Patent classifications
H04S7/307
LOCATION BASED AUDIO SIGNAL MESSAGE PROCESSING
A method of incorporating environmental acoustic sources into a virtual environment by measuring real environment acoustic sources and locations and incorporating them into a virtual environment with virtual acoustic sources.
Apparatus, Method, or Computer Program for Processing an Encoded Audio Scene using a Parameter Smoothing
Apparatus for processing an audio scene representing a sound field, the audio scene having information on a transport signal and a first set of parameters. The apparatus has a parameter processor for processing the first set of parameters to obtain a second set of parameters, wherein the parameter processor is configured to calculate at least one raw parameter for each output time frame using at least one parameter of the first set of parameters for the input time frame, to calculate a smoothing information such as a factor for each raw parameter in accordance with a smoothing rule, and to apply a corresponding smoothing information to the corresponding raw parameter to derive the parameter of the second set of parameters for the output time frame. The apparatus further has an output interface for generating a processed audio scene using the second set of parameters and the information on the transport signal.
SIGNAL PROCESSING APPARATUS AND METHOD, AND PROGRAM TO REDUCE CALCULATION AMOUNT BASED ON MUTE INFORMATION
The present technology relates to a signal processing apparatus and method, and a program that make it possible to reduce an arithmetic operation amount.
The signal processing apparatus performs, on the basis of audio object mute information indicative of whether or not a signal of an audio object is a mute signal, at least either one of a decoding process or a rendering process of an object signal of the audio object. The present technology can be applied to a signal processing apparatus.
VIDEO PROCESSING DEVICE AND METHOD
A video processing apparatus includes a memory storing instructions, and at least one processor configured to execute the instructions to generate a plurality of feature information by analyzing a video signal comprising a plurality of images based on a first DNN, extract a first altitude component and a first planar component corresponding to a movement of an object in a video from the video signal based on a second DNN, extract a second planar component corresponding to a movement of a sound source in audio from a first audio signal based on a third DNN, generate a second altitude component based on the first altitude component, the first planar component, and the second planar component, output a second audio signal comprising the second altitude component based on the feature information, and synchronize the second audio signal with the video signal and output the synchronized second audio signal and video signal.
SIGNAL PROCESSING DEVICE, SIGNAL PROCESSING METHOD, AND PROGRAM
The present technology relates to a signal processing device, a signal processing method, and a program that allow for prevention of distortion of a sound space. The signal processing device includes: a relative azimuth prediction unit configured to predict, on the basis of a delay time in accordance with a distance from a virtual sound source to a listener, a relative azimuth of the virtual sound source when a sound of the virtual sound source reaches the listener; and a BRIR generation unit configured to acquire a head-related transfer function of the relative azimuth for each one of a plurality of the virtual sound sources and generate a BRIR on the basis of a plurality of the acquired head-related transfer functions. The present technology can be applied to the signal processing device.
CONTROL APPARATUS, SIGNAL PROCESSING METHOD, AND SPEAKER APPARATUS
A control apparatus according to an embodiment of the present technology includes an audio control section and a vibration control section.
The audio control section generates audio control signals of a plurality of channels with audio signals of the plurality of channels as input signals, the audio signals each including a first audio component and a second audio component different from the first audio component. The vibration control section generates a vibration control signal for vibration presentation by taking a difference between audio signals of two channels among the plurality of channels.
In-Car Adaptive Sound Quality Output Method, Device, Storage Medium and Car Audio System
The application relates to a sound quality output method. The method includes: obtaining the current volume level input by the user; determine the audio signal gain difference between the current volume level and the baseline volume level according to the current volume level and the preset baseline volume level, determining an equal loudness curve difference between the current volume level and the baseline volume level according to the audio signal gain difference, and the current volume level and the baseline volume level, determining the audio quality response curve corresponding to the current volume level according to the equal loudness curve difference and the pre-stored audio quality response curve corresponding to the baseline volume level, determining the output signal parameters of multiple audio output channels according to the audio quality response curve corresponding to the current volume level. The application optimizes the sound quality effect and enhances acoustic experience.
NOISE SUPPRESSION USING TANDEM NETWORKS
A device includes a memory configured to store instructions and one or more processors configured to execute the instructions. The one or more processors are configured to execute the instructions to receive audio data including a first audio frame corresponding to a first output of a first microphone and a second audio frame corresponding to a second output of a second microphone. The one or more processors are also configured to execute the instructions to provide the audio data to a first noise-suppression network and a second noise-suppression network. The first noise-suppression network is configured to generate a first noise-suppressed audio frame and the second noise-suppression network is configured to generate a second noise-suppressed audio frame. The one or more processors are further configured to execute the instructions to provide the noise-suppressed audio frames to an attention-pooling network. The attention-pooling network is configured to generate an output noise-suppressed audio frame.
DYNAMIC AUDIO EQUALIZATION
Methods and systems for performing automatic speed-based audio control. One method includes receiving, with an electronic control unit included in a vehicle, a speed of the vehicle and receiving, with the electronic control unit, an audio signal. The method also includes accessing, with the electronic control unit, a plurality of equalization curves based on the speed of the vehicle, each of the plurality of equalization curves associated with the speed of the vehicle and each of the plurality of equalization curves defining a gain adjustment for one of a plurality of frequencies, and, for each curve of the plurality of equalization curves, applying the gain adjustment defined by the curve to one of the plurality of frequencies of the audio signal.
COLORLESS GENERATION OF ELEVATION PERCEPTUAL CUES USING ALL-PASS FILTER NETWORKS
A system includes one or more computing devices that encode spatial perceptual cues into a monaural channel to generate a plurality of output channels. A computing device determines a target amplitude response for the mid and side channels of the plurality of output channels, defining a spatial perceptual associated with one or more frequency-dependent phase shifts. The computing device determines a transfer function of a single-input, multi-output allpass filter based on the target amplitude response and determines coefficients of the allpass filter based on the transfer function, and processes the monaural channel with the coefficients of the allpass filter to generate the plurality of channels having the encoded spatial perceptual cues. The allpass filter is configured to be colorless with respect to the individual output channels, allowing for the placement of spatial cues into the audio stream to be decoupled from the overall coloration of the audio.