Patent classifications
H04S2400/01
Systems and methods for sound source virtualization
A system and method for externalizing sound. The system includes a headphone assembly and a localizer configured to collect information related to a location of the user and of an acoustically reflective surface in the environment. A controller is configured to determine a location of at least one virtual sound source, and generate head related transfer functions that simulate characteristics of sound from the virtual sound source directly to the user and to the user via a reflection by the reflective surface. A signal processing assembly is configured to create one or more output signals by filtering the sound signal respectively with the HRTFs. Each speaker of the headphone assembly is configured to produce sound in accordance with the output signal.
Streaming binaural audio from a cloud spatial audio processing system to a mobile station for playback on a personal audio delivery device
Spatial audio is received from an audio server over a first communication link. The spatial audio is converted by a cloud spatial audio processing system into binaural audio. The binauralized audio is streamed from the cloud spatial audio processing system to a mobile station over a second communication link to cause the mobile station to play the binaural audio on the personal audio delivery device.
PROCESSING OF AUDIO SIGNALS FROM MULTIPLE MICROPHONES
A first device includes a memory configured to store instructions and one or more processors configured to receive audio signals from multiple microphones. The one or more processors are configured to process the audio signals to generate direction-of-arrival information corresponding to one or more sources of sound represented in one or more of the audio signals. The one or more processors are also configured to and send, to a second device, data based on the direction-of-arrival information and a class or embedding associated with the direction-of-arrival information.
Sound signal processing method and sound signal processing device
A sound signal processing method includes: receiving a line-inputted sound signal; controlling a volume of the line-inputted sound signal; and generating an early reflected sound control signal using the line-inputted sound signal having the controlled volume.
PERCEPTUAL BASS EXTENSION WITH LOUDNESS MANAGEMENT AND ARTIFICIAL INTELLIGENCE (AI)
One embodiment provides a computer-implemented method that includes implementing a customizable compressor for at least one sidechain processing associated with a loudspeaker. Machine learning is applied to automatically tune one or more parameters of the at least one sidechain processing. One or more channels are extracted, including a low-frequency effects (LFE) channel, for nonlinear signal synthesis. A proportional power-sum-based mix-in of an LFE sidechain channel is applied into a non-LFE sidechain. The LFE sidechain channel is maintained within a specified threshold of being level, before and after nonlinear signal synthesis.
AUTOMATIC SPATIAL CALIBRATION FOR A LOUDSPEAKER SYSTEM USING ARTIFICIAL INTELLIGENCE AND NEARFIELD RESPONSE
One embodiment provides a method of automatic spatial calibration. The method comprises estimating one or more distances from one or more loudspeakers to a listening area based on a machine learning model and one or more propagation delays from the one or more loudspeakers to the listening area. The method further comprises estimating one or more incidence angles of the one or more loudspeakers relative to the listening area based on the one or more propagation delays. The method further comprises applying spatial perception correction to audio reproduced by the one or more loudspeakers based on the one or more distances and the one or more incidence angles. The spatial perception correction comprises delay and gain compensation that corrects misplacement of any of the one or more loudspeakers relative to the listening area.
Presentation of Premixed Content in 6 Degree of Freedom Scenes
A method including: obtaining at least two audio signals for reproduction, each of the at least two audio signals associated with a respective one of at least two reproduction locations within an audio reproduction space; obtaining within the audio reproduction space at least two zones; obtaining at least one location for a user's position within the audio reproduction space, the at least one location being relative to at least one of the at least two zones and the at least two reproduction locations; and processing the at least two audio signals based on the obtained at least one location for the user's position within the audio reproduction space to generate at least one output audio signal, the at least one output audio signal is reproduced from at least one of the at least two reproduction locations.
METHOD AND DEVICE FOR PROCESSING AUDIO SIGNAL, USING METADATA
Disclosed is a device for processing an audio signal, which renders an audio signal. The device for processing an audio signal includes a processor. The processor receives metadata including an audio signal and first element reference distance information and renders a first element signal on the basis of the first element reference distance information, wherein the first element reference distance information indicates the reference distance of an element signal. The audio signal is capable of including a second element signal which may be simultaneously rendered with the first element signal, and the metadata is capable of including second element distance information indicating the distance of the second element signal. The number of bits required for representing the first element reference distance information is smaller than the number of bits required for representing the second element distance information.
Audio Representation and Associated Rendering
An apparatus for immersive audio communication including circuitry configured to: receive at least a first audio data stream and a second audio data stream, wherein at least one of the first and second audio stream includes a spatial audio stream to enable immersive audio during a communication; determine a type of each of the first and second audio streams to identify which of the received first and second audio data streams the spatial audio stream; process the second audio data stream with at least one parameter dependent on the determined type; and render the first audio data stream and the processed second audio data stream.
AUDIO RENDERING METHOD AND APPARATUS
This application discloses an audio rendering method and apparatus. The method includes: obtaining a to-be-rendered audio signal; determining K first combined HRTFS based on K first HRTFs and K second HRTFs; determining K second combined HRTFs based on K third HRTFs and K fourth HRTFs; determining a first target rendered signal based on the K first combined HRTFs and the to-be-rendered audio signal, where the first target rendered signal is a rendered signal output to the left ear of a listener; and determining a second target rendered signal based on the K second combined HRTFs and the to-be-rendered audio signal, where the second target rendered signal is a rendered signal output to the right ear of the listener.