H04S5/00

SYSTEM, APPARATUS, AND METHOD FOR MULTI-DIMENSIONAL ADAPTIVE MICROPHONE-LOUDSPEAKER ARRAY SETS FOR ROOM CORRECTION AND EQUALIZATION

In at least one embodiment, an audio system is provided. The audio system includes a plurality of loudspeaker, a plurality of microphones, and an audio controller. The plurality of loudspeakers transmits an audio signal in a listening environment. The plurality of microphones detects the audio signal in the listening environment. The at least one audio controller is configured to determine a first psychoacoustic perceived loudness (PPL) of the audio signal as the audio signal is played back through a first loudspeaker of the plurality of loudspeakers and to determine a second PPL of the audio signal as the audio signal is sensed by a first microphone of the plurality of microphones. The at least one audio controller is further configured to map the first loudspeaker of the plurality of loudspeakers to the first microphone of the plurality of microphones based at least on the first PPL and the second PPL.

POSITIONING ARRANGEMENT

The innovation relates to a method and a system for positioning objects, the method comprising detecting, by a central unit, signals from a plurality of receiver/transmitter units at least partly surrounding an area around the central unit; detecting, by the central unit, an absence of at least one signal from at least one of the plurality of receiver/transmitter units at least partly surrounding an area around the central unit; and determining a position of at least one object between the central unit and the at least one of the plurality of receiver/transmitter units based on the detected absence of the at least one signal.

Systems and Methods for Upmixing Audiovisual Data

A computer-implemented method for upmixing audiovisual data can include obtaining audiovisual data including input audio data and video data accompanying the input audio data. Each frame of the video data can depict only a portion of a larger scene. The input audio data can have a first number of audio channels. The computer-implemented method can include providing the audiovisual data as input to a machine-learned audiovisual upmixing model. The audiovisual upmixing model can include a sequence-to-sequence model configured to model a respective location of one or more audio sources within the larger scene over multiple frames of the video data. The computer-implemented method can include receiving upmixed audio data from the audiovisual upmixing model.

Decoding of audio scenes

Exemplary embodiments provide encoding and decoding methods, and associated encoders and decoders, for encoding and decoding of an audio scene which is represented by one or more audio signals. The encoder generates a bit stream which comprises downmix signals and side information which includes individual matrix elements of a reconstruction matrix which enables reconstruction of the one or more audio signals in the decoder.

Decoding of audio scenes

Exemplary embodiments provide encoding and decoding methods, and associated encoders and decoders, for encoding and decoding of an audio scene which is represented by one or more audio signals. The encoder generates a bit stream which comprises downmix signals and side information which includes individual matrix elements of a reconstruction matrix which enables reconstruction of the one or more audio signals in the decoder.

Binaural rendering for headphones using metadata processing

Embodiments are described for a method of rendering audio for playback through headphones comprising receiving digital audio content, receiving binaural rendering metadata generated by an authoring tool processing the received digital audio content, receiving playback metadata generated by a playback device, and combining the binaural rendering metadata and playback metadata to optimize playback of the digital audio content through the headphones.

METHODS AND APPARATUS FOR DECODING ENCODED AUDIO SIGNAL(S)

There are provided decoding and encoding methods for encoding and decoding of multichannel audio content for playback on a speaker configuration with N channels. The decoding method comprises decoding, in a first decoding module, M input audio signals into M mid signals which are suitable for playback on a speaker configuration with M channels; and for each of the N channels in excess of M channels, receiving an additional input audio signal corresponding to one of the M mid signals and decoding the input audio signal and its corresponding mid signal so as to generate a stereo signal including a first and a second audio signal which are suitable for playback on two of the N channels of the speaker configuration.

METHODS AND APPARATUS FOR DECODING ENCODED AUDIO SIGNAL(S)

There are provided decoding and encoding methods for encoding and decoding of multichannel audio content for playback on a speaker configuration with N channels. The decoding method comprises decoding, in a first decoding module, M input audio signals into M mid signals which are suitable for playback on a speaker configuration with M channels; and for each of the N channels in excess of M channels, receiving an additional input audio signal corresponding to one of the M mid signals and decoding the input audio signal and its corresponding mid signal so as to generate a stereo signal including a first and a second audio signal which are suitable for playback on two of the N channels of the speaker configuration.

Audio signal processing method

Disclosed is an audio signal processing method. The audio signal processing method according to the present invention comprises the steps of: receiving a bit-stream including at least one of a channel signal and an object signal; receiving a user's environment information; decoding at least one of the channel signal and the object signal on the basis of the received bit-stream; generating the user's reproducing channel information on the basis of the user's received environment information; and generating a reproducing signal through a flexible renderer on the basis of at least one of the channel signal and the object signal and the user's reproducing channel information.

Reducing correlation between higher order ambisonic (HOA) background channels

In general, techniques are described for compression and decoding of audio data are generally disclosed. An example device for compressing audio data includes one or more processors configured to apply a decorrelation transform to ambient ambisonic coefficients to obtain a decorrelated representation of the ambient ambisonic coefficients, the ambient HOA coefficients having been extracted from a plurality of higher order ambisonic coefficients and representative of a background component of a soundfield described by the plurality of higher order ambisonic coefficients, wherein at least one of the plurality of higher order ambisonic coefficients is associated with a spherical basis function having an order greater than one.