H04S2400/03

Sound spatialization with room effect
09848274 · 2017-12-19 · ·

A method of sound spatialization, in which at least one filtering process, including summation, is applied, to at least two input signals, the filtering process comprising: the application of at least one first room effect transfer function, the first transfer function being specific to each input signal, and the application of at least one second room effect transfer function, the second transfer function being common to all input signals. The method is such that it comprises a step of weighting at least one input signal with a weighting factor, said weighting factor being specific to each of the input signals.

Decorrelator structure for parametric reconstruction of audio signals

An encoding system encodes multiple audio signals (X) as a downmix signal (Y) together with wet and dry upmix coefficients (P, C). In a decoding system, a pre-multiplier (101) computes an intermediate signal (W) by mapping the downmix signal linearly in accordance with a first set of coefficients (Q); a decorrelating section (102) outputs a decorrelated signal (Z) based on the intermediate signal; a wet upmix section (103) computes a wet upmix signal by mapping the decorrelated signal linearly in accordance with the wet upmix coefficients; a dry upmix section (104) computes a dry upmix signal by mapping the downmix signal linearly in accordance with the dry upmix coefficients; a combining section (105) provides a multidimensional reconstructed signal (X) by combining the wet and dry upmix signals; and a converter (106) computes the first set of coefficients based on the wet and dry upmix coefficients and supplies this to the pre-multiplier.

Decoding of audio scenes

Exemplary embodiments provide encoding and decoding methods, and associated encoders and decoders, for encoding and decoding of an audio scene which is represented by one or more audio signals. The encoder generates a bit stream which comprises downmix signals and side information which includes individual matrix elements of a reconstruction matrix which enables reconstruction of the one or more audio signals in the decoder.

Binaural rendering for headphones using metadata processing

Embodiments are described for a method of rendering audio for playback through headphones comprising receiving digital audio content, receiving binaural rendering metadata generated by an authoring tool processing the received digital audio content, receiving playback metadata generated by a playback device, and combining the binaural rendering metadata and playback metadata to optimize playback of the digital audio content through the headphones.

Binaural rendering method and apparatus for decoding multi channel audio

Disclosed is a binaural rendering method and apparatus for decoding a multichannel audio signal. The binaural rendering method may include: extracting an early reflection component and a late reverberation component from a binaural filter; generating a stereo audio signal by performing binaural rendering of a multichannel audio signal base on the early reflection component; and applying the late reverberation component to the generated stereo audio signal.

Binaural rendering method and apparatus for decoding multi channel audio

Disclosed is a binaural rendering method and apparatus for decoding a multichannel audio signal. The binaural rendering method may include: extracting an early reflection component and a late reverberation component from a binaural filter; generating a stereo audio signal by performing binaural rendering of a multichannel audio signal base on the early reflection component; and applying the late reverberation component to the generated stereo audio signal.

LOUDNESS ADJUSTMENT FOR DOWNMIXED AUDIO CONTENT

Audio content coded for a reference speaker configuration is downmixed to downmix audio content coded for a specific speaker configuration. One or more gain adjustments are performed on individual portions of the downmix audio content coded for the specific speaker configuration. Loudness measurements are then performed on the individual portions of the downmix audio content. An audio signal that comprises the audio content coded for the reference speaker configuration and downmix loudness metadata is generated. The downmix loudness metadata is created based at least in part on the loudness measurements on the individual portions of the downmix audio content.

AUDIO DIGITAL SIGNAL PROCESSOR UTILIZING A HYBRID NETWORK ARCHITECTURE
20170352357 · 2017-12-07 · ·

A system and method executed by audio processing software on one or more electronic devices in a computer system to process digital audio signals. The system comprises a digitizer for digitizing a received audio signal; and processor for performing a plurality of audio processing functions on the digitized audio signals, each of the audio processing functions having at least one programmable parameter, and wherein each of the audio processing functions are categorized and grouped as audio objects, and organized into a channel strip, the channel strip processing digitized audio signals for a particular received audio signal, and wherein, the audio objects are fixed in order, so that the digitized received audio signals are processed by a predefined number of N audio objects, and wherein the N audio objects occur in a fixed sequence, and further wherein, the N audio objects comprise a first subset of non-exchangeable audio objects and a second subset of exchangeable audio objects, such that any one or more of the second subset of audio objects can be exchanged by a replacement audio object, and further wherein when the audio processing functions are programmed, they can be saved without compiling the audio processing software.

APPARATUS AND METHOD FOR GENERATING A PLURALITY OF AUDIO CHANNELS

An apparatus for generating a plurality of audio channels for a speaker setup, comprises a processor repeating an energy distribution from a speaker not contained in the speaker setup to the speakers in the speaker setup to acquire a downmix information for a downmix to the speaker setup; and a renderer for generating the plurality of audio channels using the downmix information.

METHODS AND APPARATUS FOR DECODING ENCODED AUDIO SIGNAL(S)

There are provided decoding and encoding methods for encoding and decoding of multichannel audio content for playback on a speaker configuration with N channels. The decoding method comprises decoding, in a first decoding module, M input audio signals into M mid signals which are suitable for playback on a speaker configuration with M channels; and for each of the N channels in excess of M channels, receiving an additional input audio signal corresponding to one of the M mid signals and decoding the input audio signal and its corresponding mid signal so as to generate a stereo signal including a first and a second audio signal which are suitable for playback on two of the N channels of the speaker configuration.