H04S3/02

SIGNAL PROCESSING METHODS AND SYSTEMS FOR RENDERING AUDIO ON VIRTUAL LOUDSPEAKER ARRAYS
20170245082 · 2017-08-24 ·

Techniques of rendering audio involve applying a balanced-realization state space model to each head-related transfer function (HRTF) to reduce the order of an effective FIR or even an infinite impulse response (IIR) filter. Along these lines, each HRTF G(z) is derived from a head-related impulse response filter (HRIR) via, e.g., a z-transform. The data of the HRIR may be used to construct a first state space representation [A, B, C, D] of the HRTF via the relation .G(z)=C(zI−A).sup.−1B+D This first state space representation is not unique and so for an FIR filter, A and B may be set to simple, binary-valued arrays, while C and D contain the HRIR data. This representation leads to a simple form of a Gramian Q whose eigenvectors provide system states that maximize the system gain as measured by a Hankel norm. Further, a factorization of Q provides a transformation into a balanced state space in which the Gramian is equal to a diagonal matrix of the eigenvalues of Q. By considering only those states associated with an eigenvalue greater than some threshold, the balanced state space representation of the HRTF may be truncated to provide an approximate HRTF that approximates the original HRTF very well while reducing the amount of computation required by as much as 90%.

Apparatus and method for center signal scaling and stereophonic enhancement based on a signal-to-downmix ratio

An apparatus for generating a modified audio signal having two or more modified audio channels from an audio input signal comprising two or more audio input channels is provided. The apparatus has an information generator for generating signal-to-downmix information. The information generator is adapted to generate signal information by combining a spectral value of each of the two or more audio input channels in a first way. The information generator is adapted to generate downmix information by combining the spectral value of each of the two or more audio input channels in a second way being different from the first way. Furthermore, the information generator is adapted to combine the signal information and the downmix information to obtain signal-to-downmix information. The apparatus has a signal attenuator for attenuating the two or more audio input channels depending on the signal-to-downmix information to obtain the two or more modified audio channels.

Apparatus and method for center signal scaling and stereophonic enhancement based on a signal-to-downmix ratio

An apparatus for generating a modified audio signal having two or more modified audio channels from an audio input signal comprising two or more audio input channels is provided. The apparatus has an information generator for generating signal-to-downmix information. The information generator is adapted to generate signal information by combining a spectral value of each of the two or more audio input channels in a first way. The information generator is adapted to generate downmix information by combining the spectral value of each of the two or more audio input channels in a second way being different from the first way. Furthermore, the information generator is adapted to combine the signal information and the downmix information to obtain signal-to-downmix information. The apparatus has a signal attenuator for attenuating the two or more audio input channels depending on the signal-to-downmix information to obtain the two or more modified audio channels.

Apparatus and method for efficient object metadata coding

An apparatus for generating one or more audio channels is provided. The apparatus includes a metadata decoder for receiving one or more compressed metadata signals. Each of the one or more compressed metadata signals includes a plurality of first metadata samples. The metadata decoder is configured to generate one or more reconstructed metadata signals and to generate each of the second metadata samples of each reconstructed metadata signal of the one or more reconstructed metadata signals depending on at least two of the first metadata samples of the reconstructed metadata signal. The apparatus includes an audio channel generator for generating the one or more audio channels depending on the one or more audio object signals and depending on the one or more reconstructed metadata signals. An apparatus for generating encoded audio information including one or more encoded audio signals and one or more compressed metadata signals is provided.

Apparatus and method for efficient object metadata coding

An apparatus for generating one or more audio channels is provided. The apparatus includes a metadata decoder for receiving one or more compressed metadata signals. Each of the one or more compressed metadata signals includes a plurality of first metadata samples. The metadata decoder is configured to generate one or more reconstructed metadata signals and to generate each of the second metadata samples of each reconstructed metadata signal of the one or more reconstructed metadata signals depending on at least two of the first metadata samples of the reconstructed metadata signal. The apparatus includes an audio channel generator for generating the one or more audio channels depending on the one or more audio object signals and depending on the one or more reconstructed metadata signals. An apparatus for generating encoded audio information including one or more encoded audio signals and one or more compressed metadata signals is provided.

DECODING OF AUDIO SCENES

Exemplary embodiments provide encoding and decoding methods, and associated encoders and decoders, for encoding and decoding of an audio scene which is represented by one or more audio signals. The encoder generates a bit stream which comprises downmix signals and side information which includes individual matrix elements of a reconstruction matrix which enables reconstruction of the one or more audio signals in the decoder.

DECODING OF AUDIO SCENES

Exemplary embodiments provide encoding and decoding methods, and associated encoders and decoders, for encoding and decoding of an audio scene which is represented by one or more audio signals. The encoder generates a bit stream which comprises downmix signals and side information which includes individual matrix elements of a reconstruction matrix which enables reconstruction of the one or more audio signals in the decoder.

APPARATUS AND METHOD FOR GENERATING OUTPUT SIGNALS BASED ON AN AUDIO SOURCE SIGNAL, SOUND REPRODUCTION SYSTEM AND LOUDSPEAKER SIGNAL

An apparatus for generating a first multitude of output signals based on at least one audio source signal having a delay network and a feedback processor. The delay network includes a second multitude of delay paths, each delay path having a delay line and an attenuation filter. Each delay line is configured for delaying delay line input signals and for combining the at least one audio source signal and a reverberated audio signal to obtain a combined signal, wherein the attenuation filter of a delay path is configured for filtering the combined signal from the delay line of the delay path to obtain an output signal. The first multitude of output signals includes the output signal. The feedback processor is configured for reverberating the first multitude of output signals to obtain a third multitude of the reverberated audio signals including the reverberated audio signal.

APPARATUS AND METHOD FOR GENERATING OUTPUT SIGNALS BASED ON AN AUDIO SOURCE SIGNAL, SOUND REPRODUCTION SYSTEM AND LOUDSPEAKER SIGNAL

An apparatus for generating a first multitude of output signals based on at least one audio source signal having a delay network and a feedback processor. The delay network includes a second multitude of delay paths, each delay path having a delay line and an attenuation filter. Each delay line is configured for delaying delay line input signals and for combining the at least one audio source signal and a reverberated audio signal to obtain a combined signal, wherein the attenuation filter of a delay path is configured for filtering the combined signal from the delay line of the delay path to obtain an output signal. The first multitude of output signals includes the output signal. The feedback processor is configured for reverberating the first multitude of output signals to obtain a third multitude of the reverberated audio signals including the reverberated audio signal.

Apparatus and method for generating a plurality of audio channels

An apparatus for generating a plurality of audio channels for a first speaker setup is characterized by an imaginary speaker determiner, an energy distribution calculator, a processor and a renderer. The imaginary speaker determiner is configured to determine a position of an imaginary speaker not contained in the first speaker setup to obtain a second speaker setup containing the imaginary speaker. The energy distribution calculator is configured to calculate an energy distribution from the imaginary speaker to the other speakers in the second speaker setup. The processor is configured to repeat the energy distribution to obtain a downmix information for a downmix from the second speaker setup to the first speaker setup. The renderer is configured to generate the plurality of audio channels using the downmix information.