Patent classifications
H04S3/02
Audio decoder for audio channel reconstruction
A method and apparatus for reconstructing N audio channels from M audio channels is disclosed. The method includes receiving a bitstream containing an encoded audio signal representing the M audio channels and decoding the encoded audio signal to obtain a frequency domain representation of the M audio channels. The method further includes extracting a parameter from the bitstream and reconstructing at least one of the N audio channels using the parameter. The parameter represents an angle between two signals, at least one of which is included in the M audio channels.
Audio decoder for audio channel reconstruction
A method and apparatus for reconstructing N audio channels from M audio channels is disclosed. The method includes receiving a bitstream containing an encoded audio signal representing the M audio channels and decoding the encoded audio signal to obtain a frequency domain representation of the M audio channels. The method further includes extracting a parameter from the bitstream and reconstructing at least one of the N audio channels using the parameter. The parameter represents an angle between two signals, at least one of which is included in the M audio channels.
Rendering audio objects having apparent size
Methods, systems, and computer program products for rending an audio object having an apparent size are disclosed. An audio processing system receives audio panning data including a first grid mapping first virtual sound sources in a space and speaker positions to speaker gains. The first grid specifies first speaker gains of the first virtual sound sources in the space. The audio processing system determines a second grid of second virtual sound sources in the space, including mapping the first virtual sound sources into the second virtual sound sources of the second virtual sources. The audio processing system selects at least one of the first grid or second grid for rendering an audio object based on an apparent size of the audio object. The audio processing system renders the audio object based on the selected grid or grids.
Rendering audio objects having apparent size
Methods, systems, and computer program products for rending an audio object having an apparent size are disclosed. An audio processing system receives audio panning data including a first grid mapping first virtual sound sources in a space and speaker positions to speaker gains. The first grid specifies first speaker gains of the first virtual sound sources in the space. The audio processing system determines a second grid of second virtual sound sources in the space, including mapping the first virtual sound sources into the second virtual sound sources of the second virtual sources. The audio processing system selects at least one of the first grid or second grid for rendering an audio object based on an apparent size of the audio object. The audio processing system renders the audio object based on the selected grid or grids.
MATRIX DECODER WITH CONSTANT-POWER PAIRWISE PANNING
A constant-power pairwise panning upmixing system and method for upmixing from a two-channel stereo signal to a multi-channel surround sound (having more than two channels). Each output channel is some combination of the two input channels. Closed-form solutions are used to calculate dematrixing coefficients that are used to weight each input channel. The dematrixing coefficients are computed based on an inter-channel level difference and an inter-channel phase difference between the two input signals. The weighted input channels then are mixed uniquely for each output channel to generate a surround sound output from the stereo input signal. Each dematrixing coefficient has an in-phase component and an out-of-phase component. The phase coefficients for each component vary in time and are based on the phase difference between the input signals. The resultant surround sound output faithfully simulates the audio content as originally mixed.
MATRIX DECODER WITH CONSTANT-POWER PAIRWISE PANNING
A constant-power pairwise panning upmixing system and method for upmixing from a two-channel stereo signal to a multi-channel surround sound (having more than two channels). Each output channel is some combination of the two input channels. Closed-form solutions are used to calculate dematrixing coefficients that are used to weight each input channel. The dematrixing coefficients are computed based on an inter-channel level difference and an inter-channel phase difference between the two input signals. The weighted input channels then are mixed uniquely for each output channel to generate a surround sound output from the stereo input signal. Each dematrixing coefficient has an in-phase component and an out-of-phase component. The phase coefficients for each component vary in time and are based on the phase difference between the input signals. The resultant surround sound output faithfully simulates the audio content as originally mixed.
Decorrelator structure for parametric reconstruction of audio signals
An encoding system encodes multiple audio signals (X) as a downmix signal (Y) together with wet and dry upmix coefficients (P, C). In a decoding system, a pre-multiplier (101) computes an intermediate signal (W) by mapping the downmix signal linearly in accordance with a first set of coefficients (Q); a decorrelating section (102) outputs a decorrelated signal (Z) based on the intermediate signal; a wet upmix section (103) computes a wet upmix signal by mapping the decorrelated signal linearly in accordance with the wet upmix coefficients; a dry upmix section (104) computes a dry upmix signal by mapping the downmix signal linearly in accordance with the dry upmix coefficients; a combining section (105) provides a multidimensional reconstructed signal (X) by combining the wet and dry upmix signals; and a converter (106) computes the first set of coefficients based on the wet and dry upmix coefficients and supplies this to the pre-multiplier.
Decoding of audio scenes
Exemplary embodiments provide encoding and decoding methods, and associated encoders and decoders, for encoding and decoding of an audio scene which is represented by one or more audio signals. The encoder generates a bit stream which comprises downmix signals and side information which includes individual matrix elements of a reconstruction matrix which enables reconstruction of the one or more audio signals in the decoder.
Decoding of audio scenes
Exemplary embodiments provide encoding and decoding methods, and associated encoders and decoders, for encoding and decoding of an audio scene which is represented by one or more audio signals. The encoder generates a bit stream which comprises downmix signals and side information which includes individual matrix elements of a reconstruction matrix which enables reconstruction of the one or more audio signals in the decoder.
Binaural rendering for headphones using metadata processing
Embodiments are described for a method of rendering audio for playback through headphones comprising receiving digital audio content, receiving binaural rendering metadata generated by an authoring tool processing the received digital audio content, receiving playback metadata generated by a playback device, and combining the binaural rendering metadata and playback metadata to optimize playback of the digital audio content through the headphones.