Patent classifications
G10L19/20
Frame loss management in an FD/LPD transition context
A method for decoding a digital signal encoded using predictive coding and transform coding, comprising the following steps: predictive decoding of a preceding frame of the digital signal, encoded by a set of predictive coding parameters; detecting the loss of a current frame of the encoded digital signal; generating by prediction, from at least one predictive coding parameter encoding the preceding frame, a frame for replacing the current frame; generating by prediction, from at least one predictive coding parameter encoding the preceding frame, an additional segment of digital signal; temporarily storing said additional segment of digital signal.
Frame loss management in an FD/LPD transition context
A method for decoding a digital signal encoded using predictive coding and transform coding, comprising the following steps: predictive decoding of a preceding frame of the digital signal, encoded by a set of predictive coding parameters; detecting the loss of a current frame of the encoded digital signal; generating by prediction, from at least one predictive coding parameter encoding the preceding frame, a frame for replacing the current frame; generating by prediction, from at least one predictive coding parameter encoding the preceding frame, an additional segment of digital signal; temporarily storing said additional segment of digital signal.
Apparatus and method for audio rendering employing a geometric distance definition
An apparatus for playing back an audio object associated with a position includes a distance calculator for calculating distances of the position to speakers or for reading the distances of the position to the speakers. The distance calculator is configured to take a solution with a smallest distance. The apparatus is configured to play back the audio object using the speaker corresponding to the solution.
Apparatus and method for audio rendering employing a geometric distance definition
An apparatus for playing back an audio object associated with a position includes a distance calculator for calculating distances of the position to speakers or for reading the distances of the position to the speakers. The distance calculator is configured to take a solution with a smallest distance. The apparatus is configured to play back the audio object using the speaker corresponding to the solution.
APPARATUS AND METHOD FOR SCREEN RELATED AUDIO OBJECT REMAPPING
An apparatus for generating loudspeaker signals includes an object metadata processor configured to receive metadata, to calculate a second position of the audio object depending on the first position of the audio object and on a size of a screen if the audio object is indicated in the metadata as being screen-related, to feed the first position of the audio object as the position information into the object renderer if the audio object is indicated in the metadata as being not screen-related, and to feed the second position of the audio object as the position information into the object renderer if the audio object is indicated in the metadata as being screen-related. The apparatus further includes an object renderer configured to receive an audio object and to generate the loudspeaker signals depending on the audio object and on position information.
APPARATUS AND METHOD FOR SCREEN RELATED AUDIO OBJECT REMAPPING
An apparatus for generating loudspeaker signals includes an object metadata processor configured to receive metadata, to calculate a second position of the audio object depending on the first position of the audio object and on a size of a screen if the audio object is indicated in the metadata as being screen-related, to feed the first position of the audio object as the position information into the object renderer if the audio object is indicated in the metadata as being not screen-related, and to feed the second position of the audio object as the position information into the object renderer if the audio object is indicated in the metadata as being screen-related. The apparatus further includes an object renderer configured to receive an audio object and to generate the loudspeaker signals depending on the audio object and on position information.
AUDIO SIGNAL ENCODING METHOD AND APPARATUS, AND AUDIO SIGNAL DECODING METHOD AND APPARATUS
An audio signal encoding method and apparatus, and an audio signal decoding method and apparatus are provided. The audio signal encoding method includes: obtaining a frequency-domain coefficient of a current frame and a frequency-domain coefficient of a reference signal of the current frame; performing filtering processing on the frequency-domain coefficient of the current frame to obtain a filtering parameter; determining a target frequency-domain coefficient of the current frame based on the filtering parameter; performing filtering processing on the frequency-domain coefficient of the reference signal and a reference frequency-domain coefficient based on the filtering parameter to obtain a target frequency-domain coefficient of the reference signal; and encoding the target frequency-domain coefficient of the current frame based on the target frequency-domain coefficient of the current frame, the target frequency-domain coefficient of the reference signal, a reference target frequency-domain coefficient. The method can improve audio signal encoding/decoding efficiency.
AUDIO SIGNAL ENCODING METHOD AND APPARATUS, AND AUDIO SIGNAL DECODING METHOD AND APPARATUS
An audio signal encoding method and apparatus, and an audio signal decoding method and apparatus are provided. The audio signal encoding method includes: obtaining a frequency-domain coefficient of a current frame and a frequency-domain coefficient of a reference signal of the current frame; performing filtering processing on the frequency-domain coefficient of the current frame to obtain a filtering parameter; determining a target frequency-domain coefficient of the current frame based on the filtering parameter; performing filtering processing on the frequency-domain coefficient of the reference signal and a reference frequency-domain coefficient based on the filtering parameter to obtain a target frequency-domain coefficient of the reference signal; and encoding the target frequency-domain coefficient of the current frame based on the target frequency-domain coefficient of the current frame, the target frequency-domain coefficient of the reference signal, a reference target frequency-domain coefficient. The method can improve audio signal encoding/decoding efficiency.
METHOD AND APPARATUS FOR DECODING A BITSTREAM INCLUDING ENCODED HIGHER ORDER AMBISONICS REPRESENTATIONS
Higher Order Ambisonics represents three-dimensional sound independent of a specific loudspeaker set-up. However, transmission of an HOA representation results in a very high bit rate. Therefore compression with a fixed number of channels is used, in which directional and ambient signal components are processed differently. For coding, portions of the original HOA representation are predicted from the directional signal components. This prediction provides side information which is required for a corresponding decoding. By using some additional specific purpose bits, a known side information coding processing is improved in that the required number of bits for coding that side information is reduced on average.
METHOD AND APPARATUS FOR DECODING A BITSTREAM INCLUDING ENCODED HIGHER ORDER AMBISONICS REPRESENTATIONS
Higher Order Ambisonics represents three-dimensional sound independent of a specific loudspeaker set-up. However, transmission of an HOA representation results in a very high bit rate. Therefore compression with a fixed number of channels is used, in which directional and ambient signal components are processed differently. For coding, portions of the original HOA representation are predicted from the directional signal components. This prediction provides side information which is required for a corresponding decoding. By using some additional specific purpose bits, a known side information coding processing is improved in that the required number of bits for coding that side information is reduced on average.