G10L19/20

Time-domain stereo encoding and decoding method and related product

An audio encoding and decoding method and a related apparatus are provided. The audio encoding method may include: determining a coding mode of a current frame; when determining that the coding mode of the current frame is an anticorrelated signal coding mode, performing time-domain downmix processing on left and right channel signals in the current frame by using a time-domain downmix processing manner corresponding to the anticorrelated signal coding mode, to obtain a primary channel signal and a secondary channel signal, where the time-domain downmix processing manner corresponding to the anticorrelated signal coding mode is a time-domain downmix processing manner corresponding to an anticorrelated signal channel combination scheme, and the anticorrelated signal channel combination scheme is a channel combination scheme corresponding to a near out of phase signal; and encoding the obtained primary channel signal and secondary channel signal in the current frame.

Time-domain stereo encoding and decoding method and related product

An audio encoding and decoding method and a related apparatus are provided. The audio encoding method may include: determining a coding mode of a current frame; when determining that the coding mode of the current frame is an anticorrelated signal coding mode, performing time-domain downmix processing on left and right channel signals in the current frame by using a time-domain downmix processing manner corresponding to the anticorrelated signal coding mode, to obtain a primary channel signal and a secondary channel signal, where the time-domain downmix processing manner corresponding to the anticorrelated signal coding mode is a time-domain downmix processing manner corresponding to an anticorrelated signal channel combination scheme, and the anticorrelated signal channel combination scheme is a channel combination scheme corresponding to a near out of phase signal; and encoding the obtained primary channel signal and secondary channel signal in the current frame.

HYBRID, PRIORITY-BASED RENDERING SYSTEM AND METHOD FOR ADAPTIVE AUDIO

Embodiments are directed to a method of rendering adaptive audio by receiving input audio comprising channel-based audio, audio objects, and dynamic objects, wherein the dynamic objects are classified as sets of low-priority dynamic objects and high-priority dynamic objects, rendering the channel-based audio, the audio objects, and the low-priority dynamic objects in a first rendering processor of an audio processing system, and rendering the high-priority dynamic objects in a second rendering processor of the audio processing system. The rendered audio is then subject to virtualization and post-processing steps for playback through soundbars and other similar limited height capable speakers.

Decoding of audio scenes

Exemplary embodiments provide encoding and decoding methods, and associated encoders and decoders, for encoding and decoding of an audio scene which is represented by one or more audio signals. The encoder generates a bit stream which comprises downmix signals and side information which includes individual matrix elements of a reconstruction matrix which enables reconstruction of the one or more audio signals in the decoder.

Decoding of audio scenes

Exemplary embodiments provide encoding and decoding methods, and associated encoders and decoders, for encoding and decoding of an audio scene which is represented by one or more audio signals. The encoder generates a bit stream which comprises downmix signals and side information which includes individual matrix elements of a reconstruction matrix which enables reconstruction of the one or more audio signals in the decoder.

APPARATUS AND METHOD FOR GENERATING A PLURALITY OF AUDIO CHANNELS

An apparatus for generating a plurality of audio channels for a speaker setup, comprises a processor repeating an energy distribution from a speaker not contained in the speaker setup to the speakers in the speaker setup to acquire a downmix information for a downmix to the speaker setup; and a renderer for generating the plurality of audio channels using the downmix information.

APPARATUS AND METHOD FOR GENERATING A PLURALITY OF AUDIO CHANNELS

An apparatus for generating a plurality of audio channels for a speaker setup, comprises a processor repeating an energy distribution from a speaker not contained in the speaker setup to the speakers in the speaker setup to acquire a downmix information for a downmix to the speaker setup; and a renderer for generating the plurality of audio channels using the downmix information.

Audio decoding device and method with decoding branches for decoding audio signal encoded in a plurality of domains

An audio encoder has a first information sink oriented encoding branch such as a spectral domain encoding branch, a second information source or SNR oriented encoding branch such as an LPC-domain encoding branch, and a switch for switching between the first and second encoding branches, the second encoding branch having a converter into a specific domain different from the spectral domain such as an LPC analysis stage generating an excitation signal, and the second encoding branch having a specific domain coding branch such as LPC domain processing branch, and a specific spectral domain coding branch such as LPC spectral domain processing branch, and an additional switch for switching between the specific domain coding branch and the specific spectral domain coding branch. An audio decoder has a first domain decoder, a second domain decoder, and a third domain decoder as well as two cascaded switches for switching between the decoders.

Audio object clustering by utilizing temporal variations of audio objects

Embodiments of the present invention relate to audio object clustering by utilizing temporal variation of audio objects. There is provided a method of estimating temporal variation of an audio object for use in audio object clustering. The method comprises obtaining at least one segment of an audio track associated with the audio object, the at least one segment containing the audio object; estimating variation of the audio object over a time duration of the at least one segment based on at least one property of the audio object and adjusting, at least partially based on the estimated variation of the audio object, a contribution of the audio object to the determination of a centroid in the audio object clustering. Corresponding system and computer program product are disclosed.

Audio object clustering by utilizing temporal variations of audio objects

Embodiments of the present invention relate to audio object clustering by utilizing temporal variation of audio objects. There is provided a method of estimating temporal variation of an audio object for use in audio object clustering. The method comprises obtaining at least one segment of an audio track associated with the audio object, the at least one segment containing the audio object; estimating variation of the audio object over a time duration of the at least one segment based on at least one property of the audio object and adjusting, at least partially based on the estimated variation of the audio object, a contribution of the audio object to the determination of a centroid in the audio object clustering. Corresponding system and computer program product are disclosed.