G10L19/18

Decoder for decoding an encoded audio signal and encoder for encoding an audio signal

A schematic block diagram of a decoder for decoding an encoded audio signal is shown. The decoder includes an adaptive spectrum-time converter and an overlap-add-processor. The adaptive spectrum-time converter converts successive blocks of spectral values into successive blocks of time values, e.g. via a frequency-to-time transform. Furthermore, the adaptive spectrum-time converter receives a control information and switches, in response to the control information, between transform kernels of a first group of transform kernels including one or more transform kernels having different symmetries at sides of a kernel, and a second group of transform kernels including one or more transform kernels having the same symmetries at sides of a transform kernel. Moreover, the overlap-add-processor overlaps and adds the successive blocks of time values to obtain decoded audio values, which may be a decoded audio signal.

Decoder for decoding an encoded audio signal and encoder for encoding an audio signal

A schematic block diagram of a decoder for decoding an encoded audio signal is shown. The decoder includes an adaptive spectrum-time converter and an overlap-add-processor. The adaptive spectrum-time converter converts successive blocks of spectral values into successive blocks of time values, e.g. via a frequency-to-time transform. Furthermore, the adaptive spectrum-time converter receives a control information and switches, in response to the control information, between transform kernels of a first group of transform kernels including one or more transform kernels having different symmetries at sides of a kernel, and a second group of transform kernels including one or more transform kernels having the same symmetries at sides of a transform kernel. Moreover, the overlap-add-processor overlaps and adds the successive blocks of time values to obtain decoded audio values, which may be a decoded audio signal.

AUDIO PROCESSING

Audio communication apparatus comprises a set of two or more audio communication nodes; each audio communication node comprising: an audio encoder controlled by encoding parameters to generate encoded audio data to represent a vocal input generated by a user of that audio communication node, the encoded data being agnostic to which user who generated the vocal input; and an audio decoder controlled by decoding parameters to generate a decoded audio signal as a reproduction of a vocal signal generated by a user of another of the audio communication nodes, the decoding parameters being specific to the user of that other of the audio communication nodes.

AUDIO ENCODER FOR ENCODING A MULTICHANNEL SIGNAL AND AUDIO DECODER FOR DECODING AN ENCODED AUDIO SIGNAL

Audio encoder for encoding a multichannel signal is shown. The audio encoder includes a downmixer for downmixing the multichannel signal to obtain a downmix signal, a linear prediction domain core encoder for encoding the downmix signal, wherein the downmix signal has a low band and a high band, wherein the linear prediction domain core encoder is configured to apply a bandwidth extension processing for parametrically encoding the high band, a filterbank for generating a spectral representation of the multichannel signal, and a joint multichannel encoder configured to process the spectral representation including the low band and the high band of the multichannel signal to generate multichannel information.

AUDIO ENCODER FOR ENCODING A MULTICHANNEL SIGNAL AND AUDIO DECODER FOR DECODING AN ENCODED AUDIO SIGNAL

Audio encoder for encoding a multichannel signal is shown. The audio encoder includes a downmixer for downmixing the multichannel signal to obtain a downmix signal, a linear prediction domain core encoder for encoding the downmix signal, wherein the downmix signal has a low band and a high band, wherein the linear prediction domain core encoder is configured to apply a bandwidth extension processing for parametrically encoding the high band, a filterbank for generating a spectral representation of the multichannel signal, and a joint multichannel encoder configured to process the spectral representation including the low band and the high band of the multichannel signal to generate multichannel information.

AUDIO SCENE ENCODER, AUDIO SCENE DECODER AND RELATED METHODS USING HYBRID ENCODER-DECODER SPATIAL ANALYSIS

An audio scene encoder for encoding an audio scene, the audio scene having at least two component signals, has: a core encoder for core encoding the at least two component signals, wherein the core encoder is configured to generate a first encoded representation for a first portion of the at least two component signals, and to generate a second encoded representation for a second portion of the at least two component signals, a spatial analyzer for analyzing the audio scene to derive one or more spatial parameters or one or more spatial parameter sets for the second portion; and an output interface for forming the encoded audio scene signal, the encoded audio scene signal having the first encoded representation, the second encoded representation, and the one or more spatial parameters or one or more spatial parameter sets for the second portion.

Method, apparatus, and system for processing audio data
11727946 · 2023-08-15 · ·

A method for processing an audio signal includes: receiving a bitstream corresponding to the audio signal; obtaining a silence insertion descriptor (SID) type of a current frame of the audio signal by decoding the bitstream; obtaining a low-band parameter of the current frame by decoding the bitstream; obtaining a low-band signal of the current frame based on the low-band parameter; obtaining, based on the SID type of the current frame, a high-band parameter of the current frame; obtaining a high-band signal of the current frame based on the high-band parameter; and obtaining a synthesis signal of the current frame based on the low-band signal and the high-band signal.

Sound quality detection method and device for homologous audio and storage medium

Provided is a sound quality detection method, including: acquiring a plurality of audio files to be detected, wherein the plurality of audio files are homologous audio files; acquiring at least one audio feature of each of the plurality of audio files by performing feature extraction on the audio file, and generating a correspondence list between the at least one audio feature of each of the plurality of audio files and an audio file identifier; and determining, using a sound quality detection model, a sound quality score of each of the plurality of audio files based on the correspondence list between the at least one audio feature of each of the plurality of audio files and the audio file identifier, wherein the sound quality detection model is configured to detect sound quality of homologous audio files.

Sound quality detection method and device for homologous audio and storage medium

Provided is a sound quality detection method, including: acquiring a plurality of audio files to be detected, wherein the plurality of audio files are homologous audio files; acquiring at least one audio feature of each of the plurality of audio files by performing feature extraction on the audio file, and generating a correspondence list between the at least one audio feature of each of the plurality of audio files and an audio file identifier; and determining, using a sound quality detection model, a sound quality score of each of the plurality of audio files based on the correspondence list between the at least one audio feature of each of the plurality of audio files and the audio file identifier, wherein the sound quality detection model is configured to detect sound quality of homologous audio files.

Selectable linear predictive or transform coding modes with advanced stereo coding

Methods and systems for advanced stereo processing of an audio signal are disclosed. The methods and systems include selecting a coding mode of either transform coding or linear predictive coding and performing advanced stereo processing when in the selected coding mode. Both encoding and decoding operations are provided.