Patent classifications
G10L19/0017
METHOD AND APPARATUS FOR CONTROLLING AUDIO FRAME LOSS CONCEALMENT
In accordance with an example embodiment of the present invention, disclosed is a method and an apparatus thereof for controlling a concealment method for a lost audio frame of a received audio signal. A method for a decoder of concealing a lost audio frame comprises detecting in a property of the previously received and reconstructed audio signal, or in a statistical property of observed frame losses, a condition for which the substitution of a lost frame provides relatively reduced quality. In case such a condition is detected, the concealment method is modified by selectively adjusting a phase or a spectrum magnitude of a substitution frame spectrum.
COMPRESSING AUDIO WAVEFORMS USING NEURAL NETWORKS AND VECTOR QUANTIZERS
Methods, systems and apparatus, including computer programs encoded on computer storage media. One of the methods includes receiving an audio waveform that includes a respective audio sample for each of a plurality of time steps, processing the audio waveform using an encoder neural network to generate a plurality of feature vectors representing the audio waveform, generating a respective coded representation of each of the plurality of feature vectors using a plurality of vector quantizers that are each associated with a respective codebook of code vectors, wherein the respective coded representation of each feature vector identifies a plurality of code vectors, including a respective code vector from the codebook of each vector quantizer, that define a quantized representation of the feature vector, and generating a compressed representation of the audio waveform by compressing the respective coded representation of each of the plurality of feature vectors.
Audio decoding device and method with decoding branches for decoding audio signal encoded in a plurality of domains
An audio encoder has a first information sink oriented encoding branch such as a spectral domain encoding branch, a second information source or SNR oriented encoding branch such as an LPC-domain encoding branch, and a switch for switching between the first and second encoding branches, the second encoding branch having a converter into a specific domain different from the spectral domain such as an LPC analysis stage generating an excitation signal, and the second encoding branch having a specific domain coding branch such as LPC domain processing branch, and a specific spectral domain coding branch such as LPC spectral domain processing branch, and an additional switch for switching between the specific domain coding branch and the specific spectral domain coding branch. An audio decoder has a first domain decoder, a second domain decoder, and a third domain decoder as well as two cascaded switches for switching between the decoders.
Time domain level adjustment for audio signal decoding or encoding
An audio signal decoder for providing a decoded audio signal representation on the basis of an encoded audio signal representation has a decoder preprocessing stage for obtaining a plurality of frequency band signals from the encoded audio signal representation, a clipping estimator, a level shifter, a frequency-to-time-domain converter, and a level shift compensator. The clipping estimator analyzes the encoded audio signal representation and/or side information relative to a gain of the frequency band signals in order to determine a current level shift factor. The level shifter shifts levels of the frequency band signals according to the level shift factor. The frequency-to-time-domain converter converts the level shifted frequency band signals into a time-domain representation. The level shift compensator acts on the time-domain representation for at least partly compensating a corresponding level shift and for obtaining a substantially compensated time-domain representation.
METHOD AND APPARATUS FOR RESTORING AUDIO SIGNAL
Provided is a method of reconstructing an audio signal, the method including detecting a lossy frequency band, based on an energy value of each of frequencies of the audio signal; obtaining a cut-off frequency, based on the lossy frequency band; and reconstructing the audio signal of the lossy frequency band, based on the cut-off frequency.
ADAPTIVE AUDIO CODEC SYSTEM, METHOD AND ARTICLE
A decoder generates decoded signals based on quantized signals. The decoder includes an inverse quantizer and a predictor circuit. The quantized signals are generated in an encoder by low-pass filtering an input signal and encoding the filtered signal using adaptive differential pulse code modulation. The predictor circuit has filter coefficients based on a frequency response of the low-pass filter used to filter the input signal.
ADAPTIVE AUDIO CODEC SYSTEM, METHOD AND ARTICLE
An encoder generates quantized signal words based on a difference signal. The encoder includes an adaptive quantizer. A step size applied by the adaptive quantizer is generated in a feedback loop and based on a loading factor and quantized signal words generated by the adaptive quantizer. The encoder includes coding circuitry which generates code words based on quantized signal words generated by the adaptive quantizer. The coding circuitry generates an escape code in response to a quantized signal word not being associated with a corresponding coding code word.
ADAPTIVE AUDIO CODEC SYSTEM, METHOD AND ARTICLE
An adaptive noise shaping filter flattens signal components below a threshold frequency range in a filtered signal to be encoded. An encoder generates quantized signals based on a difference signal and includes an adaptive quantizer and a decoder. The decoder generates feedback signals and has an inverse quantizer and a predictor. The predictor has determined control parameters based on the threshold frequency range.
Support for generation of comfort noise, and generation of comfort noise
A method for generation of comfort noise for at least two audio channels. The method comprises determining a spatial coherence between audio signals on the respective audio channels, wherein at least one spatial coherence value per frame and frequency band is determined to form a vector of spatial coherence values. A vector of predicted spatial coherence values is formed by a weighted combination of a first coherence prediction and a second coherence prediction that are combined using a weight factor a. The method comprises signaling information about the weight factor a to the receiving node, for enabling the generation of the comfort noise for the at least two audio channels at the receiving node.
TRANSFORMATION APPARATUS, ENCODING APPARATUS, DECODING APPARATUS, TRANSFORMATION METHOD, ENCODING METHOD, DECODING METHOD, AND PROGRAM
Provided is a technique for converting an integer value sequence for encoding/decoding which allows an integer value sequence having a distribution including small values other than a zero value and greatly biased to small values to be encoded with a small average bit number. Provided are: a unary coding unit which subjects an input sequence of non-negative integer values to unary coding to obtain a unary code sequence; a bit reversing unit which replaces a bit value ‘0’ with a bit value ‘1’ and a bit value ‘1’ with a bit value ‘0’ in the bits in the unary code sequence to obtain a replaced code sequence; and a unary decoding unit which subjects the replaced code sequence to unary decoding to obtain a sequence of non-negative integer values.