Patent classifications
G10L19/09
Apparatus and method for generating an adaptive spectral shape of comfort noise
An apparatus for decoding an encoded audio signal to obtain a reconstructed audio signal is provided, having: a receiving interface for receiving one or more frames, a coefficient generator, and a signal reconstructor. The coefficient generator is configured to determine one or more first audio signal coefficients, and one or more noise coefficients. Moreover, the coefficient generator is configured to generate one or more second audio signal coefficients, depending on the one or more first audio signal coefficients and depending on the one or more noise coefficients. The audio signal reconstructor is configured to reconstruct a first portion of the reconstructed audio signal depending on the one or more first audio signal coefficients and the audio signal reconstructor is configured to reconstruct a second portion of the reconstructed audio signal depending on the one or more second audio signal coefficients, if the current frame is not received by the receiving interface or if the current frame being received by the receiving interface is corrupted.
Apparatus and method for generating an adaptive spectral shape of comfort noise
An apparatus for decoding an encoded audio signal to obtain a reconstructed audio signal is provided, having: a receiving interface for receiving one or more frames, a coefficient generator, and a signal reconstructor. The coefficient generator is configured to determine one or more first audio signal coefficients, and one or more noise coefficients. Moreover, the coefficient generator is configured to generate one or more second audio signal coefficients, depending on the one or more first audio signal coefficients and depending on the one or more noise coefficients. The audio signal reconstructor is configured to reconstruct a first portion of the reconstructed audio signal depending on the one or more first audio signal coefficients and the audio signal reconstructor is configured to reconstruct a second portion of the reconstructed audio signal depending on the one or more second audio signal coefficients, if the current frame is not received by the receiving interface or if the current frame being received by the receiving interface is corrupted.
ENCODER, DECODER, ENCODING METHOD AND DECODING METHOD FOR FREQUENCY DOMAIN LONG-TERM PREDICTION OF TONAL SIGNALS FOR AUDIO CODING
An encoder for encoding a current frame of an audio signal depending on one or more previous frames of the audio signal is provided. The previous frames precede the current frame, each of the current frame and the one or more previous frames having one or more harmonic components of the audio signal, each of the current frame and the one or more previous frames having a plurality of spectral coefficients in a frequency domain or in a transform domain. To generate an encoding of the current frame, the encoder is to determine an estimation of two harmonic parameters for each of the harmonic components of a most previous frame of the previous frames. Moreover, the encoder is to determine the estimation of the two harmonic parameters for each of the harmonic components of the most previous frame using a first group of three or more of the plurality of spectral coefficients of each of the previous frames of the audio signal.
ENCODER, DECODER, ENCODING METHOD AND DECODING METHOD FOR FREQUENCY DOMAIN LONG-TERM PREDICTION OF TONAL SIGNALS FOR AUDIO CODING
An encoder for encoding a current frame of an audio signal depending on one or more previous frames of the audio signal is provided. The previous frames precede the current frame, each of the current frame and the one or more previous frames having one or more harmonic components of the audio signal, each of the current frame and the one or more previous frames having a plurality of spectral coefficients in a frequency domain or in a transform domain. To generate an encoding of the current frame, the encoder is to determine an estimation of two harmonic parameters for each of the harmonic components of a most previous frame of the previous frames. Moreover, the encoder is to determine the estimation of the two harmonic parameters for each of the harmonic components of the most previous frame using a first group of three or more of the plurality of spectral coefficients of each of the previous frames of the audio signal.
Very Short Pitch Detection and Coding
A method includes detecting whether there is a very short pitch lag in a speech or audio signal that is shorter than a conventional minimum pitch limitation using a combination of time domain and frequency domain pitch detection techniques. The pitch detection techniques include using pitch correlations in a time domain and detecting a lack of low frequency energy in the speech or audio signal in a frequency domain. The detected very short pitch lag is coded using a pitch range from a predetermined minimum very short pitch limitation that is smaller than the conventional minimum pitch limitation.
Apparatus and method for generating an error concealment signal using individual replacement LPC representations for individual codebook information
An apparatus for generating an error concealment signal includes an LPC (linear prediction coding) representation generator for generating a first replacement LPC representation and a different second replacement LPC representation; an LPC synthesizer for filtering a first codebook information using the first replacement representation to obtain a first replacement signal and for filtering a different second codebook information using the second replacement LPC representation to obtain a second replacement signal; and a replacement signal combiner for combining the first replacement signal and the second replacement signal to obtain the error concealment signal.
Apparatus and method for generating an error concealment signal using individual replacement LPC representations for individual codebook information
An apparatus for generating an error concealment signal includes an LPC (linear prediction coding) representation generator for generating a first replacement LPC representation and a different second replacement LPC representation; an LPC synthesizer for filtering a first codebook information using the first replacement representation to obtain a first replacement signal and for filtering a different second codebook information using the second replacement LPC representation to obtain a second replacement signal; and a replacement signal combiner for combining the first replacement signal and the second replacement signal to obtain the error concealment signal.
Selecting pitch lag
In apparatus, methods, and programs for selecting pitch lag, an encoder obtains a first and a second estimates of a pitch lag for a current frame. A selected value is chosen by selection between the first and the second estimates, based on a first and a second correlation measurements. The second estimate is conditioned by the pitch lag selected at the previous frame. The selection is based on a comparison between: a downscaled version of a first correlation measurement associated to the current frame and obtained at a lag corresponding to the first estimate; and a second correlation measurement associated to the current frame and obtained at a lag corresponding to the second estimate.
Selecting pitch lag
In apparatus, methods, and programs for selecting pitch lag, an encoder obtains a first and a second estimates of a pitch lag for a current frame. A selected value is chosen by selection between the first and the second estimates, based on a first and a second correlation measurements. The second estimate is conditioned by the pitch lag selected at the previous frame. The selection is based on a comparison between: a downscaled version of a first correlation measurement associated to the current frame and obtained at a lag corresponding to the first estimate; and a second correlation measurement associated to the current frame and obtained at a lag corresponding to the second estimate.
System and method for long term prediction in audio codecs
A frequency domain long-term prediction system and method for estimating and applying an optimum long term predictor. Embodiments of the system and method include determining parameters of a single-tap predictor using a frequency-domain analysis having an optimality criteria based on spectral flatness measure. Embodiments of the system and method also include determining parameters of the long-term predictor by accounting for the performance of the vector quantizer in quantizing the various subbands. In some embodiments other encoder metrics (such as signal tonality) are used as well. Other embodiments of the system and method include determining the optimal parameters of the long-term predictor by accounting for some of the decoder operation. Other embodiments of the system and method include extending a 1-tap predictor to a k-th order predictor by convolving the 1-tap predictor with a pre-set filter and selecting from a table of such pre-set filters based on a minimum energy criteria.