Patent classifications
G10L19/07
Audio signal discontinuity correction processing system
An audio signal processing system and method is executed by an audio signal processing device to decode an audio packet to obtain decoded audio and determine an occurrence of a discontinuity occurring with a sudden increase of an amplitude of the decoded audio obtained by decoding the audio packet. The audio packet may be received correctly after an occurrence of a packet loss, and corrected to improve subjective quality of the decoded audio, wherein correcting the discontinuity of the decoded audio comprises causing distances between ISF/LSF parameters corresponding to a frame in which a packet loss has occurred to be equal.
Stereo Signal Encoding Method and Apparatus, and Stereo Signal Decoding Method and Apparatus
A stereo signal encoding method includes performing spectrum broadening on a quantized line spectral frequency (LSF) parameter of a primary channel signal in a current frame in a stereo signal to obtain a spectrum-broadened LSF parameter of the primary channel signal, determining a prediction residual of an LSF parameter of a secondary channel signal in the current frame based on an original LSF parameter of the secondary channel signal and the spectrum-broadened LSF parameter of the primary channel signal, and performing a quantization on the prediction residual of the LSF parameter of the secondary channel signal.
Concept for encoding an audio signal and decoding an audio signal using deterministic and noise like information
An encoder for encoding an audio signal has: an analyzer configured for deriving prediction coefficients and a residual signal from an unvoiced frame of the audio signal; a gain parameter calculator configured for calculating a first gain parameter information for defining a first excitation signal related to a deterministic codebook and for calculating a second gain parameter information for defining a second excitation signal related to a noise-like signal for the unvoiced frame; and a bitstream former configured for forming an output signal based on an information related to a voiced signal frame, the first gain parameter information and the second gain parameter information.
Concept for encoding an audio signal and decoding an audio signal using deterministic and noise like information
An encoder for encoding an audio signal has: an analyzer configured for deriving prediction coefficients and a residual signal from an unvoiced frame of the audio signal; a gain parameter calculator configured for calculating a first gain parameter information for defining a first excitation signal related to a deterministic codebook and for calculating a second gain parameter information for defining a second excitation signal related to a noise-like signal for the unvoiced frame; and a bitstream former configured for forming an output signal based on an information related to a voiced signal frame, the first gain parameter information and the second gain parameter information.
Apparatus and method for generating an adaptive spectral shape of comfort noise
An apparatus for decoding an encoded audio signal to obtain a reconstructed audio signal is provided, having: a receiving interface for receiving one or more frames, a coefficient generator, and a signal reconstructor. The coefficient generator is configured to determine one or more first audio signal coefficients, and one or more noise coefficients. Moreover, the coefficient generator is configured to generate one or more second audio signal coefficients, depending on the one or more first audio signal coefficients and depending on the one or more noise coefficients. The audio signal reconstructor is configured to reconstruct a first portion of the reconstructed audio signal depending on the one or more first audio signal coefficients and the audio signal reconstructor is configured to reconstruct a second portion of the reconstructed audio signal depending on the one or more second audio signal coefficients, if the current frame is not received by the receiving interface or if the current frame being received by the receiving interface is corrupted.
Apparatus and method for generating an adaptive spectral shape of comfort noise
An apparatus for decoding an encoded audio signal to obtain a reconstructed audio signal is provided, having: a receiving interface for receiving one or more frames, a coefficient generator, and a signal reconstructor. The coefficient generator is configured to determine one or more first audio signal coefficients, and one or more noise coefficients. Moreover, the coefficient generator is configured to generate one or more second audio signal coefficients, depending on the one or more first audio signal coefficients and depending on the one or more noise coefficients. The audio signal reconstructor is configured to reconstruct a first portion of the reconstructed audio signal depending on the one or more first audio signal coefficients and the audio signal reconstructor is configured to reconstruct a second portion of the reconstructed audio signal depending on the one or more second audio signal coefficients, if the current frame is not received by the receiving interface or if the current frame being received by the receiving interface is corrupted.
Stereo signal encoding method and apparatus, and stereo signal decoding method and apparatus
A stereo signal encoding method includes performing spectrum broadening on a quantized line spectral frequency (LSF) parameter of a primary channel signal in a current frame in a stereo signal to obtain a spectrum-broadened LSF parameter of the primary channel signal, determining a prediction residual of an LSF parameter of a secondary channel signal in the current frame based on an original LSF parameter of the secondary channel signal and the spectrum-broadened LSF parameter of the primary channel signal, and performing a quantization on the prediction residual of the LSF parameter of the secondary channel signal.
VOICE PROCESSING METHOD, APPARATUS, AND DEVICE AND STORAGE MEDIUM
A voice processing method includes: determining a historical voice frame corresponding to a target voice frame; determining a frequency-domain characteristic of the historical voice frame; invoking a network model to predict the frequency-domain characteristic of the historical voice frame, to obtain a parameter set of the target voice frame, the parameter set including a plurality of types of parameters, the network model including a plurality of neural networks (NNs), and a number of the types of the parameters in the parameter set being determined according to a number of the NNs; and reconstructing the target voice frame according to the parameter set.
Methods, encoder and decoder for linear predictive encoding and decoding of sound signals upon transition between frames having different sampling rates
Methods, an encoder and a decoder are configured for transition between frames with different internal sampling rates. Linear predictive (LP) filter parameters are converted from a sampling rate S1 to a sampling rate S2. A power spectrum of a LP synthesis filter is computed, at the sampling rate S1, using the LP filter parameters. The power spectrum of the LP synthesis filter is modified to convert it from the sampling rate S1 to the sampling rate S2. The modified power spectrum of the LP synthesis filter is inverse transformed to determine autocorrelations of the LP synthesis filter at the sampling rate S2. The autocorrelations are used to compute the LP filter parameters at the sampling rate S2.
Audio signal discontinuity processing system
An audio signal processing device comprises a discontinuity detector configured to determine an occurrence of a discontinuity from a sudden increase of an amplitude of decoded audio obtained by decoding the first audio packet which is received correctly after an occurrence of a packet loss, and a discontinuity corrector for correcting the discontinuity of the decoded audio by changing, in a state buffer, a distance between elements of Immittance Spectral Pair/Immittance Spectral Frequency (ISF/LSF) parameters of a past frame.