Patent classifications
G10L19/07
Methods for improving high frequency reconstruction
The present invention proposes a new method and a new apparatus for enhancement of audio source coding systems utilising high frequency reconstruction (HFR). It utilises a detection mechanism on the encoder side to assess what parts of the spectrum will not be correctly reproduced by the HFR method in the decoder. Information on this is efficiently coded and sent to the decoder, where it is combined with the output of the HFR input.
Methods for improving high frequency reconstruction
The present invention proposes a new method and a new apparatus for enhancement of audio source coding systems utilising high frequency reconstruction (HFR). It utilises a detection mechanism on the encoder side to assess what parts of the spectrum will not be correctly reproduced by the HFR method in the decoder. Information on this is efficiently coded and sent to the decoder, where it is combined with the output of the HFR input.
Method and device for quantizing linear predictive coefficient, and method and device for dequantizing same
A quantization device includes: a trellis-structured vector quantizer which quantizes a first error vector between an N-dimensional (here, “N” is two or more) subvector and a first predictive vector; and an inter-frame predictor which generates a first predictive vector from the quantized N-dimensional subvector, wherein the inter-frame predictor uses a predictive coefficient comprising an N×N matrix and performs an inter-frame prediction using the quantized N-dimensional subvector of a previous stage.
Apparatus and method for determining weighting function having for associating linear predictive coding (LPC) coefficients with line spectral frequency coefficients and immittance spectral frequency coefficients
Proposed is a method and apparatus for determining a weighting function for quantizing a linear predictive coding (LPC) coefficient and having a low complexity. The weighting function determination apparatus may convert an LPC coefficient of a mid-subframe of an input signal to one of a immittance spectral frequency (ISF) coefficient and a line spectral frequency (LSF) coefficient, and may determine a weighting function associated with an importance of the ISF coefficient or the LSF coefficient based on the converted ISF coefficient or LSF coefficient.
Apparatus and method for determining weighting function having for associating linear predictive coding (LPC) coefficients with line spectral frequency coefficients and immittance spectral frequency coefficients
Proposed is a method and apparatus for determining a weighting function for quantizing a linear predictive coding (LPC) coefficient and having a low complexity. The weighting function determination apparatus may convert an LPC coefficient of a mid-subframe of an input signal to one of a immittance spectral frequency (ISF) coefficient and a line spectral frequency (LSF) coefficient, and may determine a weighting function associated with an importance of the ISF coefficient or the LSF coefficient based on the converted ISF coefficient or LSF coefficient.
Methods, Encoder And Decoder For Linear Predictive Encoding And Decoding Of Sound Signals Upon Transition Between Frames Having Different Sampling Rates
Methods, an encoder and a decoder are configured for transition between frames with different internal sampling rates. Linear predictive (LP) filter parameters are converted from a sampling rate S1 to a sampling rate S2. A power spectrum of a LP synthesis filter is computed, at the sampling rate S1, using the LP filter parameters. The power spectrum of the LP synthesis filter is modified to convert it from the sampling rate S1 to the sampling rate S2. The modified power spectrum of the LP synthesis filter is inverse transformed to determine autocorrelations of the LP synthesis filter at the sampling rate S2. The autocorrelations are used to compute the LP filter parameters at the sampling rate S2.
SPEECH SIGNAL PROCESSING METHOD AND APPARATUS, ELECTRONIC DEVICE, AND STORAGE MEDIUM
Embodiments of this disclosure provide a speech signal processing method and apparatus, an electronic device, and a non-transitory computer-readable storage medium. A processing circuitry of the electronic device obtains speech feature parameters of a to-be-processed speech signal. The speech feature parameters include an initial spectrum of the to-be-processed speech signal. The processing circuitry predicts a first quantity of subband spectrum envelopes based on the speech feature parameters. The first set quantity is less than a quantity of initial spectrum coefficients included in the initial spectrum. Each of the subband spectrum envelopes corresponds to a second quantity of initial spectrum coefficients. The processing circuitry adjusts the initial spectrum based on the predicted subband spectrum envelopes to obtain an adjusted spectrum. The processing circuitry obtains an enhanced speech signal corresponding to the to-be-processed speech signal based on the adjusted spectrum. Apparatus and non-transitory computer-readable storage medium counterpart embodiments are also contemplated.
AUDIO SIGNAL PROCESSING DEVICE, AUDIO SIGNAL PROCESSING METHOD, AND AUDIO SIGNAL PROCESSING PROGRAM
An audio signal processing device comprises a discontinuity detector configured to determine an occurrence of a discontinuity from a sudden increase of an amplitude of decoded audio obtained by decoding the first audio packet which is received correctly after an occurrence of a packet loss, and a discontinuity corrector for correcting the discontinuity of the decoded audio.
METHOD AND DEVICE FOR QUANTIZING LINEAR PREDICTIVE COEFFICIENT, AND METHOD AND DEVICE FOR DEQUANTIZING SAME
A quantization device includes: a trellis-structured vector quantizer which quantizes a first error vector between an N-dimensional (here, “N” is two or more) subvector and a first predictive vector; and an inter-frame predictor which generates a first predictive vector from the quantized N-dimensional subvector, wherein the inter-frame predictor uses a predictive coefficient comprising an N×N matrix and performs an inter-frame prediction using the quantized N-dimensional subvector of a previous stage.
Methods, encoder and decoder for linear predictive encoding and decoding of sound signals upon transition between frames having different sampling rates
Methods, an encoder and a decoder are configured for transition between frames with different internal sampling rates. Linear predictive (LP) filter parameters are converted from a sampling rate S1 to a sampling rate S2. A power spectrum of a LP synthesis filter is computed, at the sampling rate S1, using the LP filter parameters. The power spectrum of the LP synthesis filter is modified to convert it from the sampling rate S1 to the sampling rate S2. The modified power spectrum of the LP synthesis filter is inverse transformed to determine autocorrelations of the LP synthesis filter at the sampling rate S2. The autocorrelations are used to compute the LP filter parameters at the sampling rate S2.