G10L2019/0013

SPEECH ENCODING USING A PRE-ENCODED DATABASE
20210104250 · 2021-04-08 ·

Methods, systems, and devices for encoding are described. A device, which may be otherwise known as user equipment (UE), may support standards-compatible audio encoding (e.g., speech encoding) using a pre-encoded database. The device may receive a digital representation of an audio signal and identify, based on receiving the digital representation of the audio signal, a database that is pre-encoded according to a coding standard and that includes a quantity of digital representations of other audio signals. The device may encode the digital representation of the audio signal using a machine learning scheme and information from the database pre-encoded according to the coding standard. The device may generate a bitstream of the digital representation that is compatible with the coding standard based on encoding the digital representation of the audio signal, and output a representation of the bitstream.

SIGNAL PROCESSING METHODS AND APPARATUSES FOR ENHANCING SOUND QUALITY

Provided are a signal processing method and apparatus for enhancing sound quality. The signal processing method performed by a signal transmitting apparatus includes determining, based on a plurality of parameters, a valid bandwidth so as to encode an input signal; performing pre-processing on the input signal, based on the valid bandwidth; and encoding the pre-processed input signal, based on the valid bandwidth, and the signal processing method performed by a signal receiving apparatus includes decoding a bitstream or a packet received via a transmission channel; determining a valid bandwidth, based on a plurality of parameters used in the decoding; and performing post-processing on a decoded signal, based on the valid bandwidth.

Signal processing methods and apparatuses for enhancing sound quality

Provided are a signal processing method and apparatus for enhancing sound quality. The signal processing method performed by a signal transmitting apparatus includes determining, based on a plurality of parameters, a valid bandwidth so as to encode an input signal; performing pre-processing on the input signal, based on the valid bandwidth; and encoding the pre-processed input signal, based on the valid bandwidth, and the signal processing method performed by a signal receiving apparatus includes decoding a bitstream or a packet received via a transmission channel; determining a valid bandwidth, based on a plurality of parameters used in the decoding; and performing post-processing on a decoded signal, based on the valid bandwidth.

Pyramid vector quantizer shape search

An encoder and a method therein for Pyramid Vector Quantizer, PVQ, shape search, the PVQ taking a target vector x as input and deriving a vector y by iteratively adding unit pulses in an inner dimension search loop. The method comprises, before entering a next inner dimension search loop for unit pulse addition, determining, based on the maximum pulse amplitude, maxamp.sub.y, of a current vector y, whether more than a current bit word length is needed to represent enloop.sub.y, in a lossless manner in the upcoming inner dimension loop. The variable enloop.sub.y is related to an accumulated energy of the vector y. The performing of this method enables the encoder to keep the complexity of the search at a reasonable level.

VECTOR QUANTIZER
20190378526 · 2019-12-12 ·

Vector Quantizer and method therein for vector quantization, e.g. in a transform audio codec. The method comprises comparing an input target vector with four centroids C.sub.0, C.sub.1, C.sub.0,flip and C.sub.1,flip, wherein centroid C.sub.0,flip is a flipped version of centroid C.sub.0 and centroid C.sub.1,flip is a flipped version of centroid C.sub.1, each centroid representing a respective class of codevectors. A starting point for a search related to the input target vector in the codebook is determined, based on the comparison. A search is performed in the codebook, starting at the determined starting point, and a codevector is identified to represent the input target vector. A number of input target vectors per block or time segment is variable. A search space is dynamically adjusted to the number of input target vectors. The codevectors are sorted according to a distortion measure reflecting the distance between each codevector and the centroids C.sub.0 and C.sub.1.

Vector quantizer

Vector Quantizer and method therein for vector quantization, e.g. in a transform audio codec. The method comprises comparing an input target vector with four centroids C.sub.0, C.sub.1, C.sub.0,flip and C.sub.1,flip, wherein centroid C.sub.0,flip is a flipped version of centroid C.sub.0 and centroid C.sub.1,flip is a flipped version of centroid C.sub.1, each centroid representing a respective class of codevectors. A starting point for a search related to the input target vector in the codebook is determined, based on the comparison. A search is performed in the codebook, starting at the determined starting point, and a codevector is identified to represent the input target vector. A number of input target vectors per block or time segment is variable. A search space is dynamically adjusted to the number of input target vectors. The codevectors are sorted according to a distortion measure reflecting the distance between each codevector and the centroids C.sub.0 and C.sub.1.

VECTOR QUANTIZATION OF ALGEBRAIC CODEBOOK WITH HIGH-PASS CHARACTERISTIC FOR POLARITY SELECTION
20190214031 · 2019-07-11 ·

Provided are a vector quantization device, a voice coding device, a vector quantization method, and a voice coding method which enable a reduction in the calculation amount of voice codec without deterioration of voice quality. In the vector quantization device, a first reference vector calculation unit (201) calculates a first reference vector by multiplying a target vector (x) by an auditory weighting LPC synthesis filter (H), and a second reference vector calculation unit (202) calculates a second reference vector by multiplying an element of the first reference vector by a filter having a high pass characteristic. A polarity preliminary selection unit (205) generates a polar vector by disposing a unit pulse having a positive or negative polarity, which is selected on the basis of the polarity of an element of the second reference vector, in the position of said element.

SIGNAL PROCESSING METHODS AND APPARATUSES FOR ENHANCING SOUND QUALITY

Provided are a signal processing method and apparatus for enhancing sound quality. The signal processing method performed by a signal transmitting apparatus includes determining, based on a plurality of parameters, a valid bandwidth so as to encode an input signal; performing pre-processing on the input signal, based on the valid bandwidth; and encoding the pre-processed input signal, based on the valid bandwidth, and the signal processing method performed by a signal receiving apparatus includes decoding a bitstream or a packet received via a transmission channel; determining a valid bandwidth, based on a plurality of parameters used in the decoding; and performing post-processing on a decoded signal, based on the valid bandwidth.

Vector quantization of algebraic codebook with high-pass characteristic for polarity selection

Provided are a vector quantization device, a voice coding device, a vector quantization method, and a voice coding method which enable a reduction in the calculation amount of voice codec without deterioration of voice quality. In the vector quantization device, a first reference vector calculation unit (201) calculates a first reference vector by multiplying a target vector (x) by an auditory weighting LPC synthesis filter (H), and a second reference vector calculation unit (202) calculates a second reference vector by multiplying an element of the first reference vector by a filter having a high pass characteristic. A polarity preliminary selection unit (205) generates a polar vector by disposing a unit pulse having a positive or negative polarity, which is selected on the basis of the polarity of an element of the second reference vector, in the position of said element.

SPEECH ENCODING AND DECODING METHODS AND APPARATUSES, COMPUTER DEVICE, AND STORAGE MEDIUM
20240274143 · 2024-08-15 ·

This application relates to a speech encoding method performed by a computer device the method, including: performing subband decomposition on a target speech signal to obtain a plurality of subband excitation signals; obtaining an auditory perception representational value that corresponds to each subband excitation signal; determining at least one first subband excitation signal and at least one second subband excitation signal from the at least two subband excitation signals; obtaining a gain of each of the at least one first subband excitation signal relative to a preset reference excitation signal as an encoding parameter that corresponds to the first subband excitation signal; obtaining a corresponding encoding parameter that is obtained by quantizing each of the at least one second subband excitation signal; and performing encoding on each subband excitation signal based on the encoding parameter corresponding to the subband excitation signal.