G10L2019/0005

FREQUENCY ENVELOPE VECTOR QUANTIZATION METHOD AND APPARATUS
20180018980 · 2018-01-18 · ·

Embodiments of the present application proposes a frequency envelope vector quantization method and apparatus, where the method includes: dividing N frequency envelopes in one frame into N1 vectors; quantizing a first vector in the N1 vectors by using a first codebook, to obtain a code word corresponding to the quantized first vector, where the first codebook is divided into 2.sup.B1 portions; determining, according to the code word corresponding to the quantized first vector; determining a second codebook according to the codebook of the i.sup.th portion; and quantizing a second vector in the N1 vectors based on the second codebook. In the embodiments of the present application, vector quantization can be performed on frequency envelope vectors by using a codebook with a smaller quantity of bits. Therefore, complexity of vector quantization can be reduced, and an effect of vector quantization can also be ensured.

ENCODING DEVICE, DECODING DEVICE, ENCODING METHOD, AND DECODING METHOD

An encoding device comprising: a quantization circuit that generates a quantization parameter that includes information about a vector quantization codebook; and a control circuit that sets the number of available bits according to conditions for encoding based on the difference between the number of bits available for encoding of the target sub-vector and the number of bits for the quantization parameter of the target sub-vector.

Interpolation for decomposed representations of a sound field

In general, techniques are described for performing an interpolation with respect to decomposed versions of a sound field. A device comprising one or more processors may be configured to perform the techniques. The processors may be configured to obtain decomposed interpolated spherical harmonic coefficients for a time segment by, at least in part, performing an interpolation with respect to a first decomposition of a first plurality of spherical harmonic coefficients and a second decomposition of a second plurality of spherical harmonic coefficients.

Extracting decomposed representations of a sound field based on a second configuration mode

In general, techniques are described for obtaining spherical harmonic coefficients (SHC). A device comprising a processor and a memory may be configured to perform the techniques. The processor may obtain a set of coefficients of a vector representative a distinct component of a sound field, the vector having been decomposed from SHC representative of the sound field. The processor may obtain a configuration mode by which to extract the coefficients, where the configuration mode indicates that the coefficients include coefficients corresponding to an order greater than an order of a basis function to which one or more of the spherical harmonic coefficients correspond and exclude at least one of the coefficients corresponding to a greater order. The processor may extract the coefficients of the vector based on the obtained configuration mode. The memory may be configured to store the non-zero set of the coefficients of the vector.

Performing order reduction with respect to higher order ambisonic coefficients

In general, techniques are described for performing order reduction with respect to a plurality of spherical harmonic coefficients. In accordance with the techniques, a device comprising one or more processors may be configured to perform, based on a target bitrate, order reduction with respect to a plurality of spherical harmonic coefficients or decompositions thereof to generate reduced spherical harmonic coefficients or the reduced decompositions thereof, wherein the plurality of spherical harmonic coefficients represent a sound field.

Analysis of decomposed representations of a sound field

In general, techniques are described for identifying distinct audio objects from spherical harmonic coefficients (which may also be denotes as higher order ambisonic coefficients). A device comprising one or more processors may perform the techniques so as to identify the distinct audio objects from the spherical harmonic coefficients (SHC) associated with the audio objects based on a directionality determined for one or more of the audio objects.

Authenticating a user by correlating speech and corresponding lip shape

Provided is a method of authenticating a user by correlating speech and corresponding lip shape. An audiovisual of a user requesting authentication is captured. The audiovisual is processed to generate a speech vector quantization sequence and a corresponding lip vector quantization sequence of the user. A likelihood of the speech vector quantization sequence and the corresponding lip vector quantization sequence with probability distributions of speech vector quantization code words corresponding to different lip shape vector quantization code words of the user requesting authentication weighed by probabilities of speech and lip vector quantization indices of the user requesting authentication is evaluated. If upon evaluation, a likelihood of the user requesting authentication being an authentic user is more than a predefined threshold, the user is authenticated.

Extracting decomposed representations of a sound field based on a first configuration mode

In general, techniques are described for obtaining decomposed versions of spherical harmonic coefficients. A device comprising a processor and a memory may be configured to perform the techniques. The processor may obtain a non-zero set of coefficients of a vector representative a distinct component of a sound field. The vector may have been decomposed from a plurality of spherical harmonic coefficients that describe the sound field. The processor may also obtain one of a plurality of configuration modes by which to extract the non-zero set of coefficients of the vector, where the one of the configuration modes indicates that the coefficients include all of the coefficients except for at least one of the coefficients. The processor may further extract the coefficients of the vector based on the obtained one of the configuration modes. The memory may be configured to store the non-zero set of the coefficients of the vector.

Vector Joint Encoding/Decoding Method and Vector Joint Encoder/Decoder

A vector joint encoding/decoding method and a vector joint encoder/decoder are provided, more than two vectors are jointly encoded, and an encoding index of at least one vector is split and then combined between different vectors, so that encoding idle spaces of different vectors can be recombined, thereby facilitating saving of encoding bits, and because an encoding index of a vector is split and then shorter split indexes are recombined, thereby facilitating reduction of requirements for the bit width of operating parts in encoding/decoding calculation.

APPARATUS FOR QUANTIZING LINEAR PREDICTIVE CODING COEFFICIENTS, SOUND ENCODING APPARATUS, APPARATUS FOR DE-QUANTIZING LINEAR PREDICTIVE CODING COEFFICIENTS, SOUND DECODING APPARATUS, AND ELECTRONIC DEVICE THEREFORE
20170221495 · 2017-08-03 · ·

A quantizing apparatus is provided that includes a quantization path determiner that determines a path from a first path not using inter-frame prediction and a second path using the inter-frame prediction, as a quantization path of an input signal, based on a criterion before quantization of the input signal; a first quantizer that quantizes the input signal, if the first path is determined as the quantization path of the input signal; and a second quantizer that quantizes the input signal, if the second path is determined as the quantization path of the input signal.