G10L2019/0005

Apparatus for quantizing linear predictive coding coefficients, sound encoding apparatus, apparatus for de-quantizing linear predictive coding coefficients, sound decoding apparatus, and electronic device therefore
10224051 · 2019-03-05 · ·

A quantizing apparatus is provided that includes a quantization path determiner that determines a path from a first path not using inter-frame prediction and a second path using the inter-frame prediction, as a quantization path of an input signal, based on a criterion before quantization of the input signal; a first quantizer that quantizes the input signal, if the first path is determined as the quantization path of the input signal; and a second quantizer that quantizes the input signal, if the second path is determined as the quantization path of the input signal.

BIT ERROR DETECTOR FOR AN AUDIO SIGNAL DECODER

A method comprising: receiving lattice vector quantised parameter data, the parameter data representing at least one audio signal; determining within the data at least one bit error; and controlling the decoding of the data to generate an audio signal based on the determining of the bit error.

COMPRESSION OF DECOMPOSED REPRESENTATIONS OF A SOUND FIELD
20240276166 · 2024-08-15 ·

In general, techniques are described for compressing decomposed representations of a sound field. A device comprising a memory and processing circuitry may be configured to perform the techniques. The memory may be configured to store a bitstream representative of scene-based audio data, the scene-based audio data comprising ambisonic coefficients representative of a soundfield. The processing circuitry may be configured to process the bitstream to extract foreground components and corresponding foreground directional information, dequantize the corresponding foreground directional information to obtain corresponding dequantized directional information, and obtain, based on the foreground components and the corresponding dequantized foreground directional information, a reconstructed version of the scene-based audio data.

Vector joint encoding/decoding method and vector joint encoder/decoder

A vector joint encoding/decoding method and a vector joint encoder/decoder are provided, more than two vectors are jointly encoded, and an encoding index of at least one vector is split and then combined between different vectors, so that encoding idle spaces of different vectors can be recombined, thereby facilitating saving of encoding bits, and because an encoding index of a vector is split and then shorter split indexes are recombined, thereby facilitating reduction of requirements for the bit width of operating parts in encoding/decoding calculation.

Frequency envelope vector quantization method and apparatus
10032460 · 2018-07-24 · ·

Embodiments of the present application proposes a frequency envelope vector quantization method and apparatus, where the method includes: dividing N frequency envelopes in one frame into N1 vectors; quantizing a first vector in the N1 vectors by using a first codebook, to obtain a code word corresponding to the quantized first vector, where the first codebook is divided into 2.sup.B1 portions; determining, according to the code word corresponding to the quantized first vector; determining a second codebook according to the codebook of the i.sup.th portion; and quantizing a second vector in the N1 vectors based on the second codebook. In the embodiments of the present application, vector quantization can be performed on frequency envelope vectors by using a codebook with a smaller quantity of bits. Therefore, complexity of vector quantization can be reduced, and an effect of vector quantization can also be ensured.

Method for speaker diarization

Disclosed is a speaker diarization process for determining which speaker is speaking at what time during the course of a conversation. The entire process can be most easily described in five main parts: Segmentation where speech/non-speech decisions are made; frame feature extraction where useful information is obtained from the frames; segment modeling where the information from the frame feature extraction is combined with segment start and end time information to create segment specific features; speaker decisions when the segments are clustered to create speaker models; and corrections where frame level corrections are applied to the information extracted.

Quantization step sizes for compression of spatial components of a sound field
09980074 · 2018-05-22 · ·

In general, techniques are described for determining quantization step sizes for compression of spatial components of a sound field. A device comprising one or more processors may be configured to perform the techniques. In other words, the one or more processors may be configured to determine a quantization step size to be used when compressing a spatial component of a sound field, where the spatial component generated by performing a vector based synthesis with respect to a plurality of spherical harmonic coefficients.

Vector Joint Encoding/Decoding Method and Vector Joint Encoder/Decoder

A vector joint encoding/decoding method and a vector joint encoder/decoder are provided, more than two vectors are jointly encoded, and an encoding index of at least one vector is split and then combined between different vectors, so that encoding idle spaces of different vectors can be recombined, thereby facilitating saving of encoding bits, and because an encoding index of a vector is split and then shorter split indexes are recombined, thereby facilitating reduction of requirements for the bit width of operating parts in encoding/decoding calculation.

Vector joint encoding/decoding method and vector joint encoder/decoder

A vector joint encoding/decoding method and a vector joint encoder/decoder are provided, more than two vectors are jointly encoded, and an encoding index of at least one vector is split and then combined between different vectors, so that encoding idle spaces of different vectors can be recombined, thereby facilitating saving of encoding bits, and because an encoding index of a vector is split and then shorter split indexes are recombined, thereby facilitating reduction of requirements for the bit width of operating parts in encoding/decoding calculation.

Transformed higher order ambisonics audio data

In general, techniques are described for obtaining one or more first vectors describing distinct components of a soundfield and one or more second vectors describing background components of the soundfield, both the one or more first vectors and the one or more second vectors generated at least by performing a transformation with respect to a plurality of spherical harmonic coefficients.