Patent classifications
G10L19/002
Transform ambisonic coefficients using an adaptive network
A device includes a memory configured to store untransformed ambisonic coefficients at different time segments. The device also includes one or more processors configured to obtain the untransformed ambisonic coefficients at the different time segments, where the untransformed ambisonic coefficients at the different time segments represent a soundfield at the different time segments. The one or more processors are also configured to apply one adaptive network, based on a constraint, to the untransformed ambisonic coefficients at the different time segments to generate transformed ambisonic coefficients at the different time segments, wherein the transformed ambisonic coefficients at the different time segments represent a modified soundfield at the different time segments, that was modified based on the constraint.
Transform ambisonic coefficients using an adaptive network
A device includes a memory configured to store untransformed ambisonic coefficients at different time segments. The device also includes one or more processors configured to obtain the untransformed ambisonic coefficients at the different time segments, where the untransformed ambisonic coefficients at the different time segments represent a soundfield at the different time segments. The one or more processors are also configured to apply one adaptive network, based on a constraint, to the untransformed ambisonic coefficients at the different time segments to generate transformed ambisonic coefficients at the different time segments, wherein the transformed ambisonic coefficients at the different time segments represent a modified soundfield at the different time segments, that was modified based on the constraint.
SOUND SIGNAL ENCODING METHOD, SOUND SIGNAL DECODING METHOD, SOUND SIGNAL ENCODING APPARATUS, SOUND SIGNAL DECODING APPARATUS, PROGRAM, AND RECORDING MEDIUM
A downmix unit 110 obtains downmix signals which are signals obtained by mixing input sound signals of a left channel input and input sound signals of a right channel input. A left channel signal subtraction unit 130 and a right channel signal subtraction unit 150 code the difference between the input sound signals and a multiplication value of the downmix signals and a subtraction gain for each of the left channel and the right channel. In such a configuration, a left channel subtraction gain estimation unit 120 and a right channel subtraction gain estimation unit 140 determine the subtraction gain such that the quantization errors resulting from the two processes of coding/decoding are reduced.
Methods and devices for vector segmentation for coding
A method for partitioning of input vectors for coding is presented. The method comprises obtaining of an input vector. The input vector is segmented, in a non-recursive manner, into an integer number, N.sup.SEG, of input vector segments. A representation of a respective relative energy difference between parts of the input vector on each side of each boundary between the input vector segments is determined, in a recursive manner. The input vector segments and the representations of the relative energy differences are provided for individual coding. Partitioning units and computer programs for partitioning of input vectors for coding, as well as positional encoders, are presented.
MULTI-CHANNEL AUDIO SIGNAL ENCODING METHOD AND APPARATUS
Disclosed are a multi-channel audio signal encoding method and apparatus. Audio signals of P channels in a current frame of a multi-channel audio signal can be obtained, where the audio signals of the P channels include audio signals of K channel pairs; respective bit quantities of the K channel pairs are determined based on the respective energy/amplitudes of the audio signals of the P channels and a quantity of available bits; and the audio signals of the P channels are encoded based on the respective bit quantities of the K channel pairs to obtain an encoded bitstream, to improve encoding quality.
MULTI-CHANNEL AUDIO SIGNAL ENCODING METHOD AND APPARATUS
Disclosed are a multi-channel audio signal encoding method and apparatus. Audio signals of P channels in a current frame of a multi-channel audio signal can be obtained, where the audio signals of the P channels include audio signals of K channel pairs; respective bit quantities of the K channel pairs are determined based on the respective energy/amplitudes of the audio signals of the P channels and a quantity of available bits; and the audio signals of the P channels are encoded based on the respective bit quantities of the K channel pairs to obtain an encoded bitstream, to improve encoding quality.
DECODING DEVICE, DECODING METHOD, PROGRAM, ENCODING DEVICE, AND ENCODING METHOD
An object is to reduce the amount of data to be transmitted while ensuring tactile reproducibility. A decoding device according to the present technology includes: a first decoding unit that decodes first encoded data obtained by encoding a first signal section with a first bit rate, the first signal section being a part of a touch signal section which is a signal section indicating a touch state with an object in a tactile signal, the first signal section being a signal section including a boundary between the touch state and a non-touch state with the object; and a second decoding unit that decodes second encoded data obtained by encoding a second signal section with a bit rate lower than the first bit rate, the second signal section being a signal section except for the first signal section in the touch signal section.
DECODING DEVICE, DECODING METHOD, PROGRAM, ENCODING DEVICE, AND ENCODING METHOD
An object is to reduce the amount of data to be transmitted while ensuring tactile reproducibility. A decoding device according to the present technology includes: a first decoding unit that decodes first encoded data obtained by encoding a first signal section with a first bit rate, the first signal section being a part of a touch signal section which is a signal section indicating a touch state with an object in a tactile signal, the first signal section being a signal section including a boundary between the touch state and a non-touch state with the object; and a second decoding unit that decodes second encoded data obtained by encoding a second signal section with a bit rate lower than the first bit rate, the second signal section being a signal section except for the first signal section in the touch signal section.
BIT ALLOCATION METHOD AND APPARATUS FOR AUDIO SIGNAL
The present disclosure provides a bit allocation method and apparatus for an audio signal. The bit allocation method for an audio signal includes: obtaining T audio signals in a current frame, where T is a positive integer; determining a first audio signal set based on the T audio signals, where the first audio signal set includes M audio signals, M is a positive integer, T ≥ M; determining M priorities of the M audio signals in the first audio signal set; and performing bit allocation on the M audio signals based on the M priorities of the M audio signals.
BIT ALLOCATION METHOD AND APPARATUS FOR AUDIO SIGNAL
The present disclosure provides a bit allocation method and apparatus for an audio signal. The bit allocation method for an audio signal includes: obtaining T audio signals in a current frame, where T is a positive integer; determining a first audio signal set based on the T audio signals, where the first audio signal set includes M audio signals, M is a positive integer, T ≥ M; determining M priorities of the M audio signals in the first audio signal set; and performing bit allocation on the M audio signals based on the M priorities of the M audio signals.