Patent classifications
G10L19/032
MAINTAINING INVARIANCE OF SENSORY DISSONANCE AND SOUND LOCALIZATION CUES IN AUDIO CODECS
A method including receiving a plurality of audio channels based on an audio stream, applying a model based on at least one acoustic perception algorithm to the plurality of audio channels to generate a first modelled audio stream, quantizing the plurality of audio channels using a first set of quantization parameters, dequantizing the quantized plurality of audio channels using the first set of quantization parameters, applying the model based on at least one acoustic perception algorithm to the dequantized plurality of audio channels to generate a second modelled audio stream, comparing the first modelled audio stream and the second modelled audio stream, in response to determining the comparison of the first modelled audio stream and the second modelled audio stream does not meet a criterion, generating a second set of quantization parameters, and quantizing the plurality of audio channels using the second set of quantization parameters.
Signal encoding method and device and signal decoding method and device
A spectrum encoding method includes selecting an important spectral component in band units for a normalized spectrum and encoding information of the selected important spectral component for a band, based on a number, a position, a magnitude and a sign thereof. A spectrum decoding method includes obtaining from a bitstream, information about an important spectral component for a band of an encoded spectrum and decoding the obtained information of the important spectral component, based on a number, a position, a magnitude and a sign of the important spectral component.
Signal encoding method and device and signal decoding method and device
A spectrum encoding method includes selecting an important spectral component in band units for a normalized spectrum and encoding information of the selected important spectral component for a band, based on a number, a position, a magnitude and a sign thereof. A spectrum decoding method includes obtaining from a bitstream, information about an important spectral component for a band of an encoded spectrum and decoding the obtained information of the important spectral component, based on a number, a position, a magnitude and a sign of the important spectral component.
Methods and systems for encoding frequency-domain data
An illustrative frequency-domain encoder system transforms time-domain data representative of a content instance into frequency-domain data representative of the content instance. The frequency-domain data includes a plurality of complex coefficients each representing different frequency components of a plurality of frequency components incorporated by the content instance. The frequency-domain encoder system generates a frequency-domain data container that includes the complex coefficients of the frequency-domain data and metadata descriptive of the frequency-domain data. Additionally, within the frequency-domain data container, the frequency-domain encoder system integrates the complex coefficients of the frequency-domain data with timing data representative of a time-dependent feature of the content instance. Corresponding systems and methods are also disclosed.
Optimized audio forwarding
Methods and systems for optimizing a routing of audio data to audio transmitting devices using a Bluetooth network are disclosed. One method includes receiving an encoded audio bitstream at a first speaker of the audio rendering system comprising a first and a second audio channels, separating a first set of spectral components of the first audio channel and a second set of spectral components of the second audio channel from the encoded audio bitstream, without decoding the audio bitstream, generating a first encoded bitstream from the first set of spectral components, and forwarding the first encoded bitstream to a second speaker of the audio rendering system over the wireless link.
Optimized audio forwarding
Methods and systems for optimizing a routing of audio data to audio transmitting devices using a Bluetooth network are disclosed. One method includes receiving an encoded audio bitstream at a first speaker of the audio rendering system comprising a first and a second audio channels, separating a first set of spectral components of the first audio channel and a second set of spectral components of the second audio channel from the encoded audio bitstream, without decoding the audio bitstream, generating a first encoded bitstream from the first set of spectral components, and forwarding the first encoded bitstream to a second speaker of the audio rendering system over the wireless link.
Method and device for determining loss function for audio signal
A loss function of a signal including an audio signal is determined. A loss function determining system for an audio signal is provided. A loss function is determined by: determining a reference quantization index by quantizing an original input signal; inputting the original input signal to a neural network classifier and applying an activation function to an output layer of the neural network classifier; and determining a total loss function for the neural network classifier using an output of the activation function and the reference quantization index.
Method and device for determining loss function for audio signal
A loss function of a signal including an audio signal is determined. A loss function determining system for an audio signal is provided. A loss function is determined by: determining a reference quantization index by quantizing an original input signal; inputting the original input signal to a neural network classifier and applying an activation function to an output layer of the neural network classifier; and determining a total loss function for the neural network classifier using an output of the activation function and the reference quantization index.
A PSYCHOACOUSTIC MODEL FOR AUDIO PROCESSING
The present disclosure relates to the field of audio coding, in particular, it relates to a method for encoding audio signals through a masking model based on a hearing threshold of frequency intervals of the audio signal and a measured energy of the audio signal for the corresponding frequency intervals. The disclosure further relates to an encoder that is capable of carrying out the audio encoding method.
A PSYCHOACOUSTIC MODEL FOR AUDIO PROCESSING
The present disclosure relates to the field of audio coding, in particular, it relates to a method for encoding audio signals through a masking model based on a hearing threshold of frequency intervals of the audio signal and a measured energy of the audio signal for the corresponding frequency intervals. The disclosure further relates to an encoder that is capable of carrying out the audio encoding method.