G10L19/24

Backward-compatible integration of harmonic transposer for high frequency reconstruction of audio signals

A method for decoding an encoded audio bitstream is disclosed. The method includes receiving the encoded audio bitstream and decoding the audio data to generate a decoded lowband audio signal. The method further includes extracting high frequency reconstruction metadata and filtering the decoded lowband audio signal with an analysis filterbank to generate a filtered lowband audio signal. The method also includes extracting a flag indicating whether either spectral translation or harmonic transposition is to be performed on the audio data and regenerating a highband portion of the audio signal using the filtered lowband audio signal and the high frequency reconstruction metadata in accordance with the flag.

Backward-compatible integration of harmonic transposer for high frequency reconstruction of audio signals

A method for decoding an encoded audio bitstream is disclosed. The method includes receiving the encoded audio bitstream and decoding the audio data to generate a decoded lowband audio signal. The method further includes extracting high frequency reconstruction metadata and filtering the decoded lowband audio signal with an analysis filterbank to generate a filtered lowband audio signal. The method also includes extracting a flag indicating whether either spectral translation or harmonic transposition is to be performed on the audio data and regenerating a highband portion of the audio signal using the filtered lowband audio signal and the high frequency reconstruction metadata in accordance with the flag.

Methods and apparatus for rate quality scalable coding with generative models
11621011 · 2023-04-04 · ·

Described herein is a method of decoding an audio or speech signal, the method including the steps of: (a) receiving, by a decoder, a coded bitstream including the audio or speech signal and conditioning information; (b) providing, by a bitstream decoder, decoded conditioning information in a format associated with a first bitrate; (c) converting, by a converter, the decoded conditioning information from the format associated with the first bitrate to a format associated with a second bitrate; and (d) providing, by a generative neural network, a reconstruction of the audio or speech signal according to a probabilistic model conditioned by the conditioning information in the format associated with the second bitrate. Described are further an apparatus for decoding an audio or speech signal, a respective encoder, a system of the encoder and the apparatus for decoding an audio or speech signal as well as a respective computer program product.

Methods and apparatus for rate quality scalable coding with generative models
11621011 · 2023-04-04 · ·

Described herein is a method of decoding an audio or speech signal, the method including the steps of: (a) receiving, by a decoder, a coded bitstream including the audio or speech signal and conditioning information; (b) providing, by a bitstream decoder, decoded conditioning information in a format associated with a first bitrate; (c) converting, by a converter, the decoded conditioning information from the format associated with the first bitrate to a format associated with a second bitrate; and (d) providing, by a generative neural network, a reconstruction of the audio or speech signal according to a probabilistic model conditioned by the conditioning information in the format associated with the second bitrate. Described are further an apparatus for decoding an audio or speech signal, a respective encoder, a system of the encoder and the apparatus for decoding an audio or speech signal as well as a respective computer program product.

Backward-compatible integration of high frequency reconstruction techniques for audio signals

A method for decoding an encoded audio bitstream is disclosed. The method includes receiving the encoded audio bitstream and decoding the audio data to generate a decoded lowband audio signal. The method further includes extracting high frequency reconstruction metadata and filtering the decoded lowband audio signal with an analysis filterbank to generate a filtered lowband audio signal. The method also includes extracting a flag indicating whether either spectral translation or harmonic transposition is to be performed on the audio data and regenerating a highband portion of the audio signal using the filtered lowband audio signal and the high frequency reconstruction metadata in accordance with the flag.

ELECTRONIC DEVICE FOR PERFORMING AUDIO STREAMING AND OPERATING METHOD THEREOF

An electronic device includes a memory configured to store computer-executable instructions; and a processor configured to execute the computer-executable instructions to: based on a result of analyzing a transmission environment of a wireless communication channel through which an audio signal is transmitted, determine a bitrate of the audio signal, encode the audio signal into packets according to the bitrate, the packets including a main packet for audio streaming and a plurality of extension packets for sound quality improvement, based on at least one of a type of the packets and the result of analyzing the transmission environment, determine a packet type indicating a modulation scheme and number of time slots used for transmitting each packet of the packets, and configure and transmit audio packets reflecting the packet type for each packet of the packets.

ELECTRONIC DEVICE FOR PERFORMING AUDIO STREAMING AND OPERATING METHOD THEREOF

An electronic device includes a memory configured to store computer-executable instructions; and a processor configured to execute the computer-executable instructions to: based on a result of analyzing a transmission environment of a wireless communication channel through which an audio signal is transmitted, determine a bitrate of the audio signal, encode the audio signal into packets according to the bitrate, the packets including a main packet for audio streaming and a plurality of extension packets for sound quality improvement, based on at least one of a type of the packets and the result of analyzing the transmission environment, determine a packet type indicating a modulation scheme and number of time slots used for transmitting each packet of the packets, and configure and transmit audio packets reflecting the packet type for each packet of the packets.

SYSTEM AND METHOD FOR PROVIDING HIGH QUALITY AUDIO COMMUNICATION OVER LOW BIT RATE CONNECTION
20230154474 · 2023-05-18 ·

A system and method for provide high quality audio in real-time communication over low bit rate network connections. The system includes real-time communication software application having an improved encoder and an improved decoder. The encoder decomposes audio data based on two frequency ranges corresponding to a super wideband mode and a wideband mode into a lower sub-band and a higher sub-band. Audio features are extracted from the lower sub-band and higher sub-band audio data. The audio features are quantized and packaged. The decoder reconstructs the audio data for playback on the receiving device based on the compressed audio features in the super wideband mode and the wideband mode.

RESAMPLING AN AUDIO SIGNAL FOR LOW-DELAY ENCODING/DECODING
20170372714 · 2017-12-28 ·

A method and device for resampling an audio frequency signal in an audio frequency signal coding or decoding. The method includes the following acts for each signal block to be resampled: determining, by adaptive linear prediction, a number of future signal samples, this number being defined as a function of a chosen resampling delay; constructing a resampling support vector from at least samples of the current block and determined future signal samples; applying a resampling filter to the samples of the resampling support vector.

AUDIO DECODING USING INTERMEDIATE SAMPLING RATE

A method for processing a signal includes receiving a first frame of an input audio bitstream at a decoder. The first frame includes at least one signal associated with a frequency range. The method also includes decoding the at least one signal to generate at least one decoded signal having an intermediate sampling rate. The intermediate sampling rate is based on coding information associated with the first frame. The method further includes generating a resampled signal based at least in part on the at least one decoded signal. The resampled signal has an output sampling rate of the decoder.