G10L19/173

METHODS AND APPARATUSES FOR DTX HANGOVER IN AUDIO CODING

Transmitting node and receiving node for audio coding and methods therein. The nodes being operable to encode/decode speech and to apply a discontinuous transmission (DTX) scheme comprising transmission/reception of Silence Insertion Descriptor (SID) frames during speech inactivity. The method in the transmitting node comprising determining, from amongst a number N of hangover frames, a set Y of frames being representative of background noise, and further transmitting the N hangover frames, comprising at least said set Y of frames, to the receiving node. The method further comprises transmitting a first SID frame to the receiving node in association with the transmission of the N hangover frames, where the SID frame comprises information indicating the determined set Y of hangover frames to the receiving node. The method enables the receiving node to generate comfort noise based on the hangover frames most adequate for the purpose.

LOW BITRATE AUDIO ENCODING/DECODING SCHEME HAVING CASCADED SWITCHES

An audio encoder has a first information sink oriented encoding branch, a second information source or SNR oriented encoding branch, and a switch for switching between the first encoding branch and the second encoding branch, wherein the second encoding branch has a converter into a specific domain different from the spectral domain, and wherein the second encoding branch furthermore has a specific domain coding branch, and a specific spectral domain coding branch, and an additional switch for switching between the specific domain coding branch and the specific spectral domain coding branch. An audio decoder has a first domain decoder, a second domain decoder for decoding a signal, and a third domain decoder and two cascaded switches for switching between the decoders.

CODING OF A SOUNDFIELD REPRESENTATION
20190259397 · 2019-08-22 ·

A method includes: receiving a representation of a soundfield, the representation characterizing the soundfield around a point in space; decomposing the received representation into independent signals; and encoding the independent signals, wherein a quantization noise for any of the independent signals has a common spatial profile with the independent signal.

DYNAMIC DECODER CONFIGURATION FOR LIVE TRANSCODING
20240163464 · 2024-05-16 · ·

A method and system for managing transcoding of data in a stream that includes identifying an input source change for the stream with a new input source type, and adding a decoder for the new input source type, the decoder configured to output for a respective encoder in a transcoder pipeline.

Signature tuning filters
10375476 · 2019-08-06 · ·

A method of providing audio information, said method comprising: (i) receiving audio filter settings in a client device; (ii) receiving audio data in the client device, wherein the received audio data is in an audio coding format; (iii) converting the audio filter settings to an audio filter signal in a processor of the client device, where the audio filter signal is a time-varying signal; (iv) converting the received audio data to an audio signal in a processor of the client device, where the audio signal is a time-varying signal; and (v) transmitting the converted audio filter signal and the converted audio signal from the client device to an audio output device, where the audio output device is separate from and in communication with the client device, and where the audio output device is configured for modifying the audio signal according to the audio filter signal to generate a time-varying audio output.

METHODS AND APPARATUS FOR SUPPORTING COMMUNICATION OF CONTENT STREAMS USING EFFICIENT MEMORY ORGANIZATION

Context values used by filters corresponding to different streams are stored in an interleaved manner in a block of contiguous memory locations, e.g., of a buffer, to facilitate coalesced access of data units, e.g., context values, corresponding to different channels to facilitate filtering of multiple streams, e.g., audio streams, in parallel using a graphics processing unit. Context values corresponding to different channels are intentionally grouped together on an interleaved per channel basis in staging memory. This allows context values for multiple different streams to be transferred, e.g., loaded, as a single block e.g., with a context value of each different channel being loaded into a different GPU core. By organizing the context values to facilitate GPU processing, device operation is improved and execution efficiency is achieved as compared to what would be the case if multiple non-contiguous memory locations had to be implemented to load the cores of the GPU.

METHODS AND APPARATUS FOR SUPPORTING ENCODING, DECODING AND/OR TRANSCODING OF CONTENT STREAMS IN A COMMUNICATION SYSTEM
20190221223 · 2019-07-18 ·

Methods and systems for supporting parallel processing utilizing Central Processing Unit(s) (CPU(s)) and at least one Graphics Processing Unit (GPU) device to provide high scale processing of content streams. An exemplary method embodiment including the steps of: receiving at a CPU multiple data units corresponding to a first frame time for each of first through Nth content streams; sequentially processing by the CPU data units corresponding to different content streams and the first frame time; operating a set of cores of a GPU, in parallel, to perform processing on a set of data units, processing including operating each core of the set of cores to perform an operation on a data unit corresponding to a single one of the first plurality of content streams, each core in the set of cores processing a data unit of a different content stream, said processing generating a set of generated data units.

Apparatus and method for providing enhanced guided downmix capabilities for 3D audio

An apparatus for downmixing three or more audio input channels to obtain two or more audio output channels is provided. The apparatus includes a receiving interface for receiving the three or more audio input channels and for receiving side information. Moreover, the apparatus includes a downmixer for downmixing the three or more audio input channels depending on the side information to obtain the two or more audio output channels. The number of the audio output channels is smaller than the number of the audio input channels. The side information indicates a characteristic of at least one of the three or more audio input channels, or a characteristic of one or more sound waves recorded within the one or more audio input channels, or a characteristic of one or more sound sources which emitted one or more sound waves recorded within the one or more audio input channels.

Coding of a soundfield representation

A method includes: receiving a representation of a soundfield, the representation characterizing the soundfield around a point in space; decomposing the received representation into independent signals; and encoding the independent signals, wherein a quantization noise for any of the independent signals has a common spatial profile with the independent signal.

Low bitrate audio encoding/decoding scheme having cascaded switches

An audio encoder has a first information sink oriented encoding branch, a second information source or SNR oriented encoding branch, and a switch for switching between the first encoding branch and the second encoding branch, wherein the second encoding branch has a converter into a specific domain different from the spectral domain, and wherein the second encoding branch furthermore has a specific domain coding branch, and a specific spectral domain coding branch, and an additional switch for switching between the specific domain coding branch and the specific spectral domain coding branch. An audio decoder has a first domain decoder, a second domain decoder for decoding a signal, and a third domain decoder and two cascaded switches for switching between the decoders.