G10L19/18

ENCODING DEVICE, DECODING DEVICE, AND COMMUNICATION SYSTEM FOR EXTENDING VOICE BAND
20170330584 · 2017-11-16 · ·

A first encoding unit generates a first encoded signal by encoding a component within a first band in a voice signal. A frequency shifting unit shifts the frequency of a component within a second band in the voice signal, the second band having a frequency higher than that of the first band, to the frequency of a component within the first band. A second encoding unit generates a second encoded signal by encoding the component whose frequency has been shifted in the frequency shifting unit. An output unit outputs both the first encoded signal generated in the first encoding unit and the second encoded signal generated in the second encoding unit.

ENCODING DEVICE, DECODING DEVICE, AND COMMUNICATION SYSTEM FOR EXTENDING VOICE BAND
20170330584 · 2017-11-16 · ·

A first encoding unit generates a first encoded signal by encoding a component within a first band in a voice signal. A frequency shifting unit shifts the frequency of a component within a second band in the voice signal, the second band having a frequency higher than that of the first band, to the frequency of a component within the first band. A second encoding unit generates a second encoded signal by encoding the component whose frequency has been shifted in the frequency shifting unit. An output unit outputs both the first encoded signal generated in the first encoding unit and the second encoded signal generated in the second encoding unit.

SYSTEMS AND METHODS OF AUDIO DECODER DETERMINATION AND SELECTION
20220366917 · 2022-11-17 ·

Playback devices can support audio encoded using various encoding schemes. Playing back such content includes receiving, at a playback device, audio data from an audio source; and receiving an indication from the audio source that the audio data is encoded in the compressed audio format. The device determines, independently of receiving the indication from the audio source that the audio data is encoded in the compressed audio format, whether the audio data is encoded in a compressed audio format. If the audio data is determined to be encoded in the compressed audio format: the device selects a decoder from among a plurality of decoders; decodes the audio data using the selected decoder; and plays back the decoded audio data via the playback device. If the audio data is determined not to be encoded in the compressed audio format, the device inhibits playback of the audio data.

Systems, methods, apparatus, and computer-readable media for backward-compatible audio coding
09788133 · 2017-10-10 · ·

Systems, methods, and apparatus for backward-compatible coding of a set of basis function coefficients that describe a sound field are presented.

Systems, methods, apparatus, and computer-readable media for backward-compatible audio coding
09788133 · 2017-10-10 · ·

Systems, methods, and apparatus for backward-compatible coding of a set of basis function coefficients that describe a sound field are presented.

METHOD AND APPARATUS FOR DETERMINING CODING MODE

Provided is a method and apparatus for determining a signal coding mode. The signal coding mode may be determined or changed according to whether a current frame corresponds to a silence period and by using a history of speech or music presence possibilities.

AUDIO ENCODER AND DECODER USING A FREQUENCY DOMAIN PROCESSOR , A TIME DOMAIN PROCESSOR, AND A CROSS PROCESSING FOR CONTINUOUS INITIALIZATION

An audio encoder for encoding an audio signal includes: a first encoding processor for encoding a first audio signal portion in a frequency domain, wherein the first encoding processor includes: a time frequency converter for converting the first audio signal portion into a frequency domain representation having spectral lines up to a maximum frequency of the first audio signal portion; a spectral encoder for encoding the frequency domain representation; a second encoding processor for encoding a second different audio signal portion in the time domain; a cross-processor for calculating, from the encoded spectral representation of the first audio signal portion, initialization data of the second encoding processor, so that the second encoding processing is initialized to encode the second audio signal portion immediately following the first audio signal portion in time in the audio signal; a controller configured for analyzing the audio signal and for determining, which portion of the audio signal is the first audio signal portion encoded in the frequency domain and which portion of the audio signal is the second audio signal portion encoded in the time domain; and an encoded signal former for forming an encoded audio signal including a first encoded signal portion for the first audio signal portion and a second encoded signal portion for the second audio signal portion.

AUDIO ENCODER AND DECODER USING A FREQUENCY DOMAIN PROCESSOR , A TIME DOMAIN PROCESSOR, AND A CROSS PROCESSING FOR CONTINUOUS INITIALIZATION

An audio encoder for encoding an audio signal includes: a first encoding processor for encoding a first audio signal portion in a frequency domain, wherein the first encoding processor includes: a time frequency converter for converting the first audio signal portion into a frequency domain representation having spectral lines up to a maximum frequency of the first audio signal portion; a spectral encoder for encoding the frequency domain representation; a second encoding processor for encoding a second different audio signal portion in the time domain; a cross-processor for calculating, from the encoded spectral representation of the first audio signal portion, initialization data of the second encoding processor, so that the second encoding processing is initialized to encode the second audio signal portion immediately following the first audio signal portion in time in the audio signal; a controller configured for analyzing the audio signal and for determining, which portion of the audio signal is the first audio signal portion encoded in the frequency domain and which portion of the audio signal is the second audio signal portion encoded in the time domain; and an encoded signal former for forming an encoded audio signal including a first encoded signal portion for the first audio signal portion and a second encoded signal portion for the second audio signal portion.

CODEC-SPECIFIC RADIO LINK ADAPTATION
20170289319 · 2017-10-05 ·

A codec-specific radio link adaptation procedure can be implemented at the radio access network (RAN) level. A process includes receiving, from a mobile device, a bitstream carrying speech data as part of a communication session, determining, based at least in part on a result of inspecting packets of the bitstream or a notification message from an Internet Protocol (IP) Multimedia Subsystem (IMS) core, that the bitstream is associated with a codec that provides improved performance of the mobile device as compared to legacy codecs in similar radio conditions; and adapting a radio link for the communication session according to the codec.

Adaptive codebook gain control for speech coding
09747915 · 2017-08-29 · ·

In accordance with one aspect of the invention, a selector supports the selection of a first encoding scheme or the second encoding scheme based upon the detection or absence of the triggering characteristic in the interval of the input speech signal. The first encoding scheme has a pitch pre-processing procedure for processing the input speech signal to form a revised speech signal biased toward an ideal voiced and stationary characteristic. The pre-processing procedure allows the encoder to fully capture the benefits of a bandwidth-efficient, long-term predictive procedure for a greater amount of speech components of an input speech signal than would otherwise be possible. In accordance with another aspect of the invention, the second encoding scheme entails a long-term prediction mode for encoding the pitch on a sub-frame by sub-frame basis. The long-term prediction mode is tailored to where the generally periodic component of the speech is generally not stationary or less than completely periodic and requires greater frequency of updates from the adaptive codebook to achieve a desired perceptual quality of the reproduced speech under a long-term predictive procedure.