Patent classifications
G10L19/0212
Method and apparatus to encode and decode an audio/speech signal
A method and apparatus to encode and decode an audio/speech signal is provided. An inputted audio signal or speech signal may be transformed into at least one of a high frequency resolution signal and a high temporal resolution signal. The signal may be encoded by determining an appropriate resolution, the encoded signal may be decoded, and thus the audio signal, the speech signal, and a mixed signal of the audio signal and the speech signal may be processed.
APPARATUS AND METHOD FOR ENCODING OR DECODING AN AUDIO SIGNAL WITH INTELLIGENT GAP FILLING IN THE SPECTRAL DOMAIN
An apparatus for decoding an encoded audio signal, includes a spectral domain audio decoder for generating a first decoded representation of a first set of first spectral portions, the decoded representation having a first spectral resolution; a parametric decoder for generating a second decoded representation of a second set of second spectral portions having a second spectral resolution being lower than the first spectral resolution; a frequency regenerator for regenerating every constructed second spectral portion having the first spectral resolution using a first spectral portion and spectral envelope information for the second spectral portion; and a spectrum time converter for converting the first decoded representation and the reconstructed second spectral portion into a time representation.
Audio decoder for interleaving signals
A method for decoding an encoded audio bitstream in an audio processing system is disclosed. The method includes extracting from the encoded audio bitstream a first waveform-coded signal including spectral coefficients corresponding to frequencies up to a first cross-over frequency and performing parametric decoding at a second cross-over frequency to generate a reconstructed signal. The second cross-over frequency is above the first cross-over frequency and the parametric decoding uses reconstruction parameters derived from the encoded audio bitstream to generate the reconstructed signal. The method further includes extracting from the encoded audio bitstream a second waveform-coded signal including spectral coefficients corresponding to a subset of frequencies above the first cross-over frequency and interleaving the second waveform-coded signal with the reconstructed signal to produce an interleaved signal. The interleaved signal is then combined with the first waveform-coded signal.
SIGNAL ENCODING METHOD AND APPARATUS AND SIGNAL DECODING METHOD AND APPARATUS
A spectrum coding method includes quantizing spectral data of a current band based on a first quantization scheme, generating a lower bit of the current band using the spectral data and the quantized spectral data, quantizing a sequence of lower bits including the lower bit of the current band based on a second quantization scheme, and generating a bitstream based on a upper bit excluding N bits, where N is 1 or greater, from the quantized spectral data and the quantized sequence of lower bits.
Integrated sensor-array processor
An integrated sensor-array processor and method includes sensor array time-domain input ports to receive sensor signals from time-domain sensors. A sensor transform engine (STE) creates sensor transform data from the sensor signals and applies sensor calibration adjustments. Transducer time-domain input ports receive time-domain transducer signals, and a transducer output transform engine (TTE) generates transducer output transform data from the transducer signals. A spatial filter engine (SFE) applies suppression coefficients to the sensor transform data, to suppress target signals received from noise locations and/or amplification locations. A blocking filter engine (BFE) applies subtraction coefficients to the sensor transform data, to subtract the target signals from the sensor transform data. A noise reduction filter engine (NRE) subtracts noise signals from the BFE output. An inverse transform engine (ITE) generates time-domain data from the NRE output.
Sound system
Methods and systems for processing audio data, such as spatial audio data, which modify sound characteristics of a given component of a spatial audio signal based on a relationship between a direction characteristic of the given component and a defined range of direction characteristics. A spatial audio in a format using a spherical harmonic representation of sound components is decoded by performing a transform on the spherical harmonic representation, where the transform is based on a predefined speaker layout and a predefined rule, the predefined rule indicating a speaker gain of each speaker arranged according to the predefined layout, when reproducing sound incident form a given direction to provide alternatives to existing methods of decoding spatial audio streams, which focus on soundfield reconstruction. A plurality of matrix transforms is combined into a combined transform that is performed on an audio signal; this saves processing resources of the audio system.
Encoding apparatus and decoding apparatus for transforming between modified discrete cosine transform-based coder and different coder
An encoding apparatus and a decoding apparatus in a transform between a Modified Discrete Cosine Transform (MDCT)-based coder and a different coder are provided. The encoding apparatus may encode additional information to restore an input signal encoded according to the MDCT-based coding scheme, when switching occurs between the MDCT-based coder and the different coder. Accordingly, an unnecessary bitstream may be prevented from being generated, and minimum additional information may be encoded.
Apparatus and method for decoding or encoding an audio signal using energy information values for a reconstruction band
An apparatus for decoding an encoded audio signal having an encoded representation of a first set of first spectral portions and an encoded representation of parametric data indicating spectral energies for a second set of second spectral portions, has: an audio decoder for decoding the encoded representation of the first set of the first spectral portions to obtain a first set of first spectral portions and for decoding the encoded representation of the parametric data to obtain a decoded parametric data for the second set of second spectral portions indicating, for individual reconstruction bands, individual energies; a frequency regenerator for reconstructing spectral values in a reconstruction band having a second spectral portion using a first spectral portion of the first set of the first spectral portions and an individual energy for the reconstruction band, the reconstruction band having a first spectral portion and the second spectral portion.
ENCODING DEVICE AND METHOD, DECODING DEVICE AND METHOD, AND PROGRAM
The present technology relates to an encoding device and a method, a decoding device and a method, and a program that enables acquisitions of high-quality sound even in a resource-poor setting.
A demultiplexer demultiplexes a supplied code string, to obtain the quantized low-band spectrum, the spectral characteristic code, and the quantized expansion coefficient(s). At this point, the code string includes a single quantized expansion coefficient or quantized expansion coefficients of the respective bands in the high band depending on the spectral characteristic code. A spectral inverse quantization unit obtains the low-band spectrum by inversely quantizing the quantized low-band spectrum. An expansion coefficient inverse quantization unit obtains the expansion coefficient(s) by inversely quantizing the quantized expansion coefficient(s). An expanded spectrum generation unit generates an expanded spectrum, in accordance with the low-band spectrum and the expansion coefficient(s) depending on the spectral characteristic code. An IMDCT unit generates a band-expanded time-series signal from the low-band spectrum and the expanded spectrum. The present technology can be applied to decoding devices.
APPARATUS AND METHOD FOR ENCODING OR DECODING AN AUDIO SIGNAL USING A TRANSIENT-LOCATION DEPENDENT OVERLAP
An apparatus for encoding an audio or image signal, includes: a controllable windower for windowing the audio or image signal to provide the sequence of blocks of windowed samples; a converter for converting the sequence of blocks of windowed samples into a spectral representation including a sequence of frames of spectral values; a transient location detector for identifying a location of a transient within a transient look-ahead region of a frame; and a controller for controlling the controllable windower to apply a specific window having a specified overlap length to the audio or image signal in response to an identified location of the transient, wherein the controller is configured to select the specific window from a group of at least three windows, wherein the specific window is selected based on the transient location.