G10L19/0212

MDCT-Domain Error Concealment

An error-concealing audio decoding method comprises: receiving a packet comprising a set of MDCT coefficients encoding a frame of time-domain samples of an audio signal; identifying the received packet as erroneous; generating estimated MDCT coefficients to replace the set of MDCT coefficients of the erroneous packet, based on corresponding MDCT coefficients associated with a received packet directly preceding the erroneous packet; assigning signs of a first subset of MDCT coefficients of the estimated MDCT coefficients, wherein the first subset comprises such MDCT coefficients that are associated with tonal-like spectral bins, to coincide with signs of corresponding MDCT coefficients of said preceding packet; randomly assigning signs of a second subset of MDCT coefficients of the estimated MDCT coefficients, wherein the second subset comprises MDCT coefficients associated with noise-like spectral bins; replacing the erroneous packet by a concealment packet containing the estimated MDCT coefficients and the signs assigned.

Perceptual audio coding with adaptive non-uniform time/frequency tiling using subband merging and the time domain aliasing reduction

Embodiments provide an audio processor for processing an audio signal to obtain a subband representation of the audio signal. The audio processor is configured to perform a cascaded lapped critically sampled transform on at least two partially overlapping blocks of samples of the audio signal, to obtain a set of subband samples on the basis of a first block of samples of the audio signal, and to obtain a corresponding set of subband samples on the basis of a second block of samples of the audio signal. Further, the audio processor is configured to perform a weighted combination of two corresponding sets of subband samples, one obtained on the basis of the first block of samples of the audio signal and one obtained on the basis on the second block of samples of the audio signal, to obtain an aliasing reduced subband representation of the audio signal.

Decoder for Decoding an Encoded Audio Signal and Encoder for Encoding an Audio Signal
20170365266 · 2017-12-21 ·

A schematic block diagram of a decoder for decoding an encoded audio signal is shown. The decoder includes an adaptive spectrum-time converter and an overlap-add-processor. The adaptive spectrum-time converter converts successive blocks of spectral values into successive blocks of time values, e.g. via a frequency-to-time transform. Furthermore, the adaptive spectrum-time converter receives a control information and switches, in response to the control information, between transform kernels of a first group of transform kernels including one or more transform kernels having different symmetries at sides of a kernel, and a second group of transform kernels including one or more transform kernels having the same symmetries at sides of a transform kernel. Moreover, the overlap-add-processor overlaps and adds the successive blocks of time values to obtain decoded audio values, which may be a decoded audio signal.

HARMONIC TRANSPOSITION IN AN AUDIO CODING METHOD AND SYSTEM
20230197089 · 2023-06-22 · ·

The present invention relates to transposing signals in time and/or frequency and in particular to coding of audio signals. More particular, the present invention relates to high frequency reconstruction (HFR) methods including a frequency domain harmonic transposer. A method and system for generating a transposed output signal from an input signal using a transposition factor T is described. The system comprises an analysis window of length L.sub.a, extracting a frame of the input signal, and an analysis transformation unit of order M transforming the samples into M complex coefficients. M is a function of the transposition factor T. The system further comprises a nonlinear processing unit altering the phase of the complex coefficients by using the transposition factor T, a synthesis transformation unit of order M transforming the altered coefficients into M altered samples, and a synthesis window of length L.sub.s, generating a frame of the output signal.

Filtering in the transformed domain
09847085 · 2017-12-19 · ·

A method for processing a signal in the form of consecutive sample blocks, the method comprising filtering in a transformed domain of sub-bands, and particularly equalization processing, applied to a current block in the transformed domain, and filtering-adjustment processing that is applied in the transformed domain to at least one block adjacent to the current block.

Apparatus and method for generating an encoded signal or for decoding an encoded audio signal using a multi overlap portion

An apparatus for generating an encoded signal includes: a window sequence controller for generating a window sequence information for windowing an audio or image signal, the window sequence information indicating a first window for generating a first frame of spectral values, a second window function and at least one third window function for generating a second frame of spectral values, wherein the first window function, the second window function and the one or more third window functions overlap within a multi-overlap region; a preprocessor for windowing a second block of samples corresponding to the second window function and the at least one third window functions using an auxiliary window function to acquire a second block of windowed samples, a spectrum converter for applying an aliasing-introducing transform; and a processor for processing the first frame and the second frame to acquire encoded frames of the audio or image signal.

MACHINE LEARNING-BASED KEY GENERATION FOR KEY-GUIDED AUDIO SIGNAL TRANSFORMATION
20230186926 · 2023-06-15 ·

A method comprise: receiving input audio and target audio having a target audio characteristic; using a first neural network, trained to generate key parameters that represent the target audio characteristic based on one or more of the target audio and the input audio, generating the key parameters; and configuring a second neural network, trained to be configured by the key parameters, with the key parameters to cause the second neural network to perform a signal transformation of the input audio, to produce output audio having an output audio characteristic corresponding to and that matches the target audio characteristic.

Audio encoding device and audio coding method
09837085 · 2017-12-05 · ·

An audio encoding device includes a processor; and a memory which stores a plurality of instructions, which when executed by the processor, cause the processor to execute: calculating a similarity in phase of a first channel signal and a second channel signal contained in a plurality of channels of an audio signal; and selecting, based on the similarity, a first output that outputs one of the first channel signal and the second channel signal, or a second output that outputs both of the first channel signal and the second channel signal.

Audio decoding device and method with decoding branches for decoding audio signal encoded in a plurality of domains

An audio encoder has a first information sink oriented encoding branch such as a spectral domain encoding branch, a second information source or SNR oriented encoding branch such as an LPC-domain encoding branch, and a switch for switching between the first and second encoding branches, the second encoding branch having a converter into a specific domain different from the spectral domain such as an LPC analysis stage generating an excitation signal, and the second encoding branch having a specific domain coding branch such as LPC domain processing branch, and a specific spectral domain coding branch such as LPC spectral domain processing branch, and an additional switch for switching between the specific domain coding branch and the specific spectral domain coding branch. An audio decoder has a first domain decoder, a second domain decoder, and a third domain decoder as well as two cascaded switches for switching between the decoders.

Enhanced soundfield coding using parametric component generation

The present document relates to multichannel audio coding and more precisely to techniques for discrete multichannel audio encoding and decoding. In particular, the present document relates to systems and method for coding soundfields. An audio encoder (200) configured to encode a frame of a soundfield signal (110) comprising a plurality of audio signals is described. The audio encoder (200) comprises a transform determination unit (203, 204) configured to determine an energy-compacting orthogonal transform (V) based on the frame of the soundfield signal (110). Furthermore, the encoder (200) comprises a transform unit (202) configured to apply the energy-compacting orthogonal transform (V) to the frame of the soundfield signal (110), and configured to provide a frame of a rotated soundfield signal (112) comprising a plurality of rotated audio signals (E1, E2, E3). The audio encoder (200) comprises a waveform encoding unit (103) configured to encode a first rotated audio signal (E1) of the plurality of rotated audio signals (E1, E2, E3), and a parametric encoding unit (104) configured to determine a set of spatial parameters (ae2, be2) for determining a second rotated audio signal (E2) of the plurality of rotated audio signals (E1, E2, E3) based on the first rotated audio signal (E1).