G10L19/0212

Apparatus and Method for Audio Signal Transformation
20230274749 · 2023-08-31 ·

An apparatus for audio signal transformation is provided. The apparatus includes a determination unit configured for determining, using spherical harmonics information, a transformation rule for transforming an audio input signal within a first domain, being different from a spherical harmonics domain. Moreover, the apparatus includes a transformation unit configured for transforming, using the transformation rule, the audio input signal, being represented in the first domain, to obtain a transformed audio signal being represented in the first domain. The spherical harmonics information includes information on a plurality of spherical harmonics and/or includes information being represented in the spherical harmonics domain.

Downscaled decoding

A downscaled version of an audio decoding procedure may more effectively and/or at improved compliance maintenance be achieved if the synthesis window used for downscaled audio decoding is a downsampled version of a reference synthesis window involved in the non-downscaled audio decoding procedure by downsampling by the downsampling factor by which the downsampled sampling rate and the original sampling rate deviate, and downsampled using a segmental interpolation in segments of ¼ of the frame length.

Signal processing apparatus and method, and program to reduce calculation amount based on mute information

The present technology relates to a signal processing apparatus and method, and a program that make it possible to reduce an arithmetic operation amount. The signal processing apparatus performs, on the basis of audio object mute information indicative of whether or not a signal of an audio object is a mute signal, at least either one of a decoding process or a rendering process of an object signal of the audio object. The present technology can be applied to a signal processing apparatus.

POST FILTER FOR AUDIO SIGNALS

In some embodiments, a pitch filter for filtering a preliminary audio signal generated from an audio bitstream is disclosed. The pitch filter has an operating mode selected from one of either: (i) an active mode where the preliminary audio signal is filtered using filtering information to obtain a filtered audio signal, and (ii) an inactive mode where the pitch filter is disabled. The preliminary audio signal is generated in an audio encoder or audio decoder having a coding mode selected from at least two distinct coding modes, and the pitch filter is capable of being selectively operated in either the active mode or the inactive mode while operating in the coding mode based on control information.

Estimating noise of an audio signal in the log2-domain

A method is described that estimates noise in an audio signal. An energy value for the audio signal is estimated and converted into the logarithmic domain. A noise level for the audio signal is estimated based on the converted energy value.

Decoder for decoding an encoded audio signal and encoder for encoding an audio signal

A schematic block diagram of a decoder for decoding an encoded audio signal is shown. The decoder includes an adaptive spectrum-time converter and an overlap-add-processor. The adaptive spectrum-time converter converts successive blocks of spectral values into successive blocks of time values, e.g. via a frequency-to-time transform. Furthermore, the adaptive spectrum-time converter receives a control information and switches, in response to the control information, between transform kernels of a first group of transform kernels including one or more transform kernels having different symmetries at sides of a kernel, and a second group of transform kernels including one or more transform kernels having the same symmetries at sides of a transform kernel. Moreover, the overlap-add-processor overlaps and adds the successive blocks of time values to obtain decoded audio values, which may be a decoded audio signal.

Audio encoder, audio decoder and related methods using two-channel processing within an intelligent gap filling framework

An apparatus for generating a decoded two-channel signal, comprising: a parametric decoder for providing parametric data for a second set of second spectral portions and a two-channel identification identifying for a second spectral portion of the second set of second spectral portions either a first two-channel representation for the second spectral portion of the second set of second spectral portions or a second two-channel representation for the second spectral portion of the second set of second spectral portions, the second two-channel representation being different from the first two-channel representation; and a frequency regenerator for regenerating the second spectral portion of the second set of second spectral portions depending on a first spectral portion of a first set of first spectral portions, the parametric data for the second spectral portion of the second set of second spectral portions and the two-channel identification for the second spectral portion of the second set of second spectral portions to acquire a regenerated second spectral portion of the second set of second spectral portions.

METHODS FOR PHASE ECU F0 INTERPOLATION SPLIT AND RELATED CONTROLLER
20220148602 · 2022-05-12 ·

Controlling a concealment method for a lost audio frame associated with a received audio signal is provided. At least one bin vector of a spectral representation for at least one tone is obtained, wherein the at least one bin vector includes three consecutive bin values for the at least one tone. Whether each of the three consecutive bin values has a complex value or a real value is determined. Responsive to the determination, the three consecutive bin values are processed to estimate a frequency of the at least one tone based on whether each bin value has a complex value or a real value.

Transform Encoding/Decoding of Harmonic Audio Signals

An encoder for encoding frequency transform coefficients of a harmonic audio signal include the following elements: A peak locator configured to locate spectral peaks having magnitudes exceeding a predetermined frequency dependent threshold. A peak region encoder configured to encode peak regions including and surrounding the located peaks. A low-frequency set encoder configured to encode at least one low-frequency set of coefficients outside the peak regions and below a crossover frequency that depends on the number of bits used to encode the peak regions. A noise-floor gain encoder configured to encode a noise-floor gain of at least one high-frequency set of not yet encoded coefficients outside the peak regions.

AUDIO ENCODER, AUDIO DECODER AND RELATED METHODS USING TWO-CHANNEL PROCESSING WITHIN AN INTELLIGENT GAP FILLING FRAMEWORK

An apparatus for generating a decoded two-channel signal, comprising: a parametric decoder for providing parametric data for a second set of second spectral portions and a two-channel identification identifying for a second spectral portion of the second set of second spectral portions either a first two-channel representation for the second spectral portion of the second set of second spectral portions or a second two-channel representation for the second spectral portion of the second set of second spectral portions, the second two-channel representation being different from the first two-channel representation; and a frequency regenerator for regenerating the second spectral portion of the second set of second spectral portions depending on a first spectral portion of a first set of first spectral portions, the parametric data for the second spectral portion of the second set of second spectral portions and the two-channel identification for the second spectral portion of the second set of second spectral portions to acquire a regenerated second spectral portion of the second set of second spectral portions.