G10L19/0204

Processing of audio signals during high frequency reconstruction
11568880 · 2023-01-31 · ·

The application relates to HFR (High Frequency Reconstruction/Regeneration) of audio signals. In particular, the application relates to a method and system for performing HFR of audio signals having large variations in energy level across the low frequency range which is used to reconstruct the high frequencies of the audio signal. A system configured to generate a plurality of high frequency subband signals covering a high frequency interval from a plurality of low frequency subband signals is described. The system comprises means for receiving the plurality of low frequency subband signals; means for receiving a set of target energies, each target energy covering a different target interval within the high frequency interval and being indicative of the desired energy of one or more high frequency subband signals lying within the target interval; means for generating the plurality of high frequency subband signals from the plurality of low frequency subband signals and from a plurality of spectral gain coefficients associated with the plurality of low frequency subband signals, respectively; and means for adjusting the energy of the plurality of high frequency subband signals using the set of target energies.

Audio signal coding apparatus, audio signal decoding apparatus, audio signal coding method, and audio signal decoding method

An audio signal coding apparatus includes a time-frequency transformer that outputs sub-band spectra from an input signal; a sub-band energy quantizer; a tonality calculator that analyzes tonality of the sub-band spectra; a bit allocator that selects a second sub-band on which quantization is performed by a second quantizer on the basis of the analysis result of the tonality and quantized sub-band energy, and determines a first number of bits to be allocated to a first sub-band on which quantization is performed by a first quantizer; the first quantizer that performs first coding using the first number of bits; the second quantizer that performs coding using a second coding method; and a multiplexer.

METHOD OF ENCODING AUDIO SIGNAL AND ENCODER, METHOD OF DECODING AUDIO SIGNAL AND DECODER

A method of encoding an audio signal and an encoder and a method of decoding an audio signal and a decoder are provided. The method of encoding an audio signal includes outputting a decoded signal by using a bitstream that encodes an audio signal, separating the decoded signal into a low-band signal and a high-band signal by using a sound source separator, upsampling the low-band signal, upsampling the high-band signal, and restoring the audio signal by synthesizing the upsampled low-band signal with the upsampled high-band signal, wherein the bitstream is generated by encoding a superimposed signal in which a signal in a high frequency band of the audio signal is superimposed on a low frequency band of the audio signal.

Apparatus and method for generating an enhanced signal using independent noise-filling

An apparatus for generating an enhanced signal from an input signal, wherein the enhanced signal has spectral values for an enhancement spectral region, the spectral values for the enhancement spectral regions not being contained in the input signal, includes a mapper for mapping a source spectral region of the input signal to a target region in the enhancement spectral region, the source spectral region including a noise-filling region; and a noise filler configured for generating first noise values for the noise-filling region in the source spectral region of the input signal and for generating second noise values for a noise region in the target region, wherein the second noise values are decorrelated from the first noise values or for generating second noise values for a noise region in the target region, wherein the second noise values are decorrelated from first noise values in the source region.

Audio decoder and decoding method

A method for representing a second presentation of audio channels or objects as a data stream, the method comprising the steps of: (a) providing a set of base signals, the base signals representing a first presentation of the audio channels or objects; (b) providing a set of transformation parameters, the transformation parameters intended to transform the first presentation into the second presentation; the transformation parameters further being specified for at least two frequency bands and including a set of multi-tap convolution matrix parameters for at least one of the frequency bands.

Signal encoding method and device and signal decoding method and device
11705142 · 2023-07-18 · ·

A spectrum encoding method includes selecting an important spectral component in band units for a normalized spectrum and encoding information of the selected important spectral component for a band, based on a number, a position, a magnitude and a sign thereof. A spectrum decoding method includes obtaining from a bitstream, information about an important spectral component for a band of an encoded spectrum and decoding the obtained information of the important spectral component, based on a number, a position, a magnitude and a sign of the important spectral component.

Inter-channel bandwidth extension spectral mapping and adjustment

A method includes generating a synthesized non-reference high-band channel based on a non-reference high-band excitation corresponding to a non-reference target channel. The method further includes estimating one or more spectral mapping parameters based on the synthesized non-reference high-band channel and a high-band portion of the non-reference target channel. The method also includes applying the one or more spectral mapping parameters to the synthesized non-reference high-band channel to generate a spectrally shaped synthesized non-reference high-band channel. The method further includes generating an encoded bitstream based on the one or more spectral mapping parameters and the spectrally shaped synthesized non-reference high-band channel.

Methods for phase ECU F0 interpolation split and related controller
11705136 · 2023-07-18 · ·

Controlling a concealment method for a lost audio frame associated with a received audio signal is provided. At least one bin vector of a spectral representation for at least one tone is obtained, wherein the at least one bin vector includes three consecutive bin values for the at least one tone. Whether each of the three consecutive bin values has a complex value or a real value is determined. Responsive to the determination, the three consecutive bin values are processed to estimate a frequency of the at least one tone based on whether each bin value has a complex value or a real value.

ENCODING OF MULTI-CHANNEL AUDO SIGNALS COMPRISING DOWNMIXING OF A PRIMARY AND TWO OR MORE SCALED NON-PRIMARY INPUT CHANNELS

Systems, methods, and computer program products are disclosed for adaptive downmixing of audio signals with improved continuity. An audio encoding system receives an input multi-channel audio signal including a primary input audio channel and L non-primary input audio channels. The system determines a set of L input gains. For each of the channels and gains, the system forms a respective scaled non-primary input audio channel. The system forms a primary output audio channel from the sum of the primary input audio channel and the scaled non-primary input audio channels. The system determines a set of L prediction gains. The system forms a prediction channel from the primary output audio channel. The system forms L non-primary output audio channels. The system forms an output multi-channel audio signal from the primary output audio channel and the L non-primary output audio channels.

Crosstalk data detection method and electronic device
11551706 · 2023-01-10 · ·

A method and an electronic device for detecting crosstalk data are provided. The method for detecting crosstalk data can detect whether an audio data stream includes crosstalk data. The method includes: receiving a first audio data block, a second audio data block, and a reference time difference, wherein the first audio data block and the second audio data block separately include a plurality of audio data segments; using a time difference between an acquisition time of an audio data segment in the first audio data block and a corresponding audio data segment in the second audio data block as an audio segment time difference; and determining that the audio data segment of the first audio data block includes crosstalk data when the audio segment time difference does not match the reference time difference.