G10L19/025

AUDIO PROCESSING FOR TEMPORALLY MISMATCHED SIGNALS

A device includes a processor and a transmitter. The processor is configured to determine a first mismatch value indicative of a first amount of a temporal mismatch between a first audio signal and a second audio signal. The processor is also configured to determine a second mismatch value indicative of a second amount of a temporal mismatch between the first audio signal and the second audio signal. The processor is further configured to determine an effective mismatch value based on the first mismatch value and the second mismatch value. The processor is also configured to generate at least one encoded signal having a bit allocation. The bit allocation is at least partially based on the effective mismatch value. The transmitter configured to transmit the at least one encoded signal to a second device.

AUDIO PROCESSING FOR TEMPORALLY MISMATCHED SIGNALS

A device includes a processor and a transmitter. The processor is configured to determine a first mismatch value indicative of a first amount of a temporal mismatch between a first audio signal and a second audio signal. The processor is also configured to determine a second mismatch value indicative of a second amount of a temporal mismatch between the first audio signal and the second audio signal. The processor is further configured to determine an effective mismatch value based on the first mismatch value and the second mismatch value. The processor is also configured to generate at least one encoded signal having a bit allocation. The bit allocation is at least partially based on the effective mismatch value. The transmitter configured to transmit the at least one encoded signal to a second device.

Audio encoders, audio decoders, systems, methods and computer programs using an increased temporal resolution in temporal proximity of onsets or offsets of fricatives or affricates

An audio encoder for providing an encoded audio information on the basis of an input audio information has a bandwidth extension information provider configured to provide bandwidth extension information using a variable temporal resolution and a detector configured to detect an onset of a fricative or affricate. The audio encoder is configured to adjust a temporal resolution used by the bandwidth extension information provider such that bandwidth extension information is provided with an increased temporal resolution at least for a predetermined period of time before a time at which an onset of a fricative or affricate is detected and for a predetermined period of time following the time at which the onset of the fricative or affricate is detected. Alternatively or in addition, the bandwidth extension information is provided with an increased temporal resolution in response to a detection of an offset of a fricative or affricate. Audio encoders and methods use a corresponding concept.

Audio Coding Method and Apparatus
20210390968 · 2021-12-16 ·

A method comprises determining a first modification weight according to linear spectral frequency (LSF) differences of the current frame and LSF differences of a previous frame of the current frame when a signal characteristic of the current frame meets a preset modification condition, modifying the linear predictive parameter of the current frame according to the determined first modification weight, and coding the current frame according to the modified linear predictive parameter.

Audio Coding Method and Apparatus
20210390968 · 2021-12-16 ·

A method comprises determining a first modification weight according to linear spectral frequency (LSF) differences of the current frame and LSF differences of a previous frame of the current frame when a signal characteristic of the current frame meets a preset modification condition, modifying the linear predictive parameter of the current frame according to the determined first modification weight, and coding the current frame according to the modified linear predictive parameter.

Apparatus for post-processing an audio signal using a transient location detection

Apparatus for post-processing an audio signal, including: a converter for converting the audio signal into a time-frequency representation; a transient location estimator for estimating a location in time of a transient portion using the audio signal or the time-frequency representation; and a signal manipulator for manipulating the time-frequency representation, wherein the signal manipulator is configured to reduce or eliminate a pre-echo in the time-frequency representation at a location in time before the transient location or to perform a shaping of the time-frequency representation at the transient location to amplify an attack of the transient portion.

Apparatus for post-processing an audio signal using a transient location detection

Apparatus for post-processing an audio signal, including: a converter for converting the audio signal into a time-frequency representation; a transient location estimator for estimating a location in time of a transient portion using the audio signal or the time-frequency representation; and a signal manipulator for manipulating the time-frequency representation, wherein the signal manipulator is configured to reduce or eliminate a pre-echo in the time-frequency representation at a location in time before the transient location or to perform a shaping of the time-frequency representation at the transient location to amplify an attack of the transient portion.

METHODS AND DEVICES FOR DETECTING AN ATTACK IN A SOUND SIGNAL TO BE CODED AND FOR CODING THE DETECTED ATTACK
20220180884 · 2022-06-09 · ·

A method and device for detecting an attack in a sound signal to be coded wherein the sound signal is processed in successive frames each including a number of sub-frames. The device comprises a first-stage attack detector for detecting the attack in a last sub-frame of a current frame, and a second-stage attack detector for detecting the attack in one of the sub-frames of the current frame, including the sub-frames preceding the last sub-frame. No attack is detected when the current frame is not an active frame previously classified to be coded using a generic coding mode. A method and device for coding an attack in a sound signal are also provided. The coding device comprises the above mentioned attack detecting device and an encoder of the sub-frame comprising the detected attack using a transition coding mode using a glottal-shape codebook populated with glottal impulse shapes.

Cross product enhanced subband block based harmonic transposition
11355133 · 2022-06-07 · ·

The invention provides an efficient implementation of cross-product enhanced high-frequency reconstruction (HFR), wherein a new component at frequency QΩ+rΩ.sub.0 is generated on the basis of existing components at Ω and Ω+Ω.sub.0. The invention provides a block-based harmonic transposition, wherein a time block of complex subband samples is processed with a common phase modification. Superposition of several modified samples has the net effect of limiting undesirable intermodulation products, thereby enabling a coarser frequency resolution and/or lower degree of oversampling to be used. In one embodiment, the invention further includes a window function suitable for use with block-based cross-product enhanced HFR. A hardware embodiment of the invention may include an analysis filter bank, a subband processing unit configurable by control data and a synthesis filter bank.

Cross product enhanced subband block based harmonic transposition
11355133 · 2022-06-07 · ·

The invention provides an efficient implementation of cross-product enhanced high-frequency reconstruction (HFR), wherein a new component at frequency QΩ+rΩ.sub.0 is generated on the basis of existing components at Ω and Ω+Ω.sub.0. The invention provides a block-based harmonic transposition, wherein a time block of complex subband samples is processed with a common phase modification. Superposition of several modified samples has the net effect of limiting undesirable intermodulation products, thereby enabling a coarser frequency resolution and/or lower degree of oversampling to be used. In one embodiment, the invention further includes a window function suitable for use with block-based cross-product enhanced HFR. A hardware embodiment of the invention may include an analysis filter bank, a subband processing unit configurable by control data and a synthesis filter bank.