G10L19/04

SPEECH ENHANCEMENT METHOD AND APPARATUS, DEVICE, AND STORAGE MEDIUM
20230097520 · 2023-03-30 ·

A speech enhancement method includes: performing pre-enhancement on a target speech frame according to a complex spectrum corresponding to the target speech frame, to obtain a first complex spectrum; performing speech decomposition on the target speech frame according to the first complex spectrum, to obtain a glottal parameter, a gain, and an excitation signal that correspond to the target speech frame; and performing synthesis according to the glottal parameter, the gain, and the excitation signal, to obtain an enhanced speech signal corresponding to the target speech frame.

AUDIO SIGNAL ENHANCEMENT METHOD AND APPARATUS, COMPUTER DEVICE, STORAGE MEDIUM AND COMPUTER PROGRAM PRODUCT
20230099343 · 2023-03-30 ·

This application relates to an audio signal enhancement method, performed by a computer device. The method including decoding received speech packets sequentially to obtain a residual signal, long term filtering parameters and linear filtering parameters; filtering the residual signal to obtain an audio signal; extracting feature parameters from the audio signal, when the audio signal is a feedforward error correction frame signal; converting the audio signal into a filter speech excitation signal based on the linear filtering parameters; performing speech enhancement on the filter speech excitation signal according to the feature parameters, the long term filtering parameters and the linear filtering parameters to obtain an enhanced speech excitation signal; and performing speech synthesis to obtain an enhanced speech signal based on the enhanced speech excitation signal and the linear filtering parameters.

AUDIO SIGNAL ENHANCEMENT METHOD AND APPARATUS, COMPUTER DEVICE, STORAGE MEDIUM AND COMPUTER PROGRAM PRODUCT
20230099343 · 2023-03-30 ·

This application relates to an audio signal enhancement method, performed by a computer device. The method including decoding received speech packets sequentially to obtain a residual signal, long term filtering parameters and linear filtering parameters; filtering the residual signal to obtain an audio signal; extracting feature parameters from the audio signal, when the audio signal is a feedforward error correction frame signal; converting the audio signal into a filter speech excitation signal based on the linear filtering parameters; performing speech enhancement on the filter speech excitation signal according to the feature parameters, the long term filtering parameters and the linear filtering parameters to obtain an enhanced speech excitation signal; and performing speech synthesis to obtain an enhanced speech signal based on the enhanced speech excitation signal and the linear filtering parameters.

Anti-causal filter for audio signal processing

An audio signal processor includes a digital filter block configured to receive an audio signal and output a first filtered audio signal, and a phase linearization block configured to receive the first filtered audio signal and output a second filtered audio signal with a more linear phase.

Anti-causal filter for audio signal processing

An audio signal processor includes a digital filter block configured to receive an audio signal and output a first filtered audio signal, and a phase linearization block configured to receive the first filtered audio signal and output a second filtered audio signal with a more linear phase.

SUPPORT FOR GENERATION OF COMFORT NOISE, AND GENERATION OF COMFORT NOISE

A method for generation of comfort noise for at least two audio channels. The method comprises determining a spatial coherence between audio signals on the respective audio channels, wherein at least one spatial coherence value per frame and frequency band is determined to form a vector of spatial coherence values. A vector of predicted spatial coherence values is formed by a weighted combination of a first coherence prediction and a second coherence prediction that are combined using a weight factor α. The method comprises signaling information about the weight factor α to the receiving node, for enabling the generation of the comfort noise for the at least two audio channels at the receiving node.

SUPPORT FOR GENERATION OF COMFORT NOISE, AND GENERATION OF COMFORT NOISE

A method for generation of comfort noise for at least two audio channels. The method comprises determining a spatial coherence between audio signals on the respective audio channels, wherein at least one spatial coherence value per frame and frequency band is determined to form a vector of spatial coherence values. A vector of predicted spatial coherence values is formed by a weighted combination of a first coherence prediction and a second coherence prediction that are combined using a weight factor α. The method comprises signaling information about the weight factor α to the receiving node, for enabling the generation of the comfort noise for the at least two audio channels at the receiving node.

Low bitrate audio encoding/decoding scheme having cascaded switches

An audio encoder has a first information sink oriented encoding branch such as a spectral domain encoding branch, a second information source or SNR oriented encoding branch such as an LPC-domain encoding branch, and a switch for switching between the first and second encoding branches, the second encoding branch having a converter into a specific domain different from the spectral domain such as an LPC analysis stage generating an excitation signal, and the second encoding branch having a specific domain coding branch such as LPC domain processing branch, and a specific spectral domain coding branch such as LPC spectral domain processing branch, and an additional switch for switching between the specific domain coding branch and the specific spectral domain coding branch. An audio decoder has a first domain decoder, a second domain decoder, and a third domain decoder as well as two cascaded switches for switching between the decoders.

Concept for coding mode switching compensation

A codec allowing for switching between different coding modes is improved by, responsive to a switching instance, performing temporal smoothing and/or blending at a respective transition.

Concept for coding mode switching compensation

A codec allowing for switching between different coding modes is improved by, responsive to a switching instance, performing temporal smoothing and/or blending at a respective transition.