Patent classifications
G10L19/005
ERROR CORRECTION OVERWRITE FOR AUDIO ARTIFACT REDUCTION
Audio communication methods, devices, and systems, are provided with error correction overwrite for audio artifact reduction. One illustrative low-latency audio streaming method includes: receiving packets of digital audio data; applying an error correction code decoder to obtain a data stream that includes error-corrected data samples; providing a correction-limited data stream by replacing any of the error-corrected data samples that are outliers; and converting the correction-limited data stream into an audio signal.
ERROR CORRECTION OVERWRITE FOR AUDIO ARTIFACT REDUCTION
Audio communication methods, devices, and systems, are provided with error correction overwrite for audio artifact reduction. One illustrative low-latency audio streaming method includes: receiving packets of digital audio data; applying an error correction code decoder to obtain a data stream that includes error-corrected data samples; providing a correction-limited data stream by replacing any of the error-corrected data samples that are outliers; and converting the correction-limited data stream into an audio signal.
TRANSMISSION ERROR ROBUST ADPCM COMPRESSOR WITH ENHANCED RESPONSE
Audio streaming devices, systems, and methods may employ adaptive differential pulse code modulation (ADPCM) techniques providing for optimum performance even while ensuring robustness against transmission errors. One illustrative device includes: a difference element that produces a sequence of prediction error values by subtracting predicted values from audio samples; a scaling element that produces scaled error values by dividing each prediction error by a corresponding envelope estimate; a quantizer that operates on the scaled error values to produce quantized error values; a multiplier that uses the corresponding envelope estimates to produce reconstructed error values; a predictor that produces the next audio sample values based on the reconstructed error values; and an envelope estimator. The envelope estimator includes: an updater that applies a dynamic gain to the reconstructed error values to produce update values; and an integrator that combines each of the update values with the corresponding envelope estimate to produce a subsequent envelope estimate.
TRANSMISSION ERROR ROBUST ADPCM COMPRESSOR WITH ENHANCED RESPONSE
Audio streaming devices, systems, and methods may employ adaptive differential pulse code modulation (ADPCM) techniques providing for optimum performance even while ensuring robustness against transmission errors. One illustrative device includes: a difference element that produces a sequence of prediction error values by subtracting predicted values from audio samples; a scaling element that produces scaled error values by dividing each prediction error by a corresponding envelope estimate; a quantizer that operates on the scaled error values to produce quantized error values; a multiplier that uses the corresponding envelope estimates to produce reconstructed error values; a predictor that produces the next audio sample values based on the reconstructed error values; and an envelope estimator. The envelope estimator includes: an updater that applies a dynamic gain to the reconstructed error values to produce update values; and an integrator that combines each of the update values with the corresponding envelope estimate to produce a subsequent envelope estimate.
Voice data transmission with adaptive redundancy
Voice data transmission with adaptive redundancy creates a voice data packet by packetizing the voice data payload and a number of redundant payloads selected from a set of previous voice data payloads. The voice data from the voice data payload is analysed to determine whether it is a critical or non-critical payload by classifying the received voice data as voiced or unvoiced. If at least a portion of the voice data is classified as unvoiced, the voice data payload is determined to be a critical payload. If it is a critical payload, then the voice data payload is added to the set of previous voice data payloads for inclusion as a redundant payload in subsequent voice data packets. The voice data packet is then forwarded for transmission over the network.
Voice data transmission with adaptive redundancy
Voice data transmission with adaptive redundancy creates a voice data packet by packetizing the voice data payload and a number of redundant payloads selected from a set of previous voice data payloads. The voice data from the voice data payload is analysed to determine whether it is a critical or non-critical payload by classifying the received voice data as voiced or unvoiced. If at least a portion of the voice data is classified as unvoiced, the voice data payload is determined to be a critical payload. If it is a critical payload, then the voice data payload is added to the set of previous voice data payloads for inclusion as a redundant payload in subsequent voice data packets. The voice data packet is then forwarded for transmission over the network.
AUDIO SIGNAL ENHANCEMENT METHOD AND APPARATUS, COMPUTER DEVICE, STORAGE MEDIUM AND COMPUTER PROGRAM PRODUCT
This application relates to an audio signal enhancement method, performed by a computer device. The method including decoding received speech packets sequentially to obtain a residual signal, long term filtering parameters and linear filtering parameters; filtering the residual signal to obtain an audio signal; extracting feature parameters from the audio signal, when the audio signal is a feedforward error correction frame signal; converting the audio signal into a filter speech excitation signal based on the linear filtering parameters; performing speech enhancement on the filter speech excitation signal according to the feature parameters, the long term filtering parameters and the linear filtering parameters to obtain an enhanced speech excitation signal; and performing speech synthesis to obtain an enhanced speech signal based on the enhanced speech excitation signal and the linear filtering parameters.
Method and device for decoding signal
An audio signal decoding device includes a non-transitory memory storage stores audio data in a form of a bitstream; and an audio decoder, by which a first spectral coefficient of a first sub-band of a current frame of an audio signal by decoding the bitstream is obtained; a first average quantity of allocated bits per spectral coefficient of the first sub-band is obtained; a first noise filling gain for the first sub-band is obtained when the first average quantity is less than a threshold; a second spectral coefficient is reconstructed according to the first noise filling gain; a frequency domain audio signal is obtained according to the first spectral coefficient and the second spectral coefficient; and a time domain audio signal is generated according to the frequency domain signal.
Method and device for decoding signal
An audio signal decoding device includes a non-transitory memory storage stores audio data in a form of a bitstream; and an audio decoder, by which a first spectral coefficient of a first sub-band of a current frame of an audio signal by decoding the bitstream is obtained; a first average quantity of allocated bits per spectral coefficient of the first sub-band is obtained; a first noise filling gain for the first sub-band is obtained when the first average quantity is less than a threshold; a second spectral coefficient is reconstructed according to the first noise filling gain; a frequency domain audio signal is obtained according to the first spectral coefficient and the second spectral coefficient; and a time domain audio signal is generated according to the frequency domain signal.
METHODS AND APPARATUSES FOR DTX HANGOVER IN AUDIO CODING
Transmitting node and receiving node for audio coding and methods therein. The nodes being operable to encode/decode speech and to apply a discontinuous transmission (DTX) scheme comprising transmission/reception of Silence Insertion Descriptor (SID) frames during speech inactivity. The method in the transmitting node comprising determining, from amongst a number N of hangover frames, a set Y of frames being representative of background noise, and further transmitting the N hangover frames, comprising at least said set Y of frames, to the receiving node. The method further comprises transmitting a first SID frame to the receiving node in association with the transmission of the N hangover frames, where the SID frame comprises information indicating the determined set Y of hangover frames to the receiving node. The method enables the receiving node to generate comfort noise based on the hangover frames most adequate for the purpose.