Patent classifications
G10L19/08
Spatial Audio Representation and Rendering
An apparatus including circuitry configured to: obtain a spatial audio signal including at least one audio signal and spatial metadata associated with the at least one audio signal; obtain at least one data set related to binaural rendering; obtain at least one pre-defined data set related to binaural rendering; and generate a binaural audio signal based on a combination of at least part of the at least one data set and the at least one pre-defined data set, and the spatial audio signal.
AUDIO SIGNAL ENHANCEMENT METHOD AND APPARATUS, COMPUTER DEVICE, STORAGE MEDIUM AND COMPUTER PROGRAM PRODUCT
This application relates to an audio signal enhancement method, performed by a computer device. The method including decoding received speech packets sequentially to obtain a residual signal, long term filtering parameters and linear filtering parameters; filtering the residual signal to obtain an audio signal; extracting feature parameters from the audio signal, when the audio signal is a feedforward error correction frame signal; converting the audio signal into a filter speech excitation signal based on the linear filtering parameters; performing speech enhancement on the filter speech excitation signal according to the feature parameters, the long term filtering parameters and the linear filtering parameters to obtain an enhanced speech excitation signal; and performing speech synthesis to obtain an enhanced speech signal based on the enhanced speech excitation signal and the linear filtering parameters.
LOW-FREQUENCY EMPHASIS FOR LPC-BASED CODING IN FREQUENCY DOMAIN
The invention provides an audio encoder including a combination of a linear predictive coding filter having a plurality of linear predictive coding coefficients and a time-frequency converter, wherein the combination is configured to filter and to convert a frame of the audio signal into a frequency domain in order to output a spectrum based on the frame and on the linear predictive coding coefficients; a low frequency emphasizer configured to calculate a processed spectrum based on the spectrum, wherein spectral lines of the processed spectrum representing a lower frequency than a reference spectral line are emphasized; and a control device configured to control the calculation of the processed spectrum by the low frequency emphasizer depending on the linear predictive coding coefficients of the linear predictive coding filter.
Residual coding method of linear prediction coding coefficient based on collaborative quantization, and computing device for performing the method
Disclosed are a method for coding a residual signal of LPC coefficients based on collaborative quantization and a computing device for performing the method. The residual signal coding method includes: generating encoded LPC coefficients and LPC residual signals by performing LPC analysis and quantization on an input speech; Determining a predicted LPC residual signal by applying the LPC residual signal to cross module residual learning; Performing LPC synthesis using the coded LPC coefficients and the predicted LPC residual signal; It may include the step of determining an output speech that is a synthesized output according to a result of performing the LPC synthesis.
Residual coding method of linear prediction coding coefficient based on collaborative quantization, and computing device for performing the method
Disclosed are a method for coding a residual signal of LPC coefficients based on collaborative quantization and a computing device for performing the method. The residual signal coding method includes: generating encoded LPC coefficients and LPC residual signals by performing LPC analysis and quantization on an input speech; Determining a predicted LPC residual signal by applying the LPC residual signal to cross module residual learning; Performing LPC synthesis using the coded LPC coefficients and the predicted LPC residual signal; It may include the step of determining an output speech that is a synthesized output according to a result of performing the LPC synthesis.
APPARATUS AND METHOD FOR SELECTING ONE OF A FIRST ENCODING ALGORITHM AND A SECOND ENCODING ALGORITHM
An apparatus for selecting one of a first encoding algorithm having a first characteristic and a second encoding algorithm having a second characteristic for encoding a portion of an audio signal to obtain an encoded version of the portion of the audio signal has a first estimator for estimating a first quality measure for the portion of the audio signal, which is associated with the first encoding algorithm, without actually encoding and decoding the portion of the audio signal using the first encoding algorithm. A second estimator is provided for estimating a second quality measure for the portion of the audio signal, which is associated with the second encoding algorithm, without actually encoding and decoding the portion of the audio signal using the second encoding algorithm. The apparatus has a controller for selecting the first or second encoding algorithms based on a comparison between the first and second quality measures.
Audio Transcoding Method and Apparatus, Audio Transcoder, Device, and Storage Medium
Provided is an audio transcoding method, including: (301) performing entropy decoding on a first audio stream with a first bitrate, to obtain an audio feature parameter and an excitation signal of the first audio stream, the excitation signal being a quantized audio signal; (302) obtaining a time-domain audio signal corresponding to the excitation signal based on the audio feature parameter and the excitation signal; (303) re-quantizing the excitation signal and the audio feature parameter based on the time-domain audio signal and a target transcoding bitrate, to obtain a target excitation signal and a target audio feature parameter; and (304) performing entropy coding on the target audio feature parameter and the target excitation signal, to obtain a second audio stream with a second bitrate, the second bitrate being lower than the first bitrate.
Audio Transcoding Method and Apparatus, Audio Transcoder, Device, and Storage Medium
Provided is an audio transcoding method, including: (301) performing entropy decoding on a first audio stream with a first bitrate, to obtain an audio feature parameter and an excitation signal of the first audio stream, the excitation signal being a quantized audio signal; (302) obtaining a time-domain audio signal corresponding to the excitation signal based on the audio feature parameter and the excitation signal; (303) re-quantizing the excitation signal and the audio feature parameter based on the time-domain audio signal and a target transcoding bitrate, to obtain a target excitation signal and a target audio feature parameter; and (304) performing entropy coding on the target audio feature parameter and the target excitation signal, to obtain a second audio stream with a second bitrate, the second bitrate being lower than the first bitrate.
Apparatus and method for audio rendering employing a geometric distance definition
An apparatus for playing back an audio object associated with a position includes a distance calculator for calculating distances of the position to speakers or for reading the distances of the position to the speakers. The distance calculator is configured to take a solution with a smallest distance. The apparatus is configured to play back the audio object using the speaker corresponding to the solution.
Apparatus and method for audio rendering employing a geometric distance definition
An apparatus for playing back an audio object associated with a position includes a distance calculator for calculating distances of the position to speakers or for reading the distances of the position to the speakers. The distance calculator is configured to take a solution with a smallest distance. The apparatus is configured to play back the audio object using the speaker corresponding to the solution.