Patent classifications
G10L19/083
Spatial sound reproduction using multichannel loudspeaker systems
An apparatus for spatial audio signal decoding associated with a plurality of speaker nodes (201, 203, 205, 207, 209) placed within a three dimensional space, the apparatus comprising at least one processor and at least one memory including a computer program code. The at least one memory and the computer program code configured to, with the at least one processor, cause the apparatus at least to determine a non-overlapping virtual surface arrangement (400), the virtual surface arrangement (400) comprising a plurality of virtual surfaces (421, 423, 431, 433) with corners positioned at at least three speaker nodes of the plurality of speaker nodes (201, 203, 205, 207, 209) and sides connecting pairs of corners configured to be non-intersecting with at least one defined virtual plane within the three dimensional space. The apparatus is further caused to generate gains for the speaker nodes based on the determined the virtual surface arrangement and apply the gains to at least one audio signal, the at least one audio signal to be positioned within the three dimensional space.
Deep neural network based audio processing method, device and storage medium
A deep neural network based audio processing method is provided. The method includes: obtaining a deep neural network based speech extraction model; receiving an audio input object having a speech portion and a non-speech portion, wherein the audio input object includes one or more audio data frames each having a set of audio data samples sampled at a predetermined sampling interval and represented in time domain data format; obtaining a user audiogram and a set of user gain compensation coefficients associated with the user audiogram; and inputting the audio input object and the set of user gain compensation coefficients into the trained speech extraction model to obtain an audio output result represented in time domain data format outputted by the trained speech extraction model, wherein the non-speech portion of the audio input object is at least partially attenuated in or removed from the audio output result.
Deep neural network based audio processing method, device and storage medium
A deep neural network based audio processing method is provided. The method includes: obtaining a deep neural network based speech extraction model; receiving an audio input object having a speech portion and a non-speech portion, wherein the audio input object includes one or more audio data frames each having a set of audio data samples sampled at a predetermined sampling interval and represented in time domain data format; obtaining a user audiogram and a set of user gain compensation coefficients associated with the user audiogram; and inputting the audio input object and the set of user gain compensation coefficients into the trained speech extraction model to obtain an audio output result represented in time domain data format outputted by the trained speech extraction model, wherein the non-speech portion of the audio input object is at least partially attenuated in or removed from the audio output result.
Post-Quantization Gain Correction in Audio Coding
A gain adjustment apparatus for use in decoding of audio that has been encoded with separate gain and shape representations includes an accuracy meter configured to estimate an accuracy measure of the shape representation, and to determine a gain correction based on the estimated accuracy measure. An envelope adjuster further included in the apparatus is configured to adjust the gain representation based on the determined gain correction.
Post-Quantization Gain Correction in Audio Coding
A gain adjustment apparatus for use in decoding of audio that has been encoded with separate gain and shape representations includes an accuracy meter configured to estimate an accuracy measure of the shape representation, and to determine a gain correction based on the estimated accuracy measure. An envelope adjuster further included in the apparatus is configured to adjust the gain representation based on the determined gain correction.
Post-quantization gain correction in audio coding
A gain adjustment apparatus for use in decoding of audio that has been encoded with separate gain and shape representations includes an accuracy meter configured to estimate an accuracy measure of the shape representation, and to determine a gain correction based on the estimated accuracy measure. An envelope adjuster further included in the apparatus is configured to adjust the gain representation based on the determined gain correction.
Post-quantization gain correction in audio coding
A gain adjustment apparatus for use in decoding of audio that has been encoded with separate gain and shape representations includes an accuracy meter configured to estimate an accuracy measure of the shape representation, and to determine a gain correction based on the estimated accuracy measure. An envelope adjuster further included in the apparatus is configured to adjust the gain representation based on the determined gain correction.
APPARATUS AND METHOD REALIZING IMPROVED CONCEPTS FOR TCX LTP
An apparatus for decoding an encoded audio signal to obtain a reconstructed audio signal is provided. The apparatus includes a receiving interface, a delay buffer and a sample processor for processing the selected audio signal samples to obtain reconstructed audio signal samples of the reconstructed audio signal. The sample selector is configured to select, if a current frame is received by the receiving interface and if the current frame being received by the receiving interface is not corrupted, the plurality of selected audio signal samples from the audio signal samples being stored in the delay buffer depending on a pitch lag information being included by the current frame.
APPARATUS AND METHOD REALIZING IMPROVED CONCEPTS FOR TCX LTP
An apparatus for decoding an encoded audio signal to obtain a reconstructed audio signal is provided. The apparatus includes a receiving interface, a delay buffer and a sample processor for processing the selected audio signal samples to obtain reconstructed audio signal samples of the reconstructed audio signal. The sample selector is configured to select, if a current frame is received by the receiving interface and if the current frame being received by the receiving interface is not corrupted, the plurality of selected audio signal samples from the audio signal samples being stored in the delay buffer depending on a pitch lag information being included by the current frame.
Spatially aware multiband compression system with priority
An audio signal is compressed in an audio coordinate system using gain factors applied in another audio coordinate system. A first component and a second component in a first audio coordinate system is generated from a third component and a fourth component of the audio signal in a second audio coordinate system. An amplitude threshold defining a level for each of the third component and the fourth component for applying compression is determined. A gain factor for the first component is generated using a compression ratio. The gain factor is applied to the first component when one of the third component or the fourth component exceeds the amplitude threshold to generate an adjusted first component. A first output channel and a second output channel in the second audio coordinate system is generated using the adjusted first component and the second component in the first audio coordinate system.