METHOD AND APPARATUS FOR PROCESSING AN AUDIO SIGNAL, AUDIO DECODER, AND AUDIO ENCODER
20220148609 · 2022-05-12
Inventors
- Emmanuel Ravelli (Erlangen, DE)
- Manuel Jander (Hemhofen, DE)
- Grzegorz PIETRZYK (Nuernberg, DE)
- Martin DIETZ (Nuernberg, DE)
- Marc Gayer (Erlangen, DE)
Cpc classification
G10L19/03
PHYSICS
G10L19/20
PHYSICS
G10L19/06
PHYSICS
G10L19/12
PHYSICS
H04B1/1027
ELECTRICITY
G10L19/005
PHYSICS
International classification
G10L19/022
PHYSICS
G10L19/12
PHYSICS
Abstract
A method is described that processes an audio signal. A discontinuity between a filtered past frame and a filtered current frame of the audio signal is removed using linear predictive filtering.
Claims
1. A method for processing an audio signal, the method comprising: removing a discontinuity between a filtered past frame and a filtered current frame of the audio signal using linear predictive filtering.
2. The method of claim 1, comprising filtering the current frame of the audio signal and removing the discontinuity by modifying a beginning portion of the filtered current frame by a signal acquired by linear predictive filtering a predefined signal with initial states of the linear predictive filter defined on the basis of a last part of the past frame.
3. The method of claim 2, wherein the initial states of the linear predictive filter are defined on the basis of a last part of the unfiltered past frame filtered using the set of filter parameters for filtering the current frame.
4. The method of claim 1, further comprising estimating the linear predictive filter on the filtered or non-filtered audio signal.
5. The method of claim 4, wherein estimating the linear predictive filter comprises estimating the filter based on the past and/or current frame of the audio signal or based on the past filtered frame of the audio signal using the Levinson-Durbin algorithm.
6. The method of claim 1, wherein the linear predictive filter comprises a linear predictive filter of an audio codec.
7. The method of claim 1, wherein removing the discontinuity comprises processing the beginning portion of the filtered current frame, wherein the beginning portion of the current frame comprises a predefined number of samples being less or equal than the total number of samples in the current frame, and wherein processing the beginning portion of the current frame comprises subtracting a beginning portion of a zero-input-response (ZIR) from the beginning portion of the filtered current frame.
8. The method of claim 7, comprising filtering the current frame of the audio signal using a non-recursive filter, like a FIR filter, for producing the filtered current frame.
9. The method of claim 7, comprising processing the unfiltered current frame of the audio signal on a sample-by-sample basis using a recursive filter, like an IIR filter, and wherein processing a sample of the beginning portion of the current frame comprises: filtering the sample with the recursive filter using the filter parameters of the current frame for producing a filtered sample, and subtracting a corresponding ZIR sample from the filtered sample for producing the corresponding sample of the filtered current frame.
10. The method of claim 9, wherein filtering and subtracting are repeated until the last sample in the beginning portion of the current frame is processed, and wherein the method further comprises filtering the remaining samples in the current frame with the recursive filter using the filter parameters of the current frame.
11. The method of claim 7, comprising generating the ZIR, wherein generating the ZIR comprises: filtering the M last samples of the unfiltered past frame with the filter and the filter parameters used for filtering the current frame for producing a first portion of filtered signal, wherein M is the order of the linear predictive filter, subtracting from the first portion of filtered signal the M last samples of the filtered past frame, filtered using the filter parameters of the past frame, for generating a second portion of filtered signal, and generating a ZIR of a linear predictive filter by filtering a frame of zero samples with the linear predictive filter and initial states equal to the second portion of filtered signal.
12. The method of claim 11, comprising windowing the ZIR such that its amplitude decreases faster to zero.
13. A non-transitory digital storage medium having stored thereon a computer program product for performing a method for processing an audio signal, the method comprising: removing a discontinuity between a filtered past frame and a filtered current frame of the audio signal using linear predictive filtering, when said computer program is run by a computer.
14. An apparatus for processing an audio signal, wherein the apparatus comprises a processor for removing a discontinuity between a filtered past frame and a filtered current frame of the audio signal using linear predictive filtering, or wherein the apparatus is configured to operate according to a method for processing an audio signal, the method comprising removing a discontinuity between a filtered past frame and a filtered current frame of the audio signal using linear predictive filtering.
15. An audio decoder, comprising an apparatus of claim 14.
16. An audio encoder, comprising an apparatus of claim 14.
Description
BRIEF DESCRIPTION OF THE DRAWINGS
[0036] In the following, embodiments of the present invention will be described with reference to the accompanying drawings, in which:
[0037]
[0038]
[0039]
[0040]
[0041]
[0042]
[0043]
[0044]
DETAILED DESCRIPTION OF THE INVENTION
[0045] In the following, embodiments of the inventive approach will be described in further detail and it is noted that in the accompanying drawing elements having the same or similar functionality are denoted by the same reference signs.
[0046]
[0047] The system of
[0048] In the following, embodiments of the inventive approach that may be implemented in at least one of the encoding processor 206 and the decoding processor 256 will be described in further detail.
with M the filter order and α.sub.m the filter coefficients (with α.sub.0=1). This kind of filter is also known as Linear Predictive Coding (LPC). In accordance with embodiments the filtered current frame is processed by applying linear predictive filtering to at least a part of the filtered current frame. The discontinuity may be removed by modifying a beginning portion of the filtered current frame by a signal obtained by linear predictive filtering a predefined signal with initial states of the linear predictive coding filter defined on the basis of a last part of the past frame. The initial states of the linear predictive coding filter may be defined on the basis of a last part of the past frame filtered using the set of filter parameters for the current frame. The inventive approach is advantageous as it does not require filtering the current frame of an audio signal with a filter coefficient that is used for the past frame and thereby avoids problems that arise due to the mismatch of the filter parameters for the current frame and for the past frame as they are experienced in the known approaches described above with reference to
[0049]
[0050] The functionality of the processing blocks 110 and 112 will now be described in further detail.
[0051] The ZIR, as described above with regard to
[0052] In accordance with another embodiment, the linear filer H(z) is a non-recursive filter, like a FIR filter, and the ZIR, as described above with regard to
[0053] The inventive approach may be applied in situations as described above when the audio signal is filtered. In accordance with embodiments, the inventive approach may also be applied at the decoder side, for example, when using an audio codec postfilter for reducing the level of coding noise between signal harmonics. For processing the audio frames at the decoder the postfilter, in accordance with an embodiment, may be as follows:
where B(z) and A(z) are two FIR filters and the H(z) filter parameters are the coefficients of the FIR filters B(z) and A(z), and T indicates the pitch lag. In such a scenario, the filter may also introduce a discontinuity between the two filtered frames, for example when the past filter frame parameters c.sub.0 are different from the current frame filter parameters c.sub.1, and such a discontinuity may produce an artifact in the filtered audio signal 104, for example a “click”. This discontinuity is removed by processing the filtered current frame as described above in detail.
[0054] Although some aspects of the described concept have been described in the context of an apparatus, it is clear that these aspects also represent a description of the corresponding method, where a block or device corresponds to a method step or a feature of a method step. Analogously, aspects described in the context of a method step also represent a description of a corresponding block or item or feature of a corresponding apparatus.
[0055] Depending on certain implementation requirements, embodiments of the invention can be implemented in hardware or in software. The implementation can be performed using a digital storage medium, for example a floppy disk, a DVD, a Blue-Ray, a CD, a ROM, a PROM, an EPROM, an EEPROM or a FLASH memory, having electronically readable control signals stored thereon, which cooperate (or are capable of cooperating) with a programmable computer system such that the respective method is performed. Therefore, the digital storage medium may be computer readable.
[0056] Some embodiments according to the invention comprise a data carrier having electronically readable control signals, which are capable of cooperating with a programmable computer system, such that one of the methods described herein is performed.
[0057] Generally, embodiments of the present invention can be implemented as a computer program product with a program code, the program code being operative for performing one of the methods when the computer program product runs on a computer. The program code may for example be stored on a machine readable carrier.
[0058] Other embodiments comprise the computer program for performing one of the methods described herein, stored on a machine readable carrier.
[0059] In other words, an embodiment of the inventive method is, therefore, a computer program having a program code for performing one of the methods described herein, when the computer program runs on a computer.
[0060] A further embodiment of the inventive methods is, therefore, a data carrier (or a digital storage medium, or a computer-readable medium) comprising, recorded thereon, the computer program for performing one of the methods described herein.
[0061] A further embodiment of the inventive method is, therefore, a data stream or a sequence of signals representing the computer program for performing one of the methods described herein. The data stream or the sequence of signals may for example be configured to be transferred via a data communication connection, for example via the Internet.
[0062] A further embodiment comprises a processing means, for example a computer, or a programmable logic device, configured to or adapted to perform one of the methods described herein.
[0063] A further embodiment comprises a computer having installed thereon the computer program for performing one of the methods described herein.
[0064] In some embodiments, a programmable logic device (for example a field programmable gate array) may be used to perform some or all of the functionalities of the methods described herein. In some embodiments, a field programmable gate array may cooperate with a microprocessor in order to perform one of the methods described herein. Generally, the methods may be performed by any hardware apparatus.
[0065] While this invention has been described in terms of several embodiments, there are alterations, permutations, and equivalents which will be apparent to others skilled in the art and which fall within the scope of this invention. It should also be noted that there are many alternative ways of implementing the methods and compositions of the present invention. It is therefore intended that the following appended claims be interpreted as including all such alterations, permutations, and equivalents as fall within the true spirit and scope of the present invention.