LOW COMPLEXITY HOWLING SUPPRESSION FOR PORTABLE KARAOKE
20230328434 · 2023-10-12
Inventors
Cpc classification
G10H1/366
PHYSICS
H04R2430/20
ELECTRICITY
G10H2210/005
PHYSICS
H04R3/02
ELECTRICITY
H04R2420/07
ELECTRICITY
G10H2250/055
PHYSICS
H04R2203/12
ELECTRICITY
International classification
H04R3/02
ELECTRICITY
Abstract
A low complexity howling suppression system and method for portable karaoke system are provided. In the howling suppression, at least one infinite impulse response (IIR) filters are introduced for estimating the acoustic feedback picked up by the microphone from the real environment, and thereby to cancel out the acoustic feedback from the microphone input signal.
Claims
1. A low complexity howling suppression method for a portable karaoke system, the method comprising the steps of: capturing, by at least one microphone, an input signal which comprises a source signal and an acoustic feedback; compressing and equalizing, in an electro-acoustic path, the input signal, and then feeding to an output signal after being amplified; playing back, by a loudspeaker, the output signal which is propagated through environment; and estimating, by at least one infinite impulse response (IIR) filter, the acoustic feedback, and thereby to cancel out the acoustic feedback from the input signal.
2. The method of claim 1, wherein the acoustic feedback comprises howling generated in a closed signal loop caused by acoustic coupling between the loudspeaker and the at least one microphone.
3. The method of claim 1, further comprising the step of modeling, by the at least one IIR filter, an environmental transfer function.
4. The method of claim 3, wherein relative position of the at least one microphone and the loudspeaker is fixed.
5. The method of claim 3, wherein modeling the environmental transfer function comprises measuring and estimating the environmental transfer function offline and approximating the environmental transfer function.
6. The method of claim 5, wherein approximating the environmental transfer function can be performed by multi-band IIR filters.
7. The method of claim 3, further comprises decorrelating the output signal from the input signal.
8. The method of claim 7, wherein decorrelating the output signal from the input signal can be implemented by frequency shifting the input signal.
9. The method of claim 1, further comprising the step of arranging the at least one microphone in different directivities as a microphone array for beamforming.
10. The method of claim 9, wherein the microphone array forms a cardioid directivity pattern.
11. A portable karaoke system with low complexity howling suppression, the system comprising: at least one microphone for capturing an input signal which comprises a source signal and an acoustic feedback which is propagated through environment; an electro-acoustic path for compressing and equalizing the input signal, and then feeding to an output signal after amplified; a loudspeaker for playback the output signal; and at least one infinite impulse response (IIR) filter for estimating the acoustic feedback, and thereby to cancel out the acoustic feedback from the input signal.
12. The system of claim 11, wherein the acoustic feedback comprises howling generated in a closed signal loop caused by acoustic coupling between the loudspeaker and the at least one microphone.
13. The system of claim 11, wherein the at least one IIR filter further models an environmental transfer function.
14. The system of claim 13, wherein relative position of the at least one microphone and the loudspeaker is fixed.
15. The system of claim 13, wherein the at least one IIR filter further measures and estimates the environmental transfer function offline and approximates the environmental transfer function.
16. The system of claim 15, wherein the environmental transfer function can be approximated by multi-band IIR filters.
17. The system of claim 13, further comprising a decorrelator for decorrelating the output signal from the input signal.
18. The system of claim 17, wherein decorrelating the output signal from the input signal can be implemented by frequency shifting the input signal.
19. The system of claim 11, wherein the at least one microphone includes microphones that are further arranged in different directivities as a microphone array for beamforming.
20. The system of claim 19, wherein the microphone array forms a cardioid directivity pattern.
Description
BRIEF DESCRIPTION OF THE DRAWINGS
[0008] The present invention may be better understood from reading the following description of non-limiting embodiments, with reference to the attached drawings. In the figures, like reference numeral designates corresponding parts, wherein below:
[0009]
[0010]
[0011]
[0012]
[0013]
DETAILED DESCRIPTION OF THE INVENTION
[0014] The detailed description of one or more embodiments of the present invention is disclosed hereinafter; however, it is understood that the disclosed embodiments are merely exemplary of the invention that may be embodied in various and alternative forms. The figures are not necessarily to scale; some features may be exaggerated or minimized to show details of particular components. Therefore, specific structural and function details disclosed herein are not to be interpreted as limiting, but merely as a representative basis for teaching one skilled in the art to variously employ the present invention.
[0015]
[0016] When the sound signal is picked up by the microphone 110 and subsequently amplified and fed to the loudspeaker 140, the audio sound playback by the loudspeaker 140 can be taken back by the microphone 110 through direct or reflection paths. The acoustic coupling between the loudspeaker 140 and the microphone 110 results in the output signal Y(z) propagated through the environmental transfer function F(z) to form an acoustic feedback, and then also picked up into the microphone 110, and thus the input signal X(z) as shown in
[0017] Therefore, with all the signals and transfer functions denoted in the frequency domain, the process in
Y(z)=X(z).Math.G(z).Math.K (1)
X(z)=Y(z).Math.F(z)+S(z) (2)
[0018] Regarding the above equation (2), it is conceivable that the input signal X(z) into the karaoke system may further include an Audio Stream, such as accompaniment music of a song, input by way of Bluetooth or AUX interface, for example, which has been omitted herein.
[0019] And then we can compute the total transfer function H(z) from the source signal S(z) to the output signal Y(z):
where the term F(z).Math.G(z).Math.K refers to the loop response of the system, and the magnitude and phase responses of it denote the loop gain and loop phase, respectively. Therefore, the howling will occur when the system becomes unstable, which is summarized in the Nyquist stability criterion,
F(z).Math.G(z).Math.K≥1, ∠F(z)G(z)=n2π (4)
[0020] In the past several decades, many methods on howling suppression have been discussed. Certainly, the first precaution to be considered is optimizing the whole karaoke system, such as the directivities of the loudspeaker and the microphone, the distance between them, the overall gain of the system and the amplitude of some potentially problematic frequencies. However, the optimization is usually limited, especially in the portable all-in-one karaoke system, because its form factor and sound performance usually have certain requirements—as small size but as high sound level as possible. For these limited scenarios, the processes must be automated or other measures needs to be taken to avoid howling feedback. Therefore, the present invention provides a portable all-in-one karaoke system with low complexity howling suppression method.
[0021] To better suppress the howling sound in the portable all-in-one karaoke machine, and reduce the processing computation, the power consumption, and the system latency, the present invention provides a low-complexity howling suppression method for portable karaoke system, in which filters are used to cancel unwanted components from the microphone signal.
[0022]
X(z)−X.sub.est(z)=S(z)+F(z)Y(z)−F.sub.est(z)Y(z)≈S(z) (5)
[0023] Using an adaptive filter to estimate the environmental transfer function can achieve good effect of suppressing howling. In practice, however, the howling suppression with the adaptive filter estimating the environmental transfer function still exists some problems. Firstly, the latency may not meet the requirement of this small karaoke machine. Since the adaptive algorithm might need long processing time, while the loudspeaker is very close to the microphone, so that the sound propagation time is probably smaller than the processing time, and thus the algorithm becomes ineffective. Secondly, the adaptive algorithm might also consume high power and the battery will drain quickly, which is the obvious defect of a portable device. Thirdly, the adaptive algorithm sometimes does not converge smoothly, resulting in the significant difference of the adaptive filters, which will constantly affect the timbre of the user's singing. Furthermore, there is high correlation between the loudspeaker and microphone in the karaoke system, which renders this structure to perform poorly in this scenario.
[0024] Therefore, in the example system, several second-order Infinite Impulse Response (IIR) filters 260′ are used to model the transfer function F(z), as shown in
[0025] Besides, the decorrelation 215 can be further introduced into the example system 200 to reduce the correlation between the loudspeaker and microphone signals. In the model as shown in
x.sub.shift(t)=x(t)cos(2πΔft)−{circumflex over (x)}(t)sin(2πΔft) (6)
where Δf is the shifted frequency, and {circumflex over (x)}(t) is the Hilbert transform of the original signal x(t).
[0026] The full advantage of the frequency shifting can be taken into the present method. In this proposed model with several IIR filters modeling the environmental transfer function, there is no need the adaptive processing due to the latency problem and high-power consumption. Moreover, since the acoustic feedback problem occurs because the output signal of the loudspeaker is returned to the input microphone through the part of acoustic coupling in the air, the frequency shift is used to decorrelate the reference signal and the error signal, which can be used to mitigate biased filter estimation.
[0027]
[0028] In Step 310, the input signal is provided into the portable karaoke system, which comprises the source signal such as the user's singing voice, and the acoustic feedback. This part of the input signal captured by the at least one microphone; Moreover, the input signal further comprises the audio stream such as the accompanying music of songs, this part of the input signal may be upload to the karaoke system in a wired or wireless way, such as by Bluetooth or via the AUX interface.
[0029] In Step 315, the decorrelation is introduced into the provided system to decorrelate the input signal. In this step, the loudspeaker signal of the loudspeaker is decorrelated from the input signal by frequency shifting the input signal.
[0030] In Step 320, then the frequency shifted input signal is processed in the electro-acoustic path, including compression and equalization, and then amplified by the gain factor K, to get the output signal.
[0031] In Step 340, the output signal is fed to the loudspeaker for playback and propagated through the environment.
[0032] In this step, the output signal playback from the loudspeaker, after propagated in the environment, is taken back by the microphone through a direct or some reflection paths as acoustic feedback, the acoustic feedback enters the microphone as the other part of the input signal received by the at least one microphone, as mentioned above in Step 310.
[0033] Next, in Step 360, in order to cancel out unwanted components from the microphone signal, several IIR filters are used to model the environmental transfer function. This step comprises measuring and estimating the environmental transfer function offline, and approximates the function by such as multi-band IIR filters. The resulting estimated signal is approximately equal to the acoustic feedback part of the input signal entering the microphone in step 350. Therefore, by subtracting the estimated acoustic feedback from the input signals captured by the microphone, the acoustic feedback that may produce howling in the system can be cancelled out.
[0034]
[0035] In a way of example, two microphones with different directivities as a microphone array 410 are used to form a cardioid directivity pattern as shown in
[0036] The beamforming output X.sub.beam(z) is written as:
X.sub.beam(Z)=Σ.sub.kW.sub.k(z).Math.X.sub.k(Z) (7)
where W.sub.k(z) and X.sub.k(z) are the kth beamforming filter and the kth microphone input signal, respectively.
[0037] Moreover, In the actual portable karaoke products, the microphones 410 can be wrapped with the sound-absorbing cotton 480 to further reduce the sound energy from the loudspeaker 440. We can see that the passive radiator 470 is alternatively used in this example machine, as shown schematically in
[0038]
[0039] For a portable karaoke machine, we always want to have a long play time but still expect a good sound quality, such as high volume and less howling problem. In this invention, the provided low complexity howling suppression method adopts the IIR filter structure to reduce the power consumption and the system latency. To further suppress the howling, nonlinear algorithms, for example frequency shifting, are also combined with the microphone beamforming method.
[0040] The low complexity howling suppression method and system provided in the present invention are suitable for those applications in which a system containing both loudspeaker and microphone provided that their relative positions are fixed, and the loudspeaker plays the input signal of the microphone in real time. The example applications comprise such as but not limited to portable karaoke machines, integrated speakers, and conference systems, etc.
[0041] As used in this application, an element or step recited in the singular and proceeded with the word “a” or “an” should be understood as not excluding plural of said elements or steps, unless such exclusion is stated. Furthermore, references to “one embodiment” or “one example” of the present disclosure are not intended to be interpreted as excluding the existence of additional embodiments that also incorporate the recited features. The terms “first,” “second,” and “third,” etc. are used merely as labels, and are not intended to impose numerical requirements or a particular positional order on their objects.
[0042] While exemplary embodiments are described above, it is not intended that these embodiments describe all possible forms of the invention. Rather, the words used in the specification are words of description rather than limitation, and it is understood that various changes may be made without departing from the spirit and scope of the invention. Additionally, the features of various implementing embodiments may be combined to form further embodiments of the invention.