Applying noise suppression to remote and local microphone signals
11527232 · 2022-12-13
Assignee
Inventors
Cpc classification
G10K11/17881
PHYSICS
H04R2227/009
ELECTRICITY
G10K11/17885
PHYSICS
H04R25/554
ELECTRICITY
International classification
Abstract
A remote microphone signal is obtained from a remote device, and a local microphone signal from a local device. A difference between strength of the remote microphone signal and strength of the local microphone signal is determined. An output audio signal is produced to drive a speaker in the local device. If the difference is greater than a threshold, then the local and remote microphone signals are applied to two input channels, respectively, of a two channel noise suppressor which produces the output audio signal, but if the difference is less than the threshold after a certain delay since the difference was greater than the threshold, then only the remote microphone signal is applied to a single input of a single channel noise suppressor which produces the output audio signal. Other aspects are also described and claimed.
Claims
1. A method for applying noise suppression in a local device using a remote microphone signal and a local microphone signal, the method comprising the following operations performed in a local device: obtaining a remote microphone signal from a remote device, and a local microphone signal from the local device; determining a difference between strength of the remote microphone signal and strength of the local microphone signal; and producing an output audio signal to drive a speaker in the local device, wherein the producing comprises i) if the difference is greater than a threshold, applying the local and remote microphone signals to two input channels, respectively, of a two channel noise suppressor that produces the output audio signal, and ii) if the difference is less than the threshold, applying only the remote microphone signal to a single input of a single channel noise suppressor which produces the output audio signal.
2. The method of claim 1 wherein if the difference is less than the threshold and an amount of time that has passed since the difference was above the threshold is within a certain delay, then the local and remote microphone signals are applied to the two input channels, respectively, of the two channel noise suppressor which produces the output audio signal.
3. The method of claim 2 wherein if the difference is less than the threshold and the amount of time that has passed since the difference was above the threshold is greater than the certain delay, then only the remote microphone signal is applied to the single input of the single channel noise suppressor.
4. The method of claim 1 wherein the local device is a head worn device worn by a user, and the remote device is not worn by the user.
5. The method of claim 4 wherein the local device is a headset, and the remote device is a smartphone or a tablet computer.
6. The method of claim 4 wherein obtaining the remote microphone signal comprises receiving the remote microphone signal from the remote device via a wireless communication link.
7. The method of claim 4 wherein at least one of the remote microphone signal and the local microphone is a beam formed signal.
8. The method of claim 4 wherein determining a difference between strength of the remote microphone signal and strength of the local microphone signal comprises computing a ratio of power of the remote microphone signal to power of the local microphone signal.
9. The method of claim 4 wherein the single channel noise suppressor estimates noise based on the single input channel, and the two channel noise suppressor estimates noise based on the two input channels.
10. The method of claim 4 wherein the threshold comprises an upper value and a lower value that is smaller than the upper value, the upper value being in a range of five to ten dB, and the lower value being in a range of zero to five dB.
11. A local device comprising a headset housing having therein: a speaker; a microphone to produce a local microphone signal; a wireless communications interface to receive a remote microphone signal transmitted wirelessly from a remote device; a processor; and memory having stored therein instructions that configure the processor to apply noise suppression using the remote microphone signal and the local microphone signal to produce an output audio signal that drives the speaker, by determining a difference between strength of the remote microphone signal and strength of the local microphone signal, and if the difference is greater than a threshold, then applying the local and remote microphone signals to respective inputs of a two channel noise suppressor which produces the output audio signal, and if the difference is less than the threshold then applying one, not both, of the local and remote microphone signals to an input of a single channel noise suppressor which produces the output audio signal.
12. The local device of claim 11 wherein at least one of the remote microphone signal and the local microphone signal is a beam formed signal.
13. The local device of claim 11 wherein determining the difference comprises computing a ratio of power of the remote microphone signal to power of the local microphone signal.
14. The local device of claim 11 wherein the processor is further configured to smooth the strength of the remote microphone signal and smooth the strength of the local microphone signal when determining the difference, according to a smoothing parameter, and the threshold comprises an upper value and a lower value.
15. The local device of claim 14 wherein the processor is configured to control a duration in which the two channel noise suppressor is producing the output audio signal after the difference has exceeded the upper value of the threshold, based on the smoothing parameter and based on the lower value of the threshold.
16. The local device of claim 11 wherein if the difference is less than the threshold and an amount of time that has passed since the difference was above the threshold is within a certain delay, then the local and remote microphone signals are applied to the inputs, respectively, of the two channel noise suppressor which produces the output audio signal.
17. The local device of claim 16 wherein if the difference is less than the threshold and the amount of time that has passed since the difference was above the threshold is greater than the certain delay, then only the remote microphone signal is applied to the input of the single channel noise suppressor.
18. The local device of claim 11 wherein the local device is a head worn device worn by a user and the remote device is not worn by the user.
19. The local device of claim 18 wherein the local device is a headset, and the remote device is a smartphone or a tablet computer.
20. The local device of claim 18 wherein the single channel noise suppressor estimates noise based on a single input channel, and the two channel noise suppressor estimates noise based on two input channels.
Description
BRIEF DESCRIPTION OF THE DRAWINGS
(1) Several aspects of the disclosure here are illustrated by way of example and not by way of limitation in the figures of the accompanying drawings in which like references indicate similar elements. It should be noted that references to “an” or “one” aspect in this disclosure are not necessarily to the same aspect, and they mean at least one. Also, in the interest of conciseness and reducing the total number of figures, a given figure may be used to illustrate the features of more than one aspect of the disclosure, and not all elements in the figure may be required for a given aspect.
(2)
(3)
(4)
(5)
(6)
DETAILED DESCRIPTION
(7) Several aspects of the disclosure with reference to the appended drawings are now explained. Whenever the shapes, relative positions and other aspects of the parts described are not explicitly defined, the scope of the invention is not limited only to the parts shown, which are meant merely for the purpose of illustration. Also, while numerous details are set forth, it is understood that some aspects of the disclosure may be practiced without these details. In other instances, well-known circuits, structures, and techniques have not been shown in detail so as not to obscure the understanding of this description.
(8)
(9) As seen in the block diagram of
(10) As seen in
(11) However, using the two channel noise suppressor in the local device to reduce ambient noise as described above works well only when the distance d between the remote microphone and the sound source 5 is much smaller than the distance D between the sound source 5 and the local microphone. Referring briefly back to
(12) Moreover, in real usage scenarios of the remote listening system, the desired condition d<<<D is not always achieved or controllable by the user 2. In addition, the user 2 may not know or be aware that the two channel noise suppressor works better when the remote device 3 is much closer to the sound source than to the user 2 and the local device 1.
(13) Accordingly, one aspect of the disclosure here is an automatic method of changing a noise suppressor mode of operation in a remote listening system, from a two channel noise suppressor to a one channel suppressor, when d which is the distance between the remote microphone 4 and a desired sound source 5 becomes greater than a threshold. The threshold may be, for example, one half of D which is the distance between the desired sound source 5 and the local microphone 6. The threshold represents the situation where the remote device is not placed sufficiently close to the sound source 5, or is too far away from the sound source 5 (e.g., because the talker moves away from the remote device 3, or someone who is holding the remote device 3 moves away from the talker or places it too far away from the talker.)
(14) As in the example with the two channel noise suppressor having the threshold of 6 dB, when d increases to about one half of D, the processor automatically signals a change in how the output audio signal is produced, from using the two channel noise suppressor to using the single channel noise suppressor.
(15) Still referring to
(16) Returning to the comparison block, if the difference is less than the threshold then the comparison block provides a control signal to the 1 chNS/2 chNS block to apply the remote microphone signal only to a single input of a single channel noise suppressor (which produces the output audio signal that drives the speaker 7.) This changing between 1 chNS and 2 chNS modes of operation is illustrated using example remote and local microphone signals in
(17)
(18) Improving Speech Intelligibility Through Spectral Shaping of the Remote Microphone Signal
(19) Referring now to
(20) The elements of the intelligibility enhancer together serve to increase speech intelligibility by preserving the SNR in the ear canal of the user 2 to be the same as the SNR obtained with the un-enhanced remote microphone signal where leaking ambient noise (that leaks past the passive isolation provided by local device 1) is combined with the amplified and noise-reduced remote microphone signal. This intelligibility enhancement is obtained, on top of the enhancement that is due to the remote microphone 4 having a better SNR than the local microphone 6 because of the proximity to the sound source 5 (distance d being shorter than distance D, see
(21) The enhancement EQ block has an EQ filter (a spectral shaping filter) that may be a fixed filter (not adaptive or dynamically varying over time) and may be described as follows. Studies by others have defined a speech intelligibility index, SII, that represents a measure of speech intelligibility which varies as a function of ambient noise level, distance from the source, speaking type, hearing loss and binaural or monaural hearing. The SII models were defined for four different speaking types, namely Normal, Raised, Loud, and Shout. The SII values or scores for the Raised, Loud, and Shout speech are in general progressively higher than those for the Normal speech in the same noise and hearing conditions. Since the SII models contain reference spectra for all these speaking types, it can be observed that the Raised, Loud, and Shout speech spectra are not only higher in level than the Normal speech spectra, but also with a frequency shift progressively towards higher frequencies, respectively. From these studies, the inventor of the present disclosure created three spectral shaping functions that describe the spectral differences between the speech spectra of Raised, Loud, and Shout speech, respectively, to the speech spectrum of Normal speech. Thus applying each of these spectral shaping functions to the Normal speech spectra one can generate the original speech spectra for Raised, Loud, and Shout speech. By subtracting from each of these three shaping functions the overall level difference between the Raised, Loud, and Shout speech spectra and of the Normal speech spectrum, respectively, three new shaping functions (final shaping functions) were obtained that have the same average level as that of the Normal speech spectrum but their speech energies are re-distributed, e.g., attenuated progressively at lower frequencies and boosted at higher frequencies. Laboratory experimentation showed that these new, spectrally shaped speech functions when applied to the original microphone speech spectra do in fact result in reduced word error rate, WER, or equivalently increased intelligibility, as compared to the remote microphone signal to which the spectrally shaped speech functions have not been applied.
(22)
(23) Viewed another way, and as also seen in
(24)
(25) The following are examples of various aspects of the intelligibility enhancer. A method for enhancing speech intelligibility in a local device of a remote listening system, the method comprising: obtaining a remote microphone signal from a remote device; filtering the remote microphone signal using an equalization filter to produce a filtered remote microphone signal, wherein a magnitude response of the equalization filter exhibits progressively greater attenuation below a cross-over frequency and progressively greater boost above the cross-over frequency up to 1000 Hz, wherein the cross-over frequency is between 500 Hz and 800 Hz; and providing the filtered remote microphone signal to drive a speaker in the local device. In aspect of this method, the magnitude response of the equalization filter exhibits monotonically decreasing magnitude from 2500 Hz to 8 kHz. This method may further comprise: performing a comparison between strength of the remote microphone signal as input to the equalization filter and strength of the filtered remote microphone signal; and based on the comparison setting a gain that is applied to the filtered remote microphone signal in such a way that the power of the remote microphone signal is the same before the equalization filter and after the equalization filter.
(26) In another example, a method for enhancing speech intelligibility in a local device, the method comprises: obtaining a remote microphone signal from a remote device; filtering the remote microphone signal using an equalization filter to produce a filtered remote microphone signal, wherein a magnitude response of the equalization filter has a first sub-range below a cross-over frequency in which there is attenuation between 1 dB to 20 dB, and a second sub-range above the cross-over frequency up to 3000 Hz in which there is boost between 1 to 9 dB, wherein the cross-over frequency is between 500 Hz and 800 Hz; and providing the filtered remote microphone signal to drive a speaker in the local device. In this method, the magnitude response of the equalization filter may exhibit monotonically decreasing magnitude from 2500 Hz to 8 kHz. Moreover, this method may further comprise equalizing strengths of the filtered remote microphone signal and the remote microphone signal as input to the equalization filter, by applying a gain to the filtered remote microphone signal.
(27) While certain aspects have been described above and shown in the accompanying drawings, it is to be understood that such are merely illustrative of and not restrictive on the broad invention, and that the invention is not limited to the specific constructions and arrangements shown and described, since various other modifications may occur to those of ordinary skill in the art. For example, although