Method and system for providing hearing assistance to a user
09769576 · 2017-09-19
Assignee
Inventors
- Francois Marquis (Corminboeuf, CH)
- Hans Mulder (Wünnewil, CH)
- Samuel Harsch (Ballaigues, CH)
- Tim Jost (Auvernier, CH)
Cpc classification
H03G3/344
ELECTRICITY
H04R25/50
ELECTRICITY
H04R25/554
ELECTRICITY
International classification
Abstract
Method for providing hearing assistance to a user by capturing audio signals with a microphone arrangement of a transmission unit; analyzing, by a voice activity detector of the transmission unit, the captured audio signals to judge whether a voice is present close to the microphone arrangement or not, analyzing, by a noise level estimator, the captured audio signals to estimate a surrounding noise level, processing the captured audio signals and transmitting, by a transceiver of the transmission unit, the processed audio signals via a wireless link to a receiver unit, and stimulating the user's hearing, by stimulating means at or in at least one user ear, according to the received audio signals, wherein the audio signals are processed via the wireless link by setting the gain applied to the audio signals according to the close voice/no close voice judgment of the voice activity detector and the estimated surrounding noise level.
Claims
1. A method for providing hearing assistance to a user, comprising: capturing audio signals by a microphone arrangement (17) of a transmission unit (10, 110); analyzing, with a voice activity detector (24) of the transmission unit, the captured audio signals in order to judge whether a voice is present close to the microphone arrangement or not; analyzing, by a noise level estimator (25), the captured audio signals in order to estimate a surrounding noise level; processing the captured audio signals and transmitting, with a transceiver (28) of the transmission unit, the processed audio signals via a wireless link (12) to a receiver unit (14, 114, 214); and stimulating the user's hearing with stimulating means (16, 64, 82, 182) worn at or in at least one of the user's ears in accordance with the transmitted processed audio signals received by the receiver unit; wherein the audio signals are processed prior to and/or after transmission via the wireless link by applying a gain to the audio signals according to close voice/not close voice judgment of the voice activity detector and the surrounding noise level estimated, wherein said gain is set to a first gain value (g.sub.1) during times when the presence of close voice is judged, wherein said gain is reduced from the first gain value by an attenuation value (a) to a second gain value (g.sub.2) when a change from close voice to not close voice is judged, and wherein the attenuation value is selected as a function of the surrounding noise level estimated in such a manner that the attenuation value increases as the surrounding noise level estimated increases.
2. The method of claim 1, wherein the audio signals received by the receiver unit (14, 114) are processed, by a gain control unit (62, 74, 120), by applying said gain to the received audio signals.
3. The method of claim 1, wherein the captured audio signals are processed, prior to transmission, by a gain control unit (20B) of the transmission unit (110), by applying said gain to the captured audio signals.
4. The method of claim 1, wherein said gain is increased from the second gain value (g.sub.2) by the attenuation value (a) to the first gain value (g.sub.1) when a change from no close voice to close voice is judged.
5. The method of claim 2, wherein the voice activity detector (24) generates a close voice judgement value, and wherein the noise level estimator (25) generates a surrounding noise level estimation value.
6. The method of claim 5, wherein the attenuation value (a) is determined in the transmission unit (10) from the close voice judgement value and the surrounding noise level estimation value, and wherein the attenuation value or a corresponding command for the gain control unit (62, 74, 120) is transmitted to the receiver unit (14, 114).
7. The method of claim 5, wherein the noise level estimator (25) is part of the transmission unit (10), and wherein audio signals captured by the microphone arrangement (17) are supplied as input to the noise level estimator.
8. The method of claim 7, wherein the close voice judgement value and the surrounding noise level estimation value are transmitted via the digital wireless link (12) to the receiver unit (14, 114) for selecting the attenuation value (a).
9. The method of claim 1, wherein the attenuation value (a) is set to a minimum attenuation value (a.sub.min) when the estimated surrounding noise level is at or below a first threshold value (l.sub.1).
10. The method of claim 9, wherein the first threshold value (l.sub.1) is not more than 66 dBA.
11. The method claim 9, wherein the attenuation value (a) is set to a maximum attenuation value (a.sub.max) when the estimated surrounding noise level is above or at a second threshold value (l.sub.2).
12. The method of claim 11, wherein the maximum attenuation value (a.sub.max) is at least 10 dB.
13. The method of claim 11, wherein the second threshold value (l.sub.2) is at least 70 dBA.
14. The method of claim 11, wherein the attenuation value (a) is selected to increase linearly with a first slope within a first range of the estimated surrounding noise level with increasing estimated surrounding noise level.
15. The method of claim 14, wherein the attenuation value (a) is set to a minimum attenuation value (a.sub.min) when the estimated surrounding noise level is at or below a first threshold value (l.sub.1), wherein the first range is limited by the first threshold value (l.sub.1) and the second threshold value (l.sub.2), respectively.
16. The method of claim 9, wherein the minimum attenuation value (a.sub.min) is not more than 6 dB.
17. The method of claim 1, wherein the first gain value (g.sub.1) is set as a function of the estimated surrounding noise level.
18. The method of claim 17, wherein the first gain value (g.sub.1) is selected to increase with increasing estimated surrounding noise level.
19. The method of claim 18, wherein the first gain value (g.sub.1) is selected to increase linearly with a second slope within a second range of the estimated surrounding noise level with increasing estimated surrounding noise level.
20. The method of claim 14, wherein the first gain value (g.sub.1) is selected to increase linearly with a second slope within a second range of the estimated surrounding noise level with increasing estimated surrounding noise level, wherein the first gain value (g.sub.1) has a lower constant value (g.sub.min) when the estimated surrounding noise level is below or at the lower limit of said second range and has an upper constant value (g.sub.max) when the estimated surrounding noise level is above or at the upper limit (l.sub.2) of said second range.
21. The method of claim 19, wherein the first slope is smaller than the second slope.
22. The method of claim 13, wherein the first gain value (g.sub.1) is selected to increase with increasing estimated surrounding noise level, and wherein the first range and the second range are identical.
23. The method of claim 16, wherein during times when no presence of close speech is judged the second gain value (g.sub.3) is adjusted, upon a change in the estimated surrounding noise level, to the new estimated surrounding noise level according to a function which varies less strongly with the estimated surrounding noise level than the function of the first gain value (g.sub.1) and the function of the attenuation value (a).
24. The method of claim 1, wherein the surrounding noise level estimation is performed only if it has been judged that there is no close voice captured by the microphone arrangement (17).
25. The method of claim 1, wherein the gain control unit (20B, 62, 74, 120) reduces the gain progressively from the first value (g.sub.1) to the second value (g2) during a given release time period when a change from close voice to no close voice is judged.
26. The method of claim 15, wherein the release time period is from 100 ms to 10 seconds.
27. The method of claim 1, wherein the gain control unit (20B, 62, 74, 120) keeps the gain at the first gain value (g.sub.1) for a given hold-on time period when a change from close voice to no close voice is judged, prior to progressively reducing the gain from the first gain value to the second gain value (g.sub.2) during a release time period.
28. The method of claim 17, wherein the hold-on time period is from 100 ms to 10 seconds.
29. The method of claim 1, wherein the gain control unit (20B, 62, 74) increases the gain within an attack time period from the second gain value (g.sub.2) to the first gain value (g.sub.1) when a change from no close voice to close voice is judged.
30. The method of claim 29, wherein the attack time period is from 0.5 ms to 10 ms.
31. The method of claim 1, wherein the microphone arrangement (17) comprises at least two spaced apart microphones (17A, 17B), wherein for judging whether a voice is present close to the microphone arrangement, the total energy contained in the voice spectrum of the audio signals captured at at least one of the microphones is estimated, and wherein the value of the direction of arrival of the captured audio signals is estimated by comparing the audio signals captured by at least two of the spaced apart microphones.
32. The method of claim 1, wherein the transceiver (28) is a digital transceiver and wherein the wireless link (12) is a digitally modulated link.
33. The method of claim 1, wherein said first gain value is applied in the receiver unit and said attenuation value is applied in the transmission unit.
34. A system for providing hearing assistance to a user, comprising: a receiver unit (14, 114): a transmission unit (10) comprising a microphone arrangement (17) for capturing audio signals, a voice activity detector (24) for analyzing the captured audio signals in order to judge whether a voice is present close to the microphone arrangement, an audio signal processing unit (20) for processing the captured audio signals, a transceiver (28) for transmitting the processed audio signals via a wireless link (12) to the receiver unit; the system comprising a noise level estimator (25) for analyzing captured audio signals in order to estimate a surrounding noise level, the system further comprising a gain control unit (62, 74, 120), and means (62, 74) for processing the received audio signals by setting, by said gain control unit, the gain applied to the audio signals according to the close voice judgment and the estimated surrounding noise level; and means (16, 64, 82, 120) to be worn at or in at least one of the user's ears for stimulating the user's hearing according to the audio signals processed by said means for processing the received audio signals; wherein said gain is set to a first gain value (g.sub.1) as a function of the surrounding noise level estimation during times when the presence of close voice is judged, wherein said gain is reduced from the first gain value by an attenuation value (a) to a second gain value (g.sub.2) when a change from close voice to no close voice is judged, and wherein the attenuation value is selected as a function of the estimated surrounding noise level in such a manner that the attenuation value increases with increasing estimated surrounding noise level.
35. The system of one of claim 34, wherein the stimulating means (82) is part of the receiver unit (14, 214) or is directly connected thereto.
36. The system of one of claim 34, wherein the receiver unit (14, 114, 214) is part of or connected to a hearing instrument (16, 64) comprising the stimulating means (182).
37. The system of claim 36, wherein the gain control unit (120) forms part of the hearing instrument (64).
38. The system of claim 36, wherein the gain control unit (62) forms part of the receiver unit (14).
39. The system of claim 34, wherein the noise level estimator (25) is part of the transmission unit (10, 110).
40. A system for providing hearing assistance to a user, comprising: a receiver unit (214); a transmission unit (110) comprising a microphone arrangement (17) for capturing audio signals, a voice activity detector (24) for analyzing the captured audio signals in order to judge whether a voice is present close to the microphone arrangement, an audio signal processing unit (20) for processing the captured audio signals, the audio signal processing unit including a gain control unit (20B) a transceiver (28) for transmitting the processed audio signals via a wireless link (12) to the receiver unit; the system comprising a noise level estimator (25) for analyzing captured audio signals in order to estimate a surrounding noise level, means (16, 64, 82) to be worn at or in at least one of the user's ears for stimulating the user's hearing according to the audio signals received by said receiver unit; wherein said gain control unit is adapted set the gain applied to the captured audio signal to a first gain value (g.sub.1) as a function of the surrounding noise level estimation during times when the presence of close voice is judged, wherein said gain is reduced from the first gain value by an attenuation value (a) to a second gain value (g.sub.2) when a change from close voice to no close voice is judged, and wherein the attenuation value is selected as a function of the estimated surrounding noise level in such a manner that the attenuation value increases with increasing estimated surrounding noise level.
41. The system of one of claim 40, wherein the stimulating means (82) is part of the receiver unit (14, 214) or is directly connected thereto.
42. The system of one of claim 40, wherein the receiver unit (14, 114, 214) is part of or connected to a hearing instrument (16, 64) comprising the stimulating means (182).
43. The system of claim 42, wherein the gain control unit (120) forms part of the hearing instrument (64).
44. The system of claim 42, wherein the gain control unit (62) forms part of the receiver unit (14).
45. The system of claim 40, wherein the noise level estimator (25) is part of the transmission unit (10, 110).
Description
BRIEF DESCRIPTION OF THE DRAWINGS
(1)
(2)
(3)
(4)
(5)
(6)
(7)
(8)
(9)
(10)
DETAILED DESCRIPTION OF THE INVENTION
(11)
(12)
(13) According to one embodiment, the transmission unit 10 may be adapted to be worn by the respective speaker 11 below the speaker's neck, for example with a transmitter using a lapel microphone or a shirt collar microphone.
(14) In
(15) An example of a transmission unit 10 is shown in
(16) The transmission units 10 further includes a voice activity detector (VAD) 24 and a surrounding noise level (SNL) estimator 25. The audio signal processing unit 20, the VAD 24 and the SNL estimator 25 may be implemented by a digital signal processor (DSP) indicated at 22.
(17) In addition, the transmission units 10 also may comprise a microcontroller 26 acting on the DSP 22 and the transmitter 28. The microcontroller 26 may be omitted in case that the DSP 22 is able to take over the function of the microcontroller 26.
(18) The microphone arrangement 17 comprises at least two spaced-apart microphones 17A, 17B, the audio signals of which may be used in the audio signal processing unit 20 for acoustic beamforming in order to provide the microphone arrangement 17 with a directional characteristic.
(19) The VAD 24 uses the audio signals from the microphone arrangement 17 as an input in order to determine the times when the person 11 using the respective transmission unit 10 is speaking.
(20) The VAD 24 may provide a corresponding control output signal to the microcontroller 26 in order to have, for example, the transmitter 28 sleep during times when no voice is detected and to wake up the transmitter 28 during times when voice activity is detected. In addition, a control command corresponding to the output signal of the VAD 24 may be generated and transmitted via the wireless link 12 in order to mute the receiver units 14 or saving power when the user 11 of the transmission unit 10 does not speak. To this end, a unit 32 is provided which serves to generate a digital signal comprising the audio signals from the processing unit 20 and the control data generated by the VAD 24, which digital signal is supplied to the transmitter 28.
(21) The VAD 24 comprises a voice energy estimator unit which uses the microphone signals (or a processed version of the microphone signals) in order to compute the total energy contained in the voice spectrum with a fast attack time in the range of a few milliseconds, preferably not more than 10 milliseconds. By using such short attack time it is ensured that the system is able to react very fast when the speaker 12 begins to speak.
(22) The VAD 24 also comprises a direction of arrival (DOA) estimator which is provided for estimating, by comparing the audio signals captured by the microphone 17A and the audio signals captured by the microphone 17B, the DOA value of the captured audio signals. The DOA value indicates the Direction of Arrival estimated with the phase differences in the audio band of the incoming signal captured by the microphones 17A, 17B.
(23) The VAD 24 decides, depending on the signals provided by the voice energy estimator and the DOA estimator, whether close voice, i.e. the speaker's voice, is present at the microphone arrangement 17 or not. Such type of VAD is described in more detail in WO 2009/138365 A1 and corresponding U.S. Pat. No. 8,345,900.
(24) The SNL estimator 25 serves to estimate the ambient noise level and generates a corresponding output signal which may be supplied to the unit 32 for being transmitted via the wireless link 12.
(25) More in detail, the SNL estimator 25 uses the audio signal produced by the omnidirectional rear microphone 17B in order to estimate the surrounding noise level present at the microphone arrangement 17. However, it can be assumed that the surrounding noise level estimated at the microphone arrangement 17 is a good indication also for the surrounding noise level present at the ears of the user 13, like in classrooms for example. The SNL estimator 25 may be active only if no close voice is presently detected by the VAD 24 (in case that close voice is detected by the VAD 24, the SNL estimator 25 is disabled by a corresponding signal from the VAD 24). A very long time constant in the range of 10 seconds may be applied by the SNL estimator 25. The SNL estimator 25 measures and analyzes the total energy contained in the whole spectrum of the audio signal of the microphone 17B (usually the surrounding noise in a classroom is caused by the voices of other pupils in the classroom). The long time constant ensures that only the time-averaged surrounding noise is measured and analyzed, but not specific short noise events.
(26) The surrounding noise level values may be updated regularly during speech pauses, e.g. with a rate in the range of 20 ms to 5 s.
(27) The A-weighted output of the SNL estimator 25 may be also supplied to the VAD in order to be used to adapt accordingly to it the threshold level for the close voice/no close voice decision made by the VAD 24 in order to maintain a good SNR for the voice detection.
(28) An example of a digital receiver unit 14 is shown in
(29) The amplified audio signals may be supplied to the audio input of a hearing aid 64.
(30) Rather than supplying the audio signals amplified by the variable gain amplifier 62 to the audio input of a hearing aid 64, the receiver unit 14 may include a power amplifier 78 which may be controlled by a manual volume control 80 and which supplies power amplified audio signals to a loudspeaker 82 which may be an ear-worn element integrated within or connected to the receiver unit 14. Volume control also could be done remotely from the transmission unit 10 by transmitting corresponding control commands to the receiver unit 14.
(31) Another alternative implementation of the receiver maybe a neck-worn device having a transmitter 84 for transmitting the received signals via with an magnetic induction link 86 (analog or digital) to the hearing aid 64 (as indicated by dotted lines in
(32) As already explained above, the VAD 24 provides at its output for a parameter signal which may have two different values:
(33) (a) “Voice ON”: This value is provided at the output if the VAD 24 has decided that close voice is present at the microphone arrangement 17. In this case, a control command is issued and is transmitted to the receiver unit 14, according to which the gain is set to a given value for the amplifier 62 and/or the DSP 74.
(b) “Voice OFF”: If the VAD 24 decides that no close voice is present at the microphone arrangement 17, a “voice OFF” command is issued and is transmitted to the receiver unit 14. In this case, the DSP 74 applies a “hold on time” constant and then a “release time” constant to the amplifier 62. During the “hold on time” the gain set by the amplifier 62 remains at the value applied during “voice ON”. During the “release time” the gain set by the amplifier 62 is progressively reduced from the value applied during “voice ON” to a lower value corresponding to a “pause attenuation” value. Hence, in case of “voice OFF” the gain of the microphone arrangement 17 is reduced relative to the gain of the microphone arrangement 17 during “voice ON”. This ensures an optimum SNR of the sound signals present at the user's ear, since at that time no useful audio signal is present at the microphone arrangement 17 of the transmission unit 10, so that user 13 may perceive ambient sound signals (for example voice from his neighbor in the classroom) without disturbance by noise of the microphone arrangement 17.
(34) In general, the gain is set to a first gain value g.sub.1 during times when the presence of close voice is judged by the VAD 24, and the gain is reduced from this first gain value by an attenuation value a to a second gain value g.sub.2 when a change from close voice (voice on) to no close voice (voice off) is judged by the VAD 24. Unlike in the prior art approaches, the attenuation value a is not constant but is selected as a function of the estimated surrounding noise level (i.e. the output signal of the SNL estimator 25) in such a manner that the attenuation value increases with increasing estimated SNL.
(35) On the other hand, the gain is increased from the second gain value g.sub.2 by the attenuation value a to the first gain value g.sub.1 when a change from no close voice (voice off) to close voice (voice on) is judged by the VAD 24.
(36) Typically, the first gain value g.sub.1 is set as a function of the estimated SNL.
(37) An example of the dependence of the first gain value g.sub.1, the second gain value g.sub.2 and the attenuation value a on the SNL is shown in
(38) The attenuation value a may be set to a minimum value a.sub.min when the estimated SNL is at or below a first threshold value (which typically corresponds to the lower limit l.sub.1 of the linear range of the first gain value g.sub.1), and it may be set to a maximum attenuation value a.sub.min when the estimated SNL is at or above a second threshold value (which typically corresponds to the upper limit l.sub.2 of the linear range of the second gain value g.sub.2). The minimum attenuation value a.sub.min may be, for example, 6 dB, and the maximum attenuation value a.sub.max may be, for example, 21 dB.
(39) Typically, the attenuation value a is selected to increase linearly within the range between the minimum value a.sub.min and the maximum value a.sub.max, with that range of the SNL being the same as that for the linear increase of the first gain value g.sub.1. Typically, the slope of the linear increase of the first gain value g.sub.1 is steeper than the slope of the linear increase of the attenuation value a. In
(40) While in the embodiment shown in
(41) A modification of the example of the receiver unit of
(42) A modification of the example of the transmission unit of
(43) In the example of
(44) In the example of
(45) In