Ambient noise aware dynamic range control and variable latency for hearing personalization
11393486 · 2022-07-19
Assignee
Inventors
Cpc classification
H04R2499/11
ELECTRICITY
H04R1/1041
ELECTRICITY
H04R2205/041
ELECTRICITY
G10L21/02
PHYSICS
International classification
Abstract
Signal to noise ratio, SNR, is determined in an acoustic ambient environment of an against-the-ear audio device worn by a user, wherein the acoustic ambient environment contains speech by a talker. When the SNR is above a threshold, dynamic range control is applied, as positive gain versus input level, to an audio signal from one or more microphones of the audio device. When the SNR is below the threshold, the dynamic range control applies as zero gain or negative gain to the audio signal. Other aspects are also described and claimed.
Claims
1. A method for sound enhancement in an against the ear audio device, the method comprising: filtering an audio signal in time domain using a minimum phase filter, while the audio signal is converted into sound by an against the ear audio device; determining a hearing loss level associated with the audio device or a user of the audio device; and delaying the audio signal, ahead or after the minimum phase filter while filtering the audio signal using the minimum phase filter when determining that the hearing loss level is higher than a threshold, and not delaying the audio signal while filtering the audio signal using the minimum phase filter when determining the hearing loss level is lower than the threshold.
2. The method of claim 1 wherein the audio signal is an audio program content signal or an audio downlink communications signal.
3. The method of claim 1 wherein the audio signal is from one or more microphones of the audio device that are picking up ambient sound, and not an audio program content signal or an audio downlink communications signal.
4. The method of claim 1 further comprising filtering the audio signal by performing feedback cancellation.
5. The method of claim 4 further comprising performing dynamic range control upon the audio signal by applying positive gain to the audio signal versus input level of the audio signal, in accordance with the hearing loss level.
6. The method of claim 1 wherein the against the ear audio device is a headphone, and the filtering and delaying are performed by a processor in the headphone.
7. An apparatus comprising: a digital processor configured to filter an audio signal in time domain using a minimum phase filter, determine a hearing loss level, and delay the audio signal, ahead or after the minimum phase filter while filtering the audio signal using the minimum phase filter when determining that the hearing loss level is higher than a threshold, and not delaying the audio signal while filtering the audio signal using the minimum phase filter when determining the hearing loss level is lower than the threshold.
8. The apparatus of claim 7 wherein the audio signal is an audio program content signal or an audio downlink communications signal.
9. The apparatus of claim 7 wherein the audio signal is from one or more microphones that are picking up ambient sound, and not an audio program content signal or an audio downlink communications signal.
10. The apparatus of claim 7 wherein the processor is further configured to filter the audio signal by performing feedback cancellation.
11. The apparatus of claim 10 wherein the processor is further configured to perform dynamic range control upon the audio signal by applying positive gain to the audio signal versus input level of the audio signal, in accordance with the hearing loss level.
12. The apparatus of claim 10 wherein the processor is for use in a headphone.
13. The apparatus of claim 7 wherein the processor is further configured to perform dynamic range control upon the audio signal by applying positive gain to the audio signal versus input level of the audio signal, in accordance with the hearing loss level.
14. The apparatus of claim 7 wherein the processor is further configured to perform a beamforming process upon a plurality of microphone signals, to produce the audio signal.
15. An apparatus comprising: a headphone housing having integrated therein a first microphone, and a digital processor configured to filter an audio signal in time domain using a minimum phase filter, determine a hearing loss level, and delay the audio signal, ahead or after the minimum phase filter while filtering the audio signal using the minimum phase filter when determining that the hearing loss level is higher than threshold, and not delaying the audio signal while filtering the audio signal using the minimum phase filter when determining the hearing loss level is lower than the threshold.
16. The apparatus of claim 15 wherein the audio signal is an audio program content signal or an audio downlink communications signal.
17. The apparatus of claim 16 further comprising a second microphone, and the processor is further configured to perform a beamforming process upon signals from the first and second microphones to produce the audio signal.
18. The apparatus of claim 15 wherein the audio signal is from the first microphone.
19. The apparatus of claim 15 wherein the processor is further configured to filter the audio signal by performing feedback cancellation.
20. The apparatus of claim 19 wherein the processor is further configured to perform dynamic range control upon the audio signal by applying positive gain to the audio signal versus input level of the audio signal, in accordance with the hearing loss level.
21. The apparatus of claim 19 wherein the processor is for use in a headphone.
22. The apparatus of claim 15 wherein the processor is further configured to perform dynamic range control upon the audio signal by applying positive gain to the audio signal versus input level of the audio signal, in accordance with the hearing loss level.
Description
BRIEF DESCRIPTION OF THE DRAWINGS
(1) Several aspects of the disclosure here are illustrated by way of example and not by way of limitation in the figures of the accompanying drawings in which like references indicate similar elements. It should be noted that references to “an” or “one” aspect in this disclosure are not necessarily to the same aspect, and they mean at least one. Also, in the interest of conciseness and reducing the total number of figures, a given figure may be used to illustrate the features of more than one aspect of the disclosure, and not all elements in the figure may be required for a given aspect.
(2)
(3)
(4)
(5)
(6)
DETAILED DESCRIPTION
(7) Several aspects of the disclosure with reference to the appended drawings are now explained. Whenever the shapes, relative positions and other aspects of the parts described are not explicitly defined, the scope of the invention is not limited only to the parts shown, which are meant merely for the purpose of illustration. Also, while numerous details are set forth, it is understood that some aspects of the disclosure may be practiced without these details. In other instances, well-known circuits, structures, and techniques have not been shown in detail so as not to obscure the understanding of this description.
(8) Consider as an example a user who is waiting for a train to arrive at a train station, and is wearing a headset. The user could be talking to a friend standing next to them. The headset occludes the user's ear and therefore passively attenuates the voice of the friend. If the headset has an ambient sound enhancement function (ASE) that picks up the ambient sound before amplifying it and reproducing it at the user's ear, then it allows the friend's speech to be heard more easily. The arrival of the train however will result in the train sound also being picked up, amplified and reproduced, thereby making it difficult to discern the friend's speech. In another example, the user (while wearing the headset or holding the mobile phone handset again their ear) is walking with their friend to a local social club or restaurant, and upon entering will hear an increase in babble noise (being reproduced by the ASE.)
(9) It is also likely that the same ambient sound environment is perceived (heard) differently by different users of the ASE, as some users have lower dynamic range in their hearing than others such that soft or quiet sounds are barely heard by those particular users. Several digital audio signal processing techniques referred to as personalized ambient sound enhancement (ASE) are described that can improve the experience of listening to the ambient environment for such individuals, particularly in changing ambient sound environments including but not limited to those identified above.
(10)
(11) In one aspect, some of the electronics reside in another device, separate from the against-the-ear device 1. For instance, the against-ear-device 1 may be a headphone that is connected to an audio source device 5, depicted in the example of
(12) There are many instances where a user, while wearing the against-the-ear device 1, may have a preference or need for hearing at a higher sound pressure level, SPL, than would the average person. To meet the preference or need of such a user, the ambient sound is amplified by the audio system in accordance with a hearing profile of the user, and reproduced through the speaker 2. This is also referred to here as a personalized ASE function. If the user, while wearing the headset or holding the smartphone against their ear, then enters a social club that has a much louder ambient sound level, the amplified sound may appear (be heard as) distorted or uncomfortably loud. The audio system should automatically reduce the reproduced ambient sound level in such a condition, based on the wearer's hearing profile and based on the ambient sound level. The audio system may do so in accordance with several aspects of the disclosure here.
(13)
(14) The transfer function of the ASE filter 6 is variable, e.g., on a frame by frame basis where each frame may include for example 1-10 milliseconds of the microphone signal, and may be set by an ambient sound environment analyzer 8. The input audio signal is filtered by an ASE filter 6 in the sense of a level-dependent and frequency-dependent gain that varies over time (the filtering here is thus nonlinear and time varying.) The analyzer 8 configures the ASE filter 6 based on combining i) information it has derived from the ambient sound pickup channel (e.g., the audio signal from the external microphone 3) and ii) information relating to a hearing profile of the user provided by a hearing loss compensation block (HLC 7.)
(15) As used herein, the “hearing profile” refers to a set of data that defines the hearing needs and preferences of the user including hearing level or hearing loss, as dB HL, across various frequencies of interest within the range of normal human hearing (also referred to here as auditory sub-bands.) The hearing profile may additionally specify quiet, comfortable and loud listening levels, frequency-dependent amplification preferences across different types of audio content (e.g., voice phone call, podcast, music, movies) or the user's sensitivity to noise or sound processing artifacts. The hearing profile may be derived from for example a stored audiogram of the user and may include outcomes of other standard hearing evaluation procedures such as Speech-in-Noise testing or measurement of otoacoustic emissions. In addition, or as an alternative, to objective hearing evaluations such as the audiogram, the hearing profile may be the result of a process that generates acoustic stimuli using the speakers in the against-the-ear audio device and monitors or evaluates the user's responses to those acoustic stimuli (e.g., as verbal responses that have been picked up by a microphone of the audio device, or as manual responses entered by the user through a graphical user interface of the audio system.) The hearing profile may thus define the hearing preference or hearing sensitivity of the user, for example in terms of hearing level in dB (dB HL.)
(16) It should be noted that while the figures here show a single microphone symbol in each instance (external microphone 3 and internal microphone 2), this is being used to generically refer to a sound pickup channel which is not limited to being produced by a single microphone. In many instances, the sound pickup channel may be the result of combining multiple microphone signals, e.g., by a beamforming process performed on a multi-channel output from a microphone array.
(17) The ambient sound as picked up by the external microphone 3 is amplified by the ASE filter 6, by being upward compressed (in the sense of dynamic range control), in accordance with a gain parameter which is set by the ambient sound environment analyzer 8. This compressed audio signal then drives the speaker 2 resulting in the amplified ambient sound content being reproduced at the user's ear.
(18) The compression (dynamic range control) performed by the ASE filter 6 is customized as follows. The analyzer 8 determines signal to noise ratio, SNR, in the input audio signal (from the external microphone 3.) Here, the acoustic ambient environment (and hence the input audio signal) contains speech by a talker (who is not the user.) When determining that the SNR is above a threshold, the ASE filter 6 becomes configured to apply upward compression to the input audio signal. This is also referred to here as reducing dynamic range by applying positive gain, in terms of dB, versus input level (the level of the input audio signal from the external microphone 3.) But when determining that the SNR is below the threshold, the ASE filter becomes configured to apply zero gain or negative gain, in terms of dB, to the input audio signal. The negative gain as a function of low SNR is depicted in the graph of
(19) In one aspect, determining the SNR comprises processing the input audio signal to produce a noise estimate and a main signal estimate (and computing a ratio of those two estimates.) The noise and main signal estimates may be computed on a per frequency bin basis, and the resulting SNR may be on a per frequency bin basis, and which may be updated in each audio frame. The updated SNR may then be translated into the gain parameter of the ASE filter based on knowledge of the hearing profile of the user. The updated gain parameter, on a per frequency bin basis, may then be applied to the input audio signal (from the external microphone 3) in frequency domain, by the ASE filter 6.
(20) Turning now to
(21) Referring to
(22) The ASE filter 6 is adjusted to perform with low latency, when determining the hearing loss level is below a threshold (the user has a mild hearing loss, for example as given above.) But if the HLC 7 determines the hearing loss level is above the threshold, then the ASE filter 6 is configured to perform with high latency. In terms of group delay of a digital filter, the latency of the ASE filter 6 will thus exhibit the relationship or curve shown in
(23) The above-described control of hearing loss-dependent latency in the ASE path may be applied in conjunction with a feedback cancellation, FBC, filter 10 that is filtering the output of the ASE filter 6. This may be during an ambient sound enhancement mode of operation in which only the ambient sound pick up channel is being amplified and reproduced—there is no playback signal (no user audio content such as music or a phone call.) The FBC filter 10 can also be used when there is playback, such as music or a phone call. The FBC filter 10 attempts to remove the acoustic coupling of the speaker 2 into the external microphone 3, particularly when the feedforward gain being applied to the input audio signal is high. The output of the FBC filter 10 may be added to input audio signal coming from the external microphone 3 to result in a combined signal at the input of the ASE filter 6.
(24) Alternatively, still referring to
(25) The feedback cancellation tends to perform better when latency in the ASE path is increased. This means that greater latency may be desirable when more positive gain is being applied (due to greater hearing loss.)
(26) In one aspect, adjusting the ASE filter 6 for high latency comprises re-configuring the ASE filter 6 from a minimum phase filter into a linear phase filter or a maximum phase filter. In other words, the ASE filter 6 is configured as a minimum phase filter in a base or default configuration, exhibiting low latency, unless the HLC 7 (and perhaps in conjunction with other decision makers such the ambient sound environment analyzer 8 of
(27) In another aspect, the hearing loss dependent latency control method may proceed as follows. An input audio signal is being filtered in time domain, for purposes of noise suppression, using a minimum phase filter (while downstream the audio signal is converted into sound by the speaker 2 of the against the ear audio device.) The audio signal may be from one or more microphones of the audio device that are picking up ambient sound, and not an audio program content signal or an audio downlink communications signal. The noise suppression time domain filtering may be in addition to feedforward gain that is applied as a function of a hearing loss level of a user of the audio device. The hearing loss level associated with the audio device or a user of the audio device is determined. When the determined hearing loss level is high, the audio signal, either upstream or downstream of the minimum phase filter, is delayed but not when the hearing loss level is low. Such a delay may occur by, for example, adding a delay in series with the ASE filter 6. The method may further comprise entering an ambient sound enhancement mode of operation in the audio device in which the feedback cancellation is disabled, in response to the hearing loss level being determined as below a threshold (or the feedforward gain is determined to be below a threshold.) Disabling the feedback cancellation may also be beneficial in that it saves computing resources or reduces power consumption in the audio device 1.
(28) Turning now to
(29) When HLC 7 determines that the output of the FBC filter 10 should be disconnected from the signal chain that originates from the external microphone 3, the feedback canceller 11 should also stop computing updates to the FBC filter 10. However, there could be another mode of operation as shown in
(30) To aid the Patent Office and any readers of any patent issued on this application in interpreting the claims appended hereto, applicant wishes to note that it does not intend any of the claims or claim elements below to invoke 35 U.S.C. 112(f) unless the words “means for” or “step for” are explicitly used in the particular claim.
(31) As would be readily understood, the use of personally identifiable information should follow privacy policies and practices that are generally recognized as meeting or exceeding industry or governmental requirements for maintaining the privacy of users. In particular, personally identifiable information should be managed and handled so as to minimize risks of unintentional or unauthorized access or use, and the nature of authorized use should be clearly indicated to users.
(32) While certain aspects have been described and shown in the accompanying drawings, it is to be understood that such are merely illustrative of and not restrictive on the broad invention, and that the invention is not limited to the specific constructions and arrangements shown and described, since other modifications may occur to those of ordinary skill in the art. The description is thus to be regarded as illustrative instead of limiting.