Ambient noise aware dynamic range control and variable latency for hearing personalization

Abstract

Signal to noise ratio, SNR, is determined in an acoustic ambient environment of an against-the-ear audio device worn by a user, wherein the acoustic ambient environment contains speech by a talker. When the SNR is above a threshold, dynamic range control is applied, as positive gain versus input level, to an audio signal from one or more microphones of the audio device. When the SNR is below the threshold, the dynamic range control applies as zero gain or negative gain to the audio signal. Other aspects are also described and claimed.

Claims

1. A method for sound enhancement in an against the ear audio device, the method comprising: filtering an audio signal in time domain using a minimum phase filter, while the audio signal is converted into sound by an against the ear audio device; determining a hearing loss level associated with the audio device or a user of the audio device; and delaying the audio signal, ahead or after the minimum phase filter while filtering the audio signal using the minimum phase filter when determining that the hearing loss level is higher than a threshold, and not delaying the audio signal while filtering the audio signal using the minimum phase filter when determining the hearing loss level is lower than the threshold.

2. The method of claim 1 wherein the audio signal is an audio program content signal or an audio downlink communications signal.

3. The method of claim 1 wherein the audio signal is from one or more microphones of the audio device that are picking up ambient sound, and not an audio program content signal or an audio downlink communications signal.

4. The method of claim 1 further comprising filtering the audio signal by performing feedback cancellation.

5. The method of claim 4 further comprising performing dynamic range control upon the audio signal by applying positive gain to the audio signal versus input level of the audio signal, in accordance with the hearing loss level.

6. The method of claim 1 wherein the against the ear audio device is a headphone, and the filtering and delaying are performed by a processor in the headphone.

7. An apparatus comprising: a digital processor configured to filter an audio signal in time domain using a minimum phase filter, determine a hearing loss level, and delay the audio signal, ahead or after the minimum phase filter while filtering the audio signal using the minimum phase filter when determining that the hearing loss level is higher than a threshold, and not delaying the audio signal while filtering the audio signal using the minimum phase filter when determining the hearing loss level is lower than the threshold.

8. The apparatus of claim 7 wherein the audio signal is an audio program content signal or an audio downlink communications signal.

9. The apparatus of claim 7 wherein the audio signal is from one or more microphones that are picking up ambient sound, and not an audio program content signal or an audio downlink communications signal.

10. The apparatus of claim 7 wherein the processor is further configured to filter the audio signal by performing feedback cancellation.

11. The apparatus of claim 10 wherein the processor is further configured to perform dynamic range control upon the audio signal by applying positive gain to the audio signal versus input level of the audio signal, in accordance with the hearing loss level.

12. The apparatus of claim 10 wherein the processor is for use in a headphone.

13. The apparatus of claim 7 wherein the processor is further configured to perform dynamic range control upon the audio signal by applying positive gain to the audio signal versus input level of the audio signal, in accordance with the hearing loss level.

14. The apparatus of claim 7 wherein the processor is further configured to perform a beamforming process upon a plurality of microphone signals, to produce the audio signal.

15. An apparatus comprising: a headphone housing having integrated therein a first microphone, and a digital processor configured to filter an audio signal in time domain using a minimum phase filter, determine a hearing loss level, and delay the audio signal, ahead or after the minimum phase filter while filtering the audio signal using the minimum phase filter when determining that the hearing loss level is higher than threshold, and not delaying the audio signal while filtering the audio signal using the minimum phase filter when determining the hearing loss level is lower than the threshold.

16. The apparatus of claim 15 wherein the audio signal is an audio program content signal or an audio downlink communications signal.

17. The apparatus of claim 16 further comprising a second microphone, and the processor is further configured to perform a beamforming process upon signals from the first and second microphones to produce the audio signal.

18. The apparatus of claim 15 wherein the audio signal is from the first microphone.

19. The apparatus of claim 15 wherein the processor is further configured to filter the audio signal by performing feedback cancellation.

20. The apparatus of claim 19 wherein the processor is further configured to perform dynamic range control upon the audio signal by applying positive gain to the audio signal versus input level of the audio signal, in accordance with the hearing loss level.

21. The apparatus of claim 19 wherein the processor is for use in a headphone.

22. The apparatus of claim 15 wherein the processor is further configured to perform dynamic range control upon the audio signal by applying positive gain to the audio signal versus input level of the audio signal, in accordance with the hearing loss level.

Description

BRIEF DESCRIPTION OF THE DRAWINGS

(1) Several aspects of the disclosure here are illustrated by way of example and not by way of limitation in the figures of the accompanying drawings in which like references indicate similar elements. It should be noted that references to “an” or “one” aspect in this disclosure are not necessarily to the same aspect, and they mean at least one. Also, in the interest of conciseness and reducing the total number of figures, a given figure may be used to illustrate the features of more than one aspect of the disclosure, and not all elements in the figure may be required for a given aspect.

(2) FIG. 1 shows an example against-the-ear device.

(3) FIG. 2 is a block diagram of an audio system and a related method for personalized ambient sound enhancement, ASE, using a customized compressor for dynamic range control.

(4) FIG. 3 is a block diagram of an audio system and related method for personalized ASE in which the group delay in the forward path of the ASE is varied as a function of the level of hearing loss.

(5) FIG. 4 is a block diagram an audio system and related method for personalized ASE in which the contribution to the forward path of the ASE by a feedback cancellation subsystem is omitted when the level of hearing loss is below a threshold.

(6) FIG. 5 is a block diagram of an audio system and related method for personalized ASE in which a hearing loss compensation block has disconnected a feedback cancellation filter from the forward path of the ASE and modifies a gain block A in response to the output of the filter.

DETAILED DESCRIPTION

(7) Several aspects of the disclosure with reference to the appended drawings are now explained. Whenever the shapes, relative positions and other aspects of the parts described are not explicitly defined, the scope of the invention is not limited only to the parts shown, which are meant merely for the purpose of illustration. Also, while numerous details are set forth, it is understood that some aspects of the disclosure may be practiced without these details. In other instances, well-known circuits, structures, and techniques have not been shown in detail so as not to obscure the understanding of this description.

(8) Consider as an example a user who is waiting for a train to arrive at a train station, and is wearing a headset. The user could be talking to a friend standing next to them. The headset occludes the user's ear and therefore passively attenuates the voice of the friend. If the headset has an ambient sound enhancement function (ASE) that picks up the ambient sound before amplifying it and reproducing it at the user's ear, then it allows the friend's speech to be heard more easily. The arrival of the train however will result in the train sound also being picked up, amplified and reproduced, thereby making it difficult to discern the friend's speech. In another example, the user (while wearing the headset or holding the mobile phone handset again their ear) is walking with their friend to a local social club or restaurant, and upon entering will hear an increase in babble noise (being reproduced by the ASE.)

(9) It is also likely that the same ambient sound environment is perceived (heard) differently by different users of the ASE, as some users have lower dynamic range in their hearing than others such that soft or quiet sounds are barely heard by those particular users. Several digital audio signal processing techniques referred to as personalized ambient sound enhancement (ASE) are described that can improve the experience of listening to the ambient environment for such individuals, particularly in changing ambient sound environments including but not limited to those identified above.

(10) FIG. 1 shows an example of an against-the-ear device 1 that is part of an audio system in which a method for personalized ASE can be implemented. The against-the-ear device 1 shown is an in-ear earbud (in-ear headphone which may be a sealing-type having for example a flexible ear tip, or a non-sealing or loose fitting type), which may be one of two headphones (left and right) that make up a headset. The methods described below for personalized ASE can be implemented for one or both of the headphones that make up a headset. Alternatives (not shown) to the in-ear earbud include an on-the-ear headphone, an over-the-ear headphone, and a mobile phone handset. The against-the-ear device 1 is shown in use, by its user (who may also be referred to as a listener or a wearer.) The against-the-ear device 1 has an against-the-ear acoustic transducer or speaker 2 (arranged and configured to reproduce sound directly into an ear of a user), an external microphone 3 (arranged and configured to receive ambient sound directly), and an internal microphone 4 (arranged and configured to directly receive the sound reproduced by the speaker 2.) These may all be integrated in a housing of the against-the-ear device 1. In some instances, the transducers and the electronics that process and produce the transducer drive signals (output microphone signals and an input audio signal to drive the speaker 2) can be placed in the same housing. The electronics may include an audio amplifier to drive the speaker 2 with the input audio signal, a microphone sensing circuit or amplifier that receives the microphone signals converts them into a desired format for digital signal processing, and a digital processor and associated memory. The memory stores instructions for configuring the processor (e.g., instructions to be executed by the processor) to perform digital signal processing tasks discussed below in more detail.

(11) In one aspect, some of the electronics reside in another device, separate from the against-the-ear device 1. For instance, the against-ear-device 1 may be a headphone that is connected to an audio source device 5, depicted in the example of FIG. 1 as a smartphone, via a wired connection (in which case there may be no need for a power source in the headphone housing) or via a wireless connection (e.g., a BLUETOOTH link.) In both cases, the connection to the audio source device 5 may be used to deliver a processed version of the input audio signal to the headphone (where it drives the speaker 2), from the external microphone 3 being for example in a housing of the audio source device 5.

(12) There are many instances where a user, while wearing the against-the-ear device 1, may have a preference or need for hearing at a higher sound pressure level, SPL, than would the average person. To meet the preference or need of such a user, the ambient sound is amplified by the audio system in accordance with a hearing profile of the user, and reproduced through the speaker 2. This is also referred to here as a personalized ASE function. If the user, while wearing the headset or holding the smartphone against their ear, then enters a social club that has a much louder ambient sound level, the amplified sound may appear (be heard as) distorted or uncomfortably loud. The audio system should automatically reduce the reproduced ambient sound level in such a condition, based on the wearer's hearing profile and based on the ambient sound level. The audio system may do so in accordance with several aspects of the disclosure here.

(13) FIG. 2 is a block diagram of the audio system and a related method for personalized ASE in which a compressor that performs dynamic range control of the amplified ambient sound is customized not only to the hearing profile of the user but also to the particular acoustic environment of the user. Ambient sound in the acoustic environment of the user (who is wearing or using the against the ear audio device 1) is picked up by the external microphone 3. The output, digitized microphone signal (also referred to as input audio signal) is then filtered by an ASE filter 6. The ASE filter 6 may encompass several digital signal processing operations that are performed upon its input audio signal to help compensate for an individual's reduced hearing sensitivity and otherwise make the reproduced audio signal sound more pleasing to the user (e.g., a talker's speech is more intelligible.) These may include dynamic range control (described further below as upward dynamic range compression), noise suppression (noise reduction), and perhaps other operations. In one aspect, the noise suppressor is active while ASE is active (reproducing ambient sound), and attempts to reduce the undesired parts of the ambient sound. The undesired parts of the ambient sound are those sounds that interfere with for example desired speech by a talker (where both the undesired parts and the desired talker's speech are embedded in the input audio signal.) Such a noise suppressor may use techniques such as fixed spectral shaping, adaptive filtering, and multiple sound pick up channel (multi-channel) statistical filtering.

(14) The transfer function of the ASE filter 6 is variable, e.g., on a frame by frame basis where each frame may include for example 1-10 milliseconds of the microphone signal, and may be set by an ambient sound environment analyzer 8. The input audio signal is filtered by an ASE filter 6 in the sense of a level-dependent and frequency-dependent gain that varies over time (the filtering here is thus nonlinear and time varying.) The analyzer 8 configures the ASE filter 6 based on combining i) information it has derived from the ambient sound pickup channel (e.g., the audio signal from the external microphone 3) and ii) information relating to a hearing profile of the user provided by a hearing loss compensation block (HLC 7.)

(15) As used herein, the “hearing profile” refers to a set of data that defines the hearing needs and preferences of the user including hearing level or hearing loss, as dB HL, across various frequencies of interest within the range of normal human hearing (also referred to here as auditory sub-bands.) The hearing profile may additionally specify quiet, comfortable and loud listening levels, frequency-dependent amplification preferences across different types of audio content (e.g., voice phone call, podcast, music, movies) or the user's sensitivity to noise or sound processing artifacts. The hearing profile may be derived from for example a stored audiogram of the user and may include outcomes of other standard hearing evaluation procedures such as Speech-in-Noise testing or measurement of otoacoustic emissions. In addition, or as an alternative, to objective hearing evaluations such as the audiogram, the hearing profile may be the result of a process that generates acoustic stimuli using the speakers in the against-the-ear audio device and monitors or evaluates the user's responses to those acoustic stimuli (e.g., as verbal responses that have been picked up by a microphone of the audio device, or as manual responses entered by the user through a graphical user interface of the audio system.) The hearing profile may thus define the hearing preference or hearing sensitivity of the user, for example in terms of hearing level in dB (dB HL.)

(16) It should be noted that while the figures here show a single microphone symbol in each instance (external microphone 3 and internal microphone 2), this is being used to generically refer to a sound pickup channel which is not limited to being produced by a single microphone. In many instances, the sound pickup channel may be the result of combining multiple microphone signals, e.g., by a beamforming process performed on a multi-channel output from a microphone array.

(17) The ambient sound as picked up by the external microphone 3 is amplified by the ASE filter 6, by being upward compressed (in the sense of dynamic range control), in accordance with a gain parameter which is set by the ambient sound environment analyzer 8. This compressed audio signal then drives the speaker 2 resulting in the amplified ambient sound content being reproduced at the user's ear.

(18) The compression (dynamic range control) performed by the ASE filter 6 is customized as follows. The analyzer 8 determines signal to noise ratio, SNR, in the input audio signal (from the external microphone 3.) Here, the acoustic ambient environment (and hence the input audio signal) contains speech by a talker (who is not the user.) When determining that the SNR is above a threshold, the ASE filter 6 becomes configured to apply upward compression to the input audio signal. This is also referred to here as reducing dynamic range by applying positive gain, in terms of dB, versus input level (the level of the input audio signal from the external microphone 3.) But when determining that the SNR is below the threshold, the ASE filter becomes configured to apply zero gain or negative gain, in terms of dB, to the input audio signal. The negative gain as a function of low SNR is depicted in the graph of FIG. 1 as a dotted line. Note that in both instances the gain is being applied to the soft sounds, and not the loud sounds (in the input audio signal), and its value is determined in accordance with the hearing profile of the user (who is wearing the against the ear audio device 1). In this manner, the dynamic range control (while personalized for user's hearing profile) also becomes ambient noise aware, so that when SNR in the ambient sound pickup channel drops below a threshold, the dynamic range control gain being applied by the ASE filter drops to zero or negative (in terms of dB). That makes the hearing more comfortable when the user enters a noise or loud environment or when for example a train arrives.

(19) In one aspect, determining the SNR comprises processing the input audio signal to produce a noise estimate and a main signal estimate (and computing a ratio of those two estimates.) The noise and main signal estimates may be computed on a per frequency bin basis, and the resulting SNR may be on a per frequency bin basis, and which may be updated in each audio frame. The updated SNR may then be translated into the gain parameter of the ASE filter based on knowledge of the hearing profile of the user. The updated gain parameter, on a per frequency bin basis, may then be applied to the input audio signal (from the external microphone 3) in frequency domain, by the ASE filter 6.

(20) Turning now to FIG. 3, this is a block diagram of an audio system and related method for personalized ASE in which the group delay in the forward path of the ASE is varied as a function of the level of hearing loss. It has been determined that if the against the ear audio device 1 is reproducing wide audio bandwidth and the user has mild hearing loss, such a user is more likely to be sensitive to the latency introduced by the ASE path (through the ASE filter 6). It is thus desirable to have a latency of less than 1 millisecond introduced by the ASE path, for such users, while other users who have more than mild hearing loss can actually tolerate greater latency (or the greater latency is less noticeable to them.) This result may be achieved as an instance of a more general, digital signal processing method (for ambient sound enhancement in an against the ear audio device), in the audio system of FIG. 3.

(21) Referring to FIG. 3, the ASE filter 6 filters the audio signal coming from the external microphone 3, before the audio signal is converted into sound by the speaker 2 of the against-the-ear audio device 1. The ASE filter 6 does so by applying a feedforward gain (e.g., on a per frequency bin basis, and variable per audio frame.) The feedforward gain may be part of dynamic range control of the input audio signal, applied as a positive gain versus input level to the audio signal in accordance with the hearing profile of the user. The HLC 7 determines the feedforward gain, based on the hearing loss level associated with a user of the audio device 1 (e.g., by accessing a stored audiogram of the user.) The ASE filter 6 may also perform noise suppression.

(22) The ASE filter 6 is adjusted to perform with low latency, when determining the hearing loss level is below a threshold (the user has a mild hearing loss, for example as given above.) But if the HLC 7 determines the hearing loss level is above the threshold, then the ASE filter 6 is configured to perform with high latency. In terms of group delay of a digital filter, the latency of the ASE filter 6 will thus exhibit the relationship or curve shown in FIG. 3, where the group delay is low when the applied gain is low (hearing loss, HL, is low), and high when the applied gain is high (HL is high.) The latency may refer to the delay to which the input audio signal is subjected, between the input of the HLC 7 and the output of the ASE filter 6.

(23) The above-described control of hearing loss-dependent latency in the ASE path may be applied in conjunction with a feedback cancellation, FBC, filter 10 that is filtering the output of the ASE filter 6. This may be during an ambient sound enhancement mode of operation in which only the ambient sound pick up channel is being amplified and reproduced—there is no playback signal (no user audio content such as music or a phone call.) The FBC filter 10 can also be used when there is playback, such as music or a phone call. The FBC filter 10 attempts to remove the acoustic coupling of the speaker 2 into the external microphone 3, particularly when the feedforward gain being applied to the input audio signal is high. The output of the FBC filter 10 may be added to input audio signal coming from the external microphone 3 to result in a combined signal at the input of the ASE filter 6.

(24) Alternatively, still referring to FIG. 3, the output of the FBC filter 10 may be used indirectly to modulate a scalar gain block A at the output of the ASE filter 6. That approach may be useful in instances where the FBC control loop could become unstable. That approach may work as follows. The feedback canceller 11 determines what the FBC filter 10 coefficients should be. The FBC filter coefficients are assumed to represent the state of the feedback path (through the FBC filter 10.) When the feedback canceller 11 determines that the feedback path is too strong (possibly leading to an unstable condition), it will in response reduce the gain of the scalar gain block A. Controlling the gain of block A is thus not based on the output of FBC filter 10, which is an audio signal output. Rather, it is based on the output of feedback canceller 11 which are the filter coefficients that define the FBC filter 10.

(25) The feedback cancellation tends to perform better when latency in the ASE path is increased. This means that greater latency may be desirable when more positive gain is being applied (due to greater hearing loss.)

(26) In one aspect, adjusting the ASE filter 6 for high latency comprises re-configuring the ASE filter 6 from a minimum phase filter into a linear phase filter or a maximum phase filter. In other words, the ASE filter 6 is configured as a minimum phase filter in a base or default configuration, exhibiting low latency, unless the HLC 7 (and perhaps in conjunction with other decision makers such the ambient sound environment analyzer 8 of FIG. 2 above) determine that the user has hearing loss that is above a threshold, in which case the ASE filter 6 is set to a high latency configuration.

(27) In another aspect, the hearing loss dependent latency control method may proceed as follows. An input audio signal is being filtered in time domain, for purposes of noise suppression, using a minimum phase filter (while downstream the audio signal is converted into sound by the speaker 2 of the against the ear audio device.) The audio signal may be from one or more microphones of the audio device that are picking up ambient sound, and not an audio program content signal or an audio downlink communications signal. The noise suppression time domain filtering may be in addition to feedforward gain that is applied as a function of a hearing loss level of a user of the audio device. The hearing loss level associated with the audio device or a user of the audio device is determined. When the determined hearing loss level is high, the audio signal, either upstream or downstream of the minimum phase filter, is delayed but not when the hearing loss level is low. Such a delay may occur by, for example, adding a delay in series with the ASE filter 6. The method may further comprise entering an ambient sound enhancement mode of operation in the audio device in which the feedback cancellation is disabled, in response to the hearing loss level being determined as below a threshold (or the feedforward gain is determined to be below a threshold.) Disabling the feedback cancellation may also be beneficial in that it saves computing resources or reduces power consumption in the audio device 1.

(28) Turning now to FIG. 4, this is a block diagram of an audio system and related method for personalized ASE in which the signal contribution by a feedback cancellation subsystem, to the forward path of the ASE (in an attempt to remove echo), is omitted when the level of hearing loss is determined to be below a threshold. Such an audio system has a digital processor (e.g., in the against the ear audio device 1) that performs the method by first determining a hearing loss level associated with the audio device 1 or a user of the audio device 1. The hearing loss level may be determined by the processor accessing a stored audiogram of the user, or by the processing conducting a hearing test in which it produces speech and other sound stimulus signals to drive the speaker 2 and then monitors the user's response to such stimuli (e.g., verbal responses via the internal microphone 2 or the external microphone 3, manual responses entered by the user through a graphic user interface of the smartphone 5 that is paired with a headphone being the audio device 1.) The processor then sets a feedforward gain (in the forward path of the ASE, in the ASE filter 6) into a low range in response to determining that the hearing loss level is below a threshold, or it sets the feedforward gain into a high range in response to determining that the hearing loss level is above the threshold. In response to the feedforward gain being set into the low range, the processor disables a feedback canceller (that would otherwise modify an audio signal to which the feedforward gain is applied and is processed through the ASE filter 6 before being converted into sound by the against-the-ear audio device 1.) In one aspect, if the feedforward is set into the high range, then the processor enables the feedback canceller.

(29) When HLC 7 determines that the output of the FBC filter 10 should be disconnected from the signal chain that originates from the external microphone 3, the feedback canceller 11 should also stop computing updates to the FBC filter 10. However, there could be another mode of operation as shown in FIG. 5. There, even though the FBC filter 10 output is disconnected, the feedback canceller 11 continues to run with a “side chain” in which it continually monitors the feedback path strength by evaluating the sum of the signal from the microphone 3 and the output of the FBC filter 3. The feedback canceller here applies the necessary gain adjustment to block A, similar to the optional approach depicted in dotted lines in FIG. 3. Thus, referring to FIG. 5, when the FBC filter 10 is disconnected, the signal from external microphone 3 becomes routed directly to the input of the ASE filter 6. At the same time, in the side chain process, a feedback corrected microphone signal is constructed (at the output of the summing junction) and is fed to an input of the feedback canceller 11 which analyzes the feedback corrected microphone signal to determine how to adjust the gain of the block A.

(30) To aid the Patent Office and any readers of any patent issued on this application in interpreting the claims appended hereto, applicant wishes to note that it does not intend any of the claims or claim elements below to invoke 35 U.S.C. 112(f) unless the words “means for” or “step for” are explicitly used in the particular claim.

(31) As would be readily understood, the use of personally identifiable information should follow privacy policies and practices that are generally recognized as meeting or exceeding industry or governmental requirements for maintaining the privacy of users. In particular, personally identifiable information should be managed and handled so as to minimize risks of unintentional or unauthorized access or use, and the nature of authorized use should be clearly indicated to users.

(32) While certain aspects have been described and shown in the accompanying drawings, it is to be understood that such are merely illustrative of and not restrictive on the broad invention, and that the invention is not limited to the specific constructions and arrangements shown and described, since other modifications may occur to those of ordinary skill in the art. The description is thus to be regarded as illustrative instead of limiting.

Ambient noise aware dynamic range control and variable latency for hearing personalization

Assignee

Inventors

Cpc classification

Classification Explorer

H04R1/406

ELECTRICITY

Classification Explorer

G10L2021/02166

PHYSICS

Classification Explorer

H04R29/005

ELECTRICITY

Classification Explorer

H04R3/005

ELECTRICITY

Classification Explorer

H04R2499/11

ELECTRICITY

Classification Explorer

H04R1/1041

ELECTRICITY

Classification Explorer

G10L21/0232

PHYSICS

Classification Explorer

G10L25/84

PHYSICS

Classification Explorer

H04R2205/041

ELECTRICITY

Classification Explorer

G10L21/0208

PHYSICS

Classification Explorer

H04R2460/01

ELECTRICITY

Classification Explorer

G10L25/51

PHYSICS

Classification Explorer

G10L21/0316

PHYSICS

Classification Explorer

G10L21/02

PHYSICS

Classification Explorer

H04R1/1016

ELECTRICITY

Classification Explorer

G10L21/034

PHYSICS

International classification

Classification Explorer

H04R3/00

ELECTRICITY

Classification Explorer

H04R1/40

ELECTRICITY

Classification Explorer

H04R29/00

ELECTRICITY

Classification Explorer

G10L25/51

PHYSICS

Classification Explorer

G10L25/84

PHYSICS

Classification Explorer

G10L21/0316

PHYSICS

Classification Explorer

G10L21/0232

PHYSICS

Abstract

Claims

Description