Headphone acoustic noise cancellation and speaker protection
11361745 · 2022-06-14
Assignee
Inventors
- Tom-Davy W. Saux (Los Altos, CA, US)
- Brian D. Clark (San Jose, CA, US)
- Thomas M. Jensen (San Francisco, CA, US)
- Vladan Bajic (San Francisco, CA, US)
Cpc classification
G10K11/17881
PHYSICS
H04R2430/20
ELECTRICITY
G10K2210/3028
PHYSICS
G10K11/17821
PHYSICS
G10K2210/3014
PHYSICS
G10K2210/1081
PHYSICS
International classification
G10K11/178
PHYSICS
Abstract
An audio system has an ambient sound enhancement (ASE) function, in which an against-the-ear audio device having a speaker converts a digitally processed version of an input audio signal into amplified sound. The amplification may be in accordance with a stored hearing profile of the user. The audio system also has an acoustic noise cancellation (ANC) function that may be combined in various ways with the ASE function, and that may be responsive to voice activity detection. Other aspects are also described and claimed.
Claims
1. A method for audio signal processing of microphone signals of a headphone, the method comprising: filtering an audio signal from a first microphone of a headphone to produce a first filtered signal; filtering an audio signal from a second microphone of the headphone to produce a second filtered signal; performing dynamic range control upon the first filtered signal to produce a first dynamic range adjusted signal by i) side chain processing of the first filtered signal by applying the first filtered signal to a speaker displacement model that yields a speaker displacement function in time domain, and ii) performing gain reduction upon the first filtered signal in response to detecting that a signal level of the displacement function exceeds a threshold; performing dynamic range control upon the second filtered signal to produce a second dynamic range adjusted signal; and combining the first dynamic range adjusted signal and the second dynamic range adjusted signal into an audio signal that drives a speaker of the headphone.
2. The method of claim 1 wherein as integrated in the headphone, the first microphone is more sensitive than the second microphone to sound within a user's ear that is being blocked by the headphone.
3. The method of claim 1 wherein as integrated in the headphone the second microphone is more sensitive than the first microphone to a far field sound source outside of the headphone.
4. The method of claim 1 wherein filtering the audio signal from the first microphone is performed by an acoustic noise cancellation system.
5. The method of claim 1 wherein filtering the audio signal from the second microphone is performed by an acoustic noise cancellation system.
6. The method of claim 1 wherein filtering the audio signal from the first microphone is performed by a feedback signal processing path of an acoustic noise cancellation system, and filtering the audio signal from the second microphone is performed by a feedforward signal processing path of the acoustic noise cancellation system, and the speaker produces anti-noise.
7. The method of claim 1 wherein the headphone is a sealing, in-ear type.
8. The method of claim 1 further comprising a. performing a beamforming process upon signals from a plurality of microphones that include the second microphone, to produce the audio signal from the second microphone.
9. The method of claim 1 wherein performing dynamic range control comprises compressing the first filtered signal.
10. The method of claim 1 wherein performing dynamic range control comprises performing said gain reduction by: filtering the first filtered signal using a low shelf filter that attenuates frequencies below a transition frequency; and varying a transition frequency of the low shelf filter based on the speaker displacement function.
11. The method of claim 10 wherein filtering the audio signal from the first microphone, filtering the first filtered signal using the low shelf filter, and side chain processing of the first filtered signal to determine the transition frequency of the low shelf filter, are performed in time domain.
12. The method of claim 1 wherein performing dynamic range control comprises: performing said gain reduction by filtering the first filtered signal using a first cascade of first and second low shelf filters; filtering the second filtered signal using a second cascade of first and second low shelf filters; and performing side chain processing of input signals to the first and second cascades to vary transition frequencies of the low shelf filters.
13. The method of claim 12 wherein in a given cascade, a transition frequency of the second low shelf filter is greater than a transition frequency of the first low shelf filter.
14. The method of claim 1 wherein performing dynamic range control comprises: combining the first dynamic range adjusted signal and the second dynamic range adjusted signal into a first combination audio signal; applying the first combination audio signal to a speaker displacement function; detecting signal level of the speaker displacement function; and filtering the first and second dynamic range adjusted signals using respective low shelf filters and based upon the signal level of the speaker displacement function.
15. The method of claim 14 wherein performing dynamic range control comprises: varying a transition frequency of the respective low shelf filters based on side chain processing of the first combination audio signal.
16. The method of claim 1 wherein performing dynamic range control upon the second filtered signal comprises: applying the first dynamic range adjusted signal to a speaker displacement function; detecting signal level of the speaker displacement function; and filtering the second filtered signal using a low shelf filter but only if the detected signal level of the speaker displacement function is greater than a threshold.
17. A headphone comprising: a speaker, a first microphone and a second microphone integrated into a headphone housing; a processor; and memory having stored therein instructions that the processor executes for audio signal processing of signals from the first and second microphones by filtering an audio signal from the first microphone to produce a first filtered signal; filtering an audio signal from the second microphone to produce a second filtered signal; performing dynamic range control upon the first filtered signal to produce a first dynamic range adjusted signal by i) filtering the first filtered signal using a low shelf filter that attenuates frequencies below a transition frequency, ii) applying the first filtered signal to a speaker displacement model that yields a speaker displacement function in time domain, and iii) varying a transition frequency of the low shelf filter based on the speaker displacement function; performing dynamic range control upon the second filtered signal to produce a second dynamic range adjusted signal; and combining the first dynamic range adjusted signal and the second dynamic range adjusted signal into an audio signal that drives a speaker of the headphone.
18. The headphone of claim 17 wherein the memory has stored therein instructions that the processor executes to filter the audio signal from the first microphone in a feedback signal processing path of an acoustic noise cancellation system, and to filter the audio signal from the second microphone in a feedforward signal processing path of the acoustic noise cancellation system.
19. The headphone of claim 17 wherein the memory has stored therein instructions that the processor executes so that filtering the audio signal from the first microphone, filtering the first filtered signal using the low shelf filter, and side chain processing of the first filtered signal to determine the transition frequency of the low shelf filter, are performed in time domain.
Description
BRIEF DESCRIPTION OF THE DRAWINGS
(1) Several aspects of the disclosure here are illustrated by way of example and not by way of limitation in the figures of the accompanying drawings in which like references indicate similar elements. It should be noted that references to “an” or “one” aspect in this disclosure are not necessarily to the same aspect, and they mean at least one. Also, in the interest of conciseness and reducing the total number of figures, a given figure may be used to illustrate the features of more than one aspect of the disclosure, and not all elements in the figure may be required for a given aspect.
(2)
(3)
(4)
(5)
(6)
DETAILED DESCRIPTION
(7) Several aspects of the disclosure with reference to the appended drawings are now explained. Whenever the shapes, relative positions and other aspects of the parts described are not explicitly defined, the scope of the invention is not limited only to the parts shown, which are meant merely for the purpose of illustration. Also, while numerous details are set forth, it is understood that some aspects of the disclosure may be practiced without these details. In other instances, well-known circuits, structures, and techniques have not been shown in detail so as not to obscure the understanding of this description.
(8)
(9) The headphone 1 has an against-the-ear acoustic transducer or speaker 7 arranged and configured to reproduce sound (that is represented in an audio signal) directly into the ear of a user, an external microphone 5 (arranged and configured to receive ambient sound directly), and an internal microphone 3 (arranged and configured to directly receive the sound reproduced by the speaker 7.) The headset is configured to acoustically couple the external microphone to an ambient environment of the headphone, in contrast to the internal microphone being acoustically coupled to a trapped volume of air within the ear that is being blocked by the headphone. In one variation, as integrated in the headphone and worn by its user, the external microphone 5 may be more sensitive than the internal microphone 3 to a far field sound source outside of the headphone. Viewed another way, as integrated in the headphone and worn by its user, the external microphone 5 may be less sensitive than the internal microphone 3 to sound within the user's ear. Here it should be noted that while the figures show a single microphone symbol in each instance (external microphone 5 and internal microphone 3), as producing a sound pickup channel, this does not mean that the sound pickup channel must be produced by only one microphone. In some instances, the sound pickup channel may be the result of combining multiple microphone signals, e.g., by a beamforming process performed on a multi-channel output from a microphone array—this variation or option is depicted in dotted lines in the figures, as additional external microphones and a beamforming process.
(10) In one aspect, along with the transducers and the electronics that process and produce the transducer signals (output microphone signals and an input audio signal to drive the speaker), there is also electronics that is integrated in the headphone housing. Such electronics may include an audio amplifier to drive the speaker with an audio signal (that may include program audio), a microphone sensing circuit or amplifier that receives the microphone signals converts them into a desired format for digital signal processing, and a digital processor 2 and associated memory (not shown), where the memory stores instructions for configuring or programing the processor (e.g., instructions to be executed by the processor) to perform digital signal processing methods as described below in detail. A playback signal (program audio) that may contain user content such as music, podcast, or the voice of a far end user during a voice communication session can also be provided to drive the speaker in some modes of operation, e.g., during noise cancellation mode. The playback signal may be provided to the processor from an external, companion audio source device (not shown in the example of
(11) Turning now
(12) Note that in some cases, the noise cancellation mode of operation is performed during user content media playback, where a program audio signal containing for example music or a podcast or the voice of a far end user in a phone call is also combined into the single audio signal that is driving the speaker 7. In other cases, the program audio signal is silent during noise cancellation mode.
(13) As explained above, there are instances where the output signals from one or both of the feedforward and feedback paths of the ANC system can overdrive the speaker 7, such as when the user is walking (footfall events) and/or when the ambient environment is loud (e.g., rock or pop concert.) This problem is more likely when the headphone 1 is a sealing, in-ear type. To mitigate this, dynamic range control is performed upon the first filtered signal from Gfb, to produce a first dynamic range adjusted signal, and upon the second filtered signal from Gff, to produce a second dynamic range adjusted signal, before driving the speaker 7. In the example of
(14) In one instance, the dynamic range control includes downward compressing the first filtered signal, and/or downward compressing the second filtered signal. This reduces the magnitude of a component of the speaker input signal (e.g. the first filtered signal produced by the feedback path) which helps reduce sound pressure in the trapped volume of air. That sound pressure would otherwise increase beyond normal loud sounds, due to footfall events (e.g. the user is walking, hopping, rolling over a bump).
(15) Still referring to
(16) Note that while a low shelf cut filter attenuates frequencies below its transition frequency, its response flattens out to a certain level that still passes through the input signal at a meaningful level. This is contrast to a low pass filter. While it too attenuates frequencies below a cutoff frequency, the frequency response of a low pass filter generally maintains a continuous roll off as the frequency drops until the input signal is essentially no longer passed through.
(17) Also, it should be noted that in this disclosure, detecting signal level (for example when evaluating the speaker displacement function) refers to a generic way of covering different techniques of determining the wide-band strength of a signal. This is in contrast to computing narrow band strengths, in given frequency bins for instance. Detecting the signal level may include for example envelope detection. Time domain techniques for envelope detection may be more suitable here, to ensure low latency in the response by the controller 11.
(18) Similar to the dynamic range control of the feedback path (at the output of the Gfb block), dynamic range control may also be performed in the feedforward path, and particularly upon the second filtered signal that is produced at the output of Gff block. This produces a second dynamic range adjusted signal which is then combined with the first dynamic range adjusted signal (at the summing junction shown) into an audio signal that drives the speaker 7 of the headphone. In this particular case, an approach for dynamic range control that is similar to the one applied to the feedback path is taken, namely using a controller 9 that, similar to the controller 11, performs side chain processing of at least the output of the Gff block in the same manner as described above (applying the filtered signal to the input of a speaker displacement model and comparing the resulting speaking displacement function to a threshold based on which a transition frequency of a low shelf cut filter 8 is computed.) An option here is to also consider the first filtered signal when performing the side chain processing, as indicated by the dotted line connecting the output of the Gfb block to the controller 9. For example, the two filtered signals may be combined, as represented by the summing junction, into a single audio signal that is then input to the speaker displacement model.
(19) The dynamic range control applied to the output of the Gff block may serve to reduce the magnitude of the output of the feedforward path, so that the speaker 7 is less likely to be overdriven when the user is in a loud ambient environment. As to the dynamic range control applied to the output of the Gfb block, that may serve to reduce the magnitude of the output of the feedback path, so that the speaker 7 is less likely to be overdriven during footfall events (e.g., the user is walking or riding over bumps.) As most of the energy in footfall events is below 50 Hz, the transition frequency of the low shelf cut filter 10, and perhaps also that of the low shelf cut filter 8, may vary between 20 Hz to 50 Hz. Thus, the speaker 7 is protected against the disturbances caused by footfall in both quiet and loud ambient environments, while both the feeedforward and feedback paths of the ANC system are active.
(20) One of the problems encountered when seeking to protect the speaker 7 against being overdriven is how to keep the delay in responding to a detected overdriving condition (in the feedback and/or feedforward paths) as short as possible. A solution here is to perform the filtering by the Gfb block, the filtering by the low shelf filter, and the side chain processing (to determine the transition frequency of the low shelf filter), in time domain. For example, the filtering and side chain processing may all be performed without converting any of their input signals into frequency domain or sub-band signals, so as to avoid introducing too much latency into the feedback paths. Also, using a low shelf filter in the dynamic range control also helps keep the delay as short as possible, because of such a filter's desirable phase response characteristics. These observations also apply in a similar manner to reduce latency when responding to a detected overdriving condition in the feedforward path.
(21) Referring now to
(22) It should be noted that if there is no footfall event and the ambient environment is not loud, then the side chain processing performed by each of the controller 9, the controller 11 (
(23) Turning now to
(24) In
(25) The effect of the version shown in
(26) In this disclosure, microphone signals are processed by an ANC system and are translated into speaker displacement functions, for purposes of speaker protection. Thus, the use of personally identifiable information is not likely to be needed in this disclosure. However, it should be understood that any such use should follow privacy policies and practices that are generally recognized as meeting or exceeding industry or governmental requirements for maintaining the privacy of users. In particular, personally identifiable information should be managed and handled so as to minimize risks of unintentional or unauthorized access or use, and the nature of authorized use should be clearly indicated to users.
(27) To aid the Patent Office and any readers of any patent issued on this application in interpreting the claims appended hereto, applicant wishes to note that they do not intend any of the appended claims or claim elements to invoke 35 U.S.C. 112(f) unless the words “means for” or “step for” are explicitly used in the particular claim.
(28) While certain aspects have been described and shown in the accompanying drawings, it is to be understood that such are merely illustrative of and not restrictive on the broad invention, and that the invention is not limited to the specific constructions and arrangements shown and described, since various other modifications may occur to those of ordinary skill in the art. For example, although not shown in