Headphone acoustic noise cancellation and speaker protection or dynamic user experience processing
11166099 · 2021-11-02
Assignee
Inventors
- Tom-Davy W. Saux (Los Altos, CA, US)
- Brian D. Clark (San Jose, CA, US)
- Hanchi Chen (San Jose, CA, US)
- Victoria Chiu (San Jose, CA, US)
- Vladan Bajic (San Francisco, CA, US)
- Dana Massie (Santa Cruz, CA)
- Andrew P. Bright (Los Gatos, CA, US)
- Thomas M. Jensen (San Francisco, CA, US)
Cpc classification
G10K11/17881
PHYSICS
G10K11/17833
PHYSICS
H04R5/04
ELECTRICITY
G10K2210/1081
PHYSICS
International classification
H04R1/10
ELECTRICITY
Abstract
A method for audio signal processing of microphone signals of a headphone. An audio signal from a first microphone of a headphone is filtered by an ANC system to produce a first filtered signal. Dynamic range control is performed upon the first filtered signal to produce a first dynamic range adjusted signal that drives a speaker of the headphone. Other aspects are also described and claimed.
Claims
1. A method for audio signal processing of a microphone signal of a headphone, the method comprising: filtering an audio signal from a microphone of a headphone to produce a filtered signal; driving a speaker of the headphone using the filtered signal; side chain processing of the filtered signal to detect energy or power below 20 Hz; and performing gain reduction upon the filtered signal in response to detecting that a signal level of the detected energy or power exceeds a threshold.
2. The method of claim 1 wherein the microphone is an external microphone or an internal microphone as integrated in the headphone.
3. The method of claim 2 wherein filtering the audio signal is performed by a static or adaptive filter of an acoustic noise cancellation system to produce the filtered signal being an anti-noise signal.
4. The method of claim 3 further comprising performing no gain change upon the filtered signal in response to detecting that the signal level does not exceed the threshold.
5. The method of claim 1 wherein filtering the audio signal is performed by a static or adaptive filter of an acoustic noise cancellation system to produce the filtered signal being an anti-noise signal.
6. The method of claim 5 wherein performing gain reduction upon the filtered signal is by a low shelf filter that attenuates frequencies below a transition frequency, wherein the transition frequency of the low shelf filter is varied based on the signal level of the detected energy or power.
7. The method of claim 6 wherein the headphone is a sealing, in-ear type.
8. The method of claim 1 further comprising performing a beamforming process upon signals from a plurality of microphones that include the microphone, to produce the audio signal from the microphone.
9. The method of claim 1 wherein the headphone is a sealing, in-ear type.
10. The method of claim 1 further comprising performing no gain change upon the filtered signal in response to detecting that the signal level does not exceed the threshold.
11. The method of claim 1 wherein filtering the audio signal from the microphone, performing gain reduction upon the filtered signal, and side chain processing of the filtered signal are performed in time domain.
12. A headphone comprising: a speaker, a microphone integrated into a headphone housing; a processor; and memory having stored therein instructions that the processor executes for audio signal processing by filtering an audio signal from the microphone to produce a filtered signal; driving the speaker of the headphone using the filtered signal; side chain processing of the filtered signal to detect energy or power below 20 Hz; and performing gain reduction upon the filtered signal in response to detecting that a signal level of the detected energy or power exceeds a threshold.
13. The headphone of claim 12 wherein the memory has stored therein instructions that the processor executes to filter the audio signal from the microphone in a feedback signal processing path of an acoustic noise cancellation system.
14. The headphone of claim 13 wherein the microphone is an internal microphone.
15. The headphone of claim 12 wherein the memory has stored therein instructions that the processor executes to filter the audio signal from the microphone in a feedforward signal processing path of an acoustic noise cancellation system.
16. The headphone of claim 15 wherein the microphone is an external microphone.
17. The headphone of claim 12 wherein filtering the audio signal is performed by a static or adaptive filter of an acoustic noise cancellation system to produce the filtered signal being an anti-noise signal, and the memory has stored therein further instructions that the processor executes to perform no gain change upon the filtered signal in response to detecting that the signal level does not exceed the threshold.
18. The headphone of claim 12 wherein the instructions configure the processor to perform gain reduction upon the filtered signal by a low shelf filter that attenuates frequencies below a transition frequency, and vary the transition frequency of the low shelf filter based on the signal level of the detected energy or power.
19. The headphone of claim 12 wherein the headphone housing is of a sealing, in-ear type.
20. The headphone of claim 12 wherein filtering the audio signal from the microphone, performing gain reduction upon the filtered signal, and side chain processing of the filtered signal are performed in time domain.
Description
BRIEF DESCRIPTION OF THE DRAWINGS
(1) Several aspects of the disclosure here are illustrated by way of example and not by way of limitation in the figures of the accompanying drawings in which like references indicate similar elements. It should be noted that references to “an” or “one” aspect in this disclosure are not necessarily to the same aspect, and they mean at least one. Also, in the interest of conciseness and reducing the total number of figures, a given figure may be used to illustrate the features of more than one aspect of the disclosure, and not all elements in the figure may be required for a given aspect.
(2)
(3)
(4)
(5)
(6)
(7)
DETAILED DESCRIPTION
(8) Several aspects of the disclosure with reference to the appended drawings are now explained. Whenever the shapes, relative positions and other aspects of the parts described are not explicitly defined, the scope of the invention is not limited only to the parts shown, which are meant merely for the purpose of illustration. Also, while numerous details are set forth, it is understood that some aspects of the disclosure may be practiced without these details. In other instances, well-known circuits, structures, and techniques have not been shown in detail so as not to obscure the understanding of this description.
(9)
(10) The headphone 1 has an against-the-ear acoustic transducer or speaker 7 arranged and configured to reproduce sound that is represented in an audio signal directly into the ear of a user, an external microphone 5 (arranged and configured to receive ambient sound directly), and an internal microphone 3 (arranged and configured to directly receive the sound reproduced by the speaker 7.) The headset is configured to acoustically couple the external microphone to an ambient environment of the headphone, in contrast to the internal microphone being acoustically coupled to a trapped volume of air within the ear that is being blocked by the headphone. In one variation, as integrated in the headphone and worn by its user, the external microphone 5 may be more sensitive than the internal microphone 3 to a far field sound source outside of the headphone. Viewed another way, as integrated in the headphone and worn by its user, the external microphone 5 may be less sensitive than the internal microphone 3 to sound within the user's ear. Here it should be noted that while the figures show a single microphone symbol in each instance (external microphone 5 and internal microphone 3), as producing a sound pickup channel, this does not mean that the sound pickup channel must be produced by only one microphone. In some instances, the sound pickup channel may be the result of combining multiple microphone signals, e.g., by a beamforming process performed on a multi-channel output from a microphone array—this variation or option is depicted in dotted lines in the figures, as additional external microphones and a beamforming process.
(11) In one aspect, along with the transducers and the electronics that process and produce the transducer signals (output microphone signals and an input audio signal to drive the speaker), there is also electronics that is integrated in the headphone housing. Such electronics may include an audio amplifier to drive the speaker with an audio signal (that may include program audio), a microphone sensing circuit or amplifier that receives the microphone signals converts them into a desired format for digital signal processing, and a digital processor 2 and associated memory (not shown), where the memory stores instructions for configuring or programing the processor (e.g., instructions to be executed by the processor) to perform digital signal processing methods as described below in detail. A playback signal (program audio) that may contain user content such as music, podcast, or the voice of a far end user during a voice communication session can also be provided to drive the speaker in some modes of operation, e.g., during noise cancellation mode. The playback signal may be provided to the processor from an external, companion audio source device (not shown in the example of
(12) Turning now
(13) Note that in some cases, the noise cancellation mode of operation is performed during user content media playback, where a program audio signal containing for example music or a podcast or the voice of a far end user in a phone call is also combined into the single audio signal that is driving the speaker 7. In other cases, the program audio signal is silent during noise cancellation mode.
(14) As explained above, there are instances where the output signals from one or both of the feedforward and feedback paths of the ANC system can overdrive the speaker 7, such as when the user is walking (footfall events) and/or when the ambient environment is loud (e.g., rock or pop concert.) This problem is more likely when the headphone 1 is a sealing, in-ear type. To mitigate this, dynamic range control is performed upon the first filtered signal from Gfb, to produce a first dynamic range adjusted signal, and upon the second filtered signal from Gff, to produce a second dynamic range adjusted signal, before driving the speaker 7. In the example of
(15) In one instance, the dynamic range control includes downward compressing the first filtered signal, and/or downward compressing the second filtered signal. This reduces the magnitude of a component of the speaker input signal (e.g. the first filtered signal produced by the feedback path) which helps reduce sound pressure in the trapped volume of air. That sound pressure would otherwise increase beyond normal loud sounds, due to footfall events (e.g. the user is walking, hopping, rolling over a bump).
(16) Still referring to
(17) Note that while a low shelf cut filter attenuates frequencies below its transition frequency, its response flattens out at some level that still passes through the input signal at a meaningful level. This is contrast to a low pass filter: while it too attenuates frequencies below a cutoff frequency, its response generally maintains a continuous to roll off as the frequency drops until the input signal is essentially no longer passed through.
(18) Also, it should be noted that detecting signal level (for example when evaluating the speaker displacement function) refers to a generic way of covering different techniques of determining the wide-band strength of a signal. This is in contrast to computing narrow band strengths, in given frequency bins for instance. Detecting the signal level may be including for example envelope detection. Time domain techniques for envelope detection may be more suitable here, to ensure low latency in the response by the controller 11.
(19) Similar to the dynamic range control of the feedback path (at the output of the Gfb block), dynamic range control is also performed in the feedforward path, and particularly upon the second filtered signal that is produced at the output of Gff block. This produces a second dynamic range adjusted signal which is then combined with the first dynamic range adjusted signal (at the summing junction shown) into an audio signal that drives the speaker 7 of the headphone. In this particular case, an approach for dynamic range control that is similar to the one applied to the feedback path is taken, namely using a controller 9 that, similar to the controller 11, performs side chain processing of at least the output of the Gff block in the same manner as described above (applying the filtered signal to the input of a speaker displacement model and comparing the resulting speaking displacement function to a threshold based on which a transition frequency of a low shelf cut filter 8 is computed.) An option here is to also consider the first filtered signal when performing the side chain processing, as indicated by the dotted line connecting the output of the Gfb block to the controller 9. For example, the two filtered signals may be combined, as represented by the summing junction, into a single audio signal that is then input to the speaker displacement model.
(20) The dynamic range control applied to the output of the Gff block may serve to reduce the magnitude of the output of the feedforward path, so that the speaker 7 is less likely to be overdriven when the user is in a loud ambient environment. As to the dynamic range control applied to the output of the Gfb block, that may serve to reduce the magnitude of the output of the feedback path, so that the speaker 7 is less likely to be overdriven during footfall events (e.g., the user is walking or riding over bumps.) As most of the energy in footfall events is below 50 Hz, the transition frequency of the low shelf cut filter 10, and perhaps also that of the low shelf cut filter 8, may vary between 20 Hz to 50 Hz. Thus, the speaker 7 is protected against the disturbances caused by footfall in both quiet and loud ambient environments, while both the feeedforward and feedback paths of the ANC system are active.
(21) One of the problems encountered when seeking to protect the speaker 7 against being overdriven is how to keep the delay in responding to a detected overdriving condition (in the feedback and/or feedforward paths) as short as possible. A solution here is to perform the filtering by the Gfb block, the filtering by the low shelf filter, and the side chain processing (to determine the transition frequency of the low shelf filter), in time domain. For example, the filtering and side chain processing may all be performed without converting any of their input signals into frequency domain or sub-band signals, so as to avoid introducing too much latency into the feedback paths. Also, using a low shelf filter in the dynamic range control also helps keep the delay as short as possible, because of such a filter's desirable phase response characteristics. These observations also apply in a similar manner to reduce latency when responding to a detected overdriving condition in the feedforward path.
(22) Referring now to
(23) It should be noted that if there is no footfall event and the ambient environment is not loud, then the side chain processing performed by each of the controller 9, the controller 11 (
(24) Turning now to
(25) In
(26) The effect of the version shown in
(27) Referring now to
(28) In accordance with an aspect of the disclosure here, dynamic range control is performed upon the filtered signal that is produced at the output of the Gff block or Gfb block (or both), by side chain processing of the filtered signal to detect energy or power below 20 Hz, for example on a frame by frame basis (digital audio frames.) Then, gain reduction is performed upon the filtered signal (of the Gff block, the Gfb block, or both as shown), in response to detecting that a signal level of the detected energy or power exceeds a threshold. Thus, in the example of a hybrid ANC system in
(29) In this disclosure, microphone signals are processed by an ANC system and are translated into speaker displacement functions, for purposes of speaker protection, or for detecting infrasound energy or power. Thus, the use of personally identifiable information is not likely to be needed in this disclosure. However, it should be understood that any such use should follow privacy policies and practices that are generally recognized as meeting or exceeding industry or governmental requirements for maintaining the privacy of users. In particular, personally identifiable information should be managed and handled so as to minimize risks of unintentional or unauthorized access or use, and the nature of authorized use should be clearly indicated to users.
(30) To aid the Patent Office and any readers of any patent issued on this application in interpreting the claims appended hereto, applicant wishes to note that they do not intend any of the appended claims or claim elements to invoke 35 U.S.C. 112(f) unless the words “means for” or “step for” are explicitly used in the particular claim.
(31) While certain aspects have been described and shown in the accompanying drawings, it is to be understood that such are merely illustrative of and not restrictive on the broad invention, and that the invention is not limited to the specific constructions and arrangements shown and described, since various other modifications may occur to those of ordinary skill in the art. For example, although not shown in