Controlling wind noise in a bilateral microphone array

Abstract

A pair of earphones have microphone arrays each providing a plurality of microphone signals. A processor receives the microphone signals and applies a first set of filters to a subset of the plurality of microphone signals from each of the arrays, the first set of filters inverting the signals below a cutoff frequency, and provides the first-filtered signals and the remainder of the microphone signals from each of the arrays to a second set of filters. The processor uses the second set of filters to combine the signals to generate a far-field signal that is more sensitive to sounds originating a short distance away from the earphones than to sounds close to the earphones above the cutoff frequency, and omnidirectional below the cutoff frequency, determines a level of wind noise present in the microphone signals, and adjusts the cutoff frequency as a function of the determined level of wind noise.

Claims

1. An apparatus comprising: a first earphone having a first microphone array providing a first plurality of microphone signals, and a first speaker; a second earphone having a second microphone array providing a second plurality of microphone signals, and a second speaker; and a processor receiving the first plurality of microphone signals and second plurality of microphone signals, and configured to: apply a first set of filters to a subset of the plurality of microphone signals from each of the first microphone array and the second microphone array, the first set of filters inverting the signals below a cutoff frequency; provide the first-filtered signals and the remainder of the microphone signals from each of the first microphone array and the second microphone array to a second set of filters; use the second set of filters to combine the microphone signals to generate a far-field signal that is more sensitive to sounds originating a short distance away from the apparatus than to sounds close to the apparatus above the cutoff frequency, and omnidirectional below the cutoff frequency; determine a level of wind noise present in the microphone signals; adjust the cutoff frequency as a function of the determined level of wind noise; and provide the far-field signal to the speakers for output.

2. The apparatus of claim 1, wherein the processor is further configured to: after generating the far-field signal in the second set of filters, apply gain to the output of the filters below a second cutoff frequency which is a function of the first cutoff frequency.

3. The apparatus of claim 1, wherein the processor is further configured to: after generating the far-field signal in the first set of filters, apply a high-pass filter to the output of the filters.

4. The apparatus of claim 1, wherein the processor is further configured to: determine a total low-frequency energy present in the microphone signals; and upon determining that the total sound level is below a first threshold, and the level of wind noise is below a second threshold, increase the cutoff frequency of the first set of filters.

5. The apparatus of claim 1, wherein generating the far-field signal comprises, in the processor: determining a total low-frequency energy present in the microphone signals; computing a sum of the microphone signals; computing a difference of the microphone signals; comparing the sum of the microphone signals to the difference of the microphone signals and to the total low-frequency energy; and determining the cutoff frequency based on the results of the comparison.

6. The apparatus of claim 5, wherein computing the difference of the microphone signals comprises: computing a first difference of microphone signals in the first plurality of microphone signals, computing a second difference of microphone signals in the second plurality of microphone signals, and computing a difference of the first difference and the second difference as the difference of the microphone signals.

7. An apparatus comprising: a first earphone having a first microphone array providing a first plurality of microphone signals, and a first speaker; a second earphone having a second microphone array providing a second plurality of microphone signals, and a second speaker; and a processor receiving the first plurality of microphone signals and second plurality of microphone signals, and configured to: use a first set of filters to combine the microphone signals to generate a far-field signal that is more sensitive to sounds originating a short distance away from the apparatus than to sounds close to the apparatus above a cutoff frequency, and omnidirectional below the cutoff frequency; determine a level of wind noise present in the microphone signals; adjust the cutoff frequency as a function of the determined level of wind noise; provide the far-field signal to the speakers for output; use a second set of filters to combine the microphone signals to generate a near-field signal that is more sensitive to voice signals from a person wearing the earphones than to sounds originating away from the apparatus; combine the microphone signals to generate an omnidirectional signal; combine the near-field signal and the omnidirectional signal using a weighted sum, the weight being a function of the determined level of wind noise to generate a communication signal; and provide the communication signal to a communication system.

8. The apparatus of claim 7, wherein the processor is configured to: determine the level of wind noise for adjusting the cutoff frequency based on a comparison of a sum of the microphone signals to a difference of the microphone signals; and determine the level of wind noise for adjusting the weight applied to the near field signal in the communication signal based on a comparison of the near field signal to the omnidirectional signal.

9. The apparatus of claim 7, wherein generating the far-field signal comprises, in the processor: applying an all-pass filter to a subset of the plurality of microphone signals from each of the first microphone array and the second microphone array, the all-pass filter inverting the signals below the cutoff frequency; and providing the all-pass-filtered signals and the remainder of the microphone signals from each of the first microphone array and the second microphone array to the first set of filters.

10. The apparatus of claim 7, wherein generating the near-field signal and omnidirectional signal comprises, in the processor: applying a third set of filters to a first subset of the plurality of microphone signals from each of the first microphone array and the second microphone array; applying a fourth set of filters to a second subset of the plurality of microphone signals from each of the first microphone array and the second microphone array; combining the filtered first subset with the filtered second subset to generate the near-field signal; and summing the first subset and the second subset to generate the omnidirectional signal.

11. The apparatus of claim 10, wherein generating the near-field signal and omnidirectional signal further comprises: summing the first subset and providing the summed first subset to the third set of filters; summing the second subset and providing the summed second subset to the fourth set of filters; summing the summed first subset and the second summed subset to generate the omnidirectional signal.

12. The apparatus of claim 10, wherein the processor comprises a plurality of sub-processors, and the summing of the first and second subsets is performed by a separate sub-processor from the applying of the third and fourth filters and combining of the filtered subsets.

13. A method comprising, in a processor: receiving, from a first earphone having a first microphone array, a first plurality of microphone signals; receiving, from a second earphone having a second microphone array, a second plurality of microphone signals; and applying a first set of filters to a subset of the plurality of microphone signals from each of the first microphone array and the second microphone array, the first set of filters inverting the signals below a cutoff frequency; providing the first-filtered signals and the remainder of the microphone signals from each of the first microphone array and the second microphone array to a second set of filters; using the second set of filters to combine the microphone signals to generate a far-field signal that is more sensitive to sounds originating a short distance away from the earphones than to sounds close to the apparatus above the cutoff frequency, and omnidirectional below the cutoff frequency; determining a level of wind noise present in the microphone signals; adjusting the cutoff frequency as a function of the determined level of wind noise; and providing the far-field signal to first and second speakers in the respective first and second earphones for output.

14. The method of claim 13, further comprising, in the processor: after generating the far-field signal in the second set of filters, applying gain to the output of the filters below a second cutoff frequency.

15. The method of claim 13, further comprising, in the processor: after generating the far-field signal in the first set of filters, applying a high-pass filter to the output of the filters.

16. The method of claim 13, further comprising, in the processor: determining a total sound level present in the microphone signals; and upon determining that the total sound level is below a first threshold, and the level of wind noise is below a second threshold, increasing the cutoff frequency of the first set of filters.

17. A method comprising, in a processor: receiving, from a first earphone having a first microphone array, a first plurality of microphone signals; receiving, from a second earphone having a second microphone array, a second plurality of microphone signals; using a first set of filters to combine the microphone signals to generate a far-field signal that is more sensitive to sounds originating a short distance away from the apparatus than to sounds close to the apparatus above a cutoff frequency, and omnidirectional below the cutoff frequency; determining a level of wind noise present in the microphone signals; adjusting the cutoff frequency as a function of the determined level of wind noise; providing the far-field signal to first and second speakers in the respective first and second earphones for output; using a second set of filters to combine the microphone signals to generate a near-field signal that is more sensitive to voice signals from a person wearing the earphones than to sounds originating away from the earphones; combining the microphone signals to generate an omnidirectional signal; combining the near-field signal and the omnidirectional signal using a weighted sum, the weight being a function of the determined level of wind noise to generate a communication signal; and providing the communication signal to a communication system.

18. The method of claim 17, further comprising, in the processor: determining the level of wind noise for adjusting the cutoff frequency based on a comparison of a sum of the microphone signals to a difference of the microphone signals; and determining the level of wind noise for adjusting the weight applied to the near field signal in the communication signal based on a comparison of the near field signal to the omnidirectional signal.

19. The method of claim 17, wherein generating the far-field signal comprises, in the processor: applying an all-pass filter to a subset of the plurality of microphone signals from each of the first microphone array and the second microphone array, the all-pass filter inverting the signals below the cutoff frequency; and providing the all-pass-filtered signals and the remainder of the microphone signals from each of the first microphone array and the second microphone array to the first set of filters.

20. The method of claim 17, wherein generating the near-field signal and omnidirectional signal comprises: applying a third set of filters to a first subset of the plurality of microphone signals from each of the first microphone array and the second microphone array; applying a fourth set of filters to a second subset of the plurality of microphone signals from each of the first microphone array and the second microphone array; combining the filtered first subset with the filtered second subset to generate the near-field signal; summing the first subset and the second subset to generate the omnidirectional signal.

21. The method of claim 20, wherein generating the near-field signal and omnidirectional signal further comprises: summing the first subset and providing the summed first subset to the third set of filters; summing the second subset and providing the summed second subset to the fourth set of filters; summing the summed first subset and the second summed subset to generate the omnidirectional signal.

22. The method of claim 20, wherein the processor comprises a plurality of sub-processors, and the summing of the first and second subsets is performed by a separate sub-processor from the applying of the third and fourth filters and combining of the filtered subsets.

Description

BRIEF DESCRIPTION OF THE DRAWINGS

(1) FIG. 1 shows a set of headphones.

(2) FIGS. 2 through 10 show schematic block diagrams.

DESCRIPTION

(3) In a new headphone architecture shown in FIG. 1, two earphones 102, 104 each contain a two-microphone array, 106 and 108. The two earphones 102, 104 are connected to a central unit 110, worn around the user's neck. As shown schematically in FIG. 2, the central unit includes a processor 112, wireless communications system 114, and battery 116. The earphones also each contain a speaker, 118, 120, and additional microphones 122, 124 used for providing feedback-based active noise reduction. The microphones in the two arrays 106 and 108 are labelled as 126, 128, 130, and 132. These microphones serve multiple purposes: their output signals are used as ambient sound to be cancelled in feed-forward noise cancellation, as ambient sound (including the voice of a local conversation partner) to be enhanced for conversation assistance, as voice sounds to be transmitted to a remote conversation partner through the wireless communications system, and as side-tone voice sounds to play back for the user to hear his own voice while speaking. In the example of FIG. 1, the four microphones are arranged with the front microphone on each ear pointing forward, and the rear microphone on each ear pointing rearward. A line through each pair of microphones points generally forward when the headphone is worn by a typical user, to optimize detection of sound from the direction where the user is looking. The earphones are arranged to point their respective pairs of microphones slightly inward when worn, so the lines through the microphone arrays converge a meter or two ahead of user. This has the particular benefit of optimizing the reception of the voice of someone facing the user.

(4) The processor 112 applies a number of configurable filters to the signals from the various microphones. The provision of a high-bandwidth communication channel from all four microphones 126, 128, 130, 132, two located at each ear, to a shared processing system provides new opportunities in both local conversation assistance and communication with a remote person or system. Specifically, as shown in FIG. 3, a first set of filters 202 is used to make the best use of the microphones' physical arrangement, and combine the four microphone signals to form a far-field array optimized for detecting sound from a nearby source, such as a local conversation partner. When we say the array is optimized for detecting sounds from a nearby source, we mean that the sensitivity of the array to signals originating front in front of the headphone wearer at a distance of about one to two meters is greater than the sensitivity to sounds originating closer to or farther from the headphones, or from other directions. The use of all four microphones together, as described in U.S. Patent application publication 2015/0230026, can lead to improved performance over using a separate pair of microphones for each ear. In addition, the arrays can be configured differently for the two ears, for example, to preserve binaural spatial perception, by using two separate sets of filters, 202 and 204.

(5) A third set of filters 206 is used to combine the four microphone signals to form a near-field array optimized for detecting the user's own voice. When we say the array is optimized for detecting the user's own voice, we mean that the sensitivity of the array to signals originating from the user's mouth is greater than the sensitivity to sounds originating farther from the headphones. Even with the microphones 126, 128, 130, 132 physically arranged to optimize far-field pickup in front of the user, the combination of all four microphones has been found to provide near-field voice performance at least as good as, and in some cases better than, a two-microphone array in the same earbud location but physically aimed at the user's mouth.

(6) In some examples, yet another set of filters 208 is used for providing the user's voice back to the user himself, commonly called side-tone. The side-tone voice signal may be filtered differently from the outbound voice signal to account for the effect of the earphone's acoustics on the user's perception of his or her own voice. Finally, active noise reduction (ANR) filters 210, 212 for each ear use at least one of the local microphones to produce noise-cancelling signals. The ANR filters may use one or both external microphones and the feedback microphone for each ear to cancel ambient noise. In some examples, the external microphones from the opposite ear may also be used for ANR in each ear.

(7) The ANR signals, far-field array signals, side-tone signals, and any incoming communication or entertainment signals (not shown) are summed for each ear. As shown in FIG. 4, at least some of the filters are implemented in the processor 112, with the processor handling the distribution of the four microphone signals (plus the feedback microphone signals) to the various filters. Likewise, the processor may handle the summation of the multiple filter outputs and their distribution to the appropriate speakers.

(8) In some examples, as shown in FIG. 5, the processor 112 is provided by a combination of separate dedicated sub-processors, such as left and right ANR processors 302, 304, left and right array processors 306, 308, and communications processor 310. An example of a suitable ANR processor is described in U.S. Pat. No. 8,184,822, the entire contents of which are incorporated here by reference. A similar processor may be used for the array processing. An example of a suitable communications processor is the CSR8670 from Qualcomm Inc., which in some examples also provides general-purpose processing control of the ANR and array processors, as well as providing the wireless communication system 114. In other examples, a single ANR or array processor may handle both sides, or the communication processor may also have separate left- and right-side processors. The ANR and array filters may be provided a single processor per side, or all filtering may be handled by a single processor. The four external microphone signals may each be provided directly to each of the sub-processors, or one or more of the sub-processors, such as the array processors, may receive a subset of the microphone signals directly and transfer those signals over a bus to the other processors (as shown in FIG. 5).

(9) Far-Field Filtering

(10) An example topology for far-field microphone processing is shown in FIG. 6. This represents a sub-set of the processing carried out by the complete product represented in the preceding figures. In this example, each of the four microphone signals LF, LR, RF, and RR is provided to each of two array processors 306, 308. If the same far-field signal is to be provided to each ear, only a single such processor is needed. Each array processor applies a specific filter to each incoming microphone signal before summing the filtered signals to produce a far-field signal for the respective ear. The summed signals are in turn equalized 402, 404, based on the specific filters applied to each individual microphone signal.

(11) The particular filters and related signal processing for generating the far-field signals for output to the left and right ear are described in application U.S. 2015/0230026, incorporated by reference above. All of the filtering, summing, equalizing, and processing shown in FIG. 6 could be performed in a single processor, or a different combination of processors than that used in the example. In some examples, rather than being directly output to the speakers, the array processor outputs are provided as signal inputs to the ANR processors, to provide a directional component to a hear-through feature of the ANR system, such as that described in U.S. Pat. No. 8,798,283, the contents of which are incorporated here by reference.

(12) Near-Field Communication Filters

(13) As noted above, even with the four microphones physically arranged to optimize far-field voice pickup, when all four are combined, they also produce good near-field voice signals for communication purposes. Previous communication headsets have combined two microphones to improve detection of the user's voice, for example, in a beam-forming array aimed at the user's mouth. To a high level, the same type of processing shown in FIG. 6 can be performed to generate a near-field signal, using appropriately different filter coefficients. As compared to FIG. 6, only one set of filters would be needed to generate an outbound voice signal. In some examples, as shown in FIG. 7, one of the array processors 306 or 308 combines the four microphone signals before providing two composite signals to the communications processor 310, which implements the near-field voice filtering. Specifically, the array processor 308 sums the two front microphone signals LF and RF and the two rear microphone signals LR and RR, and provides the two sets of summed signals 502, 504 to the communications processor 310. The communications processor combines the two sets of summed signals to form a near-field array signal that optimizes the user's own voice relative to far-field energy. The front sum and the rear sum are each filtered 506, 508, and the two filtered sums are then combined 510 to generate the near-field array signal 512. This simplifies the design of the communication processor 310 and signal routing between the processors, by providing only two inbound signals to the communication processor. In the particular example of FIG. 7, the wireless communication system 114 is integrated with the communication processor 310 and the near-field signal is provided directly to the outbound communication link. With a more powerful communication processor, the pre-summing may not be needed, and all four microphone signals may be individually filtered to further optimize pickup of the user's voice.

(14) Side-Tone Filters

(15) In headsets that block the user's ear, hearing their own voice played back can help the user control the level at which they speak, and feel more comfortable talking into the headset. As anyone who has listened to a recording of themselves can relate, however, simply providing the outbound communication signal to the user's ear may not sound natural. This is even more pronounced due to the way the earphones 102, 104 change how the user perceives their own voice. U.S. Pat. No. 9,020,160, incorporated here by reference, discusses ways of filtering feedback and feed-forward microphone signals to produce a self-voice signal that sounds more natural. These techniques can be used in the present architecture either using all four microphones, as shown by filter 208 in FIG. 3, or using the pre-summed front microphone signals from the outbound signal processing steps, as shown by filter 514 in FIG. 7. In some examples, the self-voice filtering is done as part of the ANR filtering. This can be particularly advantageous because unmodified feedback-based noise reduction can alleviate a large part of the occlusion effect that amplifies the lower-frequency components of one's voice when wearing headphones. The external microphone signals are then used to re-inject the higher-frequency components of the voice that are lost when the ears are blocked (rather than cancelling them as ambient noise). The cancellation of the occlusion effect may be handled by the ANR processors 302, 304, while the communication processor 310 provides the side-tone signal from the external microphones.

(16) In a simplified example, such as in the example of FIG. 7, the summed front microphone signals from the communications pathway are simply low-pass-filtered and equalized to provide a basic side-tone signal. The side-tone signal is then summed with the other local output signals and provided to the speakers 118, 120

(17) Wind-Noise Mitigation

(18) As noted above, two microphones have previously been used as beam-forming arrays to detect the user's voice. In other examples, as described in U.S. Pat. No. 8,620,650, incorporated here by reference, two microphone signals can be combined to optimize rejection of ambient and wind noise. This can be adapted to the example of FIG. 7, as shown in FIG. 8, to remove wind noise from the near-field array. The term ‘wind noise’ is used here to describe noise caused by air flow directly striking the earphones, as opposed to ‘ambient’ noise, which refers to acoustic noise arriving at the earphones from other sources (which could include distant wind). The method of the '650 patent is used with one microphone signal that is sensitive to wind noise, and one that is less sensitive to wind noise but more sensitive to ambient noise. A weighted sum is used, where the weight given to each signal depends on the relative amount of noise energy present in each signal. In the particular example of FIG. 8, the array signal 512 tends to be sensitive to wind noise. A wind-noise optimizer 556 in the manner of the '650 patent combines the array signal 512 with an omnidirectional signal 552, formed by summing (554) the incoming front sum 502 and rear sum 504. This produces an improved output signal for use as the outbound voice signal. In the particular example of FIG. 8, the processing is done in the communications processor 310, which integrates the wireless communication system 114.

(19) The far-field array signal is also susceptible to wind noise, but different processing is used to manage it. In some examples, as shown in FIG. 9, the processing fades between an omnidirectional mode at low frequencies and the directional far-field array mode at higher frequencies based on the presence of wind noise in the signal. In this example, the four microphone signals are summed, 602, 604, 606, to produce a total energy signal 608. At the same time, a difference (LF−LB) 610 of the two left microphones is computed, a difference (RF−RB) 612 of the two right microphones is computed, and the difference ((LF−LB)−(RF−RB)) 614 of those two differences is computed. The ratio of that final difference signal 616 to the total energy signal 608 is compared 618 to a threshold to produce a wind indicator signal 620. The wind signal 620 serves as an input, along with the total energy signal 608, to a computation 626 that determines a cutoff frequency for two additional sets of filters 622, 624. The wind pre-filters 622 filter the individual microphone signals. In particular, the wind pre-filters apply all-pass filters that invert the phase of the front microphone signals below the computed cutoff frequency. This causes the array to have omnidirectional sensitivity at lower frequencies, and to maintain directivity at higher frequencies. As the wind level increases, the cutoff frequency below which the front microphones are inverted is raised, fading in increasing omnidirectional behavior—at high wind levels, the directional array is not particularly useful anyway, so the entire bandwidth is made omnidirectional.

(20) A second set of wind filters 624 is applied after the far-field array processing 204. This second set of wind filters does two things: it decreases low-frequency gain, and it applies a high-pass filter. In the normal far-field array processing, high gain is applied at lower frequencies to account for the loss of energy due to the directionality of the array. As the sensitivity at lower frequencies is shifted to being omnidirectional, this energy is restored and the gain can be reduced. The cutoff frequency of this low-frequency gain is based on the cutoff frequency of the all-pass filters 622, but may not be exactly the same frequency. At the same time, the high-pass filter removes whatever residual wind noise is still picked up—at particularly high wind levels, this may be more effective than the other techniques. As the wind level increases, both the low-frequency gain cutoff frequency and the high-pass filter cutoff frequency are raised, following the raising inversion frequency of the wind pre-filters. FIG. 9 shows the processing for only the right ear. The same processing is performed for the left ear, and is omitted for clarity. In some examples, the same control signal 620 and cutoff frequencies are used for both ears, and they may be computed once for the whole system, or redundantly in the separate array processors.

(21) Mitigation of White Noise Gain at Low Frequencies

(22) In some examples, also shown in FIG. 9, an additional use is made of the wind filters 622 and 624. When the directional far-field array is used, the effective noise floor at low frequencies is elevated, due to the increased gain needed to make up for loss of energy in the array. This is noticeable to the user when in a quiet environment, but in such an environment, the far-field array is of less benefit than it is in noisy environments. Therefore, the wind noise pre-filter 622 can be used to fade to omnidirectional sensitivity at low frequencies when ambient noise is low, even when wind noise is also low and it would otherwise favor the directional signal. A threshold 628 provides an additional input to the cutoff computation 626, and if the wind detection 620 is low, but the total energy 608 is also below the threshold 628, then the wind pre-filters 622 are still applied. This reduces white-noise gain at low frequencies. The low frequency gain is also restored in this situation by wind filter 624, but the high-pass filter is not used. The cutoff frequency calculated in the low-noise situation may follow a different functional relationship to the total energy signal 608 than in the high wind situation.

(23) Bilateral Wind Mitigation

(24) Rather than combining the left and right microphone signals, as mentioned above in the discussion of near-field voice pickup, the wind-vs-ambient noise mixing algorithm used for the near-field signal can also be adapted to use separate left and right microphone signals to optimize rejection of noise that is asymmetric in the far-field microphone signal, e.g., if wind is striking the user from one side more than the other. In this example, as shown in FIG. 10, the rear microphones are subtracted 702, 704 from the front microphones on each side to produce left and right difference signals 706, 708. These signals are not the same due to shading of the head between the two earpieces. The difference signals are then each low-pass filtered 710, 712 and compared 714 to determine if one side is subject to more wind than the other. If so, the microphone signals from the noisy side are suppressed at low frequencies, where the wind is most problematic by decreasing the gain applied to the microphones from that side at low frequencies by the far-field filters. Alternatively, a pre-filter stage could reduce that gain, similarly to the symmetric wind control method shown in FIG. 9. The system slowly fades back to using all four microphones, and if the wind has died down, this fading continues until full use of all the microphones is restored at all frequencies. If wind is again detected, the system quickly fades back to one-sided operation at low frequencies.

(25) The summing and comparison can be done in each of the array processors (assuming there are two, as in some of the examples), or done in one of them and a control signal provided to the other. If the communication processer were provided with all four microphone signals, rather than with the pre-summed front and rear signal pairs, then a similar left/right wind noise control could be applied to the near-end voice signal in combination with the omnidirectional/directional wind noise control shown in FIG. 7. Alternatively, in the example of FIG. 7, the array processors could decrease the weighting of the left or right microphones in the front/rear sums provided to the communication processor. This approach is also useful with only one microphone per ear, as the total energy on each side can be compared to determine if a noise source is asymmetric, and the signals balanced in the same manner.

(26) Simultaneous Operation

(27) With sufficient processing power, the different sets of filters can be used in parallel to simultaneously produce the near-field and far-field signals. This allows the user to his own voice and a conversation partner's voice simultaneously (i.e., if they are talking over each other), or to talk on the wireless connection at the same time as listening to another person. Aside from simply multitasking, that latter can be useful if more than one person in a conversation is using a device such as the one described herein. See, for example, U.S. Pat. No. 9,190,043, the entire contents of which are incorporated here by reference. Each of the multiple headsets can transmit its user's locally-detected voice, from the near-field filters, to the other headsets, where it can be combined with the results of that headset's far-field filters to provide the user with a complete set of their conversation partner(s) voices.

(28) The simultaneous detection of near-field and far-field voice can also be useful where the near-field is not being used for conversation. For example, if the headset implements or is connected to a voice personal assistant (VPA, the near-field signal can be directed to that system, or to a wake-up word detection process. The near-field signal should provide a higher signal-to-noise ratio for this than simply using ambient microphones.

(29) The near-field and far-field signals can also be compared to each other. One result of this comparison could be to estimate the proximity of the dominant signal—if the correlation of the two is high, it is the user speaking. This can be used for a voice activity detector, or to change other noise reduction algorithms, to name two examples.

(30) In the particular example of FIG. 1, the earphones are connected to the central unit by wires that communicate signals between the microphones and speakers in the earphones and the various processors in the central unit. In other examples, the processing, communications, and battery components are embedded in the earphones, which may be connected to each other by wired or wireless connections. Components and tasks may be split between the earphones, or repeated in both, depending on the architecture and the communication bandwidth. An important consideration of the present disclosure is that the signals from all four microphones, two per ear, are available to at least some of the processors that are generating sound for playback at each ear, and all four signals are ultimately provided to the processor generating signals for transmission over the communication system, though there may be intermediate summing steps for the communication path.

(31) Embodiments of the systems and methods described above comprise computer components and computer-implemented steps that will be apparent to those skilled in the art. For example, it should be understood by one of skill in the art that the computer-implemented steps may be stored as computer-executable instructions on a computer-readable medium such as, for example, Flash ROMS, nonvolatile ROM, and RAM. Furthermore, it should be understood by one of skill in the art that the computer-executable instructions may be executed on a variety of processors such as, for example, microprocessors, digital signal processors, gate arrays, etc. For ease of exposition, not every step or element of the systems and methods described above is described herein as part of a computer system, but those skilled in the art will recognize that each step or element may have a corresponding computer system or software component. Such computer system and/or software components are therefore enabled by describing their corresponding steps or elements (that is, their functionality), and are within the scope of the disclosure.

(32) A number of implementations have been described. Nevertheless, it will be understood that additional modifications may be made without departing from the scope of the inventive concepts described herein, and, accordingly, other embodiments are within the scope of the following claims.

Controlling wind noise in a bilateral microphone array

Assignee

Inventors

Cpc classification

Classification Explorer

H04R1/406

ELECTRICITY

Classification Explorer

G10L2021/02166

PHYSICS

Classification Explorer

H04R3/005

ELECTRICITY

Classification Explorer

H04R1/1075

ELECTRICITY

Classification Explorer

H04R2201/107

ELECTRICITY

Classification Explorer

H04R2410/07

ELECTRICITY

Classification Explorer

H04R5/033

ELECTRICITY

Classification Explorer

H04R1/1016

ELECTRICITY

Classification Explorer

H04R1/1083

ELECTRICITY

Classification Explorer

G10L21/0208

PHYSICS

International classification

Classification Explorer

H04R3/00

ELECTRICITY

Classification Explorer

H04R1/10

ELECTRICITY

Abstract

Claims

Description