System and method for automatic right-left ear detection for headphones
11343605 · 2022-05-24
Assignee
Inventors
Cpc classification
H04R5/04
ELECTRICITY
H04R1/1041
ELECTRICITY
G10K11/17885
PHYSICS
H04R2420/07
ELECTRICITY
H04R2201/109
ELECTRICITY
G10K11/178
PHYSICS
H04R2410/07
ELECTRICITY
G10K2210/1081
PHYSICS
International classification
H04R1/10
ELECTRICITY
G10K11/178
PHYSICS
H04R5/04
ELECTRICITY
Abstract
System for automatic right-left ear detection for headphone comprising: first earcup and second earcup that are identical. Each of first and second earcups includes: first microphone located on perimeter of each earcup, when first earcup is worn on user's right ear first microphone of first earcup is at location farthest from user's mouth when headphone is worn in normal wear position; second microphone located on perimeter of each earcup, when first earcup is worn on user's right ear, second microphone of first earcup is at location closer than first microphone of first earcup to user's mouth; third microphone located inside each earcup facing user's ear cavity, fourth microphone located at perimeter and bottom center portion of each earcup and facing exterior of each earcup, and fifth microphone located on perimeter of each earcup above and to left of second microphone when looking at outside housing of each earcup.
Claims
1. A method for right-left ear detection for a headphone comprising: receiving microphone signals from at least three microphones included in a first earcup, wherein the headphone includes the first earcup and a second earcup, wherein the first and second earcups are interchangeable; determining which one of the first earcup or the second earcup is being worn on which one of the user's right or left ears, wherein determining includes: generating a first output by performing comparisons of i) strengths of the microphone signals from the at least three microphones in the first earcup or ratios of the strengths of microphone signals from the at least three microphones in the first earcup to ii) a first plurality of thresholds, and generating a second output by performing comparisons of i) the strengths of the microphone signals from the at least three microphones in the first earcup or ratios of the strengths of microphone signals from the at least three microphones in the first earcup to ii) a second plurality of thresholds.
2. The method of claim 1, wherein the at least three microphones in each of the first earcup and second earcup include a first microphone generating a first microphone signal, a second microphone generating a second microphone signal and a third microphone generating a third microphone signal.
3. The method of claim 2, wherein the first plurality of thresholds comprises first, second, third, and fourth predetermined thresholds, and wherein the first output indicates that the first earcup is being worn on a particular ear of the user when (i) a strength of the third microphone signal is greater than the first predetermined threshold, (ii) a ratio of the strength of the third microphone signal and the strength of the second microphone signal is greater than the second predetermined threshold, (iii) the ratio of the strength of the third microphone signal and the strength of the second microphone signal is less than the third predetermined threshold, and (iv) a ratio of the strength of the second microphone signal and the strength of the first microphone signal is greater than the fourth predetermined threshold, wherein the particular ear of the user is one of the right ear or the left ear.
4. The method of claim 2, wherein the second plurality of threshold comprises first, second, third, and fourth predetermined thresholds, and wherein the second output indicates that the second earcup is being worn on a particular ear of the user when: (i) a strength of the third microphone signal is greater than the first predetermined threshold, (ii) a ratio of the strength of the third microphone signal and the strength of the first microphone signal is greater than the second predetermined threshold, (iii) the ratio of the strength of the third microphone signal and the strength of the first microphone signal is less than the third predetermined threshold, and (iv) a ratio of the strength of the first microphone signal and the strength of the second microphone signal is greater than the fourth predetermined threshold.
5. The method of claim 2, wherein the first plurality of thresholds comprises first, second, third, fourth, and fifth predetermined thresholds, and wherein the at least three microphones in the first earcup further include a fourth microphone generating a fourth microphone signal, wherein the first output indicates that the first earcup is being worn on a particular ear of the user when (i) a strength of the third microphone signal is greater than the first predetermined threshold, (ii) a ratio of the strength of the third microphone signal and the strength of the second microphone signal is greater than the second predetermined threshold, (iii) the ratio of the strength of the third microphone signal and the strength of the second microphone signal is less than the third predetermined threshold, (iv) a ratio of the strength of the second microphone signal and the strength of the first microphone signal is greater than the fourth predetermined threshold, and (v) a ratio of the strength of the fourth microphone signal and the strength of the second microphone signal is greater than the fifth predetermined threshold.
6. The method of claim 2, wherein the first plurality of thresholds comprises first, second, third, fourth, and fifth predetermined thresholds, and wherein the at least three microphones in the second earcup further include a fourth microphone generating a fourth microphone signal, wherein the second output indicates that the second earcup is worn on a particular ear of the user when: (i) a strength of the third microphone signal is greater than the first predetermined threshold, (ii) a ratio of the strength of the third microphone signal and the strength of the first microphone signal is greater than the second predetermined threshold, (iii) the ratio of the strength of the third microphone signal and the strength of the first microphone signal is less than the third predetermined threshold, (iv) a ratio of the strength of the first microphone signal and the strength of the second microphone signal is greater than the fourth predetermined threshold, and (v) a ratio of the strength of the fourth microphone signal and the strength of the first microphone signal is greater than the fifth predetermined threshold.
7. The method of claim 1, wherein determining which one of the first earcup or the second earcup is being worn on which one of the user's right or left ears further includes: generating a signal indicating which one of the first earcup or the second earcup is being worn on a particular ear of the user by performing an error correction using the first output and the second output to eliminate spurious single frames.
8. The method of claim 7, further comprising: generating a voice beam signal and a noise beam signal based on at least two microphone signals received from the earcup identified by the signal as being worn on the particular ear of the user.
9. The method of claim 1, further comprising: transforming the microphone signals from the at least three microphones in time domain by bandpass filtering the microphone signals in a predetermined frequency band, wherein the strength of the bandpass filtered microphone signals from the at least three microphones is determined from the predetermined frequency band.
10. The method of claim 1, further comprising: transforming the microphone signals from the at least three microphones from a time domain to a frequency domain; and filtering the microphone signals in the frequency domain in a plurality of frequency bins, wherein the strength of the microphone signals from the at least three microphones is determined by summing one of the plurality of frequency bins or all of the plurality of frequency bins.
11. The method of claim 1 wherein the first earcup and second earcup have identical microphone configurations.
12. An article of manufacture comprising: a non-transitory computer-readable medium having stored thereon instructions that when executed by a processor causes the processor to receive microphone signals from at least three microphones included in a first earcup of a headphone, wherein the at least three microphones in the first earcup include a first microphone to generate a first microphone signal, a second microphone to generate a second microphone signal, and a third microphone to generate a third microphone signal, and the headphone includes the first earcup and a second earcup, wherein the first and second earcups are interchangeable, determine which one of the first earcup or the second earcup is being worn on which one of the user's right or left ears, by generating a first output by performing comparisons of i) strengths of the microphone signals from the at least three microphones in the first earcup or ratios of the strengths of microphone signals from the at least three microphones in the first earcup to ii) a first plurality of threshold, and generating a second output by performing comparisons of i) the strengths of the microphone signals from the at least three microphones in the first earcup or ratios of the strengths of the microphone signals from the at least three microphones in the first earcup to ii) a second plurality of thresholds.
13. The article of manufacture of claim 12 wherein the first plurality of thresholds and the second plurality of thresholds comprises first, second, third, and fourth predetermined thresholds, and wherein the first output indicates that the first earcup is being worn on a particular ear of the user when (i) a strength of the third microphone signal is greater than the first predetermined threshold, (ii) a ratio of the strength of the third microphone signal and the strength of the second microphone signal is greater than the second predetermined threshold, (iii) the ratio of the strength of the third microphone signal and the strength of the second microphone signal is less than the third predetermined threshold, and (iv) a ratio of the strength of the second microphone signal and the strength of the first microphone signal is greater than the fourth predetermined threshold, and wherein the second output indicates that the second earcup is being worn on the particular ear of the user when (i) a strength of the third microphone signal is greater than the first predetermined threshold, (ii) a ratio of the strength of the third microphone signal and the strength of the first microphone signal is greater than the second predetermined threshold, (iii) the ratio of the strength of the third microphone signal and the strength of the first microphone signal is less than the third predetermined threshold, and (iv) a ratio of the strength of the first microphone signal and the strength of the second microphone signal is greater than the fourth predetermined threshold.
14. The non-transitory computer-readable medium of claim 12, wherein the first plurality of thresholds and the second plurality of thresholds comprises first, second, third, fourth, and fifth predetermined thresholds, and wherein the at least three microphones in the first earcup further include a fourth microphone generating a fourth microphone signal, wherein the first output indicates that the first earcup is being worn on a particular ear of the user when (i) a strength of the third microphone signal is greater than the first predetermined threshold, (ii) a ratio of the strength of the third microphone signal and the strength of the second microphone signal is greater than the second predetermined threshold, (iii) the ratio of the strength of the third microphone signal and the strength of the second microphone signal is less than the third predetermined threshold, (iv) a ratio of the strength of the second microphone signal and the strength of the first microphone signal is greater than the fourth predetermined threshold, and (v) a ratio of the strength of the fourth microphone signal and the strength of the second microphone signal is greater than the fifth predetermined threshold.
15. The non-transitory computer-readable medium of claim 12, wherein the at least three microphones in the first earcup further include a fourth microphone generating a fourth microphone signal, wherein the second output indicates that the second earcup is being worn on a particular ear of the user when: (i) a strength of the third microphone signal is greater than the first predetermined threshold, (ii) a ratio of the strength of the third microphone signal and the strength of the first microphone signal is greater than the second predetermined threshold, (iii) the ratio of the strength of the third microphone signal and the strength of the first microphone signal is less than a fifth predetermined threshold, (iv) a ratio of the strength of the first microphone signal and the strength of the second microphone signal is greater than a fourth predetermined threshold, and (v) a ratio of the strength of the fourth microphone signal and the strength of the first microphone signal is greater than the fifth predetermined threshold.
16. The non-transitory computer-readable medium of claim 12, wherein the first earcup and second earcup have identical microphone configurations.
17. A processor for use with a headphone, the processor comprising a processor configured to: receive microphone signals from at least three microphones included in a first earcup of the headphone; and determine which one of the first earcup or a second earcup of the headphone is being worn on which one of a user's right or left ears, by: generating a first output by performing comparisons of i) strengths of the microphone signals from the at least three microphones in the first earcup or ratios of the strengths of microphone signals from the at least three microphones in the first earcup to ii) a first plurality of thresholds; and generating a second output by performing comparisons of i) the strengths of the microphone signals from the at least three microphones in the first earcup or ratios of the strengths of microphone signals from the at least three microphones in the first earcup to ii) a second plurality of thresholds.
18. The processor of claim 17, wherein the at least three microphones in each of the first earcup and second earcup include a first microphone generating a first microphone signal, a second microphone generating a second microphone signal and a third microphone generating a third microphone signal.
19. The processor of claim 18, wherein the first plurality of thresholds comprises first, second, third, and fourth predetermined thresholds, and wherein the first output indicates that the first earcup is being worn on a particular ear of the user when (i) a strength of the third microphone signal is greater than the first predetermined threshold, (ii) a ratio of the strength of the third microphone signal and the strength of the second microphone signal is greater than the second predetermined threshold, (iii) the ratio of the strength of the third microphone signal and the strength of the second microphone signal is less than the third predetermined threshold, and (iv) a ratio of the strength of the second microphone signal and the strength of the first microphone signal is greater than the fourth predetermined threshold, wherein the particular ear of the user is one of the right or the left ear.
20. The processor of claim 18, wherein the second plurality of threshold comprises first, second, third, and fourth predetermined thresholds, and wherein the second output indicates that the second earcup is being worn on a particular ear of the user when: (i) a strength of the third microphone signal is greater than the first predetermined threshold, (ii) a ratio of the strength of the third microphone signal and the strength of the first microphone signal is greater than the second predetermined threshold, (iii) the ratio of the strength of the third microphone signal and the strength of the first microphone signal is less than the third predetermined threshold, and (iv) a ratio of the strength of the first microphone signal and the strength of the second microphone signal is greater than the fourth predetermined threshold.
21. The processor of claim 18, wherein the first plurality of thresholds comprises first, second, third, fourth, and fifth predetermined thresholds, and wherein the at least three microphones in the first earcup further include a fourth microphone generating a fourth microphone signal, wherein the first output indicates that the first earcup is being worn on a particular ear of the user when (i) a strength of the third microphone signal is greater than the first predetermined threshold, (ii) a ratio of the strength of the third microphone signal and the strength of the second microphone signal is greater than the second predetermined threshold, (iii) the ratio of the strength of the third microphone signal and the strength of the second microphone signal is less than the third predetermined threshold, (iv) a ratio of the strength of the second microphone signal and the strength of the first microphone signal is greater than the fourth predetermined threshold, and (v) a ratio of the strength of the fourth microphone signal and the strength of the second microphone signal is greater than the fifth predetermined threshold.
22. The processor of claim 18, wherein the first plurality of thresholds comprises first, second, third, fourth, and fifth predetermined thresholds, and wherein the at least three microphones in the second earcup further include a fourth microphone generating a fourth microphone signal, wherein the second output indicates that the second earcup is worn on a particular ear of the user when: (i) a strength of the third microphone signal is greater than the first predetermined threshold, (ii) a ratio of the strength of the third microphone signal and the strength of the first microphone signal is greater than the second predetermined threshold, (iii) the ratio of the strength of the third microphone signal and the strength of the first microphone signal is less than the third predetermined threshold, (iv) a ratio of the strength of the first microphone signal and the strength of the second microphone signal is greater than the fourth predetermined threshold, and (v) a ratio of the strength of the fourth microphone signal and the strength of the first microphone signal is greater than the fifth predetermined threshold.
23. The processor of claim 17, wherein determining which one of the first earcup or the second earcup is being worn on which one of the user's right or left ears further includes: the processor generating an indicator signal indicating which one of the first earcup or the second earcup is being worn on a particular ear of the user by performing an error correction using the first output and the second output to eliminate spurious single frames.
24. The processor of claim 23, further comprising the processor configured to: generate a voice beam signal and a noise beam signal based on at least two microphone signals received from the earcup identified by the indicator signal as being worn on the particular ear of the user.
25. The processor of claim 17, further comprising the processor configured to: transform the microphone signals from the at least three microphones in time domain by bandpass filtering the microphone signals in a predetermined frequency band, wherein the strength of the bandpass filtered microphone signals from the at least three microphones is determined from the predetermined frequency band.
26. The processor of claim 17, further comprising the processor configured to: transform the microphone signals from the at least three microphones from a time domain to a frequency domain; and filter the microphone signals in the frequency domain in a plurality of frequency bins, wherein the strength of the microphone signals from the at least three microphones is determined by summing one of the plurality of frequency bins or all of the plurality of frequency bins.
27. The processor of claim 17, wherein the first earcup and second earcup have identical microphone configurations.
Description
BRIEF DESCRIPTION OF THE DRAWINGS
(1) The embodiments of the invention are illustrated by way of example and not by way of limitation in the figures of the accompanying drawings in which like references indicate similar elements. It should be noted that references to “an” or “one” embodiment of the invention in this disclosure are not necessarily to the same embodiment, and they mean at least one. In the drawings:
(2)
(3)
(4)
(5)
(6)
(7)
(8)
(9)
DETAILED DESCRIPTION
(10) In the following description, numerous specific details are set forth. However, it is understood that embodiments of the invention may be practiced without these specific details. In other instances, well-known circuits, structures, and techniques have not been shown to avoid obscuring the understanding of this description.
(11)
(12) The user may place the earcups 10.sub.1 and 10.sub.2 on her ears in a first placement where the first earcup 10.sub.1 is placed on her right ear and the second earcup 10.sub.2 is placed on her left ear (
(13) The headphone on
(14)
(15) As shown in
(16) In the
(17) In another embodiment, each earcup 10.sub.1, 10.sub.2 includes three microphones being the first microphone 11.sub.1, the second microphone 11.sub.2 and the third microphone 11.sub.3 which is located inside each earcup facing the user's ear cavity. In some embodiments, the first three microphones 11.sub.1, 11.sub.2, 11.sub.3 and the fourth microphone 11.sub.4 can be used to perform active noise cancellation (ANC).
(18) In one embodiment, each earcup 10.sub.1, 10.sub.2 includes four microphones being the first microphone 11.sub.1, the second microphone 11.sub.2, the third microphone 11.sub.3 and the fourth microphone 11.sub.4. In each of these embodiments, at least three of the microphones 11.sub.1-11.sub.4 capture acoustic signals and generate microphone signals that are processed to determine which earcup 10.sub.1, 10.sub.2 is currently being worn on the user's right ear.
(19) In some embodiments, each earcup 10.sub.1, 10.sub.2 may also includes a fifth microphone 11.sub.5 that is located on a perimeter of each earcup 10.sub.1, 10.sub.2 and above and to the left of the second microphone 11.sub.2 when looking at an outside housing of each earcup. When the cup is on the right ear, the fifth microphone 11.sub.5 may be used together with the second microphone to generate beamforming towards the user's mouth. As shown in
(20) Referring to
(21) When a processor (not shown) that may be included in the headphone or in the mobile device 100 that is separate from the headphone determines that the first earcup 10.sub.1 is worn on the user's right ear, the fifth microphone 11.sub.5 and the second microphone 11.sub.2 of the first earcup 10.sub.1 are known to be located on the half of the first earcup 10.sub.1 that is closer to the user's mouth as shown in
(22) For example, the fifth microphone 11.sub.5 and the second microphone 11.sub.2 may be used to create a microphone array (i.e., beamformers) which can be aligned in the direction of user's mouth. Accordingly, the beamforming process, also referred to as spatial filtering, may be a signal processing technique using the microphone array for directional sound reception.
(23) While not shown in the
(24) In another embodiment, the earcups 10.sub.1, 10.sub.2 are wireless and may also include a battery device, a processor, and a communication interface (not shown). In this embodiment, the processor may be a digital signal processing chip that processes the acoustic signal from at least three of the microphones 11.sub.1-11.sub.m. In one embodiment, the processor may control or include at least one of: the earcup detector 131, the microphone selector (beamformer) 132, the noise suppressor 133 or the automatic gain control (AGC) 134 in
(25) The communication interface may include a Bluetooth™ receiver and transmitter which may communicate speaker audio signals or microphone signals from the microphones 11.sub.1-11.sub.m wirelessly in both directions (uplink and downlink) with the electronic device 100. In some embodiments, the communication interface communicates encoded signal from a speech codec (not shown) to the electronic device 100.
(26) In the embodiments described herein, since the headphones that include two identical earcups 10.sub.1, 10.sub.2 that may be worn in two alternative placements (
(27)
(28) The system 700 also includes an earcup detector 131 that includes a first voice activity detector (VAD) 141, a second VAD 142, a selector 144 which may act as a VAD signal combiner or as an OR function, and an error corrector 143. The system 700 may also include a microphone selector (beamformer) 132, a noise suppressor 133, and an AGC controller 134. While not shown, in some embodiments, the system 700 may also include a speech codec wherein the earcups 10.sub.1, 10.sub.2 are coupled to the electronic device 100 wirelessly and communicates the output of the speech codec 160 to the electronic device 100. In this embodiment, the earcups 10.sub.1, 10.sub.2 include the microphone selector (beamformer) 132, noise suppressor 133, AGC controller 134, and speech codec. In other embodiments, the earcups 10.sub.1, 10.sub.2 are coupled to the electronic device 100 via the headphone wire or wirelessly and the electronic device 100 include the microphone selector (beamformer) 132, noise suppressor 133, AGC controller 134, and speech codec.
(29) The earcup detector 131 may be used during a calibration of the headphone. For example, the user may say a short word or phrase after placing the headphones on her ears. Referring to
(30) The first VAD 141 performs comparisons of strengths of the microphone signals or performs comparisons of the ratio of the strengths of the microphone signals to a first plurality of thresholds to generate a first output. The first output indicates which one of the first earcup 10.sub.1 is worn on the user's right ear. In one embodiment, the earcup detector 131 receives at least three microphone signals from the first earcup 10.sub.1. In one embodiment, the at least three microphone signals include the microphone signals from the first microphone 11.sub.1, the second microphone 11.sub.2 and the third microphone 11.sub.3 (
(31) In another embodiment, the at least three microphone signals include the microphone signals from the first microphone 11.sub.1, the second microphone 11.sub.2, the third microphone 11.sub.3 and the fourth microphone 11.sub.4 (
(32) Similarly, the second VAD 142 in
(33) In one embodiment, the earcup detector 131 receives at least three microphone signals from the first earcup 10.sub.1. In one embodiment, the at least three microphone signals include the microphone signals from the first microphone 11.sub.1, the second microphone 11.sub.2 and the third microphone 11.sub.3 (
(34) In another embodiment, the at least three microphone signals include the microphone signals from the first microphone 11.sub.1, the second microphone 11.sub.2, the third microphone 11.sub.3 and the fourth microphone 11.sub.4 (
(35) In one embodiment, prior to being processed by the first and the second VAD 141, 142, the microphone signals may be transformed from a time domain to a frequency domain and bandpass filtered in a predetermined frequency band. In this embodiment, the strength of the microphone signals is computed within the predetermined frequency band. In this embodiment, the strength of the microphone signals is determined from the predetermined frequency band. In one embodiment, the strength of the microphone signals is the sum of spectral magnitudes of each of the microphones between 200 Hz and 400 Hz.
(36) In another embodiment, prior to being processed by the first and the second VAD 141, 142, the microphone signals may also be bandpass filtered in the time domain to a predetermined frequency band and the strength of the microphone signals is thus the output of the bandpass filters within the predetermined frequency band.
(37) Referring back to
(38) In some embodiments, the selector 144 generates a binary output as a voice activity detector (VAD), regardless if first earcup or second earcup is worn on the right ear. This binary output is generated as an OR function from the first VAD 141 and second VAD 142.
(39) As shown in
(40) In another embodiment, each earcup 10.sub.1, 10.sub.2 also includes a sixth microphone 11.sub.6 (not shown) located adjacent to the first microphone 11.sub.1 and between the first microphone 11.sub.1 and the fifth microphone 11.sub.5. The sixth microphone 11.sub.6 and the first microphone 11.sub.1 of the first earcup 10.sub.1 are located on the half of the earphone that is farther from the user's mouth when the first earcup 10.sub.1 is worn on the right ear. In this embodiment, when the first earcup 10.sub.1 is worn on the right ear, the microphone selector (beamformer) 132 may also select first microphone 11.sub.1 and sixth microphone 11.sub.6 to form a voice beam and a noise beam using the microphone signals from the second earcup 10.sub.2 which is worn on the left ear.
(41) In
(42) The noise suppressor 133 may suppress noise in the voice beam signal based on the VAD output received from the earcup detector 131 or based on the spectral separation between the voice beam and the noise beam. The noise suppressed voice beam signal is then outputted to the AGC controller 134. The AGC controller 134 performs AGC on the noise suppressed signal based on the VAD output received from the earcup detector 131.
(43) The following embodiments of the invention may be described as a process, which is usually depicted as a flowchart, a flow diagram, a structure diagram, or a block diagram. Although a flowchart may describe the operations as a sequential process, many of the operations can be performed in parallel or concurrently. In addition, the order of the operations may be re-arranged. A process is terminated when its operations are completed. A process may correspond to a method, a procedure, etc.
(44)
(45)
(46) In Block 602, the earcup detector determines that the second earcup is worn on the user's right ear by generating a second output by performing comparisons of the strengths of the microphone signals from the at least three microphones in the first earcup or ratios of the strengths of microphone signals from the at least three microphones in the first earcup to a second plurality of thresholds.
(47)
(48) Keeping the above points in mind,
(49) An embodiment of the invention may be a machine-readable medium having stored thereon instructions which program a processor to perform some or all of the operations described above. A machine-readable medium may include any mechanism for storing or transmitting information in a form readable by a machine (e.g., a computer), such as Compact Disc Read-Only Memory (CD-ROMs), Read-Only Memory (ROMs), Random Access Memory (RAM), and Erasable Programmable Read-Only Memory (EPROM). In other embodiments, some of these operations might be performed by specific hardware components that contain hardwired logic. Those operations might alternatively be performed by any combination of programmable computer components and fixed hardware circuit components.
(50) While the invention has been described in terms of several embodiments, those of ordinary skill in the art will recognize that the invention is not limited to the embodiments described, but can be practiced with modification and alteration within the spirit and scope of the appended claims. The description is thus to be regarded as illustrative instead of limiting. There are numerous other variations to different aspects of the invention described above, which in the interest of conciseness have not been provided in detail. Accordingly, other embodiments are within the scope of the claims.