Dome shaped microphone array with circularly distributed microphones
09961437 ยท 2018-05-01
Assignee
Inventors
Cpc classification
International classification
H04R1/36
ELECTRICITY
H04R1/04
ELECTRICITY
Abstract
A sound sensing device may include a housing having a dome-shaped shell and a baseplate. The housing may have a symmetrically shaped perimeter. The dome-shaped shell may have an arcuate cross section characterized by tangent lines that monotonically increase in slope from a top of the dome-shaped shell toward a bottom of the dome-shaped shell. At least three sound ports may be provided between an interior volume of the housing and an exterior surrounding of the housing, and disposed symmetrically and peripherally about the perimeter of the housing. The sound ports may be located at most 1 cm above the support surface.
Claims
1. A sound sensing device comprising: a housing having a dome-shaped shell and a baseplate to house electronic components of the sound sensing device, the housing having a rotationally symmetric perimeter, the dome-shaped shell having an arcuate cross section characterized by tangent lines that monotonically increase in slope from a top of the dome-shaped shell toward a bottom of the dome-shaped shell; at least six sound ports formed between an interior volume of the housing and an exterior surrounding of the housing, and disposed symmetrically and peripherally about the perimeter of the housing, the sound ports located at most 1 cm above a support surface; and at least six microphones contained within the housing, the microphones disposed symmetrically and peripherally about the perimeter of the housing, each microphone arranged proximate a respective sound port to receive an acoustic signal via the respective sound port, wherein microphone signals from the microphones are combined together to produce an output signal representative of a sound source for any direction of the sound source, wherein each of the microphone signals is converted into overlapping subband signals, wherein a plurality of subband beams are generated using subband signals from each of the microphone signals, the plurality of subband beams including: a plurality of first subband beams, each of which is hybrid of a delay and sum beam and a differential beam that is generated using subband signals, from the microphone signals of all the microphones, that are in a first frequency range; a second subband beam that is a delay and sum beam comprising non-zero positive coefficients generated using subband signals, from the microphone signals of one or more of the microphones, that are in a second frequency range, and a third subband beam comprising a subband signal from the microphone signal of a primary direction microphone whose location on the housing is representative of a direction of the sound source, wherein the dome shape creates a higher pressure for sound, arriving at the primary direction microphone, that is substantially orthogonal to the support surface at an edge of the dome-shaped shell, wherein the plurality of subband beams are combined to produce the output signal.
2. The device of claim 1, further comprising an annular pad disposed peripherally about a perimeter of a bottom surface of the baseplate to support the housing on the support surface.
3. The device of claim 1, wherein a height of the dome-shaped shell is less than or equal to a distance from a center of the dome-shaped shell to a periphery thereof.
4. The device of claim 1, further comprising surface features on the dome-shaped shell that have dimensions that are less than a quarter wavelength of a predetermined smallest wavelength to be captured by the microphones.
5. The device of claim 4, wherein the predetermined smallest wavelength corresponds to 7 kHz (49 millimeters wavelength).
6. The device of claim 4, wherein the surface features include one or more of surface adornments formed on a surface of the dome-shaped shell and buttons to operate the sound sensing device.
7. The device of claim 1, wherein the perimeter of the housing has a shape of a circle or a regular polygon.
8. The device of claim 1, wherein the sound ports are formed through the dome-shape shell and open through an upper surface of the dome-shaped shell.
9. The device of claim 1, wherein the dome-shaped shell comprises a plurality of posts that extend toward and contact a printed circuit board (PCB), each post having a hollow interior, a first end aligned with one of the microphones, and a second end that terminates at one of the sound ports.
10. The device of claim 9, wherein the plurality of microphones are disposed on the PCB.
11. The device of claim 1, wherein the sound ports are formed through the baseplate and open through a bottom surface of the baseplate.
12. The device of claim 11, where the sound ports have an elastomeric pad to impede sound coming from under the device from entering into the microphone port.
13. The device of claim 1, further comprising a speaker that is centrally disposed at the top of the dome-shaped shell.
14. The device of claim 13, wherein the microphones are equidistant from the speaker and symmetrically disposed about the speaker.
15. The device of claim 1, wherein a height of the device is at least 20 millimeters.
16. The device of claim 1, wherein the arcuate cross section is defined as a circular segment having a height of 3.3 centimeters, a termination of the arcuate cross section is no higher than 1 centimeter above a bottom of the device, and deviations of the arcuate cross section do not exceed 1.225 centimeters.
17. The device of claim 1, wherein a diameter of the device is greater than 10 centimeters.
18. The device of claim 1, wherein a diameter of the device is no more than 20 centimeters.
19. The device of claim 1, wherein a diameter of the device is greater than 12 centimeters and less than 15.2 centimeters.
20. A sound sensing device comprising: a housing having a dome-shaped shell and a baseplate to house electronic components of the sound sensing device, the housing having a perimeter in a shape of a circle or a regular polygon with at least six sides, the dome-shaped shell having a cross section characterized by a substantially constant change in slope, the housing having a diameter of about 15 centimeters and a height of about 3.3 centimeters; a loudspeaker centrally disposed at a top of the dome-shaped shell; at least six sound ports formed between an interior volume of the housing and an exterior surrounding of the housing, and disposed symmetrically and peripherally about the perimeter of the housing, the sound ports located at most 1 centimeter above a support surface as measured from respective centers thereof; and at least six microphones contained within the housing, the microphones disposed symmetrically and peripherally about the perimeter of the housing, each microphone connected to a respective sound port, wherein microphone signals from the microphones are combined together to produce an output signal representative of a sound source for any direction of the sound source, wherein each of the microphone signals is converted into overlapping subband signals, wherein a plurality of subband beams are generated using subband signals from each of the microphone signals, the plurality of subband beams including: a plurality of first subband beams, each of which is hybrid of a delay and sum beam and a differential beam that is generated using subband signals, from the microphone signals of all the microphones, that are in a first frequency range; a second subband beam that is a delay and sum beam comprising non-zero positive coefficients generated using subband signals, from the microphone signals of one or more of the microphones, that are in a second frequency range, and a third subband beam comprising a subband signal from the microphone signal of a primary direction microphone whose location on the housing is representative of a direction of the sound source, wherein the dome shape creates a higher pressure for sound, arriving at the primary direction microphone, that is substantially orthogonal to the support surface at an edge of the dome-shaped shell, wherein the plurality of subband beams are combined to produce the output signal.
21. The device of claim 20, further comprising an annular pad disposed peripherally about a perimeter of a bottom surface of the baseplate to support the housing on the support surface.
22. The device of claim 20, wherein a height of the dome-shaped shell is less than or equal to a distance from a center of the dome-shaped shell to a periphery thereof.
23. The device of claim 20, further comprising surface features on the dome-shaped shell that have dimensions less than a quarter wavelength of a predetermined lowest wavelength to be captured by the microphones.
24. The device of claim 1, wherein the microphone signals of all the microphones are used to identify the primary direction microphone.
25. The device of claim 1, wherein the plurality of subband beams further includes one or more additional second subband beams.
26. The device of claim 25, wherein the plurality of subband beams further includes one or more additional third subband beams.
27. The device of claim 1, further comprising additional subband beams, each additional subband beam being a delay and sum beam comprising non-zero positive coefficients generated using subband signals from the microphone signals of one or more of the microphones.
28. The device of claim 1, wherein the first frequency range is between 0 Hz to 1000 Hz, wherein the second frequency range is between 1000 Hz and 4000 Hz.
29. The device of claim 28, wherein the subband signal that comprises the third subband beam comprise a third frequency range between 4000 Hz and 8000 Hz.
30. The device of claim 29, wherein the first frequency range overlaps the second frequency range in a range between 1000 Hz and 2000 Hz, wherein the second frequency range overlaps the third frequency range in a range between 2000 Hz and 4000 Hz.
Description
BRIEF DESCRIPTION OF THE DRAWINGS
(1) With respect to the discussion to follow and in particular to the drawings, it is stressed that the particulars shown represent examples for purposes of illustrative discussion, and are presented in the cause of providing a description of principles and conceptual aspects of the present disclosure. In this regard, no attempt is made to show implementation details beyond what is needed for a fundamental understanding of the present disclosure. In the discussion to follow, in conjunction with the drawings, make apparent to those of skill in the art how embodiments in accordance with the present disclosure may be practiced. In the accompanying drawings:
(2)
(3)
(4)
(5)
(6)
(7)
(8)
(9)
(10)
(11)
(12)
(13)
(14)
(15)
(16)
(17)
(18)
(19)
(20)
(21)
(22)
(23)
(24)
(25)
(26)
(27)
(28)
(29)
(30)
(31)
(32)
(33)
(34)
(35)
(36)
(37)
DETAILED DESCRIPTION
(38) In the following description, for purposes of explanation, numerous examples and specific details are set forth in order to provide a thorough understanding of the present disclosure. It will be evident, however, to one skilled in the art that the present disclosure as defined by the claims may include some or all of the features in these examples alone or in combination with other features described below, and may further include modifications and equivalents of the features and concepts described herein.
(39) The present disclosure teaches an approach to achieve reasonable width beams and low side lobes using a reasonable number of microphones by utilizing the shape of the device. The present disclosure teaches a method to reduce unwanted sound to achieve high quality speech transmission using microphones mounted strategically and supported by a spatial filtering algorithm to produce a signal that is an accurate representation of the desired talker. This signal can either be transmitted to the network for a far-end human listener, or the signal can be provided to a speech recognition program to convert the speech signal into words.
(40) Embodiments in accordance with the present disclosure generally provide a housing having a dome-shaped shell. The housing contains a microphone array that is configured to capture sound, and utilizes the shape of the housing to enhance the sound. As used herein, the word dome is used to describe a shape having a cross section that is approximately a circular segment. Merely as an illustration,
(41) Embodiments in accordance with the present disclosure can utilize the shape of the housing to enhance the directivity and minimize the number of microphones needed, while providing well-behaved beams that have reasonable width main lobes while minimizing side lobes. Instead of ignoring the shape of the housing, embodiments in accordance with the present disclosure utilize the shape of the housing to help improve the performance.
(42)
(43)
(44)
(45)
(46)
(47) The relative arrangement between the loudspeaker 122 and microphone ports 112 can be advantageous because direct sound from the loudspeaker 122 can couple into (picked up by) each of the microphones (not shown) equally. This arrangement makes it easier to detect the direction of sound from a talker even when the loudspeaker is playing sound.
(48) The device 100 should be able to detect and located sound from any direction simultaneously as sound is being played out the loudspeaker 122. Direction finding, however, can be hampered by echoes. Modern speakerphones use echo cancellation to remove most of the echo, but some echo usually remains; this is called residual echo. Residual echo can corrupt or otherwise make difficult the process of direction-finding in a configuration where the loudspeaker is closer to some microphones than to other microphones. For example, residual echo can give a false indication that the talker's voice is coming from the direction of the nearest microphone. Since the loudspeaker in accordance with the present disclosure is centered and the microphones are equal distance from the loudspeaker then the residual echo is usually about equal in each microphone, and so the misleading effect of residual echo can be reduced.
(49)
(50) The front view, of
(51) In the case of the embodiment with the microphones ported to the top (e.g.,
Gain=20log 10(|(1+e.sup.j2fTd)|),
where T.sub.d is the difference in time of arrival between the direct sound and the reflection off of the table. For sound, directly overhead, at 90, T.sub.d=0.0456 milliseconds. For sound arriving from 45 above the table T.sub.d=0.0456*sin(45)=0.0322 milliseconds. As the angle of arrival becomes more horizontal and less vertical, the time difference between the direction sound and the reflection becomes less and less. Therefore the attenuation of sound due to the reflection is less of a problem for a low angle.
(52) At various frequencies f, the gain for a sound source coming from straight overhead) (90) as compared to an angle relative to the table (e.g., 45) is shown in the table below. The configuration assumes the microphone port is located at a height 10 millimeters above the surface of a table (e.g., table top 10,
(53) TABLE-US-00001 f = Frequency (kHz) Gain (dB) at 90 Gain (dB) at 45 1 5.87 5.94 2 5.42 5.73 4 3.46 4.81 7 4.79 1.85
(54) As can be seen from the table above, at this height the frequency response is effective up to 4 kHz for all angles. It will be rare that a talker will be speaking from directly overhead of the device, so if they are speaking so that the talker's mouth is at 45, there is only about 1 dB of attenuation at 4 kHz. This system would function well for a frequency range commonly called narrowband or telephone speech. As of this writing, this is the most common range for telephony and mobile speech.
(55)
(56) Referring to
(57) Now consider a microphone porting where the microphone port 112 is at a lower height in a similar situation, such as shown in
(58) TABLE-US-00002 f = Frequency (kHz) Gain (dB) at 90 Gain (dB) at 45 1 5.94 5.98 2 5.69 5.86 4 4.64 5.35 7 1.15 3.85
At this height, the response is reasonably acceptable at 45 for a frequency up to 7 kHz.
(59) Now consider a microphone porting where the microphone port 112 is at a lower height in a similar situation as shown in
(60) TABLE-US-00003 f = Frequency (kHz) Gain (dB) at 90 Gain (dB) at 45 1 5.98 6.00 2 5.87 5.95 4 5.43 5.73 7 4.1 5.10
As the height of 5.0 millimeters, the frequency response is acceptable up to 7 kHz. At 7 kHz, the frequency response is useful for a range of frequencies commonly called wideband. As of this writing, the band is common for video conferencing on services such as Skype messaging and is the common frequency range for performing speech recognition.
(61) The fact that the sound pressure arrives at the microphone 122 both from direct sound and the reflection actually doubles the sound pressure. Consequently, except for interference, the sound pressure doubles. As shown above a doubling in pressure results in an increase of sound pressure of 6.02 dB. It should be noted that these numbers are approximate and the actual response is complicated by the shape of the plastic enclosure near the microphone ports and the table.
(62) In general, the sound pressure at any point on a table 10 is double what it would be if there was not a table as long as the microphone port 112 is very close to the table 10. Consider a hypothetical case where a microphone 122 is mounted very close to a corner as shown in
(63) Now consider the side view.
(64) A frequency response for sound coming from a typical talking position (e.g., 50 centimeters away and 30 centimeters above the table) is shown in
(65) The next three figures show polar responses at different frequencies of a single microphone for a dome structure that is 15 centimeters inches in diameter at the base, 3.4 centimeters in height and a circular segment cross section.
(66) It is common in acoustic literature to talk about a ratio called the Directivity Index (DI). The Directivity Index is the ratio of the total noise power in an isotropic noise filled environment, incident on an array, compared to the power actually received by the system, the omni directional noise. DI=10log.sub.10(N_omni/N_processed). When the array is useful the directivity index is positive. The Directivity Index is a method of summarizing the effectiveness of an array. A weakness of using the Directivity Index is that it is possible to design an array with an amazing Directivity Index, but with an unusable beam that is so narrow that it is too fragile for reliable pickup and there may be many and extreme side lobes pointing in the wrong directions. However, for a simple system, i.e., a single microphone the Directivity Index is a very good measure for the effectiveness of the directionality.
(67) Using a device (e.g., 100) configured in accordance with the present disclosure and defining the look direction as horizontal toward a single microphone, the device can achieve directivity that varies with frequency as shown in
(68) The change in frequency response due to the shape can be compensated with an equalizing filter to flatten the response. The response will be slightly different depending on the angle of arrival. In general, a single equalizing filter will be sufficient to compensate for typical angle of arrival. However, it is also possible to determine the angle of arrival by using the microphone array to determine the angle of arrival. Using this information, it would be possible to select an appropriate equalizing filter.
(69) Referring to
(70) For the device presented here, the directionality for a sound that is propagating horizontally maintains 4 dB to 5 dB of directionality down to 2000 Hz, with about 5 dB of directionality at 4000 Hz. In comparison,
(71) Referring to the earlier two-microphone analysis, it was noted that the usefulness of delay and sum beams depended on whether the arrival of sound was broadside or end-fire.
f.sub.b=1.5c/(2d)
f.sub.e=1.5c/(4d)
where c is the speed of sound and d is the distance between microphones, and 1.5 is a factor for estimating the highest frequency where the beam is effective. In the case of the embodiment of the six microphone array, the distance between microphones is 69.85 millimeters. While a six microphone array has a much more complicated polar response, the formulas for the effective frequency ranges for broadside and end-fire arrays are a useful approximation for estimating the effectiveness of delay and sum beams. In this case the frequencies are:
f.sub.b=3.69 kHz
f.sub.e=1.85 kHz
(72) To use the same number of microphones without the benefit of the domed housing would require bringing them much closer together. Bringing them closer together will cause problems with making effective beams at low frequencies. Low frequency beams need to have larger distance between microphones for either differential beams or delay and sum (additive) beams. Therefore, the use of the dome-shaped device with microphones ports positioned on the perimeter and very close to the surface of the table allow for good performance across a wider range of frequencies. Another way to view this benefit, is that it takes fewer microphones to make an effective array. Without the dome-shaped housing the microphones would have to be spaced about one half the distance of the embodiment described above. For similar performance, there would have to be almost twice as many microphones.
(73) Dome Shape. Other formulas for the dome shape could work other than a circular segment. However, the cross sectional shape should not be a shape with abrupt transitions. As an extreme example, the shape of a housing 2502 of a device 2500 could be a short cylinder as shown in
(74) In regard to height of the dome, the height could be as high as the radius of the circle, but would generally be about on half of the radius. The dome shape provides several advantages: (1) at low frequencies, it is not an obstacle to sound so classic beamforming can be used at lower frequencies; (2) a way to pack electrical circuits and loudspeaker inside the housing; and (3) it creates directionality for high frequency sound for microphone pickup, and 3) it creates a baffle for the centrally located loudspeaker. For the loudspeaker, there is an advantage in keeping the dome as flat as possible near the loudspeaker. Practitioners of loudspeaker design call the array around the loudspeaker a baffle. In principle, the ideal baffle is a baffle that is flat and continues forever. In practice, the dome shape approximates an infinite baffle.
(75) For the low frequency beams of the microphone array, it is better for the dome to be lower to create as little distortion as possible to the sound and to allow for utilizing all the microphones for delay and sum beams. For directivity for a single microphone, the dome must have some steepness near the edge of the perimeter of the device in order to create more sound pressure in the direction of sound. The lower the steep area the less directivity will be achieved.
(76) The present disclosure teaches a height of approximately one half of the radius of the circle. The cross section of the shape of the dome can be identically the same shape as a circular segment. The shape of the dome can be interrupted by adornments, cosmetic features, buttons, or visual features. In some embodiments, the physical features may be small compared to about of the shortest wavelength of interest for sound that is captured by the microphones. In some embodiments, the highest frequency may be 7 kHz, where the quarter wavelength is 12.25 millimeters. So any features that have dimensions significantly less than one half of the quarter wavelength, or 6.125 millimeters, will not have a significant effect on the performance of the device.
(77) A dome shape may have the same cross section as the shape of a circle, or it can have a different cross sectional profile. Like a circle, the formula for the cross section should have a shape that progressively has increasing slope as the measurement of the slope moves from the center to the perimeter. This restriction ensures that the shape is progressive and will minimize diffraction patterns. See
(78) Spatial Filter Implementation. The following section describes an implementation of spatial filters, in accordance with the present disclosure, that encompass the response across the whole frequency spectrum. The spatial filters are the same thing as beamformers. In this description, the beams are discrete beams. This disclosure teaches that the beams are discrete beams. There may be a large collection of beams, but it is not continuously variable. Beams can be implemented in a number of different ways including frequency domain implementation. For simplicity, the description here is in the time domain.
(79) Referring again to
(80) In some embodiments, the strategy that is taught is that the upper subband #4 will correspond to a single microphone. The next lower subband #3 may correspond to of one of the following: 1) a delay and sum array with coefficients that are larger on the side where sound is detected, or 2) a hybrid of delay and sum and differential techniques. For the purpose of illustration, we only consider the first case, the delay and sum array with weighting. If there are six microphones, then the delay and sum equation is:
y(k)=a0x0(tT0)+a1x1(tT1)+a2x2(tT2)+a3x3(tT3)+a4x4(tT4)+a5x5(tT5)
(81) For a delay and sum array there are coefficients a0 through a5. These are weights that are all greater than 0.
(82) There will be a multitude of beams, with at least one beam for each possible direction. There can be several beams aiming in the same direction with different shaped lobes. Using several beams in the same direction would be useful to approximate null-steering to eliminate noise coming from a particular direction.
(83) For subband #3, for example, the coefficients a0 through a5 may be weighted so that the coefficients associated with the direction of the sound are weighted with a larger coefficient than the microphones that are facing away from the sound. As an example, if the sound is coming from a direction between microphones x2 and x3 (e.g., as determined by a direction-finding algorithm), then the coefficients a2 and a3 will have the largest coefficients, while weights a1 and a4 will be less, and weights a0 and a5 will be even less or possibly zero. Merely to illustrate this point by example, for the case where the sound is coming from between x2 and x3, then the coefficients may be: a0=0.0, a1=0.216, a2=0.284, a3=0.284, a4=0.216, a5=0.0.
(84) Subband #3 covers the range between 1 kHz to 4 kHz. It crosses over with subband #2 at 1.5 kHz and crosses over with subband #4 at 3 kHz. The subbands are constructed with linear phase filters with the same group delay so that the filters merely need to be added together to form the output signal.
(85)
(86) At block 2212, the data processing components may identify from among M microphones (e.g., 112), a primary direction microphone (primary mic). The direction of the primary mic represents the direction of the sound source. In some embodiments, the primary mic may be determined by comparing the signal strength of the signals from each microphone; the microphone that outputs the greatest signal strength would be deemed the primary mic. In other embodiments, the primary mic may be determined based on signals from two more or microphones.
(87) At block 2214, the data processing components may divide each signal received from each of the M microphones into N subbands; e.g., the signal from microphone #1 may be divided into N subbands, the signal from microphone #2 may be divided into N subbands, and so on. In some embodiments, the subbands may overlap.
(88) At block 2216, the data processing components may produce a first of N intermediate signals (beams) to form a resulting beam (beamforming) that represents the sound source. In some embodiments, the data processing components may simply take the N.sup.th subband of the signal from the primary mic to generate the first intermediate signal (beam). For frequencies in the highest (N.sup.th) subband, the primary mic may utilize the dome shape to create higher pressure for sound arriving substantially orthogonal to the surface of the edge of the dome and thereby simply use the single primary microphone to make the first intermediate signal (beam).
(89) At block 2218, the data processing components may generate additional intermediate signals for each of the remaining subbands. In some embodiments, each of the remaining subbands from the 2.sup.nd subband to the N.sup.th subband of the primary mic signal and additional microphone signals may be filtered with a respective spatial filter (e.g., a spatial filter designed for that subband) to produce 2.sup.nd to N.sup.th filtered subband signals. After a beam is produced by each spatial filter for each subband (the intermediate signals), then all the N intermediate signals may be summed to produce a single output signal that is composed of different beams utilized in the N subbands.
(90) In some embodiments, the beams may be formed depending the subband. Merely to illustrate this point, referring to the configuration shown in
(91) At block 2220, the data processing components may combine the N intermediate signals (beams) produced at block 2218 to form a resulting beam (output signal). The output signal may then be transmitted to a device to drive a loudspeaker in the device, thus reproducing the sound.
(92)
(93) For subband #2, it is most typical that the beams formed for this subband will be a hybrid of differential beams and delay and sum beams. A typical horizontal polar response plot at 1000 Hz is shown in
(94) Subband #1 covers the range between 0 Hz and 750 Hz. The beams in this subband may be similar to subband #2, but it will be designed to minimize self noise from the microphones. For low frequencies the compensation filter will need to add a lot of gain at the low frequencies. This raises noise that is caused by self-noise, because the noise is uncorrelated between the microphones. At the low end of human speech, about 100 Hz, the compensation filter may need to add 15 dB of gain, so if the microphone self-noise is substantial then the spatial filter (beam) needs to be designed to get the best directivity, but minimize noise.
(95) It would be understood by practitioners in the art of beam design that there are a multitude of ways to design the beams. For a device in accordance with the present disclosure, the beam design may follow this order in some embodiments: Highest banda single microphone (primary direction microphone) Next lower banda delay and sum beam Lowest bandshybrid differential beams
(96) An example of a hybrid differential beam would be to take pairs of microphones and take the difference between each microphone element within each pair. Within each pair one microphone is the front microphone and one microphone is the rear microphone. The front microphones would each be delayed the right amount so that the resulting signal from the front microphones is aligned. Then for each pair, the rear microphone is delayed relative to the front microphone so that the difference in delay would be between 0 delay (for a bidirectional beam) and delay d/c (where d is the distance between pairs of microphones and c is the speed of sound). This example is shown in
y(k)=a0x0(tT0)+a1x1(tT1)+a2x2(tT2)+a3x3(tT3)+a4x4(tT4)+a5x5(tT5)
(97) To make each pair a hypercardioid beam then the relative delay between the front and rear microphone of each pair would be d/(3c). The delay selection for the front microphones, x0, x1, and x5 is selected so that x0 aligns with x1 and x5 using the delay associated with the propagation delay between the front most microphone and the two microphones that are further away from the source. This distance is called d.sub.f in the
(98) It is implicit that results of each of the 3 pairs of microphones are given equal weight and added together just as if each of the pairs is treated as a delay and sum microphone element. Hence, this is a hybrid of first-order gradient beams and delay and sum beams.
(99) The resulting delays and attentions of the equation, restated:
y(k)=a0x0(tT0)+a1x1(tT1)+a2x2(tT2)+a3x3(tT3)+a4x4(tT4)+a5x5(tT5),
where a0=a1=a5=1.0
(100) a3=0.877
(101) a4=a2=0.935
(102) T0=c/d.sub.f
(103) T1=T5=0.0
(104) T3=T0+c /(3d03)
(105) T2=T4=c/(3d12)
(106) Substituting these values into the above equation will make a hybrid differential beam. This hybrid beam will be slightly narrower from side to side and slightly more directive vertically than if the beam were merely made from a single pair of microphones.
(107) The present disclosure teaches that the number of microphones can be minimized by using a dome shape while allowing the microphones to be spread further apart. The shape of the dome can allow for the use of a single microphone to pick up high frequency sound in the direction that the microphone is facing.
(108) Alternatively, the same concept can be used to achieve even greater performance by using more microphones placed around the perimeter of the device at a low height from the table. If the microphones are placed closer together then it is possible to use more than one microphone. The first choice for the highest subband would be to use 2 or 3 microphones organized as a delay and sum array. This will increase the Directivity Index by approximately 3-4 dB. Another advantage of using more microphones will lower the self-noise problem for the subband #1 where hybrid differential beams are used.
(109) Regardless of the number of microphones the spatial filter strategy remains that the highest frequency subband uses a single microphone or two or three microphones as a delay and sum beam, then recruit more microphones for the next lower subband, and then use hybrid differential beams for the lowest frequencies.