Low noise differential microphone arrays
09749745 · 2017-08-29
Assignee
Inventors
Cpc classification
H04R2430/21
ELECTRICITY
International classification
Abstract
A differential microphone array includes a number (M) of microphone sensors for converting sound to a number of electrical signals, and a processor, operably coupled to the microphone sensors, to specify a target differential order (N) for the differential microphone array, and wherein M>N+1, specify a steering matrix D comprising N+1 steering vectors, calculate a respective one of a plurality of linearly specify a steering matrix D comprising N+1 steering vectors-constrained minimum variance filters based on the steering matrix, apply the respective one of the plurality of linearly-constrained minimum variance filters to a respective one of the electrical signals to calculate a respective frequency response of the electrical signals, wherein the respective frequency response comprises a plurality of components associated with a plurality of subbands, and sum the frequency responses of the electrical signals with respect to each subband to calculate an estimated frequency spectrum of the sound.
Claims
1. A differential microphone array, comprising: a number (M) of microphone sensors to receive sound signals originated from a sound source and convert the sound signals to a number of electrical signals; and a processor, operably coupled to the microphone sensors, to: specify a target differential order (N) for the differential microphone array, wherein M>N+1; construct a steering matrix D comprising N+1 steering vectors; calculate a respective one of a plurality of linearly-constrained minimum variance filters based on the steering matrix; apply the respective one of the plurality of linearly-constrained minimum variance filters to a respective one of the electrical signals to calculate a respective frequency response of the electrical signals, wherein the respective frequency response comprises a plurality of components associated with a plurality of subbands; calculate an estimated frequency spectrum of the sound source by summing the frequency responses of the electrical signals with respect to each one of the plurality of subbands; and reproduce, based on the estimated frequency spectrum, the sound source, wherein the reproduced sound source is an enhanced version of the sound source.
2. The differential microphone array of claim 1, wherein the processor is further to: prior to applying the respective one of the plurality of linearly-constrained minimum variance filters, calculate a short-time Fourier transform of the respective one of the electrical signals; and calculate an inverse short-time Fourier transform of the estimated frequency spectrum of the sound to generate an estimate of a source of the sound.
3. The differential microphone array of claim 1, wherein the differential microphone array is one of a uniform linear microphone array or a non-uniform linear microphone array.
4. The differential microphone array of claim 1, wherein the steering matrix D is a rectangular matrix, and wherein the steering matrix D=[d.sup.H(ω, 1), d.sup.H(ω, α.sub.N,1) , . . . , d .sup.H(ω, α.sub.N,N)].sup.T, wherein N+1 steering vectors d(ω, α.sub.N,n)=[1, e.sup.−jωτ.sup.
5. The differential microphone array of claim 4, wherein the plurality of linearly-constrained minimum variance filters are minimum-norm filters represented by h(ω,α)=D.sup.H (ω, α)[D(ω, α)D.sup.H (ω, α)].sup.−1β, wherein β is a vector specifying a target beam pattern.
6. A method for operating a differential microphone array that comprises a number (M) of microphone sensors to convert sound signals, originated from a sound source and received by the number (M) of microphones, to a number of electrical signals, the method comprising: specifying a target differential order (N) for the differential microphone array, wherein M>N+1; constructing, by a processor, a steering matrix D comprising N+1 steering vectors; calculating a respective one of a plurality of linearly-constrained minimum variance filters based on the steering matrix; applying the respective one of the plurality of linearly-constrained minimum variance filters to a respective one of the electrical signals to calculate a respective frequency response of the electrical signals, wherein the respective frequency response comprises a plurality of components associated with a plurality of subbands; calculating an estimated frequency spectrum of the sound source by summing the frequency responses of the electrical signals with respect to each one of the plurality of subbands; and reproducing, based on the estimated frequency spectrum, the sound source, wherein the reproduced sound source is an enhanced version of the sound source.
7. The method of claim 6, further comprising: prior to applying the respective one of the plurality of linearly-constrained minimum variance filters, calculating a short-time Fourier transform of the respective one of the electrical signals; and calculating an inverse short-time Fourier transform of the estimated frequency spectrum of the sound to generate an estimate of a source of the sound.
8. The method of claim 6, wherein the differential microphone array is one of a uniform linear microphone array or a non-uniform linear microphone array.
9. The method of claim 6, wherein the steering matrix D is a rectangular matrix, and wherein the steering matrix D=[d.sup.H (ω, 1), d.sup.H (ω, α.sub.N,1), . . . , d.sup.H (ω, α.sub.N,N)].sup.T, wherein N+1 steering vectors d(ω, α.sub.N,n)=[1, e.sup.−jωτ.sup.
10. The method of claim 9, wherein the plurality of linearly-constrained minimum variance filters are minimum-norm filters represented by h(ω, α)=D.sup.H (ω, α)[D(ω, α)D.sup.H (ω, α)].sup.−1β, where β is a vector specifying a target beam pattern.
11. A non-transitory machine-readable storage medium having stored thereon instructions that, when executed, cause a processor to operate a differential microphone array that comprises a number (M) of microphone sensors to convert sound signals, originated from a sound source and received by the number (M) of microphones, to a number of electrical signals, the processor to: specify a target differential order (N) for the differential microphone array, wherein M>N+1; construct, by the processor, a steering matrix D comprising N+1 steering vectors; calculate a respective one of a plurality of linearly-constrained minimum variance filters based on the steering matrix; apply the respective one of the plurality of linearly-constrained minimum variance filters to a respective one of the electrical signals to calculate a respective frequency response of the electrical signals, wherein the respective frequency response comprises a plurality of components associated with a plurality of subbands; calculate an estimated frequency spectrum of the sound source by summing the frequency responses of the electrical signals with respect to each one of the plurality of subbands; and reproduce, based on the estimated frequency spectrum, the sound source, wherein the reproduced sound source is an enhanced version of the sound source.
12. The non-transitory machine-readable storage medium of claim 11, wherein the processor is further to: prior to applying the respective one of the plurality of linearly-constrained minimum variance filters, calculate a short-time Fourier transform of the respective one of the electrical signals; and calculate an inverse short-time Fourier transform of the estimated frequency spectrum of the sound to generate an estimate of a source of the sound.
13. The non-transitory machine-readable storage medium of claim 11, wherein the differential microphone array is one of a uniform linear microphone array or a non-uniform linear microphone array.
14. The non-transitory machine-readable storage medium of claim 11, wherein the steering matrix D is a rectangular matrix, and wherein the steering matrix D=[d.sup.H (ω, 1), d.sup.H (ω, α.sub.N,1), . . . , d.sup.H(ω, α.sub.N,N)].sup.T, wherein N+1 steering vectors d(ω, α.sub.N,n)=[1, e.sup.−jωτ.sup.
15. The non-transitory machine-readable storage medium of claim 14, wherein the plurality of linearly-constrained minimum variance filters are minimum-norm filters represented by h(ω, α)=D.sup.H (ω, α)[D(ω, α)D.sup.H (ω, α)].sup.−1β, where β is a vector specifying a target beam pattern.
Description
BRIEF DESCRIPTION OF THE DRAWINGS
(1)
(2)
(3)
(4)
(5)
(6)
(7)
(8)
(9)
(10)
(11)
DETAILED DESCRIPTION
(12) There exists a need for differential microphone arrays that are easy to design and can reduce and/or eliminate amplification of sensor noise.
(13) Embodiments of the present invention include a differential microphone array (DMA) that include a number (M) of microphone sensors for converting a sound to a number of electrical signals and a processor that is configured to apply linearly-constrained minimum variance filters on the electrical signals over a time window to calculate frequency responses of the electrical signals over a plurality of subbands and sum the frequency responses of the electrical signals for each subband to calculate an estimated frequency spectrum of the sound.
(14) In embodiments of the present invention, the number of microphone sensors is larger than the order of the DMA plus one, and the linearly-constrained minimum variance filters are minimum-norm filters. In other embodiments of the present invention, the number of microphone sensors is equal to the order of the DMA plus one.
(15) Embodiments of the present invention include a method for operating a differential microphone array that includes a number (M) of microphone sensors for converting sound to electrical signals. The method includes applying linearly-constrained minimum variance filters on the electrical signals over a time window to calculate frequency responses of the electrical signals over a plurality of subbands and summing the frequency responses of the electrical signals for each subband to calculate an estimated frequency spectrum of the sound.
(16) Embodiments of the present invention include a method for designing reconstruction filters for a differential microphone array including a number (M) of microphone sensors. The method includes specifying a target differential order (N) for the differential microphone array, specifying N+1 steering vectors d(ω, α.sub.N,n)=[1, e.sup.−jωτ.sup.
(17) Embodiments of the present invention include a differential microphone array including a plurality of microphone sensors for receiving a speech signal and whose outputs are divided into frames. In an embodiment, the frames of the outputs are transformed into a frequency response by a frequency transform. In an embodiment, the frames are transformed using short-time Fourier transform (STFT). Other types of frequency transform that may be used to generate a frequency response include discrete cosine transform (DCT) and wavelet based transforms. The frequency responses can be divided into a plurality of subbands. In each subband, a differential beamformer is designed and applied to the frequency response coefficients to produce an estimate of clean signal in the subband. Finally, the clean speech signal is reconstructed by summing the inverse frequency transform of the frequency responses.
(18)
Y.sub.m(ω)=e.sup.−j(m−1)ωτ0αX(ω)+V.sub.m(ω) (1)
where X(ω) and V.sub.m(ω) are, respectively, the STFT of the source signal x(k) and the noise component v.sub.m(k), j=√{square root over (−1)} (or the imaginary unit), ω=2πf is the angular frequency, τ.sub.0=δ/c (c is the sound speed) is the delay between two successive microphone sensors at angle θ=0°, and α=cos(θ). Embodiments of the present invention may be similarly applicable to non-uniform array. For a non-uniform array of microphone sensors, for example, Equation (1) can be written as Y(ω)=e.sup.−jωτ.sup.
y(ω)=d(ω,α)X(ω)+v(ω) (2)
where v(ω)=[V.sub.1(ω), V.sub.2(ω), . . . , V.sub.M(ω)].sup.T, and d(ω, α)=[1, e.sup.−jωτ0α, . . . , e.sup.−j(M−1)ωτ0α].sup.T is the steering vector (of length M) at the frequency ω, and the superscript T denotes a transpose operator.
(19) Embodiments of the present invention include the design of DMAs as beamformers that recover the spectrum of the desired signal X(ω) based on the observed y(ω). As shown in
(20) Referring to
(21)
where h(ω)=[H.sub.1(ω), H.sub.2(ω), . . . , H.sub.M(ω)].sup.T. As shown in
(22) The design of the DMA is then to determine the weight vector h(ω) so that Z(ω) is an optimal estimate of X(ω). As indicated by Equation (2), y(ω) includes noise component v(ω) which may include both environmental noise and sensor noise. The weight vector h(ω) may be determined by adaptive beamforming to minimize the noise component. In adaptive beamforming, the noise component may be minimized for certain beam patterns, or
(23)
where the superscript H denotes a transpose complex conjugation. A linearly constrained minimum variance (LCMV) filter solution for Equation (4) is:
h.sub.LCMV(ω)=Φ.sub.V.sup.−1(ω)D.sup.H(ω,α)[D(ω,α)Φ.sub.V.sup.−1(ω)D.sup.H(ω,α)].sup.−1β, (5)
in which α and β include vectors through which the certain beam patterns may be defined, and Φ(
(24) In an embodiment, M=N+1. Thus, D is a fully ranked square matrix, and
h.sub.LCMV(ω)=D.sup.−1(ω,α)β, (6)
which corresponds exactly to the filter of an Nth-order DMA. However, because of h.sub.LCMV(ω) is designed in the frequency domain and is derived directly from the steering vectors d and the beam pattern β, h.sub.LCMV(ω) is designed in the frequency domain. In this way, embodiments of the present invention do not need to calculate the equalization filters which are hard to design, and therefore, embodiments of the present invention have the advantage of easier calculation.
(25) Current art requires that M=N+1 so that steering matrix D is always a square matrix that can be inversed. If M>N+1, the steering matrix D is not a square matrix. In an embodiment of the present invention, when M>N+1, the filter is designed to be a minimum-norm filter, or
h(ω,α,β)=D.sup.H(ω,α)[D(ω,α)D.sup.H(ω,α)].sup.−1β, (7)
where the selection of vectors α and β of length N+1 may determine the response and the order of the DMA. Since M may be much larger than N+1, the DMA designed according to the minimum-norm filter h(ω,α,β) is much more robust against the noise, especially against the sensor noise. This is because, for example, the minimum-norm filter h(ω,α,β) is also be derived from maximizing the white noise gain subject to the Nth order DMA fundamental constraints. Therefore, for a large number of microphone sensors, the white noise gain may approach M. If the value of M is much larger than N+1, the order of the DMA may not be equal to N anymore. However, since the Nth order DMA fundamental constraints is fulfilled, the resulting shape of the directional pattern may be slightly different than the one obtain when M=N+1. In this way, the DMA designed according to the minimum-norm filter h(ω,α,β) may effectively achieve an effective trade-off between good noise suppression and beam forming.
(26) The beam pattern derived using the minimum-norm filter is
B[h(ω,α,β),θ]=d.sup.H(ω, cos θ)D.sup.H(ω,α)[D.sup.H(ω,α)D.sup.H(ω,α)].sup.−1β. (8)
(27) The white noise gain, directivity factor, and the gain for a point noise source for the minimum-norm filters are, respectively,
(28)
where θ.sub.n is the incident angle for a point noise source.
(29) As discussed above, the trade-off is between G.sub.dn[h(ω,α,β)]=G.sub.N and G.sub.Wn[h(ω,α,β)]≧1, where G.sub.N is the directivity factor of the frequency-independent N-th order DMA.
(30) Thus, embodiments of the present invention include a process for calculating a set of filters that can be used to reconstruct the sound signals. For example, the reconstruction filters specify coefficients at a number of subbands.
(31)
d(ω,α.sub.N,n)=[1,e.sup.−jωτ.sup.
where n=1, 2, . . . , N. At 306, the steering matrix D may be constructed from the steering vectors
(32)
which is a M×(N+1) matrix. Thus, if M=N+1, D is a square matrix. However, if M>N+1, D is a rectangular matrix. At 308, a set of linearly-constrained minimum variance filters may be calculated. If the number of microphone sensors M=N+1 (N is the order of the DMA), D is a square matrix and
h.sub.LCMV(ω)=D.sup.−1(ω,α)β.
(33) However, if M>N+1, h(ω, α, β)=D.sup.H(ω, α)[D(ω, α)D.sup.H(ω, α)].sup.−1β, which is a minimum-norm filter which suppresses noise amplification.
(34) For example, the calculated linear-constrained minimum variance filters or the minimum-norm filter is used to reconstruct the sound source.
(35) Embodiments of the present invention can be used to construct DMAs of different orders, including first-order cardioid (in which α=[1, −1].sup.T, β=[1, 0].sup.T), second-order cardioid (α=[1, −1, 0].sup.T, β=[1, 0, 0].sup.T), and third-order cardioid (α=[1, −1, 0, √{square root over (2)}/2].sup.T, β=[1, 0, 0, −√{square root over (2)}/8+1/4].sup.T). The number of microphone sensors used for the construction can equal to the order plus one or be larger than the order plus one. Experimental results have demonstrated that DMAs designed using the minimum-norm filters exhibit superior robustness against noise.
(36) Embodiments of the present invention can use different numbers of microphone sensors to construct a first-order cardioid DMA, in which α=[1, −1].sup.T (namely, the two nulls are placed at 0° and 180°), and β=[1, 0].sup.T (the strength at 0° and 180° are set 1 and 0, respectively).
(37)
(38) Embodiments of the present invention can use different numbers of microphone sensors to construct second-order cardioid DMAs, in which α=[1, −1, 0].sup.T, β=[1, 0, 0].sup.T.
(39) Embodiments of the present invention use different numbers of microphone sensors to construct a third-order cardioid, in which α=[1, −1, 0, −√{square root over (2)}/2].sup.T, β=[1, 0, 0, −√{square root over (2)}/8+1/4].sup.T.
(40) Embodiments of the present invention provide a low noise differential microphone array that is an improvement above known DMAs. Embodiments of the present invention provide a differential microphone array, including a number (M) of microphone sensors for converting a sound to a number of electrical signals; and a processor which is configured to: apply linearly-constrained minimum variance filters on the electrical signals over a time window to calculate frequency responses of the electrical signals over a plurality of subbands; and sum the frequency responses of the electrical signals for each subband to calculate an estimated frequency spectrum of the sound. In embodiments, the processor is configured to, prior to applying the linearly-constrained minimum variance filters, calculate a short-time Fourier transform of the electrical signals; and calculate an inverse short-time Fourier transform of the estimated frequency spectrum of the electrical signals. In embodiments, the differential microphone array is one of a uniform linear microphone array and a non-uniform linear microphone array. In embodiments, a differential order of the differential microphone array is N, and the linearly-constrained minimum variance filters are determined by a beam pattern of the differential microphone array. In embodiments, the linearly-constrained minimum variance filter is calculated as a function of a steering matrix D, and the steering matrix D includes N+1 steering vectors d(ω, α.sub.N,n)=[1,e.sup.−jωτ.sup.
(41) Embodiments of the present invention provide a method and system for operating a differential microphone array that includes a number (M) of microphone sensors for converting sound to electrical signals, including: applying, by a processor, linearly-constrained minimum variance filters on the electrical signals over a time window to calculate frequency responses of the electrical signals over a plurality of subbands; and summing, by the processor, the frequency responses of the electrical signals for each subband to calculate an estimated frequency spectrum of the sound. In embodiments, prior to applying the linearly-constrained minimum variance filters, calculating a short-time Fourier transform of the electrical signals; and calculating an inverse short-time Fourier transform of the estimated frequency spectrum of the electrical signals. In embodiments of the system and method, the differential microphone array is one of a uniform linear microphone array and a non-uniform linear array. In embodiments of the system and method, a differential order of the differential microphone array is N, and the linearly-constrained minimum variance filters are determined by a beam pattern of the differential microphone array. In embodiments of the system and method, the linearly-constrained minimum variance filter is calculated as a function of a steering matrix D, and the steering matrix includes N+1 steering vectors d(ω, α.sub.N,n)=[1,e.sup.−jωτ.sup.
(42) Embodiments of the present invention provide a method and system for designing reconstruction filters for a differential microphone array including a number (M) of microphone sensors, including: specifying, by a processor, a target differential order (N) for the differential microphone array; specifying, by the processor, N+1 steering vectors d(ω, α.sub.N,n)=[1,e.sup.−jωτ.sup.
(43) It will be appreciated that the disclosed methods, systems, and procedures described herein can be implemented using one or more processors executing instructions from one or more computer programs or components. These components may be provided as a series of computer instructions on a computer-readable medium, including, for example, RAM, ROM, flash memory, magnetic, and/or optical disks, optical memory, and/or other storage media. The instructions may be configured to be executed by one or more processors which, when executing the series of computer instructions, performs or facilitates the performance of all or part of the disclosed methods, and procedures.
(44) Although the present disclosure has been described with reference to particular examples and embodiments, it is understood that the present disclosure is not limited to those examples and embodiments. Further, those embodiments may be used in various combinations with and without each other. The present disclosure as claimed therefore includes variations from the specific examples and embodiments described herein, as will be apparent to one of skill in the art.