DESKTOP SPEAKERPHONE
20170187854 ยท 2017-06-29
Inventors
Cpc classification
H04M1/035
ELECTRICITY
H04M3/568
ELECTRICITY
H04M3/002
ELECTRICITY
International classification
H04M1/03
ELECTRICITY
H04M3/56
ELECTRICITY
Abstract
The present invention relates to a desktop speakerphone constructed to make the desktop speakerphone less space-consuming while having improved audio qualities. The desktop speakerphone preferably has two microphone clusters 6, 7 mounted at the upper side of the housing 2 closer towards respective longitudinal ends 8 of the latter, so that each microphone cluster 6, 7 can receive voice sound A.sub.v from one or more of the users. Each microphone cluster 6, 7 preferably comprises three pressure microphones 10, 11, 12. Furthermore, within each microphone cluster 6, 7, the second and third sound inlets 14, 15 are preferably arranged symmetrically on opposite sides of the respective median plane 18. Within each microphone cluster 6, 7, the relative arrangement of the three sound inlets 13, 14, 15 defines a respective microphone axis 9, 19 for each of the microphone pairs 10, 11, 10, 12. The microphone axis 9 of the first microphone pair 10, 11 extends through the first and the second sound inlet 13, 14, while the microphone axis 19 of the second microphone pair 10, 12 extends through the first and the third sound inlet 13, 15. The microphones are connected to multiple array processors, each configured to provide an array signal in dependence on two or more of the multiple microphone signals. First and second array processor configured to provide array signals in dependence on microphone signals.
Claims
1-15. (canceled)
16. A desktop speakerphone comprising: a housing; an output path configured to provide a driver signal in dependence on an audio output signal; a sound driver arranged at the housing and configured to emit speaker sound in dependence on the driver signal; one or more microphone clusters each comprising multiple microphones configured to provide respective microphone signals in dependence on received voice sound; and an input path configured to provide an audio input signal in dependence on the multiple microphone signals, wherein the input path comprises a first array processor configured to provide a first array signal in dependence on a first and a second one of the multiple microphone signals, and wherein the input path further is configured to provide the audio input signal in dependence on the first array signal, wherein the first array signal has a first directivity pattern with a main lobe oriented in a first direction and exhibiting reduced sensitivity towards the sound driver, wherein further a first one of the one or more microphone clusters comprises: a first pressure microphone configured to provide the first microphone signal in dependence on voice sound received at a first sound inlet arranged at the housing; and a second pressure microphone configured to provide the second microphone signal in dependence on voice sound received at a second sound inlet arranged at the housing, wherein further the first array processor is configured to provide the first array signal in dependence on the first and the second microphone signal, and wherein the desktop speakerphone further comprises a directivity controller configured to adaptively control the first directivity pattern to reduce acoustic coupling between the sound driver and the first array signal.
17. A desktop speakerphone according to claim 16, wherein the directivity controller further is configured to halt or slow down adaptation of the first directivity pattern in dependence on detecting adverse conditions for the adaptation.
18. A desktop speakerphone according to claim 16, wherein the directivity controller further is configured to perform a cross-spectral analysis of a signal provided to the sound driver and the first array signal and to control the first directivity pattern in dependence on the output of the cross-spectral analysis.
19. A desktop speakerphone according to claim 16, wherein the directivity controller further is configured to perform a cross-spectral analysis of a signal provided to the sound driver and the first microphone signal and to control the first directivity pattern in dependence on the output of the cross-spectral analysis.
20. A desktop speakerphone according to claim 16, wherein the directivity controller further is configured to perform a cross-spectral analysis of a signal provided to the sound driver and the second microphone signal and to control the first directivity pattern in dependence on the output of the cross-spectral analysis.
21. A desktop speakerphone according to claim 16, wherein the directivity controller further is configured to repeatedly perform a cross-power analysis based on the audio output signal, the first microphone signal and the second microphone signal and to determine the transfer function of a controllable filter in dependence on the result of the cross-power analysis.
22. A desktop speakerphone according to claim 16, wherein the directivity controller further is configured to repeatedly estimate a first average cross-power spectrum based on the audio output signal and the first microphone signal as well as a second average cross-power spectrum based on the audio output signal and the second microphone signal and to determine the transfer function of a controllable filter in dependence on a quotient between the first and the second average cross-power spectra.
23. A desktop speakerphone according to claim 22, wherein the directivity controller further is configured to iteratively determine the transfer function of the controllable filter by repeatedly determining and applying a frequency-dependent adjustment term to the transfer function.
24. A desktop speakerphone according to claim 22, wherein the directivity controller further is configured to apply a spectral-domain low-pass filter function to the determined transfer function to reduce the effect of spurious resonances.
Description
BRIEF DESCRIPTION OF THE DRAWINGS
[0010] The invention will be explained in more detail below in connection with preferred embodiments and with reference to the drawings in which:
[0011]
[0012]
[0013]
[0014]
[0015]
[0016]
[0017]
[0018]
[0019] The figures are schematic and simplified for clarity, and they just show details essential to understanding the invention, while other details may be left out. Where practical, like reference numerals and/or names are used for identical or corresponding parts.
MODE(S) FOR CARRYING OUT THE INVENTION
[0020] The desktop speakerphone 1 shown in a top view in
[0021] An imaginary center line 16 is defined so that it extends perpendicularly to the support surface 3 through the acoustic center 17 of the sound driver 5. For each microphone cluster 6, 7, an imaginary median plane 18 is defined so that it comprises the center line 16 and further extends through the first sound inlet 13 of the respective microphone cluster 6, 7. In the desktop speakerphone 1 shown in
[0022] All sound inlets 13, 14, 15 are arranged at equal distance from the support surface 3, i.e. in the same horizontal plane when the desktop speakerphone 1 is in its operating position. Furthermore, within each microphone cluster 6, 7, the second and third sound inlets 14, 15 are arranged symmetrically on opposite sides of the respective median plane 18. Within each microphone cluster 6, 7, the first and second microphones 10, 11 constitute a first microphone pair 10, 11, while the first and third microphones 10, 12 constitute a second microphone pair 10, 12.
[0023] Within each microphone cluster 6, 7, the relative arrangement of the three sound inlets 13, 14, 15 defines a respective microphone axis 9, 19 for each of the microphone pairs 10, 11, 10, 12. The microphone axis 9 of the first microphone pair 10, 11 extends through the first and the second sound inlet 13, 14, while the microphone axis 19 of the second microphone pair 10, 12 extends through the first and the third sound inlet 13, 15. The three sound inlets 13, 14, 15 are arranged such that the first and the second microphone axes 9, 19 are perpendicular to each other and so that each of the first and the second microphone axis 9, 19 has an angle of 45 with the median plane 18. The first sound inlet 13 is arranged with a larger distance to the center line 16 than each of the second and third sound inlets 14, 15.
[0024] In the block diagram in
[0025] The desktop speakerphone 1 comprises a transceiver 22 that through a bidirectional connection 23 receives the audio output signal A.sub.o from the audio communication network 20 and/or the gateway device 21, transmits the audio input signal A.sub.i to the audio communication network 20 and/or the gateway device 21 and further handles control functions associated therewith as known from prior art speakerphones. The desktop speakerphone 1 further comprises an output path 24 that provides a driver signal A.sub.d to the sound driver 5 in dependence on the audio output signal A.sub.o that is received through the transceiver 22. The sound driver 5 emits speaker sound A.sub.e to the environment in dependence on the driver signal A.sub.d. The desktop speakerphone 1 further comprises an input path 25 that provides the audio input signal A.sub.i through the transceiver 22 in dependence on microphone signals A.sub.m received from the microphones 10, 11, 12 of the two microphone clusters 6, 7, which provide the microphones signals A.sub.m in response to voice sound A.sub.v received from the environment through the respective sound inlets 13, 14, 15. The input path 25 further receives the audio output signal A.sub.o from the transceiver 22 for use in acoustic feedback reduction and a level-controlled signal A.sub.u from the output path 24 for use in noise reduction as explained further below. The desktop speakerphone 1 further comprises a rechargeable battery or other suitable power supply 26 for supplying electric energy to components of the desktop speakerphone 1, such as e.g. the transceiver 22, the output path 24 and the input path 25. The transceiver 22 may be implemented as a wired or as a wireless transceiver and may further be implemented to connect with the audio communication network 20 and/or the gateway device 21 through an analog connection 23 or preferably a digital connection 23, such as e.g. a Bluetooth connection, an IrDA connection, a DECT connection or a USB connection.
[0026] As shown in
[0027] As shown in
[0028] As shown in
[0029] The first high-frequency array processor 51 provides a first array signal A.sub.a1 in dependence on a first pair of microphone signals A.sub.m1, A.sub.m2 from a first microphone array 10, 11, which comprises the first microphone 10 and the second microphone 11, and in further dependence on the audio output signal A.sub.o. The second high-frequency array processor 52 provides a second array signal A.sub.a2 in dependence on a second pair of microphone signals A.sub.m1, A.sub.m3 from a second microphone array 10, 12, which comprises the first microphone 10 and the third microphone 12, and in further dependence on the audio output signal A.sub.o.
[0030] For ease of reading, the following will be adhered to in the following text: The sound inlet 13 of the first microphone 10 will be referred to as a front sound inlet, while the sound inlets 14, 15 of the second and the third microphones 11, 12 will be referred to as rear sound inlets. Correspondingly, the first microphone 10 will be referred to as a front microphone, while the second and the third microphones 11, 12 will be referred to as rear microphones. Also, the microphone signal A.sub.m1 from the first microphone 10, which is received by both high-frequency array processors 51, 52, will be referred to as a front microphone signal, while the microphone signals A.sub.m1, A.sub.m2 from the second and the third microphones 11, 12, which is each received by only one of the high-frequency array processors 51, 52, will be referred to as rear microphone signals. Also, for each microphone array 10, 11, 10, 12, the direction from the respective rear sound inlet 14, 15 along the respective microphone axis 9, 19 towards the front sound inlet 13 will be referred to as the front direction.
[0031] Thus, each high-frequency array processor 51, 52 receives a front microphone signal A.sub.m1 as well as a respective one of the rear microphone signals A.sub.m2, A.sub.m3 and provides a respective one of the first and the second array signal A.sub.a1, A.sub.a2 in dependence hereon. As explained in further detail further below, each high-frequency array processor 51, 52 controls the directivity pattern of the respective array signal A.sub.a1, A.sub.a2 such that the directivity pattern has a main lobe generally oriented towards the front direction of the respective microphone array 10, 11, 10, 12 and such that the directivity pattern further exhibits reduced sensitivity towards the sound driver 5.
[0032] The first microphone signal A.sub.m1 provided by the first microphone 10 is used for providing both the first and the second array signal A.sub.a1, A.sub.a2, which may make the desktop speakerphone 1 less space-consuming and less expensive to manufacture than prior art speakerphones. Also, the use of pressure microphones, i.e. omnidirectional microphones, may make the desktop speakerphone 1 less expensive to manufacture than prior art speakerphones and may further provide greater versatility with respect to the over-all design of the housing 2 of the desktop speakerphone 1 without compromising the effectiveness of the directional microphone system 6, 7.
[0033] The low-frequency array processor 53 provides a mainly non-directional array signal A.sub.a3 by adding the microphone signals A.sub.m1, A.sub.m2, A.sub.m3 from all of the three microphones 10, 11, 12, which thus form a third microphone array. The non-directionality is achieved through in-phase adding of the microphone signals A.sub.m1, A.sub.m2, A.sub.m3 and subsequent low-pass filtering in the low-pass filter 55 (see below).
[0034] The two high-pass filters 54 each receives and high-pass filters a respective one of the first and the second array signal A.sub.a1, A.sub.a2 to provide a respective high-pass filtered signal A.sub.f1, A.sub.f2. The low-pass filter 55 receives and low-pass filters the mainly non-directional array signal A.sub.a3 to provide a low-pass filtered signal A.sub.f3. Each of the two adders 56 receives a respective one of the high-pass filtered signals A.sub.f1, A.sub.f2 as well as the low-pass filtered signal A.sub.f3 and adds the respective high-pass filtered signal A.sub.f1, A.sub.f2 to the low-pass filtered signal A.sub.f3 to provide a respective combined array signal A.sub.c1, A.sub.c2. Each of the two residual-echo cancellers 57 receives a respective one of the combined array signals A.sub.c1, A.sub.c2, the front microphone signal A.sub.m1, the audio output signal A.sub.o as well as the level-controlled signal A.sub.u from the output path 24 and provides a respective beam signal A.sub.b1, A.sub.b2 in dependence hereon.
[0035] Each residual-echo canceller 57 may employ any know method for cancelling or otherwise suppressing residual feedback from the sound driver 5 in the respective beam signal A.sub.b1, A.sub.b2. One such known method is based on processing the respective combined array signal A.sub.c1, A.sub.c2 in multiple frequency bands and attenuating the combined array signal A.sub.c1, A.sub.c2 in those frequency bands wherein its signal level correlates with the signal level of the audio output signal A.sub.o in the same frequency band.
[0036] As shown in
[0037] The filter controller 64 receives the front microphone signal A.sub.m1, the rear microphone signal A.sub.m2, A.sub.m3 as well as the audio output signal A.sub.o and adaptively determines the first set of filter coefficients C.sub.w such that in the array signal A.sub.a1, A.sub.a2, sound A.sub.e emitted by the sound driver 5 is suppressed or attenuated relative to voice sound A.sub.v arriving from the front direction of the microphone array 10, 11, 10, 12. The filter controller 64 thus controls the directivity pattern of the microphone array 10, 11, 10, 12 such that the directivity pattern has reduced sensitivity towards the sound driver 5, at least when compared to the sensitivity in the front direction, preferably also when compared to the average sensitivity across all directions.
[0038] The filter controller 64 preferably determines the first set of filter coefficients C.sub.w according to an adaptation algorithm that provides a reduction in the coherence between the array signal A.sub.a1, A.sub.a2 and the audio output signal A.sub.o under the constraint that voice sound A.sub.v received from the front direction is substantially maintained in the array signal A.sub.a1, A.sub.a2. Thus, the directivity pattern of the microphone array 10, 11, 10, 12 is adaptively controlled to reduce acoustic feedback from the sound driver 5 in the array signal A.sub.a1, A.sub.a2 and thus also in the audio input signal A.sub.i. Numerous such adaptation algorithms are known from the prior art and may be used for this purpose. Preferred algorithms are described in the following.
[0039] The block diagram shown in
[0040] In the shown embodiment of the high-frequency array processor 51, 52, the equalized spectrum, i.e. the spectrum of the of the array signal A.sub.a1, A.sub.a2, thus equals:
Sq=Q.Math.Sz=Q.Math.(SfW.Math.Sr)(1)
[0041] The sound A.sub.e emitted by the sound driver 5 will be received by each of the front and the rear microphone 10, 11, 12 and will thus also appear in the front and the rear microphone spectrum Sf, Sr. In the following, the portion of the front microphone spectrum Sf that originates from the sound driver 5 is referred to as Sfe, the portion of the rear microphone spectrum Sr that originates from the sound driver 5 is referred to as Sre, and the portion of the difference spectrum Sz that originates from the sound driver 5 is referred to as Sze. Applying equation (1), the portion of the equalized spectrum Sq that originates from the sound driver 5 thus equals:
Sqe=Q.Math.Sze=Q.Math.(SfeW.Math.Sre)(2)
[0042] Acoustic feedback in the array signal A.sub.a1, A.sub.a2 may therefore be reduced or eliminated by controlling W such that Sqe is reduced, ideally to zero. The latter may be achieved by controlling W according to:
W=Sfe/Sre(3)
provided that Sre does not contain any spectral zeroes.
[0043] The sound A.sub.e emitted by the sound driver 5 is derived from the audio output signal A.sub.o, and thus, equation (3) can be expanded to:
W=(Sfe/So)/(Sre/So)=Hfo/Hro(4)
wherein Hfo and Hro are the transfer functions from the audio output signal A.sub.o to respectively the front microphone signal A.sub.m1 and the rear microphone signal A.sub.m2, A.sub.m3. In the general case wherein a signal y dependent on another signal x is contaminated by noise uncorrelated to the other signal x, the transfer function Hyx from x to y may be estimated as:
Hyx=
wherein
W=Hfo/Hro=(
wherein
[0044] The filter controller 64 may thus preferably repeatedly perform a cross-power analysis based on the audio output signal A.sub.o, the front microphone signal A.sub.m1 and the rear microphone signal A.sub.m2, A.sub.m3 and determine the transfer function W of the controllable filter 61 in dependence on the result of the cross-power analysis. The filter controller 64 may e.g. repeatedly estimate the average cross-power spectrum
[0045] The filter controller 64 may preferably repeat the determination of the transfer function W of the controllable filter 61 at a rate fast enough to ensure that typically encountered changes in the acoustic path between the sound driver 5 and the microphones 10, 11, 12 do not cause artifacts in the audio input signal A.sub.i. Such changes may occur e.g. when users relocate or reorient the desktop speakerphone 1, or when users move themselves, their hands or other objects in the vicinity of the desktop speakerphone 1. This adaptation of the transfer function W may enable the desktop speakerphone 1 to provide a more robust suppression of acoustic feedback from the sound driver 5 compared to prior art speakerphones. The adaptation may be made at different speeds dependent on the intended use scenarios for a particular desktop speakerphone 1. The filter controller 64 may e.g. repeat the determination of the transfer function W of the controllable filter 61 once per frame or less frequently. Within the present document, the term frame bears the meaning it commonly has in connection with frequency-domain signals, namely a set of frequency bin values provided in a single step of converting a time-domain signal into a frequency-domain signal.
[0046] In a more robust embodiment, the filter controller 64 may iteratively determine the transfer function W of the controllable filter 61 by repeatedly determining and applying a frequency-dependent adjustment term dW to the transfer function W to counteract acoustic feedback in the difference signal A.sub.z. An advantage of this approach is that the filter controller 64 may halt or slow down the adaptation of the transfer function W when adverse conditions for adaptation prevail, e.g. when local users speak, when the transfer function W is close to its optimum value and/or when Sre does contain spectral zeroes. Also, where or when the adaptation of W is to be made less frequently than once per frame, this may be achieved simply by setting the adjustment term dW equal to zero for intermediate frames, i.e. frames for which no adaptation shall be made.
[0047] The filter controller 64 may preferably determine the transfer function W according to:
W.sub.k+1=W.sub.k+U.sub.k.Math.dW.sub.k(7)
wherein the index k represents the current frame number of the involved frequency-domain signals, W.sub.k is the current value of the transfer function W, W.sub.k+1 is the subsequent value of the transfer function W, dW.sub.k is the adjustment term, and U.sub.k is a frequency-dependent moderation factor between 0 and 1. The filter controller 64 may preferably determine the adjustment term dW.sub.k such that if it were applied in the current frame, the portion Sze of the difference spectrum Sz that originates from the sound driver 5 would become zero. This value of the adjustment term dW.sub.k may be derived from equation (2). First, applying frame indices k to equation (2) and omitting the effect of the equalizer 63 yields:
Sze.sub.k=Sfe.sub.kW.sub.k.Math.Sre.sub.k(8)
[0048] Inserting the adjustment term dW.sub.k and the condition that Sze.sub.k be zero into equation (8) yields:
0=Sfe.sub.k(W.sub.k+dW.sub.k).Math.Sre.sub.k(9)
[0049] Solving the equation set (8) (9) for the adjustment term dW.sub.k yields:
dW.sub.k=Sze.sub.k/Sre.sub.k(10)
which following the reasoning further above from equation (3) through equation (6) may be expanded to:
dW.sub.k=
wherein
[0050] As shown in
[0051] It may be difficult to prevent the sound driver 5 from exitating spurious resonances in the housing 2 and other mechanical structures of the speakerphone 1. Such spurious resonances may cause substantial changes in the sound field surrounding the speakerphone 1 and thus also affect the microphone signals A.sub.m and eventually the determination of the transfer function W. Since such resonances are not correlated with the voice sound S.sub.v, the filter controller 64 may treat the disturbances as feedback from the sound driver 5 and thus cause the transfer function W to deviate from its optimum. Spurious resonances may thus indirectly cause audible artefacts in the audio input signal A.sub.i provided to the audio communication network 20, in particular with a fast adaptation of the transfer function W. The filter controller 64 may preferably apply a spectral-domain low-pass filter function G to the determined transfer function W to reduce the effect of such spurious resonances. The spectral-domain low-pass filter function G acts to reduce differences between neighboring bins in the determined transfer function W. In other words, the spectral-domain low-pass filter function G smoothes the spectral shape of the transfer function W. The smoothing reduces the influence of narrow-band excursions in the spectrum of the acoustic feedback path from the sound driver 5 to the microphones 10, 11, 12, and since such narrow-band excursions are typically caused by resonances, this may generally improve the sound quality perceived by a remote party and/or allow for applying a faster adaptation of the transfer function W without deteriorating the sound quality.
[0052] The filter controller 64 may preferably apply the spectral-domain low-pass filter function G according to:
W.sub.k+1=G(W.sub.k+U.sub.k.Math.dW.sub.k)(12)
which is a modified version of equation (7). Alternatively, the filter controller 64 may apply the spectral-domain low-pass filter function G according to:
W.sub.k+1=W.sub.k=G(U.sub.k.Math.dW.sub.k)(13)
such that the spectral-domain low-pass filter function G works on the moderated adjustment term U.sub.k.Math.dW.sub.k.
[0053] The filter estimator 84 may thus comprise a spectral-domain low-pass filter 86 that operates to reduce differences between neighboring bins in the determined transfer function W. The spectral-domain low-pass filter 86 may e.g. be configured to apply the spectral-domain low-pass filter function G by passing a sliding average window across the spectrum of each instance of the determined transfer function W and/or each instance of the moderated adjustment term U.sub.k.Math.dW.sub.k. Instead of a sliding average window, the spectral-domain low-pass filter 86 may apply one or more other suitable filters selected among low-pass filters generally known in the art.
[0054] The filter estimator 84 may preferably adaptively determine the moderation factor U.sub.k in a manner that favors reliable values of the adjustment term dW.sub.k over unreliable values, e.g. as described in further detail below.
[0055] The reliability of the adjustment term dW.sub.k generally decreases when the amount of acoustic feedback from the sound driver 5 in the microphone signals A.sub.f, A.sub.r decreases relative to other signals, which typically is the case when local users speak. The filter estimator 84 may thus preferably adaptively monitor at least one of the microphone signals A.sub.f, A.sub.r and increase the moderation factor U.sub.k in frequency bins wherein acoustic feedback from the sound driver 5 in a monitored microphone signal A.sub.f, A.sub.r increases relative to other signals and adaptively decrease the moderation factor U.sub.k in frequency bins wherein acoustic feedback from the sound driver 5 in the monitored microphone signal A.sub.f, A.sub.r decreases relative to other signals. To achieve this, the filter estimator 84 may e.g. determine a frequency-dependent coherence Cmo between the audio output signal A.sub.o and one of the front and the rear microphone signal A.sub.f, A.sub.r and determine the moderation factor U.sub.k in dependence on the determined coherence Cmo. For each frequency bin, the coherence Cmo approaches 1 when acoustic feedback from the sound driver 5 dominates the respective microphone signal A.sub.f, A.sub.r and drops towards 0 when other signals are mixed into the microphone signal A.sub.f, A.sub.r. The above approach may thus result in improved values of the transfer function W and thus in increased reduction of acoustic feedback in the audio input signal A.sub.i.
[0056] The reliability of the adjustment term dW.sub.k further generally decreases when the amount of acoustic feedback from the sound driver 5 in the difference signal A.sub.z decreases relative to other signals, which typically is the case when the transfer function W is close to optimum. The filter estimator 84 may thus preferably, additionally or alternatively, adaptively increase the moderation factor U.sub.k in frequency bins wherein acoustic feedback from the sound driver 5 in the difference signal A.sub.z increases relative to other signals and adaptively decrease the moderation factor U.sub.k in frequency bins wherein acoustic feedback from the sound driver 5 in the difference signal A.sub.z decreases relative to other signals. To achieve this, the filter estimator 84 may e.g. determine a frequency-dependent coherence Czo between the audio output signal A.sub.o and the difference signal A.sub.z and determine the moderation factor U.sub.k in dependence on the determined coherence Czo. For each frequency bin, the coherence Czo approaches 1 when acoustic feedback from the sound driver 5 dominates the difference signal A.sub.z and drops towards 0 when other signals are mixed into the microphone signal A.sub.z. The above approach may thus result in improved values of the transfer function W and thus in increased reduction of acoustic feedback in the audio input signal A.sub.i.
[0057] The filter estimator 84 may preferably repeatedly determine the moderation factor U.sub.k in dependence on the coherence Cmo between the audio output signal A.sub.o and one of the front and the rear microphone signal A.sub.f, A.sub.r as well as in dependence on the coherence Czo between the audio output signal A.sub.o and the difference signal A.sub.z, e.g. according to:
U.sub.k=Cmo.sub.k.Math.(Czo.sub.k+)/(1+)(14)
wherein the index k is the current frame number, Cmo.sub.k is the current value of the frequency-dependent coherence Cmo between the audio output signal A.sub.o and one of the front and the rear microphone signal A.sub.f, A.sub.r, Czo.sub.k is the current value of the frequency-dependent coherence Czo between the audio output signal A.sub.o and the difference signal A.sub.z, and is a small, non-zero, non-negative convergence term that may prevent the adaptation of the transfer function W to stop prematurely when approaching the optimum.
[0058] In other embodiments, the filter estimator 84 may apply variants of equation (14). For instance, the convergence term may be set to zero and/or the factor Cmo.sub.k may be set to unity. In other embodiments, the filter estimator 84 may apply other, preferably similar functions for computing the moderation factor U.sub.k.
[0059] The filter controller 64 is preferably further configured to determine the transfer function W in a manner that is robust against spectral zeroes in the portion Sre of the rear microphone spectrum Sr that originates from the sound driver 5. This may e.g. be achieved by configuring the second spectral analyzer 82 to enforce a lower limit on the individual bin values of the average cross-power spectrum
[0060] In the desktop speakerphone 1, the transceiver 22 preferably exchanges the audio output signal A.sub.o and the audio input signal A.sub.i in digital form with the audio communication network 20 and/or the gateway device 21, e.g. through a USB connection or a Bluetooth connection. Also, the output path 24 and the input path 25 are preferably configured as digital circuits operating on digital signals, possibly except for portions thereof that interface to the sound driver 5 and/or the microphones 10, 11, 12. Also, the output path 24 and the input path 25 are preferably configured to operate on spectral signals, in particular in order to facilitate the adaptation of the transfer function W. Most portions of the transceiver 22, the output path 24 and the input path 25 may, however, alternatively or additionally be configured to operate on time-domain signals and/or as analog circuits operating on analog signals. Accordingly, the transceiver 22, the output path 24 and/or the input path 25 may comprise any number of signal domain converters, i.e. analog-to-digital, digital-to-analog, time-to-spectral-domain (FFT) and/or spectral-to-time-domain (IFFT) converters, as well as any number of signal encoders and/or signal decoders to perform any required signal conversions, signal encoding and/or signal decoding.
[0061] Functional blocks of digital circuits may be implemented in hardware, firmware or software, or any combination hereof. Digital circuits may perform the functions of multiple functional blocks in parallel and/or in interleaved sequence, and functional blocks may distributed in any suitable way among multiple hardware units, such as e.g. signal processors, microcontrollers and other integrated circuits.
[0062] The detailed description given herein and the specific examples indicating preferred embodiments of the invention are intended to enable a person skilled in the art to practice the invention and should thus be seen mainly as an illustration of the invention. The person skilled in the art will be able to readily contemplate further applications of the present invention as well as advantageous changes and modifications from this description without deviating from the scope of the invention. Any such changes or modifications mentioned herein are meant to be non-limiting for the scope of the invention.
[0063] Examples of further changes or modifications include: the desktop speakerphone 1 may comprise further sound drivers 5, the housing 2 may have various shapes, the sound driver 5 may be mounted off-center with respect to the housing 2, the number of microphone clusters 6, 7 may be e.g. 1, 3, 4, 5 or 6 and the input path 25 may be modified accordingly, the sound inlets 13, 14, 15 of multiple microphone clusters 6, 7 may be arranged asymmetrically, the output path 24 and/or the input path 25 may comprise further functional blocks known from prior art speakerphones, such as e.g. decoders, audio filters, circulators and the like, the emphasis filter 31, the volume control 32 and/or the limiter 33 may be omitted, the user interface 36 may be omitted or arranged remotely, e.g. in a gateway device 21, the speech detector 42, the speech level normalizer 43 and/or the beam selector 44 may be omitted, the beam selector 44 may employ other or further criteria for selecting the preferred signal, the low-frequency array processor 53 and the low-pass filter 55 may be omitted, the residual-echo cancellers 57 may be omitted, the subtractor 62 may be replaced with an adder if the filtered signal A.sub.w and the front microphone signal A.sub.m1 have opposite phases, etc.
[0064] The invention is not limited to the embodiments disclosed herein, and the invention may be embodied in other ways within the subject-matter defined in the following claims. As an example, features of the described embodiments may be combined arbitrarily, e.g. in order to adapt the devices according to the invention to specific requirements.
[0065] Any reference numerals and names in the claims are intended to be non-limiting for their scope.