Adaptive Reverberation Cancellation System
20180233123 ยท 2018-08-16
Inventors
Cpc classification
H04S7/305
ELECTRICITY
G10K2210/108
PHYSICS
G10K2210/12
PHYSICS
G10K11/178
PHYSICS
H04R3/02
ELECTRICITY
G10K2210/3028
PHYSICS
International classification
Abstract
A signal processor for determining a plurality of drive signals for driving a plurality of loudspeakers to cancel a reverberation effect in a listening area, wherein the signal processor is configured to determine from one or more measured audio signals a plurality of measured physical coefficients in a basis of physical sound functions, such that a sum of the physical sound functions, weighted with the plurality of measured physical coefficients approximates the one or more measured audio signals, wherein at least half of the plurality of measured physical coefficients are zero, determine a residual error between the plurality of measured physical coefficients and a plurality of desired physical coefficients, estimate a transfer function describing a transformation from the plurality of desired physical coefficients to the plurality of measured physical coefficients, based on the determined residual error, and update the plurality of drive signals based on the estimated transfer function.
Claims
1. A sound device comprising: a signal processor configured to: determine from one or more measured audio signals a plurality of measured physical coefficients in a basis of physical sound functions, such that a sum of the physical sound functions weighted with the plurality of measured physical coefficients approximates the one or more measured audio signals, wherein at least half of the plurality of measured physical coefficients are zero; determine a residual error between the plurality of measured physical coefficients and a plurality of desired physical coefficients; estimate a transfer function describing a transformation from the plurality of desired physical coefficients to the plurality of measured physical coefficients, based on the determined residual error; and update a plurality of drive signals based on the estimated transfer function.
2. The sound device of claim 1, wherein the signal processor is further configured to, when determining the plurality of measured physical coefficients; minimize an error measure between the measured audio signals and a linear transformation of the measured physical coefficients; and minimize a number of non-zero entries of the plurality of measured physical coefficients.
3. The sound device of claim 2, wherein the signal processor is further configured to, when minimizing the error measure and minimizing the number of non-zero entries of the plurality of measured physical coefficients, determine a vector b of the plurality of measured physical coefficients according to:
b=argmin.sub.yy.sub.p.sup.p, such that vy.sup.2 for 0p1, wherein y.sub.p is a p-norm of a vector y, is a MN sensing matrix comprising columns with the physical sound functions, N>>M, v is an M1 observation vector which comprises the one or more measured audio signals corresponding to M locations within the listening area, wherein the signal processor is further configured to randomly chose the M locations.
4. The sound device of claim 1, wherein the basis of physical sound functions is orthogonal with regard to an inner product that for a first vector bi and a second vector bj is representable as:
b.sub.ib.sub.j
=.sub.Rb.sub.i(x)b.sub.j(x)w(x)dx=.sub.ij, wherein R is a reproduction region of a plurality of loudspeakers, w(x) is a weighting function, and .sub.ij is 1 for i=j and 0 otherwise.
5. The sound device of claim 1, wherein the basis of physical sound functions comprises an orthonormal set of physical sound functions obtained from a modified Gram-Schmidt process on plane wave functions corresponding to a plurality of angles.
6. The sound device of claim 1, wherein the transfer function assigns a zero-coupling between a first coefficient and a second coefficient of the basis of physical sound functions, wherein the transfer function is representable as a diagonal matrix U(k).
7. The sound device of claim 6, wherein the signal processor is further configured to, when estimating the transfer function, estimate the diagonal matrix U(k) using a Least Mean Squares filter and/or using a Recursive Least Squares filter.
8. The sound device of claim 7, wherein the signal processor is further configured to, when estimating the diagonal matrix U(k), compute an n-th element of the diagonal matrix U(k) according to
9. The sound device of claim 1, wherein the signal processor is further configured to, when updating the plurality of drive signals, compute a drive signal update * such that an energy level of the drive signal update * is limited with an upper bound, wherein the energy level of the drive signal update * is computed as a square value of the drive signal update *.
10. The sound device of claim 9, wherein the signal processor is further configured to, when updating the plurality of drive signals, compute the drive signal update * as:
11. The sound device of claim 1, wherein the signal processor is further configured to perform an initial step of preconditioning a drive signal update * to 0 and/or preconditioning a diagonal matrix U(k) to an identity matrix.
12. (canceled)
13. A method for generating a plurality of drive signals for driving a plurality of loudspeakers to cancel a reverberation effect in a listening area, the method comprising: driving the plurality of loudspeakers with an initial plurality of drive signals; measuring one or more audio signals at one or more measurement locations; determining from the one or more measured audio signals a plurality of measured physical coefficients of in a basis of physical sound functions, such that a sum of the physical sound functions, weighted with the plurality of measured physical coefficients approximates the one or more measured audio signals, wherein at least half of the plurality of measured physical coefficients are zero; determining a residual error between the plurality of measured physical coefficients and a plurality of desired physical coefficients; estimating a transfer function from the plurality of desired physical coefficients to the plurality of measured physical coefficients, based on the determined residual error; and. updating the initial plurality of drive signals based on the estimated transfer function.
14. The method of claim 13, further comprising: minimizing an error measure between the measured audio signals and a linear transformation of the measured physical coefficients; and minimizing the number of non-zero entries of the plurality of measured physical coefficients, wherein minimizing the error measure and minimizing the number of non-zero entries of the plurality of measured physical coefficients comprises: determining a vector b of the plurality of measured physical coefficients according to:
b=argmin.sub.yy.sub.p.sup.p, such that vy.sup.2 for 0p1, wherein y.sub.p is a p-norm of a vector y, is a MN sensing matrix comprising columns with the physical sound functions, N>>M, v is an M1 observation vector which comprises the one or more measured audio signals corresponding to M locations within the listening area, wherein the signal processor is configured to randomly chose the M locations.
15. The method of claim 13, wherein the basis of physical sound functions is orthogonal with regard to an inner product that for a first vector bi and a second vector bj is representable as:
b.sub.i|b.sub.j
=.sub.Rb.sub.i(x)b.sub.j(x)w(x)dx=.sub.ij, wherein R is a reproduction region of the plurality of loudspeakers, w(x) is a weighting function, and .sub.ij is 1 for i=j and 0 otherwise.
16. The method of claim 13, wherein the transfer function assigns a zero-coupling between a first coefficient and a second coefficient of the basis of physical sound functions, wherein the transfer function is representable as a diagonal matrix U(k).
17. The method of claim 16, further comprising, when estimating the diagonal matrix U(k), computing an n-th element of the diagonal matrix U(k) according to:
18. The method of claim 13, further comprising, when updating the plurality of drive signals, computing a drive signal update * such that an energy level of the drive signal update * is limited with an upper bound, wherein the energy level of the drive signal update * is computed as a square value of the drive signal update *.
19. The method of claim 18, further comprising, when updating the drive signal, computing the drive signal update * as
20. A non-transitory computer-readable storage medium comprising instructions that when executed by a signal processor cause the signal processor to: determine from one or more measured audio signals a plurality of measured physical coefficients in a basis of physical sound functions, such that a sum of the physical sound functions weighted with the plurality of measured physical coefficients approximates the one or more measured audio signals, wherein at least half of the plurality of measured physical coefficients are zero; determine a residual error between the plurality of measured physical coefficients and a plurality of desired physical coefficients; estimate a transfer function describing a transformation from the plurality of desired physical coefficients to the plurality of measured physical coefficients, based on the determined residual error; and update a plurality of drive signals based on the estimated transfer function.
Description
BRIEF DESCRIPTION OF THE DRAWINGS
[0047] To illustrate the technical features of embodiments of the present disclosure more clearly, the accompanying drawings provided for describing the embodiments are introduced briefly in the following. The accompanying drawings in the following description are merely some embodiments of the present disclosure, but modifications on these embodiments are possible without departing from the scope of the present disclosure as defined in the claims.
[0048]
[0049]
[0050]
[0051]
[0052]
[0053]
DETAILED DESCRIPTION OF THE EMBODIMENTS
[0054]
[0055] The signal processor 100 comprises a coefficient unit 110 which is configured to determine from one or more measured audio signals a plurality of measured physical coefficients in a basis of physical sound functions, such that a sum of the physical sound functions, weighted with the plurality of measured physical coefficients approximates the one or more measured audio signals, wherein at least half of the plurality of measured physical coefficients are zero. The basis of physical sound functions can be fixed or there can be several bases of physical sound functions, wherein a specific basis can be chosen, e.g. by setting a basis selection parameter.
[0056] The signal processor 100 further comprises a residual error unit 120 which is configured to determine a residual error between the plurality of measured physical coefficients and a plurality of desired physical coefficients.
[0057] The signal processor 100 further comprises a transfer unit 130, which is configured to estimate a transfer function describing a transformation from the plurality of desired physical coefficients to the plurality of measured physical coefficients, based on the determined residual error.
[0058] The signal processor 100 further comprises an update unit 140 which is configured to update the plurality of drive signals based on the estimated transfer function. The update unit 140 can be configured to generate an initial update as zero, i.e., to initially generate a drive signal that corresponds to an input signal. The input signal can be provided to the signal processor 100 from an external unit or the input signal can be determined in the signal processor 100.
[0059] The signal processor 100 is configured to control its units such that they repeatedly compute updates to the plurality of drive signals.
[0060] The coefficient unit 110, residual error unit 120, transfer unit 130 and the update unit 140 can be realized in the same physical hardware, for example they can be realized as different parts of a programming of the signal processor 100.
[0061]
[0062]
[0063] The method comprises a second step of measuring 320 one or more audio signals at one or more measurement locations. For example, the one or more audio signals can be measured using microphones that are placed at random locations in the listening area. The method can comprise a further step of determining positions of the randomly placed microphones, such that measured audio signals can be correlated with positions of the corresponding microphones.
[0064] In a third step 330 from the one or more measured audio signals a plurality of measured physical coefficients in a basis of physical sound functions is determined, such that a sum of the physical sound functions, weighted with the plurality of measured physical coefficients approximates the one or more measured audio signals, wherein at least half of the plurality of measured physical coefficients are zero. In particular, at least or preferably at least 90% of the plurality of measured physical coefficients can be zero.
[0065] In a fourth step 340 a residual error between the plurality of measured physical coefficients and a plurality of desired physical coefficients is determined.
[0066] In a fifth step 350 a transfer function describing a transformation from the plurality of desired physical coefficients to the plurality of measured physical coefficients is determined based on the determined residual error.
[0067] In a sixth step 360, an updated version of the initial plurality of drive signals is determined based on the estimated transfer function. The updated version of the initial plurality of drive signal is output to a plurality of loudspeakers, and the method can continue in step 320.
[0068] In a further step (not shown in
[0069]
[0070] The adaptive room reverberation cancelation system 420 comprises a sound device, e.g. the sound device of
[0071]
[0072] In a -th iteration, the adaptive reverberation cancelation system 520 generates an updated drive signal l(k)+(k).sub. which drives the plurality of loudspeakers 510. The walls of the reverberant room 512 reflect the generated sound waves.
[0073] Microphones 540 measure a plurality of audio signals 541 in the reproduction region and from these measured audio signals a plurality of measured physical coefficients b.sub.n(k) is determined. A difference between the measured physical coefficients b.sub.n(k) and a plurality of desired physical coefficients is formed in the summing unit 522 and fed back to the adaptive reverberation cancelation system 520. Based on the difference, which represents a residual error 523, the adaptive reverberation cancelation system updates the drive signal, which begins a next iteration of the iterative reverberation cancellation process.
[0074]
[0075] In a first step 602, the loudspeaker drive signals are preconditioned to l(k), i.e., the initial update is 0.
[0076] In a second step 604, a plurality of measured physical coefficients is determined in a basis of physical sound functions, such that a sum of the physical sound functions of the basis, wherein the sum is weighted with the plurality of measured physical coefficients, approximates the one or more measured audio signals.
[0077] Based on a difference between the plurality of measured physical coefficients and a plurality of desired physical coefficients, a new residual error is determined.
[0078] In a third step 606, diagonal entries of a diagonal matrix U(k).sub. are determined using RLS adaptive filtering methods.
[0079] In a fourth step 608, the array of loudspeakers is driven with the updated plurality of drive signals.
[0080] If the residual error is sufficiently small, the method can output the sum of a predefined driving signal (e.g. an input signal times a predefined filter in the frequency domain) l(k) and the update signal (k). In embodiments of the disclosure, the update signal (k) can be determined based on an update filter, e.g. by applying the update filter to the predefined driving signal.
[0081] In further step 610, an Inverse Fourier Transform is applied to the updated plurality of drive signals l(k)+(k).sub. and in further step 612, the Fourier-transformed signals 611 are plaid back with the plurality of speakers. The method then continues in step 604, with an incremented iteration index .
[0082] In the following, it is described in more detail how a sparse approximation method can be used to calculate b.sub.n(k) from the randomly-placed measurements v.sub.m(k) within the selected zones of interest.
[0083] A basic principle of the method is to assume that the reproduced sound field S(x; k) results from only a small number of basis Helmholtz solutions. Based on this assumption, the following lp norm (where 0<p<1) nonconvex optimization problem may be considered:
where y is the basis function coefficient set, the dictionary is an MN sensing matrix (N>>M) whose columns contain the values of G.sub.n(x; k) at M locations and v is an M1 observation vector which contains the values of the actual reproduced sound field S(x; k) at M randomly chosen locations within the desired region. The error is related to the he additive complex Gaussian noise level. Let y be a sparse signal, i.e., y has a limited number of non-zero entries at unknown locations. Therefore, the regularized Iteratively Reweighted Least Squares (IRLS) algorithm may be applied to solve equation (3) and derive the optimal estimator that characterizes the reproduced sound field in reverberant environments:
where has only m (mM) non-zero components and can be used as an estimate of the basis function coefficients b.sub.n(k).
[0084] Overall, the calculation of the sound field coefficients b.sub.n(k) may be formulated based on the sound field measurements in (1) in the following matrix form
b(
where b(
[0085] The desired multi zone sound field S.sup.d(x; k) and the actual reproduced sound field in a reverberant room S(x; k) can be characterized by b.sup.d(k) and b(k) that represents the respective coefficient sets of the orthonormal basis function {G.sub.n}. Note that the coefficients for S.sup.d(x; k) can be derived offline.
[0086] Consider the reverberant room channel as a transformation between the reproduced sound field and the desired sound field, which can be further expressed by a linear transformation of the basis function coefficients:
b(k)=U(k)b.sup.d(k),(6)
where U(k)=diag[U.sub.1(k), . . . , U.sub.N(k)] represents the reverberant room effects at the wavenumber k. Note that U(k) may be parametrized with a diagonal structure following the assumption that the couplings between the sound field coefficients with different indices can be neglected in the defined basis function domain.
[0087] The room channel transformation U(k) can be estimated in an iterative fashion. {tilde over (b)}(k) may be defined as the measured sound field coefficients at the microphones after updating the loudspeaker signals. An accurate estimate of the room channel transformation .Math.(k) can be achieved if the squared norm of the residual error {tilde over (b)}(K)b.sup.d(k).sup.2 is minimized, which also leads to an accurate matching between the actual reproduced sound field and the desired multi zone sound field over the desired reproduction region. This can be treated as an adaptive filtering problem and U(k) can be estimated actively by using algorithms such as a LMS filter and a RLS filter.
[0088] Due to the diagonal structure of U(k), calculating the unknown diagonal entries U.sub.N(k) can be further simplified as a single-tap adaptive filtering problem. Let .Math.(k).sub. be the estimate of U(k) at the th adaption step:
where .sub.n.sup.2() is the gain factor .sub.n.sup.2()=.sub.n.sup.2(1)+|b.sub.n.sup.d(k)|.sup.2. is the forgetting factor. The RLS algorithm may be selected as it provides a fast convergence rate. Therefore, equation (7) can be applied to obtain an iterative estimate of the diagonal elements U.sub.n(k) based on the residual error at the r th adaption step.
[0089] The optimal filter updating signal on the loudspeaker array can be derived based on the active estimate of the room channel transformation. It is designed to minimize the residual error and ensure the estimation convergence. The initial loudspeaker array signals may be preconditioned to reproduce the desired multi zone sound field under free-field assumption. Therefore, the coefficients for the desired sound field b.sup.d(k) can be expressed by replacing C(k) with the direct channel C.sup.d(k) in equation (5):
b.sup.d(k)=TC.sup.d(k)l(k).(8)
[0090] Let G.sup.d(k)=TC.sup.d(k) represent the pre-determined sound field coefficient matrix of the Green's functions for all loudspeakers assuming free-field propagation. Incorporating the room channel model in (6) and the estimator .Math.(k):
b(k)=.Math.(k)G.sup.d(k)l(k).(9)
[0091] Following (9), the measured sound field coefficients {tilde over (b)}.sub.n(k) after adding updating signals (k) to the loudspeakers can be given by
{tilde over (b)}(k)=.Math.(k)G.sup.d(k)[l(k)+(k)].(10)
[0092] The difference between the measured and desired sound field coefficients using (8) and (10) may be written as:
{tilde over (b)}(k)b.sup.d(k)=[.Math.(k)I]G.sup.d(k)l(k)+.Math.(k)G.sup.d(k)(k),(11)
where I is an identity matrix.
[0093] An efficient reverberation compensation and accurate sound field reproduction can be achieved by finding the optimal loudspeaker filter updating signals (k) that minimize {tilde over (b)}(k)b.sup.d(k).sup.2. Therefore, a multi-constraint convex optimization is formulated with the objective of minimizing the error between the measured and desired sound field coefficients, while also guaranteeing the convergence:
s.t.(k).sub.q.sup.2N.sub.1(q=1 . . . Q).
[0094] G.sup.d(k) can be calculated offline. The value of N.sub.1 is adjustable and it depends how reverberant the room environment is. It can be set to be less or equal to (1(k).sup.2)/N.sub.w, where (k) is the reflection coefficients and N.sub.w is the number of walls. Note that the additional constraints on the energy of each of the loudspeaker filter updating signals are applied so that the reverberation effects of (k).sub.q are insignificant and can be consistently mitigate the adaptive process, thereby avoiding the active calculation of pseudo-inverse of the reverberation channel matrix. These formulations guarantee the system convergence and lead to less computational complexity and faster convergence than some approaches.
[0095] To summarize, in embodiments of the disclosure, the reproduced sound field is described as a weighted series of orthonormal basis functions over the desired reproduction region, which is then used to adaptively equalize the desired multi zone sound field in terms of the basis function coefficients. An adaptive reverberation cancelation system for multi zone sound field reproduction using sparse microphone measurements is proposed. The proposed approach expresses the sound field as a space-frequency orthonormal basis function expansion the desired reproduction region. The reproduced sound field may be considered as a linear transformation of the desired sound field. The adaptive channel estimation process may be introduced using sparse methods to identify these transformations directly in the orthogonal basis function domain and derive the loudspeaker updating signals that compensate the room reverberation and guarantee the convergence of the adaptive estimation in reverberant environments.
[0096] Advantages of embodiments of the disclosure include the presented signal processor, sound device and method do not require a prior measurement of the transfer functions of the employed loudspeaker. They can adapt to the alteration of ambient environment condition during the measurement process. The presented signal processor, sound device and method provide an accurate reproduction of the desired sound field under the same hardware provision and environment settings by employing the sparse methods, i.e. the same performance can be achieved using a smaller number of microphone measurements. The presented signal processor, sound device and method show a better convergence behavior to a good reproduction performance, especially in the reverberant rooms that feature low direct-to-reverberant-path power ratios. This is achieved by formulating a novel multi-constraint convex optimization and avoiding the active calculation of pseudo-inverse of the reverberation channel matrix, which guarantee the system convergence. The adaptive reverberation cancelation system rectifies the unwanted reverberation effects based on iterative feedbacks from a small number of microphone measurements, so that the listeners can still enjoy an accurate sound field reproduction even in extreme complex environments (e.g. car chamber). Less computational complexity and faster convergence.
[0097] Applications of embodiments of the disclosure include any sound reproduction system or surround sound system using multiple loudspeakers.
[0098] In particular, embodiments of the presented disclosure can be applied to TV speaker systems, car entertaining systems, teleconference systems, and/or home cinema system, where personal listening environments for one or multiple listeners is desirable.
[0099] The foregoing descriptions are only implementation manners of the present disclosure, the protection of the scope of the present disclosure is not limited to this. Any variations or replacements can be easily made by a person skilled in the art. Therefore, the protection scope of the present disclosure should be subject to the protection scope of the attached claims.