System and method for augmenting an acoustic space

Abstract

A method and system for real-time auralization is described in which room sounds are reverberated and presented over loudspeakers, thereby augmenting the acoustics of the space. Room microphones are used to capture room sound sources, with their outputs processed in a canceler to remove the synthetic reverberation also present in the room. Doing so gives precise control over the auralization while suppressing feedback. It also allows freedom of movement and creates a more natural acoustic environment for performers or participants in music, theater, gaming, home entertainment, and virtual reality applications. Canceler design methods are described, including techniques for handling varying loudspeaker-microphone transfer functions such as would be present in the context of a performance or installation.

Claims

1. A system for reducing feedback resulting from a sound produced by a speaker being captured by a microphone, the sound including auralization effects, the system comprising: an auralizer for producing the auralization effects; and a canceler, wherein the canceler includes a cancellation filter that is based on an impulse response between the microphone and the speaker, and wherein the impulse response is formed according to acoustics of a live acoustic space in which the microphone and the speaker are separately placed, and wherein the acoustics include at least an acoustic propagation delay between the speaker and the microphone.

2. The system of claim 1, wherein the cancellation filter is calibrated based on relative positions of the microphone and the speaker in the live acoustic space.

3. The system of claim 1, wherein the microphone is one of a plurality of microphones, and wherein the speaker is one of a plurality of speakers, and wherein the cancellation filter is based on impulse responses between each microphone-speaker pair of the plurality of microphones and the plurality of speakers.

4. The system of claim 1, wherein the auralization effects include artificial reverberation.

5. The system of claim 4, wherein the artificial reverberation is performed in accordance with a target acoustic space that is different from the live acoustic space.

6. The system of claim 1, wherein the microphone further captures live sound, the canceler being operative to reduce feedback caused by the acoustics of the live acoustic space before the live sound is processed by the auralizer and output to the speaker as the sound with the auralization effects.

7. The system of claim 1, wherein the auralizer and the canceler are implemented by a digital audio workstation.

8. A method for reducing feedback resulting from a sound produced by a speaker being captured by a microphone, the sound including auralization effects, the system comprising: capturing live sound by the microphone; and performing cancelation on the live sound using a cancellation filter that is based on an impulse response between the microphone and the speaker, the cancelation resulting in a live sound estimate, and wherein the impulse response is formed according to acoustics of a live acoustic space in which the microphone and the speaker are separately placed, and wherein the acoustics include at least an acoustic propagation delay between the speaker and the microphone.

9. The method of claim 8, further including adding the auralization effects to the live sound estimate and providing the live sound estimate with the added auralization effects to the speaker.

10. The method of claim 8, wherein the cancellation filter is calibrated based on relative positions of the microphone and the speaker in the live acoustic space.

11. The method of claim 8, wherein the microphone is one of a plurality of microphones, and wherein the speaker is one of a plurality of speakers, and wherein the cancellation filter is based on impulse responses between each microphone-speaker pair of the plurality of microphones and the plurality of speakers.

12. The method of claim 9, wherein adding the auralization effects includes performing artificial reverberation.

13. The method of claim 12, wherein the artificial reverberation is performed in accordance with a target acoustic space that is different from the live acoustic space.

14. A method for reducing feedback resulting from a sound produced by a speaker being captured by a microphone, the sound including auralization effects, the method comprising: generating a live sound without auralization effects from the speaker in a live acoustic space; capturing the live sound by the microphone; measuring an impulse response between the microphone and the speaker using the captured live sound; and using the measured impulse response to obtain characteristics of a cancellation filter wherein the cancellation filter is configured to reduce effects of at least acoustics of a live acoustic space in which the microphone and the speaker are separately placed, and wherein the acoustics include at least an acoustic propagation delay between the speaker and the microphone.

15. The method of claim 14, wherein the measured impulse response is a function of frequency and time.

16. The method of claim 14, wherein the characteristics of the cancellation filter further include a windowing function.

Description

BRIEF DESCRIPTION OF THE DRAWINGS

(1) These and other aspects and features of the present embodiments will become apparent to those ordinarily skilled in the art upon review of the following description of specific embodiments in conjunction with the accompanying figures, wherein:

(2) FIG. 1 is a block diagram illustrating an example system according to embodiments;

(3) FIG. 2 is a signal flow diagram illustrating an example feedback canceling auralization system according to embodiments;

(4) FIG. 3 is a signal flow diagram illustrating another example feedback canceling auralization system according to embodiments;

(5) FIG. 4 is a diagram of a Max/MSP patch showing one possible implementation of a canceling reverberator according to embodiments;

(6) FIG. 5 is a signal flow diagram illustrating aspects of calibrating a feedback canceling auralization system according to embodiments;

(7) FIG. 6 is a flowchart illustrating an example methodology for calibrating a feedback canceling auralization system according to embodiments;

(8) FIG. 7 is a diagram illustrating aspects of an example impulse measurement obtained in connection with the methodology of FIG. 6;

(9) FIGS. 8A and 8B are diagrams illustrating an example cancellation impulse response obtained in accordance with the present embodiments;

(10) FIGS. 9A to 9C are diagrams illustrating aspects of an example canceling auralizer room impulse response according to embodiments;

(11) FIGS. 10A to 10C are spectrograms illustrating aspects of another example canceling auralizer room impulse response according to embodiments;

(12) FIGS. 11A and 11B are diagrams illustrating aspects of impulse response variation according to embodiments;

(13) FIG. 12 is a diagram illustrating further aspects of impulse response variation according to embodiments;

(14) FIGS. 13A to 13C are spectrograms illustrating an example of canceler performance and residual energy in accordance with embodiments; and

(15) FIGS. 14A and 14B are spectrograms illustrating an example performance of feedback cancellation in accordance with embodiments.

DETAILED DESCRIPTION

(16) The present embodiments will now be described in detail with reference to the drawings, which are provided as illustrative examples of the embodiments so as to enable those skilled in the art to practice the embodiments and alternatives apparent to those skilled in the art. Notably, the figures and examples below are not meant to limit the scope of the present embodiments to a single embodiment, but other embodiments are possible by way of interchange of some or all of the described or illustrated elements. Moreover, where certain elements of the present embodiments can be partially or fully implemented using known components, only those portions of such known components that are necessary for an understanding of the present embodiments will be described, and detailed descriptions of other portions of such known components will be omitted so as not to obscure the present embodiments. Embodiments described as being implemented in software should not be limited thereto, but can include embodiments implemented in hardware, or combinations of software and hardware, and vice-versa, as will be apparent to those skilled in the art, unless otherwise specified herein. In the present specification, an embodiment showing a singular component should not be considered limiting; rather, the present disclosure is intended to encompass other embodiments including a plurality of the same component, and vice-versa, unless explicitly stated otherwise herein. Moreover, applicants do not intend for any term in the specification or claims to be ascribed an uncommon or special meaning unless explicitly set forth as such. Further, the present embodiments encompass present and future known equivalents to the known components referred to herein by way of illustration.

(17) According to certain aspects, the present embodiments provide a system and method for real-time auralization that uses standard room microphones, loudspeakers, and inventive signal processing tools to synthesize virtual acoustics while canceling the feedback. The cancellation method described herein uses an adaptive noise cancellation approach (see, e.g., Widrow, B., et al., Adaptive Noise Cancelling: Principles and Applications, Proceedings of the IEEE, 63(12), pp. 1692-1716, 1975, hereinafter [32]) in which a primary signal is the sum of a desired signal and unwanted noise. In that approach, a reference signal, which is correlated with the unwanted noise, is used to estimate and subtract the unwanted noise from the primary signal. Related literature also includes echo cancellation and dereverberation (see, e.g., Emanuel Habets, Fifty years of reverberation reduction: From analog signal processing to machine learning, AES 60th Conference on DREAMS, 2016, hereinafter [12]; Patrick A Naylor and Nikolay D Gaubitch, Eds., Speech Dereverberation, Springer, 2010, hereinafter [13]; and Francis Rumsey, Reverberation . . . and how to remove it, Journal of the Acoustical Society of America, vol. 64, no. 4, pp. 262-6, April 2016, hereinafter [14]).

(18) In one embodiment, a loudspeaker and microphone are configured in a room having a sound source. Room sounds are reverberated according to the acoustics of a desired target space and presented over loudspeakers, thereby augmenting the acoustics of the room. The room microphone captures sound from the room sound sources as well as from the loudspeaker playing the simulated acoustics. Measurements of the impulse response between the loudspeaker and microphone are used to estimate and subtract the simulated acoustics from the microphone signal, thereby eliminating feedback. In another embodiment, impulse responses between a plurality of loudspeakers and microphones are used to cancel simulated acoustics from multiple loudspeakers for each microphone.

(19) In an additional embodiment, multiple impulse response measurements between a loudspeaker and microphone are made, and estimates of the impulse response standard deviation as a function of time and frequency band are formed, and used in designing the processing to cancel the synthesized acoustics from the microphone signals. In a further embodiment, the correlation between a loudspeaker and microphone signal is used to adaptively modify the cancellation processing.

(20) FIG. 1 is a block diagram illustrating an example system according to embodiments.

(21) As shown, example system 100 includes a microphone 102 and speaker 104 that are both connected to an audio interface 106. Audio interface 106 includes an input 108 connected to microphone 102 and an output 110 connected to speaker 104. Audio interface 106 further includes a port 112 connected to computer 114 (e.g. desktop or notebook computer, pad or tablet computer, smart phone, etc.). It should be noted that other embodiments of system 100 can include additional or fewer components than shown in the example of FIG. 1. For example, although FIG. 1 illustrates an example with one microphone 102 and one speaker 104, it should be apparent that there can by two or more microphones 102 and/or two or more speakers 104.

(22) Moreover, although shown separately for ease of illustration, it should be noted that certain components of system 100 can be implemented together. For example, computer 114 can comprise digital audio workstation software (e.g. implementing auralization and cancelation processing according to embodiments) and be configured with an audio interface such as 106 connected to microphone preamps (e.g. input 108) and microphones (e.g. microphone 102) and a set of powered loudspeakers (e.g. speaker 104). In these and other embodiments, certain components can also be integrated into existing speaker arrays, and can be implemented using inexpensive and readily available software. For example, in virtual, augmented, and mixed reality scenarios, the system allows users to dispense with headphones for more immersive virtual acoustic experiences. Other hardware and software, including special-purpose hardware and custom software, may also be designed and used in accordance with the principles of the present embodiments.

(23) In general operation according to aspects of embodiments, room sounds (e.g. a music performance, voices from a virtual reality game participant, etc.) are captured by microphone 102. The captured sounds (i.e. microphone signals) are provided via interface 106 to computer 114, which processes the signals in real time to perform artificial reverberation according to the acoustics of a desired target space (i.e. auralization). The processed sound signals are then presented via interface 106 over speaker 104, thereby augmenting the acoustics of the room and enriching the experience of performers, game players, etc. As should be apparent, the room microphone 102 will also capture sound from the speaker 104, which is playing the simulated acoustics. According to aspects of the present embodiments, and as will be described in more detail below, computer 114 further estimates and subtracts the simulated acoustics in real time from the microphone signal, thereby eliminating feedback.

(24) FIG. 2 is a signal flow diagram illustrating processing performed by system 100 (e.g. computer 114) according to an example embodiment. As shown in FIG. 2, example computer 114 in embodiments includes a canceler 202 and an auralizer 204. In operation of system 100, a room microphone 102 captures contributions from room sound sources d(t) and synthetic acoustics produced by the loudspeaker 104 according to its applied signal 1(t), t denoting time. Auralizer 204 imparts the sonic characteristic of a target space, embodied by the impulse response h(t), on the room sounds d(t) through convolution,
l(t)=h(t)*d(t).(1)

(25) Many known auralization techniques can be used to implement auralizer 204, such as those using fast, low-latency convolution methods to save computation (e.g., William G. Gardner, Efficient convolution without latency, Journal of the Audio Engineering Society, vol. 43, pp. 2, 1993, hereinafter [16]; Guillermo Garcia, Optimal filter partition for efficient convolution with short in-put/output delay, in Proceedings of the 113th Audio Engineering Society Convention, 2002, hereinafter [17]; and Frank Wefers and Michael Vorlnder, Optimal filter partitions for real-time fir filtering using uniformly-partitioned fft-based convolution in the frequency-domain, in Proceedings of the 14th International Conference on Digital Audio Effects, 2011, pp. 155-61, hereinafter [18]). Another modal reverberator approach is disclosed in U.S. Pat. No. 9,805,704, the contents of which are incorporated herein by reference in their entirety. Although these known techniques can provide a form of impulse response h(t) used by auralizer 204, the difficulty is that the room source signals d(t) are not directly available: As described above, the room microphones also pick up the synthesized acoustics, and would cause feedback if the room microphone signal m(t) were reverberated without additional processing.

(26) According to certain aspects, the present embodiments auralize (e.g. using known techniques such as those mentioned above) an estimate of the room source signals d{circumflex over ()}(t), formed by subtracting from the microphone signal m(t) an estimate of the synthesized acoustics (e.g. the output of speaker 104). Assuming the geometry between the loudspeaker and microphone is unchanging, the actual dry signal d(t) is determined by:
d(t)=m(t)g(t)*l(t),(2)
where g(t) is the impulse response between the loudspeaker and microphone. Embodiments design an impulse response c(t), which approximates the loudspeaker-microphone response, and use it to form an estimate of the dry signal, d{circumflex over ()}(t), which is determined by:
d{circumflex over ()}(t)=m(t)c(t)*l(t).(3)
as shown in the signal flow diagram FIG. 2. The synthetic acoustics are canceled from the microphone signal m(t) by canceler 202 and subtractor 206 to estimate the room signal d{circumflex over ()}(t), which signal is reverberated by auralizer 204.

(27) The question then becomes how to obtain the canceling filter c(t). A measurement of the impulse response g(t) provides an excellent starting point, though there are time-frequency regions over which the response is not well known due to measurement noise (typically affecting the low frequencies), or changes over time due to air circulation or performers, participants, or audience members moving about the space (typically affecting the latter part of the impulse response). In regions where the impulse response is not well known, it is preferred that the cancellation be reduced so as to not introduce additional reverberation.

(28) Here, the cancellation filter 202 impulse response c(t) is preferably chosen to minimize the expected energy in the difference between the actual and estimated room microphone loud-speaker signals. For simplicity of presentation and without loss of generality, assume for the moment that the loudspeaker-microphone impulse response is a unit pulse, i.e.
g(t)=g(t),(4)
and that the impulse response measurement g.sup.(t) is equal to the sum of the actual impulse response and zero-mean noise with variance g.sup.2. Consider a canceling filter c(t) which is a windowed version of the measured impulse response g.sup.(t),
c(t)=wg.sup.(t),(5)

(29) In this case, the measured impulse response is scaled according to a one-sample-long window w. The expected energy in the difference between the auralization and cancellation signals at time t is
E[(gl(t)wg.sup.l(t)).sup.2]=l.sup.2(t)[w.sup.2g.sup.2+g.sup.2(1w).sup.2].(6)

(30) Minimizing the residual energy over choices of the window w yields
c*(t)=w*g.sup.(t), w*=g.sup.2/(g.sup.2+g.sup.2)

(31) In other words, the optimum canceler response c*(t) is a Wiener-like weighting of the measured impulse response, w*g.sup.(t). When the loudspeaker-microphone impulse response magnitude is large compared with the impulse response measurement uncertainty, the window w will be near 1, and the cancellation filter will approximate the measured impulse response. By contrast, when the impulse response is poorly known, the window w will be smallroughly the measured impulse response signal-to-noise ratioand the cancellation filter will be attenuated compared to the measured impulse response. In this way, the optimal cancellation filter impulse response is seen to be the measured loudspeaker-microphone impulse response, scaled by a compressed signal-to-noise ratio (CSNR).

(32) Typically, the loudspeaker-microphone impulse response g(t) will last hundreds of milliseconds, and the window will preferably be a function of time t and frequency f that scales the measured impulse response. Denote by g.sup.(t, fb), b=1, 2, . . . N the measured impulse response g.sup.(t) split into a set of N frequency bands fb, for example using a filterbank, such that the sum of the band responses is the original measurement,
g.sup.(t)=Sum(g.sup.(t,fb)), b=1 to N.(8)

(33) In this case, the canceler response c*(t) is the sum of measured impulse response bands g.sup.(t, fb), scaled in each band by a corresponding window w*(t, fb). Expressed mathematically,
c*(t)=Sum(c*(t,fb)), b=1 to N,(9)
where
c*(t,fb)=w*(t,fb)g.sup.(t,fb),(10)
w*(t,fb)=g.sup.2(t,fb)/(g.sup.2(t,fb)+g.sup.2(t,fb))(11)

(34) Embodiments use the measured impulse g.sup.(t, fb) as a stand-in for the actual impulse g(t, fb) in computing the window w(t, fb). Alternatively, repeated measurements of the impulse response g(t, fb) could be made, with the measurement mean used for g(t, fb), and the variation in the impulse response measurements as a function of time and frequency used to form g.sup.2(t, fb). Embodiments also perform smoothing of g.sup.2(t, fb) over time and frequency in computing w(t, fb) so that the window is a smoothly changing function of time and frequency.

(35) It should be noted that the principles described above can be extended to cases other than a single microphone-loudspeaker pair, as shown in FIG. 3. Referring to FIG. 3, in the presence of L loudspeakers and M microphones, a matrix of loudspeaker-microphone impulse responses is measured, and used in subtracting auralization signal estimates from the microphone signals. Stacking the microphone signals into an M-tall column m(t), and the loudspeaker signals into an L-tall column l(t), the cancellation system becomes
l(t)=H(t)*m(t),(12)
d{circumflex over ()}(t)=m(t)C(t)*l(t),(13)
where H(t) is the matrix of auralizer filters of 304 and C(t) the matrix of canceling filters of 302. As in the single speaker-single microphone case, the canceling filter matrix is the matrix of measured impulse responses, each windowed according to its respective CSNR, which may be a function of both time and frequency.

(36) Moreover, a conditioning processor 308, denoted by Q, can be inserted between the microphones and auralizers,
l(t)=H(t)*Q(m(t)),(14)
d{circumflex over ()}(t)=Q(m(t))C(t)*l(t),(15)
as seen in FIG. 3. This processor 308 could serve a number of functions. In one example Q could act as the weights of a mixing matrix to determine how the microphones signals are mapped to the auralizers, and subsequently, the loudspeakers. For example, it might be beneficial for microphones that are on one side send the majority of their energy to loudspeakers on the same side of the room, as could be achieved using a B-format microphone array and Ambisonics processing driving the loudspeaker array. Another use could be for when the speaker array and auralizers are used to create different acoustics in different parts of the room. The processor Q could also be a beamformer or other microphone array processor to auralize different sounds differently according to their source position. Additionally, this mechanism allows the acoustic to be changed from one virtual space to another in realtime, both instantaneously or gradually.

(37) The signal flows of FIGS. 2 and 3 are straightforward to implement in any number of environments. A Max/MSP implementation of a single-microphone, single-loudspeaker canceling auralizer is shown in FIG. 4, in this example making use of Alexander Harker and Pierre Alexandre Tremblay, The HISSTools impulse response toolbox: Convolution for the masses, in Proceedings of International Computer Music Conference, 2012 (hereinafter [19]) for low-latency fast convolution.

(38) As shown in FIG. 4, the example implementation 400 includes auralizer chain 402 and canceler chain 404. The input microphone signal m(t) is digitized at 406 and provided to subtractor 408, which subtracts from it the output from canceler chain 404 including the cancelation filter c(t). The difference signal (e.g. the signal d{circumflex over ()}(t)) is then processed by auralizer chain 402 (including impulse response h(t)), whose output (e.g. the signal l(t)) is also provided to canceler chain 404. The final output (e.g. the signal l(t)) is then converted back to analog at output 410 and provided to the speaker.

(39) According to certain aspects, a system such as described above in connection with FIGS. 1 to 4 can be reconfigured and recalibrated for use in many different environments. For example, the system can be relocated from venue to venue or room to room, and/or the positions and numbers of the microphones and speakers can be changed. After making such changes, a straightforward system calibration process to be described in more detail below can be performed, and the system can then be used to perform auralization and feedback cancelation according to the embodiments in the changed configuration. This contrasts with conventional systems, which require very time-consuming and careful tuning processes.

(40) FIG. 5 is an example signal flow diagram illustrating a calibration process according to embodiments in a single microphone-speaker pair system such as that shown in FIGS. 1 and 2. Those skilled in the art will understand how to implement the process using any numbers of pairs of microphones and speakers after being taught by these examples. As shown in FIG. 5, to calibrate the system after being placed in an environment such as a room and a microphone-speaker pair, the canceler 502 impulse response c(t) may be set to a delayed pulse, and the auralizer 504 filter is turned off. In place of the output of auralizer 504, a sine sweep 506 s(t) is played through the speaker and the impulse response g(t) after subtractor 508 is measured at measurement point 510. Multiple measurements are obtained and used to derive the impulse response g(t, fb) and window w*(t, fb) and thus the canceler filter c*(t, fb) as described above and explained further below.

(41) FIG. 6 is a flowchart illustrating an example calibration methodology according to embodiments.

(42) As shown in FIG. 6, in 602 the system is set up in the desired venue/room/environment. This includes positioning the microphone(s) and speaker(s) in their desired locations. Thereafter, in 604, the system is configured to perform measurements, for example configuring the system according to the signal flow shown in FIG. 5 (e.g. setting the canceler 502 impulse response to a delayed pulse, turning off the auralizer 504 filter, and injecting a sine sweep to the speaker input).

(43) In 606, a single or multiple microphone-speaker pair impulse response measurement(s) are made using a sine sweep or other test signal, preferably covering the entire audio band, fed to the speaker(s). In embodiments, this can include dozens of measurements of the empty space or the space with audience member stand-ins to understand the variation over time of the impulse responses between each pair of the microphones and speakers.

(44) In 608, the impulse response measurements are used to derive the cancellation filter as a function of time t and frequency fb. For example, an average of the measured impulse responses can be used to derive g.sup.(t, fb), and the standard deviation of the measured impulse responses can be used to derive g.sup.2(t, fb). The optimal window w*(t, fb) may then be derived according to (11) described above. Finally, to find the cancellation filter c(t, fb), the measured impulse response g.sup.(t, fb) is shifted and scaled according to the amplitude and arrival time of the c(t)=(t) pulse in the measurement system. For example, FIG. 7 shows an example impulse response measurement (e.g. the signal r(t) in FIG. 5). More particularly, as shown in FIG. 7, the cancellation processor (e.g. output of subtractor 508) reproduces the impulse response 702 between the loudspeaker and microphone. The delayed pulse (t) of the canceler 502 convolution is also visible.

(45) An example cancellation impulse response c(t) obtained using the methodology described above is shown in FIG. 8A, and the associated spectrogram is shown in FIG. 8B.

(46) After obtaining the cancellation filter as described above, the system is configured for run mode in 610, for example in accordance with the signal flows of FIGS. 2 and 3.

(47) It is useful to anticipate the effectiveness of the virtual acoustics cancellation in any given microphone. Substituting the optimal windowing (7) into the expression for the canceler residual energy (6), the virtual acoustics energy in the cancelled microphone signal is expected to be scaled by a factor of
=g.sup.2/(g.sup.2+g.sup.2),(16)
compared to that in the original microphone signal. Note that the reverberation-to-signal energy ratio is improved in proportion to the measurement variance for accurately measured signals, i.e. g.sup.2<<g.sup.2. By contrast, when the impulse response is inaccurately measured, the reverberation-to-signal energy ratio is nearly unchanged, 1.

(48) As an example of the performance of the present embodiments, several versions of the system of FIG. 1 with one or two microphones and one or two loudspeakers were implemented in the CCRMA Listening Room and CCRMA Stage recital hall at Stanford University. The example was implemented using a single loudspeaker source, playing exponentially swept sinusoid test signals, and Suzanne Vega's Tom's Diner as dry program material.

(49) In a first test shown in FIGS. 9A to 9C, the impulse response of the room with the system active was measured. A sine sweep from a separate loudspeaker in the room was used to measure the impulse response between a room source and the canceling reverberator system microphone input (FIG. 9A, 902), and system room source estimate (FIG. 9A, 904). The corresponding spectrograms are also shown. More particularly, as seen in FIG. 9B, the room impulse response contains both the dry room response and the wet synthesized room acoustics of Memorial Church at Stanford University. The 4.5 s reverberation time is plainly visible. Also shown in FIG. 9C is the system dry signal estimate, d{circumflex over ()}(t). Compared to the virtual room impulse response, the canceler produces a substantially dry signal, canceling in excess of 30 dB of the simulated reverberation.

(50) FIGS. 10A to 10C illustrate another example response of the system to a dry source, Suzanne Vega's Tom's Diner. Spectrograms are shown for the microphone signal in FIG. 10A, the room signal estimate in FIG. 10B, and the synthetic acoustics projected into the room in FIG. 10C. Note that the room signal estimate contains little of the synthetic reverberation, and is effectively a mix of the dry Suzanne Vega track, and low-frequency ventilation noise also present in the room. As expected, the room response in FIG. 10C shows the imprint of the Memorial Church acoustics, as added by the system.

(51) To better understand the practical performance of the system, the present applicants made repeated measurements of the loudspeaker-microphone response at the CCRMA Stage in unoccupied and occupied conditions. FIG. 11A shows the mean room impulse response 1102, and FIG. 11B shows the impulse response energy, smoothed over a 10-millisecond-long Hanning window by curve 1104. The sample standard deviation is shown separately by curve 1106 for the unoccupied condition and by curve 1108 for the occupied condition. As can be seen, the impulse response variation is smallest relative to the impulse response energy near the beginning of the impulse response. Also, the variation for the occupied room is modestly larger as the room becomes mixed. As further seen in FIG. 11B, the canceler residual energy is small near the beginning of the response, and increases relative to the decreasing impulse response energy throughout the response, consistent with the notion that the beginning of the impulse response shows little variation.

(52) FIG. 12 shows results of the measurements described in connection with FIGS. 11A and 11B in alternate detail. Similar to curve 1104 in FIG. 11B, the smoothed energy of the mean loudspeaker-microphone impulse response is shown by curve 1202, together with the residual energy of suppressed loudspeaker signals for the unoccupied (curve 1204) and occupied (curve 1206) conditions. Note that the cancellation is most effective at the impulse response start, during which there is little variation.

(53) FIGS. 13A to 13C illustrate corresponding spectrograms for the measurements illustrated in connection with FIGS. 11 and 12 described above. More particularly, the loudspeaker-microphone impulse response spectrogram is shown in FIG. 13A along with the root mean square canceler residual for the unoccupied CCRMA Stage in FIG. 13B and occupied CCRMA Stage in FIG. 13C. Note that a substantial amount of the loudspeaker energy has been canceled, particularly at the impulse response beginning and for frequencies below about 2 kHz. Overall, the residual simulated acoustics energy present in the room signal estimate d{circumflex over ()}(t) was a little over 20 dB for the occupied CCRMA Stage, and slightly more than 22.5 dB for the unoccupied CCRMA Stage.

(54) An example of the ability of a system according to embodiments to suppress feedback resulting from creating a reverberant synthetic acoustic environment is described with reference to FIGS. 14A and 14B. More particularly, FIG. 14B shows a spectrogram of a recording of the inventive system operating in a small room simulating Stanford Memorial Church. Meanwhile, FIG. 14A shows a spectrogram of the same segment, but with the canceler component of the system switched off at about 500 ms, and then switched back on at about 3000 ms. Note the rapid build-up and subsequent suppression of feedback near 1800 Hz with the temporary removal of the cancellation processing.

(55) Although the present embodiments have been particularly described with reference to preferred examples thereof, it should be readily apparent to those of ordinary skill in the art that changes and modifications in the form and details may be made without departing from the spirit and scope of the present disclosure. It is intended that the appended claims encompass such changes and modifications.

System and method for augmenting an acoustic space

Assignee

Inventors

Cpc classification

Classification Explorer

H04S7/305

ELECTRICITY

Classification Explorer

H04R5/02

ELECTRICITY

Classification Explorer

H04R2227/007

ELECTRICITY

Classification Explorer

H04R3/02

ELECTRICITY

Classification Explorer

H04S7/307

ELECTRICITY

Classification Explorer

H04R27/00

ELECTRICITY

Classification Explorer

H04R3/04

ELECTRICITY

International classification

Classification Explorer

H04R3/02

ELECTRICITY

Classification Explorer

H04S7/00

ELECTRICITY

Classification Explorer

H04R3/04

ELECTRICITY

Classification Explorer

H04R5/02

ELECTRICITY

Abstract

Claims

Description