Artificial reverberator room size control
10559295 ยท 2020-02-11
Inventors
Cpc classification
H04S7/305
ELECTRICITY
H04S2420/01
ELECTRICITY
G10K15/10
PHYSICS
International classification
Abstract
In music recording and virtual reality applications, it is desired to control the perceived size of a synthesized acoustic space. In an embodiment, a room size parameter is used to modify characteristics of an artificial reverberator so as to affect a listener's sense of the size of the acoustic space. Properties of the reverberation related to perceived room size such as decay time and equalization are adjusted interactively.
Claims
1. An artificial reverberator comprising: an input for accepting a nominal room impulse response; an input for accepting a perceived room size control, wherein the perceived room size control specifies a perceived room size that is different than a nominal room size associated with the nominal room impulse response; and a convolution operation with a processing room impulse response, wherein the processing room impulse response is derived by resampling a generating room impulse response related to the nominal impulse response according to the perceived room size control.
2. The artificial reverberator of claim 1, wherein the generating room impulse response is the nominal room impulse response, augmented to have high-frequency or low-frequency content that may appear in the audio band under the resampling.
3. The artificial reverberator of claim 1, in which the generating room impulse has decay times that are manipulated versions of nominal impulse response decay times, wherein the manipulation is affected by the room size control.
4. The artificial reverberator of claim 1, wherein the processing room impulse has decay times that are adjusted according to the room size control.
5. A method for artificial reverberation comprising: identifying a nominal room impulse response; identifying a perceived room size control, wherein the perceived room size control specifies a perceived room size that is different than a nominal room size associated with the nominal room impulse response; forming a processing room impulse response by adjusting the nominal room impulse response according to the perceived room size control; and convolving a signal with the processing room impulse response.
6. The method for artificial reverberation of claim 5 further comprising resampling in forming the processing room impulse response.
7. The method for artificial reverberation of claim 5 further comprising adjusting reverberation time in a band of frequencies according to the perceived room size control in forming the processing room impulse response.
8. A method for adjusting a perceived room size of an artificial reverberator comprising: identifying a nominal reverberation decay time; identifying a perceived room size control, wherein the perceived room size control specifies the perceived room size that is different than a nominal room size associated with the nominal room reverberation decay time; identifying a decay time associated with air absorption; and forming a desired reverberation time by adjusting the nominal reverberation decay time according to the perceived room size control and taking into account the air absorption decay time.
9. The method of claim 8 further comprising: forming implicitly or directly an estimate of the nominal reverberation decay time not attributable to air absorption.
10. A system for artificial reverberation comprising a perceived room size control, wherein the perceived room size control specifies a perceived room size that is different than a nominal room size, and a network of delay lines, each having a nominal length associated with the nominal room size, and wherein the length of at least one of the delay lines is adjusted from the nominal length according to the perceived room size control.
11. The system for artificial reverberation according to claim 10 further comprising means to accept a nominal reverberation time; and a feedback filter associated with at least one of the delay lines, wherein the feedback filter is adjusted according to the perceived room size control to substantially achieve a reverberation time determined by the perceived room size control and the nominal reverberation time.
12. The system of claim 11 further comprising: feedback paths associated with outputs of at least two of the delay lines comprising the network and inputs of the at least two delay lines, wherein an amount of feedback from one of the at least two delay lines to another one of the at least two delay lines is adjusted according to the room size control.
13. A system for artificial reverberation comprising: a perceived room size control, wherein the perceived room size control specifies a perceived room size that is different than a nominal room size associated with a nominal room impulse response, a set of resonant filters representing modes of a room, the set of resonant filters being derived from the nominal room impulse response, each resonant filter characterized by a mode frequency and mode decay rate, and a means to adjust the mode decay rate associated with one of the resonant filters in response to the perceived room size control.
14. The system of claim 13 further comprising: a means to adjust the mode frequency associated with one of the resonant filters in response to the perceived room size control.
Description
BRIEF DESCRIPTION OF THE DRAWINGS
(1) These and other aspects and features of the present embodiments will become apparent to those ordinarily skilled in the art upon review of the following description of specific embodiments in conjunction with the accompanying figures, wherein:
(2)
(3)
(4)
(5)
(6)
(7)
(8)
(9)
(10)
(11)
(12)
(13)
(14)
(15)
DETAILED DESCRIPTION
(16) The present embodiments will now be described in detail with reference to the drawings, which are provided as illustrative examples of the embodiments so as to enable those skilled in the art to practice the embodiments and alternatives apparent to those skilled in the art. Notably, the figures and examples below are not meant to limit the scope of the present embodiments to a single embodiment, but other embodiments are possible by way of interchange of some or all of the described or illustrated elements. Moreover, where certain elements of the present embodiments can be partially or fully implemented using known components, only those portions of such known components that are necessary for an understanding of the present embodiments will be described, and detailed descriptions of other portions of such known components will be omitted so as not to obscure the present embodiments. Embodiments described as being implemented in software should not be limited thereto, but can include embodiments implemented in hardware, or combinations of software and hardware, and vice-versa, as will be apparent to those skilled in the art, unless otherwise specified herein. In the present specification, an embodiment showing a singular component should not be considered limiting; rather, the present disclosure is intended to encompass other embodiments including a plurality of the same component, and vice-versa, unless explicitly stated otherwise herein. Moreover, applicants do not intend for any term in the specification or claims to be ascribed an uncommon or special meaning unless explicitly set forth as such. Further, the present embodiments encompass present and future known equivalents to the known components referred to herein by way of illustration.
(17) According to certain aspects, the present embodiments relate to an artificial reverberator that provides control over the perceived size of the simulated space. It can also provide for the perceived room size to vary smoothly over time in response to a smoothly changing room size control. Additionally or alternatively, it provides a room size control that doesn't produce Doppler shifts in response to a continuously changing room size.
(18) Among other things, the present Applicant recognizes that the room response to a transient sound is often described as a sequence of events over time, a direct path followed by early reflections that give way to late-field reverberation, as seen in
(19) If the room size were, for example, doubled, with everything else remaining the same, then the timing of the direct path and early reflections would be stretched by a factor of two. Similarly, if the room size were doubled, then its volume would increase by a factor of eight, while its absorbing area would increase by a factor of four, thereby doubling the reverberation time.
(20) Reasoning along these lines was used by Spandcock [8] in building scale models of pro-posed concert halls to test how they might sound when built. Spandcock argued that a scale model of a concert hall made with the appropriate materials and filled with a dried gas would respond to a given high-frequency sound the way the larger actual space would respond to a low-frequency sound having the same relative wavelength. Spandcock describes using a magnetic tape deck to play back a sound into the scale model at, say, eight-times speed, while simultaneously recording the response in the model. The recording was then played back, slowed by the same factor. In this way, the original pitch was restored, and the reverberation time increased to match that of the hypothesized full-scale hall.
(21) As described in [1], this approach was independently discovered by Walter Murch while working as a sound editor for motion pictures in the late 1960s, and was used to make long-lasting reverberation. Spratt, et al. in [1] present a digital technique for implementing a real-time version of the method, using a loudspeaker and microphone in a physical room. The technique is described as being mathematically equivalent to stretching the room impulse response in time, which has the effect of increasing the reverberation time, and stretching the reflection arrival times.
(22) Spratt, et al. argue that the method similar to slowing the speed of sound or increasing the room size. However, doing either of these will not result in proper reverberation time as a function of frequency, as the relative absorption of sound by air and room materials will not be taken into account.
(23) The present embodiments may be implemented using any number of artificial reverberation methods. One idea is to warp the time and frequency axes, and adjust the decay times of a given reverberation impulse response according to a desired room size. In addition, the source loudness and radiation pattern may be adjusted according to the room size. Different architectures will be clear to those skilled in the art given the examples described below.
(24) Reverberation Time
(25) First, the change in reverberation time in response to a changing room size as a result of different relative contributions of materials absorption and air absorption is described.
(26) As described in [14] and elsewhere, the decay over time of well mixed acoustic energy in a room can be approximated by examining a room with volume V and having objects and surfaces with absorbing area A. The energy density w(t) as a function of time t is assumed to be well mixed, and independent of position within the room. After a period of time t, the total energy in the room, the product of the energy density and the volume, Vw(t+t), will be that at time t less what is lost due to interactions with absorbing surfaces and objects and air propagation,
Vw(t+t)=Vw(t)Acgw(t)tVw(t)t,(1)
where the term Acgt w(t) represents surface interaction absorption, and is proportional to the time interval t, sound speed c, a constant g, absorbing area A, and energy density w(t), and the term Vw(t) represents air absorption, and is proportional to the volume V, an absorption coefficient , time interval t, and energy density w(t). These absorption terms are intuitively sensiblethe greater the time interval, the more energy that can be absorbed; the greater the energy density, the more energy that can leave the space during the time interval. Rearranging terms, and taking t.fwdarw.0, gives
(27)
with w.sub.0 being the energy density at time t=0, and being a time constant increasing with increasing volume, and decreasing with increasing absorbing area,
(28)
(29) In other words, the energy density in a well mixed room with decay exponentially, decreasing by a factor of 1/e every units of time. It is typical to measure reverberation time in terms of the time taken for energy to decrease 60 dB, T.sub.60, in which case it follows:
(30)
for example, measured in units of seconds per 60 dB decay.
(31) Note that the energy density is also a function of frequency , w(t, ), which was dropped from the discussion here for simplicity of presentation. It carries over to frequency-dependent materials and air absorption simply by making the absorbing area A and air absorption a frequency-dependent.
(32) Consider a room described by a nominal length L.sub.0. Now scale the room and all of its surfaces and objects to have a new characteristic length L. It is desired to understand how the decay time changes with a changing room size L. Assuming that the room volume V is proportional to L.sup.3, and the absorbing area A proportional to L.sup.2, the decay time of the resized room T.sub.60(L) is then
(33)
where has been introduced to represent the materials absorption for the nominally sized room, and air absorption a and materials absorption have been expressed in terms of, for example, 60 dB decay per unit time.
(34) Note that the decay time at the nominal room size,
T.sub.0=T.sub.60(L.sub.0),(7)
may be estimated from the room impulse response or otherwise modeled, and that the air absorption a is known, for instance derived assuming a given temperature, pressure, and humidity, or tabulated [15, 16]. Accordingly, setting L=L.sub.0 and solving (6) for the unknown materials absorption gives
(35)
(36) Note that it might be that case that due to estimation errors, the reverberation time, T.sub.60(L.sub.0), might exceed the air absorption-only reverberation time 1/, and (8) would produce a negative value for . In these cases (or at such frequencies that this is true), a value of =0 is preferably used, and the reverberation time will not be affected by room size. If it is desired to have a changing reverberation time with room size, a small value for could be selected.
(37) Substituting for in (6) gives T.sub.60(L), the decay time as a function of room size L. In the case that is given by (8) and not modified, a little algebra gives an expression for T.sub.60(L) in terms of the nominal decay time T.sub.0 and the decay time if the only absorption of sound energy were due to air T.sub.air,
(38)
(39) As an example of a changing reverberation time as a function of room size, consider the reverberation time of a church with a 10-meter nominal size, shown as a line with markers in
(40) In an embodiment, the effect of a finite air absorption may be exaggerated or suppressed by reducing or increasing, or even replacing the air absorption characteristic shown in
(41)
(42) As an example of a changing room size reverberation time with a modified air absorption,
(43) Convolutional Implementation
(44) In the case of a convolutional reverberator [3, 4], the given or nominal room impulse response, h.sub.0(t), associated with a nominal room size L.sub.0, may be resampled according to the new room size L to produce an adjusted impulse response h.sub.L(t),
h.sub.L(t)=h.sub.0(t.Math.L.sub.0/L).(11)
(45) As seen in
(46) In the case that the room size L is smaller than the nominal room size L.sub.0, the resampling will shorten the impulse response, thereby increasing its bandwidth. Accordingly, the resampling preferably would include the step of low-pass filtering so as to avoid aliasing if the increased bandwidth exceeds the Nyquist limit.
(47) If L is larger than L.sub.0, then the resampled (i.e., interpolated) impulse response will be longer than the nominal impulse response, and have decreased bandwidth. In this case, the adjusted impulse response may be extended to the Nyquist limit by first estimating reverberation characteristics such as decay times, equalization, echo density, and the like for that band. For instance, the decay times may be assumed to decrease in a manner typical of air absorption with increasing frequency above the original bandwidth. Additionally (or alternatively), a trend or model could be fit to the decay characteristic of the nominal impulse response, and extended in frequency. Similarly, the equalization could be extrapolated to higher frequencies by noting the trend near the nominal band edge.
(48) The method described in [9] can be used to simulate an extended bandwidth for a room impulse response. Estimates of the echo density of the nominal impulse response are made, and used to synthesize a corresponding sequence of full-bandwidth echoes. This echo sequence (e.g., white Gaussian noise in the case of a perceived fully dense echo sequence) can then be split into frequency bands and windowed to generate a full-bandwidth room impulse response which sounds like the original. In this case, the high-frequency equalization and decay rates need to be synthesized, for example as described above.
(49) In addition to extending the impulse response bandwidth, it is often the case that the impulse response is measured in the presence of a noise floor. As a result many available reverberation impulse responses have an envelope comprising the exponential decay of the late-field reverberation followed by a constant-level background noise.
(50) The prior art mechanism of increasing the reverberation time by multiplying the reverberation impulse response by a growing exponential will generate unwanted artifacts, including a bloom in energy at the end of the impulse response [10, 11]. Resampling the impulse response as described above generally avoids this difficulty, though extending the impulse response to below the noise floor would benefit the present embodiments. It should be noted that such a mechanism for lengthening reverberation time, even when applied to a properly extended room response, is not preferred, as the timing of temporal features, such as significant early reflections, is not appropriately modified.
(51) As described above, the reverberation time of a larger or smaller room is roughly scaled by the relative change in size, and is affected by the different relative absorptions of air and materials, with materials absorption accounting for a greater portion of the decay in smaller rooms. The resampling of the impulse response described above has the effect of simultaneously stretching the reverberation time and compressing the associated frequency axis,
(52)
where {dot over (T)}.sub.60 (L, ) is the frequency-dependent reverberation time of the stretched impulse response h.sub.L(t), and T.sub.0() is that of the given impulse response h(t). For example, if a room impulse response were stretched by a factor of two, the reverberation time at 500 Hz would be twice that of the original impulse response at 1000 Hz. As a result, when the given reverberation time T.sub.0 () isn't relatively constant with frequency, the reverberation time produced by resampling h(t) will differ from the desired one given by (9), and it is preferred to modify the reverberation time of the stretched impulse response accordingly, see
(53) It should be pointed out that while Spratt and Abel [1] describe resampling the room impulse response as similar to changing the sound speed or resizing the room, this is only true if everything about the room is resized, including materials absorption features. In the present embodiments it is desired to scale the room size, without modifying the materials or air properties. It is thus preferred to correct the reverberation time produced by resampling as described above.
(54) Finally, Applicant notes that the method described in [9] to synthesize impulse responses from balloon pop recordings may be adapted to synthesize room impulse responses at different room sizes. The process is shown in
(55) To generate impulse responses at different room sizes, the same process is used, with the pulse times being scaled by the room size or with the echo density used to generate the pulse times being scaled by the inverse room size. This pulse sequence is processed as above, but with the band root energy envelopes resampled according to the room size ratio L/L.sub.0, and preferably the envelopes modified to bring the band reverberation times in line with the desired T.sub.60(L, ) described by (9) or (8) and (6).
(56) Delay Network Implementation
(57) As described in [2], reverberators are often implemented as networks of delay lines with filtering, mixing, and feedback. One such reverberator structure is the feedback delay network (FDN) [5], shown in
(58) Here, the filter q(z) is the late-field equalization, with the N delay lines z.sup.n, n=1, 2, . . . , N having delays n and feedback filtering g.sub.n (z). The feedback filters are typically designed so that they produce similar dB attenuation per unit delay time according to a desired decay time as a function of frequency [5]. The unitary matrix Q represents state mixing, and controls the rate of echo density increase. An identity mixing matrix Q=I feeds each delay line to itself with no mixing between delay lines, and produces a constant echo density. A Hadamard mixing matrix Q=H generates significant mixing between delay lines, producing a rapidly increasing echo density.
(59) To change the room size to L from a nominal L.sub.0, the delay line lengths can be changed proportionately, as seen in
(60)
(61) Interpolated delay lines can be used to implement the desired early reflection delay times, but allpass filters are suggested to implement any fractional portion of delays used in the feedback loop so as to prevent unwanted magnitude filtering that would affect the resulting decay time.
(62) The feedback filters g.sub.n(z) need not be modified, as the increased (or decreased) delay line lengths will result in proportionally longer (or shorter) decay times, the filters, in effect, being applied less (more) often. However, if desired, the feedback filters g.sub.n(z) can be modified so as to properly account for the effect of air absorption on the decay time. Additionally, note that by changing the feedback delay line lengths n, the mixing matrix Q need not be modified in response to a changing room size, as the room mixing time will simply scale with the delay line lengths.
(63) It might be the case that it is desired to leave the feedback delay lines fixed, independent of room size. In such scenarios, the preferred changes in reverberation time and echo density profile (e.g., mixing time) resulting from room size changes can be achieved by (i) adjusting the feedback filters g.sub.n(z), for example using [5], and (ii) modifying the mixing matrix Q so as to slow the state mixing for larger rooms, and speed state mixing for smaller rooms.
(64) Finally, it should be pointed out that if the delay line lengths are to be smoothly changed from one set of delay values to another, they may be smoothly interpolated between successive delay values or they may be crossfaded between successive delay values. Crossfading will eliminate Doppler shifts associated with a continuously changing delay, which might be desirable for music applications, but can introduce subtle artifacts during the crossfade period.
(65) Modal Implementation
(66) As presented in [6, 7], the modal reverberator implements reverberation as a parallel sum of resonant filters h.sub.m(t), each representing a room resonance or mode, and each characterized by a mode frequency .sub.m, mode decay rate .sub.m, and mode amplitude .sub.m,
(67)
where, for example,
h.sub.m(t)=.sub.m exp{j.sub.mt.sub.mt}.(15)
(68) A number of options are described for implementing such filters in [6, 7], including biquad structures, phasor filters, and heterodyning-modulation architectures.
(69) To implement a changing room size in a modal reverberator, the mode parameters are adjusted accordingly. The mode frequencies would be changed in inverse proportion to the varying room size,
(70)
(71) One way to understand this is to consider a closed path among a set of reflecting surfaces that creates a resonance. If the path length were twice as long, the associated travel time would be twice as long, and the frequency reduced to half its original value.
(72) The mode decay rates would be modified according to the scaled decay times at the new mode frequencies as described above in (6),
(73)
where the decay times T.sub.60(L, .sub.m(L)) can be found by interpolation if they are not directly available. The decay rates .sub.m(L) at room size L are then
(74)
(75) If the room size L is made smaller than the nominal room size L.sub.0, then the mode frequencies will be increased. Those modes with frequencies that become larger than the Nyquist limit can be eliminated, for instance, not computed or their amplitudes reduced to zero.
(76) If the room size L is made larger than the nominal room size L.sub.0, then the mode frequencies will be decreased. Those modes with frequencies that become smaller than the audio band lower limit, or the lower limit of what can be reproduced with the target sound reproduction system, can be eliminated. As in the case of manipulating an impulse response in response to room size changes described above, an increase in room size may significantly reduce the bandwidth of the modal reverberator response, and additional bandwidth would be preferably created. This may be done by synthesizing additional high-frequency modes, for example statistically generating 240 additional mode frequencies per octave with decay rates extrapolated from those at lower frequencies. (Similarly, lower-frequency modes may be synthesized in the case of rooms that are smaller than the nominal size L.sub.0.)
(77) As an alternative to eliminating and synthesizing modes to accommodate a changing room size, the mode frequencies .sub.m can be warped within the audio band to generate new frequencies v.sub.m, for example according to a first-order allpass characteristic,
(78)
(79) Here, the allpass parameter is chosen according to the room size ratio L/L.sub.0,
(80)
(81) Doing so will scale the low frequencies according to the desired linear characteristic
(82)
with the high frequencies being warped to map the band edge onto the band edge v.
(83) Note that if it is desired to retain the original reverberation equalization, the mode amplitudes can be adjusted with room size to account for the changing equalization resulting from a changing modal density: Where the modal density is increased, the mode energy (the square of the mode magnitude) is proportionally increased.
(84) The mode filter parameters may be changed continuously, though to eliminate the associated Doppler shifts, it is suggested that mode filters at successive parameter sets be run in parallel and crossfaded. It could also be useful to modify the mode decay times without changing the mode frequencies.
(85) Finally, the circumstance in which only aspects of the room were made larger or smallerfor example only a pair of walls being moved further apartcan be accommodated by having certain modes be unaffected or only modestly affected. Similarly, in the delay network reverberator structures above, only certain delay lines could be affected or others only modestly affected by a changing room size.
(86) Although the present embodiments have been particularly described with reference to preferred ones thereof, it should be readily apparent to those of ordinary skill in the art that changes and modifications in the form and details may be made without departing from the spirit and scope of the present disclosure. It is intended that the appended claims encompass such changes and modifications.
(87) The following publications have been identified by the corresponding reference numerals in brackets [ ] in the above descriptions, and the details below are incorporated into the descriptions above: [1] Kyle S. Spratt, Jonathan S. Abel, All natural room enhancement, in Proc. of the International Computer Music Conference (ICMC 2009), Montreal, Canada, Aug. 16-21, 2009. [2] Vesa Valimaki, et al., Fifty years of artificial reverberation, IEEE Trans. Audio, Speech, and Lang. Proc., Vol. 20, No. 5, July 2012. [3] William Gardner, Efficient convolution without input-output delay, Journal of the Audio Engineering Society, Vol. 43, Issue 3, pp. 127-136, March 1995. [4] Guillermo Garcia, Optimal filter partition for efficient convolution with short in-put/output delay, in Proc. 113th Convention of the Audio Engineering Society, Paper No. 5660, Los Angeles, October 2002. [5] feedback delay network reverberator [6] Jonathan S. Abel, Sean A. Coffin, Kyle S. Spratt, A Modal Architecture for Artificial Reverberation with Application to Room Acoustics Modeling, in Proc. 137th Convention of the Audio Engineering Society, Los Angeles, October 2014. [7] Jonathan S. Abel, Method and system for artificial reverberation using modal decomposition, U.S. Pat. No. 9,805,704, Oct. 31, 2017. [8] Friedrich Spandock, Method and Apparatus for Determining Acoustic Effects, U.S. Pat. No. 3,139,151, June 1964. [9] Jonathan S. Abel, et al., Estimating room impulse responses from recorded balloon pops, in Proc. 129th Convention of the Audio Engineering Society, Paper No. 8171, San Francisco, November 2010. [10] Nicholas J. Bryan, Jonathan S. Abel, Methods for Extending Room Impulse Responses Beyond Their Noise Floor, in Proc. 129th Convention of the Audio Engineering Society, Paper No. 8167, San Francisco, November 2010. [11] Elliot Kermit Canfield-Dafilou and Jonathan S. Abel, On restoring prematurely truncated sine sweep room impulse response measurements, in Proceedings of the 20th International Conference on Digital Audio Effects (DAFx-17), Edinburgh, UK, Sep. 5-9, 2017. [12] James A. Moorer, About This Reverberation Business, Computer Music Journal, Vol. 3, No. 2, pp. 13-28, June 1979. [13] Jeffrey Borish, Extension of the image model to arbitrary polyhedra, J. Acoust. Soc. Am., vol. 75, no. 6, pp. 1827-1836, June 1984. [14] Wallace C. Sabine, Theatre Acoustics, in Collected Papers on Acoustics, Dover Publications Inc., New York, 1964. [15] American National Standards Institute, Committee 51, Acoustics, Method for Calculation of the Absorption of Sound by the Atmosphere, ANSI 51.26-1995, New York, N.Y.: American National Standards Institute, September, (1995). [16] International Organization for Standardization, Committee ISO/TC 43, Acoustics, Sub-Committee SC 1, Noise, Acoustics Attenuation of sound during propagation outdoors-Part 1: Calculation of the absorption of sound by the atmosphere, ISO9613-1, Geneva, Switzerland: International Organization for Standardization, (1993).