Apparatus for reproducing a multi-channel audio signal and a method for producing a multi channel audio signal

10939223 ยท 2021-03-02

Assignee

Inventors

Cpc classification

International classification

Abstract

A method for producing a multi-channel audio signal from one or more sound object signals is disclosed, where, for each sound object signal a plurality of width signals is produced, the amplitudes of the width signals following a substantially Gaussian distribution. The width signals are processed to produce a plurality of pan signals which are mapped to at least one channel. Each channel in the audio signal is produced by combining the pan signals from each sound object. An apparatus for reproducing such a multi-channel audio signal is also disclosed, in which a plurality of first loudspeakers are provided spaced around a first arc forward of a predetermined listening zone, each of the first loudspeakers facing towards the listening zone and substantially equidistant therefrom. A plurality of second loudspeakers are provided spaced around a second arc behind the listening zone, each of the second loudspeakers facing towards the listening zone. An amplifier produces an amplified signal from each channel in the audio signal, each amplified signal being provided to a corresponding first or second loudspeaker whereby in use each sound object is reproduced by one or more loudspeakers such that the SPL at a point spaced from the apparatus is less than the SPL at the listening zone.

Claims

1. A method for producing a multi-channel audio signal from one or more sound object signals, comprising: for each sound object signal: producing a plurality of de-correlated width signals, wherein the amplitudes of said width signals follows a substantially Gaussian distribution, wherein the substantially Gaussian distribution follows a user-configurable standard deviation; normalizing the amplitudes of the width signals such that an amplitude of the sum of the width signals is equal to the amplitude of the sound object signal; processing the plurality of width signals to produce a plurality of pan signals, each pan signal being mapped to at least one channel; for each channel in the audio signal, combining the pan signals from each sound object for that channel.

2. The method of claim 1, wherein each width signal is de-correlated by adding to each width signal a different phase offset, and altering the phase offset of each width signal with a period T.

3. The method of claim 1, wherein the user-configurable standard deviation is configurable for each sound object signal.

4. The method of claim 1, further comprising processing each sound object signal to produce a depth-corrected signal, and producing the plurality of width signals from the depth-corrected signal.

5. The method of claim 4, wherein each sound object signal is processed to produce two depth-corrected signals, a direct signal and a reverberant signal, wherein the plurality of de-correlated width signals are produced from the direct signal, and wherein the reverberant signal is processed to produce a plurality of de-correlated reverberant output signals, each de-correlated reverberant output signal being mapped to at least one channel in the audio signal.

6. The method of claim 1, wherein the step of producing a plurality of de-correlated width signals further comprises processing each sound object signal using a crossover filter to produce a low frequency part and a high frequency part, the plurality of de-correlated width signals being produced from the high frequency part.

7. The method of claim 5, wherein an odd plurality of de-correlated width signals are produced, wherein the low frequency part is applied to a middle signal of the odd plurality of de-correlated width signals.

Description

BRIEF DESCRIPTION OF THE FIGURES

(1) The invention will now be described, by way of example, with reference to the accompanying drawings, in which:

(2) FIG. 1 is a top view, partially cut away, of an apparatus for reproducing a multi-channel audio signal according to one embodiment of the invention;

(3) FIG. 2 is a perspective rear view of the apparatus in FIG. 1;

(4) FIG. 3 is a perspective front view of the apparatus in FIG. 1;

(5) FIG. 4 is shows room sound pressure levels (SPL) when the apparatus of FIG. 1 is in use;

(6) FIG. 5 is shows comparable room SPL using conventional stereophonic loudspeakers and audio system;

(7) FIG. 6 is shows comparable room SPL using conventional multi-channel loudspeakers and audio system; and

(8) FIG. 7 is a signal processing diagram showing a method for producing a multi-channel audio signal according to one embodiment of the invention.

DESCRIPTION OF PREFERRED EMBODIMENTS

(9) FIGS. 1 to 3 show an apparatus 10 for reproducing a multi-channel audio signal according to the embodiment. The apparatus 10 comprises a plurality of first loudspeakers 12 provided spaced around a first arc 14. Each of the first loudspeakers 12 face towards a listening zone 16 provided within the apparatus 10. The first loudspeakers 12 are preferably each substantially equidistant from the listening zone 16. The first arc 14 is preferably circular as shown in the drawings; however, elliptical or other arcuate curves may also be used.

(10) A plurality of second loudspeakers 18 are provided spaced around a second arc 20. Each of the second loudspeakers 18 faces towards the listening zone 16.

(11) A listener 22 is shown in FIG. 1 in the listening zone 16 facing towards the first loudspeakers 12. Throughout the specification, the terms forward and behind are used relative to the listening zone 16 according to the orientation of the listener 22 shown in FIG. 1.

(12) As seen in FIG. 1, the first loudspeakers 12 are positioned forward of the listening zone 16 and surround the forward 180 from the listening zone 16. The second loudspeakers 18 are positioned behind the listening zone 16. In the embodiment, thirteen (13) first loudspeakers 12 and five (5) second loudspeakers 18 are used, though other quantities may be used. It is preferred that at the number of first and second loudspeakers should be at least thirteen, however.

(13) Two low frequency drivers 24 are provided, to either side of and behind the listening zone 16 in an enclosure 26. The low frequency drivers 24 are configured as subwoofers. The second loudspeakers 18 are also provided in the enclosure 26.

(14) The second arc 20 shown in FIG. 1 has a larger radius than the first arc 14. The loudspeakers 18 are positioned closer to the listener 22 than the loudspeakers 12. This reduces the size of the apparatus 10, enabling installation in smaller rooms, without affecting the sound reproduction as experienced by the listener.

(15) An amplifier 28 produces amplified signals from each channel in the audio signal. Preferably, the audio signal has a separate channel for each loudspeaker 12, 18 and 24. Thus, the amplifier 28 provides a separate, amplified signal to each loudspeaker and to the subwoofers. The amplifier 28 is housed behind the listening zone 16 in the enclosure 26. The term amplifier 28 encompasses a multi-channel amplifier, multiple single-channel amplifiers, or a combination of both. Class D amplifiers are preferred for efficiency although other classes may be utilised.

(16) The apparatus 10 has a base 30 on which the enclosure 26 is mounted. Each first loudspeaker 12 is provided in an enclosure 32 mounted to the base 30. Adjacent enclosures 32 are connected via plates 34 extending between their top surfaces. When mounted in this manner, the enclosures 32 form a continuous arc.

(17) The multi-channel audio signal consists of one or more sound objects. Each sound object is present in a plurality of channels of the audio signal as will be described in more detail below.

(18) When the audio signal is reproduced by the apparatus 10, each sound object is reproduced by one or more loudspeakers 12, 18. The sound from each loudspeaker converges on the listening zone 16. Since each loudspeaker 12 is substantially equidistant from the listening zone 16, sounds from adjacent loudspeakers 12 reproducing a sound object arrive at the listening zone 16 at the same time and will add constructively at the listening zone 16.

(19) When the apparatus 10 reproduces the audio signal, the SPL at a point spaced from the apparatus 10 is less than the SPL at the listening zone 16. Two factors contribute to this effect. First, the listening zone 16 is substantially equidistant from the loudspeakers 12 such that their sound outputs combine within the listening zone 16, while at other locations there will be different path lengths from each loudspeaker resulting in some destructive interference. Secondly, the loudspeakers are located near and oriented towards the listening zone 16, while outside the apparatus 10 the average distance to the loudspeakers increases with increasing distance from the apparatus, resulting in a reduced SPL.

(20) FIGS. 4 to 6 show the results of SPL modelling in a 50 m.sup.2 room. In each of these figures, the model was set to produce an SPL of 125 dB at the listening zone, and the SPL throughout the room was then calculated.

(21) FIG. 4 shows the SPL using the apparatus 10, in which the SPL at the walls of the room is at least 10 dB and up to 15-20 dB lower than the listening zone. FIG. 5 shows the SPL using a traditional stereophonic arrangement. The SPL is greatest in this arrangement in the immediate vicinity of the loudspeakers and adjacent walls. FIG. 6 shows the SPL in typical multi-channel systems with loudspeakers at the periphery of the room. As shown, the SPL throughout the room and the walls is relatively even.

(22) Production of conventional audio signals involves arranging monaural tracks, with each track representing a sound object; such tracks are also referred to as sound object signals. For a studio recording, there would be a track for each instrument and vocal singer. The sound engineer arranges these tracks, adjusting relative amplitudes. The tracks are then mixed together and reduced to the number of channels using amplitude panning techniques.

(23) The preferred method of producing an audio signal according to the embodiment involves three process stages applied to the track for each sound objectdepth, width and pandescribed below with reference to FIG. 7.

(24) Depth:

(25) Each track, or sound object signal, is filtered via a low pass second order IIR filter 102, a low shelf second order IIR filter 104 and a high shelf second order IIR filter 106. These filters 102, 104 and 106 are applied in order to represent frequency variations that occur when the distance to a sound source increases. A gain stage 108, provided at the output of the filter 106, produces two depth-corrected output signals, referred to as direct and reverberant signals.

(26) Examples of filters 102, 104 and 106 and gain stage 108 are given below for a depth parameter d having a value between 0 and 1, where 0 is close to the listener and 1 is far away:

(27) Filter 102 may be a Butterworth 2nd order low pass filter with a cut-off frequency fc, where fc=20 kHz if d<=0.2, and fc=20 kHz-15 kHz*(d0.2)/0.8 if d>0.2.

(28) Filter 104 may be a low Shelf second order IIR filter with a corner frequency of 80 Hz, Q=0.5, and gain (dB)=3.0*(1.05*d).sup.2 if d<=0.2, and gain (dB)=6.0*((d0.2)/0.8).sup.2 if d>0.2.

(29) Filter 106 may be a high shelf second order IIR filter with a corner frequency of 16 kHz, Q=0.5, and gain (dB)=6.0*(1.05*d).sup.2 if d<=0.2, and gain (dB)=0.0 if d>0.2.

(30) Gain stage 108 may be a simple gain control where gain (dB)=3.0*(1.05*d).sup.2 if d<=0.2, and gain (dB)=12.0*((d0.2)/0.8).sup.2 if d>0.2.

(31) It should be appreciated that the above values are one example only, and other values may be used.

(32) The direct signal is passed to the Width stage described below. The reverberant signal is processed using an acoustic space simulator 110. The simulator 110 adds a configurable amount of reverberation. Balancing the amplitudes of the direct and reverberant signals, for example in the gain stage 108, provides an additional sense of depth. The simulator 110 uses a 1 input, n outputs algorithm. The n outputs have similar energy content, but are de-correlated using feedback delay networks with a different time constants for each output.

(33) The de-correlated nature of the n outputs enables them to be played by adjacent loudspeakers without affecting the listener 22's location of the sound object (which is located by the direct signal), whilst contributing to focussing acoustic energy at the listening zone 16 and providing a sense of depth. Typically, n<13 and the n outputs may be mapped to all channels in the audio signal, with several of them being fed by the same output. Alternatively, the n outputs may be mapped to a subset of these channels using, for example, standard audio panning techniques.

(34) Width:

(35) The direct signal from the depth stage is input to a fourth order crossover filter 112 that splits the signal into two bands: a low frequency (LF) part, and a high frequency (HF) part. The crossover frequency of the filter 112 is chosen so that it is below the spatial aliasing frequency f.sub.a=2c/d.sub.speaker, where f.sub.a is the spatial aliasing frequency, c is the speed of propagation of sound in air, and d.sub.speaker is the distance between the centers of two adjacent speakers. In the embodiment, the f.sub.a is approximately 500 Hz, but nothing prevents use of a lower frequency.

(36) The HF part of the signal is passed through k parallel gain stages 114, to produce k signals, with FIG. 7 drawn for the instance of k=5. The gain stages 114 apply gains to each of the k signals following a Gaussian distribution, whose standard deviation is controlled by an adjustable Width parameter. It is preferred that the gains of the gain stages 114 are normalised such that the sum of the k gain stage 114 outputs does not show any amplitude deviation from the HF input signal. The greater the value of the Width parameter, the more even the distribution of gains applied by the gain stages 114. This results in more control over the SPL outside the apparatus 10.

(37) It is preferred that k is an odd number, so that the middle of the k signals has a greater amplitude than the other of the k signals, which aids the listener 22 to locate the sound object. In other embodiments, values of k other than 5 may be used.

(38) Each of the k signals passes through one of k all-pass FIR filters 116. Each FIR filter 116 alters the phase of the incoming signal with a spectral period T and a different initial phase offset compared to the other FIR filters 116 to produce one of k width signals, shown in FIG. 7 at 118. The k width signals are de-correlated in phase due to the effect of the filters 116. Phase oscillation patterns such as sinusoids can be used, as well as other phase oscillation patterns.

(39) The effect of the Width processing stage is to produce k width signals with relative phase properties to enable their playback on k adjacent loudspeakers of the apparatus 10, without creating frequency cancellations in the listening zone 16.

(40) FIG. 7 shows the LF part being summed to the middle signal of the k signals. In other embodiments, the LF part could be applied to more than one of the k signals or follow the same gain/pan distribution as the HF part described above.

(41) Pan:

(42) The k width signals are each passed through a second order IIR low shelf filter 120 and gain stage 122 to produce k pan signals. The filter 120 provides a low-frequency gain correction that reduces the change in tonality of a sound object when panned across loudspeakers 12, 18. Typically, the gain of the filter 120 is 3 dB when an object is equidistant to its two closest speakers.

(43) Next, standard amplitude panning techniques are used to map the k pan signals to channels in the audio signal. The k pan signals are panned with an angular step corresponding to the angular distance between loudspeakers 12, 18 depending on the location of the sound object. This results in a set of signals, in k or k+1 of the channels in the audio signal, with similar energy content but de-correlated in phase. This contributes to focussing acoustic energy at the listening zone. The listener's ability to locate the sound object is unaffected: the listener will determine the location of a sound object based on the loudest apparent source of sound; the de-correlated signals to either side of the loudest signal for each sound object to not affect the listener's location of the sound object since de-correlated sound has no apparent location to a listener.

(44) The above processing is performed for each sound object, and the outputs combined for channel to produce the multi-channel audio signal. This processing technique provides a sound stage with superior three-dimensionality, enhanced user ability to locate each sound object with precision, while maintaining a precise control of how the acoustic energy spreads outside the apparatus.

(45) While aspects of the present disclosure have been particularly shown and described with reference to the embodiments above, it will be understood by those skilled in the art that various additional embodiments may be contemplated by the modification of the disclosed apparatuses, systems and methods without departing from the spirit and scope of the invention as defined by the claims.