Distributed spatial audio decoder
09697844 ยท 2017-07-04
Assignee
Inventors
Cpc classification
H04S5/005
ELECTRICITY
H04R2420/07
ELECTRICITY
G10L19/008
PHYSICS
International classification
Abstract
This invention describes a method for decentralized decoding of a multichannel audio signal by broadcasting the original encoded data and distributing the decoding process between a plurality of receiving units. This allows for the design and manufacture of scalable multichannel audio reproduction systems having an arbitrary number of output channels, composed of a plurality of generic decoder and loudspeaker units each generating fewer output channels. With distributed decoding, a manufacturer can use off-the-shelf stereo or mono signal processors, digital-to-analog converters and amplifier components in each generic decoding module, thus reducing manufacturing costs and complexity requirements for each module while offering unlimited scalability in the total number of output channels.
Claims
1. A method for reproducing multichannel audio, comprising: transmitting a multichannel audio encoded source signal to a plurality of decoder processing units in a decentralized decoder system for decentralized decoding of the multichannel audio source signal, each decoder processing unit in the plurality of decoder processing units including an output channel associated with a different physical location in a listening environment, each decoder processing unit being aware of its relative physical location in the listening environment, wherein the decentralized decoder system is configured such that if an initial setup for the plurality of decoder processing units is modified, the decentralized decoder system can reconfigure parameters based on either detecting new positions and number of decoder processing units or detecting new positions and number of output channels in the modified setup, wherein an output signal from each output channel is determined by the output channel position while the source signal is independent of the output channel positions in the listening environment, and wherein the output channel position is determined automatically.
2. The method of claim 1, further comprising: receiving the multichannel audio encoded source signal by the plurality of decoder processing units; and decoding the multichannel audio encoded source signal in determining and generating the output signal.
3. The method of claim 2, further comprising: converting the output signal into a different signal type.
4. The method of claim 3, further comprising: amplifying the converted output signal.
5. The method of claim 2, further comprising: virtualizing the output signal.
6. The method of claim 1, wherein each output channel position corresponds to a loudspeaker position in the listening environment.
7. The method of claim 1, wherein the multichannel audio encoded source signal is 2-channel encoded material.
8. The method of claim 7, wherein the 2-channel encoded material is 2-channel phase-amplitude encoded material.
9. The method of claim 1, wherein the at least two output channels have corresponding positions on a common vertical axis in the listening environment.
10. The method of claim 1, wherein the at least two output channels have corresponding positions on different vertical axes in the listening environment.
11. A system for multichannel audio reproduction, comprising: a distributed network of multichannel audio decoders in a decentralized decoder system for decentralized decoding of an encoded audio data stream, where each decoder is operable to receive an identical version of the encoded audio data stream and reproduce only the audio signals from the encoded audio data stream that are relevant for an associated loudspeaker with an identified physical location relative to a reference position in a listening environment, wherein the identified physical locations are detected and determined automatically and each loudspeaker is aware of its relative physical location in the listening environment, wherein the decentralized decoder system is configured such that if an initial setup for the distributed network of multichannel audio decoders is modified, the decentralized decoder system can reconfigure parameters based on either detecting new positions and number of multichannel audio decoders or detecting new positions and number of loudspeakers in the modified setup.
12. The system of claim 11, further comprising: a network of transaural filters for virtualizing the reproduced audio signals, the network of transaural filters being coupled to the distributed network of multichannel audio decoders.
13. The system of claim 11, wherein the distributed network of multichannel audio decoders are implemented in a wireless setup.
14. The system of claim 11, wherein the distributed network of multichannel audio decoders are implemented in a wired setup.
15. The system of claim 11, wherein the identified positions are determined prior to reproducing the audio signals.
16. The system of claim 11, wherein the multichannel audio decoders are frequency-domain phase-amplitude decoders.
17. The system of claim 11, wherein the encoded audio data stream is a 2-channel encoded material.
18. A method for reproducing a multichannel audio signal, comprising: broadcasting via a wireless stereo audio transmitter a two-channel phase-amplitude encoded audio signal; receiving, via a plurality of stereo wireless receivers in a decentralized decoder system for decentralized decoding of the encoded audio signal, the encoded audio signal; and processing via a phase-amplitude stereo decoder the received audio signal, wherein the processing decodes only the audio signals relevant for at least two associated loudspeakers with different corresponding channels and identified physical locations in a listening environment, wherein the identified physical locations are detected and determined automatically and each loudspeaker is aware of its relative physical location in the listening environment, wherein the decentralized decoder system is configured such that if an initial setup for the plurality of stereo wireless receivers is modified, the decentralized decoder system can reconfigure parameters based on either detecting new positions and number of stereo wireless receivers or detecting new positions and number of loudspeakers in the modified setup.
19. The method of claim 18, wherein the decoded audio signals are determined by the position of at least one loudspeaker relative to a reference position.
20. The method of claim 19, wherein each decoder is coupled to a network of transaural loudspeaker virtualization processors.
Description
BRIEF DESCRIPTION OF THE DRAWINGS
(1)
(2)
(3)
(4)
(5)
DETAILED DESCRIPTION OF PREFERRED EMBODIMENTS
(6) Reference will now be made in detail to preferred embodiments of the invention. Examples of the preferred embodiments are illustrated in the accompanying drawings. While the invention will be described in conjunction with these preferred embodiments, it will be understood that it is not intended to limit the invention to such preferred embodiments. On the contrary, it is intended to cover alternatives, modifications, and equivalents as may be included within the spirit and scope of the invention as defined by the appended claims. In the following description, numerous specific details are set forth in order to provide a thorough understanding of the present invention. The present invention may be practiced without some or all of these specific details. In other instances, well known mechanisms have not been described in detail in order not to unnecessarily obscure the present invention.
(7) It should be noted herein that throughout the various drawings like numerals refer to like parts. The various drawings illustrated and described herein are used to illustrate various features of the invention. To the extent that a particular feature is illustrated in one drawing and not another, except where otherwise indicated or where the structure inherently prohibits incorporation of the feature, it is to be understood that those features may be adapted to be included in the embodiments represented in the other figures, as if they were fully illustrated in those figures. Unless otherwise indicated, the drawings are not necessarily to scale. Any dimensions provided on the drawings are not intended to be limiting as to the scope of the invention but merely illustrative.
(8) In general, the present invention provides a multichannel speaker system where each speaker is aware of its position relative to some reference and decodes the audio signals most relevant for that position. Each speaker receives the same encoded data stream but only decodes/outputs the portions of that stream associated to its position. Specifically, each decoder is configurable to produce particular output channels without deriving any of the other ones. The encoded audio stream could be analogue, digital, compressed, stereo, multichannel, etc.
(9) In accordance with one embodiment of the present invention, provided are a method and system comprising a plurality of multichannel audio decoders where each decoder receives the same encoded audio data stream and reproduces only the audio signals relevant for an associated loudspeaker signal output (or a subset of loudspeaker outputs) identified by the position of the associated loudspeaker(s) relative to some reference position.
(10) In accordance with another embodiment of the present invention, provided are a method and system for multichannel audio reproduction comprising a wireless stereo audio transmitter broadcasting a two-channel phase-amplitude encoded audio signal generated, for instance, with an embodiment of the encoder described in U.S. patent application Ser. No. 12/246,491. This broadcast is received by a plurality of separate stereo wireless receivers. The received stereo audio is further processed by a phase-amplitude stereo decoder, such as an embodiment of the decoder described in U.S. patent application Ser. No. 12/246,491, which decodes only the audio signals most relevant for a predetermined position, or a predetermined subset of positions, usually determined by the position of at least one loudspeaker relative to a reference position.
(11) In accordance with another embodiment of the present invention, the plurality of wireless stereo loudspeaker units each contain a stereo wireless receiver, a decoder (e.g., a phase-amplitude decoder such as an embodiment of the decoder described in U.S. patent application Ser. No. 12/246,491), and a network of transaural loudspeaker virtualization filters that provide the perception of more loudspeakers than are physically present in vicinity around the physical location of the reproducing stereo loudspeaker.
(12) To begin,
(13) In some embodiments, two or more channels are reproduced in some DSP and amplification units. This allows a potentially more economical use of common/commodity stereo audio parts to be used in the system, such as stereo DACs and amplifiers. Such an embodiment is illustrated in
(14) In some embodiments, the encoded audio stream transmission is wired and distributed centrally or in a daisy chain from decoder to decoder by means of a SPDIF signal repeater.
(15) In some embodiments, each loudspeaker unit includes post-processing to recalibrate the decoded output signal in order to compensate for improper loudspeaker setup.
(16) The multichannel audio encoding format may be any analog or digital format, e.g. DTS, Dolby Digital, MP3 Surround, MPEG Surround, Microsoft WAV Extensible, WMA etc.
(17) In some embodiments, the soundtrack is broadcast to a plurality of receivers and decoders as part of a public performance installation, such as a movie theater. Possible digital protocols used for broadcast and receipt of the wireless signals might include SPDIF, HDMI, Bluetooth AD2P, Satellite or HD radio, 802.11x, 2.4 GHz etc.
(18) In another preferred embodiment, the source material represents the streamed or stored output of a phase-amplitude 3-D stereo matrix encoder described in U.S. patent application Ser. No. 12/246,491. The encoded material may have originated from a discrete multichannel movie, game or music soundtrack or the encoder may have been a part of a real-time multichannel mixing engine in applications such as interactive gaming. The resulting stereo signal is transmitted wirelessly to a network of receivers, each having an associated subset of decoders, amplifiers and loudspeakers. The stereo signal can be transmitted and received using analog or digital transmission methods. Digital representations can also be compressed before transmission using algorithms such as AAC, MP3 or WMA. The output of each wireless receiver is followed by a DSP which implements a frequency-domain phase-amplitude stereo decoder such as an embodiment of the methods described in U.S. patent application Ser. No. 12/246,491. As described in U.S. patent application Ser. No. 12/246,491, such a decoder is capable of rendering an arbitrary number of output channels, adapting each decoded output for the position of the associated loudspeakers. This property of the decoder results in a scalable, self-configuring, multichannel loudspeaker playback system employing a distributed decoding method according to the present invention.
(19) As shown in
(20) In some embodiments, there is a smaller or larger number of loudspeaker elements 316b on each loudspeaker bar 306b, possibly a single element. In some embodiments, the system comprises a smaller or larger number of subwoofers 306a, 316a.
(21) In some embodiments, the reproduction system is self configuring in that it can sense the initial setup, addition, removal or malfunction of decoder/loudspeaker units and specify or re-specify the parameters of each of the units in the system as a result. That is, the system can self configure based on the position and number of speakers present. Any technique may be used by the DSP system to detect speaker location. For example, speaker location detection techniques may include use of an acoustic calibration test, machine vision technologies, IR, cameras, wireless receiver triangulation, or simple channel labeling (FL, C, FR, SR, SL, etc.).
(22) In another embodiment (illustrated in
(23) In some embodiments, the top loudspeaker unit is not present and the phase-amplitude decoders 410 associated with the front and back loudspeaker units 406 both render the top-left, top-right, and top-center channel signals. The virtual loudspeaker virtualization block for the front and back loudspeaker units now also implement virtual top-left, top-right, and top-center speakers. Since, both the front and back loudspeaker units virtualize the top loudspeaker locations, the gains of the top channels outputs of the decoders can be power-normalized. In some embodiments, a greater or lower number of loudspeaker units 406 are present, each rendering a greater or lower number of virtual loudspeaker positions.
(24) Although the foregoing invention has been described in some detail for purposes of clarity of understanding, it will be apparent that certain changes and modifications may be practiced within the scope of the appended claims. Accordingly, the present embodiments are to be considered as illustrative and not restrictive, and the invention is not to be limited to the details given herein, but may be modified within the scope and equivalents of the appended claims.