H04S3/008

Spatial audio data exchange

A device includes one or more processors configured to execute instructions to obtain, at a first audio output device, first spatial audio data and a first reference time associated, and to cause the first reference time and data representing at least a portion of the first spatial audio data to be transmitted from the first audio output device. The instructions further cause the one or more processors to receive, at the first audio output device from a second audio output device, second spatial audio data and a second reference time. The instructions further cause the one or more processors to, based on the first reference time and the second reference time, time-align the first spatial audio data and the second spatial audio data to generate combined audio data representing a three-dimensional (3D) sound field and to generate audio output based on the combined audio data.

METHODS FOR PARAMETRIC MULTI-CHANNEL ENCODING

The present document relates to audio coding systems. In particular, the present document relates to efficient methods and systems for parametric multi-channel audio coding. An audio encoding system configured to generate a bitstream indicative of a downmix signal and spatial metadata for generating a multi-channel upmix signal from the downmix signal is described. The system comprises a downmix processing unit configured to generate the downmix signal from a multi-channel input signal; wherein the downmix signal comprises m channels and wherein the multi-channel input signal comprises n channels; n, m being integers with m<n. Furthermore, the system comprises a parameter processing unit configured to determine the spatial metadata from the multi-channel input signal. In addition, the system comprises a configuration unit configured to determine one or more control settings for the parameter processing unit based on one or more external settings; wherein the one or more external settings comprise a target data-rate for the bitstream and wherein the one or more control settings comprise a maximum data-rate for the spatial metadata.

SPATIAL AUDIO FOR INTERACTIVE AUDIO ENVIRONMENTS

Systems and methods of presenting an output audio signal to a listener located at a first location in a virtual environment are disclosed. According to embodiments of a method, an input audio signal is received. A first intermediate audio signal corresponding to the input audio signal is determined, based on a location of the sound source in the virtual environment, and the first intermediate audio signal is associated with a first bus. A second intermediate audio signal is determined. The second intermediate audio signal corresponds to a reverberation of the input audio signal in the virtual environment. The second intermediate audio signal is determined based on a location of the sound source, and further based on an acoustic property of the virtual environment. The second intermediate audio signal is associated with a second bus. The output audio signal is presented to the listener via the first and second buses.

METHODS, APPARATUS AND SYSTEMS FOR 6DOF AUDIO RENDERING AND DATA REPRESENTATIONS AND BITSTREAM STRUCTURES FOR 6DOF AUDIO RENDERING

The present disclosure relates to methods, apparatus and systems for encoding an audio signal into a bitstream, in particular at an encoder, comprising: encoding or including audio signal data associated with 3DoF audio rendering into one or more first bitstream parts of the bitstream, and encoding or including metadata associated with 6DoF audio rendering into one or more second bitstream parts of the bitstream. The present disclosure further relates to methods, apparatus and systems for decoding an audio signal and audio rendering based on the bitstream.

Method and Apparatus for Low Complexity Low Bitrate 6DOF HOA Rendering

An apparatus for generating an immersive audio scene, the apparatus including circuitry configured to: obtain audio scene based sources, the audio scene based sources are associated with one or more positions in an audio scene, wherein each audio scene based source includes at least one spatial parameter and at least one audio signal; determine at least one position associated with at least one of the audio scene based sources; generate at least one audio source based on the determined at least one position, wherein the circuitry is configured to: generate at least one spatial audio parameter; and generate at least one audio source signal; and generate information about a relationship between the generated at least one spatial audio parameter and the at least one audio signals and the generated at least one audio source is selected based on a renderer preference.

Apparatus and method for audio rendering employing a geometric distance definition

An apparatus for playing back an audio object associated with a position includes a distance calculator for calculating distances of the position to speakers or for reading the distances of the position to the speakers. The distance calculator is configured to take a solution with a smallest distance. The apparatus is configured to play back the audio object using the speaker corresponding to the solution.

AUDIO SIGNAL PROCESSING METHOD AND AUDIO SIGNAL PROCESSING APPARATUS
20220329957 · 2022-10-13 ·

An audio signal processing method is provided. The method separates a left and a right channel audio signal to a center-left and center-right channel audio signals, and a side-left and side-right channel audio signals, and processes the center and side channel audio signals by a Head Related Transfer Function (HRTF) so as to simulate the audio signals into several audio source positions related to a user, then synthesizes the processed audio signals into dual channel audio signals according to the user's hearing characteristics by a frequency compensation process. The invention further provides an audio signal processing apparatus to perform the audio signal processing method.

APPARATUS FOR PROVIDING AUDIO DATA TO MULTIPLE AUDIO LOGICAL DEVICES

A system and method that incorporates the subject disclosure may include, for example, receiving a multichannel audio stream; forming a front channel audio stream of the multichannel audio stream, including combining a first subset of audio channels of the multichannel audio stream to form the front channel audio stream; forming a surround channel audio stream of the multichannel audio stream including combining a second subset of audio channels of the multichannel audio stream to form the surround channel audio stream; providing the front channel audio stream to a primary set of speakers positioned in front of a listener and providing the surround channel audio stream to a supplemental speaker positioned behind the listener; and synchronizing the front channel audio stream and the surround channel audio stream. Additional embodiments are disclosed.

Spatial transformation of ambisonic audio data

A device configured to decode a bitstream, where the device includes a memory configured to store a temporally encoded representation of spatial audio signals. The device is also configured to receive the bitstream that includes an indication of a spatial transformation, and includes a temporal decoding unit, coupled to the memory, configured to decode one or more spatial audio signals represented in a spatial domain, where the one or more spatial audio signals are associated with different angles in the spatial domain. In addition, the device includes an inverse spatial transformation unit, coupled to the temporal decoding unit, is configured to convert the one or more spatial audio signals represented in the spatial domain into at least three ambisonic coefficients that, in part, represent a soundfield in an ambisonics domain, and perform a spatial transformation of the soundfield based on the indication of the spatial transformation received in the bitstream.

Image display apparatus and method thereof

The present disclosure relates to an image display apparatus and a method thereof. The image display apparatus according to an embodiment of the present disclosure comprises: a display; an external device interface configured to perform short-range communication; and a controller configured to: when the number of external devices communicatively connected with the image display apparatus is one, or when a bandwidth usage for the short-range communication is less than a predetermined reference, transmit an entire signal to the one external device, when the number of the external devices is two or more, and the bandwidth usage is greater than or equal to the predetermined reference, acquire a plurality of channel signals from the entire signal; transmit a first channel signal to a first external device among the external devices; and transmit a second channel signal to a second external device among the external devices.