H04S2420/03

Spatial audio augmentation and reproduction
11363403 · 2022-06-14 · ·

An apparatus including circuitry configured for: obtaining at least one spatial audio signal including at least one audio signal, wherein the at least one spatial audio signal defines an audio scene forming at least in part media content; rendering an audio scene based on the at least one spatial audio signal; obtaining at least one augmentation audio signal; transforming the at least one augmentation audio signal to at least two audio objects; augmenting the audio scene based on the at least two audio objects.

METHODS AND SYSTEMS FOR RENDERING OBJECT BASED AUDIO

Methods for generating an object based audio program, renderable in a personalizable manner, and including a bed of speaker channels renderable in the absence of selection of other program content (e.g., to provide a default full range audio experience). Other embodiments include steps of delivering, decoding, and/or rendering such a program. Rendering of content of the bed, or of a selected mix of other content of the program, may provide an immersive experience. The program may include multiple object channels (e.g., object channels indicative of user-selectable and user-configurable objects), the bed of speaker channels, and other speaker channels. Another aspect is an audio processing unit (e.g., encoder or decoder) configured to perform, or which includes a buffer memory which stores at least one frame (or other segment) of an object based audio program (or bitstream thereof) generated in accordance with, any embodiment of the method.

Adaptive panner of audio objects

An audio object including audio content and object metadata is received. The object metadata indicates an object spatial position of the audio object to be rendered by audio speakers in a playback environment. Based on the object spatial position and source spatial positions of the audio speakers, initial gain values for the audio speakers are determined. The initial gain values can be used to select a set of audio speakers from among the audio speakers. Based on the object spatial position and a set of source spatial positions at which the set of audio speakers are respectively located in the playback environment, a set of non-negative optimized gain values for the set of audio speakers is determined. The audio object at the object spatial position is rendered with the set of optimized gain values for the set of audio speakers.

Sound Field Related Rendering
20220174443 · 2022-06-02 ·

An apparatus including circuitry configured to: obtain at least two audio signals; determine a type of the at least two audio signals; process the at least two audio signals configured to be rendered based on the determined type of the at least two audio signals.

Spatial audio capture

An apparatus for spatial audio signal processing, the apparatus including at least one processor and at least one memory including a computer program code, the at least one memory and the computer program code configured to, with the at least one processor, cause the apparatus at least to: receive audio signals from a microphone array, the microphone array including three or more microphones forming a geometry with defined displacements between pairs of the three or more microphones; determine delay information between audio signals associated with the pairs of the three or more microphones; determine an operator based on the geometry with defined displacements between the pairs of the three or more microphones; apply the operator to the delay information to generate at least one direction parameter associated with the audio signals.

MULTI-CHANNEL DECORRELATOR, MULTI-CHANNEL AUDIO DECODER, MULTI-CHANNEL AUDIO ENCODER, METHODS AND COMPUTER PROGRAM USING A PREMIX OF DECORRELATOR INPUT SIGNALS

A multi-channel decorrelator for providing a plurality of decorrelated signals on the basis of a plurality of decorrelator input signals is configured to premix a first set of N decorrelator input signals into a second set of K decorrelator input signals, wherein K<N. The multi-channel decorrelator is configured to provide a first set of K′ decorrelator output signals on the basis of the second set of K decorrelator input signals. The multi-channel decorrelator is further configured to upmix the first set of K′ decorrelator output signals into a second set of N′ decorrelator output signals, wherein N′>K′. The multi-channel decorrelator can be used in a multi-channel audio decoder. A multi-channel audio encoder provides complexity control information for the multi-channel decorrelator.

COMPUTER SYSTEM FOR REALIZING CUSTOMIZED BEING-THERE IN ASSOCATION WITH AUDIO AND METHOD THEREOF

A method by a computer system including generating audio files based on respective audio signals, the audio signals having been respectively generated from a plurality of objects at a venue, generating metadata including spatial features at the venue that are respectively set for the objects, and transmitting the audio files and the metadata for the objects to a first electronic device to cause the first electronic device to realize a being-there at the venue by rendering the audio files based on the spatial features in the metadata may be provided.

Compatible multi-channel coding/decoding

In processing a multi-channel audio signal having at least three original channels, first and second downmix channels derived from the original channels are provided. For a selected original channel of the original channels, channel side information are calculated such that a downmix channel or a combined downmix channel including the first and second downmix channels, when weighted using the channel side information, results in an approximation of the selected original channel. The channel side information and the first and second downmix channels form output data to be transmitted to a low-level decoder, which only decodes the first and second downmix channels, or to a high-level decoder, which provides a full multi-channel audio signal based on the downmix channels and the channel side information. Since the channel side information occupy few bits only and since the decoder does not use dematrixing, an efficient and high quality multi-channel extension for stereo players and enhanced multi-channel players is acquired.

Apparatus for encoding or decoding an encoded multichannel signal using a filling signal generated by a broad band filter

An apparatus for decoding an encoded multichannel signal includes: a base channel decoder for decoding an encoded base channel to obtain a decoded base channel; a decorrelation filter for filtering at least a portion of the decoded base channel to obtain a filling signal; and a multichannel processor for performing a multichannel processing using a spectral representation of the decoded base channel and a spectral representation of the filling signal, wherein the decorrelation filter is a broad band filter and the multichannel processor is configured to apply a narrow band processing to the spectral representation of the decoded base channel and the spectral representation of the filling signal.

Renderer controlled spatial upmix

An audio decoder device for decoding a compressed input audio signal having at least one core decoder having one or more processors for generating a processor output signal based on a processor input signal, wherein a number of output channels of the processor output signal is higher than a number of input channels of the processor input signal, wherein each of the one or more processors has a decorrelator and a mixer, wherein a core decoder output signal having a plurality of channels has the processor output signal, and wherein the core decoder output signal is suitable for a reference loudspeaker setup; at least one format converter device configured to convert the core decoder output signal into an output audio signal, which is suitable for a target loudspeaker setup; and a control device configured to control at least one or more processors in such way that the decorrelator of the processor may be controlled independently from the mixer of the processor, wherein the control device is configured to control at least one of the decorrelators of the one or more processors depending on the target loudspeaker setup.