H04S2420/03

Methods and systems for interactive rendering of object based audio

Methods for generating an object based audio program which is renderable in a personalizable manner, e.g., to provide an immersive, perception of audio content of the program. Other embodiments include steps of delivering (e.g., broadcasting), decoding, and/or rendering such a program. Rendering of audio objects indicated by the program may provide an immersive experience. The audio content of the program may be indicative of multiple object channels (e.g., object channels indicative of user-selectable and user-configurable objects, and typically also a default set of objects which will be rendered in the absence of a selection by a user) and a bed of speaker channels. Another aspect is an audio processing unit (e.g., encoder or decoder) configured to perform, or which includes a buffer memory which stores at least one frame (or other segment) of an object based audio program (or bitstream thereof) generated in accordance with, any embodiment of the method.

Spatial audio augmentation and reproduction
11729574 · 2023-08-15 · ·

A method including: obtaining at least one spatial audio signal including at least one audio signal, wherein the at least one spatial audio signal at least partially defines an audio scene; obtaining at least one augmentation audio signal; determining at least two audio objects based upon the at least one augmentation audio signal; determining audio-object dependency information for the determined at least two audio objects; and augmenting the audio scene based, at least partially, on both the determined at least two audio objects and the determined audio-object dependency information.

Systems, devices and methods for multi-dimensional audio recording and playback
11729571 · 2023-08-15 ·

Systems and methods for recording and playback of multi-dimensional sound are described herein. The systems and methods may include positioning a plurality of multi-dimensional sound recording devices in a location and positioning a plurality of multi-dimensional sound recording sensors within the location. Then, acoustical footprint data can be generated. Next, recording positional data within the location utilizing the plurality of multi-dimensional sound recording devices may occur. The systems and methods may continue to generate spatial data utilizing the recorded positional data and store the generated acoustical footprint data and spatial data. An audio mix-down utilizing the stored acoustical footprint and spatial data is generated. Finally, a consumer-device audio track mix based on the audio mix-down can be generated. Further embodiments may also replace audio tracks to mimic the original recording conditions in other languages and environment. Playback may occur on a device that generates a profile of the playback area.

AUDIO PROCESSING DEVICE AND METHOD THEREFOR
20230254657 · 2023-08-10 ·

An input unit receives input of an assumed listening position of sound of an object, which is a sound source, and outputs assumed listening position information indicating the assumed listening position. A position information correction unit corrects position information of each object on the basis of the assumed listening position information to obtain corrected position information. A gain/frequency characteristic correction unit performs gain correction and frequency characteristic correction on a waveform signal of an object on the basis of the position information and the corrected position information. A spatial acoustic characteristic addition unit further adds a spatial acoustic characteristic to the waveform signal resulting from the gain correction and the frequency characteristic correction on the basis of the position information of the object and the assumed listening position information. The present technology is applicable to an audio processing device.

METHOD AND APPARATUS FOR RENDERING SOUND SIGNAL, AND COMPUTER-READABLE RECORDING MEDIUM
20220132259 · 2022-04-28 · ·

A method of reproducing a multi-channel audio signal including an elevation sound signal in a horizontal layout environment is provided, thereby obtaining a rendering parameter according to a rendering type and configuring a down-mix matrix, and thus effective rendering performance may be obtained with respect to an audio signal that is not suitable for applying virtual rendering. A method of rendering an audio signal includes receiving a multi-channel signal includes a plurality of input channels to be converted into a plurality of output channels; determining a rendering type for elevation rendering based on a parameter determined from a characteristic of the multi-channel signal; and rendering at least one height input channel according to the determined rendering type, wherein the parameter is included in a bitstream of the multi-channel signal.

Audio bandwidth reduction

A first device obtains, from the array, several audio signals and processes the audio signals to produce a speech signal and one or more ambient signals. The first device processes the ambient signals to produce a sound-object sonic descriptor that has metadata describing a sound object within an acoustic environment. The first device transmits, over a communication data link, the speech signal and the descriptor to a second electronic device that is configured to spatially reproduce the sound object using the descriptor mixed with the speech signal, to produce several mixed signals to drive several speakers.

Selectable linear predictive or transform coding modes with advanced stereo coding

Methods and systems for advanced stereo processing of an audio signal are disclosed. The methods and systems include selecting a coding mode of either transform coding or linear predictive coding and performing advanced stereo processing when in the selected coding mode. Both encoding and decoding operations are provided.

Decoding of audio scenes

Exemplary embodiments provide encoding and decoding methods, and associated encoders and decoders, for encoding and decoding of an audio scene which is represented by one or more audio signals. The encoder generates a bit stream which comprises downmix signals and side information which includes individual matrix elements of a reconstruction matrix which enables reconstruction of the one or more audio signals in the decoder.

Methods, apparatus and systems for encoding and decoding of directional sound sources

Some disclosed methods involve encoding or decoding directional audio data. Some encoding methods may involve receiving a mono audio signal corresponding to an audio object and a representation of a radiation pattern corresponding to the audio object. The radiation pattern may include sound levels corresponding to plurality of sample times, a plurality of frequency bands and a plurality of directions. The methods may involve encoding the mono audio signal and encoding the source radiation pattern to determine radiation pattern metadata. Encoding the radiation pattern may involve determining a spherical harmonic transform of the representation of the radiation pattern and compressing the spherical harmonic transform to obtain encoded radiation pattern metadata.

STEREO AUDIO ENCODER AND DECODER

The present disclosure provides methods, devices and computer program products for encoding and decoding a stereo audio signal based on an input signal. According to the disclosure, a hybrid approach of using both parametric stereo coding and a discrete representation of the stereo audio signal is used which may improve the quality of the encoded and decoded audio for certain bitrates.