H04S5/005

METHOD AND APPARATUS FOR RENDERING OBJECT-BASED AUDIO SIGNAL CONSIDERING OBSTACLE

A method and apparatus for rendering an object-based audio signal considering an obstacle are disclosed. A method for rendering an object-based audio signal according to an example embodiment, the method includes identifying an object-based input signal and metadata for the input signal, generating a binaural filter based on the metadata using a binaural room impulse response (BRIR), determining, based on the metadata, whether an obstacle is present between a listener and an object, modifying the generated binaural filter when it is determined that the obstacle is present, and generating a rendered output signal by convolving the modified binaural filter and the input signal.

Systems and methods for providing augmented audio

A system for providing augmented spatialized audio in a vehicle, including a plurality of speakers disposed in a perimeter of a cabin of the vehicle; and a controller configured to receive a position signal indicative of the position of a first user's head in the vehicle and to output to a first binaural device, according to the first position signal, a first spatial audio signal, such that the first binaural device produces a first spatial acoustic signal perceived by the first user as originating from a first virtual source location within the vehicle cabin, wherein the first spatial audio signal comprises at least an upper range of a first content signal, wherein the controller is further configured to drive the plurality of speakers with a driving signal such that a first bass content of the first content signal is produced in the vehicle cabin.

Systems and Methods for Audio Upmixing

Systems and methods for audio in accordance with embodiments of the invention are illustrated. One embodiment includes a method for upmixing audio, including receiving an audio track which includes an input plurality of channels, each channel having an encoded audio signal, decoding the audio signal, calculating a first frequency spectrum for a low frequency component of the signal using a first window, calculating a second frequency spectrum for a high frequency component of the signal using a second window, determining at least one direct signal by estimating panning coefficients, estimating at least one ambient signal based on the at least one direct signal, and generating an output plurality of channels based on the at least one direct signal and the at least one ambient signal.

SPATIAL AUDIO CONTROLLER

A method performed a local device that is communicatively coupled with several remote devices, the method includes: receiving, from each remote device with which the local device is engaged in a communication session, an input audio stream; receiving, for each remote device, a set parameters; determining, for each input audio stream, whether the input audio stream is to be 1) rendered individually or 2) rendered as a mix of input audio streams based on the set of parameters; for each input audio stream that is determined to be rendered individually, spatial rendering the input audio stream as an individual virtual sound source that contains only that input audio stream; and for input audio streams that are determined to be rendered as the mix of input audio streams, spatial rendering the mix of input audio streams as a single virtual sound source that contains the mix of input audio streams.

Dynamic audio upmixer parameters for simulating natural spatial variations

A system and method for creating natural spatial variations in an audio output. At least one parameter in a set of mixer tuning parameters is dynamically modified over time and within a predetermined range that is defined by a set of modification control parameters. The set of mixer tuning parameters that includes the at least one dynamically modified parameter is applied to a mixer allowing the mixer to create natural spatial variations in the audio output to be played at one or more loudspeakers.

Audio processing in adaptive intermediate spatial format

Systems, methods, and computer program products of audio processing based on Adaptive Intermediate Spatial Format (AISF) are described. The AISF is an extension to ISF that allows spatial resolution around an ISF ring to be adjusted dynamically with respect to content of incoming audio objects. An AISF encoder device adaptively warps each ISF ring during ISF encoding to adjust angular distance between objects, resulting in increase in uniformity of energy distribution around the ISF ring. At an AISF decoder device, matrices that decode sound positions to the output speaker take into account the warping that was performed at the AISF encoder device to reproduce the true positions of sound sources.

Spatializing audio data based on analysis of incoming audio data

A system for enabling spatializing audio data is provided. The system analyzes audio data to identify when to generate spatialized audio data. The system can receive incoming audio data including a plurality of channel-based audio signals as well as object-based audio. The system performs an analysis of the audio data and/or metadata associated with the audio data to determine when to generate the spatialized audio data. The system can identify one or more categories associated with the audio data (e.g., stereo, mono, game effect, . . . ) and use the category to determine whether to spatialize the audio data or not spatialize the audio data.

Method and system for surround sound processing in a headset
11503420 · 2022-11-15 · ·

An audio headset may receive a plurality of audio signals corresponding to plurality of surround sound channels. The headset may determine, via its audio processing circuitry, context and/or content of the audio signals. The audio processing circuitry may process the audio signals to generate stereo signals carrying one or more virtual surround channels, wherein the processing comprises automatically controlling, based on the context and the content of the audio signals, a simulated acoustic environment of the virtual surround channels.

SYSTEM AND METHOD FOR ADAPTIVE AUDIO SIGNAL GENERATION, CODING AND RENDERING

Embodiments are described for an adaptive audio system that processes audio data comprising a number of independent monophonic audio streams. One or more of the streams has associated with it metadata that specifies whether the stream is a channel-based or object-based stream. Channel-based streams have rendering information encoded by means of channel name; and the object-based streams have location information encoded through location expressions encoded in the associated metadata. A codec packages the independent audio streams into a single serial bitstream that contains all of the audio data. This configuration allows for the sound to be rendered according to an allocentric frame of reference, in which the rendering location of a sound is based on the characteristics of the playback environment (e.g., room size, shape, etc.) to correspond to the mixer's intent. The object position metadata contains the appropriate allocentric frame of reference information required to play the sound correctly using the available speaker positions in a room that is set up to play the adaptive audio content.

Method and System for Surround Sound Processing in an Audio Device
20230033891 · 2023-02-02 ·

An audio headset may receive a plurality of audio signals corresponding to plurality of surround sound channels. The headset may determine, via its audio processing circuitry, context and/or content of the audio signals. The audio processing circuitry may process the audio signals to generate stereo signals carrying one or more virtual surround channels, wherein the processing comprises automatically controlling, based on the context and the content of the audio signals, a simulated acoustic environment of the virtual surround channels.