H04S5/005

Renderer controlled spatial upmix

An audio decoder device for decoding a compressed input audio signal having at least one core decoder having one or more processors for generating a processor output signal based on a processor input signal, wherein a number of output channels of the processor output signal is higher than a number of input channels of the processor input signal, wherein each of the one or more processors has a decorrelator and a mixer, wherein a core decoder output signal having a plurality of channels has the processor output signal, and wherein the core decoder output signal is suitable for a reference loudspeaker setup; at least one format converter device configured to convert the core decoder output signal into an output audio signal, which is suitable for a target loudspeaker setup; and a control device configured to control at least one or more processors in such way that the decorrelator of the processor may be controlled independently from the mixer of the processor, wherein the control device is configured to control at least one of the decorrelators of the one or more processors depending on the target loudspeaker setup.

Systems and Methods for Spatial Audio Rendering

Systems and methods for rendering spatial audio in accordance with embodiments of the invention are illustrated. One embodiment includes a spatial audio system, including a primary network connected speaker, including a plurality of sets of drivers, where each set of drivers is oriented in a different direction, a processor system, memory containing an audio player application, wherein the audio player application configures the processor system to obtain an audio source stream from an audio source via the network interface, spatially encode the audio source, decode the spatially encoded audio source to obtain driver inputs for the individual drivers in the plurality of sets of drivers, where the driver inputs cause the drivers to generate directional audio.

Methods and Apparatus for Rendering Audio Objects

Multiple virtual source locations may be defined for a volume within which audio objects can move. A set-up process for rendering audio data may involve receiving reproduction speaker location data and pre-computing gain values for each of the virtual sources according to the reproduction speaker location data and each virtual source location. The gain values may be stored and used during “run time,” during which audio reproduction data are rendered for the speakers of the reproduction environment. During run time, for each audio object, contributions from virtual source locations within an area or volume defined by the audio object position data and the audio object size data may be computed. A set of gain values for each output channel of the reproduction environment may be computed based, at least in part, on the computed contributions. Each output channel may correspond to at least one reproduction speaker of the reproduction environment.

Acoustic simulation apparatus
11736881 · 2023-08-22 · ·

A virtual reproduction signal generation unit generates a virtual reproduction signal based on a sound pickup signal of a stereophonic sound at a listening position in a compartment, assuming that virtual speakers are respectively located at portions of Np positions in a vehicle, the virtual reproduction signal causing the virtual speakers of the Np positions to reproduce the stereophonic sound. A virtual prediction signal generation unit generates a virtual prediction signal based on the virtual reproduction signal and an information representing a change of acoustic characteristics when at least part of the portions of the Np positions is changed, the virtual prediction signal causing the virtual speakers of the Np positions to output a predicted sound at the listening position. An output signal generation unit generates an output signal based on the virtual prediction signal, the output signal causing speakers of a plurality of positions to output the predicted sound.

SYSTEMS AND METHODS FOR PROVIDING AUGMENTED AUDIO

A system for providing augmented spatialized audio in a vehicle, including a plurality of speakers disposed in a perimeter of a cabin of the vehicle; and a controller configured to receive a position signal indicative of the position of a first user's head in the vehicle and to output to a first binaural device, according to the first position signal, a first spatial audio signal, such that the first binaural device produces a first spatial acoustic signal perceived by the first user as originating from a first virtual source location within the vehicle cabin, wherein the first spatial audio signal comprises at least an upper range of a first content signal, wherein the controller is further configured to drive the plurality of speakers with a driving signal such that a first bass content of the first content signal is produced in the vehicle cabin.

Audio System Height Channel Up-Mixing
20220139403 · 2022-05-05 ·

Audio system height channel up-mixing that is configured to develop two or more height channels from audio sources that do not include height-related encoding. The up-mixing involves determining correlations and normalized channel energies between input audio signals. At least two height channels (e.g., left and right height audio signals) are developed from the correlations and normalized energies.

Method for audio reproduction in a multi-channel sound system
11722831 · 2023-08-08 · ·

The invention relates to a method for reproducing audio in a multi-channel sound system including two input signals (L and R), wherein output signals are generated for different sound perception levels. In order to develop said method in such a way that audio can be reproduced within a larger range of applications in a multi-channel sound system, according to the invention, only a lower sound perception level (7) and a higher sound perception level (6) are generated, and a maximum of six output signals are generated, a maximum of two output signals being allocated to the lower sound perception level (7) and a maximum of four output signals being allocated to the higher sound perception level (6).

Systems and methods for spatial audio rendering

Systems and methods for rendering spatial audio in accordance with embodiments of the invention are illustrated. One embodiment includes a spatial audio system, including a primary network connected speaker, including a plurality of sets of drivers, where each set of drivers is oriented in a different direction, a processor system, memory containing an audio player application, wherein the audio player application configures the processor system to obtain an audio source stream from an audio source via the network interface, spatially encode the audio source, decode the spatially encoded audio source to obtain driver inputs for the individual drivers in the plurality of sets of drivers, where the driver inputs cause the drivers to generate directional audio.

Selectable linear predictive or transform coding modes with advanced stereo coding

Methods and systems for advanced stereo processing of an audio signal are disclosed. The methods and systems include selecting a coding mode of either transform coding or linear predictive coding and performing advanced stereo processing when in the selected coding mode. Both encoding and decoding operations are provided.

SOUND PROCESSING APPARATUS AND SOUND PROCESSING SYSTEM

The present technology relates to a sound processing apparatus and a sound processing system for enabling more stable localization of a sound image.

A virtual speaker is assumed to exist on the lower side among the sides of a tetragon having its corners formed with four speakers surrounding a target sound image position on a spherical plane. Three-dimensional VBAP is performed with respect to the virtual speaker and the two speakers located at the upper right and the upper left, to calculate gains of the two speakers at the upper right and the upper left and the virtual speaker, the gains being to be used for fixing a sound image at the target sound image position. Further, two-dimensional VBAP is performed with respect to the lower right and lower left speakers, to calculate gains of the lower right and lower left speakers, the gains being to be used for fixing a sound image at the position of the virtual speaker. The values obtained by multiplying these gains by the gain of the virtual speaker are set as the gains of the lower right and lower left speakers for fixing a sound image at the target sound image position. The present technology can be applied to sound processing apparatuses.