H04S2420/11

AUDIO PERSONALISATION METHOD AND SYSTEM

An audio personalisation method for a user, to reproduce an area-based or volumetric sound source, includes the steps of, for a head related transfer function ‘HRTF’ associated with the user, smoothing HRTF coefficients relating to peaks and notches in the HRTF's spectral response, responsive to the size of the area or volume of the sound source; filtering the sound source using the smoothed HRTF for the notional position of the sound source; and outputting the filtered sound source signal for playback to the user.

AUDIO SYSTEM
20230262393 · 2023-08-17 ·

An audio system includes an audio/video receiver, a power supply/wireless audio distribution assembly connected to the audio/video receiver, speaker wire, and speakers compatible with the power supply/wireless audio distribution assembly.

Orientation-aware surround sound playback

Example embodiments disclosed herein relate to orientation-aware surround sound playback. A method for processing audio on an electronic device that includes a plurality of loudspeakers is disclosed, the loudspeakers arranged in more than one dimension of the electronic device. The method includes, responsive to receipt of a plurality of received audio streams, generating a rendering component associated with the plurality of received audio streams, determining an orientation dependent component of the rendering component, processing the rendering component by updating the orientation dependent component according to an orientation of the loudspeakers and dispatching the received audio streams to the plurality of loudspeakers for playback based on the processed rendering component. Corresponding system and computer program products are also disclosed.

SEAMLESS RENDERING OF AUDIO ELEMENTS WITH BOTH INTERIOR AND EXTERIOR REPRESENTATIONS
20230262405 · 2023-08-17 · ·

A method (700) for spatial audio rendering of an audio element having an extent (101). The method includes determining (s702) that a listener is within a transition region that is outside of the extent. The method also includes determining (s704) a first interior rendering with an interior set of virtual loudspeakers. The method also includes determining (s706) an exterior rendering with an exterior set of virtual loudspeakers, wherein the exterior set of virtual loudspeakers comprises first and second virtual loudspeakers. The method also includes, in response to determining that the listener is within the transition region, determining (s708) a transition rendering, wherein the transition rendering includes the interior set of virtual loudspeakers with two loudspeakers in the interior set of virtual loudspeakers replaced by third and fourth virtual loudspeakers, the third and fourth virtual loudspeakers being based on the first and second virtual loudspeakers of the exterior set of virtual loudspeakers. The method also includes rendering (s710) the transition rendering for the listener.

TRANSFORM AMBISONIC COEFFICIENTS USING AN ADAPTIVE NETWORK FOR PRESERVING SPATIAL DIRECTION

A device includes a memory configured to store untransformed ambisonic coefficients at different time segments. The device includes one or more processors configured to obtain the untransformed ambisonic coefficients at the different time segments, where the untransformed ambisonic coefficients at the different time segments represent a soundfield at the different time segments. The one or more processors are configured to apply one adaptive network, based on a constraint that includes preservation of a spatial direction of one or more audio sources in the soundfield at the different time segments, to the untransformed ambisonic coefficients at the different time segments to generate transformed ambisonic coefficients at the different time segments, wherein the transformed ambisonic coefficients at the different time segments represent a modified soundfield at the different time segments, that was modified based on the constraint. The one or more processors are also configured to apply an additional adaptive network.

AUDIO SCENE ENCODER, AUDIO SCENE DECODER AND RELATED METHODS USING HYBRID ENCODER-DECODER SPATIAL ANALYSIS

An audio scene encoder for encoding an audio scene, the audio scene having at least two component signals, has: a core encoder for core encoding the at least two component signals, wherein the core encoder is configured to generate a first encoded representation for a first portion of the at least two component signals, and to generate a second encoded representation for a second portion of the at least two component signals, a spatial analyzer for analyzing the audio scene to derive one or more spatial parameters or one or more spatial parameter sets for the second portion; and an output interface for forming the encoded audio scene signal, the encoded audio scene signal having the first encoded representation, the second encoded representation, and the one or more spatial parameters or one or more spatial parameter sets for the second portion.

Spatial audio augmentation and reproduction
11729574 · 2023-08-15 · ·

A method including: obtaining at least one spatial audio signal including at least one audio signal, wherein the at least one spatial audio signal at least partially defines an audio scene; obtaining at least one augmentation audio signal; determining at least two audio objects based upon the at least one augmentation audio signal; determining audio-object dependency information for the determined at least two audio objects; and augmenting the audio scene based, at least partially, on both the determined at least two audio objects and the determined audio-object dependency information.

Spatial audio monauralization via data exchange
11729570 · 2023-08-15 · ·

A device includes a memory configure to store instructions and one or more processors configured to execute the instructions to obtain spatial audio data at a first audio output device. The one or more processors are further configured to perform data exchange, between the first audio output device and a second audio output device, of exchange data based on the spatial audio data. The one or more processors are also configured to generate first monaural audio output at the first audio output device based on the spatial audio data.

Methods and apparatus for determining for decoding a compressed hoa sound representation

When compressing an HOA data frame representation, a gain control (15, 151) is applied for each channel signal before it is perceptually encoded (16). The gain values are transferred in a differential manner as side information. However, for starting decoding of such streamed compressed HOA data frame representation absolute gain values are required, which should be coded with a minimum number of bits. For determining such lowest integer number (β.sub.e) of bits the HOA data frame representation (C(k)) is rendered in spatial domain to virtual loudspeaker signals lying on a unit sphere, followed by normalisation of the HOA data frame representation (C(k)). Then the lowest integer number of bits is set to β.sub.e=┌log.sub.2(┌log.sub.2(√{square root over (K.sub.MAX)}.Math.O)┐+1)┐.

RECORDING AND RENDERING AUDIO SIGNALS

A method, apparatus and computer program, the method comprising: receiving a plurality of input signals representing a sound space; using the received plurality of input signals to obtain spatial metadata corresponding to the sound space; using the received plurality of input signals to obtain a first spatial audio signal corresponding to the spatial metadata; and associating the first spatial audio signal with the spatial metadata to enable the spatial metadata to be used to process the first spatial audio signal to obtain a second spatial audio signal.