H04S2420/11

Spatial audio signal decoder
11355132 · 2022-06-07 · ·

A method to decode audio signals is provided that includes: receiving an input spatial audio signal; determining directions of arrival of directional audio sources represented in the received input spatial audio signal; determining one of an active input spatial audio signal component and a passive spatial audio signal input component, based upon the determined directions of arrival; determining the other of the active input spatial audio signal component and the passive input spatial audio signal component based upon the determined one of the active input spatial audio signal component and the passive input spatial audio signal component; decoding the active input spatial audio signal component to a first output format; and decoding the passive input spatial audio signal component to a second output format.

Method and apparatus for wind noise detection and beam pattern processing

An image capture device includes a sensor, a microphone array, and a processor. The microphone array may include a first microphone, a second microphone, a third microphone, or any combination thereof. The first microphone may be configured to face a first direction. The second microphone may be configured to face a second direction. The second direction may be diametrically opposed to the first direction. The third microphone may be configured to face a third direction. The third direction may be substantially perpendicular to the first direction, the second direction, or both. The processor may be configured to determine a microphone capture pattern and detect wind noise. The microphone capture pattern may be determined based on data obtained from the sensor. The sensor data may include image data, audio data, image capture device orientation data, location data, accelerometer data, or any combination thereof.

Controlling rendering of audio data

A device may be configured to process one or more audio streams in accordance with the techniques described herein. The device may comprise: one or more processors and a memory. The one or more processors may be configured to obtain an indication of a boundary separating an interior area from an exterior area, and obtain a listener location indicative of a location of the device relative to the interior area. The one or more processors may be configured to obtain, based on the boundary and the listener location, a current renderer as either an interior renderer configured to render audio data for the interior area or an exterior renderer configured to render the audio data for the exterior area, and apply, to the audio data, the current renderer to obtain one or more speaker feeds. The memory may be configured to store the one or more speaker feeds.

Priority-based soundfield coding for virtual reality audio

An example device configured to obtain image data includes a memory configured to store one or more priority values, each of the one or more priority values being associated with a type of image object associated with the image data. The device includes one or more processors coupled to the memory, and configured to associate image objects in the image data with one or more audio sources represented in one or more audio streams. The one or more processors are also configured to assign a respective priority value to each of the one or more audio sources represented in the one or more streams and code ambisonic coefficients based on the assigned priority value.

Sound Field Related Rendering
20220174443 · 2022-06-02 ·

An apparatus including circuitry configured to: obtain at least two audio signals; determine a type of the at least two audio signals; process the at least two audio signals configured to be rendered based on the determined type of the at least two audio signals.

METHOD AND APPARATUS FOR DECODING STEREO LOUDSPEAKER SIGNALS FROM A HIGHER-ORDER AMBISONICS AUDIO SIGNAL
20220182775 · 2022-06-09 · ·

Decoding of Ambisonics representations for a stereo loudspeaker setup is known for first-order Ambisonics audio signals. But such first-order Ambisonics approaches have either high negative side lobes or poor localisation in the frontal region. The invention deals with the processing for stereo decoders for higher-order Ambisonics HOA.

FILE FORMAT FOR SPATIAL AUDIO

An audio asset library containing audio assets formatted in accordance with a file format for spatial audio includes asset metadata that enables simulated reality (SR) application developers to compose sounds for use in SR applications. The audio assets are formatted to include audio data encoding a sound capable of being composed into a SR application along with asset metadata describing not only how the sound was encoded, but also how a listener in SR environment experiences the sound. A SR developer platform is configured so that developers can compose sound for SR objects using audio assets stored in the audio library, including editing the asset metadata to include transformation parameters that support dynamic transformation of the asset metadata in the SR environment to alter how the SR listener experiences the composed sound. Other embodiments are also described and claimed.

Apparatus and Method for Audio Signal Transformation
20230274749 · 2023-08-31 ·

An apparatus for audio signal transformation is provided. The apparatus includes a determination unit configured for determining, using spherical harmonics information, a transformation rule for transforming an audio input signal within a first domain, being different from a spherical harmonics domain. Moreover, the apparatus includes a transformation unit configured for transforming, using the transformation rule, the audio input signal, being represented in the first domain, to obtain a transformed audio signal being represented in the first domain. The spherical harmonics information includes information on a plurality of spherical harmonics and/or includes information being represented in the spherical harmonics domain.

REAL-TIME SOUND FIELD SYNTHESIS BY MODIFYING PRODUCED AUDIO STREAMS
20230276189 · 2023-08-31 ·

A client device is disclosed that receives, from a server, a live video stream and a production quality live ambisonic audio stream generated during performance of a live event at a venue. The live ambisonic audio stream is generated from audio channels captured by audio capture devices disposed at the venue. The audio channels captured at the event, and modified by a producer, can be compared to audio captured by an ambisonic microphone positioned within the event space to determine the phase and relative amplitude of those channels as received by a particular ambisonic microphone channel. In this manner, raw and/or produced audio channels captured at the event can be shifted and mixed together to generate a production quality ambisonic stream.

SPATIAL INFORMATION ENHANCED AUDIO FOR REMOTE MEETING PARTICIPANTS
20230276187 · 2023-08-31 ·

A computer implemented method includes receiving sound at multiple microphones of a microphone array from multiple people at various locations about the microphone array. The received sound is encoded in at least one format capable of representing spatial locations of the multiple people. The encoded sound is transmitted in the at least one format to a remote user system capable of rendering the sound in a manner that conveys the spatial locations to a user of the remote user system.