H04S2420/11

Method for interpolating a sound field, corresponding computer program product and device.
20220132262 · 2022-04-28 ·

A method for interpolating a sound field captured by a plurality of N microphones each outputting the encoded sound field in a form including at least one captured pressure and an associated pressure gradient vector. Such a method includes an interpolation of the sound field at an interpolation position outputting an interpolated encoded sound field as a linear combination of the N encoded sound fields each weighted by a corresponding weighting factor. The interpolation includes an estimation of the N weighting factors at least from: the interpolation position; a position of each of the N microphones; the N pressures captured by the N microphones; and an estimated power of the sound field at the interpolation position.

Methods, apparatus and systems for decompressing a Higher Order Ambisonics (HOA) signal

A method for compressing a HOA signal being an input HOA representation with input time frames (C(k)) of HOA coefficient sequences comprises spatial HOA encoding of the input time frames and subsequent perceptual encoding and source encoding. Each input time frame is decomposed (802) into a frame of predominant sound signals (X.sub.PS(k−1)) and a frame of an ambient HOA component ({tilde over (C)}.sub.AMB(k−1)). The ambient HOA component ({tilde over (C)}.sub.AMB(k−1)) comprises, in a layered mode, first HOA coefficient sequences of the input HOA representation (c.sub.n(k−1)) in lower positions and second HOA coefficient sequences (c.sub.AMB,n(k−1)) in remaining higher positions. The second HOA coefficient sequences are part of an HOA representation of a residual between the input HOA representation and the HOA representation of the predominant sound signals.

Systems and methods for spatial audio rendering

Systems and methods for rendering spatial audio in accordance with embodiments of the invention are illustrated. One embodiment includes a spatial audio system, including a primary network connected speaker, including a plurality of sets of drivers, where each set of drivers is oriented in a different direction, a processor system, memory containing an audio player application, wherein the audio player application configures the processor system to obtain an audio source stream from an audio source via the network interface, spatially encode the audio source, decode the spatially encoded audio source to obtain driver inputs for the individual drivers in the plurality of sets of drivers, where the driver inputs cause the drivers to generate directional audio.

Audio bandwidth reduction

A first device obtains, from the array, several audio signals and processes the audio signals to produce a speech signal and one or more ambient signals. The first device processes the ambient signals to produce a sound-object sonic descriptor that has metadata describing a sound object within an acoustic environment. The first device transmits, over a communication data link, the speech signal and the descriptor to a second electronic device that is configured to spatially reproduce the sound object using the descriptor mixed with the speech signal, to produce several mixed signals to drive several speakers.

Spatial audio signal format generation from a microphone array using adaptive capture

Apparatus including a processor configured to: receive at least two microphone audio signals; determine spatial metadata associated with the at least two microphone audio signals; and synthesize adaptively a plurality of spherical harmonic audio signals based on at least one microphone audio signal and the spatial metadata in order to output a pre-determined order spatial audio signal format.

Spatial audio wind noise detection

A device includes one or more processors configured to obtain audio signals representing sound captured by at least three microphones and determine spatial audio data based on the audio signals. The one or more processors are further configured to determine a metric indicative of wind noise in the audio signals. The metric is based on a comparison of a first value and a second value. The first value corresponds to an aggregate signal based on the spatial audio data, and the second value corresponds to a differential signal based on the spatial audio data.

Soundfield adaptation for virtual reality audio

An example device includes a memory configured to store at least one spatial component and at least one audio source within a plurality of audio streams. The device also includes one or more processors coupled to the memory. The one or more processors are configured to receive, from motion sensors, rotation information. The one or more processors are configured to rotate the at least one spatial component based on the rotation information to form at least one rotated spatial component. The one or more processors are also configured to reconstruct ambisonic signals from the at least one rotated spatial component and the at least one audio source, wherein the at least one spatial component describes spatial characteristics associated with the at least one audio source in a spherical harmonic domain representation.

DIRECTION OF ARRIVAL ESTIMATION DEVICE, SYSTEM, AND DIRECTION OF ARRIVAL ESTIMATION METHOD

Provided is a direction of arrival estimation device wherein: a calculation circuit calculates a frequency weighting factor for each of a plurality of frequency components of signals recorded in a microphone array, on the basis of the differences among unit vectors indicating the directions of the sound sources of each of the plurality of frequency components; and an estimation circuit estimates the direction of arrival of a signal from the sound source, on the basis of the frequency weighting factors.

DETECTION METHOD, DETECTION APPARATUS, AND PROGRAM

There are provided a detection method, a detection device, and a program that do not cause a difference in events to be detected even when physical characteristics of an acoustic signal change. The detection method includes: a step of acquiring a target sound for detecting an event; and a detecting step of detecting a desired event included in the acquired sound, and in the detecting step, even when any one of a distance and a direction of a sound source of the event, which are based on a position where the target sound is collected, and an occurrence time of the event changes, the events are always detected as the same event.

Method of rendering one or more captured audio soundfields to a listener

A computer implemented system for rendering captured audio soundfields to a listener comprises apparatus to deliver the audio soundfields to the listener. The delivery apparatus delivers the audio soundfields to the listener with first and second audio elements perceived by the listener as emanating from first and second virtual source locations, respectively, and with the first audio element and/or the second audio element delivered to the listener from a third virtual source location. The first virtual source location and the second virtual source location are perceived by the listener as being located to the front of the listener, and the third virtual source location is located to the rear or the side of the listener.