H04S3/008

SYSTEM AND METHOD FOR INTERPOLATING A HEAD-RELATED TRANSFER FUNCTION

This disclosure describes a system and method for Head-Related Transfer Function (HRTF) interpolation when an HRTF dataset does not contain a particular direction associated with a desired source. The disclosed HRTF interpolation uses a finite set of HRTFs from a dataset to obtain the HRTF of any possible direction and distance, even if the direction/distance doesn't exist on the current dataset.

Immersive sound for teleoperators

Immersive experiences for users are described herein. In an example, audio data from a plurality of audio sensors associated with a vehicle can be received by an audio data processing system. The audio data processing system can combine individual captured audio channels (e.g., from the plurality of audio sensors) into two or more audio channels for output via two or more speakers proximate a user. A first audio channel of the two or more audio channels can be output via a first speaker and second audio channel of the two or more audio channels to be output via a second speaker, wherein output of the first audio channel and the second audio channel causes a resulting sound corresponding to at least a portion of a sound scene associated with the vehicle. In an example, a user computing device operable by the user can receive an input from the user.

Electronic glasses that provide binaural sound
11606660 · 2023-03-14 ·

Electronic glasses track head movement of a user with respect to a location in empty space on top of a physical object. One or more processors process sound that externally localizes as binaural sound to the location in empty space on top of the physical object. Speakers in the electronic glasses provide the binaural sound to the user.

TRANSMISSION APPARATUS, RECEPTION APPARATUS, AND ACOUSTIC SYSTEM
20220337967 · 2022-10-20 ·

A transmission apparatus includes a first transmission unit that transmits sound data to a first sound channel in a transmission path, and a second transmission unit that transmits meta data related to the sound data to a second sound channel in the transmission path while ensuring synchronization with the sound data.

METHOD AND SYSTEM FOR MONITORING AND REPORTING SPEAKER HEALTH

A method is provided, including: defining a plurality of frequency bins; sending, during a training phase, a test signal at different amplitude levels to one or more speakers, and gathering resulting test voltage (V) and current (I) points for the different amplitude levels and for each frequency bin; for each frequency bin, applying a linear regression algorithm to the gathered test voltage and current points for the different amplitudes to obtain a reference electrical impedance of said one or more speakers; sending, during a monitoring phase subsequent to said training phase, an audio signal to said one or more speakers, and gathering resulting new voltage and current points to obtain an operating electrical impedance for said one or more speakers for each frequency bin, determining a deviation between the operating and the reference electrical impedance, and, if the deviation exceeds a defined tolerance, reporting the deviation to a user.

METHODS AND APPARATUS FOR DECODING A COMPRESSED HOA SIGNAL

Methods and apparatus for decoding a compressed Higher Order Ambisonics (HOA) representation of a sound or soundfield. The method may include receiving a bit stream containing the compressed HOA representation and decoding, based on a determination that there are multiple layers, the compressed HOA representation from the bitstream to obtain a sequence of decoded HOA representations. A first subset of the sequence of decoded HOA representations is determined based only on corresponding ambient HOA components. A second subset of the sequence of decoded HOA representations is determined based on corresponding ambient HOA components and corresponding predominant sound components. For a frame k, the sequence of decoded HOA representations are represented at least in part by

[00001]c^nk1=c^AMB,nk1c^nk1=c^PS,nk1+c^AMB,nk1,for n in the first subsetfor n in the second subset

where

[00002]c^AMB,nk1

corresponds to the corresponding ambient HOA components and

[00003]INTER-CHANNEL PHASE DIFFERENCE PARAMETER ENCODING METHOD AND APPARATUS

20230131892 · 2023-04-27 ·

The present disclosure discloses an inter-channel phase difference parameter encoding method, where a current frame is obtained; a signal type and a previous IPD parameter encoding scheme of a previous frame are obtained; a current IPD parameter encoding scheme is obtained at least based on the signal type of the previous frame and the previous IPD parameter encoding scheme; and an IPD parameter of the current frame is processed based on the current IPD parameter encoding scheme.

SIGNAL PROCESSING DEVICE, METHOD, AND PROGRAM

The present technology relates to a signal processing device, method, and program that can improve encoding efficiency.

A signal processing device includes: an acquisition unit that acquires reverb information including at least one of space reverb information specific to a space around an audio object or object reverb information specific to the audio object and an audio object signal of the audio object; and a reverb processing unit that generates a signal of a reverb component of the audio object on the basis of the reverb information and the audio object signal. The present technology can be applied to a signal processing device.

SYSTEM AND METHOD FOR MULTICHANNEL SPEECH DETECTION

Embodiments of the disclosure provide systems and methods for speech detection. The method may include receiving a multichannel audio input that includes a set of audio signals from a set of audio channels in an audio detection array. The method may further include processing the multichannel audio input using a neural network classifier to generate a series of classification results in a series of time windows for the multichannel audio input. The neural network classifier includes a causal temporal convolutional network (TCN) configured to determine a classification result for each time window based on portions of the multichannel audio input n the corresponding time window and one or more time windows before the corresponding time window. The method may additionally include determining whether the multichannel audio input includes one or more speech segments in the series of time windows based on the series of classification results.

METHOD AND APPARATUS FOR SPACE OF INTEREST OF AUDIO SCENE
20220335955 · 2022-10-20 · ·

Aspects of the disclosure include methods, apparatuses, and non-transitory computer-readable storage mediums for decoding audio data of an audio scene. One apparatus includes processing circuitry that receives first audio source data and second audio source data. The first audio source data corresponds to a space of interest in the audio scene and the second audio source data does not correspond to the space of interest in the audio scene. The space of interest in the audio scene is represented by at least one of a listener space, an audio channel, or an audio object. The processing circuitry decodes the first audio source data based on the space of interest.