H04S7/303

WORLD LOCK SPATIAL AUDIO PROCESSING
20230046341 · 2023-02-16 ·

A method for providing a world-locked experience to a user of a headset in an immersive reality application includes receiving, from an immersive reality application, a first audio waveform from a first acoustic source to provide to a user of a headset, determining a direction of arrival for the first acoustic source relative to the headset, and providing, to a speaker in the headset, an audio signal including the first audio waveform and intended for an ear of the user of the headset, wherein the audio signal includes a time delay and an amplitude for the first audio waveform based on the direction of arrival for the first acoustic source relative to the user of the headset. A non-transitory, computer-readable medium storing instructions which, when executed by a processor, cause a system to perform the above method, and the system, are also provided.

CAMERA-VIEW ACOUSTIC FENCE
20230053202 · 2023-02-16 ·

Determining the angle of sound relative to the centerline of a microphone array. The angle of the centerline of a camera field-of-view (FoV) and the angle of the camera FoV is determined. Knowing the angle from the centerline of the microphone array of the particular sound and then the angle of the centerline of the camera FoV and angles of the camera FoV allows a determination if the sound is inside the FoV of the camera. If so, the microphones are unmuted. If not, the microphones are muted. As the camera zooms or pans, the changes in camera FoV and centerline angle are computed and used with the sound angle, so that the muting and unmuting occurs automatically as the camera zoom and pan angle change.

XR RENDERING FOR 3D AUDIO CONTENT AND AUDIO CODEC
20230051841 · 2023-02-16 ·

A device includes a memory configured to store instructions and also includes one or more processors configured to execute the instructions to obtain audio data corresponding to a sound source and metadata indicative of a direction of the sound source. The one or more processors are configured to execute the instructions to obtain direction data indicating a viewing direction associated with a user of a playback device. The one or more processors are configured to execute the instructions to determine a resolution setting based on a similarity between the viewing direction and the direction of the sound source. The one or more processors are also configured to execute the instructions to process the audio data based on the resolution setting to generate processed audio data.

Audio Processing Methods and Systems for a Multizone Augmented Reality Space

An illustrative audio processing system identifies an experience location with which an augmented reality presentation device is associated. The experience location is included within a multizone augmented reality space that is presented by the augmented reality presentation device. The audio processing system determines that the experience location is within both a first sound zone and a second sound zone of the multizone augmented reality space, and, based on the determining that the experience location is within both the first and second sound zones, generates a binaural audio stream for presentation by the augmented reality presentation device. The binaural audio stream includes an environmental audio component implemented by a mix of a first environmental audio stream associated with the first sound zone and a second environmental audio stream associated with the second sound zone. Corresponding methods and systems are also disclosed.

SOUND EFFECT ADJUSTMENT
20230041730 · 2023-02-09 · ·

A sound effect adjustment method is provided. In the method, a video frame and an audio signal of a corresponding time unit of a target video are obtained. A sound source orientation and a sound source distance of a sound source object in the video frame are determined. Scene information corresponding to the video frame is determined. The audio signal is filtered based on the sound source orientation and the sound source distance. An echo coefficient is determined according to the scene information. Further, an adjusted audio signal with an adjusted sound effect is generated based on the filtered audio signal and the echo coefficient.

AUDIO DEVICE AUTO-LOCATION

A method for estimating an audio device location in an environment may involve obtaining direction of arrival (DOA) data for each audio device of a plurality of audio devices in the environment and determining interior angles for each of a plurality of triangles based on the DOA data. Each triangle may have vertices that correspond with audio device locations. The method may involve determining a side length for each side of each of the triangles, performing a forward alignment process of aligning each of the plurality of triangles produce a forward alignment matrix and performing a reverse alignment process of aligning each of the plurality of triangles in a reverse sequence to produce a reverse alignment matrix. A final estimate of each audio device location may be based, at least in part, on values of the forward alignment matrix and values of the reverse alignment matrix.

SIGNAL PROCESSING APPARATUS AND METHOD, ACOUSTIC REPRODUCTION APPARATUS, AND PROGRAM

The present technology relates to a signal processing apparatus and method, an acoustic reproduction apparatus, and a program that can achieve more realistic acoustic presentation.

The signal processing apparatus includes an acoustic transfer characteristic convolution processing unit that convolves an acoustic transfer characteristic according to a current position of a presentation destination of a virtual sound source among the acoustic transfer characteristic associated with each position in a space and virtual sound source data of the virtual sound source. The present technology can be applied to an acoustic reproduction system.

IMMERSIVE SOUND REPRODUCTION USING MULTIPLE TRANSDUCERS
20230042762 · 2023-02-09 ·

One or more embodiments include techniques for generating immersive audio for an acoustic system. The techniques include determining an apparent location associated with a portion of audio; calculating, for each speaker included in a plurality of speakers of the acoustic system, a perceptual distance between the speaker and the apparent location; selecting a subset of speakers included in the plurality of speakers based on the perceptual distances between the plurality of speakers and the apparent location; generating a set of filters based on the subset of speakers and one or more target characteristics of the acoustic system; and generating, for each speaker included in the subset of speakers, a speaker signal using one or more filters included in the set of filters.

Arrangement for producing head related transfer function filters
11557055 · 2023-01-17 · ·

When three-dimensional audio is produced by using headphones, particular HRTF-filters are used to modify sound for the left and right channels of the headphone. As the morphology of every ear is different, it is beneficial to have HRTF-filters particularly designed for the user of headphones. Such filters may be produced by deriving ear geometry from a plurality of images taken with an ordinary camera, detecting necessary features from images and fitting said features to a model that has been produced from accurately scanned ears comprising representative values for different sizes and shapes. Taken images are sent to a server (52) that performs the necessary computations and submits the data further or produces the requested filter.

SYSTEM AND METHOD FOR VIRTUAL SOUND EFFECT WITH INVISIBLE LOUDSPEAKER(S)

In at least one embodiment, an apparatus for providing a virtual sound effect in a listening environment is provided. The apparatus includes at least one controller and an audio playback device. The audio playback device includes the at least one controller that is programmed to receive an audio input signal from an audio input source and to apply a head related transfer function (HRTF) to the audio input signal. The at least one controller is further programmed to apply crosstalk cancellation to the audio input signal and to generate an audio output signal after applying the HRTF and the crosstalk cancellation to the audio input signal for playback by at least one loudspeaker that is invisible to a listener in the listening environment.