H04S3/004

SYSTEMS AND METHODS FOR GENERATING VIDEO-ADAPTED SURROUND-SOUND
20220360933 · 2022-11-10 ·

Audiovisual presentations, such as film recordings, may have been originally created having an audio soundtrack with multiple audio tracks mixed for a surround sound system that includes a set of speakers physically surrounding a user. The present disclosure presents systems and methods to remix these soundtracks into 3D audio that when presented to the ears of a user can be perceived as a virtual surround sound system that mimics the physical system. What is more, the disclosed systems and methods can enhance the virtual surround sound system by adjusting virtual speakers of the virtual surround sound system according to video content of the audiovisual presentation. Further enhancement may be possible by adjusting the virtual speakers of the virtual surround sound system according to a sensed position of a user.

Method and System for Surround Sound Processing in an Audio Device
20230033891 · 2023-02-02 ·

An audio headset may receive a plurality of audio signals corresponding to plurality of surround sound channels. The headset may determine, via its audio processing circuitry, context and/or content of the audio signals. The audio processing circuitry may process the audio signals to generate stereo signals carrying one or more virtual surround channels, wherein the processing comprises automatically controlling, based on the context and the content of the audio signals, a simulated acoustic environment of the virtual surround channels.

Ambient sound activated device

Environmental sound is recorded using one or more microphones. A source of the recorded environmental sound is classified. The recorded environmental sound is weighted based on the classification of the source and the source media sound using a weighting mode to determine whether to mix the recorded environmental. The recorded environmental sound is mixed with source media sound to produce a mixed sound based on the determination. The mixed sound is played over one or more speakers.

Information processing device and information processing method

Provided is an information processing device that controls and presents sound information in an appropriate form to a user who acts in an environment on the basis of situation recognition including recognition of the environment and recognition of the actions of the user. The information processing device includes: a sensor that detects an object; an open ear style earpiece that is worn on an ear of a listener, and includes an acoustics generation unit, and a sound guide portion that transmits a sound generated by the acoustics generation unit into an earhole; and a processing unit that processes sound information of a sound source, the sound information being generated by the acoustics generation unit, the processing unit acquiring the sound information of the sound source corresponding to the object detected by the sensor, and a process of localizing a sound image of the acquired sound source while varying a position of the sound image in accordance with a position in a three-dimensional acoustic space, the position in the three-dimensional acoustic space corresponding to a position of the detected object.

Head to headset rotation transform estimation for head pose tracking in spatial audio applications

In an embodiment, a method comprises: estimating a first gravity direction in a source device reference frame for a source device; estimating a second gravity direction in a headset reference frame for a headset; estimating a rotation transform from the headset frame into a face reference frame using the first and second estimated gravity directions, a rotation transform from a camera reference frame to the source device reference frame, and a rotation transform from the face reference frame to the camera reference frame; estimating a relative position and attitude using source device motion data, headset motion data and the rotation transform from the headset frame to the face reference frame; using the relative position and attitude to estimate a head pose; and using the estimated head pose to render spatial audio for playback on the headset.

Generating Binaural Audio in Response to Multi-Channel Audio Using at Least One Feedback Delay Network

In some embodiments, virtualization methods for generating a binaural signal in response to channels of a multi-channel audio signal, which apply a binaural room impulse response (BRIR) to each channel including by using at least one feedback delay network (FDN) to apply a common late reverberation to a downmix of the channels. In some embodiments, input signal channels are processed in a first processing path to apply to each channel a direct response and early reflection portion of a single-channel BRIR for the channel, and the downmix of the channels is processed in a second processing path including at least one FDN which applies the common late reverberation. Typically, the common late reverberation emulates collective macro attributes of late reverberation portions of at least some of the single-channel BRIRs. Other aspects are headphone virtualizers configured to perform any embodiment of the method.

SOUND TRACING METHOD AND DEVICE TO IMPROVE SOUND PROPAGATION PERFORMANCE
20230199418 · 2023-06-22 · ·

There is provided a sound tracing method. The method includes: a setup processing step of setting ray information; a ray generation step of generating a sound ray; a traversal/intersection test step of generating hit triangle information; a propagation path validation (PPV) step of searching a sound path; a guide plane sort step of generating and sorting a guide plane; a Reverb Geometry Collect/Reverb Plane Sort (RGC/RPS) step of generating and sorting a reverb plane; and an impulse response calculation step of calculating an impulse response (IR) and storing the calculated IR in a valid path buffer.

Audio content format selection
11683654 · 2023-06-20 · ·

Systems, devices, apparatuses, components, methods, and techniques for providing media content to a media playback system are provided. The techniques provide for determining whether a playback system is connected, or otherwise associated with, a playback device suitable for 3D audio playback. Upon determining that a playback system is or is not connected or otherwise associated with a playback device suitable for 3D audio playback, audio content in a corresponding format may then be transferred and/or played by a requesting system. In some examples, one or more filters may be applied to requested audio content to compensate for determined user head movement and/or to create simulated 3D audio from generic two-channel recording.

Speaker system, sound processing device, sound processing method, and recording medium

A speaker system includes a wearable speaker capable of outputting a first sound which is a voice of a communication partner of a talker and a second sound, a microphone, and a sound processing device which processes a sound output from the wearable speaker and a sound picked up by the microphone. The sound processing device generates a reference signal by synthesizing a first signal indicating the first sound and a second signal indicating the second sound, outputs the first signal and the second signal to the wearable speaker, obtains a sound pickup signal including the voice of the talker from the microphone, performs, on the sound pickup signal, a process of cancelling the sound component output from the wearable speaker by using the reference signal, and outputs the sound pickup signal on which the cancellation process has been performed.

IMPEDANCE MATCHING FILTERS AND EQUALIZATION FOR HEADPHONE SURROUND RENDERING

Embodiments are described for designing a filter in a magnitude domain performing an impedance filtering function over a frequency domain to compensate for directional cues for the left and right ears of the listener as a function of virtual source angles during headphone virtual sound reproduction. The filter is derived by obtaining blocked ear canal and open ear canal transfer functions for loudspeakers placed in a room, obtaining an open ear canal transfer function for a headphone placed on a listening subject, and dividing the loudspeaker transfer functions by the headphone transfer function to invert a headphone response at the entrance of the ear canal and map the ear canal function from the headphone to free field.