H04S7/304

Wearable electronic device that displays a boundary of a three-dimensional zone
11510022 · 2022-11-22 ·

A wearable electronic device (WED) includes one or more sensors and cameras that determine a location of a physical object in a zone where the user is located and that track movement of an electronic device that moves to define a boundary of the zone. The WED includes a processor that generates binaural sound and a display that displays a virtual image of the boundary of the zone and a visual warning that notifies the user of the physical object.

Interaural time difference crossfader for binaural audio rendering

Examples of the disclosure describe systems and methods for presenting an audio signal to a user of a wearable head device. According to an example method, a first input audio signal is received, the first input audio signal corresponding to a source location in a virtual environment presented to the user via the wearable head device. The first input audio signal is processed to generate a left output audio signal and a right output audio signal. The left output audio signal is presented to the left ear of the user via a left speaker associated with the wearable head device. The right output audio signal is presented to the right ear of the user via a right speaker associated with the wearable head device. Processing the first input audio signal comprises applying a delay process to the first input audio signal to generate a left audio signal and a right audio signal; adjusting a gain of the left audio signal; adjusting a gain of the right audio signal; applying a first head-related transfer function (HRTF) to the left audio signal to generate the left output audio signal; and applying a second HRTF to the right audio signal to generate the right output audio signal. Applying the delay process to the first input audio signal comprises applying an interaural time delay (ITD) to the first input audio signal, the ITD determined based on the source location.

Information processing device, information processing method, and information processing program

An information processing device (100) according to the present disclosure includes: an acquisition unit (141) configured to acquire a first image including a content image of an ear of a user; and a calculation unit (142) configured to calculate, based on the first image acquired by the acquisition unit (141), a head-related transfer function corresponding to the user by using a learned model having learned to output a head-related transfer function corresponding to an ear when an image including a content image of the ear is input.

Acoustic output device and buttons thereof

The present disclosure relates to an acoustic output device including an earphone core, a controller, a Bluetooth module, and a button module. The earphone core may include at least one low-frequency acoustic driver configured to output sounds from at least two first guiding holes and at least one high-frequency acoustic driver configured to output sounds from at least two second guiding holes. The controller may be configured to direct the at least one low-frequency acoustic driver to output the sounds in a first frequency range and direct the at least one high-frequency acoustic driver to output the sounds in a second frequency range. The Bluetooth module may be configured to connect the acoustic output device with at least one terminal device. The button module may be configured to implement an interaction between a user of the acoustic output device and the acoustic output device.

Bidirectional propagation of sound

The description relates to rendering directional sound. One implementation includes receiving directional impulse responses corresponding to a scene. The directional impulse responses can correspond to multiple sound source locations and a listener location in the scene. The implementation can also include encoding the directional impulse responses to obtain encoded departure direction parameters for individual sound source locations. The implementation can also include outputting the encoded departure direction parameters, the encoded departure direction parameters providing sound departure directions from the individual sound source locations for rendering of sound.

AUDIO APPARATUS AND METHOD OF OPERATION THEREFOR

An audio apparatus, e.g. for rendering audio for a virtual/ augmented reality application, comprises a receiver (201) for receiving audio data for an audio scene including a first audio component representing a real-world audio source present in an audio environment of a user. A determinator (203) determines a first property of a real-world audio component from the real-world audio source and a target processor (205) determines a target property for a combined audio component being a combination of the real-world audio component received by the user and rendered audio of the first audio component received by the user. An adjuster (207) determines a render property by modifying a property of the first audio component indicated by the audio data for the first audio component in response to the target property and the first property. A renderer (209) renders the first audio component in response to the render property.

Spatial Audio Representation and Rendering
20220369061 · 2022-11-17 ·

An apparatus including circuitry configured to: obtain a spatial audio signal including at least one audio signal and spatial metadata associated with the at least one audio signal; obtain at least one data set related to binaural rendering; obtain at least one pre-defined data set related to binaural rendering; and generate a binaural audio signal based on a combination of at least part of the at least one data set and the at least one pre-defined data set, and the spatial audio signal.

IMMERSIVE MEDIA COMPATIBILITY
20230057207 · 2023-02-23 · ·

Aspects of the disclosure provide methods and apparatuses for audio processing. In some examples, an apparatus for media processing includes processing circuitry. The processing circuitry receives first six degrees of freedom (6 DoF) information associated with a media content for a scene in a media application. The first 6 DoF information includes a first spatial location and a first rotation orientation for rotation about a center at the first spatial location. The processing circuitry determines that a rendering platform for rendering the media content is a three degrees of freedom (3 DoF) platform; and calculates, a revolution orientation of the media content on a sphere centered other than the first spatial location, according to at least the first spatial location. The revolution orientation is 3 DoF information associated with the media content for rendering on the 3 DoF platform.

Autonomous gating selection to reduce noise in direct time-of-flight depth sensing

A depth camera assembly (DCA) includes a direct time of flight system for determining depth information for a local area. The DCA includes an illumination source, a camera, and a controller. The illumination source projects light (e.g., pulse of light) into the local area. The camera detects reflections of the projected light from objects in the local area. Using an internal gating selection procedure, the controller selects a gate window that is likely to be associated with reflection of a pulse of light from an object. The selected gate may be used for depth determination. The internal gating selection procedures may be achieved through external target location and selection or through internal self-selection.

Inertially stable virtual auditory space for spatial audio applications

During an initialization of a head pose tracker for a spatial audio system, a spatial audio ambience bed is rotated about a boresight vector to align the boresight vector with a center channel of the ambience bed. The boresight is computed using source device motion data and headset motion data. The ambience bed includes the center channel and one or more other channels. An ambience bed reference frame is aligned with a horizontal plane of a headset reference frame, such that the ambience bed is horizontally level with a user's ears. A first estimated gravity direction is fixed (made constant) in the ambience bed reference frame. During head pose tracking, the ambience bed reference frame is rolled about the boresight vector to align a second estimated gravity direction in the headset reference frame with the first estimated gravity direction fixed in the ambience bed reference frame.