Patent classifications
H04S2400/11
Autonomous gating selection to reduce noise in direct time-of-flight depth sensing
A depth camera assembly (DCA) includes a direct time of flight system for determining depth information for a local area. The DCA includes an illumination source, a camera, and a controller. The illumination source projects light (e.g., pulse of light) into the local area. The camera detects reflections of the projected light from objects in the local area. Using an internal gating selection procedure, the controller selects a gate window that is likely to be associated with reflection of a pulse of light from an object. The selected gate may be used for depth determination. The internal gating selection procedures may be achieved through external target location and selection or through internal self-selection.
Audio processing apparatus and method therefor
An audio processing apparatus comprises a receiver (705) which receives audio data including audio components and render configuration data including audio transducer position data for a set of audio transducers (703). A renderer (707) generating audio transducer signals for the set of audio transducers from the audio data. The renderer (7010) is capable of rendering audio components in accordance with a plurality of rendering modes. A render controller (709) selects the rendering modes for the renderer (707) from the plurality of rendering modes based on the audio transducer position data. The renderer (707) can employ different rendering modes for different subsets of the set of audio transducers the render controller (709) can independently select rendering modes for each of the different subsets of the set of audio transducers (703). The render controller (709) can select the rendering mode for a first audio transducer of the set of audio transducers (703) in response to a position of the first audio transducer relative to a predetermined position for the audio transducer. The approach may provide improved adaptation, e.g. to scenarios where most speakers are at desired positions whereas a subset deviate from the desired position(s).
Inertially stable virtual auditory space for spatial audio applications
During an initialization of a head pose tracker for a spatial audio system, a spatial audio ambience bed is rotated about a boresight vector to align the boresight vector with a center channel of the ambience bed. The boresight is computed using source device motion data and headset motion data. The ambience bed includes the center channel and one or more other channels. An ambience bed reference frame is aligned with a horizontal plane of a headset reference frame, such that the ambience bed is horizontally level with a user's ears. A first estimated gravity direction is fixed (made constant) in the ambience bed reference frame. During head pose tracking, the ambience bed reference frame is rolled about the boresight vector to align a second estimated gravity direction in the headset reference frame with the first estimated gravity direction fixed in the ambience bed reference frame.
Mixed reality system for context-aware virtual object rendering
A computer-implemented method in conjunction with mixed reality gear (e.g., a headset) includes imaging a real scene encompassing a user wearing a mixed reality output apparatus. The method includes determining data describing a real context of the real scene, based on the imaging; for example, identifying or classifying objects, lighting, sound or persons in the scene. The method includes selecting a set of content including content enabling rendering of at least one virtual object from a content library, based on the data describing a real context, using various selection algorithms. The method includes rendering the virtual object in the mixed reality session by the mixed reality output apparatus, optionally based on the data describing a real context (“context parameters”). An apparatus is configured to perform the method using hardware, firmware, and/or software.
System and method for realistic rotation of stereo or binaural audio
A system for rotating sound or selective listening to sound provides for the ability of the apparent direction of sound sources in a listening environment to remain in consistent orientations in space despite rotations of the microphones used to capture the sound and despite rotations of the head of the listener, even when wearing headphones. Modules are provided in the system to distinguish the sound sources and their apparent directions, as well as to optionally rotate the sound sources in response to detected rotations of the listener's head and/or detected rotations of the microphones.
Differential spatial rendering of audio sources
Methods and systems for intuitive spatial audio rendering with improved intelligibility are disclosed. By establishing a virtual association between an audio source and a location in the listener's virtual audio space, a spatial audio rendering system can generate spatial audio signals that create a natural and immersive audio field for a listener. The system can receive the virtual location of the source as a parameter and map the source audio signal to a source-specific multi-channel audio signal. In addition, the spatial audio rendering system can be interactive and dynamically modify the rendering of the spatial audio in response to a user's active control or tracked movement.
SYSTEM FOR AND METHOD OF GENERATING AN AUDIO IMAGE
A system for and a method of generating an audio image for use in rendering audio. The method comprises accessing an audio stream; accessing positional information, the positional information comprising a first position, a second position and a third position; and generating an audio image. In some embodiments, generating the audio image comprises generating, based on the audio stream, a first virtual wave front to be perceived by a listener as emanating from the first position; generating, based on the audio stream, a second virtual wave front to be perceived by the listener as emanating from the second position; and generating, based on the audio stream, a third virtual wave front to be perceived by the listener as emanating from the third position.
Mapping virtual sound sources to physical speakers in extended reality applications
One or more embodiments include an audio processing system for generating an audio scene for an extended reality (XR) environment. The audio processing system determines that a first virtual sound source associated with the XR environment affects a sound in the audio scene. The audio processing system generates a sound component associated with the first virtual sound source based on a contribution of the first virtual sound source to the audio scene. The audio processing system maps the sound component to a first loudspeaker included in a plurality of loudspeakers. The audio processing system outputs at least a first portion of the component for playback on the first loudspeaker.
Determining corrections to be applied to a multichannel audio signal, associated coding and decoding
A method and device for determining a set of corrections to be made to a multichannel sound signal, in which the set of corrections is determined on the basis of an item of information representative of a spatial image of an original multichannel signal and an item of information representative of a spatial image of the original multichannel signal that has been coded and then decoded.
SIGNAL PROCESSING DEVICE, SIGNAL PROCESSING METHOD, AND PROGRAM
The present technology relates to a signal processing device, signal processing method, and program capable of providing a higher realistic feeling.
A signal processing device includes: an acquisition unit that acquires audio data of an audio object and metadata including position information indicating a position of the audio object and direction information indicating a direction of the audio object; and a signal generation unit that generates a reproduction signal for reproducing a sound of the audio object at a listening position on the basis of listening position information indicating the listening position, listener direction information indicating a direction of a listener at the listening position, the position information, the direction information, and the audio data. The present technology is applicable to a transmission reproduction system.