Patent classifications
H04S2400/15
Methods for obtaining and reproducing a binaural recording
In one aspect, a method for providing a binaural recording to a listener with a head applied in a hearing system, whereas the binaural recording is listened to using a hearing device and whereas the binaural recording consists of a left binaural ear signal intended for a left ear of the listener, and a right binaural ear signal intended for a right ear of the listener, comprises determining a head orientation, determining a source direction of the binaural recording with respect to the head orientation, detecting a change of the head orientation to a new head orientation, adapting the binaural recording considering the source direction of the binaural recording and the new head orientation.
Multi-frequency sensing system with improved smart glasses and devices
The systems and methods described relate to the concept that smart devices can be used to: sense various types of phenomena like sound, blue light exposure, RF and microwave radiation, and, in real-time, analyze, report and/or control outputs (e.g., displays or speakers). The systems are configurable and use standard computing devices, such as wearable electronics (e.g., smart glasses), tablet computers, and mobile phones to measure various frequency bands across multiple points, allowing a single user to visualize and/or adjust environmental conditions.
Audio renderer based on audiovisual information
An audio renderer can have a machine learning model that jointly processes audio and visual information of an audiovisual recording. The audio renderer can generate output audio channels. Sounds captured in the audiovisual recording and present in the output audio channels are spatially mapped based on the joint processing of the audio and visual information by the machine learning model. Other aspects are described.
Head-tracked spatial audio
Spatial filters are generated that map response of an audio capture device to head related transfer functions (HRTFs) for different positions of the audio capture device relative to the HRTFs. A current set of spatial filters are determined based on the plurality of spatial filters and a head position of a user. The microphone signals are convolved with the current set of spatial filters, resulting in a left audio channel and right audio channel that form output binaural audio channels. The binaural audio channels can be used to drive speakers of a headphone set to generate sound that is perceived to have a spatial quality. Other aspects are described and claimed.
Converting Binaural Signals to Stereo Audio Signals
An apparatus including circuitry configured to: obtain a binaural audio signal; obtain, based on the binaural audio signal, at least one direction parameter of at least one frequency band of the binaural audio signal; process the binaural audio signal to generate at least two audio signals for loudspeaker reproduction by modifying an inter-channel difference of the at least one frequency band of the binaural audio signal based on the at least one direction parameter for the at least one frequency band; and output the at least two audio signals for loudspeaker reproduction.
Audio processing apparatus and method, and program
The present technology relates to an audio processing apparatus and method and a program that make it possible to obtain sound of higher quality. An acquisition unit acquires an audio signal and metadata of an object. A vector calculation unit calculates, based on a horizontal direction angle and a vertical direction angle included in the metadata of the object and indicative of an extent of a sound image, a spread vector indicative of a position in a region indicative of the extent of the sound image. A gain calculation unit calculates, based on the spread vector, a VBAP gain of the audio signal in regard to each speaker by VBAP. The present technology can be applied to an audio processing apparatus.
Methods, apparatus and systems for a pre-rendered signal for audio rendering
The present disclosure relates to a method of decoding audio scene content from a bitstream by a decoder that includes an audio renderer with one or more rendering tools. The method comprises receiving the bitstream, decoding a description of an audio scene from the bitstream, determining one or more effective audio elements from the description of the audio scene, determining effective audio element information indicative of effective audio element positions of the one or more effective audio elements from the description of the audio scene, decoding a rendering mode indication from the bitstream, wherein the rendering mode indication is indicative of whether the one or more effective audio elements represent a sound field obtained from pre-rendered audio elements and should be rendered using a predetermined rendering mode, and in response to the rendering mode indication indicating that the one or more effective audio elements represent the sound field obtained from pre-rendered audio elements and should be rendered using the predetermined rendering mode, rendering the one or more effective audio elements using the predetermined rendering mode, wherein rendering the one or more effective audio elements using the predetermined rendering mode takes into account the effective audio element information, and wherein the predetermined rendering mode defines a predetermined configuration of the rendering tools for controlling an impact of an acoustic environment of the audio scene on the rendering output. The disclosure further relates to a method of generating audio scene content and a method of encoding audio scene content into a bitstream.
Method and apparatus for audio data processing
Embodiments of the disclosure provide methods and apparatuses processing audio data. The method can include: acquiring audio data by an audio capturing device, determining feature information of an enclosure in which the audio capturing device is located, and reverberating the feature information into the audio data.
System and method for differentially locating and modifying audio sources
A system and method for differentially locating and modifying audio sources that includes receiving multiple audio inputs from a set of distinct locations; determining a multi-dimensional audio map from the audio inputs; acquiring a set of positional audio control inputs applied to the audio map, each audio control input comprising a location and audio processing property; and generating an audio output according to the audio control inputs and the audio inputs. The audio control inputs capable of configuration through manual, automatic, computer vision analysis, and other configuration modes.
Head-mounted display apparatus, sound image output system, and method of outputting sound image
An HMD includes a display unit mounted on a head of a user, and configured to display an image such that a real object located in a real space is visually recognizable, a right earphone and a left earphone configured to output a sound, a position specification unit configured to specify a position of the real object and a virtual object, and a sound output control unit configured to generate a synthesized sound with the position of the virtual object as a sound source position, and output the synthesized sound from the right earphone and the left earphone. The sound output control unit adjusts the synthesized sound so that the synthesized sound becomes an audible sound bypassing the real object when the position of the real object is located between the position of the virtual object and a position of the display unit.