Patent classifications
H04S3/004
INFORMATION PROCESSING DEVICE, INFORMATION PROCESSING METHOD, AND INFORMATION PROCESSING PROGRAM
An information processing device (100) according to the present disclosure includes: an acquisition unit (141) configured to acquire a first image including a content image of an ear of a user; and a calculation unit (142) configured to calculate, based on the first image acquired by the acquisition unit (141), a head-related transfer function corresponding to the user by using a learned model having learned to output a head-related transfer function corresponding to an ear when an image including a content image of the ear is input.
Method and apparatus for processing a stereo signal
The disclosure relates to a method for processing a stereo signal. The method can include obtaining a center channel signal by up-mixing the stereo signal. The method can also include generating a filtered center channel signal by applying one or more peak filters and one or more notch filters to the center channel signal. Furthermore, the method can include generating a binaural signal based on the filtered center channel signal.
VIRTUAL REALITY HEADSET AUDIO SYNCHRONISATION SYSTEM
A system 3 synchronises a plurality of virtual reality headsets 2 to corresponding multichannel audio. System 3 has a plurality of independently operable virtual reality headsets each having one or more images preloaded thereon and a distribution device 4 in the form of a server is in Wi-Fi communication via a wireless access point 5 with each virtual reality headset where the server 4 triggers the one or more preloaded images on each headset. A sound rack device 6 is in Wi-Fi communication with server 4 and configured to send time codes signals thereto. A sound mixer 7 is in cable communication with the sound rack device 6 and receives audio file data corresponding to virtual reality image content therefrom simultaneously with the server 4. The server 4 is cable connected to the wireless access point 5 and configured to simultaneously message each headset 2. At least one speaker element is provided for each channel of audio being connected to the mixer 7 such that upon receipt of the audio file, the mixer 7 is configured to relay each channel audio signal to respective speakers 8 in synchronisation with the VR headset images.
SOUND GENERATOR WEARABLE ON THE HEAD, SIGNAL PROCESSOR AND METHOD FOR OPERATING A SOUND GENERATOR OR A SIGNAL PROCESSOR
Sound generator wearable on the head, comprising: a first sound generator element on a first side; and a second sound generator element on a second side, wherein at least a first sound transducer and a second sound transducer are arranged in the first sound generator element such that sound emission directions of the first sound transducer and the second sound transducer are parallel or deviate by less than 30° from a parallel emission direction, and wherein a third sound transducer and a fourth sound transducer are arranged in the second sound generator element such that sound emission directions of the third sound transducer and the fourth sound transducer are parallel to one another or deviate by less than 30° from a parallel emission direction.
Spatial audio controller
A method performed a local device that is communicatively coupled with several remote devices, the method includes: receiving, from each remote device with which the local device is engaged in a communication session, an input audio stream; receiving, for each remote device, a set parameters; determining, for each input audio stream, whether the input audio stream is to be 1) rendered individually or 2) rendered as a mix of input audio streams based on the set of parameters; for each input audio stream that is determined to be rendered individually, spatial rendering the input audio stream as an individual virtual sound source that contains only that input audio stream; and for input audio streams that are determined to be rendered as the mix of input audio streams, spatial rendering the mix of input audio streams as a single virtual sound source that contains the mix of input audio streams.
Method and apparatus for performing binaural rendering of audio signal
A method and apparatus for performing binaural rendering of an audio signal are provided. The method includes identifying an input signal that is based on an object, and metadata that includes distance information indicating a distance to the object, generating a binaural filter that is based on the metadata, using a binaural room impulse response, obtaining a binaural filter to which a low-pass filter (LPF) is applied, using a frequency response control that is based on the distance information, and generating a binaural-rendered output signal by performing a convolution of the input signal and the binaural filter to which the LPF is applied.
Binaural beamforming microphone array
A binaural beamformer comprising two beamforming filters may be communicatively coupled to a microphone array to generates two beamforming outputs, one for the left ear and the other for the right ear. The beamforming filters may be configured in such a way that they are orthogonal to each other to make white noise components in the binaural outputs substantially uncorrelated and desired signal components in the binaural outputs highly correlated. As a result, the human auditory system may better separate the desired signal from white noise and intelligibility of the desired signal may be improved.
Systems and methods for generating video-adapted surround-sound
Audiovisual presentations, such as film recordings, may have been originally created having an audio soundtrack with multiple audio tracks mixed for a surround sound system that includes a set of speakers physically surrounding a user. The present disclosure presents systems and methods to remix these soundtracks into 3D audio that when presented to the ears of a user can be perceived as a virtual surround sound system that mimics the physical system. What is more, the disclosed systems and methods can enhance the virtual surround sound system by adjusting virtual speakers of the virtual surround sound system according to video content of the audiovisual presentation. Further enhancement may be possible by adjusting the virtual speakers of the virtual surround sound system according to a sensed position of a user.
Spatial Audio Augmentation and Reproduction
A method including: obtaining at least one spatial audio signal including at least one audio signal, wherein the at least one spatial audio signal at least partially defines an audio scene; obtaining at least one augmentation audio signal; determining at least two audio objects based upon the at least one augmentation audio signal; determining audio-object dependency information for the determined at least two audio objects; and augmenting the audio scene based, at least partially, on both the determined at least two audio objects and the determined audio-object dependency information.
Directional Sound Recording and Playback
Systems and methods for providing improved localization of recorded and played back sound are provided by improved microphone arrays for recording sound and by improved systems for playback of sound. Microphone arrays include four microphones with sound transducers located and aimed to mimic capture of sound by human ears. Sound captured by two side-viewing microphones is attenuated, at the time of sound capture and/or recording, at a later processing stage, or at the time of sound playback, by low-pass filtering. The recording maintains four separate channels of sound. Playback occurs through four speakers arranged to reproduce sound in the way human ears hear sound, with appropriate attenuation for side speakers. Playback can also occur through four-channel headphones. Improved playback of two-channel stereo sound can also occur through low-pass filtering of each track and playing the filtered sound through side/rear speakers on the opposite sides.