Patent classifications
H04S2400/11
Methods for obtaining and reproducing a binaural recording
In one aspect, a method for providing a binaural recording to a listener with a head applied in a hearing system, whereas the binaural recording is listened to using a hearing device and whereas the binaural recording consists of a left binaural ear signal intended for a left ear of the listener, and a right binaural ear signal intended for a right ear of the listener, comprises determining a head orientation, determining a source direction of the binaural recording with respect to the head orientation, detecting a change of the head orientation to a new head orientation, adapting the binaural recording considering the source direction of the binaural recording and the new head orientation.
Using metadata to aggregate signal processing operations
A technique including receiving and decoding a coded bitstream encoded with audio content including first audio objects corresponding to a first media content type of two consecutive media content types and second audio objects corresponding to a second media content type of the two consecutive media content types, and audio metadata corresponding to the audio content. The audio metadata including first and second audio object gains, for the first and second audio objects, generated in part based on a first fading curve of the first media content type and a second fading curve of the second media content type, respectively. The technique further includes applying the first and second audio object gains to the first and second audio objects, and rendering a sound field represented by the first audio object with the applied first audio object gain and the second audio object with the applied second audio object gain.
Information processing method, information processing device, and non-transitory storage medium
An information processing method includes: receiving first space information including a first coordinate system of one of a logical space or a physical space, and second space information including a second coordinate system of the other of the logical space or the physical space; receiving first sound localization information indicating a position where a sound image is to be localized in the first coordinate system; and transforming the first sound localization information into second sound localization information indicating a position where the sound image is to be localized in the second coordinate system.
Audio renderer based on audiovisual information
An audio renderer can have a machine learning model that jointly processes audio and visual information of an audiovisual recording. The audio renderer can generate output audio channels. Sounds captured in the audiovisual recording and present in the output audio channels are spatially mapped based on the joint processing of the audio and visual information by the machine learning model. Other aspects are described.
System and method for an audio reproduction device
System and method for enhancing audio reproduced by an audio reproduction device with a first channel and second channel is described. X samples of audio signals are received and stored in a portion of an input buffer with 2x positions and rest of the x positions are padded with zero for both the channels. Contents of the input buffer are transformed to frequency domain (FD) components. FD components are multiplied with a first filter coefficient to generate FD components with short echo effect and with a second filter coefficient to generate FD components with long echo effect. Then, they are converted to time domain (TD) components with short echo effect and TD components with long echo effect. Selective TD components with short echo effect and long echo effect are combined to generate a convolved first channel output and a convolved second channel output.
Apparatus and Method for Synthesizing a Spatially Extended Sound Source Using Cue Information Items
An apparatus for synthesizing a spatially extended sound source includes: a spatial information interface for receiving a spatial range indication indicating a limited spatial range for the spatially extended sound source within a maximum spatial range; a cue information provider for providing one or more cue information items in response to the limited spatial range; and an audio processor for processing an audio signal representing the spatially extended sound source using the one or more cue information items.
SOUND REPRODUCTION METHOD, NON-TRANSITORY MEDIUM, AND SOUND REPRODUCTION DEVICE
A sound reproduction method includes: obtaining a first audio signal indicating a first sound which arrives at a listener from a first range and a second audio signal indicating a second sound which arrives at the listener from a predetermined direction; when the first range and the predetermined direction are determined to be included in a second range which is a back range relative to a front range in the direction that the head part of the listener faces, performing a correction process on at least one of the first audio signal or the second audio signal so that intensity of the second audio signal becomes higher than intensity of the first audio signal; and performing mixing of the at least one of the first audio signal or the second audio signal, and outputting, to an output channel, the first and second audio signals.
ACOUSTIC REPRODUCTION METHOD, RECORDING MEDIUM, AND ACOUSTIC REPRODUCTION SYSTEM
An acoustic reproduction method is an acoustic reproduction method for causing a user to perceive a first sound as a sound arriving from a first position in a three-dimensional sound field and a second sound as a sound arriving from a second position different from the first position in the three-dimensional sound field. The acoustic reproduction method includes: obtaining a movement speed of a head of the user; and generating an output sound signal for causing the user to perceive sounds that arrive from predetermined positions in the three-dimensional sound field. In the generating, when the movement speed obtained is greater than a first threshold, the output sound signal for causing the user to perceive the first sound and the second sound as a sound arriving from a third position between the first position and the second position is generated.
Audio processing apparatus and method, and program
The present technology relates to an audio processing apparatus and method and a program that make it possible to obtain sound of higher quality. An acquisition unit acquires an audio signal and metadata of an object. A vector calculation unit calculates, based on a horizontal direction angle and a vertical direction angle included in the metadata of the object and indicative of an extent of a sound image, a spread vector indicative of a position in a region indicative of the extent of the sound image. A gain calculation unit calculates, based on the spread vector, a VBAP gain of the audio signal in regard to each speaker by VBAP. The present technology can be applied to an audio processing apparatus.
Methods, apparatus and systems for a pre-rendered signal for audio rendering
The present disclosure relates to a method of decoding audio scene content from a bitstream by a decoder that includes an audio renderer with one or more rendering tools. The method comprises receiving the bitstream, decoding a description of an audio scene from the bitstream, determining one or more effective audio elements from the description of the audio scene, determining effective audio element information indicative of effective audio element positions of the one or more effective audio elements from the description of the audio scene, decoding a rendering mode indication from the bitstream, wherein the rendering mode indication is indicative of whether the one or more effective audio elements represent a sound field obtained from pre-rendered audio elements and should be rendered using a predetermined rendering mode, and in response to the rendering mode indication indicating that the one or more effective audio elements represent the sound field obtained from pre-rendered audio elements and should be rendered using the predetermined rendering mode, rendering the one or more effective audio elements using the predetermined rendering mode, wherein rendering the one or more effective audio elements using the predetermined rendering mode takes into account the effective audio element information, and wherein the predetermined rendering mode defines a predetermined configuration of the rendering tools for controlling an impact of an acoustic environment of the audio scene on the rendering output. The disclosure further relates to a method of generating audio scene content and a method of encoding audio scene content into a bitstream.