Patent classifications
H04S2420/13
Methods and apparatus for compressing and decompressing a higher order ambisonics representation
Higher Order Ambisonics represents three-dimensional sound independent of a specific loudspeaker set-up. However, transmission of an HOA representation results in a very high bit rate. Therefore, compression with a fixed number of channels is used, in which directional and ambient signal components are processed differently. The ambient HOA component is represented by a minimum number of HOA coefficient sequences. The remaining channels contain either directional signals or additional coefficient sequences of the ambient HOA component, depending on what will result in optimum perceptual quality. This processing can change on a frame-by-frame basis.
RENDERING AUDIO OBJECTS HAVING APPARENT SIZE
Methods, systems, and computer program products for rending an audio object having an apparent size are disclosed. An audio processing system receives audio panning data including a first grid mapping first virtual sound sources in a space and speaker positions to speaker gains. The first grid specifies first speaker gains of the first virtual sound sources in the space. The audio processing system determines a second grid of second virtual sound sources in the space, including mapping the first virtual sound sources into the second virtual sound sources of the second virtual sources. The audio processing system selects at least one of the first grid or second grid for rendering an audio object based on an apparent size of the audio object. The audio processing system renders the audio object based on the selected grid or grids.
Apparatus, method and computer program for encoding, decoding, scene processing and other procedures related to DirAC based spatial audio coding using diffuse compensation
An apparatus for generating a sound field description from an input signal having one or more channels including: an input signal analyzer for obtaining diffuseness data from the input signal; a sound component generator for generating, from the input signal, one or more sound field components of a first group of sound field components having for each sound field component a direct component and a diffuse component, and for generating, from the input signal, a second group of sound field components having only a direct component, the sound component generator being configured to perform an energy compensation when generating the first group of sound field components, the energy compensation depending on the diffuseness data and a number of sound field components in the second group, a number of diffuse components in the first group, or a maximum order of sound field components.
APPARATUS, METHOD AND COMPUTER PROGRAM FOR ENCODING, DECODING, SCENE PROCESSING AND OTHER PROCEDURES RELATED TO DIRAC BASED SPATIAL AUDIO CODING USING DIRECT COMPONENT COMPENSATION
An apparatus for generating a sound field description from an input signal having at least two channels has: an input signal analyzer for obtaining direction data and diffuseness data from the input signal; an estimator for estimating a first energy- or amplitude-related measure for an omnidirectional component derived from the input signal and for estimating a second energy- or amplitude-related measure for a directional component derived from the input signal, and a sound component generator for generating sound field components of the sound field, wherein the sound component generator is configured to perform an energy compensation of the directional component using the first energy- or amplitude-related measure, the second energy- or amplitude-related measure, the direction data and the diffuseness data.
METHODS AND APPARATUS FOR COMPRESSING AND DECOMPRESSING A HIGHER ORDER AMBISONICS REPRESENTATION
Higher Order Ambisonics represents three-dimensional sound independent of a specific loudspeaker set-up. However, transmission of an HOA representation results in a very high bit rate. Therefore, compression with a fixed number of channels is used, in which directional and ambient signal components are processed differently. The ambient HOA component is represented by a minimum number of HOA coefficient sequences. The remaining channels contain either directional signals or additional coefficient sequences of the ambient HOA component, depending on what will result in optimum perceptual quality. This processing can change on a frame-by-frame basis.
INFORMATION PROCESSING APPARATUS, INFORMATION PROCESSING METHOD, AND PROGRAM
Provided is an information processing apparatus including: a speaker array that includes a plurality of speakers, and performs wavefront synthesis by using an output of the plurality of speakers; and a presentation unit that presents visual information indicating a state of waves on a wavefront formed in the wavefront synthesis, or presents visual information based on positional information of a virtual sound image that has been formed in a position that is different from a vicinity of the speaker array in the wavefront synthesis.
APPARATUS, METHOD AND COMPUTER PROGRAM FOR ENCODING, DECODING, SCENE PROCESSING AND OTHER PROCEDURES RELATED TO DIRAC BASED SPATIAL AUDIO CODING USING DIFFUSE COMPENSATION
An apparatus for generating a sound field description from an input signal having one or more channels including: an input signal analyzer for obtaining diffuseness data from the input signal; a sound component generator for generating, from the input signal, one or more sound field components of a first group of sound field components having for each sound field component a direct component and a diffuse component, and for generating, from the input signal, a second group of sound field components having only a direct component, the sound component generator being configured to perform an energy compensation when generating the first group of sound field components, the energy compensation depending on the diffuseness data and a number of sound field components in the second group, a number of diffuse components in the first group, or a maximum order of sound field components.
APPARATUS, METHOD AND COMPUTER PROGRAM FOR ENCODING, DECODING, SCENE PROCESSING AND OTHER PROCEDURES RELATED TO DIRAC BASED SPATIAL AUDIO CODING USING DIRECT COMPONENT COMPENSATION
An apparatus for generating a sound field description from an input signal having at least two channels has: an input signal analyzer for obtaining direction data and diffuseness data from the input signal; an estimator for estimating a first energy- or amplitude-related measure for an omnidirectional component derived from the input signal and for estimating a second energy- or amplitude-related measure for a directional component derived from the input signal, and a sound component generator for generating sound field components of the sound field, wherein the sound component generator is configured to perform an energy compensation of the directional component using the first energy- or amplitude-related measure, the second energy- or amplitude-related measure, the direction data and the diffuseness data.
APPARATUS, METHOD AND COMPUTER PROGRAM FOR ENCODING, DECODING, SCENE PROCESSING AND OTHER PROCEDURES RELATED TO DIRAC BASED SPATIAL AUDIO CODING USING LOW-ORDER, MID-ORDER AND HIGH-ORDER COMPONENTS GENERATORS
An apparatus for generating a sound field description using an input signal having a mono-signal or a multi-channel signal comprises: an input signal analyzer for analyzing the input signal to derive direction data and diffuseness data; a low-order components generator for generating a low-order sound field description from the input signal up to a predetermined order and mode; a mid-order components generator for generating a mid-order sound field description above the predetermined order or at the predetermined order and above the predetermined mode and below or at a first truncation order using a synthesis of at least one direct portion and of at least one diffuse portion using the direction data and the diffuseness data; and a high-order components generator for generating a high-order sound field description having a component above the first truncation order using a synthesis of at least one direct portion.
INFORMATION PROCESSING DEVICE, INFORMATION PROCESSING METHOD, AND INFORMATION PROCESSING SYSTEM
Provided is an information processing device that processes a dialogue of an audio agent.
The information processing device includes an acquisition unit that acquires audio information of an agent device that is played back through interaction with a user and audio information of other contents different from the audio information of the agent device, and a controller that performs sound field control processing on an audio output signal based on the audio information of the agent device acquired by the acquisition unit. The controller performs wavefront composition of pieces of audio output from a plurality of speakers, controls a sound field of the agent device, and avoids mixing with the other contents different from the audio information of the agent device.