H04S5/02

Method for and apparatus for decoding an ambisonics audio soundfield representation for audio playback using 2D setups

Sound scenes in 3D can be synthesized or captured as a natural sound field. For decoding, a decode matrix is required that is specific for a given loudspeaker setup and is generated using the known loudspeaker positions. However, some source directions are attenuated for 2D loudspeaker setups like e.g. 5.1 surround. An improved method for decoding an encoded audio signal in soundfield format for L loudspeakers at known positions comprises steps of adding (10) a position of at least one virtual loudspeaker to the positions of the L loudspeakers, generating (11) a 3D decode matrix (D′), wherein the positions (Formula I) of the L loudspeakers and the at least one virtual position (Formula II) are used, downmixing (12) the 3D decode matrix (D′), and decoding (14) the encoded audio signal (i14) using the downscaled 3D decode matrix (Formula III). As a result, a plurality of decoded loudspeaker signals (q14) is obtained.

TRANSMISSION DEVICE, TRANSMISSION METHOD, RECEPTION DEVICE, AND RECEPTION METHOD
20170263259 · 2017-09-14 ·

The present technology reduces a process load in a reception side when a plurality of types of audi data is transmitted. A metafile having meta information used to acquire, in a reception device, a predetermined number of audio streams including a plurality of groups of encoded data is transmitted. To the metafile, attribute information indicating each attribute of the encoded data of the plurality of groups is inserted. For example, to the metafile, stream correspondence relation information indicating in which audio stream the encoded data of the plurality of groups is included respectively is further inserted.

Speaker Device
20170265004 · 2017-09-14 ·

Provided is a speaker device including a cabinet whose direction is changed in accordance with an installation state, and capable of detecting the direction of the device based on received optical signals. The speaker device can be installed in a state in which the speaker device is placed on a rack such that a receiving unit faces a listening position, and a state in which the speaker device is hung on a wall such that a receiving unit faces the listening position. Voltage converting units are configured to convert photocurrents output by light receiving elements according to received light amounts of an infrared ray from an infrared remote controller to voltage signals to output the signals to a controller. The controller compares amplitudes of the voltage signals to each other, thereby detecting a state (direction) of the speaker device.

AUDIO PROCESSING APPARATUS AND METHOD, AND PROGRAM
20210409892 · 2021-12-30 · ·

The present technology relates to an audio processing apparatus and method and a program that make it possible to obtain sound of higher quality. An acquisition unit acquires an audio signal and metadata of an object. A vector calculation unit calculates, based on a horizontal direction angle and a vertical direction angle included in the metadata of the object and indicative of an extent of a sound image, a spread vector indicative of a position in a region indicative of the extent of the sound image. A gain calculation unit calculates, based on the spread vector, a VBAP gain of the audio signal in regard to each speaker by VBAP. The present technology can be applied to an audio processing apparatus.

Information processing apparatus, information processing method, and program
11743676 · 2023-08-29 · ·

Provided is an information processing apparatus including: a speaker array that includes a plurality of speakers, and performs wavefront synthesis by using an output of the plurality of speakers; and a presentation unit that presents visual information indicating a state of waves on a wavefront formed in the wavefront synthesis, or presents visual information based on positional information of a virtual sound image that has been formed in a position that is different from a vicinity of the speaker array in the wavefront synthesis.

Method for audio reproduction in a multi-channel sound system
11722831 · 2023-08-08 · ·

The invention relates to a method for reproducing audio in a multi-channel sound system including two input signals (L and R), wherein output signals are generated for different sound perception levels. In order to develop said method in such a way that audio can be reproduced within a larger range of applications in a multi-channel sound system, according to the invention, only a lower sound perception level (7) and a higher sound perception level (6) are generated, and a maximum of six output signals are generated, a maximum of two output signals being allocated to the lower sound perception level (7) and a maximum of four output signals being allocated to the higher sound perception level (6).

Method for audio reproduction in a multi-channel sound system
11722831 · 2023-08-08 · ·

The invention relates to a method for reproducing audio in a multi-channel sound system including two input signals (L and R), wherein output signals are generated for different sound perception levels. In order to develop said method in such a way that audio can be reproduced within a larger range of applications in a multi-channel sound system, according to the invention, only a lower sound perception level (7) and a higher sound perception level (6) are generated, and a maximum of six output signals are generated, a maximum of two output signals being allocated to the lower sound perception level (7) and a maximum of four output signals being allocated to the higher sound perception level (6).

Selectable linear predictive or transform coding modes with advanced stereo coding

Methods and systems for advanced stereo processing of an audio signal are disclosed. The methods and systems include selecting a coding mode of either transform coding or linear predictive coding and performing advanced stereo processing when in the selected coding mode. Both encoding and decoding operations are provided.

Selectable linear predictive or transform coding modes with advanced stereo coding

Methods and systems for advanced stereo processing of an audio signal are disclosed. The methods and systems include selecting a coding mode of either transform coding or linear predictive coding and performing advanced stereo processing when in the selected coding mode. Both encoding and decoding operations are provided.

Methods and systems for rendering object based audio

Methods for generating an object based audio program, renderable in a personalizable manner, and including a bed of speaker channels renderable in the absence of selection of other program content (e.g., to provide a default full range audio experience). Other embodiments include steps of delivering, decoding, and/or rendering such a program. Rendering of content of the bed, or of a selected mix of other content of the program, may provide an immersive experience. The program may include multiple object channels (e.g., object channels indicative of user-selectable and user-configurable objects), the bed of speaker channels, and other speaker channels. Another aspect is an audio processing unit (e.g., encoder or decoder) configured to perform, or which includes a buffer memory which stores at least one frame (or other segment) of an object based audio program (or bitstream thereof) generated in accordance with, any embodiment of the method.