H04S5/02

System and method for an audio reproduction device

System and method for enhancing audio reproduced by an audio reproduction device with a first channel and second channel is described. X samples of audio signals are received and stored in a portion of an input buffer with 2x positions and rest of the x positions are padded with zero for both the channels. Contents of the input buffer are transformed to frequency domain (FD) components. FD components are multiplied with a first filter coefficient to generate FD components with short echo effect and with a second filter coefficient to generate FD components with long echo effect. Then, they are converted to time domain (TD) components with short echo effect and TD components with long echo effect. Selective TD components with short echo effect and long echo effect are combined to generate a convolved first channel output and a convolved second channel output.

Audio processing apparatus and method, and program
11540080 · 2022-12-27 · ·

The present technology relates to an audio processing apparatus and method and a program that make it possible to obtain sound of higher quality. An acquisition unit acquires an audio signal and metadata of an object. A vector calculation unit calculates, based on a horizontal direction angle and a vertical direction angle included in the metadata of the object and indicative of an extent of a sound image, a spread vector indicative of a position in a region indicative of the extent of the sound image. A gain calculation unit calculates, based on the spread vector, a VBAP gain of the audio signal in regard to each speaker by VBAP. The present technology can be applied to an audio processing apparatus.

AUDIO PROCESSING APPARATUS AND METHOD, AND PROGRAM
20230078121 · 2023-03-16 · ·

The present technology relates to an audio processing apparatus and method and a program that make it possible to obtain sound of higher quality. An acquisition unit acquires an audio signal and metadata of an object. A vector calculation unit calculates, based on a horizontal direction angle and a vertical direction angle included in the metadata of the object and indicative of an extent of a sound image, a spread vector indicative of a position in a region indicative of the extent of the sound image. A gain calculation unit calculates, based on the spread vector, a VBAP gain of the audio signal in regard to each speaker by VBAP. The present technology can be applied to an audio processing apparatus.

Transmission device, transmission method, reception device, and reception method
11636862 · 2023-04-25 · ·

A processing load at a receiving side is reduced in a case where a plurality of classes of audio data is transmitted. A predetermined number of audio streams including coded data of a plurality of groups is generated and a container of a predetermined format having this predetermined number of audio streams is transmitted. Command information for creating a command specifying a group to be decoded from among the plurality of groups is inserted into the container and/or the audio stream. For example, a command insertion area for the receiving side to insert a command for specifying a group to be decoded is provided in at least one audio stream among the predetermined number of audio streams.

Transmission device, transmission method, reception device, and reception method
11636862 · 2023-04-25 · ·

A processing load at a receiving side is reduced in a case where a plurality of classes of audio data is transmitted. A predetermined number of audio streams including coded data of a plurality of groups is generated and a container of a predetermined format having this predetermined number of audio streams is transmitted. Command information for creating a command specifying a group to be decoded from among the plurality of groups is inserted into the container and/or the audio stream. For example, a command insertion area for the receiving side to insert a command for specifying a group to be decoded is provided in at least one audio stream among the predetermined number of audio streams.

Decorrelator structure for parametric reconstruction of audio signals

An encoding system encodes multiple audio signals (X) as a downmix signal (Y) together with wet and dry upmix coefficients (P, C). In a decoding system, a pre-multiplier (101) computes an intermediate signal (W) by mapping the downmix signal linearly in accordance with a first set of coefficients (Q); a decorrelating section (102) outputs a decorrelated signal (Z) based on the intermediate signal; a wet upmix section (103) computes a wet upmix signal by mapping the decorrelated signal linearly in accordance with the wet upmix coefficients; a dry upmix section (104) computes a dry upmix signal by mapping the downmix signal linearly in accordance with the dry upmix coefficients; a combining section (105) provides a multidimensional reconstructed signal (X) by combining the wet and dry upmix signals; and a converter (106) computes the first set of coefficients based on the wet and dry upmix coefficients and supplies this to the pre-multiplier.

Binaural rendering for headphones using metadata processing

Embodiments are described for a method of rendering audio for playback through headphones comprising receiving digital audio content, receiving binaural rendering metadata generated by an authoring tool processing the received digital audio content, receiving playback metadata generated by a playback device, and combining the binaural rendering metadata and playback metadata to optimize playback of the digital audio content through the headphones.

ELECTRONIC DEVICE FOR CONVERTING NUMBER OF AUDIO CHANNELS, AND ELECTRONIC DEVICE OPERATING METHOD

An electronic device includes: an Application Processor (AP) including a CPU; and an audio processing unit configured to: receive an audio stream of M channels from the CPU, convert the M-channel audio stream into an N-channel audio stream, the N-channel being different from the M-channel, output the N-channel audio stream through an output device. The audio processing unit is further configured to: detect a connection with the external electronic device through an audio output interface; determine whether to stop the converting the M-channel audio stream into the N-channel audio stream, based on information received from the CPU; and transmit, via the audio output interface, the M-channel audio stream to the external electronic device in response to the determination of stopping the converting the M-channel audio stream into the N-channel audio stream.

ELECTRONIC DEVICE FOR CONVERTING NUMBER OF AUDIO CHANNELS, AND ELECTRONIC DEVICE OPERATING METHOD

An electronic device includes: an Application Processor (AP) including a CPU; and an audio processing unit configured to: receive an audio stream of M channels from the CPU, convert the M-channel audio stream into an N-channel audio stream, the N-channel being different from the M-channel, output the N-channel audio stream through an output device. The audio processing unit is further configured to: detect a connection with the external electronic device through an audio output interface; determine whether to stop the converting the M-channel audio stream into the N-channel audio stream, based on information received from the CPU; and transmit, via the audio output interface, the M-channel audio stream to the external electronic device in response to the determination of stopping the converting the M-channel audio stream into the N-channel audio stream.

Methods and systems for generating and interactively rendering object based audio

Methods for generating an object based audio program, renderable in a personalizable manner, and including a bed of speaker channels renderable in the absence of selection of other program content (e.g., to provide a default full range audio experience). Other embodiments include steps of delivering, decoding, and/or rendering such a program. Rendering of content of the bed, or of a selected mix of other content of the program, may provide an immersive experience. The program may include multiple object channels (e.g., object channels indicative of user-selectable and user-configurable objects), the bed of speaker channels, and other speaker channels. Another aspect is an audio processing unit (e.g., encoder or decoder) configured to perform, or which includes a buffer memory which stores at least one frame (or other segment) of an object based audio program (or bitstream thereof) generated in accordance with, any embodiment of the method.