H04S2420/03

Spatial Audio Augmentation and Reproduction
20210385607 · 2021-12-09 ·

An apparatus including circuitry configured for: obtaining at least one spatial audio signal comprising including at least one audio signal, wherein the at least one spatial audio signal defines an audio scene forming at least in part media content; rendering an audio scene based on the at least one spatial audio signal; obtaining at least one augmentation audio signal; transforming the at least one augmentation audio signal to at least two audio objects; augmenting the audio scene based on the at least two audio objects.

Method and apparatus for binaural rendering audio signal using variable order filtering in frequency domain

The present invention relates to a method and an apparatus for binaural rendering an audio signal using variable order filtering in frequency domain. To this end, provided are a method for processing an audio signal including: receiving an input audio signal; receiving a set of truncated subband filter coefficients for filtering each subband signal of the input audio signal, the set of truncated subband filter coefficients being constituted by one or more FFT filter coefficients generated by performing FFT by a predetermined block size; generating at least one subframe for each subband; generating at least one filtered subframe for each subband; performing inverse FFT on the filtered subframe for each subband; and generating a filtered subband signal by overlap-adding the transformed subframe for each subband and an apparatus for processing an audio signal using the same.

Determination of Targeted Spatial Audio Parameters and Associated Spatial Audio Playback
20210377685 · 2021-12-02 ·

A method for spatial audio signal processing, including determining, for two or more playback audio signals, at least one spatial audio parameter for providing spatial audio reproduction; determining between the two or more playback audio signals at least one audio signal relationship parameter, the at least one audio signal relationship parameter being associated with a determination of inter-channel signal relationship information between the two or more playback audio signals and for at least two frequency bands, such that the two or more playback audio signals are configured to be reproduced based on the at least one spatial audio parameter and the at least one audio signal relationship parameter.

AMBIENT SOUND ADJUSTMENTS DURING CALL HANDLING

Apparatuses, methods and computer programs are described comprising: providing an incoming call indication in response to an incoming call, the incoming call indication including an initial ambient audio signal comprising a combination of first ambient audio and second ambient audio; receiving an ambient audio control command; and adjusting the initial ambient audio signal to generate an adjusted ambient audio signal depending on the ambient audio control command.

METHOD FOR DETERMINING AUDIO CODING/DECODING MODE AND RELATED PRODUCT
20210375292 · 2021-12-02 ·

A non-transitory computer-readable medium is provided. The non-transitory computer-readable medium having computer instructions stored therein, which when executed by one or more processors, cause the one or more processors to perform operations. The operations comprise: determining a channel combination scheme for a current frame, where the determined channel combination scheme for the current frame is one of a plurality of channel combination schemes; and determining a coding mode of the current frame based on a channel combination scheme for a previous frame and the channel combination scheme for the current frame, where the coding mode of the current frame is one of a plurality of coding modes.

METHODS AND DEVICES FOR GENERATING OR DECODING A BITSTREAM COMPRISING IMMERSIVE AUDIO SIGNALS

The present document describes a method (500) for generating a bitstream (101), wherein the bitstream (101) comprises a sequence of superframes (400) for a sequence of frames of an immersive audio signal (111). The method (500) comprises, repeatedly for the sequence of superframes (400), inserting (501) coded audio data (206) for one or more frames of one or more downmix channel signals (203) derived from the immersive audio signal (111), into data fields (411, 421, 412, 422) of a superframe (400); and inserting (502) metadata (202, 205) for reconstructing one or more frames of the immersive audio signal (111) from the coded audio data (206), into a metadata field (403) of the superframe (400).

Acoustic scene reconstruction device, acoustic scene reconstruction method, and program
11373355 · 2022-06-28 · ·

An acoustic scene reconstruction device includes: a sound source localization and separation unit configured to perform sound source localization and sound source separation from a collected sound signal; an identification unit configured to identify a kind of a sound source contained in the sound signal; an analysis processing unit configured to estimate a position of the sound source based on a result obtained through the sound source localization and the sound source separation and a result obtained through the identification, select a separation sound and generate visualization information; and a visualization processing unit configured to generate an image corresponding to the sound source is displayed at the estimated position of the sound source by using the visualization information and the separation sound and generate a sound in which the separation sound is reproduced at the estimated position of the sound source.

EFFICIENT CODING OF AUDIO SCENES COMPRISING AUDIO OBJECTS

There is provided encoding and decoding methods for encoding and decoding of object based audio. An exemplary encoding method includes inter alia calculating M downmix signals by forming combinations of N audio objects, wherein M≤N, and calculating parameters which allow reconstruction of a set of audio objects formed on basis of the N audio objects from the M downmix signals. The calculation of the M downmix signals is made according to a criterion which is independent of any loudspeaker configuration.

DETERMINATION OF THE SIGNIFICANCE OF SPATIAL AUDIO PARAMETERS AND ASSOCIATED ENCODING
20220189494 · 2022-06-16 ·

There is inter alia disclosed an apparatus for spatial audio encoding which can receive or determine for one or more audio signals (102), spatial audio parameters (106) on a sub band basis for providing spatial audio reproduction, the spatial audio parameters can comprise a coherence value (112) for each sub band of a plurality of subbands (202) of a frame. The apparatus then determines a significance measure for the coherence values (401) of the plurality of sub bands of the frame and uses the significance measure to determine whether to encode (403) the coherence values of the plurality of sub bands of the frame.

Metadata-preserved audio object clustering

Example embodiments disclosed herein relate to audio object clustering. A method for metadata-preserved audio object clustering is disclosed. The method comprises classifying a plurality of audio objects into a number of categories based on information to be preserved in metadata associated with the plurality of audio objects. The method further comprises assigning a predetermined number of clusters to the categories and allocating an audio object in each of the categories to at least one of the clusters according to the assigning. Corresponding system and computer program product are also disclosed.