H04S3/008

APPARATUS AND METHOD FOR SCREEN RELATED AUDIO OBJECT REMAPPING

An apparatus for generating loudspeaker signals includes an object metadata processor configured to receive metadata, to calculate a second position of the audio object depending on the first position of the audio object and on a size of a screen if the audio object is indicated in the metadata as being screen-related, to feed the first position of the audio object as the position information into the object renderer if the audio object is indicated in the metadata as being not screen-related, and to feed the second position of the audio object as the position information into the object renderer if the audio object is indicated in the metadata as being screen-related. The apparatus further includes an object renderer configured to receive an audio object and to generate the loudspeaker signals depending on the audio object and on position information.

SYSTEM AND METHOD FOR HEADPHONE EQUALIZATION AND ROOM ADJUSTMENT FOR BINAURAL PLAYBACK IN AUGMENTED REALITY
20230164509 · 2023-05-25 ·

A system is provided. The system includes an analyzer for determining a plurality of binaural room impulse responses, and a loudspeaker signal generator for generating at least two loudspeaker signals depending on the plurality of binaural room impulse responses and depending on the audio source signal of at least one audio source. The analyzer is configured to determine the plurality of the binaural room impulse responses such that each of the plurality of the binaural room impulse responses considers an effect that results from a headphone being worn by a user.

Audio encoder, audio decoder, methods and computer program using jointly encoded residual signals

An audio decoder for providing at least four audio channel signals on the basis of an encoded representation is configured to provide a first residual signal and a second residual signal on the basis of a jointly encoded representation of the first residual signal and of the second residual signal using a multi-channel decoding. The audio decoder is configured to provide a first audio channel signal and a second audio channel signal on the basis of a first downmix signal and the first residual signal using a residual-signal-assisted multi-channel decoding. The audio decoder is configured to provide a third audio channel signal and a fourth audio channel signal on the basis of a second downmix signal and the second residual signal using a residual-signal-assisted multi-channel decoding. An audio encoder is based on corresponding considerations.

Method for generating and outputting an acoustic multichannel signal
11659346 · 2023-05-23 · ·

Method for generating and outputting an acoustic multichannel signal, comprising the steps of: supplying a stereo signal (S), splitting the supplied stereo signal (S) into a plurality of perception-direction-dependent acoustic signal components (S.1-S.5), generating an acoustic multichannel signal by mixing each perception-direction-dependent acoustic signal component (S.1-S.5) onto an output channel (4.1-4.12) of an acoustic output apparatus (4) that comprises a plurality of, in particular more than two, acoustic output channels (4.1-4.12), outputting the generated multichannel signal over respective acoustic output channels (4.1-4.12) of the acoustic output apparatus (4).

DEEP ENCODER FOR PERFORMING AUDIO PROCESSING

Embodiments are disclosed for determining an answer to a query associated with a graphical representation of data. In particular, in one or more embodiments, the disclosed systems and methods comprise receiving an input including an unprocessed audio sequence and a request to perform an audio signal processing effect on the unprocessed audio sequence. The one or more embodiments further include analyzing, by a deep encoder, the unprocessed audio sequence to determine parameters for processing the unprocessed audio sequence. The one or more embodiments further include sending the unprocessed audio sequence and the parameters to one or more audio signal processing effects plugins to perform the requested audio signal processing effect using the parameters and outputting a processed audio sequence after processing of the unprocessed audio sequence using the parameters of the one or more audio signal processing effects plugins.

Processing object-based audio signals

An audio processing system and method which calculates, based on spatial metadata of the audio object, a panning coefficient for each of the audio objects in relation to each of a plurality of predefined channel coverage zones. Converts the audio signal into submixes in relation to the predefined channel coverage zones based on the calculated panning coefficients and the audio objects. Each of the submixes indicating a sum of components of the plurality of the audio objects in relation to one of the predefined channel coverage zones. Generating a submix gain by applying an audio processing to each of the submix and controls an object gain applied to each of the audio objects. The object gain being as a function of the panning coefficients for each of the audio objects and the submix gains in relation to each of the predefined channel coverage zones.

6DOF Rendering of Microphone-Array Captured Audio For Locations Outside The Microphone-Arrays

An apparatus for generating a spatialized audio output based on a listener position, the apparatus including circuitry configured to: obtain two or more audio signal sets; obtain a listener position within an audio environment, wherein the audio environment includes one or more area having one or more inside and outside regions in relation to the respective audio signal set positions; obtain metadata based on a processing of the at least two audio signals; determine, for the listener position within an audio environment outside the inside region, a second listener position; determine modified metadata for the second listener position based on the metadata; determine at least two modified audio signals for the second listener position based on the at least two audio signals; determine spatial metadata for the listener position; and output the at least two modified audio signals and the spatial metadata.

Audio signal processor, system and methods distributing an ambient signal to a plurality of ambient signal channels

An audio signal processor for providing ambient signal channels on the basis of an input audio signal, is configured to extract an ambient signal on the basis of the input audio signal. The signal processor is configured to distribute the ambient signal to a plurality of ambient signal channels in dependence on positions or directions of sound sources within the input audio signal, wherein a number of ambient signal channels is larger than a number of channels of the input audio signal.

Spatial audio parameters and associated spatial audio playback

An apparatus including at least one processor and at least one memory including a computer program code, the at least one memory and the computer program code configured to, with the at least one processor, cause the apparatus at least to: determine, for two or more microphone audio signals, at least one spatial audio parameter for providing spatial audio reproduction; determine at least one coherence parameter associated with a sound field based on the two or more microphone audio signals, such that another sound field is configured to be reproduced based on the at least one spatial audio parameter and the at least one coherence parameter.

Layered coding for compressed sound or sound field representations
11626119 · 2023-04-11 · ·

The present document relates to a method of layered encoding of a compressed sound representation of a sound or sound field. The compressed sound representation comprises a basic compressed sound representation comprising a plurality of components, basic side information for decoding the basic compressed sound representation to a basic reconstructed sound representation of the sound or sound field, and enhancement side information including parameters for improving the basic reconstructed sound representation. The method comprises sub-dividing the plurality of components into a plurality of groups of components and assigning each of the plurality of groups to a respective one of a plurality of hierarchical layers, the number of groups corresponding to the number of layers, and the plurality of layers including a base layer and one or more hierarchical enhancement layers, adding the basic side information to the base layer, and determining a plurality of portions of enhancement side information from the enhancement side information and assigning each of the plurality of portions of enhancement side information to a respective one of the plurality of layers, wherein each portion of enhancement side information includes parameters for improving a reconstructed sound representation obtainable from data included in the respective layer and any layers lower than the respective layer. The document further relates to a method of decoding a compressed sound representation of a sound or sound field, wherein the compressed sound representation is encoded in a plurality of hierarchical layers that include a base layer and one or more hierarchical enhancement layers, as well as to an encoder and a decoder for layered coding of a compressed sound representation.