H04S3/00

AUDIO PROVIDING APPARATUS AND AUDIO PROVIDING METHOD

An audio providing apparatus and method are provided. The audio providing apparatus includes: an object renderer configured to render an object audio signal based on geometric information regarding the object audio signal; a channel renderer configured to render an audio signal having a first channel number into an audio signal having a second channel number; and a mixer configured to mix the rendered object audio signal with the audio signal having the second channel number.

Spatial audio processing

An apparatus comprising at least one processor and at least one memory, the memory comprising machine-readable instructions, that when executed cause the apparatus to: store in a non-volatile memory multiple sets of predetermined spatial audio processing parameters for differently moving sound sources; provide in a man machine interface an option for a user to select one of the stored multiple sets of predetermined spatial audio processing parameters for differently moving sound sources; and in response to the user selecting one of the stored multiple sets of predetermined spatial audio processing parameters for differently moving sound sources, the apparatus is further caused to use the selected one of the stored multiple sets of predetermined spatial audio processing parameters to spatially process audio from one or more sound sources.

METHOD FOR ENCODING MULTI-CHANNEL AUDIO SIGNAL AND ENCODING DEVICE FOR PERFORMING ENCODING METHOD, AND METHOD FOR DECODING MULTI-CHANNEL AUDIO SIGNAL AND DECODING DEVICE FOR PERFORMING DECODING METHOD

An encoding method for a multi-channel audio signal, an encoding apparatus for performing the encoding method, and a decoding method for a multi-channel audio signal and a decoding apparatus for performing the decoding method are disclosed. A method and apparatus of bypassing an MPEG Surround (MPS) standard operation and using an arbitrary tree when a number of audio signals of N channels exceeds a channel number defined in an MPS standard, is disclosed.

Audio Signal Processing Apparatuses and Methods
20180012607 · 2018-01-11 ·

Audio signal processing apparatuses and methods are provided, such as an audio signal downmixing apparatus for processing an input audio signal into an output audio signal, wherein the input audio signal comprises a plurality of input channels recorded at a plurality of spatial positions and the output audio signal comprises a plurality of primary output channels. The audio signal downmixing apparatus comprises a downmix matrix determiner configured to determine for each frequency bin j of a plurality of frequency bins a downmix matrix D.sub.U with j being an integer in the range from 1 to N, and a processor configured to process the input audio signal using the downmix matrix D.sub.U into the output audio signal.

AUDIO METADATA PROVIDING APPARATUS AND METHOD, AND MULTICHANNEL AUDIO DATA PLAYBACK APPARATUS AND METHOD TO SUPPORT DYNAMIC FORMAT CONVERSION

An audio metadata providing apparatus and method and a multichannel audio data playback apparatus and method to support a dynamic format conversion are provided. Dynamic format conversion information may include information about a plurality of format conversion schemes that are used to convert a first format set by an author of multichannel audio data into a second format that is based on a playback environment of the multichannel audio data and that are each set for corresponding playback periods of the multichannel audio data. The audio metadata providing apparatus may provide audio metadata including the dynamic format conversion information. The multichannel audio data playback apparatus may identify the dynamic format conversion information from the audio metadata, may convert the first format of the multichannel audio data into the second format based on the identified dynamic format conversion information, and may play back the multichannel audio data in the second format.

Method and apparatus for space of interest of audio scene
11710491 · 2023-07-25 · ·

Aspects of the disclosure include methods, apparatuses, and non-transitory computer-readable storage mediums for decoding audio data of an audio scene. One apparatus includes processing circuitry that receives first audio source data and second audio source data. The first audio source data corresponds to a space of interest in the audio scene and the second audio source data does not correspond to the space of interest in the audio scene. The space of interest in the audio scene is represented by at least one of a listener space, an audio channel, or an audio object. The processing circuitry decodes the first audio source data based on the space of interest.

METHODS AND SYSTEMS FOR GENERATING AND RENDERING OBJECT BASED AUDIO WITH CONDITIONAL RENDERING METADATA

Methods and audio processing units for generating an object based audio program including conditional rendering metadata corresponding to at least one object channel of the program, where the conditional rendering metadata is indicative of at least one rendering constraint, based on playback speaker array configuration, which applies to each corresponding object channel, and methods for rendering audio content determined by such a program, including by rendering content of at least one audio channel of the program in a manner compliant with each applicable rendering constraint in response to at least some of the conditional rendering metadata. Rendering of a selected mix of content of the program may provide an immersive experience.

NON-TRANSITORY COMPUTER-READABLE MEDIUM HAVING COMPUTER-READABLE INSTRUCTIONS AND SYSTEM
20230239650 · 2023-07-27 · ·

A sound controlling system including a user terminal having a sound source, a wireless communication device, a digital to analog converter (DAC) and first processing electronics. The first processing electronics are configured to: provide data of a backing sound to the sound source; control the sound source to generate a sound signal based on the data; receive a first input instruction including a first instruction to transmit the sound signal and a second instruction to play back the backing sound; provide the sound signal to the wireless communication device as the first input instruction being the first instruction, and provide the sound signal to the DAC as being the second instruction; control the wireless communication device to convert the sound signal to a wireless signal and transmit the wireless signal; and convert the sound signal from a digital signal to an analog signal for play back of the backing sound.

Apparatus, Method, or Computer Program for Processing an Encoded Audio Scene using a Parameter Smoothing

Apparatus for processing an audio scene representing a sound field, the audio scene having information on a transport signal and a first set of parameters. The apparatus has a parameter processor for processing the first set of parameters to obtain a second set of parameters, wherein the parameter processor is configured to calculate at least one raw parameter for each output time frame using at least one parameter of the first set of parameters for the input time frame, to calculate a smoothing information such as a factor for each raw parameter in accordance with a smoothing rule, and to apply a corresponding smoothing information to the corresponding raw parameter to derive the parameter of the second set of parameters for the output time frame. The apparatus further has an output interface for generating a processed audio scene using the second set of parameters and the information on the transport signal.

Apparatus, Method, or Computer Program for Processing an Encoded Audio Scene using a Bandwidth Extension

Apparatus for processing an audio scene representing a sound field, the audio scene comprising information on a transport signal and a set of parameters. The apparatus comprising an output interface for generating a processed audio scene using the set of parameters and the information on the transport signal, wherein the output interface is configured to generate a raw representation of two or more channels using the set of parameters and the transport signal and a multichannel enhancer for generating an enhancement representation of the two or more channels using the transport signal, and a signal combiner for combining the raw representation of the two or more channels and the enhancement representation of the two or more channels to obtain the processed audio scene.