H04S2420/07

Mapping virtual sound sources to physical speakers in extended reality applications

One or more embodiments include an audio processing system for generating an audio scene for an extended reality (XR) environment. The audio processing system determines that a first virtual sound source associated with the XR environment affects a sound in the audio scene. The audio processing system generates a sound component associated with the first virtual sound source based on a contribution of the first virtual sound source to the audio scene. The audio processing system maps the sound component to a first loudspeaker included in a plurality of loudspeakers. The audio processing system outputs at least a first portion of the component for playback on the first loudspeaker.

Determining corrections to be applied to a multichannel audio signal, associated coding and decoding
20220358937 · 2022-11-10 ·

A method and device for determining a set of corrections to be made to a multichannel sound signal, in which the set of corrections is determined on the basis of an item of information representative of a spatial image of an original multichannel signal and an item of information representative of a spatial image of the original multichannel signal that has been coded and then decoded.

DYNAMICS PROCESSING ACROSS DEVICES WITH DIFFERING PLAYBACK CAPABILITIES

Individual loudspeaker dynamics processing configuration data, for each of a plurality of loudspeakers of a listening environment, may be obtained. Listening environment dynamics processing configuration data may be determined, based on the individual loudspeaker dynamics processing configuration data. Dynamics processing may be performed on received audio data based on the listening environment dynamics processing configuration data, to generate processed audio data. The processed audio data may be rendered for reproduction via a set of loudspeakers that includes at least some of the plurality of loudspeakers, to produce rendered audio signals. The rendered audio signals may be provided to, and reproduced by, the set of loudspeakers.

Methods and systems for designing and applying numerically optimized binaural room impulse responses

Methods and systems for designing binaural room impulse responses (BRIRs) for use in headphone virtualizers, and methods and systems for generating a binaural signal in response to a set of channels of a multi-channel audio signal, including by applying a BRIR to each channel of the set, thereby generating filtered signals, and combining the filtered signals to generate the binaural signal, where each BRIR has been designed in accordance with an embodiment of the design method. Other aspects are audio processing units configured to perform any embodiment of the inventive method. In accordance with some embodiments, BRIR design is formulated as a numerical optimization problem based on a simulation model (which generates candidate BRIRs) and at least one objective function (which evaluates each candidate BRIR), and includes identification of a best one of the candidate BRIRs as indicated by performance metrics determined for the candidate BRIRs by each objective function.

APPARATUS, METHODS AND COMPUTER PROGRAMS FOR ENABLING REPRODUCTION OF SPATIAL AUDIO SIGNALS
20230096873 · 2023-03-30 · ·

An apparatus (101) for enabling reproduction of spatial audio signals. The apparatus comprises means for obtaining (401) audio signals (501) comprising one or more channels and obtaining (403) spatial metadata (503) relating to the audio signals (501). The spatial metadata (503) comprises information that indicates how to spatially reproduce the audio signals. The apparatus also comprises means for obtaining (405) information relating to a field of view of video (505) wherein the video is for display on a display (205) of a rendering device (201) and wherein the video is associated with the audio signals (501). The apparatus also comprises means for aligning (407) spatial reproduction of the audio signals based, at least in part, on the obtained spatial metadata (503), with objects (309A, 309B) in the video according to the obtained information relating to the field of view of video; and enabling (409) reproduction of the audio signals based on the aligning (407).

AI-BASED DJ SYSTEM AND METHOD FOR DECOMPOSING, MISING AND PLAYING OF AUDIO DATA
20230089356 · 2023-03-23 ·

The present invention relates to a method for processing and playing audio data comprising the steps of receiving mixed input data and playing recombined output data. Furthermore, the invention relates to a device 10 for processing and playing audio data, preferably DJ equipment, comprising an audio input unit for receiving a mixed input signal, a recombination unit 32 and a playing unit 34 for playing recombined output data. In addition, the present invention relates to a method and a device for representing audio data, i.e. on a display.

AUDIO RENDERING METHOD AND APPARATUS
20230089225 · 2023-03-23 ·

This application discloses an audio rendering method and apparatus. The method includes: obtaining a to-be-rendered audio signal; determining K first combined HRTFS based on K first HRTFs and K second HRTFs; determining K second combined HRTFs based on K third HRTFs and K fourth HRTFs; determining a first target rendered signal based on the K first combined HRTFs and the to-be-rendered audio signal, where the first target rendered signal is a rendered signal output to the left ear of a listener; and determining a second target rendered signal based on the K second combined HRTFs and the to-be-rendered audio signal, where the second target rendered signal is a rendered signal output to the right ear of the listener.

Systems and methods for improving audio virtualization

Virtual sound room rendering is most realistic when the listener has themselves been the subject of the binaural room impulse response measurements, and most pleasing when the sound room involved has a high acoustic fidelity. Where the listener has no access to good sound rooms non-personalised high fidelity sound rooms are modified using information from a listener's personalised binaural impulse response data to improve the realism of such rooms. Where sound rooms are available, information from higher fidelity non-personalised sound rooms are used to improve the sound quality of the listener's personalised room data. Alternatively either personalised or non-personalised rooms can be improved through modification of their reverberation characteristics according to the listener's taste.

ELECTRONIC APPARATUS FOR AUDIO SIGNAL PROCESSING AND OPERATING METHOD THEREOF

A method of an electronic apparatus for audio signal processing includes obtaining a parameter related to spatialization of an audio object, obtaining rendering information based on the parameter related to spatialization, and rendering the audio object based on the rendering information. The parameter related to spatialization includes at least one of an object parameter of a feature of at least one of the audio object or a video object associated with the audio object, an electronic apparatus parameter of a feature of the electronic apparatus, or a user parameter of a feature of a user.

Spatial audio parameters

An apparatus including circuitry configured for: defining at least one parameter field associated with an input multi-channel audio signals, the at least one parameter field configured to describe at least one characteristic of the multi-channel audio signals; determining at least one spatial audio parameter associated with the multi-channel audio signals; and controlling a rendering of the multi-channel audio signals by processing the input multichannel audio signals using at least the at least one characteristic of the multi-channel audio signals and the at least one spatial audio parameter.