Patent classifications
H04S2400/01
NEAR-FIELD AUDIO RENDERING
Examples of the disclosure describe systems and methods for presenting an audio signal to a user of a wearable head device. According to an example method, a source location corresponding to the audio signal is identified. For each of the respective left and right ear of the user, a virtual speaker position, of a virtual speaker array, is determined, the virtual speaker position collinear with the source location and with a position of the respective ear. For each of the respective left and right ear of the user, a head-related transfer function (HRTF) corresponding to the virtual speaker position and to the respective ear is determined; and the output audio signal is presented to the respective ear of the user via one or more speakers associated with the wearable head device. Processing the audio signal includes applying the HRTF to the audio signal.
APPARATUS, METHODS AND COMPUTER PROGRAMS FOR ENABLING REPRODUCTION OF SPATIAL AUDIO SIGNALS
An apparatus (101) for enabling reproduction of spatial audio signals. The apparatus comprises means for obtaining (401) audio signals (501) comprising one or more channels and obtaining (403) spatial metadata (503) relating to the audio signals (501). The spatial metadata (503) comprises information that indicates how to spatially reproduce the audio signals. The apparatus also comprises means for obtaining (405) information relating to a field of view of video (505) wherein the video is for display on a display (205) of a rendering device (201) and wherein the video is associated with the audio signals (501). The apparatus also comprises means for aligning (407) spatial reproduction of the audio signals based, at least in part, on the obtained spatial metadata (503), with objects (309A, 309B) in the video according to the obtained information relating to the field of view of video; and enabling (409) reproduction of the audio signals based on the aligning (407).
SOUND SIGNAL PROCESSING METHOD AND SOUND SIGNAL PROCESSING DEVICE
A sound signal processing method includes: obtaining a sound signal; obtaining impulse response data that was measured in a predetermined space before the sound signal is obtained; generating an early reflected sound control signal not including a reverberant sound by convolving impulse response data of an early reflected sound among the obtained impulse response data into the obtained sound signal.
Time-varying always-on compensation for tonally balanced 3D-audio rendering
A system reduces sound coloration caused by rendering of a 3D audio signal. The system renders the 3D audio signal including a plurality of channels using the input audio signal. Input spectra data defining spectral information of the input audio signal is computed. 3D spectra data defining spectral information of a single channel representation of the 3D audio signal is computed. The system generates a tonal balance filter based on the input spectral data and the 3D spectral data. The tonal balance filter, when applied to the 3D audio signal, reduces sound coloration caused by the rendering of the 3D audio signal. The tonal balance filter is applied to the 3D audio signal to generate an output audio signal and the output audio signal is presented via a speaker array.
METHOD FOR PROCESSING AN AUDIO SIGNAL, SIGNAL PROCESSING UNIT, BINAURAL RENDERER, AUDIO ENCODER AND AUDIO DECODER
A method for processing an audio signal in accordance with a room impulse response is described. The audio signal is processed with an early part of the room impulse response separate from a late reverberation of the room impulse response, wherein the processing of the late reverberation has generating a scaled reverberated signal, the scaling being dependent on the audio signal. The processed early part of the audio signal and the scaled reverberated signal are combined.
Method and System for Surround Sound Processing in an Audio Device
An audio headset may receive a plurality of audio signals corresponding to plurality of surround sound channels. The headset may determine, via its audio processing circuitry, context and/or content of the audio signals. The audio processing circuitry may process the audio signals to generate stereo signals carrying one or more virtual surround channels, wherein the processing comprises automatically controlling, based on the context and the content of the audio signals, a simulated acoustic environment of the virtual surround channels.
INFORMATION PROCESSING DEVICE, INFORMATION PROCESSING METHOD, AND NON-TRANSITORY COMPUTER READABLE MEDIUM
An information processing device includes a processor configured to output, in a case where a service is being used in which at least speech is exchanged among multiple users such that a conversation takes places among all of the multiple users, a speech of a separate conversation distinctly from a speech of the conversation taking place among all of the multiple users to a device of a user who is engaged in the separate conversation with a specific user from among the multiple users, and output the speech of the conversation taking place among all of the multiple users without outputting the speech of the separate conversation to a device of a user who is not engaged in the separate conversation.
SYSTEM AND METHOD FOR TRANSMITTING AT LEAST ONE MULTICHANNEL AUDIO MATRIX ACROSS A GLOBAL NETWORK
A system and method for transmitting at least one multichannel audio matrix across a global network is shown and described. The method for transmitting at least one multichannel audio matrix across a global network begins by capturing audio from a production source. The captured audio is then converted from an analog format to a digital format. The digital audio may then be encoded using an audio codec. The audio is sent to a network and received at a second location. The second location uses a specialized computing device to ensure the audio is properly received. If the audio was encoded, it is now decoded. Once decoded if needed the audio is converted back to analog format. This will allow for the audio to be mixed on a mixing device.
Audio decoder, audio encoder, method for providing at least four audio channel signals on the basis of an encoded representation, method for providing an encoded representation on the basis of at least four audio channel signals and computer program using a bandwidth extension
An audio decoder for providing at least four bandwidth-extended channel signals on the basis of an encoded representation provides first and second downmix signals on the basis of a jointly encoded representation of the first and second downmix signals using a multi-channel decoding and provides at least first and second audio channel signals on the basis of the first downmix signal using a multi-channel decoding, and provides at least third and fourth audio channel signals on the basis of the second downmix signal using a multi-channel decoding. It performs a multi-channel bandwidth extension on the basis of the first and third audio channel signals, to obtain first and third bandwidth-extended channel signals, and performs a multi-channel bandwidth extension on the basis of the second and fourth audio channel signals, to obtain second and fourth bandwidth extended channel signals. An audio encoder uses a related concept.
Methods for parametric multi-channel encoding
The present document relates to audio coding systems. In particular, the present document relates to efficient methods and systems for parametric multi-channel audio coding. An audio encoding system configured to generate a bitstream indicative of a downmix signal and spatial metadata for generating a multi-channel upmix signal from the downmix signal is described. The system comprises a downmix processing unit configured to generate the downmix signal from a multi-channel input signal; wherein the downmix signal comprises m channels and wherein the multi-channel input signal comprises n channels; n, m being integers with m<n. Furthermore, the system comprises a parameter processing unit configured to determine the spatial metadata from the multi-channel input signal. In addition, the system comprises a configuration unit configured to determine one or more control settings for the parameter processing unit based on one or more external settings; wherein the one or more external settings comprise a target data-rate for the bitstream and wherein the one or more control settings comprise a maximum data-rate for the spatial metadata.