Patent classifications
H04S2420/11
Method for processing data for the estimation of mixing parameters of audio signals, mixing method, devices, and associated computers programs
A method and apparatus are provided for processing data for estimating mixing parameters of at least one audio spot signal captured by a sound recording device, called a spot microphone, arranged in the vicinity of a source among a plurality of acoustic sources constituting a sound scene, and a primary audio signal captured by an ambisonic sound recording device, arranged to capture said plurality of acoustic sources of the sound scene.
Adaptive ambisonic binaural rendering
A user equipment (UE) includes a memory element and a processor. The memory element is configured to store a plurality of head-related transfer functions. The processor is configured to receive an audio signal. The audio signal includes a plurality of ambisonic signals. The processor is also configured to identify an orientation of the UE based on physical properties of the UE. The processor is also configured to rotate the plurality of ambisonic signals based on the orientation of the UE. The processor is also configured to filter the plurality of ambisonic signals using the plurality of head-related transfer functions to form speaker signals. The processor is also configured to output the speaker signals.
METHOD AND APPARATUS FOR PROCESSING AUDIO SIGNAL
Provided are an audio signal processing method and apparatus for adjusting a location of an audio object in correspondence to a location of a visual object. The audio signal processing apparatus includes a matching unit configured to select an audio object corresponding to a visual object extracted from a video signal among at least one audio object extracted from an audio signal, a location adjusting unit configured to adjust a location of a sound image of the audio signal based on a location of the selected audio object and a location of a visual object corresponding to the selected audio, and an output unit configured to output an audio signal whose the location of the sound image is adjusted.
ACOUSTIC SIMULATION APPARATUS
A virtual reproduction signal generation unit generates a virtual reproduction signal based on a sound pickup signal of a stereophonic sound at a listening position in a compartment, assuming that virtual speakers are respectively located at portions of Np positions in a vehicle, the virtual reproduction signal causing the virtual speakers of the Np positions to reproduce the stereophonic sound. A virtual prediction signal generation unit generates a virtual prediction signal based on the virtual reproduction signal and an information representing a change of acoustic characteristics when at least part of the portions of the Np positions is changed, the virtual prediction signal causing the virtual speakers of the Np positions to output a predicted sound at the listening position. An output signal generation unit generates an output signal based on the virtual prediction signal, the output signal causing speakers of a plurality of positions to output the predicted sound.
METHOD FOR CONVERTING A FIRST SET OF SIGNALS REPRESENTATIVE OF A SOUND FIELD INTO A SECOND SET OF SIGNALS AND ASSOCIATED ELECTRONIC DEVICE
A conversion method comprises the following steps: for each of the signals (b.sub.E(t)) of a first set, determining values (α(f)) respectively associated with frequency bands; for each frequency band, converting the values (α(f)) associated with the relevant frequency band into at least one value representative of a virtual sound source oriented along the spatial direction associated with a data item stored for the relevant temporal frequency band; for each temporal frequency band, determining, on the basis of an above-mentioned representative value, a plurality of values (γ(f)) associated with the different signals (b.sub.S(t)) of the second set, respectively; constructing each signal (b.sub.S(t)) of the second set on the basis of the values (γ(f)) associated with this signal of the second set.
SOUND FIELD ADJUSTMENT
A device includes one or more processors configured to receive, via wireless transmission from a playback device, data associated with a pose of the playback device. The one or more processors are also configured to select, based on the data, a particular representation of a sound field from a plurality of representations of the sound field. Each respective representation of the sound field corresponds to a different sector of a set of sectors. A sector represents a range of values associated with movement of the playback device. The one or more processors are further configured to generate audio data corresponding to the selected representation of the sound field. one or more processors are also configured to send, via wireless transmission, the audio data as streaming data to the playback device.
Method and apparatus for decoding a bitstream including encoded higher order ambisonics representations
Higher Order Ambisonics represents three-dimensional sound independent of a specific loudspeaker set-up. However, transmission of an HOA representation results in a very high bit rate. Therefore compression with a fixed number of channels is used, in which directional and ambient signal components are processed differently. For coding, portions of the original HOA representation are predicted from the directional signal components. This prediction provides side information which is required for a corresponding decoding. By using some additional specific purpose bits, a known side information coding processing is improved in that the required number of bits for coding that side information is reduced on average.
Immersive media with media device
Aspects of the subject disclosure may include, for example, a method, comprising: receiving, by a media processor including a processor, spherical audiovisual media content from a content delivery network; rendering, by the media processor, video for a point of view in the spherical audiovisual media content at a display device coupled to the media processor; receiving, from a remote control device coupled to the media processor, a control signal panning the point of view, resulting in a new field of view; and generating, by the media processor, audio signals from the spherical audiovisual media content corresponding to the new field of view, wherein the audio signals are adapted to audio reproduction equipment coupled to the media processor. Other embodiments are disclosed.
Ambience Audio Representation and Associated Rendering
An apparatus including circuitry configured for: defining at least one ambience audio representation, the ambience audio representation includes at least one respective diffuse background audio signal and at least one parameter, the at least one parameter associated with the at least one respective diffuse background audio signal and further associated with at least one frequency range or at least one part of the frequency range, at least one time period or at least one part of the time period and a directional range for a defined position within an audio field, wherein the at least one component representation is configured to be used in rendering an ambiance audio signal by a 6- degrees-of-freedom or enhanced 3-degrees-of-freedom Tenderer by processing, based on the at least one ambience audio representation and a listener position and/or direction, the respective diffuse background audio signal.
Spatial audio signal encoder
A method to encode audio signals is provided for use with an audio capture device that includes multiple microphones having a spatial arrangement on the device, a method to encode audio signals comprising: receiving multiple microphone signals corresponding to the multiple microphones; determining a number and directions of arrival of directional audio sources represented in the one or more microphone signals; determining one of an active microphone signal component and a passive microphone signal component, based upon the determined number and directions of arrival; determining the other of the active microphone signal component and the passive microphone signal component, based upon the determined one of the active input spatial audio signal component and the passive input spatial audio signal component; encoding the active microphone signal component; encoding the passive microphone signal component.