Patent classifications
H04S2420/11
ADAPTIVE AUDIO DELIVERY AND RENDERING
Aspects of the disclosure provide methods and apparatuses (e.g., client devices and server devices) for audio processing. In some examples, a client device includes processing circuitry. The processing circuitry transmits, to a server device, a selection signal indicative of an audio encoding configuration for encoding audio content in an audio input. The processing circuitry receives, from the server device, an encoded bitstream in response to the transmitting of the selection signal. The encoded bitstream includes the audio content that is encoded according to the audio encoding configuration. The processing circuitry renders audio signals based on the encoded bitstream.
Spatial audio processing
According to an example embodiment, a method for processing a spatial audio signal that represents an audio scene, wherein the spatial audio signal is controllable and associated with at least two viewing directions is provided, the method including receiving a focus direction and a focus amount; processing the spatial audio signal by modifying the audio scene so as to control emphasis in, at least in part, a portion of the spatial audio signal in said focus direction according to said focus amount; and outputting the processed spatial audio signal, wherein the modified audio scene enables the emphasis in, at least in part, said portion of the spatial audio signal in said focus direction according to said focus amount.
Apparatus and method for processing volumetric audio
A method including receiving an audio scene including at least one source captured using at least one near field microphone and at least one far field microphone. The method includes determining at least one room-impulse-response associated with the audio scene based on the at least one near field microphone and the at least one far field microphone, accessing a predetermined scene geometry corresponding to the audio scene, and identifying best match to the predetermined scene geometry in a scene geometry database. The method also includes performing RIR comparison based on the at least one RIR and at least one geometric RIR associated with the best matching geometry and rendering a volumetric audio scene based on a result of the RIR comparison.
Audio system
An audio system includes an audio/video receiver, a power supply/wireless audio distribution assembly connected to the audio/video receiver, speaker wire, and speakers compatible with the power supply/wireless audio distribution assembly.
SIGNALLING OF AUDIO EFFECT METADATA IN A BITSTREAM
Methods, systems, computer-readable media, and apparatuses for manipulating a soundfield are presented. Some configurations include receiving a bitstream that comprises metadata and a soundfield description; parsing the metadata to obtain an effect identifier and at least one effect parameter value; and applying, to the soundfield description, an effect identified by the effect identifier. The applying may include using the at least one effect parameter value to apply the identified effect to the soundfield description.
TRAINING DATA EXTENSION APPARATUS, TRAINING DATA EXTENSION METHOD, AND PROGRAM
An input of a first observation signal corresponding to an incoming signal from a first direction is received, an angular rotation operation of the first observation signal is performed to obtain a second observation signal corresponding to an incoming signal from a second direction that is different from the first direction and the second observation signal is added to a set of training data.
AUDIO PROCESSING METHOD AND APPARATUS, READABLE MEDIUM, AND ELECTRONIC DEVICE
Provided are an audio processing method and apparatus, a readable medium, and an electronic device. The method includes: acquiring an original image captured by a terminal; determining a three-dimensional relative position of a target object relative to the terminal as a first three-dimensional relative position according to the original image; and performing three-dimensional effect processing on a target sound according to the first three-dimensional relative position to enable a sound source position of the target sound in audio obtained after the three-dimensional effect processing and the first three-dimensional relative position to conform to a positional relationship between the target object and a sound effect object corresponding to the target object, where the target sound is an effect sound corresponding to the sound effect object.
SPATIAL AUDIO MONAURALIZATION VIA DATA EXCHANGE
A device includes a memory configure to store instructions and one or more processors configured to execute the instructions to obtain spatial audio data at a first audio output device. The one or more processors are further configured to perform data exchange, between the first audio output device and a second audio output device, of exchange data based on the spatial audio data. The one or more processors are also configured to generate first monaural audio output at the first audio output device based on the spatial audio data.
CONVEYING MOTION DATA VIA MEDIA PACKETS
A device includes a memory configured to store instructions and one or more processors configured to execute the instructions to receive a media packet and to determine, based on a field of the media packet, whether the media packet includes motion data. The one or more processors are also configured to execute the instructions to, based on the media packet including motion data, extract the motion data from the media packet.
Audio processing
A method for rendering a spatial audio signal that represents a sound field in a selectable viewpoint audio environment that includes one or more audio objects associated with respective audio content and a respective position in the audio environment. The method includes receiving an indication of a selected listening position and orientation in the audio environment; detecting an interaction concerning a first audio object on basis of one or more predefined interaction criteria; modifying the first audio object and one or more further audio objects linked thereto; and deriving the spatial audio signal that includes at least audio content associated with the modified first audio object in a first spatial position of the sound field that corresponds to its position in the audio environment in relation to said selected listening position and orientation, and audio content associated with the modified one or more further audio objects.