Patent classifications
H04S2420/11
Systems and methods for spatial audio rendering
Systems and methods for rendering spatial audio in accordance with embodiments of the invention are illustrated. One embodiment includes a spatial audio system, including a primary network connected speaker, including a plurality of sets of drivers, where each set of drivers is oriented in a different direction, a processor system, memory containing an audio player application, wherein the audio player application configures the processor system to obtain an audio source stream from an audio source via the network interface, spatially encode the audio source, decode the spatially encoded audio source to obtain driver inputs for the individual drivers in the plurality of sets of drivers, where the driver inputs cause the drivers to generate directional audio.
AUGMENTED REALITY AND VIRTUAL REALITY FEEDBACK ENHANCEMENT SYSTEM, APPARATUS AND METHOD
- Chandrasekaran Sakthivel ,
- Michael Apodaca ,
- Kai Xiao ,
- Altug Koker ,
- Jeffery S. Boles ,
- Adam T. Lake ,
- Nikos Kaburlasos ,
- Joydeep Ray ,
- John H. Feit ,
- Travis T. Schluessler ,
- Jacek Kwiatkowski ,
- James M. Holland ,
- Prasoonkumar Surti ,
- Jonathan Kennedy ,
- Louis Feng ,
- Barnan Das ,
- Narayan Biswal ,
- Stanley J. Baran ,
- Gokcen Cilingir ,
- Nilesh V. Shah ,
- Archie Sharma ,
- Mayuresh M. Varerkar
Systems, apparatuses and methods may provide away to render augmented reality and virtual reality (VR/AR) environment information. More particularly, systems, apparatuses and methods may provide a way to selectively suppress and enhance VR/AR renderings of n-dimensional environments. The systems, apparatuses and methods may deepen a user's VR/AR experience by focusing on particular feedback information, while suppressing other feedback information from the environment.
METHOD AND APPARATUS FOR ENCODING AND DECODING AN HOA REPRESENTATION
The present invention relates to methods and apparatus for encoding an HOA signal representation (c(t)) of a sound field having an order of N and a number O=(N+1).sup.2 of coefficient sequences to a mezzanine HOA signal representation (w.sub.MEZZ(t)). The present invention further relates to methods and apparatus for decoding a reconstructed HOA signal representation from the mezzanine HOA signal representation.
Spatial Audio Augmentation and Reproduction
An apparatus including circuitry configured for: obtaining at least one spatial audio signal comprising including at least one audio signal, wherein the at least one spatial audio signal defines an audio scene forming at least in part media content; rendering an audio scene based on the at least one spatial audio signal; obtaining at least one augmentation audio signal; transforming the at least one augmentation audio signal to at least two audio objects; augmenting the audio scene based on the at least two audio objects.
Audio processing apparatus and method therefor
An audio processing apparatus comprises a receiver (705) which receives audio data including audio components and render configuration data including audio transducer position data for a set of audio transducers (703). A renderer (707) generating audio transducer signals for the set of audio transducers from the audio data. The renderer (7010) is capable of rendering audio components in accordance with a plurality of rendering modes. A render controller (709) selects the rendering modes for the renderer (707) from the plurality of rendering modes based on the audio transducer position data. The renderer (707) can employ different rendering modes for different subsets of the set of audio transducers the render controller (709) can independently select rendering modes for each of the different subsets of the set of audio transducers (703). The render controller (709) can select the rendering mode for a first audio transducer of the set of audio transducers (703) in response to a position of the first audio transducer relative to a predetermined position for the audio transducer. The approach may provide improved adaptation, e.g. to scenarios where most speakers are at desired positions whereas a subset deviate from the desired position(s).
Method and system for creating object-based audio content
Described herein is a method for creating object-based audio content from a text input for use in audio books and/or audio play, the method including the steps of: a) receiving the text input; b) performing a semantic analysis of the received text input; c) synthesizing speech and effects based on one or more results of the semantic analysis to generate one or more audio objects; d) generating metadata for the one or more audio objects; and e) creating the object-based audio content including the one or more audio objects and the metadata. Described herein are further a computer-based system including one or more processors configured to perform said method and a computer program product comprising a computer-readable storage medium with instructions adapted to carry out said method when executed by a device having processing capability.
Determination of Targeted Spatial Audio Parameters and Associated Spatial Audio Playback
A method for spatial audio signal processing, including determining, for two or more playback audio signals, at least one spatial audio parameter for providing spatial audio reproduction; determining between the two or more playback audio signals at least one audio signal relationship parameter, the at least one audio signal relationship parameter being associated with a determination of inter-channel signal relationship information between the two or more playback audio signals and for at least two frequency bands, such that the two or more playback audio signals are configured to be reproduced based on the at least one spatial audio parameter and the at least one audio signal relationship parameter.
AMBIENT SOUND ADJUSTMENTS DURING CALL HANDLING
Apparatuses, methods and computer programs are described comprising: providing an incoming call indication in response to an incoming call, the incoming call indication including an initial ambient audio signal comprising a combination of first ambient audio and second ambient audio; receiving an ambient audio control command; and adjusting the initial ambient audio signal to generate an adjusted ambient audio signal depending on the ambient audio control command.
An Apparatus and Method for Processing Volumetric Audio
A method including receiving an audio scene including at least one source captured using at least one near field microphone and at least one far field microphone. The method includes determining at least one room-impulse-response associated with the audio scene based on the at least one near field microphone and the at least one far field microphone, accessing a predetermined scene geometry corresponding to the audio scene, and identifying best match to the predetermined scene geometry in a scene geometry database. The method also includes performing RIR comparison based on the at least one RIR and at least one geometric RIR associated with the best matching geometry and rendering a volumetric audio scene based on a result of the RIR comparison.
METHODS AND DEVICES FOR GENERATING OR DECODING A BITSTREAM COMPRISING IMMERSIVE AUDIO SIGNALS
The present document describes a method (500) for generating a bitstream (101), wherein the bitstream (101) comprises a sequence of superframes (400) for a sequence of frames of an immersive audio signal (111). The method (500) comprises, repeatedly for the sequence of superframes (400), inserting (501) coded audio data (206) for one or more frames of one or more downmix channel signals (203) derived from the immersive audio signal (111), into data fields (411, 421, 412, 422) of a superframe (400); and inserting (502) metadata (202, 205) for reconstructing one or more frames of the immersive audio signal (111) from the coded audio data (206), into a metadata field (403) of the superframe (400).