Patent classifications
H04S2400/13
IMMERSIVE MEDIA COMPATIBILITY
Aspects of the disclosure provide methods and apparatuses for audio processing. In some examples, an apparatus for media processing includes processing circuitry. The processing circuitry receives first six degrees of freedom (6 DoF) information associated with a media content for a scene in a media application. The first 6 DoF information includes a first spatial location and a first rotation orientation for rotation about a center at the first spatial location. The processing circuitry determines that a rendering platform for rendering the media content is a three degrees of freedom (3 DoF) platform; and calculates, a revolution orientation of the media content on a sphere centered other than the first spatial location, according to at least the first spatial location. The revolution orientation is 3 DoF information associated with the media content for rendering on the 3 DoF platform.
SPEAKER TO ADJUST ITS SPEAKER SETTINGS
Examples disclosed herein include a speaker. The speaker may include a group of microphones and a processor. The processor may determine a first speaker-channel identifier for a multi-speaker system at least partially responsive to a first tone captured at the group of microphones. The processor may also determine a position of a source of the captured first tone relative to the speaker at least partially responsive to position information derived from the captured first tone. The processor may also determine a second speaker-channel identifier at least partially responsive to the first speaker-channel identifier and the position of the source of the captured first tone. The processor may also determine speaker settings at least partially responsive to the second speaker-channel identifier. Related devices, systems and methods are also disclosed.
Encoded audio metadata-based equalization
A system for producing an encoded digital audio recording has an audio encoder that encodes a digital audio recording having a number of audio channels or audio objects. An equalization (EQ) value generator produces a sequence of EQ values which define EQ filtering that is to be applied when decoding the encoded digital audio recording, wherein the EQ filtering is to be applied to a group of one or more of the audio channels or audio objects of the recording independent of any downmix. A bitstream multiplexer combines the encoded digital audio recording with the sequence of EQ values, the latter as metadata associated with the encoded digital audio recording. Other embodiments are also described including a system for decoding the encoded audio recording.
MULTI-DEVICE AUDIO ADJUSTMENT COORDINATION
This relates to intelligent automated assistants and, more specifically, to the intelligent coordination of audio signal output adjustments among multiple electronic devices. An example method includes, generating a local audio intent object associated with a software application stored on a first electronic device, the local audio intent object including one or more local audio parameters; determining that a second electronic device that is outputting an audio signal is proximate to the first electronic device; generating a proximate audio intent object corresponding to the second electronic device based on the one or more local audio adjustment parameters and a round-trip time (RTT) of a communication connection between the first electronic device and the second electronic device; and transmitting the proximate audio intent object to the second electronic device via the communication connection, wherein the proximate audio intent object causes the second electronic device to adjust the output of the audio signal.
Determining corrections to be applied to a multichannel audio signal, associated coding and decoding
A method and device for determining a set of corrections to be made to a multichannel sound signal, in which the set of corrections is determined on the basis of an item of information representative of a spatial image of an original multichannel signal and an item of information representative of a spatial image of the original multichannel signal that has been coded and then decoded.
DYNAMICS PROCESSING ACROSS DEVICES WITH DIFFERING PLAYBACK CAPABILITIES
Individual loudspeaker dynamics processing configuration data, for each of a plurality of loudspeakers of a listening environment, may be obtained. Listening environment dynamics processing configuration data may be determined, based on the individual loudspeaker dynamics processing configuration data. Dynamics processing may be performed on received audio data based on the listening environment dynamics processing configuration data, to generate processed audio data. The processed audio data may be rendered for reproduction via a set of loudspeakers that includes at least some of the plurality of loudspeakers, to produce rendered audio signals. The rendered audio signals may be provided to, and reproduced by, the set of loudspeakers.
METHOD AND APPARATUS FOR RENDERING VOLUME SOUND SOURCE
A method and apparatus for rendering a volume sound source are disclosed. The method of rendering a volume sound source may include identifying information about a listener and information about the volume sound source, determining a corresponding area in which a source element is disposed in the volume sound source in consideration of the information about the listener, determining an angle between the listener and the corresponding area based on the information about the listener and the information about the volume sound source, determining a number of source elements disposed in the corresponding area according to the angle, determining a position and a gain of the source element using i) the number of source elements and ii) a distance between the listener and the volume sound source, and rendering the volume sound source according to the position and the gain of the source element.
SYSTEM AND METHOD FOR WIRELESS AUDIO AND DATA CONNECTION FOR GAMING HEADPHONES AND GAMING DEVICES
In at least one embodiment, an audio system is provided. At least one controller is programmed to encode a first and second audio component and to generate a first and a second encoded audio component. The at least one controller is programmed to apply a first gain to at least one of the first encoded audio component and the second encoded audio component to generate at least one of a first and second increased encoded audio component and to decode the at least one of the first and the second increased encoded audio component to generate at least one of a first and second decoded audio component. The at least one controller is further programmed to amplitude pan the at least one of the first and the second decoded audio component to increase a stereo width for an audio output transmitted by a first loudspeaker and a second loudspeaker.
MULTIBAND LIMITER MODES AND NOISE COMPENSATION METHODS
Some implementations involve receiving a content stream that includes audio data, receiving at least one type of level adjustment indication relating to playback of the audio data and controlling a level of the input audio data, based on the at least one type of level adjustment indication, to produce level-adjusted audio data. Some examples involve determining, based at least in part on the type(s) of level adjustment indication, a multiband limiter configuration, applying the multiband limiter to the level-adjusted audio data, to produce multiband limited audio data and providing the multiband limited audio data to one or more audio reproduction transducers of an audio environment.
APPARATUS, METHODS AND COMPUTER PROGRAMS FOR ENABLING REPRODUCTION OF SPATIAL AUDIO SIGNALS
An apparatus (101) for enabling reproduction of spatial audio signals. The apparatus comprises means for obtaining (401) audio signals (501) comprising one or more channels and obtaining (403) spatial metadata (503) relating to the audio signals (501). The spatial metadata (503) comprises information that indicates how to spatially reproduce the audio signals. The apparatus also comprises means for obtaining (405) information relating to a field of view of video (505) wherein the video is for display on a display (205) of a rendering device (201) and wherein the video is associated with the audio signals (501). The apparatus also comprises means for aligning (407) spatial reproduction of the audio signals based, at least in part, on the obtained spatial metadata (503), with objects (309A, 309B) in the video according to the obtained information relating to the field of view of video; and enabling (409) reproduction of the audio signals based on the aligning (407).