Patent classifications
H04S3/006
SCALABLE UNIFIED AUDIO RENDERER
In general, techniques are described by which to support scalable unified audio rendering. A device comprising an audio decoder, a memory, and a processor may be configured to perform various aspects of the techniques. The audio decoder may decode, from a bitstream, first audio data and second audio data. The memory may store the first audio data and the second audio data. The processor may render the first audio data into first spatial domain audio data for playback by virtual speakers at a set of virtual speaker locations, and render the second audio data into second spatial domain audio data for playback by the virtual speakers at the set of virtual speaker locations. The processor may also mix the first spatial domain audio data and the second spatial domain audio data to obtain mixed spatial domain audio data, and convert the mixed spatial domain audio data to scene-based audio data.
Audio device
A method and system dynamically adjusts the audio of an audio and video signal to improve its overall sound quality and dialog intelligibility. Some embodiments use gain, equalization, audio signal compression and spatial enhancement (reverb) on individual channels of a multichannel audio signal.
Loudness level and range processing
Loudness signal processors and methods for processing an input audio signal in order to control a resulting integrated loudness level and a resulting loudness range of an output audio signal by a predetermined target loudness level and by a predetermined target loudness range, the processors and methods comprising level detection and level distribution analysis; transfer function generation based on the level distribution, the predetermined target loudness level and the predetermined target loudness range; and calculation of a gain to apply to said input audio signal, resulting in said output audio signal.
Adaptive audio construction
Systems, methods, and computer program products for creating an object-based audio signal from an audio input are described. The audio input includes one or more audio channels that are recorded to collectively define an audio scene. The one or more audio channels are captured from a respective one or more spatially separated microphones disposed in a stable spatial configuration. A system receives the audio input. The system performs spatial analysis on the one or more audio channels to identify one or more audio objects within the audio scene. The system determines contextual information relating to the one or more audio objects. The system defines respective audio streams including audio data relating to at least one of the identified one or more audio objects. The system then outputs an object-based audio signal including the audio streams and the contextual information.
Method and device for processing received sound and memory medium, mobile terminal, robot having the same
Disclosed herein are method and device for processing received sound. The method includes receiving at least three sets of sound sent by at least three microphones of an apparatus; calculating a position of a sounding point in a three-dimensional space according to the at least three sets of sound; and adjusting a physical position of the apparatus according to the position of the sounding point in the three-dimensional space and an optimal sound receiving region of the apparatus, so as to approach or contain the position of the sounding point relative to the optimal sound receiving region of the apparatus. The physical position includes a spatial position and orientation of the apparatus. The embodiments can accurately determine the position of the sounding point, and then adjust state of the apparatus according to the position of the sounding point, to ensure the optimization of the sound reception effect.
IMAGING APPARATUS
Imaging apparatus (100) includes selectors (115, 120) that select sound signals having a set number of channels, and a control unit. When a number of channels is set to two at time of recording sound signals, the control unit, according to a first format, records sound data generated based on selected sound signals for two channels, on one sound track included in a video file. When the number of channels is set to four, the control unit, does not record two pieces of sound data respectively on two sound tracks included in one video file in accordance with the first format.
Spatial Audio Apparatus
An apparatus including: an input configured to receive from at least two microphones at least two audio signals; at least two processor instances configured to generate separate output audio signal tracks from the at least two audio signals from the at least two microphones; a file processor configured to link the at least two output audio signal tracks within a file structure.
Spatial audio apparatus
An apparatus comprising: an input configured to receive at least one audio signal from a further apparatus; an input configured to receive at least one audio signal associated with the apparatus; an orientation/location determiner configured to determine a relative orientation/location difference between the apparatus and the further apparatus; an audio processor configured to process the at least one audio signal from the further apparatus based on the relative orientation/location difference between the apparatus and the further apparatus; and a combiner configured to combine the at least one audio signal from the further apparatus having been processed and the at least one audio signal associated with the apparatus.
ENCODED AUDIO METADATA-BASED EQUALIZATION
A system for producing an encoded digital audio recording has an audio encoder that encodes a digital audio recording having a number of audio channels or audio objects. An equalization (EQ) value generator produces a sequence of EQ values which define EQ filtering that is to be applied when decoding the encoded digital audio recording, wherein the EQ filtering is to be applied to a group of one or more of the audio channels or audio objects of the recording independent of any downmix. A bitstream multiplexer combines the encoded digital audio recording with the sequence of EQ values, the latter as metadata associated with the encoded digital audio recording. Other embodiments are also described including a system for decoding the encoded audio recording.
3D sound orientation adaptability
An apparatus includes one or more processors configured to receive orientation data and to select, based on the orientation data, a particular filter from among multiple filters. The one or more processors are configured to perform signal processing operations associated with three-dimensional (3D) sound data based on the particular filter.