Patent classifications
H04S2420/01
Audio processing apparatus and method therefor
An audio processing apparatus comprises a receiver (705) which receives audio data including audio components and render configuration data including audio transducer position data for a set of audio transducers (703). A renderer (707) generating audio transducer signals for the set of audio transducers from the audio data. The renderer (7010) is capable of rendering audio components in accordance with a plurality of rendering modes. A render controller (709) selects the rendering modes for the renderer (707) from the plurality of rendering modes based on the audio transducer position data. The renderer (707) can employ different rendering modes for different subsets of the set of audio transducers the render controller (709) can independently select rendering modes for each of the different subsets of the set of audio transducers (703). The render controller (709) can select the rendering mode for a first audio transducer of the set of audio transducers (703) in response to a position of the first audio transducer relative to a predetermined position for the audio transducer. The approach may provide improved adaptation, e.g. to scenarios where most speakers are at desired positions whereas a subset deviate from the desired position(s).
System and method for realistic rotation of stereo or binaural audio
A system for rotating sound or selective listening to sound provides for the ability of the apparent direction of sound sources in a listening environment to remain in consistent orientations in space despite rotations of the microphones used to capture the sound and despite rotations of the head of the listener, even when wearing headphones. Modules are provided in the system to distinguish the sound sources and their apparent directions, as well as to optionally rotate the sound sources in response to detected rotations of the listener's head and/or detected rotations of the microphones.
Differential spatial rendering of audio sources
Methods and systems for intuitive spatial audio rendering with improved intelligibility are disclosed. By establishing a virtual association between an audio source and a location in the listener's virtual audio space, a spatial audio rendering system can generate spatial audio signals that create a natural and immersive audio field for a listener. The system can receive the virtual location of the source as a parameter and map the source audio signal to a source-specific multi-channel audio signal. In addition, the spatial audio rendering system can be interactive and dynamically modify the rendering of the spatial audio in response to a user's active control or tracked movement.
Calibrating an audio system using a user's auditory steady state response
An audio system is described for calibrating one or more transducers for a user using the auditory steady state response (ASSR) of the user. The audio system presents an audio calibration signal to the user via a transducer array in the headset. The system measures electrical signals that are generated in the auditory cortex of the brain of the user in response to the presented audio calibration signal. The audio system determines an ASSR of the user based on the measured electrical signals. The audio system determines a value for one or more sound filter parameter based on the determined ASSR and a model. The audio system calibrates the transducer array using the determined sound filter parameters. The calibrated transducer array is used to present audio content to the user.
METHOD FOR GENERATING A CUSTOMIZED/PERSONALIZED HEAD RELATED TRANSFER FUNCTION
There is provided a method for generating a personalized Head Related Transfer Function (HRTF). The method can include capturing an image of an ear using a portable device, auto-scaling the captured image to determine physical geometries of the ear and obtaining a personalized HRTF based on the determined physical geometries of the ear.
SYSTEM FOR AND METHOD OF GENERATING AN AUDIO IMAGE
A system for and a method of generating an audio image for use in rendering audio. The method comprises accessing an audio stream; accessing positional information, the positional information comprising a first position, a second position and a third position; and generating an audio image. In some embodiments, generating the audio image comprises generating, based on the audio stream, a first virtual wave front to be perceived by a listener as emanating from the first position; generating, based on the audio stream, a second virtual wave front to be perceived by the listener as emanating from the second position; and generating, based on the audio stream, a third virtual wave front to be perceived by the listener as emanating from the third position.
Systems and methods for modifying room characteristics for spatial audio rendering over headphones
An audio rendering system includes a processor that combines audio input signals with personalized spatial audio transfer functions having room responses. The personalized spatial audio transfer functions are selected from a database having a plurality of candidate transfer functions derived from in-ear microphone measurements for a plurality of individuals. Alternatively, the personalized transfer functions are derived from actual in-ear measurements of the listener. A room modification module allows the user to modify the personalized spatial audio transfer functions to substitute a different room or to modify the characteristics of the selected room without requiring additional in ear measurements. The module segments the selected transfer function into regions including one or more of direct; head and torso influenced; early reflection, and late reverberation regions. Extraction and modification operations are performed on one or more of the regions to alter the perceived sound.
Detection of audio panning and synthesis of 3D audio from limited-channel surround sound
A method includes receiving a multi-channel audio signal (101) including multiple input audio channels (102, 104, 106, 108) that are configured to play audio from multiple respective locations relative to a listener. One or more spectral components that undergo a panning effect (1001, 1002, 1003) are identified in the multi-channel audio signal among at least some of the input audio channels. One or more virtual channels (1100, 1200, 1300) are generated, which together with the input audio channels form an extended set (111) of audio channels that retain the identified panning effect. A reduced set (222) of output audio signals, fewer in number than the input audio signals, is generated from the extended set, including recreating the panning effect in the output audio signals. The reduced set of output audio signals is outputted to a user.
MULTI-DEVICE AUDIO ADJUSTMENT COORDINATION
This relates to intelligent automated assistants and, more specifically, to the intelligent coordination of audio signal output adjustments among multiple electronic devices. An example method includes, generating a local audio intent object associated with a software application stored on a first electronic device, the local audio intent object including one or more local audio parameters; determining that a second electronic device that is outputting an audio signal is proximate to the first electronic device; generating a proximate audio intent object corresponding to the second electronic device based on the one or more local audio adjustment parameters and a round-trip time (RTT) of a communication connection between the first electronic device and the second electronic device; and transmitting the proximate audio intent object to the second electronic device via the communication connection, wherein the proximate audio intent object causes the second electronic device to adjust the output of the audio signal.
Capturing and synchronizing data from multiple sensors
Processes, methods, systems, and devices are disclosed for synchronizing multiple wireless data streams captured in action by various sensors, with lost data recovery. For example, a source device may have multiple sensors acquiring data and sending the data streams (e.g., via Bluetooth connections) to a target device. Timing information may be appended for each of the data streams. Data packets of the multiple data streams may be formed with the timing information. The data packets may be transmitted to a target device that is configured to synchronize the multiple data streams using the timing information. The target device, applying the example processes or techniques of this disclosure, may accurately synchronize the multiple data streams. In some cases, the target device may capture additional data streams and the processor synchronizes all data streams of both the source and the target devices.