Patent classifications
H04S2420/07
System and method for providing three-dimensional immersive sound
In one embodiment, a system for providing three-dimensional (3D) immersive sound is provided. The system includes a loudspeaker and at least one controller. The loudspeaker transmits an audio output signal in a listening environment. The at least one controller is programmed to store a plurality of directional bands with each directional band being defined by a narrowband frequency interval and to store at least psychoacoustic scale including a sub-band for each directional band. The at least one controller is further programmed to determine an energy for the sub-band and generate a loudspeaker driving signal based at least on the energy for the sub-band to drive the loudspeaker to transmit the audio output signal.
Spatial Audio Signal Format Generation From a Microphone Array Using Adaptive Capture
Apparatus including a processor configured to: obtain at least two microphone audio signals; determine spatial metadata transmit and/or store the spatial metadata and at least one of: at least one of the at least two microphone audio signals, at least one microphone audio signal from at least one second microphone configured to capture at least part of a same sound scene captured with the at least one first microphone, or at least one signal based, at least partially, on the at least two microphone audio signals, wherein the transmitting and/or storing is configured to enable synthesis of a plurality of spherical harmonic audio signals.
Determination of targeted spatial audio parameters and associated spatial audio playback
A method for spatial audio signal processing, including determining at least one direction parameter for at least one frequency band based on microphone signals received from a microphone array; processing the determined at least one direction parameter to determine at least one distance parameter for the at least one frequency band; and enabling an output and/or store of the at least one distance parameter, at least one audio signal, and the at least one direction parameter.
Methods, systems and apparatus for conversion of spatial audio format(s) to speaker signals
The present disclosure relates to a method of converting an audio signal in an intermediate signal format to a set of speaker feeds suitable for playback by an array of speakers. The audio signal in the intermediate signal format is obtainable from an input audio signal by means of a spatial panning function. The method comprises determining a discrete panning function for the array of speakers, determining a target panning function based on the discrete panning function, wherein determining the target panning function involves smoothing the discrete panning function, and determining a rendering operation for converting the audio signal in the intermediate signal format to the set of speaker feeds, based on the target panning function and the spatial panning function. The present disclosure further relates to a corresponding apparatus and a corresponding computer-readable storage medium.
CROSSTALK CANCELLATION FOR OPPOSITE-FACING TRANSAURAL LOUDSPEAKER SYSTEMS
Embodiments relate to audio processing for opposite facing speaker configurations that results in multiple optimal listening regions around the speakers. A system includes a left speaker and a right speaker in an opposite facing speaker configuration, and a crosstalk cancellation processor connected with the left speaker and the right speaker. The crosstalk cancellation processor applies a crosstalk cancellation to an input audio signal to generate left and right output channels. The left output channel is provided to the left speaker and the right output channel is provided to the right speaker to generate sound including multiple crosstalk cancelled listening regions that are spaced apart.
Joint wideband source localization and acquisition based on a grid-shift approach
Techniques of source localization and acquisition involve a wideband joint acoustic source localization and acquisition approach in light of sparse optimization framework based on an orthogonal matching pursuit-based grid-shift procedure. Along these lines, a specific grid structure is constructed with the same number of grid points as compared to the on-grid case, but which is “shifted” across the acoustic scene. More specifically, it is expected that each source will be located close to a grid point in at least one of the set of shifted grids. The sparse solutions corresponding to the set of shifted grids are combined to obtain the source location estimates. The estimated source positions are used as side information to obtain the original source signals.
Genetic-Algorithm-Based Equalization Using IIR Filters
Systems and methods utilize a modified genetic algorithm for adapting an off-the-shelf audio system, such as in a high-end television, to a given, particular room or other physical location presenting a specific or unique auditory environment with a set of acoustic properties. An audio system is adapted to a given room by determining an IIR based EQ solution via iterative techniques, including an iterative technique based upon a genetic algorithm adapted for an audio frequency response equalization application. In a variant, an audio system is adapted to a particular room, adjust the EQ across a microphone's bandwidth while preserving the factory-calibrated EQ response across the remaining bandwidth.
Efficient coding of audio scenes comprising audio objects
There is provided encoding and decoding methods for encoding and decoding of object based audio. An exemplary encoding method includes inter alia calculating M downmix signals by forming combinations of N audio objects, wherein M≤N, and calculating parameters which allow reconstruction of a set of audio objects formed on basis of the N audio objects from the M downmix signals. The calculation of the M downmix signals is made according to a criterion which is independent of any loudspeaker configuration.
Methods and systems for designing and applying numerically optimized binaural room impulse responses
Methods and systems for designing binaural room impulse responses (BRIRs) for use in headphone virtualizers, and methods and systems for generating a binaural signal in response to a set of channels of a multi-channel audio signal, including by applying a BRIR to each channel of the set, thereby generating filtered signals, and combining the filtered signals to generate the binaural signal, where each BRIR has been designed in accordance with an embodiment of the design method. Other aspects are audio processing units configured to perform any embodiment of the inventive method. In accordance with some embodiments, BRIR design is formulated as a numerical optimization problem based on a simulation model (which generates candidate BRIRs) and at least one objective function (which evaluates each candidate BRIR), and includes identification of a best one of the candidate BRIRs as indicated by performance metrics determined for the candidate BRIRs by each objective function.
SUBBAND SPATIAL PROCESSING AND CROSSTALK CANCELLATION SYSTEM FOR CONFERENCING
Embodiments relate to providing a conference for client devices with spatialized audio. Input audio streams are received from the client devices. For each client device, placement data defining spatial locations of other client devices within a sound field is determined. A mixed stream including a left mixed channel and a right mixed channel for the client device is generated by mixing and panning input audio streams of the other client devices according to the placement data. A spatially enhanced stream including a left enhanced channel for a left speaker and a right enhanced channel for a right speaker is generated by applying subband spatial processing and crosstalk processing on the left mixed channel and the right mixed channel of the mixed stream.