Patent classifications
H04S7/306
Determination of composite acoustic parameter value for presentation of audio content
Determination of a composite acoustic parameter value for a headset is presented herein. A directionally enhanced audio signal is generated based on audio signals from an acoustic sensor array and a spatial signal enhancement filter that is directed for enhancement of a sound source. A SNR improvement value is determined based on a SNR value of the directionally enhanced audio signal and a SNR value of an audio signal from an acoustic sensor of the acoustic sensor array. The SNR improvement value is input into a model that maps SNR improvement values to spatial acoustic parameters to determine a spatial acoustic parameter. A temporal acoustic parameter is determined based on the audio signals. The composite acoustic parameter value is determined based on the spatial acoustic parameter and a temporal acoustic parameter value. Audio content presented to a user is adjusted based in part on the composite acoustic parameter value.
SPATIAL AUDIO DOWNMIXING
Channels of audio data in a spatial audio object are associated with any one or more of a direction and a location of one or more recorded sounds, which channels are to be reproduced as spatial sound. A visualized spatial sound object represents a snapshot/thumbnail of the spatial sound. To preview the spatial sound (by experiencing its snapshot or thumbnail), a user manipulates the orientation of the visualized spatial sound object, and a weighted downmix of the channels is rendered for output as a spatial preview sound, e.g., a single output audio signal is provided to a spatial audio renderer; one or more of the channels that are oriented toward the user are emphasized in the preview sound, more than channels that are oriented away from the user. Other aspects are also described and claimed.
ACOUSTIC PROCESSING APPARATUS, ACOUSTIC PROCESSING METHOD, AND ACOUSTIC PROCESSING PROGRAM
An acoustic processing apparatus (1) according to the present disclosure includes an acquisition unit (41), a storage unit (3), and a processing unit (42). The acquisition unit (41) acquires location information regarding a location viewed by a user who listens to an acoustic content. The storage unit (3) stores acoustic information (34) regarding acoustics at the location. The processing unit (42) converts an acoustic characteristic of the acoustic content into an acoustic characteristic in accordance with the location on the basis of the acoustic information (34) and regenerates a sound field.
BINAURAL RENDERING FOR HEADPHONES USING METADATA PROCESSING
Embodiments are described for a method of rendering audio for playback through headphones comprising receiving digital audio content, receiving binaural rendering metadata generated by an authoring tool processing the received digital audio content, receiving playback metadata generated by a playback device, and combining the binaural rendering metadata and playback metadata to optimize playback of the digital audio content through the headphones.
AUGMENTED REALITY HEADPHONE ENVIRONMENT RENDERING
Accurate modeling of acoustic reverberation can be essential to generating and providing a realistic virtual reality or augmented reality experience for a participant. In an example, a reverberation signal for playback using headphones can be provided. The reverberation signal can correspond to a virtual sound source signal originating at a specified location in a local listener environment. Providing the reverberation signal can include, among other things, using information about a reference impulse response from a reference environment and using characteristic information about reverberation decay in a local environment of the participant. Providing the reverberation signal can further include using information about a relationship between a volume of the reference environment and a volume of the local environment of the participant.
Method of improving externalization of virtual surround sound
Aspects of the present disclosure relate to techniques for processing a source audio signal in order to localize sounds. In particular, aspects of the present disclosure relate to sound localization techniques which externalize sounds for headphone audio, such as a virtual surround sound headphone system. In various implementations, room reverberations and other acoustic effects of the environment may be more accurately modeled using improved room reverberation models. For example, in some implementations, the underlying source signal may be filtered with a filter representing a room impulse response that is a combination of a stereo room impulse response and a mono room impulse response. By way of further example, in some implementations the source signal may be filtered with a combined impulse response filter that is derived from binaural recordings of simulated impulses recorded in a desired reverberant environment.
Virtual stereo synthesis method and apparatus
A virtual stereo synthesis method includes acquiring at least one sound input signal on a first side and at least one sound input signal on a second side, separately performing ratio processing on a preset head related transfer function (HRTF) left-ear component and a preset HRTF right-ear component of each sound input signal on the second side, to obtain a filtering function of each sound input signal on the second side, separately performing convolution filtering on each sound input signal on the second side and the filtering function of the sound input signal on the second side, to obtain the filtered signal on the second side, and synthesizing all of the sound input signals on the first side and all of the filtered signals on the second side into a virtual stereo signal where the method may alleviate a coloration effect, and reduce calculation complexity.
AUDIO SYSTEM FOR ARTIFICIAL REALITY APPLICATIONS
Embodiments of the present disclosure relate to an audio system for artificial reality applications. One or more transducers of the audio system output, in accordance with audio instructions, one or more ultrasonic pressure waves simulating a virtual audio source near an ear of a user of the headset. A controller of the audio system generates the audio instructions such that the one or more ultrasonic pressure waves form at least a portion of audio content for presentation to the user. An array of microphones of the audio system detects audio signals in a local area. A deep neural network of the audio system processes the detected audio signals to generate enhanced audio content, and the one or more transducers present the enhanced audio content to a user.
Generating binaural audio in response to multi-channel audio using at least one feedback delay network
In some embodiments, virtualization methods for generating a binaural signal in response to channels of a multi-channel audio signal, which apply a binaural room impulse response (BRIR) to each channel including by using at least one feedback delay network (FDN) to apply a common late reverberation to a downmix of the channels. In some embodiments, input signal channels are processed in a first processing path to apply to each channel a direct response and early reflection portion of a single-channel BRIR for the channel, and the downmix of the channels is processed in a second processing path including at least one FDN which applies the common late reverberation. Typically, the common late reverberation emulates collective macro attributes of late reverberation portions of at least some of the single-channel BRIRs. Other aspects are headphone virtualizers configured to perform any embodiment of the method.
SPATIALIZED AUDIO RELATIVE TO A PERIPHERAL DEVICE
An audio system, method, and computer program product which includes a wearable audio device and a mobile peripheral device. Each device is capable of determining its respective absolute or relative position and orientation. Once the relative positions and orientations between the devices are known, virtual sound sources are generated at fixed positions and orientations relative to the peripheral device such that any change in position and/or orientation of the peripheral device produces a proportional change in the position and/or orientation of the virtual sound sources. Additionally, first order and second order reflected audio paths may be simulated for each virtual sound source to increase the realism of the simulated sources. Each sound path can be produced by modifying the original audio signal using head-related transfer functions (HRTFs) to simulate audio as though it were perceived by the user's left and right ears as coming from each virtual sound source.