H04S5/005

Reverberation generation for headphone virtualization

The present disclosure relates to reverberation generation for headphone virtualization. A method of generating one or more components of a binaural room impulse response (BRIR) for headphone virtualization is described. In the method, directionally-controlled reflections are generated, wherein directionally-controlled reflections impart a desired perceptual cue to an audio input signal corresponding to a sound source location. Then at least the generated reflections are combined to obtain the one or more components of the BRIR. Corresponding system and computer program products are described as well.

Audio processing apparatus with channel remapper and object renderer

Embodiments are described for rendering spatial audio content through a system that is configured to reflect audio off of one or more surfaces of a listening environment. The system includes an array of audio drivers distributed around a room, wherein at least one driver of the array of drivers is configured to project sound waves toward one or more surfaces of the listening environment for reflection to a listening area within the listening environment and a renderer configured to receive and process audio streams and one or more metadata sets that are associated with each of the audio streams and that specify a playback location in the listening environment.

Encoding of multi-channel audio signal to generate encoded binaural signal, and associated decoding of encoded binaural signal

An audio encoder comprises a multi-channel receiver which receives an M-channel audio signal where M>2. A down-mix processor down-mixes the M-channel audio signal to a first stereo signal and associated parametric data and a spatial processor modifies the first stereo signal to generate a second stereo signal in response to the associated parametric data and spatial parameter data for a binaural perceptual transfer function, such as a Head Related Transfer Function (HRTF). The second stereo signal is a binaural signal and may specifically be a (3D) virtual spatial signal. An output data stream comprising the encoded data and the associated parametric data is generated by an encode processor and an output processor. The HRTF processing may allow the generation of a (3D) virtual spatial signal by conventional stereo decoders. A multi-channel decoder may reverse the process of the spatial processor to generate an improved quality multi-channel signal.

METHOD AND APPARATUS FOR REPRODUCING THREE-DIMENSIONAL AUDIO
20200228908 · 2020-07-16 · ·

A three-dimensional (3D) audio reproducing method and apparatus is provided. The 3D audio reproducing method may include receiving a multichannel signal comprising a plurality of input channels; and performing downmixing according to a frequency range of the multichannel signal in order to format-convert the plurality of input channels into a plurality of output channels having elevation.

LOCATION INFORMATION THROUGH DIRECTIONAL SOUND PROVIDED BY MOBILE COMPUTING DEVICE
20200228914 · 2020-07-16 ·

A mobile computing device that provides location information through directional sound is described herein. The mobile computing device includes a location detection system that provides location signals corresponding to a user location and a destination location, such as a vehicle location or a vertex of a predefined travel route, to a spatial audio generation system to define a spatial audio signal based on a direction from the user location to the destination location. The spatial audio signal is provided to an audio device of the mobile computing device that outputs the spatial audio signal as directional sound having a locus at the destination location.

SIGNAL PROCESSING DEVICE AND IMAGE DISPLAY APPARATUS INCLUDING THE SAME

Disclosed are a signal processing device and an image display apparatus including the same. The signal processing device and an image display apparatus including the same include: a converter configured to convert of frequency of an input stereo audio signal; a primary component analyzer configured to perform primary component analysis based on a signal from the converter; a feature extractor configured to extract a feature of a primary component signal based on a signal from the primary component analyzer; an envelope adjustor configured to perform envelope adjustment based on prediction performed on the basis of a deep neural network model; and an inverse converter configured to inversely convert a signal from the envelope adjustor to output an upmix audio signal of multi-channel. Accordingly, when upmixing the downmix stereo audio signal to a multichannel audio signal, spatial distortion can be improved.

IMMERSIVE AUDIO REPRODUCTION SYSTEMS

Systems and methods can provide an elevated, virtual loudspeaker source in a three-dimensional soundfield using loudspeakers in a horizontal plane. In an example, a processor circuit can receive at least one height audio signal that includes information intended for reproduction using a loudspeaker that is elevated relative to a listener, and optionally offset from the listener's facing direction by a specified azimuth angle. A first virtual height filter can be selected for use based on the specified azimuth angle. A virtualized audio signal can be generated by applying the first virtual height filter to the at least one height audio signal. When the virtualized audio signal is reproduced using one or more loudspeakers in the horizontal plane, the virtualized audio signal can be perceived by the listener as originating from an elevated loudspeaker source that corresponds to the azimuth angle.

COMBINATION OF IMMERSIVE AND BINAURAL SOUND
20200196056 · 2020-06-18 ·

The present subject matter provides a technical solution to the technical problems facing sound localization by separating sounds and reproducing the separated sounds using a set of loudspeakers and a set of headphones. A general soundtrack that is meant to be experienced throughout the room would play through the loudspeakers, and specific sounds that are meant to be experienced near the listener would be played through a binaural representation in the headphones. The headphones may be selected to avoid occluding the ear, allowing sound produced at the loudspeakers to be heard clearly. This separation and reproduction of sounds using a combination of a loudspeaker and headphone provides a technical solution to the technical problem facing typical surround sound systems by localizing sounds for listeners in any location within a room. This improves reproduction accuracy of location-specific audio objects, including audio objects above or below a coplanar speaker configuration.

Methods and Systems for Extended Reality Audio Processing for Near-Field and Far-Field Audio Reproduction

An exemplary mobile edge compute (MEC) server implementing an extended reality audio processing system generates a near-field audio data stream and a far-field audio data stream. The near-field audio data stream is configured to be rendered by a near-field rendering system, while the far-field audio data stream is configured to be rendered by a far-field rendering system. The near-field and far-field audio data streams are each representative of virtual sound presented to an avatar of a user experiencing an extended reality world. The MEC server provides the near-field and far-field audio data streams to a media player device separate from the MEC server and implementing the near-field and far-field rendering systems. Specifically, the MEC server provides the audio data streams for concurrent rendering by the media player device as the user experiences the extended reality world using the media player device. Corresponding methods and systems are also disclosed.

Location information through directional sound provided by mobile computing device

A mobile computing device that provides location information through directional sound is described herein. The mobile computing device includes a location detection system that provides location signals corresponding to a user location and a destination location, such as a vehicle location or a vertex of a predefined travel route, to a spatial audio generation system to define a spatial audio signal based on a direction from the user location to the destination location. The spatial audio signal is provided to an audio device of the mobile computing device that outputs the spatial audio signal as directional sound having a locus at the destination location.