H04S2420/01

SIGNAL PROCESSING DEVICE, SIGNAL PROCESSING METHOD, AND PROGRAM

The present technology relates to a signal processing device, signal processing method, and program capable of providing a higher realistic feeling.

A signal processing device includes: an acquisition unit that acquires audio data of an audio object and metadata including position information indicating a position of the audio object and direction information indicating a direction of the audio object; and a signal generation unit that generates a reproduction signal for reproducing a sound of the audio object at a listening position on the basis of listening position information indicating the listening position, listener direction information indicating a direction of a listener at the listening position, the position information, the direction information, and the audio data. The present technology is applicable to a transmission reproduction system.

AUDIO ZOOM
20220360891 · 2022-11-10 ·

A device includes one or more processors configured to execute instructions to determine a first phase based on a first audio signal of first audio signals and to determine a second phase based on a second audio signal of second audio signals. The one or more processors are also configured to execute the instructions to apply spatial filtering to selected audio signals of the first audio signals and the second audio signals to generate an enhanced audio signal. The one or more processors are further configured to execute the instructions to generate a first output signal including combining a magnitude of the enhanced audio signal with the first phase and to generate a second output signal including combining the magnitude of the enhanced audio signal with the second phase. The first output signal and the second output signal correspond to an audio zoomed signal.

SYSTEMS AND METHODS FOR GENERATING VIDEO-ADAPTED SURROUND-SOUND
20220360933 · 2022-11-10 ·

Audiovisual presentations, such as film recordings, may have been originally created having an audio soundtrack with multiple audio tracks mixed for a surround sound system that includes a set of speakers physically surrounding a user. The present disclosure presents systems and methods to remix these soundtracks into 3D audio that when presented to the ears of a user can be perceived as a virtual surround sound system that mimics the physical system. What is more, the disclosed systems and methods can enhance the virtual surround sound system by adjusting virtual speakers of the virtual surround sound system according to video content of the audiovisual presentation. Further enhancement may be possible by adjusting the virtual speakers of the virtual surround sound system according to a sensed position of a user.

SYSTEM AND METHOD FOR WIRELESS AUDIO AND DATA CONNECTION FOR GAMING HEADPHONES AND GAMING DEVICES
20220360934 · 2022-11-10 ·

In at least one embodiment, an audio system is provided. At least one controller is programmed to encode a first and second audio component and to generate a first and a second encoded audio component. The at least one controller is programmed to apply a first gain to at least one of the first encoded audio component and the second encoded audio component to generate at least one of a first and second increased encoded audio component and to decode the at least one of the first and the second increased encoded audio component to generate at least one of a first and second decoded audio component. The at least one controller is further programmed to amplitude pan the at least one of the first and the second decoded audio component to increase a stereo width for an audio output transmitted by a first loudspeaker and a second loudspeaker.

SYSTEM AND METHOD FOR ADAPTIVE AUDIO SIGNAL GENERATION, CODING AND RENDERING

Embodiments are described for an adaptive audio system that processes audio data comprising a number of independent monophonic audio streams. One or more of the streams has associated with it metadata that specifies whether the stream is a channel-based or object-based stream. Channel-based streams have rendering information encoded by means of channel name; and the object-based streams have location information encoded through location expressions encoded in the associated metadata. A codec packages the independent audio streams into a single serial bitstream that contains all of the audio data. This configuration allows for the sound to be rendered according to an allocentric frame of reference, in which the rendering location of a sound is based on the characteristics of the playback environment (e.g., room size, shape, etc.) to correspond to the mixer's intent. The object position metadata contains the appropriate allocentric frame of reference information required to play the sound correctly using the available speaker positions in a room that is set up to play the adaptive audio content.

ARRANGEMENT FOR GENERATING HEAD RELATED TRANSFER FUNCTION FILTERS
20230038674 · 2023-02-09 ·

Arrangement for acquiring images for producing a head related transfer function filter is disclosed. In the arrangement the camera of a mobile phone or similar portable device is adjusted for the imaging. All acquired images are analyzed and only suitable images are sent further for producing the head related transfer filter. The arrangement is further configured to provide instructions to the user so that the whole head and other relevant body parts are sufficiently covered.

Methods and systems for designing and applying numerically optimized binaural room impulse responses

Methods and systems for designing binaural room impulse responses (BRIRs) for use in headphone virtualizers, and methods and systems for generating a binaural signal in response to a set of channels of a multi-channel audio signal, including by applying a BRIR to each channel of the set, thereby generating filtered signals, and combining the filtered signals to generate the binaural signal, where each BRIR has been designed in accordance with an embodiment of the design method. Other aspects are audio processing units configured to perform any embodiment of the inventive method. In accordance with some embodiments, BRIR design is formulated as a numerical optimization problem based on a simulation model (which generates candidate BRIRs) and at least one objective function (which evaluates each candidate BRIR), and includes identification of a best one of the candidate BRIRs as indicated by performance metrics determined for the candidate BRIRs by each objective function.

NEAR-FIELD AUDIO RENDERING

Examples of the disclosure describe systems and methods for presenting an audio signal to a user of a wearable head device. According to an example method, a source location corresponding to the audio signal is identified. For each of the respective left and right ear of the user, a virtual speaker position, of a virtual speaker array, is determined, the virtual speaker position collinear with the source location and with a position of the respective ear. For each of the respective left and right ear of the user, a head-related transfer function (HRTF) corresponding to the virtual speaker position and to the respective ear is determined; and the output audio signal is presented to the respective ear of the user via one or more speakers associated with the wearable head device. Processing the audio signal includes applying the HRTF to the audio signal.

APPARATUS, METHODS AND COMPUTER PROGRAMS FOR ENABLING REPRODUCTION OF SPATIAL AUDIO SIGNALS
20230096873 · 2023-03-30 · ·

An apparatus (101) for enabling reproduction of spatial audio signals. The apparatus comprises means for obtaining (401) audio signals (501) comprising one or more channels and obtaining (403) spatial metadata (503) relating to the audio signals (501). The spatial metadata (503) comprises information that indicates how to spatially reproduce the audio signals. The apparatus also comprises means for obtaining (405) information relating to a field of view of video (505) wherein the video is for display on a display (205) of a rendering device (201) and wherein the video is associated with the audio signals (501). The apparatus also comprises means for aligning (407) spatial reproduction of the audio signals based, at least in part, on the obtained spatial metadata (503), with objects (309A, 309B) in the video according to the obtained information relating to the field of view of video; and enabling (409) reproduction of the audio signals based on the aligning (407).

SOUND SIGNAL PROCESSING METHOD AND SOUND SIGNAL PROCESSING DEVICE
20230097661 · 2023-03-30 ·

A sound signal processing method includes: obtaining a sound signal; obtaining impulse response data that was measured in a predetermined space before the sound signal is obtained; generating an early reflected sound control signal not including a reverberant sound by convolving impulse response data of an early reflected sound among the obtained impulse response data into the obtained sound signal.