H04S7/302

Dual-zone automotive multimedia system

A dual-zone automotive multimedia system may include a first infotainment device associated with a front zone of a vehicle, at least one second infotainment device associated with a rear zone of a vehicle, wherein the at least one second infotainment device includes a directional loudspeaker arranged facing the rear zone of the vehicle, and a processor programmed to transmit audio signals to the first and second infotainment devices to create sound at each of the front and rear zones, wherein the audio signal transmitted to the directional loudspeaker relates to playback at the rear zone.

MULTI-DEVICE AUDIO ADJUSTMENT COORDINATION
20230048256 · 2023-02-16 ·

This relates to intelligent automated assistants and, more specifically, to the intelligent coordination of audio signal output adjustments among multiple electronic devices. An example method includes, generating a local audio intent object associated with a software application stored on a first electronic device, the local audio intent object including one or more local audio parameters; determining that a second electronic device that is outputting an audio signal is proximate to the first electronic device; generating a proximate audio intent object corresponding to the second electronic device based on the one or more local audio adjustment parameters and a round-trip time (RTT) of a communication connection between the first electronic device and the second electronic device; and transmitting the proximate audio intent object to the second electronic device via the communication connection, wherein the proximate audio intent object causes the second electronic device to adjust the output of the audio signal.

SIGNAL PROCESSING DEVICE, SIGNAL PROCESSING METHOD, AND PROGRAM

The present technology relates to a signal processing device, signal processing method, and program capable of providing a higher realistic feeling.

A signal processing device includes: an acquisition unit that acquires audio data of an audio object and metadata including position information indicating a position of the audio object and direction information indicating a direction of the audio object; and a signal generation unit that generates a reproduction signal for reproducing a sound of the audio object at a listening position on the basis of listening position information indicating the listening position, listener direction information indicating a direction of a listener at the listening position, the position information, the direction information, and the audio data. The present technology is applicable to a transmission reproduction system.

SIGNAL PROCESSING DEVICE, METHOD, AND PROGRAM
20220360930 · 2022-11-10 ·

The present technology relates to a signal processing device, a method, and a program that make it possible for a user to obtain a higher realistic feeling. The signal processing device includes: an audio generation unit that generates a sound source signal according to a type of a sound source on the basis of a recorded signal obtained by sound collection by a microphone attached to a moving object; a correction information generation unit that generates position correction information indicating a distance between the microphone and the sound source; and a position information generation unit that generates sound source position information indicating a position of the sound source in a target space on the basis of microphone position information indicating a position of the microphone in the target space and the position correction information. The present technology can be applied to a recording/transmission/reproduction system.

METHOD AND APPARATUS FOR REPRESENTING SPACE OF INTEREST OF AUDIO SCENE
20220360929 · 2022-11-10 · ·

Aspects of the disclosure include methods, apparatuses, and non-transitory computer-readable storage mediums for representing a space of interest of an audio scene. One apparatus includes processing circuitry that decodes audio scene data for the audio scene. The audio scene data includes (i) audio content for a plurality of items representing the audio scene and (ii) a first syntax element indicating a type of a subset of the plurality of items. The subset of the plurality of items represents the space of interest of the audio scene. The processing circuitry determines a part of the audio content for the subset of the plurality of items based on the type of the subset of the plurality of items indicated in the first syntax element. The processing circuitry renders the determined part of the audio content.

SYSTEM AND METHOD FOR ADAPTIVE AUDIO SIGNAL GENERATION, CODING AND RENDERING

Embodiments are described for an adaptive audio system that processes audio data comprising a number of independent monophonic audio streams. One or more of the streams has associated with it metadata that specifies whether the stream is a channel-based or object-based stream. Channel-based streams have rendering information encoded by means of channel name; and the object-based streams have location information encoded through location expressions encoded in the associated metadata. A codec packages the independent audio streams into a single serial bitstream that contains all of the audio data. This configuration allows for the sound to be rendered according to an allocentric frame of reference, in which the rendering location of a sound is based on the characteristics of the playback environment (e.g., room size, shape, etc.) to correspond to the mixer's intent. The object position metadata contains the appropriate allocentric frame of reference information required to play the sound correctly using the available speaker positions in a room that is set up to play the adaptive audio content.

MULTIBAND LIMITER MODES AND NOISE COMPENSATION METHODS

Some implementations involve receiving a content stream that includes audio data, receiving at least one type of level adjustment indication relating to playback of the audio data and controlling a level of the input audio data, based on the at least one type of level adjustment indication, to produce level-adjusted audio data. Some examples involve determining, based at least in part on the type(s) of level adjustment indication, a multiband limiter configuration, applying the multiband limiter to the level-adjusted audio data, to produce multiband limited audio data and providing the multiband limited audio data to one or more audio reproduction transducers of an audio environment.

APPARATUS, METHODS AND COMPUTER PROGRAMS FOR ENABLING REPRODUCTION OF SPATIAL AUDIO SIGNALS
20230096873 · 2023-03-30 · ·

An apparatus (101) for enabling reproduction of spatial audio signals. The apparatus comprises means for obtaining (401) audio signals (501) comprising one or more channels and obtaining (403) spatial metadata (503) relating to the audio signals (501). The spatial metadata (503) comprises information that indicates how to spatially reproduce the audio signals. The apparatus also comprises means for obtaining (405) information relating to a field of view of video (505) wherein the video is for display on a display (205) of a rendering device (201) and wherein the video is associated with the audio signals (501). The apparatus also comprises means for aligning (407) spatial reproduction of the audio signals based, at least in part, on the obtained spatial metadata (503), with objects (309A, 309B) in the video according to the obtained information relating to the field of view of video; and enabling (409) reproduction of the audio signals based on the aligning (407).

EARBUD ORIENTATION-BASED BEAMFORMING

An earbud includes an earbud speaker, a microphone array including a plurality of microphones, an orientation sensing subsystem, and a beamforming subsystem. The orientation sensing subsystem is configured to output an orientation signal indicating an orientation of the earbud. The beamforming subsystem is configured to output a beamformed signal. The beamformed signal is based at least on the orientation signal and a plurality of microphone signals from the plurality of microphones in the microphone array. The beamformed signal spatially selectively filters the plurality of microphone signals.

INFORMATION PROCESSING APPARATUS, INFORMATION PROCESSING SYSTEM, AND INFORMATION PROCESSING METHOD

An information processing apparatus includes a control unit configured to execute a scene detection process, a parameter extraction process, and an output process. The scene detection process detects a scene from an input content. The parameter extraction process extracts a realistic sensation parameter for wave control that corresponds to a scene that is detected by the scene detection process. The output process outputs a wave signal for the content that is produced by processing sound data of the input content by a realistic sensation parameter that is extracted by the parameter extraction process.