H04S3/002

Previewing spatial audio scenes comprising multiple sound sources

An apparatus comprising means for: in response to user input, selecting at least one sound source of a spatial audio scene, comprising multiple sound sources, the spatial audio scene being defined by spatial audio content; selecting at least one related contextual sound source based on the at least one selected sound source; and causing rendering of an audio preview, representing the spatial audio content, that can be selected by a user, wherein the audio preview comprises a mix of sound sources including at least the at least one selected sound source and the at least one related contextual sound source but not all of the multiple sound sources of the spatial audio scene, and wherein selection of the audio preview causes an operation on at least the selected sound source.

APPARATUS AND METHOD
20220182776 · 2022-06-09 · ·

The present disclosure pertains to an apparatus comprising circuitry configured to: determine a loudspeaker dependent spread factor for at least one individual loudspeaker of a loudspeaker arrangement, wherein the loudspeaker dependent spread factor depends on a specification of the at least one individual loudspeaker; and 5 control the outputs of the loudspeakers of the loudspeaker arrangement based on the loudspeaker dependent spread factor for the at least one individual loudspeaker to generate at least one virtual sound source.

ACOUSTIC DEVICE

An acoustic device includes: a generating unit that generates a monaural signal on the basis of a left stereo signal and a right stereo signal in a low frequency band; an extracting unit that extracts a stereo component for an L-channel and a stereo component for an R-channel on the basis of a left stereo signal and a right stereo signal in a high frequency band; a first combining unit that combines the monaural signal and the stereo component for the L-channel; and a second combining unit that combines the monaural signal and the stereo component for the R-channel.

Adaptive panner of audio objects

An audio object including audio content and object metadata is received. The object metadata indicates an object spatial position of the audio object to be rendered by audio speakers in a playback environment. Based on the object spatial position and source spatial positions of the audio speakers, initial gain values for the audio speakers are determined. The initial gain values can be used to select a set of audio speakers from among the audio speakers. Based on the object spatial position and a set of source spatial positions at which the set of audio speakers are respectively located in the playback environment, a set of non-negative optimized gain values for the set of audio speakers is determined. The audio object at the object spatial position is rendered with the set of optimized gain values for the set of audio speakers.

Signal processing device, channel setting method, and speaker system

To enable channel setting to be performed easily and accurately to speakers. Thus, provided is a speaker system including: N number of speakers, the N being three or more; and a signal processing device capable of communicating with each speaker. The signal processing device performs processing of recognizing two arrangement reference speakers with reception of notification that a designation operation has been received from a user, from two speakers among the N number of speakers, and processing of acquiring distance information between each speaker. The signal processing device recognizes a relative-position relationship between the N number of speakers, with the two arrangement reference speakers and the distance information between each speaker. Then, the signal processing device automatically sets a channel to each speaker, on the basis of the relative-position relationship recognized.

Acoustic signal mixing apparatus and non-transitory computer readable storage medium
11356774 · 2022-06-07 · ·

A mixing apparatus includes: a first speaker set processing unit to a P-th speaker set processing unit. K-th speaker set processing unit (K being an integer from 1 to P) includes: a mic set processing unit configured to process acoustic signals output by two microphones of a corresponding microphone set and to output a first acoustic signal and a second acoustic signal. The mic set processing unit configured to process acoustic signals output by two microphones of a corresponding microphone set based on an expansion/contraction coefficient for determining an expansion/contraction rate of a sound field, a shift coefficient for determining a shift amount of a sound field, and an attenuation coefficient for determining an attenuation amount of an acoustic signal output by a microphone.

Methods, apparatus and systems for audio reproduction

Audio perception in local proximity to visual cues is provided. A device includes a video display, first row of audio transducers, and second row of audio transducers. The first and second rows can be vertically disposed above and below the video display. An audio transducer of the first row and an audio transducer of the second row form a column to produce, in concert, an audible signal. The perceived emanation of the audible signal is from a plane of the video display (e.g., a location of a visual cue) by weighing outputs of the audio transducers of the column. In certain embodiments, the audio transducers are spaced farther apart at a periphery for increased fidelity in a center portion of the plane and less fidelity at the periphery.

Spatial sound rendering

An apparatus for spatial audio signal decoding including at least one processor and at least one memory including a computer program code configured to cause the apparatus at least to: receive at least one associated audio signal, the at least one associated audio signal based on a spatial audio signal; spatial metadata associated with the at least one associated audio signal, the spatial metadata including at least one parameter representing an ambiance energy distribution of the spatial audio signal and at least one directional parameter representing directional information of the spatial audio signal; synthesize from the at least one associated audio signal at least one output audio signal based on the at least one directional parameter and the least one parameter, wherein the at least one parameter controls ambiance energy distribution of the at least one output signal.

Method, apparatus or systems for processing audio objects

Diffuse or spatially large audio objects may be identified for special processing. A decorrelation process may be performed on audio signals corresponding to the large audio objects to produce decorrelated large audio object audio signals. These decorrelated large audio object audio signals may be associated with object locations, which may be stationary or time-varying locations. For example, the decorrelated large audio object audio signals may be rendered to virtual or actual speaker locations. The output of such a rendering process may be input to a scene simplification process. The decorrelation, associating and/or scene simplification processes may be performed prior to a process of encoding the audio data.

VIRTUAL HEIGHT AND SURROUND EFFECT IN SOUNDBAR WITHOUT UP-FIRING AND SURROUND SPEAKERS

An apparatus to realize the virtual height and surround effect. The apparatus includes at least an input source, a processor and front speaker. The input source provides the input signals on front, surround and height channels input into the processor in which a beamforming, channel separation and/or virtual-height effect are applied on each of the source channels, respectively. After the processing, all produced output channels output by the processor are arranged and combined into existing speakers of the soundbar.