Patent classifications
H04S5/00
Spatial transformation of ambisonic audio data
A device configured to decode a bitstream, where the device includes a memory configured to store a temporally encoded representation of spatial audio signals. The device is also configured to receive the bitstream that includes an indication of a spatial transformation, and includes a temporal decoding unit, coupled to the memory, configured to decode one or more spatial audio signals represented in a spatial domain, where the one or more spatial audio signals are associated with different angles in the spatial domain. In addition, the device includes an inverse spatial transformation unit, coupled to the temporal decoding unit, is configured to convert the one or more spatial audio signals represented in the spatial domain into at least three ambisonic coefficients that, in part, represent a soundfield in an ambisonics domain, and perform a spatial transformation of the soundfield based on the indication of the spatial transformation received in the bitstream.
ELECTRONIC APPARATUS AND CONTROLLING METHOD THEREOF
An electronic apparatus and/or a controlling method are provided. The electronic apparatus may include a camera for photographing an image, a microphone for receiving an input of a sound of a first channel, and a processor for generating sounds of a plurality of channels based on the input sound, wherein the processor is configured to identify an object and the location of the object from the photographed image, classify the input sound based on an audio source, and allot the sound to the corresponding identified object, copy the classified sound and generate sounds of two channels, adjust characteristics of the generated sounds of two channels based on the audio source allotted to the identified object and the location of the identified object, and mix the sounds of two channels wherein the characteristics were adjusted according to the audio source and generate a stereo sound of two channels.
ELECTRONIC APPARATUS AND CONTROLLING METHOD THEREOF
An electronic apparatus and/or a controlling method are provided. The electronic apparatus may include a camera for photographing an image, a microphone for receiving an input of a sound of a first channel, and a processor for generating sounds of a plurality of channels based on the input sound, wherein the processor is configured to identify an object and the location of the object from the photographed image, classify the input sound based on an audio source, and allot the sound to the corresponding identified object, copy the classified sound and generate sounds of two channels, adjust characteristics of the generated sounds of two channels based on the audio source allotted to the identified object and the location of the identified object, and mix the sounds of two channels wherein the characteristics were adjusted according to the audio source and generate a stereo sound of two channels.
Method for generating and outputting an acoustic multichannel signal
Method for generating and outputting an acoustic multichannel signal, comprising the steps of: supplying a stereo signal (S), splitting the supplied stereo signal (S) into a plurality of perception-direction-dependent acoustic signal components (S.1-S.5), generating an acoustic multichannel signal by mixing each perception-direction-dependent acoustic signal component (S.1-S.5) onto an output channel (4.1-4.12) of an acoustic output apparatus (4) that comprises a plurality of, in particular more than two, acoustic output channels (4.1-4.12), outputting the generated multichannel signal over respective acoustic output channels (4.1-4.12) of the acoustic output apparatus (4).
Audio signal processor, system and methods distributing an ambient signal to a plurality of ambient signal channels
An audio signal processor for providing ambient signal channels on the basis of an input audio signal, is configured to extract an ambient signal on the basis of the input audio signal. The signal processor is configured to distribute the ambient signal to a plurality of ambient signal channels in dependence on positions or directions of sound sources within the input audio signal, wherein a number of ambient signal channels is larger than a number of channels of the input audio signal.
CONTENT BASED SPATIAL REMIXING
A trained machine configured to input a stereo sound track and separate the stereo sound track into multiple N separated stereo audio signals respectively characterized by multiple N audio content classes. All stereo audio as input in the stereo sound track is included in the N separated stereo audio signals. A mixing module is configured to spatially localize symmetrically and without cross-talk, between left and right, the N separated stereo audio signals into multiple output channels. The output channels include respective mixtures of one or more of the N separated stereo audio signals. Gain is adjusted of the output channels into left and right binaural outputs to conserve summed levels of the N separated stereo audio signals distributed over the output channels.
METHOD AND APPARATUS FOR ADAPTIVE CONTROL OF DECORRELATION FILTERS
An audio signal processing method and apparatus for adaptively adjusting a decorrelator. The method comprises obtaining a control parameter and calculating mean and variation of the control parameter. Ratio of the variation and mean of the control parameter is calculated, and a decorrelation parameter is calculated based on the said ratio. The decorrelation parameter is then provided to a decorrelator.
Systems and methods of spatial audio playback with enhanced immersiveness
A method of playing back audio content with improved immersiveness can include receiving, at a playback device, audio input including vertical content having a high-frequency portion and a low-frequency portion. The playback device can face along a first sound axis and comprise an up-firing transducer configured to direct sound along a second sound axis that is vertically angled with respect to the primary sound axis and a side-firing transducer or array configured to direct sound along a third axis that is horizontally angled with respect to the first sound axis. The low-frequency portion of the vertical content can be played back via the side-firing transducer or array, while the high-frequency portion of the vertical content can be played back via the up-firing transducer.
PARAMETRIC RECONSTRUCTION OF AUDIO SIGNALS
An encoding system encodes an N-channel audio signal (X), wherein N≥3, as a single-channel downmix signal (Y) together with dry and wet upmix parameters ({tilde over (C)}, {tilde over (P)}). In a decoding system, a decorrelating section outputs, based on the downmix signal, an (N−1)-channel decorrelated signal (Z); a dry upmix section maps the downmix signal linearly in accordance with dry upmix coefficients (C) determined based on the dry upmix parameters; a wet upmix section populates an intermediate matrix based on the wet upmix parameters and knowing that the intermediate matrix belongs to a predefined matrix class, obtains wet upmix coefficients (P) by multiplying the intermediate matrix by a predefined matrix, and maps the decorrelated signal linearly in accordance with the wet upmix coefficients; and a combining section combines outputs from the upmix sections to obtain a reconstructed signal ({circumflex over (X)}) corresponding to the signal to be reconstructed.
Method and apparatus for processing multimedia signals
The present invention relates to a method and an apparatus for processing a signal, which are used for effectively reproducing a multimedia signal, and more particularly, to a method and an apparatus for processing a signal, which are used for implementing filtering for multimedia signal having a plurality of subbands with a low calculation amount. To this end, provided are a method for processing a multimedia signal including: receiving a multimedia signal having a plurality of subbands; receiving at least one proto-type filter coefficients for filtering each subband signal of the multimedia signal; converting the proto-type filter coefficients into a plurality of subband filter coefficients; truncating each subband filter coefficients based on filter order information obtained by at least partially using characteristic information extracted from the corresponding subband filter coefficients, the length of at least one truncated subband filter coefficients being different from the length of truncated subband filter coefficients of another subband; and filtering the multimedia signal by using the truncated subband filter coefficients corresponding to each subband signal and an apparatus for processing a multimedia signal using the same.