Patent classifications
H04S5/005
Audio encoding device and audio decoding device
An input signal includes a channel-based audio signal and an object-based audio signal, and an audio encoding device includes an audio scene analysis unit configured to determine an audio scene from the input signal and detect audio scene information; a channel-based encoder that encodes the channel-based audio signal output from the audio scene analysis unit; an object-based encoder that encodes the object-based audio signal output from the audio scene analysis unit; and an audio scene encoding unit configured to encode the audio scene information.
Multi-channel audio upmixer
An audio processor may receive audio input channels including stereo channels and one or more surround channels. The audio processor may downmix the audio input channels into stereo output channels; developing the stereo output channels into upmixed audio channels including at least one additional surround channel not present in the audio input channels; delay the audio input channels into delayed audio channels that are time-aligned with the upmixed audio channels; and mix the delayed audio channels and the upmixed audio channels into audio output channels.
METHOD AND DEVICE FOR GENERATING AND PLAYING BACK AUDIO SIGNAL
A method for generating audio according to an embodiment of the present invention, for solving the above technical problem, comprises the steps of: receiving an audio signal through at least one mic; generating an input channel signal respectively corresponding to the at least one mic; generating a virtual input channel signal based on the input channel signal; generating additional information including playback positions of the input channel signal and the virtual input channel signal; and transmitting a multichannel audio signal including the input channel signal and the virtual input channel signal, and the additional information.
Apparatus and method for efficient object metadata coding
An apparatus for generating one or more audio channels is provided. The apparatus includes a metadata decoder for receiving one or more compressed metadata signals. Each of the one or more compressed metadata signals includes a plurality of first metadata samples. The metadata decoder is configured to generate one or more reconstructed metadata signals and to generate each of the second metadata samples of each reconstructed metadata signal of the one or more reconstructed metadata signals depending on at least two of the first metadata samples of the reconstructed metadata signal. The apparatus includes an audio channel generator for generating the one or more audio channels depending on the one or more audio object signals and depending on the one or more reconstructed metadata signals. An apparatus for generating encoded audio information including one or more encoded audio signals and one or more compressed metadata signals is provided.
Audio calibration and adjustment
The subject disclosure is directed towards calibrating sound pressure levels of speakers to determine desired attenuation data for use in later playback. A user may be guided to a calibration location to place a microphone, and each speaker is calibrated to output a desired sound pressure level in its current acoustic environment based upon the attenuation data learned during calibration. During playback, the attenuation data is used. Also described is testing the setup of the speakers, and dynamically adjusting the attenuation data in real time based upon tracking the listener's current location.
METHOD AND APPARATUS FOR AN ULTRASONIC EMITTER SYSTEM FLOOR AUDIO UNIT
Methods and systems are provided for audio devices with enhanced directional operations. In a system that includes one or more audio output components, positioning information associated with at least a part of a user's body maybe determined, the positioning information including at least orientation related information. The one or more audio output components may be controlled based on the determined positioning information, with the determined positioning information corresponding to a positional nature of the user within a three-dimensional space around the user, and where the controlling includes providing a three-dimensional audio environment according to the user.
METHOD AND DEVICE FOR RENDERING ACOUSTIC SIGNAL, AND COMPUTER-READABLE RECORDING MEDIUM
When a channel signal, such as a 22.2 channel signal, is rendered into a 5.1 channel signal, a three-dimensional (3D) audio may be reproduced by using a two-dimensional (2D) output channel, however, when an elevation angle of an input channel is different from a standard elevation angle, if elevation rendering parameters according to the standard elevation angle are used, distortion may occur in a sound image. In order to solve the aforementioned problem according to the related art and to prevent front-back confusion due to a surround output channel, an embodiment of the present invention provides a method of rendering an audio signal, the method including receiving a multichannel signal including a plurality of input channels to be converted to a plurality of output channels; adding a preset delay to a frontal height input channel so as to allow each of the plurality of output channels to provide a sound image having an elevation at a reference elevation angle; changing, based on the added delay, an elevation rendering parameter with respect to the frontal height input channel; and preventing front-back confusion by generating, based on the changed elevation rendering parameter, an elevation-rendered surround output channel delayed with respect to the frontal height input channel.
Audio providing apparatus and audio providing method
An audio providing apparatus and method are provided. The audio providing apparatus includes: an object renderer configured to render an object audio signal based on geometric information regarding the object audio signal; a channel renderer configured to render an audio signal having a first channel number into an audio signal having a second channel number; and a mixer configured to mix the rendered object audio signal with the audio signal having the second channel number.
Producing a multichannel sound from stereo audio signals
The invention relates to a method for generating a multichannel audio signal (600, 700) from a stereo audio signal (100, 200), having the following steps: ascertaining a first panning coefficient (310) and a second panning coefficient (320) of the stereo audio signal; ascertaining a direct signal (515) as well as a first surroundings signal (510) and a second surroundings signal (520) from the first audio signal and the second audio signal and from the first panning coefficient and the second panning coefficient; ascertaining a plurality of repanning coefficients (410, 415, 420) from the panning coefficients, each repanning coefficient of the plurality of repanning coefficients being assigned to an audio channel (580, 585, 590) of a plurality of audio channels of the multichannel audio signal; calculating each direct signal using each of the repanning coefficients of the plurality of repanning coefficients; and converting each audio channel into a playback signal (600, 700) of the multichannel audio signal, each playback signal being provided for a respective playback device.
Extracting a common signal from multiple audio signals
Extracting a common signal from multiple audio signals may include summing a first signal and a second signal to obtain a first+second signal; subtracting the second signal from the first signal to obtain a first−second signal; transforming the first+second signal and the first−second signal to frequency domain representations; calculating absolute value of the frequency domain representations of the first+second signal and the first−second signal; subtracting the absolute value of the frequency domain representation of the first−second signal from the absolute value of the frequency domain representation of the first+second signal to obtain a difference signal; multiplying the difference signal by the frequency domain representation of the first+second signal to obtain a product signal; dividing the product signal by the absolute value of the frequency domain representation of the first+second signal to obtain a frequency domain representation of the common signal; and transforming the frequency domain representation to the common signal.