H04S5/005

Audio signal decoder, audio signal encoder, method for providing an upmix signal representation, method for providing a downmix signal representation, computer program and bitstream using a common inter-object-correlation parameter value

An audio signal decoder for providing an upmix signal representation on the basis of a downmix signal representation and an object-related parametric information and in dependence on a rendering information has an object parameter determinator. The object parameter determinator is configured to obtain inter-object-correlation values for a plurality of pairs of audio objects. The object parameter determinator is configured to evaluate a bitstream signaling parameter in order to decide whether to evaluate individual inter-object-correlation bitstream parameter values to obtain inter-object-correlation values for a plurality of pairs of related audio objects, or to obtain inter-object-correlation values for a plurality of pairs of related audio objects using a common inter-object-correlation bitstream parameter value. The audio signal decoder also has a signal processor configured to obtain the upmix signal representation on the basis of the downmix signal representation and using the inter-object-correlation values for a plurality of pairs of related objects and the rendering information.

Audio reproduction system and method for reproducing audio data of at least one audio object
09807533 · 2017-10-31 · ·

An audio reproduction system for reproducing audio data of at least one audio object and/or at least one sound source of an acoustic scene in a given environment comprising: at least two audio systems acting distantly apart from each other, wherein one of the audio systems is adapted to reproduce the audio object and/or the sound source in a first distance range to a listener and another of the audio systems is adapted to reproduce the audio object and/or the sound source in a second distance range to the listener, wherein the first and second distance ranges are different and possibly spaced apart from each other or placed adjacent to each other; and a panning information provider adapted to process at least one input to generate at least one panning information for each audio system to drive the at least two audio systems.

Spatial audio processing system and method

A spatial audio processing system and method including the steps of: dividing the series of virtual speakers into a series of horizontal planes around the expected listener; rendering the audio source to an intermediate spatial format for playback over a series of virtual speakers arranged in each of the series of planes around the listener, the rendering including: an initial panning of the spatialized virtual audio source to each of the horizontal planes to produce a plane rendered audio emission; a subsequent panning of each of the plane rendered audio emissions to a series of virtual speaker locations within each plane, with the subsequent panning utilizing a series of panning curves which are spatially smoothed to can include spatial frequency components which are less than the Nyquist sampling rate of the audio source.

Encoding device and method, decoding device and method, and program

The present technique relates to an encoding device and a method, a decoding device and a method, and a program capable of obtaining higher quality audio. An encoding unit encodes position information and a gain of an object in a current frame in multiple encoding modes. A compressing unit generates, for each combination of encoding modes of each pieces of position information and gains, encoded meta data including encoding mode information indicating the encoding modes and encoded data which are the encoded position information and gains, and compresses the encoding mode information included in the encoding meta data. A determining unit selects encoded meta data of which amount of data is the least from among the encoded meta data generated for each combination, thus determining the encoding mode of each pieces of position information and gains. The present technique can be applied to an encoder and a decoder.

Production of 3D audio signals

A device which produces the necessary directional audio signals for a 3-dimensional audio playback and which in that case uses as input signals the available channels of an audio recording intended for 2-dimensional audio playback. By taking psychoacoustic effects into account the desired spatial 3D audio effect is produced by a targeted use of signal delays, frequency-dependent amplitude matchings and a limited use of reverberation effects in conjunction with a targetedly asymmetric processing.

Distributed audio system
09800972 · 2017-10-24 ·

Various embodiments manage a distributed audio system is disclosed. In one embodiment, an audio stream is received from each electronic device in a plurality of electronic devices. The audio stream is captured by at least one audio input module of the electronic device. Two or more of the audio streams are aggregated into a single audio stream. The single audio stream is outputted via at least one audio output module.

Reflected sound rendering for object-based audio

Embodiments are described for rendering spatial audio content through a system that is configured to reflect audio off of one or more surfaces of a listening environment. The system includes an array of audio drivers distributed around a room, wherein at least one driver of the array of drivers is configured to project sound waves toward one or more surfaces of the listening environment for reflection to a listening area within the listening environment and a renderer configured to receive and process audio streams and one or more metadata sets that are associated with each of the audio streams and that specify a playback location in the listening environment.

Adaptive diffuse signal generation in an upmixer

An audio processing system, such as an upmixer, may be capable of separating diffuse and non-diffuse portions of N input audio signals. The upmixer may be capable of detecting instances of transient audio signal conditions. During instances of transient audio signal conditions, the up-mixer may be capable of adding a signal-adaptive control to a diffuse signal expansion process in which M audio signals are output. The upmixer may vary the diffuse signal expansion process over time such that during instances of transient audio signal conditions the diffuse portions of audio signals may be distributed substantially only to output channels spatially close to the input channels. During instances of non-transient audio signal conditions, the diffuse portions of audio signals may be distributed in a substantially uniform manner.

Audio signal processing apparatus and audio signal processing method

An audio signal processing apparatus includes: an obtaining unit which obtains a stereo signal including an R signal and an L signal; a control unit which generates a processed R signal and a processed L signal by performing (i) a first process of convolving pairs of right- and left-ear head related transfer functions into the R signal so that a sound image of the R signal is localized at each of two or more different positions at a right side of a listener; and (ii) a second process of convolving pairs of right- and left-ear head related transfer functions into the L signal so that a sound image of the L signal is localized at each of two or more different positions at a left side of the listener; and an output unit which outputs the processed R signal and the processed L signal.

TRANSMISSION APPARATUS, TRANSMISSION METHOD, RECEPTION APPARATUS, AND RECEPTION METHOD
20170289613 · 2017-10-05 · ·

To enable multichannel audio data to be transmitted favorably. Multichannel audio data of a predetermined number of channels is acquired. The multichannel audio data has a sampling frequency corresponding to the predetermined number of channels. Audio data of the respective channels configuring the multichannel audio data are sequentially transmitted to a reception side via a predetermined transmission channel for each unit audio data. Information indicating the sampling frequency is added to the transmission audio data.