H04S3/008

DISPLAY APPARATUS AND CONTROLLING METHOD THEREOF
20230230538 · 2023-07-20 ·

A display apparatus is provided. The display apparatus includes a display panel comprising a plurality of pixels, a driver configured to drive the display panel, and at least one processor. The at least one processor may, based on receiving content comprising video content and audio content, obtain sound location information based on multi-channel information included in the audio content, identify one area of the video content corresponding to the obtained sound location information, and control the driver to adjust brightness of pixels included in the identified one area.

ENHANCEMENT OF SPATIAL AUDIO SIGNALS BY MODULATED DECORRELATION

Some methods involve receiving an input audio signal that includes N input audio channels, the input audio signal representing a first soundfield format having a first soundfield format resolution, N being an integer ≥2. A first decorrelation process may be applied to two or more of the input audio channels to produce a first set of decorrelated channels, the first decorrelation process maintaining an inter-channel correlation of the set of input audio channels. A first modulation process may be applied to the first set of decorrelated channels to produce a first set of decorrelated and modulated output channels. The first set of decorrelated and modulated output channels may be combined with two or more undecorrelated output channels to produce an output audio signal that includes O output audio channels representing a second and relatively higher-resolution soundfield format than the first soundfield format, O being an integer ≥3.

TRANSMISSION DEVICE, TRANSMISSION METHOD, RECEPTION DEVICE, AND RECEPTION METHOD
20230230601 · 2023-07-20 · ·

A processing load at a receiving side is reduced in a case where a plurality of classes of audio data is transmitted. A predetermined number of audio streams including coded data of a plurality of groups is generated and a container of a predetermined format having this predetermined number of audio streams is transmitted. Command information for creating a command specifying a group to be decoded from among the plurality of groups is inserted into the container and/or the audio stream. For example, a command insertion area for the receiving side to insert a command for specifying a group to be decoded is provided in at least one audio stream among the predetermined number of audio streams.

Audio output apparatus and method of controlling thereof

An audio output apparatus is disclosed. The audio output apparatus that outputs a multi-channel audio signal through a plurality of speakers disposed at different locations, the audio output apparatus includes an input interface, and a processor configured to, based on the multi-channel audio signal input through the inputter being received, obtain scene information on a type of audio included in the multi-channel audio signal and sound image angle information about an angle formed by sound image of the type of audio included in the multi-channel audio signal based on a virtual user, and generate an output signal to be output through the plurality of speakers from the multi-channel audio signal based on the obtained scene information and sound image angle information, wherein the type of audio includes at least one of sound effect, shouting sound, music, and voice, and a number of the plurality of speakers is equal to or greater than a number of channels of the multi-channel audio signal.

Efficient coding of audio scenes comprising audio objects

There is provided encoding and decoding methods for encoding and decoding of object based audio. An exemplary encoding method includes inter alia calculating M downmix signals by forming combinations of N audio objects, wherein M≤N, and calculating parameters which allow reconstruction of a set of audio objects formed on basis of the N audio objects from the M downmix signals. The calculation of the M downmix signals is made according to a criterion which is independent of any loudspeaker configuration.

Calibrating listening devices

Techniques for calibrating listening devices are disclosed herein. The techniques include emitting a predetermined audio signal using an outward-facing transducer located on a first portion of a head-mounted device worn by the user, receiving the predetermined audio signal at a microphone located on a second portion of the head-mounted device, the second portion being different from the first portion, determining a transfer function for the user based on the received predetermined audio signal, and applying the transfer function to audio signals transmitted to the user.

Display apparatus

A display apparatus is capable of outputting a stereo sound. The display apparatus includes a display panel configured to display an image; a sound generating device on a rear surface of the display panel; a rear cover on the rear surface of the display panel and configured to support the sound generating device; a partition member between the rear surface of the display panel and the rear cover and configured to divide the display panel into first, second, third, fourth and fifth areas; and first, second, third, fourth, and fifth sound generating devices attached to the rear surface of the display panel and configured to vibrate the display panel. The first, second, third, fourth and fifth sound generating devices are in the first, second, third, fourth and fifth areas, respectively.

Audio Generation Methods and System

A method of generating audio assets, comprising the steps of: receiving an input multi-layered audio asset comprising a plurality of audio layers, generating an input multi-channel image, wherein each channel of the input multi-channel image comprises an input image representative of one of the audio layers, training a generative model on the input multi-channel image and implementing the trained generative model to generate an output multi-channel image, wherein each channel of the output multi-channel image comprises an output image representative of an output audio layer, and generating an output multi-layered audio asset based on a combination of output audio layers derived from the output images.

SPATIALIZED AUDIO CHAT IN A VIRTUAL METAVERSE

Implementations described herein relate to methods, systems, and computer-readable media to provide spatialized audio in virtual experiences. The spatialized audio may be used in voice communications such as, for example, voice and/or video chats. The chats may include spatialized audio that is combined at a client device, or at an online experience platform, and is targeted to a particular user. Individual audio streams may be collected from a plurality of avatars and other objects, and combined based on the target user. The audio may also include background and/or ambient sounds to provide a rich, immersive audio stream in virtual experiences.

METHOD AND SYSTEM FOR ARTIFICIAL REVERBERATION EMPLOYING REVERBERATION IMPULSE RESPONSE SYNTHESIS

The present embodiments relate to audio effect processing, and more particularly to a method for creating reverberation impulse responses from prerecorded or live source materials forms the basis of a family of reverberation effects. In one embodiment, segments of audio are selected and processed to form an evolving sequence of reverberation impulse responses that are applied to the original source material—that is, an audio stream reverberating itself. In another embodiment, impulse responses derived from one audio track are applied to another audio track. In a further embodiment, reverberation impulse responses are formed by summing randomly selected segments of the source audio, and imposing reverberation characteristics, including reverberation time, wet equalization, wet-dry mix, and predelay. By controlling the number and timing of the selected source audio segments, the method produces a collection of impulse responses that represent a trajectory through the source material. In so doing, the evolving impulse responses will have the character of room reverberation while also expressing the changing timbre and dynamics of the source audio.