Patent classifications
H04S5/005
Transmission apparatus, transmission method, reception apparatus, and reception method
To enable multichannel audio data to be transmitted favorably. Multichannel audio data of a predetermined number of channels is acquired. The multichannel audio data has a sampling frequency corresponding to the predetermined number of channels. Audio data of the respective channels configuring the multichannel audio data are sequentially transmitted to a reception side via a predetermined transmission channel for each unit audio data. Information indicating the sampling frequency is added to the transmission audio data.
Electronic apparatus, control method thereof and computer program product using the same
An electronic apparatus includes a memory configured to store instructions; and a processor configured to execute the instructions to: separate an input audio signal into a plurality of channel signals; identify a gain corresponding to a feature difference between a first channel signal, from among the plurality of channel signals, and a second channel signal, from among the plurality of channel signals; and adjust relative ratios among a plurality of output signals according to the identified gain to generate an output audio signal in which a sound image is varied.
Combination of immersive and binaural sound
The present subject matter provides a technical solution to the technical problems facing sound localization by separating sounds and reproducing the separated sounds using a set of loudspeakers and a set of headphones. A general soundtrack that is meant to be experienced throughout the room would play through the loudspeakers, and specific sounds that are meant to be experienced near the listener would be played through a binaural representation in the headphones. The headphones may be selected to avoid occluding the ear, allowing sound produced at the loudspeakers to be heard clearly. This separation and reproduction of sounds using a combination of a loudspeaker and headphone provides a technical solution to the technical problem facing typical surround sound systems by localizing sounds for listeners in any location within a room. This improves reproduction accuracy of location-specific audio objects, including audio objects above or below a coplanar speaker configuration.
Location information through directional sound provided by mobile computing device
A mobile computing device that provides location information through directional sound is described herein. The mobile computing device includes a location detection system that provides location signals corresponding to a user location and a destination location, such as a vehicle location or a vertex of a predefined travel route, to a spatial audio generation system to define a spatial audio signal based on a direction from the user location to the destination location. The spatial audio signal is provided to an audio device of the mobile computing device that outputs the spatial audio signal as directional sound having a locus at the destination location.
Method and system for producing binaural immersive audio for audio-visual content
A method for producing binaural immersive audio for audio-visual content. The method includes: receiving audio-visual content including video and audio; identifying audio-producing object(s) represented in given frame of video; analysing audio corresponding to given frame using first process (FP) to extract first audio track candidate; analysing audio corresponding to given frame using second process (SP) to extract second audio track candidate; determining first and second time-based match scores for first and second audio track candidates, respectively, based on match between given audio track candidate and audio-producing object(s) in given frame; selecting, for given frame of the video, at least one of first audio track candidate and second audio track candidate as playback audio based on first and second time-based match scores; and processing playback audio to produce the binaural immersive audio for the given frame of the video in the audio-visual content.
Mapping Audio To Visual Images on a Display Device Having a Curved Screen
Disclosed herein are system, method, and computer program product embodiments for accurately mapping audio to a visual image on a display device. An embodiment operates by, based on an identified virtual representation of the remote venue, modifying audio received at a local venue to produce a first and second modified audio. Thereafter, the first and second modified audio is correlated to a first and second portion of a prestored image. Afterward, the first and second portion of the prestored image is presented on a first and second portion of the display device's curved screen, and the first and second modified audio is provided on a first and second audio device, respectively, such that the first and second modified audio is provided to the first and second regions of the local venue synchronously with the first and second portions of audio.
METHOD AND APPARATUS FOR AN ULTRASONIC EMITTER SYSTEM FLOOR AUDIO UNIT
Methods and systems are provided for audio devices with enhanced directional operations. A user of an audio system may be prompted to provide an audio input. Once received, the audio input may be processed, and based on processing of the audio input positioning related information associated with the user may be generated. Outputting of an audio output in the audio system may then be controlled based on the positioning related information. Controlling the outputting of the audio output may include configuring at least one audio output related parameter or function for optimizing directionality of the audio output based on the positioning related information.
Apparatus and method for providing enhanced guided downmix capabilities for 3D audio
An apparatus for downmixing three or more audio input channels to obtain two or more audio output channels is provided. The apparatus includes a receiving interface for receiving the three or more audio input channels and for receiving side information. Moreover, the apparatus includes a downmixer for downmixing the three or more audio input channels depending on the side information to obtain the two or more audio output channels. The number of the audio output channels is smaller than the number of the audio input channels. The side information indicates a characteristic of at least one of the three or more audio input channels, or a characteristic of one or more sound waves recorded within the one or more audio input channels, or a characteristic of one or more sound sources which emitted one or more sound waves recorded within the one or more audio input channels.
Audio Processing in Adaptive Intermediate Spatial Format
Systems, methods, and computer program products of audio processing based on Adaptive Intermediate Spatial Format (AISF) are described. The AISF is an extension to ISF that allows spatial resolution around an ISF ring to be adjusted dynamically with respect to content of incoming audio objects. An AISF encoder device adaptively warps each ISF ring during ISF encoding to adjust angular distance between objects, resulting in increase in uniformity of energy distribution around the ISF ring. At an AISF decoder device, matrices that decode sound positions to the output speaker take into account the warping that was performed at the AISF encoder device to reproduce the true positions of sound sources.
METHOD FOR TRANSMITTING AND RECEIVING AUDIO DATA AND APPARATUS THEREFOR
A method for transmitting audio data performed by an audio data transmission apparatus in accordance with the present invention comprises the steps of: generating playback environment information of three-dimensional audio content; encoding a three-dimensional audio signal of the three-dimensional audio content; and transmitting, to an audio data reception apparatus, the encoded three-dimensional audio signal of the three-dimensional audio content and the generated playback environment information, wherein the playback environment information includes environment information of a room in which the three-dimensional audio content is played.