Patent classifications
H04S2400/01
Data control system for a data server and a plurality of cellular phones, a data server for the system, and a cellular phone for the system
A data control system comprises a user terminal such as a cellular phone, or an assist appliance, or a combination thereof, and a server in communication with the user terminal. The user terminal acquires the name of a person and an identification data of the person for storage as a reference on an opportunity of the first meeting with the person, and acquires the identification data of the person on an opportunity of meeting again to inform the name of the person with visual and/or audio display if the identification data is in consistency with the stored reference. The reference is transmitted to a server which allows another person to receive the reference on the condition that the same person has given a self-introduction both to a user of the user terminal and the another person to keep privacy of the same person against unknown persons.
Systems and methods for modifying room characteristics for spatial audio rendering over headphones
An audio rendering system includes a processor that combines audio input signals with personalized spatial audio transfer functions having room responses. The personalized spatial audio transfer functions are selected from a database having a plurality of candidate transfer functions derived from in-ear microphone measurements for a plurality of individuals. Alternatively, the personalized transfer functions are derived from actual in-ear measurements of the listener. A room modification module allows the user to modify the personalized spatial audio transfer functions to substitute a different room or to modify the characteristics of the selected room without requiring additional in ear measurements. The module segments the selected transfer function into regions including one or more of direct; head and torso influenced; early reflection, and late reverberation regions. Extraction and modification operations are performed on one or more of the regions to alter the perceived sound.
Detection of audio panning and synthesis of 3D audio from limited-channel surround sound
A method includes receiving a multi-channel audio signal (101) including multiple input audio channels (102, 104, 106, 108) that are configured to play audio from multiple respective locations relative to a listener. One or more spectral components that undergo a panning effect (1001, 1002, 1003) are identified in the multi-channel audio signal among at least some of the input audio channels. One or more virtual channels (1100, 1200, 1300) are generated, which together with the input audio channels form an extended set (111) of audio channels that retain the identified panning effect. A reduced set (222) of output audio signals, fewer in number than the input audio signals, is generated from the extended set, including recreating the panning effect in the output audio signals. The reduced set of output audio signals is outputted to a user.
Mapping virtual sound sources to physical speakers in extended reality applications
One or more embodiments include an audio processing system for generating an audio scene for an extended reality (XR) environment. The audio processing system determines that a first virtual sound source associated with the XR environment affects a sound in the audio scene. The audio processing system generates a sound component associated with the first virtual sound source based on a contribution of the first virtual sound source to the audio scene. The audio processing system maps the sound component to a first loudspeaker included in a plurality of loudspeakers. The audio processing system outputs at least a first portion of the component for playback on the first loudspeaker.
MULTI-DEVICE AUDIO ADJUSTMENT COORDINATION
This relates to intelligent automated assistants and, more specifically, to the intelligent coordination of audio signal output adjustments among multiple electronic devices. An example method includes, generating a local audio intent object associated with a software application stored on a first electronic device, the local audio intent object including one or more local audio parameters; determining that a second electronic device that is outputting an audio signal is proximate to the first electronic device; generating a proximate audio intent object corresponding to the second electronic device based on the one or more local audio adjustment parameters and a round-trip time (RTT) of a communication connection between the first electronic device and the second electronic device; and transmitting the proximate audio intent object to the second electronic device via the communication connection, wherein the proximate audio intent object causes the second electronic device to adjust the output of the audio signal.
Determining corrections to be applied to a multichannel audio signal, associated coding and decoding
A method and device for determining a set of corrections to be made to a multichannel sound signal, in which the set of corrections is determined on the basis of an item of information representative of a spatial image of an original multichannel signal and an item of information representative of a spatial image of the original multichannel signal that has been coded and then decoded.
DYNAMICS PROCESSING ACROSS DEVICES WITH DIFFERING PLAYBACK CAPABILITIES
Individual loudspeaker dynamics processing configuration data, for each of a plurality of loudspeakers of a listening environment, may be obtained. Listening environment dynamics processing configuration data may be determined, based on the individual loudspeaker dynamics processing configuration data. Dynamics processing may be performed on received audio data based on the listening environment dynamics processing configuration data, to generate processed audio data. The processed audio data may be rendered for reproduction via a set of loudspeakers that includes at least some of the plurality of loudspeakers, to produce rendered audio signals. The rendered audio signals may be provided to, and reproduced by, the set of loudspeakers.
METHOD AND APPARATUS FOR RENDERING VOLUME SOUND SOURCE
A method and apparatus for rendering a volume sound source are disclosed. The method of rendering a volume sound source may include identifying information about a listener and information about the volume sound source, determining a corresponding area in which a source element is disposed in the volume sound source in consideration of the information about the listener, determining an angle between the listener and the corresponding area based on the information about the listener and the information about the volume sound source, determining a number of source elements disposed in the corresponding area according to the angle, determining a position and a gain of the source element using i) the number of source elements and ii) a distance between the listener and the volume sound source, and rendering the volume sound source according to the position and the gain of the source element.
SYSTEMS AND METHODS FOR GENERATING VIDEO-ADAPTED SURROUND-SOUND
Audiovisual presentations, such as film recordings, may have been originally created having an audio soundtrack with multiple audio tracks mixed for a surround sound system that includes a set of speakers physically surrounding a user. The present disclosure presents systems and methods to remix these soundtracks into 3D audio that when presented to the ears of a user can be perceived as a virtual surround sound system that mimics the physical system. What is more, the disclosed systems and methods can enhance the virtual surround sound system by adjusting virtual speakers of the virtual surround sound system according to video content of the audiovisual presentation. Further enhancement may be possible by adjusting the virtual speakers of the virtual surround sound system according to a sensed position of a user.
SYSTEM AND METHOD FOR ADAPTIVE AUDIO SIGNAL GENERATION, CODING AND RENDERING
Embodiments are described for an adaptive audio system that processes audio data comprising a number of independent monophonic audio streams. One or more of the streams has associated with it metadata that specifies whether the stream is a channel-based or object-based stream. Channel-based streams have rendering information encoded by means of channel name; and the object-based streams have location information encoded through location expressions encoded in the associated metadata. A codec packages the independent audio streams into a single serial bitstream that contains all of the audio data. This configuration allows for the sound to be rendered according to an allocentric frame of reference, in which the rendering location of a sound is based on the characteristics of the playback environment (e.g., room size, shape, etc.) to correspond to the mixer's intent. The object position metadata contains the appropriate allocentric frame of reference information required to play the sound correctly using the available speaker positions in a room that is set up to play the adaptive audio content.