H04S3/00

WEARABLE DEVICE AND METHOD FOR CONTROLLING AUDIO OUTPUT USING MULTI DIGITAL TO ANALOG CONVERTER PATH

A wearable device is provided and includes a plurality of speakers including a first speaker, a second speaker, and an N.sup.th speaker, a plurality of digital to analog converter (DAC)s including a first DAC connected to the first speaker, a second DAC connected to the second speaker, and an N.sup.th DAC connected to the N.sup.th speaker, an audio signal processing module including N DAC output paths configured to filter an audio signal according to each frequency band and output the audio signal, a memory; and a processor electrically connected to the plurality of DACs, the audio signal processing module, and the memory, wherein the memory includes instructions causing the processor to, when the audio signal is reproduced, analyze a frequency component included in the audio signal, activate the N DAC output paths when the frequency component included in the audio signal has a full band range, activate only a DAC output path for processing a specific frequency band among the N DAC output paths when the frequency component included in the audio signal has only the specific frequency band, and output the audio signal through a speaker connected to the activated DAC output path.

Parametric joint-coding of audio sources

The following coding scenario is addressed: A number of audio source signals need to be transmitted or stored for the purpose of mixing wave field synthesis, multi-channel surround, or stereo signals after decoding the source signals. The proposed technique offers significant coding gain when jointly coding the source signals, compared to separately coding them, even when no redundancy is present between the source signals. This is possible by considering statistical properties of the source signals, the properties of mixing techniques, and spatial hearing. The sum of the source signals is transmitted plus the statistical properties of the source signals, which mostly determine the perceptually important spatial cues of the final mixed audio channels. Source signals are recovered at the receiver such that their statistical properties approximate the corresponding properties of the original source signals. Subjective evaluations indicate that high audio quality is achieved by the proposed scheme.

Parametric joint-coding of audio sources

The following coding scenario is addressed: A number of audio source signals need to be transmitted or stored for the purpose of mixing wave field synthesis, multi-channel surround, or stereo signals after decoding the source signals. The proposed technique offers significant coding gain when jointly coding the source signals, compared to separately coding them, even when no redundancy is present between the source signals. This is possible by considering statistical properties of the source signals, the properties of mixing techniques, and spatial hearing. The sum of the source signals is transmitted plus the statistical properties of the source signals, which mostly determine the perceptually important spatial cues of the final mixed audio channels. Source signals are recovered at the receiver such that their statistical properties approximate the corresponding properties of the original source signals. Subjective evaluations indicate that high audio quality is achieved by the proposed scheme.

NEAR-FIELD AUDIO RENDERING

Examples of the disclosure describe systems and methods for presenting an audio signal to a user of a wearable head device. According to an example method, a source location corresponding to the audio signal is identified. For each of the respective left and right ear of the user, a virtual speaker position, of a virtual speaker array, is determined, the virtual speaker position collinear with the source location and with a position of the respective ear. For each of the respective left and right ear of the user, a head-related transfer function (HRTF) corresponding to the virtual speaker position and to the respective ear is determined; and the output audio signal is presented to the respective ear of the user via one or more speakers associated with the wearable head device. Processing the audio signal includes applying the HRTF to the audio signal.

APPARATUS, METHODS AND COMPUTER PROGRAMS FOR ENABLING REPRODUCTION OF SPATIAL AUDIO SIGNALS
20230096873 · 2023-03-30 · ·

An apparatus (101) for enabling reproduction of spatial audio signals. The apparatus comprises means for obtaining (401) audio signals (501) comprising one or more channels and obtaining (403) spatial metadata (503) relating to the audio signals (501). The spatial metadata (503) comprises information that indicates how to spatially reproduce the audio signals. The apparatus also comprises means for obtaining (405) information relating to a field of view of video (505) wherein the video is for display on a display (205) of a rendering device (201) and wherein the video is associated with the audio signals (501). The apparatus also comprises means for aligning (407) spatial reproduction of the audio signals based, at least in part, on the obtained spatial metadata (503), with objects (309A, 309B) in the video according to the obtained information relating to the field of view of video; and enabling (409) reproduction of the audio signals based on the aligning (407).

Time-varying always-on compensation for tonally balanced 3D-audio rendering

A system reduces sound coloration caused by rendering of a 3D audio signal. The system renders the 3D audio signal including a plurality of channels using the input audio signal. Input spectra data defining spectral information of the input audio signal is computed. 3D spectra data defining spectral information of a single channel representation of the 3D audio signal is computed. The system generates a tonal balance filter based on the input spectral data and the 3D spectral data. The tonal balance filter, when applied to the 3D audio signal, reduces sound coloration caused by the rendering of the 3D audio signal. The tonal balance filter is applied to the 3D audio signal to generate an output audio signal and the output audio signal is presented via a speaker array.

Method and System for Surround Sound Processing in an Audio Device
20230033891 · 2023-02-02 ·

An audio headset may receive a plurality of audio signals corresponding to plurality of surround sound channels. The headset may determine, via its audio processing circuitry, context and/or content of the audio signals. The audio processing circuitry may process the audio signals to generate stereo signals carrying one or more virtual surround channels, wherein the processing comprises automatically controlling, based on the context and the content of the audio signals, a simulated acoustic environment of the virtual surround channels.

INFORMATION PROCESSING DEVICE, INFORMATION PROCESSING METHOD, AND NON-TRANSITORY COMPUTER READABLE MEDIUM
20230100767 · 2023-03-30 · ·

An information processing device includes a processor configured to output, in a case where a service is being used in which at least speech is exchanged among multiple users such that a conversation takes places among all of the multiple users, a speech of a separate conversation distinctly from a speech of the conversation taking place among all of the multiple users to a device of a user who is engaged in the separate conversation with a specific user from among the multiple users, and output the speech of the conversation taking place among all of the multiple users without outputting the speech of the separate conversation to a device of a user who is not engaged in the separate conversation.

SYSTEM AND METHOD FOR TRANSMITTING AT LEAST ONE MULTICHANNEL AUDIO MATRIX ACROSS A GLOBAL NETWORK
20220353628 · 2022-11-03 ·

A system and method for transmitting at least one multichannel audio matrix across a global network is shown and described. The method for transmitting at least one multichannel audio matrix across a global network begins by capturing audio from a production source. The captured audio is then converted from an analog format to a digital format. The digital audio may then be encoded using an audio codec. The audio is sent to a network and received at a second location. The second location uses a specialized computing device to ensure the audio is properly received. If the audio was encoded, it is now decoded. Once decoded if needed the audio is converted back to analog format. This will allow for the audio to be mixed on a mixing device.

Audio decoder, audio encoder, method for providing at least four audio channel signals on the basis of an encoded representation, method for providing an encoded representation on the basis of at least four audio channel signals and computer program using a bandwidth extension

An audio decoder for providing at least four bandwidth-extended channel signals on the basis of an encoded representation provides first and second downmix signals on the basis of a jointly encoded representation of the first and second downmix signals using a multi-channel decoding and provides at least first and second audio channel signals on the basis of the first downmix signal using a multi-channel decoding, and provides at least third and fourth audio channel signals on the basis of the second downmix signal using a multi-channel decoding. It performs a multi-channel bandwidth extension on the basis of the first and third audio channel signals, to obtain first and third bandwidth-extended channel signals, and performs a multi-channel bandwidth extension on the basis of the second and fourth audio channel signals, to obtain second and fourth bandwidth extended channel signals. An audio encoder uses a related concept.