G06F3/162

APPARATUS AND METHOD FOR ENCODING A PLURALITY OF AUDIO OBJECTS USING DIRECTION INFORMATION DURING A DOWNMIXING OR APPARATUS AND METHOD FOR DECODING USING AN OPTIMIZED COVARIANCE SYNTHESIS

An apparatus for encoding a plurality of audio objects and related metadata indicating direction information on the plurality of audio objects has: a downmixer for downmixing the plurality of audio objects to obtain one or more transport channels; a transport channel encoder for encoding one or more transport channels to obtain one or more encoded transport channels; and an output interface for outputting an encoded audio signal comprising the one or more encoded transport channels, wherein the downmixer is configured to downmix the plurality of audio objects in response to the direction information on the plurality of audio objects.

AUDIO SCENE CHANGE SIGNALING

There is disclosed inter alia a method for rendering a virtual reality audio scene comprising: receiving information defining a limited area audio scene within the virtual reality audio scene (301), wherein the limited area audio scene defines a sub space of the virtual reality audio scene (304), wherein the information defines the limited area audio scene by defining an extent a user can move within the virtual audio scene; determining if the movement of the user within the limited area audio scene meets a condition of an audio scene change (302); and processing the audio scene change when the movement of the user within the limited area audio scene meets the condition of an audio scene change (306).

ACOUSTIC OUTPUT APPARATUS

The present disclosure discloses an acoustic output apparatus including at least one acoustic driver, a controller, and a supporting structure. The at least one acoustic driver may be configured to output sounds through at least two sound guiding holes. The at least two sound guiding holes may include a first sound guiding hole and a second sound guiding hole. The controller may be configured to control a phase and an amplitude of the sounds generated by the at least one acoustic driver using a control signal such that the sounds output by the at least one acoustic driver through the first and second sound guiding holes have opposite phases. The supporting structure may be provided with a baffle and configured to support the at least one acoustic driver such that the first and second sound guiding holes are located on both sides of the baffle.

SYSTEM AND METHOD FOR TRANSMITTING DATA OVER A DIGITAL INTERFACE

Systems and techniques are provided to transmit data over a digital interface between a sender and a receiver. The digital interface is configured for transmitting a primary type of data as opposed to a secondary type of data. Nevertheless, systems and techniques are provided where the secondary type of data can be transmitted in the digital interface. As such, the primary and/or secondary types of data are transmitted from the sender to the receiver via the digital interface. The primary and secondary types of data may be different and/or unrelated and could be any type of data including, but not limited to, audio data, general data, and bulk data. Yet, the received primary and secondary types of data are still useful after the transmission.

AUDIO PROCESSING APPARATUS AND AUDIO PROCESSING METHOD FOR DYNAMICALLY ADJUSTING AUDIO CLOCK
20230236790 · 2023-07-27 ·

An audio processing apparatus and an audio processing method for dynamically adjusting an audio clock are provided. The audio processing apparatus includes a first interface, a buffer, a clock generator, a processor, and a second interface. The first interface receives audio data from the host. The buffer stores the audio data to generate a first audio packet and determines relationships between a data volume of the first audio packet and a first upper threshold and a first lower threshold. The second interface outputs the first audio packet and a clock signal to a codec apparatus. In response to the data volume of the first audio packet being less than the first lower threshold, the buffer outputs an underflow interrupt signal. In response to the data volume of the first audio packet being greater than the first upper threshold, the buffer outputs an overflow interrupt signal.

Methods, systems, and media for providing information relating to detected events
11567726 · 2023-01-31 · ·

Methods, systems, and media for providing information are provided. In some implementations, a method for providing information is provided, the method comprising: associating a first recording device of a group of recording devices located in an environment of a user with a trigger term; receiving, from a user device, a query that includes the trigger term; in response to receiving the query, determining that audio data is to be transmitted from at least one recording device from the group of recording devices in the environment of the user; identifying the first recording device based on the inclusion of the trigger term in the received query; receiving the audio data from the first recording device; identifying a characteristic of an animate object in the environment of the user based on the received audio data; and presenting information indicating the characteristic of the animate object on the user device.

Dynamically preventing audio artifacts
11567728 · 2023-01-31 · ·

The disclosure is directed to a process that can predict and prevent an audio artifact from occurring. The process can monitor the systems, processes, and execution threads on a larger system/device, such as a mobile or in-vehicle device. Using a learning algorithm, such as deep neural network (DNN), the information collected can generate a prediction of whether an audio artifact is likely to occur. The process can use a second learning algorithm, which also can be a DNN, to generate recommended system adjustments that can attempt to prevent the audio glitch from occurring. The recommendations can be for various systems and components on the device, such as changing the processing system frequency, the memory frequency, and the audio buffer size. After the audio artifact has been prevented, the system adjustments can be reversed fully or in steps to return the system to its state prior to the system adjustments.

Recording medium and sound processing apparatus having library program for multiple processors
11567727 · 2023-01-31 · ·

A library program includes a first executable code that defines steps of first processing of sound processing and that is written using an instruction set for the main processor, a second executable code that defines steps of second processing of the sound processing and that is written using an instruction set for a first processor, and a third executable code that defines the steps of the second processing of the sound processing and that is written using an instruction set for a second processor. The library program, when executed by the main processor provided in the sound processing apparatus, causes the main processor to: execute the first executable code; transmit, in a case where the sound processing apparatus further includes the first processor, the second executable code to the first processor; and transmit, in a case where the sound processing apparatus further includes the second processor, the third executable code to the second processor.

Noise classification for event detection

In one aspect, a network microphone device includes a plurality of microphones and is configured to detect sound via the one or more microphones. The network microphone device may capture sound data based on the detected sound in a first buffer, and capture metadata associated with the detected sound in a second buffer. The network microphone device may classify one or more noises in the detected sound and cause the network microphone device to perform an action based on the classification of the respective one or more noises.

Concurrent transmission of audio and ultrasound
11567894 · 2023-01-31 · ·

Techniques for concurrent transmission of audio and ultrasound are described. In an example, a computing device generates, in a digital domain, mixed audio data from multiple sets of audio data, each set corresponding to a different audio channel. The computing device also generates, in the digital domain, ultrasound data, and generates serial data by providing the mixed audio data and the ultrasound data as different inputs to an I2S mixing module. In an analog domain, the computing device generates an output signal based at least in part on the serial data, and sends the output signal to a speaker.