G10L19/0216

Trusted listening

Devices and methods are disclosed for trusted listening. In some examples, an apparatus can include an audio receiving device having a microphone configured to capture sound and produce an audio signal, a processing unit configured to add a trusted signature to the audio signal, and an output configured to provide the audio signal. Further, a method of trusted listening can receive a first audio signal representing a real-time sound, generate a trusted signature in an audible format, and produce a second audio signal including the trusted signature.

Deep learning segmentation of audio using magnitude spectrogram
11837245 · 2023-12-05 · ·

A method, system, and computer readable medium for decomposing an audio signal into different isolated sources. The techniques and mechanisms convert an audio signal into K input spectrogram fragments. The fragments are sent into a deep neural network to isolate for different sources. The isolated fragments are then combined to form full isolated source audio signals.

TRUSTED LISTENING

Devices and methods are disclosed for trusted listening. In some examples, an apparatus can include an audio receiving device having a microphone configured to capture sound and produce an audio signal, a processing unit configured to add a trusted signature to the audio signal, and an output configured to provide the audio signal. Further, a method of trusted listening can receive a first audio signal representing a real-time sound, generate a trusted signature in an audible format, and produce a second audio signal including the trusted signature.

DEEP LEARNING SEGMENTATION OF AUDIO USING MAGNITUDE SPECTROGRAM
20210035590 · 2021-02-04 · ·

A method, system, and computer readable medium for decomposing an audio signal into different isolated sources. The techniques and mechanisms convert an audio signal into K input spectrogram fragments. The fragments are sent into a deep neural network to isolate for different sources. The isolated fragments are then combined to form full isolated source audio signals.

INFORMATION EXCHANGE ON MOBILE DEVICES USING AUDIO
20240005934 · 2024-01-04 ·

In some implementations, a user device may receive input that triggers transmission of information via sound. The user device may select an audio clip based on a setting associated with the device, and may modify a digital representation of the selected audio clip using an encoding algorithm and based on data associated with a user of the device. The user device may transmit, to a remote server, an indication of the selected audio clip, an indication of the encoding algorithm, and the data associated with the user. The user device may use a speaker to play audio, based on the modified digital representation, for recording by other devices. Accordingly, the user device may receive, from the remote server and based on the speaker playing the audio, a confirmation that users associated with the other devices have performed an action based on the data associated with the user of the device.

System, apparatus and method for time synchronization of delayed data streams by matching of wavelet coefficients

In one example, an apparatus includes: a wavelet transform engine to receive a first signal stream and perform a wavelet transform on a first time domain sample of the first signal stream, the first wavelet transform engine to output at least one first coefficient for a first frequency range; an energy calculation circuit to compute a first energy signature for the at least one first coefficient; and a correlation circuit to generate a correlation value using the first energy signature, a second energy signature and a plurality of previous energy signatures.

Low latency audio stream acceleration by selectively dropping and blending audio blocks

A method and device for accelerated audio processing in a streaming environment. The method comprises receiving a streaming audio asset, locating a position to ignore processing of an audio block of the streaming audio asset, ignoring the audio block, compensating for the ignored audio block and playing the compensated audio on an audio device.

System, Apparatus And Method For Time Synchronization Of Delayed Data Streams By Matching Of Wavelet Coefficients
20200013420 · 2020-01-09 ·

In one example, an apparatus includes: a wavelet transform engine to receive a first signal stream and perform a wavelet transform on a first time domain sample of the first signal stream, the first wavelet transform engine to output at least one first coefficient for a first frequency range; an energy calculation circuit to compute a first energy signature for the at least one first coefficient; and a correlation circuit to generate a correlation value using the first energy signature, a second energy signature and a plurality of previous energy signatures.

Method and system for multi-talker babble noise reduction

A system and method for improving intelligibility of speech is provided. The system and method may include obtaining an input audio signal frame, classifying the input audio signal frame into a first category or a second category, wherein the first category corresponds to the noise being stronger than the speech signal, and the second category corresponds to the speech signal being stronger than the noise, decomposing the input audio signal frame into a plurality of sub-band components; de-noising each sub-band component of the input audio signal frame in parallel by applying a first wavelet de-noising method including a first wavelet transform and a predetermined threshold for the sub-band component, and a second wavelet de-noising method including a second wavelet transform and the predetermined threshold for the sub-band component, wherein the predetermined threshold for each sub-band component is based on at least one previous noise-dominant signal frame received by the receiving arrangement.

Method and apparatus for detecting valid voice signal and non-transitory computer readable storage medium

A method and apparatus for detecting a valid voice signal and a non-transitory computer readable storage medium are provided. A first audio signal including at least one audio frame signal is obtained. Multiple wavelet decomposition signals respectively corresponding to the at least one audio frame signal are obtained. A wavelet signal sequence is obtained by combining the multiple wavelet decomposition signals. A maximum value and a minimum value among audio intensity values of all sample points are obtained, and a first audio intensity threshold is determined according to the maximum value and the minimum value. Sample points each having an audio intensity value greater than the first audio intensity threshold in the wavelet signal sequence are obtained, and a signal of sample points in the first audio signal corresponding to the sample points each having an audio intensity value greater than the first audio intensity threshold is determined as the valid voice signal.