G10L21/0316

Audio processing device and associated audio processing method
11545168 · 2023-01-03 · ·

An audio processing device is disclosed. The audio processing device includes a filter and an output circuit. The filter is configured to receive an audio signal to generate a filtered audio signal, wherein the filter includes a plurality of parameters that are adjustable for changing a bandwidth, a center frequency or a gain of response of the filter. The output circuit is configured to receive the filtered audio signal to generate an output audio signal to a speaker. When the parameters of the filter are changed, the filter reduces changes in the audio signal caused by the parameters, and the output circuit continuously receives the filtered audio signal to generate the output audio signal for the speaker to play without interruption.

Audio processing device and associated audio processing method
11545168 · 2023-01-03 · ·

An audio processing device is disclosed. The audio processing device includes a filter and an output circuit. The filter is configured to receive an audio signal to generate a filtered audio signal, wherein the filter includes a plurality of parameters that are adjustable for changing a bandwidth, a center frequency or a gain of response of the filter. The output circuit is configured to receive the filtered audio signal to generate an output audio signal to a speaker. When the parameters of the filter are changed, the filter reduces changes in the audio signal caused by the parameters, and the output circuit continuously receives the filtered audio signal to generate the output audio signal for the speaker to play without interruption.

DATA PROCESSING METHOD AND APPARATUS, DEVICE, AND READABLE STORAGE MEDIUM

A data processing method includes acquiring video frame data including one or more video frames and audio data of a video, and determining position attribute information of a target object in the acquired one or more video frames, the target object being associated with the audio data. The method also includes acquiring a channel encoding parameter associated with the position attribute information, and performing azimuth enhancement processing on the audio data according to the channel encoding parameter to obtain enhanced audio data. Apparatus and non-transitory computer-readable storage medium counterpart embodiments are also contemplated.

SPEECH RECOGNITION APPARATUS AND METHOD

According to one embodiment, a speech recognition apparatus includes processing circuitry. The processing circuitry generates a plurality of augmented speech data, based on input speech data, generates a plurality of acoustic scores, based on the plurality of augmented speech data and an acoustic model, generates a plurality of adjusted acoustic scores by resampling the acoustic scores, generates an integrated acoustic score by integrating the adjusted acoustic scores, generates an integrated lattice, based on the integrated acoustic score, a pronunciation dictionary, and a language model, and searches a speech recognition result with a highest likelihood from the integrated lattice.

METHOD AND SYSTEM FOR PROTECTING USER PRIVACY DURING AUDIO CONTENT PROCESSING
20220375458 · 2022-11-24 ·

A method and system for protecting user privacy in audio content is disclosed. An audio content including private information related to at least one user is received. The audio content is segmented to generate a plurality of audio blocks. Each audio block is associated with a sequence number based on a respective chronological position in the audio content. A random key of predefined length is generated for each audio block. The plurality of audio blocks are randomly distributed to a plurality of agents for audio-to-text transcription. The random distribution is configured to scramble a data context for protecting the user privacy of the at least one user during the audio-to-text transcription. A textual transcript corresponding to the audio content is generated based on the audio-to-text transcription, the sequence number and the random key generated for each audio block.

METHOD AND SYSTEM FOR PROTECTING USER PRIVACY DURING AUDIO CONTENT PROCESSING
20220375458 · 2022-11-24 ·

A method and system for protecting user privacy in audio content is disclosed. An audio content including private information related to at least one user is received. The audio content is segmented to generate a plurality of audio blocks. Each audio block is associated with a sequence number based on a respective chronological position in the audio content. A random key of predefined length is generated for each audio block. The plurality of audio blocks are randomly distributed to a plurality of agents for audio-to-text transcription. The random distribution is configured to scramble a data context for protecting the user privacy of the at least one user during the audio-to-text transcription. A textual transcript corresponding to the audio content is generated based on the audio-to-text transcription, the sequence number and the random key generated for each audio block.

LOW LATENCY AUTOMIXER INTEGRATED WITH VOICE AND NOISE ACTIVITY DETECTION

Systems and methods are disclosed for providing voice and noise activity detection with audio automixers that can reject errant non-voice or non-human noises while maximizing signal-to-noise ratio and minimizing audio latency.

LOW LATENCY AUTOMIXER INTEGRATED WITH VOICE AND NOISE ACTIVITY DETECTION

Systems and methods are disclosed for providing voice and noise activity detection with audio automixers that can reject errant non-voice or non-human noises while maximizing signal-to-noise ratio and minimizing audio latency.

AUDIO CONFLICT RESOLUTION
20230060042 · 2023-02-23 ·

An example playback device is a first playback device in a media system. The first playback device is configured to resolve audio conflicts with one or more other playback devices in the media system by: (i) capturing, via a microphone of the first playback device, audio content played back by a second playback device, (ii) identifying the second playback device as a source of the captured audio content; and (iii) responsive to identifying the second playback device as the source of the captured audio content, altering a playback characteristic of the second playback device or the first playback device to reduce an audio interference between the first and second playback devices.

APPARATUS AND METHOD FOR PROCESSING MULTI-CHANNEL AUDIO SIGNAL

Disclosed is an apparatus and method for processing a multichannel audio signal. A multichannel audio signal processing method may include: generating an N-channel audio signal of N channels by down-mixing an M-channel audio signal of M channels; and generating a stereo audio signal by performing binaural rendering of the N-channel audio signal.