G10L25/21

SOUND PROCESSING METHOD, ELECTRONIC DEVICE AND STORAGE MEDIUM
20230007393 · 2023-01-05 ·

A sound processing method includes: determining a vector of a first residual signal according to a first signal vector and a second signal vector, the first signal vector including a first voice signal and a first noise signal input into the first microphone, the second signal vector including a second voice signal and a second noise signal input into the second microphone, and the first residual signal including the second noise signal and a residual voice signal; determining a gain function of a current frame according to the vector of the first residual signal and the first signal vector; and determining a first voice signal of the current frame according to the first signal vector and the gain function of the current frame.

SOUND PROCESSING METHOD, ELECTRONIC DEVICE AND STORAGE MEDIUM
20230007393 · 2023-01-05 ·

A sound processing method includes: determining a vector of a first residual signal according to a first signal vector and a second signal vector, the first signal vector including a first voice signal and a first noise signal input into the first microphone, the second signal vector including a second voice signal and a second noise signal input into the second microphone, and the first residual signal including the second noise signal and a residual voice signal; determining a gain function of a current frame according to the vector of the first residual signal and the first signal vector; and determining a first voice signal of the current frame according to the first signal vector and the gain function of the current frame.

METHOD FOR PROCESSING AN AUDIO STREAM AND CORRESPONDING SYSTEM

A method and a system for processing an audio stream are described, wherein at least one database of classified voices and at least one database of classified background sounds are provided and a comparison between these classified voices and background sounds with the voices and the sounds extrapolated from a suitably re-processed audio stream is carried out in order to identify possible matches.

APPROACHES TO GENERATING STUDIO-QUALITY RECORDINGS THROUGH MANIPULATION OF NOISY AUDIO
20230230610 · 2023-07-20 ·

Introduced here are computer programs and associated computer-implemented techniques for manipulating noisy audio signals to produce clean audio signals that are sufficiently high quality so as to be largely, if not entirely, indistinguishable from “rich” recordings generated by recording studios. When a noisy audio signal is obtained by a media production platform, the noisy audio signal can be manipulated to sound as if recording occurred with sophisticated equipment in a soundproof environment. Manipulation can be performed by a model that, when applied to the noisy audio signal, can manipulate its characteristics so as to emulate the characteristics of clean audio signals that are learned through training.

APPROACHES TO GENERATING STUDIO-QUALITY RECORDINGS THROUGH MANIPULATION OF NOISY AUDIO
20230230610 · 2023-07-20 ·

Introduced here are computer programs and associated computer-implemented techniques for manipulating noisy audio signals to produce clean audio signals that are sufficiently high quality so as to be largely, if not entirely, indistinguishable from “rich” recordings generated by recording studios. When a noisy audio signal is obtained by a media production platform, the noisy audio signal can be manipulated to sound as if recording occurred with sophisticated equipment in a soundproof environment. Manipulation can be performed by a model that, when applied to the noisy audio signal, can manipulate its characteristics so as to emulate the characteristics of clean audio signals that are learned through training.

Apparatus and method for generating an enhanced signal using independent noise-filling

An apparatus for generating an enhanced signal from an input signal, wherein the enhanced signal has spectral values for an enhancement spectral region, the spectral values for the enhancement spectral regions not being contained in the input signal, includes a mapper for mapping a source spectral region of the input signal to a target region in the enhancement spectral region, the source spectral region including a noise-filling region; and a noise filler configured for generating first noise values for the noise-filling region in the source spectral region of the input signal and for generating second noise values for a noise region in the target region, wherein the second noise values are decorrelated from the first noise values or for generating second noise values for a noise region in the target region, wherein the second noise values are decorrelated from first noise values in the source region.

Apparatus and method for generating an enhanced signal using independent noise-filling

An apparatus for generating an enhanced signal from an input signal, wherein the enhanced signal has spectral values for an enhancement spectral region, the spectral values for the enhancement spectral regions not being contained in the input signal, includes a mapper for mapping a source spectral region of the input signal to a target region in the enhancement spectral region, the source spectral region including a noise-filling region; and a noise filler configured for generating first noise values for the noise-filling region in the source spectral region of the input signal and for generating second noise values for a noise region in the target region, wherein the second noise values are decorrelated from the first noise values or for generating second noise values for a noise region in the target region, wherein the second noise values are decorrelated from first noise values in the source region.

HEARING DEVICE WITH PULSE POWER ESTIMATION, PULSE DETECTION, AND RELATED METHOD
20230018409 · 2023-01-19 · ·

Hearing device and method of power estimation and/or pulse detection in a hearing device is disclosed. The method comprises obtaining a pulse input signal; determining if the pulse input signal satisfies a first rising criterion; in accordance with the input signal satisfying the first rising criterion, increasing a threshold; determining if the pulse input signal satisfies a first down count criterion; in accordance with the pulse input signal satisfying the first down count criterion, initializing a down counter; determining if the down counter satisfies a second down count criterion; in accordance with the down counter satisfying the second down count criterion, decreasing the down counter; determining if the down counter satisfies a pulse detection criterion; and in accordance with the down counter satisfying the pulse detection criterion, outputting a pulse output signal indicative of detection of a pulse.

MULTI-REGISTER-BASED SPEECH DETECTION METHOD AND RELATED APPARATUS, AND STORAGE MEDIUM

This application discloses a multi-sound area-based speech detection method and related apparatus, and a storage medium, which is applied to the field of artificial intelligence. The method includes: obtaining sound area information corresponding to each sound area in N sound areas; using the sound area as a target detection sound area, and generating a control signal corresponding to the target detection sound area according to sound area information corresponding to the target detection sound area; processing a speech input signal corresponding to the target detection sound area by using the control signal corresponding to the target detection sound area, to obtain a speech output signal corresponding to the target detection sound area; and generating a speech detection result of the target detection sound area according to the speech output signal corresponding to the target detection sound area. Speech signals in different directions are processed in parallel based on a plurality of sound areas, so that in a multi-sound source scenario, the speech signals in different directions may be retained or suppressed by a control signal, to separate and enhance speech of a target detection user in real time, thereby improving the accuracy of speech detection.

MULTI-REGISTER-BASED SPEECH DETECTION METHOD AND RELATED APPARATUS, AND STORAGE MEDIUM

This application discloses a multi-sound area-based speech detection method and related apparatus, and a storage medium, which is applied to the field of artificial intelligence. The method includes: obtaining sound area information corresponding to each sound area in N sound areas; using the sound area as a target detection sound area, and generating a control signal corresponding to the target detection sound area according to sound area information corresponding to the target detection sound area; processing a speech input signal corresponding to the target detection sound area by using the control signal corresponding to the target detection sound area, to obtain a speech output signal corresponding to the target detection sound area; and generating a speech detection result of the target detection sound area according to the speech output signal corresponding to the target detection sound area. Speech signals in different directions are processed in parallel based on a plurality of sound areas, so that in a multi-sound source scenario, the speech signals in different directions may be retained or suppressed by a control signal, to separate and enhance speech of a target detection user in real time, thereby improving the accuracy of speech detection.