G10L25/09

Systems and methods for reducing audio artifacts from switching between paths of a multi-path signal processing system

A processing path may include a controller and a plurality of processing paths including a first processing path and a second processing path. The first path may be configured to generate a first digital signal based on an analog input signal and the second path may be configured to generate a second digital signal based on the analog input signal, wherein the first path has a lower gain and a higher noise floor than the second path. The controller may be configured to determine that a transition between the first path and the second path needs to occur based on the analog input signal crossing a threshold or a prediction that the input signal will cross the threshold and in response to determining the transition between the first path and the second path needs to occur, blend the transition during or near zero cross points of the analog input signal.

AUDIO CLASSIFIER THAT INCLUDES A FIRST PROCESSOR AND A SECOND PROCESSOR

The disclosure relates to an audio classifier comprising: a first processor having hard-wired logic configured to receive an audio signal and detect audio activity from the audio signal; and a second processor having reconfigurable logic configured to classify the audio signal as a type of audio signal in response to the first processor detecting audio activity.

SOUND SOURCE DETERMINING METHOD AND SYSTEM, ELECTRONIC DEVICE AND READABLE STORAGE MEDIUM

A sound source determining method and system and an electronic device are disclosed. The sound source determining method includes: obtaining initial audio information collected in real time; performing audio recognition processing on the initial audio information to obtain an audio recognition result; using the initial audio information corresponding to the audio recognition result as target audio information in a case that the audio recognition result indicates that the initial audio information meets a preset audio recognition condition; performing audio information activity detection on the target audio information to obtain target audio activity information; and performing sound source positioning on a sound producing object corresponding to the target audio activity information according to sound source positioning parameters corresponding to the target audio activity information to obtain target position information of the sound producing object.

SOUND SOURCE DETERMINING METHOD AND SYSTEM, ELECTRONIC DEVICE AND READABLE STORAGE MEDIUM

A sound source determining method and system and an electronic device are disclosed. The sound source determining method includes: obtaining initial audio information collected in real time; performing audio recognition processing on the initial audio information to obtain an audio recognition result; using the initial audio information corresponding to the audio recognition result as target audio information in a case that the audio recognition result indicates that the initial audio information meets a preset audio recognition condition; performing audio information activity detection on the target audio information to obtain target audio activity information; and performing sound source positioning on a sound producing object corresponding to the target audio activity information according to sound source positioning parameters corresponding to the target audio activity information to obtain target position information of the sound producing object.

Bodily function sound anonymization
09830901 · 2017-11-28 ·

Apparatuses, systems, methods, and software for bodily function sound anonymization.

Providing intelligent transcriptions of sound messages in a messaging application

One or more embodiments described herein include methods and systems of creating transcribed electronic communications based on sound inputs. More specifically, systems and methods described herein provide users the ability to easily and effectively send an electronic communication that includes a textual message transcribed from a sound input. Additionally, systems and methods described herein provide an analysis of a textual message transcribed from a sound input allowing users to correct an inaccurate or incorrect transcription.

SYSTEMS AND METHODS FOR REDUCING AUDIO ARTIFACTS FROM SWITCHING BETWEEN PATHS OF A MULTI-PATH SIGNAL PROCESSING SYSTEM

A processing path may include a controller and a plurality of processing paths including a first processing path and a second processing path. The first path may be configured to generate a first digital signal based on an analog input signal and the second path may be configured to generate a second digital signal based on the analog input signal, wherein the first path has a lower gain and a higher noise floor than the second path. The controller may be configured to determine that a transition between the first path and the second path needs to occur based on the analog input signal crossing a threshold or a prediction that the input signal will cross the threshold and in response to determining the transition between the first path and the second path needs to occur, blend the transition during or near zero cross points of the analog input signal.

SYSTEMS AND METHODS FOR REDUCING AUDIO ARTIFACTS FROM SWITCHING BETWEEN PATHS OF A MULTI-PATH SIGNAL PROCESSING SYSTEM

A processing path may include a controller and a plurality of processing paths including a first processing path and a second processing path. The first path may be configured to generate a first digital signal based on an analog input signal and the second path may be configured to generate a second digital signal based on the analog input signal, wherein the first path has a lower gain and a higher noise floor than the second path. The controller may be configured to determine that a transition between the first path and the second path needs to occur based on the analog input signal crossing a threshold or a prediction that the input signal will cross the threshold and in response to determining the transition between the first path and the second path needs to occur, blend the transition during or near zero cross points of the analog input signal.

System and method for improved audio consistency
09691392 · 2017-06-27 · ·

A voice biometrics system adapted to authenticate a user based on speech diagnostics is provided. The system includes a pre-processing module to receive and pre-process an input voice sample. The pre-processing module includes a clipping module to clip the input voice sample based on a clipping threshold and a voice activity detection module to apply a detection model on the input voice sample to determine an audible region and a non-audible region in the input voice sample. The pre-processing module includes a noise reduction module to apply a noise reduction model to remove noise components from the input voice sample. The voice biometrics system includes a feature extraction module to extract features from the pre-processed input voice sample. The voice biometrics system also includes an authentication module to authenticate the user by comparing a plurality of features extracted from the pre-processed input voice sample to a plurality of enrollment features.

Pitch marking in speech processing

According to some embodiments of the present invention, there is provided a computerized method for selecting and correcting pitch marks in speech processing and modification. The method comprises an action of receiving a continuous speech signal representing audible speech recorded by a microphone, where a sequence of pitch values and two or more pitch mark temporal values are computed from the continuous speech signal. The method comprises an action of computing for each of the pitch mark temporal values a lower limit temporal value and an upper limit temporal value by a cross-correlation function of the continuous speech signal around the pitch mark temporal values associated with pairs of elements in the sequence and replacing one or more of the pitch mark temporal values with one or more new temporal value between the lower limit temporal value and the upper limit temporal value.