G10L25/78

AUTOMATED CALLING SYSTEM
20230005482 · 2023-01-05 ·

Methods, systems, and apparatus for an automated calling system are disclosed. Some implementations are directed to using a bot to initiate telephone calls and conduct telephone conversations with a user. The bot may be interrupted while providing synthesized speech during the telephone call. The interruption can be classified into one of multiple disparate interruption types, and the bot can react to the interruption based on the interruption type. Some implementations are directed to determining that a first user is placed on hold by a second user during a telephone conversation, and maintaining the telephone call in an active state in response to determining the first user hung up the telephone call. The first user can be notified when the second user rejoins the call, and a bot associated with the first user can notify the first user that the second user has rejoined the telephone call.

DETECTION OF SPEECH

A method of own voice detection is provided for a user of a device. A first signal is detected, representing air-conducted speech using a first microphone of the device. A second signal is detected, representing bone-conducted speech using a bone-conduction sensor of the device. The first signal is filtered to obtain a component of the first signal at a speech articulation rate, and the second signal is filtered to obtain a component of the second signal at the speech articulation rate. The component of the first signal at the speech articulation rate and the component of the second signal at the speech articulation rate are compared, and it is determined that the speech has not been generated by the user of the device, if a difference between the component of the first signal at the speech articulation rate and the component of the second signal at the speech articulation rate exceeds a threshold value.

METHOD AND DEVICE FOR IMPROVING DYSARTHRIA
20230237928 · 2023-07-27 ·

A method of providing a language training to a user by a computing device comprising a processor and a memory is provided. The method comprises: providing contents corresponding to the language training to a user terminal; receiving the user’s voice data from the user terminal; detecting a pitch and a loudness of the user’s voice by analyzing the voice data; and generating a training evaluation by evaluating the user’s training for the contents corresponding to the language training based on the user’s voice data, further comprising determining a phoneme with poor pronunciation accuracy by analyzing the user’s voice data; and automatically generating and providing at least one of a vocabulary, a sentence, and a paragraph including the determined phoneme.

METHOD AND DEVICE FOR IMPROVING DYSARTHRIA
20230237928 · 2023-07-27 ·

A method of providing a language training to a user by a computing device comprising a processor and a memory is provided. The method comprises: providing contents corresponding to the language training to a user terminal; receiving the user’s voice data from the user terminal; detecting a pitch and a loudness of the user’s voice by analyzing the voice data; and generating a training evaluation by evaluating the user’s training for the contents corresponding to the language training based on the user’s voice data, further comprising determining a phoneme with poor pronunciation accuracy by analyzing the user’s voice data; and automatically generating and providing at least one of a vocabulary, a sentence, and a paragraph including the determined phoneme.

SIGNAL PROCESSING DEVICE, SIGNAL PROCESSING METHOD, PROGRAM, AND SIGNAL PROCESSING SYSTEM
20230005488 · 2023-01-05 ·

Provided is a signal processing device including a main speech detection unit configured to detect, by using a neural network, whether or not a signal input to a sound collection device assigned to each of at least two speakers includes a main speech that is a voice of the corresponding speaker, and output frame information indicating presence or absence of the main speech.

SIGNAL PROCESSING DEVICE, SIGNAL PROCESSING METHOD, PROGRAM, AND SIGNAL PROCESSING SYSTEM
20230005488 · 2023-01-05 ·

Provided is a signal processing device including a main speech detection unit configured to detect, by using a neural network, whether or not a signal input to a sound collection device assigned to each of at least two speakers includes a main speech that is a voice of the corresponding speaker, and output frame information indicating presence or absence of the main speech.

SOUND PROCESSING METHOD, ELECTRONIC DEVICE AND STORAGE MEDIUM
20230007393 · 2023-01-05 ·

A sound processing method includes: determining a vector of a first residual signal according to a first signal vector and a second signal vector, the first signal vector including a first voice signal and a first noise signal input into the first microphone, the second signal vector including a second voice signal and a second noise signal input into the second microphone, and the first residual signal including the second noise signal and a residual voice signal; determining a gain function of a current frame according to the vector of the first residual signal and the first signal vector; and determining a first voice signal of the current frame according to the first signal vector and the gain function of the current frame.

SOUND PROCESSING METHOD, ELECTRONIC DEVICE AND STORAGE MEDIUM
20230007393 · 2023-01-05 ·

A sound processing method includes: determining a vector of a first residual signal according to a first signal vector and a second signal vector, the first signal vector including a first voice signal and a first noise signal input into the first microphone, the second signal vector including a second voice signal and a second noise signal input into the second microphone, and the first residual signal including the second noise signal and a residual voice signal; determining a gain function of a current frame according to the vector of the first residual signal and the first signal vector; and determining a first voice signal of the current frame according to the first signal vector and the gain function of the current frame.

Voice Filtering Other Speakers From Calls And Audio Messages
20230005480 · 2023-01-05 · ·

A method includes receiving a first instance of raw audio data corresponding to a voice-based command and receiving a second instance of the raw audio data corresponding to an utterance of audible contents for an audio-based communication spoken by a user. When a voice filtering recognition routine determines to activate voice filtering for at least the voice of the user, the method also includes obtaining a respective speaker embedding of the user and processing, using the respective speaker embedding, the second instance of the raw audio data to generate enhanced audio data for the audio-based communication that isolates the utterance of the audible contents spoken by the user and excludes at least a portion of the one or more additional sounds that are not spoken by the user The method also includes executing.

Voice-activated call pick-up for mobile device
11570293 · 2023-01-31 · ·

Disclosed embodiments are directed an application program configured to run on a user's mobile device can allow voice-activated call pick-up to the user, without the user having to use his or her hands for picking up the call. For example, the application program can initially be trained to a user's voice command. When an incoming call is received at the mobile device, the user can pick up the call by issuing a voice command. In some embodiments, the application program can determine whether to allow voice-activated pick-up of calls based on data collected from multiple sensors associated with the vehicle, the mobile device, or a remote source.