G10L25/93

APPARATUS AND METHOD FOR SIGNAL PROCESSING

A signal processing apparatus includes a frequency detector configured to receive a user input including at least one of a vibration input and a user voice, vibrate in response to the received user input, and detect a frequency of the received user input, based on the vibration, and a processor configured to determine a type of the user input received by the frequency detector, based on the frequency detected by the frequency detector, and perform a function corresponding to the user input of the determined type.

APPARATUS AND METHOD FOR SIGNAL PROCESSING

A signal processing apparatus includes a frequency detector configured to receive a user input including at least one of a vibration input and a user voice, vibrate in response to the received user input, and detect a frequency of the received user input, based on the vibration, and a processor configured to determine a type of the user input received by the frequency detector, based on the frequency detected by the frequency detector, and perform a function corresponding to the user input of the determined type.

SPEECH PROCESSING METHOD AND APPARATUS, DEVICE, STORAGE MEDIUM AND PROGRAM

The present disclosure provides a speech processing method. A specific implementation solution is: a terminal device sends at least one speech intention to a server in a process of receiving first speech information, where each speech intention is a speech intention corresponding to a part of speech information in the first speech information; the server acquires response information corresponding to the at least one speech intention; the terminal device sends the first speech information to the server in response to completion of receiving the first speech information; the server acquires a second speech intention corresponding to the first speech information, and sends the response information corresponding to the first speech intention to the terminal device, and the terminal device outputs the response information.

SPEECH PROCESSING METHOD AND APPARATUS, DEVICE, STORAGE MEDIUM AND PROGRAM

The present disclosure provides a speech processing method. A specific implementation solution is: a terminal device sends at least one speech intention to a server in a process of receiving first speech information, where each speech intention is a speech intention corresponding to a part of speech information in the first speech information; the server acquires response information corresponding to the at least one speech intention; the terminal device sends the first speech information to the server in response to completion of receiving the first speech information; the server acquires a second speech intention corresponding to the first speech information, and sends the response information corresponding to the first speech intention to the terminal device, and the terminal device outputs the response information.

Method and system for generating mixed voice data

The present disclosure discloses a method and system for generating mixed voice data, and belongs to the technical field of voice recognition. In the method for generating mixed voice data according to the present disclosure, a pure voice and noise are collected first, normalization processing is performed on the collected voice data, randomization processing is performed on processed data, then GAIN processing is performed on the data, and finally filter processing is performed to obtain mixed voice data. The system for generating mixed voice data according to the present disclosure includes a collecting unit, a calculating unit, and a storage unit, the collecting unit being electrically connected to the calculating unit, and the calculating unit being connected to the storage unit through a data transmitting unit. The present disclosure provides the method and the system to meet the data requirement of deep learning.

Method and system for generating mixed voice data

The present disclosure discloses a method and system for generating mixed voice data, and belongs to the technical field of voice recognition. In the method for generating mixed voice data according to the present disclosure, a pure voice and noise are collected first, normalization processing is performed on the collected voice data, randomization processing is performed on processed data, then GAIN processing is performed on the data, and finally filter processing is performed to obtain mixed voice data. The system for generating mixed voice data according to the present disclosure includes a collecting unit, a calculating unit, and a storage unit, the collecting unit being electrically connected to the calculating unit, and the calculating unit being connected to the storage unit through a data transmitting unit. The present disclosure provides the method and the system to meet the data requirement of deep learning.

Voice user interface for intervening in conversation of at least one user by adjusting two different thresholds

An electronic device is provided. The electronic device includes a memory configured to store at least one instruction, and at least one processor where the at least one processor is configured to execute the instruction to obtain voice data from a conversation of at least one user, convert the voice data to text data, determine at least one parameter indicating characteristic of the conversation based on at least one of the voice data or the text data, adjust a condition for triggering intervention in the conversation based on the determined at least one parameter, and output a feedback based on the text data when the adjusted condition is satisfied, wherein the adjustment of the condition includes adjusting a first and a second threshold based on change of the at least one parameter.

Vowel sensing voice activity detector
11587579 · 2023-02-21 · ·

Methods and apparatuses for detecting user speech are described. In one example, a method for detecting user speech includes receiving a microphone output signal corresponding to sound received at a microphone and identifying a spoken vowel sound in the microphone signal. The method further includes outputting an indication of user speech detection responsive to identifying the spoken vowel sound.

Vowel sensing voice activity detector
11587579 · 2023-02-21 · ·

Methods and apparatuses for detecting user speech are described. In one example, a method for detecting user speech includes receiving a microphone output signal corresponding to sound received at a microphone and identifying a spoken vowel sound in the microphone signal. The method further includes outputting an indication of user speech detection responsive to identifying the spoken vowel sound.

Voice processing method, apparatus, electronic device, and storage medium

Provided in the present disclosure are a voice processing method, an apparatus, an electronic device, and a storage medium, the method comprising: detecting the working state of a current call system, and when the working state is a two-end speaking state or a remote-end speaking state, performing compression processing on a subsequent remote-end voice signal, acquiring a near-end voice signal by means of a microphone, performing echo processing on the basis of the near-end voice signal and the compression-processed remote-end voice signal to obtain an echo-processed near-end voice signal and a remaining echo signal, performing non-linear suppression processing on the near-end voice signal and the remaining echo signal, and performing gain control on the suppression-processed near-end voice signal.