G10L17/04

RELAXED INSTANCE FREQUENCY NORMALIZATION FOR NEURAL-NETWORK-BASED AUDIO PROCESSING

Techniques and apparatus for training a neural network to classify audio into one of a plurality of categories and using such a trained neural network. An example method generally includes receiving a data set including a plurality of audio samples. A relaxed feature-normalized data set is generated by normalizing each audio sample of the plurality of audio samples. A neural network is trained to classify audio into one of a plurality of categories based on the relaxed feature-normalized data set, and the trained neural network is deployed.

DEVICE FINDER USING VOICE AUTHENTICATION
20220328049 · 2022-10-13 ·

A computing device may receive an indication of an audio signal captured by a microphone, wherein the audio signal includes voice input. The computing device may determine that the voice input in the audio signal is from an authorized user of the computing device and includes a trigger phrase associated with a request to trigger device finder functionality based at least in part on comparing the voice input with data provided by the authorized user of the computing device. The computing device may, in response to determining that the voice input in the audio signal is from the authorized user of the computing device and includes the trigger phrase associated with the request to trigger device finder functionality, cause a speaker of the computing device to audibly output the alert sound to assist the authorized user to locate the computing device.

DEVICE FINDER USING VOICE AUTHENTICATION
20220328049 · 2022-10-13 ·

A computing device may receive an indication of an audio signal captured by a microphone, wherein the audio signal includes voice input. The computing device may determine that the voice input in the audio signal is from an authorized user of the computing device and includes a trigger phrase associated with a request to trigger device finder functionality based at least in part on comparing the voice input with data provided by the authorized user of the computing device. The computing device may, in response to determining that the voice input in the audio signal is from the authorized user of the computing device and includes the trigger phrase associated with the request to trigger device finder functionality, cause a speaker of the computing device to audibly output the alert sound to assist the authorized user to locate the computing device.

Electronic apparatus and controlling method thereof

An electronic apparatus is disclosed. The apparatus includes a memory configured to store at least one pre-registered voiceprint and a first voiceprint cluster including the at least one pre-registered voiceprint, and a processor configured to, based on a user recognition command being received, obtain information of time at which the user recognition command is received, change the at least one pre-registered voiceprint included in the first voiceprint cluster based on the obtained information of time, generate a second voiceprint cluster based on the at least one changed voiceprint, and based on a user's utterance being received, perform user recognition with respect to the received user's utterance based on the first voiceprint cluster and the second voiceprint cluster.

Electronic apparatus and controlling method thereof

An electronic apparatus is disclosed. The apparatus includes a memory configured to store at least one pre-registered voiceprint and a first voiceprint cluster including the at least one pre-registered voiceprint, and a processor configured to, based on a user recognition command being received, obtain information of time at which the user recognition command is received, change the at least one pre-registered voiceprint included in the first voiceprint cluster based on the obtained information of time, generate a second voiceprint cluster based on the at least one changed voiceprint, and based on a user's utterance being received, perform user recognition with respect to the received user's utterance based on the first voiceprint cluster and the second voiceprint cluster.

Voice biometric authentication in a virtual assistant
11665153 · 2023-05-30 · ·

Aspects of the disclosure relate to voice biometric authentication in a virtual assistant. In some embodiments, a computing platform may receive, from a user device, an audio file comprising a voice command to access information related to a user account. The computing platform may retrieve one or more voice biometric signatures from a voice biometric database associated with the user account, and apply a voice biometric matching algorithm to compare the voice command of the audio file to the one or more voice biometric signatures to determine if a match exists between the voice command and one of the one or more voice biometric signatures. In response to determining that a match exists, the computing platform may retrieve information associated with the user account, and then send, via the communication interface, the information associated with the user account to the user device.

VOICE COMMAND SYSTEM AND VOICE COMMAND METHOD
20230162734 · 2023-05-25 ·

A voice command system according to a first disclosure comprises a gateway apparatus having an interface configured to receive a voice command, and a controller configured to perform a registration process of registering a speaker permitted to receive the voice command. The controller is configured to perform an authentication process of rejecting a reception of the voice command when a speaker of the voice command is not registered, and permitting a reception of the voice command when a speaker of the voice command is registered. The controller is configured to perform the authentication process for each voice command.

VOICE VERIFICATION AND RESTRICTION METHOD OF VOICE TERMINAL
20230162741 · 2023-05-25 ·

A voice verification and restriction method of the voice terminal includes: a) voice storage step including: inputting and registering voice of a user through a microphone of the voice terminal, receiving and analyzing the input voice using a language processing module, transmitting the analyzed voice to a plurality of voice authentication servers to verify and store each voice, and learning the stored voice using an AI processor; and b) voice verification step including: mutually comparing the input voice with voice stored in at least one server among voices stored in the plurality of voice authentication servers, performing approval and a voice command when the input voice matches the stored voice, and setting restrictions on all or some of functions of the voice terminal and executing a step-by-step action designated by the user when the input voice does not match the stored voice.

NOISE CANCELLATION PROCESSING METHOD, DEVICE AND APPARATUS
20230164477 · 2023-05-25 ·

A noise cancellation processing method, device and apparatus are provided. The noise cancellation processing method includes: collecting first voice data in a surrounding environment by using a noise-cancelling earphone in response to detecting that the noise-cancelling earphone is in a wearing state and a noise cancellation mode is enabled; extracting to-be-recognized voiceprint feature information according to the first voice data; identifying similarities between registered voiceprint feature information stored in a registered voiceprint database and the to-be-recognized voiceprint feature information entry by entry; and in response to at least one of the similarities being greater than a first preset threshold, performing a preset action in the noise-cancelling earphone.

NOISE CANCELLATION PROCESSING METHOD, DEVICE AND APPARATUS
20230164477 · 2023-05-25 ·

A noise cancellation processing method, device and apparatus are provided. The noise cancellation processing method includes: collecting first voice data in a surrounding environment by using a noise-cancelling earphone in response to detecting that the noise-cancelling earphone is in a wearing state and a noise cancellation mode is enabled; extracting to-be-recognized voiceprint feature information according to the first voice data; identifying similarities between registered voiceprint feature information stored in a registered voiceprint database and the to-be-recognized voiceprint feature information entry by entry; and in response to at least one of the similarities being greater than a first preset threshold, performing a preset action in the noise-cancelling earphone.