G10L17/18

Intelligent Interactive Voice Recognition System
20220366915 · 2022-11-17 ·

Systems for performing intelligent interactive voice recognition functions are provided. In some aspects, natural language data may be received from a plurality of users. The natural language data may be used to generate a plurality of user-specific machine learning datasets. Subsequent natural language input data including a user query may be received. The query may be analyzed to identify the user and a user-specific machine learning dataset associated with the user may be identified. The natural language data may be processed to remove noise associated with the data and may be further processed using the identified user-specific machine learning dataset to interpret the query of the user and generate an output. The output may be transmitted to the user and feedback data may be received from the user. The user-specific machine learning dataset may then be validated and/or updated based on the feedback data.

Intelligent Interactive Voice Recognition System
20220366915 · 2022-11-17 ·

Systems for performing intelligent interactive voice recognition functions are provided. In some aspects, natural language data may be received from a plurality of users. The natural language data may be used to generate a plurality of user-specific machine learning datasets. Subsequent natural language input data including a user query may be received. The query may be analyzed to identify the user and a user-specific machine learning dataset associated with the user may be identified. The natural language data may be processed to remove noise associated with the data and may be further processed using the identified user-specific machine learning dataset to interpret the query of the user and generate an output. The output may be transmitted to the user and feedback data may be received from the user. The user-specific machine learning dataset may then be validated and/or updated based on the feedback data.

SPOOFING DETECTION APPARATUS, SPOOFING DETECTION METHOD, AND COMPUTER-READABLE STORAGE MEDIUM

A spoofing detection apparatus 100 includes a multi-channel spectrogram creation unit 10 and an evaluation unit 40. The multi-channel spectrogram creation unit 10 extracts different type of spectrograms from speech data and integrates the different type of spectrograms to create a multi-channel spectrogram. The evaluation unit 40 evaluates the created multi-channel spectrogram by applying the created multi-channel spectrogram to a classifier constructed using labeled multi-channel spectrograms as training data and classifies it to either genuine or spoof.

SPOOFING DETECTION APPARATUS, SPOOFING DETECTION METHOD, AND COMPUTER-READABLE STORAGE MEDIUM

A spoofing detection apparatus 100 includes a multi-channel spectrogram creation unit 10 and an evaluation unit 40. The multi-channel spectrogram creation unit 10 extracts different type of spectrograms from speech data and integrates the different type of spectrograms to create a multi-channel spectrogram. The evaluation unit 40 evaluates the created multi-channel spectrogram by applying the created multi-channel spectrogram to a classifier constructed using labeled multi-channel spectrograms as training data and classifies it to either genuine or spoof.

BIOMETRIC AUTHENTICATION THROUGH VOICE PRINT CATEGORIZATION USING ARTIFICIAL INTELLIGENCE
20220358933 · 2022-11-10 ·

A system is provided to categorize voice prints during a voice authentication. The system includes a processor and a computer readable medium operably coupled thereto, to perform voice authentication operations which include receiving an enrollment of a user in the biometric authentication system, requesting a first voice print comprising a sample of a voice of the user, receiving the first voice print of the user during the enrollment, accessing a plurality of categorizations of the voice prints for the voice authentication, wherein each of the plurality of categorizations comprises a portion of the voice prints based on a plurality of similarity scores of distinct voice prints in the portion to a plurality of other voice prints, determining, using a hidden layer of a neural network, one of the plurality of categorizations for the first voice print, and encoding the first voice print with the one of the plurality of categorizations.

BIOMETRIC AUTHENTICATION THROUGH VOICE PRINT CATEGORIZATION USING ARTIFICIAL INTELLIGENCE
20220358933 · 2022-11-10 ·

A system is provided to categorize voice prints during a voice authentication. The system includes a processor and a computer readable medium operably coupled thereto, to perform voice authentication operations which include receiving an enrollment of a user in the biometric authentication system, requesting a first voice print comprising a sample of a voice of the user, receiving the first voice print of the user during the enrollment, accessing a plurality of categorizations of the voice prints for the voice authentication, wherein each of the plurality of categorizations comprises a portion of the voice prints based on a plurality of similarity scores of distinct voice prints in the portion to a plurality of other voice prints, determining, using a hidden layer of a neural network, one of the plurality of categorizations for the first voice print, and encoding the first voice print with the one of the plurality of categorizations.

System for creating speaker model based on vocal sounds for a speaker recognition system, computer program product, and controller, using two neural networks
11495235 · 2022-11-08 · ·

According to one embodiment, a system for creating a speaker model includes one or more processors. The processors change a part of network parameters from an input layer to a predetermined intermediate layer based on a plurality of patterns and inputs a piece of speech into each of neural networks so as to obtain a plurality of outputs from the intermediate layer. The part of network parameters of the each of the neural networks is changed based on one of the plurality of patterns. The processors create a speaker model with respect to one or more words detected from the speech based on the outputs.

System for creating speaker model based on vocal sounds for a speaker recognition system, computer program product, and controller, using two neural networks
11495235 · 2022-11-08 · ·

According to one embodiment, a system for creating a speaker model includes one or more processors. The processors change a part of network parameters from an input layer to a predetermined intermediate layer based on a plurality of patterns and inputs a piece of speech into each of neural networks so as to obtain a plurality of outputs from the intermediate layer. The part of network parameters of the each of the neural networks is changed based on one of the plurality of patterns. The processors create a speaker model with respect to one or more words detected from the speech based on the outputs.

SYSTEMS AND METHODS FOR PERFORMING COMMANDS IN A VEHICLE USING SPEECH AND IMAGE RECOGNITION

Systems and methods are disclosed herein for implementation of a vehicle command operation system that may use multi-modal technology to authenticate an occupant of the vehicle to authorize a command and receive natural language commands for vehicular operations. The system may utilize sensors to receive data indicative of a voice command from an occupant of the vehicle. The system may receive second sensor data to aid in the determination of the corresponding vehicular operation in response to the received command. The system may retrieve authentication data for the occupants of the vehicle. The system authenticates the occupant to authorize a vehicular operation command using a neural network based on at least one of the first sensor data, the second sensor data, and the authentication data. Responsive to the authentication, the system may authorize the operation to be performed in the vehicle based on the vehicular operation command.

SYSTEMS AND METHODS FOR PERFORMING COMMANDS IN A VEHICLE USING SPEECH AND IMAGE RECOGNITION

Systems and methods are disclosed herein for implementation of a vehicle command operation system that may use multi-modal technology to authenticate an occupant of the vehicle to authorize a command and receive natural language commands for vehicular operations. The system may utilize sensors to receive data indicative of a voice command from an occupant of the vehicle. The system may receive second sensor data to aid in the determination of the corresponding vehicular operation in response to the received command. The system may retrieve authentication data for the occupants of the vehicle. The system authenticates the occupant to authorize a vehicular operation command using a neural network based on at least one of the first sensor data, the second sensor data, and the authentication data. Responsive to the authentication, the system may authorize the operation to be performed in the vehicle based on the vehicular operation command.