Patent classifications
G10L17/10
Method of collating, abstracting, and delivering worldwide viewpoints
The present invention provides a system and method for presenting global issues to users and followers of a social media platform, allowing the users and followers to provide viewpoints on the global issues, ensuring that the users providing the viewpoints are authentic, and analyzing the various viewpoints to develop statistical data including the location of those providing viewpoints. The present invention also allows a user to present a global issue for consideration by users of the platform, for example, a social media internet-based website, and allows followers of the user to provide their viewpoints on such global issue. Simultaneously, the location of said followers will be collected and collated along with their responses.
METHODS AND SYSTEMS FOR VOICE COMMAND TRANSACTION AUTHENTICATION
The disclosure describes techniques for authenticating a voice command transaction initiated by a consumer at a voice-activated digital assistant. The system determine a proximity score for a consumer device. The proximity score is based on a distance between the consumer device and the digital assistant. The system detects motion indicating presence of a person proximate the digital assistant (based on wireless signal data) and determines a motion score. The system receives voice data from the digital assistant and determines a voice differentiation score. The system determines a rank score for the consumer device based on historical transaction data associated with the device. The system analyzes the proximity score, motion score, voice differentiation score, and rank score using a voice command authentication model. Based on the analysis, the system determines an authentication score for the voice command transaction request.
VOICE USER INTERFACE
A received signal represents a user's speech. A first speaker recognition process is performed on a first portion of the received signal, to obtain a first output result. A second speaker recognition process is performed on a second portion of the received signal that is different from the first portion of the received signal, to obtain a second output result. The second speaker recognition process is different from the first speaker recognition process. The first and second output results are combined to obtain a combined output result indicating a likelihood that the user is a registered user.
METHOD AND DEVICE FOR IDENTIFYING USER USING BIO-SIGNAL
Provided is a user identifying method using a bio-signal, the method including sensing a user input; detecting a bio-signal from the sensed user input; determining whether the detected bio-signal is valid, based on status information representing a status of a user at a moment when the user input is sensed; and identifying the user by comparing the bio-signal with at least one pre-stored reference bio-signal, according to a result of the comparing.
METHOD AND DEVICE FOR IDENTIFYING USER USING BIO-SIGNAL
Provided is a user identifying method using a bio-signal, the method including sensing a user input; detecting a bio-signal from the sensed user input; determining whether the detected bio-signal is valid, based on status information representing a status of a user at a moment when the user input is sensed; and identifying the user by comparing the bio-signal with at least one pre-stored reference bio-signal, according to a result of the comparing.
Speaker identification
A method of speaker identification comprises receiving an audio signal representing speech; performing a first voice biometric process on the audio signal to attempt to identify whether the speech is the speech of an enrolled speaker; and, if the first voice biometric process makes an initial determination that the speech is the speech of an enrolled user, performing a second voice biometric process on the audio signal to attempt to identify whether the speech is the speech of the enrolled speaker. The second voice biometric process is selected to be more discriminative than the first voice biometric process.
Speaker identification
A method of speaker identification comprises receiving an audio signal representing speech; performing a first voice biometric process on the audio signal to attempt to identify whether the speech is the speech of an enrolled speaker; and, if the first voice biometric process makes an initial determination that the speech is the speech of an enrolled user, performing a second voice biometric process on the audio signal to attempt to identify whether the speech is the speech of the enrolled speaker. The second voice biometric process is selected to be more discriminative than the first voice biometric process.
METHOD AND SYSTEM FOR VOICE-BASED USER AUTHENTICATION AND CONTENT EVALUATION
The disclosed embodiments illustrate methods for voice-based user authentication and content evaluation. The method includes receiving a voice input of a user from a user-computing device, wherein the voice input corresponds to a response to a query. The method further includes authenticating the user based on a comparison of a voiceprint of the voice input and a sample voiceprint of the user. Further, the method includes evaluating content of the response of the user based on the authentication and a comparison between text content and a set of pre-defined answers to the query, wherein the text content is determined based on the received voice input.
Systems and Methods for Improved Digital Transcript Creation Using Automated Speech Recognition
This disclosure relates generally to systems, methods, and computer readable media for providing improved insights and annotations to enhance recorded audio, video, and/or written transcriptions of testimony. For example, in some embodiments, a method is disclosed for correlating non-verbal cues recognized from an audio and/or video recording of testimony to the corresponding testimony transcript locations. In other embodiments, a method is disclosed for providing testimony-specific artificial intelligence-based insights and annotations to a testimony transcript, e.g., based on the use of machine learning, natural language processing, and/or other techniques. In still other embodiments, a method is disclosed for providing smart citations to a testimony transcript, e.g., which track the location of semantic constructs within the transcript over the course of various modifications being made to the transcript. In yet other embodiments, a method is disclosed for providing intelligent speaker identification-related insights and annotations to an audio recording of a testimony transcript.
Apparatus and method for providing a reliable voice interface between a system and multiple users
A communication interface apparatus for a system and a plurality of users is provided. The communication interface apparatus for the system and the plurality of users includes a first process unit configured to receive voice information and face information from at least one user, and determine whether the received voice information is voice information of at least one registered user based on user models corresponding to the respective received voice information and face information; a second process unit configured to receive the face information, and determine whether the at least one user's attention is on the system based on the received face information; and a third process unit configured to receive the voice information, analyze the received voice information, and determine whether the received voice information is substantially meaningful to the system based on a dialog model that represents conversation flow on a situation basis.