Patent classifications
G10L2015/221
Voice recognition system and recognition result display apparatus
A voice recognition system, when deciding that a user is about to select a voice recognition result displayed on a screen while the system continuously recognizes a voice uttered, stops a display update of the recognition result on the display unit, thereby being able to prevent the recognition result from disappearing or from changing to other recognition result at the moment when the user is about to select the recognition result. This makes it possible to prevent a user from carrying out an unwanted operation, and to improve the convenience of the user.
Voice recognition method, voice controlling method, information processing method, and electronic apparatus
The present disclosure provides a voice recognition method for use in an electronic apparatus comprising a voice input module. The method comprises: receiving voice data by the voice input module; performing a first pattern voice recognition on the received voice data, including identifying whether the voice data comprises a first voice recognition information; performing a second pattern voice recognition on the voice data if the voice data comprises the first voice recognition information; and performing or refusing an operation corresponding to the first voice recognition information according to a result of the second pattern voice recognition. The present disclosure also provides a voice controlling method, an information processing method, and an electronic apparatus.
Multiple parallel delineated topics of a conversation within the same virtual assistant
Provided are embodiments for a computer-implemented method for interacting with a user by an automated response system supporting topic switching and information collection. The computer-implemented method includes receiving a plurality of utterances from the user by the automated response system, and analyzing the utterances to form a first topic thread and an information collection objective. The computer-implemented method also includes utilizing an information collection user interface to gather data to support the information collection objective, and providing responses to the user after the gathered data related to the first topic thread. Also provided are embodiments for a system and computer program product for implementing the techniques described herein.
VOICE PROCESSING SYSTEM AND VOICE PROCESSING METHOD
Provided is a voice processing system including: a voice acquisition unit that acquires an utterance of a user; a voice recognition unit that recognizes contents of the utterance acquired by the voice acquisition unit; a execution unit that executes processing based on a result of the voice recognition unit; and a determination unit that determines a candidate of a word to substitute a certain word, wherein when the voice recognition unit recognizes a request including a first word, the execution unit executes the request including the first word and provides the user with a processing result, the determination unit determines a second word that is a substitute candidate of the first word, and the execution unit also executes a request including the second word instead of the first word and stores a processing result in a storage unit.
VIRTUAL ASSISTANT IDENTIFICATION OF NEARBY COMPUTING DEVICES
In one example, a method includes method comprising: receiving audio data generated by a microphone of a current computing device; identifying, based on the audio data, one or more computing devices that each emitted a respective audio signal in response to speech reception being activated at the current computing device; and selecting either the current computing device or a particular computing device from the identified one or more computing devices to satisfy a spoken utterance determined based on the audio data.
Speech Recognition
A computer system comprises an input configured to receive voice input from a user, the voice input having speech intervals separated by non-speech intervals; an ASR system configured to identify individual words in the voice input during speech intervals thereof, and store the identified words in memory; a response generation module configured to generate based on the words stored in the memory an audio response for outputting to the user; and a response delivery module configured to begin outputting the audio response to the user during a non-speech interval of the voice input, wherein the outputting of the audio response is terminated before it has completed in response to a subsequent speech interval of the voice input commencing whilst the audio response is still being outputted.
Speech Recognition
A computer system comprises an input configured to receive voice input from a user, the voice input having speech intervals separated by non-speech intervals; an ASR system configured to identify individual words in the voice input during speech intervals of the voice input, and store the identified words in memory; a speech overload detection module configured to detect at a time during a speech interval of the voice input a speech overload condition; and a notification module configured to output to the user, in response to said to detection, a notification of the speech overload condition.
Conversational Software Agent
Voice input is received from a user. An ASR system generates in memory a set of words it has identified in the voice input, and update the set each time it identifies a new word in the voice input to add the new word to the set. A condition indicative of speech inactivity in the voice input is detected. A response for outputting to the user is generated based on the set of identified words, in response to the detection of the speech inactivity condition. The generated response is outputted to the user after an interval of time—commencing with the detection of the speech inactivity condition—has ended and only if no more words have been identified in the voice input by the ASR system in that interval of time.
INFORMATION PROCESSOR AND INFORMATION PROCESSING METHOD
An information processor according to the present disclosure includes an acquisition unit that acquires first information serving as a trigger for interaction, second information indicating an answer to the first information, and third information indicating a response to the second information; and a collection unit that collects a combination of the first information, the second information, and the third information acquired by the acquisition unit.
Intent-specific automatic speech recognition result generation
Features are disclosed for generating intent-specific results in an automatic speech recognition system. The results can be generated by utilizing a decoding graph containing tags that identify portions of the graph corresponding to a given intent. The tags can also identify high-information content slots and low-information carrier phrases for a given intent. The automatic speech recognition system may utilize these tags to provide a semantic representation based on a plurality of different tokens for the content slot portions and low information for the carrier portions. A user can be presented with a user interface containing top intent results with corresponding intent-specific top content slot values.