Patent classifications
G10L15/34
SYLLABLE BASED AUTOMATIC SPEECH RECOGNITION
Systems, methods, and computer programs are described which utilize the structure of syllables as an organizing element of automated speech recognition processing to overcome variations in pronunciation, to efficiently resolve confusable aspects, to exploit context, and to map the speech to orthography.
SYLLABLE BASED AUTOMATIC SPEECH RECOGNITION
Systems, methods, and computer programs are described which utilize the structure of syllables as an organizing element of automated speech recognition processing to overcome variations in pronunciation, to efficiently resolve confusable aspects, to exploit context, and to map the speech to orthography.
Collaborative artificial intelligence (AI) voice response system control for authorizing a command associated with a calendar event
Described herein are methods, computer-readable media, techniques, and methodologies for collaborative control of an artificial intelligence (AI) voice response system and an AI voice response system configured for collaborative interaction and control. A user command submitted by a first user to an AI voice response client device is evaluated against a user profile associated with the first user to determine whether the user command requires authorization from a second user prior to execution. If authorization is required, the user command or those portion(s) of the user command requiring authorization are sent to a user device of the second user. The second user is provided with the capability to approve, reject, or modify the user command. If approved or modified, the user command or the modified command are executed and the execution result is presented to the first user via the AI voice response client device or another device.
Collaborative artificial intelligence (AI) voice response system control for authorizing a command associated with a calendar event
Described herein are methods, computer-readable media, techniques, and methodologies for collaborative control of an artificial intelligence (AI) voice response system and an AI voice response system configured for collaborative interaction and control. A user command submitted by a first user to an AI voice response client device is evaluated against a user profile associated with the first user to determine whether the user command requires authorization from a second user prior to execution. If authorization is required, the user command or those portion(s) of the user command requiring authorization are sent to a user device of the second user. The second user is provided with the capability to approve, reject, or modify the user command. If approved or modified, the user command or the modified command are executed and the execution result is presented to the first user via the AI voice response client device or another device.
Orchestrating execution of a series of actions requested to be performed via an automated assistant
Implementations are set forth herein for creating an order of execution for actions that were requested by a user, via a spoken utterance to an automated assistant. The order of execution for the requested actions can be based on how each requested action can, or is predicted to, affect other requested actions. In some implementations, an order of execution for a series of actions can be determined based on an output of a machine learning model, such as a model that has been trained according to supervised learning. A particular order of execution can be selected to mitigate waste of processing, memory, and network resources—at least relative to other possible orders of execution. Using interaction data that characterizes past performances of automated assistants, certain orders of execution can be adapted over time, thereby allowing the automated assistant to learn from past interactions with one or more users.
METHOD FOR PROCESSING USER INPUT OF VOICE ASSISTANT
Provided is a method of processing a user input to deliver the user input to at least one of a plurality of assistants, includes: converting a user input including a voice signal based on a predetermined rule to generate an instruction; splitting a complex instruction into partial instructions based on that the generated instruction is the complex instruction requesting two or more events; and determining a domain of each of the partial instructions and distributing the partial instructions to at least one of a plurality of voice assistants based on the domain. According to an embodiment, the washer may be related to artificial intelligence (AI) modules, unmanned aerial vehicles (UAVs), robots, augmented reality (AR) devices, virtual reality (VR) devices, and 5G service-related devices.
METHOD FOR PROCESSING USER INPUT OF VOICE ASSISTANT
Provided is a method of processing a user input to deliver the user input to at least one of a plurality of assistants, includes: converting a user input including a voice signal based on a predetermined rule to generate an instruction; splitting a complex instruction into partial instructions based on that the generated instruction is the complex instruction requesting two or more events; and determining a domain of each of the partial instructions and distributing the partial instructions to at least one of a plurality of voice assistants based on the domain. According to an embodiment, the washer may be related to artificial intelligence (AI) modules, unmanned aerial vehicles (UAVs), robots, augmented reality (AR) devices, virtual reality (VR) devices, and 5G service-related devices.
Electronic apparatus for recognizing keyword included in your utterance to change to operating state and controlling method thereof
An apparatus comprising one or more processors, a communication circuit, and a memory for storing instructions, which when executed, performs a method of recognizing a user utterance. The method comprises: receiving first data associated with a user utterance, performing, a first determination to determine whether the user utterance includes the first data and a specified word, performing a second determination to determine whether the first data includes the specified word, transmitting the first data to an external server, receiving a text generated from the first data by the external server, performing a third determination to determine whether the received text matches the specified word, and determining whether to activate the voice-based input system based on the third determination.
Method for processing the output of a speech recognizer
A method for processing speech, comprising semantically parsing a received natural language speech input with respect to a plurality of predetermined command grammars in an automated speech processing system; determining if the parsed speech input unambiguously corresponds to a command and is sufficiently complete for reliable processing, then processing the command; if the speech input ambiguously corresponds to a single command or is not sufficiently complete for reliable processing, then prompting a user for further speech input to reduce ambiguity or increase completeness, in dependence on a relationship of previously received speech input and at least one command grammar of the plurality of predetermined command grammars, reparsing the further speech input in conjunction with previously parsed speech input, and iterating as necessary. The system also monitors abort, fail or cancel conditions in the speech input.
Method for processing the output of a speech recognizer
A method for processing speech, comprising semantically parsing a received natural language speech input with respect to a plurality of predetermined command grammars in an automated speech processing system; determining if the parsed speech input unambiguously corresponds to a command and is sufficiently complete for reliable processing, then processing the command; if the speech input ambiguously corresponds to a single command or is not sufficiently complete for reliable processing, then prompting a user for further speech input to reduce ambiguity or increase completeness, in dependence on a relationship of previously received speech input and at least one command grammar of the plurality of predetermined command grammars, reparsing the further speech input in conjunction with previously parsed speech input, and iterating as necessary. The system also monitors abort, fail or cancel conditions in the speech input.