Patent classifications
G10L2015/223
Query rephrasing using encoder neural network and decoder neural network
A method comprising receiving first data representative of a query. A representation of the query is generated using an encoder neural network and the first data. Words for a rephrased version of the query are selected from a set of words comprising a first subset of words comprising words of the query and a second subset of words comprising words absent from the query. Second data representative of the rephrased version of the query is generated.
Task resumption in a natural understanding system
A speech-processing system may provide access to one or more skills via spoken commands and/or responses in the form of synthesized speech. The system may be capable of keeping one or more skills active in the background while a user interacts (e.g., provides inputs to and/or receives outputs from) with a skill running in the foreground. A background skill may receive some trigger data, and determine to request the system to return the background skill to the foreground to, for example, request a user input regarding an action previously requested by the user. In some cases, the user may invoke a background skill to continue a previous interaction. The system may return the background skill to the foreground. The resumed skill may continue a previous interaction to, for example, to query the user for instructions, provide an update or alert, or continue a previous output.
Electronic device configured to perform action using speech recognition function and method for providing notification related to action using same
A method includes receiving a designated event related to a second application while an execution screen of a first application is displayed on a display. The method also includes executing an artificial intelligent application in response to the designated event. The method further includes transmitting data related to the designated event to an external server, based on the executed artificial intelligent application. Additionally, the method includes sensing a user utterance related to the designated event for a designated period of time. The method also includes transmitting the user utterance to the external server. The method further includes receiving an action order for performing a function related to the user utterance from the external server. The method also includes executing the second application at least based on the received action order. The method further includes outputting a result of performing the function by using the second application.
Providing composite graphical assistant interfaces for controlling various connected devices
Methods, apparatus, systems, and computer-readable media are provided for tailoring composite graphical assistant interfaces for interacting with multiple different connected devices. The composite graphical assistant interfaces can be generated proactively and/or in response to a user providing a request for an automated assistant to cause a connected device to perform a particular function. In response to the automated assistant receiving the request, the automated assistant can identify other connected devices, and other functions capable of being performed by the other connected devices. The other functions can then be mapped to various graphical control elements in order to provide a composite graphical assistant interface from which the user can interact with different connected devices. Each graphical control element can be arranged to reflect how each connected device is operating simultaneous to the presentation of the composite graphical assistant interface.
Method and apparatus for evaluating user intention understanding satisfaction, electronic device and storage medium
A method and apparatus for generating a user intention understanding satisfaction evaluation model, a method and apparatus for evaluating a user intention understanding satisfaction, an electronic device and a storage medium are provided, relating to intelligent voice recognition and knowledge graphs. The method for generating a user intention understanding satisfaction evaluation model is: acquiring a plurality of sets of intention understanding data, at least one set of which comprises a plurality of sequences corresponding to multi-round behaviors of an intelligent device in multi-round man-machine interactions; and learning the plurality of sets of intention understanding data through a first machine learning model, to obtain the user intention understanding satisfaction evaluation model after the learning, wherein the user intention understanding satisfaction evaluation model is configured to evaluate user intention understanding satisfactions of the intelligent device in the multi-round man-machine interactions according to the plurality of sequences corresponding to the multi-round man-machine interactions.
Information processing device, information processing method, and storage medium storing information processing program
An information processing device acquires question information. The information processing device acquires vehicle state information representing a state of the vehicle. The information processing device acquires answer information in response to the question information, the answer information including an image for display. The information processing device, in a case in which the vehicle state information represents that the vehicle is traveling, stores the answer information in a storage. The information processing device, in a case in which the information processing device acquires vehicle state information representing that the vehicle is stopped, outputs the answer information stored in the storage.
Multi-services gateway device at user premises
An application gateway including application service programming positioned at a user premises can provide voice controlled and managed services to a user and one or more endpoint devices associated with the application gateway. The application gateway can be controlled remotely by the application service provider through a service management center and configured to execute an application service provided from the application service provider. The application gateway can execute the application service at the user premises upon voice command by a user and independent of application services executing on the application service provider's network. An application service logic manager can communicate with an application service enforcement manager to verify that the request conforms with the policy and usage rules associated with the application service in order to authorize execution of the application service on the application gateway, either directly or through endpoint devices.
Electronic apparatus and control method thereof
An electronic apparatus is provided. The electronic apparatus includes a microphone, a memory configured to store a plurality of keyword recognition models, and a processor, which is coupled with the microphone and the memory, configured to control the electronic apparatus, wherein the processor is configured to selectively execute at least one keyword recognition model among the plurality of keyword recognition models based on operating state information of the electronic apparatus, based on a first user voice being input through the microphone, identify whether at least one keyword corresponding to the executed keyword recognition model is included in the first user voice by using the executed keyword recognition model, and based on at least one keyword identified as being included in the first user voice, perform an operation of the electronic apparatus corresponding to the at least one keyword.
Photo album management method, storage medium and electronic device
The present disclosure provides a photo album management method. The method includes obtaining voice search information from a user, performing intent recognition on the voice search information to obtain an intent recognition result which indicates an intent of the user for a photo album, obtaining a voiceprint feature from the voice search information to determine identity information of the user, sending the intent recognition result and the identity information of the user, and opening the photo album according to the intent recognition result and the identity information.
Artificial intelligence device and method of operating artificial intelligence device
An artificial intelligence device includes a microphone configured to receive a speech command, a speaker, a communication unit configured to perform communication with an external artificial intelligence device, and a processor configured to receive a wake-up command through the microphone, acquire a first speech quality level of the received wake-up command, receive a second speech quality level of the wake-up command input to the external artificial intelligence device from the external artificial intelligence device through the communication unit, output a notification indicating that the artificial intelligence device is selected as an object to be controlled through the speaker, when the first speech quality level is larger than the second speech quality level, receive an operation command through the microphone, acquire an intention of the received operation command and transmit the operation command to an external artificial intelligence device which will perform operation corresponding to the operation command according to the acquired intention through the communication unit.