Patent classifications
G10L2015/221
INFORMATION PROCESSING DEVICE, INFORMATION PROCESSING METHOD, AND PROGRAM
There is provided an information processing device including an analysis unit configured to analyze a character string indicating contents of utterance obtained as a result of speech recognition, and a display control unit configured to display the character string indicating the contents of the utterance and an analysis result on a display screen.
Intelligent list reading
Systems and processes for operating an intelligent automated assistant to perform intelligent list reading are provided. In one example process, a spoken user request associated with a plurality of data items is received. The process determines whether a degree of specificity of the spoken user request is less than a threshold level. In response to determining that a degree of specificity of the spoken user request is less than a threshold level, one or more attributes related to the spoken user request are determined. The one or more attributes are not defined in the spoken user request. Additionally, a list of data items based on the spoken user request and the one or more attributes is obtained. A spoken response comprising a subset of the list of data items is generated and the spoken response is provided.
ELECTRONIC DEVICE FOR DISPLAYING VOICE RECOGNITION-BASED IMAGE
Disclosed is an electronic device. The electronic device according to an embodiment may include a microphone, a display, and a processor. The processor may be configured to receive a voice input of a user through the microphone, to identify a word having a plurality of meanings among one or more words recognized based on the voice input, in response to the voice input, and to display an image corresponding to one meaning selected from the plurality of meanings through the display in association with the word. Moreover, various embodiment found through the disclosure are possible.
Systems and methods for voice-assisted media content selection
Systems and methods for media playback via a media playback system include (i) capturing a voice input comprising a request for media content, (ii) receiving information derived at least from the request for media content, (iii) requesting and receiving information from at least one remote computing device associated with a first media content service and at least one remote computing device associated with a second media content service, wherein (a) the information identifies first media content available via the first media content service for playback and identifies second media content available via the second media content service for playback, and (b) the first and second media content are related to the requested media content, and (iv) after receiving at least one of the first information and the second information, (a) selecting the first media content instead of the second media content, and (b) playing back the first media content.
Speech interface
A system (100) for enabling a user to select media content in an entertainment environment, comprising a remote control device (110) having a set of user-activated keys and a speech activation circuit adapted to enable a speech signal; a speech engine (160) comprising a speech recognizer (170); an application wrapper (180) configured to recognize substantive meaning in the speech signal; and a media content controller (190) configured to select media content. Every function that can be executed by activation of the user-activated keys can also be executed by the speech engine (160) in response to the recognized substantive meaning.
Device for performing task corresponding to user utterance
An electronic device includes a touchscreen display, a microphone, at least one speaker, a processor and a memory which stores instructions that cause the processor to receive a user utterance including a request for performing a task with the electronic device, to transmit data associated with the user utterance to an external server, to receive a response from the external server including sample utterances representative of an intent of the user utterance and the sample utterances being selected by the external server based on the user utterance, to display the sample utterances on the touchscreen display, to receive a user input to select one of the sample utterances, and to perform the task by causing the electronic device to follow a sequence of states associated with the selected one of the sample utterances.
SYSTEM AND METHOD OF FINDING AND ENGAGING WITH HISTORICAL MARKERS
The present invention relates to an application or a system and a method for finding/exploring areas, and specifically historical markers. The application alerts travelers to road-signs that list historical information. In addition to identifying them on a map, user input/output is adapted to accept user-input via spoken user commands, and for playing audio output to the user, the audio output comprising historical marker content and acknowledgement of audio commands. The functionality of the present invention includes a registration process, a data base of markers (i.e., a data), locating and navigating user to the nearest marker (i.e., navigating to a location). The user can add information to the selected marker.
SPEECH INTERFACE
A system (100) for enabling a user to select media content in an entertainment environment, comprising a remote control device (110) having a set of user-activated keys and a speech activation circuit adapted to enable a speech signal; a speech engine (160) comprising a speech recognizer (170); an application wrapper (180) configured to recognize substantive meaning in the speech signal; and a media content controller (190) configured to select media content. Every function that can be executed by activation of the user-activated keys can also be executed by the speech engine (160) in response to the recognized substantive meaning.
Voice application platform
Among other things, requests are received from voice assistant devices expressed in accordance with different corresponding protocols of one or more voice assistant frameworks. Each of the requests represents a voiced input by a user to the corresponding voice assistant device. The received requests are re-expressed in accordance with a common request protocol. Based on the received requests, responses to the requests are expressed in accordance with a common response protocol. Each of the responses is re-expressed according to a protocol of the framework with respect to which the corresponding request was expressed. The responses are sent to the voice assistant devices for presentation to the users.
Determining suggested subsequent user actions during digital assistant interaction
Systems and processes for operating an intelligent automated assistant are provided. An example process includes receiving an utterance including a user request, determining, based on the user request, a domain associated with the user request, determining, based on the domain, a first subsequent user action and a second subsequent user action, determining, based on the domain, a first parameter for the first subsequent user action and a second parameter for the second subsequent user action, in accordance with a determination that a first score associated with the first subsequent user action is higher than a score associated with the second subsequent user action, selecting the first subsequent user action as a suggested subsequent user action, and providing the suggested subsequent user action.