Patent classifications
G10L15/08
USING A SMARTPHONE TO CONTROL ANOTHER DEVICE BY VOICE
A method and system for implementing a speech-enabled interface of a host device via an electronic mobile device in a network are provided. The method includes establishing a communication session between the host device and the mobile device via a session service provider. According to some embodiments, a barcode can be adopted to enable the pairing of the host device and mobile device. Furthermore, the present method and system employ the voice interface in conjunction with speech recognition systems and natural language processing to interpret voice input for the hosting device, which can be used to perform one or more actions related to the hosting device.
Methods, systems and apparatuses for improved speech recognition and transcription
Methods, systems, and apparatuses for improved speech recognition and transcription of user utterances are described herein. User utterances may be processed by a speech recognition computing device as well as an acoustic model. The acoustic model may be trained using historical user utterance data and machine learning techniques. The acoustic model may be used to determine whether a transcription determined by the speech recognition computing device should be overridden with an updated transcription.
Methods, systems and apparatuses for improved speech recognition and transcription
Methods, systems, and apparatuses for improved speech recognition and transcription of user utterances are described herein. User utterances may be processed by a speech recognition computing device as well as an acoustic model. The acoustic model may be trained using historical user utterance data and machine learning techniques. The acoustic model may be used to determine whether a transcription determined by the speech recognition computing device should be overridden with an updated transcription.
Enabling speech interactions on web-based user interfaces
Web content with a speech interaction user interface capability is provided. Interactable elements of the web content are identified. For each of the interactable elements, one or more associated identifiers are determined and associated with a corresponding interactable element of the identified interactable elements in a data structure. A speech input is received from a user. Using the data structure, one of the interactable elements is matched to the received speech input. An action is automatically performed on the matched interactable element.
Enabling speech interactions on web-based user interfaces
Web content with a speech interaction user interface capability is provided. Interactable elements of the web content are identified. For each of the interactable elements, one or more associated identifiers are determined and associated with a corresponding interactable element of the identified interactable elements in a data structure. A speech input is received from a user. Using the data structure, one of the interactable elements is matched to the received speech input. An action is automatically performed on the matched interactable element.
Customizing search results in a multi-content source environment
Described herein are various embodiments for customizing search results in a multi-content source environment. An embodiment operates by receiving input corresponding to a search from a user and retrieving a content history indicating which content was previously viewed by the user. It is determined that the content of the content history is organized into one or more preconfigured categories. A new category of content is generated based on the content history for the user. The content of the content history for user is arranged based on both the new category and at least a subset of the one or more preconfigured categories. The arranged content is displayed in a manner customized to the user.
Customizing search results in a multi-content source environment
Described herein are various embodiments for customizing search results in a multi-content source environment. An embodiment operates by receiving input corresponding to a search from a user and retrieving a content history indicating which content was previously viewed by the user. It is determined that the content of the content history is organized into one or more preconfigured categories. A new category of content is generated based on the content history for the user. The content of the content history for user is arranged based on both the new category and at least a subset of the one or more preconfigured categories. The arranged content is displayed in a manner customized to the user.
Inferred activity based conference enhancement method and system
A method and system for optimizing conference session activities within a conference space, the system comprising at least a first sensor for sensing session characteristics and a processor receiving the sensed session characteristics and identifying a conference paradigm based on the sensed session characteristics, identifying an action sequence to perform as a function of the conference paradigm and performing the action sequence, wherein the action sequence results in automatic presentation of at least some information to at least a subset of conferees within the conference space.
Inferred activity based conference enhancement method and system
A method and system for optimizing conference session activities within a conference space, the system comprising at least a first sensor for sensing session characteristics and a processor receiving the sensed session characteristics and identifying a conference paradigm based on the sensed session characteristics, identifying an action sequence to perform as a function of the conference paradigm and performing the action sequence, wherein the action sequence results in automatic presentation of at least some information to at least a subset of conferees within the conference space.
Personal Voice-Based Information Retrieval System
The present invention relates to a system for retrieving information from a network such as the Internet. A user creates a user-defined record in a database that identifies an information source, such as a web site, containing information of interest to the user. This record identifies the location of the information source and also contains a recognition grammar based upon a speech command assigned by the user. Upon receiving the speech command from the user that is described within the recognition grammar, a network interface system accesses the information source and retrieves the information requested by the user.