G10L25/00

Hardware command device with audio privacy features
10629190 · 2020-04-21 · ·

A hardware device may receive a command from a user and then respond to that command with information. Such commands may cause an audible response to be broadcast via speaker. A device's audio reply to a command may include sensitive details that a person may not wish to share. When a device makes such an audio reply, it may therefore divulge sensitive information to one or more other people that are in listening range. A person utilizing such a device may thus inadvertently compromise his own privacy. The present disclosure includes techniques that are usable to mitigate such privacy exposures by detecting a presence of a second person in a surrounding environment and creating a reply that omits some or all of the sensitive information that might have otherwise been broadcast by a command device.

Expandable dialogue system

A system that allows non-engineers administrators, without programming, machine language, or artificial intelligence system knowledge, to expand the capabilities of a dialogue system. The dialogue system may have a knowledge system, user interface, and learning model. A user interface allows non-engineers to utilize the knowledge system, defined by a small set of primitives and a simple language, to annotate a user utterance. The annotation may include selecting actions to take based on the utterance and subsequent actions and configuring associations. A dialogue state is continuously updated and provided to the user as the actions and associations take place. Rules are generated based on the actions, associations and dialogue state that allows for computing a wide range of results.

Expandable dialogue system

A system that allows non-engineers administrators, without programming, machine language, or artificial intelligence system knowledge, to expand the capabilities of a dialogue system. The dialogue system may have a knowledge system, user interface, and learning model. A user interface allows non-engineers to utilize the knowledge system, defined by a small set of primitives and a simple language, to annotate a user utterance. The annotation may include selecting actions to take based on the utterance and subsequent actions and configuring associations. A dialogue state is continuously updated and provided to the user as the actions and associations take place. Rules are generated based on the actions, associations and dialogue state that allows for computing a wide range of results.

Unified N-best ASR results
10580406 · 2020-03-03 · ·

A system and method receives a spoken utterance and converts the spoken utterance into recognized speech results through automatic speech recognition modules. The system and method renders a composite recognition speech result comprising the recognized speech results joined in a return function. The system and method interprets the recognized speech results joined in a return function from each of the automatic speech recognition modules through multiple conversation modules.

System and method for continuous media segment identification
10575032 · 2020-02-25 · ·

This invention provides a means to identify unknown media programming using the audio component of said programming. The invention extracts audio information from the media received by consumer electronic devices such as smart TVs and TV set-top boxes then conveys said information to a remote server means which will in turn identify said audio information of unknown identity by way of testing against a database of known audio segment information. The system identifies unknown media programming in real-time such that time-sensitive services may be offered such as interactive television applications providing contextually related information or television advertisement substitution. Other uses include tracking media consumption among many other services.

Establishment of audio-based network sessions with non-registered resources
10573322 · 2020-02-25 · ·

The present disclosure is generally directed to increasing the scalability of onboarding network resources, such as a digital component, to a voice-based network. The system enables the navigating of and interaction with digital components using voice or speech input and output interfaces on a computing device. The system can receive and processes an input audio signal to identify a digital component. The system enables voice-based interaction with the previously unregistered digital component via the input and output interfaces.

Vehicle-mounted voice recognition device, vehicle including the same, vehicle-mounted voice recognition system, and method for controlling the same

A vehicle-mounted voice recognition device includes: a storage configured to store a plurality of databases for voice recognition generated based on an address book database sent from a terminal device; a processor configured to detect at least one element from the plurality of databases for voice recognition and determine an order of displaying contact information corresponding to the at least one element; and a user interface configured to display the contact information corresponding to the at least one element in the order of displaying and receive a selection of a piece of the contact information from a user. The processor is further configured to detect a database among the plurality of databases for voice recognition, the detected database including an element corresponding to the selected piece of contact information, and re-determine the order of displaying the contact information based on detection frequencies of the plurality of databases for voice recognition.

Contextual voice commands
10540976 · 2020-01-21 · ·

Among other things, techniques and systems are disclosed for implementing contextual voice commands. On a device, a data item in a first context is displayed. On the device, a physical input selecting the displayed data item in the first context is received. On the device, a voice input that relates the selected data item to an operation in a second context is received. The operation is performed on the selected data item in the second context.

Speech enhancement method and apparatus for same

A speech enhancement method is provided. The speech enhancement method includes: estimating a direction of a speaker by using an input signal, generating direction information indicating the estimated direction, detecting speech of a speaker based on a result of the estimating the direction, and enhancing the speech of the speaker by using the direction information based on a result of the detecting the speech.

Method of and system for providing adaptive respondent training in a speech recognition application

A system for conducting a telephonic speech recognition application includes an automated telephone device for making telephonic contact with a respondent and a speech recognition device which, upon the telephonic contact being made, presents the respondent with at least one introductory prompt for the respondent to reply to; receives a spoken response from the respondent; and performs a speech recognition analysis on the spoken response to determine a capability of the respondent to complete the application. If the speech recognition device, based on the spoken response to the introductory prompt, determines that the respondent is capable of competing the application, the speech recognition device presents at least one application prompt to the respondent. If the speech recognition device, based on the spoken response to the introductory prompt, determines that the respondent is not capable of completing the application, the speech recognition system presents instructions on completing the application to the respondent.