G10L15/222

Systems, methods, and apparatuses for resuming dialog sessions via automated assistant
11817099 · 2023-11-14 · ·

Methods, apparatus, systems, and computer-readable media are provided for storing incomplete dialog sessions between a user and an automated assistant in order that the dialog sessions can be completed in furtherance of certain actions. While interacting with an automated assistant, a user can become distracted and not complete the interaction to the point of the automated assistant performing some action. In response, the automated assistant can store the interaction as a dialog session. Subsequently, the user may express interest, directly or indirectly, in completing the dialog session, and the automated assistant can provide the user with a selectable element that, when selected, causes the dialog session to be reopened. The user can then continue the dialog session with the automated assistant in order that the originally intended action can be performed by the automated assistant.

SYSTEMS AND METHODS FOR CONVERSATIONS WITH DEVICES ABOUT MEDIA USING VOICE PERSONALITY PROFILES
20230352008 · 2023-11-02 ·

Systems and methods are described herein for providing media guidance. Control circuitry may receive a first voice input and access a database of topics to identify a first topic associated with the first voice input. A user interface may generate a first response to the first voice input, and subsequent to generating the first response, the control circuitry may receive a second voice input. The control circuitry may determine a match between the second voice input and an interruption input such as a period of silence or a keyword or a phrase, such as “Ahh,”, “Umm,”, or “Hmm.” The user interface may generate a second response that is associated with a second topic related to the first topic. By interrupting the conversation and changing the subject iron time to time, media guidance systems can appear to be more intelligent and human.

Systems and methods for addressing possible interruption during interaction with digital assistant

Systems and methods are described for handling interruptions during a digital assistant session between a user and a digital assistant by detecting if an interruption event is to occur during the digital assistant session. In response to detecting that the interruption event is to occur, options to address the interruption are provided.

PROACTIVE INCORPORATION OF UNSOLICITED CONTENT INTO HUMAN-TO-COMPUTER DIALOGS

Methods, apparatus, and computer readable media are described related to automated assistants that proactively incorporate, into human-to-computer dialog sessions, unsolicited content of potential interest to a user. In various implementations, based on content of an existing human-to-computer dialog session between a user and an automated assistant, an entity mentioned by the user or automated assistant may be identified. Fact(s)s related to the entity or to another entity that is related to the entity may be identified based on entity data contained in database(s). For each of the fact(s), a corresponding measure of potential interest to the user may be determined. Unsolicited natural language content may then be generated that includes one or more of the facts selected based on the corresponding measure(s) of potential interest. The automated assistant may then incorporate the unsolicited content into the existing human-to-computer dialog session or a subsequent human-to-computer dialog session.

DELAY ESTIMATION METHOD AND APPARATUS FOR SMART REARVIEW MIRROR, AND ELECTRONIC DEVICE
20220284902 · 2022-09-08 ·

A delay estimation method for a smart rearview mirror includes obtaining identification information of an external device in response to connecting to the external device by a smart rearview mirror for screen projection. A target delay estimation upper limit corresponding to the external device is obtained based on the identification information. A delay estimation result is obtained by performing delay estimation based on the target delay estimation upper limit in response to sending a voice signal by the smart rearview mirror to the external device. The delay processing is performed on the voice signal based on the delay estimation result.

METHOD FOR DENOISING VOICE DATA, DEVICE, AND STORAGE MEDIUM
20220284914 · 2022-09-08 ·

The present disclosure provides a method for denoising voice data, an electronic device, and a computer readable storage medium. The present disclosure relates to the technical field of artificial intelligence, such as Internet of Vehicles, smart cockpit, smart voice, and voice recognition. A specific embodiment of the method includes: receiving an input to-be-played first piece of voice data; and invoking, in response to not detecting a synthetic voice interruption signal in a process of playing the first piece of voice data, a preset first denoising algorithm to filter out noise data except for the first piece of voice data.

METHODS, SYSTEMS AND APPARATUSES FOR IMPROVED SPEECH RECOGNITION AND TRANSCRIPTION
20220215840 · 2022-07-07 ·

Methods, systems, and apparatuses for improved speech recognition and transcription of user utterances are described herein. User utterances may be processed by a speech recognition computing device as well as an acoustic model. The acoustic model may be trained using historical user utterance data and machine learning techniques. The acoustic model may be used to determine whether a transcription determined by the speech recognition computing device should be overridden with an updated transcription.

MULTIPLE STATE DIGITAL ASSISTANT FOR CONTINUOUS DIALOG

Systems and processes for operating an intelligent automated assistant are provided. For example, a first speech input is received from a user. In response to receiving the first speech input, a response is provided. A first output is provided corresponding to a digital assistant in a first state, and a second speech input is received from the user. A first plurality of values is obtained. Based on the first plurality of values, a first confidence level corresponding to the second speech input is obtained. In accordance with a determination that the first confidence level exceeds a first threshold confidence level, a second output is provided corresponding to the digital assistant in a second state. The second speech input continues to be received.

Voice interaction method, and device

A voice dialogue method performed by a voice dialog system includes: a voice signal generation unit; a voice dialog agent unit; a voice output unit; and a voice input control unit, the method including: a step of, by the voice signal generation unit, receiving a voice input and generating a voice signal based on the received voice input; a step of, by the voice dialog agent unit, performing voice recognition processing on the voice signal and performing processing based on a result of the voice recognition processing to generate a response signal; a step of, by the voice output unit, outputting a voice based on the response signal; and a step of, when the voice output unit outputs the voice, by the voice input control unit, keeping the voice signal generation unit, for predetermined period after output of the voice, a receivable state in which a voice input is receivable.

Systems and methods for addressing possible interruption during interaction with digital assistant

Systems and methods are described for handling interruptions during a digital assistant session between a user and a digital assistant by detecting if an interruption event is to occur during the digital assistant session. In response to detecting that the interruption event is to occur, an operation that addresses the interruption event may be caused to be performed.