H04M2250/74

Method for responding to user utterance and electronic device for supporting same

In an embodiment of the disclosure, disclosed is an electronic device including a communication module, a microphone, a first and a second wake-up recognition module, a memory, and a processor. The processor is configured to receive a first user utterance through the microphone, recognize the first user utterance based on at least one of the first or the second wake-up recognition module, when the recognized first user utterance includes specified at least one first trigger information, record at least part of the first user utterance by activating the recording function, transmit recorded data to an external device, and receive at least one of second user utterance information, which is predicted to occur at a time after the function of the speech recognition service is activated by the first wake-up recognition module, or at least one response information associated with the second user utterance from the external device.

DETERMINATION AND VISUAL DISPLAY OF SPOKEN MENUS FOR CALLS

Implementations relate to determination and visual display of spoken menus for calls. In some implementations, a computer-implemented method includes receiving audio data output in a call between a call device and a device associated with a target entity. The audio data includes speech indicating one or more selection options for a user of the call device to navigate through a call menu provided by the target entity in the call. Text is determined by programmatically analyzing the audio data, the text representing the speech. The selection options are determined based on programmatically analyzing at least one of the text or the audio data. At least a portion of the text is displayed by the call device during the call, as one or more visual options that correspond to the selection options. The visual options are each selectable via user input to cause corresponding navigation through the call menu.

Attention aware virtual assistant dismissal
11630525 · 2023-04-18 · ·

Systems and processes for operating an intelligent automated assistant are provided. An example process includes initiating a virtual assistant session responsive to receiving user input. In accordance with initiating the virtual assistant session, the process includes determining, based on data obtained using one or more sensors of the electronic device, whether one or more criteria representing expressed user disinterest are satisfied. In accordance with determining that the one or more criteria representing expressed user disinterest are satisfied prior to a first time, the process includes automatically deactivating the virtual assistant session prior to the first time. The first time is defined by a setting of the electronic device. In accordance with determining that the one or more criteria representing expressed user disinterest are not satisfied prior to the first time, the process includes automatically deactivating the virtual assistant session at the first time.

System and method for causing messages to be delivered to users of a distributed voice application execution system
11657406 · 2023-05-23 · ·

A system and method for delivering a message to a user makes use of a voice applications agent that is located, at least in part, on the user's local device, the voice applications agent being configured to perform a voice application in order to deliver the message to the user. The voice application comprises a set of instructions about how to interact with the user. Performing the voice application on the user's local device comprises the voice applications agent following the set of instructions that comprise the voice application in order to interact with the user.

Method for quickly starting application service, and terminal
11656843 · 2023-05-23 · ·

A method for quickly starting an application service, and a terminal. The method includes acquiring, by a terminal, event trigger information; starting, by the terminal, the application service software after determining that the event trigger information meets a preset quick startup condition; and acquiring, by the terminal, a voice instruction input by a user, and running the application service software according to the voice instruction. According to the method provided in the embodiments of the present disclosure, application service software is started by using event trigger information, so that a background of a terminal starts to perform recording only after the application service software is started, and background recording is stopped after the terminal provides an application service for a user, preventing a recording device in the background of the terminal from being always in a recording state, and further reducing power consumption of the terminal.

Preventing unwanted activation of a device

A computing device may be configured to receive a content asset and to determine whether the content asset comprises one or more triggers. The trigger may be a word, phrase, or passcode that alerts a voice activated device to the presence of a voice command and may serve as an instruction to the voice activated device to cause execution of the voice command. In response to determining that the content asset comprises one or more triggers, the computing device may be configured to insert one or more signal markers into the content asset at a location corresponding to the one or more triggers, and to cause transmission and/or presentation of the content asset with the one or more signal markers. The signal markers may cause a voice activated device to ignore a voice command in the content asset, despite the presence of one or more triggers.

Enabling workers to swap between mobile devices
11604675 · 2023-03-14 · ·

A method for identifying a second device by a first device for establishing a communication between the first device and the second device is described here. The method includes receiving, by a processor of a first device, a voice command from a worker in a workplace. In an example, the method comprises pausing, by the processor, a workflow operation executing on the first device. The method further comprises performing, by the processor, a voice recognition to analyze the voice command of the worker. The method includes activating, by the processor, a communication module of the first device based on the voice recognition, to identify a second device in proximity to the first device. The method includes terminating, by the processor, a connection between the first device and the wearable electronic device. Thus, terminating, by the processor, a second connection of the first device with the second device.

Systems, methods, and apparatus for real-time dictation and transcription with multiple remote endpoints
11468896 · 2022-10-11 · ·

A method to allow for real-time dictation and transcription with multiple remote endpoints is provided. The method comprises evoking a primary application and a client device APP (or APP) to work with a remote hosted application to process audio for the primary application. The APP connects to the hosted application, and the hosted application receives and processes the audio. The hosted application returns the text to the client device, which text populates the primary application. The APP and/or the hosted application also transmits the text to a remote endpoint, such as, for example, a desktop computer or a laptop computer where the user can interact with the primary application and the text returned by the hosted application.

Voice interaction processing method and apparatus
11620995 · 2023-04-04 · ·

This application provides a voice interaction processing method and apparatus, to achieve a friendly and natural voice interaction effect and reduce power consumption. In the method, a microprocessor enables an image collector only when determining, based on voice data collected by a voice collector, that a first user is a target user; then the image collector collects user image data and transmits the user image data to the microprocessor; and the microprocessor sends a wakeup instruction to an application processor only when determining, based on the user image data, that the target user is in a voice interaction state. Based on the foregoing method, nus-enabling of the image collector and the application processor is avoided to some extent, and power consumption is reduced.

ELECTRONIC APPARATUS AND CONTROLLING METHOD THEREOF

An electronic apparatus is disclosed. The electronic apparatus may include a microphone; a memory configured to store a wakeup word; and a processor configured to: identify, based on context information of the electronic apparatus, an occurrence of a pre-determined event; change, based on the occurrence of the pre-determined event, a first threshold value for recognizing the wakeup word; obtain, based on a first user voice input received via the microphone, a similarity value between first text information corresponding to the first user voice input and the wakeup word; and perform, based on the similarity value being greater than or equal to the first threshold value, a voice recognition function on second text information corresponding to a second user voice input received via the microphone after the first user voice input.