H04M2250/74

Systems and methods for optimization of transmission of real-time data via network labeling
11677803 · 2023-06-13 · ·

The present invention relates to systems and methods for network labeling in order to enhance real time data transfers. A network for a real time data transfer is identified and predictive models for network performance are compared against to determine if the network is suitable for the data transfer. If so, then the real time data transfer may be completed as expected. However, if the network is predicted to be unsuitable for transmission an alternate means for connection may be suggested. The alternate suggestion may include delaying the data transfer until the network is expected to be in better conditions, connecting to another access point in the network, or switching to another network entirely. During the data transfer, the quality of the network is monitored in order to update the predictive models for the network's quality. Identifiers for the network may be utilized to keep track of the networks. Network signal strength, signal pollution and time may also be tracked in order to identify patterns in the network's performance.

GRAPHICAL INTERFACE FOR SPEECH-ENABLED PROCESSING
20230176813 · 2023-06-08 ·

Methods and devices for sampling applications using a touch input are described herein. In some embodiments, an electronic device detects a touch input, which may cause the electronic device to send identifiers to a backend system. The backend system may then determine an application and sample audio request associated with the received identifiers. The backend system may then receive text data representing the sample audio request and text data representing a response to the sample audio request. The backend system may generate audio data representing the received text data and send the audio data to the electronic device. If, the touch input is still occurring, the backend system may find and send more sample audio requests and the responses thereof. If the touch input stops occurring during the sample, the backend system may send instructions to the electronic device to stop outputting the sample.

Call Processing Method and Device
20170339263 · 2017-11-23 · ·

A call processing method and device are provided. In the call processing method, voice guidance in a call is converted into visual data; and the visual data is displayed as a visual menu to present content of the voice guidance.

UNIFIED MESSAGE SEARCH

The disclosed embodiments include computerized methods, systems, and devices, including computer programs encoded on a computer storage medium, for generating terms of a search query based on a user's spoken utterances, identifying multiple cross-platform messages based on the generated terms, and to generating, via a presentation device, a single interface that enables the user to interact with identified messages. Based on a spoken utterance, the disclosed embodiments may determine user-specified search terms and/or criteria, and based on the user-specified search terms and/or criteria, may obtain cross-platform message data that corresponds to the search query. The communications device may generate one or more interface elements that describe corresponding ones of the cross-platform messages, which may be presented within a unified graphical user interface or voice-user interface by a communications device.

SELECTIVELY RENDERING A KEYBOARD INTERFACE IN RESPONSE TO AN ASSISTANT INVOCATION IN CERTAIN CIRCUMSTANCES
20220365745 · 2022-11-17 ·

Implementations set forth herein relate to an automated assistant that can adapt to circumstances in which a user may invoke an automated assistant with an intention of interacting with the automated assistant via a non-default interface. For example, in some instances, a user may invoke an automated assistant by selecting a selectable GUI element. In response, the automated assistant can determine that, in the current context, spoken utterances may not be suitable for providing to the automated assistant. Based on this determination, the automated assistant can cause a keyboard interface to be rendered and/or initialized for receiving typed inputs from the user. Should the user subsequently change contexts, the automated assistant can determine that voice input is now suitable for user input and initialize an audio interface in response to the user providing an invocation input in the subsequent context.

Speaker diarization
11670287 · 2023-06-06 · ·

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for speaker diarization are disclosed. In one aspect, a method includes the actions of receiving audio data corresponding to an utterance. The actions further include determining that the audio data includes an utterance of a predefined hotword spoken by a first speaker. The actions further include identifying a first portion of the audio data that includes speech from the first speaker. The actions further include identifying a second portion of the audio data that includes speech from a second, different speaker. The actions further include transmitting the first portion of the audio data that includes speech from the first speaker and suppressing transmission of the second portion of the audio data that includes speech from the second, different speaker.

METHODS AND APPARATUS FOR DETECTING A VOICE COMMAND

According to some aspects, a method of monitoring an acoustic environment of a mobile device, at least one computer readable medium encoded with instructions that, when executed, perform such a method and/or a mobile device configured to perform such a method is provided. The method comprises receiving acoustic input from the environment of the mobile device while the mobile device is operating in the low power mode, detecting whether the acoustic input includes a voice command based on performing a plurality of processing stages on the acoustic input, wherein at least one of the plurality of processing stages is performed while the mobile device is operating in the low power mode, and using at least one contextual cue to assist in detecting whether the acoustic input includes a voice command.

ENABLING WORKERS TO SWAP BETWEEN MOBILE DEVICES
20230168920 · 2023-06-01 ·

A method for identifying a second device by a first device for establishing a communication between the first device and the second device is described here. The method includes receiving, by a processor of a first device, a voice command from a worker in a workplace. In an example, the method comprises pausing, by the processor, a workflow operation executing on the first device. The method further comprises performing, by the processor, a voice recognition to analyze the voice command of the worker. The method includes activating, by the processor, a communication module of the first device based on the voice recognition, to identify a second device in proximity to the first device. The method includes terminating, by the processor, a connection between the first device and the wearable electronic device. Thus, terminating, by the processor, a second connection of the first device with the second device.

Method and device for waking up voice recognition function in mobile terminal, and computer readable storage medium

The present disclosure provides a method and device for waking up a voice recognition function in a mobile terminal. The method includes: receiving a trigger instruction of an input interface, and activating the input interface; acquiring a rotation angle of the mobile terminal in a first rotation direction with an x-axis as a rotation axis, in which the x-axis is a widthwise direction of a screen of the mobile terminal; and activating the voice recognition function when the rotation angle of the mobile terminal in the first rotation direction with the x-axis as the rotation axis is greater than or equal to a first trigger threshold.

Systems and methods for presenting location related information
09811315 · 2017-11-07 ·

Systems and methods for presenting location related information after a user arrives at a place. In an aspect, when a user gazes at a display of a standby device, information presentation begins. In other aspects, when a user shakes a device or says certain words to a device, and then gazes at it, a presentation starts.