Patent classifications
H04M2250/74
AUDIO INFORMATION PROCESSING METHOD, APPARATUS, ELECTRONIC DEVICE AND STORAGE MEDIUM
The present disclosure relates to an audio information processing method, an apparatus, an electronic device and a computer-readable storage medium. The audio information processing method includes: determining whether an audio recording start condition is satisfied; collecting audio information associated with an electronic device in response to determining that the audio recording start condition is satisfied; performing word segmentation on text information corresponding to the audio information to obtain word-segmented text information; and displaying the word-segmented text information on a user interface of the electronic device.
System and method for controlling a remote computer using an intelligent personal assistant
Described herein is a system and method for controlling a remote computer over a remote session. In an exemplary aspect, the method comprises receiving, by an intelligent personal assistant (IPA), a speech-based instruction from a user of a mobile device, the speech-based instruction comprising a request to execute a task; in response to determining, based on analysis of the speech-based instruction, an intent of the user to initiate the remote session with the remote computer: identifying, by the IPA, a remote client on the mobile device for creating the remote session with the remote computer via the cloud infrastructure; generating, by the IPA, a command object that causes a software agent on the remote computer to execute the task; and transmitting, by the IPA, the command object through the remote session to the remote computer; and presenting results for execution of the task in an interface of the remote client.
Information transmission device for open moving machine
An information transmission device applied to an open moving machine driven by a driver who is exposed to an outside of the moving machine includes: a vibrator configured to vibrate a component of the moving machine, the driver in a driving posture being in contact with the component; and a controller communicably connected to an information providing device and configured to control the vibrator and a voice output device configured to output voice to the driver, the information providing device being configured to provide utterance information to the driver as information to be transmitted to the driver. The controller makes the vibrator operate when making the voice output device output the utterance information to the driver by the voice.
ZERO LATENCY DIGITAL ASSISTANT
An electronic device can implement a zero-latency digital assistant by capturing audio input from a microphone and using a first processor to write audio data representing the captured audio input to a memory buffer. In response to detecting a user input while capturing the audio input, the device can determine whether the user input meets a predetermined criteria. If the user input meets the criteria, the device can use a second processor to identify and execute a task based on at least a portion of the contents of the memory buffer.
DIGITAL ASSISTANT INTERACTION IN A COMMUNICATION SESSION
An example process includes while an electronic device is engaged in a communication session with external device(s): receiving, from a first user of the electronic device, input to invoke a first digital assistant; receiving, from the first user, a natural language input corresponding to a task; in accordance with invoking the first digital assistant, generating, by the first digital assistant, a prompt for further user input about the task; transmitting, to the external device(s), the prompt for further user input about the task; after transmitting the prompt for further user input, receiving, from an external device of the external device(s), a response to the prompt for further user input; initiating, by the first digital assistant, based on the response and information corresponding to the first user stored on the electronic device, the task; and transmitting, to the external device(s), an output indicative of the initiated task.
HOLOGRAPHIC INTERFACE FOR VOICE COMMANDS
A computer implemented method, computer system, and computer program product for executing a voice command. A number of processor units displays a view of a location with voice command devices in response to detecting the voice command from a user. The number of processor units displays a voice command direction for the voice command in the view of the location. The number of processor units changes the voice command direction in response to a user input. The number of processor units identifies a voice command device from the voice command devices in the location based on the voice command direction to form a selected voice command device. The number of processor units executes the voice command using the selected voice command device.
Bluetooth speaker base, method and system for controlling thereof
The present disclosure provides a Bluetooth speaker base, a method and a system for controlling a Bluetooth speaker base. The method includes: acquiring voice data, and determining whether the voice data includes a wake-up word, when positions of the Bluetooth speaker base and a Bluetooth speaker satisfy a preset condition; controlling the Bluetooth speaker base to enter a wake-up recognition state, and compressing the voice data based on a compression ratio, when the voice data includes the wake word; and sending the voice data compressed to a mobile terminal through a first profile, to cause the mobile terminal to decompress the voice data received, send the voice data decompressed to a server for voice recognition to obtain audio data, and send the audio data to the Bluetooth speaker for playback through a second profile.
Methods and systems for passive wakeup of a user interaction device
The embodiments herein disclose methods and systems for passive wakeup of a user interaction device and configuring a dynamic wakeup time for a user interaction device, a method includes detecting an occurrence of at least one first non-voice event associated with at least one device present in an Internet of Things (IoT) environment. The method includes detecting an occurrence of at least one successive event associated with the at least one device. The method includes estimating a contextual probability of initiating at least one interaction by a user with the user interaction device on detecting the occurrence of at least one of the at least one first event and the at least one successive event. On determining the estimated contextual probability is above a pre-defined threshold value, the method includes configuring the dynamic wakeup time to switch the user interaction device to a passive wakeup state.
Systems to enhance data entry in mobile and fixed environment
A mobile phone device includes a housing having a substantially rectangular shape wherein its height dimension substantially corresponds to a distance between an ear and a mouth of a user and wherein its width dimension is less than its height dimension. A display unit is integrated within the front surface of the mobile phone device. The display unit substantially entirely covers the front surface of the mobile phone device. The mobile phone device does not include a physical key on the front surface.
PORTABLE TERMINAL DEVICE AND INFORMATION PROCESSING SYSTEM
A portable terminal device in an information processing system and method includes a camera and a microphone. Data of obtained images and voice are transmitted to a server that identifies operations to be executed based on the received voice and image data. The server transmits an identification of one or more results of the plurality of operations to the portable terminal device. When the portable terminal device receives only one result from the server, an operation corresponding to the one result is executed, and when a plurality of results is received, the portable terminal device displays information corresponding to the plurality of results as candidates. Additional voice is captured for selecting one of the plurality of results during the displaying of the information. A determination of one result from the plurality of results is made based on the captured voice, and an operation corresponding to the determined result is executed.