G10L2015/088

APPENDING ASSISTANT SERVER REQUESTS, FROM A CLIENT ASSISTANT,WITH PROACTIVELY-AGGREGATED PERIPHERAL DEVICE DATA

Implementations relate to proactively aggregating client device data to append to client assistant data that is communicated to a server device in response to a user request to a client automated assistant. When a user request that is associated with, for example, a peripheral client device, is received at a client device, the client device can communicate, to a server device, data that embodies the user request (e.g., audio data and/or local speech recognition data), along with peripheral device data that was received before the client device received the user request. In this way, the client automated assistant can bypass expressly soliciting peripheral device data each time a user request is received at another client device. Instead, a peripheral device can proactively communicate device data to a client device so that the device data can be appended to request data communicated to the server device from a particular client device.

Electronic apparatus and control method thereof
11580964 · 2023-02-14 · ·

An electronic apparatus is provided. The electronic apparatus includes a microphone, a memory configured to store a plurality of keyword recognition models, and a processor, which is coupled with the microphone and the memory, configured to control the electronic apparatus, wherein the processor is configured to selectively execute at least one keyword recognition model among the plurality of keyword recognition models based on operating state information of the electronic apparatus, based on a first user voice being input through the microphone, identify whether at least one keyword corresponding to the executed keyword recognition model is included in the first user voice by using the executed keyword recognition model, and based on at least one keyword identified as being included in the first user voice, perform an operation of the electronic apparatus corresponding to the at least one keyword.

In-vehicle speech processing apparatus

An in-vehicle apparatus is connectable to a device that includes a voice assistant function. The in-vehicle apparatus includes: a voice detector that performs voice recognition of an audio signal input from a microphone and that controls functions of the in-vehicle apparatus based on a result of the voice recognition; and an interface that communicates with the device. When being informed of a detection of a predetermined word in the audio signal as the result of the voice recognition of the audio signal performed by the voice detector, the interface sends to the device, not via the voice detector, the audio signal input from the microphone. The predetermined word is for activating the voice assistant function of the device.

Artificial intelligence device and method of operating artificial intelligence device
11580969 · 2023-02-14 · ·

An artificial intelligence device includes a microphone configured to receive a speech command, a speaker, a communication unit configured to perform communication with an external artificial intelligence device, and a processor configured to receive a wake-up command through the microphone, acquire a first speech quality level of the received wake-up command, receive a second speech quality level of the wake-up command input to the external artificial intelligence device from the external artificial intelligence device through the communication unit, output a notification indicating that the artificial intelligence device is selected as an object to be controlled through the speaker, when the first speech quality level is larger than the second speech quality level, receive an operation command through the microphone, acquire an intention of the received operation command and transmit the operation command to an external artificial intelligence device which will perform operation corresponding to the operation command according to the acquired intention through the communication unit.

Robot and method for recognizing wake-up word thereof
11577379 · 2023-02-14 · ·

Provided is a robot including a microphone configured to acquire a sound signal corresponding to a sound generated near the robot, a camera, an output interface including at least one of a display configured to output a wake-up screen or a speaker configured to output a wake-up sound when the robot wakes up, and a processor configured to recognize whether the acquired sound includes a voice of a person, activate the camera when the sound includes a voice of a person, recognize whether a person is present in an image acquired by the activated camera, set a wake-up word recognition sensitivity based on a recognition result as to whether a person is present, and recognize whether a wake-up word is included voice data of a user acquired through the microphone based on the set wake-up word recognition sensitivity.

Robot teaching device
11580972 · 2023-02-14 · ·

A robot teaching device includes: a display device; an operation key formed of a hard key or a soft key and including an input changeover switch; a microphone; a voice recognition section; a correspondence storage section storing each of a plurality of types of commands and a recognition target word in association with each other; a recognition target word determination section configured to determine whether a phrase represented by character information includes the recognition target word; and a command execution signal output section configured to switch, in response to the input changeover switch being operated, between a first operation in which a signal for executing the command corresponding to an operation to the operation key is outputted and a second operation in which a signal for executing the command associated with the recognition target word represented by the character information is outputted.

Keyword determinations from conversational data
11580993 · 2023-02-14 · ·

Topics of potential interest to a user, useful for purposes such as targeted advertising and product recommendations, can be extracted from voice content produced by a user. A computing device can capture voice content, such as when a user speaks into or near the device. One or more sniffer algorithms or processes can attempt to identify trigger words in the voice content, which can indicate a level of interest of the user. For each identified potential trigger word, the device can capture adjacent audio that can be analyzed, on the device or remotely, to attempt to determine one or more keywords associated with that trigger word. The identified keywords can be stored and/or transmitted to an appropriate location accessible to entities such as advertisers or content providers who can use the keywords to attempt to select or customize content that is likely relevant to the user.

Device and method for activating with voice input
11580976 · 2023-02-14 · ·

An information processing apparatus that detects a voice command via a microphone in order to activate the device and execute certain applications. The apparatus comprises a digital signal processor (DSP) and a host controller which are responsible for processing the voice commands. The DSP recognizes and processes voice commands intermittently while the host processor is in a sleep state, thereby reducing the overall power consumption of the apparatus. Further, when the DSP is configured to recognize voice commands intended, only to activate the device, a memory having a sufficiently lower storage capacity suffices.

Method and device for recognizing speech in vehicle

The present disclosure relates to a method and a device for recognizing speech in a vehicle. The method for recognizing the speech in the vehicle may include collecting one or more types of information, determining information to be linked with each other for speech recognition based on an information processing priority predefined corresponding to each type of the collected information, analyzing the determined information to perform the speech recognition for a signal input through a microphone, and extracting at least one of a wake up voice or a command voice through the speech recognition to control the vehicle. Therefore, the present disclosure has an advantage of more accurately performing the speech recognition by linking collected various information in the vehicle with each other.

Local voice data processing

Example techniques relate to local voice control in a media playback system. A satellite device (e.g., a playback device or microcontroller unit) may be configured to recognize a local set of keywords in voice inputs including context specific keywords (e.g., for controlling an associated smart device) as well as keywords corresponding to a subset of media playback commands for controlling playback devices in the media playback system. The satellite device may fall back to a hub device (e.g., a playback device) configured to recognize a more extensive set of keywords. In some examples, either device may fall back to the cloud for processing of other voice inputs.