G10L15/08

NETWORKED DEVICES, SYSTEMS, & METHODS FOR INTELLIGENTLY DEACTIVATING WAKE-WORD ENGINES

In one aspect, a playback deice is configured to identify in an audio stream, via a second wake-word engine, a false wake word for a first wake-word engine that is configured to receive as input sound data based on sound detected by a microphone. The first and second wake-word engines are configured according to different sensitivity levels for false positives of a particular wake word. Based on identifying the false wake word, the playback device is configured to (i) deactivate the first wake-word engine and (ii) cause at least one network microphone device to deactivate a wake-word engine for a particular amount of time. While the first wake-word engine is deactivated, the playback device is configured to cause at least one speaker to output audio based on the audio stream. After a predetermined amount of time has elapsed, the playback device is configured to reactivate the first wake-word engine.

INFORMATION PROCESSOR, INFORMATION PROCESSING METHOD, AND PROGRAM
20230005481 · 2023-01-05 · ·

An information processor including: an operation control unit that controls a motion of an autonomous mobile body acting on the basis of recognition processing, in a case where a target sound that is a target voice for voice recognition processing is detected, the operation control unit moving the autonomous mobile body to a position, around an approach target, where an input level of a non-target sound that is not the target voice becomes lower, the approach target being determined on the basis of the target sound.

Voice Wake-Up Method, Electronic Device, Wearable Device, and System
20230239800 · 2023-07-27 ·

A voice wake-up method, an electronic device, and a wearable device. The system includes the electronic device and the wearable device. The electronic device communicates with the wearable device through a short-distance wireless connection, and the electronic device is configured to: collect a voice signal in an environment in which the electronic device is located; and when the voice signal meets a preset condition, send a query request to the wearable device, where the query request is used to request information indicating that a user is speaking. The wearable device is configured to send a query result to the electronic device, where the query result includes the information indicating that the user is speaking. The electronic device is further configured to: when it is determined, based on the information indicating that the user is speaking, that the user is speaking, enter a wake-up state.

Voice Wake-Up Method, Electronic Device, Wearable Device, and System
20230239800 · 2023-07-27 ·

A voice wake-up method, an electronic device, and a wearable device. The system includes the electronic device and the wearable device. The electronic device communicates with the wearable device through a short-distance wireless connection, and the electronic device is configured to: collect a voice signal in an environment in which the electronic device is located; and when the voice signal meets a preset condition, send a query request to the wearable device, where the query request is used to request information indicating that a user is speaking. The wearable device is configured to send a query result to the electronic device, where the query result includes the information indicating that the user is speaking. The electronic device is further configured to: when it is determined, based on the information indicating that the user is speaking, that the user is speaking, enter a wake-up state.

COMMUNICATION SYSTEM AND EVALUATION METHOD

A communication system is configured to broadcast utterance voice data received from one of mobile communication terminals to other mobile communication terminals, to control text delivery such that a result of utterance voice recognition from voice recognition processing on the received utterance voice data is displayed on the mobile communication terminals in synchronization, and to use the result of utterance voice recognition to perform communication evaluation. The communication evaluation includes a first evaluation including evaluating a dialogue between users based on a group dialogue index to produce group communication evaluation information, a second evaluation including evaluating utterances constituting the dialogue between the users based on a personal utterance index to produce personal utterance evaluation information, and a third evaluation including using the group communication evaluation information and the personal utterance evaluation information to produce entire communication group evaluation information.

COMMUNICATION SYSTEM AND EVALUATION METHOD

A communication system is configured to broadcast utterance voice data received from one of mobile communication terminals to other mobile communication terminals, to control text delivery such that a result of utterance voice recognition from voice recognition processing on the received utterance voice data is displayed on the mobile communication terminals in synchronization, and to use the result of utterance voice recognition to perform communication evaluation. The communication evaluation includes a first evaluation including evaluating a dialogue between users based on a group dialogue index to produce group communication evaluation information, a second evaluation including evaluating utterances constituting the dialogue between the users based on a personal utterance index to produce personal utterance evaluation information, and a third evaluation including using the group communication evaluation information and the personal utterance evaluation information to produce entire communication group evaluation information.

DIALOGUE APPARATUS, METHOD AND PROGRAM

A dialogue apparatus includes a speech recognition unit (1) configured to perform speech recognition on utterance input to generate a text corresponding to the utterance, a speech waveform corresponding to the utterance, and information regarding a length of sound of the utterance; a language understanding unit (2) configured to grasp contents of the utterance by using the text corresponding to the utterance; a dialogue management unit (3) configured to determine contents of a response corresponding to the utterance by using the content of the utterance; an utterance state extraction unit (4) configured to extract a state of the utterance by using the text corresponding to the utterance, the speech waveform corresponding to the utterance, and the information regarding the length of the sound of the utterance; a response state determination unit (5) configured to determine a state of the response according to the state of the utterance; a response sentence generation unit (6) configured to generate a response sentence by using the content of the response; and a speech synthesis unit (7) configured to synthesize speech corresponding to the response sentence with the state of the response taken into account.

DIALOGUE APPARATUS, METHOD AND PROGRAM

A dialogue apparatus includes a speech recognition unit (1) configured to perform speech recognition on utterance input to generate a text corresponding to the utterance, a speech waveform corresponding to the utterance, and information regarding a length of sound of the utterance; a language understanding unit (2) configured to grasp contents of the utterance by using the text corresponding to the utterance; a dialogue management unit (3) configured to determine contents of a response corresponding to the utterance by using the content of the utterance; an utterance state extraction unit (4) configured to extract a state of the utterance by using the text corresponding to the utterance, the speech waveform corresponding to the utterance, and the information regarding the length of the sound of the utterance; a response state determination unit (5) configured to determine a state of the response according to the state of the utterance; a response sentence generation unit (6) configured to generate a response sentence by using the content of the response; and a speech synthesis unit (7) configured to synthesize speech corresponding to the response sentence with the state of the response taken into account.

METHOD FOR PROCESSING AN AUDIO STREAM AND CORRESPONDING SYSTEM

A method and a system for processing an audio stream are described, wherein at least one database of classified voices and at least one database of classified background sounds are provided and a comparison between these classified voices and background sounds with the voices and the sounds extrapolated from a suitably re-processed audio stream is carried out in order to identify possible matches.

Voice controlled assistant with coaxial speaker and microphone arrangement
11521624 · 2022-12-06 · ·

A voice controlled assistant has a housing to hold one or more microphones, one or more speakers, and various computing components. The housing has an elongated cylindrical body extending along a center axis between a base end and a top end. The microphone(s) are mounted in the top end and the speaker(s) are mounted proximal to the base end. The microphone(s) and speaker(s) are coaxially aligned along the center axis. The speaker(s) are oriented to output sound directionally toward the base end and opposite to the microphone(s) in the top end. The sound may then be redirected in a radial outward direction from the center axis at the base end so that the sound is output symmetric to, and equidistance from, the microphone(s).