Patent classifications
G10L2015/221
CONVERSATION SUPPORT DEVICE, CONVERSATION SUPPORT SYSTEM, CONVERSATION SUPPORT METHOD, AND STORAGE MEDIUM
In a conversation support device, a first voice recognition unit performs voice recognition processing on the basis of a voice signal and defines partial section text information for each partial section that is a part of an utterance section, a second voice recognition unit performs voice recognition processing on the basis of the voice signal and defines utterance section text information for each utterance section, an information integration unit integrates the partial section text information into the utterance section text information to generate integration text information, and an output processing unit outputs the integration text information to the display unit after outputting the partial section text information to the display unit.
OPERATION SUPPORT APPARATUS OF TRANSPORTATION MEANS, OPERATION SUPPORT METHOD OF TRANSPORTATION MEANS, AND RECORDING MEDIUM STORING OPERATION SUPPORT PROGRAM FOR TRANSPORTATION MEANS
An operation support apparatus includes: an acquisition unit that acquires an operation status of transportation means; a generation unit that performs processing of recognizing voices of a first staff who gives an instruction to operate the transportation means and a second staff who is instructed by the first staff, and generates character information obtained by converting the recognized voice into characters; a detection unit that performs syntax analysis on the character information and detects wrong recognition by the first or second staff; and a display control unit that displays the character information and the detection result of the wrong recognition by the detection unit on a display device visually recognizable by the first staff, thereby reducing occurrence of an accident due to the wrong recognition by the staff related to operation of the transportation means at a site where the transportation means is operated.
Audio-triggered augmented reality eyewear device
Systems, methods, and non-transitory computer readable media for augmenting scenes viewed thorough displays of an eyewear devices with audio-related image information. Scenes may be augmented by capturing, via a camera of the eyewear device, initial images of a scene, identifying features within the initial images; receiving audio-related image information (e.g., lyrics and/or images), registering the audio-related image information to the identified features, creating audio-based visual overlays including the audio-related image information registered to the identified features, and displaying the audio-based visual overlays over the scene.
Accelerating agent performance in a natural language processing system
A computer-implemented method for providing agent assisted transcriptions of user utterances. A user utterance is received in response to a prompt provided to the user at a remote client device. An automatic transcription is generated from the utterance using a language model based upon an application or context, and presented to a human agent. The agent reviews the transcription and may replace at least a portion of the transcription with a corrected transcription. As the agent inputs the corrected transcription, accelerants are presented to the user comprising suggested texted to be inputted. The accelerants may be determined based upon an agent input, an application or context of the transcription, the portion of the transcription being replaced, or any combination thereof. In some cases, the user provides textual input, to which the agent transcribes an intent associated with the input with the aid of one or more accelerants.
Speech recognition device, speech recognition method, and recording medium
A speech recognition device includes: an obtaining unit which obtains a speech uttered in a conversation between a first speaker and a second speaker; a storage which stores the speech obtained; an input unit which receives operation input; an utterance start detector which, when the input unit receives the operation input, detects a start position of the speech; and a speaker identification unit which identifies a speaker of the speech as the first speaker who has performed the operation input or the second speaker who has not performed the operation input, based on (i) first timing at which the input unit has received the operation input and (ii) second timing indicating the detected start position of the speech. The first and second timing are set for each speech of the first and second speakers. A speech recognizer performs speech recognition on the speech whose speaker has been identified.
SYSTEMS AND METHODS FOR VOICE-ASSISTED MEDIA CONTENT SELECTION
Systems and methods for media playback via a media playback system include (i) capturing a voice input comprising a request for media content, (ii) receiving information derived at least from the request for media content, (iii) requesting and receiving information from at least one remote computing device associated with a first media content service and at least one remote computing device associated with a second media content service, wherein (a) the information identifies first media content available via the first media content service for playback and identifies second media content available via the second media content service for playback, and (b) the first and second media content are related to the requested media content, and (iv) after receiving at least one of the first information and the second information, (a) selecting the first media content instead of the second media content, and (b) playing back the first media content.
PARTIAL COMPLETION OF COMMAND BY DIGITAL ASSISTANT
One embodiment provides a method, the method including: receiving, at a digital assistant, a command from a user, wherein the command includes a high confidence portion and a low confidence portion; determining, at the digital assistant, that at least a part of the output corresponding to the command can be performed by the digital assistant based upon the high confidence portion; and performing, using the digital assistant and responsive to determining the at least a part of the output of the command can be performed by the digital assistant, the at least a part of the output of the command.
Method for recognizing voice and electronic device supporting the same
An electronic device is provided. The electronic device includes a microphone, a display, a camera, a processor, and a memory. The processor is configured to receive a first utterance input through the microphone. The processor is also configured to obtain first recognized data from a first image displayed on the display or stored in the memory. The processor is further configured to store the first recognized data in association with the first utterance input when the obtained first recognized data matches the first utterance input. Additionally, the processor is configured to activate the camera when the first recognized data does not match the first utterance input. The processor is also configured to obtain second recognized data from a second image collected through the camera and store the second recognized data in association with the first utterance input when the obtained second recognized data matches the first utterance input.
Information processing device, information processing method, and information processing system
The present technology relates to an information processing device, an information processing method, and an information processing system that are capable of establish smooth and natural conversation with a person who has difficulty in hearing. The information processing device includes a sound acquisition unit that acquires sound information of a first user that is input to a sound input device and a display control unit that controls display of text information on a display device for a second user, the text information corresponding to the acquired sound information. The display control unit performs control related to display amount of the text information on the display device on the basis of at least one of the display amount of the text information on the display device or input amount of the sound information input through the sound input device.
Intelligent voice recognizing method, apparatus, and intelligent computing device
Disclosed are an intelligent voice recognizing method, a voice recognizing device, and an intelligent computing device. According to an embodiment of the present invention, an intelligent voice recognizing method of a voice recognizing device may obtain a microphone detection signal, recognize a user's voice from the microphone detection signal based on a pre-learned speech recognition model, output information related to a result of recognition of the user's voice, and update the speech recognition model based on the output speech recognition result information, easily updating the speech recognition model for speech recognition based on the speech recognition result information which is intuitively shown to the user. According to the present invention, one or more of the voice recognizing device, intelligent computing device, and server may be related to artificial intelligence (AI) modules, unmanned aerial vehicles (UAVs), robots, augmented reality (AR) devices, virtual reality (VR) devices, and 5G service-related devices.