Patent classifications
G10L15/01
SPEECH RECOGNITION IN A VEHICLE
An audio sample including speech and ambient sounds is transmitted to a vehicle computer. Recorded audio is received from the vehicle computer, the recorded audio including the audio sample broadcast by the vehicle computer and recorded by the vehicle computer and recognized speech from the recorded audio. The recognized speech and text of the speech are input to a machine learning program that outputs whether the recognized speech matches the text. When the output from the machine learning program indicates that the recognized speech does not match the text, the recognized speech and the text are included in a training dataset for the machine learning program.
METHOD AND DEVICE FOR INFORMATION PROCESSING
An information processing method and an electronic device are provided. The method includes: obtaining audio data collected by a slave device; obtaining contextual data corresponding to the slave device; and obtaining a recognition result of recognizing the audio data based on the contextual data. The contextual data characterizes a voice environment of the audio data collected by the slave device.
METHOD AND DEVICE FOR INFORMATION PROCESSING
An information processing method and an electronic device are provided. The method includes: obtaining audio data collected by a slave device; obtaining contextual data corresponding to the slave device; and obtaining a recognition result of recognizing the audio data based on the contextual data. The contextual data characterizes a voice environment of the audio data collected by the slave device.
System and methods for chatbot and search engine integration
A system and method for chatbot and search engine integration comprising chatbot crawler engine configured to detect all possible paths through a conversational flow between a chatbot and a user, and also comprising a chatbot search integration manager configured to receive a processed conversation flow from the chatbot crawler engine, parse the conversation flow to identify keywords and features, and build an indexable data structure which can be integrated into search engines in order to expose the information and data contained within the chatbot's knowledge base. This integration may allow search engine users to be redirected to a website hosting the chatbot when an indexed data structure comprises information relevant to a search engine query.
SEMIAUTOMATED RELAY METHOD AND APPARATUS
A relay for captioning a hearing user's (HU's) voice signal during a phone call between an HU and a hearing assisted user (AU), the HU using an HU device and the AU using an AU device where the HU voice signal is transmitted from the HU device to the AU device, the relay comprising a display screen, a processor linked to the display and programmed to perform the steps of receiving the HU voice signal from the AU device, transmitting the HU voice signal to a remote automatic speech recognition (ASR) server running ASR software that converts the HU voice signal to ASR generated text, the remote ASR server located at a remote location from the relay, receiving the ASR generated text from the ASR server, present the ASR generated text for viewing by a call assistant (CA) via the display and transmitting the ASR generated text to the AU device.
SEMIAUTOMATED RELAY METHOD AND APPARATUS
A relay for captioning a hearing user's (HU's) voice signal during a phone call between an HU and a hearing assisted user (AU), the HU using an HU device and the AU using an AU device where the HU voice signal is transmitted from the HU device to the AU device, the relay comprising a display screen, a processor linked to the display and programmed to perform the steps of receiving the HU voice signal from the AU device, transmitting the HU voice signal to a remote automatic speech recognition (ASR) server running ASR software that converts the HU voice signal to ASR generated text, the remote ASR server located at a remote location from the relay, receiving the ASR generated text from the ASR server, present the ASR generated text for viewing by a call assistant (CA) via the display and transmitting the ASR generated text to the AU device.
System and method for automatic testing of conversational assistance
A voice recognition system includes a microphone configured to receive one or more spoken dialogue commands from a user in a voice recognition session. The system also includes a processor in communication with the microphone. The processor is configured to receive one or more audio files associated with one or more audio events associated with the voice recognition system, execute the one or more audio files in a voice recognition session in an audio event, and output a log report indicating a result of the audio events with the voice recognition session.
System and method for automatic testing of conversational assistance
A voice recognition system includes a microphone configured to receive one or more spoken dialogue commands from a user in a voice recognition session. The system also includes a processor in communication with the microphone. The processor is configured to receive one or more audio files associated with one or more audio events associated with the voice recognition system, execute the one or more audio files in a voice recognition session in an audio event, and output a log report indicating a result of the audio events with the voice recognition session.
FREE-FORM TEXT PROCESSING FOR SPEECH AND LANGUAGE EDUCATION
Methods, systems, and computer-readable storage media for providing reading performance feedback to a user from a voice recording of the user reading an arbitrary text. A target text comprising a text passage that a user intends to read and a user recording comprising an audio recording of the user reading the target text aloud are received from a user device. The user recording is converted to a user speech hypothesis comprising text corresponding to speech recognized in the audio recording. The user speech hypothesis is then compared to the target text to generate reading performance feedback comprising relevant differences between the speech in the user recording and the target text and the reading performance feedback is displayed to the user on the user device.
FREE-FORM TEXT PROCESSING FOR SPEECH AND LANGUAGE EDUCATION
Methods, systems, and computer-readable storage media for providing reading performance feedback to a user from a voice recording of the user reading an arbitrary text. A target text comprising a text passage that a user intends to read and a user recording comprising an audio recording of the user reading the target text aloud are received from a user device. The user recording is converted to a user speech hypothesis comprising text corresponding to speech recognized in the audio recording. The user speech hypothesis is then compared to the target text to generate reading performance feedback comprising relevant differences between the speech in the user recording and the target text and the reading performance feedback is displayed to the user on the user device.