G10L2015/086

AUTOMATED WORD CORRECTION IN SPEECH RECOGNITION SYSTEMS
20210272550 · 2021-09-02 ·

Systems and methods for correcting recognition errors in speech recognition systems are disclosed herein. Natural conversational variations are identified to determine whether a query intends to correct a speech recognition error or whether the query is a new command. When the query intends to correct a speech recognition error, the system identifies a location of the error and performs the correction. The corrected query can be presented to the user or be acted upon as a command for the system.

AUTOMATED WORD CORRECTION IN SPEECH RECOGNITION SYSTEMS
20230410792 · 2023-12-21 ·

Systems and methods for correcting recognition errors in speech recognition systems are disclosed herein. Natural conversational variations are identified to determine whether a query intends to correct a speech recognition error or whether the query is a new command. When the query intends to correct a speech recognition error, the system identifies a location of the error and performs the correction. The corrected query can be presented to the user or be acted upon as a command for the system.

METHOD AND DEVICE FOR PROVIDING INFORMATION
20210065703 · 2021-03-04 · ·

Disclosed are an information providing device and an information providing method, which provide information enabling a conversation with a user by executing an artificial intelligence (AI) algorithm and/or a machine learning algorithm in a 5G environment connected for Internet-of-Things. An information providing method according to one embodiment of the present disclosure includes gathering first situational information from a home monitoring device, gathering, from the first electronic device, second situational information corresponding to the first situational information, gathering, from the home monitoring device, third situational information containing a behavioral change of the user after gathering the first situational information, generating a spoken sentence to provide to the user on the basis of the first situational information to the third situational information, and converting the spoken sentence to spoken utterance information to be output to the user.

Voice assistant system, server apparatus, device, voice assistant method therefor, and program to be executed by computer

A voice assistant system includes a server apparatus performing voice assistant and a plurality of devices, in which the server apparatus and the devices are communicatively connected to each other. The plurality of devices each records the same user's speech through a microphone, and then transmits recorded data of the same user's speech to the server apparatus. The server apparatus receives the recorded data transmitted from each of the plurality of devices, and then voice-recognizes two or more of the received recorded data in accordance with a predetermined standard to thereby interpret the contents of the user's speech to perform the voice assistant.

Speech recognition system with interactive spelling function
10832675 · 2020-11-10 · ·

An interactive speech recognition system is provided for interactively interpreting a spoken phrase. The speech recognition system includes a phrase interpretation module which attempts to accurately interpret a spoken phrase by interpreting each individual term of the spoken phrase. A term interpretation module attempts to accurately interpret each individual term of the spoken phrase not accurately interpreted by the phrase interpretation module, by using a spoken spelling of the term provided by a user. An interactive spelling module attempts to interactively spell at least a portion of an individual term of the spoken phrase not accurately interpreted by the term interpretation module, by enabling a user to interactively select at least one individual character of the term of the spoken phrase from a plurality of characters.

ALLOWING SPELLING OF ARBITRARY WORDS

Methods, systems, and apparatus, including computer programs encoded on computer storage media, for natural language processing One of the methods includes receiving a first voice input from a user device, generating a first recognition output, receiving a user selection of one or more terms in the first recognition output- receiving a second voice input spelling a correction of the user selection, determining a corrected recognition output for the selected portion; and providing a second recognition output that merges the first recognition output and the corrected recognition output.

EDITING OF WORD BLOCKS GENERATED BY MORPHOLOGICAL ANALYSIS ON A CHARACTER STRING OBTAINED BY SPEECH RECOGNITION
20200105270 · 2020-04-02 · ·

An apparatus displays, on a terminal that enables a touch operation, an edit screen on which a text including word blocks is edited, where the word blocks are generated by performing morphological analysis on a character string obtained by speech recognition. Upon reception of a scroll instruction to scroll the text, the apparatus shifts each of the word blocks displayed on the edit screen in a description direction of the text, based on the scroll instruction.

Allowing spelling of arbitrary words

Methods, systems, and apparatus, including computer programs encoded on computer storage media, for natural language processing. One of the methods includes receiving a first voice input from a user device; generating a first recognition output; receiving a user selection of one or more terms in the first recognition output; receiving a second voice input spelling a correction of the user selection; determining a corrected recognition output for the selected portion; and providing a second recognition output that merges the first recognition output and the corrected recognition output.

Method of creating animated image based on key input, and user terminal for performing the method

A method of creating an animated image based on a key input, and a user terminal for performing the method are provided. The method includes acquiring a snapshot image using a camera installed in a user terminal every time a key is input to the user terminal, and creating an animated image by merging the acquired snapshot image with the input key.

SPEECH RECOGNITION SYSTEM WITH INTERACTIVE SPELLING FUNCTION
20200066265 · 2020-02-27 ·

An interactive speech recognition system is provided for interactively interpreting a spoken phrase. The speech recognition system includes a phrase interpretation module which attempts to accurately interpret a spoken phrase by interpreting each individual term of the spoken phrase. A term interpretation module attempts to accurately interpret each individual term of the spoken phrase not accurately interpreted by the phrase interpretation module, by using a spoken spelling of the term provided by a user. An interactive spelling module attempts to interactively spell at least a portion of an individual term of the spoken phrase not accurately interpreted by the term interpretation module, by enabling a user to interactively select at least one individual character of the term of the spoken phrase from a plurality of characters.