G10L2015/086

Vehicular voice recognition system and method for controlling the same

A vehicular voice recognition system for inferring an intention of a user includes: a storage storing an instruction use history, service use pattern information, and a service preferring item; a controller receiving an input instruction of the user and performing at least one of: a first inference operation of determining a service domain among a plurality of service domains corresponding to the input instruction and providing a service in the determined service domain, a second inference operation of providing a service based on the stored instruction use history, a third inference operation of providing a service based on the stored service use pattern information, and a fourth inference operation of providing a service based on the stored service preferring item; and an output unit provided in a vehicle outputting contents of the provided service using at least one of audio and images.

METHOD FOR SPEECH RECOGNITION DICTATION AND CORRECTION BY SPELLING INPUT, SYSTEM AND STORAGE MEDIUM
20190279623 · 2019-09-12 ·

One aspect of the present disclosure provides a method for speech recognition dictation and correction by spelling input, which is implemented in a system including a terminal. The method includes transforming a speech signal received by the terminal into a speech recognition result. Whether the speech recognition result includes spelling input is determined, and a setting is identified according to a first speech recognition result and the speech recognition result including the spelling input. In response to a correction setting in which the spelling input includes a correction content, the first speech recognition result is modified according to the correction content into an edited speech recognition input, and the edited speech recognition input is displayed on a user interface of the terminal. Accordingly, the speech recognition correction is achieved by spelling input. Another aspect of the present application provides related system and storage medium implementing embodiments of the disclosed method.

Method, apparatus, and computer-readable recording medium for improving at least one semantic unit set
10395645 · 2019-08-27 · ·

A method, system, and a computer-readable recording medium for improving a set of at least one semantic unit are provided. According to the present invention, a set of at least one semantic unit may be improved by using a phonetic sound or text.

Voice and textual interface for closed-domain environment

An improved system and method is disclosed for receiving a spoken or written utterance, identifying and replacing certain words within the utterance with labels to generate a simplified text string representing the utterance, performing intent classification based on the simplified text string, and performing an action based on the intent classification and the original words that were replaced.

MULTI-CHANNEL VOICE RECOGNITION FOR A VEHICLE ENVIRONMENT

A method and device for providing voice command operation in a passenger vehicle cabin having multiple occupants are disclosed. The method and device operate to monitor microphone data relating to voice commands within a vehicle cabin and determine whether the microphone data includes wake-up-word data. When the wake-up-word data relates to more than one of a plurality of vehicle cabin zones and more than one wake-up-words are coincident, the method and device operate to monitor respective microphone data for voice command data from each of the more than one of the respective ones of the plurality of vehicle cabin zones. Upon detection, the voice command data may be processed to produce respective vehicle device commands and the vehicle device command(s) can be transmitted to effect the voice command data.

INFORMATION PROCESSING DEVICE AND INFORMATION PROCESSING METHOD
20190237075 · 2019-08-01 · ·

An information processing device includes: a first reception unit configured to receive an input of one or more characters; a second reception unit configured to receive an input of voice; and a voice recognition unit configured to recognize the voice, and output a voice recognition result beginning with the one or more characters entered into the first reception unit when the second reception unit receives the input of voice with the input of the one or more characters received by the first reception unit.

VEHICULAR VOICE RECOGNITION SYSTEM AND METHOD FOR CONTROLLING THE SAME
20190115015 · 2019-04-18 ·

A vehicular voice recognition system for inferring an intention of a user includes: a storage storing an instruction use history, service use pattern information, and a service preferring item; a controller receiving an input instruction of the user and performing at least one of: a first inference operation of determining a service domain among a plurality of service domains corresponding to the input instruction and providing a service in the determined service domain, a second inference operation of providing a service based on the stored instruction use history, a third inference operation of providing a service based on the stored service use pattern information, and a fourth inference operation of providing a service based on the stored service preferring item; and an output unit provided in a vehicle outputting contents of the provided service using at least one of audio and images.

VOICE AND TEXTUAL INTERFACE FOR CLOSED-DOMAIN ENVIRONMENT
20190088254 · 2019-03-21 · ·

An improved system and method is disclosed for receiving a spoken or written utterance, identifying and replacing certain words within the utterance with labels to generate a simplified text string representing the utterance, performing intent classification based on the simplified text string, and performing an action based on the intent classification and the original words that were replaced.

Allowing spelling of arbitrary words

Methods, systems, and apparatus, including computer programs encoded on computer storage media, for natural language processing. One of the methods includes receiving a first voice input from a user device; generating a first recognition output; receiving a user selection of one or more terms in the first recognition output; receiving a second voice input spelling a correction of the user selection; determining a corrected recognition output for the selected portion; and providing a second recognition output that merges the first recognition output and the corrected recognition output.

METHOD OF CREATING ANIMATED IMAGE BASED ON KEY INPUT, AND USER TERMINAL FOR PERFORMING THE METHOD

A method of creating an animated image based on a key input, and a user terminal for performing the method are provided. The method includes acquiring a snapshot image using a camera installed in a user terminal every time a key is input to the user terminal, and creating an animated image by merging the acquired snapshot image with the input key.