G10L15/14

Hybrid decoding using hardware and software for automatic speech recognition systems
11676585 · 2023-06-13 · ·

Embodiments describe a method for decoding speech including receiving speech input at an audio input device, generating speech data that is a digital representation of the speech input; extracting acoustic features of the speech data, assigning acoustic scores to the acoustic features, receiving data representing the acoustic features and the acoustic scores, decoding the data representing the acoustic features into a word, having a word score, by referencing a WFST language model, modifying the word score into a new word score based on a personalized grammar model stored in the external memory device, the processor is separate from and external to the WFST accelerator, and determining an intent represented by a plurality of words outputted by the WFST accelerator, where the plurality of words include the word and the new word score.

Hybrid decoding using hardware and software for automatic speech recognition systems
11676585 · 2023-06-13 · ·

Embodiments describe a method for decoding speech including receiving speech input at an audio input device, generating speech data that is a digital representation of the speech input; extracting acoustic features of the speech data, assigning acoustic scores to the acoustic features, receiving data representing the acoustic features and the acoustic scores, decoding the data representing the acoustic features into a word, having a word score, by referencing a WFST language model, modifying the word score into a new word score based on a personalized grammar model stored in the external memory device, the processor is separate from and external to the WFST accelerator, and determining an intent represented by a plurality of words outputted by the WFST accelerator, where the plurality of words include the word and the new word score.

Systems and methods for providing information discovery and retrieval
09824150 · 2017-11-21 ·

This invention relates generally to software and computers, and more specifically, to systems and methods for providing information discovery and retrieval. In one embodiment, the invention includes a system for providing information discovery and retrieval, the system including a processor module, the processor module configurable to performing the steps of receiving an information request from a consumer device over a communications network; decoding the information request; discovering information using the decoded information request; preparing instructions for accessing the information; and communicating the prepared instructions to the consumer device, wherein the consumer device is configurable to retrieving the information for presentation using the prepared instructions.

Systems and methods for providing information discovery and retrieval
09824150 · 2017-11-21 ·

This invention relates generally to software and computers, and more specifically, to systems and methods for providing information discovery and retrieval. In one embodiment, the invention includes a system for providing information discovery and retrieval, the system including a processor module, the processor module configurable to performing the steps of receiving an information request from a consumer device over a communications network; decoding the information request; discovering information using the decoded information request; preparing instructions for accessing the information; and communicating the prepared instructions to the consumer device, wherein the consumer device is configurable to retrieving the information for presentation using the prepared instructions.

TWO-STAGE TRAINING OF A SPOKEN DIALOGUE SYSTEM

Described herein are systems and methods for two-stage training of a spoken dialogue system. The first stage trains a policy network using external data to produce a semi-trained policy network. The external data includes one or more known fixed dialogues. The second stage trains the semi-trained policy network through interaction to produce a trained policy network. The interaction may be interaction with a user simulator.

TWO-STAGE TRAINING OF A SPOKEN DIALOGUE SYSTEM

Described herein are systems and methods for two-stage training of a spoken dialogue system. The first stage trains a policy network using external data to produce a semi-trained policy network. The external data includes one or more known fixed dialogues. The second stage trains the semi-trained policy network through interaction to produce a trained policy network. The interaction may be interaction with a user simulator.

GENERATING EVENT OUTPUT
20220358930 · 2022-11-10 ·

A system is provided for determining subscription data when a user requests to receive an output in the future when an event occurs. The system may determine an output type based on the capabilities of the output device and a trigger type. The system may determine a trigger type based on the priority of the triggering event. The system may also determine how many times the subscription is to be executed. Using this information, the system creates the subscription so that the user may receive a notification or an announcement when an event occurs.

Data mining apparatus, method and system for speech recognition using the same
11495234 · 2022-11-08 · ·

A data mining device, and a speech recognition method and system using the same are disclosed. The speech recognition method includes selecting speech data including a dialect from speech data, analyzing and refining the speech data including a dialect, and learning an acoustic model and a language model through an artificial intelligence (AI) algorithm using the refined speech data including a dialect. The user is able to use a dialect speech recognition service which is improved using services such as eMBB, URLLC, or mMTC of 5G mobile communications.

Data mining apparatus, method and system for speech recognition using the same
11495234 · 2022-11-08 · ·

A data mining device, and a speech recognition method and system using the same are disclosed. The speech recognition method includes selecting speech data including a dialect from speech data, analyzing and refining the speech data including a dialect, and learning an acoustic model and a language model through an artificial intelligence (AI) algorithm using the refined speech data including a dialect. The user is able to use a dialect speech recognition service which is improved using services such as eMBB, URLLC, or mMTC of 5G mobile communications.

Speech Processing System And A Method Of Processing A Speech Signal
20220059076 · 2022-02-24 ·

A speech processing system for generating translated speech, the system comprising: an input for receiving a first speech signal comprising a second language; an output for outputting a second speech signal comprising a first language; and a processor configured to: generate a first text signal from a segment of the first speech signal, the first text signal comprising the second language; generate a second text signal from the first text signal, the second text signal comprising the first language; extract a plurality of first feature vectors from the segment of the first speech signal, wherein the first feature vectors comprise information relating to audio data corresponding to the segment of the first speech signal; generate a speaker vector using a first trained algorithm taking one or more of the first feature vectors as input, wherein the speaker vector represents a set of features corresponding to a speaker; generate a second speech signal segment using a second trained algorithm taking information relating to the second text signal as input and using the speaker vector, the second speech signal segment comprising the first language.