G10L2015/0631

INTELLIGENT EXPANDING SIMILAR WORD MODEL SYSTEM AND METHOD THEREOF
20230117438 · 2023-04-20 ·

An intelligent expanding similar word model system and a method thereof are provided. The system is operated in a database system host and includes: a character analysis unit, configured to combine a plurality of key word acoustic models with an interference sound key word test set into a key word forward test module; a candidate word generation unit, configured to generate a plurality of candidate word temporary acoustic models; a recognition rate processing unit, configured to generate a first candidate word acoustic model; a false waking-up rate processing unit, configured to generate a second candidate word acoustic model; and an adjustment unit, configured to combine the plurality of key word acoustic models with the second candidate word acoustic model into a similar word acoustic model.

SYSTEMS AND METHODS FOR UNSUPERVISED STRUCTURE EXTRACTION IN TASK-ORIENTED DIALOGUES
20230120940 · 2023-04-20 ·

Embodiments described herein propose an approach for unsupervised structure extraction in task-oriented dialogues. Specifically, a Slot Boundary Detection (SBD) module is adopted, for which utterances from training domains are tagged with the conventional BIO schema but without the slot names. A transformer-based classifier is trained to detect the boundary of potential slot tokens in the test domain. Next, while the state number is usually unknown, it is more reasonable to assume the slot number is given when analyzing a dialogue system. The detected tokens are clustered into the number of slot of groups. Finally, the dialogue state is represented with a vector recording the modification times of every slot. The slot values are then tracked through each dialogue session in the corpus and label utterances with their dialogue states accordingly. The semantic structure is portrayed by computing the transition frequencies among the unique states.

System and method for detecting cognitive decline using speech analysis

System and method for detecting cognitive decline in a subject using a classification system for detecting cognitive decline in the subject based on a speech sample. The classification system is trained using speech data corresponding to audio recordings of speech from normal and cognitive decline patients to generate an ensemble classifier comprising a plurality of component classifiers and an ensemble module. Each of the plurality of component classifiers is a machine-learning classifier configured to generate a component output identifying a sample data as corresponding to a normal patient or a cognitive patient. The machine-learning classifier is generated based on a subset of available features. The ensemble module receives component outputs from all of the component classifiers and generates an ensemble output identifying the sample data as corresponding to a normal or cognitive decline patient based on the component outputs.

APPARATUS AND METHOD FOR AUTOMATIC GENERATION AND UPDATE OF KNOWLEDGE GRAPH FROM MULTI-MODAL SOURCES
20230065468 · 2023-03-02 ·

The present invention provides an apparatus and method for automatic generation and update of a knowledge graph from multi-modal sources. The apparatus comprises a conversation parsing module configured for updating a dynamic information word set V.sub.D with labelled words generated from extracted from the multi-modal sources; updating a static information word set V.sub.S based on extracted schema of relations extracted from the multi-modal sources; and generating pairs of question and answer based on the dynamic information word set V.sub.D, the static information word set V.sub.S and the one or more sentence patterns; and a knowledge graph container configured for updating a knowledge graph based on the extracted entities of interest and schema of relations. Therefore, an efficient and cost-effective way for question decomposition, query chain construction and entity association from unstructured data is achieved.

Audio translator
11605369 · 2023-03-14 · ·

Audio translation system includes a feature extractor and a style transfer machine learning model. The feature extractor generates for each of a plurality of source voice files one or more source voice parameters encoded as a collection of source feature vectors, and generates for each of a plurality of target voice files one or more target voice parameters encoded as a collection of target feature vectors. The style transfer machine learning model trained on the collection of source feature vectors for the plurality of source voice files and the collection of target feature vectors for the plurality of target voice files to generate a style transformed feature vector.

Multi-turn dialogue response generation with template generation

Machine classifiers in accordance with embodiments of the invention capture long-term temporal dependencies in particular tasks, such as turn-based dialogues. Machine classifiers may be used to help users to perform tasks indicated by the user. When a user utterance is received, natural language processing techniques may be used to understand the user's intent. Templates may be determined based on the user's intent in the generation of responses to solicit information from the user. A variety of persona attributes may be determined for a user. The persona attributes may be determined based on the user's utterances and/or provided as metadata included with the user's utterances. A response persona may be used to generate responses to the user's utterances such that the generated responses match a tone appropriate to the task. A response persona may be used to generate templates to solicit additional information and/or generate responses appropriate to the task.

Pattern recognition robust to influence of a transfer path
11620985 · 2023-04-04 · ·

A pattern recognition apparatus includes: a model storage part that stores a model(s) generated by using transfer path information indicating a difference of transfer paths of a signal(s) for training, additional to the signal(s) for training, and a pattern recognition part that inputs an input signal and transfer path information indicating a difference of transfer paths of the input signal, and performs pattern recognition of the input signal by using the model(s).

Multi-turn dialogue response generation with persona modeling

Machine classifiers in accordance with embodiments of the invention capture long-term temporal dependencies in particular tasks, such as turn-based dialogues. Machine classifiers may be used to help users to perform tasks indicated by the user. When a user utterance is received, natural language processing techniques may be used to understand the user's intent. Templates may be determined based on the user's intent in the generation of responses to solicit information from the user. A variety of persona attributes may be determined for a user. The persona attributes may be determined based on the user's utterances and/or provided as metadata included with the user's utterances. A response persona may be used to generate responses to the user's utterances such that the generated responses match a tone appropriate to the task. A response persona may be used to generate templates to solicit additional information and/or generate responses appropriate to the task.

DATA AUGMENTATION FOR INTENT CLASSIFICATION
20230141398 · 2023-05-11 ·

The present disclosure relates to a data augmentation system and method that uses a large pre-trained encoder language model to generate new, useful intent samples from existing intent samples without fine-tuning. In certain embodiments, for a given class (intent), a limited number of sample utterances of a seed intent classification dataset may be concatenated and provided as input to the encoder language model, which may generate new sample utterances for the given class (intent). Additionally, when the augmented dataset is used to fine-tune an encoder language model of an intent classifier, this technique improves the performance of the intent classifier.

MULTI-TURN DIALOGUE RESPONSE GENERATION WITH AUTOREGRESSIVE TRANSFORMER MODELS
20230206005 · 2023-06-29 ·

Machine classifiers in accordance with embodiments of the invention capture long-term temporal dependencies in the dialogue data better than the existing RNN-based architectures. Additionally, machine classifiers may model the joint distribution of the context and response as opposed to the conditional distribution of the response given the context as employed in sequence-to-sequence frameworks. Machine classifiers in accordance with embodiments further append random paddings before and/or after the input data to reduce the syntactic redundancy in the input data, thereby improving the performance of the machine classifiers for a variety of dialogue-related tasks. The random padding of the input data may further provide regularization during the training of the machine classifier and/or reduce exposure bias. In a variety of embodiments, the input data may be encoded based on subword tokenization.