Patent classifications
G06F40/289
MULTI-MODEL APPROACH TO NATURAL LANGUAGE PROCESSING AND RECOMMENDATION GENERATION
In some implementations, a device may monitor a set of data sources to generate a set of language models corresponding to the set of data sources. The device may determine a plurality of sets of keyword groups. The device may generate a plurality of sets of skill catalogs. The device may receive a source document for processing. The device may process the source document to extract a key phrase set and to determine a first similarity distance. The device may select a corresponding skill catalog and an associated language model based on a relevancy value. The device may determine second similarity distances between the source document and one or more target documents using the corresponding skill catalog and the associated language model. The device may output information associated with one or more target documents based at least in part on the second similarity distances.
MULTI-MODEL APPROACH TO NATURAL LANGUAGE PROCESSING AND RECOMMENDATION GENERATION
In some implementations, a device may monitor a set of data sources to generate a set of language models corresponding to the set of data sources. The device may determine a plurality of sets of keyword groups. The device may generate a plurality of sets of skill catalogs. The device may receive a source document for processing. The device may process the source document to extract a key phrase set and to determine a first similarity distance. The device may select a corresponding skill catalog and an associated language model based on a relevancy value. The device may determine second similarity distances between the source document and one or more target documents using the corresponding skill catalog and the associated language model. The device may output information associated with one or more target documents based at least in part on the second similarity distances.
Algorithm for scoring partial matches between words
Techniques are disclosed relating to scoring partial matches between words. In certain embodiments, a method may include receiving a request to determine a similarity between an input text data and a stored text data. The method also includes determining, based on comparing one or more words included in the input text data with one or more words included in the stored text data, a set of word pairs and a set of unpaired words. Further, in response to determining that the set of unpaired words passes elimination criteria, the method includes calculating a base similarity score between the input text data and the stored text data based on the set of word pairs. The method also includes determining a scoring penalty based on the set of unpaired words and generating a final similarity score between the input text data and the stored text data by modifying the base similarity score based on the scoring penalty.
Algorithm for scoring partial matches between words
Techniques are disclosed relating to scoring partial matches between words. In certain embodiments, a method may include receiving a request to determine a similarity between an input text data and a stored text data. The method also includes determining, based on comparing one or more words included in the input text data with one or more words included in the stored text data, a set of word pairs and a set of unpaired words. Further, in response to determining that the set of unpaired words passes elimination criteria, the method includes calculating a base similarity score between the input text data and the stored text data based on the set of word pairs. The method also includes determining a scoring penalty based on the set of unpaired words and generating a final similarity score between the input text data and the stored text data by modifying the base similarity score based on the scoring penalty.
Method, system and computer-readable medium for information retrieval
In a computer-implemented method for information retrieval and a processing system of a computer-implemented information retrieval system, an input text is received by a Natural Language Processing, NLP, suite, wherein the NLP suite comprises a plurality of models. At least one of the plurality of models is a model trained using selected features. The selected features are determined using a feature selection process. The input text is processed by each one of the plurality of models. An intermediate representation of the input text is generated by each one of the plurality of models. An enhanced representation of the input text is generated by combining a plurality of the generated intermediate representations. Information is retrieved based on the enhanced representation of the input text.
Method, system and computer-readable medium for information retrieval
In a computer-implemented method for information retrieval and a processing system of a computer-implemented information retrieval system, an input text is received by a Natural Language Processing, NLP, suite, wherein the NLP suite comprises a plurality of models. At least one of the plurality of models is a model trained using selected features. The selected features are determined using a feature selection process. The input text is processed by each one of the plurality of models. An intermediate representation of the input text is generated by each one of the plurality of models. An enhanced representation of the input text is generated by combining a plurality of the generated intermediate representations. Information is retrieved based on the enhanced representation of the input text.
Detecting system events based on user sentiment in social media messages
Methods and systems are disclosed herein for using anomaly detection in timeseries data of user sentiment to detect incidents in computing systems and identify events within an enterprise. An anomaly detection system may receive social media messages that include a timestamp indicating when each message was published. The system may generate sentiment identifiers for the social media messages. The sentiment identifiers and timestamps associated with the social media messages may be used to generate a timeseries dataset for each type of sentiment identifier. The timeseries datasets may be input into an anomaly detection model to determine whether an anomaly has occurred. The system may retrieve textual data from the social media messages associated with the detected anomaly and may use the text to determine a computing system or event associated with the detected anomaly.
Detecting system events based on user sentiment in social media messages
Methods and systems are disclosed herein for using anomaly detection in timeseries data of user sentiment to detect incidents in computing systems and identify events within an enterprise. An anomaly detection system may receive social media messages that include a timestamp indicating when each message was published. The system may generate sentiment identifiers for the social media messages. The sentiment identifiers and timestamps associated with the social media messages may be used to generate a timeseries dataset for each type of sentiment identifier. The timeseries datasets may be input into an anomaly detection model to determine whether an anomaly has occurred. The system may retrieve textual data from the social media messages associated with the detected anomaly and may use the text to determine a computing system or event associated with the detected anomaly.
Systems and methods for response selection in multi-party conversations with dynamic topic tracking
Embodiments described herein provide a dynamic topic tracking mechanism that tracks how the conversation topics change from one utterance to another and use the tracking information to rank candidate responses. A pre-trained language model may be used for response selection in the multi-party conversations, which consists of two steps: (1) a topic-based pre-training to embed topic information into the language model with self-supervised learning, and (2) a multi-task learning on the pretrained model by jointly training response selection and dynamic topic prediction and disentanglement tasks.
Method and device for keyword extraction and storage medium
A method and device for keyword extraction and a storage medium. The method includes receiving, at a terminal, an original document, acquiring, at the terminal, a candidate set by extracting at least one candidate phrase from the original document, acquiring, at the terminal, an association degree between the at least one candidate phrase in the candidate set and the original document, acquiring, at the terminal, a divergence degree of the at least one candidate phrase in the candidate set, and updating, at the terminal, a key phrase set of the original document by selecting the at least one candidate phrase from the candidate set as at least one key phrase based on the association degree and the divergence degree.