Patent classifications
G06F40/30
Analyzing documents using machine learning
A document analysis device that includes a memory operable to store a machine learning model configured to receive a sentence as an input and to output a classification identifier that is associated with a sentence type for the received sentence. The device further includes an artificial intelligence (AI) processing engine configured to receive a document comprising text, to sentences within the document, and to classify the sentences using the machine learning model. The AI processing engine is further configured to identify tagging rules for the document and to annotate one or more sentences from the document with a sentence type that matches a sentence type that is identified by the tagging rules for the document.
Intelligent call routing using knowledge graphs
A system and method for intelligently routing calls between customers and agents. The system and method use knowledge graphs to generate route recommendations for a route selection system. The system uses dynamically selected objective functions to generate the route recommendations. The objective functions may be selected according to the intent of the call. The system and method can also be used to reroute ongoing calls when the intent of the call changes.
Augmenting textual explanations with complete discourse trees
Systems, devices, and methods discussed herein provide improved autonomous agent applications that are configured to provide explanations in response to user-submitted questions. Training data comprising a question, and an explanation pair may be accessed. A discourse tree and an explanation chain can be constructed from the explanation. The explanation chain may identify logical relationships between two entities of elementary discourse units identified from the discourse tree. A query may be submitted for the two entities, and a set of search results can be mined to identify text linking the two entities. An additional discourse tree can be generated from the text of a search result. The additional discourse tree can be combined with the original discourse tree to generate a complete discourse tree. A model may be trained using this augmented data (e.g., the complete discourse tree) to improve the quality of explanations provided by the autonomous agent application.
Multi-stage adaptable continuous learning / feedback system for machine learning models
Data is received that specifies a term generated by user input in a graphical user interface. Thereafter, the term is looked up in a dictionary in which there are multiple classes for terms. The term can be classified based on a first class having a top ranked effective count for the term within the dictionary when a ratio of the first class relative to a second class having a second ranked effective count for the term in the dictionary is above a pre-defined threshold. In addition, the term is classified using a machine learning model when the ratio of the first class relative to the second class is below the pre-defined threshold. Data can be provided which characterizes the classifying. Related apparatus, systems, techniques and articles are also described.
Multi-stage adaptable continuous learning / feedback system for machine learning models
Data is received that specifies a term generated by user input in a graphical user interface. Thereafter, the term is looked up in a dictionary in which there are multiple classes for terms. The term can be classified based on a first class having a top ranked effective count for the term within the dictionary when a ratio of the first class relative to a second class having a second ranked effective count for the term in the dictionary is above a pre-defined threshold. In addition, the term is classified using a machine learning model when the ratio of the first class relative to the second class is below the pre-defined threshold. Data can be provided which characterizes the classifying. Related apparatus, systems, techniques and articles are also described.
Descriptor uniqueness for entity clustering
A mechanism is provided in a data processing system to implement a cognitive natural language processing (NLP) system with descriptor uniqueness identification to support named entity mention clustering. The mechanism annotates a set of documents from a corpus of documents for entity types and mentions, collects descriptor usages from all documents in the corpus of documents, analyzes the descriptor usages to classify the descriptors as base terms or modifier terms, generates compatibility scores for the descriptors, and performs entity merging of entity clusters based on the compatibility scores.
Machine learning based abbreviation expansion
Techniques are described herein for determining a long-form of an abbreviation using a machine learning based approach that takes into consideration both sequential context and structural context, where the long-form corresponds to a meaning of the abbreviation as used in a sequence of words that form a sentence. In some embodiments, word representations are generated for different words in the sequence of words, and a combined representation is generated for the abbreviation based on a word representation corresponding to the abbreviation, a sequential context representation, and a structural context representation. The sequential context representation can be generated based on word representations for words positioned near the abbreviation. The structural context representation can be generated based on word representations for words that are syntactically related to the abbreviation. The combined representation can be input to a classification neural network trained to output a label representing the long-form of the abbreviation.
Machine learning based abbreviation expansion
Techniques are described herein for determining a long-form of an abbreviation using a machine learning based approach that takes into consideration both sequential context and structural context, where the long-form corresponds to a meaning of the abbreviation as used in a sequence of words that form a sentence. In some embodiments, word representations are generated for different words in the sequence of words, and a combined representation is generated for the abbreviation based on a word representation corresponding to the abbreviation, a sequential context representation, and a structural context representation. The sequential context representation can be generated based on word representations for words positioned near the abbreviation. The structural context representation can be generated based on word representations for words that are syntactically related to the abbreviation. The combined representation can be input to a classification neural network trained to output a label representing the long-form of the abbreviation.
System for generating topic inference information of lyrics
A system for generating topic inference information of lyrics that can provide more useful for topic interpretation of lyrics. A device for learning topic numbers performs an operation of updating and learning topic numbers, which performs an operation of updating topic numbers on all of a plurality of lyrics data of each of a plurality of artists, for a predetermined number of times. The operation of updating topic numbers updates the topic number assigned to a given lyrics data of a given artist using a random number generator having a deviation of appearance probability corresponding to a probability distribution over topic numbers. An outputting device outputs the topic numbers of the plurality of lyrics data for each of the plurality artists, and a probability distribution over words for each of the topic numbers.
System for generating topic inference information of lyrics
A system for generating topic inference information of lyrics that can provide more useful for topic interpretation of lyrics. A device for learning topic numbers performs an operation of updating and learning topic numbers, which performs an operation of updating topic numbers on all of a plurality of lyrics data of each of a plurality of artists, for a predetermined number of times. The operation of updating topic numbers updates the topic number assigned to a given lyrics data of a given artist using a random number generator having a deviation of appearance probability corresponding to a probability distribution over topic numbers. An outputting device outputs the topic numbers of the plurality of lyrics data for each of the plurality artists, and a probability distribution over words for each of the topic numbers.