Patent classifications
G06F16/3329
Search indexing using discourse trees
Systems, devices, and methods of the present invention create a searchable index that includes informative portions of text. In an example, a computer-implemented method creates a discourse tree from a body of text. For each non-terminal node in the discourse tree, the method identifies a rhetorical relationship associated with the non-terminal node. The method labels each terminal node associated with the non-terminal node as either a nucleus or a satellite. The method further accesses a rule associated with the rhetorical relationship, and selects, based on the rule, selects the fragment associated with the nucleus. The method creates a searchable index including the selected fragments.
NATURAL LANGUAGE PROCESSING COMPREHENSION AND RESPONSE SYSTEM AND METHODS
An automatic, system-generated, multi-faceted comprehension and response capability, using Natural Language Processing, to provide value specific answers from available unstructured data, documents and text. Questions and queries are interpreted by the system's capability to determine the type of questions and provide a response or answer based on the data or information available. If the answer is in the ingested data, a response is provided that is either; a list of documents, a list of document snippets with the answer contained in the snippets, a formalized and templated response, or a highly relevant hand curated response.
QUESTION-AND-ANSWER PROCESSING METHOD, ELECTRONIC DEVICE AND COMPUTER READABLE MEDIUM
The embodiment of the present disclosure provides a question-and-answer processing method, including: acquiring a to-be-answered question; determining standard questions meeting a preset condition as a plurality of candidate standard questions, from a plurality of preset standard questions, according to a text similarity with the to-be-answered question, based on a text statistical algorithm; determining, a candidate standard question with the highest semantic similarity with the to-be-answered question as a matching standard question, from the plurality of candidate standard questions, based on a deep text matching algorithm; and determining an answer to the to-be-answered question at least according to the matching standard question. The embodiment of the present disclosure also provides an electronic device and a computer readable medium.
ANSWER GENERATION USING MACHINE READING COMPREHENSION AND SUPPORTED DECISION TREES
Systems, devices, and methods discussed herein are directed to generating an answer to an input query using machine reading comprehension techniques and a lattice of supported decision trees. A supported decision tree can be generated from the various decision chains (e.g., a sequence of elements comprising a premise and a decision connected by rhetorical relationships), where the nodes of the decision tree are identified from the plurality of decision chains and ordered based on a set of predefined priority rules. A lattice may include nodes that individually correspond to a respective supported decision tree. Nodes of the lattice may be identified for an input query. The passages corresponding to those nodes may be obtained and an answer for the query may be generated from the obtained passages using machine reading comprehension techniques. The generated answer may be provided in response to the query.
Preparing documents for coreference analysis
Unstructured text is identified as larger than a threshold size. Named-entity recognition analysis is executed on the unstructured text. One or more anchor entities of the unstructured text are determined that each occur more than a threshold amount of times within the unstructured text. Two or more instances of the one or more anchor entities that are separated by at least a threshold amount of text of the unstructured text are identified. The unstructured text is partitioned into at least three sections. The unstructured text is partitioned at respective natural language demarcation points associated with each of the two or more instances such that each of the at least three sections is smaller than the threshold size. Separate coreference analyses are performed in parallel on each of the at least three sections.
Semantic cluster formation in deep learning intelligent assistants
Enhanced techniques and circuitry are presented herein for providing responses to questions from among digital documentation sources spanning various documentation formats, versions, and types. One example includes a method comprising receiving an indication of a question directed to subject having a documentation corpus, determining a set of passages of the documentation corpus related to the question, ranking the set of passages according to relevance to the question, forming semantic clusters comprising sentences extracted from ranked ones of the set of passages according to sentence similarity, and providing a response to the question based at least on a selected semantic cluster.
System answering of user inputs
Techniques for structuring knowledge bases specific to a user or group of users and techniques for using the knowledge bases to answer user inputs are described. A knowledge base may be populated with information provided by users associated with the knowledge base. Users associated with a knowledge base may be proactive in providing content to the knowledge base and/or a system may solicit an answer to a user input from users associated with a particular knowledge base. When the system receives an answer, the system may populate the knowledge base with the answer and may output the answer to the user that originated the user input. The system may output user inputs to be answered using messages or by establishing two-way communication sessions.
Systems and methods for coverage analysis of textual queries
A computer based system and method for assigning queries to topics and/or visualizing or analyzing query coverage may include, using a computer processor, searching, using a set of queries, over a set of text documents, to produce for each query a set of search results for the query. Each search result may include a subset of text from a text document of the set of text documents. For each query, a query vector may be calculated based on the set of search results for the query, and for each of a set of topics describing the set of text documents, a topic vector may be calculated. A report or visualization may be generated including the set of queries and the set of topics using the topic vectors and the query vectors.
Descriptor uniqueness for entity clustering
A mechanism is provided in a data processing system to implement a cognitive natural language processing (NLP) system with descriptor uniqueness identification to support named entity mention clustering. The mechanism annotates a set of documents from a corpus of documents for entity types and mentions, collects descriptor usages from all documents in the corpus of documents, analyzes the descriptor usages to classify the descriptors as base terms or modifier terms, generates compatibility scores for the descriptors, and performs entity merging of entity clusters based on the compatibility scores.
Contextual feedback, with expiration indicator, to a natural understanding system in a chat bot
A chat bot computing system includes a bot controller and a natural language processor. The natural language processor receives a first textual input and identifies concepts represented by the first textual input. An indication of the concepts is output to the bot controller which generates a response to the first textual input. The concepts output by the natural language processor are also fed back into the input to the natural language processor, as context information, along with an expiration indicator when a second textual input is received. The natural language processor then identifies concepts represented in the second textual input, based on the second natural language, textual input and unexpired context information.