G06F16/33

METHOD AND APPARATUS FOR QUERYING QUESTIONS, DEVICE, AND STORAGE MEDIUM

Provided is a method for querying questions. The method includes: acquiring input information of a user; acquiring intention information of the user based on the input information of the user; determining an answer generation rule; and generating, based on the input information and the intention information, a first answer in accordance with the answer generation rule, and providing the first answer to the user.

AUTOMATED INTEROPERATIONAL TRACKING IN COMPUTING SYSTEMS
20230040862 · 2023-02-09 ·

Techniques of automated interoperation tracking in computing systems are disclosed herein. One example technique includes tokenizing a first event log from a first software component and a second event log from the second software component by calculating frequencies of appearance corresponding to strings in the first and second event logs and selecting, as tokens, a first subset of the strings in the first event log and a second subset of the strings in the second event log individually having calculated frequencies of appearance above a preset frequency threshold. The example technique can also include generating an overall event log for a task executed by both the first and second software components by matching one of the strings in the first subset to another of the strings in the second subset.

IDENTIFYING AND TRANSFORMING TEXT DIFFICULT TO UNDERSTAND BY USER

A computer-implemented method, system and computer program product for improving understandability of text by a user. A final word vector for each word in a sentence of a document is computed, such as by averaging a first word vector and a second word vector for that word. Furthermore, elements of a user portrait are vectorized. A distance is then computed between a vector for each word in the sentence and a vectorized element in the user’s portrait which is summed to form an evaluation result for the element. An evaluation result is also formed for every other element in the user’s portrait by performing such a computation step. A “final evaluation result” is then generated corresponding to the evaluation results for every element in the user’s portrait. The document is then transformed in response to the final evaluation result indicating a lack of understanding of the sentence by the user.

Preparing documents for coreference analysis

Unstructured text is identified as larger than a threshold size. Named-entity recognition analysis is executed on the unstructured text. One or more anchor entities of the unstructured text are determined that each occur more than a threshold amount of times within the unstructured text. Two or more instances of the one or more anchor entities that are separated by at least a threshold amount of text of the unstructured text are identified. The unstructured text is partitioned into at least three sections. The unstructured text is partitioned at respective natural language demarcation points associated with each of the two or more instances such that each of the at least three sections is smaller than the threshold size. Separate coreference analyses are performed in parallel on each of the at least three sections.

Phrase indexing

Intent-resolution using a phrase index may include obtaining data expressing a usage intent, the data expressing the usage intent including an unresolved data portion, identifying a phrase fragment based on the data expressing the usage intent and a defined phrase pattern, the phrase fragment including the unresolved data portion, identifying, by a processor, an indexed phrase as a candidate phrase by searching a phrase index based on the phrase fragment, wherein the candidate phrase at least partially matches the phrase fragment in accordance with the defined phrase pattern, and outputting the candidate phrase for presentation to a user as a candidate for resolving the unresolved portion.

Descriptor uniqueness for entity clustering

A mechanism is provided in a data processing system to implement a cognitive natural language processing (NLP) system with descriptor uniqueness identification to support named entity mention clustering. The mechanism annotates a set of documents from a corpus of documents for entity types and mentions, collects descriptor usages from all documents in the corpus of documents, analyzes the descriptor usages to classify the descriptors as base terms or modifier terms, generates compatibility scores for the descriptors, and performs entity merging of entity clusters based on the compatibility scores.

Systems and methods for automatically generating content summaries for topics

A method of automatically generating content summaries for topics includes receiving a taxonomy for a concept and a text corpus. The method further includes generating an annotated dataset having term annotations corresponding to the concept from the text corpus based on the taxonomy, parsing the annotated dataset into a custom generated document object having a structured layout, determining features for the term annotations, and extracting snippets from the custom generated document object, where each of the snippets corresponds to a section of the custom generated document object. The method further includes scoring the snippets based on the features such that each of the snippets corresponds to a score, filtering one or more snippets from the snippets when one or more snippet filtering conditions is met, ranking the snippets into an ordered list for the concept based on the score, and providing, to a user computing device, the ordered list.

Generating search commands based on cell selection within data tables

A search interface is displayed in a table format that includes one or more columns, each column including data items of an event attribute, the data items being of a set of events, and a plurality of rows forming cells with the one or more columns, each cell including one or more of the data items of the event attribute of a corresponding column. Based on a user selecting one or more of the cells, a list of options if displayed corresponding to the selection, and one or more commands are added to a search query that corresponds to the set of events, the one or more commands being based on at least an option that is selected from the list of options and the event attribute for each of the one or more of the data items of each of the selected one or more cells.

Generating search commands based on cell selection within data tables

A search interface is displayed in a table format that includes one or more columns, each column including data items of an event attribute, the data items being of a set of events, and a plurality of rows forming cells with the one or more columns, each cell including one or more of the data items of the event attribute of a corresponding column. Based on a user selecting one or more of the cells, a list of options if displayed corresponding to the selection, and one or more commands are added to a search query that corresponds to the set of events, the one or more commands being based on at least an option that is selected from the list of options and the event attribute for each of the one or more of the data items of each of the selected one or more cells.

Decision making analysis engine

The automated collection of online data is enhanced by generating and saving a context between a document and a related named entity, as well as a credibility level of the online source. The context, credibility level, and quality and quantity of collected data are used to enhance the use of the collected data in automated decision-making. Both the quality and the quantity may be continuously updated and honed through machine learning. Three new algorithms—DUPES, CORRAL, and ONTO—have been introduced to support the above, improving current state-of-the-art engineering practice by sharpening the strategy for named-entity searching, for ensuring that topic modeling produces relevant topic tags, and for handling sentiment which may be NEGATIVE, POSITIVE, and NEUTRAL (which includes MISSING and INCONCLUSIVE).