Patent classifications
G06F16/3346
Metadata Aggregation Using a Trained Entity Matching Predictive Model
A metadata aggregation system includes a computing platform having a hardware processor and a memory storing a software code including a trained entity matching predictive model trained using training data obtained from a reference database. The hardware processor executes the software code to obtain metadata inputs from multiple sources, conform the metadata inputs to a common format, match, using the trained entity matching predictive model, at least some of the conformed metadata inputs to the same entity, and determine, using the trained entity matching predictive model, a confidence score for each match. The software code further sends a request to one or more human editor(s) for confirmation of each match having a confidence score greater than a first threshold and less than a second threshold, and updates the reference database, in response to receiving a confirmation that at least one match is a confirmed match, to include the confirmed match.
SCALABLE MINING OF TRENDING INSIGHTS FROM TEXT
A system and method for identifying trending topics in a document corpus are provided. First, multiple topics are identified, some of which topics may be filtered or removed based on co-occurrence. Then, for each remaining topic, a frequency of the topic in the document corpus is determined, one or more frequencies of the topic in one or more other document corpora are determined, a trending score of the topic is generated based on the determined frequencies. Lastly, the remaining topics are ranked based on the generated trending scores.
Evaluating evidential links based on corroboration for intelligence analysis
Mechanisms for evaluating an evidential statement in a corpus of evidence are provided. A first evidential statement for which corroboration is sought is received and a corpus of evidence data is processed to determine a measure of corroboration of the first evidential statement by other evidence data in the corpus of evidence data. An indication of trustworthiness of the first evidential statement is generated based on the measure of corroboration of the first evidential statement by the other evidence data in the corpus of evidence data. A representation of the indication of the trustworthiness of the first evidential statement is output in association with the first evidential statement.
REAL-TIME GUIDANCE FOR CONTENT COLLECTION
Methods, systems, and apparatus, including computer programs encoded on computer storage media, for providing real-time guidance for content collection. One of the methods includes receiving user input from a user through a user interface presentation, determining, from the received user input using a first model, one or more provided data elements occurring in the user input, determining, from the one or more provided data elements occurring in the user input using a second model, one or more intended tasks, determining, for each intended task of the one or more intended tasks using a third model, one or more suggested data elements, ranking the one or more suggested data elements, and updating the user interface presentation with a user interface element suggesting that the user provide the one or more needed data elements.
Question pruning for evaluating a hypothetical ontological link
Mechanisms for generating a set of questions to evaluate a link between concept entities are provided. A hypothetical link between at least two information concept entities is generated. A set of questions corresponding to the hypothetical link is retrieved and pruned into a subset of questions based on at least one of characteristics of the hypothetical link or characteristics of the at least two information concept entities. The pruned set of questions is processed based on a corpus of evidence to generate a measure of support for or against the hypothetical link being an actual link between the at least two information concept entities. A validity indication for the hypothetical link indicating whether or not the hypothetical link is an actual link between the at least two information concept entities is output.
Predictive Text Input Method and Device
A predictive text input method and device, wherein said predictive text input method includes: detecting an input by a user; acquiring a prediction basis according to a historical text already input and a current input position, wherein said prediction basis is a preset word length of an input text based on the current input position; searching in a database according to said prediction basis to obtain a prediction result, wherein said prediction result includes at least two stages of prediction candidate words in subsequent based on said prediction basis. This disclosure can provide an efficient prediction with a prediction result which is more corresponding with the users' expectations, so as to provide a more fluent predictive text input experience.
Predicting a command in a command line interface
An apparatus for predicting a command in a command line interface includes a template command module, a parameter derivation module, and a parameter substitution module. The template command module is configured to determine a template command based on a command line history. The template command includes a command name and a parameter and the command line history includes two or more previously entered commands. The parameter derivation module is configured to determine a parameter derivation rule for deriving the parameter in the template command based on the command line history. The parameter substitution module is configured to substitute a substitute parameter for the parameter of the template command according to the parameter derivation rule.
Methods and devices for collection and heuristic analysis of large-scale biographical information
A computer system crawls a plurality of web pages; parses the crawled information into state events and determines causality between any two of the state events; and stores the state events and the causality in a database. The system receives a first request from a user to determine a path to a target state. The system obtains a current state of the user. The system determines one or more paths from the current state of the user to the target state based on the current state of the user and the state events and the causality, including identifying one or more recommended state events, each recommended state event having a causality value for the target state that satisfies first preselected causality criteria; and provides at least one path from the current state of the user to the target state.
COMPUTER-IMPLEMENTED METHOD, SEARCH PROCESSING DEVICE, AND NON-TRANSITORY COMPUTER-READABLE STORAGE MEDIUM
A computer-implemented method for creating and searching a database, the method including, storing inquiry data within a database, dividing the inquiry data into sentences to generate sentence data, segmenting the sentence data to obtain word string data, identifying a plurality of content words within with the word string data, calculating a first probability for each of the plurality of content words, the first probability indicating a probability of a first word being adjacent to a second word, receiving an instruction including at least one word string, selecting a first extended keyword having a highest probability of being adjacent to the word string, extracting a second extended keyword having a lower probability than the first content word of being adjacent to the word string, searching the database based on a word string, first extended keyword and second extended keyword.
SEARCH SYSTEM AND CORRESPONDING METHOD
There is provided a search system comprising a statistical model trained on text associated with a piece of content. The text associated with the piece of content is drawn from a plurality of different data sources. The system is configured to receive text input and generate using the statistical model an estimate of the likelihood that the piece of content is relevant given the text input. A corresponding method is also provided.