G06F16/3335

Generating search results based on non-linguistic tokens

Systems and methods for generating search results based on non-linguistic tokens are disclosed. In embodiments, a computer-implemented method comprises: mapping, by a computing device, a non-linguistic token to documents during indexing based on associations in a token database between the non-linguistic token and variations of a multi-word term; receiving, by the computing device, the multi-word term in a search query; determining, by the computing device, the non-linguistic token associated with the multi-word term using the token database; and generating, by the computing device, search results based on the multi-word term and the non-linguistic token.

CONCEPTUAL, CONTEXTUAL, AND SEMANTIC-BASED RESEARCH SYSTEM AND METHOD
20210109958 · 2021-04-15 ·

Systems are described in the field of machine learning such as natural language processing for use in researching and searching a corpus of documents in various topical areas such as physical and social sciences. The systems may utilize training, testing, and deployment of models representing a defined space within the corpus. A network of computers and user input devices may be used for receiving research queries via human-computer interface devices and application programming interfaces. Queries may be processed and used as input to the machine learning models. Outputs from the models may include ranking of results reflecting the queries.

Data driven parser selection for parsing event logs to detect security threats in an enterprise system

A method includes obtaining an incoming event log, splitting the incoming event log into a set of tokens, and identifying a subset of the tokens as anchor tokens. The method also includes utilizing an ordered list of the anchor tokens to traverse through a set of anchor token trees and, responsive to identifying a path from (i) a root node of a given one of the anchor token trees to (ii) a given leaf node of the given anchor token tree corresponding to the ordered list of the anchor tokens, selecting a given parser associated with the given leaf node. The method further includes extracting data from the incoming event log utilizing the given parser, detecting one or more security threats affecting at least one asset in an enterprise system based on the extracted data, and applying at least one remediation action to mitigate the detected security threats.

Intelligent term and query generation and feedback based selective execution
10970321 · 2021-04-06 ·

Methods, systems, and apparatus, including computer programs encoded on computer storage media, for automatically generating search terms and queries and selectively executing those queries based on feedback from previous queries. In one aspect, a method using artificial intelligence, natural language processing, and machine learning techniques, includes actions of receiving an input data set and first generating a set of auto-generated terms by using a predetermined and dynamic data filter, a basic utility processor, a multi-word combination generator, and a term tally and sorting processor. Additional actions include generating a primary query for every multi-word combination and generating focus queries for every primary query. Further actions include selectively executing queries in succession on a document server and database only if necessary, based on feedback from previous queries, and aggregating all query search results.

APPARATUS, SYSTEM AND METHOD OF USING TEXT RECOGNITION TO SEARCH FOR CITED AUTHORITIES
20210097095 · 2021-04-01 ·

An apparatus, method and system for electronically providing an underlying cited document from an image. The apparatus, system and method include: an input capable of receiving an image from a camera of a mobile device; an automated text-recognition feature capable of recognizing text in the image; an extractor capable of extracting citations from the recognized text; a comparative database capable of comparing the extracted citations to a plurality of prospective citation types in order to assess a citation type of the extracted citations; based on the assessed citation type, a citation recognizer to recognize the extracted citation; and a user interface capable of presenting the underlying cited document corresponded to the recognized citation.

Optimization technique for database application

Disclosed is a computer-readable medium including a computer program including encoded commands. The computer program is configured to cause one or more processors to perform operations for optimizing a database application when the computer program is executed by the one or more processors of a computer system and the operations include: operation for receiving a query including a bind parameter from an application source; operation for obtaining information related to the bind parameter included in the received query; and operation for rewriting the received query by reflecting the obtained information related to the bind parameter within the received query as hint information.

HIERARCHICAL SEARCH FOR IMPROVED SEARCH RELEVANCE
20210073328 · 2021-03-11 ·

A computer-implemented method is provided that includes receiving a search query and, responsive to the search query, providing one or more textual comments relevant to the search query. This includes tokenizing the search query and calculating a set of query term frequency metrics. A set of records relevant to the search query is then selected, from a persistent storage, based on determined similarities between the query term frequency metrics and frequency metrics determined for the records in the persistent storage. Textual comments within the selected records are associated with usefulness metrics. The textual comments relevant to the search query are selected by selecting those textual comments within the selected records that are associated with usefulness metrics that are within a pre-determined range, e.g., an inter-quartile range for a population of usefulness metrics.

System and method for detecting security risks in a computer system

A system and method are provided for identifying security risks in a computer system. The system includes an event stream generator configured to collect system event data from the computer system. The system further includes a query device configured to receive query requests that specify parameters of a query. Each query request includes at least one anomaly model. The query request and the anomaly model are included in a first syntax in which a system event is expressed as {subject-operation-object}. The system further includes a detection device configured to receive at least one query request from the query device and continuously compare the system event data to the anomaly models of the query requests to detect a system event that poses a security risk. The system also includes a reporting device configured to generate an alert for system events that pose a security risk detected by the detection device.

SYSTEM AND SEMI-SUPERVISED METHODOLOGY FOR PERFORMING MACHINE DRIVEN ANALYSIS AND DETERMINATION OF INTEGRITY DUE DILIGENCE RISK ASSOCIATED WITH THIRD PARTY ENTITIES AND ASSOCIATED INDIVIDUALS AND STAKEHOLDERS
20210026835 · 2021-01-28 ·

Systems and methods for performing machine-driven analysis and determination of integrity due diligence risk associated with third-party entities and individuals associated with third-party entities are disclosed.

Personalized interactive semantic parsing using a graph-to-sequence model

A semantic parsing method using a graph-to-sequence model, system, and computer program product include generating a syntactic graph for a sentence, generating node embeddings for each node based on other nodes the each node is connected to in the syntactic graph, generating a graph embedding over the node embeddings, performing attention-based recurrent neural network (RNN) decoding of the graph embedding and the node embeddings, and providing a logical translation of the sentence based on the decoding.