G06F16/316

Methods and systems for a compliance framework database schema

Generating a compliance framework. The compliance framework facilitates an organization's compliance with multiple authority documents by providing efficient methodologies and refinements to existing technologies, such as providing hierarchical fidelity to the original authority document; separating auditable citations from their context (e.g., prepositions and or informational citations); asset focused citations; SNED and Live values, among others.

Headstart for data scientists

A method, system, and apparatus are provided for recommending machine learning (ML) project resources for completing a user project by generating indexed project metadata for a plurality of ML projects, generating search metadata for a search request for ML project resources to develop an ML project, and then evaluating the search metadata against the indexed project metadata for each ML project to form a relevancy assessment which is used to order trained models from the ML projects and to display one or more recommended ML project resources comprising one or more of the plurality of trained models having a relevancy assessment exceeding a relevancy threshold.

System and Method for Joining Datasets

A computer-implemented method comprising: receiving, with a computer, first and second datasets; performing, with the computer, column discovery on the first and second datasets using a first trained machine-learning model to produce a column map that indexes one or more columns in the first dataset to one or more columns in the second dataset; performing, with the computer, row discovery on the first and second datasets using a second trained machine-learning model, a trained approximate nearest neighbor index, and the column discovery to produce a row map that indexes one or more rows in the first dataset to one or more rows in the second dataset; combining, with the computer, the first and second datasets using the column map and the row map to form a combined dataset; and performing one or more actions with the combined dataset.

METHODS, SYSTEMS, AND COMPUTER-READABLE MEDIA FOR SEMANTICALLY ENRICHING CONTENT AND FOR SEMANTIC NAVIGATION

Methods, systems and computer-readable media enable various techniques related to semantic navigation. One aspect is a technique for displaying semantically derived facets in the search engine interface. Each of the facets comprises faceted search results. Each of the faceted search results is displayed in association with user interface elements for including or excluding the faceted search result as additional search terms to subsequently refine the search query. Another aspect automatically infers new metadata from the content and from existing metadata and then automatically annotates the content with the new metadata to improve recall and navigation. Another aspect identifies semantic annotations by determining semantic connections between the semantic annotations and then dynamically generating a topic page based on the semantic connections.

Training and applying structured data extraction models

A computer system for extracting structured data from unstructured or semi-structured text in an electronic document, the system comprising: a graphical user interface configured to present to a user a graphical view of a document for use in training multiple data extraction models for the document, each data extraction model associated with a user defined question; a user input component configured to enable the user to highlight portions of the document; the system configured to present in association with each highlighted portion an interactive user entry object which presents a menu of question types to a user in a manner to enable the user to select one of the question types, and a field for receiving from the user a question identifier in the form of human readable text, wherein the question identifier and question type selected by the user are used for selecting a data extraction model, and wherein the highlighted portion of the document associated with the question identifier is used to train the selected data extraction model.

Reducing matching documents for a search query

The technology described herein provides for identifying matching documents for a search query using a bit vector search index. When a search query is received, a term is identified from the search index, and a number of bit vectors corresponding to the term are identified. Each bit vector comprises an array of bits in which at least one bit in each bit vector indicates that a corresponding document includes the term. Each bit vector also includes other bits indicating other documents include other terms. A determination is made that an unacceptable number of possible matching documents is likely to be returned. In response to the determination, a strengthening row bit vector is selected to reduce the number of possible matching documents. The identified bit vectors and the selected strengthening row are intersected to identify matching documents that contain the term.

Detection of a topic
11755831 · 2023-09-12 · ·

The present invention relates to a method for performing a detection of a topic of a message introduced in a real-time customer service messaging platform. In the method a message comprising at least one word from which the topic is definable is received; a topic from the received message is extracted; it is inquired from a database if the topic is determinable from a number of messages received chronically earlier than the received message; and an indication is generated to an operator of the real-time customer service messaging platform in accordance with a detection result obtained through an inquiry to the database. Some aspects of the present invention relate to a network node, to a computer program product and to a system.

System and method for pre-indexing filtering and correction of documents in search systems

Embodiments as disclosed herein provide a search system with an pre-indexing filter that provides both a sophisticated and contextually tailored approach to filtering documents and a corrector that is adapted to alter a document that has been designated to be filtered out from the indexing process and determine if the altered document should be indexed. The alteration of the document may be tied to the attributes, rules or thresholds used to initially filter the document from the indexing process. The filtering criteria can thus be tailored to a specific context such that both the initial filtering and the alteration process may be better suited for application in that context.

Traffic-aware route decoding using a probabilistic encoding data structure
11755553 · 2023-09-12 · ·

A mobile apparatus receives a route response comprising an encoded route and one or more delay encoding data structures. The delay encoding data structures are probabilistic data structures configured to not provide false negatives. The mobile apparatus determines a decoded route based on the encoded route and a mobile version of a digital map; determines an expected traffic delay for at least one adjacent traversable map element (TME) of the decoded route based on the one or more delay encoding data structures; and performs one or more navigation functions based at least on the expected traffic delay for the at least one adjacent segment of the decoded route. An adjacent TME is a TME of the digital map that intersects the decoded route and is not a TME of the decoded route.

System and method of context-based predictive content tagging for encrypted data

This disclosure relates to systems, methods, and computer readable media for performing multi-format, multi-protocol message threading in a way that is most beneficial for the individual user. Users desire a system that will provide for ease of message threading by “stitching” together related communications in a manner that is seamless from the user's perspective. Such stitching together of communications across multiple formats and protocols may occur, e.g., by: 1) direct user action in a centralized communications application (e.g., by a user clicking ‘Reply’ on a particular message); 2) using semantic matching (or other search-style message association techniques); 3) element-matching (e.g., matching on subject lines or senders/recipients/similar quoted text, etc.); and 4) “state-matching” (e.g., associating messages if they are specifically tagged as being related to another message, sender, etc. by a third-party service, e.g., a webmail provider or Instant Messaging (IM) service).