G06F16/2458

Systems and methods for caching of managed content in a distributed environment using a multi-tiered architecture including off-line access to cached content

Embodiments as disclosed provide a distributed caching solution that improve the performance and functionality of a content management platform for sites that are physically or logically remote from the primary site of the content management platform. In particular, according to embodiments, a remote cache server may be associated with a remote site to store local copies of documents that are managed by the primary content management platform. Periodically, a portion of the remote site's cache may be synchronized with the content management platform's primary site using an extensible architecture to ensure that content at the remote cache server is current.

Efficient data relationship mining using machine learning
11556838 · 2023-01-17 · ·

Techniques and solutions are described for determining association rules using a machine learning technique on a subset of data to which the association rules might apply, and from which they can be determined. In particular, association rules are determined by tracking changes to attribute values of data objects having a type. The changed attribute value can be used as a consequent in an association rule. Values of other attributes of data objects having the changed attribute value can be used as antecedents in association rules. Values used in antecedents can be constrained, such as by limiting values to those associated with scope attributes or values satisfying a threshold occurrence frequency. In some cases, determined association rules can be automatically implemented, such as to process input or stored data for data objects of the type.

Method and system for using recycling to improve blockchain mining

A method for awarding blockchain mining fees based on recycling efforts includes: receiving recycling data for each of a plurality of mining systems including an amount of recycled materials associated with the respective mining system; receiving validation data for each mining system from third party entities including, for each mining system, a confirmation of the amount of recycled materials for the respective recycling data; selecting one of the mining systems, which is weighted based on the amount of recycled materials for the respective mining system compared to a total amount of recycled materials for all mining systems; and receiving a new block generated by the selected mining system including a block header and a plurality of blockchain data values including a blockchain data value corresponding to a blockchain transaction for payment of mining fees for the new block to a wallet associated with the selected mining system.

Insight generation from a tabular dataset

Systems, methods, and software of processing a tabular dataset. In one embodiment, a system extracts raw association rules from the tabular dataset. Each of the raw association rules comprises a relationship between a set of antecedents and a single consequent, and corresponds to one or more transactions. The system determines potential rule merge groups of the raw association rules based on the antecedents, and determines one or more actual rule merge groups of the raw association rules in each potential rule merge group based on the transactions. The system combines the raw association rules in an actual rule merge group to generate a merged association rule. The system then generates a set of insights based on one or more merged association rules, and performs an operation based on the set of insights.

Automatic data-screening framework and preprocessing pipeline to support ML-based prognostic surveillance

The disclosed embodiments relate to a system that automatically selects a prognostic-surveillance technique to analyze a set of time-series signals. During operation, the system receives the set of time-series signals obtained from sensors in a monitored system. Next, the system determines whether the set of time-series signals is univariate or multivariate. When the set of time-series signals is multivariate, the system determines if there exist cross-correlations among signals in the set of time-series signals. If so, the system performs subsequent prognostic-surveillance operations by analyzing the cross-correlations. Otherwise, if the set of time-series signals is univariate, the system performs subsequent prognostic-surveillance operations by analyzing serial correlations for the univariate time-series signal.

Automated database updating and curation

Systems and methods for retrieval of information from read-only databases that hold taxonomic-related and sequence-related data. A method may include receiving organism names from a taxonomy database and detecting new organism names. The method may also include retrieving hierarchical data and assigning the new organism names to buckets based on the hierarchical data. The method may further include receiving sequence data elements from a nucleotide database, identifying particular buckets to correspond to a screener data set, querying organism names assigned to the particular buckets with names of reference sequences of the sequence data elements, generating a mapping between the sequence data elements and organism names returned as a result of the queries, and storing the mapping.

Multi-stage adaptable continuous learning / feedback system for machine learning models

Data is received that specifies a term generated by user input in a graphical user interface. Thereafter, the term is looked up in a dictionary in which there are multiple classes for terms. The term can be classified based on a first class having a top ranked effective count for the term within the dictionary when a ratio of the first class relative to a second class having a second ranked effective count for the term in the dictionary is above a pre-defined threshold. In addition, the term is classified using a machine learning model when the ratio of the first class relative to the second class is below the pre-defined threshold. Data can be provided which characterizes the classifying. Related apparatus, systems, techniques and articles are also described.

Low latency query processing and data retrieval at the edge

A datastore engine at an edge location of a content delivery network (CDN) may perform low-latency query processing and data retrieval for multiple types of databases at one or more origin servers. When a client sends a query to the edge location, the datastore engine translates the query from a back-end database format into a native format of the local edge datastore. If the requested data is not there, then the datastore engine retrieves the data from the back-end table and inserted inserts the data into the local edge datastore. By using multiple queries over time to re-construct data from the backend database tables at the edge, the datastore engine may provide low-latency access to data from the backend database tables (avoiding the need to retrieve data from the back-end tables to serve subsequent queries).

Data-determinant query terms

Systems and methods are disclosed for flexibly applying a query term to heterogeneous data. A query system can receive a query that includes a data-determinant query term. As the system executes the query it can generate interim search results. As the system query processes the interim search results based on the query, it can apply the data-determinant query term to records of the interims search results based on the structure of the records.

Generating search commands based on cell selection within data tables

A search interface is displayed in a table format that includes one or more columns, each column including data items of an event attribute, the data items being of a set of events, and a plurality of rows forming cells with the one or more columns, each cell including one or more of the data items of the event attribute of a corresponding column. Based on a user selecting one or more of the cells, a list of options if displayed corresponding to the selection, and one or more commands are added to a search query that corresponds to the set of events, the one or more commands being based on at least an option that is selected from the list of options and the event attribute for each of the one or more of the data items of each of the selected one or more cells.