Patent classifications
G06F16/2246
COMPLEX CHEMICAL SUBSTRUCTURE SEARCH QUERY BUILDING AND EXECUTION
Systems and methods for enabling construction of complex Boolean chemical substructure queries in a structured graphical user interface are provided. The chemical substructures (molecules) may be represented graphically in standard molecular notation, and may be arranged horizontally and vertically on the interface, along with Boolean logical operators. Boolean logical operators of a first type may logically associate molecules arranged in horizontal fashion to form row queries, whereas Boolean logical operators of a different, second type may logically associate the row queries to form a composite query to be applied to a database of molecules. The operators of the first type may comprise disjunctive operators, whereas the operators of the second type may comprise conjunctive operators.
Using sparse merkle trees for smart synchronization of S3
One example method, which may be performed in connection with an object store, includes receiving a key of a key-value pair, correlating the key to a location in a base of a Merkle tree, inserting the key at the location, hashing the value associated with the key to produce a data hash, and inserting the data hash in the Merkle tree. The Merkle tree may then be checked for consistency, and synchronized with another Merkle tree. The Merkle tree may be of a fixed size, and insertion of the key in the Merkle tree does not change the location of any keys existing in the Merkle tree prior to insertion of the new key.
System and method for implementing a contract data management module
A system and method for automatic generation of a data tree structure are disclosed. A database stores contract metadata associated with a plurality of contracts. The metadata includes relationship information data regarding parentage and/or child relationship between a particular contract and other contracts among the plurality of contracts. A processor operatively connects to the database via a communication network and accesses the database via the communication network to retrieve the contract metadata including the relationship information data. The processor also implements a search engine; stores the retrieved contract metadata including the relationship information data onto the search engine for creating a linkage between data points; and automatically generates, based on the created linkage between the data points, both an application programming interface (API) and a data tree structure that displays inter-dependency between two or more sets of contracts among the plurality of contracts.
PROCESSING INGESTED DATA TO IDENTIFY ANOMALIES
Systems and methods are described for processing ingested data in an asynchronous manner as the data is being ingested to detect potential anomalies. For example, one or more streaming data processors can convert data as the data is ingested into a comparable data structure, determine whether the comparable data structure should be assigned to an existing data pattern or a new data pattern, and optionally update a characteristic of the data pattern to which the comparable data structure is assigned. The streaming data processor(s) can perform these operations automatically in real-time or in periodic batches. Once one or more comparable data structures have been assigned to one or more data patterns, the streaming data processor(s) can analyze the comparable data structures assigned to a particular data pattern to determine whether any of the comparable data structures appear to be anomalous.
ONTOLOGY-BASED GRAPH QUERY OPTIMIZATION
Examples of the present disclosure describe systems and methods for ontology-based graph query optimization. In an example, ontology data relating to a graph or isolated collection may be collected. The ontology data may comprise uniqueness and topology information and may be used to reformulate a query in order to yield a query that is more performant than the original query when retrieving target information from a graph. In an example, reformulating a query may comprise reordering one or more parameters of the query relating to resources, relationships, and/or properties based on uniqueness information. In another example, the query may be reformulated by modifying the resource type to which the query is anchored based on the topology information. The reformulated query may then be executed to identify target information in the isolated collection, thereby identifying the same target information as the original query, but in a manner that is more performant.
Trie-Based Indices for Databases
A database system uses byte ordering for keys and a trie index to reference stored data. The keys of a database are converted into byte-comparable sequences of byte values. The trie index is generated including nodes connected by edges defining paths from a root node to leaf nodes. Each edge is associated with at least one byte value such that each path from the root node to a leaf node through one or more edges defines a unique byte prefix for a byte-comparable sequence of byte values. The leaf node of each path is associated with a database location value. A record is accessed in the database using a database location value determined from referencing the trie index using a byte-comparable sequence of byte values of the record generated from a key of the record. A trie structure and byte ordered keys may be used for partition or row indices.
System, Method, and Computer Program Product for Maintaining User Privacy in Advertisement Networks
A method for maintaining user privacy in advertisement networks may include receiving first persona data associated with a first user from at least one publisher system. The first persona data may be generalized to form first generalized persona data. A session key may be generated. The first generalized persona data and the session key may be encrypted with a first public key of an advertisement network system to form a first ciphertext. The first ciphertext and first user identity data may be communicated to a mediator system. The first user identity data may be associated with first user's identity. A first encrypted targeted advertisement may be received based on the first generalized persona data from the advertisement network system via the mediator system. The first encrypted targeted advertisement may be decrypted with the session key to form a first targeted advertisement. A system and computer program product are also disclosed.
Reduction of data stored on a block processing storage system
Techniques and systems for reducing data stored on a block processing storage system are described. A losslessly reduced representation of a data block can include references to one or more prime data element blocks, and optionally a description of a reconstitution program which, when applied to the one or more prime data element blocks, results in the data block.
Generating context tree data based on a tailored data model
A system for providing context tree based on data model is disclosed. The system comprises an interface, a processor, and a memory. The interface is configured to receive a data model entry point, and to receive one or more context filters. The processor is configured to determine context tree data based on the one or more context filters and the data model entry point from any context tree provider that has appropriate context tree information. The memory is coupled to the processor and is configured to provide the processor with instructions.
Using a B-tree to store graph information in a database
Techniques to store graph information in a database are disclosed. In various embodiments, each node in a graph may be modeled as a micro b-tree. Node identity, attribute, edge, and edge attribute data may be stored in one or more pages modeled on page formats typically used to store index data for a relational database index. Data associated with a plurality of nodes and edges, each of said edges representing a relationship between two or more of said nodes, may be received. For each node, one or more pages of data may be created, each corresponding to a prescribed page size associated with a storage device in which said one or more pages are to be stored, and each page having a data structure that includes a variable-sized set of fixed length data slots and a variable-sized variable length data region.