G06F16/2454

Enhanced database query processing

A method includes receiving, from a client device, an enhanced database query for a union operation of a first query and at least a second query, parsing the enhanced database query to identify two or more parameters controlling handling of duplicate rows in the first query and the second query, evaluating the enhanced database query utilizing the parameters to generate a result table, and providing the result table to the client device.

Reducing memory usage for long standing computations

Representative embodiments disclosed compress expressions so they utilize less physical storage. An expression is placed in a standard representation, such as an expression tree. The system utilizes one or more rules to identify portions of the expression that are likely to be common to other expressions. The common portions are extracted from the expression tree as a template and the remaining portions are hoisted from the expression as unique portions. If the template does not already reside in a cache, the template is stored in the cache. A cache reference is obtained for the template and combined with the unique portions to create a bundle that reduces storage requirements for the expression. The original expression is recovered by retrieving the template from the cache using the cache reference and placing the hoisted unique portions into their original locations in the template.

SOFTWARE VULNERABILITY GRAPH DATABASE

To analyze open-source code at a large scale, a security domain graph language (SGL) has been created that functions as a vulnerability description language and facilitates program analysis queries. The SGL facilitates building and maintaining a graph database to catalogue vulnerabilities found in open-source components. This graphical database can be accessed via a database interface directly or accessed by an agent that interacts with the database interface. To build the graph database, a database interface processes an open-source component and creates graph structures which represent relationships present in the open-source component. The database interface transforms a vulnerability description into a canonical form based on a schema for the graph database and updates the database based on a determination of whether the vulnerability is a duplicate. This ensures quality and consistency of the vulnerability dataset maintained in the graph database.

INTELLIGENT OPTIMIZATION OF PARAMETERIZED QUERIES
20240086404 · 2024-03-14 · ·

A computer implemented method can receive a parameterized query written in a declarative language. The parameterized query comprises a parameter which can be assigned different values. The method can perform a first compilation session of the parameterized query in which the parameter has no assigned value. Performing the first compilation session can generate an intermediate representation of the parameterized query. The intermediate representation describes a relational algebra expression to implement the parameterized query. The method can perform a second compilation session of the parameterized query in which parameter has an assigned value. Performing the second compilation session reuses the intermediate representation of the parameterized query.

Explicit declaration of associations to optimize grouping of elements by large data objects

The disclosure generally describes methods, software, and systems, including a method for optimizing queries referencing large object (LOB) information. A structured query language (SQL) query containing an LOB field is identified. LOB replacement information for the LOB field is obtained from a data model. The SQL query is optimized including substituting, using the obtained LOB replacement information, the LOB field with replacement language to create an updated SQL query, and extending the updated SQL query with a join of LOB field language to create an extended SQL query.

Partition-based analytic systems and methods
10402388 · 2019-09-03 · ·

A persistent distributed database system is disclosed that partitions a database structure into a plurality of partitions to optimize repeated analytics that are executed on the database structure. A repeated analytic is partitioned into sub-analytics that are executed on the database partitions, and upon repeated executions, sub-analytics are only repeated upon database partitions that have been updated since the previous execution. This drastically reduces the time to repeat execution of a repeated analytic, and frees up resources that could be better spent on other processes.

EVALUATION OF PLURAL EXPRESSIONS CORRESPONDING TO INPUT DATA
20190258681 · 2019-08-22 ·

An embodiment of the present invention evaluates plural expressions. A model is generated and configured to evaluate a plurality of expressions each including one or more expression tokens and indicating a data pattern. The model includes a plurality of nodes with one or more of the nodes associated with an expression token and one or more links between the nodes. The links are associated with information indicating each expression including each expression token associated with nodes connected by the links. Data including one or more data tokens is applied to the model. The nodes of the model are traversed over one or more corresponding links based on the one or more data tokens within the data corresponding to expression tokens associated with the nodes. Expressions corresponding to the data are determined based on the expressions associated with the one or more corresponding links.

DYNAMIC COMBINATION OF PROCESSES FOR SUB-QUERIES
20190258639 · 2019-08-22 ·

A tool for combining common processes shared by at least two or more sub-queries within a query is provided. The tool determines whether one or more sub set relationships are shared between the at least two or more sub-queries. Responsive to a determination that one or more sub set relationships are shared between the at least two or more sub-queries, the tool determines an order class for the at least two or more sub-queries based on the one or more sub set relationships, wherein determining the order class includes transforming the query to include one or more differing aspects within the single shared common process, with the one or more differing aspects arranged based, at least in part, on a query style, a query type, and a query function. Responsive to determining an access path for the query, the tool executes the access path during run-time for data accessing.

Compiling optimized database statements
11989177 · 2024-05-21 · ·

Compiling optimized database statements including receiving, by a query compiler from a client computing system, a state specification of a graphical user interface in response to a user manipulating the graphical user interface on the client computing system, wherein the state specification is in a standard file format and describes modifications to a plurality of visual elements of the graphical user interface; compiling, by the query compiler, a database statement from the state specification such that a limit clause within the database statement is processed by a database before at least one join clause; and sending, by the query compiler, the optimized database statement to a database on a cloud-based data warehouse.

Selection of Subexpressions to Materialize for Datacenter Scale

Described herein is a system and method for selecting subexpressions to be materialized. For a predefined storage budget, subexpressions of a set of candidate subexpressions to be materialized to minimize query evaluation cost are selected based upon a calculated utility of the set of candidate subexpressions, interactions of the candidate subexpressions, and, a cost of evaluating the candidate subexpressions. Based upon the subexpressions selected to be materialized, subexpression(s) of the set of candidate subexpressions to use when evaluating particular queries of the set of queries to minimize query evaluation cost are determined.