Patent classifications
G06F16/24545
RESOURCE PROVISIONING SYSTEMS AND METHODS
A method for a first set of processors and a second set of processors comprises, the first set of processors processing a set of queries, as a result of a change in utilization of the first set of processors, processing the set of queries using the second set of processors. The change in processors is independent of a change in storage resources, the storage resources shared by the first set of processors and the second set of processors.
Method and system for implementing subscription barriers in a distributed computation system
Embodiments of the invention relate to a method for managing subscriptions. The method includes initiating execution of a first subscription, in response to the initiating, obtaining a first subscription barrier associated with a first subscription, making, using the first subscription barrier, a first determination to block execution of a first query request associated with the first subscription, and in response to the first determination, ceasing execution of the first subscription.
Robustness metrics for optimization of query execution plans
A method may include responding to a query to retrieve data from a database by identifying a plurality of query execution plans. An overall robustness value may be determined for each query execution plan. The overall robustness value of a query execution plan may correspond to a sum of individual robustness values for each operator included in the query execution plan. Each operator may have an individual robustness value that corresponds to a first change in a total cost of a query execution plan including the operator relative to a second change in an output cardinality of the operator. One of the plurality of query execution plans may be selected based on the overall robustness value of each of the plurality of query execution plans. The query may be executed by performing a sequence of operators included in the selected one of the plurality of query execution plan.
Runtime metric estimations for functions
In some examples, a system receives function descriptors for different types of functions to be used when processing database queries, each function descriptor of the function descriptors comprising information relating to a respective function of the different types of functions. The system computes, based on a first function descriptor for a first function of the different types of functions, an estimate of a runtime metric associated with execution of the first function for processing a database query.
APPLYING QUERY COST DATA BASED ON AN AUTOMATICALLY GENERATED SCHEME
An analytics system is operable to receive a first plurality of query requests from a plurality of requesting entities. Query pricing scheme data is automatically generated based on the first plurality of query requests. A second plurality of query requests are received from the plurality of requesting entities. Query cost data is automatically generated for each of the second plurality of query requests by utilizing the query pricing scheme data. The query cost data for each of the second plurality of query requests is transmitted to a corresponding one of the plurality of requesting entities.
Method and System for Performing Data Cloud Operations
Systems and methods are provided for managing and accessing data using one or more data cloud servers. An exemplary method includes: receiving from one or more data sources, a first data set; stratifying the first data set into first samples; receiving from second one or more data sources, a second data set; stratifying the second data set into second samples; computing a projection factor for each of the second samples using the first samples; computing projected samples using the projection factor for each of the second samples; receiving from third one or more data sources, a third data set; computing a parameter using the third data set; selecting one or more of the projected samples to form a fourth data set; and performing a computer operation for estimating the data using the fourth data set and the parameter.
Systems and methods for accelerating exploratory statistical analysis
Embodiments of the invention utilize a “data canopy” that breaks statistical measures down to basic primitives for various data portions and stores the basic aggregates in a library within an in-memory data structure. When a queried statistical measure involves a basic aggregate stored in the library over a data portion that at least partially overlaps the data portion associated with the basic aggregate, the basic aggregate may be reused in the statistical computation of the queried measure.
Dynamically adjusting statistics collection time in a database management system
Each of one or more commit cycles may be associated with a predicted number of updates. A statistics collection time for a database table can be determined by estimating a sum of predicted updates included in one or more commit cycles. Whether the estimated sum of predicted updates is greater than a first threshold may be determined. In addition, a progress point for a first one of the commit cycles can be determined. A time to collect statistics may be selected based on the progress point of the first commit cycle.
CONFIGURING GRAPH QUERY PARALLELISM FOR HIGH SYSTEM THROUGHPUT
The present disclosure involves systems, software, and computer implemented methods for configuring graph query parallelism for high system throughput. One example method includes receiving a query to be executed against a graph database. System properties are determined of a system in which the query is to be executed. Algorithmic properties are determined of at least one algorithm to be used to execute the query. Graph data statistics are determined for the graph database. Graph traversal estimations are determined for a first iteration of the graph query and an estimated cost model is determined for the first iteration based on the graph traversal estimations. Estimated thread boundaries are determined for performing parallel execution of the first iteration. Work packages of vertices to be processed during the execution of the first iteration are determined based on the first estimated cost model and the work packages are provided to a work package scheduler.
Virtual Warehouse Query Monitoring and Reporting
Methods, systems, and apparatuses for tracking and generating analysis of queries submitted to virtual warehouses are described herein. A request to execute a query may be received. An organization may be determined, and the query may be modified to add an indication of the organization. A plurality of virtual warehouses may be identified. One of the plurality of virtual warehouses may be caused to execute the query. Query results and costs data may be received based on execution of the query by the first virtual warehouse. The costs data may indicate one or more costs associated with the execution of the query by the first virtual warehouse. The indication of the organization may be extracted from the query results. A user interface comprising query cost information corresponding to the organization may be generated.