Patent classifications
G06F16/24565
REAL TIME METHOD AND SYSTEM FOR ANALYZING DATA STREAMS
Data analysis plays a crucial role to get significant information out of the data. A real time system and method for analyzing data streams have been provided. The system can utilize many different types of data formats such as numeric, text, video, audio, image, or combination thereof. The analysis takes place as per the requirement using an analytical engine and an intermediate output is generated. The intermediate output is further processed using a distributed real time business rule processing engine to determine required conditions in the data. The business rules comprise one or more set of meta data. On match of the business rule, the system triggers an alert or propagates the required information to integrating solution for required actions. The system and method are technology and communication protocol agnostic, and designed with highly efficient load balanced technique, thereby facilitating highly concurrent data processing with minimal latency.
ESTIMATED EXECUTION TIME FOR QUERY EXECUTION
The subject technology tracks a plurality of queries corresponding to a plurality of query plans based on join operations contained in each of the plurality of queries and a previous time of executing each query. The subject technology selects a first query plan among the plurality of query plans. The subject technology determines a value indicating an estimated improvement in execution time of the first query plan in comparison to a previous execution time of a previous query plan. The subject technology attempts to execute a first query using the first query plan. The subject technology determines that a second query plan selected among the plurality of query plans has a second estimated execution time that is less than an estimated execution time of the first query plan. The subject technology executes the first query corresponding to the first query plan at a subsequent time using the second query plan.
CREATING VIRTUALIZED DATA ASSETS USING EXISTING DEFINITIONS OF ETL/ELT JOBS
A computer-implemented method, system and computer program product for creating virtualized data assets. A copy of an extract, transform, load (ETL) or an extract, load, transform (ELT) job definition is created. Furthermore, a table function is created in which the table function is configured to trigger the execution of a copy of the ETL/ELT job definition, to read the resulting data from executing the copy of the ETL/ELT job definition and to return the resulting data (data sets), such as in a queue in a federation server. After receiving a query from a user based on a virtual view in which the query includes the ETL/ELT job definition, the table function is executed which triggers execution of the copy of the ETL/ELT job definition. Data sets are then produced by executing the copied ETL/ELT job definition, which is read and returned to the user in the virtual view.
Extract-transform-load script generation
One embodiment provides a computer implemented method, including: receiving, from a user, a natural language query for data contained within at least one data repository; identifying at least one concept from the natural language query, wherein the at least one concept includes an entity and an intent; identifying a plurality of datasets satisfying the natural language query by querying the at least one data repository utilizing the at least one concept; ranking the dataset based on relevance to the query; generating an extract-transform-load script that extracts, transforms, and loads a dataset selected by the user from the plurality of datasets; and retrieving data included in the dataset utilizing the extract-transform-load script, wherein the retrieving includes returning the data to the user.
World knowledge triggers
Methods, systems, and apparatus, for associating actions with events that are determined independent of the action type of the action. A system receives, from a user device, identification of an action to be taken, the action being one of a plurality of action types, and an identification of a first event that is determined independent of the action type and for which the occurrence of the first event may be determined by the system. They system associates the action and the first event with instructions for determining if the first event has occurred, and then determines whether the first event has occurred. In response to determining that the first event has occurred, the system causes the action to occur.
Key-value store system
A system includes a processor and a memory storing multiple records in a key-value data structure and processor executable instructions for managing access to records in the key-value data structure. The instructions are executable to perform operations including managing a shared atomic epoch counter and thread epoch counters, determining a maximal safe epoch as a function of the shared atomic epoch counter and the thread epoch counters, maintaining a drain list of trigger actions, and triggering the trigger actions in the drain list as a function of an update of the shared atomic epoch counter and the maximal safe epoch.
Data simulation for regression analysis
A simulated dataset is queried for regression by validating a structured query language (SQL) statement, determining a pattern type of the SQL statement, reconstructing the SQL statement according to a predetermined process for the pattern type, creating a mutated SQL statement for querying a simulated dataset, and validating the mutated SQL statement. The simulated dataset is based on a confidential dataset having the confidential elements removed or replaced.
GENERATING TRANSFORMATIONS OF ANALYTIC ARTIFACTS
A computer-implemented method, system and computer program product for generating a visual transformation of a business analytic artifact. Features from a received business analytic artifact (e.g., dashboard) are extracted. Furthermore, constraints are determined for a transformed representation of the business analytic artifact. A heuristic-based search is then performed on a space of possible transformed representations of the business analytic artifact using the extracted features to identify a transformed version that satisfies the determined constraints and maximizes readability. A final score is then generated for each possible transformed representation of the business analytic artifact based on the extent that the transformed version satisfies the determined constraints and maximizes the readability. The transformed representation of the business analytic artifact that has the highest score is selected and displayed on the target device. In this manner, meaningful transformations of business analytic artifacts are generated and displayed on a computing device with minimal distortion.
Synopsis based advanced partition elimination
Disclosed herein are system, method, and computer program product embodiments for determining a constraint data statistics object is consistent. An embodiment operates by receiving a query for a database table. The embodiment determines a constraint data statistic object associated with a partition of the database table. The embodiment determines the constraint data statistics object is consistent with a data state of the partition of the database table based on consistency metadata of the constraint data statistics object. The embodiment processes the query for the database table based on the determining the constraint data statistics object is consistent with the data state of the partition of the database table.
RESOURCE PROVISIONING IN DATABASE SYSTEMS
Resource provisioning systems and methods are described. In an embodiment, a system includes a plurality of shared storage devices collectively storing database data, an execution platform, and a compute service manager. The compute service manager is configured to determine a task to be executed in response to a trigger event and determine a query plan for executing the task, wherein the query plan comprises a plurality of discrete subtasks. The compute service manager is further configured to assign the plurality of discrete subtasks to one or more nodes of a plurality of nodes of the execution platform, determine whether execution of the task is complete, and in response to determining the execution of the task is complete, store a record in the plurality of shared storage devices indicating the task was completed.