G06F16/24539

Managing real time data stream processing
11520796 · 2022-12-06 · ·

A method for managing data processing includes receiving, from a user of a data query system, a data query for data stored in a data store in communication with the data query system. The method also includes receiving a staleness parameter indicating an upper time boundary for the data query. The upper time boundary limits a query response to data within the data store that is older than the upper time boundary. The method further includes determining whether the data stored within the data store satisfies the staleness parameter. When a portion of the data within the data store fails to satisfy the staleness parameter, the method includes generating the query response that excludes the portion of the data that fails to satisfy the staleness parameter.

Managing multiple cache specifications within a database environment
11514042 · 2022-11-29 · ·

Technologies are described for managing multiple cache specifications for a single view in a database environment. For example, a plurality of caches can be crated for the single view. Each of the plurality of caches can have a different cache specification. A query can then be received that operates on the single view. One of the plurality of caches can be selected for processing the query based at least in part on the cache specifications. The query can be processed using the selected cache. Results of processing the query can be output.

TECHNIQUES FOR A DETERMINISTIC DISTRIBUTED CACHE TO ACCELERATE SQL QUERIES

Techniques for providing improved distributed caching are disclosed. A distributed computing system can be implemented with a cluster including a plurality of worker nodes configured to host one or more executors for processing data related to a query. The worker nodes can host a cache accessible to the executors. The data can be processed as a plurality of data segments. The worker nodes can be uniformly assigned a plurality of token bounds defining a range of integer token values. A hashing algorithm can be used to compute a token for each data segment associated with the query. Tasks can be launched on the executors preferentially, such that the task for processing a data segment having a token within the token bounds associated with the preferred executor. Executors can be instructed to review the associated cache to identify outlier data segments and inform other nodes in the cluster.

SYSTEM AND METHOD FOR INCREMENTAL VIEW MAINTENANCE BASED ON DIFFERENTIAL CALCULUS OVER NATURAL ALGEBRA OF K-RELATIONS
20220374430 · 2022-11-24 ·

A method for incremental update of materialized views and a system for answering queries against relational databases, or object-oriented databases, or graph databases, are provided. The system comprises a Storage Engine subsystem, configured to store original data as well as materialized views and subviews in a dedicated subsystem, and a Diff Engine subsystem configured to translate Natural Algebra representations of a Natural Algebra view definition into derived Natural Algebra expressions. The system further comprises an Optimizer configured to translate derived Natural Algebra expressions into Incremental View Maintenance plans, and a Delta Extractor subsystem configured to extract any transactional changes to the original data or batches of the said changes in a form that can be passed as input to the Incremental View Maintenance plans in order to compute the changes to the materialized views.

Processing transactions in a synchronized replication system

Embodiments of the present disclosure provide a method, a computer program product and apparatus for processing transactions in a synchronized replication system, wherein the method comprises, at a source site in the synchronized replication system: serializing commits of transactions in the synchronized replication system so that only one of the transactions can be committed at the same time; in response to initiating the commit of the one transaction, generating a log for each of transactions that are ongoing in the synchronized replication system, so as to record impact of all operations of a respective transaction on the synchronized replication system; marking transactions for which the logs have been generated; and completing commits of the marked transactions.

Multi-prefix query optimization
11500883 · 2022-11-15 · ·

The present invention includes systems and methods for retrieving information via a flexible and consistent targeted search model that employs interactive dynamic menu information retrieval techniques that provide context-specific functionality tailored to particular information channels, as well as to records within or across such channels, and other known state information. Users are presented with a consistent search interface among multiple tiers across and within a large domain of information sources, and need not learn different or special search syntax. A thin-client server-controlled architecture enables users of resource-constrained mobile communications devices to locate targeted information more quickly by entering fewer keystrokes and performing fewer query iterations and web page refreshes, which in turn reduces required network bandwidth.

Method and apparatus for optimizing database transactions
11500869 · 2022-11-15 · ·

The disclosure provides a database operation method and apparatus. The method comprises: sequentially acquiring, during a process of executing a target transaction by an application server, database operation commands executed by the application server for the target transaction; executing a prediction algorithm on the database operation commands, returning predicted execution results to the application server so that the application server determines a next to-be-executed database operation command, and locally recording the database operation commands and predicted execution data generated from the executing of the prediction; and when acquiring a transaction commit command regarding the target transaction, controlling a database corresponding to the application server to actually execute the target transaction according to the locally recorded database operation commands and the predicted execution data. The disclosed embodiments improve transaction execution efficiency and increase transaction throughput.

Database performance degradation detection and prevention

Techniques for database performance degradation detection and prevention are described. A statement performance monitor observes queries executed against a database engine and clusters the queries into groups of queries. The index utilization of the query groups and execution metrics are tracked over time, and a sudden change of index utilization can be detected. The change can be reported to users and/or new indexes may be automatically generated to serve affected query groups. Additionally, a statement performance monitor may be deployed to statically analyze code to identify modified queries and the resultant change of use of query indexes.

Visualization Data Reuse In A Data Analysis System
20220358129 · 2022-11-10 ·

A method includes obtaining first data expressing a first usage intent; obtaining an answer object; obtaining, in accordance with first access control data, a first visualization data for the answer object; associating, with the first visualization data, a first generation-identifier that corresponds to at least a subset of the first access control data; generating first instructions for rendering the first visualization data on a user device of the first user; receiving a first snapshot of the first visualization data; storing a first association between the first generation-identifier and the first snapshot; obtaining second data expressing a second usage intent from a second user; identifying the answer object responsive to the second data expressing the second usage intent from the second user; and, responsive to second access control data corresponding to the second user matching the first generation-identifier, outputting second instructions for rendering the first snapshot to the second user.

Stand In Tables
20230098361 · 2023-03-30 · ·

A method for pre-utilizing tables before refresh includes selecting a query for execution in a data analytics application and identifying a table specified by the query, for instance a materialized table. Another operation includes computing an encoding, such as a hash value, for at least one portion of the identified table. Another operation includes comparing the computed encoding to a pre-stored encoding for the at least one portion of the identified table. On the condition that the computed encoding is identical to the pre-stored encoding, the operations include utilizing the identified table in executing the query in lieu of re-materializing the identified table. Otherwise, on the condition that the computed encoding differs from the pre-stored encoding, the operations include re-materializing the identified table before returning the results of the query.