G06F16/24545

Database management system and database processing method

The database management system (DBMS) receives a first instruction specifying anonymization rule information corresponding to a column of the relation table, among anonymization rule information that is present for each column included in the relation table and shows a plurality of generalization rules. The DBMS reads the column from the relation table in response to the first instruction, and generates a temporary result obtained by generalizing each attribute value of the column based on any of a plurality of generalization rules. The DBMS generates an aggregate result obtained by aggregating the temporary result. The DBMS generates an anonymization method including generalization information indicating a correspondence relationship of each attribute value of the column and any of the plurality of generalization rules when the aggregate result satisfies a disclosure rule. The DBMS generates anonymization information as a result of processing the relation table based on the first anonymization method.

Distributed memory-augmented neural network architecture

A method for using a distributed memory device in a memory augmented neural network system includes receiving, by a controller, an input query to access data stored in the distributed memory device, the distributed memory device comprising a plurality of memory banks. The method further includes determining, by the controller, a memory bank selector that identifies a memory bank from the distributed memory device for memory access, wherein the memory bank selector is determined based on a type of workload associated with the input query. The method further includes computing, by the controller and by using content based access, a memory address in the identified memory bank. The method further includes generating, by the controller, an output in response to the input query by accessing the memory address.

OPTIMAL QUERY SCHEDULING FOR RESOURCE UTILIZATION OPTIMIZATION

The present disclosure provides a method, system and computer program product for optimal query scheduling for resource utilization option. In an embodiment of the disclosure, a process for optimal query scheduling includes receiving in an information retrieval data processing system at a contemporaneous time, a request for deferred query execution of a specified query to a future time after the contemporaneous time. The method additionally includes determining a frequency of change of data corresponding to a field referenced in the specified query. Then, on condition that the frequency of change is below a threshold value, an intermediate time prior to the future time but after the contemporaneous time can be identified and the specified query scheduled for execution at the intermediate time instead of the future time. But, otherwise the specified query can be scheduled at the future time as originally requested.

DATA ARRANGEMENT MANAGEMENT IN A DISTRIBUTED DATA CLUSTER ENVIRONMENT OF A SHARED POOL OF CONFIGURABLE COMPUTING RESOURCES
20170371928 · 2017-12-28 ·

Disclosed aspects relate to data arrangement management in a distributed data cluster environment of a shared pool of configurable computing resources. In the distributed data cluster environment, a set of data is monitored for a data redistribution candidate trigger. The data redistribution candidate trigger is detected with respect to the set of data. Based on the data redistribution candidate trigger, the set of data is analyzed with respect to a candidate data redistribution action. Using the candidate data redistribution action, a new data arrangement associated with the set of data is determined. Accordingly, the new data arrangement is established.

METHOD FOR PREDICATE EVALUATION IN RELATIONAL DATABASE SYSTEMS

A method for performing row qualification in database table retrieval and join operations. This method, referred to as bulk qualification, evaluates conditions on multiple rows in a database table at the same time, providing more efficient utilization of memory bandwidth and CPU throughput.

Method for EN passant workload shift detection

Embodiments of the present invention provide systems, methods, and computer program products for detecting shifts in types of workloads handled by a relational database management system. Embodiments of the present invention can afford relational database administrators with the ability to leverage information pertinent to the current type of workload being handled by the relational database management system. Furthermore, embodiments of the present invention provide relational database administrators with information in regard to system workload states and workload transitions.

Sharded storage of geolocated data with predictable query response times

A system enables an arbitrary number of items to be indexed in a geographic region that provides a predictable query response time across a sharded database. Items indexed to the geographic region are stored on a single shard and additional items are added to that shard as long as an overflow condition indicative of undesirable query response times is not met. If the overflow condition is met the system expands the storage of items indexed to the geographic region to one or more additional shards in order to maintain predictable query response times. The system may maintain a shard count representing the total number of shards being used to store items corresponding to a geographic region, which can be used to query one or more relevant shards. The system may apply deterministic hashing in order to evenly distribute shards across database nodes of the sharded database.

SCHEDULING PARALLEL EXECUTION OF QUERY SUB-PLANS
20230195729 · 2023-06-22 ·

Various embodiments provide for executing sub-plans in parallel using a plurality of execution nodes, which can be part of a data platform. In particular, various embodiments identify sub-plans (e.g., fragments or portions of one or more child operators) of a root operator in a query plan such that the identified sub-plans that are candidates for execution on a single execution node, determine a cost estimate for causing the candidate sub-plans to be executed in parallel using multiple execution nodes, and cause the candidate sub-plans to be executed in parallel based on the cost estimate.

STORAGE RESOURCE PROVISIONING SYSTEMS AND METHODS
20230185824 · 2023-06-15 ·

A method and apparatus for managing a set of storage resources for a set of queries is described. In an exemplary embodiment, a method provisions processing resources of an execution platform and provisions storage resources of a storage platform. The execution platform uses the storage platform, which is shared with the execution platform, to process the set of queries. The method changes a number of the storage resources provisioned for the storage platform based on a storage capacity utilization by the set of queries of the storage resources. The method changes the number of the storage resources independently to a change of the processing resources in the execution platform. The method processes the set of queries using the changed number of the storage resources provisioned for the storage platform.

GENERATING QUERY COST DATA BASED ON AT LEAST ONE QUERY FUNCTION OF A QUERY REQUEST

A query processing system is operable to determining a query pricing scheme and receive a query request from a requesting entity that indicates at least one query function for execution against a database system. Query cost data is generated based on the at least one query function by utilizing the query pricing scheme. The query cost data is transmitted to the requesting entity.