G06F16/1844

METHOD, COMPUTER-READABLE MEDIUM AND FILE SYSTEM FOR DEDUPLICATION
20230063119 · 2023-03-02 ·

A method for deduplication applicable to a file chunked into a plurality of deduplicated chunks is provided and includes: defining a calculation range in the file according to types of the chunks in the file, where the calculation range includes a plurality of consecutive chunks in the file; generating an evaluation value according to the types of the chunks in the calculation range to determine whether to mark the chunks in the calculation range; and re-chunking and deduplicating the marked chunks in the file. A computer-readable medium and a file system corresponding to the method for deduplication are also provided.

Systems and methods for facilitating access to private files using a cloud storage system

A method for providing access to objects associated with a particular client in a cloud storage system is disclosed. The method includes the steps of establishing a connection with a user, providing a client namespace associated with the client to the user, where the client namespace represents objects stored on the cloud storage system and objects stored on a private storage system apart from the cloud storage system, receiving a request from the user to access an object stored on the private storage system, and providing information to the user to facilitate access to the object stored on the private storage system by said user. Other systems and methods are also disclosed. Important advantages of the present invention are facilitated by separating the logic for user access (control plane) from the actual storage (Storage plane). Private file system access can still be managed from the cloud, while keeping the client data private.

Efficient database migration using an intermediary secondary storage system

A portion of contents of a database is received from a first server. The received contents of the database is stored in a secondary storage system that tracks changes between different backup versions of contents of the database. A request to migrate the contents of the database to a second server is received. A version of contents of the database is provided to the second server using the secondary storage system. The secondary storage system is configured to determine an amount of changes to the database content from one of the versions of the database content provided to the second server and the amount of changes is utilized in determining whether to quiesce the database hosted on the first server.

Cascading payload replication

Cascading payload replication to target compute nodes is disclosed. Cascading payload replication can be accomplished using a two-stage operation for a replication operation. In the first stage, a plan is generated and distributed for the replication operation. The plan includes an assignment of compute nodes to tree nodes in a tree hierarchy. In the second phase, the payload is distributed according to the plan. The plan is different for at least two replication operations. Thus, the cascading payload replication is adaptable to changing target compute nodes and provides for load balancing.

Map-reduce ready distributed file system

A map-reduce compatible distributed file system that consists of successive component layers that each provide the basis on which the next layer is built provides transactional read-write-update semantics with file chunk replication and huge file-create rates. Containers provide the fundamental basis for data replication, relocation, and transactional updates. A container location database allows containers to be found among all file servers, as well as defining precedence among replicas of containers to organize transactional updates of container contents. Volumes facilitate control of data placement, creation of snapshots and mirrors, and retention of a variety of control and policy information. Also addressed is the use of distributed transactions in a map-reduce system; the use of local and distributed snapshots; replication, including techniques for reconciling the divergence of replicated data after a crash; and mirroring.

DELTA-BASED CONFLICT-FREE REPLICATED DATA TYPE ARRAYS USING DOT STORES THAT MAP DOTS TO DOT STORES
20220327160 · 2022-10-13 ·

An example system includes a processor to receive deltas corresponding to concurrently executed operations of a number of other replicas on an element of a delta-based conflict free replicated data type (CRDT) array of a distributed computing system. The processor is to modify, at the first replica, the element of the delta-based CRDT array based on the deltas using a dot store that maps dots to a dot store.

Selection of storage nodes for storage of data

An exemplary method includes sending a storage query to a plurality of storage nodes of a data storage system, the storage query including a data identifier of a data instance and an inquiry as to which of the storage nodes can store the data instance; receiving, in response to the storage query, a plurality of responses from a subset of storage nodes included in the plurality of storage nodes and that have at least a predetermined minimum amount of free storage space, the responses including information about each of the storage nodes included in the subset; selecting, based on the information included in the responses, multiple storage nodes included in the subset; and sending the data instance and the data identifier to the selected storage nodes for storage by the selected storage nodes.

Method and system for data placement in a linked node system

In general, embodiments of the invention relate to storing data and managing the stored data in linked nodes. More specifically, embodiments of the invention relate to nodes linked together in a daisy chain configuration such as, but not limited to, a single-chain configuration and a dual-chain configuration, which use data protection domain (DPD) information to determine where and/or how to store the data.

Self-healing infrastructure for a dual-database system

A database system could include a first database engine, a second database engine, and a replication engine. The database system could also include processors configured to perform operations. The operations could involve obtaining indicators that are respectively associated with performance issues that can occur in the database system, each indicator defining one or more conditions that, when satisfied, cause the indicator to become active. The operations could also involve obtaining mappings between: (i) at least some of the indicators, and (ii) remediation subroutines. The operations could additionally involve receiving operational data related to the first database engine, the second database engine, or the replication engine; determining, based on the operational data and the conditions defined by the indicators, that a particular indicator is active; determining, based on the mappings, that the particular indicator has an associated remediation subroutine; and executing the associated remediation subroutine.

Updating a remote tree for a client synchronization service
11657067 · 2023-05-23 · ·

The disclosed technology relates to a system configured to receive operations data from a content management system, wherein the operations data comprises a log of operations, execute the log of operations, and update, based on execution of the log of operations, a remote tree representing a server state for content items stored on the content management system.