Patent classifications
G06F11/1662
System and method for speed up data rebuild in a distributed storage system with local deduplication
A method includes obtaining a plurality of representations corresponding respectively to a plurality of blocks of data stored on a source node. A plurality of data pairs are sent to a destination node, where each data pair includes a logical address associated with a block of data from the plurality of blocks of data and the corresponding representation of the block of data. A determination is made whether the blocks of data associated with the respective logical addresses are duplicates of data stored on the destination node. In accordance with an affirmative determination, a reference to a physical address of the block of data stored on the destination node is stored. In accordance with a negative determination, an indication that the data corresponding to the respective logical address is not a duplicate is stored. The data indicated as not being a duplicate is written to the destination node.
Storage system and control software deployment method
A storage system includes a plurality of storage nodes. The storage node includes: one or more storage devices which respectively provide a storage area; and one or more control software which read/write requested data from/into the corresponding storage device according to a request from a higher-level device, wherein each of the control software retains predetermined configuration information required for reading/writing requested data from/into the corresponding storage device according to a request from the higher-level device, wherein a plurality of the control software are managed as a redundancy group, and the configuration information retained in each of the control software belonging to the same redundancy group is synchronously updated, and wherein the plurality of control software configuring the redundancy group are each deployed in respectively different storage nodes so as to distribute a load of each of the storage nodes.
METHODS AND SYSTEMS FOR DATA RESYNCHRONIZATION IN A REPLICATION ENVIRONMENT
Methods, computer program products, computer systems, and the like are disclosed that provide for scalable deduplication in an efficient and effective manner. For example, such methods, computer program products, and computer systems can include determining whether a source data store and a replicated data store are unsynchronized and, in response to a determination that the source data store and the replicated data store are unsynchronized, performing a resynchronization operation. The source data stored in the source data store is replicated to replicated data in the replicated data store. The resynchronization operation resynchronizes the source data and the replicated data.
TECHNIQUES FOR LIF PLACEMENT IN SAN STORAGE CLUSTER SYNCHRONOUS DISASTER RECOVERY
Improved techniques for disaster recover within storage area networks are disclosed. Embodiments include replicating a LIF of a primary cluster on a secondary cluster. LIF configuration information is extracted from the primary cluster. A peer node from a secondary cluster is located. One or more ports are located on the located peer node that match a connectivity of the LIF from the primary cluster. One or more ports are identified based upon one or more filtering criteria to generate a candidate port list. A port from the candidate port list is selected based at least upon a load of the port. Other embodiments are described and claimed.
CLONE-AWARE APPROACH FOR SPACE AND TIME EFFICIENT REPLICATION
In an embodiment, a computer stores source files and source clone files in a source filesystem. The source clone files are shallow copies of the source files, which initially share the same data blocks as the source files. A source file or a source clone file may eventually be modified, where some of the shared blocks are replaced by modified blocks. In a first phase, all data blocks of all source files that are not clones are replicated to a standby filesystem. A second phase compares each clone file on the source filesystem with its base file to detect a set of differed blocks. The second phase copies the differed blocks into the standby filesystem and, in the standby filesystem, applies the differed blocks to a corresponding clone file. Efficiency of keeping the standby filesystem synchronized with the source filesystem is improved by transferring, from the source filesystem to the standby filesystem, only one copy of only data blocks that are modified
System and method of performing recovery using a backup image
Disclosed herein are systems and method for performing recovery using a backup image. In one exemplary aspect, a method comprises scanning a plurality of files on one or more storage devices of a computing device. The method may determine a first set of files from the plurality of files that will be used during recovery of the one or more storage devices, and tag a second set of files that will not be used during recovery. The method may copy the second set of files that have been tagged to an external storage device, and may store the first set of files in a backup image for the computing device (excluding the tagged second set of files from the backup image). The method may add, to the backup image, a respective link to each of the tagged second set of files in the external storage device.
Selection of storage nodes for storage of data
An exemplary method includes sending a storage query to a plurality of storage nodes of a data storage system, the storage query including a data identifier of a data instance and an inquiry as to which of the storage nodes can store the data instance; receiving, in response to the storage query, a plurality of responses from a subset of storage nodes included in the plurality of storage nodes and that have at least a predetermined minimum amount of free storage space, the responses including information about each of the storage nodes included in the subset; selecting, based on the information included in the responses, multiple storage nodes included in the subset; and sending the data instance and the data identifier to the selected storage nodes for storage by the selected storage nodes.
Managing replica unavailability in a distributed file system
Replica unavailability in a distributed file system can be managed. For example, a processing device can detect that a replica of data in a volume of a distributed file system is unavailable. In response to detecting that the replica is unavailable, the processing device can create a copy of the data in a memory location that is within the distributed file system and external to the volume. The processing device can then execute a write request by modifying both the data in the volume and the copy in the memory location (e.g., to ensure consistency between the two).
USE OF CLUSTER-LEVEL REDUNDANCY WITHIN A CLUSTER OF A DISTRIBUTED STORAGE MANAGEMENT SYSTEM TO ADDRESS NODE-LEVEL ERRORS
Systems and methods that make use of cluster-level redundancy within a distributed storage management system to address various node-level error scenarios are provided. According to one embodiment, a KV store of a node of a cluster of a distributed storage management system manages storage of data blocks as values and corresponding block IDs as keys. Data integrity errors are reported to the first node in the form of a list of missing block IDs that are in use but missing from the KV store. A metadata resynchronization process may then be caused to be performed, including for each block ID in the list of missing block IDs: (i) reading a data block corresponding to the block ID from another node of the cluster that maintains redundant information relating to the block ID; and (ii) restoring the block ID within the KV store by writing the data block to the node.
METHOD AND SYSTEM FOR MANAGING STORAGE SYSTEM
Embodiments of the present invention provide a method and a system for managing a storage system. In one embodiment of the present invention, there is provided a method for managing a storage system, where the storage system comprises a first controller, a second controller, a first communication area as well as a second communication area. The method comprising: with respect to a storage device in the storage system, in response to the first controller successfully accessing the storage device, writing to the first communication area a first state that indicates a state relationship between the first controller and the storage device, where the first communication area is readable and writable to the first controller and readable to the second controller; reading from the second communication area a second state that indicates a state relationship between the second controller and the storage device, where the second communication area is readable to the first controller and readable and writable to the second controller; and in response to the second state indicating that the second controller successfully accesses the storage device, initializing the storage system. In one embodiment of the present invention, there is further provided a corresponding system and apparatus.