Patent classifications
G06F3/0608
Managed NAND data compression
Apparatus and methods are disclosed, including providing available data operations for the storage system processor to a host processor, identifying data operations to be performed by the storage system processor, and assigning identified data operations to the storage system processor to reduce bus traffic between the host processor and the storage system processor, to improve host processor performance, and to reduce energy use by the host processor.
Data Storage Arrangement and Method for Anonymization Aware Deduplication
A data storage arrangement includes a memory and a controller, where the controller receives an indication of data to be anonymized. The controller further parses a data element to be stored and generates a copy of one or more data portions to be anonymized. The controller further deletes one or more data portions to be anonymized to generate a modified data element to be stored. The controller further generates a copy of the modified data element to be stored utilizing deduplication. The data storage arrangement thus takes in account data anonymization during deduplication (i.e. an anonymization aware deduplication).
DYNAMIC STORAGE IN KEY VALUE SOLID STATE DRIVE
A method for dynamically storing keys and values includes receiving a request for storing one or more keys in a key value Solid State drive (KV-SSD). The method further includes performing a storage operation for storing each key of the one or more keys in a node of a data structure of the KV-SSD. The storage operation includes allocating a first region in the node for storing the key, such that a size of the first region is equal to a size of the key. The storage operation further includes allocating a second region in the node for storing key metadata associated with the key, such that the second region is of a predetermined size. The storage operation further includes storing the key in the first region and the key metadata in the second region of the node.
DATA REPLICATION SYSTEM AND DATA REPLICATION METHOD
A first storage system compresses data relating to read and write by a primary site and stores the data in a first physical volume. A second storage system compresses data relating to read and write by a secondary site and stores the data in a second physical volume. When performing replication for transferring the data stored in the first physical volume of the first storage system to the second storage system and storing the data in the second physical volume, the first storage system and the second storage system determine, based on a compression scheme executable by the first storage system and a compression scheme executable by the second storage system, a compression scheme to be applied to transfer target data and transfer the transfer target data compressed by the determined compression scheme.
Capturing data in data transfer appliance for transfer to a cloud-computing platform
In one aspect, a computer-implemented method useful for migrating hundreds of Terabytes to Petabytes of data to a cloud-computing environment with a data transfer appliance includes the step of providing a data transfer appliance. The data transfer appliance includes an operating system, one or more computing processing units (CPU's), a memory, and a data storage system. The computer-implemented method includes the step of implementing data capture from a data storage system to the data transfer appliance. The computer-implemented method includes the step of storing the dedupe form of the data in the data transfer appliance by; providing a capture utility, wherein the capture utility comprises a data traversal engine and a data read engine.
Optimizing garbage collection that uses a logical-to-physical table search
A method is described that includes determining, by a memory subsystem, that a garbage collection process is to be performed on a memory device and selecting a first candidate block stripe for folding into a first target block stripe in response to determining that the garbage collection process is to be performed. The method further includes determining, by the memory subsystem, that a physical-to-logical table stored in the first candidate block stripe is unavailable; reducing a write command rate, which controls a rate at which writes are fulfilled by the memory subsystem, in response to determining that the physical-to-logical table stored in the first candidate block stripe is unavailable; and performing folding of the first candidate block stripe into the first target block stripe using a logical-to-physical table.
EFFICIENT INLINE BLOCK-LEVEL DEDUPLICATION USING A BLOOM FILTER AND A SMALL IN-MEMORY DEDUPLICATION HASH TABLE
A method for inline block-level deduplication is provided. The method generally includes receiving an input/output (I/O) to write a first data block in storage as associated with an logical block address (LBA), hashing the first data block to a first hash, determining a match for the first hash is contained in a bloom filter based on set bits in the bloom filter for the first hash, determining an entry for the first data block is contained in a deduplication hash table based on a subset of bits of the first hash, locating a first middle map extent in a middle map based on a middle block address (MBA) included in the entry, verifying the first hash matches the first hash stored in the middle map extent, adding a logical map extent for the LBA to a logical map, wherein the logical map extent maps the LBA to the MBA.
Adaptive compression of stored data
Systems, devices and methods for adaptive compression of stored information includes a memory management computing device programmed to monitor a size of a plurality of data structures stored in a data repository. The computing device compares the size of each of a plurality of data structures to a predetermined threshold. When a size of an uncompressed data structure meets the threshold, the memory management computing device calculates a value of a first compression parameter based on a value of a first parameter and a value of a second parameter of each data element of the uncompressed data structure, calculates a value of a second compression parameter based the value of the first parameter of each data element of the uncompressed data structure, generates a compressed data structure based on the value of the first compression parameter and the second compression parameter; and replaces, in the data repository, the uncompressed data structure with the compressed data structure.
Selective compression circuitry in a memory device
Methods and systems for selectively compressing data lines of a memory device in selective compression circuitry. The selective compression circuitry receives multiple data lines and compression circuitry that selectively compresses inputs. The selective compression circuitry also includes control circuitry to receive data over via the data lines. The control circuitry, when in a compressed mode, transmits data from each of the data lines to the compression circuitry. Alternatively, in an uncompressed mode, the control circuitry transmits data from a first subset of the data lines to the compression circuitry while blocking data from a second subset of the data lines from being transmitted to the compression circuitry.
Performing secondary copy operations based on deduplication performance
An improved information management system is described herein in which the information management system can evaluate the deduplication performance of secondary copy operations and dynamically adjust the manner in which secondary copy data is created to minimize the negative effects of performing deduplication. Furthermore, the improved information management system can improve deduplication performance by applying different storage policies to different types of applications running on a client computing device. Moreover, the improved information management system can automatically detect the region of a client computing device and apply an appropriate information management policy to the client computing device to avoid inconsistencies or other errors resulting from administrator control.