G06F11/1084

Variable sparing of disk drives in storage array

In general, embodiments relate to a managing a Redundant Array of Independent Disks (RAID) group. The embodiments include determining a minimum and maximum set of spare disks to allocate to the RAID group, wherein the RAID group comprises a plurality of active members, allocating the minimum number of spare members to the RAID group, allocating an additional spare member to the RAID group, setting a mode of the additional spare member to storage mode, enabling, after the setting, the RAID controller to store data in the plurality of active members and in the additional spare member, wherein the plurality of active members, the minimum number of spare members, and the additional spare member comprise persistent storage.

Write hole protection method and system for raid, and storage medium

A write hole protection method and system for a RAID, and a storage medium. The method comprises: presetting a log area, and after a RAID is degraded, setting the log area to be in an enabled state; when the log area is in the enabled state, determining, before each stripe write operation, whether a data block of a failed member disk of the RAID in a stripe is a check data block; if the data block is not the check data block, determining whether data blocks to be written of the stripe comprise a data block to be written into the failed member disk; if yes, backing up the data block to be written into the failed member disk in the log area; if not, calculating the data block of the failed member disk and backing up the data block in the log area, or backing up the data blocks to be written in the log area; and when the degraded RAID is started after a failure, performing data recovery using the log area. By using the present solution, the write hole issue of the RAID is avoided.

SYSTEMS AND METHODS FOR RESETTING A DEGRADED STORAGE RESOURCE
20230152979 · 2023-05-18 · ·

A method may include, in an operating system of an information handling system: responsive to a determination that a storage resource of the information handling system is experiencing a predictor of a failure of the storage resource, issuing a command to the storage resource to reload firmware code of the storage resource; responsive to the storage resource reloading the firmware code and reset of the storage resource following reloading of the firmware code, determining whether the predictor persists; and responsive to determining whether the predictor persists, performing a responsive action.

Reconfiguring a storage system based on resource availability
11687259 · 2023-06-27 · ·

Reconfiguring a storage system based on resource availability, including: limiting a number of storage devices in a storage system that may be simultaneously servicing write operations; determining that an amount of required write bandwidth has changed; and subsequent to determining that the amount of required write bandwidth has changed, adjusting, by a computer processor, the number of storage devices in the storage system that may be simultaneously servicing write operations.

FLUSHING HOST CACHE DATA BEFORE REBUILDING DEGRADED REDUNDANT VIRTUAL DISK

A storage management method includes receiving a degrade signal indicating a degraded state of a virtual disk associated with a host system. Rebuild-flush operations may be performed. The operations may include writing, to the virtual disk and also to a hot spare drive (HSP) associated with the virtual disk, valid-modified data, stored in a host storage cache, associated with the virtual disk. In contrast, valid-unmodified storage cache data associated with the virtual disk, may be written to the HSP only. After the rebuild-flush completes, the virtual disk may be rebuilt. During rebuild, however, any cached-LBA may be skipped where traditional RAID rebuild operations are performed for un-cached LBAs only.

METHOD FOR CONTROLLING OPERATIONS OF RAID SYSTEM COMPRISING HOST DEVICE AND PLURALITY OF SSDS

Embodiments herein provide a method for controlling operations of a Redundant Array of Independent Disks (RAID) data storage system comprising a host device and a plurality of solid-state drives (SSDs). The method includes performing, by the at least one SSD, recovery of lost data by performing the auto-rebuild operation. The method also includes performing by the at least one SSD, the auto-error correction operation based on the IO error. The method also includes creating a snapshot of an address mapping table by all SSDs of the plurality of SSDs in the RAID data storage system. The auto-rebuild operation, the auto-error correction operation and the creation the snapshot of the address mapping table are all performed without the intervention from the host device.

Extracting selective information from on-die dynamic random access memory (DRAM) error correction code (ECC)

Error correction in a memory subsystem includes a memory device generating internal check bits after performing internal error detection and correction, and providing the internal check bits to the memory controller. The memory device performs internal error detection to detect errors in read data in response to a read request from the memory controller. The memory device selectively performs internal error correction if an error is detected in the read data. The memory device generates check bits indicating an error vector for the read data after performing internal error detection and correction, and provides the check bits with the read data to the memory controller in response to the read request. The memory controller can apply the check bits for error correction external to the memory device.

METHOD FOR WRITING DATA INTO STORAGE SYSTEM AND STORAGE SYSTEM
20170308332 · 2017-10-26 · ·

In a storage system, a control device determines that at least one first storage device of X storage devices is out of service. Then, the control device selects a second storage device from the X storage device. A quantity of stripe members of a data stripe stored in the second storage device is less than a ratio of T divided by X. The control device further writes target data into the second storage device. The target data is a data unit or a check unit used to update at least one stripe member of the data stripe stored in the first storage device.

PRIORITIZED DATA REBUILDING IN A DISPERSED STORAGE NETWORK
20170300374 · 2017-10-19 ·

A method begins with a processing module querying distributed storage network (DSN) storage units regarding storage errors associated with a data segment. The method continues with the processing module receiving query responses and depending on the responses, assigning a first threshold priority or a second threshold priority to encoded data slices (EDSs) associated with the data segment. The method proceeds with the processing module, depending on the assigned threshold priority, issuing read slice requests and rebuilding EDS associated with the data segment.

Prioritized data rebuilding in a dispersed storage network based on consistency requirements

A method begins with a processing module transmitting list slice requests to distributed storage network (DSN) storage units regarding storage errors associated with a data segment. The method continues with the processing module receiving list slice response messages and depending on the list slice response messages, determining whether a first threshold priority or a second threshold priority number of error-free EDSs associated with the first data segment has been stored. The method proceeds with the processing module, depending on the number of error-free EDSs associated with the first data segment that have been stored, issuing read slice requests and rebuilding one or more EDSs associated with the data segment.