Patent classifications
G06F11/2017
Driver switch for device error recovery for assigned devices
An error recovery system includes a memory, a processor in communication with the memory, a primary device, a backup device, a hypervisor executing on the processor, and a virtual machine. The virtual machine includes a guest operating system (OS) executing on the hypervisor, a pass-through device, and a guest driver. The hypervisor executes to detect an error associated with the primary device and to send a request to save a device state to the guest driver. The hypervisor also grants the guest OS access to the backup device. The guest driver receives the request from the hypervisor, and responsive to receiving the request, saves a state signature in the memory. The state signature includes a device signature and the device state of the primary device. Additionally, the guest driver determines a status of the device signature as one of matching and mismatching the backup device.
COMPUTER SYSTEM AND CONTROL METHOD THEREFOR
When mounting hardware which is coupled to another portion by a plurality of paths with different applications, despite the hardware being a single device, and a failure occurs in any of the paths, there is a risk that the failure may propagate to other components unless the other paths are also blocked. In order to solve the problem described above, in a storage apparatus to which a device coupled by a plurality of coupling paths with different applications can be mounted, the present invention determines a block range at the time of an occurrence of a failure to be a device and a plurality of coupling paths coupled to the device, manages the block range, and upon an occurrence of a failure, executes failure handling which involves blocking an appropriate block range determined in advance by referring to the information.
System and method for redundant object storage
Systems and methods for redundant object storage are disclosed. A method may include storing at least two copies of each of a plurality of objects among a plurality of nodes communicatively coupled to one another in order to provide redundancy of each of the plurality of objects in the event of a fault of one of the plurality of nodes. The method may also include monitoring access to each object to determine a frequency of access for each object. The method may additionally include redistributing one or more of the copies of the objects such that at least one particular node of the plurality of nodes includes copies of only objects accessed at a frequency below a predetermined frequency threshold based on the determined frequency of access for each object. The method may further include placing the at least one particular node in a reduced-power mode.
Data storage system employing a hot spare to store and service accesses to data having lower associated wear
A controller monitors access frequencies of address ranges mapped to a data storage array. Based on the monitoring, the controller identifies frequently accessed ones of the address ranges that have lower associated wear, for example, those that are read more often than written. In response to the identifying, the controller initiates copying of a dataset associated with the identified address ranges from the data storage array to a spare storage device while refraining from copying other data from the data storage array onto the spare storage device. The controller directs read input/output operations (IOPs) targeting the identified address ranges to be serviced by access to the spare storage device. In response to a failure of a failed storage device among the plurality of primary storage devices, the controller rebuilds contents of the failed storage device on the spare storage device in place of the dataset associated with the identified address ranges.
Automated restart of paused virtual machines due to input/output errors
An apparatus includes a storage device of a host computing device. The storage device is to store a virtualization manager. The apparatus also includes a processing device of the host computing device and operatively coupled to the storage device. The processing device is to determine that a first instance of a virtual machine on a first host computing device is paused based on an error associated with a connection to a storage device of the first host computing device, determine whether the second host computing device has access to the storage device of the first host computing device, instantiate a second instance of the virtual machine on the second host computing device when the second host computing device is determined to have access to the storage device of the first host computing device, and to stop the first instance of the virtual machine on the first host computing device.
PREVENTING DATA CORRUPTION AND SINGLE POINT OF FAILURE IN FAULT-TOLERANT MEMORY FABRICS
An example device in accordance with an aspect of the present disclosure includes a redundancy controller and/or memory module to prevent data corruption and single point of failure in a fault-tolerant memory fabric. Devices include engines to issue and/or respond to primitive requests, identify failures and/or fault conditions, and receive and/or issue containment mode indications.
Chunk redundancy architecture for memory
An integrated circuit (IC) includes addressable blocks of memory, and at least one redundant block of memory. A block of memory includes two or more chunks of memory. The IC also includes redundancy control cells. Control circuitry is included to access a first chunk of a redundant block of memory in place of a first remapped chunk one of the addressable blocks of memory, and a second chunk of a redundant block of memory in place of a second remapped chunk one of the addressable blocks of memory, based on the redundancy control cells.
Device and system including adaptive repair circuit
A device, system, and/or method includes an internal circuit configured to perform at least one function, an input-output terminal set and a repair circuit. The input-output terminal set includes a plurality of normal input-output terminals connected to an external device via a plurality of normal signal paths and at least one repair input-output terminal selectively connected to the external device via at least one repair signal path. The repair circuit repairs at least one failed signal path included in the normal signal paths based on a mode signal and fail information signal, where the mode signal represents whether to use the repair signal path and the fail information signal represents fail information on the normal signal paths. Using the repair circuit, various systems adopting different repair schemes may be repaired and cost of designing and manufacturing the various systems may be reduced.
REWRITING OF DATA STORED IN DEFECTIVE STORAGE REGIONS INTO OTHER STORAGE REGIONS
A storage apparatus includes a storage disk including a plurality of tracks each of which includes a plurality of sectors, a head configured to write data in and read data from the storage disk, and a controller. The controller is configured to control the head to carry out reading of a group of data units from target sectors in a target track of the storage disk, the group of data units being associated with a command received from an external device, determine whether or not the target sectors include one or more defective sectors based on result of the reading, and control the head to write the group of data units in physically consecutive non-written sectors of the target track or another track, when the target sectors are determined to include the defective sectors.
Memory system
A main memory includes unit memory regions, a redundancy memory region for replacing one or more of the unit memory regions, an address wrapper for generating an address increase/decrease control signal in first and second address wrapping modes, a column decoder for sequentially selecting memory cells in a faulty memory region where a fault has occurred, among the unit memory regions in the first address wrapping mode, and sequentially selecting redundancy memory cells in the redundancy memory region in the second address wrapping mode, based on a column address and the address increase/decrease control signal, and a data input/output circuit for outputting data read from the faulty memory region as backup data to a temporary memory in the first address wrapping mode, and outputting the backup data as restoration data to the redundancy memory region in the second address wrapping mode.