Patent classifications
G06F11/2043
SYSTEM AND METHOD FOR REDUCING FAILOVER TIMES IN A REDUNDANT MANAGEMENT MODULE CONFIGURATION
While the management module of an information handling system is set as a standby module, an enclosure controller provides first requests for attribute data of the information handling system, and receives and stores first response data for attribute data associated with a first subset of the first requests in a local memory of the enclosure controller. The enclosure controller receives request failure responses associated with a second subset of the first requests directed to a subset of the attributes data for the information handling system stored in a shared memory. While the management module is set as an active module, the management module is granted access to the shared memory. The enclosure controller provides retry requests for attributes associated with the request failure responses, and receives and stores second response data associated with the retry requests in the local memory.
MAPPING OF DATA STORAGE SYSTEM FOR A REDUNDANT ARRAY OF INDEPENDENT NODES
Described herein, system that facilitates mapping of redundant array of independent nodes of a storage device. According to an embodiment, a system can comprise generating a first configuration of a storage cluster, wherein the storage cluster comprises a group of nodes and a group of disks, generating a second configuration of the storage cluster using the first configuration, wherein the group of nodes are divided into a first pair of nodes comprising a first node having access to a first group of disks and a second node having access to a second group of disks, and generating a third configuration of the storage cluster using the second configuration, wherein the first node comprises a first mapped node that manages the first group of disks of the first node and enables access to the second group of disks of the second node.
Virtual machine recovery in shared memory architecture
Examples provide for virtual machine recovery using pooled memory. A shared partition is created on pooled memory accessible by a plurality of virtual machine hosts. A set of memory pages for virtual machines running on the hosts is moved to the shared partition. A master agent polls memory page tables associated with the plurality of hosts for write access. If the master agent obtains write access to a memory page table of a given host, the given host that previously held the write access is identified as a failed host or an isolated host. The virtual machines of the given host enabled to resume from pooled memory are respawned on a new host while maintaining memory state of the virtual machines using data within the pooled memory, including the virtual machine memory pages, memory page table, host profile data, and/or host-to-VM table data.
Control system and control unit
A control system includes one control unit connected to input/output units connected to an apparatus and another control unit capable of substituting for the one control unit. In the control system, the one control unit establishes connection to the input/output units. The control system includes a determination layer and a secure communication layer. The determination layer determines to establish connection between the other control unit and the input/output units when the connection between the one control unit and the input/output units is broken. The secure communication layer establishes connection between the other control unit and the input/output units in accordance with a result of the determination by the determination layer.
Core pairing in multicore systems
A method, executed by a computer, includes pairing a first core with a second core to form a first core group, wherein each core of the group has a plurality of functional units, transferring instructions received by the first core to the second core for execution via a first inter-core communication bus, and executing the instructions on the second core. A computer system and computer program product corresponding to the above method are also disclosed herein.
Transaction management for multi-node clusters
Examples include transaction management for a multi-node cluster. Some examples include generating a request log entry at an owner node that identifies a duplicate data location, tagging the duplicate data location with a requested tag, and issuing a zero-copy request that identifies the duplicate data location. In some such examples, a service node may generate a service log entry based on the zero-copy request that identifies the duplicate data location, process the zero-copy request, and tag the duplicate data location with a serviced tag. A determination may be made as to whether at least one of the owner node and the service node have failed and based on the determination that at least one has failed, the failed node may be recovered via at least one of the requested tag and the serviced tag of the duplicate data location.
System and method for data recovery in a distributed data computing environment implementing active persistence
A system and method for automatic recovery of actively persisted data in a distributed data grid. A dynamic active recovery quorum function determines the number of cluster storage members that must be available before recovery starts. The use of the dynamic active recovery quorum allows time for a cluster to start and ensures that partitions can be recovered gracefully without overloading too few storage members. The dynamic active recovery quorum function automatically configures the recovery quorum based on a predefined algorithm and data collected about cluster membership during cluster operation prior to the catastrophic failure or shut down for planned maintenance.
NODE DEVICE, RECOVERY OPERATION CONTROL METHOD, AND NON-TRANSITORY COMPUTER READABLE MEDIUM STORING RECOVERY OPERATION CONTROL PROGRAM
When a node device (10-1) has detected a system failure in a cluster system (1), it determines whether the node device (10-1) is an avoidance-override device. Then, when the node device (10-1) determines that the own node device is an avoidance priority device, the node device (10-1) transmits a request signal to a node device (10-2) other than the node device (10-1). The request signal is a signal for requesting a report about a normal state and an abnormal state of the node device (10-2). Then, the node device (10-1) determines whether to execute a recovery operation of the own node device or to avoid executing a recovery operation of the own node device based on the report from the node device (10-2)
SYSTEMS AND METHODS FOR IN-FIELD CORE FAILOVER
A multicore processor may include multiple processing cores that were previously designated as active cores and at least one processing core that was previously designated as a functional spare. The processor may include an interface to receive, during operation of the processor in an end-user environment, a request to change the designation of at least one of the processing cores. The processor may be to store, into a desired cores configuration data structure in response to the request, data representing a bitmask that reflects the requested change, and to execute a reset sequence. During the reset sequence, the processor may activate, dependent on the bitmask, a processing core previously designated as a functional spare, or may deactivate, dependent on the bitmask, a processing core previously designated as an active core. The processor may include a predetermined maximum number of active cores and a predetermined minimum number of functional spares.
Electronic circuit board
According to an embodiment, an electronic circuit board includes a nonvolatile memory, a reading circuit to read data stored in the nonvolatile memory, a switch, and a communication circuit. When power is supplied from a first power source, the switch performs switching to a first state in which the nonvolatile memory and a host device configured to read and write data from and in the nonvolatile memory are connected. When power is supplied from a second power source, the switch performs switching to a second state in which the host device and the nonvolatile memory are not connected and the reading circuit and the nonvolatile memory are connected. The communication circuit transmits, to an external device, the data read by the reading circuit from the nonvolatile memory when power is being supplied from the second power source.