G06F11/2012

SYSTEM AND METHOD TO DETECT AND RECOVER FROM INOPERABLE DEVICE MANAGEMENT BUS

Embodiments are described for detecting and recovering from an inoperable device management bus. A remote management controller is configured to offload device management bus transactions that use a messaging protocol. The messing protocol transactions are offloaded to a bus protocol controller that is responsible for managing bus operations using the messaging protocol. The bus protocol controller updates a set of bus status counters stored in a shared memory based on the processing of the offloaded messaging protocol transactions. The remote management controller processes device management bus transactions that use a bus protocol and updates the bus status counters based on the status of the bus protocol transactions. The remote management controller determines the status of the device management bus based on the bus status counters in shared memory, if the device management bus is inoperable, resets the remote management controller.

METHODS AND SYSTEMS TO IMPROVE INPUT/OUTPUT (I/O) RESUMPTION TIME BY BATCHING MULTIPLE NON-CONFLICTING OPERATIONS DURING A NON-DISRUPTIVE AUTOMATIC UNPLANNED FAILOVER FROM A PRIMARY COPY OF DATA AT A PRIMARY STORAGE SYSTEM TO A MIRROR COPY OF THE DATA AT A CROSS-SITE SECONDARY STORAGE SYSTEM
20240036996 · 2024-02-01 ·

Multi-site distributed storage systems and computer-implemented methods are described for improving a resumption time of input/output (I/O) operations during an automatic unplanned failover (AUFO). A computer-implemented method includes determining, with a second storage cluster, whether heartbeat information from one or more storage objects of a CG of a first set of CGs is received during a time period, determining an out of sync state for a data replication relationship between the CG of the first set of CGs and a mirrored CG of a second set of CGs when the heartbeat information is not received during the time period and sending a single bulk role change call with a cluster identifier from the second cluster to an external mediator to provide a role change from follower to leader in the second set of CGs.

ZERO DATA LOSS TRANSFER PROTOCOL
20190391885 · 2019-12-26 ·

A method for reliable data synchronization within a network is disclosed. The producer system stories data in a persistent data store and produces one or more data updates. The producer system simultaneously transmits the data updates to a consumer system and initiating storage of the data updates at the producer system. When storage of the data updates at the producer system is complete, the producer system transmits a first acknowledgment to the consumer system. The producer system determines whether a second acknowledgment has been received from the consumer system, wherein the second acknowledgment indicates that the consumer system has successfully stored the data updates at the consumer system. In accordance with a determination that the second acknowledgment has been received from the consumer system, the producer system changes the temporary status of the data updates stored at the producer system to a permanent status.

SYSTEMS AND METHODS FOR ERROR RECOVERY IN REBOOTLESS FIRMWARE UPDATES

Systems and methods include a remote access controller of an IHS (Information Handling System) that provides remote management of the IHS. The remote access controller initiates an update of firmware used to operate a hardware component of the IHS by transmitting a firmware image to the hardware component via a first signaling pathway connecting the remote access controller to the hardware component. The remote access controller detects a failure in transmission of the firmware image to the hardware component via the first signaling pathway and identifies a second signaling pathway connecting the remote access controller to the hardware component. The remote access controller resumes transmission of the firmware image to the hardware component via the second signaling pathway and the firmware used to operate the hardware component is updated using the transmitted firmware image.

Remote control system
11917415 · 2024-02-27 · ·

Remote control system is disclosed. The present techniques relate to a system and a method for improving connection between two devices, e.g. to enable one device to control the other. The method comprises receiving, at a remote computing device, a request from a secondary user device to diagnose a fault with the primary user device. The request is received on a first communication channel. The method further comprises a step of connecting, in response to the request, the remote computing device with the secondary user device. This is performed by using a secure communication channel which is different to the first communication channel. The method further comprises a step of sending a command message for the primary user device from the remote computing device to the secondary user device via the secure communication channel. Any response to the command message is then received from the secondary user device via the secure communication channel. The response is used at least in part to diagnose the fault.

Modular switched fabric for data storage systems

Systems, methods, apparatuses, and software for data storage systems are provided herein. In one example, a data storage system is provided that includes a plurality of storage drives each comprising a Peripheral Component Interconnect Express (PCIe) interface, and configured to store data and retrieve the data stored on associated storage media responsive to storage operations. The data storage system includes one or more processing modules comprising one or more processors communicatively coupled to the plurality of storage drives over a PCIe fabric comprised of one or more PCIe switches. The processors are configured to share a PCIe address space associated with the PCIe fabric for transfer of the storage operations to appropriate ones of the processors that manage ones of the plurality of storage drives.

Failover handling in modular switched fabric for data storage systems

Data storage systems are provided that include network interfaces each configured to receive storage operations issued by one or more host systems for handling by the data storage system, and transfer the storage operations over a Peripheral Component Interconnect Express (PCIe) fabric. The data storage systems also include processors configured to receive the storage operations over the PCIe fabric, process properties of the storage operations against allocation information of a shared PCIe address space to determine target processors to handle the storage operations, and selectively transfer ones of the storage operations to the target processors for handling of the storage operations with associated storage drives coupled over the PCIe fabric. The data storage systems also identifie failed ones of the processors and responsively initiate backup ones of the processors to handle portions of the shared PCIe address space handled by the failed ones of the processors.

Zero data loss transfer protocol

A method for reliable data synchronization within a network is disclosed. The producer system stories data in a persistent data store and produces one or more data updates. The producer system simultaneously transmits the data updates to a consumer system and initiating storage of the data updates at the producer system. When storage of the data updates at the producer system is complete, the producer system transmits a first acknowledgment to the consumer system. The producer system determines whether a second acknowledgment has been received from the consumer system, wherein the second acknowledgment indicates that the consumer system has successfully stored the data updates at the consumer system. In accordance with a determination that the second acknowledgment has been received from the consumer system, the producer system changes the temporary status of the data updates stored at the producer system to a permanent status.

Method and apparatus for communication between master and slave processors

An example system includes a slave processor and a master processor. The master processor is configured to communicate with the slave processor over a digital communication link in a first mode, and over an analog communication link in a second mode. A method is also disclosed.

CONTROL PLANE METHOD AND APPARATUS FOR PROVIDING ERASURE CODE PROTECTION ACROSS MULTIPLE STORAGE DEVICES
20190196909 · 2019-06-27 ·

A system and method for providing erasure code protection across multiple storage devices. A data switch in a storage system connects a plurality of storage devices to a remote host. Each storage device is also connected to a controller, e.g., a baseboard management controller. During normal operation, read and write commands from the remote host are sent to respective storage devices through the data switch. When a write command is executed, the storage device executing the command sends a copy of the data to the controller, which generates and stores erasure codes, e.g., on a storage device that is dedicated to the storage of erasure codes, and invisible to the remote host. When a device fails or is removed, the controller reconfigures the data switch to redirect all traffic addressed to the failed or absent storage device to the controller, and the controller responds to host commands in its stead.