G06F11/2033

AUTOMATED FAILOVER BACKUP RECONFIGURATION MANAGEMENT FOR STORAGE SYSTEMS
20220237086 · 2022-07-28 ·

Techniques for managing backup plans in information processing systems are disclosed. For example, a method comprises managing a data backup plan for an information processing system comprising a set of host devices that execute application instances for which data is stored on a first storage system and mirrored on a second storage system. After a given condition is detected, managing the data backup plan comprises reconfiguring the data backup plan by: (i) automatically subscribing, to the data backup plan, one or more of the application instances that execute on one or more active ones of the set of host devices configured to store data on the second storage system; and (ii) automatically unsubscribing, from the data backup plan, one or more of the application instances that previously executed on one or more non-active ones of the set of host devices configured to store data on the first storage system.

Adapter for synthetic or redundant remote terminals on single 1553 bus
11204884 · 2021-12-21 · ·

A remote terminal adapter device is disclosed. The adapter device includes control processors in communication with a bus controller via a dual redundant data bus (e.g., MIL-STD-1553) having primary and secondary data buses or channels. The adapter device includes analog relays connecting the primary and secondary buses to a main remote terminal (RT) device configured for control of an aircraft subsystem. Additional analog relays connect the data bus to one or more auxiliary or additional RTs (e.g., configured to backup the main RT or simulate the controlled subsystem and its responses. The adapter device may monitor the data bus for traffic and allow the redundant RT to access the data bus (from the same remote terminal) address as the main RT by activating and deactivating the analog relays.

Method and apparatus for redundancy in active-active cluster system
11194679 · 2021-12-07 · ·

A method is applied to a system including a host cluster and at least one pair of storage arrays. The host cluster includes a quorum host, which includes a quorum unit. The quorum host is an application host having a quorum function. A pair of storage arrays includes a first storage array and a second storage array. The quorum host receives a quorum request, temporarily stops delivering a service to the first storage array and the second storage array, determines, from the first storage array and the second storage array, which is a quorum winning storage array and which is a quorum losing storage array according to logic judgment, stops the service with the quorum losing storage array, sends quorum winning information to the quorum winning storage array, and resumes the delivered service between the host cluster and the quorum winning storage array.

Method and system for providing sustained resiliency in mainframe environment

An embodiment of the present invention is directed to enabling a mainframe system to be shutdown and restarted in an alternate region within minutes in a consistent and demonstrated manner ensuring data consistency for various components including disk, storage, coupling facility, etc. This enhances and packages together various software products from a mainframe platform in order to deliver a solution. An embodiment of the present invention is directed to an integrated automation that validates the integrity of the systems after restarting in remote regions.

DEBUGGING SHARED MEMORY ERRORS

There is provided a method for debugging errors in a shared memory. The method comprises executing instrumented machine code of a plurality of processes to generate a recorded execution of each of the plurality of processes for deterministic replay of the recorded execution. The method further comprises logging accesses to the shared memory by each of the plurality of processes in a shared memory log for debugging errors in the shared memory by analysing the recorded executions and the shared memory log. The shared memory log is accessible by each of the plurality of processes.

System and method to increase data center availability using rack-to-rack storage link cable

A first data storage rack has a first computing unit coupled to a first plurality of storage drives via a first storage controller. A second data storage rack has a second computing unit coupled to a second plurality of storage drives via a second storage controller. A first rack-to-rack storage link cable couples the first computing unit to the second storage controller such that the first computing unit can provide access to the second plurality of drives in response to a failure that prevents the second computing unit from providing access to the second plurality of drives via a system network.

Debugging shared memory errors

There is provided a method for debugging errors in a shared memory. The method comprises executing instrumented machine code of a plurality of processes to generate a recorded execution of each of the plurality of processes for deterministic replay of the recorded execution. The method further comprises logging accesses to the shared memory by each of the plurality of processes in a shared memory log for debugging errors in the shared memory by analysing the recorded executions and the shared memory log. The shared memory log is accessible by each of the plurality of processes.

Service takeover method, storage device, and service takeover apparatus
11347603 · 2022-05-31 · ·

The present disclosure describes example service takeover methods, storage devices, and service takeover apparatuses. In one example method, when a communication fault occurs between two storage devices in a storage system, the two storage devices respectively obtain running statuses of the two storage devices. A running status can reflect current usage of one or more system resources of a particular storage device. Then, a delay duration is determined according to the running statuses, where the delay duration is a duration for which the storage device waits before sending an arbitration request to a quorum server. The two storage devices respectively send, after the delay duration, arbitration requests to the quorum server to request to take over a service. The quorum server then can select a storage device in a relatively better running status to take over a host service.

Selective TCP/IP stack reconfiguration

A method, system, and program product are provided. A plurality of recovery groups is defined on each transmission control protocol/internet protocol (TCP/IP) stack in a cluster of servers. The recovery group includes a service, one or more IP addresses associated with the service, a trigger condition, and a recovery action. Each of the recovery groups is monitored for an occurrence of the trigger condition associated with the service. In response to detecting the trigger condition, a backup TCP/IP stack is notified to automatically perform the recovery action defined for a failing recovery group on an owning TCP/IP stack. Only the failing recovery group is recovered and the remaining recovery groups execute uninterrupted.

Mixed-Criticality Network for Common Unmanned System Architecture
20220165163 · 2022-05-26 ·

Apparatus and methods for controlling unmanned systems (UMSs), such as unmanned aircraft, are provided. A UMS can be provided that includes a network, auxiliary systems, and a payload, where the network can connect the auxiliary systems and the payload. A network switch of the network can logically separate the network into at least a second tier of communications and a third tier of communications. The network can be used to control the UMS by at least: controlling the auxiliary systems using messages communicated by the second tier of communications, and communicating with the payload using messages communicated by the third tier of communications.