G06F11/2038

Methods and systems for a non-disruptive planned failover from a primary copy of data at a primary storage system to a mirror copy of the data at a cross-site secondary storage system without using an external mediator
11704207 · 2023-07-18 · ·

Systems and methods are described for a non-disruptive planned failover from a primary copy of data at a primary storage cluster to a mirror copy of the data at a cross-site secondary storage cluster without using an external mediator. According to an example, a planned failover feature of a multi-site distributed storage system provides an order of operations such that a primary copy of a first data center continues to serve I/O operations until a mirror copy of a second data center is ready. This planned failover feature improves functionality and efficiency of the distributed storage system by providing non-disruptiveness during planned failover without using an external mediator based on a primary storage cluster being selected as an authority to implement a state machine with a persistent configuration database to track a planned failover state for the planned failover.

Defect repair for a reconfigurable data processor for homogeneous subarrays

A device architecture includes a spatially reconfigurable array of processors, such as configurable units of a CGRA, having spare homogenous subarrays, and a parameter store on the device which stores parameters that tag one or more elements as unusable. Configuration data is distributed using a statically reconfigurable bus system, to implement the pattern of placement of configuration data, in dependence on the tagged elements. As a result, a spatially reconfigurable array having unusable elements can be repaired.

Redundant automation system, method for creating the automation system, computer program and computer readable medium
11550270 · 2023-01-10 · ·

A method for creating a redundant automation system, a computer program and a computer-readable medium, wherein the redundant automation system includes at least one automation installation to be controlled that is installed at an installation location and two control applications that are communicatively interconnected via a synchronization path, and includes a plurality of communication hubs and communication paths connecting these to one another, where one of the control applications operates as the master and the other control application operates as a reserve, such that when the control application operating as the master fails, the control application operating as the reserve function as the master, and where the locations of the computing resources for the control applications are selected such that the control applications are connected to the at least one automation installation via two different communication paths preferably having no or a minimal number of common communication hubs.

Method and system for disaster recovery of a regional cloud based desktop fabric

A system and method for ensuring the availability of virtual desktops in a cloud based system. The system includes a primary regional datacenter having a primary desktop pool accessible by a desktop client providing access to a desktop to a desktop user. A secondary regional datacenter includes a secondary desktop pool. A control plane orchestrates communication between the desktop client and the regional datacenters. The control plane creates a copy of the desktop from the primary regional datacenter. The control plane performs an activation procedure when a disaster event occurs. The activation procedure includes creating the desktop in the secondary desktop pool from the copy. The activation procedure also directs the desktop client to the secondary desktop pool to access the desktop from the secondary regional datacenter. A deactivation procedure directs the desktop client in the secondary desktop pool to reestablish availability to desktops in the primary desktop pool.

TRANSFERRING TASK DATA BETWEEN EDGE DEVICES IN EDGE COMPUTING
20220413974 · 2022-12-29 ·

Edge device task management by receiving an indicator corresponding to a first container running a task on a first edge device of a cluster of edge devices, wherein the indicator indicates an error status of the first container, and wherein task data of the task is stored in a first local storage of the first edge device, selecting a second edge device from the cluster of edge devices, wherein a second container on the second edge device is to run the task, instructing the first and second edge devices to transfer the task data from the first local storage of the first edge device to a second local storage of the second edge device, and in response to receiving a notification that indicates the task data has been transferred from the first local storage to the second local storage, sending the task to the second container.

Vehicle system for autonomous control in response to abnormality
11535272 · 2022-12-27 · ·

A vehicle system includes a first vehicle platform including a first computer configured to operate by means of electric power from a first electric power source and perform traveling control of a vehicle, a second vehicle platform including a second computer configured to operate by means of electric power from a second electric power source different from the first electric power source and perform traveling control of the vehicle, and an autonomous driving platform including a third computer configured to perform autonomous driving control of the vehicle by transmitting a control instruction including data for autonomously driving the vehicle to the first computer when the first vehicle platform is in a normal state and perform autonomous stoppage control of the vehicle by transmitting a control instruction including data for causing the vehicle to autonomously stop to the second computer when the first vehicle platform is in an abnormal state.

HIGH FREQUENCY SNAPSHOT TECHNIQUE FOR IMPROVING DATA REPLICATION IN DISASTER RECOVERY ENVIRONMENT

A high frequency snapshot technique improves data replication in a disaster recovery (DR) environment. A base snapshot is generated from failover data at a primary site and replicated to a placeholder file at a secondary site. Upon commencement of the base snapshot generation and replication, incremental light weight snapshots (LWSs) of the failover data are captured and replicated to the secondary site. A staging file at the secondary site accumulates the replicated LWSs (“high-frequency snapshots”). The staging file is populated with the LWSs in parallel with the replication of the base snapshot at the placeholder file. At a subsequent predetermined time interval, the accumulated LWSs are synthesized to capture a “checkpoint” snapshot by applying and pruning the accumulated LWSs at the staging file. Once the base snapshot is fully replicated, the pruned LWSs are merged to the base snapshot to synchronize the replicated failover data.

Time-based virtual machine reversion

Recovery points can be used for replicating a virtual machine and reverting the virtual machine to a different state. A filter driver can monitor and capture input/output commands between a virtual machine and a virtual machine disk. The captured input/output commands can be used to create a recovery point. The recovery point can be associated with a bitmap that may be used to identify data blocks that have been modified between two versions of the virtual machine. Using this bitmap, a virtual machine may be reverted or restored to a different state by replacing modified data blocks and without replacing the entire virtual machine disk.

ENSURING HIGH AVAILABLITY OF REPLICATED DATABASE MANAGEMENT SYSTEMS DURING UPGRADES

An online system, such as a multi-tenant system ensures high availability of systems, for example, database management systems. The online system replicates the databases across multiple datacenters including: (1) a master node that receives read and write requests (2) a read-replica that receives only read requests and (3) a spare node that does not receive requests but acts as standby for high availability. One or more application servers may send read and write requests to the databases. The system performs a sweep of upgrades of the database nodes and also performs traffic quiescing of the requests received from the application servers to redirect the traffic across the database nodes as the upgrade sweep is orchestrated. The sweep of upgrades ensures that the availability of the database management system to the end users is maximized during the upgrade process.

Redundancy system and method

A method, computer program product, and computing system for operating an autonomous vehicle; monitoring the operation of a plurality of computing devices within the autonomous vehicle; and in response to detecting the failure of one or more of the plurality of computing devices, switching the autonomous vehicle from a nominal autonomous operational mode to a degraded autonomous operational mode.