G06F2201/85

RE-ESTABLISH CONNECTION WITH DOCKING STATIONS
20230129307 · 2023-04-27 ·

In some examples, the disclosure describes a device that includes a docking station and a processor. The processor may determine that an error condition involving disconnection of a computing device from the docking station couplable to the computing device has occurred and receive, responsive to the determination, a signal indicative of performance of an operation to re-establish communication with the docking station. The processor may further perform, responsive to receipt of the signal, the operation to re-establish communication with the docking station.

System recovery using a failover processor

Techniques for system recovery using a failover processor are disclosed. A first processor, with a first instruction set, is configured to execute operations of a first type; and a second processor, with a second instruction set different from the first instruction set, is configured to execute operations of a second type. A determination is made that the second processor has failed to execute at least one operation of the second type within a particular period of time. Responsive to determining that the second processor has failed to execute at least one operation of the second type within the particular period of time, the first processor is configured to execute both the operations of the first type and the operations of the second type.

Self-descriptive orchestratable modules in software-defined industrial systems

Various systems and methods are provided for implementing a software defined industrial system. In an example, self-descriptive control applications and software modules are provided in the context of orchestratable distributed systems. The self-descriptive control applications may be executed by an orchestrator or like control device, configured to: identify available software modules adapted to perform functional operations in a control system environment; identify operational characteristics that identify characteristics of execution of the available software modules that are available to implement a control system application; select a software module for execution based on the operational configuration and the operational characteristics identified in the manifest; and cause the execution of the selected software module in the control system environment based on an application specification for the control system application.

Distributed application orchestration management in a heterogeneous distributed computing environment

Distributed application orchestration management is provided. A first passive member of a set of passive members sends a notification message to other members indicating that the first passive member is initiating start of a distributed application in response to the first passive member validating that a self-restart by a leader member failed. The first passive member compares timestamps associated with an attempt to start the distributed application by other passive members in the set of passive members. The first passive member stops a particular attempt to start the distributed application in response to the first passive member determining that a timestamp associated with the particular attempt to start the distributed application by the first passive member is newer than another timestamp of another passive member. The first passive member designates the other passive member having an older timestamp as a new leader member to continue starting the distributed application.

Event-driven system failover and failback
11636013 · 2023-04-25 · ·

A system determines that a primary event processor, included in a primary data center, is associated with a failure. The primary event processor is included in the primary data center and configured to process first events stored in a main event store of the primary data center. The system identifies a secondary event processor, in a secondary data center, that is to process one or more first events based on the failure. The primary event processor and the secondary event processor are configured to process a same type of event. The system causes, based on a configuration associated with the primary or secondary event processor, the one or more first events to be retrieved from one of the main event store or a replica event store. The replica event store is included in the secondary data center and mirrors the main event store of the primary data center.

FAULT TOLERANT SYSTEM, SERVER, AND OPERATION METHOD OF FAULT TOLERANT SYSTEM
20220327034 · 2022-10-13 · ·

A first server and a second server use a virtual address to mount the storage synchronous area in a storage by the NFS. The first server obtains a snapshot of memory content of a virtual system operated as an active system and transmits the snapshot to the second server. The first server replicates content of the storage synchronous area in the storage to a storage synchronous area in a storage. When a failure occurs in the first server, the second server sets a virtual address to the storage and uses the virtual address to mount the storage synchronous area in the storage by NFS. The second server uses the snapshot received from the first server to execute the application on the virtual system.

SEMICONDUCTOR DEVICE
20230064905 · 2023-03-02 ·

When one of CPUs that perform a lock step operation fails and the failure type is an SW failure, the semiconductor device copies information held by an SR and a GR of the CPU operating normally to the CPU with the SW failure, thereby continuing a process without stopping the lock step operation. On the other hand, when the failure type is an HW failure, the failed CPU is stopped to continue the process with only the normal CPU.

Substitution apparatus, substitution control program, and substitution method

A substitution apparatus for installation in a vehicle in which a plurality of in-vehicle control apparatuses are implemented, the substitution apparatus including a control unit and a substitute unit. The control unit is configured to control the substitute unit based on transmission data transmitted from the in-vehicle control apparatuses, specify an abnormal in-vehicle control apparatus based on the transmission data, disable the specified abnormal in-vehicle control apparatus, and apply, to the substitute unit, a program for exhibiting functions otherwise normally executed by the specified abnormal in-vehicle control apparatus. The substitute unit is configured to substitute for the disabled in-vehicle control apparatus by executing the applied program.

System and device for data recovery for ephemeral storage

In various embodiments, a method for page cache management is described. The method can include: identifying a storage device fault associated with a fault-resilient storage device; determining that a first region associated with the fault-resilient storage device comprises an inaccessible space and that a second region associated with the fault-resilient storage device comprises an accessible space; identifying a read command at the second storage device for the data and determine, based on the read command, first data requested by a read operation from a local memory of the second storage device; determining, based on the read command, second data requested by the read operation from the second region; retrieving the second data from the second region; and scheduling a transmission of the second data from the fault-resilient storage device to the second storage device.

Storage system and method for data recovery after detection of an uncorrectable error

A storage system caches, in volatile memory, data read from non-volatile memory. After detecting an uncorrectable error in the data cached in the volatile memory, the storage system replaces the cached data with data re-read from the non-volatile memory and updated to reflect any changes made to the data after it was stored in the non-volatile memory. The storage system can also analyze a pattern in data adjacent to the uncorrectable error and predict corrected data based on the pattern.