G06F11/203

Live Migrating Virtual Machines to a Target Host Upon Fatal Memory Errors
20230161674 · 2023-05-25 ·

The disclosed technology provides techniques, systems, and apparatus for containing and recovering from uncorrectable memory errors in distributed computing environment through migration of virtual machines and associated memory to a target host machine. An aspect of the disclosed technology includes a hypervisor or virtual machine manager that receives signaling of an uncorrectable memory error detected by a host machine. The virtual machine manager then uses information received via the signaling to identify virtual memory addresses or memory pages associated with the corrupted memory element so as to allow for containment and recovery from the error, and for live migration of the virtual machine.

METHOD, APPARATUS, AND NON-TRANSITORY COMPUTER READABLE MEDIUM FOR MIGRATING VIRTUAL MACHINES
20230062733 · 2023-03-02 ·

A method and an apparatus for migrating virtual machine includes monitoring a status of a compute node; determining whether the compute node meets a trigger condition; wherein the trigger condition comprising a time period of lost connection of the compute node reaches a predetermined time period, or an unstable status of the compute node; and if the compute node meets the trigger condition, transmitting a message to a control node to migrate the VM.

ENSURING TIMELY RESTORATION OF AN APPLICATION
20220334929 · 2022-10-20 ·

Methods, apparatuses, and systems for ensuring timely restoration of an application, including: determining, based on a plurality of factors, a projected time to complete an application recovery operation from a first execution environment to a second execution environment and generating a report based on the projected time to complete the application recovery operation.

Migrating virtual machines between computing environments
11467886 · 2022-10-11 · ·

Virtual machines can be migrated between computing environments. For example, a system can receive a request to perform a migration process involving migrating a virtual machine from a source computing environment to a target computing environment. The target computing environment may be a cloud computing environment. In response to the request, the system can receive first configuration data for a first version of the virtual machine that is located in the source computing environment. The first configuration data can describe virtualized features of the first version of the virtual machine. The system can use the first configuration data to generate second configuration data for a second version of the virtual machine that is to be deployed in the target computing environment. The system can then deploy the second version of the virtual machine within one or more containers of the target computing environment in accordance with the second configuration data.

Computer duplication and configuration management systems and methods

In part, the disclosure relates to systems and methods to rapidly copy the computer operating system, drivers and applications from a source computer to a target computer using a duplication engine. Once the copy is complete the source computer will resume execution, and the target computer will first alter its configuration (also referred to as a role or personality) and then resume execution conforming to its new configuration as indicated by a profile stored in protected or specialized memory. The profile can be value, a file, or other memory structure and is protected in the sense that the profile (and or the region of memory where it is stored) must not be overwritten by a state transfer from the source computer to the target computer.

Method and system for providing coordinated checkpointing to a group of independent computer applications

A system and method thereof for performing loss-less migration of an application group. In an exemplary embodiment, the system may include a high-availability services module structured for execution in conjunction with an operating system, and one or more computer nodes of a distributed system upon which at least one independent application can be executed upon. The high-availability services module may be structured to be executable on the one or more computer nodes for loss-less migration of the one or more independent applications, and is operable to perform checkpointing of all state in a transport connection.

Method of recovering application data from a memory of a failed node

A method of recovering application data from the memory of a failed node in a computer system comprising a plurality of nodes connected by an interconnect and of writing the application data to a replacement node; wherein a node of the computer system executes an application which creates application data storing the most recent state of the application in a node memory; the node fails; the node memory of the failed node is then controlled using a failover memory controller; and the failover memory controller copies the application data from the node memory of the failed node to a node memory of the replacement node over the interconnect.

Server clustering in a computing-on-demand system

A device may provision two or more servers, each of the servers including a network interface. In addition, the device may enable the network interface in each of the provisioned servers, create a shared volume, assign the shared volume to each of the provisioned servers, and enable a clustering application on each of the provisioned servers to form a cluster comprising the provisioned servers, the cluster having a heartbeat via the network interfaces.

Reliable virtualized network function system for a cloud computing system

A reliable network function virtualization (rVNF) system includes a virtualized network function (VNF) application instance that includes a plurality of physical VNF instances. A load balancer provides an interface between a client and the VNF application instance. A load balancer interface facilitates delivery of packets related to a particular user context to the same physical VNF instance. A communication interface facilitates communication between the client and the VNF application instance. Application storage stores session data associated with the VNF application instance.

VIRTUAL MACHINE RECOVERY IN SHARED MEMORY ARCHITECTURE

Examples provide for virtual machine recovery using pooled memory. A shared partition is created on pooled memory accessible by a plurality of virtual machine hosts. A set of memory pages for virtual machines running on the hosts is moved to the shared partition. A master agent polls memory page tables associated with the plurality of hosts for write access. If the master agent obtains write access to a memory page table of a given host, the given host that previously held the write access is identified as a failed host or an isolated host. The virtual machines of the given host enabled to resume from pooled memory are respawned on a new host while maintaining memory state of the virtual machines using data within the pooled memory, including the virtual machine memory pages, memory page table, host profile data, and/or host-to-VM table data.