G06F9/4416

Integrated network boot operating system installation leveraging hyperconverged storage

A computing node is configure to implement an intra-node network boot and installation protocol (protocol) for booting and installing an operating system (OS) on a virtual machine hosted on the computing node without communicating over a physical network. The protocol includes hosting a dynamic host configuration protocol (DHCP) server instance and/or a network boot server instance on a controller virtual machine of the computing node to emulate DHCP protocol and network boot server protocol communications. In some examples, the protocol further utilizes one or more virtual extensible local area networks (LANs)(VXLANs) and a virtual switch hosted at a hypervisor running on the computing node.

Sharing image installation image streams

A technique includes a first compute node of a cluster of nodes receiving a communication from a cluster maintenance node of the cluster instructing the first compute node to provide an installation image that is received by the first compute node to a second compute node of the cluster. The first node and the second node are peers. The technique includes sharing the first installation stream that communicates the image to the first compute node. Sharing the first installation stream includes, while the first compute node is receiving the first installation stream, the first compute node providing a second installation stream to communicate parts of the image received via the first installation stream to the second compute node.

Control of A Computer System in A Power-Down State
20230066750 · 2023-03-02 ·

Techniques are disclosed relating a computer system in a power-down state receiving a communication from a remote computer system and performing a task indicated by the communication. The computer system in a power-down state performs the task without transitioning from the power-down state into a power-up state. Exemplary tasks performed in the power-down state include uploading one or more files to a remote computer system, downloading one or more files from a remote computer system, deleting one or more files from the computer system, accessing input/output devices, disabling the computer system, and performing a memory check on the computer system.

HIGH-AVAILABILITY (HA) MANAGEMENT NETWORKS FOR HIGH PERFORMANCE COMPUTING PLATFORMS

Embodiments of systems and methods for high-availability (HA) management networks for High Performance Computing (HPC) platforms are described. In some embodiments, an HPC platform may include a first Baseboard Management Controller (BMC) having a first network port; and a hardware accelerator comprising a second BMC having a second network port, where at least one of: (a) the first BMC is configured to share the first network port with the second BMC in response to a determination that the second network port has failed or has insufficient bandwidth, or (b) the second BMC is configured to share the second network port with the first BMC in response to a determination that the first network port has failed or has insufficient bandwidth.

Restart controller

A native restart controller restarts services in a controlled manner. The services run in a cloud computing environment comprising a plurality of computing devices executing a plurality of Kubernetes pods comprising one or more containers. An indication is received of a deployment defining a service to be implemented in the cloud computing environment, the deployment including a desired number of instances of the service. An API is instantiated that is operable to provide a first state indicative of whether restarts are to be controlled by the restart controller and a second state indicating that a restart is required. In response to receiving an indication that restarts for the cluster are to be controlled by the restart controller and that a restart is required, the instances are restarted in a sequence in accordance with restart criteria associated with the service.

Bootstrapping Devices on a Network
20230164549 · 2023-05-25 ·

Methods for operating a device and for managing bootstrapping of devices are disclosed. The method (100) for operating a device comprises computing (102) a derivative of a secret shared between the device and a server entity of a network, generating (104) a temporary bootstrap URI by combining at least a part of the computed derivative with a static bootstrap URI for the network, and sending (106) a bootstrap request to the temporary bootstrap URI. The method for managing bootstrapping of devices comprises generating temporary bootstrap URIs corresponding to devices operable to connect to a network, and updating a network DNS registry to map the generated temporary bootstrap URIs to the IP address of at least one of a bootstrap server instance reachable via the network and/or a bootstrap load balancer. Also disclosed are a device, a bootstrap load balancer, a bootstrap server, and a computer program.

SYSTEM AND METHOD FOR FACILITATING MANAGEMENT OF EDGE COMPUTING NODES OF AN EDGE COMPUTING NETWORK
20230161601 · 2023-05-25 ·

A system and method for facilitating management of edge computing nodes of an edge computing network is disclosed is disclosed. The method includes registering one or more second edge nodes with an edge computing network, reading a MAC address of a network card associated with the one or more second edge nodes, and obtaining installation topic and the MAC address. Also, the method includes publishing a message on the installation topic associated with one or more parameters, performing one or more operations on the set of first edge nodes based on the installation topic, and executing the set of install instructions on the one or more second edge nodes. Further, the method includes publishing a result of the execution of the set of install instructions on the installation topic and the MAC address and updating a state of the edge computing network.

SYSTEM BOOTING METHOD AND RELATED COMPUTER SYSTEM

A system booting method for a computer system having a plurality of central processing units and a booting unit is disclosed. The system booting method includes determining, by the booting unit, a booting mode of the computer system; transmitting a booting signal, which is related to the booting mode, to the plurality of CPUs of the computer system; and entering a multi-CPU booting mode or entering an independent booting mode of the plurality CPUs according to the booting signal.

Dynamic generation of cloud platform application programming interface calls
11630705 · 2023-04-18 · ·

An apparatus comprises a processing device configured to receive a request to execute an action on cloud assets of a cloud platform utilizing an application programming interface (API) exposed by the cloud platform, the request comprising a set of keyword arguments, and to generate a code class instance for the API. The processing device is also configured to instantiate, via the generated code class instance, a client for the cloud platform utilizing a first subset of arguments in the set of keyword arguments, to determine from the set of keyword arguments an identifier of the action to be executed, and to execute the action by running a function of the generated code class instance, the function dynamically generating an API call utilizing the instantiated client for the cloud platform, the determined identifier, and a second subset of arguments in the set of keyword arguments.

MULTI-CLUSTER BOOT-STRAPPING

Disclosed herein are system, method, and computer program product embodiments for multi-cluster boot-strapping. In some embodiments, a server residing on a primary computing cluster receives a first request to establish a temporary connection between the primary computing cluster and a secondary computing cluster. The server establishes the temporary connection between the primary computing cluster and the secondary computing cluster using the first set of credentials. Furthermore, the server receives a second request to establish a persistent connection between the primary computing cluster and the secondary computing cluster. The server establishes the persistent connection by transmitting a third request comprising the configuration settings to the secondary computing cluster thereby causing the secondary computing cluster to generate a second set of credentials corresponding to the primary computing cluster. The server receives and stores the second set of credentials.