G06F2209/5022

HYBRID COMPUTING SYSTEM MANAGEMENT

A method, a system and a computer program product for hybrid computing system management are proposed. In the method, workload information associated with a set of application server instances running in a first computing system is obtained by a server controller in response to a scaling request for changing the number of instances in the set of application server instances from a request controller. The set of application server instances serves at least one application running in a second computing system. A scaling decision indicating whether to change the number of instances in the set of application server instances is determined by a predictor based on the workload information from the server controller. The second computing system is enabled by the request controller to handle requests associated with the at least one application for the set of application server instances based on the scaling decision.

Elastic container platform architecture
11561816 · 2023-01-24 · ·

A method, a device, and a non-transitory storage medium are described in which an elastic platform virtualization service is provided in relation to a virtual device. The elastic platform virtualization service includes logic that provides for the management of a virtualized device during its life cycle. The creation or reconfiguration of the virtualized device is based on a tertiary choice between using dedicated hardware and dedicated kernel; common hardware and common kernel; or a combination of the dedicated hardware, dedicated kernel, common hardware, and common kernel.

Method and apparatus for comparing acquired cloud resource use information to thresholds to recommend a target cloud resource instance

Embodiments of the present disclosure disclose a method and apparatus for acquiring information. The method may include: acquiring cloud resource use information; comparing the cloud resource use information with a use rate threshold value, to obtain a comparison result; obtaining use state information of a cloud resource corresponding to the cloud resource use information according to the comparison result; and generating cloud resource state information based on the use state information.

Method, device and computer program product for processing computing job

A first set of requirements of a first set of computing tasks for computing resources in a computing system is acquired respectively. Based on a determination that the requirement of a computing task in the first set of computing tasks for a computing resource satisfies a resource threshold condition, the computing task is divided into a plurality of sub-tasks. The resource threshold condition describes the threshold of a computing resource provided by a computing device in a plurality of computing devices in the computing system. A merging task for merging a plurality of sub-results of the plurality of sub-tasks into a result of the computing task is generated. Based on other computing tasks than the computing task in the set of computing tasks, the plurality of sub-tasks, and the merging task, a second set of computing tasks of the computing job is determined so as to process the computing job.

Redistribution of processing groups between server nodes based on hardware resource utilization
11704160 · 2023-07-18 · ·

At least one processor is configured to obtain measurement information comprising an indication of an amount of utilization of a hardware resource of a first server node by a plurality of processing groups and to determine that the amount of utilization of the hardware resource is above a threshold amount of utilization. The at least one processor is further configured to select a given processing group for redistribution based at least in part on the determination that the amount of utilization of the hardware resource is above the threshold amount and on an amount of utilization of the hardware resource by the given processing group. The at least one processor is further configured to determine that a second server node comprises enough available capacity of the hardware resource and to redistribute the given processing group to the second server node based at least in part on the determination.

System and method for supporting a usage calculation process in a cloud infrastructure environment

Systems and methods described herein support a usage calculation process in a cloud infrastructure environment. The usage calculation process can be used to determine whether a requested transaction that targets a compartment within a tree-structure of compartments violates any compartment quota or limit within parent compartments within the tree-structure.

CLOUD APPLICATION THRESHOLD BASED THROTTLING
20230221995 · 2023-07-13 ·

Systems and methods are provided for intercepting computing requests and modifying the execution timing thereof based on thresholds and minimum performance criteria and/or adjusting hosted services plans in order to monitor and control costs of hosting software applications on hosted provider computing resources.

PUBLISHING PHYSICAL TOPOLOGY NETWORK LOCALITY INFORMATION FOR GRAPHICAL PROCESSING UNIT WORKLOADS

Discussed herein are techniques that utilize locality information of host machines included in a cluster network for the execution of graphical processing unit based workloads. For each host machine of a plurality of host machines, locality information for the host machine is stored therein. The locality information for a host machine identifies a rack comprising the host machine. Responsive to receiving a request requesting execution of a workload, one or more host machines of the plurality of host machines are identified as being available for executing the workload. For each of the one or more host machines, the locality information for the host machine is obtained. Further, linkage information of the one or more host machines is identified. The locality information and the linkage information of the one or more host machines is provided in response to the request.

Technologies for assigning workloads to balance multiple resource allocation objectives

Technologies for allocating resources of managed nodes to workloads to balance multiple resource allocation objectives include an orchestrator server to receive resource allocation objective data indicative of multiple resource allocation objectives to be satisfied. The orchestrator server is additionally to determine an initial assignment of a set of workloads among the managed nodes and receive telemetry data from the managed nodes. The orchestrator server is further to determine, as a function of the telemetry data and the resource allocation objective data, an adjustment to the assignment of the workloads to increase an achievement of at least one of the resource allocation objectives without decreasing an achievement of another of the resource allocation objectives, and apply the adjustments to the assignments of the workloads among the managed nodes as the workloads are performed. Other embodiments are also described and claimed.

TECHNIQUES TO ENABLE QUALITY OF SERVICE CONTROL FOR AN ACCELERATOR DEVICE
20220413909 · 2022-12-29 ·

Examples include techniques to enable quality of service (QoS) control for an accelerator device. Circuitry at an accelerator device implements QoS control responsive to receipt of a submission descriptor for a work request to execute a workload for an application hosted by a compute device coupled with the accelerator device. An example QoS control includes accepting the submission descriptor to a work queue at the accelerator device based on a work size of submission descriptor submissions of the application to the work queue over a unit of time not exceeding a submission rate threshold. The work queue is associated with an operational unit at the accelerator device to execute the workload based on information included in the submission descriptor. The work queue to be shared with at least one other application hosted by the compute device.