G06F2209/5013

External way allocation circuitry for processor cores

An apparatus includes a plurality of processor cores; a shared cache connected to the plurality of processor cores; a cache control unit connected to the shared cache; and a way allocation circuitry connected to at least one of the plurality of processor cores. The way allocation circuitry is external to the plurality of processor cores. The cache control unit and the way allocation circuitry are cooperatively configured to process an intercepted memory request with respect to designated ways in the shared cache, the designated ways being based on a partition identifier and a partition table.

Systems and methods for dynamic allocation of computing resources for microservice architecture type applications

Systems and methods for auto-scaling computing resources for applications that rely on microservices, such as media streaming applications. In example embodiments, the systems and methods use a hybrid approach that takes into account proactive and reactive metrics to predict future processing needs. The proactive metrics use data obtained from the microservices pipeline. In example embodiments, computing units are deployed or removed from a cluster of computing units in anticipation of a future computational need and in consideration of quality of service requirements.

Information processing device and information processing system

A client terminal and a server system having a plurality of processing units processing game applications are connected to each other via a network. In a game system as a cloud computing system, the server system makes the client terminal wait for a start of processing of an application when there is no processing unit assignable to the client terminal. When the processing unit becomes assignable, the client terminal displays a screen inquiring whether to start the processing of the application. When a user does not respond for a predetermined time, the server system cancels the waiting state of the user.

SCHEDULING SYSTEM FOR COMPUTATIONAL WORK ON HETEROGENEOUS HARDWARE

The technology includes methods, processes, and systems for virtualizing graphics processing unit (GPU) memory. Example embodiments of the technology include managing an amount of GPU memory used by one or more processes, such as Application Programming Interfaces (APIs), that directly or indirectly impact one or more other processes running on the same GPU. Managing and/or virtualizing the amount of GPU memory may ensure that an end user does not receive a GPU out-of-memory error because the API request is impacted by the processing of other API requests. A virtual machine with access to a GPU may be organized with one or more job slots that are configured to specify the number of processes that are able to run concurrently on a specific virtual machine. A process may be configured on each virtual machine running a software program or API and is used to schedule work based on GPU memory requirements.

Method for resource allocation and terminal device

A method for resource allocation, a terminal device, and a non-transitory computer readable storage medium are provided. The method includes the following. A photography application of a terminal device determines a running scenario. The photography application determines a performance improvement strategy for the running scenario. The photography application sends a scenario data packet to an operating system of the terminal device, where the scenario data packet includes the performance improvement strategy and is configured for the operating system to adjust allocation of system resources for the photography application according to the performance improvement strategy.

Self-partitioning distributed computing system

Methods and systems of self-partitioning entities. One system includes an electronic processor configured to access the updated job distribution information. The electronic processor is also configured to query a second predetermined number of entities included within a first secondary partition of the one or more secondary partitions. The electronic processor is also configured to determine whether the first secondary partition includes a second set of remaining entities in addition to the second predetermined number of entities. In response to the first secondary partition including a second set of remaining entities, the electronic processor is also configured to divide the second set of remaining entities of the first secondary partition into a third partition, where the third partition has a third partition range smaller than the second partition range, and generate and transmit, to the central queue server, additional updated job distribution information identifying the third partition.

SYSTEMS AND METHODS FOR TYING TOKEN VALIDITY TO A TASK EXECUTED IN A COMPUTING SYSTEM
20220069997 · 2022-03-03 ·

It is desired to try to increase the security of a computing system running computer applications that may access data in a data storage system. In some embodiments, a token associates a user with a task that is being executed by a computing node. It may therefore be possible to determine which user executed which tasks. In some embodiments, the validity of a token is tied to the lifespan of a task associated with the token, rather than to a fixed amount of time. Therefore, if the task associated with the token is complete, the token may become invalid, rather than remaining valid for a duration of time that possibly exceeds the lifespan of the associated task. In some embodiments, a token is used to enforce data access control, e.g. to deny certain users access to certain data in the data storage system.

Methods, devices, and a computer program product for processing an access request and updating a storage system
11269548 · 2022-03-08 · ·

Techniques for processing an access request and updating a storage system are provided. For instance, a method comprises: receiving an access request for an object associated with a storage system, the storage system including a plurality of physical nodes, each of the plurality of physical nodes including at least one set of virtual units, each set of virtual units including at least one virtual unit; determining, from a plurality of sets of virtual units included in the plurality of physical nodes of the storage system, a target set of virtual units associated with the object; and determining, from the target set of virtual units, a target virtual unit corresponding to the object. With the technical solution of the present disclosure, not only a set of virtual units on a physical node may be easily split and merged, but also huge computing resources that need to be allocated may be saved, so better user experience may be brought about at a lower cost.

INTELLIGENTLY CHOOSING TRANSPORT CHANNELS ACROSS PROTOCOLS BY DRIVE TYPE
20220066822 · 2022-03-03 ·

In an approach to storage level load balancing, the load level of a storage system is monitored, where the load level is a utilization percentage of a plurality of CPU cores in the storage system. An overload condition is detected based on the utilization percentage of one or more CPU cores exceeding a threshold, where the overload condition is caused by an overlap of one or more I/O queues from multiple host computers accessing a single CPU core. Responsive to detecting the overload condition, a new I/O queue is selected on a second CPU core, where the second CPU core has a utilization percentage less than a second threshold. A recommendation is sent to a host computer, where the recommendation is to move I/O traffic from the first CPU core to the new I/O queue on the second CPU core to rebalance the load level of the storage system.

Systems, methods, and apparatuses for implementing a scheduler and workload manager with scheduling redundancy and site fault isolation

In accordance with disclosed embodiments, there are provided systems, methods, and apparatuses for implementing a stateless, deterministic scheduler and work discovery system with interruption recovery. For instance, according to one embodiment, there is disclosed a system to implement a stateless scheduler service, in which the system includes: a processor and a memory to execute instructions at the system; a compute resource discovery engine to identify one or more computing resources available to execute workload tasks; a workload discovery engine to identify a plurality of workload tasks to be scheduled for execution; a cache to store information on behalf of the compute resource discovery engine and the workload discovery engine; a scheduler to request information from the cache specifying the one or more computing resources available to execute workload tasks and the plurality of workload tasks to be scheduled for execution; and further in which the scheduler is to schedule at least a portion of the plurality of workload tasks for execution via the one or more computing resources based on the information requested. Other related embodiments are disclosed.