G06F2209/5011

AI VIDEO PROCESSING METHOD AND APPARATUS
20230049578 · 2023-02-16 ·

The method comprises: connecting to a plurality of AI computing boards in an AI processing resource pool and a plurality of video encoding and decoding boards in a video processing resource pool by means of a unified high-speed interface; respectively allocating a specified number of AI computing boards and video encoding and decoding boards on account of resources and bandwidths required for completing a processing task to form a temporary cooperation relationship based on the processing task; in response to resource overflow or insufficiency in the AI processing resource pool or the video processing resource pool caused by a processing task change, accessing more AI computing boards or video encoding and decoding boards or stopping using redundant AI computing boards or video encoding and decoding boards; performing the processing task on account of the allocated AI computing boards or video encoding and decoding boards, and releasing the temporary cooperation relationship.

SYSTEMS AND METHODS OF HYBRID CENTRALIZED DISTRIBUTIVE SCHEDULING ON SHARED PHYSICAL HOSTS
20230037293 · 2023-02-09 ·

Systems and systems for hybrid centralized distributive scheduling and conflict resolution of multiple scheduler instances that share physical resources in a cloud computing system. The cloud computing system includes a plurality of scheduler instances, a global resource manager (GRM) for high-level resource management and conflict resolution for the scheduler instances, and a plurality of physical hosts. Each physical host has a respective local resource manager (LRM). The scheduler instances are responsible for initially processing of scheduling and resource allocation for resource requests, and proposing candidate physical hosts (and respective resource allocation) for the resource requests to the GRM. The GRM is responsible for conflict resolution through its general conflict resolvers of filtering, sorting and counting. The GRM decides which physical hosts among the candidate physical hosts will run the runtime instances of the resource requests after resolving conflicts among the scheduler instances.

CPU utilization for service level I/O scheduling

One or more aspects of the present disclosure relate to service level input/output scheduling to control central processing unit (CPU) utilization. Input/output (I/O) operations are processed with one or more of a first CPU pool and a second CPU pool of two or more CPU pools. The second CPU pool processes I/O operations that are determined to stall any of the CPU cores.

Frozen indices
11556388 · 2023-01-17 · ·

Methods and systems for searching a frozen index are provided. Exemplary methods include: a method may comprise: receiving an initial search and a subsequent search; loading the initial search and the subsequent search into a throttled thread pool, the throttled thread pool including; getting the initial search from the throttled thread pool; storing a first shard from a mass storage in a memory in response to the initial search; performing the initial search on the first shard; providing first top search result scores from the initial search; and removing the first shard from the memory when the initial search is completed.

Method and system for multi-pronged backup using real-time attributes

A method and system for backup processes that includes identifying a target volume and identifying a number of available threads to back up the target volume. The elements in the target volume are distributed among the available threads based on a currently pending size of data in the threads. The elements are stored from each thread into a backup container, and merged from each of the backup containers into a backup volume.

Memory allocator for I/O operations

Some embodiments provide a novel method for sharing data between user-space processes and kernel-space processes without copying the data. The method dedicates, by a driver of a network interface controller (NIC), a memory address space for a user-space process. The method allocates a virtual region of the memory address space for zero-copy operations. The method maps the virtual region to a memory address space of the kernel. The method allows access to the virtual region by both the user-space process and a kernel-space process.

Memory pooling between selected memory resources

Apparatuses, systems, and methods related to memory pooling between selected memory resources are described. A system using a memory pool formed as such may enable performance of functions, including automated functions critical for prevention of damage to a product, personnel safety, and/or reliable operation, based on increased access to data that may improve performance of a mission profile. For instance, one apparatus described herein includes a memory resource, a processing resource coupled to the memory resource, and a transceiver resource coupled to the processing resource. The memory resource, the processing resource, and the transceiver resource are configured to enable formation of a memory pool between the memory resource and another memory resource at another apparatus responsive to a request to access the other memory resource transmitted from the processing resource via the transceiver.

CONFIGURING NODES FOR DISTRIBUTED COMPUTE TASKS
20230236895 · 2023-07-27 ·

Systems and methods are provided for improving compute job distribution using federated computing nodes. This includes identifying a plurality of independently controlled computing nodes which then receive a token such that they can each be identified as being authorized to participate in a federated computing node cluster. Metrics associated with the particular nodes are then received and based on the received metrics compute jobs are assigned to the particular node by assembling a compute job data packet comprising the one or more compute jobs and transmitting the assembled compute job data packet to the particular node. Other features are also described in which assigned compute jobs and/or unrelated compute tasks can be dynamically modified in order to optimize compute job completion based on the received metrics.

MEMORY POOLING BETWEEN SELECTED MEMORY RESOURCES
20230004444 · 2023-01-05 ·

Apparatuses, systems, and methods related to memory pooling between selected memory resources are described. A system using a memory pool formed as such may enable performance of functions, including automated functions critical for prevention of damage to a product, personnel safety, and/or reliable operation, based on increased access to data that may improve performance of a mission profile. For instance, one apparatus described herein includes a memory resource, a processing resource coupled to the memory resource, and a transceiver resource coupled to the processing resource. The memory resource, the processing resource, and the transceiver resource are configured to enable formation of a memory pool between the memory resource and another memory resource at another apparatus responsive to a request to access the other memory resource transmitted from the processing resource via the transceiver.

CONTROL METHOD AND APPARATUS OF CLUSTER RESOURCE, AND CLOUD COMPUTING SYSTEM
20230004439 · 2023-01-05 ·

This disclosure relates to a control method and apparatus of cluster resources, and a cloud computing system, and relates to the field of computer technologies. The method includes: in the case where a to-be-controlled resource is a to-be-expanded resource, determining a binding relationship between the to-be-expanded resource and an application; adding the to-be-expanded resource that is initialized into a resource pool of a corresponding application having the binding relationship with the to-be-expanded resource; generating a to-be-executed data packet of a to-be-processed application according to a deployment type of the to-be-processed application; and deploying the to-be-executed data packet on the to-be-expanded resource in the resource pool of the to-be-processed application for execution.