Patent classifications
G06F11/2035
CORE PAIRING IN MULTICORE SYSTEMS
A method, executed by a computer, includes pairing a first core with a second core to form a first core group, wherein each core of the group has a plurality of functional units, transferring instructions received by the first core to the second core for execution via a first inter-core communication bus, and executing the instructions on the second core. A computer system and computer program product corresponding to the above method are also disclosed herein.
Maintaining point-in-time granularity for backup snapshots
A method and system for maintaining granular backups is discussed. Backup windows may be defined, and policies may be associated with those windows. Journal and backup files may be retained or deleted in accordance with those policies.
Load balancing and fault tolerant service in a distributed data system
Techniques for load balancing and fault tolerant service are described. An apparatus may comprise load balancing and fault tolerant component operative to execute a load balancing and fault tolerant service in a distributed data system. The load balancing and fault tolerant service distributes a load of a task to a first node in a cluster of nodes using a routing table. The load balancing and fault tolerant service stores information to indicate the first node from the cluster of nodes is assigned to perform the task. The load balancing and fault tolerant service detects a failure condition for the first node. The load balancing and fault tolerant service moves the task to a second node from the cluster of nodes to perform the task for the first node upon occurrence of the failure condition.
Container image arrangement method and non-transitory computer-readable medium
A container image arrangement method executed by a processor included in a computer to execute a process, the computer being connected to each of a plurality of nodes, the process includes, identifying a first node that has a first storage storing a container image and has a largest number of containers started from the container image among the plurality of nodes, determining whether the container operating in the first node is capable of starting in a second node among the plurality of nodes other than the first node, where the second has a second storage storing the container image, and storing the container image in a third storage included in a third node different from each of the first node and the second node among the plurality of the nodes when it is determined that the container is not capable of starting in the second node.
Systems and methods for error recovery
Embodiments of the present disclosure include an error recovery method comprising detecting a computing error, restarting a first artificial intelligence processor of a plurality of artificial intelligence processors processing a data set, and loading a model in the artificial intelligence processor, wherein the model corresponds to a same model processed by the plurality of artificial intelligence processors during a previous processing iteration by the plurality of artificial intelligence processors on data from the data set.
PLUG-IN BASED FRAMEWORK TO PROVIDE FAULT TOLERANCE AND HIGH AVAILABILITY IN DISTRIBUTED SYSTEMS
A plug-in based framework provides high availability (HA), including fault tolerance, in a distributed system, such as provided by a virtualized computing environment. The framework uses blueprints that define entities to be monitored, failure conditions, failover actions, restoration actions, and other aspects associated with HA. Microservices execute the blueprints, and a load balancer may balance the execution of the blueprints amongst microservices.
VIRTUALIZED FILE SERVER DISASTER RECOVERY
In one embodiment, a system for managing a virtualization environment includes a set of host machines, each of which includes a hypervisor, virtual machines, and a virtual machine controller, and a virtualized file server backup system configured to identify backup data, wherein the backup data comprises data stored on the virtual disks and VFS configuration information, and the first data is identified in accordance with a backup policy, send the backup data to one or more remote sites for storage, and, in response to detection of changes in the backup data, send the changes to the remote sites in accordance with a replication policy. The backup data may be identified based on a protection domain associated with the backup policy. The data stored on the VFS may include one or more storage objects. The storage objects may include shares, groups of shares, files, or directories.
Clustered storage network
A data storage network is provided. The network includes a client connected to the data storage network; a plurality nodes on the data storage network, wherein each data node has two or more RAID controllers, wherein a first RAID controller of a first node is configured to receive a data storage request from the client and to generate RAID parity data on a data set received from the client, and to store all of the generated RAID parity data on a single node of the plurality of nodes.
Monitoring of replicated data instances
Replicated instances in a database environment provide for automatic failover and recovery. A monitoring component can obtain a lease enabling the component to periodically communicate with, and monitor, one or more data instances in the data environment, where the data instance can be a replicated instance including a primary and a secondary replica. For a large number of instances, the data environment can be partitioned such that each monitoring component can be assigned a partition of the workload. In the event of a failure of a monitoring component, the instances can be repartitioned and the remaining monitoring components can be assigned to the new partitions to substantially evenly distribute the workload.
Backup backplane management control in a server rack system
A server rack includes a rack management controller (RMC) configured to manage a first function and a backplane including a backplane controller (BPC). The BPC is configured to monitor the RMC, determine that the RMC is unavailable, and manage the first function, in response to determining that the RMC is unavailable.