G06F11/328

Method and Apparatus for Predicting and Exploiting Aperiodic Backup Time Windows on a Storage System
20230259429 · 2023-08-17 ·

A multivariate time series model such as a Vector Auto Regression (VAR) model is built using fabric utilization, disk utilization, and CPU utilization time series data. The VAR model leverages interdependencies between multiple time-dependent variables to predict the start and length of an aperiodic backup time window, and to cause backup operations to occur during the aperiodic backup time window to thereby exploit the aperiodic backup time window for use in connection with backup operations. By automatically starting backup operations during predicted aperiodic backup time windows where the CPU, disk, and fabric utilization values are predicted to be low, it is possible to implement backup operations during time windows where the backup operations are less likely to interfere with primary application workloads, or system application workloads that need to be implemented to maintain optimal operation of the storage system.

Hypervisor-independent reference copies of virtual machine payload data based on block-level pseudo-mount
11321195 · 2022-05-03 · ·

Hypervisor-independent reference copies of virtual machine payload data based on block-level pseudo-mount infrastructure and techniques are generated and stored in an illustrative data storage management system. An illustrative hypervisor-independent reference copy comprises one or more virtual-machine payload data files that originated from a first virtual machine. The hypervisor-independent virtual-machine-payload reference copy is governed by a distinct reference copy policy that controls retention, storage, tiering, scheduling, etc. for the reference copy, independently of how the illustrative system treats other virtual machine payload data files originating from the same virtual machine.

Generating metrics values at component levels of a monolithic application and of a microservice of a microservices-based architecture
11321217 · 2022-05-03 · ·

Monitoring and troubleshooting tools provide the capability to visualize different levels of a client's application that is deployed as a suite of independent but cooperating services (e.g., an application that includes a monolithic application and a microservices-based application), collect values of monitored or tracked metrics at those different levels, and visualize values of the metrics at those levels. For example, metrics values can be generated for components of the monolithic application and/or for components of a microservice of the microservice-based application.

MALFUNCTIONING SYSTEM IDENTIFICATION MECHANISM
20220129361 · 2022-04-28 · ·

A management system is described. The management system includes an interface coupled to a plurality of infrastructure appliances and one or more processors to monitor each of the plurality of infrastructure appliances, detect a malfunction at a first of the infrastructure appliances, and transmit a display message to one or more of the plurality of infrastructure appliances that are adjacent to the first infrastructure appliance, wherein a display message indicates one or more activity light indicators to be activated at an adjacent infrastructure appliance.

Graphical user interface for visual correlation of virtual machine information and storage volume information

The disclosed embodiments include a method for identifying a performance metric to diagnose a cause of a performance issues of virtual machine. The method includes obtaining data of a virtual machine, an indication that a storage volume contains data of the virtual machine, data about the storage volume, and an identification of the storage volume. The data of the virtual machine is correlated with the data about the storage volume based on the indication that the storage volume contains data of the virtual machine and the identification of the storage volume. A performance metric is identified based at least in part on an outcome of the correlating. The performance metric indicates that the storage volume is a cause of a performance issue of the virtual machine. A state related to the storage volume is changed to mitigate the cause of the performance issue of the virtual machine.

Management of internet of things devices

A method and system for communicating with IoT devices to gather information related to device failure or error(s) is disclosed. The system receives log files from an IoT device (e.g., a smart refrigerator) that recently failed. The system determines which log files the IoT device created before and/or after a failure. After gathering this information, the system stores the information in a database, sends it to the IoT device manufacturer, or sends it to a cloud provider. The system can also send the failure-related information to the IoT device-related entities (e.g., IoT device manufacturers), and the entity uses this information to troubleshoot the failure and send a fix or software update to the IoT device.

MANAGING DATA CENTER FAILURE EVENTS

Managing data center recovery from failure events can include a failure event platform having aspects provided via a user interface that integrates multiple failure and recovery management and execution features. The features can include, among others, application drift monitoring between production and recovery environments, real-time health checks of system components, user-modifiable scripting for prioritizing and customizing data center recovery actions, and a recovery execution tool.

BACKUP DATA SECURITY MANAGEMENT SYSTEM AND ASSOCIATED METHOD
20230244805 · 2023-08-03 ·

The disclosure relates to a computer implemented method for assisting a user managing the data-security of backup copies of a computer system having a plurality of nodes, the method comprising: receiving status data for backup copies associated with a plurality of nodes, wherein, for each node, the status data provides a status of one or more backup copies associated with the node with respect to a plurality of data-security criteria; determining a backup security metric for each of the plurality of nodes based on the status data; and providing the security metrics for the user to demonstrate the relative level of backup data-security of the plurality of nodes.

PERFORMANCE INFORMATION VISUALIZATION APPARATUS, PERFORMANCE INFORMATION VISUALIZATION METHOD, AND NON-TRANSITORY COMPUTER-READABLE STORAGE MEDIUM
20220121547 · 2022-04-21 · ·

A performance information visualization apparatus includes a memory and a processor. The memory configured to stores, as node information, information that indicates a connection relationship of a node, and information that indicates a generation in which the node is added to an information processing system and a generation in which the node is deleted from the information processing system. The processor configured to that synthesizes configuration information by, when an event occurrence node where an event has occurred does not exist in configuration information of a specific generation associated with a time when the event has occurred and the event occurrence node is added to the configuration information, adding a node and connection between nodes including a connection relationship of the event occurrence node, based on the node information.

System, Method, and Computer Program Product for Diagnosing Faulty Components in Networked Computer Systems
20230246902 · 2023-08-03 ·

Described are a system, method, and computer program product for diagnosing faulty components in networked computer systems. The method includes receiving a plurality of alerts associated with a fault in a networked computer system. The method also includes generating a graph of a network topology of the networked computer system. The method further includes associating each alert with a node of the graph to determine a set of nodes affected by the fault. The method further includes determining a common node of the graph having a plurality of edges connected to nodes affected by the fault. The method further includes determining a faulty component based on the common node, retrieving a set of records of operational changes to the networked computer system, and determining, based on the set of records and the faulty component, an operational change that caused the fault in the networked computer system.