G06F11/0769

SYSTEM, METHOD, AND COMPUTER PROGRAM PRODUCT FOR LOCATION AWARE DEVICE FAULT DETECTION
20230214287 · 2023-07-06 ·

A system, method, and computer program product for identifying location-specific faults are provided. Some embodiments may include receiving first device status data associated with a first computing device and the first device status data may comprise first location-indicative data indicative of a location. Some embodiments may include comparing the first device status data with second device status data associated with one or more second computing devices and the second device status data may comprise second location-indicative data indicative of the location. In some embodiments, based on the comparison of the first device status data and the second device status data, determining that the first computing device is affected by one or more of a device-specific fault or a location-specific fault. Some embodiments may include causing information regarding the device-specific fault or the location-specific fault to be displayed via a graphical user interface.

System and method for monitoring and upgrading a dual-flash device
11550655 · 2023-01-10 · ·

One embodiment provides a computer implemented method of for monitoring and upgrading a dual-flash device. The method includes performing an OS upgrade on a server; writing an upgraded OS to the dual-flash device; updating a grub.cfg file corresponding to the upgraded OS; and deleting old OS files from the dual-flash device.

Lock database code for online patching
11537387 · 2022-12-27 · ·

Systems, methods, and other embodiments associated with patching database objects while a database system is online are described. In one embodiment, a patch command is defined and identifies a database code object, wherein the patch command is configured to modify code of the database code object. A serialized lock is requested from the database system for the database code object, wherein the serialized lock prohibits executing applications of the database system from accessing the database code object. In response to receiving the serialized lock from the database system, the code of the database code object is modified with a patch code to generate a modified database code object. The modified database code object is compiled and the serialized lock on the database code object is released to allow other executing applications to call and access the database code object.

INFORMATION PROCESSING APPARATUS, INFORMATION PROCESSING METHOD, AND STORAGE MEDIUM

According to one embodiment, an information processing apparatus includes a processing circuit. The processing circuit calculates a first input/output error related to normal data and a second input/output error related to pseudo abnormal data different from the normal data, for each of a plurality of autoencoders having different network structures. The processing circuit outputs relational data indicating a relation between the network structure and the first input/output error and the second input/output error.

Real-time cognitive root cause analysis (CRCA) computing

An ordered set of root cause analysis (RCA) document entry criteria is identified. RCA input segments are specified using unstructured natural language input, including at least: incident descriptive elements, a single problem statement, a set of why questions and answers, and a single cause categorization. A guided input sequence of the RCA input segments is performed interactively with a user. Quality indicators of content of user input entered during a respective RCA input segment are determined using a scoring algorithm, and the user is assisted with improving precision and consistency of the user input. Responsive to a threshold of consistent user input across the RCA input segments resulting in identification of a single cause categorization of an information technology (IT) problem, an RCA document is generated that identifies the single cause categorization of the new IT problem.

FAULT MANAGEMENT SYSTEM FOR FUNCTIONAL SAFETY OF AUTOMOTIVE GRADE CHIP
20220392280 · 2022-12-08 · ·

A fault management system for functional safety of an automotive grade chip includes: an out-of-chip system and an automotive-grade chip, where the automotive-grade chip includes a processor, a system controller, a system configuration module, a fault management device, and an on-chip function module; and the fault management device is configured with a fault classification management model.

COMPUTING CLUSTER HEALTH REPORTING ENGINE
20220391277 · 2022-12-08 ·

A cluster health reporting engine may be a software tool which generates compiled health data reported by data collection hosts, being health data of computing resources of backend computing clusters whose failure during the ordinary course of data query and processing functions may impede the normal functioning of those data query and processing functions. Such techniques may generate compiled health data reported by a data collection host for a particular host of a computing cluster, enabling administrative personnel to quickly narrow specificity of health data reported. Such techniques may aggregate health data reported by a data collection host over a dimension of hosted services, and may configure a reporting sub-system to visualize this aggregated health data, enabling administrative personnel to quickly view storage capacity consumed by various hosted services and identify hosted services or sub-services generating adverse health data by visual highlighting.

System and method for detecting anomalies by discovering sequences in log entries
11513935 · 2022-11-29 · ·

A method for detecting an anomaly includes retrieving a log file that includes log entries, grouping the log entries into clusters of log entry types based on number of occurrences and average time interval, and discovering a sequence of the log entry types within each of the clusters. The sequence of the log entry types is based on a shortest path from a first one of the log entry types to a last one of the log entry types.

Preventing disruption within information technology environments

A determination is made whether an incident that affects one configuration item in a plurality of configuration items within an information technology environment impacts at least one event for at least one other configuration item in the plurality of configuration items. In response to determining that the incident does impact at least one event on at least one other configuration item, one or more pre-defined actions to execute on the at least one other configuration item are identified. The identified one or more pre-defined actions are executed on the at least one other configuration item.

Autonomous release management in distributed computing systems

Implementations described herein relate to methods, systems, and computer-readable media to provide an alert based on a release of a software application implemented in a distributed computing system. In some implementations, the method includes receiving, at a processor, an indication of the release of the software application, obtaining a first set of metric values for each metric of a list of metrics for a first time period preceding a time of release of the release, obtaining a second set of metric values for each metric of the list of metrics for a second time period following the time of release, comparing the first set of metric values to the second set of metric values to determine a deviation score, generating an alert based on the deviation score, and transmitting the alert via one of a user interface and a communication channel.