G06F11/2252

Detecting datacenter mass outage with near real-time/offline using ML models

The present embodiments relate to data center outage detection and alert generation. An outage detection service as described herein can process near real-time data from various sources in a datacenter and process the data using a model to determine one or more projected sources of a detected outage. The model as described herein can include one or more machine learning models incorporating a series of rules to process near-real time data and offline data and determine one or more projected sources of an outage. An alert message can be generated to provide the projected sources of the outage and other data relevant to the outage.

PREDICTING AND RESOLVING ISSUES WITHIN A TELECOMMUNICATION NETWORK
20210399972 · 2021-12-23 ·

Disclosed here is a system to automatically predict and resolve issues within a telecommunication network. Initially, the system builds a service registry to store dependence information within the network, which can include software components and hardware components. Various components of the network create logs of their operations. Machine learning models examine the logs and detect any issues. Upon detecting an issue or abnormal event, the system can automatically resolve the issue by determining the most similar issue occurring previously and determining a solution that resolved the previous most similar issue. In addition, the system can propagate the fix to dependent systems and/or notify the dependent systems of the issue.

ERROR HANDLING METHOD AND APPARATUS
20220206891 · 2022-06-30 ·

An error handling method performed by a computing device, the computing device comprises at least one computing device component and a board management controller (BMC) coupled to the at least one computing device component, the method comprises the steps of a BMC detecting an error relating to at least one computing device component, the BMC determining from a database a technical specification to fix the error and generating information for accessing the technical specification. An error handling apparatus comprises a BMC and at least one computing device component coupled to the BMC. The BMC is configured to detect an error relating to the at least one computing device component, determine from in a database a technical specification to fix the error, and generate information for accessing the technical specification.

SYSTEM FOR RECOMMENDING TESTS FOR MOBILE COMMUNICATION DEVICES MAINTENANCE RELEASE CERTIFICATION
20220197765 · 2022-06-23 ·

Techniques for automatically selecting device tests for testing devices configured for operation in wireless communication networks, based upon maintenance releases (MRs) received from original equipment manufacturers. When an MR with changes for a device is received, the MR may be analyzed in order to determine what the changes pertain to with respect to the device. The changes may be clustered with respect to requirements for the changes and a knowledge base may be consulted by a recommendation engine in order to determine candidate tests for testing the MR. The candidate tests may be based upon previous tests, failed tests and, relevant tests. Based at least in part on the identified previous tests, failed tests and relevant tests, one or more tests may be selected for testing devices with respect to the newly received MR.

Fault rectification operation recommendation method and apparatus, and storage medium
11743113 · 2023-08-29 · ·

This application discloses a fault rectification operation recommendation method and apparatus, and a storage medium in the field of communication technologies. In some implementations, after fault information is obtained, a rectification contingency plan corresponding to the fault information may be searched for from a contingency plan library. When the rectification contingency plan corresponding to the fault information cannot be found in the contingency plan library, the fault information may be processed by using a recommendation model to obtain a recommended contingency plan. Further, one operation is selected from one or more candidate operations included in the recommended contingency plan as a fault rectification operation.

DETECTING DATACENTER MASS OUTAGE WITH NEAR REAL-TIME/OFFLINE USING ML MODELS

The present embodiments relate to data center outage detection and alert generation. An outage detection service as described herein can process near real-time data from various sources in a datacenter and process the data using a model to determine one or more projected sources of a detected outage. The model as described herein can include one or more machine learning models incorporating a series of rules to process near-real time data and offline data and determine one or more projected sources of an outage. An alert message can be generated to provide the projected sources of the outage and other data relevant to the outage.

AUTOMATED SYSTEM FOR INTELLIGENT ERROR CORRECTION WITHIN AN ELECTRONIC BLOCKCHAIN LEDGER

A system for automated and intelligent error correction within an electronic blockchain ledger is provided. The system may analyze unformatted/unstructured blockchain event logs using machine learning algorithms in order to identify and label the errors within the event logs. Based on the identified errors, the system may use predictive analysis in conjunction with error or rule repositories and/or machine learning to identify potential solutions to the identified errors. Once the potential solutions have been identified, the system may automatically attempt to rectify the blockchain transaction errors using the potential solutions. The system may further comprise trend/correlation analyses and reporting functions regarding various metrics and may output said metrics in various accessible formats.

Automated system for intelligent error correction within an electronic blockchain ledger

A system for automated and intelligent error correction within an electronic blockchain ledger is provided. The system may analyze unformatted/unstructured blockchain event logs using machine learning algorithms in order to identify and label the errors within the event logs. Based on the identified errors, the system may use predictive analysis in conjunction with error or rule repositories and/or machine learning to identify potential solutions to the identified errors. Once the potential solutions have been identified, the system may automatically attempt to rectify the blockchain transaction errors using the potential solutions. The system may further comprise trend/correlation analyses and reporting functions regarding various metrics and may output said metrics in various accessible formats.

Predicting and resolving issues within a telecommunication network
11831534 · 2023-11-28 · ·

Disclosed here is a system to automatically predict and resolve issues within a telecommunication network. Initially, the system builds a service registry to store dependence information within the network, which can include software components and hardware components. Various components of the network create logs of their operations. Machine learning models examine the logs and detect any issues. Upon detecting an issue or abnormal event, the system can automatically resolve the issue by determining the most similar issue occurring previously and determining a solution that resolved the previous most similar issue. In addition, the system can propagate the fix to dependent systems and/or notify the dependent systems of the issue.

DETECTING DATACENTER MASS OUTAGE WITH NEAR REAL-TIME/OFFLINE USING ML MODELS

The present embodiments relate to data center outage detection and alert generation. An outage detection service as described herein can process near real-time data from various sources in a datacenter and process the data using a model to determine one or more projected sources of a detected outage. The model as described herein can include one or more machine learning models incorporating a series of rules to process near-real time data and offline data and determine one or more projected sources of an outage. An alert message can be generated to provide the projected sources of the outage and other data relevant to the outage.