Patent classifications
G06F11/2263
Software application diagnostic aid
A diagnostics tool aids in the efficient collection of relevant error data for addressing faults in connected software systems. Context information is collected from a software system that is being displayed. Configuration of the software system is collected, and used to identify relevant connected software systems. Error data is collected via respective log interfaces from the error logs of the software system being displayed, and relevant connected systems. The context, configuration, and error data is stored in a database. Based at least upon the configuration information, a query is formulated and posed to the database. A corresponding query result is received and processed to return an error report to a user interface, for inspection (e.g., by a user or a support staff member). Certain embodiments may further generate an appropriate recommendation based upon the query result. The recommendation may be generated with reference to a stored ruleset and/or artificial intelligence.
PERFORM PREEMPTIVE IDENTIFICATION AND REDUCTION OF RISK OF FAILURE IN COMPUTATIONAL SYSTEMS BY TRAINING A MACHINE LEARNING MODULE
A machine learning module is trained by receiving inputs comprising attributes of a computing environment, where the attributes affect a likelihood of failure in the computing environment. In response to an event occurring in the computing environment, a risk score that indicates a predicted likelihood of failure in the computing environment is generated via forward propagation through a plurality of layers of the machine learning module. A margin of error is calculated based on comparing the generated risk score to an expected risk score, where the expected risk score indicates an expected likelihood of failure in the computing environment corresponding to the event. An adjustment is made of weights of links that interconnect nodes of the plurality of layers via back propagation to reduce the margin of error, to improve the predicted likelihood of failure in the computing environment.
AUTOMATIC PREDICTION SYSTEM FOR SERVER FAILURE AND METHOD OF AUTOMATICALLY PREDICTING SERVER FAILURE
The present invention relates to an automatic prediction system for a server failure, which monitors the status of a single server connected to a network and providing web, DB and network services, and predicts and warns a server failure of a target system by using the collected status data of the single server.
The automatic prediction system for a server failure comprises: a data collection module to collect status information of a server and service of a target system; a model generation and optimization module to generate a CNN-based failure prediction model by using the collected data and to optimize model parameters and hyper-parameter values; and a prediction module to perform online failure prediction by using the optimized CNN-based failure prediction model.
SYSTEMS AND METHODS FOR MANAGING DISTRIBUTED SALES, SERVICE AND REPAIR OPERATIONS
The systems and methods of the present disclosure are generally related to managing distributed sales, service and repair operations. In particular, the systems and methods of the present disclosure relate to managing a distributed network of sales, service and/or repair operations that include automated features.
DISK DRIVE FAILURE PREDICTION WITH NEURAL NETWORKS
Techniques are described herein for predicting disk drive failure using a machine learning model. The framework involves receiving disk drive sensor attributes as training data, preprocessing the training data to select a set of enhanced feature sequences, and using the enhanced feature sequences to train a machine learning model to predict disk drive failures from disk drive sensor monitoring data. Prior to the training phase, the RNN LSTM model is tuned using a set of predefined hyper-parameters. The preprocessing, which is performed during the training and evaluation phase as well as later during the prediction phase, involves using predefined values for a set of parameters to generate the set of enhanced sequences from raw sensor reading. The enhanced feature sequences are generated to maintain a desired healthy/failed disk ratio, and only use samples leading up to a last-valid-time sample in order to honor a pre-specified heads-up-period alert requirement.
Missing values imputation of sequential data
A method and system of imputing corrupted sequential data is provided. A plurality of input data vectors of a sequential data is received. For each input data vector of the sequential data, the input data vector is corrupted. The corrupted input data vector is mapped to a staging hidden layer to create a staging vector. The input data vector is reconstructed based on the staging vector, to provide an output data vector. adjusted parameter of the staging hidden layer is iteratively trained until it is within a predetermined tolerance of a loss function. A next input data vector of the sequential data is predicted based on the staging vector. The predicted next input data vector is stored.
Detection of misbehaving components for large scale distributed systems
A method or apparatus for monitoring a system by detecting misbehaving components in the system is presented. A computing device receives historical data points based on a set of monitored signals of a system. The system has components that are monitored through the set of monitored signals. For each monitored component, the computing device performs unsupervised machine learning based on the historical data points to identify expected states and state transitions for the component. The computing device identifies one or more steady components based on the identified states of the monitored components. The computing device also receives real-time data points based on monitoring the set of signals from the system. For each identified steady component, the computing device examines the received real-time data points for deviation from the expected state and state transitions of the steady component. The computing device reports anomaly in the system based on the detected deviations.
FAILED AND CENSORED INSTANCES BASED REMAINING USEFUL LIFE (RUL) ESTIMATION OF ENTITIES
Estimating Remaining Useful Life (RUL) from multi-sensor time series data is difficult through manual inspection. Current machine learning and data analytics methods, for RUL estimation require large number of failed instances for training, which are rarely available in practice, and these methods cannot use information from currently operational censored instances since their failure time is unknown. Embodiments of the present disclosure provide systems and methods for estimating RUL using time series data by implementing an LSTM-RNN based ordinal regression technique, wherein during training RUL value of failed instance(s) is encoded into a vector which is given as a target to the model. Unlike a failed instance, the exact RUL for a censored instance is unknown. For using the censored instances, target vectors are generated and the objective function is modified for training wherein the trained LSTM-RNN based ordinal regression is applied on an input test time series for RUL estimation.
Intra-class adaptation fault diagnosis method for bearing under variable working conditions
The invention relates to a fault diagnosis method for a rolling bearing under variable working conditions. Based on a convolutional neural network, a transfer learning algorithm is combined to handle the problem of the reduced universality of deep learning models. Data acquired under different working conditions is segmented to obtain samples. The samples are preprocessed by using FFT. Low-level features of the samples are extracted by using improved ResNet-50, and a multi-scale feature extractor analyzes the low-level features to obtain high-level features as inputs of a classifier. In a training process, high-level features of training samples and test samples are extracted, and a conditional distribution distance between them is calculated as a part of a target function for backpropagation to implement intra-class adaptation, thereby reducing the impact of domain shift, to enable a deep learning model to better carry out fault diagnosis tasks.
SYSTEM AND METHOD FOR ASSISTING USER TO RESOLVE A HARDWARE ISSUE AND A SOFTWARE ISSUE
The present disclosure relates to system(s) and method(s) for assisting a user to resolve a hardware issue and a software issue. The system identifies, a target cluster, associated with a new ticket received from the user, from the set of clusters. Further, the system recommends one or more runbook scripts, from a runbook repository, associated with the new ticket. The system further identifies a new runbook script, corresponding to the new ticket, from a set of external repositories. Further, the system executes at least one of the one or more runbook scripts or the new runbook script, associated with the new ticket. The system further generates a document based on the execution of the one or more runbook scripts or the new runbook script, thereby assisting the user to resolve a target issue.