Patent classifications
G06F11/079
SYSTEMS AND METHODS FOR POWER LOSS PROTECTION OF STORAGE RESOURCES
In accordance with embodiments of the present disclosure, a method for power loss protection of one or more storage resources may include receiving information from each of the one or more storage resources regarding power loss protection capabilities of such storage resource. The method may also include based on the information, repurposing, for each power loss protection capable storage resource, a communications channel between a logic device and such power loss protection capable storage resource for transmission of a respective early power-off warning signal for such power loss protection capable storage resource. The method may further include in response to a power event of a power supply unit for providing electrical energy to the one or more storage resources, asserting for each power loss protection capable storage resource its respective early power-off warning signal.
INFORMATION PROCESSING DEVICE AND METHOD OF TESTING
An information processing device includes a first port and a processor coupled to the first port and configured to transmit, via the first port, a first signal to a first device coupled to the first port, cause a second device coupled to the first port to determine whether a failure is present in the first port when the information processing device does not receive a first response signal in response to the first signal, and determine that the failure is present in the first device when the second device does not determine that the failure is present in the first port.
Mitigating a voltage condition of a memory cell in a memory sub-system
A determination that a first programming operation has been performed on a particular memory cell can be made. A determination can be made, based on one or more threshold criteria, whether the particular memory cell has transitioned from a state associated with a decreased error rate to another state associated with an increased error rate. In response to determining that the particular memory cell has transitioned from the state associated with the decreased error rate to the another state associated with the increased error rate, an operation can be performed on the particular memory cell to transition the particular memory cell from the another state associated with the increased error rate to the state associated with the decreased error rate.
Error dynamics analysis
A method, a system, and a computer program product for analyzing error messages. A first error log generated as a result of an execution of at least one task of a computing system at a first instance is received. The first error log include a plurality of first error messages. A first association rules model is generated using the first error messages. The first association rules model includes a plurality of association rules defining one or more relationships. A second error log, including a plurality of second error messages, generated as a result of an execution of the task at a second instance is received and a second association rules model is generated using the second error messages. Based on the first and second association rules models, at least one error message pattern associated with execution of the at least one task is determined.
Control system, control method, and control program
A control system includes an information processing device that communicates with a controller that controls a control target. The controller or the information processing device include a storage device that stores one or more SQL statements to be executed and an execution result the one or more SQL statements in association with each other as log data. The information processing device includes a display controller that displays on a display an SQL statement to be corrected that has an unsuccessful execution result; an operation unit that accepts a correction operation on the SQL statement and an execution operation; and a communication interface that sends an execution instruction to execute the corrected SQL statement to the controller upon receipt of the execution operation and to receive an execution result of the corrected SQL statement from the controller. The display controller displays an execution result of the corrected SQL statement.
PROFILING AND DIAGNOSTICS FOR INTERNET OF THINGS
A computing device and method for profiling and diagnostics in an Internet of Things (IoT) system, including matching an observed solution characteristic of the IoT system to an anomaly in an anomaly database.
SYSTEM AND METHOD FOR ANOMALY DETECTION AND ROOT CAUSE AUTOMATION USING SHRUNK DYNAMIC CALL GRAPHS
A system and method for real-time or near real-time anomaly detection and root cause automation in production environments or in other environments using shrunk dynamic call graphs are provided. The system includes an instrumentation agent that generates shrunk dynamic call graphs and exceptions/errors by injecting monitoring code or probes or call-tags into monitored application, a data agent that forwards collected data to the analysis engine over a network, an analysis engine that performs continuous clustering using machine learning, anomaly, and root cause detection. The system also includes a reporting module to report the anomaly.
METHOD AND SYSTEM FOR IDENTIFYING ROOT CAUSE OF A HARDWARE COMPONENT FAILURE
In general, embodiments relate to a method for identifying hardware component failures, comprising: obtaining system logs that show a transition of device states for a device; using a normalization and filtering module to process and extract relevant data from the system logs and important keywords for the device; creating a device state path for the device from a healthy device state to an unhealthy device state using the extracted relevant data; obtaining the device state path for the device from a storage and a current device state of the device; predicting a next device state of the device based on the current device state using an analysis module; generating a device state chain using the device state path, current device state, and next device state; and identifying root cause of a hardware component failure using the device state chain.
SYSTEMS AND METHODS FOR AUTOMATICALLY APPLYING CONFIGURATION CHANGES TO COMPUTING CLUSTERS
A system includes a memory and a processor. The processor is configured to access one or more configuration logs generated by a computing cluster. The processor is further configured to determine, by analyzing the one or more configuration logs, a particular service running on the computing cluster that has generated a plurality of errors within the plurality of log messages. The processor is further configured to determine whether the particular error has previously occurred. The processor is further configured to, in response to determining that the particular error has previously occurred, generate and send one or more commands to the computing cluster. The one or more commands are operable to change a current configuration value for the particular service running on the computing cluster to a new configuration value. The new configuration value is based on a historical value stored in the database of historical configuration errors.
Insider attack resistant system and method for cloud services integrity checking
An insider attack resistant system for providing cloud services integrity checking is disclosed. In particular, the system utilizes an automated integrity checking script and virtual machines to check the integrity of a service. The system may utilize the integrity checking script and virtual machines to execute a set of operations associated with the service so as to check the integrity of the service. When executing the set of operations, the system may only have access to the minimum level of access to peripherals that is required for each operation in the set of operations to be executed. After each operation is executed, the system may log each result for each operation, and analyze each result to determine if a failure exists for any of the operations. If a failure exists, the system may determine that a change in an expected system behavior associated with the service has occurred.