G06F7/20

Methods and Systems for Identifying a Level of Similarity Between a Plurality of Data Representations
20230289137 · 2023-09-14 ·

A reference map generator clusters, into a semantic map, a set of data documents selected according to at least one criterion and associated with a medical diagnosis. A parser generates an enumeration of measurements occurring in the set of data documents. A representation generator generates for each measurement in the enumeration, a sparse distributed representation (SDR). The method includes storing, by a processor on a second computing device, in each of a plurality of memory cells on the second computing device, one of the generated SDRs. A diagnosis support module receives a document comprising a plurality of measurements. The representation generator generates a compound SDR for the document. Each of the plurality of bitwise comparison circuits determine a level of overlap between the compound SDR and the stored generated SDR. The diagnosis support module provides an identification of the medical diagnosis associated with a stored SDR.

Auto tuning data anomaly detection
11442941 · 2022-09-13 · ·

Automatic tuning anomaly detection is described. The context metric keys are established during a training phase based on the surrounding context of data received from devices over time. Anomaly and tuning windows are also established for metric ranges of the context metric keys. After the training phase, incoming data is correlated against the keys to identify sets of the data associated with certain context metric keys. For any given context metric key, metric data values in the associated set of data fall either within or outside the metric range of the context metric key. If they fall outside the range for longer than the anomaly window, an alarm is raised. If they fall outside the range for longer than the tuning window, boundaries for the metric range are updated. Additionally, the contextual parameters of the context metric keys are also updated over time, as new data contexts appear.

Auto tuning data anomaly detection
11442941 · 2022-09-13 · ·

Automatic tuning anomaly detection is described. The context metric keys are established during a training phase based on the surrounding context of data received from devices over time. Anomaly and tuning windows are also established for metric ranges of the context metric keys. After the training phase, incoming data is correlated against the keys to identify sets of the data associated with certain context metric keys. For any given context metric key, metric data values in the associated set of data fall either within or outside the metric range of the context metric key. If they fall outside the range for longer than the anomaly window, an alarm is raised. If they fall outside the range for longer than the tuning window, boundaries for the metric range are updated. Additionally, the contextual parameters of the context metric keys are also updated over time, as new data contexts appear.

Factor analysis apparatus, factor analysis method, and non-transitory storage medium

An apparatus as an aspect of the present invention is a factor analysis apparatus that analyzes a relationship between a target event that is a target of factor analysis and an assumed factor of the target event, and includes a similarity calculator, a first influence calculator, and a second influence calculator. The similarity calculator calculates a degree of similarity between a data item included in provided time-series data and the assumed factor. The first influence calculator calculates a first degree of influence indicating a degree of influence of the data item on the target event on the basis of time-series data of the data item and time-series data of the target event. The second influence calculator calculates a second degree of influence indicating a degree of influence of the assumed factor on the target event on the basis of the degree of similarity and the first degree of influence.

Factor analysis apparatus, factor analysis method, and non-transitory storage medium

An apparatus as an aspect of the present invention is a factor analysis apparatus that analyzes a relationship between a target event that is a target of factor analysis and an assumed factor of the target event, and includes a similarity calculator, a first influence calculator, and a second influence calculator. The similarity calculator calculates a degree of similarity between a data item included in provided time-series data and the assumed factor. The first influence calculator calculates a first degree of influence indicating a degree of influence of the data item on the target event on the basis of time-series data of the data item and time-series data of the target event. The second influence calculator calculates a second degree of influence indicating a degree of influence of the assumed factor on the target event on the basis of the degree of similarity and the first degree of influence.

Methods and Systems for Identifying a Level of Similarity Between a Plurality of Data Representations
20220091817 · 2022-03-24 ·

A reference map generator clusters, into a semantic map, a set of data documents selected according to at least one criterion and associated with a medical diagnosis. A parser generates an enumeration of measurements occurring in the set of data documents. A representation generator generates for each measurement in the enumeration, a sparse distributed representation (SDR). The method includes storing, by a processor on a second computing device, in each of a plurality of memory cells on the second computing device, one of the generated SDRs. A diagnosis support module receives a document comprising a plurality of measurements. The representation generator generates a compound SDR for the document. Each of the plurality of bitwise comparison circuits determine a level of overlap between the compound SDR and the stored generated SDR. The diagnosis support module provides an identification of the medical diagnosis associated with a stored SDR.

Methods and Systems for Identifying a Level of Similarity Between a Plurality of Data Representations
20220091817 · 2022-03-24 ·

A reference map generator clusters, into a semantic map, a set of data documents selected according to at least one criterion and associated with a medical diagnosis. A parser generates an enumeration of measurements occurring in the set of data documents. A representation generator generates for each measurement in the enumeration, a sparse distributed representation (SDR). The method includes storing, by a processor on a second computing device, in each of a plurality of memory cells on the second computing device, one of the generated SDRs. A diagnosis support module receives a document comprising a plurality of measurements. The representation generator generates a compound SDR for the document. Each of the plurality of bitwise comparison circuits determine a level of overlap between the compound SDR and the stored generated SDR. The diagnosis support module provides an identification of the medical diagnosis associated with a stored SDR.

System and method for clustering data
11294624 · 2022-04-05 · ·

A system for clustering data comprises a database for storing a plurality of data items, a clustering unit comprising components operable to receive and cluster the plurality of data items, and output clustered data items. A method of clustering data comprises receiving a plurality of data items from a database; computing distances between each pair of data items in the plurality of data items; until a stopping criterion is reached, identifying key elements, merging data items not identified as key elements with the nearest key elements, updating the computed distances; once the stopping criterion is reached, outputting cluster identities of the plurality of data items.

System and method for clustering data
11294624 · 2022-04-05 · ·

A system for clustering data comprises a database for storing a plurality of data items, a clustering unit comprising components operable to receive and cluster the plurality of data items, and output clustered data items. A method of clustering data comprises receiving a plurality of data items from a database; computing distances between each pair of data items in the plurality of data items; until a stopping criterion is reached, identifying key elements, merging data items not identified as key elements with the nearest key elements, updating the computed distances; once the stopping criterion is reached, outputting cluster identities of the plurality of data items.

METHOD FOR CLASSIFYING AN UNMANAGED DATASET

A computer implemented method for classifying at least one source dataset of a computer system. The method may include providing a plurality of associated reference tables organized and associated in accordance with a reference storage model in the computer system. The method may also include calculating, by a data classifier application of the computer system, a first similarity score between the source dataset and a first reference table of the reference tables based on common attributes in the source dataset and a join of the first reference table with at least one further reference table of the reference tables having a relationship with the first reference table. The method may further include classifying, by the data classifier application, the source dataset by determining using at least the calculated first similarity score whether the source dataset is organized as the first reference table in accordance to the reference storage model.