Patent classifications
G06F18/2137
Subset conditioning using variational autoencoder with a learnable tensor train induced prior
The proposed model is a Variational Autoencoder having a learnable prior that is parametrized with a Tensor Train (VAE-TTLP). The VAE-TTLP can be used to generate new objects, such as molecules, that have specific properties and that can have specific biological activity (when a molecule). The VAE-TTLP can be trained in a way with the Tensor Train so that the provided data may omit one or more properties of the object, and still result in an object with a desired property.
METHOD, ELECTRONIC DEVICE, AND COMPUTER PROGRAM PRODUCT FOR PROCESSING DATA
Embodiments of the present disclosure relate to a method, an electronic device, and a computer program product for processing data. The method includes determining a reference tensor based on a tensor representing multidimensional data, where the reference tensor is associated with a target tensor. The method further includes decomposing the reference tensor to obtain multiple low-rank tensors, where a rank of each of the low-rank tensors is lower than that of the reference tensor. The method further includes determining the target tensor based on the multiple low-rank tensors so as to determine multidimensional data at a specific moment. By means of embodiments of the present disclosure, the overhead of computing resources may be reduced, and the time for processing data may be reduced.
METHOD, ELECTRONIC DEVICE, AND COMPUTER PROGRAM PRODUCT FOR PROCESSING DATA
Embodiments of the present disclosure relate to a method, an electronic device, and a computer program product for processing data. The method includes determining a reference tensor based on a tensor representing multidimensional data, where the reference tensor is associated with a target tensor. The method further includes decomposing the reference tensor to obtain multiple low-rank tensors, where a rank of each of the low-rank tensors is lower than that of the reference tensor. The method further includes determining the target tensor based on the multiple low-rank tensors so as to determine multidimensional data at a specific moment. By means of embodiments of the present disclosure, the overhead of computing resources may be reduced, and the time for processing data may be reduced.
Narrative evaluator
A system includes a narrative repository which stores a plurality of narratives and, for each narrative, a corresponding outcome. A narrative evaluator receives the plurality of narratives and the outcome for each narrative. For each received narrative, a subset of the narrative is determined to retain based on rules. For each determined subset, a entropy matrix is determined which includes, for each word in the subset, a measure associated with whether the word is expected to appear in a sentence with another word in the subset. For each entropy matrix, a distance matrix is determined which includes, for each word in the subset, a numerical representation of a difference in meaning of the word and another word. Using one or more distance matrix(es), a first threshold distance is determined for a first word of the subset. The first word and first threshold are stored as a first word-threshold pair associated with the first outcome.
HYDRAULIC TURBINE CAVITATION ACOUSTIC SIGNAL IDENTIFICATION METHOD BASED ON BIG DATA MACHINE LEARNING
The present invention provides a hydraulic turbine cavitation acoustic signal identification method based on big data machine learning. According to the method, time sequence clustering based on multiple operating conditions under the multi-output condition of the hydraulic turbine set is performed by utilizing an neural network, characteristic quantities of the hydraulic turbine set under a steady condition in a healthy state is screened; a random forest algorithm is introduced to perform feature screening of multiple measuring points under steady-state operation of the hydraulic turbine set, optimal feature measuring points and optimal feature subsets are extracted, finally a health state prediction model is constructed by using gated recurrent units; whether incipient cavitation is present in the equipment is judged. The present invention can effectively identify the occurrence of incipient cavitation in the hydraulic turbine set, reducing unnecessary shutdown of the equipment and prolonging the service life.
Prediction system for simulating the effects of a real-world event
A prediction system for simulating effects of a real-world event can be used for autonomous driving. In operation, the system receives input data regarding a complex system (e.g., roadways) and various real-world events. A full-scale network is constructed of the complex system, such that nodes represent road intersections and edges between nodes represent road segments linking the road intersections. The network is reduced is scaled down to generate a multi-layer model of the complex system. Each layer in the model is simulated to identify equilibrium flows, with the model thereafter destabilized by applying stimuli to reflect the real-world event. An autonomous vehicle can then be caused to chart and traverse a road path based on road segments and intersections that are least affected by the real-world event.
GROUPING SOFTWARE APPLICATIONS BASED ON TECHNICAL FACETS
Embodiments of the present disclosure provide to techniques for automatically grouping software applications based on their technical patterns/characteristics (i.e., technical facets) via machine learning (ML) algorithms. For instance, a first set of software applications that exhibit a high prevalence of one or more first technical facets may be grouped into a first category, a second set of software applications that exhibit a high prevalence of one or more second technical facets may be grouped into a second category, and so on. Once grouped into categories, the software applications in a given category may be assessed, analyzed, and/or processed together for various purposes.
GROUPING SOFTWARE APPLICATIONS BASED ON TECHNICAL FACETS
Embodiments of the present disclosure provide to techniques for automatically grouping software applications based on their technical patterns/characteristics (i.e., technical facets) via machine learning (ML) algorithms. For instance, a first set of software applications that exhibit a high prevalence of one or more first technical facets may be grouped into a first category, a second set of software applications that exhibit a high prevalence of one or more second technical facets may be grouped into a second category, and so on. Once grouped into categories, the software applications in a given category may be assessed, analyzed, and/or processed together for various purposes.
Mapper component for a neuro-linguistic behavior recognition system
Techniques are disclosed for generating a sequence of symbols based on input data for a neuro-linguistic model. The model may be used by a behavior recognition system to analyze the input data. A mapper component of a neuro-linguistic module in the behavior recognition system receives one or more normalized vectors generated from the input data. The mapper component generates one or more clusters based on a statistical distribution of the normalized vectors. The mapper component evaluates statistics and identifies statistically relevant clusters. The mapper component assigns a distinct symbol to each of the identified clusters.
Mapper component for a neuro-linguistic behavior recognition system
Techniques are disclosed for generating a sequence of symbols based on input data for a neuro-linguistic model. The model may be used by a behavior recognition system to analyze the input data. A mapper component of a neuro-linguistic module in the behavior recognition system receives one or more normalized vectors generated from the input data. The mapper component generates one or more clusters based on a statistical distribution of the normalized vectors. The mapper component evaluates statistics and identifies statistically relevant clusters. The mapper component assigns a distinct symbol to each of the identified clusters.