Patent classifications
G06F18/2133
IDENTIFIER VELOCITY ANOMALY DETECTION
A processing system including at least one processor may obtain a personal identifier comprising a plurality of characters and generate a first embedding of the personal identifier in accordance with an embedding model. The processing system may then identify one or more embeddings of other personal identifiers that are within a threshold distance of the first embedding and generate an alert in response to the identifying of the one or more embeddings of the other personal identifiers that are within the threshold distance.
IDENTIFIER VELOCITY ANOMALY DETECTION
A processing system including at least one processor may obtain a personal identifier comprising a plurality of characters and generate a first embedding of the personal identifier in accordance with an embedding model. The processing system may then identify one or more embeddings of other personal identifiers that are within a threshold distance of the first embedding and generate an alert in response to the identifying of the one or more embeddings of the other personal identifiers that are within the threshold distance.
Hierarchical topic model with an interpretable topic hierarchy
Some techniques described herein relate to generating a hierarchical topic model (HTM), which can be used to generate custom content. In one example, a method includes determining first-level topics in a topic hierarchy related to a corpus of documents. A first-level topic of the first-level topics includes multiple words. The multiple words are grouped into clusters based on word embeddings of the multiple words. The multiple words are then subdivided into second-level topics as subtopics of the first-level topic, such that the number of second-level topics equals the number of clusters. A document of the corpus of documents is assigned to the first-level topic and to a second-level topic of the second-level topics, and an indication is received of access by a user to the document. Custom content is generated for the user based on one or more other documents assigned to the first-level topic and the second-level topic.
Hierarchical topic model with an interpretable topic hierarchy
Some techniques described herein relate to generating a hierarchical topic model (HTM), which can be used to generate custom content. In one example, a method includes determining first-level topics in a topic hierarchy related to a corpus of documents. A first-level topic of the first-level topics includes multiple words. The multiple words are grouped into clusters based on word embeddings of the multiple words. The multiple words are then subdivided into second-level topics as subtopics of the first-level topic, such that the number of second-level topics equals the number of clusters. A document of the corpus of documents is assigned to the first-level topic and to a second-level topic of the second-level topics, and an indication is received of access by a user to the document. Custom content is generated for the user based on one or more other documents assigned to the first-level topic and the second-level topic.
MACHINE VISION SYSTEM FOR RECOGNIZING NOVEL OBJECTS
Described is a system for classifying novel objects in imagery. In operation, the system extracts salient patches from a plurality of unannotated images using a multi-layer network. Activations of the multi-layer network are clustered into key attribute, with the key attributes being displayed to a user on a display, thereby prompting the user to annotate the key attributes with class label. An attribute database is then generated based on user prompted annotations of the key attributes. A test image can then be passed through the system, allowing the system to classify at least one object in the test image by identifying an object class in the attribute database. Finally, a device can be caused to operate or maneuver based on the classification of the at least one object in the test image.
Massively parallel processing (MPP) large-scale combination of time series data
Methods and apparatus are provided for performing massively parallel processing (MPP) large-scale combinations of time series data. A given working compute node in a distributed computing environment obtains a given group of time series data of a plurality of groups of time series data; generates a measurement matrix for the given group based on a plurality of selected time series and a plurality of time lags of the selected time series; processes the measurement matrix to generate a first linear model with a predefined number of first independent selected variables; assigns a score to each first independent selected variable; and provides the first independent selected variables and assigned scores to a master compute node that ranks the first independent selected variables for all groups from all working computing nodes according to assigned scores; selects a predefined number of second independent selected variables based on a final rank to create a final group of time series; and processes the final group of time series to generate a final linear model.
SYSTEM IN COMMUNICATION WITH A MANAGED INFRASTRUCTURE
A system is in communication with a managed infrastructure. An extraction engine is in communication with a managed infrastructure. The extraction engine is configured to receive managed infrastructure data and produces events as well as populates an entropy database with a dictionary of event entropy that can be included in the entropy database. A signalizer engine that includes one or more of an NMF engine, a k-means clustering engine and a topology proximity engine. The signalizer engine inputs a list of devices and a list of connections between components or nodes in the managed infrastructure. The signalizer engine determines one or more common characteristics and produces clusters of events relating to failure or errors in at least one of the devices and connections between components or nodes in the managed infrastructure. The events are converted into words and subsets to group the events into clusters that relate to security of the managed infrastructure. In response to grouping the events, physical changes are made to at least a portion of the physical hardware. In response to production of the clusters, security of the managed infrastructure is maintained.
LEARNING DATA SELECTION METHOD, LEARNING DATA SELECTION DEVICE, AND COMPUTER-READABLE RECORDING MEDIUM
A non-transitory computer-readable recording medium stores therein a learning data selection program that causes a computer to execute a process including: extracting a first input data group relating to first input data in correspondence with designation of the first input data included in an input data group input to a machine learning model, the machine learning model classifying or determining transformed data that is transformed from input data; acquiring a first transformed data group of the machine learning model and a first output data group of the machine learning model, respectively, the first transformed data group being input to the machine learning model and corresponding to the first input data group, the first output data group corresponding to the first transformed data group; and selecting learning target data of an estimation model from the first input data group.
Unification of models having respective target classes with distillation
Generating soft labels used for training a unified model is achieved by unification of models having respective target classes with distillation. A collection of samples is prepared. Predictions are generated by individual trained models. Individual trained models have an individual class set to form a unified class set that includes target classes. The unified soft labels are estimated for each sample over the target classes in the unified class set from the predictions using a relation connecting a first output of each individual trained model and a second output of the unified model. The unified soft labels are output to train a unified model having the unified class set.
GENERATIVE AI AND AGENTIC AI SYSTEMS AND METHODS FOR PRODUCT DATA ANALYTICS AND OPTIMIZATION
Generative AI systems and methods are developed to provide recommendations regarding product sales, pricing, inventory, orders, manufacturing, distribution, shipping, packaging or other product analytics as determined from a range of available data sources. A consistent, semantic metadata structure is described as well as a hypothesis generating and testing system capable of generating predictive analytics models in a non-supervised or partially supervised mode. Users and/or AI agents (i.e., a form agentic AI) may then subscribe to the date for the use in economic forecasting.