Patent classifications
G06F18/23211
Multiclassification approach for enhancing natural language classifiers
In an approach to creating models utilizing optimally clustered training sets, one or more computer processors determine an optimal cluster size. The one or more computer processors generate one or more clusters from one or more classes and respectively associated training statements that are contained in a training set, based on the determined optimal cluster size, wherein the one or more generated clusters, respectively, contain fewer classes than the training set. The one or more computer processors identify one or more isolated high confidence classes and associated training statements from one or more cluster classifications generated by a static model trained with the one or more generated clusters. The one or more computer processors create one or more dynamic models trained with the one or more identified isolated high confidence classes. The one or more computer processors perform one or more classifications utilizing the one or more created dynamic models.
Granular Cluster Generation for Real-Time Processing
Generating granular clusters for real-time processing is provided. The systems can identify tokens based on aggregating input from computing devices over a time interval. The systems can identify, based on metrics, a subset of tokens for cluster generation. The systems can generate, via a clustering technique, token clusters from the subset of the tokens, each of the token clusters comprising two or more tokens from the subset of the tokens. The systems can apply a de-duplication technique to each of the token clusters. The systems can apply a filtering technique to the token clusters to remove tokens erroneously grouped in a token cluster. The systems can assign, based on a selection process, a label for each of the token clusters. The systems can activate, based on a number of remaining tokens in each of the token clusters, a subset of the token clusters for real-time content selection.
Granular Cluster Generation for Real-Time Processing
Generating granular clusters for real-time processing is provided. The systems can identify tokens based on aggregating input from computing devices over a time interval. The systems can identify, based on metrics, a subset of tokens for cluster generation. The systems can generate, via a clustering technique, token clusters from the subset of the tokens, each of the token clusters comprising two or more tokens from the subset of the tokens. The systems can apply a de-duplication technique to each of the token clusters. The systems can apply a filtering technique to the token clusters to remove tokens erroneously grouped in a token cluster. The systems can assign, based on a selection process, a label for each of the token clusters. The systems can activate, based on a number of remaining tokens in each of the token clusters, a subset of the token clusters for real-time content selection.
ARTIFICIAL INTELLIGENCE-BASED BASE CALLING
The technology disclosed processes input data through a neural network and produces an alternative representation of the input data. The input data includes per-cycle image data for each of one or more sequencing cycles of a sequencing run. The per-cycle image data depicts intensity emissions of one or more analytes and their surrounding background captured at a respective sequencing cycle. The technology disclosed processes the alternative representation through an output layer and producing an output and base calls one or more of the analytes at one or more of the sequencing cycles based on the output.
ARTIFICIAL INTELLIGENCE-BASED BASE CALLING
The technology disclosed processes input data through a neural network and produces an alternative representation of the input data. The input data includes per-cycle image data for each of one or more sequencing cycles of a sequencing run. The per-cycle image data depicts intensity emissions of one or more analytes and their surrounding background captured at a respective sequencing cycle. The technology disclosed processes the alternative representation through an output layer and producing an output and base calls one or more of the analytes at one or more of the sequencing cycles based on the output.
SEMANTIC MAP PRODUCTION SYSTEM AND METHOD
The system includes a metric map creation unit configured to create a metric map using first image data received from a 3D sensor, an image processing unit configured to recognize an object by creating and classifying a point cloud using second image data received from an RGB camera; a probability-based map production unit configured to create an object location map and a spatial semantic map in a probabilistic expression method using a processing result of the image processing unit, a question creation unit configured to extract a portion of high uncertainty about an object class from a produced map on the basis of entropy and ask a user about the portion, and a map update unit configured to receive a response from the user and update a probability distribution for spatial information according to a change in probability distribution for classification of the object.
Plant data classification device, plant data display processing device, and plant control system
A plant data classification device according to one aspect of the invention includes: a data classification unit that classifies multidimensional operation data into categories; an evaluation index calculation unit that calculates an evaluation index of a category from a value of the operation data; a classification result evaluation unit that calculates a variation in the evaluation index for each category and determines whether the variation in the evaluation index is less than or equal to a reference value; and a parameter changing unit that changes, when it is determined that the variation in the evaluation index exceeds the reference value, a value of a parameter that defines a size of a category of the data classification unit in a direction of decreasing the size of the category.
Artificial Intelligence-Based Determination of Analyte Data for Base Calling
The technology disclosed relates to artificial intelligence based determination of analyte data for base calling. In particular, the technology disclosed uses input image data that is derived from a sequence of images. Each image in the sequence of images represents an imaged region and depicts intensity emissions indicative of one or more analytes and a surrounding background of the intensity emissions at a respective one of a plurality of sequencing cycles of a sequencing run. The input image data comprises image patches extracted from each image in the sequence of images. The input image data is processed through a neural network to generate an alternative representation of the input image data. The alternative representation is processed through an output layer to generate an output indicating properties of respective portions of the imaged region.
Methods, Systems, and Apparatuses for Quantitative Analysis of Heterogeneous Biomarker Distribution
Methods, systems, and apparatuses for detecting and describing heterogeneity in a cell sample are disclosed herein. A plurality of fields of view (FOV) are generated for one or more areas of interest (AOI) within an image of the cell sample are generated. Hyperspectral or multispectral data from each FOV is organized into an image stack containing one or more z-layers, with each z-layer containing intensity data for a single marker at each pixel in the FOV. A cluster analysis is applied to the image stacks, wherein the clustering algorithm groups pixels having a similar ratio of detectable marker intensity across layers of the z-axis, thereby generating a plurality of clusters having similar expression patterns.
Time series clustering analysis for forecasting demand
Product demand forecasting accuracy utilizes partitional clustering of time series data with dynamic time warping. The product demand forecasting disclosed herein is particularly suited to forecasting product demand for products with limited sales data. Time-series sales data of a producs (or group of products) with limited sales data (e.g. a sparse or no time series of sales data) are dynamically time warped with sales data of products, or groups of products, having extensive sales data (e.g., an extensive time series of sales data) to determine a clustering model with an optimal number of clusters and a prototype time series for each cluster in the model. The prototype time series for the cluster in which the product (or group of products) with limited sales data lies is utilized as its product demand forecast.