Patent classifications
G06F18/2451
TWO-STAGE FREQUENCY SELECTION METHOD AND DEVICE FOR MICROWAVE FREQUENCY SWEEP DATA
Disclosed is a two-stage frequency selection method and device for microwave frequency sweep data. The method includes: acquiring microwave frequency sweep data; performing frequency selection on the microwave frequency sweep data by using a random forest-recursive feature elimination algorithm, taking a preset parameter in the random forest-recursive feature elimination algorithm as a hyper-parameter, changing the value of the hyper-parameter, and generating a series of candidate frequency subsets within different frequencies; building prediction models on the basis of the frequency sweep data corresponding to the candidate frequency subsets of different frequencies; evaluating the performance of each prediction model by means of 10 fold cross validation, and calculating evaluation index values of model performance; and taking the evaluation indexes as a voting basis, and selecting an optimal frequency subset by using a majority voting method.
PHENOTYPING TUMOR INFILTRATING LYMPHOCYTES ON HEMATOXYLIN AND EOSIN (H&E) STAINED TISSUE IMAGES TO PREDICT RECURRENCE IN LUNG CANCER
The present disclosure relates to an apparatus including one or more processors configured to receive a digitized image of a region of tissue demonstrating a disease, and containing cellular structures represented in the digitized image, each of the cellular structures being associated with a cell category of a plurality of cell categories; select a cellular structure of the cellular structures based on the cell category for the cellular structure; for the cellular structure selected, compute a set of contextual features; assign, based on the set of contextual features, the cellular structure to at least one cluster of a plurality of clusters; compute cluster features, the cluster features describing characteristics of the at least one cluster of the plurality of clusters; and generate a prediction that describes a pathologic or phenotypic state of the disease based, at least in part, on the cluster features and/or the set of contextual features.
Location sensitive ensemble classifier
Computer-implemented systems and methods for generating and using a location sensitive ensemble classifier for classifying content includes dividing a validation data set into regions. Each region encompasses data points of the validation data set that fall within the region. A regional ensemble classifier is generated for each region based on the data points that fall within the region. A content item is then classified in at least one of a plurality of classes using the regional ensemble classifier for the region to which the content item belongs.
Categorical feature enhancement mechanism for gradient boosting decision tree
A computer implemented method of generating a gradient boosting decision tree for obtaining predictions includes finding split points by sorting variable values of a feature by their gradient during training of the gradient boosting decision tree, performing a linear search to find a subset of variables with maximum split gain, and modifying a node of the gradient boosting decision tree to have multiple split points on the node for a feature as a function of the linear search. In a further example, a computer implemented method of controlling overfitting in a gradient boosting decision tree includes combining values of low population feature values into a virtual bin, fanning out the virtual bin into feature values having a low population, and including the low population feature values into multiple split points on a node of the gradient boosting decision tree.
Categorical feature enhancement mechanism for gradient boosting decision tree
A computer implemented method of generating a gradient boosting decision tree for obtaining predictions includes finding split points by sorting variable values of a feature by their gradient during training of the gradient boosting decision tree, performing a linear search to find a subset of variables with maximum split gain, and modifying a node of the gradient boosting decision tree to have multiple split points on the node for a feature as a function of the linear search. In a further example, a computer implemented method of controlling overfitting in a gradient boosting decision tree includes combining values of low population feature values into a virtual bin, fanning out the virtual bin into feature values having a low population, and including the low population feature values into multiple split points on a node of the gradient boosting decision tree.
STORAGE MEDIUM, MODEL GENERATION METHOD, AND INFORMATION PROCESSING APPARATUS
A non-transitory computer-readable storage medium storing a model generation program that causes at least one computer to execute a process that includes acquiring, on a first assumption that assumes each of individual data items included in a training data set is easy for a user to interpret, each of first values for each of the individual data items by optimizing an objective function that has a loss weight related to ease of interpretation of the data item by using the training data set; acquiring, on a second assumption that assumes each of the individual data items is not easy, each of second values; selecting a specific data item from the individual data items based on each of the first values and each of the second values for each of the individual data items; and generating a linear model using user evaluation for the specific data item.
Sequential minimal optimization algorithm for learning using partially available privileged information
Computational algorithms integrate and analyze data to consider multiple interdependent, heterogeneous sources and forms of patient data, and using a classification model, provide new learning paradigms, including privileged learning and learning with uncertain clinical data, to determine patient status for conditions such as acute respiratory distress syndrome (ARDS) or non-ARDS.
Embeddings + SVM for teaching traversability
A system includes a memory module configured to store image data captured by a camera and an electronic controller communicatively coupled to the memory module. The electronic controller is configured to receive image data captured by the camera, implement a neural network trained to predict a drivable portion in the image data of an environment. The neural network predicts the drivable portion in the image data of the environment. The electronic controller is configured to implement a support vector machine. The support vector machine determines whether the predicted drivable portion of the environment output by the neural network is classified as drivable based on a hyperplane of the support vector machine and output an indication of the drivable portion of the environment.
Machine learning and/or image processing for spectral object classification
In one embodiment, a method of machine learning and/or image processing for spectral object classification is described. In another embodiment, a device is described for using spectral object classification. Other embodiments are likewise described.
Optical time domain reflectometer (OTDR)-based classification for fiber optic cables using machine learning
In one embodiment, a device receives optical time domain reflectometer (OTDR) trace samples, each sample labeled with an associated fiber optic cable condition. The device alters the received OTDR trace samples to generate a set of synthetic OTDR trace samples. Each synthetic sample is labeled with the label of the received sample that was altered to generate the synthetic sample. The device trains a machine learning-based classifier using a training dataset that comprises the synthetic OTDR trace samples. The device uses the trained classifier to identify a condition along a particular fiber optic cable based on OTDR trace data obtained from that cable.