Patent classifications
G06V10/7747
Training image classifiers
Methods, systems, an apparatus, including computer programs encoded on a storage device, for training an image classifier. A method includes receiving an image that includes a depiction of an object; generating a set of poorly localized bounding boxes; and generating a set of accurately localized bounding boxes. The method includes training, at a first learning rate and using the poorly localized bounding boxes, an object classifier to classify the object; and training, at a second learning rate that is lower than the first learning rate, and using the accurately localized bounding boxes, the object classifier to classify the object. The method includes receiving a second image that includes a depiction of an object; and providing, to the trained object classifier, the second image. The method includes receiving an indication that the object classifier classified the object in the second image; and performing one or more actions.
FACE LIVENESS DETECTION METHOD, SYSTEM, APPARATUS, COMPUTER DEVICE, AND STORAGE MEDIUM
A face liveness detection method is provided, and includes: receiving an image transmitted by a terminal, the image including a face of an object; performing data augmentation on the image, to obtain an extended image corresponding to the image, a number of extended images corresponding to the image being more than one; performing liveness detection on the extended images corresponding to the image, to obtain intermediate detection results of the extended images, a liveness detection model used in liveness detection being obtained by performing model training on an initial neural network model according to a sample image and extended sample images corresponding to the sample image; and obtaining a liveness detection result of the object in the image after fusing the intermediate detection results of the extended images.
METHOD AND DEVICE FOR EVALUATING AN IMAGE CLASSIFIER
A computer-implemented method for evaluating an image classifier, in which a classifier output of the image classifier is provided for the actuation of an at least semi-autonomous robot. The evaluation method includes: ascertaining a first dataset including image data and annotations being assigned to the image data, the annotations including information about the scene imaged in the respective image and/or about image regions to be classified and/or about movement information of the robot; ascertaining regions of the scenes that are reachable by the robot based on the annotations; ascertaining relevance values for image regions to be classified by the image classifier; classifying the image data of the first image dataset with the aid of the image classifier; evaluating the image classifier based on image regions correctly classified by the image classifier and incorrectly classified image regions, as well as the calculated relevance values of the corresponding image regions.
LEARNING DATA GENERATION DEVICE AND DEFECT IDENTIFICATION SYSTEM
A learning data generation device that can generate learning data suitable for learning of an identification model. The learning data generation device has a function of cutting out part of first image data as second image data, a function of generating a two-dimensional graphic corresponding to the area of the second image data and representing a pseudo defect, a function of generating third image data by combining the second image data and the two-dimensional graphic, and a function of assigning a label corresponding to the two-dimensional graphic to the third image data. By using the third image data for learning of the identification model, a highly accurate identification model can be generated.
Optimizing inference time of entity matching models
Methods, systems, and computer-readable storage media for receiving input data including a set of entities of a first type and a set of entities of a second type, providing a set of features based on entities of the first type, the set of features including features expected to be included in entities of the second type, filtering entities of the second type based on the set of features to provide a sub-set of entities of the second type, and generating an output by processing the set of entities of the first type and the sub-set of entities of the second type through a ML model, the output comprising a set of matching pairs, each matching pair in the set of matching pairs comprising an entity of the set of entities of the first type and at least one entity of the sub-set of entities of the second type.
Domain adaptation and fusion using weakly supervised target-irrelevant data
Aspects include receiving a request to perform an image classification task in a target domain. The image classification task includes identifying a feature in images in the target domain. Classification information related to the feature is transferred from a source domain to the target domain. The transferring includes receiving a plurality of pairs of task-irrelevant images that each includes a task-irrelevant image in the source domain and in the target domain. The task-irrelevant image in the source domain has a fixed correspondence to the task-irrelevant image in the target domain. A target neural network is trained to perform the image classification task in the target domain. The training is based on the plurality of pairs of task-irrelevant images. The image classification task is performed in the target domain and includes applying the target neural network to an image in the target domain and outputting an identified feature.
A CO-TRAINING FRAMEWORK TO MUTUALLY IMPROVE CONCEPT EXTRACTION FROM CLINICAL NOTES AND MEDICAL IMAGE CLASSIFICATION
A system and method for training a text report identification machine learning model and an image identification machine learning model, including: initially training a text report machine learning model, using a labeled set of text reports including text pre-processing the text report and extracting features from the pre-processed text report, wherein the extracted features are input into the text report machine learning model; initially training an image machine learning model, using a labeled set of images; applying the initially trained text report machine learning model to a first set of unlabeled text reports with associated images to label the associated images; selecting a first portion of labeled associated images; re-training the image machine learning model using the selected first portion of labeled associated images; applying the initially trained image machine learning model to a first set of unlabeled images with associated text reports to label the associated text reports; selecting a first portion of labeled associated text reports; and re-training the text report machine learning model using the selected first portion of labeled associated text reports.
DIAGNOSTIC ASSISTANCE APPARATUS AND MODEL GENERATION APPARATUS
A diagnostic assistance apparatus according to an aspect of the present disclosure determines whether a body part of a target examinee captured in a target medical image is normal, by using a trained first classification model generated by unsupervised learning using a plurality of first learning medical images of normal cases and a trained second classification model generated by supervised learning using a plurality of learning data sets including normal cases and abnormal cases.
INFORMATION PROCESSING APPARATUS, INFORMATION PROCESSING SYSTEM, INFORMATION PROCESSING METHOD, AND INFORMATION PROCESSING PROGRAM
An object of the present disclosure is to provide an information processing apparatus, an information processing system, an information processing method, and an information processing program capable of achieving efficient use of training data. An information processing apparatus according to the present disclosure includes: a recognition unit (101) that performs object recognition processing using sensor information acquired by a sensor, the object recognition processing being performed by a first recognizer that has been pretrained; and a training data application determination unit (22d) that determines whether the sensor information is applicable as training data to a second recognizer different from the first recognizer.
LEARNING DATASET GENERATION DEVICE AND LEARNING DATASET GENERATION METHOD
A learning dataset generation device includes: a memory that stores three-dimensional CAD data of a workpiece and a container; and one or more processors including hardware, wherein the one or more processors are configured to use the three-dimensional CAD data of the workpiece and the container, stored in the memory, to generate, in a three-dimensional virtual space, a plurality of imaging objects in which a plurality of the workpieces are bulk-loaded in different forms inside the container, acquire a plurality of virtual distance images by measuring each of the generated imaging objects by means of a virtual three-dimensional measurement machine disposed in the three-dimensional virtual space, accept at least one teaching position for each of the acquired virtual distance images, and generate a learning dataset by associating the accepted teaching position with each of the virtual distance images.