Patent classifications
G06V10/945
INFORMATION PROCESSING APPARATUS, INFORMATION PROCESSING METHOD, AND STORAGE MEDIUM
An information processing apparatus according to the present invention includes: a display control unit that displays, on a screen, a map of a search area, a camera icon indicating a location of a surveillance camera in the map, and a person image of a search target person; an operation receiving unit that receives an operation of superimposing, on the screen, one of the person image or the camera icon on the other; and a processing request unit that requests a matching process between the person image and a surveillance video captured by the surveillance camera corresponding to the camera icon based on the operation.
SELF-SUPERVISED LEARNING FRAMEWORK TO GENERATE CONTEXT SPECIFIC PRETRAINED MODELS
Systems and methods for self-supervised representation learning as a means to generate context-specific pretrained models include selecting data from a set of available data sets; selecting a pretext task from domain specific pretext tasks; selecting a target problem specific network architecture based on a user selection from available choices or any customized model as per user preference; and generating a pretrained model for the selected network architecture using the selected data obtained from the set of available data sets and a pretext task as obtained from domain specific pretext tasks.
METHOD FOR DETECTING DEFECT AND METHOD FOR TRAINING MODEL
The present disclosure provides a method and device for detecting an image category. The method includes: acquiring a sample data set including a plurality of sample images labeled with a category, the sample data set including a training data set and a verification data set; training a deep learning model using the training data set to obtain, according to different numbers of training rounds, at least two trained models; testing the at least two trained models using the verification data set to generate a verification test result; generating, based on the verification test result, a verification test index; determining, according to the verification test index, a target model from the at least two trained models; and predict a to-be-tested image of the target object using the target model to obtain the category of the to-be-tested image.
Learning data collection device, learning data collection system, and learning data collection method
In collection of training data for image recognition, in order to support a reduction in collection of improper images which are not suitable as training data, a learning data collection device includes a processor which is configured to acquire a captured image from an image capturing device, determine whether or not the captured image is suitable as training data, and when the captured image is determined to be not suitable as training data, perform a notification operation to prompt an image capturing person to reshoot a new image for the captured image.
FLIGHT PLAN GENERATION DEVICE AND FLIGHT PLAN GENERATION METHOD
A flight plan generation device for generating a flight plan of a flight device that transmits a captured image captured by an imaging unit through wireless communication during flight includes a first acquisition unit configured to acquire target information for specifying an imaging target to be imaged by the imaging unit during flight of the flight device and image quality information for specifying required image quality of a captured image of the imaging target, a second acquisition unit configured to acquire communication quality information on wireless communication quality in a predetermined area including at least a flight airspace of the flight device, and a flight plan generation unit configured to generate a flight plan including a flight path of the flight device and an imaging parameter of the imaging unit based on the acquired target information, image quality information, and communication quality information.
Continuous machine learning for extracting description of visual content
Aspects of the present disclosure relate to machine learning techniques for continuous implementation and training of a machine learning system for identifying the natural language meaning of visual content. A computer vision model or other suitable machine learning model can predict whether a given descriptor is associated with the visual content. A set of such models can be used to determine whether particular ones of a set of descriptors are associated with the visual content, with the determined descriptors representing a meaning of the visual content. This meaning can be refined based on a multi-armed bandit tracking and analyzing interactions between the visual content and users associated with certain personas related to the determined descriptors.
System and method for training an artificial intelligence (AI) classifier of scanned items
Systems and methods for training an artificial intelligence (AI) classifier of scanned items. The items may include a training set of sample raw scans. The set may include in-class objects and not-in-class raw scans. An AI classifier may be configured to sample raw scans in the training set, measure errors in the results, update classifier parameters based on the errors, and detect completion of training.
Image tracing system and method
A method includes tagging, by at least one processor, one or more three-dimensional assets with a unique identifier and storing the one or more three-dimensional assets in a database, creating, by the at least one processor, a three-dimensional model based on the one or more three-dimensional assets and loading the three-dimensional model in a simulator, generating, by the at least one processor, a two-dimensional image that is a representation of the three-dimensional model in the simulator, the two-dimensional image comprising metadata that includes each unique identifier for each three-dimensional asset of the three-dimensional model displayed in the two-dimensional image, and assigning, by the at least one processor, the two-dimensional image with a unique identifier and storing each unique identifier for each three-dimensional asset of the three-dimensional model displayed in the two-dimensional image in metadata for the two-dimensional image.
INFORMATION PROCESSING UNIT, INFORMATION PROCESSING METHOD, AND PROGRAM
An information processing unit includes: a diagnostic image input section that inputs the diagnostic image; an operation information obtaining section that obtains display operation history information representing an operation history of a user who controls displaying of the diagnostic image; a query image generation section that extracts a predetermined region of the input diagnostic image to generate a query image; a diagnosed image obtaining section that supplies the generated query image and the display operation history information to a diagnosed image search unit and obtains the diagnosed image obtained as a search result by the diagnosed image search unit; and a display control section that displays the diagnostic image and the obtained diagnosed image for comparison.
Machine learning inference user interface
Two-dimensional objects are displayed upon a user interface; user input selects an area and selects a machine learning model for execution. The results are displayed as an overlay over the objects in the user interface. User input selects a second model for execution; the result of this execution is displayed as a second overlay over the objects. A first overlay from a model is displayed over a set of objects in a user interface and a ground truth corresponding to the objects is displayed as a second overlay on the user interface. User input selects the ground truth overlay as a reference and causes a comparison of the first overlay with the ground truth overlay; the visual data from the comparison is displayed on the user interface. A comparison of M inference overlays with N reference overlays is performed and visual data from the comparison is displayed on the interface.