Patent classifications
G06V10/778
LEARNING DEVICE, TRAFFIC EVENT PREDICTION SYSTEM, AND LEARNING METHOD
To provide a learning device that improves, using appropriate learning data, the accuracy of a prediction model that predicts a traffic event from a video. The learning device: detects, from a video obtained by imaging a road, an object to be detected including at least a vehicle, by a method different from that of a prediction model that predicts a traffic event on the road; generates learning data for the prediction model on the basis of the detected object and the captured video; and learns the prediction model using the generated learning data.
LEARNING DEVICE, TRAFFIC EVENT PREDICTION SYSTEM, AND LEARNING METHOD
To provide a learning device that improves, using appropriate learning data, the accuracy of a prediction model that predicts a traffic event from a video. The learning device: detects, from a video obtained by imaging a road, an object to be detected including at least a vehicle, by a method different from that of a prediction model that predicts a traffic event on the road; generates learning data for the prediction model on the basis of the detected object and the captured video; and learns the prediction model using the generated learning data.
MODEL GENERATING APPARATUS AND MODEL GENERATING METHOD
A model generating apparatus includes a measurement unit and a change unit. The measurement unit measures a size of an object appearing on an image included in data set. The change unit changes, based on a distribution of the size, a layer to be connected to a detection unit of a Convolutional Neural Network (CNN) for detecting an object appearing on the image, among layers for extracting feature included in the CNN.
MODEL GENERATING APPARATUS AND MODEL GENERATING METHOD
A model generating apparatus includes a measurement unit and a change unit. The measurement unit measures a size of an object appearing on an image included in data set. The change unit changes, based on a distribution of the size, a layer to be connected to a detection unit of a Convolutional Neural Network (CNN) for detecting an object appearing on the image, among layers for extracting feature included in the CNN.
AUTOMATIC LOCALIZED EVALUATION OF CONTOURS WITH VISUAL FEEDBACK
A localized evaluation network incorporates a discriminator acting as classifier, which may be included within a generative adversarial network (GAN). GAN may include a generative network such as U-NET for creating segmentations. The localized evaluation network is trained on image pairs including medical images of organs of interest and segmentation (mask) images. The network is trained to distinguish whether an image pair does or does not represent the ground truth. GAN examines interior layers of the discriminator and evaluates how much each localized image region contributes to the final classification. The discriminator may analyze regions of the image pair that contribute to a classification by analyzing layer weights of the machine learning model. Disclosed embodiments include a visual attribute, such as a heat map, that represents contributions of localized regions of a contour to an overall confidence score. These localized regions may be highlighted and reported for quality assurance review.
SYSTEMS AND METHODS FOR CATEGORIZING IMAGE PIXELS
Systems and methods are described to systems and methods for training a machine learning model to categorize each pixel of an input overhead image using received overhead images, and using a trained machine learning model to determine, for each pixel of input overhead images, to which land use or land cover mapping category each pixel of each overhead image belongs. The provided systems and methods may generate a map of a geographic area associated with the plurality of overhead images based on the plurality of overhead images and on the determined categories.
DATABASE MANAGEMENT SYSTEM AND METHOD FOR UPDATING A TRAINING DATASET OF AN ITEM IDENTIFICATION MODEL
A system for updating a training dataset of an item identification model determines that an item is not included in a training dataset. In response to determining that the item is not included in the training dataset, the system obtains an identifier of the item. The system detects a triggering event at a platform, where the triggering event corresponds to a user placing the item on a platform. The system captures images of the item. The system extracts a set of features associated with the item from the images. The system associates the item to the identifier and the set of features. The system adds a new entry to the training dataset, where the new entry represents the item labeled with the identifier and the set of features.
SYSTEM AND METHOD FOR AGGREGATING METADATA FOR ITEM IDENTIFICATION USING DIGITAL IMAGE PROCESSING
A system for identifying items based on aggregated metadata obtains images of an item. The system extracts a set of features from images of the item. The system identifies a first value of a first feature associated with a first image of the item. The system identifies a second value of the first feature associated with a second image of the item. The system aggregates the first value and the second value. The system associates the item to the aggregated first value and the second value, where the aggregated first value and the second value represent the first feature of the item. The system adds a new entry for each image of the item to a training dataset associated with an item identification model.
METHOD FOR RE-RECOGNIZING OBJECT IMAGE BASED ON MULTI-FEATURE INFORMATION CAPTURE AND CORRELATION ANALYSIS
A method for re-recognizing an object image is provided based on multi-feature information capture and correlation analysis weights of an input feature map by using a convolutional layer with a spatial attention mechanism and a channel attention mechanism, causing channel and spatial information to effectively combined, which not only focus on an important feature and suppress an unnecessary feature, but also improve a representation of a feature. A multi-head attention mechanism is used to process a feature after an image is divided into blocks to capture abundant feature information and determine a correlation between features to improve performance and efficiency of object image retrieval. The convolutional layer with the channel attention mechanism and the spatial attention mechanism is combined with a transformer having the multi-head attention mechanism to focus on globally important features and capture fine-grained features, thereby improving performance of re-recognition.
METHOD FOR RE-RECOGNIZING OBJECT IMAGE BASED ON MULTI-FEATURE INFORMATION CAPTURE AND CORRELATION ANALYSIS
A method for re-recognizing an object image is provided based on multi-feature information capture and correlation analysis weights of an input feature map by using a convolutional layer with a spatial attention mechanism and a channel attention mechanism, causing channel and spatial information to effectively combined, which not only focus on an important feature and suppress an unnecessary feature, but also improve a representation of a feature. A multi-head attention mechanism is used to process a feature after an image is divided into blocks to capture abundant feature information and determine a correlation between features to improve performance and efficiency of object image retrieval. The convolutional layer with the channel attention mechanism and the spatial attention mechanism is combined with a transformer having the multi-head attention mechanism to focus on globally important features and capture fine-grained features, thereby improving performance of re-recognition.