G06V20/70

ANALYSIS DEVICE AND ANALYSIS METHOD

An analysis device for visualizing an accuracy of a trained determination device includes an acquisition unit acquiring an image pair of a non-defective product image and a defective product image, an extraction unit extracting an image region of a defective part of the defective product, a generation unit generating a plurality of image regions of pseudo-defective parts, a compositing unit synthesizing each of the image regions of the plurality of pseudo-defective parts with the non-defective product image to generate a plurality of composite images having different feature quantities, an unit outputting the plurality of composite images to the determination device and acquiring a label corresponding to each of the plurality of composite images from the determination device, and a display control unit displaying an object indicating the label corresponding to each of the plurality of composite images in an array based on the feature quantities.

ANALYSIS DEVICE AND ANALYSIS METHOD

An analysis device for visualizing an accuracy of a trained determination device includes an acquisition unit acquiring an image pair of a non-defective product image and a defective product image, an extraction unit extracting an image region of a defective part of the defective product, a generation unit generating a plurality of image regions of pseudo-defective parts, a compositing unit synthesizing each of the image regions of the plurality of pseudo-defective parts with the non-defective product image to generate a plurality of composite images having different feature quantities, an unit outputting the plurality of composite images to the determination device and acquiring a label corresponding to each of the plurality of composite images from the determination device, and a display control unit displaying an object indicating the label corresponding to each of the plurality of composite images in an array based on the feature quantities.

METHOD AND DEVICE FOR CREATING A MACHINE LEARNING SYSTEM INCLUDING A PLURALITY OF OUTPUTS
20230022777 · 2023-01-26 ·

A method for creating a machine learning system, which is configured for segmentation and object detection. The method includes: providing a directed graph, selecting a path through the graph, at least one additional node being selected from a subset and a path being selected through the graph from the input node along the edges via the additional node up to the output node, the path initially being drawn as a function of probabilities of the edges, which defines a drawing probability of all architectures within the graph, creating a machine learning system as a function of the selected path and training the created machine learning system.

AUGMENTED PSEUDO-LABELING FOR OBJECT DETECTION LEARNING WITH UNLABELED IMAGES
20230028042 · 2023-01-26 ·

A method includes obtaining an image of a scene and identifying one or more labels for one or more objects captured in the image. The method also includes generating one or more domain-specific augmented images by modifying the image, where the one or more domain-specific augmented images are associated with the one or more labels. In addition, the method includes training or retraining a machine learning model using the one or more domain-specific augmented images and the one or more labels. Generating the one or more domain-specific augmented images may include at least one of modifying the image to include a different amount of motion blur, modifying the image to include a different lighting condition, and modifying the image to include a different weather condition.

AUGMENTED PSEUDO-LABELING FOR OBJECT DETECTION LEARNING WITH UNLABELED IMAGES
20230028042 · 2023-01-26 ·

A method includes obtaining an image of a scene and identifying one or more labels for one or more objects captured in the image. The method also includes generating one or more domain-specific augmented images by modifying the image, where the one or more domain-specific augmented images are associated with the one or more labels. In addition, the method includes training or retraining a machine learning model using the one or more domain-specific augmented images and the one or more labels. Generating the one or more domain-specific augmented images may include at least one of modifying the image to include a different amount of motion blur, modifying the image to include a different lighting condition, and modifying the image to include a different weather condition.

SEMI-SUPERVISED VIDEO TEMPORAL ACTION RECOGNITION AND SEGMENTATION

Systems, apparatuses, and methods include technology that generates final frame predictions for a first plurality of frames of a video, where the first plurality of frames is associated with unlabeled data. The technology predicts an ordered list of actions for the first plurality of frames based on the final frame predictions, and temporally aligning the ordered list of actions to the final frame predictions to generate labels.

SEMI-SUPERVISED VIDEO TEMPORAL ACTION RECOGNITION AND SEGMENTATION

Systems, apparatuses, and methods include technology that generates final frame predictions for a first plurality of frames of a video, where the first plurality of frames is associated with unlabeled data. The technology predicts an ordered list of actions for the first plurality of frames based on the final frame predictions, and temporally aligning the ordered list of actions to the final frame predictions to generate labels.

IMAGE ANNOTATION TOOLS
20230230384 · 2023-07-20 · ·

A method of annotating known objects in road images captured from a sensor-equipped vehicle, the method implemented in an annotation system and comprising: receiving at the annotation system a road image containing a view of a known object; receiving ego localization data, as computed in a map frame of reference, via localization applied to sensor data captured by the sensor-equipped vehicle, the ego localization data indicating an image capture pose of the road image in the map frame of reference; determining, from a predetermined road map, an object location of the known object in the map frame of reference, the predetermined road map representing a road layout the map frame of reference, wherein the known object is one of: a piece of road structure, and an object on or adjacent a road; computing, in an image plane defined by the image capture pose, an object projection, by projecting an object model of the known object from the object location into the image plane; and storing, in an image database, image data of the road image, in association with annotation data of the object projection for annotating the image data with a location of the known object in the image plane.

IMAGE ANNOTATION TOOLS
20230230384 · 2023-07-20 · ·

A method of annotating known objects in road images captured from a sensor-equipped vehicle, the method implemented in an annotation system and comprising: receiving at the annotation system a road image containing a view of a known object; receiving ego localization data, as computed in a map frame of reference, via localization applied to sensor data captured by the sensor-equipped vehicle, the ego localization data indicating an image capture pose of the road image in the map frame of reference; determining, from a predetermined road map, an object location of the known object in the map frame of reference, the predetermined road map representing a road layout the map frame of reference, wherein the known object is one of: a piece of road structure, and an object on or adjacent a road; computing, in an image plane defined by the image capture pose, an object projection, by projecting an object model of the known object from the object location into the image plane; and storing, in an image database, image data of the road image, in association with annotation data of the object projection for annotating the image data with a location of the known object in the image plane.

LABEL IDENTIFICATION METHOD AND APPARATUS, DEVICE, AND MEDIUM
20230230400 · 2023-07-20 ·

Provided are a label identification method and apparatus, a device, and a medium. The method includes: obtaining a target feature of a first image, in which the target feature characterizes a visual feature of the first image and a word feature of at least one label; and identifying a label of the first image from the at least one label based on the target feature. By characterizing the visual feature of the first image and the target feature of the word feature of the at least one label, the label of the first image is identified from the at least one label. Thus, identification accuracy of the label can be improved.