Patent classifications
G06V10/945
IMAGE RECOGNITION SUPPORT APPARATUS, IMAGE RECOGNITION SUPPORT METHOD, AND IMAGE RECOGNITION SUPPORT PROGRAM
The invention supports creation of models for recognizing attributes in an image with high accuracy. An image recognition support apparatus includes an image input unit configured to acquire an image, a pseudo label generation unit configured to recognize the acquired image based on a plurality of types of image recognition models and output recognition information, and generate pseudo labels indicating attributes of the acquired image based on the output recognition information, and a new label generation unit configured to generate new labels based on the generated pseudo labels.
AUTOMATIC RULE PREDICTION AND GENERATION FOR DOCUMENT CLASSIFICATION AND VALIDATION
A method is provided. The method may include, in response to electronically receiving a document, automatically classifying the document and different parts of the document, by electronically identifying a document type associated with the document and electronically tagging data associated with the different parts of the document based on classification rules. The method may further include automatically extracting the tagged data associated with the automatically classified document based on data extraction rules. The method may further include detecting first feedback associated with the classification rules and second feedback associated with the data extraction rules. The method may further include automatically generating and updating validation rules based on the identified document type, the detected first feedback, and the detected second feedback to validate the automatically classified document and the automatically tagged and extracted data.
Method and system for interfacing with a user to facilitate an image search for an object-of-interest
Methods, systems, and techniques for performing a facet search include receiving facet search commencement user input indicating that a search for a facet is to commence; in response to the facet search commencement user input, searching one or more video recordings for the facet; and displaying, on a display, facet image search results depicting the facet, wherein the facet image search results are selected from the one or more video recordings. An artificial neural network may be used for the facet search, and that network may be trained by generating a facet image training set that comprises training images, with the training images depicting a type of facet common to the training images; and training, by using the facet image training set, that neural network to classify the type of facet when a sample image comprising the type of facet is input to that network.
Information processing apparatus, information processing method, and storage medium for classifying object of interest
An information processing method in which an object of interest is classified using node group information defining a node group having modeled a scheme of classification as a tree structure and having grouped nodes possessing a same parent node, comprises: setting depth information for determining whether to perform classification for a particular node group when sequentially traversing node groups from the parent node using the node group information to classify the object of interest; and classifying the object of interest by sequentially traversing node groups from the parent node using the node group information, and providing a classification result, wherein classifying the object of interest varies a depth up to which node groups are sequentially traversed from the parent node to classify the object of interest, in accordance with setting of the depth information.
Image processing system for providing attribute information, image processing method and storage medium
In a system, a setting window including at least a preview region in which a scanned image is previewed and a text field to which attribute information on the scanned image is input is displayed, when a character region within the scanned image previewed in the setting window is moused over, control to preliminarily display a character string corresponding to the moused-over character region in the text field is performed, and when the moused-over character region is clicked by a mouse, control to fix the character string preliminarily displayed in the text field is performed.
AUTHENTICATION DEVICE, REGISTRATION DEVICE, AUTHENTICATION METHOD, REGISTRATION METHOD, AND STORAGE MEDIUM
An authentication device includes a feature amount generation unit for generating a feature amount of an object in an image, a registered data acquisition unit for acquiring registered data where a feature amount of a predetermined object in an image is registered in advance as a registered feature amount, and NG information associated with the registered feature amount unsuitable for authentication is recorded, a similarity acquisition unit for acquiring a similarity between the predetermined feature amount and the registered feature amount acquired from the registered data acquisition unit, and a determination unit for performing authentication if the degree of similarity acquired by the degree of similarity acquisition unit satisfies a predetermined authentication condition, and even if the degree of similarity satisfies the predetermined authentication condition, if the NG information associated with the registered feature amount is acquired, the predetermined feature amount is not authenticated.
Image processing apparatus, method for controlling the same, and storage medium
Images of the plurality of document pages are scanned to generate image data with one scanning instruction. A single folder named with a received character string is determined as a storage destination of image data corresponding to the plurality of document pages generated with the scanning instruction.
IMAGE OUTPUT DEVICE AND METHOD FOR CONTROLLING THE SAME
The present invention relates to a video output device mounted on a vehicle to implement augmented reality, and a method for controlling the same. The video output device comprises: a video output unit for outputting visual information for implementing the augmented reality; a communication unit for receiving a front video captured of the front of the vehicle; and a processor for investigating, in the front video, at least one to-be-driven lane on which the vehicle is to be driven, and controlling the video output unit such that main carpet images guiding the to-be-driven lanes are output lane by lane.
INFORMATION AUTHENTICATION
An information authentication method is provided. The method includes: determining a plurality of first objects in a target image; determining at least one second object in the target image; and executing, for each of the plurality of first objects, a first authentication operation including: determining whether the at least one second object includes a second object associated with the first object; performing first authentication on the second object associated with the first object in response to determining that the at least one second object includes the second object associated with the first object to obtain a first authentication result of the first object; and outputting the first authentication result.
ANNUNCIATION METHOD AND INFORMATION PROCESSING DEVICE
An annunciation method includes the steps of obtaining first position information as position information of a place where a first object is located, obtaining second position information as position information of a place where a second object is located, obtaining third position information as position information of a place where a third object is located, setting the first object and the second object as a first group when a distance between the first object and the second object calculated based on the first position information and the second position information is smaller than or equal to a first threshold value, obtaining fourth position information representing a position of the first group, and performing annunciation when a distance between the third object and the first group calculated based on the third position information and the fourth position information is smaller than a second threshold value.