G06V30/24

Vision analysis and validation system for improved inspection in robotic assembly

A vision analytics and validation (VAV) system for providing an improved inspection of robotic assembly, the VAV system comprising a trained neural network three-way classifier, to classify each component as good, bad, or do not know, and an operator station configured to enable an operator to review an output of the trained neural network, and to determine whether a board including one or more “bad” or a “do not know” classified components passes review and is classified as good, or fails review and is classified as bad. In one embodiment, a retraining trigger to utilize the output of the operator station to train the trained neural network, based on the determination received from the operator station.

Automated indexing and extraction of information in digital documents

Systems and methods for automated indexing and extraction of information in digital documents are disclosed. A method may comprise selecting a page number of a digital document to identify a page containing targeted information; inputting an image of the page into a visual machine learning network (visual ML), wherein the visual ML is trained to recognize text associated with the targeted information in an image; identifying by the visual ML, a section of the image that contains the targeted information; inputting the page number, the digital document, and coordinates of the section into an extraction module; and extracting the targeted information by the extraction module from the section.

IDENTIFYING REGIONS OF INTEREST FROM WHOLE SLIDE IMAGES
20230252807 · 2023-08-10 ·

The present application relates generally to identifying regions of interest in images, including but not limited to whole slide image region of interest identification, prioritization, de-duplication, and normalization via interpretable rules, nuclear region counting, point set registration, and histogram specification color normalization. This disclosure describes systems and methods for analyzing and extracting regions of interest from images, for example biomedical images depicting a tissue sample from biopsy or ectomy. Techniques directed to quality control estimation, granular classification, and coarse classification of regions of biomedical images are described herein. Using the described techniques, patches of images corresponding to regions of interest can be extracted and analyzed individually or in parallel to determine pixels correspond to features of interest and pixels that do not. Patches that do not include features of interest, or include disqualifying features, can be disqualified from further analysis. Relevant patches can analyzed and stored with various feature parameters.

Means for using microstructure of materials surface as a unique identifier

The present application concerns the visual identification of materials or documents for tracking or authentication purposes. It describes methods to automatically authenticate an object by comparing some object images with reference images, the object images being characterized by the fact that visual elements used for comparison are non-disturbing for the naked eye. In some described approaches it provides the operator with visible features to locate the area to be imaged. It also proposes ways for real-time implementation enabling user friendly detection using mobile devices like smart phones.

Imagery evidence matching system

Systems and methods are provided for generating sets of candidates comprising images and places within a threshold geographic proximity based on geographic information associated with each of the plurality of images and geographic information associated with each place. For each set of candidates, the systems and methods generate a similarity score based on a similarity between text extracted from each image and a place name, and the geographic information associated with each image and each place. For each place with an associated image as a potential match, the systems and methods generate a name similarity score based on matching the extracted text of the image to the place name, and store an image as place data associated with a place based on determining that the name similarity score for the extracted text associated with the image is higher than a second predetermined threshold.

GENERATING DIGITAL IMAGES UTILIZING HIGH-RESOLUTION SPARSE ATTENTION AND SEMANTIC LAYOUT MANIPULATION NEURAL NETWORKS

This disclosure describes one or more implementations of a digital image semantic layout manipulation system that generates refined digital images resembling the style of one or more input images while following the structure of an edited semantic layout. For example, in various implementations, the digital image semantic layout manipulation system builds and utilizes a sparse attention warped image neural network to generate high-resolution warped images and a digital image layout neural network to enhance and refine the high-resolution warped digital image into a realistic and accurate refined digital image.

Hybrid vision system for crop land navigation
11765542 · 2023-09-19 · ·

In an embodiment, autonomous vehicles with global positioning systems (GPS) are used for field inspection to reduce fuel and labor costs and improve reliability with increased consistency in field crop inspection. A vehicle may be programmed to traverse a field while using sensors to detect objects and operating in a first image capture mode, for example, capturing low-resolution images of objects in the field, typically crops. Under program control, machine vision techniques are used with the low-resolution images to recognize crops, non-crop plant material or undefined objects. Under program control, location data is used to correlate recognized objects with digitally stored field maps to resolve whether a particular object is in a location at which crop planting is expected or not expected. Under program control, depending on whether an object in a low-resolution digital image is recognized as a crop, and whether the object is in an expected geo-location for crops, the vehicle may cease traversing temporarily and switch to a second image capture mode, for example, capturing a high-resolution image of the object, for use in disease analysis or classification, weed analysis or classification, alert notifications or other messages, or other processing. In this manner, a field may be rapidly traversed and imaged using coarse-level, rapid techniques that require lower processing resources, storage or memory, while automatically switching to execute special processing only when necessary to resolve unexpected objects or to perform operations such as disease classification that benefit from high-resolution images and more intensive use of processing resources, storage or memory.

Method and system for visio-linguistic understanding using contextual language model reasoners

This disclosure relates generally to visio-linguistic understanding. Conventional methods use contextual visio-linguistic reasoner for visio-linguistic understanding which requires more compute power and large amount of pre-training data. Embodiments of the present disclosure provide a method for visio-linguistic understanding using contextual language model reasoner. The method converts the visual information of an input image into a format that the contextual language model reasoner understands and accepts for a downstream task. The method utilizes the image captions and confidence score associated with the image captions along with a knowledge graph to obtain a combined input in a format compatible with the contextual language model reasoner. Contextual embeddings corresponding to the downstream task is obtained using the combined input. The disclosed method is used to solve several downstream tasks such as scene understanding, visual question answering, visual common-sense reasoning and so on.

MEANS FOR USING MICROSTRUCTURE OF MATERIALS SURFACE AS A UNIQUE IDENTIFIER

The present application concerns the visual identification of materials or documents for tracking or authentication purposes. It describes methods to automatically authenticate an object by comparing some object images with reference images, the object images being characterized by the fact that visual elements used for comparison are non-disturbing for the naked eye. In some described approaches it provides the operator with visible features to locate the area to be imaged. It also proposes ways for real-time implementation enabling user friendly detection using mobile devices like smart phones

Image normalization increasing robustness of machine learning applications for medical images

A computer program, a system and a method for normalizing medical images from a type of image acquisition device using a machine learning unit are disclosed. An embodiment of the method includes receiving a set of image data with images; decomposing each of the images of the set of images into components by incorporating at least information from different settings of the image acquisition device-specific image processing algorithms; and normalizing each of the components via a machine learning unit by processing at least information from the different settings of the image acquisition device-specific processing algorithms to provide a set of normalized images with a relatively decreased variability score.