G06V10/28

Neural network processing for multi-object 3D modeling

Embodiments are directed to neural network processing for multi-object three-dimensional (3D) modeling. An embodiment of a computer-readable storage medium includes executable computer program instructions for obtaining data from multiple cameras, the data including multiple images, and generating a 3D model for 3D imaging based at least in part on the data from the cameras, wherein generating the 3D model includes one or more of performing processing with a first neural network to determine temporal direction based at least in part on motion of one or more objects identified in an image of the multiple images or performing processing with a second neural network to determine semantic content information for an image of the multiple images.

DETECTING SPECIFIED IMAGE IDENTIFIERS ON OBJECTS
20180005390 · 2018-01-04 ·

Embodiments of the present application relate to a method, apparatus, and system for detecting a specified image identifier. The method includes retrieving a target image to be detected from a predetermined area, binarizing the target image to be detected to obtain a target binary image to be detected, calibrating connected domains of the target binary image to be detected, successively retrieving image features of candidate connected domains, and comparing the image features corresponding to the candidate connected domains to image features of a standard specified identifier image, wherein the candidate connected domains are determined based at least in part on the calibration of the connected domains, and determining a candidate connected domain as the location of the specified identifier image based at least in part on the comparison of the image features corresponding to the candidate connected domains to image features of the standard specified identifier image.

METHODS FOR MOBILE IMAGE CAPTURE OF VEHICLE IDENTIFICATION NUMBERS IN A NON-DOCUMENT
20180012100 · 2018-01-11 ·

Various embodiments disclosed herein are directed to methods of capturing Vehicle Identification Numbers (VIN) from images captured by a mobile device. Capturing VIN data can be useful in several applications, for example, insurance data capture applications. There are at least two types of images supported by this technology: (1) images of documents and (2) images of non-documents.

Multiple Stage Image Based Object Detection and Recognition

Systems, methods, tangible non-transitory computer-readable media, and devices for autonomous vehicle operation are provided. For example, a computing system can receive object data that includes portions of sensor data. The computing system can determine, in a first stage of a multiple stage classification using hardware components, one or more first stage characteristics of the portions of sensor data based on a first machine-learned model. In a second stage of the multiple stage classification, the computing system can determine second stage characteristics of the portions of sensor data based on a second machine-learned model. The computing system can generate an object output based on the first stage characteristics and the second stage characteristics. The object output can include indications associated with detection of objects in the portions of sensor data.

Multiple Stage Image Based Object Detection and Recognition

Systems, methods, tangible non-transitory computer-readable media, and devices for autonomous vehicle operation are provided. For example, a computing system can receive object data that includes portions of sensor data. The computing system can determine, in a first stage of a multiple stage classification using hardware components, one or more first stage characteristics of the portions of sensor data based on a first machine-learned model. In a second stage of the multiple stage classification, the computing system can determine second stage characteristics of the portions of sensor data based on a second machine-learned model. The computing system can generate an object output based on the first stage characteristics and the second stage characteristics. The object output can include indications associated with detection of objects in the portions of sensor data.

USER-GUIDED IMAGE SEGMENTATION METHODS AND PRODUCTS

A method for image segmentation includes (a) clustering, based upon k-means clustering, pixels of an image into first clusters, (b) outputting a cluster map of the first clusters (c) re-clustering the pixels into a new plurality of non-disjoint pixel-clusters, and (d) classifying the non-disjoint pixel-clusters in categories, according to a user-indicated classification. Another method for image segmentation includes (a) forming a graph with each node of the graph corresponding to a first respective non-disjoint pixel-cluster of the image and connected to each terminal of the graph and to all other nodes corresponding to other respective non-disjoint pixel-clusters that, in the image, are within a neighborhood of the first respective non-disjoint pixel-cluster, (b) setting weights of connections of the graph according to a user-indicated classification in categories respectively associated with the terminals, and (c) segmenting the image into the categories by cutting the graph based upon the weights.

USER-GUIDED IMAGE SEGMENTATION METHODS AND PRODUCTS

A method for image segmentation includes (a) clustering, based upon k-means clustering, pixels of an image into first clusters, (b) outputting a cluster map of the first clusters (c) re-clustering the pixels into a new plurality of non-disjoint pixel-clusters, and (d) classifying the non-disjoint pixel-clusters in categories, according to a user-indicated classification. Another method for image segmentation includes (a) forming a graph with each node of the graph corresponding to a first respective non-disjoint pixel-cluster of the image and connected to each terminal of the graph and to all other nodes corresponding to other respective non-disjoint pixel-clusters that, in the image, are within a neighborhood of the first respective non-disjoint pixel-cluster, (b) setting weights of connections of the graph according to a user-indicated classification in categories respectively associated with the terminals, and (c) segmenting the image into the categories by cutting the graph based upon the weights.

Image processing device, image processing method, and storage medium for correcting brightness
11710343 · 2023-07-25 · ·

The image processing unit selects multiple subject areas from strobe-ON image data to be corrected, and, from the selected multiple subject areas, the image processing unit acquires a feature amount such as gloss information corresponding to each subject. Subsequently, from each subject area, the image processing unit selects a part of the subject area, based on the acquired feature amount. Then, regarding the partial area of each subject area, which is selected based on the feature amount, the image processing unit estimates the auxiliary light arrival rate corresponding to each subject, based on a pixel value of the strobe-ON image data and a pixel value of strobe-OFF image data. Thereafter, based on the estimated auxiliary light arrival rate, the image processing unit corrects the brightness of each subject area of the strobe-ON image data, in order to generate corrected image data.

Image processing device, image processing method, and storage medium for correcting brightness
11710343 · 2023-07-25 · ·

The image processing unit selects multiple subject areas from strobe-ON image data to be corrected, and, from the selected multiple subject areas, the image processing unit acquires a feature amount such as gloss information corresponding to each subject. Subsequently, from each subject area, the image processing unit selects a part of the subject area, based on the acquired feature amount. Then, regarding the partial area of each subject area, which is selected based on the feature amount, the image processing unit estimates the auxiliary light arrival rate corresponding to each subject, based on a pixel value of the strobe-ON image data and a pixel value of strobe-OFF image data. Thereafter, based on the estimated auxiliary light arrival rate, the image processing unit corrects the brightness of each subject area of the strobe-ON image data, in order to generate corrected image data.

Compact encoded heat maps for keypoint detection networks

A method is presented. The method includes determining a number of landmarks in an image comprising multiple pixels. The method also includes determining a number of channels for the image based on a function of the number of landmarks. The method further includes determining, for each one of the number of channels, a confidence of each pixel of the multiple pixels corresponding to a landmark. The method still further includes identifying the landmark in the image based on the confidence.