Patent classifications
G06V30/2528
Automated indexing and extraction of information in digital documents
Systems and methods for automated indexing and extraction of information in digital documents are disclosed. A method may comprise selecting a page number of a digital document to identify a page containing targeted information; inputting an image of the page into a visual machine learning network (visual ML), wherein the visual ML is trained to recognize text associated with the targeted information in an image; identifying by the visual ML, a section of the image that contains the targeted information; inputting the page number, the digital document, and coordinates of the section into an extraction module; and extracting the targeted information by the extraction module from the section.
ADAPTIVE CONTENT CLASSIFICATION OF A VIDEO CONTENT ITEM
In a method for performing adaptive content classification of a video content item, frames of a video content item are analyzed at a sampling rate for a type of content, wherein the sampling rate dictates a frequency at which frames of the video content item are analyzed. Responsive to identifying content within at least one frame indicative of the type of content, the sampling rate of the frames is increased. Responsive to not identifying content within at least one frame indicative of the type of content, the sampling rate of the frames is decreased. It is determined whether the video content item includes the type of content based on the analyzing the frames.
PROCESSING IMAGE DATA SETS
A method includes obtaining an image data set that depicts semiconductor components, and applying a hierarchical bricking to the image data set. In this case, the bricking includes a plurality of bricks on a plurality of hierarchical levels. The bricks on different hierarchical levels have different image element sizes of corresponding image elements.
Vision-based cell structure recognition using hierarchical neural networks
Methods, systems, and computer program products for vision-based cell structure recognition using hierarchical neural networks and cell boundaries to structure clustering are provided herein. A computer-implemented method includes detecting a style of the given table using at least one style classification model; selecting, based at least in part on the detected style, a cell detection model appropriate for the detected style; detecting cells within the given table using the selected cell detection model; and outputting, to at least one user, information pertaining to the detected cells comprising image coordinates of one or more bounding boxes associated with the detected cells.
Identifying and avoiding obstructions using depth information in a single image
A farming machine includes one or more image sensors for capturing an image as the farming machine moves through the field. A control system accesses an image captured by the one or more sensors and identifies a distance value associated with each pixel of the image. The distance value corresponds to a distance between a point and an object that the pixel represents. The control system classifies pixels in the image as crop, plant, ground, etc. based on depth information in in the pixels. The control system generates a labelled point cloud using the labels and depth information, and identifies features about the crops, plants, ground, etc. in the point cloud. The control system generates treatment actions based on any of the depth information, visual information, point cloud, and feature values. The control system actuates a treatment mechanism based on the classified pixels.
SYSTEM AND METHOD FOR EYEWEAR SIZING
Provided is a process for generating specifications for lenses of eyewear based on locations of extents of the eyewear determined through a pupil location determination process. Some embodiments capture an image and determine, using computer vision image recognition functionality, the pupil locations of a human's eyes based on the captured image depicting the human wearing eyewear.
IDENTIFYING AND AVOIDING OBSTRUCTIONS USING DEPTH INFORMATION IN A SINGLE IMAGE
A farming machine includes one or more image sensors for capturing an image as the farming machine moves through the field. A control system accesses an image captured by the one or more sensors and identifies a distance value associated with each pixel of the image. The distance value corresponds to a distance between a point and an object that the pixel represents. The control system classifies pixels in the image as crop, plant, ground, etc. based on depth information in in the pixels. The control system generates a labelled point cloud using the labels and depth information, and identifies features about the crops, plants, ground, etc. in the point cloud. The control system generates treatment actions based on any of the depth information, visual information, point cloud, and feature values. The control system actuates a treatment mechanism based on the classified pixels.
Multiple task transfer learning
Systems and methods relating to multitask transfer learning. Neural networks are used to accomplish a number of tasks and the results of these tasks are used to determine parameters common to these and other tasks. These parameters can then be used to accomplish other related tasks. In the description, data fitting as well as image related tasks are used. Task conditioning as well as the use of a KL regularizer have greatly improved results when testing the methods of the invention.
Optical character recognition using a combination of neural network models
Embodiments of the present disclosure describe a system and method for optical character recognition. In one embodiment, a system receives an image depicting text. The system extracts features from the image using a feature extractor. The system applies a first decoder to the features to generate a first intermediary output. The system applies a second decoder to the features to generate a second intermediary output, wherein the feature extractor is common to the first decoder and the second decoder. The system determines a first quality metric value for the first intermediary output and a second quality metric value for the second intermediary output based on a language model. Responsive to determining that the first quality metric value is greater than the second quality metric value, the system selects the first intermediary output to represent the text.
Systems and methods for identifying data processing activities based on data discovery results
Aspects of the present invention provide methods, apparatuses, systems, computing devices, computing entities, and/or the like for identifying data processing activities associated with various data assets based on data discovery results. In accordance various aspects, a method is provided comprising: identifying and scanning data assets to detect a subset of the data assets, wherein each asset of the subset is associated with a particular data element used for target data; generating a prediction for each pair of data assets of the subset on the target data flowing between the pair; identifying a data flow for the target data based on the prediction generated for each pair; and identifying a data processing activity associated with handling the target data based on a correlation identified for the particular data element, the subset, and/or the data flow with a known data element, subset, and/or data flow for the data processing activity.