Patent classifications
G06V30/1478
TEX LINE DETECTION
A system and method for text line detection are described Examples include detection of symbols in an image received from an image-capturing device. In examples, for each of at least some of the symbols, neighboring symbols within a local region a given distance from the symbol are analyzed in order to determine a direction for a line in the local regions In examples, based on the determined directions for the lines, text lines in the image are identified.
ENHANCED OPTICAL CHARACTER RECOGNITION (OCR) IMAGE SEGMENTATION SYSTEM AND METHOD
Optical character recognition (OCR) based systems and methods for extracting and automatically evaluating contextual and identification information and associated metadata from an image utilizing enhanced image processing techniques and image segmentation. A unique, comprehensive integration with an account provider system and other third party systems may be utilized to automate the execution of an action associated with an online account. The system may evaluate text extracted from a captured image utilizing machine learning processing to classify an image type for the captured image, and select an optical character recognition model based on the classified image type. They system may compare a data value extracted from the recognized text for a particular data type with an associated online account data value for the particular data type to evaluate whether to automatically execute an action associated with the online account linked to the image based on the data value comparison.
Persistent feature based image rotation and candidate region of interest
Embodiments of a system and method for sorting and delivering articles in a processing facility based on image data are described. Image processing results such as rotation notation information may be included in or with an image to facilitate downstream processing such as when the routing information cannot be extracted from the image using an unattended system and the image is passed to an attended image processing system. The rotation notation information may be used to dynamically adjust the image before presenting the image via the attended image processing system.
IMAGE DEWARPING WITH CURVED DOCUMENT BOUNDARIES
An example non-transitory computer-readable medium includes instructions executable by a processor to detect boundaries of a representation of a document page in a captured image, model the boundaries of the representation of the document page as nonlinear curves, use the nonlinear curves to transform pixels of the representation of the document page into pixels of a dewarped representation of the document page, and output a dewarped image based on the dewarped representation of the document page.
Data normalization and extraction system
A data ingestion system normalizes ingested documents and extracts data based on a template that is applied to the documents. In an aspect, the system accesses a document of a document type and determines a template to apply to the document. The system normalizes the document, extracts data values from the document based at least in part on the template, and generates structured data based at least partly on the extracted data.
DEEP-LEARNING-BASED SYSTEM AND PROCESS FOR IMAGE RECOGNITION
Disclosed are methods and systems for using artificial intelligence (AI) for image recognition by using predefined coordinates to extract a portion of a received image, the extracted portion comprising a word to be identified having at least a first letter and a second letter; executing an image recognition protocol to identify the first letter; when the server is unable to identify the second letter, the server executes an AI model having a nodal data structure to identify the second letter based upon the identified first letter, the nodal data structure comprising a set of nodes where each node represents a letter, each node connected to at least one other node, wherein connection of a first node to a second node corresponds to a probability that a letter corresponding to the second node is used in a word subsequent to a letter corresponding to the first node.
Process for highlighting text with varied orientation
A computer-implemented method and apparatus for highlighting text in an image disposed in a markup language document are disclosed. Location data identifying a location, size and orientation of a text element in the image may be obtained, where the text element is oriented in a direction that is non-orthogonal to vertical and horizontal axes of the image. A context for a canvas element in the document may be obtained and rotated to align the context to the orientation of the text element using the location data. The context may also be translated to the location of the text element using the location data, and a text highlighting element that at least partially overlays the text element may be generated.
DYNAMICALLY OPTIMIZING PHOTO CAPTURE FOR MULTIPLE SUBJECTS
A user device detects, in a field of view of the camera, a first side of a document, and determines first information associated with the first side of the document. The user device selects a first image resolution based on the first information and captures, by the camera, a first image of the first side of the document according to the first image resolution. The user device detects, in the field of view of the camera, a second side of the document, and determines second information associated with the second side of the document. The user device selects a second image resolution based on the second information, and captures, by the camera, a second image of the second side of the document according to the second image resolution. The user device performs an action related to the first image and the second image.
Deep-learning-based system and process for image recognition
Disclosed are methods and systems for using artificial intelligence (AI) for image recognition by using predefined coordinates to extract a portion of a received image, the extracted portion comprising a word to be identified having at least a first letter and a second letter; executing an image recognition protocol to identify the first letter; when the server is unable to identify the second letter, the server executes an AI model having a nodal data structure to identify the second letter based upon the identified first letter, the nodal data structure comprising a set of nodes where each node represents a letter, each node connected to at least one other node, wherein connection of a first node to a second node corresponds to a probability that a letter corresponding to the second node is used in a word subsequent to a letter corresponding to the first node.
Methods, systems, apparatus and articles of manufacture for receipt decoding
Methods, apparatus, systems and articles of manufacture are disclosed for receipt decoding. An example apparatus includes processor circuitry to execute instructions to extract text from the receipt image, the text including bounding boxes; associate ones of the bounding boxes to link horizontally related fields of a the receipt image by selecting a first bounding box; identifying first horizontally aligned bounding boxes, the first horizontally aligned bounding boxes to include at least one bounding box of the bounding boxes that is horizontally aligned relative to the first bounding box; adding the first horizontally aligned bounding boxes to a word sync list; and connecting ones of the first horizontally aligned bounding boxes and the first bounding box based on at least one of an amount of the first horizontally aligned bounding boxes in the word sync list and a relationship among the first horizontally aligned bounding boxes and the first bounding box.