Patent classifications
G06V30/1478
HIGH-SPEED OCR DECODE USING DEPLETED CENTERLINES
A method for template matching can include iteratively selecting a template set of points to project over a centerline of a candidate symbol; conducting a template matching analysis; assigning a score to each template set; and selecting a template set with a highest assigned score. For example, the score can depend on proximity of the template points to a center and/or boundaries of a principal tracing path of the symbol. Additionally, one or more template sets having a top rank can be selected for a secondary analysis of proximity of the template points to a boundary of a printing of the symbol. The method can further include using the template with the highest score to interpret the candidate symbol.
System and method of character recognition using fully convolutional neural networks with attention
Embodiments of the present disclosure include a method that obtains a digital image. The method includes extracting a word block from the digital image. The method includes processing the word block by evaluating a value of the word block against a dictionary. The method includes outputting a prediction equal to a common word in the dictionary when a confidence factor is greater than a predetermined threshold. The method includes processing the word block and assigning a descriptor to the word block corresponding to a property of the word block. The method includes processing the word block using the descriptor to prioritize evaluation of the word block. The method includes concatenating a first output and a second output. The method includes predicting a value of the word block.
Data normalization and extraction system
A data ingestion system normalizes ingested documents and extracts data based on a template that is applied to the documents. In an aspect, the system accesses a document of a document type and determines a template to apply to the document. The system normalizes the document, extracts data values from the document based at least in part on the template, and generates structured data based at least partly on the extracted data.
Method and apparatus for recognizing imaged information-bearing medium, computer device and medium
A method and apparatus for recognizing an imaged information-bearing medium, a computer-readable storage device and a computer device are provided. The method comprising: acquiring a first image of the imaged information-bearing medium; performing text recognition on the first image to acquire a text content of the imaged information-bearing medium; classifying the imaged information-bearing medium to acquire a type of the imaged information-bearing medium; and archiving the text content according to the type.
ALIGNMENT OF USER INPUT ON A SCREEN
A system for automated user input alignment receives the user input at a touchscreen display. A skew of the user input is identified as the user input is being received at a touchscreen display. A skew correction is determined based on the identified skew. The skew correction is applied to the user input to align the user input on the touchscreen display. The skew correction applied in an automated alignment process that. The user input is displayed with the applied skew correction on the touchscreen display with improved efficiency and without user manipulation to perform the alignment.
Dynamically optimizing photo capture for multiple subjects
A user device detects, in a field of view of the camera, a first side of a document, and determines first information associated with the first side of the document. The user device selects a first image resolution based on the first information and captures, by the camera, a first image of the first side of the document according to the first image resolution. The user device detects, in the field of view of the camera, a second side of the document, and determines second information associated with the second side of the document. The user device selects a second image resolution based on the second information, and captures, by the camera, a second image of the second side of the document according to the second image resolution. The user device performs an action related to the first image and the second image.
SYSTEMS AND METHODS FOR ASSIGNING WORD FRAGMENTS TO TEXT LINES IN OPTICAL CHARACTER RECOGNITION-EXTRACTED DATA
Systems and methods for assigning word fragments to lines of text in optical character recognition (OCR) extracted data can include at least one processor obtaining a plurality of word fragments from OCR generated data associated with an image. The at least one processor can determine vertical coordinates of each of the word fragments in the image. The at least one processor can cluster the plurality of word fragments into one or more clusters of word fragments based on the vertical coordinates of the plurality of word fragments. The at least one processor can assign each word fragment of a respective cluster to a corresponding text line based on the clustering.
Multifunction peripheral assisted optical mark recognition using dynamic model and template identification
A system for multifunction peripheral assisted optical mark recognition uses a scanner to scan at least one printed page to generate a scanned image of the at least one printed page and to optically detect a presence of a visible label on the scanned image. Then, the multifunction peripheral may extract a model identification and a template identification from the visible label, select a template, identifying locations from which image data is to be extracted from the scanned image, select a model, specifically identifying at least two types of acceptable marks within the image data to be extracted from the scanned image, and perform optical mark recognition on the location from which image data is to be extracted identified by the template using the at least two types of acceptable marks identified by the model to extract useful data from the image data.
Method, System and Computer Readable Storage Medium for Identifying Information Carried on Sheet
A method for identifying information carried on a sheet is disclosed. The method comprises: identifying, using one or more computing devices, each of one or more areas on the sheet based on an image of the sheet and a pre-trained first model, wherein each of the one or more areas is associated with all or part of the information carried on the sheet, and the first model is a neural network based model; and identifying, using one or more computing devices, characters in each of the one or more areas based on the image of the sheet, each of the one or more areas and a pre-trained second model so as to determine the information carried on the sheet, wherein the second model is a neural network based model.
High-speed OCR decode using depleted centerlines
A method for template matching can include iteratively selecting a template set of points to project over a centerline of a candidate symbol; conducting a template matching analysis; assigning a score to each template set; and selecting a template set with a highest assigned score. For example, the score can depend on proximity of the template points to a center and/or boundaries of a principal tracing path of the symbol. Additionally, one or more template sets having a top rank can be selected for a secondary analysis of proximity of the template points to a boundary of a printing of the symbol. The method can further include using the template with the highest score to interpret the candidate symbol.