Patent classifications
G06V30/26
Systems and Methods for Enhancing Trainable Optical Character Recognition (OCR) Performance
Systems and methods for enhancing trainable optical character recognition (OCR) performance are disclosed herein. An example method includes receiving, at an application executing on a user computing device communicatively coupled to a machine vision camera, an image captured by the machine vision camera, the image including an indicia encoding a payload and a character string. The example method also includes identifying the indicia and the character string; decoding the indicia to determine the payload; and applying an optical character recognition (OCR) algorithm to the image to interpret the character string and identify an unrecognized character within the character string. The example method also includes comparing the payload to the character string to validate the unrecognized character as corresponding to a known character included within the payload; and responsive to validating the unrecognized character, adding the unrecognized character to a font library referenced by the OCR algorithm.
SYSTEM AND METHOD FOR GENERATING BEST POTENTIAL RECTIFIED DATA BASED ON PAST RECORDINGS OF DATA
Various methods, apparatuses/systems, and media for data processing are disclosed. A processor receives a digital document; applies an optical character recognition (OCR) algorithm on said received digital document by utilizing an OCR tool; identifies defective data extracted by the OCR tool resulted from relatively inferior image quality of the received digital document; implements an auto rectification algorithm on the identified defective data; automatically generates, in response to implementing the auto rectification algorithm, corresponding auto-rectified data for each identified defective data; records the defective data and corresponding auto-rectified data at a field level; receives user input data on said recorded auto-rectified data; determines whether the auto-rectified data is correct or not; and populates, based on determining that the auto-rectified data is correct, a machine learning model with said received user input data to be utilized for subsequently received digital document.
SYSTEM AND METHOD FOR GENERATING BEST POTENTIAL RECTIFIED DATA BASED ON PAST RECORDINGS OF DATA
Various methods, apparatuses/systems, and media for data processing are disclosed. A processor receives a digital document; applies an optical character recognition (OCR) algorithm on said received digital document by utilizing an OCR tool; identifies defective data extracted by the OCR tool resulted from relatively inferior image quality of the received digital document; implements an auto rectification algorithm on the identified defective data; automatically generates, in response to implementing the auto rectification algorithm, corresponding auto-rectified data for each identified defective data; records the defective data and corresponding auto-rectified data at a field level; receives user input data on said recorded auto-rectified data; determines whether the auto-rectified data is correct or not; and populates, based on determining that the auto-rectified data is correct, a machine learning model with said received user input data to be utilized for subsequently received digital document.
Automated categorization and assembly of low-quality images into electronic documents
An apparatus includes a memory and processor. The memory stores OCR and NLP algorithms. The processor receives an image of a physical document page and executes the OCR algorithm to convert the image into text. The processor identifies errors in the text, which are associated with noise in the image. The processor generates a feature vector that includes features obtained by executing the NLP algorithm on the text, and features associated with the identified errors in the text. The processor uses the feature vector to assign the image to a document category. Documents assigned to the document category share one or more characteristics, and the feature vector is associated with a probability greater than a threshold that the physical document associated with the image includes those characteristics. The processor then stores the image in a database as a page of an electronic document belonging to the assigned document category.
METHODS, SYSTEMS, ARTICLES OF MANUFACTURE AND APPARATUS TO LABEL TEXT ON IMAGES
Methods, systems, articles of manufacture and apparatus are disclosed to label text on images. An example apparatus includes colorizer circuitry to apply color to text boxes corresponding to optical character recognition (OCR) data associated with an image, OCR manager circuitry to render an OCR text prompt associated with the OCR data, the OCR text prompt to be rendered proximate to respective ones of the text boxes, the OCR text prompt to display a text portion of the OCR data, and edit circuitry to (a) render an interface in response to selection of the OCR text prompt, the interface populated with the text portion of the OCR data, and (b) in response to an overwrite input to the interface, update the text portion of the OCR data in a memory corresponding to the image.
METHODS, SYSTEMS, ARTICLES OF MANUFACTURE AND APPARATUS TO LABEL TEXT ON IMAGES
Methods, systems, articles of manufacture and apparatus are disclosed to label text on images. An example apparatus includes colorizer circuitry to apply color to text boxes corresponding to optical character recognition (OCR) data associated with an image, OCR manager circuitry to render an OCR text prompt associated with the OCR data, the OCR text prompt to be rendered proximate to respective ones of the text boxes, the OCR text prompt to display a text portion of the OCR data, and edit circuitry to (a) render an interface in response to selection of the OCR text prompt, the interface populated with the text portion of the OCR data, and (b) in response to an overwrite input to the interface, update the text portion of the OCR data in a memory corresponding to the image.
CHARACTER ACQUISITION, PAGE PROCESSING AND KNOWLEDGE GRAPH CONSTRUCTION METHOD AND DEVICE, MEDIUM
A method and a device for acquiring character information in a picture, a non-transitory storage medium, a page processing method, and a knowledge graph construction method are disclosed. The method for acquiring character information in a picture includes: acquiring a picture and extracting at least one piece of character information in the picture; and checking-and-correcting the at least one piece of character information based on a knowledge graph.
IMAGE RECOGNITION METHOD AND APPARATUS, TRAINING METHOD, ELECTRONIC DEVICE, AND STORAGE MEDIUM
An image recognition method and apparatus, a training method, an electronic device, and a storage medium are provided. The image recognition method includes: acquiring an image to be recognized, the image to be recognized including a target text; and determining text content of the target text based on knowledge information and image information of the image to be recognized.
AUTOMATED CATEGORIZATION AND ASSEMBLY OF LOW-QUALITY IMAGES INTO ELECTRONIC DOCUMENTS
An apparatus includes a memory and processor. The memory stores OCR and NLP algorithms. The processor receives an image of a physical document page and executes the OCR algorithm to convert the image into text. The processor identifies errors in the text, which are associated with noise in the image. The processor generates a feature vector that includes features obtained by executing the NLP algorithm on the text, and features associated with the identified errors in the text. The processor uses the feature vector to assign the image to a document category. Documents assigned to the document category share one or more characteristics, and the feature vector is associated with a probability greater than a threshold that the physical document associated with the image includes those characteristics. The processor then stores the image in a database as a page of an electronic document belonging to the assigned document category.
SYSTEMS AND METHODS FOR DETECTION AND CORRECTION OF OCR TEXT
OCR-text correction system and method embodiments are described. The OCR-text correction embodiments comprise or cooperate with a transformer-based sequence-to-sequence language model. The model is pretrained to denoise corrupted text and is fine-tuned using OCR-correction-specific examples. Text obtained at least in part through OCR is applied to the fine-tuned pretrained transformer model to detect at least one error in a subset of the text. Responsive to detecting the at least one error, the fine-tuned pretrained transformer model outputs an updated subset of the text to correct the at least one error.