Patent classifications
G06V30/12
Image processing system for computerizing document, control method thereof, and storage medium
In an image processing system in which when a paper document is computerized, a file name or the like is set by using a recognized character string obtained by performing OCR processing, so that time and effort of a user when a plurality of documents is computerized en bloc is reduced. Learning data is generated by registering positional information relating to a recognized character string used for setting of a property relating to a scanned image in association with a document form of the scanned image. Then, in a case where the learning data is generated in response to setting of the property being performed for a first scanned image that is selected from a plurality of scanned images included in a list, a scanned image having a document form similar to a document form of the first scanned image is determined among other scanned images included in the list.
TEXT IMAGE DEFECT DETECTION METHOD, COMPUTER DEVICE, AND STORAGE MEDIUM
This application provides an image defect detection method. The method includes obtaining a first image and a second image of a flawless image. A third image is obtained from the second image and the first image, and a fourth image is obtained according to the second image and an image to be detected. A fifth image is obtained based on the third image and the second image. A sixth image is obtained based on the third image and the fourth image. A seventh image is obtained from the fifth image and the sixth image. A defect value of the fourth image is obtained according to the third image and the seventh image. A detection result of the fourth image is determined based on the defect value.
TEXT IMAGE DEFECT DETECTION METHOD, COMPUTER DEVICE, AND STORAGE MEDIUM
This application provides an image defect detection method. The method includes obtaining a first image and a second image of a flawless image. A third image is obtained from the second image and the first image, and a fourth image is obtained according to the second image and an image to be detected. A fifth image is obtained based on the third image and the second image. A sixth image is obtained based on the third image and the fourth image. A seventh image is obtained from the fifth image and the sixth image. A defect value of the fourth image is obtained according to the third image and the seventh image. A detection result of the fourth image is determined based on the defect value.
SYSTEMS AND METHODS FOR DETECTION AND CORRECTION OF OCR TEXT
OCR-text correction system and method embodiments are described. The OCR-text correction embodiments comprise or cooperate with a transformer-based sequence-to-sequence language model. The model is pretrained to denoise corrupted text and is fine-tuned using OCR-correction-specific examples. Text obtained at least in part through OCR is applied to the fine-tuned pretrained transformer model to detect at least one error in a subset of the text. Responsive to detecting the at least one error, the fine-tuned pretrained transformer model outputs an updated subset of the text to correct the at least one error.
HAND-DRAWN DIAGRAM RECOGNITION USING VISUAL ARROW-RELATION DETECTION
Computer-readable media, methods, and systems are disclosed for converting a diagram into a digital diagram format. A diagram is received as an image file and a plurality of recognition stages produce a final diagram based on predicted information from one or more of the recognition stages. The plurality of recognition stages including a shape detection stage for detecting a plurality of shapes and at least one arrow detection stage in which arrows are detected as relations between pairs of shapes. The final diagram is generated based on the predicted information and is converted into the digital diagram format compatible with a diagram modeling language.
METHOD AND SYSTEM FOR IDENTIFYING AND DETERMINING VALUATION OF CURRENCY
A method and system is provided for determining the denomination and related data for a currency item using a personal computing device, such as a mobile phone. The device includes or is connected to an image capture device that is preferably a digital video camera. At least one image of a target currency item is captured then processed for image quality. A further processing of the image includes a coordinate mapping. A comparison is made between individual pixels of the processed image based on the assigned coordinate mapping with a database of reference currency images to determine the currency denomination. Additional processing of the currency image provides the date and other data regarding the target currency item. A market value for the target currency item is identified by reference to a valuation database using the data determined for the currency item.
INSPECTION APPARATUS, METHOD, AND NON-TRANSITORY STORAGE MEDIUM FOR INSPECTING PRINT PRODUCT
An inspection apparatus performs dropout color processing on a first inspection area set for an image generated by reading a print product and then performs first recognition processing on the first inspection area, and further performs second recognition processing on a second inspection area set for the image and then performs an inspection of whether sufficient margin areas are allocated on the second inspection area, without performing dropout color processing.
INSPECTION APPARATUS, METHOD, AND NON-TRANSITORY STORAGE MEDIUM FOR INSPECTING PRINT PRODUCT
An inspection apparatus performs dropout color processing on a first inspection area set for an image generated by reading a print product and then performs first recognition processing on the first inspection area, and further performs second recognition processing on a second inspection area set for the image and then performs an inspection of whether sufficient margin areas are allocated on the second inspection area, without performing dropout color processing.
OCR error correction
Implementations of the disclosure are directed to OCR error correction systems and methods. In some implementations, a method comprises: obtaining, at a computing device, optical character recognition (OCR) text extracted from a document image, the text comprising a token; searching, at the computing device, based on a token bigram determined from the token and a mapping between words in a corpus and a corpus bigram set comprised of unique bigrams from the beginning or ending of the words in the corpus, the corpus for a best word to replace the token; and replacing, at the computing device, the token with the best word.
Hand-drawn diagram recognition using visual arrow-relation detection
Computer-readable media, methods, and systems are disclosed for converting a diagram into a digital diagram format. A diagram is received as an image file and a plurality of recognition stages produce a final diagram based on predicted information from one or more of the recognition stages. The plurality of recognition stages including a shape detection stage for detecting a plurality of shapes and at least one arrow detection stage in which arrows are detected as relations between pairs of shapes. The final diagram is generated based on the predicted information and is converted into the digital diagram format compatible with a diagram modeling language.