Patent classifications
G06V30/26
PROCESSING APPARATUS, PROCESSING METHOD, AND NON-STRATEGY MEDIUM
The present invention provides a processing apparatus (10) including an acquisition unit (11) that acquires an image of a fill-in form including a plurality of first fill-in fields where a numerical value is filled in, and a second fill-in field where a sum total of the numerical values filled in a plurality of the first fill-in fields is filled in, an analysis unit (12) that analyzes the image, and recognizes the value filled in a plurality of the first fill-in fields and the value filled in the second fill-in field, a determination unit (13) that determines whether a sum total of recognition results of the value filled in a plurality of the first fill-in fields and a recognition result of the value filled in the second fill-in field match each other, and a processing unit (14) that executes error processing when a sum total of the recognition results of the value filled in a plurality of the first fill-in fields and the recognition result of the value filled in the second fill-in field do not match each other.
PROCESSING APPARATUS, PROCESSING METHOD, AND NON-STRATEGY MEDIUM
The present invention provides a processing apparatus (10) including an acquisition unit (11) that acquires an image of a fill-in form including a plurality of first fill-in fields where a numerical value is filled in, and a second fill-in field where a sum total of the numerical values filled in a plurality of the first fill-in fields is filled in, an analysis unit (12) that analyzes the image, and recognizes the value filled in a plurality of the first fill-in fields and the value filled in the second fill-in field, a determination unit (13) that determines whether a sum total of recognition results of the value filled in a plurality of the first fill-in fields and a recognition result of the value filled in the second fill-in field match each other, and a processing unit (14) that executes error processing when a sum total of the recognition results of the value filled in a plurality of the first fill-in fields and the recognition result of the value filled in the second fill-in field do not match each other.
INFORMATION PROCESSING APPARATUS AND NON-TRANSITORY COMPUTER READABLE MEDIUM STORING PROGRAM
An information processing apparatus includes a processor configured to acquire recognized items obtained by recognizing characters in a first image showing a reading target including one or more first fields in which characters are written and a second field in which a name or a signature is written, if any one of the first fields is inadequate, create a second image having blank fields for the inadequate first field and the second field, if the second field is inadequate, create a second image having a blank field for the second field, and output the created second image.
Creating compact example sets for intent classification
Embodiments for creating compact example subsets for intent classification in a conversational system are provided. A set of content used for training an intent classifier is received from a conversational corpus. Entries within the set of content are separated into a first subset and a second subset, and a cross-validation operation is performed on the first and second subsets to identify a correctly labeled portion and an incorrectly labeled portion of the set of content. A reduced content used for performing a final training of the intent classifier is formed by combining a first number of the entries from the correctly labeled portion and a second number of the entries from the incorrectly labeled portion of the set of content.
Optical character recognition quality evaluation and optimization
A processor may receive an image and determine a number of foreground pixels in the image. The processor may obtain a result of optical character recognition (OCR) processing performed on the image. The processor may identify at least one bounding box surrounding at least one portion of text in the result and overlay the at least one bounding box on the image to form a masked image. The processor may determine a number of foreground pixels in the masked image and a decrease in the number of foreground pixels in the masked image relative to the number of foreground pixels in the image. Based on the decrease, the processor may modify an aspect of the OCR processing for subsequent image processing.
Optical character recognition quality evaluation and optimization
A processor may receive an image and determine a number of foreground pixels in the image. The processor may obtain a result of optical character recognition (OCR) processing performed on the image. The processor may identify at least one bounding box surrounding at least one portion of text in the result and overlay the at least one bounding box on the image to form a masked image. The processor may determine a number of foreground pixels in the masked image and a decrease in the number of foreground pixels in the masked image relative to the number of foreground pixels in the image. Based on the decrease, the processor may modify an aspect of the OCR processing for subsequent image processing.
TEXT RECOGNITION METHOD AND APPARATUS
Disclosed is a text recognition method and apparatus. A text recognition post-processing method for reflecting user post-correction performed by a processor in an apparatus, the text recognition post-processing method includes training a deep learning post-processing model based on post-correction data comprising a partial image including post-correction target text and post-correction text when there is user post-correction for a text recognition result of an input image; and post-processing a text recognition result of another input image by applying the trained deep learning post-processing model.
TEXT RECOGNITION METHOD AND APPARATUS
Disclosed is a text recognition method and apparatus. A text recognition post-processing method for reflecting user post-correction performed by a processor in an apparatus, the text recognition post-processing method includes training a deep learning post-processing model based on post-correction data comprising a partial image including post-correction target text and post-correction text when there is user post-correction for a text recognition result of an input image; and post-processing a text recognition result of another input image by applying the trained deep learning post-processing model.
OPTICAL CHARACTER RECOGNITION QUALITY EVALUATION AND OPTIMIZATION
A processor may receive an image and determine a number of foreground pixels in the image. The processor may obtain a result of optical character recognition (OCR) processing performed on the image. The processor may identify at least one bounding box surrounding at least one portion of text in the result and overlay the at least one bounding box on the image to form a masked image. The processor may determine a number of foreground pixels in the masked image and a decrease in the number of foreground pixels in the masked image relative to the number of foreground pixels in the image. Based on the decrease, the processor may modify an aspect of the OCR processing for subsequent image processing.
OPTICAL CHARACTER RECOGNITION QUALITY EVALUATION AND OPTIMIZATION
A processor may receive an image and determine a number of foreground pixels in the image. The processor may obtain a result of optical character recognition (OCR) processing performed on the image. The processor may identify at least one bounding box surrounding at least one portion of text in the result and overlay the at least one bounding box on the image to form a masked image. The processor may determine a number of foreground pixels in the masked image and a decrease in the number of foreground pixels in the masked image relative to the number of foreground pixels in the image. Based on the decrease, the processor may modify an aspect of the OCR processing for subsequent image processing.