Patent classifications
G06V30/26
METHODS, SYSTEMS, ARTICLES OF MANUFACTURE AND APPARATUS TO LABEL TEXT ON IMAGES
Methods, systems, articles of manufacture and apparatus are disclosed to label text on images. An example apparatus includes colorizer circuitry to apply color to text boxes corresponding to optical character recognition (OCR) data associated with an image, OCR manager circuitry to render an OCR text prompt associated with the OCR data, the OCR text prompt to be rendered proximate to respective ones of the text boxes, the OCR text prompt to display a text portion of the OCR data, and edit circuitry to (a) render an interface in response to selection of the OCR text prompt, the interface populated with the text portion of the OCR data, and (b) in response to an overwrite input to the interface, update the text portion of the OCR data in a memory corresponding to the image.
SYSTEM FOR TRANSPORTATION AND SHIPPING RELATED DATA EXTRACTION
A system is discussed herein that is configured for extracting data from documents. In particular, the system may be utilized for automating and computerized checking of transit and shipping related documents. For example, the documents may include various data, such delivery dates, prices, inventory identification, personnel identification, container identification, customs documents, transport documents, a combination thereof, and the like.
SYSTEM FOR TRANSPORTATION AND SHIPPING RELATED DATA EXTRACTION
A system is discussed herein that is configured for extracting data from documents. In particular, the system may be utilized for automating and computerized checking of transit and shipping related documents. For example, the documents may include various data, such delivery dates, prices, inventory identification, personnel identification, container identification, customs documents, transport documents, a combination thereof, and the like.
MOVING TEXT REGION DETECTION FOR BROKEN TEXT RECOVERY
One embodiment provides a method comprising receiving content for presentation on a display, and obtaining one or more sample frames of the content. The method further comprises generating a set of features based on one or more horizontal edge signals and one or more vertical edge signals of the one or more sample frames. The method further comprises utilizing a classification model to detect, based on the set of features, a region of interest of moving text in the one or more sample frames.
MOVING TEXT REGION DETECTION FOR BROKEN TEXT RECOVERY
One embodiment provides a method comprising receiving content for presentation on a display, and obtaining one or more sample frames of the content. The method further comprises generating a set of features based on one or more horizontal edge signals and one or more vertical edge signals of the one or more sample frames. The method further comprises utilizing a classification model to detect, based on the set of features, a region of interest of moving text in the one or more sample frames.
Processing apparatus, processing method, and non-strategy medium
The present invention provides a processing apparatus (10) including an acquisition unit (11) that acquires an image of a fill-in form including a plurality of first fill-in fields where a numerical value is filled in, and a second fill-in field where a sum total of the numerical values filled in a plurality of the first fill-in fields is filled in, an analysis unit (12) that analyzes the image, and recognizes the value filled in a plurality of the first fill-in fields and the value filled in the second fill-in field, a determination unit (13) that determines whether a sum total of recognition results of the value filled in a plurality of the first fill-in fields and a recognition result of the value filled in the second fill-in field match each other, and a processing unit (14) that executes error processing when a sum total of the recognition results of the value filled in a plurality of the first fill-in fields and the recognition result of the value filled in the second fill-in field do not match each other.
Processing apparatus, processing method, and non-strategy medium
The present invention provides a processing apparatus (10) including an acquisition unit (11) that acquires an image of a fill-in form including a plurality of first fill-in fields where a numerical value is filled in, and a second fill-in field where a sum total of the numerical values filled in a plurality of the first fill-in fields is filled in, an analysis unit (12) that analyzes the image, and recognizes the value filled in a plurality of the first fill-in fields and the value filled in the second fill-in field, a determination unit (13) that determines whether a sum total of recognition results of the value filled in a plurality of the first fill-in fields and a recognition result of the value filled in the second fill-in field match each other, and a processing unit (14) that executes error processing when a sum total of the recognition results of the value filled in a plurality of the first fill-in fields and the recognition result of the value filled in the second fill-in field do not match each other.
Text extraction using optical character recognition
Provided herein are systems and methods for extracting text from a document. Different optical character recognition (OCR) tools are used to extract different versions of the text in the document. Metrics evaluating the quality of the extracted text are compared to identify and select higher quality extracted text. A selected portion of text is compared to a threshold to ensure minimal quality. The selected portion of text is then saved. Error correction can be applied to the selected portion of text based on errors specific to the OCR tools or the document contents.
Text extraction using optical character recognition
Provided herein are systems and methods for extracting text from a document. Different optical character recognition (OCR) tools are used to extract different versions of the text in the document. Metrics evaluating the quality of the extracted text are compared to identify and select higher quality extracted text. A selected portion of text is compared to a threshold to ensure minimal quality. The selected portion of text is then saved. Error correction can be applied to the selected portion of text based on errors specific to the OCR tools or the document contents.
INFORMATION PROCESSING APPARATUS, INFORMATION PROCESSING METHOD, AND STORAGE MEDIUM
Provided is an information processing apparatus including: a character recognition unit configured to perform a character recognition process on an image of a processing target document; a generation unit configured to generate an instruction message based on a result of the character recognition process, the instruction message being a message for causing a large language model to reply a document type of the processing target document; a transmission unit configured to transmit the instruction message in order to obtain a reply to the instruction message from the large language model; and a reception unit configured to receive the reply to the instruction message from the large language model.