G06V30/416

PRINTING SYSTEM, IMAGE PROCESSING APPARATUS, AND COMPARISON METHOD
20230049493 · 2023-02-16 · ·

A printing system includes processing circuitry. The processing circuitry acquires print data of a plurality of pages and extracts comparison data from the print data for each page. The processing circuitry, from first image data read from a printed material on which the print data is printed, acquires second image data at a position corresponding to the comparison data, for each page of the printed material. The processing circuitry outputs a comparison result of the comparison data and the read image data for each page.

PRINTING SYSTEM, IMAGE PROCESSING APPARATUS, AND COMPARISON METHOD
20230049493 · 2023-02-16 · ·

A printing system includes processing circuitry. The processing circuitry acquires print data of a plurality of pages and extracts comparison data from the print data for each page. The processing circuitry, from first image data read from a printed material on which the print data is printed, acquires second image data at a position corresponding to the comparison data, for each page of the printed material. The processing circuitry outputs a comparison result of the comparison data and the read image data for each page.

Systems, Methods, and Devices for a Form Converter

Methods, systems, and devices for automatically converting a static electronic file format and its various elements into a dynamic digital form with executable elements that can be customized before being used. The resulting digital form is compatible with digital workflows and processes. The disclosed systems, methods, and devices go beyond simply extracting data from the original electronic file format and instead enable users to, without using code, convert the source form into a dynamic, malleable digital form while still retaining the source form's original purpose and functionality.

Systems, Methods, and Devices for a Form Converter

Methods, systems, and devices for automatically converting a static electronic file format and its various elements into a dynamic digital form with executable elements that can be customized before being used. The resulting digital form is compatible with digital workflows and processes. The disclosed systems, methods, and devices go beyond simply extracting data from the original electronic file format and instead enable users to, without using code, convert the source form into a dynamic, malleable digital form while still retaining the source form's original purpose and functionality.

Representative document hierarchy generation

In some aspects, a method includes performing optical character recognition (OCR) based on data corresponding to a document to generate text data, detecting one or more bounded regions from the data based on a predetermined boundary rule set, and matching one or more portions of the text data to the one or more bounded regions to generate matched text data. Each bounded region of the one or more bounded regions encloses a corresponding block of text. The method also includes extracting features from the matched text data to generate a plurality of feature vectors and providing the plurality of feature vectors to a trained machine-learning classifier to generate one or more labels associated with the one or more bounded regions. The method further includes outputting metadata indicating a hierarchical layout associated with the document based on the one or more labels and the matched text data.

Representative document hierarchy generation

In some aspects, a method includes performing optical character recognition (OCR) based on data corresponding to a document to generate text data, detecting one or more bounded regions from the data based on a predetermined boundary rule set, and matching one or more portions of the text data to the one or more bounded regions to generate matched text data. Each bounded region of the one or more bounded regions encloses a corresponding block of text. The method also includes extracting features from the matched text data to generate a plurality of feature vectors and providing the plurality of feature vectors to a trained machine-learning classifier to generate one or more labels associated with the one or more bounded regions. The method further includes outputting metadata indicating a hierarchical layout associated with the document based on the one or more labels and the matched text data.

Multi-step document information extraction
11557139 · 2023-01-17 · ·

Briefly, embodiments of a system, method, and article for receiving a document from a remote device and identifying items in the document. Various operations may be performed based on one or more dependencies of the identified items. For example, additional items may be identified in the document. One or more of the identified items may be parsed. A correspondence between the identified items and a second set of items may be determined. The identified items may be validated based on a set of rules. One or more of the identified items may be transmitted to the remote device in response to the performance of the various operations.

Multi-step document information extraction
11557139 · 2023-01-17 · ·

Briefly, embodiments of a system, method, and article for receiving a document from a remote device and identifying items in the document. Various operations may be performed based on one or more dependencies of the identified items. For example, additional items may be identified in the document. One or more of the identified items may be parsed. A correspondence between the identified items and a second set of items may be determined. The identified items may be validated based on a set of rules. One or more of the identified items may be transmitted to the remote device in response to the performance of the various operations.

Collision avoidance for document field placement

Users of a database management engine may generate fillable digital documents by mapping interface elements onto form documents. When a user maps interface elements onto a form document, the user may accidentally overlap two or more interface elements. To rectify this, the database management engine may modify the position of one of interface elements based on a set of positioning rules. In addition, the database management engine may identify and suggest mappings to users based on similar documents that have been previously mapped. The database management engine identifies similar documents using information about the document, the user, and the mapping itself. The mapping associated with the most similar document may be provided to the user as a suggested mapping. The database management engine converts the form document and finalized mapping into a fillable digital document. The fillable digital document is sent to recipients, who complete the fillable digital document.

Collision avoidance for document field placement

Users of a database management engine may generate fillable digital documents by mapping interface elements onto form documents. When a user maps interface elements onto a form document, the user may accidentally overlap two or more interface elements. To rectify this, the database management engine may modify the position of one of interface elements based on a set of positioning rules. In addition, the database management engine may identify and suggest mappings to users based on similar documents that have been previously mapped. The database management engine identifies similar documents using information about the document, the user, and the mapping itself. The mapping associated with the most similar document may be provided to the user as a suggested mapping. The database management engine converts the form document and finalized mapping into a fillable digital document. The fillable digital document is sent to recipients, who complete the fillable digital document.