G06V30/1475

Converting an image into a structured table

A system for converting an image of an unstructured table into a structured table is provided. The system may comprise a memory storing machine readable instructions. The system may include a processor to receive an image of a unstructured table and convert the image of the unstructured table into a structured table. Converting the image of the unstructured table into the structured table may include providing cell mapping and low confidence determination to highlight potentially misconverted content. The low confidence determination may be based on a first input and a second input. The processor may export the structured table, upon validation, to an application that supports structured tables.

Image processing apparatus for placing a character recognition target region at a position of a predetermined region in an image conforming to a predetermined format
10970579 · 2021-04-06 · ·

An image processing apparatus includes a storage device for storing a position of a predetermined region in an image conforming to a predetermined format, a processor for acquiring an input image including a character recognition target region, cutting out a region corresponding to the character recognition target region from the input image or an image generated from the input image to generate a corrected image in which the region is placed at the position of the predetermined region in the image conforming to the predetermined format, and detecting a character from the corrected image, and an output device for outputting information related to the detected character.

METHOD AND APPARATUS FOR REAL-TIME TEXT REPLACEMENT IN A NATURAL SCENE

In augmented reality (AR) and mixed reality (MR) representations of natural scenes that includes text on different kinds of surfaces, real-time text replacement facilitates user involvement with and appreciation of the natural scenes. Determination of surface curvature using a three-dimensional (3D) camera enables determination of consequent textual distortion and necessary compensation in order to read text accurately. Translation, transliteration, or other modification of text and replacement with that text in a natural scene enables a user to participate more fully in the scene.

LIST AND TABULAR DATA EXTRACTION SYSTEM AND METHOD

A system and method for automating and improving tabular and list-based data extraction from a variety of document types is disclosed. The system and method detect and sort which documents include tables or lists, and performs row and column segmentation. In addition, the system and method apply Conditional Random Fields models to localize each table and semantic data understanding to map and export the extracted data to the desired format and arrangement.

SYSTEMS AND METHODS FOR MOBILE IMAGE CAPTURE AND PROCESSING OF DOCUMENTS
20210073786 · 2021-03-11 ·

Techniques for processing images of documents captured using a mobile device are provided. The images can include different sides of a document from a mobile device for an authenticated transaction. In an example implementation, a method incudes inspecting the images to detect a feature associated with a first side of the document. In response to determining an image is the first side of the document, a type of content is selected to be analyze on the image of the first side and one or more of regions of interests (ROIs) are identified on the image of the first side that are known to include the selected type of content. A process can include receiving a sub-image of the image of the first side from the preprocessing unit, and performing content detection test on the sub-image.

IMAGE PROCESSING SYSTEM, IMAGE PROCESSING METHOD, AND PROGRAM
20210209393 · 2021-07-08 ·

To speed up image processing, an obtaining means of an image processing system obtains a captured image of a document that includes a fixed part and an un-fixed part, where the document is captured by an image reader or an image capture device. A first shaping means shapes the captured image based on a feature of the document in a sample image and a feature of the document in the captured image so as to obtain a first shaped image. A detecting means detects a feature part of the fixed part from the first shaped image. A second shaping means shapes the first shaped image such that a position of the feature part detected by the detecting means is aligned with a predetermined position so as to obtain a second shaped image.

Systems and methods for mobile image capture and processing of documents

Techniques for processing images of documents captured using a mobile device are provided. The images can include different sides of a document from a mobile device for an authenticated transaction. In an example implementation, a method includes inspecting the images to detect a feature associated with a first side of the document. In response to determining an image is the first side of the document, a type of content is selected to be analyze on the image of the first side and one or more of regions of interests (ROIs) are identified on the image of the first side that are known to include the selected type of content. A process can include receiving a sub-image of the image of the first side from the preprocessing unit, and performing content detection test on the sub-image.

IMAGE PROCESSING APPARATUS, NON-TRANSITORY COMPUTER READABLE RECORDING MEDIUM THAT RECORDS AN IMAGE PROCESSING PROGRAM, AND IMAGE PROCESSING METHOD
20200380657 · 2020-12-03 ·

An image processing apparatus includes a character determining unit configured to divide the read image into multiple blocks, each of the multiple blocks including multiple characters, and determine an inclination of each of the multiple characters included in each of the multiple blocks, a block processing unit configured to detect a change point block, the change point block being a block including characters having an inclination included in a first inclination interval, a number of the characters being equal to or larger than a first threshold, and including characters having an inclination included in a second inclination interval, a number of the characters being equal to or larger than the first threshold, the second inclination interval being different from the first inclination interval, and a fold determining unit configured to determine that the document is folded if the change point block is detected.

SYSTEMS AND METHODS FOR AUTOMATIC IMAGE CAPTURE ON A MOBILE DEVICE

Real-time evaluation and enhancement of image quality prior to capturing an image of a document on a mobile device is provided. An image capture process is initiated on a mobile device during which a user of the mobile device prepares to capture the image of the document, utilizing hardware and software on the mobile device to measure and achieve optimal parameters for image capture. Feedback may be provided to a user of the mobile device to instruct the user on how to manually optimize certain parameters relating to image quality, such as the angle, motion and distance of the mobile device from the document. When the optimal parameters for image capture of the document are achieved, at least one image of the document is automatically captured by the mobile device.

Extracting data from electronic documents

A structured data processing system includes hardware processors and a memory in communication with the hardware processors. The memory stores a data structure and an execution environment. The data structure includes an electronic document. The execution environment includes a data extraction solver configured to perform operations including identifying a particular page of the electronic document; performing an optical character recognition (OCR) on the page to determine a plurality of alphanumeric text strings on the page; determining a type of the page; determining a layout of the page; determining at least one table on the page based at least in part on the determined type of the page and the determined layout of the page; and extracting a plurality of data from the determined table on the page. The execution environment also includes a user interface module that generates a user interface that renders graphical representations of the extracted data; and a transmission module that transmits data that represents the graphical representations.