G06V30/412

Apparatus and methods for extracting data from lineless table using delaunay triangulation and excess edge removal

A method for extracting data from lineless tables includes storing an image including a table in a memory. A processor operably coupled to the memory identifies a plurality of text-based characters in the image, and defines multiple bounding boxes based on the characters. Each of the bounding boxes is uniquely associated with at least one of the text-based characters. A graph including multiple nodes and multiple edges is generated based on the bounding boxes, using a graph construction algorithm. At least one of the edges is identified for removal from the graph, and removed from the graph to produce a reduced graph. The reduced graph can be sent to a neural network to predict row labels and column labels for the table.

Information processing apparatus for detecting a common attribute indicated in different tables and generating information about the common attribute, and information processing method, and non-transitory computer readable medium

An embodiment of the present invention provides an information processing apparatus for detecting an attribute indicated in different tables in common and generating information about the attribute. An information processing apparatus as an embodiment of the present invention includes a detector and an information generator. The detector detects a common attribute indicated in a first table and a second table. The information generator generates information about the common attribute based on contents of the first table and the second table.

Information processing apparatus for detecting a common attribute indicated in different tables and generating information about the common attribute, and information processing method, and non-transitory computer readable medium

An embodiment of the present invention provides an information processing apparatus for detecting an attribute indicated in different tables in common and generating information about the attribute. An information processing apparatus as an embodiment of the present invention includes a detector and an information generator. The detector detects a common attribute indicated in a first table and a second table. The information generator generates information about the common attribute based on contents of the first table and the second table.

Image processing apparatus and non-transitory computer readable medium

An image processing apparatus includes a first image generator and a second image generator. The first image generator generates a first image, including a predetermined ruled-line image and an inscription image, from a second sheet in a sheet group. The sheet group is obtained by stacking multiple sheets including a single first sheet and the second sheet. The first sheet has inscription information inscribed thereon. The second sheet has the inscription image corresponding to the inscription information transferred thereon and includes the ruled-line image. The second image generator generates a second image in which a surplus image is removed from the first image generated by the first image generator in accordance with a learning model that has learned to remove the surplus image different from the ruled-line image and the inscription image.

Image processing apparatus and non-transitory computer readable medium

An image processing apparatus includes a first image generator and a second image generator. The first image generator generates a first image, including a predetermined ruled-line image and an inscription image, from a second sheet in a sheet group. The sheet group is obtained by stacking multiple sheets including a single first sheet and the second sheet. The first sheet has inscription information inscribed thereon. The second sheet has the inscription image corresponding to the inscription information transferred thereon and includes the ruled-line image. The second image generator generates a second image in which a surplus image is removed from the first image generated by the first image generator in accordance with a learning model that has learned to remove the surplus image different from the ruled-line image and the inscription image.

VISUALIZATION OF THE IMPACT OF TRAINING DATA

An example operation may include one or more of generating a plurality of bounding boxes at a plurality of content areas in an image corresponding to a plurality of pieces of text within the image, converting the plurality of bounding boxes into a plurality of bounding box vectors based on attributes of the plurality of bounding boxes, training a machine learning model to transform a bounding box into a location in vector space based on the plurality of bounding box vectors, and storing the trained machine learning model in memory.

VISUALIZATION OF THE IMPACT OF TRAINING DATA

An example operation may include one or more of generating a plurality of bounding boxes at a plurality of content areas in an image corresponding to a plurality of pieces of text within the image, converting the plurality of bounding boxes into a plurality of bounding box vectors based on attributes of the plurality of bounding boxes, training a machine learning model to transform a bounding box into a location in vector space based on the plurality of bounding box vectors, and storing the trained machine learning model in memory.

DOCUMENT PROCESSING
20230022677 · 2023-01-26 ·

A method of document processing is provided. An implementation solution is: obtaining target text information and target layout information of a target document, the target text information includes target text included in the target document and character position information of the target text, and the target layout information is used to characterize the region where text in the target document is located; fusing the target text information and the target layout information to obtain first multimodal information of the target document; and inputting the first multimodal information into an intelligent document comprehension model, and obtaining at least one target word in the target document and at least one feature vector corresponding to the at least one target word output by the intelligent document comprehension model, each target word is related to semantics of the target document.

DOCUMENT PROCESSING
20230022677 · 2023-01-26 ·

A method of document processing is provided. An implementation solution is: obtaining target text information and target layout information of a target document, the target text information includes target text included in the target document and character position information of the target text, and the target layout information is used to characterize the region where text in the target document is located; fusing the target text information and the target layout information to obtain first multimodal information of the target document; and inputting the first multimodal information into an intelligent document comprehension model, and obtaining at least one target word in the target document and at least one feature vector corresponding to the at least one target word output by the intelligent document comprehension model, each target word is related to semantics of the target document.

FACILITATING IDENTIFICATION OF FILLABLE REGIONS IN A FORM

Methods and systems are provided for facilitating identification of fillable regions and/or data associated therewith. In embodiments, a candidate fillable region indicating a region in a form that is a candidate for being fillable is obtained. Textual context indicating text from the form and spatial context indicating positions of the text within the form are also obtained. Fillable region data associated with the candidate fillable region is generated, via a machine learning model, using the candidate fillable region, the textual context, and the spatial context. Thereafter, a fillable form is generated using the fillable region data, the fillable form having one or more fillable regions for accepting input.