G06V30/00

Machined learning supporting document data extraction

Improved techniques to access content from documents in an automated fashion. The improved techniques permit content within documents to be retrieved and then used by computer systems operating various software programs (e.g., application programs), such as an extraction program. Documents, especially business transaction documents, often have various descriptors (or tables) and values that form key-value pairs. The improved techniques permit key-value pairs within documents to be recognized and extracted from documents. Consequently, RPA systems are able to accurately understand the content of tables within documents so that users and/or software robots can operate on the documents with increased reliability and flexibility.

Machined learning supporting document data extraction

Improved techniques to access content from documents in an automated fashion. The improved techniques permit content within documents to be retrieved and then used by computer systems operating various software programs (e.g., application programs), such as an extraction program. Documents, especially business transaction documents, often have various descriptors (or tables) and values that form key-value pairs. The improved techniques permit key-value pairs within documents to be recognized and extracted from documents. Consequently, RPA systems are able to accurately understand the content of tables within documents so that users and/or software robots can operate on the documents with increased reliability and flexibility.

Searchable table extraction

Tables including cells can be extracted from an input document image, as objects and processed to be output in an XML format. The tables can be filtered based on one or more keywords, to reduce a number of the tables. The filtered tables that include the one or more keywords can be obtained. A query can be executed on the filtered tables, the query including one or more keys composed based on cell level information associated with the filtered tables. One or more cells among the cells can be identified based on the one or more keys. As a response to the query, the target content that corresponds to the one or more cells can be provided.

MACHINED LEARNING SUPPORTING DOCUMENT DATA EXTRACTION

Improved techniques to access content from documents in an automated fashion. The improved techniques permit content within documents to be retrieved and then used by computer systems operating various software programs (e.g., application programs), such as an extraction program. Documents, especially business transaction documents, often have various descriptors (or tables) and values that form key-value pairs. The improved techniques permit key-value pairs within documents to be recognized and extracted from documents. Consequently, RPA systems are able to accurately understand the content of tables within documents so that users and/or software robots can operate on the documents with increased reliability and flexibility.

MACHINED LEARNING SUPPORTING DOCUMENT DATA EXTRACTION

Improved techniques to access content from documents in an automated fashion. The improved techniques permit content within documents to be retrieved and then used by computer systems operating various software programs (e.g., application programs), such as an extraction program. Documents, especially business transaction documents, often have various descriptors (or tables) and values that form key-value pairs. The improved techniques permit key-value pairs within documents to be recognized and extracted from documents. Consequently, RPA systems are able to accurately understand the content of tables within documents so that users and/or software robots can operate on the documents with increased reliability and flexibility.

AUTOMATIC DEVELOPMENT AND ENHANCEMENT OF DEEP LEARNING MODEL FOR DATA EXTRACTION USING FEEDBACK LOOP

Systems and methods for deep learning model development for data extraction using a feedback loop. A system generates an interactive graphical user interface (GUI) on one or more user devices for displaying a document with data extracted from the document by a data extraction model together with a user interaction tool allowing the user to correct the extracted data. The system receives, via the interactive GUI, correction information for the extracted data and monitoring performance characteristics of the extraction model in real-time based on the user correction information. The system automatically updates and trains the extraction model using the correction information responsive to detecting that the performance characteristics meet a predetermined performance reduction condition.

Neural network architecture for classifying documents

A system to classify image of a document using neural network architecture is provided. The system includes a storage device storing the image derived from the document having text information. The system includes a document importer operable to perform optical character recognition to convert image data in the image to machine readable data. The system includes a neural network that perform semantic enrichment and positional context for the terms of interest present in the image. The neural network is configured to take as input the machine-readable data and the image and combine both the machine-readable data and the image to classify the image of the document based on the positional context of the terms of interest.

Handwriting text recognition system based on neural network
12347220 · 2025-07-01 · ·

A handwriting text recognition system based on neural network includes a stroke input processor for receiving tracks of online handwriting texts, a string database for storing a large amount of the tracks; a word recognition neural network; and an after-processor being connected to the string database and the output interface of the text recognition neural network; The handwriting text recognition system based on neural network provides higher rates of confidences. Some natural languages frequently used all over the world can be recognized with a higher accuracy (including languages written from right to left and from left to right). The association relations between the input strokes and the character strings can be provided. It could support any strokes with irregular written orders.

Method, apparatus, and system for character recognition using context

A neural network-based optical character recognition (OCR) method, system, and apparatus may use context provided by a neural network-based language model (LM) to improve character recognition in images. In embodiments, a fuzzy OCR system receives an input image and uses aggregated classes of visually similar characters to identify a most probable aggregated class of visually similar characters. That aggregated class is used as an input to the LM, which, in embodiments, employs a portion of text, for example a word, a phrase, a sentence, or a paragraph, from the input image to provide sufficient context to identify the correct character from the aggregated class of visually similar characters.

Handwriting recognition method and apparatus, and electronic device and storage medium
12380721 · 2025-08-05 · ·

A handwriting recognition method and apparatus, and an electronic device and a storage medium are provided. The method includes: acquiring a text image containing handwritten text; inputting the text image into a convolutional neural network, and extracting a CNN feature and a HOG feature of the text image; and extracting the handwritten text in text image according to the CNN feature and the HOG feature.