Patent classifications
G06V30/226
METHOD, APPARATUS, AND COMPUTER-READABLE STORAGE MEDIUM FOR RECOGNIZING CHARACTERS IN A DIGITAL DOCUMENT
Method, computer readable medium, and apparatus of recognizing character zone in a digital document. In an embodiment, the method includes classifying a segment of the digital document as including text, calculating at least one parameter value associated with the classified segment of the digital document, determining, based on the calculated at least one parameter value, a zonal parameter value, classifying the segment of the digital document as a handwritten text zone or as a printed text zone based on the determined zonal parameter value and a threshold value, the threshold value being based on a selection of an intersection of a handwritten text distribution profile and a printed text distribution profile, each of the handwritten text distribution profile and the printed text distribution profile being associated with a zonal parameter corresponding to the determined zonal parameter value, and generating, based on the classifying, a modified version of the digital document.
METHOD, APPARATUS, AND COMPUTER-READABLE STORAGE MEDIUM FOR RECOGNIZING CHARACTERS IN A DIGITAL DOCUMENT
Method, computer readable medium, and apparatus of recognizing character zone in a digital document. In an embodiment, the method includes classifying a segment of the digital document as including text, calculating at least one parameter value associated with the classified segment of the digital document, determining, based on the calculated at least one parameter value, a zonal parameter value, classifying the segment of the digital document as a handwritten text zone or as a printed text zone based on the determined zonal parameter value and a threshold value, the threshold value being based on a selection of an intersection of a handwritten text distribution profile and a printed text distribution profile, each of the handwritten text distribution profile and the printed text distribution profile being associated with a zonal parameter corresponding to the determined zonal parameter value, and generating, based on the classifying, a modified version of the digital document.
INFORMATION PROCESSING APPARATUS, INFORMATION PROCESSING METHOD, AND COMPUTER PROGRAM
An information processing apparatus (10) includes a controller (11) that acquires an image containing a figure and a character string and generates association information indicating an association between the figure and the character string based on a positional relationship between the figure and the character string in the image.
OBJECT DETECTION USING NEURAL NETWORKS
Systems and methods for facilitating an automated detection of an object in a test document are disclosed. A system may include a processor including a dataset generator. The dataset generator may obtain a first input image and a first original document from a data lake. The dataset generator may prune a portion of the first original document to obtain a pruned image. The dataset generator may blend the first input image with the pruned image to generate a modified image. The modified image may include the pruned image bearing the first pre-defined representation. The modified image may be combined with the first original document to generate a training dataset. The training dataset may be utilized to train a neural network based model to obtain a trained model for the automated detection of the object in the test document.
OBJECT DETECTION USING NEURAL NETWORKS
Systems and methods for facilitating an automated detection of an object in a test document are disclosed. A system may include a processor including a dataset generator. The dataset generator may obtain a first input image and a first original document from a data lake. The dataset generator may prune a portion of the first original document to obtain a pruned image. The dataset generator may blend the first input image with the pruned image to generate a modified image. The modified image may include the pruned image bearing the first pre-defined representation. The modified image may be combined with the first original document to generate a training dataset. The training dataset may be utilized to train a neural network based model to obtain a trained model for the automated detection of the object in the test document.
Image processing apparatus, image processing method, and storage medium
Character recognition processing suitable to a handwritten character area and a printed character area among character areas in a scanned image of a document is performed. Next, character recognition results for the handwritten character area and character recognition results for the printed character area are integrated and a likelihood indicating a probability of being an extraction target is calculated for a candidate character string that is an extraction candidate among the integrated character recognition results and a character string that is the item value is determined. Then, at the time of the determination, different evaluation indications are used in a case where a character originating from the handwritten character area is included in characters constituting the candidate character string and in a case where such a character is not included.
METHOD FOR PROVIDING COACHING SERVICE BASED ON HANDWRITING INPUT AND SERVER THEREFOR
A method for providing a coaching service based on a handwriting input and a server therefor are disclosed. The method includes receiving handwriting input information of a learner from a learner interface, detecting a behavioral pattern of the learner based on the handwriting input information and information on a learning item which the learner performs learning, determining whether there is an abnormality in the behavioral pattern, and obtaining a feedback based on a determination result about the abnormality.
Method, apparatus, and computer-readable storage medium for recognizing characters in a digital document
Method, computer readable medium, and apparatus of recognizing character zone in a digital document. In an embodiment, the method includes classifying a segment of the digital document as including text, calculating at least one parameter value associated with the classified segment of the digital document, determining, based on the calculated at least one parameter value, a zonal parameter value, classifying the segment of the digital document as a handwritten text zone or as a printed text zone based on the determined zonal parameter value and a threshold value, the threshold value being based on a selection of an intersection of a handwritten text distribution profile and a printed text distribution profile, each of the handwritten text distribution profile and the printed text distribution profile being associated with a zonal parameter corresponding to the determined zonal parameter value, and generating, based on the classifying, a modified version of the digital document.
Method, apparatus, and computer-readable storage medium for recognizing characters in a digital document
Method, computer readable medium, and apparatus of recognizing character zone in a digital document. In an embodiment, the method includes classifying a segment of the digital document as including text, calculating at least one parameter value associated with the classified segment of the digital document, determining, based on the calculated at least one parameter value, a zonal parameter value, classifying the segment of the digital document as a handwritten text zone or as a printed text zone based on the determined zonal parameter value and a threshold value, the threshold value being based on a selection of an intersection of a handwritten text distribution profile and a printed text distribution profile, each of the handwritten text distribution profile and the printed text distribution profile being associated with a zonal parameter corresponding to the determined zonal parameter value, and generating, based on the classifying, a modified version of the digital document.
METHODS AND SYSTEMS FOR ADAPTIVE, TEMPLATE-INDEPENDENT HANDWRITING EXTRACTION FROM IMAGES USING MACHINE LEARNING MODELS
Methods and systems for adaptive, template-independent handwriting extraction from images using machine learning models and without manual localization or review. For example, the system may receive an input image, wherein the input image comprises native printed content and handwritten content. The system may process the input image with a model to generate an output image, wherein the output image comprises extracted handwritten content based on the native handwritten content. The system may process the output image to digitally recognize the extracted handwritten content. The system may generate a digital representation of the input image, wherein the digital representation comprises the native printed content and the digitally recognized extracted handwritten content.