G06V30/2455

Image processing device, image forming apparatus, image processing method, and non-transitory computer-readable storage medium
11252302 · 2022-02-15 · ·

An image processing device includes: an image classifying section which, through a convolutional neural network, classifies each pixel of input image data as expressing or not expressing a handwritten image to calculate a classification probability of each pixel, the classification probability being a probability that the handwritten image is expressed; a threshold setting section which sets a first threshold when removal processing to remove the handwritten image is performed and a second threshold which is smaller than the first threshold when emphasis processing to emphasize the handwritten image is performed; and an image processor which adjusts a gradation value of pixels with a classification probability no smaller than the first threshold to remove the handwritten image when the removal processing is performed and adjusts the gradation value of pixels with a classification probability no smaller than the second threshold to emphasize the handwritten image when the emphasis processing is performed.

UTILIZING MACHINE LEARNING AND IMAGE FILTERING TECHNIQUES TO DETECT AND ANALYZE HANDWRITTEN TEXT

In some implementations, a device may receive an image that depicts handwritten text. The device may determine that a section of the image includes the handwritten text. The device may analyze, using a first image processing technique, the section to identify subsections of the section that include individual words of the handwritten text. The device may reconfigure, using a second image processing technique, the subsections to create preprocessed word images associated with the individual words. The device may analyze, using a word recognition model, the preprocessed word images to generate digitized words that are associated with the preprocessed word images. The device may verify, based on a reference data structure, that the digitized words correspond to recognized words of the word recognition model. The device may generate, based on verifying the digitized words, digital text according to a sequence of the digitized words in the section.

METHOD OF GENERATING FONT DATABASE, AND METHOD OF TRAINING NEURAL NETWORK MODEL
20220180650 · 2022-06-09 ·

A method of generating a font database, and a method of training a neural network model are provided, which relate to a field of artificial intelligence, in particular to a computer vision and deep learning technology. The method of generating the font database includes: determining, by using a trained similarity comparison model, a basic font database most similar to handwriting font data of a target user in a plurality of basic font databases as a candidate font database; and adjusting, by using a trained basic font database model for generating the candidate font database, the handwriting font data of the target user, so as to obtain a target font database for the target user.

PROCESSING DIGITIZED HANDWRITING

A handwritten text processing system processes a digitized document including handwritten text input to generate an output version of the digitized document that allows users to execute text processing functions on the textual content of the digitized document. Each word of the digitized data is extracted by converting the digitized document into images, binarizing the images, and segmenting the images into binary image patches. Each binary image patch is further processed to identify if the word is machine-generated or if the word is handwritten. The output version is generated by combining underlying images of the pages of the digitized document with words from the pages superimposed in a transparent font at positions that coincide with the positions of the words in the underlying images.

Method of generating font database, and method of training neural network model

A method of generating a font database, and a method of training a neural network model are provided, which relate to a field of artificial intelligence, in particular to a computer vision and deep learning technology. The method of generating the font database includes: determining, by using a trained similarity comparison model, a basic font database most similar to handwriting font data of a target user in a plurality of basic font databases as a candidate font database; and adjusting, by using a trained basic font database model for generating the candidate font database, the handwriting font data of the target user, so as to obtain a target font database for the target user.

Handwritten content removing method and device and storage medium

A handwritten content removing method and device and a storage medium. The handwritten content removing method comprises: acquiring an input image of a text page to be processed, the input image comprising a handwritten region, which comprises a handwritten content (S10); identifying the input image so as to determine the handwritten content in the handwritten region (S11); and removing the handwritten content in the input image so as to obtain an output image (S12).

Method, apparatus, and system for auto-registration of nested tables from unstructured cell association for table-based documentation

In some forms containing keywords and content, there may be nested levels of keywords, also referred to as a hierarchy. Content in the forms may be associated with one or more keywords in one or more of the nested levels, or in the hierarchy. Identifying keywords in adjacent cells in a table (with a nested keyword being either to the right of or below another keyword) enables distinguishing between keywords and content in filled forms, and enables correct association of content with respective keywords.

IDENTITY VERIFICATION OR IDENTIFICATION METHOD USING HANDWRITTEN SIGNATURES AFFIXED TO A DIGITAL SENSOR
20220222954 · 2022-07-14 ·

A method for identifying or for verifying the identity of a user, using a plurality, of previously acquired reference signature vectors, a handwritten signature of the user and at least one additional item of handwritten information linked to the user that arc affixed beforehand to an in particular mobile digital sensor, in which method: a) said handwritten signature of the user and said at least one additional item of information are fused in order to generate at least one test signature vector, b) said at least one test signature vector is compared with a plurality of said reference signature vectors, and c) a likelihood score is generated on the basis at least of this comparison in order to identify or to verify the identity of the user.

Information processing device, information processing system and computer readable medium

An information processing device includes a processor configured to: group electronic documents that have been processed, based on similarity degree of the electronic documents into one or more groups; determine a group, among the one or more groups, to which at least one received electronic document is to belong; determine whether the at least one received document is a modified version of one or more electronic documents belonging to the determined group, the modified version having been partially modified with respect to the one or more electronic documents belonging to the determined group; and specify a blank portion in the at least one received electronic document by comparing the at least one received electronic document with the one or more electronic documents belonging to the determined group.

INFORMATION PROCESSING APPARATUS, NON-TRANSITORY COMPUTER READABLE MEDIUM, AND CHARACTER RECOGNITION SYSTEM
20210319273 · 2021-10-14 · ·

An information processing apparatus includes a processor configured to acquire a result of character recognition of a handwritten character string and in response to a determination that the handwritten character string includes a character or a symbol that refers to a reference character string that has appeared previously, replace the character or the symbol with the reference character string that is referred to by the character or the symbol.