G06V30/293

PATTERN RECOGNITION DEVICE, PATTERN RECOGNITION METHOD, AND COMPUTER PROGRAM PRODUCT
20180005087 · 2018-01-04 ·

According to an embodiment, a pattern recognition device is configured to divide an input signal into a plurality of elements, convert the divided elements into feature vectors having the same dimensionality to generate a set of feature vectors, and evaluate the set of feature vectors using a recognition dictionary including models corresponding to respective classes, to output a recognition result representing a class or a set of classes to which the input signal belongs. The models each include sub-models each corresponding to one of possible division patterns in which a signal to be classified into a class corresponding to the model can be divided into a plurality of elements. A label expressing a model including a sub-model conforming to the set of feature vectors, or a set of labels expressing a set of models including sub-models conforming to the set of feature vectors is output as the recognition result.

SYSTEMS AND METHODS FOR REPRESENTING AND SEARCHING CHARACTERS
20230230403 · 2023-07-20 ·

Methods and supporting systems for representing and searching characters, comprising: obtaining an image of the character, labelling a structure of the character by defining a plurality of nodes and a plurality of edges on the character in the image, and generating a representation of the character by extracting a set of two-dimensional coordinates to represent the plurality of nodes and by extracting a matrix to represent the plurality of edges, and providing the representation in a searchable database.

OPTICAL CHARACTER RECOGNITION SYSTEMS AND METHODS FOR PERSONAL DATA EXTRACTION

Methods and systems for extracting personal data from a sensitive document are provided. The system includes a document prediction module, a cropping module, a denoising module, and an optical character recognition (OCR) module. The document prediction module predicts type of document of the sensitive document using a keypoint matching-based approach and the cropping module extracts document shape and extracts one or more fields comprising text or pictures from the sensitive document. The denoising module prepares the one or more fields for optical character recognition, and the OCR module performs optical character recognition on the denoised one or more fields to detect characters in the one or more fields.

Method and system for converting font of Chinese character in image, computer device and medium

A method and a system for converting a font of a Chinese character in an image, a computer device and a medium are disclosed. A specific implementation of the method includes: acquiring a stroke of a to-be-converted Chinese character in the image and spatial distribution information of the stroke; and generating a Chinese character in a target font that corresponds to the to-be-converted Chinese character in the image according to the stroke of the to-be-converted Chinese character, the spatial distribution information of the stroke and standard stroke information of the target font, to replace the to-be-converted Chinese character.

INTELLIGENT BUILDING BLOCK-BASED CHINESE CHARACTER LEARNING SYSTEM
20230035696 · 2023-02-02 ·

An intelligent building block-based Chinese character learning system, relating to the technical field of intelligent building block-based Chinese character learning. Chinese characters are combined freely and conveniently. Moreover, Chinese characters combined by a child are combined with online courses, so that online courses are triggered by means of assembly by the child, and interactive Chinese character teaching is implemented. The system comprises: an instruction module, used for bearing Chinese characters and building block assembly of the Chinese characters; a master control board module, used for reading the Chinese characters assembled by the instruction modules, displaying the assembled Chinese characters, and synchronizing the assembled Chinese characters to a mobile phone or an iPad; and an online course module, used for reading the Chinese characters synchronized by the master control board module and searching for an online course.

TEXT RECOGNITION IN IMAGE
20230064122 · 2023-03-02 ·

According to implementations of the subject matter described herein, there is provided a solution for text recognition in an image. In this solution, a target text line area, which is expected to include a text to be recognized, is determined from an image. Probability distribution information of a character model element(s) present in the target text line area is determined using a single character model. The single character model is trained based on training text line areas and respective ground-truth texts in the training text line areas. Texts in the training text line areas are arranged in different orientations, and/or the ground-truth texts comprise texts are related to various languages (e.g., texts related to a Latin and an Eastern languages). The text in the target text line area can be determined based on the determined probability distribution information. The single character model enables more efficient and convenient text recognition.

SYSTEM TO IDENTIFY AUTHORSHIP OF HANDWRITTEN TEXT BASED ON INDIVIDUAL ALPHABETS

A device, method, and non-transitory computer readable medium are described. The method includes receiving a dataset including hand written Arabic words and hand written Arabic alphabets from one or more users. The method further includes removing whitespace around alphabets in the hand written Arabic words and the hand written Arabic alphabets in the dataset. The method further includes splitting the dataset into a training set, a validation set, and a test set. The method further includes classifying one or more user datasets from the training set, the validation set, and the test set. The method further includes identifying the target user from the one or more user datasets. The identification of the target user includes a verification accuracy of the hand written Arabic words being larger than a verification accuracy threshold value.

TEXT INDEPENDENT WRITER VERIFICATION METHOD AND SYSTEM

A device, method, and non-transitory computer readable medium are described. The method includes receiving a dataset including hand written Arabic words and hand written Arabic alphabets from one or more users. The method further includes removing whitespace around alphabets in the hand written Arabic words and the hand written Arabic alphabets in the dataset. The method further includes splitting the dataset into a training set, a validation set, and a test set. The method further includes classifying one or more user datasets from the training set, the validation set, and the test set. The method further includes identifying the target user from the one or more user datasets. The identification of the target user includes a verification accuracy of the hand written Arabic words being larger than a verification accuracy threshold value.

ON DEMAND TESTING AS A SERVICE FOR BASE TEXT DIRECTION VERIFICATION TESTING

Methods and systems for testing base text direction (BTD) include receiving one or more images captured by an end-user system. Each of the one or more images displays respective text test case information. Each of the one or more images is compared to a respective reference image associated with a respective text test case. It is determined whether the end-user system produces BTD errors based on the comparison in accordance with one or more BTD error rules.

Method for image text recognition, apparatus, device and storage medium

The present application discloses a method for image text recognition, an apparatus, a device, and a storage medium, and relates to image processing technologies in the field of cloud computing. A specific implementation is: acquiring an image to be processed, where at least one text line exists in the image to be processed; processing each text line in the image to be processed to obtain a composite encoded vector corresponding to each word in each text line, where the composite encoded vector carries semantic information and position information; and determining a text recognition result of the image to be processed according to the semantic information and the position information carried in the composite encoded vector corresponding to each word in each text line. This technical solution can accurately distinguish adjacent fields with small pixel spacing in the image and improve the accuracy of text recognition in the image.