G06V30/19147

METHOD AND APPARATUS FOR PRESENTING CANDIDATE CHARACTER STRING, AND METHOD AND APPARATUS FOR TRAINING DISCRIMINATIVE MODEL
20220351085 · 2022-11-03 ·

A method and an apparatus for presenting a candidate character string and a method and an apparatus for training a discriminative model are provided. The method for presenting a candidate character string may include: acquiring a character string of alphabetic words inputted by a user; determining a character transition weight of the character string, the character transition weight being used to characterize a transition probability of a character; generating a sort weight corresponding to the character string based on a pre-acquired basic weight matching the character string and the character transition weight; and selecting, according to an order indicated by the sort weight, at least two candidate character strings matching the character string from a pre-acquired candidate character string set for presentation.

Digital image processing

A computer-implemented method for processing a digital image. The digital image comprises one or more text cells, wherein each of the one or more text cells comprises a string and a bounding box. The method comprises receiving the digital image in a first format, the first format providing access to the strings and the bounding boxes of the one more text cells. The methods further comprises encoding the strings of the one or more text cells as visual pattern according to a predefined string encoding scheme and providing the digital image in a second format. The second format comprises the visual pattern of the strings of the one or more text cells. A corresponding system and a related computer program product is provided.

Ground truth generation for image segmentation

A method, system and computer program product to generate a training data set for image segmentation applications, comprising providing a set of input documents of a first format. The input documents each comprise one or more pages. The input documents are split into individual document pages and parsed. Parsing comprises identifying a predefined set of items including position information of the position of the predefined set of items in the individual document pages; generating a bitmap image of a second format for each individual document page of the first format. The bitmap image comprises a predefined number of pixels. A mask is generated for each individual document. The mask comprises the predefined number of pixels of the corresponding bitmap image. Generating the mask comprises assigning an encoded class label to each pixel of the mask based on the position information of identified items of the predefined set of items.

IMAGE PROCESSING SYSTEM AND IMAGE PROCESSING METHOD
20230029990 · 2023-02-02 ·

An image processing system according to the present embodiment acquires a processing target image read from an original that is handwritten and specifies one or more handwritten areas included in the acquired processing target image. In addition, for each specified handwritten area, the present image processing system extracts from the processing target image a handwritten character image and a handwritten area image indicating an approximate shape of a handwritten character. Furthermore, for a handwritten area including a plurality of lines of handwriting among the specified one or more handwritten areas, a line boundary of handwritten characters is determined from a frequency of pixels indicating a handwritten area in a line direction of the handwritten area image, and a corresponding handwritten area is separated into each line.

Manual curation tool for map data using aggregated overhead views

Examples disclosed herein may involve (i) obtaining a first layer of map data associated with sensor data capturing a geographical area, the first layer of map data comprising an aggregated overhead-view image of the geographical area, where the aggregated overhead-view image is generated from aggregated pixel values from a plurality of images associated with the geographical area, (ii) obtaining a second layer of map data, the second layer of map data comprising label data for the geographical area derived from the aggregated overhead-view image of the geographical area, and (iii) causing the first layer of map data and the second layer of map data to be presented to a user for curation of the label data.

SEMANTIC CONCEPT MATCHING USING ENHANCED WEAK SUPERVISION
20230092447 · 2023-03-23 ·

A method augments data labels of a machine learning task. The method includes applying at least one labeling function for each of a plurality of pairs of semantic concepts and producing a labeling matrix, computing pairwise similarity scores using similarity metrics for each of the pairs of semantic concepts, augmenting the labeling matrix using the pairwise similarity scores to increase a density of the labeling matrix, and inputting the labeling matrix to a label aggregator to apply a single label for each of the pairs of semantic concepts labeled by the at least one labeling function.

METHOD AND APPARATUS FOR RECOGNIZING TEXT, DEVICE AND STORAGE MEDIUM

The present disclosure provides a method and apparatus for recognizing a text, a device and a storage medium, and relates to the field of deep learning technology. A specific implementation comprises: receiving a target image; performing a text detection on the target image using a pre-trained lightweight text detection network, to obtain a text detection box; and recognizing a text in the text detection box using a pre-trained lightweight text recognition network, to obtain a text recognition result.

Method and apparatus for evaluating article value based on artificial intelligence, and storage medium

The present disclosure provides a method and apparatus for evaluating article value based on artificial intelligence, and a storage medium. The solution of present disclosure may be employed to pre-mine high-quality articles and low-quality articles as training data, and train according to the training data to obtain a value-scoring model. As such, value evaluation needs to be performed for the to-be-evaluated article, it is feasible to first perform feature extraction for the to-be-evaluated article, determine a score of the to-be-evaluated article based on the extracted features and the value-scoring model, and thereby implement effective evaluation of the article value.

SYSTEMS AND METHODS FOR DETECTION AND CORRECTION OF OCR TEXT
20230083000 · 2023-03-16 ·

OCR-text correction system and method embodiments are described. The OCR-text correction embodiments comprise or cooperate with a transformer-based sequence-to-sequence language model. The model is pretrained to denoise corrupted text and is fine-tuned using OCR-correction-specific examples. Text obtained at least in part through OCR is applied to the fine-tuned pretrained transformer model to detect at least one error in a subset of the text. Responsive to detecting the at least one error, the fine-tuned pretrained transformer model outputs an updated subset of the text to correct the at least one error.

Machine learning-based text recognition system with fine-tuning model

A non-transitory processor-readable medium stores instructions to be executed by a processor. The instructions cause the processor to receive a first trained machine learning model that generates a transcription based on a document. The instructions cause the processor to execute the first trained machine learning model and a second trained machine learning model to generate a refined transcription based on the transcription. The instructions cause the processor to execute a quality assurance program to generate a transcription score based on the document and the transcription. The instructions cause the processor to execute the quality assurance program to generate a refined transcription score based on the refined transcription and at least one of the document or the transcription. The at least one refined transcription score indicates an automation performance better than an automation performance for the at least one transcription score.