Patent classifications
G06V30/26
SYSTEMS AND METHODS FOR DETECTION AND CORRECTION OF OCR TEXT
OCR-text correction system and method embodiments are described. The OCR-text correction embodiments comprise or cooperate with a transformer-based sequence-to-sequence language model. The model is pretrained to denoise corrupted text and is fine-tuned using OCR-correction-specific examples. Text obtained at least in part through OCR is applied to the fine-tuned pretrained transformer model to detect at least one error in a subset of the text. Responsive to detecting the at least one error, the fine-tuned pretrained transformer model outputs an updated subset of the text to correct the at least one error.
Character Restoration Method and Apparatus, Storage Medium, and Electronic Device
A character restoration method and apparatus, a storage medium, and an electronic device are provided. The character restoration method includes: a character identifier of a character in a text region is determined, where the character identifier is used for uniquely identifying the character; and encoding is performed at least according to the character identifier, and encoded data is sent to a receiving end, where the encoded data is used for the receiving end to decode the encoded data and restore the character according to the character identifier obtained after decoding, that is, encoding is performed merely according to a small amount of information, and then the information is obtained by decoding, so as to restore the character.
DISPLAY CONTROL INTEGRATED CIRCUIT APPLICABLE TO PERFORMING REAL-TIME VIDEO CONTENT TEXT DETECTION AND SPEECH AUTOMATIC GENERATION IN DISPLAY DEVICE
A display control integrated circuit (IC) applicable to performing real-time video content text detection and speech automatic generation in a display device may include a pre-processing circuit, a character recognition circuit and a post-processing circuit. The pre-processing circuit may input a video signal to obtain a real-time video content carried by the video signal, and perform preliminary text detection on the real-time video content to generate a series of segmented character images to indicate a subtitle. The character recognition circuit may perform character recognition on the series of segmented character images to generate a series of characters, respectively. The post-processing circuit may perform vocabulary correction on the series of characters to selectively replace any erroneous character with a correct character to generate one or more vocabularies, for performing speech automatic generation.
DISPLAY CONTROL INTEGRATED CIRCUIT APPLICABLE TO PERFORMING REAL-TIME VIDEO CONTENT TEXT DETECTION AND SPEECH AUTOMATIC GENERATION IN DISPLAY DEVICE
A display control integrated circuit (IC) applicable to performing real-time video content text detection and speech automatic generation in a display device may include a pre-processing circuit, a character recognition circuit and a post-processing circuit. The pre-processing circuit may input a video signal to obtain a real-time video content carried by the video signal, and perform preliminary text detection on the real-time video content to generate a series of segmented character images to indicate a subtitle. The character recognition circuit may perform character recognition on the series of segmented character images to generate a series of characters, respectively. The post-processing circuit may perform vocabulary correction on the series of characters to selectively replace any erroneous character with a correct character to generate one or more vocabularies, for performing speech automatic generation.
OPTICAL CHARACTER RECOGNITION QUALITY EVALUATION AND OPTIMIZATION
A processor may receive an image and determine a number of foreground pixels in the image. The processor may obtain a result of optical character recognition (OCR) processing performed on the image. The processor may identify at least one bounding box surrounding at least one portion of text in the result and overlay the at least one bounding box on the image to form a masked image. The processor may determine a number of foreground pixels in the masked image and a decrease in the number of foreground pixels in the masked image relative to the number of foreground pixels in the image. Based on the decrease, the processor may modify an aspect of the OCR processing for subsequent image processing.
OPTICAL CHARACTER RECOGNITION QUALITY EVALUATION AND OPTIMIZATION
A processor may receive an image and determine a number of foreground pixels in the image. The processor may obtain a result of optical character recognition (OCR) processing performed on the image. The processor may identify at least one bounding box surrounding at least one portion of text in the result and overlay the at least one bounding box on the image to form a masked image. The processor may determine a number of foreground pixels in the masked image and a decrease in the number of foreground pixels in the masked image relative to the number of foreground pixels in the image. Based on the decrease, the processor may modify an aspect of the OCR processing for subsequent image processing.
APPARATUS AND METHOD FOR DETERMINING AND TRACKING HANDWRITTEN TIP AMOUNTS
A system and method for determining a value of a hand written monetary tip amount on a paper payment receipt is provided. One embodiment scans, using a scanner, a paper payment receipt having a hand written monetary tip amount thereon; generates scan data that corresponds to the scanned paper payment receipt: identifies text from the scan data, wherein the identified text includes hand written text and machine printed text; discriminates the hand written text from the machine printed text; and determines a value of the hand written monetary tip amount based on the identified hand written text.
APPARATUS AND METHOD FOR DETERMINING AND TRACKING HANDWRITTEN TIP AMOUNTS
A system and method for determining a value of a hand written monetary tip amount on a paper payment receipt is provided. One embodiment scans, using a scanner, a paper payment receipt having a hand written monetary tip amount thereon; generates scan data that corresponds to the scanned paper payment receipt: identifies text from the scan data, wherein the identified text includes hand written text and machine printed text; discriminates the hand written text from the machine printed text; and determines a value of the hand written monetary tip amount based on the identified hand written text.
Handwriting input display apparatus, handwriting input display method and recording medium storing program
A handwriting input display apparatus causes display means to display a stroke generated by an input made by using input means to a screen as a handwritten object. The apparatus includes display control means for causing the display means to display character string candidates including a handwriting recognition candidate when the handwritten object does not change for a predetermined time. When the handwriting recognition candidate is selected, the display control means causes the display means to erase a display of the character string candidates and a display of the handwritten object, and causes the display means to display a character string object at a position where the erased handwritten object was displayed. When selection of the handwriting recognition candidate is not performed for a predetermined time and the display of the character string candidates is erased, the display control means causes the handwritten object to be kept displayed.
INFORMATION PROCESSING APPARATUS, INFORMATION PROCESSING SYSTEM, AND NON-TRANSITORY COMPUTER READABLE MEDIUM
An information processing apparatus includes a processor configured to perform processing for displaying character information recognized by reading plural forms, in a descending or ascending order of the number of pieces of character information recognized as being identical.