G06V30/133

FOCUS DETECTION METHOD, APPARATUS, AND ELECTRONIC DEVICE
20230094297 · 2023-03-30 ·

A focus detection method includes: acquiring an image of a test object through a to-be-tested image acquisition device, the test object including a character, and a clarity of the character corresponding to a minimum clarity with which a content of the character is still able to be recognized using a character recognition technology; performing character recognition on the image to obtain a recognition result; and determining a focus detection result for the to-be-tested image acquisition device based on the recognition result.

METHOD OF DETECTING PRINTING DEFECTS, COMPUTER DEVICE, AND STORAGE MEDIUM
20230093969 · 2023-03-30 ·

This application provides a method of detecting printing defects. The method includes obtaining a first image of each character in a reference image. A third image of each character is obtained based on the first image of each character, a fourth image of each character is obtained based on a second image of each character obtained from an image to be detected. Once a fifth image of each character is obtained based on the third image of each character, a sixth image of each character is obtained according to the fourth image and the fifth image of each character, a detection result of each character in the image to be detected is determined according to the fifth image and the sixth image of the each character.

SYSTEMS AND METHODS FOR DETECTION AND CORRECTION OF OCR TEXT
20230083000 · 2023-03-16 ·

OCR-text correction system and method embodiments are described. The OCR-text correction embodiments comprise or cooperate with a transformer-based sequence-to-sequence language model. The model is pretrained to denoise corrupted text and is fine-tuned using OCR-correction-specific examples. Text obtained at least in part through OCR is applied to the fine-tuned pretrained transformer model to detect at least one error in a subset of the text. Responsive to detecting the at least one error, the fine-tuned pretrained transformer model outputs an updated subset of the text to correct the at least one error.

METHOD AND SYSTEM FOR IDENTIFYING AND DETERMINING VALUATION OF CURRENCY
20230062007 · 2023-03-02 ·

A method and system is provided for determining the denomination and related data for a currency item using a personal computing device, such as a mobile phone. The device includes or is connected to an image capture device that is preferably a digital video camera. At least one image of a target currency item is captured then processed for image quality. A further processing of the image includes a coordinate mapping. A comparison is made between individual pixels of the processed image based on the assigned coordinate mapping with a database of reference currency images to determine the currency denomination. Additional processing of the currency image provides the date and other data regarding the target currency item. A market value for the target currency item is identified by reference to a valuation database using the data determined for the currency item.

INSPECTION APPARATUS, METHOD, AND NON-TRANSITORY STORAGE MEDIUM FOR INSPECTING PRINT PRODUCT
20230067117 · 2023-03-02 ·

An inspection apparatus performs dropout color processing on a first inspection area set for an image generated by reading a print product and then performs first recognition processing on the first inspection area, and further performs second recognition processing on a second inspection area set for the image and then performs an inspection of whether sufficient margin areas are allocated on the second inspection area, without performing dropout color processing.

AUTOMATED CATEGORIZATION AND PROCESSING OF DOCUMENT IMAGES OF VARYING DEGREES OF QUALITY
20230061725 · 2023-03-02 ·

An apparatus includes a memory and a processor. The memory stores a dictionary and a machine learning algorithm trained to classify text. The processor receives an image of a page, converts the image into a set of text, and identifies a plurality of tokens within the text. Each token includes one or more contiguous characters that are both preceded and followed by whitespace within the text. The processor identifies invalid tokens by removing tokens of the plurality of tokens that correspond to words of the dictionary. The processor calculates, based on a ratio of a total number of valid tokens to a total number of tokens, a score. In response to determining that the score is greater than a threshold, the processor applies the machine learning algorithm to classify the text into a category and stores the image and/or text in a database according to the category.

AUTOMATED LICENSE PLATE RECOGNITION SYSTEM AND RELATED METHOD

Systems, methods, devices and computer readable media for determining a geographical location of a license plate are described herein. A first image of a license plate is acquired by a first image acquisition device of a camera unit and a second image of the license plate is acquired by a second image acquisition device of the camera unit. A three-dimensional position of the license plate relative to the camera unit is determined based on stereoscopic image processing of the first image and the second image. A geographical location of the camera unit is obtained. A geographical location of the license plate is determined from the three-dimensional position of the license plate relative to the camera unit and the geographical location of the camera unit. Other systems, methods, devices and computer readable media for detecting a license plate and identifying a license plate are described herein.

DISPLAY CONTROL INTEGRATED CIRCUIT APPLICABLE TO PERFORMING REAL-TIME VIDEO CONTENT TEXT DETECTION AND SPEECH AUTOMATIC GENERATION IN DISPLAY DEVICE

A display control integrated circuit (IC) applicable to performing real-time video content text detection and speech automatic generation in a display device may include a pre-processing circuit, a character recognition circuit and a post-processing circuit. The pre-processing circuit may input a video signal to obtain a real-time video content carried by the video signal, and perform preliminary text detection on the real-time video content to generate a series of segmented character images to indicate a subtitle. The character recognition circuit may perform character recognition on the series of segmented character images to generate a series of characters, respectively. The post-processing circuit may perform vocabulary correction on the series of characters to selectively replace any erroneous character with a correct character to generate one or more vocabularies, for performing speech automatic generation.

Identifying invalid identification documents

The method, system, and non-transitory computer-readable medium embodiments described herein provide for identifying invalid identification documents. In various embodiments, an application executing on a user device prompts the user device to transmit an image of the identification document. The application receives an image including the identification document in response to the identification document being within a field of view of a camera of the user device. The identification document includes a plurality of visual elements, and one or more visual elements of the plurality of visual elements are one or more invalidating marks. The application detects a predetermined pattern on the identification document in the image, the predetermined pattern formed from the one or more invalidating marks. The application determines that the identification document is invalid based on the detected predetermined pattern.

IMAGE PROCESSING APPARATUS, IMAGE PROCESSING METHOD, AND NON-TRANSITORY COMPUTER-READABLE STORAGE MEDIUM STORING PROGRAM

An image processing apparatus comprises: an acquiring unit configured to acquire image data generated by reading an original; a specifying unit configured to specify an object region including a predetermined object in the image data; a removing unit configured to remove, from the object region, noise with a size smaller than a size specified by a first threshold; and a setting unit configured to set the first threshold, for each object region specified by the specifying unit, in accordance with the size of the object region.