G06V30/133

CLIENT SIDE FILTERING OF CARD OCR IMAGES

The technology of the present disclosure includes computer-implemented methods, computer program products, and systems to filter images before transmitting to a system for optical character recognition (OCR). A user computing device obtains a first image of the card from the digital scan of a physical card and analyzes features of the first image, the analysis being sufficient to determine if the first image is likely to be usable by an OCR algorithm. If the user computing device determines that the first image is likely to be usable, then the first image is transmitted to an OCR system associated with the OCR algorithm. Upon a determination that the first image is unlikely to be usable, a second image of the card from the digital scan of the physical card is analyzed. The optical character recognition system performs an optical character recognition algorithm on the filtered card.

PROVIDING IMPROVED OPTICAL CHARACTER RECOGNITION USING AN AUTOMATIC METRIC-BASED EVALUATION PLATFORM

Aspects of the disclosure relate to providing improved optical character recognition (OCR). An OCR evaluation platform may generate a script for evaluating OCR performance. The platform may generate modified resources by executing OCR applications to modify original resources. Based on executing the script, the platform may generate comparative analysis information based on comparing the modified resources to the original resource. The platform may generate metric scores based on the comparative analysis information. The metric scores may be used to generate visual representations of the performance of different OCR applications. The platform may generate weighted scores representing the performance of different OCR applications. The platform may identify a preferred OCR application for performing a particular operation. The platform may store correlations between preferred OCR applications and corresponding operations. The platform my cause execution of preferred OCR applications when performing corresponding operations, based on the stored correlations.

IDENTIFYING INVALID IDENTIFICATION DOCUMENTS

The method, system, and non-transitory computer-readable medium embodiments described herein provide for identifying invalid identification documents. In various embodiments, an application executing on a user device prompts the user device to transmit an image of the identification document. The application receives an image including the identification document in response to the identification document being within a field of view of a camera of the user device. The identification document includes a plurality of visual elements, and one or more visual elements of the plurality of visual elements are one or more invalidating marks. The application detects a predetermined pattern on the identification document in the image, the predetermined pattern formed from the one or more invalidating marks. The application determines that the identification document is invalid based on the detected predetermined pattern.

Systems and methods for measuring document legibility
12225168 · 2025-02-11 · ·

Disclosed embodiments may include a system for measuring document legibility. The system may automatically receive document image data from a user device. The system may then process the image data using optical character recognition to create language data containing a plurality of words. The system may then obtain an overall number by counting the plurality of words in the language data. The system may then identify and count the common words within the plurality of words by comparing the plurality of words to words in a database. A score may be obtained by dividing the common word number by the overall number. The score may then be compared to a legibility threshold. If the score is below the threshold, the system may determine the document is illegible. If the score is above the threshold, the system may determine the document is legible.

Client side filtering of card OCR images

The technology of the present disclosure includes computer-implemented methods, computer program products, and systems to filter images before transmitting to a system for optical character recognition (OCR). A user computing device obtains a first image of the card from the digital scan of a physical card and analyzes features of the first image, the analysis being sufficient to determine if the first image is likely to be usable by an OCR algorithm. If the user computing device determines that the first image is likely to be usable, then the first image is transmitted to an OCR system associated with the OCR algorithm. Upon a determination that the first image is unlikely to be usable, a second image of the card from the digital scan of the physical card is analyzed. The optical character recognition system performs an optical character recognition algorithm on the filtered card.

ADJUSTING DIFFERENT AREAS OF A PAYMENT INSTRUMENT IMAGE INDEPENDENTLY
20170098201 · 2017-04-06 ·

The present disclosure involves systems, software, and computer-implemented methods for allowing independent adjustment for different areas of a payment instrument image. An example method includes updating an image property of an area of a clearing payment instrument image associated with a tangible payment instrument including a payee, a payor, an amount, and an authorization, the tangible payment instrument to be submitted for electronic transaction clearing, wherein the clearing payment instrument image is associated with a first value of the image property, the image property of the area is updated to a second value different than the first value of the image property, and the area of the clearing payment instrument image includes less than the entire clearing payment image; and storing the updated clearing payment instrument image in response to updating the image property.

Processing of images during assessment of suitability of books for conversion to audio format
09613268 · 2017-04-04 · ·

A system to process graphical elements within a book during assessment of the book for suitability for conversion to an audio format includes an image classification subsystem, an image processing subsystem, and a weighting subsystem. The image classification subsystem is configured to classify a graphical element based on at least one of a context of the graphical element and properties of the graphical element. The image processing subsystem is configured process the graphical element to create a processed graphical element, the processing responsive to the classification of the graphical element. The weighting subsystem is configured to produce a weighting corresponding to the processed graphical element, the weighting indicating an impact of the graphical element of suitability of the book for conversion to the audio format.

Text extraction using optical character recognition

Provided herein are systems and methods for extracting text from a document. Different optical character recognition (OCR) tools are used to extract different versions of the text in the document. Metrics evaluating the quality of the extracted text are compared to identify and select higher quality extracted text. A selected portion of text is compared to a threshold to ensure minimal quality. The selected portion of text is then saved. Error correction can be applied to the selected portion of text based on errors specific to the OCR tools or the document contents.

Multiple input machine learning framework for anomaly detection

A method that includes extracting image features of a document image, executing an optical character recognition (OCR) engine on the document image to obtain OCR output, and extracting OCR features from the OCR output. The method further includes executing an anomaly detection model using features including the OCR features and the image features to generate anomaly score, and presenting anomaly score.

METHOD AND SYSTEM FOR ENSURING DUAL BAR CODE AUTHENTICATION OF DOCUMENTS

The embodiments of present disclosure herein address unresolved problems of a file path encryption, a manual quality check and a dual bar-code verification while scanning and transferring of the answer sheets to a server via a communication network. Embodiments herein provide a system and method for ensuring a dual bar code-based authentication of answer sheets. The system and method provide a multilevel security that is achieved by a programmatic scanning and validation. A metadata tagging, and manual quality check can be carried out by an operator. The system and method provide a file transfer over a Hypertext Transfer Protocol (HTTP) network and with the help of a blow-fish algorithm, media files path is stored in an encrypted manner. The system and method restrict any third-party entity to get access to the confidential data and ensure a single user authorization from scanning, monitoring to package creation and an operator management.