Patent classifications
G06V30/166
TABLE DATA RECOVERING IN CASE OF IMAGE DISTORTION
The subject matter of this specification can be implemented in, among other things, a method that includes identifying edges of a section of a document in a source image that includes at least one row of text. The method includes identifying characters in the document. The method includes identifying word portions. The method includes generating polynomials that approximate points of the characters within the word portions. The method includes generating a second polynomial that approximates the points of the characters of word portions. The method includes identifying a stretching coefficient of the row of text based on a length of the section between the edges relative to a length of the second polynomial. The method includes mapping portions of the source image along the row of text to new positions in a corrected image based on the second polynomial and the stretching coefficient.
Systems and methods for mobile image capture and processing of checks
Techniques for processing an image of a check captured using a mobile device are provided. The check image is processed to determine whether the check can be deposited at a bank via a mobile deposit process. The system can identify regions of the check—such as the endorsement area—to determine if the check has been properly endorsed. The system can be implemented on a mobile device and/or a server, where the mobile device routes the check image to the server for processing. If the check cannot be deposited, a rejection is forwarded in real time to the mobile device for possible correction.
SYSTEMS AND METHODS FOR MOBILE AUTOMATED CLEARING HOUSE ENROLLMENT
Systems and methods for mobile enrollment in automated clearing house (ACH) transactions using mobile-captured images of financial documents are provided. Applications running on a mobile device provide for the capture and processing of images of documents needed for enrollment in an ACH transaction, such as a blank check, remittance statement and driver's license. Data from the mobile-captured images that is needed for enrolling in ACH transactions is extracted from the processed images, such as a user's name, address, bank account number and bank routing number. The user can edit the extracted data, select the type of document that is being captured, authorize the creation of an ACH transaction and select an originator of the ACH transaction. The extracted data and originator information is transmitted to a remote server along with the user's authorization so the ACH transaction can be setup between the originator's and receiver's bank accounts.
APPARATUS FOR PREDICTING METADATA OF MEDICAL IMAGE AND METHOD THEREOF
This disclosure relates to a computerized method to perform a machine learning on a relationship between medical images and metadata using a neural network and acquiring metadata by applying a machine learning model to medical images, and a method thereof. The apparatus and method may include training a prediction model for predicting metadata of medical images based on multiple medical images for learning and metadata matched with each of multiple medical images and predicting metadata of input medical image.
APPARATUS FOR PREDICTING METADATA OF MEDICAL IMAGE AND METHOD THEREOF
This disclosure relates to a computerized method to perform a machine learning on a relationship between medical images and metadata using a neural network and acquiring metadata by applying a machine learning model to medical images, and a method thereof. The apparatus and method may include training a prediction model for predicting metadata of medical images based on multiple medical images for learning and metadata matched with each of multiple medical images and predicting metadata of input medical image.
IMAGE PROCESSING APPARATUS, IMAGE PROCESSING METHOD, AND NON-TRANSITORY STORAGE MEDIUM
An image processing apparatus that generates an image for character recognition from a read image includes at least one memory that stores instructions, and at least one processor that executes the instructions to perform extracting of an area of handwritten character information and an area of printed character information from the read image, clipping of a partial image of the area of handwritten character information and a partial image of the area of printed character information out of the read image, and generating of the image for character recognition by combining the partial image of the area of handwritten character information and the partial image of the area of printed character information being associated with each other.
Systems and methods for mobile automated clearing house enrollment
Systems and methods for mobile enrollment in automated clearing house (ACH) transactions using mobile-captured images of financial documents are provided. Applications running on a mobile device provide for the capture and processing of images of documents needed for enrollment in an ACH transaction, such as a blank check, remittance statement and driver's license. Data from the mobile-captured images that is needed for enrolling in ACH transactions is extracted from the processed images, such as a user's name, address, bank account number and bank routing number. The user can edit the extracted data, select the type of document that is being captured, authorize the creation of an ACH transaction and select an originator of the ACH transaction. The extracted data and originator information is transmitted to a remote server along with the user's authorization so the ACH transaction can be setup between the originator's and receiver's bank accounts.
MULTI-DIRECTIONAL SCENE TEXT RECOGNITION METHOD AND SYSTEM BASED ON MULTI-ELEMENT ATTENTION MECHANISM
A method and a system of multi-directional scene text recognition based on multi-element attention mechanism are provided. The method includes: performing normalization processing for a text row/column image I output from an external text detection module by a feature extractor, extracting a feature for the normalized image by using a deep convolutional neural network to acquire an initial feature map F.sub.0, and adding a 2-dimensional directional positional encoding P to an initial feature map F.sub.0 in order to output a multi-channel feature map F; converting the multi-channel feature map F output from a feature extractor by an encoder into a hidden representation H; and converting the hidden representation H output from the encoder into a recognized text by a decoder and using the recognized text as the output result. The method and the system of multi-directional scene text recognition based on multi-element attention mechanism provided by the present invention are applied to multi-oriented scene text images including horizontal text, vertical text, and curved text etc., and have achieved high applicability.
Neural Network-based Optical Character Recognition
Methods, systems, and apparatus, including computer programs encoded on computer storage media, for neural network-based optical character recognition. An embodiment of the system may generate a set of bounding boxes based on reshaped image portions that correspond to image data of a source image. The system may merge any intersecting bounding boxes into a merged bounding box to generate a set of merged bounding boxes indicative of image data portions that likely portray one or more words. Each merged bounding box may be fed by the system into a neural network to identify one or more words of the source image represented in the respective merged bounding box. The one or more identified words may be displayed by the system according to a standardized font and a confidence score.
SYSTEMS AND METHODS FOR MOBILE AUTOMATED CLEARING HOUSE ENROLLMENT
Systems and methods for mobile enrollment in automated clearing house (ACH) transactions using mobile-captured images of financial documents are provided. Applications running on a mobile device provide for the capture and processing of images of documents needed for enrollment in an ACH transaction, such as a blank check, remittance statement and driver's license. Data from the mobile-captured images that is needed for enrolling in ACH transactions is extracted from the processed images, such as a user's name, address, bank account number and bank routing number. The user can edit the extracted data, select the type of document that is being captured, authorize the creation of an ACH transaction and select an originator of the ACH transaction. The extracted data and originator information is transmitted to a remote server along with the user's authorization so the ACH transaction can be setup between the originator's and receiver's bank accounts.