Patent classifications
G06V30/1478
Image processing device, image processing method, and computer program product
According to an embodiment, an image processing device includes a memory, and one or more hardware processors configured to function as a receiving unit, a specifying unit, and a detecting unit. The receiving unit receives input information input to an image. The specifying unit specifies the position of the input information. The detecting unit detects a character string having a smaller distance to the position than another character string, from the image.
MULTIFUNCTION PERIPHERAL ASSISTED OPTICAL MARK RECOGNITION USING DYNAMIC MODEL AND TEMPLATE IDENTIFICATION
A system for multifunction peripheral assisted optical mark recognition uses a scanner to scan at least one printed page to generate a scanned image of the at least one printed page and to optically detect a presence of a visible label on the scanned image. Then, the multifunction peripheral may extract a model identification and a template identification from the visible label, select a template, identifying locations from which image data is to be extracted from the scanned image, select a model, specifically identifying at least two types of acceptable marks within the image data to be extracted from the scanned image, and perform optical mark recognition on the location from which image data is to be extracted identified by the template using the at least two types of acceptable marks identified by the model to extract useful data from the image data.
Extracting card data from multiple cards
Extracting financial card information with relaxed alignment comprises a method to receive an image of a card, determine one or more edge finder zones in locations of the image, and identify lines in the one or more edge finder zones. The method further identifies one or more quadrilaterals formed by intersections of extrapolations of the identified lines, determines an aspect ratio of the one or more quadrilateral, and compares the determined aspect ratios of the quadrilateral to an expected aspect ratio. The method then identifies a quadrilateral that matches the expected aspect ratio and performs an optical character recognition algorithm on the rectified model. A similar method is performed on multiple cards in an image. The results of the analysis of each of the cards are compared to improve accuracy of the data.
Image processing
An example method is of image processing provided in according with one implementation of the present disclosure. The method includes receiving an image, placing a window across the image, and computing a set of all occurring grayscale values within the window. The method further includes computing a threshold value based on the set of all occurring grayscale values within the window and determining an output pixel value of at least one pixel from the window based on the threshold value.
System and method for importing scanned construction project documents
A system and method for efficiently importing scanned construction project documents (e.g., digital images of physical documents) is disclosed. The method includes receiving a digital image of a document and performing a first text recognition operation on a first portion of the digital image. The method includes in response to determining, based on the first text recognition operation, that the first portion does not include machine-readable text, generating a modified image of the document by performing an image modification operation. The image modification operation may include an orientation operation. The method further includes storing the modified image of the document in a database. The image modification operation may also include a de-skewing operation and an alignment operation.
Method and apparatus for transformation of dot text in an image into stroked characters based on dot pitches
A method and apparatus for determining orientation and dot pitch of characters in an image. A statistical neighborhood of a set of dots of an image is determined. The statistical neighborhood includes a set of points and each point is associated with a position and a statistical measure indicative of a likelihood that one or more dots that satisfy a shape and a size criteria are located at that position. A Fast Fourier Transform (FFT) is computed across the set of points of the statistical neighborhood; and based on the FFT of the set of points, a first orientation and a first distance between adjacent dots of characters along the first orientation, and a second orientation and a second distance between adjacent dots of the characters along the second orientation are determined.
RECONSTRUCTING DOCUMENT FROM SERIES OF DOCUMENT IMAGES
Systems and methods for reconstructing a document from a series of document images. An example method comprises: receiving a plurality of image frames, wherein each image frame of the plurality of image frames contains at least a part of an image of an original document; identifying a plurality of visual features in the plurality of image frames; performing spatial alignment of the plurality of image frames based on matching the identified visual features; splitting each of the plurality of image frames into a plurality of image fragments; identifying one or more text-depicting image fragments among the plurality of image fragments; associating each identified text-depicting image fragment with an image frame in which that image fragment has an optimal value of a pre-defined quality metric among values of the quality metric for that image fragment in the plurality of image frames; and producing a reconstructed image frame by blending image fragments from the associated image frames.
SYSTEM AND METHOD FOR PREPROCESSING IMAGES TO IMPROVE OCR EFFICACY
A system to preprocess images to increase accuracy of optical character recognition (OCR) includes a processor, and a memory coupled to the processor. The processor is configured to scan an electronically stored representation of a whole or partial document, identify an image in the electronically stored representation, and recognize row-based text within the electronically stored representation. In addition, the processor is configured to align the row-based text vertically, generate a resultant electronically stored representation of the whole or partial document having the row-based text aligned, and save the resultant electronically stored representation for subsequent OCR processing. The electronically stored representation of the whole or partial document may contain at least one image having a JPG, TIF, GIF, PNG, or BMP, type of format.
Document reorientation processing
Video frames of a document are captured. A current orientation mode of a device having a camera is determined based on the video frames. An optimal orientation mode for capturing a document image is determined. Guided instructions assist in placing the device in the optimal orientation mode and when the document is centered in a lens of the camera, the document image is taken by the camera.
PERSISTENT FEATURE BASED IMAGE ROTATION AND CANDIDATE REGION OF INTEREST
Embodiments of a system and method for sorting and delivering articles in a processing facility based on image data are described. Image processing results such as rotation notation information may be included in or with an image to facilitate downstream processing such as when the routing information cannot be extracted from the image using an unattended system and the image is passed to an attended image processing system. The rotation notation information may be used to dynamically adjust the image before presenting the image via the attended image processing system.