Patent classifications
G06V30/18086
METHOD AND SYSTEM OF CAPTURING AN IMAGE OF A CARD
A method and a system of capturing an image of a card having a magnetic stripe is provided. The method includes obtaining a first image by an imaging device of the card, obtaining a plurality of images of the card via color delta analysis, and obtaining a third image of the card by comparing the first and the plurality of images.
User terminal device and method for controlling the same
A user terminal device and a method for controlling the same are provided. The user terminal device includes a sensor configured to sense a user touch operation for a binarized text image, a controller configured to generate an indicator pointing out a point where the user touch operation is sensed, when the user touch operation is sensed by the sensor, and a display unit configured to display the binarized text image and the generated indicator.
Model based document image enhancement
Systems and methods are disclosed for model based document image enhancement. Instead of requiring paired dirty and clean images for training a model to clean document images (which may cause privacy concerns), two models are trained on the unpaired images such that only the dirty images are accessed or only the clean images are accessed at one time. One model is a first implicit model to translate the dirty images from a source space to a latent space, and the other model is a second implicit model to translate the images from the latent space to clean images in a target space. The second implicit model is trained based on translating electronic document images in the target space to the latent space. In some implementations, the implicit models are diffusion models, such as denoising diffusion implicit models based on solving ordinary differential equations.
Determination of a convolutional neural network (CNN) for automatic target recognition in a resource constrained environment
Methods and structures are presented for implementing an automatic target recognition system as a convolutional neural network (CNN) in a satellite or other environment with constrained resources, such as limited memory capacity and limited processing capability. For example, this allows for the automatic target recognition to be implemented on a field programmable gate array (FPGA). Image data is split into subsets of contiguous pixels, with the subsets processed in parallel in a CNN of a corresponding processing node using quantized weight values that are determined in a training process that accounts for the constraints of the automatic target recognition system. The results of the automatic target recognition process is based on the combined output of the processing nodes.
Methods and systems for accurately recognizing vehicle license plates
Systems can be configured for detecting license plates and recognizing characters in license plates. In an example, a system can receive an image and identify one or more regions in the image that include a license plate. Character recognition can be performed in the one or more regions to determine contents of a candidate license plate. Location-specific information about a license plate format can be used together with the determined contents of the candidate license plate to determine if the recognized characters are valid.
IMAGE READING SYSTEMS, METHODS AND STORAGE MEDIUM FOR PERFORMING GEOMETRIC EXTRACTION
Geometric extraction is performed on an unstructured document by recognizing textual blocks on at least a portion of a page of the unstructured document, generating bounding boxes that surround and correspond to the textual blocks, determining search paths having coordinates of two endpoints and connecting at least two bounding boxes, and generating a graph representation of the at least a portion of the page, the graph representation including the plurality of textual blocks, the coordinates of the vertices of each bounding box and the coordinates of the two endpoints of each search path.
IMAGE PROCESSING APPARATUS AND IMAGE PROCESSING METHOD
An image processing apparatus determines whether each of pixels in a scanned image is a color pixel or a monochrome pixel, determines whether each of blocks including the multiple pixels in the scanned image is a color block or a monochrome block, based on determination results of the respective pixels, and determines that the scanned image is a color image in a case where an arrangement pattern of blocks determined to be color blocks matches a predetermined pattern.
CHARACTER RECOGNITION DEVICE, CHARACTER RECOGNITION METHOD, AND CHARACTER RECOGNITION PROGRAM
A character recognition device includes an acquisition means configured to acquire an image containing a character region, a first recognition means configured to recognize a character from the character region by a first recognition method, a setting means configured to set reference lines along an alignment direction of the characters and passing through a specified position in each character, a second recognition means configured to recognize a character by a second recognition method, the second recognition method being a method that recognizes a character from an image within a recognition window by scanning in a recognition target region in an image while changing a size of the recognition window, and configured to set a position or a height in a vertical direction of the recognition window based on the reference lines, and an output means configured to output a word composed of characters recognized by the second recognition means.
Method for generating a plurality of sets of training image data for training machine learning model
A method for generating a plurality of sets of training image data for training a machine learning model includes: (a) acquiring object image data representing an object image; (b) dividing the object image into T number of partial object images by dividing a region of the object image into T number of partial regions corresponding to respective ones of T number of partial color ranges; (c) generating a plurality of sets of color-modified object image data representing respective ones of a plurality of color-modified object images by performing an adjustment process on the object image data, the adjustment process including a color modification process to modify colors of at least one of the T number of partial object images; and (d) generating the plurality of sets of training image data using one or more sets of background image data and the plurality of sets of color-modified object image data.
Method and system of capturing an image of a card
A method and a system of capturing an image of a card having a magnetic stripe is provided. The method includes obtaining a first image by an imaging device of the card, obtaining a plurality of images of the card via color delta analysis, and obtaining a third image of the card by comparing the first and the plurality of images.