Patent classifications
G06V30/18105
Mobile terminal, image processing method, and computer-readable recording medium
A mobile terminal includes a memory, and a processor coupled to the memory, wherein the processor is configured to execute first acquiring a frame obtained through photographing, second acquiring document image data of a document from the frame, first determining whether a form partial feature in a registered form and a document partial feature at a position corresponding to a position of the partial feature match, the document partial feature being in the document, and third acquiring a frame obtained through re-photographing when it is determined that the form partial feature and the document partial feature do not match.
OBJECT DETECTION AND IMAGE CROPPING USING A MULTI-DETECTOR APPROACH
Computer-implemented methods for detecting objects within digital image data based on color transitions include: receiving or capturing a digital image depicting an object; sampling color information from a first plurality of pixels of the digital image, wherein each of the first plurality of pixels is located in a background region of the digital image; optionally sampling color information from a second plurality of pixels of the digital image, wherein each of the second plurality of pixels is located in a foreground region of the digital image; assigning each pixel a label of either foreground or background using an adaptive label learning process; binarizing the digital image based on the labels assigned to each pixel; detecting contour(s) within the binarized digital image; and defining edge(s) of the object based on the detected contour(s). Corresponding systems and computer program products configured to perform the inventive methods are also described.
OBJECT DETECTION AND IMAGE CROPPING USING A MULTI-DETECTOR APPROACH
According to an exemplary embodiment, a method for pre-cropping digital image data includes: dividing the digital image into segments; computing a color value distance between corresponding pixels of neighboring segments of the digital image; comparing the color value distance(s) against a minimum color distance threshold; clustering neighboring segments having a color value distance less than or equal to the minimum color distance threshold; computing a connected structure based on the clustered segments; computing a polygon bounding the connected structure; comparing a fraction of segments included in the connected structure and the polygon, relative to a total number of segments in the digital image, to a minimum included segment threshold; and in response to determining the fraction of segments in the connected structure and the polygon, relative to the total number of segments meets or exceeds a minimum included segment threshold, cropping the digital image based on edges of the polygon.
DOCUMENT OPTICAL CHARACTER RECOGNITION
Vehicles and other items often have corresponding documentation, such as registration cards, that includes a significant amount of informative textual information that can be used in identifying the item. Traditional OCR may be unsuccessful when dealing with non-cooperative images. Accordingly, features such as dewarping, text alignment, and line identification and removal may aid in OCR of non-cooperative images. Dewarping involves determining curvature of a document depicted in an image and processing the image to dewarp the image of the document to make it more accurately conform to the ideal of a cooperative image. Text alignment involves determining an actual alignment of depicted text, even when the depicted text is not aligned with depicted visual cues. Line identification and removal involves identifying portions of the image that depict lines and removing those lines prior to OCR processing of the image.
SESSION TRIAGE AND REMEDIATION SYSTEMS AND METHODS
A computer system is provided. The computer system includes a memory and at least one processor coupled to the memory. The at least one processor is configured to scan session data representative of operation of a user interface comprising a plurality of user interface elements; detect, at a point in the session data, at least one changed element within the plurality of user interface elements; classify, in response to detecting the at least one changed element, the at least one changed element as either indicating or not indicating an error; store an association between the error and the point in the session data; and provide access to the point in the session data via the association.
Object detection and image cropping using a multi-detector approach
Computer-implemented methods for detecting objects within digital image data based on color transitions include: receiving or capturing a digital image depicting an object; sampling color information from a first plurality of pixels of the digital image; optionally sampling color information from a second plurality of pixels of the digital image; generating or receiving a representative background color profile based on the color information sampled from the first plurality of pixels; generating or receiving a representative foreground color profile based on the color information sampled from the second plurality of pixels and/or the first plurality of pixels; assigning each pixel a label; binarizing the digital image based on the labels; detecting contour(s) within the binarized digital image; and defining edge(s) of the object based on the detected contour(s). Corresponding systems and computer program products configured to perform the inventive methods are also described.
Monitoring mobile device usage
A first image of a mobile device screen is recorded into memory of the mobile device. The first image includes at least one icon that represents an application installed on the mobile device. A second image of the mobile device screen is recorded into the memory of the mobile device. A graphical change in an area of the mobile device screen corresponding to a position of the icon is detected by comparing at least a portion of the second image to the first image. The graphical change results from a user selection of the icon to activate the application represented by the icon. In response to detecting the graphical change, determine an identifier of the application represented by the icon. Send a record of the user selection of the icon to a collection server. The record includes at least the identifier of the application.
IMAGE BOX FILTERING FOR OPTICAL CHARACTER RECOGNITION
A method for box filtering includes obtaining, by a computing device, a form image, and identifying, by the computing device, a region of the form image that includes boxes. Vertical lines in the region of the form image are detected. The boxes in the region are detected according to the plurality of vertical lines, and image content is extracted from the boxes.
Method and system for preparing text images for optical-character recognition
The current document is directed to methods and systems that acquire an image containing text with curved text lines to generate a corresponding corrected image in which the text lines are straightened and have a rectilinear organization. The method may include identifying a page sub-image within the text-containing image, generating a text-line-curvature model for the page sub-image that associates inclination angles with pixels in the page sub-image, generating local displacements, using the text-line-curvature model, for pixels in the page sub-image, and transferring pixels from the page sub-image to a corrected page-sub-image using the local displacements to construct a corrected page sub-image in which the text lines are straightened and in which the text characters and symbols have a rectilinear arrangement.
Method of digitizing and extracting meaning from graphic objects
Using a convolutional neural network, a method for digitizing and extracting meaning from graphic objects such as bar and pie charts, decomposes a chart into its sub-parts (pie and slices or bars, axes and legends) with significant tolerance to the wide range of variations in shape and relative position of pies, bars, axes and legends. A linear regression calibration allows properly reading values even when there are many OCR failures.