Patent classifications
G06V30/1444
Graphical user interface created via inputs from an electronic document
A computing device receives a request to render a listing of item entries on a graphical user interface. The computing device receives an electronic image of the document, analyzes the electronic image, and determines a document type by performing an image recognition on a first portion the electronic image, comparing information extrapolated via the image recognition algorithm to a database of document types, and identifying a match between the extrapolated information a document type. The computing device applies an OCR algorithm that corresponds to the determined document type to a second portion of the electronic image, and identifies items extracted from the second portion. The computing device renders the listing of item entries on the graphical user interface of the user computing device, the listing of items comprising a listing of each item extracted from the second portion of the electronic image.
Systems and methods for managing documents containing one or more hyper texts and related information
According to aspects illustrated herein, a method for preserving one or more hyperlinks while printing a document is disclosed. The method includes receiving the document containing one or more hyper texts, wherein each hyper text is associated with a corresponding hyperlink. The document is parsed to extract the one or more hyper texts. Then information related to the one or more hyper texts is identified and extracted, the information includes a hyper text, a hyperlink corresponding to the hyper text, a page number of the hyper text and an ordinal number of occurrence of the hyper text on the page number. An index page including the information related to the one or more hyper texts is created. Finally, the index page along with the document is printed, the index page includes the one or more hyper texts and information related to the one or more hyper texts.
System for verifying the identity of a user
A system receives an image including a live facial image of the user and an identity document including a photograph of the user. Moreover, the system calculates a facial match score by comparing facial features in the live facial image to facial features in the photograph. The system recognizes data objects and characters in the identity document using optical character recognition (OCR) and computer vision, and then identifies, based on the recognized data objects and characters, a type of the identity document. Further, the system calculates a document validity score by comparing the recognized characters and data objects to character strings and data objects known to be present in the identified type of the identity document. Additionally, the system determines and outputs the user's identity verification status based on comparing the facial match score to a facial match threshold and comparing the document validity score to a document validity threshold.
Apparatus, method, and storage medium for setting information related to scanned image
An apparatus of the invention determines whether or not new scanned image data is similar to past scanned image data based on character string areas and a table area extracted from the new scanned image data, specifies a character string area used to obtain information set to the past scanned image data determined to be similar, detects a target area as a processing target out of the character string areas extracted from the new scanned image data based on the specified character string area, the table included in the past scanned image data determined to be similar, and the table included in the new scanned image data, performs character recognition processing on the detected target area, and sets information to the new scanned image data by using a character obtained as a result of the character recognition processing.
INFORMATION PROCESSING APPARATUS AND NON-TRANSITORY COMPUTER READABLE MEDIUM
An information processing apparatus includes a processor. The processor is configured to receive first image data, and generate, by processing corresponding to information represented in the first image data and corresponding to specific information other than information of a deletion target out of the information represented in the first image data, second image data not representing the information of the deletion target out of the information represented in the first image data but representing the information other than the information of the deletion target.
System and method for processing and identifying content in form documents
The present disclosure generally provides a system and method for processing and identifying data in form. The system and method may distinguish between content data and background data in a form. In some aspects, the content data or background data may be removed, wherein the remaining data may be processed separately. Removal of the background data or the content data may allow for more effective or efficient character recognition of the data. In some embodiments, data may be processed on an element basis, wherein each element of the form may be labeled as background data, content data, noise, or combinations thereof. This system and method may significantly increase the ability to capture and extract relevant information from a form.
SYSTEM AND A METHOD FOR DEVELOPING A TOOL FOR AUTOMATED DATA CAPTURE
The present invention discloses a system and a method for developing a tool for automated data capture. In particular the present invention provides for extracting document records associated with each historical enterprise-document based on a classification of historical enterprise-documents. Further, a meta-data for each historical enterprise-document and corresponding document records is generated. A plurality of data point representation lists are generated based on each document record. A representation template for each historical enterprise-document is generated based on the corresponding meta-data and data representation list. Further, data point identification models are generated for each category of historical documents using plurality of historical enterprise documents of respective category and the corresponding representation templates. Finally, data capture rules for capturing data value associated with data points in each incoming enterprise-document are generated within data point identification model. The generated models are implemented by the tool of the present invention for automated data capture.
DEVICE, PROCESS AND SYSTEM FOR RISK MITIGATION
A system for a computer useable medium, the system having a set of executable code is provided including a first set of computer program code adapted to receive at least a portion of a document comprising at least one classifiable distinct marker, a second set of computer program code adapted to analyze the distinct marker and assign a classifier thereto, and a third set of computer program code adapted to assess the potential risk of the distinct marker and calculate a first risk value associated with the distinct marker as it relates to the classifier and display the first risk value to a user of the system.
INFORMATION PROCESSING DEVICE AND NON-TRANSITORY COMPUTER READABLE MEDIUM
An information processing device includes a processor configured to output an extracted character string entry rule for each item of a form in a case where a regularity related to an entry of a character string of a confirmation result is extracted, the confirmation result being a result of confirming a result of character recognition performed on the form.
IMAGE PROCESSING APPARATUS, IMAGE PROCESSING SYSTEM, AND STORAGE MEDIUM
An image processing apparatus includes a reading unit configured to generate image data by reading an original, a reception unit configured to receive selection of a stored file from a user, an acquisition unit configured to acquire character information from the image data generated by the reading unit, and an execution unit configured to perform processing for inserting the character information acquired by the acquisition unit into the selected file.