Patent classifications
G06V30/416
SYSTEM AND METHOD FOR FORMAT-AGNOSTIC DOCUMENT INGESTION
A system for format-agnostic document ingestion including a document ingestion server and a database is disclosed. The server is configured to receive an image of a document comprising text in an unknown format, convert the image, using OCR, into a plurality of text elements a content, a size, and an absolute position. The server is also configured to retrieve data detectors from the database, each associated with a data type anticipated to be in the document, and comprising at least one identifier and direction, and at least one validation criteria. The server is also configured to identify a potential descriptor by comparing the content of each text element with the at least one identifier, and then determine if the text element pointed to by the data detector meets the validation criteria. Finally, the server is configured to associate the validated text element with the data detector, and store the content.
SYSTEM AND METHOD FOR FORMAT-AGNOSTIC DOCUMENT INGESTION
A system for format-agnostic document ingestion including a document ingestion server and a database is disclosed. The server is configured to receive an image of a document comprising text in an unknown format, convert the image, using OCR, into a plurality of text elements a content, a size, and an absolute position. The server is also configured to retrieve data detectors from the database, each associated with a data type anticipated to be in the document, and comprising at least one identifier and direction, and at least one validation criteria. The server is also configured to identify a potential descriptor by comparing the content of each text element with the at least one identifier, and then determine if the text element pointed to by the data detector meets the validation criteria. Finally, the server is configured to associate the validated text element with the data detector, and store the content.
MULTI-PAGE DOCUMENT RECOGNITION IN DOCUMENT CAPTURE
Techniques to capture document data are disclosed. It is determined that a sequence of pages in a stream of document page images comprise a single multi-page document. Data is extracted from two or more different pages included in the sequence. The data extracted from two or more different pages included in the sequence of pages is used to populate a data entry form associated with the multi-page document.
MULTI-PAGE DOCUMENT RECOGNITION IN DOCUMENT CAPTURE
Techniques to capture document data are disclosed. It is determined that a sequence of pages in a stream of document page images comprise a single multi-page document. Data is extracted from two or more different pages included in the sequence. The data extracted from two or more different pages included in the sequence of pages is used to populate a data entry form associated with the multi-page document.
Enhanced Item Validation and Image Evaluation System
Systems for item validation and image evaluation are provided. In some examples, a system may receive an instrument and associated data. The instrument may be received and at least one of a bill pay profile and a user profile may be retrieved. The bill pay profile and user profile may each include a plurality of previously processed instruments that have been determined to be valid and/or authentic. The instrument may be compared to the plurality of previously processed instruments to determine whether one or more elements of the instrument being evaluated match one or more corresponding elements of the plurality of previously processed instruments. Matching or non-matching elements may be identified. In some examples, one or more user interfaces may be generated displaying the instruments and including any highlighting or enhancements identifying matching or non-matching elements.
Enhanced Item Validation and Image Evaluation System
Systems for item validation and image evaluation are provided. In some examples, a system may receive an instrument and associated data. The instrument may be received and at least one of a bill pay profile and a user profile may be retrieved. The bill pay profile and user profile may each include a plurality of previously processed instruments that have been determined to be valid and/or authentic. The instrument may be compared to the plurality of previously processed instruments to determine whether one or more elements of the instrument being evaluated match one or more corresponding elements of the plurality of previously processed instruments. Matching or non-matching elements may be identified. In some examples, one or more user interfaces may be generated displaying the instruments and including any highlighting or enhancements identifying matching or non-matching elements.
Method and system for human-vision-like scans of unstructured text data to detect information-of-interest
A method, system and computer program for automatic, highly accurate machine scans of unstructured text data sources, like information kept or displayed in Web browsers, WORD, POWERPOINT, EXCEL, PDF, and other documents, with the ability to detect, isolate and extract specific text information from unknown and varying locations within the unstructured text data. The system uses multiple human-vision-like but electronic scans of the unstructured data using artificial intelligence techniques to locate, and extract required information despite varying conditions, like unknown number of pages, unknown sequence of pages, unknown data layouts and data arrangements, unknown number, lengths and indentations of sections/paragraphs, and in case of tabular data, unknown number of rows and column sequences in the unstructured text data source.
Method and system for human-vision-like scans of unstructured text data to detect information-of-interest
A method, system and computer program for automatic, highly accurate machine scans of unstructured text data sources, like information kept or displayed in Web browsers, WORD, POWERPOINT, EXCEL, PDF, and other documents, with the ability to detect, isolate and extract specific text information from unknown and varying locations within the unstructured text data. The system uses multiple human-vision-like but electronic scans of the unstructured data using artificial intelligence techniques to locate, and extract required information despite varying conditions, like unknown number of pages, unknown sequence of pages, unknown data layouts and data arrangements, unknown number, lengths and indentations of sections/paragraphs, and in case of tabular data, unknown number of rows and column sequences in the unstructured text data source.
IDENTIFYING REGULATORY DATA CORRESPONDING TO EXECUTABLE RULES
Various embodiments are provided for correlating regulatory data in a computing environment by a processor. A rule may be associated with one or more textual paragraphs extracted from a policy document that describes at least a portion of the rule.
IDENTIFYING REGULATORY DATA CORRESPONDING TO EXECUTABLE RULES
Various embodiments are provided for correlating regulatory data in a computing environment by a processor. A rule may be associated with one or more textual paragraphs extracted from a policy document that describes at least a portion of the rule.