Patent classifications
G06V30/1452
BUILDING CLASSIFICATION AND EXTRACTION MODELS BASED ON ELECTRONIC FORMS
According to one embodiment, a computer-implemented method is configured for building a classification and/or data extraction knowledge base using an electronic form. The method includes: receiving an electronic form having associated therewith a plurality of metadata labels, each metadata label corresponding to at least one element of interest represented within the electronic form; parsing the plurality of metadata labels to determine characteristic features of the element(s) of interest; building a representation of the electronic form based on the plurality of metadata labels; generating a plurality of permutations of the representation of the electronic form by applying a predetermined set of variations to the representation; and training either a classification model, an extraction model, or both using: the representation of the electronic form, and the plurality of permutations of the representation of the electronic form. Corresponding systems and computer program products are also disclosed.
Methods and apparatus for nonintrusive monitoring of web browser usage
Example methods disclosed herein for monitoring web browsing include processing a video image obtained from a video signal of a device implementing a web browser to identify a first image region less than the entire video image, the first image region having a first shape corresponding to an address bar of the web browser. Disclosed example methods also include tagging first textual information identified in the first image region as corresponding to an address of a web page displayed by the web browser in response to determining that the first textual information includes a first string of text matching a reference string of text. Disclosed example methods further include reporting the tagged first textual information to determine usage of the web browser.
CHARACTER RECOGNITION APPARATUS FOR RECOGNIZING CHARACTER STRING OVER MULTIPLE LINES NOT HAVING KNOWN FORMAT
The computing circuit detects a plurality of character regions in the input image, each of the plurality of character regions including a character or a character string made of a plurality of characters. The computing circuit determines a direction of the character or the character string in each of the character regions. The computing circuit recognizes the character or the character string in each of the character regions. The computing circuit generates a connected region by connecting at least two character regions including the character(s) or the character string(s) having a same direction, the at least two character regions being close to each other at a distance less than a threshold. The computing circuit connects the character(s) or the character string(s) included in the connected region to each other.
METHOD OF PHONE NUMBER RECOGNITION AND SYSTEM FOR USING THE SAME
A mobile device includes a display, a non-transitory computer readable medium configured to store instructions thereon, and a processor connected to the non-transitory computer readable medium and the display. The processor is configured to execute the instructions for receiving image data. The processor is further configured to execute the instructions for determining a phone number based on the imaging data. The processor is further configured to execute the instructions for, in response to the phone number being determined, automatically causing the display to: display the phone number, wherein the display is configured to receive instructions for contacting the displayed phone number.
METHOD AND SYSTEM FOR READING AN OPTICAL PRESCRIPTION ON AN OPTICAL PRESCRIPTION IMAGE
A method for reading an optical prescription on an optical prescription image. The method includes detecting a region comprising the optical prescription on the optical prescription image; extracting the optical prescription and converting the optical prescription into machine-encoded optical prescription data; classifying a portion of the optical prescription data into one or more predetermined categories, to generate an optical prescription value associated with a respective one of the one or more predetermined categories; and determining whether the optical prescription value associated with the respective one of the one or more predetermined categories contains an error, and, if the optical prescription value contains the error, correcting the error within the optical prescription value, to generate a corrected optical prescription value associated with the respective one of the one or more predetermined categories. A system for reading an optical prescription on an optical prescription is also disclosed.
Image processing apparatus that obtains item value and performs character recognition process on a document image, image processing method, and non-transitory computer-readable storage medium
An image processing apparatus acquires a character recognition result by performing character recognition processing on a document image, detects a character string candidate described in a predetermined format from the character recognition result, determines a likelihood of the character string candidate based on another character string existing in the vicinity of the detected character string candidate, and outputs, in a case where a plurality of character string candidates is detected, an item value based on a character string candidate having a high likelihood.
KEY-POINT BASED TEXT REGION IDENTIFICATION
Systems and methods for text localization are provided. Various embodiments of the present technology provide systems and methods for improved text localization algorithms that will help in enhancing the efficiency of text identification algorithms used for recognizing text in scanned documents prior to performing OCR, or other related applications. In some embodiments, regions of interest are identified on an image document indicating locations on the image document where text may be present. Individual words in the image document are identified based on space identification and region of interest clustering algorithms applied to the regions of interest in the image document.
MATCHING SYSTEM FOR IMAGES AND TEXT DESCRIPTIONS IN SPECIFICATIONS
Provided is a matching system for images and text descriptions in a specification. The matching system includes an image-and-text recognition device, receiving a specification and recognizing image blocks and text blocks thereon, the image block having corresponding covering range; and a preference value calculation device, assigning preference value to each of the text blocks according to positional relationship between the above-mentioned text block and the above-mentioned image block, and the contents of the above-mentioned text block, for matching the image blocks and the text blocks.
Systems and methods for distributed ledger-based check verification
Systems and methods for distributed ledger-based check verification are disclosed. In one embodiment, a method may include a bank backend computer program: (1) receiving, from a computer application executed by an electronic device, an image of a presented check as part of an electronic check deposit process; (2) performing optical character recognition on the image of the presented check; (3) generating a text file based on the optical character recognition; (4) querying a distributed ledger in a distributed ledger network to determine whether the presented check has been presented or cleared before; (5) determining that the presented check has not been presented or cleared before; (6) processing the presented check for deposit; and (7) writing the text file for the presented check to the distributed ledger.
Information extraction from documents containing handwritten text
A method, computer system, and a computer program product for information extraction is provided. The present invention may include receiving, by a handwriting detection model of an integrated system, a mixed-text document including a combination of typed text and handwritten text, where the received mixed-text document includes at least one key-value pair. The present invention may also include receiving, by the handwriting detection model of the integrated system, a first location information of at least one key from the at least one key-value pair in the received mixed-text document. The present invention may further include detecting, by the handwriting detection model of the integrated system, at least one handwritten text in the received mixed-text document based on the received first location information of the at least one key.