Patent classifications
G06V30/42
METHOD AND PLATFORM OF GENERATING DOCUMENT, ELECTRONIC DEVICE AND STORAGE MEDIUM
A method and a platform of generating a document, an electronic device, and a storage medium are provided, which relate to a field of an artificial intelligence technology, in particular to fields of computer vision and deep learning technologies, and may be applied to a text recognition scenario and other scenarios. The method includes: performing a category recognition on a document picture to obtain a target category result; determining a target structured model matched with the target category result; and performing, by using the target structured model, a structure recognition on the document picture to obtain a structure recognition result, so as to generate an electronic document based on the structure recognition result, wherein the structure recognition result includes a field attribute recognition result and a field position recognition result.
CONTINUOUS MACHINE LEARNING METHOD AND SYSTEM FOR INFORMATION EXTRACTION
Methods and systems for artificial intelligence (AI)-assisted document annotation and training of machine learning-based models for document data extraction are described. The methods and systems described herein take advantage of a continuous machine learning approach to create document processing pipelines that provide accurate and efficient data extraction from documents that include structured text, semi-structured text, unstructured text, or any combination thereof.
INFORMATION PROCESSING APPARATUS, INFORMATION PROCESSING METHOD, AND COMPUTER PROGRAM
An information processing apparatus (10) includes a controller (11) that acquires an image containing a figure and a character string and generates association information indicating an association between the figure and the character string based on a positional relationship between the figure and the character string in the image.
METHODS, SYSTEMS, ARTICLES OF MANUFACTURE, AND APPARATUS FOR DECODING IMAGES
Methods, apparatus, systems, and articles of manufacture are disclosed for decoding images. An example apparatus to decode an image comprises interface circuitry to receive an image of a purchase document, and processor circuitry to execute the machine readable instructions to extract text from the image of the purchase document, the image of the purchase document to memorialize a transaction that includes at least one product; determine a type of the purchase document to which the image corresponds; apply one of a first pipeline or a second pipeline to the image of the purchase document based on the type of the purchase document; obtain purchase facts corresponding to a respective one of the at least one product memorialized in the image of the purchase document; and map the obtained purchase facts against a products database to identify the at least one product memorialized in the image of the purchase document.
METHODS, SYSTEMS, ARTICLES OF MANUFACTURE, AND APPARATUS FOR DECODING IMAGES
Methods, apparatus, systems, and articles of manufacture are disclosed for decoding images. An example apparatus to decode an image comprises interface circuitry to receive an image of a purchase document, and processor circuitry to execute the machine readable instructions to extract text from the image of the purchase document, the image of the purchase document to memorialize a transaction that includes at least one product; determine a type of the purchase document to which the image corresponds; apply one of a first pipeline or a second pipeline to the image of the purchase document based on the type of the purchase document; obtain purchase facts corresponding to a respective one of the at least one product memorialized in the image of the purchase document; and map the obtained purchase facts against a products database to identify the at least one product memorialized in the image of the purchase document.
SYSTEMS AND METHODS FOR AUTOMATED DIGITIZATION OF AND WORKFLOWS FOR DATA OBJECT MODEL
Methods and systems include a trade finance digital asset platform that generally provides improved visibility, security, and workflow execution for a set of trade finance transactions enabling capabilities for trade finance asset digitization, a trade finance data object model, interfaces to systems used by parties to trade finance transactions, event and state reporting services, and smart contract services that optionally operate using a blockchain.
Identity document verification based on barcode structure
An identity document can be authenticated using format data of a barcode on the document, such as a barcode on a driver's license. Scan data is obtained by decoding a plurality of barcodes. Format features of the plurality of barcodes are extracted. Scan data is classified into two or more clusters. Each cluster is characterized by a set of format features extracted from the scan data. A barcode on an ID to be verified is scanned. Format features from the barcode of the ID to be verified is compared to at least one of the two or more clusters to authenticate the ID.
Identity document verification based on barcode structure
An identity document can be authenticated using format data of a barcode on the document, such as a barcode on a driver's license. Scan data is obtained by decoding a plurality of barcodes. Format features of the plurality of barcodes are extracted. Scan data is classified into two or more clusters. Each cluster is characterized by a set of format features extracted from the scan data. A barcode on an ID to be verified is scanned. Format features from the barcode of the ID to be verified is compared to at least one of the two or more clusters to authenticate the ID.
HANDWRITING RECOGNITION PIPELINES FOR GENEALOGICAL RECORDS
Disclosed herein relates to example embodiments for recognizing handwritten information in a genealogical record. A computing server may receive a genealogical record. The genealogical record may take the form of an image of a physical form having a structured layout, fields, and handwritten information. The computing server may divide the genealogical record into a plurality of areas based on the structured layout. The computing server may identify, for a particular area, a type of field that is included within the particular area. The computing server may select a handwriting recognition model for identifying the handwritten information in the particular area. The handwriting recognition model may be selected based on the type of the field. The computing server may input an image of the particular area to the handwriting recognition model to generate text of the handwritten information. The computing server may store the text of the handwritten information.
METHODS, SYSTEMS, ARTICLES OF MANUFACTURE, AND APPARATUS FOR DECODING PURCHASE DATA USING AN IMAGE
Methods, apparatus, systems, and articles of manufacture are disclosed that decode purchase data using an image. An example apparatus includes a dictionary including associated product descriptions and barcodes, interface circuitry, and processing circuitry to execute machine readable instructions to obtain purchase details and barcodes corresponding to a receipt, the purchase details including receipt product descriptions, generate a search query that includes a first receipt product description of the receipt product descriptions, a list of barcodes corresponding to the barcodes, and a store identifier associated with the receipt, execute a search against the dictionary using the search query to identify a barcode from the list of barcodes that corresponds to the first receipt product description, and in response to identifying the barcode that corresponds to the first receipt product description, associating the barcode and the first receipt product description and adding the association to the dictionary.