G06V30/40

Artificial intelligence (AI) based document processor

An Artificial Intelligence (AI) based document processing system receives a request including one or more of a message and documents related to a process to be automatically executed. A process identifier is extracted and used for retrieving guidelines for the automatic execution of the document processing task. Machine Learning (ML) models, each corresponding to a guideline, are used to extract data responsive to the guidelines. Based on the responsive data meeting the approval threshold and the automatic document processing task executed, one or more of a recommendation to accept or reject the request, and a corresponding letter can be automatically generated.

Information processing apparatus, image processing apparatus, and non-transitory computer readable medium storing program
11562122 · 2023-01-24 · ·

An information processing apparatus includes a processor configured to extract, from a document, words of plural categories, select one extracted word from each of the plural categories, generate a first character string by arranging the selected words in accordance with a rule, wherein the rule determines positions of the selected words within the first character string based on the categories of the selected words, in response to reception of an operation of changing a first word in the first character string from a user, present to the user one or more candidate words from the category of the first portion of the first character string, generate a second character string by replacing the first word in the first character string with a user-selected word selected by the user from among the one or more candidate words, and store the second character string in a memory in association with the document.

Information processing apparatus, image processing apparatus, and non-transitory computer readable medium storing program
11562122 · 2023-01-24 · ·

An information processing apparatus includes a processor configured to extract, from a document, words of plural categories, select one extracted word from each of the plural categories, generate a first character string by arranging the selected words in accordance with a rule, wherein the rule determines positions of the selected words within the first character string based on the categories of the selected words, in response to reception of an operation of changing a first word in the first character string from a user, present to the user one or more candidate words from the category of the first portion of the first character string, generate a second character string by replacing the first word in the first character string with a user-selected word selected by the user from among the one or more candidate words, and store the second character string in a memory in association with the document.

Parallel prediction of multiple image aspects

Example embodiments that analyze images to characterize aspects of the images rely on a same neural network to characterize multiple aspects in parallel. Because additional neural networks are not required for additional aspects, such an approach scales with increased aspects.

Color expression conversion apparatus for understanding color perception in document using textual, expression and non-transitory computer readable medium storing program

A color expression conversion apparatus includes a color expression conversion rule storage unit that stores a rule for converting a textual expression of a specific color into another textual expression, and a color expression conversion control unit that converts a textual expression of a color included in document data into another textual expression in accordance with the rule.

Color expression conversion apparatus for understanding color perception in document using textual, expression and non-transitory computer readable medium storing program

A color expression conversion apparatus includes a color expression conversion rule storage unit that stores a rule for converting a textual expression of a specific color into another textual expression, and a color expression conversion control unit that converts a textual expression of a color included in document data into another textual expression in accordance with the rule.

METHOD TO DETERMINE AUTHENTICITY OF SECURITY HOLOGRAM

A method to determine authenticity of a security feature of an identification document, characterized by receiving a real-time video feed of the identification document with a light source directed at the identification document to make visible a security hologram; processing the real-time video feed into a plurality of image sequence; analysing each image from the plurality of image sequence for a glare and the security hologram, wherein the glare is a reflection of the light source from the identification document; analysing the position of the glare and the security hologram in each image from the plurality of image sequence; evaluating whether the position of the glare and the position of the security hologram is caused by the light source; and providing authenticity result of the identification document captured from the real-time video feed.

MULTI-LAYER NEURAL NETWORK AND CONVOLUTIONAL NEURAL NETWORK FOR CONTEXT SENSITIVE OPTICAL CHARACTER RECOGNITION
20230019919 · 2023-01-19 ·

Aspects of the disclosure relate to OCR. A computing platform may train, using historical images, a CNN and a RNN to perform OCR/identify characters in context. The computing platform may receive an image of a document, and may input the image into the CNN, which may cause the CNN to output OCR information for the image and a confidence score. Based on identifying that the confidence score exceeds a confidence threshold, the computing platform may store the OCR information to enable subsequent access of a digital version of the document. Based on identifying that the confidence score does not exceed the confidence threshold, the computing platform may: 1) input the OCR information into the first RNN, which may cause the first RNN to output contextual OCR information for the image, and 2) store the contextual OCR information to enable subsequent access of the digital version of the document.

SYSTEM FOR THIRD PARTY SELLERS IN ONLINE RETAIL ENVIRONMENT

A third party item listing management system useable for validation of third party items to be included on a retailer website is disclosed. The third party item listing management system includes an application programming interface (API) accessible by a plurality of third parties and configured to receive item data. An item management process receives the item data and calls an item validation pipeline which includes a plurality of item validation stages including an item legalization stage. In the item legalization stage, the item data and the identity of the third party are validated against a plurality of item listing rules to determine whether the one or more items are allowed to be offered via the retailer website by the third party. The item listing rules can include a rule preventing the third party from listing an item included in a core item collection offered by the retailer via the retailer website.

Optical character recognition of documents having non-coplanar regions
11699294 · 2023-07-11 · ·

Systems and methods for performing OCR of an image depicting text symbols and imaging a document having a plurality of planar regions are disclosed. An example method comprises: receiving a first image of a document having a plurality of planar regions and one or more second images of the document; identifying a plurality of coordinate transformations corresponding to each of the planar regions of the first image of the document; identifying, using the plurality of coordinate transformations, a cluster of symbol sequences of the text in the first image and in the one or more second images; and producing a resulting OCR text comprising a median symbol sequence for the cluster of symbol sequences.