G06V30/19113

Machine learning driven rules engine for dynamic data-driven enterprise application

The present invention generally relates to system, method and graphical user interface for executing one or more tasks in dynamic data driven enterprise application. The invention includes creation of rules on a rule creation interface by one or more syntax from a rule creation syntax data library. The invention provides machine learning models driven rule engine for executing the tasks.

METHODS, SYSTEMS, ARTICLES OF MANUFACTURE, AND APPARATUS TO TAG SEGMENTS IN A DOCUMENT
20240096125 · 2024-03-21 ·

Methods, apparatus, systems, and articles of manufacture are disclosed to tag segments in a document. An example apparatus includes processor circuitry to execute machine readable instructions to generate node embeddings for nodes of a graph, the node embeddings based on features extracted from text segments detected in a document, the text segments to be represented by the nodes of the graph; sample edges corresponding to the nodes to generate the graph; generate first updated node embeddings by passing the node embeddings and the graph through layers of a graph neural network, the first updated embeddings corresponding to the node embeddings augmented with neighbor information; generate second updated node embeddings by passing the first updated embeddings through layers of a recurrent neural network, the second updated embeddings corresponding to the first updated node embeddings augmented with sequential information; and classify the text segments based on the second updated node embeddings.

Low- And High-Fidelity Classifiers Applied To Road-Scene Images
20190311221 · 2019-10-10 ·

Disclosures herein teach applying a set of sections spanning a down-sampled version of an image of a road-scene to a low-fidelity classifier to determine a set of candidate sections for depicting one or more objects in a set of classes. The set of candidate sections of the down-sampled version may be mapped to a set of potential sectors in a high-fidelity version of the image. A high-fidelity classifier may be used to vet the set of potential sectors, determining the presence of one or more objects from the set of classes. The low-fidelity classifier may include a first Convolution Neural Network (CNN) trained on a first training set of down-sampled versions of cropped images of objects in the set of classes. Similarly, the high-fidelity classifier may include a second CNN trained on a second training set of high-fidelity versions of cropped images of objects in the set of classes.

Multi-model system for electronic transaction authorization and fraud detection

A method receives an electronic image and uses the image as an input to a neural network. Based on a determination that the image represents a document, the method uses the image as an input to another neural network to identify a portion of the document containing an identifier. The method extracts the identifier by performing character recognition on the identified portion and determines whether the identifier is valid by using a validation API to determine whether the identifier is associated with a valid account at an institution. Based on a determination that the identifier is associated with a valid account, the method authorizes a transaction associated with the identifier. Based on a determination that the identifier is not associated with a valid account, the method denies the transaction. The first neural network classifies the electronic image into one of multiple valid document types and an invalid document type.

Low- and high-fidelity classifiers applied to road-scene images

Disclosures herein teach applying a set of sections spanning a down-sampled version of an image of a road-scene to a low-fidelity classifier to determine a set of candidate sections for depicting one or more objects in a set of classes. The set of candidate sections of the down-sampled version may be mapped to a set of potential sectors in a high-fidelity version of the image. A high-fidelity classifier may be used to vet the set of potential sectors, determining the presence of one or more objects from the set of classes. The low-fidelity classifier may include a first Convolution Neural Network (CNN) trained on a first training set of down-sampled versions of cropped images of objects in the set of classes. Similarly, the high-fidelity classifier may include a second CNN trained on a second training set of high-fidelity versions of cropped images of objects in the set of classes.

OPTICAL RECEIPT PROCESSING
20240177123 · 2024-05-30 ·

Techniques for providing improved optical character recognition (OCR) for receipts are discussed herein. Some embodiments may provide for a system including one or more servers configured to perform receipt image cleanup, logo identification, and text extraction. The image cleanup may include transforming image data of the receipt by using image parameters values that optimize the logo identification, and performing logo identification using a comparison of the image data with training logos associated with merchants. When a merchant is identified, a second image clean up may be performed by using image parameter values optimized for text extraction. A receipt structure may be used to categorize the extracted text. Improved OCR accuracy is also achieved by applying on format rules of the receipt structure to the extracted text.

Optical character recognition quality evaluation and optimization
12014559 · 2024-06-18 · ·

A processor may receive an image and determine a number of foreground pixels in the image. The processor may obtain a result of optical character recognition (OCR) processing performed on the image. The processor may identify at least one bounding box surrounding at least one portion of text in the result and overlay the at least one bounding box on the image to form a masked image. The processor may determine a number of foreground pixels in the masked image and a decrease in the number of foreground pixels in the masked image relative to the number of foreground pixels in the image. Based on the decrease, the processor may modify an aspect of the OCR processing for subsequent image processing.

Image analysis system for testing in manufacturing

A vision analytics and validation (VAV) system for providing an improved inspection of robotic assembly, the VAV system comprising a trained neural network three-way classifier, to classify each component as good, bad, or do not know, and an operator station configured to enable an operator to review an output of the trained neural network, and to determine whether a board including one or more bad or a do not know classified components passes review and is classified as good, or fails review and is classified as bad. In one embodiment, a retraining trigger to utilize the output of the operator station to train the trained neural network, based on the determination received from the operator station.

Information processing device, image processing system and non-transitory computer readable medium storing program

An information processing device includes: an obtaining unit that obtains a first classification condition for classifying a document by use of image information of an image formed on the document; an acceptance unit that accepts a second classification condition for classifying the document, the second classification condition being defined by a user; and a classification unit that applies the first classification condition and the second classification condition to the image information based on a predetermined rule of a degree of priority, and classifies the document.

INFORMATION PROCESSING SYSTEM, METHOD, AND NON-TRANSITORY COMPUTER-EXECUTABLE MEDIUM
20240257547 · 2024-08-01 ·

An information processing system includes circuitry. The circuitry acquires a captured image by capturing a document. The circuitry performs an analysis process using the captured image. The circuitry selects, for each of at least one setting item of a plurality of setting items relating to image processing to be performed on the captured image, at least one setting value from among configurable setting values as a candidate for a recommended setting. The circuitry performs image processing repeatedly on the captured image while changing setting values of the plurality of setting items with a setting value of the at least one setting item restricted to the at least one setting value selected as the candidate for the recommended setting. The circuitry determines recommended settings for the plurality of setting items relating to image processing to obtain an image suitable for character recognition.