G06V30/19113

MULTI-MODEL SYSTEM FOR ELECTRONIC TRANSACTION AUTHORIZATION AND FRAUD DETECTION

A method receives an electronic image and uses the image as an input to a neural network. Based on a determination that the image represents a document, the method uses the image as an input to another neural network to identify a portion of the document containing an identifier. The method extracts the identifier by performing character recognition on the identified portion and determines whether the identifier is valid by using a validation API to determine whether the identifier is associated with a valid account at an institution. Based on a determination that the identifier is associated with a valid account, the method authorizes a transaction associated with the identifier. Based on a determination that the identifier is not associated with a valid account, the method denies the transaction. The first neural network classifies the electronic image into one of multiple valid document types and an invalid document type.

OPTICAL RECEIPT PROCESSING
20230162165 · 2023-05-25 ·

Techniques for providing improved optical character recognition (OCR) for receipts are discussed herein. Some embodiments may provide for a system including one or more servers configured to perform receipt image cleanup, logo identification, and text extraction. The image cleanup may include transforming image data of the receipt by using image parameters values that optimize the logo identification, and performing logo identification using a comparison of the image data with training logos associated with merchants. When a merchant is identified, a second image clean up may be performed by using image parameter values optimized for text extraction. A receipt structure may be used to categorize the extracted text. Improved OCR accuracy is also achieved by applying on format rules of the receipt structure to the extracted text.

ON-DEVICE TWO STEP APPROXIMATE STRING MATCHING
20230206669 · 2023-06-29 ·

A personalized preview system to receive a request to access a collection of media items from a user of a user device. Responsive to receiving the request to access the collection of media items, the personalized preview system accesses user profile data associated with the user, wherein the user profile data includes an image. For example, the image may comprise a depiction of a face, wherein the face comprises a set of facial landmarks. Based on the image, the personalized preview system generates one or more media previews based on corresponding media templates and the image, and displays the one or more media previews within a presentation of the collection of media items at a client device of the user.

OPTICAL CHARACTER RECOGNITION QUALITY EVALUATION AND OPTIMIZATION
20230186661 · 2023-06-15 · ·

A processor may receive an image and determine a number of foreground pixels in the image. The processor may obtain a result of optical character recognition (OCR) processing performed on the image. The processor may identify at least one bounding box surrounding at least one portion of text in the result and overlay the at least one bounding box on the image to form a masked image. The processor may determine a number of foreground pixels in the masked image and a decrease in the number of foreground pixels in the masked image relative to the number of foreground pixels in the image. Based on the decrease, the processor may modify an aspect of the OCR processing for subsequent image processing.

DATE AND TIME FEATURE IDENTIFICATION
20230177856 · 2023-06-08 ·

Methods and systems for text processing include building a knowledge base using column names and associated functions from a code base. Classifiers are trained using the knowledge base and are cross-validated to determine accuracy scores. Text is processed using a selected classifier having a highest accuracy score from the classifiers to determine date/time features.

Leveraging text profiles to select and configure models for use with textual datasets

Text profiles can be leveraged to select and configure models according to some examples described herein. In one example, a system can analyze a reference textual dataset and a target textual dataset using text-mining techniques to generate a first text profile and a second text profile, respectively. The first text profile can contain first metrics characterizing the reference textual dataset and the second text profile can contain second metrics characterizing the target textual dataset. The system can determine a similarity value by comparing the first text profile to the second text profile. The system can also receive a user selection of a model that is to be applied to the target textual dataset. The system can then generate an insight relating to an anticipated accuracy of the model on the target textual dataset based on the similarity value. The system can output the insight to the user.

Low- and high-fidelity classifiers applied to road-scene images

Disclosures herein teach applying a set of sections spanning a down-sampled version of an image of a road-scene to a low-fidelity classifier to determine a set of candidate sections for depicting one or more objects in a set of classes. The set of candidate sections of the down-sampled version may be mapped to a set of potential sectors in a high-fidelity version of the image. A high-fidelity classifier may be used to vet the set of potential sectors, determining the presence of one or more objects from the set of classes. The low-fidelity classifier may include a first Convolution Neural Network (CNN) trained on a first training set of down-sampled versions of cropped images of objects in the set of classes. Similarly, the high-fidelity classifier may include a second CNN trained on a second training set of high-fidelity versions of cropped images of objects in the set of classes.

Automatic protocol discovery using text analytics

A computing system for learning a device type and message formats used by a device is provided. The computing system includes an interface and a processor. The interface is receptive of documents describing identification information and communication and application protocols of devices. The processor is coupled with the interface to obtain rules of network packet analysis using document analytics and identify identification information and communication and application protocols of network messages from devices using the rules.

SYSTEM AND METHOD FOR MULTI-SENSOR, MULTI-LAYER TARGETED LABELING AND USER INTERFACES THEREFOR

A method includes receiving an input specifying a recognition target. The method further includes selecting a plurality of models of an initial recognition layer based on the recognition target, and selecting a plurality of models of a final recognition layer based on the recognition target. The method includes obtaining sensor data from two or more sensors of a plurality of sensors, providing the sensor data to the plurality of models of the initial recognition layer to obtain an initial set of identifications, providing sensor data to the plurality of models of the final recognition layer to obtain a final set of identifications, and outputting an identification from at least one of the initial set of identifications or the final set of identifications.

Low- and high-fidelity classifiers applied to road-scene images

Disclosures herein teach applying a set of sections spanning a down-sampled version of an image of a road-scene to a low-fidelity classifier to determine a set of candidate sections for depicting one or more objects in a set of classes. The set of candidate sections of the down-sampled version may be mapped to a set of potential sectors in a high-fidelity version of the image. A high-fidelity classifier may be used to vet the set of potential sectors, determining the presence of one or more objects from the set of classes. The low-fidelity classifier may include a first Convolution Neural Network (CNN) trained on a first training set of down-sampled versions of cropped images of objects in the set of classes. Similarly, the high-fidelity classifier may include a second CNN trained on a second training set of high-fidelity versions of cropped images of objects in the set of classes.