G06V30/19

IMPROVED SYSTEM AND METHOD FOR AUTOMATING BUSINESS ACCOUNTING
20230214893 · 2023-07-06 · ·

A system including a database configured to store data characterizing a plurality of emails, a plurality of projects, a plurality of vendors, and a plurality of invoices. Each invoice of the plurality of invoices includes data characterizing a project of the plurality of projects, a vendor of the plurality of vendors, and a plurality of invoice data. The system also includes a computing system communicatively coupled to the database and including at least one processor. The at least one processor is configured to receive a file in a first format, convert the file into a second format, determine a first project, a first vendor, and a first plurality of invoice data based on data in the file, prepare a new invoice based on the first project, the first vendor, and the first plurality of invoice data determined, and provide the new invoice in a third format.

METHODS, APPARATUSES, AND COMPUTER-READABLE STORAGE MEDIA FOR IMAGE-BASED SENSITIVE-TEXT DETECTION
20230215202 · 2023-07-06 ·

“The present disclosure describes a method, an apparatus, and a non-transitory computer-readable medium for detecting sensitive text information such as privacy-related text information from a signal and modifying the signal by removing the detected sensitive text information therefrom. The apparatus receives the signal such as an image, a video clip, or an audio clip, and recognizes a text string therefrom. The apparatus then detects, from the text string, a substring based on a similarity between the substring and a regular expression, and modifies the signal by removing information related to the detected substring from the signal.”

SEARCH DEVICE, SEARCH SYSTEM, SEARCH METHOD, AND STORAGE MEDIUM
20230215204 · 2023-07-06 ·

According to one embodiment, a search device generates a character string image of a first character string by using the first character string. The search device inputs the character string image to a classifier. The classifier outputs a classification of a character string according to an input of an image. The search device outputs an other character string based on a classification result of the classifier. The other character string is different from the first character string.

Machine learning technique for automatic modeling of multiple-valued outputs

A method and system are disclosed for training a model that implements a machine-learning algorithm. The technique utilizes latent descriptor vectors to change a multiple-valued output problem into a single-valued output problem and includes the steps of receiving a set of training data, processing, by a model, the set of training data to generate a set of output vectors, and adjusting a set of model parameters and component values for at least one latent descriptor vector in the plurality of latent descriptor vectors based on the set of output vectors. The set of training data includes a plurality of input vectors and a plurality of desired output vectors, and each input vector in the plurality of input vectors is associated with a particular latent descriptor vector in a plurality of latent descriptor vectors. Each latent descriptor vector comprises a plurality of scalar values that are initialized prior to training the model.

Object detection and image cropping using a multi-detector approach
11694456 · 2023-07-04 · ·

Systems, methods and computer program products for detecting objects using a multi-detector are disclosed, according to various embodiments. In one aspect, a computer-implemented method includes defining an analysis profile comprising an initial number of analysis cycles dedicated to each of a plurality of detectors, where each detector is independently configured to detect objects according to a unique set of analysis parameters and/or a unique detector algorithm. The method also includes: receiving digital video data that depicts at least one object; analyzing the digital video data using some or all of the detectors in accordance with the analysis profile, where the analyzing produces an analysis result for each detector used in the analysis. Further, the method includes updating the analysis profile by adjusting the number of analysis cycles dedicated to at least one of the detectors based on the analysis results.

Re-training a model for abnormality detection in medical scans based on a re-contrasted training set

A method includes generating first contrast significance data for a first computer vision model generated from a first training set of medical scans. First significant contrast parameters are identified based on the first contrast significance data. A first re-contrasted training set is generated based on performing a first intensity transformation function on the first training set of medical scans, where the first intensity transformation function utilizes the first significant contrast parameters. A first re-trained model is generated from the first re-contrasted training set, which is associated with corresponding output labels based on abnormality data for the first training set of medical scans. Re-contrasted image data of a new medical scan is generated based on performing the first intensity transformation function. Inference data indicating at least one abnormality detected in the new medical scan is generated based on utilizing the first re-trained model on the re-contrasted image data.

Optical character recognition method and apparatus, electronic device and storage medium

The present application discloses a method and an apparatus for optical character recognition, an electronic device and a storage medium, and relates to the fields of artificial intelligence and deep learning. The method may include: determining, for a to-be-recognized image, a text bounding box of a text area therein, and extracting a text area image from the to-be-recognized image according to the text bounding box; determining a bounding box of text lines in the text area image, and extracting a text-line image from the text area image according to the bounding box; and performing text sequence recognition on the text-line image, and obtaining a recognition result. The application of the solution in the present application can improve a recognition speed and the like.

Apparatus for generating annotated image information using multimodal input data, apparatus for training an artificial intelligence model using annotated image information, and methods thereof
11694021 · 2023-07-04 · ·

A method for providing a user interface (UI) for generating training data for an artificial intelligence (AI) model may include providing, for display via the UI, image information that depicts an object, a set of operations of the object, and a process associated with the set of operations. The method may include providing, for display via the UI, text information that describes the object, the set of operations of the object, and the process associated with the set of operations. The method may include receiving, via the UI, a user input that associates respective image information of the image information with corresponding text information of the text information. The method may include generating association information that associates the respective image information with the corresponding text information, based on the user input. The method may include generating discourse and semantic information from the text information associated to the image information.

Systems and methods for generating document numerical representations

Described embodiments relate to a method comprising: determining a candidate document comprising image data and character data and extracting the image data and the character data from the candidate document. The method comprises providing, to an image-based numerical representation generation model, the image data, and generating, by the image-based numerical representation generation model, an image-based numerical representation of the image data. The method comprises providing, to a character-based numerical representation generation model, the character data; and generating, by the character-based numerical representation generation model, a character-based numerical representation of the character data. The method comprises providing, to a consolidated image-character based numerical representation generation model, the image-based numerical representation and the character-based numerical representation; and generating, by the consolidated image-character based numerical representation generation model, a combined image-character based numerical representation of the candidate document.

Predictive data analysis using image representations of categorical data to determine temporal patterns

There is a need for more effective and efficient predictive data analysis solutions and/or more effective and efficient solutions for generating image representations of categorical data. In one example, embodiments comprise receiving a categorical input feature, generating an image representation of the categorical input feature, generating an image-based prediction based at least in part on the image representation, and performing one or more prediction-based actions based at least in part on the image-based prediction.