G06V30/19167

AUTOMATED CLASSIFICATION AND INTERPRETATION OF LIFE SCIENCE DOCUMENTS

A computer-implemented tool for automated classification and interpretation of documents, such as life science documents supporting clinical trials, is configured to perform a combination of raw text, document construct, and image analyses to enhance classification accuracy by enabling a more comprehensive machine-based understanding of document content. The combination of analyses provides context for classification by leveraging relative spatial relationships among text and image elements, identifying characteristics and formatting of elements, and extracting additional metadata from the documents as compared to conventional automated classification tools, wherein natural language processing (NLP) is applied to associate text with tokens, and relevant differences and similarities between protocols are identified.

INFORMATION PROCESSING APPARATUS AND NON-TRANSITORY COMPUTER READABLE MEDIUM
20210064867 · 2021-03-04 · ·

An information processing apparatus includes a processor. The processor is configured to acquire an evaluation form image in which an item and correct answer data indicating a correct answer of a recognition result for the item are associated with each other in advance; and output, when the acquired evaluation form image is processed in at least one step of form processing, an evaluation result of the at least one step of the form processing.

Machine learning based extraction of partition objects from electronic documents

An object-extraction method includes generating multiple partition objects based on an electronic document, and receiving a first user selection of a data element via a user interface of a compute device. In response to the first user selection, and using a machine learning model, a first subset of partition objects from the multiple partition objects is detected and displayed via the user interface. A user interaction, via the user interface, with one of the partition objects is detected, and in response, a weight of the machine learning model is modified, to produce a modified machine learning model. A second user selection of the data element is received via the user interface, and in response and using the modified machine learning model, a second subset of partition objects from the multiple partition objects is detected and displayed via the user interface, the second subset different from the first subset.

SYSTEM AND METHOD FOR PROVIDING AN INTERACTIVE VISUAL LEARNING ENVIRONMENT FOR CREATION, PRESENTATION, SHARING, ORGANIZING AND ANALYSIS OF KNOWLEDGE ON SUBJECT MATTER
20210065569 · 2021-03-04 ·

The embodiments herein disclose a system and a method for providing an online web-based interactive audio-visual platform for note creation, presentation, sharing, organizing, and analysis. The system provides a conceptual and interactive interface to content; analyses a student's notes and instantly determines the accuracy of the conceptual connections made and a student's understanding of a topic. The system enables the student to add and use audio, visual, drawing, text notes, and mathematical equations in addition to those suggested by the note taking solution; to collate notes from various sources in a meaningful manner by grouping concepts using colors, images, and text; and to personalize other maps developed within the same environment while maintaining links back to the original source from which the notes are derived. The system highlights keywords in conjunction with spoken text to complement the advantages of using visual maps to improve learning outcomes.

Machine Learning System for Summarizing Tax Documents With Non-Structured Portions

Technologies for summarizing tax documents that include an unstructured portion, such as K1 filings. The system extracts data from both the structured information, such as a K1 facepage, and unstructured information, such as whitepaper statement(s). The system includes machine learning model(s) to determine the information to be extracted from the unstructured information. The machine learning model(s) generate a confidence level associated with the extracted unstructured information that represents a prediction on how likely the extracted unstructured information was accurately extracted. The system generates a document in an electronic interchange format that represents both the structured and unstructured information in the analyzed tax document.

APPARATUS AND METHODS FOR STORING AND DISPENSING MEDICATIONS

An apparatus for automated storage and dispensing of medications. Medications are stored in one or more inventory storage foam storage plates attached to a frame of the apparatus. Medications are delivered to the apparatus via a locked delivery container. A carrier mechanism retrieves medications from the inventory storage container and delivery container and moves medications to various subsystems of the apparatus. Information related to medications is communicated to a remote pharmacist prior to dispensing the medication. Multiple installations of the apparatus are centrally coordinated.

Machine learning-based text recognition system with fine-tuning model

A non-transitory processor-readable medium stores instructions to be executed by a processor. The instructions cause the processor to receive a first trained machine learning model that generates a transcription based on a document. The instructions cause the processor to execute the first trained machine learning model and a second trained machine learning model to generate a refined transcription based on the transcription. The instructions cause the processor to execute a quality assurance program to generate a transcription score based on the document and the transcription. The instructions cause the processor to execute the quality assurance program to generate a refined transcription score based on the refined transcription and at least one of the document or the transcription. The at least one refined transcription score indicates an automation performance better than an automation performance for the at least one transcription score.

Deep learning based adaptive arithmetic coding and codelength regularization
11062211 · 2021-07-13 · ·

A deep learning based compression (DLBC) system applies trained models to compress binary code of an input image to a target codelength. For a set of binary codes representing the quantized coefficents of an input image, the DLBC system applies a first model that is trained to predict feature probabilities based on the context of each bit of the binary codes. The DLBC system compresses the binary code via adaptive arithmetic coding based on the determined probability of each bit. The compressed binary code represents a balance between a reconstruction quality of a reconstruction of the input image and a target compression ratio of the compressed binary code.

IMAGE-DATA-BASED CLASSIFICATION OF MEAT PRODUCTS
20210204553 · 2021-07-08 ·

Meat products can be classified based on image data. Training image data is received that includes image data about first meat products. Labels associated with the first meat products are received, where each of the labels includes a type of one of the first meat products. A trained classification model is developed based on the training image data and the received labels. Image data representative of a second meat product is received. The image data is inputted into the trained classification model, where the trained classification model is configured to classify a type of the second meat product based on the image data. The type of the second meat product is received from the trained classification model.

SYSTEM AND METHOD FOR MULTI-MODAL IMAGE CLASSIFICATION

Systems and methods for classifying images (e.g., ads) are described. An image is accessed. Optical character recognition is performed on at least a first portion of the image. Image recognition is performed via a convolutional neural network on at least a second portion of the image. At least one class for the image is automatically identified, via a fully connected neural network, based on one or more predictions, each of the one or more predictions being based on both the optical character recognition and the image recognition. Finally, the at least one class identified for the image is output.