Patent classifications
G06V30/18057
Generating a response to a user query utilizing visual features of a video segment and a query-response-neural network
The present disclosure relates to systems, methods, and non-transitory computer-readable media for generating a response to a question received from a user during display or playback of a video segment by utilizing a query-response-neural network. The disclosed systems can extract a query vector from a question corresponding to the video segment using the query-response-neural network. The disclosed systems further generate context vectors representing both visual cues and transcript cues corresponding to the video segment using context encoders or other layers from the query-response-neural network. By utilizing additional layers from the query-response-neural network, the disclosed systems generate (i) a query-context vector based on the query vector and the context vectors, and (ii) candidate-response vectors representing candidate responses to the question from a domain-knowledge base or other source. To respond to a user's question, the disclosed systems further select a response from the candidate responses based on a comparison of the query-context vector and the candidate-response vectors.
MACHINE LEARNING FOR DATA EXTRACTION
Computer systems and methods are provided for extracting information from an image of a document. A computer system receives image data, the image data including an image of a document. The computer system determines a portion of the received image data that corresponds to a predefined document field. The computer system utilizes a neural network system to assign a label to the determined portion of the received image data. The computer system performs text recognition on the portion of the received image data and stores the recognized text in association with the assigned label.
IMAGE ANALYSIS APPARATUS, IMAGE ANALYSIS METHOD, AND PROGRAM
There are provided an image analysis apparatus, an image analysis method, and a program for implementing an image analysis method that can, when text information about a structural formula of a compound is generated from an image showing the structural formula, cope with a change in the way of drawing of the structural formula.
An image analysis apparatus according to one embodiment of the present invention includes a processor, and the processor is configured to generate, on the basis of a feature value of a subject image showing a structural formula of a subject compound, symbol information representing the structural formula of the subject compound with a line notation, by using an analysis model. The analysis model is a model created through machine learning using a learning image and symbol information representing a structural formula of a compound shown by the learning image with a line notation.
SYSTEM FOR CHARACTER RECOGNITION IN A DIGITAL IMAGE PROCESSING ENVIRONMENT
Systems, computer program products, and methods are described herein for character recognition in a digital image processing environment. The present invention is configured to electronically retrieve one or more documents from a document repository, wherein the one or more documents are in an image format; initiate one or more image super resolution algorithms on the one or more documents; generate, based on at least the one or more image super resolution algorithms, one or more high-resolution images associated with each of the one or more documents; initiate one or more image bottleneck ensembles (IBE) algorithms on the one or more high-resolution images; extract, using the one or more IBE algorithms, one or more features associated with the one or more high resolution images; and store the one or more features extracted from the one or more high resolution images in a feature repository
AUTOMATED CONTEXTUAL PROCESSING OF UNSTRUCTURED DATA
A system for providing automated and domain specific contextual processing for context based verification may classify a plurality of extracted parameters from a set of digitized training document to assign a document similarity score with respect to a set of reference documents. The system may automatically detect a domain for the set of digitized training documents based on the document similarity score. The system may load a domain based neural model for the detected domain to generate a plurality of pre-defined contextual parameters. The system may receive a set of input documents and perform a contextual processing of the received set of documents based on the pre-defined contextual parameters to obtain an output in form of a plurality of filtered snippets, each bearing a corresponding rank. The context based verification may be performed based on the plurality of filtered snippets and the corresponding rank.
METHOD AND APPARATUS OF TRAINING IMAGE RECOGNITION MODEL, METHOD AND APPARATUS OF RECOGNIZING IMAGE, AND ELECTRONIC DEVICE
The present application provides a method and an apparatus of training an image recognition model, a method and an apparatus of recognizing an image, and an electronic device, which relates to a field of an image processing technology, and in particular to artificial intelligence and computer vision technology. A specific implementation scheme of the present disclosure includes: determining a training sample set including a plurality of sample pictures and a text label for each sample picture; extracting an image feature of each sample picture and a semantic feature of each sample picture based on a feature extraction network of a basic image recognition model; and training the basic image recognition model based on the extracted image feature of each sample picture, the extracted semantic feature of each sample picture, the text label for each sample picture, a predetermined image classification loss function, and a predetermined semantic classification loss function.
SYSTEMS FOR AUTOMATED LESION DETECTION AND RELATED METHODS
Example systems and methods for lesion detection are described herein. An example system includes at least one processor and a memory operably coupled to the at least one processor. The system also includes a candidate selection module configured to receive an image, determine a plurality of candidate points in the image, and select a respective volumetric region centered by each of the candidate points. A portion of a lesion has a high probability of being determined as a candidate point. The system further includes a deep learning network configured to receive the respective volumetric regions selected by the candidate selection module, and determine a respective probability of each respective volumetric region to contain the lesion. Additionally, example methods for training a deep learning network to detect lesions are described herein.
IDENTIFYING HANDWRITTEN SIGNATURES IN DIGITAL IMAGES USING OCR RESIDUES
Technologies are described for automatically identifying handwritten signatures within digital images using OCR residues. For example, a digital image of a scanned document is received. The scanned document comprises typewritten content and handwritten content. Optical character recognition (OCR) is performed on the digital image to identify typewritten text within the digital image. Pixel areas containing the identified typewritten text are removed from the digital image. Density-based clustering is performed on the digital image to cluster remaining pixel data and generate candidate segments. The candidate segments are then processed using a trained image classifier to determine if they contain handwritten signatures.
Machine learning based models for object recognition
Machine learning based models recognize objects in images. Specific features of the object are extracted from the image using machine learning based models. The specific features extracted from the image assist deep learning based models in identifying subtypes of a type of object. The system recognizes the objects and collections of objects and determines whether the arrangement of objects violates any predetermined policies. For example, a policy may specify relative positions of different types of objects, height above ground at which certain types of objects are placed, or an expected number of certain types of objects in a collection.
Automated systems and methods for identifying fields and regions of interest within a document image
Systems and methods are configured to extract text from images (e.g., document images) utilizing a combination of optical character recognition processes and neural network-based analysis of various images to train a machine-learning object recognition model that is configured to identify text within images based on object-comparisons between known and unknown text within images. In certain embodiments, identified text within images may be utilized to identify corresponding regions-of-interest for extraction of image data encompassing a portion of an image that may be indexed based at least in part on text identified as corresponding to the particular region-of-interest.