Patent classifications
G06V30/18181
CONTENT EXTRACTION BASED ON GRAPH MODELING
Methods and systems are presented for extracting categorizable information from an image using a graph that models data within the image. Upon receiving an image, a data extraction system identifies characters in the image. The data extraction system then generates bounding boxes that enclose adjacent characters that are related to each other in the image. The data extraction system also creates connections between the bounding boxes based on locations of the bounding boxes. A graph is generated based on the bounding boxes and the connections such that the graph can accurately represent the data in the image. The graph is provided to a graph neural network that is configured to analyze the graph and produce an output. The data extraction system may categorize the data in the image based on the output.
SYSTEMS AND METHODS FOR MACHINE LEARNING KEY-VALUE EXTRACTION ON DOCUMENTS
A machine learning based key-value extraction model extracts fields/entities from documents. The input images are processed through OCR. A list of words (uni-grams) and their coordinates are extracted from the original images. Following word cleaning and manipulation, n-gram creation (multi-words), and feature engineering, the transformed data is fed into a classification algorithm to predict if a uni-gram or n-gram is one of the target entities or a non-entity. Following the first step that includes unique feature engineering, a second step improves extraction accuracy among the fields/entities.
METHOD AND DEVICE FOR CREATING A MACHINE LEARNING SYSTEM
A method for creating a machine learning system which is designed for segmentation and object detection in images. The method includes: providing a directed graph; selecting a path through the graph, at least one additional node being selected from this subset, a path through the graph from the input node along the edges via the additional node up to the output node being selected; creating a machine learning system as a function of the selected path; and training the machine learning system created.
Character recognition method and apparatus, electronic device and computer readable storage medium
A character recognition method, a character recognition apparatus, an electronic device and a computer readable storage medium are disclosed. The character recognition method includes: determining semantic information and first position information of each individual character recognized from an image; constructing a graph network according to the semantic information and the first position information of each individual character; and determining a character recognition result of the image according to a feature of each individual character calculated by the graph network.
Content extraction based on graph modeling
Methods and systems are presented for extracting categorizable information from an image using a graph that models data within the image. Upon receiving an image, a data extraction system identifies characters in the image. The data extraction system then generates bounding boxes that enclose adjacent characters that are related to each other in the image. The data extraction system also creates connections between the bounding boxes based on locations of the bounding boxes. A graph is generated based on the bounding boxes and the connections such that the graph can accurately represent the data in the image. The graph is provided to a graph neural network that is configured to analyze the graph and produce an output. The data extraction system may categorize the data in the image based on the output.
Systems and methods for context-aware text extraction
Systems and methods are provided to perform context-aware text extraction.
DIGITAL FORENSIC APPARATUS FOR SEARCHING RECOVERY TARGET AREA FOR LARGE-CAPACITY VIDEO EVIDENCE USING TIME MAP AND METHOD OF OPERATING THE SAME
The present disclosure relates to technology for automatically searching and recovering the recovery area of frames corresponding to a desired time for large-capacity video evidence using a time map generated through an optical character recognition (OCR) function. A digital forensic apparatus for searching and recovering a recovery target area for large-capacity video evidence using a time map according to an embodiment of the present disclosure may include a division recovery device for collecting video evidence from a storage device, dividing the collected video evidence into a plurality of spaces in consideration of the physical space of the storage device, and recovering a representative frame in each of the divided spaces; a time information recognizer for recognizing time information from the recovered representative frame using an optical character recognition (OCR) function; a time map generator for generating a time map in which the divided spaces are arranged according to a time criterion based on the recognized time information; and a selective recovery device for searching a recovery target area by matching specific time information input by a user with the generated time map and recovering the searched recovery target area.
METHOD, APPARATUS, DEVICE, STORAGE MEDIUM AND PROGRAM PRODUCT OF PERFORMING TEXT MATCHING
A method, an apparatus, a device, a storage medium and a program product of performing a text matching are provided, which relate to a field of a computer technology, and in particular to natural language processing and deep learning technologies. The method includes: determining a word set and a plurality of semantic units from a text set, the word set is associated with a first predetermined attribute, and the text set contains a plurality of first texts indicating an object information and a plurality of second texts indicating an object demand information; generating a graph; and generating a final feature representation associated with the text set and the word set based on the graph and a graph convolution model, so as to perform the text matching.
IMAGE SEARCH METHOD, APPARATUS, AND DEVICE
Embodiments of the specification provide an image search method, an apparatus, and a device. The method includes: obtaining an input image associated with an image search, wherein the input image includes a plurality of first text blocks; selecting a to-be-processed image from a target database, wherein the to-be-processed image includes a plurality of second text blocks; and generating a first graph structural feature based on the plurality of first text blocks; generating a second graph structural feature based on the plurality of second text blocks; determining that the first graph structural feature and the second graph structural feature satisfy a condition; and in response to determining that the first graph structural feature and the second graph structural feature satisfy the condition, outputting the to-be-processed image as a search result.
System and method for graph search enhancement
A method, computer program product, and computer system for analyzing an image to detect a plurality of geometric shapes in the image. The method may also include building a graph data structure resembling the image based upon, at least in part, analyzing the image. In some embodiments, building the graph data structure may include traversing the image to generate one or more graph data structure clauses.