Patent classifications
G06V30/18181
GRAPH MACHINE LEARNING FOR CASE SIMILARITY
Herein is machine learning for anomalous graph detection based on graph embedding, shuffling, comparison, and unsupervised training techniques that can characterize an unfamiliar graph. In an embodiment, a computer obtains many known vectors that respectively represent known graphs. A new vector is generated that represents a new graph that contains multiple vertices. The new vector may contain an arithmetic aggregation of vertex vectors that respectively represent multiple vertices and/or a vector that represents a virtual vertex that is connected to the multiple vertices by respective virtual edges. In the many known vectors, some similar vectors that are similar to the new vector are identified. The new graph is automatically characterized based on a subset of the known graphs that the similar vectors represent.
METHOD AND SYSTEM FOR TABLE STRUCTURE RECOGNITION VIA DEEP SPATIAL ASSOCIATION OF WORDS
State of art techniques that utilize spatial association based Table structure Recognition (TSR) have limitation in selecting minimal but most informative word pairs to generate digital table representation. Embodiments herein provide a method and system for TSR from an table image via deep spatial association of words using optimal number of word pairs, analyzed by a single classifier to determine word association. The optimal number of word pairs are identified by utilizing immediate left neighbors and immediate top neighbors approach followed redundant word pair elimination, thus enabling accurate capture of structural feature of even complex table images via minimal word pairs. The reduced number of word pairs in combination with the single classifier trained to determine the word associations into classes comprising as same cell, same row, same column and unrelated, provides TSR pipeline with reduced computational complexity, consuming less resources still generating more accurate digital representation of complex tables.
Image search method, apparatus, and device
Embodiments of the specification provide an image search method, an apparatus, and a device. The method includes: obtaining an input image associated with an image search, wherein the input image includes a plurality of first text blocks; selecting a to-be-processed image from a target database, wherein the to-be-processed image includes a plurality of second text blocks; and generating a first graph structural feature based on the plurality of first text blocks; generating a second graph structural feature based on the plurality of second text blocks; determining that the first graph structural feature and the second graph structural feature satisfy a condition; and in response to determining that the first graph structural feature and the second graph structural feature satisfy the condition, outputting the to-be-processed image as a search result.
Deep learning method
Provided is a deep learning method including a step of each of at least two or more deep learning machines learning a web traffic by using a hexadecimal; a step of the deep learning machines learning the web traffic by using an incremental learning using a weight; a step of, when the web traffic is received, each of the deep learning machines encoding a character string of the web traffic with UTF-8 hexadecimal; a step of each of the deep learning machines converting the character string into an image and deep learning the image.
TECHNIQUES FOR GRAPH DATA STRUCTURE AUGMENTATION
A computing device may receive a set of user documents. Data may be extracted from the documents to generate a first graph data structure with one or more initial graphs containing key-value pairs. A model may be trained on the first graph data structure to classify the pairs. Until a set of evaluation metrics for the model exceeds a set of deployment thresholds: generating, a set of evaluation metrics may be generated for the model. The set of evaluation metrics may be compared to the set of deployment thresholds. In response to a determination that the set of evaluation metrics are below the set of deployment thresholds: one or more new graphs may be generated from the one or more initial graphs in the first graph data structure to produce a second graph data structure. The first and second graph can be used to train the model.
ROBUST METHOD FOR TRACING LINES OF TABLE
A method for image processing includes obtaining a mask of a stroke from an image and identifying a plurality of cross edges for the stroke based on the mask and a reference line. The plurality of cross edges includes a group of adjacent cross edges that intersect the reference line. The method further includes (a) calculating a first vector based on positions of at least two of the cross edges in the group, (b) expanding the group, based on the first vector, to include cross edges adjacent to the group that do not intersect the reference line, (c) calculating a second vector based on positions of at least two of the cross edges in the expanded group, and (d) expanding the expanded group, based on the second vector, to include a second group of adjacent cross edges nearby the expanded group that do not intersect the reference line.
CONVERSION OF TABULAR FORMAT DATA TO MACHINE READABLE TEXT FOR QA OPERATIONS
A system and method for table conversion including converting a table containing text in tabular form to an image, labeling each text area of the image with a bounding box, determining for each bounding box, a position information, a semantic information, and an image information, reconstructing the image into a graph form having a plurality of nodes, wherein each node represents the bounding box of the text areas of the image, inputting at least two nodes into a trained neural network to determine a relative relationship between the at least two nodes, building a knowledge graph using the relative relationship of the at least two nodes, and translating the knowledge graph into machine readable natural language.
Image Table Extraction Method And Apparatus, Electronic Device, And Storgage Medium
Embodiments of the present disclosure disclose an image table extraction method and apparatus, an electronic device, a storage media, and a training method for a table extraction model, which relate to the field of artificial intelligence technologies and cloud computing technologies, including: acquiring an image to be processed;
generating a table of the image to be processed according to a table extraction model, where the table extraction model is obtained according to a field position feature, an image feature, and a text feature of a sample image; and filling text information of the image to be processed into the table.
METHOD FOR RECOGNIZING VEHICLE LICENSE PLATE, ELECTRONIC DEVICE AND COMPUTER READABLE STORAGE MEDIUM
A method for recognizing vehicle license plate is provided, the method includes: performing vehicle license plate detection on a Nth image frame in a video stream to obtain a first vehicle license plate detection result which is used to indicate a position of the first vehicle license plate in the Nth image frame if the first vehicle license plate is included in the Nth image frame; segmenting a character region and a number region from the first vehicle license plate according to the position of the first vehicle license plate in the Nth image frame and content information of the first vehicle license plate if the first vehicle license plate is included in the Nth image frame; and recognizing the character region and the number region segmented from the first vehicle license plate to obtain a first recognition result of the first vehicle license plate.
CONTENT EXTRACTION BASED ON GRAPH MODELING
Methods and systems are presented for extracting categorizable information from an image using a graph that models data within the image. Upon receiving an image, a data extraction system identifies characters in the image. The data extraction system then generates bounding boxes that enclose adjacent characters that are related to each other in the image. The data extraction system also creates connections between the bounding boxes based on locations of the bounding boxes. A graph is generated based on the bounding boxes and the connections such that the graph can accurately represent the data in the image. The graph is provided to a graph neural network that is configured to analyze the graph and produce an output. The data extraction system may categorize the data in the image based on the output.