Patent classifications
G06V30/19107
AUTOMATICALLY DISCOVERING DATA TRENDS USING ANONYMIZED DATA
A computer-implemented method of executing a programmed spend management computer system. The computer system comprises a data pre-processor that is communicatively coupled to a plurality of the application instances and accesses historic transaction data from any of the instances and thereby has access to a large community of data across all tenants. The data pre-processor is programmed to normalize transaction descriptions and determine line spend values, unit price values, quantity values, and buyer country data for a plurality of commodities, and to store the data in item sets in digital storage. A statistical processor is coupled to the digital storage to access the item sets and executes statistical calculation on the item sets to generate pricing insight data. Pricing insights and/or prescriptions are generated automatically under stored program control and provided to a presentation processor for output to and/or rendering to an end-user device.
ARTICLE TOPIC ALIGNMENT
A method including: analyzing, by a computing device, a plurality of portions of a document; determining, by the computing device and based on the analyzing, a concept of each of the portions of the document; comparing, by the computing device, a title of the document with the concept of each of the portions of the document; determining, by the computing device and based on the comparing, an alignment of the concept of each of the portions of the document with the title; generating, by the computing device and based on the alignment, a propensity score for each of the portions of the document; and reordering, by the computing device and based on the propensity scores, the portions of the document from most aligned with the title to least aligned with the title.
Document information extraction for computer manipulation
Systems and apparatuses are disclosed for extracting information from document images. An example method includes segmenting a document image into multiple segments and determining formatting information for each segment. Determining formatting information for a segment includes determining one or more features of the segment and comparing the one or more features of the segment to one or more clusters of features associated with different document types. The formatting information for the segment is based on the comparison. The method also includes, for each segment, storing the formatting information in a data structure associated with the segment. The method further includes, for each segment including text to be identified during information extraction, applying OCR to the segment to generate machine-encoded text and storing the machine-encoded text in the associated data structure.
INFORMATION EXTRACTION SYSTEM AND NON-TRANSITORY COMPUTER READABLE RECORDING MEDIUM STORING INFORMATION EXTRACTION PROGRAM
An information extraction system divides learning data items into main clusters by performing clustering on a set of the learning data items for use in generation of clustering models that are information extraction models for extracting information from invoice data and generates the different information extraction models for the different main clusters by performing learning using the learning data items for the individual main clusters.
INFORMATION PROCESSING DEVICE, INFORMATIONPROCESSING METHOD, AND NON-TRANSITORYCOMPUTER READABLE STORAGE MEDIUM
The information processing device acquires a probability image representing a probability of an existence of a character in each of pixels included in a target image including a plurality of characters based on the target image, estimates positions of respective character images included in the target image based on the acquired probability image, classifies the plurality of character images into a plurality of groups based on the estimated positions, acquires a plurality of recognition target images which is generated so as to correspond to the plurality of groups, and includes the plurality of character images respectively belonging to the corresponding groups, and recognizes the plurality of characters from each of the recognition target images.
INFORMATION PROCESSING DEVICE, INFORMATION PROCESSING METHOD, AND NON-TRANSITORY COMPUTER READABLE STORAGE MEDIUM
The information processing device obtains a character string image which includes a plurality of characters, and which includes the characters arranged in an arrangement direction, obtains a probability image representing a probability of an existence of a character in each of pixels included in the character string image, obtains a plurality of character regions in which the characters are estimated to respectively exist in the character string image based on the probability image, obtains an additional character region which is located in the character string image, and which does not overlap the plurality of character regions based on a determination result on whether or not a pixel of a non-background color exists in a direction perpendicular to the arrangement direction at every position in the arrangement direction in the character string image, and recognizes the plurality of characters from the character regions and the additional character region.
Systems and methods to process electronic images to provide image-based cell group targeting
Systems and methods are disclosed for grouping cells in a slide image that share a similar target, comprising receiving a digital pathology image corresponding to a tissue specimen, applying a trained machine learning system to the digital pathology image, the trained machine learning system being trained to predict at least one target difference across the tissue specimen, and determining, using the trained machine learning system, one or more predicted clusters, each of the predicted clusters corresponding to a subportion of the tissue specimen associated with a target.
Systems and methods to process electronic images to provide image-based cell group targeting
Systems and methods are disclosed for grouping cells in a slide image that share a similar target, comprising receiving a digital pathology image corresponding to a tissue specimen, applying a trained machine learning system to the digital pathology image, the trained machine learning system being trained to predict at least one target difference across the tissue specimen, and determining, using the trained machine learning system, one or more predicted clusters, each of the predicted clusters corresponding to a subportion of the tissue specimen associated with a target.
SYSTEMS AND METHODS TO PROCESS ELECTRONIC IMAGES TO PROVIDE IMAGE-BASED CELL GROUP TARGETING
Systems and methods are disclosed for grouping cells in a slide image that share a similar target, comprising receiving a digital pathology image corresponding to a tissue specimen, applying a trained machine learning system to the digital pathology image, the trained machine learning system being trained to predict at least one target difference across the tissue specimen, and determining, using the trained machine learning system, one or more predicted clusters, each of the predicted clusters corresponding to a subportion of the tissue specimen associated with a target.
SYSTEMS AND METHODS TO PROCESS ELECTRONIC IMAGES TO PROVIDE IMAGE-BASED CELL GROUP TARGETING
Systems and methods are disclosed for grouping cells in a slide image that share a similar target, comprising receiving a digital pathology image corresponding to a tissue specimen, applying a trained machine learning system to the digital pathology image, the trained machine learning system being trained to predict at least one target difference across the tissue specimen, and determining, using the trained machine learning system, one or more predicted clusters, each of the predicted clusters corresponding to a subportion of the tissue specimen associated with a target.