Patent classifications
G06V30/19107
Method and system for detecting drift in text streams
Methods and systems disclosed herein may quantify the content and nature of a first stream of text to detect when the typical composition of the first stream of text changes. Quantifying the content and nature of the first stream of text may begin by generating a baseline representation of the content of the first stream of text as represented by a first matrix. Once generated, the first matrix may be used as a control against subsequently received sequences of text. In this regard, a second matrix may be generated from a second sequence of text and compared to the first matrix to determine the differences between the first sequence of text and the second sequence of text. Once a difference is determined, the difference may be compared to a threshold value and, when the difference exceeds the threshold value, an administrator may be notified and corrective action taken.
METHOD OF IDENTIFYING RANKING AND PROCESSING INFORMATION OBTAINED FROM A DOCUMENT
Computer-implemented methods of automatically identifying, ranking, and processing information obtained from a document, and computerized systems and computer program products related thereto. The method involves identifying text clusters and identifying a visual layout structure of at least one part of the document, and ranking the text clusters according to visual properties of the text cluster. The method further involves identifying a semantic context of the identified text clusters and ranking the text clusters according to a similarity of the identified semantic context in relation to a given semantic context, to obtain a semantic context ranking, creating a total ranking of the text clusters based on a combination of a pair of rankings, and selecting text cluster(s) according to its position in the total ranking and providing the selected text cluster(s) to at least one downstream application.
Information processing system
There is provided an information processing system comprising: an obtaining unit configured to obtain read image data generated by reading an image printed on a sheet by a printing apparatus; and a controller configured to determine an abnormality of the printing apparatus by determining, based on the read image data obtained by the obtaining unit, a type of a stain that appears in the read image data from among a plurality of stain types.
Image analysis of data logs
Systems, methods, and software for analyzing data logs. In one embodiment, a method comprises collecting a plurality of the data logs from log-generating elements, converting the data logs into log images, performing image analysis on a plurality of the log images to extract insights, and generating an output based on the insights.
Product identification assistance techniques in an electronic marketplace application
A system for assisting a user in listing items for sale in an electronic marketplace via an electronic marketplace application is disclosed. A product identification technique for assisting the user in listing of the item for sale in the electronic marketplace is determined based on initial user input provided by the user. A prompt to provide additional user input is then displayed to the user in the user interface of the electronic marketplace application, where the additional user data corresponds to the determined product identification technique for assisting the user. A listing for the item is generated based on the additional input provided by the user, and the listing is displayed to the user in the user interface of the electronic marketplace application.
METHOD AND SERVER FOR OBTAINING TEXT FROM IMAGE
A method performed by a server, may include: obtaining an image including a first text and a second text overlapping the first text; separating a first text region corresponding to the first text from the image; extracting pixels corresponding to the first text from the first text region to obtain an undamaged portion and a damaged portion of the first text; and reconstructing the first text by inpainting the damaged portion of the first text in which the first text overlaps the second text in the image.
UNIFIED SCENE TEXT DETECTION AND LAYOUT ANALYSIS
Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for jointly performing text detection and layout analysis. In one aspect, a method comprises processing the image and a set of object queries to generate an encoded representation of the image and an encoded representation of the set of object queries; processing the encoded representation of the image and the encoded representation of the set of object queries to generate a set of text detection masks; processing the encoded representation of the set of object queries to generate layout relevance measures; processing the encoded representation of the set of object queries to generate textness scores for the text detection masks; generating a text detection output that defines respective areas of the image that include text items; and generating a layout analysis output that defines clusters of respective areas of the image identified by the text detection masks.
Systems, methods, and computer program products for object detection and analysis of an image
Systems, methods, and computer program products of intelligent image analysis using object detection models to identify objects and locate and detect features in an image are disclosed. The systems, methods, and computer program products include automated learning to identify the location of an object to enable continuous identification and location of an object in an image during periods when the object may be difficult to recognize or during low visibility conditions.
FAILURE MODE DISCOVERY FOR MACHINE COMPONENTS
The failure modes of mechanical components may be determined based on text analysis. For example, a word embedding may be determined based on a plurality of text documents that include a plurality of maintenance records characterizing failure of mechanical components. A vector representation for a particular maintenance record may then be determined based on the word embedding. Based on the vector representation, the particular maintenance record may then be identified as belonging to a particular failure mode out of a set of possible failure modes.
Document Extraction Template Induction
A method for document extraction includes receiving, from a user device associated with a user, an annotated document that includes one or more fields. Each respective field of the one or more fields of the annotated document is labeled by a respective annotation. The method includes clustering, using a template matching algorithm, the annotated document into a cluster and inducing, using the annotated document, a document template for the cluster. The method includes receiving, from the user device, an unannotated document including the one or more fields. The method includes clustering, using the template matching algorithm, the unannotated document into the cluster and, in response to clustering the unannotated document into the cluster, extracting, using the document template, the one or more fields.