G06V30/1983

Multi-word phrase based analysis of electronic documents
12079254 · 2024-09-03 · ·

A document processing system is configured to identify, for each accessed electronic document in a first set of multiple electronic documents, a set of identified multi-word phrases determined to be in ordered text information in the accessed electronic document, each multi-word phrase of the set of identified multi-word phrases including adjacent words in the ordered text information; and determine, for each accessed electronic document in the first set of multiple electronic documents, a selected document type from the first set of document types based at least on an analysis of the set of identified multi-word phrases with respect to multi-word-phrase characteristics identified by a first definition and associated with each document type in a first set of document types associated with a first document-set type.

Training image-recognition systems based on search queries on online social networks
10083379 · 2018-09-25 · ·

In one embodiment, a method includes receiving a plurality of search queries comprising n-grams; identifying a subset of the plurality of search queries as being queries for visual-media items based on one or more n-grams of the search query being associated with visual-media content; calculating, for each of the n-grams of the search queries of the subset, a popularity-score based on a count of the search queries in the subset that include the n-gram; determining popular n-grams, wherein each of the popular n-grams is an n-gram of the search queries of the subset of search queries having a popularity-score greater than a threshold popularity-score; and selecting one or more of the popular n-grams for training a visual-concept recognition system, wherein each of the popular n-grams is selected based on whether it is associated with a visual concept.

IMAGE SUPPORT FOR COGNITIVE INTELLIGENCE QUERIES
20180268024 · 2018-09-20 ·

A computer-implemented method, a cognitive intelligence system and computer program product adapt a relational database containing image data types. At least one image token in the relational database is converted to a textual form. Text is produced based on relations of tokens in the relational database. A set of word vectors is produced based on the text. A cognitive intelligence query expressed as a structured query language (SQL) query may be applied to the relational database using the set of word vectors. An image token may be converted to textual form by converting the image to a tag, by using a neural network classification model and replacing the image token with a corresponding cluster identifier, by binary comparison or by a user-specified similarity function. An image token may be converted to a plurality of textual forms using more than one conversion method.

CREATION DEVICE, COMPUTER PROGRAM PRODUCT, RECOGNITION SYSTEM, AND CREATION METHOD

According to an embodiment, a creation device creates a code table including a plurality of code words. A recognition device identifies a code word represented by a code image based on a result of character recognition of the code image formed on an object, the code table, and a confusion matrix preliminarily created. The confusion matrix represents probabilities that characters are recognized when the recognition device performs character recognition on an image. The creation device includes a change unit, an evaluation value calculation unit, and a control unit. The change unit changes the code table. The evaluation value calculation unit calculates an evaluation value of the changed code table using the confusion matrix. The control unit causes the change of the code table and the calculation of the evaluation value to be repeated such that the evaluation value becomes small.

Generating event definitions based on spatial and relational relationships
10062009 · 2018-08-28 · ·

Data from one or more sensors is input to a workflow and fragmented to produce HyperFragments. The HyperFragments of input data are processed by a plurality of Distributed Experts, who make decisions about what is included in the HyperFragments or add details relating to elements included therein, producing tagged HyperFragments, which are maintained as tuples in a Semantic Database. Algorithms are applied to process the HyperFragments to create an event definition corresponding to a specific activity. Based on related activity included in historical data and on ground truth data, the event definition is refined to produce a more accurate event definition. The resulting refined event definition can then be used with the current input data to more accurately detect when the specific activity is being carried out.

METHOD AND SYSTEM FOR PROVIDING ASSISTANCE BY MULTI-FUNCTION DEVICE FOR DOCUMENT PREPARATION
20180232185 · 2018-08-16 ·

The disclosed embodiments illustrate method and system for providing assistance for document preparation. The method includes processing one or more portions for one or more field names in an electronic document by a multifunction device. The electronic document corresponds to a hand-filled document, which comprises a character string in a first format for a field name. Further, one or more portions are processed to determine a second format and a location of each character string. A set of information is received in a pre-specified format for the one or more field names from a user-computing device. A field value for each of the processed one or more portions is determined based on a match between the character string and key strings associated with field names. The electronic document is updated based on replacement of the processed one or more portions with corresponding determined field value at the location.

Image text analysis for identifying hidden text

Provided are techniques for image text analysis for identifying hidden text. An Optical Character Reader (OCR) is utilized to extract a text string from an image. Context within the image is analyzed. It is determined that the extracted text string is a partial text string based on the context. For a first radius level of a plurality of radius levels, a segmented sub-image is identified around the partial text string within the first radius level, an image search on the segmented sub-image is performed to identify a candidate text string, and, in response to determining that the candidate text string is a complete text string, the complete text string is provided for performing an action.

Automated media analysis for sponsor valuation

Systems and methods are provided for analyzing images or video using computer vision. Data comprising real time or near real time information or historical information is retrieved that is associated with a sporting event at a physical location. A time segment is identified of a display device at the physical location for acquisition. The display device is configurable to present visual sponsorship data during the time segment for an assigned sponsor. It is determined that one or more rules are satisfied by the data. An indication is transmitted that the first rule is satisfied to a computing device of a sponsor. A bid or valuation is generated based at least on the first rule being satisfied. A request to acquire the time segment is received from the computing device of the sponsor, and the display device at the physical location is caused to present visual sponsorship data for the sponsor during the time segment.

APPARATUS AND METHOD FOR PROCESSING CONTENT
20180197094 · 2018-07-12 ·

A content processing apparatus and method is provided, in which content input by a user is processed by determining whether transmission of the content corresponds to an abnormal pattern when a relation between the user and the other party is taken into account, and for automatically adjusting a permissible level by learning whether the content is an abnormal pattern based on the user's response. The content processing apparatus can estimate whether the content corresponds to an abnormal pattern by using a rule-based algorithm or an artificial intelligence (AI) algorithm when determining whether the content corresponds to the abnormal pattern. When determining whether the content corresponds to the abnormal pattern is estimated using the AI algorithm, the content processing apparatus can use machine learning, a neural network algorithm, or a deep learning algorithm.

Autonomous control device
09983553 · 2018-05-29 · ·

An object of the present invention is to provide an autonomous system that realizes expected operation in a form in which the soundness of the operation can be proved to a third party in adaptation to an external factor that dynamically varies and to enhance a working ratio in autonomous operation. The autonomous system is provided with a function for dynamically leading a satisfiable combination of a requirement for the soundness of operation and expected operation on the basis of the information of operating environment acquired via exterior world measurement means, a function for generating control logic for realizing the expected operation, a function for recording the control logic, the requirement for sound operation and the expected operation, and a function for presenting the record in a form in which the third party can read the record.