G06V30/1983

UPLOAD MANAGEMENT

Aspects of the present disclosure relate to managing the upload of media items to cloud storage. A model can be configured to distinguish between confidential and non-confidential media items. A media item to be uploaded to a cloud storage can be analyzed using the model to determine whether the media item is confidential. In response to determining that the media item is confidential, an upload control action can be issued on the media item.

Automated control of display devices

Systems and methods are provided for analyzing images or video using computer vision. Data comprising real time or near real time information or historical information is retrieved that is associated with a sporting event at a physical location. A time segment is identified of a display device at the physical location for acquisition. The display device is configurable to present visual sponsorship data during the time segment for an assigned sponsor. It is determined that one or more rules are satisfied by the data. An indication is transmitted that the first rule is satisfied to a computing device of a sponsor. A bid or valuation is generated based at least on the first rule being satisfied. A request to acquire the time segment is received from the computing device of the sponsor, and the display device at the physical location is caused to present visual sponsorship data for the sponsor during the time segment.

IMAGE-PROCESSING DEVICE, IMAGE-ROCESSING METHOD, AND STORAGE MEDIUM ON WHICH PROGRAM IS STORED

An image-processing device includes: a reliability calculation unit configured to calculate reliability of a character recognition result on a document image which is a character recognition target on the basis of a feature amount of a character string of a specific item included in the document image; and an output destination selection unit configured to select an output destination of the character recognition result in accordance with the reliability.

Image processing apparatus
10949662 · 2021-03-16 · ·

There is provided an image processing apparatus including a processor that acquires document image data generated by reading the document and recognizes a character string included in the document image data by character recognition and a storage that saves the document image data, in which the processor compares a folder name of an existing folder in the storage with the character string included in the document image data to select a folder in which at least a part of the folder name matches the character string included in the document image data, as a folder of a save destination of the document image data.

Methods and devices for quantifying text similarity
10929710 · 2021-02-23 · ·

Disclosed herein are computer-implemented methods; computer-implemented systems; and non-transitory, computer-readable media, for quantifying text similarity. One computer-implemented method includes obtaining a plurality of shortest operation paths including one or more edit pairs for correcting an optical correction recognition (OCR) text string with an edit text string, where each of the one or more edit pairs denotes an operation performable to a character of the OCR text string during correction by the edit text string. A plurality of similarity scores is determined, each corresponding to one of the plurality of shortest operation paths and determined by summing historical similarity scores of the one or more edit pairs of each of the plurality of shortest operation paths. A minimum one of the plurality of similarity scores is selected to quantify text similarity between the OCR text string and the edit text string.

NEURAL NETWORKS FOR MULTI-LABEL CLASSIFICATION OF SEQUENTIAL DATA
20210081766 · 2021-03-18 ·

Described techniques for multi-label classification, in which sequential data includes characters that have two or more aspects that require classification, are capable of providing separate classifications for different categories of components. Using an appropriately-trained neural network, the described techniques perform aligning and otherwise combining two or more classifications (e.g., categories, or types of labels) to obtain multi-label characters.

IDENTIFICATION OF CANDIDATE REGIONS IN IMAGES FOR PREDEFINED OBJECT PLACEMENT

According to examples, an apparatus may include a processor and a memory on which are stored machine-readable instructions that when executed by the processor, may cause the processor to receive an image and identify contents in the received image. The processor may identify candidate regions on the image at which a predefined object is placeable. In some examples, the processor may assign scores to the identified candidate regions based on relative positions of the identified candidate regions to respective ones of the identified contents in the image. Based on the assigned scores, the processor may select a candidate region among the identified candidate regions at which the predefined object is to be placed. The processor may determine a size and a position of the predefined object based on the selected candidate region, and may output the determined size and the position of the predefined object on the image.

AUTOMATED CLASSIFICATION AND INTERPRETATION OF LIFE SCIENCE DOCUMENTS
20210034855 · 2021-02-04 ·

A computer-implemented tool for automated classification and interpretation of documents, such as life science documents supporting clinical trials, is configured to perform a combination of raw text, document construct, and image analyses to enhance classification accuracy by enabling a more comprehensive machine-based understanding of document content. The combination of analyses provides context for classification by leveraging relative spatial relationships among text and image elements, identifying characteristics and formatting of elements, and extracting additional metadata from the documents as compared to conventional automated classification tools.

MULTI-WORD PHRASE BASED ANALYSIS OF ELECTRONIC DOCUMENTS
20210027021 · 2021-01-28 ·

A document processing system is configured to identify, for each accessed electronic document in a first set of multiple electronic documents, a set of identified multi-word phrases determined to be in ordered text information in the accessed electronic document, each multi-word phrase of the set of identified multi-word phrases including adjacent words in the ordered text information; and determine, for each accessed electronic document in the first set of multiple electronic documents, a selected document type from the first set of document types based at least on an analysis of the set of identified multi-word phrases with respect to multi-word-phrase characteristics identified by a first definition and associated with each document type in a first set of document types associated with a first document-set type.

OPTICAL CHARACTER RECOGNITION OF DOCUMENTS HAVING NON-COPLANAR REGIONS
20210027087 · 2021-01-28 ·

Systems and methods for performing OCR of an image depicting text symbols and imaging a document having a plurality of planar regions are disclosed. An example method comprises: receiving a first image of a document having a plurality of planar regions and one or more second images of the document; identifying a plurality of coordinate transformations corresponding to each of the planar regions of the first image of the document; identifying, using the plurality of coordinate transformations, a cluster of symbol sequences of the text in the first image and in the one or more second images; and producing a resulting OCR text comprising a median symbol sequence for the cluster of symbol sequences.