G06V30/1908

INFORMATION PROCESSING APPARATUS, INFORMATION PROCESSING METHOD, AND STORAGE MEDIUM
20220189187 · 2022-06-16 ·

An object is to improve character recognition accuracy of handwritten characters, originally a single continuous character string, described discontinuously. An image area corresponding to a handwritten character is separated from a document image obtained by scanning a document and a character block including characters having the same baseline is extracted. Then, in a case where a plurality of character blocks is extracted from the first image area, a single character block is generated by combining character blocks based on a position relationship of the plurality of character blocks.

IMAGE READING DEVICE
20230252813 · 2023-08-10 ·

According to one embodiment, an image reading device includes an image reading unit, a control unit, and an output unit. The image reading unit reads an image formed on a document to generate read image data. The control unit extracts at least one predetermined area from the read image data to generate extracted image data, compares the extracted image data and reference image data determined for each of the predetermined area, for each of the predetermined area, and generates an aggregated image obtained by aggregating the extracted image data and information indicating a result of the comparison. The output unit outputs the aggregated image.

Vector Object Generation from Raster Objects using Semantic Vectorization
20230154075 · 2023-05-18 · ·

Semantic vectorization techniques are described that support generating and editing of vector objects from raster objects. A raster object, for instance, is received as an input by a semantic vectorization system. The raster object is utilized by the semantic vectorization system to generate a semantic classification for the raster object. The semantic classification identifies semantic objects in the raster image. The semantic vectorization system leverages the semantic classification to generate vector objects. As a result, the vector objects resemble the semantic objects in the raster object.

SYSTEM AND METHOD FOR ZERO-SHOT LEARNING WITH DEEP IMAGE NEURAL NETWORK AND NATURAL LANGUAGE PROCESSING (NLP) FOR OPTICAL CHARACTER RECOGNITION (OCR)
20220284721 · 2022-09-08 · ·

A system and method for constructing a training dataset and training a neural network include obtaining a searchable portable document format (PDF) document, identifying a bounding box defining a region in a background image that is associated with an overlaying text object defined in the PDF document, determining an image crop of the PDF document according to the bounding box, and generating a training data sample for the training dataset, the training data sample comprising a data pair of the image crop and the associated text object.

AUTOMATIC LABELING OF OBJECTS IN SENSOR DATA

Aspects of the disclosure provide for automatically generating labels for sensor data. For instance, first sensor data for a first vehicle may be identified. This first sensor data may have been captured by a first sensor of the vehicle at a first location during a first point in time and may be associated with a first label for an object. Second sensor data for a vehicle may be identified. The second sensor data may have been captured by a second sensor of the vehicle at a second location at a second point in time outside of the first point in time. The second location is different from the first location. The object is a static object may be determined. Based on the determination that the object is a static object, the first label may be used to automatically generate a second label for the second sensor data.

MEDIA MANAGEMENT SYSTEM FOR VIDEO DATA PROCESSING AND ADAPTATION DATA GENERATION

In various embodiments, methods and systems for implementing a media management system, for video data processing and adaptation data generation, are provided. At a high level, a video data processing engine relies on different types of video data properties and additional auxiliary data resources to perform video optical character recognition operations for recognizing characters in video data. In operation, video data is accessed to identify recognized characters. A video OCR operation to perform on the video data for character recognition is determined from video character processing and video auxiliary data processing. Video auxiliary data processing includes processing an auxiliary reference object; the auxiliary reference object is an indirect reference object that is a derived input element used as a factor in determining the recognized characters. The video data is processed based on the video OCR operation and based on processing the video data, at least one recognized character is communicated.

Media management system for video data processing and adaptation data generation

In various embodiments, methods and systems for implementing a media management system, for video data processing and adaptation data generation, are provided. At a high level, a video data processing engine relies on different types of video data properties and additional auxiliary data resources to perform video optical character recognition operations for recognizing characters in video data. In operation, video data is accessed to identify recognized characters. A video OCR operation to perform on the video data for character recognition is determined from video character processing and video auxiliary data processing. Video auxiliary data processing includes processing an auxiliary reference object; the auxiliary reference object is an indirect reference object that is a derived input element used as a factor in determining the recognized characters. The video data is processed based on the video OCR operation and based on processing the video data, at least one recognized character is communicated.

SYSTEM OF DETECTING CHEATING ON AN ONLINE EXAMINATION
20240078824 · 2024-03-07 ·

There is provided a computer system of detecting user cheating in an online test, the system comprising a processing circuitry configured to: obtain capability of performing screen capture of a user computer; perform a screen capture of the user computer; perform at least one of: i) extracting one or more text strings from one or more non-examination application windows of the userscreen image, obtaining at least part of an examination question text that has been presented in an application window on the user computer, and for at least one of the one or more extracted text strings, determining a degree of relevance to the examination question text, and ii) determining, using image analysis techniques, a degree of matching between at least part of the at least one userscreen image, and data associated with a suspicious application.

Information processing apparatus, information processing method, and storage medium
11908215 · 2024-02-20 · ·

An object is to improve character recognition accuracy of handwritten characters, originally a single continuous character string, described discontinuously. An image area corresponding to a handwritten character is separated from a document image obtained by scanning a document and a character block including characters having the same baseline is extracted. Then, in a case where a plurality of character blocks is extracted from the first image area, a single character block is generated by combining character blocks based on a position relationship of the plurality of character blocks.

INFORMATION PROCESSING DEVICE, METHOD, PROGRAM, AND INFORMATION PROCESSING SYSTEM FOR ASSISTING IN EXAMINATION OF IMAGE FOR PRINTING

An information processing device for assisting in examination of an image for printing. The information processing device recognizes a character string and a non-character object included in the image for printing that has been input, determines a plurality of element regions each including the character string recognized, determines whether, in each of the element regions, the recognized character string matches a character string included in document data, determines whether, in each of the element regions, the recognized character string satisfies a predetermined condition, and displays a confirmation screen including the image for printing clearly indicating a region determined not to match the character string included in the document data or a region determined not to satisfy the predetermined condition defined by the regulation data.