G06V30/19013

GENERATION METHOD, NON-TRANSITORY COMPUTER-READABLE STORAGE MEDIUM FOR STORING GENERATION PROGRAM, AND GENERATION DEVICE
20230048143 · 2023-02-16 · ·

A generation method implemented by a computer, the generation method including: acquiring, by a processor circuit of the computer, read information generated from a reading result that is a document image obtained by imaging a paper document; and generating, by the processor circuit, an electronic document with a signature image that includes the electronic document and the signature image by adding the signature image obtained by imaging a signature written or stamped on the paper document to an electronic document that corresponds to the acquired read information.

CONTINUOUS MACHINE LEARNING METHOD AND SYSTEM FOR INFORMATION EXTRACTION

Methods and systems for artificial intelligence (AI)-assisted document annotation and training of machine learning-based models for document data extraction are described. The methods and systems described herein take advantage of a continuous machine learning approach to create document processing pipelines that provide accurate and efficient data extraction from documents that include structured text, semi-structured text, unstructured text, or any combination thereof.

TEXT DETECTION METHOD, TEXT RECOGNITION METHOD AND APPARATUS

The present disclosure provides a text detection method, a text recognition method and an apparatus, which relate to the field of artificial intelligence technology, in particular to the field of deep learning and computer vision technologies, and can be applied to scenarios such as optical character recognition. The text detection method is: acquiring an image feature of a text strip in a to-be-recognized image; performing visual enhancement processing on the to-be-recognized image to obtain an enhanced feature map of the to-be-recognized image; comparing the image feature of the text strip with the enhanced feature map for similarity to obtain a target bounding box of the text strip on the enhanced feature map.

WINE PRODUCT POSITIONING METHOD, WINE PRODUCT INFORMATION MANAGEMENT METHOD AND APPARATUS, DEVICE, AND STORAGE MEDIUM
20230237825 · 2023-07-27 ·

Disclosed are a wine product positioning method, a wine product information management method and apparatus, a computer device, and a computer-readable storage medium. Based on a preset camera in a wine cellar, a wine product image captured by the preset camera and corresponding to a target wine product is acquired (S21). Based on a preset wine label recognition method combining optical character recognition (OCR) and deep learning recognition, the wine product image is recognized to obtain a wine label corresponding to the wine product image (S22). A preset capture position corresponding to the camera is acquired, and the preset capture position is taken as a current position corresponding to the target wine product (S23). A position corresponding to the target wine product is described by using the wine label and the current position, to position the target wine product (S24).

VISUALIZATION OF THE IMPACT OF TRAINING DATA

An example operation may include one or more of generating a plurality of bounding boxes at a plurality of content areas in an image corresponding to a plurality of pieces of text within the image, converting the plurality of bounding boxes into a plurality of bounding box vectors based on attributes of the plurality of bounding boxes, training a machine learning model to transform a bounding box into a location in vector space based on the plurality of bounding box vectors, and storing the trained machine learning model in memory.

ADJUSTING RESOLUTION OF VIDEO STREAM BASED ON OPTICAL CHARACTER RECOGNITION
20230023431 · 2023-01-26 ·

In one aspect, a first device includes at least one processor and storage accessible to the at least one processor. The storage includes instructions executable by the at least one processor to locally generate first optical character recognition (OCR) data related to at least a first video frame of content. The instructions are also executable to receive, from a second device different from the first device, second OCR data related to at least a second video frame of content. The instructions are then executable to compare the first OCR data to the second OCR data and, responsive to the comparison indicating the first OCR data does not match the second OCR data to within a threshold, take at least one action to adjust the resolution of a video stream such as a video conference's video stream.

Method and system for identifying and determining valuation of currency

A method and system is provided for determining the denomination and related data for a currency item using a personal computing device, such as a mobile phone. The device includes or is connected to an image capture device that is preferably a digital video camera. At least one image of a target currency item is captured then processed for image quality. A further processing of the image includes a coordinate mapping. A comparison is made between individual pixels of the processed image based on the assigned coordinate mapping with a database of reference currency images to determine the currency denomination. Additional processing of the currency image provides the date and other data regarding the target currency item. A market value for the target currency item is identified by reference to a valuation database using the data determined for the currency item.

Method and apparatus for determining information about a drug-containing vessel

Information about a drug-containing vessel is determined by capturing image data of the curved surface of a cylindrical portion of a drug-containing vessel. The image data is unfurled from around the curved surface, binarised, and a template matching algorithm employed to determine that the label information comprises candidate information about the vessel and/or the drug.

Recognition and selection of a discrete pattern within a scene containing multiple patterns

A memory device is provided including instructions that, when executed, cause one or more processors to perform the steps including receiving a plurality of images acquired by a camera, the plurality of images including a plurality of optical patterns, wherein an optical pattern of the plurality of optical patterns encodes an object identifier. The steps include presenting the plurality of images comprising the plurality of optical patterns on a display, and presenting a plurality of visual indications overlying the plurality of optical patterns in the plurality of images. The steps also include identifying a selected optical pattern of the plurality of optical patterns based on a user action and a position of the selected optical pattern in one or more of the plurality of images. The steps also include decoding the selected optical pattern to generate the object identifier and storing the object identifier in a second memory device.

APPARATUS AND METHOD FOR GENERATING A SCHEMA

An apparatus and method for generating a schema, the apparatus comprising at least a processor and a memory communicatively connected to the at least a processor, the memory containing instructions configuring the at least a processor to display, at a graphical control interface, a content field window, receive, as a function of the content field window, a criterion element, and generate a schema as a function of the criterion element.