G06V30/19013

CHARACTER RECOGNITION METHOD, MODEL TRAINING METHOD, RELATED APPARATUS AND ELECTRONIC DEVICE

A character recognition method, a model training method, a related apparatus and an electronic device are provided. The specific solution is: obtaining a target picture; performing feature encoding on the target picture to obtain a visual feature of the target picture; performing feature mapping on the visual feature to obtain a first target feature of the target picture, where the first target feature is a feature that has a matching space with a feature of character semantic information of the target picture; inputting the first target feature into a character recognition model for character recognition to obtain a first character recognition result of the target picture.

Medical image processing apparatus and medical observation system

A medical image processing apparatus includes: a superimposed image generation unit configured to generate a superimposed image by superimposing a subject image and a fluorescent image in areas corresponding to each other; a determination unit configured to determine whether or not at least one of a subject and an observation device moves from timing before timing at which one of the subject image and the fluorescent image is captured to the timing; and a superimposition controller configured to cause the superimposition image generation unit to prohibit a superimposition in an area of at least a part of the subject image and the fluorescent image when the determination unit determines that at least one of the subject and the observation device moves.

METHOD AND SYSTEM FOR CURATING A VIRTUAL MODEL FOR FEATURE IDENTIFICATION

Computer-implemented methods and systems for curating virtual models and populating overlays within a virtual environment are described herein. A server may receive a data request from a user electronic device. The data request may comprise a property of interest located at a particular portion of an overall region. The server may then dynamically acquire a virtual model for rendering the property within a virtual environment at the user electronic device based on the data request. The server may then curate the virtual model in accordance with rules that emphasize features associated with the property that are relevant to assessing risks associated with the property when assessing the property. The server may then identify the curated property modeled by the virtual model, obtain annotation records associated with the features of the property, and populate an annotations overlay rendered in the virtual environment with information included in the annotation records.

INFORMATION PROCESSING APPARATUS, NON-TRANSITORY COMPUTER READABLE MEDIUM, AND INFORMATION PROCESSING METHOD

An information processing apparatus includes a processor configured to: separate a header part and a body part from a read image obtained by reading a facsimile document which is a document received by facsimile; and switch preprocessing in accordance with a header recognition result which is a recognition result obtained through character recognition on the header part, the preprocessing being performed before character recognition on the body part.

Methods, systems, apparatus and articles of manufacture for receipt decoding

Methods, apparatus, systems and articles of manufacture are disclosed for receipt decoding. An example apparatus includes processor circuitry to execute instructions to extract text from the receipt image, the text including bounding boxes; associate ones of the bounding boxes to link horizontally related fields of a the receipt image by selecting a first bounding box; identifying first horizontally aligned bounding boxes, the first horizontally aligned bounding boxes to include at least one bounding box of the bounding boxes that is horizontally aligned relative to the first bounding box; adding the first horizontally aligned bounding boxes to a word sync list; and connecting ones of the first horizontally aligned bounding boxes and the first bounding box based on at least one of an amount of the first horizontally aligned bounding boxes in the word sync list and a relationship among the first horizontally aligned bounding boxes and the first bounding box.

Method, system, and medium for managing suspended customer transactions in a retail environment

A network node associated with a retail store receives digital images from customers who have initiated a customer transaction at a first location. The digital images are unique and are associated with the transaction. The transaction is then temporarily suspended. To resume the transaction, the customer provides information describing features of the previously provided digital image. The network node uses this information to locate the corresponding digital image, and thus, the corresponding suspended transaction, and to authenticate the customer to resume the transaction. Provided the customer is authenticated, the node resumes the suspended transaction.

SYSTEMS AND METHODS FOR LOW COMPUTE DEPTH MAP GENERATION

Systems and methods are provided performing for low compute depth map generation by implementing acts of obtaining a stereo pair of images of a scene, downsampling the stereo pair of images, generating a depth map by stereo matching the downsampled stereo pair of images, and generating an upsampled depth map based on the depth map using an edge-preserving filter for obtaining at least some data of at least one image of the stereo pair of images.

MARKING INSPECTION DEVICE, MARKING INSPECTION METHOD AND ARTICLE INSPECTION APPARATUS
20210342618 · 2021-11-04 ·

A marking region image is obtained by cutting out the part corresponding to a marking region from an article image obtained by imaging an article to be inspected. Then, whether or not the marking is properly provided is determined by performing a character recognition of a marking part for a marking region image. Further, an image of an article having no marking and no defect is stored as a reference image, whereas a marking periphery image obtained by removing the image of the marking part from the marking region image is compared to the reference image. By that comparison, whether or not any defect is included in the marking peripheral part of the marking region except the marking part is determined.

TEXT DETECTION, CARET TRACKING, AND ACTIVE ELEMENT DETECTION
20210342622 · 2021-11-04 · ·

Detection of typed and/or pasted text, caret tracking, and active element detection for a computing system are disclosed. The location on the screen associated with a computing system where the user has been typing or pasting text, potentially including hot keys or other keys that do not cause visible characters to appear, can be identified and the physical position on the screen where typing or pasting occurred can be provided based on the current resolution of where one or more characters appeared, where the cursor was blinking, or both. This can be done by identifying locations on the screen where changes occurred and performing text recognition and/or caret detection on these locations. The physical position of the typing or pasting activity allows determination of an active or focused element in an application displayed on the screen.

Restricting screenshare of web pages to select list of allowed website URLs

In the context of a co-browse session, one of the participants elects to include a screenshare task in which a screenshare of a browser window displaying a website will be provided to other participants of the co-browse session. When the screenshare task is started, a location of an address bar of the web browser is identified, optional pre-processing is applied to the image of the address bar, and a character recognition process, is used to determine the characters of the URL in the browser's address bar. The URL is compared with a list of allowed website URLs, and the screenshare session is selectively allowed only if the URL is contained in the list of allowed URLs. Once the URL has been approved, a slice of pixels the address bar is obtained and monitored for changes to the pixels that may indicate a change to the URL.