G06V30/1478

APPARATUS AND METHOD FOR USING BACKGROUND CHANGE TO DETERMINE CONTEXT
20190294909 · 2019-09-26 · ·

Devices and a method are provided for providing feedback to a user. In one implementation, the method comprises obtaining a plurality of images from an image sensor. The image sensor is configured to be positioned for movement with the user's head. The method further comprises monitoring the images, and determining whether relative motion occurs between a first portion of a scene captured in the plurality of images and other portions of the scene captured in the plurality of images. If the first portion of the scene moves less than at least one other portion of the scene, the method comprises obtaining contextual information from the first portion of the scene. The method further comprises providing the feedback to the user based on at least part of the contextual information.

System and method for preprocessing images to improve OCR efficacy
10417516 · 2019-09-17 · ·

A system to preprocess images to increase accuracy of optical character recognition (OCR) includes a processor, and a memory coupled to the processor. The processor is configured to scan an electronically stored representation of a whole or partial document, identify an image in the electronically stored representation, and recognize row-based text within the electronically stored representation. In addition, the processor is configured to align the row-based text vertically, generate a resultant electronically stored representation of the whole or partial document having the row-based text aligned, and save the resultant electronically stored representation for subsequent OCR processing. The electronically stored representation of the whole or partial document may contain at least one image having a JPG, TIF, GIF, PNG, or BMP, type of format.

TEXT IMAGE CORRECTION METHOD AND APPARATUS
20240161523 · 2024-05-16 · ·

A text image correction method and a corresponding text image correction apparatus. Frequency information of a row-direction cumulative curve used by the method is sensitive to an error between a compensation angle for a tilt angle and a real tilt angle, and the method thus has good robustness. The method can accurately estimate the compensation angle for a tilt angle and correct a tilted text image. The method and apparatus can be applied to scenarios such as image pre-processing, automatic compensation for angles of scanned text images, automatic compensation for tilt angles of mobile phone photos.

Information processing apparatus, information processing method, and storage medium
10354162 · 2019-07-16 · ·

A detected quadrilateral area is displayed and no group of candidate lines is displayed in a normal state. While a user is selecting a side that the user desires to change, a group of candidate lines corresponding to the selected side is displayed. Then, whether to replace a position of the selected side with a position of a candidate line is determined based on a movement destination position of the selected side.

EXTRACTING CARD DATA FROM MULTIPLE CARDS

Extracting financial card information with relaxed alignment comprises a method to receive an image of a card, determine one or more edge finder zones in locations of the image, and identify lines in the one or more edge finder zones. The method further identifies one or more quadrilaterals formed by intersections of extrapolations of the identified lines, determines an aspect ratio of the one or more quadrilateral, and compares the determined aspect ratios of the quadrilateral to an expected aspect ratio. The method then identifies a quadrilateral that matches the expected aspect ratio and performs an optical character recognition algorithm on the rectified model. A similar method is performed on multiple cards in an image. The results of the analysis of each of the cards are compared to improve accuracy of the data.

Apparatus and method for using background change to determine context
10339406 · 2019-07-02 · ·

Devices and a method are provided for providing feedback to a user. In one implementation, the method comprises obtaining a plurality of images from an image sensor. The image sensor is configured to be positioned for movement with the user's head. The method further comprises monitoring the images, and determining whether relative motion occurs between a first portion of a scene captured in the plurality of images and other portions of the scene captured in the plurality of images. If the first portion of the scene moves less than at least one other portion of the scene, the method comprises obtaining contextual information from the first portion of the scene. The method further comprises providing the feedback to the user based on at least part of the contextual information.

IMAGE PROCESSING APPARATUS AND IMAGE FORMING APPARATUS
20190197336 · 2019-06-27 · ·

An image processing apparatus includes a character recognition section, a translation section, an image processing section, a selection acceptance section, and a control section. The character recognition section performs character recognition processing on image data. The translation section translates an original text obtained through the character recognition processing performed by the character recognition section into a predetermined language and creates a translated text. The image processing section generates a replaced image in which a text portion of an original image shown in the image data is replaced from the original text by the translated text. The selection acceptance section accepts an instruction of selecting, as an output target, either one or both of the original image shown in the image data and the replaced image. The control section performs, in accordance with the accepted instruction, processing of outputting an output target image selected as the output target.

Imaging terminal, imaging sensor to determine document orientation based on bar code orientation and methods for operating the same

Embodiments of an image reader and/or methods of operating an image reader can capture an image, identify a bar code or IBI form within the captured image, and, store or display the captured image responsive to an orientation of the bar code.

Method and apparatus for image recognition

The present disclosure discloses a method and an apparatus for processing image information. A specific implementation of the method comprises: recognizing each character in an original image and acquiring a position of the each character; matching a character in the original image with a character in a layout structured region of a template image, and recording identical characters or character strings in the original image and the template image as a matching point pair; acquiring a projective transformation matrix between the matching point pairs according to the position of the character in the original image and the position of the character in the layout structured region of the template image; registering the original image according to the projective transformation matrix to acquire a registered image; and recognizing the registered image to acquire a recognition result. This implementation simplifies steps of image matching in character recognition, enhances matching accuracy and universality, and reduces cost of development.

RECORDING MEDIUM RECORDING CHARACTER AREA EXTRACTION PROGRAM, INFORMATION PROCESSING APPARATUS AND CHARACTER AREA EXTRACTION METHOD
20190156135 · 2019-05-23 · ·

A non-transitory computer-readable recording medium recording a character area extraction program for causing a computer to execute a process includes changing a relationship in relative sizes between an image and a scanning window that scans the image; scanning the scanning window based on a changed relationship, specifying a scanning position at which an edge density of an image area included in the scanning window is equal to or larger than a threshold value, extracting one or more areas indicated by the scanning window at the specified scanning position as one or more character area candidates, determining, when overlapped character area candidates included in the one or more character area candidates overlap with each other, a maximum character area candidate having a maximum edge density among the overlapped character area candidates, and extracting the image area included in the maximum character area candidate as a character area.