Patent classifications
G06V30/1478
Information processing device, information processing method, and non-transitory computer readable storage medium
The information processing device obtains a character string image which includes a plurality of characters, and which includes the characters arranged in an arrangement direction, obtains a probability image representing a probability of an existence of a character in each of the pixel included in the character string image, obtains a plurality of character regions in which the characters are estimated to respectively exist in the character string image based on the probability image, obtains an additional character region which is located in the character string image, and which does not overlap the plurality of character regions based on a determination result on whether or not a pixel of a non-background color exists in a direction perpendicular to the arrangement direction at every position in the arrangement direction in the character string image, and recognizes the plurality of characters from the character regions and the additional character region.
INFORMATION PROCESSING APPARATUS, INFORMATION PROCESSING SYSTEM, AND NON-TRANSITORY COMPUTER READABLE MEDIUM
An information processing apparatus includes: a first extracting unit that extracts a position of a character entry box in an input image; a recognizing unit that recognizes a character string written in the character entry box; a calculating unit that calculates recognition accuracy of each of characters of the character string recognized by the recognizing unit; a first detector that detects that a value based on the recognition accuracy is equal to or larger than a preset threshold value; a second extracting unit that extracts a position of a circumscribed rectangle for each character of the character string in the input image; a second detector that detects contact of the circumscribed rectangle with the character entry box; and a display that displays the character string to be corrected on the basis of a result of detection by the first detector and a result of detection by the second detector.
IMAGE PROCESSING DEVICE, IMAGE PROCESSING METHOD, AND COMPUTER PROGRAM PRODUCT
According to an embodiment, an image processing device includes a memory, and one or more hardware processors configured to function as a receiving unit, a specifying unit, and a detecting unit. The receiving unit receives input information input to an image. The specifying unit specifies the position of the input information. The detecting unit detects a character string having a smaller distance to the position than another character string, from the image.
IMAGE CORRECTION DEVICE
An image correction device includes a line segment detection module, a shape specification module and an image correction module. The line segment detection module detects from a captured image obtained by photographing a document a plurality of line segments that correspond to the notation on the surface of the document. The shape specification module specifies shape approximation lines that approximate the surface shape of the document from the plurality of line segments. The image correction module utilizes the shape approximation lines specified by the shape specification module to correct the captured image.
Text recognition method and terminal device
A text recognition method includes scaling a to-be-recognized image based on a first scale ratio, determining first coordinate information corresponding to a text line area in the scaled to-be-recognized image, determining, based on the first scale ratio, second coordinate information corresponding to the first coordinate information, where the second coordinate information is coordinate information of the text line area in the to-be-recognized image, performing character recognition on text line images corresponding to the second coordinate information by using a recognition model, and determining text line content corresponding to the text line images, where the to-be-recognized image includes the text line images.
Image processing method
An image processing method for a picture of a participant, photographed in an event, such as a marathon race, increases the accuracy of recognition of a race bib number by performing image processing on a detected race bib area, and associates the recognized race bib number with a person included in the picture. This image processing method detects a person from an input image, estimates an area in which a race bib exists based on a face position of the detected person, detects an area including a race bib number from the estimated area, performs image processing on the detected area to thereby perform character recognition of the race bib number from an image subjected to image processing, and associates the result of character recognition with the input image.
SYSTEM AND METHOD FOR IMPORTING SCANNED CONSTRUCTION PROJECT DOCUMENTS
A system and method for efficiently importing scanned construction project documents (e.g., digital images of physical documents) is disclosed. The method includes receiving a digital image of a document and performing a first text recognition operation on a first portion of the digital image. The method includes in response to determining, based on the first text recognition operation, that the first portion does not include machine-readable text, generating a modified image of the document by performing an image modification operation. The image modification operation may include an orientation operation. The method further includes storing the modified image of the document in a database. The image modification operation may also include a de-skewing operation and an alignment operation.
METHOD, APPARATUS, CLIENT TERMINAL, AND SERVER FOR ASSOCIATING VIDEOS WITH E-BOOKS
Method, apparatus, client terminal, and server for associating videos with e-books are provided. The method for associating the video with the e-book includes: identifying at least one first content in the video; and comparing the first content with the second content in the e-book to determine the association relationship between the video and the e-book. The e-book includes at least one second content, and the association relationship includes the association relationship between a video part in the video corresponding to the first content and an e-book part in the e-book corresponding to the second content.
INFORMATION PROCESSING APPARATUS, STORAGE MEDIUM, AND INFORMATION PROCESSING METHOD
A search area is set on a recognition target image, cutout areas are set at a plurality of positions in the search area, images corresponding to the plurality of set cutout areas are extracted, similarities of candidate characters obtained by comparison between the extracted images and dictionary data is weighted in accordance with the positions of the cutout areas. In such a manner, evaluation values of the candidate characters are obtained, and a candidate character with the highest evaluation value among the obtained candidate characters is output as a recognition result. Further, a search area relating to a next character is set based on position information about the cutout area corresponding to the recognition result.
DOCUMENT REORIENTATION PROCESSING
Video frames of a document are captured. A current orientation mode of a device having a camera is determined based on the video frames. An optimal orientation mode for capturing a document image is determined. Guided instructions assist in placing the device in the optimal orientation mode and when the document is centered in a lens of the camera, the document image is taken by the camera.