G06V30/1607

TEXT-BASED INFORMATION EXTRACTION FROM IMAGES
20240412543 · 2024-12-12 ·

A method for extracting text information from images includes obtaining an extraction request associated with live data comprising an image; generating, using a prediction model, rotational variant features and rotational invariant features associated with the live data; generating, using the prediction model, text embeddings associated with the rotational variant features using overlapping kernel-based embedding on the live data; generating, using the prediction model, attention values for each pixel in the live data using context attention; applying a trained language model to the text embeddings, attention values, and the live data to generate predictions; and performing extraction actions based on the predictions.

Video text tracking method and electronic device
12190612 · 2025-01-07 · ·

A video text tracking method and an electronic device are disclosed. In the method, a text line region is split into sub-regions, the sub-regions are tracked and then processed, and processed sub-regions are combined into a new text line. The technical solutions provided in this application are not only applicable to a straight-line text scenario or a curved text scenario, but also present a good tracking effect for a deformable text line.

Machine learning enabled document deskewing
12165298 · 2024-12-10 · ·

A method may include determining, based at least on an image of a document, a plurality of text bounding boxes enclosing lines of text present in the document. A machine learning model may be trained to determine, based at least on the coordinates defining the text bounding boxes, the coordinates of a document bounding box enclosing the text bounding boxes. The document bounding box may encapsulate the visual aberrations that are present in the image of the document. As such, one or more transformations may be determined based on the coordinates of the document bounding box. The image of the document may be deskewed by applying the transformations. One or more downstream tasks may be performed based on the deskewed image of the document. Related methods and articles of manufacture are also disclosed.

Mobile document capture assist for optimized text recognition
09697431 · 2017-07-04 · ·

A device and method for providing a visual cue for improved text imaging on a mobile device. The method includes determining a minimum text size for accurate optical character recognition (OCR) of an image captured by the mobile device, receiving an image stream of a printed substrate, and displaying the image stream and a visual cue superimposed onto the image stream, wherein the visual cue is indicative of the minimum text size. The method further includes capturing a digital image of the image stream, wherein the digital image does not include the visual cue. Additionally, the method further includes notifying a user of the mobile device when text displayed within the image stream is at least as large as the minimum text size.

Method and system for correction of an image from a hand-held scanning device

A method for correcting an image acquired by a hand-held scanning device. A binarized image of an acquired image is cropped by removing columns on the left end and on the right end of only first components. A work image is created from the cropped image by replacing in each row of components series of first components smaller than a predetermined distance with series of second components. In the work image, a central line is identified. The identified central line in the work image is used to identify the corresponding central line in the cropped image forming a central line image, and in the central line image, the central text line is straightened.

Information processing apparatus, information processing system, information processing method and storage medium

According to one embodiment, an information processing apparatus includes an image acquisition module, an elevation-angle acquisition module, a character deformation specification module, a character detection dictionary storage, a character detection dictionary selector and a character detector. The elevation-angle acquisition module is configured to acquire an elevation angle of a photographic device assumed when the photographic device has obtained an acquired image. The character deformation specification module is configured to specify how an appearance of the character in the acquired image is deformed, based on the acquired elevation angle.

Method and apparatus of extracting particular information from standard card
09665787 · 2017-05-30 · ·

A method of extracting particular information in a standard card is disclosed herein. The method includes: acquiring a card image of a standard card having particular information to be extracted; identifying an image region containing the particular information in the card image; and extracting and outputting the image region as an independent image. Thus, an image related to the part of the particular information only can be obtained from the standard card conveniently, quickly and accurately, thereby improving the working efficiency. In addition, the present disclosure further provides an apparatus of extracting particular information in a standard card and a method of inputting particular information of a standard card in a mobile terminal.

Image resizing for optical character recognition in portable reading machine

A reading machine that operates in various modes includes image correction processing is described. The reading device pre-processes an image for optical character recognition by receiving the image and determining whether text in the image is too large or small for optical character recognition processing by determining that text height falls outside of a range in which optical character recognition software will recognize text in a digitized image. If necessary the image is resized according to whether the text is too large or too small.

METHOD AND SYSTEM FOR CORRECTION OF AN IMAGE FROM A HAND-HELD SCANNING DEVICE

A method for correcting an image acquired by a hand-held scanning device. A binarized image of an acquired image is cropped by removing columns on the left end and on the right end of only first components. A work image is created from the cropped image by replacing in each row of components series of first components smaller than a predetermined distance with series of second components. In the work image, a central line is identified. The identified central line in the work image is used to identify the corresponding central line in the cropped image forming a central line image, and in the central line image, the central text line is straightened.

INSTALLATION INFORMATION ACQUISITION METHOD, CORRECTION METHOD, PROGRAM, AND INSTALLATION INFORMATION ACQUISITION SYSTEM
20250088614 · 2025-03-13 ·

A projector in an installation information acquisition method is installed in a real space, has a changeable projection direction, and projects a projection image based on a virtual image. The virtual image is an image in a case where an image arranged at a display position in a virtual space is viewed from a virtual installation position. The method includes first acquisition processing for acquiring positional information of three or more first adjustment points in the virtual space, projection processing for projecting, by the projector, an index image onto the real space, second acquisition processing for acquiring angle information of the projection direction with respect to a reference direction in a state where the index image matches three or more second adjustment points respectively corresponding to the three or more first adjustment points, and third acquisition processing for acquiring installation information based on the positional information and the angle information.