G06V30/18086

System and method for text line and text block extraction
12548358 · 2026-02-10 · ·

The invention concerns a method implemented by a device for displaying strokes of digital ink in a display area and for performing text line extraction to extract text lines from the strokes. In particular, the text line extraction may involve slicing the display area into strips, ordering for each strip the strokes into ordered lists which form collectively a first set of ordered lists, forming for each strip a second set of ordered lists by filtering out from the ordered lists of the first set strokes which are below a given size threshold, and performing a neural net analysis based on said first and second sets to determine for each stroke a respective text line to which it belongs.

IMAGE READING SYSTEMS, METHODS AND STORAGE MEDIUM FOR PERFORMING GEOMETRIC EXTRACTION

Geometric extraction is performed on an unstructured document by recognizing textual blocks on at least a portion of a page of the unstructured document, generating bounding boxes that surround and correspond to the textual blocks, determining search paths having coordinates of two endpoints and connecting at least two bounding boxes, and generating a graph representation of the at least a portion of the page, the graph representation including the plurality of textual blocks, the coordinates of the vertices of each bounding box and the coordinates of the two endpoints of each search path.

Method and apparatus employing font size determination for resolution-independent rendered text for electronic documents

Method and apparatus for determining font point size in bitmapped text does not rely on accuracy of an optical character recognition (OCR) engine, or on generation of heuristics (e.g. assumption of certain amounts of different types of text, such as capital, lowercase, ascending, descending) to determine a likely font size. A deep learning model for determining text size is based on extraction of features from existing text to obtain a more general solution.

Automated combobox.select in a non-technology way
12608303 · 2026-04-21 · ·

A method includes capturing a first image of the GUI at a first time, after the first time, providing the GUI with an input event to change a configuration of at least one of the plurality of graphical elements to include an expanded region and capturing a second image of the GUI at a second time after the input event. A background of the second image of the GUI changes from a background of the first image of the GUI to include one or more background text blocks and an expanded region text block. The method also includes obtaining, a difference image, determining whether the difference image includes the one or more background text blocks and the expanded region text block; and selecting a text block closest to a position of the at least one of the plurality of graphical elements as the expanded region text block.