G06K9/72

TEXT RECOGNITION METHOD AND DEVICE, AND ELECTRONIC DEVICE

A text recognition method includes: acquiring an image including text information, the text information including M characters, M being a positive integer greater than 1; performing text recognition on the image to acquire character information about the M characters; recognizing reading direction information about each character in accordance with the character information about the M characters, the reading direction information being used to indicate a next character corresponding to a current character in a semantic reading order; and ranking the M characters in accordance with the reading direction information about the M characters to acquire a text recognition result of the text information.

Handwriting detector, extractor, and language classifier
11176361 · 2021-11-16 · ·

Disclosed are methods for handwriting recognition. In some aspects, an image representing a page of a sample document is analyzed to identify a region having indications of handwriting. The region is analyzed to determine frequencies of a plurality of geometric features within the region. The frequencies may be compared to profiles or histograms of known language types, to determine if there are similarities between the frequencies in the sample document relative to those of the known language types. In some aspects, machine learning may be used to characterize the document as a particular language type based on the frequencies of the geometric features.

Computing system for extraction of textual elements from a document

Described herein are various technologies pertaining to text extraction from a document. A computing device receives the document. The document comprises computer-readable text and a layout, wherein the layout defines positions of the computer-readable text within a two-dimensional area represented by the document. Responsive to receiving the document, the computing device identifies at least one textual element in the computer-readable text based upon spatial factors between portions of the computer-readable text and contextual relationships between the portions of the computer-readable text. The computing device then outputs the at least one textual element.

METHOD AND APPARATUS FOR CONTEXTUAL INCLUSION OF OBJECTS IN A CONFERENCE

Aspects of the subject disclosure may include, for example, an assessment of a context associated with a conference, an identification of an object associated with the conference in accordance with the context, and a presentation of the object as part of the conference. The conference may include a videoconference and the object may include a physical object, a virtual object, or a combination thereof. Other embodiments are disclosed.

Image processing method and an image processing system
11170265 · 2021-11-09 · ·

An image processing method for recognising characters included in an image. A first character recognition unit performs recognition of a first group of characters corresponding to a first region of the image. A measuring unit calculates a confidence measure of the first group of characters. A determination unit determines whether further recognition is to be performed based on the confidence measure. A selection unit selects a second region of the image that includes the first region, if it is determined that further recognition is to be performed. A second character recognition unit performs further recognition of a second group of characters corresponding to the second region of the image.

POLYSEMANT MEANING LEARNING AND SEARCH RESULT DISPLAY
20210342658 · 2021-11-04 ·

A polysemant meaning learning method is provided. The method includes extracting a plurality of first target terms and at least one adjacent term combinations of each first target term; obtaining a capsule network model by training by taking the encoding of each first target term as an input vector and the encoding of each adjacent term combination corresponding to each first target term as an output vector; when a to-be-recognized second target term is recognized, inputting the second target term into the capsule network model, and determining a plurality of obtained intermediate vectors as feature vectors of the second target term; and clustering the feature vectors with a cosine similarity greater than a similarity threshold to generate representative terms of one or more categories and determining one or more meanings of the one or more categories.

MEDICAL IMAGE PROCESSING METHOD AND APPARATUS, ELECTRONIC MEDICAL DEVICE, AND STORAGE MEDIUM
20210343016 · 2021-11-04 ·

A medical image processing method includes: obtaining a biological tissue image including a biological tissue, recognizing, in the biological tissue image, a first region of a lesion object in the biological tissue; recognizing a lesion attribute matching the lesion object; dividing an image region of the biological tissue in the biological tissue image into a plurality of quadrant regions; obtaining target quadrant position information of a quadrant region in which the first region is located; and generating medical service data according to the target quadrant position information and the lesion attribute.

Keyframe scheduling method and apparatus, electronic device, program and medium

A key frame scheduling method and apparatus include: performing feature extraction on a current frame through a first network layer of a neural network to obtain low-layer features of the current frame acquiring a scheduling probability of the current frame according to low-level features of a previous key frame adjacent to the current frame and the low-level features of the current frame; determining whether the current frame is scheduled as a key frame according to the scheduling probability value of the current frame; and when determining that the current frame is scheduled as a key frame, performing feature extraction on low-level features of a current key frame via a second network layer of the neural network to obtain high-level features of the current key frame, where the network depth of the first network layer is less than the network depth of the second network layer.

Display overlays for prioritization of video subjects

Technology for generating camera viewfinder displays for camera people video recording/broadcasting live events such as sporting events, where the viewfinder displays include overlays that include: (i) priority values for objects shown on and/or off the live event view shown in viewfinder display; (ii) identifications of objects that are outside the viewfinder display; and/or (iii) direction to the locations of objects that are outside the viewfinder display. In response to these indications in the overlay, the cameraperson may move the camera to better capture a high priority object and/or capture an object that was outside the viewfinder display.

VEHICLE SENSOR OCCLUSION DETECTION

A system for detecting a road surface includes a processor programmed to identify, in a vehicle sensor field of view, environment features including a sky, a road surface, a road shoulder, based on map data and data received from the one or more vehicle sensors, upon determining a low confidence in identifying an area of the field of view to be a road surface or sky, to receive a polarimetric image, and to determine, based on the received polarimetric image, whether the identified area is a mirage phenomenon, a wet road surface, or the sky. The low confidence is determined upon determining that a road surface has a vanishing edge, a road lane marker is missing, or an anomalous object is present.