Patent classifications
G06V30/1478
DEVICES AND METHODS FOR GENERATING INPUT
Devices and methods are disclosed for generating input. In one implementation, a stylus is provided for generating writing input. The stylus includes an elongated body having a distal end, and a light source configured to project coherent light on an opposing surface adjacent the distal end. The stylus further includes at least one sensor configured to measure first reflections of the coherent light from the opposing surface while the distal end moves in contact with the opposing surface, and to measure second reflections of the coherent light from the opposing surface while the distal end moves above the opposing surface and out of contact with the opposing surface. The stylus also includes at least one processor configured to receive input from the at least one sensor and to enable determining three dimensional positions of the distal end based on the first reflections and the second reflections.
System and method of character recognition using fully convolutional neural networks with attention
Embodiments of the present disclosure include a method that obtains a digital image. The method includes extracting a word block from the digital image. The method includes processing the word block by evaluating a value of the word block against a dictionary. The method includes outputting a prediction equal to a common word in the dictionary when a confidence factor is greater than a predetermined threshold. The method includes processing the word block and assigning a descriptor to the word block corresponding to a property of the word block. The method includes processing the word block using the descriptor to prioritize evaluation of the word block. The method includes concatenating a first output and a second output. The method includes predicting a value of the word block.
Apparatuses, methods, and systems for 3-channel dynamic contextual script recognition using neural network image analytics and 4-tuple machine learning with enhanced templates and context data
In some embodiments, a method includes training a first machine learning model based on multiple documents and multiple templates associated with the multiple documents. The method further includes executing the first machine learning model to generate multiple relevancy masks, the multiple relevancy masks to remove a visual structure of the multiple templates from a visual structure of the multiple documents. The method further includes generating multiple multichannel field images to include the multiple relevancy masks and at least one of the multiple documents or the multiple templates. The method further includes training a second machine learning model based on the multiple multichannel field images and multiple non-native texts associated with the multiple documents. The method further includes executing the second machine learning model to generate multiple non-native texts from the multiple multichannel field images.
Image processing method, apparatus, electronic device and computer readable storage medium
The present application discloses an image processing method, apparatus, electronic device and computer readable storage medium. The image processing method comprises detecting a text region in an image to be processed, recognizing the text region to obtain a text recognition result. In this application, the text recognition in the image to be processed is realized, the recognition manner for the text in the image is simplified, and the recognition effect for the text is improved.
APPARATUS AND METHOD FOR RECOMMENDING LEARNING USING OPTICAL CHARACTER RECOGNITION
There are provided a learning recommendation apparatus and method for detecting a problem from an image through character recognition and providing at least one sub-topic learning among a plurality of sub-topic learnings related to the detected problem. The provided learning recommendation apparatus recommends, as a recommendation target, a plurality of learning topics including the concept of a formula which has been read through the character recognition for an image, wherein a priority order is set to the plurality of learning topics based on the concept distance between the learning topic and the learning history, and the learning topics are recommended so that the learning topic having a higher priority order is located at a higher position.
METHODS, SYSTEMS, APPARATUS AND ARTICLES OF MANUFACTURE FOR RECEIPT DECODING
Methods, apparatus, systems and articles of manufacture are disclosed for receipt decoding. An example apparatus includes processor circuitry to execute instructions to extract text from the receipt image, the text including bounding boxes; associate ones of the bounding boxes to link horizontally related fields of a the receipt image by selecting a first bounding box; identifying first horizontally aligned bounding boxes, the first horizontally aligned bounding boxes to include at least one bounding box of the bounding boxes that is horizontally aligned relative to the first bounding box; adding the first horizontally aligned bounding boxes to a word sync list; and connecting ones of the first horizontally aligned bounding boxes and the first bounding box based on at least one of an amount of the first horizontally aligned bounding boxes in the word sync list and a relationship among the first horizontally aligned bounding boxes and the first bounding box.
SYSTEM AND METHOD FOR STRAIGHTENING CURVED PAGE CONTENT
The page straightening system includes a word module to determine an enclosing quadrilateral of each connected component of curved page content. Further, a line module in the page straightening system is configured to form text lines by joining enclosing quadrilaterals based on a reading order. Subsequently, a correction module in the page straightening system is configured to generate straightened content from the curved content based on the text lines. As such, the page straightening system can automatically straighten curved page content.
INFORMATION PROCESSING APPARATUS AND PROGRAM
An information processing apparatus capable of displaying an image on a predetermined display unit, includes: a reception unit that receives a written input on an image according to an operation of a user in a state where the image is displayed on the display unit; a generation unit that generates a written object according to the written input received by the reception unit; a reference detection unit that detects a reference direction of the image displayed on the display unit; a correction unit that corrects the written object on the basis of the reference direction detected by the reference detection unit; and a display control unit that displays the written object generated by the generation unit.
RECORDING DOSE DATA FROM DRUG INJECTION DEVICES USING OPTICAL CHARACTER RECOGNITION (OCR)
A method of recording a medicament dose using a data collection device comprises capturing an image of a medicament dose indicator of a medicament delivery device, adjusting a scale of said image, adjusting said image for skew of one or more characters displayed by the medicament dose indicator, determining the position of at least one of said one or more characters in the image, identifying the at least one character using optical character recognition and determining a medicament dose indicator by the medicament dose indicator based on a result of the optical character recognition. The method may be performed using a handheld electronic device comprising a camera, such as a cellphone, a tablet computer or other device. A computer program for controlling a data collection device to perform the method may be provided in the form of a software application or “app”.
Model-based dewarping method and apparatus
An apparatus and method for processing a captured image and, more particularly, for processing a captured image comprising a document. In one embodiment, an apparatus comprising a camera to capture documents is described. In another embodiment, a method for processing a captured image that includes a document comprises the steps of distinguishing an imaged document from its background, adjusting the captured image to reduce distortions created from use of a camera and properly orienting the document is described.