Patent classifications
G06V30/1456
Detection of texts
The present disclosure relates to detection of texts. A text detecting method includes: acquiring a first image to be detected of a text object to be detected; determining whether the first image to be detected contains a predetermined indicator; determining, if the first image to be detected contains the predetermined indicator, a position of the predetermined indicator, and acquiring a second image to be detected of the text object to be detected; determining whether the second image to be detected contains the predetermined indicator; and determining, if the second image to be detected does not contain the predetermined indicator, a text detecting region based on the position of the predetermined indicator.
SYSTEMS AND METHODS FOR STRUCTURAL DATA ANALYSIS
Systems and methods for viewing, tracking, and analyzing data structure. Particularly, systems and methods for recognizing and grouping structural components of data into data shapes for viewing, tracking, and analyzing the data structure irrespective of the data content. An example method of analyzing data may include receiving document data comprising a plurality of data fields and defining a data shape from the document data, the data shape having one or more of the plurality of data fields. The data shape is defined agnostic to data content. The data shape may further include a qualifier associated with a data field. The data shape may be a first data shape, and the method may further include defining a second data shape from the document data, the second data shape having one or more of the plurality of data fields. The second shape may comprise the first data shape and an additional element.
APPROACH FOR CLOUD EMR COMMUNICATION VIA A CONTENT PARSING ENGINE AND A STORAGE SERVICE
An approach provides sending captured Superbill image data and output data generated based on results of parsing the captured Superbill image data to an external system which manages Superbill data via a cloud system and a storage service. The cloud system creates parsing rule data for parsing a captured Superbill image in accordance with user operation at a client device. The cloud system obtains captured Superbill image data from the storage service, in response to receiving a notification indicating that the captured Superbill image data has been stored in the storage service. The cloud system parses the obtained image data based on the created parsing rule and generates output data based on results of parsing. The cloud system sends the generated output data and the obtained image data to the external system via the one or more computer networks.
Voice controlled keyboard typing using computer vision
Techniques for systems and methods that provide for utilizing a robotic system to type commands that correspond to voice commands using a keyboard of a device are described herein. In embodiments, an image of a device may be received from a camera. A keyboard region of the device may be determined based on a keyboard detection algorithm that uses the image. One or more characters in a portion of the image that corresponds to the keyboard region of the device may be detected based on a character detection algorithm. The one or more characters may be grouped into one or more groups based on the portion of the image. A character of a portion of character associated with a group may be edited based on an error detection algorithm.
Information processing apparatus and image forming apparatus performing file conversion of handwriting comment and comment extraction method
Provided is information processing apparatus that extracts a handwriting comment. A comment acquiring part searches handwriting comment from an image data of a scanned manuscript and acquire the handwriting comment in association with position information indicating the handwriting comment for an area of the manuscript. A filing part converts the handwriting comment acquired by the comment acquiring part into a file. An OCR part performs optical character recognition (OCR). A filing part performs OCR of the comment by the OCR part, and when recognizable as a character, converts character data of the handwriting comment into the file, and when unrecognizable as a character, acquires the area of the handwriting comment from image data of the manuscript and converts into the file.
Real-time image capture system
A method for processing camera images in real-time is disclosed. The user starts a process that generates a feed from a mobile device of camera images of a document. An image from the feed is obtained and set as the current image. One or more of a plurality of aspects regarding the image are computed and tested against a threshold. If the aspect fails to satisfy the threshold, then the user is prompted to change the move the mobile device. Once the aspects satisfy their respective thresholds, the camera feed is stopped, the last image is saved, and character recognition of the last image is performed and displayed to the user.
Identification method and electronic device
An identification method and an electronic device are provided. The identification method comprises: detecting at least one object using the electronic device; providing an identification box having a first appearance which corresponds with the at least one object as detected; and displaying the identification box having the first appearance via the electronic device.
INFORMATION PROCESSING DEVICE AND INFORMATION PROCESSING METHOD
An information processing device performs processing on document image data including first image data to undergo character recognition processing and second image data not to undergo character recognition processing. The information processing device includes a detecting section which detects the first image data, an extracting section which extracts the first image data, and a processing section. The processing section includes a counting section which counts first images, a determining section which determines whether the number of the first images exceeds a threshold, a first performing section which performs first processing when the threshold is exceeded, and a second performing section which performs second processing when the threshold is not exceeded. Through the first processing, the second image is masked with a background color of the document image and character recognition is then performed on the document image. Through the second processing, character recognition is performed on the first images.
System and method to facilitate content distribution
Systems and methods are provided that facilitate publishing, distributing, and reading of electronic content. In some embodiments, the systems and methods may include a document conversion module for converting documents uploaded by publishers into an e-reader friendly format (an e-document). The systems and methods may also include a virtual library for making the e-documents available to end users and an active reader module to allow an end user to download and read the e-documents on an end user device. In some embodiments, the systems and methods may include a user management module for digital rights management and control of end user access to the e-documents. In some embodiments, the active reader may include functionality that allows an end user to annotate the e-document and share comments among users.
METHOD AND APPARATUS FOR RECOGNIZING CHARACTERS
A method and an apparatus for recognizing characters using an image are provided. A camera is activated according to a character recognition request and a preview mode is set for displaying an image photographed through the camera in real time. An auto focus of the camera is controlled and an image having a predetermined level of clarity is obtained for character recognition from the images obtained in the preview mode. The image for character recognition is character-recognition-processed so as to extract recognition result data. A final recognition character row is drawn that excludes non-character data from the recognition result data. A first word is combined including at least one character of the final recognition character row and a predetermined maximum number of characters. A dictionary database that stores dictionary information on various languages using the first word is searched, so as to provide the user with the corresponding word.