Patent classifications
G06V30/148
LOW POWER MACHINE LEARNING USING REAL-TIME CAPTURED REGIONS OF INTEREST
Systems and methods are described for generating image content. The systems and methods may include, in response to receiving a request to cause a sensor of a computing device to identify image content associated with optical data captured by the sensor, detecting a first sensor data stream having a first image resolution, and detecting a second sensor data stream having a second image resolution. The systems and method may also include identifying, by processing circuitry of the computing device, at least one region of interest in the first sensor data stream, determining cropping coordinates that define a first plurality of pixels in the at least one region of interest in the first sensor data stream, and generating a cropped image representing the at least one region of interest.
ON-DEVICE ARTIFICIAL INTELLIGENCE SYSTEMS AND METHODS FOR DOCUMENT AUTO-ROTATION
An auto-rotation module having a single-layer neural network on a user device can convert a document image to a monochrome image having black and white pixels and segment the monochrome image into bounding boxes, each bounding box defining a connected segment of black pixels in the monochrome image. The auto-rotation module can determine textual snippets from the bounding boxes and prepare them into input images for the single-layer neural network. The single-layer neural network is trained to process each input image, recognize a correct orientation, and output a set of results for each input image. Each result indicates a probability associated with a particular orientation. The auto-rotation module can examine the results, determine what degree of rotation is needed to achieve a correct orientation of the document image, and automatically rotate the document image by the degree of rotation needed to achieve the correct orientation of the document image.
System and Computer-Implemented Method for Character Recognition in Payment Card
The present disclosure relates to a system and computer-implemented method for character recognition in a payment card. The method includes receiving an image of a payment card and one or more details associated with the payment card. Further, a derivative of the image is determined based on the one or more details and a horizontal sum of pixel values is determined for a plurality of rows in the image. Furthermore, one or more Regions of Interest (ROIs) are identified in the image by comparing the horizontal sum of pixel values with a predefined first threshold. Subsequently, one or more characters in the one or more ROIs are extracted using one or more peak values in a histogram of the one or more ROIs. Finally, each of the one or more characters extracted from the one or more ROIs is recognized using a trained Artificial Intelligence technique.
TEXT RECOGNITION METHOD, ELECTRONIC DEVICE, AND NON-TRANSITORY STORAGE MEDIUM
Provided are a text recognition method, an electronic device, and a non-transitory computer-readable storage medium, which are applicable in an OCR scenario. In the particular solution, a text image to be recognized is acquired. Feature extraction is performed on the text image, to obtain an image feature corresponding to the text image, where a height-wise feature and a width-wise feature of the image feature each have a dimension greater than 1. According to the image feature, sampling features corresponding to multiple sampling points in the text image are determined. According to the sampling features corresponding to the multiple sampling points, a character recognition result corresponding to the text image is determined.
TEXT RECOGNITION METHOD, ELECTRONIC DEVICE, AND NON-TRANSITORY STORAGE MEDIUM
Provided are a text recognition method, an electronic device, and a non-transitory computer-readable storage medium, which are applicable in an OCR scenario. In the particular solution, a text image to be recognized is acquired. Feature extraction is performed on the text image, to obtain an image feature corresponding to the text image, where a height-wise feature and a width-wise feature of the image feature each have a dimension greater than 1. According to the image feature, sampling features corresponding to multiple sampling points in the text image are determined. According to the sampling features corresponding to the multiple sampling points, a character recognition result corresponding to the text image is determined.
CHARACTER RECOGNITION METHOD, COMPUTER PROGRAM PRODUCT WITH STORED PROGRAM AND COMPUTER READABLE MEDIUM WITH STORED PROGRAM
A character recognition method includes inputting an input image of a document, with the input image including a plurality of characters; selecting the plurality of characters through an object detection module to form at least one character region; separating the plurality of characters in the at least one character region to form a plurality of character boxes; performing calculation to determine a format of a character in each of the plurality of character boxes; recognizing the characters in the at least one character region through an object recognition module to determine a symbol content of the character in each of the plurality of character boxes; and converting the plurality of characters according to the format and symbol content of the character in each of the plurality of character boxes, and outputting corresponding editable characters.
Visual domain detection systems and methods
Disclosed is an effective domain name defense solution in which a domain name string may be provided to or obtained by a computer embodying a visual domain analyzer. The domain name string may be rendered or otherwise converted to an image. An optical character recognition function may be applied to the image to read out a text string which can then be compared with a protected domain name to determine whether the text string generated by the optical character recognition function from the image converted from the domain name string is similar to or matches the protected domain name. This visual domain analysis can be dynamically applied in an online process or proactively applied in an offline process to hundreds of millions of domain names.
Semantic cluster formation in deep learning intelligent assistants
Enhanced techniques and circuitry are presented herein for providing responses to questions from among digital documentation sources spanning various documentation formats, versions, and types. One example includes a method comprising receiving an indication of a question directed to subject having a documentation corpus, determining a set of passages of the documentation corpus related to the question, ranking the set of passages according to relevance to the question, forming semantic clusters comprising sentences extracted from ranked ones of the set of passages according to sentence similarity, and providing a response to the question based at least on a selected semantic cluster.
Multi-step document information extraction
Briefly, embodiments of a system, method, and article for receiving a document from a remote device and identifying items in the document. Various operations may be performed based on one or more dependencies of the identified items. For example, additional items may be identified in the document. One or more of the identified items may be parsed. A correspondence between the identified items and a second set of items may be determined. The identified items may be validated based on a set of rules. One or more of the identified items may be transmitted to the remote device in response to the performance of the various operations.
Collision avoidance for document field placement
Users of a database management engine may generate fillable digital documents by mapping interface elements onto form documents. When a user maps interface elements onto a form document, the user may accidentally overlap two or more interface elements. To rectify this, the database management engine may modify the position of one of interface elements based on a set of positioning rules. In addition, the database management engine may identify and suggest mappings to users based on similar documents that have been previously mapped. The database management engine identifies similar documents using information about the document, the user, and the mapping itself. The mapping associated with the most similar document may be provided to the user as a suggested mapping. The database management engine converts the form document and finalized mapping into a fillable digital document. The fillable digital document is sent to recipients, who complete the fillable digital document.