G06V30/10

RECOGNITION DEVICE, RECOGNITION METHOD, AND COMPUTER PROGRAM PRODUCT
20180012112 · 2018-01-11 ·

According to an embodiment, a recognition device includes a candidate detection unit, a recognition unit, a matching unit, and a prohibition processing unit. The candidate detection unit detects, from an input image, character candidates each being a set of pixels estimated to include a character. The recognition unit recognizes each of the character candidates and generates one or more recognition candidates each being a character of a candidate as a recognition result. The matching unit matches each of the one or more recognition candidates with a knowledge dictionary in which a recognition target character string is modeled, and generates matching results obtained by matching a character string estimated to be included in the input image with the knowledge dictionary. The prohibition processing unit deletes, from the matching results, a matching result obtained by matching a character string including a prohibition target character string with the knowledge dictionary.

PATTERN RECOGNITION DEVICE, PATTERN RECOGNITION METHOD, AND COMPUTER PROGRAM PRODUCT
20180012108 · 2018-01-11 ·

According to an embodiment, a pattern recognition device recognizes a pattern of an input signal by converting the input signal to a feature vector and matching the feature vector with a recognition dictionary. The recognition dictionary includes a dictionary subspace basis vector for expressing a dictionary subspace which is a subspace of a space of the feature vector, and a plurality of probability parameters for converting similarity calculated from the feature vector and the dictionary subspace into likelihood. The device includes a recognition unit configured to calculate the similarity using a quadratic polynomial of a value of an inner product of the feature vector and the dictionary subspace basis vector, and calculate the likelihood using the similarity and an exponential function of a linear sum of the probability parameters. The recognition dictionary is trained by using an expectation maximization method using a constraint condition between the probability parameters.

Artificial intelligence based smart data engine

A machine learning computing system for extracting structured data objects from electronic documents comprising unstructured text includes a first data repository storing a plurality of electronic documents including at least one text data object and an expert system computing device. The expert system computing device includes a processor and a non-transitory memory device storing instructions causing the expert system to receive a first data object comprising unstructured data identified from an electronic document stored in the first data repository, process, a first set of rules to identify at least one key-value pair data object from the first data object; process, by an inference engine module, a second set of rules to identify at least one free text data object from the first data object and store, in a non-transitory memory device, the at least one key-value pair and the at least one free text data object.

Artificial intelligence based smart data engine

A machine learning computing system for extracting structured data objects from electronic documents comprising unstructured text includes a first data repository storing a plurality of electronic documents including at least one text data object and an expert system computing device. The expert system computing device includes a processor and a non-transitory memory device storing instructions causing the expert system to receive a first data object comprising unstructured data identified from an electronic document stored in the first data repository, process, a first set of rules to identify at least one key-value pair data object from the first data object; process, by an inference engine module, a second set of rules to identify at least one free text data object from the first data object and store, in a non-transitory memory device, the at least one key-value pair and the at least one free text data object.

SYSTEMS AND METHODS FOR STRIKE THROUGH DETECTION
20180012099 · 2018-01-11 ·

The present disclosure is directed to systems and methods for strike through detection and, more particularly, to systems and methods for detecting a strike through in an address block of a mailpiece. The method is implemented in a computing device and includes: generating edges of lines within a text block identified through optical character recognition processes; locating text lines within the text block; characterizing the edges within the text lines and outside of the text lines; and grouping identified edges of the characterized edges outside of the text lines into co-linear groups.

METHOD AND APPARATUS FOR VERIFYING VEHICLE OWNERSHIP FROM AN IMAGE

Some aspects of the invention relate to a mobile apparatus including an image sensor configured to convert an optical image into an electrical signal. The optical image includes an image of a vehicle license plate. The mobile apparatus includes a license plate detector configured to process the electrical signal to recover information from the vehicle license plate image. The mobile apparatus includes an interface configured to transmit the vehicle license plate information to a remote apparatus and receive verification of vehicle ownership in response to the transmission.

Electronic document data extraction

Methods, systems, and computer storage media are provided for data extraction. A target document representation may be generated based on modified text of a target electronic document. A measure of similarity may be determined between the target document representation and a reference document representation, which may be based on modified text of a reference electronic document. Based on the measure of similarity, the reference document representation may be selected. An extraction model associated with the selected reference document representation can then be used to extract data from the target document.

Electronic document data extraction

Methods, systems, and computer storage media are provided for data extraction. A target document representation may be generated based on modified text of a target electronic document. A measure of similarity may be determined between the target document representation and a reference document representation, which may be based on modified text of a reference electronic document. Based on the measure of similarity, the reference document representation may be selected. An extraction model associated with the selected reference document representation can then be used to extract data from the target document.

SYSTEMS AND METHODS FOR IMPROVED OPTICAL CHARACTER RECOGNITION OF HEALTH RECORDS
20180011974 · 2018-01-11 ·

Systems and methods to improve the optical character recognition of records, and in particular health records, are provided. An image of a medical record is received, and an initial optical image recognition (OCR) on the image is performed to identify text information. The OCR signal quality may be measured, and areas of insufficient OCR signal quality may be isolated. The signal quality is determined by a weighted average of semantic analysis of the resulting text, and/or OCR accuracy measures. The OCR process may be repeated on the isolated regions of lower signal quality, each time using a different OCR transform, until all regions are completed with a desired degree of signal quality (accuracy). All the regions of the document may then be recompiled into a single document for outputting.

SHADOW DETECTION AND REMOVAL IN LICENSE PLATE IMAGES
20180012101 · 2018-01-11 ·

A method, system, and apparatus for license plate relighting comprises collecting an image of a license plate, performing license plate recognition on the image of the license plate; calculating a confidence metric for the license plate recognition; and performing a shadow detection and relighting method if the confidence metric is below a predetermined threshold, comprising identifying a shaded region of said license plate, determining if the shaded region is actually shaded, and relighting the actually shaded region.