Patent classifications
G06V30/268
CONTENT-AWARE SELECTION
An image editing program can include a content-aware selection system. The content-aware selection system can enable a user to select an area of an image using a label or a tag that identifies object in the image, rather than having to make a selection area based on coordinates and/or pixel values. The program can receive a digital image and metadata that describes an object in the image. The program can further receive a label, and can determine from the metadata that the label is associated with the object. The program can then select a bounding box for the object, and identify in the bounding box, pixels that represent the object. The program can then output a selection area that surrounds the pixels.
Handwriting recognition systems and methods
Systems and methods are provided for handwriting recognition. Handwriting data is received. Geometric data of text in handwriting data is determined. Sub-characters of the text are determined. Sub-characters of text are matched to a model. Most probable characters of the text are determined based on the matching.
Object recognition device that determines overlapping states for a plurality of objects
An object recognition device according to an embodiment includes a camera that captures an image of an imaging area. A storage device stores, for each of a plurality of registered objects, dictionary feature information for identifying the corresponding object and dictionary boundary information for identifying an actual boundary area of the corresponding object. A processor receives the captured image from the camera, and determines an object area in the captured image. The processor extracts feature information from the object area, and, based on the extracted feature information compared to the dictionary feature information, identifies each object included in the object area. The processor also extracts boundary information corresponding to each identified object included in the object area, and, based on the extracted boundary information compared to the dictionary boundary information with respect to each identified object, determines an overlap state of each identified object in the object area.
Machine learning data extraction algorithms
Embodiments of the present disclosure pertain to extracting data corresponding to particular data types using machine learning algorithms. In one embodiment, a method includes receiving an image in a backend system, sending the image to an optical character recognition (OCR) component, and in accordance therewith, receiving a plurality of characters recognized in the image. The character set is matched against known values to generate candidate character strings. The character set is processed by one or more machine learning algorithms to produce features. For each candidate character string, the features are then processed by a random forest model to determine a final character string.
Method and System for Detecting Fake News Based on Multi-Task Learning Model
A method, a system, and a computer program product for detecting fake news based on a multi-task learning model. In an embodiment, a multi-task learning model is used to perform joint training on authenticity detection and topic classification of news to be detected, and authenticity of the news to be detected and a topic of the news to be detected are returned simultaneously. Through the implementation of the embodiment of the present invention, the authenticity of the news and the topic of the news can be detected simultaneously, and the accuracy of fake news detection and topic classification is improved.
Content-aware selection
An image editing program can include a content-aware selection system. The content-aware selection system can enable a user to select an area of an image using a label or a tag that identifies object in the image, rather than having to make a selection area based on coordinates and/or pixel values. The program can receive a digital image and metadata that describes an object in the image. The program can further receive a label, and can determine from the metadata that the label is associated with the object. The program can then select a bounding box for the object, and identify in the bounding box, pixels that represent the object. The program can then output a selection area that surrounds the pixels.
Data extraction and duplicate detection
A system provides an end-to-end solution for invoice processing which includes reading invoices (both pdfs and images), extracting key relevant information from the face of invoices, organizing the relevant information in a structured template as a key-value pair, and comparing invoices based on the similarities between different invoice fields to identify potential duplicate invoices.
METHOD AND APPARATUS FOR DETECTING TEXT REGIONS IN IMAGE, DEVICE, AND MEDIUM
Embodiments of the present disclosure provide a method and apparatus for detecting text regions in an image, a device, and a medium. The method may include: detecting, based on feature representation of an image, a first text region in the image, where the first text region covers a text in the image, a region occupied by the text being of a certain shape; determining, based on a feature block of the first text region, text geometry information associated with the text, where the text geometry information includes a text centerline of the text and distance information of the centerline from the upper and lower borders of the text; and adjusting, based on the text geometry information associated with the text, the first text region to a second text region, where the second text region also covers the text and is smaller than the first text region.
Method and apparatus for recognizing characters
A method and an apparatus for recognizing characters using an image are provided. A camera is activated according to a character recognition request and a preview mode is set for displaying an image photographed through the camera in real time. An auto focus of the camera is controlled and an image having a predetermined level of clarity is obtained for character recognition from the images obtained in the preview mode. The image for character recognition is character-recognition-processed so as to extract recognition result data. A final recognition character row is drawn that excludes non-character data from the recognition result data. A first word is combined including at least one character of the final recognition character row and a predetermined maximum number of characters. A dictionary database that stores dictionary information on various languages using the first word is searched, so as to provide the user with the corresponding word.
Correction of misspellings in QA system
Embodiments provide a computer implemented method for identifying and correcting a misspelling in a question answering (QA) system, wherein the QA system is coupled to a document corpus, and the document corpus includes a plurality of documents related to a particular domain. The method includes the following steps: receiving an input question and a plurality of passages, wherein the plurality of passages are extracted from the document corpus by the QA system; providing at least one alternate form for each token extracted from the input question and the plurality of passages; identifying at least one misspelled token; and scoring at least one alternate form of each identified misspelled token.