Patent classifications
G06V30/19147
Collaborative text detection and text recognition
Described are approaches for assigning tasks between machine resources (e.g., AI task performers, AI task validators), human resources (e.g., task performers, task validators), and/or other smart systems to facilitate collaborative text detection, text recognition, and text retrieval in order to optimize system performance along a variety of different selection criteria specifying various performant dimensions, including, but not limited to improving system efficiency, reducing task performer and/or task validator idle time, improving triage outcomes, reducing data processing loads, maintaining client confidentiality, etc., that may be associated with one or more customers.
IMAGE PROCESSING SYSTEM, IMAGE PROCESSING METHOD, AND STORAGE MEDIUM
An image processing system performs tilt correction with respect to a document image having handwritten characters and typed letters mixed with each other. The image processing system separates the document image into an image with handwritten characters determined as handwritten characters and an image without handwritten characters not determined as handwritten characters, estimates a tilt angle of the image without handwritten characters, and corrects the document image on the basis of the tilt angle.
COMPUTER-READABLE, NON-TRANSITORY RECORDING MEDIUM CONTAINING THEREIN IMAGE PROCESSING PROGRAM FOR GENERATING LEARNING DATA OF CHARACTER DETECTION MODEL, AND IMAGE PROCESSING APPARATUS
A computer-readable, non-transitory recording medium contains therein an image processing program. The image processing program is for generating learning data of a character detection model that at least detects, to recognize a character in a document contained in an image, a position of the character in the image, and configured to cause a computer to generate a cropped image by cropping the image, and adopt the cropped image not containing an image representing a split character as the learning data, instead of adopting the cropped image containing the image representing the split character as the learning data.
COLLABORATIVE TEXT DETECTION AND TEXT RECOGNITION
Described are approaches for assigning tasks between machine resources (e.g., AI task performers, AI task validators), human resources (e.g., task performers, task validators), and/or other smart systems to facilitate collaborative text detection, text recognition, and text retrieval in order to optimize system performance along a variety of different selection criteria specifying various performant dimensions, including, but not limited to improving system efficiency, reducing task performer and/or task validator idle time, improving triage outcomes, reducing data processing loads, maintaining client confidentiality, etc., that may be associated with one or more customers.
TEXT RECOGNITION IN IMAGE
According to implementations of the subject matter described herein, there is provided a solution for text recognition in an image. In this solution, a target text line area, which is expected to include a text to be recognized, is determined from an image. Probability distribution information of a character model element(s) present in the target text line area is determined using a single character model. The single character model is trained based on training text line areas and respective ground-truth texts in the training text line areas. Texts in the training text line areas are arranged in different orientations, and/or the ground-truth texts comprise texts are related to various languages (e.g., texts related to a Latin and an Eastern languages). The text in the target text line area can be determined based on the determined probability distribution information. The single character model enables more efficient and convenient text recognition.
DATA NETWORK, SYSTEM AND METHOD FOR DATA INGESTION IN A DATA NETWORK
The present invention provides a data network, a data ingestion system and a method of data ingestion in the data network for a supply chain management enterprise application. The data network includes one or more data objects of different data types received from different data sources structured on multiple distinct architecture, connected to each other for executing multiple functions in the enterprise application.
INFORMATION PROCESSING APPARATUS, INFORMATION PROCESSING METHOD AND NON-TRANSITORY STORAGE MEDIUM
The information processing apparatus according to the present disclosure synthesizes a handwriting image with a noise image to generate a synthesized image, generates a correct label indicative of handwriting pixels from the handwriting image, and applies the synthesized image and the correct label as learning data to generate a learning model.
Method, apparatus and system for identifying target objects
The present disclosure provides a method, apparatus and system for identifying target objects. The method includes: clipping out a target image from an acquired image, wherein the target image involves a plurality of target objects to be identified that are stacked; adjusting a height of the target image to a preset height, wherein a height direction of the target image corresponds to a direction in which the plurality of target objects are stacked; extracting a feature map of the adjusted target image; segmenting the feature map in a dimension corresponding to the height direction of the target image to obtain a preset number of segment features; and identifying the target objects based on each of the preset number of segment features.
Document processing system and method for performing document classification by machine learning
A document processing system and method for performing document classification by machine learning include an input module, a processing module, and at least one storage module preconfigured with a classification folder matching a code. Upon completion of a first-instance model construction procedure, the input module receives a document image. The processing module compares the document image with a machine learning model information to generate a computation result and stores the document image in the classification folder according to the computation result. Therefore, classification of the document images is automated according to the code of the corresponding classification folder, thereby enhancing the accuracy and efficiency of document classification.
LEARNING USER INTERFACE CONTROLS VIA INCREMENTAL DATA SYNTHESIS
A User Interface (UI) interface object detection system employs an initial dataset comprising a set of images, that may include synthesized images, to train a Machine Learning (ML) engine to generate an initial trained model. A data point generator is employed to generate an updated synthesized image set which is used to further train the ML engine. The data point generator may employ images generated by an application program as a reference by which to generate the updated synthesized image set. The images generated by the application program may be tagged in advance. Alternatively, or in addition, the images generated by the application program may be captured dynamically by a user using the application program.