G06V30/1478

DEEP-LEARNING-BASED SYSTEM AND PROCESS FOR IMAGE RECOGNITION

Disclosed are methods and systems for using artificial intelligence (AI) for image recognition by using predefined coordinates to extract a portion of a received image, the extracted portion comprising a word to be identified having at least a first letter and a second letter; executing an image recognition protocol to identify the first letter; when the server is unable to identify the second letter, the server executes an AI model having a nodal data structure to identify the second letter based upon the identified first letter, the nodal data structure comprising a set of nodes where each node represents a letter, each node connected to at least one other node, wherein connection of a first node to a second node corresponds to a probability that a letter corresponding to the second node is used in a word subsequent to a letter corresponding to the first node.

Text image correction method and apparatus

A text image correction method and a corresponding text image correction apparatus. Frequency information of a row-direction cumulative curve used by the method is sensitive to an error between a compensation angle for a tilt angle and a real tilt angle, and the method thus has good robustness. The method can accurately estimate the compensation angle for a tilt angle and correct a tilted text image. The method and apparatus can be applied to scenarios such as image pre-processing, automatic compensation for angles of scanned text images, automatic compensation for tilt angles of mobile phone photos.

Machine-learning models for image processing

Presented herein are systems and methods for the employment of machine learning models for image processing. A mobile application for client-side image processing and validation, which interacts with and leverages native image processing software of the client device, where the image processing software and the mobile application include any number of machine-learning models for identifying a document and attributes of the document for recognition and validation. This mobile application uses the image processing software from a client operating system to control the camera. The image processing software generates various types of information about a video frame and the document, and the mobile application invokes APIs or software libraries of the image processing software to access the information and validate the frame and document.