G06V10/32

SYSTEMS AND METHODS FOR MOBILE IMAGE CAPTURE AND CONTENT PROCESSING OF DRIVER'S LICENSES
20230091041 · 2023-03-23 ·

Systems and methods are provided for processing and extracting content from an image captured using a mobile device. In one embodiment, an image is captured by a mobile device and corrected to improve the quality of the image. The corrected image is then further processed by adjusting the image, identifying the format and layout of the document, binarizing the image and extracting the content using optical character recognition (OCR). Multiple methods of image adjusting may be implemented to accurately assess features of the document, and a secondary layout identification process may be performed to ensure that the content being extracted is properly classified.

SYSTEMS AND METHODS FOR MOBILE IMAGE CAPTURE AND CONTENT PROCESSING OF DRIVER'S LICENSES
20230091041 · 2023-03-23 ·

Systems and methods are provided for processing and extracting content from an image captured using a mobile device. In one embodiment, an image is captured by a mobile device and corrected to improve the quality of the image. The corrected image is then further processed by adjusting the image, identifying the format and layout of the document, binarizing the image and extracting the content using optical character recognition (OCR). Multiple methods of image adjusting may be implemented to accurately assess features of the document, and a secondary layout identification process may be performed to ensure that the content being extracted is properly classified.

USE OF NEURAL NETWORKS TO PREDICT LANE LINE TYPES
20220350993 · 2022-11-03 ·

A method of predicting lane line types with neural networks includes capturing optical information with one or more optical sensors disposed on a vehicle. The method further includes cropping the optical information to a predetermined size, passing cropped optical information through a neural network, and assessing the optical information to detect locations of a plurality of lane lines in the optical information. The method further includes predicting a plurality of values assigned to predetermined lane line types of the plurality of lane lines. The method further determines a maximum confidence value for each of the plurality of values assigned to the predetermined lane line types for each of the plurality of lane lines; and extracts a lane line label corresponding to the maximum confidence value for each of the plurality of lane lines.

METHOD AND APPARATUS FOR SCENE SEGMENTATION FOR THREE-DIMENSIONAL SCENE RECONSTRUCTION
20230092248 · 2023-03-23 ·

A method includes obtaining, from an image sensor, image data of a real-world scene; obtaining, from a depth sensor, sparse depth data of the real-world scene; and passing the image data to a first neural network to obtain one or more object regions of interest (ROIs) and one or more feature map ROIs. Each object ROI includes at least one detected object. The method also includes passing the image data and sparse depth data to a second neural network to obtain one or more dense depth map ROIs; aligning the one or more object ROIs, one or more feature map ROIs, and one or more dense depth map ROIs; and passing the aligned ROIs to a fully convolutional network to obtain a segmentation of the real-world scene. The segmentation contains one or more pixelwise predictions of one or more objects in the real-world scene.

METHOD AND APPARATUS FOR SCENE SEGMENTATION FOR THREE-DIMENSIONAL SCENE RECONSTRUCTION
20230092248 · 2023-03-23 ·

A method includes obtaining, from an image sensor, image data of a real-world scene; obtaining, from a depth sensor, sparse depth data of the real-world scene; and passing the image data to a first neural network to obtain one or more object regions of interest (ROIs) and one or more feature map ROIs. Each object ROI includes at least one detected object. The method also includes passing the image data and sparse depth data to a second neural network to obtain one or more dense depth map ROIs; aligning the one or more object ROIs, one or more feature map ROIs, and one or more dense depth map ROIs; and passing the aligned ROIs to a fully convolutional network to obtain a segmentation of the real-world scene. The segmentation contains one or more pixelwise predictions of one or more objects in the real-world scene.

HIGH-PRECISION IDENTIFICATION METHOD AND SYSTEM FOR SUBSTATIONS

The present invention provides a high-precision identification method and system for substations, including building a Mask RCNN objection recognition network model based on convolutional neural networks; inputting acquired image information of a object into the Mask RCNN object recognition network model for preliminary recognition and outputting a recognition result of the object; using an information entropy to create a semantic decision tree and correcting the recognition result of the object according to a principle of relative correlation between different objects and outputting a final recognition decision result; reading the recognition decision result to obtain a true type of the object to be recognized. The present invention performs preliminary recognition through the Mask RCNN object recognition network model, performs secondary judgment and correction recognition in combination with a semantic decision tree, and uses the relative correlation between substations to fuse image features on the basis of increasing the judging criteria to obtain the true type. This greatly improves the accuracy of image recognition of substations, and has a positive role in the research and development of automatic inspection equipment for inspection robots.

HIGH-PRECISION IDENTIFICATION METHOD AND SYSTEM FOR SUBSTATIONS

The present invention provides a high-precision identification method and system for substations, including building a Mask RCNN objection recognition network model based on convolutional neural networks; inputting acquired image information of a object into the Mask RCNN object recognition network model for preliminary recognition and outputting a recognition result of the object; using an information entropy to create a semantic decision tree and correcting the recognition result of the object according to a principle of relative correlation between different objects and outputting a final recognition decision result; reading the recognition decision result to obtain a true type of the object to be recognized. The present invention performs preliminary recognition through the Mask RCNN object recognition network model, performs secondary judgment and correction recognition in combination with a semantic decision tree, and uses the relative correlation between substations to fuse image features on the basis of increasing the judging criteria to obtain the true type. This greatly improves the accuracy of image recognition of substations, and has a positive role in the research and development of automatic inspection equipment for inspection robots.

Cross-view image optimizing method, apparatus, computer equipment, and readable storage medium
11611733 · 2023-03-21 · ·

Disclosed is a cross-view image optimizing method and apparatus, and a computer equipment and a readable storage medium. The method includes: acquiring a sample image and a pre-trained cross-view image generating model; generating an multi-dimensional cross-view image of the sample image by a multi-dimensional feature extracting module of the first generator to obtain dimension features and cross-view initial images at multiple dimensions; obtaining a multi-dimensional feature map with corresponding dimension features by the second generator; inputting the multi-dimensional feature map to a multi-channel attention module of the second generator for feature extraction and calculating a feature weight of each attention channel, obtaining attention feature images, attention images and feature weights in a preset number of the attention channels; and weighting and summing the attention images and the attention feature images of all the channels according to the feature weights, and obtaining a cross-view target image.

Cross-view image optimizing method, apparatus, computer equipment, and readable storage medium
11611733 · 2023-03-21 · ·

Disclosed is a cross-view image optimizing method and apparatus, and a computer equipment and a readable storage medium. The method includes: acquiring a sample image and a pre-trained cross-view image generating model; generating an multi-dimensional cross-view image of the sample image by a multi-dimensional feature extracting module of the first generator to obtain dimension features and cross-view initial images at multiple dimensions; obtaining a multi-dimensional feature map with corresponding dimension features by the second generator; inputting the multi-dimensional feature map to a multi-channel attention module of the second generator for feature extraction and calculating a feature weight of each attention channel, obtaining attention feature images, attention images and feature weights in a preset number of the attention channels; and weighting and summing the attention images and the attention feature images of all the channels according to the feature weights, and obtaining a cross-view target image.

EXPRESS TRACKING FOR PATIENT FLOW MANAGEMENT IN A DISTRIBUTED ENVIRONMENT
20230079032 · 2023-03-16 ·

The present invention relates to systems and methods of express tracking for patient flow management in a distributed environment. Particularly, aspects are directed to a computer implemented method that includes initiating a check-in process that includes prompting a user to scan an identifier, processing the identifier using a cascade machine-learning architecture comprising of a multi-task convolutional neural network model and a recurrent neural network model to obtain the classification of the identifier and member identification information, determining whether the user is known user based on the member identification information, when the user is a known user, verifying user data saved in the computing system associated with the identifier, once the user data is verified, determining whether the user has a scheduled appointment; and when the user has a scheduled appointment, checking the user in for the scheduled appointment.