Patent classifications
G06V10/28
Multimodal foreground background segmentation
The subject disclosure is directed towards a framework that is configured to allow different background-foreground segmentation modalities to contribute towards segmentation. In one aspect, pixels are processed based upon RGB background separation, chroma keying, IR background separation, current depth versus background depth and current depth versus threshold background depth modalities. Each modality may contribute as a factor that the framework combines to determine a probability as to whether a pixel is foreground or background. The probabilities are fed into a global segmentation framework to obtain a segmented image.
Multimodal foreground background segmentation
The subject disclosure is directed towards a framework that is configured to allow different background-foreground segmentation modalities to contribute towards segmentation. In one aspect, pixels are processed based upon RGB background separation, chroma keying, IR background separation, current depth versus background depth and current depth versus threshold background depth modalities. Each modality may contribute as a factor that the framework combines to determine a probability as to whether a pixel is foreground or background. The probabilities are fed into a global segmentation framework to obtain a segmented image.
Systems and methods for mobile image capture and content processing of driver's licenses
Systems and methods are provided for processing and extracting content from an image captured using a mobile device. In one embodiment, an image is captured by a mobile device and corrected to improve the quality of the image. The corrected image is then further processed by adjusting the image, identifying the format and layout of the document, binarizing the image and extracting the content using optical character recognition (OCR). Multiple methods of image adjusting may be implemented to accurately assess features of the document, and a secondary layout identification process may be performed to ensure that the content being extracted is properly classified.
Systems and methods for mobile image capture and content processing of driver's licenses
Systems and methods are provided for processing and extracting content from an image captured using a mobile device. In one embodiment, an image is captured by a mobile device and corrected to improve the quality of the image. The corrected image is then further processed by adjusting the image, identifying the format and layout of the document, binarizing the image and extracting the content using optical character recognition (OCR). Multiple methods of image adjusting may be implemented to accurately assess features of the document, and a secondary layout identification process may be performed to ensure that the content being extracted is properly classified.
Automated gauge reading and related systems, methods, and devices
Computing devices and methods for reading gauges are disclosed. A gauge reading method includes capturing image data corresponding to a captured image of one or more gauges, detecting one or more gauges in the captured image, cropping a detected gauge in the captured image to provide a use image including the detected gauge, and classifying the detected gauge to correlate the detected gauge with a template image. The gauge reading method also includes attempting to perform feature detection rectification on the use image to produce a rectified image of the detected gauge, performing template matching rectification on the use image to produce the rectified image responsive to a failure to perform the feature detection rectification, and estimating a gauge reading responsive to the rectified image. A computing device may implement at least a portion of a gauge reading method.
ASSISTING MEDICAL PROCEDURES WITH LUMINESCENCE IMAGES PROCESSED IN LIMITED INFORMATIVE REGIONS IDENTIFIED IN CORRESPONDING AUXILIARY IMAGES
A solution is proposed for assisting a medical procedure. A corresponding method comprises acquiring a luminescence image (205F), based on a luminescence light, and an auxiliary image (205R), based on an auxiliary light different from this luminescence light, of a field of view (103); the field of view (103) contains a region of interest comprising a target body of the medical procedure (containing a luminescence substance) and one or more foreign objects. An auxiliary informative region (210Ri) representative of the region of interest without the foreign objects is identified in the auxiliary image (205R) according to its content, and a luminescence informative region (210Fi) is identified in the luminescence image (205F) according to the auxiliary informative region (210Ri). The luminescence image (205F) is processed limited to the luminescence informative region (210Fi) for facilitating an identification of a representation of the target body therein. A computer program and a corresponding computer program product for implementing the method are also proposed. Moreover, a computing device for performing the method and an imaging system comprising it are proposed. A medical procedure based on the same solution is further proposed.
ASSISTING MEDICAL PROCEDURES WITH LUMINESCENCE IMAGES PROCESSED IN LIMITED INFORMATIVE REGIONS IDENTIFIED IN CORRESPONDING AUXILIARY IMAGES
A solution is proposed for assisting a medical procedure. A corresponding method comprises acquiring a luminescence image (205F), based on a luminescence light, and an auxiliary image (205R), based on an auxiliary light different from this luminescence light, of a field of view (103); the field of view (103) contains a region of interest comprising a target body of the medical procedure (containing a luminescence substance) and one or more foreign objects. An auxiliary informative region (210Ri) representative of the region of interest without the foreign objects is identified in the auxiliary image (205R) according to its content, and a luminescence informative region (210Fi) is identified in the luminescence image (205F) according to the auxiliary informative region (210Ri). The luminescence image (205F) is processed limited to the luminescence informative region (210Fi) for facilitating an identification of a representation of the target body therein. A computer program and a corresponding computer program product for implementing the method are also proposed. Moreover, a computing device for performing the method and an imaging system comprising it are proposed. A medical procedure based on the same solution is further proposed.
ELECTRONIC DEVICE, CONTENTS SEARCHING SYSTEM AND SEARCHING METHOD THEREOF
Optical flow information is determined and used to identify a video clip from among a sequence of frames. The video clip may be identified based on motion features derived in part from the optical flow information. In some embodiments, semantic information is concatenated with the motion features derived from the optical flow information.
NON-INTRUSIVE DETECTION METHOD AND DEVICE FOR POP-UP WINDOW BUTTON
A non-intrusive detection method for detecting at least one pop-up window button of the pop-up window includes the following steps: retrieving a screen image on a display device; comparing the screen image with a preset screen image and generating a differential image area according the screen image and the preset screen image; determining the differential image area as the pop-up window when the differential image area is greater than an image area threshold value; selecting a plurality of contour lengths of the pop-up window matching up with a contour length threshold value by Canny edge detector; and analyzing the contour lengths according to Douglas-Peucker algorithm and an amount of endpoints to generate a contour edge corresponding to the pop-up window button.
Systems and methods for automatic image capture on a mobile device
Real-time evaluation and enhancement of image quality prior to capturing an image of a document on a mobile device is provided. An image capture process is initiated on a mobile device during which a user of the mobile device prepares to capture the image of the document, utilizing hardware and software on the mobile device to measure and achieve optimal parameters for image capture. Feedback may be provided to a user of the mobile device to instruct the user on how to manually optimize certain parameters relating to image quality, such as the angle, motion and distance of the mobile device from the document. When the optimal parameters for image capture of the document are achieved, at least one image of the document is automatically captured by the mobile device.