System and Method for Generating High-Resolution Stereo Image and Depth Map
20190392555 ยท 2019-12-26
Assignee
Inventors
Cpc classification
G06T3/40
PHYSICS
H04N13/122
ELECTRICITY
H04N13/133
ELECTRICITY
H04N2013/0081
ELECTRICITY
G06T3/20
PHYSICS
H04N13/25
ELECTRICITY
International classification
Abstract
A system and method for generating high-resolution stereo images and depth map in multi-camera systems having multiple cameras with different resolutions and view angles. One method is to improve the lower resolution image and combining it with the higher resolution image, then the resulting image is processed by extensive algorithms to ensure utmost high quality. The system can also handle non-planar image contents. The process is to generate a crude depth map first and then divide the map into multiple layers. Each layer will be separately registered. The results from the registered layers will be merged to improve the depth map generation. The improved depth map could be repeatedly fed back to the beginning of the process to further improve the registration performance. The system and method can generate stereo images using uncalibrated cameras with different views and resolutions.
Claims
1. A system for generating high-resolution stereo image and depth map comprising: a first camera having a first image; a second camera having a second image; an up-sampler connected to the first image; a two-step image alignment module is connected to a first output of the up-sampler and the second image; a pan-sharpening module is connected to a second output of the up-sampler; a pan band creator is connected to an output of the two-step image alignment module to provide an input to the pan-sharpening module for producing a first high resolution image; a stereo image generator for combining the first high resolution image and the second image to generate a stereo image; a disparity map generator receives the stereo image to generate a disparity map; and the generated disparity map is connected to a first input of a depth map generator.
2. A system for generating high-resolution stereo image and depth map as claim in claim 1, wherein, the first image is a low resolution image, and the second image is a high resolution image.
3. A system for generating high-resolution stereo image and depth map as claim in claim 2, further comprising: a divider connected between the depth map generator and the two-step image alignment module; and the depth map generator receives a third output from the up-sampler.
4. A system for generating high-resolution stereo image and depth map as claim in claim 3, wherein, the divider divides an output of the depth map generator into multiple levels to manage any non-coplanar image contents.
5. A system for generating high-resolution stereo image and depth map as claim in claim 1, wherein, the first and second cameras are uncalibrated with different views and resolutions.
6. A method for generating high-resolution stereo image and depth map comprising the steps of: upsampling a first image; aligning the upsampled first image with a second image using a two-step image alignment process; creating a Panchromatic (pan) band using multispectral bands of the second image; pansharpening the upsampled first image with the pan band created by the second image to generate a high resolution left image; forming a stereo image using the high resolution left image and the second image; generating a disparity map using the stereo image; and generating a first depth map using the disparity map.
7. A method for generating high-resolution stereo image and depth map as claim in claim 6, wherein the two-step image alignment step further comprising the steps of: combining the upsampled first image and the second image to generate a second depth map; dividing the second generated depth map into multiple levels; registering the multiple levels individually; merging all the registered levels to improve the second depth map; aligning the divided depth map with the upsampled left image.
8. A method for generating high-resolution stereo image and depth map as claim in claim 7, wherein, the second image is of high resolution; and the improved depth map is repeatedly fed back to the first depth map generating step to further improve the registering performance.
9. A method for generating high-resolution stereo image and depth map as claim in claim 7, wherein, the registering step is performed by Scale Invariant Features Transform (SIFT) or Speeded Up Robust Features (SURF) with Random Sample Consensus (RANSAC).
10. A method for generating high-resolution stereo image and depth map as claim in claim 6, wherein, the pansharpening step is performed by Gram-Schmidt Adaptive (GSA) algorithm.
Description
BRIEF DESCRIPTION OF THE DRAWINGS
[0017]
[0018]
[0019]
[0020]
[0021]
[0022]
[0023]
[0024]
[0025]
[0026]
[0027]
[0028]
[0029]
[0030]
[0031]
[0032]
[0033]
[0034]
[0035]
DETAILED DESCRIPTION OF THE INVENTION
[0036] The present invention utilizes a new approach to high resolution stereo image formation and high resolution depth map generation. As shown in
[0041]
Two-Step Image Registration
[0042] The block diagram of the two-step image alignment approach is shown in Fig. The first step of the two-step image alignment approach is using RANSAC (Random Sample Consensus) technique, Reference [1], for an initial image alignment. In this first step, we use the two RGB images from the left and right imagers. The left image is first upsampled using bicubic interpolation to the same resolution of the right image. First, SURF features, Reference [2] and SIFT features, References [3][4], are extracted from the two images. These features are then matched within the image pair. This is followed by applying RANSAC to estimate the geometric transformation. Assuming the left camera image is the reference image, the right camera image content is then projected into a new image that is aligned with the reference image using the geometric transformation.
[0043] The second step of the two-step alignment approach uses this aligned right image with RANSAC and the left camera image as inputs and applies the Diffeomorphic Registration, Reference [3], technique. Diffeomorphic Registration is formulated as a constrained optimization problem, which is solved with a step-then-correct strategy, Reference [3]. This second step reduces the registration errors to subpixel levels and makes it possible to conduct accurate pansharpening.
Pansharpening Algorithm
[0044] The goal of pansharpening, References [5]-[14], is to fuse a low-spatial resolution left image with a high-spatial resolution panchromatic image (pan) from the right camera. In the present invention, after the two-step registration, the image from the left camera can be considered as a blurred version of the right one. The images from the left camera are sharpened by pansharpening using high spatial resolution images from the right camera as the panchromatic reference image.
[0045] Pansharpening techniques can be classified into two main categories:
[0046] (1) the Component Substitution (CS) approach; and
[0047] (2) the MultiResolution Analysis (MRA) approach.
[0048] The CS approach is based on the substitution of a component with the pan image and the MRA approach relies on the injection of spatial details that are obtained through a multiresolution decomposition of the pan image into the resampled MS bands. In the present invention, we focus on the CS-based approach. Under the assumption that the components containing the spatial structure of multispectral images at all spectral bands are highly correlated, the transformed low-resolution MS images can be enhanced by substituting the components containing the spatial structure of the pan image through a histogram matching. The output pansharpened data are finally achieved by applying the inverse transformation to project the data back to the original space.
[0049] The Gram-Schmidt Adaptive (GSA), Reference [5], algorithm is applied for its simplicity and performance in our experiments.
Stereo Image Formation
[0050]
[0051]
Disparity Estimation
[0052] Disparity is the difference between two pixels that correspond to the same physical point in the stereo image pair. Once the stereo images are created, a feature correspondence process is needed to determine the pixels that belong to the same physical point. Based the feature correspondence results, the disparity map is computed for every pixel in the image.
Depth Estimation
[0053]
L=Bf/X
where B is the baseline between the two cameras, f is the focal length, and X is the disparity at a particular pixel.
Experimental Results
[0054] In the following figures,
[0055] The stereo images could then be created by using two methods:
[0056] 1. to use the low resolution left image and the downsampled right image; and
[0057] 2. to use the high resolution left and right images.
[0058] Both stereo images are shown in
[0059]
[0060] In
[0061] It will be apparent to those skilled in the art that various modifications and variations can be made to the system and method of the present disclosure without departing from the scope or spirit of the disclosure. It should be perceived that the illustrated embodiments are only preferred examples of describing the invention and should not be taken as limiting the scope of the invention.
REFERENCES
[0062] [1] R. Hartley and A. Zisserman, Multiple View Geometry in Computer Vision, Cambridge University Press, 2003. [0063] [2] H. Bay, A. Ess, T. Tuytelaars, and L. Van Gool. SURF: Speeded Up Robust Features, Computer Vision and Image Understanding (CVIU). Vol. 110, No. 3, pp. 346-359, 2008. [0064] [3] B. Ayhan, M. Dao, C. Kwan, H. Chen, J. F. Bell III, and R. Kidd, A Novel Utilization of Image Registration Techniques to Process Mastcam Images in Mars Rover with Applications to Image Fusion, Pixel Clustering, and Anomaly Detection, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, Vol. 10, Issue: 10, Pages: 4553-4564, 2017. [0065] [4] D. G. Lowe, Object recognition from local scale-invariant features. IEEE International Conference on Computer Vision, vol. 2, pp. 1150-1157, 1999. [0066] [5] G. Vivone, L. Alparone, J. Chanussot, M. Dalla Mura, Garzelli, and G. Licciardi, A critical comparison of pansharpening algorithms, IEEE Int. Conf. Geoscience and Remote Sensing (IGARSS), pp. 191-194, July 2014. [0067] [6] J. Zhou, C. Kwan, and B. Budavari, Hyperspectral Image Super-Resolution: A Hybrid Color Mapping Approach, SPIE Journal of Applied Remote Sensing, Vol. 10, 035024, 2016. [0068] [7] C. Kwan, J. H. Choi, S. Chan, J. Zhou, and B. Budavari, Resolution Enhancement for Hyperspectral Images: A Super-Resolution and Fusion Approach, IEEE International Conference on Acoustics, Speech, and Signal Processing, New Orleans, March 2017. [0069] [8] M. Dao, C. Kwan, B. Ayhan, and J. Bell, Enhancing Mastcam Images for Mars Rover Mission, 14th International Symposium on Neural Networks, Hokkaido, Japan, June 2017. [0070] [9] C. Kwan, B. Budavari, M. Dao, B. Ayhan, and J. F. Bell, Pansharpening of Mastcam images, IEEE International Geoscience and Remote Sensing Symposium (IGARSS), Fort Worth, July 2017. [0071] [10] C. Kwan, B. Ayhan, and B. Budavari, Fusion of THEMIS and TES for Accurate Mars Surface Characterization, IEEE International Geoscience and Remote Sensing Symposium (IGARSS), Fort Worth, July 2017. [0072] [11] C. Kwan, B. Budavari, A. Bovik, and G. Marchisio, Blind Quality Assessment of Fused WorldView-3 Images by Using the Combinations of Pansharpening and Hypersharpening Paradigms, IEEE Geoscience and Remote Sensing Letters, Volume: 14, Issue: 10, pp. 1835-1839, 2017. [0073] [12] B. Ayhan, M. Dao, C. Kwan, H. Chen, J. F. Bell III, and R. Kidd, A Novel Utilization of Image Registration Techniques to Process Mastcam Images in Mars Rover with Applications to Image Fusion, Pixel Clustering, and Anomaly Detection, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, Vol. 10, Issue: 10, Pages: 4553-4564, 2017. [0074] [13] C. Kwan, J. Zhou, and B. Budavari, A New Pansharpening Approach for Hyperspectral Images, Colorimetry and Image Processing, InTech, 2018. [0075] [14] Y. Qu, H. Qi, B. Ayhan, C. Kwan, and R. Kidd, Does Multispectral/Hyperspectral Pansharpening Improve the Performance of Anomaly Detection? IEEE International Geoscience and Remote Sensing Symposium (IGARSS), Fort Worth, July 2017. [0076] [15] X. Li, C. Kwan, and B. Li, Stereo Imaging with Uncalibrated Camera, Advances in Visual Computing, Second International Symposium, ISVC 2006, Lake Tahoe, N.V., USA, Nov. 6-8, 2006. [0077] [16] Y. Qu, H. Qi, B. Ayhan, C. Kwan, and R. Kidd, Does Multispectral/Hyperspectral Pansharpening Improve the Performance of Anomaly Detection? IEEE International Geoscience and Remote Sensing Symposium (IGARSS), Fort Worth, July 2017.