METHOD AND DEVICE FOR FUSION OF IMAGES
20220207673 · 2022-06-30
Inventors
Cpc classification
G06T7/30
PHYSICS
International classification
Abstract
A method of fusing images includes obtaining an optical image of a first scene with a first camera. A thermal image of the first scene is obtained with a second camera. The optical image is fused with the thermal image to generate a fused image.
Claims
1. A method of fusing images comprising: obtaining an optical image of a first scene with a first camera; obtaining a thermal image of the first scene with a second camera; and fusing the optical image with the thermal image to generate a fused image.
2. The method of claim 1, wherein fusing the optical image with the thermal image includes matching pixels from the optical image with corresponding pixels from the thermal image.
3. The method of claim 2, wherein the fused image includes a field of view less than or equal to a field of view of the optical image and a field of view of the thermal image.
4. The method of claim 3, wherein fusing the optical image with the thermal image includes cropping a portion of at least one of the optical image and the thermal image from the fused image.
5. The method of claim 1, wherein the optical image includes a different perspective of the first scene from the thermal image.
6. The method of claim 1, wherein the optical image includes different distortions of objects in the first scene from the thermal image.
7. The method of claim 1, wherein at least one object appears in one of the thermal image or the optical image but not the other of the thermal image or the optical image.
8. The method of claim 1, wherein fusing the optical image with the thermal image is performed by an algorithm on a central processing unit of a controller.
9. The method of claim 8, wherein the algorithm relies on machine learning for fusing the optical image with the thermal image.
10. The method of claim 1, including performing object detection on the fused image to identify at least one relevant portion of the fused image.
11. The method of claim 9, wherein the machine learning occurs on a central processing unit.
12. The method of claim 1, wherein the first scene includes low light conditions.
13. An assembly for capturing and fusing images comprising: an optical camera; a thermal camera; a controller in communication with the optical camera and the thermal camera, wherein the controller is configured for: obtaining an optical image of a first scene with a first camera; obtaining a thermal image of the first scene with a second camera; and fusing the optical image with the thermal image to generate a fused image.
14. The assembly of claim 13, wherein fusing the optical image with the thermal image includes matching pixels from the optical image with corresponding pixels from the thermal image.
15. The assembly of claim 14, wherein the fused image includes a field of view less than or equal to a field of view of the optical image and a field of view of the thermal image.
16. The assembly of claim 3, wherein fusing the optical image with the thermal image includes cropping a portion of at least one of the optical image and the thermal image from the fused image.
17. The assembly of claim 13, wherein fusing the optical image with the thermal image is performed by an algorithm on a central processing unit of the controller.
18. The assembly of claim 17, wherein the algorithm relies on machine learning for fusing the optical image with the thermal image.
19. The assembly of claim 13, including performing object detection on the fused image to identify at least one relevant portion of the fused image.
20. The assembly of claim 19, wherein performing the object detecting occurs on a graphics processing unit.
Description
BRIEF DESCRIPTION OF THE DRAWINGS
[0023] The various features and advantages of the present disclosure will become apparent to those skilled in the art from the following detailed description. The drawings that accompany the detailed description can be briefly described as follows.
[0024]
[0025]
[0026]
[0027]
[0028]
[0029]
DETAILED DESCRIPTION
[0030]
[0031] The optical camera 24 and the thermal camera 26 are each in electrical communication with a controller 30 on the vehicle 20 (See
[0032]
[0033]
[0034]
[0035]
[0036] In the illustrated example, the fused image 52 is generated entirely with the controller 30. In particular, the fused image 52 is generated through the use of only the CPU on the controller 30. If a graphics processing unit (GPU) was incorporated into the controller 30, it was not needed to the generate the fused image 52. This is accomplished by using the two-dimensional images 32, 34 from the optical camera 24 and the thermal camera 26, respectively. These images 32, 34 require significantly less storage space than a three-dimensional rendering generated by a system incorporating lidar or another three-dimensional image generating approach. Because the images 32, 34 are so much smaller in data size, the CPU on the controller 30 can manage fusing the images 32, 34 into the fused image 52. This reduces the cost of the system by being able to use a lower cost controller 30 with less processing power and may even eliminate the need for the controller 30 to have a GPU.
[0037] Additionally, the generation of the fused image 52 reduces the computation steps needed to perform object detection on the scene 36 by another processing unit or a portion of the controller 30. For example, if an object detection algorithm was performed separately on both the optical image 32 and the thermal image 34, more time would be required to perform the two separate object detections. Furthermore, the two separate object detections would need to be merged into a single view to show all of the identified objects from each image. However, once the fused image 52 is generated, only a single object detection algorithm would need to be performed on the fused image 52. This further reduces the computing power needed for systems performing object detection. Furthermore, the fused image could be used in a process other than object detection.
[0038]
[0039] Once the optical image 32 and the thermal image 34 have been obtained, the controller 30 can fuse the optical image 32 and the thermal image 34 into the fused image 52 (Step 130). The fusing process includes matching the optical image 32 on a pixel-by-pixel basis with the thermal image 34. As shown in
[0040] Additionally, because portions of the optical image 32 do not appear in the thermal image 34 and vice versa, the fused image 52 will crop portions from the fields of view 46 and 48 when generating the fused image 52. Therefore, the field of view 46 and the field of view 48 should be large enough to provide adequate overlapping coverage of the scene 36 to generate the fused image 52 with a large enough portion of the scene 36 needed to identify relevant objects if used in object detection. Accordingly, the field of view 50 of the fused image 52 is less than or equal to the field of view 46 of the optical image 32 and the field of view 48 of the thermal image 34.
[0041] The fusing of the optical image 32 with the thermal image 34 is performed entirely on the CPU in the controller 30 as opposed to being performed on a GPU on the controller 30. The fusing process of the optical image 32 with the thermal image 34 is also performed on a real time basis. One factor contributing to the ability to perform the fusing process on the CPU as opposed to the GPU is the smaller file size of the optical image 32 and the thermal image 34. In particular, some other imaging processors perform object detection on three dimensional space which requires significantly more computing power of a GPU and more storage.
[0042] The fusing of the optical image 32 with the thermal image 34 occurs through an algorithm stored on the controller 30. In one example, the algorithm can perform the fusing process through the use of machine learning to find corresponding pixels in the optical image 32 and the thermal image 34. Because the optical image 32 and the thermal image 34 are able to be fused into a single fused image 52, an object detection algorithm can be run on the single fused image 52 instead of running the object detection algorithm separately for both the optical image 32 and the thermal image 34.
[0043] By performing object detection on the fused image 52 instead, the number of images that the object detection must be performed on is reduced in half and frees up computing power when performing the object detection. For example, without the fusing process, the object detection algorithm would have to identify relevant objects, such as pedestrians, in both the optical image 32 and the thermal image 34 and then the relevant objects from each would need to be combined into a single representation of all of the relevant objects. However, utilizing the disclosed fusing process, the object detection algorithm can run on the fused image 52 with the relevant portions being identified or highlighted in the fused image 52. Additionally, because the optical image 32 and the thermal image 34 are matched on a pixel by pixel basis, any relevant objects identified in the fused image 52 can be transferred to the either the optical image 32 or the thermal image 34. This allows a user to see the objects detected on the optical image 32 which may be easier to relate the scene 36 than the fused image 52 or the thermal image 34.
[0044] Although the different non-limiting examples are illustrated as having specific components, the examples of this disclosure are not limited to those particular combinations. It is possible to use some of the components or features from any of the non-limiting examples in combination with features or components from any of the other non-limiting examples.
[0045] It should be understood that like reference numerals identify corresponding or similar elements throughout the several drawings. It should also be understood that although a particular component arrangement is disclosed and illustrated in these exemplary embodiments, other arrangements could also benefit from the teachings of this disclosure.
[0046] The foregoing description shall be interpreted as illustrative and not in any limiting sense. A worker of ordinary skill in the art would understand that certain modifications could come within the scope of this disclosure. For these reasons, the following claim should be studied to determine the true scope and content of this disclosure.