Method and Apparatus for Detecting Missed Areas during Endoscopy
20210274089 · 2021-09-02
Assignee
Inventors
- Kang-Huai Wang (Saratoga, CA)
- Chenyu Wu (Sunnyvale, CA)
- Mark Hadley (Los Altos, CA, US)
- Gordon C. Wilson (San Francisco, CA)
Cpc classification
A61B5/004
HUMAN NECESSITIES
H04N23/555
ELECTRICITY
G06T3/4038
PHYSICS
H04N23/64
ELECTRICITY
International classification
G06T3/40
PHYSICS
Abstract
A method of processing images captured using an endoscope comprising a camera is disclosed. According to this method, regular images captured by the camera are received while the endoscope is maneuvered by an operator to travel through a human gastrointestinal (GI) tract. The regular images are mosaicked to determine any missed or insufficiently imaged area in a section of the human GI tract already travelled by the endoscope. If any missed or insufficiently imaged area is detected, information regarding any missed or insufficiently imaged area is provided to the operator.
Claims
1. A method of processing images captured using an endoscope comprising a camera, the method comprising: receiving regular images captured by the camera while the endoscope is maneuvered by an operator to travel through a human anatomical lumen; mosaicking the regular images to determine any missed area in a section of the human anatomical lumen travelled by the endoscope; and if any missed area is detected, providing information regarding any missed area to the operator.
2. The method of claim 1, further comprising receiving structured light images associated with the regular images from the camera; and deriving distance information of the regular images based on the structured light images; wherein the structured light images are captured by the camera while the endoscope is maneuvered by the operator to travel through the human anatomical lumen.
3. The method of claim 2, wherein the distance information of the regular images is used to assist said mosaicking the regular images.
4. The method of claim 3, wherein the regular images are normalized according to the distance information of the regular images and optical magnification information to facilitate said mosaicking the regular images.
5. The method of claim 2, wherein the distance information is used to determine whether a target area in one regular image is out of focus or not and if the target area is out of focus in all regular images covering the target area, information of the target area is provided to the operator.
6. The method of claim 2, wherein the endoscope further comprises a motion sensing device to measure camera motion inside the human anatomical lumen.
7. The method of claim 6, wherein the motion sensing device corresponds to an accelerometer or a gyrator.
8. The method of claim 6, wherein the motion sensing device is used to determine camera movement, camera trajectory, camera orientation or any combination thereof.
9. The method of claim 8, wherein said mosaicking the regular images is performed in a space based on the camera trajectory.
10. The method of claim 1, wherein said providing the information regarding any missed area to the operator comprises displaying the regular images with highlight on any missed area.
11. The method of claim 1 further comprising generating a 2D or 3D mosaicked image and displaying the 2D or 3D mosaicked image on a display device with any missed area highlighted.
12. The method of claim 1, wherein the endoscope further comprises a motion sensing device to measure camera motion inside the human anatomical lumen.
13. The method of claim 12, wherein the motion sensing device corresponds to an accelerometer or a gyrator.
14. The method of claim 12, wherein the motion sensing device is used to determine camera movement, camera trajectory, camera orientation or any combination thereof.
15. The method of claim 14, wherein said mosaicking the regular images is performed in a space based on the camera trajectory.
16. The method of claim 1 further comprising generating a 2D or 3D mosaicked image and storing the 2D or 3D mosaicked image.
17. The method of claim 16, wherein information of the 2D or 3D mosaicked image stored is used by the operator during withdraw process of the endoscope to re-image any missed area.
18. The method of claim 16, wherein information of the 2D or 3D mosaicked image stored is used in a subsequent colonoscopy of a same patient.
19. The method of claim 1, wherein a target area in the regular images is lack of parallax, the target area is determined as one missed area.
20. The method of claim 1, wherein a target area in the regular images is lack of a surface area in mosaicked result, the target area is determined as one missed area.
21. The method of claim 1 further comprises deriving camera position, camera movement, camera orientation or a combination thereof inside the human anatomical lumen by using motion estimation based on the regular images.
22. A non-transitory computer-readable medium having stored thereon a computer-readable code executable by a processor to cause the processor to: receive regular images captured by a camera while an endoscope is maneuvered by an operator to travel through a human anatomical lumen; mosaic the regular images to determine any missed area in a section of the human anatomical lumen travelled by the endoscope; and if any missed area is detected, provide information regarding any missed area to the operator.
Description
BRIEF DESCRIPTION OF THE DRAWINGS
[0015]
[0016]
[0017]
DETAILED DESCRIPTION OF THE INVENTION
[0018] It will be readily understood that the components of the present invention, as generally described and illustrated in the figures herein, may be arranged and designed in a wide variety of different configurations. Thus, the following more detailed description of the embodiments of the systems and methods of the present invention, as represented in the figures, is not intended to limit the scope of the invention, as claimed, but is merely representative of selected embodiments of the invention. References throughout this specification to “one embodiment,” “an embodiment,” or similar language mean that a particular feature, structure, or characteristic described in connection with the embodiment may be included in at least one embodiment of the present invention. Thus, appearances of the phrases “in one embodiment” or “in an embodiment” in various places throughout this specification are not necessarily all referring to the same embodiment.
[0019] Furthermore, the described features, structures, or characteristics may be combined in any suitable manner in one or more embodiments. One skilled in the relevant art will recognize, however, that the invention can be practiced without one or more of the specific details, or with other methods, components, etc. In other instances, well-known structures, or operations are not shown or described in detail to avoid obscuring aspects of the invention. The illustrated embodiments of the invention will be best understood by reference to the drawings, wherein like parts are designated by like numerals throughout. The following description is intended only by way of example, and simply illustrates certain selected embodiments of apparatus and methods that are consistent with the invention as claimed herein.
[0020] A colonoscopy enables your doctor to examine the lining of your rectum, colon (large intestine) and the lowest part of the small intestine (known as the ileum) for any abnormalities. During colonoscopy, an endoscope is inserted from the anus and then slowly advanced into the rectum, colon and ileum. The colonoscopy is intended to cover all the surface areas of the portion of the human gastrointestinal (GI) tract travelled by the colonoscope. However, the colonoscope has a limited field of view (FOV). In addition, due to various difficult conditions in the GI tract, such as sharp bends, folds, twists and turns in the colon, they present a great challenge to maneuver the endoscope for imaging the GI tract with complete coverage.
[0021] In order to deal with the missed area problem, the present invention discloses techniques based on picture mosaicking to identify possible missed areas or ill-imaged areas (also termed as insufficiently imaged areas) during the colonoscopy. The insufficiently imaged areas refer to areas that do not provide sufficient image quality for picture mosaicking. The insufficiently imaged areas may due to objects being out of focus, objects being partially occluded, etc. The picture mosaicking may correspond to a 3D mosaicking (i.e., 3D registration) or 2D mosaicking (i.e., image stitching). If the missed or insufficiently imaged area can be timely detected for the section imaged by the endoscope, it is possible to alert the colonoscopist about the missed area so that corrective action may be taken. The stitched image can provide a panoramic view of the anatomical lumen. Accordingly, the stitched image is also referred as a panorama image in this disclosure. In order to make timely alert to the colonoscopist, the detection of missed of insufficiently imaged area has to be done in real time or near real time. For example, the detection has to be completed in few seconds or less. The key idea behind the present invention is that the colon is a closed tract so that the images taken from the colonoscope can be stitched the imaged area to form a closed tract without any missed of insufficiently imaged area. If there is any missed or insufficiently imaged area, it implies that there is an area on the tract un-imaged or ill-imaged.
[0022] In order to alert the colonoscopist about a missed area, one embodiment of the present invention will display the stitched image with the un-imaged area highlighted. The highlight of the missed area can be in the form of outline of the missed area. In another example, the highlight of the missed area can be in the form of overlay color. In another embodiment, an audible sound or other notifying signal may be provided to the colonoscopist.
[0023] Various missed area detection methods are disclosed in the present invention. In one embodiment, if an area in the captured image is lack of parallax, the area is considered as a candidate for missed area or insufficiently imaged area. As known in the field, parallax is a displacement or difference in the apparent position of an object viewed along two different lines of sight. The parallax is measured by the angle or semi-angle of inclination between those two lines. Due to foreshortening, nearby objects show a larger parallax than farther objects when observed from different positions. Therefore, the parallax has been used to determine distances in various applications. According to one embodiment of the present invention, picture mosaicking identifies a corresponding target area in multiple captured images and stitches the corresponding target area using the multiple captured images. If such parallax is not available for a corresponding target area, the corresponding target area is determined as a candidate for missed area or insufficiently imaged area.
[0024]
[0025] For the condition described in
[0026] For the intended application, in addition to viewing the captured images and being aware of possible missed or insufficiently imaged area, it is also important to let the colonoscopist know the location or relative location of the camera. Therefore, when there is a need to maneuver the camera to cover a missed or insufficiently imaged area, the colonoscopist can be timely informed about the needed movement. In order to determine the camera location and/or orientation inside the GI tract, in one embodiment of the present invention, the colonoscope is further be equipped with a motion sensing device, such as an accelerometer or gyrator. Therefore, the movement and the trajectory of the camera as well as its orientation can be determined so that the doctor can be better informed of the colon structure and relative position between the camera and the surrounding GI tract associated with the image being displayed. In one embodiment the stitched image could be formed in the space based on the trajectory of the camera movement.
[0027] In another embodiment, the endoscope can be equipped with structured lights to capture regular images along with structured light images. The structured light images (SLIs) can be used to derive the distance of the different areas in the image so that the stitching can be more accurate by using the 3D information of the imaged areas. Furthermore, with the more accurate starting point in iteration of the stitching process, the stitched image can avoid falling into local minimum. In U.S. Pat. No. 9,936,151 that is being incorporated by reference, a camera equipped with a regular light and structured light is disclosed to capture both regular images and structured light images using the same image sensor. The structured light images and corresponding regular images are captured temporally close so that the motion in between is expected to be small. Accordingly, the distance information derived from the structured light image correlates highly with the regular image. Details regarding the camera design with SLI capture capability are disclosed in U.S. Pat. No. 9,936,151.
[0028] These 2D or 3D mosaicked structure may help the endoscopy procedure. For example, the 2D or 3D mosaicked structure may be by a different colonoscopist so that the anatomy information can help the colonoscopist to steer within the lumen. This concept, as well as the other concepts in the invention, could be applied to GI tracts or organ or other areas of the body, such as bladder.
[0029] In addition to the fact that the lumen surface lacks the sharp edges and features to facilitate registration, some studies (e.g., J. Davis, “Mosaics of scenes with moving objects,” Proceedings. 1998 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No. 98CB36231), Santa Barbara, Calif., 1998, pp. 354-360.) in the field of picture mosaicking have found that the close distance between object and camera makes the stitching more difficult. Therefore, the distance information is extremely important for GI images since the distance is typically very close. In one embodiment, during the operation of the colonoscopy, only regular images are displayed on the screen and structured light images are not displayed. However, the structured light images can be fed through the computer or a processor for calculating distance information of the regular image. In one embodiment, the regular images are normalized according to the distance information and optical magnification information to facilitate stitching. In another embodiment, the stitched 2D or 3D structure can be displayed in real time or near real time. In yet another embodiment the missed area is highlighted on the stitched 2D or 3D structure. The stitched image can be shown in the form of a surface of a tract. The stitching algorithm can be rigid or non-rigid to take into account some movement of the bowel during the colonoscopy.
[0030] In the case of colonoscopy equipped with structured light to capture SLIs and deriving the distance information, the regular image with areas out of focus can be determined. During stitching, the overlapped area should be dominated by the pixels properly focused. If there is an area where all the images covering this area are all out of focus for this area, then this area is a candidate for ill-imaged area. This area should be highlighted or a notice should be provided to the colonoscopist.
[0031] In another embodiment, the colonoscope can be equipped with a motion sensing device (e.g. accelerator or gyrator, etc) to determine its movement and the trajectory of the camera as well as one or more structured lights to capture SLIs and derive corresponding distance information. By combining the structured light and the motion sensing device, an accurate 3D structure of the GI tract can be built in real time. Based on the accurate 3D structure of the GI tract, the colonoscopy system can be configured to highlight any un-examined area and to alert the doctor of the area un-examined. Furthermore, the colonoscopy system can be configured to assist the doctor during colonoscopy by knowing physiological topology and the pathological topology when existing. Alternatively, these information can be recorded and used later.
[0032] This invention can also be used in other GI tracts, such as stomach, or non-GI areas such as bladder.
[0033]
[0034] The method may be implemented in software for execution by various types of processors. For example, the processor may correspond a computer associated with the colonoscopy examination station or a workstation coupled to receive images from the colonoscope.
[0035] The above description is presented to enable a person of ordinary skill in the art to practice the present invention as provided in the context of a particular application and its requirements. Various modifications to the described embodiments will be apparent to those with skill in the art, and the general principles defined herein may be applied to other embodiments. Therefore, the present invention is not intended to be limited to the particular embodiments shown and described, but is to be accorded the widest scope consistent with the principles and novel features herein disclosed. In the above detailed description, various specific details are illustrated in order to provide a thorough understanding of the present invention. Nevertheless, it will be understood by those skilled in the art that the present invention may be practiced.
[0036] The invention may be embodied in other specific forms without departing from its spirit or essential characteristics. The described examples are to be considered in all respects only as illustrative and not restrictive. The scope of the invention is, therefore, indicated by the appended claims rather than by the foregoing description. All changes which come within the meaning and range of equivalency of the claims are to be embraced within their scope.