STENT VISUALIZATION ENHANCEMENT USING CASCADED SPATIAL TRANSFORMATION NETWORK
20230214964 · 2023-07-06
Assignee
Inventors
- Shanhui Sun (Cambridge, MA, US)
- Li Chen (Cambridge, MA, US)
- Yikang Liu (Cambridge, MA, US)
- Xiao Chen (Cambridge, MA, US)
- Zhang Chen (Cambridge, MA, US)
Cpc classification
International classification
Abstract
An apparatus for stent visualization includes a hardware processor that is configured to input one or more stent images from a sequence of X-ray images and corresponding balloon marker location data to a cascaded spatial transform network. The background is separated from the one or more stent images using the cascaded spatial transform network and a transformed stent image with a clear background and a non-stent background image is generated. The stent layer and non-stent layer are generated using a neural network without online optimization. A mapping function f maps the inputs, the sequence images and marker coordinates, into the two single image outputs.
Claims
1. An apparatus for stent visualization, the apparatus comprising a hardware processor that is configured to: transform a first stent image to an image space using a first spatial transformer network (STN0) to generate a first transformed stent image; generate a new background image from the first transformed stent image; transform the new background image to a background image space using a second spatial transformer network (STN1) to generate a non-stent background image (B.sup.k); transform the non-stent background image (B.sup.k) to the image space using the second spatial transformer network (STN1); generate a stent image in the image space from the transformed non-stent background image; and transform the stent image in the image space to a stent image space to generate a clear stent image S.sup.k for the stent visualization.
2. The apparatus according to claim 1, wherein the first stent image is an image from a sequence of image frames and the hardware processor is further configured to generate the clear stent image S.sup.k based on image frames in the sequence of image frames.
3. The apparatus according to claim 2, wherein the hardware processor is further configured to: use an average layer to generate the non-stent background image based on the transformed new background image from the second spatial transformer network (STN1) for the image frames in the sequence of image frames; and use the average layer to generate the clear stent image based on the transformed stent image from the first spatial transformer network (STN0) for the image frames in the sequence of image frames.
4. The apparatus according to claim 1, wherein the hardware processor is further configured to generate the first transformed stent image using balloon marker positions as inputs to the first spatial transformer network (STN0).
5. The apparatus according to claim 1, wherein the hardware processor is further configured to use a minus operation to generate the new background image from the first transformed stent image, wherein an input to the minus operation is a corresponding image from an image sequence.
6. The apparatus according to claim 1, wherein a separated background image is used as an input to the second spatial transformer network to transform the new background image to the background image space.
7. The apparatus according to claim 1, wherein a corresponding image from an image sequence is used as an input to the second spatial transformer network (STN1) to transform the non-stent background image (B.sup.k) to the image space.
8. The apparatus according to claim 1, wherein the hardware processor is configured to use a minus operation to generate the stent image in the image space from the transformed non-stent background image, wherein an input to the minus operation is a corresponding image from an image sequence.
9. The apparatus according to claim 1, wherein the hardware processor is further configured to generate the clear stent image using balloon marker positions as inputs to the first spatial transformer network (STN0).
10. The apparatus according to claim 1, wherein the first spatial transformer network and the second spatial transformer network form a cascaded spatial transformer network.
11. A computer implemented method comprising using a hardware processor to generate a clear stent image and a non-stent background image from image frames of a sequence of image frames, the method comprising using the hardware processor to: transform a first stent image of the sequence of image frames to an image space of the sequence of image frames using a first spatial transformer network (STN0) to generate a first transformed stent image; generate a new background image from the first transformed stent image; transform the new background image to a background image space using a second spatial transformer network (STN1) to generate a non-stent background image (B.sup.k); transform the non-stent background image (B.sup.k) to the image space of the sequence of image frames using the second spatial transformer network (STN1); generate a stent image in the image space of the sequence of image frames from the transformed non-stent background image; and transform the stent image in the image space to a stent image space to generate the clear stent image S.sup.k.
12. The computer implemented method according to claim 11, wherein the method further comprises: using an average layer to generate the non-stent background image based on the transformed new background image from the second spatial transformer network (STN1) for the image frames in the sequence of image frames; and using the average layer to generate the clear stent image based on the transformed stent image from the first spatial transformer network (STN0) for the image frames in the sequence of image frames.
13. The computer implemented method according to claim 11, wherein the method further comprises generating the first transformed stent image using balloon marker positions as inputs to the first spatial transformer network (STN0).
14. The computer implemented method according to claim 11, wherein the method further comprises using a minus operation to generate the new background image from the first transformed stent image, wherein an input to the minus operation is a corresponding image from an image sequence.
15. The computer implemented method according to claim 11, wherein the method further comprises using a separated background image an input to the second spatial transformer network to transform the new background image to the background image space.
16. The computer implemented method according to claim 11, wherein the method further comprises using a corresponding image from an image sequence as an input to the second spatial transformer network (STN1) to transform the non-stent background image (B.sup.k) to the image sequence space.
17. The computer implemented method according to claim 11, wherein the method further comprises using a minus operation to generate the stent image in the image sequence space from the transformed non-stent background image, wherein an input to the minus operation is a corresponding image from an image sequence.
18. The computer implemented method according to claim 11, wherein the method further comprises generating the clear stent image using balloon marker positions as inputs to the first spatial transformer network (STN0).
19. The computer implemented method according to claim 19, wherein the method further comprises generating the non-stent background image using the balloon marker positions as inputs to the second spatial transformer network (STN1), wherein an order of the balloon marker position inputs to the second spatial transformer network (STN1) is reversed relative to an order of the balloon marker position inputs to the first spatial transformer network (STN0).
20. A computer program product comprising a non-transitory computer-readable medium having stored thereon machine readable instructions, which when executed by a computer cause the computer to execute the method according to claim 11.
Description
BRIEF DESCRIPTION OF THE DRAWINGS
[0024] In the following detailed portion of the present disclosure, the invention will be explained in more detail with reference to the example embodiments shown in the drawings, in which:
[0025]
[0026]
[0027]
[0028]
[0029]
[0030]
[0031]
[0032]
[0033]
[0034]
[0035]
DETAILED DESCRIPTION OF THE DISCLOSED EMBODIMENTS
[0036] The following detailed description illustrates exemplary aspects of the disclosed embodiments and ways in which they can be implemented. Although some modes of carrying out the aspects of the disclosed embodiments have been disclosed, those skilled in the art would recognize that other embodiments for carrying out or practising the aspects of the disclosed embodiments are also possible.
[0037]
[0038] As illustrated in the example of
[0039] As shown in
[0040] The exemplary frame images of
[0041] One example of a clear stent image S.sup.k is illustrated in
[0042] The term “background image” is a single image that does not include the stent. An example of a non-stent background image B.sup.k is shown in
[0043]
[0044] Three major coordinate systems are relied on. These include a coordinate system for each X-ray image I.sub.n in the X-ray image sequence I.sub.0-I.sub.n, a coordinate system for the stent images S.sup.k, and a coordinate system for the background images B.sup.k. According to the aspects of the disclosed embodiments, in the example of
[0045] In the exemplary cascaded network structure 400 illustrated in
[0046]
[0047] As is illustrated in
[0048] This aligned image or result 502 of the first spatial transformer layer STN0 is used to find a new non-stent background image B.sup.k. In the example of
[0049] The second spatial transformer network STN1 then is used to transform the new non-stent background image 506, which is still in the coordinate system of the original X-ray image, also referred to as the “original image space” to the coordinate system of the prior non-stent background image B.sup.k-1. The result 506 of the minus operation 504 and the non-stent background image B.sup.k-1 are the inputs to the second spatial transformer network STN1.
[0050] The result 508 of the second spatial transformer network STN1 in this example is a new non-stent background image B.sup.k. As shown in
[0051] The spatial transformer network STN1 is then used a second time. Following the average pooling layer 510, the order of the first transformer network STN0 and second transformer network STN1 is reversed. In this example, the new non-stent background image B.sup.k, which is a single image, is transformed by the spatial transformer network STN1 back to the coordinate system of the original image space, namely image I.sub.n.
[0052] The result 512 the second spatial transformer network STN1 is a new non-stent background image transformed to the coordinate system of the original image space I.sub.n. In this example, the result 512 of the second spatial transformer network STN1 and the image frame Jo is processed in the minus layer or operation 514. This result 516 is a stent image.
[0053] The first spatial transformer network STN0 is then used a second time to transform the stent image 516 from Xray image space to the stent image space. In this example, the result 516, together with the balloon marker fixed frame index M.sub.ref and the two balloon marker positions M.sub.0, are the inputs to the first spatial transformer network STN0.
[0054] The result 518 of the first spatial transformation network STN0 in this example is processed by an average layer 520. The output of the average pooling layer 520 is the stent image S.sup.k.
[0055] In one embodiment, when k=1, the non-stent background image B.sup.0 can be initialized as a black image, which contains zero in the image. The stent image S.sup.0 can be initialized as one of the images in the image sequence I.sub.0 to I.sub.n.
[0056] For example, in one embodiment, the stent image S.sup.0 can be the first image I.sub.0 in the image sequence I.sub.0 . . . I.sub.n. During the inference stage, the inputs can be fed directly into the network structure 500. The stent image S.sup.k is then obtained for better visualization.
[0057] The aspects of the disclosed embodiments are not limited to a specific network structure. The image order in the sequence illustrated in
[0058]
[0059]
[0060]
[0061]
[0062] In one embodiment, the first spatial transformer network STN0 and the second spatial transformer network STN1 can be trained separately. For the second spatial transformer network STN1, the network can be trained using supervised learning or unsupervised learning. For unsupervised learning, the loss is based on comparing the transformed Image 1 with Input Image 2. For supervised learning, the ground truth of the transformed Image 1 is needed, which can be generated from any suitable image registration algorithm.
[0063]
[0064] A new background image is generated 1004 from the first transformed stent image. In one embodiment, a minus operation is used with a corresponding image frame from the sequence of image frame as an input to the minus operation.
[0065] The new background image is transformed 1006 to a background image space using a second spatial transformer network (STN1) to generate a non-stent background image (B.sup.k). In one embodiment, a separated background image is an input to the second spatial transformer network (STN1). In one embodiment, these frames are repeated for all available frames and the results processed in an average layer to generate the non-stent background image.
[0066] The non-stent background image (Bk) is transformed 1008 to the image space of the sequence of image frames using the second spatial transformer network (STN1). A corresponding frame image from the sequence of image frames is an input top the second spatial transformer network (STN1). In this example, the second spatial transformer network (STN1) is used twice.
[0067] A stent image is generated 1010 in the image space of the sequence of image frames from the transformed non-stent background image. In one embodiment, the minus operation is used, where the corresponding image frame from the sequence of image frames is an input to the minus operation.
[0068] The result of the minus operation is fed to the first spatial transformer network (STN0) where the generated stent image is transformed 1012 to the stent image space. In one embodiment, the balloon marker positions are an input to the first spatial transformer network. The steps are repeated for all available image frames and an average layer is used to generate the clear stent image Sk.
[0069] As illustrated in
[0070] The output of the neural network 110 is the clear stent image S.sup.k and the non-stent background image B.sup.k. The function “f” is realized via the neural network 110 in conjunction with the operation of the processor 106.
[0071] The apparatus 100 includes suitable logic, circuitry, interfaces and/or code that is configured to carry out and execute the processes described herein. Examples of the apparatus 100 may include, but are not limited to, an application server, a web server, a database server, a file server, a cloud server, or a combination thereof.
[0072] The processor 106 includes suitable logic, circuitry, interfaces and/or code that is configured to process the plurality of images (or the sequence of image frames) by use of the neural network 110. The processor 106 is configured to respond to and process instructions that drive the apparatus 100. Examples of the processor 106 include, but are not limited to, a microprocessor, a microcontroller, a complex instruction set computing (CISC) microprocessor, a reduced instruction set (RISC) microprocessor, a very long instruction word (VLIW) microprocessor, or any other type of processing circuit. Optionally, the processor 106 may be one or more individual processors, processing devices and various elements associated with a processing device that may be shared by other processing devices. Additionally, the one or more individual processors, processing devices and elements are arranged in various architectures for responding to and processing the instructions that drive the apparatus 100. In one embodiment, the processor 106 is a hardware processor configured to execute machine readable instructions to carry out the processes generally described herein.
[0073] In one embodiment, the neural network 110 refers to an artificial neural network configured to receive an input, compress the input, and decompress the compressed input to generate an output such that the generated output is similar to the received input. Alternatively stated, the neural network 110 is used to reduce the size of input data into a smaller representation, and whenever original data is needed, it can be reconstructed from the compressed data.
[0074] In one aspect, the disclosed embodiments include a training phase and an operational phase. In the training phase, the neural network 110 is trained, using training data, to enable the neural network 110 to perform specific intended functions in the operational phase. The processor 106 is configured to execute an unsupervised or a semi-supervised training of the neural network 110 using training data. In the unsupervised training of the neural network 110, unlabeled training data is used for training of the neural network 106. Moreover, in the semi-supervised training of the neural network 110, a comparatively small amount of labeled training data and a large amount of unlabeled training data is used for training of the neural network 110.
[0075] Referring also to
[0076] The aspects of the disclosed embodiments separate the stent layer and non-stent layer using neural network without online optimization. The stent motion and non-stent motion does not need to be estimated on-line in an explicit way. A mapping function f maps the inputs, the sequence images and marker coordinates, into two single image outputs. The function f is fully realized via a neural network. Thus, the method disclosed herein is faster than conventional optimization-based approaches. In addition, the network is trained on a large number of data and is much robust than non-learning based methods.
[0077] Various embodiments and variants disclosed above, with respect to the aforementioned apparatus 100, apply mutatis mutandis to the method. The method described herein is computationally efficient and does not cause processing burden on the processor 102.
[0078] Modifications to embodiments of the aspects of the disclosed embodiments described in the foregoing are possible without departing from the scope of the aspects of the disclosed embodiments as defined by the accompanying claims. Expressions such as “including”, “comprising”, “incorporating”, “have”, “is” used to describe and claim the aspects of the disclosed embodiments are intended to be construed in a non-exclusive manner, namely allowing for items, components or elements not explicitly described also to be present. Reference to the singular is also to be construed to relate to the plural.
[0079] Thus, while there have been shown, described and pointed out, fundamental novel features of the invention as applied to the exemplary embodiments thereof, it will be understood that various omissions, substitutions and changes in the form and details of devices and methods illustrated, and in their operation, may be made by those skilled in the art without departing from the spirit and scope of the presently disclosed invention. Further, it is expressly intended that all combinations of those elements, which perform substantially the same function in substantially the same way to achieve the same results, are within the scope of the invention. Moreover, it should be recognized that structures and/or elements shown and/or described in connection with any disclosed form or embodiment of the invention may be incorporated in any other disclosed or described or suggested form or embodiment as a general matter of design choice. It is the intention, therefore, to be limited only as indicated by the scope of the claims appended hereto.