SYSTEMS AND METHODS FOR INTELLIGENTLY COMPRESSING WHOLE SLIDE IMAGES
20230215052 · 2023-07-06
Inventors
Cpc classification
G06T2207/20016
PHYSICS
G06V10/26
PHYSICS
International classification
G06V10/26
PHYSICS
Abstract
Systems and methods for compressing images that include a memory storing an executable code and a processor executing the code to receive a whole slide image, the whole slide image containing a plurality of image layers and metadata associated with each image layer, extract a high-resolution image layer and the corresponding metadata, wherein the high-resolution image layer includes a plurality of image tiles including informative tiles and noninformative tiles, where the informative tiles depict a region of interest of the specimen, analyze the image tiles of the extracted high-resolution image layer, determine a first tile is a noninformative tile, create an informative image layer by removing the first tile from the extracted high-resolution image layer, the informative image layer containing a plurality of informative tiles, compress the informative image layer into a single-layer whole slide image, and save the single-layer whole slide image in the memory.
Claims
1. A system comprising: a non-transitory memory storing an executable code; and a hardware processor executing the executable code to: receive a whole slide image depicting a specimen, the whole slide image having an image pyramid containing a plurality of image layers each depicting the specimen with a corresponding layer resolution and a corresponding plurality of metadata associated with each image layer of the plurality of image layers; extract a high-resolution image layer and the corresponding metadata associated with the high-resolution image layer from the image pyramid, wherein the high-resolution image layer includes a plurality of image tiles including informative tiles and noninformative tiles, where the informative tiles depict an image of a region of interest of the specimen; analyze the plurality of image tiles of the extracted high-resolution image layer; determine a first tile of the plurality of image tiles is a noninformative tile; create an informative image layer by removing the first tile from the extracted high-resolution image layer, the informative image layer containing a plurality of informative tiles; and save the single-layer whole slide image in the non-transitory memory.
2. The system of claim 1, wherein the hardware processor further executes the executable code to reconstruct a multi-layer image pyramid from the compressed single-layer whole slide image comprising a plurality of reconstructed layers, wherein each reconstructed layer of the plurality of reconstructed layers has a corresponding reconstructed layer resolution.
3. The system of claim 2, wherein reconstructing the multi-resolution image pyramid comprises using one of an upsampling algorithm and a downsampling algorithm.
4. The system of claim 1, wherein the informative tiles depict a tissue information of the specimen.
5. The system of claim 1, wherein each noninformative tile is removed using a tile pixel variance algorithm.
6. The system of claim 1, wherein the plurality of noninformative tiles is at least one of a white space around a tissue image and an image of glass borders depicted in an image.
7. The system of claim 1, wherein after removing the first tile from the high-resolution image layer, the hardware processor executes the executable code to insert a color value to represent the first tile that was removed.
8. The system of claim 1, wherein a file size of the single-layer whole slide image is up to 90% less than a file size of the whole slide image, thereby resulting in a faster retrieval time.
9. The system of claim 1, wherein, prior to determining the first tile is a noninformative tile, the hardware processor executes the executable code to calculate a probability that the first tile is a noninformative tile, wherein the determination is based on the probability.
10. The system of claim 1, wherein, prior to saving the single-layer whole slide image in the non-transitory memory, the hardware processor further executes the executable code to compress the informative image layer into a single-layer whole slide image.
11. A method for use with a computing system having a non-transitory memory and a hardware processor, the method comprising: receiving, using the hardware processor, a whole slide image depicting a specimen, the whole slide image having an image pyramid containing a plurality of image layers each depicting the specimen with a corresponding layer resolution and a corresponding plurality of metadata associated with each image layer of the plurality of image layers; extracting, using the hardware processor, a high-resolution image layer and the corresponding metadata associated with the high-resolution image layer from the image pyramid, wherein the high-resolution image layer includes a plurality of image tiles including informative tiles and noninformative tiles, where the informative tiles depict an image of a region of interest of the specimen; analyzing, using the hardware processor, the plurality of image tiles of the extracted high-resolution image layer; determining, using the hardware processor, a first tile of the plurality of image tiles is a noninformative tile; creating, using the hardware processor, an informative image layer by removing the first tile from the extracted high-resolution image layer, the informative image layer containing a plurality of informative tiles; and saving, using the hardware processor, the single-layer whole slide image in the non-transitory memory.
12. The method of claim 11, further comprising reconstructing, using the hardware processor, a multi-layer image pyramid from the compressed single-layer whole slide image comprising a plurality of reconstructed layers, wherein each reconstructed layer of the plurality of reconstructed layers has a corresponding reconstructed layer resolution.
13. The method of claim 12, wherein reconstructing, using the hardware processor, the multi-resolution image pyramid comprises using one of an upsampling algorithm and a downsampling algorithm.
14. The method of claim 11, wherein the informative tiles depict a tissue information of the specimen.
15. The method of claim 11, wherein each noninformative tile is removed using a tile pixel variance algorithm.
16. The method of claim 11, wherein the plurality of noninformative tiles is at least one of a white space around a tissue image and an image of glass borders depicted in an image.
17. The method of claim 11, wherein, after removing the first tile from the high-resolution image layer, the method further comprises inserting, using the hardware processor, a color value to represent the first tile that was removed.
18. The method of claim 11, wherein a file size of the single-layer whole slide image is up to 90% less than a file size of the whole slide image, thereby resulting in a faster retrieval time.
19. The method of claim 11, wherein, prior to determining the first tile is a noninformative tile, the method further comprises calculating, using the hardware processor, a probability that the first tile is a noninformative tile, wherein the determination is based on the probability.
20. The method of claim 11, wherein, prior to saving the single-layer whole slide image in the non-transitory memory, the method further comprises compressing the informative image layer into a single-layer whole slide image.
Description
BRIEF DESCRIPTION OF THE DRAWINGS
[0025]
[0026]
[0027]
[0028]
[0029]
[0030]
[0031]
DETAILED DESCRIPTION
[0032] The following description contains specific information pertaining to embodiments and implementations in the present disclosure. The drawings in the present application and their accompanying detailed description are directed to merely exemplary implementations. Unless noted otherwise, like or corresponding elements among the figures may be indicated by like or corresponding reference numerals. Moreover, the drawings and illustrations in the present application are generally not to scale and are not intended to correspond to actual relative dimensions.
[0033]
[0034] Computing device 110 is a computing system for intelligently compressing whole slide images. In some implementations, computing device 110 may be a computing system for intelligently decompressing a previously intelligently compressed whole slide image. As shown in
[0035] WSI 131 is a digital image file. In some implementations, WSI 131 may include an image pyramid comprising a plurality of layers each containing the image at a different level of resolution. Each layer of the image pyramid may have metadata associated with it. In some implementations, WSI 131 may be a digital image of a specimen, such as a tissue sample imaged for analysis. WSI 131 may be a digital image of a specimen, such as a tissue sample imaged for diagnosis. The specimen may include a region of interest that includes tissue information related to a condition of the tissue or a condition of the specimen. The plurality of layers may include a high-resolution image, a middle-resolution image, and a low-resolution image. In other implementations, the plurality of layers may include additional layers each depicting the image in a layer resolution. Each layer of WSI 131 may be made up of a plurality of image tiles. Each tile of the plurality of image tiles depicting a portion of the slide. Some image tiles may depict a portion of the specimen. Some image tiles may depict blank slide space. Some image tiles may depict an edge of a slide that is holding the specimen for imaging. An image tile that depicts a portion of the specimen may be considered an informative tile. An image tile that depicts only blank slide space or only an edge of the slide may be considered a noninformative tile.
[0036] Compressed WSI 133 is a compressed digital image. Compressed WSI 133 may be a single layer image compressed to preserve space in a computer memory, such as memory 130, and configured to preserve image data for decompression. In some implementations, compressed WSI 133 may be an image including informative tiles that has been compressed. In some implementations, compressed WSI 133 may be an image including informative tiles with substantially all noninformative tiles removed that has been compressed. Compressed WSI 133 may preserve image data to reconstruct an image pyramid comprising a plurality of image layers each having a reconstructed layer resolution, such as a layer including a reconstructed high-resolution image, a layer including a reconstructed middle-resolution image, and a layer including a reconstructed low-resolution image.
[0037] WSI processing module 141 is a software module stored in memory 130 for execution by processor 120 to process a whole slide image for intelligent compression, according to one implementation of the present disclosure. In some implementations, WSI processing module 141 may receive a whole slide image having an image pyramid containing a plurality of image layers and a metadata associated with each image layer, wherein each image layer of the image pyramid has an layer resolution. In some implementations, WSI processing module 141 may extract a high-resolution image layer from the plurality of image layers and the metadata associated with the high-resolution image layer, wherein the high-resolution image layer includes a plurality of image tiles depicting an image of a region of interest. In some implementations, WSI processing module 141 may convert the high-resolution image layer from a color image for processing.
[0038] Tile processing module 143 is a software module stored in memory 130 for execution by processor 120 to perform analysis of the high-resolution image layer of the WSI, according to one implementation of the present disclosure. In some implementations, tile processing module 143 may analyze the plurality of image tiles of the high-resolution image layer. In some implementations, tile processing module 143 may calculate a probability that a tile in the high-resolution image layer is a noninformative tile. In some implementations, tile processing module 143 may determine that tiles in the high-resolution image layer are noninformative tiles. In some implementations, a first tile may be a noninformative tile. In some implementations, there may be a plurality of noninformative tiles. In some implementations, tile processing module 143 may remove the noninformative tile or the plurality of noninformative tiles from the high-resolution image layer, thereby creating an informative image layer containing a plurality of informative tiles. In some implementations, tile processing module 143 may insert a color value to replace the image data that is removed when the noninformative tiles are removed.
[0039] Compression module 145 is a software module stored in memory 130 for execution by processor 120 to intelligently compress a processed high-resolution image layer. In some implementations, compression module 145 may intelligently compress the informative image layer containing the plurality of informative tiles into a single-layer whole slide image. In some implementations, compression module 145 may save the single-layer whole slide image in the non-transitory memory.
[0040] Decompression module 147 is a software module stored in memory 130 for execution by processor 120 to intelligently decompress the single-layer whole slide image. In some implementations, decompression module 147 may reconstruct a multi-resolution image pyramid from the intelligently compressed single-layer whole slide image.
[0041] Display 150 may include a display suitable for displaying images. In some implementations, display 150 may include a television, a computer monitor, a display of a tablet computer, or a display of a mobile phone. Display 150 may be a light emitting diode (LED) display, an organic light emitting diode (OLED) display, a liquid crystal display (LCD), a plasma display, a cathode ray tube (CRT), an electroluminescent display (ELD), or other display appropriate for viewing images. As depicted in
[0042] Network 155 is a computer network, such as the Internet, a local area network (LAN), a wireless local area network (WLAN), a wide area network (WAN), a metropolitan area network (MAN), a server area network (SAN), etc.
[0043] Storage device 160 is a computing device for storing code for execution by processor 120, and also for storing various data and parameters. Storage device 160 may be a server or other computer storage device. Storage device 160 may be a local storage device or a remote storage device. As depicted in
[0044]
[0045] As shown in
[0046]
[0047]
[0048]
[0049] As depicted at 510, the novel system of the present disclosure may extract a high-resolution image layer of the from the multi-resolution image layers. In the depicted implementation, the extracted image layer is a high-resolution image layer with informative tissue information for diagnostic pathology purposes. The high-resolution image layer includes a plurality of image tiles, wherein each of the plurality of image tiles is one of an informative tile and a noninformative tile.
[0050] At 520, analysis of the plurality of image tiles of the high-resolution image layer may be performed. The plurality of image tiles depict an image of a region of interest, wherein the region of interest depicts the tissue sample having tissue information for diagnostic pathology purposes. An informative tile has relevant tissue information. A noninformative tile has extraneous information, unnecessary for the purposes of diagnostic pathology purposes. In some implementations, the noninformative tiles are images of white space surrounding the region of interest with the tissue information. In some implementations, the noninformative tiles are images of the glass border of the glass slide. As depicted at 520, the presently disclosed novel system may intelligently identify and remove any noninformative tiles using tile pixel variance algorithms. In the depicted implementation at 530, the removed, noninformative tiles are disposed of. In some implementations, the novel system contemplates inserting a color value to represent the noninformative tile that was removed. In some implementations, the color value is white. In some implementations, the color value is a color other than white.
[0051] Also depicted at 530, with the removal of any noninformative tiles, what remains of the high-resolution image layer may be created into an informative image layer containing a plurality of informative tiles having informative tissue information, according to one implementation of the present disclosure. In some implementations, the informative image layer includes the plurality of informative tiles. In some implementations, the informative image layer includes the plurality of informative tiles as well as tiles having the inserted color value that replaced the removed, noninformative tiles. The informative image layer is intelligently compressed into a single-layer whole slide image. Further, as depicted at 530, the single-layer whole slide image is saved. In other words, relevant information, such as the informative tiles are stored. In some implementations, the relevant information may be stored in a storage device. Accordingly, the single-layer whole slide image containing only relevant information is stored, which results in up to 90% in reduced file size.
[0052] At 540, in some implementations, an image pyramid may be reconstructed using the single-layer whole slide image that was saved. In some implementations, upsampling algorithms are used to generate higher resolution layers. In some implementations, downsampling algorithms are used to generate lower resolution layers. As a result, in the depicted implementation at 540, a full image pyramid is reconstructed for an optimalviewing experience without compromising the integrity of the images.
[0053]
[0054] Generally, an image pyramid containing the plurality of image layers includes multiple image levels, so that each image layer of the image pyramid has a specific layer resolution. The different layer resolution may depend on the pixel size of the image layer. For instance, a first image layer may be 50 um*50 um, a second image layer may be 100 um*100 um, a third image layer may be 200 um*200 um, a fourth image layer may be 500 um*500 um, a fifth image layer may be 1 mm*1 mm, and so on. In some implementations, the increments of the pixel size for each image layer may be different. In some implementations, the image pyramid may have less than five image layers. In some implementations, the image pyramid may have more than five image layers.
[0055] At 602, system 100, using processor 120, extracts a high-resolution image layer and the corresponding metadata associated with the high-resolution image layer from the image pyramid, wherein the high-resolution image layer includes a plurality of image tiles including informative tiles and noninformative tiles, where the informative tiles depict an image of a region of interest (ROI) of the specimen. The ROI may include the cells, tissues, organs, and other materials on a glass slide. In some implementations, the ROI contains a full histological information to be examined by a pathologist. In some implementations, the ROI may include materials on a glass slide. The pathologist examines the specimen ROI image to interpret findings and make a diagnosis. In addition to depicting an image of a ROI with the specimen, the high-resolution image layer may depict additional areas surrounding the tissue sample. The area surrounding the tissue sample image may include empty portions of the slide appearing as white space around the tissue sampleimage and images of glass borders from the glass slide.
[0056] At 603, system 100, using processor 120, may optionally convert the high-resolution image layer from a color image for processing. In some implementations, an RGB color image is converted to a YCbCr space image. In some implementations, an RGB color image is converted to a YUV color space image. In some implementations, a color image is converted to a grayscale image. In some implementations, a color image is converted to a color combination appropriate for further processing, according to an implementation of the present disclosure. Table 1 below shows an example of an RGB image converted to a YUV image.
[0057] As shown in Table 1, an RGB color image is converted to a YUV space image. In some implementations, all further image processing is based on a grayscale image of the “Luminance” Y channel, instead of the three-channel RGB image. YUV images are an affine transformation of the RGB image. Y channel is a perceived intensity or “Luminance.” U channel and V channel are chrominance components or “color information.”
[0058] Further, Table 2 below depicts an implementation of a color matrix. YCbCr is used for digital signal. Cb is the blue difference, and Cr is the red difference.
TABLE-US-00001 TABLE 2
[0059] At 604, system 100, using processor 120, analyzes the plurality of image tiles of the high-resolution image layer. The plurality of image tiles includes at least one informative tile and at least one noninformative tile. An informative tile contains relevant information, which may include the region of interest with the specimen image containing histological information. In some implementations, a noninformative tile may contain irrelevant information such as only empty space around a tissue sample image. In some implementations, a noninformative tile contains irrelevant information, which may be an image of the glass or glass border from the glass slide.
[0060] In some implementations, analysis may include discrete wavelet transform (DWT) with a first tile downsampled lower band (LL) frequency matrix. In some implementations, the analysis includes LL sub-image variance and mean computation. Further analysis includes row and column differences between the plurality of tiles. In some implementations, the plurality of image tiles are labeled and categorized as an informative tile or a noninformative tile.
[0061] Table 3 below shows the use of DWT in the analysis of image tiles.
TABLE-US-00002 TABLE 3
[0062] As shown in Table 3, for each image tile, the image is decomposed and downsampled using DWT for four to five levels. Specifically, for each level of decomposition, the low-pass and high-pass filters are used in both row-wise and column-wise directions, wherein the original image can generate four subbands of the image of LL, LH, HL, and HH.
[0063] DWT is an algorithm used to reduce dimensionality of an image, feature extraction process. DWT algorithm decomposes the image into four subbands (sub-image) i.e., LL, LH, HL, HH. LL is the approximate image of input image.LL is low frequency sub-band, so it is used for further decomposition process. LH subband extracts the horizontal features of original image. HL subband gives vertical features. HH subband gives diagonal features. For example, if an original size is 512*512, the first LL level frequency band matrix becomes 256*256, second LL matrix is 128*128, and so on.
[0064] In some implementations, further analysis may include single tile image analysis. In some implementations, the mean, variance, row, and column differences are calculated on the basis of the LL image with size 32*32. As such, this can used to save computational complexity.
[0065] At 605, system 100, using processor 120, may optionally calculate a probability that the first tile is a noninformative tile. Various factors are analyzed, including the pixels associated with the tile, colors, and spectra of light, to name a few. Based on the analysis, the probability of whether a tile is noninformative may be determined. In some implementations, if the probability is above a certain threshold value, then the first tile is noninformative.
[0066] At 606, system 100, using processor 120, determines a first tile of the plurality of image tiles is a noninformative tile. In some implementations, the noninformative tile has irrelevant information or is largely blank, whereas the informative tile generally has the tissue information necessary for diagnostic pathology, research, and the like.
[0067] In some implementations, each image tile is labeled as one of an informative tile and a noninformative tile. For noninformative tiles, the image variances are usually very low, and the mean is relatively high as it mainly consists of “white space,” without contours and sharp intensity changes within the tile. “White” is the greatest intensity value, while “black” is the smallest intensity value of 0. As depicted in
[0068] In some implementations, the row and column differences are used to check on the uniformity of the images. Some tiles have “stripe” or “band” artifacts, wherein the entire row or column have similar patterns. For example, as depicted in the
[0069] At 607, system 100, using processor 120, removes the first tile from the high-resolution image layer, thereby creating an informative image layer containing a plurality of informative tiles. In some implementations, each noninformative tile is removed using a tile pixel variance algorithm.
[0070] In some implementations, a binary image is formed based on the tile labeling information and its location. The contour of the image is drawn, and the isolated and absurd islands or noninformative tiles are removed. Then, the tiles labeled as “1” are dilated for one more pixel to ensure all the edge tiles.
[0071] At 608, system 100, using processor 120, may insert a color value to represent the first tile that was removed. In some implementations, the color value for white is inserted, thereby replacing the removed noninformative tile that contained irrelevant information. Removing the noninformative tile and inserting the color value for white in its place ultimately decreases the ending file size. In some implementations, the color value other than white may be inserted. In some implementations, a color value is not inserted. In some implementations, a removed tile is left without information.
[0072] At 609, system 100, using processor 120, intelligently compresses the informative image layer containing the plurality of informative tiles into a single-layer whole slide image. As a result, the relevant information in the informative tiles is maintained and stored in high resolution. In some implementations, the system may select intelligent compression techniques and/or intelligent compression parameters based on one or more intelligent compression rules, which may be associated with image characteristics, patient characteristics, and medical history, to name a few.
[0073] In some implementations, compression module 145 may write the image tiles. In some implementations, for the informative tiles labeled as “1,” compression module 145 may write the information of the informative tiles to form a single-layer image. In some implementations, where the noninformative tiles are labeled as “0,” the write is disabled and left as a blank tile.
[0074] At 610, system 100, using processor 120, may save the single-layer whole slide image in the non-transitory memory. Consequently, large file sizes in gigabytes (GB) can be intelligently compressed in a timely manner and stored without comprising the integrity of the high-resolutionimage quality. A file size of the single-layer whole slide image is up to 90% less than a file size of the whole slide image, thereby resulting in faster retrieval time and requiring less storage capacity and network bandwidth. In some implementations, the file size of the single-layer wholeslide image is 10% to 90% less than the file size of the whole slide image. In some implementations, the file size of the single-layer whole slide image is any increment between 10% to 90% less than the file size of the whole slide image.
[0075] At 611, system 100, using processor 120, may reconstruct a multi-resolution image pyramid from the intelligently compressed single-layer whole slide image. In some implementations, reconstructing the multi-resolution image pyramid uses an upsampling algorithm to generate higher resolution layers. In some implementations, reconstructing the multi-resolution image pyramid uses a downsampling algorithm to generate lower resolution layers. By reconstructing a multi-resolution image pyramid, pathologists and other professionals still have the high-standard viewing experience that they are accustomed to.
[0076]
[0077] As shown in
[0078] As shown on the right side of
[0079] In some implementations, the present disclosure contemplates lower bandwidth demands for whole slide image retrieval, thereby enabling quicker access to whole slide images.
[0080] In some implementations, also contemplated is the reduced processing time for artificial intelligence (AI) analysis. In some implementations, the present disclosure contemplates incorporating machine learning, thereby reducing processing time, including time required for analysis of image tiles.
[0081] In some implementations, intelligent compression of WSIs may reduce cost of data storage and management implementation.
[0082] In some implementations, the present disclosure also contemplates enabling the development of an intelligent data management system for real world pathology data.
[0083] The present disclosure contemplates implementations of automatically selecting an appropriate intelligent compression technique and intelligent compression parameters for whole slide images (WSI) in order to reduce or prevent loss of significant information that does not impact the usefulness or diagnosis of the digital pathology images. Based on various image characteristics associated with a WSI, the present disclosure contemplates dynamically and intelligently compressing whole slide images using particular intelligent compression techniquesand by adjusting intelligent compression parameters, to maintain diagnostic tissue information ofthe image. The system may select intelligent compression techniques and/or intelligent compression parameters based on one or more intelligent compression rules, which may be associated with image characteristics, patient characteristics, medical history, etc. Further, the system may, based on the one or more intelligent compression rules, intelligently compress the image to a maximum degree of intelligent compression while maintaining the significant information of the image.
[0084] From the above description, it is manifest that various techniques can be used for implementing the concepts described in the present application without departing from the scope of those concepts. Moreover, while the concepts have been described with specific reference to certain implementations, a person having ordinary skill in the art would recognize that changes can be made in form and detail without departing from the scope of those concepts. As such, the described implementations are to be considered in all respects as illustrative and not restrictive. It should also be understood that the present application is not limited to the particular implementations described above, but many rearrangements, modifications, and substitutions are possible without departing from the scope of the present disclosure.