Advanced computer-aided diagnosis of lung nodules

Abstract

Methods are herein provided for decision support in diagnosis of a disease in a subject, and for extracting features from a multi-slice data set. Systems for computer-aided diagnosis are provided. The systems take as input a plurality of medical data and produces as output a diagnosis based upon this data. The inputs may consist of a combination of image data and clinical data. Diagnosis is performed through feature selection and the use of one or more classifier algorithms.

Claims

1. A method of extracting features from a multi-slice data set, the method comprising: representing a spatial distribution of an object mathematically; representing a shape of the object mathematically; determining contour and texture of the object; identifying a border pixel of the object and estimating a derivative; analyzing the derivative as a function of position along the contour; identifying automatically the presence of dark regions or bright regions within the object; and approximating texture of an image in a surrounding region of the object; wherein the features are calculated for the group comprising each slice of the multi-slice data set, a maximum intensity projection taken at an arbitrary angle, a minimum intensity projection taken at an arbitrary angle, and a digitally reconstructed radiograph taken at an arbitrary angle through one or more slices of the image.

2. The method according to claim 1, further comprising selecting an individual slice from the multi-slice data set for analysis by a manual selection by a user or by an automatic selection of a largest slice.

3. The method according to claim 1, wherein the features calculated for each slice of the multi-slice data set are combined by a method selected from the group consisting of: calculating a weighted average in which weights are proportional to a number of pixels on each slice; finding a maximum value across multiple slices of the multi-slice data set; and finding a minimum value across the multiple slices of the multi-slice data set.

4. The method according to claim 1, wherein the features are calculated in each of a plurality of dimensions.

5. The method according to claim 4, wherein the plurality of dimensions is at least one selected from the group consisting of 2 dimensions, 2.5 dimensions, and 3 dimension.

6. The method according to claim 1, wherein the shape of the object is described by at least one of the group consisting of: distribution of coefficients after a Fourier transform of border pixel positions; mathematical moments of a segmented object that are invariant to translation, rotation, and scaling; mathematical moments of a grayscale distribution of image pixels; fractal dimension; and a chain code.

7. The method according to claim 1, wherein the texture of the object is described by at least one of the group consisting of: fractal dimension; energy, entropy, maximum probability, inertia, inverse difference and correlation based on a gray-level co-occurrence matrix; and coarseness, contrast, busyness, complexity and strength based on a neighborhood gray-tone difference matrix.

8. The method according to claim 1, wherein the surrounding region is described by at least one of the group consisting of: a derivative of image intensity along a direction orthogonal to a local contour; a derivative of the image intensity along the direction orthogonal to the local contour and moments of a power spectrum; and an estimate of variance of the image intensity along the direction orthogonal to the local contour.

9. The method according to claim 1, wherein the presence of dark regions and bright regions within the object is described by the intensity or size of clusters of contiguous pixels above or below a given threshold.

10. A system for extracting features from a multi-slice data set, the system comprising: a processor; and a memory storing instructions, which, when executed by the processor, cause the processor to: represent a spatial distribution of an object mathematically; 0067 represent a shape of the object mathematically; determine contour and texture of the object; identify a border pixel of the object and estimate a derivative; analyze the derivative as a function of position along the contour; identify automatically the presence of dark regions or bright regions within the object; and approximate texture of an image in a surrounding region of the object; wherein the features are calculated for the group comprising each slice of the multi-slice data set, a maximum intensity projection taken at an arbitrary angle, a minimum intensity projection taken at an arbitrary angle; and a digitally reconstructed radiograph taken at an arbitrary angle through one or more slices of the image.

11. The system according to claim 10, wherein the memory further stores instructions, which, when executed by the processor, cause the processor to select an individual slice from the multi-slice data set for analysis by a manual selection by a user or by an automatic selection of a largest slice.

12. The system according to claim 10, wherein the features calculated for each slice of the multi-slice data set are combined by one of the group consisting of: calculating a weighted average in which weights are proportional to a number of pixels on each slice; finding a maximum value across multiple slices of the multi-slice data set; and finding a minimum value across the multiple slices of the multi-slice data set.

13. The system according to claim 10, wherein the features are calculated in each of a plurality of dimensions.

14. The system according to claim 13, wherein the plurality of dimensions is at least one selected from the group consisting of 2 dimensions, 2.5 dimensions, and 3 dimension.

15. The system according to claim 10, wherein the shape of the object is described by at least one of the group consisting of: distribution of coefficients after a Fourier transform of border pixel positions; mathematical moments of a segmented object that are invariant to translation, rotation, and scaling; mathematical moments of a grayscale distribution of image pixels; fractal dimension; and a chain code.

16. The system according to claim 10, wherein the texture of the object is described by at least one of the group consisting of: fractal dimension; energy, entropy, maximum probability, inertia, inverse difference and correlation based on a gray-level co-occurrence matrix; and coarseness, contrast, busyness, complexity and strength based on a neighborhood gray-tone difference matrix.

17. The system according to claim 10, wherein the surrounding region is described by at least one of the group consisting of: a derivative of image intensity along a direction orthogonal to a local contour; a derivative of the image intensity along the direction orthogonal to the local contour and moments of a power spectrum; and an estimate of variance of the image intensity along the direction orthogonal to the local contour.

18. The system according to claim 10, wherein the presence of dark regions and bright regions within the object is described by the intensity or size of clusters of contiguous pixels above or below a given threshold.

Description

(1) FIG. 1 is a series of drawings showing a box-counting algorithm for finding fractal dimension of a contour.

(2) FIG. 2 is a graph of a computation of fractal dimension, with log (N [boxes]) on the ordinate and log (1/[box size]) on the abscissa, showing a linear relationship between log (N [boxes]) and log (1/[box size]).

(3) FIG. 3, Panel A is a scan of a nodule.

(4) FIG. 3, Panel B is a corresponding segmented contour drawing and the estimated normal angles.

(5) FIG. 4, Panel A is a graph of the intensity gradient of a nodule, with gradient on the ordinate and position along contour on the abscissa.

(6) FIG. 4, Panel B is a graph of the corresponding power spectrum, with power/frequency on the ordinate and normalized frequency on the abscissa.

(7) FIG. 5 is a block diagram of a CADx system.

(8) FIG. 6 depicts an example image with internal boundary and a chain code encoder template.

(9) A CADx system that provides high confidence to clinicians improves clinician workflow by providing fast and accurate diagnosis (fewer false positives and false negatives). A CADx system can be used as a second reader to increase clinicians' confidence in their diagnosis, leading to significant reduction of unnecessary biopsies of lung lesions such as nodules. Furthermore, a CADx system can facilitate lung cancer screening of asymptomatic patients since diagnosis can be reached quickly and accurately. MSCT scanners, exemplified but not limited to the Philips Brilliance series, offer increasing resolution and allow finer structures to be observed while producing increasing amounts of image data to be interpreted by radiologists. However, even the latest CADx systems often fail to incorporate clinical information, to use optimized feature extraction, or to apply machine learning techniques.

(10) Features of interest that have not been developed include analysis for both thick and thin slice CT scans. Other proposed features include only simple 3D features or features that are not optimal to describe the differences between benign and malignant nodules. These features often result in a low rate of accuracy and are not desirable to use in a CADx algorithm.

(11) The methods and systems provided herein are based on state-of-the art machine learning techniques, such as genetic algorithms and support vector machines, and innovative image processing algorithms for pre-processing of images and feature extraction. An aspect of the methods and systems provided herein is the capability of combining image-based and clinical information about a patient and a patient's lesion into the decision making process. The methods and systems provided herein combine features extracted from high quality medical images, for example a CT scan, with non-imaging data from patient health records through the use of machine learning and image processing methodology.

(12) The methods and systems (exemplified by FIG. 5) provided herein for computer-aided diagnosis of lesions includes several processing units. A pre-processing unit processes images, e.g. MSCT scans, to create isotropic volumes of interest each surrounding a lesion to be diagnosed, or to segment or delineate lesions, such as pulmonary nodules. A feature extraction unit extracts two dimensional (2D), two and a half dimensional (2.5D) and three dimensional (3D) features from images to characterize lesions. These features, together with clinical information including patient history, constitute the feature pool. A clinical information processing unit accepts and transforms the clinical information to be used in the feature pool. Feature selection, a step used in the design phase of the system, is based on a genetic algorithm, and is used to select an optimal feature subset from the feature pool. A classifier or committee of classifiers are used in the feature selection process, and are built using the selected feature subset to classify a lesion as malignant or benign, or to determine the malignancy of a lesion, such as a pulmonary nodule.

(13) In many instances, because of the complexity of the 3D shape of the nodules, the 2D slices may reveal several disconnected islands. As many of the features described below rely upon a unique delineation of border pixels, it is often desirable to operate on only a single object. The methods and systems in one embodiment use an algorithm to remove all but the largest connected object, and perform all nodule analysis on that object. In this case, analysis is performed only for 2D and 2.5D features. The methods and systems provided herein facilitate the use of 2D, 2.5D, and 3D features on a multi-slice dataset. Because of the ease of computation of 2D features, it is often desirable to utilize 2D features even when 3D data is available. This is particularly common in thick slice data (i.e. a slice thickness greater than 3 times the in-plane resolution), where 3D features may not be as robust and sufficient. In such cases, the user of the CADx system may manually identify an individual slice for analysis. The CADx system is then said to be operating in a 2D mode, in which the features extracted from this 2D slice are used in performing the classification. Similarly, only 2D features from the training dataset are used in constructing the optimum classifier when the system is operating in 2D mode. Such systems fail to capture the full range of information present in the multi-slice volume.

(14) To overcome this, the methods and systems provided herein use a 2.5D, or pseudo-3D, mode. The same features are used as in the pure 2D mode, and these features are calculated on each slice of the MSCT image. The slice range may encompass the whole volume, or alternatively can be manually selected by the user. The feature values used in the classifier are then taken as the size-weighted average of these slice-by-slice calculations. For some features, it may be more logical to use the maximum or minimum values across slices. Alternatively, the 2D features are computed on maximum intensity projection (MIP) data taken through the range of slices, or from an arbitrary angle, with the 2D feature extraction run on the projected image.

(15) Pre-Processing of Images

(16) Images, such as MSCT scans, are pre-processed to determine regions of interest (ROIs) or volumes of interest (VOIs) for analysis. These serve as an input into the feature extraction unit.

(17) The methods and systems provided herein include several pre-processing steps. ROIs are constructed that allow feature calculation to be robust to segmentation errors. ROIs are constructed by morphological operations of erosion and dilation on the binary nodule image. This is used to construct an internal, external, and boundary region. An additional ROI consisting of the largest square region that can be embedded in the object is also identified. The features described in these methods and systems can be computed using any one or more of these ROIs.

(18) The chest wall and other irrelevant anatomy are excluded from feature calculation. Many current segmentation algorithms also delineate the borders of the chest wall. Voxels or pixels that belong to the chest wall are explicitly excluded from all of the features to be described. Only those pixels that are labelled as belonging to the lung parenchyma or the nodule of interest are included.

(19) For many features, it is desirable to capture the full range of detail within the nodule, including potential cavities within the nodule. Cavitation, or air pockets within the nodule, is often not identified directly by segmentation. Therefore, the methods and systems provided herein use an algorithm for post-processing nodule segmentation to fill in holes or gaps in the segmentation mask, such that air pockets or cavities within the nodule are explicitly considered during feature calculation. No change is performed on the CT images, but rather, on the segmentation results. These methods and systems relate to the use of the post-processed segmentation masks, as well as the original segmentation masks.

(20) As shown in FIG. 5, the pre-processing unit extracts a VOI surrounding a lung nodule based on its location, which can be provided by a clinician or a CAD system.

(21) Since MSCT scans have higher resolution in the slices than among slices, it is desirable to perform interpolation between the slices to create isotropic voxel representation. Interpolation to isotropic voxels is desirable for segmentation purposes. However, it is also preferable to retain the original data for feature calculation, to avoid the filtering properties of interpolation kernels. The methods and systems provided herein use an interpolation method that keeps the original slices and inserts interpolated slices to make the scan near-isotropic. The methods and systems in one embodiment include a means of limiting the interpolation to a single axis in order to reach near-isotropic voxels. These isotropic volumes can then be used for segmentation, and for computation of 3D features. The original 2D slices can later be extracted from this interpolated 3D dataset.

(22) A segmentation step delineates the pulmonary nodules from the background, generating a binary or trinary image (label VOI), where a nodule, background and lung wall regions are labeled. A segmentation algorithm is run on the interpolated data, resulting in a volume of the same dimension as the interpolated volume. See Philips Press Release, Philips' new CT Lung Nodule Assessment and Comparison Option can enable clinicians to identify and treat lung cancer, 2003; Wiemker et al., Options to improve the performance of the computer aided detection of lung nodules in thin-slice CT, Philips Research Laboratories: Hamburg, p. 1-19, 2003; and Wiemker et al., Computer Aided Tumor Volumetry in CT Data, Invention disclosure, Philips Research Hamburg, 2002. For analysis of 2D features, the slices in the label volume that correspond to the original slices are identified and extracted. For 3D features, the full near-isotropic 3D volume is used.

(23) In one embodiment of the invention, manual selection of segmentation masks from segmentation using varying threshold is used. Thus, various segmentation thresholds and seed placements are tested. Grayscale images overlaid with segmentation contours are presented to a user, who then manually selects the optimum segmentation, a single slice for 2D feature extraction, and a range of slices for 2.5-D feature extraction.

(24) Feature Extraction Unit

(25) Feature extraction is performed to extract 2D, 2.5D, and 3D features images to characterize lesions, such as pulmonary nodules. These features, together with clinical information, constitute a feature pool.

(26) Using a gray-level and labeled VOI, the feature extraction unit calculates different features, such as 2D (using the native slice of the VOI with the largest nodule cross-section), 2.5D (an average of 2D features calculated on all the native slices, weighted by the nodule cross-sectional area), and 3D (based on the near-isotropic VOI) features. The feature extraction step is significant, since the calculated features need to have enough discriminatory power together with the clinical information to distinguish between malignant and benign nodules. Features can be, for example, gray level distributions inside and surrounding the nodules, shape information, texture information inside and outside the nodule, gradient information on the surface of the nodule, or contrast between the inside and outside of the nodule. Each of these features can be calculated in 2D, 2.5D, or 3D.

(27) Clinical Information and its Transformation to Clinical Features

(28) Since clinical information is important in the diagnosis process, the methods and systems provided herein include a unit which converts clinical information into a suitable form so it can be combined with extracted image based features for the feature selection process. For example, clinical information for gender is divided into 2 categories, such as whether the patient is male or whether is the patient female. The clinical information which can be used in the proposed system can include, for example, age, gender, smoking history, cancer history, family history, occupational exposure, recreational exposure, past or present pulmonary diseases (e.g. emphysema), number of satellite nodules around the one to be diagnosed, size of lymph nodes, presence of other suspicious nodules, or location of the nodule in the lung (e.g. upper lobe or lower lobe).

(29) Feature Selection

(30) The feature selection unit finds the most relevant features from the feature pool containing image-based as well clinical features. A GA and SVM-based feature selection process is used. Once the most relevant features are determined based on a training dataset, a classifier or committee of classifiers is built based on the optimal feature subsets and the feature selection unit is no longer required. In the embodiment relating to the committee of classifiers, each committee member can be constructed on a feature subset identified through a separate run of the feature selection algorithm. The diversity of the classifiers in the committee is achieved for example for the GA-based feature selection by giving a different randomly selected set of training and testing data to each GA run. Other feature selection methods, such as statistical difference filters, correlation filters, stepwise linear regression, recursive feature elimination (RFE), and a random feature selection can also be used.

(31) Classifier

(32) Following the supervised learning principle, a classifier is built using the selected optimal feature subset and training data. Possible classifiers are SVMs, decision trees, linear discriminant analysis and neural networks. SVMs are often used since they have shown superior performance with respect to classifiers.

(33) Committees of classifiers can also be used. In this case, several classifiers are built on different feature subsets, which were selected as the best feature subsets via the feature selection process. Each classifier will result in a decision, e.g. whether a nodule is malignant or benign. A majority vote of the classifiers will determine for example, the output of the CADx system for the nodule in question. In likelihood CADx, the output likelihood of the committee may be the fraction of positive votes, or an average (which may be weighted) of the individual likelihood results from each member of the committee.

EXAMPLES

(34) The methods and systems provided herein apply several image processing methods to CADx. Examples of these image processing methods are provided below.

Example 1: Invariant Moments

(35) Moments are a means of mathematically representing the spatial distribution of an object. This includes the shape (binary moments) or density distribution (grayscale moments). Invariant moments are those moments that do not change when the object undergoes some transformation, such as rotation, scaling, and translation. Moment based methods have been used at length in computer vision and optical character recognition. The use of invariant moments, including the first 6 invariant moments, is described by the mathematical formalism of Hu (1962).

(36) For an image I of size N×M, the moments m.sub.pq can be given by:

(37) $m_{pq} = {.Math.}_{x = 1}^{N} {.Math.}_{y = 1}^{M} x^{p} y^{q} I (x, y)$
where I takes on the grayscale intensity of the image at a pixel (x, y) for grayscale moments, or a value of 0 or 1 for binary moments. The centroid of the object is defined using the following moments.
x=m.sub.10/m.sub.00 and y=m.sub.01/m.sub.00

(38) These allow calculation of centralised moments μ.sub.pq that are invariant under translation:

(39) $μ_{pq} = {.Math.}_{x = 1}^{N} {.Math.}_{y = 1}^{M} {(x - \overline{x})}^{p} {(y - \overline{y})}^{q} I (x, y)$
which can be made scale invariant by computing normalised central moments
η.sub.pq=μ.sub.pq/μ.sub.00.sup.γ
where γ=(p+q)/2+1 for all (p+q)≥2

(40) These scale and translation invariant moments can be converted into rotationally invariant moments, using a method described by Hu (IRE Trans Information Theory, IT-8 179-197, 1962). These seven moment invariants are given as the following equations.
H.sub.1=η.sub.20+η.sub.02
H.sub.2=(η.sub.20−η.sub.02).sup.2+4η.sub.11.sup.2
H.sub.3=(η.sub.30−3η.sub.12).sup.2+(3η.sub.21−η.sub.03).sup.2
H.sub.4=(η.sub.30−η.sub.12).sub.2+(η.sub.21+η.sub.03).sup.2
H.sub.5=(η.sub.30−3η.sub.12)(η.sub.30+η.sub.12)+((η.sub.30+η.sub.12).sub.2−3(η.sub.21−η.sub.03).sup.2)
H.sub.6=(η.sub.20−η.sub.02)((η.sub.30+η.sub.12).sup.2−(η.sub.21+η.sub.03).sup.2)+4η.sub.11(η.sub.30+η.sub.12)(η.sub.21+η.sub.03)
H.sub.7=(3η.sub.21−η.sub.03)(η.sub.30+η.sub.12)((η.sub.30η.sub.12).sup.2−3(η.sub.21+η.sub.03).sup.2)+(3η.sub.12)−η.sub.30)(η.sub.21+η.sub.03)(3(η.sub.12+η.sub.30).sup.2−(η.sub.21+η.sub.03).sup.2)

(41) The moments and central moments are computed by iterating over different p and q values and over all pixels in an image, skipping over pixels that are not in the nodule. The image function, I, can be binary or real-valued, and in practice, both are implemented. In this way, each nodule can contribute seven invariant binary moments, and seven invariant grayscale moments. These 14 scalar values can then be used as inputs into a CADx system.

(42) Alternatively, to make the feature extraction robust to segmentation uncertainty, the grayscale moments are calculated by performing the moment calculation on a circular area enclosing the segmented nodule. The pleural wall and irrelevant lung structures are identified via the segmentation and removed. To avoid incorrectly placing 0 values in these removed structures, random noise is inserted into these pixels by sampling from a histogram computed from the retained background.

(43) Further, 3D moments are extracted, and features are derived from the 3D moments in binary mask gray scale data as shown by the following equations.
moment.J11=η.sub.200+η.sub.020+η.sub.002
moment.J21=η.sub.200η.sub.020+η.sub.200η.sub.002+η.sub.020η.sub.002−η.sub.101.sup.2−η.sub.110η.sub.110−η.sub.011.sup.2
moment.J31=η.sub.200η.sub.020η.sub.002−η.sub.002η.sub.110.sup.2+2*η.sub.110*η.sub.101*η.sub.011−η.sub.020*η.sub.101.sup.2−η.sub.200*η.sub.011.sup.2

(44) The three derived features above are also computed for moments derived for the grayscale nodule image.

Example 2: Invariant Fourier Descriptors

(45) Fourier descriptors are a means of mathematically describing the shape of an object. Conceptually, the border of an object is described in terms of frequencies. Low frequencies describe smooth edges and give the general shape of the object, whereas high frequencies describe irregularities and sharp changes in the contour. The Fourier descriptors describe the particular mix of frequencies that constitute a particular object shape. In the methods and systems provided herein, Fourier descriptors that are invariant to scale, translation, and rotation are used. For use in the classifier, two means of condensing the Fourier coefficients into a scalar value are provided.

(46) Initially, the pixels on the edge of a nodule are identified by a number of well-known methods. Care is taken to ensure that these N edge pixels are listed in clockwise or counter-clockwise order around the object. Each edge pixel is described with its x and y coordinate, yielding the vectors x={x.sub.1, x.sub.2, . . . , x.sub.N} and y={y.sub.1, y.sub.2, . . . , y.sub.N}. The discrete Fourier transform for each object is computed using any of a number of techniques (denoted generically as FT), yielding
v=FT{x} and w=FT{y}
which can be decomposed into real and imaginary parts
v.sub.n=a.sub.n+ib.sub.n and w.sub.n=c.sub.n+id.sub.n.
The Fourier descriptors for the shape uniquely defined by x and y is then expressed as:

(47) $f_{n} = \frac{\sqrt{a_{n}^{2} + b_{n}^{2}}}{\sqrt{a_{1}^{2} + b_{1}^{2}}} + \frac{\sqrt{c_{n}^{2} + d_{n}^{2}}}{\sqrt{c_{1}^{2} + d_{1}^{2}}} .$
This yields a vector f which may vary in length between nodules, thus rendering it difficult to use in comparing nodules in a CADx system. In the methods and systems provided herein, vector f is condensed into two scalar descriptors g.sub.1 and g.sub.2:

(48) $g_{1} = ({.Math.}_{n = 2}^{N} \frac{f_{n}}{n}) / {.Math.}_{n = 2}^{N} n and g_{2} = ({.Math.}_{n = 2}^{N} {nf}_{n}) / {.Math.}_{n = 2}^{N} n .$

(49) The mathematical description given here follows that of Nixon et al., Feature Extraction & Image Processing, Butterworth-Heinemann: Woburn, Mass. pp. 269-278 (2002). This calculation is used as an input into a CADx system.

Example 3: Fractal Dimension of Nodule Shape

(50) The contour of an object can be described using the Minkowski-Bouligand fractal dimension, also known as the box-counting dimension. This describes how the measured length of the contour changes with the resolution at which the contour is viewed. A nodule with a high fractal dimension tends to exhibit irregularities in the surface shape, which may indicate malignancy. The fractal dimension can be calculated for 2D, 2.5D, or 3D features, and in fact, all three may be simultaneously used during classification.

(51) A nodule with an outline, shown in the first panel of FIG. 1 (Scale 1), is considered. This outline consists of all pixels that are nearest to the exterior of the nodule. The total number of pixels in the outline is tallied as N (1). The image is then resampled by a factor of ½, shown in FIG. 1 as Scale 2. This can be thought of as tiling 2×2 pixel boxes over the Scale 1 image and seeing how many of these boxes contain and edge pixels. This count of boxes is denoted N (½). This is then repeated with 3×3 pixel boxes to yield N (⅓), and so on. For a fractal object, the value N (1/d) varies with the scale d according to.
N(1/d)=μ((1/d).sup.−FD
where FD is the fractal dimension of the object. By algebraic manipulations, this is changed to:
ln N(1/d)=−FD ln (1/d)+ln μ.

(52) Thus, if the value of of N.sub.d is computed for several d, then the FD can be estimated by a linear fit between ln (1/d) on the x-axis and ln N (1/d) on they-axis. This fit may be a least squares fit or a robust fit, or may be taken by taking the average slope between successive points. An example of such a fit is given in FIG. 2. The slope of this line is used as a feature in a CADx system.

Example 4: Fractal Dimension of Nodule Texture

(53) The texture of an object can be described in terms of fractal dimension. A 2D view of the object is considered as a surface embedded in 3D. The fractal dimension measures the variations in grayscale intensity as the resolution of the image is changed. Higher fractal dimensions suggest greater complexity in the internal structure of the object. As with the border, the fractal dimension of the texture can be calculated for 2D, 2.5D, or 3D features, and in fact, all three may be simultaneously used during classification.

(54) The texture dimension is calculated in a manner similar to that of the border, described above in Example 3, with the following changes: the d×d boxes are not tiled, but overlap; that is, all possible d×d boxes are considered. Then, instead of counting the number of boxes, N (1/d) represents the differences between the maximum and minimum pixel gray value within each d×d box, summed over all boxes of size d×d.

Example 5: Edge Gradient-Based Features

(55) Border pixels of a nodule are identified, and the normal direction pointing away from an object is computed. The derivative is then estimated by computing the difference in grayscale intensity between the border pixel and the pixel some finite distance away in the normal direction. This second external value is typically found by interpolating the value of the image two pixels away. The derivative as a function of position along the contour can then be analyzed statistically to yield scalar features using the mean, the standard deviation, and the root-mean-square variation and first moment of the power spectrum of this function.

(56) As with the Fourier descriptors, each edge pixel and its x and y coordinate are considered, yielding the vectors x={x.sub.1, x.sub.2, . . . , x.sub.N} and y={y.sub.1, y.sub.2, . . . , y.sub.N}. For an edge pixel i located at (x.sub.i, y.sub.i), the normal angle is computed. This is given by:

(57) $α = \frac{1}{2} (\tan^{- 1} (\frac{y_{i + k} - y_{i}}{x_{i + k} - x_{i}}) + \tan^{- 1} (\frac{y_{i} - y_{i - k}}{x_{i} - x_{i - k}})) + 90 °,$
taking care to ensure the consistency of the signs. That is, the normal is computed as being perpendicular to the local curve of the surface, where the local curvature is estimated by averaging between two nearby (but not necessarily adjacent) points. An example of these angles as computed on a nodule is given in FIG. 3.

(58) A point d pixels away from the edge is then defined by the coordinates
x=x.sub.i+d sin α and y=y.sub.i+d cos α,
with an intensity at that location found by bilinear interpolation on the image. The edge gradient is then simply the difference between the interpolated intensity and the intensity of the original edge pixel. This can be computed for every edge pixel in order, as shown in FIG. 4. A large number of statistical features can be calculated from this, including mean and standard deviation. The power spectrum can estimated through many well known techniques, thus yielding information about the frequency content of the fluctuations around the object. The underlying assumption to this analysis is that high frequency variations in intensity may suggest spiculations, which are believed to be an indicator of malignancy. To assess this, the moments and root mean square variation of the power spectrum can be calculated and used as features.

Example 6: Detection and Characterization of Internal Clusters

(59) The presence of dark or bright clusters in the object may be indicative of calcification or cavitation, respectively. The methods and systems provided herein include means of identifying these regions automatically, and using them as features for CADx.

(60) A threshold is iteratively applied to an image, such that the only pixels that remain are above some intensity t. The value oft is lowered, starting at the maximum value in the object to the minimum value of the object. At each threshold, the remaining pixels are grouped into clusters that are connected on the sides. That is, each cluster consists of a set of pixels where it is possible to move from one pixel to any other pixel by travelling in one of the four cardinal directions, without ever leaving the cluster. If the largest cluster has a size of greater than n pixels, then the value of the threshold t is saved and the algorithm halted. The feature that is extracted is the given by either the threshold value expressed in units of image pixel intensity, or as the number of standard deviations between the mean intensity of the rest of the detected cluster and the critical threshold. Similarly, by thresholding below the iterated intensity threshold, it is possible to detect dark clusters; the same two features can be calculated for these clusters.

Example 7: Chain Code

(61) The pixels on the border of an object image can be identified in a continuous order, such that each pixel has a pair of neighbors. By tracking the relative positions of neighboring pixels, a vector describing the shape of an object is identified.

(62) For each pixel on the border, the two neighbors of this pixel are defined using a chain code. Turning to FIG. 6, consider a border given by pixels a-g of image 602.

(63) By placing the centre of template 604 on each of pixel a-g of the boundary, the chain code description is read out {1, 2, 1, 2, 4, 4, . . . } with a difference between successive values of {1, −1, 1, 2, 0, . . . } or an absolute difference of {1, 1, 1, 2, 0, . . . }.

(64) The distribution of these chain code and chain code difference values can be used to calculate features. The fraction of absolute difference values greater than 1, 2, . . . , 6 can be used as a feature. This fraction can be used to detect the number of sudden changes in direction, thus describing the irregularity of nodules, or concepts such as spiculation and lobulation.

Example 8: Texture: Neighbourhood Gray-Tone Difference Matrix

(65) Several mathematical methods exist for approximating the human perception of texture of an image, including the aforementioned fractal method. An alternative method is based on what is known as the neighborhood gray-tone difference matrix (NGTDM). This method seeks to quantitatively describe the differences between each pixel and its surrounding neighborhood, leading to mathematical descriptions that have been shown in psychometric tests to correlate well with subjective ratings of the abstract qualities including coarseness, contrast, busyness, complexity, and strength.

(66) The description given by Amadsun and King (IEEE Trans Sys Man Cybernetics 19(5): 1264-1274, 1989) is followed. The NGTDM is a matrix formed as follows. The N×N image as a whole is quantized to a predetermined number of levels, g. For every pixel in the particular ROI with quantized intensity denoted by f(k,l),

(67) $A (k, l) = \frac{1}{{(2 d + 1)}^{2} - 1} [{.Math.}_{m = - d}^{d} {.Math.}_{n = - d}^{d} f (k + m, l + n)] where (m, n) \neq (0, 0)$
That is, A is the average of a (2d+1)×(2d+1) neighborhood around the pixel of interest, excluding the pixel itself. The NGTDM matrix N has one column and as many rows as their are levels of intensity in the image. The row i is then given by

(68) $N (i) = \underset{k, l}{.Math.} .Math. i - A (k, l) .Math.$
where the summation is taken over all pixels where f(k,l)=i Care is taken to restrict the calculation to the area within the region of interest (ROI), whether this is a nodule itself or the region outside the nodule.

(69) The probability for each bin, p(i), is defined as the fraction of centre pixels that contributed to the calculation of N(i). Two features are shown here; others are described in detail in Amad sun et al., IEEE Trans Sys Man Cybernetics, 19(5): 1264-1274, 1989. These features are:

(70) $coarseness = {[\underset{i}{.Math.} p (i) N (i)]}^{- 1}$ $contrast = [\frac{1}{G (G - 1)} {.Math.}_{i = 0}^{g} {.Math.}_{j = 0}^{g} p (i) p (j) {(i - j)}^{2}] [\frac{1}{n^{2}} {.Math.}_{i = 0}^{g} N (i)]$
where n=(N−2d) and G is the actual number of the g levels that appear in the image.

(71) The methods and systems provided herein include pre-processing CT images for CADx, several new features that can be used as inputs into a CADx system and advanced machine learning techniques for feature selection and classification. The methods and systems provided herein overcome difficulties in dealing with thick-slice CT volumes, provide robustness to errors in identifying the border of lung nodules, and are optimized to improve diagnostic accuracy of classification systems. The methods and systems provided herein use pre-processing and features to solve this problem by providing 2D, 2.5D, and 3D characterization of lesions, such as benign and malignant lung nodules. Therefore, a machine learning system using uses these features can distinguish more accurately benign nodules from malignant nodules and can achieve higher specificity and sensitivity than the systems without them. Furthermore, the proposed system has the capability of combining image-based and clinical information about the patient into the decision making process.

(72) The CADx methods and systems provided herein can be used with several modalities, for example MRI and CT. The methods and systems provided herein can be used in radiology workstations (exemplified but not limited to Philips Extended Brilliance Workstation, Philips Mx8000, and Philips Brilliance CT scanners) or incorporated into PACS systems (e.g. Stentor iSite). The CADx methods and systems provided herein can be used in diagnosing different diseases, including but not limited to colon polyps, liver cancer, and breast cancer.

(73) It will furthermore be apparent that other and further forms of the invention, and embodiments other than the specific and exemplary embodiments described above, may be devised without departing from the spirit and scope of the appended claims and their equivalents, and therefore it is intended that the scope of this invention encompasses these equivalents and that the description and claims are intended to be exemplary and should not be construed as further limiting. The contents of all references cited herein are incorporated by reference.

Advanced computer-aided diagnosis of lung nodules

Assignee

Inventors

Cpc classification

Classification Explorer

G16Z99/00

PHYSICS

Classification Explorer

G06T7/0012

PHYSICS

Classification Explorer

G06T7/48

PHYSICS

Classification Explorer

A61B6/5223

HUMAN NECESSITIES

Classification Explorer

G06T2207/30061

PHYSICS

Classification Explorer

A61B6/5217

HUMAN NECESSITIES

Classification Explorer

G16H50/30

PHYSICS

International classification

Classification Explorer

G06T7/00

PHYSICS

Classification Explorer

G06T7/48

PHYSICS

Classification Explorer

A61B6/00

HUMAN NECESSITIES

Abstract

Claims

Description