METHOD FOR PREDICTING THE DEVELOPMENT OF ARTHRITIS IN INDIVIDUALS PRIOR TO RADIOGRAPHIC OR SYMPTOMATIC PRESENTATION
20170202520 ยท 2017-07-20
Inventors
- Kenneth L. Urish (Sewickley, PA, US)
- Matthew G. Keffalas (Groton, MA, US)
- Timothy J. Mosher (Elizabethtown, PA, US)
- David J. Miller (State College, PA, US)
Cpc classification
G01R33/5608
PHYSICS
A61B5/055
HUMAN NECESSITIES
G01R33/50
PHYSICS
G16H50/20
PHYSICS
A61B5/7264
HUMAN NECESSITIES
G16H50/30
PHYSICS
A61B5/7275
HUMAN NECESSITIES
International classification
Abstract
A method for detecting symptomatic osteoarthritis in a human patient who is otherwise asymptomatic comprises: taking a plurality of magnetic resonance (MR) image signal features from MR sequences in a joint of the patient; submitting the MR image signal features to a classifier for performing a feature reduction for redundant or unnecessary features; calculating a signal texture index (STI) value from the remaining image features; comparing that STI value against the STI values of one group of individuals suspected of being vulnerable to developing osteoarthritis at the first point in time; and the same group of individuals years later; predicting from that comparison of STI values a likelihood of the patient developing osteoarthritis; and treating the patient accordingly.
Claims
1. A method for predicting a type of osteoarthritis of a joint or cartilage of a human patient prior to a radiographic or symptomatic presentation of osteoarthritis in the human patient, said method comprising: (a) taking a plurality of magnetic resonance (MR) sequence maps of one or more regions of cartilage in the joint or cartilage of the human patient; (b) submitting said plurality of MR sequence maps to a classifier for performing a feature reduction that calculates which redundant or unnecessary features may be removed; (c) eliminating said redundant or unnecessary features from said plurality of MR sequence maps to form a minimum grouping of image features for the patient; (d) calculating from said minimum grouping of image features a signal texture index (STI) value; (e) comparing the STI value for the patient against a first database comprising a set of individuals suspected of being vulnerable to developing osteoarthritis and a second database of the same set of individuals from the first database who have gone on to develop osteoarthritis after a point of time has passed; and (f) predicting, based on above comparison step (e), a likelihood of the patient developing osteoarthritis.
2. The method of claim 1, which further comprises: (g) treating the patient based on predicting step (f).
3. The method of claim 1 wherein the joint or cartilage is in the human patient's knee, ankle, hip, shoulder, elbow, or wrist.
4. The method of claim 1 wherein the joint or cartilage is from the human patient's knee, and the plurality of magnetic resonance (MR) sequences for the human patient's knee are taken mostly from a dominant compartment of the human patient's knee, said dominant compartment selected from the human patient's patella, medial or lateral knee compartments.
5. The method of claim 1 wherein the classifier from step (b) is selected from the group consisting of a linear classifier and a non-linear classifier.
6. The method of claim 1 wherein step (d) includes using one or more histogram measures selected from the group consisting of: mean, standard deviation, variance, dispersion, average energy, energy, skewness and kurtosis.
7. The method of claim 1 wherein step (d) includes using one or more measures selected from the group consisting of: gray level co-occurrence matrix (GLCM), gray level run length (GLRL), and Z-scores.
8. The method of claim 1 wherein the point of time in step (e) is three (3) years after initial diagnosis.
9. A method for predicting and treating osteoarthritis of a patient's knee or hip before any radiographic or symptomatic presentation of osteoarthritis is observed in the patient's knee or hip, said method comprising: (a) taking a plurality of magnetic resonance (MR) sequence maps of one or more regions of the patient's knee or hip; (b) submitting said plurality of MR sequence maps to a classifier for performing a feature reduction that calculates which features may be removed; (c) eliminating said features from said plurality of MR sequence maps to form a minimum grouping of image features for the patient; (d) calculating from said minimum grouping of image features a signal texture index (STI) value for the patient; (e) comparing the patient's STI value against two population databases, the first database comprising a plurality of individuals suspected of being vulnerable to developing osteoarthritis; and a second database of the same plurality of individuals years later, some of whom did develop osteoarthritis but also including others in that same plurality of individuals who did not develop osteoarthritis at least about three (3) years later; (f) based on above comparison step (e), predicting a likelihood of the patient developing osteoarthritis; and (g) treating the patient based on predicting step (f).
10. The method of claim 9 wherein step (a) includes taking magnetic resonance (MR) sequences from a dominant compartment of the patient's patella, medial and lateral knee compartments.
11. The method of claim 9 wherein the classifier from step (b) is selected from the group consisting of a linear classifier and a non-linear classifier.
12. The method of claim 9 wherein step (d) includes using one or more histogram measures selected from the group consisting of: mean, standard deviation, variance, dispersion, average energy, energy, skewness and kurtosis.
13. The method of claim 9 wherein step (d) includes using one or more measures selected from the group consisting of: gray level co-occurrence matrix (GLCM), gray level run length (GLRL), and Z-scores.
14. A method for predicting a patient's likelihood of developing osteoarthritis in a knee or hip before any symptomatic presentation of osteoarthritis has been observed in the patient using radiographic and a patient-reported outcome score (such as a WOMAC) comparison against a group of individuals suspected of being vulnerable to developing osteoarthritis at a first point in time and comparing that same group of individuals years later, said method comprising: (a) taking a plurality of magnetic resonance (MR) sequence maps of one or more regions of the patient's knee or hip; (b) submitting said plurality of MR sequence maps to a classifier for performing a feature reduction that calculates which features may be removed; (c) eliminating said features from said plurality of MR sequence maps to form a minimum grouping of image features for the patient; (d) calculating from said minimum grouping of image features a signal texture index (STI) value for the patient; (e) compiling a first index of STI values for the group of individuals suspected of being vulnerable to developing osteoarthritis at the first point in time; and a second index of STI values for the same group of individuals years later; (f) comparing the patient's STI value against the two indexes of STI values from step (e); (g) based on above comparison step (f), predicting the patient's likelihood of developing osteoarthritis; and (h) treating the patient based on predicting step (g).
15. The method of claim 14 wherein step (a) includes taking magnetic resonance (MR) sequences from a dominant compartment of the patient's patella, medial and lateral knee compartments.
16. The method of claim 14 wherein the classifier from step (b) is selected from the group consisting of a linear classifier and a non-linear classifier.
17. The method of claim 14 wherein step (d) includes using one or more histogram measures selected from the group consisting of: mean, standard deviation, variance, dispersion, average energy, energy, skewness and kurtosis.
18. The method of claim 14 wherein step (d) includes using one or more measures selected from the group consisting of: gray level co-occurrence matrix (GLCM), gray level run length (GLRL), and Z-scores.
Description
BRIEF DESCRIPTION OF DRAWINGS
[0019] Further features, objectives and advantages of this invention will be made clearer with the following detailed description made with reference to the accompanying drawings in which:
[0020]
[0021]
[0022]
[0023]
[0024]
[0025]
[0026]
[0027]
DESCRIPTION OF PREFERRED EMBODIMENTS
[0028] First, referring to the accompanying drawings,
[0029]
[0030] The Signal texture index identifies early signs of OA on T2 maps per accompanying
[0031] For
[0032] In
[0033] For the charts at
[0034] For
[0035] Population Cohort: Patients were selected from the OAI cohort. A total of 201 patients were selected, 89 control and 112 symptomatic. Specific inclusion criteria are as follows. Control subjects were selected from the control cohort defined as a low WOMAC score (<5) with low KL score that had no risk factors for OA progression. The OA rapid progression cohort were selected from the incidence cohort based on the initial criteria of a low WOMAC pain score less than 10 that had no radiographic signs of OA (KL<1) and that had a change in WOMAC pain score of >10. The incidence cohort did not have OA risk factors or symptomatic OA (risk factors: 1. previous knee surgery; 2. overweight as defined by ages cutoffs of 45-69 males>92.9 kg and females>77.1 kg; 3. previous knee injury defined by an injury of difficulty walking for at least one week; 4. family history in parent or sibling of total knee replacement; 5. Heberden's Nodes defined as self-report of bony enlargement of one or more enlargements of the distal interphalangeal joints in either hand; symptomatic OA: 1. Kellgren and Lawrence (KL) grade<2 on fixed flexion radiographs; 2. no frequent knee symptoms for at least one month during the past 12 months defined as pain, aching, or stiffness in or around the knee on most days). The incidence cohort did have risk factors for OA progression (OAI exclusion criteria included rheumatoid arthritis, bilateral total knee joint replacement, and a positive pregnancy test. Institutional review board approval had been obtained at all participating institutions in the OAI, and informed consent had been obtained by all participants in the study.
[0036] MR Image Acquisition:
[0037] In the OAI cohort, three dimensional sagital DESS and T2 mapping images were acquired from the imaging database freely available by request [11]. Briefly, MRI of the knee joint was performed on a 3.0 T Siemens whole body MAGNETOM Trio 3T scanner (Siemens, Erlangen, Germany) using a standard extremity coil. For high-spatial-resolution 3D DESS imaging [12], a total of 160 sections were acquired with a field of view (FOV) of 14 cm (matrix 384384) with an in-plane spatial resolution of 0.3650.365 mm and a slice thickness of 0.7 mm with an acquisition time of 11 min. For sagittal 2D dual-echo fast spin echo (FSE) sequence for mapping T2 relaxation time, TR was 2700 ms and 7 echo images with TE ranging 10-80 ms were acquired with matrix of 384384, in-plane resolution of 0.3130.313 mm, FOV of 12 cm, acquisition time 12 min and slice thickness of 3 mm. OAI data sets used included the baseline imaging data set 0.E.1 and 0.C.2.
[0038] Plain Radiographic Assessment:
[0039] Standard bilateral standing posterior-anterior fixed flexion knee radiographs were obtained at the baseline visit. Knees were positioned with a 20-30 flexion and 10 internal rotation of the feet in a plexiglass frame (SynaFlexer, CCBR-Synarc, San Francisco, Calif., USA). Knee radiographs were graded using the using the Kellgren-Lawrence (KL) scoring system (Lawrence, 1957). The patello-femoral joint was not included in the KL score as the OAI protocol used the fixed flexion knee radiograph for KL scoring.
[0040] Standard bilateral, full length lower limb radiographs were obtained at the one-year clinical visit with knees fully extended and feet place six inches apart directly facing the film centered at the knees. Mechanical axis was measured using the standard technique of measuring the angle placed from the center of the femoral head to the medial tibial prominence to the midline of the ankle (McGory 2002, pub med id:12439260). OAI data sets used included the baseline and one year imaging data set 0.E.1 and 0.C.2.
[0041] Clinical Assessment:
[0042] Clinical symptoms were assessed with the Western Ontario and McMaster Universities Osteoarthritis (WOMAC) questionnaire at the time of magnetic resonance screening (Bellamy, id 3068365). The OAI clinical data set 0.2.2 was used for data collection.
[0043] Registration:
[0044] DESS and T2 images were registered using the Mattes mutual information metric. Registration software was built using the insight toolkit, a C++ open source image analysis library (www.itk.org). DESS images possess higher resolution and were transformed through three-dimensional space to preserve the voxel information on the fixed T2 image using a verser transform. Linear interpolation was used in sampling voxels on non-grid positions. A specialized gradient decent optimizer is used to define the transform parameters through successive iterations as the search space is large across 6 degrees of freedom. After the transform, the mutual information metric is used to assess the degree of alignment between the two images and the process is repeated until a maximum degree of overlap has been achieved (Urish 2013, pub med id: 23997865).
[0045] Segmentation:
[0046] Segmentation was completed on DESS images. Segmentation of the femoral and patellar cartilage was completed using custom semi-automated software implementing a global active statistical shape model with a local active contour model. Gross inaccuracies in the segmentation could be corrected by a manual correction of the computer segmentation. Binary masks of the lateral and medial femoral condyle and patella were generated from the segmented images.
[0047] 1. The lateral and medial masks were split into 5 sections for each individual. The patella region was treated as a single section. There were 11 regions of interest (ROI) per individual. The cartilage region for the lateral and medial compartments resembles an arc or semi-circle. In order to add an extra level of spatial localization ability to the feature set, the lateral and medial masks are divided into 5 sections each. Note that the medial and lateral masks are divided independently, so the section boundaries for the medial compartment are not necessarily the same as the ones for the lateral compartment. The automatic mask division algorithm is as follows: Find all segmented masks within the current patient's current compartment (medial or lateral).
[0048] 2. Superimpose all segmented masks onto a single image.
[0049] 3. Find , the angle of the arc of cartilage.
[0050] 4. Divide by 5 to find .sub.S, and then draw section boundaries at intervals of .sub.S.
[0051] 5. Starting on the far left and rotating counterclockwise, number the sections 1-5.
[0052] Note that the patella masks are usually much smaller than the lateral and medial masks, so they are not divided and are treated as a single section. Therefore, there are 5 medial sections, 5 lateral sections, and 1 patella section for a total of 11 sections. Each feature is measured independently in each section, so in general there are 11 instances of each feature. This is an arbitrary number and any number of regions of interest could be selected in operation of this invention. The specific segmentation technique used is not significant for our method and any type of segmentation method selected from the group of automatic, semi-automatic, and manual may be used.
[0053] T2 Maps:
[0054] T2 maps were calculated from the Multi-Slice-Multi-Echo T2 images available in the OAI. Calculation of the T2 maps have been previously described (Smith and Mosher; Pubmed id 11436214). Briefly, the T2 maps are calculated on a voxel-by-voxel basis using a linear least squares fitting with CCHIPS/IDL software (Cincinnati Children's Hospital Image Processing Software/Interactive Data Language, (RSI, Boulder, Colo.). The MR T2 signal decay of cartilage is mono-exponential, and the signal intensity decay can be expressed as an exponential decay as a function of time for each voxel. Quantitative T2 maps can be visualized as a color-coded image using an ordinal rainbow scale.
[0055] Image Feature Extraction:
[0056] Candidate features were calculated from each T2 map using the segmented binary masks region of interest using a matlab script (Mathworks, Natick, Mass.). Each feature was independently measured in each of the 11 sections on each knee. There were four main categories of features: histogram, grey level co-occurrence matrix (GLCM), grey level run length matrix (GLRL), and z-score. The numbers reported below are the totals from all 11 sections. A 32-bin histogram was used to calculate the mean, variance, entropy, and central moments. GLCM features were calculated from the grey level co-occurrence matrices at unit distance and angles 0, 45, 90, 135 degrees, and 90 degrees in the z direction. GLRL features were calculated from grey level run length matrices at angles 0 and 90 degrees. The Z-score was calculated for all voxels in each section. The mean value, variance, minimum value, maximum value, and range of values were then calculated (n=55). In each of the 11 sections, a total of 725 features were measured on each T2 map. All features were normalized to the range [1,1].
[0057] Each mask is used to extract the appropriate voxels from the corresponding T2 map. As mentioned before, there are 5 medial sections, 5 lateral sections, and 1 patella section for a total of 11 sections that voxels can originate from. Features are measured independently for the voxels from each section, creating 11 instances of the same feature, each from a different location in the right knee.
[0058] Primarily, textural features were chosen to represent the images. This is because texture can measure statistical properties and spatial distribution of the image intensities [24]. This fact makes textural features very attractive for predicting OA status from the T2 map, and many of the initial features have been used in previous OA studies. The initial feature set can be split into four categories: 1) histogram; 2) gray level co-occurrence matrix (GLCM); 3) gray level run-length matrix (GLRL); 4) z-score.
[0059] All feature calculations are performed with custom-made Matlab functions. There are 725 total features in the initial feature set. In the following sections, let I.sub.j(u,v,w) be the intensity of the pixel at index (u,v,w) in section j and S.sub.j is the set of all voxel indices in section j.
[0060] Histogram Features:
[0061] For each section, a 32-bin histogram is calculated from the intensity values of the T2 maps. The histogram in section j is calculated according to the equation:
Note that within section j, the histogram p.sub.j actually does not depend on the spatial location of the voxels. This means that all histogram features do not depend on the location of the voxels within their specified section, but instead the histogram features measure the statistical properties of the voxel intensities. Even though there is no spatial dependency, histogram features are still often used in texture measurement. Also, some level of spatial information is included in this feature category since each feature is measured independently in each section, so each feature instance is localized to one specific location in the knee. The following equations define the histogram features:
In addition to these features, the median, mode, minimum value, maximum value, and range of values is calculated from the histogram. An 8-bin histogram is also calculated for each section j and the occupancy of each bin is used as a feature. Also, the following two additional miscellaneous features are included in this category even though they are not calculated using the histogram.
Note that the relative size feature is not calculated for the patella section because there is only a single section in the patella compartment, which would make this feature equal to 1 for all patients.
[0062] Gray Level Co-Occurrence Matrix (GLCM):
[0063] Histogram features alone cannot completely characterize texture since they do not measure the spatial characteristics of the cartilage region. The second-order histogram, called the gray level co-occurrence matrix (GLCM), is a common tool for measuring cartilage. In this study, the GLCM is calculated for a distance of 1 (the voxels must be immediate neighbors in the specified direction) and for direction =0, 45, 90, 135, and 90 in the z (third dimension) direction. Before the GLCM is calculated, the intensities are quantized down to 8 gray levels, which results in each GLCM being an 88 matrix.
If h.sub.j,(x,y) is divided by the total number of neighboring pixels in section j, then the GLCM becomes an estimate of the joint probability f.sub.j,(x,y). The following features are calculated from f.sub.j,(x,y).
Note that .sub.u, .sub.v, and .sub.u, .sub.v are the means and standard deviations of the marginal distributions created by the row and column sums of the matrix, respectively. There are 5 different GLCMs (1 for each direction) calculated for each section, and since there are 11 sections this means that there are 55 total GLCMs calculated for a single patient. Each feature is measured independently for each GLCM.
[0064] Gray Level Run-Length Matrix (GLRL):
[0065] The gray level run-length matrix (GLRL) is another typical tool used in texture analysis. The GLRL g.sub.j,(x,y) is defined as the number of runs of length y in the direction consisting of points with gray level x in section j. Before the GLRL is calculated, the cartilage image intensities are quantized down to 8 gray levels. The GLRL is calculated for =0,90. The largest possible run length is the largest number of voxels that lie in direction , but the run lengths are quantized to 4 possible ranges. Let P.sub.j be the total number of voxels in section j, and let N.sub.j,be the sum of all elements in g.sub.j,. The following features are calculated from the GLRL g.sub.j,(x,y).
There are 2 different GLRLs per section, which means there are 22 total GLRLs calculated per patient. Each feature is calculated independently for each GLRL.
[0066] Z-Score:
[0067] A special normalization procedure that produced features that had a significant correlation with OA damage can be defined as:
where .sub.j,control and .sub.j,control are the mean and standard deviation of section j for only the patients in the control group. This can be seen as normalization to healthy voxels, and therefore it may help the symptomatic voxels stand out more than normal. From the transformed intensities .sub.1, the features calculated are the mean, variance, minimum value, maximum value, and range of values. Similar to the histogram features, the z-score features do not have the ability to identify spatial characteristics of the voxels within section j. However, since the features are measured independently from each section, each feature instance is a localized measurement.
[0068] Feature Normalization:
[0069] For each feature independently, each feature instance is normalized to the range [1,1]. This is done because there might simultaneously be very large and very small feature values, which could cause numerical problems in the classifier training procedure
[0070] Classification, Feature Elimination, and Partial Sum Measurements:
[0071] Given the features, we need to calculate a classification decision for each patient. The classification decision is reduced to a binary decision (healthy or symptomatic) for simplicity. In this case, each patient is treated as a 725-dimensional feature vector and the classifier maps this vector into a binary value. Since the feature space is very high-dimensional (especially considering there are only 210 patients in this 725-dimensional space), decreasing the dimensionality of this space could potentially lead to better generalization accuracy. Feature selection and classifier training are performed using a training set of patients, and then the generalization accuracy is estimated on a patient test set that is disjoint from the training set. This chapter is set-up as follows: the following section describes the support vector machine (SVM) classifier and the reason it was chosen, then the next section describes the feature selection algorithm margin-based feature elimination (MFE), and then the final section explains the structure of the feature selection and classification experiment.
[0072] Support Vector Machine:
[0073] Support vector machine (SVM) training and testing were implemented using the LIBSVM Matlab interface. To assess the performance of the classifier, we randomly divided the entire cohort into 100 equal-sized training and test subsets with equal numbers of control and rapid progression individuals. In each of the 100 trials, the SVM classifier was trained to discriminate between control and rapid progression OA populations using all 725 features on the training set, and the accuracy of the classifier was measured on the independent test set. The confusion matrix was calculated after each trial. Margin based feature elimination (MFE) was used to eliminate redundant and uninformative candidate features. In the same trial, SVM training was coupled with MFE to identify a reduced set of essential features. The accuracy of the reduced feature set was tested on the test data set, and the confusion matrix was again determined (Arrow 80 in
[0074] As a specific example of implementation, each patient is represented as a data point in k-dimensional space, with k being the number of features. As mentioned previously, the number of initial features before the feature selection phase is 725 and the number of classes is 2. For now, assume these data points are linearly separable; in other words, the two classes can be separated using a single linear surface (hyperplane) of dimension k1. This separating hyperplane essentially splits the k-dimensional space in two, with each subspace corresponding to one of the two classes. This hyperplane is known as the linear discriminant function (LDF), and in the case of SVM this hyperplane has the form
0=w.sup.Tx+b
where w is the kx1 SVM weight vector, x is the kx1 feature vector, and b is a scalar bias term. In this case, w is a vector normal to the hyperplane and |b|/w is the perpendicular distance between the hyperplane and the origin where w is the Euclidean norm of the weight vector.
[0075] The choice of this separating hyperplane is not unique, and the choice of a particular hyperplane is the strength of SVM.
[0076] Since there are two classes, there are two class labels: 1 and +1, corresponding to healthy and symptomatic respectively. Let y.sub.i be the class label for patient i and x.sub.i be the feature vector for this patient. The SVM decision score for patient i is d.sub.i and is written as
and the actual classification decision for patient i is {circumflex over (d)}.sub.i=sgn(d.sub.i)
where sgn is the signum function. For all training patients in a linearly separable data set, the following inequality holds:
y.sub.id.sub.i1
It can be shown that the patients from each class closest to the hyperplane are a perpendicular distance 1/w away from it. This distance is called the margin of the hyperplane, and these patients are called the support vectors. The SVM procedure chooses a separating hyperplane such that the margin is maximized while satisfying the inequalities for every training instance. This can be stated formally as an optimization problem:
Minimize w.sup.2 subject to y.sub.id.sub.i1
[0077] The hyperplane is chosen to maximize margin because this hyperplane is proven to have better accuracy on unseen data points. Since we do not know the true distribution of data points in the feature space, the unseen data points may easily cross the separator and therefore be classified incorrectly. The separating hyperplane is chosen such that the margin between support vectors is maximized: the hyperplane with largest margin will typically account for the distribution of the classes better than other separators, and therefore it should perform better on unseen test data.
[0078] Other than maximizing margin between the classes and separating hyperplane, the strength of SVMs is the use of support vectors. In the derivation of the weight vector w and bias b, the support vectors are the only data points that affect w and b. This means that the classifier is uniquely determined by the choice of support vectors and only the support vectors. The number of model parameters in the classifier derivation depends on the number of support vectors. Therefore, unlike in other classifiers, the number of model parameters is not determined by the feature dimensionality and is instead bounded by the number of training instances, which causes the SVM to be more robust against overfitting.
[0079] Margin Based Feature Elimination:
[0080] Feature selection is integrated into the classifier training process for this study. Selecting a subset of features from the initial feature set is necessary in order to eliminate redundant and non-informative features, and it is also possible to improve the generalization performance of the classification process [13].
[0081] Since margin-maximization is the goal of the SVM training procedure, a feature selection method was chosen that uses this margin as the criterion for removing features. This algorithm is called margin-based feature elimination (MFE). The goal of MFE is to maximize the SVM margin with each feature elimination step. This is a wrapper algorithm, meaning the trained classifier is used to determine the order of feature removal. In other words, classifier training is a part of the feature selection process, and the classifier is retrained many times to determine the usefulness of the features. MFE represents a computationally-efficient feature elimination algorithm that works together with the goal of the SVM training procedure and has been shown to perform well on standard datasets.
[0082] The MFE algorithm for linear SVMs has been described in detail [32]. A simplified version of the algorithm is explained: 1. Train a SVM using the current feature set. 2. Calculate the effect on the SVM decision function of removing each feature separately. 3. Remove the feature whose removal results in the largest SVM margin. 4. If linear separability is lost, stop. 5. Go to step 1, using the reduced feature set.
[0083] Note that in the version of MFE used in this study, the feature elimination is terminated when the training data becomes linearly nonseparable. MFE could be altered to continue removing features after this point, but this would require introducing slackness into the SVM training procedure. SVM training and MFE is simplified by not using slackness, and it was found experimentally that this method works well without considering slackness.
[0084] Statistics:
[0085] Data is expressed as a meanstandard deviation, except where noted. Direct comparisons between two cell populations were made using an unpaired, two-tailed Student's t-test. Statistical significance was determined if P<0.05. Multiple group comparisons were made using two-way ANOVA, using the Student-Newman-Keuls pairwise comparison to determine significance levels. Conventions for box plot include the mean outlined by the box representing the 25% and 75% quantiles, whiskers representing the minimum and maximum value, outliers denoted with a circle, and notches representing the 95% confidence interval of the mean.
[0086] Receiver operating characteristic (ROC) analysis was performed on the entire set using standard techniques. The procedure and methods are discussed in detail using the thesis of Matthew Keffalas, The Pennsylvania State University, Electrical Engineering, Schreyer Honors College, 2010.
[0087] Classification Experiment:
[0088] An example of implementing the entire method described above is as follows. The experiment is designed to estimate the performance of the combined SVM/MFE procedure. We must do this by using the 210 fully-preprocessed patients. According to the suggestions in [34], the patient set used to test the classifier performance must be completely disjoint from the patient set that is used to train the classifier and perform feature selection. Therefore we test the performance of the classification and feature selection methods by splitting the entire patient set into two equally-sized disjoint sets, training and test. The training set is formed by randomly sampling (without replacement) the full dataset. Each set has the same ratio of classes as the original dataset. The training set is used to select the sparse feature set and train the SVM, and the test set is used to estimate the generalization accuracy of the trained classifier and the selected feature set. This procedure is repeated for 100 trials using 100 different training sets (and therefore 100 different test sets), and the final performance of these methods are estimated by averaging the results from each trial.
[0089] We hypothesized that T2 map signal heterogeneity could accurately prognosticate OA progression. To test this hypothesis, we used the OAI to identify and compare the texture metrics of two populations of T2 maps: an asymptomatic control and a rapid OA progression population. The asymptomatic group was collected from the OAI control cohort (n=89). The rapid progression population was collected from the incidence cohort (n=112). At the initial time point, the population was asymptomatic (WOMAC<10) and had no radiographic signs of OA (KL2). At the 3 year time point, this population experienced a WOMAC change (greater than 10), signifying both a large and rapid progression of symptoms. These populations were comparable in regards to age, sex, and BMI. As expected by cohort definitions between the control and incidence cohorts, the asymptomatic population did have lower WOMAC and KL scores than the rapidly progression population.
[0090] To assess signal heterogeneity, we quantified signal heterogeneity using a series of texture metrics on each of these populations. Asymptomatic and rapid progression populations based on baseline T2 map image features that described texture. Images were segmented and registered so that image texture features could be extracted. DESS images were used for segmentation because of the increased contrast at cartilage-soft tissue and cartilage-bone interfaces. Multimodality registration was used to align DESS and T2 sequences so that the segmentation masks (subdivided into ROI) could be used to measure a range of histogram and texture based image features. We chose as candidate features some well-known descriptors of image texture (local entropy, variance, cross-correlation, run-lengths, histogram based) and integrated a feature reduction step within the classifier training. An image classifier, SVM, was used to develop a model to predict OA, with classifier training and testing via a series of cross-validation experiments, dividing the subpopulations into training and test subsets. Margin based feature elimination was used to eliminate redundant and uninformative features, sacrificing minimal accuracy in order to simplify the model and to identify image feature biomarkers. The classifier design wraps MFE around SVM training, removing one feature at a time. The classifier found an appropriate hyper-plane in image-feature space that separated these populations. The SVM score is the distance from this plane, and can be described as a signal texture metric (Arrow 55 in
[0091] Three separate cases of classifier accuracy were analyzed. First, the accuracy of using the entire set of all 725 features before feature elimination was measured. The average accuracy of the classifier was greater than 80%, corresponding to an average sensitivity of 82.05.4% and an average specificity of 75.07.3%. Second, MFE was used to remove redundant and uninformative features significantly reducing the feature space. An average of only 20 of the 725 features was needed to maintain a comparable level of accuracy. The average accuracy of the system with MFE feature selection was 76.17.2%, with average sensitivity of 79.16.7% and average specificity of 70.07.7%. Finally, in each of these trials by design, a new separate set of features was selected. If feature reduction was performed on the entire trial set simultaneously, a single set of features for the signal texture index were defined. At a sacrifice of some bias to obtain the single feature set, accuracy was 80% with a sensitivity of 83% and a specificity of 77%. The remainder of the discussion will focus on the second case as it presents the most unbiased classifier accuracy and quantification of signal texture index.
[0092] After feature reduction, the STI had good separation of the asymptomatic and OA populations. By design, the classifier sets a STI value of zero as the decision boundary so that any positive value is determined to be OA progression and any negative value a control decision is made (
[0093] The image texture features that predict rapid progression of OA for most individuals are primarily located in one of the three knee compartments. The STI is calculated from a weighted sum of image feature measurements from the lateral and medial compartment and the patella. By separately considering the features from each compartment (lateral, medial, patella) and finding the partial sum for each section, the effective contribution from each compartment to the overall decision can be determined. The rapid progression population was considered separately in this analysis. The contribution of features in each compartment to the overall signal texture index shows substantial separation between each compartment (
[0094] To test the observation that the signal texture index from one compartment plays a dominant role in OA progression, we isolated the medial and lateral sub-populations from the dominant compartment and compared the compartment to the mechanical axis from standing full limb length radiographs. Individuals with a dominant compartment on the medial condyle were highly correlated with valgus alignment, and individuals associated with a dominant compartment on the lateral condyle were associated with a varus alignment. A comparison of these two populations demonstrated the differences were statistically different as measured by the student's t-test. At a minimum, the dominant compartment's location is highly correlated with mechanical axis (
[0095] For symptomatic patients that are correctly classified, the most positive partial sum amongst the three sections contributes most to the correct decision. It can thus be inferred that the section with this partial sum is likely the one undergoing the most OA changes. Moreover, a significant disparity between the largest and second largest partial sums for an individual patient suggests OA changes may only be occurring in the knee section with the largest partial sum. Thus, this invention may be used to localize cartilage damage, the progression of disease, or identify area of the joint where healthy cartilage resides.
[0096] This invention had been demonstrated to be used on T2 maps for OA symptomatic prognostication but the demonstration is equally valid on a number of other instances. Any MR sequence including but not limited to dGERMIC (i.e, delayed gadolinium-enhanced magnetic resonance imaging of cartilage), T1, T1 rho, and T2 sequences could be substituted or combined with T2 maps. Texture and feature analysis was used to prognosticate but could be used as a diagnostic test for symptomatic progression or differentiate different stages of disease progression. Further, the same invention can be applied to predicting morphologic changes in articular cartilage including but not limited to changes in cartilage volume, area, or thickness and bone, synovial, inflammatory tissue morphometry. The invention is not specific to the disease of OA but any type of pathologic process effecting human or animal cartilage including but not limited to rheumatoid arthritis, post-traumatic arthritis, cartilage trauma, osteochondritis dissecans, or the general state of cartilage health. Also, the invention can be applied to any joint in the body including but not limited to hip, shoulder, elbow, wrist, and ankle. The specific steps used in this invention can be altered in length, method employed, order, or omission.
[0097] A challenge in classification is the curse of dimensionality. There is a relative paucity of available training samples compared to the large dimensionality of the image feature space and to the number of parameters in the classifier model. This implies that the classifier has a tendency to overfit the data which can degrade the accuracy of the model. To avoid this problem, we applied a linear discriminant function classifier, SVM, to maximize the margin between these two populations. In this sense, the SVM maximizes the separation of the two classes. For an SVM, unlike a standard linear discriminate function, the number of model parameters is bounded by the number of training samples, rather than being controlled by the feature dimensionality. Since the number of samples is typically the much smaller number, in this way the SVM mitigates potential overfitting. SVM is not a unique solution to this process. Any linear or non-linear classifier and method of feature reduction can be applied for use in this invention. An emphasis was placed on defining OA by symptomatic progression. OA could be defined by symptoms, biomarkers, imaging criteria, or any other definition.
[0098] In addition to overcoming the problem of the non-linear response of T2 to cartilage degradation, this approach removes systematic bias. Differences in methodology or instrumentation used in the T2 measurement can lead to differences in the magnitude of T2 values. Since texture analysis compares spatial differences in T2 values between neighboring pixels rather than the absolute T2 values, it effectively uses an internal calibration standard to remove systematic bias. This helps eliminate the variation observed in a sequence as a function of the operator, machine, and location.
[0099] There are distinct differences between results discussed here and those reported by Dam [14]. In Dam's work, they make distinctions between non-progressers and early progressors. Dam uses the definition of early progressors based on radiographic progression. For example, at the end of the time period non-progressors have no change in radiographic findings and early-progressors have an increased change in radiographic findings. Our method presented above and as demonstrated by our results can perform the more difficult task of identifying patients that will have symptomatic progression with no radiographic progression or evidence. For example, Dam's method has the potential to misclassify an individual as an arthritis non-progressor based on no predicted change in their radiographs when they may have had arthritis progression based on symptoms. Our method would have improved accuracy and prognostication by correctly classifying these individuals.
[0100] From this perspective our method identifies an even earlier type of progression in the disease process that is more challenging to detect than the Dam group. A large disadvantage of radiographs is that radiographs are a poor method to detect early arthritis changes (CITE). Individuals may develop arthritis symptoms without having radiographic evidence of arthritis (CITE). Using Dam's definition [14], this group would be entirely missed by his analysis and considered non-progressors. Using our definition and method, this group would not be miss-classified and correctly considered part of the progression. This allows are method to be more sensitive and specific for early arthritis progression than Dam's method. Further, Dam demonstrates that there are differences between progressors and non-progressors but does not actually demonstrate that it can be used as a prognostic tool [14]. Our results presented above, demonstrate its prognostic accuracy.
[0101] Footnoted References above: [0102] 1. David-Vaudey, E., et al., T2 relaxation time measurements in osteoarthritis. Magn Reson Imaging, 2004. 22(5): p. 673-82. [0103] 2. Mosher, T. J. and B. J. Dardzinski, Cartilage MRI T2 relaxation time mapping: overview and applications. Semin Musculoskelet Radiol, 2004. 8(4): p. 355-68. [0104] 3. Mosher, T. J., et al., MR imaging and T2 mapping of femoral cartilage: in vivo determination of the magic angle effect. AJR Am J Roentgenol, 2001. 177(3): p. 665-9. [0105] 4. Wheaton, A. J., et al., Correlation of T1rho with fixed charge density in cartilage. J Magn Reson Imaging, 2004. 20(3): p. 519-25. [0106] 5. Burstein, D. and M. L. Gray, Is MRI fulfilling its promise for molecular imaging of cartilage in arthritis? Osteoarthritis Cartilage, 2006. 14(11): p. 1087-90. [0107] 6. Xia, Y., J. B. Moody, and H. Alhadlaq, Orientational dependence of T2 relaxation in articular cartilage: A microscopic MRI (microMRI) study. Magn Reson Med, 2002. 48(3): p. 460-9. [0108] 7. Harrison, M. M., et al., Patterns of knee arthrosis and patellar subluxation. Clin Orthop Relat Res, 1994(309): p. 56-63. [0109] 8. Maroudas, A. I., Balance between swelling pressure and collagen tension in normal and degenerate cartilage. Nature, 1976. 260(5554): p. 808-9. [0110] 9. Yulish, B. S., et al., Juvenile rheumatoid arthritis: assessment with MR imaging. Radiology, 1987. 165(1): p. 149-52. [0111] 10. Mosher, T. J., B. J. Dardzinski, and M. B. Smith, Human articular cartilage: influence of aging and early symptomatic degeneration on the spatial variation of T2preliminary findings at 3 T Radiology, 2000. 214(1): p. 259-66. [0112] 11. Peterfy, C. G., E. Schneider, and M. Nevitt, The osteoarthritis initiative: report on the design rationale for the magnetic resonance imaging protocol for the knee. Osteoarthritis Cartilage, 2008. 16(12): p. 1433-41. [0113] 12. Eckstein, F., et al., Double echo steady state magnetic resonance imaging of knee articular cartilage at 3 Tesla: a pilot study for the Osteoarthritis Initiative. Ann Rheum Dis, 2006. 65(4): p. 433-41. [0114] 13. Aksu, Y., et al., Margin-maximizing feature elimination methods for linear and nonlinear kernel support vector machines. IEEE Trans On Neural Networks, 2010. 21: p. 701-717. [0115] 14. Dam, E. B. K., DK), Qazi, Arish (Kobenhavn k, DK), Karsdal, Morten (Kobenhavn k, DK), Petterson, Paola C. (Koge, DK), Nielsen, Mads (Dragor, DK), Christiansen, Claus (Morcote, CH), Pathology indicating measure related to cartilage structure and automatic quantification thereof. 2010, Nordic Bioscience Imaging A/S (Herlev, DK): United States.