METHOD FOR PREDICTING THE DEVELOPMENT OF ARTHRITIS IN INDIVIDUALS PRIOR TO RADIOGRAPHIC OR SYMPTOMATIC PRESENTATION

Abstract

A method for detecting symptomatic osteoarthritis in a human patient who is otherwise asymptomatic comprises: taking a plurality of magnetic resonance (MR) image signal features from MR sequences in a joint of the patient; submitting the MR image signal features to a classifier for performing a feature reduction for redundant or unnecessary features; calculating a signal texture index (STI) value from the remaining image features; comparing that STI value against the STI values of one group of individuals suspected of being vulnerable to developing osteoarthritis at the first point in time; and the same group of individuals years later; predicting from that comparison of STI values a likelihood of the patient developing osteoarthritis; and treating the patient accordingly.

Claims

1. A method for predicting a type of osteoarthritis of a joint or cartilage of a human patient prior to a radiographic or symptomatic presentation of osteoarthritis in the human patient, said method comprising: (a) taking a plurality of magnetic resonance (MR) sequence maps of one or more regions of cartilage in the joint or cartilage of the human patient; (b) submitting said plurality of MR sequence maps to a classifier for performing a feature reduction that calculates which redundant or unnecessary features may be removed; (c) eliminating said redundant or unnecessary features from said plurality of MR sequence maps to form a minimum grouping of image features for the patient; (d) calculating from said minimum grouping of image features a signal texture index (STI) value; (e) comparing the STI value for the patient against a first database comprising a set of individuals suspected of being vulnerable to developing osteoarthritis and a second database of the same set of individuals from the first database who have gone on to develop osteoarthritis after a point of time has passed; and (f) predicting, based on above comparison step (e), a likelihood of the patient developing osteoarthritis.

2. The method of claim 1, which further comprises: (g) treating the patient based on predicting step (f).

3. The method of claim 1 wherein the joint or cartilage is in the human patient's knee, ankle, hip, shoulder, elbow, or wrist.

4. The method of claim 1 wherein the joint or cartilage is from the human patient's knee, and the plurality of magnetic resonance (MR) sequences for the human patient's knee are taken mostly from a dominant compartment of the human patient's knee, said dominant compartment selected from the human patient's patella, medial or lateral knee compartments.

5. The method of claim 1 wherein the classifier from step (b) is selected from the group consisting of a linear classifier and a non-linear classifier.

6. The method of claim 1 wherein step (d) includes using one or more histogram measures selected from the group consisting of: mean, standard deviation, variance, dispersion, average energy, energy, skewness and kurtosis.

7. The method of claim 1 wherein step (d) includes using one or more measures selected from the group consisting of: gray level co-occurrence matrix (GLCM), gray level run length (GLRL), and Z-scores.

8. The method of claim 1 wherein the point of time in step (e) is three (3) years after initial diagnosis.

9. A method for predicting and treating osteoarthritis of a patient's knee or hip before any radiographic or symptomatic presentation of osteoarthritis is observed in the patient's knee or hip, said method comprising: (a) taking a plurality of magnetic resonance (MR) sequence maps of one or more regions of the patient's knee or hip; (b) submitting said plurality of MR sequence maps to a classifier for performing a feature reduction that calculates which features may be removed; (c) eliminating said features from said plurality of MR sequence maps to form a minimum grouping of image features for the patient; (d) calculating from said minimum grouping of image features a signal texture index (STI) value for the patient; (e) comparing the patient's STI value against two population databases, the first database comprising a plurality of individuals suspected of being vulnerable to developing osteoarthritis; and a second database of the same plurality of individuals years later, some of whom did develop osteoarthritis but also including others in that same plurality of individuals who did not develop osteoarthritis at least about three (3) years later; (f) based on above comparison step (e), predicting a likelihood of the patient developing osteoarthritis; and (g) treating the patient based on predicting step (f).

10. The method of claim 9 wherein step (a) includes taking magnetic resonance (MR) sequences from a dominant compartment of the patient's patella, medial and lateral knee compartments.

11. The method of claim 9 wherein the classifier from step (b) is selected from the group consisting of a linear classifier and a non-linear classifier.

12. The method of claim 9 wherein step (d) includes using one or more histogram measures selected from the group consisting of: mean, standard deviation, variance, dispersion, average energy, energy, skewness and kurtosis.

13. The method of claim 9 wherein step (d) includes using one or more measures selected from the group consisting of: gray level co-occurrence matrix (GLCM), gray level run length (GLRL), and Z-scores.

14. A method for predicting a patient's likelihood of developing osteoarthritis in a knee or hip before any symptomatic presentation of osteoarthritis has been observed in the patient using radiographic and a patient-reported outcome score (such as a WOMAC) comparison against a group of individuals suspected of being vulnerable to developing osteoarthritis at a first point in time and comparing that same group of individuals years later, said method comprising: (a) taking a plurality of magnetic resonance (MR) sequence maps of one or more regions of the patient's knee or hip; (b) submitting said plurality of MR sequence maps to a classifier for performing a feature reduction that calculates which features may be removed; (c) eliminating said features from said plurality of MR sequence maps to form a minimum grouping of image features for the patient; (d) calculating from said minimum grouping of image features a signal texture index (STI) value for the patient; (e) compiling a first index of STI values for the group of individuals suspected of being vulnerable to developing osteoarthritis at the first point in time; and a second index of STI values for the same group of individuals years later; (f) comparing the patient's STI value against the two indexes of STI values from step (e); (g) based on above comparison step (f), predicting the patient's likelihood of developing osteoarthritis; and (h) treating the patient based on predicting step (g).

15. The method of claim 14 wherein step (a) includes taking magnetic resonance (MR) sequences from a dominant compartment of the patient's patella, medial and lateral knee compartments.

16. The method of claim 14 wherein the classifier from step (b) is selected from the group consisting of a linear classifier and a non-linear classifier.

17. The method of claim 14 wherein step (d) includes using one or more histogram measures selected from the group consisting of: mean, standard deviation, variance, dispersion, average energy, energy, skewness and kurtosis.

18. The method of claim 14 wherein step (d) includes using one or more measures selected from the group consisting of: gray level co-occurrence matrix (GLCM), gray level run length (GLRL), and Z-scores.

Description

BRIEF DESCRIPTION OF DRAWINGS

[0019] Further features, objectives and advantages of this invention will be made clearer with the following detailed description made with reference to the accompanying drawings in which:

[0020] FIG. 1A is a schematic flowchart according to one embodiment of this invention;

[0021] FIG. 1B is a schematic flowchart showing a second preferred embodiment of Applicants' method steps;

[0022] FIG. 2A is a graph comparing signal texture index (STI) versus density as means for identifying early signs of OA on a T2 map;

[0023] FIG. 2B is a graph plotting trial data using an SVM classifier from twenty dimensions onto two dimensions;

[0024] FIG. 3 is a graph plotting true versus false positive rates with the invention, the diagonal line therein representing random guessing;

[0025] FIG. 4A is a graph plotting STI versus density with the first, second and third compartments indicated along with the SVM decision boundary;

[0026] FIG. 4B is a graph showing the average STI, by notched box plots, for each of the three compartments; and

[0027] FIG. 4C is a graph plotting for individuals with a dominant medial or lateral compartment, the varus or valgus mechanical axis alignment associated with an increased STI.

DESCRIPTION OF PREFERRED EMBODIMENTS

[0028] First, referring to the accompanying drawings, FIG. 1A shows a schematic representation of one experimental design according to this invention. Particularly therein, Control 10 and OA Progression 20 were fed as input to a Feature Extract 30. Output from that Feature Extract 30 was Split into 2 Groups 40 and used to Train an SVM Classifier 50, the SVM score output from which (Arrow 55) is a Signal Texture Metric. The latter gets forwarded to a Test Classifier 60 that went into a loop for repeating 100 times (Item 70). Output from the latter repeating (Arrow 80) looped back to the step of Splitting into 2 Groups (Item 40), one of which passes directly into Test Classifier 60.

[0029] FIG. 1B is a flowchart showing the steps for one method (generally item 200) of practicing the present invention. Particularly, these method steps include: taking a plurality of MRI sequence maps of one or more regions of a patient's knee or hip (item 202); from the aforesaid maps, extracting image features including texture and histogram metrics (item 204); submitting this group of image features to a set classifier for performing a feature reduction thereon (item 206); and eliminating removable (redundant and/or unnecessary) from these sequence maps to form a minimum grouping of image features for a particular patient (item 208). Next, the method calculates from the remaining minimum image grouping an STI value for the patient in question (item 210). That value is then compared against TWO database populations of the same group of individuals separated in a preset time (preferably 3 years apart). This substep, item 212 uses a first database of individuals initially suspected of being vulnerable to developing osteoarthritis. Then, comparing those same value against the aforementioned populations years later during which time some may have gone on to develop osteoarthritis but others have not. From that comparison of the two set of databases (of the same individuals), the inventors are able to better predict the likelihood of the individual patient developing osteoarthritis near term (item 214). Based on that predicting step 214, the patient can then be treated for his/her condition (item 216).

[0030] The Signal texture index identifies early signs of OA on T2 maps per accompanying FIG. 2. More particularly, FIG. 2A plotted histograms of the Signal Texture Index (STI) for the control and OA populations. A positive score therein corresponded to an OA decision, and a negative score a control decision. Accuracy was about 80%. Therein, the SVM decision boundary is indicated by the solid vertical line to the left of 0. The results shown are the combined 1000 trials with an average number of 20 features needed to build the STI.

[0031] For FIG. 2B visualization, multidimensional scaling was used to project the data of one such trial using the SVM classifier from twenty dimensions onto two dimensions. Admittedly, that has some cost in representation fidelity. The axes are dimensionless, and represent a summation of the different image features used to determine the signal texture index (or STI).

[0032] In FIG. 3, Receiver operating characteristic (or ROC) curveThe sensitivity is equivalent to the true positive rate, and specificity equivalent to true negative rate. The diagonal line therein represents the results of random guessing.

[0033] For the charts at FIG. 4, a signal texture index (STI) that indicates OA is associated primarily with one knee compartment. More particularly, FIG. 4A showed how the features that dominate OA decisions, for most subjects, come from primarily one knee compartment (medial, lateral, or patella). To demonstrate this, the aggregate STI partial scores were calculated for each knee compartment in each subject. A histogram for the compartment with largest partial score (First), second largest partial score (Second), and minimum partial score (labeled Third) across subjects as a function of Density where the area under each curve is unity. In that same FIG. 4A, the SVM decision score is shown as the vertical black line.

[0034] For FIG. 4B, the average STI in each compartment of 4A is statistically different. Notched box plots show the average STI for each of the three compartments. Finally, in FIG. 4C, the dominant compartment that predicted OA progression is strongly correlated with mechanical alignment. Notched box plot of individuals with a dominant medial or lateral compartment contribution to STI was compared as a function of the mechanical axis. Individuals with a varus alignment were associated with an increased STI in the medial compartment and vice versa for valgus alignment. The patella as the dominant compartment in the texture index was excluded. Notches note a 95% confidence interval of the mean. Negative mechanical axis values indicate valgus alignment. * p<0.05.

[0035] Population Cohort: Patients were selected from the OAI cohort. A total of 201 patients were selected, 89 control and 112 symptomatic. Specific inclusion criteria are as follows. Control subjects were selected from the control cohort defined as a low WOMAC score (<5) with low KL score that had no risk factors for OA progression. The OA rapid progression cohort were selected from the incidence cohort based on the initial criteria of a low WOMAC pain score less than 10 that had no radiographic signs of OA (KL<1) and that had a change in WOMAC pain score of >10. The incidence cohort did not have OA risk factors or symptomatic OA (risk factors: 1. previous knee surgery; 2. overweight as defined by ages cutoffs of 45-69 males>92.9 kg and females>77.1 kg; 3. previous knee injury defined by an injury of difficulty walking for at least one week; 4. family history in parent or sibling of total knee replacement; 5. Heberden's Nodes defined as self-report of bony enlargement of one or more enlargements of the distal interphalangeal joints in either hand; symptomatic OA: 1. Kellgren and Lawrence (KL) grade<2 on fixed flexion radiographs; 2. no frequent knee symptoms for at least one month during the past 12 months defined as pain, aching, or stiffness in or around the knee on most days). The incidence cohort did have risk factors for OA progression (OAI exclusion criteria included rheumatoid arthritis, bilateral total knee joint replacement, and a positive pregnancy test. Institutional review board approval had been obtained at all participating institutions in the OAI, and informed consent had been obtained by all participants in the study.

[0036] MR Image Acquisition:

[0037] In the OAI cohort, three dimensional sagital DESS and T2 mapping images were acquired from the imaging database freely available by request [11]. Briefly, MRI of the knee joint was performed on a 3.0 T Siemens whole body MAGNETOM Trio 3T scanner (Siemens, Erlangen, Germany) using a standard extremity coil. For high-spatial-resolution 3D DESS imaging [12], a total of 160 sections were acquired with a field of view (FOV) of 14 cm (matrix 384384) with an in-plane spatial resolution of 0.3650.365 mm and a slice thickness of 0.7 mm with an acquisition time of 11 min. For sagittal 2D dual-echo fast spin echo (FSE) sequence for mapping T2 relaxation time, TR was 2700 ms and 7 echo images with TE ranging 10-80 ms were acquired with matrix of 384384, in-plane resolution of 0.3130.313 mm, FOV of 12 cm, acquisition time 12 min and slice thickness of 3 mm. OAI data sets used included the baseline imaging data set 0.E.1 and 0.C.2.

[0038] Plain Radiographic Assessment:

[0039] Standard bilateral standing posterior-anterior fixed flexion knee radiographs were obtained at the baseline visit. Knees were positioned with a 20-30 flexion and 10 internal rotation of the feet in a plexiglass frame (SynaFlexer, CCBR-Synarc, San Francisco, Calif., USA). Knee radiographs were graded using the using the Kellgren-Lawrence (KL) scoring system (Lawrence, 1957). The patello-femoral joint was not included in the KL score as the OAI protocol used the fixed flexion knee radiograph for KL scoring.

[0040] Standard bilateral, full length lower limb radiographs were obtained at the one-year clinical visit with knees fully extended and feet place six inches apart directly facing the film centered at the knees. Mechanical axis was measured using the standard technique of measuring the angle placed from the center of the femoral head to the medial tibial prominence to the midline of the ankle (McGory 2002, pub med id:12439260). OAI data sets used included the baseline and one year imaging data set 0.E.1 and 0.C.2.

[0041] Clinical Assessment:

[0042] Clinical symptoms were assessed with the Western Ontario and McMaster Universities Osteoarthritis (WOMAC) questionnaire at the time of magnetic resonance screening (Bellamy, id 3068365). The OAI clinical data set 0.2.2 was used for data collection.

[0043] Registration:

[0044] DESS and T2 images were registered using the Mattes mutual information metric. Registration software was built using the insight toolkit, a C++ open source image analysis library (www.itk.org). DESS images possess higher resolution and were transformed through three-dimensional space to preserve the voxel information on the fixed T2 image using a verser transform. Linear interpolation was used in sampling voxels on non-grid positions. A specialized gradient decent optimizer is used to define the transform parameters through successive iterations as the search space is large across 6 degrees of freedom. After the transform, the mutual information metric is used to assess the degree of alignment between the two images and the process is repeated until a maximum degree of overlap has been achieved (Urish 2013, pub med id: 23997865).

[0045] Segmentation:

[0046] Segmentation was completed on DESS images. Segmentation of the femoral and patellar cartilage was completed using custom semi-automated software implementing a global active statistical shape model with a local active contour model. Gross inaccuracies in the segmentation could be corrected by a manual correction of the computer segmentation. Binary masks of the lateral and medial femoral condyle and patella were generated from the segmented images.

[0047] 1. The lateral and medial masks were split into 5 sections for each individual. The patella region was treated as a single section. There were 11 regions of interest (ROI) per individual. The cartilage region for the lateral and medial compartments resembles an arc or semi-circle. In order to add an extra level of spatial localization ability to the feature set, the lateral and medial masks are divided into 5 sections each. Note that the medial and lateral masks are divided independently, so the section boundaries for the medial compartment are not necessarily the same as the ones for the lateral compartment. The automatic mask division algorithm is as follows: Find all segmented masks within the current patient's current compartment (medial or lateral).

[0048] 2. Superimpose all segmented masks onto a single image.

[0049] 3. Find , the angle of the arc of cartilage.

[0050] 4. Divide by 5 to find .sub.S, and then draw section boundaries at intervals of .sub.S.

[0051] 5. Starting on the far left and rotating counterclockwise, number the sections 1-5.

[0052] Note that the patella masks are usually much smaller than the lateral and medial masks, so they are not divided and are treated as a single section. Therefore, there are 5 medial sections, 5 lateral sections, and 1 patella section for a total of 11 sections. Each feature is measured independently in each section, so in general there are 11 instances of each feature. This is an arbitrary number and any number of regions of interest could be selected in operation of this invention. The specific segmentation technique used is not significant for our method and any type of segmentation method selected from the group of automatic, semi-automatic, and manual may be used.

[0053] T2 Maps:

[0054] T2 maps were calculated from the Multi-Slice-Multi-Echo T2 images available in the OAI. Calculation of the T2 maps have been previously described (Smith and Mosher; Pubmed id 11436214). Briefly, the T2 maps are calculated on a voxel-by-voxel basis using a linear least squares fitting with CCHIPS/IDL software (Cincinnati Children's Hospital Image Processing Software/Interactive Data Language, (RSI, Boulder, Colo.). The MR T2 signal decay of cartilage is mono-exponential, and the signal intensity decay can be expressed as an exponential decay as a function of time for each voxel. Quantitative T2 maps can be visualized as a color-coded image using an ordinal rainbow scale.

[0055] Image Feature Extraction:

[0056] Candidate features were calculated from each T2 map using the segmented binary masks region of interest using a matlab script (Mathworks, Natick, Mass.). Each feature was independently measured in each of the 11 sections on each knee. There were four main categories of features: histogram, grey level co-occurrence matrix (GLCM), grey level run length matrix (GLRL), and z-score. The numbers reported below are the totals from all 11 sections. A 32-bin histogram was used to calculate the mean, variance, entropy, and central moments. GLCM features were calculated from the grey level co-occurrence matrices at unit distance and angles 0, 45, 90, 135 degrees, and 90 degrees in the z direction. GLRL features were calculated from grey level run length matrices at angles 0 and 90 degrees. The Z-score was calculated for all voxels in each section. The mean value, variance, minimum value, maximum value, and range of values were then calculated (n=55). In each of the 11 sections, a total of 725 features were measured on each T2 map. All features were normalized to the range [1,1].

[0057] Each mask is used to extract the appropriate voxels from the corresponding T2 map. As mentioned before, there are 5 medial sections, 5 lateral sections, and 1 patella section for a total of 11 sections that voxels can originate from. Features are measured independently for the voxels from each section, creating 11 instances of the same feature, each from a different location in the right knee.

[0058] Primarily, textural features were chosen to represent the images. This is because texture can measure statistical properties and spatial distribution of the image intensities [24]. This fact makes textural features very attractive for predicting OA status from the T2 map, and many of the initial features have been used in previous OA studies. The initial feature set can be split into four categories: 1) histogram; 2) gray level co-occurrence matrix (GLCM); 3) gray level run-length matrix (GLRL); 4) z-score.

[0059] All feature calculations are performed with custom-made Matlab functions. There are 725 total features in the initial feature set. In the following sections, let I.sub.j(u,v,w) be the intensity of the pixel at index (u,v,w) in section j and S.sub.j is the set of all voxel indices in section j.

[0060] Histogram Features:

[0061] For each section, a 32-bin histogram is calculated from the intensity values of the T2 maps. The histogram in section j is calculated according to the equation:

[00001] $p_{j} (x) = \frac{\begin{matrix} number .Math. .Math. of .Math. .Math. voxels .Math. .Math. in .Math. .Math. section .Math. .Math. j \\ with .Math. .Math. quantized .Math. .Math. gray .Math. .Math. level .Math. .Math. q_{z} .Math. .Math. in .Math. .Math. S_{j} \end{matrix}}{total .Math. .Math. number .Math. .Math. of .Math. .Math. voxels .Math. .Math. in .Math. .Math. S_{j}}, x = 0, 1, .Math. .Math., 31$

Note that within section j, the histogram p.sub.j actually does not depend on the spatial location of the voxels. This means that all histogram features do not depend on the location of the voxels within their specified section, but instead the histogram features measure the statistical properties of the voxel intensities. Even though there is no spatial dependency, histogram features are still often used in texture measurement. Also, some level of spatial information is included in this feature category since each feature is measured independently in each section, so each feature instance is localized to one specific location in the knee. The following equations define the histogram features:

[00002] $\begin{matrix} Mean .Math. : & m_{j} = {.Math.}_{x = 0}^{31} .Math. {xp}_{j} (x) \\ Variance .Math. : & _{j}^{2} = {.Math.}_{x = 0}^{31} .Math. {(x - m_{j})}^{2} .Math. p_{j} (x) \\ Dispersion .Math. : & {.Math.}_{x = 0}^{31} .Math. .Math. x - m_{j} .Math. .Math. p_{j} (x) \\ Average .Math. .Math. Energy .Math. : & {.Math.}_{x = 0}^{31} .Math. x^{2} .Math. p_{j} (x) \\ Energy .Math. : & {.Math.}_{x = 0}^{31} .Math. {[p_{j} (x)]}^{2} \\ Entropy : & - {.Math.}_{x = 0}^{31} .Math. .Math. p_{j} (x) .Math. \log_{2} (p_{j} (x)) \\ Skewness : & _{j}^{- 3} .Math. {.Math.}_{x = 0}^{31} .Math. {(x - m_{j})}^{3} .Math. p_{j} (x) \\ Kurtosis : & _{j}^{- 4} ({.Math.}_{x = 0}^{31} .Math. {(x - m_{j})}^{4} .Math. p_{j} (x)) - 3 \end{matrix}$

In addition to these features, the median, mode, minimum value, maximum value, and range of values is calculated from the histogram. An 8-bin histogram is also calculated for each section j and the occupancy of each bin is used as a feature. Also, the following two additional miscellaneous features are included in this category even though they are not calculated using the histogram.

[00003] $\begin{matrix} Relative .Math. .Math. Size .Math. : & \frac{number .Math. .Math. of .Math. .Math. voxels .Math. .Math. in .Math. .Math. S_{j}}{number .Math. .Math. of .Math. .Math. voxels .Math. .Math. in .Math. .Math. corresponding .Math. .Math. compartment} \\ L .Math. .Math. 2 .Math. .Math. norm .Math. : & \sqrt{\underset{(u, v, w) S_{j}}{.Math.} .Math. {[I_{j} (u, v, w)]}^{2}} \end{matrix}$

Note that the relative size feature is not calculated for the patella section because there is only a single section in the patella compartment, which would make this feature equal to 1 for all patients.

[0062] Gray Level Co-Occurrence Matrix (GLCM):

[0063] Histogram features alone cannot completely characterize texture since they do not measure the spatial characteristics of the cartilage region. The second-order histogram, called the gray level co-occurrence matrix (GLCM), is a common tool for measuring cartilage. In this study, the GLCM is calculated for a distance of 1 (the voxels must be immediate neighbors in the specified direction) and for direction =0, 45, 90, 135, and 90 in the z (third dimension) direction. Before the GLCM is calculated, the intensities are quantized down to 8 gray levels, which results in each GLCM being an 88 matrix.

If h.sub.j,(x,y) is divided by the total number of neighboring pixels in section j, then the GLCM becomes an estimate of the joint probability f.sub.j,(x,y). The following features are calculated from f.sub.j,(x,y).

[00004] $\begin{matrix} Angular .Math. .Math. Second .Math. .Math. Moment .Math. : & {.Math.}_{x = 0}^{7} .Math. {.Math.}_{y = 0}^{7} .Math. {[f_{j,} (x, y)]}^{2} \\ Contrast .Math. : & {.Math.}_{x = 0}^{7} .Math. {.Math.}_{y = 0}^{7} .Math. {(x - y)}^{2} .Math. f_{j,} (x, y) \\ Absolute .Math. .Math. Value .Math. : & {.Math.}_{x = 0}^{7} .Math. {.Math.}_{y = 0}^{7} .Math. .Math. x - y .Math. .Math. f_{j .Math.,} (x, y) \\ Inverse .Math. .Math. Difference .Math. : & {.Math.}_{x = 0}^{7} .Math. {.Math.}_{y = 0}^{7} .Math. \frac{f_{j .Math. .Math.} (x, y)}{1 + {(x - y)}^{2}} \\ GLCM .Math. .Math. Entropy .Math. : & - {.Math.}_{x = 0}^{7} .Math. {.Math.}_{y = 0}^{7} .Math. f_{j, .Math.} (x, y) .Math. \log_{2} .Math. f_{j, .Math.} (x, y) \\ Correlation .Math. : & {.Math.}_{x = 0}^{7} .Math. {.Math.}_{y = 0}^{7} .Math. \frac{{xyf}_{j, .Math.} (x, y) -_{u} .Math._{v}}{_{u} .Math._{v}} \end{matrix}$

Note that .sub.u, .sub.v, and .sub.u, .sub.v are the means and standard deviations of the marginal distributions created by the row and column sums of the matrix, respectively. There are 5 different GLCMs (1 for each direction) calculated for each section, and since there are 11 sections this means that there are 55 total GLCMs calculated for a single patient. Each feature is measured independently for each GLCM.

[0064] Gray Level Run-Length Matrix (GLRL):

[0065] The gray level run-length matrix (GLRL) is another typical tool used in texture analysis. The GLRL g.sub.j,(x,y) is defined as the number of runs of length y in the direction consisting of points with gray level x in section j. Before the GLRL is calculated, the cartilage image intensities are quantized down to 8 gray levels. The GLRL is calculated for =0,90. The largest possible run length is the largest number of voxels that lie in direction , but the run lengths are quantized to 4 possible ranges. Let P.sub.j be the total number of voxels in section j, and let N.sub.j,be the sum of all elements in g.sub.j,. The following features are calculated from the GLRL g.sub.j,(x,y).

[00005] $\begin{matrix} Short .Math. .Math. Runs .Math. .Math. Emphasis .Math. : & {.Math.}_{x = 0}^{7} .Math. {.Math.}_{y = 0}^{3} .Math. \frac{g_{j,} (x, y)}{y^{2}} / N_{j,} \\ Long .Math. .Math. Runs .Math. .Math. Emphasis .Math. : & {.Math.}_{x = 0}^{7} .Math. {.Math.}_{y = 0}^{3} .Math. y^{2} .Math. g_{j,} (x, y) .Math. /_{N_{j,}} \\ Gray .Math. .Math. Level .Math. .Math. Nonuniformity .Math. : & {.Math.}_{x = 0}^{7} .Math. {({.Math.}_{y = 0}^{3} .Math. g_{j,} (x, y))}^{2} .Math. /_{N_{j,}} \\ Run .Math. .Math. Percentage .Math. : & {.Math.}_{x = 0}^{7} .Math. {.Math.}_{y = 0}^{3} .Math. g_{j,} (x, y) .Math. /_{P_{1}} \end{matrix}$

There are 2 different GLRLs per section, which means there are 22 total GLRLs calculated per patient. Each feature is calculated independently for each GLRL.

[0066] Z-Score:

[0067] A special normalization procedure that produced features that had a significant correlation with OA damage can be defined as:

[00006] $\hat{I_{j}} (u, v, w) = \frac{I_{j} (u, v, w) -_{jcontrol}}{_{j, control}}$

where .sub.j,control and .sub.j,control are the mean and standard deviation of section j for only the patients in the control group. This can be seen as normalization to healthy voxels, and therefore it may help the symptomatic voxels stand out more than normal. From the transformed intensities .sub.1, the features calculated are the mean, variance, minimum value, maximum value, and range of values. Similar to the histogram features, the z-score features do not have the ability to identify spatial characteristics of the voxels within section j. However, since the features are measured independently from each section, each feature instance is a localized measurement.

[0068] Feature Normalization:

[0069] For each feature independently, each feature instance is normalized to the range [1,1]. This is done because there might simultaneously be very large and very small feature values, which could cause numerical problems in the classifier training procedure

[0070] Classification, Feature Elimination, and Partial Sum Measurements:

[0071] Given the features, we need to calculate a classification decision for each patient. The classification decision is reduced to a binary decision (healthy or symptomatic) for simplicity. In this case, each patient is treated as a 725-dimensional feature vector and the classifier maps this vector into a binary value. Since the feature space is very high-dimensional (especially considering there are only 210 patients in this 725-dimensional space), decreasing the dimensionality of this space could potentially lead to better generalization accuracy. Feature selection and classifier training are performed using a training set of patients, and then the generalization accuracy is estimated on a patient test set that is disjoint from the training set. This chapter is set-up as follows: the following section describes the support vector machine (SVM) classifier and the reason it was chosen, then the next section describes the feature selection algorithm margin-based feature elimination (MFE), and then the final section explains the structure of the feature selection and classification experiment.

[0072] Support Vector Machine:

[0073] Support vector machine (SVM) training and testing were implemented using the LIBSVM Matlab interface. To assess the performance of the classifier, we randomly divided the entire cohort into 100 equal-sized training and test subsets with equal numbers of control and rapid progression individuals. In each of the 100 trials, the SVM classifier was trained to discriminate between control and rapid progression OA populations using all 725 features on the training set, and the accuracy of the classifier was measured on the independent test set. The confusion matrix was calculated after each trial. Margin based feature elimination (MFE) was used to eliminate redundant and uninformative candidate features. In the same trial, SVM training was coupled with MFE to identify a reduced set of essential features. The accuracy of the reduced feature set was tested on the test data set, and the confusion matrix was again determined (Arrow 80 in FIG. 1). After classification was completed and the signal texture index was calculated, the signal texture index of each compartment was determined. In each of the 100 trials, the partial weighted linear sum of the medial femoral condyle, lateral femoral condyle, and patella contribution to the each individuals overall SVM score was determined. Results were normalized based on the number of regions in each compartment (5 for the medial and lateral condyle, one for the patella), and averaged across the 100 separate trials. Any number of trial numbers, SVM or related classification methods, or methods to form the test and training sets on the available data sets could be used, and the method presented here is only an example of implementation.

[0074] As a specific example of implementation, each patient is represented as a data point in k-dimensional space, with k being the number of features. As mentioned previously, the number of initial features before the feature selection phase is 725 and the number of classes is 2. For now, assume these data points are linearly separable; in other words, the two classes can be separated using a single linear surface (hyperplane) of dimension k1. This separating hyperplane essentially splits the k-dimensional space in two, with each subspace corresponding to one of the two classes. This hyperplane is known as the linear discriminant function (LDF), and in the case of SVM this hyperplane has the form

0=w.sup.Tx+b

where w is the kx1 SVM weight vector, x is the kx1 feature vector, and b is a scalar bias term. In this case, w is a vector normal to the hyperplane and |b|/w is the perpendicular distance between the hyperplane and the origin where w is the Euclidean norm of the weight vector.

[0075] The choice of this separating hyperplane is not unique, and the choice of a particular hyperplane is the strength of SVM.

[0076] Since there are two classes, there are two class labels: 1 and +1, corresponding to healthy and symptomatic respectively. Let y.sub.i be the class label for patient i and x.sub.i be the feature vector for this patient. The SVM decision score for patient i is d.sub.i and is written as

[00007] $d_{i} = {.Math.}_{i = 0}^{h - 1} .Math. w_{j} .Math. x_{i, j} + b$

and the actual classification decision for patient i is {circumflex over (d)}.sub.i=sgn(d.sub.i)
where sgn is the signum function. For all training patients in a linearly separable data set, the following inequality holds:

y.sub.id.sub.i1

It can be shown that the patients from each class closest to the hyperplane are a perpendicular distance 1/w away from it. This distance is called the margin of the hyperplane, and these patients are called the support vectors. The SVM procedure chooses a separating hyperplane such that the margin is maximized while satisfying the inequalities for every training instance. This can be stated formally as an optimization problem:

Minimize w.sup.2 subject to y.sub.id.sub.i1

[0077] The hyperplane is chosen to maximize margin because this hyperplane is proven to have better accuracy on unseen data points. Since we do not know the true distribution of data points in the feature space, the unseen data points may easily cross the separator and therefore be classified incorrectly. The separating hyperplane is chosen such that the margin between support vectors is maximized: the hyperplane with largest margin will typically account for the distribution of the classes better than other separators, and therefore it should perform better on unseen test data.

[0078] Other than maximizing margin between the classes and separating hyperplane, the strength of SVMs is the use of support vectors. In the derivation of the weight vector w and bias b, the support vectors are the only data points that affect w and b. This means that the classifier is uniquely determined by the choice of support vectors and only the support vectors. The number of model parameters in the classifier derivation depends on the number of support vectors. Therefore, unlike in other classifiers, the number of model parameters is not determined by the feature dimensionality and is instead bounded by the number of training instances, which causes the SVM to be more robust against overfitting.

[0079] Margin Based Feature Elimination:

[0080] Feature selection is integrated into the classifier training process for this study. Selecting a subset of features from the initial feature set is necessary in order to eliminate redundant and non-informative features, and it is also possible to improve the generalization performance of the classification process [13].

[0081] Since margin-maximization is the goal of the SVM training procedure, a feature selection method was chosen that uses this margin as the criterion for removing features. This algorithm is called margin-based feature elimination (MFE). The goal of MFE is to maximize the SVM margin with each feature elimination step. This is a wrapper algorithm, meaning the trained classifier is used to determine the order of feature removal. In other words, classifier training is a part of the feature selection process, and the classifier is retrained many times to determine the usefulness of the features. MFE represents a computationally-efficient feature elimination algorithm that works together with the goal of the SVM training procedure and has been shown to perform well on standard datasets.

[0082] The MFE algorithm for linear SVMs has been described in detail [32]. A simplified version of the algorithm is explained: 1. Train a SVM using the current feature set. 2. Calculate the effect on the SVM decision function of removing each feature separately. 3. Remove the feature whose removal results in the largest SVM margin. 4. If linear separability is lost, stop. 5. Go to step 1, using the reduced feature set.

[0083] Note that in the version of MFE used in this study, the feature elimination is terminated when the training data becomes linearly nonseparable. MFE could be altered to continue removing features after this point, but this would require introducing slackness into the SVM training procedure. SVM training and MFE is simplified by not using slackness, and it was found experimentally that this method works well without considering slackness.

[0084] Statistics:

[0085] Data is expressed as a meanstandard deviation, except where noted. Direct comparisons between two cell populations were made using an unpaired, two-tailed Student's t-test. Statistical significance was determined if P<0.05. Multiple group comparisons were made using two-way ANOVA, using the Student-Newman-Keuls pairwise comparison to determine significance levels. Conventions for box plot include the mean outlined by the box representing the 25% and 75% quantiles, whiskers representing the minimum and maximum value, outliers denoted with a circle, and notches representing the 95% confidence interval of the mean.

[0086] Receiver operating characteristic (ROC) analysis was performed on the entire set using standard techniques. The procedure and methods are discussed in detail using the thesis of Matthew Keffalas, The Pennsylvania State University, Electrical Engineering, Schreyer Honors College, 2010.

[0087] Classification Experiment:

[0088] An example of implementing the entire method described above is as follows. The experiment is designed to estimate the performance of the combined SVM/MFE procedure. We must do this by using the 210 fully-preprocessed patients. According to the suggestions in [34], the patient set used to test the classifier performance must be completely disjoint from the patient set that is used to train the classifier and perform feature selection. Therefore we test the performance of the classification and feature selection methods by splitting the entire patient set into two equally-sized disjoint sets, training and test. The training set is formed by randomly sampling (without replacement) the full dataset. Each set has the same ratio of classes as the original dataset. The training set is used to select the sparse feature set and train the SVM, and the test set is used to estimate the generalization accuracy of the trained classifier and the selected feature set. This procedure is repeated for 100 trials using 100 different training sets (and therefore 100 different test sets), and the final performance of these methods are estimated by averaging the results from each trial.

[0089] We hypothesized that T2 map signal heterogeneity could accurately prognosticate OA progression. To test this hypothesis, we used the OAI to identify and compare the texture metrics of two populations of T2 maps: an asymptomatic control and a rapid OA progression population. The asymptomatic group was collected from the OAI control cohort (n=89). The rapid progression population was collected from the incidence cohort (n=112). At the initial time point, the population was asymptomatic (WOMAC<10) and had no radiographic signs of OA (KL2). At the 3 year time point, this population experienced a WOMAC change (greater than 10), signifying both a large and rapid progression of symptoms. These populations were comparable in regards to age, sex, and BMI. As expected by cohort definitions between the control and incidence cohorts, the asymptomatic population did have lower WOMAC and KL scores than the rapidly progression population.

[0090] To assess signal heterogeneity, we quantified signal heterogeneity using a series of texture metrics on each of these populations. Asymptomatic and rapid progression populations based on baseline T2 map image features that described texture. Images were segmented and registered so that image texture features could be extracted. DESS images were used for segmentation because of the increased contrast at cartilage-soft tissue and cartilage-bone interfaces. Multimodality registration was used to align DESS and T2 sequences so that the segmentation masks (subdivided into ROI) could be used to measure a range of histogram and texture based image features. We chose as candidate features some well-known descriptors of image texture (local entropy, variance, cross-correlation, run-lengths, histogram based) and integrated a feature reduction step within the classifier training. An image classifier, SVM, was used to develop a model to predict OA, with classifier training and testing via a series of cross-validation experiments, dividing the subpopulations into training and test subsets. Margin based feature elimination was used to eliminate redundant and uninformative features, sacrificing minimal accuracy in order to simplify the model and to identify image feature biomarkers. The classifier design wraps MFE around SVM training, removing one feature at a time. The classifier found an appropriate hyper-plane in image-feature space that separated these populations. The SVM score is the distance from this plane, and can be described as a signal texture metric (Arrow 55 in FIG. 1).

[0091] Three separate cases of classifier accuracy were analyzed. First, the accuracy of using the entire set of all 725 features before feature elimination was measured. The average accuracy of the classifier was greater than 80%, corresponding to an average sensitivity of 82.05.4% and an average specificity of 75.07.3%. Second, MFE was used to remove redundant and uninformative features significantly reducing the feature space. An average of only 20 of the 725 features was needed to maintain a comparable level of accuracy. The average accuracy of the system with MFE feature selection was 76.17.2%, with average sensitivity of 79.16.7% and average specificity of 70.07.7%. Finally, in each of these trials by design, a new separate set of features was selected. If feature reduction was performed on the entire trial set simultaneously, a single set of features for the signal texture index were defined. At a sacrifice of some bias to obtain the single feature set, accuracy was 80% with a sensitivity of 83% and a specificity of 77%. The remainder of the discussion will focus on the second case as it presents the most unbiased classifier accuracy and quantification of signal texture index.

[0092] After feature reduction, the STI had good separation of the asymptomatic and OA populations. By design, the classifier sets a STI value of zero as the decision boundary so that any positive value is determined to be OA progression and any negative value a control decision is made (FIG. 2A). One of the trials with similar accuracy (76.6%) to the entire set of trial corresponds with good separation of the two populations (FIG. 2B). ROC analysis showed excellent classifier performance, and tradeoffs between specificity and sensitivity as a function of the SVM decision boundary (FIG. 3). This invention can be used to prognosticate and/or diagnose cartilage damage, arthritis, or the general state of cartilage health. This demonstrates the minimum accuracy of the invention. Small modifications will improve the accuracy.

[0093] The image texture features that predict rapid progression of OA for most individuals are primarily located in one of the three knee compartments. The STI is calculated from a weighted sum of image feature measurements from the lateral and medial compartment and the patella. By separately considering the features from each compartment (lateral, medial, patella) and finding the partial sum for each section, the effective contribution from each compartment to the overall decision can be determined. The rapid progression population was considered separately in this analysis. The contribution of features in each compartment to the overall signal texture index shows substantial separation between each compartment (FIG. 4A) and the mean of each of these compartments are statistically different (FIG. 4B). This suggests that, for most subjects, a single knee compartment plays a dominant role in rapid progression to symptomatic OA.

[0094] To test the observation that the signal texture index from one compartment plays a dominant role in OA progression, we isolated the medial and lateral sub-populations from the dominant compartment and compared the compartment to the mechanical axis from standing full limb length radiographs. Individuals with a dominant compartment on the medial condyle were highly correlated with valgus alignment, and individuals associated with a dominant compartment on the lateral condyle were associated with a varus alignment. A comparison of these two populations demonstrated the differences were statistically different as measured by the student's t-test. At a minimum, the dominant compartment's location is highly correlated with mechanical axis (FIG. 4C).

[0095] For symptomatic patients that are correctly classified, the most positive partial sum amongst the three sections contributes most to the correct decision. It can thus be inferred that the section with this partial sum is likely the one undergoing the most OA changes. Moreover, a significant disparity between the largest and second largest partial sums for an individual patient suggests OA changes may only be occurring in the knee section with the largest partial sum. Thus, this invention may be used to localize cartilage damage, the progression of disease, or identify area of the joint where healthy cartilage resides.

[0096] This invention had been demonstrated to be used on T2 maps for OA symptomatic prognostication but the demonstration is equally valid on a number of other instances. Any MR sequence including but not limited to dGERMIC (i.e, delayed gadolinium-enhanced magnetic resonance imaging of cartilage), T1, T1 rho, and T2 sequences could be substituted or combined with T2 maps. Texture and feature analysis was used to prognosticate but could be used as a diagnostic test for symptomatic progression or differentiate different stages of disease progression. Further, the same invention can be applied to predicting morphologic changes in articular cartilage including but not limited to changes in cartilage volume, area, or thickness and bone, synovial, inflammatory tissue morphometry. The invention is not specific to the disease of OA but any type of pathologic process effecting human or animal cartilage including but not limited to rheumatoid arthritis, post-traumatic arthritis, cartilage trauma, osteochondritis dissecans, or the general state of cartilage health. Also, the invention can be applied to any joint in the body including but not limited to hip, shoulder, elbow, wrist, and ankle. The specific steps used in this invention can be altered in length, method employed, order, or omission.

[0097] A challenge in classification is the curse of dimensionality. There is a relative paucity of available training samples compared to the large dimensionality of the image feature space and to the number of parameters in the classifier model. This implies that the classifier has a tendency to overfit the data which can degrade the accuracy of the model. To avoid this problem, we applied a linear discriminant function classifier, SVM, to maximize the margin between these two populations. In this sense, the SVM maximizes the separation of the two classes. For an SVM, unlike a standard linear discriminate function, the number of model parameters is bounded by the number of training samples, rather than being controlled by the feature dimensionality. Since the number of samples is typically the much smaller number, in this way the SVM mitigates potential overfitting. SVM is not a unique solution to this process. Any linear or non-linear classifier and method of feature reduction can be applied for use in this invention. An emphasis was placed on defining OA by symptomatic progression. OA could be defined by symptoms, biomarkers, imaging criteria, or any other definition.

[0098] In addition to overcoming the problem of the non-linear response of T2 to cartilage degradation, this approach removes systematic bias. Differences in methodology or instrumentation used in the T2 measurement can lead to differences in the magnitude of T2 values. Since texture analysis compares spatial differences in T2 values between neighboring pixels rather than the absolute T2 values, it effectively uses an internal calibration standard to remove systematic bias. This helps eliminate the variation observed in a sequence as a function of the operator, machine, and location.

[0099] There are distinct differences between results discussed here and those reported by Dam [14]. In Dam's work, they make distinctions between non-progressers and early progressors. Dam uses the definition of early progressors based on radiographic progression. For example, at the end of the time period non-progressors have no change in radiographic findings and early-progressors have an increased change in radiographic findings. Our method presented above and as demonstrated by our results can perform the more difficult task of identifying patients that will have symptomatic progression with no radiographic progression or evidence. For example, Dam's method has the potential to misclassify an individual as an arthritis non-progressor based on no predicted change in their radiographs when they may have had arthritis progression based on symptoms. Our method would have improved accuracy and prognostication by correctly classifying these individuals.

[0100] From this perspective our method identifies an even earlier type of progression in the disease process that is more challenging to detect than the Dam group. A large disadvantage of radiographs is that radiographs are a poor method to detect early arthritis changes (CITE). Individuals may develop arthritis symptoms without having radiographic evidence of arthritis (CITE). Using Dam's definition [14], this group would be entirely missed by his analysis and considered non-progressors. Using our definition and method, this group would not be miss-classified and correctly considered part of the progression. This allows are method to be more sensitive and specific for early arthritis progression than Dam's method. Further, Dam demonstrates that there are differences between progressors and non-progressors but does not actually demonstrate that it can be used as a prognostic tool [14]. Our results presented above, demonstrate its prognostic accuracy.

[0101] Footnoted References above: [0102] 1. David-Vaudey, E., et al., T2 relaxation time measurements in osteoarthritis. Magn Reson Imaging, 2004. 22(5): p. 673-82. [0103] 2. Mosher, T. J. and B. J. Dardzinski, Cartilage MRI T2 relaxation time mapping: overview and applications. Semin Musculoskelet Radiol, 2004. 8(4): p. 355-68. [0104] 3. Mosher, T. J., et al., MR imaging and T2 mapping of femoral cartilage: in vivo determination of the magic angle effect. AJR Am J Roentgenol, 2001. 177(3): p. 665-9. [0105] 4. Wheaton, A. J., et al., Correlation of T1rho with fixed charge density in cartilage. J Magn Reson Imaging, 2004. 20(3): p. 519-25. [0106] 5. Burstein, D. and M. L. Gray, Is MRI fulfilling its promise for molecular imaging of cartilage in arthritis? Osteoarthritis Cartilage, 2006. 14(11): p. 1087-90. [0107] 6. Xia, Y., J. B. Moody, and H. Alhadlaq, Orientational dependence of T2 relaxation in articular cartilage: A microscopic MRI (microMRI) study. Magn Reson Med, 2002. 48(3): p. 460-9. [0108] 7. Harrison, M. M., et al., Patterns of knee arthrosis and patellar subluxation. Clin Orthop Relat Res, 1994(309): p. 56-63. [0109] 8. Maroudas, A. I., Balance between swelling pressure and collagen tension in normal and degenerate cartilage. Nature, 1976. 260(5554): p. 808-9. [0110] 9. Yulish, B. S., et al., Juvenile rheumatoid arthritis: assessment with MR imaging. Radiology, 1987. 165(1): p. 149-52. [0111] 10. Mosher, T. J., B. J. Dardzinski, and M. B. Smith, Human articular cartilage: influence of aging and early symptomatic degeneration on the spatial variation of T2preliminary findings at 3 T Radiology, 2000. 214(1): p. 259-66. [0112] 11. Peterfy, C. G., E. Schneider, and M. Nevitt, The osteoarthritis initiative: report on the design rationale for the magnetic resonance imaging protocol for the knee. Osteoarthritis Cartilage, 2008. 16(12): p. 1433-41. [0113] 12. Eckstein, F., et al., Double echo steady state magnetic resonance imaging of knee articular cartilage at 3 Tesla: a pilot study for the Osteoarthritis Initiative. Ann Rheum Dis, 2006. 65(4): p. 433-41. [0114] 13. Aksu, Y., et al., Margin-maximizing feature elimination methods for linear and nonlinear kernel support vector machines. IEEE Trans On Neural Networks, 2010. 21: p. 701-717. [0115] 14. Dam, E. B. K., DK), Qazi, Arish (Kobenhavn k, DK), Karsdal, Morten (Kobenhavn k, DK), Petterson, Paola C. (Koge, DK), Nielsen, Mads (Dragor, DK), Christiansen, Claus (Morcote, CH), Pathology indicating measure related to cartilage structure and automatic quantification thereof. 2010, Nordic Bioscience Imaging A/S (Herlev, DK): United States.

METHOD FOR PREDICTING THE DEVELOPMENT OF ARTHRITIS IN INDIVIDUALS PRIOR TO RADIOGRAPHIC OR SYMPTOMATIC PRESENTATION

Inventors

Cpc classification

Classification Explorer

A61B5/4528

HUMAN NECESSITIES

Classification Explorer

A61B5/4571

HUMAN NECESSITIES

Classification Explorer

G01R33/5608

PHYSICS

Classification Explorer

A61B5/4595

HUMAN NECESSITIES

Classification Explorer

A61B5/055

HUMAN NECESSITIES

Classification Explorer

G01R33/50

PHYSICS

Classification Explorer

A61B5/459

HUMAN NECESSITIES

Classification Explorer

A61B5/7267

HUMAN NECESSITIES

Classification Explorer

G16H50/20

PHYSICS

Classification Explorer

A61B5/7264

HUMAN NECESSITIES

Classification Explorer

G16H50/30

PHYSICS

Classification Explorer

A61B5/4514

HUMAN NECESSITIES

Classification Explorer

A61B5/4585

HUMAN NECESSITIES

Classification Explorer

A61B5/4576

HUMAN NECESSITIES

Classification Explorer

A61B5/7275

HUMAN NECESSITIES

Classification Explorer

G16H30/40

PHYSICS

Classification Explorer

A61B5/458

HUMAN NECESSITIES

International classification

Classification Explorer

A61B5/00

HUMAN NECESSITIES

Classification Explorer

A61B5/055

HUMAN NECESSITIES

Abstract

Claims

Description