System, method and computer-accessible medium for evaluating a malignancy status in at-risk populations and during patient treatment management
09734122 · 2017-08-15
Assignee
Inventors
- Marko I. Vuskovic (Bonsall, CA, US)
- Margaret E. Huflejt (La Jolla, CA, US)
- Harvey I. Pass (Bronxville, NY, US)
Cpc classification
G16B40/00
PHYSICS
G16B45/00
PHYSICS
International classification
G01N33/50
PHYSICS
G06F15/00
PHYSICS
Abstract
In accordance with certain exemplary embodiments of the present disclosure, exemplary visualization system, method, procedure and computer-accessible medium can be provided, which individually and/or collectively can be called, e.g., ImmunoRuler (IR). Certain exemplary embodiments of the exemplary IR in accordance with the present disclosure can be used for, e.g., comparing two groups of objects with specific and distinguished properties, such as a group of case subjects with a particular disease and a group of control subjects without that particular disease, and/or a group of subjects with a particular disease who are given a treatment and a group of subjects with the same disease but under another treatment or no treatment at all.
Claims
1. A non-transitory computer-accessible medium having instructions thereon for determining probabilistic cancer information about at least one patient, wherein, when a computing arrangement executes the instructions, the computing arrangement is configured to execute procedures comprising: (a) receiving serum information related to a serum received from the at least one patient; (b) receiving Printed Glycan Array (PGA) information including a set of glycans determined using a training procedure applied to further serum information, wherein the further serum information is related to further serums received from a plurality of further patients; (c) generating further PGA information from the serum information by selecting a further set of glycans based on the set of glycans; and (d) determining the probabilistic cancer information by applying the further PGA information to the PGA information.
2. The computer-accessible medium of claim 1, wherein the scalar quantities comprise continuous scalar quantities.
3. The computer-accessible medium of claim 1, wherein the computing arrangement is configured to determine the probabilistic cancer information by: assigning, to each of a plurality of objects associated with the further PGA information, a scalar quantity that has a respective value; providing the scalar quantity assigned to each of the objects in a single diagram; and sorting the scalar quantities by respective magnitudes, and grouping the scalar quantities according to the respective values thereof.
4. The computer-accessible medium of claim 3, wherein the scalar quantities are grouped according to the respective values.
5. The computer-accessible medium of claim 3, wherein at least one of the scalar quantities is provided as a bar in a bar graph.
6. The computer-accessible medium of claim 5, wherein the providing procedure includes assigning each bar with a particular color based the respective value of the corresponding scalar quantities.
7. The computer-accessible medium of claim 6, wherein each of the colors corresponds to one of the groups formed by the grouping procedure.
8. The computer-accessible medium of claim 7, wherein each of the bars representing a respective value within lower or upper quartiles of each of the groups is assigned a particular shade of the color corresponding to the group.
9. The computer-accessible medium of claim 5, wherein the bar graph includes a cutoff line determined as a function of the respective values.
10. The computer-accessible medium of claim 9, wherein the computing arrangement is further configured to determine the cutoff line using at least one of a specificity or a sensitivity of the objects.
11. The computer-accessible medium of claim 5, wherein the computing arrangement is further configured to classify at least one of the scalar quantities, and assigns a bar representing a value associated with the classified scalar quantity with a color that is different than a color associated with any of the groups generated by the grouping procedure.
12. The computer-accessible medium of claim 3, wherein the computing arrangement is further configured to determine the value of each of the scalar quantities by a linear combination of predictor variables associated with each of the objects, wherein the predictor variables are continuous quantities which characterize at least one inherent property of each of the groups of the objects.
13. The computer-accessible medium of claim 12, wherein the at least one inherent property includes at least one of (i) a binding characteristic of a type of antibodies from a human serum to a library of carbohydrates on a printed glycan array, or (ii) a gene expression on a nucleic acid array.
14. The computer-accessible medium of claim 12, wherein the training procedure is based on predictor variables, which are based on historical information associated with at least one of the objects so as to train the computing arrangement.
15. The computer-accessible medium of claim 3, wherein the computing arrangement is further configured to determine the linear combination using a set of coefficients obtained by at least one of (i) a logistic regression procedure, (ii) a linear regression procedure, (iii) a linear discriminant analysis procedure, or (iv) a support vector machine.
16. The computer-accessible medium of claim 3, wherein the computing arrangement is further configured to adjust each of the values to make the respective values negative for most of the objects of a first group of the groups, and to make the respective values positive for most of objects of a second group of the groups.
17. The computer-accessible medium of claim 16, wherein the values for most of the objects of the first group are less than 0.5, and the respective values for most of the objects of the second group are greater than 0.5.
18. The computer-accessible medium of claim 3, wherein the computing arrangement is further configured to normalize each of the values to be between zero and one.
19. The computer-accessible medium of claim 3, wherein the computing arrangement is further configured to determine the respective value of each of the scalar quantities as a natural logarithm of a ratio of a first distance between the respective value of one of the objects to a center of mass of one group obtained by the grouping procedure, and second distance between the value of the one of the objects to a center of mass of another group obtained by the grouping procedure, wherein the distance is determined based on at least one of a Mahalanobis distance or a Hotelling distance.
20. The computer-accessible medium of claim 3, wherein the scalar quantities are sorted in an increasing numerical order of the respective values from a smallest value to a largest value.
21. The computer-accessible medium of claim 1, wherein the probabilistic cancer information includes at least one of (i) a probability that the at least one patient has an elevated cancer risk, (ii) a probability of a malignancy progression recurrence for the at least one patient, or (iii) a probability of survival from cancer by the at least one patient.
22. A method for determining probabilistic cancer information about at least one patient, comprising: (a) receiving serum information related to a serum received from the at least one patient; (b) receiving Printed Glycan Array (PGA) information including a set of glycans determined using a training procedure applied to further serum information, wherein the further serum information is related to further serums received from a plurality of further patients; (c) generating further PGA information from the serum information by selecting a further set of glycans based on the set of glycans; and (d) using a computer arrangement, determining the probabilistic cancer information by applying the further PGA information to the PGA information.
23. The method of claim 22, further comprising at least one of displaying or storing information associated with at least one of (i) sorting of scalar quantities associated with the serum, or (ii) grouping of the scalar quantities associated with the serum, in a storage arrangement in at least one of a user-accessible format or a user-readable format.
24. The method of claim 22, wherein the probabilistic cancer information includes at least one of (i) a probability that the at least one patient has an elevated cancer risk, (ii) a probability of a malignancy progression recurrence for the at least one patient, or (iii) a probability of survival from cancer by the at least one patient.
25. A system for determining probabilistic cancer information about at least one patient, comprising: a non-transitory computer-accessible medium having executable instructions thereon, wherein when at least one computing arrangement executes the instructions, the at least one computing arrangement is configured to: (a) receive serum information related to a serum received from the at least one patient; (b) receive Printed Glycan Array (PGA) information including a set of glycans determined using a training procedure applied to further serum information, wherein the further serum information is related to further serums received from a plurality of further patients; (c) generate further PGA information from the serum information by selecting a further set of glycans based on the set of glycans; and (d) determine the probabilistic cancer information by applying the further PGA information to the PGA information.
26. The system of claim 25, wherein the probabilistic cancer information includes at least one of (i) a probability that the at least one patient has an elevated cancer risk, (ii) a probability of a malignancy progression recurrence for the at least one patient, or (iii) a probability of survival from cancer by the at least one patient.
Description
BRIEF DESCRIPTION OF THE DRAWINGS
(1) The foregoing and other objects of the present disclosure will be apparent upon consideration of the following detailed description, taken in conjunction with the accompanying exemplary drawings and claims showing illustrative embodiments of the invention, in which:
(2)
(3)
(4)
(5)
(6)
(7)
(8)
(9)
(10)
(11)
(12)
(13)
(14)
(15)
(16)
(17)
(18) Throughout the figures, the same reference numerals and characters, unless otherwise stated, are used to denote like features, elements, components or portions of the illustrated embodiments. Moreover, while the subject disclosure will now be described in detail with reference to the figures, it is done so in connection with the illustrative embodiments. It is intended that changes and modifications can be made to the described embodiments without departing from the true scope and spirit of the subject disclosure.
DETAILED DESCRIPTION OF EXEMPLARY EMBODIMENTS OF THE DISCLOSURE
(19) Exemplary embodiments of the present disclosure can be implemented by the exemplary system, method, and/or computer-accessible medium in accordance with the present disclosure, which can be collectively and/or individually called, e.g., ImmunoRuler (IR).
(20) For example, the exemplary IR can be, but not limited to, a set of exemplary procedures that can be integrated as a computer-based visual tool. The tool can, for example, provide for discovery and evaluation of glycan-based biomarkers, which can be used to discriminate, determine, distinguish and/or classify a high risk population from patients with malignancy and malignant patients with good and poor prognosis. Certain exemplary embodiments of IR can incorporate all, substantially all and/or a majority of bioinformatics information, including risk scores that can be generated by logistic regression, as well as other feature selection and classification algorithms and/or procedures. Exemplary embodiments of IR can provide, e.g., an easy-to-understand visualization of risk scores that can be associated with control and case samples from a physician's data repository of previously diagnosed/treated patients, as well as of new patients whom have not yet been diagnosed. This exemplary information can be used to assess a probability of patient(s) having an elevated cancer risk and/or a malignancy progression recurrence (e.g., early detection and diagnostic application), and/or to assess the survival probability (e.g., prognostic application).
(21) Certain exemplary embodiments of IR can, e.g., support diagnostic and/or prognostic analysis of data obtained from Printed Glycan Arrays (PGA). For example, IR can provide a visual tool that can, e.g., be easily understood and used by medical doctors, nurses, and/or researchers in cancer biology, immunology and/or oncology. While most of the existing diagnostic tools can provide a limited range of diagnostic classification, the exemplary IR can determine an exemplary risk score of an undiagnosed patient, and present it graphically together with risk scores of already diagnosed patients from a clinical data repository. Such exemplary embodiments can, e.g., facilitate physicians to better estimate an exemplary level of a risk, and to compare it to levels of risk of other known patients. The exemplary IR also can incorporate other information that can be useful, such as, e.g., quartiles, medians, area under a ROC curve (AUC) and/or training precision, and the like.
(22) By including diagnostic information (e.g., characteristics of different malignancies), IR can provide for the identification of patients potentially presenting with—or patients at-risk for, e.g., unexpected malignancies, for example, when utilized during medical check-ups and/or cancer early-detection screening. For example, exemplary embodiments of IR according to the present disclosure can provide for early detection of: (a) breast cancer and/or a breast cancer risk while screening for lung cancer and/or a lung cancer risk; (b) ovarian cancer and/or ovarian cancer risk while screening for breast cancer and/or a breast cancer risk; and/or (c) mesothelioma risk while screening for lung cancer and/or a lung cancer risk. Further, since certain exemplary embodiments of IR can be pre-trained (e.g., pre-configured, pre-programmed, etc.) for various pathologies and malignancy types, such exemplary embodiments of IR can be used to, e.g., screen/diagnose patients for various pathologies malignancies using a single (e.g., one) blood sample. This exemplary polymorphic feature of certain exemplary embodiments of the present disclosure can eliminate a need for multiple tests and/or multiple patient visits.
(23) According to certain exemplary embodiments of the present disclosure, IR can be implemented as, e.g., a web-enabled application which can provide easy access to clinical data and/or performing patient screening, diagnosis and/or prognosis from remote locations via any networked computing arrangement, such as, a, desktop computer, a laptop, a notebook, a tablet, a smartphone, and/or mobile computers, personal digital assistants (PDAs), interactive television consoles, etc.
(24) According to certain exemplary embodiments of the present disclosure, IR can include, e.g., an integrated collection of one or more procedures, computer-accessible medium and/or systems, as well as one or more exemplary visual tools, which can perform classifier training, testing of patients with unknown class membership (e.g., labels) and/or visualization of exemplary results. While certain exemplary embodiments of IR can be configured as an exemplary diagnostic and/or prognostic tool that can complement (e.g., can be compatible and work well with) an exemplary Printed Glycan Arrays (PGA) platform, certain exemplary embodiments of IR can also (or alternatively) be used in other areas of medicine, biology, and/or pattern recognition.
(25) The name “ImmunoRuler” can also be configured to, e.g., process information that can include PGA data, which can be used to quantify a response of an immune system by measuring a level of binding of human antibodies against Tumor Associated Carbohydrates Antigens (TACA) and/or other glycans.
(26) Exemplary embodiments of the system, method, procedure, and/or computer-accessible medium in accordance with the present disclosure can include, e.g., the following:
(27) Exemplary Training Mode: Input: Quantified raw PGA training data (e.g., physician's data repository), pre-processing parameters, feature selection methods, and classifier configuration; Step 1: Data preprocessing (e.g., screening for noise, normalization, transformation and computation of quality control indicators, such as, e.g., Coefficient of variation, Concordance Correlation Coefficient (CCC), and Interclass Correlation Coefficient (ICC)); Step 2: Feature pre-filtering (e.g., selecting a subset of features which pass a univariate test, such as Wilcoxon-Mann-Whitney rank-sum test, with a desired p-value); Step 3: Feature selection (e.g., selecting a specified number of features based on a configured univariate feature selection method, such as Wilcoxon-Mann-Whitney rank-sum test (WMW), or multivariate feature selection method, such as Forward Sequential Feature Selection (FSFS): Backward Sequential Feature Selection (BSFS), Stepwise Feature Selection (SFS), Classification and Regression Trees (CART), Random Forests, Markoy Blankets, Genetic Algorithm (GA) and Ant Colony Optimization (ACO), etc.; Step 4: Generation of the IR discriminant function (e.g., either by logistic regression, Generalized Linear Model (GLM), or by some other projection method such as Fisher Linear Discriminant (FLD), Support Vector Machine (SVM), linear regression etc. The discriminant function preferably depends on configured decision strategy/cutoff value which can be based on equal odds, middle of cluster centroids, ratio of the cost of false positive rate vs. false negative rate, etc.); Step 5: Transformation of the linear discriminant into the risk score; There can be three definitions of risk scores used in IR (described herein): untransformed discriminant function, shifted discriminant function, and discriminant function transformed with sigmoid (logistic) function; Step 6: Computation of IR-performance indicators, such as training precision, sensitivity, specificity, positive predictive value and the area under the ROC curve (AUC); Step 7: (if requested): Computation of sensitivity functions of risk scores with respect to the binding level for various selected features (e.g., glycans); and Step 8: Plotting of a colored bar graph of sorted risk scores (e.g., risk scores can be separated into control and case group, and possibly the test samples).
Exemplary Classification/Test Mode: Input: Quantified raw PGA data of patient(s) with unknown disease status. The exemplary IR can automatically retain the pre-processing parameters, selected features (e.g., signature), and classifier configuration obtained, for example, in training mode. The exemplary IR can also retain the data used for training. Step 1: Removing features rejected by the screening process in training mode; Step 2: Normalization and transformation of new data with the same parameters determined in training node; Step 3: Extracting data that correspond to features selected in training mode, e.g., composing the design matrix; Step 4: Computing the discriminant value(s) using the discriminant function generated in training mode; Step 5: Computing the risk score; and Step 6: Inserting the risk score(s) of test observations into IR diagram generated in training mode.
(28)
z.sub.i=x.sub.i.sup.Tw+w.sub.o (1)
where x.sub.i can represent an exemplary vector of, e.g., exemplary normalized and transformed binding intensities for exemplary patient i (e.g., i=1, 2, . . . , n) for selected exemplary features (e.g., glycans) that can generally include linear, interaction, and/or quadratic terms; w can represent an exemplary vector of exemplary logistic regression coefficients; and w.sub.o can represent an exemplary regression intercept coefficient. An exemplary class membership can be determined by, e.g., the sign of z.sub.i, e.g., z.sub.i≦0 can determine, classify, and/or designate exemplary control values (e.g., based on control observations), while z.sub.i>0 can determine, classify, and/or designate exemplary case values (e.g., based on case observations). Although in this example the projection vector w can be obtained by, e.g., multivariate logistic regression, certain exemplary embodiments of the present disclosure can provide for other projection methods, as described herein above.
(29) As shown in
(30) In this example, an exemplary feature selection procedure can be performed, e.g., with an exemplary forward stepwise feature selection procedure, method and/or procedure (FSFS), and an exemplary projection vector can be obtained, e.g., utilizing an exemplary logistic regression procedure with an exemplary design matrix that contained four interaction terms. The exemplary interaction pairs of selected glycans can be, e.g., GID=(311, 121), (189, 507), (328, 121) and (121, 172). The resulting exemplary training precision can be approximately 87%, exemplary specificity can be approximately 89.2%, exemplary sensitivity can be approximately 84% and exemplary positive predictive value, e.g., can be approximately 85.7%, and the exemplary training AUC value can be approximately 0.924. In this example, exemplary contingency values, e.g., number of true negatives (TN) 104, false positives (FP) 105, false negatives (FN) 106, and true positives (TP) 107, are shown, for example, in a graph of the exemplary diagram 101 of
(31) In accordance with certain exemplary embodiments of the present disclosure, a solution of a larger number of exemplary interaction terms can result in, e.g., higher training precisions but with a potential consequence of overfitting. The utilization of four exemplary interaction terms can be considered as having provided the best unbiased cross-validation precision (e.g., over 75%) and therefore can have an expected minimal overfit. According to certain exemplary embodiments of the present disclosure, if more interaction and linear terms are selected, it can be preferable to use, e.g., larger training samples.
(32)
(33) Diagram 201 includes an exemplary horizontal line 204 that can represent an exemplary cutoff (e.g., decision) value, which, as in this example, can be determined by equal odds. According to certain exemplary embodiments of the present disclosure, the horizontal line 204 can be determined by other decision strategies, as described herein. As shown in
(34)
(35)
(36) For example, if exemplary projection vector w is obtained by logistic regression, the exemplary risk scores can have meaning of conditional probabilities Pr{Yεcase|X}, e.g., exemplary risk score r.sub.i can represent an exemplary probability that exemplary observation i can belong to an exemplary case sample.
(37) An exemplary cutoff value, shown in the exemplary diagram 301 of
(38) As described herein, the risk scores that can be obtained by logistic regression and sigmoid transformation can represent the probabilities of the subject's membership to the case sample. This can also be applied to other linear projection methods and/or procedures, e.g., including FLD and SVM, if the sigmoid function is parameterized, for example, as follows:
(39)
(40) where the parameters m and s can be obtained by maximizing the log-likelihood function:
(41)
And where n.sub.1 and n.sub.2 can be the control and case sample sizes. (See, e.g., Platt, J. C. (2000) “Probabilistic outputs for support vector machines and comparison to regularized likelihood methods”, In: Smola, A, Barlet, P., Scholkopf, B. and Schuuramans, D. (Eds.) “Advances in Large Margin Classifiers”, Cambridge, Mass., MIT Press) In certain exemplary embodiments of the present disclosure, a rapidly converging algorithm for finding the optimal values of m and s can be integrated with the exemplary IR.
(42) According to certain exemplary embodiments of the present disclosure, further information can be added to exemplary IR. For example, each exemplary diagram can be accompanied an exemplary title which can show certain parameters that can be relevant to the IR training, such as, e.g., an exemplary feature selection method, a prefitering parameter, a projection method, selected features, regression coefficients (e.g., exemplary intercept coefficient w.sub.o and exemplary projection vector w), corresponding p-values for the coefficients, a decision strategy, a presentation type, a training precision, a specificity, a sensitivity and a positive predictive value, a training AUC value, a source of training data, a concentration, etc. An example of an exemplary title that can correspond to the exemplary diagram 101 illustrated in
(43) For example,
(44) According to certain exemplary embodiments of the present disclosure, the additional information which can be useful can be estimated (e.g., cross-validated) precisions and AUC values that can include, e.g., an exemplary confidence interval for an exemplary AUC value. These exemplary values, which can be optional in certain exemplary embodiments of the present disclosure, can involve additional computations and/or complex computations, and thus, possibly a longer computational time than may be involved with other exemplary embodiments of the present disclosure.
(45) According to certain exemplary embodiments of the present disclosure, in order to relate exemplary risk scores (e.g., IR bars) to a particular patient from the exemplary training samples, the IR can be accompanied with, e.g., an exemplary list of patient identification numbers (PID), which can be sorted in the same manner as the risk score bars can be sorted. An example of an exemplary list, which can correspond to the three exemplary versions of IR illustrated in diagrams 101, 201, and 301 of
(46) TABLE-US-00001 TABLE 1 List of patients for each group: Control Sample (65): 707, 735, 760, 716, 756, 768, 720, 733, 737, 755 700, 702, 708, 706, 730, 729, 762, 747, 754, 718 713, 705, 721, 743, 739, 725, 764, 775, 704, 738 740, 710, 734, 750, 766, 748, 719, 759, 771, 717 712, 765, 763, 728, 736, 701, 732, 749, 770, 767 744, 731, 714, 774, 741, 769, 745, 772, 758, 715 709, 746, 810, 757, 711 Case Sample (50): 605, 307, 158, 78, 621, 125, 647, 143, 866, 509 107, 278, 317, 529, 145, 59, 523, 494, 322, 49 887, 294, 194, 337, 259, 112, 251, 291, 258, 149 133, 144, 132, 128, 331, 312, 272, 172, 79, 519 440, 347, 636, 298, 184, 313, 168, 99, 179, 361
(47) According to certain exemplary embodiments of the present disclosure, an interactive version of IR can be provided in which exemplary bar values and exemplary PIDs can be obtained by selecting any of the bars (e.g., using a computer mouse, placing the on-screen pointer to a position over an exemplary particular bar and clicking a mouse button to select such exemplary particular bar). According to further exemplary embodiments of this present disclosure, a particular bar can be selected by using, e.g., a computer keyboard, touch-screen, etc.
(48) A snapshot of an exemplary the GUI 2201 is shown, for example, in
(49) According to additional exemplary embodiments of the present disclosure, in order to provide for increased contrast and/or visualization, the exemplary bars 102, 202, and 302 can be filled with the color blue and have black edges, and the exemplary bars 103, 203, and 303 can be filled with the color red and have black edges. Other colors and/or color combinations can also be used. Accordingly, it is possible that, in certain exemplary cases that can involve large samples, the exemplary black bar edges can dominate over the colors of the exemplary bars, which can render the bar colors to be effectively and/or substantially invisible. In such exemplary cases, the bar edges can be removed so that the colors can be visible.
(50)
(51)
(52)
(53)
(54) According to certain exemplary embodiments of the present disclosure, an exemplary IR diagram can provide visible information about, e.g., quartiles of training samples, which can be available as a standard feature in box plots. For example, such information can be made visible by, e.g., brightening the color (or grayscale shade) of the exemplary bars of certain control and case samples, such that a darker color/shade can contain and/or represent exemplary observations, measurements, values, etc., which can be, e.g., within the 25 and 75 percentiles.
(55)
(56)
(57)
(58) In accordance with certain exemplary embodiments of the present disclosure, IR can be configured and/or adapted to be used for or in, e.g., a particular exemplary application. For example, in an exemplary case of screening of an exemplary general population for breast cancer, where the prevalence of disease can be very low, it can be important to have a high exemplary specificity, while an exemplary sensitivity can be considered to be not important as long as it has values of, e.g., above approximately 40%, above approximately 45%, and/or above 50%. A low exemplary specificity can result in a higher amount of exemplary false positives, which can lead to, e.g., unnecessary requests for additional, more painful and more expensive tests such as mammograms or biopsies. Similarly, exemplary high sensitivity values can be preferable in accordance with certain exemplary embodiments of the present disclosure, such as, e.g., when used for in screening and the diagnosis of lung cancer among smokers and/or hereditary high risk patients.
(59) Therefore, the exemplary IR in accordance with certain exemplary embodiments of the present disclosure can provide an exemplary provision of adjusting the decision cutoff value such that the exemplary IR can be configured for various decision strategies. For example, instead of (or in conjunction with) using an exemplary zero discriminant value or exemplary equal probabilities (e.g., r.sub.c=0.5), a particular cutoff value which increases the sensitivity on account of specificity, or vice versa, can be used. The IR can, e.g., specify an exemplary ratio between exemplary costs of misclassification of controls (e.g., false positive rate (FPR)) and misclassification of cases (e.g., false negative rate (FNR)), which exemplary ratio can be expressed as, e.g.:
(60)
(61) Accordingly, an exemplary total cost of misclassification can be expressed as, e.g.:
C.sub.tot=C.sub.fprFPR+C.sub.fnrFNR=C.sub.fpr(1−S.sub.p)+C.sub.fnr(1−S.sub.n) (4)
where S.sub.p and S.sub.n can represent specificity and sensitivity, respectively.
(62) Since FPR, FNR, S.sub.p and S.sub.n can be functions of an exemplary decision threshold r.sub.c, an exemplary optimal r.sub.o for an exemplary specified γ can be obtained (and/or measured, received, determined, calculated, etc.) by maximizing the exemplary function:
(63)
(64) An exemplary IR system, method and/or procedure can, e.g., include an exemplary maximization procedure (e.g., exemplary Equation 6). Exemplary illustrations are provided in
(65) In particular,
(66)
(67) As shown in the example of
(68) Certain exemplary embodiments can provide an exemplary IR to test unlabeled observations, measurements, etc., in patients with an unknown health status. For example,
(69) The projected risk scores of e.g., test observations, measurements, etc., can be furnished with exemplary whiskers, e.g., 1306, 1406, 1506, 1606 and 1706, as shown in
{tilde over (x)}.sub.ij.sup.L={tilde over (x)}.sub.ij−δ.sub.ij;{tilde over (x)}.sub.ij.sup.U={tilde over (x)}.sub.ij−δ.sub.ij (7)
where {tilde over (x)}.sub.ij and δ.sub.ij can represent a median and a median absolute deviation (MAD) across (and/or substantially all and/or a majority of) the exemplary replicates for subject i and glycan j, respectively (e.g., the exemplary tilde located above x can denote an exemplary raw signal). Exemplary signals {tilde over (x)}.sub.ij, {tilde over (x)}.sub.ij.sup.L and {tilde over (x)}.sub.ij.sup.U can be subjected to an exemplary normalization procedure and/or an exemplary transformation procedure, and become, e.g., x.sub.ij, x.sub.ij.sup.L and x.sub.ij.sup.U, respectively. According to certain exemplary embodiments of the present disclosure, these exemplary signals can be considered in a context of m selected features, and each measurement can be represented as, e.g., an exemplary m-dimensional hypercube in m-dimensional feature space, where the vertices in dimension j can be, e.g., x.sub.ij.sup.L and x.sub.ij.sup.U, while the center of the hypercube can be, e.g., vector x.sub.i=(x.sub.i1, x.sub.i2, . . . , x.sub.im). A projection of this hypercube by an exemplary projection vector w, such as, exemplary regression coefficients w.sub.j=0, 1, . . . , m, can provide exemplary 2.sup.m projected values for each exemplary patient, z.sub.ik, k=1, 2, . . . , 2.sup.m.
(70) Exemplary lower and upper values for variation whiskers 1306, 1406, 1506, 1606, 1706 can be computed (and/or determined, calculated, processed, etc.) as, e.g.:
(71)
while an exemplary projection of an exemplary center of the hypercube can yield, e.g., value z.sub.i which can be used in exemplary Equation (1). Exemplary final risk scores r.sub.i.sup.L and r.sub.i.sup.U which can be shown as the variation whiskers 1306, 1406, 1506, 1606 and 1706 in the diagrams 1301-1701, respectively, can be obtained directly, through shifting and/or through sigmoid transformation of, e.g., z.sub.i.sup.L and z.sub.i.sup.U, depending on the type of exemplary IR. Such exemplary described procedure can be implemented as a part of the exemplary IR in accordance with certain exemplary embodiments of the present disclosure.
(72) As described herein above, for example,
(73)
(74)
(75)
(76)
(77) The exemplary embodiments of IR can have an additional exemplary provision such as, e.g., exemplary sensitivities of exemplary risk scores r.sub.1 with respect to exemplary binding intensities for particular glycan, x.sub.ij:
(78)
(79) From exemplary Equations 1 and 2, described herein above, it follows that:
S.sub.ij=w.sub.jr.sub.i(1−r.sub.i). (10)
(80) In accordance with certain exemplary embodiments of the present disclosure, a more practical form of sensitivity can be, e.g., relative sensitivity, which can show and/or provide, e.g., an exemplary relative change of an exemplary risk score caused by the relative change of an exemplary binding intensity, e.g.:
(81)
(82) For example, s.sub.ij=4 can mean that an approximately 1% increase of x.sub.ij can cause an approximately 4% increase in r.sub.i.
(83)
(84)
(85)
(86) As shown in
(87) Further, the exemplary processing arrangement 2010 can be provided with or include an input/output arrangement 2070, which can include, e.g., a wired network, a wireless network, the internet, an intranet, a data collection probe, a sensor, etc. As shown, the exemplary processing arrangement 2010 can be in communication with an exemplary display arrangement 2060, which, according to certain exemplary embodiments of the present disclosure, can be a touch-screen configured for inputting information to the processing arrangement in addition to outputting information from the processing arrangement, for example. Further, the exemplary display 2060 and/or a storage arrangement 2050 can be used to display and/or store data in a user-accessible format and/or user-readable format.
(88)
(89) Examples in this document are used solely to explain and illustrate the basic concepts of IR, not to demonstrate the strength of existing data and platform, for example. The examples described herein can be based on, e.g., exemplary mesothelioma assay with 65 asbestos exposed subjects (control sample) and 50 patients diagnosed with malignant mesothelioma (case sample). The data can be obtained, e.g., on the original platform and with original glycan library (e.g., with 211 glycans termed “PGA version 6”). The exemplary choice of selected feature size was thus chosen for the exemplary embodiments described herein to provide an exemplary cross-validation tests, instead of selecting larger glycan sets which could give larger training precisions, and can be, e.g., subject of overfitting.
(90) The foregoing merely illustrates the principles of the disclosure. Various modifications and alterations to the described embodiments will be apparent to those skilled in the art in view of the teachings herein. It will thus be appreciated that those skilled in the art will be able to devise numerous systems, arrangements, and methods which, although not explicitly shown or described herein, embody the principles of the disclosure and are thus within the spirit and scope of the disclosure. In addition, all publications and references referred to above are incorporated herein by reference in their entireties. It should be understood that the exemplary procedures described herein can be stored on any computer accessible medium, including a hard drive, RAM, ROM, removable disks, CD-ROM, memory sticks, etc., and executed by a processing arrangement which can be a microprocessor, mini, macro, mainframe, etc. In addition, to the extent that the prior art knowledge has not been explicitly incorporated by reference herein above, it is explicitly incorporated herein in its entirety. All publications referenced above are incorporated herein by reference in their entireties.