Detecting neoplasm

Abstract

This document relates to methods and materials for detecting premalignant and malignant neoplasms. For example, methods and materials for determining whether or not a stool sample from a mammal contains nucleic acid markers or polypeptide markers of a neoplasm are provided.

Claims

1. A system, comprising: (a) reagents for extracting genomic DNA from a biological sample of a human individual suspected of having precancer or colorectal cancer; (b) primers specific for K-ras, wherein the primers specific for K-ras are selected from SEQ ID NOs: 2, 3, 4, 5, 6, 7 and 8; (c) reagents sufficient for measuring the methylation level of one or more CpG sites in BMP3, wherein the reagents sufficient for measuring the methylation level of one or more CpG sites in BMP3 comprise reagents sufficient for conducting one or more of methylation-specific PCR, quantitative methylation-specific PCR, methylation-sensitive DNA restriction enzyme analysis, quantitative bisulfite pyrosequencing, and bisulfite genomic sequencing PCR, wherein the reagents sufficient for conducting one or more of methylation-specific PCR, quantitative methylation-specific PCR, methylation-sensitive DNA restriction enzyme analysis, quantitative bisulfite pyrosequencing, and bisulfite genomic sequencing PCR comprise BMP3 specific primers and a bisulfite reagent; and (d) a stool stability buffer.

2. The system of claim 1, wherein the biological sample is a stool sample.

3. The system of claim 1, further comprising reagents sufficient for measuring a K-ras mutation score.

4. The system of claim 3, wherein the reagents sufficient for measuring a K-ras mutation score comprises reagents sufficient for conducting a digital melt curve analysis, and/or a quantitative allele-specific PCR.

5. The system of claim 1, wherein the bisulfite reagent is sodium bisulfite.

6. The system of claim 1, further comprising reagents sufficient for converting unmethylated cytosine residues to uracil residues within said biological sample while leaving 5-methylcytosine residues unchanged.

7. The system of claim 1, wherein the stool stability buffer is a DNA stabilization buffer.

8. The system of claim 1, wherein the primers specific for K-ras are specific for a K-ras mutation at amino acid number 12.

Description

DESCRIPTION OF THE DRAWINGS

(1) FIG. 1: Adjusted Cut-off Levels with Quantitative Stool Markers to Achieve 95% Specificity across Age and Gender using the Q-LEAD Model. Solid line for women, dotted line for men.

(2) FIG. 2: Sensitive and specific detection of pancreatic cancer by fecal ratio of carboxypeptidase B: carboxypeptidase A2. Note that ratio in stools from patients with colorectal cancer is no different from ratios with healthy controls.

(3) FIG. 3: Elastase levels quantified in stools from patients with pancreatic cancer, patients with colorectal cancer, and healthy controls.

(4) FIG. 4: Ratio of elastase 3A: pancreatic alpha amylase differentiates patients with pancreatic cancer from patients with colorectal cancer and from healthy controls.

(5) FIG. 5: Digital Melt Curve to detect mutations by targeted gene scanning (temperature (x-axis) v, temperature-normalized fluorescence (y-axis)). Eight pairs of primers, which amplify 100-350 bp gene fragments, were used to scan K-ras and APC genes and detect mutations (substitution and deletion mutations, respectively) at 1% mutant/wild type ratio.

(6) FIG. 6: Quantitative detection of low abundance mutations by Digital Melt Curve Assay. Varying Mutant: Wild-type ratios of K-ras and APC gene mixtures were prepared and assayed blindly by Digital Melt Curve.

(7) FIG. 7: High analytical sensitivity by Digital Melt Curve (temperature (x-axis) v, temperature-normalized fluorescence (y-axis)) To test the detection limit of digital melt curve (DMC) assay, mutant copies were spiked in wild-type copies at 0.1, 0.5, 1, 5, and 10% dilutions. DMC could detect up to 0.1% mutant/wild-type level when 1000 copies were dispersed to one 96-well plate. The numbers of positive wells increased proportionally when spiked mutant copies were increased. A pair of primers that amplify 248 bp K-ras gene fragment were used as an example here. Primers that amplify 119 bp K-ras gene, 162 bp APC gene, and 346 bp APC genes were also used to test the detection limit and quantitative property of DMC.

(8) FIG. 8: Superior screen detection of colorectal precancerous polyps by Digital Melt Curve (DMC). Histogram compares sensitivity by DMC with that by common fecal occult blood tests (Hemoccult and HemoccultSENSA) and by the commercial stool DNA test (PreGenPlus, Exact Sciences). Detection by DMC was significantly better than by any other test (p<0.05).

(9) FIG. 9: Distributions of short fragment human DNA (short DNA) and long fragment human DNA (long DNA) in stools from patients with normal colonoscopy, large precancerous adenomas, and colorectal cancer Human DNA quantified by an assay of Alu repeats. Short DNA represents 45 bp fragment amplification products, and long DNA represents 245 bp amplification products.

(10) FIG. 10: Stool distributions of short and long DNA in patients with pancreatic cancer and in healthy controls. Short DNA represents 45 bp fragment amplification products, and long DNA represents 245 bp amplification products.

(11) FIG. 11: Methylated BMP3 gene in stool for detection of colorectal neoplasia. Methylated BMP3 was blindly quantified in stools from patients with colorectal cancers, precancerous adenomas, and normal individuals with real-time methylation-specific PCR. Each circle represents a stool sample.

(12) FIG. 12: Frequency of Specific Base Changes in Colorectal Tumors.

DETAILED DESCRIPTION

(13) This document provides methods and materials related to detecting a neoplasm in a mammal (e.g., a human). For example, this document provides methods and materials for using nucleic acid markers, polypeptide markers, and combinations of markers present in a biological sample (e.g., a stool sample) to detect a neoplasm m a mammal. Such a neoplasm can be a cancer or precancer in the head and neck, lungs and airways, esophagus, stomach, pancreas, bile ducts, small bowel, or colorectum. It will be appreciated that the methods and materials provided herein can be used to detect neoplasm markers in a mammal having a combination of different neoplasms. For example, the methods and materials provided herein can be used to detect nucleic acid and polypeptide markers in a human having lung and stomach neoplasms.

(14) In some cases, the methods and materials provided herein can be used to quantify multiple markers in biological samples (e.g., stool) to yield high sensitivity for detection of lesions (e.g., neoplasms), while preserving high specificity. Such methods can include, for example, a logistic model that adjusts specificity cut-offs based on age, gender, or other variables in a target population to be tested or screened.

(15) In some cases, the methods and materials provided herein can be used to determine whether a mammal (e.g., a human) has colorectal cancer or pancreatic cancer. For example, serotransferin, methylated BMP3, and mutant BRAF markers in stool can be used to identify a mammal as likely having colorectal cancer, while mutant p16, carboxypeptidase B/A, and elastase 2A markers can be used to identify a mammal as likely having pancreatic cancer.

(16) Any suitable method can be used to detect a nucleic acid marker in a mammalian stool sample. For example, such methods can involve isolating DNA from a stool sample, separating out one or more particular DNAs from the total DNA, subjecting the DNAs to bisulfite treatment, and determining whether the separated DNAs are abnormally methylated (e.g., hypermethylated or hypomethylated). In some cases, such methods can involve isolating DNA from a stool sample and determining the presence or absence of DNA having a particular size (e.g., short DNA). It is noted that a single stool sample can be analyzed for one nucleic acid marker or for multiple nucleic acid markers. For example, a stool sample can be analyzed using assays that detect a panel of different nucleic acid markers. In addition, multiple stool samples can be collected from a single mammal and analyzed as described herein.

(17) Nucleic acid can be isolated from a stool sample using, for example, a kit such as the QIAamp DNA Stool Mini Kit (Qiagen Inc., Valencia, Calif.). In addition, nucleic acid can be isolated from a stool sample using the following procedure: (1) homogenizing samples in an excess volume (>1:7 w:v) of a stool stability buffer (0.5M Tris pH 9.0, 150 mM EDTA, 10 mM NaCl) by shaking or mechanical mixing; (2) centrifuging a 10 gram stool equivalent of each sample to remove all particulate matter; (3) adding 1 μL of 100 μg/μL RNase A to the supernatant and incubating at 37° C. for 1 hour; (4) precipitating total nucleic acid with 1/10 volume 3M NaAc and an equal volume isopropanol; and (5) centrifuging and then resuspending the DNA pellet in TE (0.01 M Tris pH 7.4, 0.001 M EDTA). U.S. Pat. Nos. 5,670,325; 5,741,650, 5,928,870; 5,952,178, and 6,020,137 also describe various methods that can be used to prepare and analyze stool samples.

(18) One or more specific nucleic acid fragments can be purified from a nucleic acid preparation using, for example, a modified sequence-specific hybrid capture technique (see, e.g., Ahlquist et al. (2000) Gastroenterology, 119:1219-1227). Such a protocol can involve: (1) adding 300 μL of sample preparation to an equal volume of a 6 M guanidine isothiocyanate solution containing 20 μmol biotinylated oligonucleotides (obtained from, for example, Midland Certified Reagent Co., Midland, Tex.) with sequences specific for the DNA fragments to be analyzed; (2) incubating for two hours at 25° C.; (3) adding streptavidin coated magnetic beads to the solution and incubating for an additional hour at room temperature; (4) washing the bead/hybrid capture complexes four times with IX B+W buffer (I M NaCl, 0.01 M Tris-HCl pH 7.2, 0.001 M EDTA, 0.1% Tween 20); and (5) eluting the sequence specific captured DNA into 35 μL L-TE (1 mM Tris pH 7.4, 0.1 M EDTA) by heat denaturation of the bead/hybrid capture complexes. Any other suitable technique also can be used to isolate specific nucleic acid fragments.

(19) Nucleic acid can be subjected to bisulfite treatment to convert unmethylated cytosine residues to uracil residues, while leaving any 5-methylcytosine residues unchanged. A bisulfite reaction can be performed using, for example, standard techniques: (1) denaturing approximately 1 μg of genomic DNA (the amount of DNA can be less when using micro-dissected DNA specimens) for 15 minutes at 45° C. with 2 N NaOH; (2) incubating with 0.1 M hydroquinone and 3.6 M sodium bisulfite (pH 5.0) at 55° C. for 4-12 hours, (3) purifying the DNA from the reaction mixture using standard (e.g., commercially-available) DNA miniprep columns or other standard techniques for DNA purification; (4) resuspending the purified DNA sample in 55 μL water and adding 5 μl 3 N NaOH for a desulfonation reaction that typically is performed at 40° C. for 5-10 minutes; (5) precipitating the DNA sample with ethanol, washing the DNA, and resuspending the DNA in an appropriate volume of water. Bisulfite conversion of cytosine residues to uracil also can be achieved using other methods (e.g., the CpGenome™ DNA Modification Kit from Serologicals Corp., Norcross, Ga.).

(20) Any appropriate method can be used to determine whether a particular DNA is hypermethylated or hypomethylated. Standard PCR techniques, for example, can be used to determine which residues are methylated, since unmethylated cytosines converted to uracil are replaced by thymidine residues during PCR PCR reactions can contain, for example, 10 μL of captured DNA that either has or has not been treated with sodium bisulfite, IX PCR buffer, 0.2 mM dNTPs, 0.5 μM sequence specific primers (e.g., primers flanking a CpG island within the captured DNA), and 5 units DNA polymerase (e.g., Amplitaq DNA polymerase from PE Applied Biosystems, Norwalk, Conn.) in a total volume of 50 μl. A typical PCR protocol can include, for example, an initial denaturation step at 94° C. for 5 min, 40 amplification cycles consisting of 1 minute at 94° C. 1 minute at 60° C., and 1 minute at 72° C. and a final extension step at 72° C. for 5 minutes.

(21) To analyze which residues within a captured DNA are methylated, the sequences of PCR products corresponding to samples treated with and without sodium bisulfite can be compared. The sequence from the untreated DNA will reveal the positions of all cytosine residues within the PCR product. Cytosines that were methylated will be converted to thymidine residues in the sequence of the bisulfite-treated DNA, while residues that were not methylated will be unaffected by bisulfite treatment.

(22) Purified nucleic acid fragments from a stool sample or samples can be analyzed to determine the presence or absence of one or more somatic mutations. Mutations can be single base changes, short insertion/deletions, or combinations thereof. Methods of analysis can include conventional Sanger based sequencing, pyrosequencing, next generation sequencing, single molecule sequencing, and sequencing by synthesis. In some cases, mutational status can be determined by digital PCR followed by high resolution melting curve analysis. In other cases, allele specific primers or probes in conjunction with amplification methods can be used to detect specific mutations in stool DNA. The mutational signature can comprise not only the event of a base or sequence change in a specific gene, but also the location of the change within the gene, whether it is coding, non-coding, synonymous or non-synonymous, a transversion or transition, and the dinucleotide sequence upstream and downstream from the alteration.

(23) In some cases, a sample can be assessed for the presence or absence of a polypeptide marker. For example, any appropriate method can be used to assess a stool sample for a polypeptide marker indicative of a neoplasm. For example, a stool sample can be used in assays designed to detect one or more polypeptide markers. Appropriate methods such as those described elsewhere (Aebersold and Mann, Nature, 422-198-207 (2003) and McDonald and Yates, Dis. Markers, 18:99-105 (2002)) can be adapted or designed to detect polypeptides in a stool. For example, single-reaction monitoring using a TSQ mass spectrometer can specifically target polypeptides in a stool sample. High resolution instruments like the LTQ-FT or LTQ orbitrap can be used to detect polypeptides present in a stool sample.

(24) The term “increased level” as used herein with respect to the level of an elastase 3A polypeptide is any level that is above a median elastase 3A polypeptide level in a stool sample from a random population of mammals (e.g., a random population of 10, 20, 30, 40, 50, 100, or 500 mammals) that do not have an aero-digestive cancer. Elevated polypeptide levels of an elastase 3A polypeptide can be any level provided that the level is greater than a corresponding reference level. For example, an elevated level of an elastase 3A polypeptide can be 0.5, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, or more fold greater than the reference level of elastase 3A polypeptide in a normal sample. It is noted that a reference level can be any amount. For example, a reference level for an elastase 3A polypeptide can be zero. In some cases, an increased level of an elastase 3A polypeptide can be any detectable level of an elastase 3A polypeptide in a stool sample.

(25) The term “increased level” as used herein with respect to the level of an carboxypeptidase B polypeptide level is any level that is above a median carboxypeptidase B polypeptide level in a stool sample from a random population of mammals (e.g., a random population of 10, 20, 30, 40, 50, 100, or 500 mammals) that do not have an aero-digestive cancer. Elevated polypeptide levels of carboxypeptidase B polypeptide can be any level provided that the level is greater than a corresponding reference level. For example, an elevated level of carboxypeptidase B polypeptide can be 0.5, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, or more fold greater than the reference level carboxypeptidase B polypeptide observed in a normal stool sample. It is noted that a reference level can be any amount. For example, a reference level for a carboxypeptidase B polypeptide can be zero. In some cases, an increased level of a carboxypeptidase B polypeptide can be any detectable level of a carboxypeptidase B polypeptide m a stool sample.

(26) The term “increased level” as used herein with respect to the level of DNA fragments less than about 200 or less than about 70 base pairs in length is any level that is above a median level of DNA fragments less than about 200 or less than about 70 base pairs in length in a stool sample from a random population of mammals (e.g., a random population of 10, 20, 30, 40, 50, 100, or 500 mammals) that do not have an aero-digestive cancer. In some cases, an increased level of DNA fragments less than about 200 or less than about 70 base pairs in length can be any detectable level of DNA fragments less than about 200 or less than about 70 base pairs in length in a stool sample.

(27) The term “elevated methylation” as used herein with respect to the methylation status of a BMP3 or ALX nucleic acid is any methylation level that is above a median methylation level in a stool sample from a random population of mammals (e.g., a random population of 10, 20, 30, 40, 50, 100, or 500 mammals) that do not have an aero-digestive cancer. Elevated levels of BMP3 or ALX methylation can be any level provided that the level is greater than a corresponding reference level. For example, an elevated level of BMP3 or ALX methylation can be 0.5, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, or more fold greater than the reference level methylation observed in a normal stool sample. It is noted that a reference level can be any amount.

(28) The term “elevated mutation score” as used herein with respect to detected mutations in a matrix panel of particular nucleic acid markers is any mutation score that is above a median mutation score in a stool sample from a random population of mammals (e.g., a random population of 10, 20, 30, 40, 50, 100, or 500 mammals) that do not have an aero-digestive cancer. An elevated mutation score in a matrix panel of particular nucleic acid markers can be any score provided that the score is greater than a corresponding reference score. For example, an elevated score of K-ras or APC mutations can be 0.5, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, or more fold greater than the reference score of K-ras or APC mutations observed in a normal stool sample. It is noted that a reference score can be any amount.

(29) In some cases, a ratio of particular polypeptide markers can be determined and used to identify a mammal having an aero-digestive cancer (e.g., a colorectal cancer or a pancreatic cancer) For example, a ratio provided herein (e.g., the ratio of carboxypeptidase B polypeptide levels to carboxypeptidase A2 polypeptide levels) to can be used as described herein to identify a mammal having a particular neoplasm (e.g., pancreatic cancer).

(30) In some cases, a matrix marker panel can be used to identify mammals having an aero-digestive cancer (e.g., a colorectal cancer or a pancreatic cancer). In some cases, such panel also can identify the location of the aero-digestive cancer. Such a panel can include nucleic acid markers, polypeptide markers, and combinations thereof and can provide information about a mutated marker gene, the mutated region of the marker gene, and/or type of mutation. For example, data can be analyzed using a statistical model to predict tumor site (e.g., anatomical location or tissue of origin) based on inputs from sequencing data (such as by specific nucleic acid or combination of nucleic acids mutated, specific mutational location on a nucleic acid, and nature of mutation (e.g. insertion, deletion, transition, or transversion) or by any combination thereof) and/or data from polypeptide or other types of markers. For example, a Site of Tumor Estimate (SITE) model can be used to predict tumor site using a matrix panel of markers that are present to variable extent across tumors.

(31) In some cases, data can be analyzed using quantified markers to create a logistic model, which can have both high sensitivity and high specificity. For example, a logistic model can also incorporate population variables like gender and age to adjust cut-off levels for test positively and thereby optimize assay performance in a screening setting. In some cases, a Quantitative Logistic to Enhance Accurate Detection (Q-LEAD) Model can be used with any marker class or combination of markers as long as they can be quantified.

(32) This document also provides methods and materials to assist medical or research professionals in determining whether or not a mammal has an aero-digestive cancer. Medical professionals can be, for example, doctors, nurses, medical laboratory technologists, and pharmacists. Research professionals can be, for example, principle investigators, research technicians, postdoctoral trainees, and graduate students. A professional can be assisted by (1) determining the ratio of particular polypeptide markers in a stool sample, and (2) communicating information about the ratio to that professional, for example. In some cases, a professional can be assisted by (1) determining the level of human DNA, the methylation status of genes such as BMP3, and the mutation score of genes such as APC and K-ras, and (2) communicating information about the level of DNA, the methylation status of particular genes, and the mutation score of particular genes to the professional. In some cases, a professional can be assisted by (I) detecting mutations in cancer-related genes such as K-ras, p53, APC, p16, EGFR, CTNNB1, BRAF, and SMAD4, as a matrix marker panel, and (2) communicating information regarding the mutations to the professional.

(33) After the ratio of particular polypeptide markers, or presence of particular nucleic acid markers in a stool sample is reported, a medical professional can take one or more actions that can affect patient care. For example, a medical professional can record the results in a patient's medical record. In some cases, a medical professional can record a diagnosis of an aero-digestive cancer, or otherwise transform the patient's medical record, to reflect the patient's medical condition. In some cases, a medical professional can review and evaluate a patient's entire medical record, and assess multiple treatment strategies, for clinical intervention of a patient's condition. In some cases, a medical professional can record a tumor site prediction with the reported mutations. In some cases, a medical professional can request a determination of the ratio of particular polypeptide markers to predict tumor site. In some cases, a medical professional can review and evaluate a patient's entire medical record and assess multiple treatment strategies, for clinical intervention of a patient's condition.

(34) A medical professional can initiate or modify treatment of an aero-digestive cancer after receiving information regarding a ratio of particular polypeptide markers or the presence of nucleic acid markers in a patient's stool sample. In some cases, a medical professional can compare previous reports and the recently communicated ratio of particular polypeptide markers, or presence of nucleic acid markers, and recommend a change in therapy. In some cases, a medical professional can enroll a patient in a clinical trial for novel therapeutic intervention of an aero-digestive cancer. In some cases, a medical professional can elect waiting to begin therapy until the patient's symptoms require clinical intervention.

(35) A medical professional can communicate the ratio of particular polypeptide markers to a patient or a patient's family. In some cases, a medical professional can provide a patient and/or a patient's family with information regarding aero-digestive cancers, including treatment options, prognosis, and referrals to specialists, e.g., oncologists and/or radiologists. In some cases, a medical professional can provide a copy of a patient's medical records to communicate the ratio of particular polypeptide markers to a specialist.

(36) A research professional can apply information regarding a subject's ratio of particular polypeptide markers to advance aero-digestive cancer research. For example, a researcher can compile data on the ratio of particular polypeptide markers, and/or presence of particular nucleic acid markers, with information regarding the efficacy of a drug for treatment of aero-digestive cancer to identify an effective treatment. In some cases, a research professional can obtain a subject's ratio of particular polypeptide markers, and/or determine the presence of particular nucleic acid markers to evaluate a subject's enrollment, or continued participation in a research study or clinical trial. In some cases, a research professional can classify the severity of a subject's condition, based on the ratio of particular polypeptide markers and/or the levels of particular nucleic acid markers. In some cases, a research professional can communicate a subject's ratio of particular polypeptide markers, and/or the presence of particular nucleic acid markers to a medical professional. In some cases, a research professional can refer a subject to a medical professional for clinical assessment of an aero-digestive cancer, and treatment of an aero-digestive cancer.

(37) Any appropriate method can be used to communicate information to another person (e.g., a professional). For example, information can be given directly or indirectly to a professional. For example, a laboratory technician can input the ratio of particular polypeptide markers and/or particular nucleic acid markers into a computer-based record. In some cases, information is communicated by making a physical alteration to medical or research records. For example, a medical professional can make a permanent notation or flag a medical record for communicating a diagnosis to other medical professionals reviewing the record. In addition, any type of communication can be used to communicate the information. For example, mail, e-mail, telephone, and face-to-face interactions can be used. The information also can be communicated to a professional by making that information electronically available to the professional. For example, the information can be communicated to a professional by placing the information on a computer database such that the professional can access the information. In addition, the information can be communicated to a hospital, clinic, or research facility serving as an agent for the professional.

(38) The invention will be further described in the following examples, which do not limit the scope of the invention described in the claims.

EXAMPLES

Example 1—Multi-Marker Quantitation and a Q-LEAD Model

(39) Most approaches at marker detection in stool have been qualitative. When such qualitative approaches are applied to assay of multiple markers (targeting multiple markers is required with neoplasm detection due molecular heterogeneity), sensitivity is achieved at the expense of compounded non-specificity. Non-specificity can lead to prohibitive programmatic cost with population screening due to the expensive and unnecessary evaluations of false-positive tests. However, if markers are quantified, then a logistic model can be created to achieve both high sensitivity and high specificity. Such a logistic model can also incorporate population variables like gender and age to adjust cut-off levels for test positively and thereby optimize assay performance in a screening setting (FIG. 1). This Quantitative Logistic to Enhance Accurate Detection (Q-LEAD) Model can be used with any marker class or combination of markers as long as they can be quantified.

(40) A combination of more than one marker was undertaken to achieve the desired sensitivity and specificity for cancer detection. Binary regression methods predicting disease as a function of diagnostic tests estimate the optimal combination of the tests for classifying a subject as diseased or not. McIntosh and Pepe, Biometrics 58: 657-664 (2002). A logistic regression model can assess the relationship between a binary dependent response variable such as presence or absence of disease and one or more independent predictor variables. The independent predictors may be qualitative (e.g., binary) or quantitative (e.g., a continuous endpoint). In the Q-LEAD model, the independent predictors can include such biological markers as K-ras, BMP3 and DNA concentration, and others. Importantly, the model incorporates the demographic variables of gender and age, as we have observed that both age and gender influence molecular marker levels in stool. As average stool marker levels increase with age and male gender, failure to adjust for these variables would yield suboptimal specificity in men and elderly persons tested. Coefficients are estimated from the sample data for each term in the model. The result of the model is a risk score for each subject. Cutoffs for predicting disease state from this risk score can be determined in order to maximize sensitivity and specificity of the marker combinations for predicting disease as desired. The inclusion of demographic variables allows these cutoffs to be determined as a function of age and gender.

(41) As an application of the Q-LEAD Model, the following was performed to evaluate a quantitative stool DNA assay approach targeting three informative markers for the detection of colorectal neoplasia. Subjects included 34 with colorectal cancer, 20 with adenomas >1 cm, and 26 with normal colonoscopy. Subjects added a DNA stabilization buffer with stool collection, and stools were frozen at −80° C. within 72 hours From thawed stool aliquots, crude DNA was extracted by standard methods, and target genes were enriched by sequence capture. K-ras mutation score, methylation of BMP3 gene, and concentration of human DNA (245 bp length) were respectively quantified by a digital melt curve assay, real-time methylation-specific PCR, and real-time Alu PCR, respectively. Assays were performed blinded. A logistic model, which incorporates three markers and gender, was constructed to analyze discrimination by combined markers.

(42) Age medians were 60 for patients with colorectal cancer, 66 for those with adenomas, and 61 for normal controls; and male/female distributions were 23/11, 9/11, and 10/16, respectively. Detection rates of colorectal neoplasms were determined by individual quantitative markers at specificity cut-offs of 96 percent and by combined markers (Table 1). Discrimination by combined markers was calculated using a qualitative binomial method (each marker considered as positive or negative based on individual 96 percent specificity) and by the Q-LEAD model (sensitivity data shown at overall specificity of 96 percent).

(43) TABLE-US-00001 TABLE 1 Specificity and sensitivity of cancer markers. Sensitivity Specificity Cancers Adenomas Both Individual Markers k-ras mutation 96% 42% 32% 38% BMP3 methylation 96% 45% 32% 40% DNA concentration 96% 65% 40% 56% Combined Markers Qualitative method 88% 90% 58% 78% Q-LEAD Model 96% 90% 47% 76%

(44) By quantitative assay and multivariable analysis of an informative marker panel, stool DNA testing can achieve high sensitivity while preserving high specificity for detection of colorectal neoplasia. The particular three-marker combination of mutant K-ras, BMP3 methylation, and human DNA concentration represents a complementary, high-yield panel.

(45) The above data set and additional data were analyzed as follows. A quantitative stool DNA assay approach targeting four informative markers for use in the detection of colorectal cancer and advanced adenoma was evaluated. Subjects comprised 74 patients with colorectal cancer, 27 with an adenoma >1 cm, and 100 with normal colonoscopy. Stools were collected with a stabilization buffer before or >1 week after colonoscopy and were frozen at −80° C. within 24 hours of collection. From thawed stool aliquots, crude DNA was extracted as described above, and target genes were enriched by sequence-specific capture. Human DNA concentration. K-ras and APC mutation scores, and BMP3 methylation were sensitively quantified by real-time Alu PCR, a digital melt curve assay (Zou et al., Gastroenterology, “High Detection Rates of Colorectal Neoplasia by Stool DNA Testing With a Novel Digital Melt Curve Assay,” (2008)), and real-time methylation-specific PCR, respectively. Assays were performed blindly. Sensitivities and specificities of single makers and their combinations were analyzed.

(46) Age medians were 61 for patients with colorectal cancer, 67 for those with adenomas, and 59 for normal controls; and, male/female ratios were 52/22, 15/12, and 37/63, respectively. The table displays detection rates of colorectal neoplasms by individual quantitative markers at specificities of 90% and by combined markers at two specificities (Table 2) Data in this table represent a training set and have not been adjusted for age and gender. Yet, it is clear that the full panel of Alu, K-ras, APC, and BMP3 detected more neoplasms than any individual marker, p<0.05. At 90% specificity, the full panel detects more adenomas >3 cm (90%, 9/10) than <3 cm (47%, 8/17), p<0.05, and more colorectal cancers at stages III-IV (89%, 40/45) than at stages 1-II (69%, 20/29) p<0.05. Neoplasm detection rates were not affected by tumor location.

(47) TABLE-US-00002 TABLE 2 Specificity and sensitivity of a four marker panel Sensitivity Colorectal cancers Adenomas Specificity I-II III-IV ≤3 cm >3 cm Individual Markers APC mutation 90% 38% 40% 47% 50% k-ras mutation 90% 46% 42% 24% 50% BMP3 methylation 90% 36% 38% 12% 30% DNA concentration 90% 52% 76% 41% 50% Combined Markers 90% 69% 89% 47% 90%

(48) In conclusion, a quantitative stool DNA assay system that incorporates a stabilization buffer with specimen collection, high analytical sensitivity, and a panel of broadly informative markers can achieve high detection rates of both colorectal cancers and advanced adenoma.

Example 2—SITE Model and Matrix Marker Panel

(49) A statistical model (Site of Tumor Estimate (SITE)) can be used to predict tumor site (e.g., anatomical location or tissue of origin) based on inputs from sequencing data (such as by specific nucleic acid or combination of nucleic acids mutated, specific mutational location on a nucleic acid, and nature of mutation (e.g. insertion, deletion, transition, or transversion) or by any combination thereof) and/or data from polypeptide or other types of markers.

(50) A matrix marker panel was developed to include eight cancer-related genes: K-ras, p53, APC, p16, EGFR, CTNNB1, BRAF, and SMAD4. The mutation frequencies of these genes were tabulated against the six major aero-digestive cancers based on literature or public database reviews and on actual sequencing observations (Table 3). Literature frequencies were derived from the COSMIC somatic mutation database, review articles, and texts.

(51) TABLE-US-00003 TABLE 3 Matrix Panel of Markers by Tumor Site AD Cancer Site Literature p16 p53 K-ras APC SMAD4 EGFR CTNNB1 BRAF Colorectal <5% 50-75% 40% 85% 14% NA 13% 20% Pancreatic 85-100 50-60% 80-90% 10-40% 30% NA 3-8% <5% Lung 15-25% 25-75% 20-40% 5% 7% 30% 6% <5% Bile Duct 15-60% 30-60% 40% 30-40% 17% NA 1% 14% Gastric 5-30% 20-50% 10% 20-60% NA <1% 30-50% <5% Esophageal 5-90% 40-90% 5-12% 5-60% NA NA 1% <5% Actual (non-dbSNP) N Total Unique Colorectal 57 5% 47% 26% 75% 25% 12% 2% 30% 98% Pancreatic 24 29% 17% 62% 54% 8% 8% 4% 8% 83% Lung 56 9% 57% 9% 16% 14% 14% 2% 4% 77% Bile Duct 15 13% 27% 13% 20% 13% 0 0 7% 67% Gastric 23 17% 22% 4% 35% 17% 4% 4% 0 65% Esophageal 24 4% 46% 4% 33% 17% 4% 0 4% 79%

(52) Some of the frequencies include other genetic alterations than simply single base changes and small insertions/deletions such as methylation events, large homozygous deletions, and copy number changes. Such alterations would not be reflected in the actual frequency table. Actual frequencies were derived by sequencing coding and flanking gene regions from 245 patient tissue samples reflecting the spectrum of aero-digestive cancers. Only non-synonymous and splice site alterations were tabulated. When specific mutational hot-spot sites were able to be identified for particular genes, only those sites were analyzed.

(53) The matrix panel includes markers that are present to variable extent across these tumors so that their aggregate use achieves high overall sensitivity and allows prediction of tumor site using the SITE Model. 70% of tumors harbored one or more mutations from the eight gene panel. Some gene mutations, like those associated with p16, are common in tumors above the colon but are rare for those in the colon. Mutant K-ras is frequent with colorectal and pancreatic cancers but infrequent in the other cancers. Mutations in EGFR clustered with lung and colorectal tumors and mutations in SMA4 clustered with stomach and colorectal tumors. Genes such as p53, are commonly mutated across many different types of cancers, but specific mutational locations or types of mutations within p53 and other genes differ between tumor site (e.g., Greenman et al, Nature, 446(7132)-153-8 (2007); Soussi and Lozano, Biochem. Biophys. Res Commun., 331(3):834-42 (2005); Stephens et al., Nat. Genet., 37(6):590-2 (2005); Sjoblom et al., Science, 314(5797):268-74 (2006); Wood et al., Science, 318(5853):1108-13 (2007); and Davies, Cancer Res, 65(17):7591-5(2005)) and can be factored in to the SITE Model to predict tumor site. Single base substitutions were the most common type of mutation throughout the panel and those that predicted colorectal tumors included C-G and A-T transversions (FIG. 12). Other tumor sites had similarly unique base change profiles (Table 4). Insertion/deletions mutations were most common with colorectal tumors, particularly adenomas.

(54) TABLE-US-00004 TABLE 4 Specific Base Change Fractions in AD Tumors C > T T > C G > C A > T G > T T > G Tumor (G > A) (A > G) (C > G) (T > A) (C > A) (A > C) Head and Neck 0.38 0.12 0.38 0.12 Esophageal 0.8 0.07 0.13 Lung 0.3 0.11 0.02 0.13 0.34 0.09 Stomach 0.5 0.25 0.17 0.08 Pancreas 0.41 0.15 0.04 0.07 0.33 Bile Duct 0.5 0.12 0.25 0.12 CRA 0.34 0.05 0.11 0.11 0.34 0.03 CRC 0.4 0.05 0.02 0.24 0.26 0.03

(55) Polypeptide markers found in stool, such as by proteomic approaches, can also be used to detect aero-digestive neoplasms and predict tumor site. The following was performed to identify and explore candidate polypeptide markers in stool for the discriminate detection of pancreatic cancer. Subjects included 16 cases with pancreatic cancer, 10 disease controls (colorectal cancer), and 24 healthy controls. Whole stools were collected and frozen promptly in aliquots at −80° C. Thawed aliquots were centrifuged, and the aqueous supernatant from each was analyzed. Polypeptides were separated by 1-D electrophoresis, excised from gels, and digested for mass spectrometric analysis using an LTQ-Orbitrap. Data outputs were searched using Mascot, Sequest, and X! Tandem programs against an updated

(56) Swissprot database that included all cataloged species. Unique peptide counts and ratio calculations were performed using Scaffold software.

(57) Median age for pancreatic cancer cases was 67, for colorectal cancer controls 63, and for healthy controls 62; and male/female distributions were 9/7, 6/4, and 9/15, respectively. Using shotgun-proteomic techniques on stools, two pancreatic enzymes (carboxypeptidases B and A2) were conspicuous, as unique spectral counts of the former were commonly elevated with pancreatic cancer and of the latter commonly decreased. Considered together as the ratio of carboxypeptidase B/carboxypeptidase A2, pancreatic cancer cases were almost completely separated from colorectal cancer and healthy control groups. Median ratios were 0.9, 0.2, and 0.3, respectively. At a specificity cut-off for the carboxypeptidase B/A2 ratio at 100% (i.e., ratios from normal control and colorectal cancer stools all below cut-off), sensitivity for pancreatic cancer was 86 percent (FIG. 2). Only two pancreatic cancers were misclassified.

(58) These results demonstrate that a stool assay of polypeptide markers can be a feasible non-invasive approach to the detection pancreatic cancer. These results also demonstrate that multivariable analysis of specific polypeptide ratios can be used.

(59) In addition, polypeptide markers unique to colorectal neoplasms were identified (Table 5). For example, serotransferrin was found in stools from patients with colorectal cancer but not in those with pancreatic cancer. These markers when considered as part of a matrix panel contribute both to overall sensitivity for tumor detection and help discriminate colorectal from pancreatic cancer.

(60) TABLE-US-00005 TABLE 5 Positive Stool Findings. Carboxypeptidase B/A2* Serotransferrin Colorectal Cancer 0 60% Pancreatic Cancer 86% 0 Normal controls 0 0 *ratio >0.75 considered positive

(61) Another polypeptide in stool that is pancreatic cancer specific is elastase 3A. Methods and Results demonstrating this are as follows:

(62) Stool Preparation

(63) Samples were collected in phosphate buffered saline and either dropped off in clinic or mailed in collection tub. Samples were homogenized and frozen within 72 hours after receipt. Frozen stools were diluted 1:3 w:v in PBS (Roche, Cat #1666789). Diluted stools were stomached in a filter bag (Brinkman, BA6041/STR 177×305 mm) for 60 seconds on control setting and spun at 10,000 rpms for 30 minutes. Following an additional 10 minute spin at 14,000 rpm, the supernatant was filtered through a 0.45-μm syringe filter and analyzed. Total protein present in stool was quantitated using a Bradford Protein Assay kit (Pierce).

(64) 1-Dimensional Electrophoresis

(65) Stool supernatants were diluted 1:1 in Leammli-BME buffer and run on a 10.5-14% gradient gel. Vertical slices were cut from 250 kDa to 15 kDa and in-gel digested using methods described elsewhere (e.g., Wilm et al., Nature, 379:466-469 (1996)). Bands were destained, dehydrated, digested in trypsin, extracted, and lyophilized for MS analysis.

(66) Mass Spectrometry

(67) Lyophilized samples were reconstituted and injected with a flow of 500 nL/min and a 75 minute gradient from 5-90% 98% acetonitrile. MS was performed in data dependent mode to switch automatically between MS and MS.sup.2 acquisition on the three most abundant ions. Survey scans were acquired with resolution r=60,000 at 40 m/z using FWHM with a target accumulation of 10.sup.6 counts. An isolation width of 2.5 m/z was applied. Exclusion mass width was 0.6 m/z on low end and 1.5 m/x on high end. All acquisition and method development was performed using Xcaliber version 2.0.

(68) Database Searching all ms/ms

(69) Samples were analyzed using Mascot (Matrix Science, London, UK; version 2.1.03), Sequest (ThermoFinnigan, San Jose, Calif.; version 27, rev. 12) and X! Tandem (World Wide Web at “thegpm.org”; version 2006.09.15.3). Mascot and X! Tandem were searched with a fragment ion mass tolerance of 0.80 Da and a parent ion tolerance of 10.0 PPM. Sequest was searched with a fragment ion mass tolerance of 1.00 Da. Nitration of tyrosine was specified in Mascot as a variable modification.

(70) Criteria for Polypeptide Identification

(71) Scaffold (version Scaffold-01_06_06, Proteome Software Inc., Portland, Oreg.) was used to validate MS/MS based polypeptide identifications. Peptide identifications were accepted if they could be established at greater than 95.0% probability as specified by the Peptide Prophet algorithm (Keller et al, Anal. Chem., 74(20):5383-92 (2002)). Polypeptide identifications were accepted if they could be established at greater than 99.0 percent probability and contained at least two identified peptides. Polypeptide probabilities were assigned by the Protein Prophet algorithm (Nesvizhskii, Anal. Chem., 75(17):4646-58 (2003)). Polypeptides that contained similar peptides and could not be differentiated based on MS/MS analysis alone were grouped to satisfy the principles of parsimony.

(72) Specific to Elastase S4 Ratio Determination

(73) Ratios of elastase 3A were determined using spectral counts for each polypeptide Ratios were determined by dividing the number of unique peptides of elastase 3A (determined using a composite ID from database search modules Mascot, XTandem, and Seaquest and compiled in Scaffold) by the number of unique peptides from another polypeptide such as pancreatic alpha-amylase

(74) Results

(75) The concentration of a specific pancreatic enzyme, elastase 3A, was consistently found to be elevated in the fecal supernatant of patients with pancreatic cancer as compared to normal controls or patients with non-pancreatic cancer (FIG. 3). These finding indicate that fecal concentration of elastase 3A is an accurate marker for pancreatic cancer. In addition, the ratio of elastase 3A against other pancreatic enzymes (or other stable fecal polypeptides) was found to be especially discriminant for pancreatic cancer and obviates the need to determine absolute elastase 3A concentrations (FIG. 4). While mass spectrometry was used to make these observations, elastase 3A levels and ratios including elastase 3A can be measured using other methods as well.

Example 3—Digital Melt Curve Assay for Scanning Mutations

(76) A sensitive, rapid, and affordable method for scanning mutations in bodily fluids at high-throughput was developed A melt curve assay is a post-PCR technique that can be used to scan for mutations in PCR amplicons. Mutations in PCR products can be detected by changes in the shape of the melting curve (heterozygote from mutant sample) compared to a reference sample (homozygote from wild-type sample) (FIG. 5). Melt curve assay can scan all mutations in a DNA fragment <400 bp in less than 10 minutes, rather than individually targeting single mutations. Regular melt curve assays can detect mutations down to a limit of 5% mutant:wild-type and, thus, are not sensitive enough to detect mutations in many biological samples. For instance, in stool, an analytical sensitivity of 1% or less is required in order to detect precancerous polyps or small early stage cancers. Importantly, a quantitative score can be given to density of target mutations (FIG. 6).

(77) Digital PCR can augment the sensitivity of PCR to detect low abundance mutations. Gene copies can be diluted and distributed into 96 wells of a plate to increase the percentage of mutant copy to wild-type copies in certain wells. For example, if a stool DNA sample contains only 1% of mutant BRAF copies compared to wild-type copies, distributing 300 copies of BRAF gene into a 96-well plate can lead to three wells with an average mutant ratio of 33 percent (1:3) After PCR amplification, these three wells with mutant copies can be detected by sequencing or other approaches. Since digital PCR requires PCR on a whole 96-well plate and 96 sequencings (or other approaches) for each target, it can be slow and costly.

(78) The concept of digital melt curve assay is to combine the scanning ability and speed of high resolution melt curve assay with the sensitivity of digital PCR. Miniaturizing and automating this technology dramatically lowers per assay cost and achieves high-throughput necessary for population screening.

(79) The following procedure was used to perform a digital melt curve assay. To prepare a DNA sample, gene target fragments (e.g., BRAF, K-ras, APC, p16, etc) were captured from stool DNA using a sequence-specific capture method and were quantified with real-time PCR. About 200 to 2000 gene copies were mixed in tube with all PCR reagents. An average of 2 to 20 copies (variable) were distributed to each well of a 96-well plate. PCR amplification was performed using specific primers on the plate (e.g., one target per plate). Final concentrations of PCR mastermix for Digital Melt Curve assays in a 96-well plate (500 μL dispersed to 96 wells with each well containing 5 μL) were as follows' 2×pfx amplification buffer (Invitrogen), 0.3 mM each dNTP, 200 nM forward primer, 200 nM reverse primer, 1 mM MgSO.sub.4, 0.02 unit/μL Platinum® pfx polymerase (Invitrogen), and 0.1 unit/μL LcGreen+ dye (Idaho Tech). A high resolution melt curve assay was used to identify the wells with mutant copies Sequencing was optionally performed to confirm 1 to 2 representative wells.

(80) In some cases, emulsion PCR can be used in place of digital PCR. In such cases, each lipid drop can become a tiny PCR reactor of one single molecule of gene.

(81) TABLE-US-00006 TABLE 6 Sequence Specific Capture Probes and DNA Primer Capture SEQ Target Probe/ ID Gene Region Primer Oligo Sequence (5′.fwdarw.3′) No. KRAS Condons Probe GTGGACGAATATGATCCAACAATAGA 1 12/13 GGTAAATCTTG Condons Primer AGGCCTGCTGAAAATGACTG 2 12/13 1 TTGTTGGATCATATTCGTCCAC 3 Condons Primer TAAGGCCTGCTGAAAATGAC 4 12/13 2 ATCAAAGAATGGTCCTGCAC 5 Condons Primer CGTCTGCAGTCAACTGGAATTT 6 12/13 3 TGTATCGTCAAGGCACTCTTGC 7 Condons Primer CTTAAGCGTCGATGGAGGAG 8 12/13 4 TTGTTGGATCATATTCGTCCAC 3 BRAF V600E Probe CCAGACAACTGTTCAAACTGATGGGA 9 CCCACTCCATC V600E Primer CCACAAAATGGATCCAGACA 10 TGCTTGCTCTGATAGGAAAATG 11 APC MCR Probe CAGATAGCCCTGGACAAACCATGCCA 12 1 CCAAGCAGAAG MCR Probe TTCCAGCAGTGTCACAGCACCCTAGA 13 2 ACCAAATCCAG MCR Probe ATGACAATGGGAATGAAACAGAATCA 14 3 GAGCAGCCTAAAG Condons Primer TTCATTATCATCTTTGTCATCAGC 15 1286- 1 1346 CGCTCCTGAAGAAAATTCAA 16 Condons Primer TGCAGGGTTCTAGTTTATCTTCA 17 1346- 2 1367 CTGGCAATCGAACGACTCTC 18 Condons Primer CAGGAGACCCCACTCATGTT 19 1394- 3 1480 TGGCAAAATGTAATAAAGTATCAGC 20 Condons Primer CATGCCACCAAGCAGAAGTA 21 1450- 4 1489 CACTCAGGCTGGATGAACAA 22 Condon Primer GAGCCTCGATGAGCCATTTA 23 1554 5 TCAATATCATCATCATCTGAATCATC 24 102457 Primer GTGAACCATGCAGTGGAATG 25 delC 6 ACTTCTCGCTTGGTTTGAGC 26 102457 Primer CAGGAGACCCCACTCATGTT 19 delC 7 CATGGTTTGTCCAGGGCTAT 27 102457 Primer GTGAACCATGCAGTGGAATG 25 delC 8 AGCATCTGGAAGAACCTGGA 28 TP53 Exon 4 Probe AAGACCCAGGTCCAGATGAAGCTCCC 29 AGAATGCCAGA Exon 4 Primer CCCTTCCCAGAAAACCTACC 30 GCCAGGCATTGAAGTCTCAT 31 Exon 5 Probe CATGGCCATCTACAAGCAGTCACAGC 32 ACATGACGGAG Exon 5 Primer CACTTGTGCCCTGACTTTCA 33 AACCAGCCCTGTCGTCTCT 34 Exon 6 Probe AGTGGAAGGAAATTTGCGTGTGGAGT 35 ATTTGGATGAC Exon 6 Primer CAGGCCTCTGATTCCTCACT 36 CTTAACCCCTCCTCCCAGAG 37 Exon 7 Probe ATGTGTAACAGTTCCTGCATGGGCGG 38 CATGAACCGGA Exon 7 Primer CTTGGGCCTGTGTTATCTCC 39 GGGTCAGAGGCAAGCAGA 40 Exon 8 Probe CGCACAGAGGAAGAGAATCTCCGCAA 41 GAAAGGGGAGC Exon 8 Primer GGGAGTAGATGGAGCCTGGT 42 GCTTCTTGTCCTGCTTGCTT 43

Example 4—Sensitive Detection of Mutations Using a Digital Melt Curve Assay

(82) The following was performed to develop a quantitative method for scanning gene mutations and to evaluate the sensitivity of the quantitative method for detecting target mutations in stool. A digital melt curve assay was designed by combining digital PCR to a modified melt curve assay. Target genes in low concentration were PCR amplified with a saturated DNA dye. LcGreen+, in a 96-well plate. Each well contained a small number of gene copies, which allowed high mutation/wild-type ratios in some wells that were then detected by melt curve scanning using a LightScanner. Mutations were scored based on the number of wells containing mutant copies in a 96-well plate. To test sensitivity, mutant genes were spiked into a wild-type pool at 0.1, 0.5, 1, 5, and 10% dilutions, and analyzed using digital melt curve assay with 250-1000 gene copies per 96-well plate. This method was then applied in the stool detection of APC, p53, K-ras, and BRAF mutations from 48 patients known to have mutations in one of these genes in matched tumor tissue. Subjects included 9 patients with pancreatic cancer, 31 with colorectal cancer, and 8 with colorectal adenoma >1 cm. All mutations detected by digital melt curve were further confirmed by Sanger sequencing.

(83) The digital melt curve assay detected as few as 0.1% mutant copies for amplicons <350 bp using one 96-well plate (FIG. 7), compared to the detection limit for regular melt curve of ≥5 percent. Each mutation scanning took 8-10 minutes with this manual approach. Mutations of APC, p53, K-ras, and BRAF genes were all successfully scanned with digital melt curve in quantitative fashion. Tissue-confirmed mutations were detected from matched stools in 88 percent (42/48) of patients with gastrointestinal neoplasms, including 89 percent with pancreatic cancer, 90 percent with colorectal cancer, and 75 percent with colorectal adenoma >1 cm.

(84) These results demonstrate that a digital melt curve assay can be a highly sensitive approach for detecting mutations in stool, and that it has potential for diagnostic application with both upper and lower gastrointestinal neoplasms.

Example 5—Using a Digital Melt Curve Assay to Detect Adenomas

(85) Archived stools were used to evaluate a digital melt curve assay of DNA markers for detection of advanced adenomas and to compare the accuracy of the digital melt curve assay with that of occult blood testing and a commercial DNA marker assay method (EXACT Sciences). Average risk subjects collected stools without a preservative buffer and mailed them to central processing laboratories for banking and blinded stool testing by Hemoccult, HemoccultSENSA, and DNA marker assay. All subjects underwent a colonoscopy, and tissue from advanced adenomas was archived. Archival stools were selected from the 27 patients with a colorectal adenoma >1 cm found to harbor mutant K-ras on tissue analyses and from the first 25 age and gender matched subjects with normal colonoscopy. Standard methods were used to extract crude DNA from fecal aliquots, and K-ras gene was enriched by sequence capture. Mutations in the K-ras gene were quantified by a digital melt curve assay based on the number of wells containing mutant gene copies in a 96-well plate and confirmed by sequencing.

(86) Median age with adenomas was 67 and controls 71; and males/females were 12/15 and 13/14, respectively. Median adenoma size was 1.5 cm (range 1-3 cm) Based on a cut-off of >3 wells with mutant K-ras, the digital melt curve assay yielded an overall sensitivity of 59 percent for adenomas with a specificity of 92 percent; sensitivity for adenomas >2 cm was 80 percent (8/10) and for those <2 cm was 47 percent (8/17), p=0.1. In these same stools, overall adenoma detection rates were 7 percent by Hemoccult, 15 percent by HemoccultSENSA, and 26 percent by the EXACT Sciences K-ras assay (p<0.05 for each vs. digital melt curve) (FIG. 8). Respective specificities were 92 percent, 92 percent, and 100 percent.

(87) These results demonstrate that an analytically-sensitive digital melt curve assay method can be used to detect a majority of advanced colorectal adenomas and improve yield over current stool test approaches.

Example 6—Short DNA as a Cancer Marker

(88) Free human DNA is present in all human stools and arises from cells shed (exfoliated) from the normal surface (mucosa) of the aero-digestive tract (mouth/throat, lungs, and all digestive organs) and from tumors or other lesions that may be present. It has been generally accepted that “long DNA” in stool reflects that presence of colorectal and other aero-digestive tumors, in that cells exfoliated from cancers do not undergo typical cell death (apoptosis) which would shorten DNA. Specifically, because DNA from apoptotic cells would be broken down to fragment lengths shorter than 100 bp, long DNA was defined as being longer than 100 bp. Indeed, levels of long DNA were elevated in stools from patients with colorectal and other cancers as compared to those from healthy controls (Zou et al., Cancer Epidiemiol. Biomarkers Prev., 15′ 115 (2006); Ahlquist et al., Gastroenterology, 119:1219 (2000); and Boynton et al. Clin. Chem., 49:1058 (2003)). As such, long DNA in stool can serve as a marker for colorectal and other tumors.

(89) “Short DNA” (i.e., <100 bp in length), however, was found to be as or more discriminant than long DNA as a tumor marker in stool for detection of both colorectal (FIG. 9) and pancreatic (FIG. 10) neoplasia.

(90) Briefly, methods and materials similar to those described elsewhere were used to detect short DNA present in stool samples (Zou et al., Cancer Epidemiol. Bionmarkers Prev., 15(6): 1115 (2006)). Total DNA was extracted by isopropanol precipitation from 19 blinded stool samples: 9 pancreatic adenocarcinoma, and 10 age/gender matched normals. The DNA pellets were taken up in 8 mL of 10-fold diluted TE, pH 8. The Alu sequence consists of conserved regions and variable regions. In the putative consensus Alu sequence, the conserved regions are the 25-bp span between nucleotide positions 23 and 47 and the 16-bp span between nucleotide positions 245 and 260. Although primers can be designed in any part of the Alu sequences, for more effectively amplifying Alu sequences, the PCR primers are preferably completely or partially (at least the 3′-regions of the primers) located in the conserved regions. Primers specific for the human Alu sequences were used to amplify fragments of differing lengths inside Alu repeats. The sequences were as follows:

(91) TABLE-US-00007 Amplicon size Primer Sequences 245 bp Forward Primer: 5′-ACGCCTGTAATCCCAGCACTT-3′ (SEQ ID NO: 44) Reverse Primer: 5′-TCGCCCAGGCTGGAGTGCA-3′ (SEQ ID NO: 45) 130 bp Forward Primer: 5′-TGGTGAAACCCCGTCTCTAC-3′ (SEQ ID NO: 46) Reverse Primer: 5′-CTCACTGCAACCTCCACCTC-3′ (SEQ ID NO: 47) 45 bp Forward Primer: 5′-TGGTGAAACCCCGTCTCTAC-3′ (SEQ ID NO: 46) Reverse Primer: 5′-CGCccGGCTAATTTTTGTAT-3′ (SEQ ID No: 48)

(92) Stool DNA was diluted 1:5 with Ix Tris-EDTA buffer (pH 7.5) for PCR amplification. Tris-EDTA buffer-diluted stool DNA (1 μL) was amplified in a total volume of 25 μL containing Ix iQ SYBR Green Supermix (Bio-Rad, Hercules, Calif.), 200 nmol/L each primer under the following conditions. 95° C. for 3 minutes followed by 40 cycles of 95° C., 60° C., and 72° C. for 30 seconds each. A standard curve was created for each plate by amplifying 10-fold serially diluted human genomic DNA samples (Novagen, Madison, Wis.). Melting curve analysis was made after each PCR to guarantee that only one product was amplified for all samples.

(93) Amplification was carried out in 96-well plates in an iCycler (Blo-Rad). Each plate consisted of stool DNA samples and multiple positive and negative controls. Each assay was done in duplicate.

(94) The following was performed to compare long DNA (245 bp) and short (45 bp) human DNA in stool for detection of upper and lower GI neoplasms, and to assess the effect of GI tumor site on human DNA levels in stool. Subjects included 33 patients with colorectal cancer. 20 with colorectal adenomas >1 cm. 13 with pancreatic cancer, and 33 colonoscopically-normal controls. Subjects added a preservative buffer to stools at time of collection to prevent post-defecation bacterial metabolism of DNA, and stools were frozen within 8 hours at −80° C. Using a validated quantitative assay for human DNA (Zou et al. Epidemiol Biomarkers Prev., 15:1115 (2006)). 245 bp and 45 bp Alu sequences were amplified from all stools in blinded fashion. Sensitivities for long and short DNA were based on 97 percent specificity cut-offs.

(95) Age medians were 60, 66, 69, and 62 for colorectal cancer, colorectal adenoma, pancreatic cancer, and control groups, respectively, and male/female distributions were 22/11, 9/11, 9/4, 11/21, respectively. In stools from neoplasm and control groups, amplification products were quantitatively greater for short DNA versus long DNA Respective sensitivities by long and short DNA were 66 percent and 62 percent with the 29 distal colorectal neoplasms. 46 percent and 46 percent with the 24 proximal colorectal neoplasms, and 15 percent and 31 percent (p=0.16) with the 13 pancreatic cancers. By Wilcoxan Rank-Sum test, effect of neoplasm site on detection rates was significant for both long DNA (p=0.004) and short DNA (p=0.02). Among colorectal neoplasms, respective sensitivities by long and short DNA were 48 percent and 52 percent with lesions <3 cm, 63 percent and 63 percent with those >3 cm, 64 percent and 61 percent with cancers, and 35 percent and 45 percent with adenomas.

(96) These results demonstrate that short and long DNA can be comparably sensitive for stool detection of GI neoplasms. However, detection rates vary with tumor site, being greatest with the most distal lesions and lowest with the most proximal ones. These results were consistent with substantial luminal degradation of DNA exfoliated from more proximal GI neoplasms.

(97) It was also demonstrated that mutant gene markers in stool can be detected to a greater extent if amplicon size is less than 70 bp, consistent with luminal degradation. Thus, short DNA can serve as a marker per se and as the target size for mutation detection.

Example 7—Use of Fecal Methylated BMP3 as a Neoplasia Marker

(98) Stools from patients with colorectal tumors were found to contain significantly elevated amounts of methylated BMP3 gene copies, but those from normal individuals were found to contain none or only trace amounts. When fecal methylated BMP3 was assayed with an appropriate amplification method, colorectal cancers and premalignant adenomas were specifically detected (FIG. 11). Fecal methylated BMP3 detected a higher percentage of proximal colon tumors than distal tumors, so it can be combined with markers for distal colorectal tumors to create complementary marker panels. Fecal methylated BMP3 was very specific with few false-positive reactions.

(99) Similar results can be obtained using other genes and methods such as those described elsewhere (Zou et al., Cancer Epidemiol Biomarkers Prev., 16(12):2686 (2007)).

Example 8—Detecting Aero-Digestive Cancers by Stool DNA Testing

(100) Tissue samples from patients with confirmed aero-digestive tumors were extracted and sequenced to assess the presence or absence of somatic gene alterations. Germline DNA from the same patients were used as controls. Once an alteration was confirmed, a matched stool sample was tested for that alteration. Two separate methods were utilized to detect the mutation in stool: Allele specific PCR and digital melt curve analysis. For both methods, we focused on amplifying the shortest fragments possible (>100 bp) that have been shown to contain higher levels of the mutant sequence.

(101) Digital Melt Curve (DMC)

(102) We studied 138 patients (69 cases with a GI neoplasm and 69 age/sex-matched asymptomatic controls with normal colonoscopy) by first, identifying a mutation in neoplasm tissue, and then determining if that specific mutation could be detected in stool from that individual. Stools were collected with a stability buffer and frozen at −80° C. until assayed.

(103) Genes commonly mutated in GI neoplasms (TP53, KRAS, APC, CDH1, CTNNB1, BRAF, SMAD4, and P16) were sequenced from DNA extracted from tumor tissue, to identify a target mutation for each case. Target genes were isolated by hybrid capture (Table 7) and the tissue-confirmed somatic mutations were assayed in stool by the digital melt curve method, as described in Example 1. Mutations detected in stool were confirmed by sequencing. Assays were performed blinded.

(104) TABLE-US-00008 TABLE 7 Sequence Specific Capture Probes and Primers for AD Cancer Mutation Detection SEQ SEQ SEQ ID SENSE PRIMER 1 ID ANTISENSE PRIMER 2 ID MUTATION IN TISSUE CAPTURE PROBE No. (5′ TO 3′) No. (5′ TO 3) No. 12487C > CT: ATGGCCATCTACAAGCAGTCA 49 AGTACTCCCCTGCCCTCAAC 128 CTCACAACCTCCGTCATGTG 169 167Q > Q/X TAGCACATGACGGAGGTTGT 102447_102450het_ AGAGTGAACCATGCAGTGGAA 50 TTTGAGAGTCGTTCGATTGC 129 CATGGTTTGTCCAGGGCTAT 27 delTGGT AAGTGGCATTATAAGCCC 12410G > GA, TTTGCCAACTGGCCAAGACCT 51 AGTACTCCCCTGCCCTCAAC 128 CTCCGTCATGTGCTGTGACT 170 141C > C/Y ACCCTGTGCAGCTGTG 102678het_delA CAGATGCTGATACTTTATTAC 52 TCCAGGTTCTTCCAGATGCT 130 CACTCAGGCTGGATGAACAA 22 TTTTGCCACGGAAAGTACT 102594_102598het_ AAAGCACCTACTGCTGAAAGA 53 AGCTCAAACCAAGCGAGAAG 131 AGCATCTGGAAGAACCTGGA 28 delAGAGA GAGTGGACCTAAGCAAG 102644_102646het_ ATGCTGCAGTTCAGAGGGGTC 54 GGACCTAAGCAAGCTGCAGT 132 CACTCAGGCTGGATGAACAA 22 insG CAGGTTCTTCCAGATGC A 102594_102595het_ TAAAGCACCTACTGCTGAAAA 55 AGCTCAAACCAAGCGAGAAG 131 AGCATCTGGAAGAACCTGGA 28 delAG GAGAGAGTGGACCTAAGCAAG 102106het_delT CACAGGAAGCAGATTCTGCAA 56 CAGACGACACAGGAAGCAGA 133 TGCTGGATTTGGTTCTAGGG 171 TACCCTGCAAATAGCA 102442het_delT TTCAGAGTGAACCATGCAGGG 57 TTTGAGAGTCGTTCGATTGC 129 CATGGTTTGTCCAGGGCTAT 27 AATGGTAAGTGGCATTAT apc 102494C > CT; TCCAGATAGCCCTGGATAAAC 58 GTGAACCATGCAGTGGAATG 25 AGCTGTTTGAGGAGGTGGTG 172 1429Q > Q/X CATGCCACCAAG apc 102557C > CT; CTCAAACAGCTCAAACCAAGT 59 ACCACCTCCTCAAACAGCTC 134 GCAGCTTGCTTAGGTCCACT 173 1450R > R/X GAGAAGTACCTAAAAATAAA apc AGCAGAAATAAAAGAAAAGTT 60 CAGACGACACAGGAAGCAGA 133 TGCTGGATTTGGTTCTAGGG 171 102140het_del GGAACTAGGTCAGCTGA A apc 102494C > CT; TCCAGATAGCCCTGGATAAAC 58 GTGAACCATGCAGTGGAATG 25 AGCTGTTTGAGGAGGTGGTG 172 1429Q > Q/X CATGCCACCAAG apc 102554het_delA CTCAAACAGCTCAAACCAGCG 61 CATGCCACCAAGCAGAAGTA 21 GCAGCTTGCTTAGGTCCACT 173 AGAAGTACCTAAA tp53 E5 12647A > TCTGGCCCCTCCTCAGCGTCT 62 CAGGCCTCTGATTCCTCACT 36 ACACGCAAATTTCCTTCCAC 174 AG: 193H > H/R TATCCGAGTGGAAG tp53 E5 12742G > CCTATGAGCCGCCTGAGATCT 63 CATAGTGTGGTGGTGCCCTA 135 AACCACCCTTAACCCCTCCT 175 GA GGTTTGCAACTGGG tp53 E5 12706C > ATGACAGAAACACTTTTTGAC 64 GTGGAAGGAAATTTGCGTGT 136 CAGTTGCAAACCAGACCTCA 176 CT: 213R > R/X ATAGTGTGGTGGTG tp53 E412712A > GAAACACTTTTCGACATGGTG 65 GTGGAAGGAAATTTGCGTGT 136 CAGTTGCAAACCAGACCTCA 176 AG: 215S > S/G TGGTGGTGCCCTAT tp53 E4 12388T > CTGCCCTCAACAAGATGCTTT 66 TGTTCACTTGTGCCCTGACT 137 GCAGGTCTTGGCCAGTTG 177 TC: 134F > F/L GCCAACTGGCCAAG tp53 E3 11606G > AAGTCTGTGACTTGCACAGTC 67 GTCTGGGCTTCTTGCATTCT 138 GCCAGGCATTGAAGTCTCAT 31 GA: 125T > T/T AGTTGCCCTGAGGG tp53 E6 13379C > GCATGGGCGGCATGAACTGGA 68 TGGCTCTGACTGTACCACCA 139 CCAGTGTGATGATGGTGAGG 178 CT: 248R > R/W GGCCCATCCTCACC tp53 12E3 11326A > TCTTTTCACCCATCTACCGTC 69 ACCTGGTCCTCTGACTGCTC 140 GGGGACAGCATCAAATCATC 179 AC (splice site) CCCCTTGCCGTCCC tp53 E6 13412G > CCATCATCACACTGGAATACT 70 CCTCACCATCATCACACTGG 141 GGGTCAGAGGCAAGCAGA 40 GT: 259D > D/Y CCAGGTCAGGAGCC tp53 E4 12449G > ACCCCCGCCCGTCACCCGCGT 71 GTGCAGCTGTGGGTTGATT 142 CTCCGTCATGTGCTGTGACT 170 GT: 154G > G/V CC tp53 E7 13872G > GGAACAGCTTTGAGGTGTGTG 72 GGAAGAGAATCTCCGCAAGA 143 GCTTCTTGTCCTGCTTGCTT 43 GT, 298E > E/X TTTGTGCCTGTCCT APC 102843C > CG: TCAGAGCAGCCTAAAGAATGA 73 ATGCCTCCAGTTCAGGAAAA 144 TTTTTCTGCCTCTTTCTCTT 180 1545S > S/X AATGAAAACCAAGAGAAA GG tp53 E4 12392G > CCTCAACAAGATGTTTTTCCA 74 TGCCCTGACTTTCAACTCTG 145 CTGCACAGGGCAGGTCTT 181 GT, 135C > C/F ACTGGCCAAGACCT T APC 102557C > CT: CTCAAACAGCTCAAACCAAGT 59 ACCACCTCCTCAAACAGCTC 134 GCAGCTTGCTTAGGTCCACT 173 1450R > R/X GAGAAGTACCTAAAAATAAA tp53 E7 13819G > CCTGTCCTGGGATAGACCGGC 75 CTACTGGGACGGAACAGCTT 146 GCGGAGATTCTCTTCCTCTG 182 T: 280R > I GCAC tp53 13E4 11326A > TCTTTTCACCCATCTACCGTC 69 ACCTGGTCCTCTGACTGCTC 140 GGGGACAGCATCAAATCATC 179 AC (splice site) CCCCTTGCCGTCCC tp53 E7 13412G > CCATCATCACACTGGAATACT 70 CCTCACCATCATCACACTGG 41 GGGTCAGAGGCAAGCAGA 40 GT: 259D > D/Y CCAGGTCAGGAGCC tp53 E5 12449G > ACCCCCGCCCGTCACCCGCGT 71 GTGCAGCTGTGGGTTGATT 142 CTCCGTCATGTGCTGTGACT 170 GT: 154G > G/V CC tp53 E8 13872G > GGAGCCTCACCACTAGCTGCC 76 GGAAGAGAATCTCCGCAAGA 143 GCTTCTTGTCCTGCTTGCTT 43 GT: 298E > E/X CCCAGG tp53 E8 13813C > GTGTTTGTGCCTGTCGTGGGA 77 CTACTGGGACGGAACAGCTT 146 GCGGAGATTCTCTTCCTCTG 182 CG, 278P > P/R GAGACCGGCG tp53 E8 13851A > GGAAGAGAATCTCCGCTAGAA 78 GCGCACAGAGGAAGAGAATC 147 TTCTTGTCCTGCTTGCTTAC 183 AT, 291K > K/X AGGGGAGCCTCA C smad4 E2 19049G > GTTAAATATTGTCAGTATGCA 79 AGGTGGCCTGATCTTCACAA 148 TGGATTCACACAGACACTAT 184 GA, 18118A > A/A TTTGACTTAAAATGTGATAG CACA tp53 E8 13777G > TAGTGGTAATCTACTGGAACG 80 TTTCCTTACTGCCTCTTGCT 149 CACAAACACGCACCTCAAAG 185 GA, 266G > G/E GAACAGCTTTGAGGTG TC tp53 E6 12653T > CCTCCTCAGCATCTTACCCGA 81 CAGGCCTCTGATTCCTCACT 36 ACACGCAAATTTCCTTCCAC 174 TC: 195I > I/T GTGGAAGGAAAT tp53 E7 13379C > GCATGGGCGGCATGAACTGGA 68 TGGCTCTGACTGTACCACCA 139 CCAGTGTGATGATGGTGAGG 178 CT: 248R > R/W GGCCCATCCTCACC tp53 E6 12647A > TCTGGCCCCTCCTCAGCGTCT 62 CAGGCCTCTGATTCCTCACT 36 ACACGCAAATTTCCTTCCAC 174 AG: 193H > H/R TATCCGAGTGGAAG tp53 E6 12712A > GAAACACTTTTCGACATGGTG 65 GTGGAAGGAAATTTGCGTGT 136 CAGTTGCAAACCAGACCTCA 76 AG: 215S > S/G TGGTGGTGCCCTAT tp53 E8 13872G > GGAGCCTCACCACTAGCTGCC 76 GGAAGAGAATCTCCGCAAGA 143 GCTTCTTGTCCTGCTTGCTT 43 GT, 298E > E/X CCCAGG tp53 E7 13370G > AGTTCCTGCATGGGCAGCATG 82 TGGCTCTGACTGTACCACCA 139 CCAGTGTGATGATGGTGAGG 178 GA: 245G > G/S AACCGGAGGC tp53 E4 11580het_ CTGGGCTTCTTGCATTCTGGA 83 CCCTTCCCAGAAAACCTACC 30 ACTGACCGTGCAAGTCACAG 186 delG CAGCCAAGTCTGTGA tp53 E5 12524A > TGCCCCCACCGTGAGCGCTGC 84 TGGCCATCTACAAGCAGTCA 150 CTGCTCACCATCGCTATCTG 187 AG, 179H > H/R >tp53 E6 12661G > TCAGCATCTTATCCGAGTGTA 85 CAGGCCTCTGATTCCTCACT 36 CCAAATACTCCACACGCAAA 188 GT, 198E > E/X AGGAAATTTGCGTGTGGA tp53 E8 13872G > GGAGCCTCACCACTAGCTGCC 76 GGAAGAGAATCTCCGCAAGA 143 GCTTCTTGTCCTGCTTGCTT 43 GT 298E > E/X CCCAGG apc 102494C > TCCAGATAGCCCTGGATAAAC 58 CAGGAGACCCCACTCATGTT 19 TGGCAAAATGTAATAAAGTA 20 CT; 1429Q > Q/X CATGCCACCAAG TCAGC apc 102557C > CT; CTCAAACAGCTCAAACCAAGT 59 CAGGAGACCCCACTCATGTT 19 TGGCAAAATGTAATAAAGTA 20 1450R > R/X GAGAAGTACCTAAAAATAAA TCAGC apc 102140het_delA AGCAGAAATAAAAGAAAAGTT 60 TTCATTATCATCTTTGTCAT 15 CGCTCCTGAAGAAAATTCAA 16 GGAACTAGGTCAGCTGA CAGC apc 102494C > CT; TCCAGATAGCCCTGGATAAAC 58 CAGGAGACCCCACTCATGTT 19 TGGCAAAATGTAATAAAGTA 20 1429Q > Q/X CATGCCACCAAG TCAGC apc 102134G > GT; TGCAAATAGCAGAAATAAAAT 86 TTCATTATCATCTTTGTCAT 15 CGCTCCTGAAGAAAATTCAA 16 1309E > E/X AAAAGATTGGAACTAGGTCA CAGC apc 102554het_delA CTCAAACAGCTCAAACCAGCG 61 CAGGAGACCCCACTCATGTT 19 TGGCAAAATGTAATAAAGTA 20 AGAAGTACCTAAA TCAGC apc 102852het_insA CTAAAGAATCAAATGAAAAAC 87 GAGCCTCGATGAGCCATTTA 23 TCAATATCATCATCATCTGA 24 CAAGAGAAAGAGGCAGAA ATCATC Kras 5571G > GA; GTGGTAGTTGGAGCTGATGGC 88 AGGCCTGCTGAAAATGACTG 2 TTGTTGGATCATATTCGTCC 3 12G > G/D GTAGGCAAGAGT AC tp53 E4 12392G > CCTCAACAAGATGTTTTACCA 89 TGTTCACTTGTGCCCTGACT 137 GCAGGTCTTGGCCAGTTG 177 GA; 135C > C/Y ACTGGCCAAGACCT tp53 E5 12655C > CCTCCTCAGCATCTTATCTGA 90 CAGGCCTCTGATTCCTCACT 36 ACACGCAAATTTCCTTCCAC 174 CT; 196R > R/X GTGGAAGGAAATTTGC tp53 E6 13350G > TCCACTACAACTACATGTATA 91 TGGCTCTGACTGTACCACCA 139 CCAGTGTGATGATGGTGAGG 178 GA; 238C > C/Y ACAGTTCCTGCATGGG tp53 E6 13420G > CACTGGAAGACTCCAGATCAG 92 CCTCACCATCATCACACTGG 141 GGGTCAGAGGCAAGCAGA 40 GA GAGCCACTTGCC tp53 E5 12712A > GAAACACTTTTCGACATGGTG 65 GTGGAAGGAAATTTGCGTGT 136 CAGTTGCAAACCAGACCTCA 176 AG; 215S > S/G TGGTGGTGCCCTAT Kras 5571G > GA; GTGGTAGTTGGAGCTGATGGC 88 AGGCCTGCTGAAAATGACTG 2 TTGTTGGATCATATTCGTCC 3 12G > G/D GTAGGCAAGAGT AC P16(ink4a) E1 GGAGAGGGGGAGTGCAGGCAG 93 AGCCAGTCAGCCGAAGG 151 GAGGGGCTGGCTGGTC 189 19638A > AT CGGG P16(ink4a) E2 CCCAACTGCGCCTACCCCGCC 94 CACCCTGGCTCTGACCAT 152 GGGTCGGGTGAGAGTGG 190 23353G > GT; ACTC 447D > DY P16(ink4a) E1 GGAGAGGGGGAGTGCAGGCAG 93 AGCCAGTCAGCCGAAGG 151 GAGGGGCTGGCTGGTC 189 19638A > AT CGGG p16(ink4a) E2 GCCCGGGAGGGCTCCTGGACA 95 GACCCCGCCACTCTCAC 153 CAGCTCCTCAGCCAGGTC 191 23402het_delT_ CGCTG p16(ink4a) E2 GCCCGGGAGGGCTTACTGGAC 96 GACCCCGCCACTCTCAC 153 CAGCTCCTCAGCCAGGTC 191 23403C > CA; ACGCTGGT 484F > F/ ctnnb 1 CAATGGGTCATATCACAGATT 97 ATATTTCAATGGGTCATATC 154 TCAAATCAGCTATAAATACG 192 25541het_delT CTTTTTTTTAAATTAAAGTAA ACAG AAACA CA cdh 1 E9 TCTTATCTCAAAAGAACAACA 98 GCCATGATCGCTCAAATACA 155 TCTCAGGGGGCTAAAGGATT 193 76435het_delA AAAAAGAGGAATCCTTTAG cdh 1 E1 GCGCCCAGCCCTGCGCCCATT 99 ACTTGCGAGGGACGCATT 156 GAAGAAGGGAAGCGGTGAC 194 743_744het_ CCTC insAGCCCTGCGCCCA cdh 1 E13 AAGTAAGTCCAGCTGGCAAAG 100 CATTCTGGGGATTCTTGGAG 157 GGAAATAAACCTCCTCCATT 195 8685386854het_insA TGACTCAGCCTTTGACTT TTT cdh 1 E14 91472C > AGGATGACACCCGGGACAATG 101 CTGTTTCTTCGGAGGAGAGC 158 CCGCCTCCTTCTTCATCATA 196 CT; 751N > N/N TTTATTACTATGATGAAG cdh 1 E 15 TTTTTTCTCCAAAGGACTGAC 102 TTCCTACTCTTCATTGTACT 159 TGCAACGTCGTTACGAGTCA 197 92868_92896het_ GCTCGGCCTGAAGTG TCAACC delTTGACTTGA GCCAGCTGCACAGGGGC CTG cdh 1 E4 71669* CAAGCAGAATTGCTCACTTTC 103 CGTTTCTGGAATCCAAGCAG 160 GCAGCTGATGGGAGGAATAA 198 het_delA CCAACTCCTCTCC cdh 1 E7 74926G > GGTCACAGCCACAGACACGGA 104 CCAGGAACCTCTGTGATGGA 161 TGAGGATGGTGTAAGCGATG 199 GA; 289A > A/T CGATGATGTGAA cdh 1 E1 736_ AGCCCTGCGCCCCTTCCTCTC 105 ACTTGCGAGGGACGCATT 156 GAAGAAGGGAAGCGGTGAC 194 742het_delTGCGCCC CCG p16(ink4a) E1 GGAGAGGGGGAGTGCAGGCAG 93 AGCCAGTCAGCCGAAGG 151 CTCACAACCTCCGTCATGTG 169 19638A > AT CGGG tp53 E4 12365A > TTCCTCTTCCTACAGTGCTCC 106 CACTTGTGCCCTGACTTTCA 33 GCCAGTTGGCAAAACATCT 200 AG; 126Y > Y/C CCTGCCCTCAAC tp53 E4 12548G > TGCTCAGATAGCGATGATGAG 107 CACATGACGGAGGTTGTGAG 162 AACCAGCCCTGTCGTCTCT 34 GA CAGCTGGGGCTG p16(ink4a) E1 GGTCGGAGGCCGAGCCAGGTG 108 TTCCAATTCCCCTGCAAA 163 CCCAACGCACCGAATAGT 201 19810T > TG; GGTAGA 491I > S tp53 E7 13757G > GCTTCTCTTTTCCTATCCTAA 109 GGGACAGGTAGGACCTGATT 164 AGCTGTTCCGTCCCAGTAGA 202 GA GTAGTGGTAATCTACTGG T tp53 E7 13815G > TTGTGCCTGTCCTCGGAGAGA 110 CTACTGGGACGGAACAGCTT 146 GCGGAGATTCTCTTCCTCTG 182 GC; 279G > G/R CCGGCG tp53 E7 13816G > TGTGCCTGTCCTGAGAGAGAC 111 CTACTGGGACGGAACAGCTT 146 GCGGAGATTCTCTTCCTCTG 182 GA; 279G > G/E CGGCGC tp53 E5 12365A > TTCCTCTTCCTACAGTCCTCC 112 CACTTGTGCCCTGACTTTCA 33 GCCAGTTGGCAAAACATCT 200 AC, 126Y > Y/S CCTGCCCTCAAC tp53 E5 12491A > TCTACAAGCAGTCACAGCTCA 113 TGGCCATCTACAAGCAGTCA 150 CTGCTCACCATCGCTATCTG 187 AT, 168H > HL TGACGGAGGTTGTGGA 113 TGGCCATCTACAAGCAGTCA 150 TCACCATCGCTATCTGAGCA 203 113 TGGCCATCTACAAGCAGTCA 150 AACCAGCCCTGTCGTCTCT 34 kras 5570G > GC, GTGGTAGTTGGAGCTGATGGC 88 AGGCCTGCTGAAAATGACTG 2 TTGTTGGATCATATTCGTCC 3 12G > G/R GTAGGCAAGAGT AC tp53 E7 13370G > AGTTCCTGCATGGGCAGCATG 82 TGGCTCTGACTGTACCACCA 139 CCAGTGTGATGATGGTGAGG 178 GA, 245G > G/S AACCGGAGGC apc 102864_ AAATGAAAACCAAGAGAAAGG 114 TGACAATGGGAATGAAACAG 165 GGTCCTTTTCAGAATCAATA 204 102865het_delAG CAGAAAAAACTATTGATTC A GTTTT tp53 E5 12386T > CCTGCCCTCAACAAGACGTTT 115 TGTTCACTTGTGCCCTGACT 137 GCAGGTCTTGGCCAGTTG 177 TC, 133M > M/T TGCCAACTGGCC cdh1 E15 93059G > GCTCATCTCTAAGCTCAGGAA 116 CCAAAGCATGGCTCATCTCT 205 CTCAGGCAAGCTGAAAACAT 206 GA GAGTTGTGTCAAAAATGAGA A tp53 E8 13798G > CGGAACAGCTTTGAGGTGCAT 117 CTACTGGGACGGAACAGCTT 146 GCGGAGATTCTCTTCCTCTG 182 GA: 273R > R/H GTTTGTGCCTGTCCTGGG p53 E6, 12698_ TGGAAGGAAATTTGCGTGTGG 118 GTGGAAGGAAATTTGCGTGT 136 AGCTGTTTGAGGAGGTGGTG 172 12701het_delAC AGTATTTGGATGACAG (1 or 2 AC repeats) P53 E8, 13824C > TGTCCTGGGAGAGACTGGCGC 119 CTACTGGGACGGAACAGCTT 146 GCGGAGATTCTCTTCCTCTG 182 CT, 282R > R/W ACAGAGGAAGAGAAT APC 102151G > GA, AAGAAAAGATTGGAACTAGAT 120 CAGACGACACAGGAAGCAGA 133 GTGACACTGCTGGAACTTCG 207 1314R > R/R CAGCTGAAGATCCTGTG P53 E5 12457 G > CGCCCGGCACCCGCTTCCGCG 121 GTGCAGCTGTGGGTTGATT 142 CTCCGTCATGTGCTGTGACT 170 G/T CCATGGCCA p53 E813812C > CG, GTGTTTGTGCCTGTGCTGGGA 122 CTACTGGGACGGAACAGCTT 146 GCGGAGATTCTCTTCCTCTG 182 278P > P/A GAGACCGGCG APC 102686het_delA AGGTTCTTCCAGATGCTGATA 123 CTGCAGTTCAGAGGGTCCAG 210 CACTCAGGCTGGATGAACAA 22 CTTTATTACATTTTGC APC het_delAG GCGAGAAGTACCTAAAAATAA 124 AGCTCAAACCAAGCGAGAAG 131 AGCATCTGGAAGAACCTGGA 28 between 102594_ AGCACCTACTGCTGAA 102603 (1 of 5 AG repeats) APC 102240C > CA, CAGGGTTCTAGTTTATCTTAA 125 CCCTAGAACCAAATCCAGCA 166 TGTCTGAGCACCACTTTTGG 208 1344S > S/X GAATCAGCCAGGCACA 102676102680del CCAGATGCTGATACTTTATTT 126 CTGCAGTTCAGAGGGTCCAG 167 CACTCAGGCTGGATGAACAA 22 ACATT TGCCACGGAAAGTACTC 12487C > CT: ATGGCCATCTACAAGCAGTCA 49 AGTACTCCCCTGCCCTCAAC 128 CTCACAACCTCCGTCATGTG 169 167Q > Q/X TAGCACATGACGGAGGTTGT 102447_102450het_ AGAGTGAACCATGCAGTGGAA 50 TTTGAGAGTCGTTCGATTGC 129 CATGGTTTGTCCAGGGCTAT 27 delTGGT AAGTGGCATTATAAGCCC 12410G > GA, TTTGCCAACTGGCCAAGACCT 51 AGTACTCCCCTGCCCTCAAC 128 CTCCGTCATGTGCTGTGACT 170 141C > C/Y ACCCTGTGCAGCTGTG 102678het_delA CAGATGCTGATACTTTATTAC 52 TCCAGGTTCTTCCAGATGCT 130 CACTCAGGCTGGATGAACAA 22 TTTTGCCACGGAAAGTACT 102594_102598het_ AAAGCACCTACTGCTGAAAGA 127 AGCTCAAACCAAGCGAGAAG 131 AGCATCTGGAAGAACCTGGA 28 delAGAGA GAGTGGACCTAAGCAAG 102776A > AT: ATGACAATGGGAATGAAACAG 14 TTTGCCACGGAAAGTACTCC 168 TTTCCTGAACTGGAGGCATT 209 1523R > R/X AATCAGAGCAGCCTAAAG 102644_102645het_ ATGCTGCAGTTCAGAGGGGTC 54 GGACCTAAGCAAGCTGCAGT 132 CACTCAGGCTGGATGAACAA 22 insG CAGGTTCTTCCAGATGC A 102594_102595het_ TAAAGCACCTACTGCTGAAAA 55 AGCTCAAACCAAGCGAGAAG 131 AGCATCTGGAAGAACCTGGA 28 delAG GAGAGAGTGGACCTAAGCAAG 102106het_delT CACAGGAAGCAGATTCTGCAA 56 CAGACGACACAGGAAGCAGA 133 TGCTGGATTTGGTTCTAGGG 171 TACCCTGCAAATAGCA 102442het_delT TTCAGAGTGAACCATGCAGGG 57 TTTGAGAGTCGTTCGATTGC 129 CATGGTTTGTCCAGGGCTAT 27 AATGGTAAGTGGCATTAT apc 102494C > CT; TCCAGATAGCCCTGGATAAAC 58 GTGAACCATGCAGTGGAATG 25 AGCTGTTTGAGGAGGTGGTG 172 1429Q > Q/X CATGCCACCAAG apc 102140het_delA AGCAGAAATAAAAGAAAAGTT 60 CAGACGACACAGGAAGCAGA 133 TGCTGGATTTGGTTCTAGGG 171 GGAACTAGGTCAGCTGA apc 102554het_delA CTCAAACAGCTCAAACCAGCG 61 CATGCCACCAAGCAGAAGTA 21 GCAGCTTGCTTAGGTCCACT 173 AGAAGTACCTAAA

(105) Target mutations were not detected in control stools. Target mutations were detected in stools from 68% (47/69) of patients with a GI neoplasm. Specifically, target mutations were detected in stools from 71% (36/51) of patients with cancer [40% (2/5) with oropharyngeal, 65% (11/17) with esophageal, 100% (4/14) with gastric, 55% (6/11) with pancreatic, 75% (3/4) with biliary or gallbladder, and 100% (10/10) with colorectal] and from 61% (11/18) with precancers [100% (2:2) with pancreatic intraductular papillary mucinous neoplasia and 56% (9/16) with colorectal advanced adenoma]. Mutant copies in genes recovered from stool averaged 0.4% (range 0.05-13.4%) for supracolonic and 1.4% (0.1-15.6%) for colorectal neoplasms, p=0.004 (Table 8).

(106) TABLE-US-00009 TABLE 8 Digital Melt Curve Detection of Validated Mutations in AD Cancer Patient Stool # ID Site Age Gender Tissue Mutation Stool Detection Mutation Frequency % Normal Control 1 1163 Head/Neck(pharynx) 73 M tp53 YES 0.8 Neg 2 1250 Head/Neck(pharynx) 49 M tp53 NO Neg 3 1295 Head/Neck(pharynx) 47 F tp53 NO Neg 4 1391 Head/Neck 65 M tp53 TP53 NO (both) Neg 5 1427 Head/Neck 60 M tp53 tp53 YES(p53-1), No (p53-2) 0.05 Neg 1 745 Esophagus 84 F tp53 YES 0.4 Neg 2 769 Esophagus 56 F tp53 YES 0.4 Neg 3 782 Esophagus 55 M tp53 NO Neg 4 789 Esophagus 61 M tp53 YES 1.6 Neg 5 819 Esophagus 53 M tp53 YTES 0.2 Neg 6 873 Esophagus 61 M tp53 YES 0.2 Neg 7 906 Esophagus 55 M APC YES 0.8 Neg 8 1049 Esophagus 57 M tp53 NO Neg 9 1064 Esophagus 72 F tp53 NO Neg 10 1067 Esophagus 72 M tp53 YES 0.7 Neg 11 1103 Esophagus 78 M tp53 NO Neg 12 1199 Esophagus 66 M tp53 YES 0.5 NEG 13 1307 Esophagus 51 M tp53 NO NEG 14 1373 Esophagus 76 M tp53 YES 0.5 NEG 15 1414 Esophagus 66 M tp53 YES 0.1 NEG 16 1448 Esophagus 82 M tp53 NO NEG 17 1072 Esophagus tp53 YES 0.4 NEG 1 798 Stomach 81 M cdh1 YES 13.2 NEG 3 1221 Stomach 55 M cdh1 cdh1 YES(both) 8, 1.3 NEG 4 1224 Stomach 75 F smad4 cdh1 YES(smad4), No(CDH1) 0.2 NEG 5 1402 Stomach 56 M APC tp53 YES (p53) 0.1 NEG 1 848 Gall Bladder 67 M tp53 YES 0.1 NEG 2 1315 Gall Bladder 57 F tp53 YES 1.4 NEG 1 1043 Bile Duct 51 F APC NO NEG 2 1554 Bile Duct 77 M cdh1 YES 13.4 NEG 1 757 Pancreatic Cancer in situ 78 M K-ras YES 0.2 NEG 2 1349 Pancreatic Cancer in situ 64 M K-ras YES 0.2 NEG 1 839 Pancreas 69 F tp53 YES 0.2 NEG 2 1204 Pancreas 65 F p16 NO NEG 3 1253 Pancreas 63 F K-ras tp53 Yes(k-ras), No(p53) 2 NEG 4 1400 Pancreas 71 F tp53 K-ras NO (both) NEG 5 1547 Pancreas 77 F tp53 NO NEG 6 1217 Pancreas K-ras NO NEG 7 1073 Pancreas K-ras NO NEG 8 532 Pancreas K-ras YES 1 NEG 9 1592 Pancreas K-ras P53 YES (both) 0.3 NEG 10 1695 Pancreas K-ras YES 0.2 NEG 11 1058 Pancreas K-ras P53 APC YES(K-ras) 0.2 NEG 1 438 Colorectal Cancer 78 F APC YES 1.2 NEG 2 446 Colorectal Cancer 74 M BRAF YES 0.4 NEG 3 529 Colorectal Cancer 46 M K-RAS YES 1 NEG 4 489 Colorectal Cancer 73 M K-RAS YES 2.6 NEG 5 549 Colorectal Cancer 79 M BRAF YES 1.6 NEG 6 551 Colorectal Cancer 69 M K-RAS YES 5.8 NEG 7 584 Colorectal Cancer 68 M K-RAS YES 1.4 NEG 8 894 Colorectal Cancer 57 M P53 APC YES(p53, APC) 1.6, 5 NEG 9 998 Colorectal Cancer 45 F APC KRAS YES(K-ras, APC) 0.6, 0.8 NEG 10 1009 Colorectal Adenoma 65 F P53 YES 12.9 NEG 1 513 Colorectal Adenoma 65 F APC YES 0.1 NEG 2 546 Colorectal Adenoma 61 M APC NO NEG 3 547 Colorectal Adenoma 52 F APC NO NEG 4 568 Colorectal Adenoma 52 M APC YES 7.8 NEG 5 578 Colorectal Adenoma 71 F APC NO NEG 6 590 Colorectal Adenoma 54 F APC YES 3.2 NEG 7 701 Colorectal Adenoma 72 F APC NO NEG 8 855 Colorectal Adenoma 75 M K-RAS YES 0.4 NEG 9 860 Colorectal Adenoma 53 M APC YES 15.6 NEG 10 900 Colorectal Adenoma 64 F APC K-RAS No(both) NEG 11 962 Colorectal Adenoma 56 M K-RAS Yes 1 NEG 12 965 Colorectal Adenoma 82 M APC K-RAS No (both) NEG 13 991 Colorectal Adenoma 79 M APC K-RAS YES(K-ras), No(APC) 0.2 NEG 14 1135 Colorectal Adenoma 59 M K-RAS YES 13 NEG 15 1231 Colorectal Adenoma 50 M APC NO NEG 16 1559 Colorectal Adenoma K-RAS YES 1 NEG

(107) We also performed an initial pilot study with 10 stool samples from patients with confirmed bile duct cancers to determine if DMC technology could detect mutations in k-ras, a well characterized gene known be mutated in this population. K-ras mutations were detected in stools for 3/10 or 4/10 bile duct cancers (depending on mutation score of 5 or 3, respectively) (Table 9). As K-ras is mutant in 30-40% of bile duct cancers, these results indicate that the detection assay is picking up the appropriate proportion of cancer samples.

(108) TABLE-US-00010 TABLE 9 K-ras mutation scores for patients with bile duct cancer. K-ras Sample Mutation Mutation Detected # Pathology Score A B 520 BD Cancer 0 528 BD Cancer 0 559 BD Cancer 0 558 BD Cancer 1 Codon 12 GAT 515 BD Cancer 2 Codon 12 GAT 543 BD Cancer 2 Codon 13 GAC 806 BD Cancer 3 Codon 12 TGT 539 BD Cancer 5 Codon 12 GAT Codon 13 GGA 512 BD Cancer 6 Codon 12 GAT Codon 12 GAT 725 BD Cancer 25 Codon 13 GAC Codon 12 GAT; Codon 12 GTT
Allele Specific PCR

(109) The allele specific-PCR assay was a modified version of a previously published method (e.g., Cha et al., Mismatch Amplification Mutation Assay (MAMA): Application to the c-H-ras Gene PCR Methods and Applications. 2-14-20 (1992) Cold Spring Harbor Laboratory). TP53 gene fragments were captured from stool DNA samples with probes specific to mutations identified in the matched tissue (Table 7). Copy numbers were assessed by qPCR. Samples were adjusted to 10,000 fragments each and amplified with allele specific primer sets.

(110) TABLE-US-00011 Sample F Primer R Primer A745 GACAGAAACACTTTAT CGGCTCATAGGG (SEQ ID No: 211) (SEQ ID NO: 217) A848 ACACTTTTCGACAAG AAACCAGACCTCAG (SEQ ID No: 212) (SEQ ID No: 218) A789 CCTCAACAAGATAC CAGCTGCACAGG (SEQ ID No: 213) (SEQ ID NO: 219) A782 GCCGCCTGAAA AGACCCCAGTTGC (SEQ ID No: 214) (SEQ ID No: 220) A873 GCGGCATGAAAT TTCCAGTGTGATGAT (SEQ ID No: 215) (SEQ D NO: 221) A769 CCCCTCCTCAGAG CTTCCACTCGGATAA (SEQ ID No: 216) (SEQ ID No: 222)
The Forward Primer in Each Case is Specific for Each TP53 Mutation.
Esophagus and Stomach

(111) Targeting mutations found in esophageal cancers or those from gastroesophageal junction (on p53, APC, or K-ras), the same mutation was detected by allele-specific PCR in matched stools from five of five (100%) cancers but in none of the controls (Table 10). The threshold cycle (Ct), designates the PCR cycle at which the product enters the exponential phase of amplification.

(112) Gallbladder

(113) Targeting a mutation confirmed in a gallbladder cancer, the same mutation was found in the matched stool from that patient using allele-specific PCR (Table 10).

(114) TABLE-US-00012 TABLE 10 Quantitative Mutant Allele Specific-PCR Results for Matched Aero-digestive Cancers Sample Gene # fragments Ct A769 esophageal/gastric cancer p53 10K 71 N normal p53 10K >80 A782 esophageal/gastric cancer p53 10K 38.8 N normal p53 10K 44.1 A745 esophageal/gastric cancer p53 30K 42.5 N normal p53 30K 45.6 A873 esophageal/gastric cancer p53 30K 37.8 N normal p53 30K 40.4 A848 gall bladder cancer p53 10K 22.4 N normal p53 10K 36.3 A789 esophageal/gastric cancer p53 10K 25.9 N normal p53 10K 28.7

Example 9—Candidate Stool Polypeptide Markers Identified for Colorectal Cancer and Precancerous Adenomas

(115) The following list of polypeptides were identified by a statistical analysis model using all data generated from mass spectral of fecal protein extracts: β2-macroglobulin, compliment C3 protein, serotransferrin, haptoglobin, carbonic anhydrase 1, xaa-pro dipeptidase, leukocyte elastase inhibitor, hemoglobin, glucose-6-phosphate, and catalase. This list of polypeptides is in order of difference from normal. Thus, the mean spectral abundance for β2-macroglobulin is most different from normal for cancer and adenoma.

(116) The statistical significance of relative poly peptide abundance between normal, adenoma, and colorectal cancer (CRC) was obtained using normalized spectral count data from a zero inflated poisson regression model as an offset term in the protein specific differential expression analysis. The differential expression analysis also incorporated the zero inflated poisson regression model. Polypeptides were then ranked according to their statistical significance and whether the expression profile followed the clinically relevant pattern of Normal<Adenoma<CRC. Using a rule that a positive test required that any three of top six markers be positive, the sensitivity and specificity of this panel were both 100% in a training set.

(117) The listed polypeptides can be used individually or in any combination to detect colorectal cancer or precancerous adenomas.

Example 10—Identification of Polypeptide Markers for Pancreatic Cancer

(118) Potential polypeptide markers for pancreatic cancer prediction were identified. Utilizing a Scaffold (Proteome Software) side-by-side comparison of spectral abundances, ratios of the spectral counts of carboxypeptidase B (CBPB1_HUMAN) and Carboxypeptidase A1 (CBPA1_HUMAN) were compared. A value for carboxypeptidase B/Al of 0.7 or higher was predictive of pancreatic cancer (in normal stools, an average ratio of 2:3 B/Al was observed). Specificity for training data was 100% with a sensitivity of 88%, while sensitivity from a validation set was 82% at the same specificity.

Example 11—Use of Fecal Methylated ALX4 as a Neoplasia Marker

(119) Stools from patients with colorectal tumors were found to contain significantly elevated amounts of methylated ALX4 gene copies, but those from normal individuals were found to contain none or only trace amounts. When fecal methylated ALX4 was assayed with an appropriate amplification method, colorectal cancers and premalignant adenomas were specifically detected. At 90% specificity, fecal methylated ALX4 detected 59% colorectal cancer and 54% premalignant adenomas, allowing for the detection of both colorectal cancer and premalignant adenomas.

OTHER EMBODIMENTS

(120) It is to be understood that while the invention has been described in conjunction with the detailed description thereof, the foregoing description is intended to illustrate and not limit the scope of the invention, which is defined by the scope of the appended claims. Other aspects, advantages, and modifications are within the scope of the following claims.

Detecting neoplasm

Assignee

Inventors

Cpc classification

Classification Explorer

C12Q1/37

CHEMISTRY; METALLURGY

Classification Explorer

G01N2333/966

PHYSICS

Classification Explorer

C12Q1/40

CHEMISTRY; METALLURGY

Classification Explorer

C12Q2600/154

CHEMISTRY; METALLURGY

Classification Explorer

G01N2560/00

PHYSICS

Classification Explorer

G01N33/57419

PHYSICS

Classification Explorer

C12Y302/01001

CHEMISTRY; METALLURGY

Classification Explorer

C12Q2600/158

CHEMISTRY; METALLURGY

Classification Explorer

G01N2333/928

PHYSICS

Classification Explorer

G01N33/57407

PHYSICS

Classification Explorer

C12Q2600/156

CHEMISTRY; METALLURGY

Classification Explorer

C12Y304/2107

CHEMISTRY; METALLURGY

Classification Explorer

G01N33/57438

PHYSICS

Classification Explorer

C12Q1/6886

CHEMISTRY; METALLURGY

International classification

Classification Explorer

C12Q1/6886

CHEMISTRY; METALLURGY

Classification Explorer

C12Q1/37

CHEMISTRY; METALLURGY

Classification Explorer

C12Q1/40

CHEMISTRY; METALLURGY

Classification Explorer

G01N33/574

PHYSICS

Abstract

Claims

Description