MULTI-PRIMER AMPLIFICATION METHOD FOR TAGGING OF TARGET NUCLEIC ACIDS
20200102594 ยท 2020-04-02
Inventors
- Andrew May (San Francisco, CA)
- Peilin Chen (Richmond, CA)
- Jun Wang (Palo Alto, CA)
- Fiona Kaper (San Mateo, CA)
- Megan Anderson (Washington, DC)
Cpc classification
C12Q2525/161
CHEMISTRY; METALLURGY
B01L3/502738
PERFORMING OPERATIONS; TRANSPORTING
C12Q2525/155
CHEMISTRY; METALLURGY
B01L7/52
PERFORMING OPERATIONS; TRANSPORTING
B01L3/50273
PERFORMING OPERATIONS; TRANSPORTING
B01L2300/0864
PERFORMING OPERATIONS; TRANSPORTING
B01L2300/0867
PERFORMING OPERATIONS; TRANSPORTING
B01L2400/0487
PERFORMING OPERATIONS; TRANSPORTING
C12Q1/6806
CHEMISTRY; METALLURGY
C12Q1/6806
CHEMISTRY; METALLURGY
B01L2300/0816
PERFORMING OPERATIONS; TRANSPORTING
B01L2300/087
PERFORMING OPERATIONS; TRANSPORTING
C12Q2525/155
CHEMISTRY; METALLURGY
International classification
C12Q1/6806
CHEMISTRY; METALLURGY
Abstract
In certain embodiments, the present invention provides amplification methods in which nucleotide tag(s) and, optionally, a barcode nucleotide sequence are added to target nucleotide sequences. In other embodiments, the present invention provides a microfluidic device that includes a plurality of first input lines and a plurality of second input lines. The microfluidic device also includes a plurality of sets of first chambers and a plurality of sets of second chambers. Each set of first chambers is in fluid communication with one of the plurality of first input lines. Each set of second chambers is in fluid communication with one of the plurality of second input lines. The microfluidic device further includes a plurality of first pump elements in fluid communication with a first portion of the plurality of second input lines and a plurality of second pump elements in fluid communication with a second portion of the plurality of second input lines.
Claims
1-81. (canceled)
82. A method of preparing reaction products, the method comprising: providing M samples; providing N assays; mixing the M samples and N assays to form MN pairwise combinations, each of the MN pairwise combinations being contained in a closed volume; forming MN reaction products from the MN pairwise combinations; and recovering the MN reaction products.
83. The method of claim 82 wherein the M samples are contained in M sets of first chambers and the N assays are contained in N sets of second chambers.
84. The method of claim 83 wherein the first chambers containing one of the M samples are associated with a set of second chambers, each of the set of second chambers containing one of the N assays.
85. The method of claim 84 wherein mixing the M samples and N assays comprises opening fluid lines to provide for fluid communication between the firsrt chambers containing one of the M samples and each of the set of second chambers containing one of the N assays.
86. The method of claim 82 wherein the first chambers are characterized by a volume of less than or equal to 100 nl.
87. The method of claim 86 wherein the volume is less than or equal to 40 nl.
88. The method of claim 82 wherein the second chambers are characterized by a volume of less than or equal to 10 nl.
89. The method of claim 88 wherein the volume is less than or equal to 2 nl.
90. The method of claim 82 wherein the closed volume is characterized by a volume of less than or equal to 100 nl.
91. The method of claim 82 further comprising thermocycling the MN pairwise combinations.
92. The method of claim 82 wherein the M samples are provided at M sample ports of a microfluidic device.
93. The method of claim 91 wherein the MN samples are recovered at the M sample ports of the microfluidic device.
94. The method of claim 82 wherein forming the MN pairwise combinations is performed concurrently.
95. The method of claim 82 wherein forming the MN pairwise combinations is performed sequentially.
Description
BRIEF DESCRIPTION OF THE DRAWINGS
[0044] The invention may be understood by reference to the following description taken in conjunction with the accompanying drawings that illustrate certain specific embodiments of the present invention.
[0045]
[0046]
[0047]
[0048]
[0049]
[0050]
[0051]
[0052]
[0053]
[0054]
[0055]
[0056]
[0057]
[0058]
[0059]
[0060]
[0061]
[0062]
[0063]
[0064]
DETAILED DESCRIPTION
[0065] In certain embodiments, the present invention provides amplification methods in which nucleotide tag(s) and a barcode nucleotide sequence are added to target nucleotide sequences. The added sequences can then serve as primer and/or probe-binding sites. The barcode nucleotide sequence can encode information, such as, e.g., sample origin, about the target nucleotide sequence to which it is attached. Tagging and/or barcoding target nucleotide sequences can increase the number of samples that can be analyzed for one or multiple targets in a single assay, while minimizing increases in assay cost. The methods are particularly well-suited for increasing the efficiency of assays performed on microfluidic devices.
[0066] In particular embodiments, the methods are used to prepare nucleic acids for DNA sequencing by, e.g., adding binding sites for DNA sequencing primers, optionally followed by sample calibration for DNA sequencing. In specific, illustrative embodiments, the method can be employed to add binding sites for DNA sequencing primers in a microfluidic device that permits recovery of reaction products. In illustrative devices of this type, dilation pumping can utilized to remove substantially all of the reaction products from the microfluidic device, providing uniformity between the various reaction product pools. Thus, it is possible to produce pools of barcoded reaction products that are uniform with respect to volume and copy number. In various embodiments, the volume and/or copy number uniformity is such that the variability, with respect to volume and/or copy number, of each pool recovered from the device is less than about 100 percent, less than about 90 percent, less than about 80 percent, less than about 70 percent, less than about 60 percent, less than about 50 percent, less than about 40 percent, less than about 30 percent, less than about 20 percent, less than about 17 percent, or less than about 15, 12, 10, 9, 8, 7, 6, 5, 4.5, 4, 3.5, 3, 2.5, 2, 1.5, 1, or 0.5 percent. Those of skill in the art appreciate that the volume and/or copy number variability may fall within any range bounded by any of these values (e.g., about 2 to about 7 percent). In an illustrative embodiment, the volume samples recovered from a microfluidic device vary by no more than approximately 10%. Standard pipetting error is on the order of between 5 and 10%. Thus, the observed variability in volumes is largely attributable to pipetting error. Utilizing the systems and methods described herein, the time and labor required to prepare sequencing libraries is reduced in comparison with conventional techniques.
[0067] It is understood that the invention is not limited to the particular methodology, protocols, and reagents, etc., described herein, as these can be varied by the skilled artisan. It is also understood that the terminology used herein is used for the purpose of describing particular illustrative embodiments only, and is not intended to limit the scope of the invention. It also noted that as used herein and in the appended claims, the singular forms a, an, and the include the plural reference unless the context clearly dictates otherwise. Thus, for example, a reference to a cell is a reference to one or more cells and equivalents thereof known to those skilled in the art.
[0068] The embodiments of the invention and the various features and advantageous details thereof are explained more fully with reference to the non-limiting embodiments and examples that are described and/or illustrated in the accompanying drawings and detailed in the following description. It should be noted that the features illustrated in the drawings are not necessarily drawn to scale, and features of one embodiment may be employed with other embodiments as the skilled artisan would recognize, even if not explicitly stated herein. Descriptions of well-known components and processing techniques may be omitted so as to not unnecessarily obscure the embodiments of the invention.
Definitions
[0069] Terms used in the claims and specification are defined as set forth below unless otherwise specified. These terms are defined specifically for clarity, but all of the definitions are consistent with how a skilled artisan would understand these terms.
[0070] The term adjacent, when used herein to refer two nucleotide sequences in a nucleic acid, can refer to nucleotide sequences separated by 0 to about 20 nucleotides, more specifically, in a range of about 1 to about 10 nucleotides, or sequences that directly abut one another.
[0071] The term nucleic acid refers to a nucleotide polymer, and unless otherwise limited, includes known analogs of natural nucleotides that can function in a similar manner (e.g., hybridize) to naturally occurring nucleotides.
[0072] The term nucleic acid includes any form of DNA or RNA, including, for example, genomic DNA; complementary DNA (cDNA), which is a DNA representation of mRNA, usually obtained by reverse transcription of messenger RNA (mRNA) or by amplification; DNA molecules produced synthetically or by amplification; and mRNA.
[0073] The term nucleic acid encompasses double- or triple-stranded nucleic acids, as well as single-stranded molecules. In double- or triple-stranded nucleic acids, the nucleic acid strands need not be coextensive (i.e, a double-stranded nucleic acid need not be double-stranded along the entire length of both strands).
[0074] The term nucleic acid also encompasses any chemical modification thereof, such as by methylation and/or by capping. Nucleic acid modifications can include addition of chemical groups that incorporate additional charge, polarizability, hydrogen bonding, electrostatic interaction, and functionality to the individual nucleic acid bases or to the nucleic acid as a whole. Such modifications may include base modifications such as 2-position sugar modifications, 5-position pyrimidine modifications, 8-position purine modifications, modifications at cytosine exocyclic amines, substitutions of 5-bromo-uracil, backbone modifications, unusual base pairing combinations such as the isobases isocytidine and isoguanidine, and the like.
[0075] More particularly, in certain embodiments, nucleic acids, can include polydeoxyribonucleotides (containing 2-deoxy-D-ribose), polyribonucleotides (containing D-ribose), and any other type of nucleic acid that is an N- or C-glycoside of a purine or pyrimidine base, as well as other polymers containing nonnucleotidic backbones, for example, polyamide (e.g., peptide nucleic acids (PNAs)) and polymorpholino (commercially available from the Anti-Virals, Inc., Corvallis, Oreg., as Neugene) polymers, and other synthetic sequence-specific nucleic acid polymers providing that the polymers contain nucleobases in a configuration which allows for base pairing and base stacking, such as is found in DNA and RNA. The term nucleic acid also encompasses linked nucleic acids (LNAs), which are described in U.S. Pat. Nos. 6,794,499, 6,670,461, 6,262,490, and 6,770,748, which are incorporated herein by reference in their entirety for their disclosure of LNAs.
[0076] The nucleic acid(s) can be derived from a completely chemical synthesis process, such as a solid phase-mediated chemical synthesis, from a biological source, such as through isolation from any species that produces nucleic acid, or from processes that involve the manipulation of nucleic acids by molecular biology tools, such as DNA replication, PCR amplification, reverse transcription, or from a combination of those processes.
[0077] The term target nucleic acids is used herein to refer to particular nucleic acids to be detected in the methods of the invention.
[0078] As used herein the term target nucleotide sequence refers to a molecule that includes the nucleotide sequence of a target nucleic acid, such as, for example, the amplification product obtained by amplifying a target nucleic acid or the cDNA produced upon reverse transcription of an RNA target nucleic acid.
[0079] As used herein, the term complementary refers to the capacity for precise pairing between two nucleotides. I.e., if a nucleotide at a given position of a nucleic acid is capable of hydrogen bonding with a nucleotide of another nucleic acid, then the two nucleic acids are considered to be complementary to one another at that position. Complementarity between two single-stranded nucleic acid molecules may be partial, in which only some of the nucleotides bind, or it may be complete when total complementarity exists between the single-stranded molecules. The degree of complementarity between nucleic acid strands has significant effects on the efficiency and strength of hybridization between nucleic acid strands.
[0080] Specific hybridization refers to the binding of a nucleic acid to a target nucleotide sequence in the absence of substantial binding to other nucleotide sequences present in the hybridization mixture under defined stringency conditions. Those of skill in the art recognize that relaxing the stringency of the hybridization conditions allows sequence mismatches to be tolerated.
[0081] In particular embodiments, hybridizations are carried out under stringent hybridization conditions. The phrase stringent hybridization conditions generally refers to a temperature in a range from about 5 C. to about 20 C. or 25 C. below than the melting temperature (T.sub.m) for a specific sequence at a defined ionic strength and pH. As used herein, the T.sub.m is the temperature at which a population of double-stranded nucleic acid molecules becomes half-dissociated into single strands. Methods for calculating the T.sub.m of nucleic acids are well known in the art (see, e.g., Berger and Kimmel (1987) METHODS IN ENZYMOLOGY, VOL.152: GUIDE TO MOLECULAR CLONING TECHNIQUES, San Diego: Academic Press, Inc. and Sambrook et al. (1989) MOLECULAR CLONING: A LABORATORY MANUAL, 2ND ED., VOLS. 1-3, Cold Spring Harbor Laboratory), both incorporated herein by reference). As indicated by standard references, a simple estimate of the T.sub.m value may be calculated by the equation: T.sub.m=81.5+0.41(% G+C), when a nucleic acid is in aqueous solution at 1 M NaCl (see, e.g., Anderson and Young, Quantitative Filter Hybridization in NUCLEIC ACID HYBRIDIZATION (1985)). The melting temperature of a hybrid (and thus the conditions for stringent hybridization) is affected by various factors such as the length and nature (DNA, RNA, base composition) of the primer or probe and nature of the target nucleic acid (DNA, RNA, base composition, present in solution or immobilized, and the like), as well as the concentration of salts and other components (e.g., the presence or absence of formamide, dextran sulfate, polyethylene glycol). The effects of these factors are well known and are discussed in standard references in the art. Illustrative stringent conditions suitable for achieving specific hybridization of most sequences are: a temperature of at least about 60 C. and a salt concentration of about 0.2 molar at pH7.
[0082] The term oligonucleotide is used to refer to a nucleic acid that is relatively short, generally shorter than 200 nucleotides, more particularly, shorter than 100 nucleotides, most particularly, shorter than 50 nucleotides. Typically, oligonucleotides are single-stranded DNA molecules.
[0083] The term primer refers to an oligonucleotide that is capable of hybridizing (also termed annealing) with a nucleic acid and serving as an initiation site for nucleotide (RNA or DNA) polymerization under appropriate conditions (i.e., in the presence of four different nucleoside triphosphates and an agent for polymerization, such as DNA or RNA polymerase or reverse transcriptase) in an appropriate buffer and at a suitable temperature. The appropriate length of a primer depends on the intended use of the primer, but primers are typically at least 7 nucleotides long and, more typically range from 10 to 30 nucleotides, or even more typically from 15 to 30 nucleotides, in length. Other primers can be somewhat longer, e.g., 30 to 50 nucleotides long. In this context, primer length refers to the portion of an oligonucleotide or nucleic acid that hybridizes to a complementary target sequence and primes nucleotide synthesis. Short primer molecules generally require cooler temperatures to form sufficiently stable hybrid complexes with the template. A primer need not reflect the exact sequence of the template but must be sufficiently complementary to hybridize with a template. The term primer site or primer binding site refers to the segment of the target nucleic acid to which a primer hybridizes.
[0084] A primer is said to anneal to another nucleic acid if the primer, or a portion thereof, hybridizes to a nucleotide sequence within the nucleic acid. The statement that a primer hybridizes to a particular nucleotide sequence is not intended to imply that the primer hybridizes either completely or exclusively to that nucleotide sequence. For example, in certain embodiments, amplification primers used herein are said to anneal to a nucleotide tag. This description encompasses primers that anneal wholly to the nucleotide tag, as well as primers that anneal partially to the nucleotide tag and partially to an adjacent nucleotide sequence, e.g., a target nucleotide sequence. Such hybrid primers can increase the specificity of the amplification reaction.
[0085] As used herein, the selection of primers so as to avoid substantial annealing to the target nucleic acids means that primers are selected so that the majority of the amplicons detected after amplification are full-length in the sense that they result from priming at the expected sites at each end of the target nucleic acid, as opposed to amplicons resulting from priming within the target nucleic acid, which produces shorter-than-expected amplicons. In various embodiments, primers are selected to that at least 55%, at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% are full-length.
[0086] The term primer pair refers to a set of primers including a 5 upstream primer or forward primer that hybridizes with the complement of the 5 end of the DNA sequence to be amplified and a 3 downstream primer or reverse primer that hybridizes with the 3 end of the sequence to be amplified. As will be recognized by those of skill in the art, the terms upstream and downstream or forward and reverse are not intended to be limiting, but rather provide illustrative orientation in particular embodiments.
[0087] A probe is a nucleic acid capable of binding to a target nucleic acid of complementary sequence through one or more types of chemical bonds, generally through complementary base pairing, usually through hydrogen bond formation, thus forming a duplex structure. The probe binds or hybridizes to a probe binding site. The probe can be labeled with a detectable label to permit facile detection of the probe, particularly once the probe has hybridized to its complementary target. Alternatively, however, the probe may be unlabeled, but may be detectable by specific binding with a ligand that is labeled, either directly or indirectly. Probes can vary significantly in size. Generally, probes are at least 7 to 15 nucleotides in length. Other probes are at least 20, 30, or 40 nucleotides long. Still other probes are somewhat longer, being at least 50, 60, 70, 80, or 90 nucleotides long. Yet other probes are longer still, and are at least 100, 150, 200 or more nucleotides long. Probes can also be of any length that is within any range bounded by any of the above values (e.g., 15-20 nucleotides in length).
[0088] The primer or probe can be perfectly complementary to the target nucleic acid sequence or can be less than perfectly complementary. In certain embodiments, the primer has at least 65% identity to the complement of the target nucleic acid sequence over a sequence of at least 7 nucleotides, more typically over a sequence in the range of 10-30 nucleotides, and often over a sequence of at least 14-25 nucleotides, and more often has at least 75% identity, at least 85% identity, at least 90% identity, or at least 95%, 96%, 97%. 98%, or 99% identity. It will be understood that certain bases (e.g., the 3 base of a primer) are generally desirably perfectly complementary to corresponding bases of the target nucleic acid sequence. Primer and probes typically anneal to the target sequence under stringent hybridization conditions.
[0089] The term nucleotide tag is used herein to refer to a predetermined nucleotide sequence that is added to a target nucleotide sequence. The nucleotide tag can encode an item of information about the target nucleotide sequence, such the identity of the target nucleotide sequence or the identity of the sample from which the target nucleotide sequence was derived. In certain embodiments, such information may be encoded in one or more nucleotide tags, e.g., a combination of two nucleotide tags, one on either end of a target nucleotide sequence, can encode the identity of the target nucleotide sequence.
[0090] As used herein the term barcode primer refers to a primer that includes a specific barcode nucleotide sequence that encodes information about the amplicon produced when the barcode primer is employed in an amplification reaction. For example, a different barcode primer can be employed to amplify one or more target sequences from each of a number of different samples, such that the barcode nucleotide sequence indicates the sample origin of the resulting amplicons.
[0091] As used herein, the term encoding reaction refers to reaction in which at least one nucleotide tag is added to a target nucleotide sequence. Nucleotide tags can be added, for example, by an encoding PCR in which the at least one primer comprises a target-specific portion and a nucleotide tag located on the 5 end of the target-specific portion, and a second primer that comprises only a target-specific portion or a target-specific portion and a nucleotide tag located on the 5 end of the target-specific portion. For illustrative examples of PCR protocols applicable to encoding PCR, see pending WO Application US03/37808 as well as U.S. Pat. No.6,605,451. Nucleotide tags can also be added by an encoding ligation reaction that can comprise a ligation reaction in which at least one primer comprises a target-specific portion and nucleotide tag located on the 5 end of the target-specific portion, and a second primer that comprises a target-specific portion only or a target-specific portion and a nucleotide tag located on the 5 end of the target specific portion. Illustrative encoding ligation reactions are described, for example, in U.S. Patent Publication No. 2005/0260640, which is hereby incorporated by reference in its entirety, and in particular for ligation reactions.
[0092] As used herein an encoding reaction produces a tagged target nucleotide sequence, which includes a nucleotide tag linked to a target nucleotide sequence.
[0093] As used herein with reference to a portion of a primer, the term target-specific nucleotide sequence refers to a sequence that can specifically anneal to a target nucleic acid or a target nucleotide sequence under suitable annealing conditions.
[0094] As used herein with reference to a portion of a primer, the term nucleotide tag-specific nucleotide sequence refers to a sequence that can specifically anneal to a nucleotide tag under suitable annealing conditions.
[0095] Amplification according to the present teachings encompasses any means by which at least a part of at least one target nucleic acid is reproduced, typically in a template-dependent manner, including without limitation, a broad range of techniques for amplifying nucleic acid sequences, either linearly or exponentially. Illustrative means for performing an amplifying step include ligase chain reaction (LCR), ligase detection reaction (LDR), ligation followed by Q-replicase amplification, PCR, primer extension, strand displacement amplification (SDA), hyperbranched strand displacement amplification, multiple displacement amplification (MDA), nucleic acid strand-based amplification (NASBA), two-step multiplexed amplifications, rolling circle amplification (RCA), and the like, including multiplex versions and combinations thereof, for example but not limited to, OLA/PCR, PCR/OLA, LDR/PCR, PCR/PCR/LDR, PCR/LDR, LCR/PCR, PCR/LCR (also known as combined chain reactionCCR), and the like. Descriptions of such techniques can be found in, among other sources, Ausbel et al.; PCR Primer: A Laboratory Manual, Diffenbach, Ed., Cold Spring Harbor Press (1995); The Electronic Protocol Book, Chang Bioscience (2002); Msuih et al., J. Clin. Micro. 34:501-07 (1996); The Nucleic Acid Protocols Handbook, R. Rapley, ed., Humana Press, Totowa, N.J. (2002); Abramson et al., Curr Opin Biotechnol. 1993 Feb.;4(1):41-7, U.S. Pat. No. 6,027,998; U.S. Pat. No. 6,605,451, Barany et al., PCT Publication No. WO 97/31256; Wenz et al., PCT Publication No. WO 01/92579; Day et al., Genomics, 29(1): 152-162 (1995), Ehrlich et al., Science 252:1643-50 (1991); Innis et al., PCR Protocols: A Guide to Methods and Applications, Academic Press (1990); Favis et al., Nature Biotechnology 18:561-64 (2000); and Rabenau et al., Infection 28:97-102 (2000); Belgrader, Barany, and Lubin, Development of a Multiplex Ligation Detection Reaction DNA Typing Assay, Sixth International Symposium on Human Identification, 1995 (available on the world wide web at: promega.com/geneticidproc/ussymp6proc/blegrad.html-); LCR Kit Instruction Manual, Cat. #200520, Rev. #050002, Stratagene, 2002; Barany, Proc. Natl. Acad. Sci. USA 88:188-93 (1991); Bi and Sambrook, Nucl. Acids Res. 25:2924-2951 (1997); Zirvi et al., Nucl. Acid Res. 27:e40i-viii (1999); Dean et al., Proc Natl Acad Sci USA 99:5261-66 (2002); Barany and Gelfand, Gene 109:1-11 (1991); Walker et al., Nucl. Acid Res. 20:1691-96 (1992); Polstra et al., BMC Inf. Dis. 2:18-(2002); Lage et al., Genome Res. 2003 Feb.;13(2):294-307, and Landegren et al., Science 241:1077-80 (1988), Demidov, V., Expert Rev Mol Diagn. 2002 Nov.;2(6):542-8., Cook et al., J Microbiol Methods. 2003 May;53(2):165-74, Schweitzer et al., Curr Opin Biotechnol. 2001 Feb.;12(1):21-7, U.S. Pat. No. 5,830,711, U.S. Pat. No. 6,027,889, U.S. Pat. No. 5,686,243, PCT Publication No. W00056927A3, and PCT Publication No. W09803673A1.
[0096] In some embodiments, amplification comprises at least one cycle of the sequential procedures of: annealing at least one primer with complementary or substantially complementary sequences in at least one target nucleic acid; synthesizing at least one strand of nucleotides in a template-dependent manner using a polymerase; and denaturing the newly-formed nucleic acid duplex to separate the strands. The cycle may or may not be repeated. Amplification can comprise thermocycling or can be performed isothermally.
[0097] The term qPCR is used herein to refer to quantitative real-time polymerase chain reaction (PCR), which is also known as real-time PCR or kinetic polymerase chain reaction.
[0098] A reagent refers broadly to any agent used in a reaction, other than the analyte (e.g., nucleic acid being analyzed). Illustrative reagents for a nucleic acid amplification reaction include, but are not limited to, buffer, metal ions, polymerase, reverse transcriptase, primers, template nucleic acid, nucleotides, labels, dyes, nucleases, and the like. Reagents for enzyme reactions include, for example, substrates, cofactors, buffer, metal ions, inhibitors, and activators.
[0099] The term universal detection probe is used herein to refer to any probe that identifies the presence of an amplification product, regardless of the identity of the target nucleotide sequence present in the product.
[0100] The term universal qPCR probe is used herein to refer to any such probe that identifies the presence of an amplification product during qPCR. In particular embodiments, nucleotide tags according to the invention can comprise a nucleotide sequence to which a detection probe, such as a universal qPCR probe binds. Where a tag is added to both ends of a target nucleotide sequence, each tag can, if desired, include a sequence recognized by a detection probe. The combination of such sequences can encode information about the identity or sample source of the tagged target nucleotide sequence. In other embodiments, one or more amplification primers can comprise a nucleotide sequence to which a detection probe, such as a universal qPCR probe binds. In this manner, one, two, or more probe binding sites can be added to an amplification product during the amplification step of the methods of the invention. Those of skill in the art recognize that the possibility of introducing multiple probe binding sites during preamplification (if carried out) and amplification facilitates multiplex detection, wherein two or more different amplification products can be detected in a given amplification mixture or aliquot thereof.
[0101] The term universal detection probe is also intended to encompass primers labeled with a detectable label (e.g., a fluorescent label), as well as non-sequence-specific probes, such as DNA binding dyes, including double-stranded DNA (dsDNA) dyes, such as SYBR Green.
[0102] The term target-specific qPCR probe is used herein to refer to a qPCR probe that identifies the presence of an amplification product during qPCR, based on hybridization of the qPCR probe to a target nucleotide sequence present in the product.
[0103] Hydrolysis probes are generally described in U.S. Pat. No. 5,210,015, which is incorporated herein by reference in its entirety for its description of hydrolysis probes. Hydrolysis probes take advantage of the 5-nuclease activity present in the thermostable Taq polymerase enzyme typically used in the PCR reaction (TaqMan probe technology, Applied Biosystems, Foster City Calif.). The hydrolysis probe is labeled with a fluorescent detector dye such as fluorescin, and an acceptor dye or quencher. In general, the fluorescent dye is covalently attached to the 5 end of the probe and the quencher is attached to the 3 end of the probe, and when the probe is intact, the fluorescence of the detector dye is quenched by fluorescence resonance energy transfer (FRET). The probe anneals downstream of one of the primers that defines one end of the target nucleic acid in a PCR reaction. Using the polymerase activity of the Taq enzyme, amplification of the target nucleic acid is directed by one primer that is upstream of the probe and a second primer that is downstream of the probe but anneals to the opposite strand of the target nucleic acid. As the upstream primer is extended, the Taq polymerase reaches the region where the labeled probe is annealed, recognizes the probe-template hybrid as a substrate, and hydrolyzes phosphodiester bonds of the probe. The hydrolysis reaction irrevocably releases the quenching effect of the quencher dye on the reporter dye, thus resulting in increasing detector fluorescence with each successive PCR cycle. In particular, hydrolysis probes suitable for use in the invention can be capable of detecting 8-mer or 9-mer motifs that are common in the human and other genomes and/or transcriptomes and can have a high T.sub.m of about 70 C. enabled by the use of linked nucleic acid (LNA) analogs.
[0104] The term label, as used herein, refers to any atom or molecule that can be used to provide a detectable and/or quantifiable signal. In particular, the label can be attached, directly or indirectly, to a nucleic acid or protein. Suitable labels that can be attached to probes include, but are not limited to, radioisotopes, fluorophores, chromophores, mass labels, electron dense particles, magnetic particles, spin labels, molecules that emit chemiluminescence, electrochemically active molecules, enzymes, cofactors, and enzyme substrates.
[0105] The term dye, as used herein, generally refers to any organic or inorganic molecule that absorbs electromagnetic radiation at a wavelength greater than or equal 340 nm.
[0106] The term fluorescent dye, as used herein, generally refers to any dye that emits electromagnetic radiation of longer wavelength by a fluorescent mechanism upon irradiation by a source of electromagnetic radiation, such as a lamp, a photodiode, or a laser.
[0107] The term elastomer has the general meaning used in the art. Thus, for example, Allcock et al. (Contemporary Polymer Chemistry, 2nd Ed.) describes elastomers in general as polymers existing at a temperature between their glass transition temperature and liquefaction temperature. Elastomeric materials exhibit elastic properties because the polymer chains readily undergo torsional motion to permit uncoiling of the backbone chains in response to a force, with the backbone chains recoiling to assume the prior shape in the absence of the force. In general, elastomers deform when force is applied, but then return to their original shape when the force is removed.
[0108] A polymorphic marker or polymorphic site is a locus at which nucleotide sequence divergence occurs. Illustrative markers have at least two alleles, each occurring at frequency of greater than 1%, and more typically greater than 10% or 20% of a selected population. A polymorphic site may be as small as one base pair. Polymorphic markers include restriction fragment length polymorphism (RFLPs), variable number of tandem repeats (VNTR's), hypervariable regions, minisatellites, dinucleotide repeats, trinucleotide repeats, tetranucleotide repeats, simple sequence repeats, deletions, and insertion elements such as Alu. The first identified allelic form is arbitrarily designated as the reference form and other allelic forms are designated as alternative or variant alleles. The allelic form occurring most frequently in a selected population is sometimes referred to as the wildtype form. Diploid organisms may be homozygous or heterozygous for allelic forms. A diallelic polymorphism has two forms. A triallelic polymorphism has three forms.
[0109] A single nucleotide polymorphism (SNP) occurs at a polymorphic site occupied by a single nucleotide, which is the site of variation between allelic sequences. The site is usually preceded by and followed by highly conserved sequences of the allele (e.g., sequences that vary in less than 1/100 or 1/1000 members of the populations). A SNP usually arises due to substitution of one nucleotide for another at the polymorphic site. A transition is the replacement of one purine by another purine or one pyrimidine by another pyrimidine. A transversion is the replacement of a purine by a pyrimidine or vice versa. SNPs can also arise from a deletion of a nucleotide or an insertion of a nucleotide relative to a reference allele.
Amplification Methods
[0110] In General
[0111] In particular embodiments, the invention provides an amplification method for introducing a plurality (e.g., at least three) of selected nucleotide sequences into one or more target nucleic acid(s). The method entails amplifying a plurality of target nucleic acids, typically, in a plurality of samples. In illustrative embodiments, the same set of target nucleic acids can be amplified in each of two or more different samples. The samples can differ from one another in any way, e.g., the samples can be from different tissues, subjects, environmental sources, etc. At least three primers can be used to amplify each target nucleic acid, namely: forward and reverse amplification primers, each primer including a target-specific portion and one or both primers including a nucleotide tag. The target-specific portions can specifically anneal to a target under suitable annealing conditions. The nucleotide tag for the forward primer can have a sequence that is the same as, or different from, the nucleotide tag for the reverse primer. Generally, the nucleotide tags are 5 of the target-specific portions. The third primer is a barcode primer comprising a barcode nucleotide sequence and a first and/or second nucleotide tag-specific portion. The barcode nucleotide sequence is a sequence selected to encode information about the amplicon produced when the barcode primer is employed in an amplification reaction. The tag-specific portion can specifically anneal to the one or both nucleotide tags in the forward and reverse primers. The barcode primer is generally 5 of the tag-specific portion.
[0112] The barcode primer is typically present in the amplification mixture in excess of the forward and/or reverse primer(s). More specifically, if the barcode primer anneals to the nucleotide tag in the forward primer, the barcode primer is generally present in excess of the forward primer. If the barcode primer anneals to the nucleotide tag in the reverse primer, the barcode primer is generally present in excess of the reverse primer. In each instance the third primer in the amplification mixture, i.e., the reverse primer or the forward primer, respectively, can be present, in illustrative embodiments, at a concentration approximately similar to that of the barcode primer. Generally the barcode primer is present in substantial excess. For example, the concentration of the barcode primer in the amplification mixtures can be at least 2-fold, at least 4-fold, at least 5-fold, at least 10-fold, at least 15-fold, at least 20-fold, at least 25-fold, at least 30-fold, at least 35-fold, at least 40-fold, at least 45-fold, at least 50-fold, at least 100-fold, at least 500-fold, at least 10.sup.3-fold, at least 510.sup.3-fold, at least 10.sup.4-fold, at least 510.sup.4-fold, at least 10.sup.5-fold, at least 510.sup.5-fold, at least 10.sup.6-fold, or higher, relative to the concentration of the forward and/or reverse primer(s). In addition, the concentration excess of the barcode primer can fall within any range having any of the above values as endpoints (e.g., 2-fold to 10.sup.5-fold). In illustrative embodiments, where the barcode primer has a tag-specific portion that is specific for the nucleotide tag on the forward primer, the forward primer can be present in picomolar to nanomolar concentrations, e.g., about 5 pM to 500 nM, about 5 pM to 100 nM, about 5 pM to 50 nM, about 5 pM to 10 nM, about 5 pM to 5 nM, about 10 pM to 1 nM, about 50 pM to about 500 pM, about 100 pM or any other range having any of these values as endpoints (e.g., 10 pM to 50 pM). Suitable, illustrative concentrations of barcode primer that could be used on combination with any of these concentrations of forward primer include about 10 nM to about 10 M, about 25 nM to about 7.5 M, about 50 nM to about 5 M, about 75 nM to about 2.5 M, about 100 nM to about 1 M, about 250 nM to about 750 nM, about 500 nM or any other range having any of these values as endpoints (e.g., 100 nM to 500 nM). In amplification reactions using such concentrations of forward and barcode primers, the reverse primer have a concentration on the same order as the barcode primer (e.g. within about 10-fold, within about 5-fold, or equal).
[0113] Each amplification mixture can be subjected to amplification to produce target amplicons comprising tagged target nucleotide sequences, each comprising first and second nucleotide tags flanking the target nucleotide sequence, and at least one barcode nucleotide sequence at the 5 or 3 end of the target amplicon (relative to one strand of the target amplicon). In certain embodiments, the first and second nucleotide tags and/or the barcode nucleotide sequence are selected so as to avoid substantial annealing to the target nucleic acids. In such embodiments, the tagged target nucleotide sequences can include molecules having the following elements: 5-(barcode nucleotide sequence)-(first nucleotide tag from the forward primer)-(target nucleotide sequence)-(second nucleotide tag sequence from the reverse primer)-3 or 5-(first nucleotide tag from the forward primer)-(target nucleotide sequence)-(second nucleotide tag sequence from the reverse primer)-(barcode nucleotide sequence)-3.
[0114] In illustrative embodiments, the barcode nucleotide sequence identifies a particular sample. Thus, for example, a set of T target nucleic acids can be amplified in each of S samples, where S and T are integers, typically greater than one. In such embodiments, amplification can be performed separately for each sample, wherein the same set of forward and reverse primers is used for each sample and the set of forward and reverse primers has at least one nucleotide tag that is common to all primers in the set. A different barcode primer can be used for each sample, wherein the bar code primers have different barcode nucleotide sequences, but the same tag-specific portion that can anneal to the common nucleotide tag. This embodiment has the advantage of reducing the number of different primers that would need to be synthesized to encode sample origin in amplicons produced for a plurality of target sequences. Alternatively, different sets of forward and reverse primers can be employed for each sample, wherein each set has a nucleotide tag that is different from the primers in the other set, and different barcode primers are used for each sample, wherein the barcode primers have different barcode nucleotide sequences and different tag-specific portions. In either case, the amplification produces a set of T amplicons from each sample that bear sample-specific barcodes.
[0115] In embodiments, wherein the same set of forward and reverse primers is used for each sample, the forward and reverse primers for each target can be initially combined separately from the sample, and each barcode primer can be initially combined with its corresponding sample. Aliquots of the initially combined forward and reverse primers can then be added to aliquots of the initially combined sample and barcode primer to produce ST amplification mixtures. These amplification mixtures can be formed in any article that can be subjected to conditions suitable for amplification. For example, the amplification mixtures can be formed in, or distributed into, separate compartments of a microfluidic device prior to amplification. Suitable microfluidic devices include, in illustrative embodiments, matrix-type microfluidic devices, such as those described below.
[0116] Any amplification method can be employed to produce amplicons from the amplification mixtures. In illustrative embodiments, PCR is employed. The amplification is generally carried out for at least three cycles to introduce the first and second nucleotide tags and the barcode nucleotide sequence. In various embodiments, amplification is carried out for 5, 10, 15, 20, 25, 30, 35, 40, 45, or 50 cycles, or for any number of cycles falling within a range having any of these values as endpoints (e.g. 5-10 cycles). In particular embodiments, amplification is carried out for a sufficient number of cycles to normalize target amplicon copy number across targets and across samples (e.g., 15, 20, 25, 30, 35, 40, 45, or 50 cycles, or for any number of cycles falling within a range having any of these values as endpoints).
[0117] Particular embodiments of the above-described method provide substantially uniform amplification, yielding a plurality of target amplicons wherein the majority of amplicons are present at a level relatively close to the average copy number calculated for the plurality of target amplicons. Thus, in various embodiments, at least 50, at least 55, at least 60, at least 65, at least 70, at least 75, at least 80, at least 85, at least 90, at least 91, at least 92, at least 93, at least 94, at least 95, at least 96, at least 97, at least 98, or at least 99 percent of the target amplicons are present at greater than 50 percent of the average number of copies of target amplicons and less than 2-fold the average number of copies of target amplicons.
[0118] The invention also provides, in certain embodiments, a method for amplifying a plurality of target nucleotides in which barcoding is, optionally, omitted and the target nucleotide sequences are tagged after amplification. More specifically, the invention provides a method for amplifying a plurality of target nucleic acids, typically, in a plurality of samples, that entails preparing an amplification mixture for each target nucleic acid. Each amplification mixture includes a forward primer including a target-specific sequence and a reverse primer including a target-specific sequence. The amplification mixtures are subjected to amplification to produce a plurality of target nucleotide sequences. The target nucleotide sequences are then tagged to produce a plurality of target amplicons, each including first and/or second nucleotide tags flanking the target nucleotide sequence. This method produces a plurality of target amplicons, wherein at least 50 percent of the target amplicons are present at greater than 50 percent of the average number of copies of target amplicons and less than 2-fold the average number of copies of target amplicons. In various embodiments of this method at least 55, at least 60, at least 65, at least 70, at least 75, at least 80, at least 85, at least 90, at least 91, at least 92, at least 93, at least 94, at least 95, at least 96, at least 97, at least 98, or at least 99 percent of the target amplicons are present at greater than 50 percent of the average number of copies of target amplicons and less than 2-fold the average number of copies of target amplicons.
[0119] In various embodiments, the target nucleotide sequence amplified can be, e.g., 25 bases, 50 bases, 100 bases, 200 bases, 500 bases, or 750 bases. In certain embodiments of the above-described methods, a long-range amplification method, such as long-range PCR can be employed to produce amplicons from the amplification mixtures. Long-range PCR permits the amplification of target nucleotide sequences ranging from one or a few kilobases (kb) to over 50 kb. In various embodiments, the target nucleotide sequences that are amplified by long-range PCR are at least about 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 20, 25, 30, 35, 40, 45, or 50 kb in length. Target nucleotide sequences can also fall within any range having any of these values as endpoints (e.g., 25 bases to 100 bases or 5-15 kb). The use of long-range PCR in the above-described methods can, in some embodiments, yield a plurality of target amplicons wherein at least 50, at least 55, at least 60, at least 65, or at least 70 percent of the target amplicons are present at greater than 50 percent of the average number of copies of target amplicons and less than 2-fold the average number of copies of target amplicons.
[0120] Long-range PCR is well known in the art. See, e.g., Cheng S, Fockler C, Barnes W M, Higuchi R (June 1994). Effective amplification of long targets from cloned inserts and human genomic DNA. Proc. Natl. Acad. Sci. U.S.A. 91 (12): 5695-9. Enzymes, protocols, and kits for long-range PCR that are suitable for use in the methods described here are commercially available; examples include: SequalPrep Long PCR Kit (Invitrogen, USA), PfuUltra II Fusion HS DNA polymerase (Stratagene), Phusion DNA polymerases, Phusion Flash High Fidelity PCR Master Mix (Finnzymes).
[0121] In certain embodiments, the target amplicons can be recovered from the amplification mixtures. For example, a matrix-type microfluidic device that is adapted to permit recovery of the contents of each reaction chamber (see below) can be employed for the amplification to generate the target amplicons. In variations of these embodiments, the target amplicons can be subjected to further amplification and/or analysis. For example, one or more target amplicon(s) can be subjected to amplification using primers specific for the first and second nucleotide tags to produce a target amplicon lacking the barcode nucleotide sequence. In certain embodiments, the amount of target amplicons produced in the amplification mixtures can be quantified during amplication, e.g., by quantitative real-time PCR, or after.
[0122] In particular embodiments, the above-described amplification methods are employed to produce amplicons suitable for automated DNA sequencing. In particular, the ability of the methods to provide substantially uniform amplification, as described above, of target nucleotide sequences is helpful in preparing DNA sequencing libraries having good coverage. In the context of automated DNA sequencing, the term coverage refers to the number of times the sequence is measured upon sequencing. A DNA sequencing library that has substantially uniform coverage can yield sequence data where the coverage is also substantially uniform. Thus, in various embodiments, upon performing automated sequencing of a plurality of target amplicons prepared as described herein, the sequences of at least 50 percent of the target amplicons are present at greater than 50 percent of the average number of copies of target amplicon sequences and less than 2-fold the average number of copies of target amplicon sequences. In various embodiments of this method at least 55, at least 60, at least 65, at least 70, at least 75, at least 80, at least 85, at least 90, at least 91, at least 92, at least 93, at least 94, at least 95, at least 96, at least 97, at least 98, or at least 99 percent of the target amplicon sequences are present at greater than 50 percent of the average number of copies of target amplicon sequences and less than 2-fold the average number of copies of target amplicon sequences.
[0123] Preparation of Nucleic Acids for DNA Sequencing
[0124] Many current DNA sequencing techniques rely on sequencing by synthesis. These techniques entail library creation, massively parallel PCR amplification of library molecules, and sequencing. Library creation starts with conversion of sample nucleic acids to appropriately sized fragments, ligation of adaptor sequences onto the ends of the fragments, and selection for molecules properly appended with adaptors. The presence of the adaptor sequences on the ends of the library molecules enables amplification of random-sequence inserts. The above-described methods for tagging nucleotide sequences can be substituted for ligation, to introduce adaptor sequences, as described in greater detail below.
[0125] In particular embodiments, the number of library DNA molecules produced in the massively parallel PCR step is low enough that the chance of two molecules associating with the same substrate, e.g. the same bead (in 454 DNA sequencing) or the same surface patch (in Solexa DNA sequencing) is low, but high enough so that the yield of amplified sequences is sufficient to provide a high throughput. As discussed further below, after suitable adaptor sequences are introduced, digital PCR can be employed to calibrate the number of library DNA molecules prior to sequencing by synthesis.
[0126] Addition of DNA Sequencing Primers to Nucleic Acids
[0127] The DNA to be sequenced can be any type of DNA. In particular embodiments, the DNA is genomic DNA from an organism. In variations of such embodiments, total genomic DNA obtained from a sample taken from an organism or from a DNA library is prepared for sequencing.
[0128] As described above, at least three primers are employed to prepare the DNA for sequencing: forward, reverse, and barcode primers. However, one or more of the forward primer, reverse primer, and barcode primer includes at least one additional primer binding site. In specific embodiments, the barcode primer includes at least a first additional primer binding site upstream of the barcode nucleotide sequence, which is upstream of the first nucleotide tag-specific portion. In certain embodiments, two of the forward primer, reverse primer, and barcode primer include at least one additional primer binding site (i.e, such that the amplicon produced upon amplification includes the nucleotide tag sequences, the barcode nucleotide sequence, and the two additional binding sites). For example, if the barcode primer includes a first additional primer binding site upstream of the barcode nucleotide sequence, in specific embodiments, the reverse primer can include at least a second additional primer binding site downstream of the second nucleotide tag. Amplification then yields a molecule having the following elements: 5-(first additional primer binding site)-(barcode nucleotide sequence)-(first nucleotide tag from the forward primer)-(target nucleotide sequence)-(second nucleotide tag from the reverse primer)-(second additional primer binding site)-3. In specific embodiments, the first and second additional primer binding sites are capable of being bound by DNA sequencing primers, to facilitate sequencing of the entire amplicon, including the barcode, which can, as discussed above, indicate sample origin.
[0129] In some embodiments, more than three primers can be employed to add desired elements to a target nucleotide sequence. For example, four primers can be employed to produce molecules having the same five elements discussed above, plus an optional additional barcode e.g., 5-(first additional primer binding site)-(barcode nucleotide sequence)-(first nucleotide tag from the forward primer)-(target nucleotide sequence)-(second nucleotide tag from the reverse primer)-(additional barcode nucleotide sequence)-(second additional primer binding site)-3. In an illustrative four-primer embodiment, the forward primer includes a target-specific portion and first nucleotide tag, and the reverse primer includes a target-specific portion and a second nucleotide tag. Together, these two primers constitute the inner primers. The remaining two primers are the outer primers, which anneal to the first and second nucleotide tags present in the inner primers. One outer primer is the barcode primer, which can contain at least a first additional primer binding site upstream of the barcode nucleotide sequence, which is upstream of the first nucleotide tag-specific portion (i.e., the same barcode primer discussed in the previous paragraph). The second outer primer can include a second tag-specific portion, an additional barcode nucleotide sequence and, downstream of this, a second additional primer binding site.
[0130] Amplification to incorporate elements from more than three primers can be carried out in one or multiple amplification reactions. For example, a four-primer amplification can be carried out in one amplification reaction, in which all four primers are present. Alternatively, a four-primer amplification can be carried out, e.g., in two amplification reactions: one to incorporate the inner primers and a separate amplification reaction to incorporate the outer primers. Where all four primers are present in one amplification reaction, the outer primers are generally present in the reaction mixture in excess. The relative concentration values give above for the barcode primer relative to the forward and/or reverse primers also apply to the relative concentrations of the outer primers relative to inner primers in a one-step, four-primer amplification reaction.
[0131] In an illustrative embodiment of the four-primer amplification reaction, each of the outer primers contains a unique barcode. For example, one barcode primer would be constructed of the elements 5-(first additional primer binding site)-(first barcode nucleotide sequence)-(first nucleotide tag)-3, and the second barcode primer would be constructed of the elements 5-(second additional primer binding site)-(second barcode nucleotide sequence)-(second nucleotide tag)-3. In this embodiment, a number (J) of first barcode primers can be combined with a number (K) of second barcode primers to create JK unique amplification products.
[0132] In a further illustrative embodiment of the invention, more than four primers can be combined in a single reaction to append different combinations of additional primer binding sites, barcode sequences, and nucleotide tags. For example, outer barcode primers containing the following elements: 5-(first additional primer binding site)-(first barcode nucleotide sequence)-(first nucleotide tag)-3, 5-(first additional primer binding site)-(first barcode nucleotide sequence)-(second nucleotide tag)-3, 5-(second additional primer binding site)-(first barcode nucleotide sequence)-(first nucleotide tag)-3, 5-(second additional primer binding site)-(first barcode nucleotide sequence)-(second nucleotide tag)-3, can be combined with inner target-specific primers as described above to produce amplification product pools containing all combinations of the barcode primers with the desired amplicon sequence.
[0133] In other illustrative embodiments of the invention, outer barcode primers in any of the combinations described above, or other combinations that would be obvious to one of skill in the art, can be combined with more than one pair of target primer sequences bearing the same first and second nucleotide tag sequences. For example, inner primers containing up to ten different target-specific forward primer sequences combined with the same first nucleotide tag and up to ten different target-specific reverse primer sequences combined with the same second nucleotide tag can be combined with the up to 2 or up to 4 outer barcode primers to generate multiple amplification products as described above. In various embodiments, at least 10, at least 20, at least 50, at least 100, at least 200, at least 500, at least 1000, at least 2000, at least 5000 or at least 10000 different target-specific primer pairs bearing the same first nucleotide tag and second nucleotide tag would be combined with the up to 2 or up to 4 outer barcode primers to generate multiple amplification products.
[0134] The methods of the invention can include subjecting at least one target amplicon to DNA sequencing using any available DNA sequencing method. In particular embodiments, a plurality of target amplicons is sequenced using a high throughput sequencing method. Such methods typically use an in vitro cloning step to amplify individual DNA molecules. Emulsion PCR (emPCR) isolates individual DNA molecules along with primer-coated beads in aqueous droplets within an oil phase. PCR produces copies of the DNA molecule, which bind to primers on the bead, followed by immobilization for later sequencing. emPCR is used in the methods by Marguilis et al. (commercialized by 454 Life Sciences, Branford, Conn.), Shendure and Porreca et al. (also known as polony sequencing) and SOLiD sequencing, (Applied Biosystems Inc., Foster City, Calif.). See M. Margulies, et al. (2005) Genome sequencing in microfabricated high-density picolitre reactors Nature 437: 376-380; J. Shendure, et al. (2005) Accurate Multiplex Polony Sequencing of an Evolved Bacterial Genome Science 309 (5741): 1728-1732. In vitro clonal amplification can also be carried out by bridge PCR, where fragments are amplified upon primers attached to a solid surface. Braslaysky et al. developed a single-molecule method (commercialized by Helicos Biosciences Corp., Cambridge, Mass.) that omits this amplification step, directly fixing DNA molecules to a surface. I. Braslaysky, et al. (2003) Sequence information can be obtained from single DNA molecules Proceedings of the National Academy of Sciences of the United States of America 100: 3960-3964.
[0135] DNA molecules that are physically bound to a surface can be sequenced in parallel. Sequencing by synthesis, like dye-termination electrophoretic sequencing, uses a DNA polymerase to determine the base sequence. Reversible terminator methods (commercialized by Illumina, Inc., San Diego, Calif. and Helicos Biosciences Corp., Cambridge, Mass.) use reversible versions of dye-terminators, adding one nucleotide at a time, and detect fluorescence at each position in real time, by repeated removal of the blocking group to allow polymerization of another nucleotide. Pyrosequencing also uses DNA polymerization, adding one nucleotide at a time and detecting and quantifying the number of nucleotides added to a given location through the light emitted by the release of attached pyrophosphates (commercialized by 454 Life Sciences, Branford, Conn.). See M. Ronaghi, et al. (1996). Real-time DNA sequencing using detection of pyrophosphate release Analytical Biochemistry 242: 84-89.
[0136] Sample Preparation by Digital PCR
[0137] In some embodiments, samples are loaded into an amplification device, for example, a PCR plate or a microfluidic device, at sample concentrations containing on average less than one amplification template per well or chamber. Each well or chamber in the device is prepared such that it contains suitable tagged target-specific primers and a unique combination of forward and reverse barcode primers. For example, one well can contain barcode primers containing the elements 5-(first additional primer binding site)-(first barcode sequence)-(first nucleotide tag)-3, 5-(second additional primer binding site)-(second barcode sequence)-(second nucleotide tag)-3. A second well or chamber can contain barcode primers containing the elements 5-(first additional primer binding site)-(third barcode sequence)-(first nucleotide tag)-3, 5-(second additional primer binding site)-(fourth barcode sequence)-(second nucleotide tag)-3. Amplification products produced in each well would be labeled uniquely with the combinations of barcode sequences loaded into these wells.
[0138] Sample Calibration by Digital PCR
[0139] In particular embodiments, the number of target amplicons produced, e.g. from a DNA library, using the above-described methods can be calibrated using a digital amplification method. The step is finds particular application in preparing DNA for sequencing by synthesis. For discussions of digital PCR see, for example, Vogelstein and Kinzler, 1999, Proc Natl Acad Sci USA 96:9236-41; McBride et al., U.S. Patent Application Publication No. 20050252773, especially Example 5 (each of these publications are hereby incorporated by reference in their entirety, and in particular for their disclosures of digital amplification). Digital amplification methods can make use of certain-high-throughput devices suitable for digital PCR, such as microfluidic devices typically including a large number and/or high density of small-volume reaction sites (e.g., nano-volume reaction sites or reaction chambers). In illustrative embodiments, digital amplification is performed using a microfluidic device, such as the Digital Array microfluidic devices described below. Digital amplification can entail distributing or partitioning a sample among hundreds to thousands of reaction mixtures disposed in a reaction/assay platform or microfluidic device. In such embodiments, a limiting dilution of the sample is made across a large number of separate amplification reactions such that most of the reactions have no template molecules and give a negative amplification result. In counting the number of positive amplification results, e.g, at the reaction endpoint, one is counting the individual template molecules present in the input sample one-by-one. A major advantage of digital amplification is that the quantification is independent of variations in the amplification efficiencysuccessful amplifications are counted as one molecule, independent of the actual amount of product.
[0140] In certain embodiments, digital amplification can be carried out after preamplification of sample nucleic acids. Typically, preamplification prior to digital amplification is performed for a limited number of thermal cycles (e.g., 5 cycles, or 10 cycles). In certain embodiments, the number of thermal cycles during preamplification can range from about 4 to 15 thermal cycles, or about 4-10 thermal cycles. In certain embodiments the number of thermal cycles can be 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, or more than 15. The above-described amplification to produce adaptor sequence-containing amplicons for DNA sequencing can be substituted for the typical preamplification step.
[0141] Digital amplication methods are described in U.S. Publication No. 20090239308, published Sep. 24, 2009, which is hereby incorporated by reference in its entirety and, in particular, for its disclosure of digital amplification methods and devices. Generally, in digital amplification, identical (or substantially similar) amplification reactions are run on a nucleic acid sample, such as genomic DNA. The number of individual reactions for a given nucleic acid sample may vary from about 2 to over 1,000,000. Typically, the number of reactions performed on a sample is about 100 or greater, more typically about 200 or greater, and even more typically about 300 or greater. Larger scale digital amplification can also be performed in which the number of reactions performed on a sample is about 500 or greater, about 700 or greater, about 765 or greater, about 1,000 or greater, about 2,500 or greater, about 5,000 or greater, about 7,500 or greater, or about 10,000or greater. The number of reactions performed may also be significantly higher, such up to about 25,000, up to about 50,000, up to about 75,000, up to about 100,000, up to about 250,000, up to about 500,000, up to about 750,000, up to about 1,000,000, or even greater than 1,000,000 assays per genomic sample.
[0142] In particular embodiments, the quantity of nucleic acid subjected to digital amplification is generally selected such that, when distributed into discrete reaction mixtures, each individual amplification reaction is expected to include one or fewer amplifiable nucleic acids. One of skill in the art can determine the concentration of target amplicon(s) produced as described above and calculate an appropriate amount for use in digital amplification. More conveniently, a set of serial dilutions of the target amplicon(s) can be tested. For example, a device that is commercially available from Fluidigm Corp. as the 12.765 Digital Array microfluidic device allows 12 different dilutions to be tested simultaneously. Optionally, a suitable dilution can be determined by generating a linear regression plot. For the optimal dilution, the line should be straight and pass through the origin. Subsequently the concentration of the original samples can be calculated from the plot.
[0143] The appropriate quantity of target amplicon(s) can be distributed into discrete locations or reaction wells or chambers such that each reaction includes, for example, an average of no more than about one amplicon per volume. The target amplicon(s) can be combined with reagents selected for quantitative or nonquantitative amplification, prior to distribution or after.
[0144] Following distribution, the reaction mixtures are subjected to amplification to identify those reaction mixtures that contained a target amplicon. Any amplification method can be employed, but conveniently, PCR is used, e.g., real-time PCR or endpoint PCR. This amplification can employ any primers capable of amplifying the target amplicon(s). Thus, in particular embodiments, the primers can be DNA sequencing primers that anneal to the primer binding sites introduced in the previous amplification step.
[0145] The concentration of any target amplicon (copies/L) is correlated with the number of positive (i.e., amplification product-containing) reaction mixtures. See copending U.S. application Ser. No. 12/170,414, entitled Method and Apparatus for Determining Copy Number Variation Using Digital PCR, which is incorporated by reference for all purposes, and, in particular, for analysis of digital PCR results. Also see Dube et al., 2008, Mathematical Analysis of Copy Number Variation in a DNA Sample Using Digital PCR on a Nanofluidic Device PLoS ONE 3(8): e2876. doi:10.1371/journal.pone.0002876, which is incorporated by reference for all purposes and, in particular, for analysis of digital PCR results.
[0146] In an illustrative embodiment of sample calibration for DNA sequencing by digital PCR, a PCR reaction mix containing roughly 100-360 amplicons per l can be loaded onto a Digital Array microfluidic device, such as Fluidigm Corporation's (South San Francisco, Calif.) 12.765 Digital Array microfluidic device, described below. The microfluidic chip has 12 panels and each panel contains 765 chambers. Replicate panels on the digital chip can be assayed in order to obtain absolute quantification of the initial concentration of library. The diluted samples having typical relative coefficients of variation (between replicates) within 9-12% (or lower) can be used for sequencing. See. e.g., White III RA, Blainey PC, Fan CH, Quake SR. Digital PCR provides sensitive and absolute calibration for high throughput sequencing BMC Genomics 10:116 doi:10.1186/1471-2164-10-116.
Sample Nucleic Acids
[0147] Preparations of nucleic acids (samples) can be obtained from biological sources and prepared using conventional methods known in the art. In particular, DNA or RNA useful in the methods described herein can be extracted and/or amplified from any source, including bacteria, protozoa, fungi, viruses, organelles, as well higher organisms such as plants or animals, particularly mammals, and more particularly humans. Suitable nucleic acids can also be obtained from environmental sources (e.g., pond water), from man-made products (e.g., food), from forensic samples, and the like. Nucleic acids can be extracted or amplified from cells, bodily fluids (e.g., blood, a blood fraction, urine, etc.), or tissue samples by any of a variety of standard techniques. Illustrative samples include samples of plasma, serum, spinal fluid, lymph fluid, peritoneal fluid, pleural fluid, oral fluid, and external sections of the skin; samples from the respiratory, intestinal genital, and urinary tracts; samples of tears, saliva, blood cells, stem cells, or tumors. For example, samples of fetal DNA can be obtained from an embryo or from maternal blood. Samples can be obtained from live or dead organisms or from in vitro cultures. Illustrative samples can include single cells, paraffin-embedded tissue samples, and needle biopsies. Nucleic acids useful in the invention can also be derived from one or more nucleic acid libraries, including cDNA, cosmid, YAC, BAC, P1, PAC libraries, and the like.
[0148] Nucleic acids of interest can be isolated using methods well known in the art, with the choice of a specific method depending on the source, the nature of nucleic acid, and similar factors. The sample nucleic acids need not be in pure form, but are typically sufficiently pure to allow the amplification steps of the methods of the invention to be performed. Where the target nucleic acids are RNA, the RNA can be reversed transcribed into cDNA by standard methods known in the art and as described in Sambrook, J., Fritsch, E. F., and Maniatis, T., Molecular Cloning: A Laboratory Manual. Cold Spring Harbor Laboratory Press, N.Y., Vol. 1, 2, 3 (1989), for example. The cDNA can then be analyzed according to the methods of the invention.
Target Nucleic Acids
[0149] Any target nucleic acid that can be tagged in an encoding reaction of the invention (described herein) can be detected using the methods of the invention. In typical embodiments, at least some nucleotide sequence information will be known for the target nucleic acids. For example, if the encoding reaction employed is PCR, sufficient sequence information is generally available for each end of a given target nucleic acid to permit design of suitable amplification primers. In an alternative embodiment, the target-specific sequences in primers could be replaced by random or degenerate nucleotide sequences.
[0150] The targets can include, for example, nucleic acids associated with pathogens, such as viruses, bacteria, protozoa, or fungi; RNAs, e.g., those for which over- or under-expression is indicative of disease, those that are expressed in a tissue- or developmental-specific manner; or those that are induced by particular stimuli; genomic DNA, which can be analyzed for specific polymorphisms (such as SNPs), alleles, or haplotypes, e.g., in genotyping. Of particular interest are genomic DNAs that are altered (e.g., amplified, deleted, and/or mutated) in genetic diseases or other pathologies; sequences that are associated with desirable or undesirable traits; and/or sequences that uniquely identify an individual (e.g., in forensic or paternity determinations).
Primer Design
[0151] Primers suitable for nucleic acid amplification are sufficiently long to prime the synthesis of extension products in the presence of the agent for polymerization. The exact length and composition of the primer will depend on many factors, including, for example, temperature of the annealing reaction, source and composition of the primer, and where a probe is employed, proximity of the probe annealing site to the primer annealing site and ratio of primer:probe concentration. For example, depending on the complexity of the target nucleic acid sequence, an oligonucleotide primer typically contains in the range of about 15 to about 30 nucleotides, although it may contain more or fewer nucleotides. The primers should be sufficiently complementary to selectively anneal to their respective strands and form stable duplexes. One skilled in the art knows how to select appropriate primer pairs to amplify the target nucleic acid of interest.
[0152] For example, PCR primers can be designed by using any commercially available software or open source software, such as Primer3 (see, e.g., Rozen and Skaletsky (2000) Meth. Mol. Biol., 132: 365-386; www.broad.mit.edu/node/1060, and the like) or by accessing the Roche UPL website. The amplicon sequences are input into the Primer3 program with the UPL probe sequences in brackets to ensure that the Primer3 program will design primers on either side of the bracketed probe sequence.
[0153] Primers may be prepared by any suitable method, including, for example, cloning and restriction of appropriate sequences or direct chemical synthesis by methods such as the phosphotriester method of Narang et al. (1979) Meth. Enzymol. 68: 90-99; the phosphodiester method of Brown et al. (1979) Meth. Enzymol. 68: 109-151; the diethylphosphoramidite method of Beaucage et al. (1981) Tetra. Lett., 22: 1859-1862; the solid support method of U.S. Pat. No. 4,458,066 and the like, or can be provided from a commercial source.
[0154] Primers may be purified by using a Sephadex column (Amersham Biosciences, Inc., Piscataway, N.J.) or other methods known to those skilled in the art. Primer purification may improve the sensitivity of the methods of the invention.
Microfluidic Devices
[0155] In certain embodiments, any of the methods of the invention can be carried out using a microfluidic device. In illustrative embodiments, the device is a matrix-type microfluidic device is one that allows the simultaneous combination of a plurality of substrate solutions with reagent solutions in separate isolated reaction chambers. It will be recognized, that a substrate solution can comprise one or a plurality of substrates and a reagent solution can comprise one or a plurality of reagents. For example, the microfluidic device can allow the simultaneous pair-wise combination of a plurality of different amplification primers and samples. In certain embodiments, the device is configured to contain a different combination of primers and samples in each of the different chambers. In various embodiments, the number of separate reaction chambers can be greater than 50, usually greater than 100, more often greater than 500, even more often greater than 1000, and sometimes greater than 5000, or greater than 10,000.
[0156] In particular embodiments, the matrix-type microfluidic device is a Dynamic Array (DA) microfluidic device, an example of which is shown in
[0157] U.S. Patent Publication No. 2008/0223721 and PCT Publication No. WO 05/107938A2 describe illustrative matrix-type devices that can be used to practice the methods described herein.
[0158] Although the DA microfluidic devices described above in WO 05/107938 are well suited for conducting the methods described herein, the invention is not limited to any particular device or design. Any device that partitions a sample and/or allows independent pair-wise combinations of reagents and sample may be used. U.S. Patent Publication No. 20080108063 (which is hereby incorporated by reference it its entirety) includes a diagram illustrating the 48.48 Dynamic Array IFC (Integrated Fluidic Circuit), a commercially available device available from Fluidigm Corp. (South San Francisco Calif.). It will be understood that other configurations are possible and contemplated such as, for example, 4896; 9696; 30120; etc.
[0159] In specific embodiments, the microfluidic device can be a Digital Array microfluidic device, which is adapted to perform digital amplification. Such devices can have integrated channels and valves that partition mixtures of sample and reagents into nanolitre volume reaction chambers. In some embodiments, the Digital Array microfluidic device is fabricated, at least in part, from an elastomer. Illustrative Digital Array microfluidic devices are described in copending U.S. Applications owned by Fluidigm, Inc., such as U.S. application Ser. No. 12/170,414, entitled Method and Apparatus for Determining Copy Number Variation Using Digital PCR. One illustrative embodiment has 12 input ports corresponding to 12 separate sample inputs to the device. The device can have 12 panels, and each of the 12 panels can contain 765 6 nL reaction chambers with a total volume of 4.59 L per panel. Microfluidic channels can connect the various reaction chambers on the panels to fluid sources. Pressure can be applied to an accumulator in order to open and close valves connecting the reaction chambers to fluid sources. In illustrative embodiments, 12 inlets can be provided for loading of the sample reagent mixture. 48 inlets can be used to provide a source for reagents, which are supplied to the biochip when pressure is applied to accumulator. Additionally, two or more inlets can be provided to provide hydration to the biochip. Hydration inlets are in fluid communication with the device to facilitate the control of humidity associated with the reaction chambers. As will be understood to one of skill in the art, some elastomeric materials that can utilized in the fabrication of the device are gas permeable, allowing evaporated gases or vapor from the reaction chambers to pass through the elastomeric material into the surrounding atmosphere. In a particular embodiment, fluid lines located at peripheral portions of the device provide a shield of hydration liquid, for example, a buffer or master mix, at peripheral portions of the biochip surrounding the panels of reaction chambers, thus reducing or preventing evaporation of liquids present in the reaction chambers. Thus, humidity at peripheral portions of the device can be increased by adding a volatile liquid, for example water, to hydration inlets. In a specific embodiment, a first inlet is in fluid communication with the hydration fluid lines surrounding the panels on a first side of the biochip and the second inlet is in fluid communication with the hydration fluid lines surrounding the panels on the other side of the biochip.
[0160] While the Digital Array microfluidic devices are well-suited for carrying out the digital amplification methods described herein, one of ordinary skill in the art would recognize many variations and alternatives to these devices. The microfluidic device which is the 12.765 Dynamic Array commercially available from Fluidigm Corp. (South San Francisco, Calif.), includes 12 panels, each having 765 reaction chambers with a volume of 6 nL per reaction chamber. However, this geometry is not required for the digital amplification methods described herein. The geometry of a given Digital Array microfluidic device will depend on the particular application. Additional description related to devices suitable for use in the methods described herein is provided in U.S. Patent Application Publication No. 2005/0252773, incorporated herein by reference for its disclosure of Digital Array microfluidic devices.
[0161] In certain embodiments, the methods described herein can be performed using a microfluidic device that provides for recovery of reaction products. Such devices are described in detail in copending U.S. Application No. 61/166,105, filed Apr. 2, 2009, which is hereby incorporated by reference in its entirety and specifically for its description of microfluidic devices that permit reaction product recovery and related methods. For example, the digital PCR method for calibrating DNA samples prior to sequencing can be preformed on such devices, permitting recovery of amplification products, which can then serve as templates for DNA sequencing.
[0162]
[0163] The carrier 100 also includes four sources 130, 132, 134, and 136, which may be used to actuate control lines present in the microfluidic device. In an embodiment, sources 130, 132, and 134 are used to pressurize control lines operable to open and close valves present in the microfluidic device. For example, application of pressure greater than atmospheric pressure to source 132 will result in the liquid present in source 132 flowing into control lines present on the microfluidic device, thereby actuating valves operable to obstruct flow through one or more fluid input lines also present on the microfluidic device. In an embodiment, source 130 is used as a fluid well containing harvesting reagent. Pressure can be applied to source 130, forcing the harvesting reagent to flow through fluid lines provided on the carrier to fluid lines provided on the microfluidic device. Thus, application of pressure to source 130 can result in the flow of a harvesting reagent or other suitable fluid through the microfluidic device. The control lines that are in fluid communication with the sources 130-136 can include control lines for interface valves, containment valves, valves used in dilation pumping, fluid lines for the flow of harvesting reagent, or the like. In a particular embodiment, valve 1 is controlled by source 132, valve 2 is controlled by source 134, harvesting reagent is provided in source 130, and hydration reagent is provided in source 136. In this particular embodiment, the interface valves are controlled by source 150 and containment valves are controlled by source 152. This particular embodiment is not intended to limit the present invention, but merely to provide an example of one configuration. Other configurations can be utilized as appropriate to the particular application.
[0164] As described more fully in relation to
[0165] Pressure accumulators 150 and 152 may be utilized to pressurize other control lines, provide for hydration of the microfluidic device, or they may not be used in some embodiments. Although 48 sample input ports and 48 assay input ports are shown in the embodiment of the present invention illustrated in
[0166]
[0167] As described more fully throughout the present specification, after samples and assays are mixed and reacted, the reaction products can be recovered from the microfluidic device by flowing a recovery fluid through the fluid input lines 212, through the array portion 230 of the microfluidic device as illustrated in
[0168] The particular number of sample and assay input lines illustrated in
[0169]
[0170] Utilizing interface valves 330 and containment valves 340, each of the sample chambers can be isolated from each of the other sample chambers as well as the assay chambers. The assay chambers can be isolated from the other assay chambers using the containment valves. Both the isolation and containment valves are actuated by application of pressure to a corresponding control line present on the carrier or by other means, for example, electrostatic actuation.
[0171]
[0172] Opening of the interface valves 330 enables the samples and the assays to mix in pairwise combinations via free interface diffusion. After the samples and assays are mixed, thermocycling can be performed to form reaction products. Reaction products are recovered from the microfluidic device by opening harvest valves 350, which enable the reaction products to flow into portions 360 of the sample input lines adjacent the sample chambers. Using sample input lines 316 and on-chip pumps (not shown), reaction products flow through the sample input lines toward recovery ports on the carrier.
[0173] In the embodiment illustrated in
[0174] A benefit provided by the systems described herein is that the volume of samples and assays used in the reactions are fixed, regardless of the pipetting volume dispensed into the sample input and the assay input ports. If the volume in the sample and/or assay input ports is above a predetermined threshold sufficient to fill the sample/assay input lines and the sample/assay chambers, then application of pressure to the sample/assay input ports will result in complete filling of the sample/assay chambers. The completely filled chambers thus provide a fixed reaction volume not available with conventional microtiter plate techniques.
[0175] Although systems have been developed by the present assignee to perform many simultaneous binding assays, including, but not limited to immunological experiments such as ELISA assays, embodiments of the present invention provide for dilation pumping on-chip as well as separate sample and assay chambers. Thus, pairwise combinations of samples and assays are possible using embodiments described herein that are not possible with previously developed techniques. Additional description of binding assays is provided in U.S. Patent Application Publication No. 2007/0074972, filed on Sep. 13, 2006, the disclosure of which is hereby incorporated by reference in its entirety for all purposes.
[0176] Embodiments of the present invention provide a system suitable for PCR sample preparation that features reduced cost, time, and labor in the preparation of amplicon libraries from an input DNA template. In a typical use case, the first amplification will be used to generate libraries for next-generation sequencing. Utilizing embodiments of the present invention, samples and encoded primers are combined with amplicon-specific (AS) primers to create a mixture that is suitable for desired reactions. Based on an MN architecture of the microfluidic device, each of the M samples is combined with each of the N AS primers (i.e., assays) to form MN pairwise combinations. That is, one reaction site is provided for each sample and assay pair. After the completion of the reaction (e.g., PCR), the reaction products are recovered from the system, typically using a harvest reagent that flows through the microfluidic device. In a specific embodiment, reaction products associated with each sample are recovered in a separate reaction pool, enabling further processing or study of the pool containing a given sample reacted with each of the various assays.
[0177] Thus, in embodiments described herein, a microfluidic device is provided in which independent sample inputs are combined with primer inputs in an MN array configuration. Thus, each reaction is a unique combination of a particular sample and a particular primer. As described more fully throughout the present specification, samples are loaded into sample chambers in the microfluidic device through sample input lines arranged as columns in one implementation. AS primers or assays are loaded into assay chambers in the microfluidic device through assay input lines arranged as rows crossing the columns. The sample chambers and the assay chambers are in fluidic isolation during loading. After the loading process is completed, an interface valve operable to obstruct a fluid line passing between pairs of sample and assay chambers is opened to enable free interface diffusion of the pairwise combinations of samples and assays. Precise mixture of the samples and assays enables reactions to occur between the various pairwise combinations, producing a reaction product including a set of specific PCR reactions for which each sample has been effectively coded with a unique barcode. The reaction products are harvested and can then be used for subsequent sequencing processes. The terms assay and sample as used herein are descriptive of particular uses of the devices in some embodiments. However, the uses of the devices are not limited to the use of sample(s) and assay(s) in all embodiments. For example, in other embodiments, sample(s) may refer to a first reagent or a plurality of first reagents and assay(s) may refer to a second reagent or a plurality of second reagents. The MN character of the devices enable the combination of any set of first reagents to be combined with any set of second reagents.
[0178] According to one particular process implemented using an embodiment of the present invention, after 25 cycles of PCR, the reaction products from the MN pairwise combinations will be recovered from the microfluidic device in discrete pools, one for each of the M samples. Typically, the discrete pools are contained in a sample input port provided on the carrier. In some processes, the reaction products may be harvested on a per amplicon basis for purposes of normalization. Utilizing embodiments of the present invention, it is possible to achieve results (for replicate experiments assembled from the same input solutions of samples and assays) for which the copy number of amplification products varies by no more than 25% within a sample and no more than 25% between samples. Thus, the amplification products recovered from the microfluidic device will be representative of the input samples as measured by the distribution of specific known genotypes. Preferably, output sample concentration will be greater than 2,000 copies/amplicon/microliter and recovery of reaction products will be performed in less than two hours.
[0179] Applications in which embodiments of the present invention can be used include sequencer-ready amplicon preparation and long-range PCR amplicon library production. For the sequencer-ready amplicon preparation, multiple-forward primer and 3-primer combination protocols can be utilized.
[0180]
[0181]
[0182] As discussed in relation to
[0183] In an embodiment, even assay input lines 420 and odd assay input lines 422 are in fluid communication with vias 424 that are aligned with assay input lines 140, which are in fluid communication with assay input ports 122 as illustrated in
[0184] As discussed above, the various fluid lines can be integrated into the carrier, the microfluidic device, or other suitable structure. In a 48 sample48 assay array configuration, the 24 even assay input lines 420 will provide inputs to half of the rows of assay input lines 518 shown in
[0185] The harvesting reagent input line 430 provides for harvesting reagent used in recovering reaction products from the microfluidic device. The harvesting reagent input line 430 illustrated in
[0186] As described in relation to
[0187] By applying pressure to the bank of sample input ports 120 and the bank of assay input ports 122, samples and reagents can be loaded through the illustrated sample and assay input lines into sample and assay chambers present in the microfluidic device. By applying pressure to the harvesting reagent input port 136, the reaction products can be recovered from the sample chambers and delivered to the sample input ports. Valves present in the microfluidic device are utilized to control the flow of samples, assays, and reaction products, as described more fully throughout the present specification.
[0188]
[0189]
[0190] As described more fully throughout the present specification, reaction products are recovered from the microfluidic device using the sample input lines 516 and pumps (not shown). Containment valves 540 provide for containment between the various sample and assay chambers in each row. Utilizing the interface valves 530 and the containment valves 540, each of the sample chambers can be isolated from each of the other sample chambers as well as the assay chambers. The assay chambers can be isolated from the other assay chambers using the containment valves. Both the isolation and containment valves are actuated by application of pressure to a corresponding control line in fluid communication with sources 130-134 or by other means, for example, electrostatic actuation.
[0191] In
[0192] Embodiments of the present invention provide unit cells with dimensions on the order of several hundred microns, for example unit cells with dimension of 500500 m, 525525 m, 550550 m, 575575 m, 600600 m, 625625 m, 650650 m, 675675, m, 700700 m, or the like. The dimensions of the sample chambers and the assay chambers are selected to provide amounts of materials sufficient for desired processes while reducing sample and assay usage. As examples, sample chambers can have dimensions on the order of 100-400 m in width200-600 m in length100-500 m in height. For example, the width can be 100 m, 125 m, 150 m, 175 m, 200 m, 225 m, 250 m, 275 m, 300 m, 325 m, 350 m, 375 m, 400 m, or the like. For example, the length can be 200 m, 225 m, 250 m, 275 m, 300 m, 325 m, 350 m, 375 m, 400 m, 425 m, 450 m, 475 m, 500 m, 525 m, 550 m, 575 m, 600 m, or the like. For example, the height can be 100 m, 125 m, 150 m, 175 m, 200 m, 225 m, 250 m, 275 m, 300 m, 325 m, 350 m, 375 m, 400 m, 425 m, 450 m, 475 m, 500 m, 525 m, 550 m, 575 m, 600 m, or the like. Assay chambers can have similar dimensional ranges, typically providing similar steps sizes over smaller ranges than the smaller chamber volumes. In some embodiments, the ratio of the sample chamber volume to the assay chamber volume is about 5:1, 10:1, 15:1, 20:1, 25:1, or 30:1. Smaller chamber volumes than the listed ranges are included within the scope of the invention and are readily fabricated using microfluidic device fabrication techniques.
[0193] Higher density microfluidic devices will typically utilize smaller chamber volumes in order to reduce the footprint of the unit cells. In applications for which very small sample sizes are available, reduced chamber volumes will facilitate testing of such small samples.
[0194] The dimensions of the interface valves 530 are selected to provide for complete obstruction of the fluid lines 514 connecting the sample and assay chambers. In some embodiments, the valve dimensions range from about 10-200 m10-200 m, for example, 5050 m, 5065 m, 5080 m, 50100 m, 6550 m, 6565 m, 6580 m, 65100 m, 8050 m, 8065 m, 8080 m, 80100 m, 10050 m, 10065 m, 10080 m, 100100 m, or the like. The sample input lines may have various widths depending on the number of sample input lines and the sample chamber volumes, and desired flow rates for loading and product recovery. As examples, the sample input lines may have a cross-section of 1-20 m in height and 50-100 m in width. For example, the sample input lines may have heights of 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20 m and widths of 50, 55, 60, 65, 70, 75, 80, 85, 90, 95, or 100 m.
[0195] Other device parameters, including layer to layer alignment, ranging from 20-100 m, and via size, ranging from 50-200 microns, are selected to provide desired system performance characteristics. One of ordinary skill in the art would recognize many variations, modifications, and alternatives.
[0196] In some embodiments, an extra assay inlet is provided at the side of the microfluidic device adjacent the harvesting reagent input lines. Additionally, no assay inlet is provided at the side of the microfluidic device adjacent the sample input ports on the carrier. In this configuration, the extra assay inlet can be used for dehydration chamber loading. Typically, loading of the dehydration chambers will use more than 5 l of assay solution. Alternatively, a separate dehydration solution could be used to keep assay volumes uniform across the microfluidic device.
[0197]
[0198] The intersection of the control line 532 with the fluid line 514 forms a valve at the intersection, referred to as an interface valve 530 because the valve prevents mixing at the interface between the sample and the assay. The interface valve 530 is actuated in response to fluid pressure in the control line and is operative to prevent fluid flow through the fluid lines. Generally, the multilayer microfluidic device discussed herein includes a number of elastomeric layers and the valves 530 include a deflectable membrane between the first layer and the second layer. In a push-up configuration, the deflectable membrane of the valve is deflectable into the fluid line 514 positioned above the intersection with the control line 532. In this configuration, the deflectable membrane deflects up into the fluid line to close the fluid line at the position of the valve, thus the reference to push-up valves. Releasing the pressure in the control line will result in the deflectable membrane returning to the undeflected position and thereby opening of the closed valve. Additional description of microfluidic devices including valves is provided in U.S. Patent Application No. 2005/0226742, the entire disclosure of which is hereby incorporated by reference in its entirety for all purposes.
[0199] As illustrated in
[0200] Referring once again to
[0201] The mixture is reacted to form a reaction product (620). A typical reaction included within the scope of the present invention is PCR, which involves thermocycling of the microfluidic device through a number of cycles as will be evident to one of skill in the art. The fluid line between the assay chamber and the sample chamber is closed (622) by actuation of interface valves 530. Closure of the interface valves separates the reaction product present in the sample chambers from the reaction products present in the assay chambers. Additionally, the containment valves 540 can be closed during thermocycling in order to prevent precipitation during the thermocycling process. A harvesting reagent flows from the harvesting port to the sample chamber (624) in order to begin the process of harvesting the reaction products present in the sample chambers. The harvesting port 136 is an example of a fluid input port useful in the harvesting process. As illustrated in
[0202] Dilation pumping is used in the illustrated embodiment to remove the reaction products from the microfluidic device as discussed in additional detail in relation to
[0203] It should be appreciated that the specific steps illustrated in
[0204]
[0205]
[0206]
[0207] It should be noted that the straight line representing the interface between the harvesting reagent and the reaction products is shown merely for purposes of simplicity and it will be apparent to one of skill in the art that in practice, a more complicated interface will be present.
[0208]
[0209]
[0210] Samples are loaded into sample chambers 510 and assays are loaded into assay chambers 512 as described in relation to
[0211] Isolation valve 840 is open during the sample and assay loading process to enable the depletion front to flow into the loading bowls 830. Valve 822 is open, allowing the samples to flow through the sample input lines to the various sample chambers. Since valve 820 is closed, samples are not allowed to pass into the harvesting reagent input line 810. It should be noted that containment valves 540 are illustrated in the closed state in
[0212] In
[0213]
[0214] Although
[0215]
[0216] Fluid pressure resulting from the flow of the harvesting reagent into the array portion of the microfluidic device results in expansion of the sample input lines and sample chambers above the valve 822. The pump cycle is initiated by this pressurization of the sample chambers. As described below, closing of valve 820 and opening of valve 822 will enable the pressurized harvesting reagent and reaction products to be recovered from the microfluidic device as it flows through the microfluidic device.
[0217]
[0218] Dilation pumping (also know as volumetric capacitive pumping) is a method of operating a properly configured integrated fluidic circuit (microfluidic device) to obtain precise, low rate, low volume pumping through all configured elements of the microfluidic device. Dilation pumping is unique to microfluidic circuits that utilize channels that have one or more channel walls formed from an elastomeric material. As an example, the flow of the harvesting reagent through the sample input lines and sample chambers is considered volumetric capacitive pumping. Pumping proceeds by the closure of valves 822 and the opening of valves 820. As discussed above, harvesting reagent ports (not illustrated) are pressurized to introduce the harvesting reagent into the topmost sample input lines and sample chambers, which can be considered as a channel. The pressurization of microfluidic channels with at least one channel wall formed from an elastomeric material results in expansion of the elastomeric wall(s) outward from the channel with a resulting increase in channel volume that is proportional to the fluidic pressure (or gaseous pressure in alternate embodiments) within the channel, the elastic properties of the elastomeric channel wall material such as Young's modulus, and the length and cross sectional area of the channel. The sample input lines and sample chambers are allowed to pressurize and then valves 820 is closed as illustrated in
[0219] Thus embodiments provide a method of dilation pumping that includes closing a first valve disposed between the sample chamber and the sample input port (i.e., valve 822), opening a second valve disposed between the harvesting port and the sample chamber (i.e., valve 820), closing the second valve, opening the first valve, and repeating these steps a predetermined number of times. Between the steps of opening the second valve and closing the second valve, the harvesting reagent flows into the sample input lines and sample chambers, pressurizing the channel as described above. After the dilation pumping process is complete, harvesting reagent substantially fills the sample input lines and sample chambers (e.g., recovery rates>95%), thereby pooling the reaction products associated with a given sample in the sample input port from which the given sample was initially dispensed.
[0220] Dilation pumping provides benefits not typically available using conventional techniques. For example, dilation pumping enables for a slow removal of the reaction products from the microfluidic device. In an exemplary embodiment, the reaction products are recovered at a fluid flow rate of less than 100 l per hour. In this example, for 48 reaction products distributed among the reaction chambers in each column, with a volume of each reaction product of about 1.5 l, removal of the reaction products in a period of about 30 minutes, will result in a fluid flow rate of 72 l/hour. (i.e., 48*1.5/0.5 hour). In other embodiments, the removal rate of the reaction products is performed at a rate of less than 90 l/hr, 80 l/hr, 70 l/hr, 60 l/hr, 50 l/hr, 40 l/hr, 30 l/hr, 20 l/hr, 10 l/hr, 9 l/hr, less than 8 l/hr, less than 7 l/hr, less than 6 l/hr, less than 5 l/hr, less than 4 l/hr, less than 3 l/hr, less than 2 l/hr, less than 1 l/hr, or less than 0.5 l/hr.
[0221] Dilation pumping results in clearing of substantially a high percentage and potentially all the reaction products present in the microfluidic device. Some embodiments remove more than 75% of the reaction products present in the reaction chambers (e.g., sample chambers) of the microfluidic device. As an example, some embodiments remove more than 80%, 85%, 90%, 92%, 95%, 96%, 97%, 98%, or 99% of the reaction products present in the reaction chambers.
[0222] In some embodiments, a harvesting valve is provided on the microfluidic device to obstruct the flow of harvesting reagent through the device. Application of a pressure source to a harvesting input port results in flow of harvesting fluid (e.g., a harvesting liquid) through harvest reagent input lines up to the harvesting valve. The permeability of the materials utilized to fabricate the microfluidic device enables such a harvesting fluid to fill the harvest reagent input lines, typically expelling air initially present in such lines. The presence of the harvesting valve will obstruct the flow of the harvest reagent at the location of the harvesting valve. Actuation (i.e., opening) of the harvesting valve will result in the harvesting fluid flowing through the harvest reagent input lines downstream of the harvesting valve. In other embodiments, a harvesting valve is replaced with one or more other suitable valves as appropriate to the particular application. For example, in the embodiment illustrated in
[0223] Fabrication methods using elastomeric materials and methods for design of devices and their components have been described in detail in the scientific and patent literature. See, e.g., Unger et al. (2000) Science 288:113-116; U.S. Pat. Nos. US 6,960,437 (Nucleic acid amplification utilizing microfluidic devices); 6,899,137 (Microfabricated elastomeric valve and pump systems); 6,767,706 (Integrated active flux microfluidic devices and methods); 6,752,922 (Microfluidic chromatography); 6,408,878 (Microfabricated elastomeric valve and pump systems); 6,645,432 (Microfluidic systems including three-dimensionally arrayed channel networks); U.S. Patent Application Publication Nos. 2004/0115838; 2005/0072946; 2005/0000900; 2002/0127736; 2002/0109114; 2004/0115838; 2003/0138829; 2002/0164816; 2002/0127736; and 2002/0109114; PCT Publication Nos. WO 2005/084191; WO 05/030822A2; and WO 01/01025; Quake & Scherer, 2000, From micro to nanofabrication with soft materials Science 290: 1536-40; Unger et al., 2000, Monolithic microfabricated valves and pumps by multilayer soft lithography Science 288:113-116; Thorsen et al., 2002, Microfluidic large-scale integration Science 298:580-584; Chou et al., 2000, Microfabricated Rotary Pump Biomedical Microdevices 3:323-330; Liu et al., 2003, Solving the world-to-chip interface problem with a microfluidic matrix Analytical Chemistry 75, 4718-23, Hong et al, 2004, A nanoliter-scale nucleic acid processor with parallel architecture Nature Biotechnology 22:435-39.
[0224] According to certain embodiments describer herein, the detection and/or quantification of one or more target nucleic acids from one or more samples may generally be carried out on a microfluidic device by obtaining a sample, optionally pre-amplifying the sample, and distributing the optionally pre-amplified sample, or aliquots thereof, into reaction chambers of a microfluidic device containing the appropriate buffers, primers, optional probe(s), and enzyme(s), subjecting these mixtures to amplification, and querying the aliquots for the presence of amplified target nucleic acids. The sample aliquots may have a volume of less than 1 picoliter or, in various embodiments, in the range of about 1 picoliter to about 500 nanoliters, in a range of about 2 picoliters to about 50 picoliters, in a range of about 5 picoliters to about 25 picoliters, in the range of about 100 picoliters to about 20 nanoliters, in the range of about 1 nanoliter to about 20 nanoliters, and in the range of about 5 nanoliters to about 15 nanoliters. In many embodiments, sample aliquots account for the majority of the volume of the amplification mixtures. Thus, amplification mixtures can have a volume of less than 1 picoliter or, in various embodiments about 2, about 5 about 7, about 10, about 15, about 20, about 25, about 50, about 100, about 250, about 500, and about 750 picoliters; or about 1, about 2, about 5, about 7, about 15, about 20, about 25, about 50, about 250, and about 500 nanoliters. The amplification mixtures can also have a volume within any range bounded by any of these values (e.g., about 2 picoliters to about 50 picoliters).
[0225] In certain embodiments, multiplex detection is carried out in individual amplification mixture, e.g., in individual reaction chambers of a microfluidic device, which can be used to further increase the number of samples and/or targets that can be analyzed in a single assay or to carry out comparative methods, such as comparative genomic hybridization (CGH). In various embodiments, up to 2, 3, 4, 5, 6, 7, 8, 9, 10, 50, 100, 500, 1000, 5000, 10000 or more amplification reactions are carried out in each individual reaction chamber.
[0226] In specific embodiments, the assay usually has a dynamic range of at least 3 orders of magnitude, more often at least 4, at least 5, at least 6, at least 7, or at least 8 orders of magnitude.
Quantitative Real-Time PCR and Other Detection and Quantification Methods
[0227] Any method of detection and/or quantification of nucleic acids can be used in the invention to detect amplification products. In one embodiment, PCR (polymerase chain reaction) is used to amplify and/or quantify target nucleic acids. In other embodiments, other amplification systems or detection systems are used, including, e.g., systems described in U.S. Pat. No. 7,118,910 (which is incorporated herein by reference in its entirety for its description of amplification/detection systems) and Invader assays; PE BioSystems). In particular embodiments, real-time quantification methods are used. For example, quantitative real-time PCR methods can be used to determine the quantity of a target nucleic acid present in a sample by measuring the amount of amplification product formed during the amplification process itself.
[0228] Fluorogenic nuclease assays are one specific example of a real-time quantification method that can be used successfully in the methods described herein. This method of monitoring the formation of amplification product involves the continuous measurement of PCR product accumulation using a dual-labeled fluorogenic oligonucleotide probean approach frequently referred to in the literature as the TaqMan method. See U.S. Pat. No. 5,723,591; Heid et al., 1996, Real-time quantitative PCR Genome Res. 6:986-94, each incorporated herein by reference in their entireties for their descriptions of fluorogenic nuclease assays. It will be appreciated that while TaqMan probes are the most widely used for qPCR, the invention is not limited to use of these probes; any suitable probe can be used.
[0229] Other detection/quantification methods that can be employed in the present invention include FRET and template extension reactions, molecular beacon detection, Scorpion detection, Invader detection, and padlock probe detection.
[0230] FRET and template extension reactions utilize a primer labeled with one member of a donor/acceptor pair and a nucleotide labeled with the other member of the donor/acceptor pair. Prior to incorporation of the labeled nucleotide into the primer during a template-dependent extension reaction, the donor and acceptor are spaced far enough apart that energy transfer cannot occur. However, if the labeled nucleotide is incorporated into the primer and the spacing is sufficiently close, then energy transfer occurs and can be detected. These methods are particularly useful in conducting single base pair extension reactions in the detection of single nucleotide polymorphisms and are described in U.S. Pat. No. 5,945,283 and PCT Publication WO 97/22719.
[0231] With molecular beacons, a change in conformation of the probe as it hybridizes to a complementary region of the amplified product results in the formation of a detectable signal. The probe itself includes two sections: one section at the 5 end and the other section at the 3 end. These sections flank the section of the probe that anneals to the probe binding site and are complementary to one another. One end section is typically attached to a reporter dye and the other end section is usually attached to a quencher dye. In solution, the two end sections can hybridize with each other to form a hairpin loop. In this conformation, the reporter and quencher dye are in sufficiently close proximity that fluorescence from the reporter dye is effectively quenched by the quencher dye. Hybridized probe, in contrast, results in a linearized conformation in which the extent of quenching is decreased. Thus, by monitoring emission changes for the two dyes, it is possible to indirectly monitor the formation of amplification product. Probes of this type and methods of their use are described further, for example, by Piatek et al., 1998, Nat. Biotechnol. 16:359-63; Tyagi, and Kramer, 1996, Nat. Biotechnology 14:303-308; and Tyagi, et al., 1998, Nat. Biotechnol. 16:49-53 (1998).
[0232] The Scorpion detection method is described, for example, by Thelwell et al. 2000, Nucleic Acids Research, 28:3752-3761 and Solinas et al., 2001, Duplex Scorpion primers in SNP analysis and FRET applications Nucleic Acids Research 29:20. Scorpion primers are fluorogenic PCR primers with a probe element attached at the 5-end via a PCR stopper. They are used in real-time amplicon-specific detection of PCR products in homogeneous solution. Two different formats are possible, the stem-loop format and the duplex format. In both cases the probing mechanism is intramolecular. The basic elements of Scorpions in all formats are: (i) a PCR primer; (ii) a PCR stopper to prevent PCR read-through of the probe element; (iii) a specific probe sequence; and (iv) a fluorescence detection system containing at least one fluorophore and quencher. After PCR extension of the Scorpion primer, the resultant amplicon contains a sequence that is complementary to the probe, which is rendered single-stranded during the denaturation stage of each PCR cycle. On cooling, the probe is free to bind to this complementary sequence, producing an increase in fluorescence, as the quencher is no longer in the vicinity of the fluorophore. The PCR stopper prevents undesirable read-through of the probe by Taq DNA polymerase.
[0233] Invader assays (Third Wave Technologies, Madison, Wis.) are used particularly for SNP genotyping and utilize an oligonucleotide, designated the signal probe, that is complementary to the target nucleic acid (DNA or RNA) or polymorphism site. A second oligonucleotide, designated the Invader Oligo, contains the same 5 nucleotide sequence, but the 3 nucleotide sequence contains a nucleotide polymorphism. The Invader Oligo interferes with the binding of the signal probe to the target nucleic acid such that the 5 end of the signal probe forms a flap at the nucleotide containing the polymorphism. This complex is recognized by a structure specific endonuclease, called the Cleavase enzyme. Cleavase cleaves the 5 flap of the nucleotides. The released flap binds with a third probe bearing FRET labels, thereby forming another duplex structure recognized by the Cleavase enzyme. This time, the Cleavase enzyme cleaves a fluorophore away from a quencher and produces a fluorescent signal. For SNP genotyping, the signal probe will be designed to hybridize with either the reference (wild type) allele or the variant (mutant) allele. Unlike PCR, there is a linear amplification of signal with no amplification of the nucleic acid. Further details sufficient to guide one of ordinary skill in the art are provided by, for example, Neri, B. P., et al., Advances in Nucleic Acid and Protein Analysis 3826:117-125, 2000) and U.S. Pat. No. 6,706,471.
[0234] Padlock probes (PLPs) are long (e.g., about 100 bases) linear oligonucleotides. The sequences at the 3 and 5 ends of the probe are complementary to adjacent sequences in the target nucleic acid. In the central, noncomplementary region of the PLP there is a tag sequence that can be used to identify the specific PLP. The tag sequence is flanked by universal priming sites, which allow PCR amplification of the tag. Upon hybridization to the target, the two ends of the PLP oligonucleotide are brought into close proximity and can be joined by enzymatic ligation. The resulting product is a circular probe molecule catenated to the target DNA strand. Any unligated probes (i.e., probes that did not hybridize to a target) are removed by the action of an exonuclease. Hybridization and ligation of a PLP requires that both end segments recognize the target sequence. In this manner, PLPs provide extremely specific target recognition.
[0235] The tag regions of circularized PLPs can then be amplified and resulting amplicons detected. For example, TaqMan real-time PCR can be carried out to detect and quantify the amplicon. The presence and amount of amplicon can be correlated with the presence and quantity of target sequence in the sample. For descriptions of PLPs see, e.g., Landegren et al., 2003, Padlock and proximity probes for in situ and array-based analyses: tools for the post-genomic era, Comparative and Functional Genomics 4:525-30; Nilsson et al., 2006, Analyzing genes using closing and replicating circles Trends Biotechnol. 24:83-8; Nilsson et al., 1994, Padlock probes: circularizing oligonucleotides for localized DNA detection, Science 265:2085-8.
[0236] In particular embodiments, fluorophores that can be used as detectable labels for probes include, but are not limited to, rhodamine, cyanine 3 (Cy 3), cyanine 5 (Cy 5), fluorescein, Vic, Liz, Tamra, 5-Fam, 6-Fam, and Texas Red (Molecular Probes). (Vic, Liz, Tamra, 5-Fam, 6-Fam are all available from Applied Biosystems, Foster City, Calif.).
[0237] Devices have been developed that can perform a thermal cycling reaction with compositions containing a fluorescent indicator, emit a light beam of a specified wavelength, read the intensity of the fluorescent dye, and display the intensity of fluorescence after each cycle. Devices comprising a thermal cycler, light beam emitter, and a fluorescent signal detector, have been described, e.g., in U.S. Pat. Nos. 5,928,907; 6,015,674; and 6,174,670.
[0238] In some embodiments, each of these functions can be performed by separate devices. For example, if one employs a Q-beta replicase reaction for amplification, the reaction may not take place in a thermal cycler, but could include a light beam emitted at a specific wavelength, detection of the fluorescent signal, and calculation and display of the amount of amplification product.
[0239] In particular embodiments, combined thermal cycling and fluorescence detecting devices can be used for precise quantification of target nucleic acids. In some embodiments, fluorescent signals can be detected and displayed during and/or after one or more thermal cycles, thus permitting monitoring of amplification products as the reactions occur in real-time. In certain embodiments, one can use the amount of amplification product and number of amplification cycles to calculate how much of the target nucleic acid sequence was in the sample prior to amplification.
[0240] According to some embodiments, one can simply monitor the amount of amplification product after a predetermined number of cycles sufficient to indicate the presence of the target nucleic acid sequence in the sample. One skilled in the art can easily determine, for any given sample type, primer sequence, and reaction condition, how many cycles are sufficient to determine the presence of a given target nucleic acid.
[0241] According to certain embodiments, one can employ an internal standard to quantify the amplification product indicated by the fluorescent signal. See, e.g., U.S. Pat. No. 5,736,333.
[0242] In various embodiments, employing preamplification, the number of preamplification cycles is sufficient to add one or more nucleotide tags to the target nucleotide sequences, so that the relative copy numbers of the tagged target nucleotide sequences is substantially representative of the relative copy numbers of the target nucleic acids in the sample. For example, preamplification can be carried out for 2-20 cycles to introduce the sample-specific or set-specific nucleotide tags. In other embodiments, detection is carried out at the end of exponential amplification, i.e., during the plateau phase, or endpoint PCR is carried out. In this instance, preamplification will normalize amplicon copy number across targets and across samples. In various embodiments, preamplification and/or amplification can be carried out for about: 2, 4, 10, 15, 20, 25, 30, 35, or 40 cycles or for a number of cycles falling within any range bounded by any of these values.
Labeling Strategies
[0243] Any suitable labeling strategy can be employed in the methods of the invention. Where the assay mixture is aliquoted, and each aliquot is analyzed for presence of a single amplification product, a universal detection probe can be employed in the amplification mixture. In particular embodiments, real-time PCR detection can be carried out using a universal qPCR probe. Suitable universal qPCR probes include double-stranded DNA dyes, such as SYBR Green, Pico Green (Molecular Probes, Inc., Eugene, Oreg.), Eva Green (Biotinum), ethidium bromide, and the like (see Zhu et al., 1994, Anal. Chem. 66:1941-48). Suitable universal qPCR probes also include sequence-specific probes that bind to a nucleotide sequence present in all amplification products. Binding sites for such probes can be conveniently introduced into the tagged target nucleic acids during amplification.
[0244] Alternatively, one or more target-specific qPCR probes (i.e., specific for a target nucleotide sequence to be detected) is employed in the amplification mixtures to detect amplification products. Target-specific probes could be useful, e.g., when only a few target nucleic acids are to be detected in a large number of samples. For example, if only three targets were to be detected, a target-specific probe with a different fluorescent label for each target could be employed. By judicious choice of labels, analyses can be conducted in which the different labels are excited and/or detected at different wavelengths in a single reaction. See, e.g., Fluorescence Spectroscopy (Pesce et al., Eds.) Marcel Dekker, New York, (1971); White et al., Fluorescence Analysis: A Practical Approach, Marcel Dekker, New York, (1970); Berlman, Handbook of Fluorescence Spectra of Aromatic Molecules, 2nd ed., Academic Press, New York, (1971); Griffiths, Colour and Constitution of Organic Molecules, Academic Press, New York, (1976); Indicators (Bishop, Ed.). Pergamon Press, Oxford, 19723; and Haugland, Handbook of Fluorescent Probes and Research Chemicals, Molecular Probes, Eugene (1992).
Removal of Undesired Reaction Components
[0245] It will be appreciated that reactions involving complex mixtures of nucleic acids in which a number of reactive steps are employed can result in a variety of unincorporated reaction components, and that removal of such unincorporated reaction components, or reduction of their concentration, by any of a variety of clean-up procedures can improve the efficiency and specificity of subsequently occurring reactions. For example, it may be desirable, in some embodiments, to remove, or reduce the concentration of preamplification primers prior to carrying out the amplification steps described herein.
[0246] In certain embodiments, the concentration of undesired components can be reduced by simple dilution. For example, preamplified samples can be diluted about 2-, 5-, 10-, 50-, 100-, 500-, 1000-fold prior to amplification to improve the specificity of the subsequent amplification step.
[0247] In some embodiments, undesired components can be removed by a variety of enzymatic means. Alternatively, or in addition to the above-described methods, undesired components can be removed by purification. For example, a purification tag can be incorporated into any of the above-described primers (e.g., into the barcode nucleotide sequence) to facilitate purification of the tagged target nucleotides.
[0248] In particular embodiments, clean-up includes selective immobilization of the desired nucleic acids. For example, desired nucleic acids can be preferentially immobilized on a solid support. In an illustrative embodiment, an affinity moiety, such as biotin (e.g., photo-biotin), is attached to desired nucleic acid, and the resulting biotin-labeled nucleic acids immobilized on a solid support comprising an affinity moiety-binder such as streptavidin. Immobilized nucleic acids can be queried with probes, and non-hybridized and/or non-ligated probes removed by washing (See, e.g., Published P.C.T. Application WO 03/006677 and U.S. Ser. No. 09/931,285.) Alternatively, immobilized nucleic acids can be washed to remove other components and then released from the solid support for further analysis. This approach can be used, for example, in recovering target amplicons from amplification mixtures after the addition of primer binding sites for DNA sequencing. In particular embodiments, an affinity moiety, such as biotin, can be attached to an amplification primer such that amplification produces an affinity moiety-labeled (e.g., biotin-labeled) amplicon. Thus, for example, where three primers are employed to add barcode and nucleotide tag elements to a target nucleotide sequence, as described above, at least one of the barcode or reverse primers can include an affinity moiety. Where four primers (two inner primers and two outer primers) are employed to add desired element to a target nucleotide sequence, at least one of the outer primers can include an affinity moiety.
Data Output and Analysis
[0249] In certain embodiments, when the methods of the invention are carried out on a matrix-type microfluidic device, the data can be output as a heat matrix (also termed heat map). In the heat matrix, each square, representing a reaction chamber on the DA matrix, has been assigned a color value which can be shown in gray scale, but is more typically shown in color. In gray scale, black squares indicate that no amplification product was detected, whereas white squares indicate the highest level of amplification produce, with shades of gray indicating levels of amplification product in between. In a further aspect, a software program may be used to compile the data generated in the heat matrix into a more reader-friendly format.
Applications
[0250] The methods of the invention are applicable to any technique aimed at detecting the presence or amount of one or more target nucleic acids in a nucleic acid sample. Thus, for example, these methods are applicable to identifying the presence of particular polymorphisms (such as SNPs), alleles, or haplotypes, or chromosomal abnormalities, such as amplifications, deletions, or aneuploidy. The methods may be employed in genotyping, which can be carried out in a number of contexts, including diagnosis of genetic diseases or disorders, pharmacogenomics (personalized medicine), quality control in agriculture (e.g., for seeds or livestock), the study and management of populations of plants or animals (e.g., in aquaculture or fisheries management or in the determination of population diversity), or paternity or forensic identifications. The methods of the invention can be applied in the identification of sequences indicative of particular conditions or organisms in biological or environmental samples. For example, the methods can be used in assays to identify pathogens, such as viruses, bacteria, and fungi). The methods can also be used in studies aimed at characterizing environments or microenvironments, e.g., characterizing the microbial species in the human gut.
[0251] These methods can also be employed in determinations DNA or RNA copy number. Determinations of aberrant DNA copy number in genomic DNA is useful, for example, in the diagnosis and/or prognosis of genetic defects and diseases, such as cancer. Determination of RNA copy number, i.e., expression level is useful for expression monitoring of genes of interest, e.g., in different individuals, tissues, or cells under different conditions (e.g., different external stimuli or disease states) and/or at different developmental stages.
[0252] In addition, the methods can be employed to prepare nucleic acid samples for further analysis, such as, e.g., DNA sequencing.
[0253] Finally, nucleic acid samples can be tagged as a first step, prior subsequent analysis, to reduce the risk that mislabeling or cross-contamination of samples will compromise the results. For example, any physician's office, laboratory, or hospital could tag samples immediately after collection, and the tags could be confirmed at the time of analysis. Similarly, samples containing nucleic acids collected at a crime scene could be tagged as soon as practicable, to ensure that the samples could not be mislabeled or tampered with. Detection of the tag upon each transfer of the sample from one party to another could be used to establish chain of custody of the sample.
Kits
[0254] Kits according to the invention include one or more reagents useful for practicing one or more assay methods of the invention. A kit generally includes a package with one or more containers holding the reagent(s) (e.g., primers and/or probe(s)), as one or more separate compositions or, optionally, as admixture where the compatibility of the reagents will allow. The kit can also include other material(s) that may be desirable from a user standpoint, such as a buffer(s), a diluent(s), a standard(s), and/or any other material useful in sample processing, washing, or conducting any other step of the assay.
[0255] Kits according to the invention generally include instructions for carrying out one or more of the methods of the invention. Instructions included in kits of the invention can be affixed to packaging material or can be included as a package insert. While the instructions are typically written or printed materials they are not limited to such. Any medium capable of storing such instructions and communicating them to an end user is contemplated by this invention. Such media include, but are not limited to, electronic storage media (e.g., magnetic discs, tapes, cartridges, chips), optical media (e.g., CD ROM), RF tags, and the like. As used herein, the term instructions can include the address of an internet site that provides the instructions.
[0256] It is understood that the examples and embodiments described herein are for illustrative purposes only and that various modifications or changes in light thereof will be suggested to persons skilled in the art and are to be included within the spirit and purview of this application and scope of the appended claims.
[0257] In addition, all other publications, patents, and patent applications cited herein are hereby incorporated by reference in their entirety for all purposes.
EXAMPLES
Example 1
Multi-Primer Amplification Method for Barcoding of Target Nucleic Acids in Preparation for DNA Sequencing
[0258] Genomic DNA samples (BioChain, USA) at 100 and 0 ng/ml (negative control [NTC]) were amplified for 25 cycles 7900HT Fast Real-Time PCR System (Applied Biosystems, USA) with the following primer pairs at 200 nM per primer: 1) 454 tails; 2) A5 specific primers; and 3) the three primers shown in
Example 2
Multi-Primer Amplification Method for Quantifying Target Nucleic Acids in Preparation for DNA Sequencing
[0259] Primers for preparing genomic DNA for sequencing using various DNA conventional DNA sequencing methods are shown below.
TABLE-US-00001 ShotGunForward: (SEQIDNO:1) 5-CCATCTCATCCCTGCGTGTC-3 ShotGunReverse: (SEQIDNO:2) 5-CCTATCCCCTGTGTGCCTTG-3 ShotGunUPRForward: (SEQIDNO:3) 5-GGCGGCGACCATCTCATCCCTGCGTGTC-3 MIDForward: (SEQIDNO:4) 5-GCCTCCCTCGCGCCATCAG-3 MIDReverse: (SEQIDNO:5) 5-GCCTTGCCAGCCCGCTCAG-3 MIDUPRForward: (SEQIDNO:6) 5-GGCGGCGAGCCTCCCTCGCGCCATCAG-3 SolexaForward: (SEQIDNO:7) 5-ACACTCTTTCCCTACACGA-3 SolexaReverse: (SEQIDNO:8) 5-CAAGCAGAAGACGGCATA-3 SolexaUPRForward: (SEQIDNO:9) 5-GGCGGCGAACACTCTTTCCCTACACGA-3 SolidForward: (SEQIDNO:10) 5-CCACTACGCCTCCGCTTTCCTCTCTATG-3 SolidReverse: (SEQIDNO:11) 5-CTGCCCCGGGTTCCTCATTCT-3 SolidUPRForward: (SEQIDNO:12) 5-GGCGGCGACCACTACGCCTCCGCTTTCCTCTCTATG-3 454TitaniumForward: (SEQIDNO:13) 5-CCATCTCATCCCTGCGTG-3 454TitaniumReverse: (SEQIDNO:14) 5-CCTATCCCCTGTGTGCCTTG-3 454TitaniumUPRForward: (SEQIDNO:15) 5-GGCGGCGACCATCTCATCCCTGCGTG-3 SolexasmRNAForward: (SEQIDNO:16) 5-TAATGATACGGCGACCACC-3 SolexasmRNAReverse: (SEQIDNO:17) 5-ACAAGCAGAAGACGGCATAC-3 SolexasmRNAUPLForward: (SEQIDNO:18) 5-GGCGGCGATAATGATACGGCGACCAC-3
[0260] The properties of these primers is shown in Table 1 below.
TABLE-US-00002 TABLE 1 Length Tm Primer (nt) CG % ( C.) Primer-Dimer 454- standard (ShotGun) ShotGun 20 60 68.4 No self/cross- Forward: dimer, 1.5 C. diff in Tm ShotGun 20 60 66.9 Reverse: ShotGun 28 67.8 84.8 UPR Forward: 454-MID MID 19 73.6 74.9 4-bases of self- Forward: dimer(F.UPL) & cross-dimer(F./UPL, R/UPL) MID 19 73.6 74.9 High GC Reverse: MID UPR 27 77.7 88.5 Forward: Solexa Solexa 19 47.3 57.8 No dimer, Forward: 2.1 C. diff in Tm Solexa 18 50 60.6 Low GC Reverse: Solexa 27 59.2 78.4 UPR Forward: Solid Solid 28 57.1 74.7 Strong self-dimer Forward: & cross-dimer Solid 21 61.9 72.5 variety of GC Reverse: & Tm Solid 36 63.8 85.6 UPR Forward:
[0261] The reaction mixture used for amplification of genomic DNA to incorporate primer sequences is given below in Table 2.
TABLE-US-00003 TABLE 2 Add V l of TE into dry probe tube 100 uM stock solution V = Total nmol value of the dry probe * 10 10X Fluidigm Assay 100 Mol Forward: 4 2000 nM UPR Forward: 4 2000 nM Reverse: 8 4000 nM TE: 184 Total: 200
Example 3
Additional Illustrative Primers for Barcoding of Target Nucleic Acids in Preparation for 454 DNA Sequencing
[0262] Tables 3 and 4 below show additional illustrative primers for barcoding of target nucleic acids in preparation for 454 DNA sequencing. 454F refers to a 454 forward primer binding site; 454R refers to 454 reverse primer binding site. BC refers to a nucleotide barcode. TAG refers to a nucleotide tag. P53 refers to a target-specific primer sequence.
TABLE-US-00004 TABLE3 SequenceName Sequence SEQID 454F-BC1-TAG8 GCCTCCCTCGCGCCATCAGGCATGCACACTGACGACATGGTTCTACA (SEQIDNO:19) 454F-BC2-TAG8 GCCTCCCTCGCGCCATCAGCGTACGACACTGACGACATGGTTCTACA (SEQIDNO:20) 454F-BC3-TAG8 GCCTCCCTCGCGCCATCAGGTCAGCACACTGACGACATGGTTCTACA (SEQIDNO:21) 454F-BC4-TAG8 GCCTCCCTCGCGCCATCAGAGCTGCACACTGACGACATGGTTCTACA (SEQIDNO:22) 454F-BC5-TAG8 GCCTCCCTCGCGCCATCAGTGCATCACACTGACGACATGGTTCTACA (SEQIDNO:23) 454F-BC6-TAG8 GCCTCCCTCGCGCCATCAGCTGATGACACTGACGACATGGTTCTACA (SEQIDNO:24) 454F-BC7-TAG8 GCCTCCCTCGCGCCATCAGGTAGTCACACTGACGACATGGTTCTACA (SEQIDNO:25) 454F-BC8-TAG8 GCCTCCCTCGCGCCATCAGGTCGATACACTGACGACATGGTTCTACA (SEQIDNO:26) 454F-BC9-TAG8 GCCTCCCTCGCGCCATCAGGATACGACACTGACGACATGGTTCTACA (SEQIDNO:27) 454F-BC10-TAG8 GCCTCCCTCGCGCCATCAGTGATGCACACTGACGACATGGTTCTACA (SEQIDNO:28) 454F-BC11-TAG8 GCCTCCCTCGCGCCATCAGAGCTGAACACTGACGACATGGTTCTACA (SEQIDNO:29) 454F-BC12-TAG8 GCCTCCCTCGCGCCATCAGACTGTAACACTGACGACATGGTTCTACA (SEQIDNO:30) 454F-BC13-TAG8 GCCTCCCTCGCGCCATCAGTGCATGACACTGACGACATGGTTCTACA (SEQIDNO:31) 454F-BC14-TAG8 GCCTCCCTCGCGCCATCAGAGTCTAACACTGACGACATGGTTCTACA (SEQIDNO:32) 454F-BC15-TAG8 GCCTCCCTCGCGCCATCAGTGTCTGACACTGACGACATGGTTCTACA (SEQIDNO:33) 454F-BC16-TAG8 GCCTCCCTCGCGCCATCAGGCTAGCACACTGACGACATGGTTCTACA (SEQIDNO:34) 454F-BC17-TAG8 GCCTCCCTCGCGCCATCAGGATAGCACACTGACGACATGGTTCTACA (SEQIDNO:35) 454F-BC18-TAG8 GCCTCCCTCGCGCCATCAGGCTACTACACTGACGACATGGTTCTACA (SEQIDNO:36) 454F-BC19-TAG8 GCCTCCCTCGCGCCATCAGCTATGCACACTGACGACATGGTTCTACA (SEQIDNO:37) 454F-BC20-TAG8 GCCTCCCTCGCGCCATCAGGCTATGACACTGACGACATGGTTCTACA (SEQIDNO:38) 454F-BC21-TAG8 GCCTCCCTCGCGCCATCAGCGTGCAACACTGACGACATGGTTCTACA (SEQIDNO:39) 454F-BC22-TAG8 GCCTCCCTCGCGCCATCAGATAGCTACACTGACGACATGGTTCTACA (SEQIDNO:40) 454F-BC23-TAG8 GCCTCCCTCGCGCCATCAGTGTAGCACACTGACGACATGGTTCTACA (SEQIDNO:41) 454F-BC24-TAG8 GCCTCCCTCGCGCCATCAGGTGCTAACACTGACGACATGGTTCTACA (SEQIDNO:42) 454F-BC25-TAG8 GCCTCCCTCGCGCCATCAGGTCATGACACTGACGACATGGTTCTACA (SEQIDNO:43) 454F-BC26-TAG8 GCCTCCCTCGCGCCATCAGATCGTGACACTGACGACATGGTTCTACA (SEQIDNO:44) 454F-BC27-TAG8 GCCTCCCTCGCGCCATCAGTGTACGACACTGACGACATGGTTCTACA (SEQIDNO:45) 454F-BC28-TAG8 GCCTCCCTCGCGCCATCAGAGTGTAACACTGACGACATGGTTCTACA (SEQIDNO:46) 454F-BC29-TAG8 GCCTCCCTCGCGCCATCAGTGACAGACACTGACGACATGGTTCTACA (SEQIDNO:47) 454F-BC30-TAG8 GCCTCCCTCGCGCCATCAGGATCACACACTGACGACATGGTTCTACA (SEQIDNO:48) 454F-BC31-TAG8 GCCTCCCTCGCGCCATCAGCTAGAGACACTGACGACATGGTTCTACA (SEQIDNO:49) 454F-BC32-TAG8 GCCTCCCTCGCGCCATCAGCTAGTCACACTGACGACATGGTTCTACA (SEQIDNO:50) 454F-BC33-TAG8 GCCTCCCTCGCGCCATCAGAGCTAGACACTGACGACATGGTTCTACA (SEQIDNO:51) 454F-BC34-TAG8 GCCTCCCTCGCGCCATCAGTGACTGACACTGACGACATGGTTCTACA (SEQIDNO:52) 454F-BC35-TAG8 GCCTCCCTCGCGCCATCAGTGATAGACACTGACGACATGGTTCTACA (SEQIDNO:53) 454F-BC36-TAG8 GCCTCCCTCGCGCCATCAGCGTATCACACTGACGACATGGTTCTACA (SEQIDNO:54) 454F-BC37-TAG8 GCCTCCCTCGCGCCATCAGGTCTGAACACTGACGACATGGTTCTACA (SEQIDNO:55) 454F-BC38-TAG8 GCCTCCCTCGCGCCATCAGCATGACACACTGACGACATGGTTCTACA (SEQIDNO:56) 454F-BC39-TAG8 GCCTCCCTCGCGCCATCAGCGATGAACACTGACGACATGGTTCTACA (SEQIDNO:57) 454F-BC40-TAG8 GCCTCCCTCGCGCCATCAGGCTGATACACTGACGACATGGTTCTACA (SEQIDNO:58) 454F-BC41-TAG8 GCCTCCCTCGCGCCATCAGCAGTACACACTGACGACATGGTTCTACA (SEQIDNO:59) 454F-BC42-TAG8 GCCTCCCTCGCGCCATCAGGCGACTACACTGACGACATGGTTCTACA (SEQIDNO:60) 454F-BC43-TAG8 GCCTCCCTCGCGCCATCAGGTACGAACACTGACGACATGGTTCTACA (SEQIDNO:61) 454F-BC44-TAG8 GCCTCCCTCGCGCCATCAGACGCTAACACTGACGACATGGTTCTACA (SEQIDNO:62) 454F-BC45-TAG8 GCCTCCCTCGCGCCATCAGAGCATCACACTGACGACATGGTTCTACA (SEQIDNO:63) 454F-BC46-TAG8 GCCTCCCTCGCGCCATCAGGATGCTACACTGACGACATGGTTCTACA (SEQIDNO:64) 454F-BC47-TAG8 GCCTCCCTCGCGCCATCAGGTCTGCACACTGACGACATGGTTCTACA (SEQIDNO:65) 454F-BC48-TAG8 GCCTCCCTCGCGCCATCAGATGCGAACACTGACGACATGGTTCTACA (SEQIDNO:66)
TABLE-US-00005 TABLE4 SequenceName Sequence SEQID TAG8-P53-1+ ACACTGACGACATGGTTCTACAACTGTCCAGCTTTGTGCC (SEQIDNO:67) TAG8-P53-2+ ACACTGACGACATGGTTCTACAGATCATCATAGGAGTTGCATTGTTG (SEQIDNO:68) TAG8-P53-3+ ACACTGACGACATGGTTCTACACGGACCTTTGTCCTTCCT (SEQIDNO:69) TAG8-P53-4+ ACACTGACGACATGGTTCTACAATGCAAACCTCAATCCCTCC (SEQIDNO:70) TAG8-P53-5+ ACACTGACGACATGGTTCTACAAGTTTCTTCCCATGCACCTG (SEQIDNO:71) TAG8-P53-6+ ACACTGACGACATGGTTCTACAGTGAATCCCCGTCTCTACTAAAA (SEQIDNO:72) TAG8-P53-7+ ACACTGACGACATGGTTCTACATGTTTCCCATTTGCGGTTATGA (SEQIDNO:73) TAG8-P53-8+ ACACTGACGACATGGTTCTACAAGTTGTGGGACTGCTTTATACATT (SEQIDNO:74) 454R-P53-1 GCCTTGCCAGCCCGCTCAGTCCTCTGCCTAGGCGTT (SEQIDNO:75) 454R-P53-2 GCCTTGCCAGCCCGCTCAGGAAATGTAAATGTGGAGCCAAACA (SEQIDNO:76) 454R-P53-3 GCCTTGCCAGCCCGCTCAGACTCATTCTTGAAAATACCTCCGG (SEQIDNO:77) 454R-P53-4 GCCTTGCCAGCCCGCTCAGAAATGCCACCTCGATTTAGGAAA (SEQIDNO:78) 454R-P53-5 GCCTTGCCAGCCCGCTCAGTCACCCTCCCGAATAGCT (SEQIDNO:79) 454R-P53-6 GCCTTGCCAGCCCGCTCAGAGTGTAAAATGGTACAACCGCT (SEQIDNO:80) 454R-P53-7 GCCTTGCCAGCCCGCTCAGCCTCTTAAGATACTGTAAACTCTGTAAAGC (SEQIDNO:81) 454R-P53-8 GCCTTGCCAGCCCGCTCAGATTGTGCCATTGTACTCTAGCC (SEQIDNO:82) TAG8-P53-9+ ACACTGACGACATGGTTCTACACTTCCTTTCTCTACTGAATGCTTTTAATTT (SEQIDNO:83) TAG8-P53-10+ ACACTGACGACATGGTTCTACATCTTACACAAACTCTTCAGAAAACAGA (SEQIDNO:84) TAG8-P53-11+ ACACTGACGACATGGTTCTACAGTACCAAAACCAAACAAGGACAT (SEQIDNO:85) TAG8-P53-12+ ACACTGACGACATGGTTCTACAGGTGAAACGCCATCTCTACTAA (SEQIDNO:86) TAG8-P53-13+ ACACTGACGACATGGTTCTACATCATGATTGTAGCTGATTCAACATTCA (SEQIDNO:87) TAG8-P53-14+ ACACTGACGACATGGTTCTACAACTAGCATGCTGAAACCCC (SEQIDNO:88) TAG8-P53-15+ ACACTGACGACATGGTTCTACATCAGGAGATCGAGACCATCC (SEQIDNO:89) TAG8-P53-16+ ACACTGACGACATGGTTCTACATCATGCCTGTAATCCCAGC (SEQIDNO:90) 454R-P53-9 GCCTTGCCAGCCCGCTCAGACCTCAAATGATCCCCTGC (SEQIDNO:91) 454R-P53-10 GCCTTGCCAGCCCGCTCAGATTACAGGCGTGAGCCAC (SEQIDNO:92) 454R-P53-11 GCCTTGCCAGCCCGCTCAGTTTTGAGATGAAGTCTTGCTCTGT (SEQIDNO:93) 454R-P53-12 GCCTTGCCAGCCCGCTCAGTAAAGACCAGTCTGACTATGTTGC (SEQIDNO:94) 454R-P53-13 GCCTTGCCAGCCCGCTCAGACCATGCCCGGCTAATTTT (SEQIDNO:95) 454R-P53-14 GCCTTGCCAGCCCGCTCAGAGTTCACGCCATTCTCCTG (SEQIDNO:96) 454R-P53-15 GCCTTGCCAGCCCGCTCAGCACTACGCCCGGCTAATTTT (SEQIDNO:97) 454R-P53-16 GCCTTGCCAGCCCGCTCAGTGGCCCCATTAGGACATGTAT (SEQIDNO:98) TAG8-P53-17+ ACACTGACGACATGGTTCTACATTGTCCCATTGCACTCCAG (SEQIDNO:99) TAG8-P53-18+ ACACTGACGACATGGTTCTACATGGGCAACAAGAGTGAAACT (SEQIDNO:100) TAG8-P53-19+ ACACTGACGACATGGTTCTACAAAATAAATATAGCAGGGTTGCAGGT (SEQIDNO:101) TAG8-P53-20+ ACACTGACGACATGGTTCTACATGCATTTCTCTTGGCTCCC (SEQIDNO:102) TAG8-P53-21+ ACACTGACGACATGGTTCTACAACTTTCCTCAACTCTACATTTCCC (SEQIDNO:103) TAG8-P53-22+ ACACTGACGACATGGTTCTACATCAGTGCAAACAACAGAAAAGTG (SEQIDNO:104) TAG8-P53-23+ ACACTGACGACATGGTTCTACACATGTTTCTTAGCAAATCTGATGACA (SEQIDNO:105) TAG8-P53-24+ ACACTGACGACATGGTTCTACATCTGTGGTCCCAGCTACT (SEQIDNO:106) 454R-P53-17 GCCTTGCCAGCCCGCTCAGTTTCACCATGTTAGGTTGGTCTC (SEQIDNO:107) 454R-P53-18 GCCTTGCCAGCCCGCTCAGTGTAGGTTAAATCCAAATACTATACCGTC (SEQIDNO:108) 454R-P53-19 GCCTTGCCAGCCCGCTCAGTCTCAAATCTTCAGTAGCAACTAAAATCT (SEQIDNO:109) 454R-P53-20 GCCTTGCCAGCCCGCTCAGTCCCGACCTCAGGTGATC (SEQIDNO:110) 454R-P53-21 GCCTTGCCAGCCCGCTCAGTGGTCTTGAACTCCCAACTTC (SEQIDNO:111) 454R-P53-22 GCCTTGCCAGCCCGCTCAGCCTCCGACTCCCAAAGTG (SEQIDNO:112) 454R-P53-23 GCCTTGCCAGCCCGCTCAGACTACAGCCTCGGACTCC (SEQIDNO:113) 454R-P53-24 GCCTTGCCAGCCCGCTCAGATCTTGCACGAAGTTATGCAACTA (SEQIDNO:114) TAG8-P53-25+ ACACTGACGACATGGTTCTACAACCACTGCACTCCAGC (SEQIDNO:115) TAG8-P53-26+ ACACTGACGACATGGTTCTACAACAAGGAAAAGTATCAGACAATGTAAGT (SEQIDNO:116) TAG8-P53-27+ ACACTGACGACATGGTTCTACAACGGTAGCTCACACCTGTAAT (SEQIDNO:117) TAG8-P53-28+ ACACTGACGACATGGTTCTACATGGAAGTCCCTCTCTGATTGT (SEQIDNO:118) TAG8-P53-29+ ACACTGACGACATGGTTCTACAACTGACTTTCTGCTCTTGTCTTTC (SEQIDNO:119) TAG8-P53-30+ ACACTGACGACATGGTTCTACAATTCTGGGACAGCCAAGTC (SEQIDNO:120) TAG8-P53-31+ ACACTGACGACATGGTTCTACAAGGAGTTCAAGACCAGCCT (SEQIDNO:121) TAG8-P53-32+ ACACTGACGACATGGTTCTACATCTGTCTCCTTCCTCTTCCTAC (SEQIDNO:122) 454R-P53-25 GCCTTGCCAGCCCGCTCAGCCTCTTCCCCAAAAGCTCT (SEQIDNO:123) 454R-P53-26 GCCTTGCCAGCCCGCTCAGTCTCGAACTCCTTACTTCAGGT (SEQIDNO:124) 454R-P53-27 GCCTTGCCAGCCCGCTCAGCCCAACACCATGCCAGTG (SEQIDNO:125) 454R-P53-28 GCCTTGCCAGCCCGCTCAGTCCCCAGCCCTCCAG (SEQIDNO:126) 454R-P53-29 GCCTTGCCAGCCCGCTCAGATTGAAGTCTCATGGAAGCCAG (SEQIDNO:127) 454R-P53-30 GCCTTGCCAGCCCGCTCAGTCAAGTGATCTTCCCACCTCA (SEQIDNO:128) 454R-P53-31 GCCTTGCCAGCCCGCTCAGACAACCTCCGTCATGTGC (SEQIDNO:129) 454R-P53-32 GCCTTGCCAGCCCGCTCAGACCCATTTACTTTGCACATCTCA (SEQIDNO:130) TAG8-P53-33+ ACACTGACGACATGGTTCTACATTAAGGGTGGTTGTCAGTGG (SEQIDNO:131) TAG8-P53-34+ ACACTGACGACATGGTTCTACATTGCAGTGAGCTGAGATCAC (SEQIDNO:132) TAG8-P53-35+ ACACTGACGACATGGTTCTACAATCTCCTTACTGCTCCCACT (SEQIDNO:133) TAG8-P53-36+ ACACTGACGACATGGTTCTACATTTTATCACCTTTCCTTGCCTCTT (SEQIDNO:134) TAG8-P53-37+ ACACTGACGACATGGTTCTACAACTCGTCGTAAGTTGAAAATATTGTAAGT (SEQIDNO:135) TAG8-P53-38+ ACACTGACGACATGGTTCTACATCCCAAAGTGCTGGGATTAC (SEQIDNO:136) TAG8-P53-39+ ACACTGACGACATGGTTCTACATCCATCCTCCCAGCTCAG (SEQIDNO:137) TAG8-P53-40+ ACACTGACGACATGGTTCTACAATCTCAGCTCACTGCAGC (SEQIDNO:138) 454R-P53-33 GCCTTGCCAGCCCGCTCAGAGCCAACCTAGGAGATAACACA (SEQIDNO:139) 454R-P53-34 GCCTTGCCAGCCCGCTCAGAGGCTCCATCTACTCCCAA (SEQIDNO:140) 454R-P53-35 GCCTTGCCAGCCCGCTCAGTTGATAAGAGGTCCCAAGACTTAGTA (SEQIDNO:141) 454R-P53-36 GCCTTGCCAGCCCGCTCAGTGGGTGACAGAGTGAGACT (SEQIDNO:142) 454R-P53-37 GCCTTGCCAGCCCGCTCAGACATCACTGTAATCCAGCCTG (SEQIDNO:143) 454R-P53-38 GCCTTGCCAGCCCGCTCAGAGATCATGCCACTGCACTC (SEQIDNO:144) 454R-P53-39 GCCTTGCCAGCCCGCTCAGGGCATGTGCCTGTAGTCC (SEQIDNO:145) 454R-P53-40 GCCTTGCCAGCCCGCTCAGTGGTCTTGAACTCCTGACCT (SEQIDNO:146) TAG8-P53-41+ ACACTGACGACATGGTTCTACAAAACAGCATGGTTGCATGAAAG (SEQIDNO:147) TAG8-P53-42+ ACACTGACGACATGGTTCTACAAGTCGCATGCACATGTAGTC (SEQIDNO:148) TAG8-P53-43+ ACACTGACGACATGGTTCTACAAAAAGTCAGCTGTATAGGTACTTGAAG (SEQIDNO:149) TAG8-P53-44+ ACACTGACGACATGGTTCTACACCTCAGTGTATCCACAGAACA (SEQIDNO:150) TAG8-P53-45+ ACACTGACGACATGGTTCTACAATGCATGCCTGTAATCCCAG (SEQIDNO:151) TAG8-P53-46+ ACACTGACGACATGGTTCTACAAACTCATGTTCAAGACAGAAGGG (SEQIDNO:152) TAG8-P53-47+ ACACTGACGACATGGTTCTACAATTTTCTCTAACTTCAAGGCCCATAT (SEQIDNO:153) TAG8-P53-48+ ACACTGACGACATGGTTCTACATGGATCCACCAAGACTTGTTTTAT (SEQIDNO:154) 454R-P53-41 GCCTTGCCAGCCCGCTCAGGATTACAGGTGTGAGCCACT (SEQIDNO:155) 454R-P53-42 GCCTTGCCAGCCCGCTCAGACAGTACCTGAGTTAAAAGATGGTTC (SEQIDNO:156) 454R-P53-43 GCCTTGCCAGCCCGCTCAGTGAGACCCTCCAGCTCTG (SEQIDNO:157) 454R-P53-44 GCCTTGCCAGCCCGCTCAGATCTTCCCTTACCCCATTTTACTTTATT (SEQIDNO:158) 454R-P53-45 GCCTTGCCAGCCCGCTCAGTTCAAAGACCCAAAACCCAAAATG (SEQIDNO:159) 454R-P53-46 GCCTTGCCAGCCCGCTCAGGTCAAGTTCTAGACCCCATGTAATA (SEQIDNO:160) 454R-P53-47 GCCTTGCCAGCCCGCTCAGTGTGGTCCCAGCTACTCC (SEQIDNO:161) 454R-P53-48 GCCTTGCCAGCCCGCTCAGAGCAAAGTTTTATTGTAAAATAAGAGATCGAT (SEQIDNO:162)
Example 4
4-Primer Barcoding of Target Nucleic Acids in Preparation for 454 DNA Sequencing Using a Microfluidic Device that Permits Recovery of Amplication Products
[0263] Target-specific primers were designed for 48 genomic regions associated with prostate cancer. In addition to the target-specific regions, the primers were designed to contain additional tag sequences at the 5 end. Forward primers contained the sequence ACACTGACGACATGGTTCTACA (SEQ ID NO:163). Reverse primers contained the sequence TACGGTAGCAGAGACTTGGTCT (SEQ ID NO:164). The sequences of the primers containing both tag sequences and the target-specific regions are listed in Table 5.
TABLE-US-00006 TABLE5 Amplicon Assay TaggedForward TaggedReverse size # AssayName primersequence primersequence Ampliconposition (notags) 1 MSMB-1 ACACTGACGACATGG TACGGTAGCAGAGACT chr10:51219512+ 157 TTCTACAGTGGTTGC TGGTCTGCACACGCAT 51219668 CCTCTCCAGTA ATTAAAATAGGAA (SEQIDNO:547) (SEQIDNO:165) 2 MSMB-2 ACACTGACGACATGGT TACGGTAGCAGAGACT chr10:51225703+ 208 TCTACATCATTCTCCA TGGTCTTTCATCTGCA 51225910 CCCTGACCTT GACAGGTCCA (SEQIDNO:548) (SEQIDNO:166) 3 MSMB-3 ACACTGACGACATGGT TACGGTAGCAGAGACT chr10:51226702+ 183 TCTACAAGGCCTTGTT TGGTCTCCAGCACTGG 51226884 CTCATTGCAT CTTGAGACTT (SEQIDNO:549) (SEQIDNO:167) 4 MSMB-4 ACACTGACGACATGGT TACGGTAGCAGAGACT chr10:51232232+ 229 TCTACAGGGTCCTTTC TGGTCTAGGCCAGAGG 51232460 TCTTCTAACAGG AGAATGAGG (SEQIDNO:550) (SEQIDNO:168) 5 HNF1B-1 ACACTGACGACATGGT TACGGTAGCAGAGACT chr17:33121423+ 138 TCTACACAGAGGGTGA TGGTCTATGACCCTGC 33121560 TGGTGTGGA CAAATGACAC (SEQIDNO:551) (SEQIDNO:169) 6 HNF1B-5 ACACTGACGACATGGT TACGGTAGCAGAGACT chr17:33138980+ 252 TCTACATGCTTCCCAT TGGTCTTGGAAACTGC 33139231 TCTTCTTCTCC TCTTTGTGGTC (SEQIDNO:552) (SEQIDNO:170) 7 HNF1B-6 ACACTGACGACATGGT TACGGTAGCAGAGACT chr17:33144574+ 254 TCTACATGCCTCTTAT TGGTCTTGGTGGCACT 33144827 CTTATCAGCTCCA AATGTTCCCTA (SEQIDNO:553) (SEQIDNO:171) 8 HNF1B-7 ACACTGACGACATGGT TACGGTAGCAGAGACT chr17:33165634+ 234 TCTACATAAGATCCGT TGGTCTGAGGTCCGTG 33165867 GGCAAGAACC TCTACAACTGG (SEQIDNO:554) (SEQIDNO:172) 9 HNF1B-8 ACACTGACGACATGGT TACGGTAGCAGAGACT chr17:33165796+ 175 TCTACAGTCCATGGCC TGGTCTCCCCTCACTC 33165970 AGCTTTTG ACCATCTCC (SEQIDNO:555) (SEQIDNO:173) 10 HNF1B-9 ACACTGACGACATGGT TACGGTAGCAGAGACT chr17:33167605+ 215 TCTACAAGGGTTCCTG TGGTCTAGTCCGATGA 33167819 GGTCTGTGTA TGCCTGCT (SEQIDNO:556) (SEQIDNO:174) 11 HNF1B-10 ACACTGACGACATGGT TACGGTAGCAGAGACT chr17:33167782+ 194 TCTACACTTCTTGTTG TGGTCTTGAGTGAAGG 33167975 GTGGGCTCAG CTACAGACCCTA (SEQIDNO:557) (SEQIDNO:175) 12 HNF1B-11 ACACTGACGACATGGT TACGGTAGCAGAGACT chr17:33173490+ 192 TCTACATGAGAGGGCA TGGTCTAGAGGGAGGT 33173681 AAGGTCACTT GGTCGATGT (SEQIDNO:558) (SEQIDNO:176) 13 HNF1B-12 ACACTGACGACATGGT TACGGTAGCAGAGACT chr17:33173623+ 160 TCTACAGTTGAGATGC TGGTCTTCTCCCACTA 33173782 TGGGAGAGGT GTACCCTAACCATC (SEQIDNO:559) (SEQIDNO:177) 14 MYC-1 ACACTGACGACATGGT TACGGTAGCAGAGACT chr8:128817980+ 142 TCTACAGACCCGCTTC TGGTCTGCATTCGACT 128818121 TCTGAAAGG CATCTCAGCA (SEQIDNO:560) (SEQIDNO:178) 15 MYC-2 ACACTGACGACATGGT TACGGTAGCAGAGACT chr8:128819612+ 247 TCTACACAGGTTTCCG TGGTCTCAGCAGCTCG 128819858 CACCAAGA AATTTCTTCC (SEQIDNO:561) (SEQIDNO:179) 16 MYC-6 ACACTGACGACATGGT TACGGTAGCAGAGACT chr8:128821784+ 255 TCTACAAACCTTGCTA TGGTCTCCTCTTGGCA 128822038 AAGGAGTGATTTCT GCAGGATAGT (SEQIDNO:562) (SEQIDNO:180) 17 MYC-7 ACACTGACGACATGGT TACGGTAGCAGAGACT chr8:128821968+ 250 TCTACAACGTCTCCAC TGGTCTAACTCCGGGA 128822217 ACATCAGCAC TCTGGTCAC (SEQIDNO:563) (SEQIDNO:181) 18 MYC-8 ACACTGACGACATGGT TACGGTAGCAGAGACT chr8:128822158+ 263 TCTACACCAGAGGAGG TGGTCTTTCTGTTAGA 128822420 AACGAGCTAA AGGAATCGTTTTCC (SEQIDNO:564) (SEQIDNO:182) 19 JAZF1-2 ACACTGACGACATGGT TACGGTAGCAGAGACT chr7:27846803+ 244 TCTACATTCCATGTGG TGGTCTCTCCTGACAG 27847046 TTATGCCAAG TCCTTGCACTT (SEQIDNO:565) (SEQIDNO:183) 20 JAZF1-4 ACACTGACGACATGGT TACGGTAGCAGAGACT chr7:27998002+ 195 TCTACACAATAAGCAG TGGTCTCTTTGTGTTA 27998196 CAGATATAAGGTTGTT GGTAGCCTCATATATT (SEQIDNO:566) C (SEQIDNO:184) 21 NCOA4-1 ACACTGACGACATGGT TACGGTAGCAGAGACT chr10:51249073+ 265 TCTACATTCAAAGGTG TGGTCTGCCCTGTGTC 51249337 GTTTTTGGTTG AAGAGTCCAG (SEQIDNO:567) (SEQIDNO:185) 22 NCOA4-2 ACACTGACGACATGGT TACGGTAGCAGAGACT chr10:51250503+ 246 TCTACATTGGGAAACA TGGTCTACCAGAAGCC 51250748 TCATTCTTTGG ATGCTCAAAC (SEQIDNO:568) (SEQIDNO:186) 23 NCOA4-3 ACACTGACGACATGGT TACGGTAGCAGAGACT chr10:51250847+ 250 TCTACATGGTGTCATT TGGTCTTGATCTTATC 51251096 GTGGCTAGTTG CTAGCAACACAGAAG (SEQIDNO:569) (SEQIDNO:187) 24 NCOA4-4 ACACTGACGACATGGT TACGGTAGCAGAGACT chr10:51251218+ 201 TCTACATGAAGTTGAT TGGTCTAGAAGTGCCC 51251418 GAAACGATATTCCTTA AGTGAAGCAT (SEQIDNO:570) (SEQIDNO:188) 25 NCOA4-5 ACACTGACGACATGGT TACGGTAGCAGAGACT chr10:51252141+ 197 TCTACATTGGCAGCAT TGGTCTCCCAAAGGAA 51251418 AGCATAAATAACA GTATAAGCCAAG (SEQIDNO:571) (SEQIDNO:189) 26 NCOA4-6 ACACTGACGACATGGT TACGGTAGCAGAGACT chr10:51252768+ 227 TCTACACTGCATTTGA TGGTCTTCCACCTACT 51252994 CATTCCTTGTTT GCTGTGTCTACTG (SEQIDNO:572) (SEQIDNO:190) 27 NCOA4-7 ACACTGACGACATGGT TACGGTAGCAGAGACT chr10:51254556+ 260 TCTACAGCAGACAGAA TGGTCTTCTGATAGGT 51254815 TCTCCAAAGCA CCATCTCATCTTGA (SEQIDNO:573) (SEQIDNO:191) 28 NCOA4-8 ACACTGACGACATGGT TACGGTAGCAGAGACT chr10:51254768+ 255 TCTACAGGTTGGAGAT TGGTCTTGGTCATTCA 51255022 CAAGAGCTTCC GGCACTTCAG (SEQIDNO:574) (SEQIDNO:192) 29 NCOA4-9 ACACTGACGACATGGT TACGGTAGCAGAGACT chr10:51254962+ 253 TCTACAGAAACCAGCC TGGTCTCCTTCTTTCT 51255214 CAAAGGTGT TCAGAAGCCACT (SEQIDNO:575) (SEQIDNO:193) 30 NCOA4-10 ACACTGACGACATGGT TACGGTAGCAGAGACT chr10:51255167+ 266 TCTACAGAATTGTGAG TGGTCTTGGGACTTCC 51255432 AAGGAGGCTCTG TTCTTTGTATGG (SEQIDNO:576) (SEQIDNO:194) 31 NCOA4-11 ACACTGACGACATGGT TACGGTAGCAGAGACT chr10:51255385+ 249 TCTACACCTTGTCGGA TGGTCTCCAGTGCTAT 51255633 GTGGCTTATC TTTGATGTTTATGC (SEQIDNO:577) (SEQIDNO:195) 32 NCOA4-13 ACACTGACGACATGGT TACGGTAGCAGAGACT chr10:51259156+ 155 TCTACAGGAGCTTTAA TGGTCTTTGGCAAGCT 51259310 GGCAGGGAAA GCAGTCAC (SEQIDNO:578) (SEQIDNO:196) 33 NUDT11-1 ACACTGACGACATGGT TACGGTAGCAGAGACT chrX:51255496+ 253 TCTACAAGCGAGGCAG TGGTCTGTACTGACTG 51255748 ACAAATAGAAG TCACGGAGCTG (SEQIDNO:579) (SEQIDNO:197) 34 SLC22A3-4 ACACTGACGACATGGT TACGGTAGCAGAGACT chr6:160738955+ 209 TCTACATCTGCATTCT TGGTCTTCCCCGTATT 160739163 GGCATGTCTC AATGCATGGTAT (SEQIDNO:580) (SEQIDNO:198) 35 SLC22A3-5 ACACTGACGACATGGT TACGGTAGCAGAGACT chr6:160748030+ 245 TCTACAAAGGTGAGCT TGGTCTTTGTTGGCTA 160748274 CTTTTCCTGTCTT TCTGGCCCTA (SEQIDNO:581) (SEQIDNO:199) 36 SLC22A3-6 ACACTGACGACATGGT TACGGTAGCAGAGACT chr6:160749740+ 268 TCTACATGCTTCTGTG TGGTCTGTCTGTTTGG 160750007 ACCTCTTGTGT AGTCTAATTTCTGC (SEQIDNO:582) (SEQIDNO:200) 37 SLC22A3-7 ACACTGACGACATGGT TACGGTAGCAGAGACT chr6:160751720+ 201 TCTACACATAACTCAC TGGTCTAATCAATTCA 160751920 AACAGCCTCCTTC CCAGCTTTAGCAA (SEQIDNO:583) (SEQIDNO:201) 38 SLC22A3-10 ACACTGACGACATGGT TACGGTAGCAGAGACT chr6:160778107+ 202 TCTACAGTGGTGGAAC TGGTCTGGCTCCCTAT 160778308 TGCCAGGA ACTTGATTGTGG (SEQIDNO:584) (SEQIDNO:202) 39 SLC22A3-11 ACACTGACGACATGGT TACGGTAGCAGAGACT chr6:160783754+ 189 TCTACACCTCCCTTTC TGGTCTCGCTGGTCTA 160783942 AAACTTTCTGTG CAGAGTTACTTAGGA (SEQIDNO:585) (SEQIDNO:203) 40 SLC22A3-12 ACACTGACGACATGGT TACGGTAGCAGAGACT chr6:160784591+ 208 TCTACATGATTATCTT TGGTCTTGAAGGCTCT 160784798 GAAGTCACTTGTTGAA TAAGAATAGCAAATG (SEQIDNO:586) (SEQIDNO:204) 41 SLC22A3-13 ACACTGACGACATGGT TACGGTAGCAGAGACT chr6:160788700+ 235 TCTACAGTGTCTTCCT TGGTCTTTCCCTGTGG 160788934 GGAGCGGTAA ATATTCAATTTTCT (SEQIDNO:587) (SEQIDNO:205) 42 SLC22A3-14 ACACTGACGACATGGT TACGGTAGCAGAGACT chr6:160791984+ 169 TCTACATCTTTCCTAA TGGTCTATCTCTGCAA 160792152 AGACTTTCTCCTTTG GGCACAGCTT (SEQIDNO:588) (SEQIDNO:206) 43 KLK3-1 ACACTGACGACATGGT TACGGTAGCAGAGACT chr19:56049936+ 205 TCTACAAGTCCTGGGG TGGTCTGGAAAGAGCC 56050140 AATGAAGGTT TCAGCTTGAC (SEQIDNO:589) (SEQIDNO:207) 44 KLK3-2 ACACTGACGACATGGT TACGGTAGCAGAGACT chr19:56051260+ 256 TCTACAGTTCCTCCTG TGGTCTCCTCTGGGAC 56051515 TCAACCCTGA ACAGACACCT (SEQIDNO:590) (SEQIDNO:208) 45 KLK3-3 ACACTGACGACATGGT TACGGTAGCAGAGACT chr19:56053051+ 250 TCTACATCCTTATCAT TGGTCTTTCACAGCAT 56053300 CCTCGCTCCT CCGTGAGC (SEQIDNO:591) (SEQIDNO:209) 46 KLK3-4 ACACTGACGACATGGT TACGGTAGCAGAGACT chr19:56053237+ 200 TCTACAACTCCAGCCA TGGTCTCCCTCAGACC 56053436 CGACCTCAT CAGGCATC (SEQIDNO:592) (SEQIDNO:210) 47 KLK3-5 ACACTGACGACATGGT TACGGTAGCAGAGACT chr19:56053490+ 240 TCTACAGGTCCAGCCC TGGTCTCCCAGCCCAG 56053729 ACAACAGT AATTAAGGT (SEQIDNO:593) (SEQIDNO:211) 48 KLK3-8 ACACTGACGACATGGT TACGGTAGCAGAGACT chr19:56054924+ 192 TCTACATCTTCCAAAG TGGTCTGGGCACATGG 56055115 CTGGGAACTG TTCACTGC (SEQIDNO:594) (SEQIDNO:212)
Preparation of Reaction Mixtures
[0264] Primers were synthesized by IDT at 10 nmol scale, and provided resuspended in water at a concentration of 100 uM. The forward and reverse primer for each region in Table 5 were combined in separate wells in a 96-well PCR plate (USA scientific) to a final concentration of 1 M of each primer in PCR-quality water (Teknova) containing 0.05% Tween-20.
[0265] 48 human genomic DNA samples from the HapMap sample collection were resuspended at 50 ng/l in low-EDTA TE buffer (Teknova), and prepared for PCR as follows.
[0266] A pre-sample mixture was prepared as follows:
TABLE-US-00007 TABLE 6 Volume per Volume for 64 Pre-sample mixture sample (l) samples (l) Faststart High Fidelity 0.5 32 reaction Buffer with MgCl.sub.2 DMSO 0.1 6.4 PCR-Grade Nucleotide 0.1 6.4 Mixture Faststart High-Fidelity 0.05 3.2 Enzyme Blend (Roche 04 738 292 001) 20x Access Array Loading 0.25 16 Reagent (PN: 100-0883) 20x Evagreen (Biotium- 0.25 16 31000) 20x ROX dye (Invitrogen 0.25 16 12223-012) PCR-Grade water 0.5 32 Total 2 128
[0267] For each sample, a sample mixture containing forward and reverse barcode primers, genomic DNA, and pre-sample mix was prepared in an individual well in a 96-well PCR plate.
TABLE-US-00008 TABLE 7 Sample Mixture Volume (l) Pre-sample Mixture 2 2 M forward barcode primer 0.5 2 m reverse barcode primer 0.5 Genomic DNA (50 ng/l) 1 PCR-grade water 1
[0268] Each sample was mixed with one pair of barcode primers selected from
[0269] Table 8.
TABLE-US-00009 TABLE8 Reverse Reverse Forward barcodeprimer barcodeprimer Forwardbarcodeprimer barcodeprimer (454B-BC#-CS1) SEQIDNO. (454A-BC#-CS2) SEQIDNO. 1 GCCTTGCCAGCCCGC (SEQID GCCTCCCTCGCGCCATCAGGCAT (SEQID TCAGGCATGCTACGG NO:165) GCACACTGACGACATGGTTCTAC NO:166) TAGCAGAGACTTGGT A CT 2 GCCTTGCCAGCCCGC (SEQID GCCTCCCTCGCGCCATCAGCGTA (SEQID TCAGCGTACGTACGG NO:167) CGACACTGACGACATGGTTCTAC NO:168) TAGCAGAGACTTGGT A CT 3 GCCTTGCCAGCCCGC (SEQID GCCTCCCTCGCGCCATCAGGTCA (SEQID TCAGGTCAGCTACGG NO:169) GCACACTGACGACATGGTTCTAC NO:170) TAGCAGAGACTTGGT A CT 4 GCCTTGCCAGCCCGC (SEQID GCCTCCCTCGCGCCATCAGAGCT (SEQID TCAGAGCTGCTACGG NO:171) GCACACTGACGACATGGTTCTAC NO:172) TAGCAGAGACTTGGT A CT 5 GCCTTGCCAGCCCGC (SEQID GCCTCCCTCGCGCCATCAGTGCA (SEQID TCAGTGCATCTACGG NO:173) TCACACTGACGACATGGTTCTAC NO:174) TAGCAGAGACTTGGT A CT 6 GCCTTGCCAGCCCGC (SEQID GCCTCCCTCGCGCCATCAGCTGA (SEQID TCAGCTGATGTACGG NO:175) TGACACTGACGACATGGTTCTAC NO:176) TAGCAGAGACTTGGT A CT 7 GCCTTGCCAGCCCGC (SEQID GCCTCCCTCGCGCCATCAGGTAG (SEQID TCAGGTAGTCTACGG NO:177) TCACACTGACGACATGGTTCTAC NO:178) TAGCAGAGACTTGGT A CT 8 GCCTTGCCAGCCCGC (SEQID GCCTCCCTCGCGCCATCAGGTCG (SEQID TCAGGTCGATTACGG NO:179) ATACACTGACGACATGGTTCTAC NO:180) TAGCAGAGACTTGGT A CT 9 GCCTTGCCAGCCCGC (SEQID GCCTCCCTCGCGCCATCAGGATA (SEQID TCAGGATACGTACGG NO:181) CGACACTGACGACATGGTTCTAC NO:182) TAGCAGAGACTTGGT A CT 10 GCCTTGCCAGCCCGC (SEQID GCCTCCCTCGCGCCATCAGTGAT (SEQID TCAGTGATGCTACGG NO:183) GCACACTGACGACATGGTTCTAC NO:184) TAGCAGAGACTTGGT A CT 11 GCCTTGCCAGCCCGC (SEQID GCCTCCCTCGCGCCATCAGAGCT (SEQID TCAGAGCTGATACGG NO:185) GAACACTGACGACATGGTTCTAC NO:186) TAGCAGAGACTTGGT A CT 12 GCCTTGCCAGCCCGC (SEQID GCCTCCCTCGCGCCATCAGACTG (SEQID TCAGACTGTATACGG NO:187) TAACACTGACGACATGGTTCTAC NO:188) TAGCAGAGACTTGGT A CT 13 GCCTTGCCAGCCCGC (SEQID GCCTCCCTCGCGCCATCAGTGCA (SEQID TCAGTGCATGTACGG NO:189) TGACACTGACGACATGGTTCTAC NO:190) TAGCAGAGACTTGGT A CT 14 GCCTTGCCAGCCCGC (SEQID GCCTCCCTCGCGCCATCAGAGTC (SEQID TCAGAGTCTATACGG NO:191) TAACACTGACGACATGGTTCTAC NO:192) TAGCAGAGACTTGGT A CT 15 GCCTTGCCAGCCCGC (SEQID GCCTCCCTCGCGCCATCAGTGTC (SEQID TCAGTGTCTGTACGG NO:193) TGACACTGACGACATGGTTCTAC NO:194) TAGCAGAGACTTGGT A CT 16 GCCTTGCCAGCCCGC (SEQID GCCTCCCTCGCGCCATCAGGCTA (SEQID TCAGGCTAGCTACGG NO:195) GCACACTGACGACATGGTTCTAC NO:196) TAGCAGAGACTTGGT A CT 17 GCCTTGCCAGCCCGC (SEQID GCCTCCCTCGCGCCATCAGGATA (SEQID TCAGGATAGCTACGG NO:197) GCACACTGACGACATGGTTCTAC NO:198) TAGCAGAGACTTGGT A CT 18 GCCTTGCCAGCCCGC (SEQID GCCTCCCTCGCGCCATCAGGCTA (SEQID TCAGGCTACTTACGG NO:199) CTACACTGACGACATGGTTCTAC NO:200) TAGCAGAGACTTGGT A CT 19 GCCTTGCCAGCCCGC (SEQID GCCTCCCTCGCGCCATCAGCTAT (SEQID TCAGCTATGCTACGG NO:201) GCACACTGACGACATGGTTCTAC NO:202) TAGCAGAGACTTGGT A CT 20 GCCTTGCCAGCCCGC (SEQID GCCTCCCTCGCGCCATCAGGCTA (SEQID TCAGGCTATGTACGG NO:203) TGACACTGACGACATGGTTCTAC NO:204) TAGCAGAGACTTGGT A CT 21 GCCTTGCCAGCCCGC (SEQID GCCTCCCTCGCGCCATCAGCGTG (SEQID TCAGCGTGCATACGG NO:205) CAACACTGACGACATGGTTCTAC NO:206) TAGCAGAGACTTGGT A CT 22 GCCTTGCCAGCCCGC (SEQID GCCTCCCTCGCGCCATCAGATAG (SEQID TCAGATAGCTTACGG NO:207) CTACACTGACGACATGGTTCTAC NO:208) TAGCAGAGACTTGGT A CT 23 GCCTTGCCAGCCCGC (SEQID GCCTCCCTCGCGCCATCAGTGTA (SEQID TCAGTGTAGCTACGG NO:209) GCACACTGACGACATGGTTCTAC NO:210) TAGCAGAGACTTGGT A CT 24 GCCTTGCCAGCCCGC (SEQID GCCTCCCTCGCGCCATCAGGTGC (SEQID TCAGGTGCTATACGG NO:211) TAACACTGACGACATGGTTCTAC NO:212) TAGCAGAGACTTGGT A CT 25 GCCTTGCCAGCCCGC (SEQID GCCTCCCTCGCGCCATCAGGTCA (SEQID TCAGGTCATGTACGG NO:213) TGACACTGACGACATGGTTCTAC NO:214) TAGCAGAGACTTGGT A CT 26 GCCTTGCCAGCCCGC (SEQID GCCTCCCTCGCGCCATCAGATCG (SEQID TCAGATCGTGTACGG NO:215) TGACACTGACGACATGGTTCTAC NO:216) TAGCAGAGACTTGGT A CT 27 GCCTTGCCAGCCCGC (SEQID GCCTCCCTCGCGCCATCAGTGTA (SEQID TCAGTGTACGTACGG NO:217) CGACACTGACGACATGGTTCTAC NO:218) TAGCAGAGACTTGGT A CT 28 GCCTTGCCAGCCCGC (SEQID GCCTCCCTCGCGCCATCAGAGTG (SEQID TCAGAGTGTATACGG NO:219) TAACACTGACGACATGGTTCTAC NO:220) TAGCAGAGACTTGGT A CT 29 GCCTTGCCAGCCCGC (SEQID GCCTCCCTCGCGCCATCAGTGAC (SEQID TCAGTGACAGTACGG NO:221) AGACACTGACGACATGGTTCTAC NO:222) TAGCAGAGACTTGGT A CT 30 GCCTTGCCAGCCCGC (SEQID GCCTCCCTCGCGCCATCAGGATC (SEQID TCAGGATCACTACGG NO:223) ACACACTGACGACATGGTTCTAC NO:224) TAGCAGAGACTTGGT A CT 31 GCCTTGCCAGCCCGC (SEQID GCCTCCCTCGCGCCATCAGCTAG (SEQID TCAGCTAGAGTACGG NO:225) AGACACTGACGACATGGTTCTAC NO:226) TAGCAGAGACTTGGT A CT 32 GCCTTGCCAGCCCGC (SEQID GCCTCCCTCGCGCCATCAGCTAG (SEQID TCAGCTAGTCTACGG NO:227) TCACACTGACGACATGGTTCTAC NO:228) TAGCAGAGACTTGGT A CT 33 GCCTTGCCAGCCCGC (SEQID GCCTCCCTCGCGCCATCAGAGCT (SEQID TCAGAGCTAGTACGG NO:229) AGACACTGACGACATGGTTCTAC NO:230) TAGCAGAGACTTGGT A CT 34 GCCTTGCCAGCCCGC (SEQID GCCTCCCTCGCGCCATCAGTGAC (SEQID TCAGTGACTGTACGG NO:231) TGACACTGACGACATGGTTCTAC NO:232) TAGCAGAGACTTGGT A CT 35 GCCTTGCCAGCCCGC (SEQID GCCTCCCTCGCGCCATCAGTGAT (SEQID TCAGTGATAGTACGG NO:233) AGACACTGACGACATGGTTCTAC NO:234) TAGCAGAGACTTGGT A CT 36 GCCTTGCCAGCCCGC (SEQID GCCTCCCTCGCGCCATCAGCGTA (SEQID TCAGCGTATCTACGG NO:235) TCACACTGACGACATGGTTCTAC NO:236) TAGCAGAGACTTGGT A CT 37 GCCTTGCCAGCCCGC (SEQID GCCTCCCTCGCGCCATCAGGTCT (SEQID TCAGGTCTGATACGG NO:237) GAACACTGACGACATGGTTCTAC NO:238) TAGCAGAGACTTGGT A CT 38 GCCTTGCCAGCCCGC (SEQID GCCTCCCTCGCGCCATCAGCATG (SEQID TCAGCATGACTACGG NO:239) ACACACTGACGACATGGTTCTAC NO:240) TAGCAGAGACTTGGT A CT 39 GCCTTGCCAGCCCGC (SEQID GCCTCCCTCGCGCCATCAGCGAT (SEQID TCAGCGATGATACGG NO:241) GAACACTGACGACATGGTTCTAC NO:242) TAGCAGAGACTTGGT A CT 40 GCCTTGCCAGCCCGC (SEQID GCCTCCCTCGCGCCATCAGGCTG (SEQID TCAGGCTGATTACGG NO:243) ATACACTGACGACATGGTTCTAC NO:244) TAGCAGAGACTTGGT A CT 41 GCCTTGCCAGCCCGC (SEQID GCCTCCCTCGCGCCATCAGCAGT (SEQID TCAGCAGTACTACGG NO:245) ACACACTGACGACATGGTTCTAC NO:246) TAGCAGAGACTTGGT A CT 42 GCCTTGCCAGCCCGC (SEQID GCCTCCCTCGCGCCATCAGGCGA (SEQID TCAGGCGACTTACGG NO:247) CTACACTGACGACATGGTTCTAC NO:248) TAGCAGAGACTTGGT A CT 43 GCCTTGCCAGCCCGC (SEQID GCCTCCCTCGCGCCATCAGGTAC (SEQID TCAGGTACGATACGG NO:249) GAACACTGACGACATGGTTCTAC NO:250) TAGCAGAGACTTGGT A CT 44 GCCTTGCCAGCCCGC (SEQID GCCTCCCTCGCGCCATCAGACGC (SEQID TCAGACGCTATACGG NO:251) TAACACTGACGACATGGTTCTAC NO:252) TAGCAGAGACTTGGT A CT 45 GCCTTGCCAGCCCGC (SEQID GCCTCCCTCGCGCCATCAGAGCA (SEQID TCAGAGCATCTACGG NO:253) TCACACTGACGACATGGTTCTAC NO:254) TAGCAGAGACTTGGT A CT 46 GCCTTGCCAGCCCGC (SEQID GCCTCCCTCGCGCCATCAGGATG (SEQID TCAGGATGCTTACGG NO:255) CTACACTGACGACATGGTTCTAC NO:256) TAGCAGAGACTTGGT A CT 47 GCCTTGCCAGCCCGC (SEQID GCCTCCCTCGCGCCATCAGGTCT (SEQID TCAGGTCTGCTACGG NO:257) GCACACTGACGACATGGTTCTAC NO:258) TAGCAGAGACTTGGT A CT 48 GCCTTGCCAGCCCGC (SEQID GCCTCCCTCGCGCCATCAGATGC (SEQID TCAGATGCGATACGG NO:259) GAACACTGACGACATGGTTCTAC NO:260) TAGCAGAGACTTGGT A CT
Running the Access Array IFC
[0270] The containment and interface accumulator reservoirs were filled with 300 l of Control Line Fluid (Fluidigm PN 89000020) and the H1-H4 reagent wells were loaded with 500 l of 0.05% Tween-20 in PCR-grade water prior to Access Array IFC loading. 5 l of each sample mixture was loaded into the sample ports, and 5 l of each primer mixture was loaded into the primer inlets on the Access Array IFC.
[0271] The Access Array IFC was thermal cycled and imaged using a BioMark Real-Time PCR system manufactured by Fluidigm Corporation. The Access Array IFC thermal cycling protocol contains a thermal mix step [50 C. for 2 min, 70 C. for 20 min], a hotstart step [95 C. for 10 min], a 35 cycle touch down PCR strategy [2 cycles of 95 C. for 15 sec and 63 C. for 1 min, 2 cycles of 95 C. for 15 sec and 62 C. for 1 min, 2 cycles of 95 C. for 15 sec and 61 C. for 1 min, 2 cycles of 95 C. for 15 sec and 60 C. for 1 min, 2 cycles of 95 C. for 15 sec and 58 C. for 1 min, 25 cycles of 95 C. for 15 sec and 72 C. for 1 min], and an elongation step [72 C. for 3 min]. The real-time data was analyzed with Fluidigm Real-Time PCR Analysis software to obtain C.sub.T values for each reaction chamber.
[0272] After amplification, the PCR products were harvested from the Access Array IFC using the Post-PCR IFC Loader AX. Before harvesting, each sample port was filled with 2 l of 0.05% Tween-20. Residual solution was removed from the H1-H4 reagent wells, and they were refilled with 600 l of 1 Access Array Harvesting Reagent (0.05% tween-20). After harvesting, each sample port became a PCR product outlet that contained 10 l (10%) of 48 pooled PCR products. The pooled PCR products were removed from the Access Array IFC and stored in a microtiter plate at 4 C.
[0273] 1 l of each PCR product pool for each sample was taken and loaded onto an Agilent 1K Bioanalyzer chip.
[0274] PCR Products for each sample were pooled based on concentrations calculated from the Agilent Bioanalyzer traces. The product pool was purified using AMPure beads (Agencourt) according to the manufacturer's instructions.
[0275] The purified product pool was subjected to emulsion PCR followed by pyrosequencing on a 454 FLX sequencer (Roche Analytical Sciences) according to manufacturer's instructions. The sequence file output by the sequencer was then analyzed for the presence of barcoded PCR products.
[0276] The number of sequences obtained for each barcode were counted, and plotted (
[0277] The number of sequences counted for each individual PCR product in each sample were then analyzed (
Example 5
Multi-Primer Amplification Using Four Outer Primers With Different Combinations of Primer Binding Site and Nucleotide Tags
[0278] Sets of primer pairs were designed to amplify specific regions from the EGFR and MET genes. These were then combined in an Access Array IFC with human genomic DNA and four outer primers (
Preparation of Reaction Mixtures
[0279] Primers were synthesized by Eurofins MWG Operon at 10 nmol scale and provided resuspended in water at a concentration of 100 M. The forward and reverse primer for each region in Table 9 were combined in separate wells in a 96-well PCR plate (USA scientific) to a final concentration of 1 M of each primer in PCR-quality water (Teknova) containing 0.05% Tween-20.
TABLE-US-00010 TABLE9 ForwardPrimer ReversePrimer Assay ForwardPrimer SEQIDNO. ReversePrimer SEQIDNO. EGFR_Exon3 ACACTGACGACATG (SEQID TACGGTAGCAGAGACTTGG (SEQID GTTCTACATTCTTA NO:261) TCTCCAGCCTCTCACCCTG NO:262) GACCATCCAGGAGG TAAA TG EGFR_Exon4 ACACTGACGACATG (SEQID TACGGTAGCAGAGACTTGG (SEQID GTTCTACAAGCTGG NO:263) TCTTAGGAGCTGGAGGCAG NO:264) AAAGAGTGCTCACC AGAT EGFR_Exon5 ACACTGACGACATG (SEQID TACGGTAGCAGAGACTTGG (SEQID GTTCTACAGCGTCA NO:265) TCTACATGGGTCTGAGGCT NO:266) TCAGTTTCTCATCA GTTC TT EGFR_Exon6 ACACTGACGACATG (SEQID TACGGTAGCAGAGACTTGG (SEQID GTTCTACACCCTGG NO:267) TCTTCTTACCAGGCAGTCG NO:268) GAAATGATCCTACC CTCT EGFR_Exon7 ACACTGACGACATG (SEQID TACGGTAGCAGAGACTTGG (SEQID GTTCTACACCAGCG NO:269) TCTGACAAGGATGCCTGAC NO:270) TGTCCTCTCTCCT CAGT EGFR_Exon8 ACACTGACGACATG (SEQID TACGGTAGCAGAGACTTGG (SEQID GTTCTACACAAAGG NO:271) TCTGATGTGTTCCTTTGGA NO:272) AGGATGGAGCCTTT GGTGG C EGFR_Exon9 ACACTGACGACATG (SEQID TACGGTAGCAGAGACTTGG (SEQID GTTCTACATCCAAC NO:273) TCTCAAGCAACTGAACCTG NO:274) AAATGTGAACGGAA TGACTC T EGFR_Exon10 ACACTGACGACATG (SEQID TACGGTAGCAGAGACTTGG (SEQID GTTCTACAGATCAA NO:275) TCTTTCCAAGGGAACAGGA NO:276) TAATCACCCTGTTG AATATG TTTG EGFR_Exon11 ACACTGACGACATG (SEQID TACGGTAGCAGAGACTTGG (SEQID GTTCTACATCCTAC NO:277) TCTGCTTTGGCTGTGGTCA NO:278) GTGGTGTGTGTCTG ACTT A EGFR_Exon12 ACACTGACGACATG (SEQID (SEQID GTTCTACACCACAT NO:279) TACGGTAGCAGAGACTTGG NO:280) GATTTTTCTTCTCT TCTCGGTGACTTACTGCAG CCA CTGTT EGFR_Exon13 ACACTGACGACATG (SEQID (SEQID GTTCTACAGCTCTG NO:281) TACGGTAGCAGAGACTTGG NO:282) TCACTGACTGCTGT TCTGCTATAACAACAACCT G GGAGCCT EGFR_Exon14 ACACTGACGACATG (SEQID (SEQID GTTCTACAGCTGAC NO:283) TACGGTAGCAGAGACTTGG NO:284) GGGTTTCCTCTTC TCTGACGTGGATAGCAGCA AGG EGFR_Exon15 ACACTGACGACATG (SEQID (SEQID GTTCTACAGCATGA NO:285) TACGGTAGCAGAGACTTGG NO:286) ACATTTTTCTCCAC TCTTTCTGTTCTCCTTCAC CT TTTCCAC EGFR_Exon16 ACACTGACGACATG (SEQID (SEQID GTTCTACATTTCTC NO:287) TACGGTAGCAGAGACTTGG NO:288) TTTCACTTCCTACA TCTCCACAGCAGTGTGGTC GATGC ATTC EGFR_Exon17 ACACTGACGACATG (SEQID (SEQID GTTCTACATGGAAT NO:289) TACGGTAGCAGAGACTTGG NO:290) CTGTCAGCAACCTC TCTCCCAGGACTGGCACTC A EGFR_Exon18 ACACTGACGACATG (SEQID (SEQID GTTCTACAGCTGAG NO:291) TACGGTAGCAGAGACTTGG NO:292) GTGACCCTTGTCTC TCTCCCACCAGACCATGAG AGG EGFR_Exon19 ACACTGACGACATG (SEQID (SEQID GTTCTACATCACAA NO:293) TACGGTAGCAGAGACTTGG NO:294) TTGCCAGTTAACGT TCTCCACACAGCAAAGCAG CT AAAC EGFR_Exon20 ACACTGACGACATG (SEQID TACGGTAGCAGAGACTTGG (SEQID GTTCTACACCACAC NO:295) TCTCCGTATCTCCCTTCCC NO:296) TGACGTGCCTCTC TGAT EGFR_Exon21 ACACTGACGACATG (SEQID TACGGTAGCAGAGACTTGG (SEQID GTTCTACACCTCAC NO:297) TCTCTGACCTAAAGCCACC NO:298) AGCAGGGTCTTCTC TCCTT EGFR_Exon22 ACACTGACGACATG (SEQID TACGGTAGCAGAGACTTGG (SEQID GTTCTACACACTGC NO:299) TCTCCAGCTTGGCCTCAGT NO:300) CTCATCTCTCACCA ACA EGFR_Exon23 ACACTGACGACATG (SEQID TACGGTAGCAGAGACTTGG (SEQID GTTCTACACATGAT NO:301) TCTAGTGTGGACAGACCCA NO:302) CCCACTGCCTTCTT CCA EGFR_Exon24 ACACTGACGACATG (SEQID TACGGTAGCAGAGACTTGG (SEQID GTTCTACATTCCAG NO:303) TCTGAGGGACTCTTCCCAA NO:304) TGTTCTAATTGCAC TGGA TGTT EGFR_Exon25 ACACTGACGACATG (SEQID TACGGTAGCAGAGACTTGG (SEQID GTTCTACACTAATA NO:305) TCTTTTGTTCAAATGAGTA NO:306) GCCTCAAAATCTCT GACACAGC GCAC EGFR_Exon26 ACACTGACGACATG (SEQID TACGGTAGCAGAGACTTGG (SEQID GTTCTACACATTCC NO:307) TCTTTCTGGCTTATAAGGT NO:308) ATGGGCAACTTCTC GTTCATACA EGFR_Exon27 ACACTGACGACATG (SEQID TACGGTAGCAGAGACTTGG (SEQID GTTCTACACCTTCC NO:309) TCTTCCAGACAAGCCACTC NO:310) CTCATTTCCTCCTG ACC EGFR_Exon2 ACACTGACGACATG (SEQID TACGGTAGCAGAGACTTGG (SEQID 8-1 GTTCTACAcctctg NO:311) TCTCTAATTTGGTGGCTGC NO:312) atttctttccactt CTTT tca EGFR_Exon2 ACACTGACGACATG (SEQID TACGGTAGCAGAGACTTGG (SEQID 8-2 GTTCTACATGTCAA NO:313) TCTGGTCCTGGGTATCGAA NO:314) CAGCACATTCGACA AGAGT G EGFR_Exon2 ACACTGACGACATG (SEQID TACGGTAGCAGAGACTTGG (SEQID GTTCTACATTTCTT NO:315) TCTAGGAAAATCAAAGTCA NO:316) CCAGTTTGCCAAGG CCAACC MET_Exon1-1 ACACTGACGACATG (SEQID TACGGTAGCAGAGACTTGG (SEQID GTTCTACACTCTCG NO:317) TCTCAGCACAGGCCCAGTC NO:318) CCTTGAACCTGTTT TT MET_Exon1-2 ACACTGACGACATG (SEQID TACGGTAGCAGAGACTTGG (SEQID GTTCTACATTCCTT NO:319) TCTGGGAGAATATGCAGTG NO:320) GGTGCCACTAACTA AACCTC CA MET_Exon2 ACACTGACGACATG (SEQID TACGGTAGCAGAGACTTGG (SEQID GTTCTACATGGATT NO:321) TCTTTGCACAATACCAGAT NO:322) CACATTAACTCTAT AGAACAGAC GACCA MET_Exon3 ACACTGACGACATG (SEQID TACGGTAGCAGAGACTTGG (SEQID GTTCTACATGAGCT NO:323) TCTCGTCTATGGAAATTCC NO:324) TGTTGGAATAAGGA CTGTG TG MET_Exon4 ACACTGACGACATG (SEQID TACGGTAGCAGAGACTTGG (SEQID GTTCTACAGAAGCT NO:325) TCTTGCCAGCTGTTAGAGA NO:326) CTTTCCACCCCTTC TTCCT MET_Exon5 ACACTGACGACATG (SEQID TACGGTAGCAGAGACTTGG (SEQID GTTCTACATGTCCT NO:327) TCTCCCCAGCAAAGCATTT NO:328) TGTAGGTTTTCCCA TAAG AA MET_Exon6 ACACTGACGACATG (SEQID TACGGTAGCAGAGACTTGG (SEQID GTTCTACAGAAAAT NO:329) TCTCATGATAGGATAGAAT NO:330) TCCTTGGATTTGTC CTTCCTTACCA ATG MET_Exon7 ACACTGACGACATG (SEQID TACGGTAGCAGAGACTTGG (SEQID GTTCTACAGTTTTG NO:331) TCTTTCAAATTGACAGATG NO:332) TTTTTATCTCCCCT CAACAA CCA MET_Exon8 ACACTGACGACATG (SEQID TACGGTAGCAGAGACTTGG (SEQID GTTCTACAGGAACC NO:333) TCTTTGTTTTCTTATACCC NO:334) ATTGAGTTATATCC ATCAGAAGC TTTTG MET_Exon9 ACACTGACGACATG (SEQID TACGGTAGCAGAGACTTGG (SEQID GTTCTACATTGGTG NO:335) TCTCAGGTACCATGAAAGC NO:336) GAAAGAACCTCTCA CACA A MET_Exon10 ACACTGACGACATG (SEQID TACGGTAGCAGAGACTTGG (SEQID GTTCTACATGTTGC NO:337) TCTTTTGAGCTGATGATTT NO:338) CAAGCTGTATTCTG AAGACAGTG TT MET_Exon12 ACACTGACGACATG (SEQID TACGGTAGCAGAGACTTGG (SEQID GTTCTACAGGACCC NO:339) TCTCAAGAATCGACGACAA NO:340) AAAGTGCTACAACC TCTTAAAC MET_Exon13 ACACTGACGACATG (SEQID TACGGTAGCAGAGACTTGG (SEQID GTTCTACAGCCCAT NO:341) TCTCAACAATGTCACAACC NO:342) GATAGCCGTCTTTA CACTG MET_Exon14 ACACTGACGACATG (SEQID TACGGTAGCAGAGACTTGG (SEQID GTTCTACACCTTCA NO:343) TCTGCTTACTGGAAAATCG NO:344) TCTTACAGATCAGT TATTTAACAAA TTCCT MET_Exon15 ACACTGACGACATG (SEQID TACGGTAGCAGAGACTTGG (SEQID GTTCTACAACGCAG NO:345) TCTTCCACAAGGGGAAAGT NO:346) TGCTAACCAAGTTC GTAAA T MET_Exon16 ACACTGACGACATG (SEQID TACGGTAGCAGAGACTTGG (SEQID GTTCTACATGTCTC NO:347) TCTGGCTTACAGCTAGTTT NO:348) CACCACTGGATTTC GCCAGT T MET_Exon17 ACACTGACGACATG (SEQID TACGGTAGCAGAGACTTGG (SEQID GTTCTACATGCTTT NO:349) TCTTCCTCCTTGTCACTTA NO:350) TCTAACTCTCTTTG ATTTGGA ACTGC MET_Exon18 ACACTGACGACATG (SEQID TACGGTAGCAGAGACTTGG (SEQID GTTCTACATTCTAT NO:351) TCTAGAGGAGAAACTCAGA NO:352) TTCAGCCACGGGTA GATAACCAA A MET_Exon19 ACACTGACGACATG (SEQID TACGGTAGCAGAGACTTGG (SEQID GTTCTACACTCACC NO:353) TCTGGCATTTCTGTAAAAG NO:354) TCATCTGTCCTGTT TAAAGAACG TCT MET_Exon20 ACACTGACGACATG (SEQID TACGGTAGCAGAGACTTGG (SEQID GTTCTACACCTGCC NO:355) TCTGTGTGGACTGTTGCTT NO:356) TTCAAAGGGTCTCT TGACA
[0280] A single human Genomic DNA sample (Coriell NA10830) was resuspended at 50 ng/l in low-EDTA TE buffer (Teknova) and prepared for PCR as follows.
[0281] A pre-sample mixture was prepared as follows:
TABLE-US-00011 TABLE 10 Volume per Volume for 64 Pre-sample mixture sample (l) samples (l) Faststart High Fidelity 0.5 32 reaction Buffer with MgCl.sub.2 DMSO 0.1 6.4 PCR-Grade Nucleotide 0.1 6.4 Mixture Faststart High-Fidelity 0.05 3.2 Enzyme Blend (Roche 04 738 292 001) 20x Access Array Loading 0.25 16 Reagent (PN: 100-0883) PCR-Grade water 0.5 32 Total 2.5 160
[0282] For each sample replicate, a sample mixture containing forward and reverse barcode primers, genomic DNA and pre-sample mix was prepared in an individual well in a 96-well PCR plate.
TABLE-US-00012 TABLE 11 Sample Mixture Volume (l) Pre-sample Mixture 2 4 M forward barcode primer 0.5 4 m reverse barcode primer 0.5 Genomic DNA (50 ng/l) 1 PCR-grade water 1
[0283] Four replicate samples were prepared by mixing each sample with one pair of barcode primers selected from Table 12.
TABLE-US-00013 TABLE12 Reverse Reverse Forward Forward barcodeprimer barcodeprimer barcodeprimer barcodeprimer (454B-BC#-CS1) SEQIDNO. (454A-BC#-CS2) SEQIDNO. 1 GCCTTGCCAGCCCG (SEQIDNO:357) GCCTCCCTCGCGCC (SEQIDNO:358) CTCAGGCATGCTAC ATCAGGCATGCACA GGTAGCAGAGACTT CTGACGACATGGTT GGTCT CTACA 2 GCCTTGCCAGCCCG (SEQIDNO:359) GCCTCCCTCGCGCC (SEQIDNO:360) CTCAGCGTACGTAC ATCAGCGTACGACA GGTAGCAGAGACTT CTGACGACATGGTT GGTCT CTACA 3 GCCTTGCCAGCCCG (SEQIDNO:361) GCCTCCCTCGCGCC (SEQIDNO:362) CTCAGGTCAGCTAC ATCAGGTCAGCACA GGTAGCAGAGACTT CTGACGACATGGTT GGTCT CTACA 4 GCCTTGCCAGCCCG (SEQIDNO:363) GCCTCCCTCGCGCC (SEQIDNO:364) CTCAGAGCTGCTAC ATCAGAGCTGCACA GGTAGCAGAGACTT CTGACGACATGGTT GGTCT CTACA 5 GCCTTGCCAGCCCG (SEQIDNO:365) GCCTCCCTCGCGCC (SEQIDNO:366) CTCAGTGCATCTAC ATCAGTGCATCACA GGTAGCAGAGACTT CTGACGACATGGTT GGTCT CTACA 6 GCCTTGCCAGCCCG (SEQIDNO:367) GCCTCCCTCGCGCC (SEQIDNO:368) CTCAGCTGATGTAC ATCAGCTGATGACA GGTAGCAGAGACTT CTGACGACATGGTT GGTCT CTACA 7 GCCTTGCCAGCCCG (SEQIDNO:369) GCCTCCCTCGCGCC (SEQIDNO:370) CTCAGGTAGTCTAC ATCAGGTAGTCACA GGTAGCAGAGACTT CTGACGACATGGTT GGTCT CTACA 8 GCCTTGCCAGCCCG (SEQIDNO:371) GCCTCCCTCGCGCC (SEQIDNO:372) CTCAGGTCGATTAC ATCAGGTCGATACA GGTAGCAGAGACTT CTGACGACATGGTT GGTCT CTACA 9 GCCTTGCCAGCCCG (SEQIDNO:373) GCCTCCCTCGCGCC (SEQIDNO:374) CTCAGGATACGTAC ATCAGGATACGACA GGTAGCAGAGACTT CTGACGACATGGTT GGTCT CTACA 10 GCCTTGCCAGCCCG (SEQIDNO:375) GCCTCCCTCGCGCC (SEQIDNO:376) CTCAGTGATGCTAC ATCAGTGATGCACA GGTAGCAGAGACTT CTGACGACATGGTT GGTCT CTACA 11 GCCTTGCCAGCCCG (SEQIDNO:377) GCCTCCCTCGCGCC (SEQIDNO:378) CTCAGAGCTGATAC ATCAGAGCTGAACA GGTAGCAGAGACTT CTGACGACATGGTT GGTCT CTACA 12 GCCTTGCCAGCCCG (SEQIDNO:379) GCCTCCCTCGCGCC (SEQIDNO:380) CTCAGACTGTATAC ATCAGACTGTAACA GGTAGCAGAGACTT CTGACGACATGGTT GGTCT CTACA 13 GCCTTGCCAGCCCG (SEQIDNO:381) GCCTCCCTCGCGCC (SEQIDNO:382) CTCAGTGCATGTAC ATCAGTGCATGACA GGTAGCAGAGACTT CTGACGACATGGTT GGTCT CTACA 14 GCCTTGCCAGCCCG (SEQIDNO:383) GCCTCCCTCGCGCC (SEQIDNO:384) CTCAGAGTCTATAC ATCAGAGTCTAACA GGTAGCAGAGACTT CTGACGACATGGTT GGTCT CTACA 15 GCCTTGCCAGCCCG (SEQIDNO:385) GCCTCCCTCGCGCC (SEQIDNO:386) CTCAGTGTCTGTAC ATCAGTGTCTGACA GGTAGCAGAGACTT CTGACGACATGGTT GGTCT CTACA 16 GCCTTGCCAGCCCG (SEQIDNO:387) GCCTCCCTCGCGCC (SEQIDNO:388) CTCAGGCTAGCTAC ATCAGGCTAGCACA GGTAGCAGAGACTT CTGACGACATGGTT GGTCT CTACA
Running the Access Array IFC
[0284] The containment and interface accumulator reservoirs were filled with 300 l of Control Line Fluid (Fluidigm PN 89000020), and the H1-H4 reagent wells were loaded with 500 l of 0.05% Tween-20 in PCR-grade water prior to Access Array IFC loading. 5 l of each sample mixture was loaded into the sample ports, and 5 l of each primer mixture was loaded into the primer inlets on the Access Array IFC.
[0285] The Access Array IFC was thermal cycled and imaged using an IFC Stand-Alone Thermal Cycler (Fluidigm Corporation). The thermal cycling protocol contains a thermal mix step [50 C. for 2 min, 70 C. for 20 min], a hotstart step [95 C. for 10 min], a 35 cycle PCR strategy [2 cycles of 95 C. for 15 sec and 60 C. for 4 min, 33 cycles of 95 C. for 15 sec, 60 C. for 15 sec, 72 C. for 1 min, and an elongation step [72 C. for 3 min].
[0286] After amplification, the PCR products were harvested from the Access Array IFC using the Post-PCR IFC Loader AX. Before harvesting, each sample port was filled with 2 l of 0.05% Tween-20. Residual solution was removed from the H1-H4 reagent wells, and they were refilled with 600 l of 1 Access Array Harvesting Reagent (0.05% tween-20). After harvesting each sample port became a PCR product outlet that contained 10 l (10%) of 48 pooled PCR products. The pooled PCR products were removed from the Access Array IFC and stored in a microtiter plate at 4 C.
[0287] PCR products for each sample were pooled based on concentrations calculated from the Agilent Bioanalyzer traces. The purified product pool was subjected to emulsion PCR followed by pyrosequencing on a 454 FLX sequencer (Roche Analytical Sciences) according to manufacturer's instructions. Emulsion PCR reactions were run with beads containing both A and B primer sequences attached, enabling sequence reads for both strands of the amplicon.
[0288] The number of sequences counted for each individual PCR product in each sample were analyzed to demonstrate representation of the PCR products shown in
Example 6
4-Primer Barcoding of Target Nucleic Acids for Illumina DNA Sequencing Using a Microfluidic Device that Permits Recovery of Amplication Products
[0289] Sequences designed for a 4-primer tagging scheme to be used on the Illumina Genome Analyzer II are shown in Tables 13 and 14. The tag sequence is the inner primer sequence.
TABLE-US-00014 TABLE13 Innerprimers Target-Specific TagSequence Sequence(Forward) OligonucleotideSequence ACACTCTTTCCCTACA ACTGTCCAGCTTTGTGCC ACACTCTTTCCCTACACGACGCTCTTCCG CGACGCTCTTCCGATC (SEQIDNO:390) ATCTACTGTCCAGCTTTGTGCC T (SEQIDNO:391) (SEQIDNO:389) ACACTCTTTCCCTACA GATCATCATAGGAGTTGC ACACTCTTTCCCTACACGACGCTCTTCCG CGACGCTCTTCCGATC ATTGTTG ATCTGATCATCATAGGAGTTGCATTGTTG T (SEQIDNO:393) (SEQIDNO:394) (SEQIDNO:392) Target-Specific TagSequence Sequence(Reverse) OligonucleotideSequence CTCGGCATTCCTGCTG TCCTCTGCCTAGGCGTT CTCGGCATTCCTGCTGAACCGCTCTTCCG AACCGCTCTTCCGATC (SEQIDNO:396) ATCTTCCTCTGCCTAGGCGTT T (SEQIDNO:397) (SEQIDNO:395) CTCGGCATTCCTGCTG GAAATGTAAATGTGGAGC CTCGGCATTCCTGCTGAACCGCTCTTCCG AACCGCTCTTCCGATC CAAACA ATCTGAAATGTAAATGTGGAGCCAAACA T (SEQIDNO:399) (SEQIDNO:400) (SEQIDNO:398)
TABLE-US-00015 TABLE14 Barcode Primers Direction ILMN_PE1sh_F Forward AATGATACGGCGACCACCGAGATCTAC ACTCTTTCCCTACACGA (SEQIDNO:401) ILMN_PE2sh_R Reverse CAAGCAGAAGACGGCATACGAGATCGG TCTCGGCATTCCTGCTGAAC (SEQIDNO:402)
[0290] The successful amplication of a PCR product using the 4-primer strategy designed for use on the Illumina GA II sequencer is shown in
Example 7
Barcoding of Target Nucleic Acids for Titanium Chemistry on the 454 FLX Sequencer (Roche Analytical Sciences)
[0291] Table 15 shows forward barcode sequences for use with Titanium chemistry on the 454 FLX Sequencer (Roche Analytical Sciences). Table 16 shows reverse barcode sequences for use with Titanium chemistry on the 454 FLX Sequencer (Roche Analytical Sciences).
TABLE-US-00016 TABLE15 Well Barcode ForwardOligoName ForwardOligosequence SEQIDNO. A1 TI-MID1 TI-F-MID1-TAG8 CGTATCGCCTCCCTCGCGCCATCAGACGAGTGCG (SEQIDNO:403) TACACTGACGACATGGTTCTACA B1 TI-MID2 TI-F-MID2-TAG8 CGTATCGCCTCCCTCGCGCCATCAGACGCTCGAC (SEQIDNO:404) AACACTGACGACATGGTTCTACA C1 TI-MID3 TI-F-MID3-TAG8 CGTATCGCCTCCCTCGCGCCATCAGAGACGCACT (SEQIDNO:405) CACACTGACGACATGGTTCTACA D1 TI-MID67 TI-F-MID67-TAG8 CGTATCGCCTCCCTCGCGCCATCAGTCGATAGTG (SEQIDNO:406) AACACTGACGACATGGTTCTACA E1 TI-MID5 TI-F-MID5-TAG8 CGTATCGCCTCCCTCGCGCCATCAGATCAGACAC (SEQIDNO:407) GACACTGACGACATGGTTCTACA F1 TI-MID6 TI-F-MID6-TAG8 CGTATCGCCTCCCTCGCGCCATCAGATATCGCGA (SEQIDNO:408) GACACTGACGACATGGTTCTACA G1 TI-MID7 TI-F-MID7-TAG8 CGTATCGCCTCCCTCGCGCCATCAGCGTGTCTCT (SEQIDNO:409) AACACTGACGACATGGTTCTACA H1 TI-MID8 TI-F-MID8-TAG8 CGTATCGCCTCCCTCGCGCCATCAGCTCGCGTGT (SEQIDNO:410) CACACTGACGACATGGTTCTACA A2 TI-MID10 TI-F-MID10-TAG8 CGTATCGCCTCCCTCGCGCCATCAGTCTCTATGC (SEQIDNO:411) GACACTGACGACATGGTTCTACA B2 TI-MID11 TI-F-MID11-TAG8 CGTATCGCCTCCCTCGCGCCATCAGTGATACGTC (SEQIDNO:412) TACACTGACGACATGGTTCTACA C2 TI-MID13 TI-F-MID13-TAG8 CGTATCGCCTCCCTCGCGCCATCAGCATAGTAGT (SEQIDNO:413) GACACTGACGACATGGTTCTACA D2 TI-MID14 TI-F-MID14-TAG8 CGTATCGCCTCCCTCGCGCCATCAGCGAGAGATA (SEQIDNO:414) CACACTGACGACATGGTTCTACA E2 TI-MID15 TI-F-MID15-TAG8 CGTATCGCCTCCCTCGCGCCATCAGATACGACGT (SEQIDNO:415) AACACTGACGACATGGTTCTACA F2 TI-MID16 TI-F-MID16-TAG8 CGTATCGCCTCCCTCGCGCCATCAGTCACGTACT (SEQIDNO:416) AACACTGACGACATGGTTCTACA G2 TI-MID17 TI-F-MID17-TAG8 CGTATCGCCTCCCTCGCGCCATCAGCGTCTAGTA (SEQIDNO:417) CACACTGACGACATGGTTCTACA H2 TI-MID18 TI-F-MID18-TAG8 CGTATCGCCTCCCTCGCGCCATCAGTCTACGTAG (SEQIDNO:418) CACACTGACGACATGGTTCTACA A3 TI-MID19 TI-F-MID19-TAG8 CGTATCGCCTCCCTCGCGCCATCAGTGTACTACT (SEQIDNO:419) CACACTGACGACATGGTTCTACA B3 TI-MID20 TI-F-MID20-TAG8 CGTATCGCCTCCCTCGCGCCATCAGACGACTACA (SEQIDNO:420) GACACTGACGACATGGTTCTACA C3 TI-MID21 TI-F-MID21-TAG8 CGTATCGCCTCCCTCGCGCCATCAGCGTAGACTA (SEQIDNO:421) GACACTGACGACATGGTTCTACA D3 TI-MID22 TI-F-MID22-TAG8 CGTATCGCCTCCCTCGCGCCATCAGTACGAGTAT (SEQIDNO:422) GACACTGACGACATGGTTCTACA E3 TI-MID23 TI-F-MID23-TAG8 CGTATCGCCTCCCTCGCGCCATCAGTACTCTCGT (SEQIDNO:423) GACACTGACGACATGGTTCTACA F3 TI-MID24 TI-F-MID24-TAG8 CGTATCGCCTCCCTCGCGCCATCAGTAGAGACGA (SEQIDNO:424) GACACTGACGACATGGTTCTACA G3 TI-MID25 TI-F-MID25-TAG8 CGTATCGCCTCCCTCGCGCCATCAGTCGTCGCTC (SEQIDNO:425) GACACTGACGACATGGTTCTACA H3 TI-MID26 TI-F-MID26-TAG8 CGTATCGCCTCCCTCGCGCCATCAGACATACGCG (SEQIDNO:426) TACACTGACGACATGGTTCTACA A4 TI-MID27 TI-F-MID27-TAG8 CGTATCGCCTCCCTCGCGCCATCAGACGCGAGTA (SEQIDNO:427) TACACTGACGACATGGTTCTACA B4 TI-MID28 TI-F-MID28-TAG8 CGTATCGCCTCCCTCGCGCCATCAGACTACTATG (SEQIDNO:428) TACACTGACGACATGGTTCTACA C4 TI-MID68 TI-F-MID68-TAG8 CGTATCGCCTCCCTCGCGCCATCAGTCGCTGCGT (SEQIDNO:429) AACACTGACGACATGGTTCTACA D4 TI-MID30 TI-F-MID30-TAG8 CGTATCGCCTCCCTCGCGCCATCAGAGACTATAC (SEQIDNO:430) TACACTGACGACATGGTTCTACA E4 TI-MID31 TI-F-MID31-TAG8 CGTATCGCCTCCCTCGCGCCATCAGAGCGTCGTC (SEQIDNO:431) TACACTGACGACATGGTTCTACA F4 TI-MID32 TI-F-MID32-TAG8 CGTATCGCCTCCCTCGCGCCATCAGAGTACGCTA (SEQIDNO:432) TACACTGACGACATGGTTCTACA G4 TI-MID33 TI-F-MID33TAG8 CGTATCGCCTCCCTCGCGCCATCAGATAGAGTAC (SEQIDNO:433) TACACTGACGACATGGTTCTACA H4 TI-MID34 TI-F-MID34-TAG8 CGTATCGCCTCCCTCGCGCCATCAGCACGCTACG (SEQIDNO:434) TACACTGACGACATGGTTCTACA A5 TI-MID35 TI-F-MID35-TAG8 CGTATCGCCTCCCTCGCGCCATCAGCAGTAGACG (SEQIDNO:435) TACACTGACGACATGGTTCTACA B5 TI-MID36 TI-F-MID36-TAG8 CGTATCGCCTCCCTCGCGCCATCAGCGACGTGAC (SEQIDNO:436) TACACTGACGACATGGTTCTACA C5 TI-MID37 TI-F-MID37-TAG8 CGTATCGCCTCCCTCGCGCCATCAGTACACACAC (SEQIDNO:437) TACACTGACGACATGGTTCTACA D5 TI-MID38 TI-F-MID38-TAG8 CGTATCGCCTCCCTCGCGCCATCAGTACACGTGA (SEQIDNO:438) TACACTGACGACATGGTTCTACA E5 TI-MID39 TI-F-MID39-TAG8 CGTATCGCCTCCCTCGCGCCATCAGTACAGATCG (SEQIDNO:439) TACACTGACGACATGGTTCTACA F5 TI-MID40 TI-F-MID40-TAG8 CGTATCGCCTCCCTCGCGCCATCAGTACGCTGTC (SEQIDNO:440) TACACTGACGACATGGTTCTACA G5 TI-MID41 TI-F-MID41-TAG8 CGTATCGCCTCCCTCGCGCCATCAGTCTGACGTC (SEQIDNO:441) AACACTGACGACATGGTTCTACA H5 TI-MID42 TI-F-MID42-TAG8 CGTATCGCCTCCCTCGCGCCATCAGTCGATCACG (SEQIDNO:442) TACACTGACGACATGGTTCTACA A6 TI-MID43 TI-F-MID43-TAG8 CGTATCGCCTCCCTCGCGCCATCAGTCGCACTAG (SEQIDNO:443) TACACTGACGACATGGTTCTACA B6 TI-MID44 TI-F-MID44-TAG8 CGTATCGCCTCCCTCGCGCCATCAGTCTAGCGAC (SEQIDNO:444) TACACTGACGACATGGTTCTACA C6 TI-MID45 TI-F-MID45-TAG8 CGTATCGCCTCCCTCGCGCCATCAGTCTATACTA (SEQIDNO:445) TACACTGACGACATGGTTCTACA D6 TI-MID46 TI-F-MID46-TAG8 CGTATCGCCTCCCTCGCGCCATCAGTGACGTATG (SEQIDNO:446) TACACTGACGACATGGTTCTACA E6 TI-MID47 TI-F-MID47-TAG8 CGTATCGCCTCCCTCGCGCCATCAGTGTGAGTAG (SEQIDNO:447) TACACTGACGACATGGTTCTACA F6 TI-MID48 TI-F-MID48-TAG8 CGTATCGCCTCCCTCGCGCCATCAGACAGTATAT (SEQIDNO:448) AACACTGACGACATGGTTCTACA G6 TI-MID49 TI-F-MID49-TAG8 CGTATCGCCTCCCTCGCGCCATCAGACGCGATCG (SEQIDNO:449) AACACTGACGACATGGTTCTACA H6 TI-MID50 TI-F-MID50-TAG8 CGTATCGCCTCCCTCGCGCCATCAGACTAGCAGT (SEQIDNO:450) AACACTGACGACATGGTTCTACA
TABLE-US-00017 TABLE16 Well Barcode ReverseOligoName ReverseOligoSequence SEQIDNO. A1 TI-MID1 TI-R-MID1-TAG5 CTATGCGCCTTGCCAGCCCGCTCA (SEQIDNO:451) GACGAGTGCGTTACGGTAGCAGAG ACTTGGTCT B1 TI-MID2 TI-R-MID2-TAG5 CTATGCGCCTTGCCAGCCCGCTCA (SEQIDNO:452) GACGCTCGACATACGGTAGCAGAG ACTTGGTCT C1 TI-MID3 TI-R-MID3-TAG5 CTATGCGCCTTGCCAGCCCGCTCA (SEQIDNO:453) GAGACGCACTCTACGGTAGCAGAG ACTTGGTCT D1 TI-MID67 TI-R-MID67-TAG5 CTATGCGCCTTGCCAGCCCGCTCA (SEQIDNO:454) GTCGATAGTGATACGGTAGCAGAG ACTTGGTCT E1 TI-MID5 TI-R-MID5-TAG5 CTATGCGCCTTGCCAGCCCGCTCA (SEQIDNO:455) GATCAGACACGTACGGTAGCAGAG ACTTGGTCT F1 TI-MID6 TI-R-MID6-TAG5 CTATGCGCCTTGCCAGCCCGCTCA (SEQIDNO:456) GATATCGCGAGTACGGTAGCAGAG ACTTGGTCT G1 TI-MID7 TI-R-MID7-TAG5 CTATGCGCCTTGCCAGCCCGCTCA (SEQIDNO:457) GCGTGTCTCTATACGGTAGCAGAG ACTTGGTCT H1 TI-MID8 TI-R-MID8-TAG5 CTATGCGCCTTGCCAGCCCGCTCA (SEQIDNO:458) GCTCGCGTGTCTACGGTAGCAGAG ACTTGGTCT A2 TI-MID10 TI-R-MID10-TAG5 CTATGCGCCTTGCCAGCCCGCTCA (SEQIDNO:459) GTCTCTATGCGTACGGTAGCAGAG ACTTGGTCT B2 TI-MID11 TI-R-MID11-TAG5 CTATGCGCCTTGCCAGCCCGCTCA (SEQIDNO:460) GTGATACGTCTTACGGTAGCAGAG ACTTGGTCT C2 TI-MID13 TI-R-MID13-TAG5 CTATGCGCCTTGCCAGCCCGCTCA (SEQIDNO:461) GCATAGTAGTGTACGGTAGCAGAG ACTTGGTCT D2 TI-MID14 TI-R-MID14-TAG5 CTATGCGCCTTGCCAGCCCGCTCA (SEQIDNO:462) GCGAGAGATACTACGGTAGCAGAG ACTTGGTCT E2 TI-MID15 TI-R-MID15-TAG5 CTATGCGCCTTGCCAGCCCGCTCA (SEQIDNO:463) GATACGACGTATACGGTAGCAGAG ACTTGGTCT F2 TI-MID16 TI-R-MID16-TAG5 CTATGCGCCTTGCCAGCCCGCTCA (SEQIDNO:464) GTCACGTACTATACGGTAGCAGAG ACTTGGTCT G2 TI-MID17 TI-R-MID17-TAG5 CTATGCGCCTTGCCAGCCCGCTCA (SEQIDNO:465) GCGTCTAGTACTACGGTAGCAGAG ACTTGGTCT H2 TI-MID18 TI-R-MID18-TAG5 CTATGCGCCTTGCCAGCCCGCTCA (SEQIDNO:466) GTCTACGTAGCTACGGTAGCAGAG ACTTGGTCT A3 TI-MID19 TI-R-MID19-TAG5 CTATGCGCCTTGCCAGCCCGCTCA (SEQIDNO:467) GTGTACTACTCTACGGTAGCAGAG ACTTGGTCT B3 TI-MID20 TI-R-MID20-TAG5 CTATGCGCCTTGCCAGCCCGCTCA (SEQIDNO:468) GACGACTACAGTACGGTAGCAGAG ACTTGGTCT C3 TI-MID21 TI-R-MID21-TAG5 CTATGCGCCTTGCCAGCCCGCTCA (SEQIDNO:469) GCGTAGACTAGTACGGTAGCAGAG ACTTGGTCT D3 TI-MID22 TI-R-MID22-TAG5 CTATGCGCCTTGCCAGCCCGCTCA (SEQIDNO:470) GTACGAGTATGTACGGTAGCAGAG ACTTGGTCT E3 TI-MID23 TI-R-MID23-TAG5 CTATGCGCCTTGCCAGCCCGCTCA (SEQIDNO:471) GTACTCTCGTGTACGGTAGCAGAG ACTTGGTCT F3 TI-MID24 TI-R-MID24-TAG5 CTATGCGCCTTGCCAGCCCGCTCA (SEQIDNO:472) GTAGAGACGAGTACGGTAGCAGAG ACTTGGTCT G3 TI-MID25 TI-R-MID25-TAG5 CTATGCGCCTTGCCAGCCCGCTCA (SEQIDNO:473) GTCGTCGCTCGTACGGTAGCAGAG ACTTGGTCT H3 TI-MID26 TI-R-MID26-TAG5 CTATGCGCCTTGCCAGCCCGCTCA (SEQIDNO:474) GACATACGCGTTACGGTAGCAGAG ACTTGGTCT A4 T--MID27 TI-R-MID27-TAG5 CTATGCGCCTTGCCAGCCCGCTCA (SEQIDNO:475) GACGCGAGTATTACGGTAGCAGAG ACTTGGTCT B4 TI-MID28 TI-R-MID28-TAG5 CTATGCGCCTTGCCAGCCCGCTCA (SEQIDNO:476) GACTACTATGTTACGGTAGCAGAG ACTTGGTCT C4 TI-MID68 TI-R-MID68-TAG5 CTATGCGCCTTGCCAGCCCGCTCA (SEQIDNO:477) GTCGCTGCGTATACGGTAGCAGAG ACTTGGTCT D4 TI-MID30 TI-R-MID30-TAG5 CTATGCGCCTTGCCAGCCCGCTCA (SEQIDNO:478) GAGACTATACTTACGGTAGCAGAG ACTTGGTCT E4 TI-MID31 TI-R-MID31-TAG5 CTATGCGCCTTGCCAGCCCGCTCA (SEQIDNO:479) GAGCGTCGTCTTACGGTAGCAGAG ACTTGGTCT F4 TI-MID32 TI-R-MID32-TAG5 CTATGCGCCTTGCCAGCCCGCTCA (SEQIDNO:480) GAGTACGCTATTACGGTAGCAGAG ACTTGGTCT G4 TI-MID33 TI-R-MID33-TAG5 CTATGCGCCTTGCCAGCCCGCTCA (SEQIDNO:481) GATAGAGTACTTACGGTAGCAGAG ACTTGGTCT H4 TI-MID34 TI-R-MID34-TAG5 CTATGCGCCTTGCCAGCCCGCTCA (SEQIDNO:482) GCACGCTACGTTACGGTAGCAGAG ACTTGGTCT A5 TI-MID35 TI-R-MID35-TAG5 CTATGCGCCTTGCCAGCCCGCTCA (SEQIDNO:483) GCAGTAGACGTTACGGTAGCAGAG ACTTGGTCT B5 TI-MID36 TI-R-MID36-TAG5 CTATGCGCCTTGCCAGCCCGCTCA (SEQIDNO:484) GCGACGTGACTTACGGTAGCAGAG ACTTGGTCT C5 TI-MID37 TI-R-MID37-TAG5 CTATGCGCCTTGCCAGCCCGCTCA (SEQIDNO:485) GTACACACACTTACGGTAGCAGAG ACTTGGTCT D5 TI-MID38 TI-R-MID38-TAG5 CTATGCGCCTTGCCAGCCCGCTCA (SEQIDNO:486) GTACACGTGATTACGGTAGCAGAG ACTTGGTCT E5 TI-MID39 TI-R-MID39-TAG5 CTATGCGCCTTGCCAGCCCGCTCA (SEQIDNO:487) GTACAGATCGTTACGGTAGCAGAG ACTTGGTCT F5 TI-MID40 TI-R-MID40-TAG5 CTATGCGCCTTGCCAGCCCGCTCA (SEQIDNO:488) GTACGCTGTCTTACGGTAGCAGAG ACTTGGTCT G5 TI-MID69 TI-R-MID69-TAG5 CTATGCGCCTTGCCAGCCCGCTCA (SEQIDNO:489) GTCTGACGTCATACGGTAGCAGAG ACTTGGTCT H5 TI-MID42 TI-R-MID42-TAG5 CTATGCGCCTTGCCAGCCCGCTCA (SEQIDNO:490) GTCGATCACGTTACGGTAGCAGAG ACTTGGTCT A6 TI-MID43 TI-R-MID43-TAG5 CTATGCGCCTTGCCAGCCCGCTCA (SEQIDNO:491) GTCGCACTAGTTACGGTAGCAGAG ACTTGGTCT B6 TI-MID44 TI-R-MID44-TAG5 CTATGCGCCTTGCCAGCCCGCTCA (SEQIDNO:492) GTCTAGCGACTTACGGTAGCAGAG ACTTGGTCT C6 TI-MID45 TI-R-MID45-TAG5 CTATGCGCCTTGCCAGCCCGCTCA (SEQIDNO:493) GTCTATACTATTACGGTAGCAGAG ACTTGGTCT D6 TI-MID46 TI-R-MID46-TAG5 CTATGCGCCTTGCCAGCCCGCTCA (SEQIDNO:494) GTGACGTATGTTACGGTAGCAGAG ACTTGGTCT E6 TI-MID47 TI-R-MID47-TAG5 CTATGCGCCTTGCCAGCCCGCTCA (SEQIDNO:495) GTGTGAGTAGTTACGGTAGCAGAG ACTTGGTCT F6 TI-MID48 TI-R-MID48-TAG5 CTATGCGCCTTGCCAGCCCGCTCA (SEQIDNO:496) GACAGTATATATACGGTAGCAGAG ACTTGGTCT G6 TI-MID49 TI-R-MID49-TAG5 CTATGCGCCTTGCCAGCCCGCTCA (SEQIDNO:497) GACGCGATCGATACGGTAGCAGAG ACTTGGTCT H6 TI-MID50 TI-R-MID50-TAG5 CTATGCGCCTTGCCAGCCCGCTCA (SEQIDNO:498) GACTAGCAGTATACGGTAGCAGAG ACTTGGTCT
Example 8
Muliplex Barcoding of Target Nucleic Acids
[0292] Three pools of 10 primers were assembled from the primers listed in Table 9. PCR conditions were identical to those listed in Example 4, with the exception that primer concentrations were varied.