SEQUENCE CONVERSION REACTION
20230220474 · 2023-07-13
Inventors
- Nancy SCHOENBRUNNER (Charlestown, MA, US)
- Manuel DUVAL (Charlestown, MA, US)
- Erhard Ralf SCHOENBRUNNER (Charlestown, MA, US)
Cpc classification
C12Q2525/161
CHEMISTRY; METALLURGY
C12Q1/6883
CHEMISTRY; METALLURGY
C12Q2525/161
CHEMISTRY; METALLURGY
International classification
Abstract
Disclosed herein are methods, compositions, and kits related to the identification and/or quantification of target molecules.
Claims
1. A method, comprising: combining a first plurality of sequence conversion probes (SCPs) with a sample mixture including one or more target measurand, wherein each SCP comprises a polynucleotide including a target-specific region (TSR) and a circularizable signal specific region (SSR), the circularizable SSR comprising a first engineered polynucleotide marker including one or more primer regions, further wherein the first plurality of SCPs comprise a plurality of different TSRs configured to hybridize to different target measurands, wherein each SCP of the first plurality of SCPs includes the same SSR; hybridizing the TSRs to the one or more target measurand within the sample mixture; cutting SCPs having TSRs that are hybridized to the one or more target measurands to release cut SSRs (cSSRs) without releasing SSRs of un-hybridized SCPs; and hybridizing each of the cSSRs to a lock probe to produce circularization complexes; contacting the circularization complexes with a circularizing agent to circularize the cSSRs; and detecting the first engineered polynucleotide marker from the circularized cSSRs.
2. The method of claim 1, wherein hybridizing each of the cSSRs to the lock probe comprises hybridizing each of the cSSRs to the lock probe and an SSR extension probe, wherein the SSR extension probe comprises a second primer for use in detecting the first engineered polynucleotide marker.
3. The method of claim 1, wherein the SSR is configured not to hybridize to a measurand in the sample mixture.
4. The method of claim 1, wherein detecting the first engineered polynucleotide marker comprises performing digital PCR using the circularized cSSRs (cirSSR) and one or more primers configured to hybridize to the one or more primer regions to amplify the first engineered polynucleotide markers.
5. The method of claim 4, wherein performing digital PCR comprises distributing the pool of cirSSRs into a plurality of reaction samples at a dilution such that at least some of the reaction samples contain cirSSRs and some do not contain cirSSRs.
6. The method of claim 3, further comprising analyzing the first engineered polynucleotide markers of the cSSRs present or absent in individual reaction samples to provide a first number of binary results representing presence or absence of a presumably euploid chromosome in the reaction samples and a second number of binary results representing presence or absence of a presumably aneuploid chromosome in the reaction samples.
7. The method of claim 4, further comprising determining the presence or absence of an aneuploidy by comparing said first number with said second number, wherein a differential between the first number and the second number indicates the presence of aneuploidy.
8. The method of claim 1, wherein hybridizing each of the cSSRs to the lock probe comprises using a same polynucleotide sequence for the lock probe to hybridize to all of the cSSRs.
9. The method of claim 1, further comprising digesting non-circular DNA in the mixture with one or more exonuclease after contacting the circularization complexes with the circularizing agent.
10. The method of claim 1, wherein the SSR of each set of SCPs indicates a different chromosome.
11. The method of claim 1, wherein the one or more target measurand in the sample mixture comprises cell-free DNA (cfDNA).
12. The method of claim 1, wherein the sample mixture comprises a mixture of maternal and fetal genetic material from a maternal blood sample.
13. The method of claim 1, wherein the circularizing agent is a ligase.
14. The method of claim 1, wherein the circularizable SSR further comprises a first hybridization region and a second hybridization region, wherein the first hybridization region and the second hybridization region are configured to hybridize to the lock probe.
15. The method of claim 1, wherein the one or more primer regions comprises a forward primer region and a reverse primer region arranged on the circularizable SSR so that PCR amplification primers directed to the forward primer region and the second primer region cannot amplify the first engineered polynucleotide marker on the circularizable SSR unless the circularizable SSR is circularized.
16. The method of claim 15, wherein the lock probe is configured to hybridize to at least a portion of the first engineered polynucleotide marker, and to at least a portion of the forward primer region to produce circularization complexes.
17. The method of claim 1, wherein combining the first plurality of SCPs further comprises combining a second plurality of SCPs with the sample mixture, wherein the second plurality of SCPs comprise a plurality of different TSRs configured to hybridize to different target measurands, wherein each SCP of the second plurality of SCPs includes a same second SSR, wherein the same second SSR is different from the SSR of the first plurality of SCPs.
18. The method of claim 1, wherein detecting the first engineered polynucleotide marker from the circularized cSSRs comprises a method of detecting minimal residual disease monitoring.
19. The method of claim 1, wherein detecting the first engineered polynucleotide marker from the circularized cSSRs comprises a method of detecting transplant rejection.
20-244. (canceled)
Description
BRIEF DESCRIPTION OF THE DRAWINGS
[0121] The patent or application file contains at least one drawing executed in color. Copies of this patent or patent application publication with color drawing(s) will be provided by the Office upon request and payment of the necessary fee.
[0122] The novel features of the invention are set forth with particularity in the claims that follow. A better understanding of the features and advantages of the present invention will be obtained by reference to the following detailed description that sets forth illustrative embodiments, in which the principles of the invention are utilized, and the accompanying drawings of which:
[0123]
[0124]
[0125]
[0126]
[0127]
[0128]
[0129]
[0130]
[0131]
[0132]
[0133]
[0134]
[0135]
[0136]
[0137]
[0138]
[0139]
[0140]
[0141]
[0142]
[0143]
[0144]
[0145]
[0146]
[0147]
[0148]
[0149]
[0150]
[0151]
[0152]
[0153]
[0154]
[0155]
[0156]
[0157]
[0158]
[0159]
[0160]
[0161]
[0162]
[0163]
[0164]
DETAILED DESCRIPTION
[0165] Provided herein are compositions, kits, and methods for the detection and quantitation of nucleic acids. Practice of some methods and use of some compositions herein can simplify and improve the detection and quantitation of analytes quantitated through measuring their nucleic acids (e.g. the quantification of bacteria, human or other chromosomes, genes, pathogens, etc.). The disclosure herein provides methods for converting a plurality of measurands into a smaller number or even a single unified measurand. For example, the disclosure herein provides methods for converting a plurality of different nucleic acid targets for one or more alleles present on a chromosome into a single measurand. This approach thus may simplify assays using nucleic acid amplification and hybridization quantification methods, such as digital counting methods. It may also improve the accuracy of those methods. The current state of the art for performing non-invasive prenatal testing (NIPT) by PCR would require many unique primers and/or probes as each unique measurand requires two unique primers and may require a unique probe as well. The methods and compositions disclosed herein simplifies the chemistry to a few or even as little as one primer pair (2 primers) and 3 unique probes for NIPT for detecting trisomy 13, 18 and 21. This approach is broadly applicable to hybridization quantification methods.
[0166] This disclosure may significantly simplify the design for NIPT dPCR assays, offer flexibility to easily select target measurand sequences including shorter sequences, reduce the complexity of an assay for NIPT to a few or even a single primer pair for all chromosomes and may only require a single probe per chromosome, while allowing enough alleles on each chromosome to be targeted/tested to meet the statistically required number of replicates to achieve needed clinical performance.
[0167] This methods and compositions (including kits) described herein may require only two primers and three probes to allow digital PCR analysis, resulting in a clean reaction. Starting with maternal and fetal DNA extracted from 1-2 mL of plasma to achieve 100 000 counts per chromosome 13, 18 and 21 it is estimated with conventional methods 600 primers and 300 probes are needed, for 200 000 counts 1200 primers and 600 probes.
[0168] These simplifications increase the robustness of the assay and allow targeting a high number of sequences without having to worry about primer/primer/probe interaction causing an unspecific reaction.
Definitions
[0169] As used herein: accuracy refers to closeness of a measurement to a specific or correct value. Analyte refers to a substance of interest in an analysis, such as a nucleic acid of a specific chromosome, a specific virus, a specific bacterium, etc.
[0170] Analyte specific SSR refers to a common signal specific region (SSR) sequence for a defined analyte (e.g. a specific chromosome, a specific virus, a specific bacterium, etc.). The “analyte specific SSR” is connected on sequence conversion probe (SCPs) with different “measurand specific TSR”. These sequence conversion probe (SCP) detect a common analyte by detecting a plurality of analyte specific measurands (e.g. different alleles on chromosomes or pathogens).
[0171] Aneuploidy refers to the state where the wrong number of chromosomes (e.g., the wrong number of full chromosomes or the wrong number of chromosome segments, such as the presence of deletions or duplications of a chromosome segment) is present in a cell. In the case of a somatic human cell it may refer to the case where a cell does not contain 22 pairs of autosomal chromosomes and one pair of sex chromosomes. In the case of a human gamete, it may refer to the case where a cell does not contain one of each of the 23 chromosomes. In the case of a single chromosome type, it may refer to the case where more or less than two homologous, but non-identical chromosome copies are present, or where there are two chromosome copies present that originate from the same parent. In some embodiments, the deletion of a chromosome segment is a microdeletion.
[0172] In the figures, “B” refers to an affinity tag, including (but not limited to) biotin or another affinity tag. The affinity tag may allow removal of unwanted moieties such as following hybridization and cutting steps.
[0173] Chromosome refers to either a full chromosome, or a segment or section of a chromosome.
[0174] CRISPR-Cas refers to a system for controlled or programmable genome engineering. CRISPR refers to clustered regularly interspaced short palindromic repeats. Cas is an endonuclease that cuts DNA at a location specified by a specific RNA (may be referred to as a guide RNA).
[0175] cirSSR is a circularized exonuclease resistant signal specific region (SSR) derived from a cSSR.
[0176] cSSR is a signal specific region (SSR) of a sequence conversion probe (SCP) after cutting such as by e.g., a nuclease.
[0177] cTSR is a target-specific region (TSR) portion of the sequence conversion probe (SCP) after cutting by e.g., a nuclease.
[0178] Digital PCR (dPCR) is an alternate method to conventional φRT-PCR for absolute quantification and detection of nucleic acid molecules. dPCR works by partitioning a sample of DNA or cDNA into many individual, parallel PCR reactions; some of these reactions contain the target nucleic acid molecule (positive) while others do not (negative) and amplifying the sample. A single starting molecule of DNA or cDNA can be amplified a million-fold or more. During amplification, dye-labeled probes are used to detect sequence-specific targets. When no target sequence is present, no signal accumulates. Following PCR analysis, the fraction of negative reactions is used to generate an absolute count of the number of target molecules in the sample, without the need for standards or endogenous controls.
[0179] Duplex refers to DNA:DNA double strand or DNA:RNA double strand.
[0180] Lock Probe refers to an oligonucleotide which, in the presence of ligase, facilitates formation of a cirSIR.
[0181] The term “kit” (or “kit of parts”) refers to an article of manufacture including one or more containers and, optionally, a data carrier. The one or more containers may be filled with one or more of the reagents described herein. Additional containers may be included in the kit that contain, e.g., diluents, buffers and further reagents such as dNTPs. The data carrier may be a non-electronic data carrier, e.g., a graphical data carrier such as an information leaflet, an information sheet, a bar code or an access code, or an electronic data carrier such as a floppy disk, a compact disk (CD), a digital versatile disk (DVD), a microchip or another semiconductor-based electronic data carrier. The access code may allow the access to a database, e.g., an internet database, a centralized, or a decentralized database. The data carrier may include instructions for the use of the kit in the methods described herein. The data carrier may include a cut-off value or reference level. In case that the data carrier includes an access code which allows the access to a database, the threshold value or reference level is deposited in this database. In addition, the data carrier may comprise information or instructions on how to carry out the methods in this disclosure.
[0182] As used herein, a measurand refers to a nucleic acid representing one of many possible sequences/alleles of (or in) an analyte.
[0183] Measurand specific TSR refers to a target-specific region (TSR) specific for a measurand. Different “measurand specific TSRs” can be linked to a common “analyte specific SSR”.
[0184] NIPT refers to non-invasive prenatal testing and in some cases may refer to trisomy-based analysis on cell-free DNA in maternal blood or plasma.
[0185] Nucleic acid or nucleic acid molecule refers to DNA and/or RNA.
[0186] Nuclease refers to an enzyme that degrades nucleic acids by breaking bonds between nucleotides in the nucleic acids. Nucleases can act on DNA (DNase) and/or RNA (RNase). A duplex specific nuclease (DSN) selectively cuts DNA in a nucleic acid duplex (DNA/DNA) or heteroduplex (e.g., RNA/DNA). A duplex specific nuclease may leave one strand or part of one strand intact. Many nucleases are commercially available and include, but are not limited to: Restriction enzymes, 5′-3′ nucleases, RNase H, RNase H2, RNase HII, RNase V, duplex-specific nuclease (DSN), lambda exonuclease, T7 exonuclease, exonuclease III, RecJf, exonuclease I, exonuclease T, exonuclease V, BAL-31 nuclease, mung bean nuclease, DNase I, micrococcal nuclease, T7 endonuclease I, T5 exonuclease, Cas9 and any combination thereof. Information regarding nucleases and their substrates is found, e.g., in Mishra, Nawin C; (Nucleases: Molecular Biology and Applications, Wiley-Interscience (ISBN: 978-0-471-39461-7). Additional nucleases of interest include but are not limited to: RNase V1 (e.g., for cutting dsRNA), oligoribonucleases, (e.g., for cutting oligonucleotides), exoribonuclease II (e.g., for cutting mature miRNAs).
[0187] Oligonucleotide refers to a single-stranded multimer of nucleotides from 2 to about 500 nucleotides, e.g., 2 to 200 nucleotides. Oligonucleotides may be synthetic or may be made enzymatically, and, in some embodiments, are under 10 to 50 nucleotides in length. Oligonucleotides may contain ribonucleotide monomers (i.e., may be oligoribonucleotides or “RNA oligonucleotides”) or deoxyribonucleotide monomers (i.e., may be oligodeoxyribonucleotides or “DNA oligonucleotides”) or chemical linker. Oligonucleotides may be 10 to 20, 11 to 30, 31 to 40, 41 to 50, 51-60, 61 to 70, 71 to 80, 80 to 100, 100 to 150 or 150 to 200, up to 500 or more nucleotides in length, for example.
[0188] Padlock Probe refers to an oligonucleotide composed of two target complementary segments connected by a linker that may carry detectable functions. The two ends of the linear oligonucleotide probes are brought in juxtaposition by hybridization to a target sequence (e.g. a Lock Probe). This juxtaposition allows the two probe segments to be covalently joined by the action of a DNA ligase. As used herein, a padlock probe may be a specific type of inversion probe in which there is no gap between the target complementary regions at the ends of the probe (e.g., the target complementary regions span the entire target region upon hybridization, leaving no gaps). In general, an inversion probe may include no gaps (as in a padlock probe) or may include a gap (e.g., a gap of 1 bp, a gap of 2 bp, a gap of 3 bp, etc.). Inversion probes having larger gaps (e.g., >2-5 bp, >2 bp, >3 bp, >4 bp, >5 bp, etc.) may be referred to as connector inversion probes.
[0189] Primer refers to a single DNA molecule (a DNA oligomer) or a collection of DNA molecules (DNA oligomers) in which the DNA molecules are identical, or nearly so, and where the primer contains a region that is designed to hybridize to a targeted locus (e.g. a targeted polymorphic locus or a nonpolymorphic locus) and may contain a priming sequence designed to allow PCR amplification.
[0190] R refers to one or more Ribonucleotides or other nucleotide analog or a recognition site for a Restriction Enzyme; they may allow site-specific cutting upon hybridization to a nucleic acid target.
[0191] SCA refers to sequence conversion method.
[0192] SCORE refers to a sequence conversion reaction. A sequence conversion reaction “converts” a nucleic acid into a different form (sometimes referred to as a “proxy”).
[0193] SCP refers to sequence conversion probe.
[0194] SSR refers to a signal specific region.
[0195] Sequence refers to a DNA sequence or a genetic sequence. It may refer to the primary, physical structure of the DNA molecule or strand in an individual. It may refer to the sequence of nucleotides found in that DNA molecule, or the complementary strand to the DNA molecule. It may refer to the information contained in the DNA molecule as its representation in silico.
[0196] Subject relates to any organism such as a vertebrate, particularly any mammal, including both a human and another mammal, e.g., an animal such as a rodent, a rabbit, or a non-human primate (e.g., a monkey). The rodent may be a mouse, rat, hamster, guinea pig, or chinchilla. In some embodiments, the subject is a pregnant human female.
[0197] A substrate refers to a substance on which an enzyme acts. For example,
[0198] T21, T18, T13 refer to trisomy of chromosomes 21, 18, 13, respectively.
[0199] TSR refers to target-specific region. A unique target-specific region represents a unique measurand. (e.g. a specific allele sequence)
[0200] An analyte can be quantified by a sum of measurements, e.g. the analyte human chromosome 21 includes potentially thousands of measurands represented by thousands of detectable sequences across chromosome 21. Chromosome 21 has a length of ˜48 Mbases. This theoretically represents 480,000 unique measurands of an average length of 100 bases.
[0201] This method uses in some embodiments “analyte specific SSRs” [meaning e.g. chromosome specific SSR] linked to “measurand specific TSR” [specific to a unique sequence/allele on the chromosome].
Sequence Conversion Method
[0202]
[0203]
[0204] The affinity tag in
[0205] Direct visualization of the cleaved SSR (cSSR) may also be used. For example,
[0206] The SSR may be engineered to have a predetermined length, or range of lengths, polynucleotide content (e.g., GC ratio, etc.), and/or sequence. In particular in variations in which multiple different SSRs are used, either to indicate different specific target measurands (individual measurands) or to identify groups of target measurands (e.g., grouped by gene, by chromosome, by genetic pathway, by pathogen, mutations indicative of cancer, etc.), the SSRs may be distinct from each other, but may be engineered to have similar functional properties, such as melting temperature, primer regions, length, etc. This may help prevent bias that may otherwise occur during detection (e.g., amplification and/or binding).
[0207] Any of these assays, including the one shown in
[0208]
[0209] In
[0210] The carryover of SCPs that are not removed by the affinity tag, either because they are not successfully coupled to an affinity tag, or because the un-cut SCP is not sufficiently retained and/or removed by the affinity tag following nuclease treatment (see, e.g.,
[0211] Two potential sources of SCP carryover, and therefore background, include sequence conversion probe that is not bound to an affinity tag (as may occur with truncated SCPs), and sequence conversion probe that includes an affinity tag, but that remains unbound (as may occur with lower affinity or lower retention binding). The issue of SCP carryover is of particular relevance, as in highly sensitive applications such as NIPT, it may be critical to ensure that the final signal is only generated by the released SSR. When sequence conversion reactions are used for NIPT as described in some variations herein, such assays may contain ˜200 or more different SCPs per chromosome to achieve significant clinical performance. For example, in cfDNA there may be ˜1,000 genome copies per targeted allele in 1 ml of plasma, resulting in 200,000 molecules of SSR per chromosome. With a 50 pM concentration of each SCP in 100 μl (or approximately 6×10″ molecules of SCP per chromosome) there is a 3×10.sup.6 ratio of SCP:released SSR (per chromosome). For NIPT, as little as a 2% difference in aggregate released SSR from two chromosomes should ideally be detectable for samples with 4% fetal fraction. Thus, the maximal level of SCP carryover that would lead to intolerable background levels may be calculated. A reasonable specification could be that less than 10% of the signal difference between normal and trisomy attributable to the desired released SSR may come from carryover of uncleaved SCP. This translates into very strict tolerances: 10% (max background specification)×2% (copy number difference for 4% fetal fraction)×200,000 (molecules of SSR per chromosome)=400 molecules carryover SCP tolerated. 400 molecules of carryover intact SCP translates into a requirement to remove 400/6×10″, or 99.9999999% of the SCP in order to resolve signal within the desired range. This strict tolerance may be highly demanding in practice.
[0212] Described herein are methods and compositions (including kits) that may address this. In general, the SCPs provided as part of these methods and compositions may be labeled with an affinity tag (such as biotin) in different configurations: at the 5′ end, internal or at the 3′ end (and in some variations, internally). 5′ end and 3′ end biotinylation is described in
[0213] However, in variations in which all of the SCP is labeled with a 3′ biotin that is synthesized as described above, all of the probes will have the 3′ biotin. This is therefore may be a preferred label configuration. See, e.g.,
[0214] An alternative variation is illustrated above in
[0215] If greater specificity is desired, addition affinity tags (e.g., orthogonal affinity tags), such as a digoxin, may be included on the SCP near the 5′ end to exclude truncated impurities. It must be an orthogonal (i.e. not biotin) so that the 3′ biotin-SCP are not removed. It may be near, but not at, the 5′ PO4 end so that it does not inhibit ligation.
[0216] In general, any of these methods and compositions may include multiple affinity tags, including multiple biotin groups. For example, multiple biotin groups may be sequentially added to the 5′ end of the SCP or in some variations the 3′ end of the SCP. To increase the strength of binding the avidity of the SCP to bind the affinity partner (e.g., SA) may be increased by using more than one affinity tag (e.g., biotin) per SCP. Avidity refers to the accumulated strength of multiple affinities of individual non-covalent binding interactions, such as between an antibody and an antigen, and is commonly referred to as functional affinity. To increase avidity, multiple (e.g. two to four) biotin tags may be added on the SCP which can interact with the four binding sites on each streptavidin (SA). Theoretically, if all four biotins are engaged in binding, the K.sub.d would be approximately 10.sup.−60 M, more than sufficient for the estimated stringency described above.
[0217] Even though the affinity of the biotin-streptavidin interaction is extremely strong, with a best-case disassociation constant (K.sub.d) of 10.sup.−15 M, a small amount of the biotin-SCP probe may still remain unbound, because even this extremely strong interaction is still an equilibrium between bound and unbound species. The theoretical level of unbound biotin-SCP was calculated under the expected reaction conditions at the reported (10.sup.−15 M) and 10-fold lower K.sub.d (10.sup.−14 M) as shown in Table 1.
TABLE-US-00001 molecules free [streptavidin], M* [biotin-SCP], M** K.sub.d, M biotin-SCP 1.00E−06 3.00E−08 1.00E−15 1,862 1.00E−06 3.00E−08 1.00E−14 18,619 1.00E−06 3.00E−08 1.00E−60 0
[0218] The last row of Table 1 illustrates the theoretical effect of including four biotin on, e.g., the 3′ end of the SCP.
[0219] There are various well-known biotin modifications that may be added to the 3′,5′ end or placed internally, such as a standard (C6 linker) biotin, biotin-dT (which can be placed internally), and a dual biotin modification resulting in two functional biotin groups, which act to increase biotin-streptavidin binding affinity, and may be used for applications requiring high sensitivity, as described above.
[0220] Any of the methods and compositions (e.g., kits) described herein may provide cleavage of the SCP to release the SSR immediately adjacent to, or as close as possible (e.g., within 1 bp, within 2 bp, within 3 bp, within 4 bp, etc.) to the TSR that is hybridized to measurand, depending on the nuclease used. For example, restriction enzymes may be used, as described herein. Restriction enzymes may specifically cleave double stranded DNA. Cleavage specificity may be enhanced by cutting the double stranded TSR-target DNA as close as possible to the transition to the single stranded SSR. By engineering the cut-site to be as close as possible to the Y-intersection of double to single-stand DNA (e.g., the TSR-SSR boundary) the specificity of the DNA cutting may be enhanced. In some variations Type IIS restriction enzymes, such as BsaI or BspQI, which cleave outside of their recognition sequence to one side, and thus allow cleavage closer to the Y-transition from the double stranded TSR to the single stranded SSR, may be used. In one example, taking the enzymological requirements into consideration, the cut site can be moved to within two bases to the intersection. For example, BspQI may be used by targeting the BspQI recognition site as part of the TSR, at the TSR-SSR boundary.
[0221] In any of the sequence conversion methods (and assays including the sequence conversion method) multiple SCPs, directed to a variety of target measurands and including a subset of different SSRs may be used to consolidate groups of different target measurands by category or class identified by a common SSR.
[0222] For example, any of these assays may include a plurality of different sequence conversion probes (SCPs) that may be combined with a sample mixture of genetic material that includes target measurands. The combined mixture may be referred to as an assay mixture. As described above, each SCP may include a polynucleotide target-specific region (TSR), an affinity tag, and a signal specific region (SSR). The SSR includes an engineered polynucleotide marker. The SCPs may be configured to hybridize to a number (x) of different targets (e.g., may have different TSRs). Within the plurality of SCPs, different TSRs for which their target sequences (e.g., target RNA, target DNA, etc.) share a common class or category may have the same SSR. Thus, the number of different SSRs (y) may be less than the number of TSRs (e.g., there may be 2 time as many or more TSRs as SSRs, 3× or more as many, 4× or more as many, 5× or more as many 6× or more as may, 7× or more as many, 8× or more as many, 9× or more as many, 10× or more as many, 15× or more as many, 20× or more as many, 25× or more as many, 30× or more as many, 40× or more or many, 50× or more or many, 100× or more or many, 200× or more as many, 300× or more as many, etc.). Thus, these assays may be reductive assays, reducing a large number of diverse target measurands to a smaller number of uniform SSRs (e.g., engineered polynucleotide markers). The uniform SSRs may be configured to represent a category to which subsets of the target measurands are a member.
[0223]
[0224]
[0225] These examples (e.g.,
[0226] Examples of detection classes for target measurands include chromosomes (e.g., different genes on the same chromosome or groups of chromosomes), genes (e.g., different alleles of a same gene), genetic pathways (e.g., different genes in a genetic pathway), exons (e.g., different exons of a same gene), pathogens (e.g., different genes/alleles, etc. of a common pathogen, including virus), different mutations known to drive cancer (e.g. lung cancer), or the like. Any grouping of target measurands may be included with a common engineered SSR.
[0227] For example, for an assay for non-invasive prenatal testing, the plurality of subsets of SCPs could correspond to different chromosomes so that for each chromosome (within each subset) a plurality of different TSRs could be directed to different parts of a single chromosome and share a same SSR.
[0228]
[0229]
[0230] In some variations it may be desirable to remove the target material (e.g., RNA target measurand) from the reaction.
[0231] The sequence conversion probes described in
[0232]
[0233]
[0234] As mentioned above, when only the TSR is degraded (e.g., by DSN or RNase H2) the intact target measurand may be released and allowed to hybridize to another copy of the SCP. This is illustrated in
[0235] Alternatively,
[0236] For example, any of the methods and compositions (e.g., kits) described herein may be used to provide input into a procedure including digital PCR to provide quantitative data.
[0237] The methods and compositions described herein may also be used to detect very short target measurand sequences. For example,
[0238] A similar technique may be used for direct detection of small target measurands, as shown in
[0239] In
[0240] As mentioned above, in some variations the methods described herein for performing the sequence conversion method may include a restriction endonuclease as the nuclease. This is illustrated, for example, in
[0241] In general, the target-specific region may be designed so that binding to the target measurand is optimal. For example, the TSR may be configured to bind to at least a 5 mer and can be directed to the coding (gene) portions of the genome. The sequences may be restricted to contain at least half GC (so 3 to 6 for 6 mers). When identifying good target measurand regions, regions that are repeated (e.g., multiple copy regions) may be preferred, because they may increase the overall signal or may be avoided if e.g. accurate chromosome counting is desired.
[0242]
[0243]
[0244] For example, in some variations the cSCP may be configured for use with NIPT and may include first and second target-specific regions (e.g., R1 & R2) that are the selected target sequences on the chromosome. PB1 and PB2 may be a universal primer (e.g., one pair), and the TAG region may be a specific TaqMan probe for each chromosome (e.g., 3 in total, one for each of chromosomes 13, 18 and 21).
[0245] In use, the cSCP may be hybridized to the target measurand as shown in
[0246]
[0247] The use of a sequence conversion assay in which the sequence conversion probe is configured to be an inversion probe (e.g., a padlock probe) may be particular advantageous. As described above in reference to
[0248] As descried above, in some variations, the SSR may be engineered so that the SSR only becomes PCR-detectable after being separated from the SCP and circularized, e.g., by including primers oriented away from each other rather than towards each other (
[0249]
[0250] For example, in
[0251] In general, the use of a nuclease (e.g., an endonuclease, such as a DNase) in the method shown in
[0252]
[0253]
[0254] In
[0255] This workflow, which in this example does not depend on a biotin streptavidin clean-up, may be highly specific for cut SSR amplification and may prevent all or most of the background that may otherwise arise from un-cut SCP, when using an exonuclease digestion destroying the un-cut SCP.
[0256] Another example of a workflow for use with a sequence conversion method as described herein, similar to that of
[0257] In
[0258] In some examples, as described in greater detail below, lock probe(s), SSR extension probe, and ligase may then be added (together or sequentially) to ligate the SSR extension probe to an end of the short cSSR. In some examples this may result in circular cSSR (cirSSR); alternatively the cSSR (“long cSSR” comprising the short cSSR and SSR extension probe) may be left linear. For example, in
[0259] Another variation of a sequence conversion method is shown in
[0260]
[0261] In the variation shown in
[0262] In general, the methods and compositions described herein may allow for the conversion of one oligonucleotide sequence into another one, including converting several oligonucleotide sequences into one sequence. Examples of this may include converting RNA into DNA, converting long sequences into short sequences, short sequences into long sequences, converting a first sequence into another sequence, increasing the apparent copy number of an analyte, and the like.
[0263] In general the engineered SSR sequence may be designated such that it has desirable properties that the target measurand sequence does not have. Thus, these methods and compositions may simplify (e.g., convert, translate) many sequences into few, e.g., consolidating them into a single sequence. As described herein, this may change the length of a target sequence to a length making it detectable or easier to detect. For example, PCR amplicons typically need to have a certain length to accommodate forward and backward primer and, in the case of TaqMan PCR, an additional probe binding region is needed. Existing technologies for detecting polynucleotides are often limited by the size of the polynucleotide, typically requiring a target region length of about 100 bases or longer, preventing shorter DNA or RNA targets (e.g., of e.g. 25 bases or smaller) from being easily detected. The methods and compositions described herein may enable detection of such targets.
[0264] An SCP may be synthesized in any appropriate manner. SCP probes can be made for example by coupling several different multi-mers that have been pre-manufactured. For example, it might be advantageous to add the SSR as a single building block, or several building blocks, since it would be constant, and the chemical linkage does not have to be a nucleotide or DNA. It might include any other chemical linker. From a structural perspective the SCP does not need to be 100% oligonucleotide.
[0265] Another variation of a sequence conversion method is shown
[0266] In any of the methods described herein a plurality of SCPs may be used together. The plurality of SCPs may include a plurality of different TSRs configured to hybridize to different target measurands, while each of the SCP in this (first) plurality may include the same SSR, as described above. In
[0267] In
[0268] Optionally, in some examples an SSR extension probe may be added to the end (e.g., the 3′ end) of the short cSSR. In
[0269] In
[0270] The use of an SSR extension probe is optional, and may be used to dramatically reduce background, which may be particularly helpful in some examples. In general, the SSR extension probe may include an appropriate number of nucleotides (e.g., between 6 and 40 or more), and the sequence may be specific to the second primer region.
[0271] The workflow shown in
EXAMPLES
[0272] The sequence conversion methods described herein (e.g., using the SCP probes described) may be used for and/or as part of a variety of different techniques. For example, these methods may be used with dPCR for applications including (but not limited to): detection of aneuploidy by digital chromosome counting using cffDNA, somatic mutation panels for oncology, syndromic pathogen panels, etc.
[0273] For example, the somatic mutation panels for oncology may be beneficial to detect a number of known cancer driver somatic mutations in oncogenes, such as EGFR, KRAS, BRAF to help guide therapy selection or screen for cancer. Such mutations are known in the literature and can be found in various databases (e.g. COSMIC). One would hybridize SCPs to mutation sites of interest from gene A (e.g. EGFR) with multiple SCP probes each with unique TSRs but all with a common SSR (e.g. with a probe binding site for a FAM-labeled probe). The TSRs would be perfectly matched to the mutations of interest (e.g. EGFR L858R, T790M, etc.) but would be mismatched to the wild type gene. As a result, only the SCPs bound to the perfectly matched mutant sites would be cleaved by the nuclease. The multiple SCPs for a second gene (gene B, e.g. KRAS) would similarly have unique TSRs for the mutations of interest in gene B but would have a common SSR (e.g. with a probe binding site for a HEX-labeled probe). One would amplify and detect the resulting SSRs in dPCR with as few as one primer pair and a probe specifically labeled for each gene.
[0274] Any of the methods and kits described herein may be used for minimal residual disease monitoring. Minimal residual disease (MRD) refers to the small number of cancer cells or cell-free circulating DNA derived from such cancer cells that remain in the body after treatment. The number of remaining cells may be so small that they do not cause any physical signs or symptoms and often cannot even be detected through traditional methods, such as viewing cells under a microscope and/or by tracking abnormal serum proteins in the blood. A MRD positive test result means that residual (remaining) disease was detected. A negative result means that residual disease was not detected. The methods and kit described herein may therefore be used to detect markers of cancer cells or cell-free circulating DNA from minimal residual disease. After treating cancer, any remaining cancer cells in the body can become active and start to multiply, causing a relapse of the disease. Detecting MRD may indicate that the treatment was not completely effective or that the treatment was incomplete. Minimal residual disease may be present after treatment because not all of the cancer cells responded to the therapy, or because the cancer cells became resistant to the medications used.
[0275] Any of the methods and system (e.g., kits) described herein may be used for detecting transplant rejection. For example, the methods described herein may detect donor-derived cell-free DNA and may therefore tracks DNA markers from the organ donor that appear in the blood (or other tissues or fluids) of the transplant recipient. For example, injured or dying cells from the donor organ release donor DNA fragments into the bloodstream, and higher amounts of donor DNA indicate a higher risk for transplant rejection in the recipient. The methods and systems (e.g., kits) described herein can be used to detect relative amounts of donor and patient DNA, similar to the methods for detecting fetal and maternal DNA descried herein. Current tests used to detect heart transplant rejection rely on frequent and painful biopsies of heart tissue. Those biopsies run the risk of damaging the heart and are limited by their invasiveness (ability to obtain tissue samples) and reliability for detecting acute rejection. Acute, or rapid, rejection tends to happen in the first three to six months after transplantation as opposed to chronic rejection, which occurs after many years. For example, blood samples may be monitored the patients for signs of acute rejection using the assays described herein.
[0276] These methods and compositions may be configured as syndromic pathogen panels. Such panels may be beneficial to detect groups of pathogenic bacteria, yeast and/or fungi with SCP probes in conjunction with detection (e.g., by dPCR). For example, multiple enteric pathogens causing gastrointestinal distress could be detected with SCPs including one or more target sequences (e.g. 16S RNA) in one or more bacteria (e.g. Campylobacter, C. difficile, etc.) that could lead to the same symptoms (e.g. acute diarrhea) and that are treated in the same manner (e.g. same classes of antibiotics). The SCPs would have multiple unique TSRs but share a common SSR for common treatment modality (Fam for the antibiotic class A; Hex for antibiotic class B).
Example 1
[0277] In one example, the methods described herein may be used to identify the ploidy of a genetic sample.
[0278] Traditional methods for using dPCR to detect aneuploidy require a large number of primers and probes to achieve statistical significance. For example,
[0279] For example, these methods may be used to identify target measurands (e.g., 100 to 200 target measurands) on each of chromosomes 21, 18 and 13 each. This information (e.g., the number of identified target measurands from each chromosome) may be used to determine the ploidy of each of these chromosomes. In some variations the method may include determining the length of each TSR with a ribose to hybridize at 70° C. plus minus 10° C. that will not dissociate after cleavage. Ribose may be placed as close to the 3′ end as is possible to minimize cycling/amplification. The method may also optionally include determining for the SSR amplification part as an ideal amplicon. The primers for the C21/C18/C13 may all be identical, and probes to each of C21, C18 and C13 may be standard TaqMan probes specific for C21, C18 and C13. Each Signal Conversion probe (SCP) may have an affinity tag at one end, such as, but not limited to biotin at the 5′ end. Assuming 200 target measurands each for C21/13/18 and a 100,000 well or droplet array, the method may include getting 200 different SCPs for C21, 200 different SCPs for C18, and getting 200 different SCPs for C13. Probes may be mixed at 1:1:1 ratio. There should be now roughly 600 SCPs in equimolar ratio. The total amount may be determined experimentally but can be expected to be at least 10-fold higher than the target DNA concentration. A typical concentration of an individual probe could be in the range of 5 pM-500 pM. Incubation time, temperature, and buffer condition may impact the optimal conditions. The nuclease (e.g., RNase H2) can be added before, together or after addition of SCP, however for some nucleases (e.g. restriction enzymes) it is often preferred to be added last. For example, restriction enzymes need to be added after SCP hybridization otherwise they could destroy the target DNA allele before the SCP can bind. Once the reaction has been completed, uncleaved SCP and cTSR will be removed with streptavidin bound to a solid phase (e.g. beads, column, magnetic beads, etc.). The supernatant or an aliquot may be mixed with the PCR master mix for the digital PCR reaction and run following standard protocols known in the art.
[0280] The number of positive T 21, T 18 and T 13 counts may then be compared and based on mathematical transformations (e.g. difference, ratio or other) to determine if the result is normal or abnormal.
Example 2: Sequence Conversion Method for NIPT (Detection of Trisomy 13, 18 and 21) Using Restriction Enzyme
[0281] As mentioned above, in some variations a restriction endonuclease (“restriction enzyme”) may be used. For example, in some variations the ploidy of a sample of genetic material including chromosomal material may be determined using the methods described herein.
[0282] In any of the methods described herein in which the nuclease used to separate the SSR from the SCP, the restriction enzyme to be used may be determined and the restriction site may be engineered into the SCP. For example, selection criterion for determining a restriction enzyme may be based on the specific needs of the methodology used, and may include the recognition site being approximately ˜6 bases long (other length restriction sites work as well); the enzyme begin stable and effective at above 45° C. The enzyme chosen may be selected because it can be heat inactivated. In some variations, the recognition site may be found in the potential targets' sequences with sufficient frequency (ideally >500) on each of chromosome 13, 18 and 21, to support assay statistical requirements (e.g., the number of probes per chromosome).
[0283] In designing the SCPs, the identified restriction sites may be extended to the target-specific region (TSR) sequences. For example, the method may include identifying 200 or more sequence targets on chromosome 21, 18 and 13 each based on the chosen restriction enzyme. Possible candidate restriction enzymes may include BsmI (NEB cat #R0134L), BsmBI-v2 (NEB cat #R0739S), BspQI (NEB cat #R0712S), BsaI (NEB cat #R0535S), BsaI-HF-v2 (NEB cat#R3733S) or BssHII (NEB cat #R0199L). Alleles with the selected restriction enzyme recognition site sequence may be identified for use on chromosomes 13, 18 and 21. To determine the 5′-end for the target-specific region the Tm of the probe may be defined. Since many of these restriction enzyme work well above 45° C. (the reaction temperature), the TSR should ideally be fully bound at the reaction temperature. Therefore, a Tm of the probe above 55° C. may be used as a target. Based on this Tm in 50 mM NaCl, a TSR probe length of 10-40 bases may be ideal, depending on GC content. The TSR binding sequence may be extended (using bioinformatic techniques) for more than 6 bases downstream (direction 3′). The sequence upstream (direction 5′) may be extended as needed to achieve the desired Tm. Unique sequences may be identified: sequences that are found on chromosomes other than the intended chromosome with an identity (or homology) of more than 80% may be discarded, especially those carrying the enzyme's recognition site. Sequences binding to common SNPs (e.g. >−1% frequency in the human population) in the restriction site may be filtered out. The method may also include ensuring the remaining sequences do not overlap with each other.
[0284] As used herein, when comparing polynucleotide sequences, two sequences are said to be “identical” if the sequence of nucleotides in each of the two sequences is the same when the sequences are aligned for maximum correspondence. The percentage identity between two nucleotide sequences as described herein may be determined according to art-accepted practices and criteria, for instance, the BLAST and BLAST 2.0 algorithms described in Altschul et al., Nucl. Acids Res. 25:3389-3402 (1977) and Altschul et al., J. Mol. Biol. 215:403-410 (1990).
[0285] The sequences for the Signal Specific Region (SSR) may also be determined. For example, by identifying (using bioinformatics) 3 oligo sequences not found in the human genome or human pathogens long enough to harbor a PCR amplicon. The 3 sequences may be synthetic (have very little homology or identity with the human genome), may contain two universal primer binding sites (one at the 5′-end, one at the 3′-end of the SSR) same for all three oligonucleotides, the oligo may contain a TaqMan probe binding site specific for Chromosome 13, 18 or 21 between the primer sites.
[0286] The final Sequence Conversion Probe (SCP) design may be determined by concatenating each of 200 identified Target-specific Regions (TSRs) for Chromosome 13, 18 and 21 with their respective universal Signal Specific Region (C13 SSR, C18 SSR and C21 SSR). The TSR may be at the 5′ end and will be synthesized with a 5′ biotin tag; the SSR is at the 3′ end. This results in 200 each chromosome specific Sequence Conversion Probes. The approximately 600 HPLC purified probes may be generated, providing, for example, approximately 200 identified SCPs each for Chromosomes 13, 18 and 21.
[0287] Primers and probes may be optimized for dPCR cycling conditions with an extension temperature of 58° C. For example, the primer Tm may be 58-60° C., the probe Tm may be 68-70° C. The first probe (e.g., probe 1, C21) may include a reporter 1 (e.g., FAM) and a Quencher (e.g., Iowa Black). The second probe (probe 2, C18) may include a Reporter 2 (e.g., HEX) and a Quencher (e.g., Iowa Black). The third probe (probe 3, C13) may include a reporter 3 (e.g., Cy5), and a Quencher (e.g., Iowa Black). The primers and probe for dPCR may be optimized to be compatible with the Bio Rad ddPCR Supermix for Probes (Catalog number: 186-3010 for 500 reactions).
[0288] The method may include: (1) maternal and fetal cfDNA Extraction, (2) sequence Conversion Reaction, (3) cleanup of sequence conversion reaction, (4) partitioning (e.g. droplet generation) of cSSRs, (5) dPCR and (6) data analysis.
[0289] Sample collection and DNA extraction may include isolating plasma (e.g., within five days from blood draw). An average of 9.5 mL whole blood may be collected in cell-free DNA BCT® blood collection tubes (Streck) via a double centrifugation protocol consisting of a first centrifugation step at 1342× g for 30 minutes, transfer of the plasma fraction to a secondary tube and a second centrifugation step at 2267× g for 20 minutes. Plasma is stored at −80 C until further processing.
[0290] Extract cfDNA from 1 mL of plasma of a pregnant female as described by the manufacturer. Reagents: Maxwell® RSC ccfDNA Plasma Kit (Catalog number: AS1480); Instrument: ProMega Maxwell® RSC 48 Instrument (AS8500). Elute cfDNA in 50 μL, elution buffer. Minimize salt carry over.
Sequence Conversion Reaction
[0291] 50 μl cfDNA eluate is transferred to a microtiter plate and 10 μL of SCP solution is added which contains 5-500 pM (final) locus-specific SCP probe (3-300 nM total SCP probe concentration for 600 SCP probes. Probes are at equimolar concentration). Denature at 95° C. for 60 sec, then lower temperature to 50-65° C. Add and mix 10 μL of Restriction Enzyme Buffer (containing 1-10 U BsaI-HF-v2 (New England Biolabs), 7 μL of 10× CutSmart Buffer Cat number B7204S). Incubation time and probe concentration at 50-65° C. is optimized following guidelines by Zhang et al (Nat Chem. 2018 January; 10(1): 91-98.) for 5-500 pM probe (individual SCP) at 0.5-24 hours. Temperature, concentrations and time interact with each other and allow a number of solutions to accommodate user requirements.
[0292] Cleanup of sequence conversion reaction may include, once the reaction has been completed, removal of any uncleaved SCP and cTSR by binding with streptavidin bound to a solid phase (e.g. beads, column, magnetic beads). The cSSR may be partitioned for dPCR. For example, the supernatant from the cleanup of the sequence conversion reaction may be mixed with the dPCR master mix and any additional reagents as required for the specific dPCR system (e.g. oil for droplet generation for the BioRad QX ONE Droplet Digital PCR System). The number of partitions is selected such that the required statistical significance for the intended use of the assay can be achieved. The dPCR workflow, including droplet generation, is described in “QX ONE Droplet Digital PCR System and QX ONE Software User Guide”.
[0293] Digital PCR (dPCR) may be performed on the resulting SSRs. For example, a Bio-Rad QX ONE Droplet Digital PCR System (M/N 12006536) may be used with PX1 PCR Plate Sealer (1814000), and Supermix for Probes (Biorad Catalog number: 186-3010 for 500 reactions).
[0294] Table 2, below illustrates the volumes required of 10 μM stock solutions for forward primer, reverse primer, and probe to achieve final concentrations of 900 nM primer concentration and 250 nM probe concentration for one reaction:
TABLE-US-00002 Bio-Rad ddPCR SuperMix for Probes 11 μl Forward Primer (10 μM) 2.0 μl Reverse Primer (10 μM) 2.0 μl Probe (10 μM) 0.55 μl Molecular Grade Water 4.45 μl Sample 2 μl
[0295] Note that if higher sample input is needed, reduce volume of Molecular Grade Water. The total volume accommodates for dead volume for pipetting.
[0296] Table 3 Shows an example of a thermal cycling protocol for dPCR that may be used:
TABLE-US-00003 Cycle Time Temperature Ramp Rate Step 1 1 10 minutes 95° C. 2 C./sec Step 2 40 30 seconds 94° C. 2 C./sec 60 seconds 58° C. 2 C./sec Step 3 1 10 minutes 98° C. 2 C./sec Step 4 1 infinite 12° C. 2 C./sec
[0297] The number of positive chromosome 21, 18 and 13 counts (e.g. empty vs occupied partitions, using Poisson distribution to determine absolute copy numbers) may be compared and based on mathematical transformations (e.g. difference, ratio or other) to ascertain if the risk for aneuploidy of chromosome 21, 18 or 13 of a particular sample is increased. Further risk factors like maternal age, being carriers of the genetic translocation, having had one child with an aneuploidy etc. can be used in combination with the assay result to generate a risk score.
Design and Selection of SCPs
[0298] Described herein are methods for designing and selecting the sequence conversion probes (SCPs) that may be used with any of the Sequence Conversion Reactions described and illustrated herein (including but not limited to the assay illustrated in
[0299] For example, a sequence conversion assay may start with an extracted subject (or pooled subjects) sample, e.g., of cell free DNA (cfDNA). SCP probes are then added to extracted cfDNA, which may be heated to denature the double-stranded cfDNA and cooled to allow specific annealing of SCP probes to target alleles within the cfDNA. Nuclease (e.g., restriction enzyme) is then added, and incubated to cleave the hybridized SCP probes at the pre-determined position, releasing the SSR portion of the SCP probe (cut SSR, or cSSR) only from those SCP probes whose TSR regions have hybridized to target regions of the cfDNA. A lock probe and ligase may then be added to circularize the cSSR. Optionally non-circular ssDNA and dsDNA may be digested; this step is not be necessary, particularly where the “primer away” amplicon configuration (as shown in
[0300] Thus, described herein are methods of forming sequence conversion probes (SCPs) that may be used to form a functional SCP. These SCPs may work in any of the sequence conversion assay. Although specific examples of sequences for SCPs, and the SSRs and TSRs that form them are provided, for example in SEQ ID Nos. 1-5 and 8-75, it should be understood that the methods described herein teach and enable one of skill in the art to make SCPs that will work with the sequence conversion assays described herein and are not limited to a particular sequence. SCPs having virtually any sequence that conforms to the properties described herein for selecting and combining the TSR and SSR will work.
[0301] For example, an SCP may generally include a target-specific region (TSR) extending between 15 and 80 base pairs (bp), wherein a polynucleotide sequence of the TSR is at least 80% identity (e.g., at least 81%, at least 82%, at least 83%, at least 84%, at least 85%, etc.) to only a single region of one chromosome within a target (e.g., human) genome. In some cases the polynucleotide sequence of the TSR may be between 15 and 50 bp, between 15 and 45 bp, between 15 and 35 bp, between 15 and 30 bp, etc. Preferably the TSR is between 15 and 35 bp. As will be described in greater detail below, the TSR is homologous (e.g., having a signification identity) to only a single region of one chromosome within a target genome, so that the TSR will bind with reasonable affinity to only a single site within the genome. As described in greater detail, the TSR may be selected so that it occurs on a known chromosome (e.g., chromosome 21) in the genome. Although the examples described herein are specific to assays of human genomes, it should be understood that these assays may be directed to targets for other genomes (including other animal genomes, plant genomes, etc.).
[0302] The TSR is selected explicitly to include within the sequence of the TSR a cognate restriction site for a type IIs restriction enzyme. Example of type IIs restriction enzymes that may be used are provided herein. In some examples the cut site for the restriction enzyme may be at or near a first end of the TSR. The TSR may have a GC content of greater than 50%.
[0303] As mentioned, the TSR probe is specific to a given chromosome and occurs only once in that chromosome. Typically, multiple instances of SCPs are included that have the same SSR but different TSRs, as described above. All of the TSRs within a set of SCPs (including a variety of different TSR probes) may have the same restriction site. The restriction site (the cognate restriction site for a type IIs restriction enzyme) may be positioned in approximately the same position in all of the TSRs of the set of SPCs. For example, the cognate site may be positioned with the cut site within 8 bp (e.g., within 7 bp, within 6 bp, within 5 bp, within 4 pb, within 3 bp, within 2 bp, within 1 bp, etc.) of the 3′ end of the TSR where the TSR is coupled to the SSR. For example, all of the TSRs may have the cognate restriction site within about 10 bp downstream and upstream of the 5′ and 3′ ends of the TSR, respectively. The cognate site may be adjacent to the end (e.g., the 5′ end) of the TSR where the TSR is coupled to the SSR. In some examples all of the TSRs of the set of TSRs within the set of SCPs have approximately the same size (e.g., about 13 bp, about 14 bp, about 15 bp, about 16 bp, about 17 bp, about 18 bp, about 19 bp, about 20 bp, about 21 bp, about 22 bp, about 23 bp, about 24 bp, about 25 bp, about 26 bp, about 27 bp, about 28 bp, between about 13-50 bp, between about 15-40 bp, between about 15-35 bp, between about 15-30 bp, between about 15-25 bp, etc.).
[0304] The different TSRs of a set of SCPs are each unique and non-overlapping; the TSRs of a set of SCPs do not overlap with each other (e.g., all hybridize to different, non-overlapping regions of the genome.
[0305] In general, the GC content of each TSR is greater than 50%, and the Tm (melting temperature) for each TSR is typically within a set range (e.g., greater than 55 degrees C., greater than 56 degrees C., greater than 57 degrees C., greater than 58 degrees C., greater than 59 degrees C., between about 55-65 degrees C., between about 57-67 degrees C., between about 57-65 degrees C., etc.).
[0306] Finally, as will be described in greater detail below, each TRS probe may be selected to minimize their predicted secondary structure. The secondary structure may be calculated or approximated by calculating a minimum free energy (MFE) in kcal/mol and determining if the cognate restriction site for a type IIs restriction enzyme within the TSR prove is not part of a hairpin loop or otherwise blocked or disrupted by second structure. This determination may also or alternatively be made at the SCP stage, once the putative TSR has been combined with the SSR.
[0307] The sequence-specific region (SSR) may have a polynucleotide sequence extending greater than 40 bp (e.g., greater than 45 bp, greater than 50 bp, greater than 55 bp, between 40 and 100 bp, between 40-90 bp, between 40-80 bp, between 50-100 bp, between 50-90 bp, between 50-80 bp, etc.). The polynucleotide sequence of the SSR sequence does not occur in the target genome. For example, the SSR may have a sequence that is engineered and compared to the target genome (e.g., the human genome, including and non-chromosomal DNA, such as mitochondrial DNA) to confirm that it does not appear within the sequence of the target genome (so that it would not hybridize to the target genome). The SSR may have a CG content of greater than 50%.
[0308] In general, the end of the TSR nearest to the cut site of the type IIs restriction enzyme region (e.g., a 5′ end) may be joined to the SSR. As already mentioned above, the cognate restriction site is not part of a hairpin structure in the assembled SCP.
[0309] In general the SSR of each SCP in a set of SCPs may be the same polynucleotide sequence and may share a set of forward and backward primers that hybridize with high selectively and efficiency. For example, the SSR may include a forward primer region and a reverse primer region.
[0310] The SSR may include one or more regions that prevent or block enzymatic extension (e.g., polymerase) based on a modification of the polynucleotide sequence. For example, any of the SSRs may include a forward primer region and a reverse primer region and an abasic region between the forward primer region and the reverse primer region.
[0311] An abasic region (also referred to as an abasic site or an AP site, or apurinic/apyrimidinic site) may occur naturally by hydrolysis of nucleoside residues in DNA to generate abasic sites. Most commonly, dA sites are hydrolyzed causing depurination and leading to abasic residues. These sites may be included in engineered polynucleotides, including the SSRs described herein. For example, a 1,2′-Dideoxyribose modification can be used to insert a single base space into a DNA oligonucleotide sequence, which replicates the loss of base pairing ability by a nucleotide, which can occur naturally through depurination or other mechanisms. A C3 Spacer phosphoramidite (iSpC3) can be used as a spacer arm in an oligonucleotide. The compounds may be added in multiple additions when a longer spacer is required. 3′-Spacer C3 CPG or phosphoramidite may also act as a blocker of exonuclease and polymerase and may be used to introduce a stable abasic site within an oligonucleotide. For example, dSpacer CE Phosphoramidite (5′-O-Dimethoxytrityl-1′,T-Dideoxyribose-3′-[(2-cyanoethyl)-(N,N-diisopropyl)]-phosphoramidite) may be used. dSpacer is also referred to as an abasic site, tetrahydrofuran (THF), or apurinic/apyrimidinic (AP) site. It can be incorporated into an oligonucleotide sequence internally, or at the 5′ or 3′ end of a sequence. Any number of spacers may be used to form the abasic region/site (e.g., 1, 2, 3, 4, 5, etc.).
[0312]
[0313]
[0314] SEQ ID NOS. 8, 12, 15, 18, 21, 24, 27, 30, 33, 36, 39, 42, 45, 48, 51, 54, 57, 60, 63, 66, 69, 72, or 75 illustrate examples of an TSRs that were engineered using the methods described herein.
[0315] This GCF_000001405.26_GRCh38_genomic.fna features all the other short contigs (˜430 of them) that have not been assigned yet to a chromosome. In the example shown in
[0316] The target genome is scanned for all locations within the genome having the target type IIs restriction site 2402 (e.g., in this example, BsaI), to provide genomic coordinates for these sites. A fixed region upstream and downstream (e.g., 16 nucleotides upstream, 5′, and 5 nucleotides downstream, 3′) 2404 were chosen from the genomic location, and putative TSRs were then vetted to remove any that included a duplicate 2406, since each TSR must be entirely unique within the entire genome 2408. The putative TSRs were then filtered to identify just those that are present on the target chromosome 2410, in this example, chromosome 21. The resulting putative TSRs were then examined to identify their Tm (e.g., using a homoduplex Tm prediction model) 2412. Any putative TSRs having a predicted Tm of less than 58 degrees C. were eliminated. The resulting (“high Tm” putative TSRs were then examined to confirm that they were unique within about 80% identity with the entire genome; for example a mismatch of more than two (or in some cases more than 3, more than 4, mover than 5, etc.) was allowed to remain a putative TSR, otherwise they were eliminated 2416. Calculations of how many hits a given candidate e.g. chr 21 BsaI TSR, has against the rest of a genome with a given number of mismatches may be used as a filter for potential cross-reactivity. Any overlapping putative TSRs were also eliminated 2418. The similarity between all putative TSRs 2420 and the rest of the genome TSR was also estimated based on their string distance using an approximate string matching method. The putative TSR(s) were then combined with the intended SSR (see below) 2422 to form putative SCPs, and the putative SCP sequences were examined to eliminate any having secondary structure 2424 that would inhibit enzymatic activity by the target type Hs restriction enzyme (e.g., BsaI).
[0317] Secondary structure may be estimated and calculated using any appropriate method, a variety of which are known and available to those of skill in the art. For example, secondary structure and hairpin folding at or near the type Hs restriction enzyme cognate site may be determined using RNAFold ViennaRNA Package. For example,
[0318] In the example shown in
[0319] As mentioned, any appropriate type II restriction enzyme may be used. These enzymes may be selected from the list of commercially available enzymes having a recognition site that is 6 or 7 bases long. The enzymes should be heat inactivated, and preferably recombinant. The recognition site for the enzyme should occur often enough (ideally, e.g., more than about 500 times) on each target chromosome (e.g., chromosome 13, 18 and/or 21), which may support assay statistical requirements. Preferably the enzyme may hybridize at an elevated temperature (e.g., 45 degrees C., 50 degrees C., etc.), such as enzymes which work well around 50° C. Restriction enzymes that have a low salt tolerance and/or that cannot be heat inactivated and/or that had a high “star activity” (e.g., a form of lack of unspecific cleavage activity) were eliminated as options for use in the TSR design.
[0320] A parallel process may also be used to select the SSR. Although a single SSR may be used, multiple putative SSRs may be formed. For example, In
[0321] The SSR identified in SEQ ID NO. 1 shows one example of an SSR; alternative SSRs are shown in SEQ ID NOS. 2-5. For example, SEQ ID NO. 2 shows an SSR portion. This portion may be combined with the SSR portion of SEQ ID NO. 3 or SEQ ID NO. 5 (e.g., connected by an abasic region as shown in
[0322] For example, detection may be performed using digital PCR (dPCR). In some examples the system may include TaqMan probes. The TaqMan probe site may be engineered on the SSR so that the TaqMan probe does not interfere with the method described herein. For example, TaqMan probes may be positioned at the 3′ end of the SSR, rather than at the SSR 5′ end or straddling the cut site, both of which may interfere with the method. For example, if the TaqMan probe straddles the cut site, the TaqMan probe may also act to bridge the forward and reverse primers, resulting in undesired amplification (and therefore undesired PCR signal). Alternatively, it has been found that positioning the TaqMan probe site at the SSR 5′ end may result in sequestering of the TaqMan probe.
[0323]
[0324] An SCP may be an approximately 90 nucleotide long single strand DNA with a high GC content per design, which is prone to adopt stable secondary structure. Examples of SCPs determined as described above for human chromosome 21 are provided in SEQ ID NOS. 8, 12, 15, 18, 21, 24, 27, 30, 33, 36, 39, 42, 45, 48, 51, 54, 57, 60, 63, 66, 69, 72, or 75. Partial SCPs, which may be linked, e.g., via an abasic region, to additional SSR regions (such as shown in SEQ ID NO. 10) are shown in SEQ ID NOS. 11, 14, 17, 20, 23, 26, 29, 32, 35, 38, 41, 44, 47, 50, 53, 56, 59, 62, 65, 68, 71, 74 and 77.
Further Examples
[0325] The various methods and steps described herein were examined using sample primers and template as described herein. One example of a workflow (referred to herein as a “full primer away circularization workflow”) was demonstrated using an SCP (“SCR0083”) which includes a TSR at the 3′ end with a BspQI recognition sequence and is specific for Chromosome 21 at site NC_000021.9_27305245.
[0326] In this example, SCP SCR0083 (0.5 nM; SEQ ID No. 83; IDT) was annealed to target SCR0113 (SEQ ID No. 84, 3.4e5 copy; IDT) in 10 mM Tris-HCl, 1 mM Na2-EDTA, 100 mM NaCl, 10 mM MgCl2, pH 8.0 (40u1) by heating to 95° C. for 5 min, cooling at 0.015 C/sec to 50° C. and holding for 5 min. The SCP annealed to its TSR synthetic target was digested with BspQI (20 U; NEB cat #R0712) for 60 min at 50° C. in a final digestion buffer comprised of 50 mM Tris-HCl, 100 mM NaCl, 10 mM MgCl2, 100 mg/ml BSA. The digestion was followed by a 20-minute heat inactivation step at 80° C. The SSR of the digested SCP was then ligated by treatment with Hi-T4 ligase (60 U; NEB cat #M2622) for 70 minutes at 50° C. in 50 mM Tris-HCl, 20 mM MgCl2, 1.5 mM ATP, 0.5 nM Lock Probe SCR0080 (SEQ ID No. 85; IDT). The ligation was followed by a 10 min heat inactivation step at 65° C. The circularized SSR was then amplified and detected by real-time PCR using TaqMan Fast Advanced MasterMix (ThermoFisher cat #4444556), 200 nM forward and reverse primers (SSR1_FP, SEQ ID No. 86; SSR1_RP, SEQ ID No. 87; IDT) and 100 nM TaqMan probe SCR0078 (SEQ ID No. 88; IDT) by denaturation at 95° C. for 20″ followed by 40 cycles of 95° C. for 1″ and 60° C. for 20″ QuantStudio Real Time PCR System (ThermoFisher Scientific).
[0327] Amplification plots for digested, circularized chromosome 21-specific Sequence Conversion Probe SCR0083 (SEQ ID No. 83) using the “primer away” qPCR assay for SCR0083 and a no template control (NTC) are shown in
[0328] This workflow illustrates the multiple, sequential steps completed successfully, starting with the annealing of the SCP to target, then cleaving the SCP by the specific restriction endonuclease (BspQI in this example), thereby separating the TSR from the SSR, and ligating the resulting cleaved SSR (cSSR) into a covalently closed single-stranded circular DNA and, finally, detecting the circularized product by “primer away” qPCR.
[0329] In some examples the TSR may be cut so that there is no TSR (no target sequence) in the resulting cSSR.
[0330] Multiplex SCR with qPCR Readout was also examined. Highly efficient and specific amplification and detection of two SSRs with a universal primer pair and two different TaqMan probes, one labeled with FAM, the other labeled with HEX, was demonstrated. TaqMan PCR was used as the detection method in this example. The respective probes are highly specific for their intended SSR and do not cross-react with the unintended SSR or with human genomic DNA.
[0331] In this example, synthetic Signal Specific Region (SSR) oligonucleotides SCR0076 (SEQ ID No. 89; IDT) and SCR0136 (SEQ ID No. 90; IDT) were amplified in the presence of common forward and reverse primers and detected with either the cognate or non-cognate TaqMan probe, i.e. SCR0078 (SEQ ID No. 88) for SCR0076 (SEQ ID No. 89) or SCR0134 (SEQ ID No. 91) for SCR0136 (SEQ ID No. 90). The SSR targets correspond to the SSRs in the Sequence Conversion Probe (SCP) found in SCR0083 (SEQ ID No. 83) (for chromosome 21) and SCR0133 (SEQ ID No. 99, for chromosome 18) after cleavage and circularization via ligation.
[0332] Additionally, specificity was examined by using either water (no template control; NTC) or in the presence of 100 ng of human genomic DNA derived from HeLa cell line (gDNA; NEB; cat #N4006). The real-time PCR was performed using TaqMan Fast Advanced MasterMix (ThermoFisher cat #4444556), 200 nM forward and reverse primers (SSR1_FP, SEQ ID No. 86; SSR1_RP, SEQ ID No. 87; IDT) and 100 nM TaqMan probe SCR0078 (SEQ ID No. 88; IDT) or SCR0134 (SEQ ID No. 91; IDT) by denaturation at 95° C. for 20″ followed by 40 cycles of 95° C. for 1″ and 60° C. for 20″ on the QuantStudio Real Time PCR System (ThermoFisher Scientific).
[0333]
[0334] A full workflow with multiplex detection of targes on two different human chromosomes (with dPCR Readout) was also performed. Specific multiplex detection of targets on two human chromosomes by two SCPs, one specific for chromosome 21 (SCR0083; site NC_000021.9_27305245), the other specific for chromosome 18 (SCR0133; site NC_000018.10_647934), both mapped to GRCh38.p13 Primary Assembly_647934. Digital PCR was used as the detection method for absolute quantitation. Human genomic DNA was not detected when in the absence of the SCPs.
[0335] Sequence Conversion Probes (SCPs) SCR0083 (0.5 nM; SEQ ID No. 83; IDT) specific for chromosome 21 (NC_000021.9_27305245) and SCR0133 (0.5 nM; SEQ ID No. 99; IDT) specific for chromosome 18 (NC_000018.10_647934) were annealed with 100 ng of human genomic DNA derived from HeLa cell line (NEB; cat #N4006) in 10 mM Tris-HCl, 1 mM Na2-EDTA, 100 mM NaCl, 10 mM MgCl2, pH 8.0 (40 ml) by heating to 95° C. for 5 min, cooling at 0.015 C/sec to 50° C. and holding for 5 min. The SCPs annealed to genomic DNA were digested with BspQI (20 U; NEB cat #R0712) for 60 min at 60° C. in a final digestion buffer comprised of 50 mM Tris-HCl, 100 mM NaCl, 10 mM MgCl2, 100 mg/ml BSA. The digestion was followed by a 20-minute heat inactivation step at 80° C. The SSRs of the digested SCPs were then ligated by treatment with Hi-T4 ligase (60 U; NEB cat #M2622) for 70 minutes at 50° C. in 50 mM Tris-HCl, 20 mM MgCl2, 1.5 mM ATP, 0.5 nM of each Lock Probe SCR0080 (SeqID 3; IDT) and SCR0135 (SEQ ID No. 93; IDT). The ligation was followed by a 10 min heat inactivation step at 65° C. The circularized SSRs were then amplified and detected by digital PCR using QlAcuity Probe PCR Kit (Qiagen cat #250101), 800 nM forward and reverse primers (SSR1_FP, SEQ ID No. 85; SSR1_RP, SEQ ID No. 87; IDT) and 400 nM TaqMan probe SCR0078 (SEQ ID No. 88; IDT) and SCR0134 (SEQ ID No. 91; IDT) by denaturation at 95° C. for 2′ followed by 40 cycles of 95° C. for 15″ and 60° C. for 30″ on the QlAcuity digital PCR System (Qiagen, Netherlands) using a 26K 24-well nanoplate. The positive and negative partitions in the FAM and HEX channel were quantified using the QlAcuity Software Suite 1.2.18 (Qiagen) and converted into absolute concentration (copies/ml) as shown in Table 2. As a negative control genomic DNA was treated in the absence of SCPs to the full workflow.
[0336] The absolute quantification for chromosome 21—and chromosome 18-derived products are shown in the Table 2. The ratio is not 1:1 likely because HeLa cell lines are known to be hyperploid. The negative control with HeLa gDNA only (no SCPs) yielded 0 positive partitions and 0 copy/ul in both the Fam and Hex channels.
TABLE-US-00004 TABLE 2 Conc Partitions Partitions Partitions Sample Target (copies/μL) (valid) (positive) (negative) SCR083 and Chr 300.4 25480 5671 19809 SCR0133 on 18 HeLa gDNA Chr 1821.2 25479 19941 5538 21
[0337] A two-part assay with an SCP, including Linear Ligation and qPCR was also performed, similar to
[0338]
[0339] When a feature or element is herein referred to as being “on” another feature or element, it can be directly on the other feature or element or intervening features and/or elements may also be present. In contrast, when a feature or element is referred to as being “directly on” another feature or element, there are no intervening features or elements present. It will also be understood that, when a feature or element is referred to as being “connected”, “attached” or “coupled” to another feature or element, it can be directly connected, attached or coupled to the other feature or element or intervening features or elements may be present. In contrast, when a feature or element is referred to as being “directly connected”, “directly attached” or “directly coupled” to another feature or element, there are no intervening features or elements present. Although described or shown with respect to one embodiment, the features and elements so described or shown can apply to other embodiments. It will also be appreciated by those of skill in the art that references to a structure or feature that is disposed “adjacent” another feature may have portions that overlap or underlie the adjacent feature.
[0340] Terminology used herein is for the purpose of describing particular embodiments only and is not intended to be limiting of the invention. For example, as used herein, the singular forms “a”, “an” and “the” are intended to include the plural forms as well, unless the context clearly indicates otherwise. It will be further understood that the terms “comprises” and/or “comprising,” when used in this specification, specify the presence of stated features, steps, operations, elements, and/or components, but do not preclude the presence or addition of one or more other features, steps, operations, elements, components, and/or groups thereof. As used herein, the term “and/or” includes any and all combinations of one or more of the associated listed items and may be abbreviated as “/”.
[0341] Spatially relative terms, such as “under”, “below”, “lower”, “over”, “upper” and the like, may be used herein for ease of description to describe one element or feature's relationship to another element(s) or feature(s) as illustrated in the figures. It will be understood that the spatially relative terms are intended to encompass different orientations of the device in use or operation in addition to the orientation depicted in the figures. For example, if a device in the figures is inverted, elements described as “under” or “beneath” other elements or features would then be oriented “over” the other elements or features. Thus, the exemplary term “under” can encompass both an orientation of over and under. The device may be otherwise oriented (rotated 90 degrees or at other orientations) and the spatially relative descriptors used herein interpreted accordingly. Similarly, the terms “upwardly”, “downwardly”, “vertical”, “horizontal” and the like are used herein for the purpose of explanation only unless specifically indicated otherwise.
[0342] Although the terms “first” and “second” may be used herein to describe various features/elements (including steps), these features/elements should not be limited by these terms, unless the context indicates otherwise. These terms may be used to distinguish one feature/element from another feature/element. Thus, a first feature/element discussed below could be termed a second feature/element, and similarly, a second feature/element discussed below could be termed a first feature/element without departing from the teachings of the present invention.
[0343] Throughout this specification and the claims which follow, unless the context requires otherwise, the word “comprise”, and variations such as “comprises” and “comprising” means various components can be co-jointly employed in the methods and articles (e.g., compositions and apparatuses including device and methods). For example, the term “comprising” will be understood to imply the inclusion of any stated elements or steps but not the exclusion of any other elements or steps.
[0344] In general, any of the apparatuses and methods described herein should be understood to be inclusive, but all or a sub-set of the components and/or steps may alternatively be exclusive, and may be expressed as “consisting of” or alternatively “consisting essentially of” the various components, steps, sub-components or sub-steps.
[0345] As used herein in the specification and claims, including as used in the examples and unless otherwise expressly specified, all numbers may be read as if prefaced by the word “about” or “approximately,” even if the term does not expressly appear. The phrase “about” or “approximately” may be used when describing magnitude and/or position to indicate that the value and/or position described is within a reasonable expected range of values and/or positions. For example, a numeric value may have a value that is +/−0.1% of the stated value (or range of values), +/−1% of the stated value (or range of values), +/−2% of the stated value (or range of values), +/−5% of the stated value (or range of values), +/−10% of the stated value (or range of values), etc. Any numerical values given herein should also be understood to include about or approximately that value, unless the context indicates otherwise. For example, if the value “10” is disclosed, then “about 10” is also disclosed. Any numerical range recited herein is intended to include all sub-ranges subsumed therein. It is also understood that when a value is disclosed that “less than or equal to” the value, “greater than or equal to the value” and possible ranges between values are also disclosed, as appropriately understood by the skilled artisan. For example, if the value “X” is disclosed the “less than or equal to X” as well as “greater than or equal to X” (e.g., where X is a numerical value) is also disclosed. It is also understood that the throughout the application, data is provided in a number of different formats, and that this data, represents endpoints and starting points, and ranges for any combination of the data points. For example, if a particular data point “10” and a particular data point “15” are disclosed, it is understood that greater than, greater than or equal to, less than, less than or equal to, and equal to 10 and 15 are considered disclosed as well as between 10 and 15. It is also understood that each unit between two particular units are also disclosed. For example, if 10 and 15 are disclosed, then 11, 12, 13, and 14 are also disclosed.
[0346] Although various illustrative embodiments are described above, any of a number of changes may be made to various embodiments without departing from the scope of the invention as described by the claims. For example, the order in which various described method steps are performed may often be changed in alternative embodiments, and in other alternative embodiments one or more method steps may be skipped altogether. Optional features of various device and system embodiments may be included in some embodiments and not in others. Therefore, the foregoing description is provided primarily for exemplary purposes and should not be interpreted to limit the scope of the invention as it is set forth in the claims.
[0347] The examples and illustrations included herein show, by way of illustration and not of limitation, specific embodiments in which the subject matter may be practiced. As mentioned, other embodiments may be utilized and derived there from, such that structural and logical substitutions and changes may be made without departing from the scope of this disclosure. Such embodiments of the inventive subject matter may be referred to herein individually or collectively by the term “invention” merely for convenience and without intending to voluntarily limit the scope of this application to any single invention or inventive concept, if more than one is, in fact, disclosed. Thus, although specific embodiments have been illustrated and described herein, any arrangement calculated to achieve the same purpose may be substituted for the specific embodiments shown. This disclosure is intended to cover any and all adaptations or variations of various embodiments. Combinations of the above embodiments, and other embodiments not specifically described herein, will be apparent to those of skill in the art upon reviewing the above description.
TABLE-US-00005 SEQUENCE LISTING SEQ ID NO. 1 SSR Example 1 GCTCGGACCCTGCTGTATCGTTCGCGGGCCTCCTCCTCAAAAACGCGCCATATGA GCCCCGTCGT SEQ ID NO. 2 SSR Example 1-Before abasic region GCTCGGACCCTGCTGTATCGTTCGCGGGCCT SEQ ID NO. 3 SSR Example 1-After abasic region CCTCCTCAAAAACGCGCCATATGAGCCCCGTCGT SEQ ID NO. 4 SSR Example 2 GCTCGGACCCTGCTGTATCGTTCGCGGGCCTCCTCCTCAAAAACGCGCCATATGA GCCCCGTCGTGATTACCCGCTGA SEQ ID NO. 5 SSR Example 2 CCTCCTCAAAAACGCGCCATATGAGCCCCGTCGTGATTACCCGCTGA SEQ ID NO. 6 Forward primer for SSR Examples 1 and 2 CCTCCTCAAAAACGCGCC SEQ ID NO. 7 Reverse primer for SSR Examples 1 and 2 AGGCCCGCGAACGATACA SEQ ID NO. 8 TSR example 1 Chromosome 21 unique site with BSAI recognition site TAAACCCTAAGAGACCAGATAAAGCT SEQ ID NO. 9 SCP combines Example 1 SSR with TSR example 1 Chromosome 21 unique site with BSAI GCTCGGACCCTGCTGTATCGTTCGCGGGCCTCCTCCTCAAAAACGCGCCATATGA GCCCCGTCGTTAAACCCTAAGAGACCAGATAAAGCT SEQ ID NO. 10 Partial SCP-Example 1 SSR before abasic site GCTCGGACCCTGCTGTATCGTTCGCGGGCCT SEQ ID NO. 11 Partial SCP-Example 1 SSR with TSR example 1 Chromosome 21 unique site with BSAI- after abasic site CCTCCTCAAAAACGCGCCATATGAGCCCCGTCGTTAAACCCTAAGAGACCAGATAA AGCT SEQ ID NO. 12 TSR example 2 Chromosome 21 unique site with BSAI TCCCCTCTGTGAGACCTTCAGAAATC SEQ ID NO. 13 SCP combines Example 1 SSR with TSR example 2 Chromosome 21 unique site with BSAI GCTCGGACCCTGCTGTATCGTTCGCGGGCCTCCTCCTCAAAAACGCGCCATATGA GCCCCGTCGTTCCCCTCTGTGAGACCTTCAGAAATC SEQ ID NO. 14 Partial SCP-Example 1 SSR with TSR example 2 Chromosome 21 unique site with BSAI- after abasic site (can be combined with SEQ ID NO. 10 for full SCP with abasic site between) CCTCCTCAAAAACGCGCCATATGAGCCCCGTCGTTCCCCTCTGTGAGACCTTCAG AAATC SEQ ID NO. 15 TSR example 3 Chromosome 21 unique site with BSAI TAATCACCGTGAGACCAGTTAATGAG SEQ ID NO. 16 SCP combines Example 1 SSR with TSR example 3 Chromosome 21 unique site with BSAI GCTCGGACCCTGCTGTATCGTTCGCGGGCCTCCTCCTCAAAAACGCGCCATATGA GCCCCGTCGTTAATCACCGTGAGACCAGTTAATGAG SEQ ID NO. 17 Partial SCP-Example 1 SSR with TSR example 3 Chromosome 21 unique site with BSAI- after abasic site (can be combined with SEQ ID NO. 10 for full SCP with abasic site between) CCTCCTCAAAAACGCGCCATATGAGCCCCGTCGTTAATCACCGTGAGACCAGTTAA TGAG SEQ ID NO. 18 TSR example 4 Chromosome 21 unique site with BSAI AATTTTATGAGAGACCCTAGACAGCT SEQ ID NO. 19 SCP combines Example 1 SSR with TSR example 4 Chromosome 21 unique site with BSAI GCTCGGACCCTGCTGTATCGTTCGCGGGCCTCCTCCTCAAAAACGCGCCATATGA GCCCCGTCGTAATTTTATGAGAGACCCTAGACAGCT SEQ ID NO. 20 Partial SCP-Example 1 SSR with TSR example 4 Chromosome 21 unique site with BSAI- after abasic site (can be combined with SEQ ID NO. 10 for full SCP with abasic site between) CCTCCTCAAAAACGCGCCATATGAGCCCCGTCGTAATTTTATGAGAGACCCTAGAC AGCT SEQ ID NO. 21 TSR example 5 Chromosome 21 unique site with BSAI GAATTCTTCTGAGACCTGAAGTGTAG SEQ ID NO. 22 SCP combines Example 1 SSR with TSR example 5 Chromosome 21 unique site with BSAI GCTCGGACCCTGCTGTATCGTTCGCGGGCCTCCTCCTCAAAAACGCGCCATATGA GCCCCGTCGTGAATTCTTCTGAGACCTGAAGTGTAG SEQ ID NO. 23 Partial SCP-Example 1 SSR with TSR example 5 Chromosome 21 unique site with BSAI- after abasic site (can be combined with SEQ ID NO. 10 for full SCP with abasic site between) CCTCCTCAAAAACGCGCCATATGAGCCCCGTCGTGAATTCTTCTGAGACCTGAAGT GTAG SEQ ID NO. 24 TSR example 6 Chromosome 21 unique site with BSAI GTCACAAAAAGAGACCCTGTGTTTAA SEQ ID NO. 25 SCP combines Example 1 SSR with TSR example 6 Chromosome 21 unique site with BSAI GCTCGGACCCTGCTGTATCGTTCGCGGGCCTCCTCCTCAAAAACGCGCCATATGA GCCCCGTCGTGTCACAAAAAGAGACCCTGTGTTTAA SEQ ID NO. 26 Partial SCP-Example 1 SSR with TSR example 6 Chromosome 21 unique site with BSAI- after abasic site (can be combined with SEQ ID NO. 10 for full SCP with abasic site between) CCTCCTCAAAAACGCGCCATATGAGCCCCGTCGTGTCACAAAAAGAGACCCTGTG TTTAA SEQ ID NO. 27 TSR example 7 Chromosome 21 unique site with BSAI AGCCAAAAAAGAGACCTTTCAACCCA SEQ ID NO. 28 SCP combines Example 1 SSR with TSR example 7 Chromosome 21 unique site with BSAI GCTCGGACCCTGCTGTATCGTTCGCGGGCCTCCTCCTCAAAAACGCGCCATATGAGCCCC GTCGTAGCCAAAAAAGAGACCTTTCAACCCA SEQ ID NO. 29 Partial SCP-Example 1 SSR with TSR example 7 Chromosome 21 unique site with BSAI- after abasic site (can be combined with SEQ ID NO. 10 for full SCP with abasic site between) CCTCCTCAAAAACGCGCCATATGAGCCCCGTCGTAGCCAAAAAAGAGACCTTTCAA CCCA SEQ ID NO. 30 TSR example 8 Chromosome 21 unique site with BSAI AGCCCCCTAAGAGACCAAAATAGCAA SEQ ID NO. 31 SCP combines Example 1 SSR with TSR example 8 Chromosome 21 unique site with BSAI GCTCGGACCCTGCTGTATCGTTCGCGGGCCTCCTCCTCAAAAACGCGCCATATGA GCCCCGTCGTAGCCCCCTAAGAGACCAAAATAGCAA SEQ ID NO. 32 Partial SCP-Example 1 SSR with TSR example 8 Chromosome 21 unique site with BSAI- after abasic site (can be combined with SEQ ID NO. 10 for full SCP with abasic site between) CCTCCTCAAAAACGCGCCATATGAGCCCCGTCGTAGCCCCCTAAGAGACCAAAAT AGCAA SEQ ID NO. 33 TSR example 9 Chromosome 21 unique site with BSAI AATTATAAATGAGACCTAGGATGCAG SEQ ID NO. 34 SCP combines Example 1 SSR with TSR example 9 Chromosome 21 unique site with BSAI GCTCGGACCCTGCTGTATCGTTCGCGGGCCTCCTCCTCAAAAACGCGCCATATGA GCCCCGTCGTAATTATAAATGAGACCTAGGATGCAG SEQ ID NO. 35 Partial SCP-Example 1 SSR with TSR example 9 Chromosome 21 unique site with BSAI- after abasic site (can be combined with SEQ ID NO. 10 for full SCP with abasic site between) CCTCCTCAAAAACGCGCCATATGAGCCCCGTCGTAATTATAAATGAGACCTAGGAT GCAG SEQ ID NO. 36 TSR example 10 Chromosome 21 unique site with BSAI TCACCCTAAAGAGACCAAGGAAGAAT SEQ ID NO. 37 SCP combines Example 1 SSR with TSR example 10 Chromosome 21 unique site with BSAI GCTCGGACCCTGCTGTATCGTTCGCGGGCCTCCTCCTCAAAAACGCGCCATATGA GCCCCGTCGTTCACCCTAAAGAGACCAAGGAAGAAT SEQ ID NO. 38 Partial SCP-Example 1 SSR with TSR example 10 Chromosome 21 unique site with BSAI- after abasic site (can be combined with SEQ ID NO. 10 for full SCP with abasic site between) CCTCCTCAAAAACGCGCCATATGAGCCCCGTCGTTCACCCTAAAGAGACCAAGGA AGAAT SEQ ID NO. 39 TSR example 11 Chromosome 21 unique site with BSAI TCTAACTAAAGAGACCATTAACGCAA SEQ ID NO. 40 SCP combines Example 1 SSR with TSR example 11 Chromosome 21 unique site with BSAI GCTCGGACCCTGCTGTATCGTTCGCGGGCCTCCTCCTCAAAAACGCGCCATATGA GCCCCGTCGTTCTAACTAAAGAGACCATTAACGCAA SEQ ID NO. 41 Partial SCP-Example 1 SSR with TSR example 11 Chromosome 21 unique site with BSAI- after abasic site (can be combined with SEQ ID NO. 10 for full SCP with abasic site between) CCTCCTCAAAAACGCGCCATATGAGCCCCGTCGTTCTAACTAAAGAGACCATTAAC GCAA SEQ ID NO. 42 TSR example 12 Chromosome 21 unique site with BspQI GTCCCGAGCTGAAGAGCTTTAAAACCT SEQ ID NO. 43 SCP combines Example 1 SSR with TSR example 12 Chromosome 21 unique site with BspQI GCTCGGACCCTGCTGTATCGTTCGCGGGCCTCCTCCTCAAAAACGCGCCATATGA GCCCCGTCGTGTCCCGAGCTGAAGAGCTTTAAAACCT SEQ ID NO. 44 Partial SCP-Example 1 SSR with TSR example 12 Chromosome 21 unique site with BspQI- after abasic site (can be combined with SEQ ID NO. 10 for full SCP with abasic site between) CCTCCTCAAAAACGCGCCATATGAGCCCCGTCGTGTCCCGAGCTGAAGAGCTTTA AAACCT SEQ ID NO. 45 TSR example 13 Chromosome 21 unique site with BspQI CCACCTAACTGAAGAGCGTTGGGAGCC SEQ ID NO. 46 SCP combines Example 1 SSR with TSR example 13 Chromosome 21 unique site with BspQI GCTCGGACCCTGCTGTATCGTTCGCGGGCCTCCTCCTCAAAAACGCGCCATATGA GCCCCGTCGTCCACCTAACTGAAGAGCGTTGGGAGCC SEQ ID NO. 47 Partial SCP-Example 1 SSR with TSR example 13 Chromosome 21 unique site with BspQI- after abasic site (can be combined with SEQ ID NO. 10 for full SCP with abasic site between) CCTCCTCAAAAACGCGCCATATGAGCCCCGTCGTCCACCTAACTGAAGAGCGTTG GGAGCC SEQ ID NO. 48 TSR example 14 Chromosome 21 unique site with BspQI CTTTGGTGGTGAAGAGCTGCCACCAAA SEQ ID NO. 49 SCP combines Example 1 SSR with TSR example 14 Chromosome 21 unique site with BspQI GCTCGGACCCTGCTGTATCGTTCGCGGGCCTCCTCCTCAAAAACGCGCCATATGA GCCCCGTCGTCTTTGGTGGTGAAGAGCTGCCACCAAA SEQ ID NO. 50 Partial SCP-Example 1 SSR with TSR example 14 Chromosome 21 unique site with BspQI- after abasic site (can be combined with SEQ ID NO. 10 for full SCP with abasic site between) CCTCCTCAAAAACGCGCCATATGAGCCCCGTCGTCTTTGGTGGTGAAGAGCTGCCAC CAAA SEQ ID NO. 51 TSR example 15 Chromosome 21 unique site with BspQI AAGTGGTCTGGAAGAGCCGGGGGAGTG SEQ ID NO. 52 SCP combines Example 1 SSR with TSR example 15 Chromosome 21 unique site with BspQI GCTCGGACCCTGCTGTATCGTTCGCGGGCCTCCTCCTCAAAAACGCGCCATATGA GCCCCGTCGTAAGTGGTCTGGAAGAGCCGGGGGAGTG SEQ ID NO. 53 Partial SCP-Example 1 SSR with TSR example 15 Chromosome 21 unique site with BspQI- after abasic site (can be combined with SEQ ID NO. 10 for full SCP with abasic site between) CCTCCTCAAAAACGCGCCATATGAGCCCCGTCGTAAGTGGTCTGGAAGAGCCGGG GGAGTG SEQ ID NO. 54 TSR example 16 Chromosome 21 unique site with BspQI GTCACCTAAAGAAGAGCCTTCAGGAAT SEQ ID NO. 55 SCP combines Example 1 SSR with TSR example 16 Chromosome 21 unique site with BspQI GCTCGGACCCTGCTGTATCGTTCGCGGGCCTCCTCCTCAAAAACGCGCCATATGA GCCCCGTCGTGTCACCTAAAGAAGAGCCTTCAGGAAT SEQ ID NO. 56 Partial SCP-Example 1 SSR with TSR example 16 Chromosome 21 unique site with BspQI- after abasic site (can be combined with SEQ ID NO. 10 for full SCP with abasic site between) CCTCCTCAAAAACGCGCCATATGAGCCCCGTCGTGTCACCTAAAGAAGAGCCTTC AGGAAT SEQ ID NO. 57 TSR example 17 Chromosome 21 unique site with BspQI TCTAGATTCTGAAGAGCCAGAACTCTA SEQ ID NO. 58 SCP combines Example 1 SSR with TSR example 17 Chromosome 21 unique site with BspQI GCTCGGACCCTGCTGTATCGTTCGCGGGCCTCCTCCTCAAAAACGCGCCATATGA GCCCCGTCGTTCTAGATTCTGAAGAGCCAGAACTCTA SEQ ID NO. 59 Partial SCP-Example 1 SSR with TSR example 17 Chromosome 21 unique site with BspQI- after abasic site (can be combined with SEQ ID NO. 10 for full SCP with abasic site between) CCTCCTCAAAAACGCGCCATATGAGCCCCGTCGTTCTAGATTCTGAAGAGCCAGAA CTCTA SEQ ID NO. 60 TSR example 18 Chromosome 21 unique site with BspQI CCTGGAACTGGAAGAGCCGTGTGTTTA SEQ ID NO. 61 SCP combines Example 1 SSR with TSR example 18 Chromosome 21 unique site with BspQI GCTCGGACCCTGCTGTATCGTTCGCGGGCCTCCTCCTCAAAAACGCGCCATATGA GCCCCGTCGTCCTGGAACTGGAAGAGCCGTGTGTTTA SEQ ID NO. 62 Partial SCP-Example 1 SSR with TSR example 18 Chromosome 21 unique site with BspQI- after abasic site (can be combined with SEQ ID NO. 10 for full SCP with abasic site between) CCTCCTCAAAAACGCGCCATATGAGCCCCGTCGTCCTGGAACTGGAAGAGCCGTG TGTTTA SEQ ID NO. 63 TSR example 19 Chromosome 21 unique site with BspQI CCCATTTGTGGAAGAGCATGAAATGGG SEQ ID NO. 64 SCP combines Example 1 SSR with TSR example 19 Chromosome 21 unique site with BspQI GCTCGGACCCTGCTGTATCGTTCGCGGGCCTCCTCCTCAAAAACGCGCCATATGA GCCCCGTCGTCCCATTTGTGGAAGAGCATGAAATGGG SEQ ID NO. 65 Partial SCP-Example 1 SSR with TSR example 19 Chromosome 21 unique site with BspQI- after abasic site (can be combined with SEQ ID NO. 10 for full SCP with abasic site between) CCTCCTCAAAAACGCGCCATATGAGCCCCGTCGTCCCATTTGTGGAAGAGCATGA AATGGG SEQ ID NO. 66 TSR example 20 Chromosome 21 unique site with BspQI CCAGGTCTGAGAAGAGCACATTTCATC SEQ ID NO. 67 SCP combines Example 1 SSR with TSR example 20 Chromosome 21 unique site with BspQI GCTCGGACCCTGCTGTATCGTTCGCGGGCCTCCTCCTCAAAAACGCGCCATATGA GCCCCGTCGTCCAGGTCTGAGAAGAGCACATTTCATC SEQ ID NO. 68 Partial SCP-Example 1 SSR with TSR example 20 Chromosome 21 unique site with BspQI- after abasic site (can be combined with SEQ ID NO. 10 for full SCP with abasic site between) CCTCCTCAAAAACGCGCCATATGAGCCCCGTCGTCCAGGTCTGAGAAGAGCACAT TTCATC SEQ ID NO. 69 TSR example 21 Chromosome 21 unique site with BspQI CAGGATGTGAGAAGAGCCTGGTGCATA SEQ ID NO. 70 SCP combines Example 1 SSR with TSR example 21 Chromosome 21 unique site with BspQI GCTCGGACCCTGCTGTATCGTTCGCGGGCCTCCTCCTCAAAAACGCGCCATATGA GCCCCGTCGTCAGGATGTGAGAAGAGCCTGGTGCATA SEQ ID NO. 71 Partial SCP-Example 1 SSR with TSR example 21 Chromosome 21 unique site with BspQI- after abasic site (can be combined with SEQ ID NO. 10 for full SCP with abasic site between) CCTCCTCAAAAACGCGCCATATGAGCCCCGTCGTCAGGATGTGAGAAGAGCCTGG TGCATA SEQ ID NO. 72 TSR example 22 Chromosome 21 unique site with BspQI TAAACCATTGGAAGAGCTTGTGCTCTG SEQ ID NO. 73 SCP combines Example 1 SSR with TSR example 22 Chromosome 21 unique site with BspQI GCTCGGACCCTGCTGTATCGTTCGCGGGCCTCCTCCTCAAAAACGCGCCATATGA GCCCCGTCGTTAAACCATTGGAAGAGCTTGTGCTCTG SEQ ID NO. 74 Partial SCP-Example 1 SSR with TSR example 22 Chromosome 21 unique site with BspQI- after abasic site (can be combined with SEQ ID NO. 10 for full SCP with abasic site between) CCTCCTCAAAAACGCGCCATATGAGCCCCGTCGTTAAACCATTGGAAGAGCTTGTG CTCTG SEQ ID NO. 75 TSR example 23 Chromosome 21 unique site with BspQI TGTGGAAGAGCATGAAATGGG SEQ ID NO. 76 SCP combines Example 1 SSR with TSR example 23 Chromosome 21 unique site with BspQI GCTCGGACCCTGCTGTATCGTTCGCGGGCCTCCTCCTCAAAAACGCGCCATATGA GCCCCGTCGTCCCATTTGTGGAAGAGCATGAAATGGG SEQ ID NO. 77 Partial SCP-Example 1 SSR with TSR example 23 Chromosome 21 unique site with BspQI- after abasic site (can be combined with SEQ ID NO. 10 for full SCP with abasic site between SEQ ID NO. 10 and this sequence) CCTCCTCAAAAACGCGCCATATGAGCCCCGTCGTCCCATTTGTGGAAGAGCATGA AATGGG SEQ ID NO. 78 Partial SCP-Example 3 partial SSR region before abasic site. GAGCCCCGTCGTGCTCGGACCCTGCTGTATCGTTCGCGGGCCT SEQ ID NO. 79 Partial SCP-Example 3 partial SSR region after abasic site. CCTCCTCAAAAACGCGCCATATGTCAGAAT SEQ ID NO. 80 Partial SCP-Example 19 TSR with partial SSR region after abasic site (can be combined with SEQ ID NO. 10 for full SCP with abasic site between the SEQ ID NO. 10 and this sequence) CCTCCTCAAAAACGCGCCATATGTCAGAATTGTGGAAGAGCATGAAATGGG SEQ ID NO. 81 Lock Probe Example 1 GAGCCCCGTCGTGCTCGGACCCTGC SEQ ID NO. 82 Lock Probe Example 2 GAGCCCCGTCGTCCCATTGCTCGGACCCTGC SEQ ID NO. 83 SCR0083 (iSpc3 are internal 3 carbon propane diol spacers, which serve as extension blockers) 5′PGCTCGGACCCTGCTGTATCGTTCGCGGGCCT/iSpC3//iSpC3/CCTCCTCAAAAAC GCGCCATATGAGCCCCGTCGTGATTACCCGCTGATGGTGAAGAGCTGCCACCAAA-3′P SEQ ID NO. 84 SCR0113 TTTGGTGGCAGCTCTTCACCA SEQ ID NO. 85 SCR0080 CAGGGTCCGAGCTCAGCGGGTAAT-3′P SEQ ID NO. 86 SSR1_FP CCT CAA AAA CGC GCC SEQ ID NO. 87 SSR1_RP AGG CCC GCG AAC GAT ACA SEQ ID NO. 88 SCR0078 5′ FAM-TC AGC GGG T/ZEN Quencher/A ATC ACG GGG CTC-3′ IowaBlackQuencher SEQ ID NO. 89 SCR0076 CCTCCTCAAAAACGCGCCATATGAGCCCCGTCGTGATTACCCGCTGAGCTCGGAC CCTGCTGTATCGTTCGCGGGCCT SEQ ID NO. 90 SCR0136 CCTCCTCAAAAACGCGCCATATAACGGGCGGACGGCGTAGTTCTGCTGCTCGGAC CCTGCTGTATCGTTCGCGGGCCT SEQ ID NO. 91 SCR0134 5′ HEX-AG CAG AAC T/ZEN/A CGC CGT CCG CCC GTT-3′ IowaBlack Quencer SEQ ID NO. 92 (iSpc3 are internal 3 carbon propane diol spacers, which serve as extension blockers) 5′P-GCTCGGACCCTGCTGTATCGTTCGCGGGCCT\iSpc3\\iSpc3\CCTCCTCAAAAAC GCGCCATATAACGGGCGGACGGCGTAGTTCTGCTAAGAGAAGAGCAAAAACCCTA-3′P SEQ ID NO. 93 SCR0135 CAGGGTCCGAGCAGCAGAACTACG-3′P SEQ ID NO. 94 5′-HEX-AG CAG AAC T/ZEN/A CGC CGT CCG CCC GTT-3′ IowaBlack Quencher SEQ ID NO. 95 SCR0142 5′P-TGTATCGTTCGCGGGCCT SEQ ID NO. 96 SCR0142a 5′P-TGTATCGTTCGCGGGCCT-3′P SEQ ID NO. 97 SCR0146 CCTCCTCAAAAACGCGCCATATGAGCCCCGTCGTGATTACCCGCTGA SEQ ID NO. 98 SCR0147 GCGAACGATACATCAGCGGGTA-3P SEQ ID No. 99 SCR0133 (iSpc3 are internal 3 carbon propane diol spacers, which serve as extension blockers) 5′P-GCTCGGACCCTGCTGTATCGTTCGCGGGCCT\iSpc3\\iSpc3\CCTCCTCAAAAAC GCGCCATATAACGGGCGGACGGCGTAGTTCTGCTAAGAGAAGAGCAAAAACCCTA-3′P