METHOD FOR DETECTING THE QUANTITY OF BIOMARKER AND IDENTIFYING DISEASE STATUS
20200141941 ยท 2020-05-07
Inventors
- PEI-JER CHEN (Taipei, TW)
- SHIOU-HWEI YEH (Taipei, TW)
- CHIAO-LING LI (Taipei, TW)
- DING-SHINN CHEN (Taipei, TW)
- Ya-Chun Wang (New Taipei, TW)
- Sheng-Tai Tzeng (New Taipei, TW)
Cpc classification
G01N2800/085
PHYSICS
C12Q2600/112
CHEMISTRY; METALLURGY
G01N2800/52
PHYSICS
C12Q2600/106
CHEMISTRY; METALLURGY
G01N2800/56
PHYSICS
C12Q1/6809
CHEMISTRY; METALLURGY
C12Q1/6806
CHEMISTRY; METALLURGY
International classification
C12Q1/6806
CHEMISTRY; METALLURGY
Abstract
The present invention provides a method of identifying a viral-host junction sequence from a subject with a hepatocellular carcinoma caused by chronic infection of hepatitis B virus. The viral-host junction sequence has a length of less than 200 bps and comprises a hepatitis B viral genome sequence and a host genome sequence.
Claims
1. A method of identifying a viral-host junction sequence from a subject with a hepatocellular carcinoma caused by chronic infection of hepatitis B virus, comprising: obtaining a cfDNA in a serum or plasma from a subject with a hepatocellular carcinoma caused by chronic infection of HBV, before or after a tumor resection of the subject; ligating the cfDNA with an adaptor; amplifying the cfDNA ligated with the adaptor by using a plurality of primers, wherein each of the primers is complementary to a sequence of the corresponding adaptor; hybridizing at least two polynucleotide probes with the cfDNA ligated with the adaptor; capturing and isolating a target ctDNA in the cfDNA hybridized with the polynucleotide probes; sequencing the target ctDNA by a sequencing system, wherein the target ctDNA has a viral-host junction sequence, and the viral-host junction sequence has a length of less than 200 bps and comprises a hepatitis B viral genome sequence and a host genome sequence.
2. The method according to claim 1, further comprising quantifying a concentration of the viral-host junction sequence.
3. The method according to claim 2, wherein quantifying the concentration of the viral-host junction sequence is performed by droplet digital PCR (ddPCR).
4. The method according to claim 2, wherein the concentration of the viral-host junction sequence comprises a copy number in each millimeter of the plasma or the serum.
5. The method according to claim 2, further comprising showing the concentration of the viral-host junction sequence in the target ctDNA before tumor resection and the concentration of the viral-host junction sequence in the target ctDNA after the tumor resection.
6. The method according to claim 1, wherein the target ctDNA is enriched by the polynucleotide probes complementary to a part of the sequence derived from hepatitis B viral genome.
7. The method according to claim 1, wherein the polynucleotide probes cover the whole hepatitis B viral genome sequence.
8. The method according to claim 1, wherein the cfDNA ligated with the corresponding adaptor comprises a first ctDNA with one end thereof ligated with the corresponding adaptor and a second ctDNA with two ends thereof ligated with the corresponding adaptors.
9. A product for identifying a viral-host junction sequence from a subject with a hepatocellular carcinoma caused by chronic infection of hepatitis B virus, comprising: a cfDNA extraction kit configured to extract a cfDNA in a serum or plasma from a subject with a hepatocellular carcinoma caused by chronic infection of HBV; an adaptor configured to ligate to an end of the extracted cfDNA; a nucleotide amplification kit comprising a plurality of primers complementary to a sequence of the adaptor; at least two polynucleotide probes complementary to a part of the sequence derived from hepatitis B viral genome, cover the whole hepatitis B viral genome sequence, and configured to hybridize with the amplified cfDNA ligated with the adaptor; a hybridization kit comprising a bead and biotin, configured to capture and isolate a target ctDNA in the cfDNA hybridized with the polynucleotide probes; a sequencing system configured to sequence a viral-host junction sequence of less than 200 bps in the target ctDNA.
10. The product according to claim 9, further comprises a droplet digital PCR (ddPCR) kit configured to quantify a concentration of the viral-host junction sequence.
Description
BRIEF DESCRIPTION OF THE DRAWINGS
[0009] Implementations of the present technology will now be described, by way of example only, with reference to the attached figures. The aspect of the present disclosure can be better understood with reference to the following figures. The components in the figures are not necessarily drawn to scale with the emphasis instead being placed upon clearly illustrating the principles of the present disclosure.
[0010]
[0011]
[0012]
[0013]
[0014]
[0015]
[0016]
[0017]
DETAILED DESCRIPTION
[0018] It will be appreciated that for simplicity and clarity of illustration, where appropriate, reference numerals have been repeated among the different figures to indicate corresponding or analogous elements. In addition, numerous specific details are set forth in order to provide a thorough understanding of the embodiments described herein. However, it will be understood by those of ordinary skill in the art that the embodiments described herein can be practiced without these specific details. In other instances, methods, procedures and components have not been described in detail so as not to obscure the related relevant feature being described. The drawings are not necessarily to scale and the proportions of certain parts may be exaggerated to better illustrate details and features. The description is not to be considered as limiting the scope of the embodiments described herein.
[0019] Several definitions that apply throughout this disclosure will now be presented.
[0020] The term coupled is defined as connected, whether directly or indirectly through intervening components, and is not necessarily limited to physical connections. The connection can be such that the objects are permanently connected or releasably connected. The term comprising, when utilized, means including, but not necessarily limited to; it specifically indicates open-ended inclusion or membership in the so-described combination, group, series and the like.
[0021] The term subject refers to an object of studies or experimental samples, may include human, monkey, groundhog or any animal. The term integration or integrant is making up or being a part of a whole. The integrant herein means single nucleic acid base pair, a part of nucleic acid sequence, a fragment of nucleic acid sequence or a gene sequence from a viral chromosome embed in a host chromosome sequence. The term junction is a combination of a fragment of a viral chromosome sequence and a part of a host chromosome sequence. The term read number refers to the times of next generation sequencing (NGS) reads. Each NGS read is analyzed to see whether it contained junction sequences or not. The reads containing junction sequences are then assembled into the finalized junction sequences. Therefore, each finalized junction sequence is assembled by the NGS reads containing the same junction region, and the accumulated read number represents the number of reads assembled into that specific finalized junction sequence. The term amount means the relative quantification of copy numbers of the specific junction cfDNA fragment in subject serum or plasma. The specific junction sequences are obtained from NGS analyzed results. According to the sequence, the specific junction cfDNA fragment from serum or plasma are detected by droplet digital PCR (ddPCR) platform in absolute quantification which can estimate the concentration (copy numbers) of the specific junction cfDNA fragment in each milliliter serum or plasma.
[0022] The present disclosure is described in relation to the innovation of using circulating cell-free nucleic acids in a subject to identify and monitor a disease development in the subject.
[0023] Certain human tumors/cancers, for example hepatocellular carcinoma (HCC), are caused by chronic infection of hepatitis B virus (HBV). These tumors/cancers accumulate genetic alterations in their genomes. Among such alterations, a unique one is the integration of viral genome into the host genome, usually occurring in the early stage of infections. Superimposed upon these mutations are other somatic mutations that continue to occur and finally transform the cells to tumors/cancers.
[0024] As noted, when HCC cells turn over, fragmented genetic nucleic acids will be released into the body fluids, comprising blood, urine or interstitial fluid. Circulating cell free nucleic acids which floats freely in the circulatory system, for example blood circulation, usually comprises DNA fragments. These fragments include those from host genome, from viral genome, and/or from the viral integration sites, for example the viral-host junction.
[0025] Infected cells, for example HBV-infected hepatocytes, proliferate if they become cancerous and so is the amount of the viral integrants carried by the infected cells. The amount of viral integrants thus is in proportion to the size of tumor/cancer in general. In addition, as the viral integrates into the host genome at different sites, each tumor/cancer carries a unique spectrum of viral integration sites. The viral integration sites and/or the viral-host junction, are cancer/tumor-specific and can be used to identify and monitor the tumor/cancer development.
[0026] In one embodiment, human subjects are employed in the tests to illustrate the present invention. Subject 1 has a 12109 (cm) tumor diagnosed by computer tomography. According to the histological report when Subject 1 is employed in this test, Subject 1 is defined as a Grade III HCC patient. Subject 2 has a 1813.59 (cm) tumor diagnosed by computer tomography. According to the histological report, Subject 2 is defined as a Grade III HCC patient. Subject 3 has s 87.57 (cm) tumor identified by computed tomography. According to the histological report, Subject 3 is defined as a Grade III HCC patient. Subject 4 has a 222 (cm) tumor and is at Grade II. Subject 5 has a tumor smaller than 222 (cm) and the stage of the cancer development is not determined and/or not available at the time of test enrollment. Subject 6 is defined as a Grade III HCC patent, and has a tumor size of 9.1 cm.sup.3. Subject 7 is defined as a Grade II HCC patent, and has a tumor size of 11 cm.sup.3. Subject 8 has a tumor size of 3 cm.sup.3. Subject 9 has a tumor size of 11.58 cm.sup.3. Subject 10 has a tumor size of 4.6 cm.sup.3.
[0027] In another embodiment, multiple blood samples are obtained from human subjects. Each time, blood is drawn, collected in a clinically suitable container and, if needed, stored in a suitable condition for later analysis. Each blood sample is processed to obtain serum, such as by centrifugation. The cfDNA can be extracted by using a commercial kit, comprising MagNA Pure LC Total Nucleic acid Isolation kit (Roche). The tumor tissues are obtained. Genomic DNAs of tumor cells are extracted.
[0028] In another embodiment, in order to proportionally amplify the all ctDNA obtained from the human subject, the ctDNA can be attached or ligated with at least one adaptor to at least one end or both ends of the ctDNA. The ligating at least one adaptor to at least one end or both ends of the ctDNA can be performed by using TruSeq DNA Sample Preparation (Illumina), TruSeq Nano DNA LT Library Preparation Kit (Illumina), IonTorrent (Life Technologies) or other equivalent reagents.
[0029] In another embodiment, after the ctDNA is ligated with at least one adaptor, each ctDNA in the sample from the human subject can be amplified by using TruSeq DNA Sample Preparation (Illumina), TruSeq Nano DNA LT Library Preparation Kit (Illumina), IonTorrent (Life Technologies) or other equivalent reagents.
[0030] In another embodiment, polynucleotides having HBV genome sequence are used as probes here. The probes can be either designed or synthesized from the fragmentation of HBV genome. The probes portfolio can be cover the whole HBV genome sequence. The whole HBV genome sequences can be obtained from the National Center for Biotechnology Information. The probes can be synthesized by using a commercial kit, comprising Ion TargetSeq Custom Enrichment Kit (Life Technologies). The length of the probes is synthesized in a range from about 20 bases to about 200 bases. The length of the probes is preferably synthesized in a range from about 50 bases to about 180 bases. After the synthesis of the probes, each probe can be labeled, comprising biotinylated, at least one end of the probe. Biotinylation of probes can be performed by using a commercial kit, comprising ruSeq Nano DNA LT Library Preparation Kit (Illumina), Ion TargetSeq Custom Enrichment Kit (Life Technologies) or other equivalent kit. The probes can be subsequently attached or linked to a bead, for example through biotin.
[0031] In another embodiment, the all amplified ctDNA are mixed and incubated with the beads coated with the biotinylated probes to allow hybridization between the ctDNA and the biotinylated probes. The certain ctDNA that have at least partial viral sequences anneal to the complementary sequences on the probes and can form a bead-probe-ctDNA complex. The other ctDNA that does not bind to the probes float freely and does not form any complex. The bead-probe-ctDNA complexes are separated from non-binding ctDNA by, for example, centrifugation. The bead-probe-ctDNA complexes are obtained. After the bead and the probe are removed from the bead-probe-ctDNA complexes, target ctDNA can be collected. Capturing of the ctDNA hybridized with the probes can be performed by using TargetSeq Hybridization & Wash Buffer Kit (Life Technologies) or other equivalent kit.
[0032] In another embodiment, in order to sequence and identify the target ctDNA. The target ctDNA can be further sequenced using IonTorrent platform, HiSeq 2500 (Illumina) or other equivalent sequencing platform.
[0033] In another embodiment, the copy number of specific target ctDNA junctions are quantified by BIO-RAD Eva Green Droplet Digital PCR kit. The copy number of each of the specific target ctDNA junction in the subject can be determined. Other equivalent DNA quantification kits can also be used.
[0034]
[0035] Referring again to
[0036]
[0037] The amplified cfDNA can be categorized into cfDNA having only host genome sequences (cfDNA D), cfDNA having only viral genome sequences (not shown), and cfDNA having both viral and host genome sequences and thus comprising viral-host junctions 22 (cfDNA A, B, and C). According to a preferred approach, all amplified cfDNA are incubated with polynucleotide probes 23 (designed and derived from the viral genome sequence) to allow hybridization to occur. It is to be noted that the polynucleotide probes 23 may have different sequences even though all drawn alike. Referring again to
[0038] Table 1 shows top ten target sequences identified in the DNA samples obtained from Subject 1 tumor tissue. As shown, a junction sequence is inserted into the host chromosome (Host Chromosome #) at a specific integration position (Integration Position) with an accumulated read number (Accumulated Reads). Accumulated read number is obtained by NGS sequencing result. Sequences having the same junction are counted to give the number of the junction present in the sample. Each sequence includes at least partial viral chromosome sequence (underlined) and a partial host chromosome sequence to form a viral-host junction.
TABLE-US-00001 TABLE1 JunctionDataofSubject1TumorTissue Host Integra- Accu- SEQ Chromo- tion mulated ID # some# Position JunctionSequence Reads NO. 1 17 2224 GGTCTTACATAAGAGGACTC 290 1 7083 AGAAAATACTTTGTGATGAT 2 17 2225 AACTCCTTTTGAGAGCGCAG 234 2 1295 TGTTCGGTGCAGGTCCCCAG 3 1 12136 ATCATCACAAAGTATTTTCT 192 3 0041 GAGTCCTCTTATGTAAGACC 4 12 11887 TGAGGTGAGAGGATCTCTTG 115 4 6274 AGCACAGATGATGGGATAGG 5 X 5856 AAACGTCCACTTGCAGATTT 102 5 8585 TATGTAATTGGAAGTTGGGG 6 8 5689 AGCAGGAAAATATATGCCCC 106 6 5765 ACCTTCCCTTTCTCTGACCC 7 1 12147 AGGAAGACTGCCTACTCCCA 85 7 5300 CAGGCCTGAAAGCGCTCCAA 8 X 5856 AGCATTCGGGCCAGGGTTCA 67 8 3641 CTCAGGCTCAGGGCACATTG 9 16 2152 GCATTTGGTGGTCTATAAGC 38 9 5068 ACACCCGCCCACACCAATCT 10 18 7793 CAAGACCAGCCTGAGGATGA 26 10 2557 CTGTCTCTTAGAGGTGGAGA
[0039] Table 2 shows the target sequences identified in the ctDNA samples obtained from the serum of Subject 1. The ctDNA samples are obtained from Subject 1 13 days before a tumor excision. As shown, each sequence contains at least partial viral chromosome sequence (underlined) and a partial host chromosome sequence to form a viral-host junction.
TABLE-US-00002 TABLE2 JunctionDataofSubject1SerumSample Host Integra- Accu- SEQ Chromo- tion mulated ID # some# Position JunctionSequence Reads NO. 11 17 2225 CACTCCTTTTGAGAGCGCAG 94 11 1295 TGTTCAGGTGCAGGGTCCCC 12 1 12136 ATCATCACAAAGTATTTTCT 82 12 0041 GAGTCCTCTTATGTAAGACC 13 1 13727 AACAGAAAGATTCGTCCCCA 68 13 AATCCAATCTGTCTTCCATC 14 8 5689 AGCAGGAAAATATATGCCCC 62 14 5765 ACCTTCCCTTTCTCTGCCCT 15 17 2224 GGTCTTACATAAGAGGACTC 42 15 7083 AGAAAATACTTTGTGATGAT 16 16 2152 GCATTTGGTGGTCTATAAGC 31 16 5068 ACACCCGCCCACACCAATCT 17 8 5689 ATCATCCTGGGCTTTCTGCA 16 17 5953 CTTCCCATAGGTAATCAAAG 18 X 5856 AGCATTCGGGCCAGGGTTCA 9 18 3641 CTCAGGCTCAGGGCACATTG
[0040] As illustrated in Tables 1 and 2, at least #3 (from tumor sample) and #12 (from serum sample), #1 (from tumor sample) and #15 (from serum sample), and #2 (from tumor sample) and #11 (from serum sample) each pair have the same viral-host junction sequences. Similar patterns (including the relative read numbers) of viral-host junction sequences identified in both tumor DNA and ctDNA indicate that chimera ctDNA in serum is derived from tumor DNA. By selectively enriching the ctDNAs carrying at least a portion of the viral genome in the serum, viral-host junctions are identified to provide tumor-specific information about the subject.
[0041] Table 3 shows the target sequences identified in the DNA samples obtained from Subject 2 tumor tissue. As shown, each sequence contains at least partial viral genome sequence (underlined) and partial host genome sequence and forms a viral-host junction.
TABLE-US-00003 TABLE3 JunctionDataofSubject2TumorTissue Host Integra- Accu- SEQ Chromo- tion mulated ID # some# Position JunctionSequence Reads NO. 1 3 11165 ATGAAGCTATTTATAATAAA 4183 19 3312 ACAAACTTTATTAAATCTAG TTTAAATGCCTTACTCTCTT TTTTGCCTTCTGACTTCTTT CCTTCTATTCGAGATCTCCT 2 2 8027 TTTCATTGTTGCTGTTTTTC 3772 20 8757 AAATTGATTTTGGGATCCAG CCTGTTATTCTACTCCCTTA ACTTCATGGGATATGTAATT GGAAGTTGGGGTACTTTACC 3 3 11165 TCTCCCTTTAGACTTCAAAC 1269 21 3206 ACTTCAAAATATGACTTCAC TACAAAGCTTTATAGAATGC CAGCCTTCCACAGAGTATGT AAATAATGCCTAGTTTTGAA 4 2 8027 CCAGCACATTTGTCTATAAA 752 22 8655 TTTACATTCTTGGATATTAG CAAAATTGCAAACAGACCAA TTTATGCCTACAGCCTCCTA GTACAAAGACCTTTAACCTA 5 1 18987 TCCAGTGTTTGTGGGTTGAG 485 23 9551 CAGTATTATTGCATGGCCCA GTGGTGGTGGTTGATGTTCC TGGAAGTAGAGGACAAACGG GCAACATACCTTGGTAGTCC 6 1 18987 TGCAAGTGGTTGCAGTTCTT 174 24 9474 TTGCTTTGCCACCACCACTG GGCCATGCAAAACCTGCACG ATTCCTGCTCAAGGAACCTC TATGTTTCCCTCTTGTTGCT 7 20 6022 CAGGAGGAGGTGATGGACCC 169 25 7034 ACTGGGTGGTGAAGAACAGT TTCTCTTCCAAAATTACTTC CCACCCAGGTGGCCAGATTC ATCAACTCACCCCAACACAG 8 22 2694 ATCTGTAAAATTGGGATCAT 100 26 1239 CACACTTTCCTTTTATTGGG GTTTAAATGAATACCCAAAG ACAAAAGAAAATTGGTAATA GAGGTAAAAAGGGACTCAAG 9 20 6022 TGGCCGAGGCCATCTTCTAA 93 27 7112 ATAAATGTGTGGAAGAGAAA CTGTTCTTCAGTATTTGGTG TCTTTTGGAGTGTGGATTCG CACTCCTCCCGCTTACAGAC 10 5 1295 AGGACGGGTGCCCGGGTCCC 37 28 309 CAGTCCCTCCGCCACGTGGG AAGCGCGGTCCAGACCAATT TATGCCTACAGCCTCCTAGT ACAAAGACCTTTAACCTAAT
[0042] Table 4 shows the target sequences identified in the ctDNA samples obtained from serum of Subject 2. Serum samples are obtained from Subject 2 at tumor excision. As shown, each sequence contains at least partial viral genome sequence (underlined) and partial host genome sequence and forms a viral-host junction.
TABLE-US-00004 TABLE4 JunctionDataofSubject2SerumSample Host Integra- Accu- SEQ Chromo- tion mulated ID # some# Position JunctionSequence Reads NO. 11 3 11165 ATGAAGCTATTTATAATAAA 3277 29 3312 ACAAACTTTATTAAATCTAG TTTAAATGCCTTACTCTCTT TTTTGCCTTCTGACTTCTTT CCTTCTATTCGAGATCTCCT 12 20 6022 CAGGAGGAGGTGATGGACCC 642 30 7034 ACTGGGTGGTGAAGAACAGT TTCTCTTCCAAAATTACTTC CCACCCAGGTGGCCAGATTC ATCAACTCACCCCAACACAG 13 1 18987 TCCAGTGTTTGTGGGTTGAG 373 31 9551 CAGTATTATTGCATGGCCCA GTGGTGGTGGTTGATGTTCC TGGAAGTAGAGGACAAACGG GCAACATACCTTGGTAGTCC 14 2 5001 GTCCGTTGGTGGTGAACTGG 372 32 2582 GCAAGATAATTGCATGGCCC AGTGGTGGTGGTTGATGTTC CTGGAAGTAGAGGACAAACG GGCAACATACCTTGGTAGTC 15 15 4834 AGATTGGTCTATAATTTTCT 237 33 4568 TTTACTATCTTCAGTATTTG GTATCTTTGGGAGTGTGGAT TCGCACTCCTCCCGCTTACA GACCACCAAATGCCCCTATC 16 2 8027 TTTCATTGTTGCTGTTTTTC 230 34 8757 AAATTGATTTTGGGATCCAG CCTGTTATTCTACTCCCTTA ACTTCATGGGATATGTAATT GGAAGTTGGGGTACTTTACC 17 20 6022 TGGCCGAGGCCATCTTCTAA 209 35 7112 ATAAATGTGTGGAAGAGAAA CTGTTCTTCAGTATTTGGTG TCTTTTGGAGTGTGGATTCG CACTCCTCCCGCTTACAGAC 18 1 18987 TGCAAGTGGTTGCAGTTCTT 205 36 9474 TTGCTTTGCCACCACCACTG GGCCATGCAAAACCTGCACG ATTCCTGCTCAAGGAACCTC TATGTTTCCCTCTTGTTGCT 19 2 5001 GTAAGCCATTGTGGCTTTCC 205 37 2660 TGACCAGCCCACCACCACTG GGCCATGCAAAACCTGCACG ATTCCTGCTCAAGGAACCTC TATGTTTCCCTCTTGTTGCT 20 2 8027 CCAGCACATTTGTCTATAAA 64 38 8655 TTTACATTCTTGGATATTAG CAAAATTGCAAACAGACCAA TTTATGCCTACAGCCTCCTA GTACAAAGACCTTTAACCTA
[0043] As illustrated in Tables 3 and 4, at least #1 (from tumor sample) and #11 (from serum sample), #7 (from tumor sample) and #12 (from serum sample), and #5 (from tumor sample) and #3 (from serum sample) both have the same viral-host junction sequences. Similar patterns of viral-host junction sequences identified in both tumor DNA and ctDNA show that chimera ctDNA in serum is derived from tumor DNA. By selectively enriching the target ctDNA in the serum, viral-host junctions are identified to provide tumor-specific information about the subject.
[0044] Table 5 shows the target sequences identified in the DNA samples obtained from Subject 3 tumor tissue. As shown, each sequence contains at least partial viral genome sequence (underlined) and partial host genome sequence and forms a viral-host junction.
TABLE-US-00005 TABLE5 JunctionDataofSubject3Tumor Host Integra- Accu- SEQ Chromo- tion mulated ID # some# Position JunctionSequence Reads NO. 1 5 1295 GGAAATGGAGCCAGGCGCTC 3024 39 930 CTGCTGGCCGCGCACCGGGC GCCTCACACCAGAACATCGC ATCAGGACTCCTAGGACCCC TGCTCGTGTTACAGGCGGGG 2 8 11163 TCAAGCAGAAAAACCATGAA 635 40 6420 GATTTAAAAACTTGTAAATA TTTGAATGTGGGCTCCACCC CAACAGTCCCCCGTGGGGAG GGGTGAACCCTGGCCCGAAT 3 14 5259 CTAAGGGACACTACAGGAAA 354 41 1737 CCAGCCCCGAAGTGATTTCT TTTGAAATTCCAAATCTTTC TGTCCCCAATCCCCTGGGAT TCTTCCCCGATCATCAGTTG 4 9 13885 CCTCGAAGCCTGTGCCAACC 190 42 7330 TAGCCCATTCCTCAGGCTCA GGGCCTCCTCACATCTGTGC CAGCAGCTCCTCCTCCTGCC TCCACCAATCGGCAGTCAGG 5 1 6854 CATTGTTACTGTGATATGCT 188 43 9419 ATAATTATTCTCACCTTATG TGTCCAAGGAATACTAACAT TGAGATTCCCGAGATTGAGA TCTTCTGCGACGCGGCGATT 6 9 3145 ATGGAGAATACAGCACATTA 172 44 5679 TTAGGAGTAAGTTTCCTTAA ACACATTTTGATTTTTTGTA CAATATGTTCCTGTGGCAAT GTGCCCCAACTCCCAATTAC 7 17 7143 TTTGCCACCTTCCTGCCACT 138 45 4403 TTGTAGATGCAAGATCTTGG GCAAGTTCCCGTGGGCGTTC ACGGTGGTTTCCATGCGACG TGCAGAGGTGAAGCGAAGTG 8 12 12623 CAGTGGAAACAAAGCCACTG 135 46 0889 GGAAGTTCAAACTGAGAGAA GCCCACCACAAGTCTAGACT CTGTGGTATTGTGAGGATTT TTGTCAACAAGAAAAACCCC 9 X 3591 AGTATATCATCAGTTATTTT 124 47 1295 TCAAGGTTTTCTAAGTAAAC AGTTTCTCAACCTTTACCCC GTTGCTCGGCAACGGCCTGG TCTGTGCCAAGTGTTTGCTG 10 10 7539 TCAGGGAGGGGATGTTGACT 58 48 7400 GCATTTTGGAGGTTCAGGGC CTACTAACAACTGTGCCAGC AGCTCCTCCTCCTGCCTCCA CCAATCGGCAGTCAGGAAGG
[0045] Table 6 shows the target sequences identified in the ctDNA samples obtained from serum of Subject 3. Serum samples are obtained from Subject 3 at tumor excision. As shown, each sequence contains at least partial viral genome sequence (underlined) and partial host genome sequence and forms a viral-host junction.
TABLE-US-00006 TABLE6 JunctionDataofSubject3SerumSample Host Integra- Accu- SEQ Chromo- tion mulated ID # some# Position JunctionSequence Reads NO. 11 5 1295 GGAAATGGAGCCAGGCGCTC 153 49 930 CTGCTGGCCGCGCACCGGGC GCCTCACACCAGAACATCGC ATCAGGACTCCTAGGACCCC TGCTCGTGTTACAGGCGGGG 12 8 11163 TCAAGCAGAAAAACCATGAA 52 50 6420 GATTTAAAAACTTGTAAATA TTTGAATGTGGGCTCCACCC CAACAGTCCCCCGTGGGGAG GGGTGAACCCTGGCCCGAAT 13 21 4756 CCCGGGACCGACCCCAGGAA 27 51 5536 GAGCCAGGGGCCCGGGTGAT CCCTGCGGGGGTCTGGCTTT CAGTTATATGGATGATGTGG TATTGGGGGCCAAGTCTGTA 14 21 2857 AATGAAAATCTCATTGATTT 25 52 3066 TTCACTTATAGGTTTTACCT TAGAGCTCCTCCTCTGCCTA ATCATCTCATGTTCATGTCC TACTGTTCAAGCCTCCAAGC 15 7 8784 AGAATTGATACCTAAGCTGA 24 53 2849 GCAGAAATGAGGCCGACCAT GAAGTGAGTGCCTAATCATC TCATGTTCATGTCCTACTGT TCAAGCCTCCAAGCTGTGCC 16 7 14850 CGTAGGAAAGACAAGGTGGC 19 54 3201 ATTGATGGAAAGCAGTAGTT TTTGAGCCCTTCGCAGACGA AGGTCTCAATCGCCGCGTCG CAGAAGATCTCAATCTCGGG 17 1 16227 TTAAAAAGGAGTTTTGTTTG 16 55 7132 TTAGTCTATTCACTCATTTC AAGGAACATAGAAGAAGAAC TCCCTCGCCTCGCAGACGAA GGTCTCAATCGCCGCGTCGC 18 12 12504 CAGTTCCCTGGCTCCAAGCT 15 56 8731 CCCTCAAAAGATGCCCAGCT GGCCTTTCCCAAAGGCCTTG TAAGTTGGCGAGAAAGTAAA AGCCTGTTTTGCTTGTATAC 19 7 3041 ACATGCCCTTCACTTCAGCC 13 57 2226 TGATGCTCCTGGCATAAGCT CAGCAATTTTGGAGTGCGAA TCCACACTCCAAAAGACACC AAATATTCAAGAACAGTTTC 20 13 8450 AATTTCCCCTGAATAGCTGC 13 58 5952 AGTACTCACAGACACACTGG ATGCTACTCACCTCTGCCTA ATCATCTCATGTTCATGTCC TACTGTTCAAGCCTCCAAGC
[0046] As illustrated in Tables 5 and 6, similar patterns of viral-host junction sequences identified in both tumor DNA and ctDNA show that ctDNA in serum is derived from tumor DNA. By selectively enriching the target ctDNA in the serum, viral-host junctions are identified to provide tumor-specific information about the subject.
[0047]
TABLE-US-00007 (viralgenomesequenceunderlined;) SEQIDNO.59 GGTCTTACATAAGAGGACTCAGAAAATACTTTGTGATGAT,
Subject 4
[0048]
TABLE-US-00008 (SEQIDNO.60) ACTTCAAAGACTGTGTGTTTCTAATTATTTTGGGGGACAT,
and Subject 5
[0049]
TABLE-US-00009 (SEQIDNO.61) GTAGGCATAAATTGGTCTGTACCTCACTTCCCTGCTTTCC.
The presence of the three specific viral-host junctions is determined in the tumor gDNA (T) and non-tumor gDNA (N). Porphobilinogen deaminase (PBGD) and miR-122 are used as internal control. No-template control (NTC) is also included. As illustrated in
[0050]
[0051]
[0052] After accumulating read number of junction sequences in patient's serum or plasma in Table 2, 4 and 6, the junction sequence in each of the subjects with the most read number are selected to analyze the concentration of specific viral-host junction sequence in patient's serum or plasma. The specific viral-host junction sequence, and its' read number are identified in Subject 8, 9 and 10 by similar methods conducted in Table 2, 4 and 6. To determine the concentration of the specific viral-host sequence in Subject 8, 9 and 10, the plasma samples are analyzed with BIO-RAD Eva Green droplet digital PCR kit. Each samples diluted and the diluted samples are mixed with a reaction mix containing one or more fluorescence dyes and other reagents, the sample-reaction mix are then subjected QX2000 Droplet Generator to generation a plurality of small droplets. The droplets are then transferred to C1000 Touch Thermal Cycler to conduct polymerase chain reaction. Finally, the QX2000 Droplet Reader is used to quantify fluorescence signals presented. The fluorescence signal indicated by QX2000 Droplet Reader provides absolute quantifications of the specific viral-host junctions.
[0053]
[0054]
[0055]
[0056] Table 7 shows the information of Subject 8, 9 and 10, including the gender, age, tumor size and ctDNA junction concentration.
TABLE-US-00010 ctDNA junction Tumor size concentration Sample Gender Age (cm.sup.3) (copy number per ml) Subject 8 M 52 3 15.2 Subject 9 M 66 11.58 448 Subject 10 M 43 4.6 29
[0057] The above results from Subject 8, Subject 9 and Subject 10 suggest a relationship of tumor size. The copy number of specific viral-host junctions can be inferred. The tumor size is positively correlated to the concentration of ctDNA junctions. The larger tumor size represents higher copy number of ctDNA junction in patient's blood. The presence of ctDNA junction in patient's serum or plasma can be indicative of tumor status. Specifically, the copy number of ctDNA junction in patient's blood can be used to monitor the size of tumor within one patient when evaluating the prognosis after the surgical removal, radiotherapy, chemotherapy or other therapeutic approach on the tumor. The copy number of ctDNA junction in patient's blood can also be used to assess the size of tumor when diagnosing the tumor status. The ctDNA junction concentration in the plasma or serum of more than 30 copies per milliliter of plasma or serum may represent a tumor size of more than 4.6 cm.sup.3. The relationship between the ctDNA junction concentration in the plasma or serum and the tumor size may be an exponential function or a linear function.
[0058] The read number of ctDNA junction in patient's serum or plasma provides a non-invasive diagnosis for tumor. Therefore, the read number of ctDNA junction can be used to diagnose the presence of tumor to a patient with hepatitis-B virus infection, or to evaluate the tumor status before surgical removal, radiotherapy, chemotherapy of other therapeutic approach on the tumor. The read number of ctDNA junction in patient's blood can also be used to assess the presence and the size of tumor.
[0059] It is to be noted that by using the approach described in the present invention, mutated p53 or beta-catenin genes cannot be detected in the ctDNAs despite the mutations are identified in the tumor tissues (data not shown). The result shows that by using the method of present invention, tumor specific viral-host junctions (viral genome sequence insertion into host genome), and not conventional somatic mutations, are selectively enriched and obtained to provide cancer/tumor information.
[0060] The embodiments shown and described above are only examples. Many details are often found in the art for example the other features of a circuit board assembly. Therefore, many such details are neither shown nor described. Even though numerous characteristics and advantages of the present technology have been set forth in the foregoing description, together with details of the structure and function of the present disclosure, the disclosure is illustrative only, and changes may be made in the detail, including in matters of shape, size and arrangement of the parts within the principles of the present disclosure up to, and including the full extent established by the broad general meaning of the terms used in the claims. It will therefore be appreciated that the embodiments described above may be modified within the scope of the claims.