Activated reporter protein for the detection of infection in a biological sample

20230257734 · 2023-08-17

    Inventors

    Cpc classification

    International classification

    Abstract

    The invention relates to novel means and processes for the detection of a virus in a biological sample comprising cells infected by the virus. In particular, the invention relates to a fluorescent reporter protein designed as a recombinant inactive form of flipGFP suitable for specific activation by viral components in particular by viral proteins, such as viral protease, wherein the viral component recognizes a cleavage site inserted in the recombinant flipGFP. The fluorescent reporter protein is suitable for use in an in vitro method of detection of virus infection in a biological sample when the virus is related to the viral components activating the inactive form of flipGFP into an active fluorescent flipGFP in a biological sample, especially a sample comprising cells, in particular unaltered cells.

    Claims

    1. A nucleic acid construct which comprises an operon wherein the operon comprises (a) a recombinant transgene that encodes a recombinant inactive form of a fluorescent reporter protein, and a cleavage site for a viral protease, and (b) a nucleic acid coding for a detectable expression control protein, wherein the nucleic acid sequences of (a) and (b) are operably assembled in said operon under the control of a single promoter and optionally of additional control sequence(s) for transcription and/or translation and wherein the nucleic acid sequences of (a) and (b) are optionally separated by the sequence of a polyprotein separating site such as the sequence of the separating 2A-peptide originating from Thosea asigna virus capsid such as the sequence of SEQ ID No.147.

    2. The nucleic acid construct according to claim 1, wherein the recombinant transgene encoding the inactive form of the fluorescent reporter protein comprises a nucleotide sequence of an altered form of the Open Reading Frame (ORF) that encodes the active form of the fluorescent reporter protein and wherein said alteration in the ORF comprises switching position in the ORF of at least one nucleotide sequence encoding specific structure domains of the active form of the fluorescent protein to prevent assembly of the expressed structure domains as a functional protein enabling maturation of the chromophore and wherein the nucleotide sequence encoding the inactive form of the fluorescent reporter protein additionally comprises a nucleotide sequence encoding a cleavage site for a determined protease.

    3. The nucleic acid construct according to claim 2, wherein the nucleic acid coding for a detectable expression control protein is interposed within the sequence encoding the structural domains of the sequence of the active form of a fluorescent reporter protein.

    4. The nucleic acid according to any one of claims 1 to 3, wherein the active fluorescent reporter protein is an active flipGFP and the inactive fluorescent reporter protein is an inactive flipGFP.

    5. The nucleic acid construct according to claim 4, wherein the recombinant transgene comprises from 5′-end to 3′-end polynucleotides encoding the beta10 strand of flipGFP, a linker having from 3 to 15, in particular about 10, amino acid residues, the E5 domain of flipGFP, the beta11 strand of flipGFP, the cleavage site of the viral protease and the K5 domain of the flipGFP, the sequence of a polyprotein separating site such as the sequence of the separating 2A-peptide originating from Thosea asigna virus capsid, a polynucleotide encoding the beta1-9 strand of flipGFP wherein these polynucleotides together encode the inactive form of the recombinant flipGFP with the viral protease cleavage site.

    6. The nucleic acid construct according to claim 3, which comprises from 5′-end to 3′-end polynucleotides encoding the beta1-9 strand of flipGFP, the sequence of a polyprotein separating site such as the sequence of the separating 2A-peptide originating from Thosea asigna virus capsid, the nucleic acid coding for a detectable expression control protein, polynucleotides encoding the beta10 strand of flipGFP, a linker having from 3 to 15, in particular about 10, amino acid residues, the E5 domain of flipGFP, the beta11 strand of flipGFP, the cleavage site of the viral protease and the K5 domain of the flipGFP, wherein these polynucleotides together encode the inactive form of the recombinant flipGFP with the viral protease cleavage site and the detectable expression control protein.

    7. The nucleic acid construct according to any one of claims 1 to 6, wherein the transgene further comprises upstream from the sequence encoding the inactive fluorescent reporter protein in particular the inactive flipGFP, a sequence encoding a signal peptide, in particular a signal peptide for retention of the expressed polypeptides in the endoplasmic reticulum (ER retention signal) or a signal peptide for targeting the expressed polypeptides to the cell membrane (membrane targeting signal), especially a Cytochrome P450 ER retention signal of sequence MDPVVVLGLCLSCLLLLSLWQSHGGGK (SEQ ID No.105) or a membrane targeting signal of sequence MGCCFSKT (SEQ ID No.107)..

    8. The nucleic acid construct according to any one of claims 1 to 6, which further comprises operably associated with the sequence encoding the inactive fluorescent reporter protein in particular the inactive flipGFP, a sequence encoding a peptide for retention of the expressed part of the fluorescent reporter polypeptide associated thereto in the endoplasmic reticulum (ER retention signal) or for targeting the expressed polypeptides into the ER membrane or the cell membrane.

    9. The nucleic acid construct according to claim 7 or 8, which comprises a nucleotide sequence encoding the HCV Core TM peptide that is the sequence of SEQ ID No.152) or encoding the HIV Vpu peptide that is the sequence SEQ ID No.153.

    10. The nucleic acid construct according to claim 7, wherein the polynucleotide encoding the signal peptide contains or consists of the sequence of ATGGACCCCGTGGTGGTGCTGGGCCTGTGCCTGAGCTGCCTGCTGCTGCTGAG CCTGTGGAAGCAGAGCCACGGCGGCGGCAAG (SEQ ID No.104) encoding the Cytochrome P450 ER retention signal peptide or contains or consists of the sequence of ATGGGCTGCTGCTTCAGCAAGACC (SEQ ID No.106) encoding the membrane targeting signal peptide.

    11. The nucleic acid construct according to claim 9, wherein the polynucleotide encoding the transmembrane peptide contains or consists of the sequence of SEQ ID No.152 encoding the HCV Core TM peptide for retention into the ER membrane, or the sequence of SEQ ID No.153 encoding the HIV Vpu peptide and this sequence is operably associated with the inactive flipGFP.

    12. The nucleic acid construct according to any one of claims 1 to 11, wherein the expression control protein is a fluorescent protein with a fluorophore of a color that is different from the color of the reporter fluorescent protein, in particular is mCherry protein or mTurquoise protein or cyan fluorescent protein (ECFP), yellow fluorescent protein such as mVenus, in particular the nucleic acid construct contains the polynucleotide of SEQ ID No.146 coding for mCherry.

    13. The nucleic acid construct according to any one of claims 1 to 12, wherein the cleavage site is recognized by a protease of a determined virus family selected in the group of Alphaviruses, Coronaviruses, Enteroviruses, Retroviruses and Flaviviruses.

    14. The nucleic acid construct according to any one of claims 1 to 13, wherein the nucleic acid sequence for the protease cleavage site encodes an amino acid sequence selected from the group of ITTLGKFGQ (SEQ ID No.126) for the Enterovirus 2A protease, EALFQGPK (SEQ ID No.127) or SYFASEQGEIQWV (SEQ ID No.128) for the Enterovirus 3C protease, RAGAYIFS (SEQ ID No.129) for Alphaviruses, RELNGGAYTRYV (SEQ ID No.130), FTLKGGAPTKVT (SEQ ID No.131), IALKGGKIVNNW (SEQ ID No.132), TSAVLQSGFRKM (SEQ ID No.133), KVATVQSKMSDV (SEQ ID No.134), SAVKLQNNELSP (SEQ ID No.135), ATVRLQAGNATE (SEQ ID No.136), REPMLQSADAQS (SEQ ID No.137), SGVTFQSAVKRT (SEQ ID No.138) for SARS-CoV-2 coronavirus, SGVTFQGKFKK (SEQ ID No.139) for SARS virus coronavirus, YAKRGGVF (SEQ ID No140) for Flaviviruses, and more particularly KERKRRGADTSI (SEQ ID No.141), TRSGKRSWPPSE (SEQ ID No.142), EPEKQRSPQDNQ (SEQ ID No.143), GLVKRRGGGTGE (SEQ ID No.144) for ZIKA viruses.

    15. The nucleic acid construct according to any one of claims 1 to 13, wherein the polynucleotide encoding the viral protease cleavage site consists of a polynucleotide selected from the group of: GAGGACAGGGCCGGCGCCGGCATCATCGAGACCCCC for CHKV-1 (SEQ ID No.108) or GCCACCAGGGCCGGCTGCGCCCCCAGCTACAGGGTG for CHKV-2 (SEQ ID No.109) or CTGGACAGGGCCGGCGGCTACATCTTCAGCAGCGAC for CHKV-3 (SEQ ID No.110) or, ATCACCACTCTTGGGAAATTTGGACAA for EV71_2A (SEQ ID No.111) or AGCTACTTCGCCAGCGAGCAGGGCGAGATCCAGTGGGTG for EV71_3C (SEQ ID No.112) or, AGGGAGCTGAACGGCGGCGCCTACACCAGGTACGTG for SARSCoV2_1 (SEQ ID No.113) or TTCACCCTGAAGGGCGGCGCCCCCACCAAGGTGACC for SARSCoV2_2 (SEQ ID No.114) or ATCGCCCTGAAGGGCGGCAAGATCGTGAACAACTGG for SARSCoV2_3 (SEQ ID No.115) or ACCAGCGCCGTGCTGCAGAGCGGCTTCAGGAAGATG for SARSCoV2_4 (SEQ ID No.116) or AGCGGCGTGACCTTCCAGAGCGCCGTGAAGAGGACC for SARSCoV2_5 (SEQ ID No.117) or AAGGTGGCCACCGTGCAGAGCAAGATGAGCGACGTG for SARSCoV2_6 (SEQ ID No.118) or AGCGCCGTGAAGCTGCAGAACAACGAGCTGAGCCCC for SARSCoV2_7 (SEQ ID No.119) or GCCACCGTGAGGCTGCAGGCCGGCAACGCCACCGAG for SARSCoV2_8 (SEQ ID No.120) or AGGGAGCCCATGCTGCAGAGCGCCGACGCCCAGAGC for SARSCoV2_9 (SEQ ID No.121) or, AAGGAGAGGAAGAGGAGGGGCGCCGACACCAGCATC for ZIKV_1 (SEQ ID No.122) or ACCAGGAGCGGCAAGAGGAGCTGGCCCCCCAGCGAG for ZIKV_2 (SEQ ID No.123) or GAGCCCGAGAAGCAGAGGAGCCCCCAGGACAACCAG for ZIKV_3 (SEQ ID No.124) or GGCCTGGTGAAGAGGAGGGGCGGCGGCACCGGCGAG for ZIKV_4 (SEQ ID No.125).

    16. The nucleic acid construct according to any one of claims 4 to 15, wherein the flipGFP is encoded by the sequence of SEQ ID No.145 and optionally wherein the sequence of the cleavage site for the protease is inserted at position 262 in said sequence.

    17. The nucleic acid construct according to any one of claims 1 to 16, wherein the polynucleotide that encodes the inactive recombinant flipGFP is selected from the group of SEQ ID No 47 to SEQ ID No.100, SEQ ID No. 150 and SEQ ID No.151 respectively for flipGFP_CHKV_1, flipGFP_CHKV_2, flipGFP_CHKV_3, flipGFP_ER_CHKV_1, flipGFP_ER_CHKV_2, flipGFP_ER_CHKV_3, flipGFP_ER_EV71_2A, flipGFP_ER_EV71_3C, flipGFP_ER_SARSCoV2_1, flipGFP_ER_SARSCoV2_2, flipGFP_ER_SARSCoV2_3, flipGFP_ER_SARSCoV2_4, flipGFP_ER_SARSCoV2_5, flipGFP_ER_SARSCoV2_6, flipGFP_ER_SARSCoV2_7, flipGFP_ER_SARSCoV2_8, flipGFP_ER_SARSCoV2_9, flipGFP_ER_ZIKV_1, flipGFP_ER_ZIKV_2, flipGFP_ER_ZIKV_3, flipGFP_ER_ZIKV_4, flipGFP_EV71_2A, flipGFP_EV71_3C, flipGFP_Membrane_CHKV_1, flipGFP_Membrane_CHKV_2, flipGFP_Membrane_CHKV_3, flipGFP_Membrane_EV71_2A, flipGFP_Membrane_EV71_3C, flipGFP_Membrane_SARSCoV2_1, flipGFP_Membrane_SARSCoV2_2, flipGFP_Membrane_SARSCoV2_3, flipGFP_Membrane_SARSCoV2_4, flipGFP_Membrane_SARSCoV2_5, flipGFP_Membrane_SARSCoV2_6, flipGFP_Membrane_SARSCoV2_7, flipGFP_Membrane_SARSCoV2_8, flipGFP_Membrane_SARSCoV2_9, flipGFP_Membrane_ZIKV_1, flipGFP_Membrane_ZIKV_2, flipGFP_Membrane_ZIKV_3, flipGFP_Membrane_ZIKV_4, flipGFP_SARSCoV2_1, flipGFP_SARSCoV2_2, flipGFP_SARSCoV2_3, flipGFP_SARSCoV2_4, flipGFP_SARSCoV2_5, flipGFP_SARSCoV2_6, flipGFP_SARSCoV2_7, flipGFP_SARSCoV2_8, flipGFP_SARSCoV2_9, flipGFP_ZIKV_1, flipGFP_ZIKV_2, flipGFP_ZIKV_3, flipGFP_ZIKV_4, flipGFP-SARS2-HCVCore, and flipGFP-SARS2-HIVVpu.

    18. The recombinant nucleic acid according to any one of claims 1 to 14, which is selected from the group of pLentiPuro_flipGFP_ER_3C (SEQ ID No.101), pLentiPuro_flipGFP_Membrane_3C (SEQ ID No.102), and pLentiPuro-flipGFP-3C (SEQ ID No.103).

    19. The recombinant nucleic acid according to any one of claims 1, 8, 9 or 11 to 18, which is selected from the group of pLentiPuro-flipGFP-SARS2-HCVCore (SEQ ID No.150), and pLentiPuro-flipGFP-SARS2-HIVVpu (SEQ ID No.151).

    20. A set of at least two nucleic acid constructs according to any one of claims 1 to 19.

    21. The nucleic acid construct according to any one of claims 1 to 20, wherein the promoter of the transgene is active in cells selected from the group of prokaryotic cells, in particular in bacterial cells or in archaeal cells, and/or is active in eukaryotic cells, in particular in mammalian cells or in insect cells, in particular the promoter is the CMV promoter.

    22. A transformation vector which comprises a nucleic acid construct according to any one of claims 1 to 21, in particular a vector which is a plasmid for transfection or for transduction, especially a lentiviral vector plasmid.

    23. A cell which is a prokaryotic or a eukaryotic cell or cell line transformed with the nucleic acid construct according to any one of claims 1 to 130or with a transformation vector according to claim 22, in particular a stable cell line, especially a stable cell line transduced with lentiviral vector particles expressing a nucleic acid construct according to any one of claims 1 to 21.

    24. The cell according to claim 23, which is a cell selected for its sensibility to infection by a determined human virus targeting the cleavage site of the inactive fluorescent protein expressed in the cell, wherein the cell is optionally a stable cell line.

    25. The cell according to claim 23 or 24, which is a cell line stably expressing the nucleic acid construct according to any one of claims 1 to 21, wherein the nucleic acid construct is inserted in the genome of the cell.

    26. A polynucleotide encoding a viral protease cleavage site which is selected from the group of GAGGACAGGGCCGGCGCCGGCATCATCGAGACCCCC for CHKV-1 (SEQ ID No.108) or GCCACCAGGGCCGGCTGCGCCCCCAGCTACAGGGTG for CHKV-2 (SEQ ID No.109) or CTGGACAGGGCCGGCGGCTACATCTTCAGCAGCGAC for CHKV-3 (SEQ ID No.110) or, ATCACCACTCTTGGGAAATTTGGACAA for EV71_2A (SEQ ID No.111) or AGCTACTTCGCCAGCGAGCAGGGCGAGATCCAGTGGGTG for EV71_3C (SEQ ID No.112) or, AGGGAGCTGAACGGCGGCGCCTACACCAGGTACGTG for SARSCoV2_1 (SEQ ID No.113) or TTCACCCTGAAGGGCGGCGCCCCCACCAAGGTGACC for SARSCoV2_2 (SEQ ID No.114) or ATCGCCCTGAAGGGCGGCAAGATCGTGAACAACTGG for SARSCoV2_3 (SEQ ID No.115) or ACCAGCGCCGTGCTGCAGAGCGGCTTCAGGAAGATG for SARSCoV2_4 (SEQ ID No.116) or AGCGGCGTGACCTTCCAGAGCGCCGTGAAGAGGACC for SARSCoV2_5 (SEQ ID No.117) or AAGGTGGCCACCGTGCAGAGCAAGATGAGCGACGTG for SARSCoV2_6 (SEQ ID No.118) or AGCGCCGTGAAGCTGCAGAACAACGAGCTGAGCCCC for SARSCoV2_7 (SEQ ID No.119) or GCCACCGTGAGGCTGCAGGCCGGCAACGCCACCGAG for SARSCoV2_8 (SEQ ID No.120) or AGGGAGCCCATGCTGCAGAGCGCCGACGCCCAGAGC for SARSCoV2_9 (SEQ ID No.121) or, AAGGAGAGGAAGAGGAGGGGCGCCGACACCAGCATC for ZIKV_1 (SEQ ID No.122) or ACCAGGAGCGGCAAGAGGAGCTGGCCCCCCAGCGAG for ZIKV_2 (SEQ ID No.123) or GAGCCCGAGAAGCAGAGGAGCCCCCAGGACAACCAG for ZIKV_3 (SEQ ID No.124) or GGCCTGGTGAAGAGGAGGGGCGGCGGCACCGGCGAG for ZIKV_4 (SEQ ID No.125).

    27. Use of a nucleic acid construct according to any one of claims 1 to 21 or of a vector according to claim 22, for in vitro detection and optionally quantification of a viral infection in a biological sample of a human or animal subject, wherein the detection targets a virus recognizing the cleavage site recombined in the inactive fluorescent reporter protein.

    28. Use of a cell line according to claim 25, for in vitro detection and optionally quantification of a viral infection in a biological sample of a human or animal subject, wherein the detection targets a virus recognizing the cleavage site recombined in the inactive fluorescent reporter protein.

    29. The use according to claim 27 or 28, wherein fluorescence of the reporter protein is detected or measured directly on the cells.

    30. An in vitro method of detecting or monitoring a pathogen infection, in particular a virus infection, in a biological sample previously obtained from a human or an animal subject, which comprises the steps of: a. Providing cells according to claim 23 to 25 that express either transiently or stably an inactive fluorescent reporter protein comprising a cleavage site for a protease of the virus to be detected, b. Contacting said cells with the assayed biological sample in conditions that enable the virus when present in the sample, to infect the cells and the viral protease to cleave the protease cleavage site, c. Allowing fluorescence to increase in the cells that have been contacted with the biological sample in b., following activation of the inactive fluorescent reporter protein by cleavage in step b. of the viral protease cleavage site and measuring said fluorescence of the reporter protein, d. Optionally comparing the fluorescence level of the active fluorescent reporter protein to a standard or fluorescent control protein expressed in the cells and optionally concluding on virus infectious activity in the subject and/or quantitating the virus.

    31. An in vitro method of detecting or monitoring a virus infection, in a biological sample previously obtained from a human or an animal subject, which comprises the steps of: a. Providing an inactive fluorescent protein as a reporter protein expressed from the nucleic acid construct of any one of claims 1 to 21, together with a control protein expressed from the same nucleic acid construct wherein the inactive fluorescent reporter protein comprises a cleavage site for a protease of the virus to be detected, b. Contacting said inactive fluorescent reporter protein with the assayed biological sample in conditions that enable the virus protease to target and to cleave the protease cleavage site in the inactive fluorescent reporter protein, c. Allowing fluorescence to increase in the biological sample following activation of the inactive fluorescent reporter protein by cleavage in step b. and measuring said fluorescence, d. Optionally comparing the fluorescence level to a standard or to the fluorescence of the control protein and optionally concluding on the virus infectious activity in the subject and/or quantitating the virus.

    32. A method of any one of claims 30 or 31 of detecting or monitoring a virus infection, in a biological sample previously obtained from a human or an animal subject, wherein the virus is a determined virus selected in the group of Alphaviruses, Coronaviruses, Enteroviruses, Retroviruses and Flaviviruses, in particular is a coronavirus, especially is SARS-CoV-2 (or SARS-2) responsible for Covid-19.

    33. A method according to any one of claims 30 to 32, wherein detection is carried out as soon as 24 hours, in particular as soon as 14 hours, more particularly within a range of 8 to 24 hours, following suspicion of infection.

    34. A laboratory animal for experimental or clinical observation of the response to a virus infection, wherein the animal has been transformed to enable its genome to express a nucleic acid construct according to any one of claims 1 to 21 either transiently or stably, the animal being in particular a rodent, an insect or a non-human mammal.

    Description

    LEGEND OF THE FIGURES

    [0086] FIG. 1: sequence alignment for GFP

    [0087] FIG. 2: nucleotide sequence and encoded amino acid sequence for pLentiPuro_flipGFP_3C

    [0088] FIG. 3: nucleotide sequence and encoded amino acid sequence for pLentiPuro_flipGFP_ER_3C

    [0089] FIG. 4: nucleotide sequence and encoded amino acid sequence for pLentiPuro_flipGFP_Membrane_3C

    [0090] FIGS. 5 to 7: flipGFP_Schematics

    [0091] FIG. 8: flipGFP by Transfection and Transfected Protease - flipGFP detection of SARS CoV-2 protease:

    [0092] HEK293T cells were seeded in 24 well plates. Cells were transfected with the SARS-CoV-2 cleavage-site 3 ER Retention signal peptide flipGFP construct alone (Mock) or together with a plasmid expressing the Nsp5 protease (+Nsp5). Cells were incubated for 24 hours, before fixing with 4% Formalin. Cells were imaged using an EVOS FL microscope.

    [0093] FIG. 9: flipGFP by Transfection and Virus Infection - flipGFP detection of chikungunya virus infection, enterovirus 71 infection, CHKV virus infection, SARS CoV-2 infection, ZIK virus infection: Vero E6 cells were seeded in 24 well plates. Cells were transfected with the individual flipGFP constructs. After 24 hours cells were infected with either enterovirus 71 (EV71), chikungunya virus (CHKV), SARS-CoV-2 (BetaCoV/France/IDF0372/2020, SARS-CoV-2) or Zika virus (ZIKV) at an MOI = 1 (SARS2). Cells were incubated for a further 48 hours, before fixing with 4% Formalin. Cells were imaged using an EVOS FL microscope.

    [0094] FIG. 10: flipGFP expressed in Stable Cell Line and Infected- Stable cell line FlipGFP detection of SARS Cov-2 infection - A549 cells overexpressing Ace2 were transduced using a lentivirus encoding the SARS-CoV-2 flipGFP Membrane 3 construct. Cells were selected using puromycin, followed by flow-cytometry sorting, gating for high mCherry and GFP negative expressing cells. Cells were seeded in 24 well plates. Cells were infected with SARS-CoV-2 (BetaCoV/France/IDF0372/2020) at an MOI = 1 (SARS2) or left uninfected (Mock). Cells were incubated for 48 hours, before fixing with 4% Formalin. Cells were imaged using an EVOS FL microscope.

    [0095] FIG. 11: A: Map of lentiviral plasmid pLentiPuro-flipGFP-SARS2-HCVCore (encompassing the sequence of flipGFP-SARS2-HCVCore) B: detail of the transgene C: nucleotide sequence of the insert of flipGFP-SARS2-HCVCore: SEQ ID No.150. In this figure the segment designated as flipGFP (SEQ ID No. 149) refers to the subdomains of GFP that have been switched (flipped) from their position in the native GFP and also contain the sequence of the protease cleavage site. More accurately flipGFP is featured by this segment together with the GFP1-9 segment (SEQ ID No. 148).

    [0096] FIG. 12: A: Map of lentiviral plasmid pLentiPuro-flipGFP-SARS2-HIVVpu (encompassing the sequence of flipGFP-SARS2-HIVVpu) B: detail of the transgene C: nucleotide sequence of the insert of flipGFP-SARS2-HIVVpu: SEQ ID No.151. In this figure the segment designated as flipGFP refers to the subdomains of GFP that have been switched (flipped) from their position in the native GFP and also contain the sequence of the protease cleavage site (SEQ ID No. 149). More accurately flipGFP is featured by this segment together with the GFP1-9 segment (SEQ ID No. 148).

    [0097] FIG. 13: VeroE6 cells were transfected with flipGFP-SARS2-10-HCV plasmids. After 24 hours cells were trypsinised and seeded on .Math.-slides (ibidi). After further 24 hours cells were either Mock infected or infected with SARS-CoV-2 for 1 hour, before replacing the media. The samples were incubated and imaged in a Nikon Biostation with 5% CO.sub.2 at 37° C. Images were taken every 20 minutes over 48 hours using 20x magnification and 488 nm (GFP), as well as phase contrast channels. Subsequently still images were isolated in ImageJ software at the indicated times. Contrast of the images were inverted. Scale bars (black) represent 50 .Math.m.

    [0098] FIG. 14: VeroE6 cells transfected with flipGFP-SARS2-10-HCV were infected with SARS-CoV-2 and imaged under life-cell conditions every 20 minutes for 48 hours using a Nikon Biostation at 20x magnification. Images from the GFP channel were processed in ImageJ and the signal was inverted. The earliest detection of a GFP signal from SARS-CoV-2 infected cells was at around 9.6 hours post infection. The development of the reporter cell signal was stable and increased over time. Times of GFP channel images are shown. White arrows indicate cells with GFP signal. Black scale bars represent 50 .Math.m.

    EXPERIMENTAL DATA

    Methods

    Cells and Viruses

    [0099] Vero E6 (Vero 76, clone E6, Vero E6, ATCC® CRL-1586TM) were maintained in a humidified atmosphere at 37° C. with 5% CO.sub.2, in Dulbecco’s modified Eagle’s medium (DMEM, Life Technologies) containing 10% (v/v) fetal bovine serum (FBS, Life Technologies). ACE2-expressing A549 cells, a human lung epithelial cell line (Institut Pasteur; Bouhaddou M. et al - The Global Phosphorylation landscape of SARS-CoV-2 infection https://doi.org/10.1016/j.cell.2020.06.034). A549-ACE2 cells were cultured in DMEM supplemented with 10% (v/v) FBS and maintained at 37° C. with 5% CO.sub.2. HEK293T cells were DMEM supplemented with 10% (v/v) FBS and maintained at 37° C. with 5% CO.sub.2.

    [0100] SARS-CoV-2 (BetaCoV/France/IDF0372/2020 isolate) was supplied through the European Virus Archive goes Global (EVAg) platform. The virus was propagated in Vero E6 cells in DMEM supplemented with 2% FBS.

    [0101] Enterovirus 71 (EV71) Sep006 isolate (GenBank: KX197462.1) was generated using plasmid-based reverse genetics system. For this, the virus sequence was synthesized and cloned into a pCAGGS plasmid that flanked the virus genome sequence with a hammer-head ribozyme (HHRz) at the 5′ end and a hepatitis delta ribozyme (HdRz) sequence at the 3′ end.

    [0102] Zika virus used for the identification of DVGs is the African strain MR766 of Zika virus (ZIKV). Accession number for this strain is Genbank # LC002520.1.

    [0103] Chikungunya virus was generated from CHIKV infectious clones derived from the Indian Ocean lineage, ECSA genotype. Accession number for such strain SZ 1050 of this lineage is Genbank # MG664850.1.

    [0104] All experiments with live viruses were performed in compliance with Institut Pasteur Paris’s guidelines for Biosafety Level 3 (BSL-3) containment procedures in approved laboratories. All experiments were performed in at least three biologically independent samples.

    flipGFP Plasmids

    [0105] The template for the flipGFP plasmids was kindly provided by Xiaokun Shu (University of California, San Francisco) (Zhang et al., 2019). The lentivial plasmid pLenti-puro (Guan et al., 2011) was acquired through Addgene (#39481). The plasmid and flipGFP insert were linearized by PCR (see Table 1 for primer sequence). The two fragments were joined using InFusion cloning (Clonetech) according to the manufacturer’s instructions. The plasmid was confirmed by Sanger sequencing. The nucleotide sequences for the individual protease cleavage sites were changed by single PCR reactions (Q5, Thermo Scientific) (see Table 1 for primers). The correctness of the inserted sequences was confirmed by Sanger sequencing.

    [0106] The membrane signal peptide was inserted by a single PCR reaction (Q5, Thermo Scientific) (see Table 1 for primers) before changing the nucleotide sequences for the individual protease cleavage sites by PCR.

    [0107] The endoplasmatic reticulum (ER) retention signal peptide was inserted by linearizing the pLenti-puro-flipGFP plasmid (see Table 1 for primers) and inserting the nucleotide sequence of the ER retention signal peptide using annealed oligonucleotides by InFusion cloning (Clonetech) according to the manufacturer’s instructions. The correct insertion of the peptide sequence was confirmed by Sanger sequencing. The individual protease cleavage sites were altered by PCR (Q5, Thermo Scientific).

    Transient flipGFP Expression

    [0108] HEK293T or Vero E6 cells were seeded on cover slips. The following day cells were transfected with individual flipGFP plasmids using TransIT-LT1 (Mirus) according to the manufacturer’s instructions. For infections, transfected cells were incubated for 24 hours prior to infections. To assess the activity of individual SARS-CoV-2 flipGFP constructs, flipGFP plasmids were co-transfected with either pLVX-EF1alpha-nCoV2019-nsp5-2xStrep-IRES-Puro or pLVX-EF1alpha-nCoV2019-nsp5-C145A-2xStrep-IRES-Puro (kindly provided by Nevan Krogan, University of California, San Francisco, Gordon et al., 2020) for the expression of the viral protease.

    Stable flipGFP Expression Cells

    [0109] Individual flipGFP constructs in pLenti-puro were transfected together with pAX2 (Proc Natl Acad Sci U S A. 2011 Jun 21;108(25):10343-8. Epub 2011 Jun 6. This plasmid is available through Addgene) and pCMV-VSV-G (published in RNA 2003 Apr;9(4):493-501. This plasmid is available through Addgene) in HEK293T. Lentiviruses were harvested 48 hours post transfection and supernatants were cleared from cellular debris by centrifugation. A549-Ace2 cells were seeded and individual lentivirus containing the flipGFP constructs were used to transduce the target cells using 6 .Math.g/ml Diethylaminoethyl (DEAE)-dextran (Sigma) together with centrifugation using 1,000x g for 1 hour. After two days of incubation, cells containing flipGFP were selected using 10 .Math.g/ml puromycin (Sigma). To obtain a homogeneous population of flipGFP expressing cells, cells were FACS-sorted using mCherry expression as marker for the presence and expression of flipGFP.

    Detection of Viral Protease and Virus Infection in Transfected Cells and in Stable Cells

    [0110] The protocol for transfection and infection of cells has been described in the legends of the figures and the results of the detection are provided on FIGS. 8, 9 and 10.

    [0111] This protocol was repeated with VeroE6 cells that were transfected with flipGFP-SARS2-10-HCV plasmids. These cells were then either infected with SARS-CoV-2 or Mock infected and followed over time in 12-hour intervals to give an overview (FIG. 13). Alternatively, the GFP channel alone was followed at shorter time intervals to focus on early detection (FIG. 14). The white arrows show GFP signal of these cells. These experiments show that virus infected cells can be detected early after infection, especially as early as about 9 hours post infection and that in average such detection may be achieved 16 hours after infection.

    [0112] The results show that the use of flipGFP for the detection of virus infections work in principle. The activation and generation of a GFP signal is achieved by cleavage events during viral infection, wherein the events are carried out by the viral protease. The specificity and sensitivity of the protease cleavage peptide can be altered by its sequence as well as overall accessibility of the peptide to the viral protease. Many viral proteases appear to be located around replication sites, where the enzymes process the viral polyproteins into functional peptides. Accordingly, at the same time, this allows for targeting of these specific replication sites for a viral reporter system, such as the use of flipGFP. It was possible to identify the endoplasmatic reticulum (ER) as such an important site inside the cell, which brings the reporter and the viral protease in close proximity for cleavage and subsequent activation.

    [0113] Location of the flipGFP reporter to subcellular compartments, such as the ER, can be achieved by either the use of signal peptides that transport the reporter to this site and retain it there, or by the use of transmembrane peptides of known proteins to be inserted in individual organelle membrane, such as using the HCV Core transmembrane domain for the targeting and insertion into the ER membrane. The cleaved flipGFP peptide can disperse in the cell and bind to the GFP1-10 peptide to form an active, fluorescent GFP protein.

    [0114] The flipGFP reporter can be cleaved upon the recognition by viral proteases. This can occur within few hours following viral infection. However, this signal detection depends on sensitivity of the system used, such as sensitivity of microscopes. The detection can be mitigated by the increase of the overall expression level of flipGFP in the cell. Thus, the detection of GFP can be shifted to earlier time points, when more of the reporter flipGFP is present in the cell. At the same time, more sensitive systems than the microscope used, including flow cytometry methods, may achieve superior results and limits of detection. The experiments show that detection is possible within a few hours after infection, here as early as about 9 hours post infection with SARS-CoV-2 in VeroE6 cells. The signal in early to mid-stages of infection seem to be specific to the viral protease and can easily be differentiated from control samples. Late events during infection, such as the induction of apoptosis, and therefore activation of caspases, seem to be able to activate the flipGFP reporter in a rather unspecific manner. This allows for a time window of specific flipGFP activation of at least several hours, in VeroE6 cells infected with SARS-CoV-2 such as about 8-14 hours. However, this could be mitigated in systems that either use cells that do not easily undergo apoptosis during viral infection or by the use of molecules that prevent caspase activation.

    TABLE-US-00001 SEQ ID No. and Name 5′-3′ Sequence Purpose 1.pLenti-Puro Linear For GACGAGCTGTACAAGTAATAACACATCGACAATCAACCTCTGG Linearizing plasmid for InFusion Cloning to insert flipGFP 2.pLenti-Puro Linear Rev CGTCAGGCAGGTCCATGGATCCCGTACGCCCGGGCGGTGTC Linearizing plasmid for InFusion Cloning to insert flipGFP 3.flipGFP For ATGGACCTGCCTGACGACCACTAC Preparing flipGFP for Infusion Cloning 4.flipGFP Rev TTACTTGTACAGCTCGTCCATGCC Preparing flipGFP for Infusion Cloning 5.Membrane For GCTGCTGCTTCAGCAAGACCATGGACCTGCCTGACGACCACTAC Inserting Membrane Signal Peptide into pLenti-Puro plasmid 6.Membrane Rev CTTGCTGAAGCAGCAGCCCATGGATCCCGTACGCCCGGGCGG Inserting Membrane Signal Peptide into pLenti-Puro plasmid 7.pLenti-Puro ER For CACGGCGGCGGCAAGATGGACCTGCCTGACGACCAC Linearizing plasmid for InFusion Cloning to insert ER Retention Signal Peptide 8.pLenti-Puro ER Rev CCACCACGGGGTCCATGGATCCCGTACGCCCGGGCGG Linearizing plasmid for InFusion Cloning to insert ER Retention Signal Peptide 9.ER Signal For ATGGACCCCGTGGTGGTGCTGGGCCTGTGCCTGAGCTGCCTGCTGCTGCTGAGCCTGTGGAAGCAGAGCCACGGCGGCGGCAAG Inserting ER Retention Signal Peptide into pLenti-Puro plasmid 10.ER Signal Rev CTTGCCGCCGCCGTGGCTCTGCTTCCACAGGCTCAGCAGCAGCAGGCAGCTCAGGCACAGGCCCAGCACCACCACGGGGTCCAT Inserting ER Retention Signal Peptide into pLenti-Puro plasmid 11.SARS2-1 For CGGCGGCGCCTACACCAGGTACGTGAAGGTGTCCGCCCTGAAGGAAAAAG Generation of flipGFP recognized by SARS-CoV-2 Nsp3 protease 12.SARS2-1 Rev GGTGTAGGCGCCGCCGTTCAGCTCCCTTGATGCATCGGTAATGCCAGCCGC Generation of flipGFP recognized by SARS-CoV-2 Nsp3 protease 13.SARS2-2 For GGGCGGCGCCCCCACCAAGGTGACCAAGGTGTCCGCCCTGAAGGAAAAAG Generation of flipGFP recognized by SARS-CoV-2 Nsp3 protease 14.SARS2-2 Rev GGTGGGGGCGCCGCCCTTCAGGGTGAATGATGCATCGGTAATGCCAGCCGC Generation of flipGFP recognized by SARS-CoV-2 Nsp3 protease 15.SARS2-3 For GAAGGGCGGCAAGATCGTGAACAACTGGAAGGTGTCCGCCCTGAAGGAAAAAG Generation of flipGFP recognized by SARS-CoV-2 Nsp5 protease 16.SARS2-3 Rev CACGATCTTGCCGCCCTTCAGGGCGATTGATGCATCGGTAATGCCAGCCGC Generation of flipGFP recognized by SARS-CoV-2 Nsp5 protease 17.SARS2-4 For GTGCTGCAGAGCGGCTTCAGGAAGATGAAGGTGTCCGCCCTGAAGGAAAAAG Generation of flipGFP recognized by SARS-CoV-2 Nsp5 protease 18.SARS2-4 Rev CTGAAGCCGCTCTGCAGCACGGCGCTGGTTGATGCATCGGTAATGCCAGCCGC Generation of flipGFP recognized by SARS-CoV-2 Nsp5 protease 19.SARS2-5 For GACCTTCCAGAGCGCCGTGAAGAGGACCAAGGTGTCCGCCCTGAAGGAAAAAG Generation of flipGFP recognized by SARS-CoV-2 Nsp5 protease 20.SARS2-5 Rev CACGGCGCTCTGGAAGGTCACGCCGCTTGATGCATCGGTAATGCCAGCCGC Generation of flipGFP recognized by SARS-CoV-2 Nsp5 protease 21.SARS2-6 For CACCGTGCAGAGCAAGATGAGCGACGTGAAGGTGTCCGCCCTGAAGGAAAAAG Generation of flipGFP recognized by SARS-CoV-2 Nsp5 protease 22.SARS2-6 Rev CATCTTGCTCTGCACGGTGGCCACCTTTGATGCATCGGTAATGCCAGCCGC Generation of flipGFP recognized by SARS-CoV-2 Nsp5 protease 23.SARS2-7 For GAAGCTGCAGAACAACGAGCTGAGCCCCAAGGTGTCCGCCCTGAAGGAAAAAG Generation of flipGFP recognized by SARS-CoV-2 Nsp5 protease 24.SARS2-7 Rev GCTCGTTGTTCTGCAGCTTCACGGCGCTTGATGCATCGGTAATGCCAGCCGC Generation of flipGFP recognized by SARS-CoV-2 Nsp5 protease 25.SARS2-8 For GAGGCTGCAGGCCGGCAACGCCACCGAGAAGGTGTCCGCCCTGAAGGAAAAAG Generation of flipGFP recognized by SARS-CoV-2 Nsp5 protease 26.SARS2-8 Rev GTTGCCGGCCTGCAGCCTCACGGTGGCTGATGCATCGGTAATGCCAGCCGC Generation of flipGFP recognized by SARS-CoV-2 Nsp5 protease 27.SARS2-9 For CATGCTGCAGAGCGCCGACGCCCAGAGCAAGGTGTCCGCCCTGAAGGAAAAAG Generation of flipGFP recognized by SARS-CoV-2 Nsp5 protease 28.SARS2-9 Rev GTCGGCGCTCTGCAGCATGGGCTCCCTTGATGCATCGGTAATGCCAGCCGC Generation of flipGFP recognized by SARS-CoV-2 Nsp5 protease 29.ZIKV-1 For GGAAGAGGAGGGGCGCCGACACCAGCATCAAGGTGTCCGCCCTGAAGGAAAAAG Generation of flipGFP recognized by ZIKV NS2B-NS3 protease 30.ZIKV-1 Rev GTCGGCGCCCCTCCTCTTCCTCTCCTTTGATGCATCGGTAATGCCAGCCGC Generation of flipGFP recognized by ZIKV NS2B-NS3 protease 31.ZIKV-2 For GCGGCAAGAGGAGCTGGCCCCCCAGCGAGAAGGTGTCCGCCCTGAAGGAAAAAG Generation of flipGFP recognized by ZIKV NS2B-NS3 protease 32.ZIKV-2 Rev GGGGCCAGCTCCTCTTGCCGCTCCTGGTTGATGCATCGGTAATGCCAGCCGC Generation of flipGFP recognized by ZIKV NS2B-NS3 protease 33.ZIKV-3 For GAAGCAGAGGAGCCCCCAGGACAACCAGAAGGTGTCCGCCCTGAAGGAAAAAG Generation of flipGFP recognized by ZIKV NS2B-NS3 protease 34.ZIKV-3 Rev CTGGGGGCTCCTCTGCTTCTCGGGCTCTGATGCATCGGTAATGCCAGCCGC Generation of flipGFP recognized by ZIKV NS2B-NS3 protease 35.ZIKV-4 For GTGAAGAGGAGGGGCGGCGGCACCGGCGAGAAGGTGTCCGCCCTGAAGGAAAAAG Generation of flipGFP recognized by ZIKV NS2B-NS3 protease 36.ZIKV-4 Rev GCCGCCGCCCCTCCTCTTCACCAGGCCTGATGCATCGGTAATGCCAGCCGC Generation of flipGFP recognized by ZIKV NS2B-NS3 protease 37.CHKV-1 For GGGCCGGCGCCGGCATCATCGAGACCCCCAAGGTGTCCGCCCTGAAGGAAAAAG Generation of flipGFP recognized by CHKV Nsp2 protease 38.CHKV-1 Rev GATGATGCCGGCGCCGGCCCTGTCCTCTGATGCATCGGTAATGCCAGCCGC Generation of flipGFP recognized by CHKV Nsp2 protease 39.CHKV-2 For CACCCTGTAGCTGGGGGCGCAGCCGGCCCAAGGTGTCCGCCCTGAAGGAAAAAG Generation of flipGFP recognized by CHKV Nsp2 protease 40.CHKV-2 Rev GCTGGGGGCGCAGCCGGCCCTGGTGGCTGATGCATCGGTAATGCCAGCCGC Generation of flipGFP recognized by CHKV Nsp2 protease 41.CHKV-3 For GGGCCGGCGGCTACATCTTCAGCAGCGACAAGGTGTCCGCCCTGAAGGAAAAAG Generation of flipGFP recognized by CHKV Nsp2 protease 42.CHKV-3 Rev GAAGATGTAGCCGCCGGCCCTGTCCAGTGATGCATCGGTAATGCCAGCCGC Generation of flipGFP recognized by CHKV Nsp2 protease 43.EV71-2A For CACTCTTGGGAAATTTGGACAAAAGGTGTCCGCCCTGAAGGAAAAAG Generation of flipGFP recognized by EV71 2A protease 44.EV71-2A Rev CAAATTTCCCAAGAGTGGTGATTGATGCATCGGTAATGCCAGCCGC Generation of flipGFP recognized by EV71 2A protease 45.EV71-3C For CGCCAGCGAGCAGGGCGAGATCCAGTGGGTGAAGGTGTCCGCCCTGAAGGAAAAAG Generation of flipGFP recognized by EV71 3C protease 46.EV71-3C Rev GATCTCGCCCTGCTCGCTGGCGAAGTAGCTTGATGCATCGGTAATGCCAGCCGC Generation of flipGFP recognized by EV71 3C protease

    [0115] The sequences contained in the sequence listing are numbered as follows:

    TABLE-US-00002 (SEQ ID) Nom 1 pLenti-Puro Linear For 2 pLenti-Puro Linear Rev 3 flipGFP For 4 flipGFP Rev 5 Membrane For (nt) 6 Membrane Rev (nt) 7 pLenti-Puro ER For (nt) 8 pLenti-Puro ER Rev (nt) 9 ER Signal For (nt) 10 ER Signal Rev (nt) 11 SARS2-1 For (nt) 12 SARS2-1 Rev (nt) 13 SARS2-2 For (nt) 14 SARS2-2 Rev (nt) 15 SARS2-3 For (nt) 16 SARS2-3 Rev (nt) 17 SARS2-4 For (nt) 18 SARS2-4 Rev (nt) 19 SARS2-5 For (nt) 20 SARS2-5 Rev (nt) 21 SARS2-6 For (nt) 22 SARS2-6 Rev (nt) 23 SARS2-7 For (nt) 24 SARS2-7 Rev (nt) 25 SARS2-8 For (nt) 26 SARS2-8 Rev (nt) 27 SARS2-9 For (nt) 28 SARS2-9 Rev (nt) 29 ZIKV-1 For (nt) 30 ZIKV-1 Rev (nt) 31 ZIKV-2 For (nt) 32 ZIKV-2 Rev (nt) 33 ZIKV-3 For (nt) 34 ZIKV-3 Rev (nt) 35 ZIKV-4 For (nt) 36 ZIKV-4 Rev (nt) 37 CHKV-1 For (nt) 38 CHKV-1 Rev (nt) 39 CHKV-2 For (nt) 40 CHKV-2 Rev (nt) 41 CHKV-3 For (nt) 42 CHKV-3 Rev (nt) 43 EV71-2A For (nt) 44 EV71-2A Rev (nt) 45 EV71-3C For (nt) 46 EV71-3C Rev (nt) 47 flipGFP_CHKV_1 (nt) 48 flipGFP_CHKV_2 (nt) 49 flipGFP_CHKV_3 (nt) 50 flipGFP_ER_CHKV_1 (nt) 51 flipGFP_ER_CHKV_2 (nt) 52 flipGFP_ER_CHKV_3 (nt) 53 flipGFP_ER_EV71_2A (nt) 54 flipGFP_ER_EV71_3C (nt) 55 flipGFP_ER_SARSCoV2_1 (nt) 56 flipGFP_ER_SARSCoV2_2 (nt) 57 flipGFP_ER_SARSCoV2_3 (nt) 58 flipGFP_ER_SARSCoV2_4 (nt) 59 flipGFP_ER_SARSCoV2_5 (nt) 60 flipGFP_ER_SARSCoV2_6 (nt) 61 flipGFP_ER_SARSCoV2_7 (nt) 62 flipGFP_ER_SARSCoV2_8 (nt) 63 flipGFP_ER_SARSCoV2_9 (nt) 64 flipGFP_ER_ZIKV_1 (nt) 65 flipGFP_ER_ZIKV_2 (nt) 66 flipGFP_ER_ZIKV_3 (nt) 67 flipGFP_ER_ZIKV_4 (nt) 68 flipGFP_EV71_2A (nt) 69 flipGFP_EV71_3C (nt) 70 flipGFP_Membrane_CHKV_1 (nt) 71 flipGFP_Membrane_CHKV_2 (nt) 72 flipGFP_Membrane_CHKV_3 (nt) 73 flipGFP_Membrane_EV71_2A (nt) 74 flipGFP_Membrane_EV71_3C (nt) 75 flipGFP_Membrane_SARSCoV2_1 (nt) 76 flipGFP_Membrane_SARSCoV2_2 (nt) 77 flipGFP_Membrane_SARSCoV2_3 (nt) 78 flipGFP_Membrane_SARSCoV2 4 (nt) 79 flipGFP_Membrane_SARSCoV2_5 (nt) 80 flipGFP_Membrane_SARSCoV2_6 (nt) 81 flipGFP_Membrane SARSCoV2 7 (nt) 82 flipGFP_Membrane_SARSCoV2 8 (nt) 83 flipGFP_Membrane_SARSCoV2_9 (nt) 84 flipGFP_Membrane_ZIKV_1 (nt) (nt) 85 flipGFP_Membrane_ZIKV_2 (nt) 86 flipGFP_Membrane_ZIKV_3 (nt) 87 flipGFP_Membrane_ZIKV_4 (nt) 88 flipGFP_SARSCoV2_1 (nt) 89 flipGFP_SARSCoV2_2 (nt) 90 flipGFP_SARSCoV2_3 (nt) 91 flipGFP_SARSCoV2_4 (nt) 92 flipGFP_SARSCoV2_5 (nt) 93 flipGFP_SARSCoV2_6 (nt) 94 flipGFP_SARSCoV2_7 (nt) 95 flipGFP_SARSCoV2_8 (nt) 96 flipGFP_SARSCoV2_9 (nt) 97 flipGFP_ZIKV_1 (nt) 98 flipGFP_ZIKV_2 (nt) 99 flipGFP_ZIKV_3 (nt) 100 flipGFP_ZIKV_4 (nt) 101 pLentiPuro_flipGFP_ER_3C(nt) 102 pLentiPuro_flipGFP_Membrane_3C (nt) 103 pLentiPuro-flipGFP-3C (nt) 104 ER retention signal (nt) 105 ER retention signal/1 (aa) 106 Membrane targeting signal (nt) 107 Membrane targeting signal (aa) 108 cleavage site for for CHKV-1 (nt) 109 cleavage site for for CHKV-2 110 cleavage site for for CHKV-3 111 EV71_2A 112 EV71_3C 113 SARSCoV2_1 114 SARSCoV2_2 115 SARSCoV2_3 116 SARSCoV2_4 117 SARSCoV2_5 118 SARSCoV2_6 119 SARSCoV2_7 120 SARSCoV2_8 121 SARSCoV2_9 122 ZIKV_1 123 ZIKV_2 124 ZIKV_3 125 ZIKV_4 126 Enterovirus 2A protease cleavage site (aa) 127 Enterovirus 3C protease cleavage site (aa) 128 Enterovirus 3C protease cleavage site (aa) 129 Alphaviruses protease cleavage site (aa) 130 SARS-CoV-2 protease cleavage site (aa) 131 SARS-CoV-2 protease cleavage site/1 (aa) 132 SARS-CoV-2 protease cleavage site/2 (aa) 133 SARS-CoV-2 protease cleavage site/3 (aa) 134 SARS-CoV-2 protease cleavage site/4 (aa) 135 SARS-CoV-2 protease cleavage site/5 (aa) 136 SARS-CoV-2 protease cleavage site/6 (aa) 137 SARS-CoV-2 protease cleavage site/7 (aa) 138 SARS-CoV-2 protease cleavage site/8 (aa) 139 SARS virus protease cleavage site (aa) 140 Flavivirus protease cleavage site (aa) 141 ZIK viruses protease cleavage site (aa) 142 ZIK viruses protease cleavage site/1 (aa) 143 ZIK viruses protease cleavage site/2 (aa) 144 ZIK viruses protease cleavage site/3 (aa) 145 FlipGFP 146 mCherry (nt) 147 2A peptide from Thosea asigna virus capsid 148 GFP1-9 149 Flipped GFP domains with protease cleavage site for SARS-2 protease 150 pLentiPuro-flipGFP-SARS2-HCVCore 151 pLentiPuro-flipGFP-SARS2-HIVVpu 152 HCV Core TM 153 HIV Vpu

    TABLE-US-00003 47 ATGGACCTGCCTGACGACCACTACCTGTCCACCCAGACCATCCTGTCCAAGGACCTGAACTCCGGACTCAGATCTGGCAGCGGTCTCGAGATGGAAGTTAGCGCTCTGGAAAAAGAAGTGTCTGCACTCGAGAAAGAAGTAAGTGCCCTTGAGAAGGAGGTGTCCGCACTCGAGAAGGAGGTCAGCGCCCTGGAAAAGGAAAAGCGAGACCATATGGTTTTGCTTGAGTATGTTACAGCGGCTGGCATTACCGATGCATCAGAGGACAGGGCCGGCGCCGGCATCATCGAGACCCCCAAGGTGTCCGCCCTGAAGGAAAAAGTAAGCGCACTGAAAGAAAAGGTGAGCGCGCTGAAGGAGAAAGTGAGCGCCCTGAAAGAGAAAGTCTCTGCCCTTAAGGAGGATATCGAGGGCAGAGGAAGTCTGCTAACATGCGGTGACGTCGAGGAGAATCCTGGCCCAAAGCTTGCCACCATGCGCAAAGGCGAAGAACTGTTTACCGGCATTGTGCCGATTCTGGTGGAACTGGATGGCGATGTGAACGGCCATAAATTTTTTGTGCGCGGCGAAGGCGAAGGCGATGCGACCATTGGCAAACTGAGCCTGAAATTTATTTGCACCACCGGCAAACTGCCGGTGCCGTGGCCGACCCTGGTGACCACCCTGACCTATGGCGTGCAGTGCTTTAGCCGCTATCCGGATCATATGAAACGCCATGATTTTTTTAAAAGCGCGATGCCGGAAGGCTATGTGCAGGAACGCACCATTTATTTTAAAGATGATGGCACCTATAAAACCCGCGCGGAAGTGAAATTTGAAGGCGATACCCTGGTGAACCGCATTGAACTGAAAGGCATTGATTTTAAAGAAGATGGCAACATTCTGGGCCATAAACTGGAATATAACTTTAACAGCCATAAAGTGTATATTACCGCGGATAAACAGAACAACGGCATTAAAGCGAACTTTACCATTCGCCATAACGTGGAAGATGGCAGCGTGCAGCTGGCGGATCATTATCAGCAGAACACCCCGATTGGCGATGGCCCGGTTCTTCTTCCTGGCGGCCGCTCTAGA 48 ATGGACCTGCCTGACGACCACTACCTGTCCACCCAGACCATCCTGTCCAAGGACCTGAACTCCGGACTCAGATCTGGCAGCGGTCTCGAGATGGAAGTTAGCGCTCTGGAAAAAGAAGTGTCTGCACTCGAGAAAGAAGTAAGTGCCCTTGAGAAGGAGGTGTCCGCACTCGAGAAGGAGGTCAGCGCCCTGGAAAAGGAAAAGCGAGACCATATGGTTTTGCTTGAGTATGTTACAGCGGCTGGCATTACCGATGCATCAGCCACCAGGGCCGGCTGCGCCCCCAGCTACAGGGTGAAGGTGTCCGCCCTGAAGGAAAAAGTAAGCGCACTGAAAGAAAAGGTGAGCGCGCTGAAGGAGAAAGTGAGCGCCCTGAAAGAGAAAGTCTCTGCCCTTAAGGAGGATATCGAGGGCAGAGGAAGTCTGCTAACATGCGGTGACGTCGAGGAGAATCCTGGCCCAAAGCTTGCCACCATGCGCAAAGGCGAAGAACTGTTTACCGGCATTGTGCCGATTCTGGTGGAACTGGATGGCGATGTGAACGGCCATAAATTTTTTGTGCGCGGCGAAGGCGAAGGCGATGCGACCATTGGCAAACTGAGCCTGAAATTTATTTGCACCACCGGCAAACTGCCGGTGCCGTGGCCGACCCTGGTGACCACCCTGACCTATGGCGTGCAGTGCTTTAGCCGCTATCCGGATCATATGAAACGCCATGATTTTTTTAAAAGCGCGATGCCGGAAGGCTATGTGCAGGAACGCACCATTTATTTTAAAGATGATGGCACCTATAAAACCCGCGCGGAAGTGAAATTTGAAGGCGATACCCTGGTGAACCGCATTGAACTGAAAGGCATTGATTTTAAAGAAGATGGCAACATTCTGGGCCATAAACTGGAATATAACTTTAACAGCCATAAAGTGTATATTACCGCGGATAAACAGAACAACGGCATTAAAGCGAACTTTACCATTCGCCATAACGTGGAAGATGGCAGCGTGCAGCTGGCGGATCATTATCAGCAGAACACCCCGATTGGCGATGGCCCGGTTCTTCTTCCTGGCGGCCGCTCTAGA 49 ATGGACCTGCCTGACGACCACTACCTGTCCACCCAGACCATCCTGTCCAAGGACCTGAACTCCGGACTCAGATCTGGCAGCGGTCTCGAGATGGAAGTTAGCGCTCTGGAAAAAGAAGTGTCTGCACTCGAGAAAGAAGTAAGTGCCCTTGAGAAGGAGGTGTCCGCACTCGAGAAGGAGGTCAGCGCCCTGGAAAAGGAAAAGCGAGACCATATGGTTTTGCTTGAGTATGTTACAGCGGCTGGCATTACCGATGCATCACTGGACAGGGCCGGCGGCTACATCTTCAGCAGCGACAAGGTGTCCGCCCTGAAGGAAAAAGTAAGCGCACTGAAAGAAAAGGTGAGCGCGCTGAAGGAGAAAGTGAGCGCCCTGAAAGAGAAAGTCTCTGCCCTTAAGGAGGATATCGAGGGCAGAGGAAGTCTGCTAACATGCGGTGACGTCGAGGAGAATCCTGGCCCAAAGCTTGCCACCATGCGCAAAGGCGAAGAACTGTTTACCGGCATTGTGCCGATTCTGGTGGAACTGGATGGCGATGTGAACGGCCATAAATTTTTTGTGCGCGGCGAAGGCGAAGGCGATGCGACCATTGGCAAACTGAGCCTGAAATTTATTTGCACCACCGGCAAACTGCCGGTGCCGTGGCCGACCCTGGTGACCACCCTGACCTATGGCGTGCAGTGCTTTAGCCGCTATCCGGATCATATGAAACGCCATGATTTTTTTAAAAGCGCGATGCCGGAAGGCTATGTGCAGGAACGCACCATTTATTTTAAAGATGATGGCACCTATAAAACCCGCGCGGAAGTGAAATTTGAAGGCGATACCCTGGTGAACCGCATTGAACTGAAAGGCATTGATTTTAAAGAAGATGGCAACATTCTGGGCCATAAACTGGAATATAACTTTAACAGCCATAAAGTGTATATTACCGCGGATAAACAGAACAACGGCATTAAAGCGAACTTTACCATTCGCCATAACGTGGAAGATGGCAGCGTGCAGCTGGCGGATCATTATCAGCAGAACACCCCGATTGGCGATGGCCCGGTTCTTCTTCCTGGCGGCCGCTCTAGA 50 ATGGACCCCGTGGTGGTGCTGGGCCTGTGCCTGAGCTGCCTGCTGCTGCTGAGCCTGTGGAAGCAGAGCCACGGCGGCGGCAAGATGGACCTGCCTGACGACCACTACCTGTCCACCCAGACCATCCTGTCCAAGGACCTGAACTCCGGACTCAGATCTGGCAGCGGTCTCGAGATGGAAGTTAGCGCTCTGGAAAAAGAAGTGTCTGCACTCGAGAAAGAAGTAAGTGCCCTTGAGAAGGAGGTGTCCGCACTCGAGAAGGAGGTCAGCGCCCTGGAAAAGGAAAAGCGAGACCATATGGTTTTGCTTGAGTATGTTACAGCGGCTGGCATTACCGATGCATCAGAGGACAGGGCCGGCGCCGGCATCATCGAGACCCCCAAGGTGTCCGCCCTGAAGGAAAAAGTAAGCGCACTGAAAGAAAAGGTGAGCGCGCTGAAGGAGAAAGTGAGCGCCCTGAAAGAGAAAGTCTCTGCCCTTAAGGAGGATATCGAGGGCAGAGGAAGTCTGCTAACATGCGGTGACGTCGAGGAGAATCCTGGCCCAAAGCTTGCCACCATGCGCAAAGGCGAAGAACTGTTTACCGGCATTGTGCCGATTCTGGTGGAACTGGATGGCGATGTGAACGGCCATAAATTTTTTGTGCGCGGCGAAGGCGAAGGCGATGCGACCATTGGCAAACTGAGCCTGAAATTTATTTGCACCACCGGCAAACTGCCGGTGCCGTGGCCGACCCTGGTGACCACCCTGACCTATGGCGTGCAGTGCTTTAGCCGCTATCCGGATCATATGAAACGCCATGATTTTTTTAAAAGCGCGATGCCGGAAGGCTATGTGCAGGAACGCACCATTTATTTTAAAGATGATGGCACCTATAAAACCCGCGCGGAAGTGAAATTTGAAGGCGATACCCTGGTGAACCGCATTGAACTGAAAGGCATTGATTTTAAAGAAGATGGCAACATTCTGGGCCATAAACTGGAATATAACTTTAACAGCCATAAAGTGTATATTACCGCGGATAAACAGAACAACGGCATTAAAGCGAACTTTACCATTCGCCATAACGTGGAAGATGGCAGCGTGCAGCTGGCGGATCATTATCAGCAGAACACCCCGATTGGCGATGGCCCGGTTCTTCTTCCTGGCGGCCGCTCTAGA 51 ATGGACCCCGTGGTGGTGCTGGGCCTGTGCCTGAGCTGCCTGCTGCTGCTGAGCCTGTGGAAGCAGAGCCACGGCGGCGGCAAGATGGACCTGCCTGACGACCACTACCTGTCCACCCAGACCATCCTGTCCAAGGACCTGAACTCCGGACTCAGATCTGGCAGCGGTCTCGAGATGGAAGTTAGCGCTCTGGAAAAAGAAGTGTCTGCACTCGAGAAAGAAGTAAGTGCCCTTGAGAAGGAGGTGTCCGCACTCGAGAAGGAGGTCAGCGCCCTGGAAAAGGAAAAGCGAGACCATATGGTTTTGCTTGAGTATGTTACAGCGGCTGGCATTACCGATGCATCAGCCACCAGGGCCGGCTGCGCCCCCAGCTACAGGGTGAAGGTGTCCGCCCTGAAGGAAAAAGTAAGCGCACTGAAAGAAAAGGTGAGCGCGCTGAAGGAGAAAGTGAGCGCCCTGAAAGAGAAAGTCTCTGCCCTTAAGGAGGATATCGAGGGCAGAGGAAGTCTGCTAACATGCGGTGACGTCGAGGAGAATCCTGGCCCAAAGCTTGCCACCATGCGCAAAGGCGAAGAACTGTTTACCGGCATTGTGCCGATTCTGGTGGAACTGGATGGCGATGTGAACGGCCATAAATTTTTTGTGCGCGGCGAAGGCGAAGGCGATGCGACCATTGGCAAACTGAGCCTGAAATTTATTTGCACCACCGGCAAACTGCCGGTGCCGTGGCCGACCCTGGTGACCACCCTGACCTATGGCGTGCAGTGCTTTAGCCGCTATCCGGATCATATGAAACGCCATGATTTTTTTAAAAGCGCGATGCCGGAAGGCTATGTGCAGGAACGCACCATTTATTTTAAAGATGATGGCACCTATAAAACCCGCGCGGAAGTGAAATTTGAAGGCGATACCCTGGTGAACCGCATTGAACTGAAAGGCATTGATTTTAAAGAAGATGGCAACATTCTGGGCCATAAACTGGAATATAACTTTAACAGCCATAAAGTGTATATTACCGCGGATAAACAGAACAACGGCATTAAAGCGAACTTTACCATTCGCCATAACGTGGAAGATGGCAGCGTGCAGCTGGCGGATCATTATCAGCAGAACACCCCGATTGGCGATGGCCCGGTTCTTCTTCCTGGCGGCCGCTCTAGA 52 ATGGACCCCGTGGTGGTGCTGGGCCTGTGCCTGAGCTGCCTGCTGCTGCTGAGCCTGTGGAAGCAGAGCCACGGCGGCGGCAAGATGGACCTGCCTGACGACCACTACCTGTCCACCCAGACCATCCTGTCCAAGGACCTGAACTCCGGACTCAGATCTGGCAGCGGTCTCGAGATGGAAGTTAGCGCTCTGGAAAAAGAAGTGTCTGCACTCGAGAAAGAAGTAAGTGCCCTTGAGAAGGAGGTGTCCGCACTCGAGAAGGAGGTCAGCGCCCTGGAAAAGGAAAAGCGAGACCATATGGTTTTGCTTGAGTATGTTACAGCGGCTGGCATTACCGATGCATCACTGGACAGGGCCGGCGGCTACATCTTCAGCAGCGACAAGGTGTCCGCCCTGAAGGAAAAAGTAAGCGCACTGAAAGAAAAGGTGAGCGCGCTGAAGGAGAAAGTGAGCGCCCTGAAAGAGAAAGTCTCTGCCCTTAAGGAGGATATCGAGGGCAGAGGAAGTCTGCTAACATGCGGTGACGTCGAGGAGAATCCTGGCCCAAAGCTTGCCACCATGCGCAAAGGCGAAGAACTGTTTACCGGCATTGTGCCGATTCTGGTGGAACTGGATGGCGATGTGAACGGCCATAAATTTTTTGTGCGCGGCGAAGGCGAAGGCGATGCGACCATTGGCAAACTGAGCCTGAAATTTATTTGCACCACCGGCAAACTGCCGGTGCCGTGGCCGACCCTGGTGACCACCCTGACCTATGGCGTGCAGTGCTTTAGCCGCTATCCGGATCATATGAAACGCCATGATTTTTTTAAAAGCGCGATGCCGGAAGGCTATGTGCAGGAACGCACCATTTATTTTAAAGATGATGGCACCTATAAAACCCGCGCGGAAGTGAAATTTGAAGGCGATACCCTGGTGAACCGCATTGAACTGAAAGGCATTGATTTTAAAGAAGATGGCAACATTCTGGGCCATAAACTGGAATATAACTTTAACAGCCATAAAGTGTATATTACCGCGGATAAACAGAACAACGGCATTAAAGCGAACTTTACCATTCGCCATAACGTGGAAGATGGCAGCGTGCAGCTGGCGGATCATTATCAGCAGAACACCCCGATTGGCGATGGCCCGGTTCTTCTTCCTGGCGGCCGCTCTAGA 53 ATGGACCCCGTGGTGGTGCTGGGCCTGTGCCTGAGCTGCCTGCTGCTGCTGAGCCTGTGGAAGCAGAGCCACGGCGGCGGCAAGATGGACCTGCCTGACGACCACTACCTGTCCACCCAGACCATCCTGTCCAAGGACCTGAACTCCGGACTCAGATCTGGCAGCGGTCTCGAGATGGAAGTTAGCGCTCTGGAAAAAGAAGTGTCTGCACTCGAGAAAGAAGTAAGTGCCCTTGAGAAGGAGGTGTCCGCACTCGAGAAGGAGGTCAGCGCCCTGGAAAAGGAAAAGCGAGACCATATGGTTTTGCTTGAGTATGTTACAGCGGCTGGCATTACCGATGCATCAATCACCACTCTTGGGAAATTTGGACAAAAGGTGTCCGCCCTGAAGGAAAAAGTAAGCGCACTGAAAGAAAAGGTGAGCGCGCTGAAGGAGAAAGTGAGCGCCCTGAAAGAGAAAGTCTCTGCCCTTAAGGAGGATATCGAGGGCAGAGGAAGTCTGCTAACATGCGGTGACGTCGAGGAGAATCCTGGCCCAAAGCTTGCCACCATGCGCAAAGGCGAAGAACTGTTTACCGGCATTGTGCCGATTCTGGTGGAACTGGATGGCGATGTGAACGGCCATAAATTTTTTGTGCGCGGCGAAGGCGAAGGCGATGCGACCATTGGCAAACTGAGCCTGAAATTTATTTGCACCACCGGCAAACTGCCGGTGCCGTGGCCGACCCTGGTGACCACCCTGACCTATGGCGTGCAGTGCTTTAGCCGCTATCCGGATCATATGAAACGCCATGATTTTTTTAAAAGCGCGATGCCGGAAGGCTATGTGCAGGAACGCACCATTTATTTTAAAGATGATGGCACCTATAAAACCCGCGCGGAAGTGAAATTTGAAGGCGATACCCTGGTGAACCGCATTGAACTGAAAGGCATTGATTTTAAAGAAGATGGCAACATTCTGGGCCATAAACTGGAATATAACTTTAACAGCCATAAAGTGTATATTACCGCGGATAAACAGAACAACGGCATTAAAGCGAACTTTACCATTCGCCATAACGTGGAAGATGGCAGCGTGCAGCTGGCGGATCATTATCAGCAGAACACCCCGATTGGCGATGGCCCGGTTCTTCTTCCTGGCGGCCGCTCTAGA 54 ATGGACCCCGTGGTGGTGCTGGGCCTGTGCCTGAGCTGCCTGCTGCTGCTGAGCCTGTGGAAGCAGAGCCACGGCGGCGGCAAGATGGACCTGCCTGACGACCACTACCTGTCCACCCAGACCATCCTGTCCAAGGACCTGAACTCCGGACTCAGATCTGGCAGCGGTCTCGAGATGGAAGTTAGCGCTCTGGAAAAAGAAGTGTCTGCACTCGAGAAAGAAGTAAGTGCCCTTGAGAAGGAGGTGTCCGCACTCGAGAAGGAGGTCAGCGCCCTGGAAAAGGAAAAGCGAGACCATATGGTTTTGCTTGAGTATGTTACAGCGGCTGGCATTACCGATGCATCAAGCTACTTCGCCAGCGAGCAGGGCGAGATCCAGTGGGTGAAGGTGTCCGCCCTGAAGGAAAAAGTAAGCGCACTGAAAGAAAAGGTGAGCGCGCTGAAGGAGAAAGTGAGCGCCCTGAAAGAGAAAGTCTCTGCCCTTAAGGAGGATATCGAGGGCAGAGGAAGTCTGCTAACATGCGGTGACGTCGAGGAGAATCCTGGCCCAAAGCTTGCCACCATGCGCAAAGGCGAAGAACTGTTTACCGGCATTGTGCCGATTCTGGTGGAACTGGATGGCGATGTGAACGGCCATAAATTTTTTGTGCGCGGCGAAGGCGAAGGCGATGCGACCATTGGCAAACTGAGCCTGAAATTTATTTGCACCACCGGCAAACTGCCGGTGCCGTGGCCGACCCTGGTGACCACCCTGACCTATGGCGTGCAGTGCTTTAGCCGCTATCCGGATCATATGAAACGCCATGATTTTTTTAAAAGCGCGATGCCGGAAGGCTATGTGCAGGAACGCACCATTTATTTTAAAGATGATGGCACCTATAAAACCCGCGCGGAAGTGAAATTTGAAGGCGATACCCTGGTGAACCGCATTGAACTGAAAGGCATTGATTTTAAAGAAGATGGCAACATTCTGGGCCATAAACTGGAATATAACTTTAACAGCCATAAAGTGTATATTACCGCGGATAAACAGAACAACGGCATTAAAGCGAACTTTACCATTCGCCATAACGTGGAAGATGGCAGCGTGCAGCTGGCGGATCATTATCAGCAGAACACCCCGATTGGCGATGGCCCGGTTCTTCTTCCTGGCGGCCGCTCTAGA 55 ATGGACCCCGTGGTGGTGCTGGGCCTGTGCCTGAGCTGCCTGCTGCTGCTGAGCCTGTGGAAGCAGAGCCACGGCGGCGGCAAGATGGACCTGCCTGACGACCACTACCTGTCCACCCAGACCATCCTGTCCAAGGACCTGAACTCCGGACTCAGATCTGGCAGCGGTCTCGAGATGGAAGTTAGCGCTCTGGAAAAAGAAGTGTCTGCACTCGAGAAAGAAGTAAGTGCCCTTGAGAAGGAGGTGTCCGCACTCGAGAAGGAGGTCAGCGCCCTGGAAAAGGAAAAGCGAGACCATATGGTTTTGCTTGAGTATGTTACAGCGGCTGGCATTACCGATGCATCAAGGGAGCTGAACGGCGGCGCCTACACCAGGTACGTGAAGGTGTCCGCCCTGAAGGAAAAAGTAAGCGCACTGAAAGAAAAGGTGAGCGCGCTGAAGGAGAAAGTGAGCGCCCTGAAAGAGAAAGTCTCTGCCCTTAAGGAGGATATCGAGGGCAGAGGAAGTCTGCTAACATGCGGTGACGTCGAGGAGAATCCTGGCCCAAAGCTTGCCACCATGCGCAAAGGCGAAGAACTGTTTACCGGCATTGTGCCGATTCTGGTGGAACTGGATGGCGATGTGAACGGCCATAAATTTTTTGTGCGCGGCGAAGGCGAAGGCGATGCGACCATTGGCAAACTGAGCCTGAAATTTATTTGCACCACCGGCAAACTGCCGGTGCCGTGGCCGACCCTGGTGACCACCCTGACCTATGGCGTGCAGTGCTTTAGCCGCTATCCGGATCATATGAAACGCCATGATTTTTTTAAAAGCGCGATGCCGGAAGGCTATGTGCAGGAACGCACCATTTATTTTAAAGATGATGGCACCTATAAAACCCGCGCGGAAGTGAAATTTGAAGGCGATACCCTGGTGAACCGCATTGAACTGAAAGGCATTGATTTTAAAGAAGATGGCAACATTCTGGGCCATAAACTGGAATATAACTTTAACAGCCATAAAGTGTATATTACCGCGGATAAACAGAACAACGG CATTAAAGCGAACTTTACCATTCGCCATAACGTGGAAGATGGCAGCGTGCAGCTGGCGGATCATTATCAGCAGAACACCCCGATTGGCGATGGCCCGGTTCTTCTTCCTGGCGGCCGCTCTAGA 56 ATGGACCCCGTGGTGGTGCTGGGCCTGTGCCTGAGCTGCCTGCTGCTGCTGAGCCTGTGGAAGCAGAGCCACGGCGGCGGCAAGATGGACCTGCCTGACGACCACTACCTGTCCACCCAGACCATCCTGTCCAAGGACCTGAACTCCGGACTCAGATCTGGCAGCGGTCTCGAGATGGAAGTTAGCGCTCTGGAAAAAGAAGTGTCTGCACTCGAGAAAGAAGTAAGTGCCCTTGAGAAGGAGGTGTCCGCACTCGAGAAGGAGGTCAGCGCCCTGGAAAAGGAAAAGCGAGACCATATGGTTTTGCTTGAGTATGTTACAGCGGCTGGCATTACCGATGCATCATTCACCCTGAAGGGCGGCGCCCCCACCAAGGTGACCAAGGTGTCCGCCCTGAAGGAAAAAGTAAGCGCACTGAAAGAAAAGGTGAGCGCGCTGAAGGAGAAAGTGAGCGCCCTGAAAGAGAAAGTCTCTGCCCTTAAGGAGGATATCGAGGGCAGAGGAAGTCTGCTAACATGCGGTGACGTCGAGGAGAATCCTGGCCCAAAGCTTGCCACCATGCGCAAAGGCGAAGAACTGTTTACCGGCATTGTGCCGATTCTGGTGGAACTGGATGGCGATGTGAACGGCCATAAATTTTTTGTGCGCGGCGAAGGCGAAGGCGATGCGACCATTGGCAAACTGAGCCTGAAATTTATTTGCACCACCGGCAAACTGCCGGTGCCGTGGCCGACCCTGGTGACCACCCTGACCTATGGCGTGCAGTGCTTTAGCCGCTATCCGGATCATATGAAACGCCATGATTTTTTTAAAAGCGCGATGCCGGAAGGCTATGTGCAGGAACGCACCATTTATTTTAAAGATGATGGCACCTATAAAACCCGCGCGGAAGTGAAATTTGAAGGCGATACCCTGGTGAACCGCATTGAACTGAAAGGCATTGATTTTAAAGAAGATGGCAACATTCTGGGCCATAAACTGGAATATAACTTTAACAGCCATAAAGTGTATATTACCGCGGATAAACAGAACAACGGCATTAAAGCGAACTTTACCATTCGCCATAACGTGGAAGATGGCAGCGTGCAGCTGGCGGATCATTATCAGCAGAACACCCCGATTGGCGATGGCCCGGTTCTTCTTCCTGGCGGCCGCTCTAGA 57 ATGGACCCCGTGGTGGTGCTGGGCCTGTGCCTGAGCTGCCTGCTGCTGCTGAGCCTGTGGAAGCAGAGCCACGGCGGCGGCAAGATGGACCTGCCTGACGACCACTACCTGTCCACCCAGACCATCCTGTCCAAGGACCTGAACTCCGGACTCAGATCTGGCAGCGGTCTCGAGATGGAAGTTAGCGCTCTGGAAAAAGAAGTGTCTGCACTCGAGAAAGAAGTAAGTGCCCTTGAGAAGGAGGTGTCCGCACTCGAGAAGGAGGTCAGCGCCCTGGAAAAGGAAAAGCGAGACCATATGGTTTTGCTTGAGTATGTTACAGCGGCTGGCATTACCGATGCATCAATCGCCCTGAAGGGCGGCAAGATCGTGAACAACTGGAAGGTGTCCGCCCTGAAGGAAAAAGTAAGCGCACTGAAAGAAAAGGTGAGCGCGCTGAAGGAGAAAGTGAGCGCCCTGAAAGAGAAAGTCTCTGCCCTTAAGGAGGATATCGAGGGCAGAGGAAGTCTGCTAACATGCGGTGACGTCGAGGAGAATCCTGGCCCAAAGCTTGCCACCATGCGCAAAGGCGAAGAACTGTTTACCGGCATTGTGCCGATTCTGGTGGAACTGGATGGCGATGTGAACGGCCATAAATTTTTTGTGCGCGGCGAAGGCGAAGGCGATGCGACCATTGGCAAACTGAGCCTGAAATTTATTTGCACCACCGGCAAACTGCCGGTGCCGTGGCCGACCCTGGTGACCACCCTGACCTATGGCGTGCAGTGCTTTAGCCGCTATCCGGATCATATGAAACGCCATGATTTTTTTAAAAGCGCGATGCCGGAAGGCTATGTGCAGGAACGCACCATTTATTTTAAAGATGATGGCACCTATAAAACCCGCGCGGAAGTGAAATTTGAAGGCGATACCCTGGTGAACCGCATTGAACTGAAAGGCATTGATTTTAAAGAAGATGGCAACATTCTGGGCCATAAACTGGAATATAACTTTAACAGCCATAAAGTGTATATTACCGCGGATAAACAGAACAACGGCATTAAAGCGAACTTTACCATTCGCCATAACGTGGAAGATGGCAGCGTGCAGCTGGCGGATCATTATCAGCAGAACACCCCGATTGGCGATGGCCCGGTTCTTCTTCCTGGCGGCCGCTCTAGA 58 ATGGACCCCGTGGTGGTGCTGGGCCTGTGCCTGAGCTGCCTGCTGCTGCTGAGCCTGTGGAAGCAGAGCCACGGCGGCGGCAAGATGGACCTGCCTGACGACCACTACCTGTCCACCCAGACCATCCTGTCCAAGGACCTGAACTCCGGACTCAGATCTGGCAGCGGTCTCGAGATGGAAGTTAGCGCTCTGGAAAAAGAAGTGTCTGCACTCGAGAAAGAAGTAAGTGCCCTTGAGAAGGAGGTGTCCGCACTCGAGAAGGAGGTCAGCGCCCTGGAAAAGGAAAAGCGAGACCATATGGTTTTGCTTGAGTATGTTACAGCGGCTGGCATTACCGATGCATCAACCAGCGCCGTGCTGCAGAGCGGCTTCAGGAAGATGAAGGTGTCCGCCCTGAAGGAAAAAGTAAGCGCACTGAAAGAAAAGGTGAGCGCGCTGAAGGAGAAAGTGAGCGCCCTGAAAGAGAAAGTCTCTGCCCTTAAGGAGGATATCGAGGGCAGAGGAAGTCTGCTAACATGCGGTGACGTCGAGGAGAATCCTGGCCCAAAGCTTGCCACCATGCGCAAAGGCGAAGAACTGTTTACCGGCATTGTGCCGATTCTGGTGGAACTGGATGGCGATGTGAACGGCCATAAATTTTTTGTGCGCGGCGAAGGCGAAGGCGATGCGACCATTGGCAAACTGAGCCTGAAATTTATTTGCACCACCGGCAAACTGCCGGTGCCGTGGCCGACCCTGGTGACCACCCTGACCTATGGCGTGCAGTGCTTTAGCCGCTATCCGGATCATATGAAACGCCATGATTTTTTTAAAAGCGCGATGCCGGAAGGCTATGTGCAGGAACGCACCATTTATTTTAAAGATGATGGCACCTATAAAACCCGCGCGGAAGTGAAATTTGAAGGCGATACCCTGGTGAACCGCATTGAACTGAAAGGCATTGATTTTAAAGAAGATGGCAACATTCTGGGCCATAAACTGGAATATAACTTTAACAGCCATAAAGTGTATATTACCGCGGATAAACAGAACAACGGCATTAAAGCGAACTTTACCATTCGCCATAACGTGGAAGATGGCAGCGTGCAGCTGGCGGATCATTATCAGCAGAACACCCCGATTGGCGATGGCCCGGTTCTTCTTCCTGGCGGCCGCTCTAGA 59 ATGGACCCCGTGGTGGTGCTGGGCCTGTGCCTGAGCTGCCTGCTGCTGCTGAGCCTGTGGAAGCAGAGCCACGGCGGCGGCAAGATGGACCTGCCTGACGACCACTACCTGTCCACCCAGACCATCCTGTCCAAGGACCTGAACTCCGGACTCAGATCTGGCAGCGGTCTCGAGATGGAAGTTAGCGCTCTGGAAAAAGAAGTGTCTGCACTCGAGAAAGAAGTAAGTGCCCTTGAGAAGGAGGTGTCCGCACTCGAGAAGGAGGTCAGCGCCCTGGAAAAGGAAAAGCGAGACCATATGGTTTTGCTTGAGTATGTTACAGCGGCTGGCATTACCGATGCATCAAGCGGCGTGACCTTCCAGAGCGCCGTGAAGAGGACCAAGGTGTCCGCCCTGAAGGAAAAAGTAAGCGCACTGAAAGAAAAGGTGAGCGCGCTGAAGGAGAAAGTGAGCGCCCTGAAAGAGAAAGTCTCTGCCCTTAAGGAGGATATCGAGGGCAGAGGAAGTCTGCTAACATGCGGTGACGTCGAGGAGAATCCTGGCCCAAAGCTTGCCACCATGCGCAAAGGCGAAGAACTGTTTACCGGCATTGTGCCGATTCTGGTGGAACTGGATGGCGATGTGAACGGCCATAAATTTTTTGTGCGCGGCGAAGGCGAAGGCGATGCGACCATTGGCAAACTGAGCCTGAAATTTATTTGCACCACCGGCAAACTGCCGGTGCCGTGGCCGACCCTGGTGACCACCCTGACCTATGGCGTGCAGTGCTTTAGCCGCTATCCGGATCATATGAAACGCCATGATTTTTTTAAAAGCGCGATGCCGGAAGGCTATGTGCAGGAACGCACCATTTATTTTAAAGATGATGGCACCTATAAAACCCGCGCGGAAGTGAAATTTGAAGGCGATACCCTGGTGAACCGCATTGAACTGAAAGGCATTGATTTTAAAGAAGATGGCAACATTCTGGGCCATAAACTGGAATATAACTTTAACAGCCATAAAGTGTATATTACCGCGGATAAACAGAACAACGGCATTAAAGCGAACTTTACCATTCGCCATAACGTGGAAGATGGCAGCGTGCAGCTGGCGGATCATTATCAGCAGAACACCCCGATTGGCGATGGCCCGGTTCTTCTTCCTGGCGGCCGCTCTAGA 60 ATGGACCCCGTGGTGGTGCTGGGCCTGTGCCTGAGCTGCCTGCTGCTGCTGAGCCTGTGGAAGCAGAGCCACGGCGGCGGCAAGATGGACCTGCCTGACGACCACTACCTGTCCACCCAGACCATCCTGTCCAAGGACCTGAACTCCGGACTCAGATCTGGCAGCGGTCTCGAGATGGAAGTTAGCGCTCTGGAAAAAGAAGTGTCTGCACTCGAGAAAGAAGTAAGTGCCCTTGAGAAGGAGGTGTCCGCACTCGAGAAGGAGGTCAGCGCCCTGGAAAAGGAAAAGCGAGACCATATGGTTTTGCTTGAGTATGTTACAGCGGCTGGCATTACCGATGCATCAAAGGTGGCCACCGTGCAGAGCAAGATGAGCGACGTGAAGGTGTCCGCCCTGAAGGAAAAAGTAAGCGCACTGAAAGAAAAGGTGAGCGCGCTGAAGGAGAAAGTGAGCGCCCTGAAAGAGAAAGTCTCTGCCCTTAAGGAGGATATCGAGGGCAGAGGAAGTCTGCTAACATGCGGTGACGTCGAGGAGAATCCTGGCCCAAAGCTTGCCACCATGCGCAAAGGCGAAGAACTGTTTACCGGCATTGTGCCGATTCTGGTGGAACTGGATGGCGATGTGAACGGCCATAAATTTTTTGTGCGCGGCGAAGGCGAAGGCGATGCGACCATTGGCAAACTGAGCCTGAAATTTATTTGCACCACCGGCAAACTGCCGGTGCCGTGGCCGACCCTGGTGACCACCCTGACCTATGGCGTGCAGTGCTTTAGCCGCTATCCGGATCATATGAAACGCCATGATTTTTTTAAAAGCGCGATGCCGGAAGGCTATGTGCAGGAACGCACCATTTATTTTAAAGATGATGGCACCTATAAAACCCGCGCGGAAGTGAAATTTGAAGGCGATACCCTGGTGAACCGCATTGAACTGAAAGGCATTGATTTTAAAGAAGATGGCAACATTCTGGGCCATAAACTGGAATATAACTTTAACAGCCATAAAGTGTATATTACCGCGGATAAACAGAACAACGGCATTAAAGCGAACTTTACCATTCGCCATAACGTGGAAGATGGCAGCGTGCAGCTGGCGGATCATTATCAGCAGAACACCCCGATTGGCGATGGCCCGGTTCTTCTTCCTGGCGGCCGCTCTAGA 61 ATGGACCCCGTGGTGGTGCTGGGCCTGTGCCTGAGCTGCCTGCTGCTGCTGAGCCTGTGGAAGCAGAGCCACGGCGGCGGCAAGATGGACCTGCCTGACGACCACTACCTGTCCACCCAGACCATCCTGTCCAAGGACCTGAACTCCGGACTCAGATCTGGCAGCGGTCTCGAGATGGAAGTTAGCGCTCTGGAAAAAGAAGTGTCTGCACTCGAGAAAGAAGTAAGTGCCCTTGAGAAGGAGGTGTCCGCACTCGAGAAGGAGGTCAGCGCCCTGGAAAAGGAAAAGCGAGACCATATGGTTTTGCTTGAGTATGTTACAGCGGCTGGCATTACCGATGCATCAAGCGCCGTGAAGCTGCAGAACAACGAGCTGAGCCCCAAGGTGTCCGCCCTGAAGGAAAAAGTAAGCGCACTGAAAGAAAAGGTGAGCGCGCTGAAGGAGAAAGTGAGCGCCCTGAAAGAGAAAGTCTCTGCCCTTAAGGAGGATATCGAGGGCAGAGGAAGTCTGCTAACATGCGGTGACGTCGAGGAGAATCCTGGCCCAAAGCTTGCCACCATGCGCAAAGGCGAAGAACTGTTTACCGGCATTGTGCCGATTCTGGTGGAACTGGATGGCGATGTGAACGGCCATAAATTTTTTGTGCGCGGCGAAGGCGAAGGCGATGCGACCATTGGCAAACTGAGCCTGAAATTTATTTGCACCACCGGCAAACTGCCGGTGCCGTGGCCGACCCTGGTGACCACCCTGACCTATGGCGTGCAGTGCTTTAGCCGCTATCCGGATCATATGAAACGCCATGATTTTTTTAAAAGCGCGATGCCGGAAGGCTATGTGCAGGAACGCACCATTTATTTTAAAGATGATGGCACCTATAAAACCCGCGCGGAAGTGAAATTTGAAGGCGATACCCTGGTGAACCGCATTGAACTGAAAGGCATTGATTTTAAAGAAGATGGCAACATTCTGGGCCATAAACTGGAATATAACTTTAACAGCCATAAAGTGTATATTACCGCGGATAAACAGAACAACGGCATTAAAGCGAACTTTACCATTCGCCATAACGTGGAAGATGGCAGCGTGCAGCTGGCGGATCATTATCAGCAGAACACCCCGATTGGCGATGGCCCGGTTCTTCTTCCTGGCGGCCGCTCTAGA 62 ATGGACCCCGTGGTGGTGCTGGGCCTGTGCCTGAGCTGCCTGCTGCTGCTGAGCCTGTGGAAGCAGAGCCACGGCGGCGGCAAGATGGACCTGCCTGACGACCACTACCTGTCCACCCAGACCATCCTGTCCAAGGACCTGAACTCCGGACTCAGATCTGGCAGCGGTCTCGAGATGGAAGTTAGCGCTCTGGAAAAAGAAGTGTCTGCACTCGAGAAAGAAGTAAGTGCCCTTGAGAAGGAGGTGTCCGCACTCGAGAAGGAGGTCAGCGCCCTGGAAAAGGAAAAGCGAGACCATATGGTTTTGCTTGAGTATGTTACAGCGGCTGGCATTACCGATGCATCAGCCACCGTGAGGCTGCAGGCCGGCAACGCCACCGAGAAGGTGTCCGCCCTGAAGGAAAAAGTAAGCGCACTGAAAGAAAAGGTGAGCGCGCTGAAGGAGAAAGTGAGCGCCCTGAAAGAGAAAGTCTCTGCCCTTAAGGAGGATATCGAGGGCAGAGGAAGTCTGCTAACATGCGGTGACGTCGAGGAGAATCCTGGCCCAAAGCTTGCCACCATGCGCAAAGGCGAAGAACTGTTTACCGGCATTGTGCCGATTCTGGTGGAACTGGATGGCGATGTGAACGGCCATAAATTTTTTGTGCGCGGCGAAGGCGAAGGCGATGCGACCATTGGCAAACTGAGCCTGAAATTTATTTGCACCACCGGCAAACTGCCGGTGCCGTGGCCGACCCTGGTGACCACCCTGACCTATGGCGTGCAGTGCTTTAGCCGCTATCCGGATCATATGAAACGCCATGATTTTTTTAAAAGCGCGATGCCGGAAGGCTATGTGCAGGAACGCACCATTTATTTTAAAGATGATGGCACCTATAAAACCCGCGCGGAAGTGAAATTTGAAGGCGATACCCTGGTGAACCGCATTGAACTGAAAGGCATTGATTTTAAAGAAGATGGCAACATTCTGGGCCATAAACTGGAATATAACTTTAACAGCCATAAAGTGTATATTACCGCGGATAAACAGAACAACGGCATTAAAGCGAACTTTACCATTCGCCATAACGTGGAAGATGGCAGCGTGCAGCTGGCGGATCATTATCAGCAGAACACCCCGATTGGCGATGGCCCGGTTCTTCTTCCTGGCGGCCGCTCTAGA 63 ATGGACCCCGTGGTGGTGCTGGGCCTGTGCCTGAGCTGCCTGCTGCTGCTGAGCCTGTGGAAGCAGAGCCACGGCGGCGGCAAGATGGACCTGCCTGACGACCACTACCTGTCCACCCAGACCATCCTGTCCAAGGACCTGAACTCCGGACTCAGATCTGGCAGCGGTCTCGAGATGGAAGTTAGCGCTCTGGAAAAAGAAGTGTCTGCACTCGAGAAAGAAGTAAGTGCCCTTGAGAAGGAGGTGTCCGCACTCGAGAAGGAGGTCAGCGCCCTGGAAAAGGAAAAGCGAGACCATATGGTTTTGCTTGAGTATGTTACAGCGGCTGGCATTACCGATGCATCAAGGGAGCCCATGCTGCAGAGCGCCGACGCCCAGAGCAAGGTGTCCGCCCTGAAGGAAAAAGTAAGCGCACTGAAAGAAAAGGTGAGCGCGCTGAAGGAGAAAGTGAGCGCCCTGAAAGAGAAAGTCTCTGCCCTTAAGGAGGATATCGAGGGCAGAGGAAGTCTGCTAACATGCGGTGACGTCGAGGAGAATCCTGGCCCAAAGCTTGCCACCATGCGCAAAGGCGAAGAACTGTTTACCGGCATTGTGCCGATTCTGGTGGAACTGGATGGCGATGTGAACGGCCATAAATTTTTTGTGCGCGGCGAAGGCGAAGGCGATGCGACCATTGGCAAACTGAGCCTGAAATTTATTTGCACCACCGGCAAACTGCCGGTGCCGTGGCCGACCCTGGTGACCACCCTGACCTATGGCGTGCAGTGCTTTAGCCGCTATCCGGATCATATGAAACGCCATGATTTTTTTAAAAGCGCGATGCCGGAAGGCTATGTGCAGGAACGCACCATTTATTTTAAAGATGATGGCACCTATAAAACCCGCGCGGAAGTGAAATTTGAAGGCGATACCCTGGTGAACCGCATTGAACTGAAAGGCATTGATTTTAAAGAAGATGGCAACATTCTGGGCCATAAACTGGAATATAACTTTAACAGCCATAAAGTGTATATTACCGCGGATAAACAGAACAACGGCATTAAAGCGAACTTTACCATTCGCCATAACGTGGAAGATGGCAGCGTGCAGCTGGCGGATCATTATCAGCAGAACACCCCGATTGGCGATGGCCCGGTTCTTCTTCCTGGCGGCCGCTCTAGA 64 ATGGACCCCGTGGTGGTGCTGGGCCTGTGCCTGAGCTGCCTGCTGCTGCTGAGCCTGTGGAAGCAGAGCCACGGCGGCGGCAAGATGGACCTGCCTGACGACCACTACCTGTCCACCCAGACCATCCTGTCCAAGGACCTGAACTCCGGACTCAGATCTGGCAGCGGTCTCGAGATGGAAGTTAGCGCTCTGGAAAAAGAAGTGTCTGCACTCGAGAAAGAAGTAAGTGCCCTTGAGAAGGAGGTGTCCGCACTCGAGAAGGAGGTCAGCGCCCTGGAAAAGGAAAAGCGAGACCATATGGTTTTGCTTGAGTATGTTACAGCGGCTGGCATTACCGATGCATCAAAGGAGAGGAAGAGGAGGGGCGCCGACACCAGCATCAAGGTGTCCGCCCTGAAGGAAAAAGTAAGCGCACTGAAAGAAAAGGTGAGCGCGCTGAAGGAGAAAGTGAGCGCCCTGAAAGAGAAAGTCTCTGCCCTTAAGGAGGATATCGAGGGCAGAGGAAGTCTGCTAACATGCGGTGACGTCGAGGAGAATCCTGGCCCAAAGCTTGCCACCATGCGCAAAGGCGAAGAACTGTTTACCGGCATTGTGCCGATTCTGGTGGAACTGGATGGCGATGTGAACGGCCATAAATTTTTTGTGCGCGGCGAAGGCGAAGGCGATGCGACCATTGGCAAACTGAGCCTGAAATTTATTTGCACCACCGGCAAACTGCCGGTGCCGTGGCCGACCCTGGTGACCACCCTGACCTATGGCGTGCAGTGCTTTAGCCGCTATCCGGATCATATGAAACGCCATGATTTTTTTAAAAGCGCGATGCCGGAAGGCTATGTGCAGGAACGCACCATTTATTTTAAAGATGATGGCACCTATAAAACCCGCGCGGAAGTGAAATTTGAAGGCGATACCCTGGTGAACCGCATTGAACTGAAAGGCATTGATTTTAAAGAAGATGGCAACATTCTGGGCCATAAACTGGAATATAACTTTAACAGCCATAAAGTGTATATTACCGCGGATAAACAGAACAACGGCATTAAAGCGAACTTTACCATTCGCCATAACGTGGAAGATGGCAGCGTGCAGCTGGCGGATCATTATCAGCAGAACACCCCGATTGGCGATGGCCCGGTTCTTCTTCCTGGCGGCCGCTCTAGA 65 ATGGACCCCGTGGTGGTGCTGGGCCTGTGCCTGAGCTGCCTGCTGCTGCTGAGCCTGTGGAAGCAGAGCCACGGCGGCGGCAAGATGGACCTGCCTGACGACCACTACCTGTCCACCCAGACCATCCTGTCCAAGGACCTGAACTCCGGACTCAGATCTGGCAGCGGTCTCGAGATGGAAGTTAGCGCTCTGGAAAAAGAAGTGTCTGCACTCGAGAAAGAAGTAAGTGCCCTTGAGAAGGAGGTGTCCGCACTCGAGAAGGAGGTCAGCGCCCTGGAAAAGGAAAAGCGAGACCATATGGTTTTGCTTGAGTATGTTACAGCGGCTGGCATTACCGATGCATCAACCAGGAGCGGCAAGAGGAGCTGGCCCCCCAGCGAGAAGGTGTCCGCCCTGAAGGAAAAAGTAAGCGCACTGAAAGAAAAGGTGAGCGCGCTGAAGGAGAAAGTGAGCGCCCTGAAAGAGAAAGTCTCTGCCCTTAAGGAGGATATCGAGGGCAGAGGAAGTCTGCTAACATGCGGTGACGTCGAGGAGAATCCTGGCCCAAAGCTTGCCACCATGCGCAAAGGCGAAGAACTGTTTACCGGCATTGTGCCGATTCTGGTGGAACTGGATGGCGATGTGAACGGCCATAAATTTTTTGTGCGCGGCGAAGGCGAAGGCGATGCGACCATTGGCAAACTGAGCCTGAAATTTATTTGCACCACCGGCAAACTGCCGGTGCCGTGGCCGACCCTGGTGACCACCCTGACCTATGGCGTGCAGTGCTTTAGCCGCTATCCGGATCATATGAAACGCCATGATTTTTTTAAAAGCGCGATGCCGGAAGGCTATGTGCAGGAACGCACCATTTATTTTAAAGATGATGGCACCTATAAAACCCGCGCGGAAGTGAAATTTGAAGGCGATACCCTGGTGAACCGCATTGAACTGAAAGGCATTGATTTTAAAGAAGATGGCAACATTCTGGGCCATAAACTGGAATATAACTTTAACAGCCATAAAGTGTATATTACCGCGGATAAACAGAACAACGGCATTAAAGCGAACTTTACCATTCGCCATAACGTGGAAGATGGCAGCGTGCAGCTGGCGGATCATTATCAGCAGAACACCCCGATTGGCGATGGCCCGGTTCTTCTTCCTGGCGGCCGCTCTAGA 66 ATGGACCCCGTGGTGGTGCTGGGCCTGTGCCTGAGCTGCCTGCTGCTGCTGAGCCTGTGGAAGCAGAGCCACGGCGGCGGCAAGATGGACCTGCCTGACGACCACTACCTGTCCACCCAGACCATCCTGTCCAAGGACCTGAACTCCGGACTCAGATCTGGCAGCGGTCTCGAGATGGAAGTTAGCGCTCTGGAAAAAGAAGTGTCTGCACTCGAGAAAGAAGTAAGTGCCCTTGAGAAGGAGGTGTCCGCACTCGAGAAGGAGGTCAGCGCCCTGGAAAAGGAAAAGCGAGACCATATGGTTTTGCTTGAGTATGTTACAGCGGCTGGCATTACCGATGCATCAGAGCCCGAGAAGCAGAGGAGCCCCCAGGACAACCAGAAGGTGTCCGCCCTGAAGGAAAAAGTAAGCGCACTGAAAGAAAAGGTGAGCGCGCTGAAGGAGAAAGTGAGCGCCCTGAAAGAGAAAGTCTCTGCCCTTAAGGAGGATATCGAGGGCAGAGGAAGTCTGCTAACATGCGGTGACGTCGAGGAGAATCCTGGCCCAAAGCTTGCCACCATGCGCAAAGGCGAAGAACTGTTTACCGGCATTGTGCCGATTCTGGTGGAACTGGATGGCGATGTGAACGGCCATAAATTTTTTGTGCGCGGCGAAGGCGAAGGCGATGCGACCATTGGCAAACTGAGCCTGAAATTTATTTGCACCACCGGCAAACTGCCGGTGCCGTGGCCGACCCTGGTGACCACCCTGACCTATGGCGTGCAGTGCTTTAGCCGCTATCCGGATCATATGAAACGCCATGATTTTTTTAAAAGCGCGATGCCGGAAGGCTATGTGCAGGAACGCACCATTTATTTTAAAGATGATGGCACCTATAAAACCCGCGCGGAAGTGAAATTTGAAGGCGATACCCTGGTGAACCGCATTGAACTGAAAGGCATTGATTTTAAAGAAGATGGCAACATTCTGGGCCATAAACTGGAATATAACTTTAACAGCCATAAAGTGTATATTACCGCGGATAAACAGAACAACGGCATTAAAGCGAACTTTACCATTCGCCATAACGTGGAAGATGGCAGCGTGCAGCTGGCGGATCATTATCAGCAGAACACCCCGATTGGCGATGGCCCGGTTCTTCTTCCTGGCGGCCGCTCTAGA 67 ATGGACCCCGTGGTGGTGCTGGGCCTGTGCCTGAGCTGCCTGCTGCTGCTGAGCCTGTGGAAGCAGAGCCACGGCGGCGGCAAGATGGACCTGCCTGACGACCACTACCTGTCCACCCAGACCATCCTGTCCAAGGACCTGAACTCCGGACTCAGATCTGGCAGCGGTCTCGAGATGGAAGTTAGCGCTCTGGAAAAAGAAGTGTCTGCACTCGAGAAAGAAGTAAGTGCCCTTGAGAAGGAGGTGTCCGCACTCGAGAAGGAGGTCAGCGCCCTGGAAAAGGAAAAGCGAGACCATATGGTTTTGCTTGAGTATGTTACAGCGGCTGGCATTACCGATGCATCAGGCCTGGTGAAGAGGAGGGGCGGCGGCACCGGCGAGAAGGTGTCCGCCCTGAAGGAAAAAGTAAGCGCACTGAAAGAAAAGGTGAGCGCGCTGAAGGAGAAAGTGAGCGCCCTGAAAGAGAAAGTCTCTGCCCTTAAGGAGGATATCGAGGGCAGAGGAAGTCTGCTAACATGCGGTGACGTCGAGGAGAATCCTGGCCCAAAGCTTGCCACCATGCGCAAAGGCGAAGAACTGTTTACCGGCATTGTGCCGATTCTGGTGGAACTGGATGGCGATGTGAACGGCCATAAATTTTTTTGTGCGCGGCGAAGGCGAAGGCGATGCGACCATTGGCAAACTGAGCCTGAAATTTATTTGCACCACCGGCAAACTGCCGGTGCCGTGGCCGACCCTGGTGACCACCCTGACCTATGGCGTGCAGTGCTTTAGCCGCTATCCGGATCATATGAAACGCCATGATTTTTTTAAAAGCGCGATGCCGGAAGGCTATGTGCAGGAACGCACCATTTATTTTAAAGATGATGGCACCTATAAAACCCGCGCGGAAGTGAAATTTGAAGGCGATACCCTGGTGAACCGCATTGAACTGAAAGGCATTGATTTTAAAGAAGATGGCAACATTCTGGGCCATAAACTGGAATATAACTTTAACAGCCATAAAGTGTATATTACCGCGGATAAACAGAACAACGGCATTAAAGCGAACTTTACCATTCGCCATAACGTGGAAGATGGCAGCGTGCAGCTGGCGGATCATTATCAGCAGAACACCCCGATTGGCGATGGCCCGGTTCTTCTTCCTGGCGGCCGCTCTAGA 68 ATGGACCTGCCTGACGACCACTACCTGTCCACCCAGACCATCCTGTCCAAGGACCTGAACTCCGGACTCAGATCTGGCAGCGGTCTCGAGATGGAAGTTAGCGCTCTGGAAAAAGAAGTGTCTGCACTCGAGAAAGAAGTAAGTGCCCTTGAGAAGGAGGTGTCCGCACTCGAGAAGGAGGTCAGCGCCCTGGAAAAGGAAAAGCGAGACCATATGGTTTTGCTTGAGTATGTTACAGCGGCTGGCATTACCGATGCATCAATCACCACTCTTGGGAAATTTGGACAAAAGGTGTCCGCCCTGAAGGAAAAAGTAAGCGCACTGAAAGAAAAGGTGAGCGCGCTGAAGGAGAAAGTGAGCGCCCTGAAAGAGAAAGTCTCTGCCCTTAAGGAGGATATCGAGGGCAGAGGAAGTCTGCTAACATGCGGTGACGTCGAGGAGAATCCTGGCCCAAAGCTTGCCACCATGCGCAAAGGCGAAGAACTGTTTACCGGCATTGTGCCGATTCTGGTGGAACTGGATGGCGATGTGAACGGCCATAAATTTTTTGTGCGCGGCGAAGGCGAAGGCGATGCGACCATTGGCAAACTGAGCCTGAAATTTATTTGCACCACCGGCAAACTGCCGGTGCCGTGGCCGACCCTGGTGACCACCCTGACCTATGGCGTGCAGTGCTTTAGCCGCTATCCGGATCATATGAAACGCCATGATTTTTTTAAAAGCGCGATGCCGGAAGGCTATGTGCAGGAACGCACCATTTATTTTAAAGATGATGGCACCTATAAAACCCGCGCGGAAGTGAAATTTGAAGGCGATACCCTGGTGAACCGCATTGAACTGAAAGGCATTGATTTTAAAGAAGATGGCAACATTCTGGGCCATAAACTGGAATATAACTTTAACAGCCATAAAGTGTATATTACCGCGGATAAACAGAACAACGGCATTAAAGCGAACTTTACCATTCGCCATAACGTGGAAGATGGCAGCGTGCAGCTGGCGGATCATTATCAGCAGAACACCCCGATTGGCGATGGCCCGGTTCTTCTTCCTGGCGGCCGCTCTAGA 69 ATGGACCTGCCTGACGACCACTACCTGTCCACCCAGACCATCCTGTCCAAGGACCTGAACTCCGGACTCAGATCTGGCAGCGGTCTCGAGATGGAAGTTAGCGCTCTGGAAAAAGAAGTGTCTGCACTCGAGAAAGAAGTAAGTGCCCTTGAGAAGGAGGTGTCCGCACTCGAGAAGGAGGTCAGCGCCCTGGAAAAGGAAAAGCGAGACCATATGGTTTTGCTTGAGTATGTTACAGCGGCTGGCATTACCGATGCATCAAGCTACTTCGCCAGCGAGCAGGGCGAGATCCAGTGGGTGAAGGTGTCCGCCCTGAAGGAAAAAGTAAGCGCACTGAAAGAAAAGGTGAGCGCGCTGAAGGAGAAAGTGAGCGCCCTGAAAGAGAAAGTCTCTGCCCTTAAGGAGGATATCGAGGGCAGAGGAAGTCTGCTAACATGCGGTGACGTCGAGGAGAATCCTGGCCCAAAGCTTGCCACCATGCGCAAAGGCGAAGAACTGTTTACCGGCATTGTGCCGATTCTGGTGGAACTGGATGGCGATGTGAACGGCCATAAATTTTTTGTGCGCGGCGAAGGCGAAGGCGATGCGACCATTGGCAAACTGAGCCTGAAATTTATTTGCACCACCGGCAAACTGCCGGTGCCGTGGCCGACCCTGGTGACCACCCTGACCTATGGCGTGCAGTGCTTTAGCCGCTATCCGGATCATATGAAACGCCATGATTTTTTTAAAAGCGCGATGCCGGAAGGCTATGTGCAGGAACGCACCATTTATTTTAAAGATGATGGCACCTATAAAACCCGCGCGGAAGTGAAATTTGAAGGCGATACCCTGGTGAACCGCATTGAACTGAAAGGCATTGATTTTAAAGAAGATGGCAACATTCTGGGCCATAAACTGGAATATAACTTTAACAGCCATAAAGTGTATATTACCGCGGATAAACAGAACAACGGCATTAAAGCGAACTTTACCATTCGCCATAACGTGGAAGATGGCAGCGTGCAGCTGGCGGATCATTATCAGCAGAACACCCCGATTGGCGATGGCCCGGTTCTTCTTCCTGGCGGCCGCTCTAGA 70 ATGGGCTGCTGCTTCAGCAAGACCATGGACCTGCCTGACGACCACTACCTGTCCACCCAGACCATCCTGTCCAAGGACCTGAACTCCGGACTCAGATCTGGCAGCGGTCTCGAGATGGAAGTTAGCGCTCTGGAAAAAGAAGTGTCTGCACTCGAGAAAGAAGTAAGTGCCCTTGAGAAGGAGGTGTCCGCACTCGAGAAGGAGGTCAGCGCCCTGGAAAAGGAAAAGCGAGACCATATGGTTTTGCTTGAGTATGTTACAGCGGCTGGCATTACCGATGCATCAGAGGACAGGGCCGGCGCCGGCATCATCGAGACCCCCAAGGTGTCCGCCCTGAAGGAAAAAGTAAGCGCACTGAAAGAAAAGGTGAGCGCGCTGAAGGAGAAAGTGAGCGCCCTGAAAGAGAAAGTCTCTGCCCTTAAGGAGGATATCGAGGGCAGAGGAAGTCTGCTAACATGCGGTGACGTCGAGGAGAATCCTGGCCCAAAGCTTGCCACCATGCGCAAAGGCGAAGAACTGTTTACCGGCATTGTGCCGATTCTGGTGGAACTGGATGGCGATGTGAACGGCCATAAATTTTTGTGCGCGGCGAAGGCGAAGGCGATGCGACCATTGGCAAACTGAGCCTGAAATTTATTTGCACCACCGGCAAACTGCCGGTGCCGTGGCCGACCCTGGTGACCACCCTGACCTATGGCGTGCAGTGCTTTAGCCGCTATCCGGATCATATGAAACGCCATGATTTTTTTAAAAGCGCGATGCCGGAAGGCTATGTGCAGGAACGCACCATTTATTTTAAAGATGATGGCACCTATAAAACCCGCGCGGAAGTGAAATTTGAAGGCGATACCCTGGTGAACCGCATTGAACTGAAAGGCATTGATTTTAAAGAAGATGGCAACATTCTGGGCCATAAACTGGAATATAACTTTAACAGCCATAAAGTGTATATTACCGCGGATAAACAGAACAACGGCATTAAAGCGAACTTTACCATTCGCCATAACGTGGAAGATGGCAGCGTGCAGCTGGCGGATCATTATCAGCAGAACACCCCGATTGGCGATGGCCCGGTTCTTCTTCCTGGCGGCCGCTCTAGA 71 ATGGGCTGCTGCTTCAGCAAGACCATGGACCTGCCTGACGACCACTACCTGTCCACCCAGACCATCCTGTCCAAGGACCTGAACTCCGGACTCAGATCTGGCAGCGGTCTCGAGATGGAAGTTAGCGCTCTGGAAAAAGAAGTGTCTGCACTCGAGAAAGAAGTAAGTGCCCTTGAGAAGGAGGTGTCCGCACTCGAGAAGGAGGTCAGCGCCCTGGAAAAGGAAAAGCGAGACCATATGGTTTTGCTTGAGTATGTTACAGCGGCTGGCATTACCGATGCATCAGCCACCAGGGCCGGCTGCGCCCCCAGCTACAGGGTGAAGGTGTCCGCCCTGAAGGAAAAAGTAAGCGCACTGAAAGAAAAGGTGAGCGCGCTGAAGGAGAAAGTGAGCGCCCTGAAAGAGAAAGTCTCTGCCCTTAAGGAGGATATCGAGGGCAGAGGAAGTCTGCTAACATGCGGTGACGTCGAGGAGAATCCTGGCCCAAAGCTTGCCACCATGCGCAAAGGCGAAGAACTGTTTACCGGCATTGTGCCGATTCTGGTGGAACTGGATGGCGATGTGAACGGCCATAAATTTTTGTGCGCGGCGAAGGCGAAGGCGATGCGACCATTGGCAAACTGAGCCTGAAATTTATTTGCACCACCGGCAAACTGCCGGTGCCGTGGCCGACCCTGGTGACCACCCTGACCTATGGCGTGCAGTGCTTTAGCCGCTATCCGGATCATATGAAACGCCATGATTTTTTTAAAAGCGCGATGCCGGAAGGCTATGTGCAGGAACGCACCATTTATTTTAAAGATGATGGCACCTATAAAACCCGCGCGGAAGTGAAATTTGAAGGCGATACCCTGGTGAACCGCATTGAACTGAAAGGCATTGATTTTAAAGAAGATGGCAACATTCTGGGCCATAAACTGGAATATAACTTTAACAGCCATAAAGTGTATATTACCGCGGATAAACAGAACAACGGCATTAAAGCGAACTTTACCATTCGCCATAACGTGGAAGATGGCAGCGTGCAGCTGGCGGATCATTATCAGCAGAACACCCCGATTGGCGATGGCCCGGTTCTTCTTCCTGGCGGCCGCTCTAGA 72 ATGGGCTGCTGCTTCAGCAAGACCATGGACCTGCCTGACGACCACTACCTGTCCACCCAGACCATCCTGTCCAAGGACCTGAACTCCGGACTCAGATCTGGCAGCGGTCTCGAGATGGAAGTTAGCGCTCTGGAAAAAGAAGTGTCTGCACTCGAGAAAGAAGTAAGTGCCCTTGAGAAGGAGGTGTCCGCACTCGAGAAGGAGGTCAGCGCCCTGGAAAAGGAAAAGCGAGACCATATGGTTTTGCTTGAGTATGTTACAGCGGCTGGCATTACCGATGCATCACTGGACAGGGCCGGCGGCTACATCTTCAGCAGCGACAAGGTGTCCGCCCTGAAGGAAAAAGTAAGCGCACTGAAAGAAAAGGTGAGCGCGCTGAAGGAGAAAGTGAGCGCCCTGAAAGAGAAAGTCTCTGCCCTTAAGGAGGATATCGAGGGCAGAGGAAGTCTGCTAACATGCGGTGACGTCGAGGAGAATCCTGGCCCAAAGCTTGCCACCATGCGCAAAGGCGAAGAACTGTTTACCGGCATTGTGCCGATTCTGGTGGAACTGGATGGCGATGTGAACGGCCATAAATTTTTTGTGCGCGGCGAAGGCGAAGGCGATGCGACCATTGGCAAACTGAGCCTGAAATTTATTTGCACCACCGGCAAACTGCCGGTGCCGTGGCCGACCCTGGTGACCACCCTGACCTATGGCGTGCAGTGCTTTAGCCGCTATCCGGATCATATGAAACGCCATGATTTTTTTAAAAGCGCGATGCCGGAAGGCTATGTGCAGGAACGCACCATTTATTTTAAAGATGATGGCACCTATAAAACCCGCGCGGAAGTGAAATTTGAAGGCGATACCCTGGTGAACCGCATTGAACTGAAAGGCATTGATTTTAAAGAAGATGGCAACATTCTGGGCCATAAACTGGAATATAACTTTAACAGCCATAAAGTGTATATTACCGCGGATAAACAGAACAACGGCATTAAAGCGAACTTTACCATTCGCCATAACGTGGAAGATGGCAGCGTGCAGCTGGCGGATCATTATCAGCAGAACACCCCGATTGGCGATGGCCCGGTTCTTCTTCCTGGCGGCCGCTCTAGA 73 ATGGGCTGCTGCTTCAGCAAGACCATGGACCTGCCTGACGACCACTACCTGTCCACCCAGACCATCCTGTCCAAGGACCTGAACTCCGGACTCAGATCTGGCAGCGGTCTCGAGATGGAAGTTAGCGCTCTGGAAAAAGAAGTGTCTGCACTCGAGAAAGAAGTAAGTGCCCTTGAGAAGGAGGTGTCCGCACTCGAGAAGGAGGTCAGCGCCCTGGAAAAGGAAAAGCGAGACCATATGGTTTTGCTTGAGTATGTTACAGCGGCTGGCATTACCGATGCATCAATCACCACTCTTGGGAAATTTGGACAAAAGGTGTCCGCCCTGAAGGAAAAAGTAAGCGCACTGAAAGAAAAGGTGAGCGCGCTGAAGGAGAAAGTGAGCGCCCTGAAAGAGAAAGTCTCTGCCCTTAAGGAGGATATCGAGGGCAGAGGAAGTCTGCTAACATGCGGTGACGTCGAGGAGAATCCTGGCCCAAAGCTTGCCACCATGCGCAAAGGCGAAGAACTGTTTACCGGCATTGTGCCGATTCTGGTGGAACTGGATGGCGATGTGAACGGCCATAAATTTTTTGTGCGCGGCGAAGGCGAAGGCGATGCGACCATTGGCAAACTGAGCCTGAAATTTATTTGCACCACCGGCAAACTGCCGGTGCCGTGGCCGACCCTGGTGACCACCCTGACCTATGGCGTGCAGTGCTTTAGCCGCTATCCGGATCATATGAAACGCCATGATTTTTTTAAAAGCGCGATGCCGGAAGGCTATGTGCAGGAACGCACCATTTATTTTAAAGATGATGGCACCTATAAAACCCGCGCGGAAGTGAAATTTGAAGGCGATACCCTGGTGAACCGCATTGAACTGAAAGGCATTGATTTTAAAGAAGATGGCAACATTCTGGGCCATAAACTGGAATATAACTTTAACAGCCATAAAGTGTATATTACCGCGGATAAACAGAACAACGGCATTAAAGCGAACTTTACCATTCGCCATAACGTGGAAGATGGCAGCGTGCAGCTGGCGGATCATTATCAGCAGAACACCCCGATTGGCGATGGCCCGGTTCTTCTTCCTGGCGGCCGCTCTAGA 74 ATGGGCTGCTGCTTCAGCAAGACCATGGACCTGCCTGACGACCACTACCTGTCCACCCAGACCATCCTGTCCAAGGACCTGAACTCCGGACTCAGATCTGGCAGCGGTCTCGAGATGGAAGTTAGCGCTCTGGAAAAAGAAGTGTCTGCACTCGAGAAAGAAGTAAGTGCCCTTGAGAAGGAGGTGTCCGCACTCGAGAAGGAGGTCAGCGCCCTGGAAAAGGAAAAGCGAGACCATATGGTTTTGCTTGAGTATGTTACAGCGGCTGGCATTACCGATGCATCAAGCTACTTCGCCAGCGAGCAGGGCGAGATCCAGTGGGTGAAGGTGTCCGCCCTGAAGGAAAAAGTAAGCGCACTGAAAGAAAAGGTGAGCGCGCTGAAGGAGAAAGTGAGCGCCCTGAAAGAGAAAGTCTCTGCCCTTAAGGAGGATATCGAGGGCAGAGGAAGTCTGCTAACATGCGGTGACGTCGAGGAGAATCCTGGCCCAAAGCTTGCCACCATGCGCAAAGGCGAAGAACTGTTTACCGGCATTGTGCCGATTCTGGTGGAACTGGATGGCGATGTGAACGGCCATAAATTTTTTGTGCGCGGCGAAGGCGAAGGCGATGCGACCATTGGCAAACTGAGCCTGAAATTTATTTGCACCACCGGCAAACTGCCGGTGCCGTGGCCGACCCTGGTGACCACCCTGACCTATGGCGTGCAGTGCTTTAGCCGCTATCCGGATCATATGAAACGCCATGATTTTTTTAAAAGCGCGATGCCGGAAGGCTATGTGCAGGAACGCACCATTTATTTTAAAGATGATGGCACCTATAAAACCCGCGCGGAAGTGAAATTTGAAGGCGATACCCTGGTGAACCGCATTGAACTGAAAGGCATTGATTTTAAAGAAGATGGCAACATTCTGGGCCATAAACTGGAATATAACTTTAACAGCCATAAAGTGTATATTACCGCGGATAAACAGAACAACGGCATTAAAGCGAACTTTACCATTCGCCATAACGTGGAAGATGGCAGCGTGCAGCTGGCGGATCATTATCAGCAGAACACCCCGATTGGCGATGGCCCGGTTCTTCTTCCTGGCGGCCGCTCTAGA 75 ATGGGCTGCTGCTTCAGCAAGACCATGGACCTGCCTGACGACCACTACCTGTCCACCCAGACCATCCTGTCCAAGGACCTGAACTCCGGACTCAGATCTGGCAGCGGTCTCGAGATGGAAGTTAGCGCTCTGGAAAAAGAAGTGTCTGCACTCGAGAAAGAAGTAAGTGCCCTTGAGAAGGAGGTGTCCGCACTCGAGAAGGAGGTCAGCGCCCTGGAAAAGGAAAAGCGAGACCATATGGTTTTGCTTGAGTATGTTACAGCGGCTGGCATTACCGATGCATCAAGGGAGCTGAACGGCGGCGCCTACACCAGGTACGTGAAGGTGTCCGCCCTGAAGGAAAAAGTAAGCGCACTGAAAGAAAAGGTGAGCGCGCTGAAGGAGAAAGTGAGCGCCCTGAAAGAGAAAGTCTCTGCCCTTAAGGAGGATATCGAGGGCAGAGGAAGTCTGCTAACATGCGGTGACGTCGAGGAGAATCCTGGCCCAAAGCTTGCCACCATGCGCAAAGGCGAAGAACTGTTTACCGGCATTGTGCCGATTCTGGTGGAACTGGATGGCGATGTGAACGGCCATAAATTTTTTGTGCGCGGCGAAGGCGAAGGCGATGCGACCATTGGCAAACTGAGCCTGAAATTTATTTGCACCACCGGCAAACTGCCGGTGCCGTGGCCGACCCTGGTGACCACCCTGACCTATGGCGTGCAGTGCTTTAGCCGCTATCCGGATCATATGAAACGCCATGATTTTTTTAAAAGCGCGATGCCGGAAGGCTATGTGCAGGAACGCACCATTTATTTTAAAGATGATGGCACCTATAAAACCCGCGCGGAAGTGAAATTTGAAGGCGATACCCTGGTGAACCGCATTGAACTGAAAGGCATTGATTTTAAAGAAGATGGCAACATTCTGGGCCATAAACTGGAATATAACTTTAACAGCCATAAAGTGTATATTACCGCGGATAAACAGAACAACGGCATTAAAGCGAACTTTACCATTCGCCATAACGTGGAAGATGGCAGCGTGCAGCTGGCGGATCATTATCAGCAGAACACCCCGATTGGCGATGGCCCGGTTCTTCTTCCTGGCGGCCGCTCTAGA 76 ATGGGCTGCTGCTTCAGCAAGACCATGGACCTGCCTGACGACCACTACCTGTCCACCCAGACCATCCTGTCCAAGGACCTGAACTCCGGACTCAGATCTGGCAGCGGTCTCGAGATGGAAGTTAGCGCTCTGGAAAAAGAAGTGTCTGCACTCGAGAAAGAAGTAAGTGCCCTTGAGAAGGAGGTGTCCGCACTCGAGAAGGAGGTCAGCGCCCTGGAAAAGGAAAAGCGAGACCATATGGTTTTGCTTGAGTATGTTACAGCGGCTGGCATTACCGATGCATCATTCACCCTGAAGGGCGGCGCCCCCACCAAGGTGACCAAGGTGTCCGCCCTGAAGGAAAAAGTAAGCGCACTGAAAGAAAAGGTGAGCGCGCTGAAGGAGAAAGTGAGCGCCCTGAAAGAGAAAGTCTCTGCCCTTAAGGAGGATATCGAGGGCAGAGGAAGTCTGCTAACATGCGGTGACGTCGAGGAGAATCCTGGCCCAAAGCTTGCCACCATGCGCAAAGGCGAAGAACTGTTTACCGGCATTGTGCCGATTCTGGTGGAACTGGATGGCGATGTGAACGGCCATAAATTTTTGTGCGCGGCGAAGGCGAAGGCGATGCGACCATTGGCAAACTGAGCCTGAAATTTATTTGCACCACCGGCAAACTGCCGGTGCCGTGGCCGACCCTGGTGACCACCCTGACCTATGGCGTGCAGTGCTTTAGCCGCTATCCGGATCATATGAAACGCCATGATTTTTTTAAAAGCGCGATGCCGGAAGGCTATGTGCAGGAACGCACCATTTATTTTAAAGATGATGGCACCTATAAAACCCGCGCGGAAGTGAAATTTGAAGGCGATACCCTGGTGAACCGCATTGAACTGAAAGGCATTGATTTTAAAGAAGATGGCAACATTCTGGGCCATAAACTGGAATATAACTTTAACAGCCATAAAGTGTATATTACCGCGGATAAACAGAACAACGGCATTAAAGCGAACTTTACCATTCGCCATAACGTGGAAGATGGCAGCGTGCAGCTGGCGGATCATTATCAGCAGAACACCCCGATTGGCGATGGCCCGGTTCTTCTTCCTGGCGGCCGCTCTAGA 77 ATGGGCTGCTGCTTCAGCAAGACCATGGACCTGCCTGACGACCACTACCTGTCCACCCAGACCATCCTGTCCAAGGACCTGAACTCCGGACTCAGATCTGGCAGCGGTCTCGAGATGGAAGTTAGCGCTCTGGAAAAAGAAGTGTCTGCACTCGAGAAAGAAGTAAGTGCCCTTGAGAAGGAGGTGTCCGCACTCGAGAAGGAGGTCAGCGCCCTGGAAAAGGAAAAGCGAGACCATATGGTTTTGCTTGAGTATGTTACAGCGGCTGGCATTACCGATGCATCAATCGCCCTGAAGGGCGGCAAGATCGTGAACAACTGGAAGGTGTCCGCCCTGAAGGAAAAAGTAAGCGCACTGAAAGAAAAGGTGAGCGCGCTGAAGGAGAAAGTGAGCGCCCTGAAAGAGAAAGTCTCTGCCCTTAAGGAGGATATCGAGGGCAGAGGAAGTCTGCTAACATGCGGTGACGTCGAGGAGAATCCTGGCCCAAAGCTTGCCACCATGCGCAAAGGCGAAGAACTGTTTACCGGCATTGTGCCGATTCTGGTGGAACTGGATGGCGATGTGAACGGCCATAAATTTTTTGTGCGCGGCGAAGGCGAAGGCGATGCGACCATTGGCAAACTGAGCCTGAAATTTATTTGCACCACCGGCAAACTGCCGGTGCCGTGGCCGACCCTGGTGACCACCCTGACCTATGGCGTGCAGTGCTTTAGCCGCTATCCGGATCATATGAAACGCCATGATTTTTTTAAAAGCGCGATGCCGGAAGGCTATGTGCAGGAACGCACCATTTATTTTAAAGATGATGGCACCTATAAAACCCGCGCGGAAGTGAAATTTGAAGGCGATACCCTGGTGAACCGCATTGAACTGAAAGGCATTGATTTTAAAGAAGATGGCAACATTCTGGGCCATAAACTGGAATATAACTTTAACAGCCATAAAGTGTATATTACCGCGGATAAACAGAACAACGGCATTAAAGCGAACTTTACCATTCGCCATAACGTGGAAGATGGCAGCGTGCAGCTGGCGGATCATTATCAGCAGAACACCCCGATTGGCGATGGCCCGGTTCTTCTTCCTGGCGGCCGCTCTAGA 78 ATGGGCTGCTGCTTCAGCAAGACCATGGACCTGCCTGACGACCACTACCTGTCCACCCAGACCATCCTGTCCAAGGACCTGAACTCCGGACTCAGATCTGGCAGCGGTCTCGAGATGGAAGTTAGCGCTCTGGAAAAAGAAGTGTCTGCACTCGAGAAAGAAGTAAGTGCCCTTGAGAAGGAGGTGTCCGCACTCGAGAAGGAGGTCAGCGCCCTGGAAAAGGAAAAGCGAGACCATATGGTTTTGCTTGAGTATGTTACAGCGGCTGGCATTACCGATGCATCAACCAGCGCCGTGCTGCAGAGCGGCTTCAGGAAGATGAAGGTGTCCGCCCTGAAGGAAAAAGTAAGCGCACTGAAAGAAAAGGTGAGCGCGCTGAAGGAGAAAGTGAGCGCCCTGAAAGAGAAAGTCTCTGCCCTTAAGGAGGATATCGAGGGCAGAGGAAGTCTGCTAACATGCGGTGACGTCGAGGAGAATCCTGGCCCAAAGCTTGCCACCATGCGCAAAGGCGAAGAACTGTTTACCGGCATTGTGCCGATTCTGGTGGAACTGGATGGCGATGTGAACGGCCATAAATTTTTTGTGCGCGGCGAAGGCGAAGGCGATGCGACCATTGGCAAACTGAGCCTGAAATTTATTTGCACCACCGGCAAACTGCCGGTGCCGTGGCCGACCCTGGTGACCACCCTGACCTATGGCGTGCAGTGCTTTAGCCGCTATCCGGATCATATGAAACGCCATGATTTTTTTAAAAGCGCGATGCCGGAAGGCTATGTGCAGGAACGCACCATTTATTTTAAAGATGATGGCACCTATAAAACCCGCGCGGAAGTGAAATTTGAAGGCGATACCCTGGTGAACCGCATTGAACTGAAAGGCATTGATTTTAAAGAAGATGGCAACATTCTGGGCCATAAACTGGAATATAACTTTAACAGCCATAAAGTGTATATTACCGCGGATAAACAGAACAACGGCATTAAAGCGAACTTTACCATTCGCCATAACGTGGAAGATGGCAGCGTGCAGCTGGCGGATCATTATCAGCAGAACACCCCGATTGGCGATGGCCCGGTTCTTCTTCCTGGCGGCCGCTCTAGA 79 ATGGGCTGCTGCTTCAGCAAGACCATGGACCTGCCTGACGACCACTACCTGTCCACCCAGACCATCCTGTCCAAGGACCTGAACTCCGGACTCAGATCTGGCAGCGGTCTCGAGATGGAAGTTAGCGCTCTGGAAAAAGAAGTGTCTGCACTCGAGAAAGAAGTAAGTGCCCTTGAGAAGGAGGTGTCCGCACTCGAGAAGGAGGTCAGCGCCCTGGAAAAGGAAAAGCGAGACCATATGGTTTTGCTTGAGTATGTTACAGCGGCTGGCATTACCGATGCATCAAGCGGCGTGACCTTCCAGAGCGCCGTGAAGAGGACCAAGGTGTCCGCCCTGAAGGAAAAAGTAAGCGCACTGAAAGAAAAGGTGAGCGCGCTGAAGGAGAAAGTGAGCGCCCTGAAAGAGAAAGTCTCTGCCCTTAAGGAGGATATCGAGGGCAGAGGAAGTCTGCTAACATGCGGTGACGTCGAGGAGAATCCTGGCCCAAAGCTTGCCACCATGCGCAAAGGCGAAGAACTGTTTACCGGCATTGTGCCGATTCTGGTGGAACTGGATGGCGATGTGAACGGCCATAAATTTTTTGTGCGCGGCGAAGGCGAAGGCGATGCGACCATTGGCAAACTGAGCCTGAAATTTATTTGCACCACCGGCAAACTGCCGGTGCCGTGGCCGACCCTGGTGACCACCCTGACCTATGGCGTGCAGTGCTTTAGCCGCTATCCGGATCATATGAAACGCCATGATTTTTTTAAAAGCGCGATGCCGGAAGGCTATGTGCAGGAACGCACCATTTATTTTAAAGATGATGGCACCTATAAAACCCGCGCGGAAGTGAAATTTGAAGGCGATACCCTGGTGAACCGCATTGAACTGAAAGGCATTGATTTTAAAGAAGATGGCAACATTCTGGGCCATAAACTGGAATATAACTTTAACAGCCATAAAGTGTATATTACCGCGGATAAACAGAACAACGGCATTAAAGCGAACTTTACCATTCGCCATAACGTGGAAGATGGCAGCGTGCAGCTGGCGGATCATTATCAGCAGAACACCCCGATTGGCGATGGCCCGGTTCTTCTTCCTGGCGGCCGCTCTAGA 80 ATGGGCTGCTGCTTCAGCAAGACCATGGACCTGCCTGACGACCACTACCTGTCCACCCAGACCATCCTGTCCAAGGACCTGAACTCCGGACTCAGATCTGGCAGCGGTCTCGAGATGGAAGTTAGCGCTCTGGAAAAAGAAGTGTCTGCACTCGAGAAAGAAGTAAGTGCCCTTGAGAAGGAGGTGTCCGCACTCGAGAAGGAGGTCAGCGCCCTGGAAAAGGAAAAGCGAGACCATATGGTTTTGCTTGAGTATGTTACAGCGGCTGGCATTACCGATGCATCAAAGGTGGCCACCGTGCAGAGCAAGATGAGCGACGTGAAGGTGTCCGCCCTGAAGGAAAAAGTAAGCGCACTGAAAGAAAAGGTGAGCGCGCTGAAGGAGAAAGTGAGCGCCCTGAAAGAGAAAGTCTCTGCCCTTAAGGAGGATATCGAGGGCAGAGGAAGTCTGCTAACATGCGGTGACGTCGAGGAGAATCCTGGCCCAAAGCTTGCCACCATGCGCAAAGGCGAAGAACTGTTTACCGGCATTGTGCCGATTCTGGTGGAACTGGATGGCGATGTGAACGGCCATAAATTTTTTGTGCGCGGCGAAGGCGAAGGCGATGCGACCATTGGCAAACTGAGCCTGAAATTTATTTGCACCACCGGCAAACTGCCGGTGCCGTGGCCGACCCTGGTGACCACCCTGACCTATGGCGTGCAGTGCTTTAGCCGCTATCCGGATCATATGAAACGCCATGATTTTTTTAAAAGCGCGATGCCGGAAGGCTATGTGCAGGAACGCACCATTTATTTTAAAGATGATGGCACCTATAAAACCCGCGCGGAAGTGAAATTTGAAGGCGATACCCTGGTGAACCGCATTGAACTGAAAGGCATTGATTTTAAAGAAGATGGCAACATTCTGGGCCATAAACTGGAATATAACTTTAACAGCCATAAAGTGTATATTACCGCGGATAAACAGAACAACGGCATTAAAGCGAACTTTACCATTCGCCATAACGTGGAAGATGGCAGCGTGCAGCTGGCGGATCATTATCAGCAGAACACCCCGATTGGCGATGGCCCGGTTCTTCTTCCTGGCGGCCGCTCTAGA 81 ATGGGCTGCTGCTTCAGCAAGACCATGGACCTGCCTGACGACCACTACCTGTCCACCCAGACCATCCTGTCCAAGGACCTGAACTCCGGACTCAGATCTGGCAGCGGTCTCGAGATGGAAGTTAGCGCTCTGGAAAAAGAAGTGTCTGCACTCGAGAAAGAAGTAAGTGCCCTTGAGAAGGAGGTGTCCGCACTCGAGAAGGAGGTCAGCGCCCTGGAAAAGGAAAAGCGAGACCATATGGTTTTGCTTGAGTATGTTACAGCGGCTGGCATTACCGATGCATCAAGCGCCGTGAAGCTGCAGAACAACGAGCTGAGCCCCAAGGTGTCCGCCCTGAAGGAAAAAGTAAGCGCACTGAAAGAAAAGGTGAGCGCGCTGAAGGAGAAAGTGAGCGCCCTGAAAGAGAAAGTCTCTGCCCTTAAGGAGGATATCGAGGGCAGAGGAAGTCTGCTAACATGCGGTGACGTCGAGGAGAATCCTGGCCCAAAGCTTGCCACCATGCGCAAAGGCGAAGAACTGTTTACCGGCATTGTGCCGATTCTGGTGGAACTGGATGGCGATGTGAACGGCCATAAATTTTTTGTGCGCGGCGAAGGCGAAGGCGATGCGACCATTGGCAAACTGAGCCTGAAATTTATTTGCACCACCGGCAAACTGCCGGTGCCGTGGCCGACCCTGGTGACCACCCTGACCTATGGCGTGCAGTGCTTTAGCCGCTATCCGGATCATATGAAACGCCATGATTTTTTTAAAAGCGCGATGCCGGAAGGCTATGTGCAGGAACGCACCATTTATTTTAAAGATGATGGCACCTATAAAACCCGCGCGGAAGTGAAATTTGAAGGCGATACCCTGGTGAACCGCATTGAACTGAAAGGCATTGATTTTAAAGAAGATGGCAACATTCTGGGCCATAAACTGGAATATAACTTTAACAGCCATAAAGTGTATATTACCGCGGATAAACAGAACAACGGCATTAAAGCGAACTTTACCATTCGCCATAACGTGGAAGATGGCAGCGTGCAGCTGGCGGATCATTATCAGCAGAACACCCCGATTGGCGATGGCCCGGTTCTTCTTCCTGGCGGCCGCTCTAGA 82 ATGGGCTGCTGCTTCAGCAAGACCATGGACCTGCCTGACGACCACTACCTGTCCACCCAGACCATCCTGTCCAAGGACCTGAACTCCGGACTCAGATCTGGCAGCGGTCTCGAGATGGAAGTTAGCGCTCTGGAAAAAGAAGTGTCTGCACTCGAGAAAGAAGTAAGTGCCCTTGAGAAGGAGGTGTCCGCACTCGAGAAGGAGGTCAGCGCCCTGGAAAAGGAAAAGCGAGACCATATGGTTTTGCTTGAGTATGTTACAGCGGCTGGCATTACCGATGCATCAGCCACCGTGAGGCTGCAGGCCGGCAACGCCACCGAGAAGGTGTCCGCCCTGAAGGAAAAAGTAAGCGCACTGAAAGAAAAGGTGAGCGCGCTGAAGGAGAAAGTGAGCGCCCTGAAAGAGAAAGTCTCTGCCCTTAAGGAGGATATCGAGGGCAGAGGAAGTCTGCTAACATGCGGTGACGTCGAGGAGAATCCTGGCCCAAAGCTTGCCACCATGCGCAAAGGCGAAGAACTGTTTACCGGCATTGTGCCGATTCTGGTGGAACTGGATGGCGATGTGAACGGCCATAAATTTTTTGTGCGCGGCGAAGGCGAAGGCGATGCGACCATTGGCAAACTGAGCCTGAAATTTATTTGCACCACCGGCAAACTGCCGGTGCCGTGGCCGACCCTGGTGACCACCCTGACCTATGGCGTGCAGTGCTTTAGCCGCTATCCGGATCATATGAAACGCCATGATTTTTTTAAAAGCGCGATGCCGGAAGGCTATGTGCAGGAACGCACCATTTATTTTAAAGATGATGGCACCTATAAAACCCGCGCGGAAGTGAAATTTGAAGGCGATACCCTGGTGAACCGCATTGAACTGAAAGGCATTGATTTTAAAGAAGATGGCAACATTCTGGGCCATAAACTGGAATATAACTTTAACAGCCATAAAGTGTATATTACCGCGGATAAACAGAACAACGGCATTAAAGCGAACTTTACCATTCGCCATAACGTGGAAGATGGCAGCGTGCAGCTGGCGGATCATTATCAGCAGAACACCCCGATTGGCGATGGCCCGGTTCTTCTTCCTGGCGGCCGCTCTAGA 83 ATGGGCTGCTGCTTCAGCAAGACCATGGACCTGCCTGACGACCACTACCTGTCCACCCAGACCATCCTGTCCAAGGACCTGAACTCCGGACTCAGATCTGGCAGCGGTCTCGAGATGGAAGTTAGCGCTCTGGAAAAAGAAGTGTCTGCACTCGAGAAAGAAGTAAGTGCCCTTGAGAAGGAGGTGTCCGCACTCGAGAAGGAGGTCAGCGCCCTGGAAAAGGAAAAGCGAGACCATATGGTTTTGCTTGAGTATGTTACAGCGGCTGGCATTACCGATGCATCAAGGGAGCCCATGCTGCAGAGCGCCGACGCCCAGAGCAAGGTGTCCGCCCTGAAGGAAAAAGTAAGCGCACTGAAAGAAAAGGTGAGCGCGCTGAAGGAGAAAGTGAGCGCCCTGAAAGAGAAAGTCTCTGCCCTTAAGGAGGATATCGAGGGCAGAGGAAGTCTGCTAACATGCGGTGACGTCGAGGAGAATCCTGGCCCAAAGCTTGCCACCATGCGCAAAGGCGAAGAACTGTTTACCGGCATTGTGCCGATTCTGGTGGAACTGGATGGCGATGTGAACGGCCATAAATTTTTGTGCGCGGCGAAGGCGAAGGCGATGCGACCATTGGCAAACTGAGCCTGAAATTTATTTGCACCACCGGCAAACTGCCGGTGCCGTGGCCGACCCTGGTGACCACCCTGACCTATGGCGTGCAGTGCTTTAGCCGCTATCCGGATCATATGAAACGCCATGATTTTTTTAAAAGCGCGATGCCGGAAGGCTATGTGCAGGAACGCACCATTTATTTTAAAGATGATGGCACCTATAAAACCCGCGCGGAAGTGAAATTTGAAGGCGATACCCTGGTGAACCGCATTGAACTGAAAGGCATTGATTTTAAAGAAGATGGCAACATTCTGGGCCATAAACTGGAATATAACTTTAACAGCCATAAAGTGTATATTACCGCGGATAAACAGAACAACGGCATTAAAGCGAACTTTACCATTCGCCATAACGTGGAAGATGGCAGCGTGCAGCTGGCGGATCATTATCAGCAGAACACCCCGATTGGCGATGGCCCGGTTCTTCTTCCTGGCGGCCGCTCTAGA 84 ATGGGCTGCTGCTTCAGCAAGACCATGGACCTGCCTGACGACCACTACCTGTCCACCCAGACCATCCTGTCCAAGGACCTGAACTCCGGACTCAGATCTGGCAGCGGTCTCGAGATGGAAGTTAGCGCTCTGGAAAAAGAAGTGTCTGCACTCGAGAAAGAAGTAAGTGCCCTTGAGAAGGAGGTGTCCGCACTCGAGAAGGAGGTCAGCGCCCTGGAAAAGGAAAAGCGAGACCATATGGTTTTGCTTGAGTATGTTACAGCGGCTGGCATTACCGATGCATCAAAGGAGAGGAAGAGGAGGGGCGCCGACACCAGCATCAAGGTGTCCGCCCTGAAGGAAAAAGTAAGCGCACTGAAAGAAAAGGTGAGCGCGCTGAAGGAGAAAGTGAGCGCCCTGAAAGAGAAAGTCTCTGCCCTTAAGGAGGATATCGAGGGCAGAGGAAGTCTGCTAACATGCGGTGACGTCGAGGAGAATCCTGGCCCAAAGCTTGCCACCATGCGCAAAGGCGAAGAACTGTTTACCGGCATTGTGCCGATTCTGGTGGAACTGGATGGCGATGTGAACGGCCATAAATTTTTGTGCGCGGCGAAGGCGAAGGCGATGCGACCATTGGCAAACTGAGCCTGAAATTTATTTGCACCACCGGCAAACTGCCGGTGCCGTGGCCGACCCTGGTGACCACCCTGACCTATGGCGTGCAGTGCTTTAGCCGCTATCCGGATCATATGAAACGCCATGATTTTTTTAAAAGCGCGATGCCGGAAGGCTATGTGCAGGAACGCACCATTTATTTTAAAGATGATGGCACCTATAAAACCCGCGCGGAAGTGAAATTTGAAGGCGATACCCTGGTGAACCGCATTGAACTGAAAGGCATTGATTTTAAAGAAGATGGCAACATTCTGGGCCATAAACTGGAATATAACTTTAACAGCCATAAAGTGTATATTACCGCGGATAAACAGAACAACGGCATTAAAGCGAACTTTACCATTCGCCATAACGTGGAAGATGGCAGCGTGCAGCTGGCGGATCATTATCAGCAGAACACCCCGATTGGCGATGGCCCGGTTCTTCTTCCTGGCGGCCGCTCTAGA 85 ATGGGCTGCTGCTTCAGCAAGACCATGGACCTGCCTGACGACCACTACCTGTCCACCCAGACCATCCTGTCCAAGGACCTGAACTCCGGACTCAGATCTGGCAGCGGTCTCGAGATGGAAGTTAGCGCTCTGGAAAAAGAAGTGTCTGCACTCGAGAAAGAAGTAAGTGCCCTTGAGAAGGAGGTGTCCGCACTCGAGAAGGAGGTCAGCGCCCTGGAAAAGGAAAAGCGAGACCATATGGTTTTGCTTGAGTATGTTACAGCGGCTGGCATTACCGATGCATCAACCAGGAGCGGCAAGAGGAGCTGGCCCCCCAGCGAGAAGGTGTCCGCCCTGAAGGAAAAAGTAAGCGCACTGAAAGAAAAGGTGAGCGCGCTGAAGGAGAAAGTGAGCGCCCTGAAAGAGAAAGTCTCTGCCCTTAAGGAGGATATCGAGGGCAGAGGAAGTCTGCTAACATGCGGTGACGTCGAGGAGAATCCTGGCCCAAAGCTTGCCACCATGCGCAAAGGCGAAGAACTGTTTACCGGCATTGTGCCGATTCTGGTGGAACTGGATGGCGATGTGAACGGCCATAAATTTTTTGTGCGCGGCGAAGGCGAAGGCGATGCGACCATTGGCAAACTGAGCCTGAAATTTATTTGCACCACCGGCAAACTGCCGGTGCCGTGGCCGACCCTGGTGACCACCCTGACCTATGGCGTGCAGTGCTTTAGCCGCTATCCGGATCATATGAAACGCCATGATTTTTTTAAAAGCGCGATGCCGGAAGGCTATGTGCAGGAACGCACCATTTATTTTAAAGATGATGGCACCTATAAAACCCGCGCGGAAGTGAAATTTGAAGGCGATACCCTGGTGAACCGCATTGAACTGAAAGGCATTGATTTTAAAGAAGATGGCAACATTCTGGGCCATAAACTGGAATATAACTTTAACAGCCATAAAGTGTATATTACCGCGGATAAACAGAACAACGGCATTAAAGCGAACTTTACCATTCGCCATAACGTGGAAGATGGCAGCGTGCAGCTGGCGGATCATTATCAGCAGAACACCCCGATTGGCGATGGCCCGGTTCTTCTTCCTGGCGGCCGCTCTAGA 86 ATGGGCTGCTGCTTCAGCAAGACCATGGACCTGCCTGACGACCACTACCTGTCCACCCAGACCATCCTGTCCAAGGACCTGAACTCCGGACTCAGATCTGGCAGCGGTCTCGAGATGGAAGTTAGCGCTCTGGAAAAAGAAGTGTCTGCACTCGAGAAAGAAGTAAGTGCCCTTGAGAAGGAGGTGTCCGCACTCGAGAAGGAGGTCAGCGCCCTGGAAAAGGAAAAGCGAGACCATATGGTTTTGCTTGAGTATGTTACAGCGGCTGGCATTACCGATGCATCAGAGCCCGAGAAGCAGAGGAGCCCCCAGGACAACCAGAAGGTGTCCGCCCTGAAGGAAAAAGTAAGCGCACTGAAAGAAAAGGTGAGCGCGCTGAAGGAGAAAGTGAGCGCCCTGAAAGAGAAAGTCTCTGCCCTTAAGGAGGATATCGAGGGCAGAGGAAGTCTGCTAACATGCGGTGACGTCGAGGAGAATCCTGGCCCAAAGCTTGCCACCATGCGCAAAGGCGAAGAACTGTTTACCGGCATTGTGCCGATTCTGGTGGAACTGGATGGCGATGTGAACGGCCATAAATTTTTTGTGCGCGGCGAAGGCGAAGGCGATGCGACCATTGGCAAACTGAGCCTGAAATTTATTTGCACCACCGGCAAACTGCCGGTGCCGTGGCCGACCCTGGTGACCACCCTGACCTATGGCGTGCAGTGCTTTAGCCGCTATCCGGATCATATGAAACGCCATGATTTTTTTAAAAGCGCGATGCCGGAAGGCTATGTGCAGGAACGCACCATTTATTTTAAAGATGATGGCACCTATAAAACCCGCGCGGAAGTGAAATTTGAAGGCGATACCCTGGTGAACCGCATTGAACTGAAAGGCATTGATTTTAAAGAAGATGGCAACATTCTGGGCCATAAACTGGAATATAACTTTAACAGCCATAAAGTGTATATTACCGCGGATAAACAGAACAACGGCATTAAAGCGAACTTTACCATTCGCCATAACGTGGAAGATGGCAGCGTGCAGCTGGCGGATCATTATCAGCAGAACACCCCGATTGGCGATGGCCCGGTTCTTCTTCCTGGCGGCCGCTCTAGA 87 ATGGGCTGCTGCTTCAGCAAGACCATGGACCTGCCTGACGACCACTACCTGTCCACCCAGACCATCCTGTCCAAGGACCTGAACTCCGGACTCAGATCTGGCAGCGGTCTCGAGATGGAAGTTAGCGCTCTGGAAAAAGAAGTGTCTGCACTCGAGAAAGAAGTAAGTGCCCTTGAGAAGGAGGTGTCCGCACTCGAGAAGGAGGTCAGCGCCCTGGAAAAGGAAAAGCGAGACCATATGGTTTTGCTTGAGTATGTTACAGCGGCTGGCATTACCGATGCATCAGGCCTGGTGAAGAGGAGGGGCGGCGGCACCGGCGAGAAGGTGTCCGCCCTGAAGGAAAAAGTAAGCGCACTGAAAGAAAAGGTGAGCGCGCTGAAGGAGAAAGTGAGCGCCCTGAAAGAGAAAGTCTCTGCCCTTAAGGAGGATATCGAGGGCAGAGGAAGTCTGCTAACATGCGGTGACGTCGAGGAGAATCCTGGCCCAAAGCTTGCCACCATGCGCAAAGGCGAAGAACTGTTTACCGGCATTGTGCCGATTCTGGTGGAACTGGATGGCGATGTGAACGGCCATAAATTTTTGTGCGCGGCGAAGGCGAAGGCGATGCGACCATTGGCAAACTGAGCCTGAAATTTATTTGCACCACCGGCAAACTGCCGGTGCCGTGGCCGACCCTGGTGACCACCCTGACCTATGGCGTGCAGTGCTTTAGCCGCTATCCGGATCATATGAAACGCCATGATTTTTTTAAAAGCGCGATGCCGGAAGGCTATGTGCAGGAACGCACCATTTATTTTAAAGATGATGGCACCTATAAAACCCGCGCGGAAGTGAAATTTGAAGGCGATACCCTGGTGAACCGCATTGAACTGAAAGGCATTGATTTTAAAGAAGATGGCAACATTCTGGGCCATAAACTGGAATATAACTTTAACAGCCATAAAGTGTATATTACCGCGGATAAACAGAACAACGGCATTAAAGCGAACTTTACCATTCGCCATAACGTGGAAGATGGCAGCGTGCAGCTGGCGGATCATTATCAGCAGAACACCCCGATTGGCGATGGCCCGGTTCTTCTTCCTGGCGGCCGCTCTAGA 88 ATGGACCTGCCTGACGACCACTACCTGTCCACCCAGACCATCCTGTCCAAGGACCTGAACTCCGGACTCAGATCTGGCAGCGGTCTCGAGATGGAAGTTAGCGCTCTGGAAAAAGAAGTGTCTGCACTCGAGAAAGAAGTAAGTGCCCTTGAGAAGGAGGTGTCCGCACTCGAGAAGGAGGTCAGCGCCCTGGAAAAGGAAAAGCGAGACCATATGGTTTTGCTTGAGTATGTTACAGCGGCTGGCATTACCGATGCATCAAGGGAGCTGAACGGCGGCGCCTACACCAGGTACGTGAAGGTGTCCGCCCTGAAGGAAAAAGTAAGCGCACTGAAAGAAAAGGTGAGCGCGCTGAAGGAGAAAGTGAGCGCCCTGAAAGAGAAAGTCTCTGCCCTTAAGGAGGATATCGAGGGCAGAGGAAGTCTGCTAACATGCGGTGACGTCGAGGAGAATCCTGGCCCAAAGCTTGCCACCATGCGCAAAGGCGAAGAACTGTTTACCGGCATTGTGCCGATTCTGGTGGAACTGGATGGCGATGTGAACGGCCATAAATTTTTTGTGCGCGGCGAAGGCGAAGGCGATGCGACCATTGGCAAACTGAGCCTGAAATTTATTTGCACCACCGGCAAACTGCCGGTGCCGTGGCCGACCCTGGTGACCACCCTGACCTATGGCGTGCAGTGCTTTAGCCGCTATCCGGATCATATGAAACGCCATGATTTTTTTAAAAGCGCGATGCCGGAAGGCTATGTGCAGGAACGCACCATTTATTTTAAAGATGATGGCACCTATAAAACCCGCGCGGAAGTGAAATTTGAAGGCGATACCCTGGTGAACCGCATTGAACTGAAAGGCATTGATTTTAAAGAAGATGGCAACATTCTGGGCCATAAACTGGAATATAACTTTAACAGCCATAAAGTGTATATTACCGCGGATAAACAGAACAACGGCATTAAAGCGAACTTTACCATTCGCCATAACGTGGAAGATGGCAGCGTGCAGCTGGCGGATCATTATCAGCAGAACACCCCGATTGGCGATGGCCCGGTTCTTCTTCCTGGCGGCCGCTCTAGA 89 ATGGACCTGCCTGACGACCACTACCTGTCCACCCAGACCATCCTGTCCAAGGACCTGAACTCCGGACTCAGATCTGGCAGCGGTCTCGAGATGGAAGTTAGCGCTCTGGAAAAAGAAGTGTCTGCACTCGAGAAAGAAGTAAGTGCCCTTGAGAAGGAGGTGTCCGCACTCGAGAAGGAGGTCAGCGCCCTGGAAAAGGAAAAGCGAGACCATATGGTTTTGCTTGAGTATGTTACAGCGGCTGGCATTACCGATGCATCATTCACCCTGAAGGGCGGCGCCCCCACCAAGGTGACCAAGGTGTCCGCCCTGAAGGAAAAAGTAAGCGCACTGAAAGAAAAGGTGAGCGCGCTGAAGGAGAAAGTGAGCGCCCTGAAAGAGAAAGTCTCTGCCCTTAAGGAGGATATCGAGGGCAGAGGAAGTCTGCTAACATGCGGTGACGTCGAGGAGAATCCTGGCCCAAAGCTTGCCACCATGCGCAAAGGCGAAGAACTGTTTACCGGCATTGTGCCGATTCTGGTGGAACTGGATGGCGATGTGAACGGCCATAAATTTTTTGTGCGCGGCGAAGGCGAAGGCGATGCGACCATTGGCAAACTGAGCCTGAAATTTATTTGCACCACCGGCAAACTGCCGGTGCCGTGGCCGACCCTGGTGACCACCCTGACCTATGGCGTGCAGTGCTTTAGCCGCTATCCGGATCATATGAAACGCCATGATTTTTTTAAAAGCGCGATGCCGGAAGGCTATGTGCAGGAACGCACCATTTATTTTAAAGATGATGGCACCTATAAAACCCGCGCGGAAGTGAAATTTGAAGGCGATACCCTGGTGAACCGCATTGAACTGAAAGGCATTGATTTTAAAGAAGATGGCAACATTCTGGGCCATAAACTGGAATATAACTTTAACAGCCATAAAGTGTATATTACCGCGGATAAACAGAACAACGGCATTAAAGCGAACTTTACCATTCGCCATAACGTGGAAGATGGCAGCGTGCAGCTGGCGGATCATTATCAGCAGAACACCCCGATTGGCGATGGCCCGGTTCTTCTTCCTGGCGGCCGCTCTAGA 90 ATGGACCTGCCTGACGACCACTACCTGTCCACCCAGACCATCCTGTCCAAGGACCTGAACTCCGGACTCAGATCTGGCAGCGGTCTCGAGATGGAAGTTAGCGCTCTGGAAAAAGAAGTGTCTGCACTCGAGAAAGAAGTAAGTGCCCTTGAGAAGGAGGTGTCCGCACTCGAGAAGGAGGTCAGCGCCCTGGAAAAGGAAAAGCGAGACCATATGGTTTTGCTTGAGTATGTTACAGCGGCTGGCATTACCGATGCATCAATCGCCCTGAAGGGCGGCAAGATCGTGAACAACTGGAAGGTGTCCGCCCTGAAGGAAAAAGTAAGCGCACTGAAAGAAAAGGTGAGCGCGCTGAAGGAGAAAGTGAGCGCCCTGAAAGAGAAAGTCTCTGCCCTTAAGGAGGATATCGAGGGCAGAGGAAGTCTGCTAACATGCGGTGACGTCGAGGAGAATCCTGGCCCAAAGCTTGCCACCATGCGCAAAGGCGAAGAACTGTTTACCGGCATTGTGCCGATTCTGGTGGAACTGGATGGCGATGTGAACGGCCATAAATTTTTTGTGCGCGGCGAAGGCGAAGGCGATGCGACCATTGGCAAACTGAGCCTGAAATTTATTTGCACCACCGGCAAACTGCCGGTGCCGTGGCCGACCCTGGTGACCACCCTGACCTATGGCGTGCAGTGCTTTAGCCGCTATCCGGATCATATGAAACGCCATGATTTTTTTAAAAGCGCGATGCCGGAAGGCTATGTGCAGGAACGCACCATTTATTTTAAAGATGATGGCACCTATAAAACCCGCGCGGAAGTGAAATTTGAAGGCGATACCCTGGTGAACCGCATTGAACTGAAAGGCATTGATTTTAAAGAAGATGGCAACATTCTGGGCCATAAACTGGAATATAACTTTAACAGCCATAAAGTGTATATTACCGCGGATAAACAGAACAACGGCATTAAAGCGAACTTTACCATTCGCCATAACGTGGAAGATGGCAGCGTGCAGCTGGCGGATCATTATCAGCAGAACACCCCGATTGG CGATGGCCCGGTTCTTCTTCCTGGCGGCCGCTCTAGA 91 ATGGACCTGCCTGACGACCACTACCTGTCCACCCAGACCATCCTGTCCAAGGACCTGAACTCCGGACTCAGATCTGGCAGCGGTCTCGAGATGGAAGTTAGCGCTCTGGAAAAAGAAGTGTCTGCACTCGAGAAAGAAGTAAGTGCCCTTGAGAAGGAGGTGTCCGCACTCGAGAAGGAGGTCAGCGCCCTGGAAAAGGAAAAGCGAGACCATATGGTTTTGCTTGAGTATGTTACAGCGGCTGGCATTACCGATGCATCAACCAGCGCCGTGCTGCAGAGCGGCTTCAGGAAGATGAAGGTGTCCGCCCTGAAGGAAAAAGTAAGCGCACTGAAAGAAAAGGTGAGCGCGCTGAAGGAGAAAGTGAGCGCCCTGAAAGAGAAAGTCTCTGCCCTTAAGGAGGATATCGAGGGCAGAGGAAGTCTGCTAACATGCGGTGACGTCGAGGAGAATCCTGGCCCAAAGCTTGCCACCATGCGCAAAGGCGAAGAACTGTTTACCGGCATTGTGCCGATTCTGGTGGAACTGGATGGCGATGTGAACGGCCATAAATTTTTTGTGCGCGGCGAAGGCGAAGGCGATGCGACCATTGGCAAACTGAGCCTGAAATTTATTTGCACCACCGGCAAACTGCCGGTGCCGTGGCCGACCCTGGTGACCACCCTGACCTATGGCGTGCAGTGCTTTAGCCGCTATCCGGATCATATGAAACGCCATGATTTTTTTAAAAGCGCGATGCCGGAAGGCTATGTGCAGGAACGCACCATTTATTTTAAAGATGATGGCACCTATAAAACCCGCGCGGAAGTGAAATTTGAAGGCGATACCCTGGTGAACCGCATTGAACTGAAAGGCATTGATTTTAAAGAAGATGGCAACATTCTGGGCCATAAACTGGAATATAACTTTAACAGCCATAAAGTGTATATTACCGCGGATAAACAGAACAACGGCATTAAAGCGAACTTTACCATTCGCCATAACGTGGAAGATGGCAGCGTGCAGCTGGCGGATCATTATCAGCAGAACACCCCGATTGGCGATGGCCCGGTTCTTCTTCCTGGCGGCCGCTCTAGA 92 ATGGACCTGCCTGACGACCACTACCTGTCCACCCAGACCATCCTGTCCAAGGACCTGAACTCCGGACTCAGATCTGGCAGCGGTCTCGAGATGGAAGTTAGCGCTCTGGAAAAAGAAGTGTCTGCACTCGAGAAAGAAGTAAGTGCCCTTGAGAAGGAGGTGTCCGCACTCGAGAAGGAGGTCAGCGCCCTGGAAAAGGAAAAGCGAGACCATATGGTTTTGCTTGAGTATGTTACAGCGGCTGGCATTACCGATGCATCAAGCGGCGTGACCTTCCAGAGCGCCGTGAAGAGGACCAAGGTGTCCGCCCTGAAGGAAAAAGTAAGCGCACTGAAAGAAAAGGTGAGCGCGCTGAAGGAGAAAGTGAGCGCCCTGAAAGAGAAAGTCTCTGCCCTTAAGGAGGATATCGAGGGCAGAGGAAGTCTGCTAACATGCGGTGACGTCGAGGAGAATCCTGGCCCAAAGCTTGCCACCATGCGCAAAGGCGAAGAACTGTTTACCGGCATTGTGCCGATTCTGGTGGAACTGGATGGCGATGTGAACGGCCATAAATTTTTTGTGCGCGGCGAAGGCGAAGGCGATGCGACCATTGGCAAACTGAGCCTGAAATTTATTTGCACCACCGGCAAACTGCCGGTGCCGTGGCCGACCCTGGTGACCACCCTGACCTATGGCGTGCAGTGCTTTAGCCGCTATCCGGATCATATGAAACGCCATGATTTTTTTAAAAGCGCGATGCCGGAAGGCTATGTGCAGGAACGCACCATTTATTTTAAAGATGATGGCACCTATAAAACCCGCGCGGAAGTGAAATTTGAAGGCGATACCCTGGTGAACCGCATTGAACTGAAAGGCATTGATTTTAAAGAAGATGGCAACATTCTGGGCCATAAACTGGAATATAACTTTAACAGCCATAAAGTGTATATTACCGCGGATAAACAGAACAACGGCATTAAAGCGAACTTTACCATTCGCCATAACGTGGAAGATGGCAGCGTGCAGCTGGCGGATCATTATCAGCAGAACACCCCGATTGGCGATGGCCCGGTTCTTCTTCCTGGCGGCCGCTCTAGA 93 ATGGACCTGCCTGACGACCACTACCTGTCCACCCAGACCATCCTGTCCAAGGACCTGAACTCCGGACTCAGATCTGGCAGCGGTCTCGAGATGGAAGTTAGCGCTCTGGAAAAAGAAGTGTCTGCACTCGAGAAAGAAGTAAGTGCCCTTGAGAAGGAGGTGTCCGCACTCGAGAAGGAGGTCAGCGCCCTGGAAAAGGAAAAGCGAGACCATATGGTTTTGCTTGAGTATGTTACAGCGGCTGGCATTACCGATGCATCAAAGGTGGCCACCGTGCAGAGCAAGATGAGCGACGTGAAGGTGTCCGCCCTGAAGGAAAAAGTAAGCGCACTGAAAGAAAAGGTGAGCGCGCTGAAGGAGAAAGTGAGCGCCCTGAAAGAGAAAGTCTCTGCCCTTAAGGAGGATATCGAGGGCAGAGGAAGTCTGCTAACATGCGGTGACGTCGAGGAGAATCCTGGCCCAAAGCTTGCCACCATGCGCAAAGGCGAAGAACTGTTTACCGGCATTGTGCCGATTCTGGTGGAACTGGATGGCGATGTGAACGGCCATAAATTTTTTGTGCGCGGCGAAGGCGAAGGCGATGCGACCATTGGCAAACTGAGCCTGAAATTTATTTGCACCACCGGCAAACTGCCGGTGCCGTGGCCGACCCTGGTGACCACCCTGACCTATGGCGTGCAGTGCTTTAGCCGCTATCCGGATCATATGAAACGCCATGATTTTTTTAAAAGCGCGATGCCGGAAGGCTATGTGCAGGAACGCACCATTTATTTTAAAGATGATGGCACCTATAAAACCCGCGCGGAAGTGAAATTTGAAGGCGATACCCTGGTGAACCGCATTGAACTGAAAGGCATTGATTTTAAAGAAGATGGCAACATTCTGGGCCATAAACTGGAATATAACTTTAACAGCCATAAAGTGTATATTACCGCGGATAAACAGAACAACGGCATTAAAGCGAACTTTACCATTCGCCATAACGTGGAAGATGGCAGCGTGCAGCTGGCGGATCATTATCAGCAGAACACCCCGATTGGCGATGGCCCGGTTCTTCTTCCTGGCGGCCGCTCTAGA 94 ATGGACCTGCCTGACGACCACTACCTGTCCACCCAGACCATCCTGTCCAAGGACCTGAACTCCGGACTCAGATCTGGCAGCGGTCTCGAGATGGAAGTTAGCGCTCTGGAAAAAGAAGTGTCTGCACTCGAGAAAGAAGTAAGTGCCCTTGAGAAGGAGGTGTCCGCACTCGAGAAGGAGGTCAGCGCCCTGGAAAAGGAAAAGCGAGACCATATGGTTTTGCTTGAGTATGTTACAGCGGCTGGCATTACCGATGCATCAAGCGCCGTGAAGCTGCAGAACAACGAGCTGAGCCCCAAGGTGTCCGCCCTGAAGGAAAAAGTAAGCGCACTGAAAGAAAAGGTGAGCGCGCTGAAGGAGAAAGTGAGCGCCCTGAAAGAGAAAGTCTCTGCCCTTAAGGAGGATATCGAGGGCAGAGGAAGTCTGCTAACATGCGGTGACGTCGAGGAGAATCCTGGCCCAAAGCTTGCCACCATGCGCAAAGGCGAAGAACTGTTTACCGGCATTGTGCCGATTCTGGTGGAACTGGATGGCGATGTGAACGGCCATAAATTTTTTGTGCGCGGCGAAGGCGAAGGCGATGCGACCATTGGCAAACTGAGCCTGAAATTTATTTGCACCACCGGCAAACTGCCGGTGCCGTGGCCGACCCTGGTGACCACCCTGACCTATGGCGTGCAGTGCTTTAGCCGCTATCCGGATCATATGAAACGCCATGATTTTTTTAAAAGCGCGATGCCGGAAGGCTATGTGCAGGAACGCACCATTTATTTTAAAGATGATGGCACCTATAAAACCCGCGCGGAAGTGAAATTTGAAGGCGATACCCTGGTGAACCGCATTGAACTGAAAGGCATTGATTTTAAAGAAGATGGCAACATTCTGGGCCATAAACTGGAATATAACTTTAACAGCCATAAAGTGTATATTACCGCGGATAAACAGAACAACGGCATTAAAGCGAACTTTACCATTCGCCATAACGTGGAAGATGGCAGCGTGCAGCTGGCGGATCATTATCAGCAGAACACCCCGATTGGCGATGGCCCGGTTCTTCTTCCTGGCGGCCGCTCTAGA 95 ATGGACCTGCCTGACGACCACTACCTGTCCACCCAGACCATCCTGTCCAAGGACCTGAACTCCGGACTCAGATCTGGCAGCGGTCTCGAGATGGAAGTTAGCGCTCTGGAAAAAGAAGTGTCTGCACTCGAGAAAGAAGTAAGTGCCCTTGAGAAGGAGGTGTCCGCACTCGAGAAGGAGGTCAGCGCCCTGGAAAAGGAAAAGCGAGACCATATGGTTTTGCTTGAGTATGTTACAGCGGCTGGCATTACCGATGCATCAGCCACCGTGAGGCTGCAGGCCGGCAACGCCACCGAGAAGGTGTCCGCCCTGAAGGAAAAAGTAAGCGCACTGAAAGAAAAGGTGAGCGCGCTGAAGGAGAAAGTGAGCGCCCTGAAAGAGAAAGTCTCTGCCCTTAAGGAGGATATCGAGGGCAGAGGAAGTCTGCTAACATGCGGTGACGTCGAGGAGAATCCTGGCCCAAAGCTTGCCACCATGCGCAAAGGCGAAGAACTGTTTACCGGCATTGTGCCGATTCTGGTGGAACTGGATGGCGATGTGAACGGCCATAAATTTTTTGTGCGCGGCGAAGGCGAAGGCGATGCGACCATTGGCAAACTGAGCCTGAAATTTATTTGCACCACCGGCAAACTGCCGGTGCCGTGGCCGACCCTGGTGACCACCCTGACCTATGGCGTGCAGTGCTTTAGCCGCTATCCGGATCATATGAAACGCCATGATTTTTTTAAAAGCGCGATGCCGGAAGGCTATGTGCAGGAACGCACCATTTATTTTAAAGATGATGGCACCTATAAAACCCGCGCGGAAGTGAAATTTGAAGGCGATACCCTGGTGAACCGCATTGAACTGAAAGGCATTGATTTTAAAGAAGATGGCAACATTCTGGGCCATAAACTGGAATATAACTTTAACAGCCATAAAGTGTATATTACCGCGGATAAACAGAACAACGGCATTAAAGCGAACTTTACCATTCGCCATAACGTGGAAGATGGCAGCGTGCAGCTGGCGGATCATTATCAGCAGAACACCCCGATTGGCGATGGCCCGGTTCTTCTTCCTGGCGGCCGCTCTAGA 96 ATGGACCTGCCTGACGACCACTACCTGTCCACCCAGACCATCCTGTCCAAGGACCTGAACTCCGGACTCAGATCTGGCAGCGGTCTCGAGATGGAAGTTAGCGCTCTGGAAAAAGAAGTGTCTGCACTCGAGAAAGAAGTAAGTGCCCTTGAGAAGGAGGTGTCCGCACTCGAGAAGGAGGTCAGCGCCCTGGAAAAGGAAAAGCGAGACCATATGGTTTTGCTTGAGTATGTTACAGCGGCTGGCATTACCGATGCATCAAGGGAGCCCATGCTGCAGAGCGCCGACGCCCAGAGCAAGGTGTCCGCCCTGAAGGAAAAAGTAAGCGCACTGAAAGAAAAGGTGAGCGCGCTGAAGGAGAAAGTGAGCGCCCTGAAAGAGAAAGTCTCTGCCCTTAAGGAGGATATCGAGGGCAGAGGAAGTCTGCTAACATGCGGTGACGTCGAGGAGAATCCTGGCCCAAAGCTTGCCACCATGCGCAAAGGCGAAGAACTGTTTACCGGCATTGTGCCGATTCTGGTGGAACTGGATGGCGATGTGAACGGCCATAAATTTTTTGTGCGCGGCGAAGGCGAAGGCGATGCGACCATTGGCAAACTGAGCCTGAAATTTATTTGCACCACCGGCAAACTGCCGGTGCCGTGGCCGACCCTGGTGACCACCCTGACCTATGGCGTGCAGTGCTTTAGCCGCTATCCGGATCATATGAAACGCCATGATTTTTTTAAAAGCGCGATGCCGGAAGGCTATGTGCAGGAACGCACCATTTATTTTAAAGATGATGGCACCTATAAAACCCGCGCGGAAGTGAAATTTGAAGGCGATACCCTGGTGAACCGCATTGAACTGAAAGGCATTGATTTTAAAGAAGATGGCAACATTCTGGGCCATAAACTGGAATATAACTTTAACAGCCATAAAGTGTATATTACCGCGGATAAACAGAACAACGGCATTAAAGCGAACTTTACCATTCGCCATAACGTGGAAGATGGCAGCGTGCAGCTGGCGGATCATTATCAGCAGAACACCCCGATTGGCGATGGCCCGGTTCTTCTTCCTGGCGGCCGCTCTAGA 97 ATGGACCTGCCTGACGACCACTACCTGTCCACCCAGACCATCCTGTCCAAGGACCTGAACTCCGGACTCAGATCTGGCAGCGGTCTCGAGATGGAAGTTAGCGCTCTGGAAAAAGAAGTGTCTGCACTCGAGAAAGAAGTAAGTGCCCTTGAGAAGGAGGTGTCCGCACTCGAGAAGGAGGTCAGCGCCCTGGAAAAGGAAAAGCGAGACCATATGGTTTTGCTTGAGTATGTTACAGCGGCTGGCATTACCGATGCATCAAAGGAGAGGAAGAGGAGGGGCGCCGACACCAGCATCAAGGTGTCCGCCCTGAAGGAAAAAGTAAGCGCACTGAAAGAAAAGGTGAGCGCGCTGAAGGAGAAAGTGAGCGCCCTGAAAGAGAAAGTCTCTGCCCTTAAGGAGGATATCGAGGGCAGAGGAAGTCTGCTAACATGCGGTGACGTCGAGGAGAATCCTGGCCCAAAGCTTGCCACCATGCGCAAAGGCGAAGAACTGTTTACCGGCATTGTGCCGATTCTGGTGGAACTGGATGGCGATGTGAACGGCCATAAATTTTTTGTGCGCGGCGAAGGCGAAGGCGATGCGACCATTGGCAAACTGAGCCTGAAATTTATTTGCACCACCGGCAAACTGCCGGTGCCGTGGCCGACCCTGGTGACCACCCTGACCTATGGCGTGCAGTGCTTTAGCCGCTATCCGGATCATATGAAACGCCATGATTTTTTTAAAAGCGCGATGCCGGAAGGCTATGTGCAGGAACGCACCATTTATTTTAAAGATGATGGCACCTATAAAACCCGCGCGGAAGTGAAATTTGAAGGCGATACCCTGGTGAACCGCATTGAACTGAAAGGCATTGATTTTAAAGAAGATGGCAACATTCTGGGCCATAAACTGGAATATAACTTTAACAGCCATAAAGTGTATATTACCGCGGATAAACAGAACAACGGCATTAAAGCGAACTTTACCATTCGCCATAACGTGGAAGATGGCAGCGTGCAGCTGGCGGATCATTATCAGCAGAACACCCCGATTGGCGATGGCCCGGTTCTTCTTCCTGGCGGCCGCTCTAGA 98 ATGGACCTGCCTGACGACCACTACCTGTCCACCCAGACCATCCTGTCCAAGGACCTGAACTCCGGACTCAGATCTGGCAGCGGTCTCGAGATGGAAGTTAGCGCTCTGGAAAAAGAAGTGTCTGCACTCGAGAAAGAAGTAAGTGCCCTTGAGAAGGAGGTGTCCGCACTCGAGAAGGAGGTCAGCGCCCTGGAAAAGGAAAAGCGAGACCATATGGTTTTGCTTGAGTATGTTACAGCGGCTGGCATTACCGATGCATCAACCAGGAGCGGCAAGAGGAGCTGGCCCCCCAGCGAGAAGGTGTCCGCCCTGAAGGAAAAAGTAAGCGCACTGAAAGAAAAGGTGAGCGCGCTGAAGGAGAAAGTGAGCGCCCTGAAAGAGAAAGTCTCTGCCCTTAAGGAGGATATCGAGGGCAGAGGAAGTCTGCTAACATGCGGTGACGTCGAGGAGAATCCTGGCCCAAAGCTTGCCACCATGCGCAAAGGCGAAGAACTGTTTACCGGCATTGTGCCGATTCTGGTGGAACTGGATGGCGATGTGAACGGCCATAAATTTTTTGTGCGCGGCGAAGGCGAAGGCGATGCGACCATTGGCAAACTGAGCCTGAAATTTATTTGCACCACCGGCAAACTGCCGGTGCCGTGGCCGACCCTGGTGACCACCCTGACCTATGGCGTGCAGTGCTTTAGCCGCTATCCGGATCATATGAAACGCCATGATTTTTTTAAAAGCGCGATGCCGGAAGGCTATGTGCAGGAACGCACCATTTATTTTAAAGATGATGGCACCTATAAAACCCGCGCGGAAGTGAAATTTGAAGGCGATACCCTGGTGAACCGCATTGAACTGAAAGGCATTGATTTTAAAGAAGATGGCAACATTCTGGGCCATAAACTGGAATATAACTTTAACAGCCATAAAGTGTATATTACCGCGGATAAACAGAACAACGGCATTAAAGCGAACTTTACCATTCGCCATAACGTGGAAGATGGCAGCGTGCAGCTGGCGGATCATTATCAGCAGAACACCCCGATTGGCGATGGCCCGGTTCTTCTTCCTGGCGGCCGCTCTAGA 99 ATGGACCTGCCTGACGACCACTACCTGTCCACCCAGACCATCCTGTCCAAGGACCTGAACTCCGGACTCAGATCTGGCAGCGGTCTCGAGATGGAAGTTAGCGCTCTGGAAAAAGAAGTGTCTGCACTCGAGAAAGAAGTAAGTGCCCTTGAGAAGGAGGTGTCCGCACTCGAGAAGGAGGTCAGCGCCCTGGAAAAGGAAAAGCGAGACCATATGGTTTTGCTTGAGTATGTTACAGCGGCTGGCATTACCGATGCATCAGAGCCCGAGAAGCAGAGGAGCCCCCAGGACAACCAGAAGGTGTCCGCCCTGAAGGAAAAAGTAAGCGCACTGAAAGAAAAGGTGAGCGCGCTGAAGGAGAAAGTGAGCGCCCTGAAAGAGAAAGTCTCTGCCCTTAAGGAGGATATCGAGGGCAGAGGAAGTCTGCTAACATGCGGTGACGTCGAGGAGAATCCTGGCCCAAAGCTTGCCACCATGCGCAAAGGCGAAGAACTGTTTACCGGCATTGTGCCGATTCTGGTGGAACTGGATGGCGATGTGAACGGCCATAAATTTTTTGTGCGCGGCGAAGGCGAAGGCGATGCGACCATTGGCAAACTGAGCCTGAAATTTATTTGCACCACCGGCAAACTGCCGGTGCCGTGGCCGACCCTGGTGACCACCCTGACCTATGGCGTGCAGTGCTTTAGCCGCTATCCGGATCATATGAAACGCCATGATTTTTTTAAAAGCGCGATGCCGGAAGGCTATGTGCAGGAACGCACCATTTATTTTAAAGATGATGGCACCTATAAAACCCGCGCGGAAGTGAAATTTGAAGGCGATACCCTGGTGAACCGCATTGAACTGAAAGGCATTGATTTTAAAGAAGATGGCAACATTCTGGGCCATAAACTGGAATATAACTTTAACAGCCATAAAGTGTATATTACCGCGGATAAACAGAACAACGGCATTAAAGCGAACTTTACCATTCGCCATAACGTGGAAGATGGCAGCGTGCAGCTGGCGGATCATTATCAGCAGAACACCCCGATTGGCGATGGCCCGGTTCTTCTTCCTGGCGGCCGCTCTAGA 100 ATGGACCTGCCTGACGACCACTACCTGTCCACCCAGACCATCCTGTCCAAGGACCTGAACTCCGGACTCAGATCTGGCAGCGGTCTCGAGATGGAAGTTAGCGCTCTGGAAAAAGAAGTGTCTGCACTCGAGAAAGAAGTAAGTGCCCTTGAGAAGGAGGTGTCCGCACTCGAGAAGGAGGTCAGCGCCCTGGAAAAGGAAAAGCGAGACCATATGGTTTTGCTTGAGTATGTTACAGCGGCTGGCATTACCGATGCATCAGGCCTGGTGAAGAGGAGGGGCGGCGGCACCGGCGAGAAGGTGTCCGCCCTGAAGGAAAAAGTAAGCGCACTGAAAGAAAAGGTGAGCGCGCTGAAGGAGAAAGTGAGCGCCCTGAAAGAGAAAGTCTCTGCCCTTAAGGAGGATATCGAGGGCAGAGGAAGTCTGCTAACATGCGGTGACGTCGAGGAGAATCCTGGCCCAAAGCTTGCCACCATGCGCAAAGGCGAAGAACTGTTTACCGGCATTGTGCCGATTCTGGTGGAACTGGATGGCGATGTGAACGGCCATAAATTTTTTGTGCGCGGCGAAGGCGAAGGCGATGCGACCATTGGCAAACTGAGCCTGAAATTTATTTGCACCACCGGCAAACTGCCGGTGCCGTGGCCGACCCTGGTGACCACCCTGACCTATGGCGTGCAGTGCTTTAGCCGCTATCCGGATCATATGAAACGCCATGATTTTTTTAAAAGCGCGATGCCGGAAGGCTATGTGCAGGAACGCACCATTTATTTTAAAGATGATGGCACCTATAAAACCCGCGCGGAAGTGAAATTTGAAGGCGATACCCTGGTGAACCGCATTGAACTGAAAGGCATTGATTTTAAAGAAGATGGCAACATTCTGGGCCATAAACTGGAATATAACTTTAACAGCCATAAAGTGTATATTACCGCGGATAAACAGAACAACGGCATTAAAGCGAACTTTACCATTCGCCATAACGTGGAAGATGGCAGCGTGCAGCTGGCGGATCATTATCAGCAGAACACCCCGATTGGCGATGGCCCGGTTCTTCTTCCTGGCGGCCGCTCTAGAAGGAGAAAGTGAGCGCCCTGAAAGAGAAAGTCTCTGCCCTTAAGGAGGATATCGAGGGCAGAGGAAGTCTGCTAACATGCGGTGACGTCGAGGAGAATCCTGGCCCAAAGCTTGCCACCATGCGCAAAGGCGAAGAACTGTTTACCGGCATTGTGCCGATTCTGGTGGAACTGGATGGCGATGTGAACGGCCATAAATTTTTTGTGCGCGGCGAAGGCGAAGGCGATGCGACCATTGGCAAACTGAGCCTGAAATTTATTTGCACCACCGGCAAACTGCCGGTGCCGTGGCCGACCCTGGTGACCACCCTGACCTATGGCGTGCAGTGCTTTAGCCGCTATCCGGATCATATGAAACGCCATGATTTTTTTAAAAGCGCGATGCCGGAAGGCTATGTGCAGGAACGCACCATTTATTTTAAAGATGATGGCACCTATAAAACCCGCGCGGAAGTGAAATTTGAAGGCGATACCCTGGTGAACCGCATTGAACTGAAAGGCATTGATTTTAAAGAAGATGGCAACATTCTGGGCCATAAACTGGAATATAACTTTAACAGCCATAAAGTGTATATTACCGCGGATAAACAGAACAACGGCATTAAAGCGAACTTTACCATTCGCCATAACGTGGAAGATGGCAGCGTGCAGCTGGCGGATCATTATCAGCAGAACACCCCGATTGGCGATGGCCCGGTTCTTCTTCCTGGCGGCCGCTCTAGAGAGGGCAGAGGAAGTCTGCTAACATGCGGTGACGTCGAGGAGAATCCTGGCCCAGAATTGATGGTGAGCAAGGGCGAGGAGGATAACATGGCCATCATCAAGGAGTTCATGCGCTTCAAGGTGCACATGGAGGGCTCCGTGAACGGCCACGAGTTCGAGATCGAGGGCGAGGGCGAGGGCCGCCCCTACGAGGGCACCCAGACCGCCAAGCTGAAGGTGACCAAGGGTGGCCCCCTGCCCTTCGCCTGGGACATCCTGTCCCCTCAGTTCATGTACGGCTCCAAGGCCTACGTGAAGCACCCCGCCGACATCCCCGACTACTTGAAGCTGTCCTTCCCCGAGGGCTTCAAGTGGGAGCGCGTGATGAACTTCGAGGACGGCGGCGTGGTGACCGTGACCCAGGACTCCTCCCTGCAGGACGGCGAGTTCATCTACAAGGTGAAGCTGCGCGGCACCAACTTCCCCTCCGACGGCCCCGTAATGCAGAAGAAGACCATGGGCTGGGAGGCCTCCTCCGAGCGGATGTACCCCGAGGACGGCGCCCTGAAGGGCGAGATCAAGCAGAGGCTGAAGCTGAAGGACGGCGGCCACTACGACGCTGAGGTCAAGACCACCTACAAGGCCAAGAAGCCCGTGCAGCTGCCCGGCGCCTACAACGTCAACATCAAGTTGGACATCACCTCCCACAACGAGGACTACACCATCGTGGAACAGTACGAACGCGCCGAGGGCCGCCACTCCACCGGCGGCATGGACGAGCTGTACAAGtaataaCACatcgacaatcaacctctggattacaaaatttgtgaaagattgactggtattcttaactatgttgctccttttacgctatgtggatacgctgctttaatgcctttgtatcatgctattgcttcccgtatggctttcattttctcctccttgtataaatcctggttgctgtctctttatgaggagttgtggcccgttgtcaggcaacgtggcgtggtgtgcactgtgfttgctgacgcaacccccactggttggggcattgccaccacctgtcagctcctttccgggactttcgctttccccctccctattgccacggcggaactcatcgccgcctgccttgcccgctgctggacaggggctcggctgftgggcactgacaattccgtggtgttgtcggggaagctgacgtcctttccatggctgctcgcctgtgttgccacctggattctgcgcgggacgtccttctgctacgtcccttcgGccctcaatccagcggaccttccttcccgcggcctgctgccggctctgcggcctcttccgcgtcttcgccttcgccctcagacgagtcggatctccctttgggccgcctccccgcctggaattcgagctcggtacctttaagaccaatgacttacaaggcagctgtagatcttagccactttttaaaagaaaaggggggactggaagggctaattcactcccaacgaagacaagatctgctttttgcttgtactgggtctctctggttagaccagatctgagcctgggagctctctggctaactagggaacccactgcttaagcctcaataaagcttgccttgagtgcttcaagtagtgtgtgcccgtctgttgtgtgactctggtaactagagatccctcagacccttttagtcagtgtggaaaatctctagcagtagtagttcatgtcatcttattattcagtatttataacttgcaaagaaatgaatatcagagagtgagaggaacttgtttattgcagcttataatggttacaaataaagcaatagcatcacaaatttcacaaataaagcatttttttcactgcattctagttgtggtttgtccaaactcatcaatgtatcttatcatgtctggctctagctatcccgcccctaactccgcccatcccgcccctaactccgcccagttccgcccattctccgccccatggctgactaattttttttatttatgcagaggccgaggccgcctcggcctctgagctattccagaagtagtgaggaggcttttttggaggcctagggacgtacccaattcgccctatagtgagtcgtattacgcgcgctcactggccgtcgttttacaacgtcgtgactgggaaaaccctggcgttacccaacttaatcgccttgcagcacatccccctttcgccagctggcgtaatagcgaagaggcccgcaccgatcgcccttcccaacagttgcgcagcctgaatggcgaatgggacgcgccctgtagcggcgcattaagcgcggcgggtgtggtggttacgcgcagcgtgaccgctacacttgccagcgccctagcgcccgctcctttcgctttcttcccttcctttctcgccacgttcgccggctttccccgtcaagctctaaatcgggggctccctttagggttccgatttagtgctttacggcacctcgaccccaaaaaacttgattagggtgatggttcacgtagtgggccatcgccctgatagacggtttttcgccctttgacgttggagtccacgttctttaatagtggactcttgttccaaactggaacaacactcaaccctatctcggtctattcttttgatttataagggattttgccgatttcggcctattggttaaaaaatgagctgatttaacaaaaatttaacgcgaattttaacaaaatattaacgcttacaatttaggtggcacttttcggggaaatgtgcgcggaacccctatttgtttatttttctaaatacattcaaatatgtatccgctcatgagacaataaccctgataaatgcttcaataatattgaaaaaggaagagtatgagtattcaacatttccgtgtcgcccttattcccttttttgcggcattttgccttcctgtttttgctcacccagaaacgctggtgaaagtaaaagatgctgaagatcagttgggtgcacgagtgggttacatcgaactggatctcaacagcggtaagatccttgagagttttcgccccgaagaacgttttccaatgatgagcacttttaaagttctgctatgtggcgcggtattatcccgtattgacgccgggcaagagcaactcggtcgccgcatacactattctcagaatgacttggttgagtactcaccagtcacagaaaagcatcttacggatggcatgacagtaagagaattatgcagtgctgccataaccatgagtgataacactgcggccaacttacttctgacaacgatcggaggaccgaaggagctaaccgcttttttgcacaacatgggggatcatgtaactcgccttgatcgttgggaaccggagctgaatgaagccataccaaacgacgagcgtgacaccacgatgcctgtagcaatggcaacaacgttgcgcaaactattaactggcgaactacttactctagcttcccggcaacaattaatagactggatggaggcggataaagttgcaggaccacttctgcgctcggcccttccggctggctggtttattgctgataaatctggagccggtgagcgtgggtctcgcggtatcattgcagcactggggccagatggtaagccctcccgtatcgtagttatctacacgacggggagtcaggcaactatggatgaacgaaatagacagatcgctgagataggtgcctcactgattaagcattggtaactgtcagaccaagtttactcatatatactttagattgatttaaaacttcatttttaatttaaaaggatctaggtgaagatcctttttgataatctcatgaccaaaatcccttaacgtgagttttcgttccactgagcgtcagaccccgtagaaaagatcaaaggatcttcttgagatcctttttttctgcgcgtaatctgctgcttgcaaacaaaaaaaccaccgctaccagcggtggtttgtttgccggatcaagagctaccaactctttttccgaaggtaactggcttcagcagagcgcagataccaaatactgttcttctagtgtagccgtagttaggccaccacttcaagaactctgtagcaccgcctacatacctcgctctgctaatcctgttaccagtggctgctgccagtggcgataagtcgtgtcttaccgggttggactcaagacgatagttaccggataaggcgcagcggtcgggctgaacggggggttcgtgcacacagcccagcttggagcgaacgacctacaccgaactgagatacctacagcgtgagctatgagaaagcgccacgcttcccgaagggagaaaggcggacaggtatccggtaagcggcagggtcggaacaggagagcgcacgagggagcttccagggggaaacgcctggtatctttatagtcctgtcgggtttcgccacctctgacttgagcgtcgatttttgtgatgctcgtcaggggggcggagcctatggaaaaacgccagcaacgcggcctttttacggttcctggccttttgctggccttttgctcacatgttctttcctgcgttatcccctgattctgtggataaccgtattaccgcctttgagtgagctgataccgctcgccgcagccgaacgaccgagcgcagcgagtcagtgagcgaggaagcggaagagcgcccaatacgcaaaccgcctctccccgcgcgttggccgattcattaatgcagctggcacgacaggtttcccgactggaaagcgggcagtgagcgcaacgcaattaatgtgagttagctcactcattaggcaccccaggctttacactttatgcttccggctcgtatgttgtgtggaattgtgagcggataacaatttcacacaggaaacagctatgaccatgattacgccaagcgcgcaattaaccctcactaaagggaacaaaagctggagctgcaagctt 101 aatgtagtcttatgcaatactcttgtagtcttgcaacatggtaacgatgagttagcaacatgccttacaaggagagaaaaagcaccgtgcatgccgattggtggaagtaaggtggtacgatcgtgccttattaggaaggcaacagacgggtctgacatggattggacgaaccactgaattgccgcattgcagagatattgtatttaagtgcctagctcgatacataaacgggtctctctggttagaccagatctgagcctgggagctctctggctaactagggaacccactgcttaagcctcaataaagcttgccttgagtgcttcaagtagtgtgtgcccgtctgttgtgtgactctggtaactagagatccctcagacccttttagtcagtgtggaaaatctctagcagtggcgcccgaacagggacttgaaagcgaaagggaaaccagaggagctctctcgacgcaggactcggcttgctgaagcgcgcacggcaagaggcgaggggcggcgactggtgagtacgccaaaaattttgactagcggaggctagaaggagagagatgggtgcgagagcgtcagtattaagcgggggagaattagatcgcgatgggaaaaaattcggttaaggccagggggaaagaaaaaatataaattaaaacatatagtatgggcaagcagggagctagaacgattcgcagttaatcctggcctgttagaaacatcagaaggctgtagacaaatactgggacagctacaaccatcccttcagacaggatcagaagaacttagatcattatataatacagtagcaaccctctattgtgtgcatcaaaggatagagataaaagacaccaaggaagctttagacaagatagaggaagagcaaaacaaaagtaagaccaccgcacagcaagcggccgctgatcttcagacctggaggaggagatatgagggacaattggagaagtgaattatataaatataaagtagtaaaaattgaaccattaggagtagcacccaccaaggcaaagagaagagtggtgcagagagaaaaaagagcagtgggaataggagctttgttccttgggttcttgggagcagcaggaagcactatgggcgcagcgtcaatgacgctgacggtacaggccagacaattattgtctggtatagtgcagcagcagaacaatttgctgagggctattgaggcgcaacagcatctgttgcaactcacagtctggggcatcaagcagctccaggcaagaatcctggctgtggaaagatacctaaaggatcaacagctcctggggatttggggttgctctggaaaactcatttgcaccactgctgtgccttggaatgctagttggagtaataaatctctggaacagatttggaatcacacgacctggatggagtgggacagagaaattaacaattacacaagcttaatacactccttaattgaagaatcgcaaaaccagcaagaaaagaatgaacaagaattattggaattagataaatgggcaagtttgtggaattggtttaacataacaaattggctgtggtatataaaattattcataatgatagtaggaggcttggtaggtttaagaatagtttttgctgtactttctatagtgaatagagttaggcagggatattcaccattatcgtttcagacccacctcccaaccccgaggggacccgacaggcccgaaggaatagaagaagaaggtggagagagagacagagacagatccattcgattagtgaacggatctcgacggtatcggttaacttttaaaagaaaaggggggattggggggtacagtgcaggggaaagaatagtagacataatagcaacagacatacaaactaaagaattacaaaaacaaattacaaaattcaaaattttatcgataagcttgggagttccgcgttacataacttacggtaaatggcccgcctggctgaccgcccaacgacccccgcccattgacgtcaataatgacgtatgttcccatagtaacgccaatagggactftccaftgacgtcaatgggtggagtatttacggtaaactgcccacttggcagtacatcaagtgtatcatatgccaagtacgccccctattgacgtcaatgacggtaaatggcccgcctggcattatgcccagtacatgaccttatgggactttcctacttggcagtacatctacgtattagtcatcgctattaccatggtgatgcggttttggcagtacatcaatgggcgtggatagcggtttgactcacggggatttccaagtctccaccccattgacgtcaatgggagtttgttttggcaccaaaatcaacgggactttccaaaatgtcgtaacaactccgccccattgacgcaaatgggcggtaggcgtgtacggtgggaggtctatataagcagagctcgtttagtgaaccgtcagatcgcctggagacgccatccacgctgttttgacctccatagaagacaccgcccgggcgtacgggatccATGGACCCCGTGGTGGTGCTGGGCCTGTGCCTGAGCTGCCTGCTGCTGCTGAGCCTGTGGAAGCAGAGCCACGGCGGCGGCAAGATGGACCTGCCTGACGACCACTACCTGTCCACCCAGACCATCCTGTCCAAGGACCTGAACTCCGGACTCAGATCTGGCAGCGGTCTCGAGATGGAAGTTAGCGCTCTGGAAAAAGAAGTGTCTGCACTCGAGAAAGAAGTAAGTGCCCTTGAGAAGGAGGTGTCCGCACTCGAGAAGGAGGTCAGCGCCCTGGAAAAGGAAAAGCGAGACCATATGGTTTTGCTTGAGTATGTTACAGCGGCTGGCATTACCGATGCATCAAGCTACTTCGCCAGCGAGCAGGGCGAGATCCAGTGGGTGAAGGTGTCCGCCCTGAAGGAAAAAGTAAGCGCACTGAAAGAAAAGGTGAGCGCGCTGAAGGAGAAAGTGAGCGCCCTGAAAGAGAAAGTCTCTGCCCTTAAGGAGGATATCGAGGGCAGAGGAAGTCTGCTAACATGCGGTGACGTCGAGGAGAATCCTGGCCCAAAGCTTGCCACCATGCGCAAAGGCGAAGAACTGTTTACCGGCATTGTGCCGATTCTGGTGGAACTGGATGGCGATGTGAACGGCCATAAATTTTTTGTGCGCGGCGAAGGCCAAGGCGATGCGACCATTGGCAAACTGAGCCTGAAATTTATTTGCACCACCGGCAAACTGCCGGTGCCGTGGCCGACCCTGGTGACCACCCTGACCTATGGCGTGCAGTGCTTTAGCCGCTATCCGGATCATATGAAACGCCATGATTTTTTTAAAAGCGCGATGCCGGAAGGCTATGTGCAGGAACGCACCATTTATTTTAAAGATGATGGCACCTATAAAACCCGCGCGGAAGTGAAATTTGAAGGCGATACCCTGGTGAACCGCATTGAACTGAAAGGCATTGATTTTAAAGAAGATGGCAACATTCTGGGCCATAAACTGGAATATAACTTTAACAGCCATAAAGTGTATATTACCGCGGATAAACAGAACAACGGCATTAAAGCGAACTTTACCATTCGCCATAACGTGGAAGATGGCAGCGTGCAGCTGGCGGATCATTATCAGCAGAACACCCCGATTGGCGATGGCCCGGTTCTTCTTCCTGGCGGCCGCTCTAGAGAGGGCAGAGGAAGTCTGCTAACATGCGGTGACGTCGAGGAGAATCCTGGCCCAGAATTGATGGTGAGCAAGGGCGAGGAGGATAACATGGCCATCATCAAGGAGTTCATGCGCTTCAAGGTGCACATGGAGGGCTCCGTGAACGGCCACGAGTTCGAGATCGAGGGCGAGGGCGAGGGCCGCCCCTACGAGGGCACCCAGACCGCCAAGCTGAAGGTGACCAAGGGTGGCCCCCTGCCCTTCGCCTGGGACATCCTGTCCCCTCAGTTCATGTACGGCTCCAAGGCCTACGTGAAGCACCCCGCCGACATCCCCGACTACTTGAAGCTGTCCTTCCCCGAGGGCTTCAAGTGGGAGCGCGTGATGAACTTCGAGGACGGCGGCGTGGTGACCGTGACCCAGGACTCCTCCCTGCAGGACGGCGAGTTCATCTACAAGGTGAAGCTGCGCGGCACCAACTTCCCCTCCGACGGCCCCGTAATGCAGAAGAAGACCATGGGCTGGGAGGCCTCCTCCGAGCGGATGTACCCCGAGGACGGCGCCCTGAAGGGCGAGATCAAGCAGAGGCTGAAGCTGAAGGACGGCGGCCACTACGACGCTGAGGTCAAGACCACCTACAAGGCCAAGAAGCCCGTGCAGCTGCCCGGCGCCTACAACGTCAACATCAAGTTGGACATCACCTCCCACAACGAGGACTACACCATCGTGGAACAGTACGAACGCGCCGAGGGCCGCCACTCCACCGGCGGCATGGACGAGCTGTACAAGtaataaCACatcgacaatcaacctctggattacaaaatttgtgaaagattgactggtattcttaactatgttgctccttttacgctatgtggatacgctgctttaatgcctttgtatcatgctattgcttcccgtatggctttcattttctcctccttgtataaatcctggttgctgtctctttatgaggagttgtggcccgttgtcaggcaacgtggcgtggtgtgcactgtgtttgctgacgcaacccccactggttggggcattgccaccacctgtcagctcctttccgggactttcgctttccccctccctattgccacggcggaactcatcgccgcctgccttgcccgctgctggacaggggctcggctgttgggcactgacaattccgtggtgttgtcggggaagctgacgtcctttccatggctgctcgcctgtgttgccaccctggattctgcgcgggacgtccttctgctacgtcccttcggccctcaatccagcggaccttccttcccgcggcctgctgccggctctgcggcctcttccgcgtcttcgccttcgccctcagacgagtcggatctccctttgggccgcctccccgcctggaattcgagctcggtacctttaagaccaatgacttacaaggcagctgtagatcttagccactttttaaaagaaaaggggggactggaagggctaattcactcccaacgaagacaagatctgctttttgcttgtactgggtctctctggttagaccagatctgagcctgggagctctctggctaactagggaacccactgcttaagcctcaataaagcttgccttgagtgcttcaagtagtgtgtgcccgtctgttgtgtgactctggtaactagagatccctcagacccttttagtcagtgtggaaaatctctagcagtagtagttcatgtcatcttattattcagtattttataacttgcaaagaaatgaatatcagagagtgagaggaacttgtttattgcagcttataatggttacaaataaagcaatagcatcacaaatttcacaaataaagcatttttttcactgcattctagttgtggtttgtccaaactcatcaatgtatcttatcatgtctggctctagctatcccgcccctaactccgcccatcccgcccctaactccgcccagttccgcccattctccgccccatggctgactaatttttttatttatgcagaggccgaggccgcctcggcctctgagctattccagaagtagtgaggaggcttttttggaggcctagggacgtacccaattcgccctatagtgagtcgtattacgcgcgctcactggccgtcgttttacaacgtcgtgactgggaaaaccctggcgttacccaacttaatcgccttgcagcacatccccctttcgccagctggcgtaatagcgaagaggcccgcaccgatcgcccttcccaacagttgcgcagcctgaatggcgaatgggacgcgccctgtagcggcgcattaagcgcggcgggtgtggtggttacgcgcagcgtgaccgctacacttgccagcgccctagcgcccgctcctttcgctttcttcccttcctttctcgccacgttcgcccggctttccccgtcaagctctaaatcgggggctccctttagggttccgatttagtgctttacggcacctcgaccccaaaaaacttgattagggtgatggttcacgtagtgggccatcgccctgatagacggtttttcgccctttgacgttggagtccacgttctttaatagtggactcttgttccaaactggaacaacactcaaccctatctcggtctattcttttgatttataagggattttgccgatttcggcctattggttaaaaaatgagctgatttaacaaaaatttaacgcgaattttaacaaaatattaacgcttacaatttaggtggcacttttcggggaaatgtgcgcggaacccctatttgtttatttttctaaatacattcaaatatgtatccgctcatgagacaataaccctgataaatgcttcaataatattgaaaaaggaagagtatgagtattcaacatttccgtgtcgcccttattccctttttgcggcattttgccttcctgtttttgctcacccagaaacgctggtgaaagtaaaagatgctgaagatcagttgggtgcacgagtgggttacatcgaactggatctcaacagcggtaagatccttgagagttttcgccccgaagaacgttttccaatgatgagcacttttaaagttctgctatgtggcgcggtattatcccgtattgacgccgggcaagagcaactcggtcgccgcatacactattctcagaatgacttggttgagtactcaccagtcacagaaaagcatcttacggatggcatgacagtaagagaattatgcagtgctgccataaccatgagtgataacactgcggccaacttacttctgacaacgatcggaggaccgaaggagctaaccgcttttttgcacaacatgggggatcatgtaactcgccttgatcgttgggaaccggagctgaatgaagccataccaaacgacgagcgtgacaccacgatgcctgtagcaatggcaacaacgttgcgcaaactattaactggcgaactacttactctagcttcccggcaacaa ttaatagactggatggaggcggataaagttgcaggaccacttctgcgctcggcccttccggctggctggtttattgctgataaatctggagccggtgagcgtgggtctgcggtatcattgcagcactggggccagatggtaagccctcccgtatcgtagttatctacacgacggggagtcaggcaactatggatgaacgaaaatagacagatcgctgagataggtgcctcactgattaagcattggtaactgtcagaccaagtttactcatatatactttagattgatttaaaacttcatttttaatttaaaaggatctaggtgaagatcctttttgataatctcatgaccaaaatcccttaacgtgagttttcgttccactgagcgtcagaccccgtagaaaagatcaaaggatcttcttgagatcctttttttctgcgcgtaatctgctgcttgcaaacaaaaaaaccaccgctaccagcggtggtttgtttgccggatcaagagctaccaactctttttccgaaggtaactggcttcagcagagcgcagataccaaatactgttcttctagtgtagccgtagttaggccaccacttcaagaactctgtagcaccgcctacatacctcgctctgctaatcctgttaccagtggctgctgccagtggcgataagtcgtgtcttaccgggttggactcaagacgatagttaccggataaggcgcagcggtcgggctgaacggggggttcgtgcacacagcccagcttggagcgaacgacctacaccgaactgagatacctacagcgtgagctatgagaaagcgccacgcttcccgaagggagaaaggcggacaggtatccggtaagcggcagggtcggaacaggagagcgcacgagggagcttccagggggaaacgcctggtatctttatagtcctgtcgggtttcgccacctctgacttgagcgtcgatttttgtgatgctcgtcaggggggcggagcctatggaaaaacgccagcaacgcggcctttttacggttcctggccttttgctggccttttgctcacatgttctttcctgcgttatcccctgattctgtggataaccgtattaccgcctttgagtgagctgataccgctcgccgcagccgaacgaccgagcgcagcgagtcagtgagcgaggaagcggaagagcgcccaatacgcaaaccgcctctcccgcgcgttggccgattcattaatgcagctggcacgacaggtttcccactggaaagcgggcagtgagcgcaacgcaattaatgtgagttagctcactcattaggcaccccaggctttacactttatgcttccggctcgtatgttgtgtggaattgtgagcggataacaatttcacacaggaaacagctatgaccatgattacgccaagcgcgcaattaaccctcactaaagggaacaaaagctggagctgcaagctt 102 aatgtagtcttatgcaatactcttgtagtcttgcaacatggtaacgatgagttagcaacatgccttacaaggagagaaaaagcaccgtgcatgccgattggtggaagtaaggtggtacgatcgtgccttattaggaaggcaacagacgggtctgacatggattggacgaaccactgaattgccgcattgcagagatattgtatttaagtgcctagctcgatacataaacgggtctctctggttagaccagatctgagcctgggagctctctggctaactagggaacccactgcttaagcctcaataaagcttgccttgagtgcttcaagtagtgtgtgcccgtctgttgtgtgactctggtaactagagatccctcagacccttttagtcagtgtggaaaatctctagcagtggcgcccgaacagggacttgaaagcgaaagggaaaccagaggagctctctcgacgcaggactcggcttgctgaagcgcgcacggcaagaggcgaggggcggcgactggtgagtacgccaaaaattttgactagcggaggctagaaggagagagatgggtgcgagagcgtcagtattaagcgggggagaattagatcgcgatgggaaaaaattcggttaaggccagggggaaagaaaaaatataaattaaaacatatagtatgggcaagcagggagctagaacgattcgcagttaatcctggcctgttagaaacatcagaaggctgtagacaaatactgggacagctacaaccatcccttcagacaggatcagaagaacttagatcattatataatacagtagcaaccctctattgtgtgcatcaaaggatagagataaaagacaccaaggaagctttagacaagatagaggaagagcaaaacaaaagtaagaccaccgcacagcaagcggccgctgatcttcagacctggaggaggagatatgagggacaattggagaagtgaattatataaatataaagtagtaaaaattgaaccattaggagtagcacccaccaaggcaaagagaagagtggtgcagagagaaaaaagagcagtgggaataggagctttgttccttgggttcttgggagcagcaggaagcactatgggcgcagcgtcaatgacgctgacggtacaggccagacaattattgtctggtatagtgcagcagcagaacaatttgctgagggctattgaggcgcaacagcatctgttgcaactcacagtctggggcatcaagcagctccaggcaagaatcctggctgtggaaagatacctaaaggatcaacagctcctggggatttggggttgctctggaaaactcatttgcaccactgctgtgccttggaatgctagttggagtaataaatctctggaacagatttggaatcacacgacctggatggagtgggacagagaaattaacaattacacaagcttaatacactccttaattgaagaatcgcaaaaccagcaagaaaagaatgaacaagaattattggaattagataaatgggcaagtttgtggaattggtttaacataacaaattggctgtggtatataaaattattcataatgatagtaggaggcttggtaggtttaagaatagtttttgctgtactttctatagtgaatagagttaggcagggatattcaccattatcgtttcagacccacctcccaaccccgaggggacccgacaggcccgaaggaatagaagaagaaggtggagagagagacagagacagatccattcgattagtgaacggatctcgacggtatcggttaacttttaaaagaaaaggggggattggggggtacagtgcaggggaaagaatagtagacataatagcaacagacatacaaactaaagaattacaaaaacaaattacaaaattcaaaattttatcgataagcttgggagttccgcgttacataacttacggtaaatggcccgcctggctgaccgcccaacgacccccgcccattgacgtcaataatgacgtatgttcccatagtaacgccaatagggactttccattgacgtcaatgggtggagtatttacggtaaactgcccacttggcagtacatcaagtgtatcatatgccaagtacgccccctattgacgtcaatgacggtaaatggcccgcctggcattatgcccagtacatgaccttatgggactttcctacttggcagtacatctacgtattagtcatcgctattaccatggtgatgcggttttggcagtacatcaatgggcgtggatagcggtttgactcacggggatttccaagtctccaccccattgacgtcaatgggagtttgttttggcaccaaaatcaacgggactttccaaaatgtcgtaacaactccgccccattgacgcaaatgggcggtaggcgtgtacggtgggaggtctatataagcagagctcgtttagtgaaccgtcagatcgcctggagacgccatccacgctgttttgacctccatagaagacaccgcccgggcgtacgggatccATGGGCTGCTGCTTCAGCAAGACCATGGACCTGCCTGACGACCACTACCTGTCCACCCAGACCATCCTGTCCAAGGACCTGAACTCCGGACTCAGATCTGGCAGCGGTCTCGAGATGGAAGTTAGCGCTCTGGAAAAAGAAGTGTCTGCACTCGAGAAAGAAGTAAGTGCCCTTGAGAAGGAGGTGTCCGCACTCGAGAAGGAGGTCAGCGCCCTGGAAAAGGAAAAGCGAGACCATATGGTTTTGCTTGAGTATGTTACAGCGGCTGGCATTACCGATGCATCAAGCTACTTCGCCAGCGAGCAGGGCGAGATCCAGTGGGTGAAGGTGTCCGCCCTGAAGGAAAAAGTAAGCGCACTGAAAGAAAAGGTGAGCGCGCTGAAGGAGAAAGTGAGCGCCCTGAAAGAGAAAGTCTCTGCCCTTAAGGAGGATATCGAGGGCAGAGGAAGTCTGCTAACATGCGGTGACGTCGAGGAGAATCCTGGCCCAAAGCTTGCCACCATGCGCAAAGGCGAAGAACTGTTTACCGGCATTGTGCCGATTCTGGTGGAACTGGATGGCGATGTGAACGGCCATAAATTTTTTGTGCGCGGCGAAGGCGAAGGCGATGCGACCATTGGCAAACTGAGCCTGAAATTTATTTGCACCACCGGCAAACTGCCGGTGCCGTGGCCGACCCTGGTGACCACCCTGACCTATGGCGTGCAGTGCTTTAGCCGCTATCCGGATCATATGAAACGCCATGATTTTTTTAAAAGCGCGATGCCGGAAGGCTATGTGCAGGAACGCACCATTTATTTTAAAGATGATGGCACCTATAAAACCCGCGCGGAAGTGAAATTTGAAGGCGATACCCTGGTGAACCGCATTGAACTGAAAGGCATTGATTTTAAAGAAGATGGCAACATTCTGGGCCATAAACTGGAATATAACTTTAACAGCCATAAAGTGTATATTACCGCGGATAAACAGAACAACGGCATTAAAGCGAACTTTACCATTCGCCATAACGTGGAAGATGGCAGCGTGCAGCTGGCGGATCATTATCAGCAGAACACCCCGATTGGCGATGGCCCGGTTCTTCTTCCTGGCGGCCGCTCTAGAGAGGGCAGAGGAAGTCTGCTAACATGCGGTGACGTCGAGGAGAATCCTGGCCCAGAATTGATGGTGAGCAAGGGCGAGGAGGATAACATGGCCATCATCAAGGAGTTCATGCGCTTCAAGGTGCACATGGAGGGCTCCGTGAACGGCCACGAGTTCGAGATCGAGGGCGAGGGCGAGGGCCGCCCCTACGAGGGCACCCAGACCGCCAAGCTGAAGGTGACCAAGGGTGGCCCCCTGCCCTTCGCCTGGGACATCCTGTCCCCTCAGTTCATGTACGGCTCCAAGGCCTACGTGAAGCACCCCGCCGACATCCCCGACTACTTGAAGCTGTCCTTCCCCGAGGGCTTCAAGTGGGAGCGCGTGATGAACTTCGAGGACGGCGGCGTGGTGACCGTGACCCAGGACTCCTCCCTGCAGGACGGCGAG TTCATCTACAAGGTGAAGCTGCGCGGCACCAACTTCCCCTCCGACGGCCCCGTAATGCAGAAGAAGACCATGGGCTGGGAGGCCTCCTCCGAGCGGATGTACCCCGAGGACGGCGCCCTGAAGGGCGAGATCAAGCAGAGGCTGAAGCTGAAGGACGGCGGCCACTACGACGCTGAGGTCAAGACCACCTACAAGGCCAAGAAGCCCGTGCAGCTGCCCGGCGCCTACAACGTCAACATCAAGTTGGACATCACCTCCCACAACGAGGACTACACCATCGTGGAACAGTACGAACGCGCCGAGGGCCGCCACTCCACCGGCGGCATGGACGAGCTGTACAAGtaataaCACatcgacaatcaacctctggattacaaaatttgtgaaagattgactggtattcttaactatgttgctccttttacgctatgtggatacgctgctttaatgcctttgtatcatgctattgcttcccgtatggctttcattttctcctccttgtataaatcctggttgctgtctctttatgaggagttgtggcccgttgtcaggcaacgtggcgtggtgtgcactgtgtttgctgacgcaacccccactggttggggcattgccaccacctgtcagctcctttccgggactttcgctttccccctccctattgccacggcggaactcatcgccgcctgccttgcccgctgctggacaggggctcggctgttgggcactgacaattccgtggtgttgtcggggaagctgacgtcctttccatggctgctcgcctgtgttgccacctggattctgcgcgggacgtccttctgctacgtcccttcggccctcaatccagcggaccttccttcccgcggcctgctgccggctctgcggcctcttccgcgtcttcgccttcgccctcagacgagtcggatctccctttgggccgcctccccgcctggaattcgagctcggtacctttaagaccaatgacttacaaggcagctgtagatcttagccactttttaaaagaaaaggggggactggaagggctaattcactcccaacgaagacaagatctgctttttgcttgtactgggtctctctggttagaccagatctgagcctgggagctctctggctaactagggaacccactgcttaagcctcaataaagcttgccttgagtgcttcaagtagtgtgtgcccgtctgttgtgtgactctggtaactagagatccctcagacccttttagtcagtgtggaaaatctctagcagtagtagttcatgtcatcttattattcagtatttataacttgcaaagaaatgaatatcagagagtgagaggaacttgtttattgcagcttataatggttacaaataaagcaatagcatcacaaatttcacaaataaagcatttttttcactgcattctagttgtggtttgtccaaactcatcaatgtatcttatcatgtctggctctagctatcccgcccctaactccgccatccegeoccctaactcegceccagttcecgeoccattctccgecccatggctgactaattttttttatttatgcagaggccgaggccgcctcggcctctgagctattccagaagtagtgaggaggcttttttggaggcctagggacgtacccaattcgccctatagtgagtcgtattacgcgcgctcactggccgtcgttttacaacgtcgtgactgggaaaaccctggcgttacccaacttaatcgccttgcagcacatccccctttcgccagctggcgtaatagcgaagaggcccgcaccgatcgcccttcccaacagttgcgcagcctgaatggcgaatgggacgcgccctgtagcggcgcattaagcgcggcgggtgtggtggttacgcgcagcgtgaccgctacacttgccagcgccctagcgcccgctcctttcgctttcttcccttcctttctcgccacgttcgccggctttccccgtcaagctctaaatcgggggctccctttagggttccgatttagtgctttacggcacctcgaccccaaaaaacttgattagggtgatggttcacgtagtgggccatcgccctgatagacggtttttcgccctttgacgttggagtccacgttctttaatagtggactcttgttccaaactggaacaacactcaaccctatctcggtctattcttttgatttataagggattttgccgatttcggcctattggttaaaaaatgagctgatttaacaaaaatttaacgcgaattttaacaaaatattaacgcttacaatttaggtggcacttttcggggaaatgtgcgcggaacccctatttgtttatttttctaaatacattcaaatatgtatccgctcatgagacaataaccctgataaatgcttcaataatattgaaaaaggaagagtatgagtattcaacatttccgtgtcgcccttattcccttttttgcggcattttgccttcctgtttttgctcacccagaaacgctggtgaaagtaaaagatgctgaagatcagttgggtgcacgagtgggttacatcgaactggatctcaacagcggtaagatccttgagagttttcgccccgaagaacgttttccaatgatgagcacttttaaagttctgctatgtggcgcggtattatcccgtattgacgccgggcaagagcaactcggtcgccgcatacactattctcagaatgacttggttgagtactcaccagtcacagaaaagcatcttacggatggcatgacagtaagagaattatgcagtgctgccataaccatgagtgataacactgcggccaacttacttctgacaacgatcggaggaccgaaggagctaaccgcttttttgcacaacatgggggatcatgtaactcgccttgatcgttgggaaccggagctgaatgaagccataccaaacgacgagcgtgacaccacgatgcctgtagcaatggcaacaacgttgcgcaaactattaactggcgaactacttactctagcttcccggcaacaattaatagactggatggaggcggataaagttgcaggaccacttctgcgctcggcccttccggctggctggtttattgctgataaatctggagccggtgagcgtgggtctcgcggtatcattgcagcactggggccagatggtaagccctcccgtatcgtagttatctacacgacggggagtcaggcaactatggatgaacgaaatagacagatcgctgagataggtgcctcactgattaagcattggtaactgtcagaccaagtttactcatatatactttagattgatttaaaacttcatttttaatttaaaaggatctaggtgaagatcctttttgataatctcatgaccaaaatcccttaacgtgagttttcgttccactgagcgtcagaccccgtagaaaagatcaaaggatcttcttgagatcctttttttctgcgcgtaatctgctgcttgcaaacaaaaaaaccaccgctaccagcggtggtttgtttgccggatcaagagctaccaactctttttccgaaggtaactggcttcagcagagcgcagataccaaatactgttcttctagtgtagccgtagttaggccaccacttcaagaactctgtagcaccgcctacatacctcgctctgctaatcctgttaccagtggctgctgccagtggcgataagtcgtgtcttaccgggttggactcaagacgatagttaccggataaggcgcagcggtcgggctgaacggggggttcgtgcacacagcccagcttggagcgaacgacctacaccgaactgagatacctacagcgtgagctatgagaaagcgccacgcttcccgaagggagaaaggcggacaggtatccggtaagcggcagggtcggaacaggagagcgcacgagggagcttccagggggaaacgcctggtatctttatagtcctgtcgggtttcgccacctctgacttgagcgtcgatttttgtgatgctcgtcaggggggcggagcctatggaaaaacgccagcaacgcggcctttttacggttcctggccttttgctggccttttgctcacatgttctttcctgcgttatcccctgattctgtggataaccgtattaccgcctttgagtgagctgataccgctcgccgcagccgaacgaccgagcgcagcgagtcagtgagcgaggaagcggaagagcgcccaatacgcaaaccgcctctccccgcgcgttggccgattcattaatgcagctggcacgacaggtttcccgactggaaagcgggcagtgagcgcaacgcaattaatgtgagttagctcactcattaggcaccccaggctttacactttatgcttccggctcgtatgttgtgtggaattgtgagcggataacaatttcacacaggaaacagctatgaccatgattacgccaagcgcgcaattaaccctcactaaagggaacaaaagctggagctgcaagcttGGCACCAACTTCCCCTCCGACGGCCCCGTAATGCAGAAGAAGACCATGGGCTGGGAGGCCTCCTCCGAGCGGATGTACCCCGAGGACGGCGCCCTGAAGGGCGAGATCAAGCAGAGGCTGAAGCTGAAGGACGGCGGCCACTACGACGCTGAGGTCAAGACCACCTACAAGGCCAAGAAGCCCGTGCAGCTGCCCGGCGCCTACAACGTCAACATCAAGTTGGACATCACCTCCCACAACGAGGACTACACCATCGTGGAACAGTACGAACGCGCCGAGGGCCGCCACTCCACCGGCGGCATGGACGAGCTGTACAAGtaataaCACatcgacaatcaacctctggattacaaaatttgtgaaagattgactggtattcttaactatgttgctccttttacgctatgtggatacgctgctttaatgcctttgtatcatgctattgcttcccgtatggctttcattttctcctccttgtataaatcctggttgctgtctctttatgaggagttgtggcccgttgtcaggcaacgtggcgtggtgtgcactgtgtttgctgacgcaacccccactggttggggcttgccaccacctgtcagctcctttccgggactttcgctttccccctccctattgccacggcggaactcatcgccgcctgccttgcccgctgctggacaggggctcggctgttgggcactgacaattccgtggtgttgtcggggaagctgacgtcctttccatggctgctcgcctgtgttgccacctggattctgcgcgggacgtccttctgctacgtcccttcggccctcaatccagcggaccttccttcccgcggcctgctgccggctctgcggcctcttccgcgtcttcgccttcgccctcagacgagtcggatctccctttgggccgcctccccgcctggaattcgagctcggtacctttaagaccaatgacttacaaggcagctgtagatcttagccactttttaaaagaaaaggggggactggaagggctaattcactcccaacgaagacaagatctgctttttgcttgtactgggtctctctggttagaccagatctgagcctgggagctctctggctaactagggaacccactgcttaagcctcaataaagcttgccttgagtgcttcaagtagtgtgtgcccgtctgttgtgtgactctggtaactagagatccctcagacccttttagtcagtgtggaaaatctctagcagtagtagttcatgtcatcttattattcagtatttataacttgcaaagaaatgaatatcagagagtgagaggaacttgtttattgcagcttataatggttacaaataaagcaatagcatcacaaatttcacaaataaagcatttttttcactgcattctagttgtggtttgtccaaactcatcaatgtatcttatcatgtctggctctagctatcccgcccctaactccgcccatcccgcccctaactccgcccagttccgcccattctccgccccatggctgactaattttttttatttatgcagaggccgaggccgcctcggcctctgagctattccagaagtagtgaggaggcttttttggaggcctagggacgtacccaattcgccctatagtgagtcgtattacgcgcgctcactggccgtcgttttacaacgtcgtgactgggaaaaccctggcgttacccaacttaatcgccttgcagcacatccccctttcgccagctggcgtaatagcgaagaggcccgcaccgatcgcccttcccaacagttgcgcagcctgaatggcgaatgggacgcgccctgtagcggcgcattaagcgcggcgggtgtggtggttacgcgcagcgtgaccgctacacttgccagcgccctagcgcccgctcctttcgctttcttcccttcctttctcgccacgttcgccggctttccccgtcaagctctaaatcgggggctccctttagggttccgatttagtgctttacggcacctcgaccccaaaaaacttgattagggtgatggttcacgtagtgggccatcgccctgatagacggtttttcgccctttgacgttggagtccacgttctttaatagtggactcttgttccaaactggaacaacactcaaccctatctcggtctattcttttgatttataagggattttgccgatttcggcctattggttaaaaaatgagctgatttaacaaaaatttaacgcgaattttaacaaaatattaacgcttacaatttaggtggcacttttcggggaaatgtgcgcggaacccctatttgtttatttttctaaatacattcaaatatgtatccgctcatgagacaataaccctgataaatgcttcaataatattgaaaaaggaagagtatgagtattcaacatttccgtgtcgcccttattcccttttttgcggcattttgccttcctgtttttgctcacccagaaacgctggtgaaagtaaaagatgctgaagatcagttgggtgcacgagtgggttacatcgaactggatctcaacagcggtaagatccttgagagttttcgccccgaagaacgttttccaatgatgagcacttttaaagttctgctatgtggcgcggtattatcccgtattgacgccgggcaagagcaactcggtcgccgcatacactattctcagaatgacttggttgagtactcaccagtcacagaaaagcatcttacggatggcatgacagtaagagaattatgcagtgctgccataaccatgagtgataacactgcggccaacttacttctgacaacgatcggaggaccgaaggagctaaccgcttttttgcacaacatgggggatcatgtaactcgccttgatcgttgggaaccggagctgaatgaagccataccaaacgacgagcgtgacaccacgatgcctgtagcaatggcaacaacgttgcgcaaactattaactggcgaactacttactctagcttcccggcaacaattaatagactggatggaggcggataaagttgcaggaccacttctgcgctcggcccttccggctggctggtttattgctgataaatctggagccggtgagcgtgggtctcgcggtatcattgcagcactggggccagatggtaagccctcccgtatcgtagttatctacacgacggggagtcaggcaactatggatgaacgaaatagacagatcgctgagataggtgcctcactgattaagcattggtaactgtcagaccaagtttactcatatatactttagattgatttaaaacttcatttttaatttaaaaggatctaggtgaagatcctttttgataatctcatgaccaaaatcccttaacgtgagttttcgttccactgagcgtcagaccccgtagaaaagatcaaaggatcttcttgagatcctttttttctgcgcgtaatctgctgcttgcaaacaaaaaaaccaccgctaccagcggtggtttgtttgccggatcaagagctaccaactctttttccgaaggtaactggcttcagcagagcgcagataccaaatactgttcttctagtgtagccgtagttaggccaccacttcaagaactctgtagcaccgcctacatacctcgctctgctaatcctgttaccagtggctgctgccagtggcgataagtcgtgtcttaccgggttggactcaagacgatagttaccggataaggcgcagcggtcgggctgaacggggggttcgtgcacacagcccagcttggagcgaacgacctacaccgaactgagatacctacagcgtgagctatgagaaagcgccacgcttcccgaagggagaaaggcggacaggtatccggtaagcggcagggtcggaacaggagagcgcacgagggagcttccagggggaaacgcctggtatctttatagtcctgtcgggtttcgccacctctgacttgagcgtcgatttttgtgatgctcgtcaggggggcggagcctatggaaaaacgccagcaacgcggcctttttacggttcctggccttttgctggccttttgctcacatgttctttcctgcgttatcccctgattctgtggataaccgtattaccgcctttgagtgagctgataccgctcgccgcagccgaacgaccgagcgcagcgagtcagtgagcgaggaagcggaagagcgcccaatacgcaaaccgcctctccccgcgcgttggccgattcattaatgcagctggcacgacaggtttcccgactggaaagcgggcagtgagcgcaacgcaattaatgtgagttagctcactcattaggcaccccaggctttacactttatgcttccggctcgtatgttgtgtggaattgtgagcggataacaatttcacacaggaaacagctatgaccatgattacgccaagcgcgcaattaaccctcactaaagggaacaaaagctggagctgcaagctt 103 aatgtagtcttatgcaatactcttgtagtcttgcaacatggtaacgatgagttagcaacatgccttacaaggagagaaaaagcaccgtgcatgccgattggtggaagtaaggtggtacgatcgtgccttattaggaaggcaacagacgggtctgacatggattggacgaaccactgaattgccgcattgcagagatattgtatttaagtgcctagctcgatacataaacgggtctctctggttagaccagatctgagcctgggagctctctggctaactagggaacccactgcttaagcctcaataaagcttgccttgagtgcttcaagtagtgtgtgcccgtctgttgtgtgactctggtaactagagatccctcagacccttttagtcagtgtggaaaatctctagcagtggcgcccgaacagggacttgaaagcgaaagggaaaccagaggagctctctcgacgcaggactcggcttgctgaagcgcgcacggcaagaggcgaggggcggcgactggtgagtacgccaaaaattttgactagcggaggctagaaggagagagatgggtgcgagagcgtcagtattaagcgggggagaattagatcgcgatgggaaaaaattcggttaaggccagggggaaagaaaaaatataaattaaaacatatagtatgggcaagcagggagctagaacgattcgcagttaatcctggcctgttagaaacatcagaaggctgtagacaaatactgggacagctacaaccatcccttcagacaggatcagaagaacttagatcattatataatacagtagcaaccctctattgtgtgcatcaaaggatagagataaaagacaccaaggaagctttagcaagatagaggaagagcaaaacaaaagtaagaccaccgcacagcaagcggccgctgatcttcagacctggaggaggagatatgagggacaattggagaagtgaattatataaatataaagtagtaaaaattgaaccattaggagtagcacccaccaaggcaaagagaagagtggtgcagagagaaaaaagagcagtgggaataggagctttgttccttgggttcttgggagcagcaggaagcactatgggcgcagcgtcaatgacgctgacggtacaggccagacaattattgtctggtatagtgcagcagcagaacaatttgctgagggctattgaggcgcaacagcatctgttgcaactcacagtctggggcatcaagcagctccaggcaagaatcctggctgtggaaagatacctaaaggatcaacagctcctggggatttggggttgctctggaaaactcatttgcaccactgctgtgccttggaatgctagttggagtaataaatctctggaacagatttggaatcacacgacctggatggagtgggacagagaaattaacaattacacaagcttaatacactccttaattgaagaatcgcaaaaccagcaagaaaagaatgaacaagaattattggaattagataaatgggcaagtttgtggaattggtttaacataacaaattggctgtggtatataaaattattcataatgatagtaggaggcttggtaggtttaagaatagtttttgctgtactttctatagtgaatagagttaggcagggatattcaccattatcgtttcagacccacctcccaaccccgaggggacccgacaggcccgaaggaatagaagaagaaggtggagagagagacagagacagatccattcgattagtgaacggatctcgacggtatcggttaacttttaaaagaaaaggggggattggggggtacagtgcaggggaaagaatagtagacataatagcaacagacatacaaactaaagaattacaaaaacaaattacaaaattcaaaattttatcgataagcttgggagttccgcgttacataacttacggtaaatggcccgcctggctgaccgcccaacgacccccgcccattgacgtcaataatgacgtatgttcccatagtaacgccaatagggactttccattgacgtcaatgggtggagtatttacggtaaactgcccacttggcagtacatcaagtgtatcatatgccaagtacgccccctattgacgtcaatgacggtaaatggcccgcctggcattatgcccagtacatgaccttatgggactttcctacttggcagtacatctacgtattagtcatcgctattaccatggtgatgcggttttggcagtacatcaatgggcgtggatagcggtttgactcacggggatttccaagtctccaccccattgacgtcaatgggagtttgttttggcaccaaaatcaacgggactttccaaaatgtcgtaacaactccgccccattgacgcaaatgggcggtaggcgtgtacggtgggaggtctatataagcagagctcgtttagtgaaccgtcagatcgcctggagacgccatccacgctgttttgacctccatagaagacaccgcccgggcgtacgggatccATGGACCTGCCTGACGACCACTACCTGTCCACCCAGACCATCCTGTCCAAGGACCTGAACTCCGGACTCAGATCTGGCAGCGGTCTCGAGATGGAAGTTAGCGCTCTGGAAAAAGAAGTGTCTGCACTCGAGAAAGAAGTAAGTGCCCTTGAGAAGGAGGTGTCCGCACTCGAGAAGGAGGTCAGCGCCCTGGAAAAGGAAAAGCGAGACCATATGGTTTTGCTTGAGTATGTTACAGCGGCTGGCATTACCGATGCATCAAGCTACTTCGCCAGCGAGCAGGGCGAGATCCAGTGGGTGAAGGTGTCCGCCCTGAAGGAAAAAGTAAGCGCACTGAAAGAAAAGGTGAGCGCGCTGAAGGAGAAAGTGAGCGCCCTGAAAGAGAAAGTCTCTGCCCTTAAGGAGGATATCGAGGGCAGAGGAAGTCTGCTAACATGCGGTGACGTCGAGGAGAATCCTGGCCCAAAGCTTGCCACCATGCGCAAAGGCGAAGAACTGTTTACCGGCATTGTGCCGATTCTGGTGGAACTGGATGGCGATGTGAACGGCCATAAAITTTTIGTGCGCGGCGAAGGCGAAGGCGATGCGACCATTGGCAAACTGAGCCTGAAATTTATTTGCACCACCGGCAAACTGCCGGTGCCGTGGCCGACCCTGGTGACCACCCTGACCTATGGCGTGCAGTGCTTTAGCCGCTATCCGGATCATATGAAACGCCATGATTTTTTTAAAAGCGCGATGCCGGAAGGCTATGTGCAGGAACGCACCATTTATTTTAAAGATGATGGCACCTATAAAACCCGCGCGGAAGTGAAATTTGAAGGCGATACCCTGGTGAACCGCATTGAACTGAAAGGCATTGATTTTAAAGAAGATGGCAACATTCTGGGCCATAAACTGGAATATAACTTTAACAGCCATAAAGTGTATATTACCGCGGATAAACAGAACAACGGCATTAAAGCGAACTTTACCATTCGCCATAACGTGGAAGATGGCAGCGTGCAGCTGGCGGATCATTATCAGCAGAACACCCCGATTGGCGATGGCCCGGTTCTTCTTCCTGGCGGCCGCTCTAGAGAGGGCAGAGGAAGTCTGCTAACATGCGGTGACGTCGAGGAGAATCCTGGCCCAGAATTGATGGTGAGCAAGGGCGAGGAGGATAACATGGCCATCATCAAGGAGTTCATGCGCTTCAAGGTGCACATGGAGGGCTCCGTGAACGGCCACGAGTTCGAGATCGAGGGCGAGGGCGAGGGCCGCCCCTACGAGGGCACCCAGACCGCCAAGCTGAAGGTGACCAAGGGTGGCCCCCTGCCCTTCGCCTGGGACATCCTGTCCCCTCAGTTCATGTACGGCTCCAAGGCCTACGTGAAGCACCCCGCCGACATCCCCGACTACTTGAAGCTGTCCTTCCCCGAGGGCTTCAAGTGGGAGCGCGTGATGAACTTCGAGGACGGCGGCGTGGTGACCGTGACCCAGGACTCCTCCCTGCAGGACGGCGAGTTCATCTACAAGGTGAAGCTGCGCGGCACCAACTTCCCCTCCGACGGCCCCGTAATGCAGAAGAAGACCATGGGCTGGGAGGCCTCCTCCGAGCGGATGTACCCCGAGGACGGCGCCCTGAAGGGCGAGATCAAGCAGAGGCTGAAGCTGAAGGACGGCGGCCACTACGACGCTTGAGGTCAAGACCACCTACAAGGCCAAGAAGCCCGTGCAGCTGCCCGGCGCCTACAACGTCAACATCAAGTTGGACATCACCTCCCACAACGAGGACTACACCATCGTGGAACAGTACGAACGCGCCGAGGGCCGCCACTCCACCGGCGGCATGGACGAGCTGTACAAGtaataaCACatcgacaatcaacctctggattacaaaatttgtgaaagattgactggtattcttaactatgttgctccttttacgctatgtggatacgctgctttaatgcctttgtatcatgctattgcttcccgtatggctttcattttctcctccttgtataaatcctggttgctgtctctttatgaggagttgtggcccgttgtcaggcaacgtggcgtggtgtgcactgtgtttgctgacgcaacccccactggttggggcattgccaccacctgtcagctcctttccgggactttcgctttccccctccctattgccacggcggaactcatcgccgcctgccttgcccgctgctggacaggggctcggctgttgggcactgacaattccgtggtgttgtcggggaagctgacgtcctttccatggctgctcgcctgtgttgccacctggattctgcgcgggacgtccttctgctacgtcccttcggccctcaatccagcggaccttccttcccgcggcctgctgccggctctgcggcctcttccgcgtcttcgccttcgccctcagacgagtcggatctccctttgggcgcctccccgcctggaattcgagctcggtacctttaagaccaatgacttacaaggcagctgtagatcttagccactttttaaaagaaaaggggggactggaaggctaattcactcccaacgaagacaagatctgctttttgcttgtactgggtctctctggttagaccagatctgagcctgggagctctctggctaactagggaacccactgcttaagcctcaataaagcttgccttgagtgcttcaagtagtgtgtgcccgtctgttgtgtgactctggtaactagagatccctcagacccttttagtcagtgtggaaaatctctagcagtagtagttcatgtcatcttattattcagtatttataacttgcaaagaaatgaatatcagagagtgagaggaacttgtttattgcagcttataatggttacaaataaagcaatagcatcacaaatttcacaaataaagcatttttttcactgcattctagttgtggtttgtccaaactcatcaatgtatcttatcatgtctggctctagctatcccgcccctaactccgcccatcccgcccctaactccgcccagttccgcccattctccgccccatggctgactaattttttttatttatgcagaggccgaggccgcctcggcctctgagctattccagaagtagtgaggaggcttttttggaggcctagggacgtacccaattcgccctatagtgagtcgtattacgcgcgctcactggccgtcgttttacaacgtcgtgactgggaaaaccctggcgttacccaacttaatcgccttgcagcacatccccctttcgccagctggcgtaatagcgaagaggcccgcaccgatcgcccttcccaacagttgcgcagcctgaatggcgaatgggacgcgccctgtagcggcgcattaagcgcggcgggtgtggtggttacgcgcagcgtgaccgctacacttgccagcgccctagcgcccgctcctttcgctttcttcccttcctttctcgccacgttcgccggctttccccgtcaagctctaaatcgggggctccctttagggttccgatttagtgctttacggcacctcgaccccaaaaaacttgattagggtgatggttcacgtagtgggccatcgccctgatagacggtttttcgccctttgacgttggagtccacgttctttaatagtggactcttgttccaaactggaacaacactcaaccctatctcggtctattcttttgatttataagggattttgccgatttcggcctattggttaaaaaatgagctgatttaacaaaaatttaacgcgaattttaacaaaatattaacgcttacaatttaggtggcacttttcggggaaatgtgcgcggaacccctatttgtttatttttctaaatacattcaaatatgtatccgctcatgagacaataaccctgataaatgcttcaataatattgaaaaaggaagagtatgagtattcaacatttccgtgtcgcccttattccctttttgcggcattttgccttcctgtttttgctcacccagaaacgctggtgaaagtaaaagatgctgaagatcagttgggtgcacgagtgggttacatcgaactggatctcaacagcggtaagatccttgaggttttcgccccgaagaacgttttccaatgatgagcacttttaaagttctgctatgtggcgcggtattatcccgtattgacgccgggcaagagcaactcggtcgccgcatacactattctcagaatgacttggttgagtactcaccagtcacagaaaagcatcttacggatggcatgacagtaagagaattatgcagtgctgccataaccatgagtgataacactgcggccaacttacttctgacaacgatcggaggaccgaaggagctaaccgcttttttgcacaacatgggggatcatgtaactcgccttgatcgttgggaaccggagctgaatgaagccataccaaacgacgagcgtgacaccacgatgcctgtagcaatggcaacaacgttgcgcaaactattaactggcgaactacttactctagcttcccggcaacaattaatagactggatggaggcggataaagttgcaggaccacttctgcgctcggcccttccggctggctggtttattgctgataaatctggagccggtgagcgtgggtctcgcggtatcattgcagcactggggccagatggtaagccctcccgtatcgtagttatctacacgacggggagtcaggcaactatggatgaacgaaatagacagatcgctgagataggtgcctcactgattaagcattggtaactgtcagaccaagtttactcatatatactttagattgatttaaaacttcatttttaatttaaaaggatctaggtgaagatcctttttgataatctcatgaccaaaatcccttaacgtgagttttcgttccactgagcgtcagaccccgtagaaaagatcaaaggatcttcttgagatcctttttttctgcgcgtaatctgctgcttgcaaacaaaaaaaccaccgctaccagcggtggtttgtttgccggatcaagagctaccaactctttttccgaaggtaactggcttcagcagagcgcagataccaaatactgttcttctagtgtagccgtagttaggccaccacttcaagaactctgtagcaccgcctacatacctcgctctgctaatcctgttaccagtggctgctgccagtggcgataagtcgtgtcttaccgggttggactcaagacgatagttaccggataaggcgcagcggtcgggctgaacggggggttcgtgcacacagcccagcttggagcgaacgacctacaccgaactgagatacctacagcgtgagctatgagaaagcgccacgcttcccgaagggagaaaggcggacaggtatccggtaagcggcagggtcggaacaggagagcgcacgagggagcttccagggggaaacgcctggtatctttatagtcctgtcgggtttcgccacctctgacttgagcgtcgatttttgtgatgctcgtcaggggggcggagcctatggaaaaacgccagcaacgcggcctttttacggttcctggccttttgctggccttttgctcacatgttctttctgcgttatcccctgattctgtggataaccgtattaccgcctttgagtgagctgataccgctcgccgcagccgaacgaccgagcgcagcgagtcagtgagcgaggaagcggaagagcgcccaatacgcaaaccgcctctccccgcgcgttggccgattcattaatgcagctggcacgacaggtttcccgactggaaagcgggcagtgagcgcaacgcaattaatgtgagttagctcactcattaggcaccccaggctttacactttatgcttccggctcgtatgttgtgtggaattgtgagcggataacaatttcacacaggaaacagctatgaccatgattacgccaagcgcgcaattaaccctcactaaagggaacaaaagctggagctgcaagctt 104 ATGGACCCCGTGGTGGTGCTGGGCCTGTGCCTGAGCTGCCTGCTGCTGCTGAGCCTGTGGAAGCAGAGCCACGGCGGCGGCAAG 105 MDPVVVLGLCLSCLLLLSLWQSHGGGK 106 ATGGGCTGCTGCTTCAGCAAGACC 107 MGCCFSKT 108 GAGGACAGGGCCGGCGCCGGCATCATCGAGACCCCC 109 GCCACCAGGGCCGGCTGCGCCCCCAGCTACAGGGTG 110 CTGGACAGGGCCGGCGGCTACATCTTCAGCAGCGAC 111 ATCACCACTCTTGGGAAATTTGGACAA 112 AGCTACTTCGCCAGCGAGCAGGGCGAGATCCAGTGGGTG 113 AGGGAGCTGAACGGCGGCGCCTACACCAGGTACGTG 114 TTCACCCTGAAGGGCGGCGCCCCCACCAAGGTGACC 115 ATCGCCCTGAAGGGCGGCAAGATCGTGAACAACTGG 116 ACCAGCGCCGTGCTGCAGAGCGGCTTCAGGAAGATG 117 AGCGGCGTGACCTTCCAGAGCGCCGTGAAGAGGACC 118 AAGGTGGCCACCGTGCAGAGCAAGATGAGCGACGTG 119 AGCGCCGTGAAGCTGCAGAACAACGAGCTGAGCCCC 120 GCCACCGTGAGGCTGCAGGCCGGCAACGCCACCGAG 121 AGGGAGCCCATGCTGCAGAGCGCCGACGCCCAGAGC 122 AAGGAGAGGAAGAGGAGGGGCGCCGACACCAGCATC 123 ACCAGGAGCGGCAAGAGGAGCTGGCCCCCCAGCGAG 124 GAGCCCGAGAAGCAGAGGAGCCCCCAGGACAACCAG 125 GGCCTGGTGAAGAGGAGGGGCGGCGGCACCGGCGAG 126 ITTLGKFGQ 127 EALFQGPK 128 SYFASEQGEIQWV 129 RAGAYIFS 130 RELNGGAYTRYV 131 FTLKGGAPTKVT 132 IALKGGKIVNNW 133 TSAVLQSGFRKM 134 KVATVQSKMSDV 135 SAVKLQNNELSP 136 ATVRLQAGNATE 137 REPMLQSADAQS 138 SGVTFQSAVKRT 139 SGVTFQGKFKK 140 YAKRGGVF 141 KERKRRGADTSI 142 TRSGKRSWPPSE 143 EPEKQRSPQDNQ 144 GLVKRRGGGTGE 145 ATGGACCTGCCTGACGACCACTACCTGTCCACCCAGACCATCCTGTCCAAGGACCTGAACTCCGGACTCAGATCTGGCAGCGGTCTCGAGATGGAAGTTAGCGCTCTGGAAAAAGAAGTGTCTGCACTCGAGAAAGAAGTAAGTGCCCTTGAGAAGGAGGTGTCCGCACTCGAGAAGGAGGTCAGCGCCCTGGAAAAGGAAAAGCGAGACCATATGGTTTTGCTTGAGTATGTTACAGCGGCTGGCATTACCGATGCATCAAAGGTGTCCGCCCTGAAGGAAAAAGTAAGCGCACTGAAAGAAAAGGTGAGCGCGCTGAAGGAGAAAGTGAGCGCCCTGAAAGAGAAAGTCTCTGCCCTTAAGGAGGATATCGAGGGCAGAGGAAGTCTGCTAACATGCGGTGACGTCGAGGAGAATCCTGGCCCAAAGCTTGCCACCATGCGCAAAGGCGAAGAACTGTTTACCGGCATTGTGCCGATTCTGGTGGAAC TGGATGGCGATGTGAACGGCCATAAATTTTTTIGTGCGCGGCGAAGGCGAAGGCGATGCGACCATTGGCAAACTGAGCCTGAAATTTATTTGCACCACCGGCAAACTGCCGGTGCCGTGGCCGACCCTGGTGACCACCCTGACCTATGGCGTGCAGTGCTTTAGCCGCTATCCGGATCATATGAAACGCCATGATTTTTTTAAAAGCGCGATGCCGGAAGGCTATGTGCAGGAACGCACCATTTATTTTAAAGATGATGGCACCTATAAAACCCGCGCGGAAGTGAAATTTGAAGGCGATACCCTGGTGAACCGCATTGAACTGAAAGGCATTGATTTTAAAGAAGATGGCAACATTCTGGGCCATAAACTGGAATATAACTTTAACAGCCATAAAGTGTATATTACCGCGGATAAACAGAACAACGGCATTAAAGCGAACTTTACCATTCGCCATAACGTGGAAGATGGCAGCGTGCAGCTGGCGGATCATTATCAGCAGAACACCCCGATTGGCGATGGCCCGGTTCTTCTTCCTGGCGGCCGCTCTAGA 146 TGAGCAAGGGCGAGGAGGATAACATGGCCATCATCAAGGAGTTCATGCGCTTCAAGGTGCACATGGAGGGCTCCGTGAACGGCCACGAGTTCGAGATCGAGGGCGAGGGCGAGGGCCGCCCCTACGAGGGCACCCAGACCGCCAAGCTGAAGGTGACCAAGGGTGGCCCCCTGCCCTTCGCCTGGGACATCCTGTCCCCTCAGTTCATGTACGGCTCCAAGGCCTACGTGAAGCACCCCGCCGACATCCCCGACTACTTGAAGCTGTCCTTCCCCGAGGGCTTCAAGTGGGAGCGCGTGATGAACTTCGAGGACGGCGGCGTGGTGACCGTGACCCAGGACTCCTCCCTGCAGGACGGCGAGTTCATCTACAAGGTGAAGCTGCGCGGCACCAACTTCCCCTCCGACGGCCCCGTAATGCAGAAGAAGACCATGGGCTGGGAGGCCTCCTCCGAGCGGATGTACCCCGAGGACGGCGCCCTGAAGGGCGAGATCAAGCAGAGGCTGAAGCTGAAGGACGGCGGCCACTACGACGCTGAGGTCAAGACCACCTACAAGGCCAAGAAGCCCGTGCAGCTGCCCGGCGCCTACAACGTCAACATCAAGTTGGACATCACCTCCCACAACGAGGACTACACCATCGTGGAACAGTACGAACGCGCCGAGGGCCGCCACTCCACCGGCGGCATGGACGAGCTGTACAAGtaa 147 GAGGGCAGAGGAAGTCTGCTAACATGCGGTGACGTCGAGGAGAATCCTGGCCCA