SYSTEM FOR PRODUCTION OF ANTIBODIES AND THEIR DERIVATIVES

20170314013 ยท 2017-11-02

    Inventors

    Cpc classification

    International classification

    Abstract

    The present disclosure provides methods and compositions for the production of chimeric antibodies that specifically bind an antigen of interest.

    Claims

    1.-15. (canceled)

    16. A method for detecting an antigen of interest in a sample, comprising the steps of (a) contacting the sample with an antibody that specifically binds the antigen under conditions that promote the formation of an antibody-antigen complex, (b) contacting the antibody-antigen complex with a fusion protein comprising (i) the immunoglobulin-binding domains of staphylococcal protein A and streptococcal protein G, and (ii) Metridia longa luciferase or a derivative lacking the N-terminal region, under conditions that promote binding of the fusion protein to the antibody-antigen complex, and (c) detecting the Metridia longa luciferase.

    17. The method of claim 16, wherein the fusion protein is encoded by a vector selected from the group consisting of pS14L-spAG-MLuc16, pETspAG-N-MLuc1, and pS14L-spAG-N-MLuc15.

    18. The method of claim 17, wherein the fusion protein is encoded by pS14L-spAG-MLuc16 or pETspAG-N-MLuc1.

    19. The method of claim 17, wherein the fusion protein is encoded by pS14L-spAG-N-MLuc15.

    20. An IgG fusion protein comprising IgG heavy chains fused with a peptide or polypeptide selected from the group consisting of green fluorescent protein (GFP), Metridia longa luciferase, cellulose binding domain, 6 histidine, or a biotinylatable peptide.

    Description

    BRIEF DESCRIPTION OF THE FIGURES

    [0022] FIG. 1 shows the structure of the pVLentry-Hyg10 and pVHentry-Cm5 vectors. Plac and Pampbacterial promoters; PCMV iethe immediate early promoter of CMV; IRESinternal ribosome entry site; SV40 poly A and HSV TK polyAtranscription terminators; fl ori and pUC oriphage and plasmid origins of replication; 10b, IGHG1, and lacZsequences encoding phage T7 protein 10b, constant part of human IgG and -peptide of -galactosidase, respectively; Ap(R), CM(R), Km(R) and Hygromycin-delEspsequences encoding resistance to antibiotics ampicillin, chloramphenicol, G418 and Hygromycin B (this sequence was modified to remove Esp3I site), respectively. Underlined are sequences of cohesive ends generated by Esp3I.

    [0023] FIG. 2 shows the assembly of IgG-encoding sequences using cohesive ends generated by DNA polymerase T4. DNApolT4 (dCTP)designates treatment with DNA polymerase T4 in the mixture containing only dCTP. Esp3I and ligasetwo additional types of treatments with endonuclease Esp3I and DNA ligase, respectively, that are required for assembly of IgG-encoding sequences. IG-V, IGHG1 and 10bsequences encoding variable and constant parts of IgG chain and protein 10b, respectively.

    [0024] FIG. 3 shows the interaction of gfpBoNT/A-CH5 with its receptors on the surface of the neuroblastoma cell. gfpBONT/A-CH5 was added to SH-SY5Y cells and after 15 minutes cells were subject to microscopy.

    [0025] FIG. 4 shows the effect of antibiotic resistance selection on production of human IgG by CHO cells. Dilutions of media from the original IgG-producing culture and its derivative selected at higher concentrations of antibiotics were loaded into wells of a 96-well plate covered with BoNT/A-CH. Immobilized IgGs were visualized by treatment of wells with biotinylated anti-human antibodies followed by treatment with streptavidin-horse radish peroxidase and 1-STEP Slow TMB-ELISA (Pierce, Inc.).

    [0026] FIG. 5 shows the composition of proteins purified from cell culture media. Proteins were separated by SDS-PAGE and were either stained by Coomassie (right portion) or transferred onto a nitrocellulose membrane and treated with biotinylated anti-human IgG. Bound antibodies were visualized by treatment with streptavidin-horse radish peroxidase conjugate and 1-STEP Slow TMB-ELISA (Pierce, Inc.) and 1-STEP Ultra TMB (Pierce, Inc.). Line 1 contains pre-stained molecular weight markers from Fermentas, Inc.; 2protein purified from media of cells generated by transfection with plasmid encoding both chains of IgG; 3protein from cells transfected with plasmid encoding human IgG whose heavy chain is fused with GFP; 4protein from cells transfected with plasmid encoding human IgG whose heavy chain is fused with MLuc.

    [0027] FIG. 6 shows the interaction of purified human IgGs with receptor-recognizing domain of BoNT/A. Dilutions of IgGs purified from media of isolated cell cultures were loaded into wells of a 96-well plate covered with BoNT/A-CH5. Immobilized IgGs were visualized by treatment of wells with biotinylated anti-human antibodies followed by treatment with streptavidin-horse radish peroxidase and Metal Enhanced DAB Substrate Kit (Pierce, Inc.). The control line corresponds to the highest OD.sub.450 of wells that were treated the same way as others but did not contain BoNT/A-CH5.

    DETAILED DESCRIPTION

    [0028] The present disclosure provides methods and compositions for robust generation of human monoclonal antibodies targeted at pathogens of interest.

    [0029] In addition to the set of products that address existing needs, this technology advances our understanding of structure-function relationships in the neurotoxin molecule and provides information about mechanisms of inactivation of this molecule by antibodies.

    [0030] In practicing the present disclosure, many conventional techniques in cell biology, molecular biology, protein biochemistry, immunology, and bacteriology are used. These techniques are well-known in the art and are provided in any number of available publications, including Current Protocols in Molecular Biology, Vols. I-III, Ausubel, Ed. (1997); Sambrook et al., Molecular Cloning: A Laboratory Manual, Second Ed. (Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y., 1989).

    [0031] Certain terms used herein are defined below. Unless defined otherwise, all technical and scientific terms used herein have the same general meaning as commonly understood by one skilled in the art.

    [0032] Unless defined otherwise, all technical and scientific terms used herein generally have the same meaning as commonly understood by one of ordinary skill in the art to which this technology belongs. As used in this specification and the appended claims, the singular forms a, an and the include plural referents unless the content clearly dictates otherwise. For example, reference to a cell includes a combination of two or more cells, and the like. Generally, the nomenclature used herein and the laboratory procedures in cell culture, molecular genetics, organic chemistry, analytical chemistry and nucleic acid chemistry and hybridization described below are those well-known and commonly employed in the art. All references cited herein are incorporated by reference in their entirety for all purposes to the same extent as if each individual publication, patent, or patent application were specifically and individually incorporated by reference in its entirety for all purposes.

    [0033] As used herein, about will be understood by persons of ordinary skill in the art and will vary to some extent depending upon the context in which it is used. If there are uses of the term which are not clear to persons of ordinary skill in the art, given the context in which it is used, about will mean up to plus or minus 10% of the particular term.

    [0034] As used herein, administration of a composition to a subject includes any route of delivering the compound to the subject to perform its intended function. Administration can be carried out by any suitable route including oral, intranasal, parenteral (intravenous, intramuscular, intraperitoneal, or subcutaneous), or topical. Administration includes self-administration and administration by another.

    [0035] As used herein, the terms antigen and antigenic refer to molecules with the capacity to be recognized by an antibody or otherwise act as a member of an antibody-ligand pair. Specific binding refers to the interaction of an antigen with the variable regions of immunoglobulin heavy and light chains. Antibody-antigen binding may occur in vivo or in vitro. The skilled artisan will understand that macromolecules, including proteins, nucleic acids, fatty acids, lipids, lipopolysaccharides and polysaccharides have the potential to act as an antigen. The skilled artisan will further understand that nucleic acids encoding a protein with the potential to act as an antibody ligand necessarily encodes an antigen. The artisan will further understand that antigens are not limited to full-length proteins, but can also include partial amino acid sequences. Moreover, sequences from different sources may be combined to generate mosaic antigens, depending on the specific intended use. In some embodiments, the mosaic antigen will include epitopes derived from different proteins. In some embodiments, the mosaic antigen will include epitopes derived from the same protein. The term antigenic is an adjectival reference to molecules having the properties of an antigen. In some embodiments, the antigen of interest is a bacterial toxin. In some embodiments the antigen of interest is a botulinum neurotoxin.

    [0036] As used herein, the term epitope refers to that portion of a molecule that forms a site specifically recognized by an antibody or immune cell. A protein epitope may comprise amino acid residues directly involved in antibody binding, as well as residues not directly involved in binding that are nonetheless included in the antibody-epitope footprint and excluded from the solvent surface. Epitopes may derive from a variety of physical characteristics of a protein, including primary, secondary, and tertiary amino acid structure, and amino acid/protein charge. Epitopes present within a molecule are referred to as real epitopes. Real epitopes encompass wild-type sequences and variants of wild-type sequences. Real epitopes may exist within a wild-type protein, a naturally occurring variant of a wild-type protein, or an engineered variant of a wild-type protein. The term mimetic epitope refers to a molecule whose primary structure is unrelated to the primary structure of a given real epitope that nonetheless specifically binds to antibodies that recognize the real epitope. Epitopes may be isolated, purified, or otherwise prepared by those skilled in the art. They may be obtained from natural sources including cells and tissues, or they may be isolated from host cells expressing a recombinant form of the epitope.

    [0037] As used herein, effective amount refers to a quantity sufficient to achieve a desired effect. In the context of therapeutic or prophylactic applications, the effective amount will depend on the type and severity of the condition at issue and on the characteristics of the individual subject, such as general health, age, sex, body weight, and tolerance to pharmaceutical compositions. In the context of an antigenic composition, in some embodiments, an effective amount is an amount sufficient to result in a protective response against a pathogen. In other embodiments, an effective amount of an antigenic composition is an amount sufficient to result in antibody generation against the antigen. With respect to antigenic compositions, in some embodiments, an effective amount will depend on the intended use, the degree of immunogenicity of a particular antigenic compound, and the health/responsiveness of the subject's immune system, in addition to the factors described above. The skilled artisan will be able to determine appropriate amounts depending on these and other factors. In the case of a biochemical application, in some embodiments, an effective amount will depend on the size and nature of the sample in question. It will also depend on the nature and sensitivity of the methods in use. The skilled artisan will be able to determine the effective amount based on these and other considerations.

    [0038] As used herein, the term polymer resin refers to resins, such as, but not limited to polysaccharide polymers such as agarose, cellulose, and Sepharose. The skilled artisan will understand that proteins may be covalently attached to the resin using methods well known in the art, including but not limited to cyanogen bromide activation, reductive animation of aldehydes, and the addition of iodoacetyl functional groups. The skilled artisan will further understand that functional equivalents of polysaccharide polymers may also be to immobilize proteins.

    [0039] As used herein, the term BoNT refers to any of the seven serologically distinct botulinum neurotoxins produced by Clostridium botulinum, Clostridium argentiensis, and Clostridium baratti. Individual serotypes are referred to as BoNT/A, BoNT/B, BoNT/C, BoNT/D, BoNT/E, BoNT/F, and BoNT/G. Exemplary, non-limiting nucleic acid sequences of BoNT/A, /B, /C, /D, /E, /F, and /G are found in GenBank Accession numbers DQ409059, FM865705, AB200364, NZ ACSJ01000015, AM695754, X81714, and X74162, respectively. Exemplary, non-limiting amino acid sequences of BoNT/A, /B, /C, /D, /E, /F, and /G are found in GenBank Accession numbers ABD65472, CAR97779, BAD90572, ZP 04863672, CAM91137, CAA57358, and CAA52275, respectively. Exemplary, non-limiting nucleic and amino acid sequences of C. tetani tetanus toxin are found in GenBank Accession numbers AF154828 and AAF73267, respectively. As used herein, the term BoNT/A-L refers to the full-length botulinum neurotoxin A light chain. As used herein, the term BoNT/B-L refers to the full-length botulinum neurotoxin B light chain.

    [0040] As used herein, the term anti-BoNT antibody refers to an antibody capable of specifically binding to BoNT. As used herein, an antibody includes a polyclonal antibody, a monoclonal antibody, and also refers to functional fragments (e.g., fragments which bind an antigen/epitope), such as Fv, Fab, Fc and CDRs.

    [0041] As used herein, the terms immunogen and immunogenic refer to molecules with the capacity to elicit an immune response. The response may involve antibody production or the activation of immune cells. The response may occur in vivo or in vitro. The skilled artisan will understand that a variety of macromolecule, including proteins, have the potential to be immunogenic. The skilled artisan will further understand that nucleic acids encoding a molecule capable of eliciting an immune response necessarily encodes an immunogen. The artisan will further understand that immunogens are not limited to full-length molecules, but may include partial amino acid sequences (e.g., epitopes). Moreover, sequences from different sources may be combined to generate mosaic immunogens, depending on the specific intended use.

    [0042] As used herein, the terms isolate and purify refer to processes of obtaining a biological substance that is substantially free of material and/or contaminants normally found in its natural environment (e.g., from the cells or tissues from which a protein is derived, or substantially free from chemical precursors or other chemicals when chemically synthesized).

    [0043] As used herein, the term the terms polypeptide, peptide, and protein are used interchangeable to mean a polymer comprising two or more amino acids joined to each other by peptide bonds or modified peptide bonds (i.e., peptide isosteres). Polypeptides may include amino acids other than the naturally-occurring amino acids, as well as amino acid analogs and mimetics prepared by techniques that are well known in the art. The skilled artisan will understand that polypeptides, peptides, and proteins may be obtained in a variety of ways including isolation from cells and tissues expressing the protein endogenously, isolation from cell or tissues expressing a recombinant form of the molecule, or synthesized chemically.

    [0044] As used herein, the term subject refers to a member of any vertebrate species. In some embodiments, the subject is avian and includes domestic (e.g., chicken, turkey) and wild bird species. In some embodiments, subjects include mammals such as humans, as well as those mammals of importance due to being endangered, of economic importance (animals raised on farms for consumption by humans) and/or social importance (animals kept as pets or in zoos) to humans. In particular embodiments, the subject is a human. In other embodiments, the subject is not human.

    [0045] As used herein, the term pathogen refers to any entity that causes disease, including, for example, but not limited to, mycoplasma, fungi, bacteria, viruses, viroids, virus-like organisms, protozoa, and nematodes, toxins, and prions. In some embodiments, the pathogen is a Clostridium. In some embodiments, the pathogen is Clostridium botulinum.

    [0046] As used herein, the term chimera and chimeric refers to biological molecules comprising materials derived from two or more organisms of the same or different species. For example, the terms chimeric antibody, and chimeric IgG refer to antibodies comprising amino acid sequences derived from two or more organisms of the same or different species. In some embodiments, the organisms are both of the same species. In some embodiments, the organisms are both human. In some embodiments, the organisms are from different species. In some embodiments, the terms refer to nucleic acid sequences encoding chimeric polypeptide sequences.

    [0047] The present disclosure provides methods and compositions for high-throughput production of chimeric antibodies that specifically bind to an antigen of interest. The methods combine three procedures into one streamlined process: 1) isolation of lymphocytes producing antibodies of interest from the blood of immunized individuals, 2) amplification of sequences encoding variable domains of light and heavy chains of immunoglobulin from individual isolated cells, and 3) assembly of amplified sequences into specially designed vectors and construction of cells encoding human/human chimeras targeted at antigens of interest. The uniqueness of this process is its ability to generate multiple (up to 100) immunoglobulin-producing clones within a very short time (one-two months). Each such clone encodes an IgG whose variable domains of light and heavy chains originate from the same lymphocyte.

    [0048] Since the required antibody-producing blood cells could come from a patient recovered from the infection, this system does not depend on the availability of a developed vaccine. Consequently, this system could be used to develop protective entities against rare and even new natural and engineered pathogens at very early signs of appearance. Additionally, the system does not involve use of viruses and, consequently, is safe to use.

    [0049] The methods allow for rapid generation of IgGs whose heavy chains carry additional polypeptides at the C-termini. This grants the opportunity to produce derivatives of antibodies that can be used to monitor corresponding antigens (IgGs fused with reporter molecules) or to immobilize those pathogens (IgGs fused with polypeptides like Cellulose Binding Domain). Among other fusions, the system allows creation of fusions with Metridia longa luciferase, which allows fast and inexpensive examination of conditions to identify those for optimal production of antibodies. Also, the methods allow for the use of fluorescence activated cell sorting (FACS) for fast selection of clones producing increased levels of IgGs.

    [0050] The present disclosure provides methods and compositions for robust development of human antibodies targeted at specific antigens of interest. The chosen approach required the ability to 1) isolate individual human lymphocytes specific to the chosen antigen, 2) isolate immunoglobulin-encoding sequences from a single selected cell, and 3) assemble immunoglobulin-encoding constructs that can be introduced into chosen cell cultures for production of corresponding antibodies. Prior to this work, it was unknown whether the dynamics of antibody secretion and the limited number of antigen-specific lymphocytes in the peripheral blood would permit efficient separation of these specific cells from all others. It was unclear whether protocols for rtPCR at the single cell level would be robust enough to allow their application in a high throughput format. Finally, described procedures for assembling expression vectors carrying IgG-encoding sequences were suitable for manipulation with just a very small number of IgG-encoding sequences at a time. By contrast, suitable methods for high throughput production must be capable of simultaneous handling of tens and even hundreds of different sequences.

    [0051] In some embodiments, the compositions comprise expression vectors encoding constant regions of either light or heavy chains of human IgG. In some embodiments, the compositions comprise an expression vector encoding the constant regions of both the IgG heavy and light chains.

    [0052] In some embodiments, the methods comprise isolating sequences encoding variable domains of light and heavy chains of IgG from single cells and assembly of Ig-encoding vectors.

    [0053] In some embodiments, the methods comprise introducing designed IgG-encoding constructs into mammalian cells and evaluation of conditions for efficient IgG production. In some embodiments, the methods comprise producing and characterizing chimeric IgGs. In some embodiments, the chimeric IgGs are specific for botulinum neurotoxin serotype A (BoNT/A).

    [0054] Embodiments described herein are set forth in the following non-limiting examples.

    EXAMPLES

    Example 1

    Development of Expression Vectors

    [0055] This Example demonstrates the construction of expression vectors for the cloning and production of chimeric IgG antibodies that specifically bind an antigen of interest.

    [0056] In order to create a system for generation of human antibodies that is capable of working in a high throughput format, vectors were necessary that would allow 1) a 100%-certain assembly of sequences encoding light and heavy chains of immunoglobulins, 2) simple assembly of such sequences into one plasmid, and 3) robust selection of cells carrying such plasmids and expressing both chains of immunoglobulins. Plasmids pVLentry-Hyg10 and pVHentry-Cm5 are designed for assembly of expression-competent sequences for light and heavy chains of IgG, respectively, meet all of these requirements (FIG. 1). Specifically, both of these plasmids possess two recognition sites for restriction endonuclease Esp3I per plasmid and these sites flank the sequence encoding protein 10b of bacteriophage T7. These two features ensure that practically 100% of colonies growing after cloning experiments utilizing vectors pVLentry-Hyg10 and pVHentry-Cm5 carry inserts of interest in a pre-determined orientation.

    [0057] Restriction endonuclease Esp3I cuts DNA outside of its recognition sequence and generates four nucleotide-long cohesive 5-overhanging ends. As depicted in FIG. 1, each Esp3I cleavage site in plasmids pVLentry-Hyg10 and pVHentry-Cm5 is unique. Therefore, fragments generated as a result of treatment of these plasmids with Esp3I and removal of the protein 10b-encoding sequence are not able to form a viable circular DNA unless the reaction is supplemented with a DNA fragment carrying appropriate sticky ends. As demonstrated in FIG. 2, the insertion of such a DNA fragment will occur only in one orientation, thus eliminating the need for following analysis of recombinant clones. The sequence encoding protein 10b of bacteriophage T7 functions as a safeguard, preventing re-assembly of the original vector.

    [0058] In our vectors, its expression is controlled by the lactose promoter. Expression of this sequence is lethal to F plasmid-containing E. coli (17). Therefore, while our vectors are maintained in F-negative cells, cloning experiments require strains carrying F factor and, after transformation, cells are grown in the presence of IPTG and the corresponding antibiotic (ampicillin in the case of plasmid pVLentry-Hyg10 and chloramphenicol in the case of plasmid pVHentry-Cm5). Under these conditions, only cells carrying plasmids in which the protein 10b-encoding fragment has been substituted with a new insert survive.

    [0059] Another important element of our vectors is a strong promoter that can direct transcription of the inserted sequence in mammalian cells. In vectors pVLentry-Hyg10 and pVHentry-Cm5, this role is served by the sequence from cytomegalovirus (CMV). However, we also designed plasmids in which a sequence from Rouse Sarcoma virus is used for this purpose. Plasmids pVLentry-Hyg10 and pVHentry-Cm5 are designed in such a way that transcripts initiated from the CMV promoter incorporate not only a sequence lying immediately downstream of the promoter, but also an Internal Ribosome Entry Site (IRES) and sequence for antibiotic resistance. In the case of plasmid pVLentry-Hyg10, this is resistance to Hygromycin B and, in the case of plasmid pVHentry-Cm5, this sequence confers resistance to G418. Presence of IRES makes synthesis of antibiotic-inactivating protein proportional to synthesis of protein encoded by the preceding portion of the transcript (immunoglobulin chain in the derivatives of these plasmids). This feature is not absolutely necessary for selection of stable transfectants (in some of our plasmids it is not present), however, it makes further maintenance of selected clones easier and opens opportunities for their further improvement.

    [0060] In addition, design of our vectors allows simple combination of sequences encoding light and heavy chains of IgG in the same plasmid, which, in turn, ensures equal amounts of IgG chain-encoding sequences to be introduced into the cell during transfection. I-SceI recognition sites are one of elements enabling such combination.

    [0061] I-SceI is a site-specific homing endonuclease that recognizes an 18 nucleotide-long sequence and generates DNAs with cohesive ends that can be used for cloning. Due to the length of the target sequence, its occurrence in the sequence encoding a variable domain of Ig is practically impossible. Therefore, using this enzyme enabled transfer of entire IgG-encoding sequences from one plasmid into another without destroying the integrity of these sequences. Nonsymmetrical cohesive ends generated by the I-SceI 1 ensure that, in all generated plasmids, relative orientation of IgG-encoding sequences is the same. This feature allows further improvement of the reproducibility of IgG production experiments. As shown in FIG. 1, plasmids pVLentry-Hyg10 and pVHentry-Cm5 possess two I-SceI sites each. However, in plasmid pVLentry-Hyg10, I-SceI sites flank the Ig-encoding cassette, while in plasmid pVHentry-Cm5, both I-SceI sites are located on one side of the Ig-encoding cassette and flank the gene of the alpha peptide of beta-galactosidase (lacZ).

    [0062] In addition to differences in location of I-SceI sites, both plasmids possess different antibiotic-resistance markers. Both of these plasmids use the same origin of replication for propagation in E. coli cells and therefore are not be able to coexist in the same cell. All of these features allow us to speed up the process of assembly and identification of the plasmid carrying both L- and H-chain encoding sequences. Indeed, a simple treatment of the mixture of L- and H-chain encoding plasmids with I-SceI and ligase generates the required hybrid plasmid. Similarly to one of its parents, this plasmid inherits the chloramphenicol-resistance gene, while, unlike this parent, it will not be able to produce the alpha-peptide of beta-galactosidase. As a result, only cells carrying the required plasmid and not the three others present in the mixture are able to form white colonies on the media supplemented with chloramphenicol, X-Gal and isopropyl--D-thiogalactopyranoside (IPTG).

    [0063] Also disclosed are four derivatives of plasmid pVHentry-Cm5. These derivatives have all elements described above. However, instead of the sequence encoding the constant part of IgG heavy chain alone, all these plasmids contain sequences that encode fusions of the same part of IgG heavy chain with different polypeptides. One of them encodes a fusion with green fluorescent protein (GFP), the seconda fusion with luciferase from Metridia longa (MLuc) (18, 19), the thirda fusion with His-tag and a peptide that can be biotinylated by biotin ligase, and the fourtha fusion with a polypeptide that specifically binds cellulose (20, 21).

    Example 2

    Isolation of Sequences Encoding Variable Domains of Light and Heavy Chains of IgG

    [0064] A single individual who was vaccinated with pentavalent botulinum toxoid vaccine six years prior received several boosts and served as a donor of blood cells. These cells were subject to fractionation on Ficoll gradient, enrichment on BD IMag Anti-human CD19 Particles-DM, and, finally, cell sorting. As a marker for cells producing anti-BoNT/A, we used a fusion between Green Fluorescent Protein and the receptor-recognizing domain of BoNT/A (gfpBoNT/A-CH5). This protein was constructed in our lab and, prior to use in cell sorting experiments, was tested for the ability to recognize specific receptors present in neuroblastoma cells (FIG. 3).

    [0065] Cells simultaneously binding APC-Mouse-anti-human CD19 and gfpBoNT/A-CH5 were sorted into wells of a 96-well plate, one cell per well.

    [0066] Isolated cells were used as a source of sequences encoding V.sub.H- and V.sub.L-regions. We have developed a procedure for rtPCR of these sequences that includes three steps: 1) reverse transcription of mRNA released from the cell by perfringolysin O, 2) simultaneous amplification of cDNAs encoding V.sub.H- and V.sub.L-regions in the same tube by PCR and 3) re-amplification of sequences encoding each region in its own tube. Each step has its own set of primers. The whole procedure takes less than 8 hours. The number of cells that can be processed during this time is mostly limited by the capacity of the available thermo-cycler. Primers were designed based on the analysis of available human Ig-encoding sequences known in the art (8, 22). Primers used during each step are summarized in Table 1. Primers used in the re-amplification step were designed to introduce unique sequences, which can be converted into four-nucleotide-long cohesive ends compatible with ends generated by Esp3I restriction endonuclease in the corresponding vectors (see previous section), into the ends of amplified fragments. The conversion occurs as a result of treatment of purified DNA fragments by DNA polymerase T4 in the presence of dCTP as demonstrated in FIG. 2. The lack of restriction endonucleases at this stage guarantees that none of the sequences is lost due to the presence of sites for corresponding restriction endonucleases in some of them.

    TABLE-US-00001 TABLE1 Primersusedforamplificationofsequences encodingvariabledomainsofhuman immunoglobulins. Primersusedforreversetranscription IgG- GGGGAAGAGGAAGACTGACGGTC CHH Cm1 CAGTACTGCGATGAGTGGCA Clv-3 TGTGGCCTTGTTGGCTTG OligodT PrimersusedatthePCRamplificationstage pVk-1 GAGTCAGDYYCDRYCAGGACACAGCATG pVk-2 AGACCCTGTCAGGACACAGCATAGACATG pVk-3 GGACTCCTCAGTTCACCTTCTCACAATG pVk-4 TGCTCAGTTAGGACCCAGAGGAACCATG hIgGk-3 TAATGGCCTAACACTCTCCCCTGTTGAAGCTCTT IgGH-1 TGAGVDMMGYWCHTCACCATGGACTG IgGH-2 ACTGAACACAGAGGACTCACCATGGA IgGH-3 CAGTGACTCCTGTGCCCCACCATGGACA IgGH-4 TTTCTGTCCTCCACCATCATGGGGTC IgGH-5 GCACTGAACACAGACCACCAATCATGG IgG- GGGGAAGAGGAAGACTGACGGTC CHH M1 CCTGGGAGCACAGCTCATCACCATGGA M2 CACTGAACACAGAGGACTCACCATGGA M3 CATGGACCTCCTGCACAAGAACATGAA M4 ACTGAACAGAGAGAACTCACCATGGA Cm1 CAGTACTGCGATGAGTGGCA Vl1-5T7 TTTAGGCCATGGCCTGGACCCCTCTCCTGCTC Vl2-5T7 TTTAGGCCATGGCCTGGACCKTTCTCCTCCTC Vl3-5T7 TTTAGGCCATGGCCTGGDCTCYKCTCCTYCTC Vl4-5T7 TTTAGGCCATGGCATGGCCAGCTTCCCTCTCCTCCTC Vl5-5T7 TTTAGGCCATGACCTGCTCCCCTCTCCTCCTC C1-3 CCTGCAGCTCTAGTCTCCCGTGG Primersusedatthere-amplificationstage Vk-l/2- TTTAGGCATGGACATGAGGGTCCCCGCTCAGCTCCTGG 5T7 Vk-3-5T7 TTTAGGCATGGAAACCCCAGCGCAGCTTCT Vk-4-5T7 TTTAGGCATGGTGTTGCAGACCCAGGTCTT hIgGk-3 TAATGGCCTAACACTCTCCCCTGTTGAAGCTCTT IgG-CH TATTGGCGAGCTGGCCTCTCACCAACTGTCTTGTCCAC CTTGGTGTTG Vh-1-3T7 CACTGGAGACGGTGACCAGBGTBCCYTGKCCCCA Vh-1-3T75 TATTGGCactcacggaagagacggtgaccagBgtBccYtg Vh-1-5T7 TATAGccatggactggacctgga Vh-2-5T7 TATAGccatggacatactttgttccac Vh-3-5T7 TATAGccatggagtttgggctgagc Vh-4-5T7 TATAGccatgaaacacctgtggttctt Vh-5-5T7 TATAGccatggggtcaaccgccatcct Vh-6-5T7 TATAGccatgtctgtctccttcctcat Vh-7-5T7 TATAGccatggaatttgggettagct Vh-8-5T7 TATAGccatggaattggggctgag Vh-1-3T75 TATTGGCactcacggaagagacggtgaccagBgtBccYtg Vm-1-5T7 TATAGaccatggactggacctggaggttcct Vm-2-5T7 TATAGaccatggagtttgggctgagctgggt Vm-3-5T7 TATAGaacatgaaacacctgtggttcttcct Vh-1-3T75 TATTGGCactcacggaagagacggtgaccagBgtBccYtg Vl1-5T7 TTTAGGccatggcctggacccctctcctgctc Vl2-5T7 TTTAGGccatggcctggacckttctcctcctc Vl3-5T7 TTTAGGccatggcctggdctcykctcctyctc Vl4-5T7 TTTAGGccatggcatggccagcttccctctcctcctc Vl5-5T7 TTTAGGccatgacctgctcccctctcctcctc hIgG1-3 taatggcCTATGAACATTCTGTAGGGGCCAC

    [0067] In the end, only 24% of originally sorted cells produced sequences for both V.sub.H- and V.sub.L-regions. This may sound like a relatively low success rate. However, given the potential of collecting hundreds of cells and the ability to process them in just few days, this allows the accumulation of tens of pairs of sequences for further antibody assembly. In the future, we expect to increase this rate by including anti-CD27 or anti-B220 monoclonal antibodies in the cell sorting protocol and thus increase the number of those among selected cells that produce antibodies versus those that may just absorb them.

    [0068] Sequencing of 11 pairs of isolated DNA fragments revealed that practically all pairs were unique. Even when two pairs had one identical chain, the second chains were different (Sequences of variable domains of light and heavy chains are listed in Appendix 2 and 3).

    Example 3

    Introduction of Designed IgG-Encoding Constructs into Mammalian Cells and Evaluation of Conditions for Efficient IgG Production

    [0069] Eight pairs of isolated sequences were incorporated into the previously-described vectors and the resulting plasmids were introduced into CHO and HEK cells. ELISA registered accumulation of human antibodies in media of both of these cultures. In isolated stable cell lines, the level of production varied but did not exceed 1-2 g/ml (the level of production was determined on the basis of the amount of anti-BoNT/A purified from 100 ml of culture mediawill be described below). In our experience, HEK cells proved to be more robust and capable of producing more antibodies from the same volume of media. Also, these cells were easier to adapt to grow and produce IgGs in the serum-free media. This is why, in most of our later analyses, we preferred to use HEK cells.

    [0070] To select clones with higher production, we decided to use correlation between translations of sequences encoding light and heavy chains of IgGs and those encoding antibiotic-inactivating proteins, built into our system and discussed earlier. Specifically, by gradually increasing amounts of antibiotics in the culture media, we were able to select cell lines whose resistance to antibiotics is 3-4 times higher than resistance of originally selected cultures. As demonstrated in FIG. 4, ELISA revealed that cells with increased resistance to antibiotics did not produce substantially more immunoglobulins than cells possessing a lower level of resistance to these antibiotics.

    [0071] This data suggest that the bottleneck of production lies somewhere at the post-translational level. The conventional way for identifying cells with increased production of IgGs is a limiting dilution cloning. The low throughput nature of this method significantly limits the number of clones that can feasibly be screened. We tested whether fluorescence activated cell sorting (FACS) can be used to increase throughput. As a marker for IgG-producing cells, we used previously mentioned gfpBoNT/A-CH5. Cells were released from the solid support via treatment with trypsin and washed two times with fresh RPMI media to remove trypsin. Then, cells were incubated in RPMI media for 1 hour, co-incubated with gfpBoNT/A-CH5 for 10 minutes and subject to FACS. Out of the 1% of cells with the highest fluorescence intensity, corresponding to the highest antibody production rates, single cells were sorted directly into 96-well plates at one cell per well. One plate was assembled per each IgG-producing cell line. Table 2 demonstrates that we were able to find clones with increased production of IgG-luciferase hybrids for five cell lines out of seven used in the experiment. These results clearly demonstrate the potential of FACS for further development of cell lines producing high quantities of IgGs.

    TABLE-US-00002 TABLE 2 Production of IgG-MLuc by original cultures and individual clones selected from these cultures Original culture Luminescence Clone Luminescence HEK-1HL-MLuc 657,148 1E7 1,641,522 HEK-7HL-MLuc 1,387,980 7B8 8,013,339 HEK-8HL-MLuc 981,702 8E8 3,783,486 HEK-9HL-MLuc 1,991,512 9F6 2.778.794 HEK-14HL-MLuc 951,132 14G11 721,576 HEK-15HL-MLuc 104,466 15F2 594,677 HEK-41HL-MLuc 3,274,119 41C9 3,163,750

    [0072] Production of the chimera IgGs and their characterization. As result of the reasons mentioned in the previous section, most of the IgG constructs were purified from culture media of HEK cells. Our analysis of accumulation of luciferase activity in the culture media of two cell lines encoding IgG-MLuc fusions revealed that the accumulation in both continued for seven days. Therefore, all HEK cultures were grown for seven days in the same media, which was then passed through a column containing the hybrid between staphylococcal protein A and streptococcal protein G. In the case of CHO cells, the media was collected after three days. Elution of absorbed IgGs was achieved by a buffer change to 0.1 M glycine HCl (pH 2.3). Immediately after elution, the pH of collected fractions was increased by addition of 1 M Tris-Base. Then, fractions were subjected to buffer exchange and concentrated by ultrafiltration.

    [0073] In addition to IgGs alone, we purified fusions of these IgGs with luciferase, GFP, and His-tag connected to the peptide that serves as a target for biotin ligase (BirA). Analysis confirmed the presence of polypeptides with expected molecular weights and recognized by anti-human antibodies in isolated fractions (FIG. 5).

    [0074] Fractions with IgG-MLuc fusions produced light in the presence of luciferase's substratecoelenterazine. The IgG-GFP fusion emitted the green light characteristic of GFP upon illumination with UV light. Finally, the IgG fusion with His-tag and BirA substrate interacted with Ni-column and, after treatment with BirA in the presence of biotin and ATP, was recognized by streptavidin-alkaline phosphatase substrate (data not presented).

    [0075] ELISA revealed that out of eight different IgGs that we purified, all eight recognize the receptor-recognizing domain of BoNT/A (FIG. 6). This data suggests that practically all isolated cells from which we were able to recover IgG-encoding sequences produced BoNT/A-specific antibodies.

    [0076] IgGs were recognized by hybrid proteins composed of staphylococcal protein A, streptococcal protein G and Metridia longa luciferase (spAG-MLuc and spAG-N-MLuc) and developed in our lab (sequences of plasmids encoding these proteins are presented in Appendix 4). These hybrids allowed quantitative monitoring of IgG present in wells of 96-well plate. Hybrid spAG-MLuc possessed luciferase activity only when it was purified from culture media of mammalian cells. Hybrid spAG-N-MLuc possesses luciferase activity irrespective to where it was expressed, E. coli or mammalian cells.

    [0077] Examples 1-3 demonstrate 1) the number of peripheral blood cells encoding specific IgGs in blood and the efficiency of cell sorting protocols used are sufficient to produce hundreds of cells that can serve as a source of Ig-encoding sequences; 2) the methods disclosed herein permit reliable isolation of cDNA encoding variable domains of both Ig-chains from of all isolated individual lymphocytes; 3) practically all isolated cDNA pairs encode IgG specific to the antigen used in the cell sorting procedure; 4) the expression vectors described herein are suitable for high throughput assembly of plasmids encoding both full size human IgGs, as well as their derivatives carrying polypeptides that allow monitoring or/and specific binding of these IgGs to other molecules; 5) the vectors allow efficient selection of cells producing both IgG chains; and 6) FACS can be used as an efficient tool allowing selection of clones producing increased quantities of IgGs and their derivatives.

    [0078] Accordingly, the compositions and methods described herein are useful in methods comprising one or more of these aspects.

    Example 4

    Construction and Expression of Libraries of Anti-Botulinum Chimeras that Recognize Regions of BoNT/A

    [0079] This example demonstrates the construction and use of libraries of anti-botulinum chimeras that recognize regions of BoNT/A.

    [0080] First, we will use conventional methods of gene engineering to create fusions of corresponding domains with GFP. Similar to previously-mentioned gfpBoNT/A-CH5, these fusions will be used as markers for lymphocytes producing antibodies specific for catalytic and transport domains of BoNT/A. As a source of lymphocytes, we will use white blood cells from the blood of an immunized individual that were generated and tested previously, and preserved under liquid nitrogen. It has been demonstrated that such cells can be used as a source of immunoglobulin-encoding sequences (25). These cells will be subjected to enrichment on BD IMag Anti-human CD19 Particles-DM and then sorted into wells of a 96-well plate, one cell per well. Prior to FACS, cells will be labeled with APC Mouse Anti-Human CD19 (BD Biosciences) and the corresponding GFP-BoNT/A fusion. To increase the level of discrimination of IgG-producing cells from those that do not produce, but instead absorb them from serum, we will include an additional markermemory B cell marker. Bleesing and Fleisher reported that human B cells expose either B220 or CD27 on their surface [30]. Therefore, as the third component of the cell labeling mixture, we will use anti-CD27 (Ancell Co.) and/or anti-B220 (Beckman Coulter) monoclonal antibodies, each conjugated to R-Phycoerythrin.

    [0081] Isolated cells will be used as a source of sequences encoding V.sub.H- and V.sub.L-regions. Isolation and further handling of these sequences will be done according to protocols described above. At this stage, the goal will be to isolate 10-20 V.sub.H- and V.sub.L-encoding pairs that have unique sequences per each BoNT/A domain.

    [0082] Unique V.sub.H- and V.sub.L-encoding pairs will be used to assemble and produce human/human IgG chimeras as described above.

    Example 5

    Identification of IgGs and Their Combinations that can Neutralize Toxic Activity of BoNT/A

    [0083] This Example demonstrates the identification of chimeric IgG antibodies with the capacity to neutralize toxicity of BoNT/A using phage display.

    [0084] Choosing V.sub.H- and V.sub.L-encoding pairs with unique sequences does not guarantee that they will recognize different epitopes. Therefore, prior to conducting expensive toxin neutralizing experiments, we will sort developed IgGs according to their epitope specificities. For this, we will use phage display known in the art. This technology involves a library of random peptides. Sequences of these peptides are incorporated in the region of the phage genome that encodes the capsid protein. As a result, each phage particle in the library encodes and exposes on its surface only one type of peptide. We previously demonstrated that incubation of such a library with immobilized polyclonal antibodies raised against BoNT/A allows isolation of phage particles that encode peptides mimicking BoNT/A epitopes (mimetics).

    [0085] We will use a similar approach to sort developed IgGs according to their epitope specificities. Specifically, each developed IgG will be purified and immobilized on a solid support. Then, each immobilized IgG will be co-incubated with the phage display library MD-12 (Alpha Universe, LLC). Phages that do not bind to IgG will be removed by washing and those bound to IgG will be released and grown on appropriate host cells. Following this amplification, phages will be subjected to two additional cycles of the above-described screening procedure. According to our previous experience, practically all phages released after the third cycle will possess affinity to the IgG used in selection. To ensure that selected phages carry mimetics of BoNT/A, we have to prevent isolation of phages that interact with IgG parts other than the antigen-binding region. In order to do this, phages will be subject to depletion with human nave serum every time prior to incubation with immobilized developed IgG. After mixing with phages, components of human nave serum, as well as phage particles bound to them, will be removed by addition of magnetic beads with immobilized staphylococcal protein A-streptococcal protein G hybrid to the mixture.

    [0086] Individual phages carrying BoNT/A mimetics will be used for characterization of developed IgGs. Specifically, each IgG will be immobilized on wells of a 96-well plate and each immobilized IgG will be incubated with all chosen mimetic-exposing phages. Wells with bound phages will be identified using M13 phage-specific antibodies conjugated with horse radish peroxidase (GE Healthcare) and 1-Step Slow TMB-ELISA (PIERCE). IgGs interacting with the same phage will be considered as recognizing the same epitope.

    [0087] In addition to classification of developed IgGs according to their epitope (actually, mimetic) specificity, we will characterize these IgGs according to the nature of recognized epitopes (linear or structural). In these experiments, we will compare interaction of developed IgGs with corresponding recombinant domains subjected or not subjected to denaturing treatment. For this, corresponding BoNT/A fragments will be subjected to native or SDS polyacrylamide gel electrophoresis, transferred onto a nitrocellulose membrane and probed with each chosen IgG separately. Then, filters will be treated with biotinylated anti-human IgGs, followed by treatment with streptavidin-horse radish conjugate and Metal Enhanced DAB Substrate Kit (Pierce, Inc.). IgGs recognizing both forms of BoNT/A fragment will be considered as recognizing linear epitopes. Those that recognize only BoNT/A fragments not subjected to denaturing conditions will be considered as recognizing structural epitopes.

    [0088] The information about the nature of the recognized epitope will not only be used to verify epitope-based grouping of IgGs, but also to gain information about locations of corresponding epitopes on the BoNT/A molecule. Specifically, our previous experience suggests that, in the case of mimetics of linear epitopes, some similarities between sequences of these mimetics and the BoNT/A sequence can be observed. Such similarities may be used as indicators of the location of the corresponding epitope in the structure of the molecule.

    [0089] After developed IgGs are classified and grouped, representatives from each group will be tested for the ability to neutralize BoNT/A.

    [0090] It has been demonstrated that even when individual monoclonal antibodies do not have substantial protective activity, their combination may have such activity (24). This is why the analysis will include testing of the BoNT/A-neutralization potential of each chosen IgG separately and, then, testing of such potential for selected groups of IgGs.

    [0091] The goal of this analysis will be to identify IgGs or their combinations that will be able to protect mice from at least 1000 minimal doses that are lethal to a fifty percentage of mouse (MLD.sub.50) of BoNT/A. In addition, the aim will be to determine which among three regions of the BoNT/A molecule (catalytic, transport, or receptor-recognizing) contains the highest number of protective epitopes. This information will be instrumental for development of antibodies capable of neutralizing other serotypes of BoNTs.

    Example 6

    Development of Human/Human IgG Chimeras Capable of Neutralizing BoNT/B

    [0092] This Example demonstrates the development of human/human IgG chimeras capable of neutralizing BoNT/B.

    [0093] Previously, we demonstrated that different serotypes of BoNTs have similar epitopes and information about locations of epitopes in one serotype can be used to predict locations of epitopes in other serotypes (26). We will use this phenomenon to speed up the process of development of IgGs capable of neutralizing BoNT serotype B. Specifically, instead of developing IgGs to the whole molecule of BoNT/B, we will focus on just one region. This region will be the same one as that revealed in BoNT/A as possessing the most potent protective epitopes. We will create a fusion between GFP and a fragment of BoNT/B after the targeted region of BoNT/B is determined. This fusion will be used to isolate corresponding lymphocytes from the same cryopreserved fractions of blood cells mentioned earlier. FACS and following isolation of cDNAs, their PCR, cloning, expression of assembled sequences, purification of IgGs, and analysis of their protective properties will be done the same way as described in the previous two sections.

    [0094] As in case with BoNT/A, our goal will be to identify IgGs or their combinations that will ensure protection of mice from at least 1000 MLD.sub.50.

    [0095] Optimization of protocols for production of chosen chimeras. The ability to efficiently produce developed protective IgGs is a key element for the system to become a commercially viable. Earlier analysis of different monoclonal antibody-producing cell lines conducted by O'Callaghan and coauthors revealed that each cell line had its own bottleneck, limiting production of antibodies (27). This research supports the approach for selection of high producers from population of cells already producing IgG. This approach has been successfully used by many groups including ourselves. However, such selection often requires multiple cycles and is very lengthy. Development of a strain with bottlenecks that are widened or even removed will substantially increase the potential for high throughput development of cells producing high quantities of IgGs. Recent reports of successful increase of antibody production via introduction of specific DNA sequences into the cells suggest the possibility of such an approach (28-30).

    [0096] To create a cell line originally capable of producing increased quantities of IgGs, we will produce IgG derivatives carrying different polypeptides on the C-termini of heavy chains. Specifically, we will engineer a plasmid encoding one of the anti-BoNT/A IgGs fused with the trans-membrane domain of platelet derived growth factor receptor (31). This plasmid will allow generation of transiently transfected cells expressing IgG anchored in the cell membrane. Such cells will be stained with gfpBoNT/A-CH5 and subjected to FACS. Individual cells carrying the highest levels of fluorescent label will be sorted into wells of a 96-well plate and allowed to grow. We anticipate that the majority of such cells will lose IgG-encoding plasmids. As a result, such cells will stop producing the corresponding IgG derivative and antibiotic-inactivating enzymes encoded by the plasmid. Cell lines grown from such cells will be transfected again. This time, we will use the plasmid encoding IgG-luciferase hybrid formed by different V.sub.H- and V.sub.L-pair that was used in the previous transfection. Parental cell lines for those transient transfectants whose culture media contains the highest amounts of luciferase will be tested further for the ability to produce high quantities of other types of IgG-luciferase fusions. Eventually, we expect to be able to isolate a cell line that will produce increased quantities if IgGs irrespective of sequences of their V.sub.H- and V.sub.L-regions.

    [0097] To increase the success rate of the above-described selection, we will use a cell line whose diversity will be increased by chemical mutagenesis. Further, to eliminate difficulties associated with sorting originally adherent cells, we will use FREESTYLE CHO-S cells (Invitrogen, Inc.). This cell line has been adapted to grow in suspension in serum-free media. The latter feature will beneficial for future production of antibodies.

    [0098] Even with a developed host cell line capable of increased production of IgGs, we do not exclude the need for additional selection of super-producers among created IgG-producing cells. Traditionally, such selection is done by Limiting dilution cloning, which is a very labor-intensive process. We will use FACS protocols for the isolation of cells that bind the highest amounts of the label after a very short exposure to it from the population, followed by isolation of cells that lose this label faster than others.

    [0099] As a result of these activities, we will not only generate cell lines producing high quantities of chosen IgGs, but will also determine the best way to efficiently develop new IgG-producing cell lines.

    REFERENCES

    [0100] 1. Smith, K., Garman, L., Wrammert, J., Zheng, N., Capra, J. D., and Wilson, P. C. (2009) Nat Protoc. 4, 372-384 [0101] 2. Amon, S. S., Schechter, R., Inglesby, T. V, Henderson, D. A., Bartlett, J. G., Ascher, M. S., Eitzen, E., Fine, A. D., Hauer, J., Layton, M., Lillibridge, S., Osterholm, M. T., O'Toole, T., Parker, G., Perl, T. M., Russell, P. K., Swerdlow, D. L., and Tonat, K. (2001) Jama 285, 1059-1070 [online] http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=PubMed&dopt=Citation&list_uids=11209178. [0102] 3. St John, R., Finlay, B., and Blair, C. (2001) The Canadian journal of infectious diseases=Journal canadien des maladies infectieuses 12, 275-84 [online] http://www.pubmedcentral.nih.gov/articlerender.fcgi?artid=2094836&tool=pmcentrez&rendertype=abstract (Accessed Nov. 23, 2012). [0103] 4. Smith, L. A., and Rusnak, J. M. (2007) Critical reviews in immunology 27, 303-18 [online] http://www.ncbi.nlm.nih.gov/pubmed/18197811 (Accessed Nov. 21, 2012). [0104] 5. Notice of CDC's discontinuation of investigational pentavalent (ABCDE) botulinum toxoid vaccine for workers at risk for occupational exposure to botulinum toxins (2011) MMWR Morb Mortal Wkly Rep 60, 1454-1455 [online] http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=PubMed&dopt=Citation&list_uids=22031218. [0105] 6. Clayton, M. A., Clayton, J. M., Brown, D. R., and Middlebrook, J. L. (1995) Infect Immun 63, 2738-42. [0106] 7. Black, R. E., and Gunn, R. A. (1980) The American journal of medicine 69, 567-70 [online] http://www.ncbi.nlm.nih.gov/pubmed/7191633 (Accessed Nov. 23, 2012). [0107] 8. Wang, X., and Stollar, B. D. (2000) 244, 217-225 [0108] 9. Orlandi, R., Gussow, D. H., Jones, P. T., and Winter, G. (1992) Biotechnology 24, 527-31. [0109] 10. Beidler, C. B., Ludwig, J. R., Cardenas, J., Phelps, J., Papworth, C. G., Melcher, E., Sierzega, M., Myers, L. J., Unger, B. W., and Fisher, M. (1988) J Immunol 141, 4053-60. [0110] 11. Zhao, Y., and Hammarstrm, L. (2003) Immunology 108, 288-95 [online] http://www.pubmedcentral.nih.gov/articlerender.fcgi?artid=1782897&tool=pmcentrez&rendertype=abstract (Accessed Nov. 14, 2012). [0111] 12. CDC (2011) MMWR. Morbidity and mortality weekly report 60, 1454-5 [online] http://www.ncbi.nlm.nih.gov/pubmed/22031218 (Accessed Aug. 24, 2012). [0112] 13. Beidler, C. B., Ludwig, J. R., Cardenas, J., Phelps, J., Papworth, C. G., Melcher, E., Sierzega, M., Myers, L. J., Unger, B. W., and Fisher, M. (1988) Journal of immunology (Baltimore, Md.: 1950) 141, 4053-60 [online] http://www.ncbi.nlm.nih.gov/pubmed/3141512 (Accessed Nov. 24, 2012). [0113] 14. Gillies, S. D., Lo, K. M., and Wesolowski, J. (1989) Journal of immunological methods 125, 191-202 [online] http://www.ncbi.nlm.nih.gov/pubmed/2514231 (Accessed Nov. 24, 2012). [0114] 15. Norderhaug, L., Olafsen, T., Michaelsen, T. E., and Sandlie, I. (1997) Journal of immunological methods 204, 77-87 [online] http://www.ncbi.nlm.nih.gov/pubmed/9202712 (Accessed Nov. 24, 2012). [0115] 16. Liu, A. Y., Mack, P. W., Champion, C. I., and Robinson, R. R. (1987) Gene 54, 33-40 [online] http://www.ncbi.nlm.nih.gov/pubmed/3111940 (Accessed Nov. 24, 2012). [0116] 17. Schmitt, C. K., and Molineux, I. J. (1991) Journal of bacteriology 173, 1536-43 [online] http://www.pubmedcentral.nih.gov/articlerender.fcgi?artid=207293&tool=pmcentrez&rendertype=abstract (Accessed Nov. 10, 2012). [0117] 18. Markova, S. V, Golz, S., Frank, L. A., Kalthof, B., and Vysotski, E. S. (2004) The Journal of biological chemistry 279, 3212-7 [online] http://www.ncbi.nlm.nih.gov/pubmed/14583604 (Accessed Nov. 24, 2012). [0118] 19. Markova, S. V, Burakova, L. P., and Vysotski, E. S. (2012) Biochemical and biophysical research communications 417, 98-103 [online] http://www.ncbi.nlm.nih.gov/pubmed/22138240 (Accessed Jul. 20, 2012). [0119] 20. Shpigel, E., Goldlust, a, Efroni, G., Avraham, a, Eshel, a, Dekel, M., and Shoseyov, O. (1999) Biotechnology and bioengineering 65, 17-23 [online] http://www.ncbi.nlm.nih.gov/pubmed/10440667. [0120] 21. Cao, Y., Zhang, Q., Wang, C., Zhu, Y., and Bai, G. (2007) Journal of chromatography. A 1149, 228-35 [online] http://www.ncbi.nlm.nih.gov/pubmed/17391680 (Accessed Jul. 20, 2012). [0121] 22. Smith, K., Garman, L., Wrammert, J., Zheng, N., Capra, J. D., Ahmed, R., and Wilson, P. C. (2009) [0122] 23. Adekar, S. P., Takahashi, T., Jones, R. M., Al-Saleem, F. H., Ancharski, D. M., Root, M. J., Kapadnis, B. P., Simpson, L. L., and Dessain, S. K. (2008) PloS one 3, e3023 [online] http://dx.plos.org/10.1371/journal.pone.0003023 (Accessed Nov. 15, 2012). [0123] 24. Nowakowski, A., Wang, C., Powers, D. B., Amersdorfer, P., Smith, T. J., Montgomery, V. A., Sheridan, R., Blake, R., Smith, L. A., and Marks, J. D. (2002) Proceedings of the National Academy of Sciences of the United States of America 99, 11346-50 [online] http://www.pubmedcentral.nih.gov/articlerender.fcgi?artid=123259&tool=pmcentrez&rendertype=abstract (Accessed Nov. 25, 2012). [0124] 25. Hansen, A., Reiter, K., Dorner, T., and Pruss, A. (2005) Cell Tissue Bank 6, 299-308 [online] http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=PubMed&dopt=Citation&list_uids=16308769. [0125] 26. Zdanovsky, A., Zdanovsky, D., and Zdanovskaia, M. (2012) Toxicon: official journal of the International Society on Toxinology 60, 1277-86 [online] http://www.ncbi.nlm.nih.gov/pubmed/22922018 (Accessed Nov. 4, 2012). [0126] 27. O'Callaghan, P. M., McLeod, J., Pybus, L. P., Lovelady, C. S., Wilkinson, S. J., Racher, A. J., Porter, A., and James, D. C. (2010) Biotechnology and bioengineering 106, 938-51 [online] http://www.ncbi.nlm.nih.gov/pubmed/20589672 (Accessed Nov. 26, 2012). [0127] 28. Florin, L., Pegel, A., Becker, E., Hausser, A., Olayioye, M. A., and Kaufmann, H. (2009) Journal of biotechnology 141, 84-90 [online] http://www.ncbi.nlm.nih.gov/pubmed/19428735 (Accessed Nov. 16, 2012). [0128] 29. Peng, R., Abellan, E., and Fussenegger, M. (2011) Biotechnol Bioeng 108, 611-620 [0129] 30. Peng, R.-W., and Fussenegger, M. (2009) Biotechnology and bioengineering 102, 1170-81 [online] http://www.ncbi.nlm.nih.gov/pubmed/18989903 (Accessed Nov. 27, 2012). [0130] 31. Zhou, C., Jacobsen, F. W., Cai, L., Chen, Q., and Shen, W. D. mAbs 2, 508-18 [online] http ://www.pubmedcentral.nih.gov/articlerender.fcgi?artid=2958572&tool=pmcentrez&rendertype=abstract (Accessed Nov. 16, 2012).

    TABLE-US-00003 APPENDIX1 Nucleotidesequencesofconstructedplasmids pVLentry-Hyg10: 1 TGGCCCGCCTGGCTGACCGCCCAACGACCCCCGCCCATTGACGTCAATAATGACGTATGTTCCCATAGTAACGCCAATAGGGACTTTCCATTGACGTCAA ACCGGGCGGACCGACTGGCGGGTTGCTGGGGGCGGGTAACTGCAGTTATTACTGCATACAAGGGTATCATTGCGGTTATCCCTGAAAGGTAACTGCAGTT 101 TGGGTGGAGTATTTACGGTAAACTGCCCACTTGGCAGTACATCAAGTGTATCATATGCCAAGTACGCCCCCTATTGACGTCAATGACGGTAAATGGCCCG ACCCACCTCATAAATGCCATTTGACGGGTGAACCGTCATGTAGTTCACATAGTATACGGTTCATGCGGGGGATAACTGCAGTTACTGCCATTTACCGGGC 201 CCTGGCATTATGCCCAGTACATGACCTTATGGGACTTTCCTACTTGGCAGTACATCTACGTATTAGTCATCGCTATTACCATGGTGATGCGGTTTTGGCA GGACCGTAATACGGGTCATGTACTGGAATACCCTGAAAGGATGAACCGTCATGTAGATGCATAATCAGTAGCGATAATGGTACCACTACGCCAAAACCGT 301 GTACATCAATGGGCGTGGATAGCGGTTTGACTCACGGGGATTTCCAAGTCTCCACCCCATTGACGTCAATGGGAGTTTGTTTTGGCACCAAAATCAACGG CATGTAGTTACCCGCACCTATCGCCAAACTGAGTGCCCCTAAAGGTTCAGAGGTGGGGTAACTGCAGTTACCCTCAAACAAAACCGTGGTTTTAGTTGCC 401 GACTTTCCAAAATGTCGTAACAACTCCGCCCCATTGACGCAAATGGGCGGTAGGCGTGTACGGTGGGAGGTCTATATAAGCAGAGCTGGTTTAGTGAACC CTGAAAGGTTTTACAGCATTGTTGAGGCGGGGTAACTGCGTTTACCCGCCATCCGCACATGCCACCCTCCAGATATATTCGTCTCGACCAAATCACTTGG Esp3I ~~~~~~~ 501 GTCAGATCCGCTAGACGTCTCATTTAACTTTAAGAAGGAGATATACATATGGCTAGCATGACTGGTGGACAGCAAATGGGTACTAACCAAGGTAAAGGTG CAGTCTAGGCGATCTGCAGAGTAAATTGAAATTCTTCCTCTATATGTATACCGATCGTACTGACCACCTGTCGTTTACCCATGATTGGTTCCATTTCCAC 601 TAGTTGCTGCTGGAGATAAACTGGCGTTGTTCTTGAAGGTATTTGGCGGTGAAGTCCTGACTGCGTTCGCTCGTACCTCCGTGACCACTTCTCGCCACAT ATCAACGACGACCTCTATTTGACCGCAACAAGAACTTCCATAAACCGCCACTTCAGGACTGACGCAAGCGAGCATGGAGGCACTGGTGAAGAGCGGTGTA 701 GGTACGTTCCATCTCCAGCGGTAAATCCGCTCAGTTCCCTGTTCTGGGTCGCACTCAGGCAGCGTATCTGGCTCCGGGCGAGAACCTCGACGATAAACGT CCATGCAAGGTAGAGGTCGCCATTTAGGCGAGTCAAGGGACAAGACCCAGCGTGAGTCCGTCGCATAGACCGAGGCCCGCTCTTGGAGCTGCTATTTGCA 801 AAGGACATCAAACACACCGAGAAGGTAATCACCATTGACGGTCTCCTGACGGCTGACGTTCTGATTTATGATATTGAGGACGCGATGAACCACTACGACG TTCCTGTAGTTTGTGTGGCTCTTCCATTAGTGGTAACTGCCAGAGGACTGCCGACTGCAAGACTAAATACTATAACTCCTGCGCTACTTGGTGATGCTGC 901 TTCGCTCTGAGTATACCTCTCAGTTGGGTGAATCTCTGGCGATGGCTGCGGATGGTGCGGTTCTGGCTGAGATTGCCGGTCTGTGTAACGTGGAAAGCAA AAGCGAGACTCATATGGAGAGTCAACCCACTTAGAGACCGCTACCGACGCCTACCACGCCAAGACCGACTCTAACGGCCAGACACATTGCACCTTTCGTT 1001 ATATAATGAGAACATCGAGGGCTTAGGTACTGCTACCGTAATTGAGACCACTCAGAACAAGGCCGCACTTACCGACCAAGTTGCGCTGGGTAAGGAGATT TATATTACTCTTGTAGCTCCCGAATCCATGACGATGGCATTAACTCTGGTGAGTCTTGTTCCGGCGTGAATGGCTGGTTCAACGCGACCCATTCCTCTAA 1101 ATTGCGGCTCTGACTAAGGCTCGTGCGGCTCTGACCAAGAACTATGTTCCGGCTGCTGACCGTGTGTTCTACTGTGACCCAGATAGCTACTCTGCGATTC TAACGCCGAGACTGATTCCGAGCACGCCGAGACTGGTTCTTGATACAAGGCCGACGACTGGCACACAAGATGACACTGGGTCTATCGATGAGACGCTAAG 1201 TGGCAGCACTGATGCCGAACGCAGCAAACTACGCTGCTCTGATTGACCCTGAGAAGGGTTCTATCCGCAACGTTATGGGCTTTGAGGTTGTAGAAGTTCC ACCGTCGTGACTACGGCTTGCGTCGTTTGATGCGACGAGACTAACTGGGACTCTTCCCAAGATAGGCGTTGCAATACCCGAAACTCCAACATCTTCAAGG 1301 GCACCTCACCGCTGGTGGTGCTGGTACCGCTCGTGAGGGCACTACTGGTCAGAAGCACGTCTTCCCTGCCAATAAAGGTGAGGGTAATGTCAAGGTTGCT CGTGGAGTGGCGACCACCACGACCATGGCGAGCACTCCCGTGATGACCAGTCTTCGTGCAGAAGGGACGGTTATTTCCACTCCCATTACAGTTCCAACGA 1401 AAGGACAACGTTATCGGCCTGTTCATGCACCGCTCTGCGGTAGGTACTGTTAAGCTGCGTGACTTGGCTCTGGAGCGCGCTCGCCGTGCTAACTTCCAAG TTCCTGTTGCAATAGCCGGACAAGTACGTGGCGAGACGCCATCCATGACAATTCGACGCACTGAACCGAGACCTCGCGCGAGCGGCACGATTGAAGGTTC Esp3I ~~~~~~ 1501 CGGACCAGATTATCGCTAAGTACGCAATGGGCCACGGTGGTCTTCGCCCAGAAGCTGCAGGAGCTGTCGTATTCCAGTCAGGTTAATTACGAGACGCTCG GCCTGGTCTAATAGCGATTCATGCGTTACCCGGTGCCACCAGAAGCGGGTCTTCGACGTCCTCGACAGCATAAGGTCAGTCCAATTAATGCTCTGCGAGC 1601 AGCCGATCCGCATCAAAGCATGCTGTTTTCTGTCTGTCCCTAACATGCCCTGTGATTATCCGCAAACAACACACCCAAGGGCAGAACTTTGTTACTTAAA TCGGCTAGGCGTAGTTTCGTACGACAAAAGACAGACAGGGATTGTACGGGACACTAATAGGCGTTTGTTGTGTGGGTTCCCGTCTTGAAACAATGAATTT 1701 CACCATCCTGTTTGCTTCTTTCCTCAGGAACTGTGGCTGCACCATCTGTCTTCATCTTCCCGCCATCTGATGAGCAGTTGAAATCTGGAACTGCCTCTGT GTGGTAGGACAAACGAAGAAAGGAGTCCTTGACACCGACGTGGTAGACAGAAGTAGAAGGGCGGTAGACTACTCGTCAACTTTAGACCTTGACGGAGACA 1801 TGTGTGCCTGCTGAATAACTTCTATCCCAGAGAGGCCAAAGTACAGTGGAAGGTGGATAACGCCCTCCAATCGGGTAACTCCCAGGAGAGTGTCACAGAG ACACACGGACGACTTATTGAAGATAGGGTCTCTCCGGTTTCATGTCACCTTCCACCTATTGCGGGAGGTTAGCCCATTGAGGGTCCTCTCACAGTGTCTC 1901 CAGGACAGCAAGGACAGCACCTACAGCCTCAGCAGCACCCTGACGCTGAGCAAAGCAGACTACGAGAAACACAAAGTCTACGCCTGCGAAGTCACCCATC GTCCTGTCGTTCCTGTCGTGGATGTCGGAGTCGTCGTGGGACTGCGACTCGTTTCGTCTGATGCTCTTTGTGTTTCAGATGCGGACGCTTCAGTGGGTAG 2001 AGGGCCTGAGCTCGCCCGTCACAAAGAGCTTCAACAGGGGAGAGTGTTAGCGGCCAATTGGCGGCCGCAATTTAATTCCGGTTATTTTCCACCATATTGC TCCCGGACTCGAGCGGGCAGTGTTTCTCGAAGTTGTCCCCTCTCACAATCGCCGGTTAACCGCCGGCGTTAAATTAAGGCCAATAAAAGGTGGTATAACG 2101 CGTCTTTTGGCAATGTGAGGGCCCGGAAACCTGGCCCTGTCTTCTTGACGAGCATTCCTAGGGGTCTTTCCCCTCTCGCCAAAGGAATGCAAGGTCTGTT GCAGAAAACCGTTACACTCCCGGGCCTTTGGACCGGGACAGAAGAACTGCTCGTAAGGATCCCCAGAAAGGGGAGAGCGGTTTCCTTACGTTCCAGACAA 2201 GAATGTCGTGAAGGAAGCAGTTCCTCTGGAAGCTTCTTGAAGACAAACAACGTCTGTAGCGACCCTTTGCAGGCAGCGGAACCCCCCACCTGGCGACAGG CTTACAGCACTTCCTTCGTCAAGGAGACCTTCGAAGAACTTCTGTTTGTTGCAGACATCGCTGGGAAACGTCCGTCGCCTTGGGGGGTGGACCGCTGTCC 2301 TGCCTCTGCGGCCAAAAGCCACGTGTATAAGATACACCTGCAAAGGCGGCACAACCCCAGTGCCACGTTGTGAGTTGGATAGTTGTGGAAAGAGTCAAAT ACGGAGACGCCGGTTTTCGGTGCACATATTCTATGTGGACGTTTCCGCCGTGTTGGGGTCACGGTGCAACACTCAACCTATCAACACCTTTCTCAGTTTA 2401 GGCTCACCTCAAGCGTATTCAACAAGGGGCTGAAGGATGCCCAGAAGGTACCCCATTGTATGGGATCTGATCTGGGGCCTCGGTGCACATGCTTTACATG CCGAGTGGAGTTCGCATAAGTTGTTCCCCGACTTCCTACGGGTCTTCCATGGGGTAACATACCCTAGACTAGACCCCGGAGCCACGTGTACGAAATGTAC 2501 TGTTTAGTCGAGGTTAAAAAACGTCTAGGCCCCCCGAACCACGGGGACGTGGTTTTCCTTTGAAAAACACGATGATAATATGGCCACCACCCATACCTAG ACAAATCAGCTCCAATTTTTTGCAGATCCGGGGGGCTTGGTGCCCCTGCACCAAAAGGAAACTTTTTGTGCTACTATTATACCGGTGGTGGGTATGGATC 2601 GCTTTTGCAAAGATCGATCAGATCCCGGGGGGCAATGAGATATGAAAAAGCCTGAACTCACCGCGACGTCTGTCGAGAAGTTTCTGATCGAAAAGTTCGA CGAAAACGTTTCTAGCTAGTCTAGGGCCCCCCGTTACTCTATACTTTTTCGGACTTGAGTGGCGCTGCAGACAGCTCTTCAAAGACTAGCTTTTCAAGCT 2701 CAGCGTATCCGACCTGATGCAGCTCTCGGAGGGCGAAGAATCTCGTGCTTTCAGCTTCGATGTAGGAGGGCGTGGATATGTCCTGCGGGTAAATAGCTGC GTCGCATAGGCTGGACTACGTCGAGAGCCTCCCGCTTCTTAGAGCACGAAAGTCGAAGCTACATCCTCCCGCACCTATACAGGACGCCCATTTATCGACG 2801 GCCGATGGTTTCTACAAAGATCGTTATGTTTATCGGCACTTTGCATCGGCCGCGCTCCCGATTCCGGAAGTGCTTGACATTGGGGAATTCAGCGAGAGCC CGGCTACCAAAGATGTTTCTAGCAATACAAATAGCCGTGAAACGTAGCCGGCGCGAGGGCTAAGGCCTTCACGAACTGTAACCCCTTAAGTCGCTCTCGG 2901 TGACCTATTGCATCTCCCGCCGTGCACAGGGTGTCACGTTGCAAGACCTGCCTGAAACCGAACTGCCCGCTGTTCTGCAGCCGGTCGCGGAGGCCATGGA ACTGGATAACGTAGAGGGCGGCACGTGTCCCACAGTGCAACGTTCTGGACGGACTTTGGCTTGACGGGCGACAAGACGTCGGCCAGCGCCTCCGGTACCT 3001 TGCGATCGCTGCGGCCGATCTTAGCCAGACGAGCGGGTTCGGCCCATTCGGACCGCAAGGAATCGGTCAATACACTACATGGCGTGATTTCATATGCGCG ACGCTAGCGACGCCGGCTAGAATCGGTCTGCTCGCCCAAGCCGGGTAAGCCTGGCGTTCCTTAGCCAGTTATGTGATGTACCGCACTAAAGTATACGCGC 3101 ATTGCTGATCCCCATGTGTATCACTGGCAAACTGTGATGGACGACACCGTCAGTGCGTCCGTCGCGCAGGCTCTCGATGAGCTGATGCTTTGGGCCGAGG TAACGACTAGGGGTACACATAGTGACCGTTTGACACTACCTGCTGTGGCAGTCACGCAGGCAGCGCGTCCGAGAGCTACTCGACTACGAAACCCGGCTCC 3201 ACTGCCCCGAAGTCCGGCACCTCGTGCACGCGGATTTCGGCTCCAACAATGTCCTGACGGACAATGGCCGCATAACAGCGGTCATTGACTGGAGCGAGGC TGACGGGGCTTCAGGCCGTGGAGCACGTGCGCCTAAAGCCGAGGTTGTTACAGGACTGCCTGTTACCGGCGTATTGTCGCCAGTAACTGACCTCGCTCCG 3301 GATGTTCGGGGATTCCCAATACGAGGTCGCCAACATCTTCTTCTGGAGGCCGTGGTTGGCTTGTATGGAGCAGCAGACGCGCTACTTCGAGCGGAGGCAT CTACAAGCCCCTAAGGGTTATGCTCCAGCGGTTGTAGAAGAAGACCTCCGGCACCAACCGAACATACCTCGTCGTCTGCGCGATGAAGCTCGCCTCCGTA 3401 CCGGAGCTTGCAGGATCGCCGCGGCTCCGGGCGTATATGCTCCGCATTGGTCTTGACCAACTCTATCAGAGCTTGGTTGACGGCAATTTCGATGATGCAG GGCCTCGAACGTCCTAGCGGCGCCGAGGCCCGCATATACGAGGCGTAACCAGAACTGGTTGAGATAGTCTCGAACCAACTGCCGTTAAAGCTACTACGTC 3501 CTTGGGCGCAGGGTCGATGCGACGCAATCGTCCGATCCGGAGCCGGGACTGTCGGGCGTACACAAATCGCCCGCAGAAGCGCGGCCGTCTGGACCGATGG GAACCCGCGTCCCAGCTACGCTGCGTTAGCAGGCTAGGCCTCGGCCCTGACAGCCCGCATGTGTTTAGCGGGCGTCTTCGCGCCGGCAGACCTGGCTACC 3601 CTGTGTAGAAGTACTCGCCGATAGTGGAAACCGACGCCCCAGCACTCGTCCGGATCGGGAGATGGGGGAGGCTAACTGAAACACGGAAGGAGACAATACC GACACATCTTCATGAGCGGCTATCACCTTTGGCTGCGGGGTCGTGAGCAGGCCTAGCCCTCTACCCCCTCCGATTGACTTTGTGCCTTCCTCTGTTATGG I-SceI ~~~~~~~~~~ 3701 GGAAGGAACCTCGACGTTAACTTGTTTATTGCAGCTTATAATGGTTACAAATAAAGCAATAGCATCACAAATTTCACAAATAAAGCATTTATTACCCTGT CCTTCCTTGGAGCTGCAATTGAACAAATAACGTCGAATATTACCAATGTTTATTTCGTTATCGTAGTGTTTAAAGTGTTTATTTCGTAAATAATGGGACA I-SceI ~~~~~~~~ 3801 TATCCCTAGAATTCACTGGCCGTCGTTTTACAACGTCGTGACTGGGAAAACCCTGGCGTTACCCAACTTAATCGCCTTGCAGCACATCCCCCTTTCGCCA ATAGGGATCTTAAGTGACCGGCAGCAAAATGTTGCAGCACTGACCCTTTTGGGACCGCAATGGGTTGAATTAGCGGAACGTCGTGTAGGGGGAAAGCGGT 3901 GCTGGCGTAATAGCGAAGAGGCCCGCACCGATCGCCCTTCCCAACAGTTGCGCAGCCTGAATGGCGAATGGCGCCTGATGCGGTATTTTCTCCTTACGCA CGACCGCATTATCGCTTCTCCGGGCGTGGCTAGCGGGAAGGGTTGTCAACGCGTCGGACTTACCGCTTACCGCGGACTACGCCATAAAAGAGGAATGCGT 4001 TCTGTGCGGTATTTCACACCGCATACGTCAAAGCAACCATAGTACGCGCCCTGTAGCGGCGCATTAAGCGCGGCGGGTGTGGTGGTTACGCGCAGCGTGA AGACACGCCATAAAGTGTGGCGTATGCAGTTTCGTTGGTATCATGCGCGGGACATCGCCGCGTAATTCGCGCCGCCCACACCACCAATGCGCGTCGCACT 4101 CCGCTACACTTGCCAGCGCCCTAGCGCCCGCTCCTTTCGCTTTCTTCCCTTCCTTTCTCGCCACGTTCGCCGGCTTTCCCCGTCAAGCTCTAAATCGGGG GGCGATGTGAACGGTCGCGGGATCGCGGGCGAGGAAAGCGAAAGAAGGGAAGGAAAGAGCGGTGCAAGCGGCCGAAAGGGGCAGTTCGAGATTTAGCCCC 4201 GCTCCCTTTAGGGTTCCGATTTAGTGCTTTACGGCACCTCGACCCCAAAAAACTTGATTTGGGTGATGGTTCACGTAGTGGGCCATCGCCCTGATAGACG CGAGGGAAATCCCAAGGCTAAATCACGAAATGCCGTGGAGCTGGGGTTTTTTGAACTAAACCCACTACCAAGTGCATCACCCGGTAGCGGGACTATCTGC 4301 GTTTTTCGCCCTTTGACGTTGGAGTCCACGTTCTTTAATAGTGGACTCTTGTTCCAAACTGGAACAACACTCAACCCTATCTCGGGCTATTCTTTTGATT CAAAAAGCGGGAAACTGCAACCTCAGGTGCAAGAAATTATCACCTGAGAACAAGGTTTGACCTTGTTGTGAGTTGGGATAGAGCCCGATAAGAAAACTAA 4401 TATAAGGGATTTTGCCGATTTCGGCCTATTGGTTAAAAAATGAGCTGATTTAACAAAAATTTAACGCGAATTTTAACAAAATATTAACGTTTACAATTTT ATATTCCCTAAAACGGCTAAAGCCGGATAACCAATTTTTTACTCGACTAAATTGTTTTTAAATTGCGCTTAAAATTGTTTTATAATTGCAAATGTTAAAA 4501 ATGGTGCACTCTCAGTACAATCTGCTCTGATGCCGCATAGTTAAGCCAGCCCCGACACCCGCCAACACCCGCTGACGCGCCCTGACGGGCTTGTCTGCTC TACCACGTGAGAGTCATGTTAGACGAGACTACGGCGTATCAATTCGGTCGGGGCTGTGGGCGGTTGTGGGCGACTGCGCGGGACTGCCCGAACAGACGAG 4601 CCGGCATCCGCTTACAGACAAGCTGTGACCGTCTAGACGAAAGGGCCTCGTGATACGCCTATTTTTATAGGTTAATGTCATGATAATAATGGTTTCTTAG GGCCGTAGGCGAATGTCTGTTCGACACTGGCAGATCTGCTTTCCCGGAGCACTATGCGGATAAAAATATCCAATTACAGTACTATTATTACCAAAGAATC 4701 ACGTCAGGTGGCACTTTTCGGGGAAATGTGCGCGGAACCCCTATTTGTTTATTTTTCTAAATACATTCAAATATGTATCCGCTCATGAGACAATAACCCT TGCAGTCCACCGTGAAAAGCCCCTTTACACGCGCCTTGGGGATAAACAAATAAAAAGATTTATGTAAGTTTATACATAGGCGAGTACTCTGTTATTGGGA 4801 GATAAATGCTTCAATAATATTGAAAAAGGAAGAGTATGAGTATTCAACATTTCCGTGTCGCCCTTATTCCCTTTTTTGCGGCATTTTGCCTTCCTGTTTT CTATTTACGAAGTTATTATAACTTTTTCCTTCTCATACTCATAAGTTGTAAAGGCACAGCGGGAATAAGGGAAAAAACGCCGTAAAACGGAAGGACAAAA 4901 TGCTCACCCAGAAACGCTGGTGAAAGTAAAAGATGCTGAAGATCAGTTGGGTGCACGAGTGGGTTACATCGAACTGGATCTCAACAGCGGTAAGATCCTT ACGAGTGGGTCTTTGCGACCACTTTCATTTTCTACGACTTCTAGTCAACCCACGTGCTCACCCAATGTAGCTTGACCTAGAGTTGTCGCCATTCTAGGAA 5001 GAGAGTTTTCGCCCCGAAGAACGTTTTCCAATGATGAGCACTTTTAAAGTTCTGCTATGTGGCGCGGTATTATCCCGTATTGACGCCGGGCAAGAGCAAC CTCTCAAAAGCGGGGCTTCTTGCAAAAGGTTACTACTCGTGAAAATTTCAAGACGATACACCGCGCCATAATAGGGCATAACTGCGGCCCGTTCTCGTTG 5101 TCGGTCGCCGCATACACTATTCTCAGAATGACTTGGTTGAGTACTCACCAGTCACAGAAAAGCATCTTACGGATGGCATGACAGTAAGAGAATTATGCAG AGCCAGCGGCGTATGTGATAAGAGTCTTACTGAACCAACTCATGAGTGGTCAGTGTCTTTTCGTAGAATGCCTACCGTACTGTCATTCTCTTAATACGTC 5201 TGCTGCCATAACCATGAGTGATAACACTGCGGCCAACTTACTTCTGACAACGATCGGAGGACCGAAGGAGCTAACCGCTTTTTTGCACAACATGGGGGAT ACGACGGTATTGGTACTCACTATTGTGACGCCGGTTGAATGAAGACTGTTGCTAGCCTCCTGGCTTCCTCGATTGGCGAAAAAACGTGTTGTACCCCCTA 5301 CATGTAACTCGCCTTGATCGTTGGGAACCGGAGCTGAATGAAGCCATACCAAACGACGAGCGTGACACCACGATGCCTGTAGCAATGGCAACAACGTTGC GTACATTGAGCGGAACTAGCAACCCTTGGCCTCGACTTACTTCGGTATGGTTTGCTGCTCGCACTGTGGTGCTACGGACATCGTTACCGTTGTTGCAACG 5401 GCAAACTATTAACTGGCGAACTACTTACTCTAGCTTCCCGGCAACAATTAATAGACTGGATGGAGGCGGATAAAGTTGCAGGACCACTTCTGCGCTCGGC CGTTTGATAATTGACCGCTTGATGAATGAGATCGAAGGGCCGTTGTTAATTATCTGACCTACCTCCGCCTATTTCAACGTCCTGGTGAAGACGCGAGCCG 5501 CCTTCCGGCTGGCTGGTTTATTGCTGATAAATCTGGAGCCGGTGAGCGTGGGTCTCGCGGTATCATTGCAGCACTGGGGCCAGATGGTAAGCCCTCCCGT GGAAGGCCGACCGACCAAATAACGACTATTTAGACCTCGGCCACTCGCACCCAGAGCGCCATAGTAACGTCGTGACCCCGGTCTACCATTCGGGAGGGCA 5601 ATCGTAGTTATCTACACGACGGGGAGTCAGGCAACTATGGATGAACGAAATAGACAGATCGCTGAGATAGGTGCCTCACTGATTAAGCATTGGTAACTGT TAGCATCAATAGATGTGCTGCCCCTCAGTCCGTTGATACCTACTTGCTTTATCTGTCTAGCGACTCTATCCACGGAGTGACTAATTCGTAACCATTGACA 5701 CAGACCAAGTTTACTCATATATACTTTAGATTGATTTAAAACTTCATTTTTAATTTAAAAGGATCTAGGTGAAGATCCTTTTTGATAATCTCATGACCAA GTCTGGTTCAAATGAGTATATATGAAATCTAACTAAATTTTGAAGTAAAAATTAAATTTTCCTAGATCCACTTCTAGGAAAAACTATTAGAGTACTGGTT 5801 AATCCCTTAACGTGAGTTTTCGTTCCACTGAGCGTCAGACCCCGTAGAAAAGATCAAAGGATCTTCTTGAGATCCTTTTTTTCTGCGCGTAATCTGCTGC TTAGGGAATTGCACTCAAAAGCAAGGTGACTCGCAGTCTGGGGCATCTTTTCTAGTTTCCTAGAAGAACTCTAGGAAAAAAAGACGCGCATTAGACGACG 5901 TTGCAAACAAAAAAACCACCGCTACCAGCGGTGGTTTGTTTGCCGGATCAAGAGCTACCAACTCTTTTTCCGAAGGTAACTGGCTTCAGCAGAGCGCAGA AACGTTTGTTTTTTTGGTGGCGATGGTCGCCACCAAACAAACGGCCTAGTTCTCGATGGTTGAGAAAAAGGCTTCCATTGACCGAAGTCGTCTCGCGTCT 6001 TACCAAATACTGTCCTTCTAGTGTAGCCGTAGTTAGGCCACCACTTCAAGAACTCTGTAGCACCGCCTACATACCTCGCTCTGCTAATCCTGTTACCAGT ATGGTTTATGACAGGAAGATCACATCGGCATCAATCCGGTGGTGAAGTTCTTGAGACATCGTGGCGGATGTATGGAGCGAGACGATTAGGACAATGGTCA 6101 GGCTGCTGCCAGTGGCGATAAGTCGTGTCTTACCGGGTTGGACTCAAGACGATAGTTACCGGATAAGGCGCAGCGGTCGGGCTGAACGGGGGGTTCGTGC CCGACGACGGTCACCGCTATTCAGCACAGAATGGCCCAACCTGAGTTCTGCTATCAATGGCCTATTCCGCGTCGCCAGCCCGACTTGCCCCCCAAGCACG 6201 ACACAGCCCAGCTTGGAGCGAACGACCTACACCGAACTGAGATACCTACAGCGTGAGCTATGAGAAAGCGCCACGCTTCCCGAAGGGAGAAAGGCGGACA TGTGTCGGGTCGAACCTCGCTTGCTGGATGTGGCTTGACTCTATGGATGTCGCACTCGATACTCTTTCGCGGTGCGAAGGGCTTCCCTCTTTCCGCCTGT 6301 GGTATCCGGTAAGCGGCAGGGTCGGAACAGGAGAGCGCACGAGGGAGCTTCCAGGGGGAAACGCCTGGTATCTTTATAGTCCTGTCGGGTTTCGCCACCT CCATAGGCCATTCGCCGTCCCAGCCTTGTCCTCTCGCGTGCTCCCTCGAAGGTCCCCCTTTGCGGACCATAGAAATATCAGGACAGCCCAAAGCGGTGGA 6401 CTGACTTGAGCGTCGATTTTTGTGATGCTCGTCAGGGGGGCGGAGCCTATGGAAAAACGCCAGCAACGCGGCCTTTTTACGGTTCCTGGCCTTTTGCTGG GACTGAACTCGCAGCTAAAAACACTACGAGCAGTCCCCCCGCCTCGGATACCTTTTTGCGGTCGTTGCGCCGGAAAAATGCCAAGGACCGGAAAACGACC 6501 CCTTTTGCTCACATGTTCTTTCCTGCGTTATCCCCTGATTCTGTGGATAACCGTATTACCGCCTTTGAGTGAGCTGATACCGCTCGCCGCAGCCGAACGA GGAAAACGAGTGTACAAGAAAGGACGCAATAGGGGACTAAGACACCTATTGGCATAATGGCGGAAACTCACTCGACTATGGCGAGCGGCGTCGGCTTGCT 6601 CCGAGCGCAGCGAGTCAGTGAGCGAGGAAGCGGAAGAGCGCCCAATACGCAAACCGCCTCTCCCCGCGCGTTGGCCGATTCATTAATGCAGCTGGCACGA GGCTCGCGTCGCTCAGTCACTCGCTCCTTCGCCTTCTCGCGGGTTATGCGTTTGGCGGAGAGGGGCGCGCAACCGGCTAAGTAATTACGTCGACCGTGCT 6701 CAGGTTTCCCGACTGGAAAGCGGGCAGTGAGCGCAACGCAATTAATGTGAGTTAGCTCACTCATTAGGCACCCCAGGCTTTACACTTTATGCTTCCGGCT GTCCAAAGGGCTGACCTTTCGCCCGTCACTCGCGTTGCGTTAATTACACTCAATCGAGTGAGTAATCCGTGGGGTCCGAAATGTGAAATACGAAGGCCGA I-SceI ~~~~~~~~~~~~~~~~~~~~ 6801 CGTATGTTGTGTGGAATTGTGAGCGGATAACAATTTCACACAGGAAACAGCTATGACCATGATTACGCCAAGCTTTAGGGATAACAGGGTAATCGCCATG GCATACAACACACCTTAACACTCGCCTATTGTTAAAGTGTGTCCTTTGTCGATACTGGTACTAATGCGGTTCGAAATCCCTATTGTCCCATTAGCGGTAC 6901 CATTAGTTATTAATAGTAATCAATTACGGGGTCATTAGTTCATAGCCCATATATGGAGTTCCGCGTTACATAACTTACGGTAAA GTAATCAATAATTATCATTAGTTAATGCCCCAGTAATCAAGTATCGGGTATATACCTCAAGGCGCAATGTATTGAATGCCATTT pVHentry-Cm5: Esp3I ~~~~~~~ 1 GGTTTAGTGAACCGTCAGATCCGCTAGACGTCTCATATACCTGACTGGAATACGACAGCTCCTGCAGCTTCTGGGCGAAGACCACCGTGGCCCATTGCGT CCAAATCACTTGGCAGTCTAGGCGATCTGCAGAGTATATGGACTGACCTTATGCTGTCGAGGACGTCGAAGACCCGCTTCTGGTGGCACCGGGTAACGCA 101 ACTTAGCGATAATCTGGTCCGCTTGGAAGTTAGCACGGCGAGCGCGCTCCAGAGCCAAGTCACGCAGCTTAACAGTACCTACCGCAGAGCGGTGCATGAA TGAATCGCTATTAGACCAGGCGAACCTTCAATCGTGCCGCTCGCGCGAGGTCTCGGTTCAGTGCGTCGAATTGTCATGGATGGCGTCTCGCCACGTACTT 201 CAGGCCGATAACGTTGTCCTTAGCAACCTTGACATTACCCTCACCTTTATTGGCAGGGAAGACGTGCTTCTGACCAGTAGTGCCCTCACGAGCGGTACCA GTCCGGCTATTGCAACAGGAATCGTTGGAACTGTAATGGGAGTGGAAATAACCGTCCCTTCTGCACGAAGACTGGTCATCACGGGAGTGCTCGCCATGGT 301 GCACCACCAGCGGTGAGGTGCGGAACTTCTACAACCTCAAAGCCCATAACGTTGCGGATAGAACCCTTCTCAGGGTCAATCAGAGCAGCGTAGTTTGCTG CGTGGTGGTCGCCACTCCACGCCTTGAAGATGTTGGAGTTTCGGGTATTGCAACGCCTATCTTGGGAAGAGTCCCAGTTAGTCTCGTCGCATCAAACGAC 401 CGTTCGGCATCAGTGCTGCCAGAATCGCAGAGTAGCTATCTGGGTCACAGTAGAACACACGGTCAGCAGCCGGAACATAGTTCTTGGTCAGAGCCGCACG GCAAGCCGTAGTCACGACGGTCTTAGCGTCTCATCGATAGACCCAGTGTCATCTTGTGTGCCAGTCGTCGGCCTTGTATCAAGAACCAGTCTCGGCGTGC 501 AGCCTTAGTCAGAGCCGCAATAATCTCCTTACCCAGCGCAACTTGGTCGGTAAGTGCGGCCTTGTTCTGAGTGGTCTCAATTACGGTAGCAGTACCTAAG TCGGAATCAGTCTCGGCGTTATTAGAGGAATGGGTCGCGTTGAACCAGCCATTCACGCCGGAACAAGACTCACCAGAGTTAATGCCATCGTCATGGATTC 601 CCCTCGATGTTCTCATTATATTTGCTTTCCACGTTACACAGACCGGCAATCTCAGCCAGAACCGCACCATCCGCAGCCATCGCCAGAGATTCACCCAACT GGGAGCTACAAGAGTAATATAAACGAAAGGTGCAATGTGTCTGGCCGTTAGAGTCGGTCTTGGCGTGGTAGGCGTCGGTAGCGGTCTCTAAGTGGGTTGA 701 GAGAGGTATACTCAGAGCGAACGTCGTAGTGGTTCATCGCGTCCTCAATATCATAAATCAGAACGTCAGCCGTCAGGAGACCGTCAATGGTGATTACCTT CTCTCCATATGAGTCTCGCTTGCAGCATCACCAAGTAGCGCAGGAGTTATAGTATTTAGTCTTGCAGTCGGCAGTCCTCTGGCAGTTACCACTAATGGAA 801 CTCGGTGTGTTTGATGTCCTTACGTTTATCGTCGAGGTTCTCGCCCGGAGCCAGATACGCTGCCTGAGTGCGACCCAGAACAGGGAACTGAGCGGATTTA GAGCCACACAAACTACAGGAATGCAAATAGCAGCTCCAAGAGCGGGCCTCGGTCTATGCGACGGACTCACGCTGGGTCTTGTCCCTTGACTCGCCTAAAT 901 CCGCTGGAGATGGAACGTACCATGTGGCGAGAAGTGGTCACGGAGGTACGAGCGAACGCAGTCAGGACTTCACCGCCAAATACCTTCAAGAACAACGCCA GGCGACCTCTACCTTGCATGGTACACCGCTCTTCACCAGTGCCTCCATGCTCGCTTGCGTCAGTCCTGAAGTGGCGGTTTATGGAAGTTCTTGTTGCGGT Esp3I ~~~~~ 1001 GTTTATCTCCAGCAGCAACTACACCTTTACCTTGGTTAGTACCCATTTGCTGTCCACCAGTCATGCTAGCCATATGTATATCTCCTTCTTAAAGTCGTCT CAAATAGAGGTCGTCGTTGATGTGGAAATGGAACCAATCATGGGTAAACGACAGGTGGTCAGTACGATCGGTATACATATAGAGGAAGAATTTCAGCAGA Esp3I ~ 1101 CCAGTGCCTCCACCAAGGGCCCATCGGTCTTCCCCCTGGCGCCCTGCTCCAGGAGCACCTCCGAGAGCACAGCGGCCCTGGGCTGCCTGGTCAAGGACTA GGTCACGGAGGTGGTTCCCGGGTAGCCAGAAGGGGGACCGCGGGACGAGGTCCTCGTGGAGGCTCTCGTGTCGCCGGGACCCGACGGACCAGTTCCTGAT 1201 CTTCCCCGAACCGGTGACGGTGTCGTGGAACTCAGGCGCTCTGACCAGCGGCGTGCACACCTTCCCAGCTGTCCTACAGTCCTCAGGACTCTACTCCCTC GAAGGGGCTTGGCCACTGCCACAGCACCTTGAGTCCGCGAGACTGGTCGCCGCACGTGTGGAAGGGTCGACAGGATGTCAGGAGTCCTGAGATGAGGGAG 1301 AGCAGCGTGGTGACCGTGCCCTCCAGCAGCTTGGGCACCCAGACCTACATCTGCAACGTGAATCACAAGCCCAGCAACACCAAGGTGGACAAGAAAGTTG TCGTCGCACCACTGGCACGGGAGGTCGTCGAACCCGTGGGTCTGGATGTAGACGTTGCACTTAGTGTTCGGGTCGTTGTGGTTCCACCTGTTCTTTCAAC 1401 AGCCCAAATCTTGTGACAAAACTCACACATGCCCACCGTGCCCAGCACCTGAACTCCTGGGGGGACCGTCAGTCTTCCTCTTCCCCCCMAAACCCAAGGA TCGGGTTTAGAACACTGTTTTGAGTGTGTACGGGTGGCACGGGTCGTGGACTTGAGGACCCCCCTGGCAGTCAGAAGGAGAAGGGGGGKTTTGGGTTCCT 1501 CACCCTCATGATCTCCCGGACCCCTGAGGTCACATGCGTGGTGGTGGACGTGAGCCACGAAGACCCTGAGGTCAAGTTCAACTGGTACGTGGACGGCGTG GTGGGAGTACTAGAGGGCCTGGGGACTCCAGTGTACGCACCACCACCTGCACTCGGTGCTTCTGGGACTCCAGTTCAAGTTGACCATGCACCTGCCGCAC 1601 GAGGTGCATAATGCCAAGACAAAGCCGCGGGAGGAGCAGTACAACAGCACGTACCGTGTGGTCAGCGTCCTCACCGTCCTGCACCAGGACTGGCTGAATG CTCCACGTATTACGGTTCTGTTTCGGCGCCCTCCTCGTCATGTTGTCGTGCATGGCACACCAGTCGCAGGAGTGGCAGGACGTGGTCCTGACCGACTTAC 1701 GCAAGGAGTACAAGTGCAAGGTCTCCAACAAAGCCCTCCCAGCCCCCATCGAGAAAACCATCTCCAAAGCCAAAGGGCAGCCCCGAGAACCACAGGTGTA CGTTCCTCATGTTCACGTTCCAGAGGTTGTTTCGGGAGGGTCGGGGGTAGCTCTTTTGGTAGAGGTTTCGGTTTCCCGTCGGGGCTCTTGGTGTCCACAT 1801 CACCCTGCCCCCATCCCGGGATGAGCTGACCAAGAACCAGGTCAGCCTGACCTGCCTGGTCAAAGGCTTCTATCCCAGCGACATCGCCGTGGAGTGGGAG GTGGGACGGGGGTAGGGCCCTACTCGACTGGTTCTTGGTCCAGTCGGACTGGACGGACCAGTTTCCGAAGATAGGGTCGCTGTAGCGGCACCTCACCCTC 1901 AGCAATGGGCAGCCGGAGAACAACTACAAGACCACGCCTCCCGTGCTGGACTCCGACGGCTCCTTCTTCCTCTACAGCAAGCTCACCGTGGACAAGAGCA TCGTTACCCGTCGGCCTCTTGTTGATGTTCTGGTGCGGAGGGCACGACCTGAGGCTGCCGAGGAAGAAGGAGATGTCGTTCGAGTGGCACCTGTTCTCGT 2001 GGTGGCAGCAGGGGAACGTCTTCTCATGCTCCGTGATGCATGAGGCTCTGCACAACCACTACACGCAGAAGAGCCTCTCCCTGTCTCCGGGTAAATGAGC CCACCGTCGTCCCCTTGCAGAAGAGTACGAGGCACTACGTACTCCGAGACGTGTTGGTGATGTGCGTCTTCTCGGAGAGGGACAGAGGCCCATTTACTCG 2101 GGCCGCAATTTAATTCCGGTTATTTTCCACCATATTGCCGTCTTTTGGCAATGTGAGGGCCCGGAAACCTGGCCCTGTCTTCTTGACGAGCATTCCTAGG CCGGCGTTAAATTAAGGCCAATAAAAGGTGGTATAACGGCAGAAAACCGTTACACTCCCGGGCCTTTGGACCGGGACAGAAGAACTGCTCGTAAGGATCC 2201 GGTCTTTCCCCTCTCGCCAAAGGAATGCAAGGTCTGTTGAATGTCGTGAAGGAAGCAGTTCCTCTGGAAGCTTCTTGAAGACAAACAACGTCTGTAGCGA CCAGAAAGGGGAGAGCGGTTTCCTTACGTTCCAGACAACTTACAGCACTTCCTTCGTCAAGGAGACCTTCGAAGAACTTCTGTTTGTTGCAGACATCGCT 2301 CCCTTTGCAGGCAGCGGAACCCCCCACCTGGCGACAGGTGCCTCTGCGGCCAAAAGCCACGTGTATAAGATACACCTGCAAAGGCGGCACAACCCCAGTG GGGAAACGTCCGTCGCCTTGGGGGGTGGACCGCTGTCCACGGAGACGCCGGTTTTCGGTGCACATATTCTATGTGGACGTTTCCGCCGTGTTGGGGTCAC 2401 CCACGTTGTGAGTTGGATAGTTGTGGAAAGAGTCAAATGGCTCACCTCAAGCGTATTCAACAAGGGGCTGAAGGATGCCCAGAAGGTACCCCATTGTATG GGTGCAACACTCAACCTATCAACACCTTTCTCAGTTTACCGAGTGGAGTTCGCATAAGTTGTTCCCCGACTTCCTACGGGTCTTCCATGGGGTAACATAC 2501 GGATCTGATCTGGGGCCTCGGTGCACATGCTTTACATGTGTTTAGTCGAGGTTAAAAAACGTCTAGGCCCCCCGAACCACGGGGACGTGGTTTTCCTTTG CCTAGACTAGACCCCGGAGCCACGTGTACGAAATGTACACAAATCAGCTCCAATTTTTTGCAGATCCGGGGGGCTTGGTGCCCCTGCACCAAAAGGAAAC 2601 AAAAACACGATGATAATATGGCCACCACCCATACCTAGGCTTTTGCAAAGATCGATCAAGAGACAGGATGAGGATCGTTTCGCATGATTGAACAAGATGG TTTTTGTGCTACTATTATACCGGTGGTGGGTATGGATCCGAAAACGTTTCTAGCTAGTTCTCTGTCCTACTCCTAGCAAAGCGTACTAACTTGTTCTACC 2701 ATTGCACGCAGGTTCTCCGGCCGCTTGGGTGGAGAGGCTATTCGGCTATGACTGGGCACAACAGACAATCGGCTGCTCTGATGCCGCCGTGTTCCGGCTG TAACGTGCGTCCAAGAGGCCGGCGAACCCACCTCTCCGATAAGCCGATACTGACCCGTGTTGTCTGTTAGCCGACGAGACTACGGCGGCACAAGGCCGAC 2801 TCAGCGCAGGGGCGCCCGGTTCTTTTTGTCAAGACCGACCTGTCCGGTGCCCTGAATGAACTGCAAGACGAGGCAGCGCGGCTATCGTGGCTGGCCACGA AGTCGCGTCCCCGCGGGCCAAGAAAAACAGTTCTGGCTGGACAGGCCACGGGACTTACTTGACGTTCTGCTCCGTCGCGCCGATAGCACCGACCGGTGCT 2901 CGGGCGTTCCTTGCGCAGCTGTGCTCGACGTTGTCACTGAAGCGGGAAGGGACTGGCTGCTATTGGGCGAAGTGCCGGGGCAGGATCTCCTGTCATCTCA GCCCGCAAGGAACGCGTCGACACGAGCTGCAACAGTGACTTCGCCCTTCCCTGACCGACGATAACCCGCTTCACGGCCCCGTCCTAGAGGACAGTAGAGT 3001 CCTTGCTCCTGCCGAGAAAGTATCCATCATGGCTGATGCAATGCGGCGGCTGCATACGCTTGATCCGGCTACCTGCCCATTCGACCACCAAGCGAAACAT GGAACGAGGACGGCTCTTTCATAGGTAGTACCGACTACGTTACGCCGCCGACGTATGCGAACTAGGCCGATGGACGGGTAAGCTGGTGGTTCGCTTTGTA 3101 CGCATCGAGCGAGCACGTACTCGGATGGAAGCCGGTCTTGTCGATCAGGATGATCTGGACGAAGAGCATCAGGGGCTCGCGCCAGCCGAACTGTTCGCCA GCGTAGCTCGCTCGTGCATGAGCCTACCTTCGGCCAGAACAGCTAGTCCTACTAGACCTGCTTCTCGTAGTCCCCGAGCGCGGTCGGCTTGACAAGCGGT 3201 GGCTCAAGGCGAGCATGCCCGACGGCGAGGATCTCGTCGTGACCCATGGCGATGCCTGCTTGCCGAATATCATGGTGGAAAATGGCCGCTTTTCTGGATT CCGAGTTCCGCTCGTACGGGCTGCCGCTCCTAGAGCAGCACTGGGTACCGCTACGGACGAACGGCTTATAGTACCACCTTTTACCGGCGAAAAGACCTAA 3301 CATCGACTGTGGCCGGCTGGGTGTGGCGGACCGCTATCAGGACATAGCGTTGGCTACCCGTGATATTGCTGAAGAGCTTGGCGGCGAATGGGCTGACCGC GTAGCTGACACCGGCCGACCCACACCGCCTGGCGATAGTCCTGTATCGCAACCGATGGGCACTATAACGACTTCTCGAACCGCCGCTTACCCGACTGGCG 3401 TTCCTCGTGCTTTACGGTATCGCCGCTCCCGATTCGCAGCGCATCGCCTTCTATCGCCTTCTTGACGAGTTCTTCTGAGCGGGACTCTGGGGTTCGGGCC AAGGAGCACGAAATGCCATAGCGGCGAGGGCTAAGCGTCGCGTAGCGGAAGATAGCGGAAGAACTGCTCAAGAAGACTCGCCCTGAGACCCCAAGCCCGG 3501 GCACTCGAGCATAAACTTGTTTATTGCAGCTTATAATGGTTACAAATAAAGCAATAGCATCACAAATTTCACAAATAAAGCATTTTTTTCACTGCATTCT CGTGAGCTCGTATTTGAACAAATAACGTCGAATATTACCAATGTTTATTTCGTTATCGTAGTGTTTAAAGTGTTTATTTCGTAAAAAAAGTGACGTAAGA I-SceI ~~~~~~~~~~~~~~~~~~~~ 3601 AGTTGTGGTTTGTCCAAACTCATCAATGTATCTTAAGTAGGGATAACAGGGTAATTTTGTTAAATCAGCTCATTTTTTAACCAATAGGAACGCCATCAAA TCAACACCAAACAGGTTTGAGTAGTTACATAGAATTCATCCCTATTGTCCCATTAAAACAATTTAGTCGAGTAAAAAATTGGTTATCCTTGCGGTAGTTT 3701 AATAATTCGCGTCTGGCCTTCCTGTAGCCAGCTTTCATCAACATTAAATGTGAGCGAGTAACAACCCGTCGGATTCTCCGTGGGAACAAACGGCGGATTG TTATTAAGCGCAGACCGGAAGGACATCGGTCGAAAGTAGTTGTAATTTACACTCGCTCATTGTTGGGCAGCCTAAGAGGCACCCTTGTTTGCCGCCTAAC 3801 ACCGTAATGGGATAGGTTACGTTGGTGTAGATGGGCGCATCGTAACCGTGCATCTGCCAGTTTGAGGGGACGACGACCGTATCGGCCTCAGGAAGATCGC TGGCATTACCCTATCCAATGCAACCACATCTACCCGCGTAGCATTGGCACGTAGACGGTCAAACTCCCCTGCTGCTGGCATAGCCGGAGTCCTTCTAGCG 3901 ACTCCAGCCAGCTTTCCGGCACCGCTTCTGGTGCCGGAAACCAGGCAAAGCGCCATTCGCCATTCAGGCTGCGCAACTGTTGGGAAGGGCGATCGGTGCG TGAGGTCGGTCGAAAGGCCGTGGCGAAGACCACGGCCTTTGGTCCGTTTCGCGGTAAGCGGTAAGTCCGACGCGTTGACAACCCTTCCCGCTAGCCACGC 4001 GGCCTCTTCGCTATTACGCCAGCTGGCGAAAGGGGGATGTGCTGCAAGGCGATTAAGTTGGGTAACGCCAGGGTTTTCCCAGTCACGACGTTGTAAAACG CCGGAGAAGCGATAATGCGGTCGACCGCTTTCCCCCTACACGACGTTCCGCTAATTCAACCCATTGCGGTCCCAAAAGGGTCAGTGCTGCAACATTTTGC 4101 ACGGCCAGTGAATTGCAATTCGTAATCATGGTCATAGCTGTTTCCTGTGTGAAATTGTTATCCGCTCACAATTCCACACAACATACGAGCCGGAAGCATA TGCCGGTCACTTAACGTTAAGCATTAGTACCAGTATCGACAAAGGACACACTTTAACAATAGGCGAGTGTTAAGGTGTGTTGTATGCTCGGCCTTCGTAT I-SceI ~~~~~~~~~~~~~~~~~~~~ 4201 AAGTGTAAAGCCTGGGGTGCCTAATGAGTGAGCTAACTCACATTAATTGCGTTGCGCTCACTGCCATTACCCTGTTATCCCTAGTGAACCATCACCCTAA TTCACATTTCGGACCCCACGGATTACTCACTCGATTGAGTGTAATTAACGCAACGCGAGTGACGGTAATGGGACAATAGGGATCACTTGGTAGTGGGATT 4301 TCAAGTTTTTTGGGGTCGAGGTGCCGTAAAGCACTAAATCGGAACCCTAAAGGGAGCCCCCGATTTAGAGCTTGACGGGGAAAGCCGGCGAACGTGGCGA AGTTCAAAAAACCCCAGCTCCACGGCATTTCGTGATTTAGCCTTGGGATTTCCCTCGGGGGCTAAATCTCGAACTGCCCCTTTCGGCCGCTTGCACCGCT 4401 GAAAGGAAGGGAAGAAAGCGAAAGGAGCGGGCGCTAGGGCGCTGGCAAGTGTAGCGGTCACGCTGCGCGTAACCACCACACCCGCCGCGCTTAATGCGCC CTTTCCTTCCCTTCTTTCGCTTTCCTCGCCCGCGATCCCGCGACCGTTCACATCGCCAGTGCGACGCGCATTGGTGGTGTGGGCGGCGCGAATTACGCGG 4501 GCTACAGGGCGCGTCAGGTGGCACTTTTCGGGGAAATGTGCGCGGAACCCCTATTTGTTTATTTTTCTAAATACATTCAAATATGTATCCGCTCATGAGA CGATGTCCCGCGCAGTCCACCGTGAAAAGCCCCTTTACACGCGCCTTGGGGATAAACAAATAAAAAGATTTATGTAAGTTTATACATAGGCGAGTACTCT 4601 CAATAACCCTGATAAATGCTTCAATAATAACGACCGGTAATGAAAAAGGAAGAGTATGAGTATTCAACATTTCCGTGTCGCCCTTATTCCCTTTTTTGCG GTTATTGGGACTATTTACGAAGTTATTATTGCTGGCCATTACTTTTTCCTTCTCATACTCATAAGTTGTAAAGGCACAGCGGGAATAAGGGAAAAAACGC 4701 GCATTTTGCCTTCCTGTTTTTGCTCACCCAGAAACGCTGGTGAAAGTAAAAGATGCTGAAGATCAGTTGGGTGCACGAGTGGGTTACATCGAACTGGATC CGTAAAACGGAAGGACAAAAACGAGTGGGTCTTTGCGACCACTTTCATTTTCTACGACTTCTAGTCAACCCACGTGCTCACCCAATGTAGCTTGACCTAG 4801 TCAACAGCGGTAAGATCCTTGAGAGTTTTCGCCCCGAAGAACGTTTTCCAATGATGAGCACTTTTAAAGTTCTGCTATGTGGCGCGGTATTATCCCGTAT AGTTGTCGCCATTCTAGGAACTCTCAAAAGCGGGGCTTCTTGCAAAAGGTTACTACTCGTGAAAATTTCAAGACGATACACCGCGCCATAATAGGGCATA 4901 TGACGCCGGGCAAGAGCAACTCGGTCGCCGCATACACTATTCTCAGAATGACTTGGTTGAGTCTAGCGTTGATCGGCACGTAAGAGGTTCCAACTTTCAC ACTGCGGCCCGTTCTCGTTGAGCCAGCGGCGTATGTGATAAGAGTCTTACTGAACCAACTCAGATCGCAACTAGCCGTGCATTCTCCAAGGTTGAAAGTG 5001 CATAATGAAATAAGATCACTACCGGGCGTATTTTTTGAGTTATCGAGATTTTCAGGAGCTAAGGAAGCTAAAATGGAGAAAAAAATCACTGGATATACCA GTATTACTTTATTCTAGTGATGGCCCGCATAAAAAACTCAATAGCTCTAAAAGTCCTCGATTCCTTCGATTTTACCTCTTTTTTTAGTGACCTATATGGT 5101 CCGTTGATATATCCCAATGGCATCGTAAAGAACATTTTGAGGCATTTCAGTCAGTTGCTCAATGTACCTATAACCAGACCGTTCAGCTGGATATTACGGC GGCAACTATATAGGGTTACCGTAGCATTTCTTGTAAAACTCCGTAAAGTCAGTCAACGAGTTACATGGATATTGGTCTGGCAAGTCGACCTATAATGCCG 5201 CTTTTTAAAGACCGTAAAGAAAAATAAGCACAAGTTTTATCCGGCCTTTATTCACATTCTTGCCCGCCTGATGAATGCTCATCCGGAATTCCGTATGGCA GAAAAATTTCTGGCATTTCTTTTTATTCGTGTTCAAAATAGGCCGGAAATAAGTGTAAGAACGGGCGGACTACTTACGAGTAGGCCTTAAGGCATACCGT 5301 ATGAAAGACGGTGAGCTGGTGATATGGGATAGTGTTCACCCTTGTTACACCGTTTTCCATGAGCAAACTGAAACGTTTTCATCGCTCTGGAGTGAATACC TACTTTCTGCCACTCGACCACTATACCCTATCACAAGTGGGAACAATGTGGCAAAAGGTACTCGTTTGACTTTGCAAAAGTAGCGAGACCTCACTTATGG 5401 ACGACGATTTCCGGCAGTTTCTACACATATATTCGCAAGATGTGGCGTGTTACGGTGAAAACCTGGCCTATTTCCCTAAAGGGTTTATTGAGAATATGTT TGCTGCTAAAGGCCGTCAAAGATGTGTATATAAGCGTTCTACACCGCACAATGCCACTTTTGGACCGGATAAAGGGATTTCCCAAATAACTCTTATACAA 5501 TTTCGTATCAGCCAATCCCTGGGTGAGTTTCACCAGTTTTGATTTAAACGTGGCCAATATGGACAACTTCTTCGCCCCCGTTTTCACCATGGGCAAATAT AAAGCATAGTCGGTTAGGGACCCACTCAAAGTGGTCAAAACTAAATTTGCACCGGTTATACCTGTTGAAGAAGCGGGGGCAAAAGTGGTACCCGTTTATA 5601 TATACGCAAGGCGACAAGGTGCTGATGCCGCTGGCGATTCAGGTTCATCATGCCGTCTGTGATGGCTTCCATGTCGGCAGAATGCTTAATGAATTACAAC ATATGCGTTCCGCTGTTCCACGACTACGGCGACCGCTAAGTCCAAGTAGTACGGCAGACACTACCGAAGGTACAGCCGTCTTACGAATTACTTAATGTTG 5701 AGTACTGCGATGAGTGGCAGGGCGGGGCGTAATTTTTTTAAGGCAGTTATTGGTGCCCTTAAACGCCTGGTGCTACGCCTGAATAAGTGATAATAAGCGG TCATGACGCTACTCACCGTCCCGCCCCGCATTAAAAAAATTCCGTCAATAACCACGGGAATTTGCGGACCACGATGCGGACTTATTCACTATTATTCGCC 5801 ATGAATGGCAGAAATTCGAAATGACCGACCAAGCGACGCCCAACCTGCCATCACGAGATTTCGATTCCACCGCCGCCTTCTATGAAAGGTTGGGCTTCGG TACTTACCGTCTTTAAGCTTTACTGGCTGGTTCGCTGCGGGTTGGACGGTAGTGCTCTAAAGCTAAGGTGGCGGCGGAAGATACTTTCCAACCCGAAGCC 5901 AATCGTTTTCCGGGACGCCGGCTGGATGATCCTCCAGCGCGGGGATCTCATGCTGGAGTTCTTCGCCCACCCTAGGGGGAGGCTAACTGAAACACGGAAG TTAGCAAAAGGCCCTGCGGCCGACCTACTAGGAGGTCGCGCCCCTAGAGTACGACCTCAAGAAGCGGGTGGGATCCCCCTCCGATTGACTTTGTGCCTTC 6001 GAGACAATACCGGAAGGAACCCGCGCTATGACGGCAATAAAAAGACAGAATAAAACGCACGGTGTTGGGTCGTTTGTTCATAAACGCGGGGTTCGGTCCC CTCTGTTATGGCCTTCCTTGGGCGCGATACTGCCGTTATTTTTCTGTCTTATTTTGCGTGCCACAACCCAGCAAACAAGTATTTGCGCCCCAAGCCAGGG 6101 AGGGCTGGCACTCTGTCGATACCCCACCGAGACCCCATTGGGGCCAATACGCCCGCGTTTCTTCCTTTTCCCCACCCCACCCCCCAAGTTCGGGTGAAGG TCCCGACCGTGAGACAGCTATGGGGTGGCTCTGGGGTAACCCCGGTTATGCGGGCGCAAAGAAGGAAAAGGGGTGGGGTGGGGGGTTCAAGCCCACTTCC 6201 CCCAGGGCTCGCAGCCAACGTCGGGGCGGCAGGCCCTGCCATAGCCTCAGGTTACTCATATATACTTTAGATTGATTTAAAACTTCATTTTTAATTTAAA GGGTCCCGAGCGTCGGTTGCAGCCCCGCCGTCCGGGACGGTATCGGAGTCCAATGAGTATATATGAAATCTAACTAAATTTTGAAGTAAAAATTAAATTT 6301 AGGATCTAGGTGAAGATCCTTTTTGATAATCTCATGACCAAAATCCCTTAACGTGAGTTTTCGTTCCACTGAGCGTCAGACCCCGTAGAAAAGATCAAAG TCCTAGATCCACTTCTAGGAAAAACTATTAGAGTACTGGTTTTAGGGAATTGCACTCAAAAGCAAGGTGACTCGCAGTCTGGGGCATCTTTTCTAGTTTC 6401 GATCTTCTTGAGATCCTTTTTTTCTGCGCGTAATCTGCTGCTTGCAAACAAAAAAACCACCGCTACCAGCGGTGGTTTGTTTGCCGGATCAAGAGCTACC CTAGAAGAACTCTAGGAAAAAAAGACGCGCATTAGACGACGAACGTTTGTTTTTTTGGTGGCGATGGTCGCCACCAAACAAACGGCCTAGTTCTCGATGG 6501 AACTCTTTTTCCGAAGGTAACTGGCTTCAGCAGAGCGCAGATACCAAATACTGTCCTTCTAGTGTAGCCGTAGTTAGGCCACCACTTCAAGAACTCTGTA TTGAGAAAAAGGCTTCCATTGACCGAAGTCGTCTCGCGTCTATGGTTTATGACAGGAAGATCACATCGGCATCAATCCGGTGGTGAAGTTCTTGAGACAT 6601 GCACCGCCTACATACCTCGCTCTGCTAATCCTGTTACCAGTGGCTGCTGCCAGTGGCGATAAGTCGTGTCTTACCGGGTTGGACTCAAGACGATAGTTAC CGTGGCGGATGTATGGAGCGAGACGATTAGGACAATGGTCACCGACGAGCGTCACCGCTATTCAGCACAGAATGGCCCAACCTGAGTTCTGCTATCAATG 6701 CGGATAAGGCGCAGCGGTCGGGCTGAACGGGGGGTTCGTGCACACAGCCCAGCTTGGAGCGAACGACCTACACCGAACTGAGATACCTACAGCGTGAGCT GCCTATTCCGCGTCGCCAGCCCGACTTGCCCCCCAAGCACGTGTGTCGGGTCGAACCTCGCTTGCTGGATGTGGCTTGACTCTATGGATGTCGCACTCGA 6801 ATGAGAAAGCGCCACGCTTCCCGAAGGGAGAAAGGCGGACAGGTATCCGGTAAGCGGCAGGGTCGGAACAGGAGAGCGCACGAGGGAGCTTCCAGGGGGA TACTCTTTCGCGGTGCGAAGGGCTTCCCTCTTTCCGCCTGTCCATAGGCCATTCGCCGTCCCAGCCTTGTCCTCTCGCGTGCTCCCTCGAAGGTCCCCCT 6901 AACGCCTGGTATCTTTATAGTCCTGTCGGGTTTCGCCACCTCTGACTTGAGCGTCGATTTTTGTGATGCTCGTCAGGGGGGCGGAGCCTATGGAAAAACG TTGCGGACCATAGAAATATCAGGACAGCCCAAAGCGGTGGAGACTGAACTCGCAGCTAAAAACACTACGAGCAGTCCCCCCGCCTCGGATACCTTTTTGC 7001 CCAGCAACGCGGCCTTTTTACGGTTCCTGGCCTTTTGCTGGCCTTTTGCTCACATGTTCTTTCCTGCGTTATCCCCTGATTCTGTGGATAACCGTATTAC GGTCGTTGCGCCGGAAAAATGCCAAGGACCGGAAAACGACCGGAAAACGAGTGTACAAGAAAGGACGCAATAGGGGACTAAGACACCTATTGGCATAATG 7101 CGCCATGCATTAGTTATTAATAGTAATCAATTACGGGGTCATTAGTTCATAGCCCATATATGGAGTTCCGCGTTACATAACTTACGGTAAATGGCCCGCC GCGGTACGTAATCAATAATTATCATTAGTTAATGCCCCAGTAATCAAGTATCGGGTATATACCTCAAGGCGCAATGTATTGAATGCCATTTACCGGGCGG 7201 TGGCTGACCGCCCAACGACCCCCGCCCATTGACGTCAATAATGACGTATGTTCCCATAGTAACGCCAATAGGGACTTTCCATTGACGTCAATGGGTGGAG ACCGACTGGCGGGTTGCTGGGGGCGGGTAACTGCAGTTATTACTGCATACAAGGGTATCATTGCGGTTATCCCTGAAAGGTAACTGCAGTTACCCACCTC 7301 TATTTACGGTAAACTGCCCACTTGGCAGTACATCAAGTGTATCATATGCCAAGTACGCCCCCTATTGACGTCAATGACGGTAAATGGCCCGCCTGGCATT ATAAATGCCATTTGACGGGTGAACCGTCATGTAGTTCACATAGTATACGGTTCATGCGGGGGATAACTGCAGTTACTGCCATTTACCGGGCGGACCGTAA 7401 ATGCCCAGTACATGACCTTATGGGACTTTCCTACTTGGCAGTACATCTACGTATTAGTCATCGCTATTACCATGGTGATGCGGTTTTGGCAGTACATCAA TACGGGTCATGTACTGGAATACCCTGAAAGGATGAACCGTCATGTAGATGCATAATCAGTAGCGATAATGGTACCACTACGCCAAAACCGTCATGTAGTT 7501 TGGGCGTGGATAGCGGTTTGACTCACGGGGATTTCCAAGTCTCCACCCCATTGACGTCAATGGGAGTTTGTTTTGGCACCAAAATCAACGGGACTTTCCA ACCCGCACCTATCGCCAAACTGAGTGCCCCTAAAGGTTCAGAGGTGGGGTAACTGCAGTTACCCTCAAACAAAACCGTGGTTTTAGTTGCCCTGAAAGGT 7601 AAATGTCGTAACAACTCCGCCCCATTGACGCAAATGGGCGGTAGGCGTGTACGGTGGGAGGTCTATATAAGCAGAGCT TTTACAGCATTGTTGAGGCGGGGTAACTGCGTTTACCCGCCATCCGCACATGCCACCCTCCAGATATATTCGTCTCGA pVHentry-GFP1 Esp3I ~~~~~~~ 1 GGTTTAGTGAACCGTCAGATCCGCTAGACGTCTCATATACCTGACTGGAATACGACAGCTCCTGCAGCTTCTGGGCGAAGACCACCGTGGCCCATTGCGT CCAAATCACTTGGCAGTCTAGGCGATCTGCAGAGTATATGGACTGACCTTATGCTGTCGAGGACGTCGAAGACCCGCTTCTGGTGGCACCGGGTAACGCA 101 ACTTAGCGATAATCTGGTCCGCTTGGAAGTTAGCACGGCGAGCGCGCTCCAGAGCCAAGTCACGCAGCTTAACAGTACCTACCGCAGAGCGGTGCATGAA TGAATCGCTATTAGACCAGGCGAACCTTCAATCGTGCCGCTCGCGCGAGGTCTCGGTTCAGTGCGTCGAATTGTCATGGATGGCGTCTCGCCACGTACTT 201 CAGGCCGATAACGTTGTCCTTAGCAACCTTGACATTACCCTCACCTTTATTGGCAGGGAAGACGTGCTTCTGACCAGTAGTGCCCTCACGAGCGGTACCA GTCCGGCTATTGCAACAGGAATCGTTGGAACTGTAATGGGAGTGGAAATAACCGTCCCTTCTGCACGAAGACTGGTCATCACGGGAGTGCTCGCCATGGT 301 GCACCACCAGCGGTGAGGTGCGGAACTTCTACAACCTCAAAGCCCATAACGTTGCGGATAGAACCCTTCTCAGGGTCAATCAGAGCAGCGTAGTTTGCTG CGTGGTGGTCGCCACTCCACGCCTTGAAGATGTTGGAGTTTCGGGTATTGCAACGCCTATCTTGGGAAGAGTCCCAGTTAGTCTCGTCGCATCAAACGAC 401 CGTTCGGCATCAGTGCTGCCAGAATCGCAGAGTAGCTATCTGGGTCACAGTAGAACACACGGTCAGCAGCCGGAACATAGTTCTTGGTCAGAGCCGCACG GCAAGCCGTAGTCACGACGGTCTTAGCGTCTCATCGATAGACCCAGTGTCATCTTGTGTGCCAGTCGTCGGCCTTGTATCAAGAACCAGTCTCGGCGTGC 501 AGCCTTAGTCAGAGCCGCAATAATCTCCTTACCCAGCGCAACTTGGTCGGTAAGTGCGGCCTTGTTCTGAGTGGTCTCAATTACGGTAGCAGTACCTAAG TCGGAATCAGTCTCGGCGTTATTAGAGGAATGGGTCGCGTTGAACCAGCCATTCACGCCGGAACAAGACTCACCAGAGTTAATGCCATCGTCATGGATTC 601 CCCTCGATGTTCTCATTATATTTGCTTTCCACGTTACACAGACCGGCAATCTCAGCCAGAACCGCACCATCCGCAGCCATCGCCAGAGATTCACCCAACT GGGAGCTACAAGAGTAATATAAACGAAAGGTGCAATGTGTCTGGCCGTTAGAGTCGGTCTTGGCGTGGTAGGCGTCGGTAGCGGTCTCTAAGTGGGTTGA 701 GAGAGGTATACTCAGAGCGAACGTCGTAGTGGTTCATCGCGTCCTCAATATCATAAATCAGAACGTCAGCCGTCAGGAGACCGTCAATGGTGATTACCTT CTCTCCATATGAGTCTCGCTTGCAGCATCACCAAGTAGCGCAGGAGTTATAGTATTTAGTCTTGCAGTCGGCAGTCCTCTGGCAGTTACCACTAATGGAA 801 CTCGGTGTGTTTGATGTCCTTACGTTTATCGTCGAGGTTCTCGCCCGGAGCCAGATACGCTGCCTGAGTGCGACCCAGAACAGGGAACTGAGCGGATTTA GAGCCACACAAACTACAGGAATGCAAATAGCAGCTCCAAGAGCGGGCCTCGGTCTATGCGACGGACTCACGCTGGGTCTTGTCCCTTGACTCGCCTAAAT 901 CCGCTGGAGATGGAACGTACCATGTGGCGAGAAGTGGTCACGGAGGTACGAGCGAACGCAGTCAGGACTTCACCGCCAAATACCTTCAAGAACAACGCCA GGCGACCTCTACCTTGCATGGTACACCGCTCTTCACCAGTGCCTCCATGCTCGCTTGCGTCAGTCCTGAAGTGGCGGTTTATGGAAGTTCTTGTTGCGGT Esp3I ~~~~~ 1001 GTTTATCTCCAGCAGCAACTACACCTTTACCTTGGTTAGTACCCATTTGCTGTCCACCAGTCATGCTAGCCATATGTATATCTCCTTCTTAAAGTCGTCT CAAATAGAGGTCGTCGTTGATGTGGAAATGGAACCAATCATGGGTAAACGACAGGTGGTCAGTACGATCGGTATACATATAGAGGAAGAATTTCAGCAGA Esp3I ~ 1101 CCAGTGCCTCCACCAAGGGCCCATCGGTCTTCCCCCTGGCGCCCTGCTCCAGGAGCACCTCCGAGAGCACAGCGGCCCTGGGCTGCCTGGTCAAGGACTA GGTCACGGAGGTGGTTCCCGGGTAGCCAGAAGGGGGACCGCGGGACGAGGTCCTCGTGGAGGCTCTCGTGTCGCCGGGACCCGACGGACCAGTTCCTGAT 1201 CTTCCCCGAACCGGTGACGGTGTCGTGGAACTCAGGCGCTCTGACCAGCGGCGTGCACACCTTCCCAGCTGTCCTACAGTCCTCAGGACTCTACTCCCTC GAAGGGGCTTGGCCACTGCCACAGCACCTTGAGTCCGCGAGACTGGTCGCCGCACGTGTGGAAGGGTCGACAGGATGTCAGGAGTCCTGAGATGAGGGAG 1301 AGCAGCGTGGTGACCGTGCCCTCCAGCAGCTTGGGCACCCAGACCTACATCTGCAACGTGAATCACAAGCCCAGCAACACCAAGGTGGACAAGAAAGTTG TCGTCGCACCACTGGCACGGGAGGTCGTCGAACCCGTGGGTCTGGATGTAGACGTTGCACTTAGTGTTCGGGTCGTTGTGGTTCCACCTGTTCTTTCAAC 1401 AGCCCAAATCTTGTGACAAAACTCACACATGCCCACCGTGCCCAGCACCTGAACTCCTGGGGGGACCGTCAGTCTTCCTCTTCCCCCCMAAACCCAAGGA TCGGGTTTAGAACACTGTTTTGAGTGTGTACGGGTGGCACGGGTCGTGGACTTGAGGACCCCCCTGGCAGTCAGAAGGAGAAGGGGGGKTTTGGGTTCCT 1501 CACCCTCATGATCTCCCGGACCCCTGAGGTCACATGCGTGGTGGTGGACGTGAGCCACGAAGACCCTGAGGTCAAGTTCAACTGGTACGTGGACGGCGTG GTGGGAGTACTAGAGGGCCTGGGGACTCCAGTGTACGCACCACCACCTGCACTCGGTGCTTCTGGGACTCCAGTTCAAGTTGACCATGCACCTGCCGCAC 1601 GAGGTGCATAATGCCAAGACAAAGCCGCGGGAGGAGCAGTACAACAGCACGTACCGTGTGGTCAGCGTCCTCACCGTCCTGCACCAGGACTGGCTGAATG CTCCACGTATTACGGTTCTGTTTCGGCGCCCTCCTCGTCATGTTGTCGTGCATGGCACACCAGTCGCAGGAGTGGCAGGACGTGGTCCTGACCGACTTAC 1701 GCAAGGAGTACAAGTGCAAGGTCTCCAACAAAGCCCTCCCAGCCCCCATCGAGAAAACCATCTCCAAAGCCAAAGGGCAGCCCCGAGAACCACAGGTGTA CGTTCCTCATGTTCACGTTCCAGAGGTTGTTTCGGGAGGGTCGGGGGTAGCTCTTTTGGTAGAGGTTTCGGTTTCCCGTCGGGGCTCTTGGTGTCCACAT 1801 CACCCTGCCCCCATCCCGGGATGAGCTGACCAAGAACCAGGTCAGCCTGACCTGCCTGGTCAAAGGCTTCTACCCCAGCGACATCGCCGTGGAGTGGGAG GTGGGACGGGGGTAGGGCCCTACTCGACTGGTTCTTGGTCCAGTCGGACTGGACGGACCAGTTTCCGAAGATGGGGTCGCTGTAGCGGCACCTCACCCTC 1901 AGCAATGGGCAGCCGGAGAACAACTACAAGACCACGCCTCCCATGCTGGACTCCGACGGCTCCTTCTTCCTCTACAGCAAGCTCACCGTGGACAAGAGCA TCGTTACCCGTCGGCCTCTTGTTGATGTTCTGGTGCGGAGGGTACGACCTGAGGCTGCCGAGGAAGAAGGAGATGTCGTTCGAGTGGCACCTGTTCTCGT 2001 GGTGGCAGCAGGGGAACGTCTTCTCATGCTCCGTGATGCATGAGGCTCTGCACAACCACTACACGCAGAAGAGCCTCTCCCTGTCTCCGGGTAAAGGGAG CCACCGTCGTCCCCTTGCAGAAGAGTACGAGGCACTACGTACTCCGAGACGTGTTGGTGATGTGCGTCTTCTCGGAGAGGGACAGAGGCCCATTTCCCTC 2101 CTCGCCAGATAAGTGGTCAGATCCACCGGTCGCCACCATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGGTCGAGCTGGACGGC GAGCGGTCTATTCACCAGTCTAGGTGGCCAGCGGTGGTACCACTCGTTCCCGCTCCTCGACAAGTGGCCCCACCACGGGTAGGACCAGCTCGACCTGCCG 2201 GACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAGGGCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCACCACCGGCAAGCTGC CTGCATTTGCCGGTGTTCAAGTCGCACAGGCCGCTCCCGCTCCCGCTACGGTGGATGCCGTTCGACTGGGACTTCAAGTAGACGTGGTGGCCGTTCGACG 2301 CCGTGCCCTGGCCCACCCTCGTGACCACCCTGACCTACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCACGACTTCTTCAAGTCCGC GGCACGGGACCGGGTGGGAGCACTGGTGGGACTGGATGCCGCACGTCACGAAGTCGGCGATGGGGCTGGTGTACTTCGTCGTGCTGAAGAAGTTCAGGCG 2401 CATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTTCTTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGGGCGACACCCTGGTG GTACGGGCTTCCGATGCAGGTCCTCGCGTGGTAGAAGAAGTTCCTGCTGCCGTTGATGTTCTGGGCGCGGCTCCACTTCAAGCTCCCGCTGTGGGACCAC 2501 AACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAGGACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCCACAACGTCTATATCATGG TTGGCGTAGCTCGACTTCCCGTAGCTGAAGTTCCTCCTGCCGTTGTAGGACCCCGTGTTCGACCTCATGTTGATGTTGTCGGTGTTGCAGATATAGTACC 2601 CCGACAAGCAGAAGAACGGCATCAAGGTGAACTTCAAGATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTACCAGCAGAACACCCC GGCTGTTCGTCTTCTTGCCGTAGTTCCACTTGAAGTTCTAGGCGGTGTTGTAGCTCCTGCCGTCGCACGTCGAGCGGCTGGTGATGGTCGTCTTGTGGGG 2701 CATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCACTACCTGAGCACCCAGTCCGCCCTGAGCAAAGACCCCAACGAGAAGCGCGATCACATGGTCCTG GTAGCCGCTGCCGGGGCACGACGACGGGCTGTTGGTGATGGACTCGTGGGTCAGGCGGGACTCGTTTCGTGGGTTGCTCTTCGCGCTAGTGTACCAGGAC 2801 CTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGGCATGGACGAGCTGTACAAGTAAAGCGGCCGCAATTTAATTCCGGTTATTTTCCACCATATTGCCG GACCTCAAGCACTGGCGGCGGCCCTAGTGAGAGCCGTACCTGCTCGACATGTTCATTTCGCCGGCGTTAAATTAAGGCCAATAAAAGGTGGTATAACGGC 2901 TCTTTTGGCAATGTGAGGGCCCGGAAACCTGGCCCTGTCTTCTTGACGAGCATTCCTAGGGGTCTTTCCCCTCTCGCCAAAGGAATGCAAGGTCTGTTGA AGAAAACCGTTACACTCCCGGGCCTTTGGACCGGGACAGAAGAACTGCTCGTAAGGATCCCCAGAAAGGGGAGAGCGGTTTCCTTACGTTCCAGACAACT 3001 ATGTCGTGAAGGAAGCAGTTCCTCTGGAAGCTTCTTGAAGACAAACAACGTCTGTAGCGACCCTTTGCAGGCAGCGGAACCCCCCACCTGGCGACAGGTG TACAGCACTTCCTTCGTCAAGGAGACCTTCGAAGAACTTCTGTTTGTTGCAGACATCGCTGGGAAACGTCCGTCGCCTTGGGGGGTGGACCGCTGTCCAC 3101 CCTCTGCGGCCAAAAGCCACGTGTATAAGATACACCTGCAAAGGCGGCACAACCCCAGTGCCACGTTGTGAGTTGGATAGTTGTGGAAAGAGTCAAATGG GGAGACGCCGGTTTTCGGTGCACATATTCTATGTGGACGTTTCCGCCGTGTTGGGGTCACGGTGCAACACTCAACCTATCAACACCTTTCTCAGTTTACC 3201 CTCACCTCAAGCGTATTCAACAAGGGGCTGAAGGATGCCCAGAAGGTACCCCATTGTATGGGATCTGATCTGGGGCCTCGGTGCACATGCTTTACATGTG GAGTGGAGTTCGCATAAGTTGTTCCCCGACTTCCTACGGGTCTTCCATGGGGTAACATACCCTAGACTAGACCCCGGAGCCACGTGTACGAAATGTACAC 3301 TTTAGTCGAGGTTAAAAAACGTCTAGGCCCCCCGAACCACGGGGACGTGGTTTTCCTTTGAAAAACACGATGATAATATGGCCACCACCCATACCTAGGC AAATCAGCTCCAATTTTTTGCAGATCCGGGGGGCTTGGTGCCCCTGCACCAAAAGGAAACTTTTTGTGCTACTATTATACCGGTGGTGGGTATGGATCCG 3401 TTTTGCAAAGATCGATCAAGAGACAGGATGAGGATCGTTTCGCATGATTGAACAAGATGGATTGCACGCAGGTTCTCCGGCCGCTTGGGTGGAGAGGCTA AAAACGTTTCTAGCTAGTTCTCTGTCCTACTCCTAGCAAAGCGTACTAACTTGTTCTACCTAACGTGCGTCCAAGAGGCCGGCGAACCCACCTCTCCGAT 3501 TTCGGCTATGACTGGGCACAACAGACAATCGGCTGCTCTGATGCCGCCGTGTTCCGGCTGTCAGCGCAGGGGCGCCCGGTTCTTTTTGTCAAGACCGACC AAGCCGATACTGACCCGTGTTGTCTGTTAGCCGACGAGACTACGGCGGCACAAGGCCGACAGTCGCGTCCCCGCGGGCCAAGAAAAACAGTTCTGGCTGG 3601 TGTCCGGTGCCCTGAATGAACTGCAAGACGAGGCAGCGCGGCTATCGTGGCTGGCCACGACGGGCGTTCCTTGCGCAGCTGTGCTCGACGTTGTCACTGA ACAGGCCACGGGACTTACTTGACTGGCTGCTCCGTCGCGCCGATAGCACCGACCGGTGCTGCCCGCAAGGAACGCGTCGACACGAGCTGCAACAGTGACT 3701 AGCGGGAAGGGACTGGCTGCTATTGGGCGAAGTGCCGGGGCAGGATCTCCTGTCATCTCACCTTGCTCCTGCCGAGAAAGTATCCATCATGGCTGATGCA TCGCCCTTCCCTGACCGACGATAACCCGCTTCACGGCCCCGTCCTAGAGGACAGTAGAGTGGAACGAGGACGGCTCTTTCATAGGTAGTACCGACTACGT 3801 ATGCGGCGGCTGCATACGCTTGATCCGGCTACCTGCCCATTCGACCACCAAGCGAAACATCGCATCGAGCGAGCACGTACTCGGATGGAAGCCGGTCTTG TACGCCGCCGACGTATGCGAACTAGGCCGATGGACGGGTAAGCTGGTGGTTCGCTTTGTAGCGTAGCTCGCTCGTGCATGAGCCTACCTTCGGCCAGAAC 3901 TCGATCAGGATGATCTGGACGAAGAGCATCAGGGGCTCGCGCCAGCCGAACTGTTCGCCAGGCTCAAGGCGAGCATGCCCGACGGCGAGGATCTCGTCGT AGCTAGTCCTACTAGACCTGCTTCTCGTAGTCCCCGAGCGCGGTCGGCTTGACAAGCGGTCCGAGTTCCGCTCGTACGGGCTGCCGCTCCTAGAGCAGCA 4001 GACCCATGGCGATGCCTGCTTGCCGAATATCATGGTGGAAAATGGCCGCTTTTCTGGATTCATCGACTGTGGCCGGCTGGGTGTGGCGGACCGCTATCAG CTGGGTACCGCTACGGACGAACGGCTTATAGTACCACCTTTTACCGGCGAAAAGACCTAAGTAGCTGACACCGGCCGACCCACACCGCCTGGCGATAGTC 4101 GACATAGCGTTGGCTACCCGTGATATTGCTGAAGAGCTTGGCGGCGAATGGGCTGACCGCTTCCTCGTGCTTTACGGTATCGCCGCTCCCGATTCGCAGC CTGTATCGCAACCGATGGGCACTATAACGACTTCTCGAACCGCCGCTTACCCGACTGGCGAAGGAGCACGAAATGCCATAGCGGCGAGGGCTAAGCGTCG 4201 GCATCGCCTTCTATCGCCTTCTTGACGAGTTCTTCTGAGCGGGACTCTGGGGTTCGGGCCGCACTCGAGCATAAACTTGTTTATTGCAGCTTATAATGGT CGTAGCGGAAGATAGCGGAAGAACTGCTCAAGAAGACTCGCCCTGAGACCCCAAGCCCGGCGTGAGCTCGTATTTGAACAAATAACGTCGAATATTACCA I- SceI ~~~ 4301 TACAAATAAAGCAATAGCATCACAAATTTCACAAATAAAGCATTTTTTTCACTGCATTCTAGTTGTGGTTTGTCCAAACTCATCAATGTATCTTAAGTAG ATGTTTATTTCGTTATCGTAGTGTTTAAAGTGTTTATTTCGTAAAAAAAGTGACGTAAGATCAACACCAAACAGGTTTGAGTAGTTACATAGAATTCATC I-SceI ~~~~~~~~~~~~~~~~ 4401 GGATAACAGGGTAATTTTGTTAAATCAGCTCATTTTTTAACCAATAGGAACGCCATCAAAAATAATTCGCGTCTGGCCTTCCTGTAGCCAGCTTTCATCA CCTATTGTCCCATTAAAACAATTTAGTCGAGTAAAAAATTGGTTATCCTTGCGGTAGTTTTTATTAAGCGCAGACCGGAAGGACATCGGTCGAAAGTAGT 4501 ACATTAAATGTGAGCGAGTAACAACCCGTCGGATTCTCCGTGGGAACAAACGGCGGATTGACCGTAATGGGATAGGTTACGTTGGTGTAGATGGGCGCAT TGTAATTTACACTCGCTCATTGTTGGGCAGCCTAAGAGGCACCCTTGTTTGCCGCCTAACTGGCATTACCCTATCCAATGCAACCACATCTACCCGCGTA 4601 CGTAACCGTGCATCTGCCAGTTTGAGGGGACGACGACCGTATCGGCCTCAGGAAGATCGCACTCCAGCCAGCTTTCCGGCACCGCTTCTGGTGCCGGAAA GCATTGGCACGTAGACGGTCAAACTCCCCTGCTGCTGGCATAGCCGGAGTCCTTCTAGCGTGAGGTCGGTCGAAAGGCCGTGGCGAAGACCACGGCCTTT 4701 CCAGGCAAAGCGCCATTCGCCATTCAGGCTGCGCAACTGTTGGGAAGGGCGATCGGTGCGGGCCTCTTCGCTATTACGCCAGCTGGCGAAAGGGGGATGT GGTCCGTTTCGCGGTAAGCGGTAAGTCCGACGCGTTGACAACCCTTCCCGCTAGCCACGCCCGGAGAAGCGATAATGCGGTCGACCGCTTTCCCCCTACA 4801 GCTGCAAGGCGATTAAGTTGGGTAACGCCAGGGTTTTCCCAGTCACGACGTTGTAAAACGACGGCCAGTGAATTGCAATTCGTAATCATGGTCATAGCTG CGACGTTCCGCTAATTCAACCCATTGCGGTCCCAAAAGGGTCAGTGCTGCAACATTTTGCTGCCGGTCACTTAACGTTAAGCATTAGTACCAGTATCGAC 4901 TTTCCTGTGTGAAATTGTTATCCGCTCACAATTCCACACAACATACGAGCCGGAAGCATAAAGTGTAAAGCCTGGGGTGCCTAATGAGTGAGCTAACTCA AAAGGACACACTTTAACAATAGGCGAGTGTTAAGGTGTGTTGTATGCTCGGCCTTCGTATTTCACATTTCGGACCCCACGGATTACTCACTCGATTGAGT I-SceI ~~~~~~~~~~~~~~~~~~~~ 5001 CATTAATTGCGTTGCGCTCACTGCCATTACCCTGTTATCCCTAGTGAACCATCACCCTAATCAAGTTTTTTGGGGTCGAGGTGCCGTAAAGCACTAAATC GTAATTAACGCAACGCGAGTGACGGTAATGGGACAATAGGGATCACTTGGTAGTGGGATTAGTTCAAAAAACCCCAGCTCCACGGCATTTCGTGATTTAG 5101 GGAACCCTAAAGGGAGCCCCCGATTTAGAGCTTGACGGGGAAAGCCGGCGAACGTGGCGAGAAAGGAAGGGAAGAAAGCGAAAGGAGCGGGCGCTAGGGC CCTTGGGATTTCCCTCGGGGGCTAAATCTCGAACTGCCCCTTTCGGCCGCTTGCACCGCTCTTTCCTTCCCTTCTTTCGCTTTCCTCGCCCGCGATCCCG 5201 GCTGGCAAGTGTAGCGGTCACGCTGCGCGTAACCACCACACCCGCCGCGCTTAATGCGCCGCTACAGGGCGCGTCAGGTGGCACTTTTCGGGGAAATGTG CGACCGTTCACATCGCCAGTGCGACGCGCATTGGTGGTGTGGGCGGCGCGAATTACGCGGCGATGTCCCGCGCAGTCCACCGTGAAAAGCCCCTTTACAC 5301 CGCGGAACCCCTATTTGTTTATTTTTCTAAATACATTCAAATATGTATCCGCTCATGAGACAATAACCCTGATAAATGCTTCAATAATAACGACCGGTAA GCGCCTTGGGGATAAACAAATAAAAAGATTTATGTAAGTTTATACATAGGCGAGTACTCTGTTATTGGGACTATTTACGAAGTTATTATTGCTGGCCATT 5401 TGAAAAAGGAAGAGTATGAGTATTCAACATTTCCGTGTCGCCCTTATTCCCTTTTTTGCGGCATTTTGCCTTCCTGTTTTTGCTCACCCAGAAACGCTGG ACTTTTTCCTTCTCATACTCATAAGTTGTAAAGGCACAGCGGGAATAAGGGAAAAAACGCCGTAAAACGGAAGGACAAAAACGAGTGGGTCTTTGCGACC 5501 TGAAAGTAAAAGATGCTGAAGATCAGTTGGGTGCACGAGTGGGTTACATCGAACTGGATCTCAACAGCGGTAAGATCCTTGAGAGTTTTCGCCCCGAAGA ACTTTCATTTTCTACGACTTCTAGTCAACCCACGTGCTCACCCAATGTAGCTTGACCTAGAGTTGTCGCCATTCTAGGAACTCTCAAAAGCGGGGCTTCT 5601 ACGTTTTCCAATGATGAGCACTTTTAAAGTTCTGCTATGTGGCGCGGTATTATCCCGTATTGACGCCGGGCAAGAGCAACTCGGTCGCCGCATACACTAT TGCAAAAGGTTACTACTCGTGAAAATTTCAAGACGATACACCGCGCCATAATAGGGCATAACTGCGGCCCGTTCTCGTTGAGCCAGCGGCGTATGTGATA 5701 TCTCAGAATGACTTGGTTGAGTCTAGCGTTGATCGGCACGTAAGAGGTTCCAACTTTCACCATAATGAAATAAGATCACTACCGGGCGTATTTTTTGAGT AGAGTCTTACTGAACCAACTCAGATCGCAACTAGCCGTGCATTCTCCAAGGTTGAAAGTGGTATTACTTTATTCTAGTGATGGCCCGCATAAAAAACTCA 5801 TATCGAGATTTTCAGGAGCTAAGGAAGCTAAAATGGAGAAAAAAATCACTGGATATACCACCGTTGATATATCCCAATGGCATCGTAAAGAACATTTTGA ATAGCTCTAAAAGTCCTCGATTCCTTCGATTTTACCTCTTTTTTTAGTGACCTATATGGTGGCAACTATATAGGGTTACCGTAGCATTTCTTGTAAAACT 5901 GGCATTTCAGTCAGTTGCTCAATGTACCTATAACCAGACCGTTCAGCTGGATATTACGGCCTTTTTAAAGACCGTAAAGAAAAATAAGCACAAGTTTTAT CCGTAAAGTCAGTCAACGAGTTACATGGATATTGGTCTGGCAAGTCGACCTATAATGCCGGAAAAATTTCTGGCATTTCTTTTTATTCGTGTTCAAAATA 6001 CCGGCCTTTATTCACATTCTTGCCCGCCTGATGAATGCTCATCCGGAATTCCGTATGGCAATGAAAGACGGTGAGCTGGTGATATGGGATAGTGTTCACC GGCCGGAAATAAGTGTAAGAACGGGCGGACTACTTACGAGTAGGCCTTAAGGCATACCGTTACTTTCTGCCACTCGACCACTATACCCTATCACAAGTGG 6101 CTTGTTACACCGTTTTCCATGAGCAAACTGAAACGTTTTCATCGCTCTGGAGTGAATACCACGACGATTTCCGGCAGTTTCTACACATATATTCGCAAGA GAACAATGTGGCAAAAGGTACTCGTTTGACTTTGCAAAAGTAGCGAGACCTCACTTATGGTGCTGCTAAAGGCCGTCAAAGATGTGTATATAAGCGTTCT 6201 TGTGGCGTGTTACGGTGAAAACCTGGCCTATTTCCCTAAAGGGTTTATTGAGAATATGTTTTTCGTATCAGCCAATCCCTGGGTGAGTTTCACCAGTTTT ACACCGCACAATGCCACTTTTGGACCGGATAAAGGGATTTCCCAAATAACTCTTATACAAAAAGCATAGTCGGTTAGGGACCCACTCAAAGTGGTCAAAA 6301 GATTTAAACGTGGCCAATATGGACAACTTCTTCGCCCCCGTTTTCACCATGGGCAAATATTATACGCAAGGCGACAAGGTGCTGATGCCGCTGGCGATTC CTAAATTTGCACCGGTTATACCTGTTGAAGAAGCGGGGGCAAAAGTGGTACCCGTTTATAATATGCGTTCCGCTGTTCCACGACTACGGCGACCGCTAAG 6401 AGGTTCATCATGCCGTCTGTGATGGCTTCCATGTCGGCAGAATGCTTAATGAATTACAACAGTACTGCGATGAGTGGCAGGGCGGGGCGTAATTTTTTTA TCCAAGTAGTACGGCAGACACTACCGAAGGTACAGCCGTCTTACGAATTACTTAATGTTGTCATGACGCTACTCACCGTCCCGCCCCGCATTAAAAAAAT 6501 AGGCAGTTATTGGTGCCCTTAAACGCCTGGTGCTACGCCTGAATAAGTGATAATAAGCGGATGAATGGCAGAAATTCGAAATGACCGACCAAGCGACGCC TCCGTCAATAACCACGGGAATTTGCGGACCACGATGCGGACTTATTCACTATTATTCGCCTACTTACCGTCTTTAAGCTTTACTGGCTGGTTCGCTGCGG 6601 CAACCTGCCATCACGAGATTTCGATTCCACCGCCGCCTTCTATGAAAGGTTGGGCTTCGGAATCGTTTTCCGGGACGCCGGCTGGATGATCCTCCAGCGC GTTGGACGGTAGTGCTCTAAAGCTAAGGTGGCGGCGGAAGATACTTTCCAACCCGAAGCCTTAGCAAAAGGCCCTGCGGCCGACCTACTAGGAGGTCGCG 6701 GGGGATCTCATGCTGGAGTTCTTCGCCCACCCTAGGGGGAGGCTAACTGAAACACGGAAGGAGACAATACCGGAAGGAACCCGCGCTATGACGGCAATAA CCCCTAGAGTACGACCTCAAGAAGCGGGTGGGATCCCCCTCCGATTGACTTTGTGCCTTCCTCTGTTATGGCCTTCCTTGGGCGCGATACTGCCGTTATT 6801 AAAGACAGAATAAAACGCACGGTGTTGGGTCGTTTGTTCATAAACGCGGGGTTCGGTCCCAGGGCTGGCACTCTGTCGATACCCCACCGAGACCCCATTG TTTCTGTCTTATTTTGCGTGCCACAACCCAGCAAACAAGTATTTGCGCCCCAAGCCAGGGTCCCGACCGTGAGACAGCTATGGGGTGGCTCTGGGGTAAC 6901 GGGCCAATACGCCCGCGTTTCTTCCTTTTCCCCACCCCACCCCCCAAGTTCGGGTGAAGGCCCAGGGCTCGCAGCCAACGTCGGGGCGGCAGGCCCTGCC CCCGGTTATGCGGGCGCAAAGAAGGAAAAGGGGTGGGGTGGGGGGTTCAAGCCCACTTCCGGGTCCCGAGCGTCGGTTGCAGCCCCGCCGTCCGGGACGG 7001 ATAGCCTCAGGTTACTCATATATACTTTAGATTGATTTAAAACTTCATTTTTAATTTAAAAGGATCTAGGTGAAGATCCTTTTTGATAATCTCATGACCA TATCGGAGTCCAATGAGTATATATGAAATCTAACTAAATTTTGAAGTAAAAATTAAATTTTCCTAGATCCACTTCTAGGAAAAACTATTAGAGTACTGGT 7101 AAATCCCTTAACGTGAGTTTTCGTTCCACTGAGCGTCAGACCCCGTAGAAAAGATCAAAGGATCTTCTTGAGATCCTTTTTTTCTGCGCGTAATCTGCTG TTTAGGGAATTGCACTCAAAAGCAAGGTGACTCGCAGTCTGGGGCATCTTTTCTAGTTTCCTAGAAGAACTCTAGGAAAAAAAGACGCGCATTAGACGAC 7201 CTTGCAAACAAAAAAACCACCGCTACCAGCGGTGGTTTGTTTGCCGGATCAAGAGCTACCAACTCTTTTTCCGAAGGTAACTGGCTTCAGCAGAGCGCAG GAACGTTTGTTTTTTTGGTGGCGATGGTCGCCACCAAACAAACGGCCTAGTTCTCGATGGTTGAGAAAAAGGCTTCCATTGACCGAAGTCGTCTCGCGTC 7301 ATACCAAATACTGTCCTTCTAGTGTAGCCGTAGTTAGGCCACCACTTCAAGAACTCTGTAGCACCGCCTACATACCTCGCTCTGCTAATCCTGTTACCAG TATGGTTTATGACAGGAAGATCACATCGGCATCAATCCGGTGGTGAAGTTCTTGAGACATCGTGGCGGATGTATGGAGCGAGACGATTAGGACAATGGTC 7401 TGGCTGCTGCCAGTGGCGATAAGTCGTGTCTTACCGGGTTGGACTCAAGACGATAGTTACCGGATAAGGCGCAGCGGTCGGGCTGAACGGGGGGTTCGTG ACCGACGACGGTCACCGCTATTCAGCACAGAATGGCCCAACCTGAGTTCTGCTATCAATGGCCTATTCCGCGTCGCCAGCCCGACTTGCCCCCCAAGCAC 7501 CACACAGCCCAGCTTGGAGCGAACGACCTACACCGAACTGAGATACCTACAGCGTGAGCTATGAGAAAGCGCCACGCTTCCCGAAGGGAGAAAGGCGGAC GTGTGTCGGGTCGAACCTCGCTTGCTGGATGTGGCTTGACTCTATGGATGTCGCACTCGATACTCTTTCGCGGTGCGAAGGGCTTCCCTCTTTCCGCCTG 7601 AGGTATCCGGTAAGCGGCAGGGTCGGAACAGGAGAGCGCACGAGGGAGCTTCCAGGGGGAAACGCCTGGTATCTTTATAGTCCTGTCGGGTTTCGCCACC TCCATAGGCCATTCGCCGTCCCAGCCTTGTCCTCTCGCGTGCTCCCTCGAAGGTCCCCCTTTGCGGACCATAGAAATATCAGGACAGCCCAAAGCGGTGG 7701 TCTGACTTGAGCGTCGATTTTTGTGATGCTCGTCAGGGGGGCGGAGCCTATGGAAAAACGCCAGCAACGCGGCCTTTTTACGGTTCCTGGCCTTTTGCTG AGACTGAACTCGCAGCTAAAAACACTACGAGCAGTCCCCCCGCCTCGGATACCTTTTTGCGGTCGTTGCGCCGGAAAAATGCCAAGGACCGGAAAACGAC 7801 GCCTTTTGCTCACATGTTCTTTCCTGCGTTATCCCCTGATTCTGTGGATAACCGTATTACCGCCATGCATTAGTTATTAATAGTAATCAATTACGGGGTC CGGAAAACGAGTGTACAAGAAAGGACGCAATAGGGGACTAAGACACCTATTGGCATAATGGCGGTACGTAATCAATAATTATCATTAGTTAATGCCCCAG 7901 ATTAGTTCATAGCCCATATATGGAGTTCCGCGTTACATAACTTACGGTAAATGGCCCGCCTGGCTGACCGCCCAACGACCCCCGCCCATTGACGTCAATA TAATCAAGTATCGGGTATATACCTCAAGGCGCAATGTATTGAATGCCATTTACCGGGCGGACCGACTGGCGGGTTGCTGGGGGCGGGTAACTGCAGTTAT 8001 ATGACGTATGTTCCCATAGTAACGCCAATAGGGACTTTCCATTGACGTCAATGGGTGGAGTATTTACGGTAAACTGCCCACTTGGCAGTACATCAAGTGT TACTGCATACAAGGGTATCATTGCGGTTATCCCTGAAAGGTAACTGCAGTTACCCACCTCATAAATGCCATTTGACGGGTGAACCGTCATGTAGTTCACA 8101 ATCATATGCCAAGTACGCCCCCTATTGACGTCAATGACGGTAAATGGCCCGCCTGGCATTATGCCCAGTACATGACCTTATGGGACTTTCCTACTTGGCA TAGTATACGGTTCATGCGGGGGATAACTGCAGTTACTGCCATTTACCGGGCGGACCGTAATACGGGTCATGTACTGGAATACCCTGAAAGGATGAACCGT 8201 GTACATCTACGTATTAGTCATCGCTATTACCATGGTGATGCGGTTTTGGCAGTACATCAATGGGCGTGGATAGCGGTTTGACTCACGGGGATTTCCAAGT CATGTAGATGCATAATCAGTAGCGATAATGGTACCACTACGCCAAAACCGTCATGTAGTTACCCGCACCTATCGCCAAACTGAGTGCCCCTAAAGGTTCA 8301 CTCCACCCCATTGACGTCAATGGGAGTTTGTTTTGGCACCAAAATCAACGGGACTTTCCAAAATGTCGTAACAACTCCGCCCCATTGACGCAAATGGGCG GAGGTGGGGTAACTGCAGTTACCCTCAAACAAAACCGTGGTTTTAGTTGCCCTGAAAGGTTTTACAGCATTGTTGAGGCGGGGTAACTGCGTTTACCCGC 8401 GTAGGCGTGTACGGTGGGAGGTCTATATAAGCAGAGCT CATCCGCACATGCCACCCTCCAGATATATTCGTCTCGA pVHentry-MLuc7 Esp3I ~~~~~~~ 1 GGTTTAGTGAACCGTCAGATCCGCTAGACGTCTCATATACCTGACTGGAATACGACAGCTCCTGCAGCTTCTGGGCGAAGACCACCGTGGCCCATTGCGT CCAAATCACTTGGCAGTCTAGGCGATCTGCAGAGTATATGGACTGACCTTATGCTGTCGAGGACGTCGAAGACCCGCTTCTGGTGGCACCGGGTAACGCA 101 ACTTAGCGATAATCTGGTCCGCTTGGAAGTTAGCACGGCGAGCGCGCTCCAGAGCCAAGTCACGCAGCTTAACAGTACCTACCGCAGAGCGGTGCATGAA TGAATCGCTATTAGACCAGGCGAACCTTCAATCGTGCCGCTCGCGCGAGGTCTCGGTTCAGTGCGTCGAATTGTCATGGATGGCGTCTCGCCACGTACTT 201 CAGGCCGATAACGTTGTCCTTAGCAACCTTGACATTACCCTCACCTTTATTGGCAGGGAAGACGTGCTTCTGACCAGTAGTGCCCTCACGAGCGGTACCA GTCCGGCTATTGCAACAGGAATCGTTGGAACTGTAATGGGAGTGGAAATAACCGTCCCTTCTGCACGAAGACTGGTCATCACGGGAGTGCTCGCCATGGT 301 GCACCACCAGCGGTGAGGTGCGGAACTTCTACAACCTCAAAGCCCATAACGTTGCGGATAGAACCCTTCTCAGGGTCAATCAGAGCAGCGTAGTTTGCTG CGTGGTGGTCGCCACTCCACGCCTTGAAGATGTTGGAGTTTCGGGTATTGCAACGCCTATCTTGGGAAGAGTCCCAGTTAGTCTCGTCGCATCAAACGAC 401 CGTTCGGCATCAGTGCTGCCAGAATCGCAGAGTAGCTATCTGGGTCACAGTAGAACACACGGTCAGCAGCCGGAACATAGTTCTTGGTCAGAGCCGCACG GCAAGCCGTAGTCACGACGGTCTTAGCGTCTCATCGATAGACCCAGTGTCATCTTGTGTGCCAGTCGTCGGCCTTGTATCAAGAACCAGTCTCGGCGTGC 501 AGCCTTAGTCAGAGCCGCAATAATCTCCTTACCCAGCGCAACTTGGTCGGTAAGTGCGGCCTTGTTCTGAGTGGTCTCAATTACGGTAGCAGTACCTAAG TCGGAATCAGTCTCGGCGTTATTAGAGGAATGGGTCGCGTTGAACCAGCCATTCACGCCGGAACAAGACTCACCAGAGTTAATGCCATCGTCATGGATTC 601 CCCTCGATGTTCTCATTATATTTGCTTTCCACGTTACACAGACCGGCAATCTCAGCCAGAACCGCACCATCCGCAGCCATCGCCAGAGATTCACCCAACT GGGAGCTACAAGAGTAATATAAACGAAAGGTGCAATGTGTCTGGCCGTTAGAGTCGGTCTTGGCGTGGTAGGCGTCGGTAGCGGTCTCTAAGTGGGTTGA 701 GAGAGGTATACTCAGAGCGAACGTCGTAGTGGTTCATCGCGTCCTCAATATCATAAATCAGAACGTCAGCCGTCAGGAGACCGTCAATGGTGATTACCTT CTCTCCATATGAGTCTCGCTTGCAGCATCACCAAGTAGCGCAGGAGTTATAGTATTTAGTCTTGCAGTCGGCAGTCCTCTGGCAGTTACCACTAATGGAA 801 CTCGGTGTGTTTGATGTCCTTACGTTTATCGTCGAGGTTCTCGCCCGGAGCCAGATACGCTGCCTGAGTGCGACCCAGAACAGGGAACTGAGCGGATTTA GAGCCACACAAACTACAGGAATGCAAATAGCAGCTCCAAGAGCGGGCCTCGGTCTATGCGACGGACTCACGCTGGGTCTTGTCCCTTGACTCGCCTAAAT 901 CCGCTGGAGATGGAACGTACCATGTGGCGAGAAGTGGTCACGGAGGTACGAGCGAACGCAGTCAGGACTTCACCGCCAAATACCTTCAAGAACAACGCCA GGCGACCTCTACCTTGCATGGTACACCGCTCTTCACCAGTGCCTCCATGCTCGCTTGCGTCAGTCCTGAAGTGGCGGTTTATGGAAGTTCTTGTTGCGGT Esp3I ~~~~~ 1001 GTTTATCTCCAGCAGCAACTACACCTTTACCTTGGTTAGTACCCATTTGCTGTCCACCAGTCATGCTAGCCATATGTATATCTCCTTCTTAAAGTCGTCT CAAATAGAGGTCGTCGTTGATGTGGAAATGGAACCAATCATGGGTAAACGACAGGTGGTCAGTACGATCGGTATACATATAGAGGAAGAATTTCAGCAGA Esp3I ~ 1101 CCAGTGCCTCCACCAAGGGCCCATCGGTCTTCCCCCTGGCGCCCTGCTCCAGGAGCACCTCCGAGAGCACAGCGGCCCTGGGCTGCCTGGTCAAGGACTA GGTCACGGAGGTGGTTCCCGGGTAGCCAGAAGGGGGACCGCGGGACGAGGTCCTCGTGGAGGCTCTCGTGTCGCCGGGACCCGACGGACCAGTTCCTGAT 1201 CTTCCCCGAACCGGTGACGGTGTCGTGGAACTCAGGCGCTCTGACCAGCGGCGTGCACACCTTCCCAGCTGTCCTACAGTCCTCAGGACTCTACTCCCTC GAAGGGGCTTGGCCACTGCCACAGCACCTTGAGTCCGCGAGACTGGTCGCCGCACGTGTGGAAGGGTCGACAGGATGTCAGGAGTCCTGAGATGAGGGAG 1301 AGCAGCGTGGTGACCGTGCCCTCCAGCAGCTTGGGCACCCAGACCTACATCTGCAACGTGAATCACAAGCCCAGCAACACCAAGGTGGACAAGAAAGTTG TCGTCGCACCACTGGCACGGGAGGTCGTCGAACCCGTGGGTCTGGATGTAGACGTTGCACTTAGTGTTCGGGTCGTTGTGGTTCCACCTGTTCTTTCAAC 1401 AGCCCAAATCTTGTGACAAAACTCACACATGCCCACCGTGCCCAGCACCTGAACTCCTGGGGGGACCGTCAGTCTTCCTCTTCCCCCCMAAACCCAAGGA TCGGGTTTAGAACACTGTTTTGAGTGTGTACGGGTGGCACGGGTCGTGGACTTGAGGACCCCCCTGGCAGTCAGAAGGAGAAGGGGGGKTTTGGGTTCCT 1501 CACCCTCATGATCTCCCGGACCCCTGAGGTCACATGCGTGGTGGTGGACGTGAGCCACGAAGACCCTGAGGTCAAGTTCAACTGGTACGTGGACGGCGTG GTGGGAGTACTAGAGGGCCTGGGGACTCCAGTGTACGCACCACCACCTGCACTCGGTGCTTCTGGGACTCCAGTTCAAGTTGACCATGCACCTGCCGCAC 1601 GAGGTGCATAATGCCAAGACAAAGCCGCGGGAGGAGCAGTACAACAGCACGTACCGTGTGGTCAGCGTCCTCACCGTCCTGCACCAGGACTGGCTGAATG CTCCACGTATTACGGTTCTGTTTCGGCGCCCTCCTCGTCATGTTGTCGTGCATGGCACACCAGTCGCAGGAGTGGCAGGACGTGGTCCTGACCGACTTAC 1701 GCAAGGAGTACAAGTGCAAGGTCTCCAACAAAGCCCTCCCAGCCCCCATCGAGAAAACCATCTCCAAAGCCAAAGGGCAGCCCCGAGAACCACAGGTGTA CGTTCCTCATGTTCACGTTCCAGAGGTTGTTTCGGGAGGGTCGGGGGTAGCTCTTTTGGTAGAGGTTTCGGTTTCCCGTCGGGGCTCTTGGTGTCCACAT 1801 CACCCTGCCCCCATCCCGGGATGAGCTGACCAAGAACCAGGTCAGCCTGACCTGCCTGGTCAAAGGCTTCTACCCCAGCGACATCGCCGTGGAGTGGGAG GTGGGACGGGGGTAGGGCCCTACTCGACTGGTTCTTGGTCCAGTCGGACTGGACGGACCAGTTTCCGAAGATGGGGTCGCTGTAGCGGCACCTCACCCTC 1901 AGCAATGGGCAGCCGGAGAACAACTACAAGACCACGCCTCCCATGCTGGACTCCGACGGCTCCTTCTTCCTCTACAGCAAGCTCACCGTGGACAAGAGCA TCGTTACCCGTCGGCCTCTTGTTGATGTTCTGGTGCGGAGGGTACGACCTGAGGCTGCCGAGGAAGAAGGAGATGTCGTTCGAGTGGCACCTGTTCTCGT 2001 GGTGGCAGCAGGGGAACGTCTTCTCATGCTCCGTGATGCATGAGGCTCTGCACAACCACTACACGCAGAAGAGCCTCTCCCTGTCTCCGGGTAAAGGGTA CCACCGTCGTCCCCTTGCAGAAGAGTACGAGGCACTACGTACTCCGAGACGTGTTGGTGATGTGCGTCTTCTCGGAGAGGGACAGAGGCCCATTTCCCAT 2101 CATGTCCCATATGCTCGACATGGCAAGCAGCCTGAGACAGATTCTGGACTCCCAGAAAATGGAGTGGAGGTCCAACGCCGGGGGCAGCGGTAGGGATAAG GTACAGGGTATACGAGCTGTACCGTTCGTCGGACTCTGTCTAAGACCTGAGGGTCTTTTACCTCACCTCCAGGTTGCGGCCCCCGTCGCCATCCCTATTC 2201 TGGTCAGATCTTCGCGACAATTCCAAATCAACTGAGTTCGATCCTAACATTGACATTGTTGGTTTAGAAGGAAAATTTGGTATTACAAACCTAGAAACGG ACCAGTCTAGAAGCGCTGTTAAGGTTTAGTTGACTCAAGCTAGGATTGTAACTGTAACAACCAAATCTTCCTTTTAAACCATAATGTTTGGATCTTTGCC 2301 ATTTATTCACAATCTGGGAGACAATGGAGGTCATGATCAAAGCAGATATTGCAGATACTGATAGAGCCAGCAACTTTGTTGCAACTGAAACCGATGCTAA TAAATAAGTGTTAGACCCTCTGTTACCTCCAGTACTAGTTTCGTCTATAACGTCTATGACTATCTCGGTCGTTGAAACAACGTTGACTTTGGCTACGATT 2401 CCGCGGAAAAATGCCTGGCAAAAAACTGCCACTGGCAGTTATCATGGAAATGGAAGCCAATGCTTTCAAAGCTGGCTGCACCAGGGGATGCCTTATCTGT GGCGCCTTTTTACGGACCGTTTTTTGACGGTGACCGTCAATAGTACCTTTACCTTCGGTTACGAAAGTTTCGACCGACGTGGTCCCCTACGGAATAGACA 2501 CTTTCAAAAATTAAGTGTACAGCCAAAATGAAGGTATACATTCCAGGAAGGTGTCACGATTATGGTGGTGACAAGAAAACTGGACAGGCAGGAATTGTTG GAAAGTTTTTAATTCACATGTCGGTTTTACTTCCATATGTAAGGTCCTTCCACAGTGCTAATACCACCACTGTTCTTTTGACCTGTCCGTCCTTAACAAC 2601 GTGCAATTGTTGACATTCCCGAAATCTCTGGATTTAAGGAGATGGCACCCATGGAACAGTTCATTGCTCAAGTTGATCGCTGCGCTTCCTGCACTACTGG CACGTTAACAACTGTAAGGGCTTTAGAGACCTAAATTCCTCTACCGTGGGTACCTTGTCAAGTAACGAGTTCAACTAGCGACGCGAAGGACGTGATGACC 2701 ATGTCTCAAAGGTCTTGCCAATGTTAAGTGCTCTGAACTCCTGAAGAAATGGCTGCCTGACAGGTGTGCAAGTTTTGCTGACAAGATTCAAAAAGAAGTT TACAGAGTTTCCAGAACGGTTACAATTCACGAGACTTGAGGACTTCTTTACCGACGGACTGTCCACACGTTCAAAACGACTGTTCTAAGTTTTTCTTCAA 2801 CACAATATCAAAGGCATGGCCGGCGATCGATGAGCGGCCGCAATTTAATTCCGGTTATTTTCCACCATATTGCCGTCTTTTGGCAATGTGAGGGCCCGGA GTGTTATAGTTTCCGTACCGGCCGCTAGCTACTCGCCGGCGTTAAATTAAGGCCAATAAAAGGTGGTATAACGGCAGAAAACCGTTACACTCCCGGGCCT 2901 AACCTGGCCCTGTCTTCTTGACGAGCATTCCTAGGGGTCTTTCCCCTCTCGCCAAAGGAATGCAAGGTCTGTTGAATGTCGTGAAGGAAGCAGTTCCTCT TTGGACCGGGACAGAAGAACTGCTCGTAAGGATCCCCAGAAAGGGGAGAGCGGTTTCCTTACGTTCCAGACAACTTACAGCACTTCCTTCGTCAAGGAGA 3001 GGAAGCTTCTTGAAGACAAACAACGTCTGTAGCGACCCTTTGCAGGCAGCGGAACCCCCCACCTGGCGACAGGTGCCTCTGCGGCCAAAAGCCACGTGTA CCTTCGAAGAACTTCTGTTTGTTGCAGACATCGCTGGGAAACGTCCGTCGCCTTGGGGGGTGGACCGCTGTCCACGGAGACGCCGGTTTTCGGTGCACAT 3101 TAAGATACACCTGCAAAGGCGGCACAACCCCAGTGCCACGTTGTGAGTTGGATAGTTGTGGAAAGAGTCAAATGGCTCACCTCAAGCGTATTCAACAAGG ATTCTATGTGGACGTTTCCGCCGTGTTGGGGTCACGGTGCAACACTCAACCTATCAACACCTTTCTCAGTTTACCGAGTGGAGTTCGCATAAGTTGTTCC 3201 GGCTGAAGGATGCCCAGAAGGTACCCCATTGTATGGGATCTGATCTGGGGCCTCGGTGCACATGCTTTACATGTGTTTAGTCGAGGTTAAAAAACGTCTA CCGACTTCCTACGGGTCTTCCATGGGGTAACATACCCTAGACTAGACCCCGGAGCCACGTGTACGAAATGTACACAAATCAGCTCCAATTTTTTGCAGAT 3301 GGCCCCCCGAACCACGGGGACGTGGTTTTCCTTTGAAAAACACGATGATAATATGGCCACCACCCATACCTAGGCTTTTGCAAAGATCGATCAAGAGACA CCGGGGGGCTTGGTGCCCCTGCACCAAAAGGAAACTTTTTGTGCTACTATTATACCGGTGGTGGGTATGGATCCGAAAACGTTTCTAGCTAGTTCTCTGT 3401 GGATGAGGATCGTTTCGCATGATTGAACAAGATGGATTGCACGCAGGTTCTCCGGCCGCTTGGGTGGAGAGGCTATTCGGCTATGACTGGGCACAACAGA CCTACTCCTAGCAAAGCGTACTAACTTGTTCTACCTAACGTGCGTCCAAGAGGCCGGCGAACCCACCTCTCCGATAAGCCGATACTGACCCGTGTTGTCT 3501 CAATCGGCTGCTCTGATGCCGCCGTGTTCCGGCTGTCAGCGCAGGGGCGCCCGGTTCTTTTTGTCAAGACCGACCTGTCCGGTGCCCTGAATGAACTGCA GTTAGCCGACGAGACTACGGCGGCACAAGGCCGACAGTCGCGTCCCCGCGGGCCAAGAAAAACAGTTCTGGCTGGACAGGCCACGGGACTTACTTGACGT 3601 AGACGAGGCAGCGCGGCTATCGTGGCTGGCCACGACGGGCGTTCCTTGCGCAGCTGTGCTCGACGTTGTCACTGAAGCGGGAAGGGACTGGCTGCTATTG TCTGCTCCGTCGCGCCGATAGCACCGACCGGTGCTGCCCGCAAGGAACGCGTCGACACGAGCTGCAACAGTGACTTCGCCCTTCCCTGACCGACGATAAC 3701 GGCGAAGTGCCGGGGCAGGATCTCCTGTCATCTCACCTTGCTCCTGCCGAGAAAGTATCCATCATGGCTGATGCAATGCGGCGGCTGCATACGCTTGATC CCGCTTCACGGCCCCGTCCTAGAGGACAGTAGAGTGGAACGAGGACGGCTCTTTCATAGGTAGTACCGACTACGTTACGCCGCCGACGTATGCGAACTAG 3801 CGGCTACCTGCCCATTCGACCACCAAGCGAAACATCGCATCGAGCGAGCACGTACTCGGATGGAAGCCGGTCTTGTCGATCAGGATGATCTGGACGAAGA GCCGATGGACGGGTAAGCTGGTGGTTCGCTTTGTAGCGTAGCTCGCTCGTGCATGAGCCTACCTTCGGCCAGAACAGCTAGTCCTACTAGACCTGCTTCT 3901 GCATCAGGGGCTCGCGCCAGCCGAACTGTTCGCCAGGCTCAAGGCGAGCATGCCCGACGGCGAGGATCTCGTCGTGACCCATGGCGATGCCTGCTTGCCG CGTAGTCCCCGAGCGCGGTCGGCTTGACAAGCGGTCCGAGTTCCGCTCGTACGGGCTGCCGCTCCTAGAGCAGCACTGGGTACCGCTACGGACGAACGGC 4001 AATATCATGGTGGAAAATGGCCGCTTTTCTGGATTCATCGACTGTGGCCGGCTGGGTGTGGCGGACCGCTATCAGGACATAGCGTTGGCTACCCGTGATA TTATAGTACCACCTTTTACCGGCGAAAAGACCTAAGTAGCTGACACCGGCCGACCCACACCGCCTGGCGATAGTCCTGTATCGCAACCGATGGGCACTAT 4101 TTGCTGAAGAGCTTGGCGGCGAATGGGCTGACCGCTTCCTCGTGCTTTACGGTATCGCCGCTCCCGATTCGCAGCGCATCGCCTTCTATCGCCTTCTTGA AACGACTTCTCGAACCGCCGCTTACCCGACTGGCGAAGGAGCACGAAATGCCATAGCGGCGAGGGCTAAGCGTCGCGTAGCGGAAGATAGCGGAAGAACT 4201 CGAGTTCTTCTGAGCGGGACTCTGGGGTTCGGGCCGCACTCGAGCATAAACTTGTTTATTGCAGCTTATAATGGTTACAAATAAAGCAATAGCATCACAA GCTCAAGAAGACTCGCCCTGAGACCCCAAGCCCGGCGTGAGCTCGTATTTGAACAAATAACGTCGAATATTACCAATGTTTATTTCGTTATCGTAGTGTT I-SceI ~~~~~~~~~~~~~~~~~~~~ 4301 ATTTCACAAATAAAGCATTTTTTTCACTGCATTCTAGTTGTGGTTTGTCCAAACTCATCAATGTATCTTAAGTAGGGATAACAGGGTAATTTTGTTAAAT TAAAGTGTTTATTTCGTAAAAAAAGTGACGTAAGATCAACACCAAACAGGTTTGAGTAGTTACATAGAATTCATCCCTATTGTCCCATTAAAACAATTTA 4401 CAGCTCATTTTTTAACCAATAGGAACGCCATCAAAAATAATTCGCGTCTGGCCTTCCTGTAGCCAGCTTTCATCAACATTAAATGTGAGCGAGTAACAAC GTCGAGTAAAAAATTGGTTATCCTTGCGGTAGTTTTTATTAAGCGCAGACCGGAAGGACATCGGTCGAAAGTAGTTGTAATTTACACTCGCTCATTGTTG 4501 CCGTCGGATTCTCCGTGGGAACAAACGGCGGATTGACCGTAATGGGATAGGTTACGTTGGTGTAGATGGGCGCATCGTAACCGTGCATCTGCCAGTTTGA GGCAGCCTAAGAGGCACCCTTGTTTGCCGCCTAACTGGCATTACCCTATCCAATGCAACCACATCTACCCGCGTAGCATTGGCACGTAGACGGTCAAACT 4601 GGGGACGACGACCGTATCGGCCTCAGGAAGATCGCACTCCAGCCAGCTTTCCGGCACCGCTTCTGGTGCCGGAAACCAGGCAAAGCGCCATTCGCCATTC CCCCTGCTGCTGGCATAGCCGGAGTCCTTCTAGCGTGAGGTCGGTCGAAAGGCCGTGGCGAAGACCACGGCCTTTGGTCCGTTTCGCGGTAAGCGGTAAG 4701 AGGCTGCGCAACTGTTGGGAAGGGCGATCGGTGCGGGCCTCTTCGCTATTACGCCAGCTGGCGAAAGGGGGATGTGCTGCAAGGCGATTAAGTTGGGTAA TCCGACGCGTTGACAACCCTTCCCGCTAGCCACGCCCGGAGAAGCGATAATGCGGTCGACCGCTTTCCCCCTACACGACGTTCCGCTAATTCAACCCATT 4801 CGCCAGGGTTTTCCCAGTCACGACGTTGTAAAACGACGGCCAGTGAATTGCAATTCGTAATCATGGTCATAGCTGTTTCCTGTGTGAAATTGTTATCCGC GCGGTCCCAAAAGGGTCAGTGCTGCAACATTTTGCTGCCGGTCACTTAACGTTAAGCATTAGTACCAGTATCGACAAAGGACACACTTTAACAATAGGCG 4901 TCACAATTCCACACAACATACGAGCCGGAAGCATAAAGTGTAAAGCCTGGGGTGCCTAATGAGTGAGCTAACTCACATTAATTGCGTTGCGCTCACTGCC AGTGTTAAGGTGTGTTGTATGCTCGGCCTTCGTATTTCACATTTCGGACCCCACGGATTACTCACTCGATTGAGTGTAATTAACGCAACGCGAGTGACGG I-SceI ~~~~~~~~~~~~~~~~~~~ 5001 ATTACCCTGTTATCCCTAGTGAACCATCACCCTAATCAAGTTTTTTGGGGTCGAGGTGCCGTAAAGCACTAAATCGGAACCCTAAAGGGAGCCCCCGATT TAATGGGACAATAGGGATCACTTGGTAGTGGGATTAGTTCAAAAAACCCCAGCTCCACGGCATTTCGTGATTTAGCCTTGGGATTTCCCTCGGGGGCTAA 5101 TAGAGCTTGACGGGGAAAGCCGGCGAACGTGGCGAGAAAGGAAGGGAAGAAAGCGAAAGGAGCGGGCGCTAGGGCGCTGGCAAGTGTAGCGGTCACGCTG ATCTCGAACTGCCCCTTTCGGCCGCTTGCACCGCTCTTTCCTTCCCTTCTTTCGCTTTCCTCGCCCGCGATCCCGCGACCGTTCACATCGCCAGTGCGAC 5201 CGCGTAACCACCACACCCGCCGCGCTTAATGCGCCGCTACAGGGCGCGTCAGGTGGCACTTTTCGGGGAAATGTGCGCGGAACCCCTATTTGTTTATTTT GCGCATTGGTGGTGTGGGCGGCGCGAATTACGCGGCGATGTCCCGCGCAGTCCACCGTGAAAAGCCCCTTTACACGCGCCTTGGGGATAAACAAATAAAA 5301 TCTAAATACATTCAAATATGTATCCGCTCATGAGACAATAACCCTGATAAATGCTTCAATAATAACGACCGGTAATGAAAAAGGAAGAGTATGAGTATTC AGATTTATGTAAGTTTATACATAGGCGAGTACTCTGTTATTGGGACTATTTACGAAGTTATTATTGCTGGCCATTACTTTTTCCTTCTCATACTCATAAG 5401 AACATTTCCGTGTCGCCCTTATTCCCTTTTTTGCGGCATTTTGCCTTCCTGTTTTTGCTCACCCAGAAACGCTGGTGAAAGTAAAAGATGCTGAAGATCA TTGTAAAGGCACAGCGGGAATAAGGGAAAAAACGCCGTAAAACGGAAGGACAAAAACGAGTGGGTCTTTGCGACCACTTTCATTTTCTACGACTTCTAGT 5501 GTTGGGTGCACGAGTGGGTTACATCGAACTGGATCTCAACAGCGGTAAGATCCTTGAGAGTTTTCGCCCCGAAGAACGTTTTCCAATGATGAGCACTTTT CAACCCACGTGCTCACCCAATGTAGCTTGACCTAGAGTTGTCGCCATTCTAGGAACTCTCAAAAGCGGGGCTTCTTGCAAAAGGTTACTACTCGTGAAAA 5601 AAAGTTCTGCTATGTGGCGCGGTATTATCCCGTATTGACGCCGGGCAAGAGCAACTCGGTCGCCGCATACACTATTCTCAGAATGACTTGGTTGAGTCTA TTTCAAGACGATACACCGCGCCATAATAGGGCATAACTGCGGCCCGTTCTCGTTGAGCCAGCGGCGTATGTGATAAGAGTCTTACTGAACCAACTCAGAT 5701 GCGTTGATCGGCACGTAAGAGGTTCCAACTTTCACCATAATGAAATAAGATCACTACCGGGCGTATTTTTTGAGTTATCGAGATTTTCAGGAGCTAAGGA CGCAACTAGCCGTGCATTCTCCAAGGTTGAAAGTGGTATTACTTTATTCTAGTGATGGCCCGCATAAAAAACTCAATAGCTCTAAAAGTCCTCGATTCCT 5801 AGCTAAAATGGAGAAAAAAATCACTGGATATACCACCGTTGATATATCCCAATGGCATCGTAAAGAACATTTTGAGGCATTTCAGTCAGTTGCTCAATGT TCGATTTTACCTCTTTTTTTAGTGACCTATATGGTGGCAACTATATAGGGTTACCGTAGCATTTCTTGTAAAACTCCGTAAAGTCAGTCAACGAGTTACA 5901 ACCTATAACCAGACCGTTCAGCTGGATATTACGGCCTTTTTAAAGACCGTAAAGAAAAATAAGCACAAGTTTTATCCGGCCTTTATTCACATTCTTGCCC TGGATATTGGTCTGGCAAGTCGACCTATAATGCCGGAAAAATTTCTGGCATTTCTTTTTATTCGTGTTCAAAATAGGCCGGAAATAAGTGTAAGAACGGG 6001 GCCTGATGAATGCTCATCCGGAATTCCGTATGGCAATGAAAGACGGTGAGCTGGTGATATGGGATAGTGTTCACCCTTGTTACACCGTTTTCCATGAGCA CGGACTACTTACGAGTAGGCCTTAAGGCATACCGTTACTTTCTGCCACTCGACCACTATACCCTATCACAAGTGGGAACAATGTGGCAAAAGGTACTCGT 6101 AACTGAAACGTTTTCATCGCTCTGGAGTGAATACCACGACGATTTCCGGCAGTTTCTACACATATATTCGCAAGATGTGGCGTGTTACGGTGAAAACCTG TTGACTTTGCAAAAGTAGCGAGACCTCACTTATGGTGCTGCTAAAGGCCGTCAAAGATGTGTATATAAGCGTTCTACACCGCACAATGCCACTTTTGGAC 6201 GCCTATTTCCCTAAAGGGTTTATTGAGAATATGTTTTTCGTATCAGCCAATCCCTGGGTGAGTTTCACCAGTTTTGATTTAAACGTGGCCAATATGGACA CGGATAAAGGGATTTCCCAAATAACTCTTATACAAAAAGCATAGTCGGTTAGGGACCCACTCAAAGTGGTCAAAACTAAATTTGCACCGGTTATACCTGT 6301 ACTTCTTCGCCCCCGTTTTCACCATGGGCAAATATTATACGCAAGGCGACAAGGTGCTGATGCCGCTGGCGATTCAGGTTCATCATGCCGTCTGTGATGG TGAAGAAGCGGGGGCAAAAGTGGTACCCGTTTATAATATGCGTTCCGCTGTTCCACGACTACGGCGACCGCTAAGTCCAAGTAGTACGGCAGACACTACC 6401 CTTCCATGTCGGCAGAATGCTTAATGAATTACAACAGTACTGCGATGAGTGGCAGGGCGGGGCGTAATTTTTTTAAGGCAGTTATTGGTGCCCTTAAACG GAAGGTACAGCCGTCTTACGAATTACTTAATGTTGTCATGACGCTACTCACCGTCCCGCCCCGCATTAAAAAAATTCCGTCAATAACCACGGGAATTTGC 6501 CCTGGTGCTACGCCTGAATAAGTGATAATAAGCGGATGAATGGCAGAAATTCGAAATGACCGACCAAGCGACGCCCAACCTGCCATCACGAGATTTCGAT GGACCACGATGCGGACTTATTCACTATTATTCGCCTACTTACCGTCTTTAAGCTTTACTGGCTGGTTCGCTGCGGGTTGGACGGTAGTGCTCTAAAGCTA 6601 TCCACCGCCGCCTTCTATGAAAGGTTGGGCTTCGGAATCGTTTTCCGGGACGCCGGCTGGATGATCCTCCAGCGCGGGGATCTCATGCTGGAGTTCTTCG AGGTGGCGGCGGAAGATACTTTCCAACCCGAAGCCTTAGCAAAAGGCCCTGCGGCCGACCTACTAGGAGGTCGCGCCCCTAGAGTACGACCTCAAGAAGC 6701 CCCACCCTAGGGGGAGGCTAACTGAAACACGGAAGGAGACAATACCGGAAGGAACCCGCGCTATGACGGCAATAAAAAGACAGAATAAAACGCACGGTGT GGGTGGGATCCCCCTCCGATTGACTTTGTGCCTTCCTCTGTTATGGCCTTCCTTGGGCGCGATACTGCCGTTATTTTTCTGTCTTATTTTGCGTGCCACA 6801 TGGGTCGTTTGTTCATAAACGCGGGGTTCGGTCCCAGGGCTGGCACTCTGTCGATACCCCACCGAGACCCCATTGGGGCCAATACGCCCGCGTTTCTTCC ACCCAGCAAACAAGTATTTGCGCCCCAAGCCAGGGTCCCGACCGTGAGACAGCTATGGGGTGGCTCTGGGGTAACCCCGGTTATGCGGGCGCAAAGAAGG 6901 TTTTCCCCACCCCACCCCCCAAGTTCGGGTGAAGGCCCAGGGCTCGCAGCCAACGTCGGGGCGGCAGGCCCTGCCATAGCCTCAGGTTACTCATATATAC AAAAGGGGTGGGGTGGGGGGTTCAAGCCCACTTCCGGGTCCCGAGCGTCGGTTGCAGCCCCGCCGTCCGGGACGGTATCGGAGTCCAATGAGTATATATG 7001 TTTAGATTGATTTAAAACTTCATTTTTAATTTAAAAGGATCTAGGTGAAGATCCTTTTTGATAATCTCATGACCAAAATCCCTTAACGTGAGTTTTCGTT AAATCTAACTAAATTTTGAAGTAAAAATTAAATTTTCCTAGATCCACTTCTAGGAAAAACTATTAGAGTACTGGTTTTAGGGAATTGCACTCAAAAGCAA 7101 CCACTGAGCGTCAGACCCCGTAGAAAAGATCAAAGGATCTTCTTGAGATCCTTTTTTTCTGCGCGTAATCTGCTGCTTGCAAACAAAAAAACCACCGCTA GGTGACTCGCAGTCTGGGGCATCTTTTCTAGTTTCCTAGAAGAACTCTAGGAAAAAAAGACGCGCATTAGACGACGAACGTTTGTTTTTTTGGTGGCGAT 7201 CCAGCGGTGGTTTGTTTGCCGGATCAAGAGCTACCAACTCTTTTTCCGAAGGTAACTGGCTTCAGCAGAGCGCAGATACCAAATACTGTCCTTCTAGTGT GGTCGCCACCAAACAAACGGCCTAGTTCTCGATGGTTGAGAAAAAGGCTTCCATTGACCGAAGTCGTCTCGCGTCTATGGTTTATGACAGGAAGATCACA 7301 AGCCGTAGTTAGGCCACCACTTCAAGAACTCTGTAGCACCGCCTACATACCTCGCTCTGCTAATCCTGTTACCAGTGGCTGCTGCCAGTGGCGATAAGTC TCGGCATCAATCCGGTGGTGAAGTTCTTGAGACATCGTGGCGGATGTATGGAGCGAGACGATTAGGACAATGGTCACCGACGACGGTCACCGCTATTCAG 7401 GTGTCTTACCGGGTTGGACTCAAGACGATAGTTACCGGATAAGGCGCAGCGGTCGGGCTGAACGGGGGGTTCGTGCACACAGCCCAGCTTGGAGCGAACG CACAGAATGGCCCAACCTGAGTTCTGCTATCAATGGCCTATTCCGCGTCGCCAGCCCGACTTGCCCCCCAAGCACGTGTGTCGGGTCGAACCTCGCTTGC 7501 ACCTACACCGAACTGAGATACCTACAGCGTGAGCTATGAGAAAGCGCCACGCTTCCCGAAGGGAGAAAGGCGGACAGGTATCCGGTAAGCGGCAGGGTCG TGGATGTGGCTTGACTCTATGGATGTCGCACTCGATACTCTTTCGCGGTGCGAAGGGCTTCCCTCTTTCCGCCTGTCCATAGGCCATTCGCCGTCCCAGC 7601 GAACAGGAGAGCGCACGAGGGAGCTTCCAGGGGGAAACGCCTGGTATCTTTATAGTCCTGTCGGGTTTCGCCACCTCTGACTTGAGCGTCGATTTTTGTG CTTGTCCTCTCGCGTGCTCCCTCGAAGGTCCCCCTTTGCGGACCATAGAAATATCAGGACAGCCCAAAGCGGTGGAGACTGAACTCGCAGCTAAAAACAC 7701 ATGCTCGTCAGGGGGGCGGAGCCTATGGAAAAACGCCAGCAACGCGGCCTTTTTACGGTTCCTGGCCTTTTGCTGGCCTTTTGCTCACATGTTCTTTCCT TACGAGCAGTCCCCCCGCCTCGGATACCTTTTTGCGGTCGTTGCGCCGGAAAAATGCCAAGGACCGGAAAACGACCGGAAAACGAGTGTACAAGAAAGGA 7801 GCGTTATCCCCTGATTCTGTGGATAACCGTATTACCGCCATGCATTAGTTATTAATAGTAATCAATTACGGGGTCATTAGTTCATAGCCCATATATGGAG CGCAATAGGGGACTAAGACACCTATTGGCATAATGGCGGTACGTAATCAATAATTATCATTAGTTAATGCCCCAGTAATCAAGTATCGGGTATATACCTC 7901 TTCCGCGTTACATAACTTACGGTAAATGGCCCGCCTGGCTGACCGCCCAACGACCCCCGCCCATTGACGTCAATAATGACGTATGTTCCCATAGTAACGC AAGGCGCAATGTATTGAATGCCATTTACCGGGCGGACCGACTGGCGGGTTGCTGGGGGCGGGTAACTGCAGTTATTACTGCATACAAGGGTATCATTGCG 8001 CAATAGGGACTTTCCATTGACGTCAATGGGTGGAGTATTTACGGTAAACTGCCCACTTGGCAGTACATCAAGTGTATCATATGCCAAGTACGCCCCCTAT GTTATCCCTGAAAGGTAACTGCAGTTACCCACCTCATAAATGCCATTTGACGGGTGAACCGTCATGTAGTTCACATAGTATACGGTTCATGCGGGGGATA 8101 TGACGTCAATGACGGTAAATGGCCCGCCTGGCATTATGCCCAGTACATGACCTTATGGGACTTTCCTACTTGGCAGTACATCTACGTATTAGTCATCGCT ACTGCAGTTACTGCCATTTACCGGGCGGACCGTAATACGGGTCATGTACTGGAATACCCTGAAAGGATGAACCGTCATGTAGATGCATAATCAGTAGCGA 8201 ATTACCATGGTGATGCGGTTTTGGCAGTACATCAATGGGCGTGGATAGCGGTTTGACTCACGGGGATTTCCAAGTCTCCACCCCATTGACGTCAATGGGA TAATGGTACCACTACGCCAAAACCGTCATGTAGTTACCCGCACCTATCGCCAAACTGAGTGCCCCTAAAGGTTCAGAGGTGGGGTAACTGCAGTTACCCT 8301 GTTTGTTTTGGCACCAAAATCAACGGGACTTTCCAAAATGTCGTAACAACTCCGCCCCATTGACGCAAATGGGCGGTAGGCGTGTACGGTGGGAGGTCTA CAAACAAAACCGTGGTTTTAGTTGCCCTGAAAGGTTTTACAGCATTGTTGAGGCGGGGTAACTGCGTTTACCCGCCATCCGCACATGCCACCCTCCAGAT 8401 TATAAGCAGAGCT ATATTCGTCTCGA pVHentry-Hisbio1 Esp3I ~~~~~~~ 1 GGTTTAGTGAACCGTCAGATCCGCTAGACGTCTCATATACCTGACTGGAATACGACAGCTCCTGCAGCTTCTGGGCGAAGACCACCGTGGCCCATTGCGT CCAAATCACTTGGCAGTCTAGGCGATCTGCAGAGTATATGGACTGACCTTATGCTGTCGAGGACGTCGAAGACCCGCTTCTGGTGGCACCGGGTAACGCA 101 ACTTAGCGATAATCTGGTCCGCTTGGAAGTTAGCACGGCGAGCGCGCTCCAGAGCCAAGTCACGCAGCTTAACAGTACCTACCGCAGAGCGGTGCATGAA TGAATCGCTATTAGACCAGGCGAACCTTCAATCGTGCCGCTCGCGCGAGGTCTCGGTTCAGTGCGTCGAATTGTCATGGATGGCGTCTCGCCACGTACTT 201 CAGGCCGATAACGTTGTCCTTAGCAACCTTGACATTACCCTCACCTTTATTGGCAGGGAAGACGTGCTTCTGACCAGTAGTGCCCTCACGAGCGGTACCA GTCCGGCTATTGCAACAGGAATCGTTGGAACTGTAATGGGAGTGGAAATAACCGTCCCTTCTGCACGAAGACTGGTCATCACGGGAGTGCTCGCCATGGT 301 GCACCACCAGCGGTGAGGTGCGGAACTTCTACAACCTCAAAGCCCATAACGTTGCGGATAGAACCCTTCTCAGGGTCAATCAGAGCAGCGTAGTTTGCTG CGTGGTGGTCGCCACTCCACGCCTTGAAGATGTTGGAGTTTCGGGTATTGCAACGCCTATCTTGGGAAGAGTCCCAGTTAGTCTCGTCGCATCAAACGAC 401 CGTTCGGCATCAGTGCTGCCAGAATCGCAGAGTAGCTATCTGGGTCACAGTAGAACACACGGTCAGCAGCCGGAACATAGTTCTTGGTCAGAGCCGCACG GCAAGCCGTAGTCACGACGGTCTTAGCGTCTCATCGATAGACCCAGTGTCATCTTGTGTGCCAGTCGTCGGCCTTGTATCAAGAACCAGTCTCGGCGTGC 501 AGCCTTAGTCAGAGCCGCAATAATCTCCTTACCCAGCGCAACTTGGTCGGTAAGTGCGGCCTTGTTCTGAGTGGTCTCAATTACGGTAGCAGTACCTAAG TCGGAATCAGTCTCGGCGTTATTAGAGGAATGGGTCGCGTTGAACCAGCCATTCACGCCGGAACAAGACTCACCAGAGTTAATGCCATCGTCATGGATTC 601 CCCTCGATGTTCTCATTATATTTGCTTTCCACGTTACACAGACCGGCAATCTCAGCCAGAACCGCACCATCCGCAGCCATCGCCAGAGATTCACCCAACT GGGAGCTACAAGAGTAATATAAACGAAAGGTGCAATGTGTCTGGCCGTTAGAGTCGGTCTTGGCGTGGTAGGCGTCGGTAGCGGTCTCTAAGTGGGTTGA 701 GAGAGGTATACTCAGAGCGAACGTCGTAGTGGTTCATCGCGTCCTCAATATCATAAATCAGAACGTCAGCCGTCAGGAGACCGTCAATGGTGATTACCTT CTCTCCATATGAGTCTCGCTTGCAGCATCACCAAGTAGCGCAGGAGTTATAGTATTTAGTCTTGCAGTCGGCAGTCCTCTGGCAGTTACCACTAATGGAA 801 CTCGGTGTGTTTGATGTCCTTACGTTTATCGTCGAGGTTCTCGCCCGGAGCCAGATACGCTGCCTGAGTGCGACCCAGAACAGGGAACTGAGCGGATTTA GAGCCACACAAACTACAGGAATGCAAATAGCAGCTCCAAGAGCGGGCCTCGGTCTATGCGACGGACTCACGCTGGGTCTTGTCCCTTGACTCGCCTAAAT 901 CCGCTGGAGATGGAACGTACCATGTGGCGAGAAGTGGTCACGGAGGTACGAGCGAACGCAGTCAGGACTTCACCGCCAAATACCTTCAAGAACAACGCCA GGCGACCTCTACCTTGCATGGTACACCGCTCTTCACCAGTGCCTCCATGCTCGCTTGCGTCAGTCCTGAAGTGGCGGTTTATGGAAGTTCTTGTTGCGGT Esp3I ~~~~~ 1001 GTTTATCTCCAGCAGCAACTACACCTTTACCTTGGTTAGTACCCATTTGCTGTCCACCAGTCATGCTAGCCATATGTATATCTCCTTCTTAAAGTCGTCT CAAATAGAGGTCGTCGTTGATGTGGAAATGGAACCAATCATGGGTAAACGACAGGTGGTCAGTACGATCGGTATACATATAGAGGAAGAATTTCAGCAGA Esp3I ~ 1101 CCAGTGCCTCCACCAAGGGCCCATCGGTCTTCCCCCTGGCGCCCTGCTCCAGGAGCACCTCCGAGAGCACAGCGGCCCTGGGCTGCCTGGTCAAGGACTA GGTCACGGAGGTGGTTCCCGGGTAGCCAGAAGGGGGACCGCGGGACGAGGTCCTCGTGGAGGCTCTCGTGTCGCCGGGACCCGACGGACCAGTTCCTGAT 1201 CTTCCCCGAACCGGTGACGGTGTCGTGGAACTCAGGCGCTCTGACCAGCGGCGTGCACACCTTCCCAGCTGTCCTACAGTCCTCAGGACTCTACTCCCTC GAAGGGGCTTGGCCACTGCCACAGCACCTTGAGTCCGCGAGACTGGTCGCCGCACGTGTGGAAGGGTCGACAGGATGTCAGGAGTCCTGAGATGAGGGAG 1301 AGCAGCGTGGTGACCGTGCCCTCCAGCAGCTTGGGCACCCAGACCTACATCTGCAACGTGAATCACAAGCCCAGCAACACCAAGGTGGACAAGAAAGTTG TCGTCGCACCACTGGCACGGGAGGTCGTCGAACCCGTGGGTCTGGATGTAGACGTTGCACTTAGTGTTCGGGTCGTTGTGGTTCCACCTGTTCTTTCAAC 1401 AGCCCAAATCTTGTGACAAAACTCACACATGCCCACCGTGCCCAGCACCTGAACTCCTGGGGGGACCGTCAGTCTTCCTCTTCCCCCCMAAACCCAAGGA TCGGGTTTAGAACACTGTTTTGAGTGTGTACGGGTGGCACGGGTCGTGGACTTGAGGACCCCCCTGGCAGTCAGAAGGAGAAGGGGGGKTTTGGGTTCCT 1501 CACCCTCATGATCTCCCGGACCCCTGAGGTCACATGCGTGGTGGTGGACGTGAGCCACGAAGACCCTGAGGTCAAGTTCAACTGGTACGTGGACGGCGTG GTGGGAGTACTAGAGGGCCTGGGGACTCCAGTGTACGCACCACCACCTGCACTCGGTGCTTCTGGGACTCCAGTTCAAGTTGACCATGCACCTGCCGCAC 1601 GAGGTGCATAATGCCAAGACAAAGCCGCGGGAGGAGCAGTACAACAGCACGTACCGTGTGGTCAGCGTCCTCACCGTCCTGCACCAGGACTGGCTGAATG CTCCACGTATTACGGTTCTGTTTCGGCGCCCTCCTCGTCATGTTGTCGTGCATGGCACACCAGTCGCAGGAGTGGCAGGACGTGGTCCTGACCGACTTAC 1701 GCAAGGAGTACAAGTGCAAGGTCTCCAACAAAGCCCTCCCAGCCCCCATCGAGAAAACCATCTCCAAAGCCAAAGGGCAGCCCCGAGAACCACAGGTGTA CGTTCCTCATGTTCACGTTCCAGAGGTTGTTTCGGGAGGGTCGGGGGTAGCTCTTTTGGTAGAGGTTTCGGTTTCCCGTCGGGGCTCTTGGTGTCCACAT 1801 CACCCTGCCCCCATCCCGGGATGAGCTGACCAAGAACCAGGTCAGCCTGACCTGCCTGGTCAAAGGCTTCTACCCCAGCGACATCGCCGTGGAGTGGGAG GTGGGACGGGGGTAGGGCCCTACTCGACTGGTTCTTGGTCCAGTCGGACTGGACGGACCAGTTTCCGAAGATGGGGTCGCTGTAGCGGCACCTCACCCTC 1901 AGCAATGGGCAGCCGGAGAACAACTACAAGACCACGCCTCCCATGCTGGACTCCGACGGCTCCTTCTTCCTCTACAGCAAGCTCACCGTGGACAAGAGCA TCGTTACCCGTCGGCCTCTTGTTGATGTTCTGGTGCGGAGGGTACGACCTGAGGCTGCCGAGGAAGAAGGAGATGTCGTTCGAGTGGCACCTGTTCTCGT 2001 GGTGGCAGCAGGGGAACGTCTTCTCATGCTCCGTGATGCATGAGGCTCTGCACAACCACTACACGCAGAAGAGCCTCTCCCTGTCTCCGGGTAAAGGGTA CCACCGTCGTCCCCTTGCAGAAGAGTACGAGGCACTACGTACTCCGAGACGTGTTGGTGATGTGCGTCTTCTCGGAGAGGGACAGAGGCCCATTTCCCAT 2101 CATGTCCCATATGCTCGACATGGCAAGCAGCCTGAGACAGATTCTGGACTCCCAGAAAATGGAGTGGAGGTCCAACGCCGGGGGCAGCGGTAGGGATAAG GTACAGGGTATACGAGCTGTACCGTTCGTCGGACTCTGTCTAAGACCTGAGGGTCTTTTACCTCACCTCCAGGTTGCGGCCCCCGTCGCCATCCCTATTC 2201 TGGTCAGATCTTCGCATGGGCAGCAGCCATCATCATCATCATCACAGCAGCGGCATGGCAAGCAGCCTGAGACAGATTCTGGACTCCCAGAAAATGGAGT ACCAGTCTAGAAGCGTACCCGTCGTCGGTAGTAGTAGTAGTAGTGTCGTCGCCGTACCGTTCGTCGGACTCTGTCTAAGACCTGAGGGTCTTTTACCTCA I-SceI ~~~~~~~~~~~~~~~~~~~~ 2301 GGAGGTCCAACGCCGGGGGCAGCGGTAGGGATAACAGGGTAATCCATATGCTCGAGGGGGCCAAGGCCGCGCCGGCCTGCAGGCATGCAAGCTTGGCGTA CCTCCAGGTTGCGGCCCCCGTCGCCATCCCTATTGTCCCATTAGGTATACGAGCTCCCCCGGTTCCGGCGCGGCCGGACGTCCGTACGTTCGAACCGCAT 2401 ATCATGGTCATAGCTGTTTCCTGTGTGAAATTGTTATCCGCTCACAATTCCACACAACATACGAGCCGGAAGCATAAAGTGTAAAGCCTGGGGTGCCTAA TAGTACCAGTATCGACAAAGGACACACTTTAACAATAGGCGAGTGTTAAGGTGTGTTGTATGCTCGGCCTTCGTATTTCACATTTCGGACCCCACGGATT 2501 TGAGTGAGCTAACTCACATTAATTGCGTTGCGCTCACTGCCCGCTTTCCAGTCGGGAAACCTGTCGTGCCAGCGAGCTCGAATTGTTGACATTCCCGAAA ACTCACTCGATTGAGTGTAATTAACGCAACGCGAGTGACGGGCGAAAGGTCAGCCCTTTGGACAGCACGGTCGCTCGAGCTTAACAACTGTAAGGGCTTT 2601 TCTCTGGATTTAAGGAGATGGCACCCATGGAACAGTTCATTGCTCAAGTTGATCGCTGCGCTTCCTGCACTACTGGATGTCTCAAAGGTCTTGCCAATGT AGAGACCTAAATTCCTCTACCGTGGGTACCTTGTCAAGTAACGAGTTCAACTAGCGACGCGAAGGACGTGATGACCTACAGAGTTTCCAGAACGGTTACA 2701 TAAGTGCTCTGAACTCCTGAAGAAATGGCTGCCTGACAGGTGTGCAAGTTTTGCTGACAAGATTCAAAAAGAAGTTCACAATATCAAAGGCATGGCCGGC ATTCACGAGACTTGAGGACTTCTTTACCGACGGACTGTCCACACGTTCAAAACGACTGTTCTAAGTTTTTCTTCAAGTGTTATAGTTTCCGTACCGGCCG 2801 GATCGATGAGCGGCCGCAATTTAATTCCGGTTATTTTCCACCATATTGCCGTCTTTTGGCAATGTGAGGGCCCGGAAACCTGGCCCTGTCTTCTTGACGA CTAGCTACTCGCCGGCGTTAAATTAAGGCCAATAAAAGGTGGTATAACGGCAGAAAACCGTTACACTCCCGGGCCTTTGGACCGGGACAGAAGAACTGCT 2901 GCATTCCTAGGGGTCTTTCCCCTCTCGCCAAAGGAATGCAAGGTCTGTTGAATGTCGTGAAGGAAGCAGTTCCTCTGGAAGCTTCTTGAAGACAAACAAC CGTAAGGATCCCCAGAAAGGGGAGAGCGGTTTCCTTACGTTCCAGACAACTTACAGCACTTCCTTCGTCAAGGAGACCTTCGAAGAACTTCTGTTTGTTG 3001 GTCTGTAGCGACCCTTTGCAGGCAGCGGAACCCCCCACCTGGCGACAGGTGCCTCTGCGGCCAAAAGCCACGTGTATAAGATACACCTGCAAAGGCGGCA CAGACATCGCTGGGAAACGTCCGTCGCCTTGGGGGGTGGACCGCTGTCCACGGAGACGCCGGTTTTCGGTGCACATATTCTATGTGGACGTTTCCGCCGT 3101 CAACCCCAGTGCCACGTTGTGAGTTGGATAGTTGTGGAAAGAGTCAAATGGCTCACCTCAAGCGTATTCAACAAGGGGCTGAAGGATGCCCAGAAGGTAC GTTGGGGTCACGGTGCAACACTCAACCTATCAACACCTTTCTCAGTTTACCGAGTGGAGTTCGCATAAGTTGTTCCCCGACTTCCTACGGGTCTTCCATG 3201 CCCATTGTATGGGATCTGATCTGGGGCCTCGGTGCACATGCTTTACATGTGTTTAGTCGAGGTTAAAAAACGTCTAGGCCCCCCGAACCACGGGGACGTG GGGTAACATACCCTAGACTAGACCCCGGAGCCACGTGTACGAAATGTACACAAATCAGCTCCAATTTTTTGCAGATCCGGGGGGCTTGGTGCCCCTGCAC 3301 GTTTTCCTTTGAAAAACACGATGATAATATGGCCACCACCCATACCTAGGCTTTTGCAAAGATCGATCAAGAGACAGGATGAGGATCGTTTCGCATGATT CAAAAGGAAACTTTTTGTGCTACTATTATACCGGTGGTGGGTATGGATCCGAAAACGTTTCTAGCTAGTTCTCTGTCCTACTCCTAGCAAAGCGTACTAA 3401 GAACAAGATGGATTGCACGCAGGTTCTCCGGCCGCTTGGGTGGAGAGGCTATTCGGCTATGACTGGGCACAACAGACAATCGGCTGCTCTGATGCCGCCG CTTGTTCTACCTAACGTGCGTCCAAGAGGCCGGCGAACCCACCTCTCCGATAAGCCGATACTGACCCGTGTTGTCTGTTAGCCGACGAGACTACGGCGGC 3501 TGTTCCGGCTGTCAGCGCAGGGGCGCCCGGTTCTTTTTGTCAAGACCGACCTGTCCGGTGCCCTGAATGAACTGCAAGACGAGGCAGCGCGGCTATCGTG ACAAGGCCGACAGTCGCGTCCCCGCGGGCCAAGAAAAACAGTTCTGGCTGGACAGGCCACGGGACTTACTTGACGTTCTGCTCCGTCGCGCCGATAGCAC 3601 GCTGGCCACGACGGGCGTTCCTTGCGCAGCTGTGCTCGACGTTGTCACTGAAGCGGGAAGGGACTGGCTGCTATTGGGCGAAGTGCCGGGGCAGGATCTC CGACCGGTGCTGCCCGCAAGGAACGCGTCGACACGAGCTGCAACAGTGACTTCGCCCTTCCCTGACCGACGATAACCCGCTTCACGGCCCCGTCCTAGAG 3701 CTGTCATCTCACCTTGCTCCTGCCGAGAAAGTATCCATCATGGCTGATGCAATGCGGCGGCTGCATACGCTTGATCCGGCTACCTGCCCATTCGACCACC GACAGTAGAGTGGAACGAGGACGGCTCTTTCATAGGTAGTACCGACTACGTTACGCCGCCGACGTATGCGAACTAGGCCGATGGACGGGTAAGCTGGTGG 3801 AAGCGAAACATCGCATCGAGCGAGCACGTACTCGGATGGAAGCCGGTCTTGTCGATCAGGATGATCTGGACGAAGAGCATCAGGGGCTCGCGCCAGCCGA TTCGCTTTGTAGCGTAGCTCGCTCGTGCATGAGCCTACCTTCGGCCAGAACAGCTAGTCCTACTAGACCTGCTTCTCGTAGTCCCCGAGCGCGGTCGGCT 3901 ACTGTTCGCCAGGCTCAAGGCGAGCATGCCCGACGGCGAGGATCTCGTCGTGACCCATGGCGATGCCTGCTTGCCGAATATCATGGTGGAAAATGGCCGC TGACAAGCGGTCCGAGTTCCGCTCGTACGGGCTGCCGCTCCTAGAGCAGCACTGGGTACCGCTACGGACGAACGGCTTATAGTACCACCTTTTACCGGCG 4001 TTTTCTGGATTCATCGACTGTGGCCGGCTGGGTGTGGCGGACCGCTATCAGGACATAGCGTTGGCTACCCGTGATATTGCTGAAGAGCTTGGCGGCGAAT AAAAGACCTAAGTAGCTGACACCGGCCGACCCACACCGCCTGGCGATAGTCCTGTATCGCAACCGATGGGCACTATAACGACTTCTCGAACCGCCGCTTA 4101 GGGCTGACCGCTTCCTCGTGCTTTACGGTATCGCCGCTCCCGATTCGCAGCGCATCGCCTTCTATCGCCTTCTTGACGAGTTCTTCTGAGCGGGACTCTG CCCGACTGGCGAAGGAGCACGAAATGCCATAGCGGCGAGGGCTAAGCGTCGCGTAGCGGAAGATAGCGGAAGAACTGCTCAAGAAGACTCGCCCTGAGAC 4201 GGGTTCGGGCCGCACTCGAGCATAAACTTGTTTATTGCAGCTTATAATGGTTACAAATAAAGCAATAGCATCACAAATTTCACAAATAAAGCATTTTTTT CCCAAGCCCGGCGTGAGCTCGTATTTGAACAAATAACGTCGAATATTACCAATGTTTATTTCGTTATCGTAGTGTTTAAAGTGTTTATTTCGTAAAAAAA I-SceI ~~~~~~~~~~~~~~ 4301 CACTGCATTCTAGTTGTGGTTTGTCCAAACTCATCAATGTATCTTAAGTAGGGATAACAGGGTAATTTTGTTAAATCAGCTCATTTTTTAACCAATAGGA GTGACGTAAGATCAACACCAAACAGGTTTGAGTAGTTACATAGAATTCATCCCTATTGTCCCATTAAAACAATTTAGTCGAGTAAAAAATTGGTTATCCT 4401 ACGCCATCAAAAATAATTCGCGTCTGGCCTTCCTGTAGCCAGCTTTCATCAACATTAAATGTGAGCGAGTAACAACCCGTCGGATTCTCCGTGGGAACAA TGCGGTAGTTTTTATTAAGCGCAGACCGGAAGGACATCGGTCGAAAGTAGTTGTAATTTACACTCGCTCATTGTTGGGCAGCCTAAGAGGCACCCTTGTT 4501 ACGGCGGATTGACCGTAATGGGATAGGTTACGTTGGTGTAGATGGGCGCATCGTAACCGTGCATCTGCCAGTTTGAGGGGACGACGACCGTATCGGCCTC TGCCGCCTAACTGGCATTACCCTATCCAATGCAACCACATCTACCCGCGTAGCATTGGCACGTAGACGGTCAAACTCCCCTGCTGCTGGCATAGCCGGAG 4601 AGGAAGATCGCACTCCAGCCAGCTTTCCGGCACCGCTTCTGGTGCCGGAAACCAGGCAAAGCGCCATTCGCCATTCAGGCTGCGCAACTGTTGGGAAGGG TCCTTCTAGCGTGAGGTCGGTCGAAAGGCCGTGGCGAAGACCACGGCCTTTGGTCCGTTTCGCGGTAAGCGGTAAGTCCGACGCGTTGACAACCCTTCCC 4701 CGATCGGTGCGGGCCTCTTCGCTATTACGCCAGCTGGCGAAAGGGGGATGTGCTGCAAGGCGATTAAGTTGGGTAACGCCAGGGTTTTCCCAGTCACGAC GCTAGCCACGCCCGGAGAAGCGATAATGCGGTCGACCGCTTTCCCCCTACACGACGTTCCGCTAATTCAACCCATTGCGGTCCCAAAAGGGTCAGTGCTG 4801 GTTGTAAAACGACGGCCAGTGAATTGCAATTCGTAATCATGGTCATAGCTGTTTCCTGTGTGAAATTGTTATCCGCTCACAATTCCACACAACATACGAG CAACATTTTGCTGCCGGTCACTTAACGTTAAGCATTAGTACCAGTATCGACAAAGGACACACTTTAACAATAGGCGAGTGTTAAGGTGTGTTGTATGCTC I-SceI ~~~~~~~~~~~~~~~~~~~~ 4901 CCGGAAGCATAAAGTGTAAAGCCTGGGGTGCCTAATGAGTGAGCTAACTCACATTAATTGCGTTGCGCTCACTGCCATTACCCTGTTATCCCTAGTGAAC GGCCTTCGTATTTCACATTTCGGACCCCACGGATTACTCACTCGATTGAGTGTAATTAACGCAACGCGAGTGACGGTAATGGGACAATAGGGATCACTTG 5001 CATCACCCTAATCAAGTTTTTTGGGGTCGAGGTGCCGTAAAGCACTAAATCGGAACCCTAAAGGGAGCCCCCGATTTAGAGCTTGACGGGGAAAGCCGGC GTAGTGGGATTAGTTCAAAAAACCCCAGCTCCACGGCATTTCGTGATTTAGCCTTGGGATTTCCCTCGGGGGCTAAATCTCGAACTGCCCCTTTCGGCCG 5101 GAACGTGGCGAGAAAGGAAGGGAAGAAAGCGAAAGGAGCGGGCGCTAGGGCGCTGGCAAGTGTAGCGGTCACGCTGCGCGTAACCACCACACCCGCCGCG CTTGCACCGCTCTTTCCTTCCCTTCTTTCGCTTTCCTCGCCCGCGATCCCGCGACCGTTCACATCGCCAGTGCGACGCGCATTGGTGGTGTGGGCGGCGC 5201 CTTAATGCGCCGCTACAGGGCGCGTCAGGTGGCACTTTTCGGGGAAATGTGCGCGGAACCCCTATTTGTTTATTTTTCTAAATACATTCAAATATGTATC GAATTACGCGGCGATGTCCCGCGCAGTCCACCGTGAAAAGCCCCTTTACACGCGCCTTGGGGATAAACAAATAAAAAGATTTATGTAAGTTTATACATAG 5301 CGCTCATGAGACAATAACCCTGATAAATGCTTCAATAATAACGACCGGTAATGAAAAAGGAAGAGTATGAGTATTCAACATTTCCGTGTCGCCCTTATTC GCGAGTACTCTGTTATTGGGACTATTTACGAAGTTATTATTGCTGGCCATTACTTTTTCCTTCTCATACTCATAAGTTGTAAAGGCACAGCGGGAATAAG 5401 CCTTTTTTGCGGCATTTTGCCTTCCTGTTTTTGCTCACCCAGAAACGCTGGTGAAAGTAAAAGATGCTGAAGATCAGTTGGGTGCACGAGTGGGTTACAT GGAAAAAACGCCGTAAAACGGAAGGACAAAAACGAGTGGGTCTTTGCGACCACTTTCATTTTCTACGACTTCTAGTCAACCCACGTGCTCACCCAATGTA 5501 CGAACTGGATCTCAACAGCGGTAAGATCCTTGAGAGTTTTCGCCCCGAAGAACGTTTTCCAATGATGAGCACTTTTAAAGTTCTGCTATGTGGCGCGGTA GCTTGACCTAGAGTTGTCGCCATTCTAGGAACTCTCAAAAGCGGGGCTTCTTGCAAAAGGTTACTACTCGTGAAAATTTCAAGACGATACACCGCGCCAT 5601 TTATCCCGTATTGACGCCGGGCAAGAGCAACTCGGTCGCCGCATACACTATTCTCAGAATGACTTGGTTGAGTCTAGCGTTGATCGGCACGTAAGAGGTT AATAGGGCATAACTGCGGCCCGTTCTCGTTGAGCCAGCGGCGTATGTGATAAGAGTCTTACTGAACCAACTCAGATCGCAACTAGCCGTGCATTCTCCAA 5701 CCAACTTTCACCATAATGAAATAAGATCACTACCGGGCGTATTTTTTGAGTTATCGAGATTTTCAGGAGCTAAGGAAGCTAAAATGGAGAAAAAAATCAC GGTTGAAAGTGGTATTACTTTATTCTAGTGATGGCCCGCATAAAAAACTCAATAGCTCTAAAAGTCCTCGATTCCTTCGATTTTACCTCTTTTTTTAGTG 5801 TGGATATACCACCGTTGATATATCCCAATGGCATCGTAAAGAACATTTTGAGGCATTTCAGTCAGTTGCTCAATGTACCTATAACCAGACCGTTCAGCTG ACCTATATGGTGGCAACTATATAGGGTTACCGTAGCATTTCTTGTAAAACTCCGTAAAGTCAGTCAACGAGTTACATGGATATTGGTCTGGCAAGTCGAC 5901 GATATTACGGCCTTTTTAAAGACCGTAAAGAAAAATAAGCACAAGTTTTATCCGGCCTTTATTCACATTCTTGCCCGCCTGATGAATGCTCATCCGGAAT CTATAATGCCGGAAAAATTTCTGGCATTTCTTTTTATTCGTGTTCAAAATAGGCCGGAAATAAGTGTAAGAACGGGCGGACTACTTACGAGTAGGCCTTA 6001 TCCGTATGGCAATGAAAGACGGTGAGCTGGTGATATGGGATAGTGTTCACCCTTGTTACACCGTTTTCCATGAGCAAACTGAAACGTTTTCATCGCTCTG AGGCATACCGTTACTTTCTGCCACTCGACCACTATACCCTATCACAAGTGGGAACAATGTGGCAAAAGGTACTCGTTTGACTTTGCAAAAGTAGCGAGAC 6101 GAGTGAATACCACGACGATTTCCGGCAGTTTCTACACATATATTCGCAAGATGTGGCGTGTTACGGTGAAAACCTGGCCTATTTCCCTAAAGGGTTTATT CTCACTTATGGTGCTGCTAAAGGCCGTCAAAGATGTGTATATAAGCGTTCTACACCGCACAATGCCACTTTTGGACCGGATAAAGGGATTTCCCAAATAA 6201 GAGAATATGTTTTTCGTATCAGCCAATCCCTGGGTGAGTTTCACCAGTTTTGATTTAAACGTGGCCAATATGGACAACTTCTTCGCCCCCGTTTTCACCA CTCTTATACAAAAAGCATAGTCGGTTAGGGACCCACTCAAAGTGGTCAAAACTAAATTTGCACCGGTTATACCTGTTGAAGAAGCGGGGGCAAAAGTGGT 6301 TGGGCAAATATTATACGCAAGGCGACAAGGTGCTGATGCCGCTGGCGATTCAGGTTCATCATGCCGTCTGTGATGGCTTCCATGTCGGCAGAATGCTTAA ACCCGTTTATAATATGCGTTCCGCTGTTCCACGACTACGGCGACCGCTAAGTCCAAGTAGTACGGCAGACACTACCGAAGGTACAGCCGTCTTACGAATT 6401 TGAATTACAACAGTACTGCGATGAGTGGCAGGGCGGGGCGTAATTTTTTTAAGGCAGTTATTGGTGCCCTTAAACGCCTGGTGCTACGCCTGAATAAGTG ACTTAATGTTGTCATGACGCTACTCACCGTCCCGCCCCGCATTAAAAAAATTCCGTCAATAACCACGGGAATTTGCGGACCACGATGCGGACTTATTCAC 6501 ATAATAAGCGGATGAATGGCAGAAATTCGAAATGACCGACCAAGCGACGCCCAACCTGCCATCACGAGATTTCGATTCCACCGCCGCCTTCTATGAAAGG TATTATTCGCCTACTTACCGTCTTTAAGCTTTACTGGCTGGTTCGCTGCGGGTTGGACGGTAGTGCTCTAAAGCTAAGGTGGCGGCGGAAGATACTTTCC 6601 TTGGGCTTCGGAATCGTTTTCCGGGACGCCGGCTGGATGATCCTCCAGCGCGGGGATCTCATGCTGGAGTTCTTCGCCCACCCTAGGGGGAGGCTAACTG AACCCGAAGCCTTAGCAAAAGGCCCTGCGGCCGACCTACTAGGAGGTCGCGCCCCTAGAGTACGACCTCAAGAAGCGGGTGGGATCCCCCTCCGATTGAC 6701 AAACACGGAAGGAGACAATACCGGAAGGAACCCGCGCTATGACGGCAATAAAAAGACAGAATAAAACGCACGGTGTTGGGTCGTTTGTTCATAAACGCGG TTTGTGCCTTCCTCTGTTATGGCCTTCCTTGGGCGCGATACTGCCGTTATTTTTCTGTCTTATTTTGCGTGCCACAACCCAGCAAACAAGTATTTGCGCC 6801 GGTTCGGTCCCAGGGCTGGCACTCTGTCGATACCCCACCGAGACCCCATTGGGGCCAATACGCCCGCGTTTCTTCCTTTTCCCCACCCCACCCCCCAAGT CCAAGCCAGGGTCCCGACCGTGAGACAGCTATGGGGTGGCTCTGGGGTAACCCCGGTTATGCGGGCGCAAAGAAGGAAAAGGGGTGGGGTGGGGGGTTCA 6901 TCGGGTGAAGGCCCAGGGCTCGCAGCCAACGTCGGGGCGGCAGGCCCTGCCATAGCCTCAGGTTACTCATATATACTTTAGATTGATTTAAAACTTCATT AGCCCACTTCCGGGTCCCGAGCGTCGGTTGCAGCCCCGCCGTCCGGGACGGTATCGGAGTCCAATGAGTATATATGAAATCTAACTAAATTTTGAAGTAA 7001 TTTAATTTAAAAGGATCTAGGTGAAGATCCTTTTTGATAATCTCATGACCAAAATCCCTTAACGTGAGTTTTCGTTCCACTGAGCGTCAGACCCCGTAGA AAATTAAATTTTCCTAGATCCACTTCTAGGAAAAACTATTAGAGTACTGGTTTTAGGGAATTGCACTCAAAAGCAAGGTGACTCGCAGTCTGGGGCATCT 7101 AAAGATCAAAGGATCTTCTTGAGATCCTTTTTTTCTGCGCGTAATCTGCTGCTTGCAAACAAAAAAACCACCGCTACCAGCGGTGGTTTGTTTGCCGGAT TTTCTAGTTTCCTAGAAGAACTCTAGGAAAAAAAGACGCGCATTAGACGACGAACGTTTGTTTTTTTGGTGGCGATGGTCGCCACCAAACAAACGGCCTA 7201 CAAGAGCTACCAACTCTTTTTCCGAAGGTAACTGGCTTCAGCAGAGCGCAGATACCAAATACTGTCCTTCTAGTGTAGCCGTAGTTAGGCCACCACTTCA GTTCTCGATGGTTGAGAAAAAGGCTTCCATTGACCGAAGTCGTCTCGCGTCTATGGTTTATGACAGGAAGATCACATCGGCATCAATCCGGTGGTGAAGT 7301 AGAACTCTGTAGCACCGCCTACATACCTCGCTCTGCTAATCCTGTTACCAGTGGCTGCTGCCAGTGGCGATAAGTCGTGTCTTACCGGGTTGGACTCAAG TCTTGAGACATCGTGGCGGATGTATGGAGCGAGACGATTAGGACAATGGTCACCGACGACGGTCACCGCTATTCAGCACAGAATGGCCCAACCTGAGTTC 7401 ACGATAGTTACCGGATAAGGCGCAGCGGTCGGGCTGAACGGGGGGTTCGTGCACACAGCCCAGCTTGGAGCGAACGACCTACACCGAACTGAGATACCTA TGCTATCAATGGCCTATTCCGCGTCGCCAGCCCGACTTGCCCCCCAAGCACGTGTGTCGGGTCGAACCTCGCTTGCTGGATGTGGCTTGACTCTATGGAT 7501 CAGCGTGAGCTATGAGAAAGCGCCACGCTTCCCGAAGGGAGAAAGGCGGACAGGTATCCGGTAAGCGGCAGGGTCGGAACAGGAGAGCGCACGAGGGAGC GTCGCACTCGATACTCTTTCGCGGTGCGAAGGGCTTCCCTCTTTCCGCCTGTCCATAGGCCATTCGCCGTCCCAGCCTTGTCCTCTCGCGTGCTCCCTCG 7601 TTCCAGGGGGAAACGCCTGGTATCTTTATAGTCCTGTCGGGTTTCGCCACCTCTGACTTGAGCGTCGATTTTTGTGATGCTCGTCAGGGGGGCGGAGCTT AAGGTCCCCCTTTGCGGACCATAGAAATATCAGGACAGCCCAAAGCGGTGGAGACTGAACTCGCAGCTAAAAACACTACGAGCAGTCCCCCCGCCTCGGA 7701 ATGGAAAAACGCCAGCAACGCGGCCTTTTTACGGTTCCTGGCCTTTTGCTGGCCTTTTGCTCACATGTTCTTTCCTGCGTTATCCCCTGATTCTGTGGAT TACCTTTTTGCGGTCGTTGCGCCGGAAAAATGCCAAGGACCGGAAAACGACCGGAAAACGAGTGTACAAGAAAGGACGCAATAGGGGACTAAGACACCTA 7801 AACCGTATTACCGCCATGCATTAGTTATTAATAGTAATCAATTACGGGGTCATTAGTTCATAGCCCATATATGGAGTTCCGCGTTACATAACTTACGGTA TTGGCATAATGGCGGTACGTAATCAATAATTATCATTAGTTAATGCCCCAGTAATCAAGTATCGGGTATATACCTCAAGGCGCAATGTATTGAATGCCAT 7901 AATGGCCCGCCTGGCTGACCGCCCAACGACCCCCGCCCATTGACGTCAATAATGACGTATGTTCCCATAGTAACGCCAATAGGGACTTTCCATTGACGTC TTACCGGGCGGACCGACTGGCGGGTTGCTGGGGGCGGGTAACTGCAGTTATTACTGCATACAAGGGTATCATTGCGGTTATCCCTGAAAGGTAACTGCAG 8001 AATGGGTGGAGTATTTACGGTAAACTGCCCACTTGGCAGTACATCAAGTGTATCATATGCCAAGTACGCCCCCTATTGACGTCAATGACGGTAAATGGCC TTACCCACCTCATAAATGCCATTTGACGGGTGAACCGTCATGTAGTTCACATAGTATACGGTTCATGCGGGGGATAACTGCAGTTACTGCCATTTACCGG 8101 CGCCTGGCATTATGCCCAGTACATGACCTTATGGGACTTTCCTACTTGGCAGTACATCTACGTATTAGTCATCGCTATTACCATGGTGATGCGGTTTTGG GCGGACCGTAATACGGGTCATGTACTGGAATACCCTGAAAGGATGAACCGTCATGTAGATGCATAATCAGTAGCGATAATGGTACCACTACGCCAAAACC 8201 CAGTACATCAATGGGCGTGGATAGCGGTTTGACTCACGGGGATTTCCAAGTCTCCACCCCATTGACGTCAATGGGAGTTTGTTTTGGCACCAAAATCAAC GTCATGTAGTTACCCGCACCTATCGCCAAACTGAGTGCCCCTAAAGGTTCAGAGGTGGGGTAACTGCAGTTACCCTCAAACAAAACCGTGGTTTTAGTTG 8301 GGGACTTTCCAAAATGTCGTAACAACTCCGCCCCATTGACGCAAATGGGCGGTAGGCGTGTACGGTGGGAGGTCTATATAAGCAGAGCT CCCTGAAAGGTTTTACAGCATTGTTGAGGCGGGGTAACTGCGTTTACCCGCCATCCGCACATGCCACCCTCCAGATATATTCGTCTCGA pVHentry-CBD1 Esp3I ~~~~~~~ 1 GGTTTAGTGAACCGTCAGATCCGCTAGACGTCTCATATACCTGACTGGAATACGACAGCTCCTGCAGCTTCTGGGCGAAGACCACCGTGGCCCATTGCGT CCAAATCACTTGGCAGTCTAGGCGATCTGCAGAGTATATGGACTGACCTTATGCTGTCGAGGACGTCGAAGACCCGCTTCTGGTGGCACCGGGTAACGCA 101 ACTTAGCGATAATCTGGTCCGCTTGGAAGTTAGCACGGCGAGCGCGCTCCAGAGCCAAGTCACGCAGCTTAACAGTACCTACCGCAGAGCGGTGCATGAA TGAATCGCTATTAGACCAGGCGAACCTTCAATCGTGCCGCTCGCGCGAGGTCTCGGTTCAGTGCGTCGAATTGTCATGGATGGCGTCTCGCCACGTACTT 201 CAGGCCGATAACGTTGTCCTTAGCAACCTTGACATTACCCTCACCTTTATTGGCAGGGAAGACGTGCTTCTGACCAGTAGTGCCCTCACGAGCGGTACCA GTCCGGCTATTGCAACAGGAATCGTTGGAACTGTAATGGGAGTGGAAATAACCGTCCCTTCTGCACGAAGACTGGTCATCACGGGAGTGCTCGCCATGGT 301 GCACCACCAGCGGTGAGGTGCGGAACTTCTACAACCTCAAAGCCCATAACGTTGCGGATAGAACCCTTCTCAGGGTCAATCAGAGCAGCGTAGTTTGCTG CGTGGTGGTCGCCACTCCACGCCTTGAAGATGTTGGAGTTTCGGGTATTGCAACGCCTATCTTGGGAAGAGTCCCAGTTAGTCTCGTCGCATCAAACGAC 401 CGTTCGGCATCAGTGCTGCCAGAATCGCAGAGTAGCTATCTGGGTCACAGTAGAACACACGGTCAGCAGCCGGAACATAGTTCTTGGTCAGAGCCGCACG GCAAGCCGTAGTCACGACGGTCTTAGCGTCTCATCGATAGACCCAGTGTCATCTTGTGTGCCAGTCGTCGGCCTTGTATCAAGAACCAGTCTCGGCGTGC 501 AGCCTTAGTCAGAGCCGCAATAATCTCCTTACCCAGCGCAACTTGGTCGGTAAGTGCGGCCTTGTTCTGAGTGGTCTCAATTACGGTAGCAGTACCTAAG TCGGAATCAGTCTCGGCGTTATTAGAGGAATGGGTCGCGTTGAACCAGCCATTCACGCCGGAACAAGACTCACCAGAGTTAATGCCATCGTCATGGATTC 601 CCCTCGATGTTCTCATTATATTTGCTTTCCACGTTACACAGACCGGCAATCTCAGCCAGAACCGCACCATCCGCAGCCATCGCCAGAGATTCACCCAACT GGGAGCTACAAGAGTAATATAAACGAAAGGTGCAATGTGTCTGGCCGTTAGAGTCGGTCTTGGCGTGGTAGGCGTCGGTAGCGGTCTCTAAGTGGGTTGA 701 GAGAGGTATACTCAGAGCGAACGTCGTAGTGGTTCATCGCGTCCTCAATATCATAAATCAGAACGTCAGCCGTCAGGAGACCGTCAATGGTGATTACCTT CTCTCCATATGAGTCTCGCTTGCAGCATCACCAAGTAGCGCAGGAGTTATAGTATTTAGTCTTGCAGTCGGCAGTCCTCTGGCAGTTACCACTAATGGAA 801 CTCGGTGTGTTTGATGTCCTTACGTTTATCGTCGAGGTTCTCGCCCGGAGCCAGATACGCTGCCTGAGTGCGACCCAGAACAGGGAACTGAGCGGATTTA GAGCCACACAAACTACAGGAATGCAAATAGCAGCTCCAAGAGCGGGCCTCGGTCTATGCGACGGACTCACGCTGGGTCTTGTCCCTTGACTCGCCTAAAT 901 CCGCTGGAGATGGAACGTACCATGTGGCGAGAAGTGGTCACGGAGGTACGAGCGAACGCAGTCAGGACTTCACCGCCAAATACCTTCAAGAACAACGCCA GGCGACCTCTACCTTGCATGGTACACCGCTCTTCACCAGTGCCTCCATGCTCGCTTGCGTCAGTCCTGAAGTGGCGGTTTATGGAAGTTCTTGTTGCGGT Esp3I ~~~~~ 1001 GTTTATCTCCAGCAGCAACTACACCTTTACCTTGGTTAGTACCCATTTGCTGTCCACCAGTCATGCTAGCCATATGTATATCTCCTTCTTAAAGTCGTCT CAAATAGAGGTCGTCGTTGATGTGGAAATGGAACCAATCATGGGTAAACGACAGGTGGTCAGTACGATCGGTATACATATAGAGGAAGAATTTCAGCAGA Esp3I ~ 1101 CCAGTGCCTCCACCAAGGGCCCATCGGTCTTCCCCCTGGCGCCCTGCTCCAGGAGCACCTCCGAGAGCACAGCGGCCCTGGGCTGCCTGGTCAAGGACTA GGTCACGGAGGTGGTTCCCGGGTAGCCAGAAGGGGGACCGCGGGACGAGGTCCTCGTGGAGGCTCTCGTGTCGCCGGGACCCGACGGACCAGTTCCTGAT 1201 CTTCCCCGAACCGGTGACGGTGTCGTGGAACTCAGGCGCTCTGACCAGCGGCGTGCACACCTTCCCAGCTGTCCTACAGTCCTCAGGACTCTACTCCCTC GAAGGGGCTTGGCCACTGCCACAGCACCTTGAGTCCGCGAGACTGGTCGCCGCACGTGTGGAAGGGTCGACAGGATGTCAGGAGTCCTGAGATGAGGGAG 1301 AGCAGCGTGGTGACCGTGCCCTCCAGCAGCTTGGGCACCCAGACCTACATCTGCAACGTGAATCACAAGCCCAGCAACACCAAGGTGGACAAGAAAGTTG TCGTCGCACCACTGGCACGGGAGGTCGTCGAACCCGTGGGTCTGGATGTAGACGTTGCACTTAGTGTTCGGGTCGTTGTGGTTCCACCTGTTCTTTCAAC 1401 AGCCCAAATCTTGTGACAAAACTCACACATGCCCACCGTGCCCAGCACCTGAACTCCTGGGGGGACCGTCAGTCTTCCTCTTCCCCCCMAAACCCAAGGA TCGGGTTTAGAACACTGTTTTGAGTGTGTACGGGTGGCACGGGTCGTGGACTTGAGGACCCCCCTGGCAGTCAGAAGGAGAAGGGGGGKTTTGGGTTCCT 1501 CACCCTCATGATCTCCCGGACCCCTGAGGTCACATGCGTGGTGGTGGACGTGAGCCACGAAGACCCTGAGGTCAAGTTCAACTGGTACGTGGACGGCGTG GTGGGAGTACTAGAGGGCCTGGGGACTCCAGTGTACGCACCACCACCTGCACTCGGTGCTTCTGGGACTCCAGTTCAAGTTGACCATGCACCTGCCGCAC 1601 GAGGTGCATAATGCCAAGACAAAGCCGCGGGAGGAGCAGTACAACAGCACGTACCGTGTGGTCAGCGTCCTCACCGTCCTGCACCAGGACTGGCTGAATG CTCCACGTATTACGGTTCTGTTTCGGCGCCCTCCTCGTCATGTTGTCGTGCATGGCACACCAGTCGCAGGAGTGGCAGGACGTGGTCCTGACCGACTTAC 1701 GCAAGGAGTACAAGTGCAAGGTCTCCAACAAAGCCCTCCCAGCCCCCATCGAGAAAACCATCTCCAAAGCCAAAGGGCAGCCCCGAGAACCACAGGTGTA CGTTCCTCATGTTCACGTTCCAGAGGTTGTTTCGGGAGGGTCGGGGGTAGCTCTTTTGGTAGAGGTTTCGGTTTCCCGTCGGGGCTCTTGGTGTCCACAT 1801 CACCCTGCCCCCATCCCGGGATGAGCTGACCAAGAACCAGGTCAGCCTGACCTGCCTGGTCAAAGGCTTCTACCCCAGCGACATCGCCGTGGAGTGGGAG GTGGGACGGGGGTAGGGCCCTACTCGACTGGTTCTTGGTCCAGTCGGACTGGACGGACCAGTTTCCGAAGATGGGGTCGCTGTAGCGGCACCTCACCCTC 1901 AGCAATGGGCAGCCGGAGAACAACTACAAGACCACGCCTCCCATGCTGGACTCCGACGGCTCCTTCTTCCTCTACAGCAAGCTCACCGTGGACAAGAGCA TCGTTACCCGTCGGCCTCTTGTTGATGTTCTGGTGCGGAGGGTACGACCTGAGGCTGCCGAGGAAGAAGGAGATGTCGTTCGAGTGGCACCTGTTCTCGT 2001 GGTGGCAGCAGGGGAACGTCTTCTCATGCTCCGTGATGCATGAGGCTCTGCACAACCACTACACGCAGAAGAGCCTCTCCCTGTCTCCGGGTAAAGGGTA CCACCGTCGTCCCCTTGCAGAAGAGTACGAGGCACTACGTACTCCGAGACGTGTTGGTGATGTGCGTCTTCTCGGAGAGGGACAGAGGCCCATTTCCCAT 2101 CATGTCCCATATGCTCGACATGGCAAGCAGCCTGAGACAGATTCTGGACTCCCAGAAAATGGAGTGGAGGTCCAACGCCGGGGGCAGCGGTAGGGATAAG GTACAGGGTATACGAGCTGTACCGTTCGTCGGACTCTGTCTAAGACCTGAGGGTCTTTTACCTCACCTCCAGGTTGCGGCCCCCGTCGCCATCCCTATTC 2201 TGGTCAGATCTGGTACCGCGGGCGGCGACCAGCAGCATGAGCGTGGAATTTTATAACAGCAACAAAAGCGCGCAGACCAACAGCATTACCCCGATTATTA ACCAGTCTAGACCATGGCGCCCGCCGCTGGTCGTCGTACTCGCACCTTAAAATATTGTCGTTGTTTTCGCGCGTCTGGTTGTCGTAATGGGGCTAATAAT 2301 AAATTACCAACACCAGCGATAGCGATCTGAACCTGAACGATGTGAAAGTGCGCTATTATTATACCAGCGATGGCACCCAGGGCCAGACCTTTTGGTGCGA TTTAATGGTTGTGGTCGCTATCGCTAGACTTGGACTTGCTACACTTTCACGCGATAATAATATGGTCGCTACCGTGGGTCCCGGTCTGGAAAACCACGCT 2401 TCATGCGGGCGCGCTGCTGGGCAACAGCTATGTGGATAACACCAGCAAAGTGACCGCGAACTTTGTGAAAGAAACCGCGAGCCCGACCAGCACCTATGAT AGTACGCCCGCGCGACGACCCGTTGTCGATACACCTATTGTGGTCGTTTCACTGGCGCTTGAAACACTTTCTTTGGCGCTCGGGCTGGTCGTGGATACTA 2501 ACCTATGTGGAATTTGGCTTTGCGAGTGGCCGCGCGACCCTGAAAAAAGGCCAGTTTATTACCATTCAGGGCCGCATTACCAAAAGCGATTGGAGCAACT TGGATACACCTTAAACCGAAACGCTCACCGGCGCGCTGGGACTTTTTTCCGGTCAAATAATGGTAAGTCCCGGCGTAATGGTTTTCGCTAACCTCGTTGA 2601 ATACCCAGACCAACGATTATAGCTTTGATGCGAGCAGCAGCACCCCGGTGGTGAACCCGAAAGTGACCGGCTATATTGGCGGCGCGAAAGTGCTGGGCAC TATGGGTCTGGTTGCTAATATCGAAACTACGCTCGTCGTCGTGGGGCCACCACTTGGGCTTTCACTGGCCGATATAACCGCCGCGCTTTCACGACCCGTG 2701 CGCGCCGTAAAGCGGCCGCAATTTAATTCCGGTTATTTTCCACCATATTGCCGTCTTTTGGCAATGTGAGGGCCCGGAAACCTGGCCCTGTCTTCTTGAC GCGCGGCATTTCGCCGGCGTTAAATTAAGGCCAATAAAAGGTGGTATAACGGCAGAAAACCGTTACACTCCCGGGCCTTTGGACCGGGACAGAAGAACTG 2801 GAGCATTCCTAGGGGTCTTTCCCCTCTCGCCAAAGGAATGCAAGGTCTGTTGAATGTCGTGAAGGAAGCAGTTCCTCTGGAAGCTTCTTGAAGACAAACA CTCGTAAGGATCCCCAGAAAGGGGAGAGCGGTTTCCTTACGTTCCAGACAACTTACAGCACTTCCTTCGTCAAGGAGACCTTCGAAGAACTTCTGTTTGT 2901 ACGTCTGTAGCGACCCTTTGCAGGCAGCGGAACCCCCCACCTGGCGACAGGTGCCTCTGCGGCCAAAAGCCACGTGTATAAGATACACCTGCAAAGGCGG TGCAGACATCGCTGGGAAACGTCCGTCGCCTTGGGGGGTGGACCGCTGTCCACGGAGACGCCGGTTTTCGGTGCACATATTCTATGTGGACGTTTCCGCC 3001 CACAACCCCAGTGCCACGTTGTGAGTTGGATAGTTGTGGAAAGAGTCAAATGGCTCACCTCAAGCGTATTCAACAAGGGGCTGAAGGATGCCCAGAAGGT GTGTTGGGGTCACGGTGCAACACTCAACCTATCAACACCTTTCTCAGTTTACCGAGTGGAGTTCGCATAAGTTGTTCCCCGACTTCCTACGGGTCTTCCA 3101 ACCCCATTGTATGGGATCTGATCTGGGGCCTCGGTGCACATGCTTTACATGTGTTTAGTCGAGGTTAAAAAACGTCTAGGCCCCCCGAACCACGGGGACG TGGGGTAACATACCCTAGACTAGACCCCGGAGCCACGTGTACGAAATGTACACAAATCAGCTCCAATTTTTTGCAGATCCGGGGGGCTTGGTGCCCCTGC 3201 TGGTTTTCCTTTGAAAAACACGATGATAATATGGCCACCACCCATACCTAGGCTTTTGCAAAGATCGATCAAGAGACAGGATGAGGATCGTTTCGCATGA ACCAAAAGGAAACTTTTTGTGCTACTATTATACCGGTGGTGGGTATGGATCCGAAAACGTTTCTAGCTAGTTCTCTGTCCTACTCCTAGCAAAGCGTACT 3301 TTGAACAAGATGGATTGCACGCAGGTTCTCCGGCCGCTTGGGTGGAGAGGCTATTCGGCTATGACTGGGCACAACAGACAATCGGCTGCTCTGATGCCGC AACTTGTTCTACCTAACGTGCGTCCAAGAGGCCGGCGAACCCACCTCTCCGATAAGCCGATACTGACCCGTGTTGTCTGTTAGCCGACGAGACTACGGCG 3401 CGTGTTCCGGCTGTCAGCGCAGGGGCGCCCGGTTCTTTTTGTCAAGACCGACCTGTCCGGTGCCCTGAATGAACTGCAAGACGAGGCAGCGCGGCTATCG GCACAAGGCCGACAGTCGCGTCCCCGCGGGCCAAGAAAAACAGTTCTGGCTGGACAGGCCACGGGACTTACTTGACGTTCTGCTCCGTCGCGCCGATAGC 3501 TGGCTGGCCACGACGGGCGTTCCTTGCGCAGCTGTGCTCGACGTTGTCACTGAAGCGGGAAGGGACTGGCTGCTATTGGGCGAAGTGCCGGGGCAGGATC ACCGACCGGTGCTGCCCGCAAGGAACGCGTCGACACGAGCTGCAACAGTGACTTCGCCCTTCCCTGACCGACGATAACCCGCTTCACGGCCCCGTCCTAG 3601 TCCTGTCATCTCACCTTGCTCCTGCCGAGAAAGTATCCATCATGGCTGATGCAATGCGGCGGCTGCATACGCTTGATCCGGCTACCTGCCCATTCGACCA AGGACAGTAGAGTGGAACGAGGACGGCTCTTTCATAGGTAGTACCGACTACGTTACGCCGCCGACGTATGCGAACTAGGCCGATGGACGGGTAAGCTGGT 3701 CCAAGCGAAACATCGCATCGAGCGAGCACGTACTCGGATGGAAGCCGGTCTTGTCGATCAGGATGATCTGGACGAAGAGCATCAGGGGCTCGCGCCAGCC GGTTCGCTTTGTAGCGTAGCTCGCTCGTGCATGAGCCTACCTTCGGCCAGAACAGCTAGTCCTACTAGACCTGCTTCTCGTAGTCCCCGAGCGCGGTCGG 3801 GAACTGTTCGCCAGGCTCAAGGCGAGCATGCCCGACGGCGAGGATCTCGTCGTGACCCATGGCGATGCCTGCTTGCCGAATATCATGGTGGAAAATGGCC CTTGACAAGCGGTCCGAGTTCCGCTCGTACGGGCTGCCGCTCCTAGAGCAGCACTGGGTACCGCTACGGACGAACGGCTTATAGTACCACCTTTTACCGG 3901 GCTTTTCTGGATTCATCGACTGTGGCCGGCTGGGTGTGGCGGACCGCTATCAGGACATAGCGTTGGCTACCCGTGATATTGCTGAAGAGCTTGGCGGCGA CGAAAAGACCTAAGTAGCTGACACCGGCCGACCCACACCGCCTGGCGATAGTCCTGTATCGCAACCGATGGGCACTATAACGACTTCTCGAACCGCCGCT 4001 ATGGGCTGACCGCTTCCTCGTGCTTTACGGTATCGCCGCTCCCGATTCGCAGCGCATCGCCTTCTATCGCCTTCTTGACGAGTTCTTCTGAGCGGGACTC TACCCGACTGGCGAAGGAGCACGAAATGCCATAGCGGCGAGGGCTAAGCGTCGCGTAGCGGAAGATAGCGGAAGAACTGCTCAAGAAGACTCGCCCTGAG 4101 TGGGGTTCGGGCCGCACTCGAGCATAAACTTGTTTATTGCAGCTTATAATGGTTACAAATAAAGCAATAGCATCACAAATTTCACAAATAAAGCATTTTT ACCCCAAGCCCGGCGTGAGCTCGTATTTGAACAAATAACGTCGAATATTACCAATGTTTATTTCGTTATCGTAGTGTTTAAAGTGTTTATTTCGTAAAAA I-SceI ~~~~~~~~~~~~~~~~~~~ 4201 TTCACTGCATTCTAGTTGTGGTTTGTCCAAACTCATCAATGTATCTTAAGTAGGGATAACAGGGTAATTTTGTTAAATCAGCTCATTTTTTAACCAATAG AAGTGACGTAAGATCAACACCAAACAGGTTTGAGTAGTTACATAGAATTCATCCCTATTGTCCCATTAAAACAATTTAGTCGAGTAAAAAATTGGTTATC 4301 GAACGCCATCAAAAATAATTCGCGTCTGGCCTTCCTGTAGCCAGCTTTCATCAACATTAAATGTGAGCGAGTAACAACCCGTCGGATTCTCCGTGGGAAC CTTGCGGTAGTTTTTATTAAGCGCAGACCGGAAGGACATCGGTCGAAAGTAGTTGTAATTTACACTCGCTCATTGTTGGGCAGCCTAAGAGGCACCCTTG 4401 AAACGGCGGATTGACCGTAATGGGATAGGTTACGTTGGTGTAGATGGGCGCATCGTAACCGTGCATCTGCCAGTTTGAGGGGACGACGACCGTATCGGCC TTTGCCGCCTAACTGGCATTACCCTATCCAATGCAACCACATCTACCCGCGTAGCATTGGCACGTAGACGGTCAAACTCCCCTGCTGCTGGCATAGCCGG 4501 TCAGGAAGATCGCACTCCAGCCAGCTTTCCGGCACCGCTTCTGGTGCCGGAAACCAGGCAAAGCGCCATTCGCCATTCAGGCTGCGCAACTGTTGGGAAG AGTCCTTCTAGCGTGAGGTCGGTCGAAAGGCCGTGGCGAAGACCACGGCCTTTGGTCCGTTTCGCGGTAAGCGGTAAGTCCGACGCGTTGACAACCCTTC 4601 GGCGATCGGTGCGGGCCTCTTCGCTATTACGCCAGCTGGCGAAAGGGGGATGTGCTGCAAGGCGATTAAGTTGGGTAACGCCAGGGTTTTCCCAGTCACG CCGCTAGCCACGCCCGGAGAAGCGATAATGCGGTCGACCGCTTTCCCCCTACACGACGTTCCGCTAATTCAACCCATTGCGGTCCCAAAAGGGTCAGTGC 4701 ACGTTGTAAAACGACGGCCAGTGAATTGCAATTCGTAATCATGGTCATAGCTGTTTCCTGTGTGAAATTGTTATCCGCTCACAATTCCACACAACATACG TGCAACATTTTGCTGCCGGTCACTTAACGTTAAGCATTAGTACCAGTATCGACAAAGGACACACTTTAACAATAGGCGAGTGTTAAGGTGTGTTGTATGC I-SceI ~~~~~~~~~~~~~~~~~~~~ 4801 AGCCGGAAGCATAAAGTGTAAAGCCTGGGGTGCCTAATGAGTGAGCTAACTCACATTAATTGCGTTGCGCTCACTGCCATTACCCTGTTATCCCTAGTGA TCGGCCTTCGTATTTCACATTTCGGACCCCACGGATTACTCACTCGATTGAGTGTAATTAACGCAACGCGAGTGACGGTAATGGGACAATAGGGATCACT 4901 ACCATCACCCTAATCAAGTTTTTTGGGGTCGAGGTGCCGTAAAGCACTAAATCGGAACCCTAAAGGGAGCCCCCGATTTAGAGCTTGACGGGGAAAGCCG TGGTAGTGGGATTAGTTCAAAAAACCCCAGCTCCACGGCATTTCGTGATTTAGCCTTGGGATTTCCCTCGGGGGCTAAATCTCGAACTGCCCCTTTCGGC 5001 GCGAACGTGGCGAGAAAGGAAGGGAAGAAAGCGAAAGGAGCGGGCGCTAGGGCGCTGGCAAGTGTAGCGGTCACGCTGCGCGTAACCACCACACCCGCCG CGCTTGCACCGCTCTTTCCTTCCCTTCTTTCGCTTTCCTCGCCCGCGATCCCGCGACCGTTCACATCGCCAGTGCGACGCGCATTGGTGGTGTGGGCGGC 5101 CGCTTAATGCGCCGCTACAGGGCGCGTCAGGTGGCACTTTTCGGGGAAATGTGCGCGGAACCCCTATTTGTTTATTTTTCTAAATACATTCAAATATGTA GCGAATTACGCGGCGATGTCCCGCGCAGTCCACCGTGAAAAGCCCCTTTACACGCGCCTTGGGGATAAACAAATAAAAAGATTTATGTAAGTTTATACAT 5201 TCCGCTCATGAGACAATAACCCTGATAAATGCTTCAATAATAACGACCGGTAATGAAAAAGGAAGAGTATGAGTATTCAACATTTCCGTGTCGCCCTTAT AGGCGAGTACTCTGTTATTGGGACTATTTACGAAGTTATTATTGCTGGCCATTACTTTTTCCTTCTCATACTCATAAGTTGTAAAGGCACAGCGGGAATA 5301 TCCCTTTTTTGCGGCATTTTGCCTTCCTGTTTTTGCTCACCCAGAAACGCTGGTGAAAGTAAAAGATGCTGAAGATCAGTTGGGTGCACGAGTGGGTTAC AGGGAAAAAACGCCGTAAAACGGAAGGACAAAAACGAGTGGGTCTTTGCGACCACTTTCATTTTCTACGACTTCTAGTCAACCCACGTGCTCACCCAATG 5401 ATCGAACTGGATCTCAACAGCGGTAAGATCCTTGAGAGTTTTCGCCCCGAAGAACGTTTTCCAATGATGAGCACTTTTAAAGTTCTGCTATGTGGCGCGG TAGCTTGACCTAGAGTTGTCGCCATTCTAGGAACTCTCAAAAGCGGGGCTTCTTGCAAAAGGTTACTACTCGTGAAAATTTCAAGACGATACACCGCGCC 5501 TATTATCCCGTATTGACGCCGGGCAAGAGCAACTCGGTCGCCGCATACACTATTCTCAGAATGACTTGGTTGAGTCTAGCGTTGATCGGCACGTAAGAGG ATAATAGGGCATAACTGCGGCCCGTTCTCGTTGAGCCAGCGGCGTATGTGATAAGAGTCTTACTGAACCAACTCAGATCGCAACTAGCCGTGCATTCTCC 5601 TTCCAACTTTCACCATAATGAAATAAGATCACTACCGGGCGTATTTTTTGAGTTATCGAGATTTTCAGGAGCTAAGGAAGCTAAAATGGAGAAAAAAATC AAGGTTGAAAGTGGTATTACTTTATTCTAGTGATGGCCCGCATAAAAAACTCAATAGCTCTAAAAGTCCTCGATTCCTTCGATTTTACCTCTTTTTTTAG 5701 ACTGGATATACCACCGTTGATATATCCCAATGGCATCGTAAAGAACATTTTGAGGCATTTCAGTCAGTTGCTCAATGTACCTATAACCAGACCGTTCAGC TGACCTATATGGTGGCAACTATATAGGGTTACCGTAGCATTTCTTGTAAAACTCCGTAAAGTCAGTCAACGAGTTACATGGATATTGGTCTGGCAAGTCG 5801 TGGATATTACGGCCTTTTTAAAGACCGTAAAGAAAAATAAGCACAAGTTTTATCCGGCCTTTATTCACATTCTTGCCCGCCTGATGAATGCTCATCCGGA ACCTATAATGCCGGAAAAATTTCTGGCATTTCTTTTTATTCGTGTTCAAAATAGGCCGGAAATAAGTGTAAGAACGGGCGGACTACTTACGAGTAGGCCT 5901 ATTCCGTATGGCAATGAAAGACGGTGAGCTGGTGATATGGGATAGTGTTCACCCTTGTTACACCGTTTTCCATGAGCAAACTGAAACGTTTTCATCGCTC TAAGGCATACCGTTACTTTCTGCCACTCGACCACTATACCCTATCACAAGTGGGAACAATGTGGCAAAAGGTACTCGTTTGACTTTGCAAAAGTAGCGAG 6001 TGGAGTGAATACCACGACGATTTCCGGCAGTTTCTACACATATATTCGCAAGATGTGGCGTGTTACGGTGAAAACCTGGCCTATTTCCCTAAAGGGTTTA ACCTCACTTATGGTGCTGCTAAAGGCCGTCAAAGATGTGTATATAAGCGTTCTACACCGCACAATGCCACTTTTGGACCGGATAAAGGGATTTCCCAAAT 6101 TTGAGAATATGTTTTTCGTATCAGCCAATCCCTGGGTGAGTTTCACCAGTTTTGATTTAAACGTGGCCAATATGGACAACTTCTTCGCCCCCGTTTTCAC AACTCTTATACAAAAAGCATAGTCGGTTAGGGACCCACTCAAAGTGGTCAAAACTAAATTTGCACCGGTTATACCTGTTGAAGAAGCGGGGGCAAAAGTG 6201 CATGGGCAAATATTATACGCAAGGCGACAAGGTGCTGATGCCGCTGGCGATTCAGGTTCATCATGCCGTCTGTGATGGCTTCCATGTCGGCAGAATGCTT GTACCCGTTTATAATATGCGTTCCGCTGTTCCACGACTACGGCGACCGCTAAGTCCAAGTAGTACGGCAGACACTACCGAAGGTACAGCCGTCTTACGAA 6301 AATGAATTACAACAGTACTGCGATGAGTGGCAGGGCGGGGCGTAATTTTTTTAAGGCAGTTATTGGTGCCCTTAAACGCCTGGTGCTACGCCTGAATAAG TTACTTAATGTTGTCATGACGCTACTCACCGTCCCGCCCCGCATTAAAAAAATTCCGTCAATAACCACGGGAATTTGCGGACCACGATGCGGACTTATTC 6401 TGATAATAAGCGGATGAATGGCAGAAATTCGAAATGACCGACCAAGCGACGCCCAACCTGCCATCACGAGATTTCGATTCCACCGCCGCCTTCTATGAAA ACTATTATTCGCCTACTTACCGTCTTTAAGCTTTACTGGCTGGTTCGCTGCGGGTTGGACGGTAGTGCTCTAAAGCTAAGGTGGCGGCGGAAGATACTTT 6501 GGTTGGGCTTCGGAATCGTTTTCCGGGACGCCGGCTGGATGATCCTCCAGCGCGGGGATCTCATGCTGGAGTTCTTCGCCCACCCTAGGGGGAGGCTAAC CCAACCCGAAGCCTTAGCAAAAGGCCCTGCGGCCGACCTACTAGGAGGTCGCGCCCCTAGAGTACGACCTCAAGAAGCGGGTGGGATCCCCCTCCGATTG 6601 TGAAACACGGAAGGAGACAATACCGGAAGGAACCCGCGCTATGACGGCAATAAAAAGACAGAATAAAACGCACGGTGTTGGGTCGTTTGTTCATAAACGC ACTTTGTGCCTTCCTCTGTTATGGCCTTCCTTGGGCGCGATACTGCCGTTATTTTTCTGTCTTATTTTGCGTGCCACAACCCAGCAAACAAGTATTTGCG 6701 GGGGTTCGGTCCCAGGGCTGGCACTCTGTCGATACCCCACCGAGACCCCATTGGGGCCAATACGCCCGCGTTTCTTCCTTTTCCCCACCCCACCCCCCAA CCCCAAGCCAGGGTCCCGACCGTGAGACAGCTATGGGGTGGCTCTGGGGTAACCCCGGTTATGCGGGCGCAAAGAAGGAAAAGGGGTGGGGTGGGGGGTT 6801 GTTCGGGTGAAGGCCCAGGGCTCGCAGCCAACGTCGGGGCGGCAGGCCCTGCCATAGCCTCAGGTTACTCATATATACTTTAGATTGATTTAAAACTTCA CAAGCCCACTTCCGGGTCCCGAGCGTCGGTTGCAGCCCCGCCGTCCGGGACGGTATCGGAGTCCAATGAGTATATATGAAATCTAACTAAATTTTGAAGT 6901 TTTTTAATTTAAAAGGATCTAGGTGAAGATCCTTTTTGATAATCTCATGACCAAAATCCCTTAACGTGAGTTTTCGTTCCACTGAGCGTCAGACCCCGTA AAAAATTAAATTTTCCTAGATCCACTTCTAGGAAAAACTATTAGAGTACTGGTTTTAGGGAATTGCACTCAAAAGCAAGGTGACTCGCAGTCTGGGGCAT 7001 GAAAAGATCAAAGGATCTTCTTGAGATCCTTTTTTTCTGCGCGTAATCTGCTGCTTGCAAACAAAAAAACCACCGCTACCAGCGGTGGTTTGTTTGCCGG CTTTTCTAGTTTCCTAGAAGAACTCTAGGAAAAAAAGACGCGCATTAGACGACGAACGTTTGTTTTTTTGGTGGCGATGGTCGCCACCAAACAAACGGCC 7101 ATCAAGAGCTACCAACTCTTTTTCCGAAGGTAACTGGCTTCAGCAGAGCGCAGATACCAAATACTGTCCTTCTAGTGTAGCCGTAGTTAGGCCACCACTT TAGTTCTCGATGGTTGAGAAAAAGGCTTCCATTGACCGAAGTCGTCTCGCGTCTATGGTTTATGACAGGAAGATCACATCGGCATCAATCCGGTGGTGAA 7201 CAAGAACTCTGTAGCACCGCCTACATACCTCGCTCTGCTAATCCTGTTACCAGTGGCTGCTGCCAGTGGCGATAAGTCGTGTCTTACCGGGTTGGACTCA GTTCTTGAGACATCGTGGCGGATGTATGGAGCGAGACGATTAGGACAATGGTCACCGACGACGGTCACCGCTATTCAGCACAGAATGGCCCAACCTGAGT 7301 AGACGATAGTTACCGGATAAGGCGCAGCGGTCGGGCTGAACGGGGGGTTCGTGCACACAGCCCAGCTTGGAGCGAACGACCTACACCGAACTGAGATACC TCTGCTATCAATGGCCTATTCCGCGTCGCCAGCCCGACTTGCCCCCCAAGCACGTGTGTCGGGTCGAACCTCGCTTGCTGGATGTGGCTTGACTCTATGG 7401 TACAGCGTGAGCTATGAGAAAGCGCCACGCTTCCCGAAGGGAGAAAGGCGGACAGGTATCCGGTAAGCGGCAGGGTCGGAACAGGAGAGCGCACGAGGGA ATGTCGCACTCGATACTCTTTCGCGGTGCGAAGGGCTTCCCTCTTTCCGCCTGTCCATAGGCCATTCGCCGTCCCAGCCTTGTCCTCTCGCGTGCTCCCT 7501 GCTTCCAGGGGGAAACGCCTGGTATCTTTATAGTCCTGTCGGGTTTCGCCACCTCTGACTTGAGCGTCGATTTTTGTGATGCTCGTCAGGGGGGCGGAGC CGAAGGTCCCCCTTTGCGGACCATAGAAATATCAGGACAGCCCAAAGCGGTGGAGACTGAACTCGCAGCTAAAAACACTACGAGCAGTCCCCCCGCCTCG 7601 CTATGGAAAAACGCCAGCAACGCGGCCTTTTTACGGTTCCTGGCCTTTTGCTGGCCTTTTGCTCACATGTTCTTTCCTGCGTTATCCCCTGATTCTGTGG GATACCTTTTTGCGGTCGTTGCGCCGGAAAAATGCCAAGGACCGGAAAACGACCGGAAAACGAGTGTACAAGAAAGGACGCAATAGGGGACTAAGACACC 7701 ATAACCGTATTACCGCCATGCATTAGTTATTAATAGTAATCAATTACGGGGTCATTAGTTCATAGCCCATATATGGAGTTCCGCGTTACATAACTTACGG TATTGGCATAATGGCGGTACGTAATCAATAATTATCATTAGTTAATGCCCCAGTAATCAAGTATCGGGTATATACCTCAAGGCGCAATGTATTGAATGCC 7801 TAAATGGCCCGCCTGGCTGACCGCCCAACGACCCCCGCCCATTGACGTCAATAATGACGTATGTTCCCATAGTAACGCCAATAGGGACTTTCCATTGACG ATTTACCGGGCGGACCGACTGGCGGGTTGCTGGGGGCGGGTAACTGCAGTTATTACTGCATACAAGGGTATCATTGCGGTTATCCCTGAAAGGTAACTGC 7901 TCAATGGGTGGAGTATTTACGGTAAACTGCCCACTTGGCAGTACATCAAGTGTATCATATGCCAAGTACGCCCCCTATTGACGTCAATGACGGTAAATGG AGTTACCCACCTCATAAATGCCATTTGACGGGTGAACCGTCATGTAGTTCACATAGTATACGGTTCATGCGGGGGATAACTGCAGTTACTGCCATTTACC 8001 CCCGCCTGGCATTATGCCCAGTACATGACCTTATGGGACTTTCCTACTTGGCAGTACATCTACGTATTAGTCATCGCTATTACCATGGTGATGCGGTTTT GGGCGGACCGTAATACGGGTCATGTACTGGAATACCCTGAAAGGATGAACCGTCATGTAGATGCATAATCAGTAGCGATAATGGTACCACTACGCCAAAA 8101 GGCAGTACATCAATGGGCGTGGATAGCGGTTTGACTCACGGGGATTTCCAAGTCTCCACCCCATTGACGTCAATGGGAGTTTGTTTTGGCACCAAAATCA CCGTCATGTAGTTACCCGCACCTATCGCCAAACTGAGTGCCCCTAAAGGTTCAGAGGTGGGGTAACTGCAGTTACCCTCAAACAAAACCGTGGTTTTAGT 8201 ACGGGACTTTCCAAAATGTCGTAACAACTCCGCCCCATTGACGCAAATGGGCGGTAGGCGTGTACGGTGGGAGGTCTATATAAGCAGAGCT TGCCCTGAAAGGTTTTACAGCATTGTTGAGGCGGGGTAACTGCGTTTACCCGCCATCCGCACATGCCACCCTCCAGATATATTCGTCTCGA

    TABLE-US-00004 APPENDIX2 Sequencesofclonedlightchains. 16 (1) [00001]embedded image 6 (1) [00002]embedded image 22 (1) [00003]embedded image 1 (1) [00004]embedded image 21 (1) [00005]embedded image 24 (1) ---------------------------------------------------------------------- 33 (1) [00006]embedded image 33-35 (1) [00007]embedded image 41 (1) [00008]embedded image 7 (1) [00009]embedded image 7-7 (1) [00010]embedded image 41-40 (1) [00011]embedded image 8 (1) [00012]embedded image 4 (1) [00013]embedded image 9 (1) [00014]embedded image 31 (1) [00015]embedded image 17 (1) [00016]embedded image 16 (64) [00017]embedded image 6 (64) [00018]embedded image 22 (64) [00019]embedded image 1 (71) [00020]embedded image 21 (71) [00021]embedded image 24 (1) [00022]embedded image 33 (66) [00023]embedded image 33-35 (66) [00024]embedded image 41 (66) [00025]embedded image 7 (66) [00026]embedded image 7-7 (66) [00027]embedded image 41-40 (66) [00028]embedded image 8 (65) [00029]embedded image 4 (64) [00030]embedded image 9 (65) [00031]embedded image 31 (65) [00032]embedded image 17 (66) [00033]embedded image 16 (133) [00034]embedded image 6 (134) [00035]embedded image 22 (134) [00036]embedded image 1 (141) [00037]embedded image 21 (140) [00038]embedded image 24 (24) [00039]embedded image 33 (135) [00040]embedded image 33-35 (135) [00041]embedded image 41 (135) [00042]embedded image 7 (135) [00043]embedded image 7-7 (135) [00044]embedded image 41-40 (135) [00045]embedded image 8 (134) [00046]embedded image 4 (132) [00047]embedded image 9 (134) [00048]embedded image 31 (135) [00049]embedded image 17 (135) [00050]embedded image 16 (203) [00051]embedded image 6 (204) [00052]embedded image 22 (204) [00053]embedded image 1 (211) [00054]embedded image 21 (210) [00055]embedded image 24 (94) [00056]embedded image 33 (205) [00057]embedded image 33-35 (205) [00058]embedded image 41 (205) [00059]embedded image 7 (205) [00060]embedded image 7-7 (205) [00061]embedded image 41-40 (205) [00062]embedded image 8 (204) [00063]embedded image 4 (179) -------------------------------------------------- 9 (204) [00064]embedded image 31 (205) [00065]embedded image 17 (205) [00066]embedded image

    TABLE-US-00005 APPENDIX3 Alignmentofsequencesofclonedvariabledomainsofheavychains 14 (1) [00067]embedded image 15 (1) [00068]embedded image 1 (1) [00069]embedded image 21 (1) [00070]embedded image 33 (1) [00071]embedded image 41 (1) [00072]embedded image 6 (1) [00073]embedded image 7 (1) [00074]embedded image 8 (1) [00075]embedded image 9 (1) [00076]embedded image 32 (1) [00077]embedded image 31 (1) [00078]embedded image 14 (69) [00079]embedded image 15 (69) [00080]embedded image 1 (71) [00081]embedded image 21 (69) [00082]embedded image 33 (69) [00083]embedded image 41 (69) [00084]embedded image 6 (69) [00085]embedded image 7 (69) [00086]embedded image 8 (69) [00087]embedded image 9 (69) [00088]embedded image 32 (69) [00089]embedded image 31 (69) [00090]embedded image 14 (137) [00091]embedded image 15 (136) [00092]embedded image 1 (128) [00093]embedded image 21 (135) [00094]embedded image 33 (129) [00095]embedded image 41 (130) [00096]embedded image 6 (137) [00097]embedded image 7 (127) [00098]embedded image 8 (129) [00099]embedded image 9 (133) [00100]embedded image 32 (133) [00101]embedded image 31 (133) [00102]embedded image 14 (207) [00103]embedded image 15 (206) [00104]embedded image 1 (198) [00105]embedded image 21 (148) ----------- 33 (142) ----------- 41 (143) ----------- 6 (207) [00106]embedded image 7 (197) [00107]embedded image 8 (199) [00108]embedded image 9 (203) [00109]embedded image 32 (203) [00110]embedded image 31 (203) [00111]embedded image

    TABLE-US-00006 APPENDIX4 SequencesofplasmidsencodingspAG-MLucandspAG-N-MLuchybrids. pETspAG-N-MLuc1 1 GGAAAAATGCCTGGCAAAAAACTGCCACTGGCAGTTATCATGGAAATGGAAGCCAATGCTTTCAAAGCTGGCTGCACCAG CCTTTTTACGGACCGTTTTTTGACGGTGACCGTCAATAGTACCTTTACCTTCGGTTACGAAAGTTTCGACCGACGTGGTC GGGATGCCTTATCTGTCTTT CCCTACGGAATAGACAGAAA 101 CAAAAATTAAGTGTACAGCCAAAATGAAGGTATACATTCCAGGAAGGTGTCACGATTATGGTGGTGACAAGAAAACTGGA GTTTTTAATTCACATGTCGGTTTTACTTCCATATGTAAGGTCCTTCCACAGTGCTAATACCACCACTGTTCTTTTGACCT CAGGCAGGAATTGTTGGTGC GTCCGTCCTTAACAACCACG 201 AATTGTTGACATTCCCGAAATCTCTGGATTTAAGGAGATGGCACCCATGGAACAGTTCATTGCTCAAGTTGATCGCTGCG TTAACAACTGTAAGGGCTTTAGAGACCTAAATTCCTCTACCGTGGGTACCTTGTCAAGTAACGAGTTCAACTAGCGACGC CTTCCTGCACTACTGGATGT GAAGGACGTGATGACCTACA 301 CTCAAAGGTCTTGCCAATGTTAAGTGCTCTGAACTCCTGAAGAAATGGCTGCCTGACAGGTGTGCAAGTTTTGCTGACAA GAGTTTCCAGAACGGTTACAATTCACGAGACTTGAGGACTTCTTTACCGACGGACTGTCCACACGTTCAAAACGACTGTT GATTCAAAAAGAAGTTCACA CTAAGTTTTTCTTCAAGTGT 401 ATATCAAAGGCATGGCCGTACAGCTGCAGGTCGAGCACCACCACCACCACCACTGAGATCCGGCTGCTAACAAAGCCCGA TATAGTTTCCGTACCGGCATGTCGACGTCCAGCTCGTGGTGGTGGTGGTGGTGACTCTAGGCCGACGATTGTTTCGGGCT AAGGAAGCTGAGTTGGCTGC TTCCTTCGACTCAACCGACG 501 TGCCACCGCTGAGCAATAACTAGCATAACCCCTTGGGGCCTCTAAACGGGTCTTGAGGGGTTTTTTGCTGAAAGGAGGAA ACGGTGGCGACTCGTTATTGATCGTATTGGGGAACCCCGGAGATTTGCCCAGAACTCCCCAAAAAACGACTTTCCTCCTT CTATATCCGGATTGGCGAAT GATATAGGCCTAACCGCTTA 601 GGGACGCGCCCTGTAGCGGCGCATTAAGCGCGGCGGGTGTGGTGGTTACGCGCAGCGTGACCGCTACACTTGCCAGCGCC CCCTGCGCGGGACATCGCCGCGTAATTCGCGCCGCCCACACCACCAATGCGCGTCGCACTGGCGATGTGAACGGTCGCGG CTAGCGCCCGCTCCTTTCGC GATCGCGGGCGAGGAAAGCG 701 TTTCTTCCCTTCCTTTCTCGCCACGTTCGCCGGCTTTCCCCGTCAAGCTCTAAATCGGGGGCTCCCTTTAGGGTTCCGAT AAAGAAGGGAAGGAAAGAGCGGTGCAAGCGGCCGAAAGGGGCAGTTCGAGATTTAGCCCCCGAGGGAAATCCCAAGGCTA TTAGTGCTTTACGGCACCTC AATCACGAAATGCCGTGGAG 801 GACCCCAAAAAACTTGATTAGGGTGATGGTTCACGTAGTGGGCCATCGCCCTGATAGACGGTTTTTCGCCCTTTGACGTT CTGGGGTTTTTTGAACTAATCCCACTACCAAGTGCATCACCCGGTAGCGGGACTATCTGCCAAAAAGCGGGAAACTGCAA GGAGTCCACGTTCTTTAATA CCTCAGGTGCAAGAAATTAT 901 GTGGACTCTTGTTCCAAACTGGAACAACACTCAACCCTATCTCGGTCTATTCTTTTGATTTATAAGGGATTTTGCCGATT CACCTGAGAACAAGGTTTGACCTTGTTGTGAGTTGGGATAGAGCCAGATAAGAAAACTAAATATTCCCTAAAACGGCTAA TCGGCCTATTGGTTAAAAAA AGCCGGATAACCAATTTTTT 1001 TGAGCTGATTTAACAAAAATTTAACGCGAATTTTAACAAAATATTAACGTTTACAATTTCAGGTGGCACTTTTCGGGGAA ACTCGACTAAATTGTTTTTAAATTGCGCTTAAAATTGTTTTATAATTGCAAATGTTAAAGTCCACCGTGAAAAGCCCCTT ATGTGCGCGGAACCCCTATT TACACGCGCCTTGGGGATAA 1101 TGTTTATTTTTCTAAATACATTCAAATATGTATCCGCTCATGAATTAATTCTTAGAAAAACTCATCGAGCATCAAATGAA ACAAATAAAAAGATTTATGTAAGTTTATACATAGGCGAGTACTTAATTAAGAATCTTTTTGAGTAGCTCGTAGTTTACTT ACTGCAATTTATTCATATCA TGACGTTAAATAAGTATAGT 1201 GGATTATCAATACCATATTTTTGAAAAAGCCGTTTCTGTAATGAAGGAGAAAACTCACCGAGGCAGTTCCATAGGATGGC CCTAATAGTTATGGTATAAAAACTTTTTCGGCAAAGACATTACTTCCTCTTTTGAGTGGCTCCGTCAAGGTATCCTACCG AAGATCCTGGTATCGGTCTG TTCTAGGACCATAGCCAGAC 1301 CGATTCCGACTCGTCCAACATCAATACAACCTATTAATTTCCCCTCGTCAAAAATAAGGTTATCAAGTGAGAAATCACCA GCTAAGGCTGAGCAGGTTGTAGTTATGTTGGATAATTAAAGGGGAGCAGTTTTTATTCCAATAGTTCACTCTTTAGTGGT TGAGTGACGACTGAATCCGG ACTCACTGCTGACTTAGGCC 1401 TGAGAATGGCAAAAGTTTATGCATTTCTTTCCAGACTTGTTCAACAGGCCAGCCATTACGCTCGTCATCAAAATCACTCG ACTCTTACCGTTTTCAAATACGTAAAGAAAGGTCTGAACAAGTTGTCCGGTCGGTAATGCGAGCAGTAGTTTTAGTGAGC CATCAACCAAACCGTTATTC GTAGTTGGTTTGGCAATAAG 1501 ATTCGTGATTGCGCCTGAGCGAGACGAAATACGCGATCGCTGTTAAAAGGACAATTACAAACAGGAATCGAATGCAACCG TAAGCACTAACGCGGACTCGCTCTGCTTTATGCGCTAGCGACAATTTTCCTGTTAATGTTTGTCCTTAGCTTACGTTGGC GCGCAGGAACACTGCCAGCG CGCGTCCTTGTGACGGTCGC 1601 CATCAACAATATTTTCACCTGAATCAGGATATTCTTCTAATACCTGGAATGCTGTTTTCCCGGGGATCGCAGTGGTGAGT GTAGTTGTTATAAAAGTGGACTTAGTCCTATAAGAAGATTATGGACCTTACGACAAAAGGGCCCCTAGCGTCACCACTCA AACCATGCATCATCAGGAGT TTGGTACGTAGTAGTCCTCA 1701 ACGGATAAAATGCTTGATGGTCGGAAGAGGCATAAATTCCGTCAGCCAGTTTAGTCTGACCATCTCATCTGTAACATCAT TGCCTATTTTACGAACTACCAGCCTTCTCCGTATTTAAGGCAGTCGGTCAAATCAGACTGGTAGAGTAGACATTGTAGTA TGGCAACGCTACCTTTGCCA ACCGTTGCGATGGAAACGGT 1801 TGTTTCAGAAACAACTCTGGCGCATCGGGCTTCCCATACAATCGATAGATTGTCGCACCTGATTGCCCGACATTATCGCG ACAAAGTCTTTGTTGAGACCGCGTAGCCCGAAGGGTATGTTAGCTATCTAACAGCGTGGACTAACGGGCTGTAATAGCGC AGCCCATTTATACCCATATA TCGGGTAAATATGGGTATAT 1901 AATCAGCATCCATGTTGGAATTTAATCGCGGCCTAGAGCAAGACGTTTCCCGTTGAATATGGCTCATAACACCCCTTGTA TTAGTCGTAGGTACAACCTTAAATTAGCGCCGGATCTCGTTCTGCAAAGGGCAACTTATACCGAGTATTGTGGGGAACAT TTACTGTTTATGTAAGCAGA AATGACAAATACATTCGTCT 2001 CAGTTTTATTGTTCATGACCAAAATCCCTTAACGTGAGTTTTCGTTCCACTGAGCGTCAGACCCCGTAGAAAAGATCAAA GTCAAAATAACAAGTACTGGTTTTAGGGAATTGCACTCAAAAGCAAGGTGACTCGCAGTCTGGGGCATCTTTTCTAGTTT GGATCTTCTTGAGATCCTTT CCTAGAAGAACTCTAGGAAA 2101 TTTTCTGCGCGTAATCTGCTGCTTGCAAACAAAAAAACCACCGCTACCAGCGGTGGTTTGTTTGCCGGATCAAGAGCTAC AAAAGACGCGCATTAGACGACGAACGTTTGTTTTTTTGGTGGCGATGGTCGCCACCAAACAAACGGCCTAGTTCTCGATG CAACTCTTTTTCCGAAGGTA GTTGAGAAAAAGGCTTCCAT 2201 ACTGGCTTCAGCAGAGCGCAGATACCAAATACTGTCCTTCTAGTGTAGCCGTAGTTAGGCCACCACTTCAAGAACTCTGT TGACCGAAGTCGTCTCGCGTCTATGGTTTATGACAGGAAGATCACATCGGCATCAATCCGGTGGTGAAGTTCTTGAGACA AGCACCGCCTACATACCTCG TCGTGGCGGATGTATGGAGC 2301 CTCTGCTAATCCTGTTACCAGTGGCTGCTGCCAGTGGCGATAAGTCGTGTCTTACCGGGTTGGACTCAAGACGATAGTTA GAGACGATTAGGACAATGGTCACCGACGACGGTCACCGCTATTCAGCACAGAATGGCCCAACCTGAGTTCTGCTATCAAT CCGGATAAGGCGCAGCGGTC GGCCTATTCCGCGTCGCCAG 2401 GGGCTGAACGGGGGGTTCGTGCACACAGCCCAGCTTGGAGCGAACGACCTACACCGAACTGAGATACCTACAGCGTGAGC CCCGACTTGCCCCCCAAGCACGTGTGTCGGGTCGAACCTCGCTTGCTGGATGTGGCTTGACTCTATGGATGTCGCACTCG TATGAGAAAGCGCCACGCTT ATACTCTTTCGCGGTGCGAA 2501 CCCGAAGGGAGAAAGGCGGACAGGTATCCGGTAAGCGGCAGGGTCGGAACAGGAGAGCGCACGAGGGAGCTTCCAGGGGG GGGCTTCCCTCTTTCCGCCTGTCCATAGGCCATTCGCCGTCCCAGCCTTGTCCTCTCGCGTGCTCCCTCGAAGGTCCCCC AAACGCCTGGTATCTTTATA TTTGCGGACCATAGAAATAT 2601 GTCCTGTCGGGTTTCGCCACCTCTGACTTGAGCGTCGATTTTTGTGATGCTCGTCAGGGGGGCGGAGCCTATGGAAAAAC CAGGACAGCCCAAAGCGGTGGAGACTGAACTCGCAGCTAAAAACACTACGAGCAGTCCCCCCGCCTCGGATACCTTTTTG GCCAGCAACGCGGCCTTTTT CGGTCGTTGCGCCGGAAAAA 2701 ACGGTTCCTGGCCTTTTGCTGGCCTTTTGCTCACATGTTCTTTCCTGCGTTATCCCCTGATTCTGTGGATAACCGTATTA TGCCAAGGACCGGAAAACGACCGGAAAACGAGTGTACAAGAAAGGACGCAATAGGGGACTAAGACACCTATTGGCATAAT CCGCCTTTGAGTGAGCTGAT GGCGGAAACTCACTCGACTA 2801 ACCGCTCGCCGCAGCCGAACGACCGAGCGCAGCGAGTCAGTGAGCGAGGAAGCGGAAGAGCGCCTGATGCGGTATTTTCT TGGCGAGCGGCGTCGGCTTGCTGGCTCGCGTCGCTCAGTCACTCGCTCCTTCGCCTTCTCGCGGACTACGCCATAAAAGA CCTTACGCATCTGTGCGGTA GGAATGCGTAGACACGCCAT 2901 TTTCACACCGCATATATGGTGCACTCTCAGTACAATCTGCTCTGATGCCGCATAGTTAAGCCAGTATACACTCCGCTATC AAAGTGTGGCGTATATACCACGTGAGAGTCATGTTAGACGAGACTACGGCGTATCAATTCGGTCATATGTGAGGCGATAG GCTACGTGACTGGGTCATGG CGATGCACTGACCCAGTACC 3001 CTGCGCCCCGACACCCGCCAACACCCGCTGACGCGCCCTGACGGGCTTGTCTGCTCCCGGCATCCGCTTACAGACAAGCT GACGCGGGGCTGTGGGCGGTTGTGGGCGACTGCGCGGGACTGCCCGAACAGACGAGGGCCGTAGGCGAATGTCTGTTCGA GTGACCGTCTCCGGGAGCTG CACTGGCAGAGGCCCTCGAC 3101 CATGTGTCAGAGGTTTTCACCGTCATCACCGAAACGCGCGAGGCAGCTGCGGTAAAGCTCATCAGCGTGGTCGTGAAGCG GTACACAGTCTCCAAAAGTGGCAGTAGTGGCTTTGCGCGCTCCGTCGACGCCATTTCGAGTAGTCGCACCAGCACTTCGC ATTCACAGATGTCTGCCTGT TAAGTGTCTACAGACGGACA 3201 TCATCCGCGTCCAGCTCGTTGAGTTTCTCCAGAAGCGTTAATGTCTGGCTTCTGATAAAGCGGGCCATGTTAAGGGCGGT AGTAGGCGCAGGTCGAGCAACTCAAAGAGGTCTTCGCAATTACAGACCGAAGACTATTTCGCCCGGTACAATTCCCGCCA TTTTTCCTGTTTGGTCACTG AAAAAGGACAAACCAGTGAC 3301 ATGCCTCCGTGTAAGGGGGATTTCTGTTCATGGGGGTAATGATACCGATGAAACGAGAGAGGATGCTCACGATACGGGTT TACGGAGGCACATTCCCCCTAAAGACAAGTACCCCCATTACTATGGCTACTTTGCTCTCTCCTACGAGTGCTATGCCCAA ACTGATGATGAACATGCCCG TGACTACTACTTGTACGGGC 3401 GTTACTGGAACGTTGTGAGGGTAAACAACTGGCGGTATGGATGCGGCGGGACCAGAGAAAAATCACTCAGGGTCAATGCC CAATGACCTTGCAACACTCCCATTTGTTGACCGCCATACCTACGCCGCCCTGGTCTCTTTTTAGTGAGTCCCAGTTACGG AGCGCTTCGTTAATACAGAT TCGCGAAGCAATTATGTCTA 3501 GTAGGTGTTCCACAGGGTAGCCAGCAGCATCCTGCGATGCAGATCCGGAACATAATGGTGCAGGGCGCTGACTTCCGCGT CATCCACAAGGTGTCCCATCGGTCGTCGTAGGACGCTACGTCTAGGCCTTGTATTACCACGTCCCGCGACTGAAGGCGCA TTCCAGACTTTACGAAACAC AAGGTCTGAAATGCTTTGTG 3601 GGAAACCGAAGACCATTCATGTTGTTGCTCAGGTCGCAGACGTTTTGCAGCAGCAGTCGCTTCACGTTCGCTCGCGTATC CCTTTGGCTTCTGGTAAGTACAACAACGAGTCCAGCGTCTGCAAAACGTCGTCGTCAGCGAAGTGCAAGCGAGCGCATAG GGTGATTCATTCTGCTAACC CCACTAAGTAAGACGATTGG 3701 AGTAAGGCAACCCCGCCAGCCTAGCCGGGTCCTCAACGACAGGAGCACGATCATGCGCACCCGTGGGGCCGCCATGCCGG TCATTCCGTTGGGGCGGTCGGATCGGCCCAGGAGTTGCTGTCCTCGTGCTAGTACGCGTGGGCACCCCGGCGGTACGGCC CGATAATGGCCTGCTTCTCG GCTATTACCGGACGAAGAGC 3801 CCGAAACGTTTGGTGGCGGGACCAGTGACGAAGGCTTGAGCGAGGGCGTGCAAGATTCCGAATACCGCAAGCGACAGGCC GGCTTTGCAAACCACCGCCCTGGTCACTGCTTCCGAACTCGCTCCCGCACGTTCTAAGGCTTATGGCGTTCGCTGTCCGG GATCATCGTCGCGCTCCAGC CTAGTAGCAGCGCGAGGTCG 3901 GAAAGCGGTCCTCGCCGAAAATGACCCAGAGCGCTGCCGGCACCTGTCCTACGAGTTGCATGATAAAGAAGACAGTCATA CTTTCGCCAGGAGCGGCTTTTACTGGGTCTCGCGACGGCCGTGGACAGGATGCTCAACGTACTATTTCTTCTGTCAGTAT AGTGCGGCGACGATAGTCAT TCACGCCGCTGCTATCAGTA 4001 GCCCCGCGCCCACCGGAAGGAGCTGACTGGGTTGAAGGCTCTCAAGGGCATCGGTCGAGATCCCGGTGCCTAATGAGTGA CGGGGCGCGGGTGGCCTTCCTCGACTGACCCAACTTCCGAGAGTTCCCGTAGCCAGCTCTAGGGCCACGGATTACTCACT GCTAACTTACATTAATTGCG CGATTGAATGTAATTAACGC 4101 TTGCGCTCACTGCCCGCTTTCCAGTCGGGAAACCTGTCGTGCCAGCTGCATTAATGAATCGGCCAACGCGCGGGGAGAGG AACGCGAGTGACGGGCGAAAGGTCAGCCCTTTGGACAGCACGGTCGACGTAATTACTTAGCCGGTTGCGCGCCCCTCTCC CGGTTTGCGTATTGGGCGCC GCCAAACGCATAACCCGCGG 4201 AGGGTGGTTTTTCTTTTCACCAGTGAGACGGGCAACAGCTGATTGCCCTTCACCGCCTGGCCCTGAGAGAGTTGCAGCAA TCCCACCAAAAAGAAAAGTGGTCACTCTGCCCGTTGTCGACTAACGGGAAGTGGCGGACCGGGACTCTCTCAACGTCGTT GCGGTCCACGCTGGTTTGCC CGCCAGGTGCGACCAAACGG 4301 CCAGCAGGCGAAAATCCTGTTTGATGGTGGTTAACGGCGGGATATAACATGAGCTGTCTTCGGTATCGTCGTATCCCACT GGTCGTCCGCTTTTAGGACAAACTACCACCAATTGCCGCCCTATATTGTACTCGACAGAAGCCATAGCAGCATAGGGTGA ACCGAGATATCCGCACCAAC TGGCTCTATAGGCGTGGTTG 4401 GCGCAGCCCGGACTCGGTAATGGCGCGCATTGCGCCCAGCGCCATCTGATCGTTGGCAACCAGCATCGCAGTGGGAACGA CGCGTCGGGCCTGAGCCATTACCGCGCGTAACGCGGGTCGCGGTAGACTAGCAACCGTTGGTCGTAGCGTCACCCTTGCT TGCCCTCATTCAGCATTTGC ACGGGAGTAAGTCGTAAACG 4501 ATGGTTTGTTGAAAACCGGACATGGCACTCCAGTCGCCTTCCCGTTCCGCTATCGGCTGAATTTGATTGCGAGTGAGATA TACCAAACAACTTTTGGCCTGTACCGTGAGGTCAGCGGAAGGGCAAGGCGATAGCCGACTTAAACTAACGCTCACTCTAT TTTATGCCAGCCAGCCAGAC AAATACGGTCGGTCGGTCTG 4601 GCAGACGCGCCGAGACAGAACTTAATGGGCCCGCTAACAGCGCGATTTGCTGGTGACCCAATGCGACCAGATGCTCCACG CGTCTGCGCGGCTCTGTCTTGAATTACCCGGGCGATTGTCGCGCTAAACGACCACTGGGTTACGCTGGTCTACGAGGTGC CCCAGTCGCGTACCGTCTTC GGGTCAGCGCATGGCAGAAG 4701 ATGGGAGAAAATAATACTGTTGATGGGTGTCTGGTCAGAGACATCAAGAAATAACGCCGGAACATTAGTGCAGGCAGCTT TACCCTCTTTTATTATGACAACTACCCACAGACCAGTCTCTGTAGTTCTTTATTGCGGCCTTGTAATCACGTCCGTCGAA CCACAGCAATGGCATCCTGG GGTGTCGTTACCGTAGGACC 4801 TCATCCAGCGGATAGTTAATGATCAGCCCACTGACGCGTTGCGCGAGAAGATTGTGCACCGCCGCTTTACAGGCTTCGAC AGTAGGTCGCCTATCAATTACTAGTCGGGTGACTGCGCAACGCGCTCTTCTAACACGTGGCGGCGAAATGTCCGAAGCTG GCCGCTTCGTTCTACCATCG CGGCGAAGCAAGATGGTAGC 4901 ACACCACCACGCTGGCACCCAGTTGATCGGCGCGAGATTTAATCGCCGCGACAATTTGCGACGGCGCGTGCAGGGCCAGA TGTGGTGGTGCGACCGTGGGTCAACTAGCCGCGCTCTAAATTAGCGGCGCTGTTAAACGCTGCCGCGCACGTCCCGGTCT CTGGAGGTGGCAACGCCAAT GACCTCCACCGTTGCGGTTA 5001 CAGCAACGACTGTTTGCCCGCCAGTTGTTGTGCCACGCGGTTGGGAATGTAATTCAGCTCCGCCATCGCCGCTTCCACTT GTCGTTGCTGACAAACGGGCGGTCAACAACACGGTGCGCCAACCCTTACATTAAGTCGAGGCGGTAGCGGCGAAGGTGAA TTTCCCGCGTTTTCGCAGAA AAAGGGCGCAAAAGCGTCTT 5101 ACGTGGCTGGCCTGGTTCACCACGCGGGAAACGGTCTGATAAGAGACACCGGCATACTCTGCGACATCGTATAACGTTAC TGCACCGACCGGACCAAGTGGTGCGCCCTTTGCCAGACTATTCTCTGTGGCCGTATGAGACGCTGTAGCATATTGCAATG TGGTTTCACATTCACCACCC ACCAAAGTGTAAGTGGTGGG 5201 TGAATTGACTCTCTTCCGGGCGCTATCATGCCATACCGCGAAAGGTTTTGCGCCATTCGATGGTGTCCGGGATCTCGACG ACTTAACTGAGAGAAGGCCCGCGATAGTACGGTATGGCGCTTTCCAAAACGCGGTAAGCTACCACAGGCCCTAGAGCTGC CTCTCCCTTATGCGACTCCT GAGAGGGAATACGCTGAGGA 5301 GCATTAGGAAGCAGCCCAGTAGTAGGTTGAGGCCGTTGAGCACCGCCGCCGCAAGGAATGGTGCATGCAAGGAGATGGCG CGTAATCCTTCGTCGGGTCATCATCCAACTCCGGCAACTCGTGGCGGCGGCGTTCCTTACCACGTACGTTCCTCTACCGC CCCAACAGTCCCCCGGCCAC GGGTTGTCAGGGGGCCGGTG 5401 GGGGCCTGCCACCATACCCACGCCGAAACAAGCGCTCATGAGCCCGAAGTGGCGAGCCCGATCTTCCCCATCGGTGATGT CCCCGGACGGTGGTATGGGTGCGGCTTTGTTCGCGAGTACTCGGGCTTCACCGCTCGGGCTAGAAGGGGTAGCCACTACA CGGCGATATAGGCGCCAGCA GCCGCTATATCCGCGGTCGT 5501 ACCGCACCTGTGGCGCCGGTGATGCCGGCCACGATGCGTCCGGCGTAGAGGATCGAGATCTCGATCCCGCGAAATTAATA TGGCGTGGACACCGCGGCCACTACGGCCGGTGCTACGCAGGCCGCATCTCCTAGCTCTAGAGCTAGGGCGCTTTAATTAT CGACTCACTATAGGGGAATT GCTGAGTGATATCCCCTTAA 5601 GTGAGCGGATAACAATTCCCCTCTAGAAATAATTTTGTTTAACTTTAAGAAGGAGATATACCATGGGCAGCAGCCATCAT CACTCGCCTATTGTTAAGGGGAGATCTTTATTAAAACAAATTGAAATTCTTCCTCTATATGGTACCCGTCGTCGGTAGTA CATCATCATCACAGCAGCGG GTAGTAGTAGTGTCGTCGCC 5701 CCTGGTGCCGCGCGGCAGCCATAGGTCGACTCTAGAGGATCCAAGCCAAAGCACTAACGTTTTAGGTGAAGCTAAAAAAT GGACCACGGCGCGCCGTCGGTATCCAGCTGAGATCTCCTAGGTTCGGTTTCGTGATTGCAAAATCCACTTCGATTTTTTA TAAACGAATCTCAAGCACCG ATTTGCTTAGAGTTCGTGGC 5801 AAAGCTGACAACAATTTCAACAAAGAACAACAAAATGCTTTCTATGAAATCTTGAACATGCCTAACTTGAACGAAGAACA TTTCGACTGTTGTTAAAGTTGTTTCTTGTTGTTTTACGAAAGATACTTTAGAACTTGTACGGATTGAACTTGCTTCTTGT ACGCAATGGTTTCATCCAAA TGCGTTACCAAAGTAGGTTT 5901 GCTTAAAAGATGACCCAAGTCAAAGTGCTAACCTTTTAGCAGAAGCTAAAAAGTTAAATGAATCTCAAGCACCGAAAGCT CGAATTTTCTACTGGGTTCAGTTTCACGATTGGAAAATCGTCTTCGATTTTTCAATTTACTTAGAGTTCGTGGCTTTCGA GATAACAAATTCAACAAAGA CTATTGTTTAAGTTGTTTCT 6001 ACAACAAAATGCTTTCTATGAAATCTTACATTTACCTAACTTAAATGAAGAACAACGCAATGGTTTCATCCAAAGCTTAA TGTTGTTTTACGAAAGATACTTTAGAATGTAAATGGATTGAATTTACTTCTTGTTGCGTTACCAAAGTAGGTTTCGAATT AAGATGACCCAAGCCAAAGC TTCTACTGGGTTCGGTTTCG 6101 GCTAACCTTTTAGCAGAAGCTAAAAAGCTAAATGATGCACAAGCACCAAAAGCTGACAACAAATTCAACAAAGAACAACA CGATTGGAAAATCGTCTTCGATTTTTCGATTTACTACGTGTTCGTGGTTTTCGACTGTTGTTTAAGTTGTTTCTTGTTGT AAATGCTTTCTATGAAATTT TTTACGAAAGATACTTTAAA 6201 TACATTTACCTAACTTAACTGAAGAACAACGTAACGGCTTCATCCAAAGCCTTAAAGACGATCCCCGGTCGACTCTAGCG ATGTAAATGGATTGAATTGACTTCTTGTTGCATTGCCGAAGTAGGTTTCGGAATTTCTGCTAGGGGCCAGCTGAGATCGC GCAGCTTCCGGTGCTAGCAC CGTCGAAGGCCACGATCGTG 6301 TGACACTTACAAATTAATCCTTAATGGTAAAACATTGAAAGGCGAAACAACTACTGAAGCTGTTGATGCTGCTACTGCAG ACTGTGAATGTTTAATTAGGAATTACCATTTTGTAACTTTCCGCTTTGTTGATGACTTCGACAACTACGACGATGACGTC AAAAAGTCTTCAAACAATAC TTTTTCAGAAGTTTGTTATG 6401 GCTAACGACAACGGTGTTGACGGTGAATGGACTTACGACGATGCGACTAAGACCTTTACAGTTACTGAAAAACCAGAAGT CGATTGCTGTTGCCACAACTGCCACTTACCTGAATGCTGCTACGCTGATTCTGGAAATGTCAATGACTTTTTGGTCTTCA GATCGATGCGTCTGAATTAA CTAGCTACGCAGACTTAATT 6501 CACCAGCCGTGACAACTTACAAACTTGTTATTAATGGTAAAACATTGAAAGGCGAAACAACTACTAAAGCAGTAGACGCA GTGGTCGGCACTGTTGAATGTTTGAACAATAATTACCATTTTGTAACTTTCCGCTTTGTTGATGATTTCGTCATCTGCGT GAAACTGCAGAAAAAGCCTT CTTTGACGTCTTTTTCGGAA 6601 CAAACAATACGCTAACGACAACGGTGTTGATGGTGTTTGGACTTATGATGATGCGACTAAGACCTTTACGGTAACTGAAA GTTTGTTATGCGATTGCTGTTGCCACAACTACCACAAACCTGAATACTACTACGCTGATTCTGGAAATGCCATTGACTTT TGGTTACAGAGGTACCAGAT ACCAATGTCTCCATGGTCTA 6701 CTTAGCAACTTTGTTGCAACTGAAACCGATGCTAACCGC GAATCGTTGAAACAACGTTGACTTTGGCTACGATTGGCG pS14L-spAG-MLuc16 1 AGCGCCCAATACGCAAACCGCCTCTCCCCGCGCGTTGGCCGATTCATTAATGCAGCTGGCACGACAGGTTTCCCGACTGG TCGCGGGTTATGCGTTTGGCGGAGAGGGGCGCGCAACCGGCTAAGTAATTACGTCGACCGTGCTGTCCAAAGGGCTGACC AAAGCGGGCAGTGAGCGCAA TTTCGCCCGTCACTCGCGTT 101 CGCAATTAATGTGAGTTAGCTCACTCATTAGGCACCCCAGGCTTTACACTTTATGCTTCCGGCTCGTATGTTGTGTGGAA GCGTTAATTACACTCAATCGAGTGAGTAATCCGTGGGGTCCGAAATGTGAAATACGAAGGCCGAGCATACAACACACCTT TTGTGAGCGGATAACAATTT AACACTCGCCTATTGTTAAA 201 CACACAGGAAACAGCTATGACCATGATTACGCCAAGCTTTAGGGATAACAGGGTAATCGCCATGCATTAGTTATTAATAG GTGTGTCCTTTGTCGATACTGGTACTAATGCGGTTCGAAATCCCTATTGTCCCATTAGCGGTACGTAATCAATAATTATC TAATCAATTACGGGGTCATT ATTAGTTAATGCCCCAGTAA 301 AGTTCATAGCCCATATATGGAGTTCCGCGTTACATAACTTACGGTAAATGGCCCGCCTGGCTGACCGCCCAACGACCCCC TCAAGTATCGGGTATATACCTCAAGGCGCAATGTATTGAATGCCATTTACCGGGCGGACCGACTGGCGGGTTGCTGGGGG GCCCATTGACGTCAATAATG CGGGTAACTGCAGTTATTAC 401 ACGTATGTTCCCATAGTAACGCCAATAGGGACTTTCCATTGACGTCAATGGGTGGAGTATTTACGGTAAACTGCCCACTT TGCATACAAGGGTATCATTGCGGTTATCCCTGAAAGGTAACTGCAGTTACCCACCTCATAAATGCCATTTGACGGGTGAA GGCAGTACATCAAGTGTATC CCGTCATGTAGTTCACATAG 501 ATATGCCAAGTACGCCCCCTATTGACGTCAATGACGGTAAATGGCCCGCCTGGCATTATGCCCAGTACATGACCTTATGG TATACGGTTCATGCGGGGGATAACTGCAGTTACTGCCATTTACCGGGCGGACCGTAATACGGGTCATGTACTGGAATACC GACTTTCCTACTTGGCAGTA CTGAAAGGATGAACCGTCAT 601 CATCTACGTATTAGTCATCGCTATTACCATGGTGATGCGGTTTTGGCAGTACATCAATGGGCGTGGATAGCGGTTTGACT GTAGATGCATAATCAGTAGCGATAATGGTACCACTACGCCAAAACCGTCATGTAGTTACCCGCACCTATCGCCAAACTGA CACGGGGATTTCCAAGTCTC GTGCCCCTAAAGGTTCAGAG 701 CACCCCATTGACGTCAATGGGAGTTTGTTTTGGCACCAAAATCAACGGGACTTTCCAAAATGTCGTAACAACTCCGCCCC GTGGGGTAACTGCAGTTACCCTCAAACAAAACCGTGGTTTTAGTTGCCCTGAAAGGTTTTACAGCATTGTTGAGGCGGGG ATTGACGCAAATGGGCGGTA TAACTGCGTTTACCCGCCAT 801 GGCGTGTACGGTGGGAGGTCTATATAAGCAGAGCTGGTTTAGTGAACCGTCAGATCCGCTAGACGTCTCATTTAGGCATG CCGCACATGCCACCCTCCAGATATATTCGTCTCGACCAAATCACTTGGCAGTCTAGGCGATCTGCAGAGTAAATCCGTAC GAAACCCCAGCGCAGCTTCT CTTTGGGGTCGCGTCGAAGA 901 CTTCCTCCTGCTACTCTGGATCCCAGACACCATTGAAGAAATAGTGATGACGCAGTCTCCAGCCACCCTGTCTGTGTCTC GAAGGAGGACGATGAGACCTAGGGTCTGTGGTAACTTCTTTATCACTACTGCGTCAGAGGTCGGTGGGACAGACACAGAG CAGGGGAAAGAGTCACCCTC GTCCCCTTTCTCAGTGGGAG 1001 TCCAGCAGCCATCATCATCATCATCACAGCAGCGGCCTGGTGCCGCGCGGCAGCCATAGGTCGACTCTAGAGGATCCAAG AGGTCGTCGGTAGTAGTAGTAGTAGTGTCGTCGCCGGACCACGGCGCGCCGTCGGTATCCAGCTGAGATCTCCTAGGTTC CCAAAGCACTAACGTTTTAG GGTTTCGTGATTGCAAAATC 1101 GTGAAGCTAAAAAATTAAACGAATCTCAAGCACCGAAAGCTGACAACAATTTCAACAAAGAACAACAAAATGCTTTCTAT CACTTCGATTTTTTAATTTGCTTAGAGTTCGTGGCTTTCGACTGTTGTTAAAGTTGTTTCTTGTTGTTTTACGAAAGATA GAAATCTTGAACATGCCTAA CTTTAGAACTTGTACGGATT 1201 CTTGAACGAAGAACAACGCAATGGTTTCATCCAAAGCTTAAAAGATGACCCAAGTCAAAGTGCTAACCTTTTAGCAGAAG GAACTTGCTTCTTGTTGCGTTACCAAAGTAGGTTTCGAATTTTCTACTGGGTTCAGTTTCACGATTGGAAAATCGTCTTC CTAAAAAGTTAAATGAATCT GATTTTTCAATTTACTTAGA 1301 CAAGCACCGAAAGCTGATAACAAATTCAACAAAGAACAACAAAATGCTTTCTATGAAATCTTACATTTACCTAACTTAAA GTTCGTGGCTTTCGACTATTGTTTAAGTTGTTTCTTGTTGTTTTACGAAAGATACTTTAGAATGTAAATGGATTGAATTT TGAAGAACAACGCAATGGTT ACTTCTTGTTGCGTTACCAA 1401 TCATCCAAAGCTTAAAAGATGACCCAAGCCAAAGCGCTAACCTTTTAGCAGAAGCTAAAAAGCTAAATGATGCACAAGCA AGTAGGTTTCGAATTTTCTACTGGGTTCGGTTTCGCGATTGGAAAATCGTCTTCGATTTTTCGATTTACTACGTGTTCGT CCAAAAGCTGACAACAAATT GGTTTTCGACTGTTGTTTAA 1501 CAACAAAGAACAACAAAATGCTTTCTATGAAATTTTACATTTACCTAACTTAACTGAAGAACAACGTAACGGCTTCATCC GTTGTTTCTTGTTGTTTTACGAAAGATACTTTAAAATGTAAATGGATTGAATTGACTTCTTGTTGCATTGCCGAAGTAGG AAAGCCTTAAAGACGATCCC TTTCGGAATTTCTGCTAGGG 1601 CGGTCGACTCTAGCGGCAGCTTCCGGTGCTAGCACTGACACTTACAAATTAATCCTTAATGGTAAAACATTGAAAGGCGA GCCAGCTGAGATCGCCGTCGAAGGCCACGATCGTGACTGTGAATGTTTAATTAGGAATTACCATTTTGTAACTTTCCGCT AACAACTACTGAAGCTGTTG TTGTTGATGACTTCGACAAC 1701 ATGCTGCTACTGCAGAAAAAGTCTTCAAACAATACGCTAACGACAACGGTGTTGACGGTGAATGGACTTACGACGATGCG TACGACGATGACGTCTTTTTCAGAAGTTTGTTATGCGATTGCTGTTGCCACAACTGCCACTTACCTGAATGCTGCTACGC ACTAAGACCTTTACAGTTAC TGATTCTGGAAATGTCAATG 1801 TGAAAAACCAGAAGTGATCGATGCGTCTGAATTAACACCAGCCGTGACAACTTACAAACTTGTTATTAATGGTAAAACAT ACTTTTTGGTCTTCACTAGCTACGCAGACTTAATTGTGGTCGGCACTGTTGAATGTTTGAACAATAATTACCATTTTGTA TGAAAGGCGAAACAACTACT ACTTTCCGCTTTGTTGATGA 1901 AAAGCAGTAGACGCAGAAACTGCAGAAAAAGCCTTCAAACAATACGCTAACGACAACGGTGTTGATGGTGTTTGGACTTA TTTCGTCATCTGCGTCTTTGACGTCTTTTTCGGAAGTTTGTTATGCGATTGCTGTTGCCACAACTACCACAAACCTGAAT TGATGATGCGACTAAGACCT ACTACTACGCTGATTCTGGA 2001 TTACGGTAACTGAAATGGTTACAGAGGTACCGCGGGCCCGGGATCCACCGGCTAGCGGGAATTCCAAATCAACTGAGTTC AATGCCATTGACTTTACCAATGTCTCCATGGCGCCCGGGCCCTAGGTGGCCGATCGCCCTTAAGGTTTAGTTGACTCAAG GATCCTAACATTGACATTGT CTAGGATTGTAACTGTAACA 2101 TGGTTTAGAAGGAAAATTTGGTATTACAAACCTAGAGACGGATTTATTCACAATCTGGGAGACAATGGAGGTCATGATCA ACCAAATCTTCCTTTTAAACCATAATGTTTGGATCTCTGCCTAAATAAGTGTTAGACCCTCTGTTACCTCCAGTACTAGT AAGCAGATATTGCAGATACT TTCGTCTATAACGTCTATGA 2201 GATAGAGCCAGCAACTTTGTTGCAACTGAAACCGATGCTAACCGCGGAAAAATGCCTGGCAAAAAACTGCCACTGGCAGT CTATCTCGGTCGTTGAAACAACGTTGACTTTGGCTACGATTGGCGCCTTTTTACGGACCGTTTTTTGACGGTGACCGTCA TATCATGGAAATGGAAGCCA ATAGTACCTTTACCTTCGGT 2301 ATGCTTTCAAAGCTGGCTGCACCAGGGGATGCCTTATCTGTCTTTCAAAAATTAAGTGTACAGCCAAAATGAAGGTATAC TACGAAAGTTTCGACCGACGTGGTCCCCTACGGAATAGACAGAAAGTTTTTAATTCACATGTCGGTTTTACTTCCATATG ATTCCAGGAAGGTGTCACGA TAAGGTCCTTCCACAGTGCT 2401 TTATGGTGGTGACAAGAAAACTGGACAGGCAGGAATTGTTGGTGCAATTGTTGACATTCCCGAAATCTCTGGATTTAAGG AATACCACCACTGTTCTTTTGACCTGTCCGTCCTTAACAACCACGTTAACAACTGTAAGGGCTTTAGAGACCTAAATTCC AGATGGCACCCATGGAACAG TCTACCGTGGGTACCTTGTC 2501 TTCATTGCTCAAGTTGATCGCTGCGCTTCCTGCACTACTGGATGTCTCAAAGGTCTTGCCAATGTTAAGTGCTCTGAACT AAGTAACGAGTTCAACTAGCGACGCGAAGGACGTGATGACCTACAGAGTTTCCAGAACGGTTACAATTCACGAGACTTGA CCTGAAGAAATGGCTGCCTG GGACTTCTTTACCGACGGAC 2601 ACAGGTGTGCAAGTTTTGCTGACAAGATTCAAAAAGAAGTTCACAATATCAAAGGCATGGCCGGCGATCGATGAGCGGCC TGTCCACACGTTCAAAACGACTGTTCTAAGTTTTTCTTCAAGTGTTATAGTTTCCGTACCGGCCGCTAGCTACTCGCCGG GCAATTTAATTCCGGTTATT CGTTAAATTAAGGCCAATAA 2701 TTCCACCATATTGCCGTCTTTTGGCAATGTGAGGGCCCGGAAACCTGGCCCTGTCTTCTTGACGAGCATTCCTAGGGGTC AAGGTGGTATAACGGCAGAAAACCGTTACACTCCCGGGCCTTTGGACCGGGACAGAAGAACTGCTCGTAAGGATCCCCAG TTTCCCCTCTCGCCAAAGGA AAAGGGGAGAGCGGTTTCCT 2801 ATGCAAGGTCTGTTGAATGTCGTGAAGGAAGCAGTTCCTCTGGAAGCTTCTTGAAGACAAACAACGTCTGTAGCGACCCT TACGTTCCAGACAACTTACAGCACTTCCTTCGTCAAGGAGACCTTCGAAGAACTTCTGTTTGTTGCAGACATCGCTGGGA TTGCAGGCAGCGGAACCCCC AACGTCCGTCGCCTTGGGGG 2901 CACCTGGCGACAGGTGCCTCTGCGGCCAAAAGCCACGTGTATAAGATACACCTGCAAAGGCGGCACAACCCCAGTGCCAC GTGGACCGCTGTCCACGGAGACGCCGGTTTTCGGTGCACATATTCTATGTGGACGTTTCCGCCGTGTTGGGGTCACGGTG GTTGTGAGTTGGATAGTTGT CAACACTCAACCTATCAACA 3001 GGAAAGAGTCAAATGGCTCACCTCAAGCGTATTCAACAAGGGGCTGAAGGATGCCCAGAAGGTACCCCATTGTATGGGAT CCTTTCTCAGTTTACCGAGTGGAGTTCGCATAAGTTGTTCCCCGACTTCCTACGGGTCTTCCATGGGGTAACATACCCTA CTGATCTGGGGCCTCGGTGC GACTAGACCCCGGAGCCACG 3101 ACATGCTTTACATGTGTTTAGTCGAGGTTAAAAAACGTCTAGGCCCCCCGAACCACGGGGACGTGGTTTTCCTTTGAAAA TGTACGAAATGTACACAAATCAGCTCCAATTTTTTGCAGATCCGGGGGGCTTGGTGCCCCTGCACCAAAAGGAAACTTTT ACACGATGATAATATGGCCA TGTGCTACTATTATACCGGT 3201 CCACCCATACCTAGGCTTTTGCAAAGATCGATCAGATCCCGGGGGGCAATGAGATATGAAAAAGCCTGAACTCACCGCGA GGTGGGTATGGATCCGAAAACGTTTCTAGCTAGTCTAGGGCCCCCCGTTACTCTATACTTTTTCGGACTTGAGTGGCGCT CGTCTGTCGAGAAGTTTCTG GCAGACAGCTCTTCAAAGAC 3301 ATCGAAAAGTTCGACAGCGTCTCCGACCTGATGCAGCTCTCGGAGGGCGAAGAATCTCGTGCTTTCAGCTTCGATGTAGG TAGCTTTTCAAGCTGTCGCAGAGGCTGGACTACGTCGAGAGCCTCCCGCTTCTTAGAGCACGAAAGTCGAAGCTACATCC AGGGCGTGGATATGTCCTGC TCCCGCACCTATACAGGACG 3401 GGGTAAATAGCTGCGCCGATGGTTTCTACAAAGATCGTTATGTTTATCGGCACTTTGCATCGGCCGCGCTCCCGATTCCG CCCATTTATCGACGCGGCTACCAAAGATGTTTCTAGCAATACAAATAGCCGTGAAACGTAGCCGGCGCGAGGGCTAAGGC GAAGTGCTTGACATTGGGGA CTTCACGAACTGTAACCCCT 3501 ATTCAGCGAGAGCCTGACCTATTGCATCTCCCGCCGTGCACAGGGTGTCACGTTGCAAGACCTGCCTGAAACCGAACTGC TAAGTCGCTCTCGGACTGGATAACGTAGAGGGCGGCACGTGTCCCACAGTGCAACGTTCTGGACGGACTTTGGCTTGACG CCGCTGTTCTGCAGCCGGTC GGCGACAAGACGTCGGCCAG 3601 GCGGAGGCCATGGATGCGATCGCTGCGGCCGATCTTAGCCAGACGAGCGGGTTCGGCCCATTCGGACCGCAAGGAATCGG CGCCTCCGGTACCTACGCTAGCGACGCCGGCTAGAATCGGTCTGCTCGCCCAAGCCGGGTAAGCCTGGCGTTCCTTAGCC TCAATACACTACATGGCGTG AGTTATGTGATGTACCGCAC 3701 ATTTCATATGCGCGATTGCTGATCCCCATGTGTATCACTGGCAAACTGTGATGGACGACACCGTCAGTGCGTCCGTCGCG TAAAGTATACGCGCTAACGACTAGGGGTACACATAGTGACCGTTTGACACTACCTGCTGTGGCAGTCACGCAGGCAGCGC CAGGCTCTCGATGAGCTGAT GTCCGAGAGCTACTCGACTA 3801 GCTTTGGGCCGAGGACTGCCCCGAAGTCCGGCACCTCGTGCACGCGGATTTCGGCTCCAACAATGTCCTGACGGACAATG CGAAACCCGGCTCCTGACGGGGCTTCAGGCCGTGGAGCACGTGCGCCTAAAGCCGAGGTTGTTACAGGACTGCCTGTTAC GCCGCATAACAGCGGTCATT CGGCGTATTGTCGCCAGTAA 3901 GACTGGAGCGAGGCGATGTTCGGGGATTCCCAATACGAGGTCGCCAACATCTTCTTCTGGAGGCCGTGGTTGGCTTGTAT CTGACCTCGCTCCGCTACAAGCCCCTAAGGGTTATGCTCCAGCGGTTGTAGAAGAAGACCTCCGGCACCAACCGAACATA GGAGCAGCAGACGCGCTACT CCTCGTCGTCTGCGCGATGA 4001 TCGAGCGGAGGCATCCGGAGCTTGCAGGATCGCCGCGGCTCCGGGCGTATATGCTCCGCATTGGTCTTGACCAACTCTAT AGCTCGCCTCCGTAGGCCTCGAACGTCCTAGCGGCGCCGAGGCCCGCATATACGAGGCGTAACCAGAACTGGTTGAGATA CAGAGCTTGGTTGACGGCAA GTCTCGAACCAACTGCCGTT 4101 TTTCGATGATGCAGCTTGGGCGCAGGGTCGATGCGACGCAATCGTCCGATCCGGAGCCGGGACTGTCGGGCGTACACAAA AAAGCTACTACGTCGAACCCGCGTCCCAGCTACGCTGCGTTAGCAGGCTAGGCCTCGGCCCTGACAGCCCGCATGTGTTT TCGCCCGCAGAAGCGCGGCC AGCGGGCGTCTTCGCGCCGG 4201 GTCTGGACCGATGGCTGTGTAGAAGTACTCGCCGATAGTGGAAACCGACGCCCCAGCACTCGTCCGGATCGGGAGATGGG CAGACCTGGCTACCGACACATCTTCATGAGCGGCTATCACCTTTGGCTGCGGGGTCGTGAGCAGGCCTAGCCCTCTACCC GGAGGCTAACTGAAACACGG CCTCCGATTGACTTTGTGCC 4301 AAGGAGACAATACCGGAAGGAACCTCGACGTTAACTTGTTTATTGCAGCTTATAATGGTTACAAATAAAGCAATAGCATC TTCCTCTGTTATGGCCTTCCTTGGAGCTGCAATTGAACAAATAACGTCGAATATTACCAATGTTTATTTCGTTATCGTAG ACAAATTTCACAAATAAAGC TGTTTAAAGTGTTTATTTCG 4401 ATTTATTACCCTGTTATCCCTAGAATTCACTGGCCGTCGTTTTACAACGTCGTGACTGGGAAAACCCTGGCGTTACCCAA TAAATAATGGGACAATAGGGATCTTAAGTGACCGGCAGCAAAATGTTGCAGCACTGACCCTTTTGGGACCGCAATGGGTT CTTAATCGCCTTGCAGCACA GAATTAGCGGAACGTCGTGT 4501 TCCCCCTTTCGCCAGCTGGCGTAATAGCGAAGAGGCCCGCACCGATCGCCCTTCCCAACAGTTGCGCAGCCTGAATGGCG AGGGGGAAAGCGGTCGACCGCATTATCGCTTCTCCGGGCGTGGCTAGCGGGAAGGGTTGTCAACGCGTCGGACTTACCGC AATGGCGCCTGATGCGGTAT TTACCGCGGACTACGCCATA 4601 TTTCTCCTTACGCATCTGTGCGGTATTTCACACCGCATACGTCAAAGCAACCATAGTACGCGCCCTGTAGCGGCGCATTA AAAGAGGAATGCGTAGACACGCCATAAAGTGTGGCGTATGCAGTTTCGTTGGTATCATGCGCGGGACATCGCCGCGTAAT AGCGCGGCGGGTGTGGTGGT TCGCGCCGCCCACACCACCA 4701 TACGCGCAGCGTGACCGCTACACTTGCCAGCGCCCTAGCGCCCGCTCCTTTCGCTTTCTTCCCTTCCTTTCTCGCCACGT ATGCGCGTCGCACTGGCGATGTGAACGGTCGCGGGATCGCGGGCGAGGAAAGCGAAAGAAGGGAAGGAAAGAGCGGTGCA TCGCCGGCTTTCCCCGTCAA AGCGGCCGAAAGGGGCAGTT 4801 GCTCTAAATCGGGGGCTCCCTTTAGGGTTCCGATTTAGTGCTTTACGGCACCTCGACCCCAAAAAACTTGATTTGGGTGA CGAGATTTAGCCCCCGAGGGAAATCCCAAGGCTAAATCACGAAATGCCGTGGAGCTGGGGTTTTTTGAACTAAACCCACT TGGTTCACGTAGTGGGCCAT ACCAAGTGCATCACCCGGTA 4901 CGCCCTGATAGACGGTTTTTCGCCCTTTGACGTTGGAGTCCACGTTCTTTAATAGTGGACTCTTGTTCCAAACTGGAACA GCGGGACTATCTGCCAAAAAGCGGGAAACTGCAACCTCAGGTGCAAGAAATTATCACCTGAGAACAAGGTTTGACCTTGT ACACTCAACCCTATCTCGGG TGTGAGTTGGGATAGAGCCC 5001 CTATTCTTTTGATTTATAAGGGATTTTGCCGATTTCGGCCTATTGGTTAAAAAATGAGCTGATTTAACAAAAATTTAACG GATAAGAAAACTAAATATTCCCTAAAACGGCTAAAGCCGGATAACCAATTTTTTACTCGACTAAATTGTTTTTAAATTGC CGAATTTTAACAAAATATTA GCTTAAAATTGTTTTATAAT 5101 ACGTTTACAATTTTATGGTGCACTCTCAGTACAATCTGCTCTGATGCCGCATAGTTAAGCCAGCCCCGACACCCGCCAAC TGCAAATGTTAAAATACCACGTGAGAGTCATGTTAGACGAGACTACGGCGTATCAATTCGGTCGGGGCTGTGGGCGGTTG ACCCGCTGACGCGCCCTGAC TGGGCGACTGCGCGGGACTG 5201 GGGCTTGTCTGCTCCCGGCATCCGCTTACAGACAAGCTGTGACCGTCTAGACGAAAGGGCCTCGTGATACGCCTATTTTT CCCGAACAGACGAGGGCCGTAGGCGAATGTCTGTTCGACACTGGCAGATCTGCTTTCCCGGAGCACTATGCGGATAAAAA ATAGGTTAATGTCATGATAA TATCCAATTACAGTACTATT 5301 TAATGGTTTCTTAGACGTCAGGTGGCACTTTTCGGGGAAATGTGCGCGGAACCCCTATTTGTTTATTTTTCTAAATACAT ATTACCAAAGAATCTGCAGTCCACCGTGAAAAGCCCCTTTACACGCGCCTTGGGGATAAACAAATAAAAAGATTTATGTA TCAAATATGTATCCGCTCAT AGTTTATACATAGGCGAGTA 5401 GAGACAATAACCCTGATAAATGCTTCAATAATATTGAAAAAGGAAGAGTATGAGTATTCAACATTTCCGTGTCGCCCTTA CTCTGTTATTGGGACTATTTACGAAGTTATTATAACTTTTTCCTTCTCATACTCATAAGTTGTAAAGGCACAGCGGGAAT TTCCCTTTTTTGCGGCATTT AAGGGAAAAAACGCCGTAAA 5501 TGCCTTCCTGTTTTTGCTCACCCAGAAACGCTGGTGAAAGTAAAAGATGCTGAAGATCAGTTGGGTGCACGAGTGGGTTA ACGGAAGGACAAAAACGAGTGGGTCTTTGCGACCACTTTCATTTTCTACGACTTCTAGTCAACCCACGTGCTCACCCAAT CATCGAACTGGATCTCAACA GTAGCTTGACCTAGAGTTGT 5601 GCGGTAAGATCCTTGAGAGTTTTCGCCCCGAAGAACGTTTTCCAATGATGAGCACTTTTAAAGTTCTGCTATGTGGCGCG CGCCATTCTAGGAACTCTCAAAAGCGGGGCTTCTTGCAAAAGGTTACTACTCGTGAAAATTTCAAGACGATACACCGCGC GTATTATCCCGTATTGACGC CATAATAGGGCATAACTGCG 5701 CGGGCAAGAGCAACTCGGTCGCCGCATACACTATTCTCAGAATGACTTGGTTGAGTACTCACCAGTCACAGAAAAGCATC GCCCGTTCTCGTTGAGCCAGCGGCGTATGTGATAAGAGTCTTACTGAACCAACTCATGAGTGGTCAGTGTCTTTTCGTAG TTACGGATGGCATGACAGTA AATGCCTACCGTACTGTCAT 5801 AGAGAATTATGCAGTGCTGCCATAACCATGAGTGATAACACTGCGGCCAACTTACTTCTGACAACGATCGGAGGACCGAA TCTCTTAATACGTCACGACGGTATTGGTACTCACTATTGTGACGCCGGTTGAATGAAGACTGTTGCTAGCCTCCTGGCTT GGAGCTAACCGCTTTTTTGC CCTCGATTGGCGAAAAAACG 5901 ACAACATGGGGGATCATGTAACTCGCCTTGATCGTTGGGAACCGGAGCTGAATGAAGCCATACCAAACGACGAGCGTGAC TGTTGTACCCCCTAGTACATTGAGCGGAACTAGCAACCCTTGGCCTCGACTTACTTCGGTATGGTTTGCTGCTCGCACTG ACCACGATGCCTGTAGCAAT TGGTGCTACGGACATCGTTA 6001 GGCAACAACGTTGCGCAAACTATTAACTGGCGAACTACTTACTCTAGCTTCCCGGCAACAATTAATAGACTGGATGGAGG CCGTTGTTGCAACGCGTTTGATAATTGACCGCTTGATGAATGAGATCGAAGGGCCGTTGTTAATTATCTGACCTACCTCC CGGATAAAGTTGCAGGACCA GCCTATTTCAACGTCCTGGT 6101 CTTCTGCGCTCGGCCCTTCCGGCTGGCTGGTTTATTGCTGATAAATCTGGAGCCGGTGAGCGTGGGTCTCGCGGTATCAT GAAGACGCGAGCCGGGAAGGCCGACCGACCAAATAACGACTATTTAGACCTCGGCCACTCGCACCCAGAGCGCCATAGTA TGCAGCACTGGGGCCAGATG ACGTCGTGACCCCGGTCTAC 6201 GTAAGCCCTCCCGTATCGTAGTTATCTACACGACGGGGAGTCAGGCAACTATGGATGAACGAAATAGACAGATCGCTGAG CATTCGGGAGGGCATAGCATCAATAGATGTGCTGCCCCTCAGTCCGTTGATACCTACTTGCTTTATCTGTCTAGCGACTC ATAGGTGCCTCACTGATTAA TATCCACGGAGTGACTAATT 6301 GCATTGGTAACTGTCAGACCAAGTTTACTCATATATACTTTAGATTGATTTAAAACTTCATTTTTAATTTAAAAGGATCT CGTAACCATTGACAGTCTGGTTCAAATGAGTATATATGAAATCTAACTAAATTTTGAAGTAAAAATTAAATTTTCCTAGA AGGTGAAGATCCTTTTTGAT TCCACTTCTAGGAAAAACTA 6401 AATCTCATGACCAAAATCCCTTAACGTGAGTTTTCGTTCCACTGAGCGTCAGACCCCGTAGAAAAGATCAAAGGATCTTC TTAGAGTACTGGTTTTAGGGAATTGCACTCAAAAGCAAGGTGACTCGCAGTCTGGGGCATCTTTTCTAGTTTCCTAGAAG TTGAGATCCTTTTTTTCTGC AACTCTAGGAAAAAAAGACG 6501 GCGTAATCTGCTGCTTGCAAACAAAAAAACCACCGCTACCAGCGGTGGTTTGTTTGCCGGATCAAGAGCTACCAACTCTT CGCATTAGACGACGAACGTTTGTTTTTTTGGTGGCGATGGTCGCCACCAAACAAACGGCCTAGTTCTCGATGGTTGAGAA TTTCCGAAGGTAACTGGCTT AAAGGCTTCCATTGACCGAA 6601 CAGCAGAGCGCAGATACCAAATACTGTCCTTCTAGTGTAGCCGTAGTTAGGCCACCACTTCAAGAACTCTGTAGCACCGC GTCGTCTCGCGTCTATGGTTTATGACAGGAAGATCACATCGGCATCAATCCGGTGGTGAAGTTCTTGAGACATCGTGGCG CTACATACCTCGCTCTGCTA GATGTATGGAGCGAGACGAT 6701 ATCCTGTTACCAGTGGCTGCTGCCAGTGGCGATAAGTCGTGTCTTACCGGGTTGGACTCAAGACGATAGTTACCGGATAA TAGGACAATGGTCACCGACGACGGTCACCGCTATTCAGCACAGAATGGCCCAACCTGAGTTCTGCTATCAATGGCCTATT GGCGCAGCGGTCGGGCTGAA CCGCGTCGCCAGCCCGACTT 6801 CGGGGGGTTCGTGCACACAGCCCAGCTTGGAGCGAACGACCTACACCGAACTGAGATACCTACAGCGTGAGCTATGAGAA GCCCCCCAAGCACGTGTGTCGGGTCGAACCTCGCTTGCTGGATGTGGCTTGACTCTATGGATGTCGCACTCGATACTCTT AGCGCCACGCTTCCCGAAGG TCGCGGTGCGAAGGGCTTCC 6901 GAGAAAGGCGGACAGGTATCCGGTAAGCGGCAGGGTCGGAACAGGAGAGCGCACGAGGGAGCTTCCAGGGGGAAACGCCT CTCTTTCCGCCTGTCCATAGGCCATTCGCCGTCCCAGCCTTGTCCTCTCGCGTGCTCCCTCGAAGGTCCCCCTTTGCGGA GGTATCTTTATAGTCCTGTC CCATAGAAATATCAGGACAG 7001 GGGTTTCGCCACCTCTGACTTGAGCGTCGATTTTTGTGATGCTCGTCAGGGGGGCGGAGCCTATGGAAAAACGCCAGCAA CCCAAAGCGGTGGAGACTGAACTCGCAGCTAAAAACACTACGAGCAGTCCCCCCGCCTCGGATACCTTTTTGCGGTCGTT CGCGGCCTTTTTACGGTTCC GCGCCGGAAAAATGCCAAGG 7101 TGGCCTTTTGCTGGCCTTTTGCTCACATGTTCTTTCCTGCGTTATCCCCTGATTCTGTGGATAACCGTATTACCGCCTTT ACCGGAAAACGACCGGAAAACGAGTGTACAAGAAAGGACGCAATAGGGGACTAAGACACCTATTGGCATAATGGCGGAAA GAGTGAGCTGATACCGCTCG CTCACTCGACTATGGCGAGC 7201 CCGCAGCCGAACGACCGAGCGCAGCGAGTCAGTGAGCGAGGAAGCGGAAG GGCGTCGGCTTGCTGGCTCGCGTCGCTCAGTCACTCGCTCCTTCGCCTTC pS14L-spAG-N-MLuc15 1 AGCGCCCAATACGCAAACCGCCTCTCCCCGCGCGTTGGCCGATTCATTAATGCAGCTGGCACGACAGGTTTCCCGACTGG TCGCGGGTTATGCGTTTGGCGGAGAGGGGCGCGCAACCGGCTAAGTAATTACGTCGACCGTGCTGTCCAAAGGGCTGACC AAAGCGGGCAGTGAGCGCAA TTTCGCCCGTCACTCGCGTT 101 CGCAATTAATGTGAGTTAGCTCACTCATTAGGCACCCCAGGCTTTACACTTTATGCTTCCGGCTCGTATGTTGTGTGGAA GCGTTAATTACACTCAATCGAGTGAGTAATCCGTGGGGTCCGAAATGTGAAATACGAAGGCCGAGCATACAACACACCTT TTGTGAGCGGATAACAATTT AACACTCGCCTATTGTTAAA 201 CACACAGGAAACAGCTATGACCATGATTACGCCAAGCTTTAGGGATAACAGGGTAATCGCCATGCATTAGTTATTAATAG GTGTGTCCTTTGTCGATACTGGTACTAATGCGGTTCGAAATCCCTATTGTCCCATTAGCGGTACGTAATCAATAATTATC TAATCAATTACGGGGTCATT ATTAGTTAATGCCCCAGTAA 301 AGTTCATAGCCCATATATGGAGTTCCGCGTTACATAACTTACGGTAAATGGCCCGCCTGGCTGACCGCCCAACGACCCCC TCAAGTATCGGGTATATACCTCAAGGCGCAATGTATTGAATGCCATTTACCGGGCGGACCGACTGGCGGGTTGCTGGGGG GCCCATTGACGTCAATAATG CGGGTAACTGCAGTTATTAC 401 ACGTATGTTCCCATAGTAACGCCAATAGGGACTTTCCATTGACGTCAATGGGTGGAGTATTTACGGTAAACTGCCCACTT TGCATACAAGGGTATCATTGCGGTTATCCCTGAAAGGTAACTGCAGTTACCCACCTCATAAATGCCATTTGACGGGTGAA GGCAGTACATCAAGTGTATC CCGTCATGTAGTTCACATAG 501 ATATGCCAAGTACGCCCCCTATTGACGTCAATGACGGTAAATGGCCCGCCTGGCATTATGCCCAGTACATGACCTTATGG TATACGGTTCATGCGGGGGATAACTGCAGTTACTGCCATTTACCGGGCGGACCGTAATACGGGTCATGTACTGGAATACC GACTTTCCTACTTGGCAGTA CTGAAAGGATGAACCGTCAT 601 CATCTACGTATTAGTCATCGCTATTACCATGGTGATGCGGTTTTGGCAGTACATCAATGGGCGTGGATAGCGGTTTGACT GTAGATGCATAATCAGTAGCGATAATGGTACCACTACGCCAAAACCGTCATGTAGTTACCCGCACCTATCGCCAAACTGA CACGGGGATTTCCAAGTCTC GTGCCCCTAAAGGTTCAGAG 701 CACCCCATTGACGTCAATGGGAGTTTGTTTTGGCACCAAAATCAACGGGACTTTCCAAAATGTCGTAACAACTCCGCCCC GTGGGGTAACTGCAGTTACCCTCAAACAAAACCGTGGTTTTAGTTGCCCTGAAAGGTTTTACAGCATTGTTGAGGCGGGG ATTGACGCAAATGGGCGGTA TAACTGCGTTTACCCGCCAT 801 GGCGTGTACGGTGGGAGGTCTATATAAGCAGAGCTGGTTTAGTGAACCGTCAGATCCGCTAGACGTCTCATTTAGGCATG CCGCACATGCCACCCTCCAGATATATTCGTCTCGACCAAATCACTTGGCAGTCTAGGCGATCTGCAGAGTAAATCCGTAC GAAACCCCAGCGCAGCTTCT CTTTGGGGTCGCGTCGAAGA 901 CTTCCTCCTGCTACTCTGGATCCCAGACACCATTGAAGAAATAGTGATGACGCAGTCTCCAGCCACCCTGTCTGTGTCTC GAAGGAGGACGATGAGACCTAGGGTCTGTGGTAACTTCTTTATCACTACTGCGTCAGAGGTCGGTGGGACAGACACAGAG CAGGGGAAAGAGTCACCCTC GTCCCCTTTCTCAGTGGGAG 1001 TCCAGCAGCCATCATCATCATCATCACAGCAGCGGCCTGGTGCCGCGCGGCAGCCATAGGTCGACTCTAGAGGATCCAAG AGGTCGTCGGTAGTAGTAGTAGTAGTGTCGTCGCCGGACCACGGCGCGCCGTCGGTATCCAGCTGAGATCTCCTAGGTTC CCAAAGCACTAACGTTTTAG GGTTTCGTGATTGCAAAATC 1101 GTGAAGCTAAAAAATTAAACGAATCTCAAGCACCGAAAGCTGACAACAATTTCAACAAAGAACAACAAAATGCTTTCTAT CACTTCGATTTTTTAATTTGCTTAGAGTTCGTGGCTTTCGACTGTTGTTAAAGTTGTTTCTTGTTGTTTTACGAAAGATA GAAATCTTGAACATGCCTAA CTTTAGAACTTGTACGGATT 1201 CTTGAACGAAGAACAACGCAATGGTTTCATCCAAAGCTTAAAAGATGACCCAAGTCAAAGTGCTAACCTTTTAGCAGAAG GAACTTGCTTCTTGTTGCGTTACCAAAGTAGGTTTCGAATTTTCTACTGGGTTCAGTTTCACGATTGGAAAATCGTCTTC CTAAAAAGTTAAATGAATCT GATTTTTCAATTTACTTAGA 1301 CAAGCACCGAAAGCTGATAACAAATTCAACAAAGAACAACAAAATGCTTTCTATGAAATCTTACATTTACCTAACTTAAA GTTCGTGGCTTTCGACTATTGTTTAAGTTGTTTCTTGTTGTTTTACGAAAGATACTTTAGAATGTAAATGGATTGAATTT TGAAGAACAACGCAATGGTT ACTTCTTGTTGCGTTACCAA 1401 TCATCCAAAGCTTAAAAGATGACCCAAGCCAAAGCGCTAACCTTTTAGCAGAAGCTAAAAAGCTAAATGATGCACAAGCA AGTAGGTTTCGAATTTTCTACTGGGTTCGGTTTCGCGATTGGAAAATCGTCTTCGATTTTTCGATTTACTACGTGTTCGT CCAAAAGCTGACAACAAATT GGTTTTCGACTGTTGTTTAA 1501 CAACAAAGAACAACAAAATGCTTTCTATGAAATTTTACATTTACCTAACTTAACTGAAGAACAACGTAACGGCTTCATCC GTTGTTTCTTGTTGTTTTACGAAAGATACTTTAAAATGTAAATGGATTGAATTGACTTCTTGTTGCATTGCCGAAGTAGG AAAGCCTTAAAGACGATCCC TTTCGGAATTTCTGCTAGGG 1601 CGGTCGACTCTAGCGGCAGCTTCCGGTGCTAGCACTGACACTTACAAATTAATCCTTAATGGTAAAACATTGAAAGGCGA GCCAGCTGAGATCGCCGTCGAAGGCCACGATCGTGACTGTGAATGTTTAATTAGGAATTACCATTTTGTAACTTTCCGCT AACAACTACTGAAGCTGTTG TTGTTGATGACTTCGACAAC 1701 ATGCTGCTACTGCAGAAAAAGTCTTCAAACAATACGCTAACGACAACGGTGTTGACGGTGAATGGACTTACGACGATGCG TACGACGATGACGTCTTTTTCAGAAGTTTGTTATGCGATTGCTGTTGCCACAACTGCCACTTACCTGAATGCTGCTACGC ACTAAGACCTTTACAGTTAC TGATTCTGGAAATGTCAATG 1801 TGAAAAACCAGAAGTGATCGATGCGTCTGAATTAACACCAGCCGTGACAACTTACAAACTTGTTATTAATGGTAAAACAT ACTTTTTGGTCTTCACTAGCTACGCAGACTTAATTGTGGTCGGCACTGTTGAATGTTTGAACAATAATTACCATTTTGTA TGAAAGGCGAAACAACTACT ACTTTCCGCTTTGTTGATGA 1901 AAAGCAGTAGACGCAGAAACTGCAGAAAAAGCCTTCAAACAATACGCTAACGACAACGGTGTTGATGGTGTTTGGACTTA TTTCGTCATCTGCGTCTTTGACGTCTTTTTCGGAAGTTTGTTATGCGATTGCTGTTGCCACAACTACCACAAACCTGAAT TGATGATGCGACTAAGACCT ACTACTACGCTGATTCTGGA 2001 TTACGGTAACTGAAATGGTTACAGAGGTACCAGATCTTAGCAACTTTGTTGCAACTGAAACCGATGCTAACCGCGGAAAA AATGCCATTGACTTTACCAATGTCTCCATGGTCTAGAATCGTTGAAACAACGTTGACTTTGGCTACGATTGGCGCCTTTT ATGCCTGGCAAAAAACTGCC TACGGACCGTTTTTTGACGG 2101 ACTGGCAGTTATCATGGAAATGGAAGCCAATGCTTTCAAAGCTGGCTGCACCAGGGGATGCCTTATCTGTCTTTCAAAAA TGACCGTCAATAGTACCTTTACCTTCGGTTACGAAAGTTTCGACCGACGTGGTCCCCTACGGAATAGACAGAAAGTTTTT TTAAGTGTACAGCCAAAATG AATTCACATGTCGGTTTTAC 2201 AAGGTATACATTCCAGGAAGGTGTCACGATTATGGTGGTGACAAGAAAACTGGACAGGCAGGAATTGTTGGTGCAATTGT TTCCATATGTAAGGTCCTTCCACAGTGCTAATACCACCACTGTTCTTTTGACCTGTCCGTCCTTAACAACCACGTTAACA TGACATTCCCGAAATCTCTG ACTGTAAGGGCTTTAGAGAC 2301 GATTTAAGGAGATGGCACCCATGGAACAGTTCATTGCTCAAGTTGATCGCTGCGCTTCCTGCACTACTGGATGTCTCAAA CTAAATTCCTCTACCGTGGGTACCTTGTCAAGTAACGAGTTCAACTAGCGACGCGAAGGACGTGATGACCTACAGAGTTT GGTCTTGCCAATGTTAAGTG CCAGAACGGTTACAATTCAC 2401 CTCTGAACTCCTGAAGAAATGGCTGCCTGACAGGTGTGCAAGTTTTGCTGACAAGATTCAAAAAGAAGTTCACAATATCA GAGACTTGAGGACTTCTTTACCGACGGACTGTCCACACGTTCAAAACGACTGTTCTAAGTTTTTCTTCAAGTGTTATAGT AAGGCATGGCCGGCGATCGA TTCCGTACCGGCCGCTAGCT 2501 TGAGCGGCCGCAATTTAATTCCGGTTATTTTCCACCATATTGCCGTCTTTTGGCAATGTGAGGGCCCGGAAACCTGGCCC ACTCGCCGGCGTTAAATTAAGGCCAATAAAAGGTGGTATAACGGCAGAAAACCGTTACACTCCCGGGCCTTTGGACCGGG TGTCTTCTTGACGAGCATTC ACAGAAGAACTGCTCGTAAG 2601 CTAGGGGTCTTTCCCCTCTCGCCAAAGGAATGCAAGGTCTGTTGAATGTCGTGAAGGAAGCAGTTCCTCTGGAAGCTTCT GATCCCCAGAAAGGGGAGAGCGGTTTCCTTACGTTCCAGACAACTTACAGCACTTCCTTCGTCAAGGAGACCTTCGAAGA TGAAGACAAACAACGTCTGT ACTTCTGTTTGTTGCAGACA 2701 AGCGACCCTTTGCAGGCAGCGGAACCCCCCACCTGGCGACAGGTGCCTCTGCGGCCAAAAGCCACGTGTATAAGATACAC TCGCTGGGAAACGTCCGTCGCCTTGGGGGGTGGACCGCTGTCCACGGAGACGCCGGTTTTCGGTGCACATATTCTATGTG CTGCAAAGGCGGCACAACCC GACGTTTCCGCCGTGTTGGG 2801 CAGTGCCACGTTGTGAGTTGGATAGTTGTGGAAAGAGTCAAATGGCTCACCTCAAGCGTATTCAACAAGGGGCTGAAGGA GTCACGGTGCAACACTCAACCTATCAACACCTTTCTCAGTTTACCGAGTGGAGTTCGCATAAGTTGTTCCCCGACTTCCT TGCCCAGAAGGTACCCCATT ACGGGTCTTCCATGGGGTAA 2901 GTATGGGATCTGATCTGGGGCCTCGGTGCACATGCTTTACATGTGTTTAGTCGAGGTTAAAAAACGTCTAGGCCCCCCGA CATACCCTAGACTAGACCCCGGAGCCACGTGTACGAAATGTACACAAATCAGCTCCAATTTTTTGCAGATCCGGGGGGCT ACCACGGGGACGTGGTTTTC TGGTGCCCCTGCACCAAAAG 3001 CTTTGAAAAACACGATGATAATATGGCCACCACCCATACCTAGGCTTTTGCAAAGATCGATCAGATCCCGGGGGGCAATG GAAACTTTTTGTGCTACTATTATACCGGTGGTGGGTATGGATCCGAAAACGTTTCTAGCTAGTCTAGGGCCCCCCGTTAC AGATATGAAAAAGCCTGAAC TCTATACTTTTTCGGACTTG 3101 TCACCGCGACGTCTGTCGAGAAGTTTCTGATCGAAAAGTTCGACAGCGTCTCCGACCTGATGCAGCTCTCGGAGGGCGAA AGTGGCGCTGCAGACAGCTCTTCAAAGACTAGCTTTTCAAGCTGTCGCAGAGGCTGGACTACGTCGAGAGCCTCCCGCTT GAATCTCGTGCTTTCAGCTT CTTAGAGCACGAAAGTCGAA 3201 CGATGTAGGAGGGCGTGGATATGTCCTGCGGGTAAATAGCTGCGCCGATGGTTTCTACAAAGATCGTTATGTTTATCGGC GCTACATCCTCCCGCACCTATACAGGACGCCCATTTATCGACGCGGCTACCAAAGATGTTTCTAGCAATACAAATAGCCG ACTTTGCATCGGCCGCGCTC TGAAACGTAGCCGGCGCGAG 3301 CCGATTCCGGAAGTGCTTGACATTGGGGAATTCAGCGAGAGCCTGACCTATTGCATCTCCCGCCGTGCACAGGGTGTCAC GGCTAAGGCCTTCACGAACTGTAACCCCTTAAGTCGCTCTCGGACTGGATAACGTAGAGGGCGGCACGTGTCCCACAGTG GTTGCAAGACCTGCCTGAAA CAACGTTCTGGACGGACTTT 3401 CCGAACTGCCCGCTGTTCTGCAGCCGGTCGCGGAGGCCATGGATGCGATCGCTGCGGCCGATCTTAGCCAGACGAGCGGG GGCTTGACGGGCGACAAGACGTCGGCCAGCGCCTCCGGTACCTACGCTAGCGACGCCGGCTAGAATCGGTCTGCTCGCCC TTCGGCCCATTCGGACCGCA AAGCCGGGTAAGCCTGGCGT 3501 AGGAATCGGTCAATACACTACATGGCGTGATTTCATATGCGCGATTGCTGATCCCCATGTGTATCACTGGCAAACTGTGA TCCTTAGCCAGTTATGTGATGTACCGCACTAAAGTATACGCGCTAACGACTAGGGGTACACATAGTGACCGTTTGACACT TGGACGACACCGTCAGTGCG ACCTGCTGTGGCAGTCACGC 3601 TCCGTCGCGCAGGCTCTCGATGAGCTGATGCTTTGGGCCGAGGACTGCCCCGAAGTCCGGCACCTCGTGCACGCGGATTT AGGCAGCGCGTCCGAGAGCTACTCGACTACGAAACCCGGCTCCTGACGGGGCTTCAGGCCGTGGAGCACGTGCGCCTAAA CGGCTCCAACAATGTCCTGA GCCGAGGTTGTTACAGGACT 3701 CGGACAATGGCCGCATAACAGCGGTCATTGACTGGAGCGAGGCGATGTTCGGGGATTCCCAATACGAGGTCGCCAACATC GCCTGTTACCGGCGTATTGTCGCCAGTAACTGACCTCGCTCCGCTACAAGCCCCTAAGGGTTATGCTCCAGCGGTTGTAG TTCTTCTGGAGGCCGTGGTT AAGAAGACCTCCGGCACCAA 3801 GGCTTGTATGGAGCAGCAGACGCGCTACTTCGAGCGGAGGCATCCGGAGCTTGCAGGATCGCCGCGGCTCCGGGCGTATA CCGAACATACCTCGTCGTCTGCGCGATGAAGCTCGCCTCCGTAGGCCTCGAACGTCCTAGCGGCGCCGAGGCCCGCATAT TGCTCCGCATTGGTCTTGAC ACGAGGCGTAACCAGAACTG 3301 CAACTCTATCAGAGCTTGGTTGACGGCAATTTCGATGATGCAGCTTGGGCGCAGGGTCGATGCGACGCAATCGTCCGATC GTTGAGATAGTCTCGAACCAACTGCCGTTAAAGCTACTACGTCGAACCCGCGTCCCAGCTACGCTGCGTTAGCAGGCTAG CGGAGCCGGGACTGTCGGGC GCCTCGGCCCTGACAGCCCG 4001 GTACACAAATCGCCCGCAGAAGCGCGGCCGTCTGGACCGATGGCTGTGTAGAAGTACTCGCCGATAGTGGAAACCGACGC CATGTGTTTAGCGGGCGTCTTCGCGCCGGCAGACCTGGCTACCGACACATCTTCATGAGCGGCTATCACCTTTGGCTGCG CCCAGCACTCGTCCGGATCG GGGTCGTGAGCAGGCCTAGC 4101 GGAGATGGGGGAGGCTAACTGAAACACGGAAGGAGACAATACCGGAAGGAACCTCGACGTTAACTTGTTTATTGCAGCTT CCTCTACCCCCTCCGATTGACTTTGTGCCTTCCTCTGTTATGGCCTTCCTTGGAGCTGCAATTGAACAAATAACGTCGAA ATAATGGTTACAAATAAAGC TATTACCAATGTTTATTTCG 4201 AATAGCATCACAAATTTCACAAATAAAGCATTTATTACCCTGTTATCCCTAGAATTCACTGGCCGTCGTTTTACAACGTC TTATCGTAGTGTTTAAAGTGTTTATTTCGTAAATAATGGGACAATAGGGATCTTAAGTGACCGGCAGCAAAATGTTGCAG GTGACTGGGAAAACCCTGGC CACTGACCCTTTTGGGACCG 4301 GTTACCCAACTTAATCGCCTTGCAGCACATCCCCCTTTCGCCAGCTGGCGTAATAGCGAAGAGGCCCGCACCGATCGCCC CAATGGGTTGAATTAGCGGAACGTCGTGTAGGGGGAAAGCGGTCGACCGCATTATCGCTTCTCCGGGCGTGGCTAGCGGG TTCCCAACAGTTGCGCAGCC AAGGGTTGTCAACGCGTCGG 4401 TGAATGGCGAATGGCGCCTGATGCGGTATTTTCTCCTTACGCATCTGTGCGGTATTTCACACCGCATACGTCAAAGCAAC ACTTACCGCTTACCGCGGACTACGCCATAAAAGAGGAATGCGTAGACACGCCATAAAGTGTGGCGTATGCAGTTTCGTTG CATAGTACGCGCCCTGTAGC GTATCATGCGCGGGACATCG 4501 GGCGCATTAAGCGCGGCGGGTGTGGTGGTTACGCGCAGCGTGACCGCTACACTTGCCAGCGCCCTAGCGCCCGCTCCTTT CCGCGTAATTCGCGCCGCCCACACCACCAATGCGCGTCGCACTGGCGATGTGAACGGTCGCGGGATCGCGGGCGAGGAAA CGCTTTCTTCCCTTCCTTTC GCGAAAGAAGGGAAGGAAAG 4601 TCGCCACGTTCGCCGGCTTTCCCCGTCAAGCTCTAAATCGGGGGCTCCCTTTAGGGTTCCGATTTAGTGCTTTACGGCAC AGCGGTGCAAGCGGCCGAAAGGGGCAGTTCGAGATTTAGCCCCCGAGGGAAATCCCAAGGCTAAATCACGAAATGCCGTG CTCGACCCCAAAAAACTTGA GAGCTGGGGTTTTTTGAACT 4701 TTTGGGTGATGGTTCACGTAGTGGGCCATCGCCCTGATAGACGGTTTTTCGCCCTTTGACGTTGGAGTCCACGTTCTTTA AAACCCACTACCAAGTGCATCACCCGGTAGCGGGACTATCTGCCAAAAAGCGGGAAACTGCAACCTCAGGTGCAAGAAAT ATAGTGGACTCTTGTTCCAA TATCACCTGAGAACAAGGTT 4801 ACTGGAACAACACTCAACCCTATCTCGGGCTATTCTTTTGATTTATAAGGGATTTTGCCGATTTCGGCCTATTGGTTAAA TGACCTTGTTGTGAGTTGGGATAGAGCCCGATAAGAAAACTAAATATTCCCTAAAACGGCTAAAGCCGGATAACCAATTT AAATGAGCTGATTTAACAAA TTTACTCGACTAAATTGTTT 4901 AATTTAACGCGAATTTTAACAAAATATTAACGTTTACAATTTTATGGTGCACTCTCAGTACAATCTGCTCTGATGCCGCA TTAAATTGCGCTTAAAATTGTTTTATAATTGCAAATGTTAAAATACCACGTGAGAGTCATGTTAGACGAGACTACGGCGT TAGTTAAGCCAGCCCCGACA ATCAATTCGGTCGGGGCTGT 5001 CCCGCCAACACCCGCTGACGCGCCCTGACGGGCTTGTCTGCTCCCGGCATCCGCTTACAGACAAGCTGTGACCGTCTAGA GGGCGGTTGTGGGCGACTGCGCGGGACTGCCCGAACAGACGAGGGCCGTAGGCGAATGTCTGTTCGACACTGGCAGATCT CGAAAGGGCCTCGTGATACG GCTTTCCCGGAGCACTATGC 5101 CCTATTTTTATAGGTTAATGTCATGATAATAATGGTTTCTTAGACGTCAGGTGGCACTTTTCGGGGAAATGTGCGCGGAA GGATAAAAATATCCAATTACAGTACTATTATTACCAAAGAATCTGCAGTCCACCGTGAAAAGCCCCTTTACACGCGCCTT CCCCTATTTGTTTATTTTTC GGGGATAAACAAATAAAAAG 5201 TAAATACATTCAAATATGTATCCGCTCATGAGACAATAACCCTGATAAATGCTTCAATAATATTGAAAAAGGAAGAGTAT ATTTATGTAAGTTTATACATAGGCGAGTACTCTGTTATTGGGACTATTTACGAAGTTATTATAACTTTTTCCTTCTCATA GAGTATTCAACATTTCCGTG CTCATAAGTTGTAAAGGCAC 5301 TCGCCCTTATTCCCTTTTTTGCGGCATTTTGCCTTCCTGTTTTTGCTCACCCAGAAACGCTGGTGAAAGTAAAAGATGCT AGCGGGAATAAGGGAAAAAACGCCGTAAAACGGAAGGACAAAAACGAGTGGGTCTTTGCGACCACTTTCATTTTCTACGA GAAGATCAGTTGGGTGCACG CTTCTAGTCAACCCACGTGC 5401 AGTGGGTTACATCGAACTGGATCTCAACAGCGGTAAGATCCTTGAGAGTTTTCGCCCCGAAGAACGTTTTCCAATGATGA TCACCCAATGTAGCTTGACCTAGAGTTGTCGCCATTCTAGGAACTCTCAAAAGCGGGGCTTCTTGCAAAAGGTTACTACT GCACTTTTAAAGTTCTGCTA CGTGAAAATTTCAAGACGAT 5501 TGTGGCGCGGTATTATCCCGTATTGACGCCGGGCAAGAGCAACTCGGTCGCCGCATACACTATTCTCAGAATGACTTGGT AAAAGCATCTTACGGATGGCATGACAGTAAGAGAATTATGCAGTGCTGCCATAACCATGAGTGATAACACTGCGGCCAAC TGAGTACTCACCAGTCACAG TTACTTCTGACAACGATCGG 5601 TTTTCGTAGAATGCCTACCGTACTGTCATTCTCTTAATACGTCACGACGGTATTGGTACTCACTATTGTGACGCCGGTTG TTTTCGTAGAATGCCTACCGTACTGTCATTCTCTTAATACGTCACGACGGTATTGGTACTCACTATTGTGACGCCGGTTG AATGAAGACTGTTGCTAGCC AATGAAGACTGTTGCTAGCC 5701 AGGACCGAAGGAGCTAACCGCTTTTTTGCACAACATGGGGGATCATGTAACTCGCCTTGATCGTTGGGAACCGGAGCTGA TCCTGGCTTCCTCGATTGGCGAAAAAACGTGTTGTACCCCCTAGTACATTGAGCGGAACTAGCAACCCTTGGCCTCGACT ATGAAGCCATACCAAACGAC TACTTCGGTATGGTTTGCTG 5801 GAGCGTGACACCACGATGCCTGTAGCAATGGCAACAACGTTGCGCAAACTATTAACTGGCGAACTACTTACTCTAGCTTC CTCGCACTGTGGTGCTACGGACATCGTTACCGTTGTTGCAACGCGTTTGATAATTGACCGCTTGATGAATGAGATCGAAG CCGGCAACAATTAATAGACT GGCCGTTGTTAATTATCTGA 5901 GGATGGAGGCGGATAAAGTTGCAGGACCACTTCTGCGCTCGGCCCTTCCGGCTGGCTGGTTTATTGCTGATAAATCTGGA CCTACCTCCGCCTATTTCAACGTCCTGGTGAAGACGCGAGCCGGGAAGGCCGACCGACCAAATAACGACTATTTAGACCT GCCGGTGAGCGTGGGTCTCG CGGCCACTCGCACCCAGAGC 6001 CGGTATCATTGCAGCACTGGGGCCAGATGGTAAGCCCTCCCGTATCGTAGTTATCTACACGACGGGGAGTCAGGCAACTA GCCATAGTAACGTCGTGACCCCGGTCTACCATTCGGGAGGGCATAGCATCAATAGATGTGCTGCCCCTCAGTCCGTTGAT TGGATGAACGAAATAGACAG ACCTACTTGCTTTATCTGTC 6101 ATCGCTGAGATAGGTGCCTCACTGATTAAGCATTGGTAACTGTCAGACCAAGTTTACTCATATATACTTTAGATTGATTT TAGCGACTCTATCCACGGAGTGACTAATTCGTAACCATTGACAGTCTGGTTCAAATGAGTATATATGAAATCTAACTAAA AAAACTTCATTTTTAATTTA TTTTGAAGTAAAAATTAAAT 6201 AAAGGATCTAGGTGAAGATCCTTTTTGATAATCTCATGACCAAAATCCCTTAACGTGAGTTTTCGTTCCACTGAGCGTCA TTTCCTAGATCCACTTCTAGGAAAAACTATTAGAGTACTGGTTTTAGGGAATTGCACTCAAAAGCAAGGTGACTCGCAGT GACCCCGTAGAAAAGATCAA CTGGGGCATCTTTTCTAGTT 6301 AGGATCTTCTTGAGATCCTTTTTTTCTGCGCGTAATCTGCTGCTTGCAAACAAAAAAACCACCGCTACCAGCGGTGGTTT TCCTAGAAGAACTCTAGGAAAAAAAGACGCGCATTAGACGACGAACGTTTGTTTTTTTGGTGGCGATGGTCGCCACCAAA GTTTGCCGGATCAAGAGCTA CAAACGGCCTAGTTCTCGAT 6401 CCAACTCTTTTTCCGAAGGTAACTGGCTTCAGCAGAGCGCAGATACCAAATACTGTCCTTCTAGTGTAGCCGTAGTTAGG GGTTGAGAAAAAGGCTTCCATTGACCGAAGTCGTCTCGCGTCTATGGTTTATGACAGGAAGATCACATCGGCATCAATCC CCACCACTTCAAGAACTCTG GGTGGTGAAGTTCTTGAGAC 6501 TAGCACCGCCTACATACCTCGCTCTGCTAATCCTGTTACCAGTGGCTGCTGCCAGTGGCGATAAGTCGTGTCTTACCGGG ATCGTGGCGGATGTATGGAGCGAGACGATTAGGACAATGGTCACCGACGACGGTCACCGCTATTCAGCACAGAATGGCCC TTGGACTCAAGACGATAGTT AACCTGAGTTCTGCTATCAA 6601 ACCGGATAAGGCGCAGCGGTCGGGCTGAACGGGGGGTTCGTGCACACAGCCCAGCTTGGAGCGAACGACCTACACCGAAC TGGCCTATTCCGCGTCGCCAGCCCGACTTGCCCCCCAAGCACGTGTGTCGGGTCGAACCTCGCTTGCTGGATGTGGCTTG TGAGATACCTACAGCGTGAG ACTCTATGGATGTCGCACTC 6701 CTATGAGAAAGCGCCACGCTTCCCGAAGGGAGAAAGGCGGACAGGTATCCGGTAAGCGGCAGGGTCGGAACAGGAGAGCG GATACTCTTTCGCGGTGCGAAGGGCTTCCCTCTTTCCGCCTGTCCATAGGCCATTCGCCGTCCCAGCCTTGTCCTCTCGC CACGAGGGAGCTTCCAGGGG GTGCTCCCTCGAAGGTCCCC 6801 GAAACGCCTGGTATCTTTATAGTCCTGTCGGGTTTCGCCACCTCTGACTTGAGCGTCGATTTTTGTGATGCTCGTCAGGG CTTTGCGGACCATAGAAATATCAGGACAGCCCAAAGCGGTGGAGACTGAACTCGCAGCTAAAAACACTACGAGCAGTCCC GGGCGGAGCCTATGGAAAAA CCCGCCTCGGATACCTTTTT 6901 CGCCAGCAACGCGGCCTTTTTACGGTTCCTGGCCTTTTGCTGGCCTTTTGCTCACATGTTCTTTCCTGCGTTATCCCCTG GCGGTCGTTGCGCCGGAAAAATGCCAAGGACCGGAAAACGACCGGAAAACGAGTGTACAAGAAAGGACGCAATAGGGGAC ATTCTGTGGATAACCGTATT TAAGACACCTATTGGCATAA 7001 ACCGCCTTTGAGTGAGCTGATACCGCTCGCCGCAGCCGAACGACCGAGCGCAGCGAGTCAGTGAGCGAGGAAGCGGAAG TGGCGGAAACTCACTCGACTATGGCGAGCGGCGTCGGCTTGCTGGCTCGCGTCGCTCAGTCACTCGCTCCTTCGCCTTC

    TABLE-US-00007 APPENDIX5 SequenceoftheplasmidencodingbioSNAP25-N-MLuchybrid. pS14LbioSNAP25-N-MLuc-CITE-Hyg1 1 AGCGCCCAATACGCAAACCGCCTCTCCCCGCGCGTTGGCCGATTCATTAATGCAGCTGGCACGACAGGTTTCCCGACTGG TCGCGGGTTATGCGTTTGGCGGAGAGGGGCGCGCAACCGGCTAAGTAATTACGTCGACCGTGCTGTCCAAAGGGCTGACC AAAGCGGGCAGTGAGCGCAA TTTCGCCCGTCACTCGCGTT 101 CGCAATTAATGTGAGTTAGCTCACTCATTAGGCACCCCAGGCTTTACACTTTATGCTTCCGGCTCGTATGTTGTGTGGAA GCGTTAATTACACTCAATCGAGTGAGTAATCCGTGGGGTCCGAAATGTGAAATACGAAGGCCGAGCATACAACACACCTT TTGTGAGCGGATAACAATTT AACACTCGCCTATTGTTAAA 201 CACACAGGAAACAGCTATGACCATGATTACGCCAAGCTTTAGGGATAACAGGGTAATCGCCATGCATTAGTTATTAATAG GTGTGTCCTTTGTCGATACTGGTACTAATGCGGTTCGAAATCCCTATTGTCCCATTAGCGGTACGTAATCAATAATTATC TAATCAATTACGGGGTCATT ATTAGTTAATGCCCCAGTAA 301 AGTTCATAGCCCATATATGGAGTTCCGCGTTACATAACTTACGGTAAATGGCCCGCCTGGCTGACCGCCCAACGACCCCC TCAAGTATCGGGTATATACCTCAAGGCGCAATGTATTGAATGCCATTTACCGGGCGGACCGACTGGCGGGTTGCTGGGGG GCCCATTGACGTCAATAATG CGGGTAACTGCAGTTATTAC 401 ACGTATGTTCCCATAGTAACGCCAATAGGGACTTTCCATTGACGTCAATGGGTGGAGTATTTACGGTAAACTGCCCACTT TGCATACAAGGGTATCATTGCGGTTATCCCTGAAAGGTAACTGCAGTTACCCACCTCATAAATGCCATTTGACGGGTGAA GGCAGTACATCAAGTGTATC CCGTCATGTAGTTCACATAG 501 ATATGCCAAGTACGCCCCCTATTGACGTCAATGACGGTAAATGGCCCGCCTGGCATTATGCCCAGTACATGACCTTATGG TATACGGTTCATGCGGGGGATAACTGCAGTTACTGCCATTTACCGGGCGGACCGTAATACGGGTCATGTACTGGAATACC GACTTTCCTACTTGGCAGTA CTGAAAGGATGAACCGTCAT 601 CATCTACGTATTAGTCATCGCTATTACCATGGTGATGCGGTTTTGGCAGTACATCAATGGGCGTGGATAGCGGTTTGACT GTAGATGCATAATCAGTAGCGATAATGGTACCACTACGCCAAAACCGTCATGTAGTTACCCGCACCTATCGCCAAACTGA CACGGGGATTTCCAAGTCTC GTGCCCCTAAAGGTTCAGAG 701 CACCCCATTGACGTCAATGGGAGTTTGTTTTGGCACCAAAATCAACGGGACTTTCCAAAATGTCGTAACAACTCCGCCCC GTGGGGTAACTGCAGTTACCCTCAAACAAAACCGTGGTTTTAGTTGCCCTGAAAGGTTTTACAGCATTGTTGAGGCGGGG ATTGACGCAAATGGGCGGTA TAACTGCGTTTACCCGCCAT 801 GGCGTGTACGGTGGGAGGTCTATATAAGCAGAGCTGGTTTAGTGAACCGTCAGATCCGCTAGACGTCTCATTTAGGCATG CCGCACATGCCACCCTCCAGATATATTCGTCTCGACCAAATCACTTGGCAGTCTAGGCGATCTGCAGAGTAAATCCGTAC GAAACCCCAGCGCAGCTTCT CTTTGGGGTCGCGTCGAAGA 901 CTTCCTCCTGCTACTCTGGATCCCAGACACCATTGAAGAAATAGTGATGACGCAGTCTCCAGCCACCCTGTCTGTGTCTC GAAGGAGGACGATGAGACCTAGGGTCTGTGGTAACTTCTTTATCACTACTGCGTCAGAGGTCGGTGGGACAGACACAGAG CAGGGGAAAGAGTCACCCTC GTCCCCTTTCTCAGTGGGAG 1001 TCCTCAGGCGGCGCAAGCAGCCTGAGACAGATTCTGGACTCCCAGAAAATGGAGTGGAGGTCCAACGCCGGGGGCAGCGG AGGAGTCCGCCGCGTTCGTCGGACTCTGTCTAAGACCTGAGGGTCTTTTACCTCACCTCCAGGTTGCGGCCCCCGTCGCC TAGGGATAACAGGGTAATCG ATCCCTATTGTCCCATTAGC 1101 CCGAGGACGCAGACATGCGTAATGAACTGGAGGAGATGCAGAGGAGGGCTGACCAGCTGGCTGATGAGTCCCTGGAAAGC GGCTCCTGCGTCTGTACGCATTACTTGACCTCCTCTACGTCTCCTCCCGACTGGTCGACCGACTACTCAGGGACCTTTCG ACCCGTCGCATGCTGCAGCT TGGGCAGCGTACGACGTCGA 1201 GGTCGAAGAGAGTAAAGATGCTGGCATCAGGACTTTGGTTATGTTGGATGAGCAAGGCGAACAACTGGAACGCATTGAGG CCAGCTTCTCTCATTTCTACGACCGTAGTCCTGAAACCAATACAACCTACTCGTTCCGCTTGTTGACCTTGCGTAACTCC AAGGGATGGACCAAATCAAT TTCCCTACCTGGTTTAGTTA 1301 AAGGATATGAAAGAAGCAGAAAAGAATTTGACGGACCTAGGAAAATTCTGCGGGCTTTGTGTGTGTCCCTGTAACAAGCT TTCCTATACTTTCTTCGTCTTTTCTTAAACTGCCTGGATCCTTTTAAGACGCCCGAAACACACACAGGGACATTGTTCGA TAAATCCAGTGATGCTTACA ATTTAGGTCACTACGAATGT 1401 AAAAAGCCTGGGGCAATAATCAGGATGGAGTAGTGGCCAGCCAGCCTGCCCGTGTGGTGGATGAACGGGAGCAGATGGCC TTTTTCGGACCCCGTTATTAGTCCTACCTCATCACCGGTCGGTCGGACGGGCACACCACCTACTTGCCCTCGTCTACCGG ATCAGTGGTGGCTTCATCCG TAGTCACCACCGAAGTAGGC 1501 CAGGGTAACAAACGATGCCCGGGAAAATGAAATGGATGAAAACCTAGAGCAGGTGAGCGGCATCATCGGAAACCTCCGTC GTCCCATTGTTTGCTACGGGCCCTTTTACTTTACCTACTTTTGGATCTCGTCCACTCGCCGTAGTAGCCTTTGGAGGCAG ATATGGCCCTAGACATGGGC TATACCGGGATCTGTACCCG 1601 AATGAGATTGACACCCAGAATCGCCAGATTGACAGGATCATGGAGAAGGCTGACTCCAACAAAACCAGAATTGATGAAGC TTACTCTAACTGTGGGTCTTAGCGGTCTAACTGTCCTAGTACCTCTTCCGACTGAGGTTGTTTTGGTCTTAACTACTTCG CAACCAACGTGCAACAAAGA GTTGGTTGCACGTTGTTTCT 1701 TGCTGGGAAGTGGGGAGATCTCCGCGGCCCGGGATCCACCGGCTAGCGGGAATTCCAAATCAACTGAGTTCGATCCTAAC ACGACCCTTCACCCCTCTAGAGGCGCCGGGCCCTAGGTGGCCGATCGCCCTTAAGGTTTAGTTGACTCAAGCTAGGATTG ATTGACATTGTTGGTTTAGA TAACTGTAACAACCAAATCT 1801 AGGAAAATTTGGTATTACAAACCTAGAGACGGATTTATTCACAATCTGGGAGACAATGGAGGTCATGATCAAAGCAGATA TCCTTTTAAACCATAATGTTTGGATCTCTGCCTAAATAAGTGTTAGACCCTCTGTTACCTCCAGTACTAGTTTCGTCTAT TTGCAGATACTGATAGAGCC AACGTCTATGACTATCTCGG 1901 AGCAACTTTGTTGCAACTGAAACCGATGCTAACCGCGGAAAAATGCCTGGCAAAAAACTGCCACTGGCAGTTATCATGGA TCGTTGAAACAACGTTGACTTTGGCTACGATTGGCGCCTTTTTACGGACCGTTTTTTGACGGTGACCGTCAATAGTACCT AATGGAAGCCAATGCTTTCA TTACCTTCGGTTACGAAAGT 2001 AAGCTGGCTGCACCAGGGGATGCCTTATCTGTCTTTCAAAAATTAAGTGTACAGCCAAAATGAAGGTATACATTCCAGGA TTCGACCGACGTGGTCCCCTACGGAATAGACAGAAAGTTTTTAATTCACATGTCGGTTTTACTTCCATATGTAAGGTCCT AGGTGTCACGATTATGGTGG TCCACAGTGCTAATACCACC 2101 TGACAAGAAAACTGGACAGGCAGGAATTGTTGGTGCAATTGTTGACATTCCCGAAATCTCTGGATTTAAGGAGATGGCAC ACTGTTCTTTTGACCTGTCCGTCCTTAACAACCACGTTAACAACTGTAAGGGCTTTAGAGACCTAAATTCCTCTACCGTG CCATGGAACAGTTCATTGCT GGTACCTTGTCAAGTAACGA 2201 CAAGTTGATCGCTGCGCTTCCTGCACTACTGGATGTCTCAAAGGTCTTGCCAATGTTAAGTGCTCTGAACTCCTGAAGAA GTTCAACTAGCGACGCGAAGGACGTGATGACCTACAGAGTTTCCAGAACGGTTACAATTCACGAGACTTGAGGACTTCTT ATGGCTGCCTGACAGGTGTG TACCGACGGACTGTCCACAC 2301 CAAGTTTTGCTGACAAGATTCAAAAAGAAGTTCACAATATCAAAGGCATGGCCGGCGATCGATGAGCGGCCGCAATTTAA GTTCAAAACGACTGTTCTAAGTTTTTCTTCAAGTGTTATAGTTTCCGTACCGGCCGCTAGCTACTCGCCGGCGTTAAATT TTCCGGTTATTTTCCACCAT AAGGCCAATAAAAGGTGGTA 2401 ATTGCCGTCTTTTGGCAATGTGAGGGCCCGGAAACCTGGCCCTGTCTTCTTGACGAGCATTCCTAGGGGTCTTTCCCCTC TAACGGCAGAAAACCGTTACACTCCCGGGCCTTTGGACCGGGACAGAAGAACTGCTCGTAAGGATCCCCAGAAAGGGGAG TCGCCAAAGGAATGCAAGGT AGCGGTTTCCTTACGTTCCA 2501 CTGTTGAATGTCGTGAAGGAAGCAGTTCCTCTGGAAGCTTCTTGAAGACAAACAACGTCTGTAGCGACCCTTTGCAGGCA GACAACTTACAGCACTTCCTTCGTCAAGGAGACCTTCGAAGAACTTCTGTTTGTTGCAGACATCGCTGGGAAACGTCCGT GCGGAACCCCCCACCTGGCG CGCCTTGGGGGGTGGACCGC 2601 ACAGGTGCCTCTGCGGCCAAAAGCCACGTGTATAAGATACACCTGCAAAGGCGGCACAACCCCAGTGCCACGTTGTGAGT TGTCCACGGAGACGCCGGTTTTCGGTGCACATATTCTATGTGGACGTTTCCGCCGTGTTGGGGTCACGGTGCAACACTCA TGGATAGTTGTGGAAAGAGT ACCTATCAACACCTTTCTCA 2701 CAAATGGCTCACCTCAAGCGTATTCAACAAGGGGCTGAAGGATGCCCAGAAGGTACCCCATTGTATGGGATCTGATCTGG GTTTACCGAGTGGAGTTCGCATAAGTTGTTCCCCGACTTCCTACGGGTCTTCCATGGGGTAACATACCCTAGACTAGACC GGCCTCGGTGCACATGCTTT CCGGAGCCACGTGTACGAAA 2801 ACATGTGTTTAGTCGAGGTTAAAAAACGTCTAGGCCCCCCGAACCACGGGGACGTGGTTTTCCTTTGAAAAACACGATGA TGTACACAAATCAGCTCCAATTTTTTGCAGATCCGGGGGGCTTGGTGCCCCTGCACCAAAAGGAAACTTTTTGTGCTACT TAATATGGCCACCACCCATA ATTATACCGGTGGTGGGTAT 2901 CCTAGGCTTTTGCAAAGATCGATCAGATCCCGGGGGGCAATGAGATATGAAAAAGCCTGAACTCACCGCGACGTCTGTCG GGATCCGAAAACGTTTCTAGCTAGTCTAGGGCCCCCCGTTACTCTATACTTTTTCGGACTTGAGTGGCGCTGCAGACAGC AGAAGTTTCTGATCGAAAAG TCTTCAAAGACTAGCTTTTC 3001 TTCGACAGCGTCTCCGACCTGATGCAGCTCTCGGAGGGCGAAGAATCTCGTGCTTTCAGCTTCGATGTAGGAGGGCGTGG AAGCTGTCGCAGAGGCTGGACTACGTCGAGAGCCTCCCGCTTCTTAGAGCACGAAAGTCGAAGCTACATCCTCCCGCACC ATATGTCCTGCGGGTAAATA TATACAGGACGCCCATTTAT 3101 GCTGCGCCGATGGTTTCTACAAAGATCGTTATGTTTATCGGCACTTTGCATCGGCCGCGCTCCCGATTCCGGAAGTGCTT CGACGCGGCTACCAAAGATGTTTCTAGCAATACAAATAGCCGTGAAACGTAGCCGGCGCGAGGGCTAAGGCCTTCACGAA GACATTGGGGAATTCAGCGA CTGTAACCCCTTAAGTCGCT 3201 GAGCCTGACCTATTGCATCTCCCGCCGTGCACAGGGTGTCACGTTGCAAGACCTGCCTGAAACCGAACTGCCCGCTGTTC CTCGGACTGGATAACGTAGAGGGCGGCACGTGTCCCACAGTGCAACGTTCTGGACGGACTTTGGCTTGACGGGCGACAAG TGCAGCCGGTCGCGGAGGCC ACGTCGGCCAGCGCCTCCGG 3301 ATGGATGCGATCGCTGCGGCCGATCTTAGCCAGACGAGCGGGTTCGGCCCATTCGGACCGCAAGGAATCGGTCAATACAC TACCTACGCTAGCGACGCCGGCTAGAATCGGTCTGCTCGCCCAAGCCGGGTAAGCCTGGCGTTCCTTAGCCAGTTATGTG TACATGGCGTGATTTCATAT ATGTACCGCACTAAAGTATA 3401 GCGCGATTGCTGATCCCCATGTGTATCACTGGCAAACTGTGATGGACGACACCGTCAGTGCGTCCGTCGCGCAGGCTCTC CGCGCTAACGACTAGGGGTACACATAGTGACCGTTTGACACTACCTGCTGTGGCAGTCACGCAGGCAGCGCGTCCGAGAG GATGAGCTGATGCTTTGGGC CTACTCGACTACGAAACCCG 3501 CGAGGACTGCCCCGAAGTCCGGCACCTCGTGCACGCGGATTTCGGCTCCAACAATGTCCTGACGGACAATGGCCGCATAA GCTCCTGACGGGGCTTCAGGCCGTGGAGCACGTGCGCCTAAAGCCGAGGTTGTTACAGGACTGCCTGTTACCGGCGTATT CAGCGGTCATTGACTGGAGC GTCGCCAGTAACTGACCTCG 3601 GAGGCGATGTTCGGGGATTCCCAATACGAGGTCGCCAACATCTTCTTCTGGAGGCCGTGGTTGGCTTGTATGGAGCAGCA CTCCGCTACAAGCCCCTAAGGGTTATGCTCCAGCGGTTGTAGAAGAAGACCTCCGGCACCAACCGAACATACCTCGTCGT GACGCGCTACTTCGAGCGGA CTGCGCGATGAAGCTCGCCT 3701 GGCATCCGGAGCTTGCAGGATCGCCGCGGCTCCGGGCGTATATGCTCCGCATTGGTCTTGACCAACTCTATCAGAGCTTG CCGTAGGCCTCGAACGTCCTAGCGGCGCCGAGGCCCGCATATACGAGGCGTAACCAGAACTGGTTGAGATAGTCTCGAAC GTTGACGGCAATTTCGATGA CAACTGCCGTTAAAGCTACT 3801 TGCAGCTTGGGCGCAGGGTCGATGCGACGCAATCGTCCGATCCGGAGCCGGGACTGTCGGGCGTACACAAATCGCCCGCA ACGTCGAACCCGCGTCCCAGCTACGCTGCGTTAGCAGGCTAGGCCTCGGCCCTGACAGCCCGCATGTGTTTAGCGGGCGT GAAGCGCGGCCGTCTGGACC CTTCGCGCCGGCAGACCTGG 3901 GATGGCTGTGTAGAAGTACTCGCCGATAGTGGAAACCGACGCCCCAGCACTCGTCCGGATCGGGAGATGGGGGAGGCTAA CTACCGACACATCTTCATGAGCGGCTATCACCTTTGGCTGCGGGGTCGTGAGCAGGCCTAGCCCTCTACCCCCTCCGATT CTGAAACACGGAAGGAGACA GACTTTGTGCCTTCCTCTGT 4001 ATACCGGAAGGAACCTCGACGTTAACTTGTTTATTGCAGCTTATAATGGTTACAAATAAAGCAATAGCATCACAAATTTC TATGGCCTTCCTTGGAGCTGCAATTGAACAAATAACGTCGAATATTACCAATGTTTATTTCGTTATCGTAGTGTTTAAAG ACAAATAAAGCATTTATTAC TGTTTATTTCGTAAATAATG 4101 CCTGTTATCCCTAGAATTCACTGGCCGTCGTTTTACAACGTCGTGACTGGGAAAACCCTGGCGTTACCCAACTTAATCGC GGACAATAGGGATCTTAAGTGACCGGCAGCAAAATGTTGCAGCACTGACCCTTTTGGGACCGCAATGGGTTGAATTAGCG CTTGCAGCACATCCCCCTTT GAACGTCGTGTAGGGGGAAA 4201 CGCCAGCTGGCGTAATAGCGAAGAGGCCCGCACCGATCGCCCTTCCCAACAGTTGCGCAGCCTGAATGGCGAATGGCGCC GCGGTCGACCGCATTATCGCTTCTCCGGGCGTGGCTAGCGGGAAGGGTTGTCAACGCGTCGGACTTACCGCTTACCGCGG TGATGCGGTATTTTCTCCTT ACTACGCCATAAAAGAGGAA 4301 ACGCATCTGTGCGGTATTTCACACCGCATACGTCAAAGCAACCATAGTACGCGCCCTGTAGCGGCGCATTAAGCGCGGCG TGCGTAGACACGCCATAAAGTGTGGCGTATGCAGTTTCGTTGGTATCATGCGCGGGACATCGCCGCGTAATTCGCGCCGC GGTGTGGTGGTTACGCGCAG CCACACCACCAATGCGCGTC 4401 CGTGACCGCTACACTTGCCAGCGCCCTAGCGCCCGCTCCTTTCGCTTTCTTCCCTTCCTTTCTCGCCACGTTCGCCGGCT GCACTGGCGATGTGAACGGTCGCGGGATCGCGGGCGAGGAAAGCGAAAGAAGGGAAGGAAAGAGCGGTGCAAGCGGCCGA TTCCCCGTCAAGCTCTAAAT AAGGGGCAGTTCGAGATTTA 4501 CGGGGGCTCCCTTTAGGGTTCCGATTTAGTGCTTTACGGCACCTCGACCCCAAAAAACTTGATTTGGGTGATGGTTCACG GCCCCCGAGGGAAATCCCAAGGCTAAATCACGAAATGCCGTGGAGCTGGGGTTTTTTGAACTAAACCCACTACCAAGTGC TAGTGGGCCATCGCCCTGAT ATCACCCGGTAGCGGGACTA 4601 AGACGGTTTTTCGCCCTTTGACGTTGGAGTCCACGTTCTTTAATAGTGGACTCTTGTTCCAAACTGGAACAACACTCAAC TCTGCCAAAAAGCGGGAAACTGCAACCTCAGGTGCAAGAAATTATCACCTGAGAACAAGGTTTGACCTTGTTGTGAGTTG CCTATCTCGGGCTATTCTTT GGATAGAGCCCGATAAGAAA 4701 TGATTTATAAGGGATTTTGCCGATTTCGGCCTATTGGTTAAAAAATGAGCTGATTTAACAAAAATTTAACGCGAATTTTA ACTAAATATTCCCTAAAACGGCTAAAGCCGGATAACCAATTTTTTACTCGACTAAATTGTTTTTAAATTGCGCTTAAAAT ACAAAATATTAACGTTTACA TGTTTTATAATTGCAAATGT 4801 ATTTTATGGTGCACTCTCAGTACAATCTGCTCTGATGCCGCATAGTTAAGCCAGCCCCGACACCCGCCAACACCCGCTGA TAAAATACCACGTGAGAGTCATGTTAGACGAGACTACGGCGTATCAATTCGGTCGGGGCTGTGGGCGGTTGTGGGCGACT CGCGCCCTGACGGGCTTGTC GCGCGGGACTGCCCGAACAG 4901 TGCTCCCGGCATCCGCTTACAGACAAGCTGTGACCGTCTAGACGAAAGGGCCTCGTGATACGCCTATTTTTATAGGTTAA ACGAGGGCCGTAGGCGAATGTCTGTTCGACACTGGCAGATCTGCTTTCCCGGAGCACTATGCGGATAAAAATATCCAATT TGTCATGATAATAATGGTTT ACAGTACTATTATTACCAAA 5001 CTTAGACGTCAGGTGGCACTTTTCGGGGAAATGTGCGCGGAACCCCTATTTGTTTATTTTTCTAAATACATTCAAATATG GAATCTGCAGTCCACCGTGAAAAGCCCCTTTACACGCGCCTTGGGGATAAACAAATAAAAAGATTTATGTAAGTTTATAC TATCCGCTCATGAGACAATA ATAGGCGAGTACTCTGTTAT 5101 ACCCTGATAAATGCTTCAATAATATTGAAAAAGGAAGAGTATGAGTATTCAACATTTCCGTGTCGCCCTTATTCCCTTTT TGGGACTATTTACGAAGTTATTATAACTTTTTCCTTCTCATACTCATAAGTTGTAAAGGCACAGCGGGAATAAGGGAAAA TTGCGGCATTTTGCCTTCCT AACGCCGTAAAACGGAAGGA 5201 GTTTTTGCTCACCCAGAAACGCTGGTGAAAGTAAAAGATGCTGAAGATCAGTTGGGTGCACGAGTGGGTTACATCGAACT CAAAAACGAGTGGGTCTTTGCGACCACTTTCATTTTCTACGACTTCTAGTCAACCCACGTGCTCACCCAATGTAGCTTGA GGATCTCAACAGCGGTAAGA CCTAGAGTTGTCGCCATTCT 5301 TCCTTGAGAGTTTTCGCCCCGAAGAACGTTTTCCAATGATGAGCACTTTTAAAGTTCTGCTATGTGGCGCGGTATTATCC AGGAACTCTCAAAAGCGGGGCTTCTTGCAAAAGGTTACTACTCGTGAAAATTTCAAGACGATACACCGCGCCATAATAGG CGTATTGACGCCGGGCAAGA GCATAACTGCGGCCCGTTCT 5401 GCAACTCGGTCGCCGCATACACTATTCTCAGAATGACTTGGTTGAGTACTCACCAGTCACAGAAAAGCATCTTACGGATG CGTTGAGCCAGCGGCGTATGTGATAAGAGTCTTACTGAACCAACTCATGAGTGGTCAGTGTCTTTTCGTAGAATGCCTAC GCATGACAGTAAGAGAATTA CGTACTGTCATTCTCTTAAT 5501 TGCAGTGCTGCCATAACCATGAGTGATAACACTGCGGCCAACTTACTTCTGACAACGATCGGAGGACCGAAGGAGCTAAC ACGTCACGACGGTATTGGTACTCACTATTGTGACGCCGGTTGAATGAAGACTGTTGCTAGCCTCCTGGCTTCCTCGATTG CGCTTTTTTGCACAACATGG GCGAAAAAACGTGTTGTACC 5601 GGGATCATGTAACTCGCCTTGATCGTTGGGAACCGGAGCTGAATGAAGCCATACCAAACGACGAGCGTGACACCACGATG CCCTAGTACATTGAGCGGAACTAGCAACCCTTGGCCTCGACTTACTTCGGTATGGTTTGCTGCTCGCACTGTGGTGCTAC CCTGTAGCAATGGCAACAAC GGACATCGTTACCGTTGTTG 5701 GTTGCGCAAACTATTAACTGGCGAACTACTTACTCTAGCTTCCCGGCAACAATTAATAGACTGGATGGAGGCGGATAAAG CAACGCGTTTGATAATTGACCGCTTGATGAATGAGATCGAAGGGCCGTTGTTAATTATCTGACCTACCTCCGCCTATTTC TTGCAGGACCACTTCTGCGC AACGTCCTGGTGAAGACGCG 5801 TCGGCCCTTCCGGCTGGCTGGTTTATTGCTGATAAATCTGGAGCCGGTGAGCGTGGGTCTCGCGGTATCATTGCAGCACT AGCCGGGAAGGCCGACCGACCAAATAACGACTATTTAGACCTCGGCCACTCGCACCCAGAGCGCCATAGTAACGTCGTGA GGGGCCAGATGGTAAGCCCT CCCCGGTCTACCATTCGGGA 5901 CCCGTATCGTAGTTATCTACACGACGGGGAGTCAGGCAACTATGGATGAACGAAATAGACAGATCGCTGAGATAGGTGCC GGGCATAGCATCAATAGATGTGCTGCCCCTCAGTCCGTTGATACCTACTTGCTTTATCTGTCTAGCGACTCTATCCACGG TCACTGATTAAGCATTGGTA AGTGACTAATTCGTAACCAT 6001 ACTGTCAGACCAAGTTTACTCATATATACTTTAGATTGATTTAAAACTTCATTTTTAATTTAAAAGGATCTAGGTGAAGA TGACAGTCTGGTTCAAATGAGTATATATGAAATCTAACTAAATTTTGAAGTAAAAATTAAATTTTCCTAGATCCACTTCT TCCTTTTTGATAATCTCATG AGGAAAAACTATTAGAGTAC 6101 ACCAAAATCCCTTAACGTGAGTTTTCGTTCCACTGAGCGTCAGACCCCGTAGAAAAGATCAAAGGATCTTCTTGAGATCC TGGTTTTAGGGAATTGCACTCAAAAGCAAGGTGACTCGCAGTCTGGGGCATCTTTTCTAGTTTCCTAGAAGAACTCTAGG TTTTTTTCTGCGCGTAATCT AAAAAAAGACGCGCATTAGA 6201 GCTGCTTGCAAACAAAAAAACCACCGCTACCAGCGGTGGTTTGTTTGCCGGATCAAGAGCTACCAACTCTTTTTCCGAAG CGACGAACGTTTGTTTTTTTGGTGGCGATGGTCGCCACCAAACAAACGGCCTAGTTCTCGATGGTTGAGAAAAAGGCTTC GTAACTGGCTTCAGCAGAGC CATTGACCGAAGTCGTCTCG 6301 GCAGATACCAAATACTGTCCTTCTAGTGTAGCCGTAGTTAGGCCACCACTTCAAGAACTCTGTAGCACCGCCTACATACC CGTCTATGGTTTATGACAGGAAGATCACATCGGCATCAATCCGGTGGTGAAGTTCTTGAGACATCGTGGCGGATGTATGG TCGCTCTGCTAATCCTGTTA AGCGAGACGATTAGGACAAT 6401 CCAGTGGCTGCTGCCAGTGGCGATAAGTCGTGTCTTACCGGGTTGGACTCAAGACGATAGTTACCGGATAAGGCGCAGCG GGTCACCGACGACGGTCACCGCTATTCAGCACAGAATGGCCCAACCTGAGTTCTGCTATCAATGGCCTATTCCGCGTCGC GTCGGGCTGAACGGGGGGTT CAGCCCGACTTGCCCCCCAA 6501 CGTGCACACAGCCCAGCTTGGAGCGAACGACCTACACCGAACTGAGATACCTACAGCGTGAGCTATGAGAAAGCGCCACG GCACGTGTGTCGGGTCGAACCTCGCTTGCTGGATGTGGCTTGACTCTATGGATGTCGCACTCGATACTCTTTCGCGGTGC CTTCCCGAAGGGAGAAAGGC GAAGGGCTTCCCTCTTTCCG 6601 GGACAGGTATCCGGTAAGCGGCAGGGTCGGAACAGGAGAGCGCACGAGGGAGCTTCCAGGGGGAAACGCCTGGTATCTTT CCTGTCCATAGGCCATTCGCCGTCCCAGCCTTGTCCTCTCGCGTGCTCCCTCGAAGGTCCCCCTTTGCGGACCATAGAAA ATAGTCCTGTCGGGTTTCGC TATCAGGACAGCCCAAAGCG 6701 CACCTCTGACTTGAGCGTCGATTTTTGTGATGCTCGTCAGGGGGGCGGAGCCTATGGAAAAACGCCAGCAACGCGGCCTT GTGGAGACTGAACTCGCAGCTAAAAACACTACGAGCAGTCCCCCCGCCTCGGATACCTTTTTGCGGTCGTTGCGCCGGAA TTTACGGTTCCTGGCCTTTT AAATGCCAAGGACCGGAAAA 6801 GCTGGCCTTTTGCTCACATGTTCTTTCCTGCGTTATCCCCTGATTCTGTGGATAACCGTATTACCGCCTTTGAGTGAGCT CGACCGGAAAACGAGTGTACAAGAAAGGACGCAATAGGGGACTAAGACACCTATTGGCATAATGGCGGAAACTCACTCGA GATACCGCTCGCCGCAGCCG CTATGGCGAGCGGCGTCGGC 6901 AACGACCGAGCGCAGCGAGTCAGTGAGCGAGGAAGCGGAAG TTGCTGGCTCGCGTCGCTCAGTCACTCGCTCCTTCGCCTTC