METHODS FOR GENERATING BARCODED COMBINATORIAL LIBRARIES
20170369870 · 2017-12-28
Inventors
- Ryan T. Gill (Denver, CO)
- Andrew Garst (Boulder, CO, US)
- Tanya Elizabeth Warnecke Lipscomb (Boulder, CO, US)
- Marcelo Colika Bassalo (Boulder, CO, US)
- Ramsey Ibrahim Zeitoun (San Francisco, CA, US)
Cpc classification
C12N2310/20
CHEMISTRY; METALLURGY
C12N15/1065
CHEMISTRY; METALLURGY
C12N15/1082
CHEMISTRY; METALLURGY
C12N15/1079
CHEMISTRY; METALLURGY
International classification
Abstract
Provided herein are methods and composition for trackable genetic variant libraries. Further provided herein are methods and compositions for recursive engineering. Further provided herein are methods and compositions for multiplex engineering. Further provided herein are methods and compositions for enriching for editing and trackable engineered sequences and cells using nucleic acid-guided nucleases.
Claims
1. A composition comprising: i) a first donor nucleic acid comprising: a) a modified first target nucleic acid sequence; b) a first protospacer adjacent motif (PAM) mutation; and c) a first guide nucleic acid sequence comprising a first spacer region complementary to a portion of the first target nucleic acid; and ii) a second donor nucleic acid comprising: a) a barcode corresponding to the modified first target nucleic acid sequence; and b) a second guide nucleic acid sequence comprising a second spacer region complementary to a portion of a second target nucleic acid.
2. The composition of claim 1, wherein the modified first target nucleic acid sequence comprises at least one inserted, deleted, or substituted nucleic acid compared to a corresponding un-modified first target nucleic acid.
3. The composition of claim 1, wherein the first guide nucleic acid and second guide nucleic acid are compatible with a nucleic acid-guided nuclease.
4. The composition of claim 3, wherein the nucleic acid-guided nuclease is a Type II or Type V Cas protein.
5. The composition of claim 3, wherein the nucleic acid-guided nuclease is a Cas9 homologue or a Cpf1 homologue.
6. The composition of claim 1, wherein the second donor nucleic acid comprises a second PAM mutation.
7. The composition of claim 1, wherein the second donor nucleic acid sequence comprises a regulatory sequence or a mutation to turn a screenable or selectable marker on or off.
8. The composition of claim 1, wherein the second donor nucleic acid sequence targets a unique landing site.
9. A method of genome engineering, the method comprising: a) contacting a population of cells with a polynucleotide, wherein each cell comprises a first target nucleic acid, a second target nucleic acid, and a nucleic acid-guided nuclease, wherein the polynucleotide comprises 1) an editing cassette comprising: i) a modified first target nucleic acid sequence; ii) a first protospacer adjacent motif (PAM) mutation; iii) a first guide nucleic acid sequence comprising a spacer region complementary to a portion of the first target nucleic acid and compatible with the nucleic acid-guided nuclease; and 2) a recorder cassette comprising i) a barcode corresponding to the modified first target nucleic acid sequence; and ii) a second guide nucleic acid sequence comprising a second spacer region complementary to a portion of the second target nucleic acid and compatible with the nucleic acid-guided nuclease; b) allowing the first guide nucleic acid sequence, the second guide nucleic acid sequence, and the nucleic acid-guided nuclease to create a genome edit within the first target nucleic acid and the second target nucleic acid.
10. The method of claim 9, further comprising c) sequencing a portion of the barcode, thereby identifying the modified first target nucleic acid that was inserted within the first target nucleic acid in step a).
11. The method of claim 9, wherein the nucleic acid-guided nuclease is a CRISPR nuclease.
12. The method of claim 9, wherein the PAM mutation is not recognized by the nucleic acid-guided nuclease.
13. The method of claim 9, wherein the nucleic acid-guided nuclease is a Type II or Type V Cas protein.
14. The method of claim 9, wherein the nucleic acid-guided nuclease is a Cas9 homologue or a Cpf1 homologue.
15. The method of claim 9, wherein the recorder cassette further comprises a second PAM mutation that is not recognized by the nucleic acid-guided nuclease.
16. A method of selectable recursive genetic engineering comprising a) contacting cells comprising a nucleic acid-guided nuclease with a polynucleotide comprising a recorder cassette, said recorder cassette comprising i) a nucleic acid sequence that recombines into a unique landing site incorporated during a previous round of engineering, wherein the nucleic acid sequence comprises a unique barcode; and ii) a guide RNA compatible with the nucleic acid-guided nuclease that targets the unique landing site; and b) allowing the nucleic acid-guided nuclease to edit the unique landing site, thereby incorporating the unique barcode into the unique landing site.
17. The method of claim 16, wherein the nucleic acid sequence further comprises a regulatory sequence that turns transcription of a screenable or selectable marker on or off.
18. The method of claim 16, wherein the nucleic acid sequence further comprises a PAM mutation that is not compatible with the nucleic acid-guided nuclease.
19. The method of claim 16, wherein the nucleic acid sequence further comprises a second unique landing site for subsequent engineering rounds.
20. The method of claim 16, wherein the polynucleotide further comprises an editing cassette comprising a) a modified first target nucleic acid sequence; b) a first protospacer adjacent motif (PAM) mutation; and c) a first guide nucleic acid sequence comprising a first spacer region complementary to a portion of the first target nucleic acid, wherein the unique barcode corresponds to the modified first target nucleic acid such that the modified target nucleic acid can be identified by the unique barcode.
Description
BRIEF DESCRIPTION OF THE DRAWINGS
[0015] This patent or application file contains at least one drawing executed in color. Copies of this patent or patent application publication with color drawing(s) will be provided by the Office upon request and payment of the necessary fee.
[0016]
[0017]
[0018]
[0019]
[0020]
[0021]
[0022]
[0023]
[0024]
[0025]
[0026]
[0027]
[0028]
[0029]
[0030]
[0031]
[0032]
[0033]
[0034]
[0035]
[0036]
[0037]
[0038]
[0039]
[0040]
[0041]
[0042]
[0043]
[0044]
[0045]
[0046]
[0047]
[0048]
[0049]
[0050]
DETAILED DESCRIPTION OF THE DISCLOSURE
[0051] While preferred embodiments of the present invention have been shown and described herein, it will be obvious to those skilled in the art that such embodiments are provided by way of example only. Numerous variations, changes, and substitutions will now occur to those skilled in the art without departing from the invention. It should be understood that various alternatives to the embodiments of the invention described herein may be employed in practicing the invention.
[0052] Methods and compositions for enabling sophisticated combinatorial engineering strategies to optimize and explore complex phenotypes are provided herein. Many phenotypes of interest to basic research and biotechnology are the result of combinations of mutations that occur at distal loci. For example, cancer is often linked to mutations that influence multiple hallmark gene functions rather than a single chromosomal edit. Likewise, many metabolic and regulatory processes that are the target of continuing engineering efforts require the activities of many proteins acting in concert to produce the phenotypic output of interest. Methods and compositions disclosed herein can provide ways of rapid engineering and prototyping of such functions since they can provide rapid construction and accurate reporting on the mutational effects at many sites in parallel.
[0053] The methods and compositions described herein can be carried out or used in any type of cell in which a nucleic acid-guided nuclease system, such as CRISPR or Argonaute, or other targetable nuclease systems, such as TALEN, ZFN, or meganuclease can function (e.g., target and cleave DNA), including prokaryotic, eukaryotic, or archaeal cells. The cell can be a bacterial cell, such as Escherichia spp. (e.g., E. coli). The cell can be a fungal cell, such as a yeast cell, e.g., Saccharomyces spp. The cell can be a human cell. The cell can be an algal cell, a plant cell, an insect cell, or a mammalian cell, including a human cell. Additionally or alternatively, the methods described herein can be carried out in vitro or in cell-free systems in which a nucleic acid guided nuclease system, such as CRISPR or Argonaute, or other nuclease systems, such as TALEN, ZFN, or meganuclease can function (e.g., target and cleave DNA).
[0054] Disclosed herein are compositions and methods for genetic engineering. Disclosed are methods and compositions suitable for trackable or recursive genetic engineering. Disclosed method and compositions can use massively multiplexed oligonucleotide synthesis and cloning to enable high fidelity, trackable, multiplexed genome editing at single nucleotide resolution on a whole genome scale.
Trackable Plasmids
[0055] Methods and compositions can be used to perform high-fidelity trackable editing, for example, at single-nucleotide resolution and can be used to perform editing at a whole genome scale or on episomal nucleic acid molecules. Massively multiplexed oligonucleotide synthesis and/or cloning can be used in combination with a targetable nuclease system, such as a CRISPR system, MAD2 system, MAD7 system, or other nucleic acid-guided nuclease system, for editing.
[0056] As used herein, “cassette” often refers to a single molecule polynucleotide. A cassette can comprise DNA. A cassette can comprise RNA. A cassette can comprise a combination of DNA and RNA. A cassette can comprise non-naturally occurring nucleotides or modified nucleotides. A cassette can be single stranded. A cassette can be double stranded. A cassette can be synthesized as a single molecule. A cassette can be assembled from other cassettes, oligonucleotides, or other nucleic acid molecules. A cassette can comprise one or more elements. Such elements can include, as non-limiting examples, one or more of any of editing sequences, recorder sequences, guide nucleic acids, promoters, regulatory elements, mutant PAM sequences, homology arms, primer sites, linker regions, unique landing sites, a cassette, and any other element disclosed herein. Such elements can be in any order or combination. Any two or more elements can be contiguous or non-contiguous. A cassette can be comprised within a larger polynucleic acid. Such a larger polynucleic acid can be linear or circular, such as a plasmid or viral vector. A cassette can be a synthesized cassette. A cassette can be a trackable cassette.
[0057] A cassette can be designed to be used in any method or composition disclosed herein, including multiplex engineering methods and trackable engineering methods. An exemplary cassette can couple two or more elements, such as 1) a guide nucleic acid (e.g. gRNAs or gDNAs) designed for targeting a user specified target sequence in the genome and 2) an editing sequence and/or recorder sequence as disclosed herein (e.g.
[0058] A cassette can comprise one or more guide nucleic acids and editing cassette as a contiguous polynucleotide. In other examples, one or more guide nucleic acids and editing cassette are contiguous. In other examples, one or more guide nucleic acids and editing cassette are non-contiguous. In other examples, two or more guide nucleic acids and editing cassette are non-contiguous.
[0059] A cassette can comprise one or more guide nucleic acids, an editing cassette, and a recorder cassette as a contiguous polynucleotide. In other examples, one or more guide nucleic acids, editing cassette, and recorder cassette are contiguous. In other examples, two or more guide nucleic acids, editing cassette, and recorder cassette are contiguous. In other examples, one or more guide nucleic acids, editing cassette, and recorder cassette are non-contiguous. In other examples, two or more guide nucleic acids, editing cassette, and recorder cassette are non-contiguous.
[0060] A cassette can comprise one or more guide nucleic acids, one or more editing cassettes, and one or more recorder cassettes as a contiguous polynucleotide. In other examples, one or more guide nucleic acids, one or more editing cassettes, and one or more recorder cassettes are contiguous. In other examples, two or more guide nucleic acids, two or more editing cassettes, and two or more recorder cassettes are contiguous. In other examples, one or more guide nucleic acids, one or more editing cassettes, and one or more recorder cassettes are non-contiguous. In other examples, two or more guide nucleic acids, two or more editing cassettes, and two or more recorder cassettes are non-contiguous.
[0061] A cassette can comprise one or more guide nucleic acids and editing sequence as a contiguous polynucleotide. In other examples, one or more guide nucleic acids and editing sequence are contiguous. In other examples, one or more guide nucleic acids and editing sequence are non-contiguous. In other examples, two or more guide nucleic acids and editing sequence are non-contiguous.
[0062] A cassette can comprise one or more guide nucleic acids, an editing sequence, and a recorder sequence as a contiguous polynucleotide. In other examples, one or more guide nucleic acids, editing sequence, and recorder sequence are contiguous. In other examples, two or more guide nucleic acids, editing sequence, and recorder sequence are contiguous. In other examples, one or more guide nucleic acids, editing sequence, and recorder sequence are non-contiguous. In other examples, two or more guide nucleic acids, editing sequence, and recorder sequence are non-contiguous.
[0063] A cassette can comprise one or more guide nucleic acids, one or more editing sequences, and one or more recorder sequences as a contiguous polynucleotide. In other examples, one or more guide nucleic acids, one or more editing sequences, and one or more recorder sequences are contiguous. In other examples, two or more guide nucleic acids, two or more editing sequences, and two or more recorder sequences are contiguous. In other examples, one or more guide nucleic acids, one or more editing sequences, and one or more recorder sequences are non-contiguous. In other examples, two or more guide nucleic acids, two or more editing sequences, and two or more recorder sequences are non-contiguous.
[0064] An editing cassette can comprise an editing sequence. An editing sequence can comprise a mutation, such as a synonymous or non-synonymous mutation, and homology arms (HAs). An editing sequence can comprise a mutation, such as a synonymous or non-synonymous mutation, and homology arms (HAs) designed to undergo homologous recombination with the target sequence at the site of nucleic acid-guided nuclease-mediated double strand break (e.g.
[0065] A recorder cassette can comprise a recorder sequence. A recorder sequence can comprise a trackable sequence, such as a barcode or marker, and homology arms (HAs). A recorder sequence can comprise a trackable sequence, such as a barcode or marker, and homology arms (HAs) designed to undergo homologous recombination with the chromosome at the site of nucleic acid-guided nuclease-mediated double strand break (e.g.
[0066] A cassette can encode machinery (e.g. targetable nuclease, guide nucleic acid, editing cassette, and/or recorder cassette as disclosed herein) necessary to induce strand breakage as well as designed repair that can be selectively enriched and/or tracked in cells. A cell can be any cell such as eukaryotic cell, archaeal cell, prokaryotic cell, or microorganisms such as E. coli (e.g.
[0067] A cassette can comprise an editing cassette. A cassette can comprise a recorder cassette. A cassette can comprise a guide nucleic acid and an editing cassette. A cassette can comprise a guide nucleic acid and a recorder cassette. A cassette can comprise a guide nucleic acid, an editing cassette, and a recorder cassette. A cassette can comprise two guide nucleic acids, an editing cassette, and a recorder cassette. A cassette can comprise more than two guide nucleic acids, one or more editing cassettes, and one or more recorder cassettes. These elements of a cassette can be linked covalently. These elements of a cassette can be contiguous. These elements of a cassette can be contiguous.
[0068] A cassette can comprise an editing sequence. A cassette can comprise a recorder sequence. A cassette can comprise a guide nucleic acid and an editing sequence. A cassette can comprise a guide nucleic acid and a recorder sequence. A cassette can comprise a guide nucleic acid, an editing sequence, and a recorder sequence. A cassette can comprise two guide nucleic acids, an editing sequence, and a recorder sequence. A cassette can comprise more than two guide nucleic acids, one or more editing sequences, and one or more recorder sequences. These elements of a cassette can be linked covalently. These elements of a cassette can be contiguous. These elements of a cassette can be contiguous.
[0069] Single genome edits can be tracked using sequencing technologies, e.g. short read sequencing technologies (e.g.
[0070] In some embodiments, upon transformation, each editing cassette generates the designed genetic modification within the transformed cell. In some examples, the editing cassette can act in trans as a barcode of the genetic mutation introduced by the editing cassette and can enable the tracking of this mutation frequency in a complex population over time and across many different growth conditions (e.g.
[0071] In some examples, a recording cassette inserts the designed trackable sequence, such as a marker or barcode sequence, within the transformed cell. In some examples, the recorder cassette can act in cis as a barcode of the chromosomal mutation and can enable the tracking of this mutation frequency in a complex population over time and across many different growth conditions.
[0072] By providing cis and/or trans tracking of designed genomic mutations, the methods provided herein simplify sample preparation and depth of coverage for mapping diversity genome wide, and provide powerful tools for engineering on a genome scale (e.g.
[0073] A plurality of cassettes can be pooled into a library of cassettes. A library of cassettes can comprise at least 2 cassettes. A library of cassettes can comprise from 5 to a million cassettes. A library of cassettes can comprise at least a million cassettes. It should be understood, that a library of cassettes can comprise any number of cassettes.
[0074] A library of cassettes can comprise cassettes that have any combination of common elements and non-common or unique elements as compared to the other cassettes within the pool. For example, a library of cassettes can comprise common priming sites or common homology arms while also containing non-common or unique barcodes. Common elements can be shared by a plurality, majority, or all of the cassettes within a library of cassettes. Non-common elements can be shared by a plurality, minority, or sub-population of cassettes within the library of cassettes. Unique elements can be shared by a one, a few, or a sub-population of cassettes within the library of cassettes, such that it is able to identify or distinguish the one, few, or sub-population of cassettes from the other cassettes within the library of cassettes. Such combinations of common and non-common are advantageous for multiplexing techniques as disclosed herein.
[0075] Cassettes disclosed herein can generate the designed genetic modification or insert the designed marker or barcode sequence with high efficiency within a transformed cell. In many examples, the efficiency is greater than 50%. In some examples the efficiency is 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, 95%, 99%, or 100% (e.g.,
[0076] In some examples, transformation, editing, and/or recording efficiency can be increased by modulating the expression of one or more components disclosed herein, such as a nucleic acid-guided nuclease. Methods for modulating components are disclosed herein and are known in the art. Such methods can include expressing a component, such as a nucleic acid-guided nuclease or CRISPR enzyme of a subject system on a low or high copy plasmid, depending on the experimental design.
[0077] Disclosed herein are methods and compositions for generating cassettes. A cassettes can comprise a cassettes as disclosed herein. For example, a cassette can comprise any combination of an editing cassette and/or recorder cassette disclosed herein. Such a cassette can be comprised on a larger polynucleic acid molecule. Such a larger polynucleic acid molecule can be linear or circular, such as a plasmid or viral vector.
[0078] An editing cassette can comprise a mutation relative to a target nucleic acid sequence. The editing cassette can comprise sequence homologous to the target sequence flanking the desired mutation or editing sequence. The editing cassette can comprise a region which recognizes, or hybridizes to, a target sequence of a nucleic acid in a cell or population of cells, is homologous to the target sequence of the nucleic acid of the cell and includes a mutation, or a desired mutation, of at least one nucleotide relative to the target sequence.
[0079] An editing cassette can comprise a first editing sequence comprising a first mutation relative to a target sequence. A first mutation can comprise a mutation such as an insertion, deletion, or substitution of at least one nucleotide compared to the non-editing target sequence. The mutation can be incorporated into a coding region or non-coding region.
[0080] An editing cassette can comprise a second editing sequence comprising a second mutation relative to a target sequence. The second mutation can be designed to mutate or otherwise silence a PAM sequence such that a corresponding nucleic acid guided nuclease or CRISPR nuclease is no longer able to cleave the target sequence. In such cases, this mutation or silencing of a PAM can serve as a method for selecting transformants in which the first editing sequence has been incorporated.
[0081] In some examples, an editing cassette comprises at least two mutations, wherein one mutation is a PAM mutation. In some examples, the PAM mutation can be in a second editing cassette. Such a second editing cassette can be covalently linked and can be continuous or non-contiguous to the other elements in the cassette.
[0082] An editing cassette can comprise a guide nucleic acid, such as a gRNA encoding gene, optionally operably linked to a promoter. The guide nucleic acid can be designed to hybridize with the targeted nucleic acid sequence in which the editing sequence will be incorporated.
[0083] A recording cassette can comprise a recording sequence. A recorder sequence can comprise a barcoding sequence, or other screenable or selectable marker or fragment thereof. The recording sequence can be comprised within a recorder cassette. Recorder cassettes can comprise regions homologous to an insertion site within a target nucleic acid sequence such that the recording sequence is incorporated by homologous recombination or homology-driven repair systems. The site of incorporation of the recording cassette can be comprised on the same DNA molecule as the target nucleic acid to be edited by an editing cassette. The recorder sequence can comprise a barcode, unique DNA sequence, and/or a complete copy or fragment of a selectable or screenable element or marker.
[0084] A recorder cassette can comprise a mutation relative to the target sequence. The mutation can be designed to mutate or otherwise silence a PAM sequence such that a corresponding nucleic acid guided nuclease or CRISPR nuclease is no longer able to cleave the target sequence. In such cases, this mutation or silencing of a PAM site can serve as a method for selecting transformants in which the first recording sequence has been incorporated. A recorder cassette can comprise a PAM mutation. The PAM mutation can be designed to mutate or otherwise silence a PAM site such that a corresponding CRISPR nuclease is no longer able to cleave the target sequence. In such cases, this mutation or silencing of a PAM site can serve as a method for selecting transformants in which the recorder sequence has been incorporated.
[0085] A recorder cassette can comprise a guide nucleic acid, such as a gene encoding a gRNA. A promoter can be operably linked to a nucleic acid sequence encoding a guide nucleic acid capable of targeting a nucleic acid-guided nuclease to the desired target sequence. A guide nucleic acid can target a unique site within the target site. In some cases, the guide nucleic acid targets a unique landing site that was incorporated in a prior round of engineering. In some cases, the guide nucleic acid targets a unique landing site that was incorporated by a recorder cassette in a prior round of engineering.
[0086] A recorder cassette can comprise a barcode. A barcode can be a unique barcode or relatively unique such that the corresponding mutation can be identified based on the barcode. In some examples, the barcode is a non-naturally occurring sequence that is not found in nature. In most examples, the combination of the desired mutation and the barcode within the editing cassette is non-naturally occurring and not found in nature. A barcode can be any number of nucleotides in length. A barcode can be 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, or more than 30 nucleotides in length. In some cases, the barcode is more than 30 nucleotides in length. A barcode can be generated by degenerate oligonucleotide synthesis. A barcode can be rationally designed or user-specified.
[0087] A recorder cassette can comprise a landing site. A landing site can serve as a target site for a recorder cassette for a successive engineering round. A landing site can comprise a PAM. A landing site can be a unique sequence. A landing site can be at least about 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 35, 40, 45, 50 nucleotides in length. In some cases, the landing site is greater than 50 nucleotides in length.
[0088] A recorder cassette can comprise a selectable or screenable marker, or a regulatory sequence or mutation that turns a selectable or screenable marker on or off. In such cases, the turning on or off of a selectable marker can be used of selection or counter-selection, respectively, of iterative rounds of engineering. An example regulatory sequence includes a ribosome-binding site (RBS), though other such regulatory sequences are envisioned. Mutations that turn a selectable or screenable marker on can include any possible start codon that is recognized by the host transcription machinery. A mutation that turns off a selectable or screenable marker includes a mutation that deletes a start codon or one that inserts a premature stop codon or a reading frame shift mutation.
[0089] A recorder cassette can comprise one or more of a guide nucleic acid targeting a target site into which the recorder sequence is to be incorporated, a PAM mutation to silence a PAM used by the guide RNA, a barcode corresponding to an editing cassette, a unique site to serve as a landing site for a recorder cassette of a subsequent rounds of engineering, a regulatory sequence or mutation that turns a screenable or selectable marker on or off, these one or more elements being flanked by homology arms that are designed to promote recombination of these one or more elements into the cleaved target site that is targeted by the guide RNA.
[0090] A recorder cassette can comprise a first homology arm, a PAM mutation, a barcode, a unique landing site, a regulatory sequence or mutation for a screenable or selectable marker, a second homology arm, and guide RNA. The first homology arm can be an upstream homology arm. The second homology arm can be a downstream homology arm. The homology arms can be homologous to sequences flanking a cleavage site that is targeted by the guide RNA.
[0091] A cassette can comprise two guide nucleic acids designed to target two distinct target nucleic acid sequences. In any case, the guide nucleic acid can comprise a single gRNA or chimeric gRNA consisting of a crRNA and trRNA sequences, or alternatively, the gRNA can comprise separated crRNA and trRNAs, or a guide nucleic acid can comprise a crRNA. In other examples, guide nucleic acid can be introduced simultaneously with a trackable polynucleic acid or plasmid comprising an editing cassette and/or recorder cassette. In these cases, the guide nucleic acid can be encoded on a separate plasmid or be delivered in RNA form via delivery methods well known in the art.
[0092] A cassette can comprise a gene encoding a nucleic acid-guided nuclease, such as a CRISPR nuclease, functional with the chosen guide nucleic acid. A nucleic acid-guided nuclease or CRISPR nuclease gene can be provided on a separate plasmid. A nucleic acid-guided nuclease or CRISPR nuclease can be provided on the genome or episomal plasmid of a host organism to which a trackable polynucleic acid or plasmid will be introduced. In any of these examples, the nucleic acid-guided nuclease or CRISPR nuclease gene can be operably linked to a constitutive or inducible promoter. Examples of suitable constitutive and inducible promoters are well known in the art. A nucleic acid-guided nuclease or CRISPR nuclease can be provided as mRNA or polypeptide using delivery systems well known in the art. Such mRNA or polypeptide delivery systems can include, but are not limited to, nanoparticles, viral vectors, or other cell-permeable technologies.
[0093] A cassette can comprise a selectable or screenable marker, for example, such as that comprised within a recorder cassette. For example, the recorder cassette can comprise a barcode, such as trackable nucleic acid sequence which can be uniquely correlated with a genetic mutation of the corresponding editing cassette, or otherwise identifiably correlated with such a genetic mutation such that sequencing the barcode will allow identification of the corresponding genetic mutation introduced by the editing cassette. In other examples, recorder cassette can comprise a complete copy of or a fragment of a gene encoding an antibiotic resistance gene, auxotrophic marker, fluorescent protein, or other known selectable or screenable markers.
Trackable Plasmid Libraries
[0094] A trackable library can comprise a plurality of cassettes as disclosed herein. A trackable library can comprise a plurality of trackable polynucleic acids or plasmids comprising a cassette as disclosed herein. A cassette, polynucleotide, or plasmid comprising a recorder sequence or recorder cassette as disclosed herein can be referred to as a trackable cassette, polynucleotide, or plasmid. A cassette, polynucleotide, or plasmid comprising an editing sequence or editing cassette as disclosed herein can be referred to as a trackable cassette, polynucleotide, or plasmid.
[0095] In some cases, within the trackable library are distinct editing cassette and recorder cassette combinations that are sequenced to determine which editing sequence corresponds with a given marker or barcode sequence comprised within the recorder cassette. Therefore, when the editing and recorder sequences are incorporated into a target sequence, you can determine the edit that was incorporated by sequencing the recorder sequence. Sequence the recorder sequence or barcode can significantly cut down on sequencing time and cost.
[0096] Library size can depend on the experiment design. For example, if the aim is to edit each amino acid within a protein of interest, then the library size can depend on the number (N) of amino acids in a protein of interest, with a full saturation library (all 20 amino acids at each position or non-naturally occurring amino acids) scaling as 19 (or more)×N and an alanine-mapping library scaling as 1×N. Thus, screening of even very large proteins of more than 1,000 amino acids can be tractable given current multiplex oligo synthesis capabilities (e.g. 120,000 oligos). In addition to or as an alternative to activity screens, more general properties with developed high-throughput screens and selections can be efficiently tested using the libraries disclosed herein. It should be readily understood that libraries can be designed to mutate any number of amino acids within a target protein, including 1, at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, etc. up to the total number of amino acids within a target protein. Additionally, select amino acids can be targeted, such as catalytically active amino acids, or those involved in protein-protein interactions. Each amino acid that is targeted for mutation can be mutated into any number of alternate amino acids, such as any other natural or non-naturally occurring amino acid or amino acid analog. In some examples, all targeted amino acids are mutated to the same amino acid, such as alanine. In other cases, the targeted amino acids are independently mutated to any other amino acid in any combination or permutation.
[0097] Trackable libraries can comprise trackable mutations in individual residues or sequences of interest. Trackable libraries can be generated using custom-synthesized oligonucleotide arrays. Trackable plasmids can be generated using any cloning or assembly methods known in the art. For example, CREATE-Recorder plasmids can be generated by chemical synthesis, Gibson assembly, SLIC, CPEC, PCA, ligation-free cloning, other in vitro oligo assembly techniques, traditional ligation-based cloning, or any combination thereof.
[0098] Recorder sequences, such as barcodes, can be designed in silico via standard code with a degenerate mutation at the target codon. The degenerate mutation can comprise 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, or more than 30 nucleic acid residues. In some examples, the degenerate mutations can comprise 15 nucleic acid residues (N15).
[0099] Homology arms can be added to a recorder sequence and/or editing sequence to allow incorporation of the recorder and/or editing sequence into the desired location via homologous recombination or homology-driven repair. Homology arms can be added by synthesis, in vitro assembly, PCR, or other known methods in the art. For example, homology arms can be assembled via overlapping oligo extension, Gibson assembly, or any other method disclosed herein. A homology arm can be added to both ends of a recorder and/or editing sequence, thereby flanking the sequence with two distinct homology arms, for example, a 5′ homology arm and a 3′ homology arm.
[0100] The same 5′ and 3′ homology arms can be added to a plurality of distinct recorder sequences, thereby generating a library of unique recorder sequences that each have the same spacer target or targeted insertion site. The same 5′ and 3′ homology arms can be added to a plurality of distinct editing sequences, thereby generating a library of unique editing sequences that each have the same spacer target or targeted insertion site. In alternative examples, different or a variety of 5′ or 3′ homology arms can be added to a plurality of recorder sequences or editing sequences.
[0101] A recorder sequence library comprising flanking homology arms can be cloned into a vector backbone. In some examples, the recorder sequence and homology arms are cloned into a recorder cassette. Recorder cassettes can, in some cases, further comprise a nucleic acid sequence encoding a guide nucleic acid or gRNA engineered to target the desired site of recorder sequence insertion. In many cases, the nucleic acid sequences flanking the CRISPR/Cas-mediated cleavage site are homologous or substantially homologous to the homology arms comprised within the recorder cassette.
[0102] An editing sequence library comprising flanking homology arms can be cloned into a vector backbone. In some examples, the editing sequence and homology arms are cloned into an editing cassette. Editing cassettes can, in some cases, further comprise a nucleic acid sequence encoding a guide nucleic acid or gRNA engineered to target the desired site of editing sequence insertion. In many cases, the nucleic acid sequences flanking the CRISPR/Cas-mediated cleavage site are homologous or substantially homologous to the homology arms comprised within the editing cassette.
[0103] Gene-wide or genome-wide editing libraries can be subcloned into a vector backbone. In some cases, the vector backbone comprises a recorder cassette as disclosed herein. The editing sequence library can be inserted or assembled into a second site to generate competent trackable plasmids that can embed the recording barcode at a fixed locus while integrating the editing libraries at a wide variety of user defined sites.
[0104] A recorder sequence and/or cassette can be assembled or inserted into a vector backbone first, followed by insertion of an editing sequence and/or cassette. In other cases, an editing sequence and/or cassette can be inserted or assembled into a vector backbone first, followed by insertion of a recorder sequence and/or cassette. In other cases, a recorder sequence and/or cassette and an editing sequence and/or cassette are simultaneous inserted or assembled into a vector. In other cases, a recorder sequence and/or cassette and an editing sequence and/or cassette are comprised on the same cassette prior to simultaneous insertion or assembly into a vector. In other cases, a recorder sequence and/or cassette and an editing sequence and/or cassette are linked prior to simultaneous insertion or assembly into a vector. In other cases, a recorder sequence and/or cassette and an editing sequence and/or cassette are covalently linked prior to simultaneous insertion or assembly into a vector. In any of these cases, trackable plasmids or plasmid libraries can be generated.
[0105] A cassette or nucleic acid molecule can be synthesized which comprises one or more elements disclosed herein. For example, a nucleic acid molecule can be synthesized that comprises an editing cassette and a guide nucleic acid. A nucleic acid molecule can be synthesized that comprises an editing cassette and a recorder cassette. A nucleic acid molecule can be synthesized that comprises an editing cassette, a guide nucleic acid, and a recorder cassette. A nucleic acid molecule can be synthesized that comprises an editing cassette, a recorder cassette, and two guide nucleic acids. A nucleic acid molecule can be synthesized that comprises a recorder cassette and a guide nucleic acid. A nucleic acid molecule can be synthesized that comprises a recorder cassette. A nucleic acid molecule can be synthesized that comprises an editing cassette. In any of these cases, the guide nucleic acid can optionally be operably linked to a promoter. In any of these cases, the nucleic acid molecule can further include one or more barcodes.
[0106] Synthesized cassettes or synthesized nucleic acid molecules can be synthesized using any oligonucleotide synthesis method known in the art. For example, cassettes can be synthesized by array based oligonucleotide synthesis. In such examples, following synthesis of the oligonucleotides, the oligonucleotides can be cleaved from the array. Cleavage of oligonucleotides from an array can create a pool of oligonucleotides.
[0107] Software and automation methods can be used for multiplex synthesis and generation. For example, software and automation can be used to create 10, 10.sup.2, 10.sup.3, 10.sup.4, 10.sup.5, 10.sup.6, or more cassettes, such as trackable cassettes. An automation method can generate trackable plasmids in rapid fashion. Trackable cassettes can be processed through a workflow with minimal steps to produce precisely defined genome-wide libraries.
[0108] Cassette libraries, such as trackable cassette libraries, can be generated which comprise two or more nucleic acid molecules or plasmids comprising any combination disclosed herein of recorder sequence, editing sequence, guide nucleic acid, and optional barcode, including combinations of one or more of any of the previously mentioned elements. For example, such a library can comprise at least 2, 3, 4, 5, 10, 20, 30, 40, 50, 60, 70, 80, 90, 100, 200, 300, 400, 500, 600, 700, 800, 1000, 1500, 2000, 2500, 3000, 3500, 4000, 4500, 5000, 5500, 6000, 6500, 7000, 7500, 8000, 8500, 9000, 9500, 10.sup.4, 10.sup.5, 10.sup.6, 10.sup.7, 10.sup.8, 10.sup.9, 10.sup.10, or more nucleic acid molecules or plasmids of the present disclosure. It should be understood that such a library can include any number of nucleic acid molecules or plasmids, even if the specific number is not explicit listed above.
[0109] Cassettes or cassette libraries can be sequenced in order to determine the recorder sequence and editing sequence pair that is comprised on each cassette. In other cases, a known recorder sequence is paired with a known editing sequence during the library generation process. Other methods of determining the association between a recorder sequence and editing sequence comprised on a common nucleic acid molecule or plasmid are envisioned such that the editing sequence can be identified by identification or sequencing of the recorder sequence.
[0110] Methods and compositions for tracking edited episomal libraries that are shuttled between E. coli and other organisms/cell lines are provided herein. The libraries can be comprised on plasmids, Bacterial artificial chromosomes (BACs), Yeast artificial chromosomes (YACs), synthetic chromosomes, or viral or phage genomes. These methods and compositions can be used to generate portable barcoded libraries in host organisms, such as E. coli. Library generation in such organisms can offer the advantage of established techniques for performing homologous recombination. Barcoded plasmid libraries can be deep-sequenced at one site to track mutational diversity targeted across the remaining portions of the plasmid allowing dramatic improvements in the depth of library coverage (e.g.
Trackable Engineering Methods
[0111] An example of trackable engineering workflow is depicted in
[0112] Through a multiplexed cloning approach, the recorder cassette can be covalently coupled to at least one editing cassette in a plasmid (e.g.,
[0113] Methods are provided herein for combining multiplex oligonucleotide synthesis with recombineering, to create libraries of specifically designed and trackable mutations. Screens and/or selections followed by high-throughput sequencing and/or barcode microarray methods can allow for rapid mapping of mutations leading to a phenotype of interest.
[0114] Methods and compositions disclosed herein can be used to simultaneously engineer and track engineering events in a target nucleic acid sequence.
[0115] Trackable plasmids can be generated using in vitro assembly or cloning techniques. For example, the CREATE-Recorder plasmids can be generated using chemical synthesis, Gibson assembly, SLIC, CPEC, PCA, ligation-free cloning, other in vitro oligo assembly techniques, traditional ligation-based cloning, or any combination thereof.
[0116] Trackable plasmids can comprise at least one recording sequence, such as a barcode, and at least one editing sequence. In most cases, the recording sequence is used to record and track engineering events. Each editing sequence can be used to incorporate a desired edit into a target nucleic acid sequence. The desired edit can include insertion, deletion, substitution, or alteration of the target nucleic acid sequence. In some examples, the one or more recording sequence and editing sequences are comprised on a single cassette comprised within the trackable plasmid such that they are incorporated into the target nucleic acid sequence by the same engineering event. In other examples, the recording and editing sequences are comprised on separate cassettes within the trackable plasmid such that they are each incorporated into the target nucleic acid by distinct engineering events. In some examples, the trackable plasmid comprises two or more editing sequences. For example, one editing sequence can be used to alter or silence a PAM sequence while a second editing sequence can be used to incorporate a mutation into a distinct sequence.
[0117] Recorder sequences can be inserted into a site separated from the editing sequence insertion site. The inserted recorder sequence can be separated from the editing sequence by 1 bp or any number of base pairs. For example, the separation distance can be about 1 bp, 10 bp, 50 bp, 100 bp, 500 bp, 1 kp, 2 kb, 5 kb, 10 kb, or greater. The separation distance can be any discrete integer of base pairs. It should be readily understood that there the limit of the number of base pairs separating the two insertion sites can be limited by the size of the genome, chromosome, or polynucleotide into which the insertions are being made. In some examples, the maximum distance of separation depends on the size of the target nucleic acid or genome.
[0118] Recorder sequences can be inserted adjacent to editing sequences, or within proximity to the editing sequence. For example, the recorder sequence can be inserted outside of the open reading frame within which the editing sequence is inserted. Recorder sequence can be inserted into an untranslated region adjacent to an open reading frame within which an editing sequence has been inserted. The recorder sequence can be inserted into a functionally neutral or non-functional site. The recorder sequence can be inserted into a screenable or selectable marker gene.
[0119] In some examples, the target nucleic acid sequence is comprised within a genome, artificial chromosome, synthetic chromosome, or episomal plasmid. In various examples, the target nucleic acid sequence can be in vitro or in vivo. When the target nucleic acid sequence is in vivo, the CREATE-Recorder plasmid can be introduced into the host organisms by transformation, transfection, conjugation, biolistics, nanoparticles, cell-permeable technologies, or other known methods for DNA delivery, or any combination thereof. In such examples, the host organism can be a eukaryote, prokaryote, bacterium, archaea, yeast, or other fungi.
[0120] The engineering event can comprise recombineering, non-homologous end joining, homologous recombination, or homology-driven repair. In some examples, the engineering event is performed in vitro or in vivo.
[0121] The methods described herein can be carried out in any type of cell in which a nucleic acid-guided nuclease system can function (e.g., target and cleave DNA), including prokaryotic and eukaryotic cells or in vitro. In some embodiments the cell is a bacterial cell, such as Escherichia spp. (e.g., E. coli). In other embodiments, the cell is a fungal cell, such as a yeast cell, e.g., Saccharomyces spp. In other embodiments, the cell is an algal cell, a plant cell, an insect cell, or a mammalian cell, including a human cell.
[0122] In some examples, a cell is a recombinant organism. For example, the cell can comprise a non-native nucleic acid-guided nuclease system. Additionally or alternatively, the cell can comprise recombination system machinery. Such recombination systems can include lambda red recombination system, Cre/Lox, attB/attP, or other integrase systems. Where appropriate, the trackable plasmid can have the complementary components or machinery required for the selected recombination system to work correctly and efficiently.
[0123] A method for genome editing can comprise: (a) introducing a vector that encodes at least one editing cassette and at least one guide nucleic acid into a first population of cells, thereby producing a second population of cells comprising the vector; (b) maintaining the second population of cells under conditions in which a nucleic acid-guided nuclease is expressed or maintained, wherein the nucleic acid-guided nuclease is encoded on the vector, a second vector, on the genome of cells of the second population of cells, or otherwise introduced into the cell, resulting in DNA cleavage and incorporation of the editing cassette; (c) obtaining viable cells. Such a method can optionally further comprise (d) sequencing the target DNA molecule in at least one cell of the second population of cells to identify the mutation of at least one codon.
[0124] A method for genome editing can comprise: (a) introducing a vector that encodes at least one editing cassette comprising a PAM mutation as disclosed herein and at least one guide nucleic acid into a first population of cells, thereby producing a second population of cells comprising the vector; (b) maintaining the second population of cells under conditions in which nucleic acid-guided nuclease is expressed or maintained, wherein the nucleic acid-guided nuclease is encoded on the vector, a second vector, on the genome of cells of the second population of cells, or otherwise introduced into the cell, resulting in DNA cleavage, incorporation of the editing cassette, and death of cells of the second population of cells that do not comprise the PAM mutation, whereas cells of the second population of cells that comprise the PAM mutation are viable; (c) obtaining viable cells. Such a method can optionally further comprise (d) sequencing the target DNA in at least one cell of the second population of cells to identify the mutation of at least one codon.
[0125] Method for trackable genome editing can comprise: (a) introducing a vector that encodes at least one editing cassette, at least one recorder cassette, and at least two gRNA into a first population of cells, thereby producing a second population of cells comprising the vector; (b) maintaining the second population of cells under conditions in which a nucleic acid-guided nuclease is expressed or maintained, wherein the nucleic acid-guided nuclease is encoded on the vector, a second vector, on the genome of cells of the second population of cells, or otherwise introduced into the cell, resulting in DNA cleavage and incorporation of the editing and recorder cassettes; (c) obtaining viable cells. Such a method can optionally further comprise (d) sequencing the recorder sequence of the target DNA molecule in at least one cell of the second population of cells to identify the mutation of at least one codon.
[0126] In some examples where the trackable plasmid comprises an editing cassette designed to silence a PAM site, a method for trackable genome editing can comprise: (a) introducing a vector that encodes at least one editing cassette, a recorder cassette, and at least two gRNA into a first population of cells, thereby producing a second population of cells comprising the vector; (b) maintaining the second population of cells under conditions in which a nucleic acid-guided nuclease is expressed or maintained, wherein the nucleic acid-guided nuclease is encoded on the vector, a second vector, on the genome of cells of the second population of cells, or otherwise introduced into the cell, resulting in DNA cleavage, incorporation of the editing cassette and recorder cassette, and death of cells of the second population of cells that do not comprise the PAM mutation, whereas cells of the second population of cells that comprise the PAM mutation are viable; and (c) obtaining viable cells. Such a method can optionally further comprise (d) sequencing the recorder sequence of the target DNA in at least one cell of the second population of cells to identify the mutation of at least one codon. Such methods can also further comprise a recorder cassette comprising a second PAM mutation, such that both PAMs must be silences by the editing cassette PAM mutation and recorder cassette PAM mutation in order to escape cell death.
[0127] In some examples transformation efficiency is determined by using a non-targeting guide nucleic acid control, which allows for validation of the recombineering procedure and CFU/ng calculations. In some cases, absolute efficient is obtained by counting the total number of colonies on each transformation plate, for example, by counting both red and white colonies from a galK control. In some examples, relative efficiency is calculated by the total number of successful transformants (for example, white colonies) out of all colonies from a control (for example, galK control).
[0128] The methods of the disclosure can provide, for example, greater than 1000× improvements in the efficiency, scale, cost of generating a combinatorial library, and/or precision of such library generation.
[0129] The methods of the disclosure can provide, for example, greater than: 10×, 50×, 100×, 200×, 300×, 400×, 500×, 600×, 700×, 800×, 900×, 1000×, 1100×, 1200×, 1300×, 1400×, 1500×, 1600×, 1700×, 1800×, 1900×, 2000×, or greater improvements in the efficiency of generating genomic or combinatorial libraries.
[0130] The methods of the disclosure can provide, for example, greater than: 10×, 50×, 100×, 200×, 300×, 400×, 500×, 600×, 700×, 800×, 900×, 1000×, 1100×, 1200×, 1300×, 1400×, 1500×, 1600×, 1700×, 1800×, 1900×, 2000×, or greater improvements in the scale of generating genomic or combinatorial libraries.
[0131] The methods of the disclosure can provide, for example, greater than: 10×, 50×, 100×, 200×, 300×, 400×, 500×, 600×, 700×, 800×, 900×, 1000×, 1100×, 1200×, 1300×, 1400×, 1500×, 1600×, 1700×, 1800×, 1900×, 2000×, or greater decrease in the cost of generating genomic or combinatorial libraries.
[0132] The methods of the disclosure can provide, for example, greater than: 10×, 50×, 100×, 200×, 300×, 400×, 500×, 600×, 700×, 800×, 900×, 1000×, 1100×, 1200×, 1300×, 1400×, 1500×, 1600×, 1700×, 1800×, 1900×, 2000×, or greater improvements in the precision of genomic or combinatorial library generation.
Recursive Tracking for Combinatorial Engineering
[0133] Disclosed herein are methods and compositions for iterative rounds of engineering. Disclosed herein are recursive engineering strategies that allow implementation of trackable engineering at the single cell level through several serial engineering cycles (e.g.,
[0134] Combinatorial engineering methods can comprise multiple rounds of engineering. Methods disclosed herein can comprise 2 or more rounds of engineering. For example, a method can comprise 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 20, 25, 30, or more than 30 rounds of engineering.
[0135] In some examples, during each round of engineering a new recorder sequence, such as a barcode, is incorporated at the same or nearby locus in a target site (e.g.,
[0136] Disclosed herein are methods for selecting for successive rounds of engineering. Selection can occur by a PAM mutation incorporated by an editing cassette. Selection can occur by a PAM mutation incorporated by a recorder cassette. Selection can occur using a screenable, selectable, or counter-selectable marker. Selection can occur by targeting a site for editing or recording that was incorporated by a prior round of engineering, thereby selecting for variants that successfully incorporated edits and recorder sequences from both rounds or all prior rounds of engineering.
[0137] Quantitation of these genotypes can be used for understanding combinatorial mutational effects on large populations and investigation of important biological phenomena such as epistasis.
[0138] Serial editing and combinatorial tracking can be implemented using recursive vector systems as disclosed herein. These recursive vector systems can be used to move rapidly through the transformation procedure (e.g.,
[0139] In some examples, the recursive vector system disclosed herein comprises 2, 3, 4, 5, 6, 7, 8, 9, 10, or more than 10 unique plasmids. In some examples, the recursive vector system can use a particular plasmid more than once as long as a distinct plasmid is used in the previous round and in the subsequent round.
[0140] Recursive methods and compositions disclosed herein can be used to restore function to a selectable or screenable element in a targeted genome or plasmid. The selectable or screenable element can include an antibiotic resistance gene, a fluorescent gene, a unique DNA sequence or watermark, or other known reporter, screenable, or selectable gene. In some examples, each successive round of engineering can incorporate a fragment of the selectable or screenable element, such that at the end of the engineering rounds, the entire selectable or screenable element has been incorporated into the target genome or plasmid. In such examples, only those genome or plasmids, which have successfully incorporated all of the fragments, and therefore all of the desired corresponding mutations, can be selected or screened for. In this way, the selected or screened cells will be enriched for those that have incorporated the edits from each and every iterative round of engineering.
[0141] Recursive methods can be used to switch a selectable or screenable marker between an on and an off position, or between an off and an on position, with each successive round of engineering. Using such a method allows conservation of available selectable or screenable markers by requiring, for example, the use of only one screenable or selectable marker. Furthermore, short regulatory sequence or start codon or non-start codons can be used to turn the screenable or selectable marker on and off. Such short sequences can easily fit within a cassette or polynucleotide, such as a synthesized cassette.
[0142] One or more rounds of engineering can be performed using the methods and compositions disclosed herein. In some examples, each round of engineering is used to incorporate an edit unique from that of previous rounds. Each round of engineering can incorporate a unique recording sequence. Each round of engineering can result in removal or curing of the CREATE plasmid used in the previous round of engineering. In some examples, successful incorporation of the recording sequence of each round of engineering results in a complete and functional screenable or selectable marker or unique sequence combination.
[0143] Unique recorder cassettes comprising recording sequences such as barcodes or screenable or selectable markers can be inserted with each round of engineering, thereby generating a recorder sequence that is indicative of the combination of edits or engineering steps performed. Successive recording sequences can be inserted adjacent to one another. Successive recording sequences can be inserted within proximity to one another. Successive sequences can be inserted at a distance from one another.
[0144] Successive sequences can be inserted at a distance from one another. For example, successive recorder sequences can be inserted and separated by 0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, or greater than 100 bp. In some examples, successive recorder sequences are separated by about 10, 50, 100, 150, 200, 250, 300, 350, 400, 450, 500, 550, 600, 650, 700, 750, 800, 850, 900, 950, 1000, 1100, 1200, 1300, 1400, 1500, or greater than 1500 bp.
[0145] Successive recorder sequences can be separated by any desired number of base pairs and can be dependent and limited on the number of successive recorder sequences to be inserted, the size of the target nucleic acid or target genomes, and/or the design of the desired final recorder sequence. For example, if the compiled recorder sequence is a functional screenable or selectable marker, than the successive recording sequences can be inserted within proximity and within the same reading frame from one another. If the compiled recorder sequence is a unique set of barcodes to be identified by sequencing and have no coding sequence element, then the successive recorder sequences can be inserted with any desired number of base pairs separating them. In these cases, the separation distance can be dependent on the sequencing technology to be used and the read length limit.
[0146] In some examples, a recorder cassette comprises a landing site to be used as a target site for the recorder cassette of the next round of engineering. By using such a method, successive rounds of recorder cassettes can only be introduced into the target site if the recorder cassette from the previous round was successfully incorporated, thereby providing the target site for the present engineering round (e.g.,
Guide Nucleic Acid
[0147] A guide nucleic acid can complex with a compatible nucleic acid-guided nuclease and can hybridize with a target sequence, thereby directing the nuclease to the target sequence. A subject nucleic acid-guided nuclease capable of complexing with a guide nucleic acid can be referred to as a nucleic acid-guided nuclease that is compatible with the guide nucleic acid. Likewise, a guide nucleic acid capable of complexing with a nucleic acid-guided nuclease can be referred to as a guide nucleic acid that is compatible with the nucleic acid-guided nucleases.
[0148] A guide nucleic acid can be DNA. A guide nucleic acid can be RNA. A guide nucleic acid can comprise both DNA and RNA. A guide nucleic acid can comprise modified of non-naturally occurring nucleotides. In cases where the guide nucleic acid comprises RNA, the RNA guide nucleic acid can be encoded by a DNA sequence on a polynucleotide molecule such as a plasmid, linear construct, or editing cassette as disclosed herein.
[0149] A guide nucleic acid can comprise a guide sequence. A guide sequence is a polynucleotide sequence having sufficient complementarity with a target polynucleotide sequence to hybridize with the target sequence and direct sequence-specific binding of a complexed nucleic acid-guided nuclease to the target sequence. The degree of complementarity between a guide sequence and its corresponding target sequence, when optimally aligned using a suitable alignment algorithm, is about or more than about 50%, 60%, 75%, 80%, 85%, 90%, 95%, 97.5%, 99%, or more. Optimal alignment may be determined with the use of any suitable algorithm for aligning sequences. In some embodiments, a guide sequence is about or more than about 5, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 35, 40, 45, 50, 75, or more nucleotides in length. In some embodiments, a guide sequence is less than about 75, 50, 45, 40, 35, 30, 25, 20 nucleotides in length. Preferably the guide sequence is 10-30 nucleotides long. The guide sequence can be 15-20 nucleotides in length. The guide sequence can be 15 nucleotides in length. The guide sequence can be 16 nucleotides in length. The guide sequence can be 17 nucleotides in length. The guide sequence can be 18 nucleotides in length. The guide sequence can be 19 nucleotides in length. The guide sequence can be 20 nucleotides in length.
[0150] A guide nucleic acid can comprise a scaffold sequence. In general, a “scaffold sequence” includes any sequence that has sufficient sequence to promote formation of a targetable nuclease complex, wherein the targetable nuclease complex comprises a nucleic acid-guided nuclease and a guide nucleic acid comprising a scaffold sequence and a guide sequence. Sufficient sequence within the scaffold sequence to promote formation of a targetable nuclease complex may include a degree of complementarity along the length of two sequence regions within the scaffold sequence, such as one or two sequence regions involved in forming a secondary structure. In some cases, the one or two sequence regions are comprised or encoded on the same polynucleotide. In some cases, the one or two sequence regions are comprised or encoded on separate polynucleotides. Optimal alignment may be determined by any suitable alignment algorithm, and may further account for secondary structures, such as self-complementarity within either the one or two sequence regions. In some embodiments, the degree of complementarity between the one or two sequence regions along the length of the shorter of the two when optimally aligned is about or more than about 25%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, 95%, 97.5%, 99%, or higher. In some embodiments, at least one of the two sequence regions is about or more than about 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 25, 30, 40, 50, or more nucleotides in length.
[0151] A scaffold sequence of a subject guide nucleic acid can comprise a secondary structure. A secondary structure can comprise a pseudoknot region. In some example, the compatibility of a guide nucleic acid and nucleic acid-guided nuclease is at least partially determined by sequence within or adjacent to a pseudoknot region of the guide RNA. In some cases, binding kinetics of a guide nucleic acid to a nucleic acid-guided nuclease is determined in part by secondary structures within the scaffold sequence. In some cases, binding kinetics of a guide nucleic acid to a nucleic acid-guided nuclease is determined in part by nucleic acid sequence with the scaffold sequence.
[0152] In aspects of the invention the terms “guide nucleic acid” refers to a polynucleotide comprising 1) a guide sequence capable of hybridizing to a target sequence and 2) a scaffold sequence capable of interacting with or complexing with an nucleic acid-guided nuclease as described herein.
[0153] A guide nucleic acid can be compatible with a nucleic acid-guided nuclease when the two elements can form a functional targetable nuclease complex capable of cleaving a target sequence. Often, a compatible scaffold sequence for a compatible guide nucleic acid can be found by scanning sequences adjacent to a native nucleic acid-guided nuclease loci. In other words, native nucleic acid-guided nucleases can be encoded on a genome within proximity to a corresponding compatible guide nucleic acid or scaffold sequence.
[0154] Nucleic acid-guided nucleases can be compatible with guide nucleic acids that are not found within the nucleases endogenous host. Such orthogonal guide nucleic acids can be determined by empirical testing. Orthogonal guide nucleic acids can come from different bacterial species or be synthetic or otherwise engineered to be non-naturally occurring.
[0155] Orthogonal guide nucleic acids that are compatible with a common nucleic acid-guided nuclease can comprise one or more common features. Common features can include sequence outside a pseudoknot region. Common features can include a pseudoknot region. Common features can include a primary sequence or secondary structure.
[0156] A guide nucleic acid can be engineered to target a desired target sequence by altering the guide sequence such that the guide sequence is complementary to the target sequence, thereby allowing hybridization between the guide sequence and the target sequence. A guide nucleic acid with an engineered guide sequence can be referred to as an engineered guide nucleic acid. Engineered guide nucleic acids are often non-naturally occurring and are not found in nature.
More Methods
[0157] Disclosed herein are methods for genome engineering that employ a nuclease, such as a nucleic acid-guided nuclease to perform directed genome evolution/produce changes (deletions, substitutions, additions) in a target sequence, such as DNA or RNA, for example, genomic DNA or episomal DNA. Suitable nucleases can include, for example, RNA-guided nucleases such as Cas9, Cpf1, MAD2, or MAD7, DNA-guided nucleases such as Argonaute, or other nucleases such as zinc-finger nucleases, TALENs, or meganucleases. Nuclease genes can be obtained from any source, such as from a bacterium, archaea, prokaryote, eukaryote, or virus. For example, a Cas9 gene can be obtained from a bacterium harboring the corresponding Type II CRISPR system, such as the bacterium S. pyogenes (SEQ ID NO: 110). The nucleic acid sequence and/or amino acid sequence of the nuclease may be mutated, relative to the sequence of a naturally occurring nuclease. A mutation can be, for example, one or more insertions, deletions, substitutions or any combination of two or three of the foregoing. In some cases, the resulting mutated nuclease can have enhanced or reduced nuclease activity relative to the naturally occurring nuclease. In some cases, the resulting mutated nuclease can have no nuclease activity relative to the naturally occurring nuclease.
[0158] Methods for nucleic acid-guided nuclease-mediated genome editing are provided herein. Some disclosed methods can include a two-stage construction process which relies on generation of cassette libraries that incorporate directed mutations from an editing cassettes directly into a genome, episomal nucleic acid molecule, or isolated nucleic acid molecule. In some examples, during the first stage of cassette library construction, rationally designed editing cassettes can be cotransformed into cells with a guide nucleic acid (e.g., guide RNA) that hybridizes to or targets a target DNA sequence. In some examples, the guide nucleic acid is introduced as an RNA molecule, or encoded on a DNA molecule.
[0159] Editing cassettes can be designed such that they couple deletion or mutation of a PAM site with the mutation of one or more desired codons or nucleic acid residues in the adjacent nucleic acid sequence. The deleted or mutated PAM site, in some cases, can no longer be recognized by the chosen nucleic acid-guided nuclease. In some examples, at least one PAM or more than one PAM can be deleted or mutated, such as two, three, four, or more PAMs.
[0160] Methods disclosed herein can enable generation of an entire cassette library in a single transformation. The cassette library can be retrieved, in some cases, by amplification of the recombinant chromosomes, e.g. by a PCR reaction, using a synthetic feature or priming site from the editing cassettes. In some examples, a second PAM deletion or mutation is simultaneously incorporated. This approach can covalently couple the codon-targeted mutations directly to a PAM deletion.
[0161] In some examples, there is a second stage to construction of cassette libraries. During the second stage the PCR amplified cassette libraries carrying the destination PAM deletion/mutation and the targeted mutations, such as a desired mutation of one or more nucleotides, such as one or more nucleotides in one or more codons, can be co-transformed into naive cells. The cells can be eukaryotic cell, archaeal cell, or prokaryotic cells. The cassette libraries can be co-transformed with a guide nucleic acid or plasmid encoding the same to generate a population of cells that express a rationally designed protein library. The libraries can be co-transformed with a guide nucleic acid such as a gRNA, chimeric gRNA, split gRNA, or a crRNA and trRNA set. The cassette library can comprise a plurality of cassettes wherein each cassette comprises an editing cassette and guide nucleic acid. The cassette library can comprise a plurality of cassettes wherein each cassette comprises an editing cassette, recorder cassettes and two guide nucleic acids.
[0162] In some targetable nuclease systems, the guide nucleic acid can guide selection of a target sequence. As used herein, a target sequence refers to any locus in vitro or in in vivo, or in the nucleic acid of a cell or population of cells in which a mutation of at least one nucleotide, such as a mutation of at least one nucleotide in at least one codon, is desired. The target sequence can be, for example, a genomic locus, target genomic sequence, or extrachromosomal locus. The guide nucleic acid can be expressed as a DNA molecule, referred to as a guide DNA, or as a RNA molecule, referred to as a guide RNA. A guide nucleic acid can comprise a guide sequence, that is complementary to a region of the target region. A guide nucleic acid can comprise a scaffold sequence that can interact with a compatible nucleic acid-guided nuclease, and can optionally form a secondary structure. A guide nucleic acid can functions to recruit a nucleic acid-guided nuclease to the target site. A guide sequence can be complementary to a region upstream of the target site. A guide sequence can be complementary to at least a portion of the target site. A guide sequence can be completely complementary (100% complementary) to the target site or include one or more mismatches, provided that it is sufficiently complementary to the target site to specifically hybridize/guide and recruit the nuclease. Suitable nucleic acid guided nuclease include, as non-limiting examples, CRISPR nucleases, Cas nucleases, such as Cas9 or Cpf1, MAD2, and MAD7.
[0163] In some CRISPR systems, the CRISPR RNA (crRNA or spacer-containing RNA) and trans-activating CRISPR RNA (tracrRNA or trRNA) can guide selection of a target sequence. As used herein, a target sequence refers to any locus in vitro or in in vivo, or in the nucleic acid of a cell or population of cells in which a mutation of at least one nucleotide, such as a mutation of at least one nucleotide in at least one codon, is desired. The target sequence can be, for example, a genomic locus, target genomic sequence, or extrachromosomal locus. The tracrRNA and crRNA can be expressed as a single, chimeric RNA molecule, referred to as a single-guide RNA, guide RNA, or gRNA. The nucleic acid sequence of the gRNA comprises a first nucleic acid sequence, also referred to as a first region, that is complementary to a region of the target region and a second nucleic acid sequence, also referred to a second region, that forms a stem loop structure and functions to recruit a CRISPR nuclease to the target region. The first region of the gRNA can be complementary to a region upstream of the target genomic sequence. The first region of the gRNA can be complementary to at least a portion of the target region. The first region of the gRNA can be completely complementary (100% complementary) to the target genomic sequence or include one or more mismatches, provided that it is sufficiently complementary to the target genomic sequence to specifically hybridize/guide and recruit a CRISPR nuclease, such as Cas9 or Cpf1.
[0164] A guide sequence or first region of the gRNA can be at least 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, or at least 30 nucleotides in length. The guide sequence or first region of the gRNA can be at least 20 nucleotides in length.
[0165] A stem loop structure that can be formed by the scaffold sequence or second nucleic acid sequence of a gRNA can be at least 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 7, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, or 100 nucleotides in length. A stem loop structure can be from 80 to 90 or 82 to 85 nucleotides in length. A scaffold sequence or second region of the gRNA that forms a stem loop structure can be 83 nucleotides in length.
[0166] A guide nucleic acid of a cassette that is introduced into a first cell using the methods disclosed herein can be the same as the guide nucleic acid of a second cassette that is introduced into a second cell. More than one guide nucleic acid can be introduced into the population of first cells and/or the population of second cells. The more than one guide nucleic acids can comprise guide sequences that are complementary to more than one target region.
[0167] Methods disclosed herein can comprise using oligonucleotides. Such oligonucleotides can be obtained or derived from many sources. For example, an oligonucleotide can be derived from a nucleic acid library that has been diversified by nonhomologous random recombination (NRR); such a library is referred to as an NRR library. An oligonucleotide can be synthesized, for example by array-based synthesis or other known chemical synthesis method. The length of an oligonucleotide can be dependent on the method used in obtaining the oligonucleotide. An oligonucleotide can be approximately 50-200 nucleotides, 75-150 nucleotides, or between 80-120 nucleotides in length. An oligonucleotide can be about 10, 20, 30, 50, 60, 70, 80, 90, 100, 110, 120, 130, 140, 150, 160, 170, 180, 190, 200, or more nucleotides in length, including any integer, for example, 51, 52, 53, 54, 201, 202, etc. An oligonucleotide can be about 100, 200, 300, 400, 500, 600, 700, 800, 900, 1000, 1250, 1500, 1750, 2000, or more nucleotides in length, including any integer, for example, 101, 203, 1001, 2001, 2010, etc.
[0168] Oligonucleotides and/or other nucleic acid molecules can be combined or assembled to generate a cassette. Such a cassette can comprise (a) a region that is homologous to a target region of the nucleic acid of the cell and includes a desired mutation of at least one nucleotide or one codon relative to the target region, and (b) a protospacer adjacent motif (PAM) mutation. The PAM mutation can be any insertion, deletion or substitution of one or more nucleotides that mutates the sequence of the PAM such that it is no longer recognized by a nucleic acid-guided nuclease system or CRISPR nuclease system. A cell that comprises such a PAM mutation may be said to be “immune” to nuclease-mediated killing. The desired mutation relative to the sequence of the target region can be an insertion, deletion, and/or substitution of one or more nucleotides. In some examples, the insertion, deletion, and/or substitution of one or more nucleotides is in at least one codon of the target region. Alternatively, the cassette can be synthesized in a single synthesis, comprising (a) a region that is homologous to a target region of the nucleic acid of the cell and includes a desired mutation of at least one nucleotide or one codon relative to the target region, (b) a protospacer adjacent motif (PAM) mutation, and optionally (c) a region that is homologous to a second target region of the nucleic acid of the cell and includes a recorder sequence.
[0169] The methods disclosed herein can be applied to any target nucleic acid molecule of interest, from any prokaryote including bacteria and archaea, or any eukaryote, including yeast, mammalian, and human genes, or any viral particle. The nucleic acid module can be a non-coding nucleic acid sequence, gene, genome, chromosome, plasmid, episomal nucleic acid molecule, artificial chromosome, synthetic chromosome, or viral nucleic acid.
[0170] Methods for assessing recovery efficiency of donor strain libraries are disclosed herein. Recovery efficiency can be verified based on the presence of a PCR product or on changes in amplicon or PCR product sizes or sequence obtained with primers directed at the selected target locus. Primers can be designed to hybridize with endogenous sequences or heterologous sequences contained on the donor nucleic acid molecule. For example, the PCR primer can be designed to hybridize to a heterologous sequence such that PCR will only be possible if the donor nucleic acid is incorporated. Sequencing of PCR products from the recovered libraries indicates the heterologous sequence or synthetic priming site from the dsDNA cassettes or donor sequences can be incorporated with about 90-100% efficiency. In other examples, the efficiency can be about 5%, 10% 20%, 30%, 40%, 50%, 60%, 70%, 75%, 80%, 85%, 90%, 95%, 99% or 100%.
[0171] In some cases, the ability to improve final editing efficiencies of the methods disclosed herein can be assessed by carrying out cassette construction in gene deficient strains before transferring to a wild-type donor strain in an effort to prevent loss of mutations during the donor construction phase. Additionally or alternatively, efficiency of the disclosed methods can be assessed by targeting an essential gene. Essential genes can include any gene required for survival or replication of a viral particle, cell, or organism. In some examples, essential genes include dxs, metA, and folA. Essential genes have been effectively targeted using guide nucleic acid design strategies described. Other suitable essential genes are well known in the art.
[0172] Provided herein are method of increasing editing efficiencies by modulating the level of a nucleic acid-guided nuclease. This could be done by using copy control plasmids, such as high copy number plasmids or low copy number plasmids. Low copy number plasmids could be plasmids that can have about 20 or less copies per cell, as opposed to high copy number plasmids that can have about 1000 copies per cell. High copy number plasmids and low copy number plasmids are well known in the art and it is understood that an exact plasmid copy per cell does not need to be known in order to characterize a plasmid as either high or low copy number.
[0173] In some cases, the decreasing expression level of a nucleic acid-guided nuclease, such as Cas9, Cpf1, MAD2, or MAD7, can increase transformation, editing, and/or recording efficiencies. In some cases, decreasing expression level of the nucleic acid-guided nuclease is done by expressing the nucleic acid-guided nuclease on a low copy number plasmid.
[0174] In some cases, the increasing expression level of a nucleic acid-guided nuclease, such as Cas9, Cpf1, MAD2, or MAD7, can increase transformation, editing, and/or recording efficiencies. In some cases, increasing expression level of the nucleic acid-guided nuclease is done by expressing the nucleic acid-guided nuclease on a high copy number plasmid.
[0175] Other methods of modulating the expression level of a protein are also envisioned and are known in the art. Such methods include using a inducible or constitutive promoter, incorporating enhancers or other expression regulatory elements onto an expression plasmid, using RNAi, amiRNAi, or other RNA silencing techniques to modulate transcript level, fusing the protein of interest to a degradation domain, or any other method known in the art.
[0176] Provided herein are methods for generating mutant libraries. In some examples, the mutant library can be effectively constructed and retrieved within 1-3 hours post recombineering. In some examples, the mutant library is constructed within 0.5, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 12, or 24 hours post recombineering. In some examples, the mutant library can be retrieved within 0.5, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 12, 24, 36, or 48 hours post recombineering and/or post-constructing by recombineering.
[0177] Some methods disclosed herein can be used for trackable, precision genome editing. In some examples, methods disclosed herein can achieve high efficiency editing/mutating using a single cassette that encodes both an editing cassette and guide nucleic acid, and optionally a recorder cassette and second guide nucleic acid. Alternatively, a single vector can encode an editing cassette while a guide nucleic acid is provided sequentially or concomitantly. When used with parallel DNA synthesis, such as array-based DNA synthesis, methods disclosed herein can provide single step generation of hundreds or thousands of precision edits/mutations. Mutations can be mapped by sequencing the editing cassette on the vector, rather than by sequencing of the genome or a section of the genome of the cell or organism.
[0178] The methods disclosed herein can have broad utility in protein and genome engineering applications, as well as for reconstruction of mutations, such as mutations identified in laboratory evolution experiments. In some examples, the methods and compositions disclosed here can combine an editing cassette, which could include a desired mutation and a PAM mutation, with a gene encoding a guide nucleic acid on a single vector.
[0179] In some examples, a trackable mutant library can be generated in a single transformation or single reaction.
[0180] Methods disclosed herein can comprise introducing a cassette comprising an editing cassette that includes the desired mutation and the PAM mutation into a cell or population of cells. In some embodiments, the cell into which the cassette or vector is introduced also comprises a nucleic acid-guided nuclease, such as Cas9, Cpf1, MAD2, or MAD7. In some embodiments, a gene or mRNA encoding the nucleic acid-guided nuclease is concomitantly, sequentially, or subsequently introduced into the cell or population of cells. Expression of a targetable nuclease system, including nucleic acid-guided nuclease and a guide nucleic acid, in the cell or cell population can be activated such that the guide nucleic acid recruits the nucleic acid-guided nuclease to the target region, where dsDNA cleavage occurs.
[0181] In some examples, without wishing to be bound by any particular theory, the homologous region of an editing cassette complementary to the target sequence mutates the PAM and the one or more codon of the target sequence. Cells of the population of cells that did not integrate the PAM mutation can undergo unedited cell death due to nucleic acid-guided nuclease mediated dsDNA cleavage. In some examples, cells of the population of cells that integrate the PAM mutation do not undergo cell death; they remain viable and are selectively enriched to high abundance. Viable cells can be obtained and can provide a library of trackable or targeted mutations.
[0182] In some examples, without wishing to be bound by any particular theory, the homologous region of a recorder cassette complementary to the target sequence mutates the PAM and introduces a barcode into a target sequence. Cells of the population of cells that did not integrate the PAM mutation can undergo unedited cell death due to nucleic acid-guided nuclease mediated dsDNA cleavage. In some examples, cells of the population of cells that integrate the PAM mutation do not undergo cell death; they remain viable and are selectively enriched to high abundance. Viable cells can be obtained and can provide a library of trackable mutations.
[0183] A separate vector or mRNA encoding a nucleic acid-guided nuclease can be introduced into the cell or population of cells. Introducing a vector or mRNA into a cell or population of cells can be performed using any method or technique known in the art. For example, vectors can be introduced by standard protocols, such as transformation including chemical transformation and electroporation, transduction and particle bombardment. Additionally or alternatively, mRNA can be introduced by standard protocols, such as transformation as disclosed herein, and/or by techniques involving cell permeable peptides or nanoparticles.
[0184] An editing cassette can include (a) a region, which recognizes (hybridizes to) a target region of a nucleic acid in a cell or population of cells, is homologous to the target region of the nucleic acid of the cell and includes a mutation, referred to a desired mutation, of at least one nucleotide that can be in at least one codon relative to the target region, and (b) a protospacer adjacent motif (PAM) mutation. In some examples, the editing cassette also comprises a barcode. The barcode can be a unique barcode or relatively unique such that the corresponding mutation can be identified based on the barcode. The PAM mutation may be any insertion, deletion or substitution of one or more nucleotides that mutates the sequence of the PAM such that the mutated PAM (PAM mutation) is not recognized by a chosen nucleic acid-guided nuclease system. A cell that comprises such as a PAM mutation may be said to be “immune” to nucleic acid-guided nuclease-mediated killing. The desired mutation relative to the sequence of the target region may be an insertion, deletion, and/or substitution of one or more nucleotides and may be at least one codon of the target region. In some embodiments, the distance between the PAM mutation and the desired mutation is at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 40, 50, 60, 70, 80, 90, or 100 nucleotides on the editing cassette. In some embodiments, the PAM mutation is located at least 9 nucleotides from the end of the editing cassette. In some embodiments, the desired mutation is located at least 9 nucleotides from the end of the editing cassette.
[0185] A desired mutation can be an insertion of a nucleic acid sequence relative to the sequence of the target sequence. The nucleic acid sequence inserted into the target sequence can be of any length. In some embodiments, the nucleic acid sequence inserted is at least 1, 2, 3, 4, 5, 10, 20, 30, 40, 50, 100, 150, 200, 250, 300, 350, 400, 450, 500, 550, 600, 650, 700, 750, 800, 850, 900, 950, 1000, 1100, 1200, 1300, 1400, 1500, 1600, 1700, 1800, 1900, or at least 2000 nucleotides in length. In embodiments in which a nucleic acid sequence is inserted into the target sequence, the editing cassette comprises a region that is at least 10, 15, 20, 25, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50 51, 52, 53, 54, 55, 56, 57, 58, 59, or at least 60 nucleotides in length and homologous to the target sequence. The homology arms or homologous region can be about 10, 20, 30, 40, 50, 60, 70, 80, 90, 100, 110, 120, 130, 140, 150, 160, 170, 180, 190, 200, or more nucleotides in length, including any integer therein. The homology arms or homologous region can be over 200 nucleotides in length.
[0186] A barcode can be a unique barcode or relatively unique such that the corresponding mutation can be identified based on the barcode. In some examples, the barcode is a non-naturally occurring sequence that is not found in nature. In most examples, the combination of the desired mutation and the barcode within the editing cassette is non-naturally occurring and not found in nature. A barcode can be any number of nucleotides in length. A barcode can be 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, or more than 30 nucleotides in length. In some cases, the barcode is more than 30 nucleotides in length.
[0187] An editing cassette or recorder cassette can comprise at least a portion of a gene encoding a guide nucleic acid, and optionally a promoter operable linked to the encoded guide nucleic acid. In some embodiments, the portion of the gene encoding the guide nucleic acid encodes the portion of the guide nucleic acid that is complementary to the target sequence. The portion of the guide nucleic acid that is complementary to the target sequence, or the guide sequence, can be at least 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, or at least 30 nucleotides in length. In some embodiments, the guide sequence is 24 nucleotides in length. In some embodiments, the guide sequence is 18 nucleotides in length.
[0188] In some embodiments, the editing cassette or recorder cassette further comprises at least two priming sites. The priming sites may be used to amplify the cassette, for example by PCR. In some embodiments, the portion of the guide sequence is used as a priming site.
[0189] Editing cassettes or recorder cassettes for use in the described methods can be obtained or derived from many sources. For example, the cassettes can be synthesized, for example by array-based synthesis, multiplex synthesis, multi-parallel synthesis, PCR assembly, in vitro assembly, Gibson assembly, or any other synthesis method known in the art. In some embodiments, the editing cassette or recorder cassette is synthesized, for example by array-based synthesis, multiplex synthesis, multi-parallel synthesis, PCR assembly, in vitro assembly, Gibson assembly, or any other synthesis method known in the art. The length of the editing cassette or recorder cassette may be dependent on the method used in obtaining said cassette.
[0190] An editing cassette can be approximately 50-300 nucleotides, 75-200 nucleotides, or between 80-120 nucleotides in length. In some embodiments, the editing cassette can be any discrete length between 50 nucleotide and 1 Mb.
[0191] A recorder cassette can be approximately 50-300 nucleotides, 75-200 nucleotides, or between 80-120 nucleotides in length. In some embodiments, the recorder cassette can be any discrete length between 50 nucleotide and 1 Mb.
[0192] Methods disclosed herein can also involve obtaining editing cassettes and recorder cassettes and constructing a trackable plasmid or vector. Methods of constructing a vector will be known to one ordinary skill in the art and may involve ligating the cassettes into a vector backbone. In some examples, plasmid construction occurs by in vitro DNA assembly methods, oligonucleotide assembly, PCR-based assembly, SLIC, CPEC, or other assembly methods well known in the art. In some embodiments, the cassettes or a subset (pool) of the cassettes can be amplified prior to construction of the vector, for example by PCR.
[0193] The cell or population of cells comprising a polynucleotide encoding a nucleic acid-guided nuclease can be maintained or cultured under conditions in which the nuclease is expressed. Nucleic acid-guided nuclease expression can be controlled or can be constitutively on. The methods described herein can involve maintaining cells under conditions in which nuclease expression is activated, resulting in production of the nuclease, for example, Cas9, Cpf1, MAD2, or MAD7. Specific conditions under which the nucleic acid-guided nuclease is expressed can depend on factors, such as the nature of the promoter used to regulate expression of the nuclease. Nucleic acid-guided nuclease expression can be induced in the presence of an inducer molecule, such as arabinose. When the cell or population of cells comprising nucleic acid-guided nuclease encoding DNA are in the presence of the inducer molecule, expression of the nuclease can occur. CRISPR-nuclease expression can be repressed in the presence of a repressor molecule. When the cell or population of cells comprising nucleic acid-guided nuclease encoding DNA are in the absence of a molecule that represses expression of the nuclease, expression of the nuclease can occur.
[0194] Cells or the population of cells that remain viable can be obtained or separated from the cells that undergo unedited cell death as a result of nucleic acid-guided nuclease-mediated killing; this can be done, for example, by spreading the population of cells on culture surface, allowing growth of the viable cells, which are then available for assessment.
[0195] Disclosed herein are methods for the identification of the mutation without the need to sequence the genome or large portions of the genome of the cell. The methods can involve sequencing of the editing cassette, recorder cassette, or barcode to identify the mutation of one of more codon. Sequencing of the editing cassette can be performed as a component of the vector or after its separation from the vector and, optionally, amplification. Sequencing can be performed using any sequencing method known in the art, such as by Sanger sequencing or next-generation sequencing methods.
[0196] Some methods described herein can be carried out in any type of cell in which a targetable nuclease system can function, or target and cleave DNA, including prokaryotic and eukaryotic cells. In some embodiments, the cell is a bacterial cell, such as Escherichia spp., e.g., E. coli. In other embodiments, the cell is a fungal cell, such as a yeast cell, e.g., Saccharomyces spp. In other embodiments, the cell is an algal cell, a plant cell, an insect cell, or a mammalian cell, including a human cell.
[0197] A “vector” is any of a variety of nucleic acids that comprise a desired sequence or sequences to be delivered to or expressed in a cell. A desired sequence can be included in a vector, such as by restriction and ligation or by recombination or assembly methods know in the art. Vectors are typically composed of DNA, although RNA vectors are also available. Vectors include, but are not limited to plasmids, fosmids, phagemids, virus genomes, artificial chromosomes, and synthetic nucleic acid molecules.
[0198] Vectors useful in the methods disclosed herein can comprise at least one editing cassette as described herein, at least one gene encoding a gRNA, and optionally a promoter and/or a barcode. More than one editing cassette can be included on the vector, for example 2, 3, 4, 5, 6, 7, 8, 9, 10 or more editing cassettes. The more than one editing cassettes can be designed to target different target regions, for example, there could be different editing cassettes, each of which contains at least one region homologous with a different target region. In other examples, each editing cassette target the same target region while each editing cassette comprises a different desired mutation relative to the target region. In other examples, the plurality of editing cassettes can comprise a combination of editing cassettes targeting the same target region and editing cassettes targeting different target regions. Each editing cassette can comprise an identifying barcode. Alternatively or additionally, the vector can include one or more genes encoding more than one gRNA, e.g., 2, 3, 4, 5, 6, 7, 8, 9, 10 or more gRNAs. The more than one gRNAs can contain regions that are complementary to a portion of different target regions, for example, if there are different gRNAs, each of which can be complementary to a portion of a different target region. In other examples, the more than one gRNA can each target the same target region. In other examples, the more than one gRNA can be a combination of gRNAs targeting the same and different target regions.
[0199] A cassette comprising a gene encoding a portion of a guide nucleic acid, can be ligated or assembled into a vector that encodes another portion of a guide nucleic acid. Upon ligation or assembly, the portion of the guide nucleic acid from the cassette and the other portion of the guide nucleic acid can form a functional guide nucleic acid. A promoter and a gene encoding a guide nucleic acid can be operably linked.
[0200] In some embodiments, the methods involve introduction of a second vector encoding a nucleic acid-guided nuclease, such as Cas9, Cpf1, MAD2, or MAD7. The vector may further comprise one or more promoters operably linked to a gene encoding the nucleic acid-guided nuclease.
[0201] As used herein, “operably” linked can mean the promoter affects or regulates transcription of the DNA encoding a gene, such as the gene encoding the gRNA or the gene encoding a CRISPR nuclease.
[0202] A promoter can be a native promoter such as a promoter present in the cell into which the vector is introduced. A promoter can be an inducible or repressible promoter, for example, the promoter can be regulated allowing for inducible or repressible transcription of a gene, such as the gene encoding the guide nucleic acid or the gene encoding a nucleic acid-guided nuclease. Such promoters that are regulated by the presence or absence of a molecule can be referred to as an inducer or a repressor, respectively. The nature of the promoter needed for expression of the guide nucleic acid or nucleic acid-guided nuclease can vary based on the species or cell type and can be recognized by one of ordinary skill in the art.
[0203] A separate vector encoding a nucleic acid-guided nuclease can be introduced into a cell or population of cells before or at the same time as introduction of a trackable plasmid as disclosed herein. The gene encoding a nucleic acid-guided nuclease can be integrated into the genome of the cell or population of cells, or the gene can be maintained episomally. The nucleic acid-guided nuclease-encoding DNA can be integrated into the cellular genome before introduction of the trackable plasmid, or after introduction of the trackable plasmid. In some examples, a nucleic acid molecule, such as DNA-encoding a nucleic acid-guided nuclease, can be expressed from DNA integrated into the genome. In some embodiments, a gene encoding Cas9, Cpf1, MAD2, or MAD7 is integrated into the genome of the cell.
[0204] Vectors or cassettes useful in the methods described herein can further comprise two or more priming sites. In some embodiments, the presence of flanking priming sites allows amplification of the vector or cassette.
[0205] In some embodiments, a cassette or vector encodes a nucleic acid-guided nuclease comprising one or more nuclear localization sequences (NLSs), such as about or more than about 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, or more NLSs. In some embodiments, the engineered nuclease comprises about or more than about 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, or more NLSs at or near the amino-terminus, about or more than about 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, or more NLSs at or near the carboxy-terminus, or a combination of these (e.g. one or more NLS at the amino-terminus and one or more NLS at the carboxy terminus). When more than one NLS is present, each may be selected independently of the others, such that a single NLS may be present in more than one copy and/or in combination with one or more other NLSs present in one or more copies. In a preferred embodiment of the invention, the engineered nuclease comprises at most 6 NLSs. In some embodiments, an NLS is considered near the N- or C-terminus when the nearest amino acid of the NLS is within about 1, 2, 3, 4, 5, 10, 15, 20, 25, 30, 40, 50, or more amino acids along the polypeptide chain from the N- or C-terminus. Non-limiting examples of NLSs include an NLS sequence derived from: the NLS of the SV40 virus large T-antigen, having the amino acid sequence PKKKRKV (SEQ ID NO: 111); the NLS from nucleoplasmin (e.g. the nucleoplasmin bipartite NLS with the sequence KRPAATKKAGQAKKKK (SEQ ID NO:112)); the c-myc NLS having the amino acid sequence PAAKRVKLD (SEQ ID NO:113) or RQRRNELKRSP (SEQ ID NO:114); the hRNPA1 M9 NLS having the sequence NQSSNFGPMKGGNFGGRSSGPYGGGGQYFAKPRNQGGY (SEQ ID NO: 115); the sequence RMRIZFKNKGKDTAELRRRRVEVSVELRKAKKDEQILKRRNV (SEQ ID NO:1 116) of the IBB domain from importin-alpha; the sequences VSRKRPRP (SEQ ID NO:117) and PPKKARED (SEQ ID NO:115) of the myoma T protein; the sequence PQPKKKPL (SEQ ID NO: 119) of human p53; the sequence SALIKKKKKMAP (SEQ ID NO: 120) of mouse c-abl IV; the sequences DRLRR (SEQ ID NO:121) and PKQKKRK (SEQ ID NO:122) of the influenza virus NS1; the sequence RKLKKKIKKL (SEQ ID NO:123) of the Hepatitis virus delta antigen; the sequence REKKKFLKRR (SEQ ID NO: 124) of the mouse Mx1 protein; the sequence KRKGDEVDGVDEVAKKKSKK (SEQ ID NO: 125) of the human poly(ADP-ribose) polymerase; and the sequence RKCLQAGMNLEARKTKK (SEQ ID NO: 126) of the steroid hormone receptors (human) glucocorticoid.
[0206] In general, the one or more NLSs are of sufficient strength to drive accumulation of the nucleic acid-guided nuclease in a detectable amount in the nucleus of a eukaryotic cell. In general, strength of nuclear localization activity may derive from the number of NLSs, the particular NLS(s) used, or a combination of these factors. Detection of accumulation in the nucleus may be performed by any suitable technique. For example, a detectable marker may be fused to the nucleic acid-guided nuclease, such that location within a cell may be visualized, such as in combination with a means for detecting the location of the nucleus (e.g. a stain specific for the nucleus such as DAPI). Cell nuclei may also be isolated from cells, the contents of which may then be analyzed by any suitable process for detecting protein, such as immunohistochemistry, Western blot, or enzyme activity assay. Accumulation in the nucleus may also be determined indirectly, such as by an assay for the effect of the nucleic acid-guided nuclease complex formation (e.g. assay for DNA cleavage or mutation at the target sequence, or assay for altered gene expression activity affected by targetable nuclease complex formation and/or nucleic acid-guided nuclease activity), as compared to a control not exposed to the nucleic acid-guided nuclease or targetable nuclease complex, or exposed to a nucleic acid-guided nuclease lacking the one or more NLSs.
ProSAR
[0207] Methods disclosed herein are capable of engineering a few to hundreds of genetic sequence or proteins simultaneously. These methods can permit one to map in a single experiment many or all possible residue changes over a collection of desired proteins onto a trait of interest, as part of individual proteins of interest or as part of a pathway. This approach can be used at least for the following by mapping i) any number of residue changes for any number of proteins of interest in a specific biochemical pathway or that catalyze similar reactions or ii) any number of residues in the regulatory sites of any number of proteins or interest with a specific regulon or iii) any number of residues of a biological agent used to treat a health condition.
[0208] In some embodiments, methods described herein include identifying genetic variations of one or more target genes that affect any number or residues, such as one or more, or all residues of one or more target proteins. In accordance with these embodiments, compositions and methods disclosed herein permit parallel analysis of two or more target proteins or proteins that contribute to a trait. Parallel analysis of multiple proteins by a single experiment described can facilitate identification, modification and design of superior systems for example for producing a eukaryotic or prokaryotic byproduct, producing a eukaryotic byproduct, for example, a biological agent such as a growth factor or antibody, in a prokaryotic organism and the like. Relevant biologics used in analysis and treatment of disease can be produced in these genetically engineered environments that could reduce production time, increase quality all while reducing costs to the manufacturers and the consumers.
[0209] Some embodiments disclosed herein comprise constructs of use for studying genetic variations of a gene or gene segment wherein the gene or gene segment is capable of generating a protein. A construct can be generated for any number of residues, such as one, two, more than two, or all residue modifications of a target protein that is linked to a trackable agent such as a barcode. A barcode indicative of a genetic variation of a gene of a target protein can be located outside of the open reading frame of the gene. In some embodiments such a barcode can be located many hundreds or thousands of bases away from the gene. It is contemplated herein that these methods can be performed in vivo. In some examples, such a construct comprises a trackable polynucleic acid or plasmid as disclosed herein.
[0210] Constructs described herein can be used to compile a comprehensive library of genetic variations encompassing all residue changes of one target protein, more than one target protein or target proteins that contribute to a trait. In certain embodiments, libraries disclosed herein can be used to select proteins with improved qualities to create an improved single or multiple protein system for example for producing a byproduct, such as a chemical, biofuels, biological agent, pharmaceutical agent, or for biomass, or biologic compared to a non-selective system.
Protein Sequence-Activity Relationship (ProSAR) Mapping
[0211] Understanding the relationship between a protein's amino acid structure and its overall function continues to be of great practical, clinical, and scientific significance for biologists and engineers. Directed evolution can be a powerful engineering and discovery tool, but the random and often combinatorial nature of mutations makes their individual impacts difficult to quantify and thus challenges further engineering. More systematic analysis of contributions of individual residues or saturation mutagenesis remains labor- and time-intensive for entire proteins and simply is not possible on reasonable timescales for multiple proteins in parallel, such as metabolic pathways or multi-protein complexes, using standard methods.
[0212] Provided herein are methods which can be used to rapidly and efficiently examine the roles of some or all genes in a viral, microbial, or eukaryotic genome using mixtures of barcoded oligonucleotides. In some embodiments, these compositions and methods can be used to develop a powerful new technology for comprehensively mapping protein structure-activity relationships (ProSAR).
[0213] Using methods and compositions disclosed herein, multiplex cassette synthesis can be combined with recombineering, to create mutant libraries of specifically designed and barcoded mutations along one or more genes of interest in parallel. Screens and/or selections followed by high-throughput sequencing and/or barcode microarray methods can allow for rapid mapping of protein sequence-activity relationships (ProSAR). In some embodiments, systematic ProSAR mapping can elucidate individual amino acid mutations for improved function and/or activity and/or stability etc.
[0214] Methods can be iterated to combinatorially improve the function, activity, or stability. Cassettes can be generated by oligonucleotide synthesis. Given that existing capabilities of multiplex oligonucleotide synthesis can reach over 120,000 oligonucleotides per array, combined with recombineering, the methods disclosed herein can be scaled to construct mutant libraries for dozens to hundreds of proteins in a single experiment. In some examples, at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 15, 25, 50, 75, 100, 150, 200, 250, 300, 400, 500, 600, 700, 800, 900, 1000, or more proteins can be partially or completely covered by mutant libraries generated by the methods disclosed herein.
[0215] Disclosed herein are strategies to construct barcoded substitution libraries for several different proteins at the same time. Using existing multiplex DNA synthesis technology, as disclosed, a partial or complete substitution library for one or more protein constructs can be barcoded, or non-barcoded if desired, for one or for several hundred proteins at the same time. In some examples, such libraries comprise trackable plasmids as disclosed herein.
[0216] Some embodiments herein apply to analysis and structure/function/stability library construction of any protein with a corresponding screen or selection for activity. Cassette library size can depend on the number (N) of amino acids in a protein of interest, with a full saturation library, including all 20 amino acids at each position and optionally non-naturally occurring amino acids, scaling as 19 (or more)×N and an alanine-mapping library scaling as 1×N. Thus, in some examples, screening of even very large proteins of more than 1,000 amino acids can be tractable given current multiplex oligo synthesis capabilities of at least 120,000 oligos per array.
[0217] In addition or as an alternative to activity screens, more general properties with developed high-throughput screens and selections can be efficiently tested using methods and cassettes disclosed herein. For example, universal protein folding and solubility reporters can be engineered for expression in the cytoplasm, periplasm, and the inner membrane. In some examples, a protein library can be screened under different conditions such as different temperatures, different substrates or co-factors, in order to identify residue changes required for expression of various traits. In other embodiments, because residues can be analyzed one at a time, mutations at residues important for a particular trait, such as thermostability, resistance to environmental pressures, or increases or decreases in functionality or production, can be combined via multiplex recombineering with mutations important for various other traits, such as catalytic activity, to create combinatorial libraries for multi-trait optimization.
[0218] Methods disclosed herein can provide for creating and/or evaluating comprehensive, in vivo, mutational libraries of one or more target protein(s). These approaches can be extended via a recorder cassettes or barcoding technology to generate trackable mutational libraries for any number of residues or every residue in a protein. This approach can be based on protein sequence-activity relationship mapping method extended to work in vivo, capable of working on one or a few to hundreds of proteins simultaneously depending on the technology selected. For example, these methods permit one to map in a single experiment any number of, the majority of, or all possible residue changes over a collection of desired proteins onto a trait of interest, as part of individual proteins of interest or as part of a pathway.
[0219] In some examples, these approaches can be used at least for the following by mapping i) any number of or all residue changes for any number of or all proteins in a specific biochemical pathway, such as lycopene production, or that catalyze similar reactions, such as dehydrogenases or other enzymes of a pathway of use to produce a desired effect or produce a product, or ii) any number of or all residues in the regulatory sites of any number of or all proteins with a specific regulatory mechanism, such as heat shock response, or iii) any number of or all residues of a biological agent used to treat a health condition, such as insulin, a growth factor (HCG), an anti-cancer biologic, or a replacement protein for a deficient population.
[0220] Scores related to various input parameters can be assigned in order to generate one or more composite score(s) for designing genomically-engineered organisms or systems. These scores can reflect quality of genetic variations in genes or genetic loci as they relate to selection of an organism or design of an organism for a predetermined production, trait or traits. Certain organisms or systems can be designed based on a need for improved organisms for biorefining, biomass, such as crops, trees, grasses, crop residues, or forest residues, biofuel production, and using biological conversion, fermentation, chemical conversion and catalysis to generate and use compounds, biopharmaceutical production and biologic production. In certain embodiments, this can be accomplished by modulating growth or production of microorganism through genetic manipulation methods disclosed herein.
[0221] Genetic manipulation by methods disclosed herein of genes encoding a protein can be used to make desired genetic changes that can result in desired phenotypes and can be accomplished through numerous techniques including but not limited to, i) introduction of new genetic material, ii) genetic insertion, disruption or removal of existing genetic material, as well as, iii) mutation of genetic material, such as a point mutation, or any combinations of i, ii, and iii, that results in desired genetic changes with desired phenotypic changes. Mutations can be directed or random, in addition to those including, but not limited to, error prone or directed mutagenesis through PCR, mutator strains, and random mutagenesis. Mutations can be incorporated using trackable plasmids and methods as disclosed herein.
[0222] Disclosed methods can be used for inserting and accumulating higher order modifications into a microorganism's genome or a target protein; for example, multiple different site-specified mutations in the same genome, at high efficiency to generate libraries of genomes with over 1, 5, 10, 15, 20, 25, 30, 40, 50, 60, 70, 80, 90, 100, 150, 200, 250, 300, or more targeted modifications are described. In some examples, these mutations are within regulatory modules, regulatory elements, protein-coding regions, or non-coding regions. Protein coding modifications can include, but are not limited to, amino acid changes, codon optimization, and translation tuning.
[0223] In some instances, methods are provided for the co-delivery of reagents to a single biological cell. The methods generally involve the attachment or linkage of two or more cassettes, followed by delivery of the linked cassettes to a single cell. Generally, the methods provided herein involve the delivery of two or more cassettes to a single cell. In many cases, it is desirable that each individual cell receives the two or more cassettes. Traditional methods of reagent delivery may often be inefficient and/or inconsistent, leading to situations in which some cells receive only one of the cassettes. The methods provided herein may improve the efficiency and/or consistency of reagent delivery, such that a majority of cells in a cell population each receive the two or more cassettes. For example, more than 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99% of the cells in a cell population may receive the two or more cassettes.
[0224] The two or more cassettes may be linked by any known method in the art and generally the method chosen will be commensurate with the chemistry of the cassettes. Generally, the two or more cassettes are linked by a covalent bond (i.e., covalently-linked), however, other types of non-covalent chemical bonds are envisioned, such as hydrogen bonds, ionic bonds, and metallic bonds. In this way, the editing cassette and the recorder cassette may be linked and delivered into a single cell. A known edit is then associated with a known recorder or barcode sequence for that cell.
[0225] In one example, the two or more cassettes are nucleic acids, such as two or more nucleic acids. The nucleic acids may be RNA, DNA, or a combination of both, and may contain any number of chemically-modified nucleotides or nucleotide analogues. In some cases, two or more RNA cassettes are linked for delivery to a single cell. In other cases, two or more DNA cassettes are linked for delivery to a single cell. In yet other cases, a DNA cassettes and an RNA cassettes are linked for delivery to a single cell. The nucleic acids may be derived from genomic RNA, complementary DNA (cDNA), or chemically or enzymatically synthesized DNA.
[0226] A cassettes may be of 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, about 110, about 120, about 130, about 140, about 150, about 160, about 170, about 180, about 190, about 200, about 210, about 220, about 230, about 240, about 250, about 275, about 300, about 325, about 350, about 375, about 400, about 425, about 450, about 475, about 500, about 525, about 550, about 575, about 600, about 625, about 650, about 675, about 700, about 725, about 750, about 775, about 800, about 825, about 850, about 875, about 900, about 925, about 950, about 975, about 1000, about 1100, about 1200, about 1300, about 1400, about 1500, about 1750, about 2000, about 2500, about 3000, about 4000, about 5000, about 6000, about 7000, about 8000, about 9000, about 10,000 or greater nucleotide residues in length, up to a full length protein encoding or regulatory genetic element.
[0227] Two or more cassettes may be linked on a linear nucleic acid molecule or may be linked on a plasmid or circular nucleic acid molecule. The two or more cassettes may be linked directly to one another or may be separated by one or more nucleotide spacers or linkers.
[0228] Two or more cassettes may be covalently linked on a linear cassettes or may be covalently linked on a plasmid or circular nucleic acid molecule. The two or more cassettes may be covalently linked directly to one another or may be separated by one or more nucleotide spacers or linkers.
[0229] Any number and variety of cassettes may be linked for co-delivery. For example, the two or more cassettes may include nucleic acids, lipids, proteins, peptides, small molecules, or any combination thereof. The two or more cassettes may be essentially any cassettes that are amenable to linkage.
[0230] In preferred examples, the two or more cassettes are covalently linked (e.g., by a chemical bond). Covalent linkage may help to ensure that the two or more cassettes are co-delivered to a single cell. Generally, the two or more cassettes are covalently linked prior to delivery to a cell. Any method of covalently linking two or more molecules may be utilized, and it should be understood that the methods used will be at least partly determined by the types of cassettes to be linked.
[0231] In some instances, methods are provided for the co-delivery of reagents to a single biological cell. The methods generally involve the covalent attachment or linkage of two or more cassettes, followed by delivery of the covalently-linked cassettes into a single cell. The methods provided may help to ensure that an individual cell receives the two or more cassettes. Any known method of reagent delivery may be utilized to deliver the linked cassettes to a cell and will at least partly depend on the chemistry of the cassettes to be delivered. Non-limiting examples of reagent delivery methods may include: transformation, lipofection, electroporation, transfection, nanoparticles, and the like.
[0232] In various embodiments, cassettes, or isolated, donor, or editing nucleic acids may be introduced to a cell or microorganism to alter or modulate an aspect of the cell or microorganism, for example survival or growth of the microorganism as disclosed herein. The isolated nucleic acid may be derived from genomic RNA, complementary DNA (cDNA), chemically or enzymatically synthesized DNA. Additionally or alternatively, isolated nucleic acids may be of use for capture probes, primers, labeled detection oligonucleotides, or fragments for DNA assembly.
[0233] A “nucleic acid” can include single-stranded and/or double-stranded molecules, as well as DNA, RNA, chemically modified nucleic acids and nucleic acid analogs. It is contemplated that a nucleic acid may be of 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, about 110, about 120, about 130, about 140, about 150, about 160, about 170, about 180, about 190, about 200, about 210, about 220, about 230, about 240, about 250, about 275, about 300, about 325, about 350, about 375, about 400, about 425, about 450, about 475, about 500, about 525, about 550, about 575, about 600, about 625, about 650, about 675, about 700, about 725, about 750, about 775, about 800, about 825, about 850, about 875, about 900, about 925, about 950, about 975, about 1000, about 1100, about 1200, about 1300, about 1400, about 1500, about 1750, about 2000, about 2500, about 3000, about 4000, about 5000, about 6000, about 7000, about 8000, about 9000, about 10,000 or greater nucleotide residues in length, up to a full length protein encoding or regulatory genetic element.
[0234] Isolated nucleic acids may be made by any method known in the art, for example using standard recombinant methods, assembly methods, synthetic techniques, or combinations thereof. In some embodiments, the nucleic acids may be cloned, amplified, assembled, or otherwise constructed.
[0235] The nucleic acids may conveniently comprise sequences in addition to a portion of a lysine riboswitch. For example, a multi-cloning site comprising one or more endonuclease restriction sites may be added. A nucleic acid may be attached to a vector, adapter, or linker for cloning of a nucleic acid. Additional sequences may be added to such cloning and sequences to optimize their function, to aid in isolation of the nucleic acid, or to improve the introduction of the nucleic acid into a cell. Use of cloning vectors, expression vectors, adapters, and linkers is well known in the art.
[0236] Isolated nucleic acids may be obtained from cellular, bacterial, or other sources using any number of cloning methodologies known in the art. In some embodiments, oligonucleotide probes which selectively hybridize, under stringent conditions, to other oligonucleotides or to the nucleic acids of an organism or cell. Methods for construction of nucleic acid libraries are known and any such known methods may be used.
[0237] Cellular genomic DNA, RNA, or cDNA may be screened for the presence of an identified genetic element of interest using a probe based upon one or more sequences. Various degrees of stringency of hybridization may be employed in the assay.
[0238] High stringency conditions for nucleic acid hybridization are well known in the art. For example, conditions may comprise low salt and/or high temperature conditions, such as provided by about 0.02 M to about 0.15 M NaCl at temperatures of about 50° C. to about 70° C. It is understood that the temperature and ionic strength of a desired stringency are determined in part by the length of the particular nucleic acid(s), the length and nucleotide content of the target sequence(s), the charge composition of the nucleic acid(s), and by the presence or concentration of formamide, tetramethylammonium chloride or other solvent(s) in a hybridization mixture. Nucleic acids may be completely complementary to a target sequence or may exhibit one or more mismatches.
[0239] Nucleic acids of interest may also be amplified using a variety of known amplification techniques. For instance, polymerase chain reaction (PCR) technology may be used to amplify target sequences directly from DNA, RNA, or cDNA. PCR and other in vitro amplification methods may also be useful, for example, to clone nucleic acid sequences, to make nucleic acids to use as probes for detecting the presence of a target nucleic acid in samples, for nucleic acid sequencing, or for other purposes.
[0240] Isolated nucleic acids may be prepared by direct chemical synthesis by methods such as the phosphotriester method, or using an automated synthesizer. Chemical synthesis generally produces a single stranded oligonucleotide. This may be converted into double stranded DNA by hybridization with a complementary sequence or by polymerization with a DNA polymerase using the single strand as a template.
[0241] Any method known in the art for identifying, isolating, purifying, using and assaying activities of target proteins contemplated herein are contemplated. Target proteins contemplated herein include protein agents used to treat a human condition or to regulate processes (e.g. part of a pathway such as an enzyme) involved in disease of a human or non-human mammal. Any method known for selection and production of antibodies or antibody fragments is also contemplated. Additionally or alternatively, target proteins can be proteins or enzymes involved in a pathway or process in a virus, cell, or organism.
Targetable Nucleic Acid Cleavage Systems
[0242] Some methods disclosed herein comprise targeting cleavage of specific nucleic acid sequences using a site-specific, targetable, and/or engineered nuclease or nuclease system. Such nucleases can create double-stranded break (DSBs) at desired locations in a genome or nucleic acid molecule. In other examples, a nuclease can create a single strand break. In some cases, two nucleases are used, each of which generates a single strand break.
[0243] The one or more double or single strand break can be repaired by natural processes of homologous recombination (HR) and non-homologous end-joining (NHEJ) using the cell's endogenous machinery. Additionally or alternatively, endogenous or heterologous recombination machinery can be used to repair the induced break or breaks.
[0244] Engineered nucleases such as zinc finger nucleases (ZFNs), Transcription Activator-Like Effector Nucleases (TALENs), engineered homing endonucleases, and RNA or DNA guided endonucleases, such as CRISPR/Cas such as Cas9 or CPF 1, and/or Argonaute systems, are particularly appropriate to carry out some of the methods of the present invention. Additionally or alternatively, RNA targeting systems can use used, such as CRISPR/Cas systems including c2c2 nucleases.
[0245] Methods disclosed herein can comprise cleaving a target nucleic acid using a CRISPR systems, such as a Type I, Type II, Type III, Type IV, Type V, or Type VI CRISPR system. CRISPR/Cas systems can be multi-protein systems or single effector protein systems. Multi-protein, or Class 1, CRISPR systems include Type I, Type III, and Type IV systems. Alternatively, Class 2 systems include a single effector molecule and include Type II, Type VI, and Type VI.
[0246] CRISPR systems used in methods disclosed herein can comprise a single or multiple effector proteins. An effector protein can comprise one or multiple nuclease domains. An effector protein can target DNA or RNA, and the DNA or RNA may be single stranded or double stranded. Effector proteins can generate double strand or single strand breaks. Effector proteins can comprise mutations in a nuclease domain thereby generating a nickase protein. Effector proteins can comprise mutations in one or more nuclease domains, thereby generating a catalytically dead nuclease that is able to bind but not cleave a target sequence. CRISPR systems can comprise a single or multiple guiding RNAs. The gRNA can comprise a crRNA. The gRNA can comprise a chimeric RNA with crRNA and tracrRNA sequences. The gRNA can comprise a separate crRNA and tracrRNA. Target nucleic acid sequences can comprise a protospacer adjacent motif (PAM) or a protospacer flanking site (PFS). The PAM or PFS may be 3′ or 5′ of the target or protospacer site. Cleavage of a target sequence may generate blunt ends, 3′ overhangs, or 5′ overhangs.
[0247] A gRNA can comprise a spacer sequence. Spacer sequences can be complementary to target sequences or protospacer sequences. Spacer sequences can be 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, or 36 nucleotides in length. In some examples, the spacer sequence can be less than 10 or more than 36 nucleotides in length.
[0248] A gRNA can comprise a repeat sequence. In some cases, the repeat sequence is part of a double stranded portion of the gRNA. A repeat sequence can be 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, or 50 nucleotides in length. In some examples, the spacer sequence can be less than 10 or more than 50 nucleotides in length.
[0249] A gRNA can comprise one or more synthetic nucleotides, non-naturally occurring nucleotides, nucleotides with a modification, deoxyribonucleotide, or any combination thereof. Additionally or alternatively, a gRNA may comprise a hairpin, linker region, single stranded region, double stranded region, or any combination thereof. Additionally or alternatively, a gRNA may comprise a signaling or reporter molecule.
[0250] A CRISPR nuclease can be endogenously or recombinantly expressed within a cell. A CRISPR nuclease can be encoded on a chromosome, extrachromosomally, or on a plasmid, synthetic chromosome, or artificial chromosome. A CRISPR nuclease can be provided or delivered to the cell as a polypeptide or mRNA encoding the polypeptide. In such examples, polypeptide or mRNA can be delivered through standard mechanisms known in the art, such as through the use of cell permeable peptides, nanoparticles, or viral particles.
[0251] gRNAs can be encoded by genetic or episomal DNA within a cell. In some examples, gRNAs can be provided or delivered to a cell expressing a CRISPR nuclease. gRNAs can be provided or delivered concomitantly with a CRISPR nuclease or sequentially. Guide RNAs can be chemically synthesized, in vitro transcribed, or otherwise generated using standard RNA generation techniques known in the art.
[0252] A CRISPR system can be a Type II CRISPR system, for example a Cas9 system. The Type II nuclease can comprise a single effector protein, which, in some cases, comprises a RuvC and HNH nuclease domains. In some cases a functional Type II nuclease can comprise two or more polypeptides, each of which comprises a nuclease domain or fragment thereof. The target nucleic acid sequences can comprise a 3′ protospacer adjacent motif (PAM). In some examples, the PAM may be 5′ of the target nucleic acid. Guide RNAs (gRNA) can comprise a single chimeric gRNA, which contains both crRNA and tracrRNA sequences. Alternatively, the gRNA can comprise a set of two RNAs, for example a crRNA and a tracrRNA. The Type II nuclease can generate a double strand break, which is some cases creates two blunt ends. In some cases, the Type II CRISPR nuclease is engineered to be a nickase such that the nuclease only generates a single strand break. In such cases, two distinct nucleic acid sequences can be targeted by gRNAs such that two single strand breaks are generated by the nickase. In some examples, the two single strand breaks effectively create a double strand break. In some cases where a Type II nickase is used to generate two single strand breaks, the resulting nucleic acid free ends can either be blunt, have a 3′ overhang, or a 5′ overhang. In some examples, a Type II nuclease may be catalytically dead such that it binds to a target sequence, but does not cleave. For example, a Type II nuclease could have mutations in both the RuvC and HNH domains, thereby rendering the both nuclease domains non-functional. A Type II CRISPR system can be one of three sub-types, namely Type II-A, Type II-B, or Type II-C.
[0253] A CRISPR system can be a Type V CRISPR system, for example a Cpf1, C2c1, or C2c3 system. The Type V nuclease can comprise a single effector protein, which in some cases comprises a single RuvC nuclease domain. In other cases, a function Type V nuclease comprises a RuvC domain split between two or more polypeptides. In such cases, the target nucleic acid sequences can comprise a 5′ PAM or 3′ PAM. Guide RNAs (gRNA) can comprise a single gRNA or single crRNA, such as can be the case with Cpf1. In some cases, a tracrRNA is not needed. In other examples, such as when C2c1 is used, a gRNA can comprise a single chimeric gRNA, which contains both crRNA and tracrRNA sequences or the gRNA can comprise a set of two RNAs, for example a crRNA and a tracrRNA. The Type V CRISPR nuclease can generate a double strand break, which in some cases generates a 5′ overhang. In some cases, the Type V CRISPR nuclease is engineered to be a nickase such that the nuclease only generates a single strand break. In such cases, two distinct nucleic acid sequences can be targeted by gRNAs such that two single strand breaks are generated by the nickase. In some examples, the two single strand breaks effectively create a double strand break. In some cases where a Type V nickase is used to generate two single strand breaks, the resulting nucleic acid free ends can either be blunt, have a 3′ overhang, or a 5′ overhang. In some examples, a Type V nuclease may be catalytically dead such that it binds to a target sequence, but does not cleave. For example, a Type V nuclease could have mutations a RuvC domain, thereby rendering the nuclease domain non-functional.
[0254] A CRISPR system can be a Type VI CRISPR system, for example a C2c2 system. A Type VI nuclease can comprise a HEPN domain. In some examples, the Type VI nuclease comprises two or more polypeptides, each of which comprises a HEPN nuclease domain or fragment thereof. In such cases, the target nucleic acid sequences can by RNA, such as single stranded RNA. When using Type VI CRISPR system, a target nucleic acid can comprise a protospacer flanking site (PFS). The PFS may be 3′ or 5′ or the target or protospacer sequence. Guide RNAs (gRNA) can comprise a single gRNA or single crRNA. In some cases, a tracrRNA is not needed. In other examples, a gRNA can comprise a single chimeric gRNA, which contains both crRNA and tracrRNA sequences or the gRNA can comprise a set of two RNAs, for example a crRNA and a tracrRNA. In some examples, a Type VI nuclease may be catalytically dead such that it binds to a target sequence, but does not cleave. For example, a Type VI nuclease could have mutations in a HEPN domain, thereby rendering the nuclease domains non-functional.
[0255] Non-limiting examples of suitable nucleases, including nucleic acid-guided nucleases, for use in the present disclosure include C2c1, C2c2, C2c3, Cas1, Cas1B, Cas2, Cas3, Cas4, Cas5, Cas6, Cas7, Cas8, Cas9 (also known as Csn1 and Csx12), Cas10, Cpf1, Csy1, Csy2, Csy3, Cse1, Cse2, Csc1, Csc2, Csa5, Csn2, Csm2, Csm3, Csm4, Csm5, Csm6, Cmrl, Cmr3, Cmr4, Cmr5, Cmr6, Csb1, Csb2, Csb3, Csx17, Csx14, Csx100, Csx16, CsaX, Csx3, Csx1, Csx15, Csf1, Csf2, Csf3, Csf4, homologues thereof, orthologues thereof, or modified versions thereof. Suitable nucleic acid-guided nucleases can be from an organism from a genus which includes but is not limited to Thiomicrospira, Succinivibrio, Candidatus, Porphyromonas, Acidomonococcus, Prevotella, Smithella, Moraxella, Synergistes, Francisella, Leptospira, Catenibacterium, Kandleria, Clostridium, Dorea, Coprococcus, Enterococcus, Fructobacillus, Weissella, Pediococcus, Corynebacter, Sutterella, Legionella, Treponema, Roseburia, Filifactor, Eubacterium, Streptococcus, Lactobacillus, Mycoplasma, Bacteroides, Flaviivola, Flavobacterium, Sphaerochaeta, Azospirillum, Gluconacetobacter, Neisseria, Roseburia, Parvibaculum, Staphylococcus, Nitratifractor, Mycoplasma, Alicyclobacillus, Brevibacilus, Bacillus, Bacteroidetes, Brevibacilus, Carnobacterium, Clostridiaridium, Clostridium, Desulfonatronum, Desulfovibrio, Helcococcus, Leptotrichia, Listeria, Methanomethyophilus, Methylobacterium, Opitutaceae, Paludibacter, Rhodobacter, Sphaerochaeta, Tuberibacillus, and Campylobacter. Species of organism of such a genus can be as otherwise herein discussed. Suitable nucleic acid-guided nucleases can be from an organism from a genus or unclassified genus within a kingdom, which includes but is not limited to Firmicute, Actinobacteria, Bacteroidetes, Proteobacteria, Spirochates, and Tenericutes. Suitable nucleic acid-guided nucleases can be from an organism from a genus or unclassified genus within a phylum which includes but is not limited to Erysipelotrichia, Clostridia, Bacilli, Actinobacteria, Bacteroidetes, Flavobacteria, Alphaproteobacteria, Betaproteobacteria, Gammaproteobacteria, Deltaproteobacteria, Epsilonproteobacteria, Spirochaetes, and Mollicutes. Suitable nucleic acid-guided nucleases can be from an organism from a genus or unclassified genus within an order which includes but is not limited to Clostridiales, Lactobacillales, Actinomycetales, Bacteroidales, Flavobacteriales, Rhizobiales, Rhodospirillales, Burkholderiales, Neisseriales, Legionellales, Nautiliales, Campylobacterales, Spirochaetales, Mycoplasmatales, and Thiotrichales. Suitable nucleic acid-guided nucleases can be from an organism from a genus or unclassified genus within a family which includes but is not limited to Lachnospiraceae, Enterococcaceae, Leuconostocaceae, Lactobacillaceae, Streptococcaceae, Peptostreptococcaceae, Staphylococcaceae, Eubacteriaceae, Corynebacterineae, Bacteroidaceae, Flavobacterium, Cryomoorphaceae, Rhodobiaceae, Rhodospirillaceae, Acetobacteraceae, Sutterellaceae, Neisseriaceae, Legionellaceae, Nautiliaceae, Campylobacteraceae, Spirochaetaceae, Mycoplasmataceae, Pisciririckettsiaceae, and Francisellaceae.
[0256] Other nucleic acid-guided nucleases suitable for use in the methods, systems, and compositions of the present disclosure include those derived from an organism such as, but not limited to, Thiomicrospira sp. XS5, Eubacterium rectale, Succinivibrio dextrinosolvens, Candidatus Methanoplasma termitum, Candidatus Methanomethylophilus alvus, Porphyromonas crevioricanis, Flavobacterium branchiophilum, Acidomonococcus sp., Lachnospiraceae bacterium COE1, Prevotella brevis ATCC 19188, Smithella sp. SCADC, Moraxella bovoculi, Synergistes jonesii, Bacteroidetes oral taxon 274, Francisella tularensis, Leptospira inadai serovar Lyme str. 10, Acidomonococcus sp. crystal structure (5B43) S. mutans, S. agalactiae, S. equisimilis, S. sanguinis, S. pneumonia; C. jejuni, C. coli; N. salsuginis, N. tergarcus; S. auricularis, S. carnosus; N. meningitides, N. gonorrhoeae; L. monocytogenes, L. ivanovii; C. botulinum, C. difficile, C. tetani, C. sordellii; Francisella tularensis 1, Prevotella albensis, Lachnospiraceae bacterium MC2017 1, Butyrivibrio proteoclasticus, Peregrinibacteria bacterium GW2011_GWA2_33_10, Parcubacteria bacterium GW2011_GWC2_44_17, Smithella sp. SCADC, Acidaminococcus sp. BV3L6, Lachnospiraceae bacterium MA2020, Candidatus Methanoplasma termitum, Eubacterium eligens, Moraxella bovoculi 237, Leptospira inadai, Lachnospiraceae bacterium ND2006, Porphyromonas crevioricanis 3, Prevotella disiens, Porphyromonas macacae, Catenibacterium sp. CAG:290, Kandleria vitulina, Clostridiales bacterium KA00274, Lachnospiraceae bacterium 3-2, Dorea longicatena, Coprococcus catus GD/7, Enterococcus columbae DSM 7374, Fructobacillus sp. EFB-N1, Weissella halotolerans, Pediococcus acidilactici, Lactobacillus curvatus, Streptococcus pyogenes, Lactobacillus versmoldensis, and Filifactor alocis ATCC 35896.
[0257] Suitable nucleases for use in any of the methods disclosed herein include, but are not limited to, nucleases having the sequences listed in Table 1, or homologues having at least 10%, 15%, 20%, 25%, 30%, 35%, 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, or 99% sequence identity to any of the nucleases listed in Table 1.
TABLE-US-00001 TABLE 1 Amino acid MAD nuclease sequence MAD1 SEQ ID NO: 1 MAD2 SEQ ID NO: 2 MAD3 SEQ ID NO: 3 MAD4 SEQ ID NO: 4 MAD5 SEQ ID NO: 5 MAD6 SEQ ID NO: 6 MAD7 SEQ ID NO: 7 MAD8 SEQ ID NO: 8 MAD9 SEQ ID NO: 9 MAD10 SEQ ID NO: 10 MAD11 SEQ ID NO: 11 MAD12 SEQ ID NO: 12 MAD13 SEQ ID NO: 13 MAD14 SEQ ID NO: 14 MAD15 SEQ ID NO: 15 MAD16 SEQ ID NO: 16 MAD17 SEQ ID NO: 17 MAD18 SEQ ID NO: 18 MAD19 SEQ ID NO: 19 MAD20 SEQ ID NO: 20
[0258] In some methods disclosed herein, Argonaute (Ago) systems can be used to cleave target nucleic acid sequences. Ago protein can be derived from a prokaryote, eukaryote, or archaea. The target nucleic acid may be RNA or DNA. A DNA target may be single stranded or double stranded. In some examples, the target nucleic acid does not require a specific target flanking sequence, such as a sequence equivalent to a protospacer adjacent motif or protospacer flanking sequence. The Ago protein may create a double strand break or single strand break. In some examples, when a Ago protein forms a single strand break, two Ago proteins may be used in combination to generate a double strand break. In some examples, an Ago protein comprises one, two, or more nuclease domains. In some examples, an Ago protein comprises one, two, or more catalytic domains. One or more nuclease or catalytic domains may be mutated in the Ago protein, thereby generating a nickase protein capable of generating single strand breaks. In other examples, mutations in one or more nuclease or catalytic domains of an Ago protein generates a catalytically dead Ago protein that can bind but not cleave a target nucleic acid.
[0259] Ago proteins can be targeted to target nucleic acid sequences by a guiding nucleic acid. In many examples, the guiding nucleic acid is a guide DNA (gDNA). The gDNA can have a 5′ phosphorylated end. The gDNA can be single stranded or double stranded. Single stranded gDNA can be 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, or 50 nucleotides in length. In some examples, the gDNA can be less than 10 nucleotides in length. In some examples, the gDNA can be more than 50 nucleotides in length.
[0260] Argonaute-mediated cleavage can generate blunt end, 5′ overhangs, or 3′ overhangs. In some examples, one or more nucleotides are removed from the target site during or following cleavage.
[0261] Argonaute protein can be endogenously or recombinantly expressed within a cell. Argonaute can be encoded on a chromosome, extrachromosomally, or on a plasmid, synthetic chromosome, or artificial chromosome. Additionally or alternatively, an Argonaute protein can be provided or delivered to the cell as a polypeptide or mRNA encoding the polypeptide. In such examples, polypeptide or mRNA can be delivered through standard mechanisms known in the art, such as through the use of cell permeable peptides, nanoparticles, or viral particles.
[0262] Guide DNAs can be provided by genetic or episomal DNA within a cell. In some examples, gDNA are reverse transcribed from RNA or mRNA within a cell. In some examples, gDNAs can be provided or delivered to a cell expressing an Ago protein. Guide DNAs can be provided or delivered concomitantly with an Ago protein or sequentially. Guide DNAs can be chemically synthesized, assembled, or otherwise generated using standard DNA generation techniques known in the art. Guide DNAs can be cleaved, released, or otherwise derived from genomic DNA, episomal DNA molecules, isolated nucleic acid molecules, or any other source of nucleic acid molecules.
[0263] In some instances, compositions are provided comprising a nuclease such as an nucleic acid-guided nuclease (e.g., Cas9, Cpf1, MAD2, or MAD7) or a DNA-guided nuclease (e.g., Ago), linked to a chromatin-remodeling enzyme. Without wishing to be bound by theory, a nuclease fusion protein as described herein may provide improved accessibility to regions of highly-structured DNA. Non-limiting examples of chromatin-remodeling enzymes that can be linked to a nucleic-acid guided nuclease may include: histone acetyl transferases (HATs), histone deacetylases (HDACs), histone methyltransferases (HMTs), chromatin remodeling complexes, and transcription activator-like (Tal) effector proteins. Histone deacetylases may include HDAC1, HDAC2, HDAC3, HDAC4, HDAC5, HDAC6, HDAC7, HDAC8, HDAC9, HDAC10, HDAC11, sirtuin 1, sirtuin 2, sirtuin 3, sirtuin 4, sirtuin 5, sirtuin 6, and sirtuin 7. Histone acetyl transferases may include GCN5, PCAF, Hat1, Elp3, Hpa2, Hpa3, ATF-2, Nut1, Esa1, Sas2, Sas3, Tip60, MOF, MOZ, MORF, HBO1, p300, CBP, SRC-1, ACTR, TIF-2, SRC-3, TAFII250, TFIIIC, Rtt109, and CLOCK. Histone methyltransferases may include ASH1L, DOT1L, EHMT1, EHMT2, EZH1, EZH2, MLL, MLL2, MLL3, MLL4, MLL5, NSD1, PRDM2, SET, SETBP1, SETD1A, SETD1B, SETD2, SETD3, SETD4, SETD5, SETD6, SETD7, SETD8, SETD9, SETDB1, SETDB2, SETMAR, SMYD1, SMYD2, SMYD3, SMYD4, SMYD5, SUV39H1, SUV39H2, SUV420H1, and SUV420H2. Chromatin-remodeling complexes may include SWI/SNF, ISWI, NuRD/Mi-2/CHD, IN080 and SWR1.
[0264] In some instances, the nuclease is a wild-type nuclease. In other instances, the nuclease is a chimeric engineered nuclease. Chimeric engineered nucleases as disclosed herein can comprise one or more fragments or domains, and the fragments or domains can be of a nuclease, such as nucleic acid-guided nuclease, orthologs of organisms of genuses, species, or other phylogenetic groups disclosed herein; advantageously the fragments are from nuclease orthologs of different species. A chimeric engineered nuclease can be comprised of fragments or domains from at least two different nucleases. A chimeric engineered nuclease can be comprised of fragments or domains from at least two different species. A chimeric engineered nuclease can be comprised of fragments or domains from at least 2, 3, 4, 5, 6, 7, 8, 9, 10, or more different nucleases or different species. In some cases, more than one fragment or domain from one nuclease or species, wherein the more than one fragment or domain are separated by fragments or domains from a second nuclease or species. In some examples, a chimeric engineered nuclease comprises 2 fragments, each from a different protein or nuclease. In some examples, a chimeric engineered nuclease comprises 3 fragments, each from a different protein or nuclease. In some examples, a chimeric engineered nuclease comprises 4 fragments, each from a different protein or nuclease. In some examples, a chimeric engineered nuclease comprises 5 fragments, each from a different protein or nuclease.
[0265] Nuclease fusion proteins can be recombinantly expressed within a cell. A nuclease fusion protein can be encoded on a chromosome, extrachromosomally, or on a plasmid, synthetic chromosome, or artificial chromosome. A nuclease and a chromatin-remodeling enzyme may be engineered separately, and then covalently linked, prior to delivery to a cell. A nuclease fusion protein can be provided or delivered to the cell as a polypeptide or mRNA encoding the polypeptide. In such examples, polypeptide or mRNA can be delivered through standard mechanisms known in the art, such as through the use of cell permeable peptides, nanoparticles, or viral particles.
Cell-Cycle-Dependent Expression of Targeted Nucleases.
[0266] In some instances, compositions comprising a cell-cycle-dependent nuclease are provided. A cell-cycle dependent nuclease generally includes a targeted nuclease as described herein linked to an enzyme that leads to degradation of the targeted nuclease during G1 phase of the cell cycle, and expression of the targeted nuclease during G2/M phase of the cell cycle. Such cell-cycle dependent expression may, for example, bias the expression of the nuclease in cells where homology-directed repair (HDR) is most active (e.g., during G2/M phase). In some cases, the nuclease is covalently linked to cell-cycle regulated protein such as one that is actively degraded during G1 phase of the cell cycle and is actively expressed during G2/M phase of the cell cycle. In a non-limiting example, the cell-cycle regulated protein is Geminin. Other non-limiting examples of cell-cycle regulated proteins may include: Cyclin A, Cyclin B, Hsll, Cdc6, Fin1, p21 and Skp2.
[0267] In some instances, the nuclease is a wild-type nuclease.
[0268] In other instances, the nuclease is a engineered nuclease. Engineered nucleases can be non-naturally occurring.
[0269] Non-naturally occurring targetable nucleases and non-naturally occurring targetable nuclease systems can address many of these challenges and limitations.
[0270] Disclosed herein are non-naturally targetable nuclease systems. Such targetable nuclease systems are engineered to address one or more of the challenges described above and can be referred to as engineered nuclease systems. Engineered nuclease systems can comprise one or more of an engineered nuclease, such as an engineered nucleic acid-guided nuclease, an engineered guide nucleic acid, an engineered polynucleotides encoding said nuclease, or an engineered polynucleotides encoding said guide nucleic acid. Engineered nucleases, engineered guide nucleic acids, and engineered polynucleotides encoding the engineered nuclease or engineered guide nucleic acid are not naturally occurring and are not found in nature. It follows that engineered nuclease systems including one or more of these elements are non-naturally occurring.
[0271] Non-limiting examples of types of engineering that can be done to obtain a non-naturally occurring nuclease system are as follows. Engineering can include codon optimization to facilitate expression or improve expression in a host cell, such as a heterologous host cell. Engineering can reduce the size or molecular weight of the nuclease in order to facilitate expression or delivery. Engineering can alter PAM selection in order to change PAM specificity or to broaden the range of recognized PAMs. Engineering can alter, increase, or decrease stability, processivity, specificity, or efficiency of a targetable nuclease system. Engineering can alter, increase, or decrease protein stability. Engineering can alter, increase, or decrease processivity of nucleic acid scanning. Engineering can alter, increase, or decrease target sequence specificity. Engineering can alter, increase, or decrease nuclease activity. Engineering can alter, increase, or decrease editing efficiency. Engineering can alter, increase, or decrease transformation efficiency. Engineering can alter, increase, or decrease nuclease or guide nucleic acid expression.
[0272] Examples of non-naturally occurring nucleic acid sequences which are disclosed herein include sequences codon optimized for expression in bacteria, such as E. coli (e.g., SEQ ID NO: 41-60), sequences codon optimized for expression in single cell eukaryotes, such as yeast (e.g., SEQ ID NO: 127-146), sequences codon optimized for expression in multi cell eukaryotes, such as human cells (e.g., SEQ ID NO: 147-166), polynucleotides used for cloning or expression of any sequences disclosed herein (e.g., SEQ ID NO: 61-80), plasmids comprising nucleic acid sequences (e.g., SEQ ID NO: 21-40) operably linked to a heterologous promoter or nuclear localization signal or other heterologous element, proteins generated from engineered or codon optimized nucleic acid sequences (e.g., SEQ ID NO: 1-20), or engineered guide nucleic acids comprising any one of SEQ ID NO: 84-107. Such non-naturally occurring nucleic acid sequences can be amplified, cloned, assembled, synthesized, generated from synthesized oligonucleotides or dNTPs, or otherwise obtained using methods known by those skilled in the art.
[0273] Additional examples of non-naturally occurring nucleic acid sequences which are disclosed herein include sequences codon optimized for expression in bacteria, such as E. coli (e.g., SEQ ID NO: 168), sequences codon optimized for expression in single cell eukaryotes, such as yeast (e.g., SEQ ID NO: 169), sequences codon optimized for expression in multi cell eukaryotes, such as human cells (e.g., SEQ ID NO: 170), polynucleotides used for cloning or expression of any sequences disclosed herein (e.g., SEQ ID NO: 171), plasmids comprising nucleic acid sequences (e.g., SEQ ID NO: 167) operably linked to a heterologous promoter or nuclear localization signal or other heterologous element, proteins generated from engineered or codon optimized nucleic acid sequences (e.g., SEQ ID NO: 108-110), or engineered guide nucleic acids compatible with any targetable nuclease disclosed herein. Such non-naturally occurring nucleic acid sequences can be amplified, cloned, assembled, synthesized, generated from synthesized oligonucleotides or dNTPs, or otherwise obtained using methods known by those skilled in the art.
[0274] A guide nucleic acid can be DNA. A guide nucleic acid can be RNA. A guide nucleic acid can comprise both DNA and RNA. A guide nucleic acid can comprise modified of non-naturally occurring nucleotides. In cases where the guide nucleic acid comprises RNA, the RNA guide nucleic acid can be encoded by a DNA sequence on a polynucleotide molecule such as a plasmid, linear construct, or editing cassette as disclosed herein.
[0275] Nucleic acid-guided nucleases can be compatible with guide nucleic acids that are not found within the nucleases endogenous host. Such orthogonal guide nucleic acids can be determined by empirical testing. Orthogonal guide nucleic acids can come from different bacterial species or be synthetic or otherwise engineered to be non-naturally occurring.
[0276] Orthogonal guide nucleic acids that are compatible with a common nucleic acid-guided nuclease can comprise one or more common features. Common features can include sequence outside a pseudoknot region. Common features can include a pseudoknot region (e.g., 172-181). Common features can include a primary sequence or secondary structure.
[0277] A guide nucleic acid can be engineered to target a desired target sequence by altering the guide sequence such that the guide sequence is complementary to the target sequence, thereby allowing hybridization between the guide sequence and the target sequence. A guide nucleic acid with an engineered guide sequence can be referred to as an engineered guide nucleic acid. Engineered guide nucleic acids are often non-naturally occurring and are not found in nature.
[0278] In other instances, the nuclease is a chimeric nuclease. Chimeric nucleases can be engineered nucleases. Chimeric nucleases as disclosed herein can comprise one or more fragments or domains, and the fragments or domains can be of a nuclease, such as nucleic acid-guided nuclease, orthologs of organisms of genuses, species, or other phylogenetic groups; advantageously the fragments are from nuclease orthologs of different species. A chimeric nuclease can be comprised of fragments or domains from at least two different nucleases. A chimeric nuclease can be comprised of fragments or domains from at least two different species. A chimeric nuclease can be comprised of fragments or domains from at least 2, 3, 4, 5, 6, 7, 8, 9, 10, or more different nucleases or different species. In some cases, more than one fragment or domain from one nuclease or species, wherein the more than one fragment or domain are separated by fragments or domains from a second nuclease or species. In some examples, a chimeric nuclease comprises 2 fragments, each from a different protein or nuclease. In some examples, a chimeric nuclease comprises 3 fragments, each from a different protein or nuclease. In some examples, a chimeric nuclease comprises 4 fragments, each from a different protein or nuclease. In some examples, a chimeric nuclease comprises 5 fragments, each from a different protein or nuclease.
EXAMPLES
Example 1—CREATE-Plasmids and Libraries
[0279]
Example 2—CREATE Plasmid Validation
[0280]
[0281]
[0282]
[0283]
Example 3—CREATE-Recording Used to Engineer Trackable Episomal DNA Libraries
[0284]
[0285]
[0286]
Example 4—CREATE-Mediated Editing of Episomal DNA
[0287] Methods and compositions disclosed herein were used to mutate a key residue of the cas9 gene used for the CREATE process (e.g.
Example 5—CREATE-Mediated Editing and Tracking of E. coli Genome—Double Cassette
[0288] To test the performance of the recording strategy in a genomic context we tested the ability to edit two distal genomic loci in the E. coli genome (e.g.
Example 6—Plasmid Curing for Combinatorial Engineering
[0289]
[0290]
Example 7—Recursive Engineering Using Iterative CREATE-Recording Engineering Events
[0291] The example of recursive engineering depicted in
[0292]
Example 8—CREATE Design and Workflow
[0293] An example overview of CRISPR EnAbled Trackable genome Engineering (CREATE) design workflow is depicted in
Example 9—CREATE Design Validation
[0294]
Example 10—Scanning Saturation Mutagenesis of an Essential Chromosomal Gene
[0295]
Example 11—Reconstruction of ALE Mutation Set and Forward Engineering of Thermotolerant Genotypes
[0296]
Example 12—Genome Scale Mapping of Amino Acid Substitutions for the Study of Antibiotic Resistance and Tolerance
[0297]
Example 13—CREATE-Enabled Flexible Design Strategies
[0298] Illustration of example designs compatible with CREATE strategy are depicted in
Example 14—Engineering the Lycopene Pathway
[0299]
Example 15—Cas9 Editing Efficiency Controls
[0300]
Example 16—Toxicity of gRNA dsDNA Cleavage in E. coli
[0301]
[0302]
[0303]
[0304]
Example 17—CREATE Strategy for Gene Deletion
[0305]
Example 18—Editing Efficiency Controls by Cotransformation of gRNA and Linear dsDNA Cassettes
[0306]
Example 19—Library Cloning Analysis and Statistics
[0307]
Example 20—Precision of CREATE Cassette Tracking of Recombineered Populations
[0308]
Example 21—Growth Characteristics of folA Mutations in M9 Minimal Media
[0309]
Example 22—Enrichment Profiles for folA CREATE Cassettes in Minimal Media
[0310]
Example 23—Validation of Newly Identified acrB Mutations for Improved Solvent and Antibiotic Tolerance
[0311]
Example 24—Benefits of Rational Mutagenesis for Sampling Novel Adaptive Genotypes
[0312]
Example 25—Reconstruction of Mutations Identified by Erythromycin Selection
[0313]
Example 26—Validation of Crp S28P Mutation for Furfural or Thermal Tolerance
[0314]
Example 27—Genome-Scale Sequence to Activity Relationship Mapping at Single Nucleotide Resolution
[0315] Advances in DNA synthesis and sequencing have motivated increasingly complex efforts to rationally program genomic modifications on laboratory timescales. Realization of such efforts requires strategies that span the design-build-test forward-engineering cycle by not only precisely and efficiently generating large numbers of mutant designs but also by mapping the effects of these mutations at similar throughputs. CRISPR EnAbled Trackable genome Engineering (CREATE) couples highly efficient CRISPR editing with massively parallel oligomer synthesis to enable trackable precision editing on a genome wide scale. This can be accomplished using synthetic cassettes that link a targeting guide RNA with rationally programmable homologous repair cassettes that can be systematically designed to edit loci across a genome and track their phenotypic effects. We demonstrated the flexibility and ease of use of CREATE for genome engineering by parallel mapping of sequence-activity relationships for applications ranging from site saturation mutagenesis, rational protein engineering, complete residue substitution libraries and reconstruction of prior adaptive laboratory evolution experiments.
[0316] Additional methods are described in Garst A D, et al.; Nature Biotechnol. 2017 January; 35(1)L48-55, which is incorporated herein by reference in its entirety. Additional methods are described in Garst, A, et al.; Microb Cell Fact. 2013 Oct. 30; 12:99, which is incorporated herein by reference in its entirety.
[0317] Validation of CREATE Cassette Design
[0318] In order to realize our engineering objectives we took into account a number of key design considerations to both maximize the editing efficiency as well as distill a complex design process into an easily executable workflow. For example, each CREATE cassette is designed to include both a targeting guide RNA (gRNA) and a homology arm (HA) that introduces rational mutations at the chromosomal cleavage site (e.g.
[0319] To test this concept we first performed control experiments using a CREATE cassette designed to inactivate the galK gene by introducing a single point mutation to convert codon 145 from TAT to a TAA stop codon (e.g.
[0320] To further characterize how changes in the CREATE cassette design influence the editing efficiency we varied the HA length (80-120 bp) and the distance between the PAM-codon/TS (17-59 bp) (e.g.
[0321] High-Throughput Design and Multiplexed Library Construction
[0322] To scale the CREATE process for genome-wide applications we developed a custom software to automate cassette design that takes into account the above mentioned criteria to systematically identify a PAM sequence nearest to a target site (TS) of interest and modify it to create a synonymous PAM mutation. This design software is part of a suite of web-based design tools that can be implemented for E. coli and is under further development for other organisms as well as an expanded set of CRISPR-Cas systems. This software platform enables high-throughput rational design of genomic libraries in a format that is compatible with parallelized array based oligo synthesis and simple homology based cloning methods that can be performed in batch for library construction (e.g.
[0323] Using this design software we generated a total of 52,356 CREATE cassettes for a range of applications where sequence to activity mapping by traditional methods would be time-consuming and prohibitively expensive. Briefly, the library designs included: 1) a complete saturation of the folA gene to map the entire mutational landscape of an essential gene in its chromosomal context 2) saturation mutagenesis of functional residues in 35 global regulators, efflux pumps and metabolic enzymes implicated in a wide range of tolerance and production phenotypes in E. coli 3) a reconstruction of the complete set of nonsynonymous mutations identified by a recent adaptive laboratory evolution (ALE) study of thermotolerance, and 4) promoter engineering libraries designed to incorporate UP elements or CAP binding elements at transcription start sites annotated in RegulonDB (e.g.
[0324] The pooled oligo libraries were amplified and cloned in parallel and a subset of single variants were isolated to further characterize editing efficiency at different loci (e.g.
[0325] To further characterize the fidelity of the multiplexed synthesis and cloning procedures we performed deep sequencing on the pooled libraries (e.g.
[0326] CREATE Based Protein Engineering
[0327] To test the robustness of the CREATE methodology for protein engineering at a single gene level we performed deep-scanning mutagenesis of the essential folA gene. This gene encodes the dihydrofolate reductase (DHFR) enzyme responsible for the production of tetrahydrofolate and the biosynthesis of pyrimidines, purines and nucleic acids. DHFR is also the primary target of the antibiotic trimethoprim (TMP) and other antifolates that are used as antibiotics or chemotherapeutics. The wealth of structural and biochemical data DHFR function and antibiotic resistance make it an ideal model for validation of the approach.
[0328] A CREATE library designed to saturate every codon from 2-158 of the DHFR enzyme was recombineered into E. coli MG1655 and allowed to recover overnight. Following recovery ˜10.sup.9 cells (1 mL saturated culture) was transferred into media containing inhibitory TMP concentrations and allowed to grow for 48 hours. The resulting plasmid populations were then sequenced to assess our ability to capture information at the level of single amino acid substitutions that can confer TMP resistance (e.g.
[0329] These results also support the fact that we probe more deeply into the mutation space of improved fitness variants using rational mutagenesis strategies. For example, we observed 7 significantly enriched substitutions at position F153 (e.g.
[0330] In addition to mapping substitutions that confer TMP resistance, we also attempted to identify substitutions that affect the native activity of DHFR. To do so, we compared the frequencies of each plasmid variant after overnight growth in M9 (e.g.
[0331] As a separate validation of protein engineering applications, we generated a 4,240 variant library targeting the AcrB multidrug efflux pump in E. coli (e.g.
[0332] Parallel Evaluation of Genotype Fitness from Large Scale Adaptation Studies
[0333] We next sought to expand our efforts from the single protein scale and validate the use of CREATE at the genome-scale. To do so we chose to reconstruct and map mutations resulting from a prior adaptive laboratory evolution study of E. coli thermal tolerance. ALE has been used extensively as a tool to study the bacterial adaptation in response to a broad range of environmental stressors. However, in the majority of cases the genome undergoes multiple mutations making it difficult to assess the contribution of each mutation to the phenotype in question. Here, we designed and constructed a CREATE library to include all 645 nonsynonymous mutants from the Tenaillon et al ALE experiment and then subjected this library to growth selection in minimal media at 42.2° C. To assess any possible effects that could arise from the synonymous PAM mutation we included redundancy in the design of this library such that each target codon was coupled to two different PAM mutations to provide a 4 fold design redundancy for each nonsynonymous mutation. For calibration purposes the ALE library was pooled with the protein targeting libraries to allow for relative enrichment comparisons from the non-ALE derived libraries as a benchmark (e.g.
[0334] For each mutant we also calculated the number of mutations required to convert the wt codon to each of the other 19 amino acids (e.g.
[0335] High-Throughput Mapping of Selectable Precision Edits on a Genome Wide Scale
[0336] To further validate the method for genome-scale mapping and exploration we challenged genome wide targeting libraries with antibiotics or solvents relevant to bioproduction (e.g.
[0337] To compare these results to an antibiotic that interferes with translation we performed another round of selections in the presence of erythromycin (e.g. outer circle
[0338] Finally, we performed selections using furfural or acetate, common components of cellulosic hydrolysate that inhibit bacterial growth under industrial fermentation conditions and are thus the target of many strain engineering efforts (e.g.
[0339] In contrast to the weak acid tolerance of acetate, the enrichment profiles obtained the presence of growth inhibiting concentrations of furfural (2 g/L) were significantly different with the most frequently observed mutations targeting the oxidative stress response regulator rpoS (e.g.
[0340] Collectively, these selections validate the CREATE strategy by demonstrating the ability to map known associations as well as highlight power of this method for rapid mapping of novel mutations to traits of interest. It is also important to note that in contrast to the most other functional genomics technologies that mainly identify loss of function mutations, the ability to perform such broad scale scanning mutagenesis opens the door for more general genomic searches that can also identify novel gain of function mutations.
[0341] In this work we have demonstrated that CREATE allows parallel mapping of tens of thousands of amino acid and promoter mutations in a single experiment. The construction, selection, and mapping of >50,000 genome-wide mutations (e.g.
[0342] Notably, as a further distinction from prior approaches, the high efficiency mutagenesis (e.g.
[0343] The CREATE strategy provides a streamlined approach for sequence to activity mapping and directed evolution by integrating multiplexed oligo synthesis, CRISPR-CAS editing, and high-throughput sequencing.
Example 28—Genome-Scale Sequence to Activity Relationship Mapping at Single Nucleotide Resolution, Additional Examples
[0344] Possible Effects of Inconsistent Mapping of Plasmid Barcode to Genomic Edit
[0345] We note that the initial CREATE library included designs that we would expect to have low confidence mapping between the plasmid barcode and the genomic edit (as explained primarily by distance between the PAM and target mutation in the CREATE cassette, see
[0346] Tracking of High Fitness Variants (Positive Enrichment Tracking)
[0347] In cases where there is a strong selective advantage for the genomic modification (and thus the associated plasmid) we will only observe cells with the edit in the chromosome post selection. Thus, this is almost always a true positive particularly when selection times are short, thus limiting the possibility of random mutations due to replication error sweeping the population. While this phenomenon may lead to a quantitative underestimation of the true fitness of a mutation due to an enrichment profile that represents the convolution of modified and wt fitness, it will not produce false positives. Moreover, the use of replicated experiments and/or longer selections can also address this potential issue and eliminate erroneous conclusions regarding a mutations impact on fitness.
[0348] Tracking of Low Fitness Variants (Negative Enrichment Tracking)
[0349] In cases where the encoded mutation has a negative fitness contribution but is linked to a PAM only or unmodified chromosome we would incorrectly overestimate the fitness of the mutant and assume that it is closer to wt, especially for longer selection times (e.g. see
[0350] Incomplete Coverage
[0351] In cases where a variant is not present in the initial population (due to both low transformation efficiency and low editing efficiency) a couple of scenarios could arise. As implied by the points above, if the mutation is beneficial one could falsely conclude that it does not confer a fitness advantage, and if it is truly deleterious it also could be incorrectly assigned a neutral fitness score. This appears to be encountered sometimes in this work and impacts both the error associated with replicate measurements and our ability to distinguish low fitness variants from a synonymous control. However, our ability to identify beneficial mutants is robust despite these issues as evidenced by our ability to readily identify novel and previously validated mutations. Strategies to address this by overcoming Cas9 toxicity and improving recombineering efficiencies hold promise to largely eliminate such problems. Furthermore, increasing the number of replicates, increasing sequencing depth, and/or improving the library coverage by performing larger scale transformation also can help to address these issues.
[0352] Off Target gRNA Cleavage
[0353] Off target gRNA cleavage should be rare in E. coli due to the relatively small size of its genome (4 Mb), and thus lack of (non-targeted) regions of homology to the CREATE cassette. Moreover, the toxicity of gRNAs in the presence of Cas9 (e.g.
[0354] Random Off Target Mutagenesis (Evolution)
[0355] The probability that a CREATE variant is strongly enriched due to an off target mutation even is highly improbable due to 2 factors: 1) the toxicity effect for the reasons stated above and 2) the low mutation rates of MG1655 or other mutation repair proficient strains compared with the mutagenesis rates of CREATE, particularly in multiple replicates of selection. We also have validated that we can transfer the plasmid pool back into a naive parental background and rapidly verify the enrichment of fitness improving CREATE plasmids from the initial population. Like replicate data, this allows us to decouple each CREATE plasmid from the potential of background mutations that would interfere with our analysis. These factors simplify the assumptions made during our analysis, the validity of which is supported both by externally and internally validated genotypes that were identified during this work.
[0356] Possible Effects of Synonymous Mutations
[0357] Synonymous mutations (e.g. in the PAM region) can confer unexpected effects on phenotype. We have controlled for this in a number of manners. In every experiment we included an internal control that consists of a library of synonymous mutations (1/20 at each codon or 5% of total input), each of which samples different PAM and codon combinations and thus give us an idea of the range of possible effects we may have on a gene by measuring the enrichment profile of many synonymous changes. Using this population as a control we can accurately identify significant fitness changes at the resolution of single amino acids as the work suggests. We can also control for this effect by utilizing redundant sampling approaches where a site is coupled to multiple PAM mutations similar to what was done for the ALE study described herein.
[0358] CREATE Library Design Considerations
[0359] A variety of design principles were implemented in the gene targeting libraries described in some work disclosed herein. For example, the folA library (3140 cassettes) was designed to be an unbiased, exploratory library for full single site saturation mutagenesis and sequence activity. However, for the majority of the genes we sought to maximize the probability of interesting genotypes by choosing to focus the diversity of sites most likely to have a functional impact on the targeted protein (e.g. DNA binding sites, active sites, regions identified as mutational hotspots by previous selections). The sites that were included in these library designs were selected based on information deposited in databases including Ecocyc (biocyc.org/), Uniprot (uniprot.org/), and the PDB (rcsb.org/pdb) as well as relevant literature citations that identified residues or regions of interest using directed evolution approaches. The Uniprot and Ecocyc databases provide manually curated sequence features that indicate mutational effects and important domains of each protein. In cases where there was enough structural information to model ligand or DNA binding sites the relevant crystal structures were loaded into Pymol and manual residue selections were made and exported as numerical lists. For promoter libraries we took into account the spacing of these sites relative to the transcription start site and the canonical recognition sequence of either the CRP binding site (AAATGTGAtctagaTCACATTT located between −72 and −40 relative to the transcription start site) or the UP element (AAAATTTTTTTTCAAAAGTA-60 from the transcription start site) that directly recruit the alpha subunit of the RNA polymerase. These sequences were designed to integrate at these positions relative to the publicly available transcriptional start site annotations in RegulonDB using a variation of the automated CREATE design software designed for protein targeting (e.g.
[0360] Cassette Design and Automation Principles
[0361] Based on the control experiments with galK (e.g.
[0362] Design of the CREATE cassettes was automated using custom Python scripts. The basic algorithm takes a gene sequence, a list of target residues, and a list of codons as inputs. The gene sequence is searched for all available PAM sites with the corresponding spacer sequence. This list is then sorted according to relative proximity to the targeted codon position. For each PAM site in the initial list the algorithm checks for synonymous mutations that can be made in-frame that also directly disrupt the PAM site, in the event that this condition is met the algorithm proceeds to making the prescribed codon change and designing the full CREATE cassette with the accompanying spacer and iterates for each input codon and position respectively. For each PAM mutation, all possible synonymous codon substitutions are checked before proceeding to the next PAM site. For the codon saturation libraries in this study we chose the most frequent codons (genscript.com/cgi-bin/tools/codon_freq table) for each designed amino acid substitution according to the E. coli usage statistics. The script can be run rapidly on a laptop computer and was used to generate the full design of these libraries in <10 minutes. The algorithm used in this study was designed to make the most conservative mutations possible by sometimes using only the PAM as the selectable mutation marker.
[0363] Plasmids
[0364] The X2-cas9 broad host range vector was constructed by amplifying the cas9 gene from genomic S. pyogenes DNA into the pBTBX2 backbone (Lucigen). A vector map and sequence of this vector and the galK_Y145*_120/17 CREATE cassette are provided at the following locations: benchling.com/s/3c941j/edit; benchling.com/s/xRBDwcMy/edit. The editing experiments performed in some of this work employed the X2-cas9 vector in combination with the pSIM5 vector (redrecombineering.ncifcrf.gov/strains--plasmids.html) to achieve the reported efficiencies.
[0365] Recombineering of CREATE Libraries
[0366] Genomic libraries were prepared by transforming CREATE plasmid libraries into a wildtype E. coli MG1655 strain carrying the temperature sensitive pSIM5 plasmid (lambda RED) and a broad host range plasmid containing an inducible cas9 gene from cloned from S. pyogenes genomic DNA into the pBTBX-2 backbone (X2cas9, e.g.
[0367] For the control experiments with galK we used CREATE cassettes designed to convert Y145 (TAT) into a stop codon (TAA) with a single point mutation at this position and a second point mutation to make a synonymous mutation that abolishes the targeted PAM site (e.g.
[0368] Selection Procedures
[0369] Following overnight recovery, the cells were harvested by pelleting and resuspension in fresh selection media. All selections were performed in shake flask and inoculated at an initial OD600 of 0.1. Three serial dilutions (48-96 hrs depending on growth rates in the target condition) were carried out for each selection by transferring 1/100th the media volume after the cultures reached stationary phase. The 42° C. selections were performed in M9 media+0.2% glucose to mimic low carbon availability from the initial adaptation. Antibiotic selections were carried out in LB+500 μg/mL rifampicin or erythromycin to ensure stringent selection. The solvent selections were performed in M9+0.4% glucose and either 10 g/L acetate (unbuffered) or 2 g/L furfural. Selections were harvested by pelleting 1 mL of the final culture and the cell pellet was boiled in 100 μL TE buffer to preserve both the plasmid and the genomic DNA for further desired analyses.
[0370] Library Preparation and Sequencing
[0371] Custom Illumina compatible primers were designed to allow a single amplification step from the CREATE plasmid and assignment of experimental reads using barcodes. The CREATE cassettes were amplified directly from the plasmid sequences of boiled cell lysates using 20 cycles of PCR with the Phusion (NEB) polymerase using 60° C. annealing and 1:30 minute extension times. As in the cloning procedure a minimal number of PCR cycles was maintained to prevent accumulation of mutations and recombined CREATE cassettes that were observed when an excessive number of PCR cycles was implemented (e.g. >25-30). Amplified fragments were verified and quantified by 1% agarose gel electrophoresis and pooled according to the desired read depth for each sample. The pooled library was cleaned using Qiaquick PCR cleanup kit and processed for NGS using standard Illumina preparation kits. The Illumina sequencing and sample preparation were performed with the primers.
[0372] Preprocessing of High-Throughput Sequencing and Count Generation
[0373] Paired-end Illumina sequencing reads were sorted according to the golay barcode index with allowance of up to 3 mismatches then merged using the usearch-fastq_merge algorithm. Sorted reads were then matched against the database of designed CREATE cassettes using the usearch_global algorithm at an identity threshold of 90% allowing up to 60 possible hits for each read. The resulting hits were further sorted according to percent identity and read assignment was made using the best matching CREATE cassette design at a final cutoff 98% identity to the initial design. It should be noted that this read assignment strategy attempts to identify correlations between the designed genotypes and may therefore miss other important features that arise due to mutations that could occur during the experimental procedure. This approach was taken both to simplify data analysis as well as evaluate the ‘forward’ design and annotation procedure and it's ability to accurately identify meaningful genetic phenomena.
[0374] Data Analysis and Fitness Calculation
[0375] Enrichment scores (or absolute fitness scores) were calculated as the log 2 enrichment score using the following equation:
where F.sub.x,f is the frequency of cassette X at the final time point and F.sub.x,i is the initial frequency of cassette X and W is the absolute fitness of each variant. Frequencies were determined by dividing the read counts for each variant by the total experimental counts including those that were lost to filtering. Each selection was performed in duplicate and the count weighted average of the two measurements was used to infer the average fitness score of each mutation as follows:
[0376] These scores were used to rank and assess the fitness contributions of each mutation under the various selection pressures investigated. For all selections we took average absolute fitness scores for all of the synonymous mutants as a composite measure of the average growth rate. Absolute enrichment scores were considered significant if the mutant enrichment was at least +/−2*σ (e.g. p=0.05 assuming a normal distribution) of the wild-type value. We performed two replicates of each selection reported in this study to derive these figures and applied a cutoff threshold of 10 across the replicate experiments for inclusion in each analysis.
[0377] For every codon targeted our designs also included a synonymous variant to provide an internal experimental control. Thus 5% of the protein targeting cassettes encoded synonymous mutations that allow us to estimate confidence intervals for mutation effects using custom Python bootstrapping scripts. The enrichment data for each experiment was resampled with replacement 20000 to obtain 95% confidence interval estimations that were used to infer statistical significance of enrichment scores for each analysis presented in the manuscript.
[0378] Mutant Reconstructions and Growth Measurements
[0379] The AcrB T60N and Crp S28P and FolA F153R/W CREATE cassettes were ordered as separate gblocks from IDT, cloned and sequence verified. Each cassette was transformed into MG1655 and colony screened to identify a clone with the designed genomic edit. These strains (e.g.
[0380] Software and Figure Generation
[0381] Circle plots were generated using Circos v0.67. Plots were generated in Python 2.7 using the matplotlib plotting libraries and figures were made using Adobe Illustrator CS5. Entropy scores for the FolA (
[0382] Figures of the protein libraries and high fitness mutations were made using The PyMol Molecular Graphics System, Schrodinger, LLC. The following are the proteins and PDBs used in the figure generation: AcrB (3W9H, 4K7Q, 3AOC), Fis (3JR9), Ihf (ulHF), RNA polymerase (4KMU, 4IGC), Crp (3N4M), MarA (1BLO), and SoxR (2ZHG).
Example 29: Testing Edit-Barcode Correlation
[0383] A strain expressing a low copy number plasmid (Ec23) which is a Cas9-pSIM5 dual vector, was tested using different gene editing cassettes (lacZ, xylA, and rhaA) and recorder cassettes with different barcodes and insertion sites (galK site 1, galK site 2, and galK site 3) (Summarized in
[0384] The transformations were plated on selective media that allowed for enrichment of cells containing the gene edits. 30 colonies from each combination transformation were sequenced to determine if they contained the desired barcode.
[0385]
[0386] Overall, 89 out of 90 tested colonies has the designed gene edit and barcode.
Example 30: Selectable Recording
[0387] When a barcode is not selected for, it allows for enrichment of non-barcoded cells even if the corresponding gene edit is incorporated and selected for.
[0388] As depicted in
[0389] This strategy uses multiple methods to increase the efficiency of isolating desired variants that contain all of the engineered edits from each round of engineering. The PAM mutation, selectable marker switch, and unique landing site incorporated in each round separately increase efficiency and together increase efficiency as well. These tools allow for selection of each recording round and allow design of highly active recording guide RNAs. An array of equally spaced (or not equally spaced, depending on the design) barcodes is generated and facilitates downstream analysis such as sequencing the barcode array to determine which corresponding edits are incorporated throughout the genome.
[0390]
[0391] The data in
Example 31: Expression of MAD Nucleases
[0392] Wild-type nucleic acid sequences for MAD1-MAD20 include SEQ ID NOs 21-40, respectively. These MAD nucleases were codon optimized for expression in E. coli and the codon optimized sequences are listed as SEQ ID NO: 41-60, respectively (summarized in Table 2). Codon optimized MAD1-MAD20 were cloned into an expression construct comprising a constitutive or inducible promoter (e.g., T7 promoter SEQ ID NO: 83, or pBAD promoter SEQ ID NO: 81 or SEQ ID NO: 82) and an optional 6×-His tag. The generated MAD1-MAD20 expression constructs are provided as SEQ ID NOs: 61-80, respectively.
TABLE-US-00002 TABLE 2 Codon optimized WT nucleic acid nucleic acid Amino acid Expression MAD nuclease sequence sequence sequence constructs MAD1 SEQ ID NO: 21 SEQ ID NO: 41 SEQ ID NO: 1 SEQ ID NO: 61 MAD2 SEQ ID NO: 22 SEQ ID NO: 42 SEQ ID NO: 2 SEQ ID NO: 62 MAD3 SEQ ID NO: 23 SEQ ID NO: 43 SEQ ID NO: 3 SEQ ID NO: 63 MAD4 SEQ ID NO: 24 SEQ ID NO: 44 SEQ ID NO: 4 SEQ ID NO: 64 MAD5 SEQ ID NO: 25 SEQ ID NO: 45 SEQ ID NO: 5 SEQ ID NO: 65 MAD6 SEQ ID NO: 26 SEQ ID NO: 46 SEQ ID NO: 6 SEQ ID NO: 66 MAD7 SEQ ID NO: 27 SEQ ID NO: 47 SEQ ID NO: 7 SEQ ID NO: 67 MAD8 SEQ ID NO: 28 SEQ ID NO: 48 SEQ ID NO: 8 SEQ ID NO: 68 MAD9 SEQ ID NO: 29 SEQ ID NO: 49 SEQ ID NO: 9 SEQ ID NO: 69 MAD10 SEQ ID NO: 30 SEQ ID NO: 50 SEQ ID NO: 10 SEQ ID NO: 70 MAD11 SEQ ID NO: 31 SEQ ID NO: 51 SEQ ID NO: 11 SEQ ID NO: 71 MAD12 SEQ ID NO: 32 SEQ ID NO: 52 SEQ ID NO: 12 SEQ ID NO: 72 MAD13 SEQ ID NO: 33 SEQ ID NO: 53 SEQ ID NO: 13 SEQ ID NO: 73 MAD14 SEQ ID NO: 34 SEQ ID NO: 54 SEQ ID NO: 14 SEQ ID NO: 74 MAD15 SEQ ID NO: 35 SEQ ID NO: 55 SEQ ID NO: 15 SEQ ID NO: 75 MAD16 SEQ ID NO: 36 SEQ ID NO: 56 SEQ ID NO: 16 SEQ ID NO: 76 MAD17 SEQ ID NO: 37 SEQ ID NO: 57 SEQ ID NO: 17 SEQ ID NO: 77 MAD18 SEQ ID NO: 38 SEQ ID NO: 58 SEQ ID NO: 18 SEQ ID NO: 78 MAD19 SEQ ID NO: 39 SEQ ID NO: 59 SEQ ID NO: 19 SEQ ID NO: 79 MAD20 SEQ ID NO: 40 SEQ ID NO: 60 SEQ ID NO: 20 SEQ ID NO: 80
Example 32: MAD2 and MAD7 Nucleases
[0393] MAD2 and MAD7 nucleases are nucleic acid-guided nuclease that can be used in the methods disclosed herein. Nucleases Mad2 (SEQ ID NO: 2) and Mad 7 (SEQ ID NO: 7) were cloned and transformed into cells. Editing cassettes designed to mutate a target site in a galK gene were designed with mutations, which allowed for white/red screening of successfully editing colonies. The editing cassettes also encoded a guide nucleic acid designed to target galK. The editing cassettes were transformed into E. coli cells expressing MAD2, MAD7, or Cas9.
[0394]
[0395]
[0396] Further details and characterization of MAD2, MAD7, and other MAD nucleases are described in U.S. application Ser. No. 15/631,989, filed Jun. 23, 2017, and U.S. application Ser. No. 15/632,001, filed Jun. 23, 2017, each of which are incorporated herein in their entirety.
TABLE-US-00003 TABLE 3 Nucleic acid- Editing guided Guide nucleic acid scaffold sequence Target # nuclease sequence mutation gene 1 MAD2 Scaffold-12; SEQ ID NO: 95 N89KpnI galK 2 MAD2 Scaffold-10; SEQ ID NO: 93 L80** galK 3 MAD2 Scaffold-5; SEQ ID NO: 88 L80** galK 4 MAD2 Scaffold-12; SEQ ID NO: 95 D70KpnI galK 5 MAD2 Scaffold-12; SEQ ID NO: 95 Y145** galK 6 MAD2 Scaffold-11; SEQ ID NO: 94 Y145** galK 7 MAD2 Scaffold-10; SEQ ID NO: 93 Y145** galK 8 MAD2 Scaffold-12; SEQ ID NO: 95 L10KpnI galK 9 MAD2 Scaffold-11; SEQ ID NO: 94 L80** galK 10 SpCas9 S. pyogenese gRNA Y145** galK 11 MAD2 Scaffold-2; SEQ ID NO: 85 Y145** galK 12 MAD2 Scaffold-4; SEQ ID NO: 87 Y145** galK 13 MAD2 Scaffold-1; SEQ ID NO: 84 L80** galK 14 MAD2 Scaffold-13; SEQ ID NO: 96 Y145** galK
TABLE-US-00004 TABLE 4 Nucleic acid- Editing guided Guide nucleic acid scaffold sequence Target # nuclease sequence mutation gene 1 MAD7 Scaffold-1; SEQ ID NO: 84 L80** galK 2 MAD7 Scaffold-2; SEQ ID NO: 85 Y145** galK 3 MAD7 Scaffold-4; SEQ ID NO: 87 Y145** galK 4 MAD7 Scaffold-10; SEQ ID NO: 93 Y145** galK 5 MAD7 Scaffold-11; SEQ ID NO: 95 L80** galK
[0397] While preferred embodiments of the present invention have been shown and described herein, it will be obvious to those skilled in the art that such embodiments are provided by way of example only. Numerous variations, changes, and substitutions will now occur to those skilled in the art without departing from the invention. It should be understood that various alternatives to the embodiments of the invention described herein may be employed in practicing the invention. It is intended that the following claims define the scope of the invention and that methods and structures within the scope of these claims and their equivalents be covered thereby.
SEQUENCE LISTING
[0398]
TABLE-US-00005 TABLE 5 SEQ ID NO Sequence SEQ MGKMYYLGLDIGTNSVGYAVTDPSYHLLKFKGEPMWGAHVFAAGNQSAERRSFRTSRRRLDRRQQRVKLV ID QEIFAPVISPIDPRFFIRLHESALWRDDVAETDKHIFFNDPTYTDKEYYSDYPTIHHLIVDLMESSEKHDPRLVY NO: LAVAWLVAHRGHFLNEVDKDNIGDVLSFDAFYPEFLAFLSDNGVSPWVCESKALQATLLSRNSVNDKYKAL 1 KSLIFGSQKPEDNFDANISEDGLIQLLAGKKVKVNKLFPQESNDASFTLNDKEDAIEEILGTLTPDECEWIAHIR RLFDWAIMKHALKDGRTISESKVKLYEQHHHDLTQLKYFVKTYLAKEYDDIFRNVDSETTKNYVAYSYHVK EVKGTLPKNKATQEEFCKYVLGKVKNIECSEADKVDFDEMIQRLTDNSFMPKQVSGENRVIPYQLYYYELKT ILNKAASYLPFLTQCGKDAISNQDKLLSIMTFRIPYFVGPLRKDNSEHAWLERKAGKIYPWNFNDKVDLDKSE EAFIRRMTNTCTYYPGEDVLPLDSLIYEKFMILNEINNIRIDGYPISVDVKQQVFGLFEKKRRVTVKDIQNLLLS LGALDKHGKLTGIDTTIHSNYNTYHHFKSLMERGVLTRDDVERIVERMTYSDDTKRVRLWLNNNYGTLTAD DVKHISRLRKHDFGRLSKMFLTGLKGVHKETGERASILDFMWNTNDNLMQLLSECYTFSDEITKLQEAYYA KAQLSLNDFLDSMYISNAVKRPIYRTLAVVNDIRKACGTAPKRIFIEMARDGESKKKRSVTRREQIKNLYRSIR KDFQQEVDFLEKILENKSDGQLQSDALYLYFAQLGRDMYTGDPIKLEHIKDQSFYNIDHIYPQSMVKDDSLD NKVLVQSEINGEKSSRYPLDAAIRNKMKPLWDAYYNHGLISLKKYQRLTRSTPFTDDEKWDFINRQLVETRQ STKALAILLKRKFPDTEIVYSKAGLSSDFRHEFGLVKSRNINDLHHAKDAFLAIVTGNVYHERFNRRWFMVN QPYSVKTKTLFTHSIKNGNFVAWNGEEDLGRIVKMLKQNKNTIHFTRFSFDRKEGLFDIQPLKASTGLVPRKA GLDVVKYGGYDKSTAAYYLLVRFTLEDKKTQHKLMMIPVEGLYKARIDHDKEFLTDYAQTTISEILQKDKQ KVINIMFPMGTRHIKLNSMISIDGFYLSIGGKSSKGKSVLCHAMVPLIVPHKIECYIKAMESFARKFKENNKLRI VEKFDKITVEDNLNLYELFLQKLQHNPYNKFFSTQFDVLTNGRSTFTKLSPEEQVQTLLNILSIFKTCRSSGCD LKSINGSAQAARIMISADLTGLSKKYSDIRLVEQSASGLFVSKSQNLLEYL* SEQ MSSLTKFTNKYSKQLTIKNELIPVGKTLENIKENGLIDGDEQLNENYQKAKIIVDDFLRDFINKALNNTQIGNW ID RELADALNKEDEDNIEKLQDKIRGIIVSKFETFDLFSSYSIKKDEKIIDDDNDVEEEELDLGKKTSSFKYIFKKN NO: LFKLVLPSYLKTTNQDKLKIISSFDNFSTYFRGFFENRKNIFTKKPISTSIAYRIVHDNFPKFLDNIRCFNVWQTE 2 CPQLIVKADNYLKSKNVIAKDKSLANYFTVGAYDYFLSQNGIDFYNNIIGGLPAFAGHEKIQGLNEFINQECQ KDSELKSKLKNRHAFKMAVLFKQILSDREKSFVIDEFESDAQVIDAVKNFYAEQCKDNNVIFNLLNLIKNIAF LSDDELDGIFIEGKYLSSVSQKLYSDWSKLRNDIEDSANSKQGNKELAKKIKTNKGDVEKAISKYEFSLSELNS IVHDNTKFSDLLSCTLHKVASEKLVKVNEGDWPKHLKNNEEKQKIKEPLDALLEIYNTLLIFNCKSFNKNGNF YVDYDRCINELSSVVYLYNKTRNYCTKKPYNTDKFKLNFNSPQLGEGFSKSKENDCLTLLFKKDDNYYVGII RKGAKINFDDTQAIADNTDNCIFKMNYFLLKDAKKFIPKCSIQLKEVKAHFKKSEDDYILSDKEKFASPLVIKK STFLLATAHVKGKKGNIKKFQKEYSKENPTEYRNSLNEWIAFCKEFLKTYKAATIFDITTLKKAEEYADIVEF YKDVDNLCYKLEFCPIKTSFIENLIDNGDLYLFRINNKDFSSKSTGTKNLHTLYLQAIFDERNLNNPTIMLNGG AELFYRKESIEQKNRITHKAGSILVNKVCKDGTSLDDKIRNEIYQYENKFIDTLSDEAKKVLPNVIKKEATHDI TKDKRFTSDKFFFHCPLTINYKEGDTKQFNNEVLSFLRGNPDINIIGIDRGERNLIYVTVINQKGEILDSVSFNT VTNKSSKIEQTVDYEEKLAVREKERIEAKRSWDSISKIATLKEGYLSAIVHEICLLMIKHNAIVVLENLNAGFK RIRGGLSEKSVYQKFEKMLINKLNYFVSKKESDWNKPSGLLNGLQLSDQFESFEKLGIQSGFIFYVPAAYTSKI DPTTGFANVLNLSKVRNVDAIKSFFSNFNEISYSKKEALFKFSFDLDSLSKKGFSSFVKFSKSKWNVYTFGERII KPKNKQGYREDKRINLTFEMKKLLNEYKVSFDLENNLIPNLTSANLKDTFWKELFFIFKTTLQLRNSVTNGKE DVLISPVKNAKGEFFVSGTHNKTLPQDCDANGAYHIALKGLMILERNNLVREEKDTKKIMAISNVDWFEYVQ KRRGVL* SEQ MNNYDEFTKLYPIQKTIRFELKPQGRTMEHLETFNFFEEDRDRAEKYKILKEAIDEYHKKFIDEHLTNMSLDW ID NSLKQISEKYYKSREEKDKKVFLSEQKRMRQEIVSEFKKDDRFKDLFSKKLFSELLKEEIYKKGNHQEIDALK NO: SFDKFSGYFIGLHENRKNMYSDGDEITAISNRIVNENFPKFLDNLQKYQEARKKYPEWIIKAESALVAHNIKM 3 DEVFSLEYFNKVLNQEGIQRYNLALGGYVTKSGEKMMGLNDALNLAHQSEKSSKGRIHMTPLFKQILSEKES FSYIPDVFTEDSQLLPSIGGFFAQIENDKDGNIFDRALELISSYAEYDTERIYIRQADINRVSNVIFGEWGTLGGL MREYKADSINDINLERTCKKVDKWLDSKEFALSDVLEAIKRTGNNDAFNEYISKMRTAREKIDAARKEMKFI SEKISGDEESIHIIKTLLDSVQQFLHFFNLFKARQDIPLDGAFYAEFDEVHSKLFAIVPLYNKVRNYLTKNNLNT KKIKLNFKNPTLANGWDQNKVYDYASLIFLRDGNYYLGIINPKRKKNIKFEQGSGNGPFYRKMVYKQIPGPN KNLPRVFLTSTKGKKEYKPSKEIIEGYEADKHIRGDKFDLDFCHKLIDFFKESIEKHKDWSKFNFYFSPTESYG DISEFYLDVEKQGYRMHFENISAETIDEYVEKGDLFLFQIYNKDFVKAATGKKDMHTIYWNAAFSPENLQDV VVKLNGEAELFYRDKSDIKEIVHREGEILVNRTYNGRTPVPDKIHKKLTDYHNGRTKDLGEAKEYLDKVRYF KAHYDITKDRRYLNDKIYFHVPLTLNFKANGKKNLNKMVIEKFLSDEKAHIIGIDRGERNLLYYSIIDRSGKII DQQSLNVIDGFDYREKLNQREIEMKDARQSWNAIGKIKDLKEGYLSKAVHEITKMAIQYNAIVVMEELNYGF KRGRFKVEKQIYQKFENMLIDKMNYLVFKDAPDESPGGVLNAYQLTNPLESFAKLGKQTGILFYVPAAYTSK IDPTTGFVNLFNTSSKTNAQERKEFLQKFESISYSAKDGGIFAFAFDYRKFGTSKTDHKNVWTAYTNGERMR YIKEKKRNELFDPSKEIKEALTSSGIKYDGGQNILPDILRSNNNGLIYTMYSSFIAAIQMRVYDGKEDYIISPIKN SKGEFFRTDPKRRELPIDADANGAYNIALRGELTMRAIAEKFDPDSEKMAKLELKHKDWFEFMQTRGD* SEQ MTKTFDSEFFNLYSLQKTVRFELKPVGETASFVEDFKNEGLKRVVSEDERRAVDYQKVKEIIDDYHRDFIEES ID LNYFPEQVSKDALEQAFHLYQKLKAAKVEEREKALKEWEALQKKLREKVVKCFSDSNKARFSRIDKKELIK NO: EDLINWLVAQNREDDIPTVETFNNFTTYFTGFHENRKNIYSKDDHATAISFRLIHENLPKFFDNVISFNKLKEG 4 FPELKFDKVKEDLEVDYDLKHAFEIEYFVNFVTQAGIDQYNYLLGGKTLEDGTKKQGMNEQINLFKQQQTR DKARQIPKLIPLFKQILSERTESQSFIPKQFESDQELFDSLQKLHNNCQDKFTVLQQAILGLAEADLKKVFIKTS DLNALSNTIFGNYSVFSDALNLYKESLKTKKAQEAFEKLPAHSIHDLIQYLEQFNSSLDAEKQQSTDTVLNYFI KTDELYSRFIKSTSEAFTQVQPLFELEALSSKRRPPESEDEGAKGQEGFEQIKRIKAYLDTLMEAVHFAKPLYL VKGRKMIEGLDKDQSFYEAFEMAYQELESLIIPIYNKARSYLSRKPFKADKFKINFDNNTLLSGWDANKETAN ASILFKKDGLYYLGIMPKGKTFLFDYFVSSEDSEKLKQRRQKTAEEALAQDGESYFEKIRYKLLPGASKMLPK VFFSNKNIGFYNPSDDILRIRNTASHTKNGTPQKGHSKVEFNLNDCHKMIDFFKSSIQKHPEWGSFGFTFSDTS DFEDMSAFYREVENQGYVISFDKIKETYIQSQVEQGNLYLFQIYNKDFSPYSKGKPNLHTLYWKALFEEANL NNVVAKLNGEAEIFFRRHSIKASDKVVHPANQAIDNKNPHTEKTQSTFEYDLVKDKRYTQDKFFFHVPISLNF KAQGVSKFNDKVNGFLKGNPDVNIIGIDRGERHLLYFTVVNQKGEILVQESLNTLMSDKGHVNDYQQKLDK KEQERDAARKSWTTVENIKELKEGYLSHVVHKLAHLIIKYNAIVCLEDLNFGFKRGRFKVEKQVYQKFEKAL IDKLNYLVFKEKELGEVGHYLTAYQLTAPFESFKKLGKQSGILFYVPADYTSKIDPTTGFVNFLDLRYQSVEK AKQLLSDFNAIRFNSVQNYFEFEIDYKKLTPKRKVGTQSKWVICTYGDVRYQNRRNQKGHWETEEVNVTEK LKALFASDSKTTTVIDYANDDNLIDVILEQDKASFFKELLWLLKLTMTLRHSKIKSEDDFILSPVKNEQGEFYD SRKAGEVWPKDADANGAYHIALKGLWNLQQINQWEKGKTLNLAIKNQDWFSFIQEKPYQE* SEQ MHTGGLLSMDAKEFTGQYPLSKTLRFELRPIGRTWDNLEASGYLAEDRHRAECYPRAKELLDDNHRAFLNR ID VLPQIDMDWHPIAEAFCKVHKNPGNKELAQDYNLQLSKRRKEISAYLQDADGYKGLFAKPALDEAMKIAKE NO: NGNESDIEVLEAFNGFSVYFTGYHESRENIYSDEDMVSVAYRITEDNFPRFVSNALIFDKLNESHPDIISEVSGN 5 LGVDDIGKYFDVSNYNNFLSQAGIDDYNHIIGGHTTEDGLIQAFNVVLNLRHQKDPGFEKIQFKQLYKQILSV RTSKSYIPKQFDNSKEMVDCICDYVSKIEKSETVERALKLVRNISSFDLRGIFVNKKNLRILSNKLIGDWDAIET ALMHSSSSENDKKSVYDSAEAFTLDDIFSSVKKFSDASAEDIGNRAEDICRVISETAPFINDLRAVDLDSLNDD GYEAAVSKIRESLEPYMDLFHELEIFSVGDEFPKCAAFYSELEEVSEQLIEIIPLFNKARSFCTRKRYSTDKIKVN LKFPTLADGWDLNKERDNKAAILRKDGKYYLAILDMKKDLSSIRTSDEDESSFEKMEYKLLPSPVKMLPKIF VKSKAAKEKYGLTDRMLECYDKGMHKSGSAFDLGFCHELIDYYKRCIAEYPGWDVFDFKFRETSDYGSMK EFNEDVAGAGYYMSLRKIPCSEVYRLLDEKSIYLFQIYNKDYSENAHGNKNMHTMYWEGLFSPQNLESPVF KLSGGAELFFRKSSIPNDAKTVHPKGSVLVPRNDVNGRRIPDSIYRELTRYFNRGDCRISDEAKSYLDKVKTK KADHDIVKDRRFTVDKMMFHVPIAMNFKAISKPNLNKKVIDGIIDDQDLKIIGIDRGERNLIYVTMVDRKGNI LYQDSLNILNGYDYRKALDVREYDNKEARRNWTKVEGIRKMKEGYLSLAVSKLADMIIENNAIIVMEDLNH GFKAGRSKIEKQVYQKFESMLINKLGYMVLKDKSIDQSGGALHGYQLANHVTTLASVGKQCGVIFYIPAAFT SKIDPTTGFADLFALSNVKNVASMREFFSKMKSVIYDKAEGKFAFTFDYLDYNVKSECGRTLWTVYTVGERF TYSRVNREYVRKVPTDIIYDALQKAGISVEGDLRDRIAESDGDTLKSIFYAFKYALDMRVENREEDYIQSPVK NASGEFFCSKNAGKSLPQDSDANGAYNIALKGILQLRMLSEQYDPNAESIRLPLITNKAWLTFMQSGMKTWK N* SEQ MDSLKDFTNLYPVSKTLRFELKPVGKTLENIEKAGILKEDEHRAESYRRVKKIIDTYHKVFIDSSLENMAKMG ID IENEIKAMLQSFCELYKKDHRTEGEDKALDKIRAVLRGLIVGAFTGVCGRRENTVQNEKYESLFKEKLIKEILP NO: DFVLSTEAESLPFSVEEATRSLKEFDSFTSYFAGFYENRKNIYSTKPQSTAIAYRLIHENLPKFIDNILVFQKIKE 6 PIAKELEHIRADFSAGGYIKKDERLEDIFSLNYYIHVLSQAGIEKYNALIGKIVTEGDGEMKGLNEHINLYNQQ RGREDRLPLFRPLYKQILSDREQLSYLPESFEKDEELLRALKEFYDHIAEDILGRTQQLMTSISEYDLSRIYVRN DSQLTDISKKMLGDWNAIYMARERAYDHEQAPKRITAKYERDRIKALKGEESISLANLNSCIAFLDNVRDCR VDTYLSTLGQKEGPHGLSNLVENVFASYHEAEQLLSFPYPEENNLIQDKDNVVLIKNLLDNISDLQRFLKPLW GMGDEPDKDERFYGEYNYIRGALDQVIPLYNKVRNYLTRKPYSTRKVKLNFGNSQLLSGWDRNKEKDNSC VILRKGQNFYLAIMNNRHKRSFENKVLPEYKEGEPYFEKMDYKFLPDPNKMLPKVFLSKKGIEIYKPSPKLLE QYGHGTHKKGDTFSMDDLHELIDFFKHSIEAHEDWKQFGFKFSDTATYENVSSFYREVEDQGYKLSFRKVSE SYVYSLIDQGKLYLFQIYNKDFSPCSKGTPNLHTLYWRMLFDERNLADVIYKLDGKAEIFFREKSLKNDHPTH PAGKPIKKKSRQKKGEESLFEYDLVKDRHYTMDKFQFHVPITMNFKCSAGSKVNDMVNAHIREAKDMHVIG IDRGERNLLYICVIDSRGTILDQISLNTINDIDYHDLLESRDKDRQQERRNWQTIEGIKELKQGYLSQAVHRIAE LMVAYKAVVALEDLNMGFKRGRQKVESSVYQQFEKQLIDKLNYLVDKKKRPEDIGGLLRAYQFTAPFKSFK EMGKQNGFLFYIPAWNTSNIDPTTGFVNLFHAQYENVDKAKSFFQKFDSISYNPKKDWFEFAFDYKNFTKKA EGSRSMWILCTHGSRIKNFRNSQKNGQWDSEEFALTEAFKSLFVRYEIDYTADLKTAIVDEKQKDFFVDLLKL FKLTVQMRNSWKEKDLDYLISPVAGADGRFFDTREGNKSLPKDADANGAYNIALKGLWALRQIRQTSEGGK LKLAISNKEWLQFVQERSYEKD* SEQ MNNGTNNFQNFIGISSLQKTLRNALIPTETTQQFIVKNGIIKEDELRGENRQILKDIMDDYYRGFISETLSSIDDI ID DWTSLFEKMEIQLKNGDNKDTLIKEQTEYRKAIHKKFANDDRFKNMFSAKLISDILPEFVIHNNNYSASEKEE NO: KTQVIKLFSRFATSFKDYFKNRANCFSADDISSSSCHRIVNDNAEIFFSNALVYRRIVKSLSNDDINKISGDMKD 7 SLKEMSLEEIYSYEKYGEFITQEGISFYNDICGKVNSFMNLYCQKNKENKNLYKLQKLHKQILCIADTSYEVP YKFESDEEVYQSVNGFLDNISSKHIVERLRKIGDNYNGYNLDKIYIVSKFYESVSQKTYRDWETINTALEIHYN NILPGNGKSKADKVKKAVKNDLQKSITEINELVSNYKLCSDDNIKAETYIHEISHILNNFEAQELKYNPEIHLV ESELKASELKNVLDVIMNAFHWCSVFMTEELVDKDNNFYAELEEIYDEIYPVISLYNLVRNYVTQKPYSTKKI KLNFGIPTLADGWSKSKEYSNNAIILMRDNLYYLGIFNAKNKPDKKIIEGNTSENKGDYKKMIYNLLPGPNK MIPKVFLSSKTGVETYKPSAYILEGYKQNKHIKSSKDFDITFCHDLIDYFKNCIAIHPEWKNFGFDFSDTSTYED ISGFYREVELQGYKIDWTYISEKDIDLLQEKGQLYLFQIYNKDFSKKSTGNDNLHTMYLKNLFSEENLKDIVL KLNGEAEIFFRKSSIKNPIIHKKGSILVNRTYEAEEKDQFGNIQIVRKNIPENIYQELYKYFNDKSDKELSDEAA KLKNVVGHHEAATNIVKDYRYTYDKYFLHMPITINFKANKTGFINDRILQYIAKEKDLHVIGIDRGERNLIYV SVIDTCGNIVEQKSFNIVNGYDYQIKLKQQEGARQIARKEWKEIGKIKEIKEGYLSLVIHEISKMVIKYNAIIAM EDLSYGFKKGRFKVERQVYQKFETMLINKLNYLVFKDISITENGGLLKGYQLTYIPDKLKNVGHQCGCIFYVP AAYTSKIDPTTGFVNIFKFKDLTVDAKREFIKKFDSIRYDSEKNLFCFTFDYNNFITQNTVMSKSSWSVYTYGV RIKRRFVNGRFSNESDTIDITKDMEKTLEMTDINWRDGHDLRQDIIDYEIVQHIFEIFRLTVQMRNSLSELEDR DYDRLISPVLNENNIFYDSAKAGDALPKDADANGAYCIALKGLYEIKQITENWKEDGKFSRDKLKISNKDWF DFIQNKRYL* SEQ MTNKFTNQYSLSKTLRFELIPQGKTLEFIQEKGLLSQDKQRAESYQEMKKTIDKFHKYFIDLALSNAKLTHLE ID TYLELYNKSAETKKEQKFKDDLKKVQDNLRKEIVKSFSDGDAKSIFAILDKKELITVELEKWFENNEQKDIYF NO: DEKFKTFTTYFTGFHQNRKNMYSVEPNSTAIAYRLIHENLPKFLENAKAFEKIKQVESLQVNFRELMGEFGDE 8 GLIFVNELEEMFQINYYNDVLSQNGITIYNSIISGFTKNDIKYKGLNEYINNYNQTKDKKDRLPKLKQLYKQIL SDRISLSFLPDAFTDGKQVLKAIFDFYKINLLSYTIEGQEESQNLLLLIRQTIENLSSFDTQKIYLKNDTHLTTISQ QVFGDFSVFSTALNYWYETKVNPKFETEYSKANEKKREILDKAKAVFTKQDYFSIAFLQEVLSEYILTLDHTS DIVKKHSSNCIADYFKNHFVAKKENETDKTFDFIANITAKYQCIQGILENADQYEDELKQDQKLIDNLKFFLD AILELLHFIKPLHLKSESITEKDTAFYDVFENYYEALSLLTPLYNMVRNYVTQKPYSTEKIKLNFENAQLLNG WDANKEGDYLTTILKKDGNYFLAIMDKKHNKAFQKFPEGKENYEKMVYKLLPGVNKMLPKVFFSNKNIAY FNPSKELLENYKKETHKKGDTFNLEHCHTLIDFFKDSLNKHEDWKYFDFQFSETKSYQDLSGFYREVEHQGY KINFKNIDSEYIDGLVNEGKLFLFQIYSKDFSPFSKGKPNMHTLYWKALFEEQNLQNVIYKLNGQAEIFFRKAS IKPKNIILHKKKIKIAKKHFIDKKTKTSEIVPVQTIKNLNMYYQGKISEKELTQDDLRYIDNFSIFNEKNKTIDIIK DKRFTVDKFQFHVPITMNFKATGGSYINQTVLEYLQNNPEVKIIGLDRGERHLVYLTLIDQQGNILKQESLNTI TDSKISTPYHKLLDNKENERDLARKNWGTVENIKELKEGYISQVVHKIATLMLEENAIVVMEDLNFGFKRGR FKVEKQIYQKLEKMLIDKLNYLVLKDKQPQELGGLYNALQLTNKFESFQKMGKQSGFLFYVPAWNTSKIDP TTGFVNYFYTKYENVDKAKAFFEKFEAIRFNAEKKYFEFEVKKYSDFNPKAEGTQQAWTICTYGERIETKRQ KDQNNKFVSTPINLTEKIEDFLGKNQIVYGDGNCIKSQIASKDDKAFFETLLYWFKMTLQMRNSETRTDIDYL ISPVMNDNGTFYNSRDYEKLENPTLPKDADANGAYHIAKKGLMLLNKIDQADLTKKVDLSISNRDWLQFVQ KNK* SEQ MEQEYYLGLDMGTGSVGWAVTDSEYHVLRKHGKALWGVRLFESASTAEERRMFRTSRRRLDRRNWRIEIL ID QEIFAEEISKKDPGFFLRMKESKYYPEDKRDINGNCPELPYALFVDDDFTDKDYHKKFPTIYHLRKMLMNTEE NO: TPDIRLVYLAIHHMMKHRGHFLLSGDINEIKEFGTTFSKLLENIKNEELDWNLELGKEEYAVVESILKDNMLN 9 RSTKKTRLIKALKAKSICEKAVLNLLAGGTVKLSDIFGLEELNETERPKISFADNGYDDYIGEVENELGEQFYII ETAKAVYDWAVLVEILGKYTSISEAKVATYEKHKSDLQFLKKIVRKYLTKEEYKDIFVSTSDKLKNYSAYIG MTKINGKKVDLQSKRCSKEEFYDFIKKNVLKKLEGQPEYEYLKEELERETFLPKQVNRDNGVIPYQIHLYELK KILGNLRDKIDLIKENEDKLVQLFEFRIPYYVGPLNKIDDGKEGKFTWAVRKSNEKIYPWNFENVVDIEASAE KFIRRMTNKCTYLMGEDVLPKDSLLYSKYMVLNELNNVKLDGEKLSVELKQRLYTDVFCKYRKVTVKKIK NYLKCEGIISGNVEITGIDGDFKASLTAYHDFKEILTGTELAKKDKENIITNIVLFGDDKKLLKKRLNRLYPQIT PNQLKKICALSYTGWGRFSKKFLEEITAPDPETGEVWNIITALWESNNNLMQLLSNEYRFMEEVETYNMGKQ TKTLSYETVENMYVSPSVKRQIWQTLKIVKELEKVMKESPKRVFIEMAREKQESKRTESRKKQLIDLYKACK NEEKDWVKELGDQEEQKLRSDKLYLYYTQKGRCMYSGEVIELKDLWDNTKYDIDHIYPQSKTMDDSLNNR VLVKKKYNATKSDKYPLNENIRHERKGFWKSLLDGGFISKEKYERLIRNTELSPEELAGFIERQIVETRQSTKA VAEILKQVFPESEIVYVKAGTVSRFRKDFELLKVREVNDLHHAKDAYLNIVVGNSYYVKFTKNASWFIKENP GRTYNLKKMFTSGWNIERNGEVAWEVGKKGTIVTVKQIMNKNNILVTRQVHEAKGGLFDQQIMKKGKGQI AIKETDERLASIEKYGGYNKAAGAYFMLVESKDKKGKTIRTIEFIPLYLKNKIESDESIALNFLEKGRGLKEPKI LLKKIKIDTLFDVDGFKMWLSGRTGDRLLFKCANQLILDEKIIVTMKKIVKFIQRRQENRELKLSDKDGIDNE VLMEIYNTFVDKLENTVYRIRLSEQAKTLIDKQKEFERLSLEDKSSTLFEILHIFQCQSSAANLKMIGGPGKAGI LVMNNNISKCNKISIINQSPTGIFENEIDLLK SEQ MNKFENFTGLYPISKTLRFELIPQGKTLEYIEKSEILENDNYRAEKYEEVKDIIDGYHKWFINETLHDLHINWSE ID LKVALENNRIEKSDASKKELQRVQKIKREEIYNAFIEHEAFQYLFKENLLSDLLPIQIEQSEDLDAEKKKQAVE NO: TFNRFSTYFTGFHENRKNIYSKEGISTSVTYRIVHDNFPKFLENMKVFEILRNECPEVISDTANELAPFIDGVRIE 10 DIFLIDFFNSTFSQNGIDYYNRILGGVTTETGEKYRGINEFTNLYRQQHPEFGKSKKATKMVVLFKQILSDRDT LSFIPEMFGNDKQVQNSIQLFYNREISQFENEGVKTDVCTALATLTSKIAEFDTEKIYIQQPELPNVSQRLFGSW NELNACLFKYAELKFGTAEKVANRKKIDKWLKSDLFSFTELNKALEFSGKDERIENYFSETGIFAQLVKTGFD EAQSILETEYTSEVHLKDQQTDIEKIKTFLDALQNLMHLLKSLCVSEEADRDAAFYNEFDMLYNQLKLVVPL YNKVRNYITQKLFRSDKIKIYFENKGQFLGGWVDSQTENSDNGTQAGGYIFRKENVINEYDYYLGICSDPKLF RRTTIVSENDRSSFERLDYYQLKTASVYGNSYCGKHPYTEDKNELVNSIDRFVHLSGNNILIEKIAKDKVKSNP TTNTPSGYLNFIHREAPNTYECLLQDENFVSLNQRVVSALKATLATLVRVPKALVYAKKDYHLFSEIINDIDE LSYEKAFSYFPVSQTEFENSSNRTIKPLLLFKISNKDLSFAENFEKGNRQKIGKKNLHTLYFEALMKGNQDTIDI GTGMVFHRVKSLNYNEKTLKYGHHSTQLNEKFSYPIIKDKRFASDKFLFHLSTEINYKEKRKPLNNSIIEFLTN NPDINIIGLDRGERHLIYLTLINQKGEILRQKTFNIVGNTNYHEKLNQREKERDNARKSWATIGKIKELKEGFLS LVIHEIAKIMVENNAIVVLEDLNFGFKRGRFKVEKQIYQKFEKMLIDKLNYLVFKDKKANEAGGVLKGYQLA EKFESFQKMGKQSGFLFYVPAAYTSKIDPTTGFVNMLNLNYTNMKDAQTLLSGMDKISFNADANYFEFELD YEKFKTNQTDHTNKWTICTVGEKRFTYNSATKETTTVNVTEDLKKLLDKFEVKYSNGDNIKDEICRQTDAKF FEIILWLLKLTMQMRNSNTKTEEDFILSPVKNSNGEFFRSNDDANGIWPADADANGAYHIALKGLYLVKECF NKNEKSLKIEHKNWFKFAQTRFNGSLTKNG* SEQ MENFKNLYPINKTLRFELRPYGKTLENFKKSGLLEKDAFKANSRRSMQAIIDEKFKETIEERLKYTEFSECDLG ID NMTSKDKKITDKAATNLKKQVILSFDDEIFNNYLKPDKNIDALFKNDPSNPVISTFKGFTTYFVNFFEIRKHIFK NO: GESSGSMAYRIIDENLTTYLNNIEKIKKLPEELKSQLEGIDQIDKLNNYNEFITQSGITHYNEIIGGISKSENVKIQ 11 GINEGINLYCQKNKVKLPRLTPLYKMILSDRVSNSFVLDTIENDTELIEMISDLINKTEISQDVIMSDIQNIFIKY KQLGNLPGISYSSIVNAICSDYDNNFGDGKRKKSYENDRKKHLETNVYSINYISELLTDTDVSSNIKMRYKEL EQNYQVCKENFNATNWMNIKNIKQSEKTNLIKDLLDILKSIQRFYDLFDIVDEDKNPSAEFYTWLSKNAEKLD FEFNSVYNKSRNYLTRKQYSDKKIKLNFDSPTLAKGWDANKEIDNSTIIMRKFNNDRGDYDYFLGIWNKSTP ANEKIIPLEDNGLFEKMQYKLYPDPSKMLPKQFLSKIWKAKHPTTPEFDKKYKEGRHKKGPDFEKEFLHELID CFKHGLVNHDEKYQDVFGFNLRNTEDYNSYTEFLEDVERCNYNLSFNKIADTSNLINDGKLYVFQIWSKDFSI DSKGTKNLNTIYFESLFSEENMIEKMFKLSGEAEIFYRPASLNYCEDIIKKGHHHAELKDKFDYPIIKDKRYSQ DKFFFHVPMVINYKSEKLNSKSLNNRTNENLGQFTHIIGIDRGERHLIYLTVVDVSTGEIVEQKHLDEIINTDTK GVEHKTHYLNKLEEKSKTRDNERKSWEAIETIKELKEGYISHVINEIQKLQEKYNALIVMENLNYGFKNSRIK VEKQVYQKFETALIKKFNYIIDKKDPETYIHGYQLTNPITTLDKIGNQSGIVLYIPAWNTSKIDPVTGFVNLLYA DDLKYKNQEQAKSFIQKIDNIYFENGEFKFDIDFSKWNNRYSISKTKWTLTSYGTRIQTFRNPQKNNKWDSAE YDLTEEFKLILNIDGTLKSQDVETYKKFMSLFKLMLQLRNSVTGTDIDYMISPVTDKTGTHFDSRENIKNLPA DADANGAYNIARKGIMAIENIMNGISDPLKISNEDYLKYIQNQQE SEQ MTQFEGFTNLYQVSKTLRFELIPQGKTLKHIQEQGFIEEDKARNDHYKELKPIIDRIYKTYADQCLQLVQLDW ID ENLSAAIDSYRKEKTEETRNALIEEQATYRNAIHDYFIGRTDNLTDAINKRHAEIYKGLFKAELFNGKVLKQL NO: GTVTTTEHENALLRSFDKFTTYFSGFYENRKNVFSAEDISTAIPHRIVQDNFPKFKENCHIFTRLITAVPSLREH 12 FENVKKAIGIFVSTSIEEVFSFPFYNQLLTQTQIDLYNQLLGGISREAGTEKIKGLNEVLNLAIQKNDETAHIIAS LPHRFIPLFKQILSDRNTLSFILEEFKSDEEVIQSFCKYKTLLRNENVLETAEALFNELNSIDLTHIFISHKKLETIS SALCDHWDTLRNALYERRISELTGKITKSAKEKVQRSLKHEDINLQEIISAAGKELSEAFKQKTSEILSHAHAA LDQPLPTTLKKQEEKEILKSQLDSLLGLYHLLDWFAVDESNEVDPEFSARLTGIKLEMEPSLSFYNKARNYAT KKPYSVEKFKLNFQMPTLASGWDVNKEKNNGAILFVKNGLYYLGIMPKQKGRYKALSFEPTEKTSEGFDKM YYDYFPDAAKMIPKCSTQLKAVTAHFQTHTTPILLSNNFIEPLEITKEIYDLNNPEKEPKKFQTAYAKKTGDQK GYREALCKWIDFTRDFLSKYTKTTSIDLSSLRPSSQYKDLGEYYAELNPLLYHISFQRIAEKEIMDAVETGKLY LFQIYNKDFAKGHHGKPNLHTLYWTGLFSPENLAKTSIKLNGQAELFYRPKSRMKRMAHRLGEKMLNKKLK DQKTPIPDTLYQELYDYVNHRLSHDLSDEARALLPNVITKEVSHEIIKDRRFTSDKFFFHVPITLNYQAANSPS KFNQRVNAYLKEHPETPIIGIDRGERNLIYITVIDSTGKILEQRSLNTIQQFDYQKKLDNREKERVAARQAWSV VGTIKDLKQGYLSQVIHEIVDLMIHYQAVVVLENLNFGFKSKRTGIAEKAVYQQFEKMLIDKLNCLVLKDYP AEKVGGVLNPYQLTDQFTSFAKMGTQSGFLFYVPAPYTSKIDPLTGFVDPFVWKTIKNHESRKHFLEGFDFL HYDVKTGDFILHFKMNRNLSFQRGLPGFMPAWDIVFEKNETQFDAKGTPFIAGKRIVPVIENHRFTGRYRDLY PANELIALLEEKGIVFRDGSNILPKLLENDDSHAIDTMVALIRSVLQMRNSNAATGEDYINSPVRDLNGVCFDS RFQNPEWPMDADANGAYHIALKGQLLLNHLKESKDLKLQNGISNQDWLAYIQELRN* SEQ MAVKSIKVKLRLDDMPEIRAGLWKLHKEVNAGVRYYTEWLSLLRQENLYRRSPNGDGEQECDKTAEECKA ID ELLERLRARQVENGHRGPAGSDDELLQLARQLYELLVPQAIGAKGDAQQIARKFLSPLADKDAVGGLGIAKA NO: GNKPRWVRMREAGEPGWEEEKEKAETRKSADRTADVLRALADFGLKPLMRVYTDSEMSSVEWKPLRKGQ 13 AVRTWDRDMFQQAIERMMSWESWNQRVGQEYAKLVEQKNRFEQKNFVGQEHLVHLVNQLQQDMKEASP GLESKEQTAHYVTGRALRGSDKVFEKWGKLAPDAPFDLYDAEIKNVQRRNTRRFGSHDLFAKLAEPEYQAL WREDASFLTRYAVYNSILRKLNHAKMFATFTLPDATAHPIWTRFDKLGGNLHQYTFLFNEFGERRHAIRFHK LLKVENGVAREVDDVTVPISMSEQLDNLLPRDPNEPIALYFRDYGAEQHFTGEFGGAKIQCRRDQLAHMHRR RGARDVYLNVSVRVQSQSEARGERRPPYAAVFRLVGDNHRAFVHFDKLSDYLAEHPDDGKLGSEGLLSGLR VMSVDLGLRTSASISVFRVARKDELKPNSKGRVPFFFPIKGNDNLVAVHERSQLLKLPGETESKDLRAIREER QRTLRQLRTQLAYLRLLVRCGSEDVGRRERSWAKLIEQPVDAANHMTPDWREAFENELQKLKSLHGICSDK EWMDAVYESVRRVWRHMGKQVRDWRKDVRSGERPKIRGYAKDVVGGNSIEQIEYLERQYKFLKSWSFFG KVSGQVIRAEKGSRFAITLREHIDHAKEDRLKKLADRIIMEALGYVYALDERGKGKWVAKYPPCQLILLEELS EYQFNNDRPPSENNQLMQWSHRGVFQELINQAQVHDLLVGTMYAAFSSRFDARTGAPGIRCRRVPARCTQE HNPEPFPWWLNKFVVEHTLDACPLRADDLIPTGEGEIFVSPFSAEEGDFHQIHADLNAAQNLQQRLWSDFDIS QIRLRCDWGEVDGELVLIPRLTGKRTADSYSNKVFYTNTGVTYYERERGKKRRKVFAQEKLSEEEAELLVEA DEAREKSVVLMRDPSGIINRGNWTRQKEFWSMVNQRIEGYLVKQIRSRVPLQDSACENTGDI* SEQ MATRSFILKIEPNEEVKKGLWKTHEVLNHGIAYYMNILKLIRQEAIYEHHEQDPKNPKKVSKAEIQAELWDFV ID LKMQKCNSFTHEVDKDVVFNILRELYEELVPSSVEKKGEANQLSNKFLYPLVDPNSQSGKGTASSGRKPRWY NO: NLKIAGDPSWEEEKKKWEEDKKKDPLAKILGKLAEYGLIPLFIPFTDSNEPIVKEIKWMEKSRNQSVRRLDKD 14 MFIQALERFLSWESWNLKVKEEYEKVEKEHKTLEERIKEDIQAFKSLEQYEKERQEQLLRDTLNTNEYRLSK RGLRGWREIIQKWLKMDENEPSEKYLEVFKDYQRKHPREAGDYSVYEFLSKKENHFIWRNHPEYPYLYATF CEIDKKKKDAKQQATFTLADPINHPLWVRFEERSGSNLNKYRILTEQLHTEKLKKKLTVQLDRLIYPTESGG WEEKGKVDIVLLPSRQFYNQIFLDIEEKGKHAFTYKDESIKFPLKGTLGGARVQFDRDHLRRYPHKVESGNV GRIYFNMTVNIEPTESPVSKSLKIHRDDFPKFVNFKPKELTEWIKDSKGKKLKSGIESLEIGLRVMSIDLGQRQA AAASIFEVVDQKPDIEGKLFFPIKGTELYAVHRASFNIKLPGETLVKSREVLRKAREDNLKLMNQKLNFLRNV LHFQQFEDITEREKRVTKWISRQENSDVPLVYQDELIQIRELMYKPYKDWVAFLKQLHKRLEVEIGKEVKHW RKSLSDGRKGLYGISLKNIDEIDRTRKFLLRWSLRPTEPGEVRRLEPGQRFAIDQLNHLNALKEDRLKKMANT IIMHALGYCYDVRKKKWQAKNPACQIILFEDLSNYNPYEERSRFENSKLMKWSRREIPRQVALQGEIYGLQV GEVGAQFSSRFHAKTGSPGIRCSVVTKEKLQDNRFFKNLQREGRLTLDKIAVLKEGDLYPDKGGEKFISLSKD RKLVTTHADINAAQNLQKRFWTRTHGFYKVYCKAYQVDGQTVYIPESKDQKQKIIEEFGEGYFILKDGVYE WGNAGKLKIKKGSSKQSSSELVDSDILKDSFDLASELKGEKLMLYRDPSGNVFPSDKWMAAGVFFGKLERIL ISKLTNQYSISTIEDDSSKQSM* SEQ MPTRTINLKLVLGKNPENATLRRALFSTHRLVNQATKRIEEFLLLCRGEAYRTVDNEGKEAEIPRHAVQEEAL ID AFAKAAQRHNGCISTYEDQEILDVLRQLYERLVPSVNENNEAGDAQAANAWVSPLMSAESEGGLSVYDKVL NO: DPPPVWMKLKEEKAPGWEAASQIWIQSDEGQSLLNKPGSPPRWIRKLRSGQPWQDDFVSDQKKKQDELTKG 15 NAPLIKQLKEMGLLPLVNPFFRHLLDPEGKGVSPWDRLAVRAAVAHFISWESWNHRTRAEYNSLKLRRDEFE AASDEFKDDFTLLRQYEAKRHSTLKSIALADDSNPYRIGVRSLRAWNRVREEWIDKGATEEQRVTILSKLQT QLRGKFGDPDLFNWLAQDRHVHLWSPRDSVTPLVRINAVDKVLRRRKPYALMTFAHPRFHPRWILYEAPGG SNLRQYALDCTENALHITLPLLVDDAHGTWIEKKIRVPLAPSGQIQDLTLEKLEKKKNRLYYRSGFQQFAGLA GGAEVLFHRPYMEHDERSEESLLERPGAVWFKLTLDVATQAPPNWLDGKGRVRTPPEVHHFKTALSNKSKH TRTLQPGLRVLSVDLGMRTFASCSVFELIEGKPETGRAFPVADERSMDSPNKLWAKHERSFKLTLPGETPSRK EEEERSIARAEIYALKRDIQRLKSLLRLGEEDNDNRRDALLEQFFKGWGEEDVVPGQAFPRSLFQGLGAAPFR STPELWRQHCQTYYDKAEACLAKHISDWRKRTRPRPTSREMWYKTRSYHGGKSIWMLEYLDAVRKLLLSW SLRGRTYGAINRQDTARFGSLASRLLHHINSLKEDRIKTGADSIVQAARGYIPLPHGKGWEQRYEPCQLILFED LARYRFRVDRPRRENSQLMQWNHRAIVAETTMQAELYGQIVENTAAGFSSRFHAATGAPGVRCRFLLERDF DNDLPKPYLLRELSWMLGNTKVESEEEKLRLLSEKIRPGSLVPWDGGEQFATLHPKRQTLCVIHADMNAAQ NLQRRFFGRCGEAFRLVCQPHGDDVLRLASTPGARLLGALQQLENGQGAFELVRDMGSTSQMNRFVMKSL GKKKIKPLQDNNGDDELEDVLSVLPEEDDTGRITVFRDSSGIFFPCNVWIPAKQFWPAVRAMIWKVMASHSL G* SEQ MTKLRHRQKKLTHDWAGSKKREVLGSNGKLQNPLLMPVKKGQVTEFRKAFSAYARATKGEMTDGRKNMF ID THSFEPFKTKPSLHQCELADKAYQSLHSYLPGSLAHFLLSAHALGFRIFSKSGEATAFQASSKIEAYESKLASE NO: LACVDLSIQNLTISTLFNALTTSVRGKGEETSADPLIARFYTLLTGKPLSRDTQGPERDLAEVISRKIASSFGTW 16 KEMTANPLQSLQFFEEELHALDANVSLSPAFDVLIKMNDLQGDLKNRTIVFDPDAPVFEYNAEDPADIIIKLTA RYAKEAVIKNQNVGNYVKNAITTTNANGLGWLLNKGLSLLPVSTDDELLEFIGVERSHPSCHALIELIAQLEA PELFEKNVFSDTRSEVQGMIDSAVSNHIARLSSSRNSLSMDSEELERLIKSFQIHTPHCSLFIGAQSLSQQLESLP EALQSGVNSADILLGSTQYMLTNSLVEESIATYQRTLNRINYLSGVAGQINGAIKRKAIDGEKIHLPAAWSELI SLPFIGQPVIDVESDLAHLKNQYQTLSNEFDTLISALQKNFDLNFNKALLNRTQHFEAMCRSTKKNALSKPEIV SYRDLLARLTSCLYRGSLVLRRAGIEVLKKHKIFESNSELREHVHERKHFVFVSPLDRKAKKLLRLTDSRPDL LHVIDEILQHDNLENKDRESLWLVRSGYLLAGLPDQLSSSFINLPIITQKGDRRLIDLIQYDQINRDAFVMLVTS AFKSNLSGLQYRANKQSFVVTRTLSPYLGSKLVYVPKDKDWLVPSQMFEGRFADILQSDYMVWKDAGRLC VIDTAKHLSNIKKSVFSSEEVLAFLRELPHRTFIQTEVRGLGVNVDGIAFNNGDIPSLKTFSNCVQVKVSRTNT SLVQTLNRWFEGGKVSPPSIQFERAYYKKDDQIHEDAAKRKIRFQMPATELVHASDDAGWTPSYLLGIDPGE YGMGLSLVSINNGEVLDSGFIHINSLINFASKKSNHQTKVVPRQQYKSPYANYLEQSKDSAAGDIAHILDRLIY KLNALPVFEALSGNSQSAADQVWTKVLSFYTWGDNDAQNSIRKQHWFGASHWDIKGMLRQPPTEKKPKPYI AFPGSQVSSYGNSQRCSCCGRNPIEQLREMAKDTSIKELKIRNSEIQLFDGTIKLFNPDPSTVIERRRHNLGPSRI PVADRTFKNISPSSLEFKELITIVSRSIRHSPEFIAKKRGIGSEYFCAYSDCNSSLNSEANAAANVAQKFQKQLFF EL* SEQ MKRILNSLKVAALRLLFRGKGSELVKTVKYPLVSPVQGAVEELAEAIRHDNLHLFGQKEIVDLMEKDEGTQV ID YSVVDFWLDTLRLGMFFSPSANALKITLGKFNSDQVSPFRKVLEQSPFFLAGRLKVEPAERILSVEIRKIGKRE NO: NRVENYAADVETCFIGQLSSDEKQSIQKLANDIWDSKDHEEQRMLKADFFAIPLIKDPKAVTEEDPENETAGK 17 QKPLELCVCLVPELYTRGFGSIADFLVQRLTLLRDKMSTDTAEDCLEYVGIEEEKGNGMNSLLGTFLKNLQG DGFEQIFQFMLGSYVGWQGKEDVLRERLDLLAEKVKRLPKPKFAGEWSGHRMFLHGQLKSWSSNFFRLFNE TRELLESIKSDIQHATMLISYVEEKGGYHPQLLSQYRKLMEQLPALRTKVLDPEIEMTHMSEAVRSYIMIHKS VAGFLPDLLESLDRDKDREFLLSIFPRIPKIDKKTKEIVAWELPGEPEEGYLFTANNLFRNFLENPKHVPRFMA ERIPEDWTRLRSAPVWFDGMVKQWQKVVNQLVESPGALYQFNESFLRQRLQAMLTVYKRDLQTEKFLKLL ADVCRPLVDFFGLGGNDIIFKSCQDPRKQWQTVIPLSVPADVYTACEGLAIRLRETLGFEWKNLKGHEREDFL RLHQLLGNLLFWIRDAKLVVKLEDWMNNPCVQEYVEARKAIDLPLEIFGFEVPIFLNGYLFSELRQLELLLRR KSVMTSYSVKTTGSPNRLFQLVYLPLNPSDPEKKNSNNFQERLDTPTGLSRRFLDLTLDAFAGKLLTDPVTQE LKTMAGFYDHLFGFKLPCKLAAMSNHPGSSSKMVVLAKPKKGVASNIGFEPIPDPAHPVFRVRSSWPELKYL EGLLYLPEDTPLTIELAETSVSCQSVSSVAFDLKNLTTILGRVGEFRVTADQPFKLTPIIPEKEESFIGKTYLGLD AGERSGVGFAIVTVDGDGYEVQRLGVHEDTQLMALQQVASKSLKEPVFQPLRKGTFRQQERIRKSLRGCYW NFYHALMIKYRAKVVHEESVGSSGLVGQWLRAFQKDLKKADVLPKKGGKNGVDKKKRESSAQDTLWGGA FSKKEEQQIAFEVQAAGSSQFCLKCGWWFQLGMREVNRVQESGVVLDWNRSIVTFLIESSGEKVYGFSPQQL EKGFRPDIETFKKMVRDFMRPPMFDRKGRPAAAYERFVLGRRHRRYRFDKVFEERFGRSALFICPRVGCGNF DHSSEQSAVVLALIGYIADKEGMSGKKLVYVRLAELMAEWKLKKLERSRVEEQSSAQ* SEQ MAESKQMQCRKCGASMKYEVIGLGKKSCRYMCPDCGNHTSARKIQNKKKRDKKYGSASKAQSQRIAVAG ID ALYPDKKVQTIKTYKYPADLNGEVHDSGVAEKIAQAIQEDEIGLLGPSSEYACWIASQKQSEPYSVVDFWFD NO: AVCAGGVFAYSGARLLSTVLQLSGEESVLRAALASSPFVDDINLAQAEKFLAVSRRTGQDKLGKRIGECFAE 18 GRLEALGIKDRMREFVQAIDVAQTAGQRFAAKLKIFGISQMPEAKQWNNDSGLTVCILPDYYVPEENRADQL VVLLRRLREIAYCMGIEDEAGFEHLGIDPGALSNFSNGNPKRGFLGRLLNNDIIALANNMSAMTPYWEGRKG ELIERLAWLKHRAEGLYLKEPHFGNSWADHRSRIFSRIAGWLSGCAGKLKIAKDQISGVRTDLFLLKRLLDAV PQSAPSPDFIASISALDRFLEAAESSQDPAEQVRALYAFHLNAPAVRSIANKAVQRSDSQEWLIKELDAVDHL EFNKAFPFFSDTGKKKKKGANSNGAPSEEEYTETESIQQPEDAEQEVNGQEGNGASKNQKKFQRIPRFFGEGS RSEYRILTEAPQYFDMFCNNMRAIFMQLESQPRKAPRDFKCFLQNRLQKLYKQTFLNARSNKCRALLESVLIS WGEFYTYGANEKKFRLRHEASERSSDPDYVVQQALEIARRLFLFGFEWRDCSAGERVDLVEIHKKAISFLLAI TQAEVSVGSYNWLGNSTVSRYLSVAGTDTLYGTQLEEFLNATVLSQMRGLAIRLSSQELKDGFDVQLESSCQ DNLQHLLVYRASRDLAACKRATCPAELDPKILVLPVGAFIASVMKMIERGDEPLAGAYLRHRPHSFGWQIRV RGVAEVGMDQGTALAFQKPTESEPFKIKPFSAQYGPVLWLNSSSYSQSQYLDGFLSQPKNWSMRVLPQAGS VRVEQRVALIWNLQAGKMRLERSGARAFFMPVPFSFRPSGSGDEAVLAPNRYLGLFPHSGGIEYAVVDVLDS AGFKILERGTIAVNGFSQKRGERQEEAHREKQRRGISDIGRKKPVQAEVDAANELHRKYTDVATRLGCRIVV QWAPQPKPGTAPTAQTVYARAVRTEAPRSGNQEDHARMKSSWGYTWGTYWEKRKPEDILGISTQVYWTG GIGESCPAVAVALLGHIRATSTQTEWEKEEVVFGRLKKFFPS* SEQ MEKRINKIRKKLSADNATKPVSRSGPMKTLLVRVMTDDLKKRLEKRRKKPEVMPQVISNNAANNLRMLLDD ID YTKMKEAILQVYWQEFKDDHVGLMCKFAQPASKKIDQNKLKPEMDEKGNLTTAGFACSQCGQPLFVYKLE NO: QVSEKGKAYTNYFGRCNVAEHEKLILLAQLKPEKDSDEAVTYSLGKFGQRALDFYSIHVTKESTHPVKPLAQ 19 IAGNRYASGPVGKALSDACMGTIASFLSKYQDIIIEHQKVVKGNQKRLESLRELAGKENLEYPSVTLPPQPHT KEGVDAYNEVIARVRMWVNLNLWQKLKLSRDDAKPLLRLKGFPSFPVVERRENEVDWWNTINEVKKLIDA KRDMGRVFWSGVTAEKRNTILEGYNYLPNENDHKKREGSLENPKKPAKRQFGDLLLYLEKKYAGDWGKVF DEAWERIDKKIAGLTSHIEREEARNAEDAQSKAVLTDWLRAKASFVLERLKEMDEKEFYACEIQLQKWYGD LRGNPFAVEAENRVVDISGFSIGSDGHSIQYRNLLAWKYLENGKREFYLLMNYGKKGRIRFTDGTDIKKSGK WQGLLYGGGKAKVIDLTFDPDDEQLIILPLAFGTRQGREFIWNDLLSLETGLIKLANGRVIEKTIYNKKIGRDE PALFVALTFERREVVDPSNIKPVNLIGVDRGENIPAVIALTDPEGCPLPEFKDSSGGPTDILRIGEGYKEKQRAI QAAKEVEQRRAGGYSRKFASKSRNLADDMVRNSARDLFYHAVTHDAVLVFENLSRGFGRQGKRTFMTERQ YTKMEDWLTAKLAYEGLTSKTYLSKTLAQYTSKTCSNCGFTITTADYDGMLVRLKKTSDGWATTLNNKEL KAEGQITYYNRYKRQTVEKELSAELDRLSEESGNNDISKWTKGRRDEALFLLKKRFSHRPVQEQFVCLDCGH EVHADEQAALNIARSWLFLNSNSTEFKSYKSGKQPFVGAWQAFYKRRLKEVWKPNA SEQ MKRINKIRRRLVKDSNTKKAGKTGPMKTLLVRVMTPDLRERLENLRKKPENIPQPISNTSRANLNKLLTDYTE ID MKKAILHVYWEEFQKDPVGLMSRVAQPAPKNIDQRKLIPVKDGNERLTSSGFACSQCCQPLYVYKLEQVND NO: KGKPHTNYFGRCNVSEHERLILLSPHKPEANDELVTYSLGKFGQRALDFYSIHVTRESNHPVKPLEQIGGNSC 20 ASGPVGKALSDACMGAVASFLTKYQDIILEHQKVIKKNEKRLANLKDIASANGLAFPKITLPPQPHTKEGIEA YNNVVAQIVIWVNLNLWQKLKIGRDEAKPLQRLKGFPSFPLVERQANEVDWWDMVCNVKKLINEKKEDGK VFWQNLAGYKRQEALLPYLSSEEDRKKGKKFARYQFGDLLLHLEKKHGEDWGKVYDEAWERIDKKVEGLS KHIKLEEERRSEDAQSKAALTDWLRAKASFVIEGLKEADKDEFCRCELKLQKWYGDLRGKPFAIEAENSILDI SGFSKQYNCAFIWQKDGVKKLNLYLIINYFKGGKLRFKKIKPEAFEANRFYTVINKKSGEIVPMEVNFNFDDP NLIILPLAFGKRQGREFIWNDLLSLETGSLKLANGRVIEKTLYNRRTRQDEPALFVALTFERREVLDSSNIKPM NLIGIDRGENIPAVIALTDPEGCPLSRFKDSLGNPTHILRIGESYKEKQRTIQAAKEVEQRRAGGYSRKYASKA KNLADDMVRNTARDLLYYAVTQDAMLIFENLSRGFGRQGKRTFMAERQYTRMEDWLTAKLAYEGLPSKT YLSKTLAQYTSKTCSNCGFTITSADYDRVLEKLKKTATGWMTTINGKELKVEGQITYYNRYKRQNVVKDLS VELDRLSEESVNNDISSWTKGRSGEALSLLKKRFSHRPVQEKFVCLNCGFETHADEQAALNIARSWLFLRSQE YKKYQTNKTTGNTDKRAFVETWQSFYRKKLKEVWKP SEQ atgGGAAAAATGTATTATCTTGGTCTGGATATAGGAACAAATTCTGTTGGATATGCCGTAACCGACCCATC ID GTACCATTTGCTCAAATTTAAAGGCGAACCGATGTGGGGTGCCCACGTGTTTGCTGCGGGGAATCAATC NO: AGCTGAACGGAGAAGCTTTCGTACGAGCCGCAGACGCCTTGACCGCAGGCAACAGCGTGTCAAACTGGT 21 TCAAGAAATCTTTGCTCCCGTGATTAGTCCCATTGATCCACGTTTTTTTATCAGACTTCATGAGAGCGCTT TATGGCGGGATGATGTGGCTGAAACGGATAAACATATTTTCTTTAATGACCCGACCTATACGGATAAGG AATATTATTCTGACTATCCAACCATCCATCATCTCATTGTGGACCTTATGGAAAGCAGTGAAAAGCATGA CCCGCGGCTTGTTTATTTGGCTGTTGCCTGGCTGGTTGCTCATCGTGGTCATTTCCTCAATGAAGTGGATA AGGATAATATTGGGGATGTCCTGAGTTTTGACGCCTTTTATCCTGAGTTTCTGGCATTTCTTTCCGATAAT GGGGTGTCACCTTGGGTATGTGAGTCAAAAGCACTCCAAGCGACCCTGCTTTCACGAAACTCCGTCAAC GATAAGTATAAAGCCTTGAAGTCTCTGATCTTTGGCAGCCAAAAGCCGGAGGATAATTTTGATGCCAAT ATCAGTGAAGATGGACTTATCCAACTTTTAGCAGGAAAAAAGGTCAAGGTCAATAAACTTTTTCCTCAA GAAAGTAATGATGCTTCCTTTACACTCAATGATAAGGAAGATGCAATTGAGGAAATCTTAGGAACGCTT ACACCGGATGAGTGTGAATGGATTGCGCATATTAGGAGGCTGTTTGATTGGGCCATCATGAAACATGCT CTCAAAGATGGCAGAACAATCTCCGAATCGAAAGTAAAGCTCTATGAACAGCATCACCATGACTTGACA CAGCTCAAGTATTTTGTGAAGACCTATCTAGCAAAGGAATATGATGACATTTTTCGAAACGTAGATAGT GAAACAACCAAAAACTATGTCGCATATTCCTATCATGTAAAAGAAGTCAAGGGTACATTGCCCAAAAAT AAGGCAACCCAAGAAGAATTTTGCAAGTATGTCCTTGGAAAGGTAAAGAACATCGAATGCAGTGAAGC TGATAAGGTTGATTTTGATGAAATGATTCAGCGTCTTACAGACAATTCCTTTATGCCGAAACAAGTATCA GGTGAAAACAGGGTTATCCCTTACCAGCTTTACTATTATGAACTAAAGACTATTTTGAATAAAGCCGCTT CTTATCTGCCTTTTTTGACCCAATGCGGAAAAGATGCCATCTCCAATCAAGATAAGCTCCTTTCCATCAT GACCTTTCGGATTCCGTATTTCGTTGGGCCCTTGCGCAAGGACAATTCAGAGCATGCCTGGCTGGAACGA AAAGCAGGGAAAATCTATCCGTGGAATTTTAACGACAAAGTTGACCTTGATAAAAGTGAAGAAGCGTTC ATTCGGAGAATGACGAATACCTGCACTTATTATCCCGGTGAAGATGTTTTGCCACTTGACTCCCTTATTT ATGAAAAATTCATGATCCTCAATGAAATCAATAATATCCGAATTGATGGTTATCCTATTTCTGTAGATGT AAAACAGCAGGTTTTTGGCCTCTTTGAAAAGAAGAGAAGAGTGACCGTAAAGGATATCCAGAATCTCCT GCTTTCCTTGGGTGCCTTGGATAAGCATGGTAAATTGACGGGAATCGATACTACCATCCATAGCAATTAC AATACATACCATCATTTTAAATCGCTCATGGAGCGTGGCGTTCTTACTCGTGATGATGTGGAACGCATTG TGGAGCGTATGACCTATAGTGATGATACAAAACGCGTCCGTCTTTGGCTGAACAATAATTATGGAACGC TCACTGCTGACGACGTAAAGCATATTTCAAGGCTCCGAAAGCATGATTTTGGCCGGCTTTCCAAAATGTT CCTCACAGGCCTAAAGGGAGTTCATAAGGAAACGGGGGAACGAGCTTCCATTTTGGATTTTATGTGGAA TACCAATGATAACTTGATGCAGCTTTTATCTGAATGTTATACTTTTTCGGATGAAATTACCAAGCTGCAG GAAGCATACTATGCCAAGGCGCAGCTTTCCCTGAATGATTTTCTGGACTCCATGTATATTTCAAATGCTG TCAAACGTCCTATCTATCGAACTCTTGCCGTTGTAAATGACATACGCAAAGCCTGTGGGACGGCGCCAA AACGCATTTTTATCGAAATGGCAAGAGATGGGGAAAGCAAAAAGAAAAGGAGCGTAACAAGAAGAGA ACAAATCAAGAATCTTTATAGGTCCATCCGCAAGGATTTTCAGCAGGAGGTAGATTTCCTTGAAAAAAT CCTTGAAAACAAAAGCGATGGACAGCTGCAAAGCGATGCGCTCTATCTATACTTTGCGCAGCTTGGAAG GGATATGTATACCGGGGACCCTATCAAGTTGGAGCATATCAAGGACCAGTCCTTCTATAATATTGATCAT ATCTATCCCCAAAGCATGGTCAAGGACGATAGTCTTGATAACAAGGTGTTGGTTCAATCGGAAATTAAT GGAGAGAAGAGCAGTCGATATCCTCTTGATGCTGCTATCCGTAATAAAATGAAGCCTCTTTGGGATGCTT ATTATAACCATGGCCTGATTTCCCTCAAGAAGTATCAGCGTTTGACGCGGAGCACTCCCTTTACAGATGA TGAAAAGTGGGATTTCATCAATCGGCAGCTTGTTGAGACAAGACAATCCACGAAGGCCTTGGCAATCTT ACTAAAAAGGAAGTTCCCTGATACGGAGATTGTCTACTCCAAGGCAGGGCTTTCTTCTGATTTTCGGCAT GAGTTTGGTCTCGTAAAATCGAGGAATATCAATGACCTGCACCATGCAAAGGACGCATTTCTTGCGATT GTAACAGGAAATGTCTATCATGAACGCTTTAATCGCCGGTGGTTTATGGTGAACCAGCCCTATTCCGTCA AGACCAAGACGTTGTTTACGCATTCTATTAAAAATGGTAATTTTGTAGCTTGGAATGGAGAAGAGGATC TTGGCCGCATTGTTAAAATGTTAAAGCAAAATAAGAACACTATTCATTTCACGCGGTTCTCTTTTGATCG AAAGGAAGGCCTGTTTGATATTCAGCCACTAAAAGCGTCAACCGGTCTTGTACCAAGAAAAGCCGGACT AGACGTGGTAAAATATGGTGGCTATGACAAATCGACAGCAGCTTATTATCTCCTTGTTCGATTTACACTA GAAGATAAAAAGACTCAACATAAATTGATGATGATTCCTGTAGAAGGCTTGTATAAAGCTCGAATTGAC CATGATAAGGAATTCTTAACGGACTATGCACAAACTACAATCAGTGAAATCCTACAAAAAGATAAACAA AAGGTGATAAATATAATGTTTCCAATGGGAACAAGGCACATTAAACTGAATTCCATGATTTCAATCGAT GGTTTTTATCTTTCCATTGGAGGAAAGTCTAGTAAGGGAAAATCGGTGTTGTGTCATGCTATGGTACCTC TTATTGTACCTCATAAGATAGAATGTTATATTAAGGCGATGGAGTCTTTTGCACGTAAATTTAAAGAAAA TAATAAATTAAGGATTGTGGAAAAGTTTGATAAGATTACGGTGGAAGATAACTTGAACCTATACGAACT ATTTTTACAAAAACTTCAACATAACCCATATAATAAGTTCTTCTCCACACAATTTGATGTGCTGACTAAT GGAAGAAGTACATTTACTAAATTATCTCCAGAGGAACAAGTTCAAACGTTATTGAATATCTTATCAATTT TTAAAACTTGTCGGAGCTCTGGCTGCGATTTAAAATCCATTAACGGTTCTGCTCAAGCTGCCAGAATTAT GATCAGCGCAGATTTAACTGGACTCTCAAAAAAATATTCCGATATTCGGCTTGTTGAGCAATCAGCATCT GGACTTTTTGTTAGTAAATCACAAAATCTTTTGGAGTATTTAtga SEQ atgtcttcattaacaaaatttacaaataaatacagtaagcagctaaccataaaaaatgaactcatcccagtaggaaagactctcgagaacattaaggaa ID aacggtctcatagatggagatgaacagctaaacgagaattatcaaaaagcaaagataatcgttgatgattttctacgagatttcataaataaagcttta NO: aataatacccaaataggaaattggagagaattagcagatgctttaaataaagaagatgaagataacatagaaaagctccaagacaaaatcagaggaata 22 attgtaagtaaattcgagacatttgatttgttttcttcttactcgataaagaaagacgaaaagataatagatgatgataatgatgttgaagaagaggag ctagatctaggaaaaaaaacttcctcatttaaatatatttttaagaaaaacctttttaaattagtacttccttcttatttaaagacaacaaatcaggat aaactgaaaataatctcttcttttgataatttttctacctatttcagaggattctttgagaacagaaaaaatattttcactaagaagcctatatctacg tcaattgcctacagaattgtccatgataactttccaaagtttctagataacatcagatgttttaatgtgtggcaaacagaatgcccacagttaattgta aaggctgataattatttaaaatcaaagaacgtcatagctaaagataaatctttagcaaactattttactgtaggagcatatgattacttcttatcccag aatggcattgatttctacaacaacattatcggcggtctaccagcatttgctggtcatgagaaaatccaaggacttaatgaatttataaatcaagaatgc caaaaggacagcgaactaaaatctaaactgaaaaacagacatgctttcaaaatggctgttctatttaagcaaattctttcagatagagaaaaaagtttt gttatagacgagttcgaatctgatgctcaggtcatagatgcggttaagaacttctatgcagaacaatgtaaggataataatgttatttttaaccttcta aatcttatcaagaatatagcgttcttatctgatgatgaattagatggaatttttatagaaggcaagtatttaagctctgtttcccaaaagctatattca gattggtcgaagcttcgaaatgatattgaagatagtgcaaacagtaaacaaggaaataaagagttagcaaagaaaattaaaacaaataaaggcgatgtt gaaaaggccataagtaaatatgagttttctttatcagaacttaactcaattgtacatgataatacaaaattcagtgaccttctttcttgtacgttacat aaagtggctagcgaaaaactagtgaaagttaatgaaggggactggccaaaacacctgaaaaataatgaagaaaaacaaaagataaaagagcctttagat gcattgttagaaatttataatacattgctgatattcaactgcaagtcatttaataagaacggtaatttctatgttgattatgacagatgcataaatgag ctttctagtgttgtttatttatataacaaaacaagaaattactgtacaaagaaaccttataacacagacaaattcaaattaaactttaacagtcctcaa ttaggagagggctttagtaagtcgaaagaaaatgactgtctgacattattatttaaaaaagacgacaattactatgttggaattatcagaaaaggggca aaaattaactttgatgatacacaagccattgcagacaatacagataactgtatatttaagatgaattatttcctattaaaagatgctaaaaagtttatt cctaaatgttcaattcagttaaaagaagtaaaagcacattttaaaaaatcagaggatgattatatcctgagtgacaaagaaaaatttgcctctcccctt gttattaagaaatcaacatttttattagcaacagcacatgtaaaaggaaagaaaggaaacataaaaaaattccaaaaggaatattctaaggaaaatcca acagaatatagaaattctctgaatgaatggattgcattttgtaaagaatttctaaaaacatataaggcggcaacaatctttgacattacaacgttaaaa aaagctgaagaatatgctgatattgttgagttttataaggatgtagataatctttgttataaactagagttttgccctattaaaacatctttcattgag aatcttattgataatggggacttatatttattcagaatcaataataaagatttcagttcaaaatctactggtacaaagaatcttcatacgctctatctt caggcaatctttgatgaaagaaacctcaataatcctactattatgttaaatggcggagcagagttattttatcgaaaagaaagcattgaacagaaaaat aggataactcataaggcaggatcaattcttgtaaacaaggtttgtaaggatggaacaagtctagatgacaaaatcagaaacgaaatatatcaatatgaa aacaagtttattgatacattgtctgatgaagctaaaaaagttttacctaatgtaataaaaaaagaagcaactcacgacataacaaaagataagcgattt acatcagataagttctttttccattgcccattaacaattaactataaggaaggagatacaaaacaatttaacaatgaggttttatctttccttagaggt aatccagacattaatatcatcggaattgacagaggagaaagaaaccttatatacgtaactgttattaatcagaaaggcgaaatacttgacagcgtttcg tttaacacagtaacaaacaagtcgagcaaaattgaacaaactgttgattatgaggaaaagcttgctgttagggaaaaagaaagaatagaagcaaaaaga tcctgggattcaatatcaaagatagcaaccttaaaagaaggttatctatcagctattgttcatgagatatgcctactgatgatcaaacacaacgcaatc gttgtacttgagaatctaaatgcaggatttaagagaattagaggaggattatcagaaaagtctgtttatcagaaattcgagaagatgcttattaacaaa ctaaattactttgtatctaaaaaagaatcagactggaataaacctagtggacttttaaatggtttacaactttcagaccagttcgagtcatttgagaaa ttaggaattcaatctgggttcatcttctatgttcctgcagcatatacatctaagattgatcctacaacaggatttgcaaatgttcttaacttatccaag gtaagaaatgttgatgcaataaagagttttttcagtaatttcaatgaaatttcatatagcaaaaaagaagctctctttaaattctcttttgatttagat tccttatcaaagaagggcttcagctcatttgtaaaattcagtaaatctaaatggaatgtatatacatttggagagagaataataaaaccaaagaataag caagggtatcgtgaagataagagaattaatttaacatttgaaatgaaaaaacttctgaatgaatataaagtaagttttgatcttgaaaacaacttaatt ccaaatctaacctctgcaaatctgaaagataccttctggaaagaactattctttatttttaaaacaactctgcagcttagaaacagtgtaacaaatggc aaagaagatgtactgatttctccagtaaagaacgctaaaggagagttctttgtatcaggaactcataacaagacattacctcaagactgtgatgcaaat ggagcatatcatatcgccctaaaaggtctgatgattcttgaacgtaacaatcttgttagagaagaaaaagacacaaagaagataatggcaatttctaat gttgactggtttgagtatgttcaaaaaaggagaggtgtcctgtaa SEQ ATGAACAACTATGATGAGTTTACCAAACTGTACCCAATACAGAAAACGATAAGGTTCGAATTGAAGCCG ID CAGGGAAGAACGATGGAACACCTCGAAACATTCAACTTTTTCGAAGAGGACAGGGATAGAGCGGAGAA NO: ATATAAGATTTTAAAGGAAGCAATCGACGAGTATCATAAGAAGTTTATAGACGAACATCTAACAAATAT 23 GTCTCTTGACTGGAATTCTTTAAAACAGATTTCAGAGAAATACTATAAGAGTAGAGAGGAAAAAGACAA GAAAGTTTTTCTGTCAGAACAGAAACGCATGAGGCAAGAGATAGTTTCTGAGTTCAAAAAAGACGATCG GTTTAAAGATCTTTTTTCAAAAAAATTGTTTTCTGAACTTCTCAAGGAAGAGATTTACAAAAAAGGAAAC CATCAGGAAATTGACGCATTGAAAAGTTTTGATAAATTCTCAGGCTATTTTATTGGGTTGCATGAGAACC GAAAAAATATGTATTCTGACGGAGACGAGATCACGGCTATCTCTAACCGTATTGTAAATGAGAATTTCC CGAAGTTCCTCGACAACCTTCAGAAATATCAGGAAGCTCGTAAAAAATATCCAGAGTGGATCATTAAGG CAGAATCTGCTTTAGTTGCACATAATATCAAGATGGATGAAGTCTTTTCCTTAGAGTATTTCAACAAAGT CCTGAATCAAGAAGGAATACAGAGATACAATCTCGCCCTAGGTGGCTATGTGACCAAAAGTGGTGAGA AAATGATGGGGCTTAATGATGCACTTAATCTTGCCCATCAAAGTGAAAAAAGCAGCAAGGGAAGGATA CACATGACTCCACTCTTCAAACAGATTCTGAGTGAAAAAGAGTCCTTTTCTTATATACCAGATGTTTTTA CAGAAGACTCTCAACTTTTACCATCCATTGGTGGGTTCTTTGCACAAATAGAAAATGATAAGGACGGGA ATATTTTTGACAGAGCATTAGAATTGATATCTTCTTATGCAGAATACGATACAGAAAGGATATATATCAG GCAAGCGGACATAAACAGAGTTTCTAATGTTATTTTCGGGGAGTGGGGAACACTGGGGGGGTTAATGAG GGAATACAAAGCAGACTCTATCAACGACATCAATTTGGAGAGAACATGCAAGAAGGTAGACAAGTGGC TCGACTCAAAGGAGTTTGCGTTATCAGATGTATTAGAGGCAATAAAAAGAACCGGCAATAATGATGCTT TTAATGAATATATCTCAAAGATGCGCACTGCCAGGGAAAAGATTGACGCTGCAAGAAAGGAAATGAAA TTCATTTCGGAAAAAATATCTGGAGACGAAGAATCGATCCATATTATCAAAACCTTATTGGACTCGGTGC AACAGTTTTTACATTTTTTCAATTTATTCAAAGCGCGTCAGGACATTCCTCTTGATGGAGCATTCTATGCG GAGTTCGATGAAGTCCATAGCAAACTGTTTGCTATTGTTCCGTTGTATAATAAGGTTAGGAACTATCTTA CGAAAAATAACCTTAACACGAAAAAGATAAAGCTAAACTTCAAGAATCCAACTCTGGCAAACGGATGG GATCAAAACAAGGTATATGACTACGCCTCCTTAATCTTTCTCCGCGATGGTAATTATTATCTCGGAATAA TAAATCCAAAAAGGAAAAAGAATATTAAATTCGAACAAGGGTCTGGAAATGGCCCATTCTACCGGAAG ATGGTGTACAAACAAATTCCAGGGCCGAACAAGAACTTACCAAGAGTCTTCCTCACATCTACGAAAGGC AAAAAAGAGTACAAGCCGTCAAAGGAGATAATAGAAGGATATGAAGCGGACAAACACATAAGAGGAG ATAAATTCGATCTGGATTTCTGTCATAAGCTGATAGACTTCTTCAAGGAATCCATCGAGAAGCACAAGG ACTGGAGTAAGTTCAACTTCTATTTCTCTCCAACTGAATCATATGGAGACATCAGCGAATTCTATCTGGA TGTAGAAAAACAGGGATACCGGATGCATTTTGAGAATATTTCTGCCGAGACGATTGATGAGTATGTCGA AAAGGGGGACTTATTCCTCTTCCAGATATACAACAAAGACTTTGTGAAAGCGGCAACCGGAAAAAAAG ATATGCACACCATTTATTGGAACGCGGCATTCTCGCCCGAGAACCTTCAGGATGTGGTAGTGAAACTGA ACGGTGAAGCAGAACTTTTCTACAGAGACAAGAGCGACATCAAGGAGATAGTTCACAGGGAGGGAGAG ATACTGGTCAATCGTACCTACAACGGCAGGACACCTGTGCCTGACAAGATCCACAAAAAATTAACAGAT TATCATAATGGCCGTACCAAAGATCTCGGAGAAGCAAAAGAATACCTCGATAAGGTCAGATATTTCAAA GCGCACTACGACATCACAAAGGATCGCAGATACCTGAATGATAAAATATACTTCCATGTGCCTCTGACA TTGAATTTCAAAGCAAACGGGAAGAAGAATCTCAATAAGATGGTAATTGAAAAGTTCCTCTCGGACGAA AAAGCGCATATTATTGGGATTGATCGCGGGGAAAGGAATCTTCTTTACTATTCTATCATTGACAGGTCAG GTAAAATAATCGATCAACAGAGCCTCAACGTCATCGATGGATTCGATTACCGAGAGAAACTGAATCAGA GGGAGATCGAGATGAAGGATGCCAGACAAAGCTGGAATGCTATCGGGAAGATAAAGGACCTCAAGGAA GGGTATCTTTCAAAAGCGGTCCACGAAATTACCAAGATGGCGATACAATACAATGCCATTGTTGTCATG GAGGAACTCAATTATGGGTTCAAACGCGGACGTTTCAAAGTTGAGAAGCAGATATATCAGAAATTCGAG AATATGCTGATTGACAAGATGAATTATCTGGTATTCAAGGATGCTCCGGATGAAAGTCCGGGAGGAGTC CTCAATGCATATCAGCTTACTAATCCGCTTGAAAGTTTCGCTAAACTTGGGAAACAGACAGGAATTCTTT TCTATGTTCCGGCAGCCTATACTTCGAAGATAGATCCGACGACCGGGTTTGTCAATCTTTTCAATACTTC AAGTAAAACGAACGCACAGGAAAGAAAAGAATTCTTGCAAAAATTCGAGTCGATCTCCTATTCCGCTAA AGACGGAGGAATATTCGCATTCGCGTTCGATTATCGGAAGTTCGGAACGTCAAAAACAGACCACAAAAA TGTATGGACCGCATACACGAACGGGGAAAGGATGAGGTACATAAAAGAGAAAAAACGCAACGAACTGT TCGACCCCTCGAAGGAGATCAAAGAGGCTCTCACTTCATCAGGAATCAAATATGACGGCGGACAGAACA TATTGCCAGATATCCTGAGGAGCAACAATAACGGTCTGATCTACACAATGTATTCCTCTTTCATAGCGGC CATTCAAATGAGGGTCTATGACGGGAAAGAAGACTATATCATCTCGCCGATAAAGAACAGCAAGGGAG AGTTCTTCAGGACCGATCCGAAAAGAAGGGAACTTCCGATAGACGCGGATGCGAACGGCGCGTATAAC ATTGCTCTCAGGGGCGAATTGACGATGCGTGCGATAGCGGAGAAGTTCGATCCGGACTCGGAAAAGATG GCGAAGCTAGAACTGAAACATAAGGACTGGTTCGAATTCATGCAGACAAGGGGGGATTGA SEQ ATGACAAAAACATTTGATTCAGAATTTTTTAATTTATATTCTCTTCAAAAAACAGTTCGTTTTGAACTCAA ID GCCGGTTGGTGAAACAGCCTCGTTTGTTGAAGATTTTAAAAACGAAGGTTTGAAACGAGTTGTTTCAGA NO: GGATGAACGGCGTGCGGTTGATTACCAAAAAGTGAAAGAAATTATTGATGACTACCACCGAGATTTTAT 24 TGAAGAATCGCTGAACTATTTTCCTGAGCAGGTCTCAAAAGACGCTTTGGAACAAGCTTTTCACCTTTAT CAAAAACTAAAAGCCGCTAAGGTTGAAGAGCGTGAAAAAGCATTGAAAGAATGGGAAGCCCTTCAGAA AAAACTGCGCGAAAAAGTTGTTAAATGTTTTTCAGATTCAAACAAAGCACGCTTTTCCCGCATTGATAAA AAAGAACTGATTAAAGAAGATTTAATTAACTGGTTGGTTGCACAAAATCGCGAAGATGACATTCCAACC GTTGAAACCTTTAACAACTTTACGACTTATTTTACGGGGTTTCATGAAAACCGAAAAAACATTTATTCAA AAGACGATCATGCCACAGCCATTTCATTTCGACTCATTCATGAAAACCTGCCTAAGTTTTTTGATAATGT GATCAGCTTTAATAAATTGAAGGAAGGATTTCCAGAGCTGAAATTTGATAAGGTTAAGGAAGATTTAGA AGTTGATTATGACTTGAAACATGCCTTTGAAATCGAATACTTTGTCAATTTTGTTACCCAAGCCGGAATT GACCAATATAACTATCTTTTGGGGGGTAAAACCTTAGAAGACGGCACCAAAAAGCAAGGCATGAATGA ACAAATCAATCTGTTCAAGCAACAGCAAACCCGAGACAAAGCCCGACAAATTCCCAAACTCATACCATT GTTTAAACAAATTCTAAGCGAACGAACGGAAAGCCAATCGTTTATTCCAAAACAATTTGAATCAGACCA AGAGCTATTTGACTCACTGCAAAAACTGCATAACAACTGCCAAGATAAATTTACCGTACTGCAACAAGC CATTTTAGGCTTAGCCGAAGCAGATCTGAAAAAAGTATTCATTAAAACATCTGATCTTAATGCGCTATCA AATACCATTTTTGGAAATTACAGTGTGTTTTCGGATGCGTTGAATTTATACAAAGAATCGCTCAAAACAA AAAAGGCGCAAGAAGCGTTTGAAAAACTACCCGCTCACAGCATTCATGACTTGATTCAATATTTGGAGC AATTTAATAGCTCTTTGGATGCAGAAAAACAGCAATCAACTGACACCGTACTGAATTACTTTATTAAAAC AGACGAGCTGTATTCTCGGTTCATAAAATCAACGAGCGAAGCCTTCACACAAGTACAACCACTCTTTGA ATTGGAAGCATTAAGCTCAAAACGTCGTCCACCGGAAAGTGAAGACGAAGGCGCAAAAGGTCAGGAAG GGTTTGAGCAAATTAAACGCATAAAAGCCTATTTGGATACCTTGATGGAGGCGGTGCATTTTGCAAAAC CACTTTATCTGGTGAAGGGGCGCAAAATGATTGAAGGTCTGGACAAAGACCAAAGTTTCTATGAAGCCT TTGAAATGGCTTACCAAGAACTAGAAAGTCTGATTATTCCAATCTACAACAAAGCTCGTAGTTATTTAAG TCGTAAACCGTTTAAAGCGGACAAATTCAAAATTAATTTTGATAATAATACATTGCTTTCCGGTTGGGAT GCTAATAAAGAAACGGCTAACGCTTCAATTTTGTTTAAGAAGGATGGTTTGTATTATTTAGGAATCATGC CTAAAGGAAAAACGTTTTTGTTCGATTACTTCGTTTCATCGGAAGATTCTGAAAAGTTAAAACAAAGAA GACAAAAAACCGCCGAAGAAGCGCTTGCGCAAGATGGCGAAAGCTACTTTGAAAAAATTCGTTACAAG CTGTTACCTGGCGCCAGCAAAATGTTGCCGAAAGTATTTTTTTCCAACAAAAACATAGGGTTTTACAACC CAAGTGATGACATACTTCGTATCAGGAATACAGCCTCTCACACTAAAAACGGAACACCGCAAAAAGGGC ACTCTAAAGTAGAGTTTAATTTGAATGATTGTCATAAGATGATTGATTTCTTTAAATCAAGCATTCAAAA GCATCCAGAGTGGGGAAGTTTTGGATTCACCTTTTCAGATACATCAGATTTTGAAGATATGAGCGCCTTT TATCGAGAAGTCGAAAACCAAGGTTATGTCATTAGTTTCGATAAAATAAAAGAAACTTACATTCAGAGT CAAGTTGAACAGGGGAACCTATATTTATTCCAAATCTACAATAAAGACTTCTCGCCCTACAGCAAAGGC AAACCAAATTTACACACGCTTTACTGGAAAGCGTTGTTTGAGGAAGCCAACCTAAATAATGTGGTGGCA AAACTCAATGGTGAAGCTGAAATTTTCTTTAGGCGACACTCAATCAAAGCATCTGATAAAGTGGTGCAC CCAGCGAATCAAGCCATTGACAATAAAAACCCGCATACCGAAAAAACGCAAAGCACCTTTGAATATGAT CTTGTAAAAGACAAGCGCTATACCCAAGACAAATTCTTCTTCCATGTACCGATTTCATTGAACTTTAAGG CACAAGGTGTTTCAAAATTTAACGATAAAGTGAATGGATTTTTAAAGGGTAACCCAGATGTCAATATTA TTGGCATTGACCGAGGCGAACGACACCTTCTGTATTTCACTGTGGTGAATCAGAAAGGTGAAATTTTGGT TCAAGAGTCGCTTAATACCCTAATGAGTGATAAAGGGCATGTGAATGACTACCAGCAAAAACTCGACAA AAAAGAACAAGAACGCGATGCCGCTCGCAAAAGCTGGACGACGGTTGAAAATATCAAAGAATTAAAAG AAGGCTATTTATCTCATGTTGTTCATAAGTTGGCACACCTGATTATTAAATACAATGCCATTGTTTGCTTG GAAGACCTGAATTTTGGTTTCAAACGCGGGCGTTTTAAAGTGGAAAAACAAGTTTATCAGAAATTTGAA AAAGCGCTTATTGATAAGCTTAACTACTTGGTATTTAAAGAAAAAGAGTTAGGCGAGGTGGGCCATTAT CTAACCGCCTATCAGTTGACCGCACCGTTTGAAAGTTTCAAGAAGTTAGGCAAGCAAAGTGGCATATTG TTTTATGTTCCGGCGGATTACACCTCCAAAATTGACCCAACCACCGGGTTTGTCAACTTTCTTGATCTGCG TTATCAGAGTGTCGAAAAAGCCAAACAGCTCTTAAGCGACTTTAATGCCATTCGTTTTAATTCAGTACAA AACTATTTTGAGTTCGAAATAGATTACAAAAAACTCACACCCAAACGTAAAGTTGGTACTCAGAGTAAA TGGGTGATTTGTACCTATGGAGATGTCCGCTATCAAAATCGGCGTAATCAAAAAGGTCACTGGGAAACG GAAGAAGTCAATGTGACTGAAAAACTAAAAGCCCTTTTCGCCAGTGATTCCAAAACTACAACCGTAATC GATTACGCCAATGACGACAACCTAATTGACGTCATTCTGGAACAGGACAAAGCCAGCTTCTTCAAAGAA CTGTTATGGTTATTAAAACTCACCATGACGCTCCGCCACAGCAAAATCAAAAGTGAAGACGACTTTATTC TTTCACCCGTTAAAAACGAACAAGGCGAGTTTTACGATAGTCGAAAAGCGGGCGAGGTGTGGCCTAAAG ATGCAGACGCCAATGGCGCTTATCACATAGCGTTGAAAGGCTTGTGGAATCTGCAACAGATCAATCAGT GGGAAAAGGGTAAAACACTTAATCTGGCGATTAAAAACCAGGATTGGTTCAGTTTTATTCAAGAAAAGC CCTATCAAGAATAA SEQ ATGCACACAGGCGGATTACTTAGCATGGATGCCAAGGAGTTTACCGGACAGTACCCCCTTTCGAAGACT ID CTGCGTTTTGAACTGAGACCGATAGGCAGAACGTGGGACAATCTCGAAGCATCGGGGTATCTTGCGGAG NO: GACAGACACCGTGCAGAATGCTATCCCAGGGCAAAAGAGCTCTTGGACGACAACCATCGTGCATTCCTC 25 AACCGTGTCCTGCCTCAGATCGATATGGATTGGCACCCGATCGCAGAGGCATTCTGCAAAGTCCACAAG AATCCGGGAAACAAGGAATTGGCTCAGGATTACAATCTTCAGCTGTCCAAACGCAGAAAGGAGATTTCG GCCTATCTGCAGGATGCGGACGGCTATAAAGGTCTGTTTGCCAAACCTGCATTGGATGAAGCAATGAAG ATCGCGAAAGAAAACGGAAATGAATCGGACATAGAGGTTCTTGAGGCATTCAACGGTTTCTCCGTATAC TTCACCGGATATCATGAGAGCAGGGAGAACATCTATTCGGACGAGGATATGGTGTCGGTAGCTTATCGC ATCACCGAAGACAATTTCCCGAGATTCGTTTCCAATGCGCTTATATTCGATAAGCTGAATGAGTCGCACC CCGATATAATCTCGGAAGTATCCGGAAATCTGGGCGTAGACGACATCGGAAAATATTTTGATGTGTCTA ACTACAATAATTTCCTGTCGCAGGCCGGTATAGATGACTACAATCACATCATCGGCGGCCATACGACGG AGGACGGTCTGATCCAGGCATTCAATGTTGTTCTGAATCTCAGGCATCAGAAAGACCCCGGATTCGAAA AAATCCAATTCAAACAGCTGTACAAACAGATACTCAGCGTCCGTACATCCAAATCCTATATCCCGAAAC AGTTCGATAATTCGAAGGAGATGGTGGACTGCATCTGCGACTATGTGTCCAAGATCGAAAAATCCGAAA CGGTCGAGAGAGCATTGAAGCTGGTAAGGAACATATCTTCTTTTGATTTGCGCGGAATATTCGTAAACA AGAAGAATCTCCGCATTCTTTCCAACAAACTGATTGGTGATTGGGACGCGATCGAAACCGCGCTGATGC ACTCCTCCTCTTCGGAAAATGATAAGAAATCCGTCTACGACAGCGCCGAGGCATTTACGCTGGATGATA TCTTTTCGTCCGTTAAAAAATTCTCAGATGCATCTGCAGAGGATATCGGAAACCGGGCGGAGGACATAT GCAGAGTCATATCTGAGACCGCTCCGTTCATAAACGATCTGAGGGCTGTCGATTTGGACAGTTTGAATG ACGACGGTTACGAGGCGGCGGTTTCCAAGATAAGGGAATCTCTGGAACCATATATGGATCTGTTTCATG AACTGGAGATATTCTCCGTAGGCGATGAATTCCCGAAATGTGCAGCTTTCTACAGTGAACTTGAAGAAG TCTCCGAACAGCTAATCGAGATTATACCGTTATTCAACAAGGCCCGTTCGTTCTGTACGCGCAAGAGATA CAGTACGGACAAGATAAAGGTCAATTTGAAATTCCCGACACTCGCCGACGGATGGGATCTCAACAAAGA ACGCGACAACAAAGCCGCAATACTCAGGAAAGACGGAAAGTACTACCTGGCCATACTGGATATGAAGA AAGATCTTTCTTCGATCAGAACTTCGGATGAAGACGAATCCAGTTTTGAGAAAATGGAGTACAAGCTTC TTCCGAGTCCGGTAAAGATGCTGCCAAAGATCTTCGTAAAATCGAAGGCGGCCAAGGAGAAGTACGGTC TGACCGACCGTATGCTGGAGTGCTACGATAAAGGGATGCACAAGAGCGGCAGTGCATTCGATCTCGGAT TTTGTCACGAATTGATCGATTACTACAAGAGGTGCATCGCAGAATATCCCGGCTGGGACGTCTTCGATTT CAAGTTCAGGGAAACATCGGATTATGGCAGCATGAAGGAGTTCAATGAGGATGTTGCAGGGGCCGGAT ACTATATGTCCCTCAGAAAGATCCCTTGTTCGGAGGTCTACAGGCTTCTTGATGAGAAATCGATATATCT TTTCCAGATCTACAACAAAGATTATTCGGAAAACGCTCATGGGAATAAGAACATGCATACCATGTATTG GGAAGGGCTCTTTTCCCCCCAGAATCTGGAATCCCCTGTGTTTAAACTCAGCGGCGGTGCGGAGCTTTTC TTCCGTAAATCCTCCATACCCAATGACGCCAAAACGGTCCATCCGAAGGGAAGCGTCCTGGTTCCGCGC AATGATGTAAACGGCCGCAGGATACCTGACAGCATATATCGGGAGCTCACCAGATATTTCAACCGCGGA GATTGCCGCATAAGCGACGAGGCAAAGAGTTATCTGGACAAGGTGAAAACCAAGAAAGCTGACCACGA TATCGTGAAAGACAGGAGGTTCACGGTGGACAAGATGATGTTCCACGTCCCTATCGCCATGAATTTCAA AGCGATTTCGAAGCCGAATCTCAATAAAAAGGTGATTGACGGCATAATCGACGACCAAGATCTGAAGAT CATCGGCATAGACCGCGGAGAGCGCAACCTCATCTACGTAACCATGGTGGATCGCAAAGGGAACATCCT CTATCAGGATAGCCTCAATATTCTGAACGGATACGATTACCGTAAGGCCCTCGACGTCCGCGAATATGA CAATAAAGAGGCTCGGAGGAACTGGACGAAGGTCGAAGGCATCCGTAAGATGAAAGAGGGGTATCTGT CGCTTGCAGTCAGCAAATTGGCAGATATGATCATAGAGAACAATGCGATTATCGTCATGGAGGATCTCA ATCACGGATTCAAGGCAGGGCGTTCGAAGATAGAGAAACAGGTCTATCAGAAGTTCGAATCCATGCTCA TAAACAAACTCGGTTACATGGTCCTCAAGGATAAGTCTATCGATCAGAGCGGCGGAGCTCTCCACGGAT ACCAGCTTGCCAACCATGTGACAACATTGGCATCTGTAGGTAAACAATGTGGAGTGATATTCTACATCCC TGCTGCATTTACATCCAAGATAGATCCGACAACAGGATTTGCAGATCTGTTCGCCCTCAGCAATGTTAAA AACGTGGCATCTATGAGAGAATTTTTCTCCAAGATGAAGTCTGTAATCTATGATAAGGCGGAGGGAAAA TTCGCATTTACCTTCGACTATCTTGATTATAATGTGAAATCCGAGTGCGGAAGGACCCTTTGGACCGTGT ATACGGTCGGAGAGAGATTCACATACAGCAGGGTCAATAGAGAATATGTCAGAAAAGTTCCGACAGAC ATAATCTACGACGCATTGCAAAAGGCAGGAATATCTGTTGAAGGGGATCTCAGGGACAGGATTGCTGAA TCGGATGGCGACACTCTGAAGAGCATATTCTATGCATTCAAGTATGCATTGGATATGAGAGTAGAGAAC CGCGAAGAGGATTACATACAGTCTCCTGTCAAAAATGCCTCCGGAGAATTCTTCTGTTCCAAGAACGCA GGCAAATCGCTCCCTCAGGATTCCGATGCGAACGGTGCATACAATATCGCACTCAAGGGGATCCTGCAG CTACGTATGCTTTCCGAGCAGTATGATCCGAATGCAGAGAGCATACGGTTGCCACTGATAACCAACAAG GCCTGGCTGACCTTTATGCAGTCCGGTATGAAGACATGGAAGAACTGA SEQ atgGATAGTTTGAAAGATTTCACCAATCTGTACCCTGTCAGTAAGACATTGAGATTTGAATTAAAGCCCGT ID TGGAAAGACTTTAGAAAATATCGAGAAAGCAGGTATTTTGAAAGAGGATGAGCATCGTGCAGAAAGTT NO: ATCGGAGGGTGAAGAAAATAATTGATACTTATCATAAGGTATTTATCGATTCTTCTCTTGAAAATATGGC 26 TAAAATGGGTATTGAGAATGAAATAAAAGCAATGCTCCAAAGTTTCTGCGAATTGTATAAAAAAGATCA TCGCACTGAGGGTGAAGACAAGGCATTAGATAAAATTCGAGCAGTACTTCGTGGCCTGATTGTTGGGGC TTTCACTGGTGTTTGCGGAAGACGGGAAAATACAGTCCAAAACGAGAAGTACGAGAGTTTGTTCAAAGA AAAGTTGATAAAAGAAATTTTACCTGATTTTGTGCTCTCTACTGAGGCTGAAAGCTTGCCTTTCTCTGTTG AAGAAGCTACGAGGTCACTGAAGGAGTTTGATAGCTTTACATCCTACTTTGCTGGTTTTTACGAGAATAG AAAGAATATATACTCGACGAAACCTCAATCCACTGCCATTGCTTATCGTCTTATTCATGAGAACTTGCCG AAGTTCATTGATAATATTCTTGTTTTTCAGAAGATCAAAGAGCCTATAGCCAAAGAGCTGGAACATATTC GTGCGGACTTTTCTGCCGGGGGGTACATAAAAAAGGATGAGAGATTGGAGGATATTTTTTCGTTGAACT ATTATATCCACGTGTTATCTCAGGCTGGGATCGAAAAATATAACGCATTGATTGGGAAGATTGTGACAG AAGGAGATGGAGAGATGAAAGGGCTCAATGAACACATCAACCTTTACAACCAACAAAGAGGCAGAGAG GATCGGCTCCCTCTTTTTAGGCCTCTTTATAAACAGATATTGAGTGACAGAGAGCAATTATCATACTTGC CTGAGAGTTTTGAAAAAGATGAGGAGCTCCTCAGGGCTCTAAAAGAGTTCTATGATCATATCGCAGAAG ACATTCTCGGACGTACTCAACAGTTGATGACTTCTATTTCAGAATATGATTTATCTCGGATATACGTAAG GAACGATAGCCAATTGACTGATATATCAAAAAAAATGTTGGGAGATTGGAATGCTATCTACATGGCTAG AGAACGAGCATATGACCACGAGCAGGCTCCCAAAAGAATCACGGCGAAATACGAGAGGGACAGGATTA AAGCTCTTAAAGGAGAAGAGAGTATAAGTCTGGCAAATCTTAATAGTTGTATTGCCTTTCTGGACAATGT TAGAGATTGCCGTGTAGATACTTATCTTTCCACACTGGGCCAGAAGGAAGGACCACATGGTCTATCTAAT CTCGTTGAGAACGTTTTTGCCTCATACCATGAAGCAGAGCAATTGTTGAGCTTTCCATACCCCGAAGAGA ATAATCTGATTCAGGACAAGGACAATGTGGTGTTAATTAAGAATCTTCTCGACAATATCAGTGATCTGCA GAGGTTCTTGAAACCTCTTTGGGGTATGGGAGACGAACCCGATAAAGATGAAAGATTTTATGGAGAGTA TAATTATATCCGAGGAGCTCTAGATCAGGTGATCCCTCTGTACAATAAGGTAAGGAACTACCTCACTCG GAAGCCTTATTCGACCAGAAAAGTAAAACTCAATTTTGGGAATTCTCAATTGCTTAGTGGTTGGGATAG AAATAAGGAAAAGGATAATAGCTGTGTGATTTTGCGTAAGGGGCAGAACTTCTATTTGGCTATTATGAA CAATAGGCACAAAAGAAGTTTCGAAAACAAGGTGTTGCCCGAGTATAAGGAGGGAGAACCTTACTTCG AAAAGATGGATTATAAATTTTTGCCTGATCCTAATAAAATGCTTCCTAAGGTTTTTCTTTCGAAAAAAGG AATAGAGATATACAAACCAAGTCCGAAGCTTTTAGAACAATATGGACATGGAACTCACAAAAAGGGAG ATACCTTTAGTATGGATGATTTGCACGAACTGATCGATTTCTTCAAACACTCAATCGAGGCTCATGAAGA TTGGAAGCAATTCGGATTCAAATTTTCTGATACGGCTACTTATGAGAATGTATCTAGTTTCTATAGAGAA GTTGAGGATCAGGGGTATAAGCTCTCTTTCCGAAAAGTTTCGGAATCTTATGTCTATTCATTAATAGATC AAGGCAAGTTGTATTTATTTCAGATATACAACAAGGACTTTTCTCCCTGCAGCAAAGGGACACCTAATCT GCATACCTTGTATTGGAGAATGCTTTTTGACGAGCGCAATTTGGCAGATGTCATATACAAACTGGATGGG AAGGCTGAAATCTTTTTCCGAGAGAAGAGTTTGAAAAATGATCATCCCACGCATCCGGCTGGTAAGCCT ATCAAAAAGAAAAGTCGACAAAAAAAAGGAGAGGAGAGTCTGTTTGAGTATGATTTAGTCAAGGATAG GCACTATACGATGGATAAGTTCCAGTTTCATGTGCCTATTACTATGAATTTTAAATGTTCTGCAGGAAGC AAAGTCAATGATATGGTTAATGCTCATATTCGAGAGGCAAAGGATATGCATGTCATTGGAATTGATCGT GGAGAACGCAATCTGCTGTATATATGCGTGATAGATAGTCGAGGGACGATTTTGGATCAAATTTCTCTG AATACGATTAACGATATAGACTATCATGATTTATTGGAGAGTCGAGACAAAGACCGTCAGCAGGAGCGC CGAAACTGGCAAACTATCGAAGGGATCAAGGAGCTAAAACAAGGCTACCTTAGTCAGGCGGTTCATCG GATAGCCGAACTGATGGTGGCTTATAAGGCTGTAGTTGCTTTGGAGGATTTGAATATGGGGTTCAAACG TGGGCGGCAGAAAGTAGAAAGTTCTGTTTATCAGCAGTTTGAGAAACAGCTGATAGATAAGCTCAACTA TCTTGTGGACAAGAAGAAAAGGCCTGAAGATATTGGAGGATTGTTGAGAGCCTATCAATTTACGGCCCC ATTTAAGAGTTTTAAGGAAATGGGAAAGCAAAACGGCTTCTTGTTTTATATCCCGGCTTGGAACACGAG CAACATAGATCCGACTACTGGATTTGTTAATTTATTTCATGCCCAGTATGAAAATGTAGATAAAGCGAAG AGCTTCTTTCAAAAGTTTGATTCAATTAGTTACAACCCGAAGAAAGACTGGTTTGAGTTTGCATTCGATT ATAAAAACTTTACTAAAAAGGCTGAAGGAAGTCGTTCTATGTGGATATTATGCACACATGGTTCCCGAA TAAAGAATTTTAGAAATTCCCAGAAGAATGGTCAATGGGATTCCGAAGAATTCGCCTTGACGGAGGCTT TTAAGTCTCTTTTTGTGCGATATGAGATAGATTATACCGCTGATTTGAAAACAGCTATTGTGGACGAAAA GCAAAAAGACTTCTTCGTGGATCTTCTGAAGCTATTCAAATTGACAGTACAGATGCGCAACAGCTGGAA AGAGAAGGATTTGGATTATCTAATCTCTCCTGTAGCAGGGGCTGATGGCCGTTTCTTCGATACAAGAGA GGGAAATAAAAGTCTGCCTAAGGATGCAGATGCCAATGGAGCTTATAATATTGCCCTAAAAGGACTTTG GGCTCTACGCCAGATTCGGCAAACTTCAGAAGGCGGTAAACTCAAATTGGCGATTTCCAATAAGGAATG GCTACAGTTTGTGCAAGAGAGATCTTACGAGAAAGACtga SEQ atgaataatggaacaaataactttcagaattttatcggaatttcttctttgcagaagactcttaggaatgctctcattccaaccgaaacaacacagcaa ID tttattgttaaaaacggaataattaaagaagatgagctaagaggagaaaatcgtcagatacttaaagatatcatggatgattattacagaggtttcatt NO: tcagaaactttatcgtcaattgatgatattgactggacttctttatttgagaaaatggaaattcagttaaaaaatggagataacaaagacactcttata 27 aaagaacagactgaataccgtaaggcaattcataaaaaatttgcaaatgatgatagatttaaaaatatgttcagtgcaaaattaatctcagatattctt cctgaatttgtcattcataacaataattattctgcatcagaaaaggaagaaaaacacaggtaattaaattattttccagatttgcaacgtcattcaagg actattttaaaaacagggctaattgtttttcggctgatgatatatcttcatcttcttgtcatagaatagttaatgataatgcagagatattttttagta atgcattggtgtataggagaattgtaaaaagtctttcaaatgatgatataaataaaatatccggagatatgaaggattcattaaaggaaatgtctctgg aagaaatttattcttatgaaaaatatggggaatttattacacaggaaggtatatctttttataatgatatatgtggtaaagtaaattcatttatgaatt tatattgccagaaaaataaagaaaacaaaaatctctataagctgcaaaagcttcataaacagatactgtgcatagcagatacttcttatgaggtgccgt ataaatttgaatcagatgaagaggtttatcaatcagtgaatggatttttggacaatattagttcgaaacatatcgttgaaagattgcgtaagattggag acaactataacggctacaatcttgataagatttatattgttagtaaattctatgaatcagtttcacaaaagacatatagagattgggaaacaataaata ctgcattagaaattcattacaacaatatattacccggaaatggtaaatctaaagctgacaaggtaaaaaaagcggtaaagaatgatctgcaaaaaagca ttactgaaatcaatgagcttgttagcaattataaattatgttcggatgataatattaaagctgagacatatatacatgaaatatcacatattttgaata attttgaagcacaggagcttaagtataatcctgaaattcatctggtggaaagtgaattgaaagcatctgaattaaaaaatgttctcgatgtaataatga atgcttttcattggtgttcggttttcatgacagaggagctggtagataaagataataatttttatgccgagttagaagagatatatgacgaaatatatc cggtaatttcattgtataatcttgtgcgtaattatgtaacgcagaagccatatagtacaaaaaaaattaaattgaattttggtattcctacactagcgg atggatggagtaaaagtaaagaatatagtaataatgcaattattctcatgcgtgataatttgtactatttaggaatatttaatgcaaaaaataagcctg acaaaaagataattgaaggtaatacatcagaaaataaaggggattataagaagatgatttataatcttctgccaggaccaaataaaatgatccccaagg tattcctctcttcaaaaaccggagtggaaacatataagccgtctgcctatatattggagggctataaacaaaacaagcatattaaatcctctaaggatt ttgatataacattttgtcacgatttgattgattattttaagaactgtatagcaatacatcctgaatggaagaattttggctttgatttttctgacacct ccacatatgaagatatcagcggattttacagagaagtcgaattacaaggttataaaatcgactggacatatatcagcgaaaaggatattgatttgttgc aggaaaaaggacagttatatttattccaaatatataacaaagatttttccaagaaaagtaccggaaatgataatcttcatactatgtatttgaagaatt tgtttagtgaagagaatttaaaggatattgtactgaaattaaacggtgaggcggaaatcttctttagaaaatcaagcataaagaatccaataattcata aaaaaggctctattcttgttaatagaacatatgaagcagaggaaaaagatcaatttggaaatatccagatagtcagaaaaaacataccggaaaatatat atcaggagctttataaatatttcaatgataaaagtgataaagaactttcggatgaagcagctaagcttaagaatgtagtaggtcatcatgaggctgcta caaacatagtaaaagattatagatatacatatgataaatattttcttcatatgcctattacaatcaattttaaagccaataagacaggctttattaatg acagaatattacaatatattgctaaagaaaaggatttgcatgtaataggcattgatcgtggtgaaagaaacctgatatatgtttcagtaattgatactt gtggaaatattgttgaacaaaatcgtttaacattgttaatggatatgattatcagattaagctcaagcagcaggagggggcgcgacaaatcgcacgaaa agaatggaaagaaatcggcaaaataaaagaaattaaagaaggctatttatctcttgtaattcatgaaatttcaaagatggttattaaatataatgccat aattgcaatggaggatttaagctacggatttaaaaaaggtcgtttcaaggttgagcgacaggtttaccagaagtttgagacaatgcttatcaacaaact caactatctggtatttaaagatatatccataacggaaaacggtggtcttctaaagggataccagcttacatatattccagataaactgaaaaatgtggg tcatcaatgtggctgtatattttatgtacctgctgcctatacatcaaaaatagatcctacaaccggatttgtaaatatattcaaatttaaagatttaac agttgatgcgaagagagaatttataaaaaaatttgacagtatcagatatgattcagaaaaaaatctgttttgttttacattcgattataataactttat tacgcaaaatactgttatgtcaaagtcaagctggagtgtatatacgtacggagttaggataaaaagaagatttgtcaatggcaggttctcaaatgaatc ggatacaattgatataacaaaagatatggaaaaaacactcgaaatgacagatataaattggagagatggtcatgatctgaggcaggatattattgatta tgaaatcgtacaacacatatttgagatttttagattgactgtacaaatgagaaacagtttaagtgaattagaagacagggattatgaccgtttgatttc tccggtgctcaatgaaaataatatattttatgattcagctaaagcaggagatgcgttacctaaagacgcagatgctaatggtgcatattgtatagctct aaaaggcttgtatgaaatcaaacaaattacagagaattggaaagaagacggtaagttttcaagagataaacttaaaatttccaataaggactggtttga ctttattcaaaataaaaggtatttataa SEQ atgacaaacaaatttacaaaccagtactcgctttccaaaacacttcgatttgagttgattccacaaggaaaaacattggaatttattcaagaaaaagga ID ttgctctctcaagataaacaacgagcggagagttatcaagaaatgaaaaaaactattgataaatttcataaatactttatcgatttagctttaagcaat NO: gctaaactaactcatttagaaacttacttggaattatacaataaaagtgctgaaacaaaaaaagaacaaaaatttaaagacgatttaaagaaagtacaa 28 gacaatttacgaaaagaaatcgttaaatctttttcagatggtgatgcaaaatcaatttttgcaattttggataaaaaagaactgattaccgtagaactt gaaaaatggtttgaaaacaacgaacaaaaagacatttattttgacgaaaaattcaaaacgtttactacttattttactggttttcatcaaaacagaaaa aacatgtattcggttgaacccaattctacagcaattgcttttcgattgattcatgaaaatttacctaaattttagaaaatgctaaagcatttgaaaaaa taaaacaagtagaaagtttgcaagttaattttagagaattaatgggggaataggagatgaagggctaattttcgtaaatgaattagaagaaatgtttca aatcaattattataatgatgtgctttcacaaaatggaattacaatttataatagtataatttcaggatttaccaaaaatgatataaaatataaaggtct aaatgaatacataaataattacaatcaaaccaaagacaaaaaagaccgtttgccaaaattaaaacaattgtataaacagattttgagtgataggatttc actttcgtttttgcccgatgcttttacggatgggaaacaagttttgaaagccatatttgacttttataaaatcaacttactttcttataccattgaagg acaggaagaaagccaaaatcttttactattaattcgtcagacaattgaaaacctttctagttttgatacccaaaaaatttatctaaaaaatgataccca tttaaccactatttcacaacaagtatttggcgatttttcggtgttttcaactgctttaaattattggtatgaaactaaagtaaatccaaaatttgaaac ggaatatagcaaagccaacgaaaaaaaacgagaaattttagataaagccaaagcggtatttacaaaacaagattatttttcaattgcttttttacaaga agtactttcggaatacattcttaccttagatcacacttctgatattgtaaaaaagcattcctccaactgtattgcggatttttaaaaatcattttgtag ccaaaaaagaaaatgaaaccgacaaaacctttgattttattgctaatattactgcaaaataccaatgtattcaaggtattttagaaaatgcagaccaat aacgaagacgaactcaacaagaccaaaaattaattgataatttgaaattctttttagatgctattttagaattgttgcattttattaaacctttgcatt taaaatcagaaagcattaccgaaaaagacactgctttttatgatgtgtttgaaaattattacgaagcattgagtttgttgaccccattatataatatgg tgcgaaactatgtaacgcaaaagccgtacagcaccgaaaaaataaaattaaattttgaaaatgcacaattattgaatggttgggatgccaataaagaag gtgattacctaactaccattttgaaaaaagacggtaattattttttagccataatggataaaaagcataacaaagcgtttcaaaagtttccagaaggaa aagaaaattatgaaaaaatggtgtataaactattgcctggagtaaataagatgttgccaaaagtatttttttccaataaaaatattgcttacttcaacc catcaaaagagttattagaaaactataaaaaagagacgcacaaaaaaggagacacattcaatttagaacattgtcatacgttgatcgattttttcaagg actctttaaacaaacatgaagactggaaatactttgattttcaattttctgaaacaaaatcgtatcaagatttgagtggtttttatagagaagtagaac atcaaggctacaaaatcaattttaaaaatatcgattcagaatatattgatggtttggtgaacgaaggtaaattgtttctatttcaaatttacagcaaag atttttcgcctttttccaaagggaaaccgaacatgcacactttgtattggaaagccttatttgaagaacaaaatttgcaaaatgtaatctataaattga atggacaagccgaaatattttttagaaaagcctctataaaacctaaaaatataatattgcacaaaaagaaaattaaaattgccaaaaagcattttattg ataaaaaaacaaaaacatctgaaattgttcctgttcaaacaataaaaaacctcaatatgtactaccaaggaaaaataagtgaaaaagaattaacacaag atgatttaaggtatattgataattttagcattttcaatgaaaaaaataaaacaattgatattataaaagacaaacgatttacggttgataaatttcagt ttcatgtgccgattaccatgaactttaaagcaacgggcggaagttatatcaatcaaaccgtattagaatatttgcaaaacaatcccgaagttaagatta ttggattggatagaggcgaacgccatttggtatatctgacactgatagaccagcaaggaaacatcttgaaacaagaaagtttgaatacaatcaccgatt ctaaaatctcgacaccttatcataagttgttggataacaaggaaaacgagcgtgacttggctcgaaaaaattggggaacggtggaaaacatcaaagaac tcaaagaaggctacatcagtcaagtggtgcataaaattgctacgttgatgctggaagaaaatgccattgtggtaatggaagatttgaattttggattta aacgtggacgttttaaagtggaaaaacaaatttatcaaaagctggaaaaaatgttgattgacaaattgaattatttggttttaaaagacaaacaacctc aggaattaggcggattgtacaacgcattacaactcaccaataaatttgaaagtttccaaaaaatgggtaaacaatcgggctttttgttttatgtacccg cttggaacacctccaaaatagacccaaccacagggtttgtcaattatttttataccaaatatgaaaatgttgacaaagccaaagccttttttgaaaaat ttgaggcgattcgtttcaatgcagaaaagaagtattttgaatttgaagtaaaaaaatatagcgattttaacccaaaagccgaaggcactcaacaagcct ggaccatttgcacgtatggcgaacgaatagaaaccaaacgacaaaaagaccaaaacaacaaatttgtaagcactccaattaatctaaccgaaaagatag aagactttttgggtaaaaaccaaattgtttatggtgatggtaattgcatcaaatctcaaattgctagcaaagacgacaaggctttttttgaaaccttat tgtattggttcaaaatgactttacaaatgcgaaacagcgaaacaagaacagatatagattatctaatttcgcccgtgatgaatgacaacggaacatttt acaacagccgagattatgaaaaattagaaaatccaactttgcccaaagatgccgatgccaacggagcgtatcatattgccaaaaaaggattgatgcttt tgaataaaatagaccaagccgacttgacaaaaaaagtggatttatctattagtaacagagattggttgcaatttgtacaaaaaaataaataa SEQ atggaacaggagtactatttaggactggatatgggaaccggatctgtaggatgggctgttacagattcggaatatcatgtcttgcgtaaacatggaaaa ID gcactatggggagtccgattatttgaaagtgcatcgacagcagaagaacgaagaatgttccgaacatcaagaagaagactagatcgaagaaactggaga NO: attgaaattttacaggaaatttttgcagaggaaataagtaagaaagatccaggatttttcttgcgaatgaaagaaagcaaatattatccagaagataag 29 cgagatatcaatggaaattgtccggaactgccatatgcattatttgttgatgacgattttacagataaagattatcataaaaaatttccgacaatttat catctcaggaaaatgttgatgaatacagaggagacaccggatatccggttggtgtatctggcaattcatcatatgatgaagcataggggccatttcttg ttatctggtgacattaatgagattaaggagttcggaacgacattttcaaaattgttggagaatatcaaaaatgaggaattggattggaatcttgaactg ggaaaagaagaatatgctgttgtagaaagtattttaaaagataacatgttaaaccgatccacaaagaaaaccagattaataaaagcattaaaagcaaaa tcaatatgtgaaaaggctgtactgaatttattggctggtggaacggtgaaattgagtgatatatttggtcttgaagaattaaatgagacagaaagaccg aagatttcctttgctgataatggatacgatgattatatcggagaagttgaaaatgagctgggagaacaattctatattatagagacggcaaaagcagtg tatgactgggcggtattagttgaaatattgggaaaatatacgtcaatttcagaagcgaaagtagcaacgtatgaaaaacataaatcggatttacaattt ttgaaaaagatagttcggaaatatctgacaaaggaggaatataaagatatttttgtaagtacgagtgacaaattgaaaaattactctgcttatatagga atgacgaaaataaatggaaaaaaggttgatttgcagagcaaacggtgcagtaaagaagaattctatgattttattaagaaaaacgtacttaaaaagcta gaaggacaacctgaatatgaatatttgaaagaagagctagaaagagaaacatttctaccaaaacaggtgaacagggataatggtgtaataccgtatcag attcatttgtacgagttgaaaaagatattaggaaatttacgggataaaatagacctcattaaagagaacgaagataaactggttcaattatttgaattc agaattccgtattatgttggtccgctgaataagatagatgacggaaaagagggaaaatttacatgggctgtacggaaaagtaatgaaaagatatatcca tggaattttgaaaatgtagttgatatagaagcaagtgcagaaaaatttatccggagaatgacaaataagtgtacatatctgatgggcgaagatgtattg ccgaaggattcattgctttacagtaaatatatggttttaaatgaattaaataatgtaaagttggatggcgaaaaattatctgtagaattgaaacaacgg ttgtatacagatgtattttgtaagtatcggaaagtaactgtaaagaagataaaaaattacttgaaatgtgaaggtatcatatccggcaatgtcgaaata actggaattgatggtgattttaaggcatcgttaacggcatatcatgattttaaagaaatcttgacaggaacagaattggctaaaaaggacaaagaaaat attattaccaatatagtattgtttggagatgataaaaagctgctgaaaaagagactgaatcgattatatcctcagattacgccgaatcagttgaagaaa atatgtgcgctatcctatacaggctggggaagattttctaaaaagttcttagaagaaataacagctccagatccggaaacgggagaggtatggaatatc attacggcattgtgggaatcgaataataatctgatgcaattattaagtaatgaatatcggtttatggaagaagtcgaaacatacaatatgggaaaacag actaaaacattgtcgtacgaaacagtagagaatatgtatgtttctccatctgtgaaaagacagatatggcagacgctgaaaatcgtgaaagaattagaa aaagtaatgaaagaatctccgaaacgtgtatttattgagatggcgagagaaaagcaagaaagtaagagaaccgaatcgcgtaaaaaacaactaatagat ttgtataaggcttgtaaaaatgaagaaaaagattgggtaaaagaactgggagatcaggaagaacagaaattacgaagcgataagttgtacctatattat acgcaaaagggtcgttgtatgtattctggcgaggtaatagaactgaaagacttatgggataatacaaaatatgatattgatcatatatatccacaatct aaaacgatggatgacagtcttaataatcgcgtattggtaaaaaagaaatataatgcaacaaaatcagataagtatccattaaatgaaaatatacgacat gagagaaaaggcttttggaagtcactgttagatggagggtttataagtaaagaaaaatatgaacgcttaataagaaatacagaattgagtccggaagaa ttagcaggatttattgaaaggcagattgttgaaacgaggcagagtacaaaagctgtagcggaaatattaaagcaagtgtttccggaaagtgaaattgta tatgtcaaagcaggtacggtttcaagattcagaaaagattttgaattactgaaagttcgagaagtgaatgatttgcatcacgcaaaggatgcgtattta aatattgtagttggtaatagttattatgtgaaatttactaagaatgcatcatggtttataaaagaaaatccgggacgtacttacaacttaaaaaagatg tttacatcaggttggaatattgaacgaaatggagaagttgcatgggaagtcgggaaaaaaggaacaattgtaacggtaaaacaaataatgaataaaaat aatatattggtgacaagacaggttcatgaagcgaaaggtgggctgtttgatcagcagattatgaaaaaaggaaaaggtcagattgctataaaggaaact gatgaacgtcttgcatcaatagaaaagtatggaggctataataaagctgccggggcatattttatgctggtagaatctaaagataaaaaaggaaaaaca attcgaacgatagaatttataccattatatttaaagaataaaatcgagtcggatgaatcaatagcattgaactttttagaaaaaggcagaggtttgaaa gaaccaaagatactattgaaaaaaattaagattgatacattatttgatgtggacggattcaaaatgtggttgtctggaagaacaggggacagactacta tttaaatgtgcaaatcaattgattttggatgagaaaataattgtaacaatgaaaaaaattgtaaagtttattcaaaggagacaagaaaatagagaatta aaattatctgataaagatggaattgataatgaagtacttatggaaatatataacacttttgtggataagttagaaaacacagtgtatagaatacgatta tccgaacaggcaaaaacgcttatagataaacaaaaagaatttgaaaggttatcactagaggataaaagtagtactttgtttgaaattttacatattttt cagtgtcaaagtagtgcggccaatttaaaaatgataggcggacctggaaaagcaggaatattagttatgaataataatataagtaagtgtaacaaaatt tctattataaatcagtctccaacaggaattttcgaaaatgagattgatttgttaaagat SEQ ATGAAATCTTTCGATTCATTCACAAATCTTTATTCTCTTTCAAAAACCTTGAAATTTGAGATGAGACCTGT ID CGGAAATACCCAAAAAATGCTCGACAATGCAGGAGTATTTGAAAAAGACAAACTAATTCAAAAAAAGT NO: ACGGAAAAACAAAGCCGTATTTCGACAGACTCCACAGAGAATTTATAGAAGAAGCGCTCACGGGGGTA 30 GAGCTAATAGGACTAGATGAGAACTTTAGGACACTTGTTGACTGGCAAAAAGATAAGAAAAATAATGTC GCAATGAAAGCGTATGAAAATAGTTTGCAGCGGCTGAGAACGGAAATAGGTAAAATATTTAACCTAAA GGCTGAGGATTGGGTAAAGAACAAATATCCAATATTAGGGCTGAAAAATAAAAATACCGATATTTTATT CGAAGAGGCTGTATTCGGGATATTGAAAGCCCGATATGGAGAAGAAAAAGATACTTTTATAGAAGTAG AGGAAATAGATAAAACCGGCAAATCAAAGATCAATCAAATATCAATTTTCGATAGTTGGAAAGGATTTA CAGGATATTTCAAAAAATTTTTTGAAACCAGAAAGAATTTTTACAAAAACGACGGAACTTCTACAGCAA TTGCTACAAGGATCATTGATCAAAATCTGAAAAGATTCATAGATAATCTGTCAATAGTTGAAAGTGTGA GACAAAAGGTTGATCTCGCCGAGACAGAAAAATCTTTCAGCATATCTCTATCGCAATTCTTCTCAATAGA CTTTTATAACAAGTGTCTCCTTCAAGATGGTATTGATTACTACAACAAGATAATCGGTGGAGAAACTCTC AAAAATGGCGAAAAACTAATAGGTCTCAATGAACTAATAAATCAATATAGGCAGAATAATAAGGATCA GAAAATCCCATTTTTCAAACTTCTTGATAAACAAATTTTGAGTGAAAAGATATTATTTTTGGATGAAATA AAAAATGACACAGAACTGATCGAGGCGCTGAGTCAGTTCGCAAAAACAGCCGAAGAAAAAACAAAAAT TGTCAAAAAGCTTTTTGCCGATTTTGTAGAAAATAATTCCAAATACGATCTTGCACAGATTTATATTTCC CAAGAAGCATTCAATACTATATCAAACAAGTGGACAAGCGAAACTGAGACGTTCGCTAAATATCTATTC GAAGCAATGAAGAGTGGAAAACTTGCAAAGTATGAGAAAAAAGATAATAGCTATAAATTTCCTGATTTT ATTGCCCTTTCACAGATGAAGAGTGCTTTATTAAGTATCAGCCTTGAGGGACATTTTTGGAAAGAGAAAT ACTACAAAATTTCAAAATTCCAAGAGAAGACCAATTGGGAGCAGTTTCTTGCAATTTTTCTATACGAGTT TAACTCTCTTTTCAGCGACAAAATAAATACAAAAGATGGAGAAACAAAGCAAGTTGGATACTATCTATT TGCCAAAGACCTGCATAATCTTATCTTAAGTGAGCAGATTGATATTCCAAAAGATTCAAAAGTCACAAT AAAAGATTTTGCCGATTCTGTACTCACAATCTACCAAATGGCAAAATATTTTGCGGTAGAAAAAAAACG AGCGTGGCTTGCCGAGTATGAACTAGATTCATTTTATACCCAGCCAGACACAGGCTATTTACAGTTTTAT GATAACGCCTACGAGGATATTGTGCAGGTATACAACAAGCTTCGAAACTATCTGACCAAAAAGCCATAT AGCGAGGAGAAATGGAAGTTGAATTTTGAAAATTCTACGCTGGCAAATGGATGGGATAAGAACAAAGA ATCTGATAATTCAGCAGTTATTCTACAAAAAGGTGGAAAATATTATTTGGGACTGATTACTAAAGGACA CAACAAAATTTTTGATGACCGTTTTCAAGAAAAATTTATTGTGGGAATTGAAGGTGGAAAATATGAAAA AATAGTCTATAAATTTTTCCCCGACCAGGCAAAAATGTTTCCCAAAGTGTGCTTTTCTGCAAAAGGACTC GAATTTTTTAGACCGTCTGAAGAAATTTTAAGAATTTATAACAATGCAGAGTTTAAAAAAGGAGAAACT TATTCAATAGATAGTATGCAGAAGTTGATTGATTTTTATAAAGATTGCTTGACTAAATATGAAGGCTGGG CATGTTATACCTTTCGGCATCTAAAACCCACAGAAGAATACCAAAACAATATTGGAGAGTTTTTTCGAG ATGTTGCAGAGGACGGATACAGGATTGATTTTCAAGGCATTTCAGATCAATATATTCATGAAAAAAACG AGAAAGGCGAACTTCACCTTTTTGAAATCCACAATAAAGATTGGAATTTGGATAAGGCACGAGACGGAA AGTCAAAAACAACACAAAAAAACCTTCATACACTCTATTTCGAATCGCTCTTTTCAAACGATAATGTTGT TCAAAACTTTCCAATAAAACTCAATGGTCAAGCTGAAATTTTTTATAGACCGAAAACGGAAAAAGACAA ATTAGAATCAAAAAAAGATAAGAAAGGGAATAAAGTGATTGACCATAAACGCTATAGTGAGAATAAGA TTTTTTTTCATGTTCCTCTCACACTAAACCGCACTAAAAATGACTCATATCGCTTTAATGCTCAAATCAAC AACTTTCTCGCAAATAATAAAGATATCAACATCATCGGTGTAGATAGGGGAGAAAAGCATTTAGTCTAT TATTCGGTGATTACACAAGCTAGTGACATCTTAGAAAGTGGCTCACTAAATGAGCTAAATGGCGTGAAT TATGCTGAAAAACTGGGAAAAAAGGCAGAAAATCGAGAACAAGCACGCAGAGACTGGCAAGACGTAC AAGGGATCAAAGACCTCAAGAAAGGATATATTTCACAGGTGGTGCGAAAGCTTGCTGATTTAGCAATTA AACACAATGCCATTATCATTCTTGAAGATTTGAATATGAGATTTAAACAAGTTCGGGGCGGTATCGAAA AATCCATTTATCAACAGTTAGAAAAAGCACTGATAGATAAATTAAGCTTTCTTGTAGACAAAGGTGAAA AAAATCCCGAGCAAGCAGGACATCTTCTGAAAGCATATCAGCTTTCGGCCCCATTTGAGACATTTCAAA AAATGGGCAAACAGACGGGTATAATCTTTTATACACAAGCTTCGTATACCTCAAAAAGTGACCCTGTAA CAGGTTGGCGACCACACCTGTATCTCAAATATTTCAGTGCCAAAAAAGCAAAAGACGATATTGCAAAGT TTACAAAAATAGAATTTGTAAACGATAGGTTTGAGCTTACCTATGATATAAAGGACTTTCAGCAAGCAA AAGAATATCCAAATAAAACTGTTTGGAAAGTTTGCTCAAATGTAGAGAGATTCAGGTGGGACAAAAACC TCAATCAAAACAAAGGCGGATATACTCACTACACAAATATAACTGAGAATATCCAAGAGCTTTTTACAA AATATGGAATTGATATCACAAAAGATTTGCTCACACAGATTTCTACAATTGATGAAAAACAAAATACCT CATTTTTTAGAGATTTTATTTTTTATTTCAACCTTATTTGCCAAATCAGAAATACCGATGATTCTGAGATT GCTAAAAAGAATGGGAAAGATGATTTTATACTGTCACCTGTTGAGCCGTTTTTCGATAGCCGAAAAGAC AATGGAAATAAACTTCCTGAGAATGGAGATGATAACGGCGCGTATAACATAGCAAGAAAAGGGATTGT CATACTCAACAAAATCTCACAATATTCAGAGAAAAACGAAAATTGCGAGAAAATGAAATGGGGGGATT TGTATGTATCAAACATTGACTGGGACAATTTTGTAACCCAAGCTAATGCACGGCATTAA SEQ ATGATTATCTTATATATTAGTACCTCGAATATGAACATGGAAGGAGTATTTATGGAAAATTTTAAAAACT ID TGTATCCAATAAACAAAACACTTCGATTTGAATTAAGACCCTATGGAAAAACATTGGAAAATTTTAAAA NO: AATCCGGACTTTTAGAAAAAGATGCCTTTAAGGCAAATAGTAGACGAAGTATGCAAGCTATAATCGATG 31 AAAAATTCAAAGAGACTATCGAAGAACGCTTAAAGTACACTGAATTCAGTGAATGTGATCTTGGAAACA TGACATCAAAAGATAAAAAAATAACTGATAAAGCAGCTACAAATTTAAAAAAGCAAGTTATCTTATCTT TTGACGATGAAATATTTAATAATTACCTAAAACCTGATAAAAATATTGACGCATTATTTAAAAATGATCC TTCAAATCCTGTAATCTCTACATTTAAAGGTTTTACGACATATTTTGTGAATTTTTTTGAAATTCGAAAAC ATATTTTCAAGGGAGAATCATCAGGCTCAATGGCATACCGAATTATAGATGAAAACCTGACAACATACT TGAATAATATTGAAAAAATAAAAAAACTGCCAGAAGAATTAAAATCACAGCTAGAAGGCATTGATCAG ATTGATAAACTTAATAATTATAATGAGTTCATTACACAGTCAGGTATAACACACTATAATGAAATCATCG GCGGTATATCAAAATCAGAGAATGTCAAAATACAGGGAATTAATGAAGGAATTAATCTATACTGTCAGA AGAACAAAGTTAAACTTCCTCGACTGACTCCGCTATACAAAATGATATTATCAGACAGAGTTTCCAACTC TTTTGTATTAGACACTATTGAAAATGACACAGAATTAATTGAAATGATAAGTGATTTGATTAATAAGACT GAGATTTCGCAAGATGTTATAATGTCAGATATTCAAAATATTTTCATAAAATACAAACAACTTGGTAATT TGCCGGGTATCTCATATTCTTCAATAGTTAATGCTATTTGCTCGGATTATGACAACAATTTCGGAGATGG GAAGCGAAAAAAATCTTACGAAAATGATCGCAAAAAGCATTTGGAGACTAATGTATACTCCATAAATTA TATTTCTGAATTGCTTACAGATACCGATGTTTCATCAAATATCAAGATGAGATATAAAGAGCTTGAGCAA AATTATCAGGTTTGCAAAGAAAATTTTAATGCCACAAACTGGATGAATATTAAAAATATAAAACAATCT GAAAAAACAAACCTTATTAAAGATTTGTTAGATATACTTAAATCGATTCAACGTTTCTATGATTTGTTTG ATATTGTTGACGAAGATAAAAATCCAAGTGCTGAATTTTATACCTGGTTATCAAAAAATGCTGAAAAGC TTGACTTTGAATTCAATTCTGTATATAACAAGTCACGAAACTATCTCACCAGGAAACAATACTCTGATAA AAAAATCAAGCTGAATTTTGATTCTCCAACATTGGCCAAAGGGTGGGATGCTAACAAAGAAATAGATAA CTCCACGATTATAATGCGTAAATTTAATAATGACAGAGGCGATTATGATTACTTCCTTGGCATATGGAAT AAATCCACACCTGCAAATGAAAAAATAATCCCACTGGAGGATAATGGATTATTCGAAAAAATGCAATAT AAGCTGTATCCAGATCCTAGTAAGATGTTACCGAAACAATTTCTATCAAAAATATGGAAGGCAAAGCAT CCTACGACACCTGAATTTGATAAAAAATATAAAGAGGGAAGACATAAAAAAGGTCCTGATTTCGAAAA AGAATTCCTGCATGAATTGATTGATTGCTTCAAACATGGTCTTGTTAATCACGATGAAAAATATCAGGAT GTTTTTGGCTTCAATCTCCGTAACACTGAAGATTATAATTCATATACAGAGTTTCTCGAAGATGTGGAAA GATGCAATTACAATCTTTCATTTAACAAAATTGCTGATACTTCAAACCTTATTAATGATGGGAAATTGTA TGTATTTCAGATATGGTCAAAAGACTTTTCTATTGATTCAAAAGGTACTAAAAACTTGAATACAATCTAT TTTGAATCACTATTTTCAGAAGAAAACATGATAGAAAAAATGTTCAAGCTTTCTGGAGAGGCTGAGATA TTCTATCGACCAGCATCGTTGAATTATTGTGAAGATATCATAAAAAAAGGTCATCACCATGCAGAATTA AAAGATAAGTTTGACTATCCTATAATAAAAGATAAGCGATATTCACAAGATAAGTTTTTCTTTCATGTGC CAATGGTTATAAATTATAAATCTGAGAAACTGAATTCCAAAAGCCTTAACAACCGAACAAATGAAAACC TGGGACAGTTTACACATATTATAGGTATAGACAGGGGCGAGCGGCACTTGATTTATTTAACTGTTGTTGA TGTTTCCACTGGTGAAATCGTTGAACAGAAACATCTGGACGAAATTATCAATACTGATACCAAGGGAGT TGAACACAAAACCCATTATTTGAATAAATTGGAAGAAAAATCTAAAACAAGAGATAACGAGCGTAAAT CATGGGAAGCTATTGAAACTATCAAAGAATTAAAAGAAGGCTATATTTCTCATGTAATTAATGAAATAC AAAAGCTGCAAGAAAAATATAATGCCTTAATCGTAATGGAAAATCTTAACTATGGGTTCAAAAACTCAC GAATCAAAGTTGAAAAACAGGTTTATCAAAAATTCGAGACAGCATTGATTAAAAAGTTCAATTATATTA TTGATAAAAAAGATCCAGAAACCTATATACATGGTTACCAGCTTACAAATCCTATTACCACTCTGGATAA GATTGGAAATCAATCTGGAATAGTGCTGTATATTCCTGCGTGGAATACTTCTAAGATAGATCCCGTCACA GGATTTGTAAACCTTCTGTACGCAGATGATTTGAAGTATAAAAATCAGGAGCAGGCCAAATCATTCATT CAGAAAATAGACAACATATATTTTGAAAATGGAGAGTTTAAATTTGATATTGATTTTTCCAAATGGAATA ATCGCTACTCAATAAGTAAAACTAAATGGACGTTAACAAGTTATGGGACTCGCATCCAGACATTTAGAA ATCCCCAGAAAAACAATAAGTGGGATTCTGCTGAATATGATTTGACAGAAGAGTTTAAATTAATTTTAA ATATAGACGGAACGTTAAAGTCACAGGACGTAGAAACATACAAAAAATTCATGTCTTTATTTAAACTAA TGCTACAGCTTCGAAACTCTGTTACAGGAACCGACATTGATTATATGATCTCTCCTGTCACTGATAAAAC AGGAACACATTTCGATTCAAGAGAAAATATTAAAAATCTTCCTGCCGATGCAGATGCCAATGGTGCCTA CAACATTGCGCGCAAAGGAATAATGGCTATTGAAAATATAATGAACGGTATAAGCGATCCACTAAAAAT AAGCAACGAAGACTATTTAAAGTATATTCAGAATCAACAGGAATAA SEQ ATGACCCAATTTGAAGGTTTTACCAATTTATACCAAGTTTCGAAGACCCTTCGTTTTGAACTGATTCCCC ID AAGGAAAAACACTCAAACATATCCAGGAGCAAGGGTTCATTGAGGAGGATAAAGCTCGCAATGACCAT NO: TACAAAGAGTTAAAACCAATCATTGACCGCATCTATAAGACTTATGCTGATCAATGTCTCCAACTGGTAC 32 AGCTTGACTGGGAGAATCTATCTGCAGCCATAGACTCCTATCGTAAGGAAAAAACCGAAGAAACACGA AATGCGCTGATTGAGGAGCAAGCAACATATAGAAATGCGATTCATGACTACTTTATAGGTCGGACGGAT AATCTGACAGATGCCATAAATAAGCGCCATGCTGAAATCTATAAAGGACTTTTTAAAGCTGAACTTTTCA ATGGAAAAGTTTTAAAGCAATTAGGGACCGTAACCACGACAGAACATGAAAATGCTCTACTCCGTTCGT TTGACAAATTTACGACCTATTTTTCCGGCTTTTATGAAAACCGAAAAAATGTCTTTAGCGCTGAAGATAT CAGCACGGCAATTCCCCATCGAATCGTCCAGGACAATTTCCCTAAATTTAAGGAAAACTGCCATATTTTT ACAAGATTGATAACCGCAGTTCCTTCTTTGCGGGAGCATTTTGAAAATGTCAAAAAGGCCATTGGAATCT TTGTTAGTACGTCTATTGAAGAAGTCTTTTCCTTTCCCTTTTATAATCAACTTCTAACCCAAACGCAAATT GATCTTTATAATCAACTTCTCGGCGGCATATCTAGGGAAGCAGGCACAGAAAAAATCAAGGGACTTAAT GAAGTTCTCAATCTGGCTATCCAAAAAAATGATGAAACAGCCCATATAATCGCGTCCCTGCCGCATCGTT TTATTCCTCTTTTTAAACAAATTCTTTCCGATCGAAATACGTTATCCTTTATTTTGGAAGAATTCAAAAGC GATGAGGAAGTCATCCAATCCTTCTGCAAATATAAAACCCTCTTGAGAAACGAAAATGTACTGGAGACT GCAGAAGCCCTTTTCAATGAATTAAATTCCATTGATTTGACTCATATCTTTATTTCCCATAAAAAGTTAG AAACCATCTCTTCAGCGCTTTGTGACCATTGGGATACCTTGCGCAATGCACTTTACGAAAGACGGATTTC TGAACTCACTGGCAAAATAACAAAAAGTGCCAAAGAAAAAGTTCAAAGGTCATTAAAACATGAGGATA TAAATCTCCAAGAAATTATTTCTGCTGCAGGAAAAGAACTATCAGAAGCATTCAAACAAAAAACAAGTG AAATTCTTTCCCATGCCCATGCTGCACTTGACCAGCCTCTTCCCACAACATTAAAAAAACAGGAAGAAA AAGAAATCCTCAAATCACAGCTCGATTCGCTTTTAGGCCTTTATCATCTTCTTGATTGGTTTGCTGTCGAT GAAAGCAATGAAGTCGACCCAGAATTCTCAGCACGGCTGACAGGCATTAAACTAGAAATGGAACCAAG CCTTTCGTTTTATAATAAAGCAAGAAATTATGCGACAAAAAAGCCCTATTCGGTGGAAAAATTTAAATT GAATTTTCAAATGCCAACCCTTGCCTCTGGTTGGGATGTCAATAAAGAAAAAAATAATGGAGCTATTTTA TTCGTAAAAAATGGTCTCTATTACCTTGGTATCATGCCTAAACAGAAGGGGCGCTATAAAGCCCTGTCTT TTGAGCCGACAGAAAAAACATCAGAAGGATTCGATAAGATGTACTATGACTACTTCCCAGATGCCGCAA AAATGATTCCTAAGTGTTCCACTCAGCTAAAGGCTGTAACCGCTCATTTTCAAACTCATACCACCCCCAT TCTTCTCTCAAATAATTTCATTGAACCTCTTGAAATCACAAAAGAAATTTATGACCTGAACAATCCTGAA AAGGAGCCTAAAAAGTTTCAAACGGCTTATGCAAAGAAGACAGGCGATCAAAAAGGCTATAGAGAAGC GCTTTGCAAATGGATTGACTTTACGCGGGATTTTCTCTCTAAATATACGAAAACAACTTCAATCGATTTA TCTTCACTCCGCCCTTCTTCGCAATATAAAGATTTAGGGGAATATTACGCCGAACTGAATCCGCTTCTCT ATCATATCTCCTTCCAACGAATTGCTGAAAAGGAAATCATGGATGCTGTAGAAACGGGAAAATTGTATC TGTTCCAAATCTACAATAAGGATTTTGCGAAGGGCCATCACGGGAAACCAAATCTCCACACCCTGTATT GGACAGGTCTCTTCAGTCCTGAAAACCTTGCGAAAACCAGCATCAAACTTAATGGTCAAGCAGAATTGT TCTATCGACCTAAAAGCCGCATGAAGCGGATGGCCCATCGTCTTGGGGAAAAAATGCTGAACAAAAAAC TAAAGGACCAGAAGACACCGATTCCAGATACCCTCTACCAAGAACTGTACGATTATGTCAACCACCGGC TAAGCCATGATCTTTCCGATGAAGCAAGGGCCCTGCTTCCAAATGTTATCACCAAAGAAGTCTCCCATGA AATTATAAAGGATCGGCGGTTTACTTCCGATAAATTTTTCTTCCATGTTCCCATTACACTGAATTATCAAG CAGCCAATAGTCCCAGTAAATTCAACCAGCGTGTCAATGCCTACCTTAAGGAGCATCCGGAAACGCCCA TCATTGGTATCGATCGTGGAGAACGCAATCTAATCTATATTACCGTCATTGACAGTACTGGGAAAATTTT GGAGCAGCGTTCCCTGAATACCATCCAGCAATTTGACTACCAAAAAAAATTGGACAACAGGGAAAAAG AGCGTGTTGCCGCCCGTCAAGCCTGGTCCGTCGTCGGAACGATCAAAGACCTTAAACAAGGCTACTTGT CACAGGTCATCCATGAAATTGTAGACCTGATGATTCATTACCAAGCTGTTGTCGTCCTTGAAAACCTCAA CTTCGGATTTAAATCAAAACGGACAGGCATTGCCGAAAAAGCAGTCTACCAACAATTTGAAAAGATGCT AATAGATAAACTCAACTGTTTGGTTCTCAAAGATTATCCTGCTGAGAAAGTGGGAGGCGTCTTAAACCC GTATCAACTTACAGATCAGTTCACGAGCTTTGCAAAAATGGGCACGCAAAGCGGCTTCCTTTTCTATGTA CCGGCCCCTTATACCTCAAAGATTGATCCCCTGACTGGTTTTGTCGATCCCTTTGTATGGAAGACCATTA AAAATCATGAAAGTCGGAAGCATTTCCTAGAAGGATTTGATTTCCTGCATTATGATGTCAAAACAGGTG ATTTTATCCTCCATTTTAAAATGAATCGGAATCTCTCTTTCCAGAGAGGGCTTCCTGGCTTCATGCCAGCT TGGGATATTGTTTTCGAAAAGAATGAAACCCAATTTGATGCAAAAGGGACGCCCTTCATTGCAGGAAAA CGAATTGTTCCTGTAATCGAAAATCATCGTTTTACGGGTCGTTACAGAGACCTCTATCCCGCTAATGAAC TCATTGCCCTTCTGGAAGAAAAAGGCATTGTCTTTAGAGACGGAAGTAATATATTACCCAAACTTTTAGA AAATGATGATTCTCATGCAATTGATACGATGGTCGCCTTGATTCGCAGTGTACTCCAAATGAGAAACAG CAATGCCGCAACGGGGGAAGACTACATCAACTCTCCCGTTAGGGATCTGAACGGGGTGTGTTTCGACAG TCGATTCCAAAATCCAGAATGGCCAATGGATGCGGATGCCAACGGAGCTTATCATATTGCCTTAAAAGG GCAGCTTCTTCTGAACCACCTCAAAGAAAGCAAAGATCTGAAATTACAAAACGGCATCAGCAACCAAGA TTGGCTGGCCTACATTCAGGAACTGAGAAACTGA SEQ ATGGCCGTCAAATCCATCAAAGTGAAACTTCGTCTCGACGATATGCCGGAGATTCGGGCCGGTCTATGG ID AAACTTCATAAGGAAGTCAATGCGGGGGTTCGATATTACACGGAATGGCTCAGTCTTCTCCGTCAAGAG NO: AACTTGTATCGAAGAAGTCCGAATGGGGACGGAGAGCAAGAATGTGATAAGACTGCAGAAGAATGCAA 33 AGCCGAATTGTTGGAGCGGCTGCGCGCGCGTCAAGTGGAGAATGGACACCGTGGTCCGGCGGGATCGG ACGATGAATTGCTGCAGTTGGCGCGTCAACTCTATGAGTTGTTGGTTCCGCAGGCGATAGGTGCGAAAG GCGACGCGCAGCAAATTGCCCGCAAATTTTTGAGCCCCTTGGCCGACAAGGACGCAGTTGGTGGGCTTG GAATCGCGAAGGCGGGGAACAAACCGCGGTGGGTTCGCATGCGCGAAGCGGGGGAACCAGGCTGGGAA GAGGAGAAGGAGAAGGCTGAGACGAGGAAATCTGCGGATCGGACTGCGGATGTTTTGCGCGCGCTCGC GGATTTTGGGTTAAAGCCACTGATGCGCGTATACACCGATTCTGAGATGTCATCGGTGGAGTGGAAACC GCTTCGGAAGGGACAAGCCGTTCGGACGTGGGATAGGGACATGTTCCAACAAGCTATCGAACGGATGAT GTCGTGGGAGTCGTGGAATCAGCGCGTTGGGCAAGAGTACGCGAAACTCGTAGAACAAAAAAATCGAT TTGAGCAGAAGAATTTCGTCGGCCAGGAACATCTGGTCCATCTCGTCAATCAGTTGCAACAAGATATGA AAGAAGCATCGCCCGGACTCGAATCGAAAGAGCAAACCGCGCACTATGTGACGGGACGGGCATTGCGC GGATCGGACAAGGTATTTGAGAAGTGGGGGAAACTCGCCCCCGATGCACCTTTCGATTTGTACGACGCC GAAATCAAGAATGTGCAGAGACGTAACACGAGACGATTCGGATCACATGACTTGTTCGCAAAATTGGCA GAGCCAGAGTATCAGGCCCTGTGGCGCGAAGATGCTTCGTTTCTCACGCGTTACGCGGTGTACAACAGC ATCCTTCGCAAACTGAATCACGCCAAAATGTTCGCGACGTTTACTTTGCCGGATGCAACGGCGCACCCG ATTTGGACTCGCTTCGATAAATTGGGTGGGAATTTGCACCAGTACACCTTTTTGTTCAACGAATTTGGAG AACGCAGGCACGCGATTCGTTTTCACAAGCTATTGAAAGTCGAGAATGGTGTCGCAAGAGAAGTTGATG ATGTCACCGTGCCCATTTCAATGTCAGAGCAATTGGATAATCTGCTTCCCAGAGATCCCAATGAACCGAT TGCGCTATATTTTCGAGATTACGGAGCCGAACAGCATTTCACAGGTGAATTTGGTGGCGCGAAGATCCA GTGCCGCCGGGATCAGCTGGCTCATATGCACCGACGCAGAGGGGCGAGGGATGTTTATCTCAATGTCAG CGTACGTGTGCAGAGTCAGTCTGAGGCGCGGGGAGAACGTCGCCCGCCGTATGCGGCAGTATTTCGTCT GGTCGGGGACAACCATCGCGCGTTTGTCCATTTCGATAAACTATCGGATTATCTTGCGGAACATCCGGAT GATGGGAAGCTCGGGTCGGAGGGGTTGCTTTCCGGGCTGCGGGTGATGAGTGTCGATCTCGGCCTTCGC ACATCTGCATCGATTTCCGTTTTTCGCGTTGCCCGGAAGGACGAGTTGAAGCCGAACTCAAAAGGTCGTG TACCGTTTTTCTTTCCGATAAAAGGGAATGACAATCTCGTCGCGGTTCATGAGCGATCACAACTCTTGAA GCTGCCTGGCGAAACGGAGTCGAAGGACCTGCGTGCTATCCGAGAAGAACGCCAACGGACATTGCGGC AGTTGCGGACGCAACTGGCGTATTTGCGGCTGCTCGTGCGGTGTGGGTCGGAAGATGTGGGGCGGCGTG AACGGAGTTGGGCAAAGCTTATCGAGCAGCCGGTGGATGCGGCCAATCACATGACACCGGATTGGCGC GAGGCTTTTGAAAACGAACTTCAGAAGCTTAAGTCACTCCATGGTATCTGTAGCGACAAGGAATGGATG GATGCTGTCTACGAGAGCGTTCGCCGCGTGTGGCGTCACATGGGCAAACAGGTTCGCGATTGGCGAAAG GACGTACGAAGCGGAGAGCGGCCCAAGATTCGCGGCTATGCGAAAGACGTGGTCGGTGGAAACTCGAT TGAGCAAATCGAGTATCTGGAACGTCAGTACAAGTTCCTCAAGAGTTGGAGCTTCTTTGGTAAGGTGTC GGGACAAGTGATTCGTGCGGAGAAGGGATCTCGTTTTGCGATCACGCTGCGCGAACACATTGATCACGC GAAGGAAGATCGGCTGAAGAAATTGGCGGATCGCATCATTATGGAGGCTCTCGGCTATGTGTACGCGTT GGATGAGCGCGGCAAAGGAAAGTGGGTTGCGAAGTATCCGCCGTGCCAGCTCATCCTGCTGGAGGAATT GAGCGAGTACCAGTTCAATAACGACAGGCCTCCGAGCGAAAACAACCAGTTGATGCAATGGAGTCATC GCGGCGTGTTCCAGGAGTTGATAAATCAGGCCCAAGTCCATGATTTACTCGTTGGGACGATGTATGCAG CGTTCTCGTCGCGATTCGACGCGCGAACTGGGGCACCGGGTATCCGCTGTCGCCGGGTTCCGGCGCGTTG CACCCAGGAGCACAATCCAGAACCATTTCCTTGGTGGCTGAACAAGTTTGTGGTGGAACATACGTTGGA TGCTTGTCCCCTACGCGCAGACGACCTCATCCCAACGGGTGAAGGAGAGATTTTTGTCTCGCCGTTCAGC GCGGAGGAGGGGGACTTTCATCAGATTCACGCCGACCTGAATGCGGCGCAAAATCTGCAGCAGCGACTC TGGTCTGATTTTGATATCAGTCAAATTCGGTTGCGGTGTGATTGGGGTGAAGTGGACGGTGAACTCGTTC TGATCCCAAGGCTTACAGGAAAACGAACGGCGGATTCATATAGCAACAAGGTGTTTTATACCAATACAG GTGTCACCTATTATGAGCGAGAGCGGGGGAAGAAGCGGAGAAAGGTTTTCGCGCAAGAGAAATTGTCG GAGGAAGAGGCGGAGTTGCTCGTGGAAGCAGACGAGGCGAGGGAGAAATCGGTCGTTTTGATGCGTGA TCCGTCTGGCATCATCAATCGGGGAAATTGGACCAGGCAAAAGGAATTTTGGTCGATGGTGAACCAGCG GATCGAAGGATACTTGGTCAAGCAGATTCGCTCGCGCGTTCCATTACAAGATAGTGCGTGTGAAAACAC GGGGGATATTTAA SEQ ATGGCGACACGCAGTTTTATTTTAAAAATTGAACCAAATGAAGAAGTTAAAAAGGGATTATGGAAGACG ID CATGAGGTATTGAATCATGGAATTGCCTACTACATGAATATTCTGAAACTAATTAGACAGGAAGCTATTT NO: ATGAACATCATGAACAAGATCCTAAAAATCCGAAAAAAGTTTCAAAAGCAGAAATACAAGCCGAGTTA 34 TGGGATTTTGTTTTAAAAATGCAAAAATGTAATAGTTTTACACATGAAGTTGACAAAGATGTTGTTTTTA ACATCCTGCGTGAACTATATGAAGAGTTGGTCCCTAGTTCAGTCGAGAAAAAGGGTGAAGCCAATCAAT TATCGAATAAGTTTCTGTACCCGCTAGTTGATCCGAACAGTCAAAGTGGGAAAGGGACGGCATCATCCG GACGTAAACCTCGGTGGTATAATTTAAAAATAGCAGGCGACCCATCGTGGGAGGAAGAAAAGAAAAAA TGGGAAGAGGATAAAAAGAAAGATCCCCTTGCTAAAATCTTAGGTAAGTTAGCAGAATATGGGCTTATT CCGCTATTTATTCCATTTACTGACAGCAACGAACCAATTGTAAAAGAAATTAAATGGATGGAAAAAAGT CGTAATCAAAGTGTCCGGCGACTTGATAAGGATATGTTTATCCAAGCATTAGAGCGTTTTCTTTCATGGG AAAGCTGGAACCTTAAAGTAAAGGAAGAGTATGAAAAAGTTGAAAAGGAACACAAAACACTAGAGGA AAGGATAAAAGAGGACATTCAAGCATTTAAATCCCTTGAACAATATGAAAAAGAACGGCAGGAGCAAC TTCTTAGAGATACATTGAATACAAATGAATACCGATTAAGCAAAAGAGGATTACGTGGTTGGCGTGAAA TTATCCAAAAATGGCTAAAGATGGATGAAAATGAACCATCAGAAAAATATTTAGAAGTATTTAAAGATT ATCAACGGAAACATCCACGAGAAGCCGGGGACTATTCTGTCTATGAATTTTTAAGCAAGAAAGAAAATC ATTTTATTTGGCGAAATCATCCTGAATATCCTTATTTGTATGCTACATTTTGTGAAATTGACAAAAAAAA GAAAGACGCTAAGCAACAGGCAACTTTTACTTTGGCTGACCCGATTAACCATCCGTTATGGGTACGATTT GAAGAAAGAAGCGGTTCGAACTTAAACAAATATCGAATTTTAACAGAGCAATTACACACTGAAAAGTTA AAAAAGAAATTAACAGTTCAACTTGATCGTTTAATTTATCCAACTGAATCCGGCGGTTGGGAGGAAAAA GGTAAAGTAGATATCGTTTTGTTGCCGTCAAGACAATTTTATAATCAAATCTTCCTTGATATAGAAGAAA AGGGGAAACATGCTTTTACTTATAAGGATGAAAGTATTAAATTCCCCCTTAAAGGTACACTTGGTGGTGC AAGAGTGCAGTTTGACCGTGACCATTTGCGGAGATATCCGCATAAAGTAGAATCAGGAAATGTTGGACG GATTTATTTTAACATGACAGTAAATATTGAACCAACTGAGAGCCCTGTTAGTAAGTCTTTGAAAATACAT AGGGACGATTTCCCCAAGTTCGTTAATTTTAAACCGAAAGAGCTCACCGAATGGATAAAAGATAGTAAA GGGAAAAAATTAAAAAGTGGTATAGAATCCCTTGAAATTGGTCTACGGGTGATGAGTATCGACTTAGGT CAACGTCAAGCGGCTGCTGCATCGATTTTTGAAGTAGTTGATCAGAAACCGGATATTGAAGGGAAGTTA TTTTTTCCAATCAAAGGAACTGAGCTTTATGCTGTTCACCGGGCAAGTTTTAACATTAAATTACCGGGTG AAACATTAGTAAAATCACGGGAAGTATTGCGGAAAGCTCGGGAGGACAACTTAAAATTAATGAATCAA AAGTTAAACTTTCTAAGAAATGTTCTACATTTCCAACAGTTTGAAGATATCACAGAAAGAGAGAAGCGT GTAACTAAATGGATTTCTAGACAAGAAAATAGTGATGTTCCTCTTGTATATCAAGATGAGCTAATTCAAA TTCGTGAATTAATGTATAAACCCTATAAAGATTGGGTTGCCTTTTTAAAACAACTCCATAAACGGCTAGA AGTCGAGATTGGCAAAGAGGTTAAGCATTGGCGAAAATCATTAAGTGACGGGAGAAAAGGTCTTTACG GAATCTCCCTAAAAAATATTGATGAAATTGATCGAACAAGGAAATTCCTTTTAAGATGGAGCTTACGTC CAACAGAACCTGGGGAAGTAAGACGCTTGGAACCAGGACAGCGTTTTGCGATTGATCAATTAAACCACC TAAATGCATTAAAAGAAGATCGATTAAAAAAGATGGCAAATACGATTATCATGCATGCCTTAGGTTACT GTTATGATGTAAGAAAGAAAAAGTGGCAGGCAAAAAATCCAGCATGTCAAATTATTTTATTTGAAGATT TATCTAACTACAATCCTTACGAGGAAAGGTCCCGTTTTGAAAACTCAAAACTGATGAAGTGGTCACGGA GAGAAATTCCACGACAAGTCGCCTTACAAGGTGAAATTTACGGATTACAAGTTGGGGAAGTAGGTGCCC AATTCAGTTCAAGATTCCATGCGAAAACCGGGTCGCCGGGAATTCGTTGCAGTGTTGTAACGAAAGAAA AATTGCAGGATAATCGCTTTTTTAAAAATTTACAAAGAGAAGGACGACTTACTCTTGATAAAATCGCAG TTTTAAAAGAAGGAGACTTATATCCAGATAAAGGTGGAGAAAAGTTTATTTCTTTATCAAAGGATCGAA AGTTGGTAACTACGCATGCTGATATTAACGCGGCCCAAAATTTACAGAAGCGTTTTTGGACAAGAACAC ATGGATTTTATAAAGTTTACTGCAAAGCCTATCAGGTTGATGGACAAACTGTTTATATTCCGGAGAGCAA GGACCAAAAACAAAAAATAATTGAAGAATTTGGGGAAGGCTATTTTATTTTAAAAGATGGTGTATATGA ATGGGGTAATGCGGGGAAACTAAAAATTAAAAAAGGTTCCTCTAAACAATCATCGAGTGAATTAGTAGA TTCGGACATACTGAAAGATTCATTTGATTTAGCAAGTGAACTTAAGGGAGAGAAACTCATGTTATATCG AGATCCGAGTGGAAACGTATTTCCTTCCGACAAGTGGATGGCAGCAGGAGTATTTTTTGGCAAATTAGA AAGAATATTGATTTCTAAGTTAACAAATCAATACTCAATATCAACAATAGAAGATGATTCTTCAAAACA ATCAATGTAA SEQ ATGCCCACCCGCACCATCAATCTGAAACTTGTTCTTGGGAAAAATCCTGAAAACGCAACATTGCGACGC ID GCCCTATTTTCGACACACCGTTTGGTTAACCAAGCGACGAAACGTATTGAGGAATTCTTGTTGCTGTGTC NO: GTGGAGAAGCCTACAGAACAGTGGATAATGAGGGGAAGGAAGCCGAGATTCCACGTCATGCAGTCCAA 35 GAAGAAGCTCTTGCCTTTGCCAAAGCTGCTCAACGCCACAACGGCTGTATATCCACCTATGAAGACCAA GAGATTCTTGATGTACTGCGGCAACTGTACGAACGTCTTGTTCCTTCGGTCAACGAAAACAACGAGGCA GGCGATGCTCAAGCTGCTAACGCCTGGGTCAGTCCGCTCATGTCGGCAGAAAGCGAAGGAGGCTTGTCG GTCTACGACAAGGTGCTTGATCCACCGCCGGTTTGGATGAAGCTTAAAGAAGAAAAGGCTCCAGGATGG GAAGCCGCTTCTCAAATTTGGATTCAGAGTGATGAGGGACAGTCGTTACTTAATAAGCCAGGTAGCCCT CCCCGCTGGATTCGAAAACTGCGATCTGGGCAACCGTGGCAAGATGATTTCGTCAGTGACCAAAAGAAA AAGCAAGATGAGCTGACCAAAGGGAACGCACCACTTATAAAACAACTCAAAGAAATGGGGTTGTTGCC TCTTGTTAACCCATTTTTTAGACATCTTCTTGACCCTGAAGGTAAAGGCGTGAGTCCATGGGACCGTCTT GCTGTACGCGCTGCAGTGGCTCACTTTATCTCCTGGGAAAGTTGGAATCATAGAACACGTGCAGAATAC AATTCCTTGAAACTACGGCGAGACGAGTTTGAGGCAGCATCCGACGAATTCAAAGACGATTTTACTTTG CTCCGACAATATGAAGCCAAACGCCATAGTACATTGAAAAGCATCGCGCTGGCCGACGATTCGAACCCT TACCGGATTGGAGTACGTTCTCTGCGTGCCTGGAACCGCGTTCGTGAAGAATGGATAGACAAGGGTGCA ACAGAAGAACAACGCGTGACCATATTGTCAAAGCTTCAAACACAACTTCGGGGAAAATTCGGCGATCCC GATCTGTTCAACTGGCTAGCTCAGGATAGGCATGTCCATTTGTGGTCTCCTCGGGACAGCGTGACACCAT TGGTTCGCATCAATGCGGTAGATAAAGTTCTGCGTCGACGAAAACCGTATGCATTGATGACCTTTGCCCA TCCCCGCTTCCACCCTCGATGGATACTGTACGAGGCTCCAGGAGGAAGCAATCTCCGTCAATATGCATTG GATTGTACAGAAAACGCTCTACACATCACGTTGCCTTTGCTTGTCGACGATGCGCACGGAACCTGGATTG AAAAAAAGATCAGGGTGCCGCTGGCACCATCCGGACAAATTCAAGATTTAACTCTGGAAAAACTTGAGA AGAAAAAAAATCGTTTATACTACCGTTCCGGTTTTCAGCAGTTTGCCGGCTTGGCTGGCGGAGCTGAGGT TCTTTTCCACAGACCCTATATGGAACACGACGAACGCAGCGAGGAGTCTCTTTTGGAACGTCCGGGAGC CGTTTGGTTCAAATTGACCCTGGATGTGGCAACACAGGCTCCCCCGAACTGGCTTGATGGTAAGGGCCG TGTCCGTACACCGCCGGAGGTACATCATTTTAAAACCGCATTGTCGAATAAAAGCAAACATACACGTAC GCTGCAGCCGGGTCTCCGTGTCTTGTCAGTAGACTTGGGCATGCGAACATTCGCCTCCTGCTCAGTATTT GAACTCATCGAGGGAAAGCCTGAGACAGGCCGTGCCTTCCCTGTTGCCGATGAGAGATCAATGGACAGC CCGAATAAACTGTGGGCCAAGCATGAACGTAGTTTTAAACTGACGCTCCCCGGCGAAACCCCTTCTCGA AAGGAAGAGGAAGAGCGTAGCATAGCAAGAGCGGAAATTTATGCACTGAAACGCGACATACAACGCCT CAAAAGCCTACTCCGCTTAGGTGAAGAAGATAACGATAACCGTCGTGATGCATTGCTTGAACAGTTCTTT AAAGGATGGGGAGAAGAAGACGTTGTGCCTGGACAAGCGTTTCCACGCTCTCTTTTCCAAGGGTTGGGA GCTGCCCCGTTTCGCTCAACTCCAGAGTTATGGCGTCAGCATTGCCAAACATATTATGACAAAGCGGAA GCCTGTCTGGCTAAACATATCAGTGATTGGCGCAAGCGAACTCGTCCCCGTCCGACATCGCGGGAGATG TGGTACAAAACACGTTCCTATCATGGCGGCAAGTCCATTTGGATGTTGGAATATCTTGATGCCGTTCGAA AACTGCTTCTCAGTTGGAGCTTACGTGGTCGTACTTACGGTGCCATTAATCGCCAGGATACAGCCCGGTT TGGTTCTTTGGCATCACGGCTGCTCCACCATATCAATTCCCTAAAGGAAGACCGCATCAAAACAGGAGC CGACTCTATCGTTCAGGCTGCTCGCGGGTATATTCCTCTCCCTCATGGCAAGGGTTGGGAACAAAGATAT GAGCCTTGTCAGCTCATATTATTTGAAGACCTCGCACGATATCGCTTTCGCGTGGATCGACCTCGTCGAG AGAACAGCCAACTCATGCAGTGGAACCATCGAGCCATCGTGGCAGAAACAACGATGCAAGCCGAACTC TACGGACAAATTGTCGAAAATACTGCAGCGGGGTTCAGCAGTCGTTTTCACGCGGCGACAGGTGCCCCC GGTGTACGTTGTCGTTTTCTTCTAGAAAGAGACTTTGATAACGATTTGCCCAAACCGTACCTTCTCAGGG AACTTTCTTGGATGCTCGGCAATACAAAAGTCGAGTCTGAAGAAGAAAAGCTTCGATTGCTGTCTGAAA AAATCAGGCCAGGCAGTCTTGTTCCTTGGGATGGAGGCGAACAGTTCGCTACCCTGCATCCCAAAAGAC AAACACTTTGCGTCATTCATGCCGATATGAATGCTGCCCAAAATTTACAACGCCGGTTTTTCGGTCGATG CGGCGAGGCCTTTCGGCTTGTTTGTCAACCCCACGGTGACGACGTGTTACGACTCGCATCCACCCCAGGA GCTCGTCTTCTTGGAGCCCTGCAGCAGCTTGAAAATGGACAAGGAGCTTTCGAGTTGGTTCGAGACATG GGGTCAACAAGTCAAATGAACCGGTTCGTCATGAAGTCTTTGGGAAAAAAGAAAATAAAACCCCTTCAG GACAACAATGGAGACGACGAGCTTGAAGACGTGTTGTCCGTACTCCCGGAGGAAGACGACACAGGACG TATCACAGTCTTCCGCGATTCATCAGGAATCTTTTTTCCTTGCAACGTCTGGATACCGGCCAAACAGTTTT GGCCAGCAGTACGCGCCATGATTTGGAAGGTCATGGCTTCCCATTCTTTGGGGTGA SEQ ATGACAAAGTTAAGACACCGACAGAAAAAATTAACACACGACTGGGCTGGCTCCAAAAAGAGGGAAGT ID ATTAGGCTCAAATGGCAAGCTTCAGAATCCGTTGTTAATGCCGGTTAAAAAAGGTCAGGTTACTGAGTT NO: CCGGAAAGCGTTTTCTGCGTATGCTCGCGCAACGAAAGGAGAAATGACTGACGGCCGAAAGAATATGTT 36 TACGCATAGTTTCGAGCCATTTAAGACAAAGCCCTCGCTTCATCAGTGTGAATTGGCAGATAAAGCATAT CAATCTTTACATTCGTATCTGCCTGGTTCTCTTGCTCATTTTCTATTATCTGCTCACGCATTAGGTTTTCGT ATTTTTTCAAAATCTGGTGAAGCAACTGCATTCCAGGCATCCTCTAAAATTGAAGCTTACGAATCAAAAT TGGCAAGCGAATTAGCTTGTGTAGATTTATCTATTCAAAACTTGACTATTTCAACGCTTTTTAATGCGCTT ACAACGTCTGTAAGAGGGAAGGGCGAAGAAACTAGCGCTGACCCCTTAATTGCACGATTTTACACCTTA CTTACTGGCAAGCCTCTGTCTCGAGACACTCAAGGGCCTGAACGTGATTTAGCAGAAGTTATCTCGCGTA AGATAGCTAGTTCTTTTGGCACATGGAAAGAAATGACGGCAAACCCTCTTCAGTCATTACAATTTTTTGA AGAGGAACTCCATGCGCTGGATGCCAATGTCTCGCTCTCACCCGCCTTCGACGTTTTAATTAAAATGAAT GATTTGCAGGGCGATTTAAAAAATCGAACCATTGTTTTTGATCCTGACGCCCCTGTTTTTGAATATAACG CAGAAGACCCTGCCGACATAATTATTAAACTTACAGCTCGTTACGCTAAAGAAGCTGTCATCAAAAATC AAAACGTAGGAAATTACGTTAAAAACGCTATTACTACCACAAATGCCAATGGTCTTGGTTGGCTTTTGA ACAAAGGTTTGTCGTTACTCCCTGTCTCGACCGATGACGAATTGCTAGAGTTTATTGGCGTTGAACGATC TCATCCCTCATGCCATGCCTTAATTGAATTGATTGCACAATTAGAAGCCCCCGAGCTCTTTGAGAAGAAC GTATTTTCAGATACTCGTTCTGAAGTTCAAGGTATGATTGATTCAGCTGTTTCTAATCATATTGCTCGTCT TTCCAGCTCTAGAAATAGCTTGTCAATGGATAGTGAAGAATTAGAACGTTTAATCAAAAGCTTTCAGAT ACACACACCTCATTGCTCACTTTTTATTGGCGCCCAATCACTTTCACAGCAGTTAGAATCTTTGCCTGAA GCCCTTCAATCGGGCGTTAATTCAGCCGATATTTTACTAGGCTCTACTCAATATATGCTCACCAATTCTTT GGTTGAAGAGTCAATTGCAACTTATCAAAGAACACTTAATCGCATCAATTACTTGTCAGGTGTTGCAGGT CAGATTAACGGCGCAATAAAGCGAAAAGCGATAGATGGAGAAAAAATTCACTTGCCTGCAGCTTGGTC AGAGTTGATATCTTTACCATTTATAGGCCAGCCTGTTATAGATGTTGAAAGCGATTTAGCTCATCTAAAA AATCAATACCAAACACTTTCAAATGAGTTTGATACTCTTATATCTGCTTTGCAAAAGAATTTTGATTTGA ACTTTAATAAAGCGCTCCTTAATCGTACTCAGCATTTTGAAGCCATGTGTAGAAGCACTAAGAAAAACG CTTTATCCAAACCAGAGATCGTTTCCTATCGCGACCTGCTTGCTCGATTAACTTCTTGTTTGTATCGAGGC TCTTTAGTTTTGCGTCGTGCCGGCATTGAAGTGTTAAAAAAACATAAAATATTTGAGTCAAACAGCGAAC TTCGTGAACATGTTCATGAAAGAAAGCATTTCGTGTTTGTTAGTCCTCTAGATCGCAAAGCCAAGAAACT CCTTCGATTAACTGATTCGCGTCCAGACTTGTTACATGTTATTGATGAAATATTGCAGCACGATAATCTT GAAAACAAAGACCGCGAGTCACTTTGGCTAGTTCGCTCTGGTTATTTGCTTGCAGGACTTCCAGATCAAC TTTCTTCATCTTTTATTAACTTGCCTATCATTACTCAAAAAGGAGATAGACGCCTTATAGACCTGATTCAG TATGATCAAATTAATCGTGATGCTTTTGTTATGTTAGTGACCTCTGCATTCAAGTCTAATTTGTCTGGTCT GCAGTATCGTGCCAATAAGCAATCGTTCGTTGTTACTCGCACGCTAAGCCCTTATCTCGGCTCAAAACTT GTCTACGTACCCAAGGATAAAGATTGGTTAGTTCCTTCTCAAATGTTTGAAGGACGATTTGCTGACATTC TTCAATCAGATTATATGGTCTGGAAAGATGCCGGTCGTCTTTGTGTTATTGATACTGCAAAACACCTTTC TAATATAAAGAAGTCTGTATTTTCATCCGAAGAAGTTCTCGCTTTTTTAAGAGAACTCCCTCACCGCACA TTTATCCAGACCGAAGTTCGCGGCCTTGGCGTTAATGTCGATGGAATTGCATTTAATAATGGTGATATTC CGTCATTAAAAACCTTTTCAAATTGCGTTCAGGTAAAAGTTTCTCGGACTAATACATCCCTAGTTCAAAC ACTTAATCGTTGGTTTGAAGGAGGAAAAGTTTCTCCTCCGAGCATTCAATTTGAACGGGCGTATTATAAA AAAGACGATCAAATTCATGAAGACGCAGCGAAAAGAAAGATACGATTCCAGATGCCCGCAACTGAGTT GGTTCATGCTTCTGACGATGCGGGGTGGACACCAAGTTATTTGCTCGGCATTGATCCTGGCGAGTATGGA ATGGGTCTTTCATTGGTTTCGATTAATAACGGAGAAGTCTTAGATTCAGGCTTTATTCATATTAATTCTCT GATCAATTTTGCCTCTAAAAAGAGCAACCATCAAACTAAGGTTGTTCCGCGTCAGCAGTACAAATCTCCT TATGCAAATTATTTAGAACAATCTAAAGATTCTGCTGCTGGTGATATTGCGCATATACTCGATCGACTTA TATACAAATTAAATGCGTTGCCTGTTTTTGAGGCTCTTTCAGGTAATTCTCAGAGTGCTGCTGATCAAGTT TGGACGAAAGTCTTATCGTTTTACACTTGGGGTGATAATGACGCTCAGAATTCTATTAGAAAGCAGCATT GGTTTGGAGCCAGTCATTGGGATATCAAAGGTATGTTAAGGCAACCCCCTACGGAGAAGAAGCCTAAAC CGTATATTGCTTTTCCTGGCTCTCAGGTTTCTTCGTATGGTAATTCCCAACGTTGCTCTTGCTGCGGTCGC AATCCTATTGAACAACTTCGAGAAATGGCAAAGGATACCTCTATTAAAGAGCTAAAAATTCGCAATTCT GAGATACAGCTTTTTGACGGAACCATTAAATTATTTAATCCAGACCCATCCACTGTGATAGAGAGAAGG CGACATAATCTTGGTCCATCAAGAATTCCTGTTGCTGACCGTACTTTCAAAAACATCAGTCCATCAAGTC TAGAATTTAAAGAATTGATTACTATCGTGTCTCGATCTATCCGTCATTCACCTGAGTTTATCGCTAAAAA ACGCGGCATAGGGTCTGAGTATTTTTGCGCTTATTCCGATTGCAACTCATCCTTAAATTCTGAAGCTAAC GCAGCTGCTAACGTAGCGCAAAAATTTCAAAAACAGTTATTTTTTGAGTTATAA SEQ ATGAAGAGAATTCTGAACAGTCTGAAAGTTGCTGCCTTGAGACTTCTGTTTCGAGGCAAAGGTTCTGAAT ID TAGTGAAGACAGTCAAATATCCATTGGTTTCCCCGGTTCAAGGCGCGGTTGAAGAACTTGCTGAAGCAA NO: TTCGGCACGACAACCTGCACCTTTTTGGGCAGAAGGAAATAGTGGATCTTATGGAGAAAGACGAAGGAA 37 CCCAGGTGTATTCGGTTGTGGATTTTTGGTTGGATACCCTGCGTTTAGGGATGTTTTTCTCACCATCAGCG AATGCGTTGAAAATCACGCTGGGAAAATTCAATTCTGATCAGGTTTCACCTTTTCGTAAGGTTTTGGAGC AGTCACCTTTTTTTCTTGCGGGTCGCTTGAAGGTTGAACCTGCGGAAAGGATACTTTCTGTTGAAATCAG AAAGATTGGTAAAAGAGAAAACAGAGTTGAGAACTATGCCGCCGATGTGGAGACATGCTTCATTGGTCA GCTTTCTTCAGATGAGAAACAGAGTATCCAGAAGCTGGCAAATGATATCTGGGATAGCAAGGATCATGA GGAACAGAGAATGTTGAAGGCGGATTTTTTTGCTATACCTCTTATAAAAGACCCCAAAGCTGTCACAGA AGAAGATCCTGAAAATGAAACGGCGGGAAAACAGAAACCGCTTGAATTATGTGTTTGTCTTGTTCCTGA GTTGTATACCCGAGGTTTCGGCTCCATTGCTGATTTTCTGGTTCAGCGACTTACCTTGCTGCGTGACAAA ATGAGTACCGACACGGCGGAAGATTGCCTCGAGTATGTTGGCATTGAGGAAGAAAAAGGCAATGGAAT GAATTCCTTGCTCGGCACTTTTTTGAAGAACCTGCAGGGTGATGGTTTTGAACAGATTTTTCAGTTTATGC TTGGGTCTTATGTTGGCTGGCAGGGGAAGGAAGATGTACTGCGCGAACGATTGGATTTGCTGGCCGAAA AAGTCAAAAGATTACCAAAGCCAAAATTTGCCGGAGAATGGAGTGGTCATCGTATGTTTCTCCATGGTC AGCTGAAAAGCTGGTCGTCGAATTTCTTCCGTCTTTTTAATGAGACGCGGGAACTTCTGGAAAGTATCAA GAGTGATATTCAACATGCCACCATGCTCATTAGCTATGTGGAAGAGAAAGGAGGCTATCATCCACAGCT GTTGAGTCAGTATCGGAAGTTAATGGAACAATTACCGGCGTTGCGGACTAAGGTTTTGGATCCTGAGAT TGAGATGACGCATATGTCCGAGGCTGTTCGAAGTTACATTATGATACACAAGTCTGTAGCGGGATTTCTG CCGGATTTACTCGAGTCTTTGGATCGAGATAAGGATAGGGAATTTTTGCTTTCCATCTTTCCTCGTATTCC AAAGATAGATAAGAAGACGAAAGAGATCGTTGCATGGGAGCTACCGGGCGAGCCAGAGGAAGGCTATT TGTTCACAGCAAACAACCTTTTCCGGAATTTTCTTGAGAATCCGAAACATGTGCCACGATTTATGGCAGA GAGGATTCCCGAGGATTGGACGCGTTTGCGCTCGGCCCCTGTGTGGTTTGATGGGATGGTGAAGCAATG GCAGAAGGTGGTGAATCAGTTGGTTGAATCTCCAGGCGCCCTTTATCAGTTCAATGAAAGTTTTTTGCGT CAAAGACTGCAAGCAATGCTTACGGTCTATAAGCGGGATCTCCAGACTGAGAAGTTTCTGAAGCTGCTG GCTGATGTCTGTCGTCCACTCGTTGATTTTTTCGGACTTGGAGGAAATGATATTATCTTCAAGTCATGTCA GGATCCAAGAAAGCAATGGCAGACTGTTATTCCACTCAGTGTCCCAGCGGATGTTTATACAGCATGTGA AGGCTTGGCTATTCGTCTCCGCGAAACTCTTGGATTCGAATGGAAAAATCTGAAAGGACACGAGCGGGA AGATTTTTTACGGCTGCATCAGTTGCTGGGAAATCTGCTGTTCTGGATCAGGGATGCGAAACTTGTCGTG AAGCTGGAAGACTGGATGAACAATCCTTGTGTTCAGGAGTATGTGGAAGCACGAAAAGCCATTGATCTT CCCTTGGAGATTTTCGGATTTGAGGTGCCGATTTTTCTCAATGGCTATCTCTTTTCGGAACTGCGCCAGCT GGAATTGTTGCTGAGGCGTAAGTCGGTGATGACGTCTTACAGCGTCAAAACGACAGGCTCGCCAAATAG GCTCTTCCAGTTGGTTTACCTACCTCTAAACCCTTCAGATCCGGAAAAGAAAAATTCCAACAACTTTCAG GAGCGCCTCGATACACCTACCGGTTTGTCGCGTCGTTTTCTGGATCTTACGCTGGATGCATTTGCTGGCA AACTCTTGACGGATCCGGTAACTCAGGAACTGAAGACGATGGCCGGTTTTTACGATCATCTCTTTGGCTT CAAGTTGCCGTGTAAACTGGCGGCGATGAGTAACCATCCAGGATCCTCTTCCAAAATGGTGGTTCTGGC AAAACCAAAGAAGGGTGTTGCTAGTAACATCGGCTTTGAACCTATTCCCGATCCTGCTCATCCTGTGTTC CGGGTGAGAAGTTCCTGGCCGGAGTTGAAGTACCTGGAGGGGTTGTTGTATCTTCCCGAAGATACACCA CTGACCATTGAACTGGCGGAAACGTCGGTCAGTTGTCAGTCTGTGAGTTCAGTCGCTTTCGATTTGAAGA ATCTGACGACTATCTTGGGTCGTGTTGGTGAATTCAGGGTGACGGCAGATCAACCTTTCAAGCTGACGCC CATTATTCCTGAGAAAGAGGAATCCTTCATCGGGAAGACCTACCTCGGTCTTGATGCTGGAGAGCGATC TGGCGTTGGTTTCGCGATTGTGACGGTTGACGGCGATGGGTATGAGGTGCAGAGGTTGGGTGTGCATGA AGATACTCAGCTTATGGCGCTTCAGCAAGTCGCCAGCAAGTCTCTTAAGGAGCCGGTTTTCCAGCCACTC CGTAAGGGCACATTTCGTCAGCAGGAGCGCATTCGCAAAAGCCTCCGCGGTTGCTACTGGAATTTCTATC ATGCATTGATGATCAAGTACCGAGCTAAAGTTGTGCATGAGGAATCGGTGGGTTCATCCGGTCTGGTGG GGCAGTGGCTGCGTGCATTTCAGAAGGATCTCAAAAAGGCTGATGTTCTGCCCAAGAAGGGTGGAAAAA ATGGTGTAGACAAAAAAAAGAGAGAAAGCAGCGCTCAGGATACCTTATGGGGAGGAGCTTTCTCGAAG AAGGAAGAGCAGCAGATAGCCTTTGAGGTTCAGGCAGCTGGATCAAGCCAGTTTTGTCTGAAGTGTGGT TGGTGGTTTCAGTTGGGGATGCGGGAAGTAAATCGTGTGCAGGAGAGTGGCGTGGTGCTGGACTGGAAC CGGTCCATTGTAACCTTCCTCATCGAATCCTCAGGAGAAAAGGTATATGGTTTCAGTCCTCAGCAACTGG AAAAAGGCTTTCGTCCTGACATCGAAACGTTCAAAAAAATGGTAAGGGATTTTATGAGACCCCCCATGT TTGATCGCAAAGGTCGGCCGGCCGCGGCGTATGAAAGATTCGTACTGGGACGTCGTCACCGTCGTTATC GCTTTGATAAAGTTTTTGAAGAGAGATTTGGTCGCAGTGCTCTTTTCATCTGCCCGCGGGTCGGGTGTGG GAATTTCGATCACTCCAGTGAGCAGTCAGCCGTTGTCCTTGCCCTTATTGGTTACATTGCTGATAAGGAA GGGATGAGTGGTAAGAAGCTTGTTTATGTGAGGCTGGCTGAACTTATGGCTGAGTGGAAGCTGAAGAAA CTGGAGAGATCAAGGGTGGAAGAACAGAGCTCGGCACAATAA SEQ ATGGCAGAAAGCAAGCAGATGCAATGCCGCAAGTGCGGCGCAAGCATGAAGTATGAAGTAATTGGATT ID GGGCAAGAAGTCATGCAGATATATGTGCCCAGATTGCGGCAATCACACCAGCGCGCGCAAGATTCAGA NO: ACAAGAAAAAGCGCGACAAAAAGTATGGATCCGCAAGCAAAGCGCAGAGCCAGAGGATAGCTGTGGCT 38 GGCGCGCTTTATCCAGACAAAAAAGTGCAGACCATAAAGACCTACAAATACCCAGCGGATCTTAATGGC GAAGTTCATGACAGCGGCGTCGCAGAGAAGATTGCGCAGGCGATTCAGGAAGATGAGATCGGCCTGCTT GGCCCGTCCAGCGAATACGCTTGCTGGATTGCTTCACAAAAACAGAGCGAGCCGTATTCAGTTGTAGAT TTTTGGTTTGACGCGGTGTGCGCAGGCGGAGTATTCGCGTATTCTGGCGCGCGCCTGCTTTCCACAGTCC TCCAGTTGAGTGGCGAGGAAAGCGTTTTGCGCGCTGCTTTAGCATCTAGCCCGTTTGTAGATGACATTAA TTTGGCGCAAGCGGAAAAGTTCCTAGCCGTTAGCCGGCGCACAGGCCAAGATAAGCTAGGCAAGCGCAT TGGAGAATGTTTTGCGGAAGGCCGGCTTGAAGCGCTTGGCATCAAAGATCGCATGCGCGAATTCGTGCA AGCGATTGATGTGGCCCAAACCGCGGGCCAGCGGTTCGCGGCCAAGCTAAAGATATTCGGCATCAGTCA GATGCCTGAAGCCAAGCAATGGAACAATGATTCCGGGCTCACTGTATGTATTTTGCCGGATTATTATGTC CCGGAAGAAAACCGCGCGGACCAGCTGGTTGTTTTGCTTCGGCGCTTACGCGAGATCGCGTATTGCATG GGAATTGAGGATGAAGCAGGATTTGAGCATCTAGGCATTGACCCTGGTGCTCTTTCCAATTTTTCCAATG GCAATCCAAAGCGAGGATTTCTCGGCCGCCTGCTCAATAATGACATTATAGCGCTGGCAAACAACATGT CAGCCATGACGCCGTATTGGGAAGGCAGAAAAGGCGAGTTGATTGAGCGCCTTGCATGGCTTAAACATC GCGCTGAAGGATTGTATTTGAAAGAGCCACATTTCGGCAACTCCTGGGCAGACCACCGCAGCAGGATTT TCAGTCGCATTGCGGGCTGGCTTTCCGGATGCGCGGGCAAGCTCAAGATTGCCAAGGATCAGATTTCAG GCGTGCGTACGGATTTGTTTCTGCTCAAGCGCCTTCTGGATGCGGTACCGCAAAGCGCGCCGTCGCCGGA CTTTATTGCTTCCATCAGCGCGCTGGATCGGTTTTTGGAAGCGGCAGAAAGCAGCCAGGATCCGGCAGA ACAGGTACGCGCTTTGTACGCGTTTCATCTGAACGCGCCTGCGGTCCGATCCATCGCCAACAAGGCGGT ACAGAGGTCTGATTCCCAGGAGTGGCTTATCAAGGAACTGGATGCTGTAGATCACCTTGAATTCAACAA AGCATTTCCGTTTTTTTCGGATACAGGAAAGAAAAAGAAGAAAGGAGCGAATAGCAACGGAGCGCCTTC TGAAGAAGAATACACGGAAACAGAATCCATTCAACAACCAGAAGATGCAGAGCAGGAAGTGAATGGTC AAGAAGGAAATGGCGCTTCAAAGAACCAGAAAAAGTTTCAGCGCATTCCTCGATTTTTCGGGGAAGGGT CAAGGAGTGAGTATCGAATTTTAACAGAAGCGCCGCAATATTTTGACATGTTCTGCAATAATATGCGCG CGATCTTTATGCAGCTAGAGAGTCAGCCGCGCAAGGCGCCTCGTGATTTCAAATGCTTTCTGCAGAATCG TTTGCAGAAGCTTTACAAGCAAACCTTTCTCAATGCTCGCAGTAATAAATGCCGCGCGCTTCTGGAATCC GTCCTTATTTCATGGGGAGAATTTTATACTTATGGCGCGAATGAAAAGAAGTTTCGTCTGCGCCATGAAG CGAGCGAGCGCAGCTCGGATCCGGACTATGTGGTTCAGCAGGCATTGGAAATCGCGCGCCGGCTTTTCT TGTTCGGATTTGAGTGGCGCGATTGCTCTGCTGGAGAGCGCGTGGATTTGGTTGAAATCCACAAAAAAG CAATCTCATTTTTGCTTGCAATCACTCAGGCCGAGGTTTCAGTTGGTTCCTATAACTGGCTTGGGAATAG CACCGTGAGCCGGTATCTTTCGGTTGCTGGCACAGACACATTGTACGGCACTCAACTGGAGGAGTTTTTG AACGCCACAGTGCTTTCACAGATGCGTGGGCTGGCGATTCGGCTTTCATCTCAGGAGTTAAAAGACGGA TTTGATGTTCAGTTGGAGAGTTCGTGCCAGGACAATCTCCAGCATCTGCTGGTGTATCGCGCTTCGCGCG ACTTGGCTGCGTGCAAACGCGCTACATGCCCGGCTGAATTGGATCCGAAAATTCTTGTTCTGCCGGTTGG TGCGTTTATCGCGAGCGTAATGAAAATGATTGAGCGTGGCGATGAACCATTAGCAGGCGCGTATTTGCG TCATCGGCCGCATTCATTCGGCTGGCAGATACGGGTTCGTGGAGTGGCGGAAGTAGGCATGGATCAGGG CACAGCGCTAGCATTCCAGAAGCCGACTGAATCAGAGCCGTTTAAAATAAAGCCGTTTTCCGCTCAATA CGGCCCAGTACTTTGGCTTAATTCTTCATCCTATAGCCAGAGCCAGTATCTGGATGGATTTTTAAGCCAG CCAAAGAATTGGTCTATGCGGGTGCTACCTCAAGCCGGATCAGTGCGCGTGGAACAGCGCGTTGCTCTG ATATGGAATTTGCAGGCAGGCAAGATGCGGCTGGAGCGCTCTGGAGCGCGCGCGTTTTTCATGCCAGTG CCATTCAGCTTCAGGCCGTCTGGTTCAGGAGATGAAGCAGTATTGGCGCCGAATCGGTACTTGGGACTTT TTCCGCATTCCGGAGGAATAGAATACGCGGTGGTGGATGTATTAGATTCCGCGGGTTTCAAAATTCTTGA GCGCGGTACGATTGCGGTAAATGGCTTTTCCCAGAAGCGCGGCGAACGCCAAGAGGAGGCACACAGAG AAAAACAGAGACGCGGAATTTCTGATATAGGCCGCAAGAAGCCGGTGCAAGCTGAAGTTGACGCAGCC AATGAATTGCACCGCAAATACACCGATGTTGCCACTCGTTTAGGGTGCAGAATTGTGGTTCAGTGGGCG CCCCAGCCAAAGCCGGGCACAGCGCCGACCGCGCAAACAGTATACGCGCGCGCAGTGCGGACCGAAGC GCCGCGATCTGGAAATCAAGAGGATCATGCTCGTATGAAATCCTCTTGGGGATATACCTGGGGCACCTA TTGGGAGAAGCGCAAACCAGAGGATATTTTGGGCATCTCAACCCAAGTATACTGGACCGGCGGTATAGG CGAGTCATGTCCCGCAGTCGCGGTTGCGCTTTTGGGGCACATTAGGGCAACATCCACTCAAACTGAATG GGAAAAAGAGGAGGTTGTATTCGGTCGACTGAAGAAGTTCTTTCCAAGCTAG SEQ ATGGAAAAGAGAATAAACAAGATACGAAAGAAACTATCGGCCGATAATGCCACAAAGCCTGTGAGCAG ID GAGCGGCCCCATGAAAACACTCCTTGTCCGGGTCATGACGGACGACTTGAAAAAAAGACTGGAGAAGC NO: GTCGGAAAAAGCCGGAAGTTATGCCGCAGGTTATTTCAAATAACGCAGCAAACAATCTTAGAATGCTCC 39 TTGATGACTATACAAAGATGAAGGAGGCGATACTACAAGTTTACTGGCAGGAATTTAAGGACGACCATG TGGGCTTGATGTGCAAATTTGCCCAGCCTGCTTCCAAAAAAATTGACCAGAACAAACTAAAACCGGAAA TGGATGAAAAAGGAAATCTAACAACTGCCGGTTTTGCATGTTCTCAATGCGGTCAGCCGCTATTTGTTTA TAAGCTTGAACAGGTGAGTGAAAAAGGCAAGGCTTATACAAATTACTTCGGCCGGTGTAATGTGGCCGA GCATGAGAAATTGATTCTTCTTGCTCAATTAAAACCTGAAAAAGACAGTGACGAAGCAGTGACATACTC CCTTGGCAAATTCGGCCAGAGGGCATTGGACTTTTATTCAATCCACGTAACAAAAGAATCCACCCATCC AGTAAAGCCCCTGGCACAGATTGCGGGCAACCGCTATGCAAGCGGACCTGTTGGCAAGGCCCTTTCCGA TGCCTGTATGGGCACTATAGCCAGTTTTCTTTCGAAATATCAAGACATCATCATAGAACATCAAAAGGTT GTGAAGGGTAATCAAAAGAGGTTAGAGAGTCTCAGGGAATTGGCAGGGAAAGAAAATCTTGAGTACCC ATCGGTTACACTGCCGCCGCAGCCGCATACGAAAGAAGGGGTTGACGCTTATAACGAAGTTATTGCAAG GGTACGTATGTGGGTTAATCTTAATCTGTGGCAAAAGCTGAAGCTCAGCCGTGATGACGCAAAACCGCT ACTGCGGCTAAAAGGATTCCCATCTTTCCCTGTTGTGGAGCGGCGTGAAAACGAAGTTGACTGGTGGAA TACGATTAATGAAGTAAAAAAACTGATTGACGCTAAACGAGATATGGGACGGGTATTCTGGAGCGGCGT TACCGCAGAAAAGAGAAATACCATCCTTGAAGGATACAACTATCTGCCAAATGAGAATGACCATAAAA AGAGAGAGGGCAGTTTGGAAAACCCTAAGAAGCCTGCCAAACGCCAGTTTGGAGACCTCTTGCTGTATC TTGAAAAGAAATATGCCGGAGACTGGGGAAAGGTCTTCGATGAGGCATGGGAGAGGATAGATAAGAAA ATAGCCGGACTCACAAGCCATATAGAGCGCGAAGAAGCAAGAAACGCGGAAGACGCTCAATCCAAAGC CGTACTTACAGACTGGCTAAGGGCAAAGGCATCATTTGTTCTTGAAAGACTGAAGGAAATGGATGAAAA GGAATTCTATGCGTGTGAAATCCAACTTCAAAAATGGTATGGCGATCTTCGAGGCAACCCGTTTGCCGTT GAAGCTGAGAATAGAGTTGTTGATATAAGCGGGTTTTCTATCGGAAGCGATGGCCATTCAATCCAATAC AGAAATCTCCTTGCCTGGAAATATCTGGAGAACGGCAAGCGTGAATTCTATCTGTTAATGAATTATGGC AAGAAAGGGCGCATCAGATTTACAGATGGAACAGATATTAAAAAGAGCGGCAAATGGCAGGGACTATT ATATGGCGGTGGCAAGGCAAAGGTTATTGATCTGACTTTCGACCCCGATGATGAACAGTTGATAATCCT GCCGCTGGCCTTTGGCACAAGGCAAGGCCGCGAGTTTATCTGGAACGATTTGCTGAGTCTTGAAACAGG CCTGATAAAGCTCGCAAACGGAAGAGTTATCGAAAAAACAATCTATAACAAAAAAATAGGGCGGGATG AACCGGCTCTATTCGTTGCCTTAACATTTGAGCGCCGGGAAGTTGTTGATCCATCAAATATAAAGCCTGT AAACCTTATAGGCGTTGACCGCGGCGAAAACATCCCGGCGGTTATTGCATTGACAGACCCTGAAGGTTG TCCTTTACCGGAATTCAAGGATTCATCAGGGGGCCCAACAGACATCCTGCGAATAGGAGAAGGATATAA GGAAAAGCAGAGGGCTATTCAGGCAGCAAAGGAGGTAGAGCAAAGGCGGGCTGGCGGTTATTCACGGA AGTTTGCATCCAAGTCGAGGAACCTGGCGGACGACATGGTGAGAAATTCAGCGCGAGACCTTTTTTACC ATGCCGTTACCCACGATGCCGTCCTTGTCTTTGAAAACCTGAGCAGGGGTTTTGGAAGGCAGGGCAAAA GGACCTTCATGACGGAAAGACAATATACAAAGATGGAAGACTGGCTGACAGCGAAGCTCGCATACGAA GGTCTTACGTCAAAAACCTACCTTTCAAAGACGCTGGCGCAATATACGTCAAAAACATGCTCCAACTGC GGGTTTACTATAACGACTGCCGATTATGACGGGATGTTGGTAAGGCTTAAAAAGACTTCTGATGGATGG GCAACTACCCTCAACAACAAAGAATTAAAAGCCGAAGGCCAGATAACGTATTATAACCGGTATAAAAG GCAAACCGTGGAAAAAGAACTCTCCGCAGAGCTTGACAGGCTTTCAGAAGAGTCGGGCAATAATGATAT TTCTAAGTGGACCAAGGGTCGCCGGGACGAGGCATTATTTTTGTTAAAGAAAAGATTCAGCCATCGGCC TGTTCAGGAACAGTTTGTTTGCCTCGATTGCGGCCATGAAGTCCACGCCGATGAACAGGCAGCCTTGAAT ATTGCAAGGTCATGGCTTTTTCTAAACTCAAATTCAACAGAATTCAAAAGTTATAAATCGGGTAAACAG CCCTTCGTTGGTGCTTGGCAGGCCTTTTACAAAAGGAGGCTTAAAGAGGTATGGAAGCCCAACGCC SEQ ATGAAAAGGATAAATAAAATACGAAGGAGATTGGTAAAGGATAGCAACACGAAAAAAGCCGGCAAAA ID CCGGCCCTATGAAAACCTTGCTCGTTCGGGTTATGACACCTGACCTGAGAGAAAGGTTAGAGAATCTTC NO: GCAAAAAGCCGGAAAACATTCCTCAGCCCATTTCAAATACTTCACGTGCAAATTTAAATAAACTCCTCA 40 CTGACTATACGGAAATGAAGAAAGCAATCCTGCATGTTTATTGGGAAGAGTTCCAAAAAGACCCTGTCG GATTGATGAGCAGGGTTGCACAACCAGCGCCCAAGAATATTGATCAGAGAAAATTGATTCCGGTGAAGG ACGGAAATGAGAGACTAACAAGTTCTGGATTTGCCTGTTCTCAGTGCTGTCAACCCCTCTATGTTTATAA GCTTGAACAAGTGAATGACAAGGGTAAGCCCCATACAAATTACTTTGGCCGTTGTAATGTCTCCGAGCA TGAACGTTTGATATTGCTCTCGCCGCATAAACCGGAGGCAAATGACGAGCTAGTAACGTATTCGTTGGG GAAGTTCGGTCAAAGGGCATTGGACTTTTATTCAATCCACGTAACAAGAGAATCGAACCATCCTGTAAA GCCGCTAGAACAGATCGGTGGCAATAGCTGCGCAAGTGGTCCCGTTGGTAAGGCTTTATCTGATGCCTG TATGGGAGCAGTAGCCAGTTTCCTTACAAAGTACCAGGACATCATCCTCGAACACCAAAAGGTTATAAA AAAAAACGAAAAGAGATTGGCAAATCTAAAGGATATAGCAAGTGCAAACGGGCTTGCATTTCCTAAAA TCACTCTTCCACCGCAACCGCATACAAAAGAAGGGATTGAAGCTTATAACAATGTTGTTGCTCAGATAG TGATCTGGGTAAACCTGAATCTTTGGCAGAAACTCAAAATTGGCAGGGATGAGGCAAAGCCCTTACAGC GGCTTAAGGGTTTTCCGTCCTTCCCTCTTGTTGAACGCCAGGCGAATGAGGTTGATTGGTGGGATATGGT CTGTAATGTCAAAAAGTTGATTAACGAAAAGAAAGAGGACGGGAAGGTCTTCTGGCAAAATCTTGCTGG ATATAAAAGGCAGGAAGCCTTGCTTCCATATCTTTCGTCTGAAGAAGACCGTAAAAAAGGAAAAAAGTT TGCGCGTTATCAGTTTGGTGACCTTTTGCTTCACCTTGAAAAGAAACACGGTGAAGATTGGGGCAAAGTT TATGATGAGGCATGGGAAAGAATAGATAAAAAAGTTGAAGGTCTGAGTAAGCACATAAAGTTGGAGGA AGAAAGAAGGTCTGAAGATGCTCAATCAAAGGCTGCCCTCACTGATTGGCTCAGGGCAAAGGCCTCTTT TGTTATTGAAGGGCTCAAAGAAGCTGATAAGGATGAGTTTTGCAGGTGTGAGTTAAAGCTTCAAAAGTG GTATGGAGATTTGAGAGGAAAACCATTTGCTATAGAAGCAGAGAACAGCATTTTAGATATAAGCGGATT TTCTAAACAGTATAATTGTGCATTTATATGGCAGAAAGACGGCGTAAAGAAGTTAAATCTTTATTTAATA ATAAATTACTTCAAAGGTGGTAAGCTACGCTTCAAAAAAATCAAGCCAGAAGCTTTTGAAGCAAATAGG TTTTATACAGTAATTAATAAAAAAAGCGGTGAGATTGTGCCTATGGAGGTCAACTTCAATTTTGATGACC CGAATTTGATAATTCTGCCTTTGGCCTTTGGAAAAAGGCAGGGGAGGGAGTTTATCTGGAACGACCTATT GAGCCTTGAGACGGGTTCATTGAAACTCGCCAATGGCAGGGTTATTGAAAAAACGCTCTATAACAGAAG GACGAGACAGGATGAACCAGCACTTTTTGTTGCCCTGACATTTGAAAGAAGAGAGGTGCTTGACTCATC GAATATAAAACCGATGAATCTGATAGGAATAGACCGGGGAGAAAATATCCCGGCAGTCATAGCATTAA CAGACCCGGAAGGATGCCCCTTGTCAAGATTCAAAGATTCATTGGGCAATCCAACGCATATTTTGCGAA TAGGAGAAAGTTATAAGGAAAAACAACGGACTATTCAGGCTGCTAAAGAAGTTGAACAAAGGCGGGCA GGCGGATATTCGAGAAAATATGCATCAAAGGCGAAGAATCTGGCGGACGATATGGTAAGAAATACAGC TCGTGACCTCTTATATTATGCTGTTACTCAAGATGCAATGCTCATTTTTGAAAATCTTTCCCGCGGTTTTG GTAGACAAGGCAAGAGGACTTTTATGGCGGAAAGGCAGTACACGAGGATGGAAGACTGGCTGACTGCA AAGCTTGCCTATGAAGGTCTGCCATCAAAAACCTATCTTTCAAAGACTCTGGCACAGTATACCTCAAAG ACATGTTCTAATTGTGGTTTTACAATCACAAGTGCAGATTATGACAGGGTGCTCGAAAAGCTCAAGAAG ACGGCTACTGGATGGATGACTACAATCAATGGAAAAGAGTTAAAAGTTGAAGGACAGATAACATACTA TAACCGGTATAAAAGGCAGAATGTGGTAAAAGACCTCTCTGTAGAGCTGGATAGACTTTCGGAAGAGTC GGTAAATAATGATATTTCTAGTTGGACAAAAGGCCGCAGTGGTGAAGCTTTATCTCTGCTAAAAAAGAG ATTTAGTCACAGGCCGGTGCAGGAAAAGTTTGTTTGCCTGAACTGTGGTTTTGAAACCCATGCAGACGA ACAAGCAGCACTGAATATTGCAAGGTCGTGGCTCTTTCTCCGTTCTCAAGAATATAAGAAGTATCAAAC CAATAAAACGACCGGAAATACTGACAAAAGGGCATTTGTTGAAACATGGCAATCCTTTTACAGAAAGAA GCTCAAAGAAGTATGGAAACCA SEQ ATGGGTAAAATGTATTACCTTGGTTTAGACATTGGCACGAATTCCGTGGGCTACGCGGTGACCGACCCCT ID CATACCACCTGCTGAAGTTTAAGGGGGAACCAATGTGGGGTGCGCACGTATTTGCCGCCGGTAATCAGA NO: GCGCGGAACGACGCTCGTTCCGCACATCGCGTCGTCGTTTGGACCGACGCCAACAGCGCGTTAAACTGG 41 TACAGGAGATTTTTGCCCCGGTGATTAGTCCGATCGACCCACGCTTCTTCATTCGTCTGCATGAATCCGC CCTGTGGCGCGATGACGTCGCGGAGACGGATAAACATATCTTTTTCAATGATCCTACCTATACCGATAAG GAATATTATAGCGATTACCCGACTATCCATCACCTGATCGTTGATCTGATGGAAAGCTCTGAGAAACAC GATCCGCGGCTGGTGTACCTTGCAGTGGCGTGGTTAGTGGCACACCGTGGTCATTTTCTGAACGAGGTGG ACAAGGATAATATTGGAGATGTGTTGTCGTTCGACGCATTTTATCCGGAGTTTCTCGCGTTCCTGTCGGA CAACGGTGTATCACCGTGGGTGTGCGAAAGCAAAGCGCTGCAGGCGACCTTGCTGAGCCGTAACTCAGT GAACGACAAATATAAAGCCCTTAAGTCTCTGATCTTCGGATCCCAGAAACCTGAAGATAACTTCGATGC CAATATTTCGGAAGATGGACTCATTCAACTGCTGGCCGGCAAAAAGGTAAAAGTTAACAAACTGTTCCC TCAGGAATCGAACGATGCATCCTTCACATTGAATGATAAAGAAGACGCGATAGAAGAAATCCTGGGTAC GCTTACACCAGATGAATGTGAATGGATTGCGCATATACGCCGCCTTTTTGACTGGGCTATCATGAAACAT GCTCTGAAAGATGGCAGGACTATTAGCGAGTCAAAAGTCAAACTGTATGAGCAGCACCATCACGATCTG ACCCAACTTAAATACTTCGTGAAAACCTACCTTGCAAAAGAATACGACGATATTTTCCGCAACGTGGAT AGCGAAACAACGAAAAACTATGTAGCGTATTCCTATCATGTGAAAGAGGTGAAAGGCACTCTGCCTAAA AATAAGGCAACGCAAGAAGAGTTTTGTAAGTATGTCCTGGGCAAGGTTAAAAACATTGAATGCTCTGAA GCAGACAAGGTTGACTTTGATGAGATGATTCAGCGTCTTACCGACAACTCTTTTATGCCTAAGCAGGTTT CGGGCGAAAACCGCGTTATTCCTTATCAGTTATATTATTATGAACTGAAGACAATTCTGAATAAAGCAGC CTCGTACCTGCCTTTCCTGACGCAGTGTGGAAAAGATGCAATTTCGAACCAGGACAAACTACTGTCGATC ATGACGTTCCGTATTCCTTACTTCGTCGGACCCTTGCGAAAAGATAATTCGGAACATGCATGGCTCGAAC GAAAGGCCGGTAAGATTTATCCGTGGAACTTTAACGACAAAGTGGACTTGGATAAATCAGAAGAAGCGT TCATTCGCCGAATGACCAATACCTGTACCTATTATCCCGGCGAAGATGTTTTACCGTTGGATTCGCTGAT CTATGAGAAATTTATGATTTTAAATGAAATCAATAATATTCGTATTGACGGCTACCCGATTAGTGTTGAC GTTAAACAGCAGGTTTTTGGCTTGTTCGAAAAAAAACGACGCGTAACCGTGAAAGATATTCAGAACCTG CTGCTGTCTCTCGGAGCTCTGGACAAACACGGGAAGCTGACAGGCATCGATACCACTATCCACTCAAAC TATAATACGTATCACCATTTTAAATCTCTCATGGAACGCGGCGTCCTGACCCGGGATGACGTGGAACGC ATCGTTGAAAGGATGACCTACAGCGACGATACTAAGCGTGTGCGTCTGTGGCTGAATAACAACTATGGT ACTTTAACCGCCGACGATGTGAAACACATTTCGCGTCTGCGCAAACACGATTTTGGCCGTTTATCCAAAA TGTTCTTAACAGGTCTGAAGGGTGTCCATAAGGAGACCGGTGAACGTGCCTCCATACTGGATTTCATGTG GAACACGAACGATAACCTGATGCAGCTCCTTTCCGAATGCTACACGTTCAGTGATGAAATCACAAAGCT GCAAGAGGCGTATTATGCAAAAGCCCAGTTGTCTTTAAACGATTTTTTAGACTCGATGTACATCTCTAAC GCGGTGAAACGTCCGATTTACAGAACTCTGGCAGTGGTGAACGATATTCGAAAAGCATGTGGGACGGCC CCTAAACGCATTTTCATCGAAATGGCTCGTGATGGTGAATCAAAAAAAAAGAGAAGTGTTACACGTCGC GAGCAGATCAAAAACCTGTACCGCTCGATTCGTAAAGATTTCCAGCAGGAAGTTGATTTTCTGGAAAAG ATCCTGGAAAATAAATCTGATGGTCAACTTCAGTCAGATGCTTTGTATCTTTACTTTGCACAATTAGGGC GCGATATGTACACGGGCGATCCAATAAAGCTGGAGCACATCAAAGATCAGAGTTTCTATAACATAGACC ATATTTACCCGCAGTCTATGGTGAAAGACGATTCCCTAGATAACAAAGTGCTGGTGCAAAGCGAAATTA ACGGCGAGAAAAGCTCGCGATACCCTTTGGACGCCGCGATCCGCAATAAAATGAAGCCCCTTTGGGACG CTTACTATAATCATGGCCTGATCTCCTTAAAGAAATACCAGCGTCTAACGCGCTCGACCCCGTTTACCGA TGATGAAAAATGGGACTTTATTAATCGCCAGTTAGTGGAAACCCGTCAATCTACCAAAGCGCTGGCCAT TTTGTTGAAGCGTAAGTTTCCAGACACCGAAATTGTGTATTCGAAGGCGGGGTTATCGTCCGACTTCAGA CATGAATTCGGCCTTGTAAAAAGTCGCAATATTAATGATTTGCACCACGCTAAAGACGCATTCTTGGCTA TCGTTACCGGCAATGTGTACCATGAAAGATTCAATCGCAGATGGTTTATGGTGAACCAGCCGTACTCAGT TAAAACTAAAACTCTTTTTACCCACAGCATAAAGAATGGCAACTTCGTTGCCTGGAACGGCGAAGAAGA TCTCGGTCGTATTGTAAAAATGCTGAAGCAAAACAAAAATACCATTCACTTCACGCGCTTCTCCTTCGAT CGCAAAGAAGGATTATTTGATATCCAACCTCTGAAAGCCAGCACCGGCTTAGTCCCACGAAAAGCCGGT CTGGATGTCGTTAAATACGGCGGATATGACAAATCTACCGCGGCCTATTACCTGCTGGTGAGGTTCACGC TCGAGGACAAGAAAACCCAGCACAAGCTGATGATGATTCCTGTAGAAGGCCTGTACAAGGCTCGCATTG ATCATGACAAGGAATTTCTTACCGATTATGCGCAAACGACTATAAGCGAAATCCTACAGAAAGATAAAC AGAAAGTGATCAATATTATGTTTCCAATGGGTACGAGGCATATAAAACTCAATTCAATGATTAGTATCG ATGGCTTCTATCTTAGTATCGGCGGAAAGTCCTCTAAAGGTAAGTCAGTTCTATGTCACGCAATGGTTCC ACTGATCGTCCCTCACAAAATCGAATGTTACATTAAAGCAATGGAAAGCTTCGCCCGGAAGTTTAAAGA AAACAACAAGCTGCGCATCGTAGAAAAATTCGATAAAATCACCGTTGAAGACAACCTGAATCTCTACGA GCTCTTTCTCCAAAAACTGCAGCATAATCCCTATAATAAGTTTTTTTCGACACAGTTTGACGTACTGACG AACGGCCGTTCTACTTTCACAAAACTGTCGCCGGAGGAACAGGTACAGACGCTCTTGAACATTTTAAGT ATCTTTAAAACATGCCGCAGTTCGGGTTGCGACCTGAAATCCATCAACGGCAGTGCCCAGGCAGCGCGC ATCATGATTAGCGCTGACTTAACTGGACTGTCGAAAAAATATTCAGATATTAGGTTGGTTGAACAGTCA GCTTCTGGTTTGTTCGTATCCAAAAGTCAGAACTTACTGGAGTATCTCTAA SEQ ATGTCATCGCTCACGAAATTCACTAACAAATACTCTAAACAGCTCACCATTAAGAATGAACTCATCCCA ID GTTGGCAAAACACTGGAGAACATCAAAGAGAATGGTCTGATAGATGGCGACGAACAGCTGAATGAGAA NO: TTATCAGAAGGCGAAAATTATTGTGGATGATTTTCTGCGGGACTTCATTAATAAAGCACTGAATAATACG 42 CAGATCGGGAACTGGCGCGAACTGGCGGATGCCCTTAATAAAGAGGATGAAGATAACATCGAGAAATT GCAGGATAAAATTCGGGGAATCATTGTATCCAAATTTGAAACGTTTGATCTGTTTAGCAGCTATTCTATT AAGAAAGATGAAAAGATTATTGACGACGACAATGATGTTGAAGAAGAGGAACTGGATCTGGGCAAGAA GACCAGCTCATTTAAATACATATTTAAAAAAAACCTGTTTAAGTTAGTGTTGCCATCCTACCTGAAAACC ACAAACCAGGACAAGCTGAAGATTATTAGCTCGTTTGATAATTTTTCAACGTACTTCCGCGGGTTCTTTG AAAACCGGAAAAACATTTTTACCAAGAAACCGATCTCCACAAGTATTGCGTATCGCATTGTTCATGATA ACTTCCCGAAATTCCTTGATAACATTCGTTGTTTTAATGTGTGGCAGACGGAATGCCCGCAACTAATCGT GAAAGCAGATAACTATCTGAAAAGCAAAAATGTTATAGCGAAAGATAAAAGTTTGGCAAACTATTTTAC CGTGGGCGCGTATGACTATTTCCTGTCTCAGAATGGTATAGATTTTTACAACAATATTATAGGTGGACTG CCAGCGTTCGCCGGCCATGAGAAAATCCAAGGTCTCAATGAATTCATCAATCAAGAGTGCCAAAAAGAC AGCGAGCTGAAAAGTAAGCTGAAAAACCGTCACGCGTTCAAAATGGCGGTACTGTTCAAACAGATACTC AGCGATCGTGAAAAAAGTTTTGTAATTGATGAGTTCGAGTCGGATGCTCAAGTTATTGACGCCGTTAAA AACTTTTACGCCGAACAGTGCAAAGATAACAATGTTATTTTTAACTTATTAAATCTTATCAAGAATATCG CTTTCTTAAGTGATGACGAACTGGACGGCATATTCATTGAAGGGAAATACCTGTCGAGCGTTAGTCAAA AACTCTATAGCGATTGGTCAAAATTACGTAACGACATTGAGGATTCGGCTAACTCTAAACAAGGCAATA AAGAGCTGGCCAAGAAGATCAAAACCAACAAAGGGGATGTAGAAAAAGCGATCTCGAAATATGAGTTC TCGCTGTCGGAACTGAACTCGATTGTACATGATAACACCAAGTTTTCTGACCTCCTTAGTTGTACACTGC ATAAGGTGGCTTCTGAGAAACTGGTGAAGGTCAATGAAGGCGACTGGCCGAAACATCTCAAGAATAAT GAAGAGAAACAAAAAATCAAAGAGCCGCTTGATGCTCTGCTGGAGATCTATAATACACTTCTGATTTTT AACTGCAAAAGCTTCAATAAAAACGGCAACTTCTATGTCGACTATGATCGTTGCATCAATGAACTGAGT TCGGTCGTGTATCTGTATAATAAAACACGTAACTATTGCACTAAAAAACCCTATAACACGGACAAGTTC AAACTCAATTTTAACAGTCCGCAGCTCGGTGAAGGCTTTTCCAAGTCGAAAGAAAATGACTGTCTGACT CTTTTGTTTAAAAAAGACGACAACTATTATGTAGGCATTATCCGCAAAGGTGCAAAAATCAATTTTGATG ATACACAAGCAATCGCCGATAACACCGACAATTGCATCTTTAAAATGAATTATTTCCTACTTAAAGACGC AAAAAAATTTATCCCGAAATGTAGCATTCAGCTGAAAGAAGTCAAGGCCCATTTTAAGAAATCTGAAGA TGATTACATTTTGTCTGATAAAGAGAAATTTGCTAGCCCGCTGGTCATTAAAAAGAGCACATTTTTGCTG GCAACTGCACATGTGAAAGGGAAAAAAGGCAATATCAAGAAATTTCAGAAAGAATATTCGAAAGAAAA CCCCACTGAGTATCGCAATTCTTTAAACGAATGGATTGCTTTTTGTAAAGAGTTCTTAAAAACTTATAAA GCGGCTACCATTTTTGATATAACCACATTGAAAAAGGCAGAGGAATATGCTGATATTGTAGAATTCTAC AAGGATGTCGATAATCTGTGCTACAAACTGGAGTTCTGCCCGATTAAAACCTCGTTTATAGAAAACCTG ATAGATAACGGCGACCTGTATCTGTTTCGCATCAATAACAAAGACTTCAGCAGTAAATCGACCGGCACC AAGAACCTTCATACGTTATATTTACAAGCTATATTCGATGAACGTAATCTGAACAATCCGACAATTATGC TGAATGGGGGAGCAGAACTGTTCTATCGTAAAGAAAGTATTGAGCAGAAAAACCGTATCACACACAAA GCCGGTTCAATTCTCGTGAATAAGGTGTGTAAAGACGGTACAAGCCTGGATGATAAGATACGTAATGAA ATTTATCAATATGAGAATAAATTTATTGATACCCTGTCTGATGAAGCTAAAAAGGTGTTACCGAATGTCA TTAAAAAGGAAGCTACCCATGACATTACAAAAGATAAACGTTTCACTAGTGACAAATTCTTCTTTCACTG CCCCCTGACAATTAATTATAAGGAAGGCGATACCAAGCAGTTCAATAACGAAGTGCTGAGTTTTCTGCG TGGAAATCCTGACATCAACATTATCGGCATTGACCGCGGAGAGCGTAATTTAATCTATGTAACGGTTATA AACCAGAAAGGCGAGATTCTGGATTCGGTTTCATTCAATACCGTGACCAACAAGAGTTCAAAAATCGAG CAGACAGTCGATTATGAAGAGAAATTGGCAGTCCGCGAGAAAGAGAGGATTGAAGCAAAACGTTCCTG GGACTCTATCTCAAAAATTGCGACACTAAAGGAAGGTTATCTGAGCGCAATAGTTCACGAGATCTGTCT GTTAATGATTAAACACAACGCGATCGTTGTCTTAGAGAATCTTAATGCAGGCTTTAAGCGTATTCGTGGC GGTTTATCAGAAAAAAGTGTTTATCAAAAATTCGAAAAAATGTTGATTAACAAACTGAACTATTTTGTCA GCAAGAAGGAATCCGACTGGAATAAACCGTCTGGTCTGCTGAATGGACTGCAGCTTTCGGATCAGTTTG AAAGCTTCGAAAAACTGGGTATTCAGTCTGGTTTTATTTTTTACGTGCCGGCTGCATATACCTCAAAGAT TGATCCGACCACGGGCTTCGCCAATGTTCTGAATCTGTCGAAGGTACGCAATGTTGATGCGATCAAAAG CTTTTTTTCTAACTTCAACGAAATTAGTTATAGCAAGAAAGAAGCCCTTTTCAAATTCTCATTCGATCTGG ATTCACTGAGTAAGAAAGGCTTTAGTAGCTTTGTGAAATTTAGTAAGAGTAAATGGAACGTCTACACCTT TGGAGAACGTATCATAAAGCCAAAGAATAAGCAAGGTTATCGGGAGGACAAAAGAATCAACTTGACCT TCGAGATGAAGAAGTTACTTAACGAGTATAAGGTTTCTTTTGATCTTGAAAATAACTTGATTCCGAATCT CACGAGTGCCAACCTGAAGGATACTTTTTGGAAAGAGCTATTCTTTATCTTCAAGACTACGCTGCAGCTC CGTAACAGCGTTACTAACGGTAAAGAAGATGTGCTCATCTCTCCGGTCAAAAATGCGAAGGGTGAATTC TTCGTTTCGGGAACGCATAACAAGACTCTTCCGCAAGATTGCGATGCGAACGGTGCATACCATATTGCGT TGAAAGGTCTGATGATACTCGAACGTAACAACCTTGTACGTGAGGAGAAAGATACGAAAAAGATTATG GCGATTTCAAACGTGGATTGGTTCGAGTACGTGCAGAAACGTAGAGGCGTTCTGTAA SEQ ATGAACAACTACGACGAATTCACCAAACTGTACCCGATCCAGAAAACCATCCGTTTCGAACTGAAACCG ID CAGGGTCGTACCATGGAACACCTGGAAACCTTCAACTTCTTCGAAGAAGACCGTGACCGTGCGGAAAAA NO: TACAAAATCCTGAAAGAAGCGATCGACGAATACCACAAAAAATTCATCGACGAACACCTGACCAACAT 43 GTCTCTGGACTGGAACTCTCTGAAACAGATCTCTGAAAAATACTACAAATCTCGTGAAGAAAAAGACAA AAAAGTTTTCCTGTCTGAACAGAAACGTATGCGTCAGGAAATCGTTTCTGAATTCAAAAAAGACGACCG TTTCAAAGACCTGTTCTCTAAAAAACTGTTCTCTGAACTGCTGAAAGAAGAAATCTACAAAAAAGGTAA CCACCAGGAAATCGACGCGCTGAAATCTTTCGACAAATTCTCTGGTTACTTCATCGGTCTGCACGAAAAC CGTAAAAACATGTACTCTGACGGTGACGAAATCACCGCGATCTCTAACCGTATCGTTAACGAAAACTTC CCGAAATTCCTGGACAACCTGCAGAAATACCAGGAAGCGCGTAAAAAATACCCGGAATGGATCATCAA AGCGGAATCTGCGCTGGTTGCGCACAACATCAAAATGGACGAAGTTTTCTCTCTGGAATACTTCAACAA AGTTCTGAACCAGGAAGGTATCCAGCGTTACAACCTGGCGCTGGGTGGTTACGTTACCAAATCTGGTGA AAAAATGATGGGTCTGAACGACGCGCTGAACCTGGCGCACCAGTCTGAAAAATCTTCTAAAGGTCGTAT CCACATGACCCCGCTGTTCAAACAGATCCTGTCTGAAAAAGAATCTTTCTCTTACATCCCGGACGTTTTC ACCGAAGACTCTCAGCTGCTGCCGTCTATCGGTGGTTTCTTCGCGCAGATCGAAAACGACAAAGACGGT AACATCTTCGACCGTGCGCTGGAACTGATCTCTTCTTACGCGGAATACGACACCGAACGTATCTACATCC GTCAGGCGGACATCAACCGTGTTTCTAACGTTATCTTCGGTGAATGGGGTACCCTGGGTGGTCTGATGCG TGAATACAAAGCGGACTCTATCAACGACATCAACCTGGAACGTACCTGCAAAAAAGTTGACAAATGGCT GGACTCTAAAGAATTCGCGCTGTCTGACGTTCTGGAAGCGATCAAACGTACCGGTAACAACGACGCGTT CAACGAATACATCTCTAAAATGCGTACCGCGCGTGAAAAAATCGACGCGGCGCGTAAAGAAATGAAAT TCATCTCTGAAAAAATCTCTGGTGACGAAGAATCTATCCACATCATCAAAACCCTGCTGGACTCTGTTCA GCAGTTCCTGCACTTCTTCAACCTGTTCAAAGCGCGTCAGGACATCCCGCTGGACGGTGCGTTCTACGCG GAATTCGACGAAGTTCACTCTAAACTGTTCGCGATCGTTCCGCTGTACAACAAAGTTCGTAACTACCTGA CCAAAAACAACCTGAACACCAAAAAAATCAAACTGAACTTCAAAAACCCGACCCTGGCGAACGGTTGG GACCAGAACAAAGTTTACGACTACGCGTCTCTGATCTTCCTGCGTGACGGTAACTACTACCTGGGTATCA TCAACCCGAAACGTAAAAAAAACATCAAATTCGAACAGGGTTCTGGTAACGGTCCGTTCTACCGTAAAA TGGTTTACAAACAGATCCCGGGTCCGAACAAAAACCTGCCGCGTGTTTTCCTGACCTCTACCAAAGGTA AAAAAGAATACAAACCGTCTAAAGAAATCATCGAAGGTTACGAAGCGGACAAACACATCCGTGGTGAC AAATTCGACCTGGACTTCTGCCACAAACTGATCGACTTCTTCAAAGAATCTATCGAAAAACACAAAGAC TGGTCTAAATTCAACTTCTACTTCTCTCCGACCGAATCTTACGGTGACATCTCTGAATTCTACCTGGACGT TGAAAAACAGGGTTACCGTATGCACTTCGAAAACATCTCTGCGGAAACCATCGACGAATACGTTGAAAA AGGTGACCTGTTCCTGTTCCAGATCTACAACAAAGACTTCGTTAAAGCGGCGACCGGTAAAAAAGACAT GCACACCATCTACTGGAACGCGGCGTTCTCTCCGGAAAACCTGCAGGACGTTGTTGTTAAACTGAACGG TGAAGCGGAACTGTTCTACCGTGACAAATCTGACATCAAAGAAATCGTTCACCGTGAAGGTGAAATCCT GGTTAACCGTACCTACAACGGTCGTACCCCGGTTCCGGACAAAATCCACAAAAAACTGACCGACTACCA CAACGGTCGTACCAAAGACCTGGGTGAAGCGAAAGAATACCTGGACAAAGTTCGTTACTTCAAAGCGCA CTACGACATCACCAAAGACCGTCGTTACCTGAACGACAAAATCTACTTCCACGTTCCGCTGACCCTGAAC TTCAAAGCGAACGGTAAAAAAAACCTGAACAAAATGGTTATCGAAAAATTCCTGTCTGACGAAAAAGC GCACATCATCGGTATCGACCGTGGTGAACGTAACCTGCTGTACTACTCTATCATCGACCGTTCTGGTAAA ATCATCGACCAGCAGTCTCTGAACGTTATCGACGGTTTCGACTACCGTGAAAAACTGAACCAGCGTGAA ATCGAAATGAAAGACGCGCGTCAGTCTTGGAACGCGATCGGTAAAATCAAAGACCTGAAAGAAGGTTA CCTGTCTAAAGCGGTTCACGAAATCACCAAAATGGCGATCCAGTACAACGCGATCGTTGTTATGGAAGA ACTGAACTACGGTTTCAAACGTGGTCGTTTCAAAGTTGAAAAACAGATCTACCAGAAATTCGAAAACAT GCTGATCGACAAAATGAACTACCTGGTTTTCAAAGACGCGCCGGACGAATCTCCGGGTGGTGTTCTGAA CGCGTACCAGCTGACCAACCCGCTGGAATCTTTCGCGAAACTGGGTAAACAGACCGGTATCCTGTTCTA CGTTCCGGCGGCGTACACCTCTAAAATCGACCCGACCACCGGTTTCGTTAACCTGTTCAACACCTCTTCT AAAACCAACGCGCAGGAACGTAAAGAATTCCTGCAGAAATTCGAATCTATCTCTTACTCTGCGAAAGAC GGTGGTATCTTCGCGTTCGCGTTCGACTACCGTAAATTCGGTACCTCTAAAACCGACCACAAAAACGTTT GGACCGCGTACACCAACGGTGAACGTATGCGTTACATCAAAGAAAAAAAACGTAACGAACTGTTCGAC CCGTCTAAAGAAATCAAAGAAGCGCTGACCTCTTCTGGTATCAAATACGACGGTGGTCAGAACATCCTG CCGGACATCCTGCGTTCTAACAACAACGGTCTGATCTACACCATGTACTCTTCTTTCATCGCGGCGATCC AGATGCGTGTTTACGACGGTAAAGAAGACTACATCATCTCTCCGATCAAAAACTCTAAAGGTGAATTCT TCCGTACCGACCCGAAACGTCGTGAACTGCCGATCGACGCGGACGCGAACGGTGCGTACAACATCGCGC TGCGTGGTGAACTGACCATGCGTGCGATCGCGGAAAAATTCGACCCGGACTCTGAAAAAATGGCGAAAC TGGAACTGAAACACAAAGACTGGTTCGAATTCATGCAGACCCGTGGTGACTAA SEQ ATGACTAAAACATTTGATTCAGAGTTTTTTAATTTGTACTCGCTGCAAAAAACGGTACGCTTTGAGTTAA ID AACCCGTGGGAGAAACCGCGTCATTTGTGGAAGACTTTAAAAACGAGGGCTTGAAACGTGTTGTGAGCG NO: AAGATGAAAGGCGAGCCGTCGATTACCAGAAAGTTAAGGAAATAATTGACGATTACCATCGGGATTTCA 44 TTGAAGAAAGTTTAAATTATTTTCCGGAACAGGTGAGTAAAGATGCTCTTGAGCAGGCGTTTCATCTTTA TCAGAAACTGAAGGCAGCAAAAGTTGAGGAAAGGGAAAAAGCGCTGAAAGAATGGGAAGCGCTGCAG AAAAAGCTACGTGAAAAAGTGGTGAAATGCTTCTCGGACTCGAATAAAGCCCGCTTCTCAAGGATTGAT AAAAAGGAACTGATTAAGGAAGACCTGATAAATTGGTTGGTCGCCCAGAATCGCGAGGATGATATCCCT ACGGTCGAAACGTTTAACAACTTCACCACATATTTTACCGGCTTCCATGAGAATCGTAAAAATATTTACT CCAAAGATGATCACGCCACCGCTATTAGCTTTCGCCTTATTCATGAAAATCTTCCAAAGTTTTTTGACAA CGTGATTAGCTTCAATAAGTTGAAAGAGGGTTTCCCTGAATTAAAATTTGATAAAGTGAAAGAGGATTT AGAAGTAGATTATGATCTGAAGCATGCGTTTGAAATAGAATATTTCGTTAACTTCGTGACCCAAGCGGG CATAGATCAGTATAATTATCTGTTAGGAGGGAAAACCCTGGAGGACGGGACGAAAAAACAAGGGATGA ATGAGCAAATTAATCTGTTCAAACAACAGCAAACGCGAGATAAAGCGCGTCAGATTCCCAAACTGATCC CCCTGTTCAAACAGATTCTTAGCGAAAGGACTGAAAGCCAGTCCTTTATTCCTAAACAATTTGAAAGTGA TCAGGAGTTGTTCGATTCACTGCAGAAGTTACATAATAACTGCCAGGATAAATTCACCGTGCTGCAACA AGCCATTCTCGGTCTGGCAGAGGCGGATCTTAAGAAGGTCTTCATCAAAACCTCTGATTTAAATGCCTTA TCTAACACCATTTTCGGGAATTACAGCGTCTTTTCCGATGCACTGAACCTGTATAAAGAAAGCCTGAAAA CGAAAAAAGCGCAGGAGGCTTTTGAGAAACTACCGGCCCATTCTATTCACGACCTCATTCAATACTTGG AACAGTTCAATTCCAGCCTGGACGCGGAAAAACAACAGAGCACCGACACCGTCCTGAACTACTTCATCA AGACCGATGAATTATATTCTCGCTTCATTAAATCCACTAGCGAGGCTTTCACTCAGGTGCAGCCTTTGTT CGAACTGGAAGCCCTGTCATCTAAGCGCCGCCCACCGGAATCGGAAGATGAAGGGGCAAAAGGGCAGG AAGGCTTCGAGCAGATCAAGCGTATTAAAGCTTACCTGGATACGCTTATGGAAGCGGTACACTTTGCAA AGCCGTTGTATCTTGTTAAGGGTCGTAAAATGATCGAAGGGCTCGATAAAGACCAGTCCTTTTATGAAG CGTTTGAAATGGCGTACCAAGAACTTGAATCGTTAATCATTCCTATCTATAACAAAGCGCGGAGCTATCT GTCGCGGAAACCTTTCAAGGCCGATAAATTCAAGATTAATTTTGACAACAACACGCTACTGAGCGGATG GGATGCGAACAAGGAAACTGCTAACGCGTCCATTCTGTTTAAGAAAGACGGGTTATATTACCTTGGAAT TATGCCGAAAGGTAAGACCTTTCTCTTTGACTACTTTGTATCGAGCGAGGATTCAGAGAAACTGAAACA GCGTCGCCAGAAGACCGCCGAAGAAGCTCTGGCGCAGGATGGTGAAAGTTACTTCGAAAAAATTCGTTA TAAACTGTTACCAGGGGCTTCAAAGATGTTACCGAAAGTCTTTTTTAGCAACAAAAATATTGGCTTTTAC AACCCGTCGGATGACATTTTACGCATTCGCAACACAGCCTCTCACACCAAAAACGGGACCCCTCAGAAA GGCCACTCAAAAGTTGAGTTTAACCTGAATGATTGTCATAAGATGATTGATTTCTTCAAATCATCAATTC AGAAACACCCGGAATGGGGGTCTTTTGGCTTTACGTTTTCTGATACCAGTGATTTTGAAGACATGAGTGC CTTCTACCGGGAAGTAGAAAACCAGGGTTACGTAATTAGCTTTGACAAAATCAAAGAGACCTATATACA GAGCCAGGTGGAACAGGGTAATCTCTACTTATTCCAGATTTATAACAAGGATTTCTCGCCCTACAGCAA AGGCAAACCAAACCTGCATACTCTGTACTGGAAAGCCCTGTTTGAAGAAGCGAACCTGAATAACGTAGT GGCGAAGTTGAACGGTGAAGCGGAAATCTTCTTCCGTCGTCACTCCATTAAGGCCTCTGATAAAGTTGTC CATCCGGCAAATCAGGCCATTGATAATAAGAATCCACACACGGAAAAAACGCAGTCAACCTTTGAATAT GACCTCGTTAAAGACAAACGCTACACGCAAGATAAGTTCTTTTTCCACGTCCCAATCAGCCTCAACTTTA AAGCACAAGGGGTTTCAAAGTTTAATGATAAAGTCAATGGGTTCCTCAAGGGCAACCCGGATGTCAACA TTATAGGTATAGACAGGGGCGAACGCCATCTGCTTTACTTTACCGTAGTGAATCAGAAAGGTGAAATAC TGGTTCAGGAATCATTAAATACCTTGATGTCGGACAAAGGGCACGTTAATGATTACCAGCAGAAACTGG ATAAAAAAGAACAGGAACGTGATGCTGCGCGTAAATCGTGGACCACGGTTGAGAACATTAAAGAGCTG AAAGAGGGGTATCTAAGCCATGTGGTACACAAACTGGCGCACCTCATCATTAAATATAACGCAATAGTC TGCCTAGAAGACTTGAATTTTGGCTTTAAACGCGGCCGCTTCAAAGTGGAAAAACAAGTTTATCAAAAA TTTGAAAAGGCGCTTATAGATAAACTGAATTATCTGGTTTTTAAAGAAAAGGAACTTGGTGAGGTAGGG CACTACTTGACAGCTTATCAACTGACGGCCCCGTTCGAATCATTCAAAAAACTGGGCAAACAGTCTGGC ATTCTGTTTTACGTGCCGGCAGATTATACTTCAAAAATCGATCCAACAACTGGCTTTGTGAACTTCCTGG ACCTGAGATATCAGTCTGTAGAAAAAGCTAAACAACTTCTTAGCGATTTTAATGCCATTCGTTTTAACAG CGTTCAGAATTACTTTGAATTCGAAATTGACTATAAAAAACTTACTCCGAAACGTAAAGTCGGAACCCA AAGTAAATGGGTAATTTGTACGTATGGCGATGTCAGGTATCAGAACCGTCGGAATCAAAAAGGTCATTG GGAGACCGAAGAAGTGAACGTGACCGAAAAGCTGAAGGCTCTGTTCGCCAGCGATTCAAAAACTACAA CTGTGATCGATTACGCAAATGATGATAACCTGATAGATGTGATTTTAGAGCAGGATAAAGCCAGCTTTTT TAAAGAACTGTTGTGGCTCCTGAAACTTACGATGACCTTACGACATTCCAAGATCAAATCGGAAGATGA TTTTATTCTGTCACCGGTCAAGAATGAGCAGGGTGAATTCTATGATAGTAGGAAAGCCGGCGAAGTGTG GCCGAAAGACGCCGACGCCAATGGCGCCTATCATATCGCGCTCAAAGGGCTTTGGAATTTGCAGCAGAT TAACCAGTGGGAAAAAGGTAAAACCCTGAATCTGGCTATCAAAAACCAGGATTGGTTTAGCTTTATCCA AGAGAAACCGTATCAGGAATGA SEQ ATGCATACAGGCGGTCTTCTTAGTATGGACGCGAAAGAGTTCACAGGTCAGTATCCGTTGTCGAAAACA ID TTACGATTCGAACTTCGGCCCATCGGCCGCACGTGGGATAACCTGGAGGCCTCAGGCTACTTAGCGGAA NO: GACCGCCATCGTGCCGAATGTTATCCTCGTGCGAAAGAGTTATTGGATGACAACCATCGTGCCTTCCTGA 45 ATCGTGTGTTGCCACAAATCGATATGGATTGGCACCCGATTGCGGAGGCCTTTTGTAAGGTACATAAAA ACCCTGGTAATAAAGAACTTGCCCAGGATTACAACCTTCAGTTGTCAAAGCGCCGTAAGGAGATCAGCG CATATCTTCAGGATGCAGATGGCTATAAAGGCCTGTTCGCGAAGCCCGCCTTAGACGAAGCTATGAAAA TTGCGAAAGAAAACGGGAACGAAAGTGATATTGAGGTTCTCGAAGCGTTTAACGGTTTTAGCGTATACT TCACCGGTTATCATGAGTCACGCGAGAACATTTATAGCGATGAGGATATGGTGAGCGTAGCCTACCGAA TTACTGAGGATAATTTCCCGCGCTTTGTCTCAAACGCTTTGATCTTTGATAAATTAAACGAAAGCCATCC GGATATTATCTCTGAAGTATCGGGCAATCTTGGAGTTGATGACATTGGTAAGTACTTTGACGTGTCGAAC TATAACAATTTTCTTTCCCAGGCCGGTATAGATGACTACAATCACATTATTGGCGGCCATACAACCGAAG ACGGACTGATACAAGCGTTTAATGTCGTATTGAACTTACGTCACCAAAAAGACCCTGGCTTTGAAAAAA TTCAGTTCAAACAGCTCTACAAACAAATCCTGAGCGTGCGTACCAGCAAAAGCTACATCCCGAAACAGT TTGACAACTCTAAGGAGATGGTTGACTGCATTTGCGATTATGTCAGCAAAATAGAGAAATCCGAAACAG TAGAACGGGCCCTGAAACTAGTCCGTAATATCAGTTCTTTCGACTTGCGCGGGATCTTTGTCAATAAAAA GAACTTGCGCATACTGAGCAACAAACTGATAGGAGATTGGGACGCGATCGAAACCGCATTGATGCATAG TTCTTCATCAGAAAACGATAAGAAAAGCGTATATGATAGCGCGGAGGCTTTTACGTTGGATGACATCTTT TCAAGCGTGAAAAAATTTTCTGATGCCTCTGCCGAAGATATTGGCAACAGGGCGGAAGACATCTGTAGA GTGATAAGTGAGACGGCCCCTTTTATCAACGATCTGCGAGCGGTGGACCTGGATAGCCTGAACGACGAT GGTTATGAAGCGGCCGTCTCAAAAATTCGGGAGTCGCTGGAGCCTTATATGGATCTTTTCCATGAACTGG AAATTTTCTCGGTTGGCGATGAGTTCCCAAAATGCGCAGCATTTTACAGCGAACTGGAGGAAGTCAGCG AACAGCTGATCGAAATTATTCCGTTATTCAACAAGGCGCGTTCGTTCTGCACCCGGAAACGCTATAGCAC CGATAAGATTAAAGTGAACTTAAAATTCCCGACCTTGGCGGACGGGTGGGACCTGAACAAAGAGAGAG ACAACAAAGCCGCGATTCTGCGGAAAGACGGTAAGTATTATCTGGCAATTCTGGATATGAAGAAAGATC TGTCAAGCATTAGGACCAGCGACGAAGATGAATCCAGCTTCGAAAAGATGGAGTATAAACTGTTACCGA GTCCAGTAAAAATGCTGCCAAAGATATTCGTAAAATCGAAAGCCGCTAAGGAAAAATATGGCCTGACA GATCGTATGCTTGAATGCTACGATAAAGGTATGCATAAGTCGGGTAGTGCGTTTGATCTTGGCTTTTGCC ATGAACTCATTGATTATTACAAGCGTTGTATCGCGGAGTACCCAGGCTGGGATGTGTTCGATTTCAAGTT TCGCGAAACTTCCGATTATGGGTCCATGAAAGAGTTCAATGAAGATGTGGCCGGAGCCGGTTACTATAT GAGTCTGAGAAAAATTCCGTGCAGCGAAGTGTACCGTCTGTTAGACGAGAAATCGATTTATCTATTTCA AATTTATAACAAAGATTACTCTGAAAATGCACATGGTAATAAGAACATGCATACCATGTACTGGGAGGG TCTCTTTTCCCCGCAAAACCTGGAGTCGCCCGTTTTCAAGTTGTCGGGTGGGGCAGAACTTTTCTTTCGA AAATCCTCAATCCCTAACGATGCCAAAACAGTACACCCGAAAGGCTCAGTGCTGGTTCCACGTAATGAT GTTAACGGTCGGCGTATTCCAGATTCAATCTACCGCGAACTGACACGCTATTTTAACCGTGGCGATTGCC GAATCAGTGACGAAGCCAAAAGTTATCTTGACAAGGTTAAGACTAAAAAAGCGGACCATGACATTGTG AAAGATCGCCGCTTTACCGTGGATAAAATGATGTTCCACGTCCCGATTGCGATGAACTTTAAGGCGATC AGTAAACCGAACTTAAACAAAAAAGTCATTGATGGCATCATTGATGATCAGGATCTGAAAATCATTGGT ATTGATCGTGGCGAGCGGAACTTAATTTACGTCACGATGGTTGACAGAAAAGGGAATATCTTATATCAG GATTCTCTTAACATCCTCAATGGCTACGACTATCGTAAAGCTCTGGATGTGCGCGAATATGACAACAAG GAAGCGCGTCGTAACTGGACTAAAGTGGAGGGCATTCGCAAAATGAAGGAAGGCTATCTGTCATTAGCG GTCTCGAAATTAGCGGATATGATTATCGAAAATAACGCCATCATCGTTATGGAGGACCTGAACCACGGA TTCAAAGCGGGCCGCTCAAAGATTGAAAAACAAGTTTATCAGAAATTTGAGAGTATGCTGATTAACAAA CTGGGCTATATGGTGTTAAAAGACAAGTCAATTGACCAATCAGGTGGCGCGCTGCATGGATACCAGCTG GCGAACCATGTTACCACCTTAGCATCAGTTGGAAAGCAGTGTGGGGTTATCTTTTATATACCGGCAGCGT TCACTAGTAAAATAGATCCGACCACTGGTTTCGCCGATCTCTTTGCCCTGAGTAACGTTAAAAACGTAGC GAGCATGCGTGAATTCTTTTCCAAAATGAAATCTGTCATTTATGATAAAGCTGAAGGCAAATTCGCATTC ACCTTTGATTACTTGGATTACAACGTGAAGAGCGAATGTGGTCGTACGCTGTGGACCGTTTACACCGTTG GTGAGCGCTTCACCTATTCCCGTGTGAACCGCGAATATGTACGTAAAGTCCCCACCGATATTATCTATGA TGCCCTCCAGAAAGCAGGCATTAGCGTCGAAGGAGACTTAAGGGACAGAATTGCCGAAAGCGATGGCG ATACGCTGAAGTCTATTTTTTACGCATTCAAATACGCGCTAGATATGCGCGTTGAGAATCGCGAGGAAG ACTACATTCAATCACCTGTGAAAAATGCCTCTGGGGAATTTTTTTGTTCAAAAAATGCTGGTAAAAGCCT CCCACAAGATAGCGATGCAAACGGTGCATATAACATTGCCCTGAAAGGTATTCTTCAATTACGCATGCT GTCTGAGCAGTACGACCCCAACGCGGAATCTATTAGACTTCCGCTGATAACCAATAAAGCCTGGCTGAC ATTCATGCAGTCTGGCATGAAGACCTGGAAAAATTAG SEQ ATGGATAGTTTAAAAGATTTTACGAATCTATATCCCGTAAGCAAAACTCTTCGTTTTGAACTGAAACCTG ID TTGGAAAAACGTTGGAGAATATCGAGAAAGCGGGCATCCTGAAAGAAGACGAGCACCGTGCCGAAAGC NO: TACAGGCGTGTCAAAAAGATTATCGATACTTATCACAAAGTGTTCATTGATAGCAGTCTGGAGAACATG 46 GCAAAAATGGGCATAGAAAATGAAATCAAAGCAATGCTGCAGAGCTTTTGCGAGCTCTACAAGAAAGA TCACCGAACGGAAGGTGAAGATAAAGCACTGGACAAAATTCGCGCCGTTCTTCGCGGTCTGATTGTTGG CGCGTTCACCGGCGTGTGCGGCCGCCGTGAAAACACCGTGCAGAACGAAAAGTACGAGTCGCTGTTCAA AGAAAAACTGATAAAAGAAATTTTGCCTGACTTTGTGCTTTCGACCGAAGCGGAATCCCTGCCATTTTCT GTCGAAGAAGCGACCCGCAGCCTGAAAGAATTTGACTCATTCACAAGTTACTTTGCAGGCTTCTACGAA AACCGTAAAAACATCTACAGCACGAAGCCACAGAGCACGGCTATTGCTTATCGCCTGATTCATGAGAAC CTGCCGAAGTTCATCGATAACATCCTTGTTTTTCAAAAAATTAAAGAGCCGATTGCGAAAGAGTTAGAA CATATTCGAGCTGACTTTTCTGCGGGTGGGTACATTAAAAAAGATGAGCGGCTGGAAGACATCTTCAGT CTAAACTATTATATCCACGTTCTGTCGCAGGCAGGCATTGAGAAATATAATGCGCTGATTGGTAAGATTG TCACAGAAGGCGATGGTGAGATGAAAGGTCTTAATGAACATATCAATCTGTATAACCAGCAGCGTGGTC GCGAAGACCGTCTTCCACTGTTCCGCCCACTGTATAAACAGATCCTGTCTGACCGGGAACAGCTGTCCTA CCTGCCGGAAAGCTTTGAAAAGGATGAAGAGCTACTTCGCGCATTAAAGGAGTTTTACGACCATATTGC GGAAGACATTTTGGGTAGAACGCAGCAACTGATGACGTCAATTTCTGAATACGATCTGAGTAGAATCTA CGTTAGGAATGATAGCCAGCTGACCGATATTAGCAAAAAAATGCTGGGCGACTGGAACGCTATCTATAT GGCACGTGAACGTGCATATGATCATGAACAAGCACCGAAACGTATAACCGCGAAATATGAGCGTGATC GCATTAAGGCGCTAAAGGGAGAAGAAAGCATCTCACTCGCAAACCTGAACTCCTGTATCGCTTTCTTAG ATAACGTGCGCGATTGTCGCGTCGACACGTATCTGTCAACCCTTGGGCAGAAAGAGGGTCCACATGGTC TGTCTAACCTGGTGGAAAATGTCTTTGCGAGTTACCATGAAGCGGAACAACTGCTGTCTTTTCCATACCC CGAAGAAAACAATCTAATACAGGATAAAGATAACGTGGTGTTAATCAAAAACCTGCTGGACAACATCA GCGATCTGCAACGTTTCCTGAAACCTTTGTGGGGTATGGGTGACGAGCCAGACAAAGACGAACGTTTTT ATGGTGAGTATAATTATATACGTGGCGCCCTTGACCAAGTTATTCCGCTGTATAACAAAGTACGGAACTA TCTGACCCGTAAGCCATATTCTACCCGTAAAGTGAAACTGAACTTTGGCAACTCGCAACTGCTGTCGGGT TGGGATCGTAACAAAGAAAAAGATAATAGTTGTGTTATCCTGCGTAAGGGACAAAATTTTTACCTCGCG ATTATGAACAACAGACACAAGCGTTCATTTGAAAATAAGGTTCTGCCGGAGTATAAAGAGGGCGAACCG TACTTCGAGAAAATGGATTATAAGTTCTTACCAGACCCTAATAAGATGTTACCGAAAGTCTTTCTTTCGA AAAAAGGCATAGAAATCTATAAGCCGTCCCCGAAATTACTCGAACAGTATGGGCACGGGACCCACAAG AAAGGGGATACTTTTAGCATGGACGATCTGCACGAACTGATCGATTTTTTTAAACACTCCATCGAAGCCC ATGAAGACTGGAAACAGTTTGGGTTCAAGTTCTCTGATACAGCCACATACGAGAATGTGTCTAGTTTTTA TCGGGAAGTGGAGGATCAGGGCTACAAACTTAGTTTTCGTAAAGTTTCAGAGAGTTATGTTTATAGTTTA ATTGATCAGGGAAAACTTTACCTGTTCCAGATCTACAACAAAGATTTCTCGCCATGTAGTAAGGGTACCC CGAATCTGCATACACTCTATTGGAGAATGTTATTCGATGAGCGTAACTTAGCGGATGTCATTTATAAATT GGACGGGAAAGCAGAGATCTTTTTTCGTGAAAAATCACTGAAGAATGACCACCCGACTCATCCGGCCGG GAAACCGATCAAAAAAAAATCCCGCCAGAAAAAAGGAGAAGAGTCTCTGTTTGAATATGATCTGGTGA AAGACCGTCATTACACTATGGATAAATTTCAATTTCATGTTCCAATTACAATGAACTTCAAATGTTCGGC GGGTTCCAAAGTAAATGATATGGTAAACGCCCATATTCGCGAAGCGAAAGATATGCATGTTATTGGCAT CGATAGAGGCGAAAGAAACCTGCTTTATATTTGCGTAATTGACAGCCGTGGTACCATTCTGGACCAGAT CTCTTTAAACACCATCAATGACATCGATTATCACGACCTGTTGGAGTCTCGGGACAAGGACCGCCAGCA GGAGCGCCGTAATTGGCAGACAATTGAAGGCATAAAAGAATTAAAACAGGGTTACCTTTCCCAGGCCGT ACACCGCATAGCGGAACTGATGGTGGCCTACAAAGCCGTAGTTGCCCTGGAAGACTTGAATATGGGGTT TAAACGTGGCCGTCAAAAAGTCGAGAGCAGCGTGTATCAGCAATTTGAAAAACAGTTGATTGACAAGTT GAATTATTTGGTTGATAAAAAGAAACGTCCAGAAGATATTGGTGGCTTACTGCGTGCATACCAGTTTAC GGCACCTTTTAAGTCCTTCAAAGAAATGGGTAAACAGAACGGGTTTCTGTTTTACATCCCGGCCTGGAAT ACATCCAACATCGATCCTACCACCGGGTTTGTCAACCTGTTTCATGCACAATATGAAAACGTGGATAAA GCGAAGAGTTTTTTCCAAAAATTCGATAGTATTTCGTATAACCCAAAAAAAGATTGGTTTGAGTTTGCGT TCGATTATAAAAATTTTACTAAAAAGGCTGAGGGATCCCGCAGTATGTGGATCCTCTGCACCCATGGCA GTCGTATTAAAAATTTTCGTAATTCGCAAAAGAATGGCCAGTGGGACTCGGAAGAGTTTGCCCTGACCG AAGCGTTCAAATCGCTGTTTGTACGCTACGAAATTGACTACACAGCAGATCTGAAAACAGCCATCGTCG ATGAAAAACAGAAAGATTTTTTTGTAGATCTCCTAAAACTGTTCAAACTGACTGTTCAGATGCGCAATTC CTGGAAAGAGAAAGACCTGGATTATCTGATTAGCCCGGTAGCCGGTGCTGATGGACGATTTTTCGATAC TCGTGAAGGTAACAAAAGTCTCCCGAAAGATGCTGATGCCAATGGTGCATACAATATTGCATTAAAGGG GCTATGGGCCTTGCGACAGATCCGCCAGACCAGCGAAGGCGGCAAGCTGAAATTGGCCATATCGAATAA GGAATGGTTACAATTTGTTCAGGAACGTAGCTATGAAAAAGATTGA SEQ ATGAACAACGGCACAAATAATTTTCAGAACTTCATCGGGATCTCAAGTTTGCAGAAAACGCTGCGCAAT ID GCTCTGATCCCCACGGAAACCACGCAACAGTTCATCGTCAAGAACGGAATAATTAAAGAAGATGAGTTA NO: CGTGGCGAGAACCGCCAGATTCTGAAAGATATCATGGATGACTACTACCGCGGATTCATCTCTGAGACT 47 CTGAGTTCTATTGATGACATAGATTGGACTAGCCTGTTCGAAAAAATGGAAATTCAGCTGAAAAATGGT GATAATAAAGATACCTTAATTAAGGAACAGACAGAGTATCGGAAAGCAATCCATAAAAAATTTGCGAA CGACGATCGGTTTAAGAACATGTTTAGCGCCAAACTGATTAGTGACATATTACCTGAATTTGTCATCCAC AACAATAATTATTCGGCATCAGAGAAAGAGGAAAAAACCCAGGTGATAAAATTGTTTTCGCGCTTTGCG ACTAGCTTTAAAGATTACTTCAAGAACCGTGCAAATTGCTTTTCAGCGGACGATATTTCATCAAGCAGCT GCCATCGCATCGTCAACGACAATGCAGAGATATTCTTTTCAAATGCGCTGGTCTACCGCCGGATCGTAAA ATCGCTGAGCAATGACGATATCAACAAAATTTCGGGCGATATGAAAGATTCATTAAAAGAAATGAGTCT GGAAGAAATATATTCTTACGAGAAGTATGGGGAATTTATTACCCAGGAAGGCATTAGCTTCTATAATGA TATCTGTGGGAAAGTGAATTCTTTTATGAACCTGTATTGTCAGAAAAATAAAGAAAACAAAAATTTATA CAAACTTCAGAAACTTCACAAACAGATTCTATGCATTGCGGACACTAGCTATGAGGTCCCGTATAAATTT GAAAGTGACGAGGAAGTGTACCAATCAGTTAACGGCTTCCTTGATAACATTAGCAGCAAACATATAGTC GAAAGATTACGCAAAATCGGCGATAACTATAACGGCTACAACCTGGATAAAATTTATATCGTGTCCAAA TTTTACGAGAGCGTTAGCCAAAAAACCTACCGCGACTGGGAAACAATTAATACCGCCCTCGAAATTCAT TACAATAATATCTTGCCGGGTAACGGTAAAAGTAAAGCCGACAAAGTAAAAAAAGCGGTTAAGAATGA TTTACAGAAATCCATCACCGAAATAAATGAACTAGTGTCAAACTATAAGCTGTGCAGTGACGACAACAT CAAAGCGGAGACTTATATACATGAGATTAGCCATATCTTGAATAACTTTGAAGCACAGGAATTGAAATA CAATCCGGAAATTCACCTAGTTGAATCCGAGCTCAAAGCGAGTGAGCTTAAAAACGTGCTGGACGTGAT CATGAATGCGTTTCATTGGTGTTCGGTTTTTATGACTGAGGAACTTGTTGATAAAGACAACAATTTTTAT GCGGAACTGGAGGAGATTTACGATGAAATTTATCCAGTAATTAGTCTGTACAACCTGGTTCGTAACTAC GTTACCCAGAAACCGTACAGCACGAAAAAGATTAAATTGAACTTTGGAATACCGACGTTAGCAGACGGT TGGTCAAAGTCCAAAGAGTATTCTAATAACGCTATCATACTGATGCGCGACAATCTGTATTATCTGGGCA TCTTTAATGCGAAGAATAAACCGGACAAGAAGATTATCGAGGGTAATACGTCAGAAAATAAGGGTGAC TACAAAAAGATGATTTATAATTTGCTCCCGGGTCCCAACAAAATGATCCCGAAAGTTTTCTTGAGCAGCA AGACGGGGGTGGAAACGTATAAACCGAGCGCCTATATCCTAGAGGGGTATAAACAGAATAAACATATC AAGTCTTCAAAAGACTTTGATATCACTTTCTGTCATGATCTGATCGACTACTTCAAAAACTGTATTGCAA TTCATCCCGAGTGGAAAAACTTCGGTTTTGATTTTAGCGACACCAGTACTTATGAAGACATTTCCGGGTT TTATCGTGAGGTAGAGTTACAAGGTTACAAGATTGATTGGACATACATTAGCGAAAAAGACATTGATCT GCTGCAGGAAAAAGGTCAACTGTATCTGTTCCAGATATATAACAAAGATTTTTCGAAAAAATCAACCGG GAATGACAACCTTCACACCATGTACCTGAAAAATCTTTTCTCAGAAGAAAATCTTAAGGATATCGTCCTG AAACTTAACGGCGAAGCGGAAATCTTCTTCAGGAAGAGCAGCATAAAGAACCCAATCATTCATAAAAA AGGCTCGATTTTAGTCAACCGTACCTACGAAGCAGAAGAAAAAGACCAGTTTGGCAACATTCAAATTGT GCGTAAAAATATTCCGGAAAACATTTATCAGGAGCTGTACAAATACTTCAACGATAAAAGCGACAAAGA GCTGTCTGATGAAGCAGCCAAACTGAAGAATGTAGTGGGACACCACGAGGCAGCGACGAATATAGTCA AGGACTATCGCTACACGTATGATAAATACTTCCTTCATATGCCTATTACGATCAATTTCAAAGCCAATAA AACGGGTTTTATTAATGATAGGATCTTACAGTATATCGCTAAAGAAAAAGACTTACATGTGATCGGCATT GATCGGGGCGAGCGTAACCTGATCTACGTGTCCGTGATTGATACTTGTGGTAATATAGTTGAACAGAAA AGCTTTAACATTGTAAACGGCTACGACTATCAGATAAAACTGAAACAACAGGAGGGCGCTAGACAGATT GCGCGGAAAGAATGGAAAGAAATTGGTAAAATTAAAGAGATCAAAGAGGGCTACCTGAGCTTAGTAAT CCACGAGATCTCTAAAATGGTAATCAAATACAATGCAATTATAGCGATGGAGGATTTGTCTTATGGTTTT AAAAAAGGGCGCTTTAAGGTCGAACGGCAAGTTTACCAGAAATTTGAAACCATGCTCATCAATAAACTC AACTATCTGGTATTTAAAGATATTTCGATTACCGAGAATGGCGGTCTCCTGAAAGGTTATCAGCTGACAT ACATTCCTGATAAACTTAAAAACGTGGGTCATCAGTGCGGCTGCATTTTTTATGTGCCTGCTGCATACAC GAGCAAAATTGATCCGACCACCGGCTTTGTGAATATCTTTAAATTTAAAGACCTGACAGTGGACGCAAA ACGTGAATTCATTAAAAAATTTGACTCAATTCGTTATGACAGTGAAAAAAATCTGTTCTGCTTTACATTT GACTACAATAACTTTATTACGCAAAACACGGTCATGAGCAAATCATCGTGGAGTGTGTATACATACGGC GTGCGCATCAAACGTCGCTTTGTGAACGGCCGCTTCTCAAACGAAAGTGATACCATTGACATAACCAAA GATATGGAGAAAACGTTGGAAATGACGGACATTAACTGGCGCGATGGCCACGATCTTCGTCAAGACATT ATAGATTATGAAATTGTTCAGCACATATTCGAAATTTTCCGTTTAACAGTGCAAATGCGTAACTCCTTGT CTGAACTGGAGGACCGTGATTACGATCGTCTCATTTCACCTGTACTGAACGAAAATAACATTTTTTATGA CAGCGCGAAAGCGGGGGATGCACTTCCTAAGGATGCCGATGCAAATGGTGCGTATTGTATTGCATTAAA AGGGTTATATGAAATTAAACAAATTACCGAAAATTGGAAAGAAGATGGTAAATTTTCGCGCGATAAACT CAAAATCAGCAATAAAGATTGGTTCGACTTTATCCAGAATAAGCGCTATCTCTAA SEQ ATGACCAATAAATTCACTAACCAGTATTCTCTCTCTAAGACCCTGCGCTTTGAACTGATTCCGCAGGGGA ID AAACCTTGGAGTTCATTCAAGAAAAAGGCCTCTTGTCTCAGGATAAACAGAGGGCTGAATCTTACCAAG NO: AAATGAAGAAAACTATTGATAAGTTTCATAAATATTTCATTGATTTAGCCTTGTCTAACGCCAAATTAAC 48 TCACTTGGAAACGTATCTGGAGTTATACAACAAATCTGCCGAAACTAAGAAAGAACAGAAATTTAAAGA CGATTTGAAAAAAGTACAGGACAATCTGCGTAAAGAAATTGTCAAATCCTTCAGTGACGGCGATGCTAA AAGCATTTTTGCCATTCTGGACAAAAAAGAGTTGATTACTGTGGAATTAGAAAAGTGGTTTGAAAACAA TGAGCAGAAAGACATCTACTTCGATGAGAAATTCAAAACTTTCACCACCTATTTTACAGGATTTCATCAA AACCGGAAGAACATGTACTCAGTAGAACCGAACTCCACGGCCATTGCGTATCGTTTGATCCATGAGAAT CTGCCTAAATTTCTGGAGAATGCGAAAGCCTTTGAAAAGATTAAGCAGGTCGAATCGCTGCAAGTGAAT TTTCGTGAACTCATGGGCGAATTTGGTGACGAAGGTCTAATCTTCGTTAACGAACTGGAAGAAATGTTTC AGATTAATTACTACAATGACGTGCTATCGCAGAACGGTATCACAATCTACAATAGTATTATCTCAGGGTT CACAAAAAACGATATAAAATACAAAGGCCTGAACGAGTATATCAATAACTACAACCAAACAAAGGACA AAAAGGATAGGCTTCCGAAACTGAAGCAGTTATACAAACAGATTTTATCTGACAGAATCTCCCTGAGCT TTCTGCCGGATGCTTTCACTGATGGGAAGCAGGTTCTGAAAGCGATTTTCGATTTTTATAAGATTAACTT ACTGAGCTACACGATTGAAGGTCAAGAAGAATCTCAAAACTTACTGCTCTTGATCCGTCAAACCATTGA AAATCTATCATCGTTCGATACGCAGAAAATCTACCTCAAAAACGATACTCACCTGACTACGATCTCTCAG CAGGTTTTCGGGGATTTTAGTGTATTTTCAACAGCTCTGAACTACTGGTATGAAACCAAAGTCAATCCGA AATTCGAGACGGAATATTCTAAGGCCAACGAAAAAAAACGTGAGATTCTTGATAAAGCTAAAGCCGTAT TTACTAAACAGGATTACTTTTCTATTGCTTTCCTGCAGGAAGTTTTATCGGAGTATATCCTGACCCTGGAT CATACATCTGATATCGTTAAAAAACACAGCAGCAATTGCATCGCTGACTATTTCAAAAACCACTTTGTCG CCAAAAAAGAAAACGAAACAGACAAGACTTTCGATTTCATTGCTAACATCACCGCAAAATACCAGTGTA TTCAGGGTATCTTGGAAAACGCCGACCAATACGAAGACGAACTGAAACAAGATCAGAAGCTGATCGAT AATTTAAAATTCTTCTTAGATGCAATCCTGGAGCTGCTGCACTTCATCAAACCGCTTCATTTAAAGAGCG AGTCCATTACCGAAAAGGACACCGCCTTCTATGACGTTTTTGAAAATTATTATGAAGCCCTCTCCTTGCT GACTCCGCTGTATAATATGGTACGCAATTACGTAACCCAGAAACCATATTCTACCGAAAAAATTAAACT GAACTTTGAAAACGCACAGCTGCTCAACGGTTGGGACGCGAATAAAGAAGGTGACTACCTCACCACCAT CCTGAAAAAAGATGGTAACTATTTTCTGGCAATTATGGATAAGAAACATAATAAAGCATTCCAGAAATT TCCTGAAGGGAAAGAAAATTACGAAAAGATGGTGTACAAACTCTTACCTGGAGTTAACAAAATGTTGCC GAAAGTATTTTTTAGTAATAAGAACATCGCGTACTTTAACCCGTCCAAAGAACTGCTGGAAAATTATAA AAAGGAGACGCATAAGAAAGGGGATACCTTTAACCTGGAACATTGCCATACCTTAATAGACTTCTTCAA GGATTCCCTGAATAAACACGAGGATTGGAAATATTTCGATTTTCAGTTTAGTGAGACCAAGTCATACCA GGATCTTAGCGGCTTTTATCGCGAAGTAGAACACCAAGGCTATAAAATTAACTTCAAAAACATCGACAG CGAATACATCGACGGTTTAGTTAACGAGGGCAAACTGTTTCTGTTCCAGATCTATTCAAAGGATTTTAGC CCGTTCTCTAAAGGCAAACCAAATATGCATACGTTGTACTGGAAAGCACTGTTTGAAGAGCAAAACCTG CAGAATGTGATTTATAAACTGAACGGCCAAGCTGAGATTTTTTTCCGTAAAGCCTCGATTAAACCGAAA AATATCATCCTTCATAAGAAGAAAATAAAGATCGCTAAAAAACACTTCATAGATAAAAAAACCAAAAC CTCCGAAATAGTGCCTGTTCAAACAATTAAGAACTTGAATATGTACTACCAGGGCAAGATATCGGAAAA GGAGTTGACTCAAGACGATCTTCGCTATATCGATAACTTTTCGATTTTTAACGAAAAAAACAAGACGATC GACATCATCAAAGATAAACGCTTCACTGTAGATAAGTTCCAGTTTCATGTGCCGATTACTATGAACTTCA AAGCTACCGGGGGTAGCTATATCAACCAAACGGTGTTGGAATACCTGCAGAATAACCCGGAAGTCAAA ATCATTGGGCTGGACCGCGGAGAACGTCACCTTGTGTACTTGACCTTAATCGATCAGCAAGGCAACATC TTAAAACAAGAATCGCTGAATACCATTACGGATTCAAAGATTAGCACCCCGTATCATAAGCTGCTCGAT AACAAGGAGAATGAGCGCGACCTGGCCCGTAAAAACTGGGGCACGGTGGAAAACATTAAGGAGTTAAA GGAGGGTTATATTTCCCAGGTAGTGCATAAGATCGCCACTCTCATGCTCGAGGAAAATGCGATCGTTGTC ATGGAAGACTTAAACTTCGGATTTAAACGTGGGCGATTTAAAGTAGAGAAACAAATCTACCAGAAGTTA GAAAAAATGCTGATTGACAAATTAAATTACTTGGTCCTAAAAGACAAACAGCCGCAAGAATTGGGTGGA TTATACAACGCCCTCCAACTTACCAATAAATTCGAAAGTTTTCAGAAAATGGGTAAACAGTCAGGCTTTC TTTTTTATGTTCCTGCGTGGAACACATCCAAAATCGACCCTACAACCGGCTTCGTCAATTACTTCTATACT AAATATGAAAACGTCGACAAAGCAAAAGCATTCTTTGAAAAGTTCGAAGCAATACGTTTTAACGCTGAG AAAAAATATTTCGAGTTCGAAGTCAAGAAATACTCAGACTTTAACCCCAAAGCTGAGGGCACACAGCAA GCGTGGACAATCTGCACCTACGGCGAGCGCATCGAAACGAAGCGTCAAAAAGATCAGAATAACAAATT TGTTTCAACACCTATCAACCTGACCGAGAAGATTGAAGACTTCTTAGGTAAAAATCAGATTGTTTATGGC GACGGTAACTGTATAAAATCTCAAATAGCCTCAAAGGATGATAAAGCATTTTTCGAAACATTATTATATT GGTTCAAAATGACACTGCAGATGCGCAATAGTGAGACGCGTACAGATATTGATTATCTTATCAGCCCGG TCATGAACGACAACGGTACTTTTTACAACTCCAGAGACTATGAAAAACTTGAGAATCCAACTCTCCCCA AAGATGCTGATGCGAACGGTGCTTATCACATCGCGAAAAAAGGTCTGATGCTGCTGAACAAAATCGACC AAGCCGATCTGACTAAGAAAGTTGACCTAAGCATTTCAAATCGGGACTGGTTACAGTTTGTTCAAAAGA ACAAATGA SEQ ATGGAACAGGAATATTATCTGGGCTTGGACATGGGCACCGGTTCCGTCGGCTGGGCTGTTACTGACAGT ID GAATATCACGTTCTAAGAAAGCATGGTAAGGCATTGTGGGGTGTAAGACTTTTCGAATCTGCTTCCACTG NO: CTGAAGAGCGTAGAATGTTTAGAACGAGTCGACGTAGGCTAGACAGGCGCAATTGGAGAATCGAAATTT 49 TACAAGAAATTTTTGCGGAAGAGATATCTAAGAAAGACCCAGGCTTTTTCCTGAGAATGAAGGAATCTA AGTATTACCCTGAGGATAAAAGAGATATAAATGGTAACTGTCCCGAATTGCCTTACGCATTATTTGTGGA CGATGATTTTACCGATAAGGATTACCATAAAAAGTTCCCAACTATCTACCATTTACGCAAAATGTTAATG AATACAGAGGAAACCCCAGACATAAGACTAGTTTATCTGGCAATACACCATATGATGAAACATAGAGGC CATTTCTTACTTTCCGGGGATATCAACGAAATCAAAGAGTTTGGTACCACATTTAGTAAGTTACTGGAAA ACATAAAGAATGAAGAATTGGATTGGAACTTAGAACTCGGAAAAGAAGAATACGCGGTTGTCGAATCT ATCCTGAAGGATAATATGCTGAATAGGTCGACCAAAAAAACTAGGCTGATCAAAGCACTGAAAGCCAA ATCTATCTGCGAAAAAGCTGTTTTAAATTTACTTGCTGGTGGCACTGTTAAGTTATCAGACATTTTTGGTT TGGAAGAATTGAACGAAACCGAGCGTCCAAAAATTAGTTTCGCTGATAATGGCTACGATGATTACATTG GTGAGGTGGAAAACGAGTTGGGCGAACAATTTTATATTATAGAGACAGCTAAGGCAGTCTATGACTGGG CTGTTTTAGTAGAAATCCTTGGTAAATACACATCTATCTCCGAAGCGAAAGTTGCTACTTACGAAAAGCA CAAGTCCGATCTCCAGTTTTTGAAGAAAATTGTCAGGAAATATCTGACTAAGGAAGAATATAAAGATAT TTTCGTTAGTACCTCTGACAAACTGAAAAATTACTCCGCTTACATCGGGATGACCAAGATTAATGGCAAA AAAGTTGATCTGCAAAGCAAAAGGTGTTCGAAGGAAGAATTTTATGATTTCATTAAAAAGAATGTCTTA AAAAAATTAGAAGGTCAGCCAGAATACGAATATTTGAAAGAAGAACTGGAAAGAGAGACATTCTTACC AAAACAAGTCAACAGAGATAATGGGGTAATTCCATATCAAATTCACCTCTACGAATTAAAAAAAATTTT AGGCAATTTACGCGATAAAATTGACCTTATCAAAGAAAATGAGGATAAGCTGGTTCAACTCTTTGAATT CAGAATACCCTATTATGTGGGCCCACTGAACAAGATTGATGACGGCAAAGAAGGTAAATTCACATGGGC CGTCCGCAAATCCAATGAAAAAATTTACCCATGGAACTTTGAAAATGTAGTAGATATTGAAGCGTCTGC GGAGAAATTTATTCGAAGAATGACTAATAAATGCACTTACTTGATGGGAGAGGATGTTCTGCCTAAAGA CAGCTTATTATACAGCAAGTACATGGTTCTAAACGAACTTAACAACGTTAAGTTGGACGGTGAGAAATT AAGTGTAGAATTGAAACAAAGATTGTATACTGACGTCTTCTGCAAGTACAGAAAAGTGACAGTTAAAAA AATTAAGAATTACTTGAAGTGCGAAGGTATAATTTCTGGAAACGTAGAGATTACTGGTATTGATGGTGA TTTCAAAGCATCCCTAACAGCTTACCACGATTTCAAGGAAATCCTGACAGGAACTGAACTCGCAAAAAA AGATAAAGAAAACATTATTACTAATATTGTTCTTTTCGGTGATGACAAGAAATTGTTGAAGAAAAGACT GAATAGACTTTACCCCCAGATTACTCCCAATCAACTTAAGAAAATTTGTGCTTTGTCTTACACAGGATGG GGTCGTTTTTCAAAAAAGTTCTTAGAAGAGATTACCGCACCTGATCCAGAAACAGGCGAAGTATGGAAT ATAATTACCGCCTTATGGGAATCGAACAATAATCTTATGCAACTTCTGAGCAATGAATATCGTTTCATGG AAGAAGTTGAGACTTACAACATGGGCAAACAGACGAAGACTTTATCCTATGAAACTGTGGAAAATATGT ATGTATCACCTTCTGTCAAGAGACAAATTTGGCAAACCTTAAAAATTGTCAAAGAATTAGAAAAGGTAA TGAAGGAGTCTCCTAAACGTGTGTTTATTGAAATGGCTAGAGAAAAACAAGAGTCAAAAAGAACCGAG TCAAGAAAGAAGCAGTTAATCGATTTATATAAGGCTTGTAAAAACGAAGAGAAAGATTGGGTTAAAGA ATTGGGGGACCAAGAGGAACAAAAACTACGGTCGGATAAGTTGTATTTATACTATACGCAAAAGGGAC GATGTATGTATTCCGGCGAGGTAATAGAATTGAAGGATTTATGGGACAATACAAAATATGACATAGACC ATATATATCCCCAATCAAAAACGATGGACGATAGCTTGAACAATAGAGTACTCGTGAAAAAAAAATATA ATGCGACCAAATCTGATAAGTATCCTCTGAATGAAAATATCAGACATGAAAGAAAGGGGTTCTGGAAGT CCTTGTTAGATGGTGGGTTTATAAGCAAAGAAAAGTACGAGCGTCTAATAAGAAACACGGAGTTATCGC CAGAAGAACTCGCTGGTTTTATTGAGAGGCAAATCGTGGAAACGAGACAATCTACCAAAGCCGTTGCTG AGATCCTAAAGCAAGTTTTCCCAGAGTCGGAGATTGTCTATGTCAAAGCTGGCACAGTGAGCAGGTTTA GGAAAGACTTCGAACTATTAAAGGTAAGAGAAGTGAACGATTTACATCACGCAAAGGACGCTTACCTAA ATATCGTTGTAGGTAACTCATATTATGTTAAATTTACCAAGAACGCCTCTTGGTTTATAAAGGAGAACCC AGGTAGAACATATAACCTGAAAAAGATGTTCACCTCTGGTTGGAATATTGAGAGAAACGGAGAAGTCGC ATGGGAAGTTGGTAAGAAAGGGACTATAGTGACAGTAAAGCAAATTATGAACAAAAATAATATCCTCG TTACAAGGCAGGTTCATGAAGCAAAGGGCGGCCTTTTTGACCAACAAATTATGAAGAAAGGGAAAGGT CAAATTGCAATAAAAGAAACCGATGAGAGACTAGCGTCAATAGAAAAGTATGGTGGCTATAATAAAGC TGCGGGTGCATACTTTATGCTTGTTGAATCAAAAGACAAGAAAGGTAAGACTATTAGAACTATAGAATT TATACCCCTGTACCTTAAAAACAAAATTGAATCGGATGAGTCAATCGCGTTAAATTTTCTAGAGAAAGG AAGGGGTTTAAAAGAACCAAAGATCCTGTTAAAAAAGATTAAGATTGACACCTTGTTCGATGTAGATGG ATTTAAAATGTGGTTATCTGGCAGAACAGGCGATAGACTTTTGTTTAAGTGCGCTAATCAATTAATTTTG GATGAGAAAATCATTGTCACAATGAAAAAAATAGTTAAGTTTATTCAGAGAAGACAAGAAAACAGGGA GTTGAAATTATCTGATAAAGATGGTATCGACAATGAAGTTTTAATGGAAATCTACAATACATTCGTTGAT AAACTTGAAAATACCGTATATCGAATCAGGTTAAGTGAACAAGCCAAAACATTAATTGATAAACAAAAA GAATTTGAAAGGCTATCACTGGAAGACAAATCCTCCACCCTATTTGAAATTTTGCATATATTCCAGTGCC AATCTTCAGCAGCTAATTTAAAAATGATTGGCGGACCTGGGAAAGCCGGCATCCTAGTGATGAACAATA ATATCTCCAAGTGTAACAAAATATCAATTATTAACCAATCTCCGACAGGTATTTTTGAAAATGAAATAGA CTTGCTTAAGATATAA SEQ ATGTCTTTCGACTCTTTCACCAACCTGTACTCTCTGTCTAAAACCCTGAAATTCGAAATGCGTCCGGTTGG ID TAACACCCAGAAAATGCTGGACAACGCGGGTGTTTTCGAAAAAGACAAACTGATCCAGAAAAAATACG NO: GTAAAACCAAACCGTACTTCGACCGTCTGCACCGTGAATTCATCGAAGAAGCGCTGACCGGTGTTGAAC 50 TGATCGGTCTGGACGAAAACTTCCGTACCCTGGTTGACTGGCAGAAAGACAAAAAAAACAACGTTGCGA TGAAAGCGTACGAAAACTCTCTGCAGCGTCTGCGTACCGAAATCGGTAAAATCTTCAACCTGAAAGCGG AAGACTGGGTTAAAAACAAATACCCGATCCTGGGTCTGAAAAACAAAAACACCGACATCCTGTTCGAAG AAGCGGTTTTCGGTATCCTGAAAGCGCGTTACGGTGAAGAAAAAGACACCTTCATCGAAGTTGAAGAAA TCGACAAAACCGGTAAATCTAAAATCAACCAGATCTCTATCTTCGACTCTTGGAAAGGTTTCACCGGTTA CTTCAAAAAATTCTTCGAAACCCGTAAAAACTTCTACAAAAACGACGGTACCTCTACCGCGATCGCGAC CCGTATCATCGACCAGAACCTGAAACGTTTCATCGACAACCTGTCTATCGTTGAATCTGTTCGTCAGAAA GTTGACCTGGCGGAAACCGAAAAATCTTTCTCTATCTCTCTGTCTCAGTTCTTCTCTATCGACTTCTACAA CAAATGCCTGCTGCAGGACGGTATCGACTACTACAACAAAATCATCGGTGGTGAAACCCTGAAAAACGG TGAAAAACTGATCGGTCTGAACGAACTGATCAACCAGTACCGTCAGAACAACAAAGACCAGAAAATCC CGTTCTTCAAACTGCTGGACAAACAGATCCTGTCTGAAAAAATCCTGTTCCTGGACGAAATCAAAAACG ACACCGAACTGATCGAAGCGCTGTCTCAGTTCGCGAAAACCGCGGAAGAAAAAACCAAAATCGTTAAA AAACTGTTCGCGGACTTCGTTGAAAACAACTCTAAATACGACCTGGCGCAGATCTACATCTCTCAGGAA GCGTTCAACACCATCTCTAACAAATGGACCTCTGAAACCGAAACCTTCGCGAAATACCTGTTCGAAGCG ATGAAATCTGGTAAACTGGCGAAATACGAAAAAAAAGACAACTCTTACAAATTCCCGGACTTCATCGCG CTGTCTCAGATGAAATCTGCGCTGCTGTCTATCTCTCTGGAAGGTCACTTCTGGAAAGAAAAATACTACA AAATCTCTAAATTCCAGGAAAAAACCAACTGGGAACAGTTCCTGGCGATCTTCCTGTACGAATTCAACT CTCTGTTCTCTGACAAAATCAACACCAAAGACGGTGAAACCAAACAGGTTGGTTACTACCTGTTCGCGA AAGACCTGCACAACCTGATCCTGTCTGAACAGATCGACATCCCGAAAGACTCTAAAGTTACCATCAAAG ACTTCGCGGACTCTGTTCTGACCATCTACCAGATGGCGAAATACTTCGCGGTTGAAAAAAAACGTGCGT GGCTGGCGGAATACGAACTGGACTCTTTCTACACCCAGCCGGACACCGGTTACCTGCAGTTCTACGACA ACGCGTACGAAGACATCGTTCAGGTTTACAACAAACTGCGTAACTACCTGACCAAAAAACCGTACTCTG AAGAAAAATGGAAACTGAACTTCGAAAACTCTACCCTGGCGAACGGTTGGGACAAAAACAAAGAATCT GACAACTCTGCGGTTATCCTGCAGAAAGGTGGTAAATACTACCTGGGTCTGATCACCAAAGGTCACAAC AAAATCTTCGACGACCGTTTCCAGGAAAAATTCATCGTTGGTATCGAAGGTGGTAAATACGAAAAAATC GTTTACAAATTCTTCCCGGACCAGGCGAAAATGTTCCCGAAAGTTTGCTTCTCTGCGAAAGGTCTGGAAT TCTTCCGTCCGTCTGAAGAAATCCTGCGTATCTACAACAACGCGGAATTCAAAAAAGGTGAAACCTACT CTATCGACTCTATGCAGAAACTGATCGACTTCTACAAAGACTGCCTGACCAAATACGAAGGTTGGGCGT GCTACACCTTCCGTCACCTGAAACCGACCGAAGAATACCAGAACAACATCGGTGAATTCTTCCGTGACG TTGCGGAAGACGGTTACCGTATCGACTTCCAGGGTATCTCTGACCAGTACATCCACGAAAAAAACGAAA AAGGTGAACTGCACCTGTTCGAAATCCACAACAAAGACTGGAACCTGGACAAAGCGCGTGACGGTAAA TCTAAAACCACCCAGAAAAACCTGCACACCCTGTACTTCGAATCTCTGTTCTCTAACGACAACGTTGTTC AGAACTTCCCGATCAAACTGAACGGTCAGGCGGAAATCTTCTACCGTCCGAAAACCGAAAAAGACAAA CTGGAATCTAAAAAAGACAAAAAAGGTAACAAAGTTATCGACCACAAACGTTACTCTGAAAACAAAAT CTTCTTCCACGTTCCGCTGACCCTGAACCGTACCAAAAACGACTCTTACCGTTTCAACGCGCAGATCAAC AACTTCCTGGCGAACAACAAAGACATCAACATCATCGGTGTTGACCGTGGTGAAAAACACCTGGTTTAC TACTCTGTTATCACCCAGGCGTCTGACATCCTGGAATCTGGTTCTCTGAACGAACTGAACGGTGTTAACT ACGCGGAAAAACTGGGTAAAAAAGCGGAAAACCGTGAACAGGCGCGTCGTGACTGGCAGGACGTTCAG GGTATCAAAGACCTGAAAAAAGGTTACATCTCTCAGGTTGTTCGTAAACTGGCGGACCTGGCGATCAAA CACAACGCGATCATCATCCTGGAAGACCTGAACATGCGTTTCAAACAGGTTCGTGGTGGTATCGAAAAA TCTATCTACCAGCAGCTGGAAAAAGCGCTGATCGACAAACTGTCTTTCCTGGTTGACAAAGGTGAAAAA AACCCGGAACAGGCGGGTCACCTGCTGAAAGCGTACCAGCTGTCTGCGCCGTTCGAAACCTTCCAGAAA ATGGGTAAACAGACCGGTATCATCTTCTACACCCAGGCGTCTTACACCTCTAAATCTGACCCGGTTACCG GTTGGCGTCCGCACCTGTACCTGAAATACTTCTCTGCGAAAAAAGCGAAAGACGACATCGCGAAATTCA CCAAAATCGAATTCGTTAACGACCGTTTCGAACTGACCTACGACATCAAAGACTTCCAGCAGGCGAAAG AATACCCGAACAAAACCGTTTGGAAAGTTTGCTCTAACGTTGAACGTTTCCGTTGGGACAAAAACCTGA ACCAGAACAAAGGTGGTTACACCCACTACACCAACATCACCGAAAACATCCAGGAACTGTTCACCAAAT ACGGTATCGACATCACCAAAGACCTGCTGACCCAGATCTCTACCATCGACGAAAAACAGAACACCTCTT TCTTCCGTGACTTCATCTTCTACTTCAACCTGATCTGCCAGATCCGTAACACCGACGACTCTGAAATCGC GAAAAAAAACGGTAAAGACGACTTCATCCTGTCTCCGGTTGAACCGTTCTTCGACTCTCGTAAAGACAA CGGTAACAAACTGCCGGAAAACGGTGACGACAACGGTGCGTACAACATCGCGCGTAAAGGTATCGTTAT CCTGAACAAAATCTCTCAGTACTCTGAAAAAAACGAAAACTGCGAAAAAATGAAATGGGGTGACCTGT ACGTTTCTAACATCGACTGGGACAACTTCGTT SEQ ATGGAAAACTTTAAAAACTTATACCCAATAAACAAAACGTTACGTTTTGAACTGCGTCCATATGGTAAA ID ACACTGGAAAACTTTAAAAAAAGCGGTTTGTTGGAGAAGGATGCATTTAAAGCGAACTCTCGCAGATCC NO: ATGCAGGCCATCATTGATGAAAAATTTAAAGAGACGATCGAAGAACGTCTGAAATACACGGAATTTAGT 51 GAGTGTGACTTAGGTAATATGACTTCTAAAGATAAGAAAATCACCGATAAGGCGGCGACCAACCTGAAG AAGCAAGTCATTTTATCTTTTGATGATGAAATCTTTAACAACTATTTGAAACCGGACAAAAACATCGATG CCTTATTTAAAAATGACCCTTCGAACCCGGTGATTAGCACATTTAAGGGCTTCACAACGTATTTTGTCAA TTTTTTTGAAATTCGTAAACATATCTTCAAAGGAGAATCAAGCGGCTCTATGGCTTATCGCATTATTGAT GAAAACCTGACGACCTATTTGAATAACATTGAAAAAATCAAAAAACTGCCAGAGGAATTAAAGTCTCAG TTAGAAGGCATCGACCAGATCGACAAACTCAACAACTATAACGAATTTATTACGCAGTCTGGTATCACC CACTATAATGAAATTATTGGAGGTATCAGTAAATCAGAAAATGTGAAAATCCAAGGGATTAATGAAGGC ATTAACCTCTATTGCCAGAAAAATAAAGTGAAACTGCCGAGGCTGACTCCACTCTACAAAATGATCCTG TCTGACCGCGTCTCGAATAGCTTTGTCCTGGACACAATTGAAAACGATACGGAATTGATTGAGATGATA AGCGATCTGATTAACAAAACCGAAATTTCACAGGATGTAATCATGAGTGATATACAAAACATCTTTATT AAATATAAACAGCTTGGTAATCTGCCTGGAATTAGCTATTCGTCAATAGTGAACGCAATCTGTTCTGATT ATGATAACAATTTTGGCGACGGTAAGCGTAAAAAGAGTTATGAAAACGATAGGAAAAAACACCTGGAA ACTAACGTGTATTCTATCAACTATATCAGCGAACTGCTTACGGACACCGATGTGAGTTCAAACATTAAGA TGCGGTATAAGGAGCTTGAACAGAACTACCAGGTCTGTAAGGAAAACTTCAACGCAACCAACTGGATGA ACATTAAAAATATCAAACAATCCGAGAAGACCAACTTAATCAAAGATCTGCTGGATATTTTGAAGAGCA TTCAACGTTTTTATGATCTGTTCGATATCGTTGATGAAGACAAGAATCCTAGTGCGGAATTTTATACATG GCTGTCTAAAAATGCGGAGAAATTGGATTTCGAATTCAATTCTGTTTATAATAAATCACGCAACTATTTG ACCCGCAAACAATACAGCGACAAAAAGATAAAACTAAACTTCGACAGTCCGACATTGGCAAAGGGCTG GGACGCAAATAAGGAAATCGATAACTCTACGATAATTATGCGTAAGTTCAATAATGATCGAGGTGATTA TGATTATTTCTTAGGCATTTGGAACAAAAGCACCCCGGCCAACGAAAAGATAATTCCACTGGAGGATAA CGGTCTGTTCGAAAAAATGCAGTACAAATTATATCCGGATCCAAGCAAGATGCTTCCAAAGCAGTTTCT GTCTAAAATTTGGAAAGCTAAGCATCCGACCACCCCAGAATTTGACAAGAAATATAAGGAAGGCCGCCA TAAGAAAGGTCCCGATTTTGAAAAAGAATTCTTGCACGAACTGATTGATTGCTTTAAACATGGCTTAGTC AATCACGATGAAAAGTATCAAGATGTTTTTGGATTCAATTTGAGAAACACAGAAGACTACAATTCCTAC ACTGAGTTTCTCGAAGATGTGGAACGATGTAATTATAATCTGAGCTTTAACAAAATCGCGGACACCTCG AATCTGATTAACGATGGTAAACTTTATGTTTTCCAGATCTGGAGCAAGGATTTCTCTATTGACAGCAAAG GCACCAAAAACCTGAACACCATTTACTTTGAAAGTCTCTTCAGCGAAGAAAATATGATTGAGAAAATGT TTAAACTTAGCGGTGAAGCTGAAATATTCTATCGCCCGGCAAGCCTGAACTATTGCGAAGACATTATCA AAAAGGGTCATCACCACGCTGAACTGAAAGATAAATTTGATTATCCTATCATAAAAGATAAACGCTATA GCCAGGATAAATTTTTTTTTCATGTTCCTATGGTCATTAACTACAAATCAGAAAAACTGAACTCTAAAAG CCTCAATAATCGAACCAATGAAAACCTTGGGCAGTTTACCCATATAATTGGAATTGATCGCGGAGAGCG TCATTTAATCTACCTGACCGTAGTCGATGTATCGACCGGCGAGATCGTCGAGCAGAAGCACTTAGACGA GATTATCAACACTGATACCAAAGGTGTTGAGCATAAGACGCACTATCTAAACAAGCTGGAGGAAAAATC GAAAACCCGTGATAATGAACGTAAGAGTTGGGAGGCAATTGAAACGATTAAAGAACTGAAGGAGGGTT ATATCAGCCACGTAATCAATGAAATTCAAAAACTGCAGGAAAAATACAACGCCCTGATCGTTATGGAAA ATCTGAATTACGGTTTCAAAAATTCTCGCATCAAAGTGGAAAAACAGGTATATCAGAAGTTCGAGACGG CATTAATTAAAAAGTTTAATTACATCATTGACAAAAAAGATCCGGAAACTTATATTCATGGCTATCAGCT GACGAACCCGATCACCACACTGGATAAAATTGGTAACCAGTCTGGTATCGTGCTTTACATCCCTGCCTGG AATACCAGTAAAATCGATCCGGTAACGGGATTCGTCAACCTTCTATATGCAGATGACCTCAAATATAAG AATCAGGAACAGGCCAAGTCTTTTATTCAGAAAATCGATAACATTTACTTTGAGAATGGGGAATTCAAA TTTGATATTGATTTTTCTAAATGGAACAATCGTTATAGTATATCTAAGACGAAATGGACGCTCACCTCGT ACGGAACCCGAATCCAGACATTCCGCAATCCGCAGAAGAACAATAAATGGGACAGCGCCGAGTATGAT CTCACTGAAGAATTCAAATTGATTCTGAACATTGACGGTACCCTGAAAAGCCAGGATGTCGAAACCTAT AAAAAATTTATGTCTCTGTTCAAGCTGATGCTGCAACTTAGGAACTCTGTTACCGGCACTGATATCGATT ATATGATCTCCCCTGTCACTGATAAAACAGGTACGCATTTCGATTCGCGCGAAAATATCAAAAATCTGCC CGCAGATGCCGACGCCAATGGGGCGTACAATATTGCACGCAAGGGTATCATGGCGATCGAAAACATTAT GAATGGTATCAGCGACCCGCTGAAAATCTCAAACGAAGATTATTTGAAATATATCCAAAACCAGCAGGA ATAA SEQ ATGACCCAGTTCGAAGGTTTCACCAACCTGTACCAGGTTTCTAAAACCCTGCGTTTCGAACTGATCCCGC ID AGGGTAAAACCCTGAAACACATCCAGGAACAGGGTTTCATCGAAGAAGACAAAGCGCGTAACGACCAC NO: TACAAAGAACTGAAACCGATCATCGACCGTATCTACAAAACCTACGCGGACCAGTGCCTGCAGCTGGTT 52 CAGCTGGACTGGGAAAACCTGTCTGCGGCGATCGACTCTTACCGTAAAGAAAAAACCGAAGAAACCCGT AACGCGCTGATCGAAGAACAGGCGACCTACCGTAACGCGATCCACGACTACTTCATCGGTCGTACCGAC AACCTGACCGACGCGATCAACAAACGTCACGCGGAAATCTACAAAGGTCTGTTCAAAGCGGAACTGTTC AACGGTAAAGTTCTGAAACAGCTGGGTACCGTTACCACCACCGAACACGAAAACGCGCTGCTGCGTTCT TTCGACAAATTCACCACCTACTTCTCTGGTTTCTACGAAAACCGTAAAAACGTTTTCTCTGCGGAAGACA TCTCTACCGCGATCCCGCACCGTATCGTTCAGGACAACTTCCCGAAATTCAAAGAAAACTGCCACATCTT CACCCGTCTGATCACCGCGGTTCCGTCTCTGCGTGAACACTTCGAAAACGTTAAAAAAGCGATCGGTATC TTCGTTTCTACCTCTATCGAAGAAGTTTTCTCTTTCCCGTTCTACAACCAGCTGCTGACCCAGACCCAGAT CGACCTGTACAACCAGCTGCTGGGTGGTATCTCTCGTGAAGCGGGTACCGAAAAAATCAAAGGTCTGAA CGAAGTTCTGAACCTGGCGATCCAGAAAAACGACGAAACCGCGCACATCATCGCGTCTCTGCCGCACCG TTTCATCCCGCTGTTCAAACAGATCCTGTCTGACCGTAACACCCTGTCTTTCATCCTGGAAGAATTCAAA TCTGACGAAGAAGTTATCCAGTCTTTCTGCAAATACAAAACCCTGCTGCGTAACGAAAACGTTCTGGAA ACCGCGGAAGCGCTGTTCAACGAACTGAACTCTATCGACCTGACCCACATCTTCATCTCTCACAAAAAA CTGGAAACCATCTCTTCTGCGCTGTGCGACCACTGGGACACCCTGCGTAACGCGCTGTACGAACGTCGTA TCTCTGAACTGACCGGTAAAATCACCAAATCTGCGAAAGAAAAAGTTCAGCGTTCTCTGAAACACGAAG ACATCAACCTGCAGGAAATCATCTCTGCGGCGGGTAAAGAACTGTCTGAAGCGTTCAAACAGAAAACCT CTGAAATCCTGTCTCACGCGCACGCGGCGCTGGACCAGCCGCTGCCGACCACCCTGAAAAAACAGGAAG AAAAAGAAATCCTGAAATCTCAGCTGGACTCTCTGCTGGGTCTGTACCACCTGCTGGACTGGTTCGCGGT TGACGAATCTAACGAAGTTGACCCGGAATTCTCTGCGCGTCTGACCGGTATCAAACTGGAAATGGAACC GTCTCTGTCTTTCTACAACAAAGCGCGTAACTACGCGACCAAAAAACCGTACTCTGTTGAAAAATTCAA ACTGAACTTCCAGATGCCGACCCTGGCGTCTGGTTGGGACGTTAACAAAGAAAAAAACAACGGTGCGAT CCTGTTCGTTAAAAACGGTCTGTACTACCTGGGTATCATGCCGAAACAGAAAGGTCGTTACAAAGCGCT GTCTTTCGAACCGACCGAAAAAACCTCTGAAGGTTTCGACAAAATGTACTACGACTACTTCCCGGACGC GGCGAAAATGATCCCGAAATGCTCTACCCAGCTGAAAGCGGTTACCGCGCACTTCCAGACCCACACCAC CCCGATCCTGCTGTCTAACAACTTCATCGAACCGCTGGAAATCACCAAAGAAATCTACGACCTGAACAA CCCGGAAAAAGAACCGAAAAAATTCCAGACCGCGTACGCGAAAAAAACCGGTGACCAGAAAGGTTACC GTGAAGCGCTGTGCAAATGGATCGACTTCACCCGTGACTTCCTGTCTAAATACACCAAAACCACCTCTAT CGACCTGTCTTCTCTGCGTCCGTCTTCTCAGTACAAAGACCTGGGTGAATACTACGCGGAACTGAACCCG CTGCTGTACCACATCTCTTTCCAGCGTATCGCGGAAAAAGAAATCATGGACGCGGTTGAAACCGGTAAA CTGTACCTGTTCCAGATCTACAACAAAGACTTCGCGAAAGGTCACCACGGTAAACCGAACCTGCACACC CTGTACTGGACCGGTCTGTTCTCTCCGGAAAACCTGGCGAAAACCTCTATCAAACTGAACGGTCAGGCG GAACTGTTCTACCGTCCGAAATCTCGTATGAAACGTATGGCGCACCGTCTGGGTGAAAAAATGCTGAAC AAAAAACTGAAAGACCAGAAAACCCCGATCCCGGACACCCTGTACCAGGAACTGTACGACTACGTTAA CCACCGTCTGTCTCACGACCTGTCTGACGAAGCGCGTGCGCTGCTGCCGAACGTTATCACCAAAGAAGTT TCTCACGAAATCATCAAAGACCGTCGTTTCACCTCTGACAAATTCTTCTTCCACGTTCCGATCACCCTGA ACTACCAGGCGGCGAACTCTCCGTCTAAATTCAACCAGCGTGTTAACGCGTACCTGAAAGAACACCCGG AAACCCCGATCATCGGTATCGACCGTGGTGAACGTAACCTGATCTACATCACCGTTATCGACTCTACCGG TAAAATCCTGGAACAGCGTTCTCTGAACACCATCCAGCAGTTCGACTACCAGAAAAAACTGGACAACCG TGAAAAAGAACGTGTTGCGGCGCGTCAGGCGTGGTCTGTTGTTGGTACCATCAAAGACCTGAAACAGGG TTACCTGTCTCAGGTTATCCACGAAATCGTTGACCTGATGATCCACTACCAGGCGGTTGTTGTTCTGGAA AACCTGAACTTCGGTTTCAAATCTAAACGTACCGGTATCGCGGAAAAAGCGGTTTACCAGCAGTTCGAA AAAATGCTGATCGACAAACTGAACTGCCTGGTTCTGAAAGACTACCCGGCGGAAAAAGTTGGTGGTGTT CTGAACCCGTACCAGCTGACCGACCAGTTCACCTCTTTCGCGAAAATGGGTACCCAGTCTGGTTTCCTGT TCTACGTTCCGGCGCCGTACACCTCTAAAATCGACCCGCTGACCGGTTTCGTTGACCCGTTCGTTTGGAA AACCATCAAAAACCACGAATCTCGTAAACACTTCCTGGAAGGTTTCGACTTCCTGCACTACGACGTTAA AACCGGTGACTTCATCCTGCACTTCAAAATGAACCGTAACCTGTCTTTCCAGCGTGGTCTGCCGGGTTTC ATGCCGGCGTGGGACATCGTTTTCGAAAAAAACGAAACCCAGTTCGACGCGAAAGGTACCCCGTTCATC GCGGGTAAACGTATCGTTCCGGTTATCGAAAACCACCGTTTCACCGGTCGTTACCGTGACCTGTACCCGG CGAACGAACTGATCGCGCTGCTGGAAGAAAAAGGTATCGTTTTCCGTGACGGTTCTAACATCCTGCCGA AACTGCTGGAAAACGACGACTCTCACGCGATCGACACCATGGTTGCGCTGATCCGTTCTGTTCTGCAGAT GCGTAACTCTAACGCGGCGACCGGTGAAGACTACATCAACTCTCCGGTTCGTGACCTGAACGGTGTTTG CTTCGACTCTCGTTTCCAGAACCCGGAATGGCCGATGGACGCGGACGCGAACGGTGCGTACCACATCGC GCTGAAAGGTCAGCTGCTGCTGAACCACCTGAAAGAATCTAAAGACCTGAAACTGCAGAACGGTATCTC TAACCAGGACTGGCTGGCGTACATCCAGGAACTGCGTAACTA SEQ ATGGCGGTTAAATCTATCAAAGTTAAACTGCGTCTGGACGACATGCCGGAAATCCGTGCGGGTCTGTGG ID AAACTGCACAAAGAAGTTAACGCGGGTGTTCGTTACTACACCGAATGGCTGTCTCTGCTGCGTCAGGAA NO: AACCTGTACCGTCGTTCTCCGAACGGTGACGGTGAACAGGAATGCGACAAAACCGCGGAAGAATGCAA 53 AGCGGAACTGCTGGAACGTCTGCGTGCGCGTCAGGTTGAAAACGGTCACCGTGGTCCGGCGGGTTCTGA CGACGAACTGCTGCAGCTGGCGCGTCAGCTGTACGAACTGCTGGTTCCGCAGGCGATCGGTGCGAAAGG TGACGCGCAGCAGATCGCGCGTAAATTCCTGTCTCCGCTGGCGGACAAAGACGCGGTTGGTGGTCTGGG TATCGCGAAAGCGGGTAACAAACCGCGTTGGGTTCGTATGCGTGAAGCGGGTGAACCGGGTTGGGAAG AAGAAAAAGAAAAAGCGGAAACCCGTAAATCTGCGGACCGTACCGCGGACGTTCTGCGTGCGCTGGCG GACTTCGGTCTGAAACCGCTGATGCGTGTTTACACCGACTCTGAAATGTCTTCTGTTGAATGGAAACCGC TGCGTAAAGGTCAGGCGGTTCGTACCTGGGACCGTGACATGTTCCAGCAGGCGATCGAACGTATGATGT CTTGGGAATCTTGGAACCAGCGTGTTGGTCAGGAATACGCGAAACTGGTTGAACAGAAAAACCGTTTCG AACAGAAAAACTTCGTTGGTCAGGAACACCTGGTTCACCTGGTTAACCAGCTGCAGCAGGACATGAAAG AAGCGTCTCCGGGTCTGGAATCTAAAGAACAGACCGCGCACTACGTTACCGGTCGTGCGCTGCGTGGTT CTGACAAAGTTTTCGAAAAATGGGGTAAACTGGCGCCGGACGCGCCGTTCGACCTGTACGACGCGGAAA TCAAAAACGTTCAGCGTCGTAACACCCGTCGTTTCGGTTCTCACGACCTGTTCGCGAAACTGGCGGAACC GGAATACCAGGCGCTGTGGCGTGAAGACGCGTCTTTCCTGACCCGTTACGCGGTTTACAACTCTATCCTG CGTAAACTGAACCACGCGAAAATGTTCGCGACCTTCACCCTGCCGGACGCGACCGCGCACCCGATCTGG ACCCGTTTCGACAAACTGGGTGGTAACCTGCACCAGTACACCTTCCTGTTCAACGAATTCGGTGAACGTC GTCACGCGATCCGTTTCCACAAACTGCTGAAAGTTGAAAACGGTGTTGCGCGTGAAGTTGACGACGTTA CCGTTCCGATCTCTATGTCTGAACAGCTGGACAACCTGCTGCCGCGTGACCCGAACGAACCGATCGCGCT GTACTTCCGTGACTACGGTGCGGAACAGCACTTCACCGGTGAATTCGGTGGTGCGAAAATCCAGTGCCG TCGTGACCAGCTGGCGCACATGCACCGTCGTCGTGGTGCGCGTGACGTTTACCTGAACGTTTCTGTTCGT GTTCAGTCTCAGTCTGAAGCGCGTGGTGAACGTCGTCCGCCGTACGCGGCGGTTTTCCGTCTGGTTGGTG ACAACCACCGTGCGTTCGTTCACTTCGACAAACTGTCTGACTACCTGGCGGAACACCCGGACGACGGTA AACTGGGTTCTGAAGGTCTGCTGTCTGGTCTGCGTGTTATGTCTGTTGACCTGGGTCTGCGTACCTCTGCG TCTATCTCTGTTTTCCGTGTTGCGCGTAAAGACGAACTGAAACCGAACTCTAAAGGTCGTGTTCCGTTCT TCTTCCCGATCAAAGGTAACGACAACCTGGTTGCGGTTCACGAACGTTCTCAGCTGCTGAAACTGCCGG GTGAAACCGAATCTAAAGACCTGCGTGCGATCCGTGAAGAACGTCAGCGTACCCTGCGTCAGCTGCGTA CCCAGCTGGCGTACCTGCGTCTGCTGGTTCGTTGCGGTTCTGAAGACGTTGGTCGTCGTGAACGTTCTTG GGCGAAACTGATCGAACAGCCGGTTGACGCGGCGAACCACATGACCCCGGACTGGCGTGAAGCGTTCG AAAACGAACTGCAGAAACTGAAATCTCTGCACGGTATCTGCTCTGACAAAGAATGGATGGACGCGGTTT ACGAATCTGTTCGTCGTGTTTGGCGTCACATGGGTAAACAGGTTCGTGACTGGCGTAAAGACGTTCGTTC TGGTGAACGTCCGAAAATCCGTGGTTACGCGAAAGACGTTGTTGGTGGTAACTCTATCGAACAGATCGA ATACCTGGAACGTCAGTACAAATTCCTGAAATCTTGGTCTTTCTTCGGTAAAGTTTCTGGTCAGGTTATC CGTGCGGAAAAAGGTTCTCGTTTCGCGATCACCCTGCGTGAACACATCGACCACGCGAAAGAAGACCGT CTGAAAAAACTGGCGGACCGTATCATCATGGAAGCGCTGGGTTACGTTTACGCGCTGGACGAACGTGGT AAAGGTAAATGGGTTGCGAAATACCCGCCGTGCCAGCTGATCCTGCTGGAAGAACTGTCTGAATACCAG TTCAACAACGACCGTCCGCCGTCTGAAAACAACCAGCTGATGCAGTGGTCTCACCGTGGTGTTTTCCAGG AACTGATCAACCAGGCGCAGGTTCACGACCTGCTGGTTGGTACCATGTACGCGGCGTTCTCTTCTCGTTT CGACGCGCGTACCGGTGCGCCGGGTATCCGTTGCCGTCGTGTTCCGGCGCGTTGCACCCAGGAACACAA CCCGGAACCGTTCCCGTGGTGGCTGAACAAATTCGTTGTTGAACACACCCTGGACGCGTGCCCGCTGCGT GCGGACGACCTGATCCCGACCGGTGAAGGTGAAATCTTCGTTTCTCCGTTCTCTGCGGAAGAAGGTGAC TTCCACCAGATCCACGCGGACCTGAACGCGGCGCAGAACCTGCAGCAGCGTCTGTGGTCTGACTTCGAC ATCTCTCAGATCCGTCTGCGTTGCGACTGGGGTGAAGTTGACGGTGAACTGGTTCTGATCCCGCGTCTGA CCGGTAAACGTACCGCGGACTCTTACTCTAACAAAGTTTTCTACACCAACACCGGTGTTACCTACTACGA ACGTGAACGTGGTAAAAAACGTCGTAAAGTTTTCGCGCAGGAAAAACTGTCTGAAGAAGAAGCGGAAC TGCTGGTTGAAGCGGACGAAGCGCGTGAAAAATCTGTTGTTCTGATGCGTGACCCGTCTGGTATCATCA ACCGTGGTAACTGGACCCGTCAGAAAGAATTCTGGTCTATGGTTAACCAGCGTATCGAAGGTTACCTGG TTAAACAGATCCGTTCTCGTGTTCCGCTGCAGGACTCTGCGTGCGAAAACACCGGTGACATCTAA SEQ ATGGCGACCCGTTCTTTCATCCTGAAAATCGAACCGAACGAAGAAGTTAAAAAAGGTCTGTGGAAAACC ID CACGAAGTTCTGAACCACGGTATCGCGTACTACATGAACATCCTGAAACTGATCCGTCAGGAAGCGATC NO: TACGAACACCACGAACAGGACCCGAAAAACCCGAAAAAAGTTTCTAAAGCGGAAATCCAGGCGGAACT 54 GTGGGACTTCGTTCTGAAAATGCAGAAATGCAACTCTTTCACCCACGAAGTTGACAAAGACGTTGTTTTC AACATCCTGCGTGAACTGTACGAAGAACTGGTTCCGTCTTCTGTTGAAAAAAAAGGTGAAGCGAACCAG CTGTCTAACAAATTCCTGTACCCGCTGGTTGACCCGAACTCTCAGTCTGGTAAAGGTACCGCGTCTTCTG GTCGTAAACCGCGTTGGTACAACCTGAAAATCGCGGGTGACCCGTCTTGGGAAGAAGAAAAAAAAAAA TGGGAAGAAGACAAAAAAAAAGACCCGCTGGCGAAAATCCTGGGTAAACTGGCGGAATACGGTCTGAT CCCGCTGTTCATCCCGTTCACCGACTCTAACGAACCGATCGTTAAAGAAATCAAATGGATGGAAAAATC TCGTAACCAGTCTGTTCGTCGTCTGGACAAAGACATGTTCATCCAGGCGCTGGAACGTTTCCTGTCTTGG GAATCTTGGAACCTGAAAGTTAAAGAAGAATACGAAAAAGTTGAAAAAGAACACAAAACCCTGGAAGA ACGTATCAAAGAAGACATCCAGGCGTTCAAATCTCTGGAACAGTACGAAAAAGAACGTCAGGAACAGC TGCTGCGTGACACCCTGAACACCAACGAATACCGTCTGTCTAAACGTGGTCTGCGTGGTTGGCGTGAAA TCATCCAGAAATGGCTGAAAATGGACGAAAACGAACCGTCTGAAAAATACCTGGAAGTTTTCAAAGACT ACCAGCGTAAACACCCGCGTGAAGCGGGTGACTACTCTGTTTACGAATTCCTGTCTAAAAAAGAAAACC ACTTCATCTGGCGTAACCACCCGGAATACCCGTACCTGTACGCGACCTTCTGCGAAATCGACAAAAAAA AAAAAGACGCGAAACAGCAGGCGACCTTCACCCTGGCGGACCCGATCAACCACCCGCTGTGGGTTCGTT TCGAAGAACGTTCTGGTTCTAACCTGAACAAATACCGTATCCTGACCGAACAGCTGCACACCGAAAAAC TGAAAAAAAAACTGACCGTTCAGCTGGACCGTCTGATCTACCCGACCGAATCTGGTGGTTGGGAAGAAA AAGGTAAAGTTGACATCGTTCTGCTGCCGTCTCGTCAGTTCTACAACCAGATCTTCCTGGACATCGAAGA AAAAGGTAAACACGCGTTCACCTACAAAGACGAATCTATCAAATTCCCGCTGAAAGGTACCCTGGGTGG TGCGCGTGTTCAGTTCGACCGTGACCACCTGCGTCGTTACCCGCACAAAGTTGAATCTGGTAACGTTGGT CGTATCTACTTCAACATGACCGTTAACATCGAACCGACCGAATCTCCGGTTTCTAAATCTCTGAAAATCC ACCGTGACGACTTCCCGAAATTCGTTAACTTCAAACCGAAAGAACTGACCGAATGGATCAAAGACTCTA AAGGTAAAAAACTGAAATCTGGTATCGAATCTCTGGAAATCGGTCTGCGTGTTATGTCTATCGACCTGG GTCAGCGTCAGGCGGCGGCGGCGTCTATCTTCGAAGTTGTTGACCAGAAACCGGACATCGAAGGTAAAC TGTTCTTCCCGATCAAAGGTACCGAACTGTACGCGGTTCACCGTGCGTCTTTCAACATCAAACTGCCGGG TGAAACCCTGGTTAAATCTCGTGAAGTTCTGCGTAAAGCGCGTGAAGACAACCTGAAACTGATGAACCA GAAACTGAACTTCCTGCGTAACGTTCTGCACTTCCAGCAGTTCGAAGACATCACCGAACGTGAAAAACG TGTTACCAAATGGATCTCTCGTCAGGAAAACTCTGACGTTCCGCTGGTTTACCAGGACGAACTGATCCAG ATCCGTGAACTGATGTACAAACCGTACAAAGACTGGGTTGCGTTCCTGAAACAGCTGCACAAACGTCTG GAAGTTGAAATCGGTAAAGAAGTTAAACACTGGCGTAAATCTCTGTCTGACGGTCGTAAAGGTCTGTAC GGTATCTCTCTGAAAAACATCGACGAAATCGACCGTACCCGTAAATTCCTGCTGCGTTGGTCTCTGCGTC CGACCGAACCGGGTGAAGTTCGTCGTCTGGAACCGGGTCAGCGTTTCGCGATCGACCAGCTGAACCACC TGAACGCGCTGAAAGAAGACCGTCTGAAAAAAATGGCGAACACCATCATCATGCACGCGCTGGGTTACT GCTACGACGTTCGTAAAAAAAAATGGCAGGCGAAAAACCCGGCGTGCCAGATCATCCTGTTCGAAGACC TGTCTAACTACAACCCGTACGAAGAACGTTCTCGTTTCGAAAACTCTAAACTGATGAAATGGTCTCGTCG TGAAATCCCGCGTCAGGTTGCGCTGCAGGGTGAAATCTACGGTCTGCAGGTTGGTGAAGTTGGTGCGCA GTTCTCTTCTCGTTTCCACGCGAAAACCGGTTCTCCGGGTATCCGTTGCTCTGTTGTTACCAAAGAAAAA CTGCAGGACAACCGTTTCTTCAAAAACCTGCAGCGTGAAGGTCGTCTGACCCTGGACAAAATCGCGGTT CTGAAAGAAGGTGACCTGTACCCGGACAAAGGTGGTGAAAAATTCATCTCTCTGTCTAAAGACCGTAAA CTGGTTACCACCCACGCGGACATCAACGCGGCGCAGAACCTGCAGAAACGTTTCTGGACCCGTACCCAC GGTTTCTACAAAGTTTACTGCAAAGCGTACCAGGTTGACGGTCAGACCGTTTACATCCCGGAATCTAAA GACCAGAAACAGAAAATCATCGAAGAATTCGGTGAAGGTTACTTCATCCTGAAAGACGGTGTTTACGAA TGGGGTAACGCGGGTAAACTGAAAATCAAAAAAGGTTCTTCTAAACAGTCTTCTTCTGAACTGGTTGAC TCTGACATCCTGAAAGACTCTTTCGACCTGGCGTCTGAACTGAAAGGTGAAAAACTGATGCTGTACCGT GACCCGTCTGGTAACGTTTTCCCGTCTGACAAATGGATGGCGGCGGGTGTTTTCTTCGGTAAACTGGAAC GTATCCTGATCTCTAAACTGACCAACCAGTACTCTATCTCTACCATCGAAGACGACTCTTCTAAACAGTC TATGTAA SEQ ATGCCGACCCGTACCATCAACCTGAAACTGGTTCTGGGTAAAAACCCGGAAAACGCGACCCTGCGTCGT ID GCGCTGTTCTCTACCCACCGTCTGGTTAACCAGGCGACCAAACGTATCGAAGAATTCCTGCTGCTGTGCC NO: GTGGTGAAGCGTACCGTACCGTTGACAACGAAGGTAAAGAAGCGGAAATCCCGCGTCACGCGGTTCAG 55 GAAGAAGCGCTGGCGTTCGCGAAAGCGGCGCAGCGTCACAACGGTTGCATCTCTACCTACGAAGACCAG GAAATCCTGGACGTTCTGCGTCAGCTGTACGAACGTCTGGTTCCGTCTGTTAACGAAAACAACGAAGCG GGTGACGCGCAGGCGGCGAACGCGTGGGTTTCTCCGCTGATGTCTGCGGAATCTGAAGGTGGTCTGTCT GTTTACGACAAAGTTCTGGACCCGCCGCCGGTTTGGATGAAACTGAAAGAAGAAAAAGCGCCGGGTTGG GAAGCGGCGTCTCAGATCTGGATCCAGTCTGACGAAGGTCAGTCTCTGCTGAACAAACCGGGTTCTCCG CCGCGTTGGATCCGTAAACTGCGTTCTGGTCAGCCGTGGCAGGACGACTTCGTTTCTGACCAGAAAAAA AAACAGGACGAACTGACCAAAGGTAACGCGCCGCTGATCAAACAGCTGAAAGAAATGGGTCTGCTGCC GCTGGTTAACCCGTTCTTCCGTCACCTGCTGGACCCGGAAGGTAAAGGTGTTTCTCCGTGGGACCGTCTG GCGGTTCGTGCGGCGGTTGCGCACTTCATCTCTTGGGAATCTTGGAACCACCGTACCCGTGCGGAATACA ACTCTCTGAAACTGCGTCGTGACGAATTCGAAGCGGCGTCTGACGAATTCAAAGACGACTTCACCCTGC TGCGTCAGTACGAAGCGAAACGTCACTCTACCCTGAAATCTATCGCGCTGGCGGACGACTCTAACCCGT ACCGTATCGGTGTTCGTTCTCTGCGTGCGTGGAACCGTGTTCGTGAAGAATGGATCGACAAAGGTGCGA CCGAAGAACAGCGTGTTACCATCCTGTCTAAACTGCAGACCCAGCTGCGTGGTAAATTCGGTGACCCGG ACCTGTTCAACTGGCTGGCGCAGGACCGTCACGTTCACCTGTGGTCTCCGCGTGACTCTGTTACCCCGCT GGTTCGTATCAACGCGGTTGACAAAGTTCTGCGTCGTCGTAAACCGTACGCGCTGATGACCTTCGCGCAC CCGCGTTTCCACCCGCGTTGGATCCTGTACGAAGCGCCGGGTGGTTCTAACCTGCGTCAGTACGCGCTGG ACTGCACCGAAAACGCGCTGCACATCACCCTGCCGCTGCTGGTTGACGACGCGCACGGTACCTGGATCG AAAAAAAAATCCGTGTTCCGCTGGCGCCGTCTGGTCAGATCCAGGACCTGACCCTGGAAAAACTGGAAA AAAAAAAAAACCGTCTGTACTACCGTTCTGGTTTCCAGCAGTTCGCGGGTCTGGCGGGTGGTGCGGAAG TTCTGTTCCACCGTCCGTACATGGAACACGACGAACGTTCTGAAGAATCTCTGCTGGAACGTCCGGGTGC GGTTTGGTTCAAACTGACCCTGGACGTTGCGACCCAGGCGCCGCCGAACTGGCTGGACGGTAAAGGTCG TGTTCGTACCCCGCCGGAAGTTCACCACTTCAAAACCGCGCTGTCTAACAAATCTAAACACACCCGTACC CTGCAGCCGGGTCTGCGTGTTCTGTCTGTTGACCTGGGTATGCGTACCTTCGCGTCTTGCTCTGTTTTCGA ACTGATCGAAGGTAAACCGGAAACCGGTCGTGCGTTCCCGGTTGCGGACGAACGTTCTATGGACTCTCC GAACAAACTGTGGGCGAAACACGAACGTTCTTTCAAACTGACCCTGCCGGGTGAAACCCCGTCTCGTAA AGAAGAAGAAGAACGTTCTATCGCGCGTGCGGAAATCTACGCGCTGAAACGTGACATCCAGCGTCTGAA ATCTCTGCTGCGTCTGGGTGAAGAAGACAACGACAACCGTCGTGACGCGCTGCTGGAACAGTTCTTCAA AGGTTGGGGTGAAGAAGACGTTGTTCCGGGTCAGGCGTTCCCGCGTTCTCTGTTCCAGGGTCTGGGTGCG GCGCCGTTCCGTTCTACCCCGGAACTGTGGCGTCAGCACTGCCAGACCTACTACGACAAAGCGGAAGCG TGCCTGGCGAAACACATCTCTGACTGGCGTAAACGTACCCGTCCGCGTCCGACCTCTCGTGAAATGTGGT ACAAAACCCGTTCTTACCACGGTGGTAAATCTATCTGGATGCTGGAATACCTGGACGCGGTTCGTAAACT GCTGCTGTCTTGGTCTCTGCGTGGTCGTACCTACGGTGCGATCAACCGTCAGGACACCGCGCGTTTCGGT TCTCTGGCGTCTCGTCTGCTGCACCACATCAACTCTCTGAAAGAAGACCGTATCAAAACCGGTGCGGACT CTATCGTTCAGGCGGCGCGTGGTTACATCCCGCTGCCGCACGGTAAAGGTTGGGAACAGCGTTACGAAC CGTGCCAGCTGATCCTGTTCGAAGACCTGGCGCGTTACCGTTTCCGTGTTGACCGTCCGCGTCGTGAAAA CTCTCAGCTGATGCAGTGGAACCACCGTGCGATCGTTGCGGAAACCACCATGCAGGCGGAACTGTACGG TCAGATCGTTGAAAACACCGCGGCGGGTTTCTCTTCTCGTTTCCACGCGGCGACCGGTGCGCCGGGTGTT CGTTGCCGTTTCCTGCTGGAACGTGACTTCGACAACGACCTGCCGAAACCGTACCTGCTGCGTGAACTGT CTTGGATGCTGGGTAACACCAAAGTTGAATCTGAAGAAGAAAAACTGCGTCTGCTGTCTGAAAAAATCC GTCCGGGTTCTCTGGTTCCGTGGGACGGTGGTGAACAGTTCGCGACCCTGCACCCGAAACGTCAGACCC TGTGCGTTATCCACGCGGACATGAACGCGGCGCAGAACCTGCAGCGTCGTTTCTTCGGTCGTTGCGGTGA AGCGTTCCGTCTGGTTTGCCAGCCGCACGGTGACGACGTTCTGCGTCTGGCGTCTACCCCGGGTGCGCGT CTGCTGGGTGCGCTGCAGCAGCTGGAAAACGGTCAGGGTGCGTTCGAACTGGTTCGTGACATGGGTTCT ACCTCTCAGATGAACCGTTTCGTTATGAAATCTCTGGGTAAAAAAAAAATCAAACCGCTGCAGGACAAC AACGGTGACGACGAACTGGAAGACGTTCTGTCTGTTCTGCCGGAAGAAGACGACACCGGTCGTATCACC GTTTTCCGTGACTCTTCTGGTATCTTCTTCCCGTGCAACGTTTGGATCCCGGCGAAACAGTTCTGGCCGGC GGTTCGTGCGATGATCTGGAAAGTTATGGCGTCTCACTCTCTGGGTTAA SEQ ATGACCAAACTGCGTCACCGTCAGAAAAAACTGACCCACGACTGGGCGGGTTCTAAAAAACGTGAAGTT ID CTGGGTTCTAACGGTAAACTGCAGAACCCGCTGCTGATGCCGGTTAAAAAAGGTCAGGTTACCGAATTC NO: CGTAAAGCGTTCTCTGCGTACGCGCGTGCGACCAAAGGTGAAATGACCGACGGTCGTAAAAACATGTTC 56 ACCCACTCTTTCGAACCGTTCAAAACCAAACCGTCTCTGCACCAGTGCGAACTGGCGGACAAAGCGTAC CAGTCTCTGCACTCTTACCTGCCGGGTTCTCTGGCGCACTTCCTGCTGTCTGCGCACGCGCTGGGTTTCCG TATCTTCTCTAAATCTGGTGAAGCGACCGCGTTCCAGGCGTCTTCTAAAATCGAAGCGTACGAATCTAAA CTGGCGTCTGAACTGGCGTGCGTTGACCTGTCTATCCAGAACCTGACCATCTCTACCCTGTTCAACGCGC TGACCACCTCTGTTCGTGGTAAAGGTGAAGAAACCTCTGCGGACCCGCTGATCGCGCGTTTCTACACCCT GCTGACCGGTAAACCGCTGTCTCGTGACACCCAGGGTCCGGAACGTGACCTGGCGGAAGTTATCTCTCG TAAAATCGCGTCTTCTTTCGGTACCTGGAAAGAAATGACCGCGAACCCGCTGCAGTCTCTGCAGTTCTTC GAAGAAGAACTGCACGCGCTGGACGCGAACGTTTCTCTGTCTCCGGCGTTCGACGTTCTGATCAAAATG AACGACCTGCAGGGTGACCTGAAAAACCGTACCATCGTTTTCGACCCGGACGCGCCGGTTTTCGAATAC AACGCGGAAGACCCGGCGGACATCATCATCAAACTGACCGCGCGTTACGCGAAAGAAGCGGTTATCAA AAACCAGAACGTTGGTAACTACGTTAAAAACGCGATCACCACCACCAACGCGAACGGTCTGGGTTGGCT GCTGAACAAAGGTCTGTCTCTGCTGCCGGTTTCTACCGACGACGAACTGCTGGAATTCATCGGTGTTGAA CGTTCTCACCCGTCTTGCCACGCGCTGATCGAACTGATCGCGCAGCTGGAAGCGCCGGAACTGTTCGAA AAAAACGTTTTCTCTGACACCCGTTCTGAAGTTCAGGGTATGATCGACTCTGCGGTTTCTAACCACATCG CGCGTCTGTCTTCTTCTCGTAACTCTCTGTCTATGGACTCTGAAGAACTGGAACGTCTGATCAAATCTTTC CAGATCCACACCCCGCACTGCTCTCTGTTCATCGGTGCGCAGTCTCTGTCTCAGCAGCTGGAATCTCTGC CGGAAGCGCTGCAGTCTGGTGTTAACTCTGCGGACATCCTGCTGGGTTCTACCCAGTACATGCTGACCAA CTCTCTGGTTGAAGAATCTATCGCGACCTACCAGCGTACCCTGAACCGTATCAACTACCTGTCTGGTGTT GCGGGTCAGATCAACGGTGCGATCAAACGTAAAGCGATCGACGGTGAAAAAATCCACCTGCCGGCGGC GTGGTCTGAACTGATCTCTCTGCCGTTCATCGGTCAGCCGGTTATCGACGTTGAATCTGACCTGGCGCAC CTGAAAAACCAGTACCAGACCCTGTCTAACGAATTCGACACCCTGATCTCTGCGCTGCAGAAAAACTTC GACCTGAACTTCAACAAAGCGCTGCTGAACCGTACCCAGCACTTCGAAGCGATGTGCCGTTCTACCAAA AAAAACGCGCTGTCTAAACCGGAAATCGTTTCTTACCGTGACCTGCTGGCGCGTCTGACCTCTTGCCTGT ACCGTGGTTCTCTGGTTCTGCGTCGTGCGGGTATCGAAGTTCTGAAAAAACACAAAATCTTCGAATCTAA CTCTGAACTGCGTGAACACGTTCACGAACGTAAACACTTCGTTTTCGTTTCTCCGCTGGACCGTAAAGCG AAAAAACTGCTGCGTCTGACCGACTCTCGTCCGGACCTGCTGCACGTTATCGACGAAATCCTGCAGCAC GACAACCTGGAAAACAAAGACCGTGAATCTCTGTGGCTGGTTCGTTCTGGTTACCTGCTGGCGGGTCTGC CGGACCAGCTGTCTTCTTCTTTCATCAACCTGCCGATCATCACCCAGAAAGGTGACCGTCGTCTGATCGA CCTGATCCAGTACGACCAGATCAACCGTGACGCGTTCGTTATGCTGGTTACCTCTGCGTTCAAATCTAAC CTGTCTGGTCTGCAGTACCGTGCGAACAAACAGTCTTTCGTTGTTACCCGTACCCTGTCTCCGTACCTGG GTTCTAAACTGGTTTACGTTCCGAAAGACAAAGACTGGCTGGTTCCGTCTCAGATGTTCGAAGGTCGTTT CGCGGACATCCTGCAGTCTGACTACATGGTTTGGAAAGACGCGGGTCGTCTGTGCGTTATCGACACCGC GAAACACCTGTCTAACATCAAAAAATCTGTTTTCTCTTCTGAAGAAGTTCTGGCGTTCCTGCGTGAACTG CCGCACCGTACCTTCATCCAGACCGAAGTTCGTGGTCTGGGTGTTAACGTTGACGGTATCGCGTTCAACA ACGGTGACATCCCGTCTCTGAAAACCTTCTCTAACTGCGTTCAGGTTAAAGTTTCTCGTACCAACACCTC TCTGGTTCAGACCCTGAACCGTTGGTTCGAAGGTGGTAAAGTTTCTCCGCCGTCTATCCAGTTCGAACGT GCGTACTACAAAAAAGACGACCAGATCCACGAAGACGCGGCGAAACGTAAAATCCGTTTCCAGATGCC GGCGACCGAACTGGTTCACGCGTCTGACGACGCGGGTTGGACCCCGTCTTACCTGCTGGGTATCGACCC GGGTGAATACGGTATGGGTCTGTCTCTGGTTTCTATCAACAACGGTGAAGTTCTGGACTCTGGTTTCATC CACATCAACTCTCTGATCAACTTCGCGTCTAAAAAATCTAACCACCAGACCAAAGTTGTTCCGCGTCAGC AGTACAAATCTCCGTACGCGAACTACCTGGAACAGTCTAAAGACTCTGCGGCGGGTGACATCGCGCACA TCCTGGACCGTCTGATCTACAAACTGAACGCGCTGCCGGTTTTCGAAGCGCTGTCTGGTAACTCTCAGTC TGCGGCGGACCAGGTTTGGACCAAAGTTCTGTCTTTCTACACCTGGGGTGACAACGACGCGCAGAACTC TATCCGTAAACAGCACTGGTTCGGTGCGTCTCACTGGGACATCAAAGGTATGCTGCGTCAGCCGCCGAC CGAAAAAAAACCGAAACCGTACATCGCGTTCCCGGGTTCTCAGGTTTCTTCTTACGGTAACTCTCAGCGT TGCTCTTGCTGCGGTCGTAACCCGATCGAACAGCTGCGTGAAATGGCGAAAGACACCTCTATCAAAGAA CTGAAAATCCGTAACTCTGAAATCCAGCTGTTCGACGGTACCATCAAACTGTTCAACCCGGACCCGTCTA CCGTTATCGAACGTCGTCGTCACAACCTGGGTCCGTCTCGTATCCCGGTTGCGGACCGTACCTTCAAAAA CATCTCTCCGTCTTCTCTGGAATTCAAAGAACTGATCACCATCGTTTCTCGTTCTATCCGTCACTCTCCGG AATTCATCGCGAAAAAACGTGGTATCGGTTCTGAATACTTCTGCGCGTACTCTGACTGCAACTCTTCTCT GAACTCTGAAGCGAACGCGGCGGCGAACGTTGCGCAGAAATTCCAGAAACAGCTGTTCTTCGAACTGTAA SEQ ATGAAACGTATCCTGAACTCTCTGAAAGTTGCGGCGCTGCGTCTGCTGTTCCGTGGTAAAGGTTCTGAAC ID TGGTTAAAACCGTTAAATACCCGCTGGTTTCTCCGGTTCAGGGTGCGGTTGAAGAACTGGCGGAAGCGA NO: TCCGTCACGACAACCTGCACCTGTTCGGTCAGAAAGAAATCGTTGACCTGATGGAAAAAGACGAAGGTA 57 CCCAGGTTTACTCTGTTGTTGACTTCTGGCTGGACACCCTGCGTCTGGGTATGTTCTTCTCTCCGTCTGCG AACGCGCTGAAAATCACCCTGGGTAAATTCAACTCTGACCAGGTTTCTCCGTTCCGTAAAGTTCTGGAAC AGTCTCCGTTCTTCCTGGCGGGTCGTCTGAAAGTTGAACCGGCGGAACGTATCCTGTCTGTTGAAATCCG TAAAATCGGTAAACGTGAAAACCGTGTTGAAAACTACGCGGCGGACGTTGAAACCTGCTTCATCGGTCA GCTGTCTTCTGACGAAAAACAGTCTATCCAGAAACTGGCGAACGACATCTGGGACTCTAAAGACCACGA AGAACAGCGTATGCTGAAAGCGGACTTCTTCGCGATCCCGCTGATCAAAGACCCGAAAGCGGTTACCGA AGAAGACCCGGAAAACGAAACCGCGGGTAAACAGAAACCGCTGGAACTGTGCGTTTGCCTGGTTCCGG AACTGTACACCCGTGGTTTCGGTTCTATCGCGGACTTCCTGGTTCAGCGTCTGACCCTGCTGCGTGACAA AATGTCTACCGACACCGCGGAAGACTGCCTGGAATACGTTGGTATCGAAGAAGAAAAAGGTAACGGTA TGAACTCTCTGCTGGGTACCTTCCTGAAAAACCTGCAGGGTGACGGTTTCGAACAGATCTTCCAGTTCAT GCTGGGTTCTTACGTTGGTTGGCAGGGTAAAGAAGACGTTCTGCGTGAACGTCTGGACCTGCTGGCGGA AAAAGTTAAACGTCTGCCGAAACCGAAATTCGCGGGTGAATGGTCTGGTCACCGTATGTTCCTGCACGG TCAGCTGAAATCTTGGTCTTCTAACTTCTTCCGTCTGTTCAACGAAACCCGTGAACTGCTGGAATCTATC AAATCTGACATCCAGCACGCGACCATGCTGATCTCTTACGTTGAAGAAAAAGGTGGTTACCACCCGCAG CTGCTGTCTCAGTACCGTAAACTGATGGAACAGCTGCCGGCGCTGCGTACCAAAGTTCTGGACCCGGAA ATCGAAATGACCCACATGTCTGAAGCGGTTCGTTCTTACATCATGATCCACAAATCTGTTGCGGGTTTCC TGCCGGACCTGCTGGAATCTCTGGACCGTGACAAAGACCGTGAATTCCTGCTGTCTATCTTCCCGCGTAT CCCGAAAATCGACAAAAAAACCAAAGAAATCGTTGCGTGGGAACTGCCGGGTGAACCGGAAGAAGGTT ACCTGTTCACCGCGAACAACCTGTTCCGTAACTTCCTGGAAAACCCGAAACACGTTCCGCGTTTCATGGC GGAACGTATCCCGGAAGACTGGACCCGTCTGCGTTCTGCGCCGGTTTGGTTCGACGGTATGGTTAAACA GTGGCAGAAAGTTGTTAACCAGCTGGTTGAATCTCCGGGTGCGCTGTACCAGTTCAACGAATCTTTCCTG CGTCAGCGTCTGCAGGCGATGCTGACCGTTTACAAACGTGACCTGCAGACCGAAAAATTCCTGAAACTG CTGGCGGACGTTTGCCGTCCGCTGGTTGACTTCTTCGGTCTGGGTGGTAACGACATCATCTTCAAATCTT GCCAGGACCCGCGTAAACAGTGGCAGACCGTTATCCCGCTGTCTGTTCCGGCGGACGTTTACACCGCGT GCGAAGGTCTGGCGATCCGTCTGCGTGAAACCCTGGGTTTCGAATGGAAAAACCTGAAAGGTCACGAAC GTGAAGACTTCCTGCGTCTGCACCAGCTGCTGGGTAACCTGCTGTTCTGGATCCGTGACGCGAAACTGGT TGTTAAACTGGAAGACTGGATGAACAACCCGTGCGTTCAGGAATACGTTGAAGCGCGTAAAGCGATCGA CCTGCCGCTGGAAATCTTCGGTTTCGAAGTTCCGATCTTCCTGAACGGTTACCTGTTCTCTGAACTGCGTC AGCTGGAACTGCTGCTGCGTCGTAAATCTGTTATGACCTCTTACTCTGTTAAAACCACCGGTTCTCCGAA CCGTCTGTTCCAGCTGGTTTACCTGCCGCTGAACCCGTCTGACCCGGAAAAAAAAAACTCTAACAACTTC CAGGAACGTCTGGACACCCCGACCGGTCTGTCTCGTCGTTTCCTGGACCTGACCCTGGACGCGTTCGCGG GTAAACTGCTGACCGACCCGGTTACCCAGGAACTGAAAACCATGGCGGGTTTCTACGACCACCTGTTCG GTTTCAAACTGCCGTGCAAACTGGCGGCGATGTCTAACCACCCGGGTTCTTCTTCTAAAATGGTTGTTCT GGCGAAACCGAAAAAAGGTGTTGCGTCTAACATCGGTTTCGAACCGATCCCGGACCCGGCGCACCCGGT TTTCCGTGTTCGTTCTTCTTGGCCGGAACTGAAATACCTGGAAGGTCTGCTGTACCTGCCGGAAGACACC CCGCTGACCATCGAACTGGCGGAAACCTCTGTTTCTTGCCAGTCTGTTTCTTCTGTTGCGTTCGACCTGAA AAACCTGACCACCATCCTGGGTCGTGTTGGTGAATTCCGTGTTACCGCGGACCAGCCGTTCAAACTGACC CCGATCATCCCGGAAAAAGAAGAATCTTTCATCGGTAAAACCTACCTGGGTCTGGACGCGGGTGAACGT TCTGGTGTTGGTTTCGCGATCGTTACCGTTGACGGTGACGGTTACGAAGTTCAGCGTCTGGGTGTTCACG AAGACACCCAGCTGATGGCGCTGCAGCAGGTTGCGTCTAAATCTCTGAAAGAACCGGTTTTCCAGCCGC TGCGTAAAGGTACCTTCCGTCAGCAGGAACGTATCCGTAAATCTCTGCGTGGTTGCTACTGGAACTTCTA CCACGCGCTGATGATCAAATACCGTGCGAAAGTTGTTCACGAAGAATCTGTTGGTTCTTCTGGTCTGGTT GGTCAGTGGCTGCGTGCGTTCCAGAAAGACCTGAAAAAAGCGGACGTTCTGCCGAAAAAAGGTGGTAA AAACGGTGTTGACAAAAAAAAACGTGAATCTTCTGCGCAGGACACCCTGTGGGGTGGTGCGTTCTCTAA AAAAGAAGAACAGCAGATCGCGTTCGAAGTTCAGGCGGCGGGTTCTTCTCAGTTCTGCCTGAAATGCGG TTGGTGGTTCCAGCTGGGTATGCGTGAAGTTAACCGTGTTCAGGAATCTGGTGTTGTTCTGGACTGGAAC CGTTCTATCGTTACCTTCCTGATCGAATCTTCTGGTGAAAAAGTTTACGGTTTCTCTCCGCAGCAGCTGGA AAAAGGTTTCCGTCCGGACATCGAAACCTTCAAAAAAATGGTTCGTGACTTCATGCGTCCGCCGATGTTC GACCGTAAAGGTCGTCCGGCGGCGGCGTACGAACGTTTCGTTCTGGGTCGTCGTCACCGTCGTTACCGTT TCGACAAAGTTTTCGAAGAACGTTTCGGTCGTTCTGCGCTGTTCATCTGCCCGCGTGTTGGTTGCGGTAA CTTCGACCACTCTTCTGAACAGTCTGCGGTTGTTCTGGCGCTGATCGGTTACATCGCGGACAAAGAAGGT ATGTCTGGTAAAAAACTGGTTTACGTTCGTCTGGCGGAACTGATGGCGGAATGGAAACTGAAAAAACTG GAACGTTCTCGTGTTGAAGAACAGTCTTCTGCGCAGTAA SEQ ATGGCGGAATCTAAACAGATGCAGTGCCGTAAATGCGGTGCGTCTATGAAATACGAAGTTATCGGTCTG ID GGTAAAAAATCTTGCCGTTACATGTGCCCGGACTGCGGTAACCACACCTCTGCGCGTAAAATCCAGAAC NO: AAAAAAAAACGTGACAAAAAATACGGTTCTGCGTCTAAAGCGCAGTCTCAGCGTATCGCGGTTGCGGGT 58 GCGCTGTACCCGGACAAAAAAGTTCAGACCATCAAAACCTACAAATACCCGGCGGACCTGAACGGTGA AGTTCACGACTCTGGTGTTGCGGAAAAAATCGCGCAGGCGATCCAGGAAGACGAAATCGGTCTGCTGGG TCCGTCTTCTGAATACGCGTGCTGGATCGCGTCTCAGAAACAGTCTGAACCGTACTCTGTTGTTGACTTC TGGTTCGACGCGGTTTGCGCGGGTGGTGTTTTCGCGTACTCTGGTGCGCGTCTGCTGTCTACCGTTCTGCA GCTGTCTGGTGAAGAATCTGTTCTGCGTGCGGCGCTGGCGTCTTCTCCGTTCGTTGACGACATCAACCTG GCGCAGGCGGAAAAATTCCTGGCGGTTTCTCGTCGTACCGGTCAGGACAAACTGGGTAAACGTATCGGT GAATGCTTCGCGGAAGGTCGTCTGGAAGCGCTGGGTATCAAAGACCGTATGCGTGAATTCGTTCAGGCG ATCGACGTTGCGCAGACCGCGGGTCAGCGTTTCGCGGCGAAACTGAAAATCTTCGGTATCTCTCAGATG CCGGAAGCGAAACAGTGGAACAACGACTCTGGTCTGACCGTTTGCATCCTGCCGGACTACTACGTTCCG GAAGAAAACCGTGCGGACCAGCTGGTTGTTCTGCTGCGTCGTCTGCGTGAAATCGCGTACTGCATGGGT ATCGAAGACGAAGCGGGTTTCGAACACCTGGGTATCGACCCGGGTGCGCTGTCTAACTTCTCTAACGGT AACCCGAAACGTGGTTTCCTGGGTCGTCTGCTGAACAACGACATCATCGCGCTGGCGAACAACATGTCT GCGATGACCCCGTACTGGGAAGGTCGTAAAGGTGAACTGATCGAACGTCTGGCGTGGCTGAAACACCGT GCGGAAGGTCTGTACCTGAAAGAACCGCACTTCGGTAACTCTTGGGCGGACCACCGTTCTCGTATCTTCT CTCGTATCGCGGGTTGGCTGTCTGGTTGCGCGGGTAAACTGAAAATCGCGAAAGACCAGATCTCTGGTG TTCGTACCGACCTGTTCCTGCTGAAACGTCTGCTGGACGCGGTTCCGCAGTCTGCGCCGTCTCCGGACTT CATCGCGTCTATCTCTGCGCTGGACCGTTTCCTGGAAGCGGCGGAATCTTCTCAGGACCCGGCGGAACA GGTTCGTGCGCTGTACGCGTTCCACCTGAACGCGCCGGCGGTTCGTTCTATCGCGAACAAAGCGGTTCAG CGTTCTGACTCTCAGGAATGGCTGATCAAAGAACTGGACGCGGTTGACCACCTGGAATTCAACAAAGCG TTCCCGTTCTTCTCTGACACCGGTAAAAAAAAAAAAAAAGGTGCGAACTCTAACGGTGCGCCGTCTGAA GAAGAATACACCGAAACCGAATCTATCCAGCAGCCGGAAGACGCGGAACAGGAAGTTAACGGTCAGGA AGGTAACGGTGCGTCTAAAAACCAGAAAAAATTCCAGCGTATCCCGCGTTTCTTCGGTGAAGGTTCTCG TTCTGAATACCGTATCCTGACCGAAGCGCCGCAGTACTTCGACATGTTCTGCAACAACATGCGTGCGATC TTCATGCAGCTGGAATCTCAGCCGCGTAAAGCGCCGCGTGACTTCAAATGCTTCCTGCAGAACCGTCTGC AGAAACTGTACAAACAGACCTTCCTGAACGCGCGTTCTAACAAATGCCGTGCGCTGCTGGAATCTGTTCT GATCTCTTGGGGTGAATTCTACACCTACGGTGCGAACGAAAAAAAATTCCGTCTGCGTCACGAAGCGTC TGAACGTTCTTCTGACCCGGACTACGTTGTTCAGCAGGCGCTGGAAATCGCGCGTCGTCTGTTCCTGTTC GGTTTCGAATGGCGTGACTGCTCTGCGGGTGAACGTGTTGACCTGGTTGAAATCCACAAAAAAGCGATC TCTTTCCTGCTGGCGATCACCCAGGCGGAAGTTTCTGTTGGTTCTTACAACTGGCTGGGTAACTCTACCG TTTCTCGTTACCTGTCTGTTGCGGGTACCGACACCCTGTACGGTACCCAGCTGGAAGAATTCCTGAACGC GACCGTTCTGTCTCAGATGCGTGGTCTGGCGATCCGTCTGTCTTCTCAGGAACTGAAAGACGGTTTCGAC GTTCAGCTGGAATCTTCTTGCCAGGACAACCTGCAGCACCTGCTGGTTTACCGTGCGTCTCGTGACCTGG CGGCGTGCAAACGTGCGACCTGCCCGGCGGAACTGGACCCGAAAATCCTGGTTCTGCCGGTTGGTGCGT TCATCGCGTCTGTTATGAAAATGATCGAACGTGGTGACGAACCGCTGGCGGGTGCGTACCTGCGTCACC GTCCGCACTCTTTCGGTTGGCAGATCCGTGTTCGTGGTGTTGCGGAAGTTGGTATGGACCAGGGTACCGC GCTGGCGTTCCAGAAACCGACCGAATCTGAACCGTTCAAAATCAAACCGTTCTCTGCGCAGTACGGTCC GGTTCTGTGGCTGAACTCTTCTTCTTACTCTCAGTCTCAGTACCTGGACGGTTTCCTGTCTCAGCCGAAAA ACTGGTCTATGCGTGTTCTGCCGCAGGCGGGTTCTGTTCGTGTTGAACAGCGTGTTGCGCTGATCTGGAA CCTGCAGGCGGGTAAAATGCGTCTGGAACGTTCTGGTGCGCGTGCGTTCTTCATGCCGGTTCCGTTCTCT TTCCGTCCGTCTGGTTCTGGTGACGAAGCGGTTCTGGCGCCGAACCGTTACCTGGGTCTGTTCCCGCACT CTGGTGGTATCGAATACGCGGTTGTTGACGTTCTGGACTCTGCGGGTTTCAAAATCCTGGAACGTGGTAC CATCGCGGTTAACGGTTTCTCTCAGAAACGTGGTGAACGTCAGGAAGAAGCGCACCGTGAAAAACAGCG TCGTGGTATCTCTGACATCGGTCGTAAAAAACCGGTTCAGGCGGAAGTTGACGCGGCGAACGAACTGCA CCGTAAATACACCGACGTTGCGACCCGTCTGGGTTGCCGTATCGTTGTTCAGTGGGCGCCGCAGCCGAA ACCGGGTACCGCGCCGACCGCGCAGACCGTTTACGCGCGTGCGGTTCGTACCGAAGCGCCGCGTTCTGG TAACCAGGAAGACCACGCGCGTATGAAATCTTCTTGGGGTTACACCTGGGGTACCTACTGGGAAAAACG TAAACCGGAAGACATCCTGGGTATCTCTACCCAGGTTTACTGGACCGGTGGTATCGGTGAATCTTGCCCG GCGGTTGCGGTTGCGCTGCTGGGTCACATCCGTGCGACCTCTACCCAGACCGAATGGGAAAAAGAAGAA GTTGTTTTCGGTCGTCTGAAAAAATTCTTCCCGTCTTAA SEQ ATGGAAAAACGTATCAACAAAATCCGTAAAAAACTGTCTGCGGACAACGCGACCAAACCGGTTTCTCGT ID TCTGGTCCGATGAAAACCCTGCTGGTTCGTGTTATGACCGACGACCTGAAAAAACGTCTGGAAAAACGT NO: CGTAAAAAACCGGAAGTTATGCCGCAGGTTATCTCTAACAACGCGGCGAACAACCTGCGTATGCTGCTG 59 GACGACTACACCAAAATGAAAGAAGCGATCCTGCAGGTTTACTGGCAGGAATTCAAAGACGACCACGTT GGTCTGATGTGCAAATTCGCGCAGCCGGCGTCTAAAAAAATCGACCAGAACAAACTGAAACCGGAAAT GGACGAAAAAGGTAACCTGACCACCGCGGGTTTCGCGTGCTCTCAGTGCGGTCAGCCGCTGTTCGTTTA CAAACTGGAACAGGTTTCTGAAAAAGGTAAAGCGTACACCAACTACTTCGGTCGTTGCAACGTTGCGGA ACACGAAAAACTGATCCTGCTGGCGCAGCTGAAACCGGAAAAAGACTCTGACGAAGCGGTTACCTACTC TCTGGGTAAATTCGGTCAGCGTGCGCTGGACTTCTACTCTATCCACGTTACCAAAGAATCTACCCACCCG GTTAAACCGCTGGCGCAGATCGCGGGTAACCGTTACGCGTCTGGTCCGGTTGGTAAAGCGCTGTCTGAC GCGTGCATGGGTACCATCGCGTCTTTCCTGTCTAAATACCAGGACATCATCATCGAACACCAGAAAGTTG TTAAAGGTAACCAGAAACGTCTGGAATCTCTGCGTGAACTGGCGGGTAAAGAAAACCTGGAATACCCGT CTGTTACCCTGCCGCCGCAGCCGCACACCAAAGAAGGTGTTGACGCGTACAACGAAGTTATCGCGCGTG TTCGTATGTGGGTTAACCTGAACCTGTGGCAGAAACTGAAACTGTCTCGTGACGACGCGAAACCGCTGC TGCGTCTGAAAGGTTTCCCGTCTTTCCCGGTTGTTGAACGTCGTGAAAACGAAGTTGACTGGTGGAACAC CATCAACGAAGTTAAAAAACTGATCGACGCGAAACGTGACATGGGTCGTGTTTTCTGGTCTGGTGTTAC CGCGGAAAAACGTAACACCATCCTGGAAGGTTACAACTACCTGCCGAACGAAAACGACCACAAAAAAC GTGAAGGTTCTCTGGAAAACCCGAAAAAACCGGCGAAACGTCAGTTCGGTGACCTGCTGCTGTACCTGG AAAAAAAATACGCGGGTGACTGGGGTAAAGTTTTCGACGAAGCGTGGGAACGTATCGACAAAAAAATC GCGGGTCTGACCTCTCACATCGAACGTGAAGAAGCGCGTAACGCGGAAGACGCGCAGTCTAAAGCGGTT CTGACCGACTGGCTGCGTGCGAAAGCGTCTTTCGTTCTGGAACGTCTGAAAGAAATGGACGAAAAAGAA TTCTACGCGTGCGAAATCCAGCTGCAGAAATGGTACGGTGACCTGCGTGGTAACCCGTTCGCGGTTGAA GCGGAAAACCGTGTTGTTGACATCTCTGGTTTCTCTATCGGTTCTGACGGTCACTCTATCCAGTACCGTA ACCTGCTGGCGTGGAAATACCTGGAAAACGGTAAACGTGAATTCTACCTGCTGATGAACTACGGTAAAA AAGGTCGTATCCGTTTCACCGACGGTACCGACATCAAAAAATCTGGTAAATGGCAGGGTCTGCTGTACG GTGGTGGTAAAGCGAAAGTTATCGACCTGACCTTCGACCCGGACGACGAACAGCTGATCATCCTGCCGC TGGCGTTCGGTACCCGTCAGGGTCGTGAATTCATCTGGAACGACCTGCTGTCTCTGGAAACCGGTCTGAT CAAACTGGCGAACGGTCGTGTTATCGAAAAAACCATCTACAACAAAAAAATCGGTCGTGACGAACCGG CGCTGTTCGTTGCGCTGACCTTCGAACGTCGTGAAGTTGTTGACCCGTCTAACATCAAACCGGTTAACCT GATCGGTGTTGACCGTGGTGAAAACATCCCGGCGGTTATCGCGCTGACCGACCCGGAAGGTTGCCCGCT GCCGGAATTCAAAGACTCTTCTGGTGGTCCGACCGACATCCTGCGTATCGGTGAAGGTTACAAAGAAAA ACAGCGTGCGATCCAGGCGGCGAAAGAAGTTGAACAGCGTCGTGCGGGTGGTTACTCTCGTAAATTCGC GTCTAAATCTCGTAACCTGGCGGACGACATGGTTCGTAACTCTGCGCGTGACCTGTTCTACCACGCGGTT ACCCACGACGCGGTTCTGGTTTTCGAAAACCTGTCTCGTGGTTTCGGTCGTCAGGGTAAACGTACCTTCA TGACCGAACGTCAGTACACCAAAATGGAAGACTGGCTGACCGCGAAACTGGCGTACGAAGGTCTGACCT CTAAAACCTACCTGTCTAAAACCCTGGCGCAGTACACCTCTAAAACCTGCTCTAACTGCGGTTTCACCAT CACCACCGCGGACTACGACGGTATGCTGGTTCGTCTGAAAAAAACCTCTGACGGTTGGGCGACCACCCT GAACAACAAAGAACTGAAAGCGGAAGGTCAGATCACCTACTACAACCGTTACAAACGTCAGACCGTTG AAAAAGAACTGTCTGCGGAACTGGACCGTCTGTCTGAAGAATCTGGTAACAACGACATCTCTAAATGGA CCAAAGGTCGTCGTGACGAAGCGCTGTTCCTGCTGAAAAAACGTTTCTCTCACCGTCCGGTTCAGGAAC AGTTCGTTTGCCTGGACTGCGGTCACGAAGTTCACGCGGACGAACAGGCGGCGCTGAACATCGCGCGTT CTTGGCTGTTCCTGAACTCTAACTCTACCGAATTCAAATCTTACAAATCTGGTAAACAGCCGTTCGTTGG TGCGTGGCAGGCGTTCTACAAACGTCGTCTGAAAGAAGTTTGGAAACCGAACGCG SEQ ATGAAACGTATCAACAAAATCCGTCGTCGTCTGGTTAAAGACTCTAACACCAAAAAAGCGGGTAAAACC ID GGTCCGATGAAAACCCTGCTGGTTCGTGTTATGACCCCGGACCTGCGTGAACGTCTGGAAAACCTGCGT NO: AAAAAACCGGAAAACATCCCGCAGCCGATCTCTAACACCTCTCGTGCGAACCTGAACAAACTGCTGACC 60 GACTACACCGAAATGAAAAAAGCGATCCTGCACGTTTACTGGGAAGAATTCCAGAAAGACCCGGTTGGT CTGATGTCTCGTGTTGCGCAGCCGGCGCCGAAAAACATCGACCAGCGTAAACTGATCCCGGTTAAAGAC GGTAACGAACGTCTGACCTCTTCTGGTTTCGCGTGCTCTCAGTGCTGCCAGCCGCTGTACGTTTACAAAC TGGAACAGGTTAACGACAAAGGTAAACCGCACACCAACTACTTCGGTCGTTGCAACGTTTCTGAACACG AACGTCTGATCCTGCTGTCTCCGCACAAACCGGAAGCGAACGACGAACTGGTTACCTACTCTCTGGGTA AATTCGGTCAGCGTGCGCTGGACTTCTACTCTATCCACGTTACCCGTGAATCTAACCACCCGGTTAAACC GCTGGAACAGATCGGTGGTAACTCTTGCGCGTCTGGTCCGGTTGGTAAAGCGCTGTCTGACGCGTGCAT GGGTGCGGTTGCGTCTTTCCTGACCAAATACCAGGACATCATCCTGGAACACCAGAAAGTTATCAAAAA AAACGAAAAACGTCTGGCGAACCTGAAAGACATCGCGTCTGCGAACGGTCTGGCGTTCCCGAAAATCAC CCTGCCGCCGCAGCCGCACACCAAAGAAGGTATCGAAGCGTACAACAACGTTGTTGCGCAGATCGTTAT CTGGGTTAACCTGAACCTGTGGCAGAAACTGAAAATCGGTCGTGACGAAGCGAAACCGCTGCAGCGTCT GAAAGGTTTCCCGTCTTTCCCGCTGGTTGAACGTCAGGCGAACGAAGTTGACTGGTGGGACATGGTTTGC AACGTTAAAAAACTGATCAACGAAAAAAAAGAAGACGGTAAAGTTTTCTGGCAGAACCTGGCGGGTTA CAAACGTCAGGAAGCGCTGCTGCCGTACCTGTCTTCTGAAGAAGACCGTAAAAAAGGTAAAAAATTCGC GCGTTACCAGTTCGGTGACCTGCTGCTGCACCTGGAAAAAAAACACGGTGAAGACTGGGGTAAAGTTTA CGACGAAGCGTGGGAACGTATCGACAAAAAAGTTGAAGGTCTGTCTAAACACATCAAACTGGAAGAAG AACGTCGTTCTGAAGACGCGCAGTCTAAAGCGGCGCTGACCGACTGGCTGCGTGCGAAAGCGTCTTTCG TTATCGAAGGTCTGAAAGAAGCGGACAAAGACGAATTCTGCCGTTGCGAACTGAAACTGCAGAAATGGT ACGGTGACCTGCGTGGTAAACCGTTCGCGATCGAAGCGGAAAACTCTATCCTGGACATCTCTGGTTTCTC TAAACAGTACAACTGCGCGTTCATCTGGCAGAAAGACGGTGTTAAAAAACTGAACCTGTACCTGATCAT CAACTACTTCAAAGGTGGTAAACTGCGTTTCAAAAAAATCAAACCGGAAGCGTTCGAAGCGAACCGTTT CTACACCGTTATCAACAAAAAATCTGGTGAAATCGTTCCGATGGAAGTTAACTTCAACTTCGACGACCC GAACCTGATCATCCTGCCGCTGGCGTTCGGTAAACGTCAGGGTCGTGAATTCATCTGGAACGACCTGCTG TCTCTGGAAACCGGTTCTCTGAAACTGGCGAACGGTCGTGTTATCGAAAAAACCCTGTACAACCGTCGT ACCCGTCAGGACGAACCGGCGCTGTTCGTTGCGCTGACCTTCGAACGTCGTGAAGTTCTGGACTCTTCTA ACATCAAACCGATGAACCTGATCGGTATCGACCGTGGTGAAAACATCCCGGCGGTTATCGCGCTGACCG ACCCGGAAGGTTGCCCGCTGTCTCGTTTCAAAGACTCTCTGGGTAACCCGACCCACATCCTGCGTATCGG TGAATCTTACAAAGAAAAACAGCGTACCATCCAGGCGGCGAAAGAAGTTGAACAGCGTCGTGCGGGTG GTTACTCTCGTAAATACGCGTCTAAAGCGAAAAACCTGGCGGACGACATGGTTCGTAACACCGCGCGTG ACCTGCTGTACTACGCGGTTACCCAGGACGCGATGCTGATCTTCGAAAACCTGTCTCGTGGTTTCGGTCG TCAGGGTAAACGTACCTTCATGGCGGAACGTCAGTACACCCGTATGGAAGACTGGCTGACCGCGAAACT GGCGTACGAAGGTCTGCCGTCTAAAACCTACCTGTCTAAAACCCTGGCGCAGTACACCTCTAAAACCTG CTCTAACTGCGGTTTCACCATCACCTCTGCGGACTACGACCGTGTTCTGGAAAAACTGAAAAAAACCGC GACCGGTTGGATGACCACCATCAACGGTAAAGAACTGAAAGTTGAAGGTCAGATCACCTACTACAACCG TTACAAACGTCAGAACGTTGTTAAAGACCTGTCTGTTGAACTGGACCGTCTGTCTGAAGAATCTGTTAAC AACGACATCTCTTCTTGGACCAAAGGTCGTTCTGGTGAAGCGCTGTCTCTGCTGAAAAAACGTTTCTCTC ACCGTCCGGTTCAGGAAAAATTCGTTTGCCTGAACTGCGGTTTCGAAACCCACGCGGACGAACAGGCGG CGCTGAACATCGCGCGTTCTTGGCTGTTCCTGCGTTCTCAGGAATACAAAAAATACCAGACCAACAAAA CCACCGGTAACACCGACAAACGTGCGTTCGTTGAAACCTGGCAGTCTTTCTACCGTAAAAAACTGAAAG AAGTTTGGAAACCG SEQ AAAATTCcatGCAAAATGCTCCGGTTTCATGTCATCAAAATGATGACGTAATTAAGCATTGATAATTGAGATCCCTCTCCCTGACAGGATGATTACATA ID AATAATAGTGACAAAAATAAATTATTTATTTATCCAGAAAATGAATTGGAAAATCAGGAGAGCGTTTTCAATCCTACCTCTGGCGCAGTTGATATGTca NO: aaCAGGTtgccgtcactgcgtcttttactggctcttctcgctaaccaaaccggtaaccccgcttattaaaagcattctgtaacaaagcgggaccaaagc 61 catgacaaaaacgcgtaacaaaagtgtctataatcacggcagaaaagtccacattgattatttgcacggcgtcacactttgctatgccatagcattttt atccataagattagcggatcctacctgacgctttttatcgcaactctctactgtttctccatacccgtttttttgggctagcaccgcctatctcgtgtg agataggcggagatacgaactttaagAAGGAGatataccATGGGTAAAATGTATTACCTTGGTTTAGACATTGGCACGAATTCCGTGGGCTACGCGGTG ACCGACCCCTCATACCACCTGCTGAAGTTTAAGGGGGAACCAATGTGGGGTGCGCACGTATTTGCCGCCGGTAATCAGAGCGCGGAACGACGCTCGTTC CGCACATCGCGTCGTCGTTTGGACCGACGCCAACAGCGCGTTAAACTGGTACAGGAGATTTTTGCCCCGGTGATTAGTCCGATCGACCCACGCTTCTTC ATTCGTCTGCATGAATCCGCCCTGTGGCGCGATGACGTCGCGGAGACGGATAAACATATCTTTTTCAATGATCCTACCTATACCGATAAGGAATATTAT AGCGATTACCCGACTATCCATCACCTGATCGTTGATCTGATGGAAAGCTCTGAGAAACACGATCCGCGGCTGGTGTACCTTGCAGTGGCGTGGTTAGTG GCACACCGTGGTCATTTTCTGAACGAGGTGGACAAGGATAATATTGGAGATGTGTTGTCGTTCGACGCATTTTATCCGGAGTTTCTCGCGTTCCTGTCG GACAACGGTGTATCACCGTGGGTGTGCGAAAGCAAAGCGCTGCAGGCGACCTTGCTGAGCCGTAACTCAGTGAACGACAAATATAAAGCCCTTAAGTCT CTGATCTTCGGATCCCAGAAACCTGAAGATAACTTCGATGCCAATATTTCGGAAGATGGACTCATTCAACTGCTGGCCGGCAAAAAGGTAAAAGTTAAC AAACTGTTCCCTCAGGAATCGAACGATGCATCCTTCACATTGAATGATAAAGAAGACGCGATAGAAGAAATCCTGGGTACGCTTACACCAGATGAATGT GAATGGATTGCGCATATACGCCGCCTTTTTGACTGGGCTATCATGAAACATGCTCTGAAAGATGGCAGGACTATTAGCGAGTCAAAAGTCAAACTGTAT GAGCAGCACCATCACGATCTGACCCAACTTAAATACTTCGTGAAAACCTACCTTGCAAAAGAATACGACGATATTTTCCGCAACGTGGATAGCGAAACA ACGAAAAACTATGTAGCGTATTCCTATCATGTGAAAGAGGTGAAAGGCACTCTGCCTAAAAATAAGGCAACGCAAGAAGAGTTTTGTAAGTATGTCCTG GGCAAGGTTAAAAACATTGAATGCTCTGAAGCAGACAAGGTTGACTTTGATGAGATGATTCAGCGTCTTACCGACAACTCTTTTATGCCTAAGCAGGTT TCGGGCGAAAACCGCGTTATTCCTTATCAGTTATATTATTATGAACTGAAGACAATTCTGAATAAAGCAGCCTCGTACCTGCCTTTCCTGACGCAGTGT GGAAAAGATGCAATTTCGAACCAGGACAAACTACTGTCGATCATGACGTTCCGTATTCCTTACTTCGTCGGACCCTTGCGAAAAGATAATTCGGAACAT GCATGGCTCGAACGAAAGGCCGGTAAGATTTATCCGTGGAACTTTAACGACAAAGTGGACTTGGATAAATCAGAAGAAGCGTTCATTCGCCGAATGACC AATACCTGTACCTATTATCCCGGCGAAGATGTTTTACCGTTGGATTCGCTGATCTATGAGAAATTTATGATTTTAAATGAAATCAATAATATTCGTATT GACGGCTACCCGATTAGTGTTGACGTTAAACAGCAGGTTTTTGGCTTGTTCGAAAAAAAACGACGCGTAACCGTGAAAGATATTCAGAACCTGCTGCTG TCTCTCGGAGCTCTGGACAAACACGGGAAGCTGACAGGCATCGATACCACTATCCACTCAAACTATAATACGTATCACCATTTTAAATCTCTCATGGAA CGCGGCGTCCTGACCCGGGATGACGTGGAACGCATCGTTGAAAGGATGACCTACAGCGACGATACTAAGCGTGTGCGTCTGTGGCTGAATAACAACTAT GGTACTTTAACCGCCGACGATGTGAAACACATTTCGCGTCTGCGCAAACACGATTTTGGCCGTTTATCCAAAATGTTCTTAACAGGTCTGAAGGGTGTC CATAAGGAGACCGGTGAACGTGCCTCCATACTGGATTTCATGTGGAACACGAACGATAACCTGATGCAGCTCCTTTCCGAATGCTACACGTTCAGTGAT GAAATCACAAAGCTGCAAGAGGCGTATTATGCAAAAGCCCAGTTGTCTTTAAACGATTTTTTAGACTCGATGTACATCTCTAACGCGGTGAAACGTCCG ATTTACAGAACTCTGGCAGTGGTGAACGATATTCGAAAAGCATGTGGGACGGCCCCTAAACGCATTTTCATCGAAATGGCTCGTGATGGTGAATCAAAA AAAAAGAGAAGTGTTACACGTCGCGAGCAGATCAAAAACCTGTACCGCTCGATTCGTAAAGATTTCCAGCAGGAAGTTGATTTTCTGGAAAAGATCCTG GAAAATAAATCTGATGGTCAACTTCAGTCAGATGCTTTGTATCTTTACTTTGCACAATTAGGGCGCGATATGTACACGGGCGATCCAATAAAGCTGGAG CACATCAAAGATCAGAGTTTCTATAACATAGACCATATTTACCCGCAGTCTATGGTGAAAGACGATTCCCTAGATAACAAAGTGCTGGTGCAAAGCGAA ATTAACGGCGAGAAAAGCTCGCGATACCCTTTGGACGCCGCGATCCGCAATAAAATGAAGCCCCTTTGGGACGCTTACTATAATCATGGCCTGATCTCC TTAAAGAAATACCAGCGTCTAACGCGCTCGACCCCGTTTACCGATGATGAAAAATGGGACTTTATTAATCGCCAGTTAGTGGAAACCCGTCAATCTACC AAAGCGCTGGCCATTTTGTTGAAGCGTAAGTTTCCAGACACCGAAATTGTGTATTCGAAGGCGGGGTTATCGTCCGACTTCAGACATGAATTCGGCCTT GTAAAAAGTCGCAATATTAATGATTTGCACCACGCTAAAGACGCATTCTTGGCTATCGTTACCGGCAATGTGTACCATGAAAGATTCAATCGCAGATGG TTTATGGTGAACCAGCCGTACTCAGTTAAAACTAAAACTCTTTTTACCCACAGCATAAAGAATGGCAACTTCGTTGCCTGGAACGGCGAAGAAGATCTC GGTCGTATTGTAAAAATGCTGAAGCAAAACAAAAATACCATTCACTTCACGCGCTTCTCCTTCGATCGCAAAGAAGGATTATTTGATATCCAACCTCTG AAAGCCAGCACCGGCTTAGTCCCACGAAAAGCCGGTCTGGATGTCGTTAAATACGGCGGATATGACAAATCTACCGCGGCCTATTACCTGCTGGTGAGG TTCACGCTCGAGGACAAGAAAACCCAGCACAAGCTGATGATGATTCCTGTAGAAGGCCTGTACAAGGCTCGCATTGATCATGACAAGGAATTTCTTACC GATTATGCGCAAACGACTATAAGCGAAATCCTACAGAAAGATAAACAGAAAGTGATCAATATTATGTTTCCAATGGGTACGAGGCATATAAAACTCAAT TCAATGATTAGTATCGATGGCTTCTATCTTAGTATCGGCGGAAAGTCCTCTAAAGGTAAGTCAGTTCTATGTCACGCAATGGTTCCACTGATCGTCCCT CACAAAATCGAATGTTACATTAAAGCAATGGAAAGCTTCGCCCGGAAGTTTAAAGAAAACAACAAGCTGCGCATCGTAGAAAAATTCGATAAAATCACC GTTGAAGACAACCTGAATCTCTACGAGCTCTTTCTCCAAAAACTGCAGCATAATCCCTATAATAAGTTTTTTTCGACACAGTTTGACGTACTGACGAAC GGCCGTTCTACTTTCACAAAACTGTCGCCGGAGGAACAGGTACAGACGCTCTTGAACATTTTAAGTATCTTTAAAACATGCCGCAGTTCGGGTTGCGAC CTGAAATCCATCAACGGCAGTGCCCAGGCAGCGCGCATCATGATTAGCGCTGACTTAACTGGACTGTCGAAAAAATATTCAGATATTAGGTTGGTTGAA CAGTCAGCTTCTGGTTTGTTCGTATCCAAAAGTCAGAACTTACTGGAGTATCTCTAAGAAATCATCCTTAGCGAAAGCTAAGGATTTTTTTTATCTGAA ATTTATTATATCGCGTTGATTATTGATGCTGTTTTTAGTTTTAACGGCAATTAATATATGTGTTATTAATTGAATGAATTTTATCATTCATAATAAGTA TGTGTAGGATCAAGCTCAGGTTAAATATTCACTCAGGAAGTTATTACTCAGGAAGCAAAGAGGATTACAGAATTATCTCATAACAAGTGTTAAGGGATG TTATTTCC SEQ AAAATTCcatGCAAAATGCTCCGGTTTCATGTCATCAAAATGATGACGTAATTAAGCATTGATAATTGAGATCCCTCTCCCTGACAGGATGATTACATA ID AATAATAGTGACAAAAATAAATTATTTATTTATCCAGAAAATGAATTGGAAAATCAGGAGAGCGTTTTCAATCCTACCTCTGGCGCAGTTGATATGTca NO: aaCAGGTtgccgtcactgcgtcttttactggctcttctcgctaaccaaaccggtaaccccgcttattaaaagcattctgtaacaaagcgggaccaaagc 62 catgacaaaaacgcgtaacaaaagtgtctataatcacggcagaaaagtccacattgattatttgcacggcgtcacactttgctatgccatagcattttt atccataagattagcggatcctacctgacgctttttatcgcaactctctactgtttctccatacccgtttttttgggctagcaccgcctatctcgtgtg agataggcggagatacgaactttaagAAGGAGatataccATGTCATCGCTCACGAAATTCACTAACAAATACTCTAAACAGCTCACCATTAAGAATGAA CTCATCCCAGTTGGCAAAACACTGGAGAACATCAAAGAGAATGGTCTGATAGATGGCGACGAACAGCTGAATGAGAATTATCAGAAGGCGAAAATTATT GTGGATGATTTTCTGCGGGACTTCATTAATAAAGCACTGAATAATACGCAGATCGGGAACTGGCGCGAACTGGCGGATGCCCTTAATAAAGAGGATGAA GATAACATCGAGAAATTGCAGGATAAAATTCGGGGAATCATTGTATCCAAATTTGAAACGTTTGATCTGTTTAGCAGCTATTCTATTAAGAAAGATGAA AAGATTATTGACGACGACAATGATGTTGAAGAAGAGGAACTGGATCTGGGCAAGAAGACCAGCTCATTTAAATACATATTTAAAAAAAACCTGTTTAAG TTAGTGTTGCCATCCTACCTGAAAACCACAAACCAGGACAAGCTGAAGATTATTAGCTCGTTTGATAATTTTTCAACGTACTTCCGCGGGTTCTTTGAA AACCGGAAAAACATTTTTACCAAGAAACCGATCTCCACAAGTATTGCGTATCGCATTGTTCATGATAACTTCCCGAAATTCCTTGATAACATTCGTTGT TTTAATGTGTGGCAGACGGAATGCCCGCAACTAATCGTGAAAGCAGATAACTATCTGAAAAGCAAAAATGTTATAGCGAAAGATAAAAGTTTGGCAAAC TATTTTACCGTGGGCGCGTATGACTATTTCCTGTCTCAGAATGGTATAGATTTTTACAACAATATTATAGGTGGACTGCCAGCGTTCGCCGGCCATGAG AAAATCCAAGGTCTCAATGAATTCATCAATCAAGAGTGCCAAAAAGACAGCGAGCTGAAAAGTAAGCTGAAAAACCGTCACGCGTTCAAAATGGCGGTA CTGTTCAAACAGATACTCAGCGATCGTGAAAAAAGTTTTGTAATTGATGAGTTCGAGTCGGATGCTCAAGTTATTGACGCCGTTAAAAACTTTTACGCC GAACAGTGCAAAGATAACAATGTTATTTTTAACTTATTAAATCTTATCAAGAATATCGCTTTCTTAAGTGATGACGAACTGGACGGCATATTCATTGAA GGGAAATACCTGTCGAGCGTTAGTCAAAAACTCTATAGCGATTGGTCAAAATTACGTAACGACATTGAGGATTCGGCTAACTCTAAACAAGGCAATAAA GAGCTGGCCAAGAAGATCAAAACCAACAAAGGGGATGTAGAAAAAGCGATCTCGAAATATGAGTTCTCGCTGTCGGAACTGAACTCGATTGTACATGAT AACACCAAGTTTTCTGACCTCCTTAGTTGTACACTGCATAAGGTGGCTTCTGAGAAACTGGTGAAGGTCAATGAAGGCGACTGGCCGAAACATCTCAAG AATAATGAAGAGAAACAAAAAATCAAAGAGCCGCTTGATGCTCTGCTGGAGATCTATAATACACTTCTGATTTTTAACTGCAAAAGCTTCAATAAAAAC GGCAACTTCTATGTCGACTATGATCGTTGCATCAATGAACTGAGTTCGGTCGTGTATCTGTATAATAAAACACGTAACTATTGCACTAAAAAACCCTAT AACACGGACAAGTTCAAACTCAATTTTAACAGTCCGCAGCTCGGTGAAGGCTTTTCCAAGTCGAAAGAAAATGACTGTCTGACTCTTTTGTTTAAAAAA GACGACAACTATTATGTAGGCATTATCCGCAAAGGTGCAAAAATCAATTTTGATGATACACAAGCAATCGCCGATAACACCGACAATTGCATCTTTAAA ATGAATTATTTCCTACTTAAAGACGCAAAAAAATTTATCCCGAAATGTAGCATTCAGCTGAAAGAAGTCAAGGCCCATTTTAAGAAATCTGAAGATGAT TACATTTTGTCTGATAAAGAGAAATTTGCTAGCCCGCTGGTCATTAAAAAGAGCACATTTTTGCTGGCAACTGCACATGTGAAAGGGAAAAAAGGCAAT ATCAAGAAATTTCAGAAAGAATATTCGAAAGAAAACCCCACTGAGTATCGCAATTCTTTAAACGAATGGATTGCTTTTTGTAAAGAGTTCTTAAAAACT TATAAAGCGGCTACCATTTTTGATATAACCACATTGAAAAAGGCAGAGGAATATGCTGATATTGTAGAATTCTACAAGGATGTCGATAATCTGTGCTAC AAACTGGAGTTCTGCCCGATTAAAACCTCGTTTATAGAAAACCTGATAGATAACGGCGACCTGTATCTGTTTCGCATCAATAACAAAGACTTCAGCAGT AAATCGACCGGCACCAAGAACCTTCATACGTTATATTTACAAGCTATATTCGATGAACGTAATCTGAACAATCCGACAATTATGCTGAATGGGGGAGCA GAACTGTTCTATCGTAAAGAAAGTATTGAGCAGAAAAACCGTATCACACACAAAGCCGGTTCAATTCTCGTGAATAAGGTGTGTAAAGACGGTACAAGC CTGGATGATAAGATACGTAATGAAATTTATCAATATGAGAATAAATTTATTGATACCCTGTCTGATGAAGCTAAAAAGGTGTTACCGAATGTCATTAAA AAGGAAGCTACCCATGACATTACAAAAGATAAACGTTTCACTAGTGACAAATTCTTCTTTCACTGCCCCCTGACAATTAATTATAAGGAAGGCGATACC AAGCAGTTCAATAACGAAGTGCTGAGTTTTCTGCGTGGAAATCCTGACATCAACATTATCGGCATTGACCGCGGAGAGCGTAATTTAATCTATGTAACG GTTATAAACCAGAAAGGCGAGATTCTGGATTCGGTTTCATTCAATACCGTGACCAACAAGAGTTCAAAAATCGAGCAGACAGTCGATTATGAAGAGAAA TTGGCAGTCCGCGAGAAAGAGAGGATTGAAGCAAAACGTTCCTGGGACTCTATCTCAAAAATTGCGACACTAAAGGAAGGTTATCTGAGCGCAATAGTT CACGAGATCTGTCTGTTAATGATTAAACACAACGCGATCGTTGTCTTAGAGAATCTTAATGCAGGCTTTAAGCGTATTCGTGGCGGTTTATCAGAAAAA AGTGTTTATCAAAAATTCGAAAAAATGTTGATTAACAAACTGAACTATTTTGTCAGCAAGAAGGAATCCGACTGGAATAAACCGTCTGGTCTGCTGAAT GGACTGCAGCTTTCGGATCAGTTTGAAAGCTTCGAAAAACTGGGTATTCAGTCTGGTTTTATTTTTTACGTGCCGGCTGCATATACCTCAAAGATTGAT CCGACCACGGGCTTCGCCAATGTTCTGAATCTGTCGAAGGTACGCAATGTTGATGCGATCAAAAGCTTTTTTTCTAACTTCAACGAAATTAGTTATAGC AAGAAAGAAGCCCTTTTCAAATTCTCATTCGATCTGGATTCACTGAGTAAGAAAGGCTTTAGTAGCTTTGTGAAATTTAGTAAGAGTAAATGGAACGTC TACACCTTTGGAGAACGTATCATAAAGCCAAAGAATAAGCAAGGTTATCGGGAGGACAAAAGAATCAACTTGACCTTCGAGATGAAGAAGTTACTTAAC GAGTATAAGGTTTCTTTTGATCTTGAAAATAACTTGATTCCGAATCTCACGAGTGCCAACCTGAAGGATACTTTTTGGAAAGAGCTATTCTTTATCTTC AAGACTACGCTGCAGCTCCGTAACAGCGTTACTAACGGTAAAGAAGATGTGCTCATCTCTCCGGTCAAAAATGCGAAGGGTGAATTCTTCGTTTCGGGA ACGCATAACAAGACTCTTCCGCAAGATTGCGATGCGAACGGTGCATACCATATTGCGTTGAAAGGTCTGATGATACTCGAACGTAACAACCTTGTACGT GAGGAGAAAGATACGAAAAAGATTATGGCGATTTCAAACGTGGATTGGTTCGAGTACGTGCAGAAACGTAGAGGCGTTCTGTAAGAAATCATCCTTAGC GAAAGCTAAGGATTTTTTTTATCTGAAATTTATTATATCGCGTTGATTATTGATGCTGTTTTTAGTTTTAACGGCAATTAATATATGTGTTATTAATTG AATGAATTTTATCATTCATAATAAGTATGTGTAGGATCAAGCTCAGGTTAAATATTCACTCAGGAAGTTATTACTCAGGAAGCAAAGAGGATTACAGAA TTATCTCATAACAAGTGTTAAGGGATGTTATTTCC SEQ AATCAGGAGAGCGTTTTCAATCCTACCTCTGGCGCAGTTGATATGTCAAACAGGTTGCCGTCACTGCGTC ID TTTTACTGGCTCTTCTCGCTAACCAAACCGGTAACCCCGCTTATTAAAAGCATTCTGTAACAAAGCGGGA NO: CCAAAGCCATGACAAAAACGCGTAACAAAAGTGTCTATAATCACGGCAGAAAAGTCCACATTGATTATT 63 TGCACGGCGTCACACTTTGCTATGCCATAGCATTTTTATCCATAAGATTAGCGGATCCTACCTGACGCTT TTTATCGCAACTCTCTACTGTTTCTCCATACCCGTTTTTTTGGGCTAGCAGTAATACGACTCACTATAGGG GTCTCATCTCGTGTGAGATAGGCGGAGATACGAACTTTAAGAGGAGGATATACCATGCACCATCATCAT CACCATAACAACTACGACGAATTCACCAAACTGTACCCGATCCAGAAAACCATCCGTTTCGAACTGAAA CCGCAGGGTCGTACCATGGAACACCTGGAAACCTTCAACTTCTTCGAAGAAGACCGTGACCGTGCGGAA AAATACAAAATCCTGAAAGAAGCGATCGACGAATACCACAAAAAATTCATCGACGAACACCTGACCAA CATGTCTCTGGACTGGAACTCTCTGAAACAGATCTCTGAAAAATACTACAAATCTCGTGAAGAAAAAGA CAAAAAAGTTTTCCTGTCTGAACAGAAACGTATGCGTCAGGAAATCGTTTCTGAATTCAAAAAAGACGA CCGTTTCAAAGACCTGTTCTCTAAAAAACTGTTCTCTGAACTGCTGAAAGAAGAAATCTACAAAAAAGG TAACCACCAGGAAATCGACGCGCTGAAATCTTTCGACAAATTCTCTGGTTACTTCATCGGTCTGCACGAA AACCGTAAAAACATGTACTCTGACGGTGACGAAATCACCGCGATCTCTAACCGTATCGTTAACGAAAAC TTCCCGAAATTCCTGGACAACCTGCAGAAATACCAGGAAGCGCGTAAAAAATACCCGGAATGGATCATC AAAGCGGAATCTGCGCTGGTTGCGCACAACATCAAAATGGACGAAGTTTTCTCTCTGGAATACTTCAAC AAAGTTCTGAACCAGGAAGGTATCCAGCGTTACAACCTGGCGCTGGGTGGTTACGTTACCAAATCTGGT GAAAAAATGATGGGTCTGAACGACGCGCTGAACCTGGCGCACCAGTCTGAAAAATCTTCTAAAGGTCGT ATCCACATGACCCCGCTGTTCAAACAGATCCTGTCTGAAAAAGAATCTTTCTCTTACATCCCGGACGTTT TCACCGAAGACTCTCAGCTGCTGCCGTCTATCGGTGGTTTCTTCGCGCAGATCGAAAACGACAAAGACG GTAACATCTTCGACCGTGCGCTGGAACTGATCTCTTCTTACGCGGAATACGACACCGAACGTATCTACAT CCGTCAGGCGGACATCAACCGTGTTTCTAACGTTATCTTCGGTGAATGGGGTACCCTGGGTGGTCTGATG CGTGAATACAAAGCGGACTCTATCAACGACATCAACCTGGAACGTACCTGCAAAAAAGTTGACAAATGG CTGGACTCTAAAGAATTCGCGCTGTCTGACGTTCTGGAAGCGATCAAACGTACCGGTAACAACGACGCG TTCAACGAATACATCTCTAAAATGCGTACCGCGCGTGAAAAAATCGACGCGGCGCGTAAAGAAATGAA ATTCATCTCTGAAAAAATCTCTGGTGACGAAGAATCTATCCACATCATCAAAACCCTGCTGGACTCTGTT CAGCAGTTCCTGCACTTCTTCAACCTGTTCAAAGCGCGTCAGGACATCCCGCTGGACGGTGCGTTCTACG CGGAATTCGACGAAGTTCACTCTAAACTGTTCGCGATCGTTCCGCTGTACAACAAAGTTCGTAACTACCT GACCAAAAACAACCTGAACACCAAAAAAATCAAACTGAACTTCAAAAACCCGACCCTGGCGAACGGTT GGGACCAGAACAAAGTTTACGACTACGCGTCTCTGATCTTCCTGCGTGACGGTAACTACTACCTGGGTAT CATCAACCCGAAACGTAAAAAAAACATCAAATTCGAACAGGGTTCTGGTAACGGTCCGTTCTACCGTAA AATGGTTTACAAACAGATCCCGGGTCCGAACAAAAACCTGCCGCGTGTTTTCCTGACCTCTACCAAAGG TAAAAAAGAATACAAACCGTCTAAAGAAATCATCGAAGGTTACGAAGCGGACAAACACATCCGTGGTG ACAAATTCGACCTGGACTTCTGCCACAAACTGATCGACTTCTTCAAAGAATCTATCGAAAAACACAAAG ACTGGTCTAAATTCAACTTCTACTTCTCTCCGACCGAATCTTACGGTGACATCTCTGAATTCTACCTGGAC GTTGAAAAACAGGGTTACCGTATGCACTTCGAAAACATCTCTGCGGAAACCATCGACGAATACGTTGAA AAAGGTGACCTGTTCCTGTTCCAGATCTACAACAAAGACTTCGTTAAAGCGGCGACCGGTAAAAAAGAC ATGCACACCATCTACTGGAACGCGGCGTTCTCTCCGGAAAACCTGCAGGACGTTGTTGTTAAACTGAAC GGTGAAGCGGAACTGTTCTACCGTGACAAATCTGACATCAAAGAAATCGTTCACCGTGAAGGTGAAATC CTGGTTAACCGTACCTACAACGGTCGTACCCCGGTTCCGGACAAAATCCACAAAAAACTGACCGACTAC CACAACGGTCGTACCAAAGACCTGGGTGAAGCGAAAGAATACCTGGACAAAGTTCGTTACTTCAAAGCG CACTACGACATCACCAAAGACCGTCGTTACCTGAACGACAAAATCTACTTCCACGTTCCGCTGACCCTGA ACTTCAAAGCGAACGGTAAAAAAAACCTGAACAAAATGGTTATCGAAAAATTCCTGTCTGACGAAAAA GCGCACATCATCGGTATCGACCGTGGTGAACGTAACCTGCTGTACTACTCTATCATCGACCGTTCTGGTA AAATCATCGACCAGCAGTCTCTGAACGTTATCGACGGTTTCGACTACCGTGAAAAACTGAACCAGCGTG AAATCGAAATGAAAGACGCGCGTCAGTCTTGGAACGCGATCGGTAAAATCAAAGACCTGAAAGAAGGT TACCTGTCTAAAGCGGTTCACGAAATCACCAAAATGGCGATCCAGTACAACGCGATCGTTGTTATGGAA GAACTGAACTACGGTTTCAAACGTGGTCGTTTCAAAGTTGAAAAACAGATCTACCAGAAATTCGAAAAC ATGCTGATCGACAAAATGAACTACCTGGTTTTCAAAGACGCGCCGGACGAATCTCCGGGTGGTGTTCTG AACGCGTACCAGCTGACCAACCCGCTGGAATCTTTCGCGAAACTGGGTAAACAGACCGGTATCCTGTTC TACGTTCCGGCGGCGTACACCTCTAAAATCGACCCGACCACCGGTTTCGTTAACCTGTTCAACACCTCTT CTAAAACCAACGCGCAGGAACGTAAAGAATTCCTGCAGAAATTCGAATCTATCTCTTACTCTGCGAAAG ACGGTGGTATCTTCGCGTTCGCGTTCGACTACCGTAAATTCGGTACCTCTAAAACCGACCACAAAAACGT TTGGACCGCGTACACCAACGGTGAACGTATGCGTTACATCAAAGAAAAAAAACGTAACGAACTGTTCGA CCCGTCTAAAGAAATCAAAGAAGCGCTGACCTCTTCTGGTATCAAATACGACGGTGGTCAGAACATCCT GCCGGACATCCTGCGTTCTAACAACAACGGTCTGATCTACACCATGTACTCTTCTTTCATCGCGGCGATC CAGATGCGTGTTTACGACGGTAAAGAAGACTACATCATCTCTCCGATCAAAAACTCTAAAGGTGAATTC TTCCGTACCGACCCGAAACGTCGTGAACTGCCGATCGACGCGGACGCGAACGGTGCGTACAACATCGCG CTGCGTGGTGAACTGACCATGCGTGCGATCGCGGAAAAATTCGACCCGGACTCTGAAAAAATGGCGAAA CTGGAACTGAAACACAAAGACTGGTTCGAATTCATGCAGACCCGTGGTGACTAAGAAATCATCCTTAGC GAAAGCTAAGGATTTTTTTTATCTGAAATGTAGGGAGACCCTCAGGTTAAATATTCACTCAGGAAGTTAT TACTCAGGAAGCAAAGAGGATTACA SEQ AAAATTCcatGCAAAATGCTCCGGTTTCATGTCATCAAAATGATGACGTAATTAAGCATTGATAATTGAGATCCCTCTCCCTGACAGGATGATTACATA ID AATAATAGTGACAAAAATAAATTATTTATTTATCCAGAAAATGAATTGGAAAATCAGGAGAGCGTTTTCAATCCTACCTCTGGCGCAGTTGATATGTca NO: aaCAGGTtgccgtcactgcgtcttttactggctcttctcgctaaccaaaccggtaaccccgcttattaaaagcattctgtaacaaagcgggaccaaagc 64 catgacaaaaacgcgtaacaaaagtgtctataatcacggcagaaaagtccacattgattatttgcacggcgtcacactttgctatgccatagcattttt atccataagattagcggatcctacctgacgctttttatcgcaactctctactgtttctccatacccgtttttttgggctagcaccgcctatctcgtgtg agataggcggagatacgaactttaagAAGGAGatataccATGACTAAAACATTTGATTCAGAGTTTTTTAATTTGTACTCGCTGCAAAAAACGGTACGC TTTGAGTTAAAACCCGTGGGAGAAACCGCGTCATTTGTGGAAGACTTTAAAAACGAGGGCTTGAAACGTGTTGTGAGCGAAGATGAAAGGCGAGCCGTC GATTACCAGAAAGTTAAGGAAATAATTGACGATTACCATCGGGATTTCATTGAAGAAAGTTTAAATTATTTTCCGGAACAGGTGAGTAAAGATGCTCTT GAGCAGGCGTTTCATCTTTATCAGAAACTGAAGGCAGCAAAAGTTGAGGAAAGGGAAAAAGCGCTGAAAGAATGGGAAGCGCTGCAGAAAAAGCTACGT GAAAAAGTGGTGAAATGCTTCTCGGACTCGAATAAAGCCCGCTTCTCAAGGATTGATAAAAAGGAACTGATTAAGGAAGACCTGATAAATTGGTTGGTC GCCCAGAATCGCGAGGATGATATCCCTACGGTCGAAACGTTTAACAACTTCACCACATATTTTACCGGCTTCCATGAGAATCGTAAAAATATTTACTCC AAAGATGATCACGCCACCGCTATTAGCTTTCGCCTTATTCATGAAAATCTTCCAAAGTTTTTTGACAACGTGATTAGCTTCAATAAGTTGAAAGAGGGT TTCCCTGAATTAAAATTTGATAAAGTGAAAGAGGATTTAGAAGTAGATTATGATCTGAAGCATGCGTTTGAAATAGAATATTTCGTTAACTTCGTGACC CAAGCGGGCATAGATCAGTATAATTATCTGTTAGGAGGGAAAACCCTGGAGGACGGGACGAAAAAACAAGGGATGAATGAGCAAATTAATCTGTTCAAA CAACAGCAAACGCGAGATAAAGCGCGTCAGATTCCCAAACTGATCCCCCTGTTCAAACAGATTCTTAGCGAAAGGACTGAAAGCCAGTCCTTTATTCCT AAACAATTTGAAAGTGATCAGGAGTTGTTCGATTCACTGCAGAAGTTACATAATAACTGCCAGGATAAATTCACCGTGCTGCAACAAGCCATTCTCGGT CTGGCAGAGGCGGATCTTAAGAAGGTCTTCATCAAAACCTCTGATTTAAATGCCTTATCTAACACCATTTTCGGGAATTACAGCGTCTTTTCCGATGCA CTGAACCTGTATAAAGAAAGCCTGAAAACGAAAAAAGCGCAGGAGGCTTTTGAGAAACTACCGGCCCATTCTATTCACGACCTCATTCAATACTTGGAA CAGTTCAATTCCAGCCTGGACGCGGAAAAACAACAGAGCACCGACACCGTCCTGAACTACTTCATCAAGACCGATGAATTATATTCTCGCTTCATTAAA TCCACTAGCGAGGCTTTCACTCAGGTGCAGCCTTTGTTCGAACTGGAAGCCCTGTCATCTAAGCGCCGCCCACCGGAATCGGAAGATGAAGGGGCAAAA GGGCAGGAAGGCTTCGAGCAGATCAAGCGTATTAAAGCTTACCTGGATACGCTTATGGAAGCGGTACACTTTGCAAAGCCGTTGTATCTTGTTAAGGGT CGTAAAATGATCGAAGGGCTCGATAAAGACCAGTCCTTTTATGAAGCGTTTGAAATGGCGTACCAAGAACTTGAATCGTTAATCATTCCTATCTATAAC AAAGCGCGGAGCTATCTGTCGCGGAAACCTTTCAAGGCCGATAAATTCAAGATTAATTTTGACAACAACACGCTACTGAGCGGATGGGATGCGAACAAG GAAACTGCTAACGCGTCCATTCTGTTTAAGAAAGACGGGTTATATTACCTTGGAATTATGCCGAAAGGTAAGACCTTTCTCTTTGACTACTTTGTATCG AGCGAGGATTCAGAGAAACTGAAACAGCGTCGCCAGAAGACCGCCGAAGAAGCTCTGGCGCAGGATGGTGAAAGTTACTTCGAAAAAATTCGTTATAAA CTGTTACCAGGGGCTTCAAAGATGTTACCGAAAGTCTTTTTTAGCAACAAAAATATTGGCTTTTACAACCCGTCGGATGACATTTTACGCATTCGCAAC ACAGCCTCTCACACCAAAAACGGGACCCCTCAGAAAGGCCACTCAAAAGTTGAGTTTAACCTGAATGATTGTCATAAGATGATTGATTTCTTCAAATCA TCAATTCAGAAACACCCGGAATGGGGGTCTTTTGGCTTTACGTTTTCTGATACCAGTGATTTTGAAGACATGAGTGCCTTCTACCGGGAAGTAGAAAAC CAGGGTTACGTAATTAGCTTTGACAAAATCAAAGAGACCTATATACAGAGCCAGGTGGAACAGGGTAATCTCTACTTATTCCAGATTTATAACAAGGAT TTCTCGCCCTACAGCAAAGGCAAACCAAACCTGCATACTCTGTACTGGAAAGCCCTGTTTGAAGAAGCGAACCTGAATAACGTAGTGGCGAAGTTGAAC GGTGAAGCGGAAATCTTCTTCCGTCGTCACTCCATTAAGGCCTCTGATAAAGTTGTCCATCCGGCAAATCAGGCCATTGATAATAAGAATCCACACACG GAAAAAACGCAGTCAACCTTTGAATATGACCTCGTTAAAGACAAACGCTACACGCAAGATAAGTTCTTTTTCCACGTCCCAATCAGCCTCAACTTTAAA GCACAAGGGGTTTCAAAGTTTAATGATAAAGTCAATGGGTTCCTCAAGGGCAACCCGGATGTCAACATTATAGGTATAGACAGGGGCGAACGCCATCTG CTTTACTTTACCGTAGTGAATCAGAAAGGTGAAATACTGGTTCAGGAATCATTAAATACCTTGATGTCGGACAAAGGGCACGTTAATGATTACCAGCAG AAACTGGATAAAAAAGAACAGGAACGTGATGCTGCGCGTAAATCGTGGACCACGGTTGAGAACATTAAAGAGCTGAAAGAGGGGTATCTAAGCCATGTG GTACACAAACTGGCGCACCTCATCATTAAATATAACGCAATAGTCTGCCTAGAAGACTTGAATTTTGGCTTTAAACGCGGCCGCTTCAAAGTGGAAAAA CAAGTTTATCAAAAATTTGAAAAGGCGCTTATAGATAAACTGAATTATCTGGTTTTTAAAGAAAAGGAACTTGGTGAGGTAGGGCACTACTTGACAGCT TATCAACTGACGGCCCCGTTCGAATCATTCAAAAAACTGGGCAAACAGTCTGGCATTCTGTTTTACGTGCCGGCAGATTATACTTCAAAAATCGATCCA ACAACTGGCTTTGTGAACTTCCTGGACCTGAGATATCAGTCTGTAGAAAAAGCTAAACAACTTCTTAGCGATTTTAATGCCATTCGTTTTAACAGCGTT CAGAATTACTTTGAATTCGAAATTGACTATAAAAAACTTACTCCGAAACGTAAAGTCGGAACCCAAAGTAAATGGGTAATTTGTACGTATGGCGATGTC AGGTATCAGAACCGTCGGAATCAAAAAGGTCATTGGGAGACCGAAGAAGTGAACGTGACCGAAAAGCTGAAGGCTCTGTTCGCCAGCGATTCAAAAACT ACAACTGTGATCGATTACGCAAATGATGATAACCTGATAGATGTGATTTTAGAGCAGGATAAAGCCAGCTTTTTTAAAGAACTGTTGTGGCTCCTGAAA CTTACGATGACCTTACGACATTCCAAGATCAAATCGGAAGATGATTTTATTCTGTCACCGGTCAAGAATGAGCAGGGTGAATTCTATGATAGTAGGAAA GCCGGCGAAGTGTGGCCGAAAGACGCCGACGCCAATGGCGCCTATCATATCGCGCTCAAAGGGCTTTGGAATTTGCAGCAGATTAACCAGTGGGAAAAA GGTAAAACCCTGAATCTGGCTATCAAAAACCAGGATTGGTTTAGCTTTATCCAAGAGAAACCGTATCAGGAATGAGAAATCATCCTTAGCGAAAGCTAA GGATTTTTTTTATCTGAAATTTATTATATCGCGTTGATTATTGATGCTGTTTTTAGTTTTAACGGCAATTAATATATGTGTTATTAATTGAATGAATTT TATCATTCATAATAAGTATGTGTAGGATCAAGCTCAGGTTAAATATTCACTCAGGAAGTTATTACTCAGGAAGCAAAGAGGATTACAGAATTATCTCAT AACAAGTGTTAAGGGATGTTATTTCC SEQ AATCAGGAGAGCGTTTTCAATCCTACCTCTGGCGCAGTTGATATGTCAAACAGGTTGCCGTCACTGCGTC ID TTTTACTGGCTCTTCTCGCTAACCAAACCGGTAACCCCGCTTATTAAAAGCATTCTGTAACAAAGCGGGA NO: CCAAAGCCATGACAAAAACGCGTAACAAAAGTGTCTATAATCACGGCAGAAAAGTCCACATTGATTATT 65 TGCACGGCGTCACACTTTGCTATGCCATAGCATTTTTATCCATAAGATTAGCGGATCCTACCTGACGCTT TTTATCGCAACTCTCTACTGTTTCTCCATACCCGTTTTTTTGGGCTAGCAGTAATACGACTCACTATAGGG GTCTCATCTCGTGTGAGATAGGCGGAGATACGAACTTTAAGAGGAGGATATACCATGCACCATCATCAT CACCATCATACAGGCGGTCTTCTTAGTATGGACGCGAAAGAGTTCACAGGTCAGTATCCGTTGTCGAAA ACATTACGATTCGAACTTCGGCCCATCGGCCGCACGTGGGATAACCTGGAGGCCTCAGGCTACTTAGCG GAAGACCGCCATCGTGCCGAATGTTATCCTCGTGCGAAAGAGTTATTGGATGACAACCATCGTGCCTTCC TGAATCGTGTGTTGCCACAAATCGATATGGATTGGCACCCGATTGCGGAGGCCTTTTGTAAGGTACATAA AAACCCTGGTAATAAAGAACTTGCCCAGGATTACAACCTTCAGTTGTCAAAGCGCCGTAAGGAGATCAG CGCATATCTTCAGGATGCAGATGGCTATAAAGGCCTGTTCGCGAAGCCCGCCTTAGACGAAGCTATGAA AATTGCGAAAGAAAACGGGAACGAAAGTGATATTGAGGTTCTCGAAGCGTTTAACGGTTTTAGCGTATA CTTCACCGGTTATCATGAGTCACGCGAGAACATTTATAGCGATGAGGATATGGTGAGCGTAGCCTACCG AATTACTGAGGATAATTTCCCGCGCTTTGTCTCAAACGCTTTGATCTTTGATAAATTAAACGAAAGCCAT CCGGATATTATCTCTGAAGTATCGGGCAATCTTGGAGTTGATGACATTGGTAAGTACTTTGACGTGTCGA ACTATAACAATTTTCTTTCCCAGGCCGGTATAGATGACTACAATCACATTATTGGCGGCCATACAACCGA AGACGGACTGATACAAGCGTTTAATGTCGTATTGAACTTACGTCACCAAAAAGACCCTGGCTTTGAAAA AATTCAGTTCAAACAGCTCTACAAACAAATCCTGAGCGTGCGTACCAGCAAAAGCTACATCCCGAAACA GTTTGACAACTCTAAGGAGATGGTTGACTGCATTTGCGATTATGTCAGCAAAATAGAGAAATCCGAAAC AGTAGAACGGGCCCTGAAACTAGTCCGTAATATCAGTTCTTTCGACTTGCGCGGGATCTTTGTCAATAAA AAGAACTTGCGCATACTGAGCAACAAACTGATAGGAGATTGGGACGCGATCGAAACCGCATTGATGCAT AGTTCTTCATCAGAAAACGATAAGAAAAGCGTATATGATAGCGCGGAGGCTTTTACGTTGGATGACATC TTTTCAAGCGTGAAAAAATTTTCTGATGCCTCTGCCGAAGATATTGGCAACAGGGCGGAAGACATCTGT AGAGTGATAAGTGAGACGGCCCCTTTTATCAACGATCTGCGAGCGGTGGACCTGGATAGCCTGAACGAC GATGGTTATGAAGCGGCCGTCTCAAAAATTCGGGAGTCGCTGGAGCCTTATATGGATCTTTTCCATGAAC TGGAAATTTTCTCGGTTGGCGATGAGTTCCCAAAATGCGCAGCATTTTACAGCGAACTGGAGGAAGTCA GCGAACAGCTGATCGAAATTATTCCGTTATTCAACAAGGCGCGTTCGTTCTGCACCCGGAAACGCTATA GCACCGATAAGATTAAAGTGAACTTAAAATTCCCGACCTTGGCGGACGGGTGGGACCTGAACAAAGAG AGAGACAACAAAGCCGCGATTCTGCGGAAAGACGGTAAGTATTATCTGGCAATTCTGGATATGAAGAA AGATCTGTCAAGCATTAGGACCAGCGACGAAGATGAATCCAGCTTCGAAAAGATGGAGTATAAACTGTT ACCGAGTCCAGTAAAAATGCTGCCAAAGATATTCGTAAAATCGAAAGCCGCTAAGGAAAAATATGGCCT GACAGATCGTATGCTTGAATGCTACGATAAAGGTATGCATAAGTCGGGTAGTGCGTTTGATCTTGGCTTT TGCCATGAACTCATTGATTATTACAAGCGTTGTATCGCGGAGTACCCAGGCTGGGATGTGTTCGATTTCA AGTTTCGCGAAACTTCCGATTATGGGTCCATGAAAGAGTTCAATGAAGATGTGGCCGGAGCCGGTTACT ATATGAGTCTGAGAAAAATTCCGTGCAGCGAAGTGTACCGTCTGTTAGACGAGAAATCGATTTATCTATT TCAAATTTATAACAAAGATTACTCTGAAAATGCACATGGTAATAAGAACATGCATACCATGTACTGGGA GGGTCTCTTTTCCCCGCAAAACCTGGAGTCGCCCGTTTTCAAGTTGTCGGGTGGGGCAGAACTTTTCTTT CGAAAATCCTCAATCCCTAACGATGCCAAAACAGTACACCCGAAAGGCTCAGTGCTGGTTCCACGTAAT GATGTTAACGGTCGGCGTATTCCAGATTCAATCTACCGCGAACTGACACGCTATTTTAACCGTGGCGATT GCCGAATCAGTGACGAAGCCAAAAGTTATCTTGACAAGGTTAAGACTAAAAAAGCGGACCATGACATT GTGAAAGATCGCCGCTTTACCGTGGATAAAATGATGTTCCACGTCCCGATTGCGATGAACTTTAAGGCG ATCAGTAAACCGAACTTAAACAAAAAAGTCATTGATGGCATCATTGATGATCAGGATCTGAAAATCATT GGTATTGATCGTGGCGAGCGGAACTTAATTTACGTCACGATGGTTGACAGAAAAGGGAATATCTTATAT CAGGATTCTCTTAACATCCTCAATGGCTACGACTATCGTAAAGCTCTGGATGTGCGCGAATATGACAACA AGGAAGCGCGTCGTAACTGGACTAAAGTGGAGGGCATTCGCAAAATGAAGGAAGGCTATCTGTCATTA GCGGTCTCGAAATTAGCGGATATGATTATCGAAAATAACGCCATCATCGTTATGGAGGACCTGAACCAC GGATTCAAAGCGGGCCGCTCAAAGATTGAAAAACAAGTTTATCAGAAATTTGAGAGTATGCTGATTAAC AAACTGGGCTATATGGTGTTAAAAGACAAGTCAATTGACCAATCAGGTGGCGCGCTGCATGGATACCAG CTGGCGAACCATGTTACCACCTTAGCATCAGTTGGAAAGCAGTGTGGGGTTATCTTTTATATACCGGCAG CGTTCACTAGTAAAATAGATCCGACCACTGGTTTCGCCGATCTCTTTGCCCTGAGTAACGTTAAAAACGT AGCGAGCATGCGTGAATTCTTTTCCAAAATGAAATCTGTCATTTATGATAAAGCTGAAGGCAAATTCGC ATTCACCTTTGATTACTTGGATTACAACGTGAAGAGCGAATGTGGTCGTACGCTGTGGACCGTTTACACC GTTGGTGAGCGCTTCACCTATTCCCGTGTGAACCGCGAATATGTACGTAAAGTCCCCACCGATATTATCT ATGATGCCCTCCAGAAAGCAGGCATTAGCGTCGAAGGAGACTTAAGGGACAGAATTGCCGAAAGCGAT GGCGATACGCTGAAGTCTATTTTTTACGCATTCAAATACGCGCTAGATATGCGCGTTGAGAATCGCGAG GAAGACTACATTCAATCACCTGTGAAAAATGCCTCTGGGGAATTTTTTTGTTCAAAAAATGCTGGTAAAA GCCTCCCACAAGATAGCGATGCAAACGGTGCATATAACATTGCCCTGAAAGGTATTCTTCAATTACGCA TGCTGTCTGAGCAGTACGACCCCAACGCGGAATCTATTAGACTTCCGCTGATAACCAATAAAGCCTGGC TGACATTCATGCAGTCTGGCATGAAGACCTGGAAAAATTAGGAAATCATCCTTAGCGAAAGCTAAGGAT TTTTTTTATCTGAAATGTAGGGAGACCCTCAGGTTAAATATTCACTCAGGAAGTTATTACTCAGGAAGCA AAGAGGATTACA SEQ AAAATTCcatGCAAAATGCTCCGGTTTCATGTCATCAAAATGATGACGTAATTAAGCATTGATAATTGAGATCCCTCTCCCTGACAGGATGATTACATA ID AATAATAGTGACAAAAATAAATTATTTATTTATCCAGAAAATGAATTGGAAAATCAGGAGAGCGTTTTCAATCCTACCTCTGGCGCAGTTGATATGTca NO: aaCAGGTtgccgtcactgcgtcttttactggctcttctcgctaaccaaaccggtaaccccgcttattaaaagcattctgtaacaaagcgggaccaaagc 66 catgacaaaaacgcgtaacaaaagtgtctataatcacggcagaaaagtccacattgattatttgcacggcgtcacactttgctatgccatagcattttt atccataagattagcggatcctacctgacgctttttatcgcaactctctactgtttctccatacccgtttttttgggctagcaccgcctatctcgtgtg agataggcggagatacgaactttaagAAGGAGatataccatgGATAGTTTGAAAGATTTCACCAATCTGTACCCTGTCAGTAAGACATTGAGATTTGAA TTAAAGCCCGTTGGAAAGACTTTAGAAAATATCGAGAAAGCAGGTATTTTGAAAGAGGATGAGCATCGTGCAGAAAGTTATCGGAGGGTGAAGAAAATA ATTGATACTTATCATAAGGTATTTATCGATTCTTCTCTTGAAAATATGGCTAAAATGGGTATTGAGAATGAAATAAAAGCAATGCTCCAAAGTTTCTGC GAATTGTATAAAAAAGATCATCGCACTGAGGGTGAAGACAAGGCATTAGATAAAATTCGAGCAGTACTTCGTGGCCTGATTGTTGGGGCTTTCACTGGT GTTTGCGGAAGACGGGAAAATACAGTCCAAAACGAGAAGTACGAGAGTTTGTTCAAAGAAAAGTTGATAAAAGAAATTTTACCTGATTTTGTGCTCTCT ACTGAGGCTGAAAGCTTGCCTTTCTCTGTTGAAGAAGCTACGAGGTCACTGAAGGAGTTTGATAGCTTTACATCCTACTTTGCTGGTTTTTACGAGAAT AGAAAGAATATATACTCGACGAAACCTCAATCCACTGCCATTGCTTATCGTCTTATTCATGAGAACTTGCCGAAGTTCATTGATAATATTCTTGTTTTT CAGAAGATCAAAGAGCCTATAGCCAAAGAGCTGGAACATATTCGTGCGGACTTTTCTGCCGGGGGGTACATAAAAAAGGATGAGAGATTGGAGGATATT TTTTCGTTGAACTATTATATCCACGTGTTATCTCAGGCTGGGATCGAAAAATATAACGCATTGATTGGGAAGATTGTGACAGAAGGAGATGGAGAGATG AAAGGGCTCAATGAACACATCAACCTTTACAACCAACAAAGAGGCAGAGAGGATCGGCTCCCTCTTTTTAGGCCTCTTTATAAACAGATATTGAGTGAC AGAGAGCAATTATCATACTTGCCTGAGAGTTTTGAAAAAGATGAGGAGCTCCTCAGGGCTCTAAAAGAGTTCTATGATCATATCGCAGAAGACATTCTC GGACGTACTCAACAGTTGATGACTTCTATTTCAGAATATGATTTATCTCGGATATACGTAAGGAACGATAGCCAATTGACTGATATATCAAAAAAAATG TTGGGAGATTGGAATGCTATCTACATGGCTAGAGAACGAGCATATGACCACGAGCAGGCTCCCAAAAGAATCACGGCGAAATACGAGAGGGACAGGATT AAAGCTCTTAAAGGAGAAGAGAGTATAAGTCTGGCAAATCTTAATAGTTGTATTGCCTTTCTGGACAATGTTAGAGATTGCCGTGTAGATACTTATCTT TCCACACTGGGCCAGAAGGAAGGACCACATGGTCTATCTAATCTCGTTGAGAACGTTTTTGCCTCATACCATGAAGCAGAGCAATTGTTGAGCTTTCCA TACCCCGAAGAGAATAATCTGATTCAGGACAAGGACAATGTGGTGTTAATTAAGAATCTTCTCGACAATATCAGTGATCTGCAGAGGTTCTTGAAACCT CTTTGGGGTATGGGAGACGAACCCGATAAAGATGAAAGATTTTATGGAGAGTATAATTATATCCGAGGAGCTCTAGATCAGGTGATCCCTCTGTACAAT AAGGTAAGGAACTACCTCACTCGGAAGCCTTATTCGACCAGAAAAGTAAAACTCAATTTTGGGAATTCTCAATTGCTTAGTGGTTGGGATAGAAATAAG GAAAAGGATAATAGCTGTGTGATTTTGCGTAAGGGGCAGAACTTCTATTTGGCTATTATGAACAATAGGCACAAAAGAAGTTTCGAAAACAAGGTGTTG CCCGAGTATAAGGAGGGAGAACCTTACTTCGAAAAGATGGATTATAAATTTTTGCCTGATCCTAATAAAATGCTTCCTAAGGTTTTTCTTTCGAAAAAA GGAATAGAGATATACAAACCAAGTCCGAAGCTTTTAGAACAATATGGACATGGAACTCACAAAAAGGGAGATACCTTTAGTATGGATGATTTGCACGAA CTGATCGATTTCTTCAAACACTCAATCGAGGCTCATGAAGATTGGAAGCAATTCGGATTCAAATTTTCTGATACGGCTACTTATGAGAATGTATCTAGT TTCTATAGAGAAGTTGAGGATCAGGGGTATAAGCTCTCTTTCCGAAAAGTTTCGGAATCTTATGTCTATTCATTAATAGATCAAGGCAAGTTGTATTTA TTTCAGATATACAACAAGGACTTTTCTCCCTGCAGCAAAGGGACACCTAATCTGCATACCTTGTATTGGAGAATGCTTTTTGACGAGCGCAATTTGGCA GATGTCATATACAAACTGGATGGGAAGGCTGAAATCTTTTTCCGAGAGAAGAGTTTGAAAAATGATCATCCCACGCATCCGGCTGGTAAGCCTATCAAA AAGAAAAGTCGACAAAAAAAAGGAGAGGAGAGTCTGTTTGAGTATGATTTAGTCAAGGATAGGCACTATACGATGGATAAGTTCCAGTTTCATGTGCCT ATTACTATGAATTTTAAATGTTCTGCAGGAAGCAAAGTCAATGATATGGTTAATGCTCATATTCGAGAGGCAAAGGATATGCATGTCATTGGAATTGAT CGTGGAGAACGCAATCTGCTGTATATATGCGTGATAGATAGTCGAGGGACGATTTTGGATCAAATTTCTCTGAATACGATTAACGATATAGACTATCAT GATTTATTGGAGAGTCGAGACAAAGACCGTCAGCAGGAGCGCCGAAACTGGCAAACTATCGAAGGGATCAAGGAGCTAAAACAAGGCTACCTTAGTCAG GCGGTTCATCGGATAGCCGAACTGATGGTGGCTTATAAGGCTGTAGTTGCTTTGGAGGATTTGAATATGGGGTTCAAACGTGGGCGGCAGAAAGTAGAA AGTTCTGTTTATCAGCAGTTTGAGAAACAGCTGATAGATAAGCTCAACTATCTTGTGGACAAGAAGAAAAGGCCTGAAGATATTGGAGGATTGTTGAGA GCCTATCAATTTACGGCCCCATTTAAGAGTTTTAAGGAAATGGGAAAGCAAAACGGCTTCTTGTTTTATATCCCGGCTTGGAACACGAGCAACATAGAT CCGACTACTGGATTTGTTAATTTATTTCATGCCCAGTATGAAAATGTAGATAAAGCGAAGAGCTTCTTTCAAAAGTTTGATTCAATTAGTTACAACCCG AAGAAAGACTGGTTTGAGTTTGCATTCGATTATAAAAACTTTACTAAAAAGGCTGAAGGAAGTCGTTCTATGTGGATATTATGCACACATGGTTCCCGA ATAAAGAATTTTAGAAATTCCCAGAAGAATGGTCAATGGGATTCCGAAGAATTCGCCTTGACGGAGGCTTTTAAGTCTCTTTTTGTGCGATATGAGATA GATTATACCGCTGATTTGAAAACAGCTATTGTGGACGAAAAGCAAAAAGACTTCTTCGTGGATCTTCTGAAGCTATTCAAATTGACAGTACAGATGCGC AACAGCTGGAAAGAGAAGGATTTGGATTATCTAATCTCTCCTGTAGCAGGGGCTGATGGCCGTTTCTTCGATACAAGAGAGGGAAATAAAAGTCTGCCT AAGGATGCAGATGCCAATGGAGCTTATAATATTGCCCTAAAAGGACTTTGGGCTCTACGCCAGATTCGGCAAACTTCAGAAGGCGGTAAACTCAAATTG GCGATTTCCAATAAGGAATGGCTACAGTTTGTGCAAGAGAGATCTTACGAGAAAGACtgaGAAATCATCCTTAGCGAAAGCTAAGGATTTTTTTTATCT GAAATTTATTATATCGCGTTGATTATTGATGCTGTTTTTAGTTTTAACGGCAATTAATATATGTGTTATTAATTGAATGAATTTTATCATTCATAATAA GTATGTGTAGGATCAAGCTCAGGTTAAATATTCACTCAGGAAGTTATTACTCAGGAAGCAAAGAGGATTACAGAATTATCTCATAACAAGTGTTAAGGG ATGTTATTTCC SEQ AAAATTCcatGCAAAATGCTCCGGTTTCATGTCATCAAAATGATGACGTAATTAAGCATTGATAATTGAGATCCCTCTCCCTGACAGGATGATTACATA ID AATAATAGTGACAAAAATAAATTATTTATTTATCCAGAAAATGAATTGGAAAATCAGGAGAGCGTTTTCAATCCTACCTCTGGCGCAGTTGATATGTca NO: aaCAGGTtgccgtcactgcgtcttttactggctcttctcgctaaccaaaccggtaaccccgcttattaaaagcattctgtaacaaagcgggaccaaagc 67 catgacaaaaacgcgtaacaaaagtgtctataatcacggcagaaaagtccacattgattatttgcacggcgtcacactttgctatgccatagcattttt atccataagattagcggatcctacctgacgctttttatcgcaactctctactgtttctccatacccgtttttttgggctagcaccgcctatctcgtgtg agataggcggagatacgaactttaagAAGGAGatataccATGAACAACGGCACAAATAATTTTCAGAACTTCATCGGGATCTCAAGTTTGCAGAAAACG CTGCGCAATGCTCTGATCCCCACGGAAACCACGCAACAGTTCATCGTCAAGAACGGAATAATTAAAGAAGATGAGTTACGTGGCGAGAACCGCCAGATT CTGAAAGATATCATGGATGACTACTACCGCGGATTCATCTCTGAGACTCTGAGTTCTATTGATGACATAGATTGGACTAGCCTGTTCGAAAAAATGGAA ATTCAGCTGAAAAATGGTGATAATAAAGATACCTTAATTAAGGAACAGACAGAGTATCGGAAAGCAATCCATAAAAAATTTGCGAACGACGATCGGTTT AAGAACATGTTTAGCGCCAAACTGATTAGTGACATATTACCTGAATTTGTCATCCACAACAATAATTATTCGGCATCAGAGAAAGAGGAAAAAACCCAG GTGATAAAATTGTTTTCGCGCTTTGCGACTAGCTTTAAAGATTACTTCAAGAACCGTGCAAATTGCTTTTCAGCGGACGATATTTCATCAAGCAGCTGC CATCGCATCGTCAACGACAATGCAGAGATATTCTTTTCAAATGCGCTGGTCTACCGCCGGATCGTAAAATCGCTGAGCAATGACGATATCAACAAAATT TCGGGCGATATGAAAGATTCATTAAAAGAAATGAGTCTGGAAGAAATATATTCTTACGAGAAGTATGGGGAATTTATTACCCAGGAAGGCATTAGCTTC TATAATGATATCTGTGGGAAAGTGAATTCTTTTATGAACCTGTATTGTCAGAAAAATAAAGAAAACAAAAATTTATACAAACTTCAGAAACTTCACAAA CAGATTCTATGCATTGCGGACACTAGCTATGAGGTCCCGTATAAATTTGAAAGTGACGAGGAAGTGTACCAATCAGTTAACGGCTTCCTTGATAACATT AGCAGCAAACATATAGTCGAAAGATTACGCAAAATCGGCGATAACTATAACGGCTACAACCTGGATAAAATTTATATCGTGTCCAAATTTTACGAGAGC GTTAGCCAAAAAACCTACCGCGACTGGGAAACAATTAATACCGCCCTCGAAATTCATTACAATAATATCTTGCCGGGTAACGGTAAAAGTAAAGCCGAC AAAGTAAAAAAAGCGGTTAAGAATGATTTACAGAAATCCATCACCGAAATAAATGAACTAGTGTCAAACTATAAGCTGTGCAGTGACGACAACATCAAA GCGGAGACTTATATACATGAGATTAGCCATATCTTGAATAACTTTGAAGCACAGGAATTGAAATACAATCCGGAAATTCACCTAGTTGAATCCGAGCTC AAAGCGAGTGAGCTTAAAAACGTGCTGGACGTGATCATGAATGCGTTTCATTGGTGTTCGGTTTTTATGACTGAGGAACTTGTTGATAAAGACAACAAT TTTTATGCGGAACTGGAGGAGATTTACGATGAAATTTATCCAGTAATTAGTCTGTACAACCTGGTTCGTAACTACGTTACCCAGAAACCGTACAGCACG AAAAAGATTAAATTGAACTTTGGAATACCGACGTTAGCAGACGGTTGGTCAAAGTCCAAAGAGTATTCTAATAACGCTATCATACTGATGCGCGACAAT CTGTATTATCTGGGCATCTTTAATGCGAAGAATAAACCGGACAAGAAGATTATCGAGGGTAATACGTCAGAAAATAAGGGTGACTACAAAAAGATGATT TATAATTTGCTCCCGGGTCCCAACAAAATGATCCCGAAAGTTTTCTTGAGCAGCAAGACGGGGGTGGAAACGTATAAACCGAGCGCCTATATCCTAGAG GGGTATAAACAGAATAAACATATCAAGTCTTCAAAAGACTTTGATATCACTTTCTGTCATGATCTGATCGACTACTTCAAAAACTGTATTGCAATTCAT CCCGAGTGGAAAAACTTCGGTTTTGATTTTAGCGACACCAGTACTTATGAAGACATTTCCGGGTTTTATCGTGAGGTAGAGTTACAAGGTTACAAGATT GATTGGACATACATTAGCGAAAAAGACATTGATCTGCTGCAGGAAAAAGGTCAACTGTATCTGTTCCAGATATATAACAAAGATTTTTCGAAAAAATCA ACCGGGAATGACAACCTTCACACCATGTACCTGAAAAATCTTTTCTCAGAAGAAAATCTTAAGGATATCGTCCTGAAACTTAACGGCGAAGCGGAAATC TTCTTCAGGAAGAGCAGCATAAAGAACCCAATCATTCATAAAAAAGGCTCGATTTTAGTCAACCGTACCTACGAAGCAGAAGAAAAAGACCAGTTTGGC AACATTCAAATTGTGCGTAAAAATATTCCGGAAAACATTTATCAGGAGCTGTACAAATACTTCAACGATAAAAGCGACAAAGAGCTGTCTGATGAAGCA GCCAAACTGAAGAATGTAGTGGGACACCACGAGGCAGCGACGAATATAGTCAAGGACTATCGCTACACGTATGATAAATACTTCCTTCATATGCCTATT ACGATCAATTTCAAAGCCAATAAAACGGGTTTTATTAATGATAGGATCTTACAGTATATCGCTAAAGAAAAAGACTTACATGTGATCGGCATTGATCGG GGCGAGCGTAACCTGATCTACGTGTCCGTGATTGATACTTGTGGTAATATAGTTGAACAGAAAAGCTTTAACATTGTAAACGGCTACGACTATCAGATA AAACTGAAACAACAGGAGGGCGCTAGACAGATTGCGCGGAAAGAATGGAAAGAAATTGGTAAAATTAAAGAGATCAAAGAGGGCTACCTGAGCTTAGTA ATCCACGAGATCTCTAAAATGGTAATCAAATACAATGCAATTATAGCGATGGAGGATTTGTCTTATGGTTTTAAAAAAGGGCGCTTTAAGGTCGAACGG CAAGTTTACCAGAAATTTGAAACCATGCTCATCAATAAACTCAACTATCTGGTATTTAAAGATATTTCGATTACCGAGAATGGCGGTCTCCTGAAAGGT TATCAGCTGACATACATTCCTGATAAACTTAAAAACGTGGGTCATCAGTGCGGCTGCATTTTTTATGTGCCTGCTGCATACACGAGCAAAATTGATCCG ACCACCGGCTTTGTGAATATCTTTAAATTTAAAGACCTGACAGTGGACGCAAAACGTGAATTCATTAAAAAATTTGACTCAATTCGTTATGACAGTGAA AAAAATCTGTTCTGCTTTACATTTGACTACAATAACTTTATTACGCAAAACACGGTCATGAGCAAATCATCGTGGAGTGTGTATACATACGGCGTGCGC ATCAAACGTCGCTTTGTGAACGGCCGCTTCTCAAACGAAAGTGATACCATTGACATAACCAAAGATATGGAGAAAACGTTGGAAATGACGGACATTAAC TGGCGCGATGGCCACGATCTTCGTCAAGACATTATAGATTATGAAATTGTTCAGCACATATTCGAAATTTTCCGTTTAACAGTGCAAATGCGTAACTCC TTGTCTGAACTGGAGGACCGTGATTACGATCGTCTCATTTCACCTGTACTGAACGAAAATAACATTTTTTATGACAGCGCGAAAGCGGGGGATGCACTT CCTAAGGATGCCGATGCAAATGGTGCGTATTGTATTGCATTAAAAGGGTTATATGAAATTAAACAAATTACCGAAAATTGGAAAGAAGATGGTAAATTT TCGCGCGATAAACTCAAAATCAGCAATAAAGATTGGTTCGACTTTATCCAGAATAAGCGCTATCTCTAAGAAATCATCCTTAGCGAAAGCTAAGGATTT TTTTTATCTGAAATTTATTATATCGCGTTGATTATTGATGCTGTTTTTAGTTTTAACGGCAATTAATATATGTGTTATTAATTGAATGAATTTTATCAT TCATAATAAGTATGTGTAGGATCAAGCTCAGGTTAAATATTCACTCAGGAAGTTATTACTCAGGAAGCAAAGAGGATTACAGAATTATCTCATAACAAG TGTTAAGGGATGTTATTTCC SEQ AATCAGGAGAGCGTTTTCAATCCTACCTCTGGCGCAGTTGATATGTCAAACAGGTTGCCGTCACTGCGTC ID TTTTACTGGCTCTTCTCGCTAACCAAACCGGTAACCCCGCTTATTAAAAGCATTCTGTAACAAAGCGGGA NO: CCAAAGCCATGACAAAAACGCGTAACAAAAGTGTCTATAATCACGGCAGAAAAGTCCACATTGATTATT 68 TGCACGGCGTCACACTTTGCTATGCCATAGCATTTTTATCCATAAGATTAGCGGATCCTACCTGACGCTT TTTATCGCAACTCTCTACTGTTTCTCCATACCCGTTTTTTTGGGCTAGCAGTAATACGACTCACTATAGGG GTCTCATCTCGTGTGAGATAGGCGGAGATACGAACTTTAAGAGGAGGATATACCATGCACCATCATCAT CACCATACCAATAAATTCACTAACCAGTATTCTCTCTCTAAGACCCTGCGCTTTGAACTGATTCCGCAGG GGAAAACCTTGGAGTTCATTCAAGAAAAAGGCCTCTTGTCTCAGGATAAACAGAGGGCTGAATCTTACC AAGAAATGAAGAAAACTATTGATAAGTTTCATAAATATTTCATTGATTTAGCCTTGTCTAACGCCAAATT AACTCACTTGGAAACGTATCTGGAGTTATACAACAAATCTGCCGAAACTAAGAAAGAACAGAAATTTAA AGACGATTTGAAAAAAGTACAGGACAATCTGCGTAAAGAAATTGTCAAATCCTTCAGTGACGGCGATGC TAAAAGCATTTTTGCCATTCTGGACAAAAAAGAGTTGATTACTGTGGAATTAGAAAAGTGGTTTGAAAA CAATGAGCAGAAAGACATCTACTTCGATGAGAAATTCAAAACTTTCACCACCTATTTTACAGGATTTCAT CAAAACCGGAAGAACATGTACTCAGTAGAACCGAACTCCACGGCCATTGCGTATCGTTTGATCCATGAG AATCTGCCTAAATTTCTGGAGAATGCGAAAGCCTTTGAAAAGATTAAGCAGGTCGAATCGCTGCAAGTG AATTTTCGTGAACTCATGGGCGAATTTGGTGACGAAGGTCTAATCTTCGTTAACGAACTGGAAGAAATG TTTCAGATTAATTACTACAATGACGTGCTATCGCAGAACGGTATCACAATCTACAATAGTATTATCTCAG GGTTCACAAAAAACGATATAAAATACAAAGGCCTGAACGAGTATATCAATAACTACAACCAAACAAAG GACAAAAAGGATAGGCTTCCGAAACTGAAGCAGTTATACAAACAGATTTTATCTGACAGAATCTCCCTG AGCTTTCTGCCGGATGCTTTCACTGATGGGAAGCAGGTTCTGAAAGCGATTTTCGATTTTTATAAGATTA ACTTACTGAGCTACACGATTGAAGGTCAAGAAGAATCTCAAAACTTACTGCTCTTGATCCGTCAAACCAT TGAAAATCTATCATCGTTCGATACGCAGAAAATCTACCTCAAAAACGATACTCACCTGACTACGATCTCT CAGCAGGTTTTCGGGGATTTTAGTGTATTTTCAACAGCTCTGAACTACTGGTATGAAACCAAAGTCAATC CGAAATTCGAGACGGAATATTCTAAGGCCAACGAAAAAAAACGTGAGATTCTTGATAAAGCTAAAGCC GTATTTACTAAACAGGATTACTTTTCTATTGCTTTCCTGCAGGAAGTTTTATCGGAGTATATCCTGACCCT GGATCATACATCTGATATCGTTAAAAAACACAGCAGCAATTGCATCGCTGACTATTTCAAAAACCACTTT GTCGCCAAAAAAGAAAACGAAACAGACAAGACTTTCGATTTCATTGCTAACATCACCGCAAAATACCAG TGTATTCAGGGTATCTTGGAAAACGCCGACCAATACGAAGACGAACTGAAACAAGATCAGAAGCTGATC GATAATTTAAAATTCTTCTTAGATGCAATCCTGGAGCTGCTGCACTTCATCAAACCGCTTCATTTAAAGA GCGAGTCCATTACCGAAAAGGACACCGCCTTCTATGACGTTTTTGAAAATTATTATGAAGCCCTCTCCTT GCTGACTCCGCTGTATAATATGGTACGCAATTACGTAACCCAGAAACCATATTCTACCGAAAAAATTAA ACTGAACTTTGAAAACGCACAGCTGCTCAACGGTTGGGACGCGAATAAAGAAGGTGACTACCTCACCAC CATCCTGAAAAAAGATGGTAACTATTTTCTGGCAATTATGGATAAGAAACATAATAAAGCATTCCAGAA ATTTCCTGAAGGGAAAGAAAATTACGAAAAGATGGTGTACAAACTCTTACCTGGAGTTAACAAAATGTT GCCGAAAGTATTTTTTAGTAATAAGAACATCGCGTACTTTAACCCGTCCAAAGAACTGCTGGAAAATTAT AAAAAGGAGACGCATAAGAAAGGGGATACCTTTAACCTGGAACATTGCCATACCTTAATAGACTTCTTC AAGGATTCCCTGAATAAACACGAGGATTGGAAATATTTCGATTTTCAGTTTAGTGAGACCAAGTCATAC CAGGATCTTAGCGGCTTTTATCGCGAAGTAGAACACCAAGGCTATAAAATTAACTTCAAAAACATCGAC AGCGAATACATCGACGGTTTAGTTAACGAGGGCAAACTGTTTCTGTTCCAGATCTATTCAAAGGATTTTA GCCCGTTCTCTAAAGGCAAACCAAATATGCATACGTTGTACTGGAAAGCACTGTTTGAAGAGCAAAACC TGCAGAATGTGATTTATAAACTGAACGGCCAAGCTGAGATTTTTTTCCGTAAAGCCTCGATTAAACCGAA AAATATCATCCTTCATAAGAAGAAAATAAAGATCGCTAAAAAACACTTCATAGATAAAAAAACCAAAA CCTCCGAAATAGTGCCTGTTCAAACAATTAAGAACTTGAATATGTACTACCAGGGCAAGATATCGGAAA AGGAGTTGACTCAAGACGATCTTCGCTATATCGATAACTTTTCGATTTTTAACGAAAAAAACAAGACGA TCGACATCATCAAAGATAAACGCTTCACTGTAGATAAGTTCCAGTTTCATGTGCCGATTACTATGAACTT CAAAGCTACCGGGGGTAGCTATATCAACCAAACGGTGTTGGAATACCTGCAGAATAACCCGGAAGTCAA AATCATTGGGCTGGACCGCGGAGAACGTCACCTTGTGTACTTGACCTTAATCGATCAGCAAGGCAACAT CTTAAAACAAGAATCGCTGAATACCATTACGGATTCAAAGATTAGCACCCCGTATCATAAGCTGCTCGA TAACAAGGAGAATGAGCGCGACCTGGCCCGTAAAAACTGGGGCACGGTGGAAAACATTAAGGAGTTAA AGGAGGGTTATATTTCCCAGGTAGTGCATAAGATCGCCACTCTCATGCTCGAGGAAAATGCGATCGTTG TCATGGAAGACTTAAACTTCGGATTTAAACGTGGGCGATTTAAAGTAGAGAAACAAATCTACCAGAAGT TAGAAAAAATGCTGATTGACAAATTAAATTACTTGGTCCTAAAAGACAAACAGCCGCAAGAATTGGGTG GATTATACAACGCCCTCCAACTTACCAATAAATTCGAAAGTTTTCAGAAAATGGGTAAACAGTCAGGCT TTCTTTTTTATGTTCCTGCGTGGAACACATCCAAAATCGACCCTACAACCGGCTTCGTCAATTACTTCTAT ACTAAATATGAAAACGTCGACAAAGCAAAAGCATTCTTTGAAAAGTTCGAAGCAATACGTTTTAACGCT GAGAAAAAATATTTCGAGTTCGAAGTCAAGAAATACTCAGACTTTAACCCCAAAGCTGAGGGCACACAG CAAGCGTGGACAATCTGCACCTACGGCGAGCGCATCGAAACGAAGCGTCAAAAAGATCAGAATAACAA ATTTGTTTCAACACCTATCAACCTGACCGAGAAGATTGAAGACTTCTTAGGTAAAAATCAGATTGTTTAT GGCGACGGTAACTGTATAAAATCTCAAATAGCCTCAAAGGATGATAAAGCATTTTTCGAAACATTATTA TATTGGTTCAAAATGACACTGCAGATGCGCAATAGTGAGACGCGTACAGATATTGATTATCTTATCAGCC CGGTCATGAACGACAACGGTACTTTTTACAACTCCAGAGACTATGAAAAACTTGAGAATCCAACTCTCC CCAAAGATGCTGATGCGAACGGTGCTTATCACATCGCGAAAAAAGGTCTGATGCTGCTGAACAAAATCG ACCAAGCCGATCTGACTAAGAAAGTTGACCTAAGCATTTCAAATCGGGACTGGTTACAGTTTGTTCAAA AGAACAAATGAGAAATCATCCTTAGCGAAAGCTAAGGATTTTTTTTATCTGAAATGTAGGGAGACCCTC AGGTTAAATATTCACTCAGGAAGTTATTACTCAGGAAGCAAAGAGGATTACA SEQ AAAATTCcatGCAAAATGCTCCGGTTTCATGTCATCAAAATGATGACGTAATTAAGCATTGATAATTGAGATCCCTCTCCCTGACAGGATGATTACATA ID AATAATAGTGACAAAAATAAATTATTTATTTATCCAGAAAATGAATTGGAAAATCAGGAGAGCGTTTTCAATCCTACCTCTGGCGCAGTTGATATGTca NO: aaCAGGTtgccgtcactgcgtcttttactggctcttctcgctaaccaaaccggtaaccccgcttattaaaagcattctgtaacaaagcgggaccaaagc 69 catgacaaaaacgcgtaacaaaagtgtctataatcacggcagaaaagtccacattgattatttgcacggcgtcacactttgctatgccatagcattttt atccataagattagcggatcctacctgacgctttttatcgcaactctctactgtttctccatacccgtttttttgggctagcaccgcctatctcgtgtg agataggcggagatacgaactttaagAAGGAGatataccATGGAACAGGAATATTATCTGGGCTTGGACATGGGCACCGGTTCCGTCGGCTGGGCTGTT ACTGACAGTGAATATCACGTTCTAAGAAAGCATGGTAAGGCATTGTGGGGTGTAAGACTTTTCGAATCTGCTTCCACTGCTGAAGAGCGTAGAATGTTT AGAACGAGTCGACGTAGGCTAGACAGGCGCAATTGGAGAATCGAAATTTTACAAGAAATTTTTGCGGAAGAGATATCTAAGAAAGACCCAGGCTTTTTC CTGAGAATGAAGGAATCTAAGTATTACCCTGAGGATAAAAGAGATATAAATGGTAACTGTCCCGAATTGCCTTACGCATTATTTGTGGACGATGATTTT ACCGATAAGGATTACCATAAAAAGTTCCCAACTATCTACCATTTACGCAAAATGTTAATGAATACAGAGGAAACCCCAGACATAAGACTAGTTTATCTG GCAATACACCATATGATGAAACATAGAGGCCATTTCTTACTTTCCGGGGATATCAACGAAATCAAAGAGTTTGGTACCACATTTAGTAAGTTACTGGAA AACATAAAGAATGAAGAATTGGATTGGAACTTAGAACTCGGAAAAGAAGAATACGCGGTTGTCGAATCTATCCTGAAGGATAATATGCTGAATAGGTCG ACCAAAAAAACTAGGCTGATCAAAGCACTGAAAGCCAAATCTATCTGCGAAAAAGCTGTTTTAAATTTACTTGCTGGTGGCACTGTTAAGTTATCAGAC ATTTTTGGTTTGGAAGAATTGAACGAAACCGAGCGTCCAAAAATTAGTTTCGCTGATAATGGCTACGATGATTACATTGGTGAGGTGGAAAACGAGTTG GGCGAACAATTTTATATTATAGAGACAGCTAAGGCAGTCTATGACTGGGCTGTTTTAGTAGAAATCCTTGGTAAATACACATCTATCTCCGAAGCGAAA GTTGCTACTTACGAAAAGCACAAGTCCGATCTCCAGTTTTTGAAGAAAATTGTCAGGAAATATCTGACTAAGGAAGAATATAAAGATATTTTCGTTAGT ACCTCTGACAAACTGAAAAATTACTCCGCTTACATCGGGATGACCAAGATTAATGGCAAAAAAGTTGATCTGCAAAGCAAAAGGTGTTCGAAGGAAGAA TTTTATGATTTCATTAAAAAGAATGTCTTAAAAAAATTAGAAGGTCAGCCAGAATACGAATATTTGAAAGAAGAACTGGAAAGAGAGACATTCTTACCA AAACAAGTCAACAGAGATAATGGGGTAATTCCATATCAAATTCACCTCTACGAATTAAAAAAAATTTTAGGCAATTTACGCGATAAAATTGACCTTATC AAAGAAAATGAGGATAAGCTGGTTCAACTCTTTGAATTCAGAATACCCTATTATGTGGGCCCACTGAACAAGATTGATGACGGCAAAGAAGGTAAATTC ACATGGGCCGTCCGCAAATCCAATGAAAAAATTTACCCATGGAACTTTGAAAATGTAGTAGATATTGAAGCGTCTGCGGAGAAATTTATTCGAAGAATG ACTAATAAATGCACTTACTTGATGGGAGAGGATGTTCTGCCTAAAGACAGCTTATTATACAGCAAGTACATGGTTCTAAACGAACTTAACAACGTTAAG TTGGACGGTGAGAAATTAAGTGTAGAATTGAAACAAAGATTGTATACTGACGTCTTCTGCAAGTACAGAAAAGTGACAGTTAAAAAAATTAAGAATTAC TTGAAGTGCGAAGGTATAATTTCTGGAAACGTAGAGATTACTGGTATTGATGGTGATTTCAAAGCATCCCTAACAGCTTACCACGATTTCAAGGAAATC CTGACAGGAACTGAACTCGCAAAAAAAGATAAAGAAAACATTATTACTAATATTGTTCTTTTCGGTGATGACAAGAAATTGTTGAAGAAAAGACTGAAT AGACTTTACCCCCAGATTACTCCCAATCAACTTAAGAAAATTTGTGCTTTGTCTTACACAGGATGGGGTCGTTTTTCAAAAAAGTTCTTAGAAGAGATT ACCGCACCTGATCCAGAAACAGGCGAAGTATGGAATATAATTACCGCCTTATGGGAATCGAACAATAATCTTATGCAACTTCTGAGCAATGAATATCGT TTCATGGAAGAAGTTGAGACTTACAACATGGGCAAACAGACGAAGACTTTATCCTATGAAACTGTGGAAAATATGTATGTATCACCTTCTGTCAAGAGA CAAATTTGGCAAACCTTAAAAATTGTCAAAGAATTAGAAAAGGTAATGAAGGAGTCTCCTAAACGTGTGTTTATTGAAATGGCTAGAGAAAAACAAGAG TCAAAAAGAACCGAGTCAAGAAAGAAGCAGTTAATCGATTTATATAAGGCTTGTAAAAACGAAGAGAAAGATTGGGTTAAAGAATTGGGGGACCAAGAG GAACAAAAACTACGGTCGGATAAGTTGTATTTATACTATACGCAAAAGGGACGATGTATGTATTCCGGCGAGGTAATAGAATTGAAGGATTTATGGGAC AATACAAAATATGACATAGACCATATATATCCCCAATCAAAAACGATGGACGATAGCTTGAACAATAGAGTACTCGTGAAAAAAAAATATAATGCGACC AAATCTGATAAGTATCCTCTGAATGAAAATATCAGACATGAAAGAAAGGGGTTCTGGAAGTCCTTGTTAGATGGTGGGTTTATAAGCAAAGAAAAGTAC GAGCGTCTAATAAGAAACACGGAGTTATCGCCAGAAGAACTCGCTGGTTTTATTGAGAGGCAAATCGTGGAAACGAGACAATCTACCAAAGCCGTTGCT GAGATCCTAAAGCAAGTTTTCCCAGAGTCGGAGATTGTCTATGTCAAAGCTGGCACAGTGAGCAGGTTTAGGAAAGACTTCGAACTATTAAAGGTAAGA GAAGTGAACGATTTACATCACGCAAAGGACGCTTACCTAAATATCGTTGTAGGTAACTCATATTATGTTAAATTTACCAAGAACGCCTCTTGGTTTATA AAGGAGAACCCAGGTAGAACATATAACCTGAAAAAGATGTTCACCTCTGGTTGGAATATTGAGAGAAACGGAGAAGTCGCATGGGAAGTTGGTAAGAAA GGGACTATAGTGACAGTAAAGCAAATTATGAACAAAAATAATATCCTCGTTACAAGGCAGGTTCATGAAGCAAAGGGCGGCCTTTTTGACCAACAAATT ATGAAGAAAGGGAAAGGTCAAATTGCAATAAAAGAAACCGATGAGAGACTAGCGTCAATAGAAAAGTATGGTGGCTATAATAAAGCTGCGGGTGCATAC TTTATGCTTGTTGAATCAAAAGACAAGAAAGGTAAGACTATTAGAACTATAGAATTTATACCCCTGTACCTTAAAAACAAAATTGAATCGGATGAGTCA ATCGCGTTAAATTTTCTAGAGAAAGGAAGGGGTTTAAAAGAACCAAAGATCCTGTTAAAAAAGATTAAGATTGACACCTTGTTCGATGTAGATGGATTT AAAATGTGGTTATCTGGCAGAACAGGCGATAGACTTTTGTTTAAGTGCGCTAATCAATTAATTTTGGATGAGAAAATCATTGTCACAATGAAAAAAATA GTTAAGTTTATTCAGAGAAGACAAGAAAACAGGGAGTTGAAATTATCTGATAAAGATGGTATCGACAATGAAGTTTTAATGGAAATCTACAATACATTC GTTGATAAACTTGAAAATACCGTATATCGAATCAGGTTAAGTGAACAAGCCAAAACATTAATTGATAAACAAAAAGAATTTGAAAGGCTATCACTGGAA GACAAATCCTCCACCCTATTTGAAATTTTGCATATATTCCAGTGCCAATCTTCAGCAGCTAATTTAAAAATGATTGGCGGACCTGGGAAAGCCGGCATC CTAGTGATGAACAATAATATCTCCAAGTGTAACAAAATATCAATTATTAACCAATCTCCGACAGGTATTTTTGAAAATGAAATAGACTTGCTTAAGATA TAAGAAATCATCCTTAGCGAAAGCTAAGGATTTTTTTTATCTGAAATTTATTATATCGCGTTGATTATTGATGCTGTTTTTAGTTTTAACGGCAATTAA TATATGTGTTATTAATTGAATGAATTTTATCATTCATAATAAGTATGTGTAGGATCAAGCTCAGGTTAAATATTCACTCAGGAAGTTATTACTCAGGAA GCAAAGAGGATTACAGAATTATCTCATAACAAGTGTTAAGGGATGTTATTTCC SEQ AATCAGGAGAGCGTTTTCAATCCTACCTCTGGCGCAGTTGATATGTCAAACAGGTTGCCGTCACTGCGTC ID TTTTACTGGCTCTTCTCGCTAACCAAACCGGTAACCCCGCTTATTAAAAGCATTCTGTAACAAAGCGGGA NO: CCAAAGCCATGACAAAAACGCGTAACAAAAGTGTCTATAATCACGGCAGAAAAGTCCACATTGATTATT 70 TGCACGGCGTCACACTTTGCTATGCCATAGCATTTTTATCCATAAGATTAGCGGATCCTACCTGACGCTT TTTATCGCAACTCTCTACTGTTTCTCCATACCCGTTTTTTTGGGCTAGCAGTAATACGACTCACTATAGGG GTCTCATCTCGTGTGAGATAGGCGGAGATACGAACTTTAAGAGGAGGATATACCATGCACCATCATCAT CACCATTCTTTCGACTCTTTCACCAACCTGTACTCTCTGTCTAAAACCCTGAAATTCGAAATGCGTCCGGT TGGTAACACCCAGAAAATGCTGGACAACGCGGGTGTTTTCGAAAAAGACAAACTGATCCAGAAAAAAT ACGGTAAAACCAAACCGTACTTCGACCGTCTGCACCGTGAATTCATCGAAGAAGCGCTGACCGGTGTTG AACTGATCGGTCTGGACGAAAACTTCCGTACCCTGGTTGACTGGCAGAAAGACAAAAAAAACAACGTTG CGATGAAAGCGTACGAAAACTCTCTGCAGCGTCTGCGTACCGAAATCGGTAAAATCTTCAACCTGAAAG CGGAAGACTGGGTTAAAAACAAATACCCGATCCTGGGTCTGAAAAACAAAAACACCGACATCCTGTTCG AAGAAGCGGTTTTCGGTATCCTGAAAGCGCGTTACGGTGAAGAAAAAGACACCTTCATCGAAGTTGAAG AAATCGACAAAACCGGTAAATCTAAAATCAACCAGATCTCTATCTTCGACTCTTGGAAAGGTTTCACCG GTTACTTCAAAAAATTCTTCGAAACCCGTAAAAACTTCTACAAAAACGACGGTACCTCTACCGCGATCG CGACCCGTATCATCGACCAGAACCTGAAACGTTTCATCGACAACCTGTCTATCGTTGAATCTGTTCGTCA GAAAGTTGACCTGGCGGAAACCGAAAAATCTTTCTCTATCTCTCTGTCTCAGTTCTTCTCTATCGACTTCT ACAACAAATGCCTGCTGCAGGACGGTATCGACTACTACAACAAAATCATCGGTGGTGAAACCCTGAAAA ACGGTGAAAAACTGATCGGTCTGAACGAACTGATCAACCAGTACCGTCAGAACAACAAAGACCAGAAA ATCCCGTTCTTCAAACTGCTGGACAAACAGATCCTGTCTGAAAAAATCCTGTTCCTGGACGAAATCAAA AACGACACCGAACTGATCGAAGCGCTGTCTCAGTTCGCGAAAACCGCGGAAGAAAAAACCAAAATCGT TAAAAAACTGTTCGCGGACTTCGTTGAAAACAACTCTAAATACGACCTGGCGCAGATCTACATCTCTCA GGAAGCGTTCAACACCATCTCTAACAAATGGACCTCTGAAACCGAAACCTTCGCGAAATACCTGTTCGA AGCGATGAAATCTGGTAAACTGGCGAAATACGAAAAAAAAGACAACTCTTACAAATTCCCGGACTTCAT CGCGCTGTCTCAGATGAAATCTGCGCTGCTGTCTATCTCTCTGGAAGGTCACTTCTGGAAAGAAAAATAC TACAAAATCTCTAAATTCCAGGAAAAAACCAACTGGGAACAGTTCCTGGCGATCTTCCTGTACGAATTC AACTCTCTGTTCTCTGACAAAATCAACACCAAAGACGGTGAAACCAAACAGGTTGGTTACTACCTGTTC GCGAAAGACCTGCACAACCTGATCCTGTCTGAACAGATCGACATCCCGAAAGACTCTAAAGTTACCATC AAAGACTTCGCGGACTCTGTTCTGACCATCTACCAGATGGCGAAATACTTCGCGGTTGAAAAAAAACGT GCGTGGCTGGCGGAATACGAACTGGACTCTTTCTACACCCAGCCGGACACCGGTTACCTGCAGTTCTAC GACAACGCGTACGAAGACATCGTTCAGGTTTACAACAAACTGCGTAACTACCTGACCAAAAAACCGTAC TCTGAAGAAAAATGGAAACTGAACTTCGAAAACTCTACCCTGGCGAACGGTTGGGACAAAAACAAAGA ATCTGACAACTCTGCGGTTATCCTGCAGAAAGGTGGTAAATACTACCTGGGTCTGATCACCAAAGGTCA CAACAAAATCTTCGACGACCGTTTCCAGGAAAAATTCATCGTTGGTATCGAAGGTGGTAAATACGAAAA AATCGTTTACAAATTCTTCCCGGACCAGGCGAAAATGTTCCCGAAAGTTTGCTTCTCTGCGAAAGGTCTG GAATTCTTCCGTCCGTCTGAAGAAATCCTGCGTATCTACAACAACGCGGAATTCAAAAAAGGTGAAACC TACTCTATCGACTCTATGCAGAAACTGATCGACTTCTACAAAGACTGCCTGACCAAATACGAAGGTTGG GCGTGCTACACCTTCCGTCACCTGAAACCGACCGAAGAATACCAGAACAACATCGGTGAATTCTTCCGT GACGTTGCGGAAGACGGTTACCGTATCGACTTCCAGGGTATCTCTGACCAGTACATCCACGAAAAAAAC GAAAAAGGTGAACTGCACCTGTTCGAAATCCACAACAAAGACTGGAACCTGGACAAAGCGCGTGACGG TAAATCTAAAACCACCCAGAAAAACCTGCACACCCTGTACTTCGAATCTCTGTTCTCTAACGACAACGTT GTTCAGAACTTCCCGATCAAACTGAACGGTCAGGCGGAAATCTTCTACCGTCCGAAAACCGAAAAAGAC AAACTGGAATCTAAAAAAGACAAAAAAGGTAACAAAGTTATCGACCACAAACGTTACTCTGAAAACAA AATCTTCTTCCACGTTCCGCTGACCCTGAACCGTACCAAAAACGACTCTTACCGTTTCAACGCGCAGATC AACAACTTCCTGGCGAACAACAAAGACATCAACATCATCGGTGTTGACCGTGGTGAAAAACACCTGGTT TACTACTCTGTTATCACCCAGGCGTCTGACATCCTGGAATCTGGTTCTCTGAACGAACTGAACGGTGTTA ACTACGCGGAAAAACTGGGTAAAAAAGCGGAAAACCGTGAACAGGCGCGTCGTGACTGGCAGGACGTT CAGGGTATCAAAGACCTGAAAAAAGGTTACATCTCTCAGGTTGTTCGTAAACTGGCGGACCTGGCGATC AAACACAACGCGATCATCATCCTGGAAGACCTGAACATGCGTTTCAAACAGGTTCGTGGTGGTATCGAA AAATCTATCTACCAGCAGCTGGAAAAAGCGCTGATCGACAAACTGTCTTTCCTGGTTGACAAAGGTGAA AAAAACCCGGAACAGGCGGGTCACCTGCTGAAAGCGTACCAGCTGTCTGCGCCGTTCGAAACCTTCCAG AAAATGGGTAAACAGACCGGTATCATCTTCTACACCCAGGCGTCTTACACCTCTAAATCTGACCCGGTTA CCGGTTGGCGTCCGCACCTGTACCTGAAATACTTCTCTGCGAAAAAAGCGAAAGACGACATCGCGAAAT TCACCAAAATCGAATTCGTTAACGACCGTTTCGAACTGACCTACGACATCAAAGACTTCCAGCAGGCGA AAGAATACCCGAACAAAACCGTTTGGAAAGTTTGCTCTAACGTTGAACGTTTCCGTTGGGACAAAAACC TGAACCAGAACAAAGGTGGTTACACCCACTACACCAACATCACCGAAAACATCCAGGAACTGTTCACCA AATACGGTATCGACATCACCAAAGACCTGCTGACCCAGATCTCTACCATCGACGAAAAACAGAACACCT CTTTCTTCCGTGACTTCATCTTCTACTTCAACCTGATCTGCCAGATCCGTAACACCGACGACTCTGAAATC GCGAAAAAAAACGGTAAAGACGACTTCATCCTGTCTCCGGTTGAACCGTTCTTCGACTCTCGTAAAGAC AACGGTAACAAACTGCCGGAAAACGGTGACGACAACGGTGCGTACAACATCGCGCGTAAAGGTATCGT TATCCTGAACAAAATCTCTCAGTACTCTGAAAAAAACGAAAACTGCGAAAAAATGAAATGGGGTGACCT GTACGTTTCTAACATCGACTGGGACAACTTCGTTGAAATCATCCTTAGCGAAAGCTAAGGATTTTTTTTA TCTGAAATGTAGGGAGACCCTCAGGTTAAATATTCACTCAGGAAGTTATTACTCAGGAAGCAAAGAGGA TTACA SEQ AATCAGGAGAGCGTTTTCAATCCTACCTCTGGCGCAGTTGATATGTCAAACAGGTTGCCGTCACTGCGTC ID TTTTACTGGCTCTTCTCGCTAACCAAACCGGTAACCCCGCTTATTAAAAGCATTCTGTAACAAAGCGGGA NO: CCAAAGCCATGACAAAAACGCGTAACAAAAGTGTCTATAATCACGGCAGAAAAGTCCACATTGATTATT 71 TGCACGGCGTCACACTTTGCTATGCCATAGCATTTTTATCCATAAGATTAGCGGATCCTACCTGACGCTT TTTATCGCAACTCTCTACTGTTTCTCCATACCCGTTTTTTTGGGCTAGCAGTAATACGACTCACTATAGGG GTCTCATCTCGTGTGAGATAGGCGGAGATACGAACTTTAAGAGGAGGATATACCATGCACCATCATCAT CACCATAACAAATTCGAAAACTTCACCGGTCTGTACCCGATCTCTAAAACCCTGCGTTTCGAACTGATCC CGCAGGGTAAAACCCTGGAATACATCGAAAAATCTGAAATCCTGGAAAACGACAACTACCGTGCGGAA AAATACGAAGAAGTTAAAGACATCATCGACGGTTACCACAAATGGTTCATCAACGAAACCCTGCACGAC CTGCACATCAACTGGTCTGAACTGAAAGTTGCGCTGGAAAACAACCGTATCGAAAAATCTGACGCGTCT AAAAAAGAACTGCAGCGTGTTCAGAAAATCAAACGTGAAGAAATCTACAACGCGTTCATCGAACACGA AGCGTTCCAGTACCTGTTCAAAGAAAACCTGCTGTCTGACCTGCTGCCGATCCAGATCGAACAGTCTGA AGACCTGGACGCGGAAAAAAAAAAACAGGCGGTTGAAACCTTCAACCGTTTCTCTACCTACTTCACCGG TTTCCACGAAAACCGTAAAAACATCTACTCTAAAGAAGGTATCTCTACCTCTGTTACCTACCGTATCGTT CACGACAACTTCCCGAAATTCCTGGAAAACATGAAAGTTTTCGAAATCCTGCGTAACGAATGCCCGGAA GTTATCTCTGACACCGCGAACGAACTGGCGCCGTTCATCGACGGTGTTCGTATCGAAGACATCTTCCTGA TCGACTTCTTCAACTCTACCTTCTCTCAGAACGGTATCGACTACTACAACCGTATCCTGGGTGGTGTTACC ACCGAAACCGGTGAAAAATACCGTGGTATCAACGAATTCACCAACCTGTACCGTCAGCAGCACCCGGAA TTCGGTAAATCTAAAAAAGCGACCAAAATGGTTGTTCTGTTCAAACAGATCCTGTCTGACCGTGACACCC TGTCTTTCATCCCGGAAATGTTCGGTAACGACAAACAGGTTCAGAACTCTATCCAGCTGTTCTACAACCG TGAAATCTCTCAGTTCGAAAACGAAGGTGTTAAAACCGACGTTTGCACCGCGCTGGCGACCCTGACCTC TAAAATCGCGGAATTCGACACCGAAAAAATCTACATCCAGCAGCCGGAACTGCCGAACGTTTCTCAGCG TCTGTTCGGTTCTTGGAACGAACTGAACGCGTGCCTGTTCAAATACGCGGAACTGAAATTCGGTACCGCG GAAAAAGTTGCGAACCGTAAAAAAATCGACAAATGGCTGAAATCTGACCTGTTCTCTTTCACCGAACTG AACAAAGCGCTGGAATTCTCTGGTAAAGACGAACGTATCGAAAACTACTTCTCTGAAACCGGTATCTTC GCGCAGCTGGTTAAAACCGGTTTCGACGAAGCGCAGTCTATCCTGGAAACCGAATACACCTCTGAAGTT CACCTGAAAGACCAGCAGACCGACATCGAAAAAATCAAAACCTTCCTGGACGCGCTGCAGAACCTGAT GCACCTGCTGAAATCTCTGTGCGTTTCTGAAGAAGCGGACCGTGACGCGGCGTTCTACAACGAATTCGA CATGCTGTACAACCAGCTGAAACTGGTTGTTCCGCTGTACAACAAAGTTCGTAACTACATCACCCAGAA ACTGTTCCGTTCTGACAAAATCAAAATCTACTTCGAAAACAAAGGTCAGTTCCTGGGTGGTTGGGTTGAC TCTCAGACCGAAAACTCTGACAACGGTACCCAGGCGGGTGGTTACATCTTCCGTAAAGAAAACGTTATC AACGAATACGACTACTACCTGGGTATCTGCTCTGACCCGAAACTGTTCCGTCGTACCACCATCGTTTCTG AAAACGACCGTTCTTCTTTCGAACGTCTGGACTACTACCAGCTGAAAACCGCGTCTGTTTACGGTAACTC TTACTGCGGTAAACACCCGTACACCGAAGACAAAAACGAACTGGTTAACTCTATCGACCGTTTCGTTCA CCTGTCTGGTAACAACATCCTGATCGAAAAAATCGCGAAAGACAAAGTTAAATCTAACCCGACCACCAA CACCCCGTCTGGTTACCTGAACTTCATCCACCGTGAAGCGCCGAACACCTACGAATGCCTGCTGCAGGA CGAAAACTTCGTTTCTCTGAACCAGCGTGTTGTTTCTGCGCTGAAAGCGACCCTGGCGACCCTGGTTCGT GTTCCGAAAGCGCTGGTTTACGCGAAAAAAGACTACCACCTGTTCTCTGAAATCATCAACGACATCGAC GAACTGTCTTACGAAAAAGCGTTCTCTTACTTCCCGGTTTCTCAGACCGAATTCGAAAACTCTTCTAACC GTACCATCAAACCGCTGCTGCTGTTCAAAATCTCTAACAAAGACCTGTCTTTCGCGGAAAACTTCGAAAA AGGTAACCGTCAGAAAATCGGTAAAAAAAACCTGCACACCCTGTACTTCGAAGCGCTGATGAAAGGTA ACCAGGACACCATCGACATCGGTACCGGTATGGTTTTCCACCGTGTTAAATCTCTGAACTACAACGAAA AAACCCTGAAATACGGTCACCACTCTACCCAGCTGAACGAAAAATTCTCTTACCCGATCATCAAAGACA AACGTTTCGCGTCTGACAAATTCCTGTTCCACCTGTCTACCGAAATCAACTACAAAGAAAAACGTAAAC CGCTGAACAACTCTATCATCGAATTCCTGACCAACAACCCGGACATCAACATCATCGGTCTGGACCGTG GTGAACGTCACCTGATCTACCTGACCCTGATCAACCAGAAAGGTGAAATCCTGCGTCAGAAAACCTTCA ACATCGTTGGTAACACCAACTACCACGAAAAACTGAACCAGCGTGAAAAAGAACGTGACAACGCGCGT AAATCTTGGGCGACCATCGGTAAAATCAAAGAACTGAAAGAAGGTTTCCTGTCTCTGGTTATCCACGAA ATCGCGAAAATCATGGTTGAAAACAACGCGATCGTTGTTCTGGAAGACCTGAACTTCGGTTTCAAACGT GGTCGTTTCAAAGTTGAAAAACAGATCTACCAGAAATTCGAAAAAATGCTGATCGACAAACTGAACTAC CTGGTTTTCAAAGACAAAAAAGCGAACGAAGCGGGTGGTGTTCTGAAAGGTTACCAGCTGGCGGAAAA ATTCGAATCTTTCCAGAAAATGGGTAAACAGTCTGGTTTCCTGTTCTACGTTCCGGCGGCGTACACCTCT AAAATCGACCCGACCACCGGTTTCGTTAACATGCTGAACCTGAACTACACCAACATGAAAGACGCGCAG ACCCTGCTGTCTGGTATGGACAAAATCTCTTTCAACGCGGACGCGAACTACTTCGAATTCGAACTGGACT ACGAAAAATTCAAAACCAACCAGACCGACCACACCAACAAATGGACCATCTGCACCGTTGGTGAAAAA CGTTTCACCTACAACTCTGCGACCAAAGAAACCACCACCGTTAACGTTACCGAAGACCTGAAAAAACTG CTGGACAAATTCGAAGTTAAATACTCTAACGGTGACAACATCAAAGACGAAATCTGCCGTCAGACCGAC GCGAAATTCTTCGAAATCATCCTGTGGCTGCTGAAACTGACCATGCAGATGCGTAACTCTAACACCAAA ACCGAAGAAGACTTCATCCTGTCTCCGGTTAAAAACTCTAACGGTGAATTCTTCCGTTCTAACGACGACG CGAACGGTATCTGGCCGGCGGACGCGGACGCGAACGGTGCGTACCACATCGCGCTGAAAGGTCTGTACC TGGTTAAAGAATGCTTCAACAAAAACGAAAAATCTCTGAAAATCGAACACAAAAACTGGTTCAAATTCG CGCAGACCCGTTTCAACGGTTCTCTGACCAAAAACGGTTAAGAAATCATCCTTAGCGAAAGCTAAGGAT TTTTTTTATCTGAAATGTAGGGAGACCCTCAGGTTAAATATTCACTCAGGAAGTTATTACTCAGGAAGCA AAGAGGATTACA SEQ AATCAGGAGAGCGTTTTCAATCCTACCTCTGGCGCAGTTGATATGTCAAACAGGTTGCCGTCACTGCGTC ID TTTTACTGGCTCTTCTCGCTAACCAAACCGGTAACCCCGCTTATTAAAAGCATTCTGTAACAAAGCGGGA NO: CCAAAGCCATGACAAAAACGCGTAACAAAAGTGTCTATAATCACGGCAGAAAAGTCCACATTGATTATT 72 TGCACGGCGTCACACTTTGCTATGCCATAGCATTTTTATCCATAAGATTAGCGGATCCTACCTGACGCTT TTTATCGCAACTCTCTACTGTTTCTCCATACCCGTTTTTTTGGGCTAGCAGTAATACGACTCACTATAGGG GTCTCATCTCGTGTGAGATAGGCGGAGATACGAACTTTAAGAGGAGGATATACCATGCACCATCATCAT CACCATACCCAGTTCGAAGGTTTCACCAACCTGTACCAGGTTTCTAAAACCCTGCGTTTCGAACTGATCC CGCAGGGTAAAACCCTGAAACACATCCAGGAACAGGGTTTCATCGAAGAAGACAAAGCGCGTAACGAC CACTACAAAGAACTGAAACCGATCATCGACCGTATCTACAAAACCTACGCGGACCAGTGCCTGCAGCTG GTTCAGCTGGACTGGGAAAACCTGTCTGCGGCGATCGACTCTTACCGTAAAGAAAAAACCGAAGAAACC CGTAACGCGCTGATCGAAGAACAGGCGACCTACCGTAACGCGATCCACGACTACTTCATCGGTCGTACC GACAACCTGACCGACGCGATCAACAAACGTCACGCGGAAATCTACAAAGGTCTGTTCAAAGCGGAACT GTTCAACGGTAAAGTTCTGAAACAGCTGGGTACCGTTACCACCACCGAACACGAAAACGCGCTGCTGCG TTCTTTCGACAAATTCACCACCTACTTCTCTGGTTTCTACGAAAACCGTAAAAACGTTTTCTCTGCGGAA GACATCTCTACCGCGATCCCGCACCGTATCGTTCAGGACAACTTCCCGAAATTCAAAGAAAACTGCCAC ATCTTCACCCGTCTGATCACCGCGGTTCCGTCTCTGCGTGAACACTTCGAAAACGTTAAAAAAGCGATCG GTATCTTCGTTTCTACCTCTATCGAAGAAGTTTTCTCTTTCCCGTTCTACAACCAGCTGCTGACCCAGACC CAGATCGACCTGTACAACCAGCTGCTGGGTGGTATCTCTCGTGAAGCGGGTACCGAAAAAATCAAAGGT CTGAACGAAGTTCTGAACCTGGCGATCCAGAAAAACGACGAAACCGCGCACATCATCGCGTCTCTGCCG CACCGTTTCATCCCGCTGTTCAAACAGATCCTGTCTGACCGTAACACCCTGTCTTTCATCCTGGAAGAAT TCAAATCTGACGAAGAAGTTATCCAGTCTTTCTGCAAATACAAAACCCTGCTGCGTAACGAAAACGTTCT GGAAACCGCGGAAGCGCTGTTCAACGAACTGAACTCTATCGACCTGACCCACATCTTCATCTCTCACAA AAAACTGGAAACCATCTCTTCTGCGCTGTGCGACCACTGGGACACCCTGCGTAACGCGCTGTACGAACG TCGTATCTCTGAACTGACCGGTAAAATCACCAAATCTGCGAAAGAAAAAGTTCAGCGTTCTCTGAAACA CGAAGACATCAACCTGCAGGAAATCATCTCTGCGGCGGGTAAAGAACTGTCTGAAGCGTTCAAACAGAA AACCTCTGAAATCCTGTCTCACGCGCACGCGGCGCTGGACCAGCCGCTGCCGACCACCCTGAAAAAACA GGAAGAAAAAGAAATCCTGAAATCTCAGCTGGACTCTCTGCTGGGTCTGTACCACCTGCTGGACTGGTT CGCGGTTGACGAATCTAACGAAGTTGACCCGGAATTCTCTGCGCGTCTGACCGGTATCAAACTGGAAAT GGAACCGTCTCTGTCTTTCTACAACAAAGCGCGTAACTACGCGACCAAAAAACCGTACTCTGTTGAAAA ATTCAAACTGAACTTCCAGATGCCGACCCTGGCGTCTGGTTGGGACGTTAACAAAGAAAAAAACAACGG TGCGATCCTGTTCGTTAAAAACGGTCTGTACTACCTGGGTATCATGCCGAAACAGAAAGGTCGTTACAA AGCGCTGTCTTTCGAACCGACCGAAAAAACCTCTGAAGGTTTCGACAAAATGTACTACGACTACTTCCC GGACGCGGCGAAAATGATCCCGAAATGCTCTACCCAGCTGAAAGCGGTTACCGCGCACTTCCAGACCCA CACCACCCCGATCCTGCTGTCTAACAACTTCATCGAACCGCTGGAAATCACCAAAGAAATCTACGACCT GAACAACCCGGAAAAAGAACCGAAAAAATTCCAGACCGCGTACGCGAAAAAAACCGGTGACCAGAAA GGTTACCGTGAAGCGCTGTGCAAATGGATCGACTTCACCCGTGACTTCCTGTCTAAATACACCAAAACC ACCTCTATCGACCTGTCTTCTCTGCGTCCGTCTTCTCAGTACAAAGACCTGGGTGAATACTACGCGGAAC TGAACCCGCTGCTGTACCACATCTCTTTCCAGCGTATCGCGGAAAAAGAAATCATGGACGCGGTTGAAA CCGGTAAACTGTACCTGTTCCAGATCTACAACAAAGACTTCGCGAAAGGTCACCACGGTAAACCGAACC TGCACACCCTGTACTGGACCGGTCTGTTCTCTCCGGAAAACCTGGCGAAAACCTCTATCAAACTGAACG GTCAGGCGGAACTGTTCTACCGTCCGAAATCTCGTATGAAACGTATGGCGCACCGTCTGGGTGAAAAAA TGCTGAACAAAAAACTGAAAGACCAGAAAACCCCGATCCCGGACACCCTGTACCAGGAACTGTACGAC TACGTTAACCACCGTCTGTCTCACGACCTGTCTGACGAAGCGCGTGCGCTGCTGCCGAACGTTATCACCA AAGAAGTTTCTCACGAAATCATCAAAGACCGTCGTTTCACCTCTGACAAATTCTTCTTCCACGTTCCGAT CACCCTGAACTACCAGGCGGCGAACTCTCCGTCTAAATTCAACCAGCGTGTTAACGCGTACCTGAAAGA ACACCCGGAAACCCCGATCATCGGTATCGACCGTGGTGAACGTAACCTGATCTACATCACCGTTATCGA CTCTACCGGTAAAATCCTGGAACAGCGTTCTCTGAACACCATCCAGCAGTTCGACTACCAGAAAAAACT GGACAACCGTGAAAAAGAACGTGTTGCGGCGCGTCAGGCGTGGTCTGTTGTTGGTACCATCAAAGACCT GAAACAGGGTTACCTGTCTCAGGTTATCCACGAAATCGTTGACCTGATGATCCACTACCAGGCGGTTGTT GTTCTGGAAAACCTGAACTTCGGTTTCAAATCTAAACGTACCGGTATCGCGGAAAAAGCGGTTTACCAG CAGTTCGAAAAAATGCTGATCGACAAACTGAACTGCCTGGTTCTGAAAGACTACCCGGCGGAAAAAGTT GGTGGTGTTCTGAACCCGTACCAGCTGACCGACCAGTTCACCTCTTTCGCGAAAATGGGTACCCAGTCTG GTTTCCTGTTCTACGTTCCGGCGCCGTACACCTCTAAAATCGACCCGCTGACCGGTTTCGTTGACCCGTTC GTTTGGAAAACCATCAAAAACCACGAATCTCGTAAACACTTCCTGGAAGGTTTCGACTTCCTGCACTACG ACGTTAAAACCGGTGACTTCATCCTGCACTTCAAAATGAACCGTAACCTGTCTTTCCAGCGTGGTCTGCC GGGTTTCATGCCGGCGTGGGACATCGTTTTCGAAAAAAACGAAACCCAGTTCGACGCGAAAGGTACCCC GTTCATCGCGGGTAAACGTATCGTTCCGGTTATCGAAAACCACCGTTTCACCGGTCGTTACCGTGACCTG TACCCGGCGAACGAACTGATCGCGCTGCTGGAAGAAAAAGGTATCGTTTTCCGTGACGGTTCTAACATC CTGCCGAAACTGCTGGAAAACGACGACTCTCACGCGATCGACACCATGGTTGCGCTGATCCGTTCTGTTC TGCAGATGCGTAACTCTAACGCGGCGACCGGTGAAGACTACATCAACTCTCCGGTTCGTGACCTGAACG GTGTTTGCTTCGACTCTCGTTTCCAGAACCCGGAATGGCCGATGGACGCGGACGCGAACGGTGCGTACC ACATCGCGCTGAAAGGTCAGCTGCTGCTGAACCACCTGAAAGAATCTAAAGACCTGAAACTGCAGAACG GTATCTCTAACCAGGACTGGCTGGCGTACATCCAGGAACTGCGTAACTAGAAATCATCCTTAGCGAAAG CTAAGGATTTTTTTTATCTGAAATGTAGGGAGACCCTCAGGTTAAATATTCACTCAGGAAGTTATTACTC AGGAAGCAAAGAGGATTACA SEQ AATCAGGAGAGCGTTTTCAATCCTACCTCTGGCGCAGTTGATATGTCAAACAGGTTGCCGTCACTGCGTC ID TTTTACTGGCTCTTCTCGCTAACCAAACCGGTAACCCCGCTTATTAAAAGCATTCTGTAACAAAGCGGGA NO: CCAAAGCCATGACAAAAACGCGTAACAAAAGTGTCTATAATCACGGCAGAAAAGTCCACATTGATTATT 73 TGCACGGCGTCACACTTTGCTATGCCATAGCATTTTTATCCATAAGATTAGCGGATCCTACCTGACGCTT TTTATCGCAACTCTCTACTGTTTCTCCATACCCGTTTTTTTGGGCTAGCAGTAATACGACTCACTATAGGG GTCTCATCTCGTGTGAGATAGGCGGAGATACGAACTTTAAGAGGAGGATATACCATGCACCATCATCAT CACCATGCGGTTAAATCTATCAAAGTTAAACTGCGTCTGGACGACATGCCGGAAATCCGTGCGGGTCTG TGGAAACTGCACAAAGAAGTTAACGCGGGTGTTCGTTACTACACCGAATGGCTGTCTCTGCTGCGTCAG GAAAACCTGTACCGTCGTTCTCCGAACGGTGACGGTGAACAGGAATGCGACAAAACCGCGGAAGAATG CAAAGCGGAACTGCTGGAACGTCTGCGTGCGCGTCAGGTTGAAAACGGTCACCGTGGTCCGGCGGGTTC TGACGACGAACTGCTGCAGCTGGCGCGTCAGCTGTACGAACTGCTGGTTCCGCAGGCGATCGGTGCGAA AGGTGACGCGCAGCAGATCGCGCGTAAATTCCTGTCTCCGCTGGCGGACAAAGACGCGGTTGGTGGTCT GGGTATCGCGAAAGCGGGTAACAAACCGCGTTGGGTTCGTATGCGTGAAGCGGGTGAACCGGGTTGGG AAGAAGAAAAAGAAAAAGCGGAAACCCGTAAATCTGCGGACCGTACCGCGGACGTTCTGCGTGCGCTG GCGGACTTCGGTCTGAAACCGCTGATGCGTGTTTACACCGACTCTGAAATGTCTTCTGTTGAATGGAAAC CGCTGCGTAAAGGTCAGGCGGTTCGTACCTGGGACCGTGACATGTTCCAGCAGGCGATCGAACGTATGA TGTCTTGGGAATCTTGGAACCAGCGTGTTGGTCAGGAATACGCGAAACTGGTTGAACAGAAAAACCGTT TCGAACAGAAAAACTTCGTTGGTCAGGAACACCTGGTTCACCTGGTTAACCAGCTGCAGCAGGACATGA AAGAAGCGTCTCCGGGTCTGGAATCTAAAGAACAGACCGCGCACTACGTTACCGGTCGTGCGCTGCGTG GTTCTGACAAAGTTTTCGAAAAATGGGGTAAACTGGCGCCGGACGCGCCGTTCGACCTGTACGACGCGG AAATCAAAAACGTTCAGCGTCGTAACACCCGTCGTTTCGGTTCTCACGACCTGTTCGCGAAACTGGCGG AACCGGAATACCAGGCGCTGTGGCGTGAAGACGCGTCTTTCCTGACCCGTTACGCGGTTTACAACTCTAT CCTGCGTAAACTGAACCACGCGAAAATGTTCGCGACCTTCACCCTGCCGGACGCGACCGCGCACCCGAT CTGGACCCGTTTCGACAAACTGGGTGGTAACCTGCACCAGTACACCTTCCTGTTCAACGAATTCGGTGAA CGTCGTCACGCGATCCGTTTCCACAAACTGCTGAAAGTTGAAAACGGTGTTGCGCGTGAAGTTGACGAC GTTACCGTTCCGATCTCTATGTCTGAACAGCTGGACAACCTGCTGCCGCGTGACCCGAACGAACCGATCG CGCTGTACTTCCGTGACTACGGTGCGGAACAGCACTTCACCGGTGAATTCGGTGGTGCGAAAATCCAGT GCCGTCGTGACCAGCTGGCGCACATGCACCGTCGTCGTGGTGCGCGTGACGTTTACCTGAACGTTTCTGT TCGTGTTCAGTCTCAGTCTGAAGCGCGTGGTGAACGTCGTCCGCCGTACGCGGCGGTTTTCCGTCTGGTT GGTGACAACCACCGTGCGTTCGTTCACTTCGACAAACTGTCTGACTACCTGGCGGAACACCCGGACGAC GGTAAACTGGGTTCTGAAGGTCTGCTGTCTGGTCTGCGTGTTATGTCTGTTGACCTGGGTCTGCGTACCT CTGCGTCTATCTCTGTTTTCCGTGTTGCGCGTAAAGACGAACTGAAACCGAACTCTAAAGGTCGTGTTCC GTTCTTCTTCCCGATCAAAGGTAACGACAACCTGGTTGCGGTTCACGAACGTTCTCAGCTGCTGAAACTG CCGGGTGAAACCGAATCTAAAGACCTGCGTGCGATCCGTGAAGAACGTCAGCGTACCCTGCGTCAGCTG CGTACCCAGCTGGCGTACCTGCGTCTGCTGGTTCGTTGCGGTTCTGAAGACGTTGGTCGTCGTGAACGTT CTTGGGCGAAACTGATCGAACAGCCGGTTGACGCGGCGAACCACATGACCCCGGACTGGCGTGAAGCGT TCGAAAACGAACTGCAGAAACTGAAATCTCTGCACGGTATCTGCTCTGACAAAGAATGGATGGACGCGG TTTACGAATCTGTTCGTCGTGTTTGGCGTCACATGGGTAAACAGGTTCGTGACTGGCGTAAAGACGTTCG TTCTGGTGAACGTCCGAAAATCCGTGGTTACGCGAAAGACGTTGTTGGTGGTAACTCTATCGAACAGAT CGAATACCTGGAACGTCAGTACAAATTCCTGAAATCTTGGTCTTTCTTCGGTAAAGTTTCTGGTCAGGTT ATCCGTGCGGAAAAAGGTTCTCGTTTCGCGATCACCCTGCGTGAACACATCGACCACGCGAAAGAAGAC CGTCTGAAAAAACTGGCGGACCGTATCATCATGGAAGCGCTGGGTTACGTTTACGCGCTGGACGAACGT GGTAAAGGTAAATGGGTTGCGAAATACCCGCCGTGCCAGCTGATCCTGCTGGAAGAACTGTCTGAATAC CAGTTCAACAACGACCGTCCGCCGTCTGAAAACAACCAGCTGATGCAGTGGTCTCACCGTGGTGTTTTCC AGGAACTGATCAACCAGGCGCAGGTTCACGACCTGCTGGTTGGTACCATGTACGCGGCGTTCTCTTCTCG TTTCGACGCGCGTACCGGTGCGCCGGGTATCCGTTGCCGTCGTGTTCCGGCGCGTTGCACCCAGGAACAC AACCCGGAACCGTTCCCGTGGTGGCTGAACAAATTCGTTGTTGAACACACCCTGGACGCGTGCCCGCTG CGTGCGGACGACCTGATCCCGACCGGTGAAGGTGAAATCTTCGTTTCTCCGTTCTCTGCGGAAGAAGGT GACTTCCACCAGATCCACGCGGACCTGAACGCGGCGCAGAACCTGCAGCAGCGTCTGTGGTCTGACTTC GACATCTCTCAGATCCGTCTGCGTTGCGACTGGGGTGAAGTTGACGGTGAACTGGTTCTGATCCCGCGTC TGACCGGTAAACGTACCGCGGACTCTTACTCTAACAAAGTTTTCTACACCAACACCGGTGTTACCTACTA CGAACGTGAACGTGGTAAAAAACGTCGTAAAGTTTTCGCGCAGGAAAAACTGTCTGAAGAAGAAGCGG AACTGCTGGTTGAAGCGGACGAAGCGCGTGAAAAATCTGTTGTTCTGATGCGTGACCCGTCTGGTATCA TCAACCGTGGTAACTGGACCCGTCAGAAAGAATTCTGGTCTATGGTTAACCAGCGTATCGAAGGTTACC TGGTTAAACAGATCCGTTCTCGTGTTCCGCTGCAGGACTCTGCGTGCGAAAACACCGGTGACATCTAAG AAATCATCCTTAGCGAAAGCTAAGGATTTTTTTTATCTGAAATGTAGGGAGACCCTCAGGTTAAATATTC ACTCAGGAAGTTATTACTCAGGAAGCAAAGAGGATTACA SEQ AATCAGGAGAGCGTTTTCAATCCTACCTCTGGCGCAGTTGATATGTCAAACAGGTTGCCGTCACTGCGTC ID TTTTACTGGCTCTTCTCGCTAACCAAACCGGTAACCCCGCTTATTAAAAGCATTCTGTAACAAAGCGGGA NO: CCAAAGCCATGACAAAAACGCGTAACAAAAGTGTCTATAATCACGGCAGAAAAGTCCACATTGATTATT 74 TGCACGGCGTCACACTTTGCTATGCCATAGCATTTTTATCCATAAGATTAGCGGATCCTACCTGACGCTT TTTATCGCAACTCTCTACTGTTTCTCCATACCCGTTTTTTTGGGCTAGCAGTAATACGACTCACTATAGGG GTCTCATCTCGTGTGAGATAGGCGGAGATACGAACTTTAAGAGGAGGATATACCATGCACCATCATCAT CACCATGCGACCCGTTCTTTCATCCTGAAAATCGAACCGAACGAAGAAGTTAAAAAAGGTCTGTGGAAA ACCCACGAAGTTCTGAACCACGGTATCGCGTACTACATGAACATCCTGAAACTGATCCGTCAGGAAGCG ATCTACGAACACCACGAACAGGACCCGAAAAACCCGAAAAAAGTTTCTAAAGCGGAAATCCAGGCGGA ACTGTGGGACTTCGTTCTGAAAATGCAGAAATGCAACTCTTTCACCCACGAAGTTGACAAAGACGTTGTT TTCAACATCCTGCGTGAACTGTACGAAGAACTGGTTCCGTCTTCTGTTGAAAAAAAAGGTGAAGCGAAC CAGCTGTCTAACAAATTCCTGTACCCGCTGGTTGACCCGAACTCTCAGTCTGGTAAAGGTACCGCGTCTT CTGGTCGTAAACCGCGTTGGTACAACCTGAAAATCGCGGGTGACCCGTCTTGGGAAGAAGAAAAAAAA AAATGGGAAGAAGACAAAAAAAAAGACCCGCTGGCGAAAATCCTGGGTAAACTGGCGGAATACGGTCT GATCCCGCTGTTCATCCCGTTCACCGACTCTAACGAACCGATCGTTAAAGAAATCAAATGGATGGAAAA ATCTCGTAACCAGTCTGTTCGTCGTCTGGACAAAGACATGTTCATCCAGGCGCTGGAACGTTTCCTGTCT TGGGAATCTTGGAACCTGAAAGTTAAAGAAGAATACGAAAAAGTTGAAAAAGAACACAAAACCCTGGA AGAACGTATCAAAGAAGACATCCAGGCGTTCAAATCTCTGGAACAGTACGAAAAAGAACGTCAGGAAC AGCTGCTGCGTGACACCCTGAACACCAACGAATACCGTCTGTCTAAACGTGGTCTGCGTGGTTGGCGTG AAATCATCCAGAAATGGCTGAAAATGGACGAAAACGAACCGTCTGAAAAATACCTGGAAGTTTTCAAA GACTACCAGCGTAAACACCCGCGTGAAGCGGGTGACTACTCTGTTTACGAATTCCTGTCTAAAAAAGAA AACCACTTCATCTGGCGTAACCACCCGGAATACCCGTACCTGTACGCGACCTTCTGCGAAATCGACAAA AAAAAAAAAGACGCGAAACAGCAGGCGACCTTCACCCTGGCGGACCCGATCAACCACCCGCTGTGGGT TCGTTTCGAAGAACGTTCTGGTTCTAACCTGAACAAATACCGTATCCTGACCGAACAGCTGCACACCGA AAAACTGAAAAAAAAACTGACCGTTCAGCTGGACCGTCTGATCTACCCGACCGAATCTGGTGGTTGGGA AGAAAAAGGTAAAGTTGACATCGTTCTGCTGCCGTCTCGTCAGTTCTACAACCAGATCTTCCTGGACATC GAAGAAAAAGGTAAACACGCGTTCACCTACAAAGACGAATCTATCAAATTCCCGCTGAAAGGTACCCTG GGTGGTGCGCGTGTTCAGTTCGACCGTGACCACCTGCGTCGTTACCCGCACAAAGTTGAATCTGGTAACG TTGGTCGTATCTACTTCAACATGACCGTTAACATCGAACCGACCGAATCTCCGGTTTCTAAATCTCTGAA AATCCACCGTGACGACTTCCCGAAATTCGTTAACTTCAAACCGAAAGAACTGACCGAATGGATCAAAGA CTCTAAAGGTAAAAAACTGAAATCTGGTATCGAATCTCTGGAAATCGGTCTGCGTGTTATGTCTATCGAC CTGGGTCAGCGTCAGGCGGCGGCGGCGTCTATCTTCGAAGTTGTTGACCAGAAACCGGACATCGAAGGT AAACTGTTCTTCCCGATCAAAGGTACCGAACTGTACGCGGTTCACCGTGCGTCTTTCAACATCAAACTGC CGGGTGAAACCCTGGTTAAATCTCGTGAAGTTCTGCGTAAAGCGCGTGAAGACAACCTGAAACTGATGA ACCAGAAACTGAACTTCCTGCGTAACGTTCTGCACTTCCAGCAGTTCGAAGACATCACCGAACGTGAAA AACGTGTTACCAAATGGATCTCTCGTCAGGAAAACTCTGACGTTCCGCTGGTTTACCAGGACGAACTGAT CCAGATCCGTGAACTGATGTACAAACCGTACAAAGACTGGGTTGCGTTCCTGAAACAGCTGCACAAACG TCTGGAAGTTGAAATCGGTAAAGAAGTTAAACACTGGCGTAAATCTCTGTCTGACGGTCGTAAAGGTCT GTACGGTATCTCTCTGAAAAACATCGACGAAATCGACCGTACCCGTAAATTCCTGCTGCGTTGGTCTCTG CGTCCGACCGAACCGGGTGAAGTTCGTCGTCTGGAACCGGGTCAGCGTTTCGCGATCGACCAGCTGAAC CACCTGAACGCGCTGAAAGAAGACCGTCTGAAAAAAATGGCGAACACCATCATCATGCACGCGCTGGG TTACTGCTACGACGTTCGTAAAAAAAAATGGCAGGCGAAAAACCCGGCGTGCCAGATCATCCTGTTCGA AGACCTGTCTAACTACAACCCGTACGAAGAACGTTCTCGTTTCGAAAACTCTAAACTGATGAAATGGTCT CGTCGTGAAATCCCGCGTCAGGTTGCGCTGCAGGGTGAAATCTACGGTCTGCAGGTTGGTGAAGTTGGT GCGCAGTTCTCTTCTCGTTTCCACGCGAAAACCGGTTCTCCGGGTATCCGTTGCTCTGTTGTTACCAAAG AAAAACTGCAGGACAACCGTTTCTTCAAAAACCTGCAGCGTGAAGGTCGTCTGACCCTGGACAAAATCG CGGTTCTGAAAGAAGGTGACCTGTACCCGGACAAAGGTGGTGAAAAATTCATCTCTCTGTCTAAAGACC GTAAACTGGTTACCACCCACGCGGACATCAACGCGGCGCAGAACCTGCAGAAACGTTTCTGGACCCGTA CCCACGGTTTCTACAAAGTTTACTGCAAAGCGTACCAGGTTGACGGTCAGACCGTTTACATCCCGGAATC TAAAGACCAGAAACAGAAAATCATCGAAGAATTCGGTGAAGGTTACTTCATCCTGAAAGACGGTGTTTA CGAATGGGGTAACGCGGGTAAACTGAAAATCAAAAAAGGTTCTTCTAAACAGTCTTCTTCTGAACTGGT TGACTCTGACATCCTGAAAGACTCTTTCGACCTGGCGTCTGAACTGAAAGGTGAAAAACTGATGCTGTA CCGTGACCCGTCTGGTAACGTTTTCCCGTCTGACAAATGGATGGCGGCGGGTGTTTTCTTCGGTAAACTG GAACGTATCCTGATCTCTAAACTGACCAACCAGTACTCTATCTCTACCATCGAAGACGACTCTTCTAAAC AGTCTATGTAAGAAATCATCCTTAGCGAAAGCTAAGGATTTTTTTTATCTGAAATGTAGGGAGACCCTCA GGTTAAATATTCACTCAGGAAGTTATTACTCAGGAAGCAAAGAGGATTACA SEQ AATCAGGAGAGCGTTTTCAATCCTACCTCTGGCGCAGTTGATATGTCAAACAGGTTGCCGTCACTGCGTC ID TTTTACTGGCTCTTCTCGCTAACCAAACCGGTAACCCCGCTTATTAAAAGCATTCTGTAACAAAGCGGGA NO: CCAAAGCCATGACAAAAACGCGTAACAAAAGTGTCTATAATCACGGCAGAAAAGTCCACATTGATTATT 75 TGCACGGCGTCACACTTTGCTATGCCATAGCATTTTTATCCATAAGATTAGCGGATCCTACCTGACGCTT TTTATCGCAACTCTCTACTGTTTCTCCATACCCGTTTTTTTGGGCTAGCAGTAATACGACTCACTATAGGG GTCTCATCTCGTGTGAGATAGGCGGAGATACGAACTTTAAGAGGAGGATATACCATGCACCATCATCAT CACCATCCGACCCGTACCATCAACCTGAAACTGGTTCTGGGTAAAAACCCGGAAAACGCGACCCTGCGT CGTGCGCTGTTCTCTACCCACCGTCTGGTTAACCAGGCGACCAAACGTATCGAAGAATTCCTGCTGCTGT GCCGTGGTGAAGCGTACCGTACCGTTGACAACGAAGGTAAAGAAGCGGAAATCCCGCGTCACGCGGTTC AGGAAGAAGCGCTGGCGTTCGCGAAAGCGGCGCAGCGTCACAACGGTTGCATCTCTACCTACGAAGACC AGGAAATCCTGGACGTTCTGCGTCAGCTGTACGAACGTCTGGTTCCGTCTGTTAACGAAAACAACGAAG CGGGTGACGCGCAGGCGGCGAACGCGTGGGTTTCTCCGCTGATGTCTGCGGAATCTGAAGGTGGTCTGT CTGTTTACGACAAAGTTCTGGACCCGCCGCCGGTTTGGATGAAACTGAAAGAAGAAAAAGCGCCGGGTT GGGAAGCGGCGTCTCAGATCTGGATCCAGTCTGACGAAGGTCAGTCTCTGCTGAACAAACCGGGTTCTC CGCCGCGTTGGATCCGTAAACTGCGTTCTGGTCAGCCGTGGCAGGACGACTTCGTTTCTGACCAGAAAA AAAAACAGGACGAACTGACCAAAGGTAACGCGCCGCTGATCAAACAGCTGAAAGAAATGGGTCTGCTG CCGCTGGTTAACCCGTTCTTCCGTCACCTGCTGGACCCGGAAGGTAAAGGTGTTTCTCCGTGGGACCGTC TGGCGGTTCGTGCGGCGGTTGCGCACTTCATCTCTTGGGAATCTTGGAACCACCGTACCCGTGCGGAATA CAACTCTCTGAAACTGCGTCGTGACGAATTCGAAGCGGCGTCTGACGAATTCAAAGACGACTTCACCCT GCTGCGTCAGTACGAAGCGAAACGTCACTCTACCCTGAAATCTATCGCGCTGGCGGACGACTCTAACCC GTACCGTATCGGTGTTCGTTCTCTGCGTGCGTGGAACCGTGTTCGTGAAGAATGGATCGACAAAGGTGC GACCGAAGAACAGCGTGTTACCATCCTGTCTAAACTGCAGACCCAGCTGCGTGGTAAATTCGGTGACCC GGACCTGTTCAACTGGCTGGCGCAGGACCGTCACGTTCACCTGTGGTCTCCGCGTGACTCTGTTACCCCG CTGGTTCGTATCAACGCGGTTGACAAAGTTCTGCGTCGTCGTAAACCGTACGCGCTGATGACCTTCGCGC ACCCGCGTTTCCACCCGCGTTGGATCCTGTACGAAGCGCCGGGTGGTTCTAACCTGCGTCAGTACGCGCT GGACTGCACCGAAAACGCGCTGCACATCACCCTGCCGCTGCTGGTTGACGACGCGCACGGTACCTGGAT CGAAAAAAAAATCCGTGTTCCGCTGGCGCCGTCTGGTCAGATCCAGGACCTGACCCTGGAAAAACTGGA AAAAAAAAAAAACCGTCTGTACTACCGTTCTGGTTTCCAGCAGTTCGCGGGTCTGGCGGGTGGTGCGGA AGTTCTGTTCCACCGTCCGTACATGGAACACGACGAACGTTCTGAAGAATCTCTGCTGGAACGTCCGGGT GCGGTTTGGTTCAAACTGACCCTGGACGTTGCGACCCAGGCGCCGCCGAACTGGCTGGACGGTAAAGGT CGTGTTCGTACCCCGCCGGAAGTTCACCACTTCAAAACCGCGCTGTCTAACAAATCTAAACACACCCGTA CCCTGCAGCCGGGTCTGCGTGTTCTGTCTGTTGACCTGGGTATGCGTACCTTCGCGTCTTGCTCTGTTTTC GAACTGATCGAAGGTAAACCGGAAACCGGTCGTGCGTTCCCGGTTGCGGACGAACGTTCTATGGACTCT CCGAACAAACTGTGGGCGAAACACGAACGTTCTTTCAAACTGACCCTGCCGGGTGAAACCCCGTCTCGT AAAGAAGAAGAAGAACGTTCTATCGCGCGTGCGGAAATCTACGCGCTGAAACGTGACATCCAGCGTCTG AAATCTCTGCTGCGTCTGGGTGAAGAAGACAACGACAACCGTCGTGACGCGCTGCTGGAACAGTTCTTC AAAGGTTGGGGTGAAGAAGACGTTGTTCCGGGTCAGGCGTTCCCGCGTTCTCTGTTCCAGGGTCTGGGT GCGGCGCCGTTCCGTTCTACCCCGGAACTGTGGCGTCAGCACTGCCAGACCTACTACGACAAAGCGGAA GCGTGCCTGGCGAAACACATCTCTGACTGGCGTAAACGTACCCGTCCGCGTCCGACCTCTCGTGAAATGT GGTACAAAACCCGTTCTTACCACGGTGGTAAATCTATCTGGATGCTGGAATACCTGGACGCGGTTCGTA AACTGCTGCTGTCTTGGTCTCTGCGTGGTCGTACCTACGGTGCGATCAACCGTCAGGACACCGCGCGTTT CGGTTCTCTGGCGTCTCGTCTGCTGCACCACATCAACTCTCTGAAAGAAGACCGTATCAAAACCGGTGCG GACTCTATCGTTCAGGCGGCGCGTGGTTACATCCCGCTGCCGCACGGTAAAGGTTGGGAACAGCGTTAC GAACCGTGCCAGCTGATCCTGTTCGAAGACCTGGCGCGTTACCGTTTCCGTGTTGACCGTCCGCGTCGTG AAAACTCTCAGCTGATGCAGTGGAACCACCGTGCGATCGTTGCGGAAACCACCATGCAGGCGGAACTGT ACGGTCAGATCGTTGAAAACACCGCGGCGGGTTTCTCTTCTCGTTTCCACGCGGCGACCGGTGCGCCGG GTGTTCGTTGCCGTTTCCTGCTGGAACGTGACTTCGACAACGACCTGCCGAAACCGTACCTGCTGCGTGA ACTGTCTTGGATGCTGGGTAACACCAAAGTTGAATCTGAAGAAGAAAAACTGCGTCTGCTGTCTGAAAA AATCCGTCCGGGTTCTCTGGTTCCGTGGGACGGTGGTGAACAGTTCGCGACCCTGCACCCGAAACGTCA GACCCTGTGCGTTATCCACGCGGACATGAACGCGGCGCAGAACCTGCAGCGTCGTTTCTTCGGTCGTTGC GGTGAAGCGTTCCGTCTGGTTTGCCAGCCGCACGGTGACGACGTTCTGCGTCTGGCGTCTACCCCGGGTG CGCGTCTGCTGGGTGCGCTGCAGCAGCTGGAAAACGGTCAGGGTGCGTTCGAACTGGTTCGTGACATGG GTTCTACCTCTCAGATGAACCGTTTCGTTATGAAATCTCTGGGTAAAAAAAAAATCAAACCGCTGCAGG ACAACAACGGTGACGACGAACTGGAAGACGTTCTGTCTGTTCTGCCGGAAGAAGACGACACCGGTCGTA TCACCGTTTTCCGTGACTCTTCTGGTATCTTCTTCCCGTGCAACGTTTGGATCCCGGCGAAACAGTTCTGG CCGGCGGTTCGTGCGATGATCTGGAAAGTTATGGCGTCTCACTCTCTGGGTTAAGAAATCATCCTTAGCG AAAGCTAAGGATTTTTTTTATCTGAAATGTAGGGAGACCCTCAGGTTAAATATTCACTCAGGAAGTTATT ACTCAGGAAGCAAAGAGGATTACA SEQ AATCAGGAGAGCGTTTTCAATCCTACCTCTGGCGCAGTTGATATGTCAAACAGGTTGCCGTCACTGCGTC ID TTTTACTGGCTCTTCTCGCTAACCAAACCGGTAACCCCGCTTATTAAAAGCATTCTGTAACAAAGCGGGA NO: CCAAAGCCATGACAAAAACGCGTAACAAAAGTGTCTATAATCACGGCAGAAAAGTCCACATTGATTATT 76 TGCACGGCGTCACACTTTGCTATGCCATAGCATTTTTATCCATAAGATTAGCGGATCCTACCTGACGCTT TTTATCGCAACTCTCTACTGTTTCTCCATACCCGTTTTTTTGGGCTAGCAGTAATACGACTCACTATAGGG GTCTCATCTCGTGTGAGATAGGCGGAGATACGAACTTTAAGAGGAGGATATACCATGCACCATCATCAT CACCATACCAAACTGCGTCACCGTCAGAAAAAACTGACCCACGACTGGGCGGGTTCTAAAAAACGTGAA GTTCTGGGTTCTAACGGTAAACTGCAGAACCCGCTGCTGATGCCGGTTAAAAAAGGTCAGGTTACCGAA TTCCGTAAAGCGTTCTCTGCGTACGCGCGTGCGACCAAAGGTGAAATGACCGACGGTCGTAAAAACATG TTCACCCACTCTTTCGAACCGTTCAAAACCAAACCGTCTCTGCACCAGTGCGAACTGGCGGACAAAGCG TACCAGTCTCTGCACTCTTACCTGCCGGGTTCTCTGGCGCACTTCCTGCTGTCTGCGCACGCGCTGGGTTT CCGTATCTTCTCTAAATCTGGTGAAGCGACCGCGTTCCAGGCGTCTTCTAAAATCGAAGCGTACGAATCT AAACTGGCGTCTGAACTGGCGTGCGTTGACCTGTCTATCCAGAACCTGACCATCTCTACCCTGTTCAACG CGCTGACCACCTCTGTTCGTGGTAAAGGTGAAGAAACCTCTGCGGACCCGCTGATCGCGCGTTTCTACAC CCTGCTGACCGGTAAACCGCTGTCTCGTGACACCCAGGGTCCGGAACGTGACCTGGCGGAAGTTATCTC TCGTAAAATCGCGTCTTCTTTCGGTACCTGGAAAGAAATGACCGCGAACCCGCTGCAGTCTCTGCAGTTC TTCGAAGAAGAACTGCACGCGCTGGACGCGAACGTTTCTCTGTCTCCGGCGTTCGACGTTCTGATCAAAA TGAACGACCTGCAGGGTGACCTGAAAAACCGTACCATCGTTTTCGACCCGGACGCGCCGGTTTTCGAAT ACAACGCGGAAGACCCGGCGGACATCATCATCAAACTGACCGCGCGTTACGCGAAAGAAGCGGTTATC AAAAACCAGAACGTTGGTAACTACGTTAAAAACGCGATCACCACCACCAACGCGAACGGTCTGGGTTGG CTGCTGAACAAAGGTCTGTCTCTGCTGCCGGTTTCTACCGACGACGAACTGCTGGAATTCATCGGTGTTG AACGTTCTCACCCGTCTTGCCACGCGCTGATCGAACTGATCGCGCAGCTGGAAGCGCCGGAACTGTTCG AAAAAAACGTTTTCTCTGACACCCGTTCTGAAGTTCAGGGTATGATCGACTCTGCGGTTTCTAACCACAT CGCGCGTCTGTCTTCTTCTCGTAACTCTCTGTCTATGGACTCTGAAGAACTGGAACGTCTGATCAAATCTT TCCAGATCCACACCCCGCACTGCTCTCTGTTCATCGGTGCGCAGTCTCTGTCTCAGCAGCTGGAATCTCT GCCGGAAGCGCTGCAGTCTGGTGTTAACTCTGCGGACATCCTGCTGGGTTCTACCCAGTACATGCTGACC AACTCTCTGGTTGAAGAATCTATCGCGACCTACCAGCGTACCCTGAACCGTATCAACTACCTGTCTGGTG TTGCGGGTCAGATCAACGGTGCGATCAAACGTAAAGCGATCGACGGTGAAAAAATCCACCTGCCGGCG GCGTGGTCTGAACTGATCTCTCTGCCGTTCATCGGTCAGCCGGTTATCGACGTTGAATCTGACCTGGCGC ACCTGAAAAACCAGTACCAGACCCTGTCTAACGAATTCGACACCCTGATCTCTGCGCTGCAGAAAAACT TCGACCTGAACTTCAACAAAGCGCTGCTGAACCGTACCCAGCACTTCGAAGCGATGTGCCGTTCTACCA AAAAAAACGCGCTGTCTAAACCGGAAATCGTTTCTTACCGTGACCTGCTGGCGCGTCTGACCTCTTGCCT GTACCGTGGTTCTCTGGTTCTGCGTCGTGCGGGTATCGAAGTTCTGAAAAAACACAAAATCTTCGAATCT AACTCTGAACTGCGTGAACACGTTCACGAACGTAAACACTTCGTTTTCGTTTCTCCGCTGGACCGTAAAG CGAAAAAACTGCTGCGTCTGACCGACTCTCGTCCGGACCTGCTGCACGTTATCGACGAAATCCTGCAGC ACGACAACCTGGAAAACAAAGACCGTGAATCTCTGTGGCTGGTTCGTTCTGGTTACCTGCTGGCGGGTCT GCCGGACCAGCTGTCTTCTTCTTTCATCAACCTGCCGATCATCACCCAGAAAGGTGACCGTCGTCTGATC GACCTGATCCAGTACGACCAGATCAACCGTGACGCGTTCGTTATGCTGGTTACCTCTGCGTTCAAATCTA ACCTGTCTGGTCTGCAGTACCGTGCGAACAAACAGTCTTTCGTTGTTACCCGTACCCTGTCTCCGTACCT GGGTTCTAAACTGGTTTACGTTCCGAAAGACAAAGACTGGCTGGTTCCGTCTCAGATGTTCGAAGGTCGT TTCGCGGACATCCTGCAGTCTGACTACATGGTTTGGAAAGACGCGGGTCGTCTGTGCGTTATCGACACCG CGAAACACCTGTCTAACATCAAAAAATCTGTTTTCTCTTCTGAAGAAGTTCTGGCGTTCCTGCGTGAACT GCCGCACCGTACCTTCATCCAGACCGAAGTTCGTGGTCTGGGTGTTAACGTTGACGGTATCGCGTTCAAC AACGGTGACATCCCGTCTCTGAAAACCTTCTCTAACTGCGTTCAGGTTAAAGTTTCTCGTACCAACACCT CTCTGGTTCAGACCCTGAACCGTTGGTTCGAAGGTGGTAAAGTTTCTCCGCCGTCTATCCAGTTCGAACG TGCGTACTACAAAAAAGACGACCAGATCCACGAAGACGCGGCGAAACGTAAAATCCGTTTCCAGATGC CGGCGACCGAACTGGTTCACGCGTCTGACGACGCGGGTTGGACCCCGTCTTACCTGCTGGGTATCGACC CGGGTGAATACGGTATGGGTCTGTCTCTGGTTTCTATCAACAACGGTGAAGTTCTGGACTCTGGTTTCAT CCACATCAACTCTCTGATCAACTTCGCGTCTAAAAAATCTAACCACCAGACCAAAGTTGTTCCGCGTCAG CAGTACAAATCTCCGTACGCGAACTACCTGGAACAGTCTAAAGACTCTGCGGCGGGTGACATCGCGCAC ATCCTGGACCGTCTGATCTACAAACTGAACGCGCTGCCGGTTTTCGAAGCGCTGTCTGGTAACTCTCAGT CTGCGGCGGACCAGGTTTGGACCAAAGTTCTGTCTTTCTACACCTGGGGTGACAACGACGCGCAGAACT CTATCCGTAAACAGCACTGGTTCGGTGCGTCTCACTGGGACATCAAAGGTATGCTGCGTCAGCCGCCGA CCGAAAAAAAACCGAAACCGTACATCGCGTTCCCGGGTTCTCAGGTTTCTTCTTACGGTAACTCTCAGCG TTGCTCTTGCTGCGGTCGTAACCCGATCGAACAGCTGCGTGAAATGGCGAAAGACACCTCTATCAAAGA ACTGAAAATCCGTAACTCTGAAATCCAGCTGTTCGACGGTACCATCAAACTGTTCAACCCGGACCCGTCT ACCGTTATCGAACGTCGTCGTCACAACCTGGGTCCGTCTCGTATCCCGGTTGCGGACCGTACCTTCAAAA ACATCTCTCCGTCTTCTCTGGAATTCAAAGAACTGATCACCATCGTTTCTCGTTCTATCCGTCACTCTCCG GAATTCATCGCGAAAAAACGTGGTATCGGTTCTGAATACTTCTGCGCGTACTCTGACTGCAACTCTTCTC TGAACTCTGAAGCGAACGCGGCGGCGAACGTTGCGCAGAAATTCCAGAAACAGCTGTTCTTCGAACTGT AAGAAATCATCCTTAGCGAAAGCTAAGGATTTTTTTTATCTGAAATGTAGGGAGACCCTCAGGTTAAAT ATTCACTCAGGAAGTTATTACTCAGGAAGCAAAGAGGATTACA SEQ AATCAGGAGAGCGTTTTCAATCCTACCTCTGGCGCAGTTGATATGTCAAACAGGTTGCCGTCACTGCGTC ID TTTTACTGGCTCTTCTCGCTAACCAAACCGGTAACCCCGCTTATTAAAAGCATTCTGTAACAAAGCGGGA NO: CCAAAGCCATGACAAAAACGCGTAACAAAAGTGTCTATAATCACGGCAGAAAAGTCCACATTGATTATT 77 TGCACGGCGTCACACTTTGCTATGCCATAGCATTTTTATCCATAAGATTAGCGGATCCTACCTGACGCTT TTTATCGCAACTCTCTACTGTTTCTCCATACCCGTTTTTTTGGGCTAGCAGTAATACGACTCACTATAGGG GTCTCATCTCGTGTGAGATAGGCGGAGATACGAACTTTAAGAGGAGGATATACCATGCACCATCATCAT CACCATAAACGTATCCTGAACTCTCTGAAAGTTGCGGCGCTGCGTCTGCTGTTCCGTGGTAAAGGTTCTG AACTGGTTAAAACCGTTAAATACCCGCTGGTTTCTCCGGTTCAGGGTGCGGTTGAAGAACTGGCGGAAG CGATCCGTCACGACAACCTGCACCTGTTCGGTCAGAAAGAAATCGTTGACCTGATGGAAAAAGACGAAG GTACCCAGGTTTACTCTGTTGTTGACTTCTGGCTGGACACCCTGCGTCTGGGTATGTTCTTCTCTCCGTCT GCGAACGCGCTGAAAATCACCCTGGGTAAATTCAACTCTGACCAGGTTTCTCCGTTCCGTAAAGTTCTGG AACAGTCTCCGTTCTTCCTGGCGGGTCGTCTGAAAGTTGAACCGGCGGAACGTATCCTGTCTGTTGAAAT CCGTAAAATCGGTAAACGTGAAAACCGTGTTGAAAACTACGCGGCGGACGTTGAAACCTGCTTCATCGG TCAGCTGTCTTCTGACGAAAAACAGTCTATCCAGAAACTGGCGAACGACATCTGGGACTCTAAAGACCA CGAAGAACAGCGTATGCTGAAAGCGGACTTCTTCGCGATCCCGCTGATCAAAGACCCGAAAGCGGTTAC CGAAGAAGACCCGGAAAACGAAACCGCGGGTAAACAGAAACCGCTGGAACTGTGCGTTTGCCTGGTTC CGGAACTGTACACCCGTGGTTTCGGTTCTATCGCGGACTTCCTGGTTCAGCGTCTGACCCTGCTGCGTGA CAAAATGTCTACCGACACCGCGGAAGACTGCCTGGAATACGTTGGTATCGAAGAAGAAAAAGGTAACG GTATGAACTCTCTGCTGGGTACCTTCCTGAAAAACCTGCAGGGTGACGGTTTCGAACAGATCTTCCAGTT CATGCTGGGTTCTTACGTTGGTTGGCAGGGTAAAGAAGACGTTCTGCGTGAACGTCTGGACCTGCTGGC GGAAAAAGTTAAACGTCTGCCGAAACCGAAATTCGCGGGTGAATGGTCTGGTCACCGTATGTTCCTGCA CGGTCAGCTGAAATCTTGGTCTTCTAACTTCTTCCGTCTGTTCAACGAAACCCGTGAACTGCTGGAATCT ATCAAATCTGACATCCAGCACGCGACCATGCTGATCTCTTACGTTGAAGAAAAAGGTGGTTACCACCCG CAGCTGCTGTCTCAGTACCGTAAACTGATGGAACAGCTGCCGGCGCTGCGTACCAAAGTTCTGGACCCG GAAATCGAAATGACCCACATGTCTGAAGCGGTTCGTTCTTACATCATGATCCACAAATCTGTTGCGGGTT TCCTGCCGGACCTGCTGGAATCTCTGGACCGTGACAAAGACCGTGAATTCCTGCTGTCTATCTTCCCGCG TATCCCGAAAATCGACAAAAAAACCAAAGAAATCGTTGCGTGGGAACTGCCGGGTGAACCGGAAGAAG GTTACCTGTTCACCGCGAACAACCTGTTCCGTAACTTCCTGGAAAACCCGAAACACGTTCCGCGTTTCAT GGCGGAACGTATCCCGGAAGACTGGACCCGTCTGCGTTCTGCGCCGGTTTGGTTCGACGGTATGGTTAA ACAGTGGCAGAAAGTTGTTAACCAGCTGGTTGAATCTCCGGGTGCGCTGTACCAGTTCAACGAATCTTTC CTGCGTCAGCGTCTGCAGGCGATGCTGACCGTTTACAAACGTGACCTGCAGACCGAAAAATTCCTGAAA CTGCTGGCGGACGTTTGCCGTCCGCTGGTTGACTTCTTCGGTCTGGGTGGTAACGACATCATCTTCAAAT CTTGCCAGGACCCGCGTAAACAGTGGCAGACCGTTATCCCGCTGTCTGTTCCGGCGGACGTTTACACCGC GTGCGAAGGTCTGGCGATCCGTCTGCGTGAAACCCTGGGTTTCGAATGGAAAAACCTGAAAGGTCACGA ACGTGAAGACTTCCTGCGTCTGCACCAGCTGCTGGGTAACCTGCTGTTCTGGATCCGTGACGCGAAACTG GTTGTTAAACTGGAAGACTGGATGAACAACCCGTGCGTTCAGGAATACGTTGAAGCGCGTAAAGCGATC GACCTGCCGCTGGAAATCTTCGGTTTCGAAGTTCCGATCTTCCTGAACGGTTACCTGTTCTCTGAACTGC GTCAGCTGGAACTGCTGCTGCGTCGTAAATCTGTTATGACCTCTTACTCTGTTAAAACCACCGGTTCTCC GAACCGTCTGTTCCAGCTGGTTTACCTGCCGCTGAACCCGTCTGACCCGGAAAAAAAAAACTCTAACAA CTTCCAGGAACGTCTGGACACCCCGACCGGTCTGTCTCGTCGTTTCCTGGACCTGACCCTGGACGCGTTC GCGGGTAAACTGCTGACCGACCCGGTTACCCAGGAACTGAAAACCATGGCGGGTTTCTACGACCACCTG TTCGGTTTCAAACTGCCGTGCAAACTGGCGGCGATGTCTAACCACCCGGGTTCTTCTTCTAAAATGGTTG TTCTGGCGAAACCGAAAAAAGGTGTTGCGTCTAACATCGGTTTCGAACCGATCCCGGACCCGGCGCACC CGGTTTTCCGTGTTCGTTCTTCTTGGCCGGAACTGAAATACCTGGAAGGTCTGCTGTACCTGCCGGAAGA CACCCCGCTGACCATCGAACTGGCGGAAACCTCTGTTTCTTGCCAGTCTGTTTCTTCTGTTGCGTTCGACC TGAAAAACCTGACCACCATCCTGGGTCGTGTTGGTGAATTCCGTGTTACCGCGGACCAGCCGTTCAAACT GACCCCGATCATCCCGGAAAAAGAAGAATCTTTCATCGGTAAAACCTACCTGGGTCTGGACGCGGGTGA ACGTTCTGGTGTTGGTTTCGCGATCGTTACCGTTGACGGTGACGGTTACGAAGTTCAGCGTCTGGGTGTT CACGAAGACACCCAGCTGATGGCGCTGCAGCAGGTTGCGTCTAAATCTCTGAAAGAACCGGTTTTCCAG CCGCTGCGTAAAGGTACCTTCCGTCAGCAGGAACGTATCCGTAAATCTCTGCGTGGTTGCTACTGGAACT TCTACCACGCGCTGATGATCAAATACCGTGCGAAAGTTGTTCACGAAGAATCTGTTGGTTCTTCTGGTCT GGTTGGTCAGTGGCTGCGTGCGTTCCAGAAAGACCTGAAAAAAGCGGACGTTCTGCCGAAAAAAGGTG GTAAAAACGGTGTTGACAAAAAAAAACGTGAATCTTCTGCGCAGGACACCCTGTGGGGTGGTGCGTTCT CTAAAAAAGAAGAACAGCAGATCGCGTTCGAAGTTCAGGCGGCGGGTTCTTCTCAGTTCTGCCTGAAAT GCGGTTGGTGGTTCCAGCTGGGTATGCGTGAAGTTAACCGTGTTCAGGAATCTGGTGTTGTTCTGGACTG GAACCGTTCTATCGTTACCTTCCTGATCGAATCTTCTGGTGAAAAAGTTTACGGTTTCTCTCCGCAGCAG CTGGAAAAAGGTTTCCGTCCGGACATCGAAACCTTCAAAAAAATGGTTCGTGACTTCATGCGTCCGCCG ATGTTCGACCGTAAAGGTCGTCCGGCGGCGGCGTACGAACGTTTCGTTCTGGGTCGTCGTCACCGTCGTT ACCGTTTCGACAAAGTTTTCGAAGAACGTTTCGGTCGTTCTGCGCTGTTCATCTGCCCGCGTGTTGGTTGC GGTAACTTCGACCACTCTTCTGAACAGTCTGCGGTTGTTCTGGCGCTGATCGGTTACATCGCGGACAAAG AAGGTATGTCTGGTAAAAAACTGGTTTACGTTCGTCTGGCGGAACTGATGGCGGAATGGAAACTGAAAA AACTGGAACGTTCTCGTGTTGAAGAACAGTCTTCTGCGCAGTAAGAAATCATCCTTAGCGAAAGCTAAG GATTTTTTTTATCTGAAATGTAGGGAGACCCTCAGGTTAAATATTCACTCAGGAAGTTATTACTCAGGAA GCAAAGAGGATTACA SEQ AATCAGGAGAGCGTTTTCAATCCTACCTCTGGCGCAGTTGATATGTCAAACAGGTTGCCGTCACTGCGTC ID TTTTACTGGCTCTTCTCGCTAACCAAACCGGTAACCCCGCTTATTAAAAGCATTCTGTAACAAAGCGGGA NO: CCAAAGCCATGACAAAAACGCGTAACAAAAGTGTCTATAATCACGGCAGAAAAGTCCACATTGATTATT 78 TGCACGGCGTCACACTTTGCTATGCCATAGCATTTTTATCCATAAGATTAGCGGATCCTACCTGACGCTT TTTATCGCAACTCTCTACTGTTTCTCCATACCCGTTTTTTTGGGCTAGCAGTAATACGACTCACTATAGGG GTCTCATCTCGTGTGAGATAGGCGGAGATACGAACTTTAAGAGGAGGATATACCATGCACCATCATCAT CACCATGCGGAATCTAAACAGATGCAGTGCCGTAAATGCGGTGCGTCTATGAAATACGAAGTTATCGGT CTGGGTAAAAAATCTTGCCGTTACATGTGCCCGGACTGCGGTAACCACACCTCTGCGCGTAAAATCCAG AACAAAAAAAAACGTGACAAAAAATACGGTTCTGCGTCTAAAGCGCAGTCTCAGCGTATCGCGGTTGCG GGTGCGCTGTACCCGGACAAAAAAGTTCAGACCATCAAAACCTACAAATACCCGGCGGACCTGAACGGT GAAGTTCACGACTCTGGTGTTGCGGAAAAAATCGCGCAGGCGATCCAGGAAGACGAAATCGGTCTGCTG GGTCCGTCTTCTGAATACGCGTGCTGGATCGCGTCTCAGAAACAGTCTGAACCGTACTCTGTTGTTGACT TCTGGTTCGACGCGGTTTGCGCGGGTGGTGTTTTCGCGTACTCTGGTGCGCGTCTGCTGTCTACCGTTCTG CAGCTGTCTGGTGAAGAATCTGTTCTGCGTGCGGCGCTGGCGTCTTCTCCGTTCGTTGACGACATCAACC TGGCGCAGGCGGAAAAATTCCTGGCGGTTTCTCGTCGTACCGGTCAGGACAAACTGGGTAAACGTATCG GTGAATGCTTCGCGGAAGGTCGTCTGGAAGCGCTGGGTATCAAAGACCGTATGCGTGAATTCGTTCAGG CGATCGACGTTGCGCAGACCGCGGGTCAGCGTTTCGCGGCGAAACTGAAAATCTTCGGTATCTCTCAGA TGCCGGAAGCGAAACAGTGGAACAACGACTCTGGTCTGACCGTTTGCATCCTGCCGGACTACTACGTTC CGGAAGAAAACCGTGCGGACCAGCTGGTTGTTCTGCTGCGTCGTCTGCGTGAAATCGCGTACTGCATGG GTATCGAAGACGAAGCGGGTTTCGAACACCTGGGTATCGACCCGGGTGCGCTGTCTAACTTCTCTAACG GTAACCCGAAACGTGGTTTCCTGGGTCGTCTGCTGAACAACGACATCATCGCGCTGGCGAACAACATGT CTGCGATGACCCCGTACTGGGAAGGTCGTAAAGGTGAACTGATCGAACGTCTGGCGTGGCTGAAACACC GTGCGGAAGGTCTGTACCTGAAAGAACCGCACTTCGGTAACTCTTGGGCGGACCACCGTTCTCGTATCTT CTCTCGTATCGCGGGTTGGCTGTCTGGTTGCGCGGGTAAACTGAAAATCGCGAAAGACCAGATCTCTGG TGTTCGTACCGACCTGTTCCTGCTGAAACGTCTGCTGGACGCGGTTCCGCAGTCTGCGCCGTCTCCGGAC TTCATCGCGTCTATCTCTGCGCTGGACCGTTTCCTGGAAGCGGCGGAATCTTCTCAGGACCCGGCGGAAC AGGTTCGTGCGCTGTACGCGTTCCACCTGAACGCGCCGGCGGTTCGTTCTATCGCGAACAAAGCGGTTCA GCGTTCTGACTCTCAGGAATGGCTGATCAAAGAACTGGACGCGGTTGACCACCTGGAATTCAACAAAGC GTTCCCGTTCTTCTCTGACACCGGTAAAAAAAAAAAAAAAGGTGCGAACTCTAACGGTGCGCCGTCTGA AGAAGAATACACCGAAACCGAATCTATCCAGCAGCCGGAAGACGCGGAACAGGAAGTTAACGGTCAGG AAGGTAACGGTGCGTCTAAAAACCAGAAAAAATTCCAGCGTATCCCGCGTTTCTTCGGTGAAGGTTCTC GTTCTGAATACCGTATCCTGACCGAAGCGCCGCAGTACTTCGACATGTTCTGCAACAACATGCGTGCGAT CTTCATGCAGCTGGAATCTCAGCCGCGTAAAGCGCCGCGTGACTTCAAATGCTTCCTGCAGAACCGTCTG CAGAAACTGTACAAACAGACCTTCCTGAACGCGCGTTCTAACAAATGCCGTGCGCTGCTGGAATCTGTT CTGATCTCTTGGGGTGAATTCTACACCTACGGTGCGAACGAAAAAAAATTCCGTCTGCGTCACGAAGCG TCTGAACGTTCTTCTGACCCGGACTACGTTGTTCAGCAGGCGCTGGAAATCGCGCGTCGTCTGTTCCTGT TCGGTTTCGAATGGCGTGACTGCTCTGCGGGTGAACGTGTTGACCTGGTTGAAATCCACAAAAAAGCGA TCTCTTTCCTGCTGGCGATCACCCAGGCGGAAGTTTCTGTTGGTTCTTACAACTGGCTGGGTAACTCTACC GTTTCTCGTTACCTGTCTGTTGCGGGTACCGACACCCTGTACGGTACCCAGCTGGAAGAATTCCTGAACG CGACCGTTCTGTCTCAGATGCGTGGTCTGGCGATCCGTCTGTCTTCTCAGGAACTGAAAGACGGTTTCGA CGTTCAGCTGGAATCTTCTTGCCAGGACAACCTGCAGCACCTGCTGGTTTACCGTGCGTCTCGTGACCTG GCGGCGTGCAAACGTGCGACCTGCCCGGCGGAACTGGACCCGAAAATCCTGGTTCTGCCGGTTGGTGCG TTCATCGCGTCTGTTATGAAAATGATCGAACGTGGTGACGAACCGCTGGCGGGTGCGTACCTGCGTCAC CGTCCGCACTCTTTCGGTTGGCAGATCCGTGTTCGTGGTGTTGCGGAAGTTGGTATGGACCAGGGTACCG CGCTGGCGTTCCAGAAACCGACCGAATCTGAACCGTTCAAAATCAAACCGTTCTCTGCGCAGTACGGTC CGGTTCTGTGGCTGAACTCTTCTTCTTACTCTCAGTCTCAGTACCTGGACGGTTTCCTGTCTCAGCCGAAA AACTGGTCTATGCGTGTTCTGCCGCAGGCGGGTTCTGTTCGTGTTGAACAGCGTGTTGCGCTGATCTGGA ACCTGCAGGCGGGTAAAATGCGTCTGGAACGTTCTGGTGCGCGTGCGTTCTTCATGCCGGTTCCGTTCTC TTTCCGTCCGTCTGGTTCTGGTGACGAAGCGGTTCTGGCGCCGAACCGTTACCTGGGTCTGTTCCCGCAC TCTGGTGGTATCGAATACGCGGTTGTTGACGTTCTGGACTCTGCGGGTTTCAAAATCCTGGAACGTGGTA CCATCGCGGTTAACGGTTTCTCTCAGAAACGTGGTGAACGTCAGGAAGAAGCGCACCGTGAAAAACAGC GTCGTGGTATCTCTGACATCGGTCGTAAAAAACCGGTTCAGGCGGAAGTTGACGCGGCGAACGAACTGC ACCGTAAATACACCGACGTTGCGACCCGTCTGGGTTGCCGTATCGTTGTTCAGTGGGCGCCGCAGCCGA AACCGGGTACCGCGCCGACCGCGCAGACCGTTTACGCGCGTGCGGTTCGTACCGAAGCGCCGCGTTCTG GTAACCAGGAAGACCACGCGCGTATGAAATCTTCTTGGGGTTACACCTGGGGTACCTACTGGGAAAAAC GTAAACCGGAAGACATCCTGGGTATCTCTACCCAGGTTTACTGGACCGGTGGTATCGGTGAATCTTGCCC GGCGGTTGCGGTTGCGCTGCTGGGTCACATCCGTGCGACCTCTACCCAGACCGAATGGGAAAAAGAAGA AGTTGTTTTCGGTCGTCTGAAAAAATTCTTCCCGTCTTAAGAAATCATCCTTAGCGAAAGCTAAGGATTT TTTTTATCTGAAATGTAGGGAGACCCTCAGGTTAAATATTCACTCAGGAAGTTATTACTCAGGAAGCAAA GAGGATTACA SEQ AATCAGGAGAGCGTTTTCAATCCTACCTCTGGCGCAGTTGATATGTCAAACAGGTTGCCGTCACTGCGTC ID TTTTACTGGCTCTTCTCGCTAACCAAACCGGTAACCCCGCTTATTAAAAGCATTCTGTAACAAAGCGGGA NO: CCAAAGCCATGACAAAAACGCGTAACAAAAGTGTCTATAATCACGGCAGAAAAGTCCACATTGATTATT 79 TGCACGGCGTCACACTTTGCTATGCCATAGCATTTTTATCCATAAGATTAGCGGATCCTACCTGACGCTT TTTATCGCAACTCTCTACTGTTTCTCCATACCCGTTTTTTTGGGCTAGCAGTAATACGACTCACTATAGGG GTCTCATCTCGTGTGAGATAGGCGGAGATACGAACTTTAAGAGGAGGATATACCATGCACCATCATCAT CACCATGAAAAACGTATCAACAAAATCCGTAAAAAACTGTCTGCGGACAACGCGACCAAACCGGTTTCT CGTTCTGGTCCGATGAAAACCCTGCTGGTTCGTGTTATGACCGACGACCTGAAAAAACGTCTGGAAAAA CGTCGTAAAAAACCGGAAGTTATGCCGCAGGTTATCTCTAACAACGCGGCGAACAACCTGCGTATGCTG CTGGACGACTACACCAAAATGAAAGAAGCGATCCTGCAGGTTTACTGGCAGGAATTCAAAGACGACCA CGTTGGTCTGATGTGCAAATTCGCGCAGCCGGCGTCTAAAAAAATCGACCAGAACAAACTGAAACCGGA AATGGACGAAAAAGGTAACCTGACCACCGCGGGTTTCGCGTGCTCTCAGTGCGGTCAGCCGCTGTTCGT TTACAAACTGGAACAGGTTTCTGAAAAAGGTAAAGCGTACACCAACTACTTCGGTCGTTGCAACGTTGC GGAACACGAAAAACTGATCCTGCTGGCGCAGCTGAAACCGGAAAAAGACTCTGACGAAGCGGTTACCT ACTCTCTGGGTAAATTCGGTCAGCGTGCGCTGGACTTCTACTCTATCCACGTTACCAAAGAATCTACCCA CCCGGTTAAACCGCTGGCGCAGATCGCGGGTAACCGTTACGCGTCTGGTCCGGTTGGTAAAGCGCTGTC TGACGCGTGCATGGGTACCATCGCGTCTTTCCTGTCTAAATACCAGGACATCATCATCGAACACCAGAA AGTTGTTAAAGGTAACCAGAAACGTCTGGAATCTCTGCGTGAACTGGCGGGTAAAGAAAACCTGGAATA CCCGTCTGTTACCCTGCCGCCGCAGCCGCACACCAAAGAAGGTGTTGACGCGTACAACGAAGTTATCGC GCGTGTTCGTATGTGGGTTAACCTGAACCTGTGGCAGAAACTGAAACTGTCTCGTGACGACGCGAAACC GCTGCTGCGTCTGAAAGGTTTCCCGTCTTTCCCGGTTGTTGAACGTCGTGAAAACGAAGTTGACTGGTGG AACACCATCAACGAAGTTAAAAAACTGATCGACGCGAAACGTGACATGGGTCGTGTTTTCTGGTCTGGT GTTACCGCGGAAAAACGTAACACCATCCTGGAAGGTTACAACTACCTGCCGAACGAAAACGACCACAA AAAACGTGAAGGTTCTCTGGAAAACCCGAAAAAACCGGCGAAACGTCAGTTCGGTGACCTGCTGCTGTA CCTGGAAAAAAAATACGCGGGTGACTGGGGTAAAGTTTTCGACGAAGCGTGGGAACGTATCGACAAAA AAATCGCGGGTCTGACCTCTCACATCGAACGTGAAGAAGCGCGTAACGCGGAAGACGCGCAGTCTAAA GCGGTTCTGACCGACTGGCTGCGTGCGAAAGCGTCTTTCGTTCTGGAACGTCTGAAAGAAATGGACGAA AAAGAATTCTACGCGTGCGAAATCCAGCTGCAGAAATGGTACGGTGACCTGCGTGGTAACCCGTTCGCG GTTGAAGCGGAAAACCGTGTTGTTGACATCTCTGGTTTCTCTATCGGTTCTGACGGTCACTCTATCCAGT ACCGTAACCTGCTGGCGTGGAAATACCTGGAAAACGGTAAACGTGAATTCTACCTGCTGATGAACTACG GTAAAAAAGGTCGTATCCGTTTCACCGACGGTACCGACATCAAAAAATCTGGTAAATGGCAGGGTCTGC TGTACGGTGGTGGTAAAGCGAAAGTTATCGACCTGACCTTCGACCCGGACGACGAACAGCTGATCATCC TGCCGCTGGCGTTCGGTACCCGTCAGGGTCGTGAATTCATCTGGAACGACCTGCTGTCTCTGGAAACCGG TCTGATCAAACTGGCGAACGGTCGTGTTATCGAAAAAACCATCTACAACAAAAAAATCGGTCGTGACGA ACCGGCGCTGTTCGTTGCGCTGACCTTCGAACGTCGTGAAGTTGTTGACCCGTCTAACATCAAACCGGTT AACCTGATCGGTGTTGACCGTGGTGAAAACATCCCGGCGGTTATCGCGCTGACCGACCCGGAAGGTTGC CCGCTGCCGGAATTCAAAGACTCTTCTGGTGGTCCGACCGACATCCTGCGTATCGGTGAAGGTTACAAA GAAAAACAGCGTGCGATCCAGGCGGCGAAAGAAGTTGAACAGCGTCGTGCGGGTGGTTACTCTCGTAA ATTCGCGTCTAAATCTCGTAACCTGGCGGACGACATGGTTCGTAACTCTGCGCGTGACCTGTTCTACCAC GCGGTTACCCACGACGCGGTTCTGGTTTTCGAAAACCTGTCTCGTGGTTTCGGTCGTCAGGGTAAACGTA CCTTCATGACCGAACGTCAGTACACCAAAATGGAAGACTGGCTGACCGCGAAACTGGCGTACGAAGGTC TGACCTCTAAAACCTACCTGTCTAAAACCCTGGCGCAGTACACCTCTAAAACCTGCTCTAACTGCGGTTT CACCATCACCACCGCGGACTACGACGGTATGCTGGTTCGTCTGAAAAAAACCTCTGACGGTTGGGCGAC CACCCTGAACAACAAAGAACTGAAAGCGGAAGGTCAGATCACCTACTACAACCGTTACAAACGTCAGA CCGTTGAAAAAGAACTGTCTGCGGAACTGGACCGTCTGTCTGAAGAATCTGGTAACAACGACATCTCTA AATGGACCAAAGGTCGTCGTGACGAAGCGCTGTTCCTGCTGAAAAAACGTTTCTCTCACCGTCCGGTTCA GGAACAGTTCGTTTGCCTGGACTGCGGTCACGAAGTTCACGCGGACGAACAGGCGGCGCTGAACATCGC GCGTTCTTGGCTGTTCCTGAACTCTAACTCTACCGAATTCAAATCTTACAAATCTGGTAAACAGCCGTTC GTTGGTGCGTGGCAGGCGTTCTACAAACGTCGTCTGAAAGAAGTTTGGAAACCGAACGCGTAAGAAATC ATCCTTAGCGAAAGCTAAGGATTTTTTTTATCTGAAATGTAGGGAGACCCTCAGGTTAAATATTCACTCA GGAAGTTATTACTCAGGAAGCAAAGAGGATTACA SEQ AATCAGGAGAGCGTTTTCAATCCTACCTCTGGCGCAGTTGATATGTCAAACAGGTTGCCGTCACTGCGTC ID TTTTACTGGCTCTTCTCGCTAACCAAACCGGTAACCCCGCTTATTAAAAGCATTCTGTAACAAAGCGGGA NO: CCAAAGCCATGACAAAAACGCGTAACAAAAGTGTCTATAATCACGGCAGAAAAGTCCACATTGATTATT 80 TGCACGGCGTCACACTTTGCTATGCCATAGCATTTTTATCCATAAGATTAGCGGATCCTACCTGACGCTT TTTATCGCAACTCTCTACTGTTTCTCCATACCCGTTTTTTTGGGCTAGCAGTAATACGACTCACTATAGGG GTCTCATCTCGTGTGAGATAGGCGGAGATACGAACTTTAAGAGGAGGATATACCATGCACCATCATCAT CACCATAAACGTATCAACAAAATCCGTCGTCGTCTGGTTAAAGACTCTAACACCAAAAAAGCGGGTAAA ACCGGTCCGATGAAAACCCTGCTGGTTCGTGTTATGACCCCGGACCTGCGTGAACGTCTGGAAAACCTG CGTAAAAAACCGGAAAACATCCCGCAGCCGATCTCTAACACCTCTCGTGCGAACCTGAACAAACTGCTG ACCGACTACACCGAAATGAAAAAAGCGATCCTGCACGTTTACTGGGAAGAATTCCAGAAAGACCCGGTT GGTCTGATGTCTCGTGTTGCGCAGCCGGCGCCGAAAAACATCGACCAGCGTAAACTGATCCCGGTTAAA GACGGTAACGAACGTCTGACCTCTTCTGGTTTCGCGTGCTCTCAGTGCTGCCAGCCGCTGTACGTTTACA AACTGGAACAGGTTAACGACAAAGGTAAACCGCACACCAACTACTTCGGTCGTTGCAACGTTTCTGAAC ACGAACGTCTGATCCTGCTGTCTCCGCACAAACCGGAAGCGAACGACGAACTGGTTACCTACTCTCTGG GTAAATTCGGTCAGCGTGCGCTGGACTTCTACTCTATCCACGTTACCCGTGAATCTAACCACCCGGTTAA ACCGCTGGAACAGATCGGTGGTAACTCTTGCGCGTCTGGTCCGGTTGGTAAAGCGCTGTCTGACGCGTG CATGGGTGCGGTTGCGTCTTTCCTGACCAAATACCAGGACATCATCCTGGAACACCAGAAAGTTATCAA AAAAAACGAAAAACGTCTGGCGAACCTGAAAGACATCGCGTCTGCGAACGGTCTGGCGTTCCCGAAAA TCACCCTGCCGCCGCAGCCGCACACCAAAGAAGGTATCGAAGCGTACAACAACGTTGTTGCGCAGATCG TTATCTGGGTTAACCTGAACCTGTGGCAGAAACTGAAAATCGGTCGTGACGAAGCGAAACCGCTGCAGC GTCTGAAAGGTTTCCCGTCTTTCCCGCTGGTTGAACGTCAGGCGAACGAAGTTGACTGGTGGGACATGGT TTGCAACGTTAAAAAACTGATCAACGAAAAAAAAGAAGACGGTAAAGTTTTCTGGCAGAACCTGGCGG GTTACAAACGTCAGGAAGCGCTGCTGCCGTACCTGTCTTCTGAAGAAGACCGTAAAAAAGGTAAAAAAT TCGCGCGTTACCAGTTCGGTGACCTGCTGCTGCACCTGGAAAAAAAACACGGTGAAGACTGGGGTAAAG TTTACGACGAAGCGTGGGAACGTATCGACAAAAAAGTTGAAGGTCTGTCTAAACACATCAAACTGGAAG AAGAACGTCGTTCTGAAGACGCGCAGTCTAAAGCGGCGCTGACCGACTGGCTGCGTGCGAAAGCGTCTT TCGTTATCGAAGGTCTGAAAGAAGCGGACAAAGACGAATTCTGCCGTTGCGAACTGAAACTGCAGAAAT GGTACGGTGACCTGCGTGGTAAACCGTTCGCGATCGAAGCGGAAAACTCTATCCTGGACATCTCTGGTTT CTCTAAACAGTACAACTGCGCGTTCATCTGGCAGAAAGACGGTGTTAAAAAACTGAACCTGTACCTGAT CATCAACTACTTCAAAGGTGGTAAACTGCGTTTCAAAAAAATCAAACCGGAAGCGTTCGAAGCGAACCG TTTCTACACCGTTATCAACAAAAAATCTGGTGAAATCGTTCCGATGGAAGTTAACTTCAACTTCGACGAC CCGAACCTGATCATCCTGCCGCTGGCGTTCGGTAAACGTCAGGGTCGTGAATTCATCTGGAACGACCTGC TGTCTCTGGAAACCGGTTCTCTGAAACTGGCGAACGGTCGTGTTATCGAAAAAACCCTGTACAACCGTC GTACCCGTCAGGACGAACCGGCGCTGTTCGTTGCGCTGACCTTCGAACGTCGTGAAGTTCTGGACTCTTC TAACATCAAACCGATGAACCTGATCGGTATCGACCGTGGTGAAAACATCCCGGCGGTTATCGCGCTGAC CGACCCGGAAGGTTGCCCGCTGTCTCGTTTCAAAGACTCTCTGGGTAACCCGACCCACATCCTGCGTATC GGTGAATCTTACAAAGAAAAACAGCGTACCATCCAGGCGGCGAAAGAAGTTGAACAGCGTCGTGCGGG TGGTTACTCTCGTAAATACGCGTCTAAAGCGAAAAACCTGGCGGACGACATGGTTCGTAACACCGCGCG TGACCTGCTGTACTACGCGGTTACCCAGGACGCGATGCTGATCTTCGAAAACCTGTCTCGTGGTTTCGGT CGTCAGGGTAAACGTACCTTCATGGCGGAACGTCAGTACACCCGTATGGAAGACTGGCTGACCGCGAAA CTGGCGTACGAAGGTCTGCCGTCTAAAACCTACCTGTCTAAAACCCTGGCGCAGTACACCTCTAAAACCT GCTCTAACTGCGGTTTCACCATCACCTCTGCGGACTACGACCGTGTTCTGGAAAAACTGAAAAAAACCG CGACCGGTTGGATGACCACCATCAACGGTAAAGAACTGAAAGTTGAAGGTCAGATCACCTACTACAACC GTTACAAACGTCAGAACGTTGTTAAAGACCTGTCTGTTGAACTGGACCGTCTGTCTGAAGAATCTGTTAA CAACGACATCTCTTCTTGGACCAAAGGTCGTTCTGGTGAAGCGCTGTCTCTGCTGAAAAAACGTTTCTCT CACCGTCCGGTTCAGGAAAAATTCGTTTGCCTGAACTGCGGTTTCGAAACCCACGCGGACGAACAGGCG GCGCTGAACATCGCGCGTTCTTGGCTGTTCCTGCGTTCTCAGGAATACAAAAAATACCAGACCAACAAA ACCACCGGTAACACCGACAAACGTGCGTTCGTTGAAACCTGGCAGTCTTTCTACCGTAAAAAACTGAAA GAAGTTTGGAAACCGGAAATCATCCTTAGCGAAAGCTAAGGATTTTTTTTATCTGAAATGTAGGGAGAC CCTCAGGTTAAATATTCACTCAGGAAGTTATTACTCAGGAAGCAAAGAGGATTACA SEQ tgccgtcactgcgtcttttactggctcttctcgctaaccaaaccggtaaccccgcttattaaaagcattctgtaacaaagcgggaccaaagccatgaca ID aaaacgcgtaacaaaagtgtctataatcacggcagaaaagtccacattgattatttgcacggcgtcacactttgctatgccatagcatttttatccata NO: agattagcggatcctacctgacgctttttatcgcaactctctactgtttctccatacccgtttttttgggctagcaccgcctatctcgtgtgagatagg 81 cggagatacgaactttaagAAGGAGatatacc SEQ TGCCGTCACTGCGTCTTTTACTGGCTCTTCTCGCTAACCAAACCGGTAACCCCGCTTATTAAAAGCATTCT ID GTAACAAAGCGGGACCAAAGCCATGACAAAAACGCGTAACAAAAGTGTCTATAATCACGGCAGAAAAG NO: TCCACATTGATTATTTGCACGGCGTCACACTTTGCTATGCCATAGCATTTTTATCCATAAGATTAGCGGAT 82 CCTACCTGACGCTTTTTATCGCAACTCTCTACTGTTTCTCCATACCCGTTTTTTTGGGTAGCGGATCCTAC CTGAC SEQ AATCAGGAGAGCGTTTTCAATCCTACCTCTGGCGCAGTTGATATGTCAAACAGGTTTCTAGAGCACAGCT ID AACACCACGTCGTCCCTATCTGCTGCCCTAGGTCTATGAGTGGTTGCTGGATAACTTTACGGGCATGCAT NO: AAGGCTCGTAATATATATTCAGGGAGACCACAACGGTTTCCCTCTACAAATAATTTTGTTTAACTTTTAC 83 TAGAGCTAGCAGTAATACGACTCACTATAGGGGTCTCATCTCGTGTGAGATAGGCGGAGATACGAACTT TAAGAGGAGGATATACCA SEQ GTTTGAGAGATATGTAAATTCAAAGGATAATCAAAC ID NO: 84 SEQ actacattttttaagacctaattttgagt ID NO: 85 SEQ ctcaaaactcattcgaatctctactctttgtagat ID NO: 86 SEQ CTCTAGCAGGCCTGGCAAATTTCTACTGTTGTAGAT ID NO: 87 SEQ CCGTCTAAAACTCATTCAGAATTTCTACTAGTGTAGAT ID NO: 88 SEQ GTCTAGGTACTCTCTTTAATTTCTACTATTGT ID NO: 89 SEQ gttaagttatatagaataatttctactgttgtaga ID NO: 90 SEQ gtttaaaaccactttaaaatttctactattgta ID NO: 91 SEQ GTTTGAGAATGATGTAAAAATGTATGGTACACAGAAATGTTTTAATACCATATTTTTACATCACTCTCAA ID ACATACATCTCTTGTTACTGTTTATCGTATCCAGATTAAATTTCACGTTTTT NO: 92 SEQ CTCTACAACTGATAAAGAATTTCTACTTTTGTAGAT ID NO: 93 SEQ GTCTGGCCCCAAATTTTAATTTCTACTGTTGTAGAT ID NO: 94 SEQ GTCAAAAGACCTTTTTAATTTCTACTCTTGTAGAT ID NO: 95 SEQ GTCTAGAGGACAGAATTTTTCAACGGGTGTGCCAATGGCCACTTTCCAGGTGGCAAAGCCCGTTGAGCT ID TCTACGGAAGTGGCAC NO: 96 SEQ CGAGGTTCTGTCTTTTGGTCAGGACAACCGTCTAGCTATAAGTGCTGCAGGGGTGTGAGAAACTCCTATT ID GCTGGACGATGTCTCTTTTAACGAGGCATTAGCAC NO: 97 SEQ GAACGAGGGACGTTTTGTCTCCAATGATTTTGCTATGACGACCTCGAACTGTGCCTTCAAGTCTGAGGCG ID AAAAAGAAATGGAAAAAAGTGTCTCATCGCTCTACCTCGTAGTTAGAGG NO: 98 SEQ AATTACTGATGTTGTGATGAAGG ID NO: 99 SEQ TATACCATAAGGATTTAAAGACT ID NO: 100 SEQ GTCTTTACTCTCACCTTTCCACCTG ID NO: 101 SEQ ATTTGAAGGTATCTCCGATAAGTAAAACGCATCAAAG ID NO: 102 SEQ GTTTGAAGATATCTCCGATAAATAAGAAGCATCAAAG ID NO: 103 SEQ TTGTTTTAATACCATATTTTTACATCACTCTCAAAC ID NO: 104 SEQ AAAGAACGCTCGCTCAGTGTTCTGACCTTTCGAGCGCCTGTTCAGGGCGAAAACCCTGGGAGGCGCTCG ID AATCATAGGTGGGACAAGGGATTCGCGGCGAAAA NO: 105 SEQ GTTTGAGAATGATGTAAAAATGTATGGTACACAGAAATGTTTTAATACCATATTTTTACATCACTCTCAA ID ACATACATCTCTTGTTACTGTTTATCGTATCCAGATTAAATTTCACGTTTTT NO: 106 SEQ GTCTAGAGGACAGAATTTTTCAACGGGTGTGCCAATGGCCACTTTCCAGGTGGCAAAGCCCGTTGAGCT ID TCTACGGAAGTGGCAC NO: 107 SEQ MSIYQEFVNKYSLSKTLRFELIPQGKTLENIKARGLILDDEKRAKDYKKAKQIIDKYHQFFIEEILSSVC ID ISEDLLQNYSDVYFKLKKSDDDNLQKDFKSAKDTIKKQISEYIKDSEKFKNLFNQNLIDAKKGQESDLIL NO: WLKQSKDNGIELFKANSDITDIDEALEIIKSFKGWTTYFKGFHENRKNVYSSNDIPTSIIYRIVDDNLPK 108 FLENKAKYESLKDKAPEAINYEQIKKDLAEELTFDIDYKTSEVNQRVFSLDEVFEIANFNNYLNQSGITK FNTIIGGKFVNGENTKRKGINEYINLYSQQINDKTLKKYKMSVLFKQILSDTESKSFVIDKLEDDSDVVT TMQSFYEQIAAFKTVEEKSIKETLSLLFDDLKAQKLDLSKIYFKNDKSLTDLSQQVFDDYSVIGTAVLEY ITQQIAPKNLDNPSKKEQELIAKKTEKAKYLSLETIKLALEEFNKHRDIDKQCRFEEILANFAAIPMIFD EIAQNKDNLAQISIKYQNQGKKDLLQASAEDDVKAIKDLLDQTNNLLHKLKIFHISQSEDKANILDKDEH FYLVFEECYFELANIVPLYNKIRNYITQKPYSDEKFKLNFENSTLANGWDKNKEPDNTAILFIKDDKYYL GVMNKKNNKIFDDKAIKENKGEGYKKIVYKLLPGANKMLPKVFFSAKSIKFYNPSEDILRIRNHSTHTKN GSPQKGYEKFEFNIEDCRKFIDFYKQSISKHPEWKDFGFRFSDTQRYNSIDEFYREVENQGYKLTFENIS ESYIDSVVNQGKLYLFQIYNKDFSAYSKGRPNLHTLYWKALFDERNLQDVVYKLNGEAELFYRKQSIPKK ITHPAKEAIANKNKDNPKKESVFEYDLIKDKRFTEDKFFFHCPITINFKSSGANKFNDEINLLLKEKAND VHILSIDRGERHLAYYTLVDGKGNIIKQDTFNIIGNDRMKTNYHDKLAAIEKDRDSARKDWKKINNIKEM KEGYLSQVVHEIAKLVIEYNAIVVFEDLNFGFKRGRFKVEKQVYQKLEKMLIEKLNYLVFKDNEFDKTGG VLRAYQLTAPFETFKKMGKQTGIIYYVPAGFTSKICPVTGFVNQLYPKYESVSKSQEFFSKFDKICYNLD KGYFEFSFDYKNFGDKAAKGKWTIASFGSRLINFRNSDKNHNWDTREVYPTKELEKLLKDYSIEYGHGEC IKAAICGESDKKFFAKLTSVLNTILQMRNSKTGTELDYLISPVADVNGNFFDSRQAPKNMPQDADANGAY HIGLKGLMLLGRIKNNQEGKKLNLVIKNEEYFEFVQNRNN SEQ MSIYQEFVNKYSLSKTLRFELIPQGKTLENIKARGLILDDEKRAKDYKKAKQIIDKYHQFFIEEILSSVC ID ISEDLLQNYSDVYFKLKKSDDDNLQKDFKSAKDTIKKQISEYIKDSEKFKNLFNQNLIDAKKGQESDLIL NO: WLKQSKDNGIELFKANSDITDIDEALEIIKSFKGWTTYFKGFHENRKNVYSSDDIPTSIIYRIVDDNLPK 109 FLENKAKYESLKDKAPEAINYEQIKKDLAEELTFDIDYKTSEVNQRVFSLDEVFEIANFNNYLNQSGITK FNTIIGGKFVNGENTKRKGINEYINLYSQQINDKTLKKYKMSVLFKQILSDTESKSFVIDKLEDDSDVVT TMQSFYEQIAAFKTVEEKSIKETLSLLFDDLKAQKLDLSKIYFKNDKSLTDLSQQVFDDYSVIGTAVLEY ITQQVAPKNLDNPSKKEQDLIAKKTEKAKYLSLETIKLALEEFNKHRDIDKQCRFEEILANFAAIPMIFD EIAQNKDNLAQISLKYQNQGKKDLLQASAEEDVKAIKDLLDQTNNLLHRLKIFHISQSEDKANILDKDEH FYLVFEECYFELANIVPLYNKIRNYITQKPYSDEKFKLNFENSTLANGWDKNKEPDNTAILFIKDDKYYL GVMNKKNNKIFDDKAIKENKGEGYKKIVYKLLPGANKMLPKVFFSAKSIKFYNPSEDILRIRNHSTHTKN GNPQKGYEKFEFNIEDCRKFIDFYKESISKHPEWKDFGFRFSDTQRYNSIDEFYREVENQGYKLTFENIS ESYIDSVVNQGKLYLFQIYNKDFSAYSKGRPNLHTLYWKALFDERNLQDVVYKLNGEAELFYRKQSIPKK ITHPAKEAIANKNKDNPKKESVFEYDLIKDKRFTEDKFFFHCPITINFKSSGANKFNDEINLLLKEKAND VHILSIDRGERHLAYYTLVDGKGNIIKQDTFNIIGNDRMKTNYHDKLAAIEKDRDSARKDWKKINNIKEM KEGYLSQVVHEIAKLVIEHNAIVVFEDLNFGFKRGRFKVEKQVYQKLEKMLIEKLNYLVFKDNEFDKTGG VLRAYQLTAPFETFKKMGKQTGIIYYVPAGFTSKICPVTGFVNQLYPKYESVSKSQEFFSKFDKICYNLD KGYFEFSFDYKNFGDKAAKGKWTIASFGSRLINFRNSDKNHNWDTREVYPTKELEKLLKDYSIEYGHGEC IKAAICGESDKKFFAKLTSVLNTILQMRNSKTGTELDYLISPVADVNGNFFDSRQAPKNMPQDADANGAY HIGLKGLMLLDRIKNNQEGKKLNLVIKNEEYFEFVQNRNN SEQ MDKKYSIGLDIGTNSVGWAVITDEYKVPSKKFKVLGNTDRHSIKKNLIGALLFDSGETAEATRLKRTARRRY ID TRRKNRICYLQEIFSNEMAKVDDSFFHRLEESFLVEEDKKHERHPIFGNIVDEVAYHEKYPTIYHLRKKLVDST NO: DKADLRLIYLALAHMIKFRGHFLIEGDLNPDNSDVDKLFIQLVQTYNQLFEENPINASGVDAKAILSARLSKSR 110 RLENLIAQLPGEKKNGLFGNLIALSLGLTPNFKSNFDLAEDAKLQLSKDTYDDDLDNLLAQIGDQYADLFLA AKNLSDAILLSDILRVNTEITKAPLSASMIKRYDEHHQDLTLLKALVRQQLPEKYKEIFFDQSKNGYAGYIDG GASQEEFYKFIKPILEKMDGTEELLVKLNREDLLRKQRTFDNGSIPHQIHLGELHAILRRQEDFYPFLKDNREK IEKILTFRIPYYVGPLARGNSRFAWMTRKSEETITPWNFEEVVDKGASAQSFIERMTNFDKNLPNEKVLPKHSL LYEYFTVYNELTKVKYVTEGMRKPAFLSGEQKKAIVDLLFKTNRKVTVKQLKEDYFKKIECFDSVEISGVED RFNASLGTYHDLLKIIKDKDFLDNEENEDILEDIVLTLTLFEDREMIEERLKTYAHLFDDKVMKQLKRRRYTG WGRLSRKLINGIRDKQSGKTILDFLKSDGFANRNFMQLIHDDSLTFKEDIQKAQVSGQGDSLHEHIANLAGSP AIKKGILQTVKVVDELVKVMGRHKPENIVIEMARENQTTQKGQKNSRERMKRIEEGIKELGSQILKEHPVENT QLQNEKLYLYYLQNGRDMYVDQELDINRLSDYDVDHIVPQSFLKDDSIDNKVLTRSDKNRGKSDNVPSEEV VKKMKNYWRQLLNAKLITQRKFDNLTKAERGGLSELDKAGFIKRQLVETRQITKHVAQILDSRMNTKYDEN DKLIREVKVITLKSKLVSDFRKDFQFYKVREINNYHHAHDAYLNAVVGTALIKKYPKLESEFVYGDYKVYDV RKMIAKSEQEIGKATAKYFFYSNIMNFFKTEITLANGEIRKRPLIETNGETGEIVWDKGRDFATVRKVLSMPQ VNIVKKTEVQTGGFSKESILPKRNSDKLIARKKDWDPKKYGGFDSPTVAYSVLVVAKVEKGKSKKLKSVKEL LGITIMERSSFEKNPIDFLEAKGYKEVKKDLIIKLPKYSLFELENGRKRMLASAGELQKGNELALPSKYVNFLY LASHYEKLKGSPEDNEQKQLFVEQHKHYLDEIIEQISEFSKRVILADANLDKVLSAYNKHRDKPIREQAENIIH LFTLTNLGAPAAFKYFDTTIDRKRYTSTKEVLDATLIHQSITGLYETRIDLSQLGGD SEQ PKKKRKV ID NO: 111 SEQ KRPAATKKAGQAKKKK ID NO: 112 SEQ PAAKRVKLD ID NO: 113 SEQ RQRRNELKRSP ID NO: 114 SEQ NQSSNFGPMKGGNFGGRSSGPYGGGGQYFAKPRNQGGY ID NO: 115 SEQ RMRIZFKNKGKDTAELRRRRVEVSVELRKAKKDEQILKRRNV ID NO: 116 SEQ VSRKRPRP ID NO: 117 SEQ PPKKARED ID NO: 118 SEQ PQPKKKPL ID NO: 119 SEQ SALIKKKKKMAP ID NO: 120 SEQ DRLRR ID NO: 121 SEQ PKQKKRK ID NO: 122 SEQ RKLKKKIKKL ID NO: 123 SEQ REKKKFLKRR ID NO: 124 SEQ KRKGDEVDGVDEVAKKKSKK ID NO: 125 SEQ RKCLQAGMNLEARKTKK ID NO: 126 SEQ ATGGGTAAGATGTATTATCTGGGTTTGGATATAGGCACTAACTCTGTGGGATATGCAGTAACTGATCCCT ID CGTATCACTTGTTAAAGTTCAAAGGCGAACCCATGTGGGGAGCACATGTATTTGCTGCGGGTAATCAGA NO: GTGCCGAAAGGCGATCTTTCAGAACATCCAGGAGGCGATTAGATAGGAGACAGCAAAGAGTAAAGCTT 127 GTGCAAGAGATCTTTGCTCCTGTCATTTCACCTATAGACCCTCGTTTTTTTATAAGATTGCACGAATCGGC TCTATGGAGAGACGATGTTGCCGAAACAGATAAACATATCTTTTTCAATGATCCCACTTATACAGACAA GGAATACTACTCCGACTACCCGACAATTCATCATTTGATCGTCGATCTTATGGAGAGCTCTGAAAAGCAT GACCCCCGACTTGTCTATTTGGCTGTAGCTTGGTTAGTTGCTCATAGAGGTCATTTCTTGAATGAAGTAG ATAAAGACAATATAGGTGATGTACTTTCTTTTGATGCTTTCTACCCGGAATTTTTGGCCTTTTTGTCAGAC AATGGCGTCAGTCCCTGGGTCTGTGAGTCGAAGGCCCTTCAAGCTACTCTGCTGTCTAGGAATAGCGTCA ACGACAAATATAAAGCATTAAAATCGCTGATATTCGGATCGCAAAAACCGGAAGATAACTTTGACGCTA ACATCTCTGAAGATGGTTTAATCCAATTGCTGGCGGGTAAGAAAGTTAAAGTAAACAAACTATTCCCAC AAGAGTCCAACGATGCTAGCTTTACGTTGAATGATAAAGAAGACGCTATTGAAGAAATTCTAGGTACTT TAACGCCTGACGAGTGCGAATGGATCGCTCATATTCGCAGATTGTTCGATTGGGCCATCATGAAACACG CGCTAAAGGATGGCAGGACGATATCTGAATCAAAAGTGAAGCTATACGAGCAGCATCATCATGACTTGA CTCAGTTAAAGTACTTTGTGAAGACCTACCTAGCTAAAGAGTATGATGATATCTTCAGAAACGTAGACTC CGAGACAACTAAAAATTATGTAGCTTATTCTTACCATGTGAAGGAAGTGAAAGGCACATTACCAAAAAA TAAAGCAACGCAAGAAGAATTTTGTAAATACGTCCTTGGCAAAGTCAAAAACATTGAATGTTCCGAAGC AGACAAGGTTGATTTTGATGAAATGATACAACGACTTACGGACAATTCTTTTATGCCAAAGCAAGTCTC AGGTGAAAATAGAGTAATACCATACCAGTTGTACTACTATGAATTAAAGACAATTTTAAACAAAGCCGC CTCATATCTACCTTTTTTGACACAATGCGGTAAAGATGCTATTTCTAACCAAGACAAATTACTGTCTATA ATGACATTTCGCATACCATATTTCGTCGGCCCTTTAAGGAAAGATAATTCAGAACATGCCTGGTTGGAAC GTAAAGCGGGTAAAATTTACCCGTGGAACTTTAATGATAAAGTAGATCTTGATAAATCGGAGGAAGCCT TTATCCGTAGGATGACCAATACTTGCACGTATTACCCAGGAGAAGACGTGTTACCATTAGATTCACTTAT CTATGAAAAGTTTATGATCTTGAATGAGATAAACAATATTAGGATTGACGGATACCCCATTTCTGTTGAT GTGAAACAACAAGTATTTGGTTTATTTGAGAAGAAAAGGCGAGTAACAGTTAAGGATATTCAAAATCTA CTATTATCTCTTGGAGCGTTGGATAAACACGGTAAGCTGACTGGTATTGACACGACAATACACTCTAATT ATAACACTTATCATCATTTTAAATCTCTTATGGAGCGGGGAGTATTGACCAGAGATGATGTGGAAAGAA TAGTGGAAAGAATGACATATTCTGACGATACTAAGAGGGTCAGACTGTGGTTAAATAATAATTATGGAA CTCTAACAGCTGACGATGTTAAGCATATCTCAAGACTCAGAAAACACGATTTCGGCCGTTTGTCTAAAAT GTTTTTGACAGGATTGAAAGGTGTTCATAAGGAGACAGGCGAGAGAGCAAGTATACTGGATTTTATGTG GAATACTAACGACAATTTAATGCAACTACTGTCCGAATGTTACACATTCTCGGATGAGATCACCAAATTA CAAGAGGCCTACTACGCAAAAGCTCAATTATCGCTAAATGACTTCTTGGACTCTATGTATATATCAAACG CCGTTAAGAGACCTATTTATCGGACCTTAGCGGTAGTAAATGATATTAGAAAGGCATGCGGGACGGCAC CTAAAAGAATTTTCATCGAGATGGCGCGAGATGGAGAGTCTAAGAAGAAAAGATCTGTGACTCGTAGA GAGCAAATTAAAAATCTCTATAGATCAATTCGTAAAGACTTTCAACAAGAAGTTGATTTTCTGGAAAAG ATATTGGAAAATAAGAGTGACGGGCAGCTTCAGTCTGACGCTTTATATTTGTATTTTGCTCAATTAGGCA GAGACATGTACACAGGTGATCCAATCAAATTAGAACATATTAAAGACCAATCTTTTTACAACATTGATC ATATTTATCCTCAATCGATGGTGAAAGATGACAGTTTGGATAACAAGGTACTAGTCCAAAGCGAAATCA ATGGCGAAAAGAGTTCGCGCTATCCATTAGACGCAGCCATTAGAAACAAAATGAAGCCGTTGTGGGATG CCTACTATAATCATGGATTAATTTCTCTTAAGAAATACCAGCGTTTGACGAGATCTACTCCATTTACGGA CGACGAGAAGTGGGATTTTATCAATCGTCAGCTAGTTGAAACTAGGCAATCTACTAAAGCTTTAGCAAT ATTGTTAAAGCGTAAGTTTCCAGATACTGAAATAGTTTACTCAAAGGCTGGACTATCCAGCGATTTTAGA CATGAATTCGGCCTGGTTAAGAGTAGGAATATTAATGATCTACACCATGCTAAAGATGCCTTTCTCGCAA TAGTTACTGGGAACGTTTATCATGAAAGATTTAATAGAAGATGGTTTATGGTTAACCAGCCATACTCTGT GAAAACTAAGACATTGTTTACCCATTCAATTAAGAATGGCAACTTTGTCGCTTGGAATGGAGAAGAAGA TCTTGGACGTATCGTAAAGATGTTGAAACAAAACAAGAACACAATCCACTTCACCAGGTTTTCCTTTGAT AGGAAGGAGGGATTGTTCGATATTCAACCTCTCAAAGCTTCTACCGGATTGGTTCCACGAAAAGCAGGG TTGGATGTTGTTAAATATGGAGGATACGATAAAAGCACTGCCGCGTATTATTTATTAGTACGTTTTACAC TCGAGGATAAGAAGACTCAACACAAATTGATGATGATTCCTGTTGAAGGTCTCTACAAAGCACGTATTG ACCATGATAAAGAGTTTTTAACAGATTATGCTCAGACCACGATCAGCGAAATTCTTCAAAAGGACAAGC AGAAAGTGATCAACATCATGTTCCCTATGGGCACGAGACATATCAAACTGAATTCGATGATTTCTATTGA TGGATTCTATCTTTCTATTGGTGGGAAGAGTAGCAAAGGTAAGTCAGTACTATGTCATGCTATGGTGCCA TTAATCGTCCCACACAAGATAGAATGTTATATCAAGGCTATGGAATCGTTTGCAAGAAAATTCAAAGAA AATAATAAATTGAGGATCGTTGAAAAGTTTGATAAAATAACTGTTGAAGATAACTTGAACTTATACGAG CTTTTTCTACAAAAGTTGCAACATAACCCATATAATAAATTTTTCTCTACACAATTTGATGTGTTGACGA ACGGTAGAAGTACATTCACCAAATTGTCTCCAGAGGAGCAAGTCCAGACTTTACTTAATATACTGAGTA TATTTAAAACTTGTCGTTCTTCTGGGTGTGATTTAAAATCAATAAATGGTTCCGCTCAAGCGGCTAGAAT TATGATATCCGCTGATTTAACTGGCTTATCAAAAAAGTATTCAGATATTAGATTAGTTGAGCAAAGCGCA TCAGGTCTATTTGTTTCAAAATCTCAAAATCTCTTGGAATACTTGCCAAAAAAGAAAAGGAAAGTTTAG SEQ ATGAGTAGTTTAACAAAGTTTACCAATAAATATAGTAAGCAACTAACTATAAAGAACGAATTGATACCG ID GTCGGTAAGACTTTGGAAAACATAAAAGAAAATGGGTTGATTGATGGAGACGAGCAATTGAATGAGAA NO: TTATCAAAAAGCAAAGATAATAGTAGATGATTTTTTGAGAGACTTTATTAATAAAGCTCTAAATAACACT 128 CAAATTGGTAACTGGAGAGAGCTAGCCGACGCCTTGAACAAGGAAGATGAGGATAATATTGAGAAATT ACAAGATAAGATTAGAGGGATTATCGTGTCTAAGTTTGAGACTTTTGATCTGTTCAGTTCGTATTCGATT AAAAAGGACGAGAAAATCATCGATGATGATAACGATGTGGAAGAAGAGGAGCTAGACCTTGGGAAGAA GACATCTAGCTTCAAATACATATTCAAGAAAAATTTGTTCAAACTTGTCCTTCCTTCATATTTAAAAACA ACAAATCAAGATAAGTTAAAAATCATTTCTTCCTTCGATAATTTTAGTACTTATTTTCGTGGTTTTTTCGA AAACAGGAAAAATATATTCACTAAAAAGCCTATATCTACCTCTATAGCTTATAGAATTGTTCACGATAAT TTCCCAAAATTTCTAGATAATATCAGGTGTTTTAATGTTTGGCAAACCGAGTGTCCTCAGTTAATAGTCA AGGCCGACAACTACCTTAAAAGCAAGAATGTGATTGCAAAAGATAAGTCTTTGGCTAACTATTTTACAG TCGGTGCCTATGATTATTTTCTGAGTCAAAATGGTATCGATTTCTATAACAACATTATTGGCGGCTTACC AGCTTTTGCCGGGCATGAGAAGATTCAGGGTTTGAACGAATTTATCAATCAAGAATGTCAAAAGGATTC TGAATTAAAGTCTAAGCTCAAGAATAGGCACGCTTTCAAAATGGCAGTCTTATTCAAACAAATCCTTTCA GACAGAGAAAAGTCATTTGTGATTGACGAGTTCGAATCAGACGCTCAGGTAATTGATGCTGTTAAAAAT TTTTACGCGGAACAATGCAAAGATAATAACGTCATATTTAATTTATTGAATCTGATCAAGAATATTGCTT TTTTGTCGGATGATGAGTTAGACGGCATTTTCATAGAGGGTAAATACCTGTCCTCTGTGTCTCAAAAATT GTATAGTGATTGGTCAAAGTTGAGAAATGATATTGAAGATTCGGCTAATTCTAAACAGGGTAACAAAGA ATTAGCGAAGAAAATCAAAACTAACAAGGGTGATGTTGAAAAGGCTATAAGTAAGTACGAGTTCAGTTT ATCTGAACTAAATTCAATTGTTCATGATAACACAAAATTTTCCGATCTTTTATCATGCACATTACATAAA GTTGCAAGTGAAAAATTAGTCAAAGTAAACGAAGGTGATTGGCCAAAACATCTAAAAAACAACGAGGA AAAACAGAAGATAAAAGAACCTCTTGACGCTTTATTGGAAATATACAATACTCTATTAATATTTAACTGT AAAAGTTTTAACAAAAATGGTAATTTCTATGTCGACTACGATCGCTGCATTAATGAGTTGTCCAGTGTTG TGTACTTGTATAATAAAACTCGTAATTATTGTACGAAAAAGCCGTACAACACTGACAAATTTAAGTTGA ATTTCAACTCCCCACAACTGGGTGAGGGCTTCTCTAAAAGTAAAGAGAATGATTGCCTTACATTATTATT TAAAAAAGATGATAATTATTATGTCGGAATCATAAGAAAGGGGGCAAAGATCAACTTCGATGACACTCA GGCCATAGCAGACAACACAGATAACTGTATATTCAAAATGAATTATTTTTTGCTGAAGGATGCTAAAAA ATTTATCCCCAAATGTTCAATACAATTAAAAGAGGTTAAGGCCCATTTCAAAAAGTCGGAAGATGACTA TATTTTGTCCGATAAGGAAAAATTCGCTAGTCCGCTTGTTATTAAAAAATCCACATTTCTTCTCGCTACG GCTCATGTGAAAGGAAAGAAGGGCAATATTAAGAAATTTCAGAAAGAATACTCCAAAGAAAATCCTAC GGAGTATAGAAATAGTCTGAACGAATGGATAGCATTCTGCAAAGAGTTCTTGAAGACCTATAAAGCTGC CACCATCTTTGATATTACAACTTTGAAAAAGGCCGAGGAATACGCTGACATTGTGGAATTCTATAAGGA TGTAGATAATCTTTGTTACAAGTTAGAATTTTGCCCTATCAAAACTTCTTTTATCGAAAATCTTATAGATA ATGGCGATTTATACCTGTTTAGAATTAATAACAAGGACTTTTCTTCAAAAAGTACAGGCACGAAAAACTT ACACACATTATACTTGCAGGCTATATTTGACGAGCGAAACTTAAACAACCCCACGATAATGTTGAATGG AGGTGCAGAGTTATTCTACAGAAAAGAATCTATAGAACAGAAAAATCGGATCACGCACAAAGCCGGTA GTATCTTAGTGAATAAAGTGTGCAAAGATGGTACAAGTCTAGATGACAAAATCCGTAACGAAATTTACC AGTATGAAAACAAATTCATTGATACTCTTTCGGACGAAGCTAAAAAGGTTCTGCCAAACGTTATTAAGA AAGAGGCTACGCATGATATAACAAAAGATAAACGTTTCACTAGCGACAAATTCTTCTTTCATTGTCCTTT AACAATCAACTACAAGGAAGGTGACACCAAACAATTTAATAATGAAGTGCTCTCATTCCTTAGAGGTAA CCCCGATATCAATATTATCGGCATTGATAGAGGAGAAAGAAACCTAATCTATGTAACAGTCATTAACCA AAAAGGCGAAATATTGGATAGCGTCTCCTTCAATACTGTCACCAATAAGTCATCGAAGATAGAACAAAC TGTTGATTACGAAGAAAAATTGGCCGTTAGAGAAAAGGAACGTATCGAAGCGAAGAGATCTTGGGATA GCATATCCAAGATTGCCACCTTGAAGGAGGGTTATCTAAGCGCGATCGTACATGAAATCTGCTTATTAAT GATTAAGCATAATGCTATTGTCGTGTTAGAAAACCTGAATGCCGGTTTTAAAAGGATTAGAGGTGGTTTG TCAGAAAAGTCAGTATATCAAAAGTTTGAAAAGATGCTTATTAATAAACTCAACTACTTCGTTAGCAAG AAAGAAAGTGATTGGAATAAACCGTCAGGTTTGCTCAATGGTCTTCAGTTAAGTGATCAATTTGAGTCTT TCGAAAAATTAGGAATTCAAAGTGGATTCATTTTTTATGTACCAGCCGCGTACACTTCAAAAATTGACCC TACGACCGGATTTGCCAACGTCTTGAATTTGTCCAAGGTCAGAAATGTTGACGCCATCAAAAGTTTTTTT AGCAACTTCAATGAAATCTCTTATTCCAAAAAGGAAGCCCTTTTCAAGTTTTCTTTTGACCTAGACTCGTT ATCGAAGAAAGGATTTTCATCTTTCGTAAAGTTTAGCAAGTCCAAGTGGAATGTATACACATTCGGCGA GAGAATTATCAAGCCCAAGAACAAACAGGGCTATAGAGAAGACAAGAGAATCAACTTGACTTTTGAGA TGAAAAAATTACTCAACGAATACAAGGTTTCATTTGATTTGGAGAACAACTTGATTCCCAATTTGACATC AGCTAACTTGAAGGATACGTTCTGGAAGGAGTTATTCTTTATATTCAAAACGACATTACAACTGCGTAAT AGTGTTACAAACGGTAAAGAAGATGTATTAATCTCACCTGTAAAGAATGCCAAAGGAGAATTTTTCGTA TCCGGTACTCACAATAAGACACTACCACAGGATTGCGACGCTAACGGTGCGTATCATATTGCGTTGAAA GGATTAATGATACTTGAAAGAAATAACCTTGTTCGCGAAGAAAAAGACACCAAGAAGATCATGGCTATT AGCAATGTTGATTGGTTTGAATACGTGCAAAAGAGGAGAGGTGTTTTGTAA SEQ ATGAACAATTATGACGAGTTCACAAAGCTATACCCTATCCAAAAAACTATCAGGTTCGAATTGAAACCA ID CAAGGGAGAACAATGGAACATCTGGAGACATTCAACTTTTTTGAAGAGGACAGAGACAGAGCGGAGAA NO: ATACAAAATTTTAAAAGAGGCCATCGATGAATATCACAAAAAGTTTATCGACGAGCATTTAACAAACAT 129 GTCTTTGGACTGGAATTCACTTAAACAAATTTCTGAGAAATATTATAAGTCTCGGGAGGAAAAAGACAA AAAGGTCTTTTTGTCCGAGCAAAAGAGAATGAGACAAGAAATTGTCTCGGAGTTTAAAAAAGATGATCG GTTCAAAGATTTGTTTAGCAAGAAATTGTTTTCTGAATTGTTGAAGGAGGAGATATACAAGAAAGGCAA CCATCAAGAAATAGATGCTTTGAAATCGTTTGACAAGTTCAGCGGTTACTTCATTGGTTTACATGAAAAT AGGAAGAACATGTATAGCGACGGCGATGAGATCACCGCTATATCGAATAGAATCGTTAACGAAAATTTT CCGAAATTTTTGGATAATTTGCAAAAATACCAGGAAGCTAGGAAAAAGTACCCTGAATGGATAATAAAG GCGGAATCAGCTTTGGTGGCTCACAACATAAAGATGGATGAAGTCTTCTCGCTGGAATATTTTAACAAA GTATTAAATCAGGAAGGAATCCAAAGATACAACTTAGCCTTGGGTGGATACGTAACCAAATCAGGTGAG AAAATGATGGGCTTAAATGATGCACTTAATCTAGCTCACCAATCCGAAAAGTCCTCTAAAGGGAGGATA CACATGACACCATTGTTTAAGCAAATCCTTTCGGAGAAAGAATCTTTTTCATATATCCCCGATGTTTTCA CTGAGGATAGTCAATTGTTGCCCAGCATTGGTGGATTTTTTGCACAAATAGAAAATGATAAAGATGGTA ACATCTTCGATAGAGCCTTGGAATTGATAAGCTCCTATGCAGAATACGATACGGAACGAATATACATTA GACAAGCTGACATCAACAGAGTAAGCAATGTTATTTTTGGTGAGTGGGGAACTTTAGGTGGATTAATGC GGGAGTACAAAGCTGACTCAATCAATGATATTAATTTGGAACGTACGTGCAAAAAAGTCGATAAGTGGC TTGATAGTAAGGAGTTTGCTCTGTCGGATGTACTAGAAGCAATTAAGAGAACAGGAAACAATGATGCAT TTAATGAATATATTAGTAAAATGAGGACGGCTAGAGAAAAGATAGACGCCGCACGTAAGGAAATGAAG TTTATTTCCGAGAAAATATCTGGCGATGAAGAGTCGATTCACATCATCAAGACCCTACTCGATTCTGTTC AGCAATTTCTCCATTTTTTTAACCTCTTCAAAGCAAGACAAGACATTCCCTTAGATGGGGCTTTTTATGCC GAATTTGATGAAGTTCATTCAAAGTTGTTTGCTATTGTTCCTCTTTACAATAAGGTCCGTAATTACCTTAC TAAAAATAACTTGAACACCAAGAAAATAAAGTTAAACTTCAAGAATCCGACTCTTGCCAACGGGTGGGA TCAGAATAAAGTTTATGATTATGCTAGCTTAATATTTCTAAGAGATGGGAATTATTACTTAGGAATCATC AATCCAAAGCGTAAGAAAAACATTAAATTTGAACAAGGGTCAGGCAATGGCCCATTCTATAGAAAAAT GGTGTATAAGCAAATACCAGGACCTAACAAGAACTTGCCTCGCGTATTTTTAACTTCAACAAAGGGTAA AAAAGAATATAAACCAAGCAAAGAAATTATTGAAGGTTACGAAGCAGATAAACACATCAGAGGTGATA AGTTCGATCTGGATTTCTGCCATAAATTGATTGACTTTTTTAAGGAATCTATAGAAAAACATAAGGACTG GTCCAAATTTAATTTCTACTTCTCACCTACAGAAAGTTATGGTGACATTTCAGAATTTTATTTAGACGTTG AGAAACAAGGATATAGGATGCATTTTGAAAATATTTCAGCGGAAACCATCGACGAATACGTTGAGAAG GGTGATTTATTCTTGTTCCAAATTTACAATAAAGACTTCGTTAAAGCTGCAACCGGAAAGAAGGATATGC ATACCATATATTGGAACGCTGCATTCTCGCCAGAAAACTTACAAGATGTCGTTGTAAAGCTTAATGGAG AAGCTGAGCTGTTCTATAGAGACAAGAGTGATATAAAAGAGATTGTGCATCGGGAAGGTGAAATTCTGG TGAACAGAACTTACAATGGTCGTACACCCGTTCCAGACAAAATACATAAAAAACTGACCGATTATCATA ATGGTAGGACAAAGGACTTGGGCGAGGCCAAGGAGTACCTCGATAAAGTTAGATATTTCAAGGCACACT ATGATATTACGAAAGACAGGAGATATTTAAACGATAAAATTTACTTTCATGTCCCTTTGACCCTTAACTT TAAAGCTAATGGTAAAAAGAATTTGAACAAAATGGTAATTGAGAAGTTTTTATCGGACGAAAAAGCTCA CATAATCGGAATCGACCGCGGAGAGAGAAATTTACTGTATTATAGTATCATCGACAGAAGTGGAAAGAT TATTGATCAGCAATCTTTGAACGTCATTGATGGGTTTGACTATCGGGAAAAGTTAAATCAAAGGGAAAT TGAAATGAAGGATGCGAGACAATCATGGAATGCCATTGGTAAAATTAAAGATCTCAAGGAGGGGTACTT ATCAAAAGCTGTACACGAGATAACTAAAATGGCTATCCAATATAATGCAATTGTTGTAATGGAAGAATT GAATTATGGTTTTAAACGCGGCAGGTTTAAAGTCGAAAAACAAATATACCAAAAGTTTGAAAACATGTT AATTGATAAGATGAACTATCTTGTTTTCAAAGATGCACCTGATGAGAGTCCTGGCGGTGTGCTGAACGCC TATCAATTAACAAACCCATTAGAGTCCTTTGCTAAACTGGGTAAACAAACTGGCATTCTATTTTATGTTC CAGCCGCTTACACCTCAAAGATCGATCCAACGACCGGTTTTGTAAACTTATTTAATACTTCTTCCAAAAC AAACGCGCAAGAACGCAAAGAATTCCTACAAAAATTTGAATCAATATCCTATAGCGCAAAAGATGGAG GTATATTCGCTTTCGCTTTTGACTACAGAAAGTTTGGCACTTCCAAGACAGATCATAAAAATGTGTGGAC CGCTTATACCAACGGAGAAAGGATGCGTTATATTAAAGAAAAAAAGAGGAACGAACTATTTGATCCATC GAAAGAAATTAAAGAAGCTTTGACAAGCAGCGGAATCAAATATGATGGAGGTCAAAACATACTTCCAG ATATTCTCAGATCTAATAATAACGGTCTTATTTACACGATGTATTCATCTTTTATCGCTGCCATCCAAATG CGTGTGTATGATGGCAAGGAAGATTATATTATATCTCCTATTAAAAATTCAAAGGGTGAATTTTTTCGCA CGGATCCAAAAAGAAGAGAGCTTCCAATTGACGCCGATGCTAACGGTGCTTACAATATTGCATTGCGTG GTGAACTTACTATGAGAGCCATCGCCGAAAAGTTTGATCCGGACAGTGAAAAAATGGCGAAATTGGAGC TAAAGCACAAGGATTGGTTTGAATTCATGCAGACCCGTGGCGATTGA SEQ ATGACTAAAACGTTCGACTCCGAGTTTTTTAATCTCTATTCCTTGCAAAAGACCGTTAGGTTTGAATTGA ID AACCAGTTGGTGAAACTGCCTCATTTGTCGAAGACTTTAAAAACGAGGGATTGAAAAGAGTGGTTAGTG NO: AAGATGAAAGAAGGGCAGTAGACTATCAAAAGGTTAAAGAAATCATTGACGATTACCACAGAGATTTT 130 ATAGAAGAATCTCTGAACTATTTTCCAGAGCAGGTTTCAAAAGATGCTCTAGAGCAAGCGTTTCATTTGT ATCAAAAGTTGAAAGCAGCGAAGGTGGAAGAAAGGGAAAAAGCTTTAAAAGAATGGGAAGCATTACA GAAAAAATTGCGAGAAAAAGTCGTCAAATGTTTCAGCGACTCTAATAAAGCTCGCTTTTCTAGAATCGA TAAAAAAGAATTGATTAAGGAAGATTTAATAAATTGGCTGGTAGCACAAAACAGAGAGGATGATATTCC TACTGTTGAAACGTTCAATAATTTTACTACTTACTTCACTGGTTTCCATGAGAACAGGAAGAATATTTAC TCTAAAGATGATCACGCTACTGCTATAAGTTTTAGGTTGATTCACGAAAACTTGCCTAAATTTTTTGACA ATGTCATCAGTTTTAACAAGTTGAAAGAAGGTTTCCCGGAATTAAAATTCGACAAAGTTAAAGAAGATT TAGAAGTAGATTACGACTTGAAGCATGCGTTTGAAATTGAATATTTCGTTAATTTCGTCACACAAGCTGG TATCGACCAATATAATTACCTGCTTGGAGGCAAAACTCTAGAAGACGGTACGAAGAAACAAGGAATGA ATGAACAGATTAATTTATTTAAGCAACAACAAACTCGCGATAAAGCTAGACAGATTCCAAAACTGATTC CACTTTTCAAACAGATTCTATCTGAGAGAACTGAATCTCAGAGTTTTATCCCTAAGCAGTTCGAGTCTGA TCAGGAACTATTCGATTCCCTGCAGAAATTGCATAACAACTGTCAAGATAAGTTTACCGTTTTGCAACAG GCGATCTTGGGATTGGCTGAGGCAGATCTTAAAAAGGTCTTTATTAAAACTAGTGATCTAAACGCATTGT CTAACACTATTTTTGGAAATTATTCTGTGTTCTCAGACGCGCTCAATTTATATAAAGAGTCGCTAAAAAC TAAAAAGGCTCAAGAAGCTTTTGAAAAGTTGCCTGCACATAGTATTCATGATTTAATCCAATACTTAGAA CAATTTAATTCGTCTCTCGATGCTGAAAAGCAACAGTCTACCGATACTGTATTAAACTACTTTATTAAAA CCGACGAATTATATAGTCGTTTCATTAAATCCACCTCTGAGGCATTCACCCAAGTACAACCTCTCTTTGA ACTGGAAGCTTTGAGCTCCAAAAGAAGACCCCCAGAAAGTGAAGATGAGGGGGCTAAAGGCCAAGAAG GTTTCGAACAAATTAAGAGAATCAAAGCTTATCTAGACACTCTAATGGAGGCTGTCCACTTTGCTAAGCC TTTGTATCTTGTCAAGGGTAGAAAGATGATAGAGGGTCTAGACAAGGATCAAAGCTTCTACGAAGCGTT TGAAATGGCCTACCAGGAGTTGGAGTCTTTAATCATCCCCATTTACAATAAGGCCAGATCTTACCTGTCT AGGAAGCCATTTAAAGCGGATAAATTCAAAATTAATTTTGACAATAATACACTTCTATCTGGGTGGGAT GCTAACAAGGAGACGGCTAACGCCAGCATATTGTTTAAGAAGGATGGTTTATACTACCTGGGAATCATG CCAAAAGGCAAAACTTTCTTGTTCGATTATTTCGTTAGTTCAGAAGATTCTGAAAAGTTGAAACAACGGA GACAGAAAACCGCAGAGGAAGCGCTCGCACAGGATGGAGAATCCTATTTTGAAAAAATACGGTATAAA CTCCTACCAGGTGCTAGTAAGATGTTGCCAAAGGTATTTTTTAGCAATAAAAATATTGGGTTTTACAATC CCTCAGATGATATTCTACGAATTCGGAATACGGCCTCTCATACTAAGAATGGTACTCCCCAGAAGGGTC ATTCCAAGGTAGAATTTAACTTGAATGACTGTCACAAAATGATTGATTTTTTTAAATCTTCCATACAGAA ACATCCCGAGTGGGGATCCTTTGGTTTCACTTTTTCTGATACGTCGGACTTTGAAGATATGAGTGCTTTCT ACCGAGAAGTTGAAAATCAAGGTTACGTTATAAGTTTTGATAAAATAAAAGAAACTTACATTCAGTCTC AAGTTGAGCAAGGTAACTTATATTTATTTCAAATTTACAACAAAGATTTTAGTCCGTATTCAAAGGGAAA GCCAAACCTGCACACTTTATACTGGAAAGCTCTGTTTGAAGAGGCTAATTTGAATAACGTAGTGGCTAA GCTAAACGGCGAAGCAGAAATCTTTTTCAGAAGACACAGTATCAAAGCATCTGATAAAGTGGTACATCC TGCTAATCAAGCTATAGATAATAAGAATCCCCATACTGAGAAGACGCAGTCCACATTTGAATATGACTT GGTCAAAGACAAAAGATATACCCAAGACAAATTTTTTTTTCATGTACCGATATCTTTAAACTTTAAGGCT CAGGGCGTTTCAAAGTTTAATGATAAGGTAAATGGATTCTTAAAGGGCAATCCCGACGTTAATATAATC GGTATAGATCGAGGTGAGAGACATCTTTTATACTTTACCGTGGTGAATCAAAAAGGAGAAATATTAGTG CAAGAGTCCTTGAATACATTAATGTCTGACAAGGGTCATGTCAACGATTATCAACAGAAATTGGACAAG AAGGAACAGGAAAGGGACGCTGCCAGGAAGTCCTGGACGACAGTAGAAAATATTAAAGAATTAAAAGA AGGTTATTTATCACATGTGGTTCATAAACTTGCACATTTAATCATCAAATATAACGCAATAGTGTGCTTG GAAGATCTTAATTTTGGCTTCAAGAGGGGTAGGTTCAAGGTCGAAAAACAGGTCTACCAGAAGTTCGAG AAAGCTCTGATCGATAAATTGAATTATCTTGTTTTCAAAGAAAAAGAATTAGGAGAAGTTGGTCATTATC TTACAGCATACCAACTCACTGCACCATTTGAAAGCTTCAAAAAGCTAGGCAAGCAATCTGGGATTTTGTT CTATGTTCCGGCTGATTATACATCAAAGATAGATCCTACCACAGGCTTTGTAAATTTTTTAGATCTTAGG TACCAATCCGTTGAAAAAGCTAAACAGTTGCTGTCCGATTTTAATGCGATAAGATTTAATAGTGTTCAGA ATTATTTTGAGTTCGAAATTGATTATAAAAAATTGACACCAAAACGTAAAGTAGGAACACAATCTAAAT GGGTTATTTGTACCTATGGAGATGTTAGATACCAAAACAGAAGAAATCAGAAAGGTCACTGGGAAACTG AAGAAGTTAACGTTACTGAAAAACTTAAAGCTCTATTTGCGAGCGATTCAAAAACGACGACGGTGATCG ATTATGCAAATGATGATAACCTTATTGATGTAATTCTGGAACAAGATAAGGCATCATTTTTTAAAGAACT ACTATGGTTGTTAAAGCTAACCATGACCCTAAGGCACTCCAAGATAAAGTCAGAGGATGATTTTATCCTC TCTCCAGTGAAAAACGAACAAGGTGAGTTTTACGACTCAAGAAAGGCGGGTGAAGTCTGGCCTAAGGAT GCTGATGCCAATGGAGCTTATCACATCGCTCTGAAGGGGCTATGGAACTTACAGCAAATTAACCAATGG GAAAAAGGTAAAACTTTAAACCTCGCCATAAAGAACCAGGATTGGTTCAGCTTTATCCAAGAAAAACCA TATCAAGAATAA SEQ ATGCACACAGGAGGTCTACTCTCGATGGATGCTAAGGAATTTACCGGTCAATATCCGCTGTCCAAAACTT ID TGCGTTTTGAGCTTAGACCTATTGGCCGAACGTGGGATAACCTAGAGGCTTCTGGTTATTTGGCGGAAGA NO: TAGACATAGAGCTGAGTGTTATCCCCGAGCTAAAGAATTGCTGGATGATAACCACAGGGCGTTCCTGAA 131 TAGAGTTCTACCGCAAATCGATATGGATTGGCATCCAATTGCTGAAGCTTTCTGCAAGGTGCACAAAAA TCCAGGTAATAAAGAATTGGCTCAGGATTATAATTTGCAGCTTAGTAAGAGAAGAAAAGAAATTTCCGC TTATTTGCAGGATGCTGATGGATACAAGGGGTTGTTCGCGAAACCTGCCCTGGACGAAGCTATGAAAAT AGCTAAGGAAAACGGCAATGAATCTGATATTGAAGTTTTGGAAGCCTTCAATGGATTTTCCGTTTATTTC ACTGGTTATCATGAGAGTAGGGAGAATATATACTCAGACGAAGATATGGTATCCGTCGCCTATCGCATA ACTGAAGATAATTTTCCAAGGTTCGTGTCGAACGCGTTAATTTTTGATAAACTAAATGAATCGCACCCGG ATATTATTTCGGAAGTGTCCGGTAATCTGGGGGTAGACGATATTGGTAAATATTTTGATGTGTCCAACTA CAATAATTTCCTTAGTCAAGCAGGAATTGATGACTACAACCATATTATAGGAGGGCATACAACTGAAGA CGGTCTCATTCAAGCTTTTAACGTAGTGTTAAACCTAAGGCACCAAAAAGACCCAGGTTTTGAGAAAAT TCAATTTAAGCAACTCTACAAGCAGATACTGAGCGTTAGGACTAGTAAGTCATATATCCCAAAGCAATT CGATAACTCAAAGGAAATGGTCGACTGTATATGCGACTACGTCTCAAAAATAGAAAAATCTGAAACAGT AGAAAGAGCTCTGAAATTGGTAAGAAATATATCTTCTTTTGATTTAAGAGGTATTTTCGTAAATAAAAAA AACCTTCGAATTTTGTCTAATAAGTTAATTGGAGACTGGGACGCAATAGAGACAGCTTTGATGCACAGTT CCAGCAGTGAAAACGATAAGAAATCAGTGTATGACTCTGCAGAGGCATTCACCCTTGATGATATCTTCA GTTCTGTGAAAAAGTTCAGCGACGCCTCCGCTGAGGATATAGGAAACCGCGCTGAAGACATATGTCGTG TTATCTCAGAAACAGCTCCTTTCATTAACGACTTAAGGGCTGTAGATTTGGATTCTTTAAATGATGACGG CTATGAAGCGGCCGTGTCTAAAATACGGGAATCTCTTGAACCCTACATGGATCTATTTCACGAATTGGAG ATCTTTAGCGTGGGTGATGAGTTTCCTAAATGTGCTGCCTTTTATAGCGAGTTGGAAGAGGTCTCAGAAC AACTGATTGAAATCATTCCTTTATTTAACAAAGCAAGAAGTTTTTGCACAAGGAAAAGGTATTCAACCG ACAAAATCAAAGTCAATTTAAAATTCCCTACTCTGGCAGATGGATGGGATCTAAATAAAGAAAGGGATA ACAAAGCCGCAATTCTAAGAAAAGACGGTAAATACTACCTGGCAATTTTAGACATGAAGAAAGATCTCA GTAGTATTCGTACGAGCGATGAGGACGAGTCTTCTTTTGAAAAGATGGAATATAAATTGCTCCCTTCTCC TGTGAAAATGCTTCCAAAAATTTTTGTTAAATCGAAAGCCGCCAAAGAAAAGTACGGGTTGACCGATAG AATGTTAGAATGCTACGATAAAGGTATGCATAAGTCGGGTAGTGCTTTTGATTTGGGTTTTTGTCATGAA TTGATCGATTACTATAAGCGCTGCATTGCCGAGTACCCAGGCTGGGATGTTTTCGACTTTAAATTTCGTG AGACAAGCGATTACGGATCCATGAAAGAATTTAATGAAGACGTCGCTGGCGCAGGTTACTATATGTCAC TTAGAAAGATTCCATGTTCCGAAGTTTATCGTTTACTGGACGAGAAGTCAATTTACTTGTTTCAAATATA TAATAAGGATTATAGCGAAAACGCACATGGGAATAAGAATATGCATACGATGTATTGGGAGGGCTTGTT CTCACCACAAAATTTGGAATCACCAGTCTTCAAATTGTCCGGAGGCGCAGAACTTTTTTTCAGAAAGTCA TCTATTCCTAATGACGCTAAAACGGTACATCCGAAAGGTTCAGTTCTTGTTCCCAGAAACGACGTCAATG GTAGAAGAATACCAGACTCGATCTACAGAGAGTTGACAAGGTATTTTAACCGTGGGGATTGCAGGATCA GTGATGAAGCTAAGTCTTACCTGGACAAGGTCAAGACAAAAAAAGCGGACCATGACATTGTTAAGGAT AGAAGATTTACTGTAGATAAGATGATGTTCCATGTTCCGATTGCCATGAATTTTAAAGCTATAAGTAAAC CAAATCTTAATAAGAAAGTTATTGATGGCATAATAGATGATCAAGATTTGAAAATCATCGGTATCGATC GTGGTGAGAGAAATCTTATTTATGTGACCATGGTCGATAGGAAGGGGAATATATTGTATCAAGACAGTC TTAATATTTTAAATGGATACGATTACCGCAAAGCTTTAGACGTGAGGGAATATGATAACAAAGAAGCTA GAAGGAATTGGACTAAAGTAGAAGGTATTAGAAAAATGAAAGAAGGTTATTTATCTTTAGCTGTTAGTA AATTGGCCGATATGATCATCGAAAATAATGCTATAATCGTAATGGAAGATTTGAATCACGGGTTTAAGG CAGGTCGTTCCAAAATTGAAAAGCAGGTGTATCAAAAATTCGAATCAATGTTAATCAACAAGTTAGGAT ACATGGTGCTAAAAGACAAGTCCATTGACCAGTCTGGTGGAGCCCTTCATGGTTACCAATTAGCCAATC ATGTTACGACCTTAGCTAGCGTGGGTAAACAATGTGGAGTAATTTTTTACATACCTGCAGCTTTTACTTC GAAGATTGATCCCACCACGGGCTTTGCTGATTTATTCGCTCTCTCTAATGTGAAGAATGTCGCTTCTATG AGAGAGTTCTTCTCCAAAATGAAGTCAGTAATATATGACAAGGCGGAAGGCAAATTCGCCTTTACATTT GATTATTTGGATTATAACGTTAAAAGCGAATGTGGACGTACCTTATGGACTGTGTATACAGTTGGTGAAC GCTTCACCTACTCTAGAGTAAACCGAGAGTATGTTCGGAAAGTCCCAACAGATATCATCTATGATGCATT ACAAAAAGCTGGTATTAGCGTCGAAGGTGACCTTAGAGATAGAATCGCGGAAAGCGACGGTGACACAT TAAAGTCTATATTCTACGCTTTTAAATACGCGTTGGATATGAGAGTCGAAAACAGAGAGGAAGACTATA TACAGTCACCTGTGAAGAATGCTTCTGGTGAGTTCTTTTGTTCAAAAAACGCCGGAAAGTCTTTGCCGCA GGATTCAGATGCAAATGGTGCCTATAATATAGCTCTGAAAGGGATCCTACAACTCAGAATGTTGAGCGA ACAATACGATCCAAATGCAGAATCGATTAGATTGCCACTTATAACTAACAAGGCATGGTTAACTTTTATG CAATCCGGTATGAAAACTTGGAAGAATTAA SEQ ATGGATTCTCTTAAGGATTTCACTAATTTATATCCAGTCTCGAAAACATTGCGGTTCGAATTGAAACCAG ID TTGGGAAAACTCTAGAAAACATTGAAAAAGCCGGTATATTGAAAGAAGATGAACACAGAGCGGAATCC NO: TACCGCCGGGTAAAAAAGATAATTGACACATACCATAAAGTGTTTATTGACAGCTCCTTAGAGAACATG 132 GCTAAAATGGGGATAGAAAATGAAATCAAGGCTATGCTGCAGTCTTTTTGTGAACTCTATAAGAAAGAC CACAGGACAGAAGGAGAAGATAAAGCTCTTGATAAAATTAGAGCTGTTCTTAGAGGTTTAATCGTTGGG GCTTTCACTGGTGTATGTGGAAGACGAGAAAACACAGTACAAAATGAAAAGTACGAGAGTTTGTTCAAA GAAAAATTGATAAAGGAAATTTTGCCAGATTTCGTGTTGTCCACCGAGGCTGAGTCTCTTCCATTCAGCG TTGAAGAAGCAACAAGGAGCTTAAAAGAGTTTGACTCATTCACTTCTTATTTTGCTGGTTTTTACGAAAA TAGAAAGAATATTTATTCCACGAAACCGCAAAGTACTGCGATAGCCTACAGATTAATTCATGAAAACTT GCCTAAATTTATAGATAATATTTTGGTCTTCCAGAAGATTAAAGAACCAATCGCTAAAGAACTTGAGCA CATAAGAGCAGATTTTAGCGCAGGCGGATATATCAAAAAAGATGAACGGCTAGAAGACATATTCTCATT AAATTACTACATTCATGTCCTTTCTCAAGCTGGTATAGAAAAATATAATGCTTTAATCGGGAAGATAGTG ACGGAAGGTGATGGTGAAATGAAAGGTCTTAATGAACATATTAACTTATATAACCAACAGAGGGGTCGA GAGGATAGGTTGCCCTTGTTTAGGCCTCTATACAAGCAAATCCTGTCCGATAGAGAGCAATTGTCTTATT TACCTGAATCATTTGAAAAAGATGAAGAGCTGCTTAGAGCACTTAAGGAATTTTACGATCACATCGCCG AAGACATCTTGGGTAGAACACAGCAATTGATGACTTCAATTTCTGAATACGACTTGTCCCGTATTTATGT CAGAAATGATTCTCAACTTACAGACATCTCGAAGAAAATGCTAGGAGATTGGAACGCCATTTATATGGC TAGAGAACGAGCCTACGACCACGAACAGGCTCCTAAACGTATTACTGCTAAATACGAACGTGATAGAAT CAAGGCCTTAAAAGGTGAAGAGTCAATTTCATTGGCGAATCTGAACAGCTGTATAGCTTTCTTGGACAA TGTAAGGGATTGTCGAGTTGACACATACCTATCAACTTTGGGGCAGAAAGAGGGTCCTCATGGCTTAAG TAACTTGGTGGAAAACGTCTTCGCCTCATATCATGAAGCAGAACAGTTATTGTCGTTTCCTTACCCCGAA GAGAACAACCTTATTCAGGACAAAGACAATGTAGTTTTGATCAAAAACCTATTGGATAATATAAGTGAT TTACAACGTTTCCTTAAACCTTTGTGGGGAATGGGCGATGAACCTGACAAAGACGAAAGGTTTTACGGT GAATACAACTATATTAGAGGAGCGCTTGACCAGGTAATACCTTTGTACAATAAAGTAAGGAACTACTTG ACTCGTAAACCATATTCTACTAGAAAAGTTAAATTGAACTTTGGTAATTCACAGCTGCTGAGTGGTTGGG ATCGTAATAAAGAAAAAGATAACTCCTGTGTTATCTTGCGAAAAGGACAAAACTTTTACTTGGCAATTA TGAACAACCGTCACAAAAGGTCCTTCGAGAACAAAGTTCTGCCTGAATACAAAGAAGGTGAACCATATT TTGAAAAAATGGACTATAAATTCCTGCCAGATCCTAATAAAATGTTGCCTAAGGTCTTCTTGTCTAAAAA AGGTATAGAAATATATAAACCATCCCCGAAGTTGCTGGAGCAATATGGTCATGGAACGCACAAAAAAG GTGACACTTTTAGTATGGATGACTTGCACGAGTTGATTGATTTTTTTAAACATTCCATTGAAGCGCACGA AGATTGGAAACAATTTGGTTTCAAGTTCTCTGACACAGCCACTTACGAAAATGTATCGTCCTTTTATAGA GAAGTGGAAGATCAGGGTTATAAACTGTCATTCCGTAAGGTTAGTGAAAGCTATGTGTACTCGTTGATC GATCAAGGGAAGCTTTATCTTTTTCAAATCTATAATAAAGATTTCTCTCCTTGTTCAAAGGGCACACCTA ATCTTCATACACTATACTGGAGAATGCTTTTCGATGAAAGAAATTTGGCTGATGTGATCTATAAATTAGA CGGTAAAGCTGAGATTTTTTTCAGAGAGAAATCCCTGAAAAACGACCATCCAACTCATCCGGCAGGTAA ACCGATTAAAAAGAAATCCCGGCAAAAAAAGGGCGAAGAGAGTTTATTCGAGTATGATTTAGTTAAGG ACAGACATTATACAATGGACAAATTTCAATTTCATGTGCCCATTACTATGAACTTTAAGTGTAGTGCAGG GTCTAAGGTTAATGATATGGTAAACGCACATATTAGAGAAGCTAAAGATATGCACGTCATCGGTATTGA TCGCGGAGAAAGAAATTTACTTTACATTTGCGTTATCGATTCTAGGGGCACCATCTTGGATCAAATCTCT TTGAACACTATAAATGATATTGACTATCATGATCTACTAGAGAGTCGGGATAAAGACAGGCAACAAGAA AGAAGAAATTGGCAAACAATTGAAGGTATTAAAGAATTAAAGCAAGGCTATCTAAGCCAGGCTGTACA CAGAATTGCCGAATTAATGGTAGCATATAAAGCTGTCGTAGCTCTAGAAGACTTGAACATGGGTTTCAA AAGAGGGCGCCAGAAGGTCGAAAGTAGTGTTTATCAACAATTTGAAAAACAGTTAATAGATAAGTTGA ATTATCTAGTGGATAAAAAAAAGCGTCCTGAGGACATTGGCGGTTTATTAAGAGCCTACCAATTCACTG CGCCATTTAAATCGTTCAAAGAAATGGGTAAACAAAACGGTTTTCTATTCTACATCCCCGCATGGAATAC CTCAAATATAGATCCAACTACCGGTTTCGTCAACTTATTTCATGCTCAATATGAGAATGTGGACAAAGCA AAATCATTCTTTCAAAAATTTGATAGCATTAGCTACAATCCTAAAAAAGATTGGTTTGAATTTGCGTTCG ATTATAAAAATTTCACCAAGAAGGCTGAAGGTTCCAGATCTATGTGGATATTGTGCACCCACGGAAGTA GAATTAAGAACTTCCGTAATTCACAGAAAAACGGCCAGTGGGACAGCGAAGAATTCGCCCTAACCGAA GCTTTCAAAAGTCTTTTCGTAAGATACGAGATAGACTATACAGCTGATCTAAAGACAGCTATTGTGGATG AGAAGCAAAAAGACTTCTTTGTCGACCTTCTTAAGTTGTTCAAGTTAACTGTGCAGATGAGAAATAGTTG GAAGGAAAAAGACCTAGATTACTTGATTAGCCCAGTCGCTGGTGCAGATGGCAGATTTTTTGATACACG TGAAGGCAATAAATCACTACCAAAAGACGCGGACGCTAATGGCGCATACAACATCGCATTGAAGGGTTT GTGGGCTCTCAGGCAGATTAGGCAGACAAGTGAGGGTGGTAAGCTTAAGCTGGCGATTTCTAATAAGGA ATGGTTACAGTTTGTTCAAGAAAGATCCTACGAAAAAGATTAA SEQ ATGAACAATGGTACTAATAATTTTCAAAACTTCATAGGGATTTCTAGCCTTCAAAAGACATTGAGAAAT ID GCTTTAATTCCAACAGAAACGACTCAACAATTCATAGTGAAAAATGGTATTATAAAAGAAGACGAGTTG NO: CGTGGCGAGAATAGACAAATTTTGAAAGATATCATGGATGACTACTACAGAGGGTTCATCTCCGAAACA 133 TTGTCTTCTATTGACGACATTGACTGGACCAGCTTATTCGAAAAAATGGAAATACAGCTGAAGAACGGA GATAACAAGGACACTCTTATAAAGGAGCAAACGGAATATAGAAAGGCTATACACAAAAAGTTTGCTAA TGACGATAGATTTAAAAACATGTTTAGTGCGAAGTTAATTTCTGATATTCTACCCGAGTTTGTCATTCAT AATAATAACTACTCTGCATCTGAAAAAGAGGAGAAGACCCAGGTTATAAAGTTGTTTTCAAGATTTGCC ACATCATTTAAAGACTACTTCAAGAACAGGGCGAATTGCTTCTCTGCTGATGATATTAGCTCTTCCAGCT GTCATAGAATTGTTAACGATAATGCCGAAATTTTTTTTAGTAATGCCTTGGTATATAGACGCATAGTCAA GTCACTAAGCAATGATGATATAAACAAGATTAGTGGTGATATGAAAGATAGCCTTAAAGAAATGAGCCT TGAAGAGATATATTCATATGAGAAGTACGGTGAATTTATAACTCAAGAAGGAATTTCTTTTTATAACGAT ATTTGTGGTAAGGTTAATTCTTTTATGAATTTGTATTGCCAGAAGAACAAGGAAAATAAGAATCTATATA AACTACAAAAGTTGCATAAACAGATTTTGTGTATAGCTGATACATCCTACGAAGTTCCGTATAAATTTGA ATCTGATGAGGAAGTTTATCAATCGGTAAACGGTTTTCTTGACAACATTTCCAGCAAACATATCGTTGAG AGACTACGTAAAATTGGAGACAACTATAATGGTTACAATCTAGATAAAATATACATAGTGTCCAAGTTT TATGAGTCTGTCTCTCAAAAGACATATCGTGATTGGGAGACCATTAATACTGCACTTGAAATTCATTATA ACAACATATTGCCTGGTAACGGGAAGAGTAAAGCTGATAAGGTTAAAAAGGCCGTCAAAAACGACTTG CAAAAGTCTATTACCGAGATAAATGAATTAGTGTCAAACTACAAACTATGCTCAGATGATAATATTAAA GCGGAAACATACATCCACGAAATTTCCCACATACTGAATAACTTTGAAGCTCAGGAGCTTAAATATAAC CCGGAAATACACTTGGTTGAGAGCGAGTTAAAAGCATCTGAGTTGAAAAATGTATTAGACGTCATCATG AATGCGTTTCATTGGTGTTCAGTTTTCATGACTGAAGAATTAGTCGACAAAGATAACAATTTTTATGCCG AATTAGAGGAAATATATGATGAAATTTATCCCGTAATTAGTTTATACAATCTAGTTAGAAATTATGTTAC ACAAAAGCCGTATAGTACCAAGAAAATAAAGCTTAATTTCGGAATACCTACGCTTGCTGATGGTTGGTC AAAAAGTAAAGAATATAGCAATAATGCAATAATTTTAATGAGAGATAACCTATATTATTTGGGTATTTTT AACGCTAAGAACAAACCAGACAAGAAAATAATTGAAGGTAATACATCTGAAAACAAGGGCGACTATAA AAAGATGATATACAATTTGCTCCCAGGTCCTAATAAAATGATTCCTAAGGTTTTCCTGAGTAGCAAGACT GGCGTTGAAACTTACAAGCCTAGTGCGTATATCCTGGAGGGTTATAAACAGAACAAGCATATCAAATCC TCTAAGGACTTCGATATCACCTTTTGCCATGACTTAATCGATTATTTTAAAAATTGTATCGCAATTCATCC AGAATGGAAAAATTTCGGATTTGATTTTAGTGATACCAGCACTTACGAGGATATCTCTGGGTTCTACAGA GAAGTGGAGTTGCAGGGCTACAAAATCGATTGGACTTACATATCTGAAAAGGACATAGATTTGCTGCAG GAGAAAGGTCAGCTATATTTGTTTCAAATCTACAACAAAGACTTTTCTAAAAAGTCTACCGGTAATGAC AATCTGCACACAATGTACTTGAAGAACTTATTCTCCGAGGAGAACTTAAAGGACATTGTACTCAAGTTG AATGGAGAAGCCGAGATTTTTTTTAGAAAGAGCAGTATAAAGAATCCTATAATCCACAAGAAGGGCTCA ATTCTCGTGAATAGGACGTATGAGGCAGAAGAAAAGGACCAATTTGGGAATATACAAATTGTAAGAAA AAACATCCCAGAAAATATCTACCAGGAATTATATAAGTATTTTAATGACAAATCTGATAAGGAACTGTC TGACGAAGCCGCTAAGCTCAAGAATGTTGTGGGCCACCATGAAGCTGCTACTAATATAGTGAAGGACTA CAGATATACCTACGATAAATATTTCCTGCATATGCCAATTACTATAAACTTCAAAGCAAATAAAACAGG TTTTATAAATGATAGAATCCTGCAGTATATTGCTAAAGAAAAGGATTTACATGTAATTGGGATTGATAGA GGTGAACGCAATCTGATCTATGTCAGCGTAATAGATACTTGTGGTAATATTGTGGAACAAAAGTCCTTTA ATATTGTGAACGGATATGATTACCAAATCAAGTTGAAACAACAAGAGGGAGCACGCCAAATTGCCCGTA AGGAATGGAAAGAGATAGGTAAGATCAAGGAAATTAAGGAAGGTTATCTTTCATTAGTTATTCACGAAA TTTCGAAGATGGTAATCAAATACAACGCAATAATTGCTATGGAGGACCTGTCATATGGATTTAAGAAAG GTAGATTCAAGGTTGAGAGACAGGTATACCAGAAATTTGAAACTATGTTGATCAACAAATTAAATTACT TAGTCTTTAAGGACATATCAATAACGGAAAACGGCGGGCTTTTAAAAGGGTATCAACTTACATACATAC CTGATAAGTTGAAAAATGTGGGTCATCAGTGTGGGTGCATCTTTTATGTTCCAGCCGCTTACACATCAAA AATCGATCCTACTACTGGGTTCGTAAACATATTTAAATTTAAAGATCTAACCGTTGATGCAAAAAGAGA GTTTATCAAGAAATTTGATAGCATTAGGTACGATTCAGAAAAAAATCTATTCTGTTTTACTTTTGACTAC AACAACTTTATAACGCAGAATACAGTGATGTCAAAATCGTCCTGGTCAGTGTATACTTATGGTGTTAGAA TTAAGAGACGTTTCGTAAACGGTCGTTTTTCTAACGAGTCCGATACAATCGACATCACTAAAGATATGGA AAAAACTTTGGAAATGACAGATATAAACTGGAGAGATGGTCACGACCTTAGACAAGATATAATCGATTA TGAAATCGTACAGCATATTTTTGAAATTTTTCGCTTAACAGTTCAGATGCGTAACTCTCTTAGTGAGCTA GAAGATAGAGATTATGATAGACTTATCTCGCCTGTTCTTAACGAAAATAATATCTTCTATGACTCGGCAA AAGCCGGTGATGCACTTCCAAAAGATGCTGATGCAAATGGCGCGTACTGCATCGCATTGAAGGGGCTCT ACGAGATTAAACAAATCACCGAAAACTGGAAAGAAGATGGTAAATTTTCTAGGGATAAGTTGAAAATC AGTAATAAAGATTGGTTCGATTTTATACAAAATAAGCGATACTTATAG SEQ ATGACCAATAAGTTTACTAATCAATACTCATTGTCTAAAACGTTAAGATTCGAGTTAATTCCCCAGGGAA ID AGACACTAGAATTTATTCAAGAAAAAGGTCTTCTCTCTCAGGATAAACAAAGAGCAGAATCATACCAGG NO: AGATGAAAAAAACCATAGATAAATTTCATAAGTACTTCATCGACTTGGCACTATCGAACGCCAAGCTAA 134 CACATTTGGAAACCTACCTGGAGTTGTATAATAAATCGGCAGAGACGAAAAAGGAACAAAAATTCAAG GATGACCTGAAGAAGGTTCAAGATAATCTGCGAAAGGAAATAGTGAAGTCGTTTAGTGATGGTGATGCA AAGTCAATCTTTGCTATTTTAGACAAGAAGGAATTAATAACCGTGGAACTTGAAAAGTGGTTTGAAAAT AACGAACAGAAAGATATTTACTTCGACGAAAAATTTAAAACGTTTACTACGTACTTTACAGGGTTCCATC AGAACCGCAAAAACATGTACTCCGTTGAACCAAACTCTACTGCAATCGCCTACAGATTAATACACGAAA ATTTGCCTAAGTTTTTAGAAAATGCAAAGGCTTTTGAAAAGATAAAGCAAGTCGAATCGTTACAGGTAA ACTTTCGCGAATTAATGGGCGAATTTGGAGATGAAGGTCTTATTTTTGTCAATGAATTAGAGGAAATGTT TCAAATTAATTATTATAACGATGTCTTGAGTCAGAACGGCATTACTATCTACAACTCAATTATCAGTGGT TTCACTAAGAATGATATAAAATATAAAGGTTTGAATGAATACATTAATAATTATAATCAAACTAAAGAT AAGAAGGACAGGCTTCCGAAATTGAAGCAATTGTACAAGCAGATTCTAAGTGATAGGATTAGTTTGTCT TTCTTGCCAGACGCATTTACTGATGGCAAGCAAGTCTTAAAGGCTATATTCGATTTCTACAAGATTAACC TACTTTCGTACACAATTGAAGGTCAAGAAGAATCTCAAAATCTGCTGCTTTTGATTAGGCAAACTATAGA AAATTTGTCGTCCTTTGACACTCAAAAAATTTACCTGAAGAATGATACACACCTGACTACAATATCACAG CAGGTCTTTGGGGATTTTTCTGTCTTCTCCACGGCCCTAAACTATTGGTATGAGACAAAAGTTAATCCAA AATTTGAAACAGAATATAGTAAGGCGAATGAAAAAAAGAGAGAAATTTTGGATAAAGCGAAGGCAGTA TTCACAAAACAAGACTATTTTTCTATCGCATTTCTCCAAGAAGTCTTATCCGAATATATTTTGACACTCGA TCACACCTCTGATATAGTTAAGAAACATTCGTCCAACTGCATCGCAGATTACTTCAAGAATCACTTCGTG GCTAAGAAAGAAAACGAAACGGATAAAACTTTTGACTTCATTGCTAACATAACCGCTAAATACCAATGT ATTCAGGGCATATTAGAAAATGCAGACCAGTACGAAGACGAGTTAAAACAGGACCAAAAGTTAATAGA TAATCTAAAGTTTTTCTTAGATGCTATACTTGAGTTATTACATTTTATAAAGCCATTGCATCTAAAATCGG AAAGTATTACTGAAAAAGACACTGCGTTCTATGATGTGTTCGAAAATTATTATGAGGCTTTATCTTTATT GACCCCCCTTTACAACATGGTCCGCAATTATGTTACTCAGAAGCCTTACTCTACTGAAAAGATCAAATTA AACTTTGAAAATGCTCAGTTGCTGAATGGTTGGGATGCCAATAAGGAAGGTGACTACCTGACGACTATT CTAAAAAAAGACGGTAATTATTTCTTAGCAATCATGGATAAAAAACATAACAAGGCATTTCAAAAATTT CCAGAAGGAAAAGAAAACTATGAAAAGATGGTTTATAAATTGTTGCCTGGAGTTAATAAAATGTTGCCA AAAGTTTTTTTTAGCAATAAGAACATAGCTTACTTTAATCCATCTAAGGAACTGCTCGAGAACTACAAGA AGGAAACACATAAAAAAGGTGATACATTTAATTTGGAACATTGCCATACTCTGATTGATTTTTTTAAGGA CTCTCTTAATAAACATGAAGACTGGAAATATTTTGATTTTCAATTTTCGGAAACTAAATCATACCAAGAT CTAAGTGGATTTTACAGAGAAGTTGAACACCAAGGTTATAAGATTAACTTCAAGAATATAGATTCTGAA TACATTGATGGTCTTGTAAACGAGGGTAAACTATTCCTGTTCCAAATCTACTCTAAGGACTTCTCACCTTT TTCCAAAGGAAAACCTAATATGCATACGTTGTACTGGAAGGCTCTATTTGAAGAACAAAATTTGCAAAA TGTAATCTACAAACTGAACGGCCAAGCTGAAATATTCTTCAGAAAAGCCTCAATTAAGCCAAAAAACAT TATTCTTCATAAAAAGAAGATCAAGATTGCGAAGAAACATTTTATTGATAAGAAGACCAAGACTTCCGA AATTGTACCAGTACAAACAATCAAGAATCTCAATATGTATTATCAAGGCAAGATAAGTGAGAAAGAGTT AACCCAGGATGATTTACGTTATATAGACAATTTCTCTATATTCAACGAGAAGAACAAAACAATAGACAT TATCAAAGATAAAAGGTTTACTGTTGACAAATTTCAATTTCATGTGCCTATCACAATGAACTTTAAGGCC ACAGGTGGTTCGTACATTAATCAAACTGTTTTAGAATATCTGCAAAATAACCCAGAGGTCAAGATCATC GGTCTTGATAGGGGTGAGAGACATCTGGTGTATCTAACACTCATTGATCAACAAGGCAACATCTTGAAG CAAGAATCATTGAACACTATCACAGACTCCAAGATCTCGACTCCATATCACAAACTCCTTGACAATAAA GAAAACGAAAGGGATCTTGCCAGAAAAAATTGGGGTACAGTTGAAAATATTAAGGAACTAAAAGAAGG TTACATTTCGCAAGTAGTTCACAAGATTGCAACACTCATGTTGGAAGAAAACGCAATCGTTGTCATGGA AGATTTAAATTTCGGATTTAAGAGAGGAAGATTTAAAGTAGAAAAGCAAATCTACCAGAAGTTGGAGA AGATGTTAATTGACAAATTGAACTACTTAGTGCTGAAAGACAAACAGCCTCAAGAATTGGGCGGTCTAT ACAACGCTTTACAACTGACAAATAAATTTGAGTCATTCCAAAAGATGGGTAAGCAGAGTGGTTTTTTGTT TTATGTTCCGGCATGGAACACATCCAAAATCGATCCAACTACAGGCTTCGTGAATTATTTCTACACTAAA TATGAAAATGTGGATAAAGCAAAAGCTTTCTTTGAGAAGTTCGAGGCGATCCGTTTTAACGCTGAAAAG AAGTACTTCGAGTTCGAGGTCAAAAAGTATTCAGATTTTAACCCCAAGGCTGAAGGCACCCAGCAAGCA TGGACTATTTGCACGTACGGTGAGCGAATCGAAACTAAAAGGCAAAAGGATCAAAATAATAAGTTTGTA AGCACACCCATTAACTTGACAGAAAAGATAGAAGATTTTCTTGGAAAAAACCAAATTGTATATGGTGAC GGTAACTGTATCAAGTCACAAATTGCTTCTAAAGACGATAAGGCCTTCTTCGAAACTCTGCTATACTGGT TTAAAATGACGTTGCAAATGAGAAACAGTGAAACTAGAACTGATATCGACTATTTAATATCACCCGTGA TGAACGATAATGGTACCTTTTACAATTCAAGAGATTACGAGAAATTGGAGAACCCCACACTACCAAAAG ACGCAGACGCTAATGGTGCCTACCATATTGCTAAAAAGGGACTGATGTTGTTGAACAAGATAGATCAAG CCGACTTAACTAAAAAAGTTGATTTGTCAATTTCGAATAGAGATTGGTTGCAATTCGTCCAGAAAAATA AGTAA SEQ ATGGAACAGGAATACTACTTGGGTTTGGATATGGGAACTGGTTCAGTCGGTTGGGCTGTTACGGACTCC ID GAGTACCACGTGTTGAGAAAACACGGAAAGGCTTTATGGGGTGTCAGACTATTCGAATCAGCATCGACC NO: GCGGAAGAGAGAAGAATGTTTAGAACTTCAAGAAGAAGGCTGGATCGTAGGAATTGGCGGATAGAAAT 135 TTTACAAGAAATATTCGCCGAAGAAATCTCTAAAAAAGATCCAGGATTTTTTCTACGTATGAAGGAATC CAAATACTATCCGGAAGATAAACGTGATATTAATGGCAATTGTCCAGAGTTACCCTATGCTTTATTTGTG GACGACGATTTCACCGATAAAGATTACCATAAGAAGTTCCCAACAATTTACCATCTGAGAAAGATGTTA ATGAACACTGAAGAAACCCCGGATATAAGACTGGTCTATCTAGCCATTCATCATATGATGAAACACAGG GGACACTTCTTGCTATCAGGGGATATAAATGAAATTAAAGAATTTGGTACAACATTTTCTAAATTATTGG AAAATATTAAAAACGAAGAATTAGATTGGAATTTAGAATTAGGCAAGGAGGAATACGCAGTTGTCGAA TCGATTCTGAAAGATAACATGTTGAACAGATCAACGAAAAAAACAAGGCTGATCAAGGCTTTAAAAGC GAAATCAATATGCGAAAAAGCAGTATTGAATTTGTTAGCTGGGGGGACTGTCAAGTTGTCTGATATTTTC GGATTGGAAGAATTGAATGAAACAGAGAGACCGAAGATATCCTTCGCCGATAATGGCTACGATGATTAT ATAGGCGAAGTCGAAAATGAGCTGGGCGAACAATTCTACATTATCGAGACTGCCAAGGCTGTTTATGAT TGGGCGGTGTTAGTCGAAATCCTTGGCAAATACACTTCCATCTCCGAAGCTAAGGTGGCAACCTACGAA AAGCATAAAAGTGATTTGCAATTCCTTAAGAAAATTGTCCGAAAGTACTTGACCAAAGAAGAGTACAAG GATATTTTCGTATCAACATCGGACAAACTGAAGAATTATTCAGCTTATATTGGCATGACGAAAATTAATG GTAAGAAAGTTGATTTGCAATCCAAGAGATGTTCTAAAGAAGAATTTTACGATTTCATTAAAAAAAATG TCCTAAAAAAGTTGGAGGGACAACCTGAATATGAGTATTTAAAGGAAGAACTGGAAAGAGAAACTTTC CTACCAAAGCAAGTTAATCGTGATAATGGCGTTATTCCATACCAAATACACTTGTACGAATTAAAGAAG ATCTTGGGTAACTTGAGGGACAAAATTGATTTAATCAAGGAAAATGAAGACAAACTGGTACAATTATTT GAATTTAGAATACCTTACTACGTGGGCCCTTTAAACAAAATAGACGATGGTAAGGAAGGGAAGTTCACA TGGGCAGTCAGAAAGTCCAATGAAAAAATTTACCCATGGAATTTCGAAAACGTTGTAGATATTGAAGCT TCTGCTGAGAAATTTATTAGGAGAATGACAAATAAATGCACTTATCTTATGGGGGAAGACGTGTTGCCT AAAGATAGTTTATTATATTCAAAGTATATGGTCTTAAATGAATTAAACAATGTTAAATTAGATGGTGAAA AACTTTCCGTCGAATTGAAACAAAGATTGTATACAGATGTATTCTGCAAATATAGAAAAGTAACTGTAA AGAAGATTAAAAACTACCTTAAATGTGAAGGCATTATCAGCGGAAATGTTGAGATCACTGGTATCGATG GTGATTTTAAGGCATCTTTAACCGCATATCACGACTTTAAGGAAATATTGACGGGTACTGAGCTTGCTAA AAAAGACAAAGAGAACATTATCACCAATATCGTGCTCTTCGGAGACGACAAGAAATTATTGAAAAAGA GATTGAACCGCCTATACCCTCAGATTACCCCTAACCAATTGAAGAAAATCTGCGCTCTGTCTTATACTGG ATGGGGTCGTTTTAGCAAGAAGTTTCTAGAAGAAATTACTGCTCCGGATCCTGAAACTGGGGAAGTCTG GAATATAATTACCGCGCTATGGGAATCGAATAATAATTTAATGCAATTACTATCTAATGAATACAGATTT ATGGAAGAAGTCGAAACTTACAATATGGGAAAACAAACAAAAACTTTGAGCTACGAAACAGTAGAGAA TATGTATGTCTCACCATCTGTAAAGCGGCAGATCTGGCAAACCTTGAAGATAGTTAAAGAATTAGAAAA AGTGATGAAGGAAAGTCCAAAAAGGGTTTTTATTGAAATGGCCCGAGAAAAACAAGAATCTAAAAGGA CGGAAAGTAGGAAAAAGCAACTTATAGATCTATATAAAGCCTGCAAAAATGAAGAAAAAGATTGGGTA AAGGAATTAGGTGACCAGGAAGAGCAAAAATTGAGATCTGACAAGCTGTACTTGTATTATACGCAAAA GGGCCGGTGTATGTATTCGGGTGAGGTAATAGAATTGAAAGATTTATGGGATAACACTAAGTATGACAT TGACCATATTTACCCCCAGTCTAAGACAATGGACGATTCATTAAATAACCGAGTTCTTGTCAAAAAGAA GTACAATGCCACAAAGAGCGATAAGTACCCATTGAACGAAAATATAAGACATGAACGAAAAGGTTTCT GGAAATCATTGTTGGACGGTGGATTTATTTCCAAAGAAAAATACGAGAGATTGATTAGAAACACTGAAC TATCTCCAGAGGAGTTAGCTGGCTTTATCGAAAGACAAATTGTTGAAACTAGACAGTCTACAAAAGCAG TTGCAGAAATCTTAAAACAAGTATTTCCAGAATCCGAAATTGTGTACGTCAAAGCCGGAACAGTAAGTA GATTTAGAAAAGACTTTGAATTATTGAAAGTACGAGAGGTTAACGACCTACATCATGCTAAGGATGCTT ATTTAAATATAGTCGTTGGTAATTCGTATTACGTGAAATTCACAAAAAACGCATCTTGGTTCATCAAGGA GAATCCTGGTAGGACATACAACTTGAAAAAGATGTTTACATCAGGATGGAATATCGAAAGAAATGGTGA GGTTGCGTGGGAGGTAGGCAAGAAGGGAACCATTGTTACTGTAAAGCAAATTATGAATAAAAACAATA TACTTGTTACGAGACAGGTGCACGAAGCCAAAGGAGGGTTGTTTGACCAGCAAATCATGAAGAAAGGT AAAGGTCAGATAGCAATAAAAGAGACTGATGAGCGTTTAGCTAGTATAGAAAAATATGGGGGCTACAA TAAGGCAGCTGGTGCTTACTTCATGTTGGTCGAATCAAAGGATAAAAAAGGGAAGACGATCCGGACCAT AGAGTTTATCCCTCTGTACTTGAAGAATAAGATTGAGTCTGACGAAAGCATCGCATTGAATTTCTTGGAA AAGGGGCGCGGTCTAAAGGAGCCAAAAATATTGTTAAAGAAAATTAAAATAGACACCCTATTCGACGTC GATGGGTTTAAGATGTGGCTTAGTGGTCGTACTGGGGACAGATTATTATTCAAGTGTGCCAATCAGTTAA TCCTTGACGAGAAAATCATTGTTACAATGAAAAAAATTGTTAAGTTTATTCAAAGGCGACAAGAAAATA GAGAACTAAAGTTGAGTGATAAGGATGGAATCGATAATGAAGTGTTAATGGAGATTTATAACACTTTTG TCGACAAATTGGAGAATACGGTGTACAGAATTAGGCTATCTGAACAGGCTAAAACCCTAATTGATAAAC AGAAGGAGTTTGAGCGACTTTCTCTTGAAGACAAATCTTCAACTCTTTTCGAGATCCTACATATCTTTCA GTGTCAATCTTCTGCAGCTAATTTGAAAATGATTGGAGGTCCTGGTAAGGCTGGTATATTAGTCATGAAC AACAACATATCTAAGTGTAATAAGATTAGTATAATTAACCAATCACCGACAGGTATCTTTGAAAATGAA ATTGATTTACTTAAA SEQ ATGAAATCATTCGACTCGTTCACCAACTTGTACTCCCTGTCTAAAACATTGAAATTTGAAATGCGACCTG ID TTGGTAACACCCAAAAGATGTTAGATAATGCAGGAGTTTTCGAAAAGGATAAACTGATCCAGAAAAAAT NO: ACGGTAAAACGAAACCATATTTCGATAGGTTGCATCGGGAATTTATAGAAGAAGCTTTGACTGGTGTAG 136 AATTAATTGGCTTAGATGAGAATTTCCGTACTCTAGTCGATTGGCAAAAAGATAAAAAGAACAATGTTG CCATGAAGGCATACGAAAATAGTCTACAAAGACTAAGAACAGAGATCGGGAAAATTTTCAATTTGAAG GCAGAAGACTGGGTGAAGAACAAATATCCAATATTGGGTCTTAAGAATAAGAATACTGATATATTGTTC GAGGAGGCCGTTTTCGGTATTCTTAAGGCAAGATATGGTGAAGAGAAAGACACGTTTATTGAAGTTGAG GAGATTGATAAAACCGGTAAGTCCAAAATCAACCAGATCTCTATCTTCGACAGTTGGAAGGGCTTCACT GGTTATTTTAAGAAGTTCTTCGAAACTAGGAAGAACTTCTATAAAAACGATGGTACTTCCACGGCTATTG CTACAAGAATTATCGACCAAAACCTTAAGCGTTTTATTGATAACCTATCAATTGTTGAAAGTGTTCGACA GAAAGTAGATTTGGCTGAAACTGAAAAATCTTTTAGTATCTCCTTATCCCAGTTTTTCTCTATAGATTTTT ATAATAAATGTTTGCTGCAAGATGGCATTGACTACTATAATAAAATAATTGGTGGAGAGACATTGAAAA ACGGAGAGAAGCTGATTGGCCTTAATGAGTTGATAAATCAATATAGACAAAATAATAAGGACCAGAAA ATCCCTTTCTTTAAATTGCTAGACAAACAGATTTTGTCTGAAAAGATCCTATTCTTGGATGAAATAAAGA ACGATACTGAATTGATTGAAGCTTTGTCCCAGTTTGCTAAAACAGCTGAAGAAAAGACAAAGATTGTGA AAAAATTGTTTGCTGATTTCGTAGAAAACAATTCTAAATATGATCTAGCCCAGATTTATATAAGTCAAGA AGCTTTCAATACAATAAGTAATAAGTGGACAAGTGAAACAGAAACTTTTGCTAAGTATTTATTCGAAGC CATGAAGTCTGGTAAACTTGCCAAATACGAAAAAAAAGATAACAGTTATAAATTTCCAGACTTTATAGC CCTTTCACAGATGAAGTCTGCCTTATTGTCGATATCCTTAGAAGGTCATTTTTGGAAGGAAAAATATTAT AAGATAAGCAAGTTCCAAGAAAAGACTAATTGGGAACAATTTTTGGCTATATTTCTATATGAGTTCAATT CATTATTTTCCGATAAAATCAACACTAAGGATGGAGAGACTAAGCAAGTTGGCTACTATTTGTTCGCAA AAGATCTGCACAATTTGATTCTATCAGAACAAATAGATATACCAAAAGATTCAAAGGTAACTATAAAGG ATTTCGCAGATTCCGTCCTCACCATTTATCAAATGGCTAAATATTTTGCCGTTGAAAAAAAGAGAGCGTG GTTAGCAGAATACGAGTTGGACTCGTTTTATACTCAGCCAGATACTGGATACTTGCAATTCTACGATAAT GCATACGAAGACATTGTACAGGTATACAATAAACTTAGAAATTACTTAACCAAGAAGCCCTACAGTGAA GAAAAATGGAAGCTGAACTTTGAAAATTCGACTTTGGCAAATGGTTGGGATAAAAATAAAGAAAGTGA CAACTCCGCAGTGATTTTGCAAAAGGGTGGGAAATATTACTTGGGTTTAATCACAAAAGGCCACAATAA GATTTTTGATGATAGATTTCAAGAAAAATTCATAGTTGGTATAGAAGGTGGCAAATACGAGAAAATTGT CTATAAATTCTTCCCTGATCAAGCCAAAATGTTCCCAAAAGTTTGCTTTTCTGCTAAAGGATTGGAGTTTT TCCGGCCTAGCGAGGAGATCCTTCGTATCTACAACAATGCTGAATTCAAAAAAGGAGAAACCTATAGCA TAGATTCTATGCAAAAACTGATAGATTTTTATAAGGATTGTTTAACAAAGTACGAAGGCTGGGCCTGCTA TACATTTAGACATTTAAAGCCCACAGAAGAATACCAAAATAACATTGGTGAATTCTTTCGGGACGTTGC CGAAGACGGCTATAGGATCGATTTTCAAGGTATCTCAGATCAATATATCCACGAAAAGAACGAGAAGGG TGAGCTGCACCTTTTCGAAATTCATAATAAGGACTGGAATTTGGATAAGGCGAGAGATGGTAAATCGAA GACCACTCAAAAGAACTTGCATACTTTATATTTTGAGTCCTTGTTTTCTAATGATAACGTCGTCCAAAATT TTCCAATAAAGTTGAATGGACAAGCGGAAATTTTCTATCGGCCTAAGACAGAGAAAGACAAATTAGAAT CAAAGAAAGATAAAAAGGGAAATAAAGTCATTGATCACAAACGATACTCTGAGAATAAAATATTTTTCC ACGTACCATTGACACTCAACAGGACTAAGAATGACTCTTATAGATTTAATGCTCAGATTAATAATTTTTT GGCAAATAACAAGGATATTAACATAATTGGGGTGGATAGAGGTGAAAAGCACTTGGTATATTACTCTGT CATCACTCAGGCTTCTGATATATTGGAAAGCGGGTCTCTAAATGAATTGAACGGTGTTAACTACGCCGA AAAGCTAGGTAAAAAAGCTGAAAACAGAGAGCAGGCTCGGCGCGATTGGCAAGATGTTCAAGGAATTA AAGACCTTAAAAAAGGCTACATTAGTCAAGTAGTTAGAAAGTTAGCCGATCTTGCTATTAAACATAACG CAATCATTATTCTGGAGGACCTAAATATGCGTTTTAAGCAAGTTAGGGGTGGCATAGAAAAAAGTATTT ATCAGCAGCTTGAGAAGGCTTTGATAGATAAGTTATCGTTCCTAGTTGACAAAGGTGAAAAAAATCCTG AACAAGCTGGTCATCTGTTGAAAGCTTATCAGCTGAGCGCACCTTTTGAAACATTTCAAAAAATGGGAA AACAAACAGGTATTATTTTCTATACTCAAGCGAGTTATACAAGTAAATCTGACCCAGTGACAGGATGGA GACCACACCTTTATCTAAAATATTTTTCTGCTAAAAAGGCCAAAGATGACATCGCTAAGTTTACAAAAAT AGAATTTGTCAACGATAGATTTGAATTGACTTACGATATTAAAGATTTTCAGCAAGCAAAAGAATACCC AAATAAGACAGTGTGGAAAGTATGCTCCAATGTGGAGAGATTTAGATGGGATAAAAATCTCAATCAAA ACAAGGGTGGTTACACACATTATACTAATATAACTGAAAATATTCAAGAATTGTTTACTAAGTACGGAA TTGACATAACCAAAGACTTACTAACTCAGATTTCAACTATTGACGAAAAACAAAATACCTCATTTTTCCG CGACTTTATTTTTTATTTCAACTTGATCTGTCAAATTCGTAACACGGATGATTCCGAAATTGCCAAGAAG AACGGAAAAGATGATTTCATCCTATCTCCAGTGGAACCATTTTTTGACTCAAGAAAAGATAATGGTAAT AAGTTGCCTGAGAACGGAGATGATAACGGCGCTTATAATATCGCTCGGAAGGGTATTGTAATTCTTAAT AAAATATCTCAGTACTCTGAAAAGAACGAAAACTGCGAGAAAATGAAGTGGGGCGACTTGTATGTATCT AATATAGATTGGGATAATTTCGTTACTCAAGCCAACGCGAGACATTGA SEQ ATGGAAAATTTTAAAAACCTATATCCAATTAATAAGACACTTAGATTCGAGCTTAGGCCATACGGCAAA ID ACACTAGAAAATTTTAAGAAGTCAGGCCTATTAGAAAAAGACGCCTTTAAGGCAAATTCCAGAAGATCA NO: ATGCAGGCAATTATTGATGAGAAATTTAAAGAGACTATCGAGGAAAGGTTGAAATACACTGAATTCTCT 137 GAGTGCGATCTGGGAAACATGACTTCCAAGGATAAAAAGATTACCGATAAGGCTGCTACCAACCTCAAA AAGCAAGTCATCTTATCGTTTGATGATGAAATTTTTAATAACTACTTAAAGCCGGACAAAAACATTGACG CCCTATTCAAAAATGATCCGTCCAACCCCGTAATTTCAACTTTTAAGGGTTTTACCACGTACTTTGTAAAT TTTTTTGAGATTCGTAAACATATCTTCAAAGGAGAATCGTCGGGTTCCATGGCCTATAGGATAATTGATG AAAATCTTACGACTTACTTAAACAATATCGAAAAGATAAAAAAGTTACCAGAAGAATTAAAGTCTCAAT TGGAAGGTATTGACCAAATAGACAAATTAAATAACTATAATGAGTTCATAACTCAAAGCGGTATCACAC ATTACAATGAAATTATCGGTGGTATATCTAAAAGTGAGAACGTAAAAATACAGGGAATAAACGAGGGG ATCAATCTATACTGTCAGAAGAATAAAGTAAAATTACCAAGACTAACGCCATTATACAAAATGATTCTG TCTGATAGAGTTTCCAACTCGTTCGTGCTTGATACTATAGAAAATGATACTGAATTAATTGAGATGATTA GCGACTTGATTAATAAAACAGAAATATCTCAAGACGTAATAATGTCAGACATTCAGAACATTTTCATAA AATATAAACAGCTTGGTAATTTACCGGGGATAAGTTACTCTAGCATCGTGAATGCTATTTGCTCCGATTA TGACAATAATTTTGGTGACGGAAAAAGAAAAAAATCATATGAGAACGATAGGAAGAAACACCTTGAAA CAAACGTATACTCAATTAACTATATATCGGAACTGTTAACAGACACCGATGTATCATCTAATATAAAAAT GAGATATAAGGAACTTGAACAAAATTACCAGGTGTGTAAGGAGAATTTCAATGCTACCAACTGGATGAA CATTAAGAATATTAAACAGAGTGAAAAGACAAACTTGATTAAAGATCTACTAGATATACTGAAATCAAT ACAGAGATTCTACGATCTGTTTGATATAGTTGATGAAGACAAAAATCCTAGTGCTGAGTTTTACACGTGG CTAAGTAAAAATGCGGAAAAGTTAGATTTCGAGTTCAACTCTGTTTATAATAAATCTAGGAATTATTTAA CTAGAAAGCAGTATTCTGATAAAAAGATAAAATTGAACTTCGACTCCCCTACGTTGGCAAAGGGTTGGG ATGCAAACAAAGAAATCGATAACTCCACCATAATAATGCGTAAGTTTAACAATGATAGGGGGGATTACG ATTATTTTTTGGGAATTTGGAACAAATCTACCCCAGCGAATGAAAAAATTATTCCCCTTGAAGACAATGG TCTTTTTGAAAAAATGCAGTATAAATTATATCCAGACCCATCCAAGATGCTTCCAAAGCAATTTCTGTCA AAAATTTGGAAGGCTAAACACCCTACTACTCCTGAATTTGATAAGAAGTATAAGGAGGGCCGACACAAA AAGGGTCCAGATTTTGAAAAAGAATTCCTGCATGAATTGATAGATTGTTTTAAGCATGGTTTGGTAAATC ATGATGAAAAATATCAGGATGTCTTTGGATTCAATTTGAGAAATACAGAGGATTACAACTCATATACAG AATTTCTCGAGGACGTCGAACGTTGCAATTATAATCTCAGTTTCAACAAGATCGCAGACACTTCAAACTT AATTAACGACGGAAAATTGTACGTTTTTCAAATCTGGTCGAAAGACTTTAGTATTGATTCAAAGGGTACA AAAAACCTAAATACAATATATTTCGAAAGTCTATTCTCGGAAGAAAACATGATCGAAAAAATGTTCAAA CTGTCAGGCGAAGCTGAAATATTCTACCGTCCCGCAAGCCTTAATTATTGTGAGGATATCATTAAAAAA GGACATCACCATGCAGAGTTAAAAGATAAATTCGATTACCCAATAATTAAAGATAAAAGATACTCCCAG GATAAGTTCTTTTTCCATGTACCTATGGTTATTAACTACAAGTCGGAAAAACTAAACTCGAAGTCATTAA ATAATAGAACTAACGAGAACTTGGGACAATTCACACATATAATTGGTATTGATCGTGGCGAAAGACATT TAATATATCTGACTGTTGTTGATGTTTCAACAGGAGAAATTGTTGAACAGAAACATCTTGATGAAATTAT AAACACAGATACAAAAGGCGTTGAGCATAAAACTCATTATCTAAATAAATTGGAGGAAAAGTCGAAGA CTCGCGATAACGAGAGAAAGAGTTGGGAAGCAATTGAAACCATAAAAGAGCTTAAAGAAGGTTACATT AGTCACGTCATCAATGAAATACAAAAGTTACAAGAAAAGTATAACGCTTTGATTGTAATGGAAAATCTA AATTATGGTTTTAAGAATTCAAGAATCAAAGTCGAAAAGCAGGTCTATCAGAAATTTGAAACGGCACTT ATTAAAAAGTTTAACTACATTATTGATAAAAAGGACCCAGAAACTTATATTCATGGTTACCAACTGACG AACCCAATCACAACATTGGACAAAATTGGAAACCAAAGTGGAATTGTTTTATACATTCCAGCTTGGAAT ACATCCAAAATAGACCCTGTCACGGGGTTTGTCAACTTGTTATATGCCGACGATTTAAAGTATAAAAACC AAGAACAAGCAAAGTCTTTTATTCAAAAGATTGATAATATTTATTTCGAAAACGGTGAATTTAAATTCGA CATAGATTTTTCTAAATGGAACAACCGTTATTCAATAAGTAAAACTAAATGGACACTCACCTCATACGGC ACTCGTATCCAAACCTTTCGGAATCCCCAAAAAAATAACAAATGGGATTCTGCAGAATACGACTTGACC GAGGAATTTAAATTAATTCTTAATATAGACGGTACACTCAAAAGTCAAGACGTGGAGACATACAAGAAG TTTATGTCGTTATTCAAGCTTATGCTTCAGTTGAGGAACTCCGTTACAGGCACTGATATTGATTACATGAT TTCACCAGTAACGGATAAGACTGGGACTCATTTCGATTCTAGGGAAAATATTAAAAATTTACCTGCTGAC GCAGACGCAAACGGCGCATACAATATAGCAAGAAAAGGGATTATGGCCATTGAGAATATTATGAATGG CATATCAGATCCATTAAAGATAAGCAATGAAGACTACTTAAAATACATTCAGAATCAGCAAGAATAA SEQ ATGACCCAGTTTGAAGGTTTCACCAATTTGTACCAAGTAAGTAAAACCTTGAGGTTCGAATTGATCCCAC ID AGGGCAAGACATTGAAGCATATTCAAGAGCAAGGATTTATAGAAGAAGATAAAGCGAGAAACGATCAC NO: TATAAAGAGTTAAAACCCATTATTGACAGGATCTATAAAACATACGCCGATCAATGCCTTCAATTAGTG 138 CAATTAGATTGGGAAAACTTGAGCGCTGCCATCGATTCCTACAGGAAGGAAAAAACAGAAGAAACAAG AAATGCCTTAATCGAGGAACAAGCAACCTATAGAAACGCTATACACGATTACTTCATCGGTAGAACTGA TAATCTAACAGATGCAATAAATAAGAGACATGCTGAGATATATAAAGGACTATTTAAAGCAGAATTATT CAACGGAAAGGTGTTGAAACAGTTAGGTACCGTTACAACTACTGAGCATGAAAATGCCTTGCTGAGAAG CTTTGACAAGTTTACTACCTACTTTTCGGGTTTCTACGAAAATCGCAAAAATGTATTTTCTGCGGAAGAT ATTTCAACTGCAATCCCTCATAGGATTGTTCAAGATAATTTCCCTAAGTTTAAAGAGAACTGTCACATTT TTACAAGGTTAATTACTGCGGTTCCAAGTCTAAGAGAACATTTTGAGAATGTAAAAAAAGCGATTGGTA TATTTGTATCCACTAGCATTGAAGAGGTTTTCAGCTTCCCTTTTTATAACCAATTACTTACCCAAACACAG ATCGACCTGTACAACCAATTGTTAGGTGGTATATCGAGGGAGGCTGGTACGGAAAAGATTAAAGGATTA AATGAAGTTCTTAATTTGGCCATACAAAAAAATGATGAAACCGCGCACATTATCGCATCTTTACCACATA GGTTTATACCGTTATTCAAGCAAATATTATCTGATCGTAATACCTTATCGTTCATATTAGAGGAGTTTAA ATCTGACGAAGAAGTTATACAATCTTTTTGCAAGTATAAGACGCTATTGAGAAACGAAAACGTTCTGGA AACAGCCGAAGCACTGTTCAATGAATTAAACAGTATCGACTTGACTCATATTTTTATATCGCATAAAAAG TTGGAGACAATTTCTTCAGCATTGTGCGATCACTGGGACACTTTAAGGAACGCACTATATGAACGTAGG ATCTCAGAATTGACAGGTAAGATAACGAAGTCTGCTAAAGAGAAAGTGCAGAGATCCCTAAAACACGA GGATATAAATTTGCAGGAGATAATTTCAGCTGCAGGTAAAGAGTTGTCTGAAGCGTTCAAGCAAAAGAC TTCCGAAATCTTGTCACACGCACACGCCGCATTAGATCAACCTTTACCCACTACTTTGAAAAAACAAGAA GAGAAGGAGATATTAAAATCACAACTTGATTCTTTACTTGGCCTTTATCATCTTTTAGATTGGTTCGCTGT TGACGAGAGCAATGAAGTGGATCCAGAGTTTTCCGCAAGATTGACCGGTATAAAGTTGGAAATGGAACC TTCGTTATCATTTTACAACAAAGCTAGGAACTATGCTACAAAAAAACCTTATTCTGTCGAAAAATTTAAA CTGAACTTCCAAATGCCTACTCTAGCAAGTGGCTGGGATGTTAATAAAGAAAAGAACAATGGCGCTATT TTGTTTGTAAAAAATGGCCTATACTATCTTGGAATTATGCCTAAACAAAAAGGTCGCTACAAGGCTTTGT CATTTGAACCTACTGAAAAGACTAGCGAAGGTTTCGATAAGATGTATTACGATTATTTCCCGGATGCCGC TAAAATGATCCCCAAGTGCTCTACTCAATTGAAGGCAGTAACTGCTCATTTCCAAACGCATACCACGCCA ATACTGCTTTCTAACAACTTTATAGAACCACTAGAAATAACGAAAGAAATTTACGACCTAAATAACCCA GAGAAAGAACCAAAAAAGTTCCAGACGGCCTACGCCAAAAAGACAGGGGACCAAAAAGGTTACCGCG AGGCGTTATGTAAATGGATTGATTTTACTAGGGACTTTTTATCAAAATACACTAAAACGACGTCTATTGA TCTTAGCTCCTTACGCCCGTCCTCCCAATACAAGGATCTAGGTGAGTATTACGCAGAGTTGAACCCGCTA TTATACCATATTTCCTTCCAAAGGATTGCTGAAAAGGAAATTATGGACGCTGTTGAAACTGGGAAATTGT ACCTGTTTCAGATTTATAATAAGGACTTCGCAAAGGGTCACCATGGTAAGCCTAACCTTCACACTTTGTA CTGGACCGGACTATTCTCGCCTGAAAATTTGGCTAAAACAAGTATCAAGTTAAACGGTCAGGCCGAGTT ATTTTATAGACCCAAATCTAGAATGAAAAGAATGGCCCATAGATTAGGCGAAAAGATGTTAAACAAGA AATTAAAGGACCAAAAAACCCCGATACCAGACACTCTATACCAAGAACTGTACGACTATGTGAATCACA GGCTTAGTCACGATTTATCAGATGAAGCGAGGGCTTTATTGCCAAATGTCATCACCAAGGAAGTATCAC ATGAAATAATTAAGGATAGAAGGTTCACATCTGATAAATTCTTTTTTCATGTCCCAATTACATTGAATTA TCAAGCAGCGAACTCACCATCTAAATTTAATCAGCGCGTCAACGCCTATTTGAAAGAACATCCCGAAAC ACCAATCATCGGCATAGATCGAGGTGAGAGAAACTTAATATATATAACTGTGATTGATTCTACAGGAAA AATCCTGGAGCAACGATCTTTAAATACCATACAACAGTTTGATTATCAAAAAAAGTTGGATAACAGAGA AAAAGAACGTGTTGCCGCTAGGCAGGCTTGGTCTGTGGTAGGAACAATTAAGGACTTAAAGCAGGGCTA TCTGTCCCAAGTTATTCATGAAATAGTCGATCTGATGATACATTATCAGGCAGTTGTCGTGTTGGAAAAT TTGAATTTTGGCTTTAAATCAAAAAGAACTGGCATAGCAGAAAAAGCTGTGTACCAGCAGTTTGAAAAG ATGTTAATCGATAAGCTAAACTGCCTTGTTCTTAAAGATTACCCCGCAGAAAAAGTAGGTGGTGTTCTTA ATCCATATCAGTTGACAGACCAATTTACATCCTTTGCGAAAATGGGTACGCAAAGCGGGTTCTTATTCTA CGTACCGGCCCCCTATACTTCTAAGATCGACCCACTAACAGGTTTTGTGGACCCTTTTGTTTGGAAGACG ATAAAGAACCACGAGTCACGCAAACATTTCTTAGAGGGCTTTGATTTCTTGCACTACGACGTGAAAACT GGTGATTTTATCTTACACTTTAAAATGAACAGAAATCTCTCTTTCCAACGTGGACTGCCCGGATTCATGC CGGCTTGGGACATCGTTTTTGAAAAGAATGAAACGCAGTTTGACGCCAAAGGTACACCATTTATAGCGG GTAAGAGAATTGTGCCGGTCATAGAAAACCATAGATTTACAGGTAGATATAGGGATCTGTACCCTGCTA ATGAATTGATTGCATTACTCGAAGAGAAAGGAATTGTGTTTCGAGATGGATCGAATATTTTACCTAAGTT GTTGGAAAATGATGATTCACACGCAATTGATACTATGGTTGCCCTCATAAGATCGGTATTGCAAATGAG AAACTCAAATGCTGCTACGGGAGAGGATTATATAAACAGCCCCGTTCGCGATCTTAATGGTGTTTGTTTT GATTCACGTTTTCAGAACCCCGAATGGCCAATGGATGCCGACGCAAACGGAGCATATCATATTGCTCTT AAAGGCCAACTACTATTAAATCACTTAAAGGAATCCAAAGACCTAAAATTGCAAAACGGGATATCTAAT CAGGATTGGCTGGCTTACATACAAGAACTACGTAACTAG SEQ ATGGCCGTTAAGTCAATCAAAGTGAAACTTAGACTGGATGACATGCCAGAGATTCGTGCGGGGTTATGG ID AAACTTCATAAGGAAGTTAACGCAGGGGTAAGATATTATACCGAATGGTTATCATTACTTCGACAAGAG NO: AATTTGTACAGAAGGTCCCCGAACGGCGACGGTGAGCAAGAATGCGATAAGACGGCTGAAGAATGTAA 139 GGCAGAACTTTTGGAGCGCCTGAGAGCCCGTCAGGTTGAAAATGGCCATAGAGGTCCTGCGGGATCTGA TGATGAGCTTTTACAGCTAGCTAGACAATTGTATGAATTGTTGGTCCCTCAGGCTATTGGGGCTAAAGGA GACGCTCAACAAATCGCCAGAAAGTTCTTGTCACCTCTGGCTGACAAAGATGCCGTGGGAGGATTAGGT ATCGCTAAAGCAGGTAATAAACCAAGATGGGTTAGAATGAGAGAAGCAGGCGAACCTGGTTGGGAAGA AGAGAAAGAAAAGGCCGAAACTAGAAAAAGCGCTGACAGAACCGCAGATGTTTTACGGGCCTTGGCTG ATTTTGGACTGAAGCCTTTGATGAGAGTGTATACTGATTCAGAAATGTCTTCCGTTGAATGGAAGCCCCT AAGGAAGGGACAAGCGGTCAGAACCTGGGATAGGGATATGTTTCAACAGGCTATTGAAAGGATGATGT CATGGGAATCCTGGAATCAAAGAGTAGGTCAAGAATACGCTAAACTGGTCGAACAAAAGAATAGATTT GAACAAAAAAATTTTGTAGGTCAAGAACATTTAGTACATTTGGTTAATCAACTTCAACAAGATATGAAA GAGGCATCTCCTGGTTTGGAATCAAAAGAACAAACAGCACACTATGTTACCGGCCGAGCTTTGCGAGGT TCTGACAAAGTATTTGAAAAGTGGGGGAAATTAGCTCCCGATGCCCCCTTTGATCTATATGATGCTGAAA TTAAAAACGTTCAAAGAAGGAACACTAGACGTTTTGGATCCCATGATCTTTTTGCAAAGCTAGCTGAGC CAGAATACCAGGCTCTATGGCGTGAAGACGCCTCGTTTTTGACTAGATACGCAGTATACAATTCAATACT CAGAAAACTAAACCATGCCAAGATGTTTGCTACATTCACCCTGCCCGATGCTACCGCTCATCCTATTTGG ACTAGATTTGACAAGTTGGGGGGGAATCTACATCAGTACACATTTTTATTTAATGAATTCGGTGAAAGA AGACACGCTATTAGATTCCACAAGCTCCTAAAGGTTGAAAACGGCGTTGCGAGAGAAGTTGATGATGTA ACAGTTCCCATTTCTATGTCGGAGCAATTGGATAATCTATTGCCTAGAGACCCTAATGAACCAATTGCTT TGTACTTTCGTGACTACGGTGCAGAACAACACTTTACAGGTGAATTCGGCGGAGCCAAGATTCAATGTA GACGTGATCAACTCGCACACATGCATAGAAGAAGAGGCGCTCGTGATGTTTATTTAAATGTGTCTGTTA GAGTTCAATCCCAATCGGAGGCTAGAGGTGAAAGAAGGCCACCATACGCAGCAGTTTTTAGGTTAGTAG GTGATAATCATAGGGCATTTGTCCACTTCGACAAATTAAGTGATTATTTAGCAGAGCACCCTGATGATGG AAAGTTGGGCAGTGAGGGATTATTAAGTGGGTTGAGGGTAATGTCTGTAGATCTTGGTCTTCGTACTTCT GCGAGTATCTCTGTCTTTAGAGTAGCACGTAAGGATGAGTTGAAACCTAATAGCAAAGGAAGAGTCCCG TTTTTTTTTCCTATTAAGGGTAACGATAACCTGGTGGCCGTGCATGAAAGATCACAACTTTTGAAATTGC CAGGAGAAACGGAGTCCAAGGACTTGAGGGCAATTAGAGAGGAACGTCAGCGTACATTGCGACAGCTG AGAACTCAATTGGCTTATTTGAGGTTGTTGGTTAGGTGTGGTTCCGAGGATGTTGGCAGAAGAGAAAGG TCTTGGGCCAAATTGATAGAACAACCAGTGGACGCCGCAAATCACATGACACCAGATTGGAGAGAAGCT TTCGAAAATGAACTCCAGAAATTAAAGAGCCTACATGGCATATGCTCTGATAAAGAGTGGATGGATGCC GTATACGAATCCGTTCGTAGAGTCTGGCGCCACATGGGTAAGCAAGTACGGGACTGGAGAAAGGATGTT CGTTCCGGCGAAAGACCGAAGATAAGGGGGTATGCAAAGGACGTTGTAGGCGGTAATTCTATTGAACA GATTGAGTATTTGGAAAGGCAGTACAAATTTCTTAAATCCTGGAGCTTCTTCGGCAAAGTGTCAGGACA AGTCATCAGGGCTGAAAAAGGTTCCAGATTTGCTATTACGCTAAGGGAACATATTGATCATGCGAAAGA AGATAGACTGAAAAAACTAGCAGATAGAATAATTATGGAAGCACTTGGTTACGTCTATGCACTTGATGA AAGAGGCAAGGGGAAATGGGTAGCTAAATACCCGCCTTGTCAACTTATTTTATTAGAAGAATTAAGCGA GTACCAATTTAACAACGATAGACCTCCATCCGAAAATAATCAGCTGATGCAATGGTCCCATAGGGGTGT TTTTCAAGAATTGATAAATCAAGCTCAAGTACACGATTTGCTGGTAGGTACTATGTACGCAGCGTTTTCG AGCCGTTTTGATGCAAGAACTGGTGCCCCAGGTATCAGATGTCGACGTGTTCCGGCCAGATGTACACAG GAACATAACCCTGAGCCATTTCCGTGGTGGCTTAATAAGTTTGTTGTCGAGCACACATTAGACGCATGCC CTCTGAGAGCAGATGACCTTATACCCACTGGAGAAGGCGAAATATTTGTTAGTCCATTCTCTGCAGAAG AAGGTGACTTTCACCAGATACATGCAGACTTAAATGCAGCACAGAATCTCCAACAAAGGTTGTGGTCGG ATTTTGATATTTCGCAAATAAGACTAAGATGCGATTGGGGAGAGGTTGATGGAGAATTGGTGCTGATTC CAAGATTAACCGGAAAGCGAACTGCCGATTCCTATTCTAACAAGGTGTTTTACACAAATACTGGTGTTAC CTATTACGAAAGAGAAAGGGGTAAGAAGAGACGTAAAGTATTTGCTCAAGAAAAATTGTCAGAAGAGG AGGCAGAACTGTTAGTAGAAGCAGACGAAGCCAGAGAAAAATCAGTTGTGCTTATGCGTGACCCTTCCG GCATTATAAATCGTGGTAATTGGACACGACAAAAAGAATTTTGGTCTATGGTCAATCAACGTATCGAAG GCTACCTAGTTAAGCAAATCAGGTCTAGGGTTCCACTACAAGATAGCGCATGTGAAAATACGGGTGATA TATAA SEQ ATGGCTACTAGATCTTTCATTTTAAAAATTGAACCTAATGAAGAAGTGAAGAAGGGTCTCTGGAAAACT ID CACGAAGTACTTAATCATGGCATTGCCTATTATATGAATATCCTGAAGCTTATTCGTCAAGAAGCTATAT NO: ACGAGCATCATGAGCAAGATCCTAAGAACCCTAAGAAAGTAAGCAAAGCGGAAATTCAGGCTGAATTG 140 TGGGACTTCGTCTTGAAGATGCAGAAGTGTAACAGTTTTACGCACGAAGTTGATAAAGATGTGGTGTTT AATATTTTGAGGGAGCTATATGAGGAGTTGGTGCCCTCGAGTGTCGAAAAAAAAGGAGAAGCTAATCAG CTGTCAAATAAATTTTTATATCCTCTGGTGGATCCAAACTCTCAATCAGGTAAAGGCACTGCCAGTAGTG GTCGAAAACCGAGATGGTATAATTTGAAAATCGCAGGTGATCCATCGTGGGAAGAAGAAAAAAAAAAA TGGGAAGAAGATAAAAAAAAAGATCCCCTTGCCAAAATACTAGGTAAGCTAGCCGAGTATGGACTTAT ACCATTATTCATTCCTTTCACGGACTCTAATGAACCAATTGTGAAGGAAATCAAATGGATGGAAAAATC ACGTAATCAGTCTGTTAGGAGGTTGGACAAAGATATGTTTATACAGGCTCTTGAGAGGTTTTTGTCGTGG GAGTCCTGGAATTTGAAAGTGAAAGAAGAATATGAAAAAGTGGAAAAGGAGCATAAGACGTTGGAAGA AAGGATTAAGGAAGATATTCAGGCCTTTAAGAGTCTGGAACAGTACGAAAAAGAAAGACAGGAACAGT TATTGAGAGATACTCTAAACACTAATGAATATAGGCTTTCCAAGAGGGGCTTGCGAGGATGGAGAGAGA TAATTCAGAAATGGTTGAAAATGGATGAGAACGAGCCATCGGAGAAATATCTAGAGGTGTTTAAAGATT ACCAAAGAAAGCACCCTCGCGAAGCTGGTGATTACTCTGTTTATGAATTCCTTTCGAAGAAGGAAAATC ACTTCATCTGGCGAAATCATCCAGAGTACCCATATTTATATGCTACATTTTGCGAAATTGACAAGAAAAA AAAAGATGCTAAACAGCAAGCGACATTCACCCTCGCTGATCCCATCAACCACCCATTATGGGTCAGGTT CGAAGAGAGATCAGGCTCGAACCTGAATAAGTACAGGATCTTGACTGAGCAATTGCATACTGAGAAGTT AAAAAAGAAATTGACGGTCCAACTTGACAGATTGATTTATCCCACTGAATCTGGTGGATGGGAGGAGAA AGGTAAGGTTGATATTGTCCTATTGCCTTCTCGTCAATTTTACAACCAAATATTTCTGGACATCGAAGAG AAGGGTAAACATGCTTTTACCTATAAGGATGAGAGTATTAAATTTCCATTGAAGGGAACGCTTGGCGGC GCTAGAGTTCAGTTCGATAGAGATCATTTGAGAAGATACCCGCATAAAGTGGAATCTGGTAATGTAGGT CGGATCTACTTTAACATGACGGTAAATATTGAACCTACCGAGTCACCAGTCAGTAAGTCTTTAAAGATTC ATAGGGATGATTTCCCTAAATTTGTCAACTTCAAGCCTAAGGAACTAACCGAGTGGATCAAAGACAGTA AAGGCAAAAAGTTAAAGAGCGGTATTGAGTCCCTGGAGATAGGTCTTAGAGTCATGTCTATCGATTTGG GTCAAAGACAAGCAGCCGCAGCATCTATTTTCGAAGTTGTTGACCAAAAACCGGATATCGAGGGGAAAT TATTTTTTCCAATAAAAGGAACTGAGCTATACGCTGTGCATCGCGCATCCTTCAATATAAAACTGCCAGG AGAAACACTAGTAAAATCTAGAGAGGTCTTGCGTAAAGCACGTGAGGACAATCTCAAATTAATGAATCA GAAGTTAAATTTCCTTAGGAACGTGTTGCATTTCCAACAGTTCGAGGACATAACTGAACGCGAGAAAAG AGTCACTAAGTGGATCTCAAGACAAGAAAATAGTGATGTGCCATTAGTGTATCAAGACGAACTTATTCA AATAAGAGAGCTAATGTATAAACCATATAAAGACTGGGTGGCATTCTTAAAACAATTACACAAGCGGCT TGAAGTAGAAATAGGAAAAGAAGTAAAGCATTGGAGGAAGAGTCTGTCCGATGGTCGCAAAGGCCTGT ACGGGATATCACTTAAAAATATTGATGAAATTGACAGAACACGAAAATTTTTGTTAAGATGGTCATTGA GACCAACCGAACCAGGTGAGGTTAGAAGGTTGGAACCAGGCCAAAGGTTTGCCATCGATCAATTAAACC ATCTTAACGCACTGAAAGAAGATAGATTGAAGAAGATGGCGAACACTATTATTATGCACGCTCTAGGTT ATTGCTATGATGTGAGAAAGAAAAAATGGCAAGCCAAGAACCCTGCATGCCAAATTATTTTGTTTGAAG ATCTTTCTAATTACAATCCATACGAAGAGCGTTCACGTTTTGAAAACTCTAAATTGATGAAATGGTCTAG AAGAGAGATTCCGAGACAGGTCGCTCTACAAGGGGAGATTTACGGTCTTCAAGTCGGTGAGGTTGGTGC TCAATTTTCTTCCAGATTTCATGCAAAAACTGGGTCTCCAGGCATTAGGTGTTCGGTCGTTACTAAGGAA AAGTTACAGGACAACCGTTTCTTCAAAAATTTGCAACGTGAAGGCCGTTTAACACTTGATAAGATAGCT GTCCTTAAGGAAGGCGATCTGTACCCAGATAAAGGTGGTGAGAAATTCATATCTTTGAGTAAAGACAGG AAACTGGTTACAACACACGCCGACATTAACGCAGCTCAGAACTTGCAAAAGAGATTCTGGACAAGGACC CACGGCTTCTATAAGGTGTACTGTAAAGCTTATCAAGTAGATGGACAAACGGTTTATATTCCTGAATCAA AGGACCAGAAACAAAAAATTATAGAAGAATTTGGTGAAGGATACTTTATCTTGAAGGATGGAGTTTATG AGTGGGGCAATGCAGGTAAGTTAAAGATAAAGAAAGGTTCATCAAAGCAATCAAGTAGCGAACTGGTC GATTCGGATATTTTAAAGGATAGCTTTGATCTAGCTAGTGAATTGAAGGGAGAAAAGTTAATGTTATAC AGAGATCCCAGTGGGAATGTATTTCCATCTGATAAGTGGATGGCCGCCGGAGTGTTTTTTGGCAAATTAG AGAGAATCTTGATTTCTAAACTGACCAATCAATACTCAATTTCGACCATCGAAGACGACTCTTCAAAACA ATCCATGTGA SEQ ATGCCTACTCGCACCATCAATCTGAAGTTAGTTTTGGGGAAGAACCCAGAAAATGCGACTCTAAGACGG ID GCACTATTCTCTACACATAGACTTGTCAACCAAGCGACTAAGAGAATTGAAGAATTTTTACTGTTGTGTA NO: GAGGAGAAGCTTATCGTACCGTAGATAATGAAGGTAAAGAAGCTGAGATCCCACGCCATGCTGTTCAAG 141 AAGAGGCGCTTGCTTTTGCAAAAGCTGCACAACGACATAACGGCTGTATCTCCACATATGAGGACCAGG AAATCTTGGATGTGCTTAGACAATTGTATGAAAGATTAGTACCTAGCGTCAATGAAAACAACGAGGCTG GGGATGCCCAAGCCGCTAACGCTTGGGTGAGTCCATTAATGAGTGCAGAGTCCGAAGGTGGACTATCGG TCTATGATAAAGTGTTAGACCCGCCGCCAGTATGGATGAAACTCAAAGAAGAGAAAGCGCCTGGTTGGG AAGCTGCTTCTCAGATTTGGATACAGTCCGACGAAGGTCAATCGCTGCTAAATAAACCGGGTAGCCCAC CACGTTGGATTAGAAAACTTAGATCTGGTCAACCGTGGCAAGATGACTTCGTTTCAGACCAAAAAAAAA AGCAAGATGAACTAACGAAAGGTAACGCACCACTCATAAAACAATTGAAAGAGATGGGCCTCTTGCCTT TAGTTAATCCCTTTTTTAGACATTTGTTGGATCCCGAGGGTAAGGGTGTATCCCCATGGGACAGATTGGC CGTAAGGGCCGCGGTGGCGCACTTCATCTCTTGGGAAAGTTGGAACCACAGAACAAGAGCTGAGTATAA CAGTTTGAAACTGCGAAGAGATGAATTTGAGGCCGCATCTGATGAATTCAAGGACGATTTTACATTGCT ACGACAATATGAGGCTAAGCGACATAGTACGCTTAAGTCAATTGCCTTAGCTGATGACTCTAACCCGTA CCGAATTGGTGTAAGGTCCTTGAGAGCCTGGAATAGGGTTAGAGAAGAATGGATTGACAAAGGCGCAA CCGAGGAACAAAGGGTTACCATCCTTAGTAAGCTTCAAACACAATTACGGGGTAAATTCGGTGATCCAG ACCTATTTAATTGGCTAGCCCAAGATAGACACGTACACCTGTGGTCCCCGAGAGATTCCGTCACGCCCCT CGTAAGGATTAATGCCGTCGACAAAGTGCTTAGAAGACGTAAGCCTTATGCACTGATGACTTTTGCACA TCCGAGATTCCATCCAAGATGGATTCTATACGAAGCGCCTGGTGGTTCTAACTTGCGACAATACGCTTTA GATTGTACTGAAAATGCTCTGCATATTACACTTCCATTACTCGTCGACGACGCCCATGGTACATGGATTG AGAAAAAAATCCGCGTACCACTCGCTCCTAGTGGACAAATACAAGATTTAACTTTAGAAAAACTTGAAA AGAAAAAAAACAGATTATACTATAGATCAGGATTCCAACAATTTGCTGGATTAGCCGGTGGTGCTGAGG TGTTGTTTCATAGGCCGTATATGGAACATGATGAGAGATCAGAAGAATCTCTGTTGGAAAGGCCAGGCG CTGTGTGGTTCAAATTAACCTTAGATGTTGCTACCCAAGCACCACCTAACTGGTTAGATGGTAAAGGCAG AGTTAGGACACCTCCAGAAGTTCATCATTTCAAAACCGCTCTGTCAAATAAATCTAAACATACGAGAAC CTTGCAACCAGGATTGAGAGTCCTTTCTGTTGATTTGGGTATGAGAACATTTGCTTCTTGTTCTGTTTTCG AATTGATCGAAGGTAAACCTGAAACAGGTAGAGCATTCCCTGTTGCTGACGAAAGATCAATGGATAGTC CAAATAAGTTATGGGCCAAGCACGAGAGAAGCTTTAAACTAACTCTGCCTGGAGAAACACCGAGCAGA AAGGAGGAAGAAGAGAGAAGCATTGCTAGGGCAGAGATTTACGCGCTGAAAAGAGATATTCAAAGACT GAAATCACTCCTAAGATTAGGTGAGGAAGATAATGATAATAGAAGAGATGCTTTGTTAGAGCAATTCTT TAAAGGATGGGGTGAAGAGGACGTAGTTCCTGGTCAAGCTTTCCCTAGAAGCCTCTTTCAGGGATTAGG CGCTGCACCCTTTAGGTCAACACCCGAATTGTGGAGACAGCACTGTCAGACGTATTACGACAAAGCGGA AGCTTGCCTGGCAAAGCATATTTCCGACTGGAGGAAGAGAACTAGACCTCGTCCGACTTCGAGAGAGAT GTGGTATAAGACAAGATCTTACCATGGTGGCAAAAGTATTTGGATGCTAGAATACTTAGATGCTGTCCG CAAATTACTACTTTCATGGTCGTTAAGAGGTCGTACTTACGGAGCTATTAATAGACAAGACACCGCTCGT TTTGGTTCCTTAGCTTCTAGATTGTTGCATCATATCAACTCTTTAAAGGAAGACCGCATCAAAACCGGTG CAGATAGTATTGTGCAGGCCGCAAGGGGCTATATTCCTCTCCCACATGGCAAGGGTTGGGAACAGCGTT ATGAACCCTGTCAGTTGATATTATTTGAAGATCTAGCTAGGTACAGATTTCGTGTAGACAGACCTCGGAG AGAGAATTCGCAATTGATGCAGTGGAATCATCGAGCTATAGTAGCAGAAACGACGATGCAAGCTGAACT ATACGGTCAAATAGTCGAAAATACCGCTGCTGGTTTCTCCTCAAGATTTCATGCTGCAACTGGTGCTCCT GGTGTCAGATGTCGCTTTTTGTTAGAACGAGATTTCGATAATGACCTACCAAAGCCGTACTTACTGAGAG AACTAAGTTGGATGTTAGGTAACACAAAGGTTGAATCAGAGGAAGAAAAATTGCGTCTTCTAAGCGAGA AAATTAGACCAGGTTCATTAGTCCCTTGGGATGGGGGTGAACAATTCGCGACATTACACCCGAAAAGAC AAACTCTTTGTGTCATTCACGCAGATATGAACGCTGCTCAAAACCTGCAACGCAGATTTTTCGGAAGGTG TGGGGAAGCCTTTCGCCTTGTGTGTCAGCCACATGGTGATGATGTTTTGAGGCTAGCGTCTACACCAGGT GCAAGACTTTTGGGTGCATTACAACAACTGGAAAATGGTCAGGGAGCTTTCGAATTAGTTCGTGATATG GGTAGCACATCACAAATGAATCGTTTCGTCATGAAGTCGTTGGGCAAAAAAAAGATCAAGCCATTACAA GACAATAACGGGGATGATGAACTAGAAGACGTGCTATCTGTTTTACCTGAAGAAGATGATACCGGACGA ATTACTGTATTTCGGGACTCTTCGGGTATATTCTTCCCTTGTAACGTTTGGATCCCGGCAAAACAGTTCTG GCCTGCGGTCCGTGCTATGATTTGGAAGGTTATGGCATCACATTCATTGGGTTAG SEQ ATGACAAAGTTAAGGCATAGACAGAAGAAGTTAACTCACGATTGGGCGGGGTCTAAAAAGAGAGAAGT ID TCTAGGGAGCAATGGTAAATTACAGAATCCATTGCTAATGCCCGTCAAAAAAGGTCAGGTGACAGAATT NO: TCGAAAAGCATTTTCCGCATACGCCCGAGCAACCAAAGGGGAAATGACGGATGGCAGAAAAAATATGT 142 TTACTCACTCATTTGAACCATTCAAGACCAAGCCTTCGTTACATCAGTGCGAACTGGCTGACAAAGCCTA CCAGAGCTTGCATTCATATTTACCGGGTTCTTTGGCGCATTTTCTTTTATCTGCCCATGCACTTGGTTTTA GGATTTTTAGCAAATCAGGGGAAGCCACTGCATTCCAAGCGTCCTCAAAGATTGAAGCTTACGAAAGCA AGTTAGCTAGCGAGCTTGCTTGTGTTGATTTGTCTATTCAGAACTTGACTATTTCAACTTTGTTCAACGCA TTAACGACTTCCGTAAGAGGTAAAGGTGAGGAGACATCGGCAGATCCACTGATAGCTAGATTTTACACC TTACTTACCGGTAAACCACTAAGCAGAGACACTCAGGGCCCAGAACGAGATTTAGCCGAGGTGATAAGC AGAAAAATTGCAAGTTCTTTTGGAACTTGGAAGGAGATGACTGCCAATCCACTTCAATCTCTTCAATTTT TTGAAGAGGAGTTGCATGCGCTAGATGCAAATGTTAGTTTGTCACCTGCCTTCGATGTTCTGATTAAGAT GAACGACCTGCAGGGTGACTTGAAGAACAGAACGATAGTTTTTGATCCAGATGCTCCTGTGTTTGAATA TAATGCTGAGGATCCTGCTGACATCATCATTAAACTGACAGCTAGATATGCGAAAGAAGCAGTGATTAA AAATCAAAATGTCGGGAATTATGTTAAGAACGCTATTACGACAACTAACGCAAACGGACTAGGTTGGTT GCTGAACAAAGGCCTTTCCTTATTGCCTGTCTCCACTGATGACGAACTATTGGAGTTTATTGGGGTCGAG AGATCCCATCCTAGCTGTCATGCGTTGATAGAACTTATCGCTCAGTTAGAAGCACCTGAACTGTTCGAAA AAAATGTTTTTTCTGATACTCGTTCCGAGGTTCAAGGTATGATAGATTCAGCTGTAAGCAATCATATCGC CAGGCTGTCAAGCTCTCGTAATTCATTGAGCATGGACTCAGAGGAACTTGAGAGATTGATAAAATCTTTT CAAATTCATACACCACATTGTTCATTATTTATAGGGGCTCAATCCTTATCTCAACAATTGGAAAGCCTAC CCGAAGCATTGCAGTCAGGAGTGAACAGTGCTGATATTCTGCTCGGCTCAACCCAATACATGTTGACAA ATTCTTTGGTCGAGGAGTCAATCGCTACGTATCAGAGAACCTTAAATAGAATTAACTACCTGTCCGGCGT TGCAGGACAGATTAACGGTGCTATTAAGAGGAAAGCTATTGATGGTGAGAAGATACATTTACCCGCTGC TTGGTCAGAGTTAATTTCTTTACCCTTTATTGGGCAACCAGTGATTGATGTTGAATCAGATTTAGCCCACT TAAAGAACCAATACCAGACATTGTCTAACGAATTTGATACGCTGATTTCCGCACTGCAAAAGAATTTCG ACTTAAATTTTAATAAAGCCTTGCTTAATCGAACACAACATTTCGAGGCTATGTGTAGATCAACAAAAA AGAATGCCCTTTCTAAGCCTGAGATCGTTAGTTATAGAGATTTGCTAGCCAGGTTGACTTCTTGTCTTTAT AGGGGCTCTCTAGTCTTGAGGAGGGCGGGTATAGAAGTACTGAAAAAGCACAAGATATTTGAGTCCAAC TCTGAATTAAGAGAGCACGTTCATGAAAGAAAACACTTCGTATTTGTTTCTCCGCTCGATAGAAAAGCC AAGAAGCTCCTACGTTTGACTGACTCTAGGCCTGATTTATTGCACGTAATTGATGAAATACTACAACATG ATAATTTAGAGAACAAGGATAGAGAATCTTTGTGGTTAGTTCGATCTGGTTATTTACTGGCCGGCCTACC AGACCAACTCTCCTCTTCCTTTATAAATCTTCCAATCATTACTCAAAAAGGCGATCGTCGCTTGATAGAT CTCATTCAATACGACCAAATTAATAGAGATGCTTTTGTGATGTTGGTAACTTCCGCTTTTAAGTCGAACT TAAGTGGGCTGCAGTACAGAGCAAACAAACAATCTTTTGTGGTTACGCGCACTTTGTCACCATATTTGGG ATCTAAATTGGTTTATGTGCCCAAAGATAAAGATTGGCTGGTCCCTTCCCAAATGTTCGAGGGGAGATTT GCGGACATTTTGCAATCCGATTATATGGTGTGGAAGGACGCTGGAAGATTGTGTGTTATTGACACAGCT AAGCATTTGTCTAACATTAAAAAATCTGTATTCTCAAGTGAAGAAGTCCTCGCGTTTTTAAGAGAATTGC CACACCGTACGTTTATCCAAACTGAGGTCAGGGGTTTAGGGGTGAATGTGGACGGTATTGCATTTAATA ACGGGGATATACCCTCTCTGAAGACGTTTAGCAATTGCGTGCAAGTCAAAGTGAGTCGGACAAACACTA GTCTGGTCCAAACATTAAATAGATGGTTTGAAGGCGGTAAGGTCTCGCCGCCTAGCATCCAATTTGAGA GAGCATATTACAAAAAAGATGATCAAATCCACGAGGACGCTGCAAAAAGGAAGATAAGGTTTCAAATG CCAGCTACAGAGTTGGTACACGCGTCAGACGACGCAGGATGGACCCCCTCCTATTTACTTGGTATCGATC CCGGTGAATATGGTATGGGTTTGTCATTGGTCTCAATAAATAATGGCGAAGTTTTAGATAGCGGATTTAT ACACATAAATTCATTGATAAATTTCGCTTCTAAGAAATCAAATCATCAAACCAAAGTTGTTCCGAGGCA GCAATACAAGTCACCATACGCCAACTATCTAGAACAATCTAAAGATTCTGCAGCAGGAGACATAGCTCA TATTTTGGATAGACTTATCTACAAGTTGAACGCCCTACCCGTTTTCGAAGCTCTATCTGGCAATAGTCAA AGCGCAGCGGATCAGGTTTGGACAAAAGTCCTCAGCTTCTACACCTGGGGAGATAATGATGCACAAAAT TCAATTCGTAAGCAACATTGGTTCGGTGCTTCACACTGGGACATTAAAGGCATGTTGAGGCAACCGCCA ACAGAAAAAAAGCCCAAACCATACATTGCCTTTCCCGGTTCACAAGTTTCTTCTTATGGTAATTCTCAAA GGTGTTCATGTTGTGGACGTAACCCAATTGAACAATTGCGCGAAATGGCGAAGGACACATCCATTAAGG AGTTGAAGATTAGAAATTCAGAAATTCAATTGTTCGACGGTACTATAAAGTTATTTAATCCAGACCCGTC AACGGTCATAGAAAGAAGAAGACATAATTTAGGGCCATCAAGAATTCCTGTAGCTGATAGAACTTTCAA AAATATAAGTCCAAGCTCACTAGAATTCAAAGAACTAATAACGATTGTGTCACGGTCTATACGTCATTCC CCAGAATTTATTGCTAAAAAAAGAGGTATAGGTAGTGAGTACTTTTGTGCTTATAGTGATTGTAATTCCT CCTTAAATTCAGAAGCAAATGCGGCTGCGAACGTTGCCCAAAAGTTCCAAAAGCAATTGTTTTTCGAATT ATAG SEQ ATGAAAAGAATCTTGAACTCTTTAAAGGTTGCCGCCCTGCGTTTGTTATTTAGAGGTAAAGGATCTGAAC ID TTGTCAAGACTGTTAAATACCCTTTGGTCTCGCCGGTTCAGGGTGCAGTTGAGGAGTTAGCTGAGGCGAT NO: CCGCCATGATAACCTACATCTGTTTGGTCAAAAAGAAATTGTTGACCTTATGGAAAAGGATGAAGGTAC 143 GCAAGTTTACTCAGTGGTTGATTTCTGGTTAGATACCCTTCGTTTGGGGATGTTTTTCAGTCCATCAGCAA ACGCATTAAAAATCACGCTGGGTAAGTTTAATTCTGATCAGGTTAGCCCTTTTAGGAAAGTGTTAGAGCA GTCTCCATTCTTCTTGGCTGGTAGGCTGAAGGTTGAACCGGCAGAACGTATATTATCTGTCGAGATCCGT AAGATTGGGAAGAGGGAAAACAGAGTTGAGAACTATGCTGCTGACGTAGAAACGTGTTTTATAGGCCA ATTAAGTTCAGATGAGAAACAGTCAATACAAAAATTAGCTAATGATATCTGGGATAGTAAAGATCATGA AGAGCAAAGAATGTTAAAGGCAGATTTCTTCGCTATCCCTTTGATTAAGGATCCAAAGGCTGTGACCGA AGAGGATCCTGAAAATGAAACTGCTGGTAAACAAAAACCCTTGGAGTTGTGTGTCTGCCTTGTCCCAGA AATAGAAGCATAGTTACCTTTTTAATAGAATCATCCGGCGAAAAAGTTTATGGTTTCTCCCCACAGCAAT TAGAGAAGGGTTTCAGACCAGACATCGAAACTTTTAAAAAGATGGTAAGAGACTTTATGAGACCTCCTA TGTTTGATAGAAAAGGCAGACCGGCCGCAGCTTACGAGAGATTTGTTTTAGGAAGGAGACATCGAAGGT ACAGGTTTGATAAAGTATTTGAGGAAAGATTTGGGAGGTCTGCTCTTTTCATTTGTCCTAGAGTAGGTTG TGGAAATTTTGACCACAGCTCCGAACAGTCCGCGGTTGTTTTGGCCTTGATCGGATATATTGCCGATAAG GAGGGAATGTCAGGTAAGAAGTTGGTTTATGTACGGCTGGCCGAACTTATGGCCGAATGGAAACTAAAA AAATTAGAAAGATCCAGAGTTGAAGAACAATCATCCGCTCAATAA SEQ ATGGCAGAAAGCAAACAAATGCAGTGTAGGAAATGTGGAGCTAGTATGAAGTACGAAGTCATCGGTTT ID GGGTAAAAAGTCATGTAGATACATGTGTCCCGATTGTGGCAACCATACCTCGGCAAGAAAGATACAAAA NO: CAAAAAAAAAAGAGATAAAAAATATGGGTCAGCCAGTAAAGCCCAATCTCAAAGAATTGCTGTAGCAG 144 GTGCTCTTTACCCTGACAAAAAAGTACAAACTATCAAAACCTATAAATATCCAGCAGACTTGAATGGTG AGGTGCATGATAGCGGTGTTGCCGAGAAAATCGCACAAGCAATACAAGAGGACGAGATTGGACTTTTG GGACCAAGCTCAGAATATGCATGCTGGATTGCATCTCAAAAACAGTCTGAGCCTTACAGTGTAGTCGAT TTCTGGTTTGATGCAGTGTGCGCAGGGGGAGTCTTCGCCTACTCTGGCGCTAGATTATTGAGTACAGTTT TACAGTTATCCGGTGAGGAATCGGTGCTTAGAGCTGCCTTAGCCTCGTCTCCATTCGTTGACGATATAAA CTTAGCGCAAGCCGAAAAGTTTTTGGCGGTTAGCAGGCGTACAGGTCAAGATAAGTTAGGTAAGAGAAT TGGGGAGTGCTTTGCAGAAGGAAGATTGGAAGCTTTAGGGATAAAAGATAGAATGAGGGAATTTGTTCA AGCTATCGATGTTGCACAGACCGCCGGACAACGTTTCGCTGCCAAATTGAAGATATTCGGTATAAGTCA GATGCCAGAAGCTAAGCAATGGAATAACGATTCCGGACTGACTGTCTGTATACTACCTGATTATTATGTT CCCGAAGAGAATCGCGCGGACCAACTTGTAGTGTTGTTAAGAAGACTTCGCGAGATTGCATATTGCATG GGTATTGAAGATGAAGCGGGTTTCGAACATCTTGGAATAGATCCTGGTGCTCTTTCGAATTTTTCAAACG GTAACCCTAAGAGAGGATTTCTAGGGAGGCTGTTAAATAACGATATTATTGCGTTGGCAAACAATATGA GTGCGATGACTCCATATTGGGAAGGGCGTAAGGGTGAACTCATAGAAAGGCTTGCGTGGTTAAAGCACA GGGCAGAAGGGCTGTATCTTAAAGAACCTCATTTCGGTAACTCCTGGGCCGATCATAGGTCACGAATTTT CTCAAGGATCGCAGGCTGGTTATCTGGTTGCGCTGGCAAGTTGAAAATTGCGAAAGACCAAATTTCTGG AGTACGTACAGATCTATTTCTGCTAAAAAGACTGCTGGACGCAGTTCCGCAATCGGCGCCATCCCCCGAT TTTATTGCGTCAATTTCGGCACTTGACAGGTTTTTAGAAGCTGCAGAATCGAGCCAGGACCCTGCTGAAC AAGTGAGGGCTCTCTACGCTTTTCACTTGAACGCACCTGCAGTCCGAAGTATAGCCAATAAAGCAGTGC AAAGGTCCGACAGCCAAGAATGGCTGATAAAAGAACTAGACGCTGTTGACCATTTAGAATTTAACAAAG CGTTCCCATTTTTCTCTGACACAGGAAAAAAAAAAAAAAAAGGTGCTAATAGCAACGGTGCTCCATCGG AAGAAGAGTACACTGAAACGGAATCAATACAACAACCTGAGGACGCGGAACAGGAAGTAAACGGACA AGAAGGGAACGGAGCGTCTAAAAATCAAAAGAAATTTCAAAGAATACCTAGATTCTTCGGTGAAGGCT CCAGATCTGAATACAGAATTTTAACGGAAGCTCCACAGTATTTCGATATGTTTTGTAATAACATGAGGGC TATATTTATGCAGTTAGAAAGTCAACCCCGTAAAGCTCCCAGAGATTTTAAATGTTTCCTACAAAATCGA TTACAAAAATTATACAAACAGACTTTCTTGAATGCACGAAGCAACAAGTGTCGCGCTCTGCTTGAGTCA GTTTTAATCTCTTGGGGAGAATTTTATACATACGGTGCCAACGAAAAGAAATTTAGATTAAGACATGAA GCTTCAGAACGCAGCAGTGACCCAGATTACGTAGTTCAGCAAGCCTTGGAAATCGCGCGTCGTCTATTC CTTTTTGGCTTCGAATGGAGAGATTGCTCCGCTGGTGAAAGAGTGGATTTGGTTGAAATTCACAAAAAG GCTATCAGTTTTTTGTTGGCTATTACTCAAGCTGAGGTCTCTGTTGGTTCATACAATTGGCTTGGCAACTC AACAGTATCGAGATATTTATCCGTTGCGGGAACTGATACCTTATACGGTACCCAATTGGAAGAATTCCTG AACGCTACAGTGTTGAGTCAAATGCGTGGTCTGGCCATTAGATTGAGTTCTCAAGAACTTAAGGACGGT TTTGATGTGCAGCTCGAGTCTTCCTGCCAGGACAATCTGCAACACCTATTGGTGTATAGGGCTTCGAGAG ATTTGGCGGCTTGCAAGCGCGCTACTTGTCCAGCCGAACTCGATCCTAAGATTTTAGTTTTACCGGTAGG TGCATTCATCGCTTCCGTAATGAAAATGATAGAAAGAGGTGACGAACCTTTAGCTGGTGCTTATTTACGG CATAGGCCACACTCTTTCGGATGGCAAATTAGGGTCCGCGGTGTTGCTGAGGTAGGGATGGATCAGGGT ACAGCATTGGCCTTTCAAAAGCCAACAGAGTCAGAACCTTTTAAAATTAAGCCCTTCTCTGCACAGTATG GACCAGTTCTGTGGTTGAACAGTAGTAGTTATTCTCAATCACAATATTTGGACGGTTTTCTATCTCAACC AAAAAATTGGAGTATGAGGGTGTTGCCTCAGGCGGGTTCAGTTCGCGTCGAACAACGAGTTGCTTTGAT ATGGAACTTACAAGCAGGCAAGATGAGACTAGAACGCTCCGGTGCGAGGGCCTTTTTCATGCCTGTACC GTTTTCATTTAGGCCATCCGGCAGTGGGGACGAAGCAGTTTTGGCGCCCAACCGGTACTTGGGTCTGTTC CCTCATTCCGGAGGTATAGAATACGCTGTAGTGGATGTCCTGGATTCTGCTGGATTTAAAATTCTTGAAA GAGGCACTATTGCTGTCAATGGTTTCTCTCAGAAAAGGGGAGAGCGCCAAGAAGAAGCCCATCGTGAAA AACAAAGAAGGGGGATAAGTGATATAGGGCGAAAGAAGCCTGTGCAGGCAGAAGTCGATGCGGCGAA CGAATTGCATAGAAAGTACACTGATGTTGCCACAAGATTAGGTTGTAGAATCGTCGTTCAATGGGCACC ACAACCTAAACCAGGGACAGCACCGACAGCGCAAACTGTTTACGCGAGGGCTGTTAGGACAGAAGCTC CGAGGAGCGGCAACCAAGAAGATCATGCAAGAATGAAAAGTTCTTGGGGTTACACCTGGGGTACGTATT GGGAGAAACGAAAACCAGAAGATATTTTAGGGATTTCTACACAGGTGTATTGGACAGGAGGTATAGGC GAATCCTGTCCTGCTGTAGCAGTCGCTTTATTAGGTCATATTAGAGCAACTTCAACACAAACGGAGTGGG AAAAGGAAGAAGTTGTCTTTGGAAGACTGAAGAAGTTCTTTCCGAGTTAA SEQ ATGGAGAAGAGAATTAATAAGATACGGAAAAAATTATCTGCGGATAATGCAACAAAGCCAGTCTCTCGT ID TCAGGCCCCATGAAAACCCTGCTTGTAAGAGTAATGACGGATGATTTAAAAAAGAGGTTGGAAAAGCGT NO: AGAAAAAAACCAGAAGTGATGCCGCAAGTGATCTCAAATAACGCAGCTAATAATCTAAGGATGCTACTT 145 GATGATTATACAAAAATGAAAGAAGCAATCCTGCAAGTTTACTGGCAGGAATTCAAGGATGACCATGTT GGACTAATGTGCAAATTCGCACAACCAGCGTCTAAGAAAATTGACCAAAATAAATTGAAACCCGAAATG GACGAAAAAGGGAATTTAACAACTGCCGGGTTTGCCTGCTCGCAATGTGGGCAACCATTATTTGTTTATA AATTAGAGCAGGTTTCGGAAAAAGGAAAGGCTTACACAAATTACTTCGGCAGATGTAATGTTGCCGAAC ACGAAAAACTCATATTGTTAGCTCAGTTGAAGCCTGAGAAAGACTCTGATGAGGCCGTTACTTACTCGTT GGGGAAGTTTGGTCAAAGAGCTCTCGATTTTTATTCTATTCATGTGACAAAGGAGTCCACACATCCCGTC AAGCCCTTGGCACAAATTGCGGGTAATAGATACGCTTCGGGTCCAGTTGGGAAGGCCCTTTCTGATGCA TGTATGGGCACAATTGCTAGCTTTCTTAGTAAATACCAGGATATCATAATAGAGCATCAAAAAGTTGTA AAGGGTAACCAAAAGAGATTAGAATCGCTGCGTGAGTTGGCGGGTAAAGAAAACTTGGAATATCCATCT GTCACTCTGCCTCCTCAACCTCATACTAAGGAAGGTGTAGATGCGTACAATGAAGTTATCGCTAGAGTCC GTATGTGGGTGAATTTAAATTTGTGGCAAAAATTGAAGTTATCGCGTGATGATGCAAAACCTCTTCTTAG ACTAAAGGGCTTTCCTAGCTTCCCTGTAGTGGAAAGACGCGAAAATGAAGTCGATTGGTGGAATACAAT TAACGAAGTCAAAAAACTGATCGATGCAAAGCGAGATATGGGTCGAGTTTTTTGGTCTGGTGTTACAGC TGAAAAAAGGAATACGATCTTAGAAGGTTACAACTACTTGCCAAATGAGAACGATCATAAAAAAAGAG AAGGCAGTTTAGAAAATCCAAAAAAGCCAGCTAAGAGACAATTTGGTGATTTGCTACTTTACCTAGAAA AAAAGTACGCCGGAGATTGGGGGAAAGTCTTTGACGAAGCTTGGGAGAGAATAGATAAAAAAATAGCA GGATTGACGTCACACATTGAAAGAGAAGAGGCGAGAAATGCAGAAGATGCTCAGTCCAAAGCTGTCCT CACCGACTGGTTGAGAGCCAAAGCGTCCTTTGTTCTCGAACGCCTAAAAGAAATGGATGAGAAGGAATT TTATGCCTGCGAAATCCAGCTACAAAAATGGTACGGAGACTTGAGAGGTAACCCCTTTGCCGTGGAAGC AGAGAACCGTGTTGTAGATATCTCCGGTTTCTCAATCGGTAGCGATGGACACTCCATTCAGTATCGCAAC TTGTTGGCCTGGAAATATTTGGAAAACGGTAAGAGGGAATTCTATTTACTTATGAATTATGGCAAGAAA GGTAGAATCAGGTTTACTGACGGAACAGACATTAAAAAGAGTGGTAAGTGGCAAGGCCTTTTGTACGGT GGTGGCAAGGCCAAAGTAATAGACTTAACATTTGACCCCGACGACGAACAACTGATAATACTGCCTTTA GCTTTTGGTACTCGACAGGGGCGAGAGTTCATTTGGAATGATCTTTTGTCACTCGAGACTGGTTTGATAA AACTTGCAAATGGAAGAGTCATCGAGAAGACAATTTACAACAAAAAGATAGGTCGCGATGAGCCTGCA CTATTTGTGGCCTTGACCTTTGAGAGAAGGGAAGTTGTCGACCCATCCAATATTAAACCAGTCAACCTAA TCGGTGTAGATAGAGGTGAAAACATCCCAGCTGTTATCGCTCTGACAGACCCTGAAGGTTGCCCTTTGCC AGAATTTAAAGATTCGTCTGGTGGACCAACAGATATATTACGTATTGGGGAAGGCTATAAAGAGAAACA ACGTGCTATTCAGGCTGCAAAAGAAGTTGAACAGAGGAGAGCTGGAGGTTACAGTAGAAAATTCGCCA GTAAAAGTAGAAACTTAGCAGATGACATGGTTAGAAACTCTGCCCGGGATTTGTTCTATCATGCGGTTA CTCACGATGCAGTCTTAGTCTTTGAAAATCTATCGCGCGGTTTTGGTAGGCAAGGCAAGAGGACTTTTAT GACAGAGAGACAATATACAAAAATGGAAGATTGGTTAACCGCGAAGCTCGCATATGAAGGTCTTACTTC GAAAACGTACCTCAGCAAAACGCTGGCTCAATATACTTCTAAAACTTGTTCAAATTGTGGTTTTACTATT ACCACGGCAGACTACGACGGGATGTTGGTGAGATTGAAGAAGACGAGCGATGGTTGGGCAACAACATT GAATAATAAGGAATTAAAAGCAGAAGGACAGATTACGTATTACAATCGTTATAAACGCCAAACGGTTG AGAAAGAGTTGTCAGCCGAGTTGGATAGACTAAGTGAAGAGAGCGGTAACAATGATATCTCAAAGTGG ACTAAAGGGAGGCGGGATGAAGCCCTCTTTTTACTAAAGAAGAGATTCTCACATAGACCTGTGCAAGAA CAATTCGTTTGTTTAGATTGTGGCCATGAGGTTCATGCAGACGAACAGGCTGCGTTAAATATTGCGAGAA GCTGGCTATTTCTAAATTCTAATTCAACAGAGTTCAAGAGCTATAAATCCGGAAAACAACCTTTCGTAGG CGCGTGGCAAGCCTTCTATAAAAGGAGATTAAAAGAGGTTTGGAAACCAAATGCA SEQ ATGAAAAGAATTAACAAAATTAGAAGGAGGCTGGTCAAAGATTCTAATACCAAGAAAGCTGGTAAGAC ID TGGTCCGATGAAAACCCTATTAGTCAGAGTTATGACCCCAGATTTGAGAGAAAGATTGGAGAACCTCAG NO: GAAAAAGCCCGAAAACATCCCACAACCCATTAGTAACACATCAAGAGCTAATTTAAACAAGTTATTAAC 146 TGACTACACTGAAATGAAAAAAGCAATATTGCATGTTTACTGGGAAGAGTTCCAGAAAGATCCTGTTGG GTTGATGTCTAGAGTTGCTCAACCGGCCCCAAAGAATATAGATCAAAGGAAACTTATTCCTGTGAAGGA CGGCAATGAAAGATTAACCAGCTCCGGTTTCGCTTGCTCCCAGTGCTGCCAACCCCTGTATGTATACAAA CTGGAACAAGTAAATGATAAAGGTAAGCCACATACTAACTACTTTGGTAGGTGTAATGTATCCGAGCAT GAAAGATTGATCTTGTTAAGTCCCCATAAACCAGAAGCTAATGATGAGTTAGTAACTTATAGTTTAGGTA AGTTCGGACAACGAGCTTTAGATTTCTATAGCATCCATGTTACAAGAGAAAGCAATCACCCCGTCAAAC CACTGGAACAAATCGGTGGTAATAGTTGTGCGTCAGGTCCAGTAGGCAAAGCTTTATCAGACGCTTGCA TGGGTGCCGTGGCTAGTTTTTTGACGAAATACCAAGATATTATACTGGAACATCAAAAGGTAATTAAAA AGAATGAAAAGAGACTCGCTAACTTAAAAGATATTGCAAGTGCCAATGGTTTAGCTTTTCCTAAAATTA CCTTGCCACCTCAGCCACATACAAAGGAGGGAATTGAAGCTTACAATAATGTAGTAGCCCAAATAGTTA TTTGGGTGAACCTTAACCTATGGCAAAAGTTAAAAATTGGTAGAGACGAAGCCAAACCCCTGCAGAGGC TGAAGGGTTTTCCCTCCTTCCCCTTAGTAGAGAGACAAGCTAATGAAGTGGACTGGTGGGATATGGTGT GCAATGTTAAAAAATTGATTAATGAGAAGAAAGAGGATGGTAAAGTGTTTTGGCAGAATCTTGCTGGCT ACAAGAGACAGGAAGCTTTACTGCCTTATTTATCTTCTGAGGAAGATAGGAAAAAAGGTAAAAAATTTG CTAGATATCAATTCGGAGACCTACTTCTGCATTTAGAAAAAAAACATGGCGAAGATTGGGGTAAAGTTT ATGATGAAGCCTGGGAAAGAATTGATAAGAAGGTAGAAGGTCTCTCCAAACATATTAAATTAGAGGAA GAACGTAGGTCCGAAGACGCTCAATCAAAGGCAGCATTAACTGATTGGTTGAGAGCAAAAGCCTCTTTC GTTATTGAAGGATTAAAAGAAGCCGACAAAGATGAATTTTGTAGATGTGAGTTAAAGTTGCAAAAGTGG TATGGAGACCTCCGTGGTAAACCTTTTGCTATTGAGGCTGAAAATTCTATACTCGATATCTCTGGATTTTC AAAACAATATAACTGCGCATTTATATGGCAGAAAGATGGTGTTAAAAAGCTAAATCTATACTTAATTAT CAATTACTTTAAAGGTGGTAAATTGCGTTTTAAGAAGATAAAGCCTGAAGCCTTTGAGGCAAACCGTTTT TACACTGTTATCAATAAAAAATCTGGGGAAATCGTACCAATGGAAGTTAATTTCAATTTCGATGATCCTA ATCTTATTATTTTACCTCTTGCTTTCGGCAAAAGGCAAGGTAGGGAGTTTATTTGGAATGATTTATTGTCG CTGGAAACGGGGTCTCTCAAACTCGCAAACGGTAGGGTGATAGAAAAAACATTATACAACAGGAGAAC TCGGCAGGATGAGCCAGCTCTTTTTGTGGCTCTGACATTCGAGAGAAGGGAAGTTTTAGATTCATCTAAC ATCAAACCAATGAATTTAATAGGTATTGACCGGGGTGAAAATATACCTGCAGTTATTGCTTTAACTGATC CTGAGGGATGTCCTCTTAGCAGATTCAAGGACTCGTTGGGTAACCCTACTCACATCTTAAGGATTGGAGA AAGTTACAAGGAGAAACAAAGGACAATACAAGCTGCTAAAGAAGTAGAACAAAGGAGGGCGGGTGGA TATAGTCGGAAATATGCCAGCAAGGCCAAGAATTTAGCTGACGACATGGTTAGGAATACAGCTAGAGAC CTTTTATACTATGCCGTCACCCAGGATGCCATGTTGATATTTGAAAATTTAAGTAGAGGCTTCGGTAGAC AAGGTAAGCGCACCTTCATGGCAGAGAGACAATATACTAGAATGGAAGATTGGTTGACTGCCAAATTGG CATACGAAGGTCTACCTAGTAAGACGTACTTATCTAAAACACTAGCGCAGTATACTTCCAAGACATGCA GTAATTGTGGTTTCACAATCACTTCTGCCGATTACGATCGCGTCTTGGAAAAACTAAAAAAAACAGCGA CAGGTTGGATGACTACTATTAATGGGAAAGAATTGAAGGTCGAAGGACAAATAACTTACTATAATAGAT ATAAACGGCAAAACGTTGTAAAAGACCTGTCAGTCGAACTCGATCGACTTAGTGAAGAATCTGTTAATA ATGATATTAGTTCGTGGACAAAAGGTAGATCCGGTGAAGCTTTGAGCCTCCTGAAAAAACGTTTTAGCC ATAGGCCTGTCCAAGAAAAGTTTGTATGTTTAAACTGTGGTTTTGAGACCCATGCAGACGAGCAGGCCG CTCTTAATATTGCTAGATCATGGTTATTTTTAAGATCTCAGGAATACAAGAAGTACCAGACTAACAAGAC AACAGGCAACACAGATAAGCGAGCATTCGTTGAGACTTGGCAATCTTTTTATAGAAAGAAATTGAAGGA AGTCTGGAAACCA SEQ ATGGGAAAAATGTATTATCTAGGCCTGGACATAGGGACCAATTCAGTAGGCTACGCTGTCACTGACCCC ID TCCTACCATTTGCTGAAGTTCAAGGGGGAACCCATGTGGGGAGCACACGTGTTTGCGGCCGGCAACCAG NO: AGCGCAGAGCGGAGAAGCTTCCGCACCTCCAGGAGAAGGCTGGATCGCAGGCAGCAGCGTGTGAAGCT 147 GGTCCAAGAGATATTTGCCCCAGTGATTTCCCCCATCGATCCGCGCTTCTTTATTAGGCTCCACGAGTCC GCTCTCTGGCGCGACGACGTGGCCGAAACTGATAAACATATTTTCTTTAATGACCCAACATACACTGACA AGGAGTACTATTCAGATTACCCAACAATTCACCATTTGATCGTGGACCTTATGGAAAGTTCGGAGAAGC ATGATCCTCGACTTGTCTATTTGGCCGTGGCGTGGCTCGTGGCACATAGGGGCCACTTCTTGAACGAGGT GGACAAGGATAACATCGGGGATGTGTTATCTTTCGACGCTTTCTATCCTGAATTCCTTGCTTTTCTGTCTG ACAATGGCGTCAGCCCGTGGGTCTGCGAATCCAAGGCCCTCCAGGCTACGCTATTGTCAAGAAATAGCG TGAACGACAAGTACAAGGCTCTTAAGTCTTTGATTTTTGGAAGCCAGAAGCCCGAGGACAACTTTGATG CAAATATCTCGGAGGACGGGCTGATTCAGCTCCTCGCTGGGAAAAAGGTCAAGGTCAATAAGCTGTTTC CACAGGAGTCAAATGACGCGAGCTTCACCCTTAACGACAAAGAGGATGCCATTGAAGAGATCCTGGGG ACACTCACCCCAGACGAGTGCGAGTGGATAGCCCATATTAGGCGCCTCTTTGATTGGGCCATAATGAAA CATGCGCTTAAGGACGGGCGCACGATATCCGAAAGCAAGGTCAAATTGTACGAGCAGCACCACCATGAT CTGACCCAGCTAAAATATTTTGTAAAAACATATCTGGCCAAGGAGTACGATGATATCTTCCGCAACGTG GATAGTGAGACCACCAAAAACTACGTCGCGTACTCATACCACGTGAAAGAAGTTAAGGGCACGCTGCCT AAGAACAAGGCAACACAAGAGGAGTTCTGCAAGTACGTTCTCGGGAAAGTTAAAAATATAGAGTGCAG CGAGGCCGACAAAGTGGATTTTGACGAGATGATTCAACGCCTGACCGACAATTCGTTTATGCCTAAACA GGTGAGTGGAGAGAATCGCGTGATTCCATATCAGCTCTATTACTATGAACTCAAGACTATTCTGAATAA GGCCGCTAGCTATTTACCCTTCCTTACGCAGTGCGGGAAGGATGCCATTTCTAACCAGGATAAACTCTTG AGTATAATGACATTTCGAATTCCCTATTTCGTGGGTCCGCTTCGTAAGGATAACAGTGAGCACGCTTGGC TGGAGCGGAAGGCTGGCAAAATTTATCCATGGAATTTCAACGACAAGGTGGATCTGGACAAATCCGAAG AAGCCTTTATCCGCAGGATGACCAATACTTGCACATACTATCCTGGGGAGGATGTCCTTCCACTGGACTC TCTGATCTACGAAAAGTTCATGATTTTGAATGAAATTAACAACATAAGGATCGATGGGTATCCTATTTCC GTCGACGTGAAGCAGCAGGTGTTCGGGCTCTTTGAGAAGAAGCGACGGGTGACCGTGAAGGATATTCAG AATCTTCTCTTATCGCTGGGAGCCCTGGATAAACACGGAAAACTGACCGGGATAGATACTACGATTCAT TCTAATTACAACACGTATCACCATTTTAAGTCACTGATGGAGAGGGGCGTCCTAACAAGAGATGACGTG GAGAGAATAGTGGAACGAATGACATATTCTGATGACACCAAGAGAGTGCGGCTTTGGCTGAATAACAA CTACGGCACTCTGACGGCGGATGATGTAAAGCATATTTCCCGACTCCGTAAGCATGACTTCGGGCGGCT GTCTAAGATGTTTCTAACAGGCCTCAAGGGTGTGCATAAGGAAACTGGGGAGCGCGCTAGCATCCTGGA TTTTATGTGGAACACCAATGATAACCTGATGCAGCTCCTGTCAGAATGCTACACATTTTCGGACGAAATC ACCAAGCTGCAGGAGGCTTACTATGCCAAGGCCCAACTAAGCTTGAATGATTTCCTGGATTCTATGTACA TCAGCAACGCCGTAAAACGACCAATTTATAGGACACTGGCAGTGGTTAACGACATTAGGAAAGCATGCG GAACAGCTCCCAAGCGAATCTTTATCGAGATGGCCCGCGACGGCGAGAGTAAGAAGAAAAGGTCAGTG ACTAGGCGGGAGCAGATCAAGAACCTTTACCGCTCTATCCGAAAAGACTTCCAGCAAGAGGTTGATTTC CTTGAGAAGATCTTAGAGAACAAGTCAGATGGACAGCTCCAATCCGATGCTCTGTATCTGTACTTCGCTC AGCTGGGACGAGATATGTACACTGGCGACCCCATTAAACTAGAACATATCAAGGACCAATCGTTTTATA ATATCGACCACATCTACCCTCAGTCCATGGTGAAAGACGATAGTCTGGACAATAAGGTGCTCGTCCAAA GTGAGATTAACGGAGAAAAGTCGAGCAGATATCCTTTGGACGCTGCGATCCGCAACAAGATGAAGCCCC TGTGGGATGCTTACTACAATCATGGACTGATCAGCCTGAAGAAGTATCAGAGACTGACCCGGAGTACCC CTTTCACAGACGATGAGAAGTGGGATTTTATCAATAGACAACTGGTGGAAACCAGGCAGTCCACGAAAG CTCTGGCCATTCTTCTGAAGAGAAAGTTTCCAGACACAGAGATCGTCTATTCAAAGGCCGGCCTCAGTTC CGACTTTAGACATGAGTTCGGACTCGTTAAATCACGAAATATAAACGATCTCCACCATGCAAAGGACGC ATTCCTCGCGATTGTGACTGGAAATGTCTATCACGAAAGATTTAATAGGCGGTGGTTCATGGTTAACCAG CCATACTCAGTGAAGACCAAGACCCTTTTCACTCACTCTATTAAAAATGGCAACTTCGTGGCTTGGAATG GTGAGGAGGATCTTGGAAGAATTGTGAAGATGTTAAAACAGAATAAGAATACCATCCACTTTACTAGAT TCAGCTTTGACCGAAAAGAGGGGCTATTCGATATTCAACCGTTAAAGGCTTCAACAGGTCTCGTTCCACG AAAGGCCGGACTGGACGTAGTGAAATACGGCGGCTATGATAAGAGCACCGCAGCTTACTACCTCCTTGT GCGATTTACGCTCGAGGATAAGAAGACCCAACACAAGCTGATGATGATTCCCGTGGAGGGACTGTACAA AGCTCGAATTGACCATGATAAAGAGTTTCTCACAGATTACGCACAAACCACCATCTCTGAGATTCTCCAG AAAGACAAACAAAAAGTTATAAACATAATGTTTCCAATGGGTACAAGGCATATTAAACTGAACAGCATG ATCTCCATTGATGGCTTTTATTTGTCCATTGGAGGAAAGTCTAGTAAAGGCAAGTCTGTCCTCTGCCATG CCATGGTACCCCTAATCGTCCCACACAAGATTGAATGCTACATCAAGGCTATGGAGAGTTTTGCTCGGA AATTTAAAGAGAATAATAAGCTGCGTATTGTGGAAAAATTCGACAAGATAACCGTTGAAGACAATCTGA ATCTGTACGAGCTCTTTCTGCAGAAGCTGCAGCATAACCCCTATAATAAGTTCTTCTCCACACAGTTCGA TGTACTGACCAACGGGCGATCAACTTTCACAAAGCTAAGTCCTGAGGAACAGGTGCAAACACTCCTAAA CATTCTTTCCATTTTTAAGACCTGCAGATCTTCAGGATGCGACTTGAAGAGCATTAACGGGAGCGCACAG GCAGCTAGGATCATGATCTCAGCTGACCTGACAGGGCTGAGTAAAAAATACTCCGACATTCGGCTTGTA GAGCAAAGCGCCAGTGGGTTGTTCGTTAGTAAGTCGCAGAACCTGCTGGAATACCTGTAA SEQ ATGTCTTCTTTGACGAAGTTTACAAACAAATACTCTAAGCAGCTTACAATTAAGAACGAACTGATTCCCG ID TAGGAAAGACTCTGGAAAACATCAAAGAGAATGGGCTGATAGACGGCGACGAACAACTGAATGAGAAC NO: TATCAGAAGGCCAAAATTATCGTGGATGACTTCCTGAGGGATTTTATTAACAAGGCCCTGAATAATACC 148 CAGATCGGCAATTGGCGGGAACTGGCCGACGCTCTGAACAAAGAAGATGAGGACAATATCGAAAAATT ACAAGACAAAATCAGGGGCATTATTGTCAGTAAGTTCGAGACATTCGATCTGTTCTCTTCGTACTCCATT AAGAAGGACGAGAAAATCATCGATGATGACAATGACGTTGAGGAAGAAGAACTGGACTTGGGTAAAAA GACCTCATCCTTCAAGTATATTTTTAAAAAAAATCTGTTTAAATTAGTGCTCCCCAGTTATTTAAAGACA ACTAACCAGGACAAGCTTAAGATTATCTCCTCTTTTGACAACTTTAGCACCTATTTTAGAGGCTTCTTTGA AAATCGCAAGAATATTTTCACTAAGAAGCCCATAAGCACCTCTATTGCCTACAGAATCGTACATGATAA CTTCCCAAAATTTTTGGATAACATTAGATGTTTTAATGTATGGCAGACCGAATGTCCTCAGTTAATTGTG AAGGCGGATAACTACCTCAAATCCAAGAATGTGATCGCCAAAGATAAGTCTCTTGCTAACTACTTTACG GTCGGAGCCTACGATTACTTCTTATCTCAAAACGGTATTGACTTTTACAATAACATTATCGGGGGATTGC CTGCCTTCGCCGGCCATGAGAAAATTCAGGGCTTAAACGAGTTCATAAATCAGGAATGTCAAAAGGACT CAGAGCTGAAATCAAAGCTTAAGAATCGACACGCATTTAAAATGGCGGTCTTGTTCAAACAGATCCTCA GCGATAGAGAGAAAAGCTTCGTTATTGATGAATTCGAGAGCGACGCACAGGTGATTGATGCCGTGAAGA ACTTCTATGCGGAACAGTGTAAAGACAATAATGTTATTTTCAACCTATTAAACTTGATTAAGAATATCGC GTTTTTAAGTGACGATGAACTCGACGGTATCTTTATAGAAGGCAAGTACCTGTCCTCTGTCAGCCAAAAA CTCTACTCAGATTGGTCCAAGCTAAGAAATGACATCGAGGACAGTGCTAACAGCAAACAGGGCAATAA AGAGCTGGCAAAGAAAATCAAGACTAATAAAGGGGATGTGGAGAAGGCGATATCTAAATATGAGTTCT CCCTCTCCGAACTGAACTCCATCGTCCACGATAATACCAAGTTTAGTGATCTGTTGTCGTGTACACTGCA CAAAGTGGCCAGTGAAAAACTCGTCAAGGTGAACGAAGGCGATTGGCCCAAACACCTGAAAAATAATG AGGAGAAACAGAAGATCAAAGAACCTTTGGATGCGTTGCTCGAAATATATAACACACTGTTGATCTTCA ACTGTAAAAGCTTCAACAAGAACGGGAACTTTTATGTAGACTACGATCGATGTATAAATGAACTGAGCA GCGTCGTTTACCTGTACAACAAGACTCGCAATTATTGTACGAAAAAACCATATAACACCGATAAGTTCA AGCTTAATTTCAACAGTCCCCAGCTGGGAGAAGGGTTCAGCAAATCAAAAGAAAACGATTGCCTGACAT TACTCTTTAAAAAGGATGATAATTATTATGTTGGGATTATTAGGAAAGGCGCTAAGATCAACTTTGACGA CACACAGGCCATAGCTGACAACACTGATAACTGCATCTTTAAAATGAATTACTTTCTGTTGAAGGACGCC AAAAAATTCATTCCAAAATGCTCTATTCAGCTCAAGGAGGTTAAGGCCCATTTCAAGAAGTCTGAAGAT GACTACATCCTCTCTGACAAGGAAAAATTCGCTAGTCCTCTGGTTATCAAAAAAAGTACCTTCTTGCTGG CTACAGCTCACGTGAAAGGCAAGAAAGGGAACATTAAGAAGTTCCAAAAGGAATACAGCAAAGAGAAT CCAACCGAGTACAGAAATTCTCTGAACGAATGGATCGCATTCTGTAAAGAATTTCTAAAGACGTACAAG GCCGCTACCATTTTCGATATTACCACCTTGAAAAAAGCCGAGGAGTACGCCGACATCGTCGAATTCTATA AAGACGTGGATAACCTGTGTTACAAATTGGAATTCTGCCCAATTAAGACCTCTTTCATTGAAAACCTCAT CGACAATGGGGACCTCTACTTATTTAGAATTAACAATAAGGATTTTTCTTCGAAATCTACCGGAACTAAA AATCTGCACACACTGTATCTGCAAGCAATCTTCGATGAACGTAATCTCAACAACCCTACAATAATGCTGA ACGGCGGTGCTGAACTGTTCTACCGTAAAGAGAGTATTGAACAGAAGAATCGAATCACACACAAAGCG GGCAGTATTCTCGTCAATAAGGTGTGCAAAGACGGGACCAGCCTGGACGATAAGATCAGGAATGAAAT ATATCAGTATGAGAACAAGTTTATCGACACCTTGTCGGATGAGGCAAAGAAGGTGCTACCTAACGTTAT CAAGAAGGAAGCTACCCATGACATAACCAAGGATAAGCGGTTCACTTCTGACAAGTTCTTCTTCCACTG TCCTCTGACCATTAACTACAAGGAAGGAGACACTAAACAATTCAATAATGAAGTACTTAGCTTTTTGCG GGGTAATCCCGATATTAACATAATTGGTATCGACCGGGGAGAACGGAACCTGATATACGTGACAGTAAT TAATCAGAAAGGAGAAATCCTGGATTCCGTATCCTTCAATACCGTGACTAATAAATCTAGTAAAATCGA GCAGACGGTCGACTACGAGGAAAAGTTAGCAGTCAGAGAGAAGGAGAGAATCGAGGCCAAACGTTCCT GGGATAGTATCAGCAAGATTGCTACTCTGAAAGAAGGATATCTGTCCGCTATCGTCCATGAGATCTGTTT GTTGATGATCAAGCACAATGCTATAGTGGTTCTGGAGAACCTGAACGCAGGCTTCAAGCGAATTAGAGG GGGCCTGTCGGAAAAAAGCGTTTACCAGAAGTTTGAAAAGATGCTAATCAATAAGTTAAATTACTTTGT AAGTAAAAAAGAAAGCGATTGGAATAAGCCATCAGGACTTTTAAACGGGCTGCAACTGAGCGACCAGT TTGAGTCATTCGAAAAACTGGGTATTCAGAGTGGTTTCATATTCTACGTACCTGCCGCTTACACTTCAAA GATCGATCCTACAACTGGTTTTGCGAATGTCCTGAATCTGTCTAAGGTGAGGAATGTGGACGCAATCAA GTCTTTCTTCAGCAACTTCAACGAGATATCTTACAGCAAGAAAGAGGCTCTGTTTAAATTCAGTTTTGAT CTGGATAGCCTGAGCAAGAAAGGATTCTCTTCTTTCGTAAAGTTTTCTAAGTCCAAATGGAACGTCTACA CGTTCGGAGAGAGAATCATTAAACCAAAGAACAAGCAGGGGTATCGGGAAGACAAAAGGATCAATCTG ACTTTCGAAATGAAGAAACTATTGAATGAGTACAAAGTCTCATTCGATTTGGAGAACAATCTGATCCCC AATCTGACCAGCGCTAACCTCAAAGACACATTCTGGAAGGAGCTGTTTTTCATCTTTAAGACCACCCTGC AGCTACGGAATAGTGTCACAAATGGGAAAGAGGATGTACTGATCTCACCTGTGAAAAACGCCAAGGGG GAGTTCTTTGTGTCCGGCACCCATAACAAAACCCTGCCTCAGGACTGTGACGCGAACGGGGCCTACCAC ATCGCGCTAAAGGGGTTAATGATTCTCGAACGTAATAATCTGGTGCGCGAAGAAAAAGACACAAAGAA AATTATGGCCATCAGCAACGTTGACTGGTTTGAGTACGTGCAGAAGCGTCGAGGAGTTTTGTAA SEQ ATGAACAACTATGACGAGTTCACTAAACTTTACCCCATTCAGAAAACCATCAGATTTGAACTGAAGCCT ID CAGGGTCGTACCATGGAACACTTGGAAACTTTCAACTTTTTCGAGGAGGACAGGGATAGAGCTGAGAAA NO: TACAAGATCTTGAAAGAGGCCATCGACGAGTATCACAAAAAATTCATCGATGAGCATCTCACCAACATG 149 TCGCTGGATTGGAACAGTCTCAAGCAGATTTCCGAGAAGTACTATAAATCTCGGGAGGAGAAAGATAAA AAGGTGTTTTTGAGCGAGCAAAAGCGAATGCGACAGGAGATAGTCTCTGAATTTAAGAAAGATGATCGG TTTAAAGACCTATTTTCCAAAAAGCTTTTTTCAGAGCTGCTGAAGGAAGAGATCTATAAAAAAGGCAAT CACCAAGAAATTGATGCCCTGAAATCATTCGACAAATTCAGTGGGTATTTCATAGGACTGCATGAGAAC CGGAAGAATATGTATAGTGATGGAGACGAGATCACAGCCATAAGCAATCGAATCGTTAACGAGAATTTC CCGAAGTTCCTGGATAACCTGCAGAAGTATCAAGAGGCTAGGAAAAAGTACCCTGAGTGGATCATCAAG GCTGAATCAGCTCTGGTGGCTCACAATATCAAGATGGATGAAGTCTTTAGTCTTGAGTACTTTAATAAAG TCCTTAACCAGGAGGGCATCCAGCGCTATAACCTGGCTCTCGGTGGCTACGTCACAAAAAGCGGAGAAA AGATGATGGGTCTCAACGATGCACTGAATTTGGCTCATCAGTCGGAGAAGTCATCTAAGGGACGCATAC ACATGACACCACTGTTTAAACAAATCCTGAGCGAAAAGGAATCATTTTCCTACATTCCCGACGTATTCAC CGAGGACTCACAACTGCTGCCTAGTATAGGGGGGTTTTTCGCTCAGATAGAGAACGACAAAGATGGCAA CATTTTTGACAGAGCCTTGGAGTTGATTTCATCTTACGCCGAGTACGATACGGAGCGCATTTATATTCGC CAGGCGGATATCAACAGGGTTTCCAATGTGATCTTTGGCGAGTGGGGAACGCTGGGCGGGCTGATGCGG GAATACAAAGCCGACTCGATCAATGACATCAACCTGGAGAGAACATGCAAGAAGGTCGATAAATGGTT GGATAGCAAAGAGTTCGCCCTGAGTGACGTCTTGGAAGCTATCAAAAGAACCGGAAATAATGACGCGTT CAACGAGTATATCTCTAAAATGAGGACCGCGAGAGAAAAAATTGATGCAGCAAGGAAGGAGATGAAGT TTATATCTGAGAAGATCTCAGGCGATGAAGAGTCCATCCATATTATTAAAACTCTTCTGGACTCAGTGCA GCAATTCCTGCACTTTTTTAACCTCTTCAAGGCCAGGCAGGATATACCGTTAGACGGGGCTTTTTATGCC GAGTTTGATGAAGTTCATTCGAAACTTTTTGCTATAGTGCCTCTCTATAATAAAGTTCGCAATTACCTGA CAAAGAATAACTTAAACACAAAGAAAATCAAGCTCAACTTCAAAAACCCAACACTGGCAAACGGATGG GATCAGAACAAGGTATATGATTACGCCTCATTGATTTTCCTCCGGGACGGGAATTACTATCTGGGGATCA TCAACCCTAAGCGCAAAAAGAACATTAAGTTCGAACAGGGATCTGGCAATGGTCCCTTCTATAGGAAAA TGGTATACAAACAGATTCCTGGCCCCAACAAGAATCTCCCACGCGTCTTTCTGACGTCCACTAAGGGAA AGAAGGAGTACAAGCCGTCTAAAGAAATTATCGAGGGCTATGAGGCAGACAAGCATATTAGGGGTGAC AAGTTTGACCTAGACTTTTGTCATAAGCTTATCGACTTTTTCAAGGAGTCCATAGAGAAGCACAAAGATT GGTCAAAGTTTAATTTCTATTTTTCTCCAACAGAGTCCTACGGGGATATCTCTGAGTTCTATCTGGATGTT GAAAAGCAGGGGTACAGAATGCACTTCGAAAATATCTCAGCAGAAACTATCGATGAGTACGTAGAGAA AGGAGATCTGTTTCTTTTCCAAATCTACAATAAGGATTTTGTGAAGGCCGCCACTGGGAAGAAGGACAT GCACACTATTTACTGGAACGCTGCATTTTCCCCTGAAAATCTGCAGGACGTAGTAGTGAAATTAAATGGT GAGGCAGAACTGTTTTACCGCGATAAATCAGACATCAAGGAAATAGTGCACCGGGAAGGCGAGATTCTT GTTAACCGAACATATAATGGCAGGACACCTGTCCCTGATAAAATTCATAAGAAACTGACCGATTACCAC AACGGTCGAACCAAGGATCTGGGCGAGGCCAAGGAATACCTCGATAAGGTGAGGTACTTCAAAGCCCA TTATGACATCACCAAGGACCGAAGATACCTTAACGACAAAATCTACTTCCATGTCCCACTCACCTTGAAC TTCAAAGCTAACGGTAAGAAGAACCTCAATAAAATGGTGATTGAAAAATTTCTGTCCGATGAGAAGGCC CATATCATCGGCATTGATCGCGGCGAGAGAAATCTCCTTTACTATTCTATCATTGATCGGTCGGGAAAGA TTATCGACCAACAATCACTGAATGTCATCGACGGATTCGACTATAGAGAGAAGCTGAACCAACGGGAAA TCGAGATGAAGGACGCGCGCCAGTCCTGGAACGCTATCGGCAAAATTAAAGATTTGAAAGAAGGTTACC TCTCCAAAGCAGTGCACGAAATTACCAAAATGGCAATCCAGTACAATGCTATTGTGGTAATGGAGGAGT TAAATTACGGATTTAAGCGCGGGAGGTTCAAGGTTGAAAAGCAAATTTACCAAAAATTTGAGAACATGT TGATTGATAAGATGAACTACCTGGTGTTCAAGGACGCACCTGACGAGTCGCCAGGCGGCGTGTTAAATG CATATCAGCTGACAAATCCACTGGAGAGCTTTGCCAAGCTAGGAAAGCAGACTGGCATTCTCTTTTACGT CCCTGCAGCGTATACATCCAAAATTGACCCCACCACTGGCTTCGTCAATCTGTTTAACACCTCCTCCAAA ACCAACGCACAAGAACGGAAAGAATTTTTGCAAAAGTTTGAGTCCATTAGCTACTCTGCCAAAGACGGC GGGATCTTTGCTTTCGCATTCGACTACAGGAAATTCGGGACGAGTAAGACAGACCACAAGAACGTCTGG ACCGCGTACACTAATGGGGAACGCATGCGCTACATCAAAGAGAAAAAGAGGAATGAACTTTTTGACCCT TCAAAGGAAATCAAGGAAGCTCTCACCTCAAGCGGTATCAAATACGATGGCGGGCAGAATATTTTGCCA GATATCCTCAGATCGAACAATAATGGACTTATCTATACTATGTACTCCTCCTTCATTGCAGCAATTCAAA TGAGAGTGTACGATGGAAAGGAGGATTACATTATATCGCCAATTAAGAACTCCAAAGGCGAATTCTTCC GCACGGATCCTAAGCGAAGAGAACTCCCAATCGACGCTGATGCGAACGGCGCCTATAATATAGCCCTGC GGGGTGAATTAACAATGCGCGCTATTGCCGAGAAGTTCGACCCCGATTCAGAAAAAATGGCTAAGCTTG AGCTGAAACACAAAGATTGGTTCGAATTCATGCAGACAAGAGGCGACTAA SEQ ATGACTAAGACCTTCGATTCCGAGTTCTTCAACCTTTATTCCCTGCAGAAAACTGTAAGGTTTGAGCTGA ID AGCCGGTGGGCGAGACAGCCAGCTTCGTAGAGGATTTCAAGAATGAGGGTCTCAAACGGGTAGTTAGTG NO: AGGATGAGAGGAGAGCAGTGGACTATCAGAAGGTGAAAGAGATCATCGATGACTATCACCGGGATTTC 150 ATAGAGGAGTCGTTGAATTACTTCCCTGAGCAAGTATCCAAAGACGCGCTGGAACAGGCCTTTCATCTTT ACCAGAAACTGAAGGCAGCGAAGGTTGAGGAGCGGGAAAAGGCCTTGAAAGAGTGGGAAGCCCTGCA GAAAAAGCTCAGAGAAAAGGTTGTCAAATGCTTCAGCGACAGCAACAAAGCCAGGTTCAGTAGGATCG ATAAGAAAGAACTGATCAAAGAAGACTTGATCAATTGGCTGGTTGCACAGAACCGGGAAGATGATATTC CCACCGTAGAGACCTTCAACAACTTCACAACTTACTTCACCGGCTTCCATGAGAATCGTAAAAACATCTA CAGTAAAGATGATCATGCAACCGCCATCTCCTTCCGGTTGATCCACGAGAATCTCCCCAAGTTCTTTGAC AACGTGATAAGTTTCAATAAGTTGAAAGAGGGATTTCCCGAACTCAAGTTCGATAAAGTGAAGGAGGAT CTGGAAGTGGATTATGACCTTAAGCACGCTTTCGAGATAGAGTACTTCGTGAACTTTGTGACTCAGGCCG GCATCGATCAGTATAACTACCTCCTCGGGGGTAAGACGCTCGAGGACGGTACTAAGAAGCAAGGAATG AATGAGCAAATTAATCTATTTAAACAGCAGCAGACCAGGGATAAGGCTAGACAGATCCCCAAGCTTATT CCTCTTTTTAAACAGATCCTAAGTGAAAGGACAGAAAGTCAAAGCTTCATACCTAAGCAATTTGAAAGT GATCAGGAGCTGTTTGACTCCCTGCAAAAGCTGCACAACAATTGCCAGGACAAGTTTACCGTGCTGCAG CAGGCTATCCTCGGACTGGCTGAGGCGGATCTTAAGAAGGTATTCATTAAGACTAGCGACCTCAATGCC CTTAGTAACACCATCTTTGGAAATTACTCCGTTTTCAGCGATGCCCTCAATCTATACAAAGAGAGCTTGA AGACTAAAAAAGCTCAGGAAGCTTTTGAAAAATTACCGGCACATTCTATACACGACCTTATACAATACT TAGAGCAGTTCAACAGCAGCCTCGACGCTGAGAAACAGCAATCCACAGACACCGTCCTGAATTACTTCA TCAAAACCGATGAACTGTACTCCCGATTTATCAAGAGCACTTCAGAAGCCTTCACGCAAGTTCAGCCTCT GTTCGAGCTGGAGGCACTGTCCAGCAAGAGACGACCGCCAGAGTCTGAAGACGAGGGAGCCAAGGGTC AAGAGGGGTTTGAACAGATAAAGCGAATTAAGGCTTACTTGGATACTCTCATGGAGGCGGTGCATTTCG CTAAGCCTTTGTACCTGGTTAAAGGCCGAAAAATGATTGAGGGGCTAGATAAGGATCAGTCTTTTTACG AGGCTTTTGAAATGGCCTACCAGGAATTGGAATCCTTGATCATTCCAATCTATAATAAAGCCCGGAGTTA TCTGAGCAGGAAGCCCTTCAAAGCCGACAAGTTCAAAATAAATTTTGACAATAATACGCTACTGTCTGG TTGGGACGCTAACAAGGAAACAGCCAATGCTTCCATCCTGTTTAAGAAAGACGGCCTGTACTACCTGGG AATTATGCCAAAAGGCAAAACTTTTTTGTTCGATTACTTTGTGTCATCAGAGGATAGCGAGAAGTTAAAG CAAAGACGGCAGAAGACCGCCGAAGAAGCCCTCGCACAAGACGGAGAATCATATTTCGAGAAAATTCG ATATAAGCTCCTGCCTGGCGCATCAAAGATGTTGCCAAAAGTCTTCTTTTCCAACAAAAACATCGGCTTT TATAACCCCAGCGATGATATCCTTCGCATCCGGAACACCGCCTCACATACCAAAAATGGAACTCCACAG AAGGGCCACTCGAAGGTTGAATTCAACCTTAACGATTGTCACAAAATGATTGATTTTTTTAAGAGCTCCA TTCAGAAACACCCCGAATGGGGGTCCTTTGGCTTCACCTTTTCTGATACTTCAGACTTCGAGGACATGTC CGCCTTCTACAGGGAGGTGGAGAACCAGGGCTATGTCATCTCCTTCGACAAAATAAAAGAGACATACAT TCAGAGCCAGGTCGAGCAGGGAAATCTGTACCTGTTTCAGATCTATAACAAGGATTTCAGTCCCTATAG CAAGGGCAAGCCCAATTTACATACCCTGTACTGGAAGGCCCTGTTCGAAGAGGCAAACCTTAACAATGT AGTTGCTAAGCTGAATGGGGAAGCAGAGATCTTCTTCCGAAGGCACAGCATCAAGGCAAGCGACAAAG TTGTACATCCTGCTAACCAGGCCATCGATAACAAGAACCCGCATACAGAAAAGACACAGTCAACCTTTG AATACGACCTCGTGAAGGACAAGAGGTACACACAAGATAAATTCTTCTTCCACGTGCCCATCAGCTTGA ATTTTAAAGCGCAGGGAGTGAGCAAATTTAACGACAAGGTCAACGGCTTCCTGAAGGGAAACCCCGAC GTGAATATCATCGGAATTGATCGCGGTGAAAGACATCTCCTCTACTTTACTGTGGTGAACCAGAAGGGT GAGATCCTAGTACAGGAGAGCCTGAACACCCTTATGAGTGATAAGGGCCATGTGAATGATTACCAGCAG AAGCTGGACAAGAAGGAACAGGAAAGGGACGCAGCGCGGAAGTCCTGGACCACTGTTGAGAATATCAA AGAACTGAAGGAGGGATATCTTAGCCATGTGGTACACAAACTTGCACATCTGATTATCAAGTATAATGC CATAGTCTGCCTGGAAGACTTGAACTTCGGTTTCAAGCGAGGAAGGTTTAAAGTGGAGAAGCAGGTGTA CCAGAAGTTTGAGAAAGCCCTTATTGATAAGCTAAACTACCTTGTCTTTAAGGAAAAAGAACTCGGCGA AGTTGGCCACTATTTAACCGCCTACCAACTAACCGCCCCTTTCGAGTCTTTTAAGAAACTGGGAAAGCAG AGCGGAATACTCTTCTATGTGCCTGCAGACTACACCTCTAAGATCGACCCCACTACCGGCTTTGTAAACT TTCTAGATCTCCGCTATCAGTCAGTAGAAAAAGCCAAACAGCTCTTGTCAGATTTTAACGCCATCCGATT TAATTCCGTCCAAAATTACTTCGAGTTCGAAATCGACTATAAAAAACTTACCCCCAAGAGAAAGGTTGG GACGCAGTCTAAGTGGGTAATCTGCACTTACGGTGACGTGAGATACCAGAACCGCCGAAACCAGAAAG GTCATTGGGAAACCGAGGAAGTGAATGTGACTGAGAAGCTCAAGGCCCTCTTCGCTAGCGACAGTAAAA CAACAACAGTTATCGATTACGCCAATGACGATAATCTTATAGACGTGATCTTGGAACAAGACAAAGCCT CTTTTTTTAAGGAATTGTTGTGGTTGCTGAAACTTACAATGACCCTTAGGCACAGCAAGATCAAATCAGA GGATGACTTCATCCTCAGCCCGGTGAAGAATGAACAGGGAGAGTTCTACGATTCACGGAAGGCTGGAGA GGTGTGGCCCAAGGATGCCGACGCGAACGGGGCCTACCACATAGCTCTAAAAGGTCTGTGGAACCTGCA ACAAATCAATCAATGGGAGAAAGGTAAGACACTGAACCTGGCCATCAAAAATCAAGATTGGTTCTCATT CATCCAGGAAAAGCCTTATCAAGAGTGA SEQ ATGCATACGGGAGGCCTTTTATCAATGGACGCAAAAGAGTTCACCGGGCAGTATCCATTATCTAAGACA ID CTCCGCTTCGAGCTGAGGCCCATTGGCAGGACCTGGGACAACCTGGAGGCGTCGGGCTACCTGGCTGAG NO: GACAGACATCGCGCAGAATGCTATCCGAGAGCTAAGGAGCTTTTGGACGACAATCATCGCGCGTTCCTT 151 AACCGGGTGCTCCCACAGATCGATATGGACTGGCACCCGATCGCTGAGGCTTTTTGCAAGGTCCATAAG AACCCTGGGAACAAAGAGCTCGCCCAGGACTACAACTTGCAGCTGAGCAAGCGACGGAAAGAGATTTC TGCCTACCTTCAAGACGCCGATGGCTACAAAGGGCTCTTCGCAAAGCCCGCATTGGATGAGGCCATGAA AATCGCCAAGGAGAACGGGAATGAAAGTGACATCGAAGTTCTCGAAGCGTTTAACGGATTTAGCGTGTA CTTTACCGGCTATCATGAGTCAAGGGAGAATATTTATAGCGATGAGGACATGGTCTCTGTGGCCTACCG GATTACCGAGGATAATTTCCCGAGGTTTGTTTCAAATGCACTAATATTCGACAAGTTAAATGAGAGCCAC CCAGACATCATCTCGGAGGTCAGCGGCAACCTCGGAGTTGACGATATTGGCAAATACTTCGACGTGAGC AACTATAACAACTTCCTCTCACAGGCTGGCATCGACGACTATAATCATATTATAGGCGGCCACACTACTG AGGATGGTCTCATTCAGGCATTCAATGTAGTCTTGAATCTTAGGCACCAGAAGGACCCTGGGTTTGAAA AGATACAGTTCAAGCAGCTGTATAAGCAGATATTATCCGTGCGAACATCTAAAAGTTACATCCCCAAAC AGTTTGATAACTCAAAGGAGATGGTGGATTGCATATGCGATTATGTGTCAAAAATTGAAAAGAGCGAGA CTGTGGAGCGGGCTCTGAAGCTCGTCAGGAACATTAGCTCCTTTGACCTTAGAGGAATTTTCGTCAATAA AAAGAATCTGAGGATCCTGAGCAATAAGCTAATAGGAGATTGGGACGCCATAGAGACAGCATTGATGC ATTCCAGCTCAAGCGAGAATGATAAGAAGTCTGTCTACGATAGCGCTGAAGCCTTCACGCTGGACGATA TCTTCTCTTCCGTGAAAAAATTTAGTGATGCGTCCGCAGAAGATATCGGGAATCGAGCCGAAGATATCT GCAGGGTAATTTCAGAGACCGCCCCTTTCATCAATGACCTGCGCGCCGTGGACCTGGATAGCCTGAATG ACGATGGTTACGAAGCTGCAGTTTCTAAGATCAGGGAGTCTCTGGAGCCATATATGGACTTGTTTCACGA ACTTGAGATCTTTAGCGTGGGCGACGAGTTCCCGAAATGCGCAGCTTTCTATAGCGAGTTAGAGGAGGT CAGCGAGCAATTAATCGAGATCATACCCCTGTTTAATAAGGCACGGAGCTTTTGTACTCGCAAGCGCTA CAGCACCGACAAGATTAAAGTTAATCTGAAATTTCCAACTCTCGCAGACGGGTGGGACCTAAACAAGGA ACGCGATAATAAAGCCGCCATCCTTAGAAAGGACGGAAAGTACTATCTTGCCATCCTAGATATGAAAAA AGATCTGAGTTCCATTCGTACTAGCGATGAAGACGAATCTTCTTTCGAAAAAATGGAGTATAAGCTGCTC CCCTCGCCAGTCAAGATGCTACCCAAGATCTTTGTGAAGAGCAAAGCAGCCAAGGAAAAGTACGGGCTG ACGGACAGGATGCTGGAGTGCTACGATAAGGGAATGCATAAATCAGGGTCAGCTTTTGACTTGGGCTTT TGCCATGAGCTAATCGATTACTACAAGCGCTGTATCGCCGAGTATCCAGGATGGGACGTTTTCGACTTTA AATTTCGGGAGACTTCTGATTATGGTTCAATGAAGGAGTTCAACGAAGATGTCGCTGGTGCCGGTTACTA CATGAGCCTTCGCAAGATTCCTTGTTCCGAAGTCTACCGGCTACTGGACGAGAAATCTATATATTTGTTC CAGATATATAACAAGGACTACAGTGAGAATGCACATGGGAATAAGAATATGCATACTATGTATTGGGAA GGTCTCTTTTCACCCCAAAATTTGGAGTCACCCGTGTTCAAACTTAGCGGTGGCGCAGAGCTGTTCTTTA GGAAATCCAGTATACCCAATGACGCCAAGACAGTCCACCCAAAGGGTAGCGTCCTGGTGCCCAGAAAC GATGTGAACGGCAGGAGAATCCCTGACAGCATTTACCGAGAACTTACCAGGTACTTCAACCGCGGCGAC TGTAGAATCTCTGATGAGGCAAAGTCTTATCTGGATAAGGTGAAGACTAAGAAGGCAGATCATGACATT GTGAAAGACCGCCGCTTTACTGTCGACAAAATGATGTTTCACGTGCCTATCGCAATGAATTTTAAGGCAA TCTCAAAACCGAATCTGAACAAGAAGGTGATAGATGGCATTATCGATGACCAGGACCTCAAGATCATCG GAATCGACAGAGGTGAGCGAAACCTGATATACGTCACAATGGTAGATCGGAAGGGTAATATTCTGTACC AGGATTCACTAAACATCCTCAATGGATATGACTATCGAAAAGCTCTCGATGTCAGGGAATACGACAACA AGGAGGCGCGACGGAATTGGACAAAGGTGGAAGGCATACGGAAGATGAAGGAAGGCTATCTGTCACTA GCTGTCTCCAAATTGGCTGATATGATTATAGAGAACAACGCCATTATCGTGATGGAAGATCTCAACCAT GGATTCAAGGCAGGAAGAAGTAAAATTGAGAAGCAGGTGTATCAGAAGTTCGAAAGCATGCTTATTAA TAAGTTGGGTTATATGGTCTTAAAGGACAAGTCTATCGATCAGAGCGGCGGCGCACTCCATGGGTATCA GCTGGCTAACCATGTCACCACACTAGCATCCGTAGGCAAACAGTGTGGCGTGATTTTCTACATTCCTGCT GCGTTCACTTCTAAGATCGATCCTACCACGGGATTCGCAGACCTGTTCGCACTGAGCAATGTTAAAAACG TGGCCTCCATGAGGGAGTTCTTTAGCAAAATGAAAAGCGTGATTTATGACAAGGCCGAGGGCAAGTTCG CTTTCACATTTGACTACCTGGACTACAATGTGAAATCAGAGTGCGGGAGAACCCTGTGGACCGTATACA CGGTAGGGGAAAGATTCACTTACAGTCGAGTTAATCGGGAGTATGTCCGTAAAGTGCCAACTGACATCA TCTACGATGCCCTTCAGAAGGCTGGCATAAGTGTTGAGGGGGATCTAAGGGACAGGATCGCTGAATCGG ATGGCGATACTCTCAAATCAATCTTCTACGCCTTCAAGTATGCCCTCGACATGAGGGTAGAGAACCGGG AGGAGGACTATATACAGTCTCCCGTGAAGAATGCGTCGGGAGAGTTCTTCTGCTCAAAAAACGCCGGGA AATCTTTGCCGCAGGATTCTGATGCAAATGGGGCTTATAACATTGCTCTCAAAGGCATCCTGCAGCTGCG CATGCTATCTGAACAATATGACCCAAACGCTGAAAGCATTAGATTGCCATTGATCACCAATAAGGCTTG GCTGACTTTCATGCAGAGCGGTATGAAGACATGGAAAAACTAA SEQ ATGGATTCCCTTAAGGACTTCACAAATCTTTACCCCGTGAGTAAAACCCTGAGATTTGAACTCAAGCCCG ID TGGGAAAGACTCTCGAGAATATCGAGAAGGCCGGGATTTTGAAGGAAGACGAGCATCGGGCGGAAAGT NO: TACAGACGGGTGAAGAAGATTATAGATACTTATCACAAGGTCTTTATAGACAGCTCTTTAGAGAACATG 152 GCAAAGATGGGCATCGAGAACGAAATCAAGGCCATGCTGCAGTCCTTCTGCGAGCTGTATAAAAAGGAT CATCGGACCGAAGGCGAAGACAAGGCGCTGGATAAGATCAGGGCAGTGCTGCGCGGCCTCATTGTGGG TGCCTTCACTGGGGTGTGCGGGCGGAGAGAGAACACTGTGCAGAATGAGAAATACGAGAGTTTGTTCAA AGAGAAACTCATCAAGGAAATCCTGCCCGACTTCGTCTTAAGCACAGAAGCCGAATCTCTCCCATTTTCT GTCGAGGAGGCCACGCGTTCCCTTAAAGAGTTCGACAGTTTCACTTCATACTTTGCCGGATTTTATGAAA ACCGTAAAAATATATACTCCACTAAACCACAGTCAACTGCAATAGCTTACAGGTTAATCCACGAAAACC TGCCAAAATTCATCGACAATATACTCGTCTTTCAAAAAATCAAGGAACCAATCGCGAAGGAACTTGAAC ACATCCGGGCTGACTTTAGTGCGGGAGGATACATCAAAAAAGACGAGCGCCTGGAGGATATATTTTCAC TAAATTATTATATTCATGTACTGAGCCAGGCTGGCATAGAAAAGTACAACGCTCTAATTGGGAAAATCG TGACAGAAGGTGACGGGGAAATGAAAGGGCTAAACGAACATATTAACTTATATAACCAACAGCGGGGT CGAGAAGATCGTCTGCCCCTGTTCAGACCTCTGTATAAGCAAATACTCTCCGACAGAGAGCAGCTATCA TATCTGCCCGAGTCCTTTGAGAAAGATGAAGAGCTGCTCCGGGCGCTCAAGGAGTTCTATGATCATATA GCCGAGGACATTTTGGGCAGAACTCAGCAACTCATGACGTCTATTTCTGAATATGATCTGTCTCGTATCT ATGTCAGGAATGATAGCCAGCTGACCGATATATCCAAGAAGATGCTGGGGGACTGGAACGCCATTTATA TGGCGAGGGAGCGAGCATACGATCACGAGCAGGCACCCAAGAGAATCACAGCCAAATATGAGAGAGAC CGCATTAAGGCGCTGAAGGGCGAAGAAAGTATCAGTCTGGCCAATCTGAACTCCTGCATAGCTTTCCTT GATAACGTGAGGGATTGCAGAGTTGATACTTACCTGAGTACCCTGGGCCAGAAGGAAGGGCCTCACGGC CTCTCTAATCTAGTGGAGAATGTATTTGCCTCCTACCACGAAGCTGAGCAGCTGCTGTCATTTCCGTACC CAGAGGAAAATAATTTAATACAGGATAAGGACAACGTAGTGCTTATCAAAAATCTACTGGATAACATTT CCGACCTCCAGCGCTTTCTCAAACCACTTTGGGGGATGGGCGACGAGCCTGATAAGGATGAGCGCTTTT ACGGCGAGTACAACTACATCAGGGGCGCCTTGGACCAGGTGATTCCCCTCTATAATAAAGTCAGGAATT ACCTGACCCGAAAGCCATACAGTACAAGAAAGGTGAAATTAAATTTCGGCAATAGTCAGCTGCTGTCTG GTTGGGACCGAAATAAGGAGAAAGACAACAGCTGCGTAATTCTCAGAAAAGGACAGAACTTTTATTTGG CCATCATGAATAACAGACACAAGAGATCTTTCGAGAACAAAGTGCTCCCTGAGTATAAGGAGGGGGAA CCCTACTTCGAGAAGATGGACTATAAATTCCTTCCTGATCCAAATAAAATGCTGCCTAAAGTATTTCTGT CAAAAAAAGGTATAGAAATCTACAAACCTTCACCTAAGCTACTTGAACAGTATGGCCACGGCACCCATA AAAAAGGGGACACGTTCAGCATGGACGACCTACACGAACTGATTGACTTCTTTAAGCACAGCATAGAAG CTCATGAGGACTGGAAACAGTTCGGATTCAAATTCTCAGATACCGCGACCTACGAAAACGTGTCTAGTT TTTACCGGGAAGTCGAGGACCAGGGCTACAAGCTCAGCTTCAGAAAAGTTAGCGAATCTTACGTCTACT CCCTTATAGATCAAGGTAAGCTGTATCTCTTTCAAATCTACAACAAGGACTTTTCCCCATGTAGCAAGGG CACCCCCAATCTGCACACTCTCTACTGGCGGATGCTGTTCGACGAGCGTAACCTGGCAGACGTGATCTAC AAATTAGATGGTAAAGCTGAGATCTTCTTTCGTGAAAAGAGCCTAAAGAACGATCACCCCACTCACCCC GCCGGAAAGCCCATTAAGAAGAAAAGTAGGCAGAAGAAAGGAGAAGAATCGCTATTTGAGTACGACCT CGTCAAGGATCGGCATTATACAATGGATAAGTTCCAGTTCCATGTGCCAATAACTATGAATTTCAAGTGC AGTGCTGGCAGTAAGGTGAATGACATGGTAAACGCTCATATCCGGGAGGCAAAGGACATGCATGTTATT GGAATTGATAGGGGTGAGCGTAATCTCCTCTACATCTGTGTTATTGACTCCCGCGGCACAATCCTCGATC AGATTTCCTTGAATACAATTAATGATATAGACTACCATGACTTGCTTGAGTCTCGCGACAAAGATAGACA GCAGGAGAGAAGAAATTGGCAGACCATCGAAGGCATCAAGGAACTCAAGCAAGGCTACCTTTCTCAGG CAGTGCATCGAATAGCCGAGCTGATGGTGGCTTATAAAGCCGTCGTGGCACTAGAAGACCTAAATATGG GATTTAAACGAGGCAGGCAGAAGGTGGAATCATCCGTATACCAGCAGTTCGAAAAACAGTTGATAGAC AAACTCAATTACCTTGTAGACAAGAAGAAGCGGCCTGAGGACATAGGGGGCCTGCTTAGAGCGTATCAA TTTACAGCCCCATTCAAGTCTTTCAAAGAAATGGGTAAACAGAACGGTTTTCTGTTTTACATCCCAGCGT GGAACACCAGCAATATAGATCCAACCACTGGCTTCGTCAATCTGTTTCATGCTCAGTATGAAAATGTGG ACAAGGCCAAATCCTTCTTTCAGAAATTTGACAGCATCTCCTATAACCCAAAGAAAGACTGGTTTGAATT CGCCTTTGACTATAAGAATTTCACTAAGAAGGCCGAGGGATCAAGAAGCATGTGGATATTGTGCACGCA TGGCTCACGTATAAAGAACTTTAGAAACTCGCAAAAAAACGGGCAGTGGGACTCAGAAGAATTCGCACT CACCGAGGCTTTCAAATCCCTCTTCGTCCGGTATGAGATCGATTACACCGCCGATCTGAAGACGGCAATC GTCGACGAGAAACAGAAAGACTTCTTTGTAGATCTACTTAAGCTCTTTAAGCTAACCGTTCAGATGCGA AACAGTTGGAAAGAAAAGGATCTCGACTATCTCATTAGTCCAGTGGCTGGCGCGGATGGTAGATTTTTC GATACCCGGGAAGGTAACAAGTCCCTTCCCAAAGACGCCGACGCGAATGGTGCCTACAATATTGCACTA AAGGGGCTCTGGGCGCTGCGGCAAATTAGACAGACATCTGAAGGGGGCAAGCTTAAGCTGGCTATTTCT AATAAAGAGTGGTTGCAGTTTGTGCAGGAAAGGAGTTATGAGAAGGACTAG SEQ ATGAACAACGGCACCAACAACTTCCAGAACTTCATCGGCATATCGTCTCTGCAGAAAACACTTAGGAAT ID GCCCTGATTCCAACTGAGACAACACAGCAGTTTATTGTGAAGAATGGGATCATCAAAGAGGACGAATTG NO: CGCGGGGAGAATAGGCAGATCCTGAAGGACATCATGGACGATTACTACAGGGGTTTTATCTCCGAAACG 153 CTGAGCTCGATTGACGATATTGACTGGACGTCCCTCTTTGAGAAGATGGAAATCCAACTTAAAAATGGC GATAATAAAGATACCCTGATAAAGGAACAAACCGAATATAGAAAGGCTATACACAAAAAATTCGCAAA TGACGACCGCTTTAAGAACATGTTTTCTGCAAAACTGATTAGCGATATTCTGCCCGAGTTTGTGATTCAC AATAATAACTATTCCGCTTCGGAGAAGGAGGAAAAGACTCAGGTGATTAAACTGTTTTCTCGGTTCGCC ACTTCTTTCAAAGATTATTTCAAAAATCGCGCCAACTGTTTTTCCGCTGACGACATCTCCTCCTCTTCCTG CCACCGGATCGTAAACGACAATGCCGAGATCTTTTTTAGTAACGCCCTTGTGTATCGGAGGATAGTGAA GAGCCTGTCCAATGATGACATAAACAAAATTTCTGGCGATATGAAGGATAGCCTCAAAGAGATGAGCCT TGAAGAAATTTACTCCTACGAGAAGTATGGGGAGTTCATCACCCAGGAGGGGATTTCCTTCTATAATGA CATCTGTGGCAAGGTGAACAGCTTCATGAACCTGTACTGCCAGAAGAATAAGGAAAACAAAAATCTGTA CAAGCTTCAGAAGTTACATAAGCAGATCCTGTGTATCGCGGATACCTCATATGAGGTTCCTTATAAGTTC GAGAGTGATGAAGAAGTGTACCAGTCTGTAAATGGATTCTTAGACAATATTTCGTCCAAACATATAGTG GAGAGACTGAGAAAGATCGGGGACAATTACAATGGGTACAATCTCGACAAGATTTATATCGTGTCGAAG TTTTACGAATCTGTGAGCCAGAAAACATACAGGGATTGGGAAACCATTAATACCGCGCTTGAAATTCAC TACAATAATATTCTGCCTGGCAACGGAAAAAGCAAGGCCGATAAGGTAAAAAAGGCAGTCAAAAATGA CCTTCAGAAAAGTATCACCGAAATCAATGAGTTGGTGAGCAACTACAAATTGTGTTCAGACGATAATAT TAAAGCGGAAACGTACATACATGAAATTAGCCATATTCTGAATAACTTTGAGGCGCAGGAACTTAAGTA CAACCCTGAAATTCATCTCGTCGAAAGCGAATTGAAGGCCTCTGAATTGAAAAACGTTCTTGACGTGAT AATGAACGCTTTCCATTGGTGCTCTGTGTTTATGACTGAAGAGCTGGTTGATAAGGACAACAACTTTTAT GCTGAACTTGAGGAAATCTACGACGAGATCTACCCTGTGATTAGCTTGTATAACCTCGTCAGAAACTAC GTTACCCAGAAGCCGTACAGCACGAAAAAAATAAAGCTGAACTTTGGTATTCCGACTCTCGCCGATGGA TGGAGCAAGTCGAAGGAATATTCCAACAATGCCATCATTCTTATGCGAGACAATCTGTATTACCTCGGC ATCTTTAACGCCAAAAACAAGCCGGATAAGAAAATCATTGAAGGGAATACGAGCGAGAATAAGGGCGA CTATAAGAAAATGATCTACAACTTACTGCCAGGTCCCAATAAAATGATTCCTAAGGTGTTTCTGTCATCG AAAACAGGTGTAGAAACATATAAGCCCAGCGCATACATCCTGGAAGGCTACAAGCAAAACAAACACAT CAAAAGCAGCAAGGACTTTGATATCACATTCTGCCACGATCTAATCGACTACTTCAAAAATTGCATCGCC ATTCACCCTGAGTGGAAGAACTTCGGCTTTGACTTCTCCGACACCAGTACCTACGAAGACATTTCTGGAT TCTACCGTGAGGTTGAGCTGCAGGGTTATAAAATTGACTGGACATACATCAGTGAAAAAGACATCGATC TACTGCAGGAGAAGGGGCAGCTCTATCTCTTCCAGATTTATAATAAGGATTTCAGCAAGAAGTCCACTG GAAACGACAATCTGCATACAATGTATCTTAAGAACTTGTTTAGCGAAGAGAATTTGAAAGATATCGTTC TAAAGTTAAACGGGGAAGCCGAGATTTTCTTTCGAAAGTCTTCCATTAAGAATCCAATTATTCACAAGA AGGGCAGTATCCTGGTCAACAGAACCTATGAGGCCGAGGAAAAGGACCAGTTCGGTAATATACAAATT GTGCGCAAGAACATCCCCGAGAACATTTACCAGGAGCTCTATAAATACTTCAACGACAAAAGCGATAAG GAGCTTTCCGACGAGGCTGCCAAGCTGAAAAACGTGGTGGGACACCATGAAGCAGCCACCAACATCGTC AAAGATTATCGTTATACATATGACAAATATTTTCTGCACATGCCTATTACAATAAACTTTAAGGCAAACA AGACCGGGTTCATCAATGACCGGATACTCCAGTACATCGCAAAAGAGAAGGACCTGCATGTGATCGGCA TCGACCGCGGTGAAAGAAATCTCATTTACGTCAGCGTTATCGACACTTGTGGAAACATTGTGGAGCAGA AGTCCTTCAACATTGTTAACGGCTATGACTATCAGATCAAGCTCAAACAGCAGGAAGGTGCTCGTCAGA TTGCGAGGAAAGAATGGAAAGAGATCGGCAAGATCAAGGAGATCAAAGAAGGGTATCTGAGCTTGGTC ATTCACGAGATCTCCAAAATGGTCATCAAGTACAACGCTATTATCGCGATGGAAGACCTCTCTTACGGCT TTAAGAAGGGGCGCTTTAAAGTGGAGCGCCAGGTCTATCAGAAGTTCGAGACTATGCTTATCAATAAGC TGAATTACTTGGTCTTTAAGGATATCAGTATCACCGAGAACGGAGGACTGCTGAAAGGTTACCAGCTCA CATATATTCCCGATAAGCTCAAGAATGTGGGCCACCAATGCGGTTGTATTTTTTACGTTCCAGCTGCCTA CACATCTAAGATCGATCCTACCACCGGATTCGTCAATATATTTAAATTTAAAGATCTAACCGTTGATGCC AAGCGTGAGTTTATTAAGAAATTTGATTCAATCAGGTACGACAGCGAAAAGAACCTCTTCTGTTTCACTT TCGACTACAACAACTTCATCACACAAAATACTGTGATGAGCAAGTCATCATGGAGCGTTTATACTTATGG TGTAAGGATAAAAAGGCGCTTTGTTAATGGAAGGTTTTCCAATGAAAGCGATACAATAGACATCACAAA AGACATGGAGAAGACACTGGAGATGACAGATATTAATTGGAGGGACGGGCATGACCTTAGACAGGACA TCATCGACTACGAAATCGTCCAACACATTTTTGAGATATTCAGACTCACTGTCCAGATGCGAAACAGCCT GTCGGAACTCGAAGACCGGGACTACGATAGACTGATCTCCCCGGTGTTAAACGAAAATAATATTTTCTA CGATTCTGCTAAGGCAGGAGACGCTCTTCCTAAAGATGCGGACGCCAATGGCGCTTACTGTATAGCGTT GAAGGGATTGTATGAGATTAAACAGATCACTGAGAATTGGAAAGAAGACGGTAAATTCTCCAGAGACA AGCTGAAAATCTCCAACAAAGACTGGTTTGATTTTATTCAAAATAAGCGCTACCTGTAA SEQ ATGACAAACAAATTTACTAATCAGTACAGCCTGTCAAAGACCCTCCGCTTCGAACTGATTCCACAAGGG ID AAGACCCTTGAATTCATCCAGGAAAAGGGTTTATTATCCCAGGATAAACAACGCGCAGAAAGCTATCAA NO: GAGATGAAGAAGACGATCGATAAATTTCATAAGTATTTCATAGATTTAGCCCTGAGCAACGCTAAATTG 154 ACCCACCTGGAAACCTATTTGGAGCTGTACAACAAGTCAGCCGAGACAAAGAAAGAGCAGAAGTTTAA GGACGACCTGAAAAAAGTACAGGACAATTTGCGAAAAGAGATCGTCAAGTCTTTTTCCGACGGAGACGC CAAGTCAATATTTGCCATCCTGGACAAAAAGGAACTCATCACTGTGGAGTTGGAGAAGTGGTTTGAGAA TAATGAGCAGAAGGACATCTATTTTGACGAAAAGTTCAAGACATTTACTACTTACTTCACCGGATTTCAC CAAAACCGGAAGAACATGTACTCTGTTGAGCCGAACTCAACCGCCATCGCCTACCGCCTTATTCACGAA AATCTGCCAAAGTTTCTCGAGAATGCTAAAGCCTTTGAGAAAATTAAGCAGGTCGAGTCGCTCCAGGTG AACTTTCGAGAGCTGATGGGTGAATTCGGGGACGAGGGCCTGATTTTCGTGAATGAACTCGAAGAGATG TTTCAGATCAACTACTATAATGATGTACTCTCACAGAACGGGATCACTATCTACAACAGCATTATCTCTG GATTCACTAAGAACGATATCAAGTATAAAGGGCTGAATGAATACATCAACAATTATAATCAGACTAAGG ACAAAAAGGACAGGCTGCCTAAATTGAAACAGCTGTATAAGCAGATCCTCAGTGATAGAATTAGCTTGT CATTTCTCCCAGATGCCTTCACTGACGGAAAGCAGGTGCTTAAGGCGATATTCGATTTCTATAAGATCAA CCTCCTCTCTTATACAATCGAGGGCCAGGAGGAGTCACAGAACCTCCTGCTCCTGATTCGACAAACTATT GAAAATCTGTCCTCTTTCGATACGCAGAAGATATACCTGAAAAATGACACCCATCTCACTACAATATCCC AACAGGTATTCGGAGATTTCTCCGTCTTCAGTACAGCCCTGAATTACTGGTACGAGACAAAGGTGAACC CTAAGTTCGAAACAGAGTACAGCAAGGCGAACGAAAAGAAGAGGGAGATCCTGGACAAAGCCAAAGC CGTTTTCACCAAGCAAGATTACTTTAGCATCGCATTTCTGCAGGAAGTCCTGTCTGAGTACATACTGACA CTCGATCACACAAGCGACATAGTTAAGAAGCACTCTTCCAATTGTATCGCGGACTACTTCAAAAATCATT TTGTCGCGAAAAAGGAGAACGAGACAGATAAGACCTTCGATTTTATCGCGAATATTACCGCAAAGTATC AATGCATTCAGGGTATCTTGGAGAACGCCGACCAGTACGAAGACGAGCTTAAACAGGATCAGAAGCTC ATCGACAACCTAAAGTTCTTTTTGGACGCTATACTGGAACTCCTTCATTTTATTAAGCCACTACATCTGA AGAGTGAGTCTATCACTGAGAAGGACACTGCTTTTTACGACGTTTTCGAGAATTACTACGAAGCACTGTC TCTGCTAACCCCTCTGTATAACATGGTGAGAAACTATGTGACACAGAAACCTTATAGTACCGAGAAGAT TAAGTTGAACTTCGAGAACGCACAATTGCTGAATGGGTGGGATGCAAACAAAGAGGGTGATTACCTCAC AACAATCCTCAAGAAAGATGGCAATTACTTCCTGGCCATTATGGATAAAAAACATAACAAGGCATTTCA GAAATTTCCCGAGGGGAAGGAAAATTATGAAAAGATGGTATACAAGTTGCTGCCCGGGGTGAACAAAA TGCTCCCGAAGGTGTTTTTCTCGAATAAGAATATCGCGTACTTTAACCCGTCCAAGGAACTGTTGGAAAA TTATAAAAAGGAAACACACAAGAAGGGGGACACTTTTAATTTGGAGCACTGCCACACACTCATTGACTT CTTTAAAGATAGTCTCAACAAACATGAGGATTGGAAATATTTTGACTTTCAGTTTAGCGAGACCAAGTCT TATCAGGATCTGTCGGGATTTTATAGGGAAGTTGAGCACCAGGGTTACAAGATAAATTTCAAGAACATC GATAGCGAGTACATTGACGGACTGGTGAACGAAGGGAAGCTGTTCCTGTTTCAGATTTACAGCAAAGAT TTCTCTCCTTTCTCAAAAGGCAAGCCGAACATGCATACCCTGTATTGGAAGGCCCTGTTCGAGGAGCAAA ACCTTCAGAATGTGATTTACAAGCTGAACGGTCAGGCCGAGATTTTTTTTAGGAAGGCCTCTATCAAGCC CAAAAACATCATTCTGCACAAGAAAAAGATAAAGATCGCCAAAAAACACTTCATTGATAAAAAGACAA AGACTTCTGAGATCGTACCTGTTCAGACAATCAAGAATCTCAACATGTATTATCAGGGGAAGATTAGCG AGAAAGAGCTGACACAGGACGATTTGAGGTACATCGACAACTTCTCTATCTTTAACGAGAAGAACAAGA CAATCGATATCATCAAGGACAAGCGGTTTACCGTCGATAAATTCCAGTTCCATGTGCCTATCACGATGAA TTTCAAGGCCACCGGTGGGAGTTATATCAACCAGACTGTGCTGGAGTATCTGCAGAACAACCCCGAAGT AAAAATTATTGGCCTGGACAGAGGAGAGCGGCATCTGGTGTACTTGACCCTCATCGATCAGCAGGGAAA TATCCTGAAACAAGAATCTCTGAATACTATTACGGACTCCAAAATCAGCACACCTTACCACAAGCTGCTT GATAATAAAGAGAATGAGAGGGACTTGGCCCGCAAAAATTGGGGCACCGTCGAGAATATTAAGGAATT GAAAGAAGGATACATCTCACAGGTGGTTCACAAAATCGCAACCCTGATGTTAGAAGAGAACGCTATTGT GGTGATGGAGGACTTAAACTTCGGATTTAAAAGAGGAAGATTTAAAGTCGAGAAACAGATTTATCAGAA ACTGGAAAAAATGCTCATTGACAAATTAAATTACCTGGTGCTGAAAGATAAACAGCCACAGGAGCTGGG TGGCCTGTATAATGCTCTGCAGCTGACCAACAAGTTCGAGTCGTTTCAGAAAATGGGCAAGCAGTCAGG CTTCCTTTTTTACGTGCCCGCTTGGAACACCTCAAAAATCGACCCTACAACAGGCTTTGTGAATTATTTCT ATACCAAGTATGAAAACGTGGACAAGGCAAAGGCCTTTTTCGAGAAGTTTGAAGCAATCAGGTTCAATG CCGAGAAAAAATACTTTGAGTTCGAGGTCAAAAAATATAGCGACTTCAACCCTAAGGCCGAAGGCACGC AACAAGCCTGGACAATATGCACGTATGGGGAGAGAATTGAGACTAAGCGGCAGAAGGATCAGAATAAC AAATTCGTGAGCACACCGATTAACCTGACAGAGAAGATAGAGGACTTCCTCGGCAAGAATCAGATCGTG TACGGCGACGGCAATTGCATCAAGTCACAAATTGCATCTAAAGATGACAAAGCATTCTTCGAAACACTG CTGTATTGGTTCAAGATGACACTCCAGATGCGAAATAGCGAAACAAGAACAGATATTGACTACCTCATC AGCCCTGTGATGAATGATAACGGCACGTTTTACAATTCCCGGGACTATGAAAAATTAGAGAACCCGACA CTGCCAAAAGACGCCGACGCAAATGGTGCATATCACATCGCAAAGAAAGGTTTGATGCTGTTGAACAAA ATTGATCAGGCTGATCTGACAAAAAAGGTCGATCTGAGTATCAGTAACCGCGACTGGTTGCAGTTTGTC CAGAAGAACAAATAA SEQ ATGGAACAAGAGTACTATCTGGGCCTGGACATGGGCACCGGGAGTGTCGGATGGGCAGTCACCGACTCA ID GAGTACCACGTCCTCAGAAAGCACGGTAAGGCACTTTGGGGAGTGCGACTCTTCGAGTCCGCTAGTACT NO: GCTGAAGAGAGGAGGATGTTTCGAACTTCCAGGCGCAGGCTGGATCGGCGAAACTGGAGAATAGAGAT 155 TCTCCAGGAGATATTTGCTGAAGAGATTTCAAAGAAGGATCCTGGTTTTTTCCTGCGCATGAAAGAATCT AAGTATTACCCCGAAGATAAACGCGACATCAACGGCAATTGTCCTGAACTGCCCTATGCTCTGTTTGTCG ACGACGATTTCACCGACAAAGATTACCACAAGAAATTCCCCACCATATACCACCTGAGAAAGATGTTGA TGAACACCGAGGAGACACCCGACATACGTCTGGTTTACCTGGCTATCCATCATATGATGAAGCACCGCG GGCATTTCCTGCTGTCTGGAGACATCAATGAGATAAAGGAATTTGGTACTACGTTCTCCAAGTTGTTAGA AAACATTAAGAATGAAGAGTTGGACTGGAATCTTGAACTGGGAAAGGAAGAGTATGCAGTTGTAGAGT CGATTTTGAAAGATAACATGTTAAACCGGTCAACTAAGAAAACCAGGTTAATTAAGGCACTAAAGGCCA AATCGATATGCGAGAAGGCTGTGCTAAATCTGCTGGCTGGAGGCACCGTGAAACTGTCTGATATTTTCG GCCTGGAAGAGCTCAATGAAACCGAGCGGCCTAAAATTTCTTTCGCCGATAACGGATACGATGACTATA TTGGGGAGGTGGAAAACGAGCTCGGAGAACAATTCTACATTATTGAAACCGCTAAGGCAGTCTATGACT GGGCCGTGCTCGTCGAGATTTTAGGCAAGTACACCAGCATTAGCGAAGCAAAGGTGGCTACCTATGAAA AGCACAAATCTGACCTCCAGTTTCTGAAAAAGATTGTGCGCAAATACTTAACAAAAGAAGAGTACAAGG ACATCTTTGTGAGCACATCAGATAAGCTCAAGAATTACTCAGCATACATTGGAATGACAAAGATTAACG GGAAGAAGGTGGATCTCCAAAGCAAACGTTGTTCAAAGGAGGAGTTTTACGATTTCATAAAGAAGAAC GTGCTGAAGAAACTGGAGGGACAACCGGAGTACGAGTATTTAAAGGAGGAGCTCGAGCGAGAAACTTT CCTGCCCAAGCAAGTGAACAGAGACAATGGTGTCATTCCTTACCAGATTCACTTATATGAGCTGAAGAA AATCCTGGGGAACTTGAGAGACAAGATAGACCTCATCAAGGAAAATGAAGATAAGTTGGTCCAGTTGTT CGAATTCAGAATCCCATATTACGTCGGCCCGCTCAATAAGATCGACGACGGCAAGGAAGGCAAATTCAC TTGGGCGGTGCGAAAAAGCAACGAAAAAATATACCCATGGAACTTTGAGAACGTCGTTGACATCGAGG CCAGCGCCGAGAAATTTATAAGACGCATGACTAATAAGTGTACTTACCTCATGGGCGAGGATGTTCTGC CCAAGGACAGCCTGCTGTATTCCAAGTACATGGTGCTTAACGAGCTGAATAATGTAAAGTTAGATGGTG AGAAGCTCAGCGTGGAGCTTAAACAGAGGCTGTACACTGATGTGTTTTGCAAGTATCGGAAAGTTACCG TTAAGAAGATAAAGAATTACCTGAAATGCGAAGGGATCATTTCCGGCAACGTGGAAATTACCGGAATCG ACGGCGATTTTAAGGCGTCGTTGACCGCTTATCATGATTTCAAGGAGATTTTAACCGGCACGGAGCTCGC GAAGAAAGACAAGGAGAACATAATCACGAATATAGTTCTGTTTGGGGACGATAAAAAACTTCTTAAAA AACGACTCAATCGACTGTATCCGCAGATTACCCCCAACCAGCTGAAGAAGATTTGCGCTCTGAGCTATA CCGGGTGGGGCCGGTTCTCTAAGAAATTCCTCGAGGAGATCACAGCACCAGACCCAGAGACTGGTGAGG TGTGGAATATTATTACAGCTCTGTGGGAATCCAATAATAACCTTATGCAATTGTTGAGCAATGAATATAG GTTCATGGAGGAAGTGGAAACCTACAATATGGGCAAGCAGACAAAGACCCTATCTTACGAGACCGTTGA GAATATGTATGTCTCCCCTTCAGTGAAACGGCAAATCTGGCAAACTTTGAAGATCGTGAAGGAGCTCGA AAAGGTGATGAAAGAGAGCCCGAAGAGGGTTTTTATTGAAATGGCCAGAGAGAAACAGGAGAGCAAGA GAACAGAGTCTAGGAAGAAGCAGCTAATCGATTTGTATAAAGCCTGCAAGAACGAGGAAAAAGACTGG GTCAAGGAGCTAGGCGATCAGGAAGAACAGAAGTTGCGCTCTGATAAGCTGTACTTATATTATACCCAG AAAGGACGGTGCATGTACTCAGGTGAGGTCATTGAGCTGAAAGATCTGTGGGACAATACTAAGTATGAT ATTGATCACATCTACCCTCAGTCAAAAACTATGGACGACTCCCTCAACAACAGGGTGTTGGTTAAGAAG AAATACAATGCTACAAAGTCCGATAAATACCCTCTTAACGAAAACATCCGGCACGAAAGAAAGGGCTTC TGGAAGTCCCTGCTGGATGGGGGTTTTATCAGTAAAGAAAAGTATGAGAGGCTGATCCGAAATACCGAG CTCTCCCCCGAGGAACTGGCTGGCTTTATCGAAAGGCAGATCGTAGAGACTAGGCAATCTACAAAGGCA GTCGCTGAGATCCTGAAGCAAGTGTTTCCTGAGTCAGAAATCGTGTACGTCAAAGCTGGCACAGTGTCA CGGTTCCGAAAGGACTTTGAGTTGTTAAAAGTTCGGGAGGTGAATGACCTGCACCACGCTAAAGACGCC TATCTGAATATCGTTGTGGGGAACTCCTATTATGTTAAGTTTACTAAGAATGCGTCCTGGTTTATTAAGG AGAACCCGGGGCGCACCTATAACCTGAAGAAGATGTTCACCTCCGGCTGGAACATAGAACGGAACGGA GAAGTCGCGTGGGAGGTGGGTAAGAAAGGGACCATTGTGACCGTCAAACAGATTATGAACAAAAACAA CATATTGGTAACTCGCCAGGTGCATGAGGCCAAAGGGGGCCTCTTTGATCAGCAGATTATGAAAAAGGG CAAAGGACAGATCGCAATCAAGGAAACCGACGAGCGCCTGGCATCCATTGAGAAGTACGGAGGCTACA ACAAGGCGGCAGGTGCGTACTTCATGCTCGTCGAGTCCAAAGATAAGAAAGGCAAAACTATTAGAACA ATCGAGTTCATCCCTCTATATTTGAAAAATAAGATCGAAAGTGACGAAAGCATCGCCCTTAACTTCTTGG AGAAGGGCCGGGGCTTAAAGGAACCAAAGATTCTGCTCAAGAAGATCAAGATCGACACACTCTTCGAT GTGGATGGTTTTAAGATGTGGCTGTCAGGCAGGACAGGGGATCGCTTGCTGTTCAAATGCGCAAATCAG TTGATTCTGGACGAAAAGATCATTGTGACGATGAAGAAGATCGTTAAATTCATTCAGCGGAGACAGGAA AACAGAGAACTGAAACTCTCCGATAAGGATGGAATTGACAATGAAGTCCTCATGGAGATTTACAATACC TTTGTGGACAAGCTTGAGAACACAGTCTATCGGATCCGACTGTCCGAACAGGCAAAGACTCTGATCGAC AAACAGAAAGAATTCGAAAGACTAAGCTTAGAGGACAAAAGTTCAACTCTCTTTGAAATTCTCCACATC TTCCAATGTCAAAGTAGTGCAGCCAACTTGAAGATGATCGGGGGTCCCGGCAAGGCTGGAATCTTAGTC ATGAACAACAACATCTCCAAATGTAACAAAATCTCCATCATAAACCAGTCTCCCACCGGCATTTTCGAG AACGAAATTGATTTACTCAAG SEQ ATGAAATCTTTCGATTCTTTCACCAACCTCTACTCCCTTAGCAAAACCCTTAAGTTTGAAATGAGGCCGG ID TGGGGAATACACAGAAGATGCTTGACAATGCTGGCGTCTTTGAAAAGGACAAATTAATCCAGAAGAAGT NO: ATGGTAAAACAAAGCCATATTTTGACCGATTGCATCGGGAATTCATTGAAGAGGCTCTTACAGGAGTAG 156 AATTGATCGGACTGGACGAGAACTTCCGTACCTTAGTAGACTGGCAGAAGGACAAGAAGAACAACGTG GCAATGAAGGCCTATGAGAACTCACTCCAGCGCCTTAGAACCGAGATCGGAAAGATCTTTAATCTTAAG GCGGAAGATTGGGTAAAAAATAAGTACCCGATCCTGGGACTGAAAAACAAAAACACAGACATCCTGTT TGAAGAAGCCGTCTTTGGTATCTTGAAGGCCAGGTATGGAGAGGAGAAAGACACGTTTATAGAGGTAGA GGAGATTGATAAAACAGGCAAGAGTAAGATTAATCAGATCAGTATCTTTGATTCTTGGAAGGGGTTCAC AGGCTACTTTAAGAAGTTTTTCGAAACCAGGAAAAATTTCTATAAGAACGATGGCACCTCCACAGCTAT CGCGACACGCATCATAGATCAGAATCTGAAACGGTTCATTGATAATCTGAGCATTGTTGAATCCGTGCG CCAGAAGGTCGACCTAGCTGAGACTGAGAAGTCTTTCTCTATATCACTCTCCCAGTTCTTCTCAATAGAT TTTTATAATAAGTGCCTTCTGCAAGATGGCATAGACTACTATAACAAGATCATCGGCGGCGAAACTCTCA AAAACGGTGAAAAGCTCATTGGCCTGAATGAGCTCATCAACCAATATAGACAAAATAACAAGGATCAG AAAATCCCATTCTTTAAGCTGCTAGATAAACAGATCCTATCAGAAAAAATCCTGTTCCTCGACGAAATCA AAAACGACACCGAACTCATCGAGGCTCTCTCGCAGTTTGCCAAGACGGCTGAGGAGAAGACGAAGATT GTGAAAAAGCTGTTTGCAGACTTTGTGGAGAACAACTCTAAATACGATTTGGCTCAGATTTATATCTCCC AGGAAGCATTTAACACAATCTCCAATAAGTGGACTAGCGAGACTGAAACCTTCGCCAAATACCTGTTCG AGGCCATGAAAAGCGGCAAGCTCGCCAAATACGAGAAGAAGGACAATTCCTATAAGTTTCCCGATTTCA TCGCATTATCTCAGATGAAGTCCGCGCTACTTAGCATTAGCCTGGAAGGCCATTTTTGGAAGGAGAAAT ACTATAAGATTTCCAAATTCCAAGAAAAGACCAATTGGGAGCAGTTCTTGGCTATTTTTCTATACGAGTT CAACTCTTTGTTCAGTGACAAGATCAACACTAAGGACGGTGAGACCAAACAAGTGGGGTACTACCTCTT CGCCAAAGATCTTCATAACCTGATACTGTCCGAACAGATCGACATACCCAAGGATTCAAAGGTGACCAT CAAGGATTTTGCGGATTCGGTATTGACGATCTATCAGATGGCGAAGTATTTCGCTGTCGAGAAAAAGCG GGCATGGCTGGCCGAATACGAGTTGGACTCCTTCTATACTCAACCCGATACAGGGTACCTGCAGTTTTAC GATAATGCATACGAGGATATAGTCCAGGTGTACAATAAACTCAGGAACTACCTCACTAAGAAACCATAC TCCGAAGAAAAATGGAAACTTAATTTTGAGAATAGTACACTGGCCAATGGATGGGACAAGAACAAGGA ATCAGACAACTCCGCTGTAATTCTCCAGAAGGGTGGCAAGTATTATCTGGGACTGATAACAAAGGGCCA TAACAAGATTTTCGATGACCGTTTTCAGGAGAAGTTTATAGTGGGCATAGAGGGTGGCAAGTATGAAAA AATAGTCTACAAGTTCTTTCCCGATCAGGCGAAGATGTTCCCCAAAGTATGCTTCAGTGCTAAAGGCCTC GAGTTTTTCCGGCCATCTGAAGAGATACTCCGCATCTATAATAACGCAGAGTTTAAAAAGGGAGAGACG TACTCAATCGACTCGATGCAGAAACTCATTGACTTCTACAAAGATTGTCTCACAAAATACGAGGGCTGG GCTTGCTACACGTTTCGGCACTTGAAGCCAACCGAGGAATATCAAAACAACATCGGGGAGTTCTTCCGT GACGTCGCCGAAGACGGCTATAGAATTGACTTTCAGGGCATAAGTGATCAGTATATTCACGAGAAGAAT GAGAAAGGTGAGTTGCATCTTTTCGAAATCCACAATAAAGACTGGAATCTTGACAAGGCTCGCGATGGA AAATCAAAGACTACCCAGAAGAATCTTCATACACTTTACTTCGAGTCCCTCTTTTCCAACGACAACGTCG TACAGAATTTCCCAATAAAACTGAACGGCCAGGCCGAAATTTTTTACAGGCCCAAAACCGAAAAAGATA AACTGGAATCCAAGAAAGACAAGAAGGGAAATAAGGTGATAGATCACAAAAGGTATTCCGAGAACAAG ATTTTTTTCCACGTACCTCTTACCCTGAACAGAACGAAGAACGACTCTTATAGATTCAATGCCCAGATAA ACAACTTTCTCGCAAACAACAAAGATATCAATATTATCGGCGTCGATAGAGGTGAGAAGCACTTGGTAT ATTATTCTGTGATCACGCAAGCATCCGATATCTTGGAGTCCGGTTCTTTGAACGAACTGAATGGTGTCAA CTACGCCGAGAAACTCGGTAAGAAAGCTGAGAATCGGGAGCAGGCTAGAAGGGACTGGCAGGACGTTC AGGGTATCAAGGACCTGAAGAAGGGCTACATTTCTCAGGTGGTTCGAAAACTGGCTGATTTGGCCATTA AGCACAATGCAATCATCATTTTAGAAGATTTGAACATGCGGTTTAAACAAGTCAGGGGGGGGATAGAGA AATCAATTTACCAACAGCTGGAAAAAGCTCTGATTGATAAACTCTCTTTTTTGGTTGATAAGGGCGAAAA GAACCCCGAGCAAGCAGGACATCTCCTTAAAGCCTATCAACTGAGCGCACCTTTCGAGACATTCCAGAA GATGGGAAAGCAAACCGGCATCATTTTCTATACCCAGGCTTCCTATACATCCAAGTCTGATCCAGTGACT GGGTGGAGACCCCATCTCTACCTCAAGTACTTTTCTGCCAAAAAAGCTAAGGACGACATTGCTAAGTTC ACAAAAATCGAGTTCGTGAACGACAGGTTCGAGCTGACTTATGACATAAAAGATTTCCAGCAGGCCAAG GAGTACCCAAACAAGACAGTTTGGAAAGTGTGTTCCAATGTGGAGAGGTTTCGGTGGGACAAGAATCTG AATCAGAATAAAGGGGGATATACTCACTACACCAACATTACCGAGAACATCCAAGAGTTGTTCACCAAA TACGGCATCGACATTACTAAAGATCTGCTGACACAGATCTCCACCATCGATGAGAAGCAGAACACATCT TTCTTCCGGGATTTCATCTTTTATTTTAACTTGATCTGTCAGATTAGAAATACCGACGACAGTGAGATAG CTAAAAAAAACGGGAAAGACGATTTCATTCTCTCTCCCGTGGAGCCGTTTTTTGACTCCCGCAAAGACA ATGGCAATAAGCTTCCGGAAAACGGGGACGATAACGGCGCCTACAACATCGCTCGTAAGGGAATCGTTA TCCTCAATAAAATAAGCCAGTATTCCGAGAAGAACGAGAATTGTGAAAAAATGAAGTGGGGGGACCTTT ACGTCAGCAACATCGATTGGGATAACTTTGTGACACAAGCCAATGCGAGACACTAG SEQ ATGGAAAACTTCAAAAACCTCTACCCCATCAACAAGACCTTGAGGTTTGAGCTCCGGCCATATGGGAAG ID ACACTGGAGAACTTCAAAAAGTCCGGTCTGCTGGAAAAGGATGCTTTTAAGGCTAACTCTAGGAGGTCT NO: ATGCAGGCCATTATCGATGAGAAATTCAAGGAGACCATAGAGGAGCGTCTGAAATATACTGAGTTTTCC 157 GAGTGTGACCTAGGAAATATGACCAGTAAGGACAAAAAGATCACCGACAAGGCAGCGACAAACCTGAA GAAACAGGTGATTTTAAGCTTTGATGATGAGATTTTCAATAACTACTTGAAGCCGGACAAAAACATCGA CGCTCTGTTCAAGAATGATCCAAGCAACCCGGTCATCTCTACTTTCAAGGGCTTCACCACATACTTTGTA AATTTCTTCGAAATACGGAAACACATCTTCAAGGGAGAGTCTTCCGGTAGCATGGCTTACAGAATAATC GATGAGAACCTAACTACATATCTAAACAATATCGAGAAGATCAAGAAATTGCCTGAAGAACTGAAATCT CAGCTTGAGGGAATCGATCAAATTGACAAACTGAACAACTATAACGAGTTCATCACCCAGTCCGGCATT ACTCATTATAACGAAATTATTGGAGGGATTTCGAAGTCTGAAAATGTCAAAATTCAAGGCATTAACGAA GGGATTAATCTTTACTGTCAAAAGAATAAAGTGAAGCTACCACGCTTAACTCCTCTGTATAAGATGATTC TCTCTGATCGGGTCTCTAATTCCTTTGTGCTGGATACCATTGAAAATGATACCGAGTTAATTGAAATGAT CTCTGATCTGATAAATAAGACAGAGATAAGTCAGGATGTTATTATGTCCGACATCCAAAATATTTTCATC AAATATAAACAACTCGGCAACTTGCCGGGGATTAGCTACTCATCTATAGTGAATGCTATCTGTTCGGATT ACGACAATAACTTTGGTGACGGCAAACGTAAAAAAAGCTATGAGAATGATCGCAAAAAACACCTCGAG ACTAACGTGTATAGCATTAACTATATCTCAGAGTTACTGACAGACACCGACGTCTCCAGCAACATAAAG ATGCGGTACAAAGAGCTGGAGCAGAATTATCAGGTATGCAAGGAAAATTTCAACGCCACTAACTGGATG AACATCAAAAACATTAAGCAGTCTGAGAAAACCAATCTGATCAAGGACCTTCTTGACATCCTCAAGAGC ATCCAGCGGTTTTATGATTTGTTTGACATCGTGGATGAAGACAAAAATCCTAGTGCTGAGTTCTATACCT GGCTGTCTAAAAACGCGGAGAAACTGGACTTCGAGTTTAATTCAGTGTACAACAAGAGCAGGAACTACC TCACGAGAAAGCAGTACTCCGATAAAAAGATTAAGTTGAACTTCGATAGTCCTACTCTCGCCAAGGGGT GGGATGCGAACAAAGAAATTGATAATAGCACAATTATCATGAGGAAGTTCAACAACGACCGGGGCGAT TACGATTACTTCTTGGGGATCTGGAATAAGAGCACACCTGCCAACGAAAAGATCATCCCATTAGAGGAT AATGGACTGTTTGAAAAAATGCAATATAAGCTGTATCCCGATCCTAGTAAAATGCTGCCAAAGCAATTC CTTTCTAAGATCTGGAAAGCTAAACATCCAACTACACCCGAGTTTGATAAGAAGTACAAAGAAGGTCGG CACAAGAAGGGGCCTGATTTTGAGAAAGAGTTTCTGCACGAGTTGATCGATTGCTTTAAGCATGGATTG GTAAACCACGACGAAAAATATCAGGATGTGTTCGGGTTCAATCTGCGCAACACGGAAGACTACAACTCT TATACAGAGTTTCTGGAGGACGTCGAAAGGTGCAACTATAATCTTAGTTTCAATAAAATCGCTGACACG TCTAACTTGATAAATGATGGGAAACTCTATGTTTTTCAGATCTGGAGCAAGGATTTCAGCATAGATAGCA AGGGAACAAAAAACTTGAACACAATATACTTTGAATCCCTCTTCTCGGAGGAAAATATGATCGAGAAGA TGTTCAAGCTCTCAGGGGAAGCCGAAATATTCTATCGTCCAGCAAGTTTGAATTATTGTGAAGATATTAT CAAGAAGGGACACCACCACGCCGAACTGAAGGACAAATTCGACTATCCCATCATCAAGGACAAGCGAT ATAGCCAGGACAAATTTTTTTTTCATGTCCCCATGGTTATCAACTACAAAAGCGAGAAGTTAAACTCCAA ATCACTTAACAATAGGACGAACGAAAATTTAGGCCAATTCACGCACATCATCGGTATCGACCGCGGAGA GCGACATCTCATCTACCTGACCGTGGTGGATGTGTCCACCGGTGAGATCGTTGAGCAAAAGCACCTGGA TGAAATTATAAATACAGATACAAAAGGCGTCGAGCATAAAACTCATTATCTCAATAAATTAGAAGAGAA GTCCAAGACGCGGGATAATGAAAGAAAGTCCTGGGAAGCAATCGAGACGATTAAGGAGCTGAAAGAAG GCTATATTAGCCACGTGATCAATGAAATCCAGAAATTGCAGGAAAAGTATAACGCACTGATAGTGATGG AGAACCTCAATTATGGGTTTAAGAACTCGCGTATCAAAGTGGAAAAGCAGGTCTACCAGAAATTCGAGA CCGCCCTGATTAAAAAGTTTAATTACATCATTGACAAGAAAGATCCTGAAACCTACATTCATGGATACC AACTGACGAATCCAATCACTACACTCGATAAAATTGGTAACCAGAGCGGTATTGTGTTGTACATTCCGG CTTGGAATACAAGCAAGATTGATCCAGTCACTGGTTTCGTTAACCTCCTGTATGCAGACGATTTGAAATA CAAGAACCAGGAGCAGGCTAAAAGCTTTATCCAGAAAATCGATAATATCTACTTCGAAAATGGTGAGTT TAAATTTGATATAGATTTCAGCAAATGGAACAACCGCTACTCAATTAGCAAGACGAAATGGACACTGAC AAGCTACGGAACCCGGATACAGACGTTCCGAAACCCCCAGAAAAATAACAAGTGGGACAGCGCCGAGT ATGACCTGACCGAAGAGTTTAAATTAATCCTGAACATCGATGGTACTCTGAAATCTCAGGATGTGGAAA CCTATAAGAAATTCATGTCTTTATTCAAGCTGATGTTGCAGCTGCGAAACTCCGTTACTGGAACAGACAT TGACTACATGATTAGCCCTGTGACAGATAAAACTGGAACCCACTTTGATTCACGGGAGAATATCAAGAA CCTGCCCGCCGATGCTGATGCGAACGGAGCTTACAACATTGCTAGGAAGGGCATCATGGCAATCGAGAA TATTATGAACGGCATTAGCGACCCTCTGAAGATCAGTAATGAGGACTACCTGAAGTACATTCAGAACCA ACAAGAGTAA SEQ ATGACCCAGTTTGAGGGTTTCACCAATCTTTATCAGGTGTCAAAAACACTCAGATTTGAGCTCATCCCAC ID AGGGTAAAACTTTAAAGCATATTCAAGAGCAGGGCTTTATAGAGGAAGACAAAGCCAGAAACGACCAT NO: TATAAGGAACTAAAACCGATCATTGACCGCATCTACAAAACCTATGCCGACCAATGCCTTCAGCTCGTC 158 CAACTCGATTGGGAGAATCTGAGCGCCGCTATTGACAGCTACAGGAAGGAGAAGACCGAGGAGACTAG AAACGCCCTGATCGAGGAGCAGGCGACCTATAGAAACGCTATTCACGATTATTTTATCGGCCGCACCGA CAATTTGACAGATGCCATCAACAAGCGGCACGCCGAAATTTATAAGGGGTTATTTAAGGCCGAGCTGTT CAATGGAAAAGTACTGAAACAGCTGGGCACCGTAACAACCACCGAACACGAGAATGCTCTGTTGAGGT CCTTCGACAAGTTTACTACCTACTTTAGCGGCTTCTACGAAAACCGTAAAAACGTGTTTTCCGCGGAGGA TATTTCAACAGCCATTCCTCATAGGATCGTGCAGGATAATTTCCCCAAGTTTAAGGAGAACTGCCATATC TTTACCAGACTTATCACTGCTGTGCCAAGTTTACGAGAACACTTCGAGAATGTTAAGAAGGCTATAGGC ATATTCGTTTCCACCTCCATCGAAGAAGTATTCAGTTTTCCATTCTACAATCAGTTACTCACGCAGACCC AGATAGATCTCTACAATCAGCTGCTCGGAGGCATTTCTAGAGAAGCAGGCACGGAAAAGATCAAGGGCT TAAATGAAGTACTCAATCTTGCAATTCAGAAGAACGATGAGACAGCACACATTATTGCATCTCTCCCTCA CAGATTCATTCCCCTGTTCAAACAGATCCTGTCCGATCGCAACACACTAAGCTTTATACTTGAGGAGTTT AAGTCAGATGAGGAAGTGATCCAGAGCTTCTGTAAGTATAAGACTTTGCTCCGTAATGAAAACGTGCTT GAGACAGCAGAGGCTCTCTTTAACGAGTTGAATTCCATCGACCTGACACACATTTTTATCAGCCATAAAA AGCTGGAAACGATTAGCTCTGCCTTGTGCGACCACTGGGACACCCTGCGTAACGCCCTCTATGAAAGGC GCATTTCCGAGCTCACCGGGAAGATCACAAAAAGTGCCAAGGAAAAAGTCCAGAGGTCCCTTAAACAT GAAGACATCAACCTACAAGAGATCATCTCTGCGGCTGGGAAAGAGCTGTCAGAAGCATTTAAACAGAA GACTTCCGAGATCCTGAGCCACGCACACGCCGCATTAGACCAGCCCCTGCCTACAACTCTTAAAAAACA GGAGGAGAAGGAGATTTTAAAGAGCCAGCTGGACTCATTACTCGGCCTGTATCATCTCCTGGACTGGTT CGCCGTGGACGAATCCAACGAGGTGGACCCAGAATTTAGCGCCAGGCTGACAGGAATTAAACTGGAAA TGGAGCCAAGTTTGAGCTTTTACAACAAGGCTCGGAACTATGCCACTAAAAAGCCCTACAGCGTGGAAA AGTTCAAGCTGAATTTTCAGATGCCGACCCTGGCTTCCGGGTGGGATGTTAATAAGGAAAAGAATAATG GGGCTATACTGTTCGTCAAAAATGGTCTCTACTACCTGGGAATCATGCCCAAACAGAAGGGCAGGTACA AAGCCCTTTCGTTTGAGCCGACCGAAAAAACCAGCGAAGGCTTTGATAAGATGTATTACGACTATTTCCC AGATGCAGCCAAGATGATCCCAAAATGTAGCACTCAGTTGAAGGCGGTAACCGCTCACTTTCAGACACA CACCACTCCTATCTTGCTCTCCAACAACTTTATTGAGCCGCTGGAGATCACGAAGGAAATCTACGACCTT AACAACCCAGAGAAGGAACCCAAGAAATTCCAAACAGCTTATGCTAAGAAGACTGGGGATCAAAAGGG CTATCGAGAGGCTTTGTGTAAGTGGATTGACTTTACACGGGATTTCCTGAGTAAGTATACCAAGACCACA TCTATTGACCTGTCCTCACTGAGACCTTCCTCACAATATAAGGATCTCGGAGAGTATTATGCCGAACTCA ACCCTCTACTCTATCACATCTCTTTCCAGAGGATCGCCGAAAAGGAAATTATGGACGCCGTCGAGACAG GCAAGCTGTACCTCTTCCAGATTTACAACAAGGATTTCGCAAAGGGCCACCACGGAAAACCCAATTTGC ACACTTTGTACTGGACAGGGCTCTTCTCTCCCGAAAATTTGGCCAAAACTTCAATAAAACTGAACGGGC AAGCCGAGCTGTTCTATCGGCCCAAGTCACGTATGAAGCGGATGGCCCACCGGCTGGGCGAGAAGATGC TCAACAAGAAACTGAAGGATCAGAAGACGCCCATACCAGACACTCTTTACCAAGAGCTGTATGACTACG TGAATCACAGACTGAGTCACGACCTGTCTGATGAAGCCCGGGCTCTTCTTCCAAATGTGATTACCAAAG AAGTTTCCCACGAAATTATCAAGGACCGGCGCTTCACCTCTGACAAATTCTTTTTCCACGTCCCAATCAC CCTCAACTACCAGGCAGCCAATTCCCCTTCAAAGTTTAACCAGCGTGTGAATGCCTACCTGAAAGAGCA TCCGGAGACCCCCATCATAGGGATAGACAGAGGAGAGCGGAATCTTATCTACATTACTGTGATTGACAG CACAGGTAAGATCTTGGAGCAGAGATCTTTAAATACAATCCAGCAGTTTGACTACCAGAAGAAACTGGA TAACCGAGAGAAGGAAAGGGTTGCTGCAAGACAGGCCTGGTCAGTGGTCGGCACCATCAAAGACCTGA AGCAGGGCTACTTATCCCAAGTAATTCACGAAATTGTCGATCTTATGATTCATTATCAAGCCGTTGTTGT GCTGGAGAACCTGAATTTTGGCTTCAAAAGCAAACGAACAGGTATCGCCGAGAAAGCCGTGTATCAGCA GTTCGAAAAGATGCTCATAGACAAGCTGAACTGCTTAGTGCTGAAGGATTATCCTGCTGAGAAGGTCGG CGGCGTACTTAACCCATACCAGCTGACCGATCAGTTCACTAGTTTCGCCAAGATGGGAACGCAAAGTGG CTTCCTTTTCTACGTGCCCGCTCCCTACACGAGTAAGATCGACCCTCTGACCGGCTTCGTCGACCCATTCG TCTGGAAGACCATCAAGAATCACGAATCACGGAAACACTTCTTAGAGGGGTTTGACTTCCTGCACTACG ACGTGAAGACAGGGGACTTCATCTTACACTTTAAGATGAATCGAAACCTCTCCTTCCAGCGGGGCCTGC CTGGTTTCATGCCCGCATGGGACATCGTGTTTGAGAAAAACGAGACACAGTTTGACGCTAAGGGAACCC CCTTTATTGCGGGGAAGCGGATTGTCCCAGTCATCGAAAACCATCGGTTCACCGGGCGATACCGGGATC TGTACCCGGCCAACGAGCTCATCGCGCTGCTGGAGGAGAAGGGTATTGTGTTTAGGGATGGATCCAACA TTCTGCCTAAGTTGCTGGAAAATGATGATTCGCACGCCATTGATACCATGGTTGCACTGATTAGATCCGT ACTGCAGATGAGGAATAGCAATGCTGCAACCGGGGAGGATTATATTAATTCCCCAGTGCGAGATCTGAA TGGTGTCTGTTTTGACTCGCGCTTTCAGAATCCAGAATGGCCAATGGATGCAGACGCTAACGGGGCGTA CCACATTGCTCTGAAAGGCCAGCTACTCCTGAACCACCTCAAGGAGAGCAAAGATCTGAAGCTGCAGAA CGGCATTTCCAACCAAGACTGGCTCGCCTACATACAAGAACTGCGCAATTAA SEQ ATGGCTGTCAAATCCATCAAGGTTAAATTACGGCTTGATGACATGCCCGAGATCCGCGCCGGGCTCTGG ID AAACTCCATAAAGAAGTGAATGCTGGCGTTAGATACTACACAGAATGGCTCTCCCTGCTGCGCCAGGAA NO: AATTTGTACCGCCGGTCACCTAATGGAGATGGAGAGCAGGAATGCGATAAAACAGCAGAAGAGTGCAA 159 AGCCGAATTGCTGGAGCGACTGCGGGCACGGCAGGTTGAGAATGGACACCGAGGTCCGGCGGGATCGG ACGACGAGCTGCTCCAGCTCGCCAGACAATTATATGAACTGCTGGTGCCTCAGGCTATTGGGGCAAAGG GTGACGCACAGCAGATTGCTAGAAAATTTCTGTCTCCCCTCGCCGACAAAGACGCTGTCGGCGGCCTTG GGATAGCCAAAGCCGGCAACAAACCCCGATGGGTGCGCATGAGGGAGGCTGGTGAGCCTGGCTGGGAG GAAGAAAAGGAAAAGGCCGAAACCAGAAAGTCCGCCGACAGGACCGCGGACGTACTCCGAGCATTGGC CGATTTTGGGCTGAAGCCCTTAATGCGAGTCTACACCGATAGTGAAATGTCTAGCGTGGAGTGGAAGCC ATTACGCAAAGGGCAGGCAGTGCGGACGTGGGACCGTGACATGTTCCAGCAAGCCATCGAGCGAATGA TGAGCTGGGAGAGCTGGAACCAGAGAGTGGGGCAGGAGTATGCCAAGCTGGTCGAGCAGAAAAACCGG TTTGAGCAAAAAAATTTTGTAGGTCAGGAACACCTGGTGCATCTCGTTAACCAGCTCCAGCAAGATATG AAGGAAGCTTCGCCTGGATTAGAGAGCAAAGAGCAGACTGCACACTATGTAACCGGAAGAGCACTGAG GGGCAGTGACAAAGTGTTCGAAAAATGGGGAAAACTGGCTCCCGATGCCCCCTTTGACCTGTACGACGC AGAAATAAAAAACGTGCAGCGGCGAAACACCAGGCGATTTGGTAGCCATGATCTGTTCGCCAAATTGGC AGAGCCGGAATATCAGGCTCTTTGGCGAGAAGACGCATCATTTCTCACTAGGTACGCGGTCTATAACTC CATTTTGAGGAAATTGAACCACGCAAAAATGTTTGCCACCTTCACGTTGCCTGACGCCACCGCTCATCCC ATTTGGACACGGTTTGATAAGCTGGGCGGCAATCTGCATCAGTATACATTCCTGTTTAACGAGTTTGGAG AGCGAAGACATGCGATACGATTCCACAAGCTACTGAAGGTCGAAAATGGCGTGGCACGTGAGGTGGAC GATGTCACCGTGCCCATCAGCATGAGCGAACAGCTGGATAATTTGTTGCCGCGGGACCCAAATGAACCT ATAGCCCTTTATTTTAGGGACTACGGGGCGGAGCAACATTTCACTGGGGAGTTTGGCGGCGCAAAAATT CAGTGCCGACGCGACCAGCTCGCCCACATGCATAGAAGACGCGGGGCCCGGGACGTATACCTTAACGTC TCTGTGAGGGTGCAGTCCCAGTCAGAGGCAAGAGGGGAACGCAGACCACCTTACGCAGCAGTATTCAG GCTGGTAGGCGATAACCACCGGGCGTTTGTACACTTTGATAAACTTTCTGACTACCTGGCCGAACACCCG GATGACGGCAAATTAGGATCGGAGGGGCTGCTTAGCGGCCTGCGTGTGATGAGCGTCGATCTGGGGCTA CGGACCTCTGCTTCCATCTCTGTGTTCCGTGTGGCCCGAAAGGACGAGTTGAAACCTAATTCGAAGGGCC GTGTACCATTCTTTTTCCCTATTAAGGGAAATGATAATCTCGTCGCGGTGCACGAGCGTTCCCAACTGCT GAAACTGCCTGGCGAGACCGAGTCCAAAGATCTCAGAGCAATCCGGGAGGAGCGACAACGTACACTTA GGCAACTCCGCACCCAGCTGGCCTATCTGCGCTTGCTGGTGCGGTGCGGCTCCGAGGATGTAGGGAGAA GAGAGCGAAGCTGGGCAAAGCTGATAGAGCAACCAGTTGACGCCGCGAATCACATGACCCCCGACTGG CGCGAAGCGTTTGAAAATGAGCTGCAGAAGTTGAAATCTCTGCATGGGATTTGCTCAGATAAGGAGTGG ATGGACGCCGTATACGAGTCTGTTCGCCGGGTATGGCGGCACATGGGGAAGCAGGTGAGAGATTGGAG AAAGGACGTTCGCTCTGGGGAACGGCCGAAAATTCGGGGATACGCAAAGGATGTCGTGGGCGGCAATA GCATTGAGCAGATCGAGTACCTGGAAAGGCAATACAAATTTCTGAAATCTTGGTCTTTCTTTGGGAAGGT AAGCGGACAAGTTATCAGAGCCGAAAAGGGATCTCGCTTTGCTATCACATTGAGGGAACACATTGATCA CGCCAAAGAAGACAGGTTGAAAAAGTTGGCTGATCGCATTATCATGGAAGCACTCGGTTACGTCTACGC CCTTGATGAGCGCGGTAAAGGGAAGTGGGTAGCCAAGTATCCCCCATGTCAGCTGATCCTGCTCGAGGA ACTTTCTGAGTATCAGTTCAATAACGACCGTCCTCCCTCCGAAAATAATCAGCTCATGCAATGGTCCCAC CGGGGTGTGTTCCAAGAACTGATCAATCAGGCTCAGGTGCACGACCTCCTCGTAGGCACTATGTATGCA GCCTTTAGCTCCCGTTTTGACGCGCGCACAGGCGCCCCTGGAATACGATGTAGGCGAGTTCCCGCACGGT GCACTCAAGAACATAACCCGGAGCCTTTCCCATGGTGGCTCAATAAGTTTGTTGTGGAGCATACCCTCGA CGCTTGCCCATTGAGGGCGGATGACTTGATTCCCACAGGCGAGGGGGAGATCTTCGTGAGCCCATTTTCT GCCGAAGAAGGGGATTTCCACCAAATACATGCCGACTTGAATGCTGCCCAAAATCTGCAGCAAAGGCTG TGGTCAGACTTCGACATCTCGCAAATCAGACTGCGGTGTGACTGGGGCGAAGTAGACGGCGAGCTGGTG CTGATACCTAGACTGACGGGTAAGCGTACCGCCGATAGCTATAGTAATAAGGTTTTTTATACGAATACG GGGGTGACATATTACGAGCGTGAGAGAGGCAAGAAGCGTCGGAAGGTGTTCGCGCAGGAGAAGCTGAG CGAAGAGGAGGCGGAGCTACTGGTAGAGGCAGATGAGGCAAGAGAAAAGTCCGTCGTCCTGATGCGGG ATCCTAGCGGGATTATTAACAGAGGTAATTGGACACGGCAGAAAGAATTCTGGAGCATGGTGAATCAAA GAATCGAGGGTTACCTGGTGAAGCAAATTCGAAGCCGGGTGCCCCTTCAAGACAGCGCATGTGAAAACA CTGGGGACATCTAG SEQ ATGGCTACTCGGTCCTTCATCCTGAAAATCGAGCCAAATGAAGAGGTGAAAAAGGGCCTGTGGAAGACC ID CATGAGGTACTTAACCACGGCATAGCATACTATATGAATATCCTAAAACTTATACGGCAGGAGGCTATC NO: TACGAGCATCACGAGCAAGATCCTAAAAATCCAAAGAAGGTTAGTAAGGCTGAAATCCAGGCTGAATT 160 GTGGGACTTCGTGCTGAAGATGCAGAAATGCAACAGTTTCACGCATGAAGTTGATAAGGACGTCGTGTT TAATATACTCCGGGAGCTGTACGAAGAACTGGTACCAAGCTCTGTGGAAAAGAAAGGAGAGGCCAACC AGCTAAGTAATAAGTTCCTCTATCCTCTCGTGGACCCCAATTCACAGAGCGGCAAAGGTACCGCATCTTC TGGGAGGAAACCACGCTGGTACAACTTGAAGATCGCTGGCGATCCCAGCTGGGAGGAGGAAAAGAAGA AATGGGAAGAGGATAAAAAGAAAGACCCCCTGGCCAAAATCTTAGGCAAGCTCGCCGAGTACGGTCTG ATTCCACTTTTCATCCCGTTCACAGATAGCAATGAGCCGATCGTCAAGGAGATTAAGTGGATGGAAAAG AGCCGCAATCAGAGTGTGCGGAGGCTGGACAAAGACATGTTTATTCAGGCCCTGGAACGCTTCCTTAGC TGGGAAAGCTGGAACCTGAAGGTTAAGGAAGAGTACGAAAAAGTCGAGAAGGAGCATAAGACTTTGGA GGAGCGCATCAAAGAAGACATCCAGGCCTTTAAGTCTCTAGAACAGTATGAGAAAGAACGGCAGGAAC AGCTGCTGCGTGATACACTGAACACAAACGAATATCGCCTGAGCAAGAGGGGACTCAGAGGCTGGAGA GAAATCATTCAAAAGTGGCTCAAAATGGATGAAAATGAGCCGTCTGAAAAATACCTTGAAGTTTTCAAG GACTACCAGCGGAAGCACCCTAGAGAAGCCGGCGACTATAGTGTTTACGAATTCTTGAGCAAGAAGGA GAATCATTTTATATGGAGGAATCACCCGGAGTACCCATATCTGTACGCAACCTTCTGCGAAATCGACAA GAAAAAAAAAGACGCCAAGCAACAGGCTACATTTACTCTGGCCGACCCTATCAATCACCCTCTATGGGT CCGGTTTGAGGAGCGCTCCGGAAGCAATCTGAATAAATATCGTATTCTGACTGAACAGTTACACACAGA GAAGCTCAAGAAGAAACTTACGGTGCAGCTGGACCGCCTGATATACCCAACAGAGTCCGGAGGATGGG AAGAGAAAGGAAAGGTTGACATCGTACTGCTTCCATCTCGTCAGTTTTACAACCAGATATTCCTGGACAT CGAGGAGAAGGGGAAACACGCCTTCACATACAAGGACGAGTCCATAAAGTTCCCACTGAAGGGTACTTT AGGCGGTGCTAGGGTGCAGTTCGACCGCGATCACCTGAGACGGTACCCCCACAAGGTGGAGAGCGGGA ACGTGGGACGAATCTACTTTAATATGACAGTGAACATTGAACCCACAGAGAGTCCAGTTAGTAAATCCC TGAAAATTCACCGTGACGACTTTCCGAAATTTGTGAATTTCAAGCCAAAGGAGCTTACGGAGTGGATCA AGGATTCAAAGGGAAAGAAGCTGAAATCTGGTATCGAATCTCTCGAGATCGGTCTCCGTGTCATGAGCA TCGATCTGGGACAGCGCCAGGCAGCTGCCGCCAGTATATTCGAGGTGGTAGACCAAAAGCCTGACATCG AGGGAAAGCTCTTCTTCCCAATCAAAGGCACAGAGCTGTATGCGGTGCACCGGGCGTCCTTTAATATAA AGCTGCCCGGTGAAACCCTGGTGAAGTCACGGGAGGTGCTTAGAAAAGCGCGAGAGGATAACCTCAAA CTGATGAACCAAAAACTGAACTTTCTGAGGAACGTCCTGCACTTTCAGCAGTTCGAAGATATTACCGAA CGCGAAAAGAGAGTAACCAAGTGGATATCTCGTCAAGAGAACAGCGACGTCCCGTTAGTCTATCAGGAC GAACTCATCCAAATACGGGAGTTGATGTATAAGCCCTACAAGGATTGGGTCGCCTTTCTTAAGCAGCTTC ACAAACGCCTAGAGGTCGAAATAGGTAAAGAGGTGAAACATTGGCGGAAGTCGCTCAGCGACGGGAGG AAGGGACTTTATGGCATCTCTTTGAAGAACATTGACGAAATCGATAGAACCAGAAAATTTTTGTTGAGA TGGTCCCTCCGACCCACCGAGCCTGGAGAGGTGAGGCGGTTAGAACCAGGACAGAGGTTCGCTATCGAT CAGCTGAATCACCTCAATGCTCTGAAGGAGGACCGCCTCAAGAAAATGGCCAATACAATCATAATGCAC GCCCTTGGCTACTGCTACGACGTCCGAAAGAAGAAGTGGCAGGCCAAGAATCCCGCCTGTCAAATTATC CTTTTTGAGGATCTTAGCAATTACAACCCCTATGAAGAGCGGTCCAGATTCGAAAATAGTAAGCTCATG AAGTGGAGCCGCAGGGAGATCCCGCGCCAAGTGGCCCTTCAGGGGGAAATTTATGGGCTGCAGGTAGG CGAGGTCGGGGCCCAATTCTCCTCGCGCTTTCATGCGAAAACTGGAAGTCCTGGAATCCGGTGCTCAGT GGTGACAAAGGAGAAGTTGCAAGACAATCGGTTTTTTAAAAACTTACAGCGGGAGGGAAGGCTGACCC TGGATAAGATAGCCGTACTTAAGGAAGGAGATCTGTACCCTGACAAAGGCGGTGAAAAGTTCATTAGCT TGAGCAAGGACCGAAAACTTGTGACCACCCACGCTGACATCAATGCGGCACAGAACCTGCAGAAGAGA TTTTGGACTCGCACCCACGGATTCTACAAAGTTTACTGCAAAGCATATCAAGTAGACGGACAGACCGTA TACATCCCCGAGTCCAAAGATCAGAAGCAGAAAATTATTGAAGAGTTTGGGGAAGGGTACTTTATCCTG AAGGATGGTGTCTACGAATGGGGCAACGCTGGTAAACTTAAAATTAAGAAGGGCAGCTCTAAACAGTCC TCCAGCGAGTTAGTTGATTCTGATATTCTGAAAGACAGTTTCGACCTGGCCAGCGAACTTAAAGGGGAA AAATTAATGCTGTACCGGGACCCCAGCGGAAACGTCTTTCCATCCGATAAGTGGATGGCCGCTGGAGTG TTCTTTGGCAAGTTAGAGAGGATTCTCATAAGTAAGCTGACCAACCAATACTCAATCTCCACAATCGAG GATGACTCATCCAAGCAGTCTATGTGA SEQ ATGCCTACACGCACTATCAACCTGAAACTGGTTCTTGGCAAGAATCCAGAGAATGCTACCCTTCGTCGG ID GCACTATTTTCAACGCATAGACTGGTGAATCAGGCTACCAAACGGATTGAAGAGTTCCTCTTGCTTTGTC NO: GGGGGGAAGCATATAGGACGGTGGATAATGAGGGGAAAGAGGCTGAAATTCCGAGACACGCCGTGCAG 161 GAGGAAGCTCTTGCGTTTGCAAAGGCCGCTCAACGGCACAATGGTTGCATCTCTACTTATGAAGACCAG GAAATCCTGGATGTGCTCCGGCAACTGTATGAAAGGCTGGTGCCTTCTGTGAATGAAAATAATGAAGCA GGGGACGCTCAAGCCGCAAACGCGTGGGTGTCGCCACTGATGTCCGCCGAGTCCGAGGGAGGGCTCAG CGTTTACGACAAGGTGCTGGACCCACCCCCAGTGTGGATGAAACTCAAAGAGGAAAAAGCTCCGGGCTG GGAGGCTGCTTCCCAGATCTGGATCCAGTCCGACGAAGGGCAGTCCCTTCTTAACAAGCCTGGTTCGCCC CCGCGGTGGATTAGGAAACTGAGGTCAGGCCAGCCTTGGCAGGACGATTTTGTTAGCGACCAGAAAAAG AAGCAGGACGAGCTGACAAAGGGGAATGCGCCACTGATCAAACAATTAAAGGAAATGGGCTTATTGCC TCTTGTGAATCCCTTTTTTAGACATCTGCTTGACCCGGAGGGGAAGGGGGTGTCACCTTGGGACAGACTC GCTGTTAGGGCCGCTGTCGCTCATTTCATATCATGGGAATCATGGAACCACCGGACACGCGCCGAATAC AATAGTTTGAAGCTGCGGAGGGATGAGTTCGAAGCAGCTTCCGACGAATTCAAGGACGACTTCACGCTG CTTCGGCAGTACGAGGCTAAGAGGCACTCCACACTGAAGAGTATAGCTTTAGCCGATGATTCAAACCCT TATAGGATCGGCGTACGCTCCCTCCGCGCTTGGAACCGCGTCCGCGAGGAGTGGATCGACAAGGGAGCG ACCGAGGAGCAGCGGGTCACCATTCTCAGCAAGTTGCAGACCCAACTAAGGGGCAAATTTGGAGATCCT GACTTGTTCAACTGGCTGGCGCAGGACCGGCACGTGCACCTCTGGAGCCCTAGAGATAGTGTTACCCCA CTGGTTAGGATCAACGCTGTTGACAAAGTATTGCGACGGAGAAAACCGTACGCCTTGATGACTTTTGCC CACCCAAGATTCCACCCTCGGTGGATACTTTACGAAGCCCCAGGGGGCAGCAATCTCCGCCAGTATGCA CTGGATTGTACCGAAAATGCTCTGCACATTACACTGCCTCTGCTGGTTGACGATGCACATGGCACATGGA TTGAGAAAAAAATTAGGGTTCCTCTTGCCCCCAGCGGCCAGATTCAGGACCTGACACTAGAAAAGCTCG AGAAGAAGAAAAATCGTCTCTACTACCGTTCTGGGTTCCAGCAGTTTGCCGGCCTGGCCGGAGGTGCCG AGGTGCTTTTCCATCGACCATACATGGAGCACGATGAGAGGAGCGAGGAGAGCTTATTAGAACGCCCTG GTGCTGTTTGGTTCAAACTCACCTTGGACGTGGCAACCCAGGCCCCTCCAAACTGGTTGGACGGAAAGG GCCGCGTCCGAACGCCCCCCGAGGTTCACCACTTCAAGACAGCCCTCAGTAACAAGTCTAAGCACACAC GGACCCTCCAGCCCGGACTCAGAGTGTTATCCGTGGATCTGGGAATGCGCACCTTCGCCTCTTGCTCCGT ATTTGAGCTGATCGAGGGCAAACCAGAGACTGGCAGAGCGTTCCCTGTGGCCGACGAACGTTCCATGGA TTCACCAAACAAGCTGTGGGCCAAGCACGAAAGATCCTTTAAACTCACGCTCCCCGGCGAAACCCCCAG TCGGAAAGAAGAGGAGGAACGGAGCATTGCAAGAGCCGAAATCTATGCGTTGAAAAGAGATATTCAGA GATTAAAAAGTCTTCTGCGCCTGGGGGAAGAGGATAACGATAATAGACGCGATGCACTTCTTGAGCAAT TTTTCAAGGGCTGGGGCGAGGAAGACGTGGTTCCAGGTCAGGCCTTTCCCCGGAGTCTGTTCCAGGGGC TGGGGGCCGCCCCATTCAGATCCACCCCTGAGTTGTGGAGACAACACTGTCAAACCTATTATGATAAAG CAGAGGCGTGCCTGGCTAAACACATCAGCGATTGGCGCAAGAGAACCAGGCCTAGGCCTACCTCACGTG AGATGTGGTACAAGACACGCTCTTATCACGGCGGAAAGTCAATCTGGATGCTGGAATACCTCGACGCTG TGAGGAAACTGCTCTTATCCTGGAGCCTCAGAGGCCGGACCTACGGGGCTATCAACAGACAGGACACAG CAAGGTTCGGGAGCTTAGCCAGCCGGCTCCTTCACCACATTAACTCACTCAAAGAGGATCGAATAAAGA CCGGAGCCGACTCGATCGTGCAGGCAGCCCGAGGGTACATCCCCCTGCCTCATGGGAAGGGCTGGGAGC AGCGATATGAACCCTGCCAGCTGATCTTGTTTGAGGACCTTGCCCGTTATAGATTTCGCGTTGATAGACC TCGCCGTGAGAATTCTCAGCTGATGCAGTGGAACCACAGAGCGATCGTGGCTGAGACCACTATGCAGGC CGAGCTGTATGGACAGATCGTGGAGAACACCGCCGCAGGGTTCAGTTCTCGGTTTCATGCTGCCACCGG AGCTCCCGGCGTCCGGTGCCGCTTCCTCTTAGAGCGTGATTTTGACAATGACCTCCCAAAGCCCTATCTG CTGAGGGAACTGAGCTGGATGCTGGGGAACACAAAAGTAGAATCGGAGGAGGAGAAGCTACGGCTCCT CTCCGAAAAGATACGTCCAGGCTCTCTGGTACCATGGGACGGAGGAGAGCAGTTCGCGACACTGCATCC TAAGAGACAGACGTTATGTGTGATTCACGCCGATATGAACGCCGCTCAGAATCTGCAGCGAAGATTCTT TGGCCGCTGCGGCGAAGCCTTCAGGCTGGTATGTCAGCCCCACGGGGATGATGTGCTGCGGCTGGCCTC AACCCCTGGGGCTAGACTCTTGGGGGCACTCCAGCAGCTGGAAAATGGCCAAGGGGCTTTCGAACTCGT TCGGGACATGGGCAGCACAAGCCAGATGAACAGATTCGTCATGAAGAGCCTGGGAAAGAAAAAGATCA AACCCTTACAGGACAATAATGGCGACGACGAACTGGAGGACGTGTTGTCCGTGCTGCCAGAGGAAGAC GACACAGGCCGCATCACTGTCTTCCGCGACTCAAGTGGGATATTCTTTCCTTGCAACGTGTGGATTCCGG CCAAACAGTTCTGGCCTGCCGTCAGAGCCATGATTTGGAAAGTGATGGCTAGTCATTCATTGGGATGA SEQ ATGACAAAGCTGAGGCACAGACAAAAGAAGCTTACACACGACTGGGCAGGGAGCAAGAAACGTGAGGT ID CCTTGGGTCAAATGGAAAACTGCAGAACCCCTTGCTCATGCCTGTAAAGAAGGGGCAGGTAACAGAATT NO: TAGAAAAGCATTCTCCGCGTACGCTCGGGCAACTAAGGGGGAAATGACCGATGGACGGAAGAACATGT 162 TCACCCATTCTTTCGAGCCATTCAAAACAAAGCCGTCATTGCACCAATGCGAGCTGGCCGATAAGGCTTA CCAGTCTTTGCATAGTTACCTCCCCGGTTCCCTGGCCCATTTCTTGCTTTCCGCACACGCACTGGGCTTTC GTATTTTCTCTAAATCTGGGGAGGCAACTGCCTTCCAGGCCAGCTCAAAAATCGAGGCCTATGAGTCCA AGCTCGCTTCGGAGCTAGCCTGTGTCGATTTGAGTATCCAGAATTTGACGATTAGTACTCTTTTCAACGC TCTCACAACTTCAGTTCGGGGCAAGGGGGAGGAAACTTCAGCAGATCCCCTTATCGCACGGTTCTACAC TCTCCTGACGGGCAAGCCCCTGAGCCGAGACACACAGGGCCCAGAACGGGACTTGGCAGAGGTCATCTC CAGAAAGATCGCCTCGTCCTTCGGCACATGGAAGGAAATGACTGCCAACCCTCTGCAGAGCCTCCAGTT CTTCGAAGAAGAGCTTCATGCACTAGATGCCAACGTGTCTTTATCTCCAGCTTTTGATGTGTTAATCAAG ATGAATGATCTCCAAGGTGATCTGAAGAACCGTACTATAGTGTTCGACCCAGATGCACCCGTGTTCGAG TACAACGCTGAGGATCCAGCCGATATCATCATAAAGCTGACAGCTCGGTATGCGAAGGAGGCCGTCATC AAGAATCAGAACGTGGGCAATTATGTGAAAAACGCCATTACCACCACTAATGCCAATGGGCTGGGGTGG CTCCTCAATAAAGGGCTTTCACTACTGCCAGTTTCTACTGACGATGAGCTGCTCGAATTCATTGGGGTGG AGAGAAGCCATCCCAGCTGTCACGCGCTGATAGAGCTGATTGCCCAGCTAGAGGCGCCGGAACTGTTTG AGAAGAATGTGTTTAGTGACACCCGTTCCGAGGTTCAGGGTATGATCGACAGTGCAGTGTCGAACCACA TTGCTCGGCTGTCCAGCAGCCGAAACTCCCTGAGCATGGACAGCGAGGAATTGGAACGCTTGATTAAAT CTTTCCAGATTCATACTCCCCATTGTTCTCTGTTCATAGGCGCTCAGTCCTTATCTCAGCAGCTGGAGAGC TTACCTGAGGCGCTGCAGTCCGGAGTGAACAGCGCTGATATCTTATTAGGCAGCACACAGTATATGCTG ACCAACTCTCTCGTTGAAGAGTCAATTGCAACATATCAAAGGACATTAAATAGGATCAATTACCTGAGT GGGGTGGCTGGGCAGATTAACGGTGCTATCAAAAGAAAGGCAATCGACGGCGAAAAAATACACCTGCC TGCCGCCTGGAGTGAGCTCATCTCCTTACCTTTCATTGGACAGCCGGTGATTGATGTGGAGAGCGACCTG GCACACTTAAAAAACCAGTACCAGACCCTGTCCAATGAATTTGACACCCTCATTTCGGCCCTGCAGAAG AACTTCGATTTGAATTTCAACAAAGCACTCCTTAACCGCACGCAGCATTTCGAGGCAATGTGCCGGAGC ACAAAAAAAAATGCTTTATCTAAGCCCGAGATCGTGTCCTACAGAGATCTGCTGGCGCGGCTGACCAGT TGCCTTTATCGAGGCTCGCTGGTTCTCAGAAGGGCGGGAATCGAAGTTCTGAAAAAGCACAAAATCTTT GAGTCGAATAGTGAGCTGAGAGAACACGTCCACGAGCGAAAGCACTTCGTGTTCGTTAGTCCATTGGAC AGAAAGGCAAAAAAACTGTTGCGCCTGACCGATTCCCGCCCTGACTTGCTCCATGTGATCGATGAGATC CTGCAACATGACAATCTGGAGAATAAGGACAGAGAGTCCCTTTGGCTGGTCCGGTCTGGGTACCTCCTT GCTGGTCTGCCGGACCAGCTGAGTTCTTCGTTTATCAATCTCCCCATAATCACGCAAAAGGGCGATCGCC GGCTGATTGACCTGATTCAGTATGACCAGATCAATCGCGATGCTTTCGTAATGTTGGTGACAAGTGCTTT CAAAAGCAATCTCTCTGGGTTGCAGTACCGCGCTAACAAGCAGTCTTTCGTGGTCACCCGCACCCTGTCT CCTTACCTGGGTAGTAAGCTCGTATACGTCCCTAAAGACAAAGATTGGCTGGTCCCATCCCAGATGTTTG AGGGAAGATTCGCCGATATTCTGCAGAGTGACTACATGGTCTGGAAGGATGCCGGACGCCTGTGCGTGA TCGACACTGCCAAACATCTCTCTAACATTAAAAAAAGCGTGTTTAGTAGCGAAGAAGTCCTTGCTTTTCT TCGAGAGCTGCCTCACCGGACCTTCATCCAGACCGAGGTACGGGGGTTAGGAGTGAACGTCGATGGAAT CGCATTTAATAACGGGGATATCCCGAGCTTGAAGACATTCTCGAATTGTGTGCAGGTGAAGGTGAGTAG GACTAATACTAGTCTCGTGCAGACTCTAAACAGGTGGTTCGAGGGTGGCAAAGTGTCACCTCCCTCTATT CAGTTCGAAAGAGCTTACTACAAAAAAGACGATCAGATTCACGAGGACGCAGCCAAGAGAAAGATACG CTTCCAGATGCCAGCAACGGAATTAGTGCACGCCAGCGATGACGCTGGTTGGACCCCCAGCTACCTGCT GGGCATCGACCCCGGTGAGTACGGAATGGGTCTCAGTTTGGTGTCCATCAACAATGGAGAGGTCCTGGA TTCTGGATTCATCCACATTAATTCCCTGATCAATTTCGCGTCCAAAAAAAGCAATCACCAGACCAAAGTA GTCCCCCGCCAGCAGTACAAGTCCCCCTACGCGAATTATCTCGAGCAGTCAAAGGATTCAGCAGCAGGG GATATAGCTCACATTCTGGATCGGCTAATCTACAAATTGAACGCCTTGCCTGTGTTCGAGGCGCTGTCTG GCAACAGTCAGAGTGCTGCTGATCAGGTATGGACCAAAGTTCTATCCTTCTATACATGGGGAGACAACG ACGCACAGAACAGTATACGGAAGCAGCACTGGTTCGGTGCCTCACACTGGGATATTAAGGGGATGCTGC GCCAACCCCCAACCGAAAAAAAACCCAAACCATATATAGCCTTTCCCGGGAGTCAAGTGTCATCCTATG GAAATAGTCAAAGGTGTAGTTGTTGCGGCCGCAATCCCATTGAGCAGTTGCGTGAGATGGCAAAGGACA CGAGTATCAAGGAGCTGAAAATCCGAAATAGTGAGATCCAACTATTCGATGGTACAATCAAGCTGTTTA ACCCCGACCCTTCCACCGTCATCGAGAGGCGGCGGCATAACCTAGGACCCTCACGCATTCCTGTGGCAG ACCGAACTTTCAAGAATATTAGCCCTTCTTCGTTAGAGTTCAAGGAGCTCATTACTATCGTTTCTCGAAG CATCCGCCATAGCCCCGAATTTATTGCTAAGAAACGGGGTATCGGGTCTGAGTACTTTTGTGCTTATTCT GACTGCAACTCCTCACTGAACTCAGAGGCCAATGCCGCGGCCAATGTGGCACAGAAGTTTCAGAAGCAA CTCTTTTTCGAACTCTGA SEQ ATGAAACGTATTCTGAACTCTCTGAAAGTCGCCGCACTGAGGCTGCTGTTTCGAGGAAAGGGCTCAGAG ID CTGGTGAAGACCGTCAAGTACCCTCTGGTTTCGCCCGTCCAGGGTGCTGTGGAAGAACTCGCCGAAGCA NO: ATACGCCACGACAACCTACATTTATTTGGGCAGAAGGAAATCGTAGATCTGATGGAGAAGGACGAGGG 163 CACCCAGGTCTACTCGGTGGTGGACTTTTGGCTCGACACACTCCGTCTAGGGATGTTCTTCAGTCCAAGT GCTAATGCCCTTAAGATCACTCTGGGGAAGTTTAACAGCGACCAAGTTTCCCCTTTCAGGAAGGTTCTGG AGCAGTCCCCTTTCTTTCTCGCGGGTAGACTCAAAGTGGAGCCCGCTGAACGTATCCTCAGCGTGGAGAT CCGCAAGATCGGTAAGAGGGAGAATAGAGTGGAGAACTACGCCGCAGATGTAGAGACTTGTTTTATCG GTCAGCTGTCTAGTGATGAAAAGCAGTCTATCCAGAAGCTCGCTAACGATATCTGGGACTCTAAGGATC ACGAAGAGCAAAGGATGCTTAAGGCGGATTTCTTTGCCATTCCCCTCATCAAAGACCCAAAGGCAGTGA CCGAGGAAGATCCCGAGAATGAAACCGCAGGCAAACAGAAGCCTCTCGAATTATGTGTGTGCTTAGTGC CCGAGTTGTACACCCGCGGGTTCGGTTCAATAGCGGACTTCCTGGTCCAGCGTCTGACACTATTAAGAGA CAAAATGAGCACAGACACAGCAGAAGACTGCCTTGAGTATGTCGGCATAGAGGAGGAGAAGGGTAATG GGATGAACTCGCTGCTGGGGACGTTCCTCAAGAACCTGCAGGGAGACGGGTTCGAACAGATCTTCCAAT TTATGCTCGGCAGTTACGTGGGATGGCAAGGTAAGGAAGACGTCCTACGCGAACGGCTTGATTTGCTAG CGGAGAAGGTTAAAAGACTGCCGAAACCTAAGTTTGCCGGCGAGTGGTCCGGCCATCGGATGTTCCTGC ATGGTCAATTGAAGAGCTGGTCCTCTAACTTTTTCCGCCTGTTTAACGAGACTAGGGAGCTCCTCGAAAG CATAAAATCCGACATCCAACACGCGACCATGTTAATCAGCTACGTCGAAGAGAAAGGGGGATACCACCC ACAACTCTTGTCACAGTACAGGAAACTAATGGAGCAGCTGCCAGCTCTCAGAACAAAGGTGTTAGATCC AGAGATAGAAATGACTCACATGAGCGAGGCGGTAAGGTCGTACATTATGATCCACAAGTCGGTAGCAG GATTTCTGCCTGACTTACTCGAGTCCCTCGATAGGGACAAGGACAGGGAATTCCTGCTGAGTATATTTCC AAGGATCCCCAAAATTGACAAAAAAACTAAGGAAATCGTGGCCTGGGAGCTCCCAGGCGAGCCCGAAG AAGGATACCTGTTCACTGCCAATAATCTTTTTCGCAACTTTCTGGAGAATCCTAAACATGTTCCACGTTTC ATGGCAGAAAGGATCCCGGAAGATTGGACGCGCCTGCGGTCCGCTCCCGTATGGTTTGACGGCATGGTG AAACAATGGCAGAAAGTGGTAAACCAGCTGGTGGAGTCACCTGGAGCATTGTATCAGTTCAATGAAAGC TTTCTCCGACAACGTTTACAGGCAATGCTGACAGTGTATAAGAGAGACCTGCAGACAGAGAAATTCCTT AAGTTGTTGGCTGATGTCTGCAGGCCTCTGGTGGACTTCTTTGGGCTGGGGGGAAACGATATCATCTTCA AAAGCTGCCAGGACCCGAGGAAACAATGGCAAACTGTCATTCCCTTGAGTGTCCCCGCTGATGTGTACA CCGCGTGTGAGGGGCTGGCAATCCGGCTTCGTGAGACATTGGGATTTGAGTGGAAGAACCTTAAGGGCC ATGAAAGGGAGGACTTTCTAAGACTGCACCAGCTTTTAGGGAATCTGCTTTTCTGGATTCGAGATGCCAA ACTGGTGGTGAAATTGGAAGATTGGATGAATAATCCCTGTGTTCAGGAGTACGTTGAGGCTCGTAAGGC CATTGATCTCCCACTGGAGATCTTCGGCTTTGAGGTCCCCATCTTCCTGAACGGATATCTGTTTAGTGAA CTGAGGCAGTTAGAACTGCTGCTCCGCCGTAAGTCGGTTATGACCAGCTATTCGGTTAAGACAACTGGC AGTCCAAACAGGCTTTTCCAGTTAGTCTACCTGCCATTAAATCCTTCCGACCCTGAGAAAAAAAATTCTA ATAACTTTCAGGAACGCCTGGACACCCCCACTGGCTTATCACGTCGCTTCCTGGACCTTACTCTGGACGC CTTCGCCGGCAAGTTGCTGACAGACCCCGTGACTCAAGAGCTTAAAACTATGGCTGGGTTCTACGATCA CCTGTTTGGTTTCAAGCTCCCATGTAAGCTGGCAGCCATGTCTAACCACCCTGGCTCTAGCAGCAAGATG GTCGTGTTGGCCAAACCTAAAAAAGGGGTTGCATCTAATATAGGATTCGAACCAATCCCTGATCCCGCG CACCCCGTATTCCGGGTGAGATCATCATGGCCAGAGCTGAAGTATCTGGAGGGGTTACTGTATCTTCCAG AAGACACTCCACTGACAATAGAGCTCGCAGAGACAAGTGTTAGTTGTCAGAGCGTCAGTAGCGTGGCAT TCGATCTGAAAAATCTGACTACTATCCTTGGACGCGTGGGTGAGTTCCGTGTGACCGCAGACCAGCCTTT TAAGTTGACCCCCATCATCCCTGAGAAGGAGGAGTCCTTCATAGGAAAAACATATCTAGGCCTTGATGC CGGGGAACGCTCAGGCGTAGGGTTCGCTATCGTCACAGTCGACGGGGATGGGTACGAGGTACAGCGCCT GGGGGTGCATGAAGATACACAGCTGATGGCCCTACAGCAGGTGGCCTCTAAAAGCTTGAAGGAGCCGG TGTTCCAGCCGCTCAGAAAGGGTACTTTTCGGCAGCAGGAACGTATTAGAAAATCTCTCAGAGGATGTT ATTGGAACTTCTATCACGCTCTGATGATTAAGTACCGCGCCAAGGTAGTGCACGAAGAGAGCGTGGGCA GTTCCGGCCTGGTTGGGCAGTGGTTACGAGCATTCCAGAAGGACCTCAAGAAAGCCGATGTGTTGCCAA AAAAGGGAGGCAAAAACGGAGTCGATAAGAAAAAGAGAGAGTCTTCTGCACAAGACACATTGTGGGGA GGGGCTTTTAGCAAGAAGGAAGAACAGCAGATAGCTTTCGAAGTCCAAGCTGCTGGTTCTAGCCAGTTC TGCCTGAAGTGCGGATGGTGGTTCCAACTCGGAATGCGTGAGGTTAATCGCGTGCAGGAATCCGGCGTC GTGCTGGATTGGAATCGGAGTATTGTCACATTCCTGATTGAGAGCTCTGGCGAGAAAGTGTATGGGTTCT CCCCTCAGCAACTCGAAAAGGGGTTCAGACCAGACATTGAAACCTTCAAGAAGATGGTTCGGGATTTCA TGCGCCCGCCTATGTTTGACCGGAAGGGTCGCCCAGCAGCTGCCTACGAAAGGTTTGTCTTGGGACGCC GGCATCGGCGGTATAGATTCGACAAGGTTTTTGAAGAACGATTCGGACGATCCGCGCTATTCATTTGCCC GAGGGTTGGCTGTGGCAACTTTGACCACAGCAGCGAGCAGTCAGCCGTAGTGCTGGCTCTAATCGGATA TATTGCCGACAAAGAGGGGATGAGCGGAAAAAAGCTAGTCTACGTGCGTCTGGCAGAACTAATGGCGG AATGGAAATTGAAGAAACTGGAGAGGAGTAGAGTTGAGGAGCAAAGCTCCGCTCAGTGA SEQ ATGGCGGAGTCGAAGCAAATGCAGTGCAGGAAGTGTGGAGCCTCTATGAAGTACGAAGTGATCGGCCT ID CGGGAAGAAAAGCTGCAGATATATGTGTCCCGACTGCGGGAATCACACATCTGCAAGAAAGATTCAGA NO: ATAAGAAGAAAAGGGACAAGAAGTATGGATCTGCCAGTAAAGCACAAAGCCAACGAATCGCAGTTGCA 164 GGGGCCTTATACCCGGATAAAAAGGTTCAGACCATCAAGACTTATAAGTATCCAGCCGACCTGAATGGT GAGGTCCATGACTCAGGGGTGGCCGAAAAAATAGCCCAAGCAATCCAGGAGGATGAAATAGGGCTCCT CGGCCCCTCTTCCGAGTACGCCTGTTGGATCGCTAGCCAGAAACAGAGCGAGCCCTACAGTGTTGTAGA CTTTTGGTTTGACGCTGTGTGCGCCGGAGGCGTGTTCGCCTATTCTGGGGCTAGATTGCTGTCTACCGTCC TGCAGCTATCTGGGGAGGAGAGCGTCCTACGCGCAGCCCTGGCATCCTCCCCTTTTGTCGACGATATCAA TCTGGCACAGGCCGAAAAATTTCTGGCGGTGTCCAGGCGAACCGGCCAAGATAAGCTGGGGAAGCGCA TTGGAGAGTGCTTCGCAGAGGGCCGACTTGAGGCCCTAGGCATCAAGGACCGGATGCGTGAATTTGTCC AGGCTATCGATGTCGCTCAGACCGCTGGGCAGCGTTTTGCCGCGAAACTGAAAATCTTTGGGATTTCTCA GATGCCCGAGGCAAAGCAGTGGAACAATGACAGCGGACTCACCGTGTGCATCCTGCCCGACTATTACGT CCCAGAAGAAAATCGCGCAGATCAGTTGGTCGTCCTGCTAAGACGACTGAGAGAGATAGCATACTGTAT GGGGATCGAAGATGAGGCCGGTTTTGAACATCTTGGAATTGATCCTGGCGCACTATCAAATTTTTCCAAT GGCAATCCTAAACGCGGATTTTTGGGCCGCCTGCTGAACAATGATATTATTGCCTTAGCGAACAACATGT CCGCCATGACGCCTTACTGGGAGGGCAGGAAGGGAGAACTGATTGAAAGATTGGCTTGGCTGAAGCAC CGTGCAGAGGGGCTTTATCTGAAGGAACCGCATTTTGGAAATAGTTGGGCCGACCATAGGTCTAGAATT TTTTCCAGAATAGCCGGGTGGCTTTCTGGGTGCGCTGGGAAGCTAAAGATCGCCAAAGACCAGATCAGC GGAGTGCGTACTGATCTGTTCCTTCTGAAGAGACTGCTGGATGCGGTCCCGCAGTCCGCCCCTTCTCCCG ACTTCATAGCCTCTATCTCTGCCTTGGATCGCTTCCTGGAGGCCGCAGAATCTAGTCAGGATCCTGCCGA ACAGGTGAGGGCCCTATACGCCTTTCATCTGAACGCACCCGCGGTGCGAAGCATCGCCAACAAGGCAGT CCAGCGATCCGACAGCCAAGAATGGCTTATAAAGGAACTGGACGCTGTGGACCACCTGGAGTTTAACAA GGCCTTTCCCTTCTTCTCTGATACGGGAAAGAAGAAAAAGAAAGGGGCTAACTCGAATGGCGCTCCGTC CGAGGAGGAGTACACCGAGACTGAGAGCATCCAGCAGCCCGAGGACGCTGAGCAAGAGGTTAATGGTC AGGAAGGCAACGGGGCCTCGAAGAACCAGAAGAAGTTTCAGAGAATCCCCCGATTCTTCGGCGAGGGG AGTCGCAGCGAGTATCGCATCCTCACTGAAGCCCCGCAGTACTTCGACATGTTCTGTAACAACATGCGG GCCATCTTTATGCAATTAGAATCCCAACCGCGTAAAGCTCCCAGGGATTTTAAGTGTTTCCTGCAGAATC GGCTGCAGAAATTGTATAAGCAGACATTCCTGAACGCTCGATCCAACAAGTGCCGGGCATTACTAGAGT CCGTATTGATTAGTTGGGGAGAGTTTTACACCTACGGGGCTAACGAGAAAAAATTTCGACTGCGTCATG AAGCTTCTGAGCGCTCCTCGGACCCAGATTACGTGGTGCAACAGGCGCTGGAGATCGCTCGGAGGCTGT TTCTCTTCGGCTTTGAGTGGAGGGACTGTAGCGCAGGTGAAAGAGTGGATCTGGTCGAAATACATAAGA AAGCCATATCTTTCCTGTTGGCCATCACTCAGGCTGAGGTGTCTGTGGGCAGCTATAACTGGCTGGGCAA TTCTACCGTGAGTCGGTACCTGTCCGTGGCAGGGACTGATACCCTTTACGGCACCCAGCTGGAAGAATTC TTAAATGCAACCGTGTTATCTCAGATGCGGGGGCTGGCTATCAGGTTATCATCTCAGGAACTGAAGGAT GGATTTGACGTACAGCTGGAGTCTAGTTGCCAGGATAATCTGCAACACTTGCTCGTGTACAGGGCTTCAC GAGACCTTGCCGCCTGCAAGCGCGCTACTTGTCCAGCTGAGTTGGATCCTAAGATTCTGGTACTGCCCGT GGGGGCCTTTATCGCTAGCGTGATGAAAATGATTGAAAGAGGGGATGAGCCTTTAGCTGGAGCTTATCT GAGACACAGACCCCATAGTTTCGGGTGGCAGATCCGCGTTCGAGGTGTGGCAGAGGTGGGAATGGACC AAGGGACCGCCCTGGCGTTCCAGAAACCGACCGAGAGCGAACCCTTCAAGATAAAGCCGTTTTCCGCTC AATACGGCCCCGTTCTATGGCTGAACAGCTCCAGTTATAGCCAGAGCCAGTACCTGGACGGGTTCCTATC ACAGCCCAAGAACTGGAGTATGCGGGTGCTGCCACAGGCCGGCTCAGTGCGGGTAGAACAGCGCGTCG CCTTGATTTGGAATCTCCAGGCCGGAAAGATGAGGCTGGAACGGAGCGGAGCGCGGGCTTTCTTCATGC CCGTCCCATTCAGTTTCCGCCCCAGTGGCAGCGGCGACGAGGCAGTCCTGGCTCCAAATAGGTACCTGG GACTCTTTCCACACAGCGGCGGCATAGAGTACGCTGTGGTCGATGTTCTTGACTCTGCCGGCTTCAAAAT ACTCGAGAGAGGAACAATAGCCGTCAATGGCTTCTCCCAGAAACGAGGAGAAAGACAAGAGGAAGCCC ATCGCGAAAAACAAAGACGCGGTATCTCCGATATTGGGCGCAAGAAGCCAGTCCAGGCCGAAGTCGAT GCGGCCAACGAGCTCCATCGAAAATACACCGATGTTGCTACTCGGCTGGGGTGTCGAATTGTCGTTCAA TGGGCACCCCAACCCAAACCAGGCACTGCGCCGACCGCTCAGACTGTGTACGCTAGGGCCGTGAGGACT GAAGCACCAAGATCCGGCAATCAGGAAGATCACGCCAGGATGAAATCTTCCTGGGGATACACATGGGG TACGTATTGGGAAAAAAGGAAGCCCGAGGACATCCTCGGCATTAGTACCCAGGTGTATTGGACAGGCGG GATCGGCGAGTCCTGCCCGGCTGTCGCCGTCGCGCTATTGGGACACATCAGGGCCACCTCAACCCAGAC TGAATGGGAGAAAGAGGAAGTCGTGTTTGGGCGATTGAAAAAGTTCTTCCCATCCTGA SEQ ATGGAGAAGCGCATCAATAAAATTCGCAAGAAGCTGTCTGCCGATAACGCCACAAAACCAGTTAGTCGA ID AGCGGCCCAATGAAGACCCTGCTAGTTCGAGTGATGACTGATGATCTGAAGAAAAGGCTCGAAAAGCG NO: ACGCAAGAAGCCTGAGGTAATGCCTCAGGTTATAAGTAACAATGCAGCAAACAATCTGCGGATGCTGCT 165 TGACGATTACACAAAGATGAAGGAAGCCATTCTCCAGGTGTATTGGCAGGAGTTCAAGGATGATCACGT AGGCCTGATGTGTAAATTCGCGCAACCTGCAAGCAAGAAGATCGACCAAAACAAGCTGAAACCCGAGA TGGATGAAAAAGGCAATTTAACAACCGCCGGATTCGCTTGTTCCCAGTGTGGGCAGCCACTGTTCGTGT ACAAGTTAGAACAGGTGTCGGAAAAAGGAAAGGCATACACTAACTACTTTGGACGGTGCAATGTTGCA GAACACGAAAAGCTGATACTGCTTGCCCAGCTTAAGCCCGAAAAAGACAGCGACGAAGCGGTGACCTA CAGCCTGGGAAAATTCGGGCAGCGGGCACTGGACTTCTATTCTATCCACGTTACCAAGGAGAGCACCCA CCCAGTGAAGCCGTTGGCCCAAATCGCTGGAAACCGGTACGCCAGCGGACCAGTCGGCAAGGCCCTGTC CGATGCCTGTATGGGCACAATTGCTTCTTTCCTGTCCAAGTACCAGGACATCATAATCGAGCACCAAAAA GTTGTGAAAGGGAATCAGAAACGCCTGGAATCCCTTCGAGAACTGGCCGGCAAGGAGAACCTTGAGTA CCCGTCCGTGACCCTGCCTCCACAGCCACATACCAAAGAGGGCGTAGACGCGTATAATGAGGTCATTGC CCGCGTTCGCATGTGGGTTAATTTAAACCTGTGGCAGAAATTAAAACTAAGCCGAGATGATGCTAAACC GTTACTGAGATTGAAGGGATTCCCTAGCTTTCCTGTGGTGGAGAGAAGGGAAAACGAGGTTGATTGGTG GAATACTATTAATGAGGTGAAAAAGCTTATTGACGCCAAGAGGGATATGGGCAGGGTGTTCTGGAGCGG GGTGACTGCCGAAAAGAGAAATACCATCCTCGAGGGATACAATTACCTCCCCAACGAGAATGATCATAA GAAAAGAGAGGGGAGCTTAGAGAATCCAAAGAAACCTGCAAAGAGGCAATTCGGTGATCTCCTGCTCT ACCTCGAGAAGAAATACGCGGGGGACTGGGGAAAAGTTTTTGACGAAGCCTGGGAGCGCATTGACAAG AAGATCGCCGGGCTGACGTCTCACATTGAACGGGAAGAGGCACGGAATGCAGAGGACGCCCAGTCTAA GGCCGTGCTGACTGACTGGCTGCGCGCAAAGGCCTCCTTCGTGCTCGAACGTCTGAAGGAAATGGATGA GAAAGAGTTTTACGCGTGTGAAATACAGCTGCAGAAGTGGTACGGCGATCTAAGGGGAAATCCCTTCGC AGTGGAAGCCGAGAATAGGGTAGTTGACATCAGTGGGTTCTCCATCGGCAGTGATGGACATTCTATCCA GTATAGAAACCTGCTCGCCTGGAAGTACTTAGAGAACGGCAAGAGAGAGTTCTATCTGCTGATGAACTA CGGGAAAAAAGGTAGAATTCGCTTTACAGATGGCACCGACATAAAGAAGTCCGGAAAGTGGCAAGGCC TCTTATACGGAGGCGGCAAAGCAAAGGTGATAGACTTGACTTTTGACCCTGACGACGAACAGCTGATAA TCTTGCCGCTGGCCTTTGGCACAAGACAAGGTAGGGAATTTATCTGGAATGATCTTCTTTCTCTCGAGAC CGGACTCATCAAGCTCGCAAACGGAAGGGTCATCGAGAAGACAATCTACAATAAAAAGATAGGCCGAG ACGAGCCAGCCCTGTTTGTGGCTTTGACATTTGAGCGGAGAGAGGTCGTAGATCCCAGCAACATCAAAC CCGTGAACCTGATCGGTGTTGACAGGGGCGAGAACATCCCGGCGGTTATCGCACTGACGGATCCAGAAG GATGTCCTCTGCCCGAGTTCAAAGATTCATCGGGAGGGCCAACCGACATTTTGAGGATAGGGGAGGGGT ACAAGGAGAAGCAGCGAGCTATCCAGGCGGCCAAAGAAGTGGAGCAACGAAGAGCTGGTGGTTATTCT CGCAAGTTCGCTTCCAAAAGTCGTAACCTGGCTGACGATATGGTGCGCAATTCTGCCCGTGACCTTTTCT ACCACGCCGTTACACACGACGCCGTGTTAGTGTTTGAAAATCTTAGTCGAGGCTTCGGGCGACAGGGGA AGCGGACCTTTATGACCGAGAGACAGTATACAAAAATGGAGGATTGGCTGACCGCCAAACTGGCGTATG AAGGACTCACATCCAAGACCTATCTCTCAAAAACTTTGGCCCAGTATACATCTAAGACGTGCAGTAACT GTGGCTTCACCATTACCACAGCTGACTACGATGGCATGCTGGTCCGCTTAAAAAAGACATCTGACGGCT GGGCTACTACCCTCAACAATAAAGAGCTCAAAGCCGAAGGACAAATTACCTATTATAACAGGTATAAAA GACAGACTGTCGAGAAGGAGTTGAGCGCGGAGCTGGACCGCCTATCAGAGGAGTCAGGGAACAACGAT ATCTCTAAGTGGACTAAGGGACGCCGAGACGAGGCGTTGTTCTTGCTGAAAAAGCGGTTCTCTCATCGA CCCGTGCAGGAGCAGTTCGTGTGTCTGGACTGCGGCCACGAGGTTCATGCTGATGAGCAAGCTGCTCTA AATATTGCCCGTAGTTGGTTGTTCCTGAACAGCAATTCAACAGAGTTCAAGTCATACAAGAGCGGAAAG CAGCCGTTTGTGGGCGCATGGCAGGCATTTTACAAAAGACGCCTGAAGGAAGTGTGGAAGCCAAACGCC SEQ ATGAAAAGGATTAACAAAATCCGAAGGCGGCTTGTAAAGGATTCTAACACCAAAAAGGCTGGCAAGAC ID GGGGCCCATGAAAACATTACTCGTTAGAGTTATGACCCCCGACCTCAGAGAGCGACTGGAAAATTTACG NO: CAAGAAGCCAGAGAACATACCTCAGCCAATTAGTAATACCTCTCGGGCAAACCTAAACAAGTTGCTTAC 166 TGATTACACGGAGATGAAAAAGGCCATACTGCATGTGTACTGGGAGGAGTTTCAAAAGGACCCTGTCGG GCTAATGAGCAGGGTGGCTCAGCCTGCACCTAAAAACATCGACCAGCGGAAACTCATCCCAGTTAAGGA CGGAAATGAGAGATTGACAAGTTCAGGTTTCGCCTGCTCACAGTGCTGTCAACCGCTGTACGTTTATAAG TTAGAACAAGTGAATGACAAAGGAAAGCCTCACACAAATTATTTTGGCCGGTGTAATGTCTCTGAGCAT GAGCGTCTGATTCTGTTGTCCCCGCATAAACCGGAAGCTAATGACGAGCTCGTAACCTACAGCTTGGGG AAGTTTGGCCAAAGAGCATTGGACTTCTATTCAATCCATGTGACCCGCGAATCCAATCATCCCGTCAAGC CCTTGGAGCAGATAGGGGGCAATAGTTGCGCTTCTGGCCCTGTGGGCAAAGCCCTGTCCGACGCCTGTA TGGGAGCCGTGGCTTCATTCCTGACCAAATATCAGGATATCATCTTGGAGCACCAGAAAGTGATCAAGA AAAATGAAAAAAGGTTAGCAAACCTCAAGGATATTGCAAGCGCTAACGGCTTGGCTTTTCCTAAAATCA CACTTCCACCTCAGCCTCACACAAAGGAAGGCATCGAGGCATACAACAATGTGGTGGCCCAGATCGTCA TCTGGGTTAACTTAAACCTGTGGCAGAAACTTAAAATTGGCAGGGATGAGGCAAAACCCTTACAGCGCC TGAAAGGATTCCCCAGCTTTCCACTGGTGGAGCGCCAGGCTAACGAAGTGGACTGGTGGGATATGGTGT GTAACGTCAAGAAGCTCATCAATGAAAAGAAAGAGGACGGTAAAGTCTTCTGGCAGAACCTCGCCGGTT ACAAACGGCAGGAGGCGCTGTTACCTTATCTGTCGAGTGAAGAGGACCGGAAAAAAGGCAAGAAATTT GCTCGTTATCAGTTTGGTGATTTGCTCCTACATTTGGAGAAGAAGCACGGCGAGGACTGGGGAAAAGTA TACGATGAGGCCTGGGAGAGGATTGACAAAAAGGTGGAGGGACTGTCAAAGCACATCAAGCTCGAAGA AGAGCGCAGAAGCGAGGACGCCCAATCCAAAGCAGCGCTGACTGACTGGCTGCGGGCGAAGGCCAGTT TTGTAATCGAAGGCCTTAAAGAAGCCGACAAGGATGAATTCTGCAGATGCGAATTAAAACTCCAGAAGT GGTACGGCGATCTCCGAGGTAAGCCTTTCGCAATCGAGGCCGAGAATTCCATACTGGACATTAGTGGAT TCAGTAAACAGTATAATTGTGCCTTTATATGGCAGAAGGATGGTGTCAAGAAACTCAACCTGTACCTTAT TATTAATTATTTCAAAGGCGGGAAACTGAGATTTAAGAAGATAAAGCCTGAAGCCTTTGAGGCGAACCG ATTCTACACAGTTATTAACAAGAAATCTGGTGAAATTGTACCCATGGAGGTAAACTTCAACTTCGATGAT CCCAATCTGATTATATTGCCACTAGCTTTTGGCAAGCGGCAGGGTAGGGAATTCATTTGGAACGATTTGC TTTCACTGGAAACAGGGTCCCTTAAGCTGGCAAACGGGAGAGTGATTGAAAAGACATTGTACAATCGGA GGACACGTCAGGATGAACCTGCCCTTTTCGTGGCTCTGACATTCGAGCGCAGGGAGGTTCTGGACTCTA GCAATATCAAGCCAATGAACCTGATCGGCATAGACCGAGGAGAGAATATTCCGGCTGTGATCGCACTCA CCGATCCCGAAGGATGTCCCCTTTCTCGGTTCAAGGACTCCTTAGGCAATCCAACTCATATCCTGAGAAT CGGCGAGTCATACAAGGAGAAGCAGCGAACAATTCAGGCCGCCAAGGAAGTCGAGCAGAGGCGAGCTG GCGGCTACAGCCGTAAATACGCTAGTAAAGCTAAGAACCTGGCCGACGATATGGTGCGCAATACTGCTA GAGACCTGCTGTACTATGCAGTGACGCAGGACGCAATGCTGATATTCGAGAATCTGTCCAGAGGATTCG GAAGGCAGGGCAAGCGGACGTTCATGGCCGAGCGCCAGTATACAAGGATGGAGGATTGGTTAACGGCC AAGCTTGCCTATGAGGGGCTACCTAGTAAGACCTATCTGTCTAAGACGCTGGCTCAATACACCAGTAAG ACCTGCTCAAACTGTGGCTTTACAATCACTTCTGCTGATTATGATAGAGTGCTCGAGAAGCTAAAAAAA ACTGCCACCGGCTGGATGACTACTATTAATGGGAAGGAACTGAAAGTGGAAGGACAGATTACCTATTAT AATCGCTACAAGCGTCAAAACGTCGTCAAGGACCTGTCGGTGGAATTGGACAGACTCAGTGAAGAGTCC GTGAACAATGATATCAGCTCCTGGACAAAAGGGCGCAGTGGGGAGGCACTCAGCTTGCTTAAAAAGAG GTTTTCACATCGGCCGGTCCAGGAGAAATTTGTCTGCCTGAACTGCGGATTCGAGACACACGCCGACGA GCAGGCAGCACTGAACATTGCCAGATCCTGGCTGTTCCTTAGGTCCCAGGAATATAAGAAGTACCAGAC TAACAAAACCACGGGAAACACAGATAAAAGGGCCTTTGTCGAAACTTGGCAATCCTTTTACCGGAAGAA GTTAAAGGAAGTGTGGAAGCCC SEQ ATGGATAAGAAATACTCAATAGGCTTAGCAATCGGCACAAATAGCGTCGGATGGGCGGTGATCACTGAT ID GAATATAAGGTTCCGTCTAAAAAGTTCAAGGTTCTGGGAAATACAGACCGCCACAGTATCAAAAAAAAT NO: CTTATAGGGGCTCTTTTATTTGACAGTGGAGAGACAGCGGAAGCGACTCGTCTCAAACGGACAGCTCGT 167 AGAAGGTATACACGTCGGAAGAATCGTATTTGTTATCTACAGGAGATTTTTTCAAATGAGATGGCGAAA GTAGATGATAGTTTCTTTCATCGACTTGAAGAGTCTTTTTTGGTGGAAGAAGACAAGAAGCATGAACGTC ATCCTATTTTTGGAAATATAGTAGATGAAGTTGCTTATCATGAGAAATATCCAACTATCTATCATCTGCG AAAAAAATTGGTAGATTCTACTGATAAAGCGGATTTGCGCTTAATCTATTTGGCCTTAGCGCATATGATT AAGTTTCGTGGTCATTTTTTGATTGAGGGAGATTTAAATCCTGATAATAGTGATGTGGACAAACTATTTA TCCAGTTGGTACAAACCTACAATCAATTATTTGAAGAAAACCCTATTAACGCAAGTGGAGTAGATGCTA AAGCGATTCTTTCTGCACGATTGAGTAAATCAAGACGATTAGAAAATCTCATTGCTCAGCTCCCCGGTGA GAAGAAAAATGGCTTATTTGGGAATCTCATTGCTTTGTCATTGGGTTTGACCCCTAATTTTAAATCAAAT TTTGATTTGGCAGAAGATGCTAAATTACAGCTTTCAAAAGATACTTACGATGATGATTTAGATAATTTAT TGGCGCAAATTGGAGATCAATATGCTGATTTGTTTTTGGCAGCTAAGAATTTATCAGATGCTATTTTACT TTCAGATATCCTAAGAGTAAATACTGAAATAACTAAGGCTCCCCTATCAGCTTCAATGATTAAACGCTAC GATGAACATCATCAAGACTTGACTCTTTTAAAAGCTTTAGTTCGACAACAACTTCCAGAAAAGTATAAA GAAATCTTTTTTGATCAATCAAAAAACGGATATGCAGGTTATATTGATGGGGGAGCTAGCCAAGAAGAA TTTTATAAATTTATCAAACCAATTTTAGAAAAAATGGATGGTACTGAGGAATTATTGGTGAAACTAAATC GTGAAGATTTGCTGCGCAAGCAACGGACCTTTGACAACGGCTCTATTCCCCATCAAATTCACTTGGGTGA GCTGCATGCTATTTTGAGAAGACAAGAAGACTTTTATCCATTTTTAAAAGACAATCGTGAGAAGATTGA AAAAATCTTGACTTTTCGAATTCCTTATTATGTTGGTCCATTGGCGCGTGGCAATAGTCGTTTTGCATGGA TGACTCGGAAGTCTGAAGAAACAATTACCCCATGGAATTTTGAAGAAGTTGTCGATAAAGGTGCTTCAG CTCAATCATTTATTGAACGCATGACAAACTTTGATAAAAATCTTCCAAATGAAAAAGTACTACCAAAAC ATAGTTTGCTTTATGAGTATTTTACGGTTTATAACGAATTGACAAAGGTCAAATATGTTACTGAAGGAAT GCGAAAACCAGCATTTCTTTCAGGTGAACAGAAGAAAGCCATTGTTGATTTACTCTTCAAAACAAATCG AAAAGTAACCGTTAAGCAATTAAAAGAAGATTATTTCAAAAAAATAGAATGTTTTGATAGTGTTGAAAT TTCAGGAGTTGAAGATAGATTTAATGCTTCATTAGGTACCTACCATGATTTGCTAAAAATTATTAAAGAT AAAGATTTTTTGGATAATGAAGAAAATGAAGATATCTTAGAGGATATTGTTTTAACATTGACCTTATTTG AAGATAGGGAGATGATTGAGGAAAGACTTAAAACATATGCTCACCTCTTTGATGATAAGGTGATGAAAC AGCTTAAACGTCGCCGTTATACTGGTTGGGGACGTTTGTCTCGAAAATTGATTAATGGTATTAGGGATAA GCAATCTGGCAAAACAATATTAGATTTTTTGAAATCAGATGGTTTTGCCAATCGCAATTTTATGCAGCTG ATCCATGATGATAGTTTGACATTTAAAGAAGACATTCAAAAAGCACAAGTGTCTGGACAAGGCGATAGT TTACATGAACATATTGCAAATTTAGCTGGTAGCCCTGCTATTAAAAAAGGTATTTTACAGACTGTAAAAG TTGTTGATGAATTGGTCAAAGTAATGGGGCGGCATAAGCCAGAAAATATCGTTATTGAAATGGCACGTG AAAATCAGACAACTCAAAAGGGCCAGAAAAATTCGCGAGAGCGTATGAAACGAATCGAAGAAGGTATC AAAGAATTAGGAAGTCAGATTCTTAAAGAGCATCCTGTTGAAAATACTCAATTGCAAAATGAAAAGCTC TATCTCTATTATCTCCAAAATGGAAGAGACATGTATGTGGACCAAGAATTAGATATTAATCGTTTAAGTG ATTATGATGTCGATCACATTGTTCCACAAAGTTTCCTTAAAGACGATTCAATAGACAATAAGGTCTTAAC GCGTTCTGATAAAAATCGTGGTAAATCGGATAACGTTCCAAGTGAAGAAGTAGTCAAAAAGATGAAAA ACTATTGGAGACAACTTCTAAACGCCAAGTTAATCACTCAACGTAAGTTTGATAATTTAACGAAAGCTG AACGTGGAGGTTTGAGTGAACTTGATAAAGCTGGTTTTATCAAACGCCAATTGGTTGAAACTCGCCAAA TCACTAAGCATGTGGCACAAATTTTGGATAGTCGCATGAATACTAAATACGATGAAAATGATAAACTTA TTCGAGAGGTTAAAGTGATTACCTTAAAATCTAAATTAGTTTCTGACTTCCGAAAAGATTTCCAATTCTA TAAAGTACGTGAGATTAACAATTACCATCATGCCCATGATGCGTATCTAAATGCCGTCGTTGGAACTGCT TTGATTAAGAAATATCCAAAACTTGAATCGGAGTTTGTCTATGGTGATTATAAAGTTTATGATGTTCGTA AAATGATTGCTAAGTCTGAGCAAGAAATAGGCAAAGCAACCGCAAAATATTTCTTTTACTCTAATATCA TGAACTTCTTCAAAACAGAAATTACACTTGCAAATGGAGAGATTCGCAAACGCCCTCTAATCGAAACTA ATGGGGAAACTGGAGAAATTGTCTGGGATAAAGGGCGAGATTTTGCCACAGTGCGCAAAGTATTGTCCA TGCCCCAAGTCAATATTGTCAAGAAAACAGAAGTACAGACAGGCGGATTCTCCAAGGAGTCAATTTTAC CAAAAAGAAATTCGGACAAGCTTATTGCTCGTAAAAAAGACTGGGATCCAAAAAAATATGGTGGTTTTG ATAGTCCAACGGTAGCTTATTCAGTCCTAGTGGTTGCTAAGGTGGAAAAAGGGAAATCGAAGAAGTTAA AATCCGTTAAAGAGTTACTAGGGATCACAATTATGGAAAGAAGTTCCTTTGAAAAAAATCCGATTGACT TTTTAGAAGCTAAAGGATATAAGGAAGTTAAAAAAGACTTAATCATTAAACTACCTAAATATAGTCTTTT TGAGTTAGAAAACGGTCGTAAACGGATGCTGGCTAGTGCCGGAGAATTACAAAAAGGAAATGAGCTGG CTCTGCCAAGCAAATATGTGAATTTTTTATATTTAGCTAGTCATTATGAAAAGTTGAAGGGTAGTCCAGA AGATAACGAACAAAAACAATTGTTTGTGGAGCAGCATAAGCATTATTTAGATGAGATTATTGAGCAAAT CAGTGAATTTTCTAAGCGTGTTATTTTAGCAGATGCCAATTTAGATAAAGTTCTTAGTGCATATAACAAA CATAGAGACAAACCAATACGTGAACAAGCAGAAAATATTATTCATTTATTTACGTTGACGAATCTTGGA GCTCCCGCTGCTTTTAAATATTTTGATACAACAATTGATCGTAAACGATATACGTCTACAAAAGAAGTTT TAGATGCCACTCTTATCCATCAATCCATCACTGGTCTTTATGAAACACGCATTGATTTGAGTCAGCTAGG AGGTGACTGA SEQ ATGGATAAGAAGTATTCAATTGGACTTGCGATTGGCACTAACAGTGTGGGCTGGGCGGTGATTACAGAC ID GAGTATAAGGTGCCGTCAAAAAAGTTTAAAGTTCTGGGCAACACTGATCGCCATTCCATCAAGAAAAAC NO: CTAATCGGGGCCCTTCTTTTTGATAGTGGCGAAACGGCCGAGGCGACGCGTCTAAAACGTACCGCGCGG 168 CGTCGCTACACCCGACGAAAAAACCGTATTTGTTACCTTCAGGAGATCTTCAGTAACGAAATGGCTAAG GTGGACGATTCATTCTTCCACCGTCTGGAGGAGTCCTTTTTAGTTGAAGAAGACAAGAAGCATGAGCGA CACCCAATTTTTGGTAACATTGTCGACGAAGTCGCCTATCACGAAAAATATCCGACCATTTATCACCTGC GCAAAAAACTGGTCGATAGCACGGATAAAGCGGATCTGCGGCTTATTTACCTGGCGCTTGCCCACATGA TCAAGTTCCGCGGCCACTTCCTGATAGAAGGAGACCTGAACCCGGATAATAGCGATGTAGACAAACTGT TTATTCAGCTGGTCCAGACCTACAACCAGCTGTTTGAAGAAAATCCGATTAATGCGTCAGGCGTGGATG CGAAAGCGATACTGAGTGCCCGCCTGTCGAAATCTCGCCGTCTCGAAAATCTGATTGCACAGCTGCCCG GCGAAAAAAAAAACGGTCTTTTTGGCAATCTGATCGCGCTGTCACTGGGCCTGACACCAAATTTTAAGA GCAACTTCGACCTGGCAGAGGATGCGAAGCTTCAACTGTCGAAGGACACCTATGACGATGATCTGGATA ATCTTCTGGCACAAATCGGTGATCAGTATGCGGATTTATTCCTTGCAGCGAAAAACCTATCTGACGCAAT TCTGTTGAGCGATATCCTCCGCGTCAACACCGAAATCACTAAAGCCCCCCTGTCAGCGTCGATGATTAAA CGTTATGATGAGCACCATCAGGATCTGACCTTGCTAAAGGCGCTGGTGCGACAGCAGCTTCCCGAAAAA TATAAAGAGATCTTTTTTGATCAATCGAAGAATGGTTATGCCGGATACATTGATGGCGGAGCCAGTCAG GAAGAATTTTACAAATTCATCAAACCGATCCTGGAAAAAATGGATGGCACAGAAGAACTGCTTGTGAAA TTGAACCGGGAAGATTTACTGCGCAAACAGCGTACGTTCGACAACGGCTCCATACCCCATCAGATTCAC TTAGGTGAGCTGCATGCAATACTCCGTCGCCAGGAAGATTTTTATCCATTTTTAAAAGACAACCGTGAGA AGATTGAAAAAATTTTAACTTTTCGTATTCCATATTACGTCGGGCCTTTGGCCCGAGGTAACTCTCGATT CGCCTGGATGACGAGAAAAAGCGAGGAGACCATCACTCCGTGGAATTTTGAAGAGGTTGTTGATAAAG GCGCGAGCGCCCAGTCGTTTATCGAACGTATGACCAACTTTGATAAAAATCTGCCGAATGAAAAAGTGC TTCCGAAGCATTCTCTGTTGTATGAATATTTCACTGTGTACAATGAGTTAACGAAAGTGAAATATGTGAC CGAAGGCATGCGGAAACCTGCTTTTCTGTCCGGAGAACAGAAAAAAGCAATTGTGGACCTGCTGTTCAA AACGAACCGGAAAGTAACTGTGAAGCAGCTGAAAGAGGACTACTTCAAAAAAATCGAATGCTTCGACT CAGTAGAGATCTCTGGTGTTGAAGATCGCTTCAACGCGAGTCTGGGAACGTACCATGATTTGTTGAAAA TCATCAAAGATAAAGACTTTCTGGATAACGAAGAGAATGAGGACATTCTTGAAGATATTGTTTTGACAC TGACTCTGTTTGAGGATCGCGAAATGATTGAAGAGCGCCTGAAAACGTATGCCCATTTATTCGATGACA AAGTCATGAAGCAGCTGAAACGTCGCCGCTATACTGGGTGGGGCAGACTTTCACGTAAATTGATCAATG GTATAAGAGACAAACAGAGCGGCAAAACTATCTTAGATTTCCTGAAGAGTGATGGATTTGCCAACCGGA ATTTTATGCAGCTTATACATGATGACTCGCTAACGTTTAAAGAAGACATTCAGAAGGCGCAGGTCAGCG GCCAGGGTGATTCGCTGCATGAACACATTGCAAATCTTGCCGGATCGCCAGCGATCAAAAAAGGCATCC TTCAGACAGTAAAAGTTGTGGATGAACTGGTGAAAGTAATGGGTCGTCACAAGCCAGAAAATATTGTGA TCGAAATGGCCCGGGAAAATCAGACTACTCAAAAAGGTCAGAAAAATTCTCGCGAGCGTATGAAACGT ATTGAAGAAGGCATCAAAGAGCTAGGCAGCCAGATATTAAAGGAACATCCGGTTGAGAACACTCAGCT GCAGAATGAAAAACTGTATCTGTATTATCTTCAGAACGGCCGTGACATGTATGTTGATCAAGAACTGGA TATCAATCGCTTGTCCGATTATGACGTGGATCATATTGTTCCGCAAAGCTTTCTGAAAGACGATTCTATT GACAATAAAGTACTGACACGTTCGGACAAAAACCGTGGTAAAAGCGATAACGTACCGTCGGAAGAAGT TGTTAAGAAAATGAAAAATTATTGGCGCCAACTCCTGAATGCTAAATTGATTACCCAGCGGAAATTTGA TAACTTAACCAAAGCCGAGCGGGGTGGCTTAAGTGAACTGGATAAAGCGGGTTTTATTAAACGCCAACT GGTAGAAACCCGCCAGATAACGAAACATGTAGCTCAAATCCTCGATAGTCGCATGAATACGAAATATGA CGAAAATGATAAATTGATCCGTGAAGTAAAAGTGATTACTCTTAAAAGCAAATTGGTATCTGATTTTCG GAAAGATTTCCAATTCTATAAGGTGAGAGAAATTAACAATTACCATCATGCACATGATGCGTATTTAAA TGCAGTTGTTGGCACCGCCTTAATCAAAAAATATCCGAAATTAGAATCTGAGTTCGTGTATGGTGATTAT AAAGTTTATGATGTTCGAAAAATGATTGCTAAGTCTGAACAGGAAATCGGCAAAGCGACCGCAAAGTAT TTTTTTTATAGCAATATTATGAATTTTTTTAAAACTGAGATTACCCTGGCGAATGGCGAAATTCGCAAAC GTCCTCTGATTGAAACCAATGGCGAAACCGGCGAGATAGTATGGGACAAGGGCCGTGATTTTGCGACCG TCCGGAAAGTCCTGTCAATGCCGCAGGTGAATATTGTCAAGAAAACAGAAGTTCAGACAGGCGGTTTTA GTAAAGAGTCTATTCTGCCCAAACGTAATTCGGATAAATTGATTGCCCGCAAGAAAGATTGGGATCCGA AGAAATATGGTGGATTCGATTCTCCGACGGTCGCCTATAGCGTTCTAGTCGTCGCCAAGGTCGAAAAAG GTAAATCCAAAAAACTGAAATCTGTGAAAGAACTGTTAGGCATTACAATCATGGAACGTAGTAGTTTTG AAAAGAACCCGATCGACTTCCTCGAGGCGAAAGGCTACAAAGAAGTCAAGAAGGATTTGATTATTAAA CTCCCAAAATATTCATTATTTGAGTTAGAAAACGGTAGGAAGCGTATGCTGGCGAGTGCTGGGGAATTA CAGAAAGGGAATGAGTTAGCACTGCCGTCAAAATATGTGAACTTTCTGTATCTGGCCTCCCATTACGAG AAACTGAAAGGTAGCCCGGAAGATAATGAACAGAAACAACTATTTGTCGAGCAACACAAACATTATCT GGATGAAATTATTGAACAGATTAGTGAATTCTCTAAACGTGTTATTTTAGCGGATGCCAACCTTGACAAG GTGCTGAGCGCATATAATAAACACCGTGATAAACCCATTCGTGAACAGGCTGAAAATATCATACATCTG TTCACGTTAACCAACTTGGGAGCTCCTGCCGCTTTTAAATATTTCGATACCACAATTGACCGCAAACGTT ATACGTCTACAAAAGAGGTGCTCGATGCGACCCTGATCCACCAGTCTATTACAGGCCTGTATGAAACTC GTATCGACCTGTCACAACTGGGCGGCGACTGA SEQ ATGGACAAGAAATATTCAATCGGTTTAGCAATAGGAACTAACTCAGTAGGTTGGGCTGTAATTACAGAC ID GAATACAAGGTACCGTCCAAAAAGTTTAAGGTGTTGGGGAACACAGATAGACACTCTATAAAAAAAAA NO: TTTAATAGGCGCTTTACTTTTCGATTCAGGCGAAACTGCAGAAGCGACACGTCTGAAGAGAACCGCTAG 169 ACGTAGATACACGAGGAGAAAGAACAGAATATGTTACCTACAAGAAATTTTTTCTAATGAGATGGCTAA GGTGGATGATTCGTTTTTTCATAGACTCGAAGAATCTTTCTTAGTTGAAGAAGATAAAAAACACGAAAG GCATCCTATCTTTGGAAACATAGTTGATGAGGTGGCTTACCATGAAAAATATCCCACTATATATCACCTT AGAAAAAAGTTGGTTGATTCAACCGACAAAGCGGATCTAAGGTTAATTTACCTCGCGTTGGCTCACATG ATAAAATTTAGAGGACATTTCTTGATCGAAGGTGATTTAAATCCCGATAACTCTGATGTAGATAAACTGT TCATCCAGTTGGTTCAAACATATAATCAGTTGTTCGAAGAGAACCCCATTAACGCATCAGGTGTTGATGC TAAAGCAATCTTATCAGCAAGGTTGAGCAAGAGCAGACGTCTGGAAAACTTGATTGCCCAATTGCCAGG TGAAAAGAAGAACGGTCTTTTTGGAAATTTAATTGCACTTTCACTTGGGTTGACACCGAATTTTAAAAGC AATTTCGACCTCGCTGAGGATGCTAAACTCCAGTTATCTAAGGATACATATGACGATGATTTGGATAATC TATTGGCCCAGATAGGTGATCAGTATGCAGATTTGTTTTTGGCAGCTAAGAATTTATCAGATGCAATTCT ACTGAGCGATATTTTAAGGGTGAATACAGAAATAACTAAAGCACCTTTGTCTGCATCTATGATAAAAAG ATACGATGAACACCATCAAGATCTCACACTATTAAAAGCTTTAGTTAGACAACAATTACCAGAAAAATA TAAAGAAATCTTTTTCGATCAGTCCAAGAACGGATACGCCGGCTATATAGATGGCGGTGCCTCCCAAGA AGAATTTTACAAATTTATCAAACCCATTTTGGAAAAGATGGATGGTACTGAAGAATTATTGGTCAAATTA AACAGGGAAGATTTATTAAGAAAACAAAGGACCTTTGATAATGGTTCTATTCCACACCAAATCCATCTA GGGGAATTACATGCGATTCTTAGAAGACAAGAAGATTTTTATCCATTCTTGAAAGATAACAGGGAAAAG ATAGAGAAAATCTTAACTTTTAGAATTCCCTACTACGTCGGGCCCTTAGCTAGGGGGAATTCTAGATTCG CCTGGATGACACGCAAATCAGAAGAAACAATTACGCCTTGGAATTTTGAAGAAGTTGTTGATAAAGGAG CCTCTGCTCAATCTTTTATTGAACGAATGACCAATTTTGATAAGAATTTACCCAATGAAAAGGTCTTACC CAAACATTCACTCCTATACGAGTACTTTACTGTTTACAATGAGTTGACAAAAGTGAAGTATGTTACCGAG GGTATGCGAAAACCTGCTTTCTTGAGTGGTGAACAAAAGAAGGCCATTGTTGACTTGTTATTCAAAACTA ACAGAAAGGTCACTGTGAAGCAGCTTAAAGAAGATTATTTCAAAAAGATCGAATGTTTCGACTCGGTAG AAATTAGTGGTGTGGAAGATAGATTTAATGCTTCTCTTGGAACATATCATGATCTACTAAAGATCATCAA AGATAAAGATTTCTTGGACAATGAAGAAAATGAAGATATTCTTGAAGACATCGTGTTGACACTTACATT GTTTGAGGACAGAGAAATGATTGAAGAAAGGCTGAAGACCTACGCCCATTTGTTTGATGATAAAGTCAT GAAACAGTTAAAGAGGAGAAGGTATACCGGATGGGGTAGGCTGTCTCGCAAATTGATTAATGGTATTCG TGATAAACAATCGGGTAAAACAATCCTAGATTTCCTGAAGTCCGATGGTTTCGCCAACAGGAATTTTATG CAATTGATTCATGACGATTCTTTGACTTTTAAAGAGGATATTCAGAAAGCACAGGTCTCAGGACAGGGC GATTCACTCCATGAACATATAGCTAACCTGGCTGGCTCCCCTGCTATTAAGAAAGGTATCTTGCAAACCG TCAAAGTAGTAGACGAACTTGTTAAAGTTATGGGAAGACACAAACCTGAAAATATCGTTATTGAAATGG CTCGCGAAAACCAGACAACACAAAAGGGTCAAAAGAATTCGAGAGAGAGAATGAAGCGTATCGAAGA AGGTATTAAAGAACTTGGGTCCCAAATACTTAAAGAACATCCAGTAGAAAACACTCAGCTTCAAAATGA AAAATTATACTTATATTATCTTCAGAATGGCCGCGATATGTATGTTGACCAAGAGTTAGATATAAATAGG TTGTCTGATTACGACGTGGATCATATTGTACCTCAATCTTTTCTAAAAGATGATTCAATTGATAATAAGG TATTAACGAGAAGTGATAAAAATAGAGGTAAATCTGACAACGTGCCAAGCGAAGAGGTGGTGAAGAAA ATGAAAAATTATTGGCGTCAACTGTTGAACGCCAAGTTAATTACGCAGAGAAAGTTTGATAATCTAACA AAAGCTGAAAGAGGAGGCCTATCTGAGTTAGATAAGGCCGGTTTTATCAAACGTCAGTTAGTTGAAACC AGGCAAATCACGAAGCACGTTGCCCAAATTCTAGATTCAAGGATGAATACCAAATACGATGAAAACGAT AAACTGATTCGGGAAGTCAAGGTTATAACTCTAAAAAGCAAACTAGTTTCAGATTTTCGCAAAGATTTTC AATTTTACAAAGTTCGAGAAATCAATAATTATCATCATGCTCACGACGCGTACTTGAACGCGGTCGTTGG TACAGCTTTAATAAAGAAATATCCTAAACTGGAATCGGAATTTGTATATGGGGATTACAAAGTATACGA CGTGAGAAAGATGATCGCTAAATCTGAACAAGAAATTGGGAAAGCAACTGCCAAATATTTTTTTTACAG CAACATAATGAATTTTTTTAAAACGGAAATTACATTGGCAAATGGCGAAATTAGAAAGCGCCCATTGAT AGAGACCAATGGAGAGACTGGGGAAATCGTGTGGGATAAAGGACGTGATTTTGCCACAGTGAGGAAAG TGTTAAGTATGCCACAAGTTAATATTGTAAAAAAGACCGAGGTCCAAACGGGTGGATTTAGCAAAGAAT CAATTTTACCTAAGAGAAATTCAGATAAATTAATTGCCCGCAAAAAGGATTGGGATCCTAAAAAATATG GTGGTTTTGATTCCCCAACAGTTGCTTACTCCGTCCTAGTTGTTGCTAAGGTTGAAAAAGGAAAGTCTAA GAAACTTAAATCCGTAAAAGAGTTACTGGGAATTACAATAATGGAAAGATCCTCTTTCGAAAAGAACCC TATTGACTTCTTGGAGGCGAAAGGTTATAAAGAAGTCAAAAAAGATTTGATCATAAAACTACCAAAGTA TTCTCTATTTGAATTGGAAAACGGCAGAAAAAGGATGTTGGCAAGCGCTGGTGAACTACAAAAGGGTAA CGAATTGGCATTGCCGAGTAAATACGTGAATTTTCTATATTTGGCATCACATTACGAAAAGTTAAAGGG ATCACCCGAGGATAACGAGCAGAAACAACTGTTTGTTGAACAACACAAACATTATCTTGATGAAATTAT AGAACAAATTAGTGAGTTCAGTAAGAGAGTTATTTTAGCCGATGCAAATTTAGACAAAGTTTTATCTGCT TATAACAAACATAGAGATAAGCCTATAAGGGAACAAGCCGAAAATATTATTCATTTGTTTACGTTAACA AATTTAGGGGCACCAGCAGCATTCAAGTACTTCGATACGACTATCGATCGTAAGCGTTACACATCTACC AAAGAAGTTCTTGATGCAACTTTGATTCATCAATCTATAACAGGCTTATATGAAACTAGAATCGATCTGT CACAACTTGGTGGTGACTAA SEQ ATGGACAAGAAGTACTCAATTGGGCTTGCTATCGGCACTAACAGCGTTGGCTGGGCGGTCATCACAGAC ID GAATATAAGGTCCCATCAAAGAAATTCAAAGTCCTTGGCAATACGGACCGACATTCAATCAAGAAGAAC NO: CTGATTGGAGCTCTGCTGTTTGATTCCGGTGAAACCGCCGAGGCAACACGATTGAAACGTACCGCTCGT 170 AGGAGGTATACGCGGCGGAAAAATAGGATCTGCTATCTGCAGGAAATATTTAGCAACGAAATGGCCAA GGTAGACGACAGCTTCTTCCACCGGCTCGAGGAATCTTTCCTCGTGGAAGAAGACAAAAAGCACGAGCG CCACCCCATTTTCGGCAATATCGTGGACGAGGTAGCTTACCATGAAAAGTATCCAACTATTTACCACTTA CGTAAGAAGTTAGTGGACAGCACCGATAAAGCCGACCTTCGCCTGATTTACCTAGCACTTGCACACATG ATTAAGTTCCGAGGCCACTTCTTGATAGAGGGAGACCTGAATCCTGACAATTCCGATGTGGATAAATTGT TCATCCAGCTGGTACAGACATACAATCAGTTGTTTGAGGAAAATCCGATTAATGCCAGTGGCGTGGACG CCAAGGCTATCCTGTCTGCTCGGCTTAGTAAGAGTAGACGCCTGGAAAATCTAATCGCACAGCTGCCCG GCGAAAAGAAAAATGGACTGTTCGGTAATTTGATCGCCCTGAGCCTGGGCCTCACCCCTAACTTTAAGT CTAACTTCGACCTGGCCGAAGATGCTAAGCTCCAGCTGTCCAAAGATACTTACGATGACGATCTCGATA ATCTACTGGCTCAGATCGGGGACCAGTACGCTGACCTGTTTCTAGCTGCCAAGAACCTCAGTGACGCCAT TCTCCTGTCCGATATTCTGAGGGTTAACACTGAAATTACAAAGGCCCCGCTGAGCGCGAGCATGATCAA AAGGTACGACGAGCATCACCAGGACCTCACGCTGCTGAAGGCCTTAGTCAGACAGCAACTGCCCGAAA AGTACAAAGAAATCTTTTTCGACCAATCCAAGAACGGGTACGCCGGCTACATTGATGGCGGGGCTTCAC AAGAGGAGTTTTACAAGTTTATCAAGCCCATCCTGGAGAAAATGGACGGCACTGAAGAACTGCTTGTGA AACTCAATAGGGAAGACTTACTGAGGAAACAGCGCACATTCGATAATGGCTCCATACCCCACCAAATCC ATCTGGGAGAGTTGCATGCCATCTTGCGAAGGCAGGAGGACTTCTACCCCTTTCTTAAGGACAACAGGG AGAAAATCGAGAAAATTCTGACTTTCCGTATCCCCTACTACGTGGGCCCACTTGCTCGCGGAAACTCACG ATTCGCATGGATGACCAGAAAGTCCGAGGAAACAATTACACCCTGGAATTTTGAGGAGGTAGTAGACAA GGGAGCCAGCGCTCAATCTTTCATTGAGAGGATGACGAATTTCGACAAGAACCTTCCAAACGAGAAAGT GCTTCCTAAGCACAGCCTGCTGTATGAGTATTTCACGGTGTACAACGAACTTACGAAGGTCAAGTATGTG ACAGAGGGTATGCGGAAACCTGCTTTTCTGTCTGGTGAACAGAAGAAAGCTATCGTCGATCTCCTGTTTA AAACCAACCGAAAGGTGACGGTGAAACAGTTGAAGGAGGATTACTTCAAGAAGATCGAGTGTTTTGATT CTGTTGAAATTTCTGGGGTCGAGGATAGATTCAACGCCAGCCTGGGCACCTACCATGATTTGCTGAAGAT TATCAAGGATAAGGATTTTCTGGATAATGAGGAGAATGAAGACATTTTGGAGGATATAGTGCTGACCCT CACCCTGTTCGAGGACCGGGAGATGATCGAGGAGAGACTGAAAACATACGCTCACCTGTTTGACGACAA GGTCATGAAGCAGCTTAAGAGACGCCGTTACACAGGCTGGGGAAGATTATCCCGCAAATTAATCAACGG GATACGCGATAAACAAAGTGGCAAGACCATACTCGACTTCCTAAAGAGCGATGGATTCGCAAATCGCAA TTTCATGCAGTTGATCCACGACGATAGCCTGACCTTCAAAGAGGACATTCAGAAAGCGCAGGTGAGTGG TCAAGGGGATTCCCTGCACGAACACATTGCTAACTTGGCTGGATCACCAGCCATTAAGAAAGGCATACT GCAGACCGTTAAAGTGGTAGATGAGCTTGTGAAAGTCATGGGAAGACATAAGCCAGAGAACATAGTGA TCGAAATGGCCAGGGAAAATCAGACCACGCAAAAGGGGCAGAAGAACTCAAGAGAGCGTATGAAGAG GATCGAGGAGGGCATCAAGGAGCTGGGTAGCCAGATCCTTAAAGAGCACCCAGTTGAGAATACCCAGC TGCAGAATGAGAAACTTTATCTCTATTATCTCCAGAACGGAAGGGATATGTATGTCGACCAGGAACTGG ACATCAATCGGCTGAGTGATTATGACGTCGACCACATTGTGCCTCAAAGCTTTCTGAAGGATGATTCCAT CGACAATAAAGTTCTGACCCGGTCTGATAAAAATAGAGGCAAATCCGACAACGTACCTAGCGAAGAAG TCGTCAAAAAAATGAAGAACTATTGGAGGCAGTTGCTGAATGCCAAGCTGATTACACAACGCAAGTTTG ACAATCTCACCAAGGCAGAAAGGGGGGGCCTGTCAGAACTCGACAAAGCAGGTTTCATTAAAAGGCAG CTAGTTGAAACTAGGCAGATTACTAAGCACGTGGCCCAGATCCTCGACTCACGGATGAATACAAAGTAT GATGAGAATGATAAGCTAATCCGGGAGGTGAAGGTGATTACTCTGAAATCTAAGCTGGTGTCAGATTTC AGAAAAGACTTCCAGTTCTACAAAGTCAGAGAGATCAACAATTATCACCATGCCCACGATGCATATCTT AATGCAGTAGTGGGGACAGCTCTGATCAAAAAATATCCTAAACTGGAGTCTGAATTCGTTTATGGTGAC TATAAAGTCTATGACGTCAGAAAAATGATCGCAAAGAGCGAGCAGGAGATAGGGAAGGCCACAGCAAA GTACTTCTTTTACAGTAATATCATGAACTTTTTCAAAACTGAGATTACATTGGCTAACGGCGAGATCCGC AAGCGGCCACTGATAGAGACTAACGGAGAGACAGGGGAGATTGTTTGGGATAAGGGCCGTGACTTCGC CACCGTTAGGAAAGTGCTGTCCATGCCCCAGGTGAACATTGTGAAGAAGACAGAAGTGCAGACGGGTG GGTTCTCAAAAGAGTCTATTCTGCCTAAGCGGAATAGTGACAAACTGATCGCACGTAAAAAGGACTGGG ATCCAAAAAAGTACGGCGGATTCGACAGTCCTACCGTTGCATATTCCGTGCTTGTGGTCGCTAAGGTGG AGAAGGGAAAAAGCAAGAAACTGAAGTCAGTCAAAGAACTACTGGGCATAACGATCATGGAGCGCTCC AGTTTCGAAAAAAACCCAATCGATTTTCTTGAAGCCAAGGGATACAAGGAGGTAAAGAAAGACCTTATC ATTAAGCTGCCTAAGTACAGTCTGTTCGAACTGGAGAATGGGAGGAAGCGCATGCTGGCATCAGCTGGA GAACTCCAAAAAGGGAACGAGTTGGCCCTCCCCTCAAAGTATGTCAATTTTCTCTACCTGGCTTCTCACT ACGAGAAGTTAAAGGGGTCTCCAGAGGATAATGAGCAGAAACAGCTGTTTGTGGAACAGCACAAGCAC TATTTGGACGAAATCATCGAACAAATTTCCGAGTTCAGTAAGAGGGTGATTCTGGCCGACGCAAACCTT GACAAAGTTCTGTCCGCATACAATAAGCACAGAGACAAACCAATCCGCGAGCAAGCCGAGAATATAAT TCACCTTTTCACTCTGACTAATCTGGGGGCCCCCGCAGCATTTAAATATTTCGATACAACAATCGACCGG AAGCGGTATACATCTACTAAGGAAGTCCTCGATGCGACACTGATCCACCAGTCAATTACAGGTTTATAT GAAACAAGAATCGACCTGTCCCAGCTGGGCGGCGACTAG SEQ AAAATTCcatGCAAAATGCTCCGGTTTCATGTCATCAAAATGATGACGTAATTAAGCATTGATAATTGAGATCCCTCTCCCTGACAGGATGATTACATA ID AATAATAGTGACAAAAATAAATTATTTATTTATCCAGAAAATGAATTGGAAAATCAGGAGAGCGTTTTCAATCCTACCTCTGGCGCAGTTGATATGTca NO: aaCAGGTtgccgtcactgcgtcttttactggctcttctcgctaaccaaaccggtaaccccgcttattaaaagcattctgtaacaaagcgggaccaaagc 171 catgacaaaaacgcgtaacaaaagtgtctataatcacggcagaaaagtccacattgattatttgcacggcgtcacactttgctatgccatagcattttt atccataagattagcggatcctacctgacgctttttatcgcaactctctactgtttctccatacccgtttttttgggctagcaccgcctatctcgtgtg agataggcggagatacgaactttaagAAGGAGatataccATGGAACAGGAATATTATCTGGGCTTGGACATGGGCACCGGTTCCGTCGGCTGGGCTGTT ACTGACAGTGAATATCACGTTCTAAGAAAGCATGGTAAGGCATTGTGGGGTGTAAGACTTTTCGAATCTGCTTCCACTGCTGAAGAGCGTAGAATGTTT AGAACGAGTCGACGTAGGCTAGACAGGCGCAATTGGAGAATCGAAATTTTACAAGAAATTTTTGCGGAAGAGATATCTAAGAAAGACCCAGGCTTTTTC CTGAGAATGAAGGAATCTAAGTATTACCCTGAGGATAAAAGAGATATAAATGGTAACTGTCCCGAATTGCCTTACGCATTATTTGTGGACGATGATTTT ACCGATAAGGATTACCATAAAAAGTTCCCAACTATCTACCATTTACGCAAAATGTTAATGAATACAGAGGAAACCCCAGACATAAGACTAGTTTATCTG GCAATACACCATATGATGAAACATAGAGGCCATTTCTTACTTTCCGGGGATATCAACGAAATCAAAGAGTTTGGTACCACATTTAGTAAGTTACTGGAA AACATAAAGAATGAAGAATTGGATTGGAACTTAGAACTCGGAAAAGAAGAATACGCGGTTGTCGAATCTATCCTGAAGGATAATATGCTGAATAGGTCG ACCAAAAAAACTAGGCTGATCAAAGCACTGAAAGCCAAATCTATCTGCGAAAAAGCTGTTTTAAATTTACTTGCTGGTGGCACTGTTAAGTTATCAGAC ATTTTTGGTTTGGAAGAATTGAACGAAACCGAGCGTCCAAAAATTAGTTTCGCTGATAATGGCTACGATGATTACATTGGTGAGGTGGAAAACGAGTTG GGCGAACAATTTTATATTATAGAGACAGCTAAGGCAGTCTATGACTGGGCTGTTTTAGTAGAAATCCTTGGTAAATACACATCTATCTCCGAAGCGAAA GTTGCTACTTACGAAAAGCACAAGTCCGATCTCCAGTTTTTGAAGAAAATTGTCAGGAAATATCTGACTAAGGAAGAATATAAAGATATTTTCGTTAGT ACCTCTGACAAACTGAAAAATTACTCCGCTTACATCGGGATGACCAAGATTAATGGCAAAAAAGTTGATCTGCAAAGCAAAAGGTGTTCGAAGGAAGAA TTTTATGATTTCATTAAAAAGAATGTCTTAAAAAAATTAGAAGGTCAGCCAGAATACGAATATTTGAAAGAAGAACTGGAAAGAGAGACATTCTTACCA AAACAAGTCAACAGAGATAATGGGGTAATTCCATATCAAATTCACCTCTACGAATTAAAAAAAATTTTAGGCAATTTACGCGATAAAATTGACCTTATC AAAGAAAATGAGGATAAGCTGGTTCAACTCTTTGAATTCAGAATACCCTATTATGTGGGCCCACTGAACAAGATTGATGACGGCAAAGAAGGTAAATTC ACATGGGCCGTCCGCAAATCCAATGAAAAAATTTACCCATGGAACTTTGAAAATGTAGTAGATATTGAAGCGTCTGCGGAGAAATTTATTCGAAGAATG ACTAATAAATGCACTTACTTGATGGGAGAGGATGTTCTGCCTAAAGACAGCTTATTATACAGCAAGTACATGGTTCTAAACGAACTTAACAACGTTAAG TTGGACGGTGAGAAATTAAGTGTAGAATTGAAACAAAGATTGTATACTGACGTCTTCTGCAAGTACAGAAAAGTGACAGTTAAAAAAATTAAGAATTAC TTGAAGTGCGAAGGTATAATTTCTGGAAACGTAGAGATTACTGGTATTGATGGTGATTTCAAAGCATCCCTAACAGCTTACCACGATTTCAAGGAAATC CTGACAGGAACTGAACTCGCAAAAAAAGATAAAGAAAACATTATTACTAATATTGTTCTTTTCGGTGATGACAAGAAATTGTTGAAGAAAAGACTGAAT AGACTTTACCCCCAGATTACTCCCAATCAACTTAAGAAAATTTGTGCTTTGTCTTACACAGGATGGGGTCGTTTTTCAAAAAAGTTCTTAGAAGAGATT ACCGCACCTGATCCAGAAACAGGCGAAGTATGGAATATAATTACCGCCTTATGGGAATCGAACAATAATCTTATGCAACTTCTGAGCAATGAATATCGT TTCATGGAAGAAGTTGAGACTTACAACATGGGCAAACAGACGAAGACTTTATCCTATGAAACTGTGGAAAATATGTATGTATCACCTTCTGTCAAGAGA CAAATTTGGCAAACCTTAAAAATTGTCAAAGAATTAGAAAAGGTAATGAAGGAGTCTCCTAAACGTGTGTTTATTGAAATGGCTAGAGAAAAACAAGAG TCAAAAAGAACCGAGTCAAGAAAGAAGCAGTTAATCGATTTATATAAGGCTTGTAAAAACGAAGAGAAAGATTGGGTTAAAGAATTGGGGGACCAAGAG GAACAAAAACTACGGTCGGATAAGTTGTATTTATACTATACGCAAAAGGGACGATGTATGTATTCCGGCGAGGTAATAGAATTGAAGGATTTATGGGAC AATACAAAATATGACATAGACCATATATATCCCCAATCAAAAACGATGGACGATAGCTTGAACAATAGAGTACTCGTGAAAAAAAAATATAATGCGACC AAATCTGATAAGTATCCTCTGAATGAAAATATCAGACATGAAAGAAAGGGGTTCTGGAAGTCCTTGTTAGATGGTGGGTTTATAAGCAAAGAAAAGTAC GAGCGTCTAATAAGAAACACGGAGTTATCGCCAGAAGAACTCGCTGGTTTTATTGAGAGGCAAATCGTGGAAACGAGACAATCTACCAAAGCCGTTGCT GAGATCCTAAAGCAAGTTTTCCCAGAGTCGGAGATTGTCTATGTCAAAGCTGGCACAGTGAGCAGGTTTAGGAAAGACTTCGAACTATTAAAGGTAAGA GAAGTGAACGATTTACATCACGCAAAGGACGCTTACCTAAATATCGTTGTAGGTAACTCATATTATGTTAAATTTACCAAGAACGCCTCTTGGTTTATA AAGGAGAACCCAGGTAGAACATATAACCTGAAAAAGATGTTCACCTCTGGTTGGAATATTGAGAGAAACGGAGAAGTCGCATGGGAAGTTGGTAAGAAA GGGACTATAGTGACAGTAAAGCAAATTATGAACAAAAATAATATCCTCGTTACAAGGCAGGTTCATGAAGCAAAGGGCGGCCTTTTTGACCAACAAATT ATGAAGAAAGGGAAAGGTCAAATTGCAATAAAAGAAACCGATGAGAGACTAGCGTCAATAGAAAAGTATGGTGGCTATAATAAAGCTGCGGGTGCATAC TTTATGCTTGTTGAATCAAAAGACAAGAAAGGTAAGACTATTAGAACTATAGAATTTATACCCCTGTACCTTAAAAACAAAATTGAATCGGATGAGTCA ATCGCGTTAAATTTTCTAGAGAAAGGAAGGGGTTTAAAAGAACCAAAGATCCTGTTAAAAAAGATTAAGATTGACACCTTGTTCGATGTAGATGGATTT AAAATGTGGTTATCTGGCAGAACAGGCGATAGACTTTTGTTTAAGTGCGCTAATCAATTAATTTTGGATGAGAAAATCATTGTCACAATGAAAAAAATA GTTAAGTTTATTCAGAGAAGACAAGAAAACAGGGAGTTGAAATTATCTGATAAAGATGGTATCGACAATGAAGTTTTAATGGAAATCTACAATACATTC GTTGATAAACTTGAAAATACCGTATATCGAATCAGGTTAAGTGAACAAGCCAAAACATTAATTGATAAACAAAAAGAATTTGAAAGGCTATCACTGGAA GACAAATCCTCCACCCTATTTGAAATTTTGCATATATTCCAGTGCCAATCTTCAGCAGCTAATTTAAAAATGATTGGCGGACCTGGGAAAGCCGGCATC CTAGTGATGAACAATAATATCTCCAAGTGTAACAAAATATCAATTATTAACCAATCTCCGACAGGTATTTTTGAAAATGAAATAGACTTGCTTAAGATA TAAGAAATCATCCTTAGCGAAAGCTAAGGATTTTTTTTATCTGAAATTTATTATATCGCGTTGATTATTGATGCTGTTTTTAGTTTTAACGGCAATTAA TATATGTGTTATTAATTGAATGAATTTTATCATTCATAATAAGTATGTGTAGGATCAAGCTCAGGTTAAATATTCACTCAGGAAGTTATTACTCAGGAA GCAAAGAGGATTACAGAATTATCTCATAACAAGTGTTAAGGGATGTTATTTCC SEQ AATTCAAAGGATAATCAAAC ID NO: 172 SEQ AATCTCTACTCTTTGTAGAT ID NO: 173 SEQ AATTTCTACTGTTGTAGAT ID NO: 174 SEQ AATTTCTACTAGTGTAGAT ID NO: 175 SEQ AATTTCTACTATTGT ID NO: 176 SEQ AATTTCTACTGTTGTAGA ID NO: 177 SEQ AATTTCTACTATTGTA ID NO: 178 SEQ AATTTCTACTTTTGTAGAT ID NO: 179 SEQ AATTTCTACTGTTGTAGAT ID NO: 180 SEQ AATTTCTACTCTTGTAGAT ID NO: 181