Methods and compositions for modulating a genome
12031129 ยท 2024-07-09
Assignee
Inventors
- Jacob Rosenblum Rubens (Cambridge, MA)
- Geoffrey A. von Maltzahn (Somerville, MA)
- Robert James CITORIK (Somerville, MA, US)
- Barrett Ethan Steinberg (Somerville, MA, US)
- Donghui Li (Somerville, MA, US)
- William Edward SALOMON (West Roxbury, MA, US)
Cpc classification
C12N2310/20
CHEMISTRY; METALLURGY
C12N9/22
CHEMISTRY; METALLURGY
C12N15/11
CHEMISTRY; METALLURGY
International classification
C12N15/11
CHEMISTRY; METALLURGY
C12N15/90
CHEMISTRY; METALLURGY
Abstract
Methods and compositions for modulating a target genome are disclosed. The composition may comprise a first RNA encoding a polypeptide comprising a retrotransposase reverse transcriptase domain and a retrotransposase endonuclease domain. The composition may also comprise a second RNA comprising a sequence that binds the polypeptide and a heterologous object sequence. The composition may insert the sequence of the heterologous object sequence into a target DNA.
Claims
1. A composition for modifying DNA, the composition comprising a first engineered RNA and a second engineered RNA, wherein: (a) the first engineered RNA encodes a polypeptide, wherein the polypeptide comprises (i) a reverse transcriptase domain and (ii) a endonuclease domain, wherein the polypeptide comprises an amino acid sequence according to any of SEQ ID NOs: 1016-1021; and (b) the second engineered RNA comprises a template RNA, wherein the template RNA comprises (i) a sequence according to SEQ ID NO: 1140 that binds the encoded polypeptide and (ii) a heterologous object sequence, wherein (a) and (b) are separate RNAs.
2. The composition of claim 1, wherein the template RNA further comprises a sequence comprising at least 20 nucleotides of at least 80% identity to a target DNA.
3. The composition of claim 2, wherein the target DNA is a genomic safe harbor (GSH) site.
4. The composition of claim 1, wherein the template RNA comprises: (iii) at least 10 bases of identity to a target DNA at the 3 end of the template RNA, and (iv) at least 10 bases of identity to the target DNA at the 5 end of the template RNA.
5. The composition of claim 1, wherein the template RNA comprises: (iii) at least 3 bases of identity to a target DNA at the 3 end of the template RNA, and (iv) at least 3 bases of identity to the target DNA at the 5 end of the template RNA.
6. The composition of claim 1, wherein the sequence that binds the polypeptide comprises one or both of a 3 untranslated region and a 5 untranslated region.
7. The composition of claim 1, wherein the heterologous object sequence encodes an enzyme, a membrane protein, a blood factor, an intracellular protein, an extracellular protein, a structural protein, a signaling protein, a regulatory protein, a transport protein, or a motor protein.
8. The composition of claim 1, wherein the heterologous object sequence comprises a non-coding sequence or a regulatory sequence.
9. The composition of claim 1, wherein the heterologous object sequence encodes a therapeutic polypeptide or fragment thereof.
10. The pharmaceutical composition of claim 8, wherein the regulatory sequence is a promoter.
11. The composition of claim 1, wherein the polypeptide further comprises one or both of a nuclear localization signal and a nucleolar localization signal.
12. The composition of claim 1, which comprises only RNA, or comprises more RNA than DNA by an RNA:DNA ratio of at least 10:1, 20:1, 30:1, 40:1, 50:1, 60:1, 70:1, 80:1, 90:1, or 100:1.
13. The composition of claim 1, which comprises no more than 1%, 0.5%, 0.2%, 0.1%, 0.05%, 0.02%, or 0.01% DNA by mass or by molar amount of nucleic acid.
14. The composition of claim 1, which is capable of modifying DNA in the absence of homologous recombination activity.
15. The composition of claim 1, wherein the template RNA does not encode a reverse transcriptase.
16. The composition of claim 1, wherein the template RNA does not encode an endonuclease.
Description
BRIEF DESCRIPTION OF THE DRAWINGS
(1)
(2)
(3)
(4)
(5)
(6)
(7)
(8) FIGS. 8A-8B: Results of MISEQ? and MATLAB? analysis of DNA-mediated R2Tg integration into Hek293T cells. Each graph shows analysis of (
(9)
(10)
(11)
(12)
(13)
(14)
(15)
(16)
(17)
(18)
(19)
(20)
(21)
(22)
(23)
(24)
(25)
DETAILED DESCRIPTION
(26) This disclosure relates to compositions, systems and methods for targeting, editing, modifying or manipulating a DNA sequence (e.g., inserting a heterologous object DNA sequence into a target site of a mammalian genome) at one or more locations in a DNA sequence in a cell, tissue or subject, e.g., in vivo or in vitro. The object DNA sequence may include, e.g., a coding sequence, a regulatory sequence, a gene expression unit.
(27) More specifically, the disclosure provides retrotransposon-based systems for inserting a sequence of interest into the genome. This disclosure is based, in part, on a bioinformatic analysis to identify retrotransposase sequences and the associated 5 UTR and 3 UTR from a variety of organisms (see Table 3). While not wishing to be bound by theory, in some embodiments, retrotransposases identified in homeothermic (warm blooded) species, like birds, may have improved thermostability relative to some other enzymes that evolved at lower temperatures, and the thermostable retrotransposases may therefore be better suited for use in human cells. The disclosure also provides experimental evidence that several retrotransposases from different species, e.g., different species of animal and/or different species and clade of retrotransposon (e.g., as grouped by reverse transcriptase phylogeny, e.g., as described in Su et al. (2019) RNA; incorporated herein by reference in its entirety), can be used to catalyze DNA insertion into a target site in human cells (see Examples 7 and Example 28).
(28) In some embodiments, systems described herein can have a number of advantages relative to various earlier systems. For instance, the disclosure describes retrotransposases capable of inserting long sequences (e.g., over 3000 nucleotides) of heterologous nucleic acid into a genome (see, e.g.,
(29) GENE WRITER? Genome Editors
(30) Non-long terminal repeat (LTR) retrotransposons are a type of mobile genetic elements that are widespread in eukaryotic genomes. They include two classes: the apurinic/apyrimidinic endonuclease (APE)-type and the restriction enzyme-like endonuclease (RLE)-type. The APE class retrotransposons are comprised of two functional domains: an endonuclease/DNA binding domain, and a reverse transcriptase domain. The RLE class are comprised of three functional domains: a DNA binding domain, a reverse transcription domain, and an endonuclease domain. The reverse transcriptase domain of non-LTR retrotransposon functions by binding an RNA sequence template and reverse transcribing it into the host genome's target DNA. The RNA sequence template has a 3 untranslated region which is specifically bound to the transposase, and a variable 5 region generally having Open Reading Frame(s) (ORF) encoding transposase proteins. The RNA sequence template may also comprise a 5 untranslated region which specifically binds the retrotransposase.
(31) The inventors have found that, surprisingly, the elements of such non-LTR retrotransposons can be functionally modularized and/or modified to target, edit, modify or manipulate a target DNA sequence, e.g., to insert an object (e.g., heterologous) nucleic acid sequence into a target genome, e.g., a mammalian genome, by reverse transcription. Such modularized and modified nucleic acids, polypeptide compositions and systems are described herein and are referred to as GENE WRITER? gene editors. A GENE WRITER? gene editor system comprises: (A) a polypeptide or a nucleic acid encoding a polypeptide, wherein the polypeptide comprises (i) a reverse transcriptase domain, and either (x) an endonuclease domain that contains DNA binding functionality or (y) an endonuclease domain and separate DNA binding domain; and (B) a template RNA comprising (i) a sequence that binds the polypeptide and (ii) a heterologous insert sequence. For example, the GENE WRITER? genome editor protein may comprise a DNA-binding domain, a reverse transcriptase domain, and an endonuclease domain. In other embodiments, the GENE WRITER? genome editor protein may comprise a reverse transcriptase domain and an endonuclease domain. In certain embodiments, the elements of the GENE WRITER? gene editor polypeptide can be derived from sequences of non-LTR retrotransposons, e.g., APE-type or RLE-type retrotransposons or portions or domains thereof. In some embodiments the RLE-type non-LTR retrotransposon is from the R2, NeSL, HERO, R4, or CRE clade. In some embodiments the GENE WRITER? genome editor is derived from R4 element X4_Line, which is found in the human genome. In some embodiments the APE-type non-LTR retrotransposon is from the R1, or Tx1 clade. In some embodiments the GENE WRITER? genome editor is derived from Tx1 element Mare6, which is found in the human genome. The RNA template element of a GENE WRITER? gene editor system is typically heterologous to the polypeptide element and provides an object sequence to be inserted (reverse transcribed) into the host genome. In some embodiments the GENE WRITER? genome editor protein is capable of target primed reverse transcription.
(32) In some embodiments the GENE WRITER? genome editor is combined with a second polypeptide. In some embodiments the second polypeptide is derived from an APE-type non-LTR retrotransposon. In some embodiments the second polypeptide has a zinc knuckle-like motif. In some embodiments the second polypeptide is a homolog of Gag proteins.
(33) Polypeptide component of GENE WRITER? gene editor system
(34) Rt Domain:
(35) In certain aspects of the present invention, the reverse transcriptase domain of the GENE WRITER? system is based on a reverse transcriptase domain of an APE-type or RLE-type non-LTR retrotransposon. A wild-type reverse transcriptase domain of an APE-type or RLE-type non-LTR retrotransposon can be used in a GENE WRITER? system or can be modified (e.g., by insertion, deletion, or substitution of one or more residues) to alter the reverse transcriptase activity for target DNA sequences. In some embodiments the reverse transcriptase is altered from its natural sequence to have altered codon usage, e.g. improved for human cells. In some embodiments the reverse transcriptase domain is a heterologous reverse transcriptase from a different retrovirus, LTR-retrotransposon, or non-LTR retrotransposon. In certain embodiments, a GENE WRITER? system includes a polypeptide that comprises a reverse transcriptase domain of an RLE-type non-LTR retrotransposon from the R2, NeSL, HERO, R4, or CRE clade, or of an APE-type non-LTR retrotransposon from the R1, or Tx1 clade. In certain embodiments, a GENE WRITER? system includes a polypeptide that comprises a reverse transcriptase domain of a retrotransposon listed in Table 1, Table 2, or Table 3. In embodiments, the amino acid sequence of the reverse transcriptase domain of a GENE WRITER? system is at least about 50%, at least about 60%, at least about 70%, at least about 80%, at least about 85%, at least about 90%, at least about 95%, at least about 96%, at least about 97%, at least about 98%, at least about 99% identical to the amino acid sequence of a reverse transcriptase domain of a retrotransposon whose DNA sequence is referenced in Table 1, Table 2, or Table 3. A person having ordinary skill in the art is capable of identifying reverse transcription domains based upon homology to other known reverse transcription domains using routine tools as Basic Local Alignment Search Tool (BLAST). In some embodiments, reverse transcriptase domains are modified, for example by site-specific mutation. In embodiments, the reverse transcriptase domain is engineered to bind a heterologous template RNA.
(36) Endonuclease Domain:
(37) In certain embodiments, the endonuclease/DNA binding domain of an APE-type retrotransposon or the endonuclease domain of an RLE-type retrotransposon can be used or can be modified (e.g., by insertion, deletion, or substitution of one or more residues) in a GENE WRITER? system described herein. In some embodiments the endonuclease domain or endonuclease/DNA binding domain is altered from its natural sequence to have altered codon usage, e.g. improved for human cells. In some embodiments the endonuclease element is a heterologous endonuclease element, such as Fok1 nuclease, a type-II restriction 1-like endonuclease (RLE-type nuclease), or another RLE-type endonuclease (also known as REL). In some embodiments the heterologous endonuclease activity has nickase activity and does not form double stranded breaks. The amino acid sequence of an endonuclease domain of a GENE WRITER? system described herein may be at least about 50%, at least about 60%, at least about 70%, at least about 80%, at least about 85%, at least about 90%, at least about 95%, at least about 96%, at least about 97%, at least about 98%, at least about 99% identical to the amino acid sequence of an endonuclease domain of a retrotransposon whose DNA sequence is referenced in Table 1, 2, or 3. A person having ordinary skill in the art is capable of identifying endounclease domains based upon homology to other known endonuclease domains using tools as Basic Local Alignment Search Tool (BLAST). In certain embodiments, the heterologous endonuclease is Fok1 or a functional fragment thereof. In certain embodiments, the heterologous endonuclease is a Holliday junction resolvase or homolog thereof, such as the Holliday junction resolving enzyme from Sulfolobus solfataricus-Ssol Hje (Govindaraju et al., Nucleic Acids Research 44:7, 2016). In certain embodiments, the heterologous endonuclease is the endonuclease of the large fragment of a spliceosomal protein, such as Prp8 (Mahbub et al., Mobile DNA 8:16, 2017). For example, a GENE WRITER? polypeptide described herein may comprise a reverse transcriptase domain from an APE- or RLE-type retrotransposon and an endonuclease domain that comprises Fok1 or a functional fragment thereof. In still other embodiments, homologous endonuclease domains are modified, for example by site-specific mutation, to alter DNA endonuclease activity. In still other embodiments, endonuclease domains are modified to remove any latent DNA-sequence specificity.
(38) DNA Binding Domain:
(39) In certain aspects, the DNA-binding domain of a GENE WRITER? polypeptide described herein is selected, designed, or constructed for binding to a desired host DNA target sequence. In certain embodiments, the DNA-binding domain of the engineered RLE is a heterologous DNA-binding protein or domain relative to a native retrotransposon sequence. In some embodiments the heterologous DNA binding element is a zinc-finger element or a TAL effector element, e.g., a zinc-finger or TAL polypeptide or functional fragment thereof. In some embodiments the heterologous DNA binding element is a sequence-guided DNA binding element, such as Cas9, Cpf1, or other CRISPR-related protein that has been altered to have no endonuclease activity. In some embodiments the heterologous DNA binding element retains endonuclease activity. In some embodiments the heterologous DNA binding element replaces the endonuclease element of the polypeptide. In specific embodiments, the heterologous DNA-binding domain can be any one or more of Cas9, TAL domain, ZF domain, Myb domain, combinations thereof, or multiples thereof. In certain embodiments, the heterologous DNA-binding domain is a DNA binding domain of a retrotransposon described in Table 1, Table 2, or Table 3. A person having ordinary skill in the art is capable of identifying DNA binding domains based upon homology to other known DNA binding domains using tools as Basic Local Alignment Search Tool (BLAST). In still other embodiments, DNA-binding domains are modified, for example by site-specific mutation, increasing or decreasing DNA-binding elements (for example, number and/or specificity of zinc fingers), etc., to alter DNA-binding specificity and affinity. In some embodiments the DNA binding domain is altered from its natural sequence to have altered codon usage, e.g. improved for human cells
(40) In certain aspects of the present invention, the host DNA-binding site integrated into by the GENE WRITER? system can be in a gene, in an intron, in an exon, an ORF, outside of a coding region of any gene, in a regulatory region of a gene, or outside of a regulatory region of a gene. In other aspects, the engineered RLE may bind to one or more than one host DNA sequence.
(41) In certain embodiments, a GENE WRITER? gene editor system RNA further comprises an intracellular localization sequence, e.g., a nuclear localization sequence. The nuclear localization sequence may be an RNA sequence that promotes the import of the RNA into the nucleus. In certain embodiments the nuclear localization signal is located on the template RNA. In certain embodiments, the retrotransposase polypeptide is encoded on a first RNA, and the template RNA is a second, separate, RNA, and the nuclear localization signal is located on the template RNA and not on an RNA encoding the retrotransposase polypeptide. While not wishing to be bound by theory, in some embodiments, the RNA encoding the retrotransposase is targeted primarily to the cytoplasm to promote its translation, while the template RNA is targeted primarily to the nucleus to promote its retrotransposition into the genome. In some embodiments the nuclear localization signal is at the 3 end, 5 end, or in an internal region of the template RNA. In some embodiments the nuclear localization signal is 3 of the heterologous sequence (e.g., is directly 3 of the heterologous sequence) or is 5 of the heterologous sequence (e.g., is directly 5 of the heterologous sequence). In some embodiments the nuclear localization signal is placed outside of the 5 UTR or outside of the 3 UTR of the template RNA. In some embodiments the nuclear localization signal is placed between the 5 UTR and the 3 UTR, wherein optionally the nuclear localization signal is not transcribed with the transgene (e.g., the nuclear localization signal is an anti-sense orientation or is downstream of a transcriptional termination signal or polyadenylation signal). In some embodiments the nuclear localization sequence is situated inside of an intron. In some embodiments a plurality of the same or different nuclear localization signals are in the RNA, e.g., in the template RNA. In some embodiments the nuclear localization signal is less than 5, 10, 25, 50, 75, 100, 150, 200, 250, 300, 350, 400, 450, 500, 600, 700, 800, 900 or 1000 bp in length. Various RNA nuclear localization sequences can be used. For example, Lubelsky and Ulitsky, Nature 555 (107-111), 2018 describe RNA sequences which drive RNA localization into the nucleus. In some embodiments, the nuclear localization signal is a SINE-derived nuclear RNA localization (SIRLOIN) signal. In some embodiments the nuclear localization signal binds a nuclear-enriched protein. In some embodiments the nuclear localization signal binds the HNRNPK protein. In some embodiments the nuclear localization signal is rich in pyrimidines, e.g., is a C/T rich, C/U rich, C rich, T rich, or U rich region. In some embodiments the nuclear localization signal is derived from a long non-coding RNA. In some embodiments the nuclear localization signal is derived from MALAT1 long non-coding RNA or is the 600 nucleotide M region of MALAT1 (described in Miyagawa et al., RNA 18, (738-751), 2012). In some embodiments the nuclear localization signal is derived from BORG long non-coding RNA or is a AGCCC motif (described in Zhang et al., Molecular and Cellular Biology 34, 2318-2329 (2014). In some embodiments the nuclear localization sequence is described in Shukla et al., The EMBO Journal e98452 (2018). In some embodiments the nuclear localization signal is derived from a non-LTR retrotransposon, an LTR retrotransposon, retrovirus, or an endogenous retrovirus.
(42) In certain embodiments, a GENE WRITER? gene editor system polypeptide further comprises an intracellular localization sequence, e.g., a nuclear localization sequence and/or a nucleolar localization sequence. The nuclear localization sequence and/or nucleolar localization sequence may be amino acid sequences that promote the import of the protein into the nucleus and/or nucleolus, where it can promote integration of heterologous sequence into the genome. In certain embodiments, a GENE WRITER? gene editor system polypeptide (e.g., a retrotransposase, e.g., a polypeptide according to any of Tables 1, 2, or 3 herein) further comprises a nucleolar localization sequence. In certain embodiments, the retrotransposase polypeptide is encoded on a first RNA, and the template RNA is a second, separate, RNA, and the nucleolar localization signal is encoded on the RNA encoding the retrotransposase polypeptide and not on the template RNA. In some embodiments, the nucleolar localization signal is located at the N-terminus, C-terminus, or in an internal region of the polypeptide. In some embodiments, a plurality of the same or different nucleolar localization signals are used. In some embodiments, the nuclear localization signal is less than 5, 10, 25, 50, 75, or 100 amino acids in length. Various polypeptide nucleolar localization signals can be used. For example, Yang et al., Journal of Biomedical Science 22, 33 (2015), describe a nuclear localization signal that also functions as a nucleolar localization signal. In some embodiments, the nucleolar localization signal may also be a nuclear localization signal. In some embodiments, the nucleolar localization signal may overlap with a nuclear localization signal. In some embodiments, the nucleolar localization signal may comprise a stretch of basic residues. In some embodiments, the nucleolar localization signal may be rich in arginine and lysine residues. In some embodiments, the nucleolar localization signal may be derived from a protein that is enriched in the nucleolus. In some embodiments, the nucleolar localization signal may be derived from a protein enriched at ribosomal RNA loci. In some embodiments, the nucleolar localization signal may be derived from a protein that binds rRNA. In some embodiments, the nucleolar localization signal may be derived from MSP58. In some embodiments, the nucleolar localization signal may be a monopartite motif. In some embodiments, the nucleolar localization signal may be a bipartite motif. In some embodiments, the nucleolar localization signal may consist of a multiple monopartite or bipartite motifs. In some embodiments, the nucleolar localization signal may consist of a mix of monopartite and bipartite motifs. In some embodiments, the nucleolar localization signal may be a dual bipartite motif. In some embodiments, the nucleolar localization motif may be a KRASSQALGTIPKRRSSSRFIKRKK (SEQ ID NO: 1530). In some embodiments, the nucleolar localization signal may be derived from nuclear factor-?B-inducing kinase. In some embodiments, the nucleolar localization signal may be an RKKRKKK motif (SEQ ID NO: 1531) (described in Birbach et al., Journal of Cell Science, 117 (3615-3624), 2004).
(43) In some embodiments, a nucleic acid described herein (e.g., an RNA encoding a GENE WRITER? polypeptide, or a DNA encoding the RNA) comprises a microRNA binding site. In some embodiments, the microRNA binding site is used to increase the target-cell specificity of a GENE WRITER? system. For instance, the microRNA binding site can be chosen on the basis that is recognized by a miRNA that is present in a non-target cell type, but that is not present (or is present at a reduced level relative to the non-target cell) in a target cell type. Thus, when the RNA encoding the GENE WRITER? polypeptide is present in a non-target cell, it would be bound by the miRNA, and when the RNA encoding the GENE WRITER? polypeptide is present in a target cell, it would not be bound by the miRNA (or bound but at reduced levels relative to the non-target cell). While not wishing to be bound by theory, binding of the miRNA to the RNA encoding the GENE WRITER? polypeptide may reduce production of the GENE WRITER? polypeptide, e.g., by degrading the mRNA encoding the polypeptide or by interfering with translation. Accordingly, the heterologous object sequence would be inserted into the genome of target cells more efficiently than into the genome of non-target cells. A system having a microRNA binding site in the RNA encoding the GENE WRITER? polypeptide (or encoded in the DNA encoding the RNA) may also be used in combination with a template RNA that is regulated by a second microRNA binding site, e.g., as described herein in the section entitled Template RNA component of GENE WRITER? gene editor system.
(44) TABLE-US-00001 TABLE 1 Table 1: APE-type non-LTR retrotransposon elements Sequence Family Accession Mobile Element Name Organism Dewa AB097143 ORF2 Danio rerio retrotransposon Danio rerio DewaDr1 DNA, complete sequence HeT-A KJ081250 non-LTR Drosophila melanogaster non-LTR Drosophila retrotransposon: retrotransposon HeT-A, partial melanogaster HeT-A sequence Keno AB111948 ORF2 Tetraodon nigroviridis Tetraodon retrotransposon KenoTn1 DNA, nigroviridis partial sequence KenoDr1; AB097144 ORF2 Danio rerio retrotransposon KenoDr1 Danio rerio Keno DNA, complete sequence KenoFr1; AB111947 ORF2 Takifugu rubripes retrotransposon Takifugu Keno KenoFr1 DNA, complete sequence rubripes Kibi AB097139 ORF2 Danio rerio retrotransposon KibiDr2 Danio rerio DNA, complete sequence Kibi AB097138 ORF2 Danio rerio retrotransposon KibiDr1 Danio rerio DNA, complete sequence Kibi AB097137 ORF2 Tetraodon nigroviridis Tetraodon retrotransposon KibiTn1 DNA, nigroviridis complete sequence Kibi AB097136 ORF2 Takifugu rubripes retrotransposon Takifugu KibiFr1 DNA, complete sequence rubripes KoshiTn1 AB097135 ORF2 Tetraodon nigroviridis Tetraodon retrotransposon KoshiTn1 DNA, nigroviridis complete sequence Mutsu AB097142 ORF2 Danio rerio retrotransposon Danio rerio MutsuDr3 DNA, partial sequence Mutsu AB097141 ORF2 Danio rerio retrotransposon Danio rerio MutsuDr2 DNA, partial sequence Mutsu AB097140 ORF2 Danio rerio retrotransposon Danio rerio MutsuDr1 DNA, complete sequence R1 HQ284568 non-LTR Trilocha sp. GAS-2011 isolate TrilSp.6 Trilocha retrotransposon: non-LTR retrotransposon R1-like sp. GAS-2011 R1-like reverse transcriptase gene, partial cds R1 HQ284534 non-LTR Scopula ornata isolate ScoOrn.6 non- Scopula ornata retrotransposon: LTR retrotransposon R1-like reverse R1-like transcriptase gene, partial cds R1 HQ284496 non-LTR Perigonia ilus isolate PerIlus.31 non- Perigonia ilus retrotransposon: LTR retrotransposon R1-like reverse R1-like transcriptase gene, partial cds R1 HQ284489 non-LTR Oxytenis modestia isolate Oxytenis retrotransposon: OxyMod.2_3_4_7_9 non-LTR modestia R1-like retrotransposon R1-like reverse transcriptase gene, partial cds R1 HQ284488 non-LTR Oxytenis modestia isolate OxyMod.1 Oxytenis retrotransposon: non-LTR retrotransposon R1-like modestia R1-like reverse transcriptase gene, partial cds R1 HQ284476 non-LTR Oeneis magna dubia isolate Oeneis magna retrotransposon: OenMag.26 non-LTR retrotransposon dubia R1-like R1-like reverse transcriptase-like gene, partial sequence R1 HQ284437 non-LTR Lymantria dispar isolate LymDis.2 Lymantria retrotransposon: non-LTR retrotransposon R1-like dispar R1-like reverse transcriptase gene, partial cds R1 HQ284435 non-LTR Lymantria dispar isolate LymDis.1 Lymantria retrotransposon: non-LTR retrotransposon R1-like dispar R1-like reverse transcriptase gene, partial cds R1 HQ284432 non-LTR Janiodes laverna isolate JanLav.911 Janiodes retrotransposon: non-LTR retrotransposon R1-like laverna R1-like reverse transcriptase-like gene, partial sequence R1 HQ284431 non-LTR Janiodes laverna isolate JanLav.811 Janiodes retrotransposon: non-LTR retrotransposon R1-like laverna R1-like reverse transcriptase gene, partial cds R1 HQ284430 non-LTR Janiodes laverna isolate JanLav.5 Janiodes retrotransposon: non-LTR retrotransposon R1-like laverna R1-like reverse transcriptase gene, partial cds R1 HQ284428 non-LTR Janiodes laverna isolate JanLav.411 Janiodes retrotransposon: non-LTR retrotransposon R1-like laverna R1-like reverse transcriptase gene, partial cds R1 HQ284426 non-LTR Janiodes laverna isolate JanLav.211 Janiodes retrotransposon: non-LTR retrotransposon R1-like laverna R1-like reverse transcriptase-like gene, partial sequence R1 HQ284421 non-LTR Heteropterus morpheus isolate Heteropterus retrotransposon: HetMor.3 non-LTR retrotransposon morpheus R1-like R1-like reverse transcriptase gene, partial cds R1 HQ284402 non-LTR Erinnyis ello isolate EriEllo.22 non- Erinnyis ello retrotransposon: LTR retrotransposon R1-like reverse R1-like transcriptase-like gene, partial sequence R1 HQ284399 non-LTR Erebia theano isolate EreThe.29 non- Erebia theano retrotransposon: LTR retrotransposon R1-like reverse R1-like transcriptase gene, partial cds R1 HQ284398 non-LTR Erebia theano isolate EreThe.28 non- Erebia theano retrotransposon: LTR retrotransposon R1-like reverse R1-like transcriptase-like gene, partial sequence R1 HQ284397 non-LTR Erebia theano isolate EreThe.27 non- Erebia theano retrotransposon: LTR retrotransposon R1-like reverse R1-like transcriptase-like gene, partial sequence R1 HQ284391 non-LTR Emesis lucinda isolate EmeLuc.23 Emesis lucinda retrotransposon: non-LTR retrotransposon R1-like R1-like reverse transcriptase gene, partial cds R1 HQ284390 non-LTR Emesis lucinda isolate EmeLuc.2 non- Emesis lucinda retrotransposon: LTR retrotransposon R1-like reverse R1-like transcriptase gene, partial cds R1 HQ284364 non-LTR Coenonympha glycerion isolate Coenonympha retrotransposon: CoeGly.9 non-LTR retrotransposon glycerion R1-like R1-like reverse transcriptase gene, partial cds R1 HQ284363 non-LTR Coenonympha glycerion isolate Coenonympha retrotransposon: CoeGly.8 non-LTR retrotransposon glycerion R1-like R1-like reverse transcriptase-like gene, partial sequence R1 HQ284362 non-LTR Coenonympha glycerion isolate Coenonympha retrotransposon: CoeGly.7 non-LTR retrotransposon glycerion R1-like R1-like reverse transcriptase gene, partial cds R1 HQ284361 non-LTR Coenonympha glycerion isolate Coenonympha retrotransposon: CoeGly.5 non-LTR retrotransposon glycerion R1-like R1-like reverse transcriptase-like gene, partial sequence R1 HQ284357 non-LTR Coenonympha glycerion isolate Coenonympha retrotransposon: CoeGly.13 non-LTR retrotransposon glycerion R1-like R1-like reverse transcriptase gene, partial cds R1 HQ284356 non-LTR Coenonympha glycerion isolate Coenonympha retrotransposon: CoeGly.11 non-LTR retrotransposon glycerion R1-like R1-like reverse transcriptase-like gene, partial sequence R1 HQ284350 non-LTR Catocyclotis adelina isolate Catocyclotis retrotransposon: CatAde.18 non-LTR retrotransposon adelina R1-like R1-like reverse transcriptase gene, partial cds R1 HQ284340 non-LTR Caria rhacotis isolate CarRha.11 non- Caria rhacotis retrotransposon: LTR retrotransposon R1-like reverse R1-like transcriptase gene, partial cds R1 HQ284339 non-LTR Caria rhacotis isolate CarRha.1 non- Caria rhacotis retrotransposon: LTR retrotransposon R1-like reverse R1-like transcriptase gene, partial cds R1 HQ284319 non-LTR Archiearis parthenias isolate BrePar.1 Archiearis retrotransposon: non-LTR retrotransposon R1-like parthenias R1-like reverse transcriptase gene, partial cds R1 HQ284318 non-LTR Brangas neora isolate BraNeo.32 Brangas neora retrotransposon: non-LTR retrotransposon R1-like R1-like reverse transcriptase gene, partial cds R1 HQ284292 non-LTR Araschnia levana isolate AraLev.31 Araschnia levana retrotransposon: non-LTR retrotransposon R1-like R1-like reverse transcriptase-like gene, partial sequence R1 HQ284286 non-LTR Araschnia levana isolate AraLev.1 Araschnia levana retrotransposon: non-LTR retrotransposon R1-like R1-like reverse transcriptase gene, partial cds R1 HQ284280 non-LTR Anteros formosus isolate AntForm.34 Anteros retrotransposon: non-LTR retrotransposon R1-like formosus R1-like reverse transcriptase gene, partial cds R1 HQ284279 non-LTR Anteros formosus isolate AntForm.32 Anteros retrotransposon: non-LTR retrotransposon R1-like formosus R1-like reverse transcriptase-like gene, partial sequence R1 HQ284278 non-LTR Anteros formosus isolate AntForm.31 Anteros retrotransposon: non-LTR retrotransposon R1-like formosus R1-like reverse transcriptase-like gene, partial sequence R1 HQ284270 non-LTR Agrotis exclamationis isolate Agrotis retrotransposon: AgrExcl.27 non-LTR retrotransposon exclamationis R1-like R1-like reverse transcriptase gene, partial cds R1 HQ284267 non-LTR Agrius cingulata isolate Agrius retrotransposon: AgrCing.36_39 non-LTR cingulata R1-like retrotransposon R1-like reverse transcriptase gene, partial cds R1 HQ284266 non-LTR Agrius cingulata isolate AgrCing.3 Agrius retrotransposon: non-LTR retrotransposon R1-like cingulata R1-like reverse transcriptase-like gene, partial sequence R1 HQ284263 non-LTR Aglia tau isolate AglTau.8 non-LTR Aglia tau retrotransposon: retrotransposon R1-like reverse R1-like transcriptase gene, partial cds R1 HQ284262 non-LTR Aglia tau isolate AglTau.7 non-LTR Aglia tau retrotransposon: retrotransposon R1-like reverse R1-like transcriptase gene, partial cds R1 DQ836362 MalR1 Maculinea alcon R1-like non-LTR Phengaris alcon retrotransposon R1 reverse transcriptase (RT) pseudogene, partial sequence R1 DQ836391 MnaR1 Maculinea nausithous R1-like non- Phengaris LTR retrotransposon R1 reverse nausithous transcriptase (RT) gene, partial cds R1 KU543683 non-LTR Bactrocera tryoni clone Btry_5404 Bactrocera retrotransposon non-LTR retrotransposon R1, tryoni and non-LTR complete sequence retrovirus reverse transcriptase; Region: RT_nLTR R1 KU543682 non-LTR Bactrocera tryoni clone Btry_5167 Bactrocera retrotransposon non-LTR retrotransposon R1, tryoni and non-LTR complete sequence retrovirus reverse transcriptase; Region: RT_nLTR R1 KU543679 non-LTR Bactrocera tryoni clone Btry_4956 Bactrocera retrotransposon non-LTR retrotransposon R1, tryoni and non-LTR complete sequence retrovirus reverse transcriptase; Region: RT_nLTR R1 KU543678 non-LTR Bactrocera tryoni clone Btry_5979 Bactrocera retrotransposon non-LTR retrotransposon R1, tryoni and non-LTR complete sequence retrovirus reverse transcriptase; Region: RT_nLTR R1 AB078933 ORF1 Papilio xuthus non-LTR Papilio xuthus retrotransposon gene for gag-like protein, partial cds, clone: SARTPx2-2 R1 AB078932 ORF1 Papilio xuthus non-LTR Papilio xuthus retrotransposon gene for gag-like protein, partial cds, clone: SARTPx2-1 R1 AB078936 ORF2 Papilio xuthus non-LTR Papilio xuthus retrotransposon genes for gag-like protein, reverse transcriptase, partial cds, clone: SARTPx4-N18 R1 AB078935 ORF2 Papilio xuthus non-LTR Papilio xuthus retrotransposon genes for gag-like protein, reverse transcriptase, partial and complete cds, clone: SARTPx3-N7 R1 AB078934 ORF2 Papilio xuthus non-LTR Papilio xuthus retrotransposon genes for gag-like protein, reverse transcriptase, partial cds, clone: SARTPx3-N3 R1 AB078931 ORF2 Papilio xuthus non-LTR Papilio xuthus retrotransposon genes for gag-like protein, reverse transcriptase, partial and complete cds, clone: SARTPx1-N14 R1 AB078930 ORF2 Papilio xuthus non-LTR Papilio xuthus retrotransposon genes for gag-like protein, reverse transcriptase, partial and complete cds, clone: SARTPx1-N5 R1 AB078929 ORF2 Papilio xuthus non-LTR Papilio xuthus retrotransposon genes for gag-like protein, reverse transcrpitase, partial and complete cds, clone: SARTPx1-N4 R1 AB078928 ORF2 Papilio xuthus non-LTR Papilio xuthus retrotransposon gene for gag-like protein, reverse transcrpitase, complete and partial cds, clone: SARTPx1-3 R1 KP771712 ORF2; contains Blattella germanica non-LTR Blattella endonuclease, retrotransposon TRAS-like 2, germanica reverse complete sequence transcriptase and RNaseH R1 KP771711 ORF2; contains Blattella germanica non-LTR Blattella endonuclease, retrotransposon TRAS-like 1, germanica reverse complete sequence transcriptase and RNaseH R1 AF015813 R1 ORF Dugesiella sp. retrotransposon R1 Aphonopelma reverse transcriptase gene, partial sp. WDB-1998 cds R1 AF015489 R1 ORF Dugesiella sp. retrotransposon R1 Aphonopelma reverse transcriptase gene, partial sp. WDB-1998 cds R1Bm AB182560 non-LTR Bombyx mori genes for non-LTR Bombyx mori retrotransposon retrotransposon R1Bmks ORF1 R1Bmks ORF2 protein, non-LTR retrotransposon R1Bmks ORF2 protein, complete cds R6 AB090819 ORF2 Anopheles gambiae retrotransposon Anopheles R6Ag3 DNA, complete sequence gambiae R6 AB090818 ORF2 Anopheles gambiae retrotransposon Anopheles R6Ag2 DNA, complete sequence gambiae R6 AB090817 ORF2 Anopheles gambiae retrotransposon Anopheles R6Ag1 DNA, complete sequence gambiae R6 KJ958615 R2 Bacillus rossius non-LTR Bacillus retrotransposon reVIR6, partial rossius sequence R6 KJ958596 R2 Bacillus rossius non-LTR Bacillus retrotransposon reBER6, partial rossius sequence R6 AF352480 transposon: Chironomus circumdatus clone cir6 Chironomus NLRCth1- transposon NLRCth1-like non-LTR circumdatus like non-LTR retrotransposon reverse retrotransposon transcriptase gene, partial cds R6 AF373367 transposon: Clelia rustica clone CR6 non-LTR Paraphimophis non-LTR retrotransposon LINE2 reverse rusticus retrotransposon transcriptase pseudogene, partial LINE2 sequence R7 AB090820 ORF2 Anopheles gambiae retrotransposon Anopheles R7Ag1 DNA, complete sequence gambiae R7 AB090821 ORF2 Anopheles gambiae retrotransposon Anopheles R7Ag2 DNA, complete sequence gambiae R7 KJ958622 R2 Bacillus rossius non-LTR Bacillus retrotransposon trKOR7, partial rossius sequence R7 KJ958616 R2 Bacillus rossius non-LTR Bacillus retrotransposon reVIR7, partial rossius sequence R7 KJ958597 R2 Bacillus rossius non-LTR Bacillus retrotransposon reBER7, partial rossius sequence R7 AF352514 transposon: Chironomus circumdatus clone cir7 Chironomus NLRCth1-like transposon NLRCth1-like non-LTR circumdatus non-LTR retrotransposon reverse retrotransposon transcriptase pseudogene, partial sequence Rt2 AY379084 truncated; Leptocheirus plumulosus Leptocheirus similar to retrotransposon LpRt2, partial plumulosus reverse sequence transcriptase Rt2 MSQRT2RET Anopheles gambiae retrotransposon Anopheles RT2, complete sequence gambiae RTAg4 AB090813 ORF2 Anopheles gambiae retrotransposon Anopheles RTAg4 DNA, complete sequence gambiae TRAS1 BMOTRAS1 DNA binding Bombyx mori gene, complete Bombyx mori domain at sequence of retrotransposon TRAS1 AA1103-1120. TRAS3 JX875955 similar to Acyrthosiphon pisum clone LSR1 non- Acyrthosiphon reverse LTR retrotransposon TRAS3, pisum transcriptases complete sequence Tx1 AJ621359 transposon: Tetraodon nigroviridis non-LTR Tetraodon non-LTR retrotransposon TX1-1_Tet, complete nigroviridis retrotransposon sequence TX1-1_Tet Tx1 AJ621360 transposon: Tetraodon nigroviridis partial non- Tetraodon non-LTR LTR retrotransposon TX1-2_Tet nigroviridis retrotransposon TX1-2_Tet Tx1 AJ621361 transposon: Tetraodon nigroviridis partial non- Tetraodon non-LTR LTR retrotransposon TX1-3_Tet nigroviridis retrotransposon TX1-3_Tet Tx1 AJ621362 transposon: Tetraodon nigroviridis partial non- Tetraodon non-LTR LTR retrotransposon TX1-4_Tet nigroviridis retrotransposon TX1-4_Tet Tx1 DQ118004 transposon: Acipenser ruthenus clone dg194 Acipenser Tx1-like transposon Tx1-like retrotransposon ruthenus retrotransposon Tx1Aru reverse transcriptase-like Tx1Aru gene, partial sequence Tx1 AB097134 ORF2 Takifugu rubripes retrotransposon Takifugu KoshiFr1 DNA, complete sequence rubripes Tx1 AB090816 ORF2 Anopheles gambiae retrotransposon Anopheles MinoAg1 DNA, complete sequence gambiae Tx1 AB090812 ORF2 Anopheles gambiae retrotransposon Anopheles RTAg3 DNA, complete sequence gambiae Waldo AH009917 non-LTR Drosophila melanogaster Waldo-A Drosophila retrotransposon: non-LTR retrotransposon, 5 melanogaster Waldo-A sequence Waldo AH009916 non-LTR Drosophila melanogaster clone CBE9 Drosophila retrotransposon: Waldo-A non-LTR retrotransposon, 5 melanogaster Waldo-A sequence Waldo AH009915 non-LTR Drosophila melanogaster Waldo-A Drosophila retrotransposon: non-LTR retrotransposon, 5 melanogaster Waldo-A sequence Waldo AH009914 non-LTR Drosophila melanogaster Waldo-A Drosophila retrotransposon: non-LTR retrotransposon melanogaster Waldo-A Waldo AH009920 non-LTR Drosophila melanogaster Waldo-B Drosophila retrotransposon: non-LTR retrotransposon, 5 melanogaster Waldo-B sequence Waldo AH009919 non-LTR Drosophila retrotransposon: melanogaster Waldo-B Waldo AH009918 non-LTR Drosophila retrotransposon: melanogaster Waldo-B Waldo AB090815 ORF2 Anopheles gambiae retrotransposon Anopheles WaldoAg2 DNA, complete sequence gambiae Waldo AB090814 ORF2 Anopheles gambiae retrotransposon Anopheles WaldoAg1 DNA, complete sequence gambiae Waldo AB078939 ORF2 Forficula scudderi non-LTR Forficula retrotransposon pseudogene for scudderi reverse transcriptase, clone: WaldoFs1-26 Waldo AB078938 ORF2 Forficula scudderi non-LTR Forficula retrotransposon pseudogene for scudderi reverse transcriptase, clone: WaldoFs1-2 Waldo AB078937 ORF2 Forficula scudderi non-LTR Forficula retrotransposon pseudogene for scudderi reverse transcriptase, clone: WaldoFs1-1
(45) TABLE-US-00002 TABLE 2 Table 2: RLE-type non-LTR retrotransposon elements Family Accession Mobile Element Name/Description Organism CRE EF067892 Colletotrichum cereale Colletotrichum clone 9F8-1558 Ccret3 non-LTR cereale retrotransposon, partial sequence CRE EF067894 Colletotrichum cereale Colletotrichum clone 9F8-2137 Ccret3 non-LTR cereale retrotransposon, partial sequence CRE MG028000 non-LTR Characidium gomesi voucher Characidium retrotransposon: MNRJ20998 non-LTR gomesi Rex3 retrotransposon Rex3, partial sequence CRE KY566213 non-LTR Characidium gomesi non-LTR Characidim retrotransposon: retrotransposon Rex3, partial gomesi Rex3 sequence CRE GU949558 Kalotermes flavicollis Kalotermes isolate Crete non-LTR flavicollis retrotransposon R2, complete sequence; and R2 protein gene, complete cds CRE; CFU19151 poly dA Crithidia fasciculata Crithidia CRE2 tracts in retrotransposon CRE2 in mini- fasciculata 5 and exon gene, putative reverse 3 UTRs transcriptase gene, complete cds CZAR BR000987 pol TPA_inf: Capsaspora owczarzaki Capsaspora DNA, non-LTR retrotransposon owczarzaki CoL4, complete sequence, strain: ATCC 30864 CZAR BR000986 pol TPA_inf: Capsaspora owczarzaki Capsaspora DNA, non-LTR retrotransposon owczarzaki CoL3, complete sequence, strain: ATCC 30864 CZAR BR000985 pol TPA_inf: Capsaspora owczarzaki Capsaspora DNA, non-LTR retrotransposon owczarzaki CoL2, complete sequence, strain: ATCC 30864 CZAR BR000984 pol TPA_inf: Capsaspora owczarzaki Capsaspora DNA, non-LTR retrotransposon owczarzaki CoL1, complete sequence, strain: ATCC 30864 DongAG; AB097127 rt Anopheles gambiae Anopheles Dong retrotransposon DongAg DNA, gambiae partial sequence EhRLE2 AB097128 rt Entamoeba histolytica Entamoeba retrotransposon EhRLE2 DNA, histolytica complete sequence EhRLE3 AB097129 rt Entamoeba histolytica Entamoeba retrotransposon EhRLE3 DNA, histolytica complete sequence Genie AF440196 endonuclease Giardia intestinalis non-LTR Giardia retrotransposon GENIE 1 pol intestinalis polyprotein gene, complete cds Genie BK000097 endonuclease TPA_exp: Giardia intestinalis Giardia non-LTR retrotransposon Genie intestinalis 1A gene, partial sequence Genie BK000095 endonuclease TPA_exp: Giardia intestinalis Giardia non-LTR retrotransposon Genie intestinalis 1 gene, partial sequence Genie BK000096 insertion site TPA_exp: Giardia intestinalis Giardia for non-LTR non-LTR retrotransposon Genie intestinalis retrotransposon 1 target site sequence Genie 1 Genie AY216701 non- Girardia tigrina GENIE Girardia experimental retrotransposon, complete tigrina evidence, no sequence additional details recorded Genie BK000098 similar to TPA_exp: Giardia intestinalis Giardia endonuclease non-LTR retrotransposon Genie intestinalis 2 gene, complete sequence GilD AF433877 (tca)n (SEQ Giardia intestinalis inactive Giardia ID NO: 1532) non-LTR retrotransposon GilD, intestinalis or (tga)n consensus sequence (SEQ ID NO: 1533), n = 2-4 GilM AF433875 poly(dA) Giardia intestinalis non-LTR Giardia tract LINE-like retrotransposon intestinalis GilM, complete sequence Hero AB097132 rt Danio rerio retrotransposon Danio rerio HERODr DNA, complete sequence Hero AB097130 rt Takifugu rubripes Takifugu rubripes retrotransposon HEROFr DNA, complete sequence HEROTn AB097131 rt Tetraodon nigroviridis Tetraodon retrotransposon HEROTn DNA, nigroviridis complete sequence NeSL_3_135_68117 FJ905846 non-LTR Daphnia pulex non-LTR Daphnia pulex retrotransposon: retrotransposon NeSL_3_135_68117 NeSL_3_135_68117, complete sequence NeSL; DQ099731 target site Caenorhabditis briggsae Caenorhabditis NeSL-1 duplication transposon NeSl-1-like non-LTR briggsae retrotransposon NeSL-1Cb reverse transcriptase (pol) gene, complete cds PERERE-9 BN000800 TPA_exp: Schistosoma mansoni Schistosoma Perere-9 non-LTR mansoni retrotransposon R2 AF015814 R2 Limulus polyphemus Limulus retrotransposon R2, complete polyphemus sequence R2 AF090145 R2 Nasonia vitripennis R2 non-LTR Nasonia retrotransposable element vitripennis reverse transcriptase gene, partial cds R2 AF015818 R2 Porcellio scaber Porcellio scaber retrotransposon R2, complete sequence R2 AF015815 R2 Anurida maritima Anurida maritima retrotransposon R2, complete sequence R2 M16558 R2 Bombyx mori rDNA insertion Bombyx mori element R2 (typeII), complete cds R2 AF015819 R2 Forficula auricularia Forficula retrotransposon R2, complete auricularia sequence. R2 EU854578 R2 Triops cancriformis non-LTR Triops retrotransposon R2 reverse cancriformis transcriptase gene, complete cds R2 GU949555 R2 Reticulitermes lucifugus Reticulitermes non-LTR retrotransposon R2, lucifugus complete sequence; and R2 protein gene, complete cds R2 AB097123 rt Ciona intestinalis Ciona intestinalis retrotransposon R2Ci-C DNA, partial sequence R2 AB097124 rt Ciona intestinalis Ciona intestinalis retrotransposon R2Ci-D DNA, partial sequence R2 FJ461304 R2 Rhynchosciara americana Rhynchosciara non-LTR retrotransposon RaR2 americana reverse transcriptase gene, complete cds R2 AB097121 rt Ciona intestinalis Ciona intestinalis retrotransposon R2Ci-A DNA, complete sequence R2 KP657892 R2 Bacillus rossius isolate Bacillus rossius roCAP(full).9 retrotransposon R2Br reverse transcriptase gene, partial cds R2 KP657890 R2 Bacillus rossius isolate Bacillus rossius roCAP(full).7 retrotransposon R2Br reverse transcriptase gene, partial cds R2 KP657888 R2 Bacillus rossius isolate Bacillus rossius roCAP(full).5 retrotransposon R2Br reverse transcriptase gene, partial cds R2 KP657870 R2 Bacillus rossius isolate Bacillus rossius roCAP(full).1 retrotransposon R2Br reverse transcriptase gene, partial cds R2 KP657833 R2 Bacillus rossius isolate Bacillus rossius roANZ(full).13 retrotransposon R2Br reverse transcriptase gene, partial cds R2 KP657832 R2 Bacillus rossius isolate Bacillus rossius roANZ(full).12 retrotransposon R2Br reverse transcriptase gene, partial cds R2 KP657830 R2 Bacillus rossius isolate Bacillus rossius roANZ(full).10 retrotransposon R2Br reverse transcriptase gene, partial cds R2 KP657807 R2 Bacillus rossius isolate Bacillus rossius roANZ(?101).8 retrotransposon R2Br reverse transcriptase gene, partial cds R2 KP657806 R2 Bacillus rossius isolate Bacillus rossius roANZ(?101).7 retrotransposon R2Br reverse transcriptase gene, partial cds R2 KP657805 R2 Bacillus rossius isolate Bacillus rossius roANZ(?101).6 retrotransposon R2Br reverse transcriptase gene, partial cds R2 KP657802 R2 Bacillus rossius isolate Bacillus rossius roANZ(?101).3 retrotransposon R2Br reverse transcriptase gene, partial cds R2 KP657799 R2 Bacillus rossius isolate Bacillus rossius roANZ(?101).1 retrotransposon R2Br reverse transcriptase gene, partial cds R2 FJ461304 R2 Rhynchosciara americana Rhynchosciara non-LTR retrotransposon RaR2 americana reverse transcriptase gene, complete cds R2 JQ082370 polyA_signal_sequence Eyprepocnemis plorans non-LTR Eyprepocnemis retrotransposon R2 R2 plorans protein gene, complete cds R2 KJ958672 R2 Bacillus rossius non-LTR Bacillus rossius retrotransposon reCUR4_deg, partial sequence R2 KJ958671 R2 Bacillus rossius non-LTR Bacillus rossius retrotransposon reCUR3_deg, partial sequence R2 KJ958670 R2 Bacillus rossius non-LTR Bacillus rossius retrotransposon reCUR2_deg, partial sequence R2 KJ958669 R2 Bacillus rossius non-LTR Bacillus rossius retrotransposon reCUR1_deg, partial sequence R2 KJ958668 R2 Bacillus rossius non-LTR Bacillus rossius retrotransposon reCDF6_deg, partial sequence R2 KJ958667 R2 Bacillus rossius non-LTR Bacillus rossius retrotransposon reCDF5_deg, partial sequence R2 KJ958666 R2 Bacillus rossius non-LTR Bacillus rossius retrotransposon reCDF4_deg, partial sequence R2 KJ958665 R2 Bacillus rossius non-LTR Bacillus rossius retrotransposon reCDF3_deg, partial sequence R2 KJ958664 R2 Bacillus rossius non-LTR Bacillus rossius retrotransposon reCDF2_deg, partial sequence R2 KJ958663 R2 Bacillus rossius non-LTR Bacillus rossius retrotransposon reCDF1_deg, partial sequence R2 KJ958662 R2 Bacillus rossius non-LTR Bacillus rossius retrotransposon reCOM10_deg, partial sequence R2 KJ958661 R2 Bacillus rossius non-LTR Bacillus rossius retrotransposon reCOM5_deg, partial sequence R2 KJ958660 R2 Bacillus rossius non-LTR Bacillus rossius retrotransposon reCOM7_deg, partial sequence R2 KJ958659 R2 Bacillus rossius non-LTR Bacillus rossius retrotransposon reCOM3_deg, partial sequence R2 KJ958658 R2 Bacillus rossius non-LTR Bacillus rossius retrotransposon reCOM2_deg, partial sequence R2 KJ958657 R2 Bacillus rossius non-LTR Bacillus rossius retrotransposon reMSN6_deg, partial sequence R2 KJ958656 R2 Bacillus rossius non-LTR Bacillus rossius retrotransposon reMSN5_deg, partial sequence R2 KJ958655 R2 Bacillus rossius non-LTR Bacillus rossius retrotransposon reMSN4_deg, partial sequence R2 KJ958654 R2 Bacillus rossius non-LTR Bacillus rossius retrotransposon reMSN3_deg, partial sequence R2 KJ958653 R2 Bacillus rossius non-LTR Bacillus rossius retrotransposon reMSN2_deg, partial sequence R2 KJ958652 R2 Bacillus rossius non-LTR Bacillus rossius retrotransposon reMSN1_deg, partial sequence R2 KJ958651 R2 Bacillus rossius non-LTR Bacillus rossius retrotransposon rePAT8_deg, partial sequence R2 KJ958650 R2 Bacillus rossius non-LTR Bacillus rossius retrotransposon rePAT7_deg, partial sequence R2 KJ958649 R2 Bacillus rossius non-LTR Bacillus rossius retrotransposon rePAT6_deg, partial sequence R2 KJ958648 R2 Bacillus rossius non-LTR Bacillus rossius retrotransposon rePAT5_deg, partial sequence R2 KJ958647 R2 Bacillus rossius non-LTR Bacillus rossius retrotransposon rePAT4_deg, partial sequence R2 KJ958646 R2 Bacillus rossius non-LTR Bacillus rossius retrotransposon rePAT3_deg, partial sequence R2 KJ958645 R2 Bacillus rossius non-LTR Bacillus rossius retrotransposon rePAT2_deg, partial sequence R2 KJ958644 R2 Bacillus rossius non-LTR Bacillus rossius retrotransposon rePAT1_deg, partial sequence R2 KJ958643 R2 Bacillus rossius non-LTR Bacillus rossius retrotransposon rePAT9_deg, partial sequence R2 KJ958642 R2 Bacillus rossius non-LTR Bacillus rossius retrotransposon trKOR2_deg, partial sequence R2 KJ958641 R2 Bacillus rossius non-LTR Bacillus rossius retrotransposon reGAB9_deg, partial sequence R2 KJ958640 R2 Bacillus rossius non-LTR Bacillus rossius retrotransposon reGAB8_deg, partial sequence R2 KJ958639 R2 Bacillus rossius non-LTR Bacillus rossius retrotransposon reGAB7_deg, partial sequence R2 KJ958638 R2 Bacillus rossius non-LTR Bacillus rossius retrotransposon reGAB6_deg, partial sequence R2 KJ958637 R2 Bacillus rossius non-LTR Bacillus rossius retrotransposon reGAB5_deg, partial sequence R2 KJ958636 R2 Bacillus rossius non-LTR Bacillus rossius retrotransposon reGAB4_deg, partial sequence R2 KJ958635 R2 Bacillus rossius non-LTR Bacillus rossius retrotransposon reGAB3_deg, partial sequence R2 KJ958634 R2 Bacillus rossius non-LTR Bacillus rossius retrotransposon reGAB2_deg, partial sequence R2 KJ958633 R2 Bacillus rossius non-LTR Bacillus rossius retrotransposon reGAB1_deg, partial sequence R2 KJ958632 R2 Bacillus rossius non-LTR Bacillus rossius retrotransposon reTDS1_deg, partial sequence R2 KJ958631 R2 Bacillus rossius non-LTR Bacillus rossius retrotransposon reTDS11_deg, partial sequence R2 KJ958629 R2 Bacillus rossius non-LTR Bacillus rossius retrotransposon reCOM8, partial sequence R2 KJ958628 R2 Bacillus rossius non-LTR Bacillus rossius retrotransposon reCOM6, partial sequence R2 KJ958627 R2 Bacillus rossius non-LTR Bacillus rossius retrotransposon reCOM4, partial sequence R2 KJ958626 R2 Bacillus rossius non-LTR Bacillus rossius retrotransposon reCOM1, partial sequence R2 KJ958624 R2 Bacillus rossius non-LTR Bacillus rossius retrotransposon trKOR10, partial sequence R2 KJ958623 R2 Bacillus rossius non-LTR Bacillus rossius retrotransposon reGAB10, partial sequence R2 KJ958619 R2 Bacillus rossius non-LTR Bacillus rossius retrotransposon trKOR4, partial sequence R2 KJ958618 R2 Bacillus rossius non-LTR Bacillus rossius retrotransposon trKOR3, partial sequence R2 KJ958617 R2 Bacillus rossius non-LTR Bacillus rossius retrotransposon trKOR1, partial sequence R2 KJ958613 R2 Bacillus rossius non-LTR Bacillus rossius retrotransposon reVIR4, partial sequence R2 KJ958612 R2 Bacillus rossius non-LTR Bacillus rossius retrotransposon reVIR3, partial sequence R2 KJ958611 R2 Bacillus rossius non-LTR Bacillus rossius retrotransposon reVIR2, partial sequence R2 KJ958610 R2 Bacillus rossius non-LTR Bacillus rossius retrotransposon reVIR1, partial sequence R2 KJ958609 R2 Bacillus rossius non-LTR Bacillus rossius retrotransposon reTDS17, partial sequence R2 KJ958608 R2 Bacillus rossius non-LTR Bacillus rossius retrotransposon reTDS16, partial sequence R2 KJ958607 R2 Bacillus rossius non-LTR Bacillus rossius retrotransposon reTDS15, partial sequence R2 KJ958606 R2 Bacillus rossius non-LTR Bacillus rossius retrotransposon reTDS14, partial sequence R2 KJ958605 R2 Bacillus rossius non-LTR Bacillus rossius retrotransposon reTDS13, partial sequence R2 KJ958604 R2 Bacillus rossius non-LTR Bacillus rossius retrotransposon reTDS12, partial sequence R2 KJ958603 R2 Bacillus rossius non-LTR Bacillus rossius retrotransposon reTDS7, partial sequence R2 KJ958602 R2 Bacillus rossius non-LTR Bacillus rossius retrotransposon reTDS6, partial sequence R2 KJ958601 R2 Bacillus rossius non-LTR Bacillus rossius retrotransposon reTDS5, partial sequence R2 KJ958600 R2 Bacillus rossius non-LTR Bacillus rossius retrotransposon reTDS4, partial sequence R2 KJ958599 R2 Bacillus rossius non-LTR Bacillus rossius retrotransposon reTDS3, partial sequence R2 KJ958598 R2 Bacillus rossius non-LTR Bacillus rossius retrotransposon reTDS2, partial sequence R2 KJ958594 R2 Bacillus rossius non-LTR Bacillus rossius retrotransposon reBER4, partial sequence R2 KJ958593 R2 Bacillus rossius non-LTR Bacillus rossius retrotransposon reBER2, partial sequence R2 KJ958592 R2 Bacillus rossius non-LTR Bacillus rossius retrotransposon reBER1, partial sequence R2 KJ958591 R2 Bacillus rossius non-LTR Bacillus rossius retrotransposon roFOL7, partial sequence R2 KJ958590 R2 Bacillus rossius non-LTR Bacillus rossius retrotransposon roFOL6, partial sequence R2 KJ958589 R2 Bacillus rossius non-LTR Bacillus rossius retrotransposon roFOL5, partial sequence R2 KJ958588 R2 Bacillus rossius non-LTR Bacillus rossius retrotransposon roFOL4, partial sequence R2 KJ958587 R2 Bacillus rossius non-LTR Bacillus rossius retrotransposon roFOL3, partial sequence R2 KJ958586 R2 Bacillus rossius non-LTR Bacillus rossius retrotransposon roFOL2, partial sequence R2 KJ958585 R2 Bacillus rossius non-LTR Bacillus rossius retrotransposon roFOL1, partial sequence R2 KJ958584 R2 Bacillus rossius non-LTR Bacillus rossius retrotransposon roANZ10, partial sequence R2 KJ958583 R2 Bacillus rossius non-LTR Bacillus rossius retrotransposon roANZ9, partial sequence R2 KJ958582 R2 Bacillus rossius non-LTR Bacillus rossius retrotransposon roANZ8, partial sequence R2 KJ958581 R2 Bacillus rossius non-LTR Bacillus rossius retrotransposon roANZ7, partial sequence R2 KJ958580 R2 Bacillus rossius non-LTR Bacillus rossius retrotransposon roANZ6, partial sequence R2 KJ958579 R2 Bacillus rossius non-LTR Bacillus rossius retrotransposon roANZ5, partial sequence R2 KJ958578 R2 Bacillus rossius non-LTR Bacillus rossius retrotransposon roANZ4, partial sequence R2 KJ958577 R2 Bacillus rossius non-LTR Bacillus rossius retrotransposon roANZ3, partial sequence R2 KJ958576 R2 Bacillus rossius non-LTR Bacillus rossius retrotransposon roANZ2, partial sequence R2 KJ958575 R2 Bacillus rossius non-LTR Bacillus rossius retrotransposon roANZ1, partial sequence R2 KJ958574 R2 Bacillus rossius non-LTR Bacillus rossius retrotransposon roCAP9, partial sequence R2 KJ958573 R2 Bacillus rossius non-LTR Bacillus rossius retrotransposon roCAP8, partial sequence R2 KJ958572 R2 Bacillus rossius non-LTR Bacillus rossius retrotransposon roCAP7, partial sequence R2 KJ958571 R2 Bacillus rossius non-LTR Bacillus rossius retrotransposon roCAP6, partial sequence R2 KJ958570 R2 Bacillus rossius non-LTR Bacillus rossius retrotransposon roCAP5, partial sequence R2 KJ958569 R2 Bacillus rossius non-LTR Bacillus rossius retrotransposon roCAP4, partial sequence R2 KJ958568 R2 Bacillus rossius non-LTR Bacillus rossius retrotransposon roCAP3, partial sequence R2 KJ958567 R2 Bacillus rossius non-LTR Bacillus rossius retrotransposon roCAP2, partial sequence R2 KJ958566 R2 Bacillus rossius non-LTR Bacillus rossius retrotransposon roCAP1, partial sequence R2 KJ958565 R2 Bacillus rossius non-LTR Bacillus rossius retrotransposon roCAP10, partial sequence R2 JN937654 R2 Lepidurus apus lubbocki Lepidurus apus isolate lu8a 28S ribosomal lubbocki RNA gene, partial sequence; and non-LTR retrotransposon psR2Ll, complete sequence R2 JN937653 R2 Lepidurus apus lubbocki Lepidurus apus isolate lu7a 28S ribosomal lubbocki RNA gene, partial sequence; and non-LTR retrotransposon psR2Ll, complete sequence R2 JN937652 R2 Lepidurus apus lubbocki Lepidurus apus isolate lu2a 28S ribosomal lubbocki RNA gene, partial sequence; and non-LTR retrotransposon psR2Ll, complete sequence R2 JN937651 R2 Lepidurus apus lubbocki Lepidurus apus isolate b7c7 28S ribosomal lubbocki RNA gene, partial sequence; and non-LTR retrotransposon psR2Ll, complete sequence R2 JN937650 R2 Lepidurus apus lubbocki Lepidurus apus isolate b6c4 28S ribosomal lubbocki RNA gene, partial sequence; and non-LTR retrotransposon psR2Ll, complete sequence R2 JN937649 R2 Lepidurus apus lubbocki Lepidurus apus isolate lu5a 28S ribosomal lubbocki RNA gene, partial sequence; and non-LTR retrotransposon psR2Ll, complete sequence R2 JN937648 R2 Lepidurus apus lubbocki Lepidurus apus isolate lu1a 28S ribosomal lubbocki RNA gene, partial sequence; and non-LTR retrotransposon psR2Ll, complete sequence R2 JN937647 R2 Lepidurus apus lubbocki Lepidurus apus isolate b6c5 28S ribosomal lubbocki RNA gene, partial sequence; and non-LTR retrotransposon psR2Ll, complete sequence R2 JN937646 R2 Lepidurus apus lubbocki Lepidurus apus isolate b6c6 28S ribosomal lubbocki RNA gene, partial sequence; and non-LTR retrotransposon psR2Ll, complete sequence R2 JN937645 R2 Lepidurus apus lubbocki Lepidurus apus isolate lu4a 28S ribosomal lubbocki RNA gene, partial sequence; and non-LTR retrotransposon psR2Ll, complete sequence R2 JN937644 R2 Lepidurus apus lubbocki Lepidurus apus isolate lu3a 28S ribosomal lubbocki RNA gene, partial sequence; and non-LTR retrotransposon psR2Ll, complete sequence R2 JN937643 R2 Lepidurus apus lubbocki Lepidurus apus isolate b6c3 28S ribosomal lubbocki RNA gene, partial sequence; and non-LTR retrotransposon psR2Ll, complete sequence R2 JN937642 R2 Lepidurus apus lubbocki Lepidurus apus isolate lu6a 28S ribosomal lubbocki RNA gene, partial sequence; and non-LTR retrotransposon psR2Ll, complete sequence R2 JN937641 R2 Lepidurus apus lubbocki Lepidurus apus isolate LM5h2 28S ribosomal lubbocki RNA gene, partial sequence; and non-LTR retrotransposon R2Ll, complete sequence R2 JN937640 R2 Lepidurus apus lubbocki Lepidurus apus isolate LM2h5 28S ribosomal lubbocki RNA gene, partial sequence; and non-LTR retrotransposon R2Ll, complete sequence R2 JN937639 R2 Lepidurus apus lubbocki Lepidurus apus isolate LM2h4 28S ribosomal lubbocki RNA gene, partial sequence; and non-LTR retrotransposon R2Ll, complete sequence R2 JN937638 R2 Lepidurus apus lubbocki Lepidurus apus isolate LM5h5 28S ribosomal lubbocki RNA gene, partial sequence; and non-LTR retrotransposon R2Ll, complete sequence R2 JN937637 R2 Lepidurus apus lubbocki Lepidurus apus isolate LM5h4 28S ribosomal lubbocki RNA gene, partial sequence; and non-LTR retrotransposon R2Ll, complete sequence R2 JN937636 R2 Lepidurus apus lubbocki Lepidurus apus isolate LM5h3 28S ribosomal lubbocki RNA gene, partial sequence; and non-LTR retrotransposon R2Ll, complete sequence R2 JN937635 R2 Lepidurus apus lubbocki Lepidurus apus isolate LM5h1 28S ribosomal lubbocki RNA gene, partial sequence; and non-LTR retrotransposon R2Ll, complete sequence R2 JN937634 R2 Lepidurus apus lubbocki Lepidurus apus isolate LM2h3 28S ribosomal lubbocki RNA gene, partial sequence; and non-LTR retrotransposon R2Ll, complete sequence R2 JN937633 R2 Lepidurus apus lubbocki Lepidurus apus isolate LM2h2 28S ribosomal lubbocki RNA gene, partial sequence; and non-LTR retrotransposon R2Ll, complete sequence R2 JN937632 R2 Lepidurus apus lubbocki Lepidurus apus isolate LM2h1 28S ribosomal lubbocki RNA gene, partial sequence; and non-LTR retrotransposon R2Ll, complete sequence R2 JN937631 R2 Lepidurus arcticus isolate Lepidurus arcticus T6 28S ribosomal RNA gene, partial sequence; and non-LTR retrotransposon R2La, complete sequence R2 JN937630 R2 Lepidurus arcticus isolate Lepidurus arcticus T5 28S ribosomal RNA gene, partial sequence; and non-LTR retrotransposon R2La, complete sequence R2 JN937629 R2 Lepidurus arcticus isolate Lepidurus arcticus T4 28S ribosomal RNA gene, partial sequence; and non-LTR retrotransposon R2La, complete sequence R2 JN937628 R2 Lepidurus arcticus isolate Lepidurus arcticus T3 28S ribosomal RNA gene, partial sequence; and non-LTR retrotransposon R2La, complete sequence R2 JN937627 R2 Lepidurus arcticus isolate Lepidurus arcticus T2 28S ribosomal RNA gene, partial sequence; and non-LTR retrotransposon R2La, complete sequence R2 JN937626 R2 Lepidurus arcticus isolate Lepidurus arcticus T1 28S ribosomal RNA gene, partial sequence; and non-LTR retrotransposon R2La, complete sequence R2 JN937625 R2 Lepidurus arcticus isolate Lepidurus arcticus V4 28S ribosomal RNA gene, partial sequence; and non-LTR retrotransposon R2La, complete sequence R2 JN937624 R2 Lepidurus arcticus isolate Lepidurus arcticus V3 28S ribosomal RNA gene, partial sequence; and non-LTR retrotransposon R2La, complete sequence R2 JN937623 R2 Lepidurus arcticus isolate Lepidurus arcticus V2 28S ribosomal RNA gene, partial sequence; and non-LTR retrotransposon R2La, complete sequence R2 JN937622 R2 Lepidurus arcticus isolate Lepidurus arcticus V1 28S ribosomal RNA gene, partial sequence; and non-LTR retrotransposon R2La, complete sequence R2 JN937615 R2 Lepidurus couesii isolate Lepidurus couesii D3a7f 28S ribosomal RNA gene, partial sequence; and non-LTR retrotransposon R2LcB, complete sequence R2 JN937614 R2 Lepidurus couesii isolate Lepidurus couesii D3a5f 28S ribosomal RNA gene, partial sequence; and non-LTR retrotransposon R2LcB, complete sequence R2 JN937613 R2 Lepidurus couesii isolate Lepidurus couesii D3a4f 28S ribosomal RNA gene, partial sequence; and non-LTR retrotransposon R2LcB, complete sequence R2 JN937612 R2 Lepidurus couesii isolate Lepidurus couesii D3a3f 28S ribosomal RNA gene, partial sequence; and non-LTR retrotransposon R2LcB, complete sequence R2 JN937611 R2 Lepidurus couesii isolate Lepidurus couesii D3a2f 28S ribosomal RNA gene, partial sequence; and non-LTR retrotransposon R2LcB, complete sequence R2 JN937610 R2 Lepidurus couesii isolate Lepidurus couesii D3_8 28S ribosomal RNA gene, partial sequence; and non-LTR retrotransposon R2LcB, complete sequence R2 JN937609 R2 Lepidurus couesii isolate Lepidurus couesii D3_7 28S ribosomal RNA gene, partial sequence; and non-LTR retrotransposon R2LcB, complete sequence R2 JN937608 R2 Lepidurus couesii isolate Lepidurus couesii D3_6 28S ribosomal RNA gene, partial sequence; and non-LTR retrotransposon R2LcB, complete sequence R2 JN937607 R2 Lepidurus couesii isolate Lepidurus couesii D3_5 28S ribosomal RNA gene, partial sequence; and non-LTR retrotransposon R2LcB, complete sequence R2 JN937606 R2 Lepidurus couesii isolate Lepidurus couesii D3_4 28S ribosomal RNA gene, partial sequence; and non-LTR retrotransposon R2LcB, complete sequence R2 JN937605 R2 Lepidurus couesii isolate Lepidurus couesii D3_3 28S ribosomal RNA gene, partial sequence; and non-LTR retrotransposon R2LcB, complete sequence R2 JN937604 R2 Lepidurus couesii isolate Lepidurus couesii D3_2 28S ribosomal RNA gene, partial sequence; and non-LTR retrotransposon R2LcB, complete sequence R2 JN937603 R2 Lepidurus couesii isolate Lepidurus couesii D3_1 28S ribosomal RNA gene, partial sequence; and non-LTR retrotransposon R2LcB, complete sequence R2 JN937602 R2 Lepidurus couesii isolate Lepidurus couesii C2_5 28S ribosomal RNA gene, partial sequence; and non-LTR retrotransposon R2LcA, complete sequence R2 JN937601 R2 Lepidurus couesii isolate Lepidurus couesii LcoC2r1_5 28S ribosomal RNA gene, partial sequence; and non-LTR retrotransposon R2LcA, complete sequence R2 JN937600 R2 Lepidurus couesii isolate Lepidurus couesii LcoC2r1_6 28S ribosomal RNA gene, partial sequence; and non-LTR retrotransposon R2LcA, complete sequence R2 JN937599 R2 Lepidurus couesii isolate Lepidurus couesii C2_8 28S ribosomal RNA gene, partial sequence; and non-LTR retrotransposon R2LcA, complete sequence R2 JN937598 R2 Lepidurus couesii isolate Lepidurus couesii C2_4 28S ribosomal RNA gene, partial sequence; and non-LTR retrotransposon R2LcA, complete sequence R2 JN937597 R2 Lepidurus couesii isolate Lepidurus couesii C2_9 28S ribosomal RNA gene, partial sequence; and non-LTR retrotransposon R2LcA, complete sequence R2 JN937596 R2 Lepidurus couesii isolate Lepidurus couesii C2_7 28S ribosomal RNA gene, partial sequence; and non-LTR retrotransposon R2LcA, complete sequence R2 JN937595 R2 Lepidurus couesii isolate Lepidurus couesii C2_6 28S ribosomal RNA gene, partial sequence; and non-LTR retrotransposon R2LcA, complete sequence R2 JN937594 R2 Lepidurus couesii isolate Lepidurus couesii C2_3 28S ribosomal RNA gene, partial sequence; and non-LTR retrotransposon R2LcA, complete sequence R2 JN937593 R2 Lepidurus couesii isolate Lepidurus couesii C2_2 28S ribosomal RNA gene, partial sequence; and non-LTR retrotransposon R2LcA, complete sequence R2 JN937592 R2 Lepidurus couesii isolate Lepidurus couesii C2_1 28S ribosomal RNA gene, partial sequence; and non-LTR retrotransposon R2LcA, complete sequence R2 AF015822 R2 ORF Tenebrio molitor Tenebrio molitor retrotransposon R2 reverse transcriptase gene, partial cds R2 AF015817 R2 ORF Tenebrio molitor Tenebrio molitor retrotransposon R2 reverse transcriptase gene, partial cds R2 AF015816 R2 ORF Hippodamia convergens Hippodamia retrotransposon R2 reverse convergens transcriptase gene, partial cds R2 KP657866 retrotransposon: Bacillus rossius isolate Bacillus rossius R2Br roCAP(?714).5 retrotransposon R2Br reverse transcriptase gene, partial cds R2 KP657865 retrotransposon: Bacillus rossius isolate Bacillus rossius R2Br roCAP(?714).4 retrotransposon R2Br reverse transcriptase gene, partial cds R2 KP657863 retrotransposon: Bacillus rossius isolate Bacillus rossius R2Br roCAP(?714).2 retrotransposon R2Br reverse transcriptase gene, partial cds R2 KP657862 retrotransposon: Bacillus rossius isolate Bacillus rossius R2Br roCAP(?714).1 retrotransposon R2Br reverse transcriptase gene, partial cds R2 KP657861 retrotransposon: Bacillus rossius isolate Bacillus rossius R2Br roCAP(?1297).9 retrotransposon R2Br reverse transcriptase gene, partial cds R2 KP657860 retrotransposon: Bacillus rossius isolate Bacillus rossius R2Br roCAP(?1297).8 retrotransposon R2Br reverse transcriptase gene, partial cds R2 KP657859 retrotransposon: Bacillus rossius isolate Bacillus rossius R2Br roCAP(?1297).7 retrotransposon R2Br reverse transcriptase gene, partial cds R2 KP657858 retrotransposon: Bacillus rossius isolate Bacillus rossius R2Br roCAP(?1297).6 retrotransposon R2Br reverse transcriptase gene, partial cds R2 KP657857 retrotransposon: Bacillus rossius isolate Bacillus rossius R2Br roCAP(?1297).5 retrotransposon R2Br reverse transcriptase gene, partial cds R2 KP657856 retrotransposon: Bacillus rossius isolate Bacillus rossius R2Br roCAP(?1297).4 retrotransposon R2Br reverse transcriptase gene, partial cds R2 KP657855 retrotransposon: Bacillus rossius isolate Bacillus rossius R2Br roCAP(?1297).3 retrotransposon R2Br reverse transcriptase gene, partial cds R2 KP657854 retrotransposon: Bacillus rossius isolate Bacillus rossius R2Br roCAP(?1297).2 retrotransposon R2Br reverse transcriptase gene, partial cds R2 KP657853 retrotransposon: Bacillus rossius isolate Bacillus rossius R2Br roCAP(?1297).10 retrotransposon R2Br reverse transcriptase gene, partial cds R2 KP657852 retrotransposon: Bacillus rossius isolate Bacillus rossius R2Br roCAP(?1297).1 retrotransposon R2Br reverse transcriptase gene, partial cds R2 KP657851 retrotransposon: Bacillus rossius isolate Bacillus rossius R2Br roCAP(?1172).9 retrotransposon R2Br reverse transcriptase gene, partial cds R2 KP657850 retrotransposon: Bacillus rossius isolate Bacillus rossius R2Br roCAP(?1172).8 retrotransposon R2Br reverse transcriptase gene, partial cds R2 KP657849 retrotransposon: Bacillus rossius isolate Bacillus rossius R2Br roCAP(?1172).7 retrotransposon R2Br reverse transcriptase gene, partial cds R2 KP657848 retrotransposon: Bacillus rossius isolate Bacillus rossius R2Br roCAP(?1172).6 retrotransposon R2Br reverse transcriptase gene, partial cds R2 KP657847 retrotransposon: Bacillus rossius isolate Bacillus rossius R2Br roCAP(?1172).5 retrotransposon R2Br reverse transcriptase gene, partial cds R2 KP657846 retrotransposon: Bacillus rossius isolate Bacillus rossius R2Br roCAP(?1172).4 retrotransposon R2Br reverse transcriptase gene, partial cds R2 KP657845 retrotransposon: Bacillus rossius isolate Bacillus rossius R2Br roCAP(?1172).3 retrotransposon R2Br reverse transcriptase gene, partial cds R2 KP657844 retrotransposon: Bacillus rossius isolate Bacillus rossius R2Br roCAP(?1172).2 retrotransposon R2Br reverse transcriptase gene, partial cds R2 KP657843 retrotransposon: Bacillus rossius isolate Bacillus rossius R2Br roCAP(?1172).10 retrotransposon R2Br reverse transcriptase gene, partial cds R2 KP657842 retrotransposon: Bacillus rossius isolate Bacillus rossius R2Br roCAP(?1172).1 retrotransposon R2Br reverse transcriptase gene, partial cds R2 KP657824 retrotransposon: Bacillus rossius isolate Bacillus rossius R2Br roANZ(?1062).5 retrotransposon R2Br reverse transcriptase gene, partial cds R2 KP657823 retrotransposon: Bacillus rossius isolate Bacillus rossius R2Br roANZ(?1062).4 retrotransposon R2Br reverse transcriptase gene, partial cds R2 KP657822 retrotransposon: Bacillus rossius isolate Bacillus rossius R2Br roANZ(?1062).3 retrotransposon R2Br reverse transcriptase gene, partial cds R2 KP657820 retrotransposon: Bacillus rossius isolate Bacillus rossius R2Br roANZ(?1062).2 retrotransposon R2Br reverse transcriptase gene, partial cds R2 KP657816 retrotransposon: Bacillus rossius isolate Bacillus rossius R2Br roANZ(?1062).16 retrotransposon R2Br reverse transcriptase gene, partial cds R2 KP657814 retrotransposon: Bacillus rossius isolate Bacillus rossius R2Br roANZ(?1062).14 retrotransposon R2Br reverse transcriptase gene, partial cds R2 KP657810 retrotransposon: Bacillus rossius isolate Bacillus rossius R2Br roANZ(?1062).10 retrotransposon R2Br reverse transcriptase gene, partial cds R2 KP657809 retrotransposon: Bacillus rossius isolate Bacillus rossius R2Br roANZ(?1062).1 retrotransposon R2Br reverse transcriptase gene, partial cds R2 KP657759 retrotransposon: Bacillus rossius isolate Bacillus rossius R2Br rePAT(?1297).8 retrotransposon R2Br reverse transcriptase gene, partial cds R2 KP657757 retrotransposon: Bacillus rossius isolate Bacillus rossius R2Br rePAT(?1297).6 retrotransposon R2Br reverse transcriptase gene, partial cds R2 KP657751 retrotransposon: Bacillus rossius isolate Bacillus rossius R2Br rePAT(?1297).1 retrotransposon R2Br reverse transcriptase gene, partial cds R2 AB097125 rt Ciona savignyi Ciona savignyi retrotransposon R2Cs-D DNA, partial sequence R2 AB097124 rt Ciona intestinalis Ciona intestinalis retrotransposon R2Ci-D DNA, partial sequence R2 AB097123 rt Ciona intestinalis Ciona intestinalis retrotransposon R2Ci-C DNA, partial sequence R2 AB097121 rt Ciona intestinalis Ciona intestinalis retrotransposon R2Ci-A DNA, complete sequence R2 AB201417 rt Triops longicaudatus non-LTR Triops retrotransposon R2Tl gene longicaudatus for reverse transcriptase, partial cds R2 AB201416 rt Procambarus clarkii non-LTR Procambarus retrotransposon R2Pc gene for clarkii reverse transcriptase, partial cds R2 AB201415 rt Hasarius adansoni non-LTR Hasarius adansoni retrotransposon R2Ha gene for reverse transcriptase, partial cds R2 AB201414 rt Metacrinus rotundus non-LTR Metacrinus retrotransposon R2Mr gene for rotundus reverse transcriptase, partial cds R2 AB201413 rt Mauremys reevesii non-LTR Mauremys reevesii retrotransposon R2Cr-B2 gene for reverse transcriptase, partial cds R2 AB201412 rt Mauremys reevesii non-LTR Mauremys reevesii retrotransposon R2Cr-B1 gene for reverse transcriptase, partial cds R2 AB201411 rt Mauremys reevesii non-LTR Mauremys reevesii retrotransposon R2Cr-A gene for reverse transcriptase, partial cds R2 AB201410 rt Oryzias latipes non-LTR Oryzias latipes retrotransposon R2Ol-A gene for reverse transcriptase, partial cds R2 AB201409 rt Tanichthys albonubes non-LTR Tanichthys retrotransposon R2Ta gene albonubes for reverse transcriptase, partial cds R2 AB201408 rt Eptatretus burgeri non-LTR Eptatretus retrotransposon R2Eb gene for burgeri reverse transcriptase, partial cds R2 DQ099732 transposon: Aedes aegypti transposon R2- Aedes aegypti R2-like like non-LTR retrotransposon non-LTR R2Ag reverse transcriptase retrotransposon (pol) gene, partial cds R2Ag R2 DQ099728 transposon: Aedes aegypti transposon R2- Aedes aegypti R2-like like non-LTR retrotransposon non-LTR R2Ag_B reverse transcriptase retrotransposon (pol) gene, partial cds R2Ag_B R2 GU949559 Kalotermes flavicollis Kalotermes isolate Livorno non-LTR flavicollis retrotransposon R2, complete sequence; and R2 protein gene, complete cds R2 GU949557 Reticulitermes balkanensis Reticulitermes non-LTR retrotransposon R2, balkanensis partial sequence; and R2 protein gene, partial cds R2 GU949556 Reticulitermes grassei Reticulitermes non-LTR retrotransposon R2, grassei partial sequence; and R2 protein gene, partial cds R2 GU949554 Reticulitermes urbis non-LTR Reticulitermes retrotransposon R2, complete urbis sequence; and R2 protein gene, complete cds R2 AF412214 Schistosoma japonicum clone Schistosoma S10A non-LTR retrotransposon japonicum SjR2-like, partial sequence R2 AF015685 Drosophila mercatorum R2 Drosophila retrotransposon reverse mercatorum transcriptase domain protein gene, complete cds R2 KJ958674 Bacillus rossius Bacillus rossius retrotransposon R2Br, complete sequence R2 AF015814 R2 Limulus polyphemus Limulus polyphemus retrotransposon R2, complete sequence R2 M16558 R2 Bombyx mori rDNA insertion Bombyx mori element R2 (typeII), complete cds. R2 GQ398057 R9Av Adineta vaga copy 1 non-LTR Adineta vaga retrotransposon R9, complete sequence; and disrupted 28S ribosomal RNA gene, partial sequence R2Bm AB076841 R2 Bombyx mori non-LTR Bombyx mori retrotransposon R2Bm gene for reverse transcriptase, complete cds and 28S rRNA R2Ci-B AB097122 rt Ciona intestinalis Ciona intestinalis retrotransposon R2Ci-B DNA, complete sequence R2Dr AB097126 rt Danio rerio retrotransposon Danio rerio R2Dr DNA, complete sequence R4 AH003588 Parascaris equorum transposon Parascaris equorum non-LTR retrotransposable element R4 reverse transcriptase gene, partial cds R4 ALU29445 R4 Ascaris lumbricoides Ascaris site-specific non-LTR lumbricoides retrotransposable element R4 in 26S rDNA, complete sequence R4 L08889 R4 Dong Bombyx mori reverse Bombyx mori transcriptase gene, complete cds R4 DQ836390 MalR4-5 Maculinea alcon R4-like Phengaris alcon non-LTR retrotransposon R4-5 reverse transcriptase (RT) pseudogene, partial sequence R4 DQ836385 MnaR4-3 Maculinea nausithous R4-like Phengaris non-LTR retrotransposon R4-3 nausithous reverse transcriptase (RT) pseudogene, partial sequence R4 DQ836386 MnaR4-4 Maculinea nausithous R4-like Phengaris non-LTR retrotransposon R4-4 nausithous reverse transcriptase (RT) pseudogene, partial sequence R4 DQ836387 MnaR4-7 Maculinea nausithous R4-like Phengaris non-LTR retrotransposon R4-7 nausithous reverse transcriptase (RT) pseudogene, partial sequence R4 DQ836388 MnaR4-8 Maculinea nausithous R4-like Phengaris non-LTR retrotransposon R4-8 nausithous reverse transcriptase (RT) pseudogene, partial sequence R4 DQ836389 MnaR4-9 Maculinea nausithous R4-like Phengaris non-LTR retrotransposon R4-9 nausithous reverse transcriptase (RT) pseudogene, partial sequence R4 DQ836379 MteR4-1 Maculinea teleius R4-like Phengaris teleius non-LTR retrotransposon R4-1 reverse transcriptase (RT) pseudogene, partial sequence R4 DQ836384 MteR4-10 Maculinea teleius R4-like Phengaris teleius non-LTR retrotransposon R4-10 reverse transcriptase (RT) pseudogene, partial sequence R4 DQ836367 MteR4-2 Maculinea teleius R4-like Phengaris teleius non-LTR retrotransposon R4-2 reverse transcriptase (RT) gene, partial cds R4 DQ836380 MteR4-3 Maculinea teleius R4-like Phengaris teleius non-LTR retrotransposon R4-3 reverse transcriptase (RT) pseudogene, partial sequence R4 DQ836381 MteR4-4 Maculinea teleius R4-like Phengaris teleius non-LTR retrotransposon R4-4 reverse transcriptase (RT) pseudogene, partial sequence R4 DQ836382 MteR4-6 Maculinea teleius R4-like Phengaris teleius non-LTR retrotransposon R4-6 reverse transcriptase (RT) pseudogene, partial sequence R4 DQ836383 MteR4-8 Maculinea teleius R4-like Phengaris teleius non-LTR retrotransposon R4-8 reverse transcriptase (RT) pseudogene, partial sequence R4 DQ836374 transposon: Maculinea alcon R4-like Phengaris alcon R4-like non-LTR retrotransposon R4-1 non-LTR reverse transcriptase (RT) retrotransposon gene, partial cds R4-1 R4 DQ836373 transposon: Maculinea nausithous R4-like Phengaris R4-like non-LTR retrotransposon R4-1 nausithous non-LTR reverse transcriptase (RT) retrotransposon gene, partial cds R4-1 R4 DQ836375 transposon: Maculinea alcon R4-like Phengaris alcon R4-like non-LTR retrotransposon R4-2 non-LTR reverse transcriptase (RT) retrotransposon gene, partial cds R4-2 R4 DQ836376 transposon: Maculinea alcon R4-like Phengaris alcon R4-like non-LTR retrotransposon R4-3 non-LTR reverse transcriptase (RT) retrotransposon gene, partial cds R4-3 R4 DQ836377 transposon: Maculinea alcon R4-like Phengaris alcon R4-like non-LTR retrotransposon R4-4 non-LTR reverse transcriptase (RT) retrotransposon gene, partial cds R4-4 R4 DQ836371 transposon: Maculinea nausithous R4-like Phengaris R4-like non-LTR retrotransposon R4-5 nausithous non-LTR reverse transcriptase (RT) retrotransposon gene, partial cds R4-5 R4 DQ836368 transposon: Maculinea teleius R4-like Phengaris teleius R4-like non-LTR retrotransposon R4-5 non-LTR reverse transcriptase (RT) retrotransposon gene, partial cds R4-5 R4 DQ836378 transposon: Maculinea alcon R4-like Phengaris alcon R4-like non-LTR retrotransposon R4-6 non-LTR reverse transcriptase (RT) retrotransposon gene, partial cds R4-6 R4 DQ836372 transposon: Maculinea nausithous R4-like Phengaris R4-like non-LTR retrotransposon R4-6 nausithous non-LTR reverse transcriptase (RT) retrotransposon gene, partial cds R4-6 R4 DQ836369 transposon: Maculinea teleius R4-like Phengaris teleius R4-like non-LTR retrotransposon R4-7 non-LTR reverse transcriptase (RT) retrotransposon gene, partial cds R4-7 R4 DQ836370 transposon: Maculinea teleius R4-like Phengaris teleius R4-like non-LTR retrotransposon R4-9 non-LTR reverse transcriptase (RT) retrotransposon gene, partial cds R4-9 R4 AF286191 transposon: Xiphophorus maculatus Xiphophorus retrotransposon retrotransposon Rex6 reverse maculatus Rex6 transcriptase pseudogene, partial sequence R5 KJ958673 R2 Bacillus rossius non-LTR Bacillus rossius retrotransposon reCUR5_deg, partial sequence R5 KJ958620 R2 Bacillus rossius non-LTR Bacillus rossius retrotransposon trKOR5, partial sequence R5 KJ958614 R2 Bacillus rossius non-LTR Bacillus rossius retrotransposon reVIR5, partial sequence R5 KJ958595 R2 Bacillus rossius non-LTR Bacillus rossius retrotransposon reBER5, partial sequence R5 AJ006560 transposon: Anopheles merus Amer5 non-LTR Anopheles merus Amer5 retrotransposon encoding non-LTR reverse transcriptase, partial retrotransposon R5 AF352479 transposon: Chironomus circumdatus clone Chironomus NLRCth1-like cir5 transposon NLRCth1-like circumdatus non-LTR non-LTR retrotransposon retrotransposon reverse transcriptase gene, partial cds R5 AF352454 transposon: Chironomus alpestris clone Chironomus NLRCth1-like dor50 transposon NLRCth1-like alpestris non-LTR non-LTR retrotransposon retrotransposon reverse transcriptase gene, partial cds R5 AF352404 transposon: Chironomus luridus clone lur5 Chironomus luridus NLRCth1-like transposon NLRCth1-like non-LTR non-LTR retrotransposon retrotransposon reverse transcriptase gene, partial cds R8 FR852798 poly(A) tail Beta vulgaris subsp. Beta vulgaris vulgaris subsp. vulgaris LINE-type retrotransposon Belline2_3 R8 KJ958630 R2 Bacillus rossius non-LTR Bacillus rossius retrotransposon reVIR8_deg, partial sequence R8 KJ958621 R2 Bacillus rossius non-LTR Bacillus rossius retrotransposon trKOR8, partial sequence R8 KP001560 rex3-RT Iberochondrostoma lusitanicum Iberochondrostoma pseudogene_Contig clone tr8a non-LTR lusitanicum ILU_TR8 retrotransposon Rex3, complete sequence R8 FR852885 right terminal Beta vulgaris subsp. Beta vulgaris repeat vulgaris subsp. vulgaris LINE-type retrotransposon BNR114 (Belline1_114) R8 FR852856 right terminal Beta vulgaris subsp. Beta vulgaris repeat vulgarius subsp. vulgaris LINE-type retrotransposon BNR45 (Belline1_45) R8 FR852844 right terminal Beta vulgaris subsp. Beta vulgaris repeat vulgarius subsp. vulgaris LINE-type retrotransposon BNR22 (Belline1_22) R8 FR852836 right terminal Beta vulgaris subsp. Beta vulgaris repeat vulgarius subsp. vulgaris LINE-type retrotransposon Belline17_6 R8 FR852834 right terminal Beta vulgaris subsp. Beta vulgaris repeat vulgarius subsp. vulgaris LINE-type retrotransposon Belline17_4 R8 FR852831 right terminal Beta vulgaris subsp. Beta vulgaris repeat vulgarius subsp. vulgaris LINE-type retrotransposon Belline17_1 R8 FR852829 right terminal Beta vulgaris subsp. Beta vulgaris repeat vulgarius subsp. vulgaris LINE-type retrotransposon Belline16_2 R8 FR852827 right terminal Beta vulgaris subsp. Beta vulgaris repeat vulgarius subsp. vulgaris LINE-type retrotransposon Belline15_3 R8 FR852819 right terminal Beta vulgaris subsp. Beta vulgaris repeat vulgarius subsp. vulgaris LINE-type retrotransposon Belline12_2 R8 FR852813 right terminal Beta vulgaris subsp. Beta vulgaris repeat vulgarius subsp. vulgaris LINE-type retrotransposon Belline9_5 R8 FR852807 right terminal Beta vulgaris subsp. Beta vulgaris repeat vulgarius subsp. vulgaris LINE-type retrotransposon Belline8_1 R8 FR852806 right terminal Beta vulgaris subsp. Beta vulgaris repeat vulgarius subsp. vulgaris LINE-type retrotransposon Belline7_18 R8 FR852799 right terminal Beta vulgaris subsp. Beta vulgaris repeat vulgarius subsp. vulgaris LINE-type retrotransposon Belline2_4 R8 FR852795 right terminal Beta vulgaris subsp. Beta vulgaris repeat vulgarius subsp. vulgaris LINE-type retrotransposon BNR19 (Belline1_19) R8 AF352481 transposon: Chironomus circumdatus clone Chironomus NLRCth1-like cir8 transposon NLRCth1-like circumdatus non-LTR non-LTR retrotransposon retrotransposon reverse transcriptase gene, partial cds R8; FR852861 right terminal Beta vulgaris subsp. Beta vulgaris R5 repeat vulgaris subsp. vulgaris LINE-type retrotransposon BNR59 (Belline1_59) R8; FR852857 right terminal Beta vulgaris subsp. Beta vulgaris R5 repeat vulgaris subsp. vulgaris LINE-type retrotransposon BNR51 (Belline1_51) R8; FR852866 right terminal Beta vulgaris subsp. Beta vulgaris R7 repeat vulgaris subsp. vulgaris LINE-type retrotransposon BNR76 (Belline1_76) R8; FR852838 right terminal Beta vulgaris subsp. Beta vulgaris R7 repeat vulgaris subsp. vulgaris LINE-type retrotransposon BNR7 (Belline1_7) R8; AF352455 transposon: Chironomus alpestris clone Chironomus R7 NLRCth1-like dor70 note identical sequence alpestris non-LTR found in dor80 transposon retrotransposon NLRCth1-like non-LTR retrotransposon reverse transcriptase gene, partial cds R8; FR852878 right terminal Beta vulgaris subsp. Beta vulgaris R9 repeat vulgaris subsp. vulgaris LINE-type retrotransposon BNR96 (Belline1_96) R9 KJ958625 R2 Bacillus rossius non-LTR Bacillus rossius retrotransposon trKOR9, partial sequence Rex6 AJ293547 en Oreochromis niloticus Rex6 Oreochromis retrotransposon partial en niloticus pseudogene for endonuclease, clone rex6-Oni-3 Rex6 AJ293546 en Oreochromis niloticus Rex6 Oreochromis retrotransposon partial en niloticus pseudogene for endonuclease, clone rex6-Oni-2 Rex6 AJ293545 en Oreochromis niloticus Rex6 Oreochromis retrotransposon partial en niloticus pseudogene for endonuclease, clone rex6-Oni-1 Rex6 AJ293517 en Xiphophorus maculatus Rex6 Xiphophorus retrotransposon partial en maculatus pseudogene for endonuclease, clone Rex6-Xma-6 Rex6 AJ293516 en Xiphophorus maculatus Rex6 Xiphophorus retrotransposon partial en maculatus pseudogene for endonuclease, clone Rex6-Xma-5 Rex6 AJ293515 en Xiphophorus maculatus Rex6 Xiphophorus retrotransposon partial en maculatus pseudogene for endonuclease, clone Rex6-Xma-4 Rex6 AJ293514 en Xiphophorus maculatus Rex6 Xiphophorus retrotransposon partial en maculatus pseudogene for endonuclease, clone Rex6-Xma-3 Rex6 AJ293513 en Xiphophorus maculatus Rex6 Xiphophorus retrotransposon partial en maculatus pseudogene for endonuclease, clone Rex6-Xma-2 Rex6 AJ293512 en Xiphophorus maculatus Rex6 Xiphophorus retrotransposon partial en maculatus pseudogene for endonuclease, clone Rex6-Xma-1 Rex6 AJ293538 en Poecilia formosa Rex6 Poecilia formosa retrotransposon partial en pseudogene for endonuclease, clone rex6-Pfo-6 Rex6 AJ293537 en Poecilia formosa Rex6 Poecilia formosa retrotransposon partial en pseudogene for endonuclease, clone rex6-Pfo-5 Rex6 AJ293536 en Poecilia formosa Rex6 Poecilia formosa retrotransposon partial en pseudogene for endonuclease, clone rex6-Pfo-4 Rex6 AJ293535 en Poecilia formosa Rex6 Poecilia formosa retrotransposon partial en pseudogene for endonuclease, clone rex6-Pfo-3 Rex6 AJ293534 en Poecilia formosa Rex6 Poecilia formosa retrotransposon partial en pseudogene for endonuclease, clone rex6-Pfo-2 Rex6 AJ293533 en Poecilia formosa Rex6 Poecilia formosa retrotransposon partial en pseudogene for endonuclease, clone rex6-Pfo-1 Rex6 AJ293526 en Poeciliopsis gracilis Rex6 Poeciliopsis retrotransposon partial en gracilis pseudogene for endonuclease, clone rex6-Pgr-4 Rex6 AJ293525 en Poeciliopsis gracilis Rex6 Poeciliopsis retrotransposon partial en gracilis pseudogene for endonuclease, clone rex6-Pgr-3 Rex6 AJ293524 en Poeciliopsis gracilis Rex6 Poeciliopsis retrotransposon partial en gracilis pseudogene for endonuclease, clone rex6-Pgr-2 Rex6 AJ293523 en Poeciliopsis gracilis Rex6 Poeciliopsis retrotransposon partial en gracilis pseudogene for endonuclease, clone rex6-Pgr-1 Rex6 AJ293522 en Oryzias latipes Rex6 Oryzias latipes retrotransposon partial en pseudogene for endonuclease, clone rex6-Ola-5 Rex6 AJ293521 en Oryzias latipes Rex6 Oryzias latipes retrotransposon partial en pseudogene for endonuclease, clone Rex6-Ola-4 Rex6 AJ293520 en Oryzias latipes Rex6 Oryzias latipes retrotransposon partial en pseudogene for endonuclease, clone Rex6-Ola-3 Rex6 AJ293519 en Oryzias latipes Rex6 Oryzias latipes retrotransposon partial en pseudogene for endonuclease, clone Rex6-Ola-2 Rex6 AJ293518 en Oryzias latipes Rex6 Oryzias latipes retrotransposon partial en pseudogene for endonuclease, clone Rex6-Ola-1 Rex6 AJ293549 en Cichlasoma labridens Rex6 Herichthys retrotransposon partial en labridens pseudogene for endonuclease, clone rex6-Cla-2 Rex6 AJ293548 en Cichlasoma labridens Rex6 Herichthys retrotransposon partial en labridens pseudogene for endonuclease, clone rex6-Cla-1 Rex6 AJ293544 en Heterandria bimaculata Rex6 Pseudoxiphophorus retrotransposon partial en bimaculatus pseudogene for endonuclease, clone rex6-Hbi-6 Rex6 AJ293543 en Heterandria bimaculata Rex6 Pseudoxiphophorus retrotransposon partial en bimaculatus pseudogene for endonuclease, clone rex6-Hbi-5 Rex6 AJ293542 en Heterandria bimaculata Rex6 Pseudoxiphophorus retrotransposon partial en bimaculatus pseudogene for endonuclease, clone rex6-Hbi-4 Rex6 AJ293541 en Heterandria bimaculata Rex6 Pseudoxiphophorus retrotransposon partial en bimaculatus pseudogene for endonuclease, clone rex6-Hbi-3 Rex6 AJ293540 en Heterandria bimaculata Rex6 Pseudoxiphophorus retrotransposon partial en bimaculatus pseudogene for endonuclease, clone rex6-Hbi-2 Rex6 AJ293539 en Heterandria bimaculata Rex6 Pseudoxiphophorus retrotransposon partial en bimaculatus pseudogene for endonuclease, clone rex6-Hbi-1 Rex6 AJ293532 en Gambusia affinis Rex6 Gambusia affinis retrotransposon partial en pseudogene for endonuclease, clone rex6-Gaf-5 Rex6 AJ293531 en Gambusia affinis Rex6 Gambusia affinis retrotransposon partial en pseudogene for endonuclease, clone rex6-Gaf-5 Rex6 AJ293530 en Gambusia affinis Rex6 Gambusia affinis retrotransposon partial en pseudogene for endonuclease, clone rex6-Gaf-4 Rex6 AJ293529 en Gambusia affinis Rex6 Gambusia affinis retrotransposon partial en pseudogene for endonuclease, clone rex6-Gaf-3 Rex6 AJ293528 en Gambusia affinis Rex6 Gambusia affinis retrotransposon partial en pseudogene for endonuclease, clone rex6-Gaf-2 Rex6 AJ293527 en Gambusia affinis Rex6 Gambusia affinis retrotransposon partial en pseudogene for endonuclease, clone rex6-Gaf-1 Rex6 JX576459 non-LTR Symphysodon discus isolate Symphysodon discus retrotransposon: i non-LTR retrotransposon Rex6 Rex6, partial sequence Rex6 JX576458 non-LTR Symphysodon discus isolate Symphysodon discus retrotransposon: h non-LTR retrotransposon Rex6 Rex6, partial sequence Rex6 JX576457 non-LTR Symphysodon discus isolate Symphysodon discus retrotransposon: g non-LTR retrotransposon Rex6 Rex6, partial sequence Rex6 JX576456 non-LTR Symphysodon discus isolate Symphysodon discus retrotransposon: f non-LTR retrotransposon Rex6 Rex6, partial sequence Rex6 JX576455 non-LTR Symphysodon discus isolate Symphysodon discus retrotransposon: e non-LTR retrotransposon Rex6 Rex6, partial sequence Rex6 JX576454 non-LTR Symphysodon discus isolate Symphysodon discus retrotransposon: d non-LTR retrotransposon Rex6 Rex6, partial sequence Rex6 JX576453 non-LTR Symphysodon discus isolate Symphysodon discus retrotransposon: c non-LTR retrotransposon Rex6 Rex6, partial sequence Rex6 JX576452 non-LTR Symphysodon discus isolate Symphysodon discus retrotransposon: b non-LTR retrotransposon Rex6 Rex6, partial sequence Rex6 JX576451 non-LTR Symphysodon discus isolate Symphysodon discus retrotransposon: a non-LTR retrotransposon Rex6 Rex6, partial sequence Rex6 JX576450 non-LTR Symphysodon discus isolate Symphysodon discus retrotransposon: z8 non-LTR retrotransposon Rex6 Rex6, partial sequence Rex6 JX576449 non-LTR Symphysodon discus isolate Symphysodon discus retrotransposon: z7 non-LTR retrotransposon Rex6 Rex6, partial sequence Rex6 JX576448 non-LTR Symphysodon discus isolate Symphysodon discus retrotransposon: z6 non-LTR retrotransposon Rex6 Rex6, partial sequence Rex6 JX576447 non-LTR Symphysodon discus isolate Symphysodon discus retrotransposon: z5 non-LTR retrotransposon Rex6 Rex6, partial sequence Rex6 JX576446 non-LTR Symphysodon discus isolate Symphysodon discus retrotransposon: z4 non-LTR retrotransposon Rex6 Rex6, partial sequence Rex6 JX576445 non-LTR Symphysodon discus isolate Symphysodon discus retrotransposon: z3 non-LTR retrotransposon Rex6 Rex6, partial sequence Rex6 JX576444 non-LTR Symphysodon discus isolate Symphysodon discus retrotransposon: z2 non-LTR retrotransposon Rex6 Rex6, partial sequence Rex6 JX576443 non-LTR Symphysodon discus isolate Symphysodon discus retrotransposon: z1 non-LTR retrotransposon Rex6 Rex6, partial sequence Rex6 JX576442 non-LTR Symphysodon discus isolate Symphysodon discus retrotransposon: z non-LTR retrotransposon Rex6 Rex6, partial sequence Rex6 JX576441 non-LTR Symphysodon discus isolate Symphysodon discus retrotransposon: x non-LTR retrotransposon Rex6 Rex6, partial sequence Rex6 JX576440 non-LTR Symphysodon discus isolate Symphysodon discus retrotransposon: v non-LTR retrotransposon Rex6 Rex6, partial sequence Rex6 JX576439 non-LTR Symphysodon discus isolate Symphysodon discus retrotransposon: u non-LTR retrotransposon Rex6 Rex6, partial sequence Rex6 JX576438 non-LTR Symphysodon discus isolate Symphysodon discus retrotransposon: t non-LTR retrotransposon Rex6 Rex6, partial sequence Rex6 JX576437 non-LTR Symphysodon discus isolate Symphysodon discus retrotransposon: s non-LTR retrotransposon Rex6 Rex6, partial sequence Rex6 JX576436 non-LTR Symphysodon discus isolate Symphysodon discus retrotransposon: r non-LTR retrotransposon Rex6 Rex6, partial sequence Rex6 JX576435 non-LTR Symphysodon discus isolate Symphysodon discus retrotransposon: q non-LTR retrotransposon Rex6 Rex6, partial sequence Rex6 JX576434 non-LTR Symphysodon discus isolate Symphysodon discus retrotransposon: p non-LTR retrotransposon Rex6 Rex6, partial sequence Rex6 JX576433 non-LTR Symphysodon discus isolate Symphysodon discus retrotransposon: o non-LTR retrotransposon Rex6 Rex6, partial sequence Rex6 JX576432 non-LTR Symphysodon discus isolate Symphysodon discus retrotransposon: n non-LTR retrotransposon Rex6 Rex6, partial sequence Rex6 JX576431 non-LTR Symphysodon discus isolate Symphysodon discus retrotransposon: m non-LTR retrotransposon Rex6 Rex6, partial sequence Rex6 JX576430 non-LTR Symphysodon discus isolate Symphysodon discus retrotransposon: l non-LTR retrotransposon Rex6 Rex6, partial sequence Rex6 JX576429 non-LTR Symphysodon discus isolate Symphysodon discus retrotransposon: k non-LTR retrotransposon Rex6 Rex6, partial sequence Rex6 JX576428 non-LTR Symphysodon discus isolate Symphysodon discus retrotransposon: j non-LTR retrotransposon Rex6 Rex6, partial sequence Rex6 JX576427 non-LTR Pterophyllum scalare clone Pterophyllum retrotransposon: e non-LTR retrotransposon scalare Rex6 Rex6, partial sequence Rex6 JX576426 non-LTR Pterophyllum scalare clone Pterophyllum retrotransposon: d non-LTR retrotransposon scalare Rex6 Rex6, partial sequence Rex6 JX576425 non-LTR Pterophyllum scalare clone Pterophyllum retrotransposon: c non-LTR retrotransposon scalare Rex6 Rex6, partial sequence Rex6 JX576424 non-LTR Pterophyllum scalare clone Pterophyllum retrotransposon: b non-LTR retrotransposon scalare Rex6 Rex6, partial sequence Rex6 JX576423 non-LTR Pterophyllum scalare clone Pterophyllum retrotransposon: a non-LTR retrotransposon scalare Rex6 Rex6, partial sequence Rex6 JX576422 non-LTR Geophagus proximus clone Geophagus proximus retrotransposon: g non-LTR retrotransposon Rex6 Rex6, partial sequence Rex6 JX576421 non-LTR Geophagus proximus clone Geophagus proximus retrotransposon: f non-LTR retrotransposon Rex6 Rex6, partial sequence Rex6 JX576420 non-LTR Geophagus proximus clone Geophagus proximus retrotransposon: e non-LTR retrotransposon Rex6 Rex6, partial sequence Rex6 JX576419 non-LTR Geophagus proximus clone Geophagus proximus retrotransposon: d non-LTR retrotransposon Rex6 Rex6, partial sequence Rex6 JX576418 non-LTR Geophagus proximus clone Geophagus proximus retrotransposon: c non-LTR retrotransposon Rex6 Rex6, partial sequence Rex6 JX576417 non-LTR Geophagus proximus clone Geophagus proximus retrotransposon: b non-LTR retrotransposon Rex6 Rex6, partial sequence Rex6 JX576416 non-LTR Geophagus proximus clone Geophagus proximus retrotransposon: a non-LTR retrotransposon Rex6 Rex6, partial sequence Rex6 JX576415 non-LTR Astronotus ocellatus clone Astronotus retrotransposon: g non-LTR retrotransposon ocellatus Rex6 Rex6, partial sequence Rex6 JX576414 non-LTR Astronotus ocellatus clone Astronotus retrotransposon: f non-LTR retrotransposon ocellatus Rex6 Rex6, partial sequence Rex6 JX576413 non-LTR Astronotus ocellatus clone Astronotus retrotransposon: e non-LTR retrotransposon ocellatus Rex6 Rex6, partial sequence Rex6 JX576412 non-LTR Astronotus ocellatus clone Astronotus retrotransposon: d non-LTR retrotransposon ocellatus Rex6 Rex6, partial sequence Rex6 JX576411 non-LTR Astronotus ocellatus clone Astronotus retrotransposon: c non-LTR retrotransposon ocellatus Rex6 Rex6, partial sequence Rex6 JX576410 non-LTR Astronotus ocellatus clone Astronotus retrotransposon: b non-LTR retrotransposon ocellatus Rex6 Rex6, partial sequence Rex6 JX576409 non-LTR Astronotus ocellatus clone Astronotus retrotransposon: a non-LTR retrotransposon ocellatus Rex6 Rex6, partial sequence Rex6 JX576408 non-LTR Cichla monoculus clone Cichla monoculus retrotransposon: h non-LTR retrotransposon Rex6 Rex6, partial sequence Rex6 JX576407 non-LTR Cichla monoculus clone Cichla monoculus retrotransposon: g non-LTR retrotransposon Rex6 Rex6, partial sequence Rex6 JX576406 non-LTR Cichla monoculus clone Cichla monoculus retrotransposon: f non-LTR retrotransposon Rex6 Rex6, partial sequence Rex6 JX576405 non-LTR Cichla monoculus clone Cichla monoculus retrotransposon: e non-LTR retrotransposon Rex6 Rex6, partial sequence Rex6 JX576404 non-LTR Cichla monoculus clone Cichla monoculus retrotransposon: d non-LTR retrotransposon Rex6 Rex6, partial sequence Rex6 JX576403 non-LTR Cichla monoculus clone Cichla monoculus retrotransposon: c non-LTR retrotransposon Rex6 Rex6, partial sequence Rex6 JX576402 non-LTR Cichla monoculus clone Cichla monoculus retrotransposon: b non-LTR retrotransposon Rex6 Rex6, partial sequence Rex6 JX576401 non-LTR Cichla monoculus clone Cichla monoculus retrotransposon: a non-LTR retrotransposon Rex6 Rex6, partial sequence Rex6 KF131853 non-LTR Pterophyllum scalare clone Pterophyllum retrotransposon: z7 non-LTR retrotransposon scalare Rex6 Rex6, partial sequence Rex6 KF131852 non-LTR Pterophyllum scalare clone Pterophyllum retrotransposon: z6 non-LTR retrotransposon scalare Rex6 Rex6, partial sequence Rex6 KF131851 non-LTR Pterophyllum scalare clone Pterophyllum retrotransposon: z5 non-LTR retrotransposon scalare Rex6 Rex6, partial sequence Rex6 KF131850 non-LTR Pterophyllum scalare clone Pterophyllum retrotransposon: z4 non-LTR retrotransposon scalare Rex6 Rex6, partial sequence Rex6 KF131849 non-LTR Pterophyllum scalare clone Pterophyllum retrotransposon: z3 non-LTR retrotransposon scalare Rex6 Rex6, partial sequence Rex6 KF131848 non-LTR Pterophyllum scalare clone Pterophyllum retrotransposon: z2 non-LTR retrotransposon scalare Rex6 Rex6, partial sequence Rex6 KF131847 non-LTR Pterophyllum scalare clone Pterophyllum retrotransposon: z1 non-LTR retrotransposon scalare Rex6 Rex6, partial sequence Rex6 KF131846 non-LTR Pterophyllum scalare clone Pterophyllum retrotransposon: z non-LTR retrotransposon scalare Rex6 Rex6, partial sequence Rex6 KF131845 non-LTR Pterophyllum scalare clone Pterophyllum retrotransposon: x non-LTR retrotransposon scalare Rex6 Rex6, partial sequence Rex6 KF131844 non-LTR Pterophyllum scalare clone Pterophyllum retrotransposon: v non-LTR retrotransposon scalare Rex6 Rex6, partial sequence Rex6 KF131843 non-LTR Pterophyllum scalare clone Pterophyllum retrotransposon: u non-LTR retrotransposon scalare Rex6 Rex6, partial sequence Rex6 KF131842 non-LTR Pterophyllum scalare clone Pterophyllum retrotransposon: t non-LTR retrotransposon scalare Rex6 Rex6, partial sequence Rex6 KF131841 non-LTR Pterophyllum scalare clone Pterophyllum retrotransposon: s non-LTR retrotransposon scalare Rex6 Rex6, partial sequence Rex6 KF131840 non-LTR Pterophyllum scalare clone Pterophyllum retrotransposon: r non-LTR retrotransposon scalare Rex6 Rex6, partial sequence Rex6 KF131839 non-LTR Pterophyllum scalare clone Pterophyllum retrotransposon: q non-LTR retrotransposon scalare Rex6 Rex6, partial sequence Rex6 KF131838 non-LTR Pterophyllum scalare clone Pterophyllum retrotransposon: p non-LTR retrotransposon scalare Rex6 Rex6, partial sequence Rex6 KF131837 non-LTR Pterophyllum scalare clone Pterophyllum retrotransposon: n non-LTR retrotransposon scalare Rex6 Rex6, partial sequence Rex6 KF131836 non-LTR Pterophyllum scalare clone Pterophyllum retrotransposon: m non-LTR retrotransposon scalare Rex6 Rex6, partial sequence Rex6 KF131835 non-LTR Pterophyllum scalare clone Pterophyllum retrotransposon: l non-LTR retrotransposon scalare Rex6 Rex6, partial sequence Rex6 KF131834 non-LTR Pterophyllum scalare clone Pterophyllum retrotransposon: k non-LTR retrotransposon scalare Rex6 Rex6, partial sequence Rex6 KF131833 non-LTR Pterophyllum scalare clone Pterophyllum retrotransposon: j non-LTR retrotransposon scalare Rex6 Rex6, partial sequence Rex6 KF131832 non-LTR Pterophyllum scalare clone Pterophyllum retrotransposon: i non-LTR retrotransposon scalare Rex6 Rex6, partial sequence Rex6 KF131831 non-LTR Pterophyllum scalare clone Pterophyllum retrotransposon: h non-LTR retrotransposon scalare Rex6 Rex6, partial sequence Rex6 KF131830 non-LTR Pterophyllum scalare clone Pterophyllum retrotransposon: g non-LTR retrotransposon scalare Rex6 Rex6, partial sequence Rex6 KF131829 non-LTR Pterophyllum scalare clone Pterophyllum retrotransposon: f non-LTR retrotransposon scalare Rex6 Rex6, partial sequence Rex6 KF131828 non-LTR Geophagus proximus clone Geophagus proximus retrotransposon: z6 non-LTR retrotransposon Rex6 Rex6, partial sequence Rex6 KF131827 non-LTR Geophagus proximus clone Geophagus proximus retrotransposon: z5 non-LTR retrotransposon Rex6 Rex6, partial sequence Rex6 KF131826 non-LTR Geophagus proximus clone Geophagus proximus retrotransposon: z4 non-LTR retrotransposon Rex6 Rex6, partial sequence Rex6 KF131825 non-LTR Geophagus proximus clone Geophagus proximus retrotransposon: z3 non-LTR retrotransposon Rex6 Rex6, partial sequence Rex6 KF131824 non-LTR Geophagus proximus clone Geophagus proximus retrotransposon: z2 non-LTR retrotransposon Rex6 Rex6, partial sequence Rex6 KF131823 non-LTR Geophagus proximus clone Geophagus proximus retrotransposon: z1 non-LTR retrotransposon Rex6 Rex6, partial sequence Rex6 KF131822 non-LTR Geophagus proximus clone Geophagus proximus retrotransposon: z non-LTR retrotransposon Rex6 Rex6, partial sequence Rex6 KF131821 non-LTR Geophagus proximus clone Geophagus proximus retrotransposon: x non-LTR retrotransposon Rex6 Rex6, partial sequence Rex6 KF131820 non-LTR Geophagus proximus clone Geophagus proximus retrotransposon: v non-LTR retrotransposon Rex6 Rex6, partial sequence Rex6 KF131819 non-LTR Geophagus proximus clone Geophagus proximus retrotransposon: u non-LTR retrotransposon Rex6 Rex6, partial sequence Rex6 KF131818 non-LTR Geophagus proximus clone Geophagus proximus retrotransposon: t non-LTR retrotransposon Rex6 Rex6, partial sequence Rex6 KF131817 non-LTR Geophagus proximus clone Geophagus proximus retrotransposon: s non-LTR retrotransposon Rex6 Rex6, partial sequence Rex6 KF131816 non-LTR Geophagus proximus clone Geophagus proximus retrotransposon: r non-LTR retrotransposon Rex6 Rex6, partial sequence Rex6 KF131815 non-LTR Geophagus proximus clone Geophagus proximus retrotransposon: q non-LTR retrotransposon Rex6 Rex6, partial sequence Rex6 KF131814 non-LTR Geophagus proximus clone Geophagus proximus retrotransposon: p non-LTR retrotransposon Rex6 Rex6, partial sequence Rex6 KF131813 non-LTR Geophagus proximus clone Geophagus proximus retrotransposon: o non-LTR retrotransposon Rex6 Rex6, partial sequence Rex6 KF131812 non-LTR Geophagus proximus clone Geophagus proximus retrotransposon: n non-LTR retrotransposon Rex6 Rex6, partial sequence Rex6 KF131811 non-LTR Geophagus proximus clone Geophagus proximus retrotransposon: m non-LTR retrotransposon Rex6 Rex6, partial sequence Rex6 KF131810 non-LTR Geophagus proximus clone Geophagus proximus retrotransposon: l non-LTR retrotransposon Rex6 Rex6, partial sequence Rex6 KF131809 non-LTR Geophagus proximus clone Geophagus proximus retrotransposon: k non-LTR retrotransposon Rex6 Rex6, partial sequence Rex6 KF131808 non-LTR Geophagus proximus clone Geophagus proximus retrotransposon: j non-LTR retrotransposon Rex6 Rex6, partial sequence Rex6 KF131807 non-LTR Geophagus proximus clone Geophagus proximus retrotransposon: i non-LTR retrotransposon Rex6 Rex6, partial sequence Rex6 KF131806 non-LTR Geophagus proximus clone Geophagus proximus retrotransposon: h non-LTR retrotransposon Rex6 Rex6, partial sequence Rex6 KF131805 non-LTR Astronotus ocellatus clone Astronotus retrotransposon: z10 non-LTR retrotransposon ocellatus Rex6 Rex6, partial sequence Rex6 KF131804 non-LTR Astronotus ocellatus clone Astronotus retrotransposon: z9 non-LTR retrotransposon ocellatus Rex6 Rex6, partial sequence Rex6 KF131803 non-LTR Astronotus ocellatus clone Astronotus retrotransposon: z8 non-LTR retrotransposon ocellatus Rex6 Rex6, partial sequence Rex6 KF131802 non-LTR Astronotus ocellatus clone Astronotus retrotransposon: z7 non-LTR retrotransposon ocellatus Rex6 Rex6, partial sequence Rex6 KF131801 non-LTR Astronotus ocellatus clone Astronotus retrotransposon: z6 non-LTR retrotransposon ocellatus Rex6 Rex6, partial sequence Rex6 KF131800 non-LTR Astronotus ocellatus clone Astronotus retrotransposon: z5 non-LTR retrotransposon ocellatus Rex6 Rex6, partial sequence Rex6 KF131799 non-LTR Astronotus ocellatus clone Astronotus retrotransposon: z4 non-LTR retrotransposon ocellatus Rex6 Rex6, partial sequence Rex6 KF131798 non-LTR Astronotus ocellatus clone Astronotus retrotransposon: z3 non-LTR retrotransposon ocellatus Rex6 Rex6, partial sequence Rex6 KF131797 non-LTR Astronotus ocellatus clone Astronotus retrotransposon: z2 non-LTR retrotransposon ocellatus Rex6 Rex6, partial sequence Rex6 KF131796 non-LTR Astronotus ocellatus clone Astronotus retrotransposon: z1 non-LTR retrotransposon ocellatus Rex6 Rex6, partial sequence Rex6 KF131795 non-LTR Astronotus ocellatus clone Astronotus retrotransposon: z non-LTR retrotransposon ocellatus Rex6 Rex6, partial sequence Rex6 KF131794 non-LTR Astronotus ocellatus clone Astronotus retrotransposon: x non-LTR retrotransposon ocellatus Rex6 Rex6, partial sequence Rex6 KF131793 non-LTR Astronotus ocellatus clone Astronotus retrotransposon: v non-LTR retrotransposon ocellatus Rex6 Rex6, partial sequence Rex6 KF131792 non-LTR Astronotus ocellatus clone Astronotus retrotransposon: u non-LTR retrotransposon ocellatus Rex6 Rex6, partial sequence Rex6 KF131791 non-LTR Astronotus ocellatus clone Astronotus retrotransposon: t non-LTR retrotransposon ocellatus Rex6 Rex6, partial sequence Rex6 KF131790 non-LTR Astronotus ocellatus clone Astronotus retrotransposon: s non-LTR retrotransposon ocellatus Rex6 Rex6, partial sequence Rex6 KF131789 non-LTR Astronotus ocellatus clone Astronotus retrotransposon: r non-LTR retrotransposon ocellatus Rex6 Rex6, partial sequence Rex6 KF131788 non-LTR Astronotus ocellatus clone Astronotus retrotransposon: q non-LTR retrotransposon ocellatus Rex6 Rex6, partial sequence Rex6 KF131787 non-LTR Astronotus ocellatus clone Astronotus retrotransposon: p non-LTR retrotransposon ocellatus Rex6 Rex6, partial sequence Rex6 KF131786 non-LTR Astronotus ocellatus clone Astronotus retrotransposon: o non-LTR retrotransposon ocellatus Rex6 Rex6, partial sequence Rex6 KF131785 non-LTR Astronotus ocellatus clone Astronotus retrotransposon: n non-LTR retrotransposon ocellatus Rex6 Rex6, partial sequence Rex6 KF131784 non-LTR Astronotus ocellatus clone Astronotus retrotransposon: m non-LTR retrotransposon ocellatus Rex6 Rex6, partial sequence Rex6 KF131783 non-LTR Astronotus ocellatus clone Astronotus retrotransposon: l non-LTR retrotransposon ocellatus Rex6 Rex6, partial sequence Rex6 KF131782 non-LTR Astronotus ocellatus clone Astronotus retrotransposon: k non-LTR retrotransposon ocellatus Rex6 Rex6, partial sequence Rex6 KF131781 non-LTR Astronotus ocellatus clone Astronotus retrotransposon: j non-LTR retrotransposon ocellatus Rex6 Rex6, partial sequence Rex6 KF131780 non-LTR Astronotus ocellatus clone Astronotus retrotransposon: i non-LTR retrotransposon ocellatus Rex6 Rex6, partial sequence Rex6 KF131779 non-LTR Astronotus ocellatus clone Astronotus retrotransposon: h non-LTR retrotransposon ocellatus Rex6 Rex6, partial sequence Rex6 KF131778 non-LTR Cichla monoculus clone Cichla monoculus retrotransposon: z6 non-LTR retrotransposon Rex6 Rex6, partial sequence Rex6 KF131777 non-LTR Cichla monoculus clone Cichla monoculus retrotransposon: z5 non-LTR retrotransposon Rex6 Rex6, partial sequence Rex6 KF131776 non-LTR Cichla monoculus clone Cichla monoculus retrotransposon: z4 non-LTR retrotransposon Rex6 Rex6, partial sequence Rex6 KF131775 non-LTR Cichla monoculus clone Cichla monoculus retrotransposon: z3 non-LTR retrotransposon Rex6 Rex6, partial sequence Rex6 KF131774 non-LTR Cichla monoculus clone Cichla monoculus retrotransposon: z2 non-LTR retrotransposon Rex6 Rex6, partial sequence Rex6 KF131773 non-LTR Cichla monoculus clone Cichla monoculus retrotransposon: z1 non-LTR retrotransposon Rex6 Rex6, partial sequence Rex6 KF131772 non-LTR Cichla monoculus clone Cichla monoculus retrotransposon: z non-LTR retrotransposon Rex6 Rex6, partial sequence Rex6 KF131771 non-LTR Cichla monoculus clone Cichla monoculus retrotransposon: x non-LTR retrotransposon Rex6 Rex6, partial sequence Rex6 KF131770 non-LTR Cichla monoculus clone Cichla monoculus retrotransposon: v non-LTR retrotransposon Rex6 Rex6, partial sequence Rex6 KF131769 non-LTR Cichla monoculus clone Cichla monoculus retrotransposon: u non-LTR retrotransposon Rex6 Rex6, partial sequence Rex6 KF131768 non-LTR Cichla monoculus clone Cichla monoculus retrotransposon: t non-LTR retrotransposon Rex6 Rex6, partial sequence Rex6 KF131767 non-LTR Cichla monoculus clone Cichla monoculus retrotransposon: s non-LTR retrotransposon Rex6 Rex6, partial sequence Rex6 KF131766 non-LTR Cichla monoculus clone Cichla monoculus retrotransposon: r non-LTR retrotransposon Rex6 Rex6, partial sequence Rex6 KF131765 non-LTR Cichla monoculus clone Cichla monoculus retrotransposon: q non-LTR retrotransposon Rex6 Rex6, partial sequence Rex6 KF131764 non-LTR Cichla monoculus clone Cichla monoculus retrotransposon: p non-LTR retrotransposon Rex6 Rex6, partial sequence Rex6 KF131763 non-LTR Cichla monoculus clone Cichla monoculus retrotransposon: o non-LTR retrotransposon Rex6 Rex6, partial sequence Rex6 KF131762 non-LTR Cichla monoculus clone Cichla monoculus retrotransposon: n non-LTR retrotransposon Rex6 Rex6, partial sequence Rex6 KF131761 non-LTR Cichla monoculus clone Cichla monoculus retrotransposon: m non-LTR retrotransposon Rex6 Rex6, partial sequence Rex6 KF131760 non-LTR Cichla monoculus clone Cichla monoculus retrotransposon: l non-LTR retrotransposon Rex6 Rex6, partial sequence Rex6 KF131759 non-LTR Cichla monoculus clone Cichla monoculus retrotransposon: k non-LTR retrotransposon Rex6 Rex6, partial sequence Rex6 KF131758 non-LTR Cichla monoculus clone Cichla monoculus retrotransposon: j non-LTR retrotransposon Rex6 Rex6, partial sequence Rex6 KF131757 non-LTR Cichla monoculus clone Cichla monoculus retrotransposon: i non-LTR retrotransposon Rex6 Rex6, partial sequence SLACS JN608782 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: Y-46 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608781 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: Y-45 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608780 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: Y-41 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608779 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: Y-40 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608778 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: Y-38 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608777 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: Y-37 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608776 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: Y-36 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608775 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: Y-35 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608774 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: Y-34 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608773 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: Y-33 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608772 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: Y-32 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608771 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: Y-30 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608770 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: Y-29 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608769 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: Y-28 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608768 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: Y-27 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608767 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: Y-26 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608766 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: Y-25 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608765 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: Y-24 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608764 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: Y-23 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608763 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: Y-22 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608762 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: Y-20 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608761 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: Y-19 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608760 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: Y-18 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608759 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: Y-17 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608758 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: Y-16 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608757 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: Y-13 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608756 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: Y-12 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608755 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: Y-11 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608754 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: Y-10 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608753 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: Y-08 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608752 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: Y-07 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608751 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: Y-06 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608750 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: Y-05 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608749 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: Y-04 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608748 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: Y-03 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608747 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: Y-01 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608746 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: X-83 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608745 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: X-81 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608744 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: X-80 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608743 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: X-79 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608742 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: X-78 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608741 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: X-76 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608740 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: X-75 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608739 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: X-74 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608738 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: X-66 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608737 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: X-65 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608736 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: X-62 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608735 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: X-61 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608734 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: X-60 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608733 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: X-59 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608732 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: X-57 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608731 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: X-49 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608730 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: X-41 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608729 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: X-28 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608728 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: X-27 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608727 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: X-23 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608726 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: X-21 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608725 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: X-14 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608724 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: X-11 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608723 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: X-07 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608722 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: X-06 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608721 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: X-05 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608720 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: X-03 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608719 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: X-01 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608718 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: mG40 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608717 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: mG39 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608716 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: mG38 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608715 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: mG37 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608714 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: mG36 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608713 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: mG35 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608712 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: mG34 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608711 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: mG33 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608710 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: mG32 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608709 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: mG31 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608708 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: mG30 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608707 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: mG29 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608706 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: mG28 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608705 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: mG27 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608704 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: mG26 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608703 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: mG25 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608702 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: mG24 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608701 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: mG23 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608700 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: mG22 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608699 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: mG21 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608698 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: mG20 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608697 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: mG19 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608696 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: mG18 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608695 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: mG17 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608694 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: mG16 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608693 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: mG14 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608692 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: mG13 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608691 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: mG12 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608690 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: mG11 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608689 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: mG10 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608688 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: mG09 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608687 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: mG08 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608686 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: mG07 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608685 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: mG06 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608684 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: mG05 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608683 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: mG04 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608682 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: mG03 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608681 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: mG02 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608680 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: mG01 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608679 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fG40 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608678 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fG39 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608677 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fG38 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608676 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fG37 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608675 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fG36 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608674 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fG35 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608673 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fG34 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608672 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fG33 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608671 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fG32 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608670 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fG31 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608669 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fG30 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608668 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fG29 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608667 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fG28 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608666 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fG27 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608665 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fG26 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608664 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fG25 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608663 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fG24 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608662 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fG23 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608661 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fG22 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608660 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fG21 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608659 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fG20 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608658 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fG19 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608657 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fG18 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608656 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fG17 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608655 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fG16 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608654 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fG15 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608653 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fG14 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608652 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fG13 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608651 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fG11 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608650 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fG10 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608649 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fG09 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608648 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fG08 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608647 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fG07 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608646 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fG06 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608645 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fG05 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608644 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fG03 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608643 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fG02 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608642 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fG01 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608641 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: A-47 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608640 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: A-46 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608639 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: A-45 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608638 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: A-44 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608637 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: A-42 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608636 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: A-41 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608635 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: A-40 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608634 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: A-37 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608633 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: A-36 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608632 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: A-35 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608631 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: A-34 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608630 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: A-33 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608629 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: A-32 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608628 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: A-31 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608627 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: A-26 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608626 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: A-24 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608625 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: A-23 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608624 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: A-22 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608623 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: A-21 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608622 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: A-20 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608621 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: A-19 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608620 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: A-18 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608619 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: A-17 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608618 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: A-16 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608617 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: A-15 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608616 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: A-14 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608615 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: A-13 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608614 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: A-12 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608613 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: A-11 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608612 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: A-10 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608611 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: A-09 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608610 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: A-08 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608609 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: A-06 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608608 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: A-05 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608607 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: A-03 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608606 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: A-01 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608236 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: mR40 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608235 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: mR39 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608234 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: mR38 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608233 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: mR35 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608232 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: mR32 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608231 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: mR29 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608230 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: mR26 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608229 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: mR25 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608228 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: mR18 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608227 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: mR17 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608226 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: mR14 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608225 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: mR13 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608224 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: mR12 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608223 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: mR11 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608222 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: mR09 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608221 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: mR08 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608220 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: mR07 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608219 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: mR06 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608218 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: mR05 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608217 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: mR04 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608216 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: mR03 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608215 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: mR02 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608214 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: mR01 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608213 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: mL40 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608212 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: mL39 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608211 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: mL37 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608210 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: mL36 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608209 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: mL35 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608208 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: mL34 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608207 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: mL33 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608206 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: mL32 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608205 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: mL31 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608204 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: mL28 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608203 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: mL27 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608202 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: mL25 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608201 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: mL24 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608200 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: mL23 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608199 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: mL19 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608198 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: mL16 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608197 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: mL13 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608196 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: mL12 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608195 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: mL11 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608194 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: mL10 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608193 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: mL07 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608192 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: mL04 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608191 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: mL03 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608190 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: mL02 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608189 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: mL01 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608188 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: mF40 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608187 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: mF39 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608186 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: mF38 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608185 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: mF37 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608184 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: mF36 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608183 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: mF35 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608182 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: mF33 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608181 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: mF32 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608180 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: mF31 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608179 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: mF30 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608178 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: mF29 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608177 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: mF28 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608176 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: mF27 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608175 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: mF26 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608174 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: mF25 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608173 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: mF23 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608172 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: mF22 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608171 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: mF21 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608170 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: mF20 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608169 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: mF19 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608168 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: mF18 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608167 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: mF17 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608166 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: mF16 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608165 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: mF15 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608164 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: mF14 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608163 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: mF13 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608162 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: mF11 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608161 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: mF10 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608160 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: mF09 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608159 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: mF08 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608158 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: mF07 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608157 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: mF06 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608156 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: mF05 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608155 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: mF04 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608154 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: mF03 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608153 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: mF02 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608152 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: mF01 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608151 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fR40 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608150 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fR39 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608149 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fR38 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608148 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fR37 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608147 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fR36 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608146 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fR35 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608145 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fR34 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608144 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fR33 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608143 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fR32 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608142 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fR31 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608141 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fR30 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608140 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fR29 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608139 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fR28 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608138 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fR27 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608137 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fR26 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608136 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fR25 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608135 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fR24 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608134 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fR23 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608133 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fR22 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608132 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fR21 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608131 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fR20 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608130 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fR19 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608129 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fR18 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608128 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fR17 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608127 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fR16 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608126 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fR15 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608125 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fR14 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608124 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fR13 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608123 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fR12 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608122 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fR11 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608121 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fR09 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608120 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fR08 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608119 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fR07 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608118 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fR05 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608117 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fR04 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608116 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fR03 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608115 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fR02 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608114 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fR01 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608113 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fL40 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608112 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fL39 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608111 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fL38 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608110 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fL37 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608109 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fL36 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608108 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fL35 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608107 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fL34 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608106 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fL33 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608105 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fL32 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608104 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fL31 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608103 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fL30 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608102 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fL29 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608101 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fL28 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608100 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fL27 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608099 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fL26 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608098 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fL25 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608097 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fL24 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608096 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fL23 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608095 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fL22 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608094 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fL20 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608093 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fL19 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608092 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fL18 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608091 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fL17 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608090 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fL16 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608089 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fL15 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608088 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fL14 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608087 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fL13 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608086 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fL12 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608085 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fL11 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608084 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fL10 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608083 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fL09 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608082 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fL07 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608081 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fL06 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608080 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fL05 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608079 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fL04 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608078 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fL03 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608077 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fL01 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608076 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fF40 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608075 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fF39 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608074 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fF38 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608073 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fF37 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608072 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fF36 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608071 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fF35 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608070 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fF34 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608069 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fF33 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608068 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fF32 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608067 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fF31 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608066 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fF30 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608065 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fF29 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608064 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fF28 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608063 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fF27 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608062 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fF26 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608061 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fF24 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608060 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fF23 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608059 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fF21 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608058 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fF20 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608057 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fF18 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608056 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fF17 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608055 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fF16 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608054 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fF15 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608053 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fF14 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608052 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fF13 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608051 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fF12 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608050 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fF11 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608049 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fF10 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608048 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fF09 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608047 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fF08 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608046 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fF06 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608045 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fF05 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608044 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fF04 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608043 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fF03 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence SLACS JN608042 non-LTR Silene latifolia isolate Silene latifolia retrotransposon: fF02 non-LTR retrotransposon SLACS-like SLACS-like, partial sequence YURECi AB097133 rt Ciona intestinalis Ciona intestinalis retrotransposon YURECi DNA, complete sequence CRE . Cnl1 C. neoformans non-LTR Cryptococcus retrotransposon - consensus. neoformans CRE . CRE-1_ACas CRE non-LTR retrotransposon: Acanthamoeba consensus. castellanii CRE . Cre-1_BM Cre-1_BM non-LTR Bombyx mori retrotransposon - consensus. CRE . CRE-1_CCri Non-LTR retrotransposon from Chondrus crispus the red seaweed: consensus. CRE . Cre-1_FCy Cre-1_FCy non-LTR Fragilariopsis retrotransposon - conceptual cylindrus consensus. CRE . Cre-1_HM Cre-1_HM non-LTR Hydra vulgaris retrotransposon - consensus. CRE . CRE-1_HRo Cre-like non-LTR Helobdella robusta retrotransposon: consensus sequence. CRE . CRE-1_LSa CRE non-LTR retrotransposon: Lactuca sativa consensus. CRE . Cre-1_MB Cre-1_MB non-LTR Monosiga retrotransposon - consensus. brevicollis CRE . Cre-1_NV Cre-1_NV non-LTR Nematostella retrotransposon - consensus. vectensis CRE . CRE-1_PXu Non-LTR retrotransposon from Papilio xuthus Papilio xuthus: consensus. CRE . CRE-10_CCri Non-LTR retrotransposon from Chondrus crispus the red seaweed: consensus. CRE . CRE-11_CCri Non-LTR retrotransposon from Chondrus crispus the red seaweed: consensus. CRE . CRE-12_CCri Non-LTR retrotransposon from Chondrus crispus the red seaweed: consensus. CRE . CRE-13_CCri Non-LTR retrotransposon from Chondrus crispus the red seaweed: consensus. CRE . CRE-14_CCri Non-LTR retrotransposon from Chondrus crispus the red seaweed: consensus. CRE . CRE-15_CCri Non-LTR retrotransposon from Chondrus crispus the red seaweed: consensus. CRE . CRE-16_CCri Non-LTR retrotransposon from Chondrus crispus the red seaweed: consensus. CRE . CRE-17_CCri Non-LTR retrotransposon from Chondrus crispus the red seaweed: consensus. CRE . Cre-2_BM Cre-2_BM non-LTR Bombyx mori retrotransposon - consensus. CRE . CRE-2_CCri Non-LTR retrotransposon from Chondrus crispus the red seaweed: consensus. CRE . CRE-2_HMa CRE non-LTR retrotransposon: Hydra vulgaris consensus. CRE . CRE-2_HRo Cre-like non-LTR Helobdella robusta retrotransposon: consensus sequence. CRE . CRE-2_NV CRE non-LTR retrotransposon: Nematostella consensus. vectensis CRE . CRE-2_PXu Non-LTR retrotransposon from Papilio xuthus Papilio xuthus: consensus. CRE . CRE-3_CCri Non-LTR retrotransposon from Chondrus crispus the red seaweed: consensus. CRE . CRE-3_HRo CRE-like non-LTR Helobdella robusta retrotransposon: consensus sequence. CRE . CRE-3_NV CRE non-LTR retrotransposon: Nematostella consensus. vectensis CRE . CRE-4_CCri Non-LTR retrotransposon from Chondrus crispus the red seaweed: consensus. CRE . CRE-4_HRo CRE-like non-LTR Helobdella robusta retrotransposon: consensus sequence. CRE . CRE-5_CCri Non-LTR retrotransposon from Chondrus crispus the red seaweed: consensus. CRE . CRE-5_HRo CRE-like non-LTR Helobdella robusta retrotransposon: consensus sequence. CRE . CRE-6_CCri Non-LTR retrotransposon from Chondrus crispus the red seaweed: consensus. CRE . CRE-6_HRo CRE-like non-LTR Helobdella robusta retrotransposon: consensus sequence. CRE . CRE-7_CCri Non-LTR retrotransposon from Chondrus crispus the red seaweed: consensus. CRE . CRE-8_CCri Non-LTR retrotransposon from Chondrus crispus the red seaweed: consensus. CRE . CRE-9_CCri Non-LTR retrotransposon from Chondrus crispus the red seaweed: consensus. CRE M33009 CRE1 C. fasciculata retrotransposable Crithidia element (CRE1). fasciculata CRE U19151 CRE2 C. fasciculata retrotransposable Crithidia element (CRE2). fasciculata CRE M62862 CZAR T. cruzi SL-RNA-associated Trypanosoma cruzi non-LTR retrotransposon. R4 . Dong Bombyx mori non-LTR Bombyx mori retrotransposable element. R4 . DONG_FR2 Non-LTR retrotransposon; Takifugu rubripes site-specific LINE; R4/Dong superfamily; DONG_FR2. R4 . Dong-1_AFC Dong/R4-type non-LTR Cichlidae retrotransposon - consensus. R4 . Dong-1_HMM Non-LTR retrotransposon Heliconius family from Heliconius melpomene melpomene melpomene. melpomene R4 . Dong-1_NVe A Dong non-LTR Nematostella retrotransposon family from vectensis Nematostella vectensis. R4 . Dong-1_PPo Non-LTR retrotransposon from Papilio polytes Papilio polytes: consensus. R4 . Dong-1_PXu Non-LTR retrotransposon from Papilio xuthus Papilio xuthus: consensus. R4 . Dong-2_BM Non-LTR retrotransposon - a Bombyx mori consensus. R4 . Dong-2_HMM Non-LTR retrotransposon Heliconius family from Heliconius melpomene melpomene melpomene. melpomene R4 . Dong-2_Lch Dong-like non-LTR Latimeria retrotransposon - consensus. chalumnae R4 . Dong-2_PPo Non-LTR retrotransposon from Papilio polytes Papilio polytes: consensus. R4 . DongAa A Dong non-LTR Aedes aegypti retrotransposon family from Aedes aegypti. R4 AB097127 DongAG Anopheles gambiae non-LTR Anopheles retrotransposon DongAg - a gambiae partial sequence. R4 AB097128 EhRLE2 Entamoeba histolytica Entamoeba retrotransposon EhRLE2, histolytica complete sequence. R4 AB097129 EhRLE3 Entamoeba histolytica Entamoeba retrotransposon EhRLE3, histolytica complete sequence. HERO . HERO-1_AFC Hero-type non-LTR Cichlidae retrotransposon - consensus. HERO . HERO-1_BF Amphioxus HERO-1_BF Branchiostoma autonomous non-LTR floridae Retrotransposon - consensus. HERO . HERO-1_HR A family of HERO non-LTR Helobdella robusta retrotransposons - a consensus sequence. HERO . HERO-1_PP A family of HERO non-LTR Physarum retrotransposons - a polycephalm consensus sequence. HERO AAGJ02121261 HERO-1_SP Sea urchin HERO-1_SP Strongylocentrotus autonomous non-LTR purpuratus Retrotransposon - consensus. HERO . HERO-2_BF Amphioxus HERO-2_BF Branchiostoma autonomous non-LTR floridae Retrotransposon - consensus. HERO . HERO-2_DR HERO-2_DR is a family of HERO Danio rerio non-LTR retrotransposons - a consensus. HERO . HERO-2_HR Non-LTR retrotransposon: Helobdella robusta consensus sequence. HERO 048B05 Hero-2_SPur HERO-type non-ltr Strongylocentrotus retrotransposon from sea urchin. purpuratus HERO . HERO-3_BF HERO-3_BF is a family of HERO Branchiostoma non-LTR retrotransposons - a floridae consensus. HERO . HERO-3_DR HERO-3_DR is a family of HERO Danio rerio non-LTR retrotransposons - a consensus. HERO . HERO-3_HR Non-LTR retrotransposon: Helobdella robusta consensus sequence. HERO . Hero-3_SPur HERO-type non-LTR Strongylocentrotus retrotransposon from sea urchin. purpuratus HERO . HERO-4_DR HERO-4_DR is a family of HERO Danio rerio non-LTR retrotransposons - a consensus. HERO . HERO-4_HR Non-LTR retrotransposon: Helobdella robusta consensus sequence. HERO . HERO-5_HR Non-LTR retrotransposon: Helobdella robusta consensus sequence. HERO . HERO-6_HR Non-LTR retrotransposon: Helobdella robusta consensus sequence. HERO . HERO-7_HR Non-LTR retrotransposon: Helobdella robusta consensus sequence. HERO . HERO-8_HR Non-LTR retrotransposon: Helobdella robusta consensus sequence. HERO . HERO-9_HR Non-LTR retrotransposon: Helobdella robusta consensus sequence. HERO . HERODr HERODr is a family of HERO Danio rerio non-LTR retrotransposons - a consensus. HERO . HEROFr A HERO clade non-LTR Takifugu rubripes Retrotransposon family - consensus. HERO . HEROTn HEROTn or Zebulon non-LTR Tetraodon retrotransposon - a consensus nigroviridis sequence. NeSL . LIN10B_SM Non-LTR retrotransposon; Schmidtea consensus. mediterranea NeSL . LIN11_SM Non-LTR retrotransposon: Schmidtea consensus. mediterranea NeSL . LIN13_SM Non-LTR retrotransposon; Schmidtea consensus. mediterranea NeSL . LIN14_SM Non-LTR retrotransposon; Schmidtea consensus. mediterranea NeSL . LIN15_SM Non-LTR retrotransposon; Schmidtea consensus. mediterranea NeSL . LIN2_SM Non-LTR retrotransposon Schmidtea (consensus). mediterranea NeSL . LIN21_SM Non-LTR retrotransposon; Schmidtea consensus. mediterranea NeSL . LIN23_SM Non-LTR retrotransposon; Schmidtea consensus. mediterranea NeSL . LIN24_SM Non-LTR retrotransposon; Schmidtea consensus. mediterranea NeSL . LIN24B_SM Non-LTR retrotransposon; Schmidtea consensus. mediterranea NeSL . LIN25_SM Non-LTR retrotransposon; Schmidtea consensus. mediterranea NeSL . LIN26_SM Non-LTR retrotransposon; Schmidtea consensus. mediterranea NeSL . LIN3_SM Non-LTR retrotransposon; Schmidtea consensus. mediterranea NeSL . LIN4_SM Non-LTR retrotransposon; Schmidtea consensus. mediterranea NeSL . LIN4b_SM Non-LTR retrotransposon; Schmidtea consensus. mediterranea NeSL . LIN5_SM Non-LTR retrotransposon from Schmidtea Schmidtea mediterranea: mediterranea consensus. NeSL . LIN6_SM Non-LTR retrotransposon from Schmidtea Schmidtea mediterranea: mediterranea consensus. NeSL . LIN7_SM Non-LTR retrotransposon from Schmidtea Schmidtea mediterranea: mediterranea consensus. NeSL . LIN7B_SM Non-LTR retrotransposon; Schmidtea consensus. mediterranea NeSL . LIN9_SM Non-LTR retrotransposon: Schmidtea consensus. mediterranea CRE JQ747487 MoTeR1 Telomere-specific non-LTR Magnaporthe oryzae retrotransposon MoTeR1 from Magnaporthe oryzae. CRE JQ747488 MoTeR2 Telomere-specific non-LTR Magnaporthe oryzae retrotransposon MoTeR2 from Magnaporthe oryzae. NeSL Z82058 NeSL-1 NeSL-1 is a non-LTR Caenorhabditis retrotransposon, complete elegans sequence. NeSL . NeSL-1_C11 A family of NeSL non-LTR Caenorhabditis retrotransposons. tropicalis NeSL . NeSL-1_CA A family of NeSL non-LTR Caenorhabditis retrotransposons. angaria NeSL . NeSL-1_CBre A family of NeSL non-LTR Caenorhabditis retrotransposons - consensus. brenneri NeSL . NeSL-1_CBri A family of NeSL non-LTR Caenorhabditis retrotransposons. briggsae NeSL . NeSL-1_CJap A family of NeSL non-LTR Caenorhabditis retrotransposons - consensus. japonica NeSL . NeSL-1_CRem A family of NeSL non-LTR Caenorhabditis retrotransposons - consensus. remanei NeSL . NeSL-1_SM Non-LTR retrotransposon; Schmidtea consensus. mediterranea NeSL . NeSL-1_TV A family of NeSL non-LTR Trichomonas retrotransposons - consensus. vaginalis NeSL . NeSL-2_CBre A family of NeSL non-LTR Caenorhabditis retrotransposons - consensus. brenneri NeSL . NeSL-2_CRem A family of NeSL non-LTR Caenorhabditis retrotransposons - consensus. remanei NeSL . NeSL-2_SM Non-LTR retrotransposon; Schmidtea consensus. mediterranea NeSL . NeSL-3_CBre A family of NeSL non-LTR Caenorhabditis retrotransposons - consensus. brenneri NeSL . NeSL-3_CRem A family of NeSL non-LTR Caenorhabditis retrotransposons - consensus. remanei NeSL chrUn NeSL-4_CRem A family of NeSL non-LTR Caenorhabditis retrotransposons. remanei NeSL . NeSL-4_SM Non-LTR retrotransposon; Schmidtea consensus. mediterranea R2 BN000800 PERERE-9 Schistosoma mansoni Perere-9 Schistosoma mansoni non-LTR retrotransposon (EST). R4 . Plat_R4 R4 Non-LTR Retrotransposon Ornithorhynchus from Ornithorhynchus. R2 AF015815 R2_AM Anurida maritima Anurida maritima retrotransposon R2, complete sequence. R2 M16558 R2_BM Bombyx mori rDNA insertion Bombyx mori element R2 (type II), complete cds. R2 AB097121 R2CI R2-type LINE. Ciona intestinalis R2 . R2CPB Non-LTR retrotransposon: Chrysemyspicta consensus. bellii R2 . R2_DAn 28S rDNA-specific non-LTR Drosophila retrotransposon R2 in ananassae Drosophila ananassae. R2 X51967 R2_DM LINE-like retrotransposable Drosophila element R2DM. melanogaster R2 . R2_DPe 28S rDNA-specific non-LTR Drosophila retrotransposon R2 in persimilis Drosophila persimilis. R2 . R2_DPs 28S rDNA-specific non-LTR Drosophila retrotransposon R2 in pseudoobscura Drosophila pseudoobscura. R2 . R2_DSe 28S rDNA-specific non-LTR Drosophila retrotransposon R2 in sechellia Drosophila sechellia. R2 . R2_DSi 28S rDNA-specific non-LTR Drosophila retrotransposon R2 in simulans Drosophila simulans. R2 . R2_DYa 28S rDNA-specific non-LTR Drosophila yakuba retrotransposon R2 in Drosophila yakuba. R2 AF015819 R2_FA Forficula auricularia Forficula retrotransposon R2, complete auricularia sequence. R2 AF015816 R2_HC Hippodamia convergens Hippodamia retrotransposon R2 reverse convergens transcriptase gene, partial cds. R2 GU949558 R2_KF 28S rDNA-specific non-LTR Kalotermes retrotransposon R2 from flavicollis Kalotermes flavicollis. R2 AF015814 R2_LP Limulus polyphemus Limulus polyphemus retrotransposon R2, complete sequence. R2 AF015818 R2_PS Porcellio scaber Porcellio scaber retrotransposon R2, complete sequence. R2 GU949555 R2_RL 28S rDNA-specific non-LTR Reticulitermes retrotransposon R2 from lucifugus Reticulitermes lucifugus. R2 GU949554 R2_RU 28S rDNA-specific non-LTR Reticulitermes retrotransposon R2 from urbis Reticulitermes urbis. R2 . R2-1_AAm R2 non-LTR retrotransposon Amblyomma from lone star tick. americanum R2 . R2-1_ACC R2 non-LTR retrotransposon Aquila chrysaetos from golden eagle. canadensis R2 . R2-1_ACh R2 non-LTR retrotransposon Acanthisitta from rifleman. chloris R2 . R2-1_AFo R2 non-LTR retrotransposon Aptenodytes from emperor penguin. forsteri R2 . R2-1_AMi R2-type non-LTR retrotransposon. Alligator mississippiensis R2 . R2-1_AOM R2 non-LTR retrotransposon Apteryx spp. from kiwi. R2 . R2-1_ApA R2 non-LTR retrotransposon Apteryx australis from north island brown kiwi. mantelli R2 . R2-1_APi R2 non-LTR retrotransposon Acyrthosiphon from pea aphid. pisum R2 . R2-1_BRG R2 non-LTR retrotransposon Balearica from East African grey regulorum crowned crane. gibbericeps R2 . R2-1_BTe R2 non-LTR retrotransposon Bombus terrestris from buff-tailed bumblebee. R2 . R2-1_CAnn R2 non-LTR retrotransposon Calypte anna from Anna's hummingbird. R2 . R2-1_CAu R2 non-LTR retrotransposon Cathartes aura from turkey vulture. R2 . R2-1_CBr R2 non-LTR retrotransposon Corvus from American crow. brachyrhynchos R2 . R2-1_CCa R2 non-LTR retrotransposon Antrostomus from chuck-will's-widow. carolinensis R2 . R2-1_CCan R2 non-LTR retrotransposon Cuculus canorus from common cuckoo. R2 . R2-1_CPu R2 non-LTR retrotransposon Calidris pugnax from ruff. R2 . R2-1_Crp Non-LTR retrotransposon. Crocodylus porosus R2 . R2-1_CSt R2 non-LTR retrotransposon Colius striatus from speckled mousebird. R2 . R2-1_CU R2 non-LTR retrotransposon Chlamydotis from MacQueen's bustard. macqueenii R2 . R2-1_CVo R2 non-LTR retrotransposon Charadrius from killdeer. vociferus R2 . R2-1_DWi 28S rDNA-specific non-LTR Drosophila retrotransposon R2 in willistoni Drosophila willistoni. R2 . R2-1_EGa R2 non-LTR retrotransposon Egretta garzetta from little egret. R2 . R2-1_FAl R2 non-LTR retrotransposon Ficedula from collared flycatcher. albicollis R2 . R2-1_FCh R2 non-LTR retrotransposon Falco cherrug from Saker falcon. R2 . R2-1_FPe R2 non-LTR retrotransposon Falco peregrinus from peregrine falcon. R2 . R2-1_GA R2 non-LTR retrotransposon Gasterosteus from three-spined stickleback. aculeatus R2 . R2-1_Gav Non-LTR retrotransposon. Gavialis gangeticus R2 . R2-1_GFo R2 non-LTR retrotransposon Geospiza fortis from medium ground finch. R2 . R2-1_GSt R2 non-LTR retrotransposon Gavia stellata from red-throated loon. R2 . R2-1_HAl R2 non-LTR retrotransposon Haliaeetus from white-tailed eagle. albicilla R2 . R2-1_IS R2 non-LTR retrotransposon Ixodes scapularis from deer tick. R2 . R2-1_LCh R2-type non-LTR Latimeria retrotransposon - consensus. chalumnae R2 . R2-1_LDi R2 non-LTR retrotransposon Leptosomus from cuckoo roller. discolor R2 . R2-1_LSal non-LTR retrotransposon, Lepeophtheirus consensus. salmonis R2 . R2-1_LV R2 non-LTR retrotransposon Lytechinus from green sea urchin. variegatus R2 . R2-1_MDe R2 non-LTR retrotransposon Mayetiola from Hessian fly. destructor R2 . R2-1_MLe R2 non-LTR retrotransposon - Mnemiopsis leidyi consensus. R2 . R2-1_MR R2 non-LTR retrotransposon Megachile from alfalfa leafcutter bee. rotundata R2 . R2-1_MUn R2 non-LTR retrotransposon Melopsittacus fragment from budgerigar. undulatus R2 . R2-1_MUni R2 non-LTR retrotransposon Mesitornis from brown mesite. unicolor R2 . R2-1_MVi R2 non-LTR retrotransposon Manacus vitellinus from golden-collared manakin. R2 . R2-1_NNi R2 non-LTR retrotransposon Nipponia nippon from created ibis. R2 . R2-1_NV Starlet sea anemone R2-1_NV Nematostella autonomous Non-LTR vectensis Retrotransposon - consensus. R2 . R2-1_OHo R2 non-LTR retrotransposon Opisthocomus from hoatzin. hoazin R2 . R2-1_PAd R2 non-LTR retrotransposon Pygoscelis adeliae from Adelie penguin. R2 . R2-1_PBa R2 non-LTR retrotransposon Pogonomyrmex from red harvester ant. barbatus R2 . R2-1_PCar R2 non-LTR retrotransposon Phalacrocorax from great cormorant. carbo R2 . R2-1_PCau R2 non-LTR retrotransposon Priapulus caudatus sequence. R2 . R2-1_PCr R2 non-LTR retrotransposon Podiceps cristatus from great crested grebe. R2 . R2-1_PCri R2 non-LTR retrotransposon Pelecanus crispus from Dalmatian pelican. R2 . R2-1_PGu R2 non-LTR retrotransposon Pterocles from sandgrouse. gutturalis R2 . R2-1_PLe R2 non-LTR retrotransposon Phaethon lepturus from tropicbird. R2 . R2-1_PM R2-1_PM is a family of R2 Petromyzon marinus non-LTR retrotransposons - consensus. R2 . R2-1_PPap R2 non-LTR retrotransposon Phlebotomus from sand fly. papatasi R2 . R2-1_PPu R2 non-LTR retrotransposon Picoides pubescens from downy woodpecker. R2 . R2-1_PRR R2 non-LTR retrotransposon Phoenicopterus from American flamingo. ruber ruber R2 . R2-1_PSi R2 non-LTR retrotransposon Pelodiscus from Chinese soft-shelled sinensis turtle. R2 . R2-1_RMi R2 non-LTR retrotransposon Rhipicephaus from brown tick. microplus R2 . R2-1_RPr R2 non-LTR retrotransposon Rhodnius prolixus sequence. R2 . R2-1_RPu R2 non-LTR retrotransposon Rhipicephalus cDNA sequence from brown tick. pulchellus R2 . R2-1_SCa R2 non-LTR retrotransposon Serinus canaria from Atlantic canary. R2 . R2-1_SK R2 non-LTR retrotransposon Saccoglossus from acorn worm. kowalevskii R2 . R2-1_SM R2-type retrotransposon from Schmidtea Schmidtea mediterranea: mediterranea consensus. R2 . R2-1_SP R2 non-LTR retrotransposon Strongylocentrotus from purple sea urchin. purpuratus R2 AGKD01072455 R2-1_SSa R2-type non-LTR retrotransposon. Salmo salar R2 . R2-1_StC R2 non-LTR retrotransposon Struthiocamelus from ostrich. australis R2 . R2-1_TAl R2 non-LTR retrotransposon Tyto alba from barn owl. R2 . R2-1_TCas R2 non-LTR retrotransposon Tribolium from red flour beetle - consensus castaneum R2 . R2-1_TG A family of R2 non-LTR Taeniopygia retrotransposons - consensus guttata sequence. R2 . R2-1_TGut R2 non-LTR retrotransposon Tinamus guttatus from white-throated tinamou. R2 . R2-1_TSP A family of R2 non-LTR Trichinella retrotransposons in the spiralis Trichinella spiralis genome - a consensus. R2 scaffold_6 R2-1_TUr R2 non-LTR retrotransposon Tetranychus from twospotted spider mite. urticae R2 . R2-1_XM R2 non-LTR retrotransposon Xiphophorus fragment from Southern maculatus platyfish. R2 . R2-1_ZA R2 non-LTR retrotransposon Zonotrichia from white-throated sparrow. albicollis R2 . R2-1_ZLM R2 non-LTR retrotransposon Zosterops from silvereye. lateralis melanops R2 . R2-2_APi R2 non-LTR retrotransposon Acyrthosiphon from pea aphid. pisum R2 . R2-2_CCan R2 non-LTR retrotransposon Cuculus canorus from common cuckoo. R2 . R2-2_CMa R2 non-LTR retrotransposon Chlamydotis from Macqueen's bustard. macqueenii R2 . R2-2_DWi 28S rDNA-specific non-LTR Drosophila retrotransposon R2 in willistoni Drosophila willistoni. R2 . R2-2_HAl R2 non-LTR retrotransposon Haliaeetus from white-tailed eagle. albicilla R2 . R2-2_IS R2 non-LTR retrotransposon Ixodes scapularis from deer tick. R2 . R2-2_MR R2 non-LTR retrotransposon Megachile from alfalfa leafcutter bee. rotundata R2 . R2-2_MUn R2 non-LTR retrotransposon Melopsittacus fragment from budgerigar. undulatus R2 . R2-2_MUni R2 non-LTR retrotransposon Mesitornis from brown mesite. unicolor R2 . R2-2_NNi R2 non-LTR retrotransposon Nipponia nippon from created ibis. R2 . R2-2_NV Starlet sea anemone R2-2_NV Nematostella autonomous Non-LTR vectensis Retrotransposon - consensus. R2 . R2-2_PBa R2 non-LTR retrotransposon Pogonomyrmex from red harvester ant. barbatus R2 . R2-2_PM R2-2_PM is a family of R2 Petromyzon marinus non-LTR retrotransposons - a consensus. R2 . R2-2_RPr R2 non-LTR retrotransposon Rhodnius prolixus sequence. R2 . R2-2_SMed R2 non-LTR retrotransposon Schmidtea from Schmidtea mediterranea: mediterranea consensus. R2 . R2-2_TCas R2 non-LTR retrotransposon Tribolium from red flour beetle. castaneum R2 scaffold_37 R2-2_TUr R2 non-LTR retrotransposon Tetranychus from twospotted spider mite. urticae R2 ABJB010555169 R2-3_IS R2 non-LTR retrotransposon Ixodes scapularis from deer tick. R2 . R2-3_MR R2 non-LTR retrotransposon Megachile from alfalfa leafcutter bee. rotundata R2 . R2-4_MR R2 non-LTR retrotransposon Megachile from alfalfa leafcutter bee. rotundata R2 . R2-5_MR R2 non-LTR retrotransposon Megachile from alfalfa leafcutter bee. rotundata R2 . R2-6_MR R2 non-LTR retrotransposon Megachile from alfalfa leafcutter bee. rotundata R2 . R2-7_MR R2 non-LTR retrotransposon Megachile from alfalfa leafcutter bee. rotundata R2 . R2-8_MR R2 non-LTR retrotransposon Megachile from alfalfa leafcutter bee. rotundata R2 . R2-N1_Gav Non-LTR retrotransposon. Gavialis gangeticus R2 . R2-N2_Gav Non-LTR retrotransposon. Gavialis gangeticus R2 . R2-N2B_Gav Non-LTR retrotransposon. Gavialis gangeticus R2 . R2A_NVi 28S rDNA-specific non-LTR Nasonia retrotransposon R2 in vitripennis Nasonia vitripennis. R2 AF015817 R2A_TM Tenebrio molitor Tenebrio molitor retrotransposon R2 reverse transcriptase gene, partial cds. R2 . R2Amel R2Amel - R2 non-LTR Apis mellifera retrotransposon from the honeybee Apis mellifera. R2 AF015685 R2B_DM Drosophila mercatorum R2 Drosophila retrotransposon reverse mercatorum transcriptase domain protein gene, complete cds. R2 . R2B_NVi 28S rDNA-specific non-LTR Nasonia retrotransposon R2 in vitripennis Nasonia vitripennis. R2 AF015822 R2B_TM Tenebrio molitor Tenebrio molitor retrotransposon R2 reverse transcriptase gene, partial cds. R2 . R2C_NGi 28S rDNA-specific non-LTR Nasonia giraulti retrotransposon R2 in Nasonia giraulti. R2 AB097122 R2Ci-B Ciona intestinalis Ciona intestinalis retrotransposon R2Ci-B, complete sequence. R2 . R2Ci-D Ciona intestinalis Ciona intestinalis retrotransposon R2CiD, complete sequence. R2 AB097121 R2CIA_CI Ciona intestinalis Ciona intestinalis retrotransposon R2Ci-A, complete sequence. R2 AB097125 R2Cs-D Ciona intestinalis Ciona savignyi retrotransposon R2CsD, partial sequence. R2 . R2D_NGi 28S rDNA-specific non-LTR Nasonia giraulti retrotransposon R2 in Nasonia giraulti. R2 NM_001030097 R2Dr R2 non-LTR retrotransposon in Danio rerio the Danio rerio genome - a single copy. R2 . R2E_NLo 28S rDNA-specific non-LTR Nasonia retrotransposon R2 in longicornis Nasonia longicornisi. R2 AB201408 R2Eb R2 non-LTR retrotransposon Eptatretus burgeri from Eptatretus burgeri. R2 AB201415 R2Ha R2 non-LTR retrotransposon Hasarius adansoni from Hasarius adansoni. R2 JN937617 R2La R2-type non-LTR retrotransposon. Lepidurus arcticus R2 . R2LcA R2-type non-LTR retrotransposon. Lepidurus couesii R2 JN937619 R2LcB R2-type non-LTR retrotransposon. Lepidurus couesii R2 . R2LcC R2-type non-LTR retrotransposon. Lepidurus couesii R2 JN937616 R2Ll R2-type non-LTR retrotransposon. Lepidurus apus lubbocki R2 AB201414 R2Mr R2 non-LTR retrotransposon Metacrinus from Metacrinus rotundus. rotundus R2 . R2NS-1_CGi R2-type retrotransposon from Crassostrea gigas Crassostrea gigas. R2 . R2NS-1_CSi R2-type retrotransposon from Clonorchis Clonorchis sinensis: consensus. sinensis R2 . R2NS-1_PMi R2-like non-LTR Patiria miniata retrotransposon from bat star. R2 . R2NS-1_SMed R2-type retrotransposon from Schmidtea Schmidtea mediterranea: mediterranea consensus. R2 . R2Nvec-A R2Nvec-A - R2 non-LTR Nematostella retrotransposon from the vectensis starlet sea anemone Nematostella vectensis. R2 . R2Ol-A R2 non-LTR retrotransposon Oryzias latipes from the medaka Oryzias latipes - consensus. R2 AB201416 R2Pc R2 non-LTR retrotransposon Procambarus from Procambarus clarkii. clarkii R2 . R2Sm-A R2Sm-A - R2 non-LTR Schistosoma retrotransposon from the mansoni bloodfluke Schistosoma mansoni. R2 AB201409 R2Ta R2 non-LTR retrotransposon Tanichthys from Tanichthys albonubes. albonubes R2 EU854578 R2Tc R2-type non-LTR retrotransposon. Triops cancriformis R2 JN937621 R2Tc_it R2-type non-LTR retrotransposon. Triops cancriformis R2 AB201417 R2Tl R2 non-LTR retrotransposon Triops from Triops longicaudatus. longicaudatus R4 U29445 R4_AL Ascaris lumbricoides Ascaris site-specific non-LTR lumbricoides retrotransposable element R4 in 26S rDNA, complete sequence. R4 U29590 R4_HC Haemonchus contortus non-LTR Haemonchus retrotransposon specific to contortus the large subunit rRNA genes of nematodes. R4 . R4_Hmel a R4 element from Heliconius Heliconius melpomene. melpomene R4 . R4-1_AC A family of R4 non-LTR Anolis retrotransposons - consensus carolinensis sequence. R4 . R4-1_ADi R4-type retrotransposon: Acropora consensus. digitifera R4 . R4-1_BM Non-LTR retrotransposon - a Bombyx mori consensus. R4 CADV01008175 R4-1_BX An R4 non-LTR retrotransposon Bursaphelenchus family from Bursaphelenchus xylophilus xylophilus. R4 ABLE03011482 R4-1_CJap An R4 non-LTR retrotransposon Caenorhabditis family from Caenorhabditis japonica japonica. R4 . R4-1_CM Non-LTR retrotransposon from Callorhinchus the elephant shark - consensus. milii R4 . R4-1_CPB Non-LTR retrotransposon: Chrysemyspicta consensus. bellii R4 . R4-1_ED Autonomous non-LTR Entamoeba dispar retrotransposon from the R4 clade - a consensus sequence. R4 . R4-1_HG An R4 non-LTR retrotransposon Heterodera family from glycines Heterodera glycines. R4 . R4-1_HMe Non-LTR retrotransposon family from Heliconius Heliconius melpomene melpomene. melpomene melpomene R4 CABB01003843 R4-1_MI An R4 non-LTR retrotransposon Meloidogyne family from Meloidogyne incognita incognita. R4 . R4-1_PH Non-LTR Retrotransposon, Parhyale consensus. hawaiensis R4 CACX01002001 R4-1_SRa An R4 non-LTR retrotransposon Strongyloides family from Strongyloides ratti. ratti R4 . R4-1_TCa R4-type retrotransposon: Tribolium consensus. castaneum R4 . R4-1B_AC Dong-type non-LTR Anolis retrotransposons - a consensus carolinensis sequence. R4 . R4-2_AS An R4 non-LTR retrotransposon Ascaris suum family from Ascaris suum. R4 CADV01009048 R4-2_BX An R4 non-LTR retrotransposon Bursaphelenchus family from Bursaphelenchus xylophilus xylophilus. R4 ABLA01000389 R4-2_HG An R4 non-LTR retrotransposon Heterodera family from Heterodera glycines glycines. R4 CACX01002006 R4-2_SRa An R4 non-LTR retrotransposon Strongyloides family from Strongyloides ratti. ratti R4 CADV01008832 R4-3_BX An R4 non-LTR retrotransposon Bursaphelenchus family from Bursaphelenchus xylophilus xylophilus. R4 . R4-3_SRa An R4 non-LTR retrotransposon Strongyloides family from Strongyloides ratti. ratti R4 . R4-4_BX An R4 non-LTR retrotransposon Bursaphelenchus family from Bursaphelenchus xylophilus xylophilus. R4 . R4-4_SRa An R4 non-LTR retrotransposon Strongyloides family from Strongyloides ratti ratti. R4 . R4-5_BX An R4 non-LTR retrotransposon Bursaphelenchus family from Bursaphelenchus xylophilus xylophilus. NeSL AY216701 R5 Girardia tigrina R5 Girardia tigrina retrotransposon, complete sequence. NeSL . R5-1_SM A family of planarian NeSL Schmidtea non-LTR retrotransposons - mediterranea consensus. NeSL . R5-2_SM A family of planarian NeSL Schmidtea non-LTR retrotransposons - mediterranea consensus. R2 . R8Hm-A R8Hm-A - 18S rDNA-specific Hydra vulgaris non-LTR retrotransposon from Hydra magnipapillata. R2 . R8Hm-B R8Hm-B - 18S rDNA-specific Hydra vulgaris non-LTR retrotransposon from Hydra magnipapillata. R2 . R9Av R9Av, an rDNA-specific non-LTR Adineta vaga retrotransposon family from rotifer. R2 FJ461304 RaR2 28S rDNA-specific non-LTR Rhynchosciara retrotransposon R2 from americana Rhynchosciara americana. R4 . Rex6 Non-LTR retrotransposon; Takifugu rubripes site-specific LINE; R4/Dong superfamily; REX6; DONG_FR. R4 . Rex6-1_OL A Rex6 non-LTR retrotransposon Oryzias latipes family from Olyzias latipes. CRE X17078 SLACS Trypanosoma brucei DNA for Trypanosoma brucei retrotransposable element SLACS. NeSL . Utopia-1_ACa Utopia-1_ACa is a protozoan Acanthamoeba Utopia non-LTR retrotransposon - castellanii a complete sequence. NeSL scaffold_474 Utopia-1_ACar A family of NeSL non-LTR Anolis retrotransposons. carolinensis NeSL . Utopia-1_AEc A family of Utopia non-LTR Acromyrmex retrotransposons - consensus. echinatior NeSL . Utopia-1_AMi A family of NeSL non-LTR Alligator retrotransposons - consensus. mississippiensis NeSL . Utopia-1_APi A family of Utopia non-LTR Acyrthosiphon retrotransposons - consensus. pisum NeSL . Utopia-1_APl A family of Utopia non-LTR Agrilus retrotransposons. planipennis NeSL . Utopia-1_CFl A family of Utopia non-LTR Camponotus retrotransposons - consensus. floridanus NeSL . Utopia-1_CMy A family of Utopia non-LTR Chelonia mydas retrotransposons - consensus. NeSL . Utopia-1_CPB A family of Utopia non-LTR Chrysemyspicta retrotransposons - consensus. bellii NeSL . Utopia-1_Crp Non-LTR retrotransposon. Crocodylus porosus NeSL . Utopia-1_DPo A family of Utopia non-LTR Dendroctonus ponderosae retrotransposons. NeSL . Utopia-1_DPu A family of Utopia non-LTR Daphnia pulex retrotransposons - consensus. NeSL . Utopia-1_DYak A family of Utopia non-LTR Drosophila yakuba retrotransposons - consensus. NeSL . Utopia-1_EBr A family of Utopia non-LTR Eimeria brunetti retrotransposons - consensus. NeSL . Utopia-1_EMi A family of Utopia non-LTR Eimeria mitis retrotransposons - consensus. NeSL . Utopia-1_ENe A family of Utopia non-LTR Eimeria necatrix retrotransposons - consensus. NeSL . Utopia-1_Gav Non-LTR retrotransposon. Gavialis gangeticus NeSL . Utopia-1_GG1 A family of Utopia non-LTR Ganaspis retrotransposons. NeSL . Utopia-1_HAra A family of Utopia non-LTR Hyaloperonospora retrotransposons. arabidopsidis NeSL . Utopia-1_HG A family of Utopia non-LTR Heterodera retrotransposons. glycines NeSL . Utopia-1_HMM A family of Utopia non-LTR Heliconius retrotransposons. melpomene melpomene NeSL . Utopia-1_HSal A family of Utopia non-LTR Harpegnathos retrotransposons - consensus. saltator NeSL . Utopia-1_IS A family of Utopia non-LTR Ixodes scapularis retrotransposons - consensus. NeSL . Utopia-1_LAl A family of Utopia non-LTR Lasioglossum retrotransposons. albipes NeSL . Utopia-1_LFu A family of Utopia non-LTR Ladona fulva retrotransposons. NeSL AGCV01358106 Utopia-1_LV A family of Utopia non-LTR Lytechinus retrotransposons. variegatus NeSL . Utopia-1_MRo A family of Utopia non-LTR Megachile retrotransposons - consensus. rotundata NeSL . Utopia-1_NVit A family of Utopia non-LTR Nasonia retrotransposons - consensus. vitripennis NeSL . Utopia-1_PAlni NeSL non-LTR retrotransposon Phytophthora alni from Phytophthora alni. NeSL . Utopia-1_PArrh NeSL non-LTR retrotransposon Pythium from Pythium arrhenomanes. arrhenomanes NeSL . Utopia-1_PBa A family of Utopia non-LTR Pogonomyrmex retrotransposons - consensus. barbatus NeSL . Utopia-1_PCa A family of Utopia non-LTR Phytophthora retrotransposons - consensus. capsici NeSL . Utopia-1_PCinn NeSL non-LTR retrotransposon Phytophthora from Phytophthora cinnamomi. cinnamomi NeSL AHJF01004292 Utopia-1_PCu A family of Utopia non-LTR Pseudoperonospora retrotransposons - consensus. cubensis NeSL . Utopia-1_PI A family of NeSL non-LTR Phytophthora retrotransposons - consensus. infestans NeSL . Utopia-1_PInsi NeSL non-LTR retrotransposon Pythium insidiosum from Pythium insidiosum. NeSL . Utopia-1_PKern NeSL non-LTR retrotransposon Phytophthora from Phytophthora kernoviae. kernoviae NeSL . Utopia-1_PLate NeSL non-LTR retrotransposon Phytophthora from Phytophthora lateralis. lateralis NeSL . Utopia-1_PMi A family of Utopia non-LTR Patiria miniata retrotransposons. NeSL . Utopia-1_PPac A family of Utopia non-LTR Pristionchus retrotransposons. pacificus NeSL . Utopia-1_PPini NeSL non-LTR retrotransposon Phytophthora from pinifolia Phytophthora pinifolia. NeSL . Utopia-1_PR A family of Utopia non-LTR Phytophthora retrotransposons - consensus. ramorum NeSL . Utopia-1_PRe A family of Utopia non-LTR Panagrellus retrotransposons - consensus. redivivus NeSL . Utopia-1_PS A family of Utopia non-LTR Phytophthora sojae retrotransposons - consensus. NeSL . Utopia-1_PSi A family of Utopia non-LTR Pelodiscus retrotransposons - consensus. sinensis NeSL . Utopia-1_PT A family of Utopia non-LTR Parasteatoda retrotransposons. tepidariorum NeSL ADOS01001321 Utopia-1_PU A family of Utopia non-LTR Pythium ultimum retrotransposons. NeSL . Utopia-1_PVexa NeSL non-LTR retrotransposon Phytopythium from Phytopythium vexans. aff. vexans NeSL . Utopia-1_SaPa A family of Utopia non-LTR Saprolegnia retrotransposons. parasitica NeSL . Utopia-1_SDicl NeSL non-LTR retrotransposon Saprolegnia from Saprolegnia diclina. diclina NeSL . Utopia-1_SM A family of Utopia non-LTR Strigamia maritima retrotransposons. NeSL AAGJ02140537 Utopia-1_SP A family of Utopia non-LTR Strongylocentrotus retrotransposons. purpuratus NeSL . Utopia-1_TSP A family of Utopia non-LTR Trichinella retrotransposons. spiralis NeSL . Utopia-1B_CPB A family of Utopia non-LTR Chrysemys picta retrotransposons - consensus. bellii NeSL . Utopia-2_APi A family of Utopia non-LTR Acyrthosiphon retrotransposons. pisum NeSL . Utopia-2_CMy A family of Utopia non-LTR Chelonia mydas retrotransposons - consensus. NeSL . Utopia-2_CPB A family of Utopia non-LTR Chrysemys picta retrotransposons - consensus. bellii NeSL . Utopia-2_DPu A family of Utopia non-LTR Daphnia pulex retrotransposons. NeSL . Utopia-2_LFu A family of Utopia non-LTR Ladona fulva retrotransposons. NeSL . Utopia-2_PCa A family of Utopia non-LTR Phytophthora retrotransposons - consensus. capsici NeSL . Utopia-2_PI A family of NeSL non-LTR Phytophthora retrotransposons - consensus. infestans NeSL . Utopia-2_PR A family of Utopia non-LTR Phytophthora retrotransposons - consensus. ramorum NeSL . Utopia-2_PS A family of Utopia non-LTR Phytophthora sojae retrotransposons - consensus. NeSL . Utopia-2_PU A family of Utopia non-LTR Pythium ultimum retrotransposons. NeSL . Utopia-3_CPB A family of Utopia non-LTR Chrysemys picta retrotransposons - consensus. bellii NeSL . Utopia-3_DPu A family of Utopia non-LTR Daphnia pulex retrotransposons. NeSL . Utopia-3_LFu A family of Utopia non-LTR Ladona fulva retrotransposons. NeSL . Utopia-3_PCa A family of Utopia non-LTR Phytophthora retrotransposons - consensus. capsici NeSL . Utopia-3_PI A family of NeSL non-LTR Phytophthora retrotransposons - consensus. infestans NeSL . Utopia-3_PR A family of Utopia non-LTR Phytophthora retrotransposons - consensus. ramorum NeSL . Utopia-4_LFu A family of Utopia non-LTR Ladona fulva retrotransposons. NeSL AATU01001281.1 Utopia-4_PI A family of NeSL non-LTR Phytophthora retrotransposons - a copy. infestans NeSL . Utopia-4_PR A family of Utopia non-LTR Phytophthora retrotransposons - consensus. ramorum NeSL . Utopia-5_LFu A family of Utopia non-LTR Ladona fulva retrotransposons. NeSL . Utopia-5_PI A family of Utopia non-LTR Phytophthora retrotransposons - consensus. infestans NeSL . Utopia-5_PR A family of Utopia non-LTR Phytophthora retrotransposons - consensus. ramorum NeSL . Utopia-6_LFu A family of Utopia non-LTR Ladona fulva retrotransposons. R4 . X4_LINE Conserved LINE element Vertebrata reconstructed from the human genome - consensus. NeSL . YURE_CSa A NeSL non-LTR retrotransposon Ciona savignyi from Ciona savignyi. R2 . YURE-2_Cis YURE non-LTR retrotransposon Ciona savignyi from Ciona savignyi. NeSL . YURECi Ciona intestinalis Ciona intestinalis retrotransposon YURECi.
(46) A skilled artisan can, based on the Accession numbers provided in Tables 1-3 determine the nucleic acid and corresponding polypeptide sequences of each retrotransposon and domains thereof, e.g., by using routine sequence analysis tools as Basic Local Alignment Search Tool (BLAST) or CD-Search for conserved domain analysis. Other sequence analysis tools are known and can be found, e.g., at molbiol-tools.ca, for example, at molbiol-tools.ca/Motifs.htm. SEQ ID NOs 1-112 align with each row in Table 1, and SEQ ID NOs 113-1015 align with the first 903 rows of Table 2.
(47) Tables 1-3 herein provide the sequences of exemplary transposons, including the amino acid sequence of the retrotransposase, and sequences of 5 and 3 untranslated regions to allow the retrotransposase to bind the template RNA, and the full transposon nucleic acid sequence. In some embodiments, a 5 UTR of any of Tables 1-3 allows the retrotransposase to bind the template RNA. In some embodiments, a 3 UTR of any of Tables 1-3 allows the retrotransposase to bind the template RNA. Thus, in some embodiments, a polypeptide for use in any of the systems described herein can be a polypeptide of any of Tables 1-3 herein, or a sequence having at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% identity thereto. In some embodiments, the system further comprises one or both of a 5 or 3 untranslated region of any of Tables 1-3 herein (or a sequence having at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% identity thereto), e.g., from the same transposon as the polypeptide referred to in the preceding sentence, as indicated in the same row of the same table. In some embodiments, the system comprises one or both of a 5 or 3 untranslated region of any of Tables 1-3 herein, e.g., a segment of the full transposon sequence that encodes an RNA that is capable of binding a retrotransposase, and/or the sub-sequence provided in the column entitled Predicted 5 UTR or Predicted 3 UTR.
(48) In some embodiments, a polypeptide for use in any of the systems described herein can be a molecular reconstruction or ancestral reconstruction based upon the aligned polypeptide sequence of multiple retrotransposons. In some embodiments, a 5 or 3 untranslated region for use in any of the systems described herein can be a molecular reconstruction based upon the aligned 5 or 3 untranslated region of multiple retrotransposons. A skilled artisan can, based on the Accession numbers provided herein, align polypeptides or nucleic acid sequences, e.g., by using routine sequence analysis tools as Basic Local Alignment Search Tool (BLAST) or CD-Search for conserved domain analysis. Molecular reconstructions can be created based upon sequence consensus, e.g. using approaches described in Ivics et al., Cell 1997, 501-510; Wagstaff et al., Molecular Biology and Evolution 2013, 88-99. In some embodiments, the retrotransposon from which the 5 or 3 untranslated region or polypeptide is derived is a young or a recently active mobile element, as assessed via phylogenetic methods such as those described in Boissinot et al., Molecular Biology and Evolution 2000, 915-928.
(49) Table 3 (below) shows exemplary GENE WRITER? proteins and associated sequences from a variety of retrotransposases, identified using data mining. Column 1 indicates the family to which the retrotransposon belongs. Column 2 lists the element name. Column 3 indicates an accession number, if any. Column 4 lists an organism in which the retrotransposase is found. Column 5 lists the DNA sequence of the retrotransposon. Column 6 lists the predicted 5 untranslated region, and column 7 lists the predicted 3 untranslated region; both are segments of the sequence of column 5 that are predicted to allow the template RNA to bind the retrotransposase of column 8. (It is understood that columns 5-7 show the DNA sequence, and that an RNA sequence according to any of columns 5-7 would typically include uracil rather than thymidine.) Column 8 lists the predicted retrotransposase sequence encoded in the retrotransposon of column 5.
(50) TABLE-US-00003 TABLE3 1. 6. 7. 8. Fam- 2. 3. 4. Predicted Predicted PredictedAmino ily Element Accession Organism 5.DNASequence 5UTR 3UTR AcidSequence R2 R2- Taeniopygia GTCTAGTTACAACTGGGCATCGCTGCAGAGATCGCACCTCCTCGTGGTC GTCTAGTT TTCAGG MASCPKPGPPVSAG 1_TG guttata CCGCTGGTAGCCCTTCGAAGGGTGACTAAGTCGATCTCTGCCCCAGGTA ACAACTGG TTATTTA AMSLESGLTTHSVLAI CGGAGCCGTTGGGACTCACCAGTCCAACGTAACTCCTGCCTAAATTCGG GCATCGCT GATGCT ERGPNSLANSGSDFG TGAAACAAATTCCTCGGTAAAAAGCCCCATGGCTTCTTGCCCGAAACCT GCAGAGAT TAGTTTT GGGLGLPLRLLRVSV GGCCCCCCGGTTTCAGCAGGGGCAATGAGTTTGGAAAGTGGACTGACC CGCACCTC TGTACCT GTQTSRSDWVDLVS ACCCACTCCGTTCTCGCCATCGAACGTGGTCCCAATTCGTTGGCAAATTC CTCGTGGT TTCTTGT WSHPGPTSKSQQVD CGGATCAGACTTTGGGGGGGGGGGTCTGGGGCTACCGTTACGCCTATT CCCGCTGG TTTGTTT LVSLFPKHRVDLLSKN GAGGGTATCGGTCGGCACTCAGACCTCCCGCTCCGACTGGGTAGACCTG TAGCCCTT AGGATT DQVDLVAQFLPSKFP GTGTCCTGGAGCCACCCAGGACCCACGTCTAAGTCCCAGCAGGTTGACC CGAAGGG TTGATA PNLAENDLALLVNLE TGGTGTCTTTATTTCCTAAACACCGGGTTGACCTGTTATCCAAAAACGAC TGACTAAG GTGTTA FYRSDLHVYECVHFA CAGGTAGACCTGGTGGCTCAATTTTTACCATCTAAATTTCCCCCCAATTT TCGATCTC GTATTTT AHWEGLSGLPEVYE GGCAGAAAATGATTTGGCTTTGCTGGTGAACTTAGAGTTCTACAGATCG TGCCCCAG TATATTT QLAPQPCVGETLHSS GATTTGCATGTGTATGAGTGTGTTCATTTTGCTGCACATTGGGAGGGAT GTACGGA TTGTAC LPRDSELFVPEEGSSE TAAGTGGTTTGCCTGAGGTGTATGAACAACTTGCACCACAACCGTGTGT GCCGTTGG GATTGC KESEDAPKTSPPTPG GGGAGAAACTTTACATTCTAGCCTCCCACGAGACAGTGAACTGTTTGTG GACTCACC ATAATG KHGLEQTGEEKVMV CCTGAAGAGGGGAGCAGCGAGAAGGAGAGCGAGGACGCGCCAAAAAC AGTCCAAC TTCTTTT TVPDKNPPCPCCGTR ATCTCCTCCGACGCCTGGGAAACATGGTTTGGAACAGACTGGGGAGGA GTAACTCC TTATACA VNSVLNLIEHLKVSH AAAAGTGATGGTGACTGTTCCTGACAAAAATCCACCTTGTCCTTGCTGTG TGCCTAAA GTTCTGT GKRGVCFRCAKCGKE GTACCCGGGTAAACTCTGTGTTGAATCTGATTGAACATCTGAAAGTGTC TTCGGTGA TTTAATA NSNYHSVVCHFPKCR ACACGGGAAAAGGGGGGTTTGTTTTCGGTGTGCAAAATGTGGAAAGGA AACAAATT AAATAG GPETEKAPAGEWICE AAATAGTAACTATCACAGTGTTGTTTGTCATTTTCCAAAATGCAGGGGTC CCTCGGTA ACGATA VCNRDFTTKIGLGQH CAGAGACGGAGAAAGCCCCAGCTGGGGAGTGGATTTGTGAGGTATGC AAAAGCCC GCTAGA KRLAHPAVRNQERIV AACAGAGATTTTACAACCAAAATTGGCCTGGGACAACACAAGAGATTG C(SEQID GACGTT ASQPKETSNRGAHKR GCACACCCAGCAGTGAGAAATCAGGAAAGGATCGTTGCTTCCCAACCG NO:1140) AGGGCA CWTKEEEELLIRLEAQ AAAGAAACATCAAATAGAGGTGCTCACAAAAGGTGCTGGACAAAGGAG GCCACA FEGNKNINKLIAEHIT GAGGAAGAATTACTAATAAGACTGGAGGCTCAGTTCGAGGGAAACAAA AGCCAG TKTAKQISDKRRLLSR AATATTAATAAGCTTATTGCAGAACACATAACCACCAAAACAGCTAAGC TTAGGT KPAEEPREEPGTCHH AGATCAGTGACAAAAGGCGATTGCTGTCCAGAAAGCCAGCAGAGGAGC AGCGGA TRRAAASLRTEPEMS CACGTGAGGAGCCTGGAACGTGTCATCACACCAGGAGAGCAGCTGCGA TAGTAG HHAQAEDRDNGPG GCCTGAGAACGGAGCCTGAGATGAGTCATCACGCCCAGGCAGAGGACA GTAGGA RRPLPGRAAAGGRT GAGATAATGGACCTGGGAGACGCCCTCTGCCAGGCAGGGCAGCTGCCG ACAGAC MDEIRRHPDKGNGQ GAGGGAGAACAATGGACGAGATAAGACGCCACCCTGATAAGGGCAAC TTTTACT QRPTKQKSEEQLQAY GGACAGCAGAGACCCACCAAGCAAAAATCAGAAGAACAGCTGCAGGCT ATTTCAT YKKTLEERLSAGALNT TACTATAAAAAGACACTAGAGGAACGACTTTCAGCTGGGGCACTTAACA AACGCG FPRAFKQVMEGRDIK CCTTCCCCCGAGCATTCAAGCAGGTAATGGAAGGCCGGGATATAAAGCT TCAATTA LVINQTAQDCFGCLE AGTAATCAATCAGACAGCGCAGGACTGCTTCGGATGCCTGGAATCCATA CCACCT SISQIRTATRDKKDTV AGCCAAATAAGAACGGCAACCCGAGATAAAAAGGACACGGTGACCCGG GATTTG TREKHPKKPFQKWM GAGAAACACCCAAAGAAACCTTTTCAGAAGTGGATGAAGGACAGAGCA GACCAA KDRAIKKGNYLRFQR ATCAAAAAAGGTAATTATCTTCGGTTCCAGCGTTTATTTTATCTTGATAG TTCACG LFYLDRGKLAKIILDDI AGGGAAACTGGCTAAAATCATTTTAGATGATATTGAATGCTTGTCTTGT GGATTT ECLSCDIPLSEIYSVFK GACATACCACTCAGTGAAATTTATTCGGTTTTTAAAACAAGATGGGAAA GTCCAA TRWETTGSFKSLGDF CAACTGGTAGCTTTAAAAGCCTTGGGGACTTTAAAACTTACGGGAAGGC GGTGGA KTYGKADNTAFRELIT TGACAACACTGCCTTCAGAGAATTAATTACGGCTAAAGAAATTGAGAAA CGGGCC AKEIEKNVQEMSKGS AATGTGCAGGAAATGAGCAAAGGCTCGGCTCCCGGTCCAGACGGGATT ACCTTTA APGPDGITLGDVVK ACTCTTGGGGACGTCGTAAAGATGGATCCCGAGTTTTCCCGGACCATGG CTTAACC MDPEFSRTMEIFNL AGATTTTCAATTTATGGTTAACAACTGGTAAAATCCCGGACATGGTGAG CGGAAA WLTTGKIPDMVRGC GGGGTGCAGAACCGTTTTGATTCCAAAATCATCAAAGCCGGATCGTTTG AGGAAC RTVLIPKSSKPDRLKDI AAAGACATTAATAACTGGAGACCTATCACGATCGGTTCCATCTTGCTGA ATATATA NNWRPITIGSILLRLF GACTGTTCTCCAGGATTGTAACAGCTAGGCTGAGCAAAGCGTGCCCCCT ATTTATG SRIVTARLSKACPLNP GAACCCAAGGCAAAGAGGCTTTATCAGAGCGGCGGGATGCTCTGAAAA TGTGTTC RQRGFIRAAGCSENL CTTAAAACTCCTGCAAACTATAATTTGGTCGGCCAAAAGAGAACACAGA GATAAA KLLQTIIWSAKREHRP CCACTGGGTGTTGTATTCGTGGACATCGCCAAGGCTTTTGACACCGTAA (SEQID LGVVFVDIAKAFDTV GCCACCAGCACATCATTCATGCTTTGCAGCAAAGAGAGGTGGATCCCCA NO: SHQHIIHALQQREVD CATCGTCGGTCTGGTGAGCAATATGTACGAGAACATCAGTACGTATATC 1263) PHIVGLVSNMYENIS ACCACAAAGAGGAACACACACACAGACAAAATCCAGATCCGGGTTGGA TYITTKRNTHTDKIQI GTAAAGCAGGGTGACCCGATGTCGCCCCTTTTATTTAACCTGGCAATGG RVGVKQGDPMSPLL ACCCTCTATTATGCAAGCTGGAAGAGAGTGGCAAAGGATACCACCGAG FNLAMDPLLCKLEES GACAGAGCAGCATCACAGCGATGGCATTTGCAGACGATCTGGTTTTGCT GKGYHRGQSSITAM GAGCGACTCCTGGGAAAATATGAATACAAATATTAGCATACTGGAGACC AFADDLVLLSDSWEN TTCTGCAATCTGACCGGTCTCAAAACACAGGGGCAAAAGTGCCACGGCT MNTNISILETFCNLTG TTTACATCAAGCCGACAAAGGACTCTTACACCATCAATGACTGCGCTGCC LKTQGQKCHGFYIKP TGGACTATCAACGGCACACCCCTGAACATGATCGACCCCGGCGAATCTG TKDSYTINDCAAWTI AGAAATACCTCGGCCTGCAGTTTGACCCGTGGATTGGAATAGCAAGGTC NGTPLNMIDPGESEK CGGTCTCTCCACAAAACTAGATTTTTGGCTTCAGCGGATCGATCAAGCAC YLGLQFDPWIGIARS CACTTAAACCTCTGCAGAAAACTGATATTCTCAAAACATACACCATCCCT GLSTKLDFWLQRIDQ CGGCTGATCTACATAGCTGACCACTCAGAAGTGAAAACTGCACTACTCG APLKPLQKTDILKTYTI AAACCCTTGACCAGAAGATCCGGACAGCGGTCAAGGAATGGCTTCACCT PRLIYIADHSEVKTALL ACCTCCGTGCACCTGCGATGCCATCCTGTACTCGAGCACGAGAGACGGC ETLDQKIRTAVKEWL GGTTTGGGCATCACCAAATTGGCAGGACTGATCCCCAGCGTGCAGGCCC HLPPCTCDAILYSSTR GTAGACTGCATCGGATCGCACAGTCATCTGACGATACGATGAAATGCTT DGGLGITKLAGLIPSV CATGGAAAAAGAGAAAATGGAACAGCTGCATAAGAAATTGTGGATTCA QARRLHRIAQSSDDT AGCTGGAGGGGACAGAGAGAACATACCCTCGATTTGGGAAGCACCACC MKCFMEKEKMEQL GTCGAGTGAACCACCAAACAACGTGAGCACAAATTCGGAATGGGAAGC HKKLWIQAGGDRENI ACCGACCCAGAAAGATAAATTTCCAAAGCCTTGCAATTGGAGGAAAAAC PSIWEAPPSSEPPNN GAATTCAAAAAATGGACCAAATTGGCATCCCAAGGCCGCGGAATTGTAA VSTNSEWEAPTQKD ATTTTGAAAGAGACAAAATTAGTAACCATTGGATCCAATACTACAGACG KFPKPCNWRKNEFKK CATACCTCACAGGAAACTCCTCACTGCACTACAACTCAGGGCCAACGTTT WTKLASQGRGIVNFE ACCCCACGAGAGAATTTCTAGCCAGGGGTAGACAAGACCAATACATCAA RDKISNHWIQYYRRI GGCGTGTAGGCACTGCGATGCGGACATTGAATCCTGCGCCCACATCATC PHRKLLTALQLRANV GGCAACTGCCCAGTGACACAGGACGCCCGAATCAAGAGGCACAATTAC YPTREFLARGRQDQY ATCTGCGAACTGCTTCTCGAGGAGGCGAAGAAGAAGGACTGGGTAGTG IKACRHCDADIESCA TTCAAGGAACCGCACATAAGGGATTCCAACAAGGAACTGTACAAACCTG HIIGNCPVTQDARIKR ACCTGATATTTGTGAAGGATGCCCGTGCACTTGTCGTGGATGTGACAGT HNYICELLLEEAKKKD ACGGTATGAAGCAGCCAAATCATCGCTGGAGGAAGCCGCTGCAGAGAA WVVFKEPHIRDSNKE AGTGAGAAAGTACAAACACCTGGAAACGGAAGTAAGACATCTCACGAA LYKPDLIFVKDARALV TGCAAAGGACGTTACTTTTGTGGGCTTTCCCCTAGGAGCGCGGGGGAA VDVTVRYEAAKSSLE ATGGCACCAAGATAACTTTAAACTTTTGACTGAGCTTGGCCTCTCCAAAT EAAAEKVRKYKHLET CGAGGCAAGTGAAAATGGCAGAGACTTTTTCCACAGTAGCGCTCTTTTC EVRHLTNAKDVTFVG ATCTGTGGACATTGTACATATGTTTGCCAGTAGGGCCAGAAAATCTATG FPLGARGKWHQDNF GTTATGTAATTCAGGTTATTTAGATGCTTAGTTTTTGTACCTTTCTTGTTT KLLTELGLSKSRQVK TGTTTAGGATTTTGATAGTGTTAGTATTTTTATATTTTTGTACGATTGCAT MAETFSTVALFSSVDI AATGTTCTTTTTTATACAGTTCTGTTTTAATAAAATAGACGATAGCTAGA VHMFASRARKSMV GACGTTAGGGCAGCCACAAGCCAGTTAGGTAGCGGATAGTAGGTAGGA M(SEQIDNO: ACAGACTTTTACTATTTCATAACGCGTCAATTACCACCTGATTTGGACCA 1016) ATTCACGGGATTTGTCCAAGGTGGACGGGCCACCTTTACTTAACCCGGA AAAGGAACATATATAATTTATGTGTGTTCGATAAA(SEQIDNO:1539) R2 R2- Geospiza AGACTTAAGTGAGTTTGGTTACAACTGGGCATAGCTGCAGAGACCGCG AGACTTAA GGTAGA VGLCPSPGVDGTHQ 1_Gfo fortis CCTCCTCGCGGCCCCGCTGGTAAGCCCTTAACAGGGTGACTAAGTCGGT GTGAGTTT TAATCTT PNDSFQNFGETNFSV CTCTGCCCCAGTCCGGGAGTCGATGGGACTCACCAGCCCAACGATTCCT GGTTACAA TGTATA QVARLVTRNLAPRSV TCCAAAATTTCGGTGAAACAAATTTCTCGGTGCAAGTCGCAAGGCTTGT CTGGGCAT GTGGGG RGNGFGSGMATHPV CACCCGAAACCTAGCCCCCCGGTCGGTCAGGGGCAACGGGTTCGGAAG AGCTGCAG GGGGAT PADESGHESDPFLVG TGGGATGGCCACCCACCCCGTTCCCGCAGACGAATCCGGCCATGAATCT AGACCGCG CTCATGT RSCGQPARLTRQSVG GATCCATTCCTTGTAGGGAGGAGCTGCGGACAACCGGCACGCCTTACTA CCTCCTCG ACCGGG TQTSRDDILPSKTTKL GGCAATCGGTTGGCACCCAGACCTCCCGAGATGATATTTTACCATCTAA CGGCCCCG TTTCTTT TENELDLLVNFSLELY AACCACCAAATTGACAGAGAATGAATTGGACTTGCTGGTGAACTTTTCT CTGGTAAG TATTTGA RSDLQGFVQEGIHFS TTAGAATTGTATAGGTCAGATCTGCAGGGATTTGTGCAGGAGGGGATTC CCCTTAAC TTTTCAA VNREVLEGFPEVYEQ ATTTTTCTGTGAATAGGGAGGTGTTAGAGGGGTTTCCTGAGGTGTATGA AGGGTGA TAAAAC PAPQPAVGDDLNTSL ACAACCTGCACCACAACCGGCAGTAGGGGACGATTTAAACACCAGTCTC CTAA(SEQ AGACGG PPDNNICVLEKGSSE CCACCGGACAATAATATATGCGTACTTGAGAAGGGTAGCAGTGAAGCA IDNO: TAGCTA AVEDGTPEVAHPVPE GTGGAGGATGGCACACCGGAGGTAGCGCACCCCGTGCCTGAAACCCAG 1141) GGTTCG TQGKESPNNIVMVTL GGCAAAGAGTCACCGAATAACATCGTGATGGTAACTCTTCCCAACAAAA CAAGGC PNKNPPCPCCRVRLH ATCCACCATGTCCTTGCTGTAGGGTCAGACTGCATTCAGTACTGGCTCTG AGCCAC SVLALIEHLKGSHGKK ATTGAACATCTTAAGGGGTCGCATGGGAAGAAGAGGGCATGCTTTAGG AAGCCA RACFRCVKCGRENFN TGTGTCAAGTGTGGGAGGGAGAACTTTAACTATCATAGTACTGTTTGTC AAGATA YHSTVCHIAKCKGPK ACATCGCAAAATGCAAGGGACCAAAAGTTGAGAAGGCCCCAGTGGGAG GGTAGG VEKAPVGEWICEVCG AGTGGATCTGTGAGGTATGTGGTAGGGACTTTACAACCAAAATCGGCCT GTGCTC RDFTTKIGLGQHKRL GGGACAACATAAAAGATTGGCACATCCCTTGGTTAGAAACCAAGAAAG ATAGTG AHPLVRNQERIDASQ GATCGATGCTTCCCAACCGAAGGAGACATCAAACAGAGGAGCCCACAA AGTAGG PKETSNRGAHKRCW GAGATGTTGGACAAAAGAGGAGGAGGAGATGCTGATAAAGTTGGAGG GACAGT TKEEEEMLIKLEVQFE TACAGTTCGAGGGACACAGAAACATCAATAAGCTTATCGCGGAACACTT GCCTTTT GHRNINKLIAEHLTTK AACAACTAAAACATCCAAACAGATTAGTGATAAAAGGAGACTATTACCC GATTCA TSKQISDKRRLLPRKQ AGAAAACAATTAACAGATCTAAGTAAGGGAGTGGCTGGACAGAAGGTG CAACGC LTDLSKGVAGQKVLD CTGGACCCAGGACTGAGTCATCAACCCCAGCTGGGGGTAGTTGACAAT GTCAAT PGLSHQPQLGVVDN GGACTTGGTGGGGGTCATCTGCCAGGGGGGCCAGCTGCTGAAGGAAG ACCATCT GLGGGHLPGGPAAE AACAATAGAGCCATTAGGACACCACCTTGATAAGGATAACGGTCACCGG GACACG GRTIEPLGHHLDKDN GAAATCGCTGACCAGCACAAGGCAGGGAGGCTGCAGGCCCATTACCGA GATACC GHREIADQHKAGRL AAGAAGATAAGGAAGCGCCTTTCAGAAGGGATGATTAGCAACTTCCCC CTTACCG QAHYRKKIRKRLSEG GAAGTATTTGAACAACTACTGGACTGCCAGGAAGCACAACCATTGATCA GACTTG MISNFPEVFEQLLDC ATCAAGCAGCGCAGGATTGCTTTGGATGCCTGGATTCAGCAAGCCAGAT TCATGAT QEAQPLINQAAQDC AAGGAAGGCGCTCCGAAAACAGAACACACAGAAAGACCAGGGGGATC CTCCCA FGCLDSASQIRKALRK AACCCAAAAGACCAGCTCAGAAGTGGATGAAAAAAAGAGCAGTTAAGA GACTTG QNTQKDQGDQPKR GGGGTCACTTCCTCCGCTTTCAGAAATTATTTCATCTTGACAGGGGGAA TCCAAG PAQKWMKKRAVKR ATTGGCAAAGATTATTTTGGACGACGTAGAGTGTTTGTCCTGTGATATA GTGGAC GHFLRFQKLFHLDRG CCACCCAGTGAAATTTATTCGGTATTCAAAGCCCGATGGGAAACACCTG GGGCCA KLAKIILDDVECLSCDI GACAGTTTGCTGGCCTTGGGGATTTCGAAATTAATAGGAAGGCGAACA CCTTTAC PPSEIYSVFKARWETP ATAAAGCCTTCAGGGACTTAATTACGGCCAAAGAAATTCTCAAAAATGT TTAACCC GQFAGLGDFEINRKA GCGGGAGATGACCAAGGGCTCGGCCCCAGGTCCAGATGGGATCGCGCT GGAAAA NNKAFRDLITAKEILK TGGGGACATCAGGAAGATGGACCCTGAGTACACCCGGACCGCCGAACT GGAACA NVREMTKGSAPGPD CTTCAACTTATGGTTAACATCTGGTGAGATCCCGGACATGGTGAGGGGG TATATTA GIALGDIRKMDPEYT TGCAGAACTGTGTTAATCCCCAAATCGTCAAAACCGGAACGCCTGAAGG ATTATAT RTAELFNLWLTSGEIP ACATCAATAACTGGAGACCCATCACGATTGGATCCATCTTGCTGAGACTT GTGTTC DMVRGCRTVLIPKSS TTCTCCAGGATCATAACAGCGAGGTTAACAAAGGCGTGCCCCCTCAACC GGAAAA KPERLKDINNWRPITI CTAGGCAAAGAAGCTTCATCAGTGCGGCAGGATGCTCCGAGAACTTGA (SEQID GSILLRLFSRIITARLTK AGCTCCTGCAAACCATAATTCGGACTGCTAAAAATGAACACAGACCACT NO: ACPLNPRQRSFISAA GGGTGTTGTATTCGTGGACATCGCCAAGGCCTTTGACACCGTGAGCCAC 1264) GCSENLKLLQTIIRTA CAACACATCATACATGTATTGCAAAGGAGGAGAGTGGACCCCCACATCA KNEHRPLGVVFVDIA TTGGATTGGTGAAAAATATGTACAAAGACATCAGTACGGTTATCACCAC KAFDTVSHQHIIHVL AAAGAAGAACACATACACGGACAAAATCCAGATCCAGGTTGGAGTGAA QRRRVDPHIIGLVKN GCAAGGTGATCCGCTTTCGCCCCTTCTATTCAACCTGGCGATGGACCCCC MYKDISTVITTKKNTY TGTTGTGCAAGCTGGAAGAACACGGCAAAGGATTCCACCGAGGACAGA TDKIQIQVGVKQGDP GCAAGATAACAGCGATGGCATTCGCTGATGACCTGGTCCTGTTGAGCGA LSPLLFNLAMDPLLCK TTCCTGGGAAGACATGAATGCGAACATCAAGATACTGGAGACCTTCTGC LEEHGKGFHRGQSKI GACCTCACCGGTCTCAAAACACAGGGTCAAAAGTGCCACGGCTTCTACA TAMAFADDLVLLSDS TCAAGCCTACAAAGGACTCTTACACCGTCAACAACTGCGCTGCGTGGAC WEDMNANIKILETFC CATCAATGGCACACCCCTGAACATGATCAACCCCGGGGAATCAGAGAAA DLTGLKTQGQKCHG TACCTCGGCCTGCAGTTTGACCCCTGGGTGGGAATTGCAAAGACCAGCC FYIKPTKDSYTVNNCA TCCCCGAAAAACTGGACTTCTGGCTCGAACGCATTGATCGAGCTCCACT AWTINGTPLNMINP CAAACCATTTCAGAAACTGGACATTCTTAAGACATACACCATACCTCGAC GESEKYLGLQFDPWV TGACCTACGTAGCTGACCACTCAGAGATGAAAGCGGGGGCCCTTGAAG GIAKTSLPEKLDFWLE CCCTTGACCGGACAATTCGATCGGCGGTCAAGGACTGGCTGCACCTACC RIDRAPLKPFQKLDIL TTCGAGCACCTGTGATGCCATCTTGTACACGAGCATGAAGGACGGTGGT KTYTIPRLTYVADHSE TTGGGAGTGACCAAATTGGTGGGACTGATTCCGAGTGTACAAGCCCGG MKAGALEALDRTIRS AGGCTGCACAGGATTGCGCAGTCACCGGAGGAGACGATGAAAGACTTC AVKDWLHLPSSTCDA CTGGAAAAGGCCCAGATGGAGAAGATGTACGAGAAATTGTGGGTCCAA ILYTSMKDGGLGVTK GCTGGAGGGAAAAGAAAGAGGATGCCGTCAATTTGGGAAGCGCTCCC LVGLIPSVQARRLHRI GGAGGTTGTACCATCCATAGACACAGCCACAACTTCGGAGTGGGAAGC AQSPEETMKDFLEKA ACCGAACCCTAAAAGTAAGTACCCTAGACCTTGTAATTGGCGCAGAAAA QMEKMYEKLWVQA GAATTTAAAAAGTGGACTAAATTAATAGCCCAGGGCTGGGGAATTAGG GGKRKRMPSIWEAL TGTTTTAAGGGGGACAAAATTAGTAACAATTGGATTCGACATTATAGAT PEVVPSIDTATTSEW ACATACCTCACAGGAAACTTCTCACTGCCATACAGCTCCGGGCCAGTGT EAPNPKSKYPRPCN GTACCCCACAAGGGAATTTCTCGCGCGGGGGAGGGAAGATAACTGTGT WRRKEFKKWTKLIA TAAGTCTTGTAGGCACTGTGAGGCGGCAGAGGAGTCCTGTGCCCACATC QGWGIRCFKGDKISN ATCGGCATGTGTCCAGTCGTGAGGGATGCCCGAATCAAGAGGCACAAT NWIRHYRYIPHRKLLT CGCATTTGCGAGAGGCTGATGGAGGAGGCGGGGAAGAGGGACTGGAC AIQLRASVYPTREFLA GGTGTTTCAGGAGCCGCACATAAGGGACGTCACCAAGGAACTGTACAA RGREDNCVKSCRHCE ACCGGACTTGATATTCGTGAAAGAAGGCCTTGCACTTGTTGTGGATGTT AAEESCAHIIGMCPV ACAATACGGTTCGAGTCAACCAAGACAACGTTGGAGGAGGCTGCTGCA VRDARIKRHNRICERL GAGAAGGTGAACAAGTACAAACATCTGGAGACCGAAGTACGGAACCTC MEEAGKRDWTVFQE ACCAACGCTAAGGACGTTATCTTTATGGGGTTTCCCCTTGGAGCGCGGG PHIRDVTKELYKPDLI GACAATGGTACAATAAGAACTTTGAACTTTTGGACACTCTTGGCCTCCCC FVKEGLALVVDVTIRF AGATCGAGGCAGGACATTATTGCAAAGACTTTATCCACGGACGCGCTCA ESTKTTLEEAAAEKVN TTTCATCTGTGGACATTATACATATGTTTGCCAGTAGAGGCAGAAGACA KYKHLETEVRNLTNA GCATGCTTAGGGTAGATAATCTTTGTATAGTGGGGGGGGATCTCATGTA KDVIFMGFPLGARG CCGGGTTTCTTTTATTTGATTTTCAATAAAACAGACGGTAGCTAGGTTCG QWYNKNFELLDTLGL CAAGGCAGCCACAAGCCAAAGATAGGTAGGGTGCTCATAGTGAGTAGG PRSRQDIIAKTLSTDA GACAGTGCCTTTTGATTCACAACGCGTCAATACCATCTGACACGGATACC LISSVDIIHMFASRGR CTTACCGGACTTGTCATGATCTCCCAGACTTGTCCAAGGTGGACGGGCC RQHA(SEQIDNO: ACCTTTACTTAACCCGGAAAAGGAACATATATTAATTATATGTGTTCGGA 1386) AAA(SEQIDNO:1540) R2 R2- Zonotrichia CGACTTGAGAAGGTCTGGTTACAACTGGGCATAGCTGCAGAGATCGCG CGACTTGA GTAGTC NKFLGKSRVAYCLKP 1_ZA albicollis CCTCCTCGTGGCCCCGCTGGTAAGCCCTTAACAGGGTGACTAAGTCGAT GAAGGTCT ACATTG GPPVSDRGKEFGSGL CTCTGCCCCAGTCCAGGAGCCGCTGGGTTTCACCAGCCCAGCGATTCCTT GGTTACAA CACTTTC TTHPEPESESGHDPT CCAAATTCGGTGAAACAAATTCCTCGGTAAAAGCCGCGTGGCTTATTGC CTGGGCAT TGTAACT VPNPGPSLGAGEGA CTGAAACCTGGCCCCCCGGTTTCAGACAGGGGCAAAGAGTTCGGAAGT AGCTGCAG TGCACT QPLPLLRVSVGTQTC GGACTGACCACCCACCCCGAACCCGAGAGCGAATCTGGTCATGACCCAA AGATCGCG GGGTGT EEDFITSRPTKLPGIES CTGTCCCAAATCCTGGTCCGTCTCTTGGAGCGGGGGAAGGTGCACAGCC CCTCCTCG GGGATG ELGPLVKFSLEVYRSD ACTACCCTTACTCAGGGTATCGGTGGGCACCCAAACCTGTGAAGAGGAC TGGCCCCG TGGGCC LKGDVQFEGIHFPDN TTTATAACATCTAGACCAACCAAATTACCCGGAATTGAATCAGAATTAGG CTGGTAAG TGGGGT WGVLEGFPEVYEQL CCCGCTGGTGAAGTTTTCTTTAGAGGTTTACAGGTCAGATCTTAAGGGG CCCTTAAC GTGGGT APQPNGGDELNHSL GATGTGCAATTTGAGGGGATTCATTTTCCAGATAATTGGGGGGTACTGG AGGGTGA TATGGG PGDREGDVLEKDSSE AGGGGTTTCCTGAGGTGTACGAACAACTGGCACCACAGCCAAACGGGG CTAAGTCG GTATAT KEKEAAPEALPSVQR GAGACGAGTTAAATCATAGTCTCCCAGGGGACAGGGAGGGGGATGTAC ATCTCTGC ATGTGG ARSEQLPDNIVKVTV TTGAGAAGGATAGCAGCGAAAAGGAGAAGGAGGCTGCACCAGAGGCA CCCAGTCC GATATTC PDKNPPCPCCGVRLN TTGCCCTCAGTGCAAAGGGCCCGCAGTGAACAGTTGCCAGATAACATCG AGGAGCC TGGTGG SVLALIEHLKGSHGRR TAAAGGTGACTGTTCCCGACAAAAATCCACCATGTCCCTGCTGTGGTGT GCTGGGTT GAATGT RVCFRCAKCGRENFN CCGCTTAAACTCAGTGTTAGCTCTGATTGAACATCTGAAGGGCTCACAC TCACCAGC CCATTCA HHSTVCHYAKCKGP GGGAGGAGGAGGGTGTGCTTTAGGTGTGCCAAATGTGGGAGGGAGAA CCAGCGAT CTGTAT QIERPPVGEWICEVC TTTTAACCACCATAGTACTGTTTGTCATTACGCAAAGTGCAAAGGTCCAC TCCTTCCA GCCTATC GRDFTTKIGLGQHKR AGATTGAAAGGCCACCAGTGGGAGAGTGGATCTGTGAGGTATGCGGA AATTCGGT TTTTTAA HMHAMVRNQERID AGGGACTTCACGACCAAAATTGGCCTGGGACAACACAAAAGACATATG GA(SEQ TAAAAA ASQPKETSNRGAHKR CATGCAATGGTGAGAAACCAGGAAAGGATCGATGCTTCCCAACCGAAA IDNO: GACGGT CWTKEEEELLMKLEV GAGACATCAAATCGAGGAGCCCACAAGAGGTGCTGGACGAAGGAGGA 1142) AGCTAG QFENHKNINKLIAEQ GGAAGAACTGCTCATGAAGTTGGAGGTACAGTTTGAGAATCACAAAAA GTTCGC LTTKTAKQISDKRRM CATCAATAAGCTTATCGCAGAGCAATTAACAACTAAAACAGCTAAACAA GAAGCA LLKKGRGTTGNLETE ATTAGTGATAAAAGGAGAATGCTGCTCAAAAAAGGTAGGGGGACAACT GCCACA PGMSHQSQAKVKD GGTAATTTGGAAACAGAGCCTGGGATGAGTCATCAATCGCAGGCAAAA AGCCAA NGLGGDHLPGGPVV GTTAAGGACAATGGACTGGGTGGGGACCATCTGCCGGGAGGACCAGTT TAGCCA DKGTIGKPGQHLDTD GTCGATAAGGGAACAATAGGGAAGCCAGGACAACATCTTGACACAGAT GTTAGG NSHQITAGKKKGGGL AACAGCCATCAAATAACTGCTGGCAAGAAGAAAGGGGGAGGGCTGCA TAGCTC QARYRRRIMKRLAAG GGCTCGTTATAGAAGGAGAATAATGAAACGATTAGCGGCCGGGACAAT ATAGTG TINIFPKVFKELINDQE TAACATCTTCCCCAAAGTGTTTAAAGAACTGATTAACGACCAAGAGGCG GGTAGG ARPLINQTTEDCFGLL AGACCGCTAATCAATCAAACAACAGAAGACTGCTTTGGCCTCTTGGACT TGACAG DSACQIRTALREKGK CTGCATGCCAAATTAGAACGGCACTCCGGGAGAAGGGCAAATCTCAGG GAACCT SQEERPRKQYQKW AGGAACGACCAAGAAAACAGTATCAGAAGTGGATGAAGAAGAGAGCG TTGACTC MKKRAIKRGDYLRFQ ATTAAAAGGGGGGACTATCTCCGCTTCCAGCGATTATTCCATCTAGACA AGAACG RLFHLDRGKLARIILD GGGGGAAACTGGCGAGAATTATCTTGGACAACACTGAGAGCTTGTCTT CGTCCAT NTESLSCDISPSEIYSV GCGATATATCACCCAGTGAAATTTATTCGGTATTCAAGGCCAGATGGGA TAACATC FKARWETPGHFNGL AACACCTGGACACTTCAACGGCCTTGGGGACTTTGAAATTAAAGGGAAG TAGAAC GDFEIKGKANNKAFR GCCAACAACAAAGCCTTCAGGGACTTCATCACGGCTAAAGAAATTGAAA GGACCA DFITAKEIEKNVREMS AGAACGTGCGGGAAATGAGTAAGGGTTCGGCGCCAGGTCCAGATGGG AACTTC KGSAPGPDGIALGDI ATCGCCCTTGGGGACATCAAGAAGATGGATCCCGGGTATTCCCGGACC GGACAT KKMDPGYSRTAELFN GCCGAGCTATTCAACTTGTGGCTGACAGCTGGTGACATCCCGGACATGG GCACCG LWLTAGDIPDMVRG TGAGGGGGTGCAGGACTGTTTTGATCCCGAAATCGACGACACCGGAGC ATTAACC CRTVLIPKSTTPERLK GCCTAAAGGACATCAACAACTGGAGACCCATCACGATTGGTTCCATCTT GGATTT DINNWRPITIGSILLRL GCTAAGGCTGTTCTCCAGGATCATAACGGCGAGGATGACTAAGGCGTG GTCCAA FSRIITARMTKACPLN CCCCCTCAACCCGAGACAGAGAGGCTTCATCAGTGCGCCGGGATGCTCT GGTGGA PRQRGFISAPGCSEN GAGAACCTGAAACTCCTGCAATCTATAATTCGGACTGCCAAAAATGAGC CGGGCC LKLLQSIIRTAKNEHK ACAAGCCGCTGGGTGTTATTTTCGTGGACATTGCTAAGGCTTTTGACACC ACCTTTA PLGVIFVDIAKAFDTV GTGAGCCACCAACACATCATACACGTTTTACAGCAACGGAGGGTTGACC CTTAACC SHQHIIHVLQQRRVD CCCACATTGTTGGACTGGTGAACAATATGTACAAGGACATCAGTACGTA CGGAAA PHIVGLVNNMYKDIS TGTCACCACAAAGAAGAACACACACACGGACAAAATCCAGATCCGGGTT GGGAAC TYVTTKKNTHTDKIQI GGAGTGAAGCAGGGTGACCCACTATCACCCCTTCTATTCAACTTGGCAA ATATATA RVGVKQGDPLSPLLF TGGACCCCCTGTTGTGTAAGCTGGAAGAAAGTGGCAAAGGATTCCATC GTTATAT NLAMDPLLCKLEESG GAGGACAGAGCTCAATAACCGCGATGGCGTTCGCCGACGATCTGGTCTT GTGTTC KGFHRGQSSITAMAF GTTAAGCGACTCCTGGGAGAACATGAAAGAGAACATCAAAATACTGGA GTAATA ADDLVLLSDSWENM GACCTTTTGCAATCTCACCGGTCTCAAAACACAGGGTCAGAAGTGCCAC (SEQID KENIKILETFCNLTGLK GGCTTTTACATCAAGCCTACAAAGGACTCTTACACCATCAACAACTGCCC NO: TQGQKCHGFYIKPTK TGCATGGACCATCAACGGCACACCCCTGAACATGATCAACCCCGGGGAG 1265) DSYTINNCPAWTING TCAGAGAAATACCTCGGCCTGCAGATCGACCCATGGACTGGAGTAGCA TPLNMINPGESEKYL AAATACGATCTCTCCACAAAATTGAAAATATGGCTCGAAAGCATTGACC GLQIDPWTGVAKYD GAGCTCCACTTAAACCTCTGCAAAAATTAGACATCCTCAAAACATACACC LSTKLKIWLESIDRAPL ATTCCTCGACTGACCTACCTGGCTGACCATTCAGAGATGAAAGCAGGGG KPLQKLDILKTYTIPRL CTCTGGAAGCACTCGACCAGCAGATTCGAACAGCGGTCAAAGACTGGC TYLADHSEMKAGALE TGCACCTGCCCTCGTGCACCTGTGATGCCATCTTGTACGTGAGCACGAG ALDQQIRTAVKDWL GGACGGCGGTTTGGGTGTTACCAAGTTGGCGGGACTGATTCCAAGTGT HLPSCTCDAILYVSTR GCAAGCCCGGAGGCTGCATCGCATTGCGCAGTCGCCGGACGAGACGAT DGGLGVTKLAGLIPS GAAGGACTTCCTAGAGAAGGCGCAGATGGAGAAGATGTATGAGAAGTT VQARRLHRIAQSPDE ATGGGTTCAAGCTGGAGGCAAAAAGAAGGGGATGCCGTCAATTTGGGA TMKDFLEKAQMEK GGCCCTACCGATGACTGTACCACCCACTAATACAGGTAATCTTTCGGAG MYEKLWVQAGGKK TGGGAAGCACCGAACCCCAAAAGTAAGTACCCAAAACCTTGTGATTGGA KGMPSIWEALPMTV GAAGGAAAGAGCTTAAAAAGTGGACAAAATTGGAGTCCCAAGGTCGTG PPTNTGNLSEWEAP GAGTCAAAAATTTTAGGAATGATACAATTAGTAACGATTGGATCCAATA NPKSKYPKPCDWRR TTATAGACGCATACCTCACAGGAAACTCCTCACTGCCATACAACTCAGG KELKKWTKLESQGRG GCCAATGTATACCCCACAAGGGAATTTCTCGCGCGGGGGAGGGGTGAT VKNFRNDTISNDWIQ AACTATGTTAAGTTTTGTAGGCACTGTGAAGCGGACCTTGAAACCTGTG YYRRIPHRKLLTAIQL GCCATATCATCGGCTTTTGCCCAGTAACGAAGGACGCCCGAATCAAGAG RANVYPTREFLARGR GCACAATCGCATATGCGACAGGCTTTGCGAGGAGGCAGCTAAGAGGGA GDNYVKFCRHCEADL ATGGGTGGTCTTCAAGGAGCCGCACTTGAGGGATGCCACCACGGAACT ETCGHIIGFCPVTKDA GTTTAAACCGGATGTGATATTCGTGAAAGAGGACCGTGCACTGGTTGTG RIKRHNRICDRLCEEA GATGTGACAGTACGATATGAATCAGCCAAGACAACGCTGGAGGCAGCT AKREWVVFKEPHLR GCTATGGAGAAAGTGGACAAGTACAAACATCTGGAGGCAGAAGTGAA DATTELFKPDVIFVKE GGAACTCACCAACGCAAAGGACGTTGTTTTTATGGGGTTCCCCCTTGGA DRALVVDVTVRYESA GCGCGAGGGAAATTCTACAAAGGGAACTTTAACTTGCTAGAGACTCTTG KTTLEAAAMEKVDKY GCCTCCCAAAAACGAGGCAATTGAGTGTGGCAAAGACTCTATCCACGTA KHLEAEVKELTNAKD CGCGCTCATGTCATCTGTGGACATTGTGCATATGTTTGCCAGTAGATCTA VVFMGFPLGARGKF GGAAACCAAATGTCTAGGTAGTCACATTGCACTTTCTGTAACTTGCACTG YKGNFNLLETLGLPKT GGTGTGGGATGTGGGCCTGGGGTGTGGGTTATGGGGTATATATGTGG RQLSVAKTLSTYALM GATATTCTGGTGGGAATGTCCATTCACTGTATGCCTATCTTTTTAATAAA SSVDIVHMFASRSRK AAGACGGTAGCTAGGTTCGCGAAGCAGCCACAAGCCAATAGCCAGTTA PNV(SEQIDNO: GGTAGCTCATAGTGGGTAGGTGACAGGAACCTTTGACTCAGAACGCGT 1387) CCATTAACATCTAGAACGGACCAAACTTCGGACATGCACCGATTAACCG GATTTGTCCAAGGTGGACGGGCCACCTTTACTTAACCCGGAAAGGGAAC ATATATAGTTATATGTGTTCGTAATA(SEQIDNO:1541) R2 R2Dr AB097126 Danio AATCCCCCCTACCCAATCCCCCCGTCGTGACCTCCAGGCCAGGAATCACG AATCCCCC AAATCC MESTAKGKSYWMA rerio AGCGTACGACAGTGGCCATCCGGCAATGACAATAGCGTGACTAACGAC CTACCCAA CAGCGG RRPVEGATEGSLGRV AATGAGTCAGATCCATGACCCTTGGAGTGGGTTAACCTCCGCCTCTTTAA TCCCCCCG GATACA PFVTRDPKRKPEAKR AAACATGGAAAGTACAGCAAAAGGAAAGTCATACTGGATGGCCCGTCG TCGTGACC GCAAGA TLTHGLGLRECSVVLT CCCAGTAGAAGGTGCCACGGAGGGATCTTTGGGTCGGGTCCCTTTCGTA TCCAGGCC AGGTAT RLIEGRRGRDHTPSG ACGCGAGATCCTAAGCGCAAACCAGAGGCTAAACGAACACTTACGCAT AGGAATCA CGGATC WNAQRGMPNDESS GGCTTAGGACTACGAGAATGCTCGGTTGTCTTGACACGCCTCATCGAGG CGAGCGTA TAATAA VEEPNGPIPSNPIPTG GGCGTCGAGGTCGCGATCACACACCATCAGGATGGAACGCACAGCGCG CGACAGTG GGTTGA TQALPEPMADGEQG GCATGCCAAACGACGAAAGCTCGGTCGAGGAGCCCAATGGGCCGATAC GCCATCCG GCGAGG EHPGVVVTLPLRDLN CATCTAACCCCATACCAACGGGCACCCAAGCCCTGCCTGAACCTATGGC GCAATGAC AGAGGG CPLCGGSASTAVKVQ GGACGGGGAGCAGGGGGAGCACCCGGGAGTGGTGGTGACCCTGCCGC AATAGCGT TGGAGA RHLAFRHGTVPVRFS TCAGGGACTTAAACTGCCCCCTATGTGGCGGGTCGGCGAGCACCGCGG GACTAACG TCCTTTG CESCGKTSPGCHSVL TGAAAGTGCAAAGACACTTGGCATTTCGCCACGGAACAGTGCCGGTTA ACAATGAG GGGGGG CHIPKCRGPTGEPPE GATTCAGCTGTGAATCATGTGGAAAAACTTCTCCGGGTTGCCATTCCGTC TCAGATCC GTCGGG KVVKCEGCSRTFGTR CTCTGTCACATTCCGAAATGTCGCGGACCGACAGGCGAGCCGCCTGAGA ATGACCCT CTAAGTT RACSIHEMHVHSEIR AAGTGGTTAAGTGCGAGGGATGCAGTAGGACGTTTGGCACAAGGAGA TGGAGTG CCCCTCT NRKRIAQDRQEKGTS GCGTGTAGTATACATGAGATGCACGTTCACTCAGAAATCCGCAATAGGA GGTTAACC CGGGTC TDGEGRAGVERADA AAAGAATTGCTCAAGACAGGCAAGAAAAAGGGACCTCGACAGATGGA TCCGCCTC CTCCCAC GEGPSGEGIPPKRPR GAGGGGAGAGCTGGAGTCGAAAGGGCTGACGCTGGGGAAGGTCCCTC TTTAAAAA GGTGAC RARTPREPSEPPANP TGGGGAAGGGATCCCCCCTAAACGTCCCAGACGTGCGAGAACGCCCAG C(SEQID GCTCTAC PILSPQPDLPPGGLRD AGAACCGTCTGAGCCCCCCGCGAATCCGCCGATTCTCTCGCCACAACCC NO:1143) CCCTCCC LLREVASGWVRAAR GATCTGCCCCCAGGAGGCCTCCGGGACCTACTCCGGGAGGTGGCCAGT TCCTCGC DGGTVIDSVLAAWL GGGTGGGTAAGGGCAGCGAGAGACGGAGGTACGGTGATTGACAGCGT TCGTAG DGNDRLPELVDAAT GCTCGCAGCATGGTTGGATGGCAACGATCGGCTCCCTGAGCTGGTTGAC AACCCA QRTLQGLPAGRLARR GCGGCGACGCAAAGGACACTGCAGGGCTTACCTGCAGGGAGGTTGGCC ACGGTG PATFVAPNRRRGRW CGAAGACCCGCAACTTTTGTTGCGCCTAACCGGAGGAGAGGCAGGTGG AACACG GRRLKLLAKRRAYHD GGGCGCCGGCTCAAACTGCTCGCTAAGCGCCGCGCCTACCACGATTGCC GTTGGC CQIRFRKDPARLAANI AAATTCGGTTCCGAAAAGACCCAGCCCGCCTAGCCGCGAACATCCTAGA AGGATG LDGKSETSCPINEQAI CGGCAAAAGCGAAACAAGTTGCCCAATCAATGAGCAAGCGATTCATGA AAGTGA HEHFRNKWANPSPF GCACTTTCGAAACAAATGGGCAAATCCAAGTCCATTTGGTGGGCTGGGA CGTGAG GGLGRFGTENRANN CGATTTGGGACGGAAAACAGGGCCAACAACGCCCACCTCCTCGGGCCA GGGTAA AHLLGPISKSEVQTSL ATCTCCAAAAGCGAGGTCCAAACTAGCCTCCGAAATGCATCGAACGCCT GACATG RNASNASTPGPDGV CCACACCAGGCCCAGACGGCGTTGGGAAAAGGGACATTTCCAACTGGG CGTACG GKRDISNWDPECETL ATCCTGAGTGTGAGACCCTCACTCAGCTGTTTAACATGTGGTGGTTCACA TGAGCG TQLFNMWWFTGVIP GGTGTCATCCCCTCTCGCTTGAAGAAAAGTCGTACGGTGCTTCTGCCCA CGCATTT SRLKKSRTVLLPKSSD AGTCCTCAGACCCAGGAGCGGAGATGGAGATCGGCAACTGGAGACCAA TTGCTGT PGAEMEIGNWRPITI TCACCATCGGGTCGATGGTCTTGCGGCTTTTCACAAGGGTGATCAATAC TCTCTG GSMVLRLFTRVINTR GAGATTAACGGAAGCCTGTCCGTTGCACCCAAGACAGAGAGGGTTTCG GACTGG LTEACPLHPRQRGFR ACGAAGCCCCGGGTGTTCGGAGAACCTGGAAGTACTCGAATGTCTCCTC GTTTCGT RSPGCSENLEVLECLL CGACACTCCAAAGAAAAGCGCAGCCAACTGGCAGTGGTATTCGTCGATT CCCCCTC RHSKEKRSQLAVVFV TTGCACAAGCGTTTGACACCGTCTCTCATGAACACATGCTGTCAGTCCTT ACAACC DFAQAFDTVSHEHM GAGCAGATGAACGTGGATCCCCACATGGTAAATCTGATCCGGGAGATTT ATCACTT LSVLEQMNVDPHMV ACACAAACAGCTGCACAAGTGTCGAGCTAGGCCGGAAAGAGGGACCAG ACACTAT NLIREIYTNSCTSVELG ACATCCCAGTGAGGGTTGGTGTTAAGCAAGGGGATCCTCTGTCCCCGCT AGGGGC RKEGPDIPVRVGVKQ GCTTTTCAACCTGGCTTTGGATCCTCTCATCCAAAGTCTCGAACGCACAG ACAGCG GDPLSPLLFNLALDPL GCAAAGGGTGTGAGGCCGAAGGTCACAAAGTGACAGCTTTAGCGTTCG GCTCCTA IQSLERTGKGCEAEG CGGATGACCTGGCACTGGTTGCGGGCTCGTGGGAGGGAATGGCACACA CCTCCCT HKVTALAFADDLALV ACCTTGCGCTTGTAGACGAATTCTGCCTAACCACCGGCCTCACAGTCCAA CCCTATG AGSWEGMAHNLAL CCCAAAAAGTGCCACAGTTTCATGGTCAGGCCCTGCAGAGGTGCCTTCA ACCCCCC VDEFCLTTGLTVQPK CAGTGAACGACTGCCCCCCATGGGTTCTGGGGGGCAAGGCCCTGCAGC CTTCCCA KCHSFMVRPCRGAF TAACAAACATCGAAAACTCCATCAAATATCTGGGAGTAAAAGTCAATCC TACCGA TVNDCPPWVLGGKA TTGGGCGGGGATTGAAAAGCCTGACCTTACAGTGGCACTAGACCGATG TCCATG LQLTNIENSIKYLGVK GTGCAAGCGCATTGGGAAGTCACTGCTCAAACCCTCACAGAAGGTATAC GCTGTT VNPWAGIEKPDLTVA ATTCTCAATCAGTTTGCCATCCCGCGACTCTTCTACCTGGCTGATCACGG CTAGTCT LDRWCKRIGKSLLKPS TGGGGCCGGCGACGTCATGCTCCAGAACCTGGATGGGACAATCAGGAA GGACCG QKVYILNQFAIPRLFY GGCGGTGAAGAAATGGCTGCATCTTCCACCGTCAACCTGCAACGGGCT AGGGTC LADHGGAGDVMLQ GTTGTATGCCAGGAACTGTAATGGTGGCCTCGGTATATGCAAGCTCACT GGACGG NLDGTIRKAVKKWLH CGGCACATCCCATCAATGCAGGCGAGACGAATGTTCCGCTTGGCCAACT GGCATT LPPSTCNGLLYARNC CATCGGACCCGTTGATGAAGGCCATGATGCGCGGCTCCCGAGTCGAAC TGAAGG NGGLGICKLTRHIPS AGAAATTCAAAAAGGCCTGGATGCGGGCCGGGGGAGAGGAGAGTGCG TAGCTG MQARRMFRLANSSD CTCCCACGGGTGTTCGGGGCGAATCAGTACCAGGAAGGGGAGGAGGT GAATCC PLMKAMMRGSRVE CGCTAACGATCTGGTACCTCGCTGCCCAATGCCGAGCGATTGGAGACTG TCCGCT QKFKKAWMRAGGE GAAGAATTCCAACACTGGATGGGCCTGCCGATCCAGGGTGTGGGTATA GCTGCG ESALPRVFGANQYQE GCCGGCTTCTTCAGAAACAGGGTGGCTAACGGATGGCTCAGGAAGCCG AGCCTG GEEVANDLVPRCPM GCAGGGTTCAAAGAGCGGCACTACATCGCCGCTCTACAACTGCGAGCAT AGGTCG PSDWRLEEFQHWM GTGTATACCCCACCCTCGAATTCCAGCAAAGGGGCAGGAGCAAAGCGG ATGGTT GLPIQGVGIAGFFRN GTGCGGCCTGCAGGCGGTGCTCATCCCGGTTGGAATCCAGCTCTCACAT AGAGGT RVANGWLRKPAGFK CCTCGGCAAATGTCCGGCGGTGCAGGGAGCCAGAATCAGGCGTCATAA GAAATA ERHYIAALQLRACVYP CAAAATATGCGACCTCCTGAAGGCCGAAGCCGAAACCCGGGGTTGGGA CTTGGG TLEFQQRGRSKAGAA GGTACGCCGGGAATGGGCCTTCAGAACTCCGGCTGGGGAACTGAGAAG AGGAGA CRRCSSRLESSSHILG GCTCGACCTGGTACTCATCCTCGGGGATGAGGCATTGGTCATTGACGTC CACAGC KCPAVQGARIRRHNK ACAGTAAGGTACGAGTTCGCTCCGGATACCCTCCAGAATGCCGGAAAG CTCCGG ICDLLKAEAETRGWE GACAAGGTCAGCTACTACGGCCCGCACAAAGAAGCGATCGCTCGGGAG AGAGCC VRREWAFRTPAGELR CTGGGCGTAAGAAGGGTCGACATACATGGGTTTCCGTTGGGTGCACGC CCTCCCG RLDLVLILGDEALVID GGACTTTGGCTCGCCAGCAACTCCAAAGTGCTGGAACTGATGGGATTGA GGTGGT VTVRYEFAPDTLQNA GCAGGGAAAGAGTGAAGGTCTTCTCCAGACTCTTGAGTCGGAGAGTGC CATCAT GKDKVSYYGPHKEAI TCCTGTACTCTATCGACATCATGAGGACATTTTACGCAACCCTGCAATGA GGCAAC ARELGVRRVDIHGFP AAATCCCAGCGGGATACAGCAAGAAGGTATCGGATCTAATAAGGTTGA CGGGTG LGARGLWLASNSKVL GCGAGGAGAGGGTGGAGATCCTTTGGGGGGGGTCGGGCTAAGTTCCC AAACCTT ELMGLSRERVKVFSR CTCTCGGGTCCTCCCACGGTGACGCTCTACCCCTCCCTCCTCGCTCGTAG ACGGTT LLSRRVLLYSIDIMRTF AACCCAACGGTGAACACGGTTGGCAGGATGAAGTGACGTGAGGGGTA TCACTTA YATLQ(SEQIDNO: AGACATGCGTACGTGAGCGCGCATTTTTGCTGTTCTCTGGACTGGGTTTC CGAAAC 1388) GTCCCCCTCACAACCATCACTTACACTATAGGGGCACAGCGGCTCCTACC AGCACC TCCCTCCCTATGACCCCCCCTTCCCATACCGATCCATGGCTGTTCTAGTCT ATAACA GGACCGAGGGTCGGACGGGGCATTTGAAGGTAGCTGGAATCCTCCGCT GCGCCG GCTGCGAGCCTGAGGTCGATGGTTAGAGGTGAAATACTTGGGAGGAGA TAATAG CACAGCCTCCGGAGAGCCCCTCCCGGGTGGTCATCATGGCAACCGGGT CGCACC GAAACCTTACGGTTTCACTTACGAAACAGCACCATAACAGCGCCGTAAT GGTGTG AGCGCACCGGTGTGACTACTGTCCAGTGCTGATATTCTCATCTGGAGAA ACTACT TACAACACGGGTAATGGCAGAGTATTCAAAACCCAAATGTTTACGATCG GTCCAG ACCAACGGAGTCGTTCCCTTGCATCTAGGCCGGACCCGAAACTGCCGTA TGCTGA ATTGCCCGTCCCCAAGGTAGCCTCTTAGAAAACCGAAGCCCGGTCGGGG TATTCTC CGGTGGTTGCGGCGGCGCTGCGGGGGCCTGCTGCTCGGGCGGCGTCG ATCTGG GTGTGCCGCGGTGGTTGCGGTGGTGCGGCGGGGATCTCGGTCCTTGCG AGAATA GTGCCGCTGTGCCGCCGCGGTCGCGTCGGTGGCGCTGGGGTGGTGGCC CAACAC CGAGTGGCGTCGGCGTGCCACTGCCCATAGTCGCCCGCGGGGGCGACC GGGTAA GATCTGGAGGGGCGAGGGGGCTCGCGGGACTTTAACGAGAAACGGAA TGGCAG CGCAACTTCTCGCATCGCTCCCGGGACTTTCCCCCCTCGTTCAGCCGAGG AGTATTC GATGCCAAAAGGCATGAAAGGTAAGTACCATACCGGTCCGCAAAACTCT AAAACC CTTCTGACTCGGTTCTCTGTTGGTTTTCTAGAGTAACAACGAGGTGGAG CAAATG GAGAGGGACATGGCAGGGACTCCCATTCGTGCCAGCGGGTGGGGACA TTTACGA GATCGAAGGAACGGTTCGAGGGCGTAACAGACGAGAGGGAATCCGGT TCGACC CACACATTGATGCCATGCCTAAATAGGCGAGGTTTGTATTTCTACTTTGT AACGGA GGGTTCAGTATAGTCGGAGCATATGGTCGGTTGTCCCGTTGTTTTCACG GTCGTT GCGGGCAAGCGACTATCATGATAAAGTAGAATGGGAGACGGGCTCCCT CCCTTGC GACAAACCCGGAAAGGCGCCCCCCCGTGGTTCGTAGCAGCTGACGGAT ATCTAG CACGCTCGAAGAAAAATGAGTGAGAGGGGACGCCGCAACCAC(SEQID GCCGGA NO:1542) CCCGAA ACTGCC GTAATT GCCCGT CCCCAA GGTAGC CTCTTAG AAAACC GAAGCC CGGTCG GGGCGG TGGTTG CGGCGG CGCTGC GGGGGC CTGCTG CTCGGG CGGCGT CGGTGT GCCGCG GTGGTT GCGGTG GTGCGG CGGGGA TCTCGG TCCTTGC GGTGCC GCTGTG CCGCCG CGGTCG CGTCGG TGGCGC TGGGGT GGTGGC CCGAGT GGCGTC GGCGTG CCACTG CCCATA GTCGCC CGCGGG GGCGAC CGATCT GGAGGG GCGAGG GGGCTC GCGGGA CTTTAAC GAGAAA CGGAAC GCAACT TCTCGCA TCGCTCC CGGGAC TTTCCCC CCTCGTT CAGCCG AGGGAT GCCAAA AGGCAT GAAAGG TAAGTA CCATACC GGTCCG CAAAAC TCTCTTC TGACTC GGTTCT CTGTTG GTTTTCT AGAGTA ACAACG AGGTGG AGGAGA GGGACA TGGCAG GGACTC CCATTCG TGCCAG CGGGTG GGGACA GATCGA AGGAAC GGTTCG AGGGCG TAACAG ACGAGA GGGAAT CCGGTC ACACATT GATGCC ATGCCT AAATAG GCGAGG TTTGTAT TTCTACT TTGTGG GTTCAG TATAGTC GGAGCA TATGGT CGGTTG TCCCGTT GTTTTCA CGGCGG GCAAGC GACTAT CATGAT AAAGTA GAATGG GAGACG GGCTCC CTGACA AACCCG GAAAGG CGCCCC CCCGTG GTTCGT AGCAGC TGACGG ATCACG CTCGAA GAAAAA TGAGTG AGAGGG GACGCC GCAACC AC(SEQ IDNO: 1266) R2 R2- Gasterosteus CATATTGGGGTCTCAGGAGGAGACACAGGGTCTGTTGCGGCTCCGGTA CATATTGG GGAGGG MLRGGVGTPPAGGA 1_GA aculeatus AACGGTACCGGAGTCGGTTAAGCATCGTTTGGGCCCGCCTCCACGTGGT GGTCTCAG GAGTAG GAVGPGMASPGGCS GGTCCGCGGTAACACCAATAGGGTGGCTAAGAGGCCCAGTAATTTCCCC GAGGAGA GTCTCTA VRFSPGGRRLLGHRT GAATTGTCTTCCCCCCCGCGCGGGGGGGACCCCCCTTTAGTGTCGGAGC CACAGGGT CTCTGAC GGLSPSVSWRLKRLS GGTCGCGCCTCCGCGTTTGGGGTGTCGCAGGCGTGAGCCTTCGTCCCCT CTGTTGCG CCGAAG VSLRRWSGPGLLGA TAAGTTCAGACGGTCCCGGCTTCTTGCCGGGCCAACCCCCGGTGCAGCG GCTCCGGT GGCCCC DGAGGGAAVASPRG TTCTCCCATGTTGGATCGGCACCCAGCCCCGGGTGCCATGCGAGTTCAG AAACGGTA CCCGTTT TQVLGSGAGRRWLG ACATTTTGTTTATGTATCGTCTGCGTGGTTGACTTGCTAAGCTCATTTCCT CCGGAGTC CAGACC HGSRGSSPSAARGLR CCTCTCACTGCGTCCCCCCAGGTGCTGATCGGTTGAAGAGGATTCGTCG GGTTAAGC TGATTCT RLTVRLKRLSGGLLSP TTGACCTCGGCGGTGAATTTGGGATTGTATTATACAGGTAGGTATAGAG ATCGTTTG AGGCTA KACRDAEEGSSSSPG GGCGTGCGGATGTTGCGTGGCGGTGTTGGTACTCCCCCGGCTGGGGGA GGCCCGCC CCTGTG FRNPKGLGGRGLTPL GCGGGTGCGGTGGGGCCAGGCATGGCCTCGCCGGGTGGTTGCAGTGTC TCCACGTG CCTAATT GSRRFCRLTVSLNRW CGGTTCAGTCCCGGAGGGAGGCGACTGCTTGGCCACAGGACTGGAGG GTGGTCCG GGGGGG RGSLVKLNASSRASG GTTGAGTCCCTCCGTGTCCTGGAGGCTCAAGCGACTGTCTGTCTCTCTGA CGGTAACA GTCCCA RRTPVKPACDSRAGR GGCGCTGGAGCGGGCCTGGGCTGCTAGGTGCGGATGGTGCGGGGGGA CCAATAGG AAGAGA GSEHAEGGGVSAAP GGCGCTGCGGTGGCCTCCCCCAGGGGTACGCAGGTCCTGGGAAGTGGG GTGGCTAA TGTTGTC MVLRSRRKLTFSVDG GCCGGGCGTCGGTGGCTTGGGCACGGGTCGCGAGGGTCTTCTCCTTCT GAGGCCCA TGTTGTA DSNSGDRARSGSVSA GCGGCCCGGGGGCTAAGGCGGCTGACGGTACGGTTGAAGCGACTCAG GTAATTTC GAAGGG ARPGHLLVDGESASS CGGTGGCCTGTTGTCCCCTAAGGCGTGTCGGGATGCGGAAGAAGGAAG CCCGAATT TTTGCG RSGPAGDARLAGPST CTCCAGCAGCCCAGGGTTCCGGAATCCAAAAGGTCTCGGGGGAAGGGG GTCTTCCC CCACTG RSRRKGCLPPVDFEN GTTGACGCCTCTCGGATCCCGTAGATTTTGTCGGCTGACCGTCTCCCTGA CCCCGCGC ACTGCA PKKRTRLMAKMTNG ATCGCTGGAGGGGCAGTCTGGTGAAGTTGAACGCTAGTAGCAGGGCCT GGGGGGG CGGAAG NPTSHVPCPAPCSNG CCGGCCGGAGGACCCCTGTGAAACCCGCTTGTGACTCTAGAGCCGGAC ACCCCCCT GGTGGG HEGGGRVAVIEGRLP GGGGCTCGGAGCATGCGGAGGGAGGTGGAGTGAGCGCTGCACCTATG TTAGTGTC CCTCGA ELSGSRISGIQPALPV GTGTTGCGCAGTCGGCGTAAGCTCACCTTCTCTGTGGATGGCGACTCTA GGAGCGG CAGGTA ETSFVGQSTGRGAD ACTCCGGGGATAGGGCCCGGAGCGGGTCCGTCTCTGCAGCCCGTCCTG TCGCGCCT GGGGTT GDANANSSPPSPNL GCCACTTGTTGGTGGATGGTGAGAGTGCGTCCTCAAGATCTGGCCCCGC CCGCGTTT ACATGA GGSVGMVPAVRDGT GGGGGATGCCAGGTTGGCGGGGCCTTCTACGCGGAGTAGGAGGAAGG GGGGTGT CTCCGT PPLGRPGEDHSRECA GTTGCCTTCCCCCGGTCGACTTTGAAAACCCGAAGAAGCGCACACGGTT CGCAGGC GCTGCT GGNTPLWMLEDSFR GATGGCTAAGATGACGAATGGTAATCCTACCTCGCACGTCCCTTGCCCT GTGAGCCT CAGCAG CDYCPREFGTRAGRS GCCCCGTGCTCAAATGGGCATGAAGGAGGTGGGCGAGTTGCGGTGATC TCGTCCCC ACCCGC LHMRRAHLAEYDGA GAGGGGCGGCTGCCGGAGTTAAGCGGTAGTAGGATCTCTGGAATACAG TTAAGTTC GCCTCT GFCWGERLSEFAATR CCAGCCCTGCCTGTTGAAACCAGCTTTGTCGGCCAATCGACTGGCCGGG AGACGGTC GAGACC LWSTEETKKLAVFCE GCGCGGACGGCGATGCGAATGCGAATAGTAGCCCGCCTTCTCCTAATCT CCGGCTTC GGGTAG RGVPSPSECRAIAASL GGGCGGCTCGGTTGGGATGGTGCCTGCCGTGCGTGATGGTACCCCGCC TTGCCGGG GGCTAC GAGKTHHQVRSKCR GCTTGGGCGTCCAGGAGAGGATCACTCGCGGGAGTGTGCAGGGGGAA CCAACCCC TTGAAC LVFEAIRRRELLEVAA ATACTCCCCTCTGGATGCTGGAGGACAGTTTCCGGTGTGACTACTGTCCT CGGTGCAG AAGCGA ATERLEKSARRKQPA AGGGAATTCGGCACAAGAGCGGGGCGCTCGTTGCACATGCGCAGGGCT CGTTCTCC CGCCCT VPPAPVHGVRGVLR CACCTGGCCGAGTACGACGGGGCAGGTTTCTGTTGGGGTGAACGTCTC CATGTTGG GGTGTA GLLGKRVPREGGTTG AGTGAATTCGCCGCTACGCGCCTCTGGTCGACGGAGGAAACCAAAAAG ATCGGCAC TGTCCG STSARIVRRDDCRQG CTGGCCGTGTTTTGTGAGAGGGGTGTGCCCTCACCGTCGGAATGCAGA CCAGCCCC TATCCTA AVASASLNLIRRLGRK GCCATTGCAGCCTCTCTGGGCGCAGGAAAAACACATCATCAGGTTAGAT GGGTGCCA ACCTGG ATGRSGRRRVLGRPP CGAAGTGTCGACTGGTGTTCGAGGCCATTCGGCGGCGTGAATTGCTTGA TGCGAGTT TTTGGG RMDVRRSVRMRRM GGTGGCTGCTGCCACGGAGCGTTTGGAGAAAAGCGCTAGGCGGAAGC CAGACATT AAAGCC RRFLYRLARLGWAKL AGCCCGCCGTACCACCGGCACCCGTACACGGAGTGAGAGGGGTCCTGC TTGTTTAT GATACC AMFVLDGQMGASC GGGGCCTACTAGGGAAGCGGGTGCCGAGAGAGGGTGGTACCACAGGC GTATCGTC GGCAAT PVPLVEVSAVFRERW AGCACCTCAGCAAGGATCGTCAGGAGAGACGACTGCCGTCAGGGGGCA TGCGTGGT GCCCGC SIVRAFLGLGQFGGF GTTGCGTCGGCTTCTCTCAATCTGATCAGAAGGCTGGGTCGAAAGGCAA TGACTTGC CACAGG GTADNAGFGKLIDPA CGGGCCGCTCCGGCAGGAGACGGGTCCTTGGACGCCCACCCAGGATGG TAAGCTCA TGTCGC EVRAHLQSIKNRSSP ATGTAAGGCGTAGCGTGAGGATGAGGAGGATGCGCAGGTTCCTCTATC TTTCCTCCT GCACCC GPDGITKVALSKWDP GGTTGGCCCGGCTGGGCTGGGCCAAGTTGGCTATGTTTGTCCTGGACG CTCACTGC CACGGG EGIKLAHMYSTWLVS GACAGATGGGGGCGAGCTGCCCCGTTCCACTCGTCGAAGTGTCGGCGG GTCCCCCC ATGACG AGIPKVFKKCRTTLIP TCTTCCGGGAGAGGTGGAGCATAGTCAGAGCCTTCCTGGGTCTGGGTC AGGTGCTG TATGGG KTGDVSLHGDVGQ AGTTCGGGGGCTTCGGGACTGCCGACAACGCAGGATTTGGGAAGCTGA ATCGGTTG CCCCGG WRPITIASLVLRLYSRI TCGATCCGGCTGAAGTCAGGGCCCATCTCCAGTCCATCAAGAACCGGTC AAGAGGA GGGACC LTERMTVACPSHPRQ TTCCCCGGGCCCGGATGGCATCACCAAGGTGGCGCTGTCCAAATGGGA TTCGTCGT TCATGG RGFIASPGCSENLML CCCCGAAGGGATTAAATTGGCGCACATGTACTCAACATGGTTGGTATCG TGACCTCG ATACTCC LEGCMSLSKAGNGSL GCAGGCATCCCTAAGGTCTTCAAGAAGTGCAGGACGACACTTATCCCAA GCGGTGA ACTGGA AVVFVDFAKAFDTVS AGACCGGGGACGTTAGTCTACATGGTGACGTGGGGCAATGGAGGCCCA ATTTGGGA CTTGCAC HEHLLSVLVQKGLDQ TAACCATTGCGTCCCTGGTCCTGAGACTCTATTCGCGGATCCTGACGGA TTGTATTA AATCCT HMVELIKDSYENSVT AAGGATGACAGTGGCCTGTCCTAGCCACCCGCGCCAGAGGGGCTTCATT TACAGGTA GGTGTA KVHCQEGCSTDIAM GCCTCCCCGGGCTGTTCGGAAAACCTCATGCTGTTGGAAGGTTGCATGA GGTATAGA CTGGAT KVGVKQGDSMSPLL GTCTCAGCAAGGCAGGAAATGGCTCCCTCGCGGTTGTGTTCGTCGACTT GGGCGTG GCAGCG FNLALDPLIQQLEREG TGCGAAGGCCTTCGATACCGTCTCCCACGAGCACCTCCTGAGTGTTCTG CGG(SEQ ACGTTG RGFPVNGKSITAMAF GTGCAGAAAGGCTTGGACCAACACATGGTGGAGTTGATCAAGGACTCC IDNO: GTGACA ADDLAIVSDSWEGM TACGAGAACAGCGTGACCAAGGTGCACTGTCAGGAGGGTTGTTCCACT 1144) TAAGCA RANLDILVDFCELTG GACATCGCCATGAAGGTGGGAGTGAAGCAGGGTGACTCCATGTCCCCT ATCGCT MRTQPSKCHGFLIEK CTCCTCTTTAACCTGGCGCTGGATCCGCTTATCCAGCAACTTGAACGCGA AAGTCG SGSRSYKVNRCEPWL GGGCCGGGGCTTCCCAGTAAATGGGAAGTCCATTACTGCGATGGCATTT GGGTAG LNDTALHMVGPKESI GCGGATGACTTGGCCATAGTGAGTGACTCTTGGGAAGGCATGAGAGCC GGGAGG KYLGVQVNPWTGIF AACCTTGATATCCTGGTGGACTTCTGCGAGCTTACTGGAATGCGAACCC TGGGGA AEDTVAKLRQWVVA AGCCCAGTAAGTGCCACGGGTTCCTGATTGAGAAGAGTGGCAGCAGGT CCTCGG ISKTPLRPLDKVSLLC CGTACAAAGTGAACAGGTGCGAACCGTGGCTGCTGAACGACACAGCTC CACGGC QFAVPRVIFVADHC TTCACATGGTCGGGCCTAAGGAATCAATCAAGTACCTGGGCGTCCAGGT TGTAGG MLSAKALTEMDRSIR GAACCCGTGGACAGGGATCTTCGCTGAGGATACGGTTGCCAAACTACG AACGGG QAVKRWLHLARCTT ACAGTGGGTAGTTGCAATCTCCAAGACGCCTCTACGTCCGCTTGACAAG TGTATG NGLLYSRKSSGGLGIP GTGTCCCTGTTGTGCCAGTTTGCCGTACCGAGGGTCATCTTCGTGGCTG GGCTCC KLSMIVPAMQARRLL ATCACTGCATGCTATCTGCGAAGGCCCTGACAGAAATGGATAGGAGCAT GGCAGC GLSRSKDETVRWMF AAGACAAGCAGTGAAGAGGTGGTTGCACCTGGCCAGGTGTACCACGAA CGTCGT LETTDHVAFERAWLR CGGCCTCCTCTACTCAAGGAAATCCAGCGGTGGTCTGGGTATCCCAAAA CACTCCC AGGSPDEVPELGPDL TTGTCGATGATTGTTCCGGCCATGCAGGCCAGGAGACTCCTGGGCCTGT ATACAA VEGSPAEGNADPVST CCCGTTCTAAGGACGAGACGGTCAGGTGGATGTTTCTGGAGACAACTG CACAGG VRPRKRIVPCDWRQ ATCACGTGGCGTTTGAGAGGGCATGGCTGAGGGCTGGAGGGTCGCCA GGCTGC VEFDRWAGQLVQGK GATGAGGTACCGGAGCTGGGTCCGGATCTGGTGGAGGGCTCCCCTGCG ATCCTG GIRTFEADKISNCWLY GAGGGGAACGCTGACCCTGTCAGCACGGTGAGGCCAAGGAAGCGCAT GTGGCC DYPPNKLKPGDFTAA AGTCCCGTGTGACTGGCGTCAAGTCGAGTTCGACAGATGGGCCGGTCA GGTGCT VQLRANVYPTRELAG ATTGGTGCAGGGAAAAGGGATTCGGACGTTCGAAGCGGACAAGATCA AGTTGG RGRTDTIDVCCRHCG GCAACTGCTGGTTGTACGACTACCCGCCAAACAAGCTGAAGCCTGGGG TTCTGG EAPETCWHILALCPK ATTTTACGGCGGCTGTCCAGCTTAGAGCGAACGTTTACCCGACCCGGGA AAGCCC VKRCRIQRHHKVCQV GCTAGCGGGTCGCGGAAGGACCGATACGATAGATGTCTGTTGTCGACA GCCCGG LVAEAERHGWEVER CTGTGGGGAGGCCCCAGAGACTTGCTGGCACATCCTTGCGCTCTGCCCG GCTGGT EKRWMLPSGECVAP AAGGTTAAGCGGTGCCGTATTCAGAGGCACCACAAGGTGTGCCAGGTC TCGCAG DLICWLDELALIVDVT CTCGTCGCGGAGGCTGAGCGCCATGGATGGGAAGTGGAAAGGGAAAA AAGCAG VRYEFDEESLERARIE GCGCTGGATGCTGCCCTCCGGGGAGTGTGTCGCGCCGGACCTGATCTG GGTGCG KECKYRPLIPVIRASR CTGGTTGGATGAGCTGGCGCTCATTGTCGATGTGACGGTGAGGTACGA CCCAGG VQTKKVTVYGFPLGA GTTCGATGAGGAGTCGCTAGAACGCGCGCGAATCGAGAAGGAATGCAA GTAGGT RGKWPAKNELLLADL GTACCGCCCTCTCATTCCAGTGATCAGGGCGAGCAGAGTTCAGACGAAG TTGGTAT GLSKARTRSFAKLLSR AAGGTGACGGTCTATGGCTTCCCTCTGGGAGCCAGGGGAAAGTGGCCT ATCTGG RVLLHSLDVMRTFM GCTAAGAACGAGCTGCTGCTCGCCGACCTCGGCCTGAGCAAGGCTCGG GTCCGG R(SEQIDNO:1389) ACTCGGAGTTTTGCTAAACTCCTGAGCCGCAGAGTTCTCTTACATTCTCT TGCGAT GGATGTTATGAGGACGTTTATGCGTTAAGGAGGGGAGTAGGTCTCTAC ACCTATC TCTGACCCGAAGGGCCCCCCCGTTTCAGACCTGATTCTAGGCTACCTGTG GATGGG CCTAATTGGGGGGGTCCCAAAGAGATGTTGTCTGTTGTAGAAGGGTTTG CAGCGA CGCCACTGACTGCACGGAAGGGTGGGCCTCGACAGGTAGGGGTTACAT GGGCCG GACTCCGTGCTGCTCAGCAGACCCGCGCCTCTGAGACCGGGTAGGGCT CCTCGT ACTTGAACAAGCGACGCCCTGGTGTATGTCCGTATCCTAACCTGGTTTG GACGCG GGAAAGCCGATACCGGCAATGCCCGCCACAGGTGTCGCGCACCCCACG CTGTGT GGATGACGTATGGGCCCCGGGGGACCTCATGGATACTCCACTGGACTT GGAGCT GCACAATCCTGGTGTACTGGATGCAGCGACGTTGGTGACATAAGCAATC GGAGCC GCTAAGTCGGGGTAGGGGAGGTGGGGACCTCGGCACGGCTGTAGGAA GGCCTG CGGGTGTATGGGCTCCGGCAGCCGTCGTCACTCCCATACAACACAGGG GGTATG GCTGCATCCTGGTGGCCGGTGCTAGTTGGTTCTGGAAGCCCGCCCGGGC AACAGT TGGTTCGCAGAAGCAGGGTGCGCCCAGGGTAGGTTTGGTATATCTGGG TCTTGC TCCGGTGCGATACCTATCGATGGGCAGCGAGGGCCGCCTCGTGACGCG GGATGT CTGTGTGGAGCTGGAGCCGGCCTGGGTATGAACAGTTCTTGCGGATGT GGCGTA GGCGTAGCTAGATAGTACCCGTGGTTGTGGGCGTGGTGTCGACCAAAT GCTAGA GTTGTCCTGTGTGCACATAGGCCAAGGGTTACGTGGGTGGCAGTCAGA TAGTAC AGCACCCGCACCTGGAAGTGATTGCCCCGGGATCCCGGCTCTCTGTGAA CCGTGG GAGCTACCTTGAGGAAAGGTGTTCCGCTGGAACTCAAGACCCTACAGTA TTGTGG GGGGATATCAACTGGCTTTGAGGTGCTGTGATTCCGGAACCAGGGCGA GCGTGG GGGCGAGTACTTAGAGCATGTCCAAAAGCCCGGGGAACGTTCCGGGGG TGTCGA CCTGCTTGGGTCGTTGGACCCACATCCGTAAAACGATGGATCTCGCGTC CCAAAT GGCGCTCGGGAGAACTTCCCGCATGAACGCTGATTGCATGTGAGAACG GTTGTC CCCCCACGGCGGCGGGGCAGGCGCTCCCCCTGGGTGTAAGGCTCGGGG CTGTGT GGGTCACGGCTCCGCTCTAAAAG(SEQIDNO:1543) GCACAT AGGCCA AGGGTT ACGTGG GTGGCA GTCAGA AGCACC CGCACC TGGAAG TGATTG CCCCGG GATCCC GGCTCT CTGTGA AGAGCT ACCTTG AGGAAA GGTGTT CCGCTG GAACTC AAGACC CTACAG TAGGGG ATATCA ACTGGC TTTGAG GTGCTG TGATTCC GGAACC AGGGCG AGGGCG AGTACTT AGAGCA TGTCCA AAAGCC CGGGGA ACGTTCC GGGGGC CTGCTT GGGTCG TTGGAC CCACATC CGTAAA ACGATG GATCTC GCGTCG GCGCTC GGGAGA ACTTCCC GCATGA ACGCTG ATTGCAT GTGAGA ACGCCC CCACGG CGGCGG GGCAGG CGCTCCC CCTGGG TGTAAG GCTCGG GGGGGT CACGGC TCCGCTC TAAAAG (SEQID NO: 1267) R2 R2_BM AB076841 Bombyx GGGCGATACGCATAATTTTAATTTCCCGATTGAAATCCAGTCGTCTTAAT GGGCGAT GCCTTG MMASTALSLMGRC mori CTGGTGACCAGTGGCGCGGTCACCAGTATAGTGCACAGGACGTGAATG ACGCATAA CACAGT NPDGCTRGKHVTAA GCTCCGAGGCTGGCGGAGTCACTCACTATAAGTGTGAGAGACGATGTC TTTTAATTT AGTCCA PMDGPRGPSSLAGT CTGTGCCAAGTATACGTCCAACCCTAACGGGTTAAGTGAAATTAGTTGC CCCGATTG GCGGTA FGWGLAIPAGEPCG TCATAACAGGGACGGTGTACCTGTTTGCTCGTGGCTGGCTATCGAATGG AAATCCAG AGGGTG RVCSPATVGFFPVAK ACGGGACCAATACACCCCCCTGTTAGTAATGGGGTAAGAGAGAGCGGT TCGTCTTA TAGATC KSNKENRPEASGLPL CTGAAACTATGGCCGAAATCACGACGCCCCACTCCTACCCATAACCTGCA ATCTGGTG AGGCCC ESERTGDNPTVRGSA CGTGGTACCGCCGCACATTGACCGATACGGGAGGAGGGGCAGCACTTG ACCAGTGG GTCTGTT GADPVGQDAPGWT AATCACGTAGTCTTGGTGTAGCCATTGCGGGACTACAGCCCTCGTAAGT CGCGGTCA TCTTCCC CQFCERTFSTNRGLG GCCGCCTTAGAACGCAACGGGGCAATAGGTGGGCCGGGGCGCTAGCG CCAGTATA CGGAGC VHKRRAHPVETNTD GGGGGGAGTAATCTCCCCTGTTGGCGTGCACCGCACTGCTCCCACTGGG GTGCACAG TCGCTCC AAPMMVKRRWHGE GGCAGTGTCATCCGGAAACAGGTGGGCCGGGGCGCCACCAGGGGGGA GACGTGAA CTTGGC EIDLLARTEARLLAER GCAATCCCTCCTGATGATGGCGAGCACCGCACTGTCCCTTATGGGACGG TGGCTCCG TTCCCTT GQCSGGDLFGALPGF TGTAACCCGGATGGCTGTACACGTGGTAAACACGTGACAGCAGCCCCG AGGCTGGC ATATTTA GRTLEAIKGQRRREP ATGGACGGACCGCGAGGACCGTCAAGCCTAGCAGGTACCTTCGGGTGG GGAGTCAC ACATCA YRALVQAHLARFGSQ GGCCTTGCGATACCTGCGGGCGAACCCTGTGGTCGGGTTTGCAGCCCG TCACTATA GAAACA PGPSSGGCSAEPDFR GCCACAGTGGGTTTTTTTCCTGTTGCAAAAAAGTCAAATAAAGAAAATA AGTGTGAG GACATT RASGAEEAVEERCAE GACCTGAAGCCTCTGGCCTCCCGCTGGAGTCAGAGAGGACAGGCGATA AGACGATG AAACAT DAAAYDPSAVGQMS ACCCGACTGTGCGGGGTTCCGCCGGCGCAGATCCTGTGGGTCAGGATG TCCTGTGC CTACTG PDAARVLSELLEGAG CGCCTGGTTGGACCTGCCAGTTCTGCGAACGAACCTTTTCGACCAACAG CAAGTATA ATCCAAT RRRACRAMRPKTAG GGGTTTGGGTGTCCACAAGCGTAGAGCCCACCCTGTTGAGACCAATACG CGTCCAAC TTCGCC RRNDLHDDRTASAH GATGCCGCTCCGATGATGGTGAAGCGGCGGTGGCATGGCGAGGAAATC CCTAACGG GGCGTA KTSRQKRRAEYARVQ GACCTCCTCGCTCGCACCGAGGCCAGGTTGCTCGCTGAGCGGGGTCAGT GTTAAGTG CGGCCA ELYKKCRSRAAAEVID GCTCGGGTGGAGACCTCTTTGGCGCGCTTCCAGGGTTTGGAAGAACTCT AAATTAGT CGATCG GACGGVGHSLEEME GGAAGCGATTAAGGGACAACGGCGGAGGGAGCCTTATCGGGCATTGG TGCTCATA GGAGGG TYWRPILERVSDAPG TGCAAGCGCACCTTGCCCGATTTGGTTCCCAGCCGGGTCCCTCGTCGGG ACAGGGA TGGGAA PTPEALHALGRAEW GGGGTGCTCGGCCGAGCCTGACTTCCGGCGGGCTTCTGGAGCTGAGGA CGGTGTAC TCTCGG HGGNRDYTQLWKPI AGCGGTCGAGGAACGATGCGCCGAAGACGCCGCTGCCTATGATCCATC CTGTTTGC GGATCT SVEEIKASRFDWRTS CGCAGTCGGTCAGATGTCGCCCGATGCCGCTCGGGTTCTCTCCGAACTC TCGTGGCT TCCGATC PGPDGIRSGQWRAV CTTGAGGGTGCGGGGAGAAGACGAGCGTGCAGGGCTATGAGACCCAA GGCTATCG CTAATCC PVHLKAEMFNAWM GACTGCAGGGCGGCGAAACGATTTGCACGATGATCGGACAGCTAGTGC AATGGACG ATGATG ARGEIPEILRQCRTVF CCACAAAACCAGTAGACAAAAGCGCAGGGCAGAGTACGCGCGTGTGCA GGACCAAT ATTACG VPKVERPGGPGEYRP GGAACTGTACAAGAAGTGTCGCAGCAGAGCAGCAGCTGAGGTGATCGA ACACCCCC ACCTGA ILIASIPLRHFHSILARR TGGCGCGTGTGGGGGTGTCGGACACTCGCTCGAGGAGATGGAGACCTA CTGTTAGT GTCACT LLACCPPDARQRGFI TTGGCGACCTATCCTCGAGAGAGTGTCCGATGCACCTGGGCCTACACCG AATGGGGT AAAGAC CADGTLENSAVLDAV GAAGCTCTTCACGCCCTAGGGCGTGCGGAGTGGCACGGGGGCAATCGC AAGAGAG GATGGC LGDSRKKLRECHVAV GACTACACCCAGCTGTGGAAGCCGATCTCGGTGGAAGAGATCAAGGCC AGCGGTCT ATGATG LDFAKAFDTVSHEAL TCCCGCTTTGACTGGCGAACTTCGCCGGGCCCGGACGGTATACGTTCGG GAAACTAT ATCCGG VELLRLRGMPEQFCG GTCAGTGGCGTGCGGTTCCTGTGCACTTGAAGGCGGAAATGTTCAATGC GGCCGAA CGATGA YIAHLYDTASTTLAVN ATGGATGGCACGAGGCGAAATACCCGAAATTCTACGGCAGTGCCGAAC ATCACGAC AAA NEMSSPVKVGRGVR CGTCTTTGTACCTAAGGTGGAGAGACCAGGTGGACCGGGGGAATATCG GCCCCACT (SEQID QGDPLSPILFNVVMD ACCGATCTTGATCGCGTCGATTCCCCTGAGACACTTTCACTCCATCTTGG CCTACCCA NO: LILASLPERVGYRLEM CCCGGAGGCTGTTGGCTTGCTGCCCCCCTGATGCACGACAGCGCGGATT TAACCTGC 1268) ELVSALAYADDLVLLA TATCTGCGCCGACGGTACGCTGGAGAATTCCGCAGTACTGGACGCGGT ACGTGGTA GSKVGMQESISAVDC GCTTGGGGATAGCAGGAAGAAGCTGCGGGAATGTCACGTGGCGGTGC CCGCCGCA VGKQMGLRLNCRKS TAGACTTCGCCAAGGCATTTGACACAGTGTCTCACGAGGCACTTGTCGA CATTGACC AVLSMIPDGHRKKH ATTGCTGAGGTTGAGGGGCATGCCCGAACAGTTCTGCGGCTACATTGCT GATACGG HYLTERTFNIGGKPLR CACTTATACGATACGGCGTCCACCACCTTAGCCGTGAACAATGAAATGA GAGGAGG QVSCVERWRYLGVD GCAGCCCTGTGAAAGTGGGACGAGGGGTTCGTCAAGGGGACCCTCTGT GGCAGCAC FEASGCVTLEHSISSA CGCCGATACTCTTCAACGTGGTGATGGACCTCATCCTAGCTTCCCTGCCG TTGAATCA LNNISRAPLKPQQRL GAGAGGGTCGGGTATAGGTTGGAGATGGAACTTGTGTCCGCTCTGGCC CGTAGTCT EILRAHLIPRFQHGFV TATGCTGACGACCTAGTCCTGCTTGCGGGGTCGAAGGTAGGGATGCAG TGGTGTAG LGNISDDRLRMLDVQ GAGTCCATCTCTGCTGTGGACTGTGTTGGTAAGCAGATGGGCCTACGCC CCATTGCG IRKAVGQWLRLPAD TGAATTGCAGGAAGAGCGCGGTTCTGTCTATGATACCGGATGGCCACC GGACTACA VPKAYYHAAVQDGG GCAAGAAGCATCACTACCTGACTGAGCGAACCTTCAATATTGGAGGTAA GCCCTCGT LAIPSVRATIPDLIVRR GCCGCTCAGGCAGGTGAGTTGTGTTGAGCGGTGGCGATATCTTGGTGT AAGTGCCG FGGLDSSPWSVARA CGATTTTGAGGCCTCTGGATGCGTGACATTAGAGCATAGTATCAGTAGT CCTTAGAA AAKSDKIRKKLRWA GCTCTGAATAACATCTCAAGGGCACCTCTCAAACCCCAACAGAGGTTGG CGCAACGG WKQLRRFSRVDSTT AGATTTTGAGAGCTCATCTGATTCCGAGATTCCAGCACGGTTTTGTGCTT GGCAATAG QRPSVRLFWREHLH GGAAACATCTCGGATGACCGATTGAGAATGCTCGATGTCCAAATCCGGA GTGGGCC ASVDGRELRESTRTP AAGCAGTCGGACAGTGGCTAAGGCTACCGGCGGATGTGCCCAAGGCAT GGGGCGC TSTKWIRERCAQITG ACTATCACGCCGCAGTTCAGGACGGCGGCTTAGCGATCCCATCGGTGCG TAGCGGG RDFVQFVHTHINALP AGCGACCATCCCGGACCTCATTGTGAGGCGTTTCGGGGGGCTCGACTCG GGGGAGT SRIRGSRGRRGGGES TCACCATGGTCAGTGGCAAGAGCCGCCGCCAAATCTGATAAGATTCGTA AATCTCCC SLTCRAGCKVRETTA AGAAACTGCGGTGGGCCTGGAAACAGCTCCGCAGGTTCAGCCGTGTTG CTGTTGGC HILQQCHRTHGGRIL ACTCCACAACGCAACGACCATCTGTGCGCTTGTTTTGGCGAGAACATCT GTGCACCG RHNKIVSFVAKAMEE GCACGCATCTGTTGATGGACGCGAACTTCGCGAATCCACACGCACCCCG CACTGCTC NKWTVELEPRLRTSV ACATCCACAAAGTGGATTAGGGAGCGATGCGCGCAGATAACCGGACGG CCACTGGG GLRKPDIIASRDGVG GACTTCGTGCAGTTCGTGCACACTCATATCAACGCCCTCCCATCCCGCAT GGCAGTGT VIVDVQVVSGQRSLD TCGCGGATCGAGAGGGCGTAGAGGTGGGGGTGAGTCTTCGTTGACCTG CATCCGGA ELHREKRNKYGNHG CCGTGCTGGTTGCAAGGTTAGGGAGACGACGGCTCACATCCTACAACA AACAGGTG ELVELVAGRLGLPKAE GTGTCACAGAACACACGGCGGCCGGATTCTACGACACAACAAGATTGTA GGCCGGG CVRATSCTISWRGV TCTTTCGTGGCGAAAGCCATGGAAGAGAACAAGTGGACGGTTGAGCTG GCGCCACC WSLTSYKELRSIIGLRE GAGCCGAGGCTACGAACATCGGTTGGTCTCCGTAAGCCGGATATTATCG AGGGGGG PTLQIVPILALRGSHM CCTCCAGGGATGGTGTCGGAGTGATCGTGGACGTGCAGGTGGTCTCGG AGCAATCC NWTRFNQMTSVMG GCCAGCGATCGCTTGACGAGCTTCACCGTGAGAAACGTAATAAATACGG CTCCTG GGVG(SEQIDNO: GAATCACGGGGAGCTGGTTGAGTTGGTCGCAGGTAGACTAGGACTTCC (SEQID 1390) GAAAGCTGAGTGCGTGCGAGCCACTTCGTGCACGATATCTTGGAGGGG NO:1145) AGTATGGAGCCTGACTTCTTATAAGGAGTTAAGGTCCATAATCGGGCTT CGGGAACCGACACTACAAATCGTTCCGATACTGGCGTTGAGAGGTTCAC ACATGAACTGGACCAGGTTCAATCAGATGACGTCCGTCATGGGGGGCG GCGTTGGTTGAGCCTTGCACAGTAGTCCAGCGGTAAGGGTGTAGATCA GGCCCGTCTGTTTCTTCCCCGGAGCTCGCTCCCTTGGCTTCCCTTATATTT AACATCAGAAACAGACATTAAACATCTACTGATCCAATTTCGCCGGCGT ACGGCCACGATCGGGAGGGTGGGAATCTCGGGGATCTTCCGATCCTAA TCCATGATGATTACGACCTGAGTCACTAAAGACGATGGCATGATGATCC GGCGATGAAAA(SEQIDNO:1544) R2 R8Hm- Hydra TTCAAGTGGATGAAGCTGGGAAGGTAATCTGTAGTTGGTTGAGTTGGTT TTCAAGTG TAAATG MNLLIVTSSIKESDVP A vulgaris GCAGATTACTGCTGTCGATTTTGCTTTCTATTGAAAGCCTGTCTCTACGG GATGAAGC CCAAAA SSGKGGVAVNNITAG GTCCTGAAGCTTGAATTTTGGTAGCTATAGTTTTGTGGGAGGAAAGTGG TGGGAAG GTTGCTT ASGKDTCVIIHPGTD AATTTTGTACCATCTTTTGTCTCTCGTATCTACTATAGTAAATCCGGTCAT GTAATCTG GGGCTA GIWCCTECVEIHNSG GCAGCCTCTACGCGGCGCAACTAGAAACTTGGATCAGTGATCAAGGCTA TAGTTGGT AATGAT KDLKRHLAKRHPSVTI ATGCATGCCGGGTCTCCTCAGATTAGGAGTATAATACAAATCTGACTTC TGAGTTGG ACGTAC SGYKCNLCPFVSERQ ATCACTAAGAGGCTATGGGGCTAACGATCCTATAGTCTCGATGAACCTA TTGCAGAT GCTAGA LSVGTHLRYCRGVKE TTGATTGTTACTAGTAGCATAAAAGAAAGTGACGTACCCTCTAGTGGAA TACTGCTG AAAAGC VVKREFACASCSFSSD AGGGGGGTGTAGCAGTCAATAACATAACAGCAGGAGCTAGTGGAAAA TCGATTTT GACTTG TFSGLQVHMQRKHI GATACGTGCGTGATCATACACCCAGGTACCGATGGTATTTGGTGCTGTA GCTTTCTA CTGCAC AEWNDQLKEKTEFA CTGAGTGTGTAGAGATACATAACAGCGGTAAGGATCTGAAACGACATCT TTGAAAGC GGATGA WTDRELRELAEKELT TGCAAAACGTCACCCGAGTGTAACGATAAGCGGTTACAAATGCAATCTG CTGTCTCT CGGTTC TPSFRYNKIFYAALGT TGTCCATTTGTTAGTGAACGCCAACTAAGTGTGGGGACACATCTGAGGT ACGGGTCC ATCAGA SRTYDAVRKIRYNDR ACTGCAGAGGCGTAAAAGAAGTGGTTAAAAGAGAGTTTGCATGCGCGA TGAAGCTT GCCCGA YKSAIAEMRSQIADA GCTGCTCTTTTTCTTCGGATACGTTCTCAGGACTTCAGGTGCATATGCAA GAATTTTG TATGTG AAAAQERDVERGLV AGAAAGCATATAGCAGAATGGAACGACCAGCTGAAGGAGAAAACGGA GTAGCTAT CATGTC SAHSDRGKEMLPVV GTTTGCTTGGACAGACCGAGAATTGAGGGAGCTTGCTGAGAAGGAACT AGTTTTGT AAGGCG ETKSDIQVNNDIKKDI TACCACTCCTTCCTTCAGGTACAACAAAATTTTCTATGCTGCGCTAGGTA GGGAGGA GCAGGG ELTPNSRQKQTNLAL CCTCCCGGACCTACGACGCTGTGAGGAAAATTCGCTATAATGACAGATA AAGTGGA AGAATC ARPAVIEVEEDLGRQ CAAATCTGCCATTGCTGAAATGCGATCACAGATAGCAGATGCGGCTGCC ATTTTGTA ACTAGT DVKQYLASLRQDDYT GCTGCACAAGAGAGGGATGTAGAGCGGGGTTTAGTTTCAGCACACTCA CCATCTTTT GTAGCT SPAERSIFAYCREETN GACAGAGGAAAAGAAATGCTCCCTGTTGTTGAAACCAAAAGTGATATCC GTCTCTCG GTTCTTT WSATKRQVLKISRTT AAGTAAACAACGATATCAAAAAGGATATTGAATTAACACCGAATTCAAG TATCTACT CCATTAC RGLRQPKKVRPFEFP ACAGAAACAAACTAATCTAGCGCTGGCAAGGCCAGCTGTAATTGAGGT ATAGTAAA GACTTA EGFKPNRNMRKWR GGAGGAAGACTTGGGTAGGCAGGATGTGAAACAATATCTCGCATCCCT TCCGGTCA CGCGGT KYRFLQECYREKRAET GCGCCAAGACGACTACACAAGTCCGGCCGAGCGGTCAATCTTTGCATAC TGCAGCCT TAACGT VSKILDGTFIDEPEEEI TGCAGGGAGGAAACCAATTGGTCTGCGACAAAAAGACAGGTATTAAAG CTACGCGG GGCACG RPELEEVQRMYIDRL ATATCGAGAACTACCAGAGGTTTAAGACAACCTAAGAAGGTTCGTCCAT CGCAACTA ATAGAT EKRTQLDTTKIVQTD TTGAGTTTCCGGAAGGGTTCAAACCTAACAGAAATATGAGAAAGTGGA GAAACTTG TTACACC EVFCLQSYGRITIGEV GAAAGTATAGATTCCTTCAGGAATGCTATAGGGAAAAGAGAGCTGAGA GATCAGTG AGGAAA RDALGASKKDSASGP CTGTTAGCAAGATCCTGGACGGGACTTTTATCGATGAACCGGAGGAAG ATCAAGGC TAATAC DGLLLQDVRRLGPLLL AGATTAGACCAGAGTTAGAGGAAGTACAACGTATGTACATTGACCGGCT TAATGCAT GTGAAG CNIFNMWYLHGIPVE GGAGAAAAGAACTCAGCTGGATACCACGAAGATTGTGCAAACAGACGA GCCGGGTC GGTTCC ENRCRTILLYKSGDRH GGTGTTTTGTCTGCAAAGCTACGGTCGCATTACGATCGGGGAAGTAAGA TCCTCAGA ACCATAT LASNYRPVTIGNMLN GATGCACTCGGTGCAAGCAAGAAGGACTCGGCCTCGGGTCCTGACGGC TTAGGAGT ACTGGA RLYAKIWDKRIRKNV CTGCTTCTACAGGATGTGAGGAGGCTGGGACCACTATTATTGTGTAACA ATAATACA GTTTAG RLHVRQKAFIPVDGC TCTTTAACATGTGGTACTTACATGGGATCCCTGTGGAAGAAAACAGGTG AATCTGAC ATCTATG FENVKTIQCVLQSYR TCGAACAATACTCTTATACAAGAGTGGCGATAGACATCTGGCATCAAAC TTCATCAC AGGGAA KRKLEHNVVFIDLAK TATAGACCTGTGACAATCGGCAACATGCTGAACAGGCTTTACGCCAAAA TAAGAGGC ACATTTG AFDTVLHDSIRKALW TCTGGGACAAACGGATCCGGAAGAACGTGCGTCTTCATGTGAGGCAAA TATGGGGC TAATAA RKGVPSGVVKVVDSL AAGCATTTATCCCGGTGGATGGGTGCTTTGAGAACGTAAAAACGATCCA TAACGATC GTCAGT YAGAVTSISVGKTKTR ATGCGTTCTCCAGTCTTACAGAAAGCGTAAGTTGGAACACAACGTCGTA CTATAGTC CTGGTA SICINSGVKQGCPLSP TTTATTGATCTTGCCAAGGCCTTTGACACGGTCTTGCATGACTCGATAAG TCG(SEQ ACCTGG LLFNLILDELAERIEAT GAAAGCGTTGTGGCGGAAAGGTGTTCCGTCTGGGGTTGTTAAAGTGGT IDNO: CGCCGC GCGLDLDGHVLSSM AGACAGCTTATATGCGGGAGCTGTCACAAGCATAAGTGTTGGAAAAAC 1146) TGTTGA AFADDYVLLAKDSVE GAAAACTCGTTCTATATGTATAAACTCTGGAGTCAAGCAGGGTTGTCCT GTCAAA MNELIRVCSTFFKEK CTGTCACCTCTTCTATTCAACCTAATACTGGATGAACTAGCGGAGAGGAT TTAACTA GLSVNPGKCQSLRVL AGAGGCAACCGGCTGCGGGTTAGATCTTGATGGTCACGTTCTATCATCT TGTCAAT PVKEKKRSMKVLVRP ATGGCCTTTGCTGACGACTACGTGTTGCTAGCGAAGGACTCCGTGGAGA ACTCATT HRWWRIKDQDVDIP TGAACGAGTTGATAAGAGTGTGTAGTACATTCTTCAAAGAGAAAGGCTT AAGTTA SMTYDSLGKYLGVSI ATCTGTAAACCCAGGTAAATGTCAATCGCTAAGAGTTCTTCCCGTAAAG TCGACTT DPTGKIALPIEEWKN GAGAAGAAACGGTCAATGAAGGTCCTTGTTAGACCTCATAGATGGTGG TGATAT WMTKLKECKLKPEQ AGGATAAAAGACCAGGATGTTGACATCCCATCTATGACATATGACAGCT GGCATG KVKILKEVVCSRVNYV TAGGAAAATACCTTGGTGTTTCGATTGACCCAACTGGTAAGATAGCGCT GGGTGA LRMSECGISELRSWT TCCGATTGAGGAGTGGAAGAATTGGATGACCAAGCTAAAAGAGTGTAA TTCCGC RFVRNWAKNIIHLPT GCTCAAGCCCGAGCAGAAAGTTAAAATTCTGAAAGAAGTGGTTTGCTCT GTTATAT WCSSDWIHSIKGLGI CGGGTAAACTACGTTTTGCGGATGTCAGAGTGTGGCATCAGCGAACTTC CAAAGT PDVSKGIVIQRMRAS GGAGTTGGACACGATTTGTAAGGAATTGGGCGAAAAACATCATTCACTT CAAACA EKMSTSEDGIVRVVG ACCCACATGGTGCAGTAGTGACTGGATACACTCGATCAAAGGGTTAGG TGATGA ARLVQKNRVLWEKA CATTCCCGACGTTTCGAAGGGAATTGTCATACAACGTATGAGGGCTTCG TTGCAAT GFEGIELKAARRHCE GAGAAAATGTCTACGTCTGAAGACGGTATAGTCCGCGTGGTTGGTGCA GAGAAA VERLNNIGNITNGVA CGACTTGTTCAGAAGAACAGAGTCTTGTGGGAAAAGGCCGGTTTCGAA CTACCAC LKTIAAVSSVNRYW GGTATCGAACTGAAGGCAGCCAGGAGGCACTGCGAAGTGGAGAGACT GCTTGG MIEDNLKSGNKILVW CAACAACATTGGTAACATTACCAACGGCGTTGCACTCAAAACTATCGCA TCACGTT KAMAGAIPTKINLSR GCAGTCTCCTCGGTAAATCGGTACTGGATGATTGAAGACAACTTGAAAT TGTGAG GVADQTLKKCRRCGL CCGGGAACAAGATTCTCGTTTGGAAAGCAATGGCGGGTGCCATTCCAAC GAGAAC TAETDGHILAGCHTS AAAGATTAACCTTTCGCGGGGCGTAGCAGACCAGACCCTCAAAAAATGT ATCTCAT SDAYSKRHNMLCDKL CGTCGATGCGGTTTAACAGCGGAAACGGATGGACACATCTTGGCTGGA TCAAGC AKELKLNGGPNRRV TGCCATACTAGCAGCGACGCGTACTCAAAACGTCACAACATGCTCTGTG CTCCCG WRERTCFTSTGRRYR ATAAACTCGCCAAAGAGCTCAAACTCAATGGTGGACCAAACAGACGTGT GATGTC PDIIVKDDSKITVIDM GTGGCGCGAGAGGACGTGCTTCACTAGTACAGGCAGGCGATATAGACC GGCACC TCPYEKSEGHLIQCES TGACATTATCGTTAAAGATGACAGTAAAATCACAGTCATCGATATGACTT CGCTGA AKVTKYEPLKLDKYW GTCCGTATGAGAAATCAGAAGGACACCTGATCCAATGTGAAAGTGCGA CATCTTC TRELEGANGIVAEKV AAGTAACTAAATACGAGCCACTCAAGCTAGATAAGTATTGGACTCGAGA TGGCTT ELMGLAIGAIGTIMR ACTCGAGGGAGCAAATGGTATTGTTGCTGAAAAGGTAGAGCTGATGGG ATGAAA STLRKLCELKSGRIVR ATTGGCAATAGGGGCGATCGGCACAATCATGCGTAGTACCCTTCGGAA ATTTTCA RLQMIACNNSAQIIK ACTCTGTGAGTTAAAGTCGGGCAGGATCGTAAGACGTCTACAAATGATT TTAATTT GHLSRATRRNLR GCTTGTAATAATAGCGCCCAAATTATAAAGGGTCACCTGTCAAGGGCGA TTGTAA (SEQIDNO:1391) CTCGGAGGAATTTGCGGTGATAAATGCCAAAAGTTGCTTGGGCTAAATG GTCATG ATACGTACGCTAGAAAAAGCGACTTGCTGCACGGATGACGGTTCATCAG GGCGGC AGCCCGATATGTGCATGTCAAGGCGGCAGGGAGAATCACTAGTGTAGC TTGAAA TGTTCTTTCCATTACGACTTACGCGGTTAACGTGGCACGATAGATTTACA GC(SEQ CCAGGAAATAATACGTGAAGGGTTCCACCATATACTGGAGTTTAGATCT IDNO: ATGAGGGAAACATTTGTAATAAGTCAGTCTGGTAACCTGGCGCCGCTGT 1269) TGAGTCAAATTAACTATGTCAATACTCATTAAGTTATCGACTTTGATATG GCATGGGGTGATTCCGCGTTATATCAAAGTCAAACATGATGATTGCAAT GAGAAACTACCACGCTTGGTCACGTTTGTGAGGAGAACATCTCATTCAA GCCTCCCGGATGTCGGCACCCGCTGACATCTTCTGGCTTATGAAAATTTT CATTAATTTTTGTAAGTCATGGGCGGCTTGAAAGC(SEQIDNO: 1545) R2 R8Hm- Hydra CTTGGGGTCACTGACACATTTTTCGGTAGCCATAGTTTTTTGAGAGGAA CTTGGGGT ATGCCC MSNRITIGDVPSVGK B vulgaris GAGTGGAAGTTTTTCCATGAGTCGTCTCTCGTATAAACTGTGGTAAATCC CACTGACA GAGGTA GGLTVNKQTAGADG GGCCATCCAGCCTCTACGCGGCGCAACTAGAAACTTGGATCAGTGATCA CATTTTTC GTTGGG AEACVVIHPGAKGIW AGGCTAATGGATGACGGGACTCCATGGATAAGGAGATATAAAGATCTT GGTAGCCA ATAATG SSPACLRKFTIGKELR ATTTGAACGCATCTTAAGGGGTTATGGGGCTAACACCCCCTTAATTCTG TAGTTTTTT ATGCAC AHLAQIHKLAPSAVR GTGCACATTTATTGACCGTTATGAGCAATAGAATCACGATAGGTGATGT GAGAGGA AAGCTC YRCNKCPYEGDVQLS ACCCTCGGTAGGAAAGGGGGGTTTAACTGTCAATAAACAAACAGCAGG AGAGTGG GTAAGG VGTHLRYCKGIAGVV AGCTGATGGTGCTGAAGCGTGTGTAGTCATACACCCAGGTGCCAAGGG AAGTTTTT CGACTT EEKKQFACAICNFSSD TATTTGGTCCTCTCCTGCGTGTTTAAGAAAGTTTACGATCGGAAAAGAAC CCATGAGT GCTGCA TFSGLQVHKQRKHV TAAGGGCACATTTGGCTCAAATTCATAAACTTGCACCGAGTGCAGTTCG CGTCTCTC CGTATG VEWNEQLKEKTEFA GTACAGGTGTAATAAGTGTCCGTATGAGGGTGATGTCCAACTCAGTGTG GTATAAAC CCGCTA WTDRELRELAVKEVT GGAACACATCTGAGGTACTGTAAGGGTATTGCGGGAGTGGTGGAGGA TGTGGTAA AACGCT IPFSVVNTETFAVLDI GAAAAAGCAATTCGCTTGCGCGATTTGTAATTTCTCTTCGGATACCTTTT ATCCGGCC TAGCTC TTRTKDAVRKIRYTDR CAGGACTTCAGGTGCATAAGCAAAGAAAGCATGTAGTTGAATGGAACG ATCCAGCC GATGAG YKSILAEVRAQVNAV AGCAGCTGAAAGAGAAAACGGAGTTTGCTTGGACAGACAGGGAACTGC TCTACGCG TGCATG AEEAPQASDESQITLL GGGAGCTGGCGGTTAAGGAAGTAACGATTCCTTTCTCTGTGGTGAATAC GCGCAACT TCAAGA VNTGRGAELQPAVIN GGAGACCTTTGCTGTGCTAGATATTACGACGCGGACTAAGGATGCTGTG AGAAACTT CGGTCG ITDSIELVTDVNEVEM AGGAAAATTCGCTACACGGATAGATACAAATCTATCCTGGCTGAAGTAC GGATCAGT GGAGTA VTSNSTNEEQPINAP GCGCACAAGTTAACGCTGTGGCGGAGGAAGCGCCGCAAGCTAGTGATG GATCAAGG TGATCA VEPAVIEADLGRQDA AGAGTCAAATAACGCTCTTAGTTAACACAGGCAGGGGAGCAGAATTAC CTAATGGA GTGGAG KLYLASLRQSDCTNA AACCTGCTGTGATTAATATAACTGATTCAATTGAATTAGTTACTGATGTC TGACGGG CTGACTT SDRWTLAYCRGEVD AATGAGGTTGAAATGGTAACATCGAATTCAACCAATGAAGAACAGCCTA ACTCCATG TCCAGA WCKTKSRLFKVSRHA TCAACGCGCCGGTGGAACCGGCTGTAATTGAGGCGGACTTGGGAAGAC GATAAGG CAACTC RGLRQPQRVENWEF AGGATGCGAAACTATATCTCGCATCGCTGCGTCAAAGCGATTGCACAAA AGATATAA ACGCGG PEGFRPNRNLRKWR CGCATCTGATCGATGGACCCTTGCGTATTGCAGGGGAGAAGTTGATTGG AGATCTTA ATTCGC KYSFLQSCYRTKKKET TGTAAGACGAAAAGCAGGCTTTTCAAAGTATCAAGACATGCCCGGGGTT TTTGAACG GTGCGG VSKILDGTFKDTPEEEI TAAGACAACCTCAAAGGGTGGAGAATTGGGAGTTTCCAGAGGGATTCA CATCTTAA TGGATA RPELEEVQRVYVDRL GACCTAACAGGAACCTTCGTAAATGGAGGAAGTATTCATTCTTGCAAAG GGGGTTAT CAACAC EVRTQLDTTRTVHID TTGCTATAGAACGAAGAAGAAGGAAACTGTTAGTAAGATTCTTGATGGT GGGGCTA CTGGTA ERFDLVSYGRITIREV ACTTTCAAGGACACACCTGAGGAAGAGATTAGGCCAGAGTTGGAGGAA ACACCCCC TAACAT QDAISASKKDASGGP GTACAACGTGTGTACGTTGACCGGCTAGAGGTAAGAACTCAGCTGGAT TTAATTCT ATGAAG DGLLLQDVKKASPRQ ACCACTAGGACAGTGCATATAGACGAAAGATTCGATTTAGTAAGCTATG GGTGCACA GGTTCC LCIIFNMWYLHGIPV GTCGCATTACGATCAGGGAGGTACAAGACGCAATCAGCGCAAGCAAGA TTTATTGA ATCTAGT VENRCRTILLHKGGE AGGATGCCTCAGGGGGTCCCGACGGCTTGCTCCTACAGGACGTGAAAA CCGTT ACAGGG KHLTSNYRPVTIGNM AGGCGAGCCCACGCCAATTGTGTATCATCTTTAATATGTGGTACTTGCAT (SEQID ATAACG LNRVYAKIWDRRIRK GGAATCCCTGTAGTGGAAAATAGGTGCCGAACAATACTCTTGCATAAGG NO:1147) ATCCAT NLQLHVRQKAFVPLD GTGGCGAGAAGCATCTAACGTCGAACTACCGACCTGTGACGATCGGCA GGGAGC GCFENVKTIQCILQSY ATATGCTGAATAGGGTATACGCTAAGATCTGGGACAGACGGATCAGAA AAACTA RRSRREHNVVFVDLA AAAACCTGCAACTTCATGTGAGACAGAAAGCATTCGTCCCGCTGGATGG ATTAGTT KAFDTILHDSIEKALLR GTGCTTTGAGAATGTAAAAACCATCCAATGCATTCTCCAGTCTTACAGAA GGAGGT KGIPRSVIKVVDSLYA GGAGCAGGCGGGAACACAATGTCGTATTTGTCGATCTTGCAAAAGCGTT AATCCA GAVTSITVGKTKTRPI TGATACGATTTTGCATGATTCGATAGAGAAAGCATTGCTGAGGAAAGGC ACGCCG CINSGVKQGCPLSPLL ATACCGCGAAGTGTGATAAAAGTGGTAGACAGCTTATATGCGGGAGCT CTGTTG FNLVIDELAERLEATG GTCACGAGCATTACGGTTGGGAAAACAAAGACTCGACCTATATGTATAA AGTCAG CGLDLEGHVISSMAF ATTCAGGGGTGAAGCAGGGTTGTCCTCTATCTCCTTTGCTGTTCAATCTA TTTTTAA ADDYVLLAKDSVEM GTAATAGATGAACTAGCGGAGAGGCTGGAGGCAACTGGCTGCGGTCTT CCGCCA NVLMNVCNTFFEEK GATCTGGAAGGTCACGTCATTTCTTCCATGGCTTTTGCTGATGACTACGT GTCAAC GLAVNPAKCQSLRVL GTTGTTGGCGAAAGACTCGGTTGAAATGAACGTGCTAATGAACGTGTG TCTTGTA PVKGKRSMKVLTRTH CAATACGTTCTTTGAGGAGAAGGGTTTAGCTGTAAATCCAGCAAAATGT GGTTAT RWWKINNQDVEIPS CAGTCGTTACGCGTTTTGCCTGTAAAAGGCAAACGGTCCATGAAAGTCC CGGTCT MTYESVGKYLGVMI TTACGAGGACGCATAGATGGTGGAAAATTAATAACCAGGATGTTGAAA TCGGCA DPAGKIALPIEEWKL TCCCATCTATGACATACGAAAGTGTTGGAAAATATCTTGGGGTAATGAT GACCTT WLTRLRECKLKPDQK TGACCCAGCTGGTAAGATTGCTCTTCCGATTGAGGAATGGAAGCTTTGG GGACCG VKVLKEVVCARANYV CTAACTAGGTTAAGGGAGTGTAAGCTCAAACCTGATCAAAAAGTGAAG CCTAGC LRMSGCGICELRKWS GTGCTGAAAGAGGTAGTTTGTGCCCGAGCAAACTATGTTCTCCGGATGT GCCGGC RFVRGWVKSIIHFPA CCGGGTGCGGAATCTGTGAGCTCCGTAAGTGGTCACGATTTGTGAGGG CAACAG WCNSEWMHSSKGL GATGGGTGAAATCCATCATTCACTTCCCCGCATGGTGCAATAGCGAATG TTTGTCG GIPDVVSGIVIQRMR GATGCATTCGAGCAAAGGCTTAGGCATTCCTGATGTAGTGTCAGGAATT TCGACT AAEKMAKSTDGVVR GTCATCCAACGAATGAGAGCTGCGGAAAAAATGGCTAAGTCAACAGAC AACATG VVGARIVQTNRVLW GGAGTAGTCCGAGTTGTCGGGGCCCGCATTGTGCAGACAAATAGAGTT ATGATTT KRAGLAGIELDAARK TTGTGGAAAAGGGCCGGATTAGCAGGCATAGAACTGGATGCCGCCAGG GCGAGA FCEVKRVNKIGNQTN AAGTTCTGTGAGGTTAAGAGGGTGAACAAAATTGGCAATCAAACCAAT GAAACC GGALKTIAESSVSRH GGAGGCGCCCTCAAGACTATAGCAGAGTCCTCGGTGAGCCGGCACTGG CACGCTT WLLEKNIRPGNKILV TTATTGGAAAAGAATATAAGACCTGGAAACAAAATTCTAGTTTGGAAGG TGTCACT WKAMAGVIPTKINLS CAATGGCAGGAGTGATTCCAACAAAGATCAATCTGTCTAGAGGCGTAGC TATGTG RGVADQTLKKCRCC CGACCAGACTCTCAAAAAATGTCGGTGTTGTGGTTTAACAGCAGAAACT AGGATA GLTAETDCHILAGCPT GATTGTCACATCTTGGCCGGATGTCCTACCAGTCGGGATGCGTACTCGA AAATCTC SRDAYSKRHNLLCDK AACGTCATAACTTGCTTTGTGATAAACTCGCCAAAGAGCTAAGACTCAAT TTGTCCA LAKELRLNGGPSRRV GGTGGGCCAAGCAGACGGGTGTGGCGCGAGAGGATGTGTCTCTCTGG TATGATC WRERMCLSGNGRRY GAATGGCAGGCGTTATAAGCCCGATATTGTTGTGAAAGATGATGGTGT CTTTGAA KPDIVVKDDGVITVID AATTACTGTCATCGATATGGCATGTCCGTACGAGAAATCGGAAAGACAC GGGAAC MACPYEKSERHLSQC CTAAGTCAATGCGAAGATGCAAAAGTTGCTAAGTACGAGCCACTAAGG AGCGCT EDAKVAKYEPLRLDR CTTGATAGGAGTTGGACTCAAGAACTTGAGGGGAATAACGGCAGAAGT TTGAGC SWTQELEGNNGRSA GCTAATGAAATATCAGTTGTAGGGATTGCAGTAGGGGCGATTGGAACA TTGCTC NEISVVGIAVGAIGTI ATTACGCGTAAAACCCAGCGGATACTTAGCAAGTTGAAACTGGCCAAGG GGCGTT TRKTQRILSKLKLAKV TCGGAAGACCGTTACAAATAATTGCATGTAATGAAAGCGCCCAAATTAT GGCACC GRPLQIIACNESAQIIR AAGACGACATCTTTCGGGATCGAGACTTAGAAATTTGCGGTGAATGCCC TTTAGTC RHLSGSRLRNLR GAGGTAGTTGGGATAATGATGCACAAGCTCGTAAGGCGACTTGCTGCA TGTAAT (SEQIDNO:1392) CGTATGCCGCTAAACGCTTAGCTCGATGAGTGCATGTCAAGACGGTCGG ATTTTCT GAGTATGATCAGTGGAGCTGACTTTCCAGACAACTCACGCGGATTCGCG TGATATT TGCGGTGGATACAACACCTGGTATAACATATGAAGGGTTCCATCTAGTA ATGGAC CAGGGATAACGATCCATGGGAGCAAACTAATTAGTTGGAGGTAATCCA GAAAAA ACGCCGCTGTTGAGTCAGTTTTTAACCGCCAGTCAACTCTTGTAGGTTAT GGTAGT CGGTCTTCGGCAGACCTTGGACCGCCTAGCGCCGGCCAACAGTTTGTCG ATGGTT TCGACTAACATGATGATTTGCGAGAGAAACCCACGCTTTGTCACTTATGT GCA GAGGATAAAATCTCTTGTCCATATGATCCTTTGAAGGGAACAGCGCTTT (SEQID GAGCTTGCTCGGCGTTGGCACCTTTAGTCTGTAATATTTTCTTGATATTA NO: TGGACGAAAAAGGTAGTATGGTTGCA(SEQIDNO:1546) 1270) R2 R9Av GQ398057 Adineta GAAATAGTTTGCAATGGTAGGTGTATGGCGCCTCTGTGTCTCTCTTTCGC GAAATAGT ACTAGT MNLPIREHAVSVHNI vaga TGGATATAGTTTGACGATTTTGTACCAGGTATCTGTTTCTTGTGAGTTCA TTGCAATG CTCCTTC NKFNYLCQLCSKSYD GCACCAGTTTGAACAGGCTTAGCGATAGACCTTCGAACTTGAAACACTG GTAGGTGT TTCTATT TINSVKAHYVACRRQ TTGTGAAGCTGGCTGGGCCCCTGCAGATTTTCTCGATTAGAACGTGAGT ATGGCGCC AGTCAG KNASSTTAVPTNVIN GTTACGTCCAGAATGACCCACCAGTGGTTAGTTCTACGTTGCCCTGGAA TCTGTGTC TCTAATT NNQLAINTNQVISRN AGGAGAAAAGTTGAGCTAAAATCGCACGGCCTAGTTGTTTATCAAATAG TCTCTTTC AATTTTT PLQCVECLMKQVDF GCACGGTGAGGAACTCTTCTATGTACCCTGACTAAAGTACTCACTTGTGC GCTGGATA CTTACAT YAKDTKALVTHMRTK GCTGGGTTTGCTCCCCCTCGCATTGACTTATCTGATCGCACTACCCACCA TAGTTTGA TCTACAT HAAAYEESKKVATRR AACGAAACATAAACTTAGCTCGTGGTATCAGTCCACAGCGTGTGCAGTC CGATTTTG CTAGTTC VAWSPDEDQILAELE GGATTCAGGGGAGCGTGTTAGTGACAAGCAGGATAATATTAACATAGT TACCAGGT CATTATT VKLKKIQKGQLLSRLV TAATGTTAAGGCGTTCAACATTCCTTATCCAATTGGAAGAGTTGACTGTG ATCTGTTT AAATTG VEYNKCADKSKAPSR AAGTTTGTCATGAAGACATTGGACAAATGAATTTGCCGATTCGAGAGCA CTTGTGAG GTATGA SKDAIRTRRQQHDYK TGCCGTATCTGTACACAATATAAACAAATTTAATTATTTATGCCAGCTAT TTCAGCAC TCAGTG LLLRSLQSQQPPVGS GTTCTAAGTCTTATGATACTATTAATAGTGTTAAAGCTCACTATGTTGCA CAGTTTGA CTATCTC EDSDSDISSSNNNPLT TGCAGAAGACAGAAGAATGCCTCATCCACAACAGCTGTTCCAACCAATG ACAGGCTT TGCTAC TTHNVTPTPDSSNVV TCATCAACAACAACCAACTTGCTATAAATACTAATCAAGTAATATCAAGA AGCGATAG ACTCAAT LLIQKIRESVDSIVKIT AATCCACTTCAGTGCGTTGAGTGTCTAATGAAACAAGTTGATTTCTATGC ACCTTCGA GCTTAAT NLKLNTNMLNAASA TAAAGATACAAAGGCACTAGTCACGCACATGCGTACTAAACATGCTGCT ACTTGAAA CGTATG FINQNNNMDPLELS GCCTACGAGGAATCAAAGAAAGTCGCAACAAGAAGAGTTGCCTGGAGC CACTGTTG TTATTGA MRGIEEDVKAIRDKE CCTGATGAGGATCAAATTCTTGCTGAACTAGAAGTCAAATTGAAAAAGA TGAAGCTG CAGTCT LQKPTRNVPSSTTSR TACAAAAAGGTCAATTACTTAGTCGTCTTGTCGTTGAATATAATAAATGT GCTGGGCC GACACT KPTRNAKRLEKSKKY GCTGATAAATCGAAAGCTCCTTCCAGGTCCAAGGATGCTATTCGTACAA CCTGCAGA TGATTAC GYYQHLYYNNKKKLV GACGCCAACAACATGATTACAAACTATTGCTTCGCTCACTCCAATCTCAA TTTTCTCG TCTTACG AEILDGETSGAKPPP CAACCGCCAGTTGGTAGCGAAGACAGTGACAGTGACATATCTTCTAGTA ATTAGAAC ACATAT MNLVEDYYRNIWSR ATAACAATCCTTTAACAACAACACATAATGTCACTCCAACGCCAGATTCA GTGAGTGT GCACTG STIDDSPVNNIKTVNS TCCAACGTTGTACTACTAATACAAAAGATCCGTGAATCTGTAGATTCCAT TACGTCCA TTTGCTT DSIFAPISRDEIKLALS TGTAAAAATAACGAACCTCAAATTGAACACGAATATGCTGAACGCAGCA GAATGACC CAGAGA NTKKDSAAGPDAVTI AGTGCGTTCATTAATCAAAATAACAACATGGATCCACTTGAACTATCTAT CACCAGTG AACCAC KEAKAIIDNLYVAYNI GCGTGGTATCGAAGAGGATGTGAAGGCAATTCGAGACAAAGAACTTCA GTTAGTTC TGTTCAT WLGVQGIPEQLKLNK GAAACCAACCAGGAACGTTCCTTCTTCAACAACTTCGAGAAAGCCAACT TACGTTGC ATAGTG TILIPKGNSDLSLLKN CGAAATGCCAAAAGGCTTGAGAAATCAAAAAAATATGGCTATTATCAAC CCTGGAAA AAGTTC WRPITISSIILRVYNRL ATCTGTACTATAATAACAAGAAAAAATTAGTAGCGGAAATCCTCGATGG GGAGAAA CTCAGTT LAYRMNKIFKTNDKQ CGAAACAAGTGGTGCTAAGCCACCTCCAATGAACCTGGTTGAAGATTAT AGTTGAGC TTCTGTT VGFKPVNGCGINISW TATAGAAATATTTGGTCACGTTCTACTATTGATGATTCGCCTGTTAACAA TAAAATCG GATATA LHSLLKHARLNKNSIY TATTAAAACCGTTAATAGTGACTCTATATTTGCTCCAATTTCGCGTGATG CACGGCCT TTCTTCT ACLVDVSKAFDSVSH AAATCAAATTAGCATTATCAAATACGAAAAAGGATTCAGCAGCTGGACC AGTTGTTT TTCATTC QSIVRALTMNGAPSL TGACGCTGTAACAATAAAAGAAGCAAAAGCTATTATTGACAATCTTTAT ATCAAATA TCGCTTC LVKLIMDQYTNVNTV GTTGCATATAATATATGGCTAGGTGTTCAAGGAATTCCTGAACAACTGA GGCACGGT TCCTTTT ITCSGSISNKINISSGV AATTGAATAAAACTATCTTAATTCCAAAAGGAAATTCCGATCTTAGTCTA GAGGAACT CTACTGT KQGDPLSSLLFNLVID CTGAAAAACTGGCGACCTATTACAATCTCGTCTATTATCCTAAGAGTATA CTTCTATG GTTCTTT ELFDVIKDQYGYTIDN CAACAGATTATTAGCATACAGAATGAACAAGATCTTTAAAACTAATGAT TACCCTGA TTATCAG IGTTNARCFADDLTLI AAACAAGTTGGATTCAAACCTGTTAATGGTTGTGGTATTAATATATCTTG CTAAAGTA TTTTTTG SSSRMGMNKLLELTT GCTTCACTCTCTCTTGAAGCATGCACGCTTAAACAAAAATTCAATATATG CTCACTTG TGGAAA KFFKERGLNVNPSKC CTTGTCTTGTCGATGTGTCTAAAGCCTTTGATTCTGTGTCACATCAATCAA TGCGCTGG AATTGA MSIGMSKGYKGKKS TAGTAAGAGCTCTCACAATGAATGGTGCACCATCCTTGCTAGTGAAATT GTTTGCTC GAATAA KIESEPLFSITDAQIPM AATAATGGATCAATATACGAATGTAAATACTGTCATCACATGTTCTGGTT CCCCTCGC ATAAAG LGYIDKTTRYLGVNFT CTATATCAAACAAGATAAATATCTCCAGTGGTGTCAAGCAAGGTGACCC ATTGACTT T(SEQID SIGAIDAKRIKKDLQD ACTATCTAGCTTGTTGTTCAATCTGGTTATAGATGAACTGTTCGATGTAA ATCTGATC NO: TLDKLEHLKLKAQCK TAAAGGACCAATATGGTTATACAATTGATAACATTGGCACCACCAATGC GCACTACC 1271) MDLLRTYMIPRFMF ACGATGCTTCGCCGATGATTTAACACTAATATCATCATCTAGAATGGGTA CACCAAAC QLIHTELYPKLLIKMDI TGAATAAATTGCTTGAGCTCACCACGAAATTCTTCAAAGAACGTGGACT GAAACATA LIRKLAKRILHLPISTSS AAATGTAAACCCATCAAAGTGCATGTCTATTGGCATGTCCAAAGGTTAT AACTTAGC EFFYLPFKEGGLQLTS AAAGGAAAGAAGAGTAAAATCGAATCTGAACCACTCTTCTCTATCACCG TCGTGGTA LKEAVGLAKIKLHKKI ATGCTCAGATACCGATGTTGGGCTATATTGATAAGACAACTCGATATCTC TCAGTCCA MSSNDPMLCYLIESQ GGTGTAAATTTCACATCTATTGGTGCCATTGATGCAAAAAGAATCAAAA CAGCGTGT RSRIVEHFMKDLKLG AAGACCTTCAGGACACACTCGATAAGCTTGAACATCTTAAACTCAAAGC GCAGTCGG DSLTLNEMNNIKECF TCAGTGCAAAATGGATCTCTTACGAACTTATATGATACCAAGATTCATGT ATTCAGGG MKEKRISFAQKIHGV TTCAATTAATTCATACTGAGTTATATCCGAAATTGCTTATTAAAATGGAC GAGCGTGT GFEVFSSSPLTNQWI ATCTTAATTAGGAAATTAGCTAAACGAATCCTACATCTGCCCATATCAAC TAGTGACA NGEIKTMTTKTYINSI GAGTAGTGAATTCTTTTACTTACCCTTCAAAGAAGGAGGTCTTCAACTAA AGCAGGAT KLRTNTLETRVTTSRG CCTCACTTAAAGAAGCAGTTGGTTTAGCCAAAATAAAATTACACAAGAA AATATTAA LNIIKTCRRCHVADES GATAATGTCCAGTAATGATCCAATGTTATGCTACTTGATTGAGAGCCAG CATAGTTA LMHVLQCCSSTKGLR AGGAGCCGTATTGTCGAACATTTTATGAAAGACCTTAAACTTGGAGATT ATGTTAAG YSRHHKICAKVANKL CTTTAACATTAAACGAAATGAATAACATCAAAGAGTGCTTCATGAAAGA GCGTTCAA VMNGYGVFREKSYP AAAAAGAATCTCATTTGCTCAAAAAATTCACGGTGTCGGCTTCGAAGTA CATTCCTT DPNNSGSYLRPDIIAV TTCTCATCAAGTCCTTTGACGAACCAATGGATTAATGGCGAAATTAAGA ATCCAATT KNGHVIVLDVTVVYE CAATGACAACTAAAACATACATTAACTCAATTAAACTTAGAACAAATACT GGAAGAG VTGATFINAYQTKINK CTAGAAACTCGGGTAACAACATCTCGGGGACTGAACATCATAAAAACAT TTGACTGT YNAIMVQIEQMFNC GTAGAAGATGCCACGTAGCTGACGAAAGTCTCATGCATGTGCTCCAATG GAAGTTTG VNGELHGLVIGSRGSI TTGCTCTTCTACCAAAGGTTTACGATACTCTCGTCATCACAAAATATGTG TCATGAAG HHSQLHIWHQMGFS CCAAAGTAGCAAATAAATTGGTAATGAATGGTTATGGTGTATTTCGTGA ACATTGGA SIELKYVAIGCMEDSL GAAGAGTTATCCAGATCCAAACAACTCAGGTTCATACCTTCGACCGGAT CAA(SEQ RIMSTFSKAIT(SEQ ATAATTGCAGTAAAAAATGGTCATGTTATTGTTCTTGATGTAACGGTTGT IDNO: IDNO:1393) GTACGAAGTAACTGGTGCTACGTTTATTAATGCCTACCAAACAAAAATA 1148) AATAAATATAATGCGATTATGGTACAAATCGAGCAAATGTTCAATTGTG TTAATGGTGAATTGCATGGTCTAGTAATTGGATCACGTGGTTCAATTCAT CACAGTCAACTCCACATCTGGCATCAAATGGGATTCTCTTCCATAGAACT TAAATATGTGGCTATAGGATGCATGGAGGATTCGCTCAGAATCATGTCC ACATTCTCAAAAGCTATCACATGAACTAGTCTCCTTCTTCTATTAGTCAGT CTAATTAATTTTTCTTACATTCTACATCTAGTTCCATTATTAAATTGGTATG ATCAGTGCTATCTCTGCTACACTCAATGCTTAATCGTATGTTATTGACAG TCTGACACTTGATTACTCTTACGACATATGCACTGTTTGCTTCAGAGAAA CCACTGTTCATATAGTGAAGTTCCTCAGTTTTCTGTTGATATATTCTTCTT TCATTCTCGCTTCTCCTTTTCTACTGTGTTCTTITTATCAGTTTTTTGTGGA AAAATTGAGAATAAATAAAGT(SEQIDNO:1025) R2 R2O1 LC349444 Oryzias CGCACAGGGGACACAGAGCCTGCCCAAGTACCGCTCCCGAGGGAGCGG CGCACAGG GGGGGA MGTDTVYVGQDYPS latipes GAAACGGGGGGGTGACTATCCCCTGGGGTCCGGCGAGAGCGCTGGTCT GGACACA CAGCTG GLSKRVPARLVAGP ACGGACCAGGGGTGGCTGTGGGCAGGCTGCTCCTCAGGCCAGTTGATT GAGCCTGC GGAGTC MLRERSCHAHVFRA AGTTACGCATGGGCTGTACCTCCACGTGGTCCCGCTGGTAACGACTTGT CCAAGTAC TCGGCA GHMWNWRTSLPSG CGGCTAAATCAGCCCGCCCACCATCTGGGATATGGTTGACCGTCTAACC CGCTCCCG TGATTAC RWDQPALEKSRVLTR CCAGTACTCAGGTCACAAACAAAATGGGAACAGATACAGTGTATGTCG AGGGAGC AAATCTT SVATATDPEITSYPGK GCCAGGACTACCCTTCTGGCTTATCAAAACGGGTACCAGCACGGTTAGT GGGAAAC GCGCTG SVSTSTQVQEEDWC GGCGGGACCGATGCTGCGAGAGCGAAGCTGTCACGCCCATGTGTTTAG GGGGGGG CACTCG SRESGWISPGLAPEE GGCTGGACACATGTGGAACTGGCGAACCAGCCTTCCGAGCGGGCGCTG TGACTATC GATGTC PSVVSEITASMVATM GGACCAGCCCGCTTTGGAGAAGTCTCGGGTCCTAACCCGGTCGGTGGC CCCTGGGG GTCCCC RVATEEVVLEPQPEQ GACGGCCACCGACCCCGAAATTACCTCTTACCCAGGAAAGTCCGTATCG TCCGGCGA GTGACG VVTILPEHGRNVPPG ACAAGTACGCAGGTTCAGGAGGAGGACTGGTGTAGCCGGGAGAGCGG GAGCGCTG GACACA LAEQDTASPIEVSVLL GTGGATCTCGCCAGGACTTGCTCCTGAAGAACCCTCGGTGGTGTCCGAA GTCTACGG TTAATCC PDLAENCPLCGVPSG ATTACAGCCTCCATGGTAGCGACAATGAGGGTAGCAACCGAGGAGGTC ACCAGGG GGAAAG GLRLLGKHFAVRHAG GTGCTGGAACCACAGCCTGAACAGGTCGTCACAATACTGCCGGAGCAT GTGGCTGT CGAGTG VPVTYECRKCAWRSP GGTCGAAACGTTCCTCCGGGGCTGGCAGAACAGGACACCGCCAGCCCC GGGCAGG GTGACT NSHSISCHVPKCRGR ATAGAAGTCTCGGTGCTCCTCCCAGACCTCGCTGAGAACTGCCCATTGT CTGCTCCT CGCCTC ARMPSGDPGIACDLC GTGGCGTGCCGAGCGGGGGCCTACGCTTGCTCGGGAAGCATTTTGCTG CAGGCCAG AAG EARFATEVGVAQHK TCCGACATGCGGGGGTGCCTGTAACGTATGAGTGCCGTAAGTGTGCGT TTGATTAG (SEQID RHVHPVEWNKVRLE GGCGGAGCCCCAACAGCCACTCAATCTCGTGTCACGTCCCCAAATGCCG TTACGCAT NO: RRGARGGGIKATKL GGGGCGTGCGCGGATGCCCAGTGGCGATCCAGGGATCGCCTGCGATCT GGGCTGTA 1272) WSVAEVETLIRLIREH CTGTGAAGCCCGGTTTGCCACGGAGGTTGGGGTCGCCCAACACAAGCG CCTCCACG GDSGATYQLIADELG GCACGTTCATCCGGTGGAGTGGAACAAGGTGAGGCTGGAAAGGAGAG TGGTCCCG RGKTAEQVRSKKRLL GTGCGCGCGGAGGGGGAATTAAGGCGACGAAGCTCTGGAGTGTAGCG CTGGTAAC RIDTASNSPDDAEVE GAGGTAGAGACGCTAATCCGGCTCATCCGTGAGCACGGAGATTCAGGT GACTTGTC EERLESLAVRSSSRSP GCCACTTACCAGCTCATTGCCGATGAGCTGGGAAGGGGCAAGACGGCC GGCTAAAT PSLVATRVREAVARG GAACAGGTGAGGAGTAAAAAGAGGCTCCTGCGCATAGATACGGCAAGC CAGCCCGC ESEGGEEIRAIAALIRD AATAGCCCAGATGATGCAGAGGTTGAGGAGGAGAGGTTGGAATCTCTG CCACCATC VDQNPCLIETSASDIIS GCGGTTCGGTCCTCGTCACGGTCACCCCCGAGCCTGGTGGCGACCAGG TGGGATAT KLGRRVDGPKRPRPV GTCAGGGAGGCAGTTGCCAGGGGTGAATCAGAAGGTGGCGAGGAGAT GGTTGACC VREQTQEKGWVRRL CAGGGCTATTGCTGCTCTCATTAGGGACGTAGATCAGAATCCTTGTCTG GTCTAACC ARRKREYREAQYLYS ATTGAAACCTCGGCGTCGGACATCATCTCGAAGCTGGGAAGGAGGGTG CCAGTACT RDQARLAAQILDGAA GATGGGCCCAAGAGACCCAGGCCCGTTGTCAGAGAACAGACCCAAGAG CAGGTCAC SQECALPVDQVYGAF AAGGGATGGGTAAGGCGGCTTGCCCGGCGGAAAAGGGAGTACAGAGA AAACAAA REKWETVGQFHGLG AGCGCAGTACCTGTACTCAAGGGATCAAGCAAGGCTGGCGGCCCAGAT (SEQID EFRTGARADNWEFY CCTCGATGGTGCCGCCAGCCAGGAATGCGCCCTCCCGGTGGACCAGGT NO:1149) SPILAAEVKENLMRM CTACGGAGCGTTCCGTGAGAAATGGGAAACCGTAGGGCAGTTCCACGG ANGTAPGPDRISKKA ACTTGGTGAGTTCCGGACGGGTGCACGCGCAGACAACTGGGAGTTCTA LLDWDPRGEQLARLY CTCTCCAATTCTGGCGGCTGAGGTGAAAGAAAACCTAATGAGAATGGCT TTWLIGGVIPRVFKEC AACGGCACGGCCCCGGGACCAGACAGGATAAGCAAAAAGGCTCTGCTT RTKLLPKSSDPVELQD GACTGGGACCCCCGGGGTGAGCAACTGGCACGGCTGTACACGACGTGG IGGWRPVTIGSMVT CTGATCGGTGGGGTCATACCAAGGGTCTTCAAGGAGTGCAGGACTAAG RLFSRILTMRLTRACP CTGCTACCGAAATCCAGCGACCCGGTCGAGTTGCAGGACATCGGTGGA INPRQRGFLASSSGC TGGAGGCCGGTGACGATTGGGTCGATGGTGACTAGGCTGTTCAGTCGG AENLLIFDEIVRRSRR ATTCTAACGATGAGGCTAACCCGAGCCTGTCCGATCAATCCGAGGCAGC DGGPLAVVFVDFAR GCGGTTTCTTGGCCTCCTCGAGTGGATGCGCGGAAAACCTGTTGATCTT AFDSISHEHILCVLEE TGACGAGATCGTCAGGCGCTCGAGGCGGGACGGGGGGCCGCTGGCAG GGLDRHVIGLIRNSYV TGGTGTTTGTGGACTTTGCGAGGGCCTTTGACTCCATCTCACATGAACAT DCVTRVGCVEGMTP ATCCTGTGTGTTCTCGAAGAAGGCGGGCTTGACAGGCACGTTATCGGGT PIQMKVGVKQGDP TGATCCGAAACTCGTACGTGGATTGCGTGACCAGGGTGGGTTGTGTCG MSPLLFNLAMDPLIH AGGGCATGACACCACCAATACAAATGAAGGTTGGAGTGAAGCAGGGA KLETAGTGLKWGDLS GACCCCATGTCCCCCTTGCTCTTCAACCTGGCTATGGATCCCCTCATCCAT IATLAFADDLVLVSDS AAACTCGAGACGGCCGGAACTGGACTGAAATGGGGCGATCTTTCAATC EEGMGRSLGILEKFC GCCACGCTGGCCTTTGCCGACGATCTGGTGCTGGTGAGTGACTCTGAGG QLTGLRVQPRKCHGF AAGGCATGGGGAGGAGTCTCGGGATTTTGGAGAAGTTTTGCCAACTGA FMDKGVVNGCGTW CTGGGCTGAGGGTTCAGCCCAGGAAGTGTCACGGTTTCTTTATGGACAA EICGSPIHMIPPGESV GGGCGTGGTGAACGGCTGTGGAACCTGGGAAATCTGTGGGTCACCGAT RYLGVQVGPGRGVM CCACATGATTCCCCCGGGGGAATCAGTTCGTTATTTGGGAGTCCAGGTA EPDLIPTVHTWIERIS GGCCCGGGGCGCGGCGTGATGGAACCGGATCTTATCCCTACGGTCCAC EAPLKPSQRMRVLNS ACGTGGATCGAAAGGATCTCGGAGGCTCCTCTAAAGCCCTCACAACGCA FALPRIIYQADLGKVT TGAGGGTTTTGAACTCATTCGCTCTCCCCCGGATAATTTACCAGGCCGAT VTKLAQIDGIVRKAVK CTAGGGAAGGTTACGGTAACCAAATTGGCCCAGATAGATGGGATTGTC KWLHLSPSTCNGLLY CGGAAGGCTGTGAAGAAGTGGCTCCATTTGTCACCATCCACGTGCAATG SRNRDGGLGLLKLER GACTGCTGTATTCACGGAACCGCGACGGTGGTTTGGGCCTCCTAAAGCT LIPSVRTKRIYRMSRS GGAAAGACTAATCCCATCCGTGCGCACGAAGCGTATCTATCGGATGTCC PDIWTRRMTSHSVSK AGGTCTCCGGATATCTGGACACGGCGAATGACCAGCCATTCTGTGTCAA SDWEMLWVQAGGE AATCTGACTGGGAGATGTTGTGGGTCCAAGCGGGAGGTGAGAGGGGC RGSAPVMGAVEAAP AGTGCACCTGTAATGGGTGCCGTGGAGGCTGCCCCGACCGATGTGGAG TDVERSPDYPDWRR AGATCGCCAGACTACCCAGACTGGCGGCGTGAGGAAAACCTGGCATGG EENLAWSALRVQGV TCGGCCCTGCGGGTGCAGGGTGTGGGTGCAGACCAGTTTCGAGGCGAC GADQFRGDRTSSSW AGGACCAGCAGCTCTTGGATCGCCGAGCCCGCTTCGGTTGGGTTCGCGC IAEPASVGFAQRHWL AGCGCCACTGGTTGGCTGCCCTGGCGCTGAGGGCTGGGGTGTATCCGA AALALRAGVYPTREF CTCGGGAGTTTCTGGCTCGGGGTAAGGAAAAGTCAGGAGCAGCTTGCA LARGKEKSGAACRRC GACGCTGCCCGGCCAGGTTGGAATCATGTTCACACATACTTGGGCAATG PARLESCSHILGQCPF TCCGTTCGTTCAGGCGAACAGAATTGCGAGGCACAACAAGGTGTGTGT VQANRIARHNKVCVL GCTCTTGGCCACGGAGGCGGAGAGGTTCGGCTGGACGGTAATAAGGG LATEAERFGWTVIRE AGTTCCGTCTTGAGGACGCCGCTGGCGGTCTCAAGATACCCGACCTGGT FRLEDAAGGLKIPDLV TTGCAAGAAGGCCGACACAGTTCTCATTGTCGACGTGACCGTCCGGTAC CKKADTVLIVDVTVR GAGATGGATGGAGAGACGCTAAAAAGGGCCGCATCGGAGAAGGTGAA YEMDGETLKRAASEK ACACTATCTCCCAGTAGGGCAACAGATAACGGACAAGGTCGGAGGGCG VKHYLPVGQQITDKV TTGCTTTAAAGTCATGGGGTTCCCTGTAGGTGCTAGGGGAAAGTGGCCG GGRCFKVMGFPVGA GCGAGCAACAACACAGTTTTGGCTGAGTTAGGCGTCCCTGCAGGTCGG RGKWPASNNTVLAE ATGAGGACCTTTGCCAGGCTGGTGAGCCGGAGGACTCTTCTTTATTCTTT LGVPAGRMRTFARL GGATATATTGAGGGACTTCATGCGTGAGCCGGCCGGCAGGGGAACTCG VSRRTLLYSLDILRDF GGTTGCTCTCATCCCTGCGGCAACGGGTGCCGCGAATTGAGGGGGACA MREPAGRGTRVALIP GCTGGGAGTCTCGGCATGATTACAAATCTTGCGCTGCACTCGGATGTCG AATGAAN(SEQID TCCCCGTGACGGACACATTAATCCGGAAAGCGAGTGGTGACTCGCCTCA NO:1394) AG(SEQIDNO:1026) R2 R2_LP AF015814 Limulus TGGGAGGAGACCCAAACTATCCTAGGATGGGGCGGAACCGACCATATG TGGGAGG ATTTTGT GIDGYMFGYARASG polyphemus AGCCATATTAACATTGCCCACACTATCCTCTGGAGGTACCTCCTCGTGGT AGACCCAA CTCTTTC STSVSIQSSSMTEGET ACGGCTGGATATAGGTAAATCCTGTAACCAAATCCTCCAACCCGTGAAG ACTATCCT CCCAAT NERATPRASDSSSVSI GAGAACACTAAAACCCATATAGTGGCCTCGCCAACCACTATATGTCCAA AGGATGG GATGTC QSSCVTEGECLPPTD CGGCAGGAGAAGCTATCTCCCGGATGGGAAGGAAAACCCTAAACCGTG GGCGGAA TACTAG NCNPSVENQLPCVTE ATGGGAACTTACCGGCCCCATCAGCTATTGGGTACCCGGTAGGGACTTG CCGACCAT CACGCT GRFERVGSLVTVRLP CAACCCTACCCTGTATTTGCATTTTATAGGGAACCGGTCGGCCCTATATC ATGAGCCA GCCGAA FRKVACDLCSKEFLTY AGAGTAGACCGTTTATTAAATATGGGTGAAAATATTAACAGTAAAAGCT TATTAACA GCTAGA SKFAVHQANFHNSET ATGGTTTGGCGTCCGTGTGGTGCCAGGGCGGCGGCCAAACCCGAGCTA TTGCCCAC TAGATT QACCTYCGKSDGNH CTTGGCACCAACTGGGGATGGTAGCTTCCGAGCGATTCCCTGGCGACGT ACTATCCT GAGGAA HSIACHVPKCPWRRT GGGACCGATCGACGATGGAGTCCAAACATCCGGAATAGAGGAATTGAG CTGGAGGT TCTGCG VTFAANLSNFLCDLC AAATACCTATTCCACCACCGGCTCACATACCCAAGGTGAACCCGGTGCA ACCTCCTC TAATCTG NDSFKTKSGLSQHKR ACTAGAGTACAACCTATCTGTGGCGGTAGGTGCCGAACCACTCAGGTGA GTGGTACG TAATGA HKHPCSRNAERILSLG CGGGCTTGTTTATTGATGTCTCCCTACGAGACACGAATTGTGACAAATCC GCTGGATA TTACGCC VRTPSARPRQVVWS ACTCCGGTGGACAATTACCCGATCTATGAACCTGTTACCGATATTAGACA TAGGTAAA TCATGG EEETRTLREVEVVYSG AGAAAATAAAGAACTGACAACGCCTAGAGCTTCAGGCAGCATGTCTGTA TCCTGTAA GCATCT QKNINVLCAGHLPGK AGTATCCAGTCATCGAGCGTGACTGAGGGCGAAATTGATAATAACTCTG CCAAATCC ATCGGT TSKQVSDKRRDLHRI AAACTGAGGAATTGACGGATATATGTTTGGCTACGCTAGAGCTTCAGGC TCCAACCC AGCGTC RSSNVHGTPTTQSRG AGCACGTCTGTAAGCATCCAGTCATCGAGCATGACTGAGGGCGAAACTA GTGAAGG GACCCT DPVEQVEEYEELDWE ACGAAAGGGCCACGCCTAGAGCTTCAGACAGCTCGTCTGTAAGCATCCA AGAACACT GACGTT GMHPFPDPDSKFCSY GTCATCGTGCGTGACTGAGGGTGAATGTCTACCTCCTACAGACAACTGC AAAACCCA AAATTG LDQLRDQKGLTEPV AACCCGTCTGTAGAGAACCAGTTACCGTGCGTAACTGAGGGTAGGTTTG TATAGTGG GGTAAT WQEIEIVAQEWVEN AACGGGTAGGCTCACTGGTGACGGTGCGTCTGCCCTTCAGAAAGGTGG CCTCGCCA AAGAAA LAHVQSSWNHERTT CATGTGACTTGTGTTCTAAAGAGTTCTTGACATATTCGAAGTTTGCAGTC ACCACTAT TATCGA KQVPENNTPARRPFK CACCAGGCAAACTTCCACAATTCAGAAACTCAGGCATGCTGCACATATT ATGTCCAA (SEQID RRLHRVERYKRFQR GCGGTAAAAGTGATGGCAATCATCACTCTATAGCCTGTCACGTTCCGAA CGGCAGG NO: MYDLQRKRLAEEILD ATGTCCCTGGCGGCGAACTGTTACGTTTGCTGCGAACTTAAGCAATTTCT AGAAGCTA 1273) GREAVTCNLKKEEIK TGTGTGATCTTTGCAATGATAGTTTTAAGACCAAATCAGGGCTTTCGCAA TCTCCCGG DHYDQVYGVSNDRV CATAAGCGTCATAAGCATCCTTGTTCAAGGAATGCTGAACGCATCCTTTC ATGGGAA SLDDCPRPPGANNT TCTTGGAGTCAGGACGCCGTCGGCCCGCCCTCGCCAGGTAGTGTGGTCC GGAAAACC DLLKPFTPTEVMDSL GAAGAAGAAACACGAACCCTCCGGGAAGTGGAAGTAGTGTATTCGGGC CTAAACCG QGMKNGAPGPDKIT CAAAAGAACATTAATGTCCTCTGTGCGGGGCATCTACCTGGTAAGACTT TGATGGGA LPFLQKRLKNGIHVSL CCAAACAGGTCTCGGACAAGCGCCGAGACTTGCACAGGATACGGTCTTC ACTTACCG ANVFNLWQFSGRIPE TAACGTACATGGTACACCCACCACTCAGAGTCGTGGAGATCCTGTTGAA GCCCCATC CMKSNRSVLIPKGKS CAGGTCGAGGAGTACGAGGAGTTGGACTGGGAAGGAATGCATCCTTTT AGCTATTG NLRDVRNWRPITISSI CCCGACCCTGACTCTAAGTTTTGCTCGTACCTTGATCAGCTGAGAGATCA GGTACCCG VLRLYTRILARRLERA GAAGGGACTCACTGAACCGGTATGGCAGGAGATCGAAATCGTGGCACA GTAGGGA VQINPRQRGFVPQA AGAATGGGTAGAAAACCTTGCCCATGTTCAATCGTCTTGGAATCATGAG CTTGCAAC GCRDNIFLLQSAMRR AGAACAACCAAGCAGGTGCCAGAAAACAATACACCTGCACGAAGACCA CCTACCCT AKRKGTLALGLLDLSK TTTAAAAGGCGTCTCCATCGTGTGGAACGTTATAAGCGGTTTCAGAGAA GTATTTGC AFDTVGHKHLLTSLE TGTACGACCTCCAGCGAAAGCGCCTGGCTGAGGAAATACTAGACGGCC ATTTTATA RFAVHPHFVRIVEDM GGGAAGCCGTCACATGTAACCTCAAAAAGGAGGAGATCAAAGACCACT GGGAACC YSGCSTSFRVGSQST ATGATCAGGTCTACGGTGTGTCAAATGATAGAGTTTCTCTAGATGACTG GGTCGGCC RPIVLMRGVKQGDP CCCCAGGCCACCAGGGGCCAATAACACCGACCTCCTGAAACCGTTTACG CTATATCA MSPILFNIALDPLLRQ CCAACCGAAGTGATGGACTCACTTCAGGGTATGAAGAACGGGGCGCCT GAGTAGAC LEEESRGFMFREGQA GGCCCTGATAAGATTACCCTACCGTTCCTCCAAAAACGTCTTAAAAATGG CGTTTATT PVSSLAYADDMALLA CATCCATGTTTCCTTGGCAAATGTGTTTAACCTTTGGCAATTCTCGGGTC AAATATGG KDHASLQSMLGTVD GCATCCCCGAATGCATGAAGTCAAATAGGTCAGTCCTCATCCCGAAAGG GTGAAAAT KFCSGNGLGLNIAKS GAAGAGCAATCTGCGGGATGTCAGAAACTGGCGGCCAATCACAATCTC ATTAACAG AGLLIRGANKTFTVN CTCGATTGTGTTGCGGCTATACACCAGGATCTTGGCACGCCGTCTCGAG TAAAAGCT DCPSWLVNGETLPM CGGGCGGTGCAGATTAATCCCCGACAGCGAGGCTTCGTCCCTCAGGCTG ATGGTTTG IGPEQTYRYLGASICP GGTGTAGGGATAATATATTCCTGCTTCAGTCTGCTATGAGGAGGGCTAA GCGTCCGT WTGINSGPVKPTLEK GCGAAAGGGAACTCTGGCTCTGGGGCTTCTTGACTTGTCGAAGGCATTT GTGGTGCC WIANITESPLKPHQR GACACAGTTGGTCACAAACATCTTCTGACCAGCCTAGAAAGGTTCGCTG AGGGCGG VDILCKYALPRLFYQL TCCACCCGCATTTCGTCCGAATTGTGGAGGACATGTACAGTGGTTGTTC CGGCCAAA ELGTLNFKELKELDS GACGTCCTTTCGAGTAGGCAGCCAGTCTACTCGCCCCATCGTTCTGATGA CCCGAGCT MVKQAVKRWCHLP GAGGCGTCAAACAAGGGGACCCCATGTCTCCTATATTGTTCAACATCGC ACTTGGCA ACTADGLLYSRHRDG TCTGGACCCTCTTCTTCGTCAACTGGAAGAGGAAAGCCGAGGCTTTATG CCAACTGG GLAVVKLESLVPCLKI TTTAGGGAGGGGCAGGCCCCTGTCTCATCTCTAGCATATGCCGATGATA GGATGGTA KTNLRLVHSTDPVISS TGGCACTACTGGCTAAAGATCACGCCAGTCTTCAGTCGATGTTGGGCAC GCTTCCGA LAESDGLVGAIEGIAQ TGTGGATAAATTTTGTTCAGGGAACGGACTTGGCCTTAACATCGCCAAA GCGATTCC KAGLPIPTPDQRSGT AGTGCCGGACTTCTGATTAGGGGAGCGAATAAGACCTTCACTGTCAATG CTGGCGAC YHSNWRDMERRSW ACTGCCCTTCCTGGCTAGTAAATGGTGAAACGCTCCCGATGATCGGTCC GTGGGACC ERLALHGQGVELFKG CGAACAAACTTACCGTTATCTTGGGGCAAGCATCTGTCCGTGGACTGGG GATCGACG SRSANHWLPRPVGM ATAAACAGCGGGCCTGTTAAACCCACCCTGGAGAAATGGATAGCCAATA ATGGAGTC KPHHWVKCLAMRA TCACAGAGTCTCCCCTCAAGCCACATCAGAGGGTCGACATACTCTGTAA CAAACATC NVYPTKRGLSRGNLS GTACGCTTTACCCCGGCTGTTTTACCAACTTGAGCTGGGCACTCTGAATT CGGAATAG KNKDSAKCRGCTSM TCAAAGAACTGAAGGAACTAGACAGCATGGTCAAACAAGCTGTCAAAC AGGAATTG RETLCHLSGQCPKLKS GTTGGTGCCATCTACCTGCCTGTACGGCTGACGGCCTGCTATACTCCCGT AGAAATAC MRIRRHNKICEHLIAE CATCGTGATGGGGGTTTAGCTGTAGTAAAATTAGAGTCTCTTGTCCCTTG CTATTCCA ASFKGWKVLQEPTLV TCTAAAGATCAAGACAAATCTCAGACTAGTGCATTCGACCGACCCCGTC CCACCGGC TDNGERRRPDLIFHR ATATCATCTTTGGCGGAATCCGATGGTTTAGTGGGTGCCATCGAGGGTA TCACATAC DDKAVVVDVTVRYEI TTGCTCAAAAGGCTGGGCTTCCGATCCCTACGCCTGACCAGCGATCTGG CCAAGGTG SKDTLREAYASKVRR AACATATCATTCTAATTGGAGAGATATGGAAAGGAGAAGCTGGGAAAG AACCCGGT YGCLTEQIKDLTGATS GTTGGCCCTGCACGGGCAAGGTGTGGAGCTCTTCAAAGGCTCAAGATCT GCAACTAG VVFHGFPMGARGA GCCAACCACTGGTTGCCTAGGCCAGTTGGTATGAAGCCACACCACTGGG AGTACAAC WFPESSDVMADLNI TGAAGTGTCTGGCAATGAGAGCTAATGTATACCCTACAAAAAGAGGCCT CTATCTGT RSKYFEEFLCRRTILYT CAGTAGAGGGAATCTATCTAAGAACAAAGATTCCGCCAAGTGTCGGGG GGCGGTA LDLLWKSNNEQYLER ATGCACATCAATGAGGGAGACCCTATGTCATCTAAGTGGTCAATGCCCG GGTGCCGA LAP(SEQIDNO: AAATTGAAGTCGATGAGAATAAGGCGCCACAATAAGATCTGTGAGCAC ACCACTCA 1395) TTGATCGCCGAGGCCAGCTTTAAAGGCTGGAAGGTTCTGCAAGAGCCTA GGTGACG CCTTGGTTACAGACAATGGTGAACGTCGGCGACCTGATCTGATCTTCCA GGCTTGTT TCGTGATGATAAAGCGGTGGTTGTTGACGTGACGGTTCGCTACGAAATT TATTGATG TCGAAAGACACGTTGAGAGAAGCTTATGCTTCTAAAGTTCGAAGGTATG TCTCCCTA GATGTTTGACCGAACAAATTAAAGACCTTACAGGGGCTACCTCCGTTGT CGAGACAC TTTTCATGGATTTCCAATGGGTGCCCGCGGTGCCTGGTTTCCTGAAAGCT GAATTGTG CGGACGTGATGGCCGACCTGAACATTCGGTCAAAATATTTTGAAGAGTT ACAAATCC CTTGTGTAGACGCACCATCCTATATACACTGGACTTATTATGGAAATCGA ACTCCGGT ATAACGAACAATATTTAGAAAGGCTTGCACCATAAATTTTGTCTCTTTCC GGACAATT CCAATGATGTCTACTAGCACGCTGCCGAAGCTAGATAGATTGAGGAATC ACCCGATC TGCGTAATCTGTAATGATTACGCCTCATGGGCATCTATCGGTAGCGTCG TATGAACC ACCCTGACGTTAAATTGGGTAATAAGAAATATCGA(SEQIDNO:1027) TGTTACCG ATATTAGA CAAGAAAA TAAAGAAC TGACAACG CCTAGAGC TTCAGGCA GCATGTCT GTAAGTAT CCAGTCAT CGAGCGTG ACTGAGG GCGAAATT GATAATAA CTCTGAAA CTGA(SEQ IDNO: 1150) NeSL NeSL- Z82058 Caenorhabditis GCTCACTTTCTATCGTGTTAACCGTACGTTTACACTCCCAGTGAGTGTAA GCTCACTT CCTCCA MLRRKGRHRMVMV 1 elegans TAAAGGTTATTCGATAGAGGGTGTCTCCCTCTTTCTTGGGTAATTCTTCG TCTATCGT GGGCAC NSVKWQPSAHAEAI GCGGTCCGGGGTCTCTCCCTCGTCTTTTTTTTAAACTTTTCTTTCTCATCC GTTAACCG GCCGCA GTGKSWAPQRSQAS ACTCTTTTGCTCCTTTTTACTAACTCTTGTACTCTATAGTCTTTTCTCATCC TACGTTTA CGCCAA EHGWQSNAMFDPP CCCATCCGCCGTTGGGCAAAGTTTATTTACTTTGTTAAATCCATATTTTAT CACTCCCA AAGTCC NRILFARDSWSLNQS CTCTCTCACCCGTACAGAAAGCGTCTCCTTCTCAAACGCTTTTCTGTACTT GTGAGTGT TGGCAT THLQNQRSGSGLGIR TTTCTTATATTTTCATTAACATATTTTTCCTGTTTATACTAACCTAACCTCC AATAAAGG AACTCT PGQVRNNMVGGGP ATTGTCAATTACTAACTAACTTGTACAACGGATTTCGATGTTGCGCCGAA TTATTCGA GCAAAT HRAGDPKRRVELVSI AAGGACGTCACCGAATGGTTATGGTCAATTCTGTCAAATGGCAACCCAG TAGAGGGT AACATC QGSEVTVRTIYPSDEI TGCACATGCTGAAGCAATTGGAACAGGAAAGTCCTGGGCACCACAGCG GTCTCCCT AAACGT FSCYSKSCDIKTKAGY GTCCCAGGCATCCGAACACGGCTGGCAATCAAATGCAATGTTTGATCCC CTTTCTTG CAATCA GPEDLKHLTRHIKNE CCCAACAGGATTCTCTTCGCCAGAGACTCATGGTCGCTCAACCAATCAAC GGTAATTC ACTCCAC HGLKARWAYQCGLC GCATCTTCAAAATCAAAGGAGCGGATCAGGATTGGGTATAAGACCTGG TTCGGCGG AAACTCT NEKSDPSVSEGHKW TCAGGTAAGGAACAATATGGTGGGGGGTGGGCCTCACAGAGCAGGGG TCCGGGGT CCACTCT MEAHMVAVHQSSA ACCCAAAGCGTCGTGTCGAGCTGGTCAGCATACAAGGAAGCGAAGTGA CTCTCCCT CTTCAA EKRIKSYQKCTGARV CCGTCAGAACAATCTACCCGTCGGATGAAATATTCAGTTGTTACTCCAAA CGTCTTTTT GTCTTCT AEQLQAAAPSLTVPG TCATGTGATATCAAAACAAAAGCTGGCTATGGCCCTGAGGACCTAAAGC TTTAAACT CGGTGC KHKSGSRDAAKDSM ACCTGACTCGTCATATCAAGAACGAGCATGGTCTCAAAGCTCGCTGGGC TTTCTTTCT TTCCAAC TPTKDDDPKTRIYQT ATATCAATGTGGATTGTGCAATGAGAAGTCGGACCCAAGTGTATCGGA CATCCACT ACCACA RSVVKKSTQKTAEPT AGGCCACAAATGGATGGAGGCACACATGGTCGCCGTTCACCAAAGCTC CTTTTGCT ATGGTG DEGSRGPKYASIFQKS TGCGGAAAAAAGGATAAAGTCCTATCAGAAATGCACGGGTGCAAGAGT CCTTTTTAC AAAGCT VKARKSLALLCELSSP TGCAGAACAGCTACAAGCTGCTGCTCCATCGCTTACTGTGCCGGGGAAG TAACTCTT CCTTCAC KPMNPLPTNELTLKE CACAAATCAGGCTCTAGAGACGCTGCCAAAGATTCGATGACACCAACAA GTACTCTA CTTTTCC GNSRELAKEEAPSEGI AGGATGATGACCCGAAAACCAGGATCTATCAGACACGAAGCGTAGTTA TAGTCTTT CTCCAA DDIVIIDLDESEESPPR AAAAGTCGACTCAGAAAACAGCAGAGCCAACAGATGAAGGGTCTAGAG TCTCATCC AATTCTT RKRFNTWCLDHESSR GCCCAAAGTACGCATCCATTTTTCAGAAATCCGTCAAAGCAAGGAAGAG CCCATCCG CCCATGT EAWLDDTAIFWYISY CTTGGCGCTTCTCTGTGAATTAAGCAGCCCTAAGCCTATGAACCCCCTTC CCGTTGGG GGGGAA LCRGSTKYSALDPCL CTACAAATGAGCTAACTCTGAAAGAAGGGAATTCAAGAGAGCTCGCCA CAAAGTTT GTCCTG WSMYKVKGSRYILDR AAGAGGAAGCACCATCTGAAGGTATAGACGACATCGTCATCATCGATCT ATTTACTTT TTCTTGT LESSITYFFPICEEDH GGACGAATCGGAGGAGTCGCCACCCAGAAGGAAACGATTCAACACCTG GTTAAATC AAGCTC WTLLVLKDNSYYYAN GTGTCTGGATCATGAGTCAAGCCGTGAAGCATGGCTGGATGACACAGC CATATTTT TCCGGA SLHQEPRGPVRDFIN AATCTTCTGGTACATCTCCTATCTCTGCAGAGGAAGTACAAAGTACTCAG ATCTCTCT GGCTGC DSKRARKEFKVQVPL CTTTGGACCCATGCCTCTGGAGTATGTACAAAGTCAAAGGCTCAAGATA CACCCGTA AAGAGC QRDSFNCGVHICLM CATTCTTGACCGCTTGGAAAGCTCCATCACATATTTTTTCCCGATATGCG CAGAAAGC AGAAGA TNSIMAGGKWHSEE AGGAGGACCATTGGACACTGTTGGTATTGAAAGACAATTCATACTATTA GTCTCCTT AATTCTT DVRNFRKRLKKTLQE TGCAAACAGTCTGCACCAAGAGCCACGTGGCCCGGTCAGGGACTTCATC CTCAAACG CTTTCTG EGYELYSVNSLGIPFQ AACGACTCAAAACGGGCTCGGAAGGAGTTTAAGGTGCAAGTACCTCTTC CTTTTCTGT ACAAGG APTTEQMDYKETRCK AAAGAGACTCCTTTAACTGTGGAGTGCACATCTGTCTAATGACCAACTC ACTTTTTCT TCAGAA RSYASVLTQISPPAKR GATTATGGCAGGAGGCAAATGGCACTCTGAAGAAGACGTCAGAAACTT TATATTTTC GGAAGT PDCKPDNNIFVPTKD CAGAAAAAGACTGAAGAAGACACTCCAGGAAGAAGGCTATGAGCTTTA ATTAACAT CCTGTTC CAAEGNPQEKGRNE CTCGGTCAATAGTCTGGGTATACCATTCCAAGCCCCAACGACTGAGCAA ATTTTTCCT TTGAGG SPEEINTEHIVVAGKP ATGGACTACAAAGAAACAAGATGCAAAAGAAGCTATGCCAGTGTTCTTA GTTTATAC CGTCCAT ANNISPRCRSTSEML CTCAAATAAGCCCGCCGGCCAAAAGGCCGGACTGCAAACCTGACAACA TAACCTAA CCCGGG FEMVKATTSSGRSSL ACATATTCGTACCAACCAAGGATTGTGCTGCCGAAGGTAACCCGCAGGA CCTCCATT CGTCAT GTMTQDEFIRTSTIAE AAAAGGCCGAAATGAATCTCCTGAAGAGATCAATACGGAACATATCGTC GTCAATTA AGGAGA AVPLMSIKLPPMELP GTCGCAGGAAAACCTGCAAACAACATCAGTCCAAGGTGTCGGAGCACC CTAACTAA GATCAG RKILPPIPPRKPTQTN TCGGAAATGCTGTTTGAGATGGTGAAAGCCACAACCAGCAGTGGAAGA CTTGTACA ATGCAC GGQKGKQQRVPTGK AGCAGCTTGGGCACCATGACGCAGGATGAGTTCATCCGAACCAGCACA ACGGATTT CTTCTAG PDTLNAKVRNWENN ATCGCCGAGGCAGTTCCCCTAATGAGCATAAAACTCCCACCAATGGAGT CG(SEQID CAGGAG QLESYAMEGRSFQRL TGCCAAGGAAAATTCTGCCACCAATTCCCCCCAGAAAACCAACCCAAAC NO:1151) CTAGAA EWLTEVLTASIQKAA CAATGGAGGTCAAAAGGGAAAGCAACAGAGGGTGCCTACAGGAAAAC GGGCTG AGDEGIVDIICKRNPP CAGACACCCTAAATGCTAAAGTCCGGAACTGGTTCAACAACCAACTTGA CCCTGTC LEVAKGEMCTQTEN GTCGTATGCGATGGAGGGTCGCAGCTTCCAACGACTGGAATGGCTGAC TTGAGA KRKTTNNAARIADPI GGAAGTACTCACTGCGTCGATACAAAAAGCAGCAGCAGGTGATGAAGG TCCCCAC QSSKGAGDVKASYW AATAGTTGATATTATTTGCAAACGGAACCCGCCACTTGAAGTTGCGAAG GGGGGT KERARTYNRIIGSKEE GGTGAAATGTGCACCCAGACCGAAAACAAAAGGAAAACGACCAACAAT CAATAG LCKIPIDQLEDFFKKST GCAGCAAGAATTGCGGACCCAATCCAGAGCAGCAAGGGAGCTGGTGAT ACGGGA SRTNVQESIMKEKSS GTGAAGGCATCGTACTGGAAAGAAAGGGCTCGCACTTACAACAGGATT GGGGCT KIPALKIGNWMEKKF ATTGGTAGCAAGGAGGAACTCTGCAAAATTCCCATCGATCAACTGGAGG GCTGGC IGKEVAFALRKTKDTA ATTTCTTCAAGAAATCCACGTCCCGCACCAACGTGCAGGAGTCGATCAT TTTCTCT QGADGLRYHHLQWF GAAGGAGAAAAGCTCCAAAATTCCTGCTCTCAAGATAGGTAACTGGATG TTTTAAG DPSGELLAKVYNECQ GAGAAGAAGTTTATCGGAAAGGAGGTGGCGTTCGCTCTGCGGAAAACA AGGAAG RHRKIPKHWKEAETIL AAAGACACCGCGCAGGGTGCAGACGGACTGCGATACCACCACCTTCAA CACCAA LFKNGDQSKPENWR TGGTTTGATCCCAGTGGTGAGTTATTGGCGAAGGTATATAACGAGTGCC TCCGGA PISLMPVIYKLYSSLW AACGACACAGGAAGATCCCAAAACACTGGAAGGAGGCCGAGACCATCT GATCCTT NRRIRAVPNVLSKCQ TGCTGTTCAAAAATGGAGATCAGTCAAAACCAGAAAACTGGCGCCCAAT AGGGGT RGFQEREGCNESLAIL TAGCCTGATGCCTGTGATCTACAAACTTTACTCCAGTCTGTGGAACCGGA CAAAGG RTAIDVAKGKRRNLA GAATTAGAGCTGTACCAAATGTGTTGAGCAAATGTCAGCGAGGGTTCCA ATTAAA VAWLDLTNAFGSIPH GGAGCGCGAAGGTTGCAATGAGAGTCTAGCAATACTCAGAACAGCAAT AGGCAG ELIEYALTAYGFPQM CGACGTGGCCAAAGGAAAACGAAGAAACCTGGCGGTGGCATGGCTGG CAGGTC VVDVVKDMYQGAS ATCTGACGAACGCGTTTGGATCCATCCCGCACGAATTGATTGAGTACGC CAATTCT MRVKNATEKSDRIPI GCTGACAGCGTATGGATTTCCGCAAATGGTCGTCGATGTGGTCAAAGAT CCTCACT MSGVKQGDPISPTLF ATGTACCAGGGAGCATCAATGAGGGTTAAGAACGCGACGGAAAAAAGC GACTTC NICLETVIRRHLESAN GATCGAATCCCAATAATGTCTGGGGTGAAACAAGGCGATCCCATTTCAC GGTCAG GHQCLKTRIKVLAFA CAACACTTTTCAATATATGCCTGGAAACTGTGATTAGAAGACACCTGGA AGAGGA DDMAILTDSPDQLQ GTCTGCAAATGGTCACCAGTGCCTCAAAACAAGAATTAAGGTACTGGCG GTCCCG RELSKLDNDCTPLNLI TTCGCCGACGACATGGCGATTTTAACGGATTCCCCCGACCAGCTCCAGC CCTTGG FKPAKCASLVIQKGV GAGAACTGTCAAAGCTAGACAATGATTGCACGCCCCTGAATCTTATTTTC AGACCT VRSASIKLKGNAIRCL AAGCCAGCAAAATGTGCATCACTTGTGATCCAAAAAGGAGTTGTGCGG CCCCGG DENTTYKYLGVQTGS AGCGCATCAATTAAGCTTAAAGGAAACGCCATTCGATGCCTTGACGAGA GGAGGT AARISAMDLLEKVTK ACACCACTTACAAATATTTGGGAGTTCAGACGGGTTCGGCAGCAAGAAT TGCTGA ELECVVKSDLTPPQKL TTCAGCAATGGATCTACTGGAGAAAGTCACGAAGGAACTTGAATGCGT AGAGGC DCLKTFTLSKLTYMYG GGTCAAAAGTGACCTGACGCCGCCGCAAAAGCTGGACTGTCTTAAAACA GGAAGC NSIPLITEIKMFANIVI TTCACGCTGTCCAAACTGACATACATGTATGGAAATTCCATACCACTGAT TCCTTCT RGVKVMHRIPVRGS CACGGAGATAAAAATGTTTGCAAATATCGTCATTCGAGGAGTCAAAGTG AGCAAG PLEYIHLPVKDGGLG ATGCATAGAATCCCAGTCCGAGGGTCACCACTGGAGTACATCCATCTTC AGCTAG VACPKTTCMITFLVST CAGTGAAGGATGGAGGGCTTGGTGTAGCATGTCCCAAGACAACCTGCA AGGGAG LKKLWSDDEYIKTLFT TGATTACGTTCCTTGTCTCTACTCTTAAAAAACTCTGGTCAGATGATGAA TTCCCAG SLAEEVVKKESKKSTV TACATCAAAACATTATTCACATCACTGGCGGAAGAAGTAGTAAAGAAAG TCCTGA TMDDIADYLNVEERI AGTCAAAGAAGAGCACAGTCACTATGGATGATATAGCCGACTATCTCAA AACCCTT NRSEFGYNSITRLRDV CGTTGAGGAGAGGATCAATAGGAGCGAATTTGGGTACAATTCCATTAC GCGGTT MRNLAITGDSPLYRL GAGACTGCGGGATGTGATGAGGAACTTGGCCATCACTGGCGACTCCCC GATGAT KMVVKNGKIALLVQ ACTTTACAGGCTGAAAATGGTAGTAAAGAACGGGAAAATCGCTTTGCTC GGAATG ATSESMERIYTEEDAK GTCCAAGCCACAAGCGAAAGCATGGAAAGGATCTACACGGAAGAAGAT GAAGAG KLQRSLKDQVNKALK GCGAAAAAGCTGCAGCGCTCACTGAAGGATCAAGTGAACAAAGCACTC TACTTCG HRFNTTKVVKSKVVR AAACATCGATTCAACACCACCAAAGTAGTGAAAAGCAAAGTCGTCCGAG GTACTG VVQQHPASNRFVTK TCGTGCAACAGCACCCAGCAAGCAACAGGTTTGTCACAAAAGGTGGCA CTCGTT GGNLSLACHRFVHKA ACCTGAGCCTTGCATGTCACCGCTTTGTGCATAAAGCACGTCTGAATCTA GCTCTCT RLNLLACNYNNYDKS CTGGCCTGCAACTACAACAACTACGACAAATCCAAATCAAAAGTCTGTA CTGCGT KSKVCRRCGKDLETQ GGCGTTGTGGGAAGGATCTGGAGACGCAGTGGCACATACTGCAAAACT TTTACTG WHILQNCPFGFSKKI GTCCGTTTGGTTTCTCAAAGAAGATCACTGAGAGGCATGATGCCGTCTT CCGAGG TERHDAVLHKVKTLIE GCACAAGGTCAAAACTCTCATTGAAAGCGGTGGAAAAAAGAATTGGAC GCCGGA SGGKKNWTMKIDEE AATGAAGATTGATGAAGAACTTCCAGGATTCAGCAGACTCCGTCCAGAT TTTGCTC LPGFSRLRPDICLKSP ATCTGCCTCAAAAGCCCTGATGAAAAACAAATCATCTTGGCAGATGTCG GAATCG DEKQIILADVACPYEH CATGCCCATATGAGCATGGAGTAGAAGCGATGGAAAGGAGCTGGCAG CGAAAG GVEAMERSWQAKID GCAAAAATCGACAAATACGAGACGGGATTCGCCCACCTGCGGAAATCG GTCTCA KYETGFAHLRKSGTKL GGAACCAAGCTGACCGTCCTTCCGATTATAATCGGGTCACTTGGATCAT ATCGAC TVLPIIIGSLGSWWKP GGTGGAAACCGACAGGTGACAGTCTCAAGGAATTGGGAATCAAGGGA CATTCAA TGDSLKELGIKGSVIN AGCGTGATCAACAGTGCCATTCCAGAACTCTGTGCTACTGTTCTCGAACA GATGAC SAIPELCATVLEHSKN CAGTAAGAATACGTACTGGAATCACATCTTCGGTGAAGCGTACATACCA GGCTTA TYWNHIFGEAYIPNP AATCCAATGCGAAACGGACACGCAAAACCTGCTGGAAATGGATGGAAA TCTAAG MRNGHAKPAGNGW AAGGAAAGATTGCAGAAGGCCCCTGTGAGGCCTACCAACTAGCCTCCA GTCCGA KKERLQKAPVRPTN GGGCACGCCGCACGCCAAAAGTCCTGGCATAACTCTGCAAATAACATCA AAGCAG (SEQIDNO:1396) AACGTCAATCAACTCCACAAACTCTCCACTCTCTTCAAGTCTTCTCGGTGC TTGGGA TTCCAACACCACAATGGTGAAAGCTCCTTCACCTTTTCCCTCCAAAATTCT GAGTAA TCCCATGTGGGGAAGTCCTGTTCTTGTAAGCTCTCCGGAGGCTGCAAGA CGTGTT GCAGAAGAAATTCTTCTTTCTGACAAGGTCAGAAGGAAGTCCTGTTCTT CTCCTAC GAGGCGTCCATCCCGGGCGTCATAGGAGAGATCAGATGCACCTTCTAG CTTTCAA CAGGAGCTAGAAGGGCTGCCCTGTCTTGAGATCCCCACGGGGGTCAAT GTTGAA AGACGGGAGGGGCTGCTGGCTTTCTCTTTTTAAGAGGAAGCACCAATCC TGGTCG GGAGATCCTTAGGGGTCAAAGGATTAAAAGGCAGCAGGTCCAATTCTC TTTTACT CTCACTGACTTCGGTCAGAGAGGAGTCCCGCCTTGGAGACCTCCCCGGG GTTTGG GAGGTTGCTGAAGAGGCGGAAGCTCCTTCTAGCAAGAGCTAGAGGGA GATAGC GTTCCCAGTCCTGAAACCCTTGCGGTTGATGATGGAATGGAAGAGTACT TGACTT TCGGTACTGCTCGTTGCTCTCTCTGCGTTTTACTGCCGAGGGCCGGATTT GATGCT GCTCGAATCGCGAAAGGTCTCAATCGACCATTCAAGATGACGGCTTATC AGTACG TAAGGTCCGAAAGCAGTTGGGAGAGTAACGTGTTCTCCTACCTTTCAAG CTTCATC TTGAATGGTCGTTTTACTGTTTGGGATAGCTGACTTGATGCTAGTACGCT TGTGGA TCATCTGTGGATGACGCTCCCCAAGCAGTCAAGTAGACTTGAAAGGTGC TGACGC CCTCGCCCTAGTTAGCTCTTAGACCTTATGGGTCGCCATGGTTGTGGACG TCCCCAA GGTATGCTTGCCGGAGCCGAGTCGTGTTTCTTAGAACCAACCTCGACGA GCAGTC GGCGAAAGCTTGCACAAGTTAGCACAATTGTGGTAGGGCCGACTAGAA AAGTAG AATGAGTCCCTTAGGGGGTTACGCCTTGGCGAAAGTGAGGACAATTGG ACTTGA CATTGACGGGTGCTTCGGCACTAGGCAAAGGCGCCACCACACTGTCCAA AAGGTG TCTCTAAAAAGTTCACATTCATCGAAGAACTACCGGAACCAACCACACAT CCCTCGC GTGTTGAAACCTACACGGTGGAAGGGAAAGGAAAGCTTCGCTGGAACG CCTAGTT AAAAGAACGGATAGGTTCCCCTTCTTGATGGCTGTGAGGCTTAGGATGG AGCTCTT ACGGGAAGGCCGTGAGGCCTCAGGGGGTAACTCGGCCAGACGCTAGT AGACCT TGATCTTCGGATCACGACAGCCCTGGCTAAGAGGAACCCTGGATGGAG TATGGG TGTGAAGGATGGGCGGGTAGGGGGTTAAGCCTGTTGACAGACCACCGA TCGCCAT CTGCAGTCACAAAATCAGTGATTATGCGGGTGGACCAATCTGTTGGCGG GGTTGT GTGTTTCCCTCTACCTGACCCCGCAATATGGTATGTACGATCCTCGGATC GGACGG TAAAATTCATAATGGCCCACCACAACCATAAACCTCCCTAGCAGCTGGTG GTATGC GTCCCGATAATTCGGGTTCTTGCCACTACTGCGACCCAGGCTCGCC(SEQ TTGCCG IDNO:1028) GAGCCG AGTCGT GTTTCTT AGAACC AACCTC GACGAG GCGAAA GCTTGC ACAAGT TAGCAC AATTGT GGTAGG GCCGAC TAGAAA ATGAGT CCCTTAG GGGGTT ACGCCTT GGCGAA AGTGAG GACAAT TGGCAT TGACGG GTGCTT CGGCAC TAGGCA AAGGCG CCACCA CACTGTC CAATCTC TAAAAA GTTCAC ATTCATC GAAGAA CTACCG GAACCA ACCACA CATGTG TTGAAA CCTACAC GGTGGA AGGGAA AGGAAA GCTTCG CTGGAA CGAAAA GAACGG ATAGGT TCCCCTT CTTGAT GGCTGT GAGGCT TAGGAT GGACGG GAAGGC CGTGAG GCCTCA GGCGGG TAACTC GGCCAG ACGCTA GTTGAT CTTCGG ATCACG ACAGCC CTGGCT AAGAGG AACCCT GGATGG AGTGTG AAGGAT GGGCGG GTAGGG GGTTAA GCCTGT TGACAG ACCACC GACTGC AGTCAC AAAATC AGTGAT TATGCG GGTGGA CCAATCT GTTGGC GGGTGT TTCCCTC TACCTG ACCCCG CAATAT GGTATG TACGAT CCTCGG ATCTAA AATTCAT AATGGC CCACCA CAACCA TAAACCT CCCTAG CAGCTG GTGGTC CCGATA ATTCGG GTTCTTG CCACTAC TGCGAC CCAGGC TCGCC (SEQID NO:1274) CRE Cnl1 Cryptococcus CCCTCTTAATACCCCATAACACATAACAACCCCCTAATCAACGTTCTCTGC CCCTCTTA TGAGGA MSLQRAKNARGDPG neoformans ACCTTAAACACCACCAACATGTCCCTGCAGAGGGCCAAAAACGCCCGTG ATACCCCA AGAGGA RCNLCSADYRDLKDH GAGATCCTGGTCGGTGCAACCTATGCTCTGCCGACTATAGGGACCTCAA TAACACAT GGTTGG LNKQHSTHFFVPSDL AGATCATCTCAATAAACAACATTCCACCCATTTCTTCGTCCCCTCCGACCT AACAACCC ATTATTT RGSSLVACPRCGTPC CCGTGGCTCTTCCCTAGTCGCTTGCCCTCGCTGCGGCACCCCCTGCTCAG CCTAATCA TTTCTTT SAGTGLSRHQSRYCG CTGGCACTGGTTTATCTCGTCACCAGAGCCGGTATTGCGGTCTCACCGCT ACGTTCTC TCTTTAA LTAPRIRRNRVGNST CCTCGAATCCGCCGAAATCGCGTGGGAAACTCAACAAACACATCTCGCT TGCACCTT TAAGTT NTSRCPPSNTAASPIV GCCCTCCCTCCAATACTGCAGCTTCACCCATCGTTCCTTCGCCTTCCCCAG AAACACCA GTTTATT PSPSPERPSPPQPAE AACGCCCAAGCCCCCCTCAGCCTGCTGAAGTTGTTGCCAGTCTCGAACC CCAAC TAAGTA VVASLEPLSEAEEVLE ATTGTCTGAAGCCGAGGAGGTGCTGGAGGTCGCCCAGGTTGATGCCGA (SEQID GTTTCTT VAQVDAETVDTLEGT GACTGTTGACACGCTGGAAGGGACCCGGAGAGCTCCGGAATCCGTTCC NO:1152) TCATTCG RRAPESVPRSAEEGS GAGATCTGCCGAGGAAGGTAGCACGCGAGTTAGGGAGCTAAACATGAC GGCAAC TRVRELNMTAPEEEH AGCGCCGGAGGAGGAGCATCGTGGGGAGGAGGAGAGTAGTCATACCA CCACAC RGEEESSHTNPTAPA ACCCAACTGCCCCAGCAGGGCTCGAGAACGCGGTGAGCTCAACGCTGG GACAAC GLENAVSSTLGPSPG GGCCTTCCCCTGGGACGTTGCCTTCCTTACTTCCGTCCCAAGAGTGTGCT CCAATA TLPSLLPSQECANERF AACGAAAGATTCCTGTACCTTGCGCACCTGCCTGTTCGGAGCAAGCCTC AATTAA LYLAHLPVRSKPLPN TGCCCAACAACCTAGTTACCGACTTCATGGACGCCGCTGAGCGTTGTGC ACAACG NLVTDFMDAAERCA TCTTGCCTACATTGCACAACCCTCGGACTCTACACTGCTGGCATTTCTCG AAAAAT LAYIAQPSDSTLLAFL CCCTTCCAAAGGTCGGCCTCACCCAGGCGCTCGCTCCAGAACAGCCCCT GCAACC ALPKVGLTQALAPEQ CAGGCCGTCAACCTTCCTTAAGCAGTTCCCGCATATCCCCTGGCCAGAAC TCTATAA PLRPSTFLKQFPHIPW AGCCACCCGCTCGTCGTCCTCCCAGCAATATTCGTCCAGACACCACCAAA CCC(SEQ PEQPPARRPPSNIRP CAAGTCATCAAACTCGTTGAGAATGGGCGCCTAGGTGCGGCAGAGAGG IDNO: DTTKQVIKLVENGRL GTGTTGGAGGAGGATGCTTCAGTAGCCGAACTCGATCAAGGGGTCATC 1275) GAAERVLEEDASVAE GACCAGCTCATCACCAAGCACCCCAAAGGGCCGTCTTGTCCATTCGGCA LDQGVIDQLITKHPK ATGCAGTGGGTCCAACTCCTGGTAAAGCTCCCGACATCGACACCATCCA GPSCPFGNAVGPTP AAAGGCCCTCGACTCCTTCAAGCCCGACACAGCACCCGGCGTTAGTGGC GKAPDIDTIQKALDSF TGGTCAGTCCCTCTCTTGAAGACGGCTGCCAAGAGGGAGCCGGTCAAG KPDTAPGVSGWSVP CAGTTTCTCCAACTCCTCTGCGCCGCCATCGCCAACAACACCGCCCCTGG LLKTAAKREPVKQFL TCGCTCTATGCTCCGCACTTCTCGTCTCATCCCCTTGAAGAAGGACGATG QLLCAAIANNTAPGR GCTCTATCCGACCTATCGCTGTTGGTGAACTTATCTATCGGCTGTGTGCG SMLRTSRLIPLKKDDG AAAGCTCTCATCATCTCGCATTTCCAACCCGACTTCCTCCTCCCGTTCCAG SIRPIAVGELIYRLCAK CTCGGGGTCAAGTCAATCGGTGGTGTAGAGCCGATCGTGAGGCTGACA ALIISHFQPDFLLPFQL GAGAGAGTCTTGGAGGGTTCTGCCGGCGCTGAGTTCTCCTTTTTAGCCT GVKSIGGVEPIVRLTE CGCTCGATGCTTCTAACGCTTTCAACCGTGTAGATAGGGCCGAGATGGC RVLEGSAGAEFSFLAS AGCAGCGGTCAAGACCCATGCGCCGACGCTTTGGAGGACATGCAAATG LDASNAFNRVDRAE GGCCTATGGCGACTCGTCCGACCTTGTGTGTGGTGACAAAATCCTTCAA MAAAVKTHAPTLWR TCCTCTCAAGGTGTTCGACAGGGTGACCCCTTTGGCCCTCTCTTCTTCTC TCKWAYGDSSDLVC GATCACCCTCCGACCAACCTTGAATGCCCTCAGTCAATCGCTAGGTCCGT GDKILQSSQGVRQG CTACGCAAGCACTCGCTTACCTCGATGACATCTACCTCTTCTCAAACGAC DPFGPLFFSITLRPTL TCGCAAGTCCTCAGCAAAACTACCCAATTCCTCGCCGACAAGCAGCACA NALSQSLGPSTQALA TCATCAAGCTCAATGAAAAGAAATGCAAGTTAATCAGCTTCGATGAGAT YLDDIYLFSNDSQVLS CAGGCAGGAGGGCTTCAAGATGCTAGGGACGATGGTAGGAGGTAAGG KTTQFLADKQHIIKLN AGAAGCGAGCGGAGTTTCTGGAAGGCAGGATTCGGAAGGAAATGGCA EKKCKLISFDEIRQEG AAGGTGGGCAAGCTCAAGGATCTTCCACATCAACACGCGCTCCTTCTAT FKMLGTMVGGKEKR TACGTTTCTGCATTCAGCAAAATCTACGACACCTGCAGAGAAGTCTGCG AEFLEGRIRKEMAKV CTCGGACGACCTTGTAGACCTATGGGAGAGGCTGGACACGATGCTATG GKLKDLPHQHALLLL GGAGGAGGTGAAAAGGATGAGGATGAGGCAGCGGGAGGATACGGCG RFCIQQNLRHLQRSL GAAGAGGAGGCTCTAGGGAGATCGTTGACGAAGCTACCAGCGCGACTG RSDDLVDLWERLDT GGCGGACTAGGTCTACTTTCCTTCAAAGATGTAGCCCCCCTTGCTTACCG MLWEEVKRMRMRQ CTCGGCAGCCGAGGCCTCCGACACTCTCCTCGATAACCTAGGTCTCCTTT REDTAEEEALGRSLTK CTTCGCCAGAGGAACCTCCAACTCCGATCCCCCAACGAACTCGATGCGC LPARLGGLGLLSFKDV AGAACTCTGGGAATCGCAACAGGAAGCCATCCTACATAACCTCGGCGAC APLAYRSAAEASDTLL ACTGAACGCAAGCGACTCACCGAGAATGCCTCCAGACTCGGCCGAAGTT DNLGLLSSPEEPPTPI GGTTATCAGTTATCCCTTACCTTCAACCCCTGCGCCTTTCCAATGTCGAG PQRTRCAELWESQQ ATTGCCTCTGGTCTCCATGACCGCACCCTGGTCGGCTCCTCGATCCCTGT EAILHNLGDTERKRLT CTGTCGCTTCTGTGGGTCGGACTCACCTTTGGGTCACGACGAGCTTTGCC ENASRLGRSWLSVIP GCGCCCGCAACCCCTGGACCCAGCGCCGGCACAATGCCATCAACCGCGT YLQPLRLSNVEIASGL CATTTATCAACACCTCAAACAAATTCAAGGTGCCACGGTTGAGATTGAG HDRTLVGSSIPVCRFC CCCCACACGCTGTCGGGACAAAGGAGAAACGACCTTCGGGTCAGAGGT GSDSPLGHDELCRAR TCCAGCGCTCTGGCCTTCACTGACTACGACCTGAAGGTTTACTCCCTCGG NPWTQRRHNAINRV GGACCGAGACGCGAGAAGCACCGTCACACCCTGCGCCCCCAACGGCAA IYQHLKQIQGATVEIE GCTGGCCGACTTCTGCTTGGACCGGTGCGTGAACTGGCTCGACAAGGT PHTLSGQRRNDLRVR GGGTCAGGTCGTCTCTAAGAACGCTCCGAAGGTCACTGGTGGGGTCTTT GSSALAFTDYDLKVYS AAACCAATCATCCTTTCCACTGGTGGCTTGATGAGCAGGAGCACAGCAG LGDRDARSTVTPCAP ACGAATGGAAGGACTGGAGGGACGCGATGCCGGTGGGGGGGTTCGAG NGKLADFCLDRCVN AAAATGGAGAAACGGATTGGTGTCGAGTTAGTAAAGGCAAGGGCGAG WLDKVGQVVSKNAP GACGCTGGTCTTATGAGGAAGAGGAGGTTGGATTATTTTTTCTTTTCTTT KVTGGVFKPIILSTGG AATAAGTTGTTTATTTAAGTAGTTTCTTTCATTCGGGCAACCCACACGAC LMSRSTADEWKDW AACCCAATAAATTAAACAACGAAAAATGCAACCTCTATAACCC(SEQID RDAMPVGGFEKMEK NO:1029) RIGVELVKARARTLVL (SEQIDNO:1397) CRE CRE- Chondrus ACGCCCCCTATCCATTTCTGCCAGCCTCCCATCGGCTCGCCGTCTCCGCA ACGCCCCC TAAGTC MSQPNISSAETPLSQ 12_CCri crispus ACCCCTCTTCCTCGGCTGTACCAGTTCCGCTCCCACAACCTCCCTCGCCAC TATCCATT CTTGAC LPTPVPTPPSPSNPSL AATGAGTCAACCTAATATTTCGTCCGCTGAGACCCCCTTGTCTCAGCTGC TCTGCCAG GCCTGC SLPTVRDLLLCPIRSSH CCACGCCTGTTCCCACCCCGCCTTCTCCCTCCAATCCCTCTCTCTCTCTCCC CCTCCCAT CCCGTG VYSSIPSSCLHSFTML TACTGTGCGTGACCTCCTCCTCTGCCCCATACGCTCCTCCCATGTTTACTC CGGCTCGC ATACAG LIKTVRAASATMTPT ATCCATCCCTTCCTCGTGCTTACACAGTTTCACGATGCTCCTCATCAAGAC CGTCTCCG CATCGG ESHRAFIHLHILPIAVL TGTCCGCGCTGCGTCGGCCACAATGACCCCAACTGAATCMCATCGCGCA CAACCCCT TACCCCT RRSFRGETGWRSRT TTCATTCATCTACACATTCTTCCCATCGCTGTCTTGCGACGCTCGTTCCGT CTTCCTCG AGCATTT GQHHALRQRIRRASS GGAGAAACCGGATGGCGTTCCCGCACAGGGCAACATCATGCCCTCCGC GCTGTACC GAATAA GRHWAALWHEALA CAACGGATTCGCCGCGCCTCCTCGGGTCGGCATTGGGCTGCCTTGTGGC AGTTCCGC AAAA AHQVDLDYRTRHSR ACGAAGCCCTTGCTGCACATCAGGTCGACCTTGACTACCGCACGCGTCA TCCCACAA (SEQID RYQASATSRHRIGRA CAGCCGTCGTTACCAAGCCTCCGCTACATCGCGCCACCGCATCGGCCGT CCTCCCTC NO: MRLAADAQYGRAM GCCATGCGTCTGGCCGCCGATGCCCAATATGGACGTGCAATGTCGGCCC GCCACA 1276) SALKAKPLPDLHAAA TCAAGGCCAAACCGCTGCCCGATCTACATGCTGCCGCCACCCGCGACAC (SEQID TRDTLTALHPPPASP ACTCACCGCGCTTCACCCTCCTCCTGCCAGCCCGGTTCAGCCTCTCTCACC NO:1153) VQPLSPTDLPPVPEIT GACTGACCTCCCCCCGGTCCCTGAAATTACGGAAGGTCAAGTCCTCCGA EGQVLRAARALNPTS GCGGCACGCGCCCTTAATCCCACATCCGCTGCAGGACCGGACCATCTCT AAGPDHLSPRILQLLA CTCCCCGTATCCTGCAGCTCTTAGCCCGCACCACTATCAGCCCGGAAGCT RTTISPEAGVTGLSAL GGGGTTACGGGGTTGTCCGCATTGACGAACCTGGTTCGACGTCTCGCCC TNLVRRLARGDIPDR GAGGTGACATTCCGGATAGAACTGCGCCTCTCCTTGCTGCTGCCACTCT TAPLLAAATLIPLQPR GATCCCCCTCCAACCCCGCCCTCACAAAATACGGCCGATTGCTGTAGGG PHKIRPIAVGQALRRL CAGGCTTTGCGCCGTCTGGTCACGAAGGTCCTTCTGCCCCCCGCCATCCA VTKVLLPPAIQDTRD GGACACCCGCGATCACCTTCTCCCAGAACAGCTCGCCAACTCGGTTGCC HLLPEQLANSVASG TCGGGCATGGACGCAATCGTCCATGACACGCGCATGCTTATGCATCGTC MDAIVHDTRMLMH ACGGTCGAAACCCAGACTACATCATGGTCTCCGTAGACGCGCGGAATGC RHGRNPDYIMVSVD CTTTAACACCTTCTCACGTCAGTCCCTGCTGGATCGTCTCCCTCTGCAGAC ARNAFNTFSRQSLLD TCCTTCCCTCGCCCGTTTTCTCAATCTAATCTATGGCCGCACCGTTCCTGA RLPLQTPSLARFLNLIY TCTCGTGCTGCCCTCTTCTCCGCGGTTTCTGATGAAAAGTCAGGAGGGC GRTVPDLVLPSSPRFL ACCCAACAGGGGGACCCGGCAAGTATGCTCTTATTTTCGCTGGCAATCC MKSQEGTQQGDPAS AGCCGCTCCTGCGTCGTCTCACCCGCGAGTGCCGTCTCGACCTGAACCG MLLFSLAIQPLLRRLT CTGGTACGCGGATGACGGCACTCTGGTCGGGCCAATCTCGGAGGTCAT RECRLDLNRWYADD CAAGGCACTCCGAATTCTTCGTGATGACGGCCCGCAGTCCGGATTTCAC GTLVGPISEVIKALRIL GTCAATATCAACAAGTGCCGGGCATACTGGCCGACCGTAATGCCAGAAA RDDGPQSGFHVNIN AGTTGTCCGAATTGCTCCGTATCTTCCCCCTTCACGTCGAGTGCGGCGAA KCRAYWPTVMPEKL GGCGGTGTCGCCTTGCTGGGTGCCCCGCTCGGCACAGATGCCTTTGTGC SELLRIFPLHVECGEG GCCGACATCTCATGAACAAGGTTCAATCATGCCATGCCTCCCTCAGCCTC GVALLGAPLGTDAFV CTTGATGAAATTCCCGACGCGCGTACGCGGTTTCACCTTCACCGTGTAAC RRHLMNKVQSCHAS AGGCTCGGTATGCAAAGTCGAGCATGTTTTTCGCCTTACACCTCCCCACC LSLLDEIPDARTRFHL TCTCCCTCCCAGCAGCTACAAAATTTGATGAACAACAAATCGCTGCTTAT HRVTGSVCKVEHVFR TCTCGGTTGAATGATGTGGCCGTTTCCACATCCATGGCTACACAAATAG LTPPHLSLPAATKFDE GCCTTCCGTTTCGCCTCGGTGGACACGGCTTCACCCCACTGTCACCATTC QQIAAYSRLNDVAVS ATCCATGCGTCCTACGCTGCCAGTTTAATTGAGGCGGCACCTGTTCGTGT TSMATQIGLPFRLGG GAAGGGCCCACATAACCCCTCCGAGTCGTTTTATCGCCGCATGGCCCGT HGFTPLSPFIHASYAA CGTCATATCGTCCACGTACTAGGGGCCTTGAACCCTGAGGTCCGCACCC SLIEAAPVRVKGPHN GAGGCATTCTTGGGACCCATTCTCCCCTCGGACCATTTGAACCAGAGGC PSESFYRRMARRHIV CCTTTTGTCTAGACCTGAACGCGTACACCACACATTAATTCAGGCTATGC HVLGALNPEVRTRGI AGGGGGCCACTTCTCGACTCTACTGGGAACACACCGCGTGGGACCTTGA LGTHSPLGPFEPEALL CCCTCTCCCTCGCAACCACAGTGCCGCCTCTGTCCGCCGACGTGCCCGGT SRPERVHHTLIQAM ACAATTCCCTCCGTGCTCCGGGCGCCGCGTCGTTTCTATGCAGCCACCCC QGATSRLYWEHTAW TCACTCACTTCTCGAGTCCCTTCTGCGGTGTGGTCCTGTATGCTACGCCG DLDPLPRNHSAASVR GCATCTGGATACACCCGTCTACTGTGACTCTATTCGGCCTCTCATATGTT RRARYNSLRAPGAAS CGCATTGCTGTAAGCCAATGGACGCTCGCGGTGATCATGCTGCAATATG FLCSHPSLTSRVPSAV CCGTCATGGCTTCGGCGTCGTTCACCGTCATAACACTGTACGCAACCTAC WSCMLRRHLDTPVY TCGCCCGTCACGCGTTCCGCGCCGCCGGTCTCTGCTGCGACCTTGAGGT CDSIRPLICSHCCKPM CCCTTCTCTCTTGCCGAATACCGCGAACCGCCCCGCCGATATTCTCGTCC DARGDHAAICRHGF AGCCCGCCCCGCCTCCTTCGGGCGCTCTCCCGGACCGCCCCACTGCGTA GVVHRHNTVRNLLA CGACGTAACCGTTCGTTCCCCCTACTGTCGCTCTACAATGTCTCTCGCTG RHAFRAAGLCCDLEV CGAAAGGCCTCGCGGGTGCAGCGGAAGCTGCTGATTTGGACAAGCTTC PSLLPNTANRPADILV GCGTCCATTCCCGTACAGTGCGTGACGCATTTCACCTCCAGCCTGACTCC QPAPPPSGALPDRPT CCACTCCCTCTACTCGACTGGCACTTTGTCCCGCTCGCATTTGATACTCTC AYDVTVRSPYCRSTM GGCGCGACCAGCTCTCGCACGATGGCAGTCCTTGAGTACCTCGCTCACC SLAAKGLAGAAEAAD GCATTGCCAACCGGACATATTCATCTTACGGGACCGCCAAGATACGTCT LDKLRVHSRTVRDAF ACTACAACGCATCAGTTTCGCTGTTTGGTCCAGTTTGGCCTCTGCCACCC HLQPDSPLPLLDWHF TTTCCCGTATGCCCTATCACGGCGCGGCCCTATCGAGCCCCGCCCAAGT VPLAFDTLGATSSRT GTAAGTCCTTGACGCCTGCCCCGTGATACAGCATCGGTACCCCTAGCATT MAVLEYLAHRIANRT TGAATAAAAAA(SEQIDNO:1030) YSSYGTAKIRLLQRISF AVWSSLASATLSRMP YHGAALSSPAQV (SEQIDNO:1398) CRE CRE- Chondrus CNCCAGCCAMCGATCCCGCCGCCACTCGCMGCCCGGCCGTCTCGACCG CNCCAGCC TAATTCA MAXXPXISPPGAPPA 13_CCri crispus CCACCTCCCCGASGCCCCAGCCCATCATGGCCTSTWMGCCCCWGATWT AMCGATC CCTTCAT PLRYRMLQCPPPLPK CTCCCCCGGGGGCCCCCCCTGCTCCGCTGCGGTACCGGATGTTACAATG CCGCCGCC ATCTGCT XXXXPVPHPMSSPIR TCCACCSCCGCTACCSAAGCSMCASTMGTTKCCGGTCCCCCACCCGATGT ACTCGCM AGTGTC XRLPHRXMRGPPSXT CGTCACCCATACGCCKCCGCCTCCCMCACCGGMCGATGAGGGGACCCC GCCCGGCC TCTGTAA PPRDMHRPHGTPGP CTTCCCSAACCCCTCCCCGAGACATGCACAGACCSCATGGCACCCCCGG GTCTCGAC GCGCAC HSHRXCGRPPXHCTH WCCGCACTCTCACCGATKCTGTGGCCGCCCTCCCMACCACTGCACCCAT CGCCACCT CCCTCAT ASXQPRXAXHXLQXP GCCTCCSCACAACCSCGSMSAGCCCMGCACSCTCTCCAAWGGCCGAAA CCCCGASG GCATTG KLRSPPPHPHVSPLIL CTSCGCAGCCCACCTCCACATCCGCATGTTTCACCCCTCATCCTCTGCAST CCCCAGCC ATAAAA CXGPLPTPMTQPRM GGCCCCCTCCCAACGCCCATGACCCAACCACGCATGAAACGAGCGCTCT CATC(SEQ TTACCCC KRALSXSAKAPPTKRP CCASAAGCGCCAAGGCGCCCCCTACCAAGCGCCCCTCTGCCTCTCAGGG IDNO: CCA SASQGPAASSHDXPR CCCAGCCGCGTCTTCCCATGACKAATGACCSCGGACGCCCCCACCTCMA 1154) (SEQID TPPPXPPRPPPYRFPP CCCCCGCGCCCGCCTCCCTACAGGTTCCCTCCCCCWACTCTCGACCAGCA NO: PTLDQHXFALSXAYP CTTMTTCGCCCTCTCWTMAGCCTACCCCCACCCGMCTCCCMGGCGCCC 1277) HPXPRRPPSPXRXLR ACCCTCMCCTTKCCGTCSGTTGAGGCACTCCTTTCCTCCCCGATTCGGCC HSFPPRFGXQTFSSIP STCAGACATTTTCTTCCATTCCGGGACCGCGCCTTCATAGCACTGTATTAC GPRLHSTVLLLIRLVR TTCTCATCCGCCTCGTCCGCGCCGCTACAGCCGCCAACACTCCCGAAACC AATAANTPETTTLXS ACCACACTGMATTCTTGCACCTTCACCTGCTCCCGACTGCCSTTCTTCGA CTFTCSRLPFFERPSX GAGGCCTTCCGKGGCGAGSCTGGCTGGAGGTCCTCGCGCGGTCAACTTC AXLAGGPRAVNFML ATGCTCTCCGCTTGCKGATACGGAGAGCGTGTACKGGACGAGAGTGGG SACXYGERVXDESGX GWCTCTTATGKAAWGAAGCMCTAGATGCCCACAGCKCCAGGACAGAA SYXXKXHCITSHPPRP TGGCAGCACACGCATGCCCGGCGCCCCTCGCCACCCGTTTCCCCATCGG RRYSRQHSRNHHTXF CACGTGCCGCCCGCGCTATGCGCCTTGCCTCCCAAGCTCAATACGGCCG LHLHLLPTAXLREAFR CGCCATGCGCACATTTACCAACCCCCCTCTAGCTGACCTCAACGACCCGG GEXGWRSSRGQLHA CCACGATGGAGCGGCTCCAAGCCCTTCACCCCACTCCTACCGTGCCCGTC LRLXIRRACTGREWG GTGCCCCTGCCACCCTCCGCACAGCCTCGACCACCCGAAGTCACCGMGG LLXXEALDAHSXRTE AGGCGGTCWTGCGTGCGGTTCGTCGCCTCAATCCGAACTCGGCGGCCG WQHTHARRPSPPVS GCCCTGATCGCATGTCCCCGAAATTGCTTCACCTCCTGGCTCACACTCCC PSARAARAMRLASQ ATAAGCCCAGAAGCGGGCGTCACCGGTCTCTCKGCGCTAACCAACCTCG AQYGRAMRTFTNPP TCAGCCGCCTGGCTCGCGGCTCCCTCCCACCCTGTACGATCCCACTGGCC LADLNDPATMERLQ AGTGCGGCGACACTTCTKCCGTTGCAGCCCCGACCGGGAAAAATACGCC ALHPTPTVPVVPLPPS CGATCGCTATTGGGCAAGCCCTWCGCCGGCTTGTCACAAAAKTMCTTCT AQPRPPEVTXEAVXR TCCTGCCGCCATCGACGACTGTCGGGACCACCTTGCTCCCGAACAAMTG AVRRLNPNSAAGPD GCMAACGGCATACCMAACGGCATTGACGCTATCGTACACGACGCACGC RMSPKLLHLLAHTPIS ATGCTAGTACGACGCCACGGTAACGACCCACACTACKTAATGGTGTCTA PEAGVTGLSALTNLV TTGACGCTTCCAATGCGTTCAATAATTTCTCACGSCAACAAGTCCTCGAC SRLARGSLPPCTIPLA CAGCTGCCCACTCGAGCACCATCGCTCTCACGATATTTGGATATGGTGTA SAATLLPLQPRPGKIR CGCACGCGCCCCCTCCCCCCTCGTCTTGCCTTCATSCCCGCCTACCATACT PIAIGQALRRLVTKXL CCACAGCCGGGAMGGATCACAACAAGGGGACCCTGCAAGCATGCTCCT LPAAIDDCRDHLAPE TTTCTCGCTTGCCCTCCAGCCGCTCACGCGCCTCATTTCACGTGAGTGTG QXANGIPNGIDAIVH AMCTWKTAATGAACCGCTGGTATGCGGACGACGGAACTATCATTGGAC DARMLVRRHGNDP GGATTGACGAAGTTKCCAAAGCCCTTGATATCATCACTAAAGAGGGGCC HYXMVSIDASNAFN CAGGTTCCAATTCTTCCTCAACCCTTCGAAGACACGCGTCTTCTGGCCAA NFSRQQVLDQLPTRA GCAGGCAGCMAGACCTCCTCAGCCCGCTCATGACAGTGGGTCCTCTGC PSLSRYLDMVYARAP GMGTCATCGATGAAGGCGGTGTGGMTCTGCTCGGCGCCCCCATMGGG SPLVLPSXPPTILHSRX TCACCAAGCTWTATGGCACAGTACATTCGGGAAAAWTTGAACACTTGC GSQQGDPASMLLFSL AAAACCGCMCTCGCCCATCTCGACCATATCCCCGAGGCCCGCATGCGCT ALQPLTRLISRECXLX TTCACCTGCATCGGGTGTCTGCTTCTGCATGCCGCTTGCAGCACCTCTTC MNRWYADDGTIIGRI CGGTTGGTCCCCCCGGATTTCGCGWTGCCGTTTGCACAACAATTCGACC DEVXKALDIITKEGPR GTGACCAACTCGMAGCCTATGMGCGCTTTAATAGTGTGACTATGTCGC FQFFLNPSKTRVFWP CAAGAATCGTGCCCAAATACGGCTGCSTTTTTCMCACGGWGGCCACGG SRQXDLLSPLMTVGP CCTCACCTCATTGGCATCTACCATACACGCCTCWTACGCTGCTAGCCTCA LRVIDEGGVXLLGAPI TCGATACCGCTCCAGCACGGCTACAAGGTCCCCACTTTCCCGCCGTCTCT GSPSXMAQYIREXLN CAGTATCAGCGTTTTGCACGAGGCCCGTTGCGGGTCGTTCTTCGAAATTT TCKTALAHLDHIPEAR ACCTTCATTCGTKCAACCCGCACACTTCTCGATGACGGAAWCGGACCTC MRFHLHRVSASACRL GGMTGCCTTGAACCAKCTGCGCTACTGGCGCGACCTGAACGCATACAC QHLFRLVPPDFAXPF ACCTTTCTACTTCAGGCGCAATACAGTGCAGCAGCWAGCTCGTACTGGC AQQFDRDQLXAYXR AAAWACCCCTCTGGGAGTCCTTCCCCAACCCTGGTGATCACAGCGCAGC FNSVTMSPRIVPKYG CTCGCTACGCAAACGAGTACGCTACAACTCCCTGCTTGCCCCWGGGGCC CXFXTXATASPHWHL ACCAGTTTTCTCACTGCACACCCSGCCGCCACCTCTCGGGTCCACAACGC PYTPXTLLASSIPLQH AACTTGGTCCACGATGTTACGTCGGCACCTCGACGCCCCCGTGACCAAC GYKVPTFPPSLSISVL GATTCCATATCGCCGTTGCGATGTKCTCACTGCTCCAAGCCTATGGATGC HEARCGSFFEIYLHSX CCGCGGCGACCACGCGWCCATTKGCAGCCACGGGTTTGGTACGTTGCA NPHTSRCDYVAKNR CCGGCATAACACCGTCAGGAACGTCCTCGCCMGGCAGTTATTCCGMGT AQIRLXFSHGGHGLT CGCTGGCCTCGCCTACTCGCTCGAAGTACCCTTTCTGATTCCCAACACCG SLASTIHASYAASLIDT CCGCCCGTCCCGCAGATATTCTCGTCCAACCACCTCCTCCAGCCCCTGGC APARLQGPHFPAVS CTACCTCCTGACMAACCCACAGCCTATGATGTCACGATTTGTAGCCCTTT QYQRFARGPLRVVLR TCGCCGCGGAATGTTATACCATGCCGCCCGTCACCGCGGCGGAGCCGCC NLPSFVQPAHFSMTE GACGCCGCATCTGTAAGGAAGMGCAAAGCCCTCGAGCGCACTATCCGC XDLGCLEPXALLARPE MACGCTCTCCTTATCGAGGACGACAATCSTCCGCCGCCTCTTGACTGGC RIHTFLLQAQYSAAAS ACTTTCAACCGCTTTCCTTCGACGCWCTGGGMGCCCCCTCTCAGTCTAC SYWQXPLWESFPNP TGTACACGTTATCGAAGATCACGCTAAGCTCATGGCCCTCCGCAACTCGT GDHSAASLRKRVRYN GCACMATTGCAACTGCCAAATCACGCATCCAACAACGCCTCAGCTTTGC SLLAPGATSFLTAHPA TATATGGTCCAGTGCTGCCGCCGCTATCCTCTCTCGCCTACCGACACACG ATSRVHNATWSTML CCGCGGACATCTCATACCCGATAGAAGTATAATTCACCTTCATATCTGCT RRHLDAPVTNDSISPL AGTGTCTCTGTAAGCGCACCCCTCATGCATTGATAAAATTACCCCCCA RCXHCSKPMDARGD (SEQIDNO:1031) HAXIXSHGFGTLHRH NTVRNVLARQLFRVA GLAYSLEVPFLIPNTA ARPADILVQPPPPAP GLPPDXPTAYDVTICS PFRRGMLYHAARHR GGAADAASVRKXKA LERTIRXALLIEDDNX PPPLDWHFQPLSFDA LGAPSQSTVHVIEDH AKLMALRNSCTIATA KSRIQQRLSFAIWSSA AAAILSRLPTHAADIS YPIEV(SEQIDNO: 1399) CRE CRE- Acanthamoeba TAACCCTAACCCTCTCCCTCGGCCCCTCTACCCTAAAGCGCCCTAATCGA TAACCCTA TAAGCC MATTTISRSPSSSSSS 1_ACas castellanii CCGGCGACGCCCTAATCGCTACCCTCTACGCCCTAATCGACTTTGGCGCC ACCCTCTC GCGCGA SSARSRASASTSASVA AAAGCGACTTTCCCCGGCCGATTTTCTTCCTGCCTTTTTCTTTTCTCTCCA CCTCGGCC CGAGGA SIPRLFRDGRFHCPLA AGCGACGCGCCTTTTACTTTGCCGCCGTTCTGTTTTTTCTTTTCTCTTTGC CCTCTACC CGGCCA HCQTRTSTWQDLSA ACTTCGCTTCTACTTCACACCTCCTCCTCCTCCTTCTCGACCCGCGCGGCC CTAAAGCG GGACGA HLTRMHDGDVPRDV TCGAGCGACTTGCTGCAGCGGCTCCCGGCCTCCCCCACGCGGCCTGCTA CCCTAATC CCAGGA AAACGIVQCLHEGCR CTCCCGCTTTCTAGACGCCCCCGGTCTTGCTCTCAGTCTCCCGCATCGAA GACCGGC CGACGG KWFRGAAGLASHRG GCGGTAGTCGGGGTACGTGCTCAAGTGACTCAAGCCTCTTTTCAGCCTC GACGCCCT CGACGG KARHAPPPAPRAALA GGCGCTCTCTCAATCCGCCTCAGTCTTAGCCTTTCAAGTTGCTCGATTAC AATCGCTA CGACCA VAAVPRADSRGRTP GCTCTCGAATCGCTCTCTCTCTCAGTCTCAGTCTCAGTCTCAATCTCGATC CCCTCTAC CCTAGC APTPSVAPPXAGPPP TTGCCTTCGCCTTCGTCTCGACGCCTTGCTCTCGWAATCGCTGCCACTAC GCCCTAAT ACCGCA RAAPRAAPSPLPCPP GTGCCAGCTTTTTCGTGCCTTGTCTTCGTGTCGACCCGGACCGTTTGCAA CGACTTTG CGCGCC ALPHPPPSASPPTSSV GCCCTCGCCTTCGTACCCCGCTCTCGTAGCCGTTCTCATCGCTGAAGCGT GCGCCAAA ACGACA TSPCSPPTTPPSQPSP TCTACGCGCTGGCAGCAAGCCTCGGCCCTAGCTTGTAGCGCCGCCGGTG GCGACTTT TATTGTC DLFSGFANAPTTPSP GCCGCTCGCCAACATGGCTACGACGACCATCTCACGATCCCCTTCGTCTT CCCCGGCC GCGCGC PSTPXSSPAGSPIPAA CTTCTTCTTCTTCTTCCGCTCGCTCGCGGGCGTCAGCTTCCACGTCCGCCT GATTTTCT TGTACA RRFVLPVATPYPAPA CAGTTGCGTCGATACCCCGCCTCTTCCGCGATGGCCGCTTCCACTGCCCT TCCTGCCT GGCGGC PRANRPKLSPVARPF CTCGCCCACTGTCAGACCCGCACGTCCACATGGCAGGACCTCTCCGCGC TTTTCTTTT TAGGTC VPKARAGAIPEASSP ACCTCACACGCATGCACGACGGTGACGTGCCCCGTGACGTCGCCGCCGC CTCTCCAA GAGCCC VTPQDRAVSRREDA CTGCGGCATCGTGCAGTGCCTACACGAGGGCTGCCGCAAGTGGTTTCGC GCGACGC AGCCGA AAAPSSAPGLGLADE GGAGCTGCAGGACTTGCCTCTCATAGGGGCAAGGCCCGTCACGCCCCG GCCTTTTA CCGTTCT HEDDDTYGGDTIALT CCACCAGCCCCCCGCGCCGCCCTCGCCGTCGCCGCCGTGCCCCGCGCGG CTTTGCCG GAGCCT APHAPRETRAPFEFE ATTCTCGCGGTAGGACCCCAGCCCCGACCCCCTCGGTAGCCCCGCCCWA CCGTTCTG CAGTCG ACFLEEEAPATAGDL CGCCGGTCCTCCGCCGCGAGCTGCGCCACGCGCCGCCCCCAGCCCACTG TTTTTTCTT GCTTGA PPYARAFLACPSARL CCGTGCCCGCCTGCGCTCCCGCACCCGCCCCCCTCTGCCTCTCCTCCCAC TTCTCTTTG GCCCCC QEIPRRLKSAWQAAA CTCCAGCGTGACGTCCCCGTGCTCCCCGCCCACGACCCCGCCGTCGCAG CACTTCGC GGCTTC KTIAEAALDCHTAGD CCTTCGCCCGACCTGTTCTCGGGTTTCGCGAACGCGCCTACCACGCCATC TTCTACTTC CCAAGG TQGYNAHLRLFIELPA GCCGCCCTCCACGCCGWCWTCGTCGCCAGCAGGCTCGCCCATCCCGGC ACACCTCC CCTACC RGLAVPTNCRGAART TGCCAGACGCTTCGTCCTGCCTGTGGCCACGCCCTACCCGGCCCCCGCG TCCTCCTC GGGGCG KLQRERLLDIAAGRIP CCGCGTGCTAACAGGCCCAAGCTGTCGCCGGTCGCGCGCCCCTTCGTCC CTTCTCGA GCTCTTT AIPDPPCDAPGADDA CTAAGGCGCGAGCCGGAGCGATACCTGAGGCGTCCTCACCTGTGACGC CCCGCGCG TTCGCCC LRGFPVSGTTAGDVS CTCAGGACCGCGCCGTCTCACGCCGCGAAGACGCCGCCGCCGCCCCTTC GCCTCGAG TGGTTTT NDDDSGGVHDRPAA GTCCGCGCCGGGCCTCGGCCTAGCAGACGAACACGAGGACGATGACAC CGACTTGC TGCCGG TASARQAKRLVEQGL GTACGGCGGTGACACAATCGCGCTCACTGCCCCCCACGCGCCCCGTGAG TGCAGCGG CCTGTTT SSRALRALERGEPAV ACCCGCGCCCCCTTCGAGTTCGAGGCGTGTTTCCTCGAGGAGGAAGCCC CTCCCGGC TTTCTCT ASADTLGRLEALHPP CAGCCACCGCCGGCGACCTCCCGCCTTACGCGCGCGCCTTCCTCGCTTGC CTCCCCCA CCCCCTT NPTDRGLWPGAPKA CCGTCAGCCCGTCTCCAGGAGATCCCGCGCCGCCTCAAGTCCGCGTGGC CGCGGCCT TTCCCCC AIPRVTAKHLAQVAK AGGCCGCCGCCAAGACCATCGCGGAAGCCGCGCTGGATTGCCACACCG GCTACTCC CTTTTCC ELPRGSAPGPSGWTF CGGGCGACACGCAGGGCTACAACGCCCACTTGCGGCTCTTCATCGAGCT CGCTTTCT ATTTGTA ELVQAAIDRQPTGTV GCCGGCCCGCGGGCTGGCAGTGCCCACCAACTGTCGAGGCGCCGCCCG AGACGCCC CTTAGTT AAFLIDMAQRALRGT CACCAAGCTTCAACGAGAGCGCCTGCTCGACATCGCCGCCGGAAGGAT CCGGTCTT TTTCCTT LHWRGLLTASRLVAL CCCGGCCATCCCGGACCCGCCGTGCGACGCCCCGGGCGCCGACGACGC GCTCTCAG CGGCCG KKPDGGVRPIAVGEA CCTACGCGGTTTCCCCGTATCGGGGACGACAGCCGGCGACGTCAGCAA TCTCCCGC CGGCAG LYRVIGRLVLKADRV CGACGACGACAGCGGAGGTGTGCACGATCGGCCCGCGGCCACCGCCAG ATCGAAGC CTTGTTG MSSADATQYVGRHQ CGCCCGGCAAGCCAAACGGCTAGTGGAGCAAGGGCTCTCCTCCCGAGC GGTAGTCG CCCGGC YGVAYPGGVEAPVH CCTCCGTGCCCTTGAACGGGGCGAGCCCGCGGTCGCCTCGGCAGACAC GGGTACGT ATAGTG AVRELHDSGQLRAVV CCTCGGGCGTCTCGAGGCGCTCCACCCGCCTAACCCCACCGACAGAGGA GCTCAAGT TTAATAT SLDWRNAFNSLDRV CTATGGCCCGGCGCCCCGAAGGCCGCGATCCCGCGGGTCACCGCCAAG GACTCAAG GTTTAA HTALLIADRAPALARL CACCTGGCCCAAGTGGCCAAAGAGCTCCCGCGCGGTAGTGCGCCGGGT CCTCTTTTC AAAACG YEWSYREDSVLVLPR CCCTCGGGCTGGACGTTCGAGCTCGTGCAGGCGGCCATCGACCGCCAA AGCCTCGG TGTAAA AFEKAGLPASLLSQA CCCACGGGCACGGTTGCCGCGTTCCTCATCGACATGGCGCAGAGAGCCC CGCTCTCT TAAATA GVRQGDVLGPLFFAI TCCGGGGCACCCTACACTGGCGGGGATTGCTCACCGCCAGCCGCCTTGT CAATCCGC ACTGTTT GAAPVLDEIDAIPYVT CGCGCTGAAGAAGCCCGACGGCGGTGTACGACCCATCGCCGTAGGCGA CTCAGTCT AACCCT PRAYLDDIFVTIPHGV GGCCCTCTATCGCGTCATCGGCCGCCTTGTTCTCAAGGCCGACAGGGTG TAGCCTTT AACCCT TDAATKAAVAATFAT ATGTCGAGCGCCGACGCCACGCAATATGTCGGGCGGCACCAGTATGGC CAAGTTGC AACCCT AEREGAAAGLRLNRC GTGGCCTACCCCGGTGGGGTTGAGGCCCCGGTCCACGCCGTCCGCGAA TCGATTAC AA(SEQ KSAVWAADAEALLP CTGCACGACAGCGGCCAGCTCCGAGCGGTCGTCTCGCTCGACTGGCGTA GCTCTCGA IDNO: PHAAGAREDVESCA ACGCGTTCAACTCGCTCGACCGCGTGCACACGGCCCTGCTCATCGCCGA ATCGCTCT 1278) PVREGLKILGAPVGSP CCGCGCACCCGCTCTCGCGCGACTCTACGAGTGGTCCTACCGTGAGGAC CTCTCTCA AFVAKSLDGIIKRAIG TCAGTCCTCGTGCTGCCGCGCGCGTTCGAAAAGGCGGGGCTGCCGGCC GTCTCAGT TLDLVADAELPLQHK TCCCTGCTCTCCCAGGCCGGCGTGCGCCAGGGCGACGTCCTGGGACCCC CTCAGTCT LVLLRQCVAQIPTFW TCTTCTTCGCCATCGGCGCTGCCCCGGTCCTCGACGAGATCGACGCCATA CAATCTCG ARAVPDAGPALAVW CCGTACGTGACGCCGCGAGCGTACCTCGACGACATCTTCGTCACGATAC ATCTTGCC DTALLRRTGALVGLD CCCACGGTGTCACGGACGCCGCGACCAAGGCCGCCGTCGCTGCCACCTT TTCGCCTT VRDGSLQADIARLPV CGCTACGGCGGAACGCGAAGGCGCGGCCGCTGGCTTGCGGCTCAACCG CGTCTCGA RLGGLGLRSMKDTAP CTGCAAGTCGGCGGTGTGGGCCGCGGACGCAGAAGCCCTCCTTCCCCCC CGCCTTGC RAFVASILFAAALANT CACGCCGCTGGCGCGCGGGAGGACGTCGAGAGCTGCGCACCAGTGCG TCTCGWA RRSELTCSASTARRLR CGAGGGCCTCAAAATCCTCGGCGCGCCCGTGGGCTCGCCCGCCTTCGTC ATCGCTGC AALPELARTDACNDE GCCAAGTCGCTCGACGGCATCATCAAGCGCGCCATCGGCACACTCGACC CACTACGT AAWRRSIARGVFPD TCGTCGCTGACGCCGAGCTACCGCTGCAGCACAAGCTGGTGCTGCTACG GCCAGCTT VDKLGTTQLQRVLQ GCAGTGCGTGGCCCAGATACCCACGTTCTGGGCCCGCGCCGTGCCCGAC TTTCGTGC GMADSKSAHRTRRQ GCAGGCCCGGCCCTCGCCGTCTGGGATACAGCGCTCCTCAGGCGCACG CTTGTCTT VPFLFAAVFEDAATP GGCGCGCTGGTCGGACTTGACGTGCGGGACGGGTCCCTGCAAGCCGAC CGTGTCGA GSGAWLAAIPSDPTL ATCGCGCGCCTGCCTGTCCGCCTGGGCGGTCTCGGCCTCCGTTCAATGA CCCGGACC VLPDAELAEAVRIKLL AGGACACGGCGCCCCGGGCCTTCGTGGCCTCGATCCTGTTCGCCGCCGC GTTTGCAA TTTANAAGVCPACH GCTCGCCAACACGCGCCGATCCGAGCTCACGTGCAGCGCCAGTACGGC GCCCTCGC KTGIDPSHAYTCVSLS CCGACGCCTCAGAGCCGCCCTGCCCGAGCTGGCACGCACCGACGCGTG CTTCGTAC HLRTARHDVVVRRVE CAACGACGAAGCCGCCTGGCGGCGGTCCATCGCCAGGGGAGTCTTCCC CCCGCTCT LACKTEKPVREHVLAI CGACGTGGACAAGCTGGGCACCACACAACTGCAGCGCGTCCTCCAGGG CGTAGCCG PPVAPTDNNNNGDE GATGGCGGACTCCAAGTCCGCCCATCGGACCCGCCGCCAAGTGCCCTTC TTCTCATC DGSPVTTADDNADG CTCTTCGCCGCCGTGTTCGAGGACGCCGCCACGCCGGGATCCGGTGCCT GCTGAAGC HAVATKRRPETRASA GGCTGGCCGCCATACCCTCCGACCCGACCCTCGTCTTGCCAGACGCCGA GTTCTACG RAAAAAATAAAAAAI ACTGGCCGAAGCCGTGCGCATTAAGTTGCTCACGACGACGGCCAATGC CGCTGGCA INDNSLLSDDDDDDD GGCCGGCGTCTGTCCGGCATGCCACAAGACCGGCATCGACCCGTCCCAC GCAAGCCT HDDNCHGEERGEGE GCGTACACGTGCGTTTCGCTATCCCATTTGCGCACAGCACGCCACGACG CGGCCCTA RNVTCPGHYTATPFA TGGTCGTTCGCCGAGTCGAGCTCGCCTGCAAGACCGAGAAGCCGGTCC GCTTGTAG ADDTLDNSDEDNED GCGAACACGTGCTCGCCATCCCCCCCGTCGCGCCCACCGACAACAACAA CGCCGCCG NAHEDDDEDGKDD CAACGGCGACGAGGACGGCAGCCCAGTCACCACCGCCGACGACAACGC GTGGCCGC NDDDVYNNCNSSSS AGACGGCCACGCGGTCGCGACCAAACGTCGCCCCGAGACCCGCGCTTC TCGCCAAC DGDEGGDDLDYEYS AGCCAGAGCCGCCGCCGCCGCCGCCACCGCCGCCGCCGCCGCCGCAAT (SEQID DQSVTRSVDAATGES CATCAACGACAACAGCCTCCTGAGCGACGACGACGACGACGACGACCA NO:1155) PNPERPTTPTRALLRA TGACGACAACTGCCACGGAGAGGAAAGAGGAGAGGGAGAAAGGAAC DLWLPATSTAVDVM GTCACGTGCCCCGGCCACTACACCGCCACACCCTTCGCCGCGGACGACA VAAACRRSRAKAFDR CGCTCGACAACAGCGACGAGGACAACGAGGACAACGCTCACGAAGACG AVSRKAAKYGPAVA ATGACGAAGACGGTAAAGATGACAACGACGACGACGTCTACAACAACT DGSIAKVVPFVVSPF GCAACAGCAGCAGCAGCGACGGGGATGAAGGCGGTGACGACCTGGAC GVLSRPAKAFLKRAM TACGAGTACAGTGACCAGAGCGTCACTCGAAGCGTCGACGCCGCGACG GDTTAAKQAKARLRL GGAGAGAGCCCCAACCCCGAGCGCCCTACCACGCCCACCCGCGCACTAC AVAAVRGTARLSYA TACGCGCGGACCTGTGGCTACCCGCCACCTCCACCGCGGTGGACGTGAT WGACAALIVGGN GGTCGCGGCCGCCTGCCGTCGGTCACGCGCCAAGGCCTTCGACCGAGC (SEQIDNO:1400) CGTCAGCCGCAAGGCCGCGAAATACGGCCCTGCGGTAGCCGACGGCTC GATCGCCAAGGTGGTGCCGTTCGTCGTGTCGCCCTTTGGCGTACTCTCG AGGCCGGCCAAGGCCTTCCTCAAGCGCGCCATGGGCGACACGACGGCG GCCAAACAGGCCAAGGCGCGTCTGCGCCTCGCCGTGGCCGCCGTCCGA GGCACGGCCCGCCTCTCCTACGCCTGGGGCGCCTGCGCCGCCCTCATCG TCGGCGGCAACTAAGCCGCGCGACGAGGACGGCCAGGACGACCAGGA CGACGGCGACGGCGACCACCTAGCACCGCACGCGCCACGACATATTGT CGCGCGCTGTACAGGCGGCTAGGTCGAGCCCAGCCGACCGTTCTGAGC CTCAGTCGGCTTGAGCCCCCGGCTTCCCAAGGCCTACCGGGGCGGCTCT TTTTCGCCCTGGTTTTTGCCGGCCTGTTTTTTCTCTCCCCCTTTTCCCCCCT TTTCCATTTGTACTTAGTTTTTCCTTCGGCCGCGGCAGCTTGTTGCCCGGC ATAGTGTTAATATGTTTAAAAAACGTGTAAATAAATAACTGTTTAACCCT AACCCTAACCCTAA(SEQIDNO:1032) CRE Cre- Fragilari- ATCAATCTAATACTGAAGGCAATACCAAACTCAACCCGAAATCAAAATC ATCAATCT TAGCAC MAPLPWNAATSSPP 1_FCy opsis GTTAGAATCAATATACGACCCCCGCTGCTGTACATGTCCAGCCGGATCTC AATACTGA CACCATC SPVPLTNDKKKDSTLP cylindrus GTTGTAAAGAAGATTGCAGCTGTAAAAAAGTTGGACTTCTTTGTTCTTCC AGGCAATA TATTCAT TATSKNLSKNNNNK TGTGAAGAAGTTGATTGTGGCTGTTCAAATTCTTTTCATAATAAAGAATT CCAAACTC ATCCAC NNNTNRINNIKNND AATGGCTCCACTTCCTTGGAATGCTGCGACATCTTCACCACCGTCACCTG AACCCGAA ACACTG NTNDGSNKINLKLPP TCCCATTAACTAACGATAAGAAAAAGGATTCTACTCTACCTACTGCAACA ATCAAAAT ACCACCT AAVKITNPYKNKKKN TCAAAAAATTTATCCAAAAATAATAATAATAAAAATAATAATACTAATAG CGTTAGAA CCACCTT KKKNNAGKSNPKTN GATTAATAATATTAAAAATAATGATAATACAAATGATGGTTCTAATAAG TCAATATA CACAAC QNPNSSPLSDNDDD ATAAATTTGAAACTGCCCCCGGCAGCTGTTAAAATCACAAATCCTTATAA CGACCCCC TCCACTC DTDSSNITINRRLKFG GAACAAAAAGAAGAACAAGAAGAAGAATAACGCTGGAAAGTCGAACC GCTGCTGT TCAATTC TDDLAPPNPPSNTNT CCAAAACAAACCAAAATCCAAATTCAAGTCCACTTTCGGATAATGATGAT ACATGTCC CCCTGA IGTATAATAATATTT GATGATACTGATAGCTCAAACATCACCATTAATAGACGACTAAAATTTG AGCCGGAT CTACTAA ATAATATTATNTTTT GTACTGATGATTTAGCACCTCCAAACCCACCGTCAAATACTAATACTATT CTCGTTGT GAAATA TTTTNNTTGDNLASN GGTACTGCTACTGCTGCCACCGCTGCCACTGCTACAACTACGGCAACCG AAAGAAG TTTCATG INNNNNNNNSGSNN CAGCGACTGCTACTACTGCTACCAACACTACTACTACTACTACTACTACT ATTGCAGC GTGGTT SNTNNINNTDGNGS AATAACACTACTGGAGACAATCTTGCAAGTAATATTAATAATAATAATAA TGTAAAAA ACATTG NNRPPPRVYTVDPRS TAATAATAATAGTGGCAGTAACAATAGCAACACCAATAATATTAACAAT AGTTGGAC GAGGTA DLPGAEISAANKMLD ACCGATGGCAATGGTAGTAATAATCGCCCACCTCCCCGTGTTTACACAG TTCTTTGTT TCCCAAC EVYGDHVHDNPGSH TCGATCCACGAAGCGACCTTCCCGGTGCAGAAATCTCTGCTGCAAATAA CTTCCTGT ACCAAG LSGLISSSQDQLWQG AATGTTAGATGAAGTATATGGAGACCATGTCCATGACAACCCCGGCTCC GAAGAAG CACAAA YFRRLIPHNQSLYDCP CATCTCAGCGGACTTATTAGCAGCTCTCAAGATCAACTCTGGCAGGGTT TTGATTGT ATGAAC KGKLGKDITNEYSNLF ACTTTCGTCGCCTTATACCTCACAACCAATCTCTCTATGACTGCCCTAAGG GGCTGTTC CCACTA EAIMNGKCNMEKLL GAAAACTGGGTAAGGATATAACGAATGAATATTCAAACTTATTTGAGGC AAATTCTT ACCCTCT VFPVVVLQRRHGVTK AATCATGAATGGCAAGTGTAATATGGAGAAGCTACTCGTGTTTCCAGTA TTCATAAT CATCCTA NADVKRRLLSRLTAW GTGGTTCTACAACGGAGACACGGAGTGACTAAGAATGCCGACGTTAAA AAAGAATT TCCACG KEGKFKYLVEDTHRD CGCCGTCTTCTTAGCCGACTCACCGCTTGGAAGGAAGGCAAATTCAAGT A(SEQID GGGACC LIAKQSKARGDTTPA ATCTTGTTGAAGACACACATCGAGATCTTATTGCCAAACAATCCAAAGCA NO:1156) ACCTTTG HRAKVYSSKLMRGHL AGAGGAGATACAACCCCCGCGCACAGAGCTAAAGTTTACTCAAGTAAAC AGCAGA QSAVNYITDREGGGI TCATGCGTGGACATCTCCAATCAGCCGTCAACTACATCACTGACCGCGA ACACCA LYPYDVDEKSGHTVS AGGAGGGGGCATCCTTTATCCTTATGACGTCGATGAGAAATCAGGCCAT TTCATAT RVLQDKHPSMRDPG ACTGTATCAAGAGTGCTACAGGATAAGCATCCCAGCATGCGTGATCCTG TACAAC PTAMPAYESVPELPT GTCCCACAGCCATGCCTGCCTACGAGTCCGTCCCGGAACTTCCAACACTT CTTTAGC LEITADTVEIVAGKLS GAAATTACAGCTGATACAGTTGAGATAGTCGCTGGAAAGCTCAGTGGT TAGATT GGAGLSGVDSIQLKH GGTGCAGGTCTGAGTGGAGTTGATTCAATACAACTCAAGCACCTCCTCC AAGATA LLLHHGQASQRLRNV TCCATCACGGTCAAGCAAGCCAACGACTGCGCAATGTTTGTGCAAAATT ATTATTT CAKFGRWLANEHPP TGGTAGATGGCTTGCCAACGAGCACCCCCCCTGGGCCTCGTACCGTGCC AGTACA WASYRAMLANRLIAL ATGCTAGCAAATAGGCTTATTGCGCTAGACAAAATGCCCGGAATTCGAC TATTTTA DKMPGIRPVGIGDT CAGTCGGTATAGGTGATACATGGCGTCGTTTCTTCGCCAAACTTGTTCTA TACTATT WRRFFAKLVLAVSM GCAGTCTCTATGTCTTATGCTACTGACTGTTGTGGGTCAGACCAGCTCTG AAAAAA SYATDCCGSDQLCAG TGCCGGACTAAGAGCCGGAGTTGATGGTGCCATACATGGACTATCGGC AAAAAA LRAGVDGAIHGLSA TATGTGGAGGGAGATGGAATCTGAGGAAAACACAGGTTTCGTACTTATT AAAA MWREMESEENTGF GACGCAGACAATGCATTCAATGAGGTCTCACGCATCAACATGTTATGGA (SEQID VLIDADNAFNEVSRI CGATCCGCCACGAATGGCCTGCTGGAGCTCGATTCGCCTTCAACTGCTA NO: NMLWTIRHEWPAG TCGGCACCACAGCCTACTAGTGGTACGGAATCCAGGGGGAAACCCTTC 1279) ARFAFNCYRHHSLLV ACTTTCTTTTCTAAAGAAGGTGTCACACAGGGCGACCCATTTGCGATGAT VRNPGGKPFTFFSKE AGCATATGGTGTCGCTCTCCTACCACTCATCCGCAAACTGAAAGAATTAA GVTQGDPFAMIAYG ATGTATTATTAGTTCAATCTTGGTATGCAGATGATGCTAGCGCAGCTGG VALLPLIRKLKELNVLL CAAATTTGATGAAATACTACGCCTTTTTCAAGATTTATTACGAATGGGAC VQSWYADDASAAGK CTGATTTTGGGTACTTTCCTAATGCATCTAAGAGTATCCTCATCACCCATC FDEILRLFQDLLRMG CCGACAATGTGGTTGCAGCTCACCACTTCTTCAACGAGACCCATGGCCT PDFGYFPNASKSILIT AGGTTTCAAGATCAGCACAGGAAGTCGTTTCCTGGGTGGTTTCATTGGA HPDNVVAAHHFFNE GATACCACAAGTCGAGATGAATACGTATCAACAAAAATCGCCGACTGGA THGLGFKISTGSRFLG TCCACGGCACCAAGGAGCTAGCAGCAGTAGCAAGATTGAAGTATCCAC GFIGDTTSRDEYVSTK ACGCAGCTTACACAGGCATTACCAAGTGTTTGCAGCACAAGTGGAGTTT IADWIHGTKELAAVA TACTCAACGTGTTATTCCTGGCATTGATGACCTCTTCCAACCACTGGAGG RLKYPHAAYTGITKCL ATGAACTCACCAATAATTTGCTCCCCGCCCTATTTGGAGACCCCCCATCC QHKWSFTQRVIPGID ACTATGGATGACAAGCTCAGACTTTTGACCGCTCTGCCAGTCAAACATG DLFQPLEDELTNNLLP CTGGGCTTGCTCTCCCGAATCCAGTTACCTCCTCCGCAACCAACTACAAG ALFGDPPSTMDDKLR AATAGCACTCTTATGAGTTCTCATCTTCTGTTGGCTGTTCAAGGCAAGAT LLTALPVKHAGLALP CAACTTCAGTTTACAGGACCACAGAGATACCTGTCAATCCTCTCTCTCCG NPVTSSATNYKNSTL CGTCCCGAGAGCTCCGACAAACCGAAAATGATTCTTCATTGACCAACCTC MSSHLLLAVQGKINF CTTGCAGCTCTCCCTCCAGCTGCTGCAGGTCAACCAAGCACAACAAGAG SLQDHRDTCQSSLSA CAATCAAGCGTGCTGGGGAAACCGGTCTTTGGCTTACTACTATCCCTAAT SRELRQTENDSSLTNL CACATCAACGGTAACATTCTCGGATGTGACGAATTTATTGATGCTATTCG LAALPPAAAGQPSTT ATTGAGATACCAAAAAGTGCCACACAATCTCCCTGCCAAATGTGATGGC RAIKRAGETGLWLTTI TGTGGCTCTGCATTTGATGTAGGGCACGCGCTCCAATGCAAATCCGGGG PNHINGNILGCDEFID GCCTAATCATTAGACGTCATGATGAACTCAATCTTGAGCTTGCATCTTTA AIRLRYQKVPHNLPA GCAAAGATGGCCTTGAGAGAATCTGCAATACGTGCTGAACCTGAAATCA KCDGCGSAFDVGHA ACCCCAGCGCCTCTATTATGGATTCTCCCACCACCATCACAGCCATCGAC LQCKSGGLIIRRHDEL ACAAACGGAGACCGAGGAGATTTGTTGATCAAGGGCTTTTGGGACAAT NLELASLAKMALRES GGAATGGACGCTATCATCGATGTCAGAATAACAGACACAGATGCCAAAT AIRAEPEINPSASIMD CCTATCGAACAAGAGACCCAAAAAAAGTCCTACAGTCACAAGAGAAGG SPTTITAIDTNGDRG AGAAAAAGAAGAAATACCTCGATCAATGTCTACTCCAACGTCGAGCCTT DLLIKGFWDNGMDA TACCCCTTTTGTTGTCTCTGTGGACGGCCTGATTGGTTACGAGGCCAGCA IIDVRITDTDAKSYRT ATGTGCTAAAGCAATTATCAAAACGTTTAGCAGATAAATGGAATAAGCC RDPKKVLQSQEKEKK TTATTCAGTTACATGTGGAATAGTCCGCTCACGTATCAGCATTGCATGTG KKYLDQCLLQRRAFT CGCGAGCTTCCAATCAATGTCTGAGAGGTTCTCGAATACCATTCAAAAC PFVVSVDGLIGYEAS AATGAGCAGACAAATTCAATGGGAGGACGGTGCAGGCGCCGGCCTCTA NVLKQLSKRLADKW TAGAATTGTCCGCTAGCACCACCATCTATTCATATCCACACACTGACCAC NKPYSVTCGIVRSRISI CTCCACCTTCACAACTCCACTCTCAATTCCCCTGACTACTAAGAAATATTT ACARASNQCLRGSRI CATGGTGGTTACATTGGAGGTATCCCAACACCAAGCACAAAATGAACCC PFKTMSRQIQWEDG ACTAACCCTCTCATCCTATCCACGGGGACCACCTTTGAGCAGAACACCAT AGAGLYRIVR(SEQ TCATATTACAACCTTTAGCTAGATTAAGATAATTATTTAGTACATATTTTA IDNO:1401) TACTATTAAAAAAAAAAAAAAAA(SEQIDNO:1033) CRE Cre- Hydra TTTCTAATGTTACGTGATATGATATGGTTAGTTCATGGTTAGTTTATGTTT TTTCTAAT TAACTTG MNMVSICKRCDRSF 1_HM vulgaris ATGCTTAGTTTATGGAAAATCGTTTATTTATGGCACAATATTGTTTGCTG GTTACGTG TATTTTT TTLKGLNIHKGQCKIF TTTTTAAATTTATGTAACGTGTGCATTTGATGTATATTCTTGAACTTTTTA ATATGATA AAATTG VSNTNKQINNVVNN ATCTGAATTTTTACTTGGTTTAATACGTTTATTATATTCTTCGATTGAGCA TGGTTAGT TTTTATT ELTTPNKNKVEINTIL ATTTATCCTATCAAAGCAATTTATCCTTCGATTCGAGCAATTTATCCTTCG TCATGGTT AGTTT NCDEISVEHYSTNTPY ATTCGAGCAATTTATCCTTCGATTGAGCAATTTATCCTATCAAAATTAGC AGTTTATG (SEQID LPKINICESIIDPNDYL ATATATACTGCAATTTTCAAATAATCTACGAAATAAGTTCACTTACTGAA TTTATGCT NO: WGHMPFSFLLNHVN AATCATTAAGTAAAAGAAGAAAGGAAGAAAAAATAAAAATAAAAAGTA TAGTTTAT 1280) TIYDEIVFYHKNLFKV GTAAATCCTTTCATAACAATAATCATTCTATTATTAAATTTAAAGGAATAT GGAAAATC PSGKGGKMFIEELTF TTTGGTTTTGTACTAAATCATGCGTTCATATTTCACCGAAGAAGGGGGCT GTTTATTT WLKQFNNRTKLNGI GCTATATTTTTGTTTGAAGTTGTTTATCTTAAAACTTTAAACTTGTGTTCA ATGGCACA AMKCFMIVPSLMLQ ACCAACCGTAAACATTAGTTCGCTGTTCGCTCAAATTATCTACAATATAA ATATTGTT KPSIRSKAKEHAECLV AATTTATCAATCTTTTTTCGTTACGGTAAACAATAAACAATAAAATAACT TGCTGTTT RRITLWRNGNFSELM ATAGTTATTTTATTGTTTACCGCATATTGTTTAACTATAGTTAAACAAAGT TTAAATTT REIRYIQSKINTSKKKR ATTTGTTTATGGAACATTACCAGTATCTCTTGTTAAGGTAAACAACAAAA ATGTAACG TFEDISRIFAKLMME CATAGACGGCATCTCTTTTTAAGGTAATTAAGTATACGGCTAATAATAAA TGTGCATT GKVAAALKVLDRESS AATATACAGCTAATAATAAAATCTTCAATGAACATGGTTTCTATATGCAA TGATGTAT GILQCSESVLKELKSK AAGATGTGATCGTAGCTTTACTACCCTTAAGGGACTAAATATTCATAAA ATTCTTGA HPDETPVQDNCLLYG GGTCAATGTAAGATCTTTGTTTCCAATACAAATAAACAAATAAACAATGT ACTTTTTA PLQNTPECLFDSIDEI AGTTAACAATGAATTAACAACACCGAATAAAAACAAGGTGGAAATTAAT ATCTGAAT SIFNSALQTKGSAGPS ACGATATTAAACTGCGATGAGATATCTGTAGAACACTATTCAACCAACA TTTTACTT GMDADLYRRVLCSK CACCTTACTTACCCAAAATAAATATTTGTGAATCTATTATAGATCCCAAC GGTTTAAT CFGPSCKTLREEIATF GACTATCTATGGGGTCATATGCCGTTTAGCTTCCTTCTCAACCATGTCAA ACGTTTAT TKNIATKSYQPDIVQP CACAATATACGATGAAATAGTATTTTACCATAAAAACCTTTTTAAAGTGC TATATTCTT YIACRLIPLDKNPGIRP CATCAGGAAAAGGTGGTAAAATGTTTATAGAAGAACTGACCTTTTGGCT CGATTGAG IGIGEVLRRIVGKTISH AAAACAGTTTAATAATCGAACCAAATTGAATGGAATAGCCATGAAATGT CAATTTAT HCQKEIKEAAGPLQT TTCATGATAGTCCCTTCCCTAATGTTACAGAAGCCCTCAATACGGTCCAA CCTATCAA CAGHGAGAEAAIHA AGCCAAAGAACATGCAGAATGTTTAGTAAGACGAATTACATTATGGAGA AGCAATTT MQKIFHQEDTDGVL AACGGGAACTTTAGTGAATTGATGCGGGAAATTAGATATATTCAGAGCA ATCCTTCG LIDARNAFNCLNRSV AAATTAACACCTCAAAAAAGAAAAGGACATTTGAGGATATCTCAAGGAT ATTCGAGC ALHNIQITCPILAMYL ATTCGCAAAACTAATGATGGAAGGTAAAGTTGCTGCCGCACTGAAGGTT AATTTATC VNTYRKPAKLFIYGGE TTAGATAGAGAGTCATCTGGCATCTTGCAATGCTCGGAAAGTGTATTGA CTTCGATT TIFSKEGTTQGDPLA AAGAATTGAAAAGTAAACACCCAGACGAAACTCCTGTACAAGATAATTG CGAGCAAT MPWYSLSTVTIINTLK TTTACTATACGGCCCGTTACAAAACACTCCAGAATGTTTATTCGATTCAA TTATCCTTC LVIPDVKQVWLADD TTGATGAGATAAGTATATTTAACTCAGCTTTACAGACTAAAGGATCTGCA GATTGAGC ATAAGKLQSLKKWYK GGTCCTTCTGGAATGGATGCAGATCTTTACCGTCGAGTCCTATGCTCAAA AATTTATC CLEDVGGLYGYYVNQ ATGTTTTGGACCCTCTTGTAAGACTCTACGAGAAGAAATAGCAACATTTA CTATCAAA SKCWLIVKSDNQAEE CAAAAAATATTGCAACAAAATCCTACCAACCGGATATAGTTCAACCCTAC ATTAGCAT AKLIFGNSINITTQGK ATTGCATGTCGACTAATTCCCTTAGACAAAAATCCCGGGATTCGCCCCAT ATATACTG RHLGAALGSEAYKKV AGGAATTGGGGAAGTGTTACGTAGGATTGTAGGTAAAACCATTAGCCA CAATTTTC YCEDLVSKWSKELNN CCATTGTCAAAAAGAAATCAAAGAGGCAGCTGGACCACTACAAACTTGC AAATAATC LCEIATTQPQAAYSAF GCAGGACACGGTGCAGGAGCAGAAGCTGCAATACATGCTATGCAAAAG TACGAAAT IKGYRSKFTYFLRTIEA ATATTTCATCAGGAAGATACAGATGGTGTTTTGTTAATCGATGCTAGGA AAGTTCAC FENFVTPVEKILSEKLL ACGCGTTTAACTGCCTAAACCGTTCTGTTGCACTACATAATATACAGATA TTACTGAA PVLFGTDCSIIKENRD ACTTGCCCAATCTTAGCTATGTATTTAGTCAACACTTACCGTAAACCGGC AATCATTA LLALNPSEGGLGICNL AAAATTATTCATCTACGGTGGAGAAACTATTTTTTCGAAAGAAGGCACA AGTAAAAG ITEAKEQHTASKKITN ACGCAGGGCGATCCCCTCGCCATGCCATGGTACTCACTTAGCACTGTGA AAGAAAG LHIKSILDQSDVMKEK CAATCATAAATACATTGAAACTAGTAATTCCTGATGTAAAACAAGTATG GAAGAAA DDFGKTFSEIKTKTN GTTAGCCGATGATGCTACCGCTGCAGGAAAATTACAGTCTTTAAAAAAG AAATAAAA MDKSKKKKEEVKKIH TGGTATAAATGCCTAGAGGATGTCGGTGGTTTGTATGGTTATTATGTAA ATAAAAAG AGLPENLKLLVEQAC ATCAGTCAAAATGCTGGCTAATAGTAAAATCTGATAACCAAGCTGAAGA TAGTAAAT DKGASSWLNTLPIKE AGCTAAACTTATATTTGGCAACTCCATAAATATAACTACTCAGGGAAAAA CCTTTCAT QHLDLNKEEFKDALR GGCACTTAGGAGCTGCACTTGGTTCGGAAGCATACAAAAAAGTGTATTG AACAATAA LRYNVPLANLPSYCA CGAGGATTTAGTAAGTAAATGGTCTAAAGAACTTAACAATCTCTGCGAA TCATTCTA CGEKFDELHAMSCKK ATCGCCACCACGCAACCACAAGCTGCTTATTCAGCTTTTATTAAAGGGTA TTATTAAA GGFVCNRHDNIRDLL CAGATCTAAATTCACTTACTTCTTACGCACAATTGAAGCTTTTGAAAATTT TTTAAAGG TVCLNKVCTDVQAEP CGTAACACCAGTGGAAAAAATTTTATCAGAAAAATTATTACCTGTATTGT AATATTTT HLIPLTNEKFNFKTAN TTGGAACTGATTGTTCTATAATCAAAGAAAATAGGGATTTATTGGCGCT GGTTTTGT TNDEARLDIKAKGFW AAATCCATCGGAAGGAGGACTTGGAATTTGTAACTTAATAACTGAGGCC ACTAAATC RKGETAFFDVRVTHV AAGGAACAGCATACTGCCTCTAAGAAAATAACTAACTTGCACATAAAAT ATGCGTTC NSKSSKKQPTKHIFRR CAATACTCGATCAGTCAGATGTTATGAAAGAAAAAGATGATTTCGGGAA ATATTTCA HEDAKKREYLERVLE AACATTTTCAGAAATAAAAACAAAAACAAATATGGATAAATCTAAAAAA CCGAAGAA VEHGTFTPLIFGTNG AAAAAAGAAGAGGTTAAAAAAATACATGCAGGACTTCCAGAAAACCTT GGGGGCT GFGDECKRFTALLAQ AAACTTCTGGTTGAACAGGCCTGTGACAAAGGTGCCAGCAGCTGGTTAA GCTATATT KLSLKMGERYGAVIN ACACCTTACCAATTAAAGAACAACATCTAGATCTGAATAAGGAAGAGTT TTTGTTTG WLRTRLSMEITRASLL TAAGGACGCACTTAGATTGAGATATAATGTGCCACTTGCCAATTTACCAT AAGTTGTT CLRGSRTPFRHYNTD CCTACTGTGCTTGTGGAGAAAAATTTGACGAGCTACACGCAATGTCATG TATCTTAA DVGLENVQCGLI CAAAAAAGGTGGCTTTGTTTGTAACAGACATGATAACATCAGAGATTTA AACTTTAA (SEQIDNO:1402) TTAACTGTTTGCCTAAATAAAGTTTGTACTGATGTTCAAGCGGAGCCGCA ACTTGTGT TTTAATTCCATTGACAAATGAAAAATTTAATTTCAAAACTGCCAATACCA TCAACCAA ACGACGAAGCTAGATTGGATATAAAAGCAAAAGGGTTTTGGAGAAAAG CCGTAAAC GAGAAACTGCATTTTTTGATGTTAGAGTAACGCACGTAAACTCCAAATC ATTAGTTC CTCCAAAAAACAACCAACAAAACACATATTCCGTAGGCATGAAGATGCA GCTGTTCG AAAAAACGTGAGTATTTAGAACGAGTTCTAGAGGTTGAACACGGGACA CTCAAATT TTTACCCCATTAATTTTTGGTACGAATGGTGGGTTTGGAGACGAATGCA ATCTACAA AACGCTTCACGGCACTACTCGCACAAAAACTGTCCTTAAAAATGGGTGA TATAAAAT GCGGTACGGAGCTGTTATAAATTGGCTAAGGACACGTCTTTCCATGGAG TTATCAAT ATTACTAGAGCCTCCCTACTCTGCTTAAGAGGGTCACGAACCCCATTTAG CTTTTTTCG GCATTATAACACTGACGATGTTGGCCTGGAAAATGTGCAATGTGGACTT TTACGGTA ATTTAACTTGTATTTTTAAATTGTTTTATTAGTTT(SEQIDNO:1034) AACAATAA ACAATAAA ATAACTAT AGTTATTT TATTGTTT ACCGCATA TTGTTTAA CTATAGTT AAACAAAG TATTTGTTT ATGGAACA TTACCAGT ATCTCTTG TTAAGGTA AACAACAA AACATAGA CGGCATCT CTTTTTAA GGTAATTA AGTATACG GCTAATAA TAAAAATA TACAGCTA ATAATAAA ATCTTCA (SEQID NO:1157) CRE CRE- Lactuca ACATTAAATTAGAGAGGTTGATGTTTCAATGGAAGAAGATGAAATTCCA ACATTAAA TGAACT MASSSTSSSDICLCPF 1_LSa sativa AGAAGCTATTTTTGTTGCCCACCAAGTGTTTGATAAAATGTCCAAACTAA TTAGAGAG ATATTTT RSFHCCPNGEVGSKG TTTTTCTCTTGTTGCAGCTTTATTGTTCAAGATAATGTAGTTTGCTTAGTT GTTGATGT ATATATT IXRMISHIKRHHLLTE TGAGCGTTCCTTGTGCACACCAACAGTGTGTTGGTGTGCCATTTCCTTTC TTCAATGG AAAAAA DRKCVLREALSSDVG CTTCCTTTTTAACTATTGCTTCATAGCTTAAGCTTCATCTCGAGGCTTGTT AAGAAGAT A(SEQ LFMAVEETLKAFGQ CTCTTGTATGGCTTCTTCTTCTACAAGTTCGAGTGATATTTGTCTGTGCCC GAAATTCC IDNO: WMCGKCMTLHALS GTTCAGAAGCTTCCATTGTTGCCCAAATGGTGAAGTGGGAAGTAAGGG AAGAAGCT 1281) RYCHHPDGRVXFVT GATTGKCCGTATGATTTCACACATCAAAAGGCATCATCTACTTACTGAAG ATTTTTGTT GADGSSRYIVGILKPS ATCGTAAATGTGTTTTACGTGAAGCTCTTTCTAGTGATGTTGGTTTATTT GCCCACCA TKESVTNALGGLVFD ATGGCGGTGGAAGAAACTTTGAAGGCCTTTGGTCAATGGATGTGTGGG AGTGTTTG VGLLDRVFKEPITTVK AAGTGTATGACTTTGCATGCTCTTAGCCGTTATTGTCATCACCCGGATGG ATAAAATG SIPHSCRLAFSQALKT TCGTGTGAGKTTTGTTACAGGGGCTGACGGCTCGAGTCGTTACATTGTC TCCAAACT ALYKVIAQPGSVDA GGTATTCTAAAGCCGTCTACTAAAGAGTCGGTGACAAATGCTCTTGGAG AATTTTTCT WICLLLLPRCTLQVFR GTTTGGTTTTTGATGTTGGGCTCCTTGATCGTGTTTTTAAAGAGCCTATC CTTGTTGC PKNRQECRSGNRKSL ACTACTGTCAAGAGTATCCCCCATAGTTGTCGCCTTGCTTTCTCTCAGGC AGCTTTAT QQSSILKSLDTWGKE TTTGAAAACTGCTCTTTACAAGGTGATTGCCCAACCTGGCTCGGTTGATG TGTTCAAG DGIRKLVQNMLDNP CATGGATTTGTTTGTTACTTCTTCCTCGCTGCACACTGCAGGTGTTTAGG ATAATGTA EVGAMGQGGGILQK CCCAAAAATAGACAAGAATGTAGGTCTGGGAATAGAAAATCCTTACAAC GTTTGCTT ESTSSNTNIRQCLRKV AAAGCTCCATCCTGAAGTCCTTGGATACATGGGGGAAAGAGGATGGTA AGTTTGAG ADGHFTAAVKVLCSS TCAGGAAGTTAGTTCAAAATATGTTAGACAATCCCGAGGTTGGGGCCAT CGTTCCTT GVAPYNGDTIKALED GGGACAGGGTGGAGGCATCCTTCAGAAGGAGTCTACATCAAGTAACAC GTGCACAC KHPFRPPPSMPSPIIS CAACATCAGGCAGTGTCTCCGTAAGGTTGCAGATGGKCATTTTACCGCA CAACAGTG EPPLVADFDCVFGCIK GCAGTGAAAGTGTTATGCTCATCGGGTGTTGCGCCATATAATGGTGATA TGTTGGTG SFPKGTSCGRDGLRA CTATTAAAGCTTTGGAGGACAAACACCCTTTCAGGCCACCCCCATCCATG TGCCATTT QHXLDALCGEGSAIA CCGAGCCCCATAATTTCTGAACCTCCCCTTGTAGCAGACTTTGACTGTGT CCTTTCCTT TDLIRAITSVVNLWLA ATTTGGTTGCATCAAATCCTTCCCTAAAGGAACTTCWTGCGGGAGAGAT CCTTTTTA GRCPTILAEFVASAPL GGCTTGAGGGCTCAACACWTACTAGATGCCCTTTGTGGAGAAGGGTCT ACTATTGC TPLIKPDNGIRPIAVG GCTATAGCCACAGATCTCATACGTGCTATCACTTCAGTGGTTAATTTATG TTCATAGC TIWRRLVSKVAMKG GTTAGCGGGAAGATGTCCGACCATTTTGGCAGAGTTTGTTGCATCCGCT TTAAGCTT VGKEMAKYLNDFQF CCTCTCACGCCTCTGATTAAACCTGACAACGGGATCCGTCCAATTGCAGT CATCTCGA GVGVSGGAEVVLHS AGGCACTATATGGAGACGTCTGGTTTCCAAGGTTGCCATGAAAGGTGTG GGCTTGTT ANRVLSEHHADGSLA GGTAAAGAAATGGCCAAGTACCTTAATGATTTTCAGTTCGGGGTTGGTG CTCTTGT MLTVDFSNAFNLVD TGTCCGGGGGTGCTGAGGTTGTGTTACACAGTGCCAATAGGGTGTTGA (SEQID RSALLHEVKRMCPSIS GTGAACACCACGCTGATGGGTCTCTTGCAATGCTGACAGTGGATTTCTC NO:1158) LWVNFLYGQAARLYI GAATGCCTTTAACCTGGTGGATAGATCAGCCTTGCTCCACGAGGTTAAG GDQHIWSATGVQQ AGGATGTGCCCTTCTATTTCTTTGTGGGTGAATTTCTTGTACGGGCAAGC GDPLGPLLFALVLHPL AGCGAGACTTTATATAGGAGACCAACATATATGGTCTGCCACTGGGGTG VHKIRDNCKLLLHAW CAGCAAGGCGACCCCTTGGGCCCTCTTCTTTTTGCCCTCGTTTTGCACCC YLDDGTVIGDSEEVA GCTTGTGCACAAGATTAGAGACAATTGTAAGCTCCTTCTCCATGCTTGGT RVLNIIRVNGPGLGLE ATCTAGATGATGGGACTGTCATTGGGGATTCAGAGGAGGTGGCTAGAG LNIKKTEIFWPSCDGR TGTTGAACATTATTCGGGTGAATGGTCCAGGCTTGGGTCTTGAGTTGAA KLRADLFPTDIGRPSL CATCAAGAAAACGGAGATTTTTTGGCCCTCCTGTGATGGTAGGAAGCTT GVKLLGGAVSRDAGF CGTGCCGATTTATTCCCAACGGATATAGGGAGACCTTCTTTGGGGGTGA ISGLAMKRAVNAVDL AGCTCCTTGGGGGGGCTGTTAGCAGAGACGCAGGGTTTATTAGCGGGC MGLLPQLCDPQSELL TGGCCATGAAGAGAGCGGTCAATGCTGTTGATTTGATGGGTCTTCTTCC LLRSCMGIAKLFFGLR ACAACTATGTGACCCGCAGAGTGAGCTCCTTTTGCTTCGATCATGTATGG TCQPVHIEEAALFFDK GCATTGCAAAACTTTTCTTTGGTTTAAGGACATGCCAGCCGGTGCACATA GLRRSIEDMVVCGGP GAAGAGGCAGCTTTGTTCTTTGACAAAGGATTGCGCAGGTCTATCGAGG FFGDIQWRLASLPIRF ATATGGTGGTATGTGGAGGCCCCTTCTTTGGAGACATCCAGTGGCGTCT GGLGLYSAYEVSSYAF GGCTTCCTTACCTATTCGTTTCGGTGGTTTGGGTTTGTACTCGGCATACG VASRAQSWALQDHI AGGTTTCCTCCTACGCATTTGTAGCCTCGAGGGCCCAATCTTGGGCATTA LRDSGICGMDSDYLC CAAGACCACATCTTACGTGACAGTGGCATATGTGGTATGGACTCTGATT AMTRLRDTIPGFDCS ACCTATGTGCTATGACTCGTCTTCGCGATACGATTCCGGGATTCGACTGT GFTNKDTPPKSQKAL AGCGGTTTCACTAATAAGGACACCCCCCCTAAATCCCAAAAAGCATTGG ACALFSKIVKDMEVD CGTGTGCCCTTTTTAGCAAAATCGTCAAAGATATGGAAGTCGACTTCGA FDMTVRQKAVFECL CATGACTGTTAGACAGAAAGCAGTTTTTGAGTGTCTTCGGGCACCTCAT RAPHAQDFLLTIPIDG GCTCAGGATTTTCTGCTAACTATCCCTATTGATGGCCTTGGCCAGCATAT LGQHMSPVEYRTILR GTCTCCTGTGGAGTACCGAACTATCCTTCGTTACCGCCTCATGATTCCTCT YRLMIPLFPIDEICPVC ATTCCCAATTGACGAGATATGCCCAGTTTGCCGCAAGGCATGTTTGGAT RKACLDTFGEHAVHC ACCTTTGGGGAACATGCGGTTCATTGTAGAGAGCTCCCTGGTTTCAAGT RELPGFKYRHDVVRD ACAGACATGATGTGGTTAGGGATGTTCTCTTTGATGCTTGTCGGCGTGC VLFDACRRAGISAKKE TGGTATTTCTGCGAAGAAAGAAGCGCCAGTGAACTTTTTGACGGACCCG APVNFLTDPQDGRST CAGGATGGAAGATCCACACTTAGACCGGCTGACATTTTGGTCTTTGGAT LRPADILVFGWVGGK GGGTAGGAGGGAAGCACGCGTGTGTGGATCTTACTGGGGTCTCTCCTC HACVDLTGVSPLVGL TCGTCGGTTTGAGGAGCGGGGGTTTCACAGCAGGGCATGCCGCTTTGA RSGGFTAGHAALKA AAGCCGCTGCGTGCAAAGTGGCAAAGCACGAGAATGCATGTATAGAAA AACKVAKHENACIEN ATCAACATGTGTTTGTACCTTTTGCATTTGATACATTTGGTTTTCTCGCAC QHVFVPFAFDTFGFL CAGAGGCGGTGGAGCTCCTCAACAGAGTCCAACGGGTCATGCATTCTA APEAVELLNRVQRV ATGTCATATCTCCTAGATCCACAGATGTTGTTTTCAAAAGAATTAGTTTT MHSNVISPRSTDVVF GCCATCCAGAAAGGGCTAGCGGCGCAGCTTGTTGCCCGTTTGCCTTCCA KRISFAIQKGLAAQLV TCGATATGTATTGAACTATATTTTATATATTAAAAAAA(SEQIDNO: ARLPSIDMY(SEQID 1035) NO:1403) CRE Cre- Monosiga CATCTTGGCGTGAACCACGTTGTCAGACAAAATCTGCAACCCCGCTCTTT CATCTTGG TAGGTA MATESGGEDSWTQ 1_MB brevicollis GCGGCCCGCGTTTTGGCGGCGCCCTCGCTCCCACCGTGTCCGCTCGCTT CGTGAACC GGCACC VRGAKRPSAESPPSN GCTCGCTTGCTTGCCCCGCGGACATGGCCACTGAGTCCGGCGGCGAGG ACGTTGTC GTCTCG TTTSPSQTHRSAKHT ATTCTTGGACCCAGGTCCGCGGTGCTAAACGCCCGAGTGCCGAATCACC AGACAAAA GGGGTC KHGSARHDRNHVFP TCCAAGCAACACCACCACCTCGCCTTCCCAAACTCATCGTTCTGCAAAAC TCTGCAAC CCTCTGT DPMTTPLRPHARHS ACACAAAACATGGCAGCGCTCGCCACGACCGTAACCATGTTTTCCCTGA CCCGCTCT GGGGAT VPTARASSHVPSTSP CCCCATGACCACCCCGCTTCGCCCTCACGCCCGCCACTCTGTCCCTACCG TTGCGGCC CCCTGT AAGATESSARAVVPA CCCGTGCCTCGTCTCATGTGCCCTCGACGTCCCCCGCTGCCGGTGCGACC CGCGTTTT GTGCAC AEPVTRTSNGGGEQ GAGTCTTCGGCACGTGCCGTCGTGCCCGCGGCCGAGCCAGTGACCAGG GGCGGCG CTGTCG HPIIGNTSNASPRTPR ACGTCAAACGGCGGCGGGGAGCAACATCCCATCATCGGAAACACTTCC CCCTCGCT CTCCCTA TPSSPRSFAQVAAA AATGCTTCTCCCCGCACCCCTCGCACGCCATCCAGCCCTCGCTCCTTTGCT CCCACCGT GGTGGT MPAAATATSSAPMT CAAGTTGCTGCGGCAATGCCTGCCGCCGCCACTGCCACATCTTCGGCCC GTCCGCTC TCCTCGT EDLSASVPSEPNGSG CTATGACCGAGGATTTGTCAGCATCGGTGCCCTCTGAGCCAAATGGCAG GCTTGCTC TGTGTCT EQQPSPESTGQTHHS CGGGGAGCAACAACCCTCGCCCGAGTCCACAGGGCAGACACATCATTC GCTTGCTT TTTGATG IPNTPSDFLTMSSDES GATTCCTAACACACCATCGGATTTTTTGACCATGTCTTCGGATGAAAGCG GCCCCGCG GCTTGA DSPPRSTALRAPTPIA ACTCCCCTCCTCGCTCCACCGCACTCCGCGCGCCCACCCCTATCGCCCCTC GAC(SEQ CTTGTAT PPAHDGDGDTNGSA CCGCGCATGATGGTGACGGTGACACAAACGGCAGTGCCACGCCTGAGC IDNO: TTTTGTT TPEPLVQSPTPAQM CATTGGTGCAATCACCTACACCCGCTCAAATGGTGCTGCCATATCCATCG 1159) TTAATTT VLPYPSGTQQTHSDP GGTACACAACAAACCCATTCCGATCCCTCTCCGCCCTCTGCTTCACCCCCT TGCTTTA SPPSASPPATTILPAAI GCCACTACCATTTTGCCCGCTGCCATTTCACATCCTGTCGAACACAGTGA ATTTTTG SHPVEHSEHANSAPL GCATGCAAACTCAGCCCCACTTGGCGAAGTCAGTGAGAGTGAAACACA CTGTATT GEVSESETHNTAGEH CAATACAGCGGGCGAACACAGTGAGAGTGAGCAAGATGTTCTTCTCAG TGTGTG SESEQDVLLSDPAPPI CGATCCCGCTCCGCCCATCGCTGCCAACGTGCTGGATGCCCAGCGCAAG GTATTTT AANVLDAQRKVLLKT GTCCTGCTGAAGACATCTGGCCACAGGCAACTCCTCGCCTGCCCATTTG TGCTGA SGHRQLLACPFGLCK GGCTTTGCAAATGCAAGGGGCCCCGCCTTGACCGCAAAGCCTGGGTCA ATTTTTG CKGPRLDRKAWVNH ATCATGTACTACGCGAGCACCCCTACGACGAGCAAGCCACTGATCTGGT TAAGGT VLREHPYDEQATDLV CAAGCAGGTGATGGAGGCCAAATTGGTCGCCCAGTGCAACAAGTGTCA CCTTTGT KQVMEAKLVAQCNK CCTCTTCTTCGAAGCTGCTGGTATCAGTCAACACCGCTCCCGATGTGGTG ATGATG CHLFFEAAGISQHRS CCAATCTGAAGCGAGCGACCGAGGCGTTGTTTCATGCGGCTGGACACG TCTTTGT RCGANLKRATEALFH ACCTGCTTGAGATTATGCGTGGCGCTTGGCCCCAACAGTGTGTAGGGTC CTTCTGT AAGHDLLEIMRGAW TCGCATCAGTGTCTGCGAGCTGCTCAAGCTCGCCCGGCATCCACTGATG GGTCGG PQQCVGSRISVCELLK CAGCGCAGCCGCTACCCATCCAACGCCACCGAGACCAAGCTGATGGCTG TTGTTTT LARHPLMQRSRYPSN CCACCCTGAGCCAGCTGTATTGGTCTGCCGTCCACTCGGACTATACCGCT CCTCAAT ATETKLMAATLSQLY GAAGAGCGAGAGATGTGCTGGGCTTTGATTTTGGCCTTGCCTAGCATGT CCGACG WSAVHSDYTAEERE TGTTGTCTGCTCCCTCGACCGCACTGTCTACGATTGACCTGCGCAATATG TTGTGTC MCWALILALPSMLLS TTTCACGATCGTCTCCGTTGGCTTGTGACGGGGCAACTAGGTCGGGTCG TCGTTG APSTALSTIDLRNMF TGGACGCCATGCGCAAGGCAGTCGCACGCAAGCAGAGCCGTCGAGGAC GATGTG HDRLRWLVTGQLGR AGCTGAACGCCGGCGCGGGCGCCCACCCGAACGACGCAGTCGACCAGA AGCGTG VVDAMRKAVARKQS GCCTCAGGTCGCTCGTCCGCGACCCGGACCTGGCGGACGAAGCCTGGG CCGTGG RRGQLNAGAGAHPN CAAACCACGTCACGAACCGTCTGAACCGAGGTCAGATTGCCAAAGCATT TGTTCTT DAVDQSLRSLVRDPD TGATGCCGACAAGGCTCGTGCCGTGATTGGTAATTCTGAGGTTCAGGCC TGTGTTT LADEAWANHVTNRL GTGCGCGACCTCTTGGTACCGCCCGGGCTGACCCCGTACATTGCTTCGA GTGCTG NRGQIAKAFDADKA CACCCGCCTCCACGTCTACACTGGCACCAGCCACGGCTGTGAGCTCCCC TGATGG RAVIGNSEVQAVRDL AACCGTGTCCTTCACCAAGGGTGAGCTCCCCAAGGCGTTGGCGGCCACC CTTGTA LVPPGLTPYIASTPAS AAGGGTGTCACCGACCCCTATGGTTGGTCTGGTGAGCTTCTTGCCTCCAT GTTGTG TSTLAPATAVSSPTVS CTACCGCATCAAGGAGCACTTCAGTCAAGTCTTGGGCCCACGCCAGGGT ATGTGT FTKGELPKALAATKG TCTACCAGCGACCCGACTGCTCCTTCTGATGGAGACGCGCCTCAGGGCC GACTGC VTDPYGWSGELLASI CCACCACCGCCACTGGAGGTCCTCAGGTTGCCTTGAACAAGATCTTTCAC CTTTTTG YRIKEHFSQVLGPRQ CACATTGCCAACAACACCGTGCCCGAGTCGATTCGACATGCCCTTTGCTC GGTGTC GSTSDPTAPSDGDAP CATCAACTACACTATCCTGGAGAAGGCCAATGGCAAGTTTCGACCCGTG TTGTGTT QGPTTATGGPQVAL GGCACGGATTCCATCTTCAACAAGGTTGTCAACCGCGCTCTGCTCGAGC TGAAAT NKIFHHIANNTVPESI AACAGCAGCCCCATATTGCCCACTTGCTACAGGCCAGTCCAGAGCTGGC GGCCGT RHALCSINYTILEKAN CGTCGGAGTCAAGGACGGCATTTCAGCAGCGGTTGGCATGGCCTTTGG ATCTCTG GKFRPVGTDSIFNKV TGAGCTTCAAGCCTGTGAGTCTACCCCGGGCTGGACCATGCTCTCCCTC GTTATAC VNRALLEQQQPHIAH GATTTCAAGAGTGCCTTCAACTACACCGACCGAGCACGGCTGCACGAGA TTGGTC LLQASPELAVGVKDG TTGTGGCCGACAAGGTCCCTGGCCTCTTGCGCGCCTTTGAACGACACTA GTTTTGT ISAAVGMAFGELQAC TGCTCGACCCACTACCCACTGCATTGTCGACAAGTTCTTCAAGGTGATTG ACGATTT ESTPGWTMLSLDFKS ACATTGATGTTGGCCAAGGCATTGTGCAGGGCAACGAGCTATCGCCCTT TTGTTTC AFNYTDRARLHEIVA CTTCTTTGCCCTGTACTCCTGTGAGGTCCTGGGTCTCCTCGACGCCACCA TATGTG DKVPGLLRAFERHYA CTGACTACCGCTGCAAGGTCATCAAGTACCTCGACGACATTGTACTGAT CGTGAT RPTTHCIVDKFFKVIDI GGGTCCCGCGGAGGACGTGGCGGCCGACGTGGAGATTGTCAAGGCTC TCTTCGC DVGQGIVQGNELSPF GTGCAGAGTCTGCTGGCCTTCATCTGCAGCCCAGTAAGAGCCGCTTCTA GCTTGT FFALYSCEVLGLLDAT CATGCCTCGCCACCATTCGGCTTCCATCACTGCTATCAAGTCTGTATTGC ACTTCTT TDYRCKVIKYLDDIVL CAGATGCCGTGCGCGAGACGGCCAACACGGGCATGACGGTCTTGGGAA GGCATG MGPAEDVAADVEIV CGCCGATTGGCCGTCGCGAGTGGATGAAGAAGCAGCTGAACGACAAG ATAGAA KARAESAGLHLQPSK GCAAAGCACATTGCTGGCAAGCTCAATGACATGCTGACGACCGGTGTCT GCCAAT SRFYMPRHHSASITAI CGCTTCAGGCCCTCCTCACGGCCATGCAGTACGTGCCTAGCCTCATCAAC GAATGT KSVLPDAVRETANTG CACCTCTACACGCTGCCCCCAAGTCTCACGTCGGGCTTGTCCGAGCTCTT GTCTTGT MTVLGTPIGRREWM GAACCGTGCTTGCAAGGACACCTTTGTCAAAGCCTTTTTTGCCAAGGTA TCTCTTG KKQLNDKAKHIAGKL AACCTGTCTGCACCGGCTGGAGCTGAAGGTCATGACGTTACGCTGGAAC TGTTGTT NDMLTTGVSLQALLT AGCTCCTTGAGGCTCGCCTCTTCACACGGGCCAACACCGGGGGCTTTGG TTGCGT AMQYVPSLINHLYTL CCTGCACGACTTGGTTGAGCGCGGTCCGGTGGCTTATGTCTGCAACATG GCCGTC PPSLTSGLSELLNRAC GCCAAGCTGGCCACTCGCTACCCTCGGGTCTACGATCGACTTTTGGAGG GTGATTT KDTFVKAFFAKVNLS ATGCATCGAGGGCTGCCGACTTTGAGGCCCACGTGCAGCGAGCTGGCT TGATGT APAGAEGHDVTLEQ TCCAGATGGCCACGGTCAAGGACGCGGCGACCCAGCGACCAGCTGAGA CGGGGT LLEARLFTRANTGGF TCATTGCCCTCCGCTCCAAGGCGGCACTGGACGACCTGATGGCCAAGTG TGCACA GLHDLVERGPVAYVC CGCGCTGGACCTGCAGCAGGCATATCTGGCCTCACGCGAGTGGGGCGT GCTTTG NMAKLATRYPRVYD CAGCACTGTCTTGACCATGCGGGGTCGGGACAAGTTGCGTCGCTTGAGC CTTTCAG RLLEDASRAADFEAH GACACGACCTTTGCCATTGCGGTCGTGTCCATGATGGGTTTTGGCCTCCA CTCTGA VQRAGFQMATVKD TGAACTCATCAACGTCAAGCCGACGGACAAGTGCCCGCTCTGCAGCAGC GGTTCA AATQRPAEIIALRSKA AAGACACCTCAGCCGCGACTGACCCGCGAGCACCTGCTGACCTGCCGTC AACACC ALDDLMAKCALDLQ CCATCAAGCGTCACAACGCCCTTCGCGACGAGATGGGCCGCCTGCTCAG TAATTTA QAYLASREWGVSTVL GTACGCCACCCTCTCCCATGTCTGGGTGGAAAAGTCTGGCTACAACGCC (SEQID TMRGRDKLRRLSDTT AACGGTCAGAGCTGCCGCATCGACCTGCACTGCCGCAACCCCTTTCCCG NO: FAIAVVSMMGFGLH GCGGTGCTCTGGGCCCAGCTCTGCCCGACCTGGGCATTGACGTGACTGT 1282) ELINVKPTDKCPLCSS GCGCACAGCCCAACCCCCGACCACCTCGCAAGCCTGCATCAAGGTGGGC KTPQPRLTREHLLTCR GCTGCCCTTCGCCGAGCCGAAAAGGAGAAGCGCGACTACTACACCGGT PIKRHNALRDEMGRL TTCAACCATGGAAAAACTCTGATCGTCCCTGCGGCGATGACGACAACCG LRYATLSHVWVEKSG GTGGGTTCGCCTCCTCCTTTGTGGATCTGCTTGGTCAGCTCGCCCGCTGC YNANGQSCRIDLHCR GCCGAGGCCCGTGGTGTGTACCAGCCGGGGCTGGATGAGGCCTTTGTT NPFPGGALGPALPDL CCTCGGTGGAAGGGTCGCTTTGCGGCGCTGGTCCATCAGATGAACGCT GIDVTVRTAQPPTTS GACCACATCCAGCGCCACTTTGGCGGTGTCTGCCTGCGCTCGTCGTAGG QACIKVGAALRRAEK TAGGCACCGTCTCGGGGGTCCCTCTGTGGGGATCCCTGTGTGCACCTGT EKRDYYTGFNHGKTL CGCTCCCTAGGTGGTTCCTCGTTGTGTCTTTTGATGGCTTGACTTGTATTT IVPAAMTTTGGFASS TTGTTTTAATTTTGCTTTAATTTTTGCTGTATTTGTGTGGTATTTTTGCTGA FVDLLGQLARCAEAR ATTTTTGTAAGGTCCTTTGTATGATGTCTTTGTCTTCTGTGGTCGGTTGTT GVYQPGLDEAFVPR TTCCTCAATCCGACGTTGTGTCTCGTTGGATGTGAGCGTGCCGTGGTGTT WKGRFAALVHQMN CTTTGTGTTTGTGCTGTGATGGCTTGTAGTTGTGATGTGTGACTGCCTTT ADHIQRHFGGVCLRS TTGGGTGTCTTGTGTTTGAAATGGCCGTATCTCTGGTTATACTTGGTCGT S(SEQIDNO: TTTGTACGATTTTTGTTTCTATGTGCGTGATTCTTCGCGCTTGTACTTCTT 1404) GGCATGATAGAAGCCAATGAATGTGTCTTGTTCTCTTGTGTTGTTTTGCG TGCCGTCGTGATTTTGATGTCGGGGTTGCACAGCTTTGCTTTCAGCTCTG AGGTTCAAACACCTAATTTA(SEQIDNO:1036) CRE CRE- Hydra AATTTAAAAAAAAAAAATCGTTTATTTATGGCATAATACTGTTTGTAATT AATTTAAA TGAGCT MSSCKVTIPHVCPYC 2_HMa vulgaris TTTGAAAATTCGTGCAACAACTGCAGTTAAATTGAAGAGCTGAAATTTA AAAAAAAA CTTATAA KVELKTICGINRHILKC AGATCTGAGCTTTTCAATCAGAGTTTTTTACCCTAAAACATTAAATTTTAT ATCGTTTA ATTTATA KKNPLQIPSLQKTNTS CATAACAAAAATCGTTCTAATATTATTAAACTTAAAGAAATTCGTTCTTAT TTTATGGC TTATAGC LTLEPNTKVIPSITKQ ATCAAATCTTATTTCAGTGTTTCACAGACGAAGGGTTTTACTAGATTTTT ATAATACT ATTTTGT NDIIIASTSSNNLAFN ATTTTTTCAACTTTTGAATTTGTTTATTATAAAACTGTAAACTAGTGTGCA GTTTGTAA TTTA QKKDYTLTPTYSRKTT ACCAACCGTAAAAAWTAGTTAGCTGTTCACCCAAAATATTATTCAGTAT TTTTTGAA (SEQID PVSILSSMKMTPISITS GAAAATATTTAATCTTCTTTATTCGCAGTAAACAATAAAATATCTAGTTA AATTCGTG NO: HIVRRKLPELPSQTTN AACAAAATATTTCTTAATAATAAAAAACAAAAACTTTTTCTTAACAAGTA CAACAACT 1283) HLFNENFINVPFLPEI CAATGAGCAGCTGCAAAGTTACTATACCTCATGTGTGCCCTTATTGTAAA GCAGTTAA MNHLPVPNNNVM GTAGAACTTAAAACAATATGCGGAATAAACCGTCACATTTTAAAATGCA ATTGAAGA WGVYSYQQFKLFVD AAAAGAATCCTTTACAAATACCCAGTCTACAAAAAACTAATACCTCTTTA GCTGAAAT STYDEIVNYRRNIFNI ACACTCGAACCAAATACTAAAGTAATACCCTCAATTACAAAACAAAACG TTAAGATC PSGKAGKEFIEELTFW ATATTATAATAGCATCCACTTCGTCTAACAACTTAGCGTTTAATCAAAAA TGAGCTTT LRKFNSTSSLNSIALK AAGGACTACACATTAACACCTACATATTCTAGAAAAACGACACCCGTAA TCAATCAG VTMILPNLLLQKPSAK GCATACTGTCTTCTATGAAAATGACACCCATAAGTATAACATCACATATA AGTTTTTT SKSKEHTLCLTRRIDL GTTCGCAGAAAACTACCTGAGCTTCCTTCTCAAACAACAAATCATTTATT ACCCTAAA WKKGDTSLLLKEVRN TAATGAGAATTTTATAAATGTTCCCTTCTTGCCTGAAATAATGAACCATC ACATTAAA IQKKFVNSKXKRSMD TACCAGTTCCAAATAACAATGTCATGTGGGGAGTATACTCATATCAACA TTTTATCAT DISRIFAKLIMEGKITA ATTTAAATTGTTTGTGGATTCTACCTATGATGAGATCGTAAATTACCGAA AACAAAAA ALKFLEKEASSGILPLS GAAATATTTTCAACATTCCATCTGGAAAGGCAGGTAAAGAATTTATAGA TCGTTCTA DNTLKDLKSKHPEPS GGAGCTAACCTTTTGGTTAAGAAAGTTTAATTCCACTTCTAGTTTAAATT ATATTATT RVEDYSLLFGPIDLIPK CAATCGCGCTGAAAGTTACAATGATTTTGCCGAATCTTCTTTTGCAAAAA AAACTTAA CFFDCIDEQLVMKAA CCCTCCGCCAAATCAAAGTCCAAAGAACATACATTATGTTTAACTCGTAG AGAAATTC FATKGSAGPSGMDA GATTGACCTTTGGAAAAAAGGAGATACTAGTTTACTGTTAAAAGAAGTT GTTCTTAT DIYRRILCSKNFIKEGK CGAAATATACAAAAAAAATTTGTAAATTCCAAAAAKAAAAGATCTATGG ATCAAATC ELRKEIAKMTQNLLTE ACGATATATCTAGAATATTTGCCAAATTAATTATGGAAGGCAAAATCACT TTATTTCA TYEPTFLEAFTACRLIP GCAGCGCTGAAATTTTTAGAAAAAGAGGCATCATCCGGCATACTACCAC GTGTTTCA LDKNPGIRPIGVGEVL TATCAGACAACACATTAAAAGACCTTAAAAGCAAACACCCTGAACCCTC CAGACGAA RRIIGKVISWSFNSEIK CCGAGTAGAAGATTATAGCTTACTGTTTGGTCCGATTGATTTAATCCCAA GGGTTTTA EAAGPLQTCAGHGA AATGTTTCTTCGATTGTATTGATGAGCAACTAGTTATGAAAGCAGCATTT CTAGATTT GAEAAVHAMKEIFD GCAACTAAAGGATCTGCTGGACCATCAGGAATGGATGCCGATATTTATC TTATTTTTT NVQTDAILLIDAKNA GCCGCATCTTATGTTCTAAAAACTTCATCAAAGAAGGTAAAGAACTCCG CAACTTTT FNCMNRQVALHNIQ AAAAGAAATTGCTAAAATGACACAAAACTTACTAACAGAAACATATGAA GAATTTGT IICPLISIYLINTYRNPS CCAACATTTCTAGAAGCTTTCACTGCTTGTCGATTAATTCCTCTAGATAAA TTATTATA RLFVAGGKEISSQEGT AATCCAGGTATTAGGCCAATTGGAGTAGGAGAAGTATTAAGGCGTATC AAACTGTA TQGDPLAMPWYSC ATAGGTAAAGTAATTAGCTGGAGTTTCAACAGTGAGATAAAAGAGGCA AACTAGTG NTTIIIEHLLVNYPQV GCCGGGCCATTACAAACATGTGCTGGACATGGGGCAGGAGCCGAAGCG TGCAACCA KQVWLADDAAASGS GCTGTACATGCCATGAAGGAAATATTCGACAATGTGCAAACAGATGCAA ACCGTAAA IANLHSWYQHLIDEG TACTTTTGATTGACGCAAAGAACGCTTTTAATTGTATGAATCGACAAGTC AAWTAGT CKHGYYVNQSKCWLI GCCTTACACAACATCCAGATCATTTGTCCATTAATTTCAATTTACTTAATC TAGCTGTT VKSPSLAENAGIVFGK AATACTTATCGAAATCCATCGAGGCTCTTTGTGGCAGGGGGTAAAGAAA CACCCAAA SVNITTEGQRHLGSVI TATCATCCCAAGAAGGCACAACTCAAGGTGATCCCCTTGCTATGCCATG ATATTATT GSQNFKNKYCTEKVA GTACTCTTGTAACACCACGATTATTATAGAACACTTACTTGTAAATTACC CAGTATGA KWLTELKQLCKVAET CACAAGTTAAGCAGGTGTGGTTAGCAGACGATGCTGCAGCTAGTGGAA AAATATTT QPQAAFIAFTKGFRS GCATTGCAAACTTACATAGCTGGTATCAACACCTTATTGATGAAGGATG AATCTTCT KFTYFLRTIPKFEQYLA TAAACATGGCTACTATGTAAACCAATCTAAATGCTGGTTAATTGTAAAAT TTATTCGC PVDEILSHLLLPTLFGK CCCCCTCGTTAGCAGAGAATGCAGGCATAGTATTTGGTAAATCGGTCAA AGTAAACA DTPFEDHIRKLFTLTP CATAACTACAGAGGGTCAACGACATTTGGGTTCAGTAATAGGTTCGCAA ATAAAATA RDGGLGIPILVEEAPH AATTTTAAGAACAAATATTGCACTGAGAAAGTAGCAAAATGGTTAACCG TCTAGTTA QFLSSVKLTKNLVQQI AGTTAAAACAACTTTGTAAAGTAGCAGAGACGCAACCACAGGCCGCTTT AACAAAAT IDQDKILKTKNSSGNV TATTGCGTTTACAAAAGGATTTCGTTCAAAATTTACATATTTCCTAAGAA ATTTCTTA LEDLEKILTTDRLKHR CTATTCCAAAATTTGAACAATATCTAGCGCCCGTAGACGAAATACTTAGT ATAATAAA KEKIIAVDSMQPDSM CATTTGTTGTTGCCAACTCTTTTTGGAAAAGATACGCCCTTTGAGGATCA AAACAAAA LRNIQQTRSECASTW CATTAGAAAACTTTTTACATTAACTCCTCGAGATGGAGGATTGGGTATAC ACTTTTTCT LNALPLENQGFVLNK CTATACTAGTTGAAGAAGCGCCTCACCAGTTTTTATCATCTGTTAAATTA TAACAAGT EEFRDALCLRYNFDLK ACTAAAAATCTTGTACAGCAAATTATAGATCAAGATAAAATTTTAAAAAC ACA(SEQ NIPRICECGEPFNVTH AAAAAACTCTTCCGGGAATGTTCTAGAAGATCTTGAAAAAATATTAACT IDNO: ALSCKKGGFISSRHD ACTGACAGACTTAAGCATCGCAAAGAGAAAATAATTGCAGTTGATTCAA 1160) NIRNLFTTLLKRVCIN TGCAACCAGATTCAATGTTAAGAAACATACAGCAAACAAGAAGCGAAT VQSEPHLIPLDNENFY GTGCTAGCACCTGGTTAAACGCCTTACCACTAGAAAACCAAGGTTTTGTT FHTANKSNQARLDIK TTAAATAAAGAAGAATTTCGAGATGCACTTTGCTTACGTTATAATTTTGA ANGFWRNGQTAFF CTTGAAAAATATCCCTCGTATTTGTGAATGCGGAGAACCTTTCAATGTAA DVRVTHVNSMSNKN CTCATGCATTGTCGTGTAAGAAAGGTGGCTTCATTTCAAGTCGTCATGAT LDIAAIFRKHEKEKKR AACATAAGAAATTTGTTCACCACATTGCTTAAAAGAGTGTGCATAAATGT EYGERVREVEHGSLT TCAATCCGAACCACACCTCATACCCCTTGATAATGAGAATTTTTATTTCCA PLVFGTNGGMGKEC TACAGCAAATAAAAGTAACCAAGCTCGTTTAGATATTAAAGCAAATGGT HRFVRRLAEKLAEKQ TTTTGGCGAAATGGACAGACAGCATTTTTTGACGTAAGAGTTACGCATG NEKYSVVMTWLRTK TCAATTCCATGAGCAACAAAAATCTAGATATAGCTGCTATATTCAGAAAA LSFEILRSTILCLRGSRT CACGAAAAGGAAAAAAAAAGAGAGTATGGCGAACGAGTTCGTGAAGT PWTKKNDFEIGVDFK CGAACATGGCAGCTTGACACCACTAGTGTTTGGTACAAATGGAGGTATG MDALEARI(SEQID GGTAAAGAATGTCATCGGTTTGTTAGAAGGCTAGCGGAGAAACTAGCG NO:1405) GAAAAACAAAATGAAAAATACTCAGTTGTTATGACATGGTTAAGAACAA AGTTATCTTTTGAGATACTTCGCTCAACTATCCTTTGTTTAAGAGGCTCAA GAACACCCTGGACTAAAAAAAACGATTTTGAAATCGGTGTTGATTTTAA GATGGATGCTCTAGAAGCAAGAATCTGAGCTCTTATAAATTTATATTATA GCATTTTGTTTTA(SEQIDNO:1037) CRE MoTeR1 JQ747487 Magnaporthe CCCGAACCCGAACCCAAACCCAAACCCAAACCCAAACCCAAACCCAAAC CCCGAACC TAATAG MVCPTCNGVYADYN oryzae CCAAACCCAAACCCGGAGGGTTCCCAAGTCGCCTAAACCCGAAGGGTTT CGAACCCA GTAACG DHIRKKHPDERYTAL AGGATATTATTTCGTTTATTAGAATTGGATAATTATTTACCCCTGTTGGA AACCCAAA TCCCTAT QLQPLGLTPCPICKTA CAGGGGGGTTGCAGGGGTTAAATTAAGGTTTTTTATTATTTATGCGCCG CCCAAACC TTTTGTC CKNDLGVKTHLSKIH TTTATTTGTTTACCCCCCCAAATATTATAAAAGCGCGTTCCATCCTCTTAG CAAACCCA TTTGGTT KISGASKISTQPRIRTE GAAAAGCGAAGCTTTTCCTTGTAAAAGTCGCTAGACTTTTACTATAAAA AACCCAAA TTGTTTT NTDNTNSVPTSSFNP GTCGCTAGACTTTTATACCAATCTTTTAACAAAAAGCGTAGCTTTTTGTT CCCAAACC TATCTTT VLPEIQTLTPGLNNSR GCCAATCTATTAAAAAAAGCGGAGCTTTTTTTAACTTTTTCTTTTTTTTTTT CAAACCCG GTTTTTG WADNPRKRRADTPS TTTTTCTTTTTTTTTTTTTTTTTTCTTTTTTTTTTTTTTTTTTTTTTATATATAT GAGGGTTC TTTTTGT PTRGRNTRPRRFSYT TATTATTATTATTATTAGCGGTGGGGCTATTTATGCGCTTTAATTTGTGC CCAAGTCG TTTCGTT DIDLTNDEPADNPRA GGGGCTATTTATGCGCTTTAATTTGTGCGGGGCTATTAATGCGCTTTAAC CCTAAACC TTTGTTT NNPRVNNPRVNNEP TTTACAAATTTTATTTATGCGCTTTAATTGCTGCGGGCCTGTTAATGCGCT CGAAGGG TTGTTTT PSSPNSLPSISEFHTP TTAATTTACAAATTTCATTAATGCGCTTTAACTTTTATATTTACTAATGCG TTTAGGAT CGTTTTT GTLPLTNSNISLKDQH TTATTTATATAATTGCTATTATTATCGTTGCTATTATTATTATTGCTATTAT ATTATTTC GTTTTTT DKITGPILQKPLIQKLI TATCGTTATTATTATTGCAATTTTATTATATAAACCCTCGTTTGTCCCTCG GTTTATTA TTTTTGT EYSKIPIPEHHLHARQ ATTTATCCCGTTTCTTTTCCATCCCATCGCGCGTTTTCGTAAGCTTTGGTT GAATTGGA TTTTGTT AKIFADAANRIAKNFI TTCGTAGGATTTGCTTTCGTAGGCTTTGCTTTCGTAGGCTTTCGTCAGCTT TAATTATTT TTTGTTT QSPTEKTLFNLLILPRI TTACCTGCTTTTATTTTTTCTTTTTCTTTTTATTCCCCCCCCTTTTTTTTACCT ACCCCTGT TTGCCTT FGIGLINGKVTKIMQ GGTTTATTAGCGGTTTACCTGCTTTTATTACCTGGTTCCCCTTTACCTGTT TGGACAG TGTTTTT NFPSQIPPIPKIDFPSE TTATTAGCGGTTTACCTGCTTTTATTACCTGGTTCCCCTTTACCTACTTTAT GGGGGTT GTTTTTA KTDSDPVLNAKKLLE AAGCGGTTTACCTGCTTTTATTACCTGGTTCCCCTTTACCTGTTTTATTAG GCAGGGG TCTTTAT KGYIGRAAKAIIDPTP CGGTTTACCTGCTTTTATTACCTGGTTCCCCTTTACCTGTTTTATTAGCGG TTAAATTA TTTTGTT VAPETPESLNILREKH TTTACCAGCTTTTATTACCTGGTTCCCCTTTACCTACTTTATTAGCGGTTTA AGGTTTTT TTTGTTT PIGQNNPFNTKSQPI CCCGTTTCTATTAGTGGGCATTTATTTCCCGTTTTTATTAGCAGTTAAATT TATTATTTA TTACTTT SGRQITEKAILLAISSI TACCCTTTTAAGGTTATTTACCTGCTTTTATTCACAGGGCACCCCTGTTTT TGCGCCGT GTTTTAT GREKAPGLSGWTRSL TACTAGCAGTTAAATTTACCTTTTTAAGGTTATTTACCTGCTTTTATTCAC TTATTTGTT TTGTTTT LDAAIKIPTQNDVIPA AGGGCACCCCTGTTTTTACCAGCAGTTAAATTTACCTTTTTAAGGTTATTT TACCCCCC ATATTTA LRLLTDMIRQGTAPG ACCTGCTTTTATTAACAACCCTTTATTTTTTCCTATTAACGGGTATTTATTT CAAATATT CCTTTTG RELLCASRLIGLSKPD ACCTGTTTTATTGGAATTCACCCGTTGGACGGCATGGTTTGCCCAACCTG ATAAAAGC ATTTTTT GGVRPIAVGDLLYKIA TAACGGCGTTTACGCCGATTACAACGACCATATCCGGAAAAAACACCCG GCGTTCCA CTATTTT FKAILNTLWSPNCLLP GACGAACGTTATACCGCCCTCCAACTCCAACCATTGGGTTTAACCCCCTG TCCTCTTA TCCCACC YQLGVNSIGGVEPAIF CCCTATATGCAAAACCGCTTGCAAAAACGATTTGGGCGTTAAAACCCAC GGAAAAG CTTATTA TLEEAIMGPNINGIKS CTATCCAAAATCCACAAAATATCCGGTGCATCGAAAATTTCAACCCAACC CGAAGCTT TTATAAC ITSLDLKNAFNSVSRA GCGTATACGAACGGAAAATACGGATAATACCAATTCGGTCCCCACGTCG TTCCTTGT CCCAAC AIASSVAKYAPTFYRS TCGTTTAACCCTGTCCTTCCCGAAATCCAAACGTTAACCCCGGGGTTAAA AAAAGTCG CTACTAA TCWAYNQPSILITEN TAACAGCCGTTGGGCCGATAACCCCAGAAAACGACGGGCCGATACCCC CTAGACTT TATTTTT GSVLASAQGIRQGDP CTCCCCAACACGGGGTCGGAATACACGCCCACGTCGATTTTCATATACG TTACTATA TCTTTTT LGPLLFSLAFRPTLETI GATATCGATTTAACAAACGACGAACCGGCGGATAACCCCAGGGCTAATA AAAGTCGC TCTTTTT QKSLPYTYIAAYLDDV ACCCCAGGGTTAATAACCCCAGGGTTAATAACGAACCCCCCTCCAGCCC TAGACTTT TCTTTTT YILSKTPVKDKIAKIIEK AAATTCGTTACCTTCGATTTCCGAATTTCACACCCCTGGGACCCTACCCCT TATACCAA ACGGTT SPFTLNSAKTTETDID AACCAATTCGAATATATCGTTAAAAGACCAGCACGACAAAATTACCGGC TCTTTTAA TTATTTT TLKTNGLKTLGSFIGP CCTATATTGCAAAAACCGTTAATCCAAAAATTAATCGAATATTCGAAAAT CAAAAAGC CCCGTTT TELRKEFLQNKIQNFE CCCAATCCCAGAACACCACCTCCACGCCAGGCAGGCTAAAATTTTTGCTG GTAGCTTT GTTTTTT SSINALKKLPKQYGLLI ACGCCGCAAATCGAATCGCCAAAAATTTTATACAAAGCCCAACGGAGAA TTGTTGCC CTATTTT LRKSTQLLLRHLLRTL AACATTATTTAATTTACTTATATTACCCCGCATATTCGGTATCGGGTTAAT AATCTATT ATTTGTA NSQDLWELWEKTDK AAACGGAAAAGTAACTAAAATAATGCAAAACTTCCCATCCCAAATACCC AAAAAAA CGACAA LIADFVINLTVTKRKK CCTATTCCAAAAATTGATTTTCCATCCGAAAAAACCGATTCCGACCCGGT GCGGAGCT AACCCTT RPITDFVTPLITLPIKD TTTAAACGCCAAAAAATTATTGGAAAAAGGGTATATTGGCCGTGCGGCA TTTTTTAAC AGCAAA GGFGLLRHNGIAQDI AAGGCTATTATCGATCCAACCCCCGTTGCCCCAGAAACCCCGGAATCGT TTTTTCTTT TAAGCTT YFAAKDLTTEIRHKIQ TAAATATTTTACGGGAAAAACACCCTATTGGCCAAAATAACCCGTTTAAT TTTTTTTTT AGAATA RISNDFPQNQSPTAT ACAAAATCCCAACCAATATCAGGCAGGCAAATTACCGAAAAAGCTATTT TTTTCTTTT TAATAA EILHLLHNGVLADCK TATTAGCTATTTCGTCTATTGGCCGGGAAAAAGCTCCGGGCCTTAGCGG TTTTTTTTT AGCGCG NGLTNAQLNALTEN GTGGACGAGATCGTTATTAGATGCAGCCATTAAAATACCTACCCAAAAC TTTTTCTTT AATTAA ASYLGRKWLNILPIQK GACGTAATTCCGGCTTTACGACTCTTAACGGATATGATTCGCCAGGGTA TTTTTTTTT AA(SEQ SNRLTDWEMAEAVR CCGCACCGGGTAGGGAATTATTATGCGCTTCGCGTTTAATAGGGCTATC TTTTTTTTT IDNO: LRLLAPVKPLTHPCN CAAACCCGACGGCGGCGTACGCCCAATAGCCGTTGGGGACCTATTATAT TATATATA 1284) HCGNRTNINHEDVC AAAATAGCCTTTAAAGCTATTTTAAATACCCTATGGTCCCCAAACTGTTT TTATTATTA KGAVRKYTARHDQIN ATTACCTTACCAATTAGGTGTAAATAGTATAGGTGGCGTCGAACCCGCT TTATTATTA RSFVNSLKSRPEIDVEI ATTTTTACCCTCGAAGAGGCTATAATGGGCCCTAATATTAACGGTATAAA GCGGTGG EPDLNNENNVNNAN ATCGATTACCTCCCTCGATTTAAAAAACGCGTTTAATAGCGTATCCAGGG GGCTATTT TTTENPTPSPNGQN CTGCAATAGCCTCGTCGGTAGCTAAATACGCACCAACTTTCTACCGTTCT ATGCGCTT DTGCLFTTPIRSGTRN ACCTGTTGGGCCTATAACCAACCTTCGATTTTAATAACGGAAAACGGTTC TAATTTGT GQNGLRADFAVING CGTCCTGGCTAGTGCACAAGGTATACGCCAAGGCGATCCGTTAGGCCCG GCGGGGC VSKYYYDVQIVAINKD TTGTTATTCAGCCTTGCTTTTCGACCTACGTTGGAAACGATCCAAAAATC TATTTATG SGNTNPLNTLADAA GCTTCCATATACGTATATAGCGGCTTATTTGGACGACGTTTATATTTTATC CGCTTTAA NNKRRKYQFLDPFFH CAAAACGCCCGTTAAAGATAAAATAGCCAAAATAATCGAAAAAAGCCC TTTGTGCG PIIISAGGLMEKDTAQ GTTTACCCTAAATTCCGCCAAAACGACAGAAACGGATATCGATACGTTA GGGCTATT AYKQIQKLIGPVAAH AAAACCAATGGTTTAAAAACGCTCGGCTCGTTTATTGGACCAACGGAAT AATGCGCT WLDTSISLILLRSRTTA TACGGAAGGAATTTTTGCAAAATAAAATTCAAAATTTCGAATCGTCCATT TTAACTTT AISIAKNRPRA(SEQ AACGCCCTGAAAAAACTCCCTAAACAATACGGATTGCTAATCTTGCGTA ACAAATTT IDNO:1406) AAAGTACACAATTACTTTTACGCCATTTGCTCCGTACTTTAAATTCCCAG TATTTATG GACCTGTGGGAATTATGGGAAAAAACAGATAAATTAATAGCGGATTTC CGCTTTAA GTTATAAATTTAACTGTTACAAAACGGAAAAAACGGCCAATTACGGATT TTGCTGCG TCGTTACGCCGTTAATTACGTTACCTATAAAGGACGGAGGTTTTGGATTA GGCCTGTT TTACGGCATAACGGAATAGCCCAAGATATTTATTTTGCGGCCAAGGATT AATGCGCT TAACAACCGAAATTCGGCACAAAATCCAACGTATATCCAACGATTTTCCA TTAATTTA CAAAATCAAAGCCCTACCGCCACCGAGATTTTGCATTTGTTGCATAACG CAAATTTC GGGTTTTAGCAGATTGCAAAAACGGGTTAACAAACGCCCAATTAAACGC ATTAATGC TTTAACCGAAAACGCTAGTTATTTAGGTCGAAAATGGCTTAACATTTTAC GCTTTAAC CTATCCAAAAATCAAATCGATTAACGGATTGGGAAATGGCTGAAGCCGT TTTTATATT TCGATTAAGATTATTAGCCCCGGTTAAACCGTTAACCCACCCCTGCAACC TACTAATG ATTGCGGAAATCGGACCAATATAAACCACGAGGACGTTTGCAAAGGTG CGTTATTT CCGTACGCAAATATACGGCCCGTCACGACCAAATAAACAGAAGTTTCGT ATATAATT CAATTCGTTAAAAAGTCGACCAGAAATCGACGTCGAAATCGAACCCGAT GCTATTAT TTAAATAACGAAAATAACGTAAATAACGCCAATACAACCACCGAAAATC TATCGTTG CCACCCCTAGCCCCAACGGCCAAAACGATACCGGATGCCTTTTTACAACC CTATTATT CCTATTCGCTCCGGGACCCGTAACGGCCAAAACGGCCTTAGGGCGGATT ATTATTGC TTGCCGTTATTAACGGCGTATCCAAATATTATTACGACGTGCAAATCGTT TATTATTAT GCAATTAATAAGGATTCCGGTAATACAAATCCGTTAAATACGTTAGCAG CGTTATTA ACGCAGCAAATAACAAACGACGTAAATACCAATTTTTGGATCCATTTTTC TTATTGCA CATCCAATTATAATAAGCGCCGGAGGCCTTATGGAAAAGGATACAGCAC ATTTTATTA AGGCGTACAAACAAATCCAAAAATTAATAGGCCCCGTTGCGGCCCATTG TATAAACC GTTGGATACGTCGATTTCGTTAATTTTGTTACGGTCCAGAACGACGGCA CTCGTTTG GCAATTTCTATTGCTAAAAACCGCCCTCGTGCGTAATAGGTAACGTCCCT TCCCTCGA ATTTTTGTCTTTGGTTTTGTTTTTATCTTTGTTTTTGTTTTTGTTTTCGTTTT TTTATCCC TGTTTTTGTTTTCGTTTTTGTTTTTTTTTTTGTTTTTGTTTTTGTTTTTGCCTT GTTTCTTTT TGTTTTTGTTTTTATCTTTATTTTTGTTTTTGTTTTTACTTTGTTTTATTTGTT CCATCCCA TTATATTTACCTTTTGATTTTTTCTATTTTTCCCACCCTTATTATTATAACCC TCGCGCGT CAACCTACTAATATTTTTTCTTTTTTTTTTTTCTTTTTACGGTTTTATTTTC TTTCGTAA CCGTTTGTTTTTTCTATTTTATTTGTACGACAAAACCCTTAGCAAATAAGC GCTTTGGT TTAGAATATAATAAAGCGCGAATTAAAA(SEQIDNO:1038) TTTCGTAG GATTTGCT TTCGTAGG CTTTGCTTT CGTAGGCT TTCGTCAG CTTTTACCT GCTTTTAT TTTTTCTTT TTCTTTTTA TTCCCCCC CCTTTTTTT TACCTGGT TTATTAGC GGTTTACC TGCTTTTA TTACCTGG TTCCCCTTT ACCTGTTT TATTAGCG GTTTACCT GCTTTTAT TACCTGGT TCCCCTTT ACCTACTT TATAAGCG GTTTACCT GCTTTTAT TACCTGGT TCCCCTTT ACCTGTTT TATTAGCG GTTTACCT GCTTTTAT TACCTGGT TCCCCTTT ACCTGTTT TATTAGCG GTTTACCA GCTTTTAT TACCTGGT TCCCCTTT ACCTACTT TATTAGCG GTTTACCC GTTTCTAT TAGTGGGC ATTTATTTC CCGTTTTT ATTAGCAG TTAAATTT ACCCTTTT AAGGTTAT TTACCTGC TTTTATTCA CAGGGCAC CCCTGTTT TTACTAGC AGTTAAAT TTACCTTTT TAAGGTTA TTTACCTG CTTTTATTC ACAGGGC ACCCCTGT TTTTACCA GCAGTTAA ATTTACCT TTTTAAGG TTATTTAC CTGCTTTT ATTAACAA CCCTTTATT TTTTCCTAT TAACGGGT ATTTATTTA CCTGTTTT ATTGGAAT TCACCCGT TGGACGGC (SEQID NO:1161) HERO HERO- Branchiostoma TTTTCAGTCTGGCTCAGCCAGTGACCGCCGGGAAAGTCCGGCTGACTAC TTTTCAGT TGATTA MNAVCVCGKVCKN 2_BF floridae CACGAATAGGGTGGTGACAGCTGGATAGACAGACGACAGCTCGGAAA CTGGCTCA AAGACC QRGLRIHQTKMACLR GACGGCATTGGGGCAGTATGGGTTGGCACCCCTAACTGCATCTCCCCTA GCCAGTGA CGAAAC RVQAEHRSGAVATT GGAGAGCATCCCGCAACACGCTACAAAGAACCACAAAGAGCAATACCC CCGCCGGG ACCCAA VEPVLSASAPGQTEE CCAGGGATGCCCGAGAGGGGGGGAGGATGAGCATCCCATTCGGACGG AAAGTCCG TGACCC DQGPEAPHSARNLR TCCAATCGGTATTGACCCCAGCAAACGGAGAATCGACAATGAATGCAGT GCTGACTA CGGGTT ATPAPPQGRKSDHH CTGTGTGTGTGGCAAGGTATGTAAGAACCAGAGAGGTTTGAGAATCCA CCACGAAT CATCACT RVKWPAANSKEWS CCAAACAAAGATGGCCTGCTTAAGGAGGGTGCAGGCGGAGCACCGCTC AGGGTGG GATGAT QFDEDVDMILESVSR AGGGGCTGTGGCAACCACTGTAGAACCAGTGTTGTCAGCATCAGCCCCT TGACAGCT GTGTCC GSTDQKLQSMCTVI GGTCAGACGGAGGAGGATCAGGGCCCGGAAGCTCCCCACAGTGCCCG GGATAGAC CTGTTC MSMGAERFGTIGQR GAACCTCCGCGCAACGCCTGCCCCTCCACAAGGCAGGAAGTCAGATCAT AGACGACA GCACTA KPTDTMKPNRREVKI CACCGAGTGAAGTGGCCAGCCGCAAACTCCAAGGAGTGGTCGCAGTTT GCTCGGAA CCAGAG RQLRQELKSLRRSFKA GACGAGGACGTTGACATGATCTTGGAGTCGGTGTCAAGAGGTAGTACA AGACGGC TGTATTC STSGEERAALAELTH GACCAAAAGCTTCAGTCCATGTGCACAGTGATTATGTCCATGGGGGCAG ATTGGGGC TAGAG HLREKLRTLRRAEWH AACGATTTGGCACGATTGGGCAGAGGAAACCGACAGACACAATGAAGC AGTATGGG (SEQID KKKGKERARKRSAFIT CAAATCGCCGGGAAGTAAAGATCCGTCAACTGAGGCAGGAGCTAAAGT TTGGCACC NO: NPFGFTKRLLGQKRS CGTTGAGGCGGAGCTTTAAGGCGAGTACGTCGGGAGAGGAGAGAGCT CCTAACTG 1285) GNLTCPVEEINLHLSN GCTCTTGCAGAGCTCACACACCACCTTAGGGAGAAGCTTAGGACCCTCA CATCTCCC TFSDASRDVDLGPCP GAAGGGCAGAGTGGCACAAGAAGAAGGGTAAAGAAAGAGCCCGGAA CTAGGAGA LLVTSPEPEVHFDISE GCGCAGTGCTTTCATCACCAACCCTTTCGGCTTCACCAAGCGACTCCTAG GCATCCCG PTLKEVRETVKAARSS GGCAGAAGAGGAGTGGGAACCTGACCTGCCCAGTCGAGGAGATCAACC CAACACGC SAPGPSGVVYKVYKH TCCACCTCAGCAATACCTTCAGTGATGCCTCGAGAGATGTGGATCTTGG TACAAAGA CPRLVVRLWRILKVV TCCTTGCCCTTTGCTGGTGACTTCACCTGAGCCGGAAGTGCACTTTGACA ACCACAAA WRRGKVAADWRQA TCTCTGAACCAACTCTGAAGGAGGTCAGAGAGACAGTCAAGGCGGCGA GAGCAATA EGVWIPKEEESSKVD GGTCCAGTTCGGCGCCAGGTCCCAGTGGCGTGGTATACAAGGTCTACA CCCCCAGG QFRLISLLSVEGKIFFKI AACATTGCCCACGGCTTGTGGTGCGCCTCTGGAGGATCCTAAAGGTGGT GATGCCCG VAQRLIKYLLDNQYID CTGGCGCAGAGGTAAAGTGGCGGCTGATTGGAGGCAAGCCGAGGGGG AGAGGGG TSVQKGGVPGVPGC TTTGGATCCCAAAGGAAGAGGAGTCAAGTAAGGTAGACCAGTTCCGCT GGGAGGA LEHTGVVTQLIREAKE TAATTTCTCTGCTCAGTGTTGAGGGAAAGATCTTCTTCAAGATTGTGGCC TGAGCATC NRGDLAVLWLDLAN CAGCGTCTAATAAAGTACCTTCTGGACAACCAGTATATTGACACATCTGT CCATTCGG AYGSIPHKLVETALTR GCAGAAGGGGGGAGTTCCTGGTGTCCCAGGATGTCTTGAACACACGGG ACGGTCCA HHVPESIQNLILDYYS CGTAGTGACCCAGCTCATCCGGGAGGCTAAGGAGAACAGAGGGGACTT ATCGGTAT NFWLRAGSSTATSA GGCAGTCTTGTGGCTGGATCTCGCGAATGCGTATGGTTCGATCCCCCAC TGACCCCA WQRLEKGIITGCTISV AAGCTTGTGGAAACAGCACTGACCAGACACCATGTTCCAGAGTCAATTC GCAAACG PLFALAMNMIVKGA AGAACCTCATCTTAGATTACTACAGCAACTTCTGGCTAAGAGCTGGCTCC GAGAATCG EAGCRGPVSRSGTRQ AGTACAGCAACTTCAGCATGGCAACGGTTAGAGAAGGGCATCATTACTG ACA(SEQ PPIRAFMDDLTVMT GATGTACGATTTCAGTGCCCCTCTTTGCACTAGCGATGAACATGATTGTT IDNO: ATVPVCRWLLQGLER AAAGGAGCGGAAGCAGGATGTAGGGGTCCCGTGTCTAGGTCTGGAACC 1162) LITWARMSFKPAKSR AGGCAGCCGCCGATTCGAGCCTTCATGGACGATCTGACGGTGATGACT SLVLKKGKVAERFRFT GCAACAGTCCCGGTGTGTAGATGGCTCCTACAGGGATTAGAGCGTCTCA LGGTQIPTVSEKPVKS TTACATGGGCACGGATGAGTTTCAAGCCGGCCAAGTCAAGATCTCTTGT LGKVFNSSLKDTASV CCTGAAGAAGGGGAAGGTGGCTGAAAGGTTCCGTTTCACCCTGGGAGG QQTRSDLTTWLEGID CACTCAGATTCCCACAGTGTCAGAGAAACCAGTCAAGAGTCTGGGCAAG KTGLPGSFKAWMFQ GTGTTCAACAGCTCTCTGAAGGACACCGCTTCAGTTCAGCAGACTAGGA HGVLPRVLWPLLVYE GTGACCTGACAACGTGGCTCGAGGGAATTGACAAGACAGGGCTACCTG VPMTMVEQLERTISR GTAGCTTCAAGGCCTGGATGTTCCAGCATGGAGTCTTGCCAAGGGTACT FLRKWLGLPRSLSNIA CTGGCCTCTTCTTGTGTACGAGGTGCCGATGACCATGGTGGAGCAACTG LYGRSTKLQLPLSGLT GAGAGAACCATCAGCAGGTTCCTTCGCAAATGGTTGGGGCTCCCGAGG EEFKVTRAREVLMYR TCCTTAAGCAACATTGCCCTGTACGGTAGATCCACCAAGCTGCAGCTTCC DSSDSKVSSAGIHVR CTTGAGTGGCCTGACTGAAGAGTTCAAGGTTACCCGTGCAAGAGAAGT TGRKWKAQEAVDQ GTTGATGTACCGGGACTCCTCAGACTCCAAGGTCTCTTCAGCCGGCATC AEARLRHSVLVGSVA CATGTCAGGACTGGAAGAAAATGGAAGGCACAGGAAGCAGTGGATCA VGRAGLGSCPKPRYD GGCAGAGGCAAGGTTGAGACACAGTGTCCTCGTGGGGTCCGTGGCAGT KVSGKEKRLLIQDEIR AGGACGGGCAGGACTGGGCAGCTGCCCAAAGCCTCGGTACGACAAAGT AGEEEDRRCRMVG CAGCGGGAAGGAGAAGCGTCTACTGATCCAGGATGAGATAAGGGCTG MRKQGAWTRWEH GGGAAGAGGAGGATCGGCGATGCAGGATGGTAGGCATGCGCAAGCAA ADSRKVTWPELCRAE GGTGCGTGGACTAGGTGGGAACATGCTGACTCCCGCAAGGTCACATGG PSRIKFLISSVYDVLPS CCAGAGTTGTGCAGAGCTGAGCCTTCTCGGATCAAATTTCTCATCTCTTC PANLHVWGLAETPS AGTGTACGACGTGCTTCCAAGTCCAGCTAACTTGCATGTCTGGGGCTTG CQLCQRRGTLEHILSC GCAGAGACCCCCTCATGCCAACTCTGTCAGAGGAGAGGTACCCTTGAAC CPKALGEGRYRWRH ACATTCTCAGTTGTTGTCCGAAAGCACTAGGGGAAGGGAGGTACCGCT DQVLRVLADTVSNAI GGCGGCATGACCAGGTTCTTAGGGTGTTGGCAGACACAGTTAGCAACG QSSRSQQPPKKSIVF CCATCCAGAGTAGCAGGAGTCAGCAACCCCCCAAGAAGTCAATTGTCTT VRAGEKTRQQPTSA TGTCAGGGCCGGAGAGAAAACCCGACAACAACCCACTTCCGCAGGTGG GGLLSTARDWQLLV GCTTCTCTCCACTGCTAGAGATTGGCAGCTTCTAGTCGACCTTGGGAGA DLGRQLKFPEHIVATS CAGCTCAAGTTTCCAGAACACATTGTAGCCACGTCACTTCGCCCTGACAT LRPDMVLVSESTRQV GGTACTCGTGTCAGAATCCACCAGACAAGTGGTTCTGCTGGAGCTAACT VLLELTVPWEERISEA GTTCCCTGGGAGGAGCGGATAAGCGAAGCCAACGAGCGGAAGAGGGC NERKRAKYAELVVQS GAAGTATGCCGAACTGGTAGTACAAAGCCAGAGTAATGGGTGGAGAGC QSNGWRARCVPVEV CCGGTGTGTACCAGTGGAGGTTGGTTGCCGGGGTTTCGCAGGGCAGTC GCRGFAGQSLAYVLK TTTGGCTTATGTGTTAAAACTCCTTGGAGTAAGAGGTTTCCGTCTTCGGA LLGVRGFRLRKSIRDIL AATCCATCAGGGATATTCTAGAGGCTGCGGAGAAAGCCTCACGTTGGTT EAAEKASRWLWFRR GTGGTTCCGTAGGGGGGAACCGTGGAAGCCACACGGACACAGGTCGG GEPWKPHGHRSGN GGAATGATCAACCTCGGCTGGGTCGCCCGGGCGAGGGTGTATGGTGAT DQPRLGRPGEGVW TAAAGACCCGAAACACCCAATGACCCCGGGTTCATCACTGATGATGTGT (SEQIDNO:1407) CCCTGTTCGCACTACCAGAGTGTATTCTAGAG(SEQIDNO:1039) HERO HERO- Danio TTCAAGCCTGGCGCAGCCAGTGACTCCTAGGAATAGACTAGGTGGCAA TTCAAGCC TGATCA MTHANEQTTNKIYVT 2_DR rerio CCAAGAATAGTTTGGTCGACTACTGGAGAGACAGTTGACGGCACGGAA TGGCGCAG ACCCCG CICGKLCKNHWGLKI AGACGGCACTTGGGACAGTATGGGTTAGCACCCCAGCCTGTGTCTTTCG CCAGTGAC GCTGGG HQARMKCLEQESKV TGAGAGAGAACCCAAACAAGCTACGGAAAGCCCCACAGAGATATACCC TCCTAGGA TCACCTG QRTGPEPGETQEEPG CCAGGAGATCCCGAGAGGGGGGGAGGATGAGATCTCCAATCGGACGG ATAGACTA GGTGAG PEATHRAKSLHVPEP ATCAAAGGTTAATGACCCATGCAAACGAACAGACGACGAACAAAATAT GGTGGCA AGTGTA QTPSEVVQQRIKWP ATGTGACATGCATTTGCGGAAAGCTGTGTAAGAACCATTGGGGCCTAAA ACCAAGAA TGATGTT PASKGSEWLQFDED AATCCATCAGGCCAGAATGAAATGTTTGGAGCAGGAGAGTAAGGTGCA TAGTTTGG GAGAGA VSNIIQAIAKGDADSR ACGCACAGGTCCTGAACCTGGTGAGACGCAGGAGGAGCCCGGCCCGG TCGACTAC CCCGAA LKTMTTIIFSYALERFG AGGCAACCCACAGAGCCAAGTCCCTCCATGTACCAGAGCCTCAAACTCC TGGAGAG ACACTC CIEKGKTKPTTPYTM AAGCGAAGTAGTTCAACAGCGGATTAAATGGCCCCCAGCCAGCAAAGG ACAGTTGA AATGAT NRRATQIHHLRQELR AAGTGAGTGGCTGCAGTTCGATGAAGATGTGTCCAACATCATTCAAGCC CGGCACG CCCAGG SLKKLYKKATDEEKQP ATAGCCAAAGGAGATGCAGATAGCCGACTCAAAACGATGACTACCATC GAAAGAC ATACATC LAELKNILRKKLMILR ATCTTCAGCTATGCTCTAGAAAGATTCGGTTGCATAGAGAAAGGAAAGA GGCACTTG ACTGAT RAEWHRRRGRERAR CCAAGCCCACCACCCCCTACACTATGAACCGTAGGGCTACCCAGATACA GGACAGTA GATGTG KRAAFITNPFGFTKQL TCACCTGCGTCAGGAGCTTCGCTCCCTCAAGAAACTGTATAAGAAAGCT TGGGTTAG TCCCAA LGDKRSGRLECSIEEV ACGGATGAGGAGAAGCAACCATTAGCGGAGTTGAAAAACATTTTGCGG CACCCCAG ATGCAT NRFIEETVSDPLREQE AAGAAGCTGATGATCCTACGCAGGGCAGAGTGGCATCGGAGACGAGG CCTGTGTC CCATGA LEPNKALISPTPPARE GCGAGAGAGAGCCAGGAAGCGAGCTGCCTTCATCACCAATCCCTTTGG TTTCGTGA GATGTTT FSLRGPSLKEVKEIIKA CTTCACAAAACAGCTGCTCGGGGACAAGCGGAGCGGTCGACTTGAATG GAGAGAA CTTGCAT SRSASTPGPSGIPYLV CTCAATAGAGGAAGTGAATCGCTTCATTGAGGAAACAGTGAGTGATCCA CCCAAACA AA(SEQ YKRCPGLLLHLWKILK CTGAGAGAGCAGGAGCTGGAGCCCAACAAAGCTCTTATCAGCCCCACCC AGCTACGG IDNO: VIWQRGRVAEQWR CTCCAGCAAGAGAGTTCAGTTTGAGGGGGCCAAGTCTGAAGGAGGTCA AAAGCCCC 1286) CAEGVWIPKEENSKN AGGAAATCATTAAGGCATCTCGCTCAGCATCTACTCCAGGCCCTAGTGG ACAGAGAT INQFRIISLLSVEGKVF CATACCTTACCTTGTCTATAAGCGCTGCCCAGGGCTTCTCCTGCATCTGT ATACCCCC FSIVSRRLTEFLLENNY GGAAGATCTTGAAGGTGATTTGGCAACGAGGAAGAGTTGCTGAGCAGT AGGAGATC IDPSVQKGGIPGAPG GGAGGTGTGCCGAGGGAGTGTGGATTCCTAAAGAGGAAAACTCGAAA CCGAGAG CLEHTGVVTQLIREA AACATCAACCAGTTTCGAATCATCTCTCTATTGAGTGTTGAAGGGAAGG GGGGGGA HENRGDLVVLWLDL TGTTTTTCAGCATCGTCTCACGAAGACTGACAGAGTTCCTCCTCGAGAAC GGATGAG ANAYGSIPHKLVELAL AATTATATTGACCCTTCAGTGCAGAAGGGAGGGATTCCTGGAGCTCCCG ATCTCCAA HRHHVPSKIKDLILDY GCTGCTTGGAACACACTGGAGTAGTTACACAACTCATCAGAGAGGCCCA TCGGACGG YNNFKMRVTSGSETS TGAGAACAGAGGGGACTTGGTTGTCTTGTGGTTGGACTTGGCAAATGC ATCAAAGG SWHRIGKGIITGCTIS CTATGGGTCCATACCCCACAAGCTGGTTGAGCTCGCTCTACACCGCCACC TTA(SEQ VILFALAMNMVVKS ACGTTCCTAGTAAGATTAAGGACCTAATTCTGGATTACTACAATAATTTC IDNO: AEVECRGPLTKSGVR AAGATGCGGGTCACATCTGGGTCAGAAACATCAAGCTGGCATCGCATC 1163) QPPIRAYMDDLTITTT GGGAAAGGAATAATAACAGGCTGCACCATCTCAGTTATTCTTTTCGCTCT TVPGSRWILQGLERLI CGCCATGAACATGGTGGTCAAGTCAGCCGAAGTGGAATGCAGAGGGCC AWARMSFKPSKSRS CTTAACTAAGTCAGGTGTGCGACAGCCCCCTATTAGAGCATATATGGAT MVLKKGKVVDKFHF GACCTTACCATCACAACAACAACGGTCCCAGGGAGCAGGTGGATCTTAC SISGSVIPTITEQPVKS AAGGACTTGAGAGACTCATCGCCTGGGCTAGAATGAGTTTTAAGCCCTC LGKLFDSSLKDSAAIQ CAAGTCTAGGTCCATGGTGCTGAAGAAGGGGAAAGTGGTTGACAAGTT KSKKELGAWLAKVDK CCATTTTTCCATCTCAGGAAGTGTCATCCCAACCATCACGGAGCAACCTG SGLPGRFKAWIYQHS TCAAGAGTTTGGGGAAGCTCTTTGACTCCAGCCTAAAAGACTCTGCAGC ILPRVLWPLLIYAVPM CATCCAGAAGTCCAAAAAAGAACTTGGAGCTTGGCTGGCGAAGGTTGA STVESLERKISGFLRK CAAATCCGGCCTGCCTGGTAGATTCAAAGCCTGGATCTATCAGCATTCA WLGLPRSLTSAALYG ATTCTGCCCCGAGTTTTGTGGCCTCTGCTGATCTATGCAGTCCCAATGTC TSNTLQLPFSGLTEEF AACAGTTGAGTCCCTAGAAAGGAAGATCAGTGGCTTTCTTCGAAAATGG MVVRTREALQYRDS TTGGGCCTCCCACGCAGTCTTACCAGTGCTGCACTATACGGGACAAGTA RDGKVSSACIEVRTG ACACCTTGCAGCTACCATTCAGTGGCCTCACAGAGGAATTCATGGTTGT RKWNAGKAVEVAES ACGCACCAGAGAAGCCCTACAGTACAGGGACTCTAGAGATGGCAAGGT RLQQKALVGTVATG GTCATCAGCCTGCATCGAGGTGAGGACAGGCAGGAAATGGAATGCAG RAGLGYFPKTLVSQV GGAAAGCAGTGGAGGTGGCAGAGTCACGCCTGCAACAAAAGGCTCTG KGKERHHLLQGEVRA GTGGGCACTGTAGCGACAGGCAGAGCGGGCTTGGGCTATTTTCCAAAG SVEEERVSRVVGLRQ ACCTTAGTAAGCCAGGTCAAAGGCAAGGAAAGACACCACCTACTCCAG QGAWTRWNTLQRRI GGAGAGGTTCGAGCAAGTGTGGAGGAAGAGAGAGTCAGTAGGGTGGT TWANILQADFQRVR AGGACTCCGGCAGCAGGGAGCATGGACTAGGTGGAATACACTGCAACG FLVQAVYDVLPSPSN TAGGATCACCTGGGCGAACATCTTGCAGGCGGATTTCCAACGTGTCCGT LHVWGKNETPSCLLC TTCCTAGTACAAGCTGTCTACGATGTACTGCCAAGCCCATCAAACCTCCA SGRGSLEHLLSSCPKA CGTTTGGGGAAAGAATGAGACACCTTCCTGCCTTCTTTGCTCTGGAAGA LADGRYRWRHDQVL GGCTCTCTAGAACATCTCCTCAGCAGTTGCCCCAAGGCTCTGGCTGATG KAIAASLASAINTSKN GTCGCTATCGTTGGCGCCATGACCAGGTGCTTAAGGCAATTGCTGCGAG HRAPRKAVHFIKAGE CTTAGCTTCAGCCATTAACACGAGCAAGAACCATCGTGCTCCAAGGAAG KPRALPQLTTGLLHK GCAGTCCACTTCATCAAAGCTGGAGAAAAACCCCGGGCCCTCCCACAAT ASDWQLEVDLGKQL TAACAACAGGCCTCCTTCACAAAGCCTCGGACTGGCAGCTGGAGGTCGA RFPHHIAATRLRPDII CCTGGGAAAACAGCTGAGGTTTCCTCATCACATCGCTGCAACACGTCTC AISEASRQLIILELTVP CGTCCAGACATTATAGCTATCTCAGAAGCTTCAAGACAGCTAATTATTCT WEERIEEANERKRAK GGAGCTTACAGTGCCGTGGGAAGAGCGTATTGAAGAAGCAAATGAGA YQELVEECRERGWRT GGAAGCGCGCTAAGTACCAGGAATTAGTGGAGGAGTGCAGGGAGAGA YYEPIEIGCRGFAGRS GGCTGGAGAACTTACTATGAGCCCATAGAAATTGGATGCAGAGGCTTT LCKVLSRLGITGVAKK GCAGGGCGTTCACTTTGCAAAGTCCTCAGTCGTTTGGGCATTACAGGCG RAIRSASEAAEKATR TGGCGAAGAAAAGGGCCATTCGATCCGCAAGCGAAGCCGCAGAGAAG WLWIKRADPWTAV GCCACAAGGTGGCTGTGGATTAAGAGGGCAGATCCGTGGACTGCTGTT GTQVGT(SEQID GGGACACAAGTCGGGACTTGATCAACCCCGGCTGGGTCACCTGGGTGA NO:1408) GAGTGTATGATGTTGAGAGACCCGAAACACTCAATGATCCCAGGATACA TCACTGATGATGTGTCCCAAATGCATCCATGAGATGTTTCTTGCATAA (SEQIDNO:1040) HERO HERO- Branchiostoma CTGACCAGCAGACGGGAAGCCCGCGACCAACTAGTCTCCGCAAATATTG CTGACCAG TAGAAA MALPAVRSGPASTW 3_BF floridae CACACAGGGCGACCCTATGGAGCTGATTCAGTCAAATTTCCTCTGAGAT CAGACGG CCCACA TLLITLVIVAAKGTDG ATACCGATAACTATCTACAGAAACTGCACAGTTAGTTTGGAAAGAGCTT GAAGCCCG AGGCTG FMSFKLPLLSTDTWS TTCTACTGAAAGACAGCAAAATCCGCCACTTTAGACGAGCGTCAAGACT CGACCAAC AGAAAT GYNNDVKTLLGPLHH GCCCTCCCCATAACCAATATGGCGCTACCTGCTGTACGTTCTGGACCAGC TAGTCTCC GTAGAG ELATNEMSPKLAGEG CAGCACCTGGACACTGTTAATCACGCTGGTCATCGTCGCTGCTAAAGGT GCAAATAT CATCTGT FSDIMCDFMASKPEF ACAGATGGTTTTATGTCTTTTAAACTGCCACTGCTGTCTACTGATACCTG TGCACACA ATGGAC SHTTEESHSEGYISHE GTCTGGGTATAACAATGATGTGAAAACCCTGCTAGGCCCGCTCCACCAC GGGCGAC AATATT PQSLAQVKRLKNKLR GAACTGGCCACAAATGAAATGTCCCCCAAACTAGCTGGGGAGGGATTC CCTATGGA GATGAT KKAFRADATPEDRKA AGTGACATCATGTGCGACTTTATGGCCAGTAAACCAGAGTTCAGCCACA GCTGATTC TGAAAT FRDAIKTYSFMKRQQ CTACCGAAGAAAGTCACTCAGAAGGCTATATAAGCCACGAACCACAGTC AGTCAAAT GTTGTG KRKETTKSAAHQEKE TCTCGCACAAGTAAAACGCCTGAAAAACAAGCTACGTAAGAAGGCATTC TTCCTCTG ATTTTAG YHKNFWKFAGKCAK AGAGCTGACGCAACACCTGAGGATCGAAAGGCTTTCAGAGATGCAATT AGATATAC ATCAAA GQLDIPPVKPAFSVYY AAAACATACTCCTTCATGAAGCGACAACAGAAACGAAAGGAAACTACA CGATAACT TTTAGA ANEYYKNKYSHPTRV AAATCGGCAGCACACCAAGAGAAAGAATATCATAAGAACTTTTGGAAG ATCTACAG AATATG DFNKLLWFPHLPVEE TTTGCCGGAAAATGTGCAAAAGGACAGCTCGATATCCCCCCAGTAAAAC AAACTGCA AAAACC QLPANSFDMSPVRP CGGCATTCTCTGTTTATTATGCAAATGAGTACTACAAAAACAAATACTCA CAGTTAGT GAACTA KDIKAVLSKRCATSAP CACCCAACCCGTGTTGACTTCAACAAACTGCTCTGGTTTCCTCATTTGCC TTGGAAAG AACTAA GPDGIMYGHLKHLP GGTGGAGGAACAACTACCTGCGAACTCTTTTGACATGTCACCTGTCAGG AGCTTTTC ATATAAT ACHLFLSTLFSKLLESG CCGAAAGACATTAAGGCAGTCTTATCCAAACGATGCGCTACATCTGCAC TACTGAAA GTTTTTT DPPTSWSSGNVSLIH CTGGCCCGGACGGGATCATGTATGGCCACCTCAAGCACCTGCCAGCTTG GACAGCAA TTAAAG KDGSPEAAENFRMIC TCACCTGTTCCTTAGTACACTGTTCTCCAAACTGCTTGAGTCCGGAGACC AATCCGCC TAATGA LTSCVSKIFHQILSER CACCGACATCATGGTCATCTGGCAACGTGTCACTTATACACAAGGATGG ACTTTAGA TAAGCA WAKYMTCNDLIDPE TAGTCCAGAAGCTGCCGAAAACTTTCGAATGATCTGCCTTACTTCCTGCG CGAGCGTC ATACCC TQKAFLTGINGCVEH TCTCCAAGATTTTCCACCAAATACTCTCGGAACGATGGGCAAAGTACAT AAGACTGC ACATTGT VQVMREILAHAKKN GACTTGCAATGATCTGATAGACCCAGAAACACAAAAGGCATTCCTGACC CCTCCCCA GCAATA RRTVHITWFDLADAF GGAATCAACGGCTGTGTGGAGCATGTCCAAGTTATGCGGGAGATCTTA TAACCAAT CTATCTA GSVEHELIYYQMERN GCACATGCCAAGAAAAACCGCCGAACAGTCCACATTACATGGTTTGACC (SEQID TGTTATG GFPPIITTYIKNLYSRL TCGCGGATGCCTTTGGTTCTGTAGAACACGAACTGATCTACTACCAGAT NO:1164) TCCTTTG KGKVKGPGWESDPF GGAGAGAAACGGCTTCCCGCCAATTATCACCACGTACATTAAAAACCTG TCCCCCC PFGRGVFQGDNLSPII TATTCTCGCCTGAAAGGGAAAGTGAAGGGTCCAGGCTGGGAAAGTGAT TGCATG FLTVFQPILQHLKGVE CCGTTCCCGTTCGGAAGAGGAGTGTTCCAAGGAGACAACTTGTCACCCA TTTGGTC QQHGYNLNDKHYVT TCATCTTCCTAACGGTGTTCCAGCCTATTCTACAGCATCTCAAGGGAGTA AATAAT LPFADDFCLITTNKRQ GAGCAGCAACATGGCTACAACCTCAATGACAAGCATTATGTTACACTGC GACCAT HQKLITQISSNTKSM CTTTCGCAGACGACTTTTGTCTCATAACCACAAACAAACGACAGCATCAG CGTGTC NLKLKPRKCKSMSIVS AAACTAATTACTCAAATTTCTTCCAACACAAAGTCAATGAACCTAAAGCT CTGGGC GKPSDISFTIDGDPVK AAAACCACGCAAGTGTAAGTCTATGTCTATAGTGAGCGGAAAGCCATCG TCCGTG TTKDAPEKFLGGYITF GACATCAGCTTCACAATAGATGGGGACCCTGTCAAAACGACCAAAGATG TACCTTT LSKTKETYDILAKTIET CACCGGAGAAATTCCTAGGTGGCTACATCACCTTCCTGAGTAAAACAAA CTTTACT TVENINKSAIRNEYKL AGAGACCTATGACATCCTAGCAAAGACAATAGAAACGACTGTTGAAAAC ATGAAT RVYMEYAFPSWRYM ATAAACAAATCAGCGATAAGGAACGAATACAAACTCAGGGTTTACATG AAAGAA LMVHDLTDTQLQKL GAGTACGCCTTCCCATCTTGGAGGTACATGCTGATGGTACACGACCTGA TGATTTT DSIHTKAIKTWLRMQ CAGACACCCAGCTACAAAAACTCGATTCCATCCACACAAAGGCGATCAA ACTAC PSATNAILYNTRGLNF AACATGGCTCAGAATGCAACCTAGTGCAACAAATGCAATTCTGTACAAC (SEQID KSISDLYLEAHALAYS ACAAGGGGTCTCAACTTCAAAAGCATCTCAGACTTGTACCTAGAAGCCC NO: RSVLKADEKVKHALQ ACGCTCTGGCCTACAGTAGGTCAGTCCTCAAAGCAGATGAGAAGGTAA 1287) AKLDRESQWTRKMQ AACACGCTTTACAAGCCAAACTGGACCGCGAATCGCAATGGACTAGGA KWGIGKCHTIHQQAI AAATGCAGAAATGGGGTATTGGAAAGTGTCACACCATCCACCAGCAAG HVAKDSEWTSVRKH CCATCCATGTAGCAAAGGACTCAGAATGGACATCAGTACGCAAACATGT VKQQVTDMRHDVW CAAACAACAAGTCACAGATATGCGTCATGACGTCTGGACTAAACATCAG TKHQENLLQQGQML GAAAACCTTCTACAGCAAGGGCAGATGCTACAACTGCTTGAGGAAGAA QLLEEEKCDLTWRSA AAATGCGACCTGACATGGCGGTCCGCTATGTACAACCTGCCGAGGGGC MYNLPRGILSFAVRA ATCCTCAGTTTCGCTGTGCGTGCCTCCATCGACGCCCTCCCCACACTCTG SIDALPTLCNLTTWG TAACCTGACCACCTGGGGAAAACGTAACACTGACAAATGTAAACTGTGT KRNTDKCKLCGNRET GGCAACCGGGAAACACTCCACCACGTTCTGAACCACTGCGGTGTCGCTC LHHVLNHCGVALQQ TCCAACAAGGACGGTACACATTCCGACACAACTCGGTATTGAAGCACAT GRYTFRHNSVLKHIT AACGGACACCATCATAGAGTCCATTGACACCTCTCGGATCAACGCCACC DTIIESIDTSRINATIYA ATCTATGCGGACATACAAGGTTACACAACTAACGGAGGTACCATCCCGG DIQGYTTNGGTIPVH TCCATACAATACCCACTACCCAGAAACCAGACCTGATCATATATTTACCA TIPTTQKPDLIIYLPEQ GAACAGAAGACCCTCCACATCCATGAACTGACTGTACCCTTTGAAAAGA KTLHIHELTVPFEKNIK ACATCAAAACAAGTCATGACCGAAAGGTCAACAAATACAGCACCCTAGC TSHDRKVNKYSTLAA GGCAGATTTAGAAACTGCTGGCATTTCCGCTACACTAACCTGCTTTGAA DLETAGISATLTCFEV GTCGGATCAAGGGGACTCGTCACGCCAGAGAACAAGACCAGGCTTAGA GSRGLVTPENKTRLR ACACTGTTCAAAATAGTTAAAGCCAAACCACCGAAGACTCTGTTTACTGA TLFKIVKAKPPKTLFT TATAAGCCGCATTGCGATGTTATCGTCATATGCTATTTGGAACTCACGCC DISRIAMLSSYAIWNS ACGAACCGTATTGGGAGTCAGAAACGCTATTGTAGAAACCCACAAGGCT RHEPYWESETLL GAGAAATGTAGAGCATCTGTATGGACAATATTGATGATTGAAATGTTGT (SEQIDNO:1409) GATTTTAGATCAAATTTAGAAATATGAAAACCGAACTAAACTAAATATA ATGTTTTTTTTAAAGTAATGATAAGCAATACCCACATTGTGCAATACTAT CTATGTTATGTCCTTTGTCCCCCCTGCATGTTTGGTCAATAATGACCATCG TGTCCTGGGCTCCGTGTACCTTTCTTTACTATGAATAAAGAATGATTTTA CTAC(SEQIDNO:1041) HERO HERODr Danio AAAGCAGTAGAGATGACGACACATCGCGCAGAAGTTACAACTTCTGGT AAAGCAGT TAGCAT MTTHRAEVTTSGKT rerio AAGACGCAGGAGGAGCCAGGCCCGGAGGCAACCCACAGTGCCCAGAG AGAG(SEQ GCCACTT QEEPGPEATHSAQSL CCTCCTAGTGTCGCCAACACCTGCTGCCGGCCGCTCGCCTGCTACTCAAA IDNO: GGACAC LVSPTPAAGRSPATQ GCTGCCCTCAAGTGACAGCAGCTCATAACAGTCCACAAAGCCCCCAAAG 1165) AGGCCG SCPQVTAAHNSPQSP TCAGCAAGTGGCAGTTACAAGATCTGACTGTGTTCCCTTGGCACAGCCA GGGTCT QSQQVAVTRSDCVP AGAATCCAGTGGCCCCAATCCTCAAAGAAAGCTGAGTGGCTCCAGTTCG GATCAG LAQPRIQWPQSSKKA ACAAGGACGTGAATCAGATCCTGGAAGTGACAGGCAAGGGGGGTGTG CCTCGG EWLQFDKDVNQILEV GACCAGCGACTGTCAACAATGACCACGCTCATAGTGAACATTGCAGCTG TCGGGT TGKGGVDQRLSTMT AGCGATTCGGAACTGTGACACCCAAACCCACTCCATCGACATATACTCCA CGCCTG TLIVNIAAERFGTVTP AGCCACAGAGTAAAGGAAATCAAACGTCTCAGGAAAGAACTTAAGCTA GAGGAG KPTPSTYTPSHRVKEI CTAAAGAGGCAGTACAAGGCAGCAGGGGAAGTAGAAAGAGCGGGCCT GGTGTC KRLRKELKLLKRQYKA AGAAGATCTGAGAGGAATCCTGAGGAAACAGCTCGTGAACCTATGTAG TGTTGC AGEVERAGLEDLRGI GGCAGAGTATCACAGGAAGAGGCGGAGAGAGAGAGCAAGGAAAAGG AAGACC LRKQLVNLCRAEYHR GCAGCATTTTTGGCCAACCCTTTCAAGTTGACCAAGCAGCTCCTTGGCCA CGAAAC KRRRERARKRAAFLA AAAGAGGACTGGCAAACTCACCTGCTCCAAGGAGGCTATCAACAATCAC ACCCTGT NPFKLTKQLLGQKRT CTCAAGGCCACTTATTCTGACCCGAATAGAGAACAACCCCTGGGGCCTT GAGCCC GKLTCSKEAINNHLK GCGGTGCACTGCTGACACCACCTGAGCCCACATCAGAGTTCAACATGAA AGGAAA ATYSDPNREQPLGPC GGAACCCTGCCGGAGTGAAGTAGAGGAAGTGGTGAGGAGAGCAAGGT CAACAC GALLTPPEPTSEFNM CAAGCTCAGCACCAGGCCCAAGCGGAGTGCCTTACAAGGTATATAAGA TGATGA KEPCRSEVEEVVRRA ACTGCCCAAAGCTTCTACACAGGCTCTGGAAGGCCCTGAAAGTCATATG TGTGTC RSSSAPGPSGVPYKV GAGAAGAGGGAAGATTGCCCAGCCATGGAGGTATGCGGAGGGAGTGT CAAGGT YKNCPKLLHRLWKAL ACATCCCAAAAGAGGAGAAGTCGGAGAACATCGACCAGTTTCGAGTCA TGTGCA KVIWRRGKIAQPWR TCTCCTTGCTCAGTGTGGAGAGCAAAATATTCTTCAGCATTGTGGCCAAA TCAGGA YAEGVYIPKEEKSENI AGACTCTCCAACTTCCTATTGAGCAATAAATACATCGACACGTCTATGCA GATGTTT DQFRVISLLSVESKIFF GAAGGGAGGCATACCAGGAGTCCCAGGCTGCCTGGAACACACAGGCGT CTGTAA SIVAKRLSNFLLSNKYI GGTAACTCAGCTCATTAGGGAGGCAAGAGAAGGCAGGGGGGACCTGG C(SEQ DTSMQKGGIPGVPG CTGTGTTGTGGTTGGATCTCACCAATGCCTATGGCTCAATACCCCACAAG IDNO: CLEHTGVVTQLIREA CTGGTGGAGGTCGCACTGGAGAAACATCATGTACCCCAGAAGGTGAAA 1288) REGRGDLAVLWLDLT GACCTCATCATCGACTATTACAGCAAGTTCAGCTTGAGAGTCTCCTCTGG NAYGSIPHKLVEVALE CCAGTTAACATCAGATTGGCACCAGCTTGAGGTAGGAATAATCACTGGT KHHVPQKVKDLIIDYY TGCACCATCTCAGTGACCCTCTTTGCACTGGCAATGAACATGATGGTCAA SKFSLRVSSGQLTSD AGCAGCTGAGACAGAGTGCAGAGGCCCCCTCAGCAAGTCCGGAGTAAG WHQLEVGIITGCTISV GCAACCTCCCATCAGAGCCTTCATGGACGACCTCACAGTGACAACAACG TLFALAMNMMVKA TCGGTACCAGGAGCAAGATGGATCCTCCAAGGGTTGGAGAGGCTCGTG AETECRGPLSKSGVR GCATGGGCACGCATGAGCTTCAAACCTGCAAAATCCAGATCCTTGGTGC QPPIRAFMDDLTVTT TTAGGAAAGGCAAAGTCAGAGATGAGTTCCGCTTCAGGCTGGGACAAC TSVPGARWILQGLER ACCAAATCCCATCAGTCACTGAGAGACCAGTAAAGAGTCTCGGGAAGG LVAWARMSFKPAKS CCTTTAACTGTAGCCTCAATGACAGAGACTCCATCAGGGAAACCAGCAC RSLVLRKGKVRDEFR TGCCATGGAGGCTTGGTTGAAAGCAGTGGATAAATCAGGGCTCCCTGG FRLGQHQIPSVTERP AAGATTTAAGGCTTGGGTTTACCAACATGGAATCCTTCCAAGACTCCTCT VKSLGKAFNCSLNDR GGCCCTTGCTAATCTATGAGGTCCCCATGACTGTGGTTGAAGGTTTTGA DSIRETSTAMEAWLK ACAAAAGGTGAGCAGCTATCTACGCAGATGGCTGGGATTGCCACGCAG AVDKSGLPGRFKAW CCTAAGTAACATCGCTCTGTATGGGAACACCAACAAGCTCAAACTTCCTT VYQHGILPRLLWPLLI TTGGCTCAGTCAGGGAGGAGTTCATTGTGGCACGGACACGAGAACATC YEVPMTVVEGFEQK TGCAGTACTCTGGATCCAGAGATGCGAAAGTGTCCGGGGCAGGGATTG VSSYLRRWLGLPRSLS TCATCAGGACAGGGAGAAAGTGGAGGGCAGCAGAGGCAGTCGAACAA NIALYGNTNKLKLPFG GCGGAAACCCGGCTGAAGCACAAGGCCATCCTGGGGGCAGTAGCACAA SVREEFIVARTREHLQ GGCAGAGCTGGACTTGGGAGCCTAGCAGCAACCCGATACGACTCGGCC YSGSRDAKVSGAGIVI AGTGGGAGGGAGAGGCAGAGGCTGGTGCAGGAGGAGGTGCGTGCTTC RTGRKWRAAEAVEQ AGTTGAGGAGGAGAGAACCAGCAGAGCAGTGGCCATGCGGCAACAAG AETRLKHKAILGAVA GTGCCTGGATGAAGTGGGAGCAGGCGATGGAGCGGAATGTCACCTGG QGRAGLGSLAATRYD AAGGACATCTGGACATGGAACCCCCTGAGAATCAGGTTCTTGATCCAAG SASGRERQRLVQEEV GGGTCTACGACGTTCTTCCCAGCCCATCGAACCTGTACATATGGGGCAG RASVEEERTSRAVAM AGTAGAGACACCTGCATGCCCGCTGTGTTCCAAGCCAGGGACACTAGA RQQGAWMKWEQA ACATATTTTGAGCAGCTGTTCCAAGGCACTAGGTGAAGGTCGGTATCGA MERNVTWKDIWTW TGGAGACACGATCAGGTCCTTAAATCCATTGCTGAGGCAATCAGCAAGG NPLRIRFLIQGVYDVL GGATCAAGGACAGTCGATACCGCCAAGCCACGGCCAAGGTCATTCAGT PSPSNLYIWGRVETP TCATCAAGGAAGGACAAAGGCCAGAGAGAACAGCAAAGAACTGCTCTG ACPLCSKPGTLEHILS CTGGGTTGCTCTCCACGGCCCGAGACTGGGTGATGACAGTTGATCTTGA SCSKALGEGRYRWR GAGGCAGCTAAAGATTCCACCACACATCACCCAGTCTACGTTGAGACCT HDQVLKSIAEAISKGI GACATAATCTTGGTCTCTGAGGCCACAAAGCAATTAATCCTGCTGGAGC KDSRYRQATAKVIQFI TGACGGTGCCCTGGGAGGAGAGGATGGAGGAGGCTCAGGAGAGAAA KEGQRPERTAKNCSA GAGGGGAAAATATCAGGAGCTAGTGGAGCAATGTAGGGCGAATGGAT GLLSTARDWVMTVD GGAGGACCAGGTGCATGCCAGTGGAAGTGGGCAGTAGGGGATTTGCC LERQLKIPPHITQSTLR AGCTACACCCTGAGCAAGGCCTATGGTACACTGGGAATAACAGGCACA PDIILVSEATKQLILLEL AACCGAAGAAGAGCCCTAAGCAACAACGTGGAAGCAGCGGAAAAAGC TVPWEERMEEAQER ATCCAGATGGCTCTGGTTGAAGAGGGGGGAACAGTGGGGGCAGTAGC KRGKYQELVEQCRA ATGCCACTTGGACACAGGCCGGGGTCTGATCAGCCTCGGTCGGGTCGC NGWRTRCMPVEVG CTGGAGGAGGGTGTCTGTTGCAAGACCCGAAACACCCTGTGAGCCCAG SRGFASYTLSKAYGTL GAAACAACACTGATGATGTGTCCAAGGTTGTGCATCAGGAGATGTTTCT GITGTNRRRALSNNV GTAAC(SEQIDNO:1042) EAAEKASRWLWLKR GEQWGQ(SEQID NO:1410) HERO HEROFr Takifugu AGACTAGGTGACAACCAAGAACAGTTWGGTCGACTACTGGAAAGACA AGACTAGG TGATCA MTPAMEMTTTVTCI rubripes GTTGGCAGCTCGGAAAGACGGCACCCGGGACAGTATGGGTTAGCACCC TGACAACC CCCCGG CSKLCKNQRGLKIHQ CAGCCTGTATCTTTCGCGAGAAGGAACCCAAACAAGCTACGGAAAGCCC AAGAACA CTGGGT ARMKCLEREVEVQR TACAGAGAAACACCCCCAGGAGATCCCGAGAGGGGGGGAGGATGAGA GTTWGGT CGCCTG TGPGPGETQEEPGQ TCTCCAATCGGACGGACCTAACGTTAATGACCCCTGCAATGGAAATGAC CGACTACT GGCGAG EATHRSQSLHVPEPP TACGACAGTAACATGTATCTGCAGCAAGCTGTGCAAGAACCAGCGTGGC GGAAAGA GGTGTA NPNRVVQQQRIKWP TTAAAGATCCATCAGGCCAGAATGAAATGTCTGGAGCGGGAGGTTGAG CAGTTGGC TGATGT PANRRSEWLQFDED GTGCAACGCACAGGTCCTGGACCTGGTGAGACGCAGGAGGAGCCCGG AGCTCGGA CGTGAG VSNIIQATAKGDVDS ACAGGAGGCAACCCACAGATCCCAGTCCCTCCACGTACCGGAGCCTCCC AAGACGG ACCCGA RLQAISTIIVSYGSERF AACCCTAACAGAGTAGTTCAACAGCAGCGGATTAAGTGGCCCCCAGCAA CACCCGGG AACACC GRIEKGNTETTSYTM ATAGACGGAGTGAGTGGCTGCAGTTTGATGAGGATGTGTCCAACATCA ACAGTATG CTATGA NRRSFKIHQLRKELRT TCCAAGCCACAGCCAAAGGAGATGTCGACAGCAGACTCCAGGCGATAA GGTTAGCA ACCCAG LKKQFKRAXDGDKQ GTACCATCATCGTCAGCTATGGCTCAGAAAGATTTGGACGGATCGAGAA CCCCAGCC GATACA ALKELYNILRKKLKTLR GGGCAACACTGAGACCACCTCTTACACCATGAACCGCAGGTCCTTTAAG TGTATCTT TCCTGAC RAEWHRRRGRERAR ATACACCAACTGCGCAAGGAGCTGCGAACCCTCAAGAAACAGTTCAAG TCGCGAGA GATGTG KRAAFIANPFRFSKQL AGAGCTKCTGATGGGGACAAGCAAGCTTTAAAAGAGCTGTATAACATCC AGGAACCC TCCCAGT LGDKRSGRLECSREE TGCGGAAGAAGTTGAAAACTCTCCGCAGAGCAGAGTGGCACAGGAGGC AAACAAGC GCATCC VNRFLQNTMSDPLR GCGGGAGAGAGAGAGCAAGGAAGCGAGCAGCCTTCATTGCCAATCCCT TACGGAAA AGGAGA GQDLGPNRALISPAP TCCGGTTTTCTAAACAGCTGCTCGGGGACAAGCGGAGTGGCCGACTTGA GCCCTACA TGTAKCT PSAEFKLAEPSLKEVE GTGCTCAAGGGAGGAAGTGAATCGCTTCCTCCAAAACACCATGAGCGA GAGAAAC TTAAGT EVIKAARSASSPGPSG CCCACTGAGGGGTCAAGACCTAGGACCCAACAGAGCGCTCATCAGCCCT ACCCCCAG (SEQID VPYLVYKRCPEILRHL GCCCCACCATCGGCAGAGTTCAAGCTGGCAGAGCCTAGTTTGAAGGAG GAGATCCC NO: WKALKVIWRRGRVA GTTGAAGAAGTCATCAAGGCAGCCCGTTCTGCATCTTCCCCGGGCCCCA GAGAGGG 1289) DQWRCAEGLWIPKE GTGGTGTACCTTACCTCGTCTACAAGCGCTGTCCAGAAATTCTCCGGCAT GGGGAGG EDSKNINQFRTISLLS CTGTGGAAGGCCTTGAAAGTGATCTGGCGAAGGGGGAGAGTAGCCGA ATGAGATC VEGKVFFSIVSRRLTE CCAGTGGAGGTGTGCTGAGGGACTTTGGATACCCAAGGAGGAGGACTC TCCAATCG FLLKNNYIDTSVQKG GAAAAACATCAACCAGTTTCGGACTATCTCACTACTGAGTGTGGAAGGG GACGGACC GIPGVPGCLEHNGVV AAGGTGTTTTTTAGCATCGTCTCCCGAAGACTGACCGAGTTTCTCCTCAA TAACGTTA TQLIREAHESKGELAV GAACAACTACATCGACACTTCAGTGCAGAAGGGTGGGATCCCTGGAGT (SEQID LWLDLTNAYGSIPHK CCCCGGCTGTCTAGAGCACAATGGTGTAGTCACACAGCTCATCAGAGAG NO:1166) LVELALHLHHVPSKIK GCCCATGAGAGCAAAGGAGAACTAGCGGTTTTGTGGTTGGACCTGACT DLILDYYNNFRLRVTS AACGCCTACGGGTCCATCCCACACAAGCTAGTTGAGCTTGCGCTACACC GSVTSDWHRLEKGII TACACCATGTTCCCAGTAAGATCAAGGACCTGATTCTGGATTACTATAAT TGCTISVVLFVLAMN AACTTCAGGCTCAGGGTCACTTCAGGGTCAGTAACCTCAGACTGGCATC MVVKAAEVECRGPL GCCTTGAGAAAGGAATAATAACAGGCTGTACCATCTCCGTCGTTCTCTTC SRSGVRQPPIRAYMD GTACTGGCGATGAATATGGTGGTAAAGGCGGCTGAGGTGGAGTGCAG DLTVTTTSVPGCRWI AGGGCCTCTATCCAGATCAGGTGTTCGACAGCCCCCCATAAGAGCCTAC LQGLERLILWARMSF ATGGACGACCTTACCGTCACAACAACATCAGTCCCAGGGTGTAGGTGGA KPTKSRSMVLKKGKV TCTTGCAGGGTTTGGAGAGACTCATCCTATGGGCTAGGATGAGTTTTAA VDKFRFSISGTVIPSIT GCCCACCAAGTCAAGGTCCATGGTACTGAAGAAGGGGAAAGTGGTGGA EQPVKSLGKLFDSSLK CAAATTCCGATTCTCAATCTCAGGAACCGTAATTCCATCGATCACGGAGC DTAAIQKSTEELGGW AACCAGTCAAGAGCCTGGGAAAGCTCTTTGACTCCAGCCTGAAGGACAC LTKVDKSGLPGRFKA TGCTGCTATCCAGAAGTCTACGGAAGAGCTTGGAGGGTGGCTCACTAA WIYQYSILPRVLWPLL GGTGGACAAGTCTGGCCTGCCTGGTAGATTTAAAGCCTGGATCTACCAG VYAVPVTTVESFERKI TACTCCATCCTTCCCAGAGTCCTGTGGCCTCTCCTCGTGTATGCAGTCCC SSFLRRWLGLPRSLNS AGTAACAACAGTGGAATCCTTTGAAAGGAAGATCAGCAGCTTTCTGCGC AALYGTSNTLQLPFS AGATGGCTGGGTCTTCCTCGCAGCCTCAACAGCGCTGCACTGTACGGGA GLTEEFKVARTREAL CAAGTAACACCCTGCAGCTACCCTTCAGTGGGCTCACTGAAGAATTTAA QYRDSRDCKVSSAGI GGTGGCACGCACAAGAGAAGCCCTACAGTACAGAGACTCCAGGGACTG EVKTGRKWKAEKAV CAAGGTGTCATCAGCCGGGATTGAGGTGAAGACAGGAAGGAAGTGGA XVAESRLRQKALVGA AGGCAGAAAAGGCAGTGGAKGTGGCTGAGTCACGCCTAAGGCAAAAG VATGRTGLGYFPKTQ GCACTAGTTGGGGCCGTGGCAACAGGAAGAACAGGCTTGGGCTACTTC VSHARGKERNHLLQE CCAAAGACCCAAGTCAGCCATGCCCGGGGCAAAGAGAGAAACCACCTA EVRAGVEEERVGRAV CTTCAGGAGGAGGTCCGAGCAGGCGTGGAGGAAGAGCGAGTGGGTAG GLRQQGAWTRWES GGCAGTGGGACTCCGGCAGCAGGGGGCATGGACAAGGTGGGAGAGC ALQRKVTWSNIMQA GCGTTACAGCGCAAAGTTACCTGGTCAAACATCATGCAGGCAGACTTCC DFHRVRFLVAAVYDA ACCGCGTCCGGTTCCTTGTGGCGGCAGTCTACGATGCCCTCCCCAGCCC LPSPANLHAWGKSET AGCAAACCTCCATGCGTGGGGAAAGAGTGAGACACCCACCTGTTCCCTT PTCSLCSGRGSLEHLL TGCTCCGGAAGAGGCTCCCTGGAACATCTCCTTAGCAGCTGCCCAAAGT SSCPKSLADGRYRWR CCCTGGCTGATGGTCGCTATCGCTGGCGCCACGACCAGGTACTCAAAGC HDQVLKAVAESIALAI AGTGGCTGAGAGCATAGCCTTGGCCATTAGCACCASCAAACACCATCAT STXKHHHAPKKAISFI GCTCCGAAGAAGGCAATCTCCTTCATAAAAGCTGGAGAGAGACCTCGTG KAGERPRAGPQITTG CAGGCCCACAGATAACAACGGGACTCCTCCACACAGCTMCTGATTGGC LLHTAXDWQLHVDL AACTGCACGTTGACCTGGGAAAACAACTGATATTCCCCCAGCACATCGC GKQLIFPQHIATTSLR AACAACGTCTCTACGGCCAGACATGATCATCATCTCAGAGGCTTCGAAA PDMIIISEASKHLIML CACCTGATCATGCTGGAGCTTACAGTGCCCTGGGAAGAGCGGATTGAG ELTVPWEERIEEANE GAAGCCAACGAAAGGAAACGTGCCAAGTATCAGGAGCTGGTGGAGGA RKRAKYQELVEECRG GTGCAGGGGCAGGGGCTGGAGGACCTTCTACGAGCCCATAGAAGTTGG RGWRTFYEPIEVGCR CTGTAGAGGCTTTGCAGGACGCTCCCTCTGCAAAGCCTTTGGCCGACTG GFAGRSLCKAFGRLG GGAGTCACAGGGACAGCCAAAAAGAGGGCCATTAAAKCCGCGAGTGA VTGTAKKRAIKXASE AGCTGCAGAGAGAGCCACGAGGTGGSTGTGGCTKAAAAGGGCAGATCC AAERATRWXWLKRA GTGGGTTGCTACTGGGACACAAGCCGGGTCTTGATCACCCCGGCTGGG DPWVATGTQAGS TCGCCTGGGCGAGGGTGTATGATGTCGTGAGACCCGAAACACCCTATG (SEQIDNO:1411) AACCCAGGATACATCCTGACGATGTGTCCCAGTGCATCCAGGAGATGTA KCTTTAAGT(SEQIDNO:1043) HERO HEROTn Tetraodon AGATTGGTCTGGCTAAGCCAGTGACGTCCAGGAACAGACTGGCTGACG AGATTGGT TGATCA MATTQASVKPTAVA nigroviridis ACCACGAATAGAGTGGTGACAGCTTGGATAGACAGCTGACAGCAGGGA CTGGCTAA CTCCCA TCVCGKICKNPRGLKI AAGACGGCAACCGGGGCAGGAAGGGCTAGCAACCCAGCCTGCATCTTC GCCAGTGA GTCGGG HQTKMGCLASVQPE CGTGAGGAAGAACCCAAAACTTGCTACGAAGAGCCCGAAGCAAAGATA CGTCCAGG TCGCCT QRARFSLSESREVPA CCCCCAGGGGAGCCCGAGAGGGGGGGAGAATGAGCTCCCCAAACGGA AACAGACT GGGTGA RAEPYGPQQPHSPEA CGGATAACATGGCAACGACCCAGGCTAGCGTTAAACCGACAGCGGTTG GGCTGACG GGGGGT LGETQEERGQESPHS CCACATGTGTATGTGGCAAAATCTGCAAAAACCCACGAGGTCTGAAGAT ACCACGAA CTGATG AQNLRAQVAQAPDN CCACCAGACCAAGATGGGGTGCTTGGCAAGTGTGCAACCAGAGCAGCG TAGAGTGG TTGAAA PQHHRRVKWPPASK CGCAAGGTTCAGCCTCAGCGAGTCGCGGGAGGTGCCAGCCAGGGCCGA TGACAGCT GACCCG VSEWQQLDEDLEGIL GCCCTATGGCCCTCAGCAACCGCATTCTCCTGAGGCCCTTGGTGAGACG TGGATAGA AAACCC ESTAKGGVDRKLQT CAGGAGGAGCGGGGCCAGGAGTCACCCCACAGTGCCCAGAACCTCCGT CAGCTGAC CCGATG MTTLVISFATERYGT GCTCAGGTAGCACAAGCGCCAGACAACCCACAACACCACCGGCGGGTT AGCAGGG ACCCCA MEKRAAPEKYTKNR AAGTGGCCCCCAGCCAGCAAAGTGAGCGAGTGGCAGCAGCTTGATGAG AAAGACG GGTACT RAEKISQLRQELRVLK GATTTGGAAGGTATTCTGGAGTCCACCGCAAAAGGTGGAGTAGACAGA GCAACCGG ATCACT KQFKGASEDQKPGLA AAACTCCAAACAATGACCACGCTGGTCATCAGCTTTGCCACCGAGAGAT GGCAGGA GACGAT ELRCTLRKKLLTLRRA ATGGTACAATGGAGAAACGCGCTGCTCCAGAGAAGTACACCAAAAACC AGGGCTA GTGTCC EWHRRRAKERAKKR GCAGGGCAGAAAAGATCTCCCAACTGCGGCAGGAACTTCGGGTCCTGA GCAACCCA AAGACA AAFLANPFGFTKQLL AAAAGCAGTTCAAGGGCGCCAGCGAGGATCAGAAGCCAGGATTGGCA GCCTGCAT TGCATC GQKRSAHLECAKEEV GAGCTTCGTTGCACCCTTAGGAAAAAACTGCTTACCCTTCGCCGAGCAG CTTCCGTG AATAGG DSYLHDTFSDAEREN AGTGGCACCGGAGACGGGCCAAGGAAAGAGCCAAGAAACGCGCTGCA AGGAAGA TGTATTT SLGECRVLISPPEPAC TTTTTAGCCAACCCTTTTGGGTTCACTAAACAACTTTTAGGCCAGAAGCG ACCCAAAA AGAAAT SFNTKAPTWKEIQTV TAGCGCCCACTTGGAATGTGCAAAAGAGGAGGTTGATTCCTACCTCCAC CTTGCTAC C(SEQID VRAARNNSAPGPNG GACACATTCAGTGACGCAGAACGGGAGAACAGCCTAGGCGAATGTAGA GAAGAGC NO: VPYLVYKRCPKLLARL GTGCTGATCAGTCCACCTGAGCCAGCCTGCAGTTTCAACACCAAGGCTC CCGAAGCA 1290) WKILRVIWRRGKVA CAACTTGGAAAGAAATCCAAACTGTGGTCAGGGCTGCAAGAAACAACT AAGATACC HQWRWAEGVWVP CAGCTCCTGGACCCAATGGAGTCCCATATCTGGTGTACAAAAGATGCCC CCCAGGG KEEKSTLIEQFRTISLL CAAACTCCTAGCCCGGCTCTGGAAGATCCTAAGGGTGATCTGGAGAAG GAGCCCGA NVEGKIFFSILSHRLSD GGGGAAGGTCGCCCATCAATGGAGATGGGCGGAAGGGGTGTGGGTTC GAGGGGG FLLKNQYIDSSVQKG CGAAGGAGGAGAAGTCAACCTTGATAGAGCAGTTTAGGACCATCTCACT GGAGAAT GIPGVPGCLEHCGVV GCTCAATGTCGAGGGGAAGATATTCTTTAGTATCCTCTCCCATCGTCTAT GAGCTCCC TQLIREAREGRGSLA CAGACTTCCTCCTTAAGAACCAGTACATCGACTCCTCGGTGCAAAAGGG CAAACGGA VLWLDLANAYGSIPH GGGGATCCCTGGGGTACCAGGGTGTTTAGAACACTGTGGCGTGGTGAC CGGATAAC KLVEMALARHHVPG ACAACTAATTAGGGAGGCGCGCGAAGGGAGAGGTAGCCTGGCCGTACT (SEQID PIKTLIMDYYDSFHLR TTGGCTGGACTTAGCTAACGCTTATGGCTCCATACCCCACAAGCTGGTG NO:1167) VTSGSVTSEWHRLEK GAAATGGCATTAGCGAGGCACCATGTCCCAGGCCCGATCAAGACTCTG GIITGCTISVIIFALAM ATCATGGACTACTATGATAGCTTCCACCTGAGAGTCACGTCAGGCAGTG NMLAKSAEPECRGPI TCACATCTGAATGGCACCGACTAGAGAAAGGGATCATCACTGGATGCAC TKSGIRQPPIRAFMD CATCTCAGTGATAATATTCGCCCTGGCCATGAATATGCTGGCCAAGTCG DLTVTTTSVPGCRWI GCTGAGCCAGAGTGCAGAGGACCCATAACCAAGTCAGGCATTCGCCAG LQGLERLMTWARM CCCCCCATCAGAGCATTCATGGATGATCTGACAGTAACAACAACGTCAG RFKPGKSRSLVLKAG TTCCAGGGTGCCGTTGGATCCTCCAGGGCCTGGAGAGGCTTATGACTTG KVTDRFRFYLGGTQIP GGCCCGTATGCGCTTTAAACCTGGAAAATCTAGGTCCTTAGTCCTGAAG SVSEKPVKSLGKMFD GCAGGGAAGGTGACCGACCGCTTCCGCTTCTACCTGGGAGGCACCCAG GSLKDAASIRETNDQ ATTCCATCAGTCTCTGAGAAACCGGTGAAAAGCCTAGGTAAAATGTTCG LGHWLTLVDKSGLPG ACGGCTCCTTAAAGGATGCCGCTTCCATCAGGGAAACCAATGATCAGCT KFKAWVYQHGILPRI GGGGCACTGGCTGACGTTGGTCGATAAGTCAGGTCTTCCGGGGAAATT LWPLLVYEFPISTVEG CAAGGCATGGGTATACCAGCATGGTATCCTACCTAGGATACTGTGGCCA LERRVSSCLRRWLGL CTGCTGGTGTATGAATTTCCAATTTCCACCGTGGAAGGGCTTGAGAGGA PRSLSSNALYGNNNK GGGTCAGCAGCTGCCTCAGGCGTTGGCTGGGACTACCTAGGAGTCTGA LTLPFSSLAEEFMVTR GCAGCAATGCCCTCTACGGTAACAACAACAAGCTGACACTCCCCTTCAG AREVLQYRESKDPKV CAGCCTGGCAGAGGAATTCATGGTTACCAGAGCTAGGGAAGTTCTCCA ALAGIEVRTGRRWRA GTACAGGGAGTCCAAGGATCCCAAGGTAGCTCTTGCCGGCATTGAGGT QEAVDQAESRLHHK GCGGACTGGCAGAAGGTGGAGGGCTCAGGAGGCAGTGGACCAGGCAG ELVGAVATGRAGLGT AATCTCGGCTGCACCACAAAGAGCTTGTGGGAGCCGTGGCGACTGGCC TPTTHLSRLKGKERR GTGCAGGCCTGGGAACAACACCGACCACCCACCTCAGCAGGCTCAAGG DQVQLEVRASIEEQR GCAAGGAAAGGCGGGATCAGGTCCAACTAGAAGTGAGGGCCAGTATT ASQWVGLRQQGAW GAGGAACAGCGAGCTAGTCAGTGGGTGGGGCTGAGGCAGCAAGGCGC TRWEEAMARKISWP TTGGACTAGGTGGGAAGAGGCCATGGCCAGAAAGATCTCATGGCCTGA ELWRAEPLRIRFLIQS GCTGTGGAGGGCTGAGCCCTTGCGCATCCGCTTCCTTATTCAGTCAGTTT VYDVLPSPSNLFLWG ATGACGTCTTGCCCAGCCCATCAAACCTCTTCCTGTGGGGCAAGGTGGA KVESPSCPLCQGRGT ATCCCCATCATGTCCCTTGTGCCAGGGAAGGGGCACCTTGGAGCACATC LEHILSSCPKALGEGR CTCAGCAGCTGTCCCAAAGCACTTGGAGAGGGTCGCTATCGCTGGCGTC YRWRHDQVLKAIAES ACGACCAGGTGCTGAAGGCAATCGCTGAGTCTATCAGCTCCGCCATGGA ISSAMEYSKRLPLPGR GTACAGCAAGCGCCTACCCTTACCGGGACGCGGAGTTAGGTTTGTCAG GVRFVRAGEQPPPQ GGCCGGTGAACAACCTCCTCCCCAACCAAGGGCCCAACCAGGCCTCCTT PRAQPGLLATARDW GCAACAGCTAGGGACTGGCAACTAAGGGTTGACCTGGGGAAACAATTA QLRVDLGKQLKFPEN AAGTTCCCGGAAAACATCGTAGAAACCAACCTGAGGCCAGACATTGTTC IVETNLRPDIVLHSQS TGCACTCACAGTCGTCCAAGCAAGTTATTTTGCTGGAGCTGACTGTGCCC SKQVILLELTVPWEER TGGGAGGAGAGAATGGAGGAAGCGTATGAAAGGAAGGCAGGGAAGT MEEAYERKAGKYAEL ACGCTGAGCTGGTGGAGGATTGCCGCAGAGCAGGGTGGCGCAGTAGA VEDCRRAGWRSRCL TGCCTGCCTATAGAGGTTGGGGGTAGGGGCTTTGCAGGGAAGTCACTC PIEVGGRGFAGKSLC TGCAAGGCCTTTAGCCTCCTGGGCATCACAGGCATGCGCAGGAGGAAA KAFSLLGITGMRRRK GCCATCTGCGCGGCCTCAGAGGCTGCAGAGAGGGCGTCCAGATGGCTG AICAASEAAERASRW TGGATCCAGCGGGACAAGCCGTGGACGAGCGCTTCTTGGACACAGGCC LWIQRDKPWTSASW GGGAACTGATCACTCCCAGTCGGGTCGCCTGGGTGAGGGGGTCTGATG TQAGN(SEQIDNO: TTGAAAGACCCGAAACCCCCGATGACCCCAGGTACTATCACTGACGATG 1412) TGTCCAAGACATGCATCAATAGGTGTATTTAGAAATC(SEQIDNO: 1044) NeSL LIN9_SM Schmidtea AAACGACATCATGAACGCTTGGCCGCAACAATCCAGTTATCCCTGCGGT AAACGACA TAAAAT MMDSRQLNTPKIRK mediterranea AACATTGTGGAACTCATAAGACAAGTACTAAAAGAAGAATTAGAAAAAT TCATGAAC GGCAAA YQNPKMTNDIMKSY TAGAAGAAAAAATTGAAAATAATTTATTTATAAAATTTAAAAATTTAAAT GCTTGGCC AAGATA NYAVLSDVTPQETTQ AAATTTAAAAATTTAAATTTAAATTTAAATGAAGATAAAAATTTATTTAA GCAACAAT TTTCAAG TTTHLNVDIDNETTQ TCCAATAAATAATCAAGAAAATCAAGAAAATGATGGATTCAAGACAATT CCAGTTAT ATGAAT PKQPLTKSGKPKSKPI AAATACTCCAAAAATAAGAAAATATCAGAACCCAAAAATGACAAACGAC CCCTGCGG TGTGGA AVSYKFKDATFIWDT ATCATGAAAAGCTACAACTACGCGGTTTTGAGCGATGTCACGCCTCAAG TAACATTG CTCATCT TPQTNPPRDCTKLID AAACCACTCAAACAACAACCCACTTAAATGTCGATATAGACAATGAAAC TGGAACTC AAAAAA KTRPRKTIFKKSAFQS CACCCAACCAAAACAGCCACTTACGAAGTCTGGCAAACCAAAATCTAAA ATAAGACA TGACCA YLKKELSNETFVEVKT CCAATTGCGGTATCATACAAATTTAAAGATGCCACCTTCATCTGGGACAC AGTACTAA CCTTGA FLMATHKYRFKDENS TACCCCACAAACAAATCCACCAAGAGATTGCACCAAACTTATTGATAAA AAGAAGA GTCCAA RLLAYRIINRYVMETA ACAAGACCAAGAAAGACCATCTTCAAAAAATCAGCATTTCAAAGCTACC ATTAGAAA ATATGC NEFKETEFDMARFAK TCAAAAAAGAACTGTCCAATGAGACATTTGTGGAAGTAAAAACCTTCCT AATTAGAA CTAGCT FFTIPENWLKHLKPYS CATGGCAACTCACAAATATCGTTTTAAAGACGAAAACTCAAGACTCTTG GAAAAAAT ATCATG TATETSPADRIKVQKL GCATACCGAATAATTAATCGCTATGTCATGGAGACAGCAAATGAATTCA TGAAAATA GTTGCT VDLTCRYPFKTQEEQ AAGAAACCGAATTTGACATGGCTCGCTTTGCCAAATTCTTCACAATCCCA ATTTATTTA GATGGA TSVANFLHFFTQRSII GAGAATTGGTTAAAACATCTAAAACCATACTCTACAGCTACCGAAACAT TAAAATTT AACAGT GISRDYKFQKFIPFMA CACCGGCTGATAGAATAAAAGTACAAAAATTAGTGGATCTCACATGCAG AAAAATTT AAGGCA RKNTRPETTSTMVTT ATACCCATTCAAAACTCAAGAAGAGCAAACAAGTGTAGCAAACTTCCTA AAATAAAT CCTGAT SPTEQNRLPMVIITPL CACTTCTTCACCCAAAGATCAATAATTGGAATCTCAAGAGATTATAAATT TTAAAAAT AGCTAA EEPKSEHRRPEKRGA CCAAAAATTTATACCATTTATGGCAAGAAAAAACACCAGGCCGGAGACA TTAAATTT CTTTTCA SNDTIVLSDEEFPLLK ACCTCCACTATGGTTACGACTTCTCCAACAGAACAAAACAGACTACCAAT AAATTTAA CTGTGA RRTLPTRKSKNPTGA GGTAATAATCACACCACTTGAAGAACCAAAAAGTGAACATCGTAGACCA ATGAAGAT ATATCTT GNVPTETECTDEVKFI GAGAAAAGAGGCGCAAGCAATGACACAATTGTGCTTAGCGACGAAGAG AAAAATTT CAGATA LNNEYQIECKECGKV TTCCCACTACTTAAAAGGAGAACTCTTCCAACCAGAAAATCCAAAAATCC ATTTAATC TTCACA WENVRNGLNHLRQK TACTGGTGCAGGAAATGTACCWACAGAAACCGAATGCACTGATGAAGT CAATAAAT GTGACA HDFPNRTDVMVSCV TAAATTCATCCTCAACAATGAATACCAAATAGAATGTAAAGAGTGTGGA AATCAAGA CGAAAG RCEVPIKGAECVNHIK AAAGTGTGGGAAAACGTACGAAATGGATTAAACCACCTTCGTCAAAAAC AAATCAAG GACACC NHKKDDKEESEAGSL ACGATTTCCCAAACCGAACAGATGTTATGGTATCTTGCGTAAGATGTGA AAA(SEQ ACTAGT VANTQDIPNESSLSQ AGTACCGATCAAAGGAGCAGAATGTGTAAATCACATTAAAAATCACAAA IDNO: AAAAAC AAIEVYLRNILKMKEN AAAGATGACAAAGAAGAAAGTGAAGCSGGGAGTCTTGTGGCTAACACT 1168) CACTAG QERNIQYLEPSTANFL CAAGACATCCCAAATGAAAGTAGCTGACTGTCACAAGCCGCAATCGAAG TTTTTTC INRNLRAFYQNVKIEK TATATCTGAGGAATATTCTGAAAATGAAAGAAAACCAGGAAAGGAATA TGACAC LIGWEQVIWLIHWN TTCAATATCTTGAACCTAGTACTGCGAATTTCCTCATAAATAGGAACCTC CTCTTGC KCHWIVYLANCDSKT AGAGCATTTTATCAAAACGTCAAAATCGAAAAGCTTATCGGATGGGAAC TACAAA SVILDSDNQMTLQQ AAGTCATCTGGCTTATACATTGGAACAAATGTCATTGGATTGTATACCTA CTCTGTA RCNIKAKFDKFLEGTF GCTAATTGCGACTCAAAAACCTCTGTTATCTTGGACTCTGACAACCAAAT AAAATC EEKTVLGTLERKVPQ GACATTACAGCAAAGATGTAACATAAAAGCCAAATTTGACAAATTCCTA AAAAGG QPNNFDCGIYVIQYIS GAAGGTACCTTTGAAGAAAAAACAGTGCTTGGAACCCTAGAAAGAAAA ATCGAT DFLKDPQRIDYHTPD GTTCCTCAGCAACCAAACAACTTCGATTGCGGTATATATGTGATACAATA AGGCCG SKRIRKEIGELILEEMK CATCAGCGACTTTCTTAAAGACCCACAAAGAATAGATTATCATACACCCG CGCTTTC NPASKIKNPNKEIQSL ACTCCAAAAGAATTAGAAAAGAAATAGGAGAATTAATATTAGAAGAAA ACGGTC LQKFRLLQINVNDVF TGAAAAACCCTGCCTCAAAAATCAAAAATCCAAACAAAGAAATACAATC TGTATTC HWFAAEYQKSLPKIR TTTACTCCAAAAATTCAGACTACTGCAAATCAATGTGAATGATGTATTCC GTACTG TKRDGKLNKLSCSYQI ATTGGTTTGCGGCTGAATACCAAAAATCTCTACCGAAGATACGTACCAA AAAATC QRLFGLAPKRAVKEIY AAGAGATGGAAAACTGAATAAACTAAGCTGCTCCTATCAAATCCAAAGA AAGATC FQETSTADLETRVLN TTATTTGGTCTAGCTCCTAAAAGAGCAGTCAAAGAAATATATTTCCAAGA AAGGAA EHFKKDESTMKECKI AACCTCTACAGCAGACTTGGAAACAAGAGTTCTAAATGAACATTTCAAA GCTTTTC KNGNHYQDWITKAQ AAGGATGAATCAACGATGAAAGAATGTAAAATAAAAAATGGAAACCAT CCCTTTT IDNKEILEALKNSTDS TACCAAGACTGGATAACAAAGGCCCAAATTGATAATAAAGAAATATTGG AGTCAA APGEDNIPLRQWIIW AAGCCCTAAAAAACAGTACAGATTCTGCCCCCGGAGAAGATAACATTCC CACCAG NNDGVLFDMFNYIK TCTGAGGCAATGGATAATCTGGAACAACGACGGTGTCCTCTTTGATATG GTTTCTG RTHDIPDMWKNYTT TTTAACTACATCAAAAGGACACACGATATCCCAGATATGTGGAAAAACT TCCTAGT TLLIKPGKSQESNIPA ACACCACAACACTACTTATAAAACCCGGAAAAAGCCAAGAAAGCAACAT TGAGCT NWRPISILPTSYRIFM CCCCGCTAATTGGAGGCCAATATCGATATTGCCAACAAGCTATCGTATAT TCCCTTG KVLNKRVLEWANRG TTATGAAAGTCCTAAATAAAAGAGTACTAGAATGGGCTAATAGAGGAG GGACAT ELISKWQKAVDKAN AACTGATATCAAAATGGCAGAAAGCCGTAGACAAAGCTAATGGATGTG CTGCGT GCDEHSYVIQALIEKA ATGAGCACAGCTATGTCATACAAGCGCTTATCGAAAAAGCAAACAGAA TACCATT NRSYYKNEQCHLAFL GCTACTACAAAAACGAGCAATGTCACCTCGCCTTCTTGGATTTGGCAGA TGACAG DLADAFGSIPFQVIW TGCTTTTGGAAGCATCCCATTCCAAGTAATATGGCATACCCTAAAAAATA ATGTAC HTLKNMGMDEETIN TGGGTATGGATGAGGAAACCATCAACTTGCTCAAAGAAATCTACAAAGA CGCCCC LLKEIYKDCSTKYKCG TTGCTCCACAAAATATAAATGTGGAAAGAATGAGTCAGAAAAGATCAAA AGTCAA KNESEKIKITKGVRQG ATTACGAAAGGAGTCCGACAGGGATGCCCATTGTCGATGACCCTCTTCA ACTCCCC CPLSMTLFSLCIQYLI GCCTCTGTATACAATATCTTATACAAGGCATAGCAGAAAAGAAAAAAGG ACCTGA QGIAEKKKGATIAGQ AGCAACAATTGCAGGTCAAGAAGTTTGCATATTGGCTTATGCGGACGAC CACTGTC EVCILAYADDLVIVAN CTAGTAATTGTTGCAAACACAGCAAAAGACATGCAAATGCTGTTAACAA CTCAAA TAKDMQMLLTTIENL CAATCGAAAATCTGGCAAAACAAGCCGATCTCATATTCAAACCGGCAAA ACAGTT AKQADLIFKPAKCGY ATGTGGATATTACAGAGACCCAAGAGATAAAAAGTCCATGATGAAGAT CAATTG YRDPRDKKSMMKIY ATATGGCAAAGAAATCAGCATAGTAGACGAAAAGAATGTTTACACCTAC CATCCG GKEISIVDEKNVYTYL CTAGGTGTAAGAATCGGTGACACAAAGAAAAAAGACCTAAATGTCAGA AAGATC GVRIGDTKKKDLNVR TTCGAAGAGGTCAAAAAGAAAACGACAGCAATCTTCAAATCGAAATTGC GCAATTT FEEVKKKTTAIFKSKLR GAAGTGACCAAAAACTAGAGGCATACAACATCTTTTGCCAATCAAAATT TTTCACT SDQKLEAYNIFCQSKF TGTGTACATCCTACAAGGCGAAGATATCGCAAAAACCAAAATTGAAACT AAAATA VYILQGEDIAKTKIETY TACGACGAAGAAATCAAGAAAATGATAAAAGAAGATATATTAAAATTAC AATTAA DEEIKKMIKEDILKLQ AAGACAAAAGTCCGTTCACAGACTTCGTTATCTACTCCCCAAGAGAAAA CAAAAG DKSPFTDFVIYSPREK AGGGGGGTTAGGAATAACAAAGATAATAGATGAACAAACAATTCAAAC TTAATTA GGLGITKIIDEQTIQTI TATTAATAGAACGGCAAAACTCCTAAATAGTAGCCATAGAGCAATCCGG TACTGCT NRTAKLLNSSHRAIR GCTATTATTTATGAAGAGCTAATACAAGTAGCTAACCTAAGAGGAGAAA TCATTGA AIIYEELIQVANLRGEK AAGAAATCAACACCATTGAAGAAGCACTAAAATGGTTGGAAGGTACCA GTAAGT EINTIEEALKWLEGTN ACAAATACAAAAAGAACTCCAACGCCAAGACCACCTGGATAACAAGGG AGAAAA KYKKNSNAKTTWITR TTCGGGAGGCCTTTCAAACTCTAGAAAAGAAACACAAAATCAAGGTTAG ACAATC VREAFQTLEKKHKIKV ATTTGTGCCCAAAGAAAACTGCATTGGATATAAAATCAAATGCGACACC (SEQID RFVPKENCIGYKIKCD CAAGAAAAGATAGTGGAGCTTGATAACTCAAAAGAGTTATCAAAAAGC NO: TQEKIVELDNSKELSK TTACACTGGATGATAAAAGAGGCATATTATAAAGAATGGAAAGCCCTAA 1291) SLHWMIKEAYYKEW AATGCCAAGGATATATTATAAGCCTAAAAACCTCCGAATTTATGGAGTG KALKCQGYIISLKTSEF GAAAATGCCCAGAGGCCTTCCGGACCCTGATTGGAGATTCCTAACAAAA MEWKMPRGLPDPD GTAAAGGCAAATATGTTGGACGTAAACATGAAACAAGCCAACCAGGGA WRFLTKVKANMLDV GGAAGGTTGGGAAGCACAAAATGCCGAAAATGTGAAGATAAAGAATC NMKQANQGGRLGS GGCAAGCCATGTTATAAACCACTGTGCCTCAGGTAACTGGAGTAGAGTG TKCRKCEDKESASHVI GAAAAGCACAACCAGGTGCAAAATGAGCTAGCAAAAGAACTGACAAAG NHCASGNWSRVEKH CGGAATATCAGCTTCGAAAAGGACAGCATCCCAAAAGAAACAAAAGAG NQVQNELAKELTKR AGCCTAAGACCAGATTTGGTTATAAGACTCAAAGACAAGATAATGATAG NISFEKDSIPKETKESL TGGACATCAAATGCCCATTTGATGAGGAATCTGCTATCGAGAGTGCCAG RPDLVIRLKDKIMIVD AAACAAGAACATAGACAAATATCGAGAACTGGCCAAAGAGATCCAAGC IKCPFDEESAIESARN AAAAACTGGGTTACAAACAACAGTCTCAACTTTCGTTGTCTGTTCTTTGG KNIDKYRELAKEIQAK GAACCTGGGATAAGAGGAACAACGAGCTCCTACGGCAGATGGGAATAA TGLQTTVSTFVVCSL GATATGAAGAATCCAAAGAGATGAGGATCAATATGATCCAAAAAGCCA GTWDKRNNELLRQ TCCACGGGTCTAGAAAAACCTACGACCACCACAGAAATTTTAACAATGG MGIRYEESKEMRIN TTAAAATGGCAAAAAGATATTTCAAGATGAATTGTGGACTCATCTAAAA MIQKAIHGSRKTYDH AATGACCACCTTGAGTCCAAATATGCCTAGCTATCATGGTTGCTGATGG HRNFNNG(SEQID AAACAGTAAGGCACCTGATAGCTAACTTTTCACTGTGAATATCTTCAGAT NO:1413) ATTCACAGTGACACGAAAGGACACCACTAGTAAAAACCACTAGTTTTTTC TGACACCTCTTGCTACAAACTCTGTAAAAATCAAAAGGATCGATAGGCC GCGCTTTCACGGTCTGTATTCGTACTGAAAATCAAGATCAAGGAAGCTT TTCCCCTTTTAGTCAACACCAGGTTTCTGTCCTAGTTGAGCTTCCCTTGGG ACATCTGCGTTACCATTTGACAGATGTACCGCCCCAGTCAAACTCCCCAC CTGACACTGTCCTCAAAACAGTTCAATTGCATCCGAAGATCGCAATTTTT TCACTAAAATAAATTAACAAAAGTTAATTATACTGCTTCATTGAGTAAGT AGAAAAACAATC(SEQIDNO:1045) NeSL NeSL- Caenor- AAGGACGCTGGTTTAAGGCCGAATTCGTTCGTTCTTTTTCTGGCGGTCTT AAGGACG TAAACC MPLXISDCVHLVSAE 1_CBre habditis GCTTTGAGCTTGGTTTCCGATCCTATGCCCTTGWGCATCAGCGATTGCG CTGGTTTA CACACG GDTMNGRSTCGPLS brenneri TTCATCTGGTCTCGGCCGAAGGAGATACGATGAACGGGAGGTCCACTT AGGCCGA AGAMCT RSSSVVSRSRSSPSPS GTGGGCCATTGTCTCGTTCATCCTCTGTCGTAAGTAGGTCGAGGTCTTCC ATTCGTTC ACGACG VPPHPSPSIGPDTGLS CCTTCCCCCAGTGTTCCCCCCCACCCTTCCCCCAGTATCGGTCCAGATACA GTTCTTTTT CCATAA AGIIGTSRGCSLWLPE GGATTGTCGGCTGGAATCATCGGCACATCGAGAGGATGTAGCCTTTGGT CTGGCGGT GATCAG VDNALSQWLRKGLE TGCCAGAGGTGGACAATGCCTTATCACAGTGGCTGAGAAAAGGGTTGG CTTGCTTT GCATGT RDHEVLVCGFEAAKP AACGAGACCATGAAGTTCTGGTTTGTGGATTTGAGGCAGCAAAGCCACT GAGCTTGG ACGGAT LSLSKARLLRKTPRNT GTCACTTTCCAAAGCTAGACTTCTAAGAAAGACCCCAAGGAACACTGGT TTTCCGAT GTGAAT GVVRHILEFDGRLVH GTGGTTAGGCACATATTAGAATTTGACGGAAGGTTAGTTCATACTAACT CCT(SEQ GAGACT TNCNETECVLSTLXSX GTAACGAGACCGAGTGTGTTCTTTCTACTTTGTKCAGTGAMGWGGCTG IDNO: GATGAA XAVEVVRISLKCEPRE TCGAAGTAGTCAGGATATCTCTCAAATGTGAACCCCGTGAACCCTGTGA 1169) CGGAAT PCEPKCVLSILCSDKIV ACCCAAATGTGTTCTTTCTATTTTATGCAGTGATAAGATAGTCWGGATAT GAGCAC XISFECETREPFPFFXD CATTTGAATGTGAAACWCGTGAACCTTTTCCTTTCTTCMCGGATCGGAA GTGCCC RKFREPIPFVFERMY ATTCAGAGAACCTATTCCTTTCGTTTTCGAGAGGATGTATGACCCAAGA ATAAGA DPRDPIPSFICWMYD GACCCTATTCCTTCATTTATTTGTTGGATGTATGACCTGAGACAAAGGAT TCGGGT LRQRMTPGTLPXNPL GACCCCTGGCACSTTGCCAAGWAATCCCCTTTCTCMAGAGAACAAAGA ATKAAA SXENKDSWGRPAVIK CAGCTGGGGACGCCCAGCTGTCATAAAGAATGAGATAAGATCTATGAG GAWCA NEIRSMRSYLEENVK ATCTTATCTCGAAGAGAATGTGAAGGAAAACCGCCTGAACCTTTTGAGA GAGACG ENRLNLLRRLRGGGE AGGTTAAGAGGTGGTGGTGAAGGAAAGAAGATGATCAGAAAGTTGGT ATCCCTA GKKMIRKLVAEKKSD TGCAGAAAAGAAAAGCGACACAGAGGCTGTCTGCAGGATACTGTACCC MCATCG TEAVCRILYPLDDRYE ACTTGATGATCGTTATGAGTGTTTTGTTGATGGTTGTGAGACAACATCAA GGAAAA CFVDGCETTSTMGY CGATGGGATACGGGTCTAGTGACCTGAAATACATGACCACACACATAAA CACGAG GSSDLKYMTTHIKKE GAAAGAGCATGGTGTGAAAGTCCAATGGACATATGAGTGCTCCCTGTG TTATACT HGVKVQWTYECSLC TAATAAGCAAGCTCCTTTCATGGGGGGAGCTGCGTCCAAGTGGGTTACA GCTTCAC NKQAPFMGGAASK GCGCACATGGCAACAAAGCATACCGAAACGGTGAAGTTGAAGCTCAAA TGAMCT WVTAHMATKHTETV CCAAGCATCTCGACTACTGCCAAGGTTGCTGCGAAGCTAGATGAGATCG CGCTAA KLKLKPSISTTAKVAA CCGTGTCGCTACCCAAACCGAGACAAGTACGTGTATTGAGAGACCCAGA GCTCTCA KLDEIAVSLPKPRQVR TGAAGTGAAAGAGAAGGTTGCAAAACCAACACTTGCTTCCACGAGAGA TAATGA VLRDPDEVKEKVAKP AGAAGTGAAGAGAAATGCNTTGCGAAACATGGCCCCACTAGTCGAACT CCGAAC TLASTREEVKRNALR GAGTTCTCAGAATCAGTTGACWGGAGCCGAAAGACCTGAGGAAACTA TTGTTCG NMAPLVELSSQNQL GTGAAGCTATGCGACTCGAGGAGTGTAGGACTCCAGAGAAGATTGCTG CAACTG TGAERPEETSEAMRL AACTAGAAGGAAAGATACAGACCCGAACAGTGACTAAAAAGCTTAGTG CCTCCTA EECRTPEKIAELEGKI CACTGAAAGAGTCAATGGAAAAGAGAACGAGAGAGGAGAAGGTTGGG ACCGGG QTRTVTKKLSALKES AAACCATCACTTGCTCCAATTCATGAAGAAGTGAAAAAGACTGCAAGAC CGGGTG MEKRTREEKVGKPSL GGAGCTTGGCACCTCTAGTTGAACCGAGTACGTTCACTCATTTGACTGG TGAGAA APIHEEVKKTARRSLA GGCGTCAAGACTTCAGGCTGTTCGTGACGCGTTCTCGAAAGCCAACAAA GGGAGG PLVEPSTFTHLTGASR GACGCTGCGGCGAAAAGAAGGTCTAGCCTGGCGAAACCAGCTAGATTA TCGCCTT LQAVRDAFSKANKD TCAGAGATTATGAATACCACCTTCACGAAGGAGACGGTAAATGAGACG GAGGCG AAAKRRSSLAKPARL AAAGAACCTGTGAATGATACTGACGAGAGTATCGCAACAATCCAGCCAC GACGCA SEIMNTTFTKETVNE AAGTACGTGTCTACCGGTTTAATACATGGTGTCTCGATCATGAAACCAC ATGAGG TKEPVNDTDESIATIQ GAGAGAAGCCTGGTTAACCGGAGAAGTTGTGGATTGGTTCATGGGAAA GATGTG PQVRVYRFNTWCLD AGTGACTGAGAAGAAAGACCAGTACAGAGTGTTTGACTCACTTGTATG TGCAGG HETTREAWLTGEVV GTCAATGTACAAGTTCCATGGTGTAGGGTATGTATTGGATCTGATGAGG TTCCCCC DWFMGKVTEKKDQ GATCCTCTAACATACTTCTTACCAATATGTGAACACGATCACTGGGTTTT TCTTGA YRVFDSLVWSMYKF GCTAGTGATTGATGAGAAAGGAATTTGGTACGGTGACTCGAAAGGTGC GATCCG HGVGYVLDLMRDPL AGAACCGTGTAGAGAAATCGCCAAATTCATAGAAGAGACGAAAAGAGA AAAGTC TYFLPICEHDHWVLL AAGACGAATGTTCCCAGTCCCCGTACCTCTTCAAAGAGACGGAGTGAAC TAAAAG VIDEKGIWYGDSKGA TGTGGTGTACATATATGTCTAATGGTTAAATCCATCGTGAATGGCGAAC TACTAG EPCREIAKFIEETKRER CATGGTACACCGAAGAAGAAGTGAAAGTGTTCAGAAGAAATGTGAAAA ACCGAA RMFPVPVPLQRDGV GAGGTCTGAAAGAATTTGGTTTTGAACTTTATTCTGAAAGGATCGTCTAT AGATCG NCGVHICLMVKSIVN GTCGGAGATGACAGCATAAAAGTGAATGATGAGCATGATGATGACGTG AGGACG GEPWYTEEEVKVFRR GTATTCCTCTCGGAGGAGACGAATAACACTACGTTCACGATCGAGCAAG GACGGG NVKRGLKEFGFELYSE CAGAAGATCCGGCTGAAGAGGATGCCCAGCATCTGGAGAGTCCGGTGA ATGGCC RIVYVGDDSIKVNDE AACCTGTAAAGCTCATGGAGTTGAAAATTCCAAAGATTGAGATAAAGAA GCGAGG HDDDVVFLSEETNNT GAAAGAGATTCGGAGAAAACCGAAACAACAAATCGAAAAGAAAAGAA CACACG TFTIEQAEDPAEEDA AGGTGCCAACAGGGAAACCAGATGAACTGTTGGTCAGAGTGCGATTAT GCGGGT QHLESPVKPVKLMEL GGTTGGAAAGAGAAGTCCAATCATACTTCGACTCTGGAAAGAGATTCCA AACACA KIPKIEIKKKEIRRKPK AAGACTGGAGTGGATATTAGATGTCCTCACGGCTGCGATTCACAAGGCT GCCAGA QQIEKKRKVPTGKPD ACCGCCGGTGATGAGCAAGCAATTGAAAGAATTGAGAAGAGATCACCC TAACCTA ELLVRVRLWLEREVQ CCTTTGGAAGTGGAAGAGGGTGAAATGTCTACACAGACAGAACCAAAG GTAGAT SYFDSGKRFQRLEWI AAAAGAGAAAGAAAAGAAAAGGAGTCAGGTTGTGAAATGAAAGCTTCT CTTCGG LDVLTAAIHKATAGD CACAAGGAGATGTACTTCAAAAACCGCTCCAAAGCGTTCAATGTGATAA ATCTCGT EQAIERIEKRSPPLEVE TTGGAAAAGACTCAAAGCAATGCGAGATTCCAATTGAGACCCTGCAAAA CGGCCT EGEMSTQTEPKKRER GTTCTTTGAGGGAACAACTGCAGAAACGAATGTGCCAGCAGAAGTGCT GGAGAT KEKESGCEMKASHKE GAAAGAGATGGGTTCACGTCTGCCAAAGTTGGAGGCGTTGGACTGGAT ATGTGG MYFKNRSKAFNVIIG GGAAGCTAATTTCATTGAAAGTGAAGTGTCAGATGCGATGAAAAAGAC AACCCT KDSKQCEIPIETLQKF CAAAGACACCGCTCCGGGTGTAGACGGACTACGGTATCACCATTTGAAA GGGAAA FEGTTAETNVPAEVL TGGTTTGATCCAGAGTATAAGATGTTGACACTTCTCTACAATGAATGTAA GGAGAA KEMGSRLPKLEALD GAACCATCGAAAGATTCCAAGTCATTGGAAAGAGGCAGAGACGATTCT AGTTGTT WMEANFIESEVSDA CCTTTATAAAGGAGGTGATGAAACGAGGCCCGACAACTGGAGACCTAT TGTTGG MKKTKDTAPGVDGL AAGTTTGATGCCCACGATCTACAAACTGTATTCTAGTCTTTGGAACCGAA GCTGGC RYHHLKWFDPEYKM GAATTAGATCAGTTGGTGGTGTGATGAGCAAATGTCAACGAGGTTTCCA AAGAGT LTLLYNECKNHRKIPS AGAGAGAGAAGGGTGTAATGAAAGCATAGGAATCCTTAGAACGGCTAT GAAGTT HWKEAETILLYKGGD CGATGTCGCTAAGGGAAAGAGAAGGAACCTGTCAGTTGCATGGTTAGA TGAATG ETRPDNWRPISLMPT CCTTACGAATGCGTTTGGTTCAGTACCCCATGAACTGATAAAAAGTACTC TGAACC IYKLYSSLWNRRIRSV TGGAATCGTATGGATTCCCAGAAATGGTGACAGAGATTGTCATGGATAT ACCGTC GGVMSKCQRGFQER GTACAGAGGTGCATCAATCCGAATCAAGAGCAAGAATGAGAAAAGTGA ATGCAA EGCNESIGILRTAIDV ACAGATTGTTATCAAATCTGGAGTGAAGCAGGGAGACCCCATCTCCCCC CCACTA AKGKRRNLSVAWLD ACGCTGTTCAACATGTGTTTAGAGAATGTGATAAGAAGACATCTGGATA AACCAG LTNAFGSVPHELIKST GTGCTTCGGGACACAGATGCATAAAGACAAAAGTCAAGGTCCTGGCTTT TGGCGA LESYGFPEMVTEIVM TGCAGATGACATGGCCATACTGGCCGAGAACAGAGATCAGTTACAAACT TGCGGG DMYRGASIRIKSKNE GAACTAAACAAGTTGGACAAAGAATGTGAATCACTAAACCTCATTTTCA TGGAGT KSEQIVIKSGVKQGD AACCGGTAAAGTGTGCCAGTTTGATAATTGAGAGAGGAATGGTGAATA CATCAC PISPTLFNMCLENVIR AGAATGCGGAAGTGGTTCTGAGAGGGAAGCCAATCAGAAACCTAGATG AGGAAA RHLDSASGHRCIKTK AGAATGGTTCCTATAAGTACCTGGGAGTACATACAGGAATCGCAACAAG ATGTTTC VKVLAFADDMAILAE AGTTTCAACAATGCAATTGTTGGAAAGTGTCACGAAAGAAATGGATCTA TGTTGCT NRDQLQTELNKLDKE GTGAATCAGAGCGGCATGGCTCCGTTTCAAAAACTAGACTGCCTAAAGA TGACTTA CESLNLIFKPVKCASLI CGTTTGTTTTGCCGAAACTAACGTACATGTATGCGAACGCAATACCTAA TCAGTG IERGMVNKNAEVVL GTTAACGGAGCTTAAGGTCTTTGCAAACTTGACGATGAGAATGGTGAAA TTTGATA RGKPIRNLDENGSYK GAGATTCATGAAATCCCCATCAAGGGATCTCCGTTAGAGTATGTACAGC TCGCCCT YLGVHTGIATRVSTM TACCTCCAAGTCAAGGCGGATTAGGAGTGGCTTGTCCAAAGATAACAGC CAGGCA QLLESVTKEMDLVN GTTGATTACCTTCTTGGTCAACGTCATGAAAAAGCTATGGTCTTCTGACA CAAGTA QSGMAPFQKLDCLK GTTACATCAGAAAACTATACAGGGACTACCTGGATGAAGTCGCAGAGA TGAAGG TFVLPKLTYMYANAIP CGGAGACAGGTATGGAAGAGATGACGAAAGAAGATATTGCAAAATATC CCCCCAC KLTELKVFANLTMRM TGAGTGGTGATGTGCCGATCGACAAGAAGGCGTTCGGTTACAACACMT CCACAT VKEIHEIPIKGSPLEYV TCACAAGAGTAAGAGATGTGTGCAACAGCCTCACTAASATAGKGGGAG AAACTC QLPPSQGGLGVACPK CTCCACTGCACAAGTTAAAGATTGTGGAGAGAGACGGTGACTTTGCCAT CCTAGC ITALITFLVNVMKKL TCTAGTGCAAGCCACCAAAGAAGGAATGGAGAAAATCTTCACCTGTGCT AACTGG WSSDSYIRKLYRDYLD CAGGAGAAGAAACTCCAACAGCTTCTGAAAGCAGAAGTMAACACGGCT TAGTCC EVAETETGMEEMTK CTAGCGCACCGTTTCTTCACCGAGAAACCMGTGAAAAGTGCAGTGATG AGCAAG EDIAKYLSGDVPIDKK AGTGTAATGAGACAGTATCCACAGAGCAATGCCTTTGTGAAGAATGGA CGCTGG AFGYNTFTRVRDVCN AAGAATGTGAGCATTGCTGTCCACTCGTGGATACACAAAGCAAGGTTGA TWCTTG SLTXIXGAPLHKLKIVE ATGCGCTGCATTGCAACTTCAACACGTACGGTGAAAACAAGTCAAAAGT CTACTAT RDGDFAILVQATKEG GTGCCGACGTTGCGGCAAAGACGTGGAAACCCAACTGCACATCCTGCA TGCGCC MEKIFTCAQEKKLQQ GWCATGCGAGTACGGGTTACCAAAGCTAATCAACGAAAGACATGATGC CCAGGC LLKAEVNTALAHRFFT GGTGTTACATGTGGTGAGAAACCTCATCCGCAAAGGCTCAAAGAAAGA TCGCCC EKPVKSAVMSVMRQ CTGGAAGCTAAAGATAGATGAAACTGTGTCAAGTTGTAATCAACTTCGT (SEQID YPQSNAFVKNGKNV CCAGACATCTATATGTGTAGCCCAGATGGGAAAGAGGTCATAATGGCA NO: SIAVHSWIHKARLNA GATGTAACCTGTCCTTATGAATCAGGAATGCAAGCTATGCAAGAGAGTT 1292) LHCNFNTYGENKSKV GGAACCGAAAGGTCACSAAATACGAAGGAGGCTTTAGCCACTTCCAWA CRRCGKDVETQLHIL AGATGGGAAAGAAATTCACAGTGTTGCCAATAGTGGTTGGATCACTGG QXCEYGLPKLINERH GAACGTGGTGGAAACCCACAACGAACAGCTTAGTTCAACTAGGCATAG DAVLHVVRNLIRKGS AGAAAGASACGATAAGAAGAGTGATCCCCGAGCTGTGCTCAATGACCA KKDWKLKIDETVSSC TGGAATACAGTAAGGATGTCTACTGGAACCATATATTCGGGGACACCTT NQLRPDIYMCSPDGK CAGGAAGCCACCAATGAGATTTGGTGTAGAGAAGCCAAAGGGTAATAG EVIMADVTCPYESG TTGGAAGAAGGAAGGCAGCGAGCCGAAAGGTGCTGCTTCCTCCGACTA MQAMQESWNRKVT AACCCACACGAGAMCTACGACGCCATAAGATCAGGCATGTACGGATGT KYEGGFSHFXKMGK GAATGAGACTGATGAACGGAATGAGCACGTGCCCATAAGATCGGGTAT KFTVLPIVVGSLGTW KAAAGAWCAGAGACGATCCCTAMCATCGGGAAAACACGAGTTATACT WKPTTNSLVQLGIEK GCTTCACTGAMCTCGCTAAGCTCTCATAATGACCGAACTTGTTCGCAAC XTIRRVIPELCSMTME TGCCTCCTAACCGGGCGGGTGTGAGAAGGGAGGTCGCCTTGAGGCGGA YSKDVYWNHIFGDTF CGCAATGAGGGATGTGTGCAGGTTCCCCCTCTTGAGATCCGAAAGTCTA RKPPMRFGVEKPKG AAAGTACTAGACCGAAAGATCGAGGACGGACGGGATGGCCGCGAGGC NSWKKEGSEPKGAA ACACGGCGGGTAACACAGCCAGATAACCTAGTAGATCTTCGGATCTCGT SSD(SEQIDNO: CGGCCTGGAGATATGTGGAACCCTGGGAAAGGAGAAAGTTGTTTGTTG 1414) GGCTGGCAAGAGTGAAGTTTGAATGTGAACCACCGTCATGCAACCACTA AACCAGTGGCGATGCGGGTGGAGTCATCACAGGAAAATGTTTCTGTTG CTTGACTTATCAGTGTTTGATATCGCCCTCAGGCACAAGTATGAAGGCCC CCACCCACATAAACTCCCTAGCAACTGGTAGTCCAGCAAGCGCTGGTWC TTGCTACTATTGCGCCCCAGGCTCGCCC(SEQIDNO:1046) NeSL NeSL- Caenor- GCGCCCCGGGTTACATTGTCGGGGCCACCTTTCTCTTGGAGTAGAGTAC GCGCCCCG TAAAAG WRRPAPKQTKNSSL 1_CJap habditis AGTCTACTAATTTTTTGATAAGCTAGTCGGGTCCGAACCACTAGAGTTTG GGTTACAT CCAAAA HHLGHEVKRIARLKP japonica CTTGAAAATGCGTCAAACCAGCATTTTAGAACTCGCCCAAAAGTTCGGC TGTCGGG GCCACG GIFEFHAKPKNSSLHH CCCGACCCCCAAACAAATGGGACCTTCTTGACGATTTTCCCTGAAAATCG GCCACCTT GAGCAT LGHGVKRXARLKPGI GAGGATGGAATGGTCCCCTATTCTTGTAAATAGKACTGTGCAATACCCC TCTCTTGG CGGGAA FEFHAKXKNSSLHHL TTCGTCATCTGTGGGGAACAGATGACACGTGACGTCATCCGTGTAGACG AGTAGAGT AGAAAA GHEVKRIARLKPGIFE TCACGTTTTCCCGTGCCTGCGGGAGCCCCCAATCGAGCAATTTTTGCTCT ACAGTCTA ATGGAA FHAKPKNSSLHHLGH TTTGAGTGTCTGGAACGCTTGAAACCCCAGACAAATCAGGCCCAGTCGT CTAATTTTT AAGGAC EVRRNSRLKPGIFGFY CGGAAAATTTCTTTTTGAAATTTTTTGGCGCCTGCGAAAAAAATTTTTTA TGATAAGC TGAAAA QKSKNSSLHHLGHEV ACCGCCACAAACCCCCGGGAGGCGCGGWTAGGGATATCGATGTCATCG TAGTCGGG CGAGAC RRIARLKPGILEFHAK ACTCGTCGGTGATCTTTGATTTTCTCTCTGCGTCTCCTATTTTGGAACAGT TCCGAACC TGAAAA NRIKSGLKVTFLSDLX CTCGACCAAAAAACCGGGCCTGGCAACCCACCGAATCCGGATGTCGGA ACTAGAGT ATCCCA AHAGALACSRFLAST GGGATTTGGCAAGAAATGTTGGAAATAACGAAATTTCGTTATTTTCAGC TTGCTTGA AACAAA LKTEHCRQKSFKPVG ACAATTGTCAAACCGGCAAGAAAACTGGATGGACAAGACACACAATTTA AAATGCGT ACAAAT FLLHFLKNSSINEVAS CCGGAAATTGTGCTTGTTACGTCGAATTTCCCAATTTTGAAAAAATTCCT CAAACCAG CCAAAA LRNVKKXFLEFFSGKP CGTTCCACTGGTCGGGACGCGAGGTCAGACGATCTGCACGTCTGAAACC CATTTTAG CAAACT IGGMASFSRTKITFFK CAAAATCTTCGGATTTTATGCAGTAGTGGCGCCGCCCGGCTCCCAAACA AACTCGCC GAAAAA LCLKNFVLSAENPPIIR AACAAAAAATTCCTCGTTGCACCATTTGGGGCACGAGGTCAAACGAATT CAAAAGTT AAAAAA QKTNQNKASXVQIA GCACGTTTGAAACCTGGAATTTTTGAATTCCATGCAAAACCAAAAAATTC CGGCCCCG AAAACA RGGHLSDCLPSQKM CTCGTTGCACCATTTGGGGCACGGGGTCAAACGAWTTGCACGTTTGAA ACCCCCAA AAACAA AGVLGRLFLSVQSTLS ACCCGGAATCTTCGAATTCCATGCAAAACMAAAAAATTCCTCGTTGCAC ACAAATGG AAACTG HRPFDTLLRSDDDKR CATTTGGGGCACGAGGTCAAACGAATTGCACGTTTGAAACCCGGAATTT GACCTTCT GACAGA GRKTIKLQFFIKENLV TCGAATTCCATGCAAAGCCAAAAAATTCCTCGTTGCACCATCTGGGGCA TGACGATT CACTGG TPXVARDVKILXKQT CGAGGTCAGACGAAATTCACGTCTGAAACCCGGAATCTTCGGATTCTAT TTCCCTGA AAACAG KNNSGNSDSNSETK CAAAAATCAAAAAATTCCTCGTTGCACCATTTGGGGCACGAGGTCAGAC AAATCGGA TGTCAG NFSKNKVSRQNGPLI GAATTGCACGTTTGAAACCCGGAATTCTCGAATTCCATGCAAAAAACCG GGATGGA GCAAAG GGGNHKKIGENQITR GATAAAATCCGGTCTAAAAGTGACGTTTTTGTCAGATCTTWCTGCTCAC ATGGTCCC TCGCCG TLEIESKSDDNKVLVL GCTGGTGCGTTGGCGTGTAGCAGATTTCTGGCATCGACACTTAAAACGG CTATTCTT ATTATAC RILYPTNDWYKCYSQ AGCACTGCCGACAAAAAAGCTTCAAGCCAGTCGGTTTTCTGCTTCATTTC GTAAATAG TGTTCCA WCQHKSLVGYGAHD CTAAAAAATTCCTCGATCAACGAGGTGGCGTCTCTCCGCAACGTCAAAA KACTGTGC CGCCTTA LKYLTDHIKSTHSKKV AAAWKTTTCTTGAATTTTTCTCAGGAAAACCTATCGGTGGGATGGCTTC AATACCCC AAAGTC EWSYQCSICDAKAEG TTTCAGTAGAACCAAAATAACTTTTTTCAAACTTTGTTTGAAGAATTTTGT TTCGTCAT CCGAAA TGTKAARWITAHMP TTTGTCTGCTGAAAATCCCCCGATAATCCGCCAAAAAACAAACCAAAAC CTGTGGG TGGCGC KVHGIEATHRIKQNS AAAGCGAGCCKTGTCCAAATCGCAAGAGGAGGTCATCTGTCAGACTGTC GAACAGAT AAAACA EKTTNVKTANSLQE TKCCGTCCCAAAAGATGGCAGGAGTTCTCGGACGACTATTTCTCTCGGTT GACACGTG ACCTGA MALSLQKPKNGPKK CAGAGCACTCTCTCGCACCGCCCTTTCGACACGTTATTGCGGAGCGATG ACGTCATC ATCTATC VVMATSTTPEKKISEL ACGACAAAAGAGGGAGGAAAACGATCAAACTCCAGTTTTTTATTAAAGA CGTGTAGA TGAAAG ESKIQTREVAKQLSAL AAATCTGGTCACACCTKTGGTTGCTAGGGACGTGAAAATTTTAAAMAAA CGTCACGT TGCTCCA KESAQKNQQGNKTK CAAACAAAAAACAATTCTGGGAACTCTGATAGCAACAGTGAAACAAAA TTTCCCGT AACCAC NVKSSLKTIAENTNET AACTTCTCTAAAAATAAAGTTTCCAGACAAAATGGCCCATTGATTGGGG GCCTGCGG GCACAA KKISARKSLINYLKPED GCGGTAACCACAAAAAAATCGGAGAAAACCAAATCACACGCACTTTGG GAGCCCCC CTCGGA VLNHIPKEPKPASAKX AAATTGAATCCAAAAGCGATGACAACAAAGTTTTGGTCCTCCGAATACT AATCGAGC GAAAAT GLQELTGAQRLQETR GTACCCAACTAATGATTGGTACAAGTGTTACTCCCAATGGTGCCAACAC AATTTTTG CAGGGA RRFMAGNRRDSIAR AAATCCCTTGTTGGATATGGCGCTCACGATTTAAAATACTTGACAGACCA CTCTTTTG CAAGTT RESLSLGKISNSFKIEL CATAAAGTCCACTCATTCTAAAAAGGTTGAGTGGTCTTATCAGTGTAGTA AGTGTCTG GCTTCAC KNAPEKTTLKKPAVT TTTGTGACGCAAAAGCCGAAGGTACCGGTACAAAAGCWGCTAGATGG GAACGCTT GCAACG QKQNTSQNVSSSTV ATTACAGCCCACATGCCAAAAGTACACGGTATTGAAGCAACACACAGAA GAAACCCC GGCTGG VKENKTGNDVITIDD TTAAACAAAATTCTGAAAAAACAACAAATGTTAAAACTGCGAACAGTCT AGACAAAT GACAGG TETVKRKINTWCLDH CCAGGAAATGGCGCTGTCGCTCCAAAAACCAAAAAATGGTCCGAAAAA CAGGCCCA TACCCCC ESTENAWMADDIIF AGTTGTAATGGCAACTAGTACGACCCCAGAAAAGAAAATCTCTGAACTG GTCGTCGG TCCTGA WYIQKQIEISLDNKKF GAATCAAAAATCCAAACCAGAGAAGTGGCCAAACAATTGAGCGCTCTG AAAATTTC AACCGC KVIDPLIWTTYRIYGV AAGGAGTCAGCTCAAAAAAATCAGCAAGGAAACAAAACAAAAAATGTT TTTTTGAA GAGGTT ECVQDELVGFEKYFF AAATCAAGCTTAAAAACAATTGCTGAAAACACAAATGAAACMAAAAAG ATTTTTTG GAGGAT PICENGHWVLLIIDDK ATWAGCGCTCGAAAGAGCCTTATAAACTATCTGAAACCTGAGGATGTG GCGCCTGC GGACGG RVWYSDSLADKPIEVI CTCAATCACATTCCAAAGGAGCCAAAACCAGCTTCTGCGAAAAKTGGCC GAAAAAA GAAGGC EDLINKLNRTQGKFN TKCAAGAACTGACTGGTGCTCAAAGACTGCAGGAAACMAGAAGAAGG ATTTTTTAA CGCGAG QTVPKQKDGFNCGV TTTATGGCWGGAAACAGAAGAGATTCAATTGCAAGAAGAGAAAGTCTG CCGCCACA GCTTAT HVCLVAKSVITENFW TCTCTCGGCAAAATCTCAAACTCATTTAAAATTGAGCTGAAAAATGCTCC AACCCCCG GGCGGG YTEKDVNDFRKTVKL GGAAAAAACAACTCTTAAAAAACCGGCTGTCACTCAGAAACAAAACACG GGAGGCG TAACTC WLFSEGFELYSEPYK AGTCAGAATGTATCTAGTTCTACGGTTGTAAAAGAGAACAAAACAGGAA CGGWTAG GGTTGG QIQNKNISVNSEKNQ ATGACGTGATCACAATTGATGACACAGAAACTGTTAAAAGAAAAATAAA GGATATCG TGTGCT ISDNEKNWGDKTQT CACTTGGTGTCTCGACCACGAATCCACAGAAAATGCGTGGATGGCTGAC ATGTCATC AGTAGA VNESTLKERDEDIFLL GACATCATATTCTGGTACATCCAGAAACAGATTGAAATCAGTTTGGACA GACTCGTC TGATTTA RPHISVGVALKTEDEK ATAAAAAGTTCAAAGTGATTGATCCACTCATCTGGACCACATATCGAATT GGTGATCT TATCCG NQKAENLKAPQKLK TATGGTGTCGAATGTGTCCAAGATGAACTAGTTGGATTTGAAAAATACT TTGATTTT ACAGCC AIRRLKILKTCLKKLTA TTTTTCCAATCTGTGAAAATGGTCATTGGGTTTTGCTGATTATCGATGAC CTCTCTGC CCAACT VKGKPEETERAAIPNL AAAAGAGTCTGGTACAGTGATTCCCTGGCCGATAAACCAATTGAGGTTA GTCTCCTA AAGAGG MAIKLKTPPKVEPVR TTGAGGACCTCATAAACAAACTAAATCGAACCCAAGGTAAATTTAACCA TTTTGGAA AATCCT RNPEKGENYXKSQP AACGGTTCCAAAACAAAAAGACGGCTTTAATTGTGGAGTTCATGTATGT CAGTCTCG GGGAAA NKKRQIPTGKPDELV CTGGTGGCCAAATCCGTTATCACTGAGAACTTTTGGTACACAGAAAAAG ACCAAAAA GGAAAA KKVREWFEIQFQAYF ACGTTAATGACTTCAGAAAAACTGTCAAGCTTTGGCTTTTCAGTGAAGG ACCGGGCC CTTGAA EDGKSFQRLEWXTGL GTTTGAACTCTATTCAGAGCCGTACAAACAAATCCAAAACAAAAACATTT TGGCAACC AAAGTT LTAAIHKASAGDEQA CCGTTAATTCGGAAAAAAATCAAATCAGTGATAATGAAAAAAATTGGGG CACCGAAT TTTACAG VGKIIKRCPPLEIEEGE TGATAAAACTCAAACTGTGAATGAGAGTACTCTGAAAGAAAGAGATGA CCGGATGT GGCTGG MATQTETKQKPKNQ AGACATCTTTTTGCTCAGACCACACATCAGTGTTGGAGTTGCTCTCAAGA CGGAGGG TAATAG KSTKGANSSSSIREAY CAGAAGACGAGAAAAATCAAAAAGCTGAAAACTTGAAAGCCCCACAAA ATTTGGCA TTCAGC AENRARTFNKIIGKD AACTCTGAAACACGGAAGAATTCCAAGTGGACAAAAACGAGAAACCAG AGAAATGT ACAATT DKCEIPIEKIEKFFENT AATCTCCAAATGCCCAGGAAACTCCAAAAAACGAGCCAAAAATGGTTCC TGGAAATA GTAGTC TSNTNVPTETLARITS AAGTCCGAAGAATTCGGAAAAAGAAATTTCTACTGAGCTTCTGGATGCT ACGAAATT TACTGTC DLPKLEIGSWIEEEFR CAAGAAGCCGGAGAAGAGCTGAAAAGCGATCCGAAGGCTGAAAATCCT TCGTTATT TTGCAA EKEVAEALKKTKDTA GAAAACCTGTCTCAAAAAGCTGACGGCAGTGAAGGGAAAACCGGAAGA TTCAGCAC CCACAA PGVDGLRYHHLSWF GACGGAAAGAGCCGCCATTCCAAACCTCATGGCAATCAAGCTCAAGAC AATTGTCA CAAACC DPKXKLLTKLYNECRE GCCTCCAAAAGTTGAACCTGTAAGAAGAAACCCTGAAAAGGGTGAAAA AACCGGCA AGTGGT HKKIPGHWKEAETVL TTACMAAAAAAGTCAGCCAAACAAAAAGAGACAAATACCAACCGGAAA AGAAAACT TCTGCG LYKGGDETQAENWR ACCGGATGAATTGGTTAAAAAAGTCCGAGAATGGTTTGAAATTCAATTT GGATGGA GGTAGA PISLMPTICKLYSSLW CAAGCATATTTTGAGGACGGAAAATCCTTCCAGAGGTTAGAGTGGWTG CAAGACAC TCAAACT NKRIKSVTGVLSKCQ ACAGGTTTGCTCACGGCTGCAATTCACAAAGCTTCGGCTGGAGATGAGC ACAATTTA ATAATTT RGFQEREGCNESIAIL AAGCTGTGGGMAAAATCATCAAACGTTGTCCACCTCTGGAAATTGAAG CCGGAAAT GTGTGT RTAIEAAKGTKKSLSI AAGGGGAAATGGCTACCCAAACTGAAACAAAACAAAAACCAAAAAACC TGTGCTTG TTTCTTT AWLDLTNAFGSVPH AAAAGAGCACAAAAGGAGCAAATAGTTCCAGCTCAATTCGGGAAGCCT TTACGTCG TACTTGA ESIEATLIAYGFPGMV ATGCTGAAAACCGAGCGAGAACCTTCAACAAAATTATTGGAAAAGACG AATTTCCC CCCGGG TEVIKDMYNGASIRV ACAAATAGTGTGAAATTCCAATTGAAAAAATTGAAAAGTTCTTCGAGAA AATTTTGA CAACAC KTKNEKSKQILIKSGV CACAACTTCAAATACCAATGTTCCAACAGAAACACTAGCGAGGATCACT AAAAATTC ATTATAC KQGDPISPTLFNICLE TCTGATCTTCCAAAACTCGAGATTGGTAGTTGGATTGAAGAAGAGTTCA CTCGTTCC CACGTC SVIXRHLKSADGHKCI GGGAGAAAGAAGTAGCCGAAGCTCTTAAAAAAACAAAGGATACTGCCC ACTGGTCG CACAAG XSNIKLLAFADDMAIL CAGGTGTAGATGGATTACGGTACCATCATCTGAGCTGGTTTGATCCAAA GGACGCG GACGAA SDSKTKLQQELQKM AAKGAAACTGCTCACAAAACTGTACAACGAATGCAGGGAGCACAAGAA AGGTCAGA TTCATAA DDDCTPLNLIFKPAKC AATCCCAGGTCACTGGAAAGAGGCAGAAACTGTACTCCTCTACAAAGG CGATCTGC TGGCCC ASLIIEWGKVQKDQK GGGGGACGAGACGCAGGCCGAGAATTGGCGACCAATCAGTCTCATGCC ACGTCTGA CTCCCTA IKLKGQFIRSLAEQDT AACCATCTGCAAGCTATACTCTAGCCTGTGGAACAAAAGAATAAAATCC AACCCAAA AATAAA YKYLGVQTGIETRVSA GTGACAGGTGTTCTGAGCAAATGCCAAAGGGGTTTTCAAGAAAGAGAG ATCTTCGG CTCCCTA MQLMKKTVSELDKI GGTTGTAATGAAAGCATTGCAATTCTCAGAACCGCTATTGAAGCGGCAA ATTTTATG GCAACT NCSALAXWQKLDAV AAGGAACAAAAAAGAGCCTGTCAATTGCTTGGTTGGACCTTACCAATGC CAGTAG GGTGGT KTFVLPKMTYMYAN ATTTGGCTCAGTTCCACACGAATCGATCGAGGCCACACTAATTGCTTACG (SEQID CCGGCG TVPKLSELKEFANITM GTTTTCCGGGAATGGTAACCGAGGTAATAAAAGACATGTATAATGGCG NO:1170) AAGCCG RAIKVMQNIPVKGSP CATCGATTCGTGTAAAAACAAAAAACGAAAAGAGTAAACAAATCCTGAT GTTCTTG LEYVQLPIGKGGLGV TAAATCGGGTGTAAAACAGGGTGATCCAATCTCACCTACTCTTTTCAACA CCACTAT ACPKTTALITYLVSTM TTTGCCTTGAAAGTGTCATTMGTCGCCACCTAAAAAGCGCGGATGGTCA TGCGCC KKLWSTDDYIRKLHT CAAATGCATCGAMTCAAACATCAAATTATTGGCGTTTGCCGATGACATG CCAGGC DYLKMVAIKETKTKE GCAATCCTGTCAGATTCCAAAACAAAACTCCAACAAGAGTTACAAAAAA TCGCCC VTLEDLASYLSDDKTV TGGATGATGACTGTACACCGCTCAACCTTATCTTCAAACCCGCCAAATGT (SEQID CKKAVGYNSFTRVREI GCAAGTCTGATAATTGAGTGGGGAAAAGTACAAAAAGATCAAAAAATA NO: CKTLSKNKGALLSQLK AAACTAAAAGGTCAATTCATCAGAAGTTTGGCCGAACAAGACACCTACA 1293) IIAKDGKLAILVQAXK AATATCTTGGGGTGCAAACTGGCATCGAAACGCGCGTTTCTGCAATGCA DGKTKIFTHDHVKTL ACTGATGAAAAAAACKGTCAGCGAGCTTGACAAAATAAATTGCTCTGCA QKXLKKEINEALLHRF CTGGCTCMWTGGCAAAAACTGGACGCAGTAAAAACTTTTGTGCTCCCA TTEKRVKSEVVRVVQ AAAATGACGTACATGTATGCAAATACTGTACCGAAACTCTCCGAGCTCA EYPQCNSFVRDGGK AAGAGTTCGCAAATATTACAATGAGAGCAATAAAAGTAATGCAAAACAT VSIGAHRFVHKARLN TCCAGTAAAAGGTTCACCATTGGAGTATGTACAGTTACCCATTGGAAAA LLACNYNTWQDAAT GGTGGACTAGGAGTGGCATGTCCAAAAACAACTGCGTTGATAACCTATC KQCRRCGYEKETQW TGGTTTCAACAATGAAAAAATTGTGGTCCACTGATGACTATATCAGGAA HILSSCPKSMGGKITE ACTACACACAGACTACCTGAAAATGGTGGCCATAAAAGAAACGAAAAC RHDSVLKTVKEMIQT AAAAGAGGTCACACTAGAGGACCTTGCCTCCTACCTAAGTGATGATAAA GSLKNWKLKLDHELP ACCGTCTGCAAAAAAGCGGTTGGTTATAATTCATTCACAAGGGTACGAG GSTRLRPDIYLRSPNG AAATCTGCAAAACGCTATCAAAAAACAAAGGAGCACTGTTAAGCCAACT SEIILGDVTIPYEHGIE AAAAATCATTGCAAAAGATGGAAAGTTGGCTATTCTGGTACAGGCTSTG AMQTAWQKKIEKYE AAAGATGGCAAAACAAAGATTTTCACGCATGACCACGTGAAAACCTTGC EGFKYLRSTGKKLTIV AAAAASTTCTTAAAAAAGAAATAAATGAAGCCCTTCTGCACAGATTCACA PIVVGALGSWWKPT ACTGAAAAAAGAGTGAAAAGCGAAGTGGTGCGAGTGGTCCAAGAGTA TDSLVSLGIDKNTVKR CCCCCAGTGCAACTCCTTTGTCAGAGATGGAGGAAAAGTTAGCATTGGA AIPEICSTVLEYSKNIY GCGCATCGCTTTGTGCACAAAGCCAGGTTGAACCTGCTCGCGTGTAATT WNHIFGDSYQKVPM ACAACACGTGGCAGGATGCAGCCACAAAACAATGCAGAAGGTGTGGAT FFGGEKPKGQSWKK ATGAAAAAGAAACCCAATGGCACATCCTCTCATCTTGCCCAAAAAGTAT VKPPEGKTASNHEPP GGGAGGAAAAATAACTGAAAGACACGATTCTGTGTTAAAAACAGTAAA G(SEQIDNO:1415) AGAGATGATTCAAACTGGATCTCTCAAAAACTGGAAACTAAAACTTGAT CATGAATTGCCAGGATCAACCAGACTTCGCCCGGATATCTATTTGAGAA GCCCAAATGGATCCGAAATAATTCTTGGCGATGTCACAATCCCGTATGA ACACGGAATTGAAGCTATGCAAACAGCATGGCAGAAAAAAATTGAAAA ATATGAAGAGGGCTTCAAATACCTTCGTTCTACCGGCAAAAAACTCACA ATTGTGCCAATTGTGGTCGGAGCACTAGGAAGTTGGTGGAAGCCCACA ACAGACAGTCTTGTCAGTCTGGGAATCGACAAAAATACTGTAAAAAGAG CTATTCCAGAAATTTGCTCTACAGTACTCGAATACAGTAAAAACATTTAC TGGAACCATATATTCGGGGATTCCTACCAAAAAGTACCCATGTTTTTCGG CGGTGAAAAACCAAAGGGGCAAAGTTGGAAGAAAGTGAAGCCTCCTGA AGGCAAAACTGCTTCTAACCATGAGCCTCCAGGTTAAAAGCCAAAAGCC ACGGAGCATCGGGAAAGAAAAATGGAAAAGGACTGAAAACGAGACTG AAAAATCCCAAACAAAACAAATCCAAAACAAACTGAAAAAAAAAAAAA AACAAAACAAAAACTGGACAGACACTGGAAACAGTGTCAGGCAAAGTC GCCGATTATACTGTTCCACGCCTTAAAAGTCCCGAAATGGCGCAAAACA ACCTGAATCTATCTGAAAGTGCTCCAAACCACGCACAACTCGGAGAAAA TCAGGGACAAGTTGCTTCACGCAACGGGCTGGGACAGGTACCCCCTCCT GAAACCGCGAGGTTGAGGATGGACGGGAAGGCCGCGAGGCTTATGGC GGGTAACTCGGTTGGTGTGCTAGTAGATGATTTATATCCGACAGCCCCA ACTAAGAGGAATCCTGGGAAAGGAAAACTTGAAAAAGTTTTTACAGGG CTGGTAATAGTTCAGCACAATTGTAGTCTACTGTCTTGCAACCACAACAA ACCAGTGGTTCTGCGGGTAGATCAAACTATAATTTGTGTGTTTTCTTTTA CTTGACCCGGGCAACACATTATACCACGTCCACAAGGACGAATTCATAA TGGCCCCTCCCTAAATAAACTCCCTAGCAACTGGTGGTCCGGCGAAGCC GGTTCTTGCCACTATTGCGCCCCAGGCTCGCCC(SEQIDNO:1047) NeSL NeSL- Caenor- CGCGAACCAGTCATATGACAGTCTTTATTGATCGCGGTATAGGCGAGCG CGCGAACC TAGCCG MTVFIDRGIGERGQ 1_CRem habditis AGGCCAGATGGCCGTATGTAGCCTCCACCGTTATTTTTCGTTTTCACCTTT AGTCAT ATCGTA MAVCSLHRYFSFSPF remanei TTCCCCCATCCCCCCGTATGTAAATAATGGATCGTTCGGCGAAAATGGCT (SEQID AAAGAA SPIPPYVNNGSFGEN GTGGCACAGACAAATCACTGTTGCCCGTCATAGAAGTTGTTGTTCGTGA NO:1171) ACCGAG GCGTDKSLLPVIEVVV AGTTAAGATAAATTGGTCTGAGAATATTTTGGTAGTAGAGTGTCTGATA CCGTAA REVKINWSENILVVE ATGGTAAAGAGCGGAGAAAGAGTCGTTGTAAAGAGACAAAATCTGGAA CAACAA CLIMVKSGERVVVKR AAAGTTATTCAGAATTTGGCAAGAATCAACTCAACTCTATTTTCCAATCT GCAAAG QNLEKVIQNLARINST AGGAAATCAGATATTTTGCGTAGTACCCAGAATAAAAGACAGTACCAAT TAAACA LFSNLGNQIFCVVPRI AAAGAGCAGGGATACAGGAAAGAGAAGCAAWTGAAATTCCATGTATC AAAGAA KDSTNKEQGYRKEKQ ATTCCGAAGTATAAAATCCCAAGTTCCACCATATTTGAGAGGTGGGGGA AAATCA XKFHVSFRSIKSQVPP GATGTAATGGAAGATACAGAGATAAGAGGTATCAGAAAGTTGGAGCCA ATAAAA YLRGGGDVMEDTEIR GAGGCTCAGTTAGACAGCTCAAAACCGCTGATCTGCAGAGTTCTCTACC AGGAAG GIRKLEPEAQLDSSKP CAACGCAAGGTTATATGTATAAATGTTTTTATCCAAAGTGTAAAGGACA GTTGAC LICRVLYPTQGYMYK TAGTAATGGATCAACAGATCTGAGAAGTCTGAAGAAACACATGGTGGA CTCAGA CFYPKCKGHSNGSTD TAAGCATTTCACGAATATTGAATTTGCATATAAATGTGCTACGTGTATGT CCCCGA LRSLKKHMVDKHFTN TTTTAACGACTGGGAAATCGGCCACAGCGTTAAAATCAATAAAGGCACA GGAGGG IEFAYKCATCMFLTTG TATGGCAAGTCACCACAAGGTAACGATGGAACCCGGTAAAAAGAGTCT AAGAGA KSATALKSIKAHMAS CGTGCAAAAGTTGAATGCCAGACTCGAAGAAGCTGCTCCATCACTTCCA GACACC HHKVTMEPGKKSLV ATGCCGAGAAATCGATCAAAGGTCATACAGTTGACCCCCGAGAAATCGA SAGAAA QKLNARLEEAAPSLP TATCGGAATTGGAGAAAAAGAAGCAAACTCGTTCTGTGGCAAAACAGC AAGAGA MPRNRSKVIQLTPEK TTAGCACACTGAAAGAGTCGGCACAGAAAAAGGAAGAGGAGGTGAAG GACGCA SISELEKKKQTRSVAK ATAGCGGAGGTCAAAAAGAGAGAACCCCGTCTATCAATAATCCCAGAG GAGAAA QLSTLKESAQKKEEEV TCGAATGTCAGGCGAAGTCTGGCGGCAGGACTCGAACAATGTATAAAC AGGAGA KIAEVKKREPRLSIIPE CCTGAGCAATCGGTAGCTCAGAGGATAAGAGAAAAAAGAGAAGAATAC GACACC SNVRRSLAAGLEQCI GCCAAAGCTTCTAGGGAGGCAGCGGCAAAAAGAAGATCGAGTTTGGCA TCTCATA NPEQSVAQRIREKRE ATGAAGCCAGCTAGATTACCAGACAAAGAAAACGAGATTACACTCCAG AGGAGA EYAKASREAAAKRRS GAAACGAAAAAGATCGATGATCCAATCGTTATAGACCTGGAAAAAGAA GGTAGG SLAMKPARLPDKENE TGTATTCTCACTACAGTACTTCAAGTCCCAAGAAACCAGTTCAACTCGTG TCAATCC ITLQETKKIDDPIVIDL GTGTCTAGAGCATGAGACAACGATTGACGCTTGGTTAACGGATGAGGT AAATGT EKECILTTVLQVPRNQ AATACATATGTACATGTGCACAATAACCGAGAATCGAAAATATTTTATG AAACAG FNSWCLEHETTIDA GCAATCGATCCGGTTCTGTGGCCAGTCTATGTGAGAAATGGAGCAGAG AAAAAA WLTDEVIHMYMCTI GATCTACTGAGGCGTACTAGTTGCCCAGGAACATTCTTCTTTCCAATTTG CCAGTG TENRKYFMAIDPVL TGAAAGTAATCATTGGGTTCTATTAGTGATAGAACACGATGTGTATTGG GGGAGG WPVYVRNGAEDLLR TATCTGGATCCGAAAGGCGAGGAACCAAAAGGAAATGTAGAGATTCTT AAAGAA RTSCPGTFFFPICESN TTAGAGTCCATGAAAAGGAAAAGGCAGTACTATGAATTCCCACCACCCT AGACTG HWVLLVIEHDVYWY CACAGAGAGATAATGTGAATTGTGGAGTGCATGTCTGTCTTATGGCAAA ATTTCAC LDPKGEEPKGNVEILL ATCAATAGTAGATGAATGTGGTTATAATTGGTATTCTGAAGAGGACGTA CCACTA ESMKRKRQYYEFPPP AGGTCATTCAGAACCAATATGAAGGACATTCTGAAAAGTAAGGGATAT AAATGA SQRDNVNCGVHVCL GAGTTATGTCCTGAGCCTTATAATAGGCAAAATTTATTAAAAACAGAAA ATTTGG MAKSIVDECGYNWY AACAAAAGGAAGTTATTCTGGAAGAAATGATCGATTCATTCGTTGTAGA AAACAG SEEDVRSFRTNMKDI AGACGATATGACGTTCACAGTGCATCGGGATTCTGATCATGGTGATGAT AATTTG LKSKGYELCPEPYNR GAAGTTGAACATCTGAAGACCATTGAGCAGGAACCTGAAAATGAAATA GAAGAG QNLLKTEKQKEVILEE AGTGAAATTGAGAATGTAGAGGGATCTGTAGACTCAGTCATTCCAAAGT AAAAGA MIDSFVVEDDMTFT TGATGGAAATGAGAGTGCAGACACCTCCAGTGATCAATGAAAAAAGAG GAAAGG VHRDSDHGDDEVEH GTAAAAAGCGAGTATCGGCCAAAGAGAAACCGAGAAAGCAAAAGGAA GAAACC LKTIEQEPENEISEIEN AAAGAGCAAAAAGTGCCAACAGGAAAACCAGATGAGCTGGTTAAAAGA TAAAGA VEGSVDSVIPKLMEM GTAAGAGTATGGTTTGAGAAAGAATTCAAATCGTATGTGGAAGATGGA AAATAG RVQTPPVINEKRGKK AAAAGTTTCCAAAGGTTGGAATGGMTAACAGATGTTCTCACTGCAGCA TTCTCTT RVSAKEKPRKQKEKE ATTCAGAAGGCGTCAGCCGGAGATGAGAAAGCAGTAGAACTGATTGAG GCCAAA QKVPTGKPDELVKRV AAAAGATGTCCACCTTTGGAAWKCGAGGAGGGTGAAATGTGTACCCAG ATTCTGT RVWFEKEFKSYVEDG ACTGAAAAGAAAAAGAAACCAAAAAGTGGTAAAGGGAATGGCGGTCA AGAGGA KSFQRLEWXTDVLTA AGAAAGTATGAAGTCCTTGATGGCCTCATACAGTGAGAACCGAGCCAA ATACTTT AIQKASAGDEKAVELI AACCTACAATAGAATAATTGGTAAGCATTCAAAGCAGTGTGAGATCCCA GTCAAA EKRCPPLEXEEGEMC ATAGCCAAAGTACAAAAGTTCTTTGAAGGGACCACTGCCGAGACAAATG ACATGA TQTEKKKKPKSGKGN TGCCAAAGGAAACACTTAAGGAAATGTGTTCACGCCTCCCGAAAGTTGA TAGAAA GGQESMKSLMASYS AGTGGGAACGTGGATTGAAGGTGAATTCAGTGAAAGTGAAGTGACTGA CCAGTA ENRAKTYNRIIGKHSK AGCATTGAAGAAGACAAAGGACACAGCACCAGGGGTAGATGGATTAA ATCTGG QCEIPIAKVQKFFEGT GGTACCATCACCTGAAATGGTTTGATCCCGAGTTGAAAATGCTGTCACA TACGAA TAETNVPKETLKEMC GATCTATAATGAGTGTAGAGAACACAGAAAAATTCCAAAGCATTGGAA AGACAA SRLPKVEVGTWIEGE AGAGGCAGAGACAATTCTTCTCTATAAGGGAGGAGATGAGTCMAAAM GTAAGA FSESEVTEALKKTKDT CGGATAATTGGAGGCCTATCAGTCTGATGCCAACCATCTATAAACTGTA CCTGAA APGVDGLRYHHLKW TTCTAGTCTCTGGAACAGAAGGATTAGAGCAGTGAAAGGGGTGATGAG CTGACA FDPELKMLSQIYNEC CAAGTGTCAGAGAGGTTTCCAAGAAAGAGAAGGATGTAATGAAAGTAT AGAAGG REHRKIPKHWKEAET CGGAATATTGAGAACAGCCATAGATGTGGCCAAGGGCAAAAAGAGAAA AAGTCA ILLYKGGDESKXDNW CATAGCCGTAGCATGGTTAGATCTCACGAATGCCTTTGGATCAGTACCA GAAAGA RPISLMPTIYKLYSSL CATGAGCTGATAAAAGAAACTCTGGAATCTTACGGATTTCCAGAAATAG AATACC WNRRIRAVKGVMSK TAGTAGACGTCGTAGAAGACATGTATCGAGATGCATCGATCCGTGTGAC GCTCAC CQRGFQEREGCNESI GACGCGAACGGAGAAAAGTGATCAGATTATGATCAAGTCAGGAGTGAA AAAGCC GILRTAIDVAKGKKR GCAGGGAGATCCAATCTCGCCTACTCTCTTCAACATGTGTCTCGAGAGT TGTGAT NIAVAWLDLTNAFGS GTCATCAGAAGGCATCTCGACAGATCAGTCGGCCATCGGTGCCTGAAAA CGATTCT VPHELIKETLESYGFP CAAAAATAAAAGTATTAGCCTTTGCAGACGATATGGCAGTATTAGCAGA CTTACCT EIVVDVVEDMYRDAS AAGTAGTGAACAGTTGCAAAAGGAGTTGACAGCTATGGATGCTGACTG ACTGAA IRVTTRTEKSDQIMIK CTCAGCACTGAATTTGCTATTCAAACCGGCTAAATGTGCAAGTCTGATAT CTTGTTC SGVKQGDPISPTLFN TGGAAAAAGGAATAGTAAACAGGTTAAATGAGGTAGTTTTGAGAGGGA TCTTGG MCLESVIRRHLDRSV AACCGATCAGAAACCTCATGGAAAATGAGACCTACAAGTACTTAGGTGT CCTCGTA GHRCLKTKIKVLAFAD TCAGACAGGTACGGAAACAAGGGTTTCCATAATGGATCATATAACGGA ACCGGC DMAVLAESSEQLQKE AGTGTCAAGGGAGATAGATCTAGTGAATATGAGTCAACTGGCAATGCA TAAAGG LTAMDADCSALNLLF CCAGAAACTAGATATACTCAAAGCCTTCATACTTCCAAAGATGACCTATA GAGAAG KPAKCASLILEKGIVN TGTATCAGAACACGACACCTAAACTGTCAGAACTGAAAGTGTTTGCCAA GAATGT RLNEVVLRGKPIRNL TTTGGTAATGAGGTCAGTGAAGGAATTCCACAACATTCCCCTAAAAGGG TAATTG MENETYKYLGVQTG TCACCGTTGGAGTATGTCCAACTTCCCGTAGGAAAAGGAGGATTAGGA GAGATA TETRVSIMDHITEVSR GTGGCATGTCCAAAGAACACAGCCTTATTAACATTCTTGGTAACCATTAT GACATA EIDLVNMSQLAMHQ GAAAAAGTTATGGTCATCTGATAGCTATATCAGAAAGTIGTATACAGAC AAGATA KLDILKAFILPKMTYM TACCTAGAGGAGGTGGCAAAAGTGGAAATTGGAAAGTTCGAGGTCAAC GGTGGA YQNTTPKLSELKVFA TTGAACGATCTAGCAGAATTCCTAAGTGACGAAAGAGCAGTCGACAGC GTGAAG NLVMRSVKEFHNIPL AAGTTGTTCGGCTTCAATGCGTTCACGAGGGTGAGAGAAGTGGTGAGG GTCCTG KGSPLEYVQLPVGKG AGTCTCTGTAAGAATAAAGATTCTCCACTACATAGTCTGAAAATAATTGA TTCTTGA GLGVACPKNTALLTF AAGAGAAGGGAAACTTGCCATAAGTGTGCAAGCAACCGAAGAAAGTAT AACTAG LVTIMKKLWSSDSYIR TGAGAAAATCTTCACTGAAGACCAGGAAAAGAAGTTAATGTACCTACTG GAGGAA KLYTDYLEEVAKVEIG AAAGGGGAGCTAAATACAGCTCTCCAGCACAGGTTCTTTACTCAAAAGG TGTGGA KFEVNLNDLAEFLSDE TATTCAAAAGTGAAGTAATGAGAGTGGTTCAACAGCATCCACAAAGTAA AAGAGC RAVDSKLFGFNAFTR CAGTTTTGTCAGAAATGGTGGAAAAATGAGTTTTTCGGCTCAAAGATTT AGAAGG VREVVRSLCKNKDSP GTCCACCCAGGAAGACTGAACCAGTTGCCATGTAACTACAACACTTGGG CCGCGA LHSLKIIEREGKLAISV CAAAAGGCCGTACGAAGTTGTGTAGAAGGTGTGCAAAGAATGAAAATG GGCTTT QATEESIEKIFTEDQE AGACACAGTCGCATATACTGCAAGTGTGTGACTACTCAATAGGAAATAT AGACGG KKLMYLLKGELNTAL CATAAAGGAAAGACACGATGCAGTTCTTTATAAGTTTAGAGAACTCATT GTAACT QHRFFTQKVFKSEV AAAAGAGGGTCAAAAGGTCATTGGTTAGAGAGAACTGACCGGACAGTA CAGTCA MRVVQQHPQSNSFV CCAAATACTGGATCACAGCTGAAGCCAGATCTCTATCTGGAAAGCCCAG GTTGCT RNGGKMSFSAQRFV ACGGGAAGCATGTGATACTAGCCGATGTGACAGTTCCATATGAAAGAG AGTGGT HPGRLNQLPCNYNT GCATCGAAGGAATGCAAAAGGCATGGAATGAGAAAATCAACAAGTATA CTTCGG WAKGRTKLCRRCAK CTGATGGATATAAAGAAATATTCAGAAGACAAGGAAAATCCCTAGTAGT ATCCAA NENETQSHILQVCDY GTTACCATTAGTAGTTGGTTCACTGGGAACGTGGTGGAAGCCCACGGA CGGCTT SIGNIIKERHDAVLYK GGAAAGTCTGATCAAACTAGGTGTTGAGAAGACTACAGTAAGAAGGAT CGGACA FRELIKRGSKGHWLE AATACCTGAGACGTGTGGAATGGTGGCTGAATACAGTAAGAACTGCTA TAGTGA RTDRTVPNTGSQLKP TTGGAGACACATCTACGGTGAAAAGTATGTTCAAACTCCAATGATAAAT GGAACC DLYLESPDGKHVILAD GGAGGAAAAAAGCCTGAAGGAAATGATTGGAAAAAGTGTGAAAAAGG CTGGGT VTVPYERGIEGMQKA AATAGAAGTTCCTAAAGTTACTAATTAGCCGATCGTAAAAGAAACCGAG ACGGAG WNEKINKYTDGYKEI CCGTAACAACAAGCAAAGTAAACAAAAGAAAAATCAATAAAAAGGAAG AAGAAA FRRQGKSLVVLPLVV GTTGACCTCAGACCCCGAGGAGGGAAGAGAGACACCSAGAAAAAGAG TGGAAA GSLGTWWKPTEESLI AGACGCAGAGAAAAGGAGAGACACCTCTCATAAGGAGAGGTAGGTCA AGAGAT KLGVEKTTVRRIIPET ATCCAAATGTAAACAGAAAAAACCAGTGGGGAGGAAAGAAAGACTGAT AGGGCG CGMVAEYSKNCYWR TTCACCCACTAAAATGAATTTGGAAACAGAATTTGGAAGAGAAAAGAG GGCAAA HIYGEKYVQTPMING AAAGGGAAACCTAAAGAAAATAGTTCTCTTGCCAAAATTCTGTAGAGGA GGCTAA GKKPEGNDWKKCEK ATACTTTGTCAAAACATGATAGAAACCAGTAATCTGGTACGAAAGACAA GTTCATA GIEVPKVTN(SEQID GTAAGACCTGAACTGACAAGAAGGAAGTCAGAAAGAAATACCGCTCAC CACTGTC NO:1416) AAAGCCTGTGATCGATTCTCTTACCTACTGAACTTGTTCTCTTGGCCTCGT ATGCAA AACCGGCTAAAGGGAGAAGGAATGTTAATTGGAGATAGACATAAAGAT CCACTA AGGTGGAGTGAAGGTCCTGTTCTTGAAACTAGGAGGAATGTGGAAAGA AACCAG GCAGAAGGCCGCGAGGCTTTAGACGGGTAACTCAGTCAGTTGCTAGTG TGGGAT GTCTTCGGATCCAACGGCTTCGGACATAGTGAGGAACCCTGGGTACGG CTGCGG AGAAGAAATGGAAAAGAGATAGGGCGGGCAAAGGCTAAGTTCATACA GTGAAT CTGTCATGCAACCACTAAACCAGTGGGATCTGCGGGTGAATCACTTTCG CACTTTC AAAAGAAGTGAATGGACGTGCTGATGTCTGACTTTAAAGAAGTCTGAA GAAAAG ATTAAAAAAACAGATATAAAGGCCCCTCACTATAAACTCCACAGCAACA AAGTGA GGTGGTCCGGCGAGGCCGGTTCTTGCCACCATTGCACCCCAGGCTCGTC ATGGAC (SEQIDNO:1048) GTGCTG ATGTCT GACTTTA AAGAAG TCTGAA ATTAAA AAAACA GATATA AAGGCC CCTCACT ATAAAC TCCACA GCAACA GGTGGT CCGGCG AGGCCG GTTCTTG CCACCAT TGCACC CCAGGC TCGTC (SEQID NO: 1294) NeSL NeSL- Trichomonas GGGTGAGTAGTCTAGTGGTATGATTCCTGTTTTGGGTACAGGAGGTCCC GGGTGAG TAAGAA MIPVLGTGGPEKLPL 1_TV vaginalis GAGAAGCTTCCACTGCAATCGTACGTGTACTGTGGCAACACAGCTATAA TAGTCTAG GAGATA QSYVYCGNTAITDSF CAGACAGTTTCACGCCAACCGCGAAAACGATTTTGAAGCCTGAGGAACA TGGT(SEQ AGACGA TPTAKTILKPEEQNLD AAATTTAGATATCGTTTTGAAAAATATTGCAGCGTTGAATCCAGAAAATT IDNO: GTGAGA IVLKNIAALNPENYSD ACTCCGACTTAATCAGGAGCCTATCGAAGATGGAGTTCAGATTAGATTA 1172) AGAACA LIRSLSKMEFRLDYPK CCCGAAAGAAATAGAGAATTACTGGATTTCGGAAAAATTATTTAGCCAA GAAGCA EIENYWISEKLFSQSIA TCCATCGCATCATTGCCCATCAGTTTGTTAGTCGCATCCATGTTCTCACCT TAGTAG SLPISLLVASMFSPED GAAGACCGTGACTTGAGTACAGAACCGTTCCACTGTAACGCTGATGGCT GATTGG RDLSTEPFHCNADGC GTAATTTCCATTGTGACAATTGTGAAAGAATGGTTGAACACATCAGAGA CAGAGC NFHCDNCERMVEHI GCACCATAACACTGACCCCATGATCAATACATTTGAAACAACAGAAGAC TTAAGC REHHNTDPMINTFET ACATTTAGAAGAATAACGGCCATCAAAATAGACAAGACAGGCATCGAA GATGTC TEDTFRRITAIKIDKTG GAACTTAACCCTCTAAAATACAGATGCTCGTATTGCGACGAGTTATTCAC ACTCGG IEELNPLKYRCSYCDE CGAAGCAGAAGATCATGCCATCCATATGATTTCACATCTCACAGAAAAA TACGAA LFTEAEDHAIHMISHL TTATCACCAGATATATCTTTCTTTTTCAACGACATTTTACGCCTTTACAAA ACGTGT TEKLSPDISFFFNDILR ACTATCGACAAACCAACAGTACAAAATTTATTTCCAGAAACACAAGTCG ACCAAA LYKTIDKPTVQNLFPE CAATTTTTGACACACTTGAAGAAACAAACAGATTCAGACTTATCGTAGG CACCGG TQVAIFDTLEETNRFR AAGAGAAGCCATAGAAACAATTGAAGAAGCATTCCCTCCAAGTCCACCA ATTCCGT LIVGREAIETIEEAFPP GGAACAGATCGGAAACCATCCATAATCATCACAGACACCTGTCAACTCA GCTAGG SPPGTDRKPSIIITDTC GGTTTGTACCATGCATGGATGAACCACCAAAAGGAGATCTCGGAATTCT AATCAC QLRFVPCMDEPPKG GACTCTACTTTTAAGAGATTTCAGCGCACACAATATCCCGATTAAATCAC AAGCCA DLGILTLLLRDFSAHNI TGAACAATAAGGAACTAATTGCTGATAAAGACATCGATTACAGCCCAGA AAATAA PIKSLNNKELIADKDI TTTTGTCGAAGGAGCTCTAGCCAACGCAGAAGAACATGATACAACGAAC AAGAGA DYSPDFVEGALANAE AGCCAGAACAACAATGGAAGATACATTAACTCAGCCGAAAAACTTACAG CACCAC EHDTTNSQNNNGRY AATTTTTAATACAATGTGAAGACTACTTAACGAACATCAAAACACTTGAA GAAAAT INSAEKLTEFLIQCEDY GACTTAGAACGTTTCTACACAACGATTAAAGACTACAGAGTCAACAAAG TACTCAC LTNIKTLEDLERFYTTI AGGTTATCGCCGAAGATACACCAATCTTTGTATATTTCCTAGTAGAAGAA CCTCCCT KDYRVNKEVIAEDTPI GGGAAATTACCAAAACCAGGTCTTAGATGCCCACTTGAATCATACGAAG CAAACA FVYFLVEEGKLPKPGL GACACGAAGACAAGGCATTCGAATCACTGAGAAAACTTTGCGACCACTT GATAAT RCPLESYEGHEDKAF CAAAGGAGAAATCGCGAAAACGAGCTTTGACCCAAAGGTTCACACCAT AATATTA ESLRKLCDHFKGEIAK AGACATCTGGGTTGAATTTTTGGCCCAAGCCTATGGCACAGGCACGTTT ACCTCCC TSFDPKVHTIDIWVEF GTCTACAAAGATGAAAACGGAAACATCGACCTTGATACGCACGTATTCA ATCCATC LAQAYGTGTFVYKDE AATGCCCTTATGCAGACTGCTCATACACGAACAACGACAGATCAAAACT AGTCCG NGNIDLDTHVFKCPY CATGGACCACATGAAAACGAAGAAACACGCCAAGAACGTATACATCGA TATGGT ADCSYTNNDRSKLM GAGATACGGCTTCTTTTGGGGTATTGTCATAGAAGGAGTCAACCGACCA CTGATA DHMKTKKHAKNVYI AAAGGAATCGTCTACCCGACACTCAAAGACATCAAAGAACACGCTTGTC ACAGAC ERYGFFWGIVIEGVN GCAAATGTCCAGAAGCAGGATGCAACACATATGTAACAGAATTGAGCG TAGCAC RPKGIVYPTLKDIKEH ACATCAAAGAACATCTAAAGAAGAAACATAAGTCTACAACAGCAGGAG CACATCC ACRKCPEAGCNTYVT TAGACGGAGAAATCGCGCACACTGATGCTACATACTGCTGGATTACCAA ATGATA ELSDIKEHLKKKHKST AGAAGAACTCGACGCATTACATGCCGAGAGAGCAAGAGAAAGAGCAG CACTCAT TAGVDGEIAHTDATY AGCAAGTAGACAACACTCCAGTACAACAGATAATTAATGCTGACAACAA TGGAGT CWITKEELDALHAER TGAAGAGAACAACGAGAACCAAGAAGACAACGGAAACAACGAAGAAG GAAAAC ARERAEQVDNTPVQ CAGATGCCCTCGACCCGCCAAATAACACAACAGAGACAGAAGATGAAG CACCAA QIINADNNEENNEN CGGTTCATGCCGTCATCATCAATCCACCAGCAACAGAAGAGGAAGAGGT CAACAA QEDNGNNEEADALD AGCCATCATCGCCGAGGCAAGAAGAAACATTCCAGAACTCCAACAAGC ATCCACC PPNNTTETEDEAVHA AGAAGAGAGAGGCTGCGTTACACCGAAAATGACATCACTCGTCCGATTA TAGACC VIINPPATEEEEVAIIA AAACTATTGAAAGGAGGAGGAGAACTTTTCAACAAGAAACTCACTCCAT AAATCCT EARRNIPELQQAEER TAGCCACAAGATACGCAGCTACAGGAAATACAGAAGCAGACAAAATCA GCCCCA GCVTPKMTSLVRLKL AGGTAGATTACTTGACACTAAAATGCAATGCCGCCTTGAGAGAAATGAT CCTCCAC LKGGGELFNKKLTPL CTACACCAATAACCACAGCGAATCAAAGTTTATGACAGCAGAAAATGGA CCAAGT ATRYAATGNTEADKI GAAGACACAGCACCACCGCCAAGGATATCGGAAGACACAAGAGATCGC AGCTCG KVDYLTLKCNAALRE ATTCAAAAAGCAGCCAATGAAATAAAAGGAACTCTCATCAAAGTAGTCA CTTCGCT MIYTNNHSESKFMT AACACATAAGTCACGCGAGATGCCTCAAAGACAGCACGAGAGACGATG CGCTCA AENGEDTAPPPRISE AACACAATAAATTCGTCGAAATGATTGCAAAAATCAAAAACGATCTCAG CCTAAA DTRDRIQKAANEIKG AGATAACAAATTCGAACAATATAACATTGAAGAAATATTTCAAGGACCG ACTTTGC TLIKVVKHISHARCLK ATCTCCGACCAGAGTATTCTCAACATCGTCAACACGGAGGACAACAACG TCGCTC DSTRDDEHNKFVEMI AATTCATCAAGAAAATGGATTACATTAACCGAATTCTCGGAACACCACA GCTTCG AKIKNDLRDNKFEQY GGATGCATCACCATATGCAAGGAAGAAGTTACAAGCATGTTTCGCCGAT CTCGCTC NIEEIFQGPISDQSILN AACCCAACAAAGACTCTCAGAAACATAATCTTAGCCGACAAAGTTCCAC GTCTTAA IVNTEDNNEFIKKMD AACAATCATTGAAGCCAAGCGAATACCTTGATTACTACGGACCTCAATG CCCTTTC YINRILGTPQDASPYA GGCAAACGAAGCTGAAGGCTACGAAAACTTCCTGCATCATGACTACGCG CGAATA RKKLQACFADNPTKT TTACCGGAGAGATATGGCCAAGTTTTCGCAAACGACTTCCTCGACTTCAT AACACTT LRNIILADKVPQQSLK GACAAACGAATCGAAGATCATCGAAGTAATCCGCAACAAGAATCATTTA ACAATTC PSEYLDYYGPQWAN TCGGCACACGGCCTCGATGGAATTCCGAACTCAGTTTACATGCTATTCCC CCGGCT EAEGYENFLHHDYAL AGTCAGCGCCGCAAAATTCCTCAGTATATTATTCAGATCAATCATCATAT CGCCCC PERYGQVFANDFLDF CAGGTCACATCCCAGACTGCTGGAAGCTCTCCAAGACAGTGATGCTTTT ATTTTTT MTNESKIIEVIRNKNH TAAGAAGGACGACCCATCGTTAGCAAAGAACTGGAGACCAATCGGCAT (SEQID LSAHGLDGIPNSVYM CACGTCATGCACTTACAGAATCTTCATGACTTTAGTCAACAAAGCGTTAC NO: LFPVSAAKFLSILFRSII AGATGATCCCAATGTTCCACGCAATGCAAAAAGGTTTCGTTCGCGGAGC 1295) ISGHIPDCWKLSKTV AACACTGAGTGAGCACATTGCAGTCGCGAACGAAGTCCTTTGCCAATCA MLFKKDDPSLAKNW ACCAGAACACAGTCTGAAATGTTCCAAACAGCAATCGATTTCACGAACG RPIGITSCTYRIFMTLV CTTTCGGCACAGTTCCTCATCAATTGATCTTTGATTCTCTCGAAGCGAAG NKALQMIPMFHAM AAAGTTCCCGATTCGATCATCAATCTGCTCAAGGACCTCTACAAAGGAG QKGFVRGATLSEHIA CAAGAACGGCTATCTATACAAGACATGCACACTCCGAGATAGTTCCGGT VANEVLCQSTRTQSE TCGCAGAGGTGTCATCCAAGGCTGTCCACTCAGTCCAATCCTCTTCAACT MFQTAIDFTNAFGTV GCTGCTTAGATCCTTTATTATATGCAGTCCAGAGGAGACACTTTGAGGA PHQLIFDSLEAKKVPD CGGTTACAGATTCCAAGACAAAGCAGGACAGTATTCAATTGCCATTCAA SIINLLKDLYKGARTAI GCTTACGCTGACGACGTTCTAGTCATCTCTCCAACACATGAAGGAATGC YTRHAHSEIVPVRRG AAAGAATCTTAAACACAGTAGATGAATTCCAGAAAATTGCGAAACTCAA VIQGCPLSPILFNCCL AGTTGCACCACAGAAATGCGTCACACTTGCCAAAACATCCACTGCAATC DPLLYAVQRRHFEDG CAACCTTTCCGCATTGGTCCAGACGAAATCCCAATCAAGACGAGCATGG YRFQDKAGQYSIAIQ ACAACATCACATATCTTGGAATACCAATCTCTGGAACAAAGACATCAAG AYADDVLVISPTHEG ATTTGCAGCTGCAACTGGCATTCTGGAAAAGGTCAAAGCACAGATCAGA MQRILNTVDEFQKIA GTCGTCTTCGCGTCACATCTCGCTCTCTCTCAGAAGATTATCGCTCTCAG KLKVAPQKCVTLAKT AGTCTTCATCTTGCCACAACTTGACTTTTACATGTTCCACAACGTATTCAG STAIQPFRIGPDEIPIK AGTCAATGACTTGAAAGCGACAGATCAGATGATCCGAGGCCTGATCGA TSMDNITYLGIPISGT CAAAGAAGCGCCGACGTCAAACATTCCGGTTTCATTTTTCTACATGCCGA KTSRFAAATGILEKVK AGAACAAAGGCGGCTTTGGACTCGTTAAATTGGAACTTCGCCAGCCTCA AQIRVVFASHLALSQ GCTCGTTCTCACTAAATTTGCGAGGTTATGGTTAAGTCAACAAGCAGAA KIIALRVFILPQLDFYM ACCAAAGCCTTCTTTCACACAATGGCTCAAGAAGAGAAGTCATTCCGCA FHNVFRVNDLKATD AGGTCGTCGAAGACCAAGAAAATGGTTTCTTAGGCATCAAGATGGAAA QMIRGLIDKEAPTSNI ACGGCAAAATTGTCCAGAAGAACGAAAGATCCAAACGCACAAATTGTTT PVSFFYMPKNKGGF CATCACACAGGCGGCTAAAGCAGCAGACAAACTGGAAGTCAGATTCAA GLVKLELRQPQLVLTK AGAATGGGACAAAGGAGGCATACAAGTCAGAGGTGTAGGAGAAAATG FARLWLSQQAETKAF CAACAGACTGGTACCGCTCGAAACACATCGGCCAAATCTCACCCTTAATC FHTMAQEEKSFRKVV GGTCGCGTCATCCAACAGAGGCAGTACGAGGAGTTCAAGAAAGACGAA EDQENGFLGIKMEN ACACACTCACACACTTTCTGCGAACCAGCAGCGCTAGCGGAGTCACACG GKIVQKNERSKRTNC ACATCATGAAGAGACCACAAGCTGTTCCAAACAACCTCTACTCAGCGGC FITQAAKAADKLEVR TATTGCTCTCCGTACAAACACAGCTCCAACCCCAGCAAACATGCACTTCC FKEWDKGGIQVRGV ACAACCCAGAAGTTTTGGCTAATTGTCCATTGTGCGGATGCCAATCCTGC GENATDWYRSKHIG ACTCTCTTCCACACATTGAACATGTGCAGAAACCGTTTCAGTCTATACAA QISPLIGRVIQQRQYE ATGGCGCCACAATATCATATGCGATGACATTTACCAATTCATTCACGATC EFKKDETHSHTFCEP ACTATCCAGGAGTAACCATCAAATGCTCGGCGAGAATTACAAGTGACGG AALAESHDIMKRPQA CTACCAAACAACAGGCCCAGAGCTCGACGACACAGTTAAAGATCTCCTC VPNNLYSAAIALRTN CCAGACCTTGTTGTCTACGATGAAGCGAACAAGATGATCAAGATCATTG TAPTPANMHFHNPE AAGTCACATGCCCTTACGGCACGGACAACAATGTTGGCAACTCTCTTGA VLANCPLCGCQSCTL CGCGGCATACGACAAAAAGGTTAACAAGTATAAGAGCCTTGCTGAACA FHTLNMCRNRFSLYK AACAGAGAGATTATTTAACTGGACCACGACGCTCTCAATTATCGTAGTCT WRHNIICDDIYQFIH CATCACTAGGAGTCATCCCTCTCCGTACAAAACTCGACGCATTGAGAATC DHYPGVTIKCSARITS TCACCTGCAGATCACATACAGCTACTCAAGAGACTTTCGATGCACGCGA DGYQTTGPELDDTVK TAGCTGCGAGTGCTTGCATTGTTTTTGAAAAAGTGCCAGAATTCTTCGGT DLLPDLVVYDEANK ATGCGCTGCCGTCCCCTCCCAGGACGAGTCACAGCTCCCAATGCAGCGA MIKIIEVTCPYGTDNN TCCCACCAAACAACAATGAAAACAATAACGACACAGATCATGGTCAGGA VGNSLDAAYDKKVN GAACCAACAGGCAACCTCTGAAGAGCAACCAACCAACAATGGAAATGC KYKSLAEQTERLFNW TCAAGAAGACAATGGCCAAGGCGAACAAATAAATAATTCAACCGAACA TTTLSIIVVSSLGVIPLR AACTATCTCTGTTGATCAAATCATCGAAGAAGATGCTGAGAACAACGCG TKLDALRISPADHIQL ATAGAACAAGCCTTAGACCAACCCGATGAGGACGAATTCCTTAACTAAG LKRLSMHAIAASACIV AAGAGATAAGACGAGTGAGAAGAACAGAAGCATAGTAGGATTGGCAG FEKVPEFFGMRCRPL AGCTTAAGCGATGTCACTCGGTACGAAACGTGTACCAAACACCGGATTC PGRVTAPNAAIPPNN CGTGCTAGGAATCACAAGCCAAAATAAAAGAGACACCACGAAAATTACT NENNNDTDHGQEN CACCCTCCCTCAAACAGATAATAATATTAACCTCCCATCCATCAGTCCGT QQATSEEQPTNNGN ATGGTCTGATAACAGACTAGCACCACATCCATGATACACTCATTGGAGT AQEDNGQGEQINNS GAAAACCACCAACAACAAATCCACCTAGACCAAATCCTGCCCCACCTCCA TEQTISVDQIIEEDAE CCCAAGTAGCTCGCTTCGCTCGCTCACCTAAAACTTTGCTCGCTCGCTTC NNAIEQALDQPDEDE GCTCGCTCGTCTTAACCCTTTCCGAATAAACACTTACAATTCCCGGCTCG FLN(SEQIDNO: CCCCATTTTTT(SEQIDNO:1049) 1417) NeSL NeSL- Caenor- GACTCGCCTTGGGGAAGGTWTTTCAGGGGKSAATTGCCGMAGGCAAG GACTCGCC TAAACC MRYHXSNXPAXRTS 2_CBre habditis GCAGCCCCCSMMTAGCTTACAAAGTAAGTACMCATTTTCATTTCTTGTG TTGGGGAA GGCTCC DNXWRSIXKDVRRP brenneri AATTCTTTAAACATATTTTTCTTGTTTTTTGATTTCTTTTTTCTCTACCTTCC GGTWTTTC TCTGGG DPSTIEEKSRYNRSIGI CCCAATTCTTCCCCTCATCTTGTGTATACATCCCCCTCCTCCAACCAATCA AGGGGKSA AGGAGG PDSLKXRSSAVRSXSS ATACATTGACCTCTCTCTTCTGTCAAAAAATCAATACTAGTATATTGTCCC ATTGCCG TATGTCA XPPSGPQDVRLXNSP TTGTATAGTATTATTTGACGTCGTCTTTGTATTAGGAGTAGGTAACAAW MAGGCAA GAGGAC SLDDRRRLVDCETTL CTGTGTATGGCTTCAAAAAGCATGCACAAACWCCTGTCAAAAAGTAWT GGCAGCCC ATTCTCC GSYREWTDKPMMG TCCCATCMTGTGAATAGCTCAACGACWKGAAGMCCAATGATATGAGA CCSMMTA GTGGGC KMTYAAVTKRAPPRP TATCATMGCTCCAACMACCCAGCASCCCGCACCTCAGACAATCAMTGG GCTTACAA GGATGG QTGGARLSTNLLADE AGATCAATCCMAAAGGACGTCCGTCGCCCAGATCCGTCAACTATCGAG AGTAAGTA GAGGAG MEIKYRDTNDIRLVID GAGAAAAGCAGGTATAACAGGTCCATAGGTATTCCAGATTCGCTCAAAG CMCATTTT TAGGGT LPNPHLIKCPLCKSCIS AWCGGAGCAGTGCAGTCCGCAGCAKGAGCAGCCMACCTCCGTCAGGT CATTTCTT AACGAC ARGRGANALKYMKR CCACAGGACGTCCGTCTCWCCAATTCGCCATCTCTCGATGATAGGAGAA GTGAATTC CCGTCAT HIADAHHLNADFVYK GGTTAGTWGATTGTGAAACAACACTAGGGTCATACCGCGAATGGACSG TTTAAACA TCTGGA CSRCQEHEPENVCG ATAAACCAATGATGGGAAAGATGACGTATGCGGCAGTGACAAAAAGAG TATTTTTCT TGCCTA AKWIVNHLKRVHGY CGCCCCCAAGACCGCAAACGGGAGGAGCCCGGTTGAGCACCAATCTCC TGTTTTTT AACCAC TLEDAVSTAKPSTRQ TAGCAGATGAAATGGAGATAAAGTATCGAGACACCAATGACATCCGCCT GATTTCTT CACAAT QIANAFNDSAPFIDA TGTCATAGACCTTCCCAATCCCCACCTCATCAAGTGTCCGCTCTGTAAAA TTTTCTCTA CTGTCA RKTSDVPEKKSREAG GCTGCATAAGTGCGCGGGGAAGAGGTGCTAATGCGCTGAAGTACATGA CCTTCCCC AGGCAA LEKFLAPTKSEDTREK AAAGGCACATAGCCGACGCCCACCACCTCAACGCCGACTTCGTCTACAA CAATTCTT AGTGCC TPPSTRKSSESSEASI ATGTAGCAGGTGTCAAGAGCATGAACCAGAAAATGTATGCGGCGCGAA CCCCTCAT CCAAAA QSTIQETLSESSDTLT GTGGATTGTGAATCATCTCAAAAGAGTACATGGCTATACTCTAGAAGAT CTTGTGTA GCACAC VQEIINISSEDEMDEE GCCGTATCCACAGCAAAACCCTCTACAAGGCAGCAGATTGCAAACGCCT TACATCCC GCGTGG PPKRRVNVWALIHE TCAACGACTCTGCTCCATTCATAGACGCCCGGAAAACATCCGATGTGCC CCTCCTCC ATCGGT NGKDAWIDSDLMVI AGAGAAGAAGAGCAGAGAGGCAGGACTTGAGAAGTTCCTGGCCCCTAC AACCAATC TTGGAT FLESRARGYESCSIIDP AAAGTCCGAGGACACAAGGGAAAAAACCCCGCCCTCCACCAGAAAGTC AATACATT GCCGAC LNFICTDMSYLTTIVR CTCTGAAAGTTCAGAGGCATCAATCCAATCGACCATCCAAGAGACTCTTT GACCTCTC TGAGCC RRMEEGYKKIIFPLCA CGGAGTCGTCAGACACATTGACCGTCCAAGAAATAATCAATATCAGCAG TCTTCTGT AGAGGG NDHWTLVTITGSTAT TGAAGATGAAATGGACGAGGAGCCACCGAAACGGCGTGTCAATGTCTG CAAAAAAT CAAAGT FYDPMGNEPTETVKK GGCCTTGATCCATGAGAATGGCAAGGACGCCTGGATAGACTCAGACTT CAATACTA CGAAGG MIDELDLEMQLAPS GATGGTCATATTCCTGGAATCAAGAGCAAGAGGATATGAATCATGCAGC GTATATTG CCGGTA NSPRQRDSWNCGVF ATCATAGACCCTCTGAACTTCATTTGCACTGACATGTCCTATCTGACCAC TCCCTTGT GGCTCC VMKMAEAYIKDTQ AATAGTCAGAAGGCGCATGGAAGAAGGCTACAAGAAAATCATATTTCC ATAGTATT CGGCGG WDLTDVDTDVKTFR ATTATGTGCAAATGACCACTGGACACTGGTCACGATAACAGGTAGCACG ATTTGACG GTTGTC RSLLTELKAKFNIFAE GCCACCTTTTACGATCCAATGGGAAATGAGCCAACTGAGACTGTCAAGA TCGTCTTT CGTCAT DIQTYRPPSRKALTRN AGATGATCGATGAGCTCGACCTTGAAATGCAATTAGCCCCGTCAAACTC GTATTAGG AGTCAG SQSPVVVCHKCSRPA TCCTAGACAGAGAGACTCGTGGAACTGCGGCGTTTTCGTCATGAAAATG AGTAGGTA TGGTGC TPIQDVSRMEVEEAP GCGGAAGCGTACATCAAGGATACGCAATGGGATCTCACGGACGTAGAC ACAAWCT GCCTAC VLVPTPEEPPQEWTF ACGGACGTCAAAACGTTCAGAAGGAGCCTCCTAACAGAGCTCAAAGCA GTGTATGG ACCCAA VGKNRKRGVTSRTP AAGTTCAACATCTTTGCCGAGGATATCCAGACCTACCGGCCACCCTCAA CTTCAAAA CTGCTAT NTSPEAKRPAFPPVP GGAAAGCCTTAACGAGGAACAGCCAATCGCCTGTCGTTGTTTGTCACAA AGCATGCA GACACA LKPSANRWHFPEEET GTGCTCTCGGCCAGCCACACCGATCCAGGATGTGAGCAGAATGGAAGT CAAACWC CAAGGA EKMEVSSADEVKNST GGAAGAAGCGCCAGTGCTGGTACCGACTCCTGAAGAGCCTCCACAGGA CTGTCAAA CAACCC PPKPPKIPNLLAMKIA ATGGACCTTCGTCGGAAAAAACAGAAAGCGTGGTGTGACAAGCCGAAC AAGTAWT AAAATA SPVPLKRGNPSKKHG CCCGAACACGTCGCCGGAAGCCAAGCGACCGGCTTTCCCACCAGTACCC TCCCATCM AATAAG KGHMMNTARKGPT CTCAAACCATCAGCCAACAGATGGCACTTTCCAGAAGAGGAAACTGAAA TGTGAATA CCAAGG KKEMPKGEPANLIVKI AGATGGAGGTCTCAAGTGCCGACGAGGTGAAGAACTCTACCCCTCCAA GCTCAACG CGGCGT RSWFDEQLKMYKDE AACCACCCAAGATACCTAATCTTCTCGCGATGAAAATCGCCAGTCCCGTA ACWKGAA TAGCTTC GSNLQRLTWLSDSLT CCTCTGAAGAGAGGAAACCCGTCAAAGAAGCACGGTAAAGGACACATG GMCCAAT GAGCTA AAIGKAFNGNKYIVD ATGAATACAGCGAGAAAGGGTCCGACAAAGAAAGAGATGCCCAAAGG GAT(SEQ ACAAGC QIIKRNPPPLVEKGA GGAACCAGCGAACCTTATAGTTAAAATCAGAAGCTGGTTTGATGAGCAA IDNO: TCCCCG MSTQTSRKRDEFKPR CTGAAGATGTACAAGGATGAAGGGTCCAATCTACAAAGACTCACATGGT 1173) AGAGGA ERMAQEPNEPLRIQY TATCGGACTCTCTGACCGCCGCCATCGGAAAAGCATTCAATGGCAACAA TGGTTG AKNRQKTFFKIIGKQS ATACATAGTGGACCAAATCATCAAGAGAAACCCACCACCACTTGTTGAA CCACAG EQCTINIETVEQHFRK AAGGGCGCAATGTCAACACAGACAAGCCGAAAGAGAGACGAGTTCAA GGCACC TLKAPVVSENAIKTVC GCCAAGGGAGAGAATGGCCCAAGAGCCCAATGAGCCGCTTCGTATTCA ATCCTG GSIKKVLMPKTIEDPI ATATGCCAAGAATAGGCAAAAAACGTTCTTCAAGATCATTGGGAAACAG GGGAAC SSVEVKSILTKVKDTS TCTGAACAGTGCACCATCAACATTGAGACTGTCGAACAGCACTTCCGAA GACCCG PGTDGVKYSNLRWF AAACACTCAAGGCTCCTGTAGTCTCAGAGAATGCAATTAAAACGGTCTG ATCTTTC DPEGERLAKLFEECRK CGGAAGCATCAAGAAAGTATTGATGCCAAAGACCATAGAAGACCCAAT GGATGC HREIPSHWKEAETILL CTCCTCCGTAGAAGTCAAATCCATCTTGACGAAAGTGAAAGACACGTCA CCAACC PKDCSDEEKKKPEN CCAGGAACAGATGGTGTCAAGTATAGCAATCTACGCTGGTTCGACCCAG ACCGCC WRPIALMATIYKLYS AAGGGGAACGCCTCGCCAAACTGTTCGAAGAATGTCGCAAGCATAGAG AATCTGT AVWSRRISGVQGVIS AAATACCCAGCCATTGGAAGGAGGCGGAGACGATCTTATTGCCAAAGG CAGGCA PCQRGFQSLDGCNES ACTGTTCAGATGAAGAGAAGAAAAAGCCGGAAAATTGGCGTCCCATCG ACGTGC IGILRMCIDTASVLNR CCCTCATGGCAACAATCTATAAGTTGTATTCAGCAGTGTGGAGCAGAAG CCCAAA NLSCSWLDLTNAFGS AATCTCCGGTGTTCAAGGGGTAATTAGCCCGTGCCAAAGAGGCTTTCAG AGCACA VPHELIRRSLESFGYP TCCCTCGACGGATGCAATGAGTCGATCGGAATATTGCGCATGTGTATTG CGTGCG QSVIQIVTDMYKGAT ACACCGCTTCCGTACTCAATAGGAATCTCTCTTGCTCATGGCTTGATCTC GAGCGG MKVKTADQKTQSIKI ACCAACGCCTTCGGGAGCGTTCCTCACGAGCTGATAAGGAGATCCCTAG TTGGAT EAGVKQGDPISPTLF AATCATTCGGATATCCACAATCAGTTATCCAAATCGTGACTGACATGTAC GCCGAC NICLEGIIRMHQMRE AAGGGAGCAACGATGAAAGTCAAAACGGCAGATCAAAAGACGCAAAG TGAGCC KGYDCVGHKVRCLAF CATCAAAATAGAAGCGGGGGTGAAACAAGGAGACCCCATTTCTCCAAC AGAGGG ADDLAILTNNKDEM CCTATTCAATATTTGCCTTGAAGGCATCATCAGGATGCATCAGATGAGA CAAAGT QEVIDKLDADCRSVS GAGAAAGGGTACGATTGTGTCGGGCATAAAGTTCGCTGCCTAGCGTTC CGTAGG LIFKPRKCASLTIVRGA GCCGACGACCTTGCGATTCTAACGAACAACAAAGATGAAATGCAGGAA CCGGTA VDKYAKIRINGDAIRT GTTATCGACAAGTTGGATGCAGACTGTAGAAGCGTGTCGTTGATCTTTA GGCTCC MADRDTYRYLGVKT AACCTAGGAAGTGTGCATCTTTGACTATCGTGAGAGGTGCAGTTGATAA CGGCGG GVGGRASETEALIQV GTATGCAAAGATCAGAATAAATGGAGACGCGATCAGAACAATGGCGGA GTTCTCC VKELQKVHETDLAPH TAGAGACACCTATAGATATCTGGGTGTAAAGACCGGAGTTGGTGGAAG GTCGTA QKLDILKTFLLPRLQH AGCATCGGAAACGGAAGCTTTAATTCAGGTGGTCAAGGAGCTCCAAAA GTCAGT LYRNATPKLSELREFE GGTCCACGAAACCGACCTGGCTCCACATCAAAAACTTGACATCCTGAAG GGTGTG NVVMKSVKRYHNIPI ACGTTCTTACTGCCAAGACTGCAGCATCTCTACAGAAATGCCACTCCTAA CCTACAC KGSPVEYVQIPVKKG ACTGTCAGAGTTGAGAGAGTTCGAGAACGTTGTTATGAAATCAGTGAA CTAACT GLGVLSPRLTCLITFLT ACGGTATCATAACATACCAATAAAGGGCTCGCCTGTGGAATATGTCCAA GCTATG STLCKLWSDDPFISSI ATCCCTGTCAAGAAGGGTGGACTAGGAGTTCTATCTCCTCGACTCACAT ACAAGC HKDALSRITVKAMGL GCCTGATCACTTTCCTTACCTCGACCCTCTGCAAGCTATGGTCCGATGAT GTATAG TTQSATIKETCEYLNT CCATTCATATCTTCTATCCACAAAGACGCACTAAGCAGAATCACAGTGAA GAGGCC RKAVTKGGYSLFCRM AGCGATGGGACTCACCACTCAAAGTGCCACAATAAAAGAGACATGTGA CGGAAA NESLRTLSVIQGAPLK GTACTTAAACACAAGGAAAGCTGTCACGAAAGGAGGATATAGTCTATTC AACAAG SMEFIPVNNEIGIAV TGCCGCATGAATGAATCTCTCCGCACGCTGTCTGTCATCCAAGGTGCTCC CCAAGG QATKDSEIKVFTKADS ACTGAAATCAATGGAATTCATCCCGGTGAACAATGAAATCGGTATAGCG CGGCGT LKLMSKLKDLVRSAM GTACAAGCCACGAAGGATTCCGAGATCAAAGTCTTCACAAAGGCTGACA TAGCTG LKRFLEEKSVKSRVTQ GCCTGAAGCTAATGAGTAAGCTAAAAGATCTGGTCAGATCTGCTATGCT AGAGCT VLQHHPQSNRFVRD CAAACGGTTCCTCGAAGAGAAGAGTGTTAAAAGCAGAGTCACCCAAGT AACAAG GRNCSIAAQRFVHPA ACTCCAACACCACCCACAATCCAATAGATTCGTCAGGGACGGCCGAAAC CTTCTCG RLNLLSCNANTYDVN TGCAGCATAGCAGCWCAGAGATTCGTGCACCCKGCCCGTCTGAACCTCC TGGATG HPKGCRRCQADFES TCTCCTGCAACGCCAACACATACGATGTTAACCATCCAAAAGGCTGCAG GGTGCC QQHILQNCHYSLAG AAGGTGCCAGGCTGACTTCGAGTCTCAGCAGCACATCCTGCAGAACTGT AGAGGG GITQRHDRVMNRIL CACTACAGTTTGGCAGGGGGAATAACCCAGAGACATGACAGAGTCATG CACCATC QEIGNGRKAHYKIMV AACAGGATTCTGCAGGAAATTGGAAACGGGAGAAAAGCTCACTACAAG CTGGTG DMETGATRERPDIIM ATAATGGTGGATATGGAAACCGGCGCCACAAGAGAAAGACCGGATATC GGTGGA EERDGPEVLLADVTV ATCATGGAGGAAAGGGACGGTCCAGAAGTGTTACTAGCCGACGTGACA TGGGGG PYENGVQAVERAWD GTGCCCTACGAGAATGGGGTTCAAGCGGTTGAGAGGGCGTGGGATAA GAGCTT KKIEKYKHFLDYYRKI GAAGATAGAAAAATACAAGCACTTCCTAGATTACTACCGCAAAATCGGA GGGAAC GKKATILPLVVGSLGT AAGAAGGCTACGATTCTTCCCCTAGTAGTCGGTAGCCTAGGAACCTACT GTCCCG YWPDTSHSLKMLGLS GGCCCGACACAAGCCACTCACTGAAGATGCTTGGCCTTTCCGACGGTCA ATCCTTC DGQIRNVIPEICQIAL AATAAGGAATGTTATACCTGAAATCTGCCAAATTGCACTGGAATCCTCCA GGATGC ESSKNIYWKHILGDSY AAAATATCTATTGGAAGCACATTCTCGGTGATAGCTACAAAACGGTGGA CCAGAC KTVEGLFCQRNNKEV GGGACTATTTTGTCAGAGGAATAACAAAGAAGTCCGATTCGAAGGAAA CACCGC RFEGKGEKHHVSQRF AGGTGAAAAACACCACGTGTCACAAAGATTCCAACCTCTGAAATGTGAA AATCTGT QPLKCEKVRTMKSTK AAGGTGCGTACAATGAAAAGCACAAAAGAAGAGGGTAGAAGTAGATC CAAGGC EEGRSRSNAKKGPN GAATGCCAAGAAAGGTCCGAACTGGCGAAGATCAAAAAGCGAATCGGA ACCGTG WRRSKSESDGRSVSK CGGAAGGAGTGTGAGTAAAGGCAGATACTGGCGAGATCCGTCGAACAA CTCCAA GRYWRDPSNKPPHS GCCGCCACACTCGAAGATGACCCAGTCGGCTTTAGCTAAGCGCTAAACC AAGCAC KMTQSALAKR(SEQ GGCTCCTCTGGGAGGAGGTATGTCAGAGGACATTCTCCGTGGGCGGAT ACGCGC IDNO:1418) GGGAGGAGTAGGGTAACGACCCGTCATTCTGGATGCCTAAACCACCAC GGGTTG AATCTGTCAAGGCAAAGTGCCCCAAAAGCACACGCGTGGATCGGTTTG GTTTGG GATGCCGACTGAGCCAGAGGGCAAAGTCGAAGGCCGGTAGGCTCCCG ATGCCG GCGGGTTGTCCGTCATAGTCAGTGGTGCGCCTACACCCAACTGCTATGA ACTGAG CACACAAGGACAACCCAAAATAAATAAGCCAAGGCGGCGTTAGCTTCG CCAGAG AGCTAACAAGCTCCCCGAGAGGATGGTTGCCACAGGGCACCATCCTGG GGCAAA GGAACGACCCGATCTTTCGGATGCCCAACCACCGCCAATCTGTCAGGCA GTCGTA ACGTGCCCCAAAAGCACACGTGCGGAGCGGTTGGATGCCGACTGAGCC GGCCGG AGAGGGCAAAGTCGTAGGCCGGTAGGCTCCCGGCGGGTTCTCCGTCGT TAGGCT AGTCAGTGGTGTGCCTACACCTAACTGCTATGACAAGCGTATAGGAGGC CCCGGC CCGGAAAAACAAGCCAAGGCGGCGTTAGCTGAGAGCTAACAAGCTTCT GGGCTC CGTGGATGGGTGCCAGAGGGCACCATCCTGGTGGGTGGATGGGGGGA TCCGTCA GCTTGGGAACGTCCCGATCCTTCGGATGCCCAGACCACCGCAATCTGTC TAGTCA AAGGCACCGTGCTCCAAAAGCACACGCGCGGGTTGGTTTGGATGCCGA GTGGTG CTGAGCCAGAGGGCAAAGTCGTAGGCCGGTAGGCTCCCGGCGGGCTCT TGCCTTC CCGTCATAGTCAGTGGTGTGCCTTCACCCAACTGCTATGACATGCGTACA ACCCAA GGAGGCCCGGAAAAATAAGCCAAGGCGGCGTTAGCATAGGGCTAACA CTGCTAT AGCTTCTCGTGGATGGGTGCCAGAGGGCACCATCCTGGTGGGTGGATG GACATG GGGGGAGCTTGGGAACGTCCCGATCGTTCGGATGCCCAACCACCGCAA CGTACA TCTGCCAGGCAACGTGCTTCGGAWGGTCATTGGTTCTAGACTTGTAATA GGAGGC GACCATTGGCCGGAAGAGCACACGCGCGGTTGGTTGGATGCCGACCGA CCGGAA GCCTAGAGGGTGCAAACCTGAAGGGCGAGGTCGAAGGCCGTGAGGCT AAATAA CCCGGCGGGAAACTCCGTCATAGTTAGTGGTGTGCCTACACCCGACGAC GCCAAG TATGACACATAGGAGGAATCCTGATCTGATATGATCATGTATATAGGGA GCGGCG GGGCGAAGGTAAATAGTCAGKGTCAAAGTCCACGTGGCAGCTACTCCC TTAGCAT CAGCATAGTAGTGATGCGAGTGGAWCCAACTTTGACACTGATGTTCCCT AGGGCT GAGCCTGACCCATCTGCACAAATCCAACAGTGTATGATGGCCCACACAC AACAAG TGAGGACGAGTATCACTTGTGATACTCAGAGGTGTCCCCCATGATCAAC CTTCTCG CAATATCACAGCTAGCGGACCTACCGTGAGGTAGACCCCCGCCGCTGTA TGGATG GCAGGCTCGCCTC(SEQIDNO:1050) GGTGCC AGAGGG CACCATC CTGGTG GGTGGA TGGGGG GAGCTT GGGAAC GTCCCG ATCGTTC GGATGC CCAACC ACCGCA ATCTGCC AGGCAA CGTGCT TCGGAW GGTCAT TGGTTCT AGACTT GTAATA GACCAT TGGCCG GAAGAG CACACG CGCGGT TGGTTG GATGCC GACCGA GCCTAG AGGGTG CAAACC TGAAGG GCGAGG TCGAAG GCCGTG AGGCTC CCGGCG GGAAAC TCCGTCA TAGTTA GTGGTG TGCCTAC ACCCGA CGACTA TGACAC ATAGGA GGAATC CTGATCT GATATG ATCATGT ATATAG GGAGGG CGAAGG TAAATA GTCAGK GTCAAA GTCCAC GTGGCA GCTACTC CCCAGC ATAGTA GTGATG CGAGTG GAWCCA ACTTTGA CACTGA TGTTCCC TGAGCC TGACCC ATCTGC ACAAAT CCAACA GTGTAT GATGGC CCACAC ACTGAG GACGAG TATCACT TGTGAT ACTCAG AGGTGT CCCCCAT GATCAA CCAATAT CACAGC TAGCGG ACCTACC GTGAGG TAGACC CCCGCC GCTGTA GCAGGC TCGCCTC (SEQID NO: 1296) NeSL NeSL- Caenor- CCAACTCTCATCGTATTAACCTACGGTATTCACTCCTAGTGAGTGTAATA CCAACTCT TGAATA MTNVYLKPVNDNQT 2_CRem habditis AAGGTTAATTACGTTTTCTCTTGCMAGAGAAAAAGAAAATTCGAATCCT CATCGTAT CCGTCA NKTGDNSRNTMSNS remanei TTTTGTGTAACTCACAAACTGACAGAGACCTATCGAATTTCCTTTGTTTC TAACCTAC GATAAG QCEMTWKPVARTYA GTATATAGGAATAGTCACTCTGGACCACGAAGTGGACAGTTGTCGGCG GGTATTCA CCCCCA QAASTNPADDKTVT GACTTCCAGAGTGGAGAGAAAAGGTGTGAAGAGAGGAGGTCTAGAAA CTCCTAGT ACATAA VLGCKYNLLKLGNTP CACTTCGGCTGTCTAGGACCAGTTCCTGAGTGGAAAGAGGAAGGTCTA GAGTGTAA AAATAA QTSKRSPPKPSRGGA GAAACACTTCGGCTGTCTAGGACCAGTTCGTGAGATCTCTCGTGGAGAG TAAAGGTT AAGTCG RISSVYTLTDELEITHR TTGAAAACAGTCAGCTGAGGCTACTGTATTTCTTGATAGCCCCGCCCCCA AATTACGT GCGTTA EEGKITFAIDLPNKNN ATCCCCCTCCCCCCCCCCTCGACAGATTTTTCTGTTTGACCTCCTGGAATT TTTCTCTTG GCTAAC ILCPLCRECTQTRGRG TGCGAGGAGTGCGCGAGAATTTTCGAATTCTTCGCGCGTTTTCTCGAAA CMAGAGA CACTAA SSFTKHMKLHVKEKH TTTTCCAGAAGATTCGAGCGGAGAATCTTCGAGAAAGTGAGCTGAATTT AAAAGAA ACCGGC QLDATFIYKCSMCNE CGCGCGAATTTTCCGCGATTTTCAAATTATCGATTTTTGTCGGAAAATTT AATTCGAA TCCTCAT YEPEKKCGTKWIQTH ATTTTCTGGCAAAATTTGATTGAGTTCACGCGGGAGAGAAGGAATTGTT TCCTTTTTG TGGGGG LQKVHNYKYDESAIV GGAAAAGGGTATTGATTTTTGTGGCGGAGGAAACTCCCACTGAATCAAT TGTAACTC AGAGTA VPVPPNTRQQIANEL AACTCTCAAAGGAGAACTCATCGAACAACCTCGGGTGACCTGAATCTTG ACAAACTG TCATTCC NNAAPFVDIRKPKAA GGCGAAATTTTCGCATTGACACAAGATAAMACAAATTACTGTKGAAAAT ACAGAGAC GGTGCT AVEEKKTENGALLKFL AAATCAGAACAAACTGTCAAAAAGAGAGACAAAAAGTATTGATTAACA CTATCGAA CTCCGTT TKSNKDDQVKSPSXD ACATCATGACAAATGTATATCTTAAGCCTGTGAATGATAACCAGACTAAC TTTCCTTTG TGGGCG IPDAESPEKETQALTI AAAACCGGTGATAATTCTAGAAATACTATGTCAAATAGTCAATGTGAAA TTTCGTAT GTAGGG DPKGNNSPSKSSIRSS TGACGTGGAAACCTGTAGCCAGAACATATGCTCAGGCAGCCAGTACTAA ATAGGAAT AGGAGT QSSASSVCQEIQEIITL CCCGGCCGACGACAAAACGGTGACTGTCCTTGGGTGCAAATACAATCTG AGTCACTC TGGGTA SEDEDPKGARPKPGI CTAAAACTGGGAAATACTCCTCAGACGTCGAAAAGGTCGCCTCCAAAAC TGGACCAC GCGACC NVWSLINETGKDAYI CATCGAGAGGAGGAGCTCGAATCAGCAGTGTGTATACTCTGACTGATG GAAGTGG CGGAAG DTDIMMAFLKMRVE AGCTGGAGATTACGCACAGAGAAGAAGGTAAGATCACATTCGCGATAG ACAGTTGT TATGGA NCDSVNIIDPLNYQF ACCTTCCAAACAAGAATAACATCTTGTGCCCGCTGTGTCGGGAGTGCAC CGGCGGA TGCCCA PARVDLVPLIQRNLE CCAAACCCGTGGGAGAGGGTCCAGTTTTACCAAGCATATGAAACTCCAC CTTCCAGA ACCACC DGKKRVVFPICADEH GTGAAAGAGAAGCACCAACTTGATGCCACGTTCATCTACAAGTGTAGTA GTGGAGA GCAATC WTLLTISNGIAAFYDP TGTGCAACGAGTACGAACCGGAAAAAAAATGCGGTACGAAGTGGATCC GAAAAGG TGATCT TGSRMSSYIEELVNEL AGACCCACCTTCAAAAAGTGCACAACTACAAGTATGACGAGTCTGCAAT TGTGAAGA GGCATT GLIIPKEQDEQPRQR AGTTGTCCCAGTACCACCCAACACAAGACAGCAAATAGCTAATGAGTTG GAGGAGG GTGTTTC DSYNCGVFVMKMAE AACAATGCTGCCCCATTCGTTGACATCAGAAAACCGAAAGCTGCTGCTG TCTAGAAA GGATGG AFIQDTEWEMEEVE TTGAGGAGAAGAAGACTGAAAATGGTGCTCTGTTAAAATTCCTGACCAA CACTTCGG TCTCTGT EDVKNFRRNLLEELK GTCCAATAAGGACGATCAGGTAAAATCCCCATCGGAWGATATTCCAGA CTGTCTAG CTCTAG PNYEIFAEKIKYYNSP TGCGGAAAGCCCTGAAAAAGAAACTCAGGCGCTCACTATCGATCCGAA GACCAGTT ATCTGA GKSFAQSRPTSRSSQ AGGGAACAACTCACCATCAAAAAGCTCAATAAGATCGAGCCAGTCCTCA CCTGAGTG AATAGA CAVCPTCSRSATPM GCTTCCTCCGTTTGTCAAGAAATCCAGGAAATCATCACGTTGAGTGAGG GAAAGAG GCTCTG MDVGNMEVDPVPQ ATGAAGACCCAAAAGGGGCTCGTCCAAAACCAGGAATCAACGTGTGGA GAAGGTCT GCCTGA QQETPKSREPEQDEG GCTTGATAAATGAAACGGGAAAGGATGCATACATTGATACAGATATCAT AGAAACAC AGAACA WKVVGKARKRGVVT GATGGCGTTCTTGAAGATGAGAGTGGAAAACTGTGACTCCGTGAACAT TTCGGCTG CACGCG ERSPNISPEAKRQFTG AATTGATCCACTCAATTACCAGTTTCCCGCGAGAGTGGACCTAGTCCCAC TCTAGGAC CGGACC PEIKVVSPGKFHPLVG TTATCCAGAGGAATCTGGAAGACGGAAAGAAAAGAGTCGTGTTTCCGA CAGTTCGT GGTTGG ETEEMEVTCDSPPTK TCTGTGCAGACGAACACTGGACGCTCTTGACCATCTCGAATGGAATTGC GAGATCTC ATGCCG EPTTEPKVTPSLPAM TGCATTCTATGATCCGACTGGATCGCGAATGAGTAGTTATATTGAAGAG TCGTGGAG ACTCGA KIASPEVTKKQTSKKK TTGGTGAACGAACTTGGACTGATTATCCCAAAGGAACAGGATGAACAG AGTTGAAA TCTGGA GKYGKKKQXTKKAQ CCAAGACAAAGAGACAGCTACAACTGTGGGGTATTTGTGATGAAAATG ACAGTCAG GGGTGC PPKGEPTKKAQPKGE GCGGAAGCCTTCATCCAAGATACCGAATGGGAAATGGAGGAAGTAGAG CTGAGGCT AAACCT PAKLIEQVRTWFDKQ GAAGACGTGAAAAACTTCCGAAGAAATCTCCTTGAAGAACTGAAACCCA ACTGTATT GAAAGG MKSYQEQGSNIQTLT ACTACGAGATATTTGCTGAAAAAATCAAATATTATAACTCTCCGGGAAA TCTTGATA GAAAGT WIADSLTAAIFKANS AAGTTTCGCCCAAAGTCGACCCACAAGTCGAAGCAGCCAGTGTGCCGTC GCCCCGCC TGAAGG GNKYLVDKITARCPP TGTCCGACGTGCTCTCGTTCAGCTACACCGATGATGGATGTAGGAAACA CCCAATCC CCGTGA PLLNEGEMATQTSRR TGGAAGTGGATCCCGTTCCACAGCAACAAGAGACACCGAAGAGTCGCG CCCTCCCC GGCTCC TEAVKPKDRFVKESN AGCCAGAACAAGATGAAGGCTGGAAAGTGGTGGGAAAGGCTAGAAAG CCCCCCTC TGGCGG EPLRIQYAKNRAKTF AGGGGAGTGGTAACTGAACGATCGCCAAACATCTCTCCAGAAGCCAAG GACAGATT GAAACT NVIIGKHSARCEIDIN AGACAATTCACTGGTCCAGAGATCAAAGTCGTCTCACCTGGGAAGTTTC TTTCTGTTT CCGTCAT VVENHFRQTLKAQP ACCCACTTGTGGGCGAAACTGAGGAGATGGAGGTGACGTGTGACAGCC GACCTCCT AGTCAG VTEEALNTVCSGIKKA CACCAACGAAAGAGCCCACTACGGAACCGAAAGTGACTCCAAGCCTGC GGAATTTG TGGTGT KVDPSIEGPISSGEVK CAGCAATGAAAATTGCTAGCCCAGAAGTGACGAAAAAGCAAACGTCAA CGAGGAG GCCAAC AILAKIKDTSPGTDGV AGAAGAAGGGAAAGTATGGCAAAAAGAAACAGSAGACAAAGAAAGCT TGCGCGAG ACCCGA KYSDLKWFDPEGERL CAGCCGCCGAAAGGGGAGCCAACAAAGAAAGCTCAGCCAAAAGGAGA AATTTTCG CGACTA ALLFDECRQHGKIPS ACCGGCAAAGCTCATTGAGCAAGTGAGAACTTGGTTTGATAAACAGATG AATTCTTC TGACAT HWKEAETVLLPKDCT AAATCGTACCAAGAGCAAGGTTCTAACATCCAGACACTGACCTGGATTG GCGCGTTT AGTTGG EEERKKPENWRPISL CCGACTCACTCACTGCCGCCATCTTCAAGGCAAACAGTGGAAACAAGTA TCTCGAAA AGGAAT MATVYKLYSSVWNR TCTGGTAGATAAGATAACTGCAAGATGCCCACCACCATTGCTGAATGAA TTTTCCAG CCTGATC RISSVKGVISDCQRGF GGTGAGATGGCGACGCAGACGAGCAGAAGGACAGAAGCGGTGAAACC AAGATTCG TGATAA QAIDGCNESIGILRM AAAAGATCGATTTGTAAAAGAATCTAACGAGCCGCTCAGAATCCAGTAT AGCGGAG TAATCAT CIDTATVLNRNLSCS GCAAAGAACCGAGCAAAGACCTTCAATGTGATAATTGGGAAACACTCC AATCTTCG TGTTCAT WLDLTNAFGSVPHEL GCACGATGTGAAATTGATATTAACGTCGTGGAAAACCACTTCAGGCAAA AGAAAGT ATAAGG IRRSLAAFGYPESVINI CCCTGAAAGCACAACCAGTAACAGAAGAAGCATTGAATACTGTGTGCA GAGCTGAA GAGGGG ISDMYNGSSMRVKT GTGGAATCAAAAAGGCGAAAGTTGATCCAAGCATTGAAGGTCCGATCT TTTCGCGC GATGGT AEQKTQNIMIEAGVK CGTCAGGAGAAGTGAAAGCGATTCTTGCAAAGATCAAAGATACCTCTCC GAATTTTC AAATAC QGDPISPTLFNICLEGI CGGAACTGATGGAGTGAAGTACAGTGATCTGAAATGGTTCGATCCGGA CGCGATTT CCAGGG IRRHQTRKTGYNCVG AGGTGAACGTTTGGCGTTGTTGTTCGATGAATGTCGACAGCACGGGAA TCAAATTA TCCGAA NDVRCLAFADDLAIL GATTCCGAGCCACTGGAAAGAAGCAGAAACTGTTCTGCTACCAAAAGAT TCGATTTT ACCATC TNNQDEMQDVLNQ TGCACTGAAGAGGAAAGAAAGAAGCCAGAGAATTGGAGACCCATCTCT TGTCGGAA AAAGCA LDKDCRSVALIFKPKK CTAATGGCTACTGTATACAAACTCTACTCCTCAGTCTGGAACAGGAGAA AATTTATTT GCTACT CASLTIKKGSVDQYA TCTCCTCAGTKAAAGGAGTCATCAGTGATTGCCAAAGAGGCTTCCAGGC TCTGGCAA GACCAG RIKIHGMPIRTMSDG GATCGATGGATGCAATGAGTCAATCGGAATTCTGCGGATGTGCATAGA AATTTGAT CATAGT DTYKYLGVQTGNGG CACAGCCACAGTTCTCAACCGAAACCTGTCGTGTTCATGGTTAGACTTGA TGAGTTCA AGTGAT RASESESLTQIAAELQ CGAACGCTTTTGGAAGCGTGCCCCACGAGTTGATCAGAAGATCACTAGC CGCGGGA GAACAC MVHDTDLAPNQKLD CGCATTCGGGTATCCTGAATCAGTCATCAATATAATCAGTGACATGTATA GAGAAGG ATAGAC VLKAFILPRLQHMYR ATGGATCGTCAATGAGAGTCAAGACAGCGGAGCAGAAAACTCAGAACA AATTGTTG CCTGGG NATPKLTELKEFENTV TCATGATTGAAGCTGGAGTTAAGCAAGGTGATCCCATCTCGCCAACTCT GAAAAGG GTTCCCT MKSVKMYHNIPIKGS ATTCAACATCTGTCTTGAAGGCATAATCCGAAGGCATCAGACGAGGAAG GTATTGAT GAACTC PLEYVQIPVKNGGLG ACAGGTTACAACTGCGTTGGAAACGACGTACGTTGCCTGGCATTTGCTG TTTTGTGG GACCCA VMSPRFTCLITFLAST ACGATCTTGCTATCCTTACCAACAACCAGGATGAGATGCAAGATGTGCT CGGAGGA TCTGCAC LFKLWSDDEYISSIHK CAATCAGCTGGACAAGGACTGTCGTAGTGTTGCCCTGATATTTAAGCCA AACTCCCA AAACCC KALSRITAKVMGLKT AAGAAGTGTGCTTCACTGACGATCAAAAAAGGAAGTGTTGATCAGTATG CTGAATCA ACTTTGT QKATLQEQCEYLNTK CAAGAATCAAGATTCATGGAATGCCCATTCGGACTATGTCGGATGGGG ATAACTCT ACAAAT KAITKGGYSLFSRMN ATACCTACAAGTATCTCGGAGTCCAAACCGGAAACGGTGGTAGAGCCTC CAAAGGA GAACCA EAIRTLSVNLGAPLKS GGAATCAGAATCCCTGACTCAGATTGCCGCGGAACTCCAAATGGTCCAT GAACTCAT AACTGA MQFIPENGEIALEVQ GACACAGACCTGGCGCCGAACCAGAAACTTGATGTGCTGAAGGCATTC CGAACAAC TGAAGA ASENSQIKVFSKADS ATCCTGCCGAGACTGCAACATATGTACAGAAACGCCACTCCAAAGCTGA CTCGGGTG GTTTAAT MKLVTKLKDLVKSA CGGAGTTAAAGGAGTTTGAGAACACAGTCATGAAAAGTGTGAAGATGT ACCTGAAT GATTTCT MLKNFLENKKVKSKV ATCACAACATCCCGATTAAAGGATCACCACTCGAATATGTCCAAATTCCA CTTGGGCG TACATCA VQVLQHHPQSNKFV GTAAAGAATGGAGGACTCGGAGTTATGTCTCCCCGATTCACGTGTCTCA AAATTTTC CAGCTA NDGKNXSISSQKFVH TAACGTTCCTGGCGTCCACACTGTTCAAACTGTGGTCAGACGACGAATA GCATTGAC GCGGAC PARLSQLVCNGNSYS CATCTCGTCCATCCACAAAAAGGCGTTGAGTAGAATCACGGCAAAGGTG ACAAGATA CTACCGT KDLPKNCRWCGYEC ATGGGACTGAAGACCCAAAAAGCCACGCTCCAAGAGCAGTGCGAGTAC AMACAAA GAGGTA ESQAHILQHCTYSLSS CTGAACACCAAGAAAGCAATCACGAAAGGAGGTTACAGCCTCTTCTCGC TTACTGTK GACTCC GITQRHDRVLNRILXE GAATGAACGAAGCTATTCGAACGCTCAGTGTCAACCTTGGAGCACCGCT GAAAATAA CGCCGC VIKGRKNNDYYDIMV CAAATCAATGCAATTCATTCCGGAAAATGGCGAAATTGCTTTAGAAGTG ATCAGAAC TGTAGC DTEPGPTRERPDIIMI CAAGCATCAGAAAACTCACAGATCAAAGTATTCTCGAAAGCTGACAGTA AAACTGTC AGGCTC QKDGPEVLLADVTVP TGAAACTGGTGACAAAGCTGAAAGATCTGGTGAAATCGGCGATGCTCA AAAAAGA GCCATT YENGVVAIEAAWDW AGAACTTCTTGGAAAACAAGAAGGTCAAAAGCAAGGTTGTGCAGGTGC GAGACAA G(SEQ KMEKYSHFIDYFARL TTCAACACCACCCACAATCAAACAAATTCGTCAATGATGGAAAGAACWK AAAGTATT IDNO: GKRAVILPLVVGSLGT CAGCATWTCCTCCCAAAAGTTCGTACACCCAGCACGGCTGAGCCAGCTG GATTAACA 1297) YWPDTSNSLRMLGL GTCTGCAACGGGAACAGCTACAGTAAAGACCTTCCGAAAAACTGCAGAT ACATC SDGQIRNLIPDISMIA GGTGCGGCTACGAATGCGAGTCTCAGGCTCACATCCTCCAGCATTGCAC (SEQID LESSKQIYWRHIFGDS ATACAGCCTTTCATCTGGAATCACCCAGAGGCATGACCGTGTCCTGAAC NO:1174) YRIVSDLYCRKDQQEI AGGATCTTGCASGAGGTGATAAAAGGCAGAAAAAACAACGACTACTAT RFGDEPMENVQVSD GACATAATGGTGGATACGGAGCCCGGACCAACCAGAGAGCGTCCAGAT RFQPFKTREREKKSEE ATCATCATGATACAGAAAGATGGTCCGGAAGTCCTACTGGCGGATGTTA EKKRRSKSKKGKTWR CGGTACCATACGAGAATGGAGTTGTTGCGATCGAAGCCGCGTGGGATT GSKKQTDSRQSGKSN GGAAGATGGAGAAGTACAGTCACTTTATTGATTACTTCGCAAGACTGGG QNQGFQRSVGQGVS AAAGAGAGCAGTAATCCTTCCACTAGTGGTTGGAAGTCTTGGGACCTAC R(SEQIDNO:1419) TGGCCAGACACCTCAAACTCTCTGAGAATGCTGGGTCTCTCTGATGGGC AAATCAGAAACCTGATCCCAGACATCTCCATGATTGCTCTAGAGTCTTCC AAACAAATCTACTGGAGGCATATCTTCGGAGATAGCTACAGAATTGTGA GTGATCTATACTGCAGAAAAGACCAGCAGGAGATCAGATTCGGAGATG AACCCATGGAAAATGTTCAAGTCTCAGATCGATTCCAGCCTTTTAAAACA AGAGAGCGTGAGAAGAAATCCGAGGAAGAGAAAAAGAGAAGATCAAA GTCCAAAAAAGGCAAAACTTGGCGAGGATCCAAAAAACAAACTGATTC CCGGCAATCCGGCAAAAGCAATCAGAATCAGGGCTTCCAAAGAAGCGT TGGACAAGGCGTATCACGGTGAATACCGTCAGATAAGCCCCCAACATAA AAATAAAAGTCGGCGTTAGCTAACCACTAAACCGGCTCCTCATTGGGGG AGAGTATCATTCCGGTGCTCTCCGTTTGGGCGGTAGGGAGGAGTTGGG TAGCGACCCGGAAGTATGGATGCCCAACCACCGCAATCTGATCTGGCAT TGTGTTTCGGATGGTCTCTGTCTCTAGATCTGAAATAGAGCTCTGGCCTG AAGAACACACGCGCGGACCGGTTGGATGCCGACTCGATCTGGAGGGTG CAAACCTGAAAGGGAAAGTTGAAGGCCGTGAGGCTCCTGGCGGGAAAC TCCGTCATAGTCAGTGGTGTGCCAACACCCGACGACTATGACATAGTTG GAGGAATCCTGATCTGATAATAATCATTGTTCATATAAGGGAGGGGGAT GGTAAATACCCAGGGTCCGAAACCATCAAAGCAGCTACTGACCAGCATA GTAGTGATGAACACATAGACCCTGGGGTTCCCTGAACTCGACCCATCTG CACAAACCCACTTTGTACAAATGAACCAAACTGATGAAGAGTTTAATGA TTTCTTACATCACAGCTAGCGGACCTACCGTGAGGTAGACTCCCGCCGCT GTAGCAGGCTCGCCATTG(SEQIDNO:1051) NeSL NeSL- chrUn Caenor- CCCTTTTCTATCGTATTAACTACGATAACCGCTCATTTGAGTGTAAAAAA CCCTTTTCT TAACAT MTKTEWSWRHRSRS 4_CRem habditis GGTTCCCCCCTCCTCGCCTGCCTTACCCACGCATCTCTGCCTCTGGGAAG ATCGTATT GCCTTG RSVGIVVKIDTSDYAN remanei GCGGAGGGTCAACTTGCGGGTCTGTGGATTTCCTTTCCTATCCACCGCCC AACTACGA GAAGGC VRVHVAADLSNEDG ATATTCTCTGTCGAAAGCCTACCTAGATCAGCCGGGAGTTTTTCCTATCC TAACCGCT ACCACG HTSHNNGIILPIPMKP CATTCAGGCGATCGCTCAAGGCTGTTTTATCGACACTCCTTCTTGACAAG CATTTGAG CCAAAA SVDRFCQIQYPPRGY TATTTATTTCTTGACAAATTCTATTTTTCCTTTTATCGATTTTCTCTTATTTA TGTAAAAA GTCCTG YVPHPQSQKGHDAK TCGATTCTTGTGAAAATATGACCAAGACCGAATGGTCCTGGCGTCATCG AGGTTCCC GCAACT PSRHWNEEAQPPYY ATCTCGCTCCCGCTCTGTTGGAATCGTTGTGAAAATCGATACAAGCGACT CCCTCCTC GATTTG HNNNHGRRGRSAKP ATGCTAACGTCCGAGTGCATGTCGCGGCGGACCTTTCCAATGAGGATGG GCCTGCCT AATAAT SGRRPPRKPILQEESL CCACACGAGCCACAACAACGGCATCATTCTCCCCATCCCAATGAAGCCC TACCCACG GTATAA AAHPQIPGDTASAVP AGCGTCGATCGATTCTGTCAAATTCAATACCCTCCAAGAGGGTACTATGT CATCTCTG AAGTAA LYSDVVNNENKSQG TCCGCATCCTCAAAGTCAGAAAGGCCATGATGCAAAGCCCTCGCGTCAT CCTCTGGG CTGGAA KPPQGSHRRSGRPGT TGGAATGAAGAGGCACAACCTCCCTACTACCACAACAACAATCATGGGA AAGGCGG CCAAAT KPSVPVGEAEQETNS GAAGGGGGCGTTCGGCAAAACCAAGTGGACGCCGACCCCCACGAAAGC AGGGTCAA GCCCGA RPIAPEPIVKFKHDKH CCATACTTCAGGAAGAGTCCCTGGCAGCGCACCCCCAAATACCCGGGGA CTTGCGGG TAGGTA GWTTVQGSHSSGRP TACTGCGTCAGCGGTCCCACTGTACTCCGACGTCGTCAACAATGAAAAC TCTGTGGA GGGCGG VPKPSVPVVSEANRF AAGAGTCAGGGGAAACCACCGCAAGGGTCCCACAGAAGAAGTGGAAG TTTCCTTTC GAGAAA QLLQEGDFPPLTTSES ACCAGGAACAAAGCCCTCTGTTCCGGTTGGTGAGGCAGAGCAAGAAAC CTATCCAC ATGACC SQEEIKVPNYQRIVSP GAATTCCCGTCCAATTGCTCCAGAACCCATCGTGAAATTCAAACACGATA CGCCCATA TAGAAA IPLPSEEDSKLPTKSNY AACACGGGTGGACTACTGTCCAAGGGTCCCACAGTAGTGGAAGGCCGG TTCTCTGT ACACAA RAPKGRKSRNYKKPQ TACCAAAGCCCTCGGTACCGGTGGTTTCAGAGGCAAATCGGTTCCAGTT CGAAAGCC AGTCCC QQNPKKYQQRLPYQ ACTCCAGGAAGGGGATTTTCCACCCCTTACAACATCCGAATCTTCGCAA TACCTAGA AAGCCC PKVNNAPTDRMAPE GAAGAGATTAAAGTACCGAACTACCAACGAATAGTGTCACCGATTCCTC TCAGCCGG CCGGAT QLKGGGGKTAHNDI TCCCCTCTGAAGAGGATAGTAAGTTGCCGACTAAATCAAATTACAGAGC GAGTTTTT TCGAAA EEMEIEEDTDEKIIQV GCCCAAGGGACGAAAGAGTCGCAACTACAAGAAGCCACAACAACAAAA CCTATCCC GACCTA KRIKIVNKLTPHHFVC TCCGAAGAAATATCAGCAGAGGTTACCCTATCAACCCAAGGTCAACAAT ATTCAGGC TAGGAA MMTYPTDNIYRCFV GCTCCGACGGATCGCATGGCCCCAGAACAACTCAAAGGAGGAGGAGGA GATCGCTC GTCAGT KGCTATSQGGWGAE AAAACCGCCCACAATGACATTGAAGAGATGGAAATTGAGGAAGACACT AAGGCTGT GAATAG DLKYLTVHIRQEHKIK GACGAGAAGATTATCCAAGTGAAACGAATCAAAATCGTCAATAAGCTAA TTTATCGA AGAGAA VEWTYECGICGDLSG CTCCGCATCACTTTGTTTGCATGATGACGTATCCAACCGACAACATCTAT CACTCCTT ATATCA GAGKHISKWIKPHM AGATGCTTCGTCAAAGGTTGCACAGCAACATCACAAGGTGGTTGGGGA CTTGACAA AACAAA RKKHNRDAPTNFKM GCAGAGGACCTTAAGTACCTGACTGTCCATATCAGACAAGAACACAAAA GTATTTAT TCTCACC GSRSSGKPKITELLEES TTAAGGTCGAATGGACGTACGAATGCGGGATATGCGGTGACCTATCGG TTCTTGAC CATTCAC APSCSNPRRKTLNQK GAGGTGCTGGCAAACATATCAGTAAATGGATCAAACCCCATATGAGGA AAATTCTA AAGGAC KTAIITQVTPEKLKTG AGAAACACAATAGAGATGCCCCAACCAATTTCAAGATGGGTTCAAGAA TTTTTCCTT TTACTG YQTRSVTKALSVLKES GTTCAGGTAAACCCAAGATTACTGAACTACTGGAGGAGAGCGCCCCGTC TTATCGAT GTCGAG RQKELEVLREEEKAN TTGCTCGAATCCAAGAAGGAAAACCCTCAACCAGAAGAAGACTGCTATA TTTCTCTTA TAGAAA AKQKSKLHPFFTKAP ATCACGCAAGTCACTCCGGAGAAATTGAAAACGGGCTATCAAACGAGA TTTATCGA ACAAGC HIDGVKPTVRRELSK AGTGTCACGAAGGCTCTCAGCGTCCTGAAAGAGTCACGACAAAAAGAG TTCTTGTG CAAAAC MITPGGEHKGTKIPM CTGGAAGTGTTGAGAGAAGAAGAAAAGGCTAACGCTAAACAAAAGTCT AAAAT ATCAAG VHTKRGLIQKINRKAK AAACTTCATCCTTTCTTCACCAAAGCCCCTCATATAGATGGTGTGAAACC (SEQID CACGAC KAKPMHLDESTIIEAS AACAGTACGGAGAGAACTATCAAAAATGATTACTCCCGGAGGAGAACA NO:1175) GCAAAA QLDVITIDDDDEDDN TAAGGGAACAAAGATACCAATGGTCCACACCAAGCGCGGTCTCATCCAA AGGGGT MTPMRRRFNTWCL AAGATAAACAGAAAAGCTAAAAAGGCTAAACCAATGCATCTTGACGAA AACTTTG DHETTQEAWLTDDVI AGTACCATCATAGAAGCGTCACAGCTCGACGTCATCACTATTGACGACG GGCAAC NWYLKDLCFGNEQY ACGACGAAGACGACAACATGACACCAATGCGAAGAAGATTCAACACTT TAATTAA MLVDPLVWLIYKMG GGTGTCTTGACCACGAGACGACTCAAGAAGCATGGTTAACTGACGACG CGGATA GMAGVEQRFKSKKT TAATCAATTGGTACTTGAAAGACCTATGCTTTGGTAACGAACAATACAT CCTCCGT CLFPICEADHWILLVF GCTCGTAGACCCACTAGTATGGCTGATATACAAGATGGGAGGAATGGC GTATCA DETNLCYANSLGSQP AGGCGTCGAACAAAGGTTCAAAAGCAAGAAGACGTGCCTATTCCCAATC GGCAAA NGQVKNFIQQLNRKL TGCGAAGCTGACCACTGGATTCTTCTTGTATTCGATGAGACCAACTTGTG GCCGCC CSFEKEVPLQKDSVN CTACGCGAATAGTCTTGGATCCCAACCAAACGGACAAGTTAAGAACTTC ACCAAC CGVHVCLIAKSIVNG ATTCAACAACTCAACCGAAAGCTCTGCAGCTTTGAGAAAGAAGTTCCAC AGCAAA QFWYDDSDVRTFRT TTCAGAAAGATAGTGTAAACTGCGGAGTACATGTCTGCCTGATAGCAAA TTACTGC NAKAALKAQGYELFS GTCAATAGTCAATGGACAATTTTGGTACGATGATTCAGACGTTCGAACG CCGATA EAPKQIENPDSSHRE TTTAGAACCAACGCCAAGGCGGCTCTGAAAGCCCAGGGCTACGAGCTCT GGTAGG DIKENSMEMCSESL TCTCGGAAGCACCAAAACAAATCGAAAACCCAGACTCCAGCCACAGAG GCGTGA MIVATPQRSEAPMEL AAGACATCAAGGAGAACAGTATGGAAATGTGTTCGGAATCTTTGATGAT GAAAAT VDTEPSDLESPKSDR CGTTGCGACTCCACAGAGGAGTGAAGCACCTATGGAACTAGTCGACACT GACCTA VVYEDCITALSDVSEP GAGCCTAGTGATCTGGAATCGCCAAAGTCAGACAGAGTAGTCTACGAA CAACCTC RMTPEKSETPEVPVV GACTGCATCACAGCTCTATCTGATGTTTCGGAGCCAAGAATGACTCCAG CAAGAC EERDLDWPKLESPKS AAAAGAGCGAAACTCCAGAGGTGCCAGTGGTGGAAGAAAGAGATCTG CCGAGC DRVVYEDCITDLSDVS GATTGGCCAAAACTGGAATCGCCAAAGTCAGACAGAGTAGTCTATGAA CCACGG EQRMTPEKCETPEAP GACTGCATCACAGATCTGTCTGATGTTTCGGAGCAAAGAATGACTCCAG AATCGA LVVECVELERLPKDLP AAAAGTGCGAAACCCCAGAAGCGCCATTGGTTGTAGAATGTGTTGAGTT AAGACC VTDRSTVVAIPEAVKL GGAAAGGCTACCTAAGGATCTGCCAGTCACAGACAGGTCAACTGTCGT TATAGG EEKSEVVIPRLMELSY GGCAATCCCTGAAGCAGTAAAACTGGAGGAAAAGTCAGAAGTGGTAAT AAGTCA TVPPEPSPVVEYTQP TCCACGGCTCATGGAGTTATCATACACCGTCCCTCCAGAACCCTCTCCAG GTGAAT YTHTHTKPKVKATCQ TGGTTGAATACACCCAACCATACACTCACACTCACACTAAACCAAAGGTC TGATGG MGKKRKVPTGKPDE AAAGCTACATGCCAGATGGGAAAGAAAAGGAAGGTACCAACTGGGAA AAATAC LIQIVRQWFEKEFND ACCAGACGAACTGATTCAGATTGTGAGACAATGGTTTGAGAAAGAATTC AAAACC YVTEGRNFQRLEWLT AACGATTATGTTACGGAAGGACGAAACTTTCAACGACTGGAATGGCTTA AAATTTC NLLTAAIQKASAGDE CGAACTTACTCACCGCCGCAATACAGAAGGCATCAGCTGGTGATGAGG TTCCATT ETIEKIRKRCPPPEVRE AAACAATCGAAAAGATTCGAAAGAGATGCCCACCTCCAGAAGTTAGAG CACAAG NEMSTQTSQRQKPT AAAACGAAATGTCCACTCAGACATCTCAACGTCAAAAGCCTACCACAAC GACTTA TTNQKKRSRNTTQSD GAATCAGAAGAAACGCTCTAGAAACACTACTCAATCGGATACACAAGCC CTGGTC TQANTYWRNRAKTY AACACATACTGGCGAAATCGAGCCAAGACATATAATCAAATCATAGGTC GAGTAG NQIIGQDFKQCDIPIA AAGATTTCAAACAGTGTGACATACCGATCGCGATACTAGAAGAATTCTA AGCACA ILEEFYKKTTSVTNVP TAAAAAGACTACCTCAGTGACCAATGTCCCTCAGGAAACCCTTGTGAAA AGCCAA QETLVKVTSRLPRLDI GTCACCTCAAGACTACCAAGGTTAGACATTGGAAAGTGGATCGAGGATC AATATC GKWIEDPFTEQEVFG CGTTCACGGAACAAGAGGTATTTGGTGCCCTCAAAAAGACAAAAGACA AAGTAT ALKKTKDTAPGTDGL CTGCGCCAGGAACAGATGGGCTCAGATACTATCACCTCCAATGGTTTGA GACGCA RYYHLQWFDPDCKM TCCCGACTGTAAAATGTTGAGTAGCATTTACAATGAATGCCAGCACCAT AAAATG LSSIYNECQHHLKIPA CTGAAAATTCCTGCCCAATGGAAAGAAGCTGAAACAATTCTCCTCTTCAA GGTAAC QWKEAETILLFKSGD AAGTGGCGACGAATCCAAACCAGACAACTGGCGGCCTATAAGTCTCATG CTTGGG ESKPDNWRPISLMPT CCCACCATCTACAAGCTATACTCAAGTCTCTGGAATAGGAGAATACGGA CATCCA IYKLYSSLWNRRIRTV CGGTGAAGGGGATTATGAGCAAGTGCCAACGAGGATTCCAAGAGAGA ATCAAC KGIMSKCQRGFQERE GAAGGTTGCAATGAGAGTATCGGAATACTGCGGAGTGCTATTGATGTG GGATAC GCNESIGILRSAIDVA GCTAAAGGGAAAAGATCCCACCTGTCCGTTGCATGGCTGGACCTCACCA CTCTGC KGKRSHLSVAWLDLT ATGCCTTCGGTTCAGTACCTCACGAGCTGATTGAAAGCACGTTAAGTGC GTATCA NAFGSVPHELIESTLS ATACGGCTTTCCGGAGATGGTTGTACACATTGTCAAGGACATGTATAAA GGCAAA AYGFPEMVVHIVKD GACGCTTCCATAAGAGTCAAGAATAGAACGGAGAAAAGTGAGCAGATT GTCGCC MYKDASIRVKNRTEK ATGATAAAATCTGGGGTAAAACAAGGCGACCCTATCTCACCAACACTAT ACCAAA SEQIMIKSGVKQGDP TCAACATGTGCCTCGAAACGGTGATTAGACGACATCTGAAAGAATCATC CTGTACT ISPTLFNMCLETVIRR AGGTCACAAATGCATTGACACCAGAATCAAGCTTCTTGCATTTGCAGAT ACTCCG HLKESSGHKCIDTRIK GATATGGCCGTTCTAGCAGAATCAAAAGAGCAGCTACAAAAGGAGCTT AAAAAA LLAFADDMAVLAESK ACAGAAATGGATGAAGACTGTACACCTCTCAACCTAATTTTCAAGCCGG CCAAGA EQLQKELTEMDEDCT CGAAGTGTGCAAGTCTCATCATAGAGTTCGGGAAAGTGAGGACCCATG AACATG PLNLIFKPAKCASLIIEF AGCAGATCATGTTGAAGCGAGAGCCGATCCGAAACCTCAATGATGACG ATTTTCC GKVRTHEQIMLKREP GAACATACAAGTATCTGGGAGTGCATACGGGAGCAGATGCAAGGACAT CACTCC IRNLNDDGTYKYLGV CAGAAGAGGAGCTGATCATTTCTGTAACAAAAGAGGTAGACCTTGTCAA GTTAAA HTGADARTSEEELIIS TCGCTCGGCGCTTACGCCACCCCAGAAACTGGACTGTCTTAAGACGTTC GCATCTC VTKEVDLVNRSALTP ACACTCCCAAAGATGACCTACATGTATGCCAACGCCATACCAAAACTTAC AACCAA PQKLDCLKTFTLPKM CGAACTTTCAGCGTTCGCTAACATGGTCATGCGAGGAGTCAAGATAATC GCTAAA TYMYANAIPKLTELSA CACTATATCCCAGTTAGAGGATCTCCTCTTGAATATATTCAAATTCCGAC GCGGTA FANMVMRGVKIIHYI CGGCAAAGGAGGACTTGGAGTTCCATGCCCTAGAATCACGGCATTGATT AGGTTA PVRGSPLEYIQIPTGK ACCTTCCTTGTCTCAACCATGAAGAAACTGTGGTCTGATGATGAATACAT TCATGTC GGLGVPCPRITALITF TCGTAAGCTCTACAACTCTTATCTGAAGAAGGTTGTGGAGGCGGAAACG AAAAGG LVSTMKKLWSDDEYI GGAATAGTGGAGGTCTCCACAAAGGATCTAGCAGAGTACCTCAGCAAC TGTAGC RKLYNSYLKKVVEAET AAGGTACCATCCAGAAAGCACGAATTCGGGTATAACTGCTACTCGAGGA TACAGC GIVEVSTKDLAEYLSN TTCGCGAAGTTTGTAATGGGCTAGCTCTCAACCAAGCTGCCCCTCTCTAC AACCTA KVPSRKHEFGYNCYS AAACTTGAATTCATCGAACAAGACAATGAGTTAGCAGTTGTTGTCCAGC AAGCCC RIREVCNGLALNQAA CGACTGAGGAGAGCAAGGAAAGGATTTTCACTAAAGATCATGTGAAAA GAAAGG PLYKLEFIEQDNELAV AGCTCCAGTCGCTACTGAAAGCCAGCGTGAATGACGCACTGCTACACAG TAGGGC VVQPTEESKERIFTKD ATTCTTGACAACAAAACCCGTCAAAAGTGAAGTGGTACAAGTTCTCCAG CGTATA HVKKLQSLLKASVND CAGCACCCTCAAAGCAACAGCTTCGTCCGAATGGGAGGTAAAGTAAGT AAAAGA ALLHRFLTTKPVKSEV ATATCGGTACATGTATGGATCCACAGGTCACGGTTAAACCAACTAACGT CCTACAC VQVLQQHPQSNSFV GCAATTATAACATCTTTGATCCAAAGCAACCGAAAAACTGCCGGAGGTG CCTCCAA RMGGKVSISVHVWI TGGTTATAAGAACGAGACTCAATGGCACATCCTGCAAGACTGCACATAT GACCTA HRSRLNQLTCNYNIF GGCTGGGCTAAACTTATACGAGAAAGACACGATGCCGTACATCACAAG AACCCA DPKQPKNCRRCGYK GTAGTCACAATGATTTGCGCTGGGGCAAAGAAGAACTGGGGCCGGAAA CGAACT NETQWHILQDCTYG ATCGACCAAGAACTGCCCGGTTTCACTTCACTCCGTCCAGACATTTGTCT CGAACG WAKLIRERHDAVHH GACGAGTCCGGATGGCAAAGAGGTTATCTTTGCGGATGTTTGTGTCCCT ACCTAC KVVTMICAGAKKNW TACTCAAGGACAAGGAACATCGAATTCGCGTGGAAAGAGAAAATCCGA AGGAAG GRKIDQELPGFTSLRP AAGTATACAGAAGGATACAGTCATCTTGTTGCACAAGGAATCAAAGTGA TCCGTG DICLTSPDGKEVIFAD CAGTCCTTCCGATAGCCATAGGATCACTCGGAACTTGGTGGACGCCAAC AATGGA VCVPYSRTRNIEFAW CAACGAAAGTCTCTATCAACTGGGTATCAGCAAGAGCGATATTCGCAGT GAGAAA KEKIRKYTEGYSHLVA GCCATTCCATTACTATGCTCTACTGTGATGGAGTATAGTAAGAACGCCTA TATCTCA QGIKVTVLPIAIGSLG CTGGAATCACATATACGGAAACTCATATACCTCGGTCCCACTGAGATAC CCAAAT TWWTPTNESLYQLGI GGACACCAGAAGCCCGATGGAGACGATTGGAAGAAAGAACTGAGTTGC CTCTTCC SKSDIRSAIPLLCSTV GAACCAGTTCTAGCTCTCCAACAATAACATGCCTTGGAAGGCACCACGC ATTCACA MEYSKNAYWNHIYG CAAAAGTCCTGGCAACTGATTTGAATAATGTATAAAAGTAACTGGAACC AAGGCT NSYTSVPLRYGHQKP AAATGCCCGATAGGTAGGGCGGGAGAAAATGACCTAGAAAACACAAA AACTGG DGDDWKKELSCEPV GTCCCAAGCCCCCGGATTCGAAAGACCTATAGGAAGTCAGTGAATAGA TCAAGT LALQQ(SEQIDNO: GAGAAATATCAAACAAATCTCACCCATTCACAAGGACTTACTGGTCGAG AGAGCA 1420) TAGAAAACAAGCCAAAACATCAAGCACGACGCAAAAAGGGGTAACTTT CAAGCT GGGCAACTAATTAACGGATACCTCCGTGTATCAGGCAAAGCCGCCACCA AAGCCT ACAGCAAATTACTGCCCGATAGGTAGGGCGTGAGAAAATGACCTACAA CCAAGC CCTCCAAGACCCGAGCCCACGGAATCGAAAGACCTATAGGAAGTCAGT ACGAAG GAATTGATGGAAATACAAAACCAAATTTCTTCCATTCACAAGGACTTACT TGATAT GGTCGAGTAGAGCACAAGCCAAAATATCAAGTATGACGCAAAAATGGG GGGTAA TAACCTTGGGCATCCAATCAACGGATACCTCTGCGTATCAGGCAAAGTC TTTAGG GCCACCAAACTGTACTACTCCGAAAAAACCAAGAAACATGATTTTCCCAC CAACCA TCCGTTAAAGCATCTCAACCAAGCTAAAGCGGTAAGGTTATCATGTCAA ATCAAC AAGGTGTAGCTACAGCAACCTAAAGCCCGAAAGGTAGGGCCGTATAAA GGATAC AAGACCTACACCCTCCAAGACCTAAACCCACGAACTCGAACGACCTACA CTCCGT GGAAGTCCGTGAATGGAGAGAAATATCTCACCAAATCTCTTCCATTCAC GTATCA AAAGGCTAACTGGTCAAGTAGAGCACAAGCTAAGCCTCCAAGCACGAA GGCAAA GTGATATGGGTAATTTAGGCAACCAATCAACGGATACCTCCGTGTATCA GTCGCC GGCAAAGTCGCCACAAACACTGTACTACTCCGTTACTCCCAAACACATG ACAAAC GATCTCCTTCTCTCACCAAAAAGCTTTATAACCAAGCTAACGGTGGAAAG ACTGTA GACATCATGTCACGAGGAGTAGCTACAGTAACCTCTCTCTTGAGACTGC CTACTCC AAAGTCGAGGATGGATTGGGAAGGCCGCGAGGCAAAAGGCGGGTAAC GTTACTC TCGGCCAGACGCTAGTGATCTTCGGATCCGACAGCCCTGGCCTTAGAGG CCAAAC AACCCTGGGATAAGGAGCACGACGGGAAGGATGTTCCGCAAGGATTTC ACATGG CCTTCCCATTAGTCAGGGCTGGCAGTTGGTAATATAGCCTTTCTACACAC ATCTCCT CACCGTCTTGCACCCACTAAACCAGTGGGATATGCGGGTGGACTCAATG TCTCTCA TAGAAAGGTGTTCCCACTGCCTGACTCGCCAACTTTATATGTCTTGTCAA CCAAAA CATAATGGCCCCTCACTATAAACTCCCTAGCAACTGGTGGTCCGGCGAA AGCTTTA GCCGGTTCTTGCCACTATTGCGCCCCAGGCTCGCC(SEQIDNO:1052) TAACCA AGCTAA CGGTGG AAAGGA CATCAT GTCACG AGGAGT AGCTAC AGTAAC CTCTCTC TTGAGA CTGCAA AGTCGA GGATGG ATTGGG AAGGCC GCGAGG CAAAAG GCGGGT AACTCG GCCAGA CGCTAG TGATCTT CGGATC CGACAG CCCTGG CCTTAG AGGAAC CCTGGG ATAAGG AGCACG ACGGGA AGGATG TTCCGCA AGGATT TCCCTTC CCATTA GTCAGG GCTGGC AGTTGG TAATATA GCCTTTC TACACA CCACCG TCTTGCA CCCACTA AACCAG TGGGAT ATGCGG GTGGAC TCAATGT AGAAAG GTGTTC CCACTG CCTGACT CGCCAA CTTTATA TGTCTTG TCAACAT AATGGC CCCTCAC TATAAA CTCCCTA GCAACT GGTGGT CCGGCG AAGCCG GTTCTTG CCACTAT TGCGCC CCAGGC TCGCC (SEQID NO: 1298) NeSL NeSL- Schmidtea TTAAATCATTTTTAAATGTGTTTGAATATCTTAAATTATCAAATCATATTA TTAAATCA TGAGTG MNVDLDATIKSIGM 4_SM mediterranea ATATCAATGCTAAAAAAAAATCGTGCKCATCAGGCGCACGAAAATAATG TTTTTAAAT TGCTAC NTKETTYPNSQLRVE GACACAACTCGTCGACCTGCTGTCGACTCACAGAGAACCTCAATTTGGA GTGTTTGA GAGGCA TTPCTSTTIMHASCN AGAATGGGAAGCCTATAATGCTACAATTCCGCCAACCCCTATTTGAATG ATATCTTA GCGCTG TTSTISYSPLPSAVSLP ACAGATAGTCAAATATCAAAAAATATACAAACTGCTGTCAAGCGTGACT AATTATCA GTAATT ESPASSITITTTDDNC CACTTCCTTCCAATCGAAAAATAGGAAKATGTAAGAAACATGAAAGTCA AATCATAT GCATCG DIIETPYPLPQTNGDL AGCTGAAAAACCAATAATATGTCCTAAAATAAAACAATTTGAAAATATG TAATATCA GCGTTG SEILKDIEANKDTTMS CAAAAAATACCTATAAAATCACAGCCGAATAAATTCCCATCCGTTCTAAG ATGCTAAA CAGATTT NKVLDCDSDSGDDR CAGAAACCGCTACGAACTACTGCAAGAATCGGATCAAGTATATTAATTT AAAAAATC GTGTAC DMIIENDRESDMDLF CCCCCCMGGGGGAAATTAATATACTTGTTAKAAAATTAATTTTTTAATAA GTGCKCAT GATAGA SQSLLNTNQSDERRE AAATAAATAAATCGAATAAATATAAAATAAAAATAAATCAAATTAAACTT CAGGCGCA TAAAAA KNLTENAPTEITTEKS TTATTAACAATAAAATCGCAGTAAGTAAATTTCCACTGTTATTAAATTTA CGAAAATA CCAATA YFDIISKASDNTTSKKL AAACAAAATTCCTTTAAAAATGCCTCTCTTTTTCAGTAATAACACCTTTTC ATGGACAC GTAATA LNVKNELTAGLPPMP TTGCTTTTATTACTATTTCTTGTGTACTGTACAAATCGAGCACAGTTATTG AACTCGTC AATGCT PVTNTAKFIRNVRPE CAAATAGGACATAGAAATTCCTTTTTAAGTAAATTTAAATCCATGAGAAA GACCTGCT GAGCCT DIADPTLYRLDSRGKL TAAAATAAAATCCTTTTGATTCAAAGTTTCTATGTTGCTTTCTAATAGAAT GTCGACTC AGCTCG GCRTQYKKPGCGDIA GGTGTAAGCATTAATGGGTCTTGATTTTTATAAATTAAATATATTTAATC ACAGAGA CATATCT VYDYEAIVEHAAFIHT TATTAAATTAATATGTTTTTATTAATTATTAATTTTTATAGTGGGGGGAAA ACCTCAAT AAGCCG IPFNEQNNVDCQPC TTAATATACTTGATCCCAAGAATCAACTGATGATGAAGAATATGTTATTT TTGGAAGA AAAGGC HPKKGKDVHTIVLIKY CAAAATACATACAAGAAGCTGGAAAAAACAAATCAATCGCTACAATGAA ATGGGAA AGCATA ADIFNHIEAHSHVVQ TGTGGATCTCGATGCAACAATTAAAAGTATTGGAATGAACACAAAAGAG GCCTATAA TATATG TAITDNMKTYLRLTKE ACGACCTATCCAAATTCACAACTGCGAGTTGAGACGACTCCCTGTACCTC TGCTACAA AGACAA NXFYCSYRNNKKKNK AACGACTATTATGCATGCATCTTGCAACACAACCAGCACTATATCTTACT TTCCGCCA TTTAAAA CKKAFNLESNMMDI CTCCATTACCATCGGCTGTGTCACTTCCCGAAAGCCCTGCCTCGTCAATC ACCCCTAT AAAAA TEHMKTHTGYSFDX ACAATAACCACAACAGACGATAATTGCGATATTATAGAGACCCCTTACC TTGAATGA (SEQID NLNILCYCGIWKPFTE CATTACCTCAAACAAATGGTGACTTGAGTGAAATATTAAAGGATATAGA CAGATAGT NO: LIAHIKTEHLQEYINSI AGCTAATAAGGACACCACCATGTCGAATAAAGTATTGGACTGTGACTCT CAAATATC 1299) PNKENIHNTTTIVSPL GACAGCGGCGATGATCGGGACATGATAATAGAAAATGACCGAGAATCT AAAAAATA NFAGILASGETQNIP GACATGGACCTGTTTTCGCAATCTTTATTGAACACTAATCAATCTGATGA TACAAACT DEEIIKPRDLPENLAF GAGGAGGGAGAAAAACTTAACAGAAAATGCTCCAACAGAGATTACTAC GCTGTCAA NRNIENELSWSQHLV TGAGAAGAGCTACTTTGATATCATCAGTAAAGCATCTGATAATACAACCT GCGTGACT KAYIFSYAVKTSTIFIN CTAAGAAACTGCTTAATGTAAAAAACGAATTGACTGCTGGACTACCTCCT CACTTCCT PYTCNALIQCNYKTFF ATGCCTCCAGTGACCAATACTGCAAAATTCATTCGAAATGTTCGACCTGA TCCAATCG ETFPFKDFAKWNEIV GGATATTGCAGATCCTACCCTATATCGACTTGACAGCAGGGGAAAGCTT AAAAATAG LPIHNNTSSWSFFFL GGATGCAGAACWCAATACAAAAAACCCGGATGCGGGGACATAGCAGT GAAKATGT NKKKRVAMIIDPTAD ATATGACTATGAGGCGATAGTTGAACATGCCGCATTTATCCACACAATC AAGAAACA DSHTLHFELATDILRTI CCATTTAATGAACAAAATAATGTGGATTGTCAACCATGCCACCCTAAAAA TGAAAGTC LNVQNIFEDLNFPLTE AGGAAAAGATGTCCATACAATAGTTCTGATAAAATATGCAGATATCTTT AAGCTGAA VEYPVCHEANLSAFX AACCATATTGAAGCCCATAGCCACGTTGTGCAAACCGCGATTACAGATA AAACCAAT VCHFLKCLMSDLPIDI ACATGAAAACCTATCTACGTTTAACAAAGGAAAATTTKTTCTACTGCTCA AATATGTC PDIDHMKETMRPIIR TATCGTAACAACAAAAAAAAGAATAAATGCAAAAAGGCTTTTAACCTTG CTAAAATA KYNCAKFPESDVRNY AATCAAACATGATGGACATAACAGAGCACATGAAAACTCATACCGGATA AAACAATT RVLIEDLIYQLNLDTIT CAGTTTCGACMAAAACTTAAACATTCTATGCTATTGTGGTATCTGGAAG TGAAAATA CEEILCEIERINGRLNP CCGTTCACAGAGCTCATTGCCCACATCAAGACTGAGCATTTGCAAGAAT TGCAAAAA KRYFKESKPKTDIIHL ATATTAACTCAATACCAAACAAAGAAAATATCCATAATACTACTACCATA ATACCTAT QKKKSAELLCVKRLKF GTTTCCCCTCTAAACTTTGCTGGGATACTTGCATCTGGCGAAACTCAAAA AAAATCAC QISQKTEIGKIWEND TATCCCCGATGAAGAAATAATTAAACCCAGAGATCTGCCAGAAAATCTT AGCCGAAT DVDHRPPMARFLKT GCCTTCAACCGAAACATCGAAAATGAATTAAGTTGATGGTCGCAGCACT AAATTCCC FASQDCPVSNTSSINL TGGTCAAAGCATATATCTTCTCTTATGCTGTTAAAACATCAACCATCTTCA ATCCGTTC PYYMDTDTDXCTDC TCAATCCTTATACTTGCAATGCTTTGATCCAGTGCAACTACAAAACTTTCT TAAGCAGA ENLSHIMKNLDSSAP TTGAAACCTTCCCTTTCAAAGACTTTGCCAAGTGGAACGAGATAGTCCTG AACCGCTA GMDLITGGDWKKISP CCAATTCACAACAACACTTCTTCTTGGTCCTTCTTCTTCTTGAACAAGAAA CGAACTAC KHELITAICNCILRNKV AAACGAGTTGCGATGATTATCGATCCAACTGCAGATGACAGTCATACCC TGCAAGAA CPEKWKLFRTVLILKP TGCACTTTGAATTGGCTACAGATATCCTAAGGACTATACTTAACGTCCAG TCGGATCA GKMSESFRANSWRP AATATATTTGAGGACTTAAATTTCCCTCTTACTGAGGTCGAATACCCCGT AGTATATT LAIMDTAYRIFTTLLN GTGTCATGAGGCAAACCTTTCCGCATTTTMTGTATGCCACTTTCTTAAAT AATTTCCC NRLLQWIRNGNLISP GTTTAATGTCGGACTTGCCAATTGATATTCCGGATATCGATCACATGAAA CCCMGGG NQKAIGIPDGCAEHN GAGACWATGAGACCAATTATTAGAAAATATAACTGCGCAAAGTTTCCG GGAAATTA ATLHFAIDRAKRCKTE GAGAGTGATGTTAGGAATTACCGCGTACTTATCGAGGACCTGATATACC ATATACTT LHIVWLDIADXFGSLP AATTGAACCTTGACACAATTACTTGTGAGGAAATACTGTGCGAAATCGA GTTAKAAA HDLIWYTLANMGLK AAGAATAAATGGAAGGTTAAATCCCAAAAGATATTTTAAAGAGAGTAA ATTAATTTT NETLTLIKELYKDVKTI ACCAAAGACGGATATAATACATCTGCAAAAGAAAAAGTCGGCGGAACT TTAATAAA FDCQGTLSEPVPITKG CCTCTGTGTTAAAAGATTGAAATTCCAAATCAGTCAGAAAACAGAAATC AATAAATA VKQGCPLSMTLFCLSI GGAAAGATATGGGAAAACGACGATGTGGATCACAGACCGCCTATGGCC AATCGAAT DYILKSILTNYPFLLHD AGATTCTTGAAGACTTTCGCGAGTCAAGACTGCCCCGTTTCGAATACGTC AAATATAA LNISILAYADDLVLLSD ATCCATAAACCTACCTTACTACATGGATACTGATACAGATAMGTGTACT AATAAAAA SYLEIKKSLESTVELAA GATTGTGAAAATTTGTCGCACATCATGAAGAACTTGGATAGCTCGGCAC TAAATCAA FANLKFKPSKSGYLSI CTGGAATGGACCTCATTACAGGTGGAGACTGGAAAAAGATCTCCCCGA ATTAAACT NNVNSDILKLHLYNE AGCATGAACTGATAACAGCAATCTGCAATTGTATACTACGAAATAAGGT TTTATTAA EIPTISENNKYRYLGV CTGCCCAGAGAAATGGAAGCTGTTTAGAACAGTTTTAATCCTAAAACCA CAATAAAA DFSYKRNQDVDGRL GGAAAAATGTCCGAGAGTTTCAGAGCTAACTCATGGAGACCTCTTGCAA TCGCAGTA GSALALTRSLFKSYLH TCATGGACACAGCCTATAGAATCTTTACGACTCTGCTGAATAACCGCCTG AGTAAATT PAQKLNAYKTFIHSKL CTGCAATGGATCAGGAATGGCAACCTCATAAGCCCGAACCAAAAAGCG TCCACTGT IFSLRNCVIGHRILDC ATTGGTATACCGGATGGATGTGCTGAGCATAATGCTACTCTACACTTCG TATTAAAT DRNRVTQGREKQLG CAATTGACCGAGCTAAACGATGTAAAACTGAACTACACATTGTTTGGCT TTAAAACA FDQEIKALLKTMIGD CGATATCGCCGATKCATTTGGTTCGCTGCCTCATGACCTGATCTGGTATA AAATTCCT KFQAXNNYFPYTHCK CACTGGCTAATATGGGTCTGAAGAATGAAACACTAACCTTGATTAAGGA TTAAAAAT LGGLGITSAIDEYLIQS ACTATATAAGGATGTGAAGACTATCTTCGACTGTCAGGGAACCTTGTCC GCCTCTCT ITGITRLFHSSNLSFRK GAACCTGTCCCAATTACTAAAGGAGTTAAACAAGGTTGCCCATTATCAAT TTTTCAGT MLITELAHSRGGKNF GACACTCTTCTGCCTGTCTATTGACTACATTCTAAAGTCAATACTGACTA AATAACAC EAGLKWLNCEVNKA ATTATCCCTTCCTTCTTCATGATCTGAACATCAGTATTTTGGCATATGCTG CTTTTCTTG FPNTSFFVKFQKSALA ATGACTTGGTTCTTCTTTCTGACTCTTATCTAGAAATCAAAAAATCTTTAG CTTTTATTA LKRKFCICVNLKFVED AGAGTACTGTGGAATTGGCAGCMTTTGCCAACCTTAAGTTTAAACCTTC CTATTTCTT NFSLEMTYKKRTSYV GAAGTCTGGATACTTGTCCATCAACAATGTTAACTCCGATATCCTTAAAT GTGTACTG NHQNLSTLSKELHDF TACATCTCTATAATGAGGAGATACCAACGATATCCGAGAATAACAAATA TACAAATC VGLYYAEQXCQMRV CAGATATCTTGGAGTTGACTTCTCTTACAAAAGAAATCAGGATGTTGAT GAGCACA QGHIATAIGDSITAKY GGACGACTTGGGTCTGCACTTGCACTCACCAGATCTCTATTTAAATCATA GTTATTGC LIASDILNDAQYYFLV CTTGCATCCGGCGCAAAAGCTGAATGCTTACAAAACCTTCATCCACTCCA AAATAGGA RARNNLLNLNYNAYR AGCTTATCTTCTCCTTGCGTAATTGCGTGATAGGTCATAGAATCCTCGAC CATAGAAA LKYNIGTKCRLCHLDE TGTGATCGGAATAGAGTTACACAAGGTCGGGAAAAACAGCTGGGCTTT TTCCTTTTT ETQAHXFNHCRAKP GATCAGGAAATCAAGGCACTWCTGAAAACCATGATTGGAGACAAATTT AAGTAAAT NARRVKHENVLVSIV CAGGCAKTAAATAACTACTTTCCTTACACTCACTGCAAGCTGGGGGGAC TTAAATCC AFLEKIGFEIDVEKSPK TTGGTATAACCTCAGCTATTGATGAATATTTGATCCAAAGCATTACCGGA ATGAGAAA YISIPTKLKPDMVIRSK ATAACAAGATTATTTCACTCATCCAACCTCAGCTTCAGAAAAATGCTAAT TAAAATAA RNKDIHVLDLKVPYD CACAGAACTCGCTCATTCTAGAGGAGGGAAAAACTTTGAAGCGGGGCT AATCCTTT SGEGFEKAREDNYVK AAAATGGCTTAACTGTGAAGTTAACAAGGCATTCCCCAACACCTCTTTCT TGATTCAA YKDLAIEIGKAFNQKA TTGTAAAATTCCAAAAATCGGCACTTGCTCTTAAAAGAAAGTTCTGTATA AGTTTCTA TISAVVIGCLGTWDK TGCGTTAACCTTAAATTTGTAGAGGACAATTTCTCACTTGAGATGACCTA TGTTGCTT KNNAALSKIGLTKTEII CAAAAAGCGCACTTCTTATGTAAACCATCAAAACCTCAGCACACTTTCCA TCTAATAG SLARIACPNAVIACYH AAGAACTCCACGACTTCGTGGGCCTTTACTATGCWGAGCAATGWTGTC AATGGTGT IYREHVSFTKSAMAL AAATGAGAGTACAAGGACACATTGCGACTGCGATCGGGGATAGCATAA AAGCATTA PFSLA(SEQIDNO: CAGCTAAATACCTAATAGCTAGTGACATCCTTAACGACGCACAGTACTAC ATGGGTCT 1421) TTCTTGGTACGTGCGAGAAATAATCTTCTGAATCTTAACTACAATGCGTA TGATTTTT TCGACTCAAGTATAATATTGGCACAAAGTGCAGACTTTGCCACCTTGAT ATAAATTA GAAGAAACTCAGGCCCATSTGTTCAATCACTGCCGTGCCAAACCAAACG AATATATT CTAGAAGAGTGAAACACGAAAATGTGCTAGTAAGCATAGTTGCCTTCCT TAATCTAT AGAGAAAATTGGATTTGAGATTGATGTGGAAAAATCACCCAAATATATC TAAATTAA TCAATACCAACAAAGCTGAAACCTGACATGGTAATTAGGTCTAAGAGGA TATGTTTTT ATAAAGATATACATGTCCTAGACCTAAAAGTGCCATATGACTCAGGAGA ATTAATTA AGGCTTTGAAAAAGCGCGGGAAGACAACTATGTTAAATACAAAGATCT TTAATTTTT AGCCATTGAAATTGGAAAGGCATTTAATCAAAAAGCGACTATATCTGCT ATAGTGGG GTGGTGATTGGATGCCTGGGCACATGGGACAAGAAGAACAATGCCGCT GGGAAATT CTTTCCAAAATCGGGTTGACTAAGACCGAGATCATATCTCTTGCCAGGAT AATATACT AGCATGCCCAAATGCGGTAATCGCATGCTATCACATATACCGTGAGCAC TGATCCCA GTCTCATTTACAAAGAGTGCCATGGCCCTCCCCTTCAGCCTTGCATGAGT AGAATCAA GTGCTACGAGGCAGCGCTGGTAATTGCATCGGCGTTGCAGATTTGTGTA CTGATGAT CGATAGATAAAAACCAATAGTAATAAATGCTGAGCCTAGCTCGCATATC GAAGAATA TAAGCCGAAAGGCAGCATATATATGAGACAATTTAAAAAAAAA(SEQID TGTTATTT NO:1053) CAAAATAC ATACAAGA AGCTGGAA AAAACAAA TCAATCGC TACA(SEQ IDNO: 1176) NeSL R5 AY216701 Girardia GTAGGTAACTATGACTGCAAAATAATAATTCTACACCTATTGTTGATAAC GTAGGTAA TGATCC TTGRNLGQWSCYSR tigrina TCATCTCGTGCGCAAACGGAGCATGTTATTTCTAATCATTTCGTCACACA CTATGACT GTGTGT SIQQSNYSFKLSSTEV GGATTCTTCTAATTCTGATAGTAATATTATAGATAGAGATAGGAACCTTG GCAAAATA TTGTGTC GELVEQSPAPLQSPQ TTGATTTAGATGCGTCAATAACTTCTCCTACTATTATACAGCCAGAGGAT ATAATTCT GTATGA FSNNYNNLNINNNLY AGTAAGATATCTGAGGATGAGGACTTCATCTTAGTCAATAGGAAAAAGA ACACCTAT TTGTTTC YSLNTFNQSNNLCCL GCAAAAATAAGAAAAAATCTAAGAAAACAACTGAAAATAAAAATGAAA TGTTGATA CGTGTG VNIEFFPTQHLLGDIV TTCCTATTCAAAAGAGTAAAGATAAGAAAAAGAAGTCTAAAATTAATAC ACTCATCT TGTCTAT NSGCINYMNNYNNF CGAAAAACTAACTGAAAATATTACTACTTCTGAAATACCACTTGAAATTG CGTGCGCA ATTTTTC DNINLYINSNVLSYNN CTCCTTCCATACCTTTACCTTCAGCAAGTACCTCGGGTTCTCAACAACCG AACGGAG TTTTTTA YNHSFLASPYTTNITE GCCAATCCTCCAGAAGACGCTACTCTAAGTGATACGGATCTCTTCCTTAC CATGTTAT TACTTTC HADINMHVQEVNM ACAGGATGATCCCGATAGTCTTATTCTTTCTGGAAGTACTCAACCAACCT TTCTAATC AATTACC QQDNNTQHAITQQV TTGTTGACCTCAACCCTTCACAGCAATCGGAACTTCCTTCAAATACTGAC ATTTCGTC TCGTTGT SLQATSLQHTLDEMI AGCCAAAGATTTGAGGCGGGTGAAACACCCAAAATCATAACTTCTTACA ACACAGGA AATGTT VQFNTAVRLKKKHKV GGGATGACCTTTTCTACTCTACAGTCCTTCACTACAACTCAGATACAGGT TTCTTCTA ATAACTT AKIFRGHNHRKDLPT TACGGTATAAGTGTTGACAATGGGGAGCAGAGGTTTCGAATTCTTGCTA ATTCTGAT CATATG LPAREQYKTKPKLAIR GGAATCTTGTCAGGAAAACCAAGGATAAGTTCCCCTCTTTATATGCTGG AGTAATAT GAATAT EVLHRKTTATSSPSEN ACAAGTAATTAGACACACAGTCTTCTTCAATCACTTCAACCAGGCATACT TATAGATA ATGTAA AIKAFFSSYSRPAELFT ACGCCAATAATATAACTGATAGTAAAGGTAATCTAATTGAGTTTTCTGAT GAGATAG TTTAGTT GQELLESSWFPVHPE GATAAGCCTTTTCAAAGTATACCGACTGACCCAAAAACTGAACTAGAGC GAACCTTG TAGTTTA DDFEFRIPGRDQIAKY AAATTAGGAGAGAGAGACAACATCTAGTTGATAGAGCTCTTAGACATAA TTGATTTA GTTAGTT IKFASKSAAGLDWITY TCAGTTACGGGAAACTTATATTTTAAATAAACTTAATAATAATAATGGGG GATGCGTC TAGTTTA EDIKLGDPSGEILQPIF GGGGTGGCGAACATTTGAAAAGGAAAAAGATCAAAGTCAATACGGATG AATAACTT GTTTAGT EYIVQNNICPSEGKAS ATGTCTCCAGCAATGATGGAGACAGAAAACATAGACGACAGGAAGAAA CTCCTACT TTAGTTT RTIMIPKPGKSDYSDP CCTCGGACAATGGAGCTGCTATTCCCGCTCAATCCAACAATCAAATTACT ATTATACA AGTTAG SSWRPITITSAVYRLL CCTTTAAACTGAGTTCGACTGAAGTGGGAGAGCTCGTTGAGCAATCTCC GCCAGAG TTTAGTT MKYLTWELYNWILL AGCTCCCCTTCAGTCGCCTCAGTTCTCTAACAATTATAACAATCTAAATAT GATAGTAA AGTTTA NQMLSRSQKSLGKFE CAACAACAACTTATACTATAGTCTCAATACTTTTAATCAGTCTAATAACCT GATATCTG GTTAGT GCHDHNAMLNMLI TTGCTGTCTTGTTAATATTGAGTTTTTCCCAACTCAACACCTTCTTGGTGA AGGATGA (SEQID QDVRRQTNPSNPIN TATAGTTAACTCGGGGTGCATAAACTATATGAATAATTATAATAACTTTG GGACTTCA NO: KNKRLYIVFLDFTNAF ATAATATTAATTTATATATTAATAGTAATGTATTATCTTACAATAATTATA TCTTAGTC 1300) GSVPLDTLMYVPQRF ATCACAGTTTTCTCGCTTCCCCATATACTACTAACATCACAGAACATGCA AATAGGAA GLGTSALTLIKNLYLD GACATAAACATGCACGTGCAAGAAGTTAACATGCAGCAAGATAACAAT AAAGAGC NYTNVTCGESKIENV ACACAACATGCTATAACACAACAAGTCTCTCTACAAGCAACATCTCTGCA AAAAATAA KLNKGVKQGCPLSM ACACACGTTGGACGAAATGATAGTCCAGTTTAACACTGCTGTCAGGTTA GAAAAAAT LLFNIFINIIIRAIEAMP AAGAAAAAGCACAAAGTTGCAAAAATCTTTAGGGGACATAATCATCGTA CTAAGAAA DVHGYPLGDMDIRIL AAGACCTTCCAACATTGCCTGCTAGGGAACAGTATAAAACTAAACCGAA ACAACTGA AYADDIALISDSHKDL ACTTGCAATTAGAGAGGTACTTCATCGAAAAACAACAGCTACGTCTTCCC AAATAAAA QEMVYKAEYIGRILGL CTTCTGAAAATGCAATTAAGGCTTTTTTCTCCTCCTACAGCCGTCCAGCT ATGAAATT LFNPSKCALMDIPHD GAACTTTTCACTGGTCAGGAACTTCTTGAATCATCTTGGTTCCCAGTACA CCTATTCA KKRTPPILVNGEMIKC CCCGGAAGATGACTTTGAGTTTAGAATTCCGGGTAGAGACCAAATAGC AAAGAGTA VGKADPYKYLGTFRS GAAATACATCAAGTTTGCTAGTAAATCAGCTGCTGGTCTTGACTGGATC AAGATAAG WFRKLDIKELLQMM ACGTACGAGGATATTAAGTTAGGCGATCCGTCCGGGGAAATTCTCCAAC AAAAAGA MDETKLITESNLHPH CCATTTTTGAATATATAGTACAAAATAACATATGCCCATCCGAGGGGAA AGTCTAAA QKIHAYETFIHSQLPF GGCTAGTAGGACCATTATGATTCCCAAACCGGGAAAAAGTGACTATTCA ATTAATAC HLRHSRIPFSDFITNR GATCCTTCTTCTTGGCGGCCCATTACAATTACCAGCGCAGTATACAGACT CGAAAAAC KTNKTTNNSNDSEKS TCTCATGAAATATCTTACATGGGAGCTGTATAACTGGATTCTTCTTAATC TAACTGAA IQKAYDPESGQLFLN AGATGCTGTCCAGGAGCCAAAAGAGTTTAGGGAAGTTTGAGGGATGTC AATATTAC TFALPSGCAKDFFYIT ATGATCACAACGCAATGTTGAACATGCTCATCCAAGACGTTAGGAGACA TACTTCTG KDAGGPQLTSGLDEY GACCAACCCGTCTAATCCAATCAATAAGAATAAGAGGCTATACATAGTC AAATACCA LIQSIMYIFRLLGSEDP TTCCTAGACTTTACGAATGCTTTCGGGTCGGTTCCGTTAGATACTCTCAT CTTGAAAT TLNSAIKHDLISHLNL GTATGTCCCTCAACGCTTTGGCTTAGGCACCTCTGCTTTAACGCTGATTA TGCTCCTT KGFVNINFSQAISIFN AAAACTTATATCTAGATAACTACACAAATGTAACATGTGGGGAAAGCAA CCATACCT SNFTDRTDHFSHLSR AATAGAAAACGTAAAATTAAATAAAGGGGTTAAGCAAGGCTGCCCTCTA TTACCTTC TEWARLQLARKKLKS TCTATGCTGCTTTTCAACATTTTTATCAATATTATAATTAGGGCAATAGAA AGCAAGTA TLAIQTNVCLINGHLV GCTATGCCAGATGTCCATGGATACCCACTTGGAGATATGGACATCCGGA CCTCGGGT LTLSLENNVLLIDSKEK TACTGGCATATGCTGATGATATTGCTCTAATATCTGACTCCCACAAAGAC TCTCAACA GDVKKIHASLMGFLR CTGCAGGAAATGGTCTACAAGGCGGAATATATCGGTCGGATTCTTGGAC ACCGGCCA LAHLIRLQKHGWSKL TACTCTTCAACCCGTCAAAATGTGCACTTATGGACATTCCGCACGACAAG ATCCTCCA LFSATTHHEILNKRIL AAGAGGACGCCGCCTATCCTCGTCAACGGTGAGATGATCAAGTGTGTTG GAAGACG NGHVPYKIWYFIHRA GAAAGGCCGACCCATACAAATATCTTGGAACCTTTAGATCCTGGTTCCG CTACTCTA RLGLLPTKLFSVSNLC GAAGCTGGATATAAAGGAGCTCCTCCAGATGATGATGGATGAGACTAA AGTGATAC RKCGGKKETMSHAL ACTCATCACCGAGTCAAATCTACATCCTCACCAAAAAATCCACGCGTATG GGATCTCT VNCPMMQTLINERH AGACCTTCATTCACAGCCAGCTCCCATTTCACCTTAGACACAGCCGAATT TCCTTACA DALEISLVQILSSKFQ CCGTTCTCAGACTTCATAACAAACAGAAAAACAAACAAAACAACAAACA CAGGATGA GTVIRQKTYVNELRP ATTCAAACGACTCAGAAAAATCTATACAGAAAGCCTACGATCCGGAATC TCCCGATA DITMESDTQYYLVEV AGGACAATTATTCCTCAACACCTTCGCCCTTCCAAGTGGATGTGCTAAGG GTCTTATT KCPFDTKMSFELRTQ ATTTCTTTTACATTACAAAAGATGCAGGTGGACCTCAACTCACAAGCGG CTTTCTGG QTTDKYNIIIEILEDVH ACTGGATGAGTACTTAATCCAATCAATTATGTACATCTTCCGACTATTGG AAGTACTC PGKEVRLVTFIVGTLG GCAGTGAGGACCCCACCTTAAACTCTGCAATAAAACATGATCTCATTTCC AACCAACC SWGPQNSDFLRDLG CACTTAAATTTAAAGGGTTTTGTAAATATTAATTTTTCTCAAGCCATTTCA TTTGTTGA FSKDEIDQVKTRLML ATCTTTAATTCAAATTTTACCGACCGAACCGATCACTTTTCACATCTTAGC CCTCAACC QNINSSCEQWKRFV CGCACTGAATGGGCAAGACTTCAATTAGCTCGGAAAAAATTGAAGTCAA CTTCACAG QYAPTITPGPIPDAES CCTTAGCCATCCAAACTAATGTCTGTCTGATAAATGGGCATCTTGTCTTA CAATCGGA EDDQGTSDNGPTAA ACTCTTTCGCTAGAAAACAACGTTCTGTTAATTGATAGTAAAGAAAAGG ACTTCCTT TVQGPVIGDEEEELQI GGGATGTCAAGAAGATCCATGCATCCCTCATGGGGTTTCTTAGGTTAGC CAAATACT YDSGLDESSDDEPDP TCACCTTATCAGACTGCAAAAACATGGATGGTCAAAACTGCTCTTCAGT GACAGCCA DDAELLFTIDIEQYLN GCGACCACTCATCACGAAATACTAAATAAGCGTATCTTGAATGGTCACG AAGATTTG SVITD(SEQIDNO: TCCCTTATAAGATTTGGTACTTTATTCATAGGGCGCGGCTGGGGTTGTTG AGGCGGG 1422) CCTACTAAACTCTTTAGTGTTAGTAACCTTTGTAGGAAGTGCGGGGGGA TGAAACAC AGAAAGAGACCATGTCGCATGCTTTGGTCAACTGTCCAATGATGCAGAC CCAAAATC CCTCATTAATGAGAGACATGATGCTCTTGAAATCTCCCTTGTACAAATTC ATAACTTC TTTCTTCTAAATTTCAGGGTACGGTTATAAGGCAAAAGACCTATGTCAAC TTACAGGG GAGTTAAGACCCGATATCACAATGGAATCGGATACCCAATATTATCTTG ATGACCTT TTGAGGTAAAATGCCCCTTTGACACGAAGATGAGTTTTGAATTGAGAAC TTCTACTCT ACAACAAACTACTGATAAATACAACATTATTATTGAAATATTAGAAGATG ACAGTCCT TACACCCTGGGAAGGAGGTGCGCCTTGTTACGTTTATTGTAGGCACCTT TCACTACA AGGCTCATGGGGCCCGCAGAACTCGGACTTTTTGAGAGATCTGGGATTC ACTCAGAT TCCAAAGACGAAATCGACCAGGTGAAGACGCGGCTGATGCTTCAGAAT ACAGGTTA ATCAATTCCTCCTGCGAGCAGTGGAAAAGATTTGTGCAATATGCACCCA CGGTATAA CAATTACACCTGGGCCGATTCCAGACGCGGAGAGCGAGGACGATCAGG GTGTTGAC GGACGAGCGACAATGGGCCAACAGCTGCTACAGTGCAAGGACCGGTGA AATGGGG TTGGCGATGAGGAGGAGGAACTTCAAATCTACGATTCCGGCCTTGACG AGCAGAG AGTCCAGCGATGATGAACCCGACCCAGATGATGCTGAATTACTTTTCAC GTTTCGAA AATTGACATAGAACAATATTTGAATTCTGTGATAACAGACTGATCCGTGT TTCTTGCT GTTTGTGTCGTATGATTGTTTCCGTGTGTGTCTATATTTTTCTTTTTTATAC AGGAATCT TTTCAATTACCTCGTTGTAATGTTATAACTTCATATGGAATATATGTAATT TGTCAGGA TAGTTTAGTTTAGTTAGTTTAGTTTAGTTTAGTTTAGTTTAGTTAGTTTAG AAACCAAG TTAGTTTAGTTAGT(SEQIDNO:1054) GATAAGTT CCCCTCTTT ATATGCTG GACAAGTA ATTAGACA CACAGTCT TCTTCAAT CACTTCAA CCAGGCAT ACTACGCC AATAATAT AACTGATA GTAAAGGT AATCTAAT TGAGTTTT CTGATGAT AAGCCTTT TCAAAGTA TACCGACT GACCCAAA AACTGAAC TAGAGCAA ATTAGGAG AGAGAGA CAACATCT AGTTGATA GAGCTCTT AGACATAA TCAGTTAC GGGAAACT TATATTTTA AATAAACT TAATAATA ATAATGGG GGGGGTG GCGAACAT TTGAAAAG GAAAAAG ATCAAAGT CAATACGG ATGATGTC TCCAGCAA TGATGGAG ACAGAAAA CATAG (SEQID NO:1177) NeSL Utopia- Chrysemys GTTTAATTCCTTCTGATGGACATCTGCAACACCCGTCCTGAAGATGGAGT GTTTAATT TAACCG MESPAXIFEKIDAALX 1B_CPB pictabellii CTCCTGCWKCCATTTTTGAAAAAATTGATGCTGCCTTGMAGATATACTC CCTTCTGA AGACCG IYSAAAXLXXNSLSLSP CGCTGCTGCTGMTTTGGAWGAMAATTCTCTCTCTCTCTCACCTWCAGM TGGACATC CCGACC XXAXXSXXAAPASST TGCASTCWCGTCAMCTSCTGCTGCTCCTGCGTCCTCTACTCCCCAGAAA TGCAACAC AGGGAA PQKTQXKPIPXTTLG ACTCAGWGGAAGCCTATCCCGAAKACCACTCTTGGTGCCTCACGGAAG CCGTCCTG ATAACC ASRKXRTTXKDEXIXX AMCMGGACCACCASCAAGGATGAAAAMATCAGSASCTGGCKGAAGAA AAG(SEQ CACTTCC WXKKAPVDTSXGRX AGCCCCTGTGGATACCTCTKCAGGGAGAMCTAGCACCAGAAGGACAGC IDNO: TTCCCTG STRRTALRDLTSRSXN TCTTCGGGACCTCACATCCAGGAGCAGKAATATCWCAMCAGCTCTTCA 1178) ACGAAC IXXALQEEDPRRTPPX GGAGGAGGACCCCCGGAGAACCCCTCCCWCTTCCCGGGACCAGGATGC CAAGGG SRDQDAERRPAAPEK TGAGCGCCGCCCTGCTGCTCCTGAGAAGGCTGCTACCAGAGGAGCCCCC ACGCAC AATRGAPPTIQDQD CCGACGATCCAGGACCAGGATGCTGATCGCTGCCCTGCTGGGAGGGAT CCCACCC ADRCPAGRDATGGA GCCACCGGAGGAGCCCCCCGACGACCCAGGACCAGGATGCTGANCGCT ATGTACT PRRPRTRMLXAAPLG GCCCCGCTGGGAAGGATGCCGCCGGAGGAGCCCCCCCCGACGACCCGG TATTCGC RMPPEEPPPTTRDQ GACCAGGATGCTGACCGCCGCCCCGCTGCTCCAGAGAGGGATGCTCCM TACACC DADRRPAAPERDAP GAAGGAACCACCTCCTCAACCCCGGACCCCGAAACCACTTACCACCCGC GATACT EGTTSSTPDPETTYHP CTGTCCGGAGGAGGGCCGCTCCAAGGGGAACGCACTCCGSAGCCCWG GACTTG PVRRRAAPRGTHSXA GATCTCGATGCCGCACGCTGTCCTTCCGGGCAAAGAGACATCGTGGCCA GACTCCT XDLDAARCPSGQRDI GTGAGTCCAGCACCCCCCCAGGAGCGACTTCACCTCCTCAAGCTTCTCTG TATACAT VASESSTPPGATSPP CCAGACCSAGAGGAATCACCTGCCGAGTCKGCAGGCACAACAGAGGTC TCCATG QASLPDXEESPAESA CGCCCCACAGAGGGTGAGGCAGGGGAAGACGACTGCATCTACCTCCAG GGTGGC GTTEVRPTEGEAGED TACCCGMTCCCTACAGGCCTCCTCCTCTGCCCCTTCTGCCTCCCCKTCCAT GTACCC DCIYLQYPXPTGLLLC GGAGTCCAGACCCTCGGGGCCCTCAGCAAACACGTTCGKAAGGCCCAC GAGCCC PFCLPXHGVQTLGAL AACAAACGGATTGCCTTCCGGTGTAGCCGCTGCGATGCACCMTTCGAG ACTTATC SKHVRKAHNKRIAFR ACTCAAAAGAAATGCAAGTMCCATCAMGCCACATGCAAGGGACCCCTC CACTGA CSRCDAPFETQKKCK ACAACCGCGAAGGTSAACCCCACTGACACCCTGCGGGTTCCAACCCCGA CACTTTA XHXATCKGPLTTAKV CCCCCACCGACGGTCCAGCTTCAGCACCCCAGCCAGCATCCCCAGAGCC AAAACT NPTDTLRVPTPTPTD ACAGCANGTAAGGGGGGACCAACCGCCAACCGAGGGAAGCGCAACCC CTTGCAC GPASAPQPASPEPQX CCGCCTCGAGGACTGACGATGCCACCAAAAGGACCAGCCCCGCCTCCAG CCCAATC VRGDQPPTEGSATPA AATCCCCACGCTGGACCCTGCCGTGAGGGGGATCACCGCCACCTCSCAG TGGGTC SRTDDATKRTSPASRI GTCAGCGACCTCACCAGATGCCTCAGCGACCTCATAAAAACCATCCGGC TATGCC PTLDPAVRGITATSQ ACAACACGGATACGAGACGGKGCAGCGCTCCCCCACAGGTMACCTCAT GGTTAT VSDLTRCLSDLIKTIRH GCCGCCCTGCCGTAGGAGCAACTAGCACTGCCCCCCAGGCCGCACGGC GCGATA NTDTRRXSAPPQVTS GAGACCCAGCCAACGGAGGAGCCTCCCGCAGCCCCCAGATCCCACGGC TGTATGT CRPAVGATSTAPQA CGGACCCCGCCCCCGGGAGACCCAACACCTCCTCCAAGGTTACCCAACG ATCTCTT ARRDPANGGASRSP AGACTCCGACCGCCAAAAACCCCATGCCCCACCGAGGACCCCCCAGCCG CATCCTT QIPRPDPAPGRPNTS GATACCACCCGCAGAAGAACCAGAACCATCCCCAGCGCTTCCAAACACG GCAACC SKVTQRDSDRQKPH ACCGCGCCCCCACAAAGCCCAACACCGGTGTTTCCAGAACCCCACTCCCT GATACC APPRTPQPDTTRRRT CCCGGAAGATCCAGCGCTGCCTCGGAGACACCGAGAGCTGCCCCCCCCC TGTAATC RTIPSASKHDRAPTKP CACCGCACCAGGACCCCCGCCTCAAGACCCACCTGAACACCGCTCCCCA CCTCATA NTGVSRTPLPPGRSS GTCCGAGGGACAACAAGGCCACAGACTGTCTCCGCAGCACCTGAACCC ACCCAA AASETPRAAPPPPHQ GCGGAGACAACGCAGCAGGAGGAACGACGGCCGCGAGCAGAGGGTCG GCCTGA DPRLKTHLNTAPQSE CCACGCCGTGGCAATCCGCCTGGATGGAGGAGCTGGCAAAGGCTGAGG CCCCAG GQQGHRLSPQHLNP ACTTCGAGACCTTCGACACCCTGATGGACAGACTGACCGCAGAACTGTC ATGTAC RRQRSRRNDGREQR TGCGGAAATTACAGCCAGAAGGAGGGAACCCCAGGAGGCCTCACGGG AGTACC VATPWQSAWMEEL CCACGCGCAGATTCCCCGCGCCGACCCGTAACAACACTGCCAGAGAAG TTCCCTC AKAEDFETFDTLMDR GCAGGAGAGGGGACGTCGGCCGCCGCTACGATCCGGCAGCTGCATCCC TTAACTC LTAELSAEITARRREP GTATTCAGAAACTATACAGGACGAACCGGACGAAAGCCATGAGGGAGA GTGTAT QEASRATRRFPAPTR TCCTCGACGGGACCTCCTCCTACTGTGCCATCCAGCCCGAGAGACTCTAC ATTTAAT NNTAREGRRGDVGR TCCTACTTCAAGGATGTGTTCGACCACGAGGCCCAGACCAACTTGCAAC TTTAAAC RYDPAAASRIQKLYRT GCCCAGAGTGCCTTCTCCCGCTACCCCGGATCAACCTCACGGAGGACCT ATTAACT NRTKAMREILDGTSS GGAGCGAGATTTTTCCCCGCAGGAGGTGCAGGCGAGGCTGATGAGGAC TTAATAA YCAIQPERLYSYFKDV CAAAAACACTGCCCCTGGAAAAGATGGCATCCGCTACCACCTGCTGAAG AATTTTT FDHEAQTNLQRPECL AAGCGAGACCCCGGCTGCCTGGTGCTTGCTGCCATCTTCACCAAATGCA AAA LPLPRINLTEDLERDF AGCAGTTTCATCGCGTTCCCCGCTCCTGGAAAAAGTCCATGACCGTGCTC (SEQID SPQEVQARLMRTKN ATCCACAAAAAAGGCGAGCGAGACGACCCCGGCAACTGGAGGCCCATC NO: TAPGKDGIRYHLLKK TCCCTCTGCTCCACCATCTACAAGCTGTATGCCAGCTGCCTCGCGGCAAG 1301) RDPGCLVLAAIFTKCK GATCACGGACTGGTCAGTGTGCGGGGGCGCCGTCAGCTCAGTACAGAA QFHRVPRSWKKSMT GGGTTTCATGTCCTGCGAGGGATGCTACGAGCACAACTTCCTCCTTCAG VLIHKKGERDDPGN ACGGCCATCCAGGAAGCCAGGAGGTCCAAGAGGCAGTGCGCCGTAGCA WRPISLCSTIYKLYAS TGGCTTGACCTGACCAACGCCTTCGGGTCCATACCCCACCATCACATCTT CLAARITDWSVCGG TGCCACCCTGGGAGAGTTCGGGATGCCAGAAACCTTCATCCAGATCCTC AVSSVQKGFMSCEG CGGGACCTCTACAAGGACTGCACCACCACCATCCGCGCCACGGACGGA CYEHNFLLQTAIQEA GAGACGGACGCCATCCCCATCCGCCGCGGCGTGAAACAAGGATGCCCC RRSKRQCAVAWLDL CTCAGCCCCATCATCTTCAACCTGGCCATGGAACCGCTCATCCGAGCCAT TNAFGSIPHHHIFATL CTCCAGCGGCCCGACCGGCTTCGACCTGCACGGCAAGAAAATCAGCATT GEFGMPETFIQILRDL CTGGCCTACGCGGACGATCTGGCCCTGGTCGCCGACAGCTCGGAAAGC YKDCTTTIRATDGET CTCCAGCAAATGCTCGACGTCACCAGCCAAGCCGCCGAGTGGATGGGC DAIPIRRGVKQGCPLS CTCCGCTTCAACCCCAAAAAGTGTGCCTCCCTCCACGTCGACGGTGGCG PIIFNLAMEPLIRAISS CCAGGGCGCTGGTCCGGCCATCACGATTCCTGATCCAGGGCGAGCCCAT GPTGFDLHGKKISILA GGCCTCCCTCGAAGAGGGAGAGGTATACCAACACCTCGGCACACCCAC YADDLALVADSSESL AGGAGTCCGCGTCCGACAGACCCCCGAAGACACCATCGCGGAGATCCT QQMLDVTSQAAEW GCGAGACGCGGCCCAAATCGACTCCTCCCTGCTCGCCCCCTGGCAAAAG MGLRFNPKKCASLHV ATCAACGCCCTCAATACCTTCCTGATCCCCCGCATCTCCTTTGTCCTCAGG DGGARALVRPSRFLI GGATCGGCCGTAGCCAAGGTGCCCCTGAACAAGGCCGACAGCACCATC QGEPMASLEEGEVY AGGCAGCTGGTGAAGAAGTGGCTCTACCTTCCCCAGAGGGCCAGCACG QHLGTPTGVRVRQT GACATCATCTACATTTCCCACAGGCAGGGCGGCGCCAACGTACCTCGGA PEDTIAEILRDAAQID TGGGTGACCTGTGCGACGTGGCGGTGATGACCCACGCCTTCCGCCTCCT SSLLAPWQKINALNT GACGTGCCCGGACCCGACGGTGAGGAGCATCGCGCAGGAAGCCGTAC FLIPRISFVLRGSAVAK GGGACGTGGTCAGGAAACGCATCGCCAGGGCCCCCTCCGAGCAGGACA VPLNKADSTIRQLVK TCGCCACTTACCTCAGCGGCTCCCTGGAGGCTGAGTTCGGGAGAGAGG KWLYLPQRASTDIIYI GGGGAGACCTGTCCTCTCTCTGGTCCCGCGCCCGCAACGCCTCGAGACG SHRQGGANVPRMG CCTGGGTAAGAGGATCGGCTGCTGCTGGAAGTGGTGCGAGGAGCGCC DLCDVAVMTHAFRLL GGGAGCTGGGAATACTGGTGCCACGCATAAAGACCCCGGACCACACCA TCPDPTVRSIAQEAV TCGTCACCCCGACCGCCAGAGCTATGCTGGAAAGGACCCTGAAAGACG RDVVRKRIARAPSEQ CCATCCGCTGCCACTATGCCGAGAACCTCAAGCGGAAGCCGGACCAGG DIATYLSGSLEAEFGR GCAAGGTGTTCGAGGTGTCCAGCAAGTGGGACGCCAGCAACCACTTCC EGGDLSSLWSRARN TCCCCGGGGGCAGCTTCACCAGGTTCGCCGACTGGCGGTTCGTCCACAG ASRRLGKRIGCCWK GGCCCGACTCAACTGCGTTCCCCTCAACGGAGCCATCCGCCACGGCAAC WCEERRELGILVPRIK CGGGACAAGCGCTGCAGGAAGTGCGGCTACGCAAACGAGACCCTGCCC TPDHTIVTPTARAML CACGTCCTGTGTGGATGCAAACAGCACTCCGGAGCCTGGCGGCACCGC ERTLKDAIRCHYAENL CACAACGCCATCCAGAACCGGCTGGTGAAAGCCATCCCGCCGTCCCTGG KRKPDQGKVFEVSSK GGAAGATCACCCTCGACTCCGCCATCCCCGGGACAGACAGCAGACTGC WDASNHFLPGGSFT GACCCGACATCGTCGTGACGGACGCAGAAAAGAAGAAGGTCCTCATGG RFADWRFVHRARLN TAGACGTCACGGTGCCTTTTGAAAACAGGTCACCGGCCTTCCACGAGGC CVPLNGAIRHGNRDK CCGAGCACGGAAGGCGTTGAAGTACACCCCGCTGGCCGAGACCCTGAG RCRKCGYANETLPHV AGCCCAGGGCTACGAGGTCCAGATACACGCCCTGATCGTGGGAGCCCT LCGCKQHSGAWRHR GGGCTCGTGGGACCCCCACAACGAGCCGGTTCTGAGAGCGTGCGGAGT HNAIQNRLVKAIPPSL CGGTCGACGCTACGCCCGGCTCATGAGACAGCTCATGGTGTCCGACACC GKITLDSAIPGTDSRL ATCAGGTGGTCCAGAGACATTTATACGGAACACATCACAGGACACCGTC RPDIVVTDAEKKKVL AATACCACACTGAGTAACCGAGACCGCCGACCAGGGAAATAACCCACTT MVDVTVPFENRSPA CCTTCCCTGACGAACCAAGGGACGCACCCCACCCATGTACTTATTCGCTA FHEARARKALKYTPL CACCGATACTGACTTGGACTCCTTATACATTCCATGGGTGGCGTACCCGA AETLRAQGYEVQIHA GCCCACTTATCCACTGACACTTTAAAAACTCTTGCACCCCAATCTGGGTC LIVGALGSWDPHNEP TATGCCGGTTATGCGATATGTATGTATCTCTTCATCCTTGCAACCGATAC VLRACGVGRRYARL CTGTAATCCCTCATAACCCAAGCCTGACCCCAGATGTACAGTACCTTCCC MRQLMVSDTIRWSR TCTTAACTCGTGTATATTTAATTTTAAACATTAACTTTAATAAAATTTTTA DIYTEHITGHRQYHTE AA(SEQIDNO:1055) (SEQIDNO:1423) NeSL Utopia- Acanthamoeba CCCGTCAAGGGTGCTCCACGAGATCCCTGTCGCTAGCCGACCGGTTTTA CCCGTCAA TAACAA MAAKSVACPHDGCA 1_ACa castellanii CCACCCCACCCCGCCCGGACAACCACGGACCCTGCTCCGCAGCAGGACC GGGTGCTC CCATGT NKYASEASLRRHIKNK CCACGCACGATGGCCGCTAAATCCGTCGCCTGCCCTCACGATGGATGCG CACGAGAT ATGGTG HATDEEGDETSHSCP CCAACAAGTACGCGTCGGAAGCCTCCCTCCGAAGACACATTAAGAACAA CCCTGTCG AACCAC HCHRPFSTARGLSVH ACACGCTACAGATGAGGAAGGAGATGAGACCTCACACTCCTGTCCCCAC CTAGCCGA ACCTCTC IGKSHRQAPPEPTRP TGCCACCGACCTTTCTCCACCGCCCGCGGGCTCAGCGTCCACATTGGCA CCGGTTTT TCGATCT PPAPAPADPGLDPDP AATCGCACCGTCAGGCCCCCCCTGAGCCGACGCGCCCCCCCCCGGCCCC ACCACCCC TGTATTC GPTVTPPSRDDEDRE GGCCCCTGCCGATCCCGGCCTCGATCCCGACCCCGGCCCCACCGTGACG ACCCCGCC TGTGATT EPDDDPVEIADLSCP CCCCCCAGCCGTGATGACGAAGACCGCGAGGAACCCGACGACGACCCC CGGACAAC GGACAT HCAQALPSAHGLAN GTGGAGATCGCGGACCTAAGCTGCCCTCACTGCGCCCAGGCCCTCCCGT CACGGACC CAGAGT HLRACKDHRVPAPG CGGCCCACGGCCTCGCCAACCACCTTCGCGCCTGCAAGGACCACAGGGT CTGCTCCG TCCTGC APRSGPPSSRYWTAV CCCCGCCCCTGGAGCACCCCGCTCGGGTCCGCCCAGCTCCAGGTACTGG CAGCAGG GAAGGG EHHRYVEAMARFAD ACTGCTGTCGAGCACCACCGCTATGTGGAGGCCATGGCGCGCTTCGCG ACCCCACG ATACACT HPDLLARAAAHIGTR GATCACCCCGACCTACTTGCGCGCGCGGCTGCCCACATCGGGACCCGCA CACG(SEQ CTGCCA TYKQVDSHRTKVIAA CGTACAAACAGGTTGACTCCCACCGCACCAAGGTGATCGCGGCGGAGC IDNO: ATCTCGT EREGRPVRTLDPTM GCGAGGGCCGCCCTGTCCGCACGCTCGACCCCACGATGGACTGGCGCA 1179) GGGTTG DWRMRPYCASTTAR TGCGGCCCTACTGCGCCAGCACCACGGCCCGGTGGCTGGCTGAGCAGG TAATAA WLAEQGRSPVAPRS GGCGTAGCCCAGTAGCGCCCCGCTCGCCCTGCCCCGAGCCCCACGCCCC ATCCAC PCPEPHAPPPAAALL GCCGCCTGCAGCCGCGCTGCTGTACATCCCGGCCACGCCCCCCGCGCCA ACCTTCA YIPATPPAPTPRAPVA ACGCCCCGTGCCCCAGTGGCGCCTCCCAAGCTTGCGCCTCCCGCCGAGA ACA PPKLAPPAESTVPATP GCACCGTGCCCGCCACGCCCGATGGGAATCCGGAGGCGCCAGCACCCC (SEQID DGNPEAPAPPFSAPG CGTTTAGCGCCCCCGGACCTCCCACCCCCAAGGCATTGCCGCCCCCGCCC NO: PPTPKALPPPPPSRR CCGTCCCGCCGCAACCTGCGCCCTCACCTCGTGCCCAAGGATGCTTGGC 1302) NLRPHLVPKDAWQG AGGGGGTCGCCGATGCCGTCGCCCCTGCCGCCTCGCGCCTCCTGCGCAC VADAVAPAASRLLRT GCCCCTTGCGCACCTCTCCACCGAGCAGTGGGCCACGTTCGAAGCCGCC PLAHLSTEQWATFEA CTCGCCGGCCTCGAGGCTACGCTCCACCATGCCGCCCGCAGTGCAGAGG ALAGLEATLHHAARS CGGTGCCCACACGCTGCGCTAGCCGAGCAAGGGAAGACGCCGAGCGCC AEAVPTRCASRARED AACTCCGTGAAGCCCGAAAGACGCGTGAGATCTTTGGCAAGGCCGCTG AERQLREARKTREIFG CCCTCTACGCAGCCGGCAAGGACCCCACTGCCACCATCGAGCGCATCCC KAAALYAAGKDPTAT CCCAGAAGTCCGCCTACACCTGCCAACCCCTGGCTCGGCTGAATGGCCC IERIPPEVRLHLPTPGS GCCAGGGCGGCCGCCGCCCGCAGGGTGATCCGCCGTGCAGTCGCGCGA AEWPARAAAARRVI GCGGACCGGTTGCGCAAGCGCATGGGCATCCTCGATAGCGACCGCGAC RRAVARADRLRKRM CTCCAACGCCTCTTCAACGCTAACCAGAAGAAGGCAGTTCGGCAGATCC GILDSDRDLQRLFNA TCGCCCCGTCCACCAAGGCGCCGCGGTGCCAGCTAGACCCAGCCGCCGT NQKKAVRQILAPSTK CGAGGAGGCCTACATCCAGACCCTCGCCAAGCCGCCGCCGATCGACCCC APRCQLDPAAVEEAY AGCCCCCCGTGGAAGAACTCCGTCCAGTGGCCCCGCCCGCCCACTGCCG IQTLAKPPPIDPSPPW CCGATGACGGAGGCAGCCCCTTCAGCGTCGCCGAGGTCCGGGCCCAGC KNSVQWPRPPTAAD TCCGCCGACTGCCCAACGGGTCCGCCCCAGGGATCGATGGCATACCGTA DGGSPFSVAEVRAQL CGAGGCCTACAAGCGTACGAAACTGGACGCCACGCTCGCCCATGTCTTC RRLPNGSAPGIDGIPY GAGGTCGTGCGGCTGAATGCGCGCCTGCCAGCTCGATGGGATGTGGCG EAYKRTKLDATLAHV CGCACGGTCCTGCTCTACAAGAAAGGCGACCCTAACGACACCGGCAACT FEVVRLNARLPARW GGCGACCGATAAGCCTCCAGGTCACCATCTATAAGATCTTCACGGCCGC DVARTVLLYKKGDPN CCTGTCGAAGCGGCTCATCTCCTGGGCTGGCAAGCACAACACTTTCTCC DTGNWRPISLQVTIY GCATCGCAGAAGGGATTCCTACCGGCCGAAGGCTGCCACGAGCACGCG KIFTAALSKRLISWAG TTTGTCTTGCGAAGCGTGCTTGACGACGCCCGTCGGCACAAGCAGAACG KHNTFSASQKGFLPA TGTACCTTGCCTGGTACGATCTGCGCAACGCCTTCGGATCGGTGTCGCA EGCHEHAFVLRSVLD CGACCTCATCGCCTGGTGCGCTGCCATGTTGGGCCTGCCCCGCTACCTCC DARRHKQNVYLAWY GGGATGCCATCGGCGCAATCTATCGGCACTCAGCGCTCTTCGTCCAAGT DLRNAFGSVSHDLIA TGGGGATCAGGAGACCACCGGCGTCATTCCTATGCGCTGCGGCGTCAA WCAAMLGLPRYLRD GCAGGGCTGCCCTCTCAGCCCCCTCCTCTTCAACCTGTGCGTCGAGCCG AIGAIYRHSALFVQV GCCCTTCGCTGCCTACGCCGCACCACCGGGTACAAGTTCTACGGCACGT GDQETTGVIPMRCG CGATCACCGTCGAGGGCCAGGCCTACGCCGACGACCTGCTCACTGCCGC VKQGCPLSPLLFNLC GCCCTCCGCCTACCATGCGGCCCGGCAGGTGGCCACGATCGAGGAATG VEPALRCLRRTTGYKF GGCCAACTGGGCGGGAGTCTCCTTCGTCGTCCAAGCCCTCTCCCTGGAT YGTSITVEGQAYADD GCGCCGGCCGGCAAGTGTGCCGCCCTCGCGATCAACTTCGAAGGTGGT LLTAAPSAYHAARQV CTAATGCACTCTATCGACCCTGCCCTCAAGGTCCAAGGCGCAGCCATCCC ATIEEWANWAGVSF GGCCATGTCAAGAAACAACGTGTACCGCTACCTCGGAGTACATGTCGGT VVQALSLDAPAGKCA CTCACAGATGCGCTCGGCCAAGCGAACGAGCTCCTCGAGAAGGCCTCA ALAINFEGGLMHSID CGCGATGCACGCACGATCTGTGCCTCTGGCCTCGAACCCTGGCAGAAGG PALKVQGAAIPAMSR TGGTCGCAATCAAGACCTTCATCCTCTCCCGGCTCCCCTTCTTCTTCCACA NNVYRYLGVHVGLT ACGGGAAGATCCAGAGGGGCCGATGCCAGCAATTCGACCGCGAGCTTC DALGQANELLEKASR GAGAAAACCTGCGGGCCGCCCTCCGACTCCCCGTCTGCACCACGAACGC DARTICASGLEPWQK CTTCTTCCATTCCCGCGTGGCCTCAGGCGGCCTTGGCATCCTGCCCATCG VVAIKTFILSRLPFFFH CGGAAGAACAACAAGTCTACCTGGCAGCCCACGTGTTCAAGCTCCTGAC NGKIQRGRCQQFDR TTCGCCAGATCTGTCGATCCGCGCCATCGCCCGACACCAACTTGCCGAG ELRENLRAALRLPVCT GTCACCCACGCGCGACACACCACGCCAGTCCAGGACGGCGAAGCGTCA TNAFFHSRVASGGLG CCCTTCTTCGGATGGCTCATGCGGGGGCAGGAGGTCGCATCAACTACCC ILPIAEEQQVYLAAHV CCTCGGGTGACGTCAGTTCAATCTGGTTCGCAGCTGCAGGCGCCTACTC FKLLTSPDLSIRAIARH GAGGATGGGATGGTCAGTCCGCGATGCACTCCACCCGACGCTGACAGT QLAEVTHARHTTPV TGGTCCGGGCGTCCAATTCGAGGGCCGATTCCAACGTGCCAACGTCATC QDGEASPFFGWLMR CCAGCTCTCCGGGCTAGCGCCTTTTCCCGCCATGCTGTGGAATGGAGTG GQEVASTTPSGDVSS CCCTCCGCACCCAGGGTCGAGCAGCAGCCTACCAACATGCCGTCCACCC IWFAAAGAYSRMG TGCAACGCACCACTGGGTCCACAACAGCGCTGGCCTGACGACCAAGGA WSVRDALHPTLTVG GTACCGATTCGCGATCAAGTGTCGATTGGGTCTCCTGCCGACGCGAGCA PGVQFEGRFQRANVI GCTCCACACCACCGCAATGGGCCAACAGCGTGCAGGGCGTGCTCCTAC PALRASAFSRHAVEW GCCCGCGAGACGGCCAACCATGTTCTCGGACACTGCCCGGCGACCAAG SALRTQGRAAAYQH GCCGAAGTCATCGCGCGCCACAACAGGATATGCCGAGCTCTGGCCCAG AVHPATHHWVHNS GCGGCTGAAGCCTCATGGACGTCTGTCCTTGAAGACGTCCCGATCCCGG AGLTTKEYRFAIKCRL GGGTGGACTCCCCCCTACGACCCGACATCTACTGCTCTCGGCCGGGCCA GLLPTRAAPHHRNGP GTGTGCCATCATCGAGGTCGCGGTCTCCTACGAGGACGCCTTCAACGCT TACRACSYARETANH TCGATGGAGGGCCGGGCGAAGCAGAAGACCGACAAGTACGCTGGCCT VLGHCPATKAEVIAR GGCTGCTACCGTCGAGGAGCAGCTGCGGCTCCAAACCCGGCACGCGGC HNRICRALAQAAEAS TTTCGTGGTGGGCTTCTCTGGCGTCGTGCTCCCAGCCTCGGTAACCGCTA WTSVLEDVPIPGVDS CGGCCACCTCCCTTGATCTCCCCCCCAAAACTTGGAATGTGCTTCTTAAA PLRPDIYCSRPGQCAI CGTTGTGTTGCTGCCTCAATCAAAGGCAGTTACACAGCGTGGAGAAGAT IEVAVSYEDAFNASM TCCGGCGCTCTACTCCATAACAACCATGTATGGTGAACCACACCTCTCTC EGRAKQKTDKYAGLA GATCTTGTATTCTGTGATTGGACATCAGAGTTCCTGCGAAGGGATACAC ATVEEQLRLQTRHAA TCTGCCAATCTCGTGGGTTGTAATAAATCCACACCTTCAACA(SEQID FVVGFSGVVLPASVT NO:1056) ATATSLDLPPKTWNV LLKRCVAASIKGSYTA WRRFRRSTP(SEQID NO:1424) NeSL Utopia- Acromyrmex GGTGCACAACGGATGCATCATACGTGTACCGGAGCATACGGGCTGTCA GGTGCACA TAAATTA VCSVRGCRREDSRRF 1_AEc echinatior CGGCGGCTGCATGCGCGATCTAGCTCGGAGATTTTATTTATTTATTTATT ACGGATGC TTTTGTC YKFKFPLNFVKVPKTI AATTTATTTATTTATTCATCGAGTGTGAGTGTTCGCGTTTTGCCGAGAAG ATCATACG TTTGTCT VIGSAFQKSSVSARS CGATTTTCGTTAAGTGATACGCGCCGCGTTCATAGGTTAGGTGTGCAGT TGTACCGG TGGCCC QNHSRSTRVPKTRQP GTACGTGGCTGTCGTCGCGAAGATTCTCGAAGATTCTACAAGTTCAAAT AGCATACG CCCCTTT RTSNTIGRYTAASAN TTCCGTTAAATTTCGTTAAAGTTCCTAAAACCATCGTGATCGGGAGTGCG GGCTGTCA TTAAACC NYLTVIITGNYTVFAQ TTCCAAAAATCATCAGTTTCGGCTCGTTCGCAAAACCACTCCCGGTCGAC CGGCGGCT AAGCAG WICYRECTWLLSKFV CCGCGTTCCGAAAACCAGACAGCCACGTACAAGCAATACGATTGGTCGG GCATGCGC GAGAGA NFFLTIIGYFFQLRLVV TACACTGCCGCGAGTGCGAATAACTATTTAACTGTTATTATTACTGGAAA GATCTAGC GTGGCC IYEGPVILDTFSNCGS TTACACTGTATTCGCCCAGTGGATCTGTTATCGCGAGTGCACTTGGCTTT TCGGAGAT CAATGC SLFMRGQXSKALLVR TAAGTAAATTTGTTAATTTTTTCTTAACTATAATTGGATACTTTTTCCAATT TTTATTTAT CCAACT LNRSALAMADPQVH GCGGCTCGTCGTTATTTATGAGGGGCCAGTKATCTAAGGCCCTTTTAGTC TTATTTATT ATTATAT YIDYPLPPRVKCVKCF AGACTAAACCGTAGTGCACTCGCCATGGCGGATCCACAAGTGCACTATA AATTTATTT ATTAACT GAEGAGKVKGEYSD TAGACTACCCGCTGCCCCCTAGAGTCAAATGCGTAAAATGTTTCGGTGC ATTTATTC ATTTACT PPHLAKHLKKCHPGD TGAGGGGGCAGGCAAAGTAAAGGGCGAATACAGCGACCCGCCGCATTT ATCGAGTG GTGATA TLNYKCSICDLRGTGK AGCAAAACATCTGAAAAAGTGCCACCCGGGAGACACATTAAATTATAAA TGAGTGTT TTTATTA YPLRDVKAHYAECHV TGCTCAATTTGTGATCTAAGGGGGACCGGTAAATACCCCCTTAGAGATG CGCGTTTT TTTGACT SPAVDAAGPSTRGSL TTAAGGCACATTACGCCGAGTGCCATGTGTCTCCCGCAGTGGATGCGGC GCCGAGA GTTGGG GECSGAGQPTASRA GGGTCCAAGCACTCGCGGCAGCCTCGGCGAGTGCAGTGGTGCGGGTCA AGCGATTT CGGGCC AKATTRLAETVGGTD ACCGACAGCCAGCCGCGCGGCTAAAGCGACCACGCGATTGGCGGAGAC TCGTTAAG CCTCTCT KRRAATSGSRQLTLP GGTTGGGGGTACGGATAAGCGCCGTGCCGCGACATCGGGATCGCGGC TGATACGC GCTGGT FAATPSPSTAAGEAR AACTCACGCTGCCGTTCGCAGCCACCCCATCGCCATCCACAGCAGCCGG GCCGCGTT TTTATTT APRSXSTTPTSRSPSY TGAGGCAAGGGCCCCAAGAAGCGMGTCAACGACACCGACGAGCAGGT CATAGGTT ATATATA AAVTAGPPSMRSTTT CCCCCTCATATGCGGCAGTCACTGCGGGCCCGCCATCGATGAGGAGCAC AG(SEQID TTTTTTA STTARSKTVAKGAAP GACAACTTCCACCACAGCCCGCAGTAAGACTGTCGCGAAAGGCGCCGC NO:1180) CTCGCG NTTTTTTARRSGEAA GCCCAACACMACGACGACGACAACGGCCAGGAGATCCGGCGAGGCCG TACTTTT ATRKPPTTATVSKPR CCGCAACGAGGAAGCCGCCTACGACCGCCACGGTGAGTAAACCGCGTG TGTACTA VLSVETVRLPVDDIQ TGTTGTCGGTAGAAACTGTTAGGTTGCCTGTCGACGACATCCAGCGAGC CTCTATT RAGVQNAAKPARAP AGGCGTGCAAAACGCGGCCAAACCGGCGCGCGCTCCCTCTCGCCCCCC TTTCTTT SRPPQRTSPEAGGPR GCAGAGAACATCACCGGAGGCGGGGGGTCCAAGAACAACGGGCGCAA TTATTTT TTGAKEKCGEGAYKK AGGAGAAATGCGGAGAGGGAGCATACAAGAAGTTGCCCGCAAACAGC AGCTAT LPANSGNPISTRTRR GGCAATCCAATTTCAACCAGGACGAGGCGGGCAACTAGCGTGCCGGTT GCTATTT ATSVPVEKSEGTARR GAGAAGAGCGAAGGCACGGCAAGACGGGAGCGCGTCTCCCCACACCCT TTATCTC ERVSPHPPPKGIDIILS CCTCCCAAAGGAATTGATATCATCCTATCGTCGACATCGGAGGAGGAGG TTTCTTT STSEEEGTPYQPGGV GCACGCCATACCAGCCCGGCGGCGTGGGGAGACTAAGACTAAGGAGG GTCTCTA GRLRLRRKKVTGPPP AAAAAGGTGACCGGACCACCCCCAAAGATGACACCCAGAGAGGGGGT TTTTCTT KMTPREGVVTRARR GGTCACAAGAGCCAGGCGGTCCACCAGCGCTCCCGTCGAGAAGAGTGC TCTTTTT STSAPVEKSALDARLT CTTGGATGCACGCCTGACGGCTCTGGACCGGACATCGTCCAGAGCGAC TTCTTTC ALDRTSSRATGNPTS AGGCAACCCGACGTCGCAAATCGCAGGGGGCCTTTACACCAGTAGAGG CTTTTCT QIAGGLYTSRGQPER CCAACCGGAGAGGACGCCCCCTGCGAGGCTCCCCAGCCTGTCTCCGACC TTTCTTT TPPARLPSLSPTTRGS ACCAGAGGCAGTCCATCGGGGAGCCTAGGCGAGATACGGACACCCATC CTTTTAT PSGSLGEIRTPISPATS TCGCCTGCGACGTCGCTACCGGCAACGCTCACCACTTGCACGGTGACGA TCTTCTT LPATLTTCTVTTTTCG CGACCACCTGTGGAAGCCCCATAACATCCACGGGCTTCACAGGTGGCGT TTATTTA SPITSTGFTGGVGRLI GGGGAGGCTGATAACACCWCCGAGCCTCCCCCAAACGAACATCCTCCC TCTTTTT TPPSLPQTNILPTIGEE GACCATCGGGGAGGAAGGAACGTCACCGTGCGTGGCGGTCGTCACCAC TTCTTTT GTSPCVAVVTTHPRP CCATCCTTAGGCGGCAGGAGCACGTCTCCGCTCATCCTACCAAGGCCAA CTGTTGT TGEDAPCEAPQPVSD CGACACCGGAGCCTGAGCGGGGACAAGAGGAGCGGCGGCTAGAAGGC GGGGCC HQRQSIGEPRRDTDT GCGGCGCAGCCACCTACCACACCCGTCGTCGAGGGGGACAACCAGTGG CTGACC HLACDVATGNAHHL GATGGCCAGTGGACGGTGAGCGTGAGGAGAAGAGCGAGGAGGCAACA GTCCGA HGDDDHLWKPHNIH ACTGAACGATACATCCCCCTCCAACTCCGAGTCCCCGCCAACCGCTGGA GTGTGA GLHRWRGEADNTXE CCATCGCGTTCGCCGCGCATAGCCCCACTATCTGCGCTGATAGCGGCGT ATGCCG PPPNEHPPDHRGGR CGACGAGCCGCCATGAGACCAGCTTAAATCTCAATTGCACGAACGGCAA CGAAAA NVTVRGGRHHPSLG TATTTGCATGGACCGAACTCCGCCCCGTAACATTTTGCCGGTARGGGCG ACAATA GRSTSPLILPRPTTPEP GAGCGCCGTCGCGAGACATCGCCACAGGATCGCGTGGAGGGAGACATC TTATGTT ERGQEERRLEGAAQP GGTTATGGTGCTGGAAAGGTAAGTGCCGAACACCCGAGTGCTCCCGTA TTATACG PTTPVVEGDNQWD AATGTCCGTGGTGTGATGTCTCGAGGGAGAGCAACCGCGTCATCCATCG AGTGTG GQWTVSVRRRARRQ TGCCACCGCGAGCCAACCGTGGGGAGGGCGGTCGGCAGCATCATAGTC CATGTG QLNDTSPSNSESPPT GGCGGCGTCCGGACGCTCCTGTCGGTCAGCCGTCGCGGGATCACCCGG CGTGAT AGPSRSPRIAPLSALI CGCCTGCGACTGTCGCGAGGCAGCGTAGGCGTGAGCGGGTGGCCGCCC ATATTTA AASTSRHETSLNLNC GCGACGCGCTGCTCGATCGGGCTAAGGACGTCGCTACGATTGCGGATC TCTATTT TNGNICMDRTPPRNI TGGAGGCGTTCGCGGCTTCGGTCGCGGCGTTCTTCGGGGAGGATGCAT TATTTTA LPVXAERRRETSPQD CGGCCACTGGTGCTGCAGCCCGCGCTCGCGATCGTTCGGTACGCTCACG TTTATTT RVEGDIGYGAGKVSA GGAGGCGGGTGCGCGTCGGGGGGTGAAGGGAGGTGAGCGTCCGGAG ATTATAA EHPSAPVNVRGVMS AGAGAGGGCGCCGGTAGGCCGGGGTCAGCGCCGGCTGACCCCGGAGC TTTATTG RGRATASSIVPPRAN GTCGGGAGAAGCACGCGGGGACTGGGTGCGCGAGGCCAAACGCTTGC CCGCGC RGEGGRQHHSRRRP AGGCGCTGTACAGGGCGAACCGCCGCAAGGCAGTGCGAGAGGTGCTC GCGCTC DAPVGQPSRDHPAP CAGGGACCTGCCGATCAGTGCCAGGTGCCTAAACGTCAGGTCCAGGAG CTCCGG ATVARQRRRERVAA TACTTCGAGCGGCTGTACAGCGGCGGGGAAGACCTGGCTGGCGCCGGC GACTTTT RDALLDRAKDVATIA GTGGAAGCCGAACGCCCTGACCCCTCGAGTCCGCGTGAGGTATCTGCG ATTCGTT DLEAFAASVAAFFGE GTCCTGGGTCCGCTCGCGGAGCGAGAGGTGGACCGTCGGCTCCGGCGT GACAAT DASATGAAARARDR ATGAATAACTCTGCGCCGGGTCCCGACGGTGTATCCTATCGTGACCTCC ACTGTG SVRSREAGARRGVKG GTGGGGCGGACCGGGGAGCGCGGCTCCTCACGGCGCTCTACAACATCT ATATTTT GERPEREGAGRPGSA GCCTGCGGCTCGAGGCAGTCCCCGCGTCCTGGAAGACCTCCAACACTGT TCTGCKC PADPGASGEARGDW GTTGATACACAAGAAAGGAGACCGGGGCATGTTGGAGAACTGGCGCCC AGGCTG VREAKRLQALYRANR TCTCGCTCTGGGGGACACCGTCCCCAAACTCTTCGCCGCGCTCTTGGCCG GGGGGG RKAVREVLQGPADQ ACCGATTGACCGACTGGGCGGTCACCCGCGGGAAGCTCTGCTCCGCGC CTTGCCC CQVPKRQVQEYFERL AGAAGGGCTTCCTGCGGGACGAGGGGTGCTACGAGCACAACTTCGTCC CCCAGC YSGGEDLAGAGVEAE TGCAGGAGGTCCTGACGCACGCCAAGCGCTCTAAGCGCCAGGCGGTCG CCCTTAG RPDPSSPREVSAVLG TCGCGTGGCTGGACCTGTCCAACGCGTTTGGATCGATCCCGCACGCGAC TTTTAAT PLAEREVDRRLRRM GATCCGCCGCGCGCTTATAAGATCCGCGGTGCCACGGGGTCTCATAGCG TGCCTAT NNSAPGPDGVSYRD ATATGGGACTCCATGTACGATGGTTGCACGACGAGGGTGCGAACCGCC GCGGGG LRGADRGARLLTALY GAGGGTCACACAGCACCCATCCCCATCCGGTCGGGCGTCCGTCAGGGTT GGGGCT NICLRLEAVPASWKT GTCCGCTAAGCCCTATTATCTTCAACCTGGCCATCGACTCGGTCGTCCGT TTTGTCC SNTVLIHKKGDRGML GTGGCGGCCGAGWCGAATGACGGGTATTCCCTCCACGGAAATACCTGG CCCGCA ENWRPLALGDTVPKL TCGGCATTGGCTTACGCGGACGACATCGCACTACTGGCCCAGACGCCCG AATGTA FAALLADRLTDWAVT AGGGGATGGAGAGGATGCTAGCCTCTGTGGAGGCGGAGGCAGCGTCG TATATAT RGKLCSAQKGFLRDE GTGGGGCTGCGGTTCAACCCTGCAAAGTGTGCCACCCTGCACGTCGGTG ATATATT GCYEHNFVLQEVLTH CGGGGAATGGCGGCAGGGTCCTACCGACGTCATTCCAAATCCAGGGGG TAGCGC AKRSKRQAVVAWLD AGACGATCAACCCCCTGGCTCAGGGTGAGTCGTACACCCACCTTGGCGT GCGGCT LSNAFGSIPHATIRRA TCCAACGGGGTTCTCCGTGGACCAGACGCCCTACGCCGCCGTCGGGGA TAGCCG LIRSAVPRGLIAIWDS CATCGTCTCGGACCTGCGCGCTGTCGACCGCTCACTCCTTGCCCCGTGGC CTTTTGT MYDGCTTRVRTAEG AGAAGATAGAAATGCTGGGGACCTTCATCCTATCCAGGCTTGACTTTCT TTGTATT HTAPIPIRSGVRQGC GCTCCGGGGGGCCAGAGTGTTCAAGGGTCCCCTCACGGCCGTGGACCT ACCCCA PLSPIIFNLAIDSVVRV TAACATCCGWAGGCATGTTAAATCCTGGCTTAACCTCCCTCAGCGAGCA GAGGGG AAEXNDGYSLHGNT AGCGCGGAGGGAGTCTACATGCCGCCCCGTTGGGGGGGATGTGGACTC AATTGTC WSALAYADDIALLAQ CTGCCGCTCTCTGACCTCGCCGACGTCCTCACGGTTGCCCACGCGTACCG CCTCTG TPEGMERMLASVEA TATGTTAACGGTGCGCGATGGCGCCGTGAGGGAGTTGGCGTGGGAATC GGGAAA EAASVGLRFNPAKCA GCTGAGGGGAGTGGTTGGGCGCAGGATCGGCCACGCCCCTAGTTGCGA AAAAAT TLHVGAGNGGRVLP GGATATCGCCTCCTTCCTATCCGGCTCGCTGGATGGAAGGATGAGGGGC GATTGG TSFQIQGETINPLAQ GGAGGGGAGGCTTCGCTCTGGTCGAGTGCGCGGAACGCTGCGCTCAGA AAAAAT GESYTHLGVPTGFSV CAGTCCGAGAGGTTGTCCCTGCGTTGGCGGTGGGTCGAGGCCACGGAG AAAGTG DQTPYAAVGDIVSDL GAGATGACGTTGGAGTGTCGAGGGCCCAGGGGGGCAGCGATTAAGAT AGCTAA RAVDRSLLAPWQKIE TCCGCCTGAAGCGCGCGGTCAGGTAGTGAATCGGTTGCGCTCAGCTGTA (SEQID MLGTFILSRLDFLLRG GCAGAGCACTACGCAAGTAGGTTGCTTAGCAAGCCTGATCAGGGTAAG NO: ARVFKGPLTAVDLNI GTCTTCGAGGTGTCGTCGCGGAGCCGAGTGAGCAATCACTTTATCCGCG 1303) RRHVKSWLNLPQRA GCGGCAGCTTCACTCGCTTCGCCGACTGGCGCTTTATCCATAAGGCCCG SAEGVYMPPRWGG GTTAGATGTTCTTCCTCTCAACGGCGCACGACGTTGGGAGGCCAACGAC CGLLPLSDLADVLTVA AAGCGCTGTCGGCGATGCGGTGAGGTATCGGAGACATTACCCCATGTG HAYRMLTVRDGAVR CTCTGTCACTGCGGCATCCACTCCGCCGCGATACAGCTGAGGCACGACG ELAWESLRGVVGRRI CTGTCCTGCACCGCCTTTGGAAGGCCACTCGCCTTCCAGGGGTAGTGCG GHAPSCEDIASFLSGS GGTTAACCAGCGGGTGGAGGGCGTCAGCGACGAATTGGGGGCGCTAC LDGRMRGGGEASL GACCTGATCTCGTGGTCAGGCACGAGCCCTCCAAAAGTGTCGTCATCTG WSSARNAALRQSERL CGACGTCACGGTGCCATTCGAAAACCGCTGGACCGCTTTCGAGGACGCC SLRWRWVEATEEMT AGGGCGAGGAAAATCGCCAAATACTCGCCTCTGGCAGAGGAGCTACAG LECRGPRGAAIKIPPE CGGCGAGGGTACCGTGTCGTCGTGACGGCCTTCGTCGTCGGCGCCCTCG ARGQVVNRLRSAVA GCTCGTGGGATCCGAGGAATGAGGCGGTGTTGAGACTGCTGCGGGTTG EHYASRLLSKPDQGK GCAACCAGTATGCAGCTATGATGCGGCGCCTCATTGTCTCGGATACCAT VFEVSSRSRVSNHFIR TCGCTGGTCACGCGACATATATGTGGAGCATGTGTCCGGCACCCGCCAG GGSFTRFADWRFIHK TACCTGGCTCCTTCCCGTCCCTCTGGGGATCTCGCGACGCCGCCGAGAG ARLDVLPLNGARRW CGGTTCGTCGACGCTGGCTCGCCGAGGAGAGAAGCGCACAGGACGCG EANDKRCRRCGEVSE GCGCGTCGCGGTTCGGATAGTGTGAGTGTCGCGTAAATTATTTTGTCTT TLPHVLCHCGIHSAAI TGTCTTGGCCCCCCCTTTTTAAACCAAGCAGGAGAGAGTGGCCCAATGC QLRHDAVLHRLWKA CCAACTATTATATATTAACTATTTACTGTGATATTTATTATTTGACTGTTG TRLPGVVRVNQRVE GGCGGGCCCCTCTCTGCTGGTTTTATTTATATATATTTTTTACTCGCGTAC GVSDELGALRPDLVV TTTTTGTACTACTCTATTTTTCTTTTTATTTTAGCTATGCTATTTTTATCTCT RHEPSKSVVICDVTVP TTCTTTGTCTCTATTTTCTTTCTTTTTTTCTTTCCTTTTCTTTTCTTTCTTTTAT FENRWTAFEDARAR TCTTCTTTTATTTATCTTTTTTTCTTTTCTGTTGTGGGGCCCTGACCGTCCG KIAKYSPLAEELQRRG AGTGTGAATGCCGCGAAAAACAATATTATGTTTTATACGAGTGTGCATG YRVVVTAFVVGALGS TGCGTGATATATTTATCTATTTTATTTTATTTATTTATTATAATTTATTGCC WDPRNEAVLRLLRV GCGCGCGCTCCTCCGGGACTTTTATTCGTTGACAATACTGTGATATTTTT GNQYAAMMRRLIVS CTGCKCAGGCTGGGGGGGCTTGCCCCCCAGCCCCTTAGTTTTAATTGCCT DTIRWSRDIYVEHVS ATGCGGGGGGGGCTTTTGTCCCCCGCAAATGTATATATATATATATTTA GTRQYLAPSRPSGDL GCGCGCGGCTTAGCCGCTTTTGTTTGTATTACCCCAGAGGGGAATTGTC ATPPRAVRRRWLAEE CCTCTGGGGAAAAAAAATGATTGGAAAAATAAAGTGAGCTAA(SEQID RSAQDAARRGSDSV NO:1057) SVA(SEQIDNO: 1425) NeSL Utopia- Alligator TGCTGGAAAGACGGAGAACCGCTTCCTTTTTCCCTGCGCCTGGCCTGGT TGCTGGAA TGAACC CHHAGLRPGTPNRT 1_AMi mississip- ATTGCAGTACCTCCAGGATTAGCGCCAACTAGTCCGGCAGACTGTCGGA AGACGGA CCCCCTC RRPDQTAPLPDPRG piensis ATACAGCAATAGAAAGWGAGCTGACTAGCAGCTTGCTTTCCTTCCTCCG GAACCGCT TGCACC HPMPPNRRGSRSRP GTGCAGCATGGGTTCTCGTCAGTCMTGACGGGCTAGGGAAGGCGGTG TCCTTTTTC AGATGG EEPSRREPPXPRACQ CTGCCAGTACGTCCGAAAGAGTGCCGGTTGCGCAAGCGACCGCGCCAC CCTGCGCC ACCTTCA GLRVWSPPQQRMPT TCAGGTGAGTAGCCAAGGGTCTTACAGTTCACCGGACCCGAWAACGCG TGGCCTGG CTTCGA PWQTLWLEELSRATT AAAACCCCAACTCGGGCTAGTAGCCGAAGACCTGGGTCCCCCCCMGGT TATTGCAG GAGGAT FKAFEASVARLTEELS CAGAGTAGGCGAACGCCWGKGCTCAGAGGACGGAACGCGGAAAACAC TACCTCCA TCTTCAG AAARPGQPRGGNNR CCCCAGGTCCCAAGGACGCCCTGATCCACTGACAAGAACGCTCGAGGCA GGATTAGC CAATGG PATRRDHRLQPQRR CGCCAGGAGACCCCCAGCTAGGGTGGACCGCCGACTGCAGGTCCGGAG GCCAACTA ACGACC PRRQRYDPAAASRIQ GACCCTCCCAGGAGGGTGGACCAGCGAACCCAAGTTGGCGACGAACCC GTCCGGCA CCGCTCC KLYRANRPKAVREILE TGACGCACCCCCCACGATGTCAGGACCCCGACAGGCGGCGGTGGACCA GACTGTCG ACCCGA GPSAFCQVPRETLFN CTGACCATCGACCGACCCCCAGAGGCAGAGAGACTCTCAGAGCCCGGA GAATACAG AGAGGA YFSRVFNPPAEAAAP ACCCCGGCTGACGAGAGCCGCCTCCCGGCGGAGGACCCCGGAGCCTGA CAATAGAA CCCCCG RPATVEALTPVPPAE GGATGCCCCCCGGATGACGGCGGAGCGCCCCGAGCGACAGCGGACCCC AGWGAGC CGATGA GFEDAFTPQEVEARL TCCGGACCCCCACGGCCCCTCGGTGACGATGGCGGGCCCCGAACGACG TGACTAGC GACTCT KRTRDTAPGRDGIRY ACGACCCCCGGACCCCGGCGGTCCCGAGGACGCCCCCCCCGAGGGTCT AGCTTGCT ATATGG SLLKKRDPGCLVLSVL CCCCACGCTGGTGGAGGAGCCCCGGACCCCCCCGACACCGGACCCCCCC TTCCTTCCT ACTGAG FNRCREFRRTPTTWK ACGGACGACCCAGGCGAAGGCGTAGACATGACAGCACTCACGTTCCTC CCGGTGCA ACACTTT RAMTVLIHKKGDPTD CCCTTCCCCCTCCCGGCGAAGCTGTTCTGCCCGACCTGCCACCCGCCAAG GCATGGGT TTCTTCG PGNWRPIALCSTVAK ACAGTACAGGTCGCACGGCGACATGAACAAGCACCTACGGCGCTTCCAC TCTCGTCA AACCAC LYASCLAARITDWAV CAGCTGCGCCTAGCCTTCTACTGCGCCCTCTGCGGCACCGAGTACGAGG GTCMTGA TTCCTCC TGGAVSRSQKGFMS CCCTGAAGCTCCTGAAGAACCACCAGAAGGGATGCGAGGGCCACGGAG CGGGCTAG ACCATT TEGCYEHNFTLQMAL CCGAGAGGAGACCCGGCACGCTGGTGAGGTCCGCTGCCCCGGCCCGCC GGAAGGC GCGGAC DNARRTRKQCAVA GGACCCAGGCCGCGGTGCGAAGGCCCGCCAGACTGGCCACCCCGCCGA GGTGCTGC CATTGTA WLDISNAFGSVPHRH CAACCCCACCGGACCAGACCTCCAGGGACCACCCGACGGAGAGACCTG CAGTACGT ACGGGT IFGTLRELGLPDGVID CCCCAGTGATGCCACCACGCAGGCCTCCGCCCAGGGACCCCCAACCGGA CCGAAAGA TTGTGT LVRELYHGCTTTVRA CACGACGCCCCGACCAGACAGCCCCCCTCCCAGACCCCCGGGGCCACCC GTGCCGGT GTATCTA TDGETAEIPIRSGVRQ GATGCCCCCGAACCGCCGGGGATCCCGGAGCCGCCCGGAGGAACCGA TGCGCAAG TCTTCTT GCPLSPIIFNLAMEPL GCCGCCGGGAGCCCCCCGKCCCCCGAGCGTGCCAGGGTCTCCGGGTGT CGACCGCG TCTCTCT LRAVAGGPGGLDLY GGAGCCCTCCCCAGCAGAGGATGCCCACCCCATGGCAAACCCTCTGGCT CCACTCAG CAGCGT GQKLSVLAYADDLVL GGAGGAGCTCTCCCGGGCCACCACCTTCAAGGCCTTCGAGGCCTCGGTG GTGAGTAG CGCGAA LAPDATQLQQMLDV GCCCGGCTCACAGAGGAGCTCTCGGCGGCCGCCCGGCCCGGCCAGCCC CCAAGGGT CCCCCTC TSEAARWMGLRFNV CGGGGGGGCAACAACAGACCGGCGACGCGACGGGACCACAGACTGCA CTTACAGT CCTCCCC AKCASLHIDGRQKSR GCCGCAGAGGCGACCCAGGCGCCAGCGCTACGACCCGGCGGCAGCCTC TCACCGGA TTCCCCT VLDSTLTIQGQAMRH CCGGATCCAGAAGCTGTACCGGGCCAACCGTCCCAAGGCGGTGAGAGA CCCGAWA CCCCCTC LRDGEAYCHLGTPTG GATCCTGGAGGGACCCTCGGCCTTCTGCCAGGTCCCCCGGGAGACTCTG ACGCGAAA CCCCCCA HRAKQTPEETINGIV TTCAACTACTTCAGCAGGGTCTTCAACCCCCCAGCAGAAGCCGCCGCCC ACCCCAAC CCCCCG QDAHKLDSSLLAPW CACGCCCTGCGACCGTCGAAGCGCTGACCCCCGTCCCCCCGGCGGAGG TCGGGCTA GGCTTA QKIDAVNTFLIPRVAF GGTTCGAGGATGCCTTCACGCCGCAGGAAGTGGAAGCCCGCCTCAAGA GTAGCCGA GTTGGC VLRGSAVPKTPLKKA GGACCAGGGACACCGCCCCCGGCAGGGACGGCATCAGGTACAGTCTCC AGACCTGG TAACATT DAEIRRLLKKWLHLPL TCAAAAAGCGTGACCCGGGCTGCCTTGTTCTTTCTGTTCTCTTCAACAGG GTCCCCCC GTATCTC RASNEVLHIPYRQGG TGCAGAGAGTTCCGGCGCACGCCCACCACCTGGAAGAGGGCCATGACG CMGGTCA CTGTAA ANVPRMGDLCDIAV GTCCTCATCCACAAGAAGGGAGACCCGACCGACCCGGGCAACTGGAGA GAGTAGG CCTAGTT VTHAFRLLTCPDXTVS CCCATCGCCCTGTGCTCCACCGTCGCCAAGCTGTACGCCAGCTGCCTGG CGAACGCC GCGTTC IIAASALEETARKRIGR CGGCCCGCATCACCGACTGGGCGGTGACCGGCGGGGCCGTCAGCCGGA WGKGCTC CCCTCCT QPTRRDLATFLSGSLE GCCAGAAGGGCTTCATGTCGACGGAGGGCTGCTACGAACACAACTTCA AGAGGAC CACCCCC GEFSRDGGDFASLW CCCTCCAGATGGCCCTGGACAATGCCCGGAGGACCAGGAAGCAGTGCG GGAACGC ATCCCTC SRARNATRRLGKRIG CGGTGGCGTGGCTGGACATCTCCAATGCCTTCGGCTCCGTGCCCCACCG GGAAAAC TATTGTT CAWTWTEERRELGV CCACATCTTCGGCACCCTCCGCGAGCTGGGCCTACCGGACGGCGTCATC ACCCCCAG AGTCCCT SLQPAPHADRVTVTP GACCTGGTGCGAGAGCTCTACCACGGCTGCACCACGACCGTCCGCGCCA GTCCCAAG CGCTCG RTRTFLERFLKDAVR CCGACGGAGAGACCGCGGAGATCCCCATCCGGTCGGGGGTGAGACAG GACGCCCT GGCGCT NKYAGDLRAKPDQG GGCTGCCCCCTCAGCCCCATCATCTTCAACCTGGCCATGGAACCGCTCCT GATCCACT CTGTATT KVFDVTSKWDSSNH TCGAGCCGTGGCAGGCGGCCCCGGCGGGCTCGACCTGTACGGCCAGAA GACAAGA TCCCTAC FMPSGSFTRFADWR GTTGAGCGTCCTGGCCTACGCCGACGACCTCGTTCTCCTCGCCCCCGAC ACGCTCGA CGGCTT FLHRARLNCLPLNGA GCCACCCAGCTGCAGCAGATGCTGGACGTGACGTCCGAGGCGGCCAGG GGCACGCC TGTCATC VRFGHRDKRCRRCG TGGATGGGCCTGCGCTTCAACGTCGCCAAGTGCGCCTCCCTGCACATCG AGGAGAC TTTTTTG YVAETLPHVLCSCKP ACGGSAGGCAGAAGAGCCGCGTCCTGGACTCCACCCTCACGATCCAGG CCCCAGCT GATTCA HARAWQLCHNAVQ GCCAGGCGATGAGGCACCTGCGCGACGGCGAGGCCTACTGCCACCTGG AGGGTGG CAATCCT DRLVRAIPAAAGEISV GGACGCCCACCGGCCACCGGGCCAAGCAGACGCCGGAGGAGACCATCA ACCGCCGA AAACAT NRTVPGCESQMRPD ACGGGATCGTGCAGGACGCCCACAAGCTGGACTCGTCCCTGCTGGCCCC CTGCAGGT CTACTAA IVITNEEAKKVVIVDV CTGGCAGAAGATAGACGCGGTGAACACCTTCCTCATCCCCCGCGTCGCG CCGGAGG TAAAAG TIPFENRRQAFTDAR TTCGTCCTGAGAGGCTCGGCGGTCCCCAAGACCCCCCTCAAGAAGGCG ACCCTCCC TCAATC ARKREKYAPLADILRG GACGCCGAGATCCGGCGGCTGCTCAAGAAGTGGCTGCACCTGCCGCTG AGGAGGG (SEQID RGYDVTVDALIVGTL AGGGCCAGCAATGAGGTCCTGCACATCCCCTACCGGCAGGGAGGTGCC TGGACCAG NO: GAWDPSNESVLHAC AACGTCCCCCGCATGGGAGACCTCTGCGACATCGCGGTGGTCACCCACG CGAACCCA 1304) RVSRRYAKLMRCLM CCTTCCGCCTCCTGACCTGCCCGGACSCGACGGTAAGTATCATCGCCGCC AGTTGGCG VSDTIRWSRDIYVEHI AGCGCCCTCGAGGAGACCGCCCGCAAGAGGATCGGGAGGCAGCCCACC ACGAACCC TGHRQYTDPTRRTAA AGACGTGACTTGGCCACCTTCCTCAGCGGCTCGCTGGAGGGCGAGTTCA TGACGCAC GPDPEGTA(SEQID GCAGAGACGGCGGGGACTTTGCCTCGCTGTGGAGCCGAGCCCGCAACG CCCCCACG NO:1426) CCACGCGCCGCCTCGGGAAGCGCATCGGCTGCGCCTGGACCTGGACCG ATGTCAGG AGGAGCGCCGGGAGCTGGGAGTCTCCCTGCAACCAGCCCCGCACGCCG ACCCCGAC ACCGCGTCACCGTGACGCCCCGCACGAGGACCTTCCTGGAGAGGTTCCT AGGCGGC GAAGGACGCCGTCCGAAACAAGTACGCCGGCGACCTGAGGGCCAAACC GGTGGACC CGACCAGGGCAAGGTCTTCGACGTCACCTCGAAGTGGGACTCCAGCAA ACTGACCA CCACTTCATGCCCAGCGGGAGCTTCACGCGCTTCGCGGACTGGCGCTTC TCGACCGA CTCCACCGCGCCCGCCTCAACTGCCTGCCTCTGAACGGGGCCGTCCGCTT CCCCCAGA CGGCCACCGGGACAAGAGGTGCCGACGGTGCGGCTACGTGGCAGAGA GGCAGAG CCCTCCCCCACGTGCTGTGCAGCTGCAAGCCGCACGCCAGAGCCTGGCA AGACTCTC GCTCTGCCACAACGCTGTCCAGGACCGCCTGGTGAGGGCCATCCCGGCC AGAGCCCG GCAGCGGGGGAGATCTCCGTGAACCGCACCGTCCCGGGCTGCGAGAGC GAACCCCG CAGATGCGCCCCGACATCGTCATCACCAACGAGGAGGCCAAGAAGGTC GCTGACGA GTGATCGTGGACGTGACCATCCCCTTCGAGAACCGGCGCCAAGCCTTCA GAGCCGCC CCGACGCCCGGGCTCGCAAGCGGGAGAAGTACGCCCCGCTGGCCGACA TCCCGGCG TCCTGAGGGGCCGCGGCTACGACGTGACGGTCGACGCGCTCATCGTGG GAGGACCC GAACGCTCGGAGCCTGGGACCCCAGCAACGAGAGCGTCCTGCATGCCT CGGAGCCT GCCGCGTCTCCCGCCGCTACGCCAAGCTGATGCGCTGCCTCATGGTGTC GAGGATG CGACACCATCCGTTGGTCCCGTGACATCTACGTGGAGCACATCACGGGC CCCCCCGG CACCGCCAGTACACCGACCCCACCAGACGAACCGCCGCCGGACCGGAC ATGACGGC CCAGAGGGGACCGCCTGAACCCCCCCTCTGCACCAGATGGACCTTCACT GGAGCGC TCGAGAGGATTCTTCAGCAATGGACGACCCCGCTCCACCCGAAGAGGA CCCGAGCG CCCCCGCGATGAGACTCTATATGGACTGAGACACTTTTTCTTCGAACCAC ACAGCGG TTCCTCCACCATTGCGGACCATTGTAACGGGTTTGTGTGTATCTATCTTCT ACCCCTCC TTCTCTCTCAGCGTCGCGAACCCCCTCCCTCCCCTTCCCCTCCCCCTCCCC GGACCCCC CCCACCCCCGGGCTTAGTTGGCTAACATTGTATCTCCTGTAACCTAGTTG ACGGCCCC CGTTCCCCTCCTCACCCCCATCCCTCTATTGTTAGTCCCTCGCTCGGGCGC TCGGTGAC TCTGTATTTCCCTACCGGCTTTGTCATCTTTTTTGGATTCACAATCCTAAA GATGGCG CATCTACTAATAAAAGTCAATC(SEQIDNO:1058) GGCCCCGA ACGACGAC GACCCCCG GACCCCGG CGGTCCCG AGGACGCC CCCCCCGA GGGTCTCC CCACGCTG GTGGAGG AGCCCCGG ACCCCCCC GACACCGG ACCCCCCC ACGGACG ACCCAGGC GAAGGCG TAGACATG ACAGCACT CACGTTCC TCCCCTTC CCCCTCCC GGCGAAG CTGTTCTG CCCGACCT GCCACCCG CCAAGACA GTACAGGT CGCACGGC GACATGAA CAAGCACC TACGGCGC TTCCACCA GCTGCGCC TAGCCTTC TACTGCGC CCTCTGCG GCACCGAG TACGAGGC CCTGAAGC TCCTGAAG AACCACCA GAAGGGA TGCGAGG GCCACGGA GCCGAGA GGAGACCC GGCACGCT GGTGAGG TCCGCTGC CCCGGCCC GCCGGACC CAGGCCGC GGTGCGA AGGCCCGC CAGACTGG CCACCCCG CCGACAAC CCCACCGG ACCAGACC TCCAGGGA CCACCCGA CGGAGAG ACCTGCCC CAGTGA (SEQID NO:1181) NeSL Utopia- Chelonia CTCTTCTTATGAATACTTGCAACACCTGCACTGAAGATGGATTCTCCGGC CTCTTCTTA TGAGCC MTTKKVLGASTTLQT 1_CMy mydas TGCTATTTTTGAAAAACTGATGCTGCTTTGAAGGTGTATTCTGCTGCTGC TGAATACT GGTACG SSTKGKNSGCSKDPL TACCTTGGAAGGAAATTCTCTCTCTGCTCCTGAGACATCCCCAGCTGCAC TGCAACAC ACATCG RDAVPGRSWILRPAC CGTGTACCACCACCACCACTGCTGCTGCTCCACAGAAGGTTTCTCGGAC CTGCACTG TGCATC RDITTRRNIPPAPQQ AATGACTACAAAGAAGGTCCTAGGTGCCTCCACAACATTACAGACCAGC AAGATGG AACTAT QQPPMESPPTLQLQ AGCACGAAGGGGAAGAACAGTGGCTGCTCAAAGGACCCCCTCCGAGAT ATTCTCCG GAGAAA DALRRPSPTPAAAQV GCTGTTCCAGGAAGATCCTGGATTCTGAGGCCGGCCTGTCGGGACATCA GCTGCTAT GGGACT ADAGGALAALHTIKR CAACCAGAAGGAACATCCCCCCCGCCCCCCAGCAGCAGCAGCCGCCAAT TTTTGAAA GAGAGA GISVDWTSISPKXXQ GGAGAGCCCCCCCACTCTGCAGCTGCAAGATGCTCTCAGGCGACCATCT AACTGATG CTTTTTC RXTSASPDACPASET CCCACCCCCGCAGCTGCCCAGGTCGCTGACGCTGGTGGTGCTCTCGCTG CTGCTTTG CATTGG TQRDXRXLLDARPAG CTCTACACACCATCAAGAGAGGAATCTCCGTAGACTGGACCAGCATCTC AAGGTGTA ACCATAT PLDPTRPHQDEPASD TCCAAAGASCMCCCAGAGGSTCACCAGCGCCTCGCCGGATGCCTGCCCA TTCTGCTG GAACTG TADAAGTPLLQGNE GCCTCAGAAACCACCCAGAGGGACCSCAGGWSCCTGCTGGACGCCCGC CTGCTACC GAACCA DTIYLQYPLAADMLIC CCAGCCGGACCTCTCGACCCTACCCGCCCACACCAGGATGAACCAGCCA TTGGAAGG TAAACTC PICSPPQSFHLLGVVT GCGATACCGCTGATGCTGCTGGAACCCCCCTGCTGCAAGGTAATGAGG AAATTCTC ACTGAA RHLKRCHSKRVAFSC ACACCATCTACCTGCAGTATCCCCTCGCTGCGGACATGCTCATCTGCCCC TCTCTGCT CATTAA ALCSLPFETQKQCKM ATCTGCTCTCCGCCCCAAAGCTTCCACCTCCTCGGTGTCGTCACCAGGCA CCTGAGAC ATCTCAC HQVACRKCLKGTTQS CCTGAAGAGATGCCACAGCAAGCGGGTTGCCTTCAGCTGTGCCCTCTGC ATCCCCAG CAAATG PAPAPSPPAARRPAA AGCCTGCCCTTCGAGACGCAGAAGCAATGCAAGATGCACCAAGTCGCCT CTGCACCG AGGGTA PEPQRRKXTSQAAVK GCAGGAAATGCCTCAAGGGAACMACACAGTCTCCTGCCCCGGCTCCCA TGTACCAC AATCCAT KPAPVARPAERDAAI GCCCTCCTGCTGCACGCCGGCCCGCTGCTCCTGAGCCTCAACGAAGAAA CACCACCA CCTCATC EKVPAASGNITQVLA GSCGACCTCGCAAGCTGCCGTCAAGAAGCCTGCCCCCGTCGCCAGGCCA CTGCTGCT ATCGTAT SRRPVSPSHVAKXIS GCGGAACGGGATGCTGCGATCGAGAAGGTACCTGCTGCCTCGGGGAAC GCTCCACA CCACTCA MLRRLSAASPPVQH ATCACCCAGGTCCTCGCCAGCAGGAGGCCCGTCTCACCCTCTCATGTCG GAAGGTTT TTATACT VPVPRRISAPPRIAAR CCAAGMAGATCTCCATGCTGAGACGACTCAGTGCTGCCTCGCCACCTGT CTCGGACA CCACAC DPVAGRASAAPQTA CCAGCACGTCCCCGTCCCCAGAAGGATCAGCGCCCCACCGCGCATAGCT (SEQID CTGAAC LRTPAAGGASTTPQT GCTCGAGATCCTGTCGCCGGAAGAGCCAGCGCCGCCCCTCAGACCGCCC NO:1182) ATAGCC ALRTPTAGGASAMP TGCGAACTCCAGCCGCCGGAGGAGCCAGCACCACGCCTCAGACCGCCC ATTATAT QTTLPXPRRPDWRN TGCGAACTCCAACCGCCGGAGGAGCCAGTGCCATGCCTCAGACCACCCT GAACAA QPRSHSKAPGLHRQT GCCAGMCCCCAGACGTCCAGACTGGAGGAACCAGCCCCGCAGCCACAG CATACCC DQHGPQVHSAGHCL CAAAGCACCGGGCCTTCATCGCCAGACGGACCAGCACGGCCCCCAAGTC CCATATC REISRSSSNRLGSSHS CATTCTGCGGGACACTGCCTACGGGAGATCTCACGCTCCAGCAGCAACC TCAATGT AAATHRRTGGVPAT GCCTAGGCAGCAGCCACTCATAGAAGGACCGGCGGTGTCCCAGCAACC CTGTACT PEPDRVSPTTSNAXIP CCCGAGCCGGACCGCGTCTCTCCGACCACCAGCAACGCCASCATCCCGC TTGACCC PEIPPQHPTEGNPDP CAGAGATCCCGCCCCAGCACCCAACCGAAGGGAATCCTGACCCACGAG GTTAAC RDRRQADHTAGSEP ATAGACGGCAGGCCGACCATACAGCAGGCTCWGAGCCTGCACCAGAC CTTTTAC APDEVEDXEGQRPM GAGGTCGAGGACCMTGAGGGCCAGCGGCCGATGGTGAGGGCTGCCAC CCCCAAT VRAATPWQTAWTEE NCCGTGGCAGACTGCCTGGACCGAGGAGCTACAAGCGGCAGCTTCCTT CGGGGA LQAAASFDDFDLLVD TGACGACTTCGACCTCCTCGTAGACAGGCTCACCCGAGAACTGTCTGCG TATTGCA RLTRELSAEIAPRRSS GAAATCGCTCCCAGGAGGAGTTCGAACCAGGAGAACGCCCCGCCTGCC GATTAT NQENAPPAHRTPAP CACAGAACGCCTGCTCCGAACCACAACACCACCACCAGGGGAGCCAGA GTATTCC NHNTTTRGARSRDA AGTAGAGACGCCAGCCGCCGCTACGATCCAGCAGCGGCTTCAAGGATC TTACGCC SRRYDPAAASRIQKLY CAAAAGCTGTACCGGGCAAACCGCTCCAAGGCCATGAGGGAGATCCTA ACCCGA RANRSKAMREILDGP GACGGGCCCTCGCCCTACTGCACGATCCCATCTGAGCGTCTCTACAGCT TCCTAAA SPYCTIPSERLYSYFKD ACTTCAAGGATGTATTCGACCGCATAGCCCGGAATGACGCGCAGCGCCC CCGAAT VFDRIARNDAQRPEC AGAGTGCCTCCGCCCCCTGCCCCGTGTCGACGAAGCAGGTGTCCTGGAA TTCGCAC LRPLPRVDEAGVLET ACTGACTWTACGCCCAAGGAAGTGATGGCCAGACTCTCAAAAACAAAA CCCTTGA DXTPKEVMARLSKTK AACACAGCTCCTGGGAAAGACGGCATCCCCTACAGCCTCCTGAAAAAGC TAATCTG NTAPGKDGIPYSLLKK GAGATCCCGGCTGCCTGGTCCTCGCCACGCTCTTCAACCAGTGCAAGCG TACCTTA RDPGCLVLATLFNQC ATTCTGCCGGACTCCCAGCTCCTGGAAGAAGGCCATGACGGTACTGGTG TTCCCTG KRFCRTPSSWKKAM TACAAGAAGGGCGAGCGGGATGACCCCAGCAACTGGAGGCCCATCTCC ATAACC TVLVYKKGERDDPSN CTCTGCTCCACGATGTACAAGCTCTATGCCAGCTGCCTGGCGTCGAGGA AGAAAC WRPISLCSTMYKLYA TCACGGAGTGGTCGGTGAGCGGGGGAGCCATCAGCTCCATCCAGAAAG TTCTATG SCLASRITEWSVSGG GCTTCATGTCCTGCGAGGGCTGCTACGAACACAACTTCGTCCTCCAAACC CTTAAAC AISSIQKGFMSCEGCY ACCATCGAAACGGCCAGAAGGGCGCGGAGGCAGTGCGCGGTAGCGTG TCTGTAC EHNFVLQTTIETARR GCTCGACCTGGCTAACGCCTTTGGGTCCATGCCCCACCACCACATCTTTG CGTTTTT ARRQCAVAWLDLAN CCACGCTCCAGGAGTTTGGGATGCCAGAGAACTTCCTTCGTGTGATCCG TTTTATT AFGSMPHHHIFATLQ AGAGGTGTACGAGGGATGCAGCACCACCATTCGCTCGGTCGAAGGGGA TCAACAT EFGMPENFLRVIREV GACCGCCGAGATCCCGATCCGGAGCGGAGTTAAGCAGGGCTGTCCCCT CATCTTA YEGCSTTIRSVEGETA CAGCCCCATCATCTTTAACCTCGCCATGGAGCCGTTGCTGCGAGCGATCT ATAAAA EIPIRSGVKQGCPLSPI CCAATGGCACAGATGGCTTCAACCTCCACGGTGAGAGGGTGAGCGTCC TTATTAA IFNLAMEPLLRAISNG TGGCTTACGCGGATGACCTGGTCCTGACCGCGGATGACCCAGAGAGCC A(SEQ TDGFNLHGERVSVLA TCCAAGGTATGCTAGATGCCACCAGTCGAGCTGCCGACTGGATGGGGC IDNO: YADDLVLTADDPESL TCCGCTTCAATGCAAAGAAGTGCGCAACTCTCCACATCGACGGCAGCAA 1305) QGMLDATSRAADW AAGGGACTCGGTGCAGACGACGGGGTTCCAGATCCAGGGCGAGCCCGT MGLRFNAKKCATLHI CATCCCCCTGGCAGAGGGGCAGGCGTACCAGCACCTCGGCACGCCGAC DGSKRDSVQTTGFQI GGGTTTCCGTGTCCGGCAGACACCCGAGGACACCATCCAGGAGATCTTG QGEPVIPLAEGQAYQ CAGGATGCCGCCAAGATCGACGCCTCCCTGCTAGCACCGTGGCAGAAG HLGTPTGFRVRQTPE ATAAACGCCCTGAACACCTTCCTGATCCCCCGCATCTCGTTCGTCCTAAG DTIQEILQDAAKIDAS GGGATCCGCCGTGGCGAAGGTACCCCTCAACAAGGCAGACAAGATCGT LLAPWQKINALNTFLI CCGGCAGCTGGTGAAGAAGTGGCTGTTCCTTCCCCAGAGAGCCAGCAA PRISFVLRGSAVAKVP CGAGCTGGTCTACATCGCCCACAGGCATGGCGGTGCCAACGTCCCCCGC LNKADKIVRQLVKKW ATGGGCGACCTGTGTGACATCGCGGTGATCACCCACGCCTTCCGCCTGC LFLPQRASNELVYIAH TGACGTGTCCCGACGCCATGGTAAGGAACATCGCGGCAAACGCCCTCCA RHGGANVPRMGDL TGACGCGACAAAGAAGCGGATCGGCAGAGCCCCCTCCAACCAAGACAT CDIAVITHAFRLLTCP CGCCACCTTCCTGAGCGGTTCCCTGGATGGCGAATTCGGACGGGACGG DAMVRNIAANALHD GCGCGACATCGCTTCACTGTGGTCCCGCGCTCGCAACGCCACGCGTCGC ATKKRIGRAPSNQDI CTGGGGAAGCGCATCGGCTGCCGCTGGGAGTGGTGCGAGGAGCGCCA ATFLSGSLDGEFGRD GGAGCTGGGAGTCCTGGTGCCGCAGATCAGGTCCAACGACAACACCAT GRDIASLWSRARNAT CGTCACCCCGAGCGCCAGGGGCATGCTGGAGAGGACCCTGAAGGCAGC RRLGKRIGCRWEWC CATCCACTCACTGTACGTGGAAACCCTGAAGCGTAAACCGGACCAGGGT EERQELGVLVPQIRS AAAGCCTTCGAACTGACCAGCAAGTGGGACGCCAGCCAACCACTTCCTC NDNTIVTPSARGMLE GCCGGGGGCGGCTTCACCCGTTTCGCCGACTGGCGGTTCATCCACCGTG RTLKAAIHSLYVETLK CCCGGCTCAACTGCGTCCCGCTCAACGGAGCCGTCCGCCACGGGAACCG RKPDQGKAFELTSK AGACAAGCGTTGCAGGAAGTGCGGCTACTCCAACGAGACCCTGCCCCA WDASQPLPRRGRLH CGTCCTGTGCAGCTGCAAGCCCCACTCCAGAGCCTGGCAGCTGCGCCAC PFRRLAVHPPCPAQL AATGCCATCCAGAACCGCCTGGTGAAAGCCATCGCACCGCGCCTGGGG RPAQRSRPPREPRQA GAGGTCGCCGTGAACTGCGCCATCCCCGGTACTGACAGCCAGTTGCGAC LQEVRLLQRDPAPRP CTGACGTGGTAGTCACCGACGAGGCCCAGAAAAAGATCATCCTCGTCG VQLQAPLQSLAAAP ACGTCACGGTCTCCTTTGAGAACAGGACCCCGGCCTTCCGCGAAGCCCG QCHPEPPGESHRTAP AGCTCGTAAGCTGGAAAAATACGCCCCCCTGGCCGACACCCTGAGAGC GGGRRELRHPRYPAS GAAGGGCTACGAGGTGCAGATGGATGCCCTGATCGTCGGAGCCCTGGG GTPANHFLAGGGFT CGCTTGGGACCCCTGCAATGAGCGTGTGCTGCGGACCTGTGGGATCGG RFADWRFIHRARLNC TCGACGCTACGCACGGCTCATGCGGCGCCTCATGGTCTCGGACACCATC VPLNGAVRHGNRDK CGATGGTCCAGGGACATCTACATCGAACACATCACCGGCCACCGACAGT RCRKCGYSNETLPHV ACCAGGAGGTGTGAGCCGGTACGACATCGTGCATCAACTATGAGAAAG LCSCKPHSRAWQLR GGACTGAGAGACTTTTTCCATTGGACCATATGAACTGGAACCATAAACT HNAIQNRLVKAIAPR CACTGAACATTAAATCTCACCAAATGAGGGTAAATCCATCCTCATCATCG LGEVAVNCAIPGTDS TATCCACTCATTATACTCCACACCTGAACATAGCCATTATATGAACAACA QLRPDVVVTDEAQK TACCCCCATATCTCAATGTCTGTACTTTGACCCGTTAACCTTTTACCCCCA KIILVDVTVSFENRTP ATCGGGGATATTGCAGATTATGTATTCCTTACGCCACCCGATCCTAAACC AFREARARKLEKYAPL GAATTTCGCACCCCTTGATAATCTGTACCTTATTCCCTGATAACCAGAAA ADTLRAKGYEVQMD CTTCTATGCTTAAACTCTGTACCGTTTTTTTTTATTTCAACATCATCTTAAT ALIVGALGAWDPCN AAAATTATTAAA(SEQIDNO:1059) ERVLRTCGIGRRYARL MRRLMVSDTIRWSR DIYIEHITGHRQYQEV (SEQIDNO:1427) NeSL Utopia- Chrysemys TTTTTTCTGATGCTTGACTGCAAACACCCATCCAGAAGATGGAATCTCCT TTTTTTCTG TGAGCC MTQDQDADCCPAG 1_CPB pictabellii GCAGCCATTTTTGAAAAAATTGATGCTGCTTTAAAGATATACTCCATTCT ATGCTTGA AGAGTG KDATRGAPPMTQDQ CCTWKTTTGKAAGAAAACTCTTTTTCAGCTTCAGCTATTCTGTCATCGGC CTGCAAAC ACATCG DADRCPAAPERDAP TGCTGCTGTTCCTGCTTCCCAGAAAGCTCAGCMAAAACCTATCCTGAAG ACCCATCC TTCTCCC EGTTSSTPDPKTTYHP ACCWCCCTTGGTGCCTCACGGAAGACCCGGASCACCTGCAAGAACCAA AGAAGAT ACTACG AVRRRAARRGMHLR AACATTAGGAGCTGGCTGAAGAAACCCCCCGTGGATACCTCWGCAGGG GGAATCTC AGAAAG AQDLDAARCPSGQR AGACCTGGSTCCAGMAGGACAKCTCTTCGGGACCTCMCATCSAGGAGC CTGCAGCC GGACCA DNVASESSAPPRATS AAGAATATCTCAACAGCTCTTCAGGAGGGGGACCCCCGGAGAACCCTG ATTTTTGA AGTGAC PPQASLPDPEESPGE CCCGCTTCCCAGAACCAGGATGCTGATCGCCGCCCCACCGGGAAGGAT AAAAATTG CTTCTCC SAGTTEIRPTEGEAGE GCCACCGCAGGAGCCCCCCCAATGACCCAGGACCAGGATGCTGATTGCT ATGCTGCT GTTGGA EDRIYLQYPLPTGLLL GCCCCGCCGGGAAGGATGCCACCAGAGGAGCCCCCCCGATGACCCAGG TTAAAGAT TCATATG CPFCLPVHGVQTLAA ACCAGGATGCTGATCGCTGCCCCGCTGCTCCAGAGAGGGATGCTCCGG ATACTCCA AACTGG LSKHVRKTYNKRIAFR AAGGAACCACCTCCTCAACCCCAGACCCCAAAACTACTTACCACCCGGCT TTCTCCTW AACCAT CSRCDLPFETQKKCKF GTCCGGAGGAGGGCCGCTCGAAGGGGAATGCACCTCAGAGCCCAAGA KTTTGKAA AAACTC HQATCRGPPTTAKV TCTCGATGCCGCACGCTGCCCTTCCGGGCAAAGAGACAACGTGGCCAGT GAAAACTC CCTGAA NPTDILRVPTLTPTDD GAGTCCAGCGCCCCCCCAAGAGCGACTTCACCTCCCCAAGCTTCTCTACC TTTTTCAG CATTAA LASAPQPASPESQQI AGACCCAGAGGAATCACCTGGCGAGTCTGCAGGCACAACAGAGATCCG CTTCAGCT ATCTCAC RGDQPPTEGSVTPAS CCCCACGGAGGGTGAGGCAGGGGAAGAAGACCGCATCTACCTCCAGTA ATTCTGTC CAAATG RTDDATKRTSPVSRIP CCCGCTCCCTACAGGTCTCCTCCTCTGCCCCTTCTGTCTCCCCGTCCATGG ATCGGCTG AGGGTC TLDPAVRGTTATSQV AGTCCAGACCCTCGCGGCCCTCAGCAAACACGTTCGTAAGACCTACAAC CTGCTGTT AATCCAT NNLTRRLSDLIKTIRH AAACGGATTGCTTTCCGGTGTAGCCGCTGCGATCTCCCCTTCGAGACCC CCTGCTTC CCTCATC NTDTRRCSAPPQVTS AAAAGAAATGTAAGTTTCATCAAGCCACGTGCAGGGGACCCCCCACGAC CCAGAAAG ATCATAT CRPAVGATSIVPQAA CGCGAAAGTGAATCCCACTGACATCCTCCGGGTTCCAACCCTGACCCCC CTCAGCM CCACTCA RRDPANGGASRSPQI ACCGATGATCTGGCTTCAGCACCCCAGCCAGCATCCCCAGAGTCACAGC AAAACCTA TTATAM PQPDPAPGRPNTSSK AGATAAGGGGGGACCAACCGCCAACTGAGGGAAGCGTAACCCCCGCCT TCCTGAAG TCCACAC VTQRASDRQKPHAP CGAGGACTGACGATGCCACCAAAAGGACCAGTCCCGTCTCCAGAATCCC ACCWCCCT CCGAAC PRTHQPDAARRRTRT CACGCTGGACCCTGCTGTGAGGGGGACCACCGCCACCTCTCAGGTCAAC TGGTGCCT ACAGCC IPSASKHDRAPTKPST AACCTCACCAGACGCCTCAGCGACCTCATAAAAACCATCCGGCACAACA CACGGAA ACTCTAT GASRTPLPPGRSSAA CGGACACGAGACGCTGCAGCGCTCCCCCACAGGTAACCTCATGCCGCCC GACCCGGA GAACTT SETPRAALPTTPGPPP TGCCGTAGGAGCAACTAGCATCGTCCCCCAGGCTGCACGGCGAGATCC SCACCTGC CATACCC QDPPEHRSTVRGTTR AGCCAACGGAGGAGCCTCCCGTAGCCCCCAGATCCCACAGCCAGACCCC AAGAACCA TCATATC PQTVPAAPEPAETTQ GCCCCCGGGAGACCCAACACCTCCTCCAAGGTTACCCAAAGAGCCTCTG AAACATTA TCAATGT QEERRPRARVATPW ACCGCCAAAAACCCCATGCCCCACCGAGGACCCACCAGCCGGATGCCGC GGAGCTG CTGTACT QSAWMEELAKAEDF CCGCAGAAGAACCAGAACCATCCCCAGCGCTTCCAAACACGACCGCGCC GCTGAAGA TTGACCC ENFDTLMDRLTAELS CCGACAAAGCCCAGCACCGGTGCTTCCAGAACCCCACTCCCTCCCGGAA AACCCCCC ATCAAC AEITARRREPQEAAR GATCCAGTGCTGCCTCGGAGACACCGAGAGCTGCCCTCCCCACCACACC GTGGATAC CTTTTAC ATRRFPAPSRNNTAR AGGACCCCCGCCTCAAGACCCACCTGAACACCGCTCCACAGTCCGAGGG CTCWGCA CCCCAAT EGRRGDVGRRYDPA ACAACAAGGCCGCAAACCGTCCCCGCAGCACCTGAACCTGCAGAGACA GGGAGAC CGGGGA AASRIQKLYRMNRTK ACGCAGCAGGAGGAGCGACGGCCACGAGCGAGGGTCGCCACGCCGTG CTGGSTCC TATTGCA AMREILDGTSSYCAI GCAATCCGCCTGGATGGAGGAGCTGGCAAAGGCTGAGGACTTTGAGAA AGMAGGA GATTAT QPERLYSYFKDVFDH CTTCGACACCCTGATGGACAGACTGACTGCAGAACTGTCTGCGGAAATT CAKCTCTT GTATTCC EAQTNLRRPECLSPLP ACGGCCAGAAGGAGGGAACCCCAGGAAGCCGCACGGGCCACTCGCAG CGGGACCT TCATGCC RIDLTEDLERDFSPQE ATTCCCTGCGCCGAGCCGTAACAACACCGCCAGAGAAGGCAGGAGAGG CMCATCSA ACCTGA VQARLSRTKNTAPGK GGACGTCGGCCGCCGCTACGATCCGGCGGCTGCATCCCGTATTCAGAAA GGAGCAA TCTTAAA DGIRYPLLKKRDPGCL CTATACAGGATGAACCGGACGAAAGCCATGAGGGAGATCCTCGACGGG GAATATCT CCAAAC VLAAIFNKCKQFHRV ACCTCCTCCTACTGTGCCATCCAGCCCGAGAGGCTCTACTCCTACTTCAA CAACAGCT TTTGCAC PRSWKKSMTVLIHKK GGATGTGTTTGATCACGAGGCCCAGACCAACTTGCGACGCCCAGAGTG CTTCAGGA CCTCGAT GXRDDPGNWRPISL CCTTTCCCCGCTACCCCGGATCGACCTCACGGAGGACTTGGAGCGAGAT GGGGGAC AATCTGT CSTIYKLYASCLAARIT TTTTCCCCGCAGGAGGTGCAGGCGAGGCTGTCGAGGACCAAAAACACC CCCCGGAG ATGTTAT DWSVCGGAVSSVQK GCCCCTGGAAAAGATGGCATCCGCTACCCCCTGCTGAAGAAGCGAGAC AACCCTGC TCCCTGA GFMSCEGCYEHNFLL CCCGGCTGCTTGGTGCTCGCTGCCATCTTCAACAAATGCAAGCAGTTCCA CCGCTTCC TAACCA QTAIQEARRSKRQCA TCGCGTTCCCCGCTCCTGGAAAAAGTCCATGACCGTGCTCATCCACAAA CAGAACCA GAAACT VAWLDLTNAFGSIPH AAAGGCGAMCGAGACGACCCCGGCAACTGGAGGCCCATCTCCCTCTGC GGATGCTG TCTATGC HHIFATLGEFGMPET TCCACCATCTACAAGCTGTATGCCAGCTGCCTCGCGGCAAGGATCACAG ATCGCCGC TCAAACT FIQILRDLYKDCTTTIR ACTGGTCAGTGTGCGGGGGCGCCGTCAGCTCAGTGCAGAAGGGTTTCA CCCACCGG CTGTTCA ATDGETDAIPIRRGV TGTCCTGCGAGGGATGCTACGAGCACAACTTCCTCCTTCAGACGGCCAT GAAGGAT CTATTTT KQGCPLSPIIFNLAME CCAGGAGGCCAGGAGGTCCAAGAGGCAGTGCGCAGTAGCATGGCTTG GCCACCGC TTTTAAC PLIRAISSGPTGFDLH ACCTGACCAACGCCTTTGGGTCCATACCCCACCATCACATCTTTGCCACC AGGAGCCC ATCATCT GKKLSILAYADDLVLT CTGGGAGAGTTCGGGATGCCAGAAACCTTCATCCAGATCCTCCGGGACC CCCCA TAATAA ADDPESLQGMLDAT TCTACAAGGACTGCACCACCACCATCCGCGCCACGGACGGAGAGACGG (SEQID AATTTTT SRATDWMGLRFNAK ACGCCATCCCCATCCGCCGCGGCGTGAAACAAGGATGCCCCCTTAGCCC NO:1183) AAATCT KCATLHIDGSKRDSV CATCATCTTCAACCTGGCCATGGAACCGCTCATCCGAGCCATCTCCAGCG GTT(SEQ QTTGFQIQGEPVIPL GCCCGACCGGCTTCGACCTGCACGGCAAGAAACTCAGCATTCTGGCCTA IDNO: AEGQAYQHLGTPTG CGCGGACGATCTGGTCCTGACCGCGGATGACCCAGAGAGCCTCCAAGG 1306) FRVRQTPEDTIQEILQ TATGCTAGATGCCACCAGCCGAGCTACTGACTGGATGGGGCTCCGCTTC DAAKIDASLLAPWQK AATGCGAAGAAGTGCGCAACTCTGCACATTGACGGCAGCAAAAGGGAC INALNTFLIPRISFTLR TCGGTGCAGACAACGGGGTTCCAGATCCAGGGTGAGCCCGTCATCCCCC GSAVAKVPLNKADKII TGGCAGAGGGGCAGGCATACCAGCACCTGGGCACGCCAACAGGGTTCC RKLVKKWLFLPQRAS GTGTCCGGCAGACACCCGAGGACACCATCCAGGAGATCTTGCAGGACG NELVYIAHRHGGANV CCGCCAAGATTGATGCCTCCCTGCTGGCACCGTGGCAGAAGATAAACGC PRMGDLCDVAVITH CCTGAACACCTTCCTGATCCCACGCATCTCGTTCACCCTAAGGGGATCCG AFRLLTCPDATVRNIA CCGTGGCGAAGGTGCCCCTCAACAAGGCAGACAAGATCATCCGGAAGC ANALRDATEKRIGRA TGGTGAAGAAGTGGCTGTTCCTTCCCCAGAGAGCCAGCAACGAGCTGG PSNQDIATFLSGSLD TCTACATCGCCCACAGGCACGGCGGCGCCAACGTCCCCCGCATGGGTGA GEFGRDGRDIASLWS CCTGTGCGACGTCGCGGTGATCACCCACGCCTTCCGCCTGCTGACATGT RTRNATRRLGKRIGC CCCGACGCCACGGTGAGGAACATTGCGGCGAACGCCCTGCGTGATGCG RWEWCEERQELGIR ACAGAGAAGCGGATCGGCAGAGCCCCCTCGAACCAAGACATCGCCACC VPQIRSDDNTIVTPTA TTCCTGAGCGGCTCCCTGGATGGGGAATTCGGACGGGACGGGCGCGAC RGLLERTLKAAIRSLY ATCGCTTCACTGTGGTCCCGCACTCGCAACGCCACGCGTCGCCTGGGGA VETLKRKPDQGKAFE AGCGCATCGGCTGCCGCTGGGAGTGGTGCGAGGAGCGCCAGGAGCTG LTSKWDASNHFLDG GGAATCCGGGTGCCGCAGATCAGGTCCGACGACAACACCATCGTCACC GGFTRFADWRFIHRA CCGACGGCCAGGGGCTTGCTGGAGAGGACTCTGAAGGCCGCCATCCGC RLNCVPLNGAVRHG TCGCTGTACGTGGAAACCCTGAAGCGTAAACCGGACCAGGGTAAAGCC NRDKRCRKCGYPNET TTTGAGTTGACCAGCAAGTGGGACGCCAGCAACCACTTCCTCGACGGG LPHVLCSCKPHSRAW GGCGGCTTCACCCGTTTCGCCGACTGGCGGTTCATCCACCGTGCCCGGC QLRHNAIQNRLVKAI TCAACTGCGTCCCGCTCAACGGAGCCGTCCGCCACGGGAACCGAGACA APRLGEISVNCTIAGT AGCGTTGCAGGAAGTGCGGCTACCCCAACGAGACCCTGCCCCACGTCCT DSQLRPDVVVTDEA GTGCAGCTGCAAACCCCACTCCAGAGCCTGGCAGCTGCGCCACAACGCC QKKIILVDVTVSFENR ATCCAGAACCGCCTGGTGAAAGCCATCGCGCCACGCCTGGGGGAGATC TPAFREARARKLEKY TCCGTGAACTGCACCATCGCCGGTACCGACAGCCAGCTACGACCTGACG APLADTLRAKGYEVQ TGGTCGTCACCGACGAGGCCCAGAAAAAGATCATCCTCGTCGACGTCAC MDALIVGALGAWDP GGTCTCCTTTGAGAACAGGACCCCGGCATTTCGCGAAGCCCGAGCTCGT CNERVLRTCGIGRRY AAGCTGGAAAAGTACGCCCCCCTGGCTGACACCCTGAGAGCGAAGGGC ARLMRRLMVSDAIR TATGAGGTGCAGATGGACGCCCTGATTGTCGGAGCCCTGGGCGCCTGG WSRDIYIEHITGHRQ GACCCCTGCAACGAGCGTGTGCTGCGGACCTGCGGGATCGGTCGACGC YQEA(SEQIDNO: TACGCACGTCTCATGCGGCGCCTCATGGTCTCAGACGCCATCCGATGGT 1428) CCAGGGACATCTACATCGAGCACATCACCGGCCACCGACAGTACCAGGA GGCGTGAGCCAGAGTGACATCGTTCTCCCACTACGAGAAAGGGACCAA GTGACCTTCTCCGTTGGATCATATGAACTGGAACCATAAACTCCCTGAAC ATTAAATCTCACCAAATGAGGGTCAATCCATCCTCATCATCATATCCACT CATTATAMTCCACACCCGAACACAGCCACTCTATGAACTTCATACCCTCA TATCTCAATGTCTGTACTTTGACCCATCAACCTTTTACCCCCAATCGGGG ATATTGCAGATTATGTATTCCTCATGCCACCTGATCTTAAACCAAACTTTG CACCCTCGATAATCTGTATGTTATTCCCTGATAACCAGAAACTTCTATGCT CAAACTCTGTTCACTATTTTTTTTAACATCATCTTAATAAAATTTTTAAATC TGTT(SEQIDNO:1060) NeSL Utopia- Drosophila AAAGTGTAGTTCTTTTCTGTTTTAGTGTAGTGGGAAGTCTGTTTCTTTTTA AAAGTGTA TAAAAA YAPGYEAAQSPCGRE 1_DYak yakuba TTATGTTTTTTACGAAAAAGTCCTGGTCTTTGAAATTCATTGTCTAAATTT GTTCTTTTC ATTAAA PPRDHHRRPRDACG TAAATAAAATTATAAAATTTAAAAAGAAAATTAATTAAAGAAGCGATGA TGTTTTAG ATGCCTT SSHSPEPCLTTPRLLP AATATCTCTGAAATTCAATCAATCAATTAATCATGGCGTCTCAGCGAGTG TGTAGTGG AAAAAT ETVSAEPCDDESQRT CACGTATTTGCCTACCCCTTCGTGGGACCATTCCGGTGCTCCGTATGCAT GAAGTCTG AAATAA RYASPHKQARTLHDA GGATGCGTCCGGGATGCATCCCACTAGGTCGCTGGGCGAATACGGCAC TTTCTTTTT ATATATC EPRDASREHAPSCAE ATACGCTGCGGCATATCAGCACATAACCCGGCGCCACCCACAAGTGGTT ATTATGTT AAAATTT PRCHRCQWTHWKD ATTACATACCGTTGCCGGGTCTGTGGCGCTGATATGCCCCGGGGTATGA TTTTACGA AAAAAA CCPHSTNTTDGPEGT AGCAGCTCAAAGCCCATGTGGCCGCGAGCCACCCCGAGACCACCACAG AAAAGTCC AAAAAC DRCADTITSPATAAC ACGCCCACGGGATGCTTGTGGAAGCAGCCACAGCCCCGAACCCTGCCTT TGGTCTTT GAGGAA PQRSPCPLGSSNGCD ACCACTCCCCGCCTTCTCCCCGAGACAGTCAGCGCTGAGCCCTGTGACG GAAATTCA CAAATA ETAPEKRQPAADLVH ACGAGAGCCAGCGAACGCGGTACGCCTCTCCCCACAAGCAGGCGCGTA TTGTCTAA AACACA TAPFAVLVRAGPFAD CTCTGCACGACGCCGAGCCGCGCGATGCTTCCCGGGAACATGCCCCATC ATTTTAAA AATTCTG LVRAGPFADHHQDD CTGCGCTGAGCCCCGTTGTCACAGGTGCCAGTGGACGCACTGGAAGGA TAAAATTA AAAGAT DPLPHRSGSLGPLCSK CTGTTGCCCCCATTCCACCAACACGACAGATGGCCCCGAGGGCACCGAC TAAAATTT TTATATA QKDPRKTHQHRHSG CGCTGCGCAGACACCATCACCAGCCCCGCAACGGCCGCCTGCCCGCAAC AAAAAGA ATTTAAA QAGNQTHTDIPRAA GTTCCCCCTGCCCCCTTGGGTCAAGTAACGGGTGTGACGAGACGGCTCC AAATTAAT AWATAA PSRRAAICLMANAAA TGAGAAGCGGCAACCAGCCGCCGATCTCGTCCATACCGCCCCGTTCGCC TAAAGAAG ATCGAA TREDLLRAATSLSEM GTCCTCGTCCGTGCCGGCCCGTTCGCCGATCTCGTCCGCGCCGGCCCGT CGATGAAA AATAAA AAANQPTRSPTGGG TCGCCGACCACCACCAGGACGACGACCCCCTCCCGCACCGGTCTGGGAG TATCTCTG TGTTGA EPTSQGRRGPQALA TCTCGGCCCCCTCTGCTCCAAGCAGAAGGACCCCCGAAAGACCCACCAG AAATTCAA AAACAA DAAKRIQQIYRTNIPR CACCGCCACAGCGGGCAGGCCGGGAACCAAACCCACACGGACATACCC TCAATCAA AAAAAA AMRKVLRTLLTAVFS AGAGCCGCCCCCAGCAGAAGGGCCGCAATCTGCCTGATGGCCAATGCC TTAATCAT AATAAT ACLRTGHVPDLCKKS GCGGCCACCAGGGAGGACCTGCTGAGGGCCGCCACCAGCCTTTCCGAA GGCGTCTC AATAAT RTVLIHKKGDRTDLS ATGGCGGCCGCGAACCAGCCTACCCGCTCGCCCACTGGAGGTGGCGAG AGCGAGT AATAAA NWRPLSMGDTIPKLF CCCACCTCACAGGGTAGGCGCGGACCGCAAGCACTGGCAGACGCAGCG GCACGTAT AACACA AAVMADRLTAFLTN AAAAGGATCCAACAAATATACAGGACCAACATACCTCGCGCCATGAGAA TTGCCTAC ATAACA GGRLSEEQKGFLQHE AAGTCCTGAGAACACTGCTCACGGCAGTGTTTAGCGCCTGCCTGAGGAC CCCTTCGT CTCACCC GCHEHNFVLGQVLEE AGGTCATGTCCCCGATCTGTGTAAAAAGTCCAGAACGGTCTTAATCCAC GGGACCAT GGCCTG SRRQGKDLVMGWL AAGAAAGGSGACAGAACTGACCTGTCAAATTGGAGGCCTCTTTCCATGG TCCGGTGC CCCCAG DLSNAFGSIPHATIM GTGACACCATCCCCAAATTGTTCGCAGCCGTCATGGCGGACAGGCTGAC TCCGTATG AGGCAG DAVAGMGIPSRIRTII GGCGTTCCTCACTAACGGAGGAAGGCTCAGCGAGGAGCAGAAGGGCTT CATGGATG GTAAAC HQLATGAATTAKTID CCTCCAGCACGAAGGCTGCCATGAACACAATTTTGTTCTTGGCCAAGTG CGTCCGGG ATTTACT GMSEEIPIEAGVRQG CTGGAGGAGAGCAGACGCCAAGGCAAGGACCTCGTCATGGGCTGGCT ATGCATCC GGCCAT CPASPILFNIAIERVLR GGACCTGTCCAACGCGTTCGGGTCGATTCCGCATGCCACCATCATGGAC CACTAGGT ATGGCT KIKTVNAGYLLYGSRI GCGGTCGCCGGTATGGGGATCCCTTCGAGGATCCGGACCATAATCCACC CGCTGGGC TTTTTTT SPLAYADDLVLIASSP AGCTGGCCACCGGCGCCGCGACCACCGCCAAAACCATTGATGGCATGTC GAATACGG TAA EEMRSLLRAADDAAI GGAAGAGATCCCGATCGAAGCGGGGGTCAGACAGGGCTGCCCAGCCA CACATACG (SEQID EAGLHFNPKKCATLH GCCCAATCCTCTTTAACATCGCAATAGAGCGGGTACTTCGCAAAATCAA CTGCGGCA NO: LTGKKSSRRAVQTGF AACCGTCAACGCGGGGTACCTGCTCTATGGGAGCCGCATTAGCCCGCTG TATCAGCA 1307) LVRGTPIPAMTEGDA GCGTACGCCGATGACCTGGTGCTAATTGCGAGCTCCCCAGAGGAGATG CATAACCC YEYLGIPLGLKKNQTP AGGTCCTTGCTGCGTGCTGCGGACGACGCCGCAATAGAAGCCGGTCTG GGCGCCAC RAAMEAIVGDIAKID CACTTCAACCCCAAGAAGTGCGCGACCCTACACCTCACGGGGAAGAAAT CCACAAGT DSLLAPWQKIDAART CCTCGCGGAGGGCAGTGCAGACCGGCTTCCTCGTCCGTGGCACGCCAAT GGTTATTA FVAPKLDFVLRSGATL ACCGGCCATGACAGAGGGGGATGCCTACGAATACCTCGGCATCCCCCT CATACCGT RAPLRHLDTVIKKHIK GGGTTTAAAAAAAAACCAAACACCCAGGGCAGCGATGGAAGCGATAGT TGCCGGGT KWLYLPQRASAEVVY TGGGGACATAGCCAAGATAGATGACTCGCTGCTCGCCCCGTGGCAAAA CTGTGGCG TPLKKGGAGILPSSILA GATCGACGCGGCCCGCACCTTCGTGGCACCGAAGCTTGACTTCGTGCTA CTGA(SEQ DVLTIAQAHRMVSCP CGAAGTGGCGCCACCTTGCGGGCCCCGCTGCGTCATCTGGATACAGTCA IDNO: GEVVSRIASEGLREAV TTAAAAAACACATTAAAAAATGGCTGTATCTGCCGCAGAGGGCGAGCG 1184) KRKINREPSGDEMAH CGGAGGTAGTATACACCCCGCTGAAGAAAGGTGGAGCGGGCATACTAC FLSGSTLSGETASFGD CTTCATCTATATTGGCTGATGTCCTAACTATCGCCCAGGCTCACCGCATG AGFWSRVRMATKR GTGTCCTGCCCTGGGGAGGTCGTCTCCCGGATTGCAAGTGAGGGCCTG QAVHLGVRWAWRG AGAGAAGCGGTAAAGCGAAAAATAAACCGGGAGCCATCCGGCGACGA GELLVESRGQRNRPV GATGGCCCACTTTCTCTCAGGCTCCACTCTATCCGGGGAGACAGCCAGC ATDSNSRSQLIQRLR TTTGGCGACGCCGGATTCTGGTCGAGGGTGAGGATGGCCACCAAAAGG CAAQDEFLTILINKPD CAAGCTGTGCATCTGGGGGTGCGTTGGGCCTGGAGAGGAGGTGAGCTA QGKVAKLSTLTPVSN CTGGTCGAGAGTAGAGGACAAAGAAACCGACCAGTGGCCACCGACTCG AFIRDGSFTRFADWR AACTCCAGGTCCCAACTCATCCAACGTCTCAGGTGCGCAGCTCAGGATG FIHRARLGVLPLNGAI AGTTCCTGACCATCCTCATAAATAAACCCGACCAGGGGAAGGTGGCGAA RWGSGDKRCRVCGY GCTCTCCACGCTAACCCCAGTCAGCAACGCGTTCATACGCGACGGTAGC QLESVPHVLCHCMH TTTACCAGGTTTGCTGACTGGCGGTTTATCCACAGAGCCCGACTGGGAG HSNAMQQRHNAV TCCTCCCACTCAACGGAGCGATCCGATGGGGCAGCGGCGACAAGCGCT MDRLAKAGSRLGTP GCCGGGTCTGTGGATATCAGCTGGAGAGCGTTCCACACGTGTTGTGCCA RVNCRVEGVAEDMA CTGCATGCACCACTCAAACGCAATGCAGCAGAGGCACAACGCGGTGAT ALRPDLVWRDERSRK GGATCGCCTCGCCAAGGCTGGCTCACGGCTGGGGACCCCCAGGGTGAA IVIVDVTVPFENGAEA CTGCCGCGTGGAAGGGGTCGCCGAGGACATGGCGGCCCTCAGGCCGG FDNARGEKEEKYRPL ACCTGGTATGGCGCGACGAACGGAGCAGAAAAATCGTCATAGTTGACG AEALRAMGYQVKLE TGACTGTTCCGTTCGAGAACGGGGCTGAAGCGTTTGATAACGCGAGGG AFIVGALGSWDPKNE GCGAGAAAGAAGAAAAATACCGCCCCCTAGCTGAAGCCCTGCGCGCCA RVLKTLGVSRFYAGL TGGGATACCAGGTAAAACTGGAGGCATTCATTGTCGGAGCCTTGGGCTC MRRLMVADTIRWSR GTGGGACCCTAAAAACGAAAGGGTCCTTAAGACTTTGGGTGTCTCCAG DIYVEHVSGIRQFTLP GTTTTATGCTGGCCTGATGCGCAGACTGATGGTGGCCGACACCATCAGG SGAPSN(SEQID TGGTCCCGGGACATTTATGTGGAGCATGTATCCGGGATCAGGCAGTTCA NO:1429) CCCTGCCAAGTGGAGCTCCCTCCAACTAAAAAATTAAAATGCCTTAAAA ATAAATAAATATATCAAAATTTAAAAAAAAAAACGAGGAACAAATAAAC ACAAATTCTGAAAGATTTATATAATTTAAAAWATAAATCGAAAATAAAT GTTGAAAACAAAAAAAAAATAATAATAATAATAAAAACACAATAACACT CACCCGGCCTGCCCCAGAGGCAGGTAAACATTTACTGGCCATATGGCTT TTTTTTTAA(SEQIDNO:1061) NeSL Utopia- Gavialis CGCTGGAAAGACGGAGAACCGCTTCTTTTTCCTGCGCCCGGCCTGGTAT CGCTGGAA TGAACC MSGPRQAAADPRPS 1_Gav gangeticus TGCACTTCCTCCAGGACCAGCGCCAACCTAGTCCGGCAGACTGCCGGAA AGACGGA GCCCCC TDPRRQRDSQSPEPR TAATAGCCTCAGAAAGAGAGCTGGCTAGCAGCCCTCTTTTCTTTCCTCCG GAACCGCT CCTCCGC LTRAASRRRTPDPED GTGCAGCGTGGGTTCTTGTCAGTCCTGATGGGCTAGGGAAGGCGGTGC TCTTTTTCC GCCAGA APRTTAEHPERRRTP CGCCAGTACGTCCGAAAGAGCGCCGGTTGCGCGAGCGACCGCGCCGCT TGCGCCCG CGGACC PDPRGPSATTAGPER CAGGCGAGTAGCCCAAGGGTCTTACGGTTCGCCGGACCCGATAACGCG GCCTGGTA TTCACTT RRPPDPGGPEDDPPE AAAGCCCCGACTCGGGCCAGTAGCCGAAGACCNTGGGCCTCCCTCCCCA TTGCACTT CACTCC GLPTLVEEPRTPPTPD GGTCGGAGTAGGCGAACGCCCGTGCTCGGAGGACGGAACGTGGACAA CCTCCAGG GAGAGG PPDGRPRRGCRRGS AACACCCCCAGGTCCCAATGACGCCCTGATCCACTGACAAGAACGCTCG ACCAGCGC ATTCTTC AHVPPLPPPCEAAVP AGGCACNCCAGGAGACCCCCAGCTAGGGCAGACCGCCGACCACGGGTC CAACCTAG GACCAC DLPPAKAVQVAQRH GCGGAGGACCCTCCCAGGAGGGTGGACCAGCGAACCCGAGTCGGCGA TCCGGCAG GGACGA EQTPTALPPAAPSVLL CGAACCCCGACGCACCCCCCCCGCGATGTCGGGACCCCGACAGGCGGC ACTGCCGG CCCCGCT LPLRHRVRGPEAPEE GGCGGACCCCCGGCCATCGACCGACCCCCGGAGGCAGAGAGACTCTCA AATAATAG CCACCC PPQGMPGPRGREET GAGCCCGGAACCCCGGCTGACGAGAGCCGCCTCCCGGCGGAGGACCCC CCTCAGAA GAAGAG RHAGEVRRPTTRAAA GGACCCCGAGGACGCCCCCCGGACGACGGCGGAGCACCCCGAGCGAC AGAGAGCT GACCCC RRPARPAAPPATPPD GGCGGACTCCTCCGGACCCCCGCGGNCCCTCGGCGACGACGGCGGGCC GGCTAGCA CGCGAT QTSGDRPTERPAPAT CCGAGCGGCGACGNCCCCCGGACCCCGGCGGTCCCGAGGACGACCCCC GCCCTCTT GAGACT PPRRSAPRDPRPDVT CCGAGGGCCTCCCCACNCTGGTGGAGGAGCCCCGAACCCCCCCGACAC TTCTTTCCT CTATAC PRPDGPPPGPPGPP CGGACCCCCCCGACGGACGACCCAGGCGAGGGTGCAGACGCGGCAGC CCGGTGCA GGACTG DAPDPPRIPEPPGEP GCTCACGTTCCTCCCCTTCCCCCTCCCTGCGAAGCTGCTGTGCCCGACCT GCGTGGG AGGCAC EPPGALQLPSVPGSP GCCACCCGCCAAGGCAGTACAGGTCGCACAACGACATGAACAAACACC TTCTTGTC TTCCTTC GAETSAQQRMPTPR TACGGCGCTTCCACCAGCTGCGCCTAGCGTTCTACTGCTCCCTCTGCGGC AGTCCTGA GAACCA QALWLEELSRATAFE ACCGAGTACGAGGCCCTGAAGCTCCTGAAGAACCACCACAAGGTATGC TGGGCTAG CTTCCTC AFEASVARLTEELSAA CAGGGCCCCGGGGCCGAGAGGAGACCCGGCACGCTGGTGAGGTCCGC GGAAGGC CACCATT ARPGQPRRGADNGP CGCCCCACGACCCGGGCCGCGGCGCGAAGGCCCGCCAGACCGGCCGCC GGTGCCGC GCGGAC TTRRDHRPQPQRRP CCGCCGGCGACCCCACCGGACCAGACCTCCGGGGACCGCCCGACGGAG CAGTACGT CATTGTA RRQRYDPAAASRIQK AGACCCGCCCCGGCGACGCCACCACGCAGGTCTGCACCCAGGGACCCC CCGAAAGA ACGGGT LYRANRPKAAREILEG CGACCGGACGTGACGCCCCGACCGGACGGCCCCCCTCCCGGACCCCCG GCGCCGGT TTGTGT PSAFCQVPRETLFNYF GGGCCGCCCGACGCCCCCGACCCGCCGAGGATCCCGGAGCCGCCCGGN TGCGCGAG GTATCTA SRVFNPPAEAAAPRP GAGCCCGAGCCGCCGGGAGCCCTCCAGCTCCCGAGCGTGCCGGGGTCT CGACCGCG TCTCCTT ATVEALTPVPPAEGF CCGGGTGCGGAGACCTCCGCACAGCAGAGGATGCCCACCCCGCGGCAA CCGCTCAG TCTCTCT EEAFTPREVEARLKRT GCCCTCTGGCTGGAGGAGCTCTCCCGGGCCACCGCCTTCGAGGCCTTCG GCGAGTA CAGCGT RDTAPGRDGIRYGLL AGGCCTCGGTGGCCCGGCTCACGGAGGAGCTCTCGGCGGCCGCCCGGC GCCCAAGG CGCGAA KKRDPGCLVLSVLFN CCGGCCAGCCCCGGAGGGGCGCCGACAACGGACCGACGACGCGACGA GTCTTACG CCCCCTC RCREFRRTPAAWKR GACCACAGACCGCAGCCGCAGAGGCGACCCAGGCGCCAGCGCTACGAC GTTCGCCG CCCCACC AMTVLIHKKGDPTDP CCGGCGGCAGCCTCCCGGATCCAGAAGCTGTACCGGGCCAACCGCCCC GACCCGAT CCCCACC GNWRPIALCSTVAKL AAGGCGGCGAGAGAGATCCTGGAGGGACCCTCGGCTTTCTGCCAGGTC AACGCGAA CCCGGG YASCLAARITDWAVT CCCCGGGAGACTCTGTTCAACTATTTCAGCAGGGTCTTCAACCCCCCGGC AGCCCCGA CTTAGTT GGAVSRSQKGFMST AGAAGCCGCCGCCCCACGCCCCGCGACCGTCGAAGCGCTGACCCCCGTC CTCGGGCC GGCTAA EGCYEHNFTLQMAL CCCCCGGCAGAGGGGTTCGAGGAGGCCTTCACGCCGCGGGAAGTGGA AGTAGCCG CATTGTA DNARRTRKQCAVA AGCCCGCCTGAAGAGGACCAGGGACACCGCCCCCGGCAGGGACGGCAT AAGACCNT TCTCCTG WLDISNAFGSVPHRR CAGGTACGGTCTCCTNAAGAAACGTGACCCGGGCTGCCTCGTTCTTTCT GGGCCTCC TAACCTA IFGTLRELGLPDGVID GTTCTCTTCAACAGGTGCAGAGAGTTCCGGCGCACGCCCGCCGCCTGGA CTCCCCAG GTCGCG LVRELYHGCTTTVRA AGAGGGCCATGACGGTCCTCATCCACAAGAAGGGAGACCCGACCGACC GTCGGAGT TTCCCCT TDGETAEIPIRSGVRQ CGGGCAACTGGAGACCCATCGCCCTGTGCTCCACCGTGGCCAAGCTGTA AGGCGAA CCTCACC GCPLSPIIFNLAMEPL CGCCAGCTGCCTGGCGGCCCGCATCACCGACTGGGCGGTGACCGGCGG CGCCCGTG CCCATCC LRAVAGGPGGLDLY GGCCGTCAGCCGGAGCCAGAAGGGCTTCATGTCGACGGAGGGCTGCTA CTCGGAGG CTCTATT GQKLSVLAYADDLVL CGAACACAACTTCACCCTCCAGATGGCCCTGGACAACGCCCGGAGGACC ACGGAAC GTTAGT LAPDATQLQQMLDV AGGAAGCAGTGCGCGGTGGCGTGGCTGGACATCTCCAACGCCTTCGGC GTGGACAA CCCTCGC TSEAARWMGLRFNV TCCGTGCCCCACCGCCGCATCTTCGGCACCCTCCGCGAGCTGGGCCTAC AACACCCC TCGGGC AKCASLHIDGRQKSR CGGACGGCGTCATCGACCTGGTGCGAGAGCTCTACCACGGNTGCACCA CAGGTCCC GATCTG VLDSTLTIQGQAMRH CGACCGTCCGCGCCACCGACGGAGAGACCGCGGAGATCCCCATCCGGT AATGACGC TATTTCC LRDGEAYCHLGTPTG CGGGGGTGAGGCAGGGCTGCCCCCTCAGCCCCATCATCTTCAACCTGGC CCTGATCC CTATCG HRAKQTPEETINGIV CATGGAACCGCTCCTTCGAGCCGTGGGGGCGGCCCCGGCGGGCTCGA ACTGACAA GCTTTGT QDAHKLDSSLLAPW CCTGTACGGCCAGAAGTTGAGCGTCCTGGCCTACGCCGACGACCTCGTC GAACGCTC CATCTTT QKIDAANTFLIPRVAF CTCCTCGCCCCCGACGCCACCCAGCTGCAGCAGATGCTGGACGTGACGT GAGGCAC TTTCTGG VLRGSAVPKTPLKKA CCGAGGCGGCCAGGTGGATGGGCCTGCGCTTCAACGTCGCCAAGTGCG NCCAGGA ATTCCCG DAEIRRLLKKWLHLPL CCTCCCTGCACATCGACGGCAGGCAGAAGAGCCGCGTCCTGGACTCCAC GACCCCCA ATCCTAA RASNEVLHIPYRQGG CCTCACGATCCAGGGCCAGGCGATGAGGCACCTGCGCGACGGCGAGGC GCTAGGGC ACATTTA ANVPRMGDLCDIAV CTACTGCCACCTGGGGACGCCCACCGGCCACCGGGCCAAGCAGACGCC AGACCGCC CTAATA VTHAFRLLTCPDATV GGAGGAGACCATCAACGGGATCGTGCAGGACGCCCACAAGCTGGACTC GACCACGG AAAGTC SIIAASALEETARKRIA GTCCCTGCTGGCCCCCTGGCAGAAGATAGACGCGGCGAACACCTTCCTC GTCGCGGA AATCTGT RQPTGRDLATFLSGS ATCCCCCGCGTCGCGTTCGTCCTGAGAGGCTCGGCGGTCCCCAAGACCC GGACCCTC TCTTT LEGEFGRDGGDFASL CCCTCAAGAAGGCGGACGCCGAGATCCGGCGGCTGCTCAAGAAGTGGC CCAGGAG (SEQID WSRARNATRRLGKRI TGCACCTGCCGCTGAGGGCCAGCAACGAGGTCCTGCACATCCCCTACCG GGTGGACC NO: GCAWTWTEECRELG GCAGGGAGGCGCCAACGTCCCCCGCATGGGAGACCTCTGCGACATCGC AGCGAACC 1308) VSLQPAPHADRVTVT GGTGGTCACCCACGCCTTCCGCCTCCTGACCTGCCCGGACGCGACGGTA CGAGTCGG PRTRTFLERFLKDAVR AGTATCATCGCCGCCAGCGCCCTCGAGGAGACCGCCCGCAAGAGGATC CGACGAAC NKYAGDLRAKPDQG GCGAGGCAGCCGACCGGACGNGACTTGGCCACCTTCCTCAGCGGCTCG CCCGACGC KVFDVTSKWDASNH CTGGAGGGCGAGTTCGGCCGAGACGGCGGGGACTTTGCCTCGCTGTGG ACCCCCCC FMPSGSFTRFADWR AGCCGAGCCCGCAACGCCACGCGCCGCCTCGGGAAGCGCATCGGCTGC CGCG(SEQ FLHRARLNCLPLNGA GCCTGGACCTGGACCGAGGAGTGCCGGGAGCTGGGAGTCTCCCTGCAA IDNO: VRFGHRDKRCRRCG CCAGCCCCGCACGCCGACCGCGTCACCGTGACGCCCCGCACGAGGACCT 1185) YAAETLPHVLCSCKP TCCTGGAGAGGTTCCTGAAGGACGCCGTCCGAAACAAGTACGCCGGCG HARAWQLRHNAVQ ACCTGAGGGCCAAACCCGACCAGGGCAAGGTCTTCGACGTCACCTCGA DRLVRAIPAAAGEISV AGTGGGACGCTAGCAACCACTTCATGCCCAGCGGGAGCTTCACGCGCTT NRTVPGCESQMRPD CGCGGACTGGCGCTTCCTCCACCGCGCCCGCCTCAACTGCCTGCCTCTGA IVITNEEAKKVVIVDV ACGGGGCCGTGCGCTTCGGCCACCGGGACAAGAGGTGCCGACGGTGC TIPFENRRQAFTDAR GGCTACGCGGCAGAGACCCTCCCCCACGTGCTGTGCAGCTGCAAGCCG ARKREKYAPLADTLR CACGCCAGAGCCTGGCAGCTCCGCCACAACGCTGTCCAGGACCGCCTG GRGYDVTVDALIVGT GTGAGGGCCATCCCGGCCGCGGCGGGGGAGATCTCCGTGAACCGCACC LGAWDPSNESVLRA GTCCCGGGCTGCGAGAGCCAGATGCGACCCGACATAGTCATCACCAAC CRVSRRYAKLMRCL GAAGAGGCCAAGAAGGTCGTGATCGTGGACGTCACCATCCCCTTCGAG MVSDTIRWSRDIYVE AACCGGCGCCAAGCCTTCACCGACGCCCGGGCTCGCAAGCGGGAGAAG HITGHRQYSDPTRRA TACGCCCCGCTGGCCGACACCCTGAGGGGCCGCGGCTACGACGTGACG AAGPDPEGTA(SEQ GTCGACGCGCTCATCGTGGGAACGCTCGGAGCCTGGGACCCCAGCAAC IDNO:1430) GAGAGCGTCCTGCGTGCCTGCCGCGTCTCCCGCCGCTACGCCAAGCTGA TGCGCTGCCTCATGGTGTCCGACACCATCCGTTGGTCCCGCGACATCTAC GTGGAACACATCACGGGCCACCGCCAGTACTCCGACCCCACCAGACGA GCCGCCGCCGGACCGGACCCGGAGGGGACCGCCTGAACCGCCCCCCCT CCGCGCCAGACGGACCTTCACTTCACTCCGAGAGGATTCTTCGACCACG GACGACCCCGCTCCACCCGAAGAGGACCCCCGCGATGAGACTCTATACG GACTGAGGCACTTCCTTCGAACCACTTCCTCCACCATTGCGGACCATTGT AACGGGTTTGTGTGTATCTATCTCCTTTCTCTCTCAGCGTCGCGAACCCC CTCCCCCACCCCCCACCCCCGGGCTTAGTTGGCTAACATTGTATCTCCTG TAACCTAGTCGCGTTCCCCTCCTCACCCCCATCCCTCTATTGTTAGTCCCT CGCTCGGGCGATCTGTATTTCCCTATCGGCTTTGTCATCTTTTTTCTGGAT TCCCGATCCTAAACATTTACTAATAAAAGTCAATCTGTTCTTT(SEQID NO:1062) NeSL Utopia- AGCV01358106 Lytechinus ATCTACTATCATGTCTTGTCCAAGAGAGGGAAGCGATCACCTCGGTCCT ATCTACTA TGAATA MSCPREGSDHLGPD 1_LV variegatus GATCCTGAGACACCCGCCCTCCATCAGGGTTCTGACATCCGGGTTACCA TC(SEQID GCATTTA PETPALHQGSDIRVT GTTCTCGCCTTCGAGGTTCCCGCGGAAAGAGTTCTCGCCAACCAAGCTC NO:1186) TATTGTG SSRLRGSRGKSSRQP CCGACACCAAGTTCCTGCCAGCGAGGCTTCCGCCACCGCCCAGCAGACT TTCCAAA SSRHQVPASEASATA GCCGCCAACGAGTGCCAGGTGTGTGGATCTTCCTTCGCCACCTCCAGTG CAACAT QQTAANECQVCGSS GACTCCGCCGCCACATGGCCAGGCTTCATCGAGCTGCCTCTGCGGATCC ACTCATT FATSSGLRRHMARLH TGAGGGTGCTGCGCCGGCTTCCATCACAGAGATTTTCGACTACCCCTTG ATTATAT RAASADPEGAAPASI CCTTCCCGGTGGAAATGCTCGGCATGCTCGGAGAACTTTTTCAACCAGC CTAAAC TEIFDYPLPSRWKCSA AGACCCTCAAGCGACACCAGACCAGGCATCATCCAGCTACCACCTTCGC ATTTTTT CSENFFNQQTLKRH GTATGCCTTTCGGTGTTCGTCATGCCGGTCCGAGTTCGACTCAGCACGG TTTCTGT QTRHHPATTFAYAFR AGGGCTGCGAACCATTGGCAGGTCCACAAGAAGGAGCGATCTCAACTC TCCTGAC CSSCRSEFDSARRAA TCTGGCACCGAGCCCCAGGCCTCTTCCCAAGCCAGAGTTAGCATGGCTC AATCTAC NHWQVHKKERSQLS ATTCTCCTCCACCTCTGCCCAACACTTCTTGGGCGGAGCTCGCCTCGAAT GTAAAG GTEPQASSQARVSM CCTGCCGAGATACCTTCCTTCGTCTGGGAGTCTCCTCCCAAGAACCGCCC TCTGCTA AHSPPPLPNTSWAEL CTCGGTTGAGGAGTTCGGTTCGTCTCTGCCAACTGATGTTACGATGATG ACCAAC ASNPAEIPSFVWESP TCTCAAAGTCCTCCACCGCAGGTACAGTCGTCTCCTGTCCCTGCTCTGAC TGGCAT PKNRPSVEEFGSSLPT TCCTCTTTCACCCGCTGCCACTGCCTCCAGTTCTCCTCCAGGGGCTGCAA GATGAA DVTMMSQSPPPQV GGCAGCTGACCCCTCCTACACAGACTAACACCCCAGTCACCCAGAGGGC ATAAGA QSSPVPALTPLSPAAT TCGCCTGCAACCTGAAGCAGACGTCGTACCTGAACTCCCTCCTTCAGTCA TAAAAT ASSSPPGAARQLTPP CCGAGCACCCTGTGTCTGACGCTCAACACTGGGTTGATGCTGTATCCTCT CCCCTTA TQTNTPVTQRARLQ GCATCAGATTGGTCTGAGTTTGAAGCAGTATGTGATCAATTTGTCATCCA CACATTA PEADVVPELPPSVTE CGCTGTTGCTGTTTCCCGTCCCAATCTTGCTCGACCCCAGCAGCAAGATA ATTTCTT HPVSDAQHWVDAV GGCAGAGATCTGGTGACCACCCTCCTAGACAGCAAAGAGGTCAGCATC GTCACA SSASDWSEFEAVCD GACCAACCTTCGATGTCCGTGAGGCAAGTAGAATCCAGAAGCTCTATCG TCATAAT QFVIHAVAVSRPNLA TACCAGCAAGAAAAGAGCCATCAGACACATACTGAAAGAGAAATCACC GCTTTGT RPQQQDRQRSGDH TTCCTTCTCTGGTTCCGAGTCAGACGTCTTAGACTTCTTCCGCGAGGTGT CAAAGC PPRQQRGQHRPTFD ATTCTGCTAAAGAAGTTGACGAGGAAGCAGTTGGTAAACTAGCATCCTC AATGTC VREASRIQKLYRTSKK GCTCTTCGATGTCCCTCAAGGTGATGACTCTGCGACATCTCTGTCTCTGC CTACATA RAIRHILKEKSPSFSGS CCACGTCAGCGAAGGAGATCGGAGCAAGGCTGTCAAGGATGACAAACT ATATCTC ESDVLDFFREVYSAKE CTGCCCCCGGGAAGGATCGCTTGGAGTACAGACACATTCGACGTGCGG GATGTC VDEEAVGKLASSLFD ACGGGTCCTTCAGCATCTCTGAGGCCATCTTTAACAAATGCCTGGCTGA ACCCCA VPQGDDSATSLSLPT AGGTCGGATCCCAGCTCCTTGGAAGACAGCATCTACCATCCTACTTCACA ATTAATT SAKEIGARLSRMTNS AGGCTGGCCCCACGGATGATCCCGCCAACTTCCGCCCAATCGCCTTACA TTACATC APGKDRLEYRHIRRA GTCATGTCTCTACAAGCTTTTTATGGCTGTACTTGCGGACCGGCTGACCA CTTCGG DGSFSISEAIFNKCLA AGTGGGCCTGTGAGAACCAGTACCTCAGCCCCGAGCAGAAGTCCGCTC TAACCTT EGRIPAPWKTASTILL GCCCCTGCGAGGGGTGCTTCGAGCACTCCTTCCTTCTCTCAGCTGCCCTG TATACC HKAGPTDDPANFRPI AAGGACTGCAGGAGAAACCAGAAGACCATCTGCATCGGTTGGTTGGAC GTTGGA ALQSCLYKLFMAVLA CTTAGGAATGCATTTGGAAGCATTCCTCATCCTGTCATCAAGATCGTCCT TCAACAT DRLTKWACENQYLSP GTCCAGTCTGGGTGTCCCTGATTCGCTTGTTACCCTCCTCATGGATGCCT ATATGA EQKSARPCEGCFEHS ACAATGGTGCGTCAACCTCGTTCACGCTGACCGGGGGCCAGACCGACA TTTGTAA FLLSAALKDCRRNQK CCGTACCCATCAGATCAGGGGTGAAGCAAGGCTGCCCGATGTCCCCAAT AACTGTT TICIGWLDLRNAFGSI CCTCTTCAACCTGGCCATCGAACTTATCATCAGGGCAGTCAAGAAGAAT ATTTCTG PHPVIKIVLSSLGVPD GCATCAGACAACCATCTCGGAGTGACTGTCCAGGGCAAGAACCTCTCCA AGTTTTT SLVTLLMDAYNGAST TCCTGGCCTATGCTGATGACCTAGTGCTGCTCAGCCGAGACACTGAAGG TCTATGC SFTLTGGQTDTVPIRS CCTCCAATCCCTCCTTCAAGTTGCTGGCTCTTCTGCATCTACCCTTCAGAT TAATAA GVKQGCPMSPILFNL GCAGTTTAAGCCCCAGAAGTGTGCAACACTCACCCTTGACTGCAAGCGT A(SEQ AIELIIRAVKKNASDN GGTACCAATGTTAGGCAGTCTGCTCACCATATCCAAGGGGCTGCCATCC IDNO: HLGVTVQGKNLSILA CATCGCTTACTGAAGAGGAGCGTTACCGCTACCTTGGGGTCCCCATTGG 1309) YADDLVLLSRDTEGL ACTCCCAAGGCTCACAAGTCTACAGGAGAGCTCTAGGAAGCTGTCGTCA QSLLQVAGSSASTLQ GACATCGAGACCATTTCATCCTCCCTTCTAGCGCCTTGGCAGAAACTGGA MQFKPQKCATLTLDC TGCAATCAAGACCTTTGTGATACCCATCCTTCAGTATACCTTGCGGGCCA KRGTNVRQSAHHIQ CAGAATACCTCAAGTCCGACTTGAAACCCCTACGAGCAGCCATTATCAA GAAIPSLTEEERYRYL GCACGTGAAGAAGATCTGCCATCTTCCGGTAAGATCATCCAACGCCTTC GVPIGLPRLTSLQESS GTCTTCGCGTCCCGACCCTCCGGTGGTTTGGCTTTTGTTGACCCAGGAGT RKLSSDIETISSSLLAP GGATGCCGACATCCTTGTCGTAACCCAAGCCGTCCGTACCCTGGCATCG WQKLDAIKTFVIPILQ GATGATGACACGGTGAGAGCCGTTGCTCTAGGACAGCTGACCTCAGTG YTLRATEYLKSDLKPL GTCCACAGGACGGTCCATTCTGCACCATCAGATGACTGCATCGACAAGT RAAIIKHVKKICHLPV TCTTGTCCGGAAGCAGTGAGGGCCCTCTGGCAAACTCAGGGAACAGTG RSSNAFVFASRPSGG GCCAGGCATCATCTCTTTGGTCACGTACAAGGGCTGCATCCCGTCGCCT LAFVDPGVDADILVV GAAGATAAGGATCGTCGGAGCCTCGAGCGGCGACATCAAGGTCGAGA TQAVRTLASDDDTVR GTGGAGGCCGTGCCATCCCCTCCAAGAAAGTGACTGCAGGTCTCCGCTC AVALGQLTSVVHRTV CGACCATCACAACGAGATGTCAGAGAAGCTGCGGTCCCTTCCAGACCAG HSAPSDDCIDKFLSGS GGCAAGGTAGCCAGGGCACTCTCCTTGGATAGTTTCGCCAACGCAACAT SEGPLANSGNSGQAS CATGGCTGACATCCGGGAGTTTCATCCGATTTTGCGACTGGAGGTTCAT SLWSRTRAASRRLKIR CCACCGTGCTCGCCTAAACTGTCTACCTACCAACGCTGCTGTCCGGCGGT IVGASSGDIKVESGGR GGAAACAGAACGCCAACACCAAGTGCCGAAGGTGTGATCACCAGTTGG AIPSKKVTAGLRSDH AGACGTTGCCCCACATTATTAACAACTGCCGTCCCAACATGGTACCGATC HNEMSEKLRSLPDQ CGCCGGAGGCACAACTCCATCCAAGAACGCCTGGTGAAGGCGATTCAC GKVARALSLDSFANA TACGGTGACATCTACCAGGACCAGCATGTCCCCGGCGACCCGAACCCAA TSWLTSGSFIRFCDW GGGAGCGACCCGACATCACAGTGGTTGAGGGTAACAAGGTGACCATTG RFIHRARLNCLPTNA TCGACATCACCATCCCATTCGACAATGGTCCGGATGCCCTCTCCACGGCT AVRRWKQNANTKCR GCCAACGCCAAAGTCATGAAATATGACACACTGCGACAAGAGCTGGCC RCDHQLETLPHIINNC AGCCGTGGTATGGATGTCGAGGTCCACGCTTTTGTAATCGGCTCCCTTG RPNMVPIRRRHNSIQ GGTCGTGGCACGGGGACAACGAGCGTGTTCTGGGACGTCTAGGTATCT ERLVKAIHYGDIYQD CTCGCCGATACAGAACACTGATGAGGCGCCTGTGCTGCATAGATGCGAT QHVPGDPNPRERPDI CAAAGGCAGCCGGGATATCTATATCGAACATGTTACCGGTCACCGGCAG TVVEGNKVTIVDITIP TATTGAATAGCATTTATATTGTGTTCCAAACAACATACTCATTATTATATC FDNGPDALSTAANA TAAACATTTTTTTTTCTGTTCCTGACAATCTACGTAAAGTCTGCTAACCAA KVMKYDTLRQELASR CTGGCATGATGAAATAAGATAAAATCCCCTTACACATTAATTTCTTGTCA GMDVEVHAFVIGSL CATCATAATGCTTTGTCAAAGCAATGTCCTACATAATATCTCGATGTCAC GSWHGDNERVLGRL CCCAATTAATTTTACATCCTTCGGTAACCTTTATACCGTTGGATCAACATA GISRRYRTLMRRLCCI TATGATTTGTAAAACTGTTATTTCTGAGTTTTTTCTATGCTAATAAA(SEQ DAIKGSRDIYIEHVTG IDNO:1063) HRQY(SEQIDNO: 1431) NeSL Utopia- Nasonia CCATTCCTTCGTACGGGTTTTCGTGCCGGCATAGCCGGGTGGGAGACTC CCATTCCT TAGTGG SGXTGREVKCITVNV 1_NVit vitripennis GCGCGGGGGAGGTCATATCTCACCACCATCCTCGAGCTTTTTGCTGCAC TCGTACGG GGCCAT LMEQQPHTKAIREG CTGATCCGGGMTAACTGGCCGAGAAGTTAAATGTATTACTGTTAATGTA GTTTTCGT AACACC DFIVILLPQSDDETLC CTCATGGAGCAACAACCACACACCAAAGCGATCCGGGAGGGGGACTTC GCCGGCAT TAGGCC CPLCVGRGRYSGKTR ATCGTGATACTCCTCCCGCAAAGTGACGACGAGACACTTTGCTGTCCACT AGCCGGGT CCACAG VECLNRHVKEVHPDL CTGCGTGGGACGCGGACGGTACTCCGGCAAGACGCGGGTTGAGTGTCT GGGAGAC TGTGGC TTTFRCWGCGFAAP CAATAGACATGTCAAAGAGGTGCACCCAGACCTAACCACCACCTTTAGG TCGCGCGG GATGTC GDKKYPRKIVTQHCA TGCTGGGGGTGCGGGTTCGCGGCTCCCGGGGATAAGAAATATCCCCGG GGGAGGT CATGTG TCVPEVSSAPSGRVD AAGATCGTGACCCAGCACTGCGCCACCTGCGTCCCCGAGGTTTCGTCCG CATATCTC TGTCGT GERRVNTRRRLGIAA CGCCCTCGGGCAGAGTGGACGGAGAGAGGAGAGTTAACACCCGCCGTC ACCACCAT CCTTACT ATEASPVRRTRRNGL GGTTGGGTATTGCCGCGGCAACCGAGGCCAGTCCCGTGAGACGTACTC CCTCGAGC TATTTAT ASPPVEQNISQSAAP GTCGTAACGGCCTGGCCTCACCTCCAGTCGAGCAGAACATCAGCCAAAG TTTTTGCT TTATTTG PEPARVPQHPEIVAL CGCGGCTCCTCCGGAACCCGCAAGGGTGCCACAACATCCGGAGATAGT GCACCTGA TCCTGTC GESADDEVFRSPVNS TGCGCTGGGAGAGTCAGCGGATGACGAAGTATTTCGATCTCCGGTGAA (SEQID GAGCTG PPRDWRAAAPQQA CTCTCCCCCAAGAGACTGGAGGGCTGCGGCGCCCCAACAGGCGGCTTC NO:1187) TTTAATC ASSSPXAVPGITAATP CTCCTCTCCTCSGGCGGTGCCCGGTATCACTGCTGCAACCCCGTCCAACA TATTCAT SNTTRTGNGSAXSILA CCACCAGGACAGGGAATGGCTCGGCANACTCCATCCTAGCCGAGCACC ATGTAT EHPIPAPPPTNTTEA CGATACCTGCACCTCCGCCAACCAACACCACGGAGGCGAATGGTCGTGC GTGTTG NGRADIPRSGRAPPP CGACATACCCCGTTCTGGGCGGGCTCCGCCGCCCGGTCNACAAGCCGCC TGTGTG GXQAARRRAPTTEQ CGTAGAAGGGCTCCCACGACAGAACAACGTCGCATTGTGGGACTCCTC TCGTCCC RRIVGLLEAATGREQL GAGGCGGCCACTGGTCGGGAGCAGCTTGAGGAGGCAACGACTCAAGC TCGTAG EEATTQAMLFLARLT TATGCTATTCCTGGCCAGATTGACCGGGCGGCGGCCTGAACCCCGGAAC CAGTTTT GRRPEPRNAIRPGXR GCCATTCGGCCGGGAMCGCGGCAACGTCATCCAGCTCAGGGGGATGTC ATTCCGT QRHPAQGDVQAQA CAAGCGCAGGCGCCTGATCGGATTCASGAGGCCAAGAAACTCCAGCGT CCAACC PDRIXEAKKLQRLYRT CTCTACCGAACATCCAAGAAGCGGGCAGTMCAGAAGATTTTGGCAGGT AGAGGT SKKRAVQKILAGPXM CCTNATATGAACTGCCAAATCGACAAGAATACCATCACAGCCCACTTTGT CGACCA NCQIDKNTITAHFVEL GGAGCTGGCTGCCCGCAGAGACGGTGGAGAGGATTGGCCCGATGTCTT ATATTAA AARRDGGEDWPDVF CGATCGGGAAGAGCCCACGGCGGCGAGCGGCGAGGCCCTTTGTACCCC AATAAG DREEPTAASGEALCT TATCACAAGGGAGGAAGTGTTTCGACGGCTCAAGGGGAGGAACAACAC CATGGC PITREEVFRRLKGRNN CTCACCTGGGCCGGACGGAATCACCTACAGGGACCTCGCCAAGGCCSAT TTGAAG TSPGPDGITYRDLAK CCAGGGGCCCATGTGCTCGCCGCCCTCTATAACASCATCTGGCGCATTG CAGGCC AXPGAHVLAALYNXI AGGCGACGCCTGCCTTGTGGGGGGTGTCCAACACMACCCTKATATACA AAGCGC WRIEATPALWGVSN AGAAAGGCGACGCCATGGACATCTCCAACTGGAGACCGATCTCCTTGG CGTGTT TTLIYKKGDAMDISN GAGATACGGTCCCCAAGCTATTTGCCGCGATCCTGGCCGATAGAATCAA CTAACCC WRPISLGDTVPKLFA AAGGTGGGCCGTGGCCAACGGGCGCTACAGTGCCTCCCAGAAGGGCTT CGTTTTA AILADRIKRWAVANG CTTGGAGTTCGAGGGATGCTATGAACATAACTTCGTGCTCCAGGAGGCC GGGGAA RYSASQKGFLEFEGC ATTCGGGAGGCAAAAGGCGGGAGAAAGGAGTTGGTGGTGGCCTGGCT GTTACTT YEHNFVLQEAIREAK GGACTTAGCCAGTGCCTTTACATCCGTGCCCCACTCCTCTATCCTGCAGG AACCTA GGRKELVVAWLDLA CGCTCGAGGGACATGGCCTACCAAGCAAAGCGAGAAACATTATCTCCTC AAAATA SAFTSVPHSSILQALE GCTCTACACAGGCATGACCACCCGCTTCCACACGGCGGAGGGACCCACC CAACTTT GHGLPSKARNIISSLY GACCCTATTCTCATCCAGTCGGGGGTCAGACAGGGTTGCCCATTAAGCC TCC(SEQ TGMTTRFHTAEGPT CCGACGTGTTCAACCTTACCCTGGAAGTTGTCCTACGGGAAATACAACG IDNO: DPILIQSGVRQGCPLS GACGGGGGAGGGCTACACGATCGAGGGCAGGCGTATTTCCCATCTCGC 1310) PDVFNLTLEVVLREIQ CTACGCCGATGATGTCGCGATCCTGGCTGACTCCCCGGCTGGAATGAGG RTGEGYTIEGRRISHL AGGCTGCTTTTTGCAGCAGAACGGGGAGCTCGGGCGGTCGGCCTCACC AYADDVAILADSPAG TTCAACCCAGCGAAGTGCGCTACACTGCACATAGCGGGACGAGGKGAG MRRLLFAAERGARA GAGGCAGTTAGACCGACTGAGTTCTCCGTTCAAGGGACACCTGTCCGTG VGLTFNPAKCATLHI CCCTTGCTTCCGGGGAGGCCTACGAACATCTTGGCATCCCCACAGGCTA AGRGEEAVRPTEFSV CCAAGTCAGGCAGACCCCAATCAACACCTTGCGGGATCTCCTTGCTGAC QGTPVRALASGEAYE ATCGGCTCCATCGACCGGTCCTTACTGGCGSCCTGGCAGAAGCTGGATG HLGIPTGYQVRQTPI CAGTGGGAACCTTCCTACTGCCCCGGCTTGATTTCACCATGCAGGGCGC NTLRDLLADIGSIDRS CCATATCGACAAGGGCTTCTTGACCGAGGCGGACAAGATCATCAAAAAA LLAXWQKLDAVGTFL GCTGCAAAATCTTGGCTAAGCCTTCCTCAGCGAGCGAGCGCGGAGCTCG LPRLDFTMQGAHIDK TGTTTCTACCTCCCTCGCAAGGGGGAGGGGGTTTGCTCACGGTCGCTCA GFLTEADKIIKKAAKS CAGCTATAAAATGCTGTACTCGAGCGATGTCACCGTGAGCACCATAGCG WLSLPQRASAELVFL GGATCCACACTGAGGAGGACCGTTTCGGAGAGGCTTAAAAAGCGGGCG PPSQGGGGLLTVAHS TCTAACATCGATATCGCCCGCTTCCTCTCCGGAGATCTTGATCTGCCCCG YKMLYSSDVTVSTIA GTCGACGAGCCCGTCGACATTCTGGACCAAGGTTCGGAGCGCGGCCCT GSTLRRTVSERLKKRA GAGAATTAAGACCAAACTCGGGCTGCGCTGGAGCTGGTGTCAGGGGG SNIDIARFLSGDLDLP GGGAAGTACTCCTGATGGCCTGTGGGGATCCACGTGCCCCCGGAACAA RSTSPSTFWTKVRSA GAGTCTCGCCGCAGACAAAACATCTTGTCACCACGAGTCTGCGGCGCTG ALRIKTKLGLRWSWC CCTGAATCGGCACTATGCGGAGAGTCTCCTTGCCAAAAAAGATCAAGGC QGGEVLLMACGDPR AAGGTCTTTGAGGTTACCCGGAGGAGCGGACAATCCAACCACTTCCTCC APGTRVSPQTKHLVT GGAGTGGCAGCTTCACTCGCTTCTGCGACTGGCGCTTTATCCACCGCGC TSLRRCLNRHYAESLL CCGTCTCGACGTTCTCCCCCTCAATGCTGCCAAACGGTGGCAACGAGGG AKKDQGKVFEVTRRS ATGGACAAGAGATGTCGCAGATGCGGGTCAGACCTGGAGACTCTCCCG GQSNHFLRSGSFTRF CATGTGCTGAGTCACTGCGGCCCGCACTCGGCGGCGAGACAAAAGAGG CDWRFIHRARLDVLP CACAACAACATCCAGGACAGGTTGGTGAAGGCAGCCTCGAGATGTCCG LNAAKRWQRGMDK GGGACCATTAGTGTCAACCAGACCGTTGTGGGCGTCCGCGGCCCCGAC RCRRCGSDLETLPHV GCGGCATTAAGACCKGACATTGTCGTTCGCGACGATGTCAACCGCCGCG LSHCGPHSAARQKR TCACCATCGTCGATGTGGCCGTGCCGTTCGAAAACCGCCTCGAGGCGTT HNNIQDRLVKAASRC CGACGGCGTTCGKGAGGCCAAGATTGCCAAATATACACCTTTGGCCAGG PGTISVNQTVVGVRG CAACTCACGGACAGCGGTTACACCGTAACGGTGGAGGCCTTTGTAGTG PDAALRPDIVVRDDV GGCGCCCTCGGCGCGTGGGACCCGCGAAATGAACGGGTCCTGAGCCTC NRRVTIVDVAVPFEN CTCAGCATTAGTCGTTATTATGCGATACTGATGAGGAGGCTCATGGTTA RLEAFDGVREAKIAKY GTGACACAATTCGCTGGTCCCGGGACATCTACGTGGAACATGTGTCGGG TPLARQLTDSGYTVT CATTCGGCAGTACCGGGAATAGTGGGGCCATAACACCTAGGCCCCACA VEAFVVGALGAWDP GTGTGGCGATGTCCATGTGTGTCGTCCTTACTTATTTATTTATTTGTCCTG RNERVLSLLSISRYYAI TCGAGCTGTTTAATCTATTCATATGTATGTGTTGTGTGTGTCGTCCCTCGT LMRRLMVSDTIRWS AGCAGTTTTATTCCGTCCAACCAGAGGTCGACCAATATTAAAATAAGCAT RDIYVEHVSGIRQYRE GGCTTGAAGCAGGCCAAGCGCCGTGTTCTAACCCCGTTTTAGGGGAAGT (SEQIDNO:1432) TACTTAACCTAAAAATACAACTTTTCC(SEQIDNO:1064) NeSL Utopia- Phytophthora GCCCGCCGGTGGAGTAGCCATGTTGGCCACCACCGCCCAAGTCTCCGCC GCCCGCCG TAGACG MVVSRITARLEATPA 1_PCa capsici GCAGCTGCGACTGCTGCTGCTCATCGAGTCGCAGTAGTCGCAGCTGCAC GTGGAGTA GCACAG PRWDPPLPRRVIASRI AAGCACCACCTCCTGCCGGACGCGCCGCCGTGGAGCACCACGCGCGCG GCCATGTT TTCTGG ADRLVPATAPCRSAL CTGAGCCGTACCAAGACCAAGGNTTCCAGGCTCGCGCGCGCGTGGMG GGCCACCA CCCACG NAAFPSPSRDTVTES CTCCCAGCCAGCGACGCGTTCAGCAGCAGGGTCGGCTGCGGCAGCTCG CCGCCCAA TAGGCC FTQEDRQLEPLTRHV ACGCTGCACGGGGACGGCAGCGACGGCAGGCACCAAGACCACGACGA GTCTCCGC GAAAGG DEETKDSELPGRAPT CCAAGCCGTCGCTGCCGTCCCCATGGACGTCGACCAAGGTGCTCGGTGG CGCAGCTG GCCCCA VLDEESKDNDATAGE CCGACAGCGGATCAMCACCCGCTGTCGCCGCCACCGGAGCTCCGAGTC CGACTGCT CCCATGT WLLRFDGACRANPG GGCGGCAAGCGCCGCCGCCTGAACGACGGCGACGACGAAGACATCCGC GCTGCTCA AGGGAA PGGAGAVLFNPSGA GAGCTGGCCGAGCTTCTGCTGCCCGACGAGGAGGAGGCCGACGACCAC TCGAGTCG CCGCCCT ASWTCSHFMPGATE AAACCAGCGCCCAGGTTACCCGCGACCAGCGCTCATCCGGCCTCCGTCC CAGTAGTC CGGGAA TNNTAEYTAFLLGAR TCGCTGTGTACGCGCACAACGCGCAGCGCTTCAACTGCACGTTGTGCGT GCAGCTGC ATCCATT AAADHGATLLRVQG GTACACGGCTGCCAGCTTCGCTGCTCTTACGCGACACAGGGACTCTCGG ACAAGCAC CCGGTG DSQLVLRQVKGIYGA CACCGGCGCGTGACCTTCCTGGACAGGTTCTCGGCGGGTTGCGCGTGC CACCTCCT TTCGACT KSTRLRRLRDAVRAEL GGCAAACCTTTTGCCTCGAGGCTGGCCGCAGCAAGACACGCACAAACG GCCGGAC GAAGAG ARVGQFSLHHIDRQD TGCGCCAGCCTCAGCACCACACTGGTCGCGGTTTCGACGACAGGAGGA GCGCCGCC ATGCTCC NAHADRLANRALDM GCATCAAGCCACACTGTCGTCGGAGCCAACACCACCGTCGCCACGGCG GTGGAGC TTCGCCT KSTLVECATHPGRNA GTCACCGCCGAACCCCCCCTGCTCCACCATCAAGCCTCGGAACTCACTGT ACCACGCG TGACGG CTTTLTTSAAAESPAS GCCCCCCCCCACGTGTGAGTTCCCCCGACGTCGATGTGCAGCTGCACAG CGCGCTGA AGGTAC PPPVGARDTPMADA TCCGCCACAGGAAGATCAGCACGAGGACGCCACCCANCACCCGGAGAG GCCGTACC ATCTCG GEERLADVDDGEVY CACGCAACACCAACCTCCTGAGGCAACCCGCTGGGGTTCGCCGCTCGCG AAGACCAA ACAGTC AAMRLGPGEVPERR CCCACGCTCGTTGCCTCCAGGATTGCTCAGCGACTCGGCGAGCTGGAAC GGNTTCCA GAACTT PRLRLRQLSDEELEAA CTCCACGCTGGGGCCCACCATTACCCCGCGCGATGGTTGTTTCGCGAAT GGCTCGCG CAACTC SEMVERLGAALSAKI TACAGCACGGCTTGAAGCTACACCTGCTCCTCGATGGGATCCGCCACTC CGCGCGTG GCAACA TDAEDWASAEGYITA CCACGCCGCGTCATCGCCTCCCGCATTGCCGACCGGCTTGTACCTGCGA GMGCTCC TATCCG LPYMLYDKLQSYSQA CGGCACCGTGTCGCTCGGCCCTGAACGCTGCATTCCCTTCTCCATCGCGA CAGCCAGC ATACAG PRGPQQPVLTRSPRG GACACGGTGACGGAATCGTTCACCCAGGAGGACAGGCAACTGGAACCC GACGCGTT TTACAA DDRPASSEPNASSTT CTGACGCGGCACGTGGACGAAGAGACGAAGGACAGCGAGCTGCCGGG CAGCAGCA ACCACA GGVASEHQPRRRRR CAGGGCGCCGACTGTACTCGACGAGGAGTCCAAGGACAATGACGCCAC GGGTCGG GTTAGA RGRRKGRRQRRNPR TGCCGGTGAGTGGCTCCTCCGGTTCGACGGAGCTTGTCGAGCGAACCCA CTGCGGCA TATCAG RSGREGATGGHQQH GGCCCAGGCGGAGCAGGTGCCGTGCTATTCAACCCTAGCGGTGCAGCG GCTCGACG ATAGGA KKHKPRPPRETQHHR TCATGGACCTGCTCGCACTTTATGCCGGGCGCCACGGAGACAAACAACA CTGCACGG ACCTTCC EHRLDEALDELHALE CCGCCGAGTACACGGCATTCCTCCTAGGCGCCAGAGCTGCTGCAGATCA GGACGGC TTTAGG RTDPHNRPAIAKARR CGGTGCGACGCTACTTCGCGTCCAGGGGGACAGCCAGCTCGTACTCAG AGCGACG AAGCTA RVGRIRSAINQQLLR GCAGGTCAAGGGCATTTACGGTGCGAAGAGCACCCGTCTACGGAGACT GCAGGCAC ACGGGT HKFDTDEKACVDGIL ACGCGACGCTGTTCGTGCCGAACTGGCTCGGGTGGGGCAATTCTCCCTC CAAGACCA ACACTG STARAERAARAATPS CACCACATCGACAGGCAGGACAATGCGCACGCAGACCGCCTTGCGAAC CGACGACC GATGGT PPASGAPTTTVSAPG AGAGCGCTCGACATGAAGAGCACACTAGTGGAATGCGCAACGCACCCC AAGCCGTC AAATAC AIVTNDDGTCPIPSDK GGCCGGAACGCCTGCACCACGACACTGACCACCAGTGCTGCCGCTGAAT GCTGCCGT ACATAC LWRHFDAVNTPRLD CACCGGCATCACCCCCGCCGGTGGGAGCTCGAGACACGCCGATGGCCG CCCCATGG ATTTC FDAEAPGSAAFRAA ATGCCGGCGAAGAGCGCCTCGCAGACGTTGACGATGGTGAAGTTTACG ACGTCGAC (SEQID MDHLPAATRLLDLLK CGGCGATGCGGCTTGGCCCAGGCGAAGTTCCTGAACGACGGCCACGAC CAAGGTGC NO: EAPSTDEIETQLQHV TCCGCCTCCGCCAGCTGAGCGACGAGGAACTGGAAGCAGCGAGCGAGA TCGGTGGC 1311) KASSSPGLDGVGYDV TGGTGGAGCGACTGGGCGCCGCCTTATCGGCCAAGATCACAGACGCGG CGACAGCG YKRFTIQLLPVLRAAF AAGACTGGGCGAGTGCGGAGGGCTACATCACGGCCCTACCGTACATGC GATCAMC RCCWLYKKVPQSWK TGTACGACAAGCTTCAGTCGTACTCACAGGCACCGCGTGGACCCCAACA ACCCGCTG LGVVRLLHKKGPRED ACCGGTCTTGACCCGGTCCCCGCGCGGCGACGATCGCCCAGCTTCCTCG TCGCCGCC PANWRPICLQQAIYK GAGCCCAACGCATCATCCACGACTGGAGGGGTGGCATCGGAGCACCAA ACCGGAGC LYTGILARRLTRWMD CCGCGGCGCCGCCGACGACGAGGACGACGCAAGGGCCGACGACAGCG TCCGAGTC ANDRHAPGQKGFRA CCGCAACCCGCGCCGATCGGGACGCGAGGGAGCTACAGGTGGGCATCA GGCGGCA VNGCGEHNFLAATLI ACAACACAAGAAACACAAACCACGCCCACCCCGCGAGACGCAGCATCAT AGCGCCGC DNARRKHRPLYEVW CGGGAACACCGTCTCGACGAGGCTCTCGATGAGCTCCACGCCCTCGAGC CGCCTGAA YDFRNAFGSVPFALL GTACCGATCCTCACAACCGGCCAGCAATTGCCAAGGCCCGCCGCCGCGT CGACGGC WDSLQRLGVPPDYV CGGTCGCATCCGGTCTGCCATCAACCAGCAGCTACTCCGCCACAAGTTC GACGACG DMCKGLYNQASFVV GACACGGACGAGAAGGCCTGTGTCGACGGCATCCTGTCTACAGCCCGC AAGACATC GNAVDGSTAPVEQR GCCGAACGCGCTGCACGAGCAGCAACGCCGTCGCCACCAGCATCAGGA CGCGAGCT VGVFQGCPLSPQLFN GCGCCCACCACGACAGTCTCGGCGCCCGGTGCCATCGTGACCAACGATG GGCCGAG AAISPLLYALRRLPDT ATGGCACGTGCCCCATCCCGAGTGACAAGCTTTGGCGACACTTCGACGC CTTCTGCT GVQLSSVDRPGASAY CGTGAACACACCGCGCCTTGACTTCGATGCGGAGGCACCCGGGAGTGC GCCCGACG ADDLKIFSGTKAGITQ GGCCTTCCGAGCGGCGATGGACCACCTGCCCGCGGCGACACGGTTGCT AGGAGGA QHELVATFLRWTGM GGACCTGCTCAAAGAGGCGCCGAGCACGGACGAGATCGAGACGCAACT GGCCGAC QANPAKCRSMGVRR CCAGCACGTGAAGGCCTCCTCCAGCCCCGGCCTCGACGGCGTCGGCTAC GACCACAA NTNGAVEADNVHLE GACGTGTACAAGCGCTTCACGATACAGTTGCTGCCTGTGCTGCGGGCGG ACCAGCGC LDDTPIPSMTHMQS CGTTCCGGTGCTGCTGGCTGTATAAGAAGGTTCCTCAGAGCTGGAAGCT CCAGGTTA YTYLGIGDGFDHVRR CGGCGTGGTGCGCTTGCTGCACAAGAAGGGCCCGCGTGAGGACCCTGC CCCGCGAC RVELAPKLKTLKHDVT GAACTGGAGACCCATCTGTCTCCAGCAAGCCATCTACAAACTATACACG CAGCGCTC ALVESGLAPWQVVK GGCATCCTCGCGCGGCGACTAACCCGGTGGATGGACGCGAATGACCGC ATCCGGCC AVKVYLYPRVEYALR CACGCACCCGGTCAGAAGGGATTCCGAGCGGTGAACGGCTGCGGGGA TCCGTCCT HLRPDDQHLESFDLH ACACAACTTCCTCGCAGCTACGCTCATCGACAACGCCCGCCGCAAGCAT CGCTGTGT LRRGLRHLLRLPKNAT CGGCCACTGTACGAAGTCTGGTACGACTTCCGTAACGCGTTCGGCAGCG ACGCGCAC NEFFYAPVSRGGLGL TGCCGTTCGCGTTGCTGTGGGATTCGCTACAGCGCCTCGGTGTCCCGCC AACGCGCA LPLVELHAALQIAHG CGACTACGTGGACATGTGCAAGGGACTCTACAACCAAGCCTCGTTCGTC GCGCTTCA WQTLHSPDPAIRRVA GTCGGGAACGCAGTGGATGGCAGCACAGCGCCGGTAGAGCAGCGCGT ACTGCACG REQLYQIADARHRLD CGGCGTTTTTCAAGGCTGTCCGCTCAGCCCCCAGCTCTTTAACGCCGCCA TTGTGCGT KDHWPHRREELCELL TCAGCCCACTGCTCTATGCGCTGCGCCGGCTACCAGATACCGGCGTGCA GTACACGG LNGELGTSAHAPPKR GCTCTCCAGTGTTGACCGCCCTGGTGCCTCAGCGTACGCCGACGACCTT CTGCCAGC RNGDIGSLWVDVRK AAGATTTTCAGCGGCACCAAGGCGGGCATCACGCAGCAACACGAGCTG TTCGCTGC NLKTFGLKVATAPAN GTGGCGACGTTCCTGCGGTGGACCGGGATGCAGGCCAACCCTGCCAAG TCTTACGC QETGVPAQPLQLRVP TGCAGGTCCATGGGCGTGCGGCGCAACACCAACGGMGCCGTCGAGGC GACACAG HHAEWLDHGNVLRH AGACAACGTCCACCTCGAGCTCGACGATACGCCGATCCCCTCCATGACA GGACTCTC VKLHIKNLHWQTWC CACATGCAATCCTACACGTACCTGGGGATCGGCGACGGCTTCGATCACG GGCACCG ALSDQGKTARVHGG TGCGCCGCCGCGTCGAGCTCGCTCCCAAGCTGAAGACTCTCAAGCACGA GCGCGTGA VGSAFLTRPRGMWE CGTGACGGCGCTCGTGGAGTCCGGCCTTGCGCCATGGCAGGTGGTGAA CCTTCCTG SDYRFAVAARLNVVD GGCCGTGAAGGTGTACCTCTACCCTCGGGTAGAGTACGCCCTCCGGCAC GACAGGTT TVNTLSRRRLRAHDR CTCCGACCCGATGATCAGCACCTGGAGAGCTTCGACCTCCACCTTCGCC CTCGGCGG CRYPACRWKETLAH GCGGCCTCCGCCACCTGCTTCGCCTTCCCAAGAACGCCACGAACGAATT GTTGCGCG VLNHCPGTMDAVRG CTTCTACGCGCCGGTGTCGCGCGGTGGTTTGGGGCTGCTGCCGCTGGTG TGCGGCAA RHDDALKEIEHTLRAS GAACTGCACGCAGCACTGCAAATCGCGCACGGATGGCAGACGCTCCAC ACCTTTTG SGDRRELRVNQTVP TCACCCGACCCAGCCATCCGTCGCGTCGCGCGCGAGCAGCTCTACCAGA CCTCGAGG GLPGPPLRPDIQVYN TTGCCGACGCGCGGCACCGGCTCGACAAGGACCACTGGCCACACCGGA CTGGCCGC HDKRTVAVVDLAVAF GAGAGGAACTTTGCGAGCTCCTCCTCAACGGGGAGCTCGGGACGTCGG AGCAAGAC DEQPSEDPESSGLAK CCCACGCACCGCCCAAGCGACGCAACGGTGACATCGGCTCTCTGTGGGT ACGCACAA AVQIKKAKYAGIKEHL CGACGTCCGCAAGAACCTCAAGACGTTTGGACTGAAGGTGGCCACAGC ACGTGCGC ENQGWKVHLSAIVY GCCGGCCAACCAGGAGACCGGTGTACCAGCCCAGCCGCTGCAGCTCCG CAGCCTCA GSLGSVAASNHKVYT CGTGCCTCACCACGCCGAGTGGCTGGACCACGGCAACGTGCTGAGGCA GCACCACA EHLGLLKRDAKRLDR CGTGAAGCTACACATCAAGAACCTCCACTGGCAAACCTGGTGCGCCCTA CTGGTCGC QLSSACIQSSRRIWN AGTGACCAAGGGAAGACCGCACGGGTGCACGGCGGTGTTGGCAGCGC GGTTTCGA FHCAKHRARQHEHQ ATTCCTGACCCGACCTCGCGGCATGTGGGAAAGCGACTACCGCTTCGCG CGACAGG APPSQATRGRRVTET GTGGCCGCTCGCCTCAACGTGGTCGACACCGTCAACACACTGTCGCGCC AGGAGCAT GGNPSRTDRR(SEQ GGCGACTCCGCGCCCACGACCGGTGTCGGTACCCGGCCTGCAGGTGGA CAAGCCAC IDNO:1433) AGGAGACTCTGGCACACGTCCTCAACCACTGCCCGGGTACCATGGACGC ACTGTCGT GGTACGTGGCCGGCACGATGACGCGCTCAAGGAAATCGAACACACGCT CGGAGCCA CCGCGCGTCGTCGGGCGACCGCCGCGAGCTGCGTGTTAACCAAACGGT ACACCACC GCCTGGCCTGCCCGGGCCACCCCTGCGCCCCGACATCCAGGTCTACAAC GTCGCCAC CATGACAAACGCACGGTCGCAGTTGTAGACCTGGCTGTGGCGTTCGATG GGCGGTCA AGCAGCCGAGTGAGGACCCAGAGAGCTCGGGGTTGGCCAAGGCAGTC CCGCCGAA CAGATCAAGAAAGCCAAGTACGCGGGCATCAAGGAGCATCTCGAGAAC CCCCCCCT CAGGGGTGGAAGGTTCATCTCTCGGCCATCGTGTACGGCTCGCTTGGCT GCTCCACC CGGTGGCGGCCAGCAACCACAAGGTGTACACTGAACACCTTGGGTTGC ATCAAGCC TCAAGCGCGATGCAAAGCGGCTGGACCGGCAGCTCTCGTCGGCATGCA TCGGAACT TCCAGTCCAGCCGCCGCATCTGGAATTTCCACTGCGCCAAGCACCGAGC CACTGTGC ACGCCAGCACGAGCACCAAGCACCACCAAGCCAAGCGACACGGGGCAG CCCCCCCC GAGGGTGACGGAAACCGGGGGGAATCCGTCGCGCACCGACCGCCGCT ACGTGTGA AGACGGCACAGTTCTGGCCCACGTAGGCCGAAAGGGCCCCACCCATGT GTTCCCCC AGGGAACCGCCCTCGGGAAATCCATTCCGGTGTTCGACTGAAGAGATG GACGTCGA CTCCTTCGCCTTGACGGAGGTACATCTCGACAGTCGAACTTCAACTCGCA TGTGCAGC ACATATCCGATACAGTTACAAACCACAGTTAGATATCAGATAGGAACCT TGCACAGT TCCTTTAGGAAGCTAACGGGTACACTGGATGGTAAATACACATACATTT CCGCCACA C(SEQIDNO:1065) GGAAGATC AGCACGA GGACGCCA CCCANCAC CCGGAGA GCACGCAA CACCAACC TCCTGAGG CAACCCGC TGGGGTTC GCCGCTCG CGCCCACG CTCGTTGC CTCCAGGA TTGCTCAG CGACTCGG CGAGCTGG AACCTCCA CGCTGGG GCCCACCA TTACCCCG CGCG(SEQ IDNO: 1188) NeSL Utopia- Phytophthora AGCTCGGCCTCGCGGCTGCCTTCCCAGGCGCCGCCGACTTCGCGCTCTG AGCTCGGC TAAGCT MLADPAALAAGLAR 1_PI infestans GCGCGGCCCACACGCCGCCGCCGAGCCTCCAAGCGCGCCCGTTGGCTTT CTCGCGGC GGTCAT APPPPSAPQDPSPAF CGCAGACGCAGGGCTCGCGGCGACGGCCCTCGCCAGCCCCCAAGACCC TGCCTTCC CATGAC PAGPAGQNPRAAAP CCCCTACGATGTGGCACCACCCGGCAGGGCGGCCGGCAGGCTGCCCGA CAGGCGCC CGACAG ARVEVHTVVAPPGR CTCGGTATCGCCGGGTGCTACACTCTCAGCCGCCACAGCTCGGGCCTTG GCCGACTT GGCACT AGGMLPDPGLVEEPI GCGGTCCGCCATTGGCCCTTGGAGCTCGACAGCGACAGCAGCGACGAC CGCGCTCT ACCCAG QATYAHDAAQFECA GAGGACGCTCAAGACCCCCACGCCGCCGCCCCAGAACCCCCAGAAGAC GGCGCGG GTAGGG LCPYVAESMAVLVQ GTCGCGAGTGTGCTTGCCCCACCCGGCAGGGCAGGCAGCATGCTCGCC CCCACACG AACCGC HRRSAHRGTRFKDIF GACCCAGCAGCGCTGGCAGCCGGTCTGGCTCGCGCTCCGCCGCCCCCAT CCGCCGCC CCTTAAA TSGCQCSLVFYARIVA CGGCACCCCAGGACCCAAGCCCCGCCTTCCCTGCTGGCCCCGCCGGCCA GAGCCTCC AAACCC ASHAVACARRNQRA GAACCCTCGCGCCGCCGCCCCAGCCCGTGTTGAAGTCCACACAGTCGTG AAGCGCGC AGGAAG VPPAPTPVAPTRPEA GCACCACCCGGCAGGGCGGGCGGCATGCTGCCCGACCCGGGGTTGGTT CCGTTGGC ACACAA TPQPTGYLAAAMTA GAAGAACCTATCCAAGCGACGTACGCGCACGACGCTGCCCAGTTCGAG TTTCGCAG ACACCCT AAAAASSDTVVAAAT TGTGCGTTGTGCCCGTACGTCGCTGAGAGCATGGCAGTGCTCGTTCAGC ACGCAGG CCACTTA NMQSAVPAAAKTTG ACCGACGGTCGGCTCACCGCGGCACCCGCTTCAAGGACATCTTCACCAG GCTCGCGG GTGACA LQLVPPELEPALPQR CGGGTGCCAGTGCAGCCTAGTGTTTTACGCGCGCATTGTTGCCGCCAGC CGACGGCC TACATAT ASCHAGKRRRLNAD CATGCAGTGGCCTGTGCCAGGCGCAACCAGCGAGCCGTGCCACCAGCT CTCGCCAG TTTAGCC EAVTPCTPTARVSPQ CCGACCCCTGTTGCCCCCACGCGGCCCGAGGCTACCCCACAACCTACTG CCCCCAAG TAGATTT TEVAMAPHDAPQD GGTACCTCGCCGCGGCGATGACTGCCGCTGCGGCAGCCGCATCAAGCG ACCCCCCC CAGTTA DTVLQREAAEPQPDP ACACCGTAGTGGCAGCCGCCACCAACATGCAGAGCGCCGTGCCAGCCG TACGATGT CGGAGA AATPGAQVQRVEDT CCGCCAAGACCACTGGGCTTCAGCTCGTCCCCCCCGAGCTGGAGCCTGC GGCACCAC GGTTAC TAAQDDTVQQDHD CCTTCCGCAGCGTGCTTCTTGCCATGCAGGCAAACGCCGTCGGCTCAAT CCGGCAG TAACTG ADTAQVSPPRRTPTR GCCGACGAGGCGGTGACACCGTGTACGCCTACGGCGCGGGTGTCGCCG GGCGGCC GTAAAT WGPRPSSTQEPSPM CAAACAGAAGTGGCGATGGCGCCTCACGACGCACCCCAGGACGACACC GGCAGGCT ACGAAC TGEPAATLAARRPLT GTCTTACAGCGTGAGGCCGCCGAGCCACAGCCAGACCCAGCAGCAACG GCCCGACT ACATATT PAATGTRATRWGPC CCGGGTGCCCAAGTGCAGCGAGTCGAAGACACAACTGCGGCGCAGGAT CGGTATCG CTGTTCT HRAIGAAAIARLVTGL GACACTGTGCAGCAGGACCACGACGCCGACACTGCCCAGGTCTCGCCG CCGGGTGC AATCAG STEPAQPQRRQPPPP CCCCGGCGCACACCCACACGCTGGGGACCAAGACCAAGCTCAACTCAA TACACTCT TGTGAA QEPPSQPEPLAAAAT GAGCCTAGCCCCATGACAGGCGAGCCGGCTGCCACCCTGGCAGCGCGA CAGCCGCC AACTGG AAADIAATVAADIAA AGACCACTGACTCCTGCCGCAACGGGCACTCGTGCCACGCGGTGGGGG ACAGCTCG TTTTCGC AAANAAMDVDGGP CCATGTCACCGCGCGATCGGGGCCGCAGCCATCGCACGACTTGTCACGG GGCCTTGG CTTTTGG AADETWLLRFDGAC GCCTTTCAACGGAGCCAGCGCAGCCTCAGCGACGGCAGCCACCGCCAC CGGTCCGC CGGACT RRNPGPGGAGAALF CACAAGAACCTCCATCGCAACCAGAGCCGCTAGCAGCAGCAGCAACAG CATTGGCC TTTTCAC APSGAVVWTCSHYM CCGCAGCAGACATAGCAGCAACAGTTGCAGCAGACATTGCGGCAGCGG CTTGGAGC TCGCATT PSRSETNNTAEYTALL CAGCGAACGCAGCAATGGACGTGGATGGTGGACCGGCAGCCGACGAG TCGACAGC TTTGGG LGVQSAVHHGASHL ACATGGCTGTTGCGCTTTGACGGCGCATGCCGACGGAACCCAGGACCT GACAGCA CAATCG EVEGDSSLVIAQVKG GGTGGAGCAGGTGCCGCACTCTTTGCGCCCAGCGGTGCCGTGGTGTGG GCGACGAC TCTGCG TFACRNARLRQLRNR ACGTGCTCGCACTACATGCCAAGCCGCTCCGAAACGAACAACACCGCCG GAGGACG GCTAGC VRHALRSVDTHKLRH AGTACACGGCGTTACTGCTCGGCGTACAGAGCGCAGTCCACCACGGTG CTCAAGAC TTGCTA IDRQANAHADRLAN CCTCGCACCTGGAAGTCGAGGGCGACAGCAGCCTGGTAATCGCTCAGG CCCCACGC GCGGCG RALDQRRTSSECGTH TAAAGGGCACCTTCGCATGCAGGAATGCGCGGCTCCGCCAACTCCGCAA CGCCGCCC GACGAG GSCMDSCLAVPTALA CCGAGTGCGTCACGCGCTCCGCTCCGTGGACACGCACAAACTCCGGCAC CAGAACCC CGGTCT AQETPPAAPPSTSAT ATCGACCGCCAGGCCAACGCGCATGCAGATCGCCTTGCCAACCGCGCGC CCAGAAGA CCGGGG PAEGNAMDDIAAEIA TAGACCAGCGGAGGACCAGCAGCGAGTGCGGGACGCATGGCTCGTGC CGTCGCGA GCGTTC ARDEGETFPVLPIGP ATGGACTCCTGCCTCGCCGTACCCACGGCCCTGGCGGCGCAAGAAACGC GTGTGCTT ACCTTTC GSAPERQPRLRLRQL CACCGGCAGCACCACCCAGCACCAGTGCCACTCCAGCCGAAGGTAATGC GCCCCACC CCCCGC SDEERDAAADALQEL CATGGACGACATCGCAGCCGAGATTGCTGCGCGTGACGAGGGCGAGAC CGGCAGG GAGGCC ADTMASKIEDADSW CTTCCCAGTCCTGCCAATTGGCCCAGGCTCAGCCCCAGAACGGCAACCC GCAGGCA AACTAC TSGEGYISSIPERIREV CGACTCCGTCTGCGCCAACTTAGCGATGAAGAGCGCGACGCAGCGGCA GC(SEQID ACCGAT LQPYATAPPQPGRSR GATGCCCTGCAGGAGCTCGCCGATACCATGGCGAGTAAGATCGAGGAT NO:1189) CTTCTCT RQQRRRPPRVTRNQ GCTGACAGCTGGACCTCCGGCGAAGGATACATCAGCTCCATCCCCGAGC ACACTTT REHRLDEALDDMAA GTATCCGAGAGGTACTGCAGCCATACGCTACGGCCCCGCCGCAGCCGG TCTAATT TQQATPRDQRAVRR GACGCTCTCGCCGGCAACAACGCCGTCGACCACCACGGGTGACGCGCA CGCCTCC ARRRVGRVRASMAQ ACCAGCGCGAACATCGACTCGACGAAGCTTTGGACGACATGGCAGCCA GTCTTC QELRHEFAKDESKCV CACAGCAAGCGACCCCGCGAGACCAGCGGGCTGTGCGCCGTGCGCGGC GGTCTT AKILKTASTETAAEDE GACGTGTGGGCCGGGTGCGAGCCTCAATGGCACAACAGGAGCTGCGCC CGGTTG HPETCPIDAATLHAH ACGAGTTTGCCAAGGACGAGTCTAAGTGTGTGGCCAAGATCCTGAAGA TCGGGC FTGVNAPRTDFDYDA CAGCGTCGACAGAGACGGCAGCGGAGGACGAGCACCCGGAGACGTGC TTTTTTC TSGREFRAAMSDLPP CCCATTGATGCAGCTACGCTCCACGCCCATTTCACGGGAGTCAACGCGC TTTTTGA ATVEIDAFDAELTIDE CAAGGACGGACTTCGACTACGATGCCACCAGCGGCCGGGAGTTTCGAG CCAATC VEDQLTRAAKTSSPG CGGCGATGAGCGACCTTCCGCCAGCCACCGTGGAGATTGATGCGTTCG AGAGCG HDGIDYGIYSRFAAQ ACGCGGAGCTGACGATCGACGAGGTGGAGGACCAGCTCACACGCGCG CGCCAT LVPLLHAVFQFCWRH GCCAAGACCTCAAGCCCCGGCCACGACGGCATCGACTACGGCATCTACA GCGCCT RRVPRLWKVGIVRLI GCCGCTTCGCCGCGCAGCTCGTTCCGCTCCTCCACGCGGTGTTCCAGTTT CTTCTG HKKGDPRQPTNWRP TGCTGGAGACACCGTCGTGTGCCAAGACTGTGGAAGGTGGGCATCGTA GCCAAT ICLQPTIYKLYSGLLAH CGCCTCATCCACAAGAAGGGCGACCCAAGGCAGCCGACTAACTGGAGG CAGAGA RLSRWLEGNDRLPM CCGATCTGCCTTCAGCCCACGATCTACAAGCTCTACAGCGGGCTGCTGG CCGGGC AQKGFRAFNGCHEH CGCATCGGCTCTCACGCTGGCTGGAAGGTAACGACCGCCTGCCGATGG CCTGTCC NFMATTLLDQTRRQ CCCAGAAGGGGTTCAGGGCCTTCAACGGGTGCCACGAGCACAATTTCAT TCGGAC HRKLYQVWYDLRNA GGCTACCACGTTGCTGGACCAGACCCGCCGGCAGCACCGAAAGCTGTA AGCGAG FGSLPQQLMWRVLR CCAAGTGTGGTACGACCTTCGCAACGCCTTCGGCTCGCTTCCTCAGCAG GCCTCC HLGVDSGFIDRCRDIY CTCATGTGGCGCGTGCTTCGCCACCTTGGCGTCGACTCAGGCTTCATCG ACGGCC RDSAFVVANAADGA ACCGCTGCAGGGATATCTACAGGGACTCGGCCTTCGTGGTGGCGAACG AGCCAA TDPVRQEVGVYQGC CGGCGGACGGTGCGACGGACCCGGTGCGCCAAGAGGTGGGCGTATAC TCGAGT PLSPLLFVAALVPLVR CAAGGCTGCCCACTCAGCCCGCTCCTGTTCGTCGCGGCGCTGGTGCCTC CTCGGC RLEKLDGVGVPLADG TCGTGCGCCGTCTCGAGAAACTCGACGGCGTTGGTGTGCCACTGGCGG AGCGAC VRPCTTAYADDLKVF ACGGTGTGCGGCCCTGCACCACGGCCTACGCCGATGACCTCAAGGTGTT GCGTCT SDSAAGIRKCHDTVA CAGCGACAGCGCCGCCGGCATCCGCAAGTGCCACGACACAGTGGCTGG TTCTATA GFLAWTGLRANPGK CTTCCTCGCGTGGACAGGGCTGCGCGCTAACCCTGGCAAGTGTGCCAGC GCGCAG CASLAVTTNARGNPT CTCGCAGTGACGACGAACGCCCGGGGCAACCCGACTAGAGACAGCAGC CTGACG RDSSMRLEVHDAAIT ATGCGTCTCGAGGTCCACGACGCCGCCATCACCACGCTCAGCCTCCACG AGGCCG TLSLHESYRYLGVGD AAAGCTACCGCTACCTGGGCGTGGGTGATGGATATGACCACGTGCGCC ATCTGG GYDHVRHRLQLEPKL ATCGTCTCCAGCTCGAGCCAAAACTCAAGCAGCTCAAGCGGGAGGCGG CGGCCC KQLKREAVALLTSGL TGGCGCTGCTGACGTCGGGTTTGGCACCGTGGCAGGTGGTGCGCGCGC CCGATT APWQVVRALKVYVY TCAAGGTGTACGTCTACCCCAAGGTGGAGTACGCGCTGCGCCACCTGCG GGTCCG PKVEYALRHLRPLQS CCCGCTGCAGTCGCAACTGCAGGCGTTCGACCGTGTCGTCTCCAAGGGC ACTTTCG QLQAFDRVVSKGLRH TTGCGACACCTGCTGAGCCTGCCTCGCTCAGCCACCAGCGAGGTGCTGT GCCAAT LLSLPRSATSEVLYAP ACGCCCCGACGTCCAGCGGAGGCCTGGGCCTGCAGCCGCTCGTGGAGC CAGCGA TSSGGLGLQPLVELH TGCACCGCGCCCTACAGCTCGCCCACGCGTGGCAGATGCTGCATTCCAA CGACGA RALQLAHAWQMLH GGACCCAGCCATCCAAGCAGTGGCAAGAGCGCAGGCTTGCCAGGTGGT GGGGGC SKDPAIQAVARAQAC GCGCAAGCGCTACCGCCTGCAAGAAGACCATTGGCGCGGGCGCGATGA AGGGGT QVVRKRYRLQEDHW CGAGCTCGTGCGCAGCTTCCTGAACTCCGAGCTCGCCGCCTCGCCACAC TTACACT RGRDDELVRSFLNSE GCTGAGGTCCTCCGTCGCAACGGTGACATTGCCTCGCTGTGGTCCGACG TTTGCCC LAASPHAEVLRRNGD TGCAGCGGTGGCTGCGCATCTACCACCTGCGGTTCGAGCACTGCGACGA CCGTTTC IASLWSDVQRWLRIY GACAGAGGCCCATGGCCCGCTCAGCTTCCGCGTGCCGCACCACAACAA GACTTC HLRFEHCDETEAHGP GTGGCTTACGCACAAGACTGTGCTGCGGCACGTGAAGCTACACCTGAA AACTTCA LSFRVPHHNKWLTH GATCCGACACCAGACCCGATGGAAGGGCATGGTGGACCAGGGCAAGA GGCCAA KTVLRHVKLHLKIRH CGGTGCGCGTGCACGGTGGGGTGGGCGCCAAGTTCATGACGACCGGA AATGGC QTRWKGMVDQGKT GCGGGCCTGTCGGACGACGACTACCGCTTCGGGGTGAAGGGCCGCCTC GATTTG VRVHGGVGAKFMTT AACCAAGTCGACACGAACTCGGTGCTCAAGCGCAAACGACTGCGGGCA GACCCT GAGLSDDDYRFGVK CACACGACCTGCAGGGACCCAACCTGCTCCTCGGCTGAGACGCTGGCAC CCACGC GRLNQVDTNSVLKRK ATGTACTCAACCACTGCGAGTCCAACATGGACGCCATTCGCCAGCGTCA GCCGTG RLRAHTTCRDPTCSS CGACGACGCCCTGGAGCAGATTGGGTCTAAGATCCGCGGAGCGCTCGA CCACTG AETLAHVLNHCESN CCGCGCCAAGTCTCCCACAGAGCTGCGCCTGAACCAGACTGTACCCGAG CTCGGC MDAIRQRHDDALEQ TACACAGGCGCAGCTCTACGCCCAGACATTGTGCTTAGAAACGTGGCCG ACCGGC IGSKIRGALDRAKSPT CCAAGACGATGGTGATCGCCGACTTGGCGGTCACGTTCGAAGACCAAG GGCGAT ELRLNQTVPEYTGAA CGGCTCGCGCTCGTCACTCCTCGCTGCAGCTCAGCCACGACCACAAGAC TCAGCG LRPDIVLRNVAAKTM TCTCAAGTACCAACCAATCGTGGCGGAGCTGCAGCACAAGGGATGGCG GGTGCA VIADLAVTFEDQAAR AGTGCAGACTGCTGCCATAGTGTACGGCACGCTGGGCTCGGTGCAGCC ACTTCG ARHSSLQLSHDHKTL CAGCAACTTCAAGGCCTACACTGAGAAGTTCAAGCTTCACAAGCGCGAG GGCACG KYQPIVAELQHKGW GCTAGGCAGCTGGATCTCCAGCTGTCTAGCCACTGCATTCGCGCCAGCC TGTGCA RVQTAAIVYGTLGSV ACCGCATCTGGGGCTGGCACTGCCGACAGCACCGCGATCGGCAACGAA ACACAT QPSNFKAYTEKFKLH GCGGCACTGCCTCGCGAGCGTCGCGTGGGTCTGGGGGGGCCCCGCGGC GCAGCG KREARQLDLQLSSHCI GCACATCGCAGGCACCGGCACGGCGATAAGCTGGTCATCATGACCGAC CCCATTG RASHRIWGWHCRQ AGGGCACTACCCAGGTAGGGAACCGCCCTTAAAAAACCCAGGAAGACA CACGCC HRDRQRSGTASRASR CAAACACCCTCCACTTAGTGACATACATATTTTAGCCTAGATTTCAGTTA AAGCGG GSGGAPRRTSQAPA CGGAGAGGTTACTAACTGGTAAATACGAACACATATTCTGTTCTAATCA CATCGC RR(SEQIDNO: GTGTGAAAACTGGTTTTCGCCTTTTGGCGGACTTTTTCACTCGCATTTTT GGGACG 1434) GGGCAATCGTCTGCGGCTAGCTTGCTAGCGGCGGACGAGCGGTCTCCG ACGCCT GGGGCGTTCACCTTTCCCCCGCGAGGCCAACTACACCGATCTTCTCTACA CGGCCG CTTTTCTAATTCGCCTCCGTCTTCGGTCTTCGGTTGTCGGGCTTTTTTCTTT CCCAAG TTGACCAATCAGAGCGCGCCATGCGCCTCTTCTGGCCAATCAGAGACCG CGCAGC GGCCCTGTCCTCGGACAGCGAGGCCTCCACGGCCAGCCAATCGAGTCTC CCCGCC GGCAGCGACGCGTCTTTCTATAGCGCAGCTGACGAGGCCGATCTGGCG CTTCCAG GCCCCCGATTGGTCCGACTTTCGGCCAATCAGCGACGACGAGGGGGCA CACGAC GGGGTTTACACTTTTGCCCCCGTTTCGACTTCAACTTCAGGCCAAAATGG CTCGCG CGATTTGGACCCTCCACGCGCCGTGCCACTGCTCGGCACCGGCGGCGAT CCGTTT TCAGCGGGTGCAACTTCGGGCACGTGTGCAACACATGCAGCGCCCATTG GGCGGA CACGCCAAGCGGCATCGCGGGACGACGCCTCGGCCGCCCAAGCGCAGC TCGCCAT CCCGCCCTTCCAGCACGACCTCGCGCCGTTTGGCGGATCGCCATCAAGA CAAGAC CGTGCGAGAGCCAGGCGGGGTCGGGCAAAATATACTTACTCTAAGTAT GTGCGA GCCCGAATCCCTGCCCTCTCAGGCTGAACGCGGCCCCATACTTGATCTAA GAGCCA GTATGGGAGGATCCCTGGCCTCTCAGGCTGTACGCGAGACCC(SEQID GGCGGG NO:1066) GTCGGG CAAAAT ATACTTA CTCTAA GTATGC CCGAAT CCCTGCC CTCTCAG GCTGAA CGCGGC CCCATAC TTGATCT AAGTAT GGGAGG ATCCCTG GCCTCTC AGGCTG TACGCG AGACCC (SEQID NO: 1312) NeSL Utopia- Patiria CTGATGTGGATACCTTGGAATTACTCAACCGTGTCGGAGTCTTTTGTCTT CTGATGTG GATTAG MCLKSFSSTSGLRRH 1_PMi miniata TTGCGCCCAACACCTCATGGATACCATGCTTGTCGCTGGAGCGACGTTA GATACCTT CGAACA MARLHRQPSPDAST CAAGCGTGAGGGCGCCCTCCATGCCGGACAGCTGGTCTGTGCCATGTG GGAATTAC CTAATAT PSTMTEVFPYPLPKV CTTGAAGTCGTTCTCCTCGACTTCGGGCTTGCGGCGGCACATGGCCCGG TCAACCGT CCTGCC WPCVVCRENFYHNQ CTGCACCGGCAGCCTTCTCCCGACGCCTCCACGCCCTCGACTATGACGG GTCGGAGT MTAGAC TLKRHQKNFHPNVDL AAGTGTTCCCGTACCCCCTTCCGAAGGTGTGGCCGTGCGTGGTGTGCCG CTTTTGTCT GTGATT TTVYQCSVCGQEFVT CGAGAACTTCTATCACAATCAAACCTTGAAGCGTCACCAGAAGAATTTTC TTTGCGCC GCTAAT GRKASFHFKVHRRM ACCCCAATGTTGACCTCACCACCGTCTACCAGTGCTCAGTGTGCGGGCA CAACACCT CCGCAA SASAIPTPSAMPSSP GGAGTTTGTCACCGGCAGGAAAGCATCCTTCCATTTCAAAGTTCATCGG CATGGATA ACCAAC MDLIRGLVGEPLPPS CGGATGTCAGCCAGCGCCATCCCTACCCCGTCTGCAATGCCCTCCTCACC CCATGCTT CGGATC PARTPPPLARYISPAP GATGGATTTGATCCGAGGCCTCGTCGGGGAACCCCTGCCTCCTAGCCCT GTCGCTGG TACAGC RSSFSPPWNPSPPPR GCCCGTACACCGCCACCACTTGCCCGATACATCTCCCCAGCGCCCCGGTC AGCGACGT CTGAAC SPTPLPRPLTPPPRSP TTCTTTTTCACCACCCTGGAATCCCTCCCCACCGCCCCGGTCGCCCACCCC TACAAGCG ACTGAA SPPPRSPTPPPPVTLT ACTGCCACGGCCACTCACTCCACCGCCACGATCACCGAGCCCGCCGCCC TGAGGGC CTTTAAT TAPVTEPAVPVALTT CGGTCGCCAACACCACCTCCCCCGGTCACCCTCACCACCGCCCCGGTCAC GCCCTCCA CTTCACC AQVTEPSAPAVHTAA CGAGCCCGCCGTCCCGGTCGCCCTCACCACCGCCCAGGTCACCGAGCCC TGCCGGAC CATGTC PVTLSNAPVTEPATP TCCGCCCCGGCCGTCCACACCGCCGCCCCGGTCACCCTCTCCAACGCCCC AGCTGGTC ACATCT ATDPATPVTRLHSPV GGTCACCGAGCCCGCCACCCCGGCCACCGATCCTGCCACCCCGGTCACC TGTGCC GGACAC THISCSISFTASHAPYS AGACTCCACTCACCGGTCACTCACATCTCATGCTCGATCTCCTTCACGGC (SEQID TAGGTTT CAAPTSPSVYACSPR CAGCCACGCCCCCTATTCCTGTGCAGCCCCGACATCTCCGTCCGTCTACG NO:1190) TTGCTCT RRQCSSTIAAVCNSE CCTGCTCACCCAGGCGTAGACAATGCAGTTCCACCATCGCAGCAGTCTG GTTTGT ASSGNPCLLALPVHR CAATTGACGTGTGGCTGGCCGAGCTGTCACGCTCGGCTGACTTCGAGTC GTTTTCC HHLPDTSPQRPGLLF TTTTGAGGATGTGTGTGATCGGTTTGTGGAGTTTGCCGCCGCTGAGGGT TGCCTTT HHPGIPPHRPGRPPH CGAAACAACGGACGCCCAGCCCGCCCAGCGCACCAACCGCCCCGTGAC WCMTT CHGHSLHRHDHRAR CGCGGGAACCAGGGACCTCGCCCCCAACGTCCCCCTCGTCCCCACCGCC GGAWCT RPGRQHHLPRSPSPP CCGGGCTTGGCCCGGAGTTTGATGCACAGGAGGCCAGTCGTCTGCAAA TTCGCM PRSPSPPSRSPSPPPR AGCTGTATCGGACCAGCAAGAAGCGCGCCATCAGGACTATCTTGACCG CTGGAA SPSPPPRPSTPPPRSP GCTCTGATGTCCGATACAGCGGCTACCGAGGTCCTCACTAGTGGATTTC TTTATTT SPTPRSPSPPPRPPILP TTTCACTTGAAGTATTTTCCGCTCGTGAGGTCGACACCGACACCATTGCC GTCGCT PRSPDSTHRSLTSHA ACGGATACCAGCCTCCTGTTTCCGAACTCCGCCCAAGCCCGTGAGTCCG TGGATT RSPSRPATPPIPVQPR GCCAAGACCTGCTTCGCCCAGTCACCCAGCGTGAAGTCTCCCTCCGTCTT ATTTTTT HLRPSTPAHPGVDN GGCCGGATGTCCAACTCCGCACCAGGGAAGGACCGGTTGGAGTACCGT TTCTCAC AVPPSQQSAIDVWL CACATCCGCCAGGTTGATGGCGCATTCCGCGTCACCCTGGAAATCTTCA AATTTG AELSRSADFESFEDVC ACCGGTGCCTGCGTGAAAGCCGGGTCCCATCCTCCTGGAAGACAGCGA GATCTAT DRFVEFAAAEGRNN CCACCGTGCTGATCCACAAGAAGGGGGATGCCACCGACCCGGCCAACT TTTCGTT GRPARPAHQPPRDR TCCGCCCCATCGCCCTCCAGTCCTGCCTCTACAAGCTGTTGATGGCGATC TGTTCAC GNQGPRPQRPPRPH TTGTCCGACCGTGTCACCACATGGGCCCTGGACAACGATCTCATCAGCA TTCGAA RPGLGPEFDAQEASR GCTCGCAGAAGAGCGCCCGCCCCGGTGAAGGTTGCTACGAACACACAT CTCTAGC LQKLYRTSKKRAIRTIL TTCTTCTCTCCACCGTCGTGAAGGATGCCCGGCGTAACCAGAAAAACAT TGCCCTT TGSDVRYSGYRGPHP GTACGCCGCCTGGCTGGATTTACAAAATGCATTCGGCAGCATCCCCCAC TCTTCG ALMSDTAATEVLTSG GACGCAATGTTCACCGTGTTAACCTCCATCGGGGCCCCGGAGGGCCTGG GACACT FLSLEVFSAREVDTDT TCTCCCTGGTGCGCGATGTGTACACTGACGCCTCCACCGACTTCGTCACA GAACTTT IATDTSLLFPNSAQAR CCCACCGGCCGCACTGCTGCAGTCCCGATTCACTCAGGCGTGAAGCAGG AATCTTC ESGQDLLRPVTQREV GCTGCCCGATCTCTCCAGTGCTTTTCAACCTGACGCTGGAACTTATTATC GCCATG SLRLGRMSNSAPGK CGAGCAGTGAATGCTTCAGCGACTCGGGACCGCAGTGCGCCGGTGGTC GCTGTC DRLEYRHIRQVDGAF CATGGCCAGGCCGTCCCCATACTTGCGTATGCTGACGACCTGGTCATAC AGTCGC RVTLEIFNRCLRESRV TCAGCCGATCCAGTGACGGCCTGCAGAGCCTCCTGACAACAGCATCCAT CGGTTC PSSWKTATTVLIHKK CATGGCCACCAAGATCCAGCTGAAGTTCAAGCCCGCCAAGTGCGCTTCC ACTTGCT GDATDPANFRPIALQ CTCTCCCTCGAGTGCCGACGGGGGACCAAGGTACGCCCTCTTGAGTTCA GCGGTG SCLYKLLMAILSDRVT ACGTGCAGGACAAGATCATCCCCGCCCTTACAGAGGAGCAGCACTACCG GGATCC TWALDNDLISSSQKS GTACCTTGGTGTCCCCATCGGCCTCTACCGCACCGACGACAGCCTGGAG TGTTGT ARPGEGCYEHTFLLST ACCTTGGTGGCCAAGATGACTGATGACATCCAGCGGATTGACTCCTCAC GATAAT VVKDARRNQKNMY TCCTCGCTCCCTGGCAGAAACTGGATGCCATCCGGACCTTCGTACAGCC CCCCGT AAWLDLQNAFGSIP CTGCCTAGCCTACACCCTGCGGGCAGGGGACTGTGCGAAGAAACACCT GCATTG HDAMFTVLTSIGAPE CAAGAGACTGCGGGGACAGCTGGTGAAGACAGCCCGCAAAGTATGCAA CCCATG GLVSLVRDVYTDAST CCTGCCGACTCGGGCCACCACAAACTACATCTTTGCTGATCGTAGGGCG GATTTAT DFVTPTGRTAAVPIH GGTGGCTTGGGTTTCATCGACCCAAACGTCGATGCCGATATTCAAATCA TTCCGCC SGVKQGCPISPVLFNL TCACGCAGGCCGTACGCATGCTGTCTTCACCCGACGACATCACCAGAGC TTAGTTG TLELIIRAVNASATRD CATCGCCACCGGCCAGCTGTCTTCCGTGGTCCACCGCACCATCCACCGA TTCCTAA RSAPVVHGQAVPILA GCACCGACACAGGAGGAGACGGACGAGTTCTTGTCAGCATCAATGGAG CCTTGG YADDLVILSRSSDGLQ GGGGACTTTGCCAACAGTGGGAACAGCGGCCAGGCCTCTTCATTGTGG ATTTATT SLLTTASIMATKIQLK TCCAGAGCACGTGCGGCGGCCCGGCGACTGAAAGTTACCATCAGCGGC GCTGTG FKPAKCASLSLECRRG AGCCTTTCCGGCTCAGTGATCACCAAGTCAACAGAGAACCGGGAAATG GGTGAT TKVRPLEFNVQDKIIP GCTGCCAAGTCCATCACTACCGCCCTGCGTGCTCAGTCCCGCGCCCACTA GCCCGG ALTEEQHYRYLGVPIG CACCCACCGGCTGCTGTCCCTGCCCGACCAGGGGAAAGTTGGCCAGTCC GTTTTGT LYRTDDSLETLVAKM CTCAACCAAGACCAATACATGAACTCGTCCTCGTGGATGTCCTCCGGTA TTACATC TDDIQRIDSSLLAPW GCTACATCCTGTTCTGTGACTGGAGATTCATCCATCGGGCCCGTCTCAAC GGGATC QKLDAIRTFVQPCLA ACCCTTCCGACCAACGCAACGGCGCAGCGATGGAAACCCAACACCAGTC CCGCTG YTLRAGDCAKKHLKR CCGCCTGTCGCCGTTGCCAGCACCCGCAAGAGACCCTGCCCCACATCCT CGGCTC LRGQLVKTARKVCNL CAACCACTGCCCGCCCAACATGGTACCCATCCGCCGCAGACACAACCTC GGTGTT PTRATTNYIFADRRA GTCCAGCAACGCATCGTGTCTGCGGTACGCCACGGGGGGTGTTCGTC CCATGC GGLGFIDPNVDADIQ GACCAGCACGTCCCTGAGGATCCCAACCCCCGGGAGCGCCCGGACATC GACTGG IITQAVRMLSSPDDIT ACGGTCGTTGAGGGCGACAAAGTCACCATCATCGATGTATGTTGCCCCT CAGCCC RAIATGQLSSVVHRTI TCGACAACGGCCGTGATGCCCTGATGACCGCCGCAGCCGCCAAGGAGA CTTTGTT HRAPTQEETDEFLSA CCAAGTATGCCGACCTGAAGCAGGCGCTGGTGGCTGCGGGGAAAGAC TACTCTS SMEGDFANSGNSGQ GTCGAGGTGTTCGGATTCGCCGTTGGATCGCTAGGCTCCTGGTTACCTA GACTCT ASSLWSRARAAARRL GCAACGAGCGGGCACTTCGACGTCTCGGCATCGCCAAGCGCTTCCGGA ATTCATT KVTISGSLSGSVITKST CTCTGATGCGCAAACTCCTGCGCATCGACGCCATCAAGGGATCCCGGGA GTTGTAT ENREMAAKSITTALR CGTGTACATCGAGCACATGTGTGGCCATCGCCAATACACCGATTAGCGA TTCTTCC AQSRAHYTHRLLSLP ACACTAATATCCTGCCMTAGACGTGATTGCTAATCCGCAAACCAACCGG ACACTG DQGKVGQSLNQDQY ATCTACAGCCTGAACACTGAACTTTAATCTTCACCCATGTCACATCTGGA GCCAGT MNSSSWMSSGSYILF CACTAGGTTTTTGCTCTGTTTGTGTTTTCCTGCCTTTWCMTTGGAWCTTT GATCAC CDWRFIHRARLNTLP CGCMCTGGAATTTATTTGTCGCTTGGATTATTTTTTTTCTCACAATTTGG ACTGCT TNATAQRWKPNTSP ATCTATTTTCGTTTGTTCACTTCGAACTCTAGCTGCCCTTTCTTCGGACAC GTGTTTC ACRRCQHPQETLPHI TGAACTTTAATCTTCGCCATGGCTGTCAGTCGCCGGTTCACTTGCTGCGG CCGGGA LNHCPPNMVPIRRR TGGGATCCTGTTGTGATAATCCCCGTGCATTGCCCATGGATTTATTTCCG AGATAT HNLVQQRIVSAVRH CCTTAGTTGTTCCTAACCTTGGATTTATTGCTGTGGGTGATGCCCGGGTT CCTCTGC GRVFVDQHVPEDPN TTGTTTACATCGGGATCCCGCTGCGGCTCGGTGTTCCATGCGACTGGCA GGTTTTC PRERPDITVVEGDKV GCCCCTTTGTTTACTCTSGACTCTATTCATTGTTGTATTTCTTCCACACTGG ACGCTCT TIIDVCCPFDNGRDAL CCAGTGATCACACTGCTGTGTTTCCCGGGAAGATATCCTCTGCGGTTTTC GGGTGT MTAAAAKETKYADLK ACGCTCTGGGTGTCTCCCCGGGCAACGCACTGGTTGCTTGCTGCGCCAT CTCCCCG QALVAAGKDVEVFG CACCCTTTTCGTTTATATTCATTTTCAGTCTGCCGTTATCTTGGCCAGCGC GGCAAC FAVGSLGSWLPSNER TCATTCTTTTGTGATGGCCGTGGACTGACCCTCTGCGGTTTTCTGCGGTC GCACTG ALRRLGIAKRFRTLM ASCACTCTCGGTGAATGCTGTGCCACCATTTTTCATTTGTTTACTTTTTCA GTTGCTT RKLLRIDAIKGSRDVYI GCTACAATTATCCTGGCCAGCTTTCACTCTTTTGTGATGGCCGTGGACGG GCTGCG EHMCGHRQYT(SEQ ACCCTCTGCTGGTTTTCACGCTCCGGGTTTGTCTGCGGTCAGCACCCTTG CCATCAC IDNO:1435) GTGAGTGCTGGTCGTTTGTTTGCTCATTTTGCTTAGTTCACCATTATCTTT CCTTTTC MTTCTCTTTTGTWTGGTTTTCCTAGCGGTTGTCTGGGAGTTGAGCTGCA GTTTATA GTTGTCTGGTCTTGGTTTTACCCCCATTTGTTTTCTTTTAACGCGGGGCGT TTCATTT ATTGCCTTGACCGGCCGTCTCAGCTTTTCTCCTAGAGCAACCTTCCGTTC TCAGTCT ATCCAACTTTTAGTTTTCTCAGTTCTTGGCCATTCCGGTTGGTTAATTTTT GCCGTT ATTTATACTTAATTTTATGTTTACATTTTCTGGTTGGAGACCATTTTAGCT ATCTTG TGTTTTAATAGCTTTTCTTCTTTAATTAATACCCTCTGCCATTGAGGGTTT GCCAGC TTATTACTATTAATTTTGTTTACTCTTTGTAACTTGTTTGATTGAATATTTT GCTCATT AATAAACCAC(SEQIDNO:1067) CTTTTGT GATGGC CGTGGA CTGACC CTCTGC GGTTTTC TGCGGT CASCACT CTCGGT GAATGC TGTGCC ACCATTT TTCATTT GTTTACT TTTTCAG CTACAAT TATCCTG GCCAGC TTTCACT CTTTTGT GATGGC CGTGGA CGGACC CTCTGCT GGTTTTC ACGCTC CGGGTT TGTCTG CGGTCA GCACCC TTGGTG AGTGCT GGTCGT TTGTTTG CTCATTT TGCTTA GTTCACC ATTATCT TTMTTC TCTTTTG TWTGGT TTTCCTA GCGGTT GTCTGG GAGTTG AGCTGC AGTTGT CTGGTC TTGGTTT TACCCCC ATTTGTT TTCTTTT AACGCG GGGCGT ATTGCCT TGACCG GCCGTC TCAGCTT TTCTCCT AGAGCA ACCTTCC GTTCATC CAACTTT TAGTTTT CTCAGTT CTTGGC CATTCCG GTTGGT TAATTTT TATTTAT ACTTAAT TTTATGT TTACATT TTCTGGT TGGAGA CCATTTT AGCTTG TTTTAAT AGCTTTT CTTCTTT AATTAAT ACCCTCT GCCATT GAGGGT TTTTATT ACTATTA ATTTTGT TTACTCT TTGTAAC TTGTTTG ATTGAA TATTTTA ATAAAC CAC (SEQID NO: 1313) NeSL Utopia- Phytophthora AGACGAGCAACGCGCTGGGGCCCAAGACCTGGCACGAACGACACTGCC AGACGAG TGAAGC MDVDGGPAMPEPW 1_PS sojae CAGGCTGTGAACGACGAGCACGCTGCTAACCCGGCAGCGCACCGGCCT CAACGCGC TGCACA VLRFDGACRRNPGP CTGGGCTCCGCTGCACCGGTTACCGGTGCAACACGGTGGGGTCCACGTC TGGGGCCC AGCGCG GGAGAALFKPCGTV ACGGCGCGATCGGGGCAGCGGCTGTAGCTCGCCTGCTCACAGGCCTAC AAGACCTG AGGCTC AWTCSHYMPNSSET CCACGGCACCAGCACCAGCTACGCGCCGGCCTGCTTCGGCTCGGCGCTG GCACGAAC GACAGC NNTAEYTALLLGVQS CCCAGACCCGCCCGCTCCCCCCGCGGCCACGACGACAGCCCCCGATGCG GACACTGC TGGACC AVHHGASHLEIEGDS ATGGACGTCGACGGAGGGCCGGCTATGCCCGAGCCGTGGGTGCTCCGC CCAGGCTG TTCAGCT HLVVAQVKGTFACR TTCGACGGCGCCTGCCGCCGGAACCCGGGACCTGGTGGAGCTGGTGCC TGAACGAC GTCGAG NPRLRQLRNRVRHAL GCCTTGTTCAAGCCGTGCGGGACCGTGGCGTGGACGTGCTCGCACTACA GAGCACGC CCACTG RAVTSLTLKHIDRKAN TGCCAAACAGCAGCGAGACCAACAACACCGCTGAATACACGGCGCTGC TGCTAACC CATCCG AHADRLANRALDLKR TGCTTGGCGTGCAGAGTGCCGTCCACCACGGCGCCTCGCACCTGGAGAT CGGCAGC CGCCAG SLAECGEHQGAMES CGAAGGTGACAGCCACCTCGTCGTAGCGCAGGTCAAGGGCACCTTCGC GCACCGGC CCACCG CLHMNPAAQRQRE CTGCAGGAACCCGCGCCTCCGCCAGCTGCGCAACCGCGTCCGCCACGCC CTCTGGGC CACCTG QPAPPARPACAPTRA CTCCGCGCCGTGACGTCCTTAACGCTCAAGCACATCGACCGGAAGGCGA TCCGCTGC GGGCTG ESASDHDEDIDAEIAA ACGCGCACGCGGACAGACTGGCAAACCGGGCGCTGGACCTGAAGCGTT ACCGGTTA GTACTG RDGGEAFPTLPIGPG CGCTGGCTGAGTGCGGCGAGCACCAAGGGGCGATGGAGAGCTGCCTCC CCGGTGCA CCGGCG TAPARQPRLRLRQLT ACATGAATCCTGCTGCACAACGCCAGCGCGAGCAACCCGCACCTCCTGC ACACGGTG CCACCG EDEQEAAASALQAM CCGCCCCGCGTGCGCACCCACGCGCGCCGAGTCTGCCAGCGATCATGAC GGGTCCAC CGAAGG AEELACKIEDADSWT GAGGACATCGACGCAGAGATTGCCGCGCGCGACGGCGGCGAGGCGTT GTCACGGC ACAACG SGDGYISAIPSRIRQLL CCCCACGCTGCCTATCGGGCCCGGGACGGCTCCAGCGCGGCAGCCGCG GCGATCGG GAGCGG QPFTAAQPHPRPPLQ CCTTCGACTGCGGCAGCTGACCGAAGACGAGCAGGAAGCGGCGGCGTC GGCAGCG CAACGC QQRQRPPRVTRTQR GGCCCTGCAGGCCATGGCGGAGGAGCTAGCGTGCAAGATCGAAGACG GCTGTAGC CTCGCG EHRLDEALDEMAAV CAGACAGCTGGACCTCGGGCGATGGATACATCAGCGCCATCCCCAGTC TCGCCTGC AGCGCC QQERPTSRSAVRRAR GCATCCGGCAGTTGCTGCAGCCCTTCACAGCAGCCCAGCCCCACCCTCG TCACAGGC GCGTGG RRVGRIRASMRQQQ GCCTCCACTGCAACAGCAGCGACAGCGACCGCCCCGGGTGACGCGCAC CTACCCAC GTCTGG LRHDFARNESKCVED TCAGCGCGAGCATCGACTCGACGAGGCCCTCGACGAGATGGCGGCGGT GGCACCAG GGGGGG ILRAASAETAAEEHPE GCAGCAGGAGCGACCCACCAGCAGATCGGCCGTCCGCCGTGCGCGGCG CACCAGCT CCCGCG TCPIDSGTLHEHFTAV CCGCGTCGGCCGGATCCGCGCCTCGATGCGCCAACAGCAGCTCCGCCAC ACGCGCCG GCGCAC NSPRINFLPDEACGA GACTTCGCTCGCAACGAGAGCAAGTGCGTCGAGGACATTCTGCGTGCT GCCTGCTT ATCGCA LFREAMADVGTPQE GCGTCGGCGGAGACCGCAGCGGAGGAGCACCCGGAGACGTGCCCCAT CGGCTCGG GGCTCG RRSALTDELTMDEVE CGACTCGGGCACGCTCCACGAGCACTTCACGGCGGTGAACTCGCCCCGC CGCTGCCC GGCACG DQLMQAATNSSPGH ATCAACTTCCTACCAGATGAAGCTTGCGGGGCGTTGTTCCGCGAGGCGA AGACCCGC GCGGTA DGVGYDIYKKFAAQL TGGCCGACGTCGGGACACCGCAGGAGCGGCGGAGCGCGCTCACGGAC CCGCTCCC AGCTGG VPLLHAAFQSCWRH GAGCTGACGATGGACGAGGTGGAGGACCAGCTCATGCAAGCGGCCAC CCCGCGGC TCATTTG HRVPALWKVGFVRLI GAACTCCAGCCCCGGCCACGACGGCGTGGGCTACGACATCTACAAGAA CACGACGA ACCAAC HKKGDPNDPANWR GTTCGCGGCGCAGCTGGTCCCCCTCCTCCACGCAGCCTTCCAGTCCTGTT CAGCCCCC AGGGCA PICLQTAIYKLYSGLLA GGCGCCACCACCGGGTGCCTGCCCTGTGGAAGGTGGGGTTCGTCCGCC GATGCG CTACCCA RRLSAYLEANGLLLM TCATCCACAAGAAGGGCGACCCCAACGACCCAGCGAACTGGCGGCCGA (SEQID GGTAGG AQKGFRAYNGCHEH TCTGCCTGCAGACCGCGATCTACAAGCTCTACAGCGGTCTGCTGGCGCG NO:1191) GAACCG NFVATTLLDQTRRM TCGGCTGTCGGCGTACCTGGAGGCGAACGGGCTGCTGCTCATGGCGCA CCCTTCA RRRLYQVWYDLRNA GAAGGGGTTCAGGGCGTACAATGGTTGCCACGAGCACAACTTCGTGGC AAAACC FGSVHQDMLWYVLR AACCACGCTCCTGGACCAAACCCGACGCATGCGCCGCAGGCTGTACCAG CAGGAA LLGVERAFVERCDDIY GTCTGGTACGACCTGCGGAACGCGTTCGGGTCCGTTCACCAGGACATGC GACACA EDSYFVVGNAADGA TGTGGTATGTGCTCCGCCTGCTCGGCGTCGAGCGTGCGTTCGTCGAGCG AACACC TEPVRQEVGVYQGC CTGCGACGACATCTACGAGGACTCCTACTTCGTCGTCGGCAACGCGGCC CTCCCTT PLSPLLFIAALVPLLRA GACGGAGCGACGGAACCAGTGCGGCAGGAGGTGGGCGTGTACCAAGG TAGTGA LEKLDGVGVALADGV CTGTCCGCTCAGCCCCCTCCTGTTCATCGCCGCGCTGGTGCCGTTGCTCC CATGCA RPCTTAYADDLKVFS GAGCATTGGAGAAGCTCGACGGAGTGGGCGTCGCGTTGGCGGACGGG TATTTTA DSAAGITRCHAVVEK GTGCGGCCATGCACCACCGCCTACGCCGACGACCTCAAGGTGTTCAGCG GCCTAC FLEWTVLQANPGKC ACAGCGCGGCCGGCATCACGCGCTGCCACGCTGTCGTTGAGAAGTTCCT ATTTCAG AFLAVTRNARGNPA GGAGTGGACGGTGCTGCAGGCCAATCCAGGCAAGTGCGCCTTCCTGGC TTACGG HDKDMKLSLHDEEVS CGTGACACGCAACGCGCGCGGCAACCCCGCCCACGACAAGGACATGAA AGAGGT SIKLHDSYRYLGVGD GCTGTCGCTCCACGACGAGGAGGTCTCAAGCATCAAGCTGCACGACAG TACTAAC GFDHVRHRLQLEPKL CTACCGCTACCTCGGCGTGGGCGACGGCTTCGATCACGTGCGCCACCGT TGGTAA QQIKREAVALMQSG CTCCAGCTAGAGCCCAAGCTACAGCAGATCAAGCGGGAGGCGGTGGCG ACACGA LAPWQVVKALKTYVY CTGATGCAGTCGGGGTTGGCGCCGTGGCAGGTGGTGAAGGCGCTCAAG ACACAC PKVEYALRHLRPLQS ACCTACGTCTACCCCAAGGTCGAGTACGCGCTCCGCCACCTCCGACCGC AT(SEQ QLQGFDRVVAKGLR TCCAGTCCCAACTGCAGGGCTTCGACCGTGTGGTAGCCAAGGGCCTGCG IDNO: HLLRLPRSATNEVLYA GCACCTGCTACGCCTGCCCCGCTCGGCAACCAACGAGGTGCTCTACGCA 1314) PTSSGGLGLQPLVEM CCAACGTCGAGCGGGGGCTTGGGGCTCCAGCCGCTGGTGGAAATGCAC HRALQIAHAWQML CGCGCGCTGCAGATCGCCCATGCGTGGCAGATGCTGCACTCGAAGGAC HSKDPAIREVARAQV CCAGCGATCCGGGAAGTGGCGCGCGCACAGGTGTGGCAGGTGGCCCG WQVARKRHRLREEH CAAGCGTCACCGCCTCCGCGAAGAACACTGGCGGGAGCGTGACGACGA WRERDDELVRCFLNS GCTCGTGCGGTGCTTCCTCAACTCCGAGCTCGCCGCCTCGCCCCACGCG ELAASPHAEALRRHG GAGGCGCTGCGCCGCCACGGCGACATCGGGTCACTGTGGTCCGACGTA DIGSLWSDVQRWLRI CAGCGGTGGCTGCGCATCTACCACCTCAGCCTGGTGGTGCAGGACGAC YHLSLVVQDDRNGL CGCAACGGCCTGGACCCGCTCGGCCTCCGCGTGCCGCACCACGCCAAGT DPLGLRVPHHAKWL GGCTCGACCACAAGAGCGTGCTGCGGCACGTCAAGTTGCACCTCAAGA DHKSVLRHVKLHLKIR TCCGCCACCAGACCCGATGGAAGGGTCTGGCGGACCAGGGCAAGACG HQTRWKGLADQGKT GTGCGGGCCCACGGCGGTGTCGGTGCCAAGTTCATGTCGACATGGGCG VRAHGGVGAKFMST GGCCTGTCGGACGACGACTACCGCTTCGGCGTGAAGGCCCGCCTCAACC WAGLSDDDYRFGVK AGATCGACACGAACGCCGTGCTCAAGCGCAAGCGGCTGCGGTCGCACA ARLNQIDTNAVLKRK AGACCTGCAGGGACCCGACCTGCTCCTCGGCCGAGACGCTGGCCCATGT RLRSHKTCRDPTCSS CCTCAACCATTGCGAGTCCAACATGGACGCAATCCGCCAGCGTCATGAC AETLAHVLNHCESN GACGCCTTGGAGCAGATCGGGTCCAAGATCCGCAACGCGCTCAAGCGC MDAIRQRHDDALEQ GGCAAGTCGACCGCGGAGCTGCGCCTGAACCAGACGGTACCCGAGTAC IGSKIRNALKRGKSTA ACGGGCGCAGCTCTGCGGCCAGACATCGTACTGCGGATCGTGGCCGCC ELRLNQTVPEYTGAA AAGAAGATGGTGATCGCCGACTTGGCGGTCACGTTCGAAGAGCACGCG LRPDIVLRIVAAKKM GCCGGCGCCCGCCACTCCTCGCTGCAGCTCAGCCACGACCACAAGACAC VIADLAVTFEEHAAG TCAAGTACCAACCGATCGTGGCGGAGCTGCAGCTCAAGGGGTGGCAGG ARHSSLQLSHDHKTL TGCAGACCGCGGCGATCGTGTACGGGTCGCTGGGCTCGGTGCAGCCCA KYQPIVAELQLKGW GCAACTCAACACCTACACGGAAAAGTTGAAGCTGCACAAGCGCGAGGC QVQTAAIVYGSLGSV TCGACAGCTGGACCTTCAGCTGTCGAGCCACTGCATCCGCGCCAGCCAC QPSNSTPTRKS(SEQ CGCACCTGGGGCTGGTACTGCCGGCGCCACCGCGAAGGACAACGGAGC IDNO:1436) GGCAACGCCTCGCGAGCGCCGCGTGGGTCTGGGGGGGGCCCGCGGCG CACATCGCAGGCTCGGGCACGGCGGTAAGCTGGTCATTTGACCAACAG GGCACTACCCAGGTAGGGAACCGCCCTTCAAAAACCCAGGAAGACACA AACACCCTCCCTTTAGTGACATGCATATTTTAGCCTACATTTCAGTTACG GAGAGGTTACTAACTGGTAAACACGAACACACAT(SEQIDNO:1068) NeSL Utopia- ADOS01001321 Pythium GCGGTGTACGCGCACAACGCCGCGCTCTTCGAGTGCACGTTGTGCGCGC GCGGTGTA TGATGC MGTQSARERGAPSA 1_PU ultimum ACACCGCGCGGGATCTCGCCGCGCTCCAGCAGCATCGGCGCTCCGCGC CGCGCACA GGGTCA PHSHTLGPRTPPRPP ACCGCAGCGTCCGCTTTGTGGATCACTTCCACAGCGGATGCGCGTGCGG ACGCCGCG TATTGAC ACSKHGELESAAGGR CGTGAGCTTCCACTCGCGTGCGGCGGCAACCAAGCACGCGCGCGAATG CTCTTCGA CGAAAG DGQCSDGAERERDA TCCAGAGAGCGCGTTCTCGGTCGCCGCCGCCGCGCGCACTGCAGCGGC GTGCACGT GGCACC ERDIRANERDCNGD CAACACCGCAGGTATGTCTCTCGGCGCCGACGAACGCGACCACCTCGCG TGTGCGCG ATCCAC GDGDDADSDSDDRN TCCGTCGGCGCCTTGCATGATGTTGCATCCCCCGCTTTTGGCAACATTTT CACACCGC GTAGGA DARRRSRRPRATATT GCCGGTTGCGTTTGCGACCGCGGCAGACGCATCAAGCGCCACCGTGAT GCGGGATC CACCGC TTSAPTTTTTTTTSAT CGCAGACGCAGCCATGCAGCACAGTGCTGTGCCCTCTGCTGCCGCCCAA TCGCCGCG CCTCAA TSATTPATDSSPWVL TCCCCTCGGCGTGCGCACGTCCCCCCCGTGCCGCGCGCCACCACGACAC CTCCAGCA AAACCC RFDGACRRNPGPGG CATCCGCGCTGCGGATTGGTGGCAAACGCCGCCGCCTGAACGACGACG GCATCGGC AGTTCA AGAALFEPGGAVVW GCGACAACGAAAACAGCGACGGCCGCGACGCCGACATCGAGATGCGC GCTCCGCG GTTTATT TVSHYLPGSETNNTA GCCGACGACACCGACGCACCAGCGCCGACCAACCCCGCGACCAGTGCG CACCGCAG GACACC EYSAMLLGVRSAIHH GCTGCAACGCCCGCGCGCACAGCACCAGCACCAACGGATGCCGCGACC CGTCCGCT CTCCACT GATRLRVEGDSHLAL GCGCGCCGCCGCCACACGCGATGGGGACCCAGAGTGCTCGTGAACGTG TTGTGGAT TAGTGA SQVRGTFACTNRRLR GCGCGCCTTCCGCGCCGCACTCACACACGCTGGGGCCCCGCACTCCGCC CACTTCCA CATGCA KLRNRVQAALRELGD GCGACCACCAGCATGCAGCAAACATGGAGAACTCGAATCAGCTGCTGG CAGCGGAT TATTCAA YRLVHIDRQANAHA TGGCCGCGACGGACAATGCAGCGATGGCGCGGAACGCGAGCGCGACG GCGCGTGC ACCGAT DRLANRALDLRKTKV CCGAGCGCGACATCCGCGCCAACGAGCGCGACTGCAACGGCGACGGCG GGCGTGA ACATATT DCGPHATTTDACVQ ACGGCGACGACGCCGACAGCGACAGCGACGACCGCAACGACGCGCGA GCTTCCAC CGTTAG PAEILAPTARLSSSSSS CGACGCAGCAGACGACCGCGAGCGACGGCGACAACAACGACGAGCGC TCGCGTGC GAGAGG SSSSSSDEPMPGLEEP GCCGACCACGACCACGACCACGACAACGAGCGCAACGACGAGCGCGAC GGCGGCA TTACTAA AADDETDADAEADIA GACGCCGGCGACAGACAGCTCCCCCTGGGTCTTGCGCTTTGACGGCGCA ACCAAGCA CTGGTA MRDGGEIFPTLQIGP TGTCGCCGCAACCCAGGCCCCGGCGGCGCCGGTGCTGCGCTCTTCGAGC CGCGCGCG ATATATC GSAPAQQPRLRLRQL CAGGTGGCGCCGTGGTGTGGACGGTGTCTCACTACCTCCCGGGCAGTG AATGTCCA ACCATTT SDDESEAAARTLEHF AGACCAACAACACCGCCGAGTACTCGGCGATGCTGCTCGGGGTGCGCA GAGAGCG C(SEQID ANDMASKIADADD GCGCCATCCACCACGGCGCGACCCGACTCCGCGTCGAGGGCGACAGCC CGTTCTCG NO: WRSGEGYISAIPVRLR ACCTCGCGCTCTCGCAGGTGCGCGGCACCTTCGCGTGCACGAACCGGCG GTCGCCGC 1315) ELLAPYAVPIRSPPRN TCTGCGCAAGCTGCGCAACCGAGTGCAAGCGGCACTGCGCGAGCTCGG CGCCGCGC ASSRPPRPQSRPPRP CGACTACCGGCTCGTGCACATCGACCGGCAAGCGAACGCGCATGCCGA GCACTGCA PRVTRHQREHRLDEA CCGCCTCGCGAACCGCGCCCTCGACCTCCGCAAGACCAAGGTCGACTGC GCGGCCAA LDDLAAAQRSTSTDQ GGGCCCCACGCAACGACCACCGACGCGTGCGTGCAGCCGGCGGAAATC CACCGCAG RSIRNARRRVGRIRTA CTGGCGCCAACAGCCCGCCTGTCCAGCTCCAGCTCCAGCTCCAGCTCCA GTATGTCT QAQSDLRSQFATNE GCTCCAGCGACGAGCCGATGCCGGGACTCGAGGAGCCAGCTGCGGAC CTCGGCGC RACVESILRAAKPDG GACGAGACCGACGCTGACGCCGAAGCCGACATCGCAATGCGCGATGGA CGACGAAC TEPQASAGTCPIDRA GGCGAGATTTTTCCGACGCTCCAGATCGGCCCCGGCTCGGCGCCAGCAC GCGACCAC TLHAHFAGVNTPRER AACAACCGCGGCTCCGCCTCCGCCAGCTCTCGGACGACGAGTCCGAGG CTCGCGTC FDFDDALGADFRAAL CAGCCGCGCGCACGCTCGAGCACTTCGCCAACGACATGGCGAGCAAGA CGTCGGCG DVLPPPDQAADAFA TTGCGGACGCTGACGACTGGCGTTCCGGGGAGGGCTACATCAGTGCCA CCTTGCAT DELSLGEVEDQLDRV TCCCTGTGCGCCTCCGTGAACTCCTCGCACCATATGCGGTGCCAATCAG GATGTTGC VASSSPGLDGVGYDV GAGCCCTCCACGGAACGCCAGCTCCCGTCCCCCGCGCCCGCAATCTCGT ATCCCCCG FKRFRLQLLPLLHAAY CCGCCTCGCCCGCCGCGAGTGACTCGACACCAGCGCGAACACCGACTG CTTTTGGC QCCWRHRRVPATW GATGAGGCGCTCGACGACCTGGCCGCTGCGCAGCGCTCGACATCGACC AACATTTT KVGLVRLLHKKGDRA GACCAGCGCTCCATCCGCAACGCCCGGCGTCGAGTCGGCCGCATCCGCA GCCGGTTG EPNNWRPICLQQAIY CGGCACAAGCCCAATCCGACCTCCGCAGCCAGTTCGCCACGAACGAGCG CGTTTGCG KVYSGLLARRLSRWL CGCCTGCGTCGAGAGCATCCTCCGCGCCGCCAAACCCGACGGCACCGA ACCGCGGC EANERFTTAQKGFRE GCCTCAAGCAAGCGCGGGCACGTGCCCCATCGACCGTGCGACGCTCCA AGACGCAT FNGCHEHNFVASSLL CGCGCATTTCGCGGGCGTCAACACGCCGCGCGAGCGCTTCGACTTTGAC CAAGCGCC DQTRRLHRKLYAVW GACGCGCTGGGAGCCGACTTTCGCGCCGCGCTCGACGTTCTGCCACCGC ACCGTGAT YDLRNAFGSMPQPL CGGACCAAGCCGCCGATGCGTTCGCCGACGAGCTCTCGCTGGGCGAGG CGCAGACG MWRVLARLGVDTAF TCGAGGACCAGCTCGACCGCGTCGTCGCCTCCTCCAGTCCCGGCCTGGA CAGCCATG LQRCEDIYADSFFVV CGGAGTGGGCTACGACGTGTTCAAGCGCTTTCGGCTGCAGCTCCTGCCG CAGCACAG GNAADGATDPVRQE CTCCTCCACGCTGCATACCAGTGCTGCTGGCGCCACCGCCGCGTGCCTG TGCTGTGC VGVYQGCPLSPLLFIS CAACGTGGAAGGTGGGATTGGTGCGCCTGCTGCACAAGAAAGGCGATC CCTCTGCT ALIPLLRALQRLPGVG GCGCTGAGCCAAACAACTGGCGGCCGATCTGCTTGCAGCAGGCCATCTA GCCGCCCA VPLADGVRPCTTAYA CAAGGTCTACTCTGGGCTGCTGGCGCGCCGACTGTCGCGGTGGCTCGA ATCCCCTC DDLKVFSDSAAGIQQ GGCGAACGAACGCTTCACCACCGCCCAGAAAGGTTTCCGGGAGTTCAA GGCGTGC CHGTVARFLRWTGL CGGGTGCCACGAGCACAATTTCGTGGCATCCTCGCTCCTCGACCAGACG GCACGTCC RANASKCALLPVTTT CGTCGGCTCCATCGGAAGCTGTACGCTGTCTGGTACGACTTGCGGAACG CCCCCGTG ARGNPAIDDTLQLEL CCTTCGGTTCAATGCCGCAGCCCCTGATGTGGCGCGTCCTCGCTCGCCTC CCGCGCGC HGDAIARLTLQSSYAY GGAGTCGACACTGCGTTCCTCCAGCGGTGCGAGGACATCTACGCGGAC CACCACGA LGVGDGFDHVQHRV TCGTTTTTCGTCGTCGGCAACGCAGCCGACGGTGCCACGGACCCCGTCC CACCATCC QLAPKLAELKRDAVA GACAGGAGGTCGGCGTCTACCAAGGATGCCCGCTGAGCCCGCTGCTGT GCGCTGCG LLRSGLAPWQVLKAI TCATCTCAGCATTGATTCCTCTCCTGCGGGCTCTGCAGCGGCTACCAGGC GATTGGTG KVYLYPRIEYALRHLR GTCGGCGTTCCCCTGGCGGACGGAGTCCGTCCGTGTACGACCGCGTAC GCAAACGC PLQSQLEGFDRAVAK GCGGACGACCTCAAGGTGTTCAGCGACAGTGCCGCCGGCATCCAGCAA CGCCGCCT GFRHLLRLPANATNE TGCCACGGCACTGTCGCTCGATTCCTCCGATGGACGGGGCTGCGCGCGA GAACGAC LLYAPVSSGGLGLLPL ACGCGAGCAAGTGTGCGCTGCTGCCAGTGACAACCACTGCGCGCGGCA GACGGCG VELHKALQIAHGWQ ACCCCGCCATCGACGACACGCTGCAGCTCGAGCTGCACGGCGACGCCAT ACAACGAA MLHSKDAAVQAIAR CGCCCGCCTCACCCTCCAATCGAGCTATGCCTACCTTGGGGTGGGTGAT AACAGCGA AQVRQVVQKRYTLD GGCTTCGACCACGTGCAGCACCGCGTGCAGCTCGCGCCCAAGCTCGCA CGGCCGCG ADHWQGRDDELVQ GAGCTGAAGCGCGACGCTGTGGCACTGCTGCGGTCGGGGTTGGCGCCT ACGCCGAC LFLNSELAASPHATIK TGGCAGGTACTGAAGGCCATCAAGGTCTATCTGTACCCGCGCATCGAGT ATCGAGAT RRNGDIGSLWSDVQ ACGCGCTGCGCCACCTCCGACCGCTGCAGTCGCAGCTCGAGGGGTTTGA GCGCGCCG RHLKTLQLRLETREPT TCGCGCGGTAGCGAAGGGGTTCCGACACCTCCTCCGTCTACCCGCGAAC ACGACACC ADAPDSPNGLLHLRV GCCACCAACGAGCTGCTCTACGCGCCTGTGTCGAGCGGCGGCCTGGGG GACGCACC PHHRKWLSHKTVLR CTTCTGCCACTCGTGGAGCTCCACAAAGCTCTGCAGATTGCCCACGGGT AGCGCCGA HMKLHIRLCHKHKW GGCAGATGCTCCACTCGAAAGACGCCGCCGTGCAGGCGATCGCGCGCG CCAACCCC QSMSDQGRTVRAH CACAGGTCCGCCAAGTGGTCCAAAAGCGCTACACGCTGGACGCGGACC GCGACCAG GQAGSHFVSRGVGL ACTGGCAGGGACGAGACGACGAGCTCGTGCAGCTCTTTCTCAACAGTG TGCGGCTG WDADYRFALQARLN AGCTCGCCGCCTCGCCCCACGCGACCATCAAGCGGCGCAACGGAGACA CAACGCCC QLDTNSTLKRRRQRT TTGGTTCGCTCTGGAGCGATGTCCAGCGCCACCTGAAGACGCTGCAGCT GCGCGCAC NATCRAPNCSRTETL CCGGCTCGAGACGCGCGAGCCGACTGCCGACGCGCCTGACTCCCCCAA AGCACCAG AHVLNHCETNMDVI CGGCCTCCTCCACCTCCGTGTGCCGCACCACCGCAAATGGCTCAGCCAC CACCAACG RQRHDGALEQIGAAI AAAACGGTGTTGCGGCACATGAAGCTCCACATCCGCTTGTGCCACAAGC GATGCCGC NAAIKGRRTDTEVRL ACAAGTGGCAGAGCATGAGCGACCAGGGGCGCACGGTGCGAGCGCAC GACCGCGC NQTVPEFNGPAWRP GGACAGGCTGGCAGCCACTTTGTGTCGCGCGGGGTGGGCCTATGGGAC GCCGCCGC DIQVRDARSKTMVIA GCGGACTACCGCTTCGCGCTCCAGGCCCGCCTCAACCAACTCGACACCA CACACGCG DLAITFEDQPNDQSA ACTCGACACTCAAGCGTCGCCGCCAGCGCACCAATGCCACCTGCCGCGC (SEQID SSSLQHSREHKIAKYQ GCCGAACTGCTCGCGCACCGAGACACTGGCGCACGTCCTCAATCATTGC NO:1192) PIAAALERQGWRVH GAAACCAACATGGACGTGATCCGCCAGCGTCACGACGGCGCGCTCGAG TSAIVYGSLRSVHPSN CAGATTGGTGCGGCGATCAACGCTGCCATCAAGGGCCGCCGCACCGAC FTVYTELLGLLKRDAR ACCGAGGTCCGACTCAACCAGACAGTTCCAGAGTTCAACGGACCGGCG RLNTTLSCHCIRSSRR TGGCGGCCCGACATCCAGGTGCGCGACGCGCGCTCCAAGACCATGGTG VWNWHCGQHRAR ATCGCGGACCTCGCGATCACCTTCGAGGACCAGCCAAACGACCAGTCCG QHQRCQEGRAHGS CCAGCTCGTCGCTCCAACACAGCCGCGAACACAAGATCGCGAAATACCA GGNQRAEGGTATT GCCAATCGCGGCGGCACTGGAGCGGCAAGGGTGGCGGGTACACACAA (SEQIDNO:1437) GTGCCATTGTGTACGGGTCGCTACGCTCGGTCCACCCCAGCAACTTCAC CGTGTACACGGAGTTGCTGGGACTGCTCAAGCGCGACGCCCGTCGCCT GAACACAACGCTGTCGTGCCACTGCATCCGCAGCAGCCGACGCGTGTG GAACTGGCACTGCGGCCAGCACAGAGCACGACAGCATCAGCGATGCCA AGAAGGCCGCGCGCACGGGTCCGGGGGGAACCAGCGCGCGGAAGGG GGCACGGCAACAACATGATGCGGGTCATATTGACCGAAAGGGCACCAT CCACGTAGGACACCGCCCTCAAAAACCCAGTTCAGTTTATTGACACCCTC CACTTAGTGACATGCATATTCAAACCGATACATATTCGTTAGGAGAGGT TACTAACTGGTAATATATCACCATTTC(SEQIDNO:1069) NeSL Utopia- Strigamia GGAGTGTTCTTTTCGGAGACGCCGCCTACTTTAGAGGAGAGAATCCCCA GGAGTGTT TGATGG MATVRLKYPYPPEGIL 1_SM maritima CGGGCATCCTCATTTGATCTGATCCATCGAGTATCTGCGAATAGTCGGC CTTTTCGG GAGAGT CGPCAANTNAPQTR GCACTCCTTTTGCCATGATCCCGGGGGTCTCATGGTAAAAAGGTTTGTG AGACGCCG GAGGAA PYSDKSGLAKHLKLY GCACGGCTTAGTTGACGCCCCTCTTCCACGTCACTCGGCCTGCATCGATC CCTACTTT TCTTCTC HKATLVVECRHCGHE GACTGCTCTCTCCTCTTCCTCCCTCCCTCCTCTTCACGCTCTCTTCACGTGA AGAGGAG CACTGT ESDLRKMKKHISTNH CCCCTTCCCATCCCGCCCCTCGGCTTTTGGCAAAGATCTGTGTGTCCTCC AGAATCCC GCAAAA PVAAAAAPTVPPRLG AAAGCACCCATCTACCATTTGCTCGAGTTGCGATTGGTCGAAGCTGCCA CACGGGCA CCATAC PTAPPPPRVILRPRFIP CGCCACTCGTCTACTCTGCCTCTCTGACCCCTTCCCCCCTCTCTCCCTCTGC TCCTCATTT AGTCAG RPRTPSPSSSSSSASS CGCCCTCGCCTCTCGGGTCCCTTCCCATCTCCCTCCTCCCATCCACGTGCT GATCTGAT AAGATG PASSRRSVSLPPASPP CCCTCCTCTCGTCACGTGATCGTTGCGGCTCGACATGGCCACGGTCAGA CCATCGAG CTAACTA VSSASSPAARSGRNS CTCAAGTACCCCTATCCTCCCGAGGGTATCTTGTGTGGACCGTGTGCCGC TATCTGCG CTAGTTT PDSQGTAPVTPIGTV CAACACGAACGCCCCTCAGACGCGGCCATACTCCGATAAGTCGGGGTTG AATAGTCG GATACC RNSPAGSPALSYSTAS GCCAAGCATCTAAAGCTCTATCACAAAGCCACTCTAGTGGTGGAGTGTC GCGCACTC CTGTGC PIASTITTPRHLSPASP GTCACTGTGGTCACGAGGAGTCTGATCTGAGGAAGATGAAGAAGCACA CTTTTGCC CCCCTGC ALSAGPGSLGASPPV TCTCGACGAACCACCCCGTCGCTGCGGCCGCCGCGCCTACCGTACCCCC ATGATCCC AATGTC SPTAATVPPAPPATV TCGCCTGGGTCCCACCGCCCCTCCTCCGCCCCGGGTTATCCTCCGCCCTC GGGGGTC CCGCGT PAVMAATVPFVAAT GCTTCATCCCACGGCCTCGGACTCCCTCTCCTTCCTCCTCCTCGTCATCGG TCATGGTA GTCGTA TVPSVGSSTVPQRPA CGTCGTCTCCGGCCTCTTCCAGAAGGTCTGTATCCCTTCCCCCTGCGAGT AAAAGGTT CCCAAG GPRRPPPFPIDDWIG CCCCCCGTCTCCTCCGCTTCCTCCCCGGCCGCTCGCTCGGGTCGGAATTC TGTGGCAC CCCGGC RIARVSSLPELDAVSR GCCGGACTCGCAGGGGACGGCTCCGGTCACTCCGATCGGTACCGTACG GGCTTAGT TGGCAT LLEDEVVKRRPPDPN GAACTCGCCGGCCGGTAGCCCTGCCCTCAGCTACTCGACGGCCTCCCCG TGACGCCC TGAGAC ARPASLHPTRRPPPP ATCGCCTCTACCATCACGACGCCACGCCATCTCTCGCCTGCGTCCCCCGC CTCTTCCA ACATTA STRPRPCHGTGGTSV TCTCTCTGCCGGGCCTGGCTCTCTCGGCGCGTCTCCGCCCGTCTCTCCCA CGTCACTC GGCTCT SLAALTSSCIREDHRG CGGCCGCTACAGTACCCCCCGCTCCTCCAGCTACCGTACCCGCAGTGAT GGCCTGCA CGCTCCC SLPLLCTSGWDSPWP GGCGGCTACCGTACCCTTCGTCGCCGCGACTACCGTACCCTCCGTCGGC TCGATCGA CCGTAT LSPSPSSHCPSSNPCP TCTAGCACGGTACCCCAGCGTCCCGCCGGCCCTCGACGACCCCCTCCCTT CTGCTCTC ACTCTCA SSSTPPSSLLGPPRHS CCCTATCGACGATTGGATCGGAAGGATCGCCCGAGTGTCATCTCTGCCG TCCTCTTCC TAATTTC HLTWRGSGSTTPSHR GAATTGGACGCCGTCTCAAGGTTGCTCGAGGATGAAGTGGTCAAGAGA TCCCTCCC GTGTAC HCSRARYHHLLWRLP CGCCCCCCTGACCCCAACGCCAGACCCGCCTCACTCCATCCGACACGCC TCCTCTTC GCTAAT YPSSPRLPYPPSALAR GCCCTCCCCCTCCATCTACTCGCCCTCGTCCCTGCCATGGGACCGGCGGA ACGCTCTC CCTACCC YPSVPPALDDPLPSLS ACTTCAGTCTCTCTGGCGGCGCTCACGTCCTCGTGCATTCGCGAAGATCA TTCACGTG TACCCCT TIGSEGSPECHLCRN CCGAGGGAGCCTCCCCCTTTTGTGCACTTCCGGTTGGGACAGTCCATGG ACCCCTTC CCCTTTG WTPSQGCSRMKWS CCACTTTCTCCAAGTCCATCAAGCCACTGCCCATCTTCCAACCCCTGTCCC CCATCCCG ACCACTC RDAPLTPTPDPPHSIR TCTTCCTCCACTCCGCCCTCCTCGCTCCTCGGACCCCCTCGTCACTCCCAT CCCCTCGG ACCCAA HAALPLHLLALVPAM CTCACCTGGAGAGGTTCTGGATCGACTACGCCGAGCCACCGACACTGCT CTTTTGGC CCATGT GPAELQSLWRRSRPR CCCGGGCCAGATACCATCATCTACTCTGAGTGGAGGGCCATCGACCCGA AAAGATCT GTATAG AFAKITEGASPFCALP CTGGACGACTCCTCTCCTCTCTCTTCCAGAAGGTCCAGACCTTCGGGGCG GTGTGTCC CTGTGC VGTVHGHFLQVHQA CCAACCCGATGGAAGGAGAGTACGACCACACTCATCCACAAGGGTGGT TCCAAAGC TGGTGA TAHLPTPVPLPPLRPP GACCACACAGCGATGTCCTCTTGGCGTCCAATCGCCCTCCTCTCCACAGT ACCCATCT TCCCGG RSSDPLVTPISPGEVL GGCCAAGATCTATGGGAGCATCCTGTCTCATCGGCTGACGACCTGGGCC ACCATTTG GGCGGT DRLRRATDTAPGPDT GTCCAGAATGGTCGTCTCTCGCTCTCGCAGAAGGGATTCCTGCCATTTCG CTCGAGTT TATTCAC IIYSEWRAIDPTGRLL AGGATGCCTCGACCAGAACTACCTTGTTCAGTCGTGCCTCCAAGACGCC GCGATTGG TGGTTAT SSLFQKVQTFGAPTR CGCCGCAACAAGAAGACACTCTCACTCGCCTTCCTCGATCTGAAGAATG TCGAAGCT CATATCA WKESTTTLIHKGGDH CCTTTGGGAGCATCCCCCACCTGACCATTCGTCACTCCCTCGAGTGGCTC GCCACGCC TTCTAAA TAMSSWRPIALLSTV GGACTGGCCCCTTCATCGATCGACATTCTTGAAGCGTCATTCCTGGGATC ACTCGTCT ATGATCT AKIYGSILSHRLTTWA GTCGACTCGCGTTCGAACCGAGACTGGACTCACTCCCCCCATCAGCCTC ACTCTGCC TTGATCT VQNGRLSLSQKGFLP GACACTGGAGTGGTCCAAGGAGCTCCCCTCTCCCCCATCCTCTTCAATCT TCTCTGAC CTCAATT FRGCLDQNYLVQSCL AGCCATCGAACCCCTCCTCCGCACGGTCCCCTCTGCCCATTCTGGATTTT CCCTTCCC AACAAC QDARRNKKTLSLAFL CTCTCCATGGCCATGTCGTCTCTGTGGTCGCCTACGCGGATGACCTCGCC CCCTCTCT TAACTTA DLKNAFGSIPHLTIRH ATCCTCGCCCCATCTACTCCCGCTCTACAATCCCAATTGGATGCTATCTCT CCCTCTGC TTTCTGT SLEWLGLAPSSIDILE GGCATGGCAGATTGGGCGGGCCTCTCATTCAATCCGGCCAAGTGCGCG CGCCCTCG TTGTTTC ASFLGSSTRVRTETGL ACCGTCACTCTCACTGGGAAGGATAACTCTCGTGATACGCTGTCCCTGC CCTCTCGG ATTGTTT TPPISLDTGVVQGAP AAGGCTCCCCCGTCCCTTCCATCTCTGACGGCGATGCATACAAGCACCTT GTCCCTTC CACCTC LSPILFNLAIEPLLRTV GGAGTCCCCACTGGCACTACTACCTTCCCCAGTGGCACCGATGCAATCA CCATCTCC GTAAGA PSAHSGFSLHGHVVS AGAAGATGACGACGGATCTACAGGCCATAGACCACTCCGACCTCGCACC CTCCTCCC GGAAGT VVAYADDLAILAPSTP GGCCCAGAAGCTAGATGCCCTCCGCACTTTCATCATGCCCCAGCTCTCCT ATCCACGT TCATTGT ALQSQLDAISGMAD TCCATCTCTCTCACGGCTCAGTCCCCAAAGCCCCCCTCACTCAACTGGAC GCTCCCTC GCGATA WAGLSFNPAKCATV AAGAAGATCAAGCGGGCTGCCAAGCACTGGCTGTTCCTTCCTCAACGAG CTCTCGTC AATCAA TLTGKDNSRDTLSLQ CCTCGAACGAAATACTGTACATGTCCCACTTGCACGGAGGGCAATCCCT ACGTGATC (SEQID GSPVPSISDGDAYKH ACTCCCCCTCTCGGTGCTAGCGGACATCGGACAAGTGACTCACGCCGTC GTTGCGGC NO: LGVPTGTTTFPSGTD GCTCTTCTTCAATCAAGGGATCCAGCCGTGGCTGACCTCGCTCTCCGGAC TCGAC 1316) AIKKMTTDLQAIDHS TTGCCGAGAAGTCGCATCGAAACGGGCAAAGAAGACGGTCAACGGCCC (SEQID DLAPAQKLDALRTFI CGAGTTGGCCCAGTACCTCTCCGGGTCCACTGACGGAATCTACTGCACC NO:1193) MPQLSFHLSHGSVPK CCGACTTCTGACATCCCGAGTCTCTGGACGACTGCCCGAGCTGCAACAC APLTQLDKKIKRAAK GCCGACTCTCCTCCACTCTGCCACTCACGTGGACATCCCCCCTTCCCTCTG HWLFLPQRASNEILY GCGTCCCCTTCCTTTCTATCAATGGGTCCCCCCTCTCCCCATTCCGAGTAC MSHLHGGQSLLPLSV AGTCCACCCTCACCAATGCCATTAGACATAACCACCTCTCCACTCTCATT LADIGQVTHAVALLQ GCCAAACGGGACCAGGGAAATTCCTACCGTACCTCACACGACCCCGACC SRDPAVADLALRTCR CCTCTAACTACTGGGTCAAGGGAGGAGACTTCCTCCGATTCTGTGACTG EVASKRAKKTVNGPE GCGATTCATACACAGAGCCCGACTGAACCTCCTCCCAGTGAACGGGGCG LAQYLSGSTDGIYCTP CGCCGATGGGACGCAAACAGCATCAAGACGTGCCGGCGCTGTGGTGCG TSDIPSLWTTARAAT CCCAATGAGACACTCGCCCACGTGCTCAACGTCTGCCCTGTCGGACTGC RRLSSTLPLTWTSPLP CGGAGATGAAGAAGAGGCACGATGCAATTCATGCCCGCATCAAGAAAG SGVPFLSINGSPLSPF CCCTCCGACCCTCTCCCCATACGGTAGTACACCACGACCGGACAGTTCCC RVQSTLTNAIRHNHL GGATGCGGCCCCCTGAGACCGGACATTCTGAGAATCAGCGAGAGAGAC STLIAKRDQGNSYRTS AAATCTGTTGCCATTGTAGATATCCACGTTCCCTTCGACAATGGCACTGA HDPDPSNYWVKGG TGCGGTTGAGCGAGCTCACGAGACAAAGCGGGCGAAGTACGAACTGAT DFLRFCDWRFIHRAR CCGGCGGCACTACGAACACCAAGGGTATCGAGTCACGTTTGACTCACTA LNLLPVNGARRWDA GTAGTCACAGCGCTTGGTAGACTGTGGCGGGGGAGTGAGGCAGCGCTA NSIKTCRRCGAPNETL CAAGCGCTCCAGATCTCGTCACAGTATAGTAAGCTCCTCCGTAAGTTACT AHVLNVCPVGLPEM CGTGGCTGATGCCATCCACGGCTCGAGAAACGTGTACGCCCATCATATG KKRHDAIHARIKKALR ACCGGTATGGTTATGTGATGGGAGAGTGAGGAATCTTCTCCACTGTGCA PSPHTVVHHDRTVP AAACCATACAGTCAGAAGATGCTAACTACTAGTTTGATACCCTGTGCCCC GCGPLRPDILRISERD CTGCAATGTCCCGCGTGTCGTACCCAAGCCCGGCTGGCATTGAGACACA KSVAIVDIHVPFDNG TTAGGCTCTCGCTCCCCCGTATACTCTCATAATTTCGTGTACGCTAATCCT TDAVERAHETKRAKY ACCCTACCCCTCCCTTTGACCACTCACCCAACCATGTGTATAGCTGTGCT ELIRRHYEHQGYRVT GGTGATCCCGGGGCGGTTATTCACTGGTTATCATATCATTCTAAAATGAT FDSLVVTALGRLWRG CTTTGATCTCTCAATTAACAACTAACTTATTTCTGTTTGTTTCATTGTTTCA SEAALQALQISSQYSK CCTCGTAAGAGGAAGTTCATTGTGCGATAAATCAA(SEQIDNO:1070) LLRKLLVADAIHGSRN VYAHHMTGMVM (SEQIDNO:1438) NeSL Utopia- AAGJ02140537 Strongylo- AAGGCTCAAACCAGGCTGCCAACCAAGCTGCCAGCTTAGGCACCAACCA AAGGCTCA TGACGA MSHSITEVFDYPLPSR 1_SP centrotus AGCTGCCAGTCCAGGCTCCAACCAGGCTGCCCACCAATCTGCCAGCTTA AACCAGGC AATGTTC WKCTVCLENFFNQQ purpuratus GGCTCCAACCAAGCTGCCCACCTAGCTGCCAGCTTAGGCACCAACCAAG TGCCAACC AATATAT TLKRHQARHHQTTSF CTGCCAGCCGAGGATCCAACCAGGCTGCCCACCTAGCTGCCAGCTTAGG AAGCTGCC GACATT LYVFRCSACQAEFDS CTCCAACCAAGCCGCCAGCCCAGGCTCCAACCAGGTTGCCACCCGAAAG AGCTTAGG CTAATGT ARKASNHWQSHKRK TCTGCCAGTTTAGGCACCAACCAAGCTGCCAGCCGAGGCTCCAACCAGG CACCAACC TCATGTA PILSQPAVNEIPSSGL CTGCCACTCGAGGCTCCAACCAGGCTTTCACCCGAAGCATTACCCAGGC AAGCTGCC TTTTTTG DPSPPRSRPPVEVIGS TGCCCGTTTAGGCACAAACCAGGCTGCCAGCCGAGGCTCCAACCAAGCA AGTCCAGG TTTGCTT SFPDDVSMLSEPSTP ACCAGCCCAGGTTCCAATCAAGCTGCCAGCCGAGGCTCCAAGTAAGCTG CTCCAACC GACAAA STSLQMDPEVVHPPS CCAGCCGAGGCTCCAACCAGGCTGCCACTCGAGGCTCCAACCAGGCTTT AGGCTGCC GCTAGA RSISFSPMHLSPTQPA CACCCGAAGCATTACCCAGGCTGCCCGTTTAGGCACAAACCAAGCTGCC CACCAATC TGAATA SPIQIGVEVSFNSSSSL AGCCGAGGCTCCAACCAAGCAACCAGCCCAGGTTCCAATCAAGCTGCCA TGCCAGCT TATTCCC QMDPEVVQPPSPSIS GCCGAGGCTCCAACCAAGCTGCCAGCCGAGGCTCCAACCAGGCTGCCC TAGGCTCC TGTCCTA FSPMHLSPTQPASPI ACCAAGCTGCCAACTTAGGCTCCAACCAAGCTGCCAGCCCAGGCTCCAA AACCAAGC CTACATA QIGVEVSFNSSGSLQ CCAGGTTGCCACCCGAATCTCTGCCAGTTTAGGCACCAACCAAGCTGCC TGCCCACC CATCTCG MDPEVVQPPSPSISF AGCCTAGGCTCCAACCAGGCTGCCACTCGAGGCTCCAACCAGGCTTTCA TAGCTGCC ATGTGC SPMHLSPTQPVSPIQI CCCGAAGCATCACCCAGGCTGCCCTTTTAGGCACAAATCAAGCTGCCAG AGCTTAGG CCTTGCT GIEVSFTSSSPLQMD CCGAGGCTCCAACCAAGCAACCAGCCCAGGTTCCAATGAAGCTACCAGC CACCAACC AGACGA PEVVQPPSPSMSYSP CGAGGCTCCAACCAAGCTGTCAGCCGAGGCTCCAACCAGGCTGCCCACC AAGCTGCC TCCATTG MHLSLTQPDSPIPVDI AAGCTGCCAGCTTAGGTACCAACCAAGCTGCCAGCCCAGGCTCCAACCA AGCCGAG GTAACC DVIPAAEVPLPDIEIPP GGTTGCCACCCGAAACTCTGCCAGTTTAGGCACCAACCAAGCTGCCAGC GATCCAAC ATGATA SPDRHPAAEVPLPDI CGAGGCTCCAACCAGGCTGCCACTCGAGGCTCCAACCAGGCTTTCACCC CAGGCTGC TCAATG EIPPSPDRHPVAEVPL CAAGCATCAACCAGGCTGCCCGTTTAGGCACAAACCAAGCTGCCAGCCG CCACCTAG GATTAA PDIEIPPSPDRHPQSP AGGCTCCAACCAGGCCTCCAACCAAGCTACCAGCCGAAGCTCTGCCAGA CTGCCAGC CACATG PRPVMMEQPVHTPP TTAGGCACCAACCAAGCTGCCACCGTTAGAGGCCCAGAGCCCACCAATC TTAGGCTC ATCTGTA PADTQQANGPQHW CCATAACGTGTGAGATATGTGAAGCTTCCTTCCACACCTCCGCCGGTCTC CAACCAAG AACAAT VTVLANATNWEDFG CGTCGCCACACGGCCAGGCTTCATCGCACCACTGCTGAACACACTGACG CCGCCAGC ATATGA RVCVEFANHAVEAA ACAGCATGAGCCATTCCATCACCGAGGTATTCGACTATCCGTTGCCCTCC CCAGGCTC TTTACAC RSRQDAPQVRPAAQ CGTTGGAAGTGCACAGTGTGCTTAGAGAACTTTTTCAATCAGCAGACCC CAACCAGG AATGTA RQPRRPTRPRQPTFD TCAAGCGCCACCAGGCCCGGCACCACCAAACGACCTCCTTCCTCTACGTC TTGCCACC TTTATTG VREASRLQKLYKRSKK TTTCGATGCTCAGCCTGCCAGGCTGAGTTTGATTCTGCTCGTAAGGCTTC CGAAAGTC TTTCAAT RAVRHILRDDAPSFS TAATCACTGGCAGAGCCACAAGAGGAAACCAATCCTCTCACAACCAGCA TGCCAGTT AAATCT GSNEQLLDYFKEIYAP GTGAATGAAATCCCTTCTTCGGGTCTAGATCCATCCCCTCCAAGGTCTCG TAGGCACC GTTCTTT PEIDENRAQQLAESL TCCACCGGTCGAGGTGATTGGTTCTTCTTTTCCAGATGATGTGTCTATGC AACCAAGC TCACTTT FTDLEEAKESAAALM TGTCAGAACCATCTACCCCCTCTACATCACTACAAATGGACCCAGAGGTG TGCCAGCC AATACA SPISQQEISTRLSRMS GTGCATCCTCCGTCACGATCCATATCCTTTTCGCCCATGCATCTTAGCCCC GAGGCTCC TGAAAC NSAPGKDRLEYRHIR ACTCAGCCAGCCAGCCCGATTCAAATAGGCGTTGAGGTTTCCTTTAACTC AACCAGGC ATGACT QADGACRVTHIMFN TTCCAGCTCACTTCAGATGGACCCAGAGGTCGTGCAGCCCCCGTCACCG TGCCACTC GTGCCT RCLQEHRIPSAWKEA TCCATTTCCTTCTCACCCATGCATCTTAGCCCCACTCAGCCCGCCAGCCCG GAGGCTCC TCTCCAA TTILIHKSGTTDDPAN ATTCAAATAGGCGTTGAGGTTTCCTTTAACTCTTCCGGCTCACTTCAGAT AACCAGGC CTGGAG FRPIALQSCLYKLFMG GGACCCAGAGGTGGTACAGCCCCCGTCACCGTCCATCTCCTTCTCACCCA TTTCACCC ACCTAC ILSDRMTQWACNHN TGCATCTTAGCCCCACCCAACCAGTCAGCCCGATTCAAATAGGCATTGA GAAGCATT ATAATTT LLSPEQKSARPCEGC GGTTTCCTTTACCTCTTCCAGCCCACTTCAGATGGACCCCGAGGTAGTGC ACCCAGGC GTTAAA HEHTFLLSSVIKDTKR AGCCCCCGTCACCCTCCATGTCCTACTCACCCATGCATCTTAGCCTCACTC TGCCCGTT ATGATA NQKTANIAWLDLRN AGCCAGACAGCCCCATCCCAGTAGACATTGATGTCATACCAGCGGCTGA TAGGCACA TAAATAT AFGSIPHQAIHAVLTT GGTCCCCTTGCCTGATATTGAGATTCCTCCTTCTCCGGATAGACATCCAG AACCAGGC TTCGAA IGAPVSLVMLLKDTYT CGGCTGAGGTCCCCTTGCCTGATATTGAGATTCCTCCTTCTCCGGATAGA TGCCAGCC GATGAA GASTSFLSTSGETDPI CATCCAGTGGCTGAGGTCCCCTTGCCTGATATTGAGATTCCTCCTTCTCC GAGGCTCC ATTATTA QIQSGVKQGCPMSAI GGATAGACATCCACAATCGCCTCCACGACCAGTGATGATGGAGCAGCCA AACCAAGC TTAATAA LFNLTIELIIRAVKKKA GTCCATACACCACCGCCGGCCGATACACAGCAGGCTAATGGTCCACAGC AACCAGCC (SEQID TDDGLGLVVHGQRL ATTGGGTGACTGTTCTAGCCAACGCCACCAATTGGGAAGACTTCGGGAG CAGGTTCC NO: SIMAYADDLVLMSKT AGTCTGTGTAGAGTTTGCTAATCATGCTGTAGAAGCAGCCCGGTCTCGG AATCAAGC 1317) PEGLDAILSVASEQAE CAAGATGCACCTCAAGTAAGGCCTGCTGCACAGAGACAGCCTCGCAGA TGCCAGCC TLRLAFKPTKCASLSLS CCGACTAGACCTCGTCAACCCACGTTTGATGTCAGGGAGGCAAGCCGTT GAGGCTCC CRHGTSVLPREYTVQ TGCAGAAGCTCTACAAACGCAGCAAGAAAAGAGCGGTTCGGCACATTC AAGTAAGC GHLMPALDEEEQYR TAAGAGATGACGCCCCCTCCTTCTCTGGCAGCAATGAGCAACTGCTTGA TGCCAGCC YLGVPFGLPRFTNLK CTACTTCAAAGAAATCTATGCTCCTCCCGAGATTGATGAAAATAGAGCA GAGGCTCC DLIGKLKGNIETIASSL CAACAGTTGGCAGAGTCTCTCTTTACCGATCTTGAGGAAGCGAAGGAGT AACCAGGC LAPWQKLDAIKTFVQ CTGCGGCCGCCCTAATGTCTCCTATATCTCAGCAAGAGATCAGTACACG TGCCACTC PGLSFVLRAADYLKSD CCTTTCTCGAATGAGCAATTCTGCCCCTGGAAAGGACCGCCTCGAGTAT GAGGCTCC LRSLKSAITTNVKKIC AGACATATTCGACAAGCTGATGGTGCATGCCGTGTTACCCACATCATGT AACCAGGC QLPLRAANAYIFAAK TCAACAGGTGCCTACAGGAACACCGTATACCATCAGCATGGAAGGAAG TTTCACCC ESGGLAFIDPNVDAD CTACAACCATCCTCATTCACAAGTCCGGTACAACCGATGACCCAGCAAAC GAAGCATT IQVITQAVRVLSSDDE TTCCGCCCGATCGCACTCCAGTCATGCTTGTACAAACTCTTCATGGGCAT ACCCAGGC VVQTIATSQLKSVVH CCTCTCAGACAGAATGACACAGTGGGCGTGCAATCACAACCTTCTCAGC TGCCCGTT RTIHAVPTEEDIDNYL CCAGAGCAGAAGAGCGCTCGCCCATGTGAGGGATGTCATGAACACACC TAGGCACA SGSNEGLLANSGNSG TTTCTTCTGTCGTCTGTCATCAAAGATACCAAGCGAAACCAGAAGACTGC AACCAAGC QASSLWSRTRSAARR CAACATCGCCTGGCTAGATCTTAGGAACGCCTTCGGAAGCATCCCCCAC TGCCAGCC LHLTLRATTSGTVVV CAGGCTATCCATGCAGTCCTGACAACGATAGGAGCACCTGTCAGCCTGG GAGGCTCC NQQADIDHTRDILPA TTATGCTCCTGAAGGACACATACACGGGAGCATCCACATCCTTTCTTAGC AACCAAGC SITRGLRLIQRTTNAE ACGAGTGGAGAGACTGACCCAATACAGATCCAGTCCGGTGTTAAACAA AACCAGCC KLKSLPDQGKVARSL GGATGCCCGATGTCTGCCATCCTCTTCAACCTAACCATTGAACTTATCAT CAGGTTCC SNDPFANGSSWHAT CAGAGCCGTCAAGAAGAAGGCAACGGATGATGGCCTCGGGCTTGTGGT AATCAAGC GKFIRFCDWRFIHRA GCATGGACAACGCCTATCCATCATGGCCTACGCTGACGACCTCGTCCTG TGCCAGCC RLNCLPTNVATKRW ATGAGCAAGACACCAGAAGGCCTGGATGCCATTTTGTCTGTGGCATCCG GAGGCTCC KANANGKNGHQQE AGCAGGCAGAAACCCTACGTCTGGCATTCAAGCCTACCAAGTGTGCCTC AACCAAGC TLPHVLNHCLPNMV CCTCTCACTCTCATGTAGACACGGTACATCTGTCCTTCCACGCGAATACA TGCCAGCC PIRRRHDNIQQRLVT CCGTCCAAGGTCACCTGATGCCAGCTCTCGACGAGGAAGAGCAGTACC GAGGCTCC AIRHGDVFVNQHVP GCTACCTCGGAGTTCCCTTCGGCTTACCTCGTTTCACAAATCTCAAGGAC AACCAGGC GDPNPRERPDITVIE CTGATCGGCAAGCTGAAGGGAAACATCGAGACCATTGCATCATCATTGC TGCCCACC GNKVTVIDISVPFDN TGGCCCCATGGCAAAAGTTGGATGCAATCAAAACCTTCGTCCAACCAGG AAGCTGCC GPNACTTAAQAKVE TCTCAGTTTCGTACTGCGGGCAGCGGACTACTTGAAGTCGGACTTGCGT AACTTAGG KYSALRQALRDMGR TCTCTGAAATCTGCCATCACCACCAACGTCAAAAAGATATGCCAGCTTCC CTCCAACC DVEVHGFIVGALGT CCTTCGGGCGGCCAACGCATACATCTTCGCGGCTAAGGAATCGGGTGGT AAGCTGCC WHQGNERALGRLGV CTAGCCTTCATAGACCCCAACGTAGATGCGGACATCCAAGTCATCACTCA AGCCCAGG SRWYRTLMRKLCCID AGCCGTTCGTGTCCTGTCGTCTGACGATGAAGTCGTTCAGACAATAGCA CTCCAACC AIQASRDIWVEHVTG ACATCCCAACTCAAGTCGGTTGTTCATCGCACAATCCACGCAGTCCCTAC AGGTTGCC HRQYE(SEQIDNO: TGAAGAGGACATCGACAACTACTTGTCTGGCAGCAACGAGGGCCTGCTT ACCCGAAT 1439) GCCAATTCGGGTAATAGCGGCCAAGCTTCTTCTCTGTGGTCTCGAACCA CTCTGCCA GGTCAGCCGCAAGAAGACTTCATCTTACCCTCAGAGCCACCACCAGTGG GTTTAGGC TACAGTTGTTGTCAACCAGCAGGCAGACATCGACCACACAAGGGACATT ACCAACCA CTACCAGCATCAATAACAAGAGGACTGAGGCTCATCCAACGAACCACAA AGCTGCCA ATGCTGAGAAACTCAAGTCCCTACCAGACCAAGGGAAAGTTGCCAGATC GCCTAGGC CCTCAGCAACGATCCATTCGCTAACGGCTCTTCTTGGCATGCAACTGGAA TCCAACCA AGTTCATCCGCTTCTGCGATTGGCGGTTCATTCACCGAGCCAGACTGAA GGCTGCCA CTGCCTCCCCACGAATGTAGCCACGAAGAGGTGGAAGGCAAATGCGAA CTCGAGGC TGGCAAGAATGGCCATCAACAAGAGACACTGCCTCACGTACTGAATCAT TCCAACCA TGCCTACCCAACATGGTCCCCATCAGACGCCGCCATGACAACATCCAAC GGCTTTCA AACGACTGGTAACAGCCATCAGACATGGCGATGTCTTTGTCAATCAACA CCCGAAGC TGTACCGGGAGACCCCAACCCAAGAGAACGCCCAGATATCACTGTTATC ATCACCCA GAGGGAAACAAAGTCACAGTCATCGATATTTCTGTCCCCTTCGACAATG GGCTGCCC GTCCCAACGCATGCACCACAGCTGCACAAGCCAAAGTCGAAAAGTACTC TTTTAGGC TGCCCTCCGTCAAGCCTTGAGAGATATGGGAAGAGACGTCGAAGTCCAT ACAAATCA GGCTTCATAGTTGGTGCCTTAGGAACCTGGCACCAGGGCAACGAACGT AGCTGCCA GCTCTCGGACGCCTGGGTGTCAGTCGCTGGTACAGAACACTGATGAGA GCCGAGG AAGCTGTGCTGCATTGATGCTATCCAAGCTTCCAGAGACATTTGGGTTG CTCCAACC AACATGTCACTGGACATCGACAGTACGAGTGACGAAATGTTCAATATAT AAGCAACC GACATTCTAATGTTCATGTATTTTTTGTTTGCTTGACAAAGCTAGATGAA AGCCCAGG TATATTCCCTGTCCTACTACATACATCTCGATGTGCCCTTGCTAGACGATC TTCCAATG CATTGGTAACCATGATATCAATGGATTAACACATGATCTGTAAACAATAT AAGCTACC ATGATTTACACAATGTATTTATTGTTTCAATAAATCTGTTCTTTTCACTTTA AGCCGAG ATACATGAAACATGACTGTGCCTTCTCCAACTGGAGACCTACATAATTTG GCTCCAAC TTAAAATGATATAAATATTTCGAAGATGAAATTATTATTAATAA(SEQID CAAGCTGT NO:1071) CAGCCGAG GCTCCAAC CAGGCTGC CCACCAAG CTGCCAGC TTAGGTAC CAACCAAG CTGCCAGC CCAGGCTC CAACCAGG TTGCCACC CGAAACTC TGCCAGTT TAGGCACC AACCAAGC TGCCAGCC GAGGCTCC AACCAGGC TGCCACTC GAGGCTCC AACCAGGC TTTCACCC CAAGCATC AACCAGGC TGCCCGTT TAGGCACA AACCAAGC TGCCAGCC GAGGCTCC AACCAGGC CTCCAACC AAGCTACC AGCCGAA GCTCTGCC AGATTAGG CACCAACC AAGCTGCC ACCGTTAG AGGCCCAG AGCCCACC AATCCCAT AACGTGTG AGATATGT GAAGCTTC CTTCCACA CCTCCGCC GGTCTCCG TCGCCACA CGGCCAG GCTTCATC GCACCACT GCTGAACA CACTGACG ACAGC (SEQID NO:1194) NeSL Utopia- Trichinella TTTCTGGTATGAATCCCAAGCGGATTCGTTACGAAATTTGCATAAGTTTT TTTCTGGT TGATCC MCSAKTPALKSGRRR 1_TSP spiralis TGAAAAAATAGGCATTTGGTCGAGTGCTCGCACCACCATTTGTCGCGGG ATGAATCC GTCCCG GKEVNYEGQIVRVER TCGTCCTGATATTGCACTACATTCAGGAACGGCCTATTCCCTTCGGGGAA CAAGCGG AACCAA RRGSRSSTSATDLGT TTGTGTTTTAGGAATTGGAATCGGTTTGGTTACGATCGGTCGAGTGGTT ATTCGTTA CGGAAC RMVTRGRKKLMEAS CGTGAGATCGAGTGACAGCCGGGTGGCAGCGACAATGTGTAGTGCAAA CGAAATTT CACATT VREAGHHGGESAST GACACCAGCCTTGAAGTCGGGTCGCCGTCGCGGAAAAGAGGTCAACTA GCATAAGT GCCGCA VDVDVVESKKITGKT TGAAGGCCAAATTGTACGGGTAGAAAGACGGCGTGGATCTCGCAGTAG TTTTGAAA TGACTTC ARRNRRAPSGDGKR TACGAGCGCAACGGACCTTGGTACGAGGATGGTTACGCGTGGGCGAAA AAATAGGC GATTTC RESCGAECGQAVCG GAAGCTGATGGAGGCGTCGGTTCGTGAAGCGGGTCACCACGGAGGTG ATTTGGTC GCTTTTG NAVADRSEASSPRTP AATCGGCGAGTACTGTGGACGTCGACGTGGTGGAATCGAAGAAAATTA GAGTGCTC CTCTTTT NVSKSGRDKCGQPTI CGGGTAAAACGGCGCGCCGTAATAGAAGAGCGCCAAGCGGTGATGGA GCACCACC TGTATTT KASTPSPPKRKPTTSS AAAAGACGAGAGAGCTGTGGTGCGGAATGTGGCCAAGCCGTATGCGG ATTTGTCG AATTTTG SPRTPCLSKRGARSKI TAACGCTGTTGCGGACCGTAGTGAGGCCAGTTCGCCCCGCACTCCCAAC CGGGTCGT CTATTAA PSTPDTPSTSGGSGK GTTAGTAAATCTGGTCGTGACAAGTGTGGTCAACCCACTATAAAAGCCT CCTGATAT CAATTCA QRVLVSPLLRTEKLPD CCACACCATCACCACCGAAAAGAAAACCGACCACTTCGAGTAGCCCACG TGCACTAC GTTTGTT LEVLQRTEEQVTVRA TACGCCATGCCTCTCCAAGCGCGGAGCGAGATCCAAAATTCCGTCCACG ATTCAGGA AACTGTT TFPIAQAVVCPLGCE CCAGATACACCATCGACTTCTGGCGGTTCAGGTAAGCAACGTGTTTTGG ACGGCCTA TTGTATT KPYTAVRPDGQFAH TGTCGCCGTTGCTTAGGACGGAAAAATTGCCCGACCTCGAAGTTCTACA TTCCCTTC CATTTGA QTLTRHFMRVHNCH AAGAACCGAGGAACAGGTGACTGTACGGGCAACTTTCCCAATCGCCCA GGGGAATT AGATCC SVQWHYRCRNCNTD GGCGGTGGTCTGCCCACTGGGATGCGAGAAACCGTACACGGCCGTGAG GTGTTTTA AAATAA FLPADHRYPLRVVNT ACCAGACGGTCAATTTGCGCACCAAACGCTGACTCGTCATTTCATGCGA GGAATTGG AAC HVRSCVSRWEITRKL GTCCACAATTGTCACAGCGTCCAGTGGCATTACCGGTGTCGCAATTGTA AATCGGTT (SEQID GESEDLHGVRCDLCD ATACGGATTTCTTGCCGGCAGATCACCGGTATCCCCTCCGAGTTGTGAA TGGTTACG NO: YVGVSKRAVGLHRRR CACCCACGTTCGCAGCTGTGTTAGCCGGTGGGAGATAACCCGGAAGTT ATCGGTCG 1318) HANENIMQNTGTAA GGGAGAGAGCGAAGATTTGCATGGAGTTCGTTGCGATCTTTGCGATTAC AGTGGTTC QIEALSKQVGEIRVA GTAGGCGTGTCGAAAAGAGCCGTCGGATTGCATCGCCGCCGGCACGCC GTGAGATC GDYSQFKFGKRVRQY AACGAGAACATTATGCAAAATACGGGTACGGCCGCTCAAATCGAGGCC GAGTGACA VAPTQRRDGLDEEEV TTGAGCAAACAAGTCGGCGAAATACGTGTAGCAGGCGACTATAGTCAG GCCGGGT HEAEEEEVPAESRTIL TTTAAGTTTGGTAAACGGGTTCGGCAATACGTGGCACCAACGCAGAGG GGCAGCG GEPSTATAIGAEEISA CGAGACGGGTTGGACGAAGAGGAGGTTCATGAAGCCGAAGAAGAGGA ACA(SEQ TGPVRADTAAQQMI GGTGCCAGCCGAATCACGTACCATCCTAGGTGAACCCAGCACCGCAACT IDNO: CRIGQWCVWPQDY GCTATAGGAGCTGAGGAGATCTCTGCAACGGGTCCAGTTCGAGCAGAC 1195) HSIPAPQCWTDTLM ACAGCTGCTCAGCAGATGATCTGCCGGATCGGGCAATGGTGTGTCTGG DLMIEQIVLQRYPDG CCGCAGGACTACCATTCGATCCCGGCACCTCAGTGCTGGACGGACACGT AGVSVMSCSAVSAAI TGATGGACCTCATGATTGAACAAATCGTTCTGCAGCGCTATCCCGATGG HHEISAEFAAQVMSS TGCGGGTGTGAGCGTGATGAGCTGCTCAGCAGTAAGCGCAGCCATACA HDASLYCIIPVNVRN CCACGAGATATCTGCAGAGTTTGCCGCTCAGGTCATGTCTTCACATGAT HWQMIVLDVAERVV GCCAGTTTGTATTGCATCATTCCTGTGAATGTACGTAATCATTGGCAGAT HYYCSLREHNTVVLSS GATTGTGCTGGATGTGGCTGAACGAGTAGTCCATTACTACTGTTCTCTCC LLSLVELSGKHTGCTS GCGAGCATAACACTGTTGTTTTATCGTCTCTGTTATCGCTAGTGGAGCTG WKIETHDGAPVQTN AGTGGTAAGCACACTGGCTGCACCTCGTGGAAGATCGAGACTCACGAT AFDCGPFSCLFLKHLL GGCGCGCCGGTGCAAACGAATGCGTTCGATTGTGGTCCGTTTTCGTGCT HGIDMNFGDRESAA TGTTTTTGAAGCACCTGCTGCACGGCATCGACATGAATTTCGGTGATCG LRTDLKFMIDAVSTP GGAGTCGGCGGCTTTGCGCACCGACCTGAAGTTTATGATCGATGCGGTA VVPATDKLKKKPDGS TCCACTCCCGTCGTTCCGGCGACGGATAAGCTAAAGAAAAAACCTGACG ATQLTQFQQKFLSAS GCTCAGCGACTCAGTTGACCCAGTTCCAGCAGAAGTTCCTGAGCGCTTC DNWQSPDVDLQAV TGACAACTGGCAATCTCCTGATGTCGACTTGCAAGCGGTGTATGACGAA YDEVVESIVSGHNEP GTGGTGGAGAGTATTGTGTCCGGACACAATGAACCGAGCTCTCGTAAC SSRNTSRLQKKSPGK ACCAGCCGTTTGCAAAAGAAAAGTCCGGGTAAAGGAGGTAAAGGACAG GGKGQVRRRSAVTR GTTCGCCGACGCAGTGCGGTAACGCGTGACCCGGCATGGTTGAAATCT DPAWLKSASAVQKA GCGTCAGCGGTTCAAAAGGCGTTCAATTCGGCACCGGCCCGTACTGTGA FNSAPARTVNAILRR ATGCAATCCTCCGGCGTCCGAATGCCTGTCCTTCGTTTACAGCCACGCAA PNACPSFTATQVAD GTGGCAGATCATTATTTCAATCTCAGACCGGCGGTTACCTCGCTTGCACC HYFNLRPAVTSLAPE CGAGGTTATTGACATCTTACCTCCGCCCGCCACTGACCACTCGATGCTTG VIDILPPPATDHSMLV TTGCGGAGTTGAGTGAGTCGGAGGTCTGGGAAAAGATGCAGAAAGCA AELSESEVWEKMQK CCCAACTCTGCGCCTGGTGCGGACAGGATCACCATTCGTATGGTGCGGA APNSAPGADRITIRM TGGCAGATCCAGGTGCGATGATTCTCACCAGGTTCTACCGTGCCTGTCT VRMADPGAMILTRF ACTAAGAAAATGGGTGCCCCTGCAATGGAAGCAATCCGTGTGCAAGCTC YRACLLRKWVPLQW CTTTATAAGGACGGGGATAAGGAGCGCCTCGCCAACTGGCGGCCTATC KQSVCKLLYKDGDKE GCCCTGGAGCCCGTGTTACAGCGGGTGCTCAGTGCGGTAGTCGCCAGC RLANWRPIALEPVLQ AGAGTGACGAACTGGGCGAGGGCAAACGGGTTGATTTCCTTGGAAGCC RVLSAVVASRVTNW CAGAAAGGCTTCCAGCCAGCTGATGGTACCTCCGAGCACAATTTCGTCA ARANGLISLEAQKGF TGGAGGTTGCAATTCAAGAGGCAAGACGCACTAATGCACAGCTTGCCAT QPADGTSEHNFVME CAGCTGGTTAGACATCTCCAACGCCTTTGGCACAGTGTCGCACCAATTGT VAIQEARRTNAQLAI TGTTTTCGTTGTTGGAGCGCTATGGGCTGGACCCGACTTTTACCAGTTTT SWLDISNAFGTVSHQ ATCCAGAATTTGTATAAAGACGCGACCATTGTAGTCAAAGGCGCCAATG LLFSLLERYGLDPTFTS GAACCCACGTTACCGCGCGCTGGTCAGTCGGAGTACGCCAAGGTGACC FIQNLYKDATIVVKGA CTTGCAGTGGTATCTTGTTTTGTTTGTTTGTTGAGCCACTCTTGCGGTCA NGTHVTARWSVGVR GTCTTACCATCTTTACCGTGTGAGGCGGAAACGACTGCGGTTAACGTGT QGDPCSGILFCLFVEP TAGGACAGCCGATCACAGCGCTGGCTTACGCTGATGACATCGCGCTGTT LLRSVLPSLPCEAETT CGCCCCATCCATCGGGGTTATGCAACAACAGCTGTGCAAAATCCAGGGC AVNVLGQPITALAYA ATGGCCAGCGCCATGGGCTTTCGGTTCAACCCGAAGAAGTGCGCGTCCC DDIALFAPSIGVMQQ TGTACTTGAATCGTGCAGTAGTGAATGCCGCAACATTCACGATAAGCGG QLCKIQGMASAMGF GGAGGAAATACCTGCTCTCGTACACGGGGACACTTTTCGCTATCTTGGA RFNPKKCASLYLNRA GTAGCCGCTGGGCTGGGTAAACCCCAGACTCCTTTCTCATTACTGCGTG VVNAATFTISGEEIPA AGAACCTCCGAGAAGCGGAACTTATATTCCGGTCAAAGCTGGCTCCGTG LVHGDTFRYLGVAAG GCAGAAGATGGATGCGTACCGGACTTACGTACTGCCACGTTTGACATTT LGKPQTPFSLLRENLR CAGCTTATGATTGCTAAATTTAACAACATCAAGCAATCGGCCGGGCAGT EAELIFRSKLAPWQK ATGACCGCGCAATATTGAGGCTCGTGAAGCGTTGCTTCCAGCTTCCGGT MDAYRTYVLPRLTFQ GGAGACGTCCACCGACTTCATTAGAGCCCCCAGACAGTGTGGTGGTCTG LMIAKFNNIKQSAGQ GGCGTGCCATCATTGCGTGAGTTGTACGCAACCGCAAAGGTTTCGCGG YDRAILRLVKRCFQLP GCACTCAAGATGCTCTGGTCTCCCTGTCGGGTAGTCTCGAGCCTGGCAG VETSTDFIRAPRQCG CCAGTCAGCTTCAGCGAGTTGCGTCCGCGTACTTTGCGAAGAGGTTGAG GLGVPSLRELYATAK AGATGTTGAGGCGGCTGACCTCTCTACCTTCATGAACGCCGCGCGATCT VSRALKMLWSPCRV ACGCCGCTAGATCGGTCAGGGTATCCGACTTGCTTGTGGATGGACGCAC VSSLAASQLQRVASA GCAAGCAGATGTCGTACCTGACAAAGGTAGCCGGCGTTGACTGTTACTT YFAKRLRDVEAADLS CCTCGTTGGCGAGGCTGGGACCTCTTTCTTCATCCGGAATGGCCTCGGT TFMNAARSTPLDRSG CAGACGGTCTCAGTTCTAAGCCCCTTGAGGAAGAACAAAGTTATGAGCG YPTCLWMDARKQM TTTTAGGGGGGGCGATCCAAACTCGCCATCTGGACGCCTGGCTCCAATG SYLTKVAGVDCYFLV CAAGCGCCAAGGTAAGACTGCGAGCTGCATTGTGCTCGACCGGTCATCC GEAGTSFFIRNGLGQ TCCCGTTTTATAACGACCGGACGGTACACCAGCTTTGCAGCTATGCGCTT TVSVLSPLRKNKVMS TGCGTTGCCCGCTCGCCTGGATTTGTTACCTTGCCGGGCGAGATCCTCTA VLGGAIQTRHLDAW TGCGTTCGTACCAGAACTGTCGGCGGTGCGGTTACGACCGAGAGACCCT LQCKRQGKTASCIVL GCCGCACATCCTTCAGCACTGTCGGCAGTTCAGCGCCCCCGCTTACCAG DRSSSRFITTGRYTSF GCCCGGCATGACGCAGTTCAGGGACGCCTGGAAACGGTCATGCGTAGG AAMRFALPARLDLLP CGGTTCCCCAACCTCCGCGTAAACCGGGCTCTACCGGAGATTGGGTCGA CRARSSMRSYQNCR ACAAGCGGCCTGATCTAGTGGTCGTTGACGAGGAAAAGCGGTTAGTGA RCGYDRETLPHILQH TTCTGCTTGATATCGCAATCGTGTTTGAAAACACTGCCGCTGCCTTCGTC CRQFSAPAYQARHD GATGCCCGGACCAGGAAGTGGGCGCACTATGAAAAGGAGATCCTCGCC AVQGRLETVMRRRF TACCGGCTACGAGGATACTCCGTGACGTACGACGCCATCGTCGTCGGTG PNLRVNRALPEIGSN CTCTGGGGACGTGGGACCCGAAGAACGATGCCATCCTGAAAAGGATCG KRPDLVVVDEEKRLVI GTGTCGTCTCCCAAAGGTACCTACGGCTCATGAAGGTTCTCGTGGTGTC LLDIAIVFENTAAAFV CGAGATGTTGGAACATTCTTCACGGATCTACAGAAAACACCTCGGCCTG DARTRKWAHYEKEIL CGAGACCTTCTCCCGGACACCGGCACGAAACGGCGACCGGTTGGCACG AYRLRGYSVTYDAIVV ACGGAGACAGACCCACCCGGTGGTGATCTCCGTCAGAAGAAGCGCAAT GALGTWDPKNDAIL ACCATCAGCGCTAGGGCATCAGGTGGGAAATGCCTGGAACGGCGGTTC KRIGVVSQRYLRLMK ACGTCACCCGTCGGGACGCCCTCGCAGCGTGGTGAGCTGCAGTGCCAA VLVVSEMLEHSSRIYR CCGTGCCCTGGCCCTAGAAGACCCGCCCTCGCTGGAATAGCGCCCAACC KHLGLRDLLPDTGTK CGCCTAGTCTCCAGCCCAGAAAACCACCCCCGCGGCAACACCAGAAGCC RRPVGTTETDPPGGD GGTTACGAAGTCTACAGCGCATTGATCCGTCCCGAACCAACGGAACCAC LRQKKRNTISARASG ATTGCCGCATGACTTCGATTTCGCTTTTGCTCTTTTTGTATTTAATTTTGCT GKCLERRFTSPVGTPS ATTAACAATTCAGTTTGTTAACTGTTTTGTATTCATTTGAAGATCCAAATA QRGELQCQPCPGPR AAAC(SEQIDNO:1072) RPALAGIAPNPPSLQ PRKPPPRQHQKPVTK STAH(SEQIDNO: 1440) NeSL Utopia- Chelonia GGCAGAAACTGCACSTTCTAGAAGACTCACTGCCTATCCTGAGGAAGAC GGCAGAA TGAGCT MLQLRLPTPQTLRLL 2_CMy mydas TACCGCTTTGGAGATGGATTCTACTGCCGCTGCTTCAAAAGAAAATCTTC ACTGCACS GGAGTG HPSQLPQSHSTKRW ATGCTGCTTCGGAGGCTCCAGGACAGASTGAGAAGATGATCGCTCCCTC TTCTAGAA CCGATG SNIYEERARAPKDTTI GCCGATTCCACAGAAACCTTCAACTGCCGCTCGGACCGCTGCTGCTCCA GACTCACT AGAAGC ERVSAASKIPKLDPAK CGGAGCGCCCATCGGGGAACAGCCTCTAAGAGACCCTCAGCGCTTCCA GCCTATCC GCAGTC RRIGAPLQLMQGNSI GGACATCGCAGATGAGCAGCATCGATTGGAAAAGCAGCGCCTCCCTGA TGAGGAA GGGAAA SRQLSASSQYVQHN AGAAAACCCGTGGAGATGCTGCTGCAGGGAWATCTTGCACCTCGAGGA GACTACCG GTAACT AVVRRVSAPPHTGNF CGGCCTCCCAGGACATCGCAGCCAGGACAAACATCATCTCGCCTGCTCT CTTTGGAG GAAATA TGSCRRKLALPHRPP TCAGGASAAGGATGCCMSAAGAACTTMTCCCACCTCCTCMACTGCCCA ATGGATTC CTTTCCT SETQPAESYLHCWTT GGATCCTSATGCTGGTCGTCGTCCTGCTGTGCTGGAAGMKACCCCAACT TACTGCCG CATGGA QRDPTLADQHGLQD GGCGAGACCTCAGAAACCACCCAGMWGGACCKCAACATCTCACTAGAT CTGCTTCA TTGTATT HPTGLPPEGSHCIKD GCCTGCCCAGCCGAACCTCTCCATGCWACTCTCCCAGAGCAACSAGAAC AAAGAAA TTCTAAA PRQNSSTEGQRRSG CATCCAGMGAATCCGCTGATATGACTGAAGCCMATCCAACAGAGGGA ATCTTCAT TGGACA NSQARRIPKRASTAA GAAGGAAAGGAGAATGACTGCATCTATCTCCAGTATCCCCTCCCTACGG GCTGCTTC ACCTACC SKTVLPKRTSAALKSV ACACGCTCCTCTKCCCCTTCCGCTATCCGAGGGTTCCAGTACATTGGCAG GGAGGCTC TAATTCT REDTALVLEDPAKWS TCTCAGCAMACACCTCAAGAGAATCCATASCAAGCGGATCACCTTCCGG CAGGACA CAATTAC SQHREGXRQQANPT TGTGCCCTCTSCGACCTGCCTTTCGAGACGCAGATGAAATGTAAGTCTCA GASTGAGA TGAGGG AVFQPEPAEIEQQPX TCAAGTCACCTGCAAAGGACATCTCGAACTGGAAGAGTCCAACTTTACC AGATGATC ACAATCT VRAATPWQAAWME AGTCTATGTTGCCGCCACCCCATCTCTGCTCCGAAAGCAGAAACACCACA GCTCCCTC CCACTCA ELARTASFXDFDLLVD TGCTCCAGCTCCGGCTTCCGACTCCCCAAACACTCCGGCTACTGCATCCC GCCGATTC TTGATAT RLTKDLSAEIVSGRKG AGCCAACTCCCCCAGAGCCACAGCACGAAGAGGTGGTCCAACATCTATG CACAGAAA ATTTTGC TQENTPTAHRQNQN AGGAAAGGGCCCGGGCGCCCAAGGATACTACCATTGAAAGAGTAAGTG CCTTCAAC TTTCCAC NMREARRRNISRCYD CTGCCTCAAAAATCCCTAAGCTGGACCCTGCCAAGAGGAGGATCGGCG TGCCGCTC AACCAA PAAASRIQKLYRSNR CTCCCTTGCAGCTTATGCAAGGCAACTCCATCTCCAGACAGCTTAGTGCT GGACCGCT ATCTCTG PKAMREILDGPSSYC TCCTCGCAGTATGTCCAGCACAACGCCGTCGTCAGAAGGGTCAGCGCCC GCTGCTCC TACAACT AIPSERLFLYFKGVFD CTCCACACACGGGTAACTTCACGGGATCCTGTCGTCGGAAGCTAGCACT ACGGAGC TTTCATG RVAQNDMQRPECLX ACCTCACAGGCCACCCAGCGAGACCCAGCCAGCGGAGAGCTATCTCCAC GCCCATCG AGTGAT PXPRVDYAEDXEQDF TGCTGGACCACGCAACGGGACCCCACCCTAGCGGACCAGCACGGCCTC GGGAACA GTACCC TSWEVEARLTKTKNT CAAGACCACCCAACGGGATTGCCCCCAGAAGGCTCACACTGCATCAAAG GCCTCTAA GAGTAC APGKDGIRYNFLKKR ACCCCAGACAGAACTCATCCACAGAAGGGCAGCGTCGCTCCGGAAACTC GAGACCCT TTGGATT DPGCLVLTAIFNKCK CCAAGCAAGACGCATCCCTAAAAGGGCCAGTACCGCTGCCTCTAAAACT CAGCGCTT CTAATAT QFRRTPSSWKKSMM GTCCTTCCTAAAAGGACCAGTGCTGCCCTAAAATCCGTAAGGGAAGACA CCAGGACA CTAAACT VLVYKKGKQDNPNT CTGCCCTCGTTCTGGAAGATCCGGCCAAATGGAGTTCTCAACACCGGGA TCGCAGAT GTATTGT RRPISLCSTMYKLYAS GGGAAAWAGGCAACAGGCTAATCCCACAGCAGTTTTCCAGCCTGAGCC GAGCAGC TAAATCT CLAARITDWSVNGG TGCGGAGATTGAGCAGCAGCCCWCGGTGAGGGCCGCCACACCATGGC ATCGATTG ATTCACC AISSIQKGFMSCKGC AAGCCGCTTGGATGGAGGAGTTGGCGAGGACAGCCTCCTTCGAKGACT GAAAAGC TAAATTT YEHNFVLQTAIHMAR TTGACCTCCTTGTAGACAGACTCACTAAAGACCTGTCTGCAGAAATTGTA AGCGCCTC GGGTTA RAWRQCAIAWLDLA TCTGGGAGGAAGGGAACTCAGGAGAACACACCAACTGCKCACAGACAG CCTGAAGA TTGCTG NAFGSMPHQHIFDM AACCAAAACAACATGAGGGAAGCCAGGAGAAGGAACATCAGTCGCTGC AAACCCGT ATTATGT LREFGMPENFLQLVR TATGACCCAGCAGCAGCATCCAGGATCCAGAAGCTCTACCGGTCAAACC GGAGATG ACTCTAT ELYEGCTTTICSMEGE GCCCTAAGGCTATGAGAGAGATCCTCGATGGGCCCTCATCCTACTGCGC CTGCTGCA GTATCAT TPEIPIRSGVKQGCPL AATCCCGTCCGAGAGACTATTCTTGTACTTCAAGGGGGTGTTTGACCGT GGGAWAT ATGACTT SPIVENLAMEPLIRAIS GTAGCTCAGAACGACATGCAGCGTCCAGAGTGCCTTCAKCCTTWACCCC CTTGCACC TTAAAA SGLGGFDLYDNRVNI GGGTTGACTATGCAGAGGACCWGGAACAAGACTTTACATCATGGGAA TCGAGGAC ACAAAC LAYADDLVLIADNPES GTAGAGGCCAGACTTACAAAGACCAAAAACACAGCCCCAGGAAAAGAT GGCCTCCC TTTGTAT LQQMLDITSQAANW GGCATTCGCTACAACTTCCTGAAAAAACGAGACCCCGGTTGCCTGGTAC AGGACATC TTGTGG MGLRFNARKCASLHI TAACTGCCATTTTCAACAAATGCAAACAGTTCCGCCGTACTCCCAGCTCC GCAGCCAG ATAATCT DGSRRDSVQATSFQI TGGAAGAAATCTATGATGGTGCTCGTCTACAAGAAAGGCAAGCAAGAC GACAAACA AAGCAC QGEPMIFLEDGQAY AACCCCAACACTCGGAGACCCATCTCTCTCTGTTCCACCATGTACAAACT TCATCTCG TATACCC QHLGTPTGFRVQQT GTATGCCAGCTGCCTTGCGGCTAGGATCACAGACTGGTCGGTGAACGG CCTGCTCT AGATGT PEDTIAEILRDVARIDS AGGAGCCATCAGCTCCATCCAGAAAGGCTTCATGTCATGCAAAGGCTGC TCAGGASA ACAGAC SLLAPWQKINALNTF TACGAACACAACTTTGTCCTTCAGACTGCCATCCACATGGCCAGGAGGG AGGATGCC ACTCTTT LIPRISFVLRGSAMVK CATGGAGGCAATGTGCCATAGCATGGCTCGACCTGGCTAATGCTTTTGG MSAAGAA TCCCAAC VPLNKADNTIRQLVK ATCGATGCCCCACCAGCACATTTTTGACATGCTGCGAGAGTTCGGGATG CTTMTCCC CTATGTA KWMFLPQRASNELV CCTGAAAACTTTCTCCAACTGGTCCGGGAACTGTATGAAGGCTGCACCA ACCTCCTC TTATATT YISHRQGGANVPRM CCACCATCTGCTCCATGGAAGGAGAGACACCCGAAATTCCTATCCGTAG MACTGCC TTTTTAA GDLCDVAVITHAFRL CGGCGTGAAGCAAGGCTGCCCCCTGAGCCCCATCGTTTTCAACTTAGCC CAGGATCC CATTAG LTCPDAMVRNIAESA ATGGAGCCGCTCATTCGAGCGATCTCCAGCGGTCTAGGCGGTTTCGATC TSATGCTG CTTTAAT LQDAVKKRIARTPSN TGTATGACAACAGAGTGAATATCCTGGCCTACGCAGACGATCTGGTCCT GTCGTCGT AAAATTT QDVATYLSGSLEGEF GATCGCAGACAACCCCGAGAGTCTCCAACAAATGCTTGACATCACCAGC CCTGCTGT TTAAA GRDGGDFASLWTRA CAGGCTGCCAACTGGATGGGACTCCGCTTCAATGCCAGGAAGTGTGCAT GCTGGAA (SEQID RNATRRLEKRIGCHW CCCTGCATATCGATGGCAGTAGAAGGGATTCGGTCCAGGCGACGTCTTT GMKACCC NO: TWCEERQELGVLVP TCAGATCCAGGGTGAACCCATGATCTTCCTCGAGGACGGGCAAGCGTAC CAACTGGC 1319) QVKNTDHTIITPRART CAACATCTCGGCACGCCGACGGGTTTCCGCGTCCAGCAGACACCTGAGG GAGACCTC MLERTLKDAIRCQYV ATACCATCGCGGAGATCCTACGAGATGTGGCCAGGATCGACTCCTCCCT AGAAACCA ENLKRKPDQGKAFEV ACTGGCACCATGGCAGAAGATCAACGCCTTGAACACCTTCCTGATCCCC CCCAGMW TCKWDASNHFLPGG CGTATCTCATTCGTCCTGAGGGGATCTGCCATGGTGAAGGTACCTCTGA GGACCKCA SFTRFADWRFIHRAR ACAAGGCAGACAACACCATCCGGCAGCTGGTGAAGAAGTGGATGTTTC ACATCTCA LNCVPLNGAVRHGN TTCCCCAGAGGGCCAGCAATGAACTGGTGTACATCTCGCACAGGCAGG CTAGATGC RDKRCRKCGYANETL GCGGTGCCAACGTCCCTCGAATGGGTGATCTGTGCGACGTTGCCGTGAT CTGCCCAG PHVLCSCKPHSRAW CACTCATGCCTTCCGCCTTCTGACGTGCCCGGATGCCATGGTGAGGAAC CCGAACCT QLRHNAIQDRLARAI ATCGCGGAGAGTGCTCTGCAGGATGCCGTCAAGAAGCGAATCGCCAGG CTCCATGC PPPVGKVAVNSAIPG ACCCCCTCCAACCAAGATGTTGCCACCTACCTGAGCGGCTCGCTGGAAG WACTCTCC TDSQLRPDIVITNEDR GCGAATTTGGAAGAGACGGGGGAGACTTTGCTTCGCTCTGGACTCGTG CAGAGCAA KKIIMVDVTVPFENR CCCGCAATGCTACGCGACGACTGGAGAAGCGTATCGGCTGCCACTGGA CSAGAACC TPAFHDARARKVEKY CGTGGTGCGAGGAACGCCAGGAGCTGGGAGTCCTGGTGCCACAGGTG ATCCAGM APLAETLRAKGYQVQ AAGAACACAGATCACACTATCATCACTCCGAGAGCTAGAACCATGCTGG GAATCCGC THALIVGALGAWDPS AGAGGACCCTGAAGGATGCCATCCGCTGCCAATACGTGGAAAACCTGA TGATATGA NERVLRECGIGQRYA AACGGAAGCCGGACCAGGGCAAGGCATTCGAGGTGACGTGCAAGTGG CTGAAGCC RLMRQLMVSDAIRW GATGCCAGCAACCACTTCCTCCCCGGGGGCAGCTTCACCCGATTTGCCG MATCCAA SRDIYIEHITGHRQYQ ACTGGAGGTTCATCCACAGGGCCCGGCTCAACTGCGTCCCGCTGAACGG CAGAGGG EG(SEQIDNO: AGCCGTCCGCCACGGGAATCGGGACAAGCGATGCAGGAAGTGCGGCT AGAAGGA 1441) ATGCCAACGAGACGCTACCCCACGTCCTGTGTAGCTGCAAGCCCCATTC AAGGAGA CAGAGCCTGGCAGCTGCGACACAACGCCATCCAAGATCGCCTGGCCAG ATGACTGC AGCCATCCCGCCACCCGTGGGGAAGGTTGCCGTAAACTCCGCCATCCCC ATCTATCT GGAACCGACAGCCAACTGCGACCGGACATCGTCATCACCAACGAGGAC CCAGTATC CGGAAGAAGATCATCATGGTGGACGTCACAGTGCCCTTCGAGAACAGG CCCTCCCT ACCCCAGCCTTCCACGATGCCCGAGCTCGAAAGGTGGAGAAATATGCCC ACGGACAC CTCTGGCCGAAACCTTGAGAGCTAAGGGTTACCAGGTCCAGACACACGC GCTCCTCT GCTGATCGTCGGAGCCTTGGGCGCATGGGACCCCAGTAACGAGCGAGT KCCCCTTC GCTGAGAGAATGCGGAATCGGTCAACGCTACGCTCGGCTGATGCGGCA CGCTATCC ACTCATGGTGTCAGATGCCATCAGGTGGTCGAGGGACATCTACATAGAA GAGGGTTC CACATCACCGGACATCGGCAATACCAGGAGGGATGAGCTGGAGTGCCG CAGTACAT ATGAGAAGCGCAGTCGGGAAAGTAACTGAAATACTTTCCTCATGGATTG TGGCAGTC TATTTTCTAAATGGACAACCTACCTAATTCTCAATTACTGAGGGACAATC TCAGCAM TCCACTCATTGATATATTTTGCTTTCCACAACCAAATCTCTGTACAACTTT ACACCTCA TCATGAGTGATGTACCCGAGTACTTGGATTCTAATATCTAAACTGTATTG AGAGAATC TTAAATCTATTCACCTAAATTTGGGTTATTGCTGATTATGTACTCTATGTA CATASCAA TCATATGACTTTTAAAAACAAACTTTGTATTTGTGGATAATCTAAGCACT GCGGATCA ATACCCAGATGTACAGACACTCTTTTCCCAACCTATGTATTATATTTTTTT CCTTCCGG AACATTAGCTTTAATAAAATTTTTAAA(SEQIDNO:1073) TGTGCCCT CTSCGACC TGCCTTTC GAGACGC AGATGAAA TGTAAGTC TCATCAAG TCACCTGC AAAGGAC ATCTCGAA CTGGAAGA GTCCAACT TTACCAGT CTATGTTG CCGCCACC CCATCTCT GCTCCGAA AGCAGAA ACACCAC (SEQID NO:1196) NeSL Utopia- Phytophthora ACCGCCCAAGTCTCCACCGCAGCTGCGACTGCTGCTGCTCATCGAGCCG ACCGCCCA TAAGCT MQDMEEELLLDVEM 2_PCa capsici CAGTAGTCGCAGCTGTACAAGCACCACCTCCTACCGGACGCGCCGTCGT AGTCTCCA GGTCAT ETETTEPQTSTAXDA GGAGCACCACGCGCGCGCTGAGCCGCNTCAAGACCAAGAATTCCAGGC CCGCAGCT TTGACC TTTTDRPTRWGPHP TCGCGCGCGCGTGGCGCTTCAAGATGGCGACGCGACCAGCAGGAGTGG GCGACTGC GACAGG RAVAAAAIAQLVTGE CCGCTCTACTGACGGCGGAGACGAGGCCAAGACCGAGGACAGCGACGC TGCTGCTC GCACTA XAXPALPSRQDRRPA GGACGCAGCCAAGCGCCAAGCCGCTCAAGCAGCACCGAGACCCCATAC ATCGAGCC CCCAGG PRSHAPTRSRWGPR CCMGGACACCGCGCCGTACGACGGCCACAGGGACTCGGTGCTGGCTGT GCAGTAGT TAGGGA HQAVGAAAIASLATG GTACGCACACAACGCACCTGCATTCACCTGCGCGTTGTGTGTGTACACA CGCAGCTG ACCGCC LPASAAPVSRATKHG GCACGCAACTTCGCCGAGCTGACCAAGCATCGCCATGCAGCGCATCGTC TACAAGCA CTTCAG EGRRRLQTRWGPRV ACACCCGCTTCGTGGATCACTTCCACAGCGGGTGCACGTGTGGCATCGG CCACCTCC AAACCC SIPRAARRPGSRWGP CTTCCAGTCGCGCGCGGCAGCTACGCGACACGCTCAAGCCTGTGCAGAC TACCGGAC AGGAAG PRAAGASGQLPASAS AGCACACACGCCACCGTAGCTGCCTCGCGCGACCCGGCMNCCASTGCSS GCGCCGTC ACACAA GATGQLPEHVEAITT CCSGNGCCGGMAGCGASGAGGAGGACCNCGCACCCCCCGGTCCTCTSS GTGGAGC ACACCCT TPRVASDADEGPTPP TCGCGGCAGCATCTGCAGCTGCAGCAGCCGCATCAAGCGCCACTCCAGC ACCACGCG CCCTTTA DPWILRFDGACRRN TGCAGATACTGCCACCACGCAGAGCGCCGTGCCCATCGCTACTCCTGGG CGCGCTGA GTGACA PGPGGAGAALFKPS CCCCAGTACGCCCCCCACGTGCTGGAGCCACCTCCAGAGCTCCGAGTTT GCCGCNTC TACATAT GAVVWTCSHYMPSS CCGGCAAACGCCGMCGCCTCAACACGCCGATCGACCTGCAGCCGCTGG AAGACCAA TTTAGG NETNNTAEYTALLLG ACGTGGACGCGCTGATGCAGGACATGGAGGAGGAGCTGCTCCTGGAC GAATTCCA CTACATT VQSAVHHGATRLDIE GTGGAGATGGAGACGGAGACGACGGAACCGCAGACCTCGACGGCCAM GGCTCGCG TCAGTTA GDSSLVIAQVKGTFA GGACGCCACCACCACCACGGACAGGCCAACACGATGGGGCCCCCACCC CGCGCGTG CGGAGA CRNAKLRQLRNRVR CAGAGCAGTGGCGGCTGCAGCCATMGCGCAGCTGGTGACCGGCGAGS GCGCTTCA GGTTAC HALRSVEKYTLRHIDR CAGCAGMTCCAGCTCTGCCTTCGAGACAGGACCGCCGCCCAGCACCAC AGATGGC TAACTG KANAHADRLANRAL GATCCCACGCACCCACCCGGTCTCGATGGGGACCGCGCCATCAAGCTGT GACGCGAC GTAAAT DRRSSSSECEPHGSC CGGTGCCGCAGCCATCGCGTCGCTAGCTACAGGACTACCGGCCTCTGCG CAGCAGG AAAAAG MERCCGTDTTPAVQ GCACCCGTGTCTCGCGCAACCAAGCACGGGGAAGGACGACGCCGTCTC AGTGGCCG CACTTT GPTPQAAAAVPVQV CAGACACGATGGGGACCTCGGGTGAGCATCCCACGAGCCGCACGCCGT CTCTACTG (SEQID WPQWQRQTMVAW CCAGGATCGAGGTGGGGACCTCCACGCGCGGCAGGGGCCTCTGGCCAA ACGGCGG NO: TTSHGGRCRDCSTRC CTTCCTGCCAGCGCCTCCGGGGCCACTGGCCAACTTCCGGAGCATGTGG AGACGAG 1320) GRSLPSLAHRPRLSPR AGGCCATCACCACGACACCACGTGTGGCATCGGACGCTGATGAAGGAC GCCAAGAC RQPRLRLRQLSDEER CGACACCGCCGGATCCGTGGATACTCCGCTTCGACGGCGCATGCAGGC CGAGGAC DXAADALQELSDVM GTAACCCCGGACCGGGCGGTGCAGGTGCTGCATTGTTTAAGCCGAGTG AGCGACGC ASKIVDADSWDTGE GCGCCGTGGTGTGGACGTGCTCCCACTACATGCCAAGCAGCAACGAAA GGACGCA GYISSIPERIREVLQPY CCAACAACACCGCCGAGTACACGGCACTGCTGCTGGGCGTACAGAGCG GCCAAGCG TTRPPRPGHQQQQR CGGTCCACCACGGTGCCACTCGCCTAGACATCGAGGGCGACAGCAGCC CCAAGCCG RRPPRVTRNQREHRL TGGTGATTGCCCAGGTCAAGGGCACATTCGCGTGCAGGAACGCGAAAC CTCAAGCA DEALDDMQATQQA TCCGGCAGCTCCGCAACCGCGTCCGGCATGCGCTCCGCTCCGTGGAGAA GCACCGAG APRDQRAIHRARRRV GTACACCCTCCGCCACATCGACCGCAAGGCCAACGCCCACGCAGACCGT ACCCCATA GRVRASMAKQELRQ CTGGCCAACCGCGCGCTGGACCGGCGGTCGAGCAGCAGCGAGTGCGA CCCMGGA AFAKDESKCVSKILAG GCCACACGGCTCGTGCATGGAGCGATGCTGTGGCACCGACACGACACC CACCGCGC ASAETAAEEHVDECP AGCGGTGCAAGGACCGACACCGCAAGCAGCAGCAGCAGTGCCAGTGCA CGTACGAC IDAATLHAHFTGTNA GGTGTGGCCCCAGTGGCAGCGGCAGACGATGGTAGCGTGGACGACGA GGCCACAG PRTDFDYDAACGQEF GCCATGGAGGTCGATGCCGAGATTGCAGCACGAGATGCGGGCGAAGC GGACTCGG RGALDSMQPPTVAT CTTCCCAGTCTTGCCCATCGGCCCAGGCTCAGCCCCAGGCGGCAACCTC TGCTGGCT DAFEEELTIDEVEDQL GGCTTCGACTGCGCCAACTCAGCGACGAGGAACGCGACRMAGCAGCG GTGTACGC TRAAKTSSPGHDGIG GACGCCCTCCAGGAGCTTAGCGACGTCATGGCCAGCAAGATCGTTGAC ACACAACG YDIYSRFAAQLVPLLH GCAGACAGTTGGGACACCGGCGAAGGCTACATCAGCTCCATACCGGAG CACCTGCA AAYQFCWLHRRVPA CGCATCCGAGAGGTGCTGCAGCCGTACACCACGCGACCACCACGGCCT TTCACCTG LWKLGIVRLIHKKGD GGGCACCAGCAGCAGCAACGGCGACGGCCACCACGTGTGACACGCAAC CGCGTTGT PMQPTNWRPICLQP CAGCGCGAACACCGGCTAGACGAAGCGCTGGACGACATGCAAGCCACG GTGTGTAC AIYKIYSGLLARRLSR CAGCAAGCAGCACCGCGGGACCAACGGGCGATACATCGAGCGCGGCGT ACAGCACG WMEQNQRLPMAQ CGCGTCGGGCGTGTCCGGGCATCGATGGCCAAGCAAGAGCTACGGCAG CAACTTCG KGFRAFNGCHEHNF GCGTTCGCCAAGGACGAGTCCAAGTGTGTGTCCAAGATATTGGCCGGT CCGAGCTG VATTLLDQTRRSHRR GCCTCGGCGGAGACAGCGGCGGAAGAGCACGTCGACGAGTGCCCGAT ACCAAGCA LYQVWYDLRNAFGS TGACGCCGCCACGCTGCATGCCCACTTCACAGGCACCAATGCGCCACGG TCGCCATG LPQQLMWSVLRHLG ACGGACTTCGACTACGACGCTGCATGCGGTCAAGAATTCCGAGGAGCG CAGCGCAT VDASFIARCKNIYQDS CTGGACAGCATGCAGCCGCCGACCGTTGCGACCGACGCGTTCGAGGAA CGTCACAC AFVVANAVDGATDP GAGTTGACGATCGACGAGGTGGAGGACCAGCTTACACGCGCGGCCAAG CCGCTTCG VRQEVGVYQGCPLSP ACCTCCAGCCCGGGACATGACGGCATCGGCTACGACATCTACAGTCGCT TGGATCAC LLFISALVPLIRRLEKL TCGCCGCGCAGCTCGTGCCACTGCTCCACGCCGCGTACCAGTTCTGCTG TTCCACAG DGVGVPLAEGVRPC GCTGCATCGGCGCGTGCCAGCCCTGTGGAAGTTGGGCATCGTCCGCCTC CGGGTGCA ATAYADDIKVFSDSA ATCCACAAGAAGGGCGACCCGATGCAGCCAACGAACTGGCGACCGATC CGTGTGGC AGIRKCHDAVTRFLE TGTCTGCAGCCCGCGATTTACAAGATCTACAGCGGACTGCTGGCGCGTC ATCGGCTT WTGLRANPGKCASL GGCTGTCGCGCTGGATGGAGCAGAACCAGCGACTCCCCATGGCACAGA CCAGTCGC AVTTNARGNPVRDD AGGGGTTCCGAGCGTTCAATGGTTGCCACGAACACAATTTCGTGGCAAC GCGCGGC GVHLELQGEVIAPLSL CACGCTCCTGGATCAGACGCGCCGGTCGCACCGCAGGCTGTACCAAGTC AGCTACGC HDSYRYLGVGDGFD TGGTACGACCTGCGGAACGCGTTCGGGTCACTGCCGCAGCAGCTCATGT GACACGCT HVRHRLQLEPKLQQI GGAGCGTGCTCCGCCACCTCGGCGTCGACGCAAGCTTCATCGCCCGGTG CAAGCCTG KREAVALMQSGLAG CAAGAACATCTACCAAGACTCGGCTTTCGTGGTGGCGAACGCGGTGGA TGCAGACA WQVVKALKTFVYPK CGGCGCGACGGATCCTGTCCGCCAGGAGGTGGGCGTGTACCAAGGTTG GCACACAC VEYALRHLRPLQSQL TCCCCTGAGCCCCCTCCTCTTCATCTCGGCGCTTGTGCCGCTCATTCGCCG GCCACCGT QGFDRAVVRGLRHLL CCTCGAGAAGCTCGACGGCGTAGGGGTGCCGTTGGCCGAGGGTGTACG AGCTGCCT RLPQSATTEFFYTPTS ACCGTGTGCCACCGCCTACGCCGACGACATCAAGGTGTTCAGTGACAGC CGCGCGAC GGGLGLQSLVEMHQ GCCGCTGGCATCCGTAAATGCCACGACGCTGTCACTCGTTTCCTCGAGT CCGGCMN ALQVAHAWQMLHS GGACAGGCCTGCGGGCAAACCCCGGCAAGTGTGCCAGCCTAGCGGTGA CCASTGCS KDAAVVAVAKEQVC CGACAAACGCCCGCGGCAACCCGGTCCGCGACGACGGCGTGCACCTAG SCCSGNGC QVARKRYRLQEEHW AGCTGCAAGGCGAAGTCATCGCCCCCCTCAGTCTCCACGATAGCTATCG CGGMAGC RGRGDELVRLFLNSE CTACCTGGGTGTGGGCGATGGCTTCGACCATGTGCGCCACCGCCTCCAG GASGAGG LAASPFADCLRRNGD CTTGAACCCAAGCTCCAACAGATCAAGCGAGAGGCGGTGGCGCTGATG AGGACCN IGSLWTDVQRTLRLH CAGTCCGGACTGGCGGGGTGGCAGGTGGTGAAGGCGCTCAAGACCTTC CGCACCCC HLSLTAQDDRDGQD GTGTACCCCAAGGTCGAGTACGCGCTGCGCCACCTACGACCACTCCAGT CCGGTCCT PLALRVPHHTKWLD CCCAGCTCCAAGGCTTCGACCGCGCCGTCGTCCGTGGCCTGCGCCACCT CTSSTCGC HKTVLRHVKLHMKIR ACTGCGCCTCCCCCAGTCCGCCACCACCGAGTTCTTCTACACACCAACGT GGCAGCAT HQTRWKGLVDQGKT CTGGAGGTGGGCTGGGGCTGCAGTCCCTCGTGGAGATGCATCAGGCGC CTGCAGCT VRVHGGLGAKFVST TGCAGGTGGCGCATGCCTGGCAGATGCTCCATTCCAAGGACGCGGCCG GCAGCAGC GAGLSDDAYRFGVK TCGTGGCCGTTGCGAAGGAGCAGGTCTGCCAGGTAGCACGCAAGCGCT CGCATCAA ARLNQVDTNAVLKR ACCGCCTGCAGGAAGAACATTGGCGCGGGCGTGGCGACGAGCTCGTGA GCGCCACT KRLRSSKTCRDPTCSS GGCTCTTCCTCAACTCCGAGCTCGCCGCCTCGCCCTTCGCTGACTGCCTC CCAGCTGC AETLAHALNHCASN CGCCGCAACGGCGACATCGGATCCCTCTGGACGGACGTGCAGCGCACG AGATACTG MDAIRQRHDDALEQ CTCCGCCTCCATCACCTCAGCCTGACGGCTCAAGACGACAGAGATGGCC CCACCACG IGSKIRGALERAKSTT AAGACCCGCTTGCCCTTCGCGTGCCGCACCACACCAAGTGGCTCGACCA CAGAGCGC ELRLNQTVPEYTGAA CAAGACTGTGCTGCGGCACGTGAAGTTGCACATGAAGATCCGCCACCA CGTGCCCA LRPDIVLRNVAAKKM GACCCGATGGAAGGGGCTAGTGGACCAGGGCAAGACGGTACGCGTGC TCGCTACT VIADLAVTFEDHAAG ACGGTGGCCTCGGTGCGAAGTTCGTGTCGACCGGGGGGGGCTGTCGG CCTGGGCC ARHSSLQLSHDHKTL ACGACGCATACCGGTTCGGCGTGAAGGCCCGCCTCAACCAGGTCGACA CCAGTACG KYQPIVAELRVQGW CGAACGCCGTGCTCAAGCGCAAGCGGCTGCGGTCCAGCAAGACCTGCA CCCCCCAC QVQTAAIVYGSLGSV GAGACCCGACCTGCTCCTCGGCCGAGACGCTGGCTCATGCTCTCAATCA GTGCTGGA QPSNFKTYTEKLKLHK CTGCGCGTCCAACATGGACGCCATCCGCCAGCGTCATGACGACGCCTTA GCCACCTC REARQLDLQLSSHCI GAGCAGATCGGGTCCAAGATCCGCGGCGCGCTTGAGCGAGCCAAGTCG CAGAGCTC QASHRIWGWHCRR ACCACAGAGCTGCGCCTGAACCAGACGGTGCCTGAGTATACAGGCGCA CGAGTTTC HREGQRSGNTSRAS GCTCTGCGACCCGACATCGTCCTGCGGAACGTGGCCGCCAAGAAGATG CGGCAAAC RGSGGTPRRTSQVRA GTGATCGCCGACTTGGCAGTCACGTTCGAGGACCACGCGGCCGGCGCT GCCGMCG RR(SEQIDNO: CGTCACTCCTCGCTGCAGCTCAGCCACGATCACAAGACACTCAAGTACC CCTCAACA 1442) AGCCAATCGTGGCAGAGCTGCGGGTGCAGGGGTGGCAGGTGCAGACC CGCCGATC GCGGCGATTGTGTACGGGTCGCTGGGCTCGGTGCAGCCCAGCAACTTC GACCTGCA AAGACGTACACCGAGAAGTTGAAGTTGCACAAGCGCGAGGCTCGACAG GCCGCTGG CTGGATCTTCAACTGTCGAGCCACTGCATTCAAGCCAGCCACCGCATCTG ACGTGGAC GGGCTGGCATTGCCGGCGCCATCGCGAAGGACAACGAAGCGGCAATAC GCGCTG CTCGCGAGCATCGCGTGGGTCTGGGGGGACCCCGCGGCGCACATCGCA (SEQID GGTAAGGGCACGGCGATAAGCTGGTCATTTGACCGACAGGGCACTACC NO:1197) CAGGTAGGGAACCGCCCTTCAGAAACCCAGGAAGACACAAACACCCTC CCTTTAGTGACATACATATTTTAGGCTACATTTCAGTTACGGAGAGGTTA CTAACTGGTAAATAAAAAGCACTTT(SEQIDNO:1074) NeSL Utopia- Phytophthora GCTCGGCCTCGCGGCTGCCTTCCCAGGCGCCGCCGACTTCGCGCTCTGG GCTCGGCC TAGGCG MLADPAALAAGLAR 2_PI infestans CGCGGCCCACACGCCGCCGCCGAGCCTCCAAGCGCGCCCGTTGGCTTCC TCGCGGCT GAAACC APPPPSAPQDPSPAF GCAGACGCAGGGCTCGCGGCGACGGCCCTCGCCAGCCCCCAAGACCCC GCCTTCCC AGGCCC PAGPAGQNPRAAAP CCCTACGATGTGGCACCACCCGGCAGGGCGGCCGGCAGGCTGCCCGAC AGGCGCC AAGACG ARVEVHTVVAPPGR TCGGTATCGCCGGGTGCTACACTCTCAGCCGCCACAGCTCGGGCCTTGG GCCGACTT GCCGAC AGGMLPDPGLVDSS CGGTCCGCCATTGGCCCTTGGAGCTCGACAGCGACAGCAGCGACGACG CGCGCTCT AGGGCC PAAATAATPAPVAAT AGGACGCTCAAGACCCCCACGCCGCCGCCCCAGGACCCCCCGCAGACGT GGCGCGG CCACCC ATTARAAARVAVEH CGCGAGTGTGCTTGCCCCACCCGGCAAGGCAGGCAGCATGCTCGCCGA CCCACACG AGGTAG HAHAEPNQEHLPMA CCCAGCAGCGCTGGCAGCCGGTCTGGCTCGCGCTCCGCCGCCCCCATCG CCGCCGCC GGAACC RVLVEPMQVDECSSC GCACCCCAGGACCCAAGCCCCGCCTTCCCTGCTGGCCCCGCCGGCCAGA GAGCCTCC GCCCTA DRSTLTADDGSGDD ACCCTCGCGCCGCCGCCCCAGCCCGTGTTGAAGTCCACACAGTCGTGGC AAGCGCGC GAAACC VAAPSSMLSNDVAA ACCACCCGGCAGGGCGGGCGGCATGCTGCCCGACCCGGGGTTGGTTGA CCGTTGGC CATTTCG PMDVDSGTSCPPTL CTCAAGCCCAGCAGCGGCTACCGCAGCTACTCCAGCTCCGGTAGCTGCT TTCCGCAG GTGGTC QQPLQRPRALHVGS ACTGCTACAACTGCTCGCGCTGCTGCTCGCGTCGCCGTGGAGCACCACG ACGCAGG GACTCG KRRRLDADDGEEAH CGCACGCTGAGCCGAACCAAGAACATCTACCGATGGCTCGCGTGCTCGT GCTCGCGG AAGGCC QLQEEEEAGIHAPAL GGAGCCCATGCAGGTCGACGAGTGCAGCAGCTGCGACCGCAGCACTTT CGACGGCC TTACCTA RLSAASAQPASVLAV GACAGCCGACGACGGGAGCGGTGACGACGTTGCAGCCCCAAGCTCCAT CTCGCCAG TTTTTTC YTHNASRFDCTLCAY GCTCAGCAACGACGTCGCCGCCCCCATGGACGTTGACTCCGGCACCAGT CCCCCAAG CTTAGA TAGSFASLLTHRNSR TGCCCTCCAACGTTGCAGCAGCCGCTCCAGCGACCGCGTGCGCTGCACG ACCCCCCC CATTCAA HRRTAFLDRFSAGCA TAGGCAGCAAGCGCCGCCGACTCGACGCAGATGACGGTGAAGAAGCTC TACGATGT TTAGGT CGVPFASRLAAARHA ACCAGCTGCAAGAAGAGGAGGAGGCTGGCATTCATGCCCCAGCGCTCA GGCACCAC AGCGAC QACASLSSAPSAEASS GGCTATCCGCGGCCAGCGCTCAACCGGCCTCCGTCCTCGCCGTGTACAC CCGGCAG CAAATT AAGTSSPTADGADST GCACAACGCGTCGCGGTTCGACTGCACGCTGTGCGCGTACACGGCCGG GGCGGCC ACAAAT VSAVAHAEPGLPHH CAGCTTCGCGTCACTTCTGACGCACAGGAACTCTCGGCACCGCCGCACC GGCAGGCT TTGGTA NDTELTASPPLVSSSD GCATTCCTCGACCGATTCTCGGCGGGTTGCGCGTGTGGCGTGCCATTCG GCCCGACT ACGAGT VEVQATKTEATDNR CCTCGAGGCTGGCTGCAGCAAGACACGCGCAAGCGTGCGCCAGCCTCT CGGTATCG AAGCCA WGAPLPRVLVASRIA CGAGCGCACCATCGGCCGAGGCTTCGTCGGCAGCAGGAACATCAAGCC CCGGGTGC AATGGT GRLAQVPPPRWGPP CCACTGCCGACGGAGCCGACTCCACCGTCTCAGCCGTGGCTCACGCTGA TACACTCT AATACA LPRTTIAARIATRLAA GCCCGGCCTGCCCCACCACAACGACACGGAGCTCACAGCGTCCCCCCCG CAGCCGCC CAAAAC TPAPRWDPPLPRSLV CTGGTGAGCTCCTCCGATGTTGAAGTGCAGGCAACAAAGACCGAAGCC ACAGCTCG TTTTCTG VSRIAARLLPALPDAP ACCGACAACCGCTGGGGCGCCCCGCTTCCGCGCGTGCTGGTCGCGTCTC GGCCTTGG TTCTAAT ACEEEAKDSDTMDW GAATCGCCGGGCGCCTCGCTCAGGTGCCACCCCCACGCTGGGGCCCCCC CGGTCCGC CAGTGT APTWTNEETKESEPH GCTGCCGCGCACCACCATCGCAGCCCGCATTGCGACGAGGCTCGCAGCC CATTGGCC GAAAAC DEAPGQVDEETIDDA ACGCCTGCCCCCCGCTGGGACCCACCGCTGCCACGAAGCCTGGTTGTGT CTTGGAGC TGGTTTT DGEWLLRFDGACRA CACGCATCGCCGCCCGCCTGCTACCCGCGCTACCAGACGCGCCTGCATG TCGACAGC CGCCTTT NPGPGGAGAALFKP CGAGGAAGAGGCCAAGGACAGCGACACGATGGACTGGGCGCCAACTT GACAGCA TGGCGG SGPVVWTCSHYDPS GGACAAACGAGGAGACCAAGGAAAGCGAGCCACATGACGAAGCGCCA GCGACGAC ACTTTTT TTATNNTAEYTALLL GGGCAGGTCGACGAGGAGACCATCGACGACGCCGACGGCGAGTGGCT GAGGACG CACTCG GARAAADHGVTKLRI TTTGCGCTTCGACGGTGCTTGCCGTGCAAACCCAGGCCCAGGCGGCGCC CTCAAGAC CATTTTT EGDSTLVIQQVRGIF GGCGCAGCTCTCTTCAAGCCCAGCGGCCCCGTAGTGTGGACTTGTTCCC CCCCACGC GGGCAA ATRSTRLRALRNKVKL ACTACGACCCGAGCACCACGGCAACGAACAACACAGCAGAGTACACTG CGCCGCCC TCGTCT ELARVGSFSLHHIDR CGTTGCTGCTCGGGGCCAGGGCCGCCGCCGACCACGGCGTCACGAAGC CAGGACCC GCGGCT QANGHADRLANAGL TGCGCATCGAGGGCGACAGCACGCTGGTCATCCAGCAAGTGCGGGGCA CCCGCAGA AGCTTG DRRRTKLECSVHPDG TCTTCGCGACGAGAAGCACGCGGCTCCGAGCACTGCGCAACAAGGTCA CGTCGCGA CTAGCG RGCTNTSVATAAPTA AGCTCGAGCTCGCGCGCGTGGGGAGCTTCTCGCTCCACCACATCGACCG GTGTGCTT GCGGAC PAAPLPSARPPASTA GCAGGCGAACGGTCACGCCGACCGCCTTGCCAACGCCGGCCTCGACCG GCCCCACC GAGCGG APSPDDDHSDQGDI ACGCCGCACCAAGCTCGAGTGCAGTGTGCACCCCGATGGCAGGGGCTG CGGCAAG TCTCCG DDGEVYAAMCISPD CACCAACACCTCCGTGGCGACCGCAGCGCCTACTGCTCCTGCAGCGCCC GCAGGCA GGGGCG AVPHRRPRLRLRRLT TTGCCCTCAGCCCGGCCTCCTGCATCTACGGCCGCTCCGTCTCCTGATGA GC(SEQID TTCACCT DEESEEAGNVVERLA CGACCACAGCGACCAAGGCGACATCGACGACGGCGAGGTGTACGCCGC NO:1198) TTCCCCC ASLAAKIADAPDWET AATGTGCATCAGCCCCGATGCTGTGCCGCACCGCCGACCGCGCCTCCGC GCGAGG AEGYITALPYALYDKL CTACGGCGCTTGACGGACGAAGAGTCCGAGGAGGCTGGCAACGTCGTG CCAACT QPYSQSQHQPPRQQ GAGAGGTTGGCAGCGTCGCTGGCGGCGAAGATTGCAGATGCGCCAGA ACACCG QQQQRQRPRQQQQ CTGGGAGACGGCGGAGGGCTACATCACGGCCCTACCGTACGCCCTCTA ATCTTCT TRQRRQRRCKRGGG CGACAAGCTGCAGCCGTACTCCCAGTCCCAGCACCAACCTCCACGTCAA CTACACT SQHRQRKTRRRRPPR CAGCAGCAGCAGCAGCGGCAACGCCCACGGCAGCAGCAGCAAACACG TTTCTAA VTRHHREHRIDEALD GCAGCGTCGACAGCGGAGATGCAAGCGCGGCGGTGGCAGCCAACACA TTCGCCT DLHALESRRPQDRTA GGCAGCGGAAGACGCGCCGCCGACGGCCCCCACGGGTGACACGCCACC CCGTCTT ISKARRRVGRIRSALD ACCGCGAGCACCGGATCGATGAAGCCCTCGACGACCTCCACGCGCTCG CGGTCT QHQLRHRFDTDEKA AGAGCAGGCGCCCTCAGGACAGGACGGCCATCAGCAAGGCCAGGCGTC TCGGCT CVDGILAAARDKDRA GAGTCGGTCGCATCCGCTCCGCCCTTGACCAGCACCAATTGCGCCACCG GTCGGA ASVTTTAQTAAPPHS CTTCGACACCGACGAGAAGGCCTGCGTGGACGGCATCCTGGCCGCCGC TTTTTTT APASAPSSAVDDGIC GCGCGACAAGGACAGGGCGGCATCAGTGACGACAACAGCGCAAACAG CTTTTTG PIPGDLLHAFFTDVN CGGCTCCGCCTCATTCTGCACCAGCTTCAGCTCCTTCGTCGGCGGTGGAC ACCAAT TPRTEFDADSPIGARF GACGGAATCTGCCCGATTCCGGGAGACTTGCTCCATGCCTTCTTCACAG CAGAGC REALAQLPAAIAATEL ACGTGAACACGCCGCGCACGGAGTTCGACGCCGACTCGCCCATCGGAG GCGCCA LMEPPSPDEVEDQL CCCGCTTTCGGGAGGCGCTTGCTCAGCTGCCCGCTGCCATCGCTGCCAC TGCGAC QRVRGTSSPGLDGV GGAGCTACTCATGGAGCCGCCGTCGCCGGACGAGGTTGAGGACCAGTT TCTTCTG GYDVYKTFTQQLLPA GCAACGTGTACGGGGTACGTCCAGCCCCGGCTTGGACGGAGTGGGGTA GCCAAT LHAAFSRCWTDQRV CGACGTCTACAAGACCTTCACACAGCAACTCCTACCGGCCCTGCACGCC CAGAGA PQSWKLGVVRLLFKK GCTTTCTCCAGGTGTTGGACTGATCAGCGCGTCCCACAGAGCTGGAAGC CCGGGC GDRQDPANWRPICL TCGGAGTCGTGCGGCTCCTGTTCAAGAAAGGAGATCGGCAGGACCCCG CCTGTCC QQAVYKLYAGILAHR CCAACTGGAGGCCGATCTGCCTCCAGCAAGCCGTCTACAAGCTGTACGC TCGGAC FTRWLDANTRHADA CGGTATCCTGGCCCACCGGTTCACACGCTGGCTCGACGCCAACACCCGG AGCGAG QKGFRAVNGCGEHN CACGCTGACGCACAGAAGGGATTCCGCGCGGTTAACGGCTGCGGGGAA GCCTCC FLAATLTDNARRRRR CACAATTTCCTCGCAGCCACGCTGACGGACAACGCCCGCCGCCGACGCC ACGGCC ELHVVWYDIKNAFGS GCGAGCTGCACGTGGTCTGGTACGACATCAAGAACGCGTTCGGCAGTG AGCCAA VPHELLWEVLRRMG TGCCCCACGAACTGCTGTGGGAGGTGCTGAGGCGCATGGGAGTGCCGG TCAAGT VPAQFIACCQGIYDA CGCAGTTCATCGCGTGCTGCCAAGGGATCTACGACGCGGCAGCTTTCAC CTCGGC AAFTVGNAADGTTA GGTCGGGAACGCAGCAGACGGCACGACGGCCCCAATTCAGCTGCGCTT AGCGAC PIQLRLGVFQGCPLSP GGGCGTGTTTCAGGGATGTCCCTTGAGCCCCCACCTCTTCACGGCGGTG GCGTCT HLFTAVISPLLHALKR ATCAGCCCACTCCTCCACGCACTCAAGCGGCTCCCTGGCACGGGGGTAC TTCTATA LPGTGVQLSAVDRPG AGCTCTCAGCCGTCGACCGGCCAGGTGCGTCAGCATATGCCGACGACCT GCGCAG ASAYADDLKVFSDTK CAAGGTCTTCAGTGACACCAAGGATGGCATCACGCGGCAGCACCAGCT CTGACG DGITRQHQLVTDFLR CGTCACCGACTTCCTGCGCTGGACAGGCATGGTGGCAAACCCGTCCAAG AGGCCG WTGMVANPSKCST TGCAGCACCATGTCTGTCCAGAGGGACAACCGCGGCGTTTTGAAGACC ATCTGG MSVQRDNRGVLKTA GCCAACCTCACGCTCCAGCTGGACGGCGCCCAAATCCCAGCACTCGGGA CGGCCC NLTLQLDGAQIPALG TGACCGAGGCGTACGCGTACCTGGGCATAGGCGACGGATTCGACCACG CCGATT MTEAYAYLGIGDGFD TACGCCGCCGCGTCGAGTTGGCCCCCAAGCTCCGCGAACTGAAGGCAG GGTCCG HVRRRVELAPKLREL ACACGACGGCGCTGATGCAGTCGGGGCTGGCGCCGTGGCAAGTIGTGA ACTTTCG KADTTALMQSGLAP AGGCCCTCAAGGTTTACATCTACCCACGCGTGGAATATGCCCTCCGCCA GCCAAT WQVVKALKVYIYPRV CCTCCGACCGTTTCAGCAGCAGCTCCAAGGCTTTGACCGACACCTCGCG CAGCGA EYALRHLRPFQQQLQ CGCGGGCTGCGTCACCTTCTCCGCCTGCCAACATCAGCCACCACCGAGT CGACGA GFDRHLARGLRHLLR TCCTGTATGCACCAACGTCGCGAGGAGGTTTGGGGCTGCTACCGCTTAC GGGGGC LPTSATTEFLYAPTSR AGAGGTTCACGGTGCCCTTCAGATTGCGCACGCCTGGCAGACCCTGCAC AGGGGT GGLGLLPLTEVHGAL TCGCCGGACCCCGCCATCCGCCGCATTGCCCGCGTGCAGCTTCGCCAAG TTACACT QIAHAWQTLHSPDP TCGCCGACGCCCGGCACCGCTTGGACGCCGAGCACTGGAAGGAGCGGG TTTGCCC AIRRIARVQLRQVAD GGGAGGAGCTGTGCGAGCGGCTGCTAAACTCGCAGCTCGGGACATCGG CCGTTTC ARHRLDAEHWKERG CCCACGCGCCTCCCAAACGACGCAATTGCGACATCGGCTCGCTGTGGGT GACTTC EELCERLLNSQLGTSA CGATGTCCAGCGCCACCTCCGCTCTCTCGGCCTACAGCTACAGACAGCC AACTTCA HAPPKRRNCDIGSLW CCCGCGGACACGCACACTGGCGCACCAGCTCAGCCTCTCCAGCTTCGCG GGCCAA VDVQRHLRSLGLQLQ TGCCGCACCACGACAAGTGGCTGACGCACAAGGACGTGCTGCGGCACG AATGGC TAPADTHTGAPAQPL TGAAGCTGCACATCAAGAATAACCACTGGCACCGCTGGACATCCATGCG GATTTC QLRVPHHDKWLTHK GGACCAAGGGAAGACGGCCCGCGCCCACGGCGGTGAAGGCAGCGGCT GACCCT DVLRHVKLHIKNNH TCCTTACGCAGCCGCGAGGCATGTGGGAAGCCGACTACCGGTTCGCGG CCACGC WHRWTSMRDQGKT TGGCGGGTCGTCTTAACCAAGTGGACACGTACAGCGTACTGAAGCGCC GCCGTG ARAHGGEGSGFLTQ GGCGCCTGCGCTCGCACGACAGATGCCGCCAGCCAGGATGCCATCGTG CCACTG PRGMWEADYRFAV CCGAAACACTGGCTCACGTGCTAAACCACTGTCCCGGCACGATGGACGC CTCGGC AGRLNQVDTYSVLKR GGTCCGAGGGCGGCACGACGGAGCGCTCAAGCGCATCGAGCGTGAGC ACCGGC RRLRSHDRCRQPGC TCCACGCGTCGGCCACGGACCGCCGGGACCGCGTCGAGCTCCGAGTGA GGCGAT HRAETLAHVLNHCPG ACCAGACCGTGCCGTCACTCGCCGGCCCCGCGTTACGGCCCGATCTCCA TCAGCG TMDAVRGRHDGALK GCTCTACAACCACACCAAGAAGACGGTGGCGGTGGTCGACCTGGCCGT GGTGCA RIERELHASATDRRD GGCGTTTGAGGAGCAGGCGAGTGACGACGCGAGCAGCTCGGCGCTGT ACTTCG RVELRVNQTVPSLAG CCCTTATCGCCAGCCACAAGCGAGCCAAGTACGACCGCATCAAGCGACA GGCACG PALRPDLQLYNHTKK CCTCGAGCGCCAAGGATGGAAGGTACACCTCTCGGCGCTCGTGTACGG TGTGCA TVAVVDLAVAFEEQA GTCGCTTGGGGCGGTCGCTAGTGGCAACTACCAGGTGTACACCACACAC ACACAT SDDASSSALSLIASHK CTGGGGCTACTCAAGCGCGATGCAAAGCGGCTGGACCGGCAGCTGTCT GCAGCG RAKYDRIKRHLERQG GTCGAATGCATCCAGTCCAGCCGCCGCATCTGGAATCTACACTGCAGCC CCCATTG WKVHLSALVYGSLGA AGCACCGGACTCGCCAACACCAGGCGAGGCCCAGCCAAGGACCAAGAG CACGCC VASGNYQVYTTHLGL GCAGCCGGGCGACGGAGACCGGGGGGACTCCGTCCCAGACAAGCCGC AAGCGG LKRDAKRLDRQLSVE CGCTAGGCGGAAACCAGGCCCAAGACGGCCGACAGGGCCCCACCCAGG CATCGC CIQSSRRIWNLHCSQ TAGGGAACCGCCCTAGAAACCCATTTCGGTGGTCGACTCGAAGGCCTTA GGGACG HRTRQHQARPSQGP CCTATTTTTTCCTTAGACATTCAATTAGGTAGCGACCAAATTACAAATTTG ACGCCT RGSRATETGGTPSQT GTAACGAGTAAGCCAAATGGTAATACACAAAACTTTTCTGTTCTAATCAG CGGCCG SRR(SEQIDNO: TGTGAAAACTGGTTTTCGCCTTTTGGCGGACTTTTTCACTCGCATTTTTG CTCAAG 1443) GGCAATCGTCTGCGGCTAGCTTGCTAGCGGCGGACGAGCGGTCTCCGG CGCAGC GGGCGTTCACCTTTCCCCCGCGAGGCCAACTACACCGATCTTCTCTACAC CCCGCC TTTTCTAATTCGCCTCCGTCTTCGGTCTTCGGCTGTCGGATTTTTTTCTTTT CTTCCAG TGACCAATCAGAGCGCGCCATGCGACTCTTCTGGCCAATCAGAGACCGG CACGAC GCCCTGTCCTCGGACAGCGAGGCCTCCACGGCCAGCCAATCAAGTCTCG CTCGCG GCAGCGACGCGTCTTTCTATAGCGCAGCTGACGAGGCCGATCTGGCGG CCGTTT CCCCCGATTGGTCCGACTTTCGGCCAATCAGCGACGACGAGGGGGCAG GGCGGA GGGTTTACACTTTTGCCCCCGTTTCGACTTCAACTTCAGGCCAAAATGGC TCGCCAT GATTTCGACCCTCCACGCGCCGTGCCACTGCTCGGCACCGGCGGCGATT CAAGAC CAGCGGGTGCAACTTCGGGCACGTGTGCAACACATGCAGCGCCCATTG GTGCGA CACGCCAAGCGGCATCGCGGGACGACGCCTCGGCCGCTCAAGCGCAGC GAGCCA CCCGCCCTTCCAGCACGACCTCGCGCCGTTTGGCGGATCGCCATCAAGA GGCGGG CGTGCGAGAGCCAGGCGGGGTCGGGCAAAATATACTTACTCTAAGTAT GTCGGG GCCCGAATCCCTGCCCTCTCAGGCTGAACGCGGCCCCATACTTGATCTAA CAAAAT GTATGGGAGGGATCCCTGGCCTCTCAGGCTGTACGCGAGACCCGTACG ATACTTA GCCGAATCCCCTGGCCTCTCAGCCTGTACGCGGGGC(SEQIDNO: CTCTAA 1075) GTATGC CCGAAT CCCTGCC CTCTCAG GCTGAA CGCGGC CCCATAC TTGATCT AAGTAT GGGAGG GATCCCT GGCCTC TCAGGC TGTACG CGAGAC CCGTAC GGCCGA ATCCCCT GGCCTC TCAGCCT GTACGC GGGGC (SEQID NO: 1321) NeSL Utopia- Phytophthora TCAAGCCCCGCCGCCAAGCCCAGCTGCGGCTGTTGCCGCCCCTCCAGCA TCAAGCCC TGAGCA MSGDVVSSDGSSRT 2_PR ramorum GCAGCAGTCGCAGCTGAACCTACAGCCCCTCCTGCTGATCGCGCCGCCG CGCCGCCA CCTTGG TDASGDGDDGAGSS TGGAGCCCCGCGCGCGCGCCGAGCCGCCCCAAGAACAAGCACCCCCAG AGCCCAGC GTTGCT DAAGDVGVVAMDV CTAGCGCGCGCGTGGAGCCCATGTCGGGCGACGTGGTCAGCAGCGACG TGCGGCTG CAAGCG DQGARRQQPPWQR GCAGCAGCAGGACGACGGACGCGAGCGGCGACGGCGACGACGGCGCA TTGCCGCC TGATGC VGGKRRRLNDVDDE GGCAGCAGCGATGCAGCAGGCGACGTCGGCGTCGTCGCCATGGACGTT CCTCCAGC GAAGCG DTRELAELLLEEEDEA GACCAAGGTGCCCGCCGACAGCAGCCGCCATGGCAGCGCGTCGGCGGC AGCAGCA GCTGGA GDHAPAPRLSAASAR AAGCGTCGACGTCTCAACGACGTCGACGACGAAGACACCCGCGAGCTG GTCGCAGC CCGGCA PASVLSVYAHNAQRF GCCGAGCTCCTGCTCGAGGAGGAGGACGAGGCTGGGGACCACGCTCCA TGAACCTA GCTCTC QCTLCTYTAASFASLK GCGCCTAGGCTATCCGCGGCCAGCGCTCGCCCGGCCTCCGTCCTCTCCG CAGCCCCT GGTGGC RHRDSRHRRTAFLDR TGTACGCGCACAATGCGCAGCGCTTCCAGTGCACGCTGTGCACGTACAC CCTGCTGA GTGCAT FSAGCACGAPFASRL GGCCGCCAGCTTCGCGTCGCTCAAGCGACACCGGGACTCCCGGCACCG TCGCGCCG CCAGTC AAANHAHACASLNR CCGCACCGCCTTCCTCGACCGGTTCTCGGCGGGATGTGCGTGCGGTGCG CCGTGGAG CAGCCG TLSVAATPAAGELSPT CCCTTCGCGTCGAGGCTGGCTGCTGCGAACCACGCACACGCGTGCGCCA CCCCGCGC CCGCAT AGAANATVKAATVT GCCTCAACCGCACCTTATCGGTAGCGGCTACGCCGGCAGCAGGGGAAC GCGCGCCG CTGGAA PDSPRQDPPELAASP TGAGCCCCACTGCTGGTGCAGCCAACGCCACCGTCAAGGCGGCCACCGT AGCCGCCC CCTGCA PLASSPDVAVQAAD CACGCCCGACTCCCCCCGCCAAGACCCCCCGGAGCTCGCAGCGTCCCCC CAAGAACA CTGCAG MQAPTRWDPPLPRT CCGCTAGCGAGCTCCCCCGACGTTGCTGTGCAGGCCGCCGACATGCAG AGCACCCC CCAGCA LVATRVASRLTDLTPP GCCCCGACTCGTTGGGACCCGCCCCTCCCCCGCACGCTGGTGGCGACAC CAGCTAGC CCGCGC RWGPPLPRATVVSRI GGGTCGCCTCCCGCCTCACCGACCTGACACCGCCGCGGTGGGGACCGC GCGCGCGT GCGCCA AARLEAAPTPRWGP CGCTGCCTCGAGCCACGGTGGTGTCTCGCATCGCGGCGAGACTCGAGG GGAGCCC GCACCA PLPRVVVASRIAERLA CCGCACCAACGCCCCGTTGGGGCCCTCCACTTCCACGCGTCGTGGTTGC (SEQID AGCACC PPELAADDETKDGEE GTCCMGGATCGCCGAACGCTTAGCTCCGCCCGAGCTGGCAGCTGACGA NO:1199) AGGGGG DQSFTEPVAAARSXG CGAGACCAAGGACGGCGAGGAAGACCAGAGCTTTACGGAACCTGTGGC AAGTCG GEDANGEWLLRFDG TGCCGCCCGGTCMGAKGGCGGCGAGGACGCGAACGGCGAGTGGCTGC GGCGGC ACRANPGPGGAGAA TTCGCTTCGACGGCGCCTGCCGAGCCAACCCAGGCCCCGGCGGTGCGG GGAGAC LFKPSGPVVWTCSHY GCGCAGCGCTGTTCAAGCCCAGCGGCCCGGTGGTGTGGACCTGCTCCC CGGGGG MPSSSETNNTAEYTA ACTATATGCCGAGCAGCAGCGAGACCAACAACACGGCCGAGTACACGG GACTCC LLLGMRAAADHGAT CTCTGCTGCTCGGCATGCGGGCTGCGGCGGACCATGGCGCTACGCGCG GCCGCG RVHVEGDSTLVIQQV TGCACGTCGAAGGTGACAGCACCCTGGTGATCCAGCAAGTGCGGGGCA CACCGG RGIFATRSTRLRGLRK TCTTCGCTACCCGTAGCACCCGGCTCCGTGGACTGCGCAAGAGCGTGAA CCGCCG SVKAEMARMEHVTL GGCGGAGATGGCGCGGATGGAGCATGTCACGCTCCACCACATCGACCG CTAGAC HHIDRQANGHADRL CCAAGCCAACGGCCATGCCGACCGCCTCGCCAACGCCGCCCTCGACCGC GGCACA ANAALDRRKTKLECG CGCAAGACCAAGCTGGAGTGCGGACTCCACCCTGACGGCCAAGGCTGC CAGGCC LHPDGQGCSSTAATT TCCAGCACCGCCGCGACGACGGCGGTCCCCTCTGTGGTGCCGGACCGG CACAGC AVPSVVPDRPPSSTA CCTCCCTCGTCAACCGCTGCCGCGCCAACGCCAAGCGCCGAACCCGACG GGCCGA AAPTPSAEPDETEQG AGACCGAGCAGGGCGACATCGACGACGGCGAGGTCTACGCCGCCATGT CAGGGC DIDDGEVYAAMCIGP GCATTGGCCCGGACTCCATACCGGAGCGCCGACCACGCCTCCGCCTACG CACACC DSIPERRPRLRLRQLS GCAGCTGTCCGAGACCGAGGAGGAGGAGGCGGGTGCCATTGTGGAGA CAGGTA ETEEEEAGAIVERLAA GGCTGGCGGCAACGTTGGCGGGCAAGATCACGGACGCGAGTGACTGG GGGAAC TLAGKITDASDWATA GCGACGGCGGAGGGCTACATCACTGCCCTCCCCTACACTCTATACGACA CGCCCTC EGYITALPYTLYDKLQ AACTGCAGCCGTTCGCCCAGCACCGCCACCAGCCTCGCCCACAGCACCG AAACCC PFAQHRHQPRPQHR GCAGCAACCCCAGCGCGACCCACCCCTCGGCACCCACGACGGGGACCA CGCCGG QQPQRDPPLGTHDG CGGCCAGCCGAGCACCAGCCGCAGCCGTCGCCGTCGCCGCCGCGCCAA TACATTA DHGQPSTSRSRRRRR GGATCGCTTACGCCGACGCCCACCGCGTATTACCCGCCACCACCGTGAG TGGTCC RAKDRLRRRPPRITR CACCGCCTCGACGAGGCTCTGGACGACCTGCGGGCGGTGGAGCACGCC GACACC HHREHRLDEALDDLR AGTCCCCACGACCGGCCGGCCGTCGCTCGGGCCCGGCGGCGCGTTGGC TATGAG AVEHASPHDRPAVA CGCGTCAACTCCGCCATTGCCCAGCAACAGCTGCGACACAAGTTTGACA GTGCAA RARRRVGRVNSAIAQ AGGACGAGAAGGCGTGCGTGGATGGAATCTTGGCCGCGGCCCGAGCTT CCTGTAC QQLRHKFDKDEKAC CTCGCGGGTTAGCAACCCCATCGGCCTCAGCCTCGAGGCATCCACCGCC ACAAGT VDGILAAARASRGLA GGTACCCTCCACCGCTGCTGATGACGGCTCCTGCCCCATCCCCAGCGAT TACACA TPSASASRHPPPVPST GAGCTCCACGCCTTCTTCACGGCCGTGAACACGCCGGCGGGCACCTTCG CCACAT AADDGSCPIPSDELH AGCCCATGGCACCAGTGGGTGCCCCGTTCCGCTCCGCCGTCGCCCACCT AGCGAC AFFTAVNTPAGTFEP CCCCGCCGCGACATCGCAGCCTGAGTTGCTCTCGGACGCGCCGACCACG TACCAG MAPVGAPFRSAVAH GACGACATCGAGGACCAACTGCAGCGTGCCCGTGGCAGCTCCAGCCCT GTATTTA LPAATSQPELLSDAPT GGCCTGGACGGCGTGGGGTACGACATTTACAAGGCGTTCGCGGCCCAG CTACCTG TDDIEDQLQRARGSS CTCCTGCCGGCCCTGCACGCAGCATTCGCGTGCTGCTGGCGCCACAAGC GAAGCC SPGLDGVGYDIYKAF AAGTCCCGCAGAGCTGGAAGGTGGGGGTGGTGCGCCTCCTGTTCAAGA AAGGAT AAQLLPALHAAFACC AGGGGGAGCGAACCGAGCCGGCGAACTGGCGCCCGATCTGCCTGCAG TAACCG WRHKQVPQSWKVG CAGGCAATCTACAAGCTGTACGCCGGGGTGCTAGCTCGCCGACTCACCC GTCGGT VVRLLFKKGERTEPA GCTGGCTGGACGCCAACGGCCGCCACGCAGACACGCAGAAGGGATTCC AATACA NWRPICLQQAIYKLY GGGCGATGAACGGCTGCGGGGAACACAATTTCCTCGCAGCCACGCTGG CATAACT AGVLARRLTRWLDA TCGACCAGGCCCGCCGCAAGCGACGAGAGCTCCACGTCGTGTGGTACG TT(SEQ NGRHADTQKGFRA ACTTCGCCAACGCCTTCGGCAGCGTGCCACATGACCTGCTGTGGGAGGC IDNO: MNGCGEHNFLAATL GCTCGAGCGGCAGGGCGTCCCCAGCCCCTTCATTGCTTGCTGCCGGGGC 1322) VDQARRKRRELHVV CTCTACGCGGATGCCGTCTTCACAGTCGGCAACGCAGCGGACGGCACCA WYDFANAFGSVPHD CGGCACCGATCGCGCTACGGGTGGGGGTCTTCCAGGGATGTCCGCTGA LLWEALERQGVPSPF GCCCCCACCTCTTCACGGCCGCGATCGCCCCGCTGCTCCACGCGCTCAA IACCRGLYADAVFTV GCGCTTACCAGACACGGGCGTACAGCTGTCTCGTGTCGACTGCCTCGGT GNAADGTTAPIALRV GCCTCCGCGTACGCGGACGACCTCAAGATATTCAGCGGCACCGAGGGC GVFQGCPLSPHLFTA GGGACCAAGAGACAGCACGCGCTCGTGGCGGACTTCCTGCGCTGGACC AIAPLLHALKRLPDTG GGCATGCGCGCGAATCCGGCGAAGTGCTGCACCATGTCGGTCCAGCGC VQLSRVDCLGASAYA GACACACGTGGAGTGCTCAAGGCCTGCAACCTCGGACTGCAGCTGGAT DDLKIFSGTEGGTKR GGCGCCCCGATCCCCGCGCTGACCATGAGCGCCTCGTACGCCTACCTGG QHALVADFLRWTG GGATCGGGGATGGCTTCGACCATGTGCGCCGCCGCATCGAGCTCGCGC MRANPAKCCTMSV CCAAACTCCAGGAGCTCAAGCACGATGCGACGGCGCTGCTGCAGTCCG QRDTRGVLKACNLGL GACTGGCCCCATGGCAGGTAGTGAAGGCGGTGAAGGTGTACCTCTACC QLDGAPIPALTMSAS CTCGGGTGGAGTACGCCCTTCGCCACCTTCGCCCCTTCCACCAGCAGCTG YAYLGIGDGFDHVRR GAGGGCTTCGACAGACACCTCGTGCGCGGCCTCCGCCACCTCCTCCGCC RIELAPKLQELKHDAT TGCCCGCCAACGCCACGACGGCATTCTTCTACGCGCCGGTGTCGCGCGG ALLQSGLAPWQVVK CGGGCTGGGCCTGCTGCCCCTGACGGAGCTGCACGCGGCGCTACAAGT AVKVYLYPRVEYALR CGCGCACGGATGGCAGATGCTGAACTCGAAGGACCCTGCCATCCGACG HLRPFHQQLEGFDR CATCGCGCGCGTGCAGCTCCGGCAAATCGCCGACGCGCGCCACCGGAT HLVRGLRHLLRLPAN CGACGCTCAAGCGTGGCAAGACCGGGAGGAAGAGCTGGCGCAGCTCCT ATTAFFYAPVSRGGL CCTGAACTCGCAGCTCGGGGCGTCCACCGGCGCACCCCCGAAGCGCCG GLLPLTELHAALQVA CAACGGCGACATCGGCTCGCTCTGGGTGGACGTCCAGCGCCACCTCCGC HGWQMLNSKDPAIR CATCTCAGCCTCAAGCTGGAGACCGCACCGGCGTGTGCGGAGACCGGC RIARVQLRQIADARH ACCGCCGCGGCCATGCTGCAGCTTCGCGTGCCGCACCACGACAAGTGG RIDAQAWQDREEEL CTAGACCACAAGACTGTGCTGCGGCACGTGAAGCTGCACTACAAGAAC AQLLLNSQLGASTGA AAGCACTGGGCCAGGTGGGCGGCCATGMGSGACCAAGGCAAGACGGC PPKRRNGDIGSLWV ACGCACGCATGGGGGCGCGGGGAGCGGCTTCCTCACCCGGCCACGAG DVQRHLRHLSLKLET GCATGTGGGAAGCCGACTACCGCTTCGCGGTGGCCGCCCGCCTGAACC APACAETGTAAAML AGCTCGACACGCACAGCGTSCTGAAGCGCCGGCGCCTCCGGSMSCACG QLRVPHHDKWLDHK ACCGCTGTCGACAGCCGGGCTGCACCCAGGGSGGAGACGCTGGCGCAC TVLRHVKLHYKNKH GTGCTCAACCACTGCGCCGGCACCATGGACGCGGTCCGCGGCCGCCAC WARWAAMXDQGK GACGACGCCCTCAAGCACATCGAGCGCGCGCTGCACGCGTCGTCGCCG TARTHGGAGSGFLTR GGCGGCCAGGACCGCGTCGAGCTCCGGGTCAACCAGACGGTGCCGTCG PRGMWEADYRFAV CTCGCCGGCCCCGCGCTACGGCCCGACCTCCAGCTGTACAACCACACCA AARLNQLDTHSVLKR AGAAGACGGTGGCGGTGGTCGACCTGGCCGTGGCGTTCGAGGAGCAG RRLRXHDRCRQPGCT GCGAGCGACGACCCGGAGAGCTCGGCGCTGGCACGCATCGCCGCACAC QGGDAGARAQPLRR AAGCGCGCGAAGTATGCCGGCGTCAAGCGACACCTCGAGCGCCAAGGG HHGRGPRPPRRRPQ TGGAAGGTCCACCTCTCGGCGCTCGTGTACGGCTCGCTCGGCGCGGTGC AHRARAARVVAGRP CGGCCGGCAACCACAAGGTGCTTACTGAGCACCTTGGGTTGCTCAAGCG GPRRAPGQPDGAVA TGATGCGAAGCGGCTGGACCGGCAGCTCTCGGTGGCGTGCATCCAGTC RRPRATARPPAVQP CAGCCGCCGCATCTGGAACCTGCACTGCAGCCAGCACCGCGCGCGCCA HQEDGGGGRPGRG GCACCAAGCACCAGGGGGAAGTCGGGCGGCGGAGACCGGGGGGACTC VRGAGERRPGELGA CGCCGCGCACCGGCCGCCGCTAGACGGCACACAGGCCCACAGCGGCCG GTHRRTQAREVCRR ACAGGGCCACACCCAGGTAGGGAACCGCCCTCAAACCCCGCCGGTACA QATPRAPRVEGPPLG TTATGGTCCGACACCTATGAGGTGCAACCTGTACACAAGTTACACACCA ARVRLARRGAGRQP CATAGCGACTACCAGGTATTTACTACCTGGAAGCCAAGGATTAACCGGT QGAY(SEQIDNO: CGGTAATACACATAACTTT(SEQIDNO:1076) 1444) NeSL Utopia- Pythium TGACTGGTGTTTGATCACGATCAATGAGGTGATTAACATGAGCCGGAGC TGACTGGT TAAGCG MDYDDSEFFDAICIP 2_PU ultimum AGGCCCCTTACACGCTGGTGCTGTAATGGTTCAGGAATGCTCTTATGAG GTTTGATC GGGGGT DEDADVLDDGDEGD TAACTCCACAGTATAATTTTTGTTAGCGGGGGTGAGCGCGTGCGCCGCC ACGATCAA CCAGAC EGGNDDESSEPLPLAI CCCACCTTTCTCTTTTCGTTTGTATTTGGTTCCTCTGGAGCGACTTCCGTG TGAGGTGA CCCACA TNAPSAPLHATMLC GCTTTTGTCGCGCGCTATCGCGCCCGGCATGAGGATCGTCGCTCCTGAC TTAACATG AGAGAG GTVTQPWLLRFDGA GCCATGTGCGCCAGTATATCGGAGCCCCGTGCTCCGCTTGGGTGTGTGT AGCCGGA AAGCAG CRRNPGPAGAGATL TCCGCCACGACGCCACGCGATTTGCGTGCACGGCATGCGCTGGCGACTT GCAGGCCC GAATCA TRPNGIILWTHYRYIP CCGCTCAGTGCGCGAACTCGCGGTGCATCGTCGACGTGCGCACCGGTCT CTTACACG TGGTCC DKTATNNVAEYEALL CTGGCCTTCCAGGACTGGTATGAGTCGTCGTGCGCGTGCAGAGCGCTCT CTGGTGCT GCATGG DGLRCAAHHGVKHL TCACCGCCCGTCTCGACGCGTTCGTTCATTCGAGCCGTTGCGACCACAAT GTAATGGT ACCAGT RIEGDSNLVIEQVKGI GCGAACCAATCAGCGAACGCCGTCCCCTCGTCCGCCGCCTCGACAACGA TCAGGAAT AGGGCA FACSTSLRPRRDQVR GAGCGGACGCGGATGCGACGGCCCCCCCTCCCGCTATCTCTCTTCCGCG GCTCTTAT CGCTCC EILRHFETYSFRHIDR GACGGGCTTCCCCCGCCCCTGTCCTAGTTGCGCCGCTCTCCTTCTGGATG GAGTAACT ACAAAG ALNRQADRLANQAL CGGCAACTTGGACGACCCACGTGCACGCGTGCGCTGCCGCCAGTGCGA CCACAGTA GTTATC DLLKTVSVCALSQTR CGGCGTTGATCCAGCGTCATCTACACGCCGTTCCTCCCGTATCGACAGCA TAATTTTT GCCCTC VQDDTGAAHGCWH ACCGATCTCAACGGCGATGATAGCCAACCTCCAGACCCGCGTGCCAACA GTTAGCGG AAACCC WTPPDASPTDDASTS TCCAACAGCTTCCAGCCATCCATTCGACAACGACAACGACAACGACAAC GGGTGAG ATCACA ILTQDVPVPMDIDDY GACGACGACCCGCTGCTCGACGACGTGCTCATGGACTACGACGACTCG CGCGTGCG CGAAGG DPDEPMNAADDPVS GAGTTTTTCGACGCCATCTGCATTCCAGACGAAGACGCTGACGTGCTCG CCGCCCCC ATCTAA INAEREGGTVYPVLRL ACGATGGTGACGAAGGTGACGAAGGTGGTAACGACGACGAGTCGTCC ACCTTTCT AAAGAA GPNVVPERQKRLQIP GAGCCACTGCCTCTAGCGATCACGAACGCTCCAAGCGCTCCCTTGCACG CTTTTCGTT AGCAAC WLPPREMQKLEKKIE CGACCATGCTCTGCGGCACAGTTACACAGCCATGGCTCCTGCGCTTCGA TGTATTTG AATCGA VLGETFASRIRDAPD CGGCGCGTGCCGACGCAACCCAGGACCCGCCGGCGCCGGCGCCACGCT GTTCCTCT AATAGT WFSAEGYITALTSELA CACGCGGCCCAATGGTATCATCCTATGGACGCACTACAGGTATATCCCG GGAGCGA AAATAG TLIRQSTAATTGPNA GACAAGACGGCGACCAACAACGTTGCCGAGTACGAAGCGCTGCTTGAC CTTCCGTG TAAATA ARPCERTISKEKRRAR GGGCTGCGCTGCGCGGCGCACCACGGCGTCAAACATCTGCGTATTGAA GCTTTTGT GCTTAG RTTPLQRALAEAKHE GGTGACTCCAACCTCGTCATCGAGCAAGTCAAAGGCATCTTCGCGTGCT CGCGCGCT AAAGGT LQIIQPDASRKSVRKA CCACATCGCTGCGCCCGCGCCGCGACCAAGTCCGCGAGATCCTGCGCCA ATCGCGCC CAACAT ARRVKRISQAQQRH CTTCGAGACCTACAGCTTCCGCCACATCGACCGCGCACTTAATCGACAA CGGCATGA GCGAAA DLRRLFSTNERRCVEK GCCGACCGCCTGGCCAACCAGGCACTCGATCTCCTCAAGACGGTCTCCG GGATCGTC TGCATG ILRDPPVGPSSTSSSL TGTGCGCGCTCTCTCAAACTCGCGTCCAAGACGATACCGGTGCCGCACA GCTCCTGA AGGAAC PATDDDRCTIDPADL TGGGTGCTGGCACTGGACGCCGCCAGATGCGTCGCCAACCGACGATGC CGCCATGT GTAAGA FAYFQTQATAPTNFD ATCCACGTCGATACTCACGCAGGACGTTCCAGTCCCCATGGATATCGAT GCGCCAGT TGGTAA FDDEGGELFRSVLDE GATTACGACCCCGACGAGCCCATGAATGCGGCCGATGATCCAGTCAGC ATATCGGA TACATTT LPRADQEVHLLEDEIT ATTAATGCTGAACGAGAAGGTGGTACTGTCTACCCAGTGCTTCGCCTCG GCCCCGTG TTATA RDEIEDQLSRISKSTA GTCCCAACGTCGTCCCCGAACGACAAAAGAGGTTGCAAATCCCGTGGCT CTCCGCTT (SEQID PGLDGITNAVYVRFK CCCGCCGCGTGAGATGCAAAAACTCGAAAAGAAGATCGAAGTGCTGGG GGGTGTGT NO: LQLLDALQAAFNAC GGAGACTTTTGCGTCGCGCATTCGTGACGCGCCCGATTGGTTCTCTGCA GTTCCGCC 1323) WRYNRVPSMWKAA GAAGGCTACATCACGGCTCTAACGTCCGAGCTTGCAACGCTCATTCGCC ACGACGCC FVRLIYKKGNRAVPS AATCAACGGCGGCGACGACGGGACCGAATGCAGCACGACCCTGCGAGC ACGCGATT NWRPICLQQTVYKLY GCACCATCAGCAAGGAAAAGCGGCGCGCGCGAAGAACGACACCGTTAC TGCGTGCA TAILASRLQRWMDA AACGCGCGCTCGCGGAGGCAAAGCACGAGCTCCAGATCATTCAGCCTG CGGCATGC NARFTMSQKGFRAF ATGCCTCGCGCAAGAGCGTGAGGAAGGCTGCACGACGAGTCAAACGGA GCTGGCGA NGCHEHNFVATCLH TCTCCCAAGCGCAGCAGCGCCACGACCTGCGTCGCCTCTTCTCGACCAA CTTCCGCT DQTRRLRKKLAIVWY CGAGCGTCGTTGTGTCGAGAAAATCCTTCGTGATCCCCCAGTGGGGCCA CAGTGCGC DLRNAFGSLPHEYLW TCGAGTACCAGCTCCAGCTTGCCAGCCACCGACGATGACCGCTGTACCA GAACTCGC RVLARLGMPPQFVA TTGACCCAGCCGACCTGTTCGCCTACTTCCAGACGCAAGCCACGGCCCC GGTGCATC RVRQLYADASFTVES AACGAACTTCGACTTTGACGACGAAGGCGGCGAACTGTTCCGCTCGGTG GTCGACGT RDGTTDPVQLERGVY CTCGACGAACTGCCGCGCGCCGACCAAGAGGTTCACCTTCTTGAGGACG GCGCACCG QGCPLSPYLFIAALIPL AGATCACTCGGGACGAGATCGAAGACCAGCTCTCGCGCATCTCGAAATC GTCTCTGG VRALHKLKDQHGIVL GACGGCACCGGGACTCGACGGCATCACGAACGCTGTGTACGTGCGCTT CCTTCCAG APGVTDCVSAYADDI CAAGCTACAGCTGCTTGATGCGCTACAAGCTGCATTCAATGCATGTTGG GACTGGTA KIFARSGTGAKALHEI CGATACAATCGTGTTCCCAGCATGTGGAAGGCGGCGTTTGTGCGTCTGA TGAGTCGT VVRFLSWTNMAANP TCTACAAAAAGGGCAATCGCGCTGTGCCAAGCAATTGGCGGCCGATCT CGTGCGCG AKCALMVTDGARGG GCCTACAGCAGACAGTGTACAAACTATACACAGCCATCCTCGCCTCGCG TGCAGAGC DDTDASMTLSIEGETI ACTGCAGAGATGGATGGACGCGAACGCGCGGTTCACGATGTCGCAAAA GCTCTTCA PRLTGKEGYVYLGVE AGGATTTAGAGCGTTTAACGGTTGCCACGAACACAATTTTGTGGCTACC CCGCCCGT DGLAHERRATCLRDS TGTCTACACGACCAAACACGACGCCTTCGAAAGAAGCTTGCGATCGTGT CTCGACGC LKAASADVVRLLRSD GGTATGATCTCCGGAACGCATTCGGCTCGCTCCCGCATGAGTATTTGTG GTTCGTTC LAPWQIVRAIKSHVL GCGAGTACTGGCGCGCCTTGGCATGCCACCGCAGTTTGTCGCAAGAGT ATTCGAGC SRFDYVLRHLRPFLSL GAGGCAGCTGTACGCGGATGCAAGCTTCACCGTGGAATCCCGTGACGG CGTTGCGA FDGFDKMLVRGIKRL GACGACAGACCCCGTCCAGCTGGAGCGTGGCGTATATCAAGGGTGTCC CCACAATG CQLPQTATSEFLFSPT TCTGAGTCCATACTTGTTTATCGCCGCACTCATTCCGCTGGTGCGCGCAC CGAACCAA SAGGLGFLPLKELFAA TCCACAAGCTCAAAGACCAGCACGGCATCGTGCTGGCTCCCGGCGTCAC TCAGCGAA LQIVHALQMLHSKDA CGACTGCGTCTCGGCTTATGCCGACGACATCAAGATTTTCGCTCGATCG CGCCGTCC NVRAIARHQALQVV GGAACCGGCGCCAAAGCCCTCCACGAGATCGTGGTGCGATTTCTGTCCT CCTCGTCC RKRYALQSDHWSDR GGACGAACATGGCAGCGAACCCGGCCAAGTGCGCGCTGATGGTGACCG GCCGCCTC EEELLEEFFNGTLERS ATGGCGCTCGTGGCGGTGACGACACCGATGCGAGCATGACGCTATCTA GACAACGA PFALAKKVSGDIASL TCGAAGGTGAGACGATCCCACGCCTCACCGGCAAAGAAGGCTACGTGT GAGCGGA WTDVRVNLTKYGLKF ATCTCGGAGTGGAAGATGGGCTCGCGCATGAGCGCAGAGCAACGTGCC CGCGGATG GEAHGRRLQPLVSHT TGCGCGACTCGCTCAAGGCAGCGAGCGCGGATGTCGTCCGCCTGCTTCG CGACGGCC DKQLAPQQWASAIK CAGCGACCTCGCACCTTGGCAAATCGTGCGTGCGATCAAGTCGCACGTG CCCCCTCC THMRLRHLKRWTTL CTCTCGCGTTTCGATTATGTACTCCGACACTTGCGCCCGTTCCTTAGTCTC CGCTATCT VDQGKTARMHERIG TTTGACGGCTTCGACAAGATGCTGGTGCGCGGAATCAAGAGGCTGTGC CTCTTCCG SAFLTRPSGVYDASY CAGCTGCCGCAAACAGCCACGTCCGAGTTTTTGTTCTCGCCCACCTCGGC CGGACGG AFAVRARLNQVDTRS GGGTGGCCTCGGCTTCCTGCCACTGAAGGAGCTTTTTGCGGCGCTGCAG GCTTCCCC ALKRKRIVNNSRCRV ATTGTGCACGCACTTCAAATGCTGCACTCGAAGGACGCCAACGTCCGTG CGCCCCTG SGCSELETLAHVLNH CCATTGCACGGCACCAAGCGCTGCAAGTGGTGCGTAAGCGCTACGCGC TCCTAGTT CRFGSDSIRARHAETL TGCAGTCTGACCATTGGAGCGACCGCGAAGAGGAACTTCTCGAAGAGT GCGCCGCT LLIKTTMERELTRPGR TCTTTAACGGCACGCTGGAGCGCTCGCCCTTCGCCCTCGCAAAGAAAGT CTCCTTCT QHQRLLVDATVPEAR GTCAGGGGACATCGCGTCACTCTGGACGGATGTACGAGTGAATCTCACC GGATGCG DPVPSNDAAESNIGA AAGTACGGACTCAAATTCGGCGAGGCGCACGGACGGCGACTGCAGCCG GCAACTTG MAPSISHLRPDIQLYD CTCGTCAGTCACACTGACAAGCAGCTTGCACCTCAACAATGGGCTAGTG GACGACCC NKTMEAVIIDLAVAF CAATCAAGACGCACATGCGCCTGCGCCACCTCAAGCGATGGACCACCCT ACGTGCAC EDQSTDDAASSSSFA GGTCGACCAGGGTAAGACAGCCCGGATGCATGAGCGCATCGGAAGTGC GCGTGCGC RVKGVKTKKYEVIKQ GTTTCTGACAAGGCCGTCTGGAGTGTACGACGCCAGCTATGCGTTCGCG TGCCGCCA FLEYKGYTVHVAALV GTTCGAGCAAGACTAAACCAAGTCGATACGCGCAGCGCATTGAAGCGC GTGCGACG YGSLGSVDTGNFAVY AAGCGCATCGTGAACAATAGCCGCTGCCGTGTGAGTGGATGCAGCGAG GCGTTGAT TERLGLRKGAVRRLE CTGGAAACGCTCGCCCACGTACTGAATCACTGCCGATTCGGGTCAGACT CCAGCGTC CSLSARHINFAHRM CGATACGAGCACGACACGCAGAGACACTCTTGCTGATCAAGACGACCAT ATCTACAC WRRHAIAHTTGLRLI GGAACGTGAGCTCACGCGGCCAGGGCGGCAGCACCAGCGCCTCCTGGT GCCGTTCC GTNSVQQQGVQRA CGACGCGACAGTACCCGAAGCTCGTGATCCCGTACCAAGTAATGACGCT TCCCGTAT PAEKQQHQRPVQRP GCTGAGAGCAACATTGGCGCAATGGCACCCAGCATCTCCCACTTGCGCC CGACAGCA SRAQAPRDQPSQQQ CCGACATTCAACTCTACGACAACAAGACAATGGAAGCCGTGATCATCGA ACCGATCT QSQQSFQQQSQQS CCTCGCGGTTGCGTTCGAAGACCAGTCAACGGACGACGCCGCATCCAGT CAACGGCG QQSQQSQQSQQSR TCCAGTTTCGCGCGCGTCAAGGGTGTGAAGACAAAGAAATATGAGGTA ATGATAGC QRHAPTPTPVPVPVP ATCAAGCAATTTCTGGAATACAAAGGCTACACTGTGCATGTAGCAGCGC CAACCTCC VPVSTPTLTPTPTPTR TAGTCTATGGTTCACTAGGCTCCGTTGACACCGGCAACTTTGCCGTGTAT AGACCCGC RPKPAPSSTQPQQG ACGGAACGTCTCGGTCTCCGCAAGGGCGCAGTGCGGCGACTGGAGTGC GTGCCAAC APAQRRQQREQKKQ TCGCTCTCGGCGCGCCACATCAACTTCGCGCACCGTATGTGGCGACGCC ATCCAACA PACRRRHATAVRETP ACGCGATCGCCCACACGACCGGTTTGCGCCTTATCGGCACGAACAGCGT GCTTCCAG PAAPTAARTATPTAR GCAACAGCAGGGTGTGCAGCGCGCCCCGGCAGAGAAGCAACAACATCA CCATCCAT PTATTRSTSTTRSTAT GCGACCAGTGCAGCGACCCAGCCGCGCGCAAGCTCCACGTGATCAGCC TCGACAAC ATSTTRSSAPTSRPSA ATCGCAGCAACAGCAGTCCCAGCAGTCGTTCCAGCAGCAGTCCCAGCAG GACAACGA PRPRSSAPRPRSSAPT TCCCAGCAGTCCCAGCAGTCCCAGCAGTCCCAGCAGTCGCGTCAACGCC CAACGACA TRSAAPTTRSAAPTAI ACGCCCCAACTCCTACCCCAGTCCCAGTCCCAGTCCCAGTTCCAGTATCG ACGACGAC ATTSVYKSRRTTNAIS ACACCGACACTAACACCGACACCGACACCGACACGCCGGCCCAAGCCG GACCCGCT GATTRRSASSKRAPM GCTCCGTCCTCGACTCAACCCCAGCAAGGAGCTCCAGCACAGCGACGAC GCTCGACG QPRTALTPTQQQQR AACAACGAGAGCAGAAAAAGCAACCTGCGTGCCGCCGTCGACACGCAA ACGTGCTC Q(SEQIDNO:1445) CGGCAGTGAGAGAGACACCGCCAGCAGCTCCGACAGCAGCTCGGACAG (SEQID CAACTCCGACAGCGAGACCGACAGCGACAACGAGATCAACATCGACAA NO:1200) CGAGATCAACAGCGACAGCGACATCGACAACGAGATCATCAGCACCGA CATCGAGACCATCAGCACCGAGACCGAGATCATCAGCACCGAGACCGA GATCATCAGCACCGACAACGAGATCAGCAGCTCCGACAACGAGATCAG CTGCACCGACAGCGATAGCGACAACCAGCGTTTACAAAAGTCGACGCAC GACCAACGCCATCTCTGGCGCAACGACGCGACGCAGTGCGTCATCCAA GCGTGCGCCAATGCAGCCCCGAACTGCATTGACGCCAACACAACAACAA CAACGACAGTAAGCGGGGGGTCCAGACCCCACAAGAGAGAAGCAGGA ATCATGGTCCGCATGGACCAGTAGGGCACGCTCCACAAAGGTTATCGCC CTCAAACCCATCACACGAAGGATCTAAAAAGAAAGCAACAATCGAAATA GTAAATAGTAAATAGCTTAGAAAGGTCAACATGCGAAATGCATGAGGA ACGTAAGATGGTAATACATTTTTATA(SEQIDNO:1077) NeSL Utopia- Phytophthora GTACGGCCGAATCCCTGGCCTCTCAGCCTGTACGCGGGGCTATACTTGG GTACGGCC TAGACG MPRSLASEPVHSSAS 3_PI infestans TCTAAGTATGCCCCGGTCGCTGGCCTCTGAGCCCGTACACAGCTCGGCC GAATCCCT GCAAAG RLPSQAPPTSRSGAA TCGCGGCTGCCTTCCCAGGCGCCGCCGACTTCGCGCTCTGGCGCGGCCC GGCCTCTC TTCTGG HTPPPSLQARPLASA ACACGCCGCCGCCGAGCCTCCAAGCGCGCCCGTTGGCTTCCGCAGACGC AGCCTGTA CCCGCG DAGLAATALASPQDP AGGGCTCGCGGCGACGGCCCTCGCCAGCCCCCAAGACCCCCCCTACGAT CGCGGGG TAGGCC PYDVAPPGRAAGRLP GTGGCACCACCCGGCAGGGCGGCCGGCAGGCTGCCCGACTCGGTATCG CTATACTT GAAAGG DSVSPGATLSAATAR CCGGGTGCTACACTCTCAGCCGCCACAGCTCGGGCCTTGGCGGTCCGCC GGTCTAAG GCCCCG ALAVRHWPLELDSDS ATTGGCCCTTGGAGCTCGACAGCGACAGCAGCGACGACGAGGACGCTC T(SEQID CCCAGG SDDEDAQDPHAAAP AAGACCCCCACGCCGCCGCCCCAGGACCCCCCGCAGACGTCGCGAGTG NO:1201) TAGGGT GPPADVASVLAPPGR TGCTTGCCCCACCCGGCAGGGCAGGCAGCATGCTCGCCGACCCAGCAG AACGCC AGSMLADPAALAAG CGCTGGCAGCCGGTCTGGCTCGCGCTCCGCCGCCCCCATCGGCACCCCA CTCGGG LARAPPPPSAPQDPS GGACCCAAGCCCCGCCCTCCCTGCTGGCCCCGCCGGCCAGAACCCTCGC AAAACC PALPAGPAGQNPRA GCCGCCGCCCCAGCCCGTGTTGAAGTCCACACAGTCGTGGCACCACCCG ATTCTG AAPARVEVHTVVAPP GCAGGGCGGGCGGCATGCTGCCCGACCCGGGGTTGGTTGACTCAAGCC GTGTTT GRAGGMLPDPGLVD CAGCAGCGGCTACCGCAGCTACTCCAGCTCCGGTAGCTGCTACTGCTAC GGCTGT SSPAAATAATPAPVA AACTGCTCGCGTCGCCGTGGAGCACCACGCGCACGCTGAGCCGAACCA TTTTCAA ATATTARVAVEHHA AGAACATCTACCGATGGCTCGCGTGCTCGTGGAGCCCATGCAGGTCGAC CAGTCG HAEPNQEHLPMARV GAGTGCAGCAGCTGCGACCGCAGCACTTTGACAGCCGACGACGGGAGC AACTTCA LVEPMQVDECSSCD GGTGACGACGTTGCAGCCCCAAGCTCCATGCTCAGCAACGACGTCGCCG ACTCGG RSTLTADDGSGDDV CCCCCATGGACGTTGACTCCGGCACCAGTTGCCCTCCAACGTTGCAGCA AGCATA AAPSSMLSNDVAAP GCCGCTCCAGCGACCGCGTGCGCTGCACGTAGGCAGCAAGCGCCGCCG TCAGAT MDVDSGTSCPPTLQ ACTCAACGCAGATGACGGTGAAGAAGCTCACCAGCTGCAAGAAGAGGA ACACTTA QPLQRPRALHVGSKR GGAGGCTGGCATTCATGCCCCAGCGCTCAGGCTATCCGCGGCCAGCGCT CTCACAC RRLNADDGEEAHQL CAACCGGCCTCCGTCCTTGCCGTGTACACGCACAACGCGTCGCGGTTCG ATTTAG QEEEEAGIHAPALRLS ACTGCACGCTGTGCGCGTACACGGCCGGCAGCTTCGCATCACTGAAGAC ATATCA AASAQPASVLAVYTH GCACAGGAACTCTCGGCACCGCCGCACCGCATTCCTCGACCGATTCTCG GATAGG NASRFDCTLCAYTAG GCGGGTTGCGCGTGTGGCGTGCCCTTCGCCTCGAGGCTGGCTGCAGCA GAACCT SFASLKTHRNSRHRR AGACACGCGCAAGCGTGCGCCAGCCTCTCGAGCGCACCATTGGCCGAG TTATTAG TAFLDRFSAGCACGV GCTTCGTCGGCAGCAGGAGCATCAAGCCACACTGTCGACGGAGCCGAC GGAGAT PFASRLAAARHAQAC TCCACCGTCTCAGCCGCGGGTCACGCTGAGCCCGACCTGCCCCGCCACA AACGGG ASLSSAPLAEASSAAG ACGCCACGGAGCTCACAGCGTCCCCCCCGCTGGTGAGCTCCACCGATGT TACACC ASSHTVDGADSTVSA TGAAGTGCAGGCAACAGAGACCGAGGCCACCGAGAACCGCTGGGGCA GGATGG AGHAEPDLPRHNAT CTCCACTTCCGCGCGTGCTGGTCGCGTCTAGGATCGCCGGGCGCCTCGC TAAATAT ELTASPPLVSSTDVEV TCAGGTGCCACCCCCACGCTGGGGCCCCCCGCTACCGCGCACCACCATC ACAAAA QATETEATENRWGT GCGGGCCGCATTGCGACGAGGCTCGCCGCCACACCTGCCCCCCGCTGG CCTTCTC PLPRVLVASRIAGRLA AGCCCACCGCTTCCACGAAGCCTGGTTGCGTCACGCATCGCCGGCCGCC TGTTCTA QVPPPRWGPPLPRT TGCTACCCGCTCTGCCAGACGCGCCTGCATGCGAGGACGAGGCCAAGG ATCAGT TIAGRIATRLAATPAP ACAGCGACGAAATGGACTGGGAGGCGAGCGAGCCGCACGTCGAAGCG GTGAAA RWSPPLPRSLVASRIA CCAGGGCCGGTCGACGAGGAGACCATCGACGACGCCGACGGCGAGTG ACTGGT GRLLPALPDAPACED GCTCCTGCGCTTTGACGGTGCCTGCCGTGCCAACCCAGGCCCAGGCGGC TTTCGCC EAKDSDEMDWEASE GCCGGCGCAGCTCTCTTCAAGCCAAGCGGTCCCGTAGTGTGGACTTGTT TTTTGGC PHVEAPGPVDEETID CCCACTACGACCCGAGCACCACGGCAACGAACAACACAGCAGAGTACA GGACTT DADGEWLLRFDGAC CTGCGTTGCTTCTCGGGGCCAGGGCCGCCGCCGACCACGGCGTCACGA TTTCACT RANPGPGGAGAALF AGCTGCGCGTCGAGGGCGACAGCACGCTGGTCATCCAGCAAGTGCGGG CGCATTT KPSGPVVWTCSHYD GCATCTTCGCGACGAGAAGCACGCGGCTCCGAGCACTGCGCAACAAGG TTGGGC PSTTATNNTAEYTALL TCAAGCTCGAGCTCGCGCGCGTGGGGAGCTTCTCGCTCCACCACATCGA AATCGT LGARAAADHGVTKL CCGGCAGGCGAACGGTCACGCCGACCGCCTTGCCAACGCCGGCCTCGA CTGCGG RVEGDSTLVIQQVRG CCGACGCCGCACCCAGCTCGAGTGCAGTGTGCACCCCGATGGTAGGGG CTAGCTT IFATRSTRLRALRNKV CTGCACCAACACCTCCGTAGCGACCGCAGCGCCTACAGCTTCTGCAGCA GCTAGC KLELARVGSFSLHHID CCCTCGACCCCAACCCGGCCTCCTGCAACTACGGCCGCTCCGTTTCACAG GGCGGA RQANGHADRLANAG CGACCAAGGCCACATCGACGAGGACGACGAACGTCGTGCCGACATCGA CGAGCG LDRRRTQLECSVHPD CGACGGTGAAATCTACGCCCCAATGACGCTCGGGCCAGACGAAGTGCC GTCTCC GRGCTNTSVATAAPT TGCTCGGCGACCGCGCCTCCGACTCCGCCAACTCAGCGACGAAGAACTC GGGGGC ASAAPSTPTRPPATT GAGGCCGCTGGTGCCATCGTGGAAAGGCTCAGCGCATCGTTGTCAGCC GTTCACC AAPFHSDQGHIDED AAGATCACCGACGCAGAAGACTGGGGGACGGCAGAAGGCTACATTACG TTTCCCC DERRADIDDGEIYAP GCCCTGCCTCACCTGCTGTACGACAAGCTCCTACCGTACTCGCGGACAG CGCGAG MTLGPDEVPARRPRL CGCCCCGTCATCAGCGGCCACCCCGGCCATCGCGCAACCAGCAAGACCA GCCAAC RLRQLSDEELEAAGAI CCCGCAGCCGCGACGCGACCAACCGCAGCGCAACGTGGACGAACAGCA TACACC VERLSASLSAKITDAE GCATGCGGAGTCACAGCAGGGTGAAGACCAGCGACAGCAGCAGCCGC GATCTTC DWGTAEGYITALPHL CGACGCGCCGCCGACGTCGGCGAGGGAAGCGTAGAGGCCGCCGACAG TCTACAC LYDKLLPYSRTAPRH CGACGACACCCTCGCCAGCCAGGACAGAGTACCAGAGCCTCCCAGCAA TTTTCTA QRPPRPSRNQQDHP AGCCGGCAGCGCCGACCCCGACCCCCGCGCGTTACCCGCCACCACCGCG ATTCGCC QPRRDQPQRNVDEQ AGCACCGCATCGACGAGGCGTTGGATGAGCTCCATACACTGGAGCGGG TCCGTCT QHAESQQGEDQRQ CGCGGCCACAGGACCGATCGGCGATTGACAAGGCGAGACGGCGCGTC TCGGTC QQPPTRRRRRRGKR GGACGTGTCCGCGGGGCCATCAACCAACACCTGCTGCGCCACCGGTTTG TTCGGTT RGRRQRRHPRQPGQ ACACCGACGAGAAGGCGTGCGTGGCGGACATCTTGGAGAAAGCGCAC GTCGGG STRASQQSRQRRPRP GCGGCGCGAGCGGCACGTACGGCACAGGCGGCGGGCGCAGCAACGTC CTTTTTT PRVTRHHREHRIDEA TACAGGAGGAGCAGCGACAGCACCAACACAGCAAGCAGCCACGTCAGC CTTTTTG LDELHTLERARPQDR CCTTGGTGATGCCGACGACGGAACCTGCCCCATCCTGGCCGACGAGCTC ACCAAT SAIDKARRRVGRVRG TGGCAGTACTTCACCGGCACCAACACGCCGCGCTGGGAGTTCAACCCCG CAGAGC AINQHLLRHRFDTDE CGACACCCGTCGGCGAGGCGTTCCGTACGGCGATGGCAAGGCTTCCTCC GCGCCA KACVADILEKAHAAR GGCGACGAGGCTCCGCGAACTGCTGACAGAAGCCCCGACTGCGGACGA TGCGAC AARTAQAAGAATST AATCGAGACCCAGCTACAGCACGTGCGGGGATCCTCGAGCCCAGGACT TCTTCTG GGAATAPTQQAATS CGATGGCATCGGCTACGACGTCTACCAGCGCTTCGCACAGCAACTGCTC GCCAAT ALGDADDGTCPILAD CCGGTGCTCACCGCTTCGTTCAAGCGATGCTGGACAGCCAAGATGGTGC CAGAGA ELWQYFTGTNTPRW CACAGAGCTGGCAGGTGGGGGTCGTGCGACTGCTGTACAAGAAGGGC CCGGGC EFNPATPVGEAFRTA GCGCATGACGACCCGGCAAACTGGCGGCCCATTTGCCTCCAGCAAGCCA CCTGTCC MARLPPATRLRELLT TCTACAAGCTTTACACAGGTGTACTTGCTCGGCGACTCGTGCGGTGGCT TCGGAC EAPTADEIETQLQHV GGACGTGAACGACAGGCACGCACCAGGACAGAAGGGGTTCAGGGCAG AGCGAG RGSSSPGLDGIGYDV TGAACGGCTGCGGGGAGCACAACTTCCTCGCAGCCACGCTCATCGACCA GCCTCC YQRFAQQLLPVLTAS AGCACGTCGCAAGCGCCGATCCCTATACGAGGTATGGTACGACTTCAGG ACGGCC FKRCWTAKMVPQS AACGCCTTCGGCAGCGTGCCATTCCAGCTGCTGTGGGACTCGCTGCAGC AGCCAA WQVGVVRLLYKKGA GCCTCGGCGCTCCCGCAGACTTCATCGACATGTGCAAGGGGCTGTACCA TCAAGT HDDPANWRPICLQQ CCAGGCCGCCTTCGTCATCGGGAACGCCGCCGACGGACCGACAGCAGC CTCGGC AIYKLYTGVLARRLVR AATCCGGCAGCAGGTTGGCGTCTTCCAAGGGTGCCCACTGAGCCCCCAG AGCGAC WLDVNDRHAPGQK CTCTTCAACGTCGCGATCAGCCCGCTGCTCTTCGCGCTCCGCAGGCTGCC GCGTCT GFRAVNGCGEHNFL GGAGACCGGGGTGCAGCTGTCCGGCGACGATCGTGTTGGAGTTTCGGC TTCTATA AATLIDQARRKRRSLY TTATGCGGACGACCTCAAGACCTTCAGCAGCACCAAGGCGGGCGCCAC GCGCAG EVWYDFRNAFGSVP CAAGCAACACGAGCTCGTCGCCGCGTTCCTGGCTTGGACTGGCATGAAA CTGACG FQLLWDSLQRLGAP GCCAACGCAGCCAAGTGCAGCAGCATGGGCGTACGGCGCAACAGCAAC AGGCCG ADFIDMCKGLYHQA GGCGCGACAGAGGCGGACAATCTGGACCTCGCGCTCGACGGCACTCCG ATCTGG AFVIGNAADGPTAAI ATCCCCAGCATGACTCACATGCAATCGTACACCTACCTGGGGATCGGAG CGGCCC RQQVGVFQGCPLSP ACGGTTTCGACCATGTGCACCGCCGCATCGAGCTCGCGCCCAAGCTGAA CCGATT QLFNVAISPLLFALRR GACGCTCAAGCAGGACACGACGGCGCTACTCGAGTCCGGCCTCGCGCC GGTCCG LPETGVQLSGDDRVG TTGGCAGGTGGTGAAGGCCGTGAAGGTTTACCTTTACCCTCGTGTCGAG ACTTTCG VSAYADDLKTFSSTKA TACGCGCTTCGGCATCTGAGACCCGAAGACCAGCTGCTGGAGAGCTTCG GCCAAT GATKQHELVAAFLA ACCTTCACCTCCGCGCCGGCCTTCGCCACCTGCTCCGCCTGCCCAAGAAC CAGCGA WTGMKANAAKCSS GCCAACAACGACTTCTTCTACTCGCCGGTCTCGCGCGGTGGCTTGGGAC CGACGA MGVRRNSNGATEAD TCCTGCCGCTGGTGGAGCTGCACGCGGCGCTACAAATCGCGCACGGGT GGGGGC NLDLALDGTPIPSMT GGCAGATGCTTAACTCGACGGACCCAGCCACCCGACGCATCGCGCGCG AGGGGT HMQSYTYLGIGDGF AGCAGCTCCACCAGATTGCGGATGCAAGGCACCGCCTTGACAAGGCAC TTACACT DHVHRRIELAPKLKTL ACTGGAAAGAGCGTGGCGATGAGCTCTGCCAGCTCTTCCTTAACTTGGA TTTGCCC KQDTTALLESGLAPW CCTCGGGACGTCGGCCCACGCACCGCCCAAACGCCGCAACTGCGACATC CCGTTTC QVVKAVKVYLYPRVE GGCTCGCTGTGGGTCGACGTCCGCAAGAACCTGCAGGCCTTTGGGCTCA GGCTTC YALRHLRPEDQLLESF AGCTCGAGACCGCCCCGGCCGACGCTGAGTCCGGCACACCCGCCCTGCC AACTTCA DLHLRAGLRHLLRLPK GCTGCAGCTTCGCGTGCCACACCACGAGAAGTGGTTGACGCACCGCGA GGCCAA NANNDFFYSPVSRG CGTGCTGCGGCACGTGAAGCAGCACCTGAAGAACAAACATTGGCGGGC AATGGC GLGLLPLVELHAALQI GTGGTGTGCCTTCCAAGACCAAGGCAGGACGGCACGGGCACACGGTGG GATTTG AHGWQMLNSTDPA GGTCGGCAGCAGCTTCATCACCCGCCCCCGTGGCATGTGGGAGAGCGA GACCCT TRRIAREQLHQIADA CTACCGCTTTGCGGTGGCCGCTCGCCTCAATATGGTGGACACGAGTGCC CCACGC RHRLDKAHWKERGD ACTCTCGCGCGCCGACGCCTCCGCGCCCATGACCGCTGCCGCTACCCGG GCCGTG ELCQLFLNLDLGTSA GCTGCCGCTGGAAGGAGTCACTGGAGCACGTGCTCAACCACTGTCCGG CCACTG HAPPKRRNCDIGSLW GCACGATGGACGCCGTGCGTGGCCGCCACGACGGTGTGCTCCGAGAGA CTCGGC VDVRKNLQAFGLKLE TTGAGCACGCGCTCCGTGCACCATCGGGAGCACGCAGAGAGCTGCGCG ACCGGC TAPADAESGTPALPL TGAACCAGACGGTTCCGGGCCTACCCGGCCCCGCACTACGCCCCGACAT GGCGAT QLRVPHHEKWLTHR CCAGGTCTACAACCACGACCAACGCACGGTCGCAGTGGTGGACCTGGC TCAGCG DVLRHVKQHLKNKH GGTGGCCTTCGACAGGCAGGACCGCGACGACCCAGAGACCTCGGGCCT GGTGCA WRAWCAFQDQGRT GGCCAAAGCCGCGGCGGAGAAGAAGGCCAAGTACACTGGCATCCAGC ACTTCG ARAHGGVGSSFITRP GACACCTCGAGCGCCAAGGGTGGAAGGTCCACCTTTCCGCGCTCGTCTA GGCACG RGMWESDYRFAVAA CGGGTCACTGGGCTCGGTGGCCCCCAACAACTACAAAGTCTACACGGA TGTGCA RLNMVDTSATLARRR GCACCTGGGGCTCCTCAAGCGTGACGCCAAGCGGCTGGACCGGACGCT ACACAT LRAHDRCRYPGCRW GTCGGTCGCGTGCATCCAGTCCAGCCGCCGCATCTGGAACCTGCACTGC GCAGCG KESLEHVLNHCPGTM GCCAAGCATCGGGCGCGGCAGCACCAGACGCCATCCCAGTCAAGGGGC CCCATTG DAVRGRHDGVLREIE AGGCGGGTGACGGAGACCGGGGGGGCTCCGTCGCGCACCGATCGCCG CACGCC HALRAPSGARRELRV CTAGACGGCAAAGTTCTGGCCCGCGTAGGCCGAAAGGGCCCCGCCCAG AAGCGG NQTVPGLPGPALRPD GTAGGGTAACGCCCTCGGGAAAACCATTCTGGTGTTTGGCTGTTTTTCA CATCGC IQVYNHDQRTVAVV ACAGTCGAACTTCAACTCGGAGCATATCAGATACACTTACTCACACATTT GGGACG DLAVAFDRQDRDDP AGATATCAGATAGGGAACCTTTATTAGGGAGATAACGGGTACACCGGA ACGCCT ETSGLAKAAAEKKAK TGGTAAATATACAAAACCTTCTCTGTTCTAATCAGTGTGAAAACTGGTTT CGGCCG YTGIQRHLERQGWK TCGCCTTTTGGCGGACTTTTTCACTCGCATTTTTGGGCAATCGTCTGCGG CTCAAG VHLSALVYGSLGSVA CTAGCTTGCTAGCGGCGGACGAGCGGTCTCCGGGGGCGTTCACCTTTCC CGCAGC PNNYKVYTEHLGLLK CCCGCGAGGCCAACTACACCGATCTTCTCTACACTTTTCTAATTCGCCTCC CCCGCC RDAKRLDRTLSVACI GTCTTCGGTCTTCGGTTGTCGGGCTTTTTTCTTTTTGACCAATCAGAGCG CTTCCAG QSSRRIWNLHCAKH CGCCATGCGACTCTTCTGGCCAATCAGAGACCGGGCCCTGTCCTCGGAC CACGAC RARQHQTPSQSRGR AGCGAGGCCTCCACGGCCAGCCAATCAAGTCTCGGCAGCGACGCGTCTT CTCGCG RVTETGGAPSRTDRR TCTATAGCGCAGCTGACGAGGCCGATCTGGCGGCCCCCGATTGGTCCGA CCGTTT (SEQIDNO:1446) CTTTCGGCCAATCAGCGACGACGAGGGGGCAGGGGTTTACACTTTTGCC GGCGGA CCCGTTTCGGCTTCAACTTCAGGCCAAAATGGCGATTTGGACCCTCCACG TCGCCAT CGCCGTGCCACTGCTCGGCACCGGCGGCGATTCAGCGGGTGCAACTTC CAAGAC GGGCACGTGTGCAACACATGCAGCGCCCATTGCACGCCAAGCGGCATC GTGCGA GCGGGACGACGCCTCGGCCGCTCAAGCGCAGCCCCGCCCTTCCAGCAC GAGCCA GACCTCGCGCCGTTTGGCGGATCGCCATCAAGACGTGCGAGAGCCAGG GGCGGG CGGGGTCGGGCAAAATATACTTACTCTAAGTATGCCCGAATCCCTGCCC GTCGGG TCTCAGGCTGTACGCGGCCCCATACTTGACCTAAGTATGGGAGGATCCC CAAAAT TGGCCTCTCAGGCTGTACGCGRGGACCAAGTACAGCCGAATCCCTGGCC ATACTTA TCTCAGCCTGTACGCGGGGCTATACTTGGTCTAAGTATGCCCCGGTCGC CTCTAA TGGCCTCTGAGCCCGTACGC(SEQIDNO:1078) GTATGC CCGAAT CCCTGCC CTCTCAG GCTGTA CGCGGC CCCATAC TTGACCT AAGTAT GGGAGG ATCCCTG GCCTCTC AGGCTG TACGCG RGGACC AAGTAC AGCCGA ATCCCTG GCCTCTC AGCCTG TACGCG GGGCTA TACTTG GTCTAA GTATGC CCCGGT CGCTGG CCTCTGA GCCCGT ACGC (SEQID NO: 1324) NeSL Utopia- Phytophthora TGCGCGGCAGACCAAGACGCGCAGCCAACAACAGACCGTGCAGCGTGC TGCGCGGC TAGACG MQDQVDAEQQARN 3_PR ramorum GGGTGGAAAGCGCCGCCGCCTGAACGCTGGTGACGATGAAGACCAGC AGACCAAG GCAACA RWGPPLPRPLVASRV GAGAGCTGGCCGAGCTCCTGCTCGTGGACGAGGACAAGGCTGGCGCCG ACGCGCAG CTCTGG AARLGEVPPPRWGP AACACCCCGCGCTCAGGCTGCCCACGGCCAGCGCTCATCCGGCCTCCGT CCAACAAC CCCGCG PLPRGVVVSRIAARLE CCTCTCCGTGTACGCGCACGCTGCAACTCGCTTCGACTGCACGCTGTGCA AGACCGTG TCGGCC AVPVPRGGPPLPRSF CGTACACGGCTGCCAGCCTCGCTTCGCTCAAGCGCCACCGCTCGTCTCG CAGCGTGC GAAAGG VATRIADRLAPPSPDL GCACCGACGCACGGCCTTCCTCGACAAGTTCTTGGCGGGCTGCGCGTGC GGGTGGA GCCCCA SLLDEEMKESEPPDP GGCACGCCCTTCGCATCGAGGTTGGCCGCAGCCAGACACGCGCAAGCG AAGCGCCG CCCACG THHSADEDSTDAETA TGCGCCAACCTCTGCACCACCTCGGCGACGACTTCGACGGCAGCAAAGG CCGCCTGA TAGGGA DAVMEPAFVSDPPT CATCAAGCCCCACTGCTGCCGGAGGCAGACCCACCGTCCGTGCAGTGGT ACGCTGGT ACCGCC ATPREWRLQFDGAC CACCGCCGCGCCCGACCTGCCCCGCCAGTATCCCTCGGAGCTCGTTGCG GACGATGA CTCGGG RGGPNPGGAGALLY TCCCCCCCGCAGCCGAGCTCCACCAACGTTGCAATGCAGGACCAGGTCG AGACCAGC AAACCC NPEGAVVWTGSHY ACGCAGAGCAGCAGGCGCGAAATCGCTGGGGCCCACCGCTCCCTCGAC GAGAGCT AGTCTG MPGAKETNNSAEYT CGCTGGTGGCCTCCCGGGTCGCAGCCCGACTCGGCGAGGTACCGCCGC GGCCGAG GTGTTC ALLIGARAAADHGAR CGCGATGGGGCCCACCACTTCCGCGAGGGGTGGTGGTGTCCCGTATTG CTCCTGCT GGCCAA QLRIEGDSLLVIRQVK CGGCCCGGCTGGAAGCCGTCCCAGTCCCACGCGGGGGACCCCCGCTCC CGTGGACG GAAATG GLYATKSTRLRQLRN CACGCAGCTTCGTCGCGACCAGGATCGCAGACCGCCTCGCTCCCCCCTC AGGACAA CACCAC AVRHELARVGQHSL CCCGGACCTGTCGCTGCTGGACGAGGAGATGAAGGAGAGCGAGCCGC GGCTGGC CACCAC HHIDRQGNAFADRL CTGACCCGACGCACCACTCAGCGGACGAGGATTCCACGGACGCAGAGA GCCGAACA GGCGGA ANRALDLKSDKVECK CGGCGGACGCAGTGATGGAGCCCGCGTTCGTCAGCGACCCACCCACGG CCCCGCGC GGTGCA EHPVAGACTTCMGS CGACACCCCGGGAGTGGCGCCTCCAATTCGATGGAGCTTGCCGTGGCG TCAGGCTG TTTCGAC PSAGPPATPPPTTADI GGCCCAACCCCGGTGGTGCTGGCGCACTGCTCTACAACCCCGAGGGTG CCCACGGC AGTCGA EMADAGSDDELRAD CTGTCGTGTGGACGGGTTCGCACTACATGCCGGGCGCCAAGGAGACGA CAGCGCTC ACTTCAA IDDGEVYAPMRLEPG ACAACTCGGCCGAGTACACGGCCCTCCTGATCGGCGCCCGCGCAGCAG ATCCGGCC CCCGCC VIPTRRSRLRLRQLTD CAGACCACGGTGCACGACAGCTGCGCATCGAGGGCGACAGCCTGCTGG TCCGTCCT ACATATC DEMEAAGEVVERLS TCATTCGTCAAGTCAAGGGATTGTACGCCACCAAGAGCACTCGGCTGCG CTCCGTGT GGATAT AGLSAKIADADDWE ACAGCTCCGGAACGCCGTTCGCCACGAGCTGGCTCGGGTGGGGCAGCA ACGCGCAC AGTTAC TAEGYITALPYMLYD CTCGCTCCACCACATCGACCGGCAAGGCAACGCGTTCGCGGACCGCCTC GCTGCAAC AGCTCT KLQQYTQVRHGTAR GCCAACCGGGCGCTCGACCTGAAGAGCGACAAAGTCGAGTGCAAGGAA TCGCTTCG AGTTAG SPAPHPQRRDVQGQ CACCCCGTGGCCGGCGCGTGCACGACATGCATGGGTTCGCCCTCGGCG ACTGCACG ACATCG VETHREPRHETIGQP GGCCCGCCAGCGACACCGCCCCCTACCACTGCCGACATCGAGATGGCG CTGTGCAC GATAGG DQPGEPSPTRRRRRG GACGCGGGGAGCGACGACGAGCTCCGAGCAGACATCGACGACGGTGA GTACACGG AACTTCT KRKGRRQRRHPRRT AGTGTATGCTCCGATGCGCTTAGAGCCCGGCGTGATCCCGACCCGCCGC CTGCCAGC TAGAAA NCGGGGRQQRKQR TCGCGCCTCCGCCTTCGCCAGCTGACCGACGATGAGATGGAGGCAGCT CTCGCTTC ATTAAC HPRPPRGTRHHREH GGTGAGGTAGTCGAGCGCTTGAGTGCGGGCCTGTCGGCCAAGATCGCG GCTCAAGC GGGTAT RIDEAIDELHALERAR GACGCCGACGACTGGGAGACGGCGGAAGGCTACATCACAGCCCTCCCG GCCACCGC ACCGGA PQARPAIAKARRRVG TACATGCTCTACGACAAGCTGCAGCAGTACACTCAGGTGCGGCATGGCA TCGTCTCG TGGTAA RIRSAIDQQLLRHRFD CGGCTCGTTCACCAGCGCCCCACCCACAGCGCCGAGACGTCCAAGGCCA GCACCGAC ATAAAA TAEKECVDGILAAAR GGTGGAAACCCACCGGGAGCCGCGGCACGAGACCATTGGTCAGCCAGA GCACGGCC TAAAAA TARDARTTVRAAAAT CCAGCCAGGTGAGCCGTCACCGACGCGCCGCCGCCGCCGAGGAAAGCG TTCCTCGA CTTC GTTATPETAVTSGTE TAAAGGCCGCCGCCAGCGACGTCATCCCCGACGGACCAATTGTGGTGG CAAGTTCT (SEQID QQDDNGTCPIPSEVL TGGTGGCCGTCAGCAGCGGAAGCAACGGCATCCACGCCCACCGCGCGG TGGCGGG NO: WRHFDSVNTPQRDF GACACGCCACCACAGGGAGCACCGCATCGACGAAGCCATCGACGAACT CTGCGCGT 1325) DPEAPEGAAFRSAM GCACGCGCTGGAGCGTGCCCGTCCCCAAGCTCGCCCAGCCATTGCCAAA GCGGCAC ARLPAATRFMELLKE GCTCGCCGCCGAGTTGGCCGGATCCGATCGGCGATCGATCAGCAGCTTC GCCCTTCG EPSTDGIEVQLQHAS TGCGCCACCGCTTCGACACCGCCGAGAAGGAGTGCGTGGATGGCATCT CATCGAGG STSSPGLDGVGYDVY TGGCAGCAGCTCGCACGGCACGCGACGCACGGACAACAGTGAGGGCG TTGGCCGC KRFASQLLPVLKAAFK GCCGCGGCAACAGGGACGACAGCGACTCCAGAGACGGCAGTGACGAG AGCCAGAC CCWTHKQVPQSWK CGGTACGGAGCAGCAGGACGATAACGGCACCTGCCCCATCCCGAGCGA ACGCGCAA LGVVRLLYKKGDRED GGTGCTGTGGAGACACTTCGACAGCGTCAATACGCCCCAGAGGGACTT GCGTGCGC PANWRPICLQQAIYK CGACCCGGAGGCTCCCGAGGGTGCTGCGTTCCGTTCTGCGATGGCCCG CAACCTCT IYTGVLARRLTRWQD GCTGCCAGCAGCCACGCGATTCATGGAGCTGCTGAAGGAAGAGCCGTC GCACCACC ANDRHAPGQKGFRP CACGGACGGCATCGAGGTCCAGCTGCAGCACGCGAGCAGCACCTCCAG TCGGCGAC VNGCGEHNFLAAML CCCGGGCCTCGACGGCGTCGGCTACGACGTGTACAAGCGCTTCGCGTC GACTTCGA IDHARRKHRPLYEVW GCAGCTGCTGCCGGTGCTGAAGGCGGCTTTCAAGTGCTGCTGGACGCA CGGCAGCA YDFRNAFGSVPLGLL CAAGCAGGTGCCCCAGAGCTGGAAGCTCGGCGTGGTGCGCCTCCTGTA AAGGCATC WDALERTGVPAEYIA CAAGAAGGGCGACCGTGAAGACCCGGCGAACTGGAGGCCCATCTGCCT AAGCCCCA AVQGLYDHAAFMVG CCAGCAAGCCATCTACAAGATCTACACGGGGGTGCTCGCGCGGCGCCTC CTGCTGCC NAVDGSTAPILQRVG ACCCGCTGGCAGGACGCCAACGACCGCCACGCCCCGGGCCAGAAGGGC GGAGGCA VFQGCPLSPPLFSAAI TTCCGGCCGGTGAATGGCTGCGGGGAACACAACTTCCTCGCGGCCATGC GACCCACC SPLLHALQLLPSSGVQ TCATCGACCACGCGCGCCGCAAGCACCGGCCACTGTACGAGGTGTGGT GTCCGTGC LSGDDRPGVSAYAD ACGACTTCCGGAACGCTTTCGGAAGCGTACCCCTCGGGCTGCTGTGGGA AGTGGTCA DLKTFSGTKAGVTEQ CGCCCTGGAGCGGACAGGCGTTCCCGCCGAGTACATCGCTGCCGTCCA CCGCCGCG HELVAMFLRWTGM GGGCTTGTACGACCACGCAGCGTTCATGGTGGGCAACGCGGTGGACGG CCCGACCT AANPAKCRSMGVRR CAGCACGGCGCCGATCCTGCAGCGTGTCGGAGTGTTCCAGGGGTGCCC GCCCCGCC NGNGAIEADHLELAL GCTGAGCCCACCGCTGTTCTCGGCGGCGATCAGCCCCCTCCTCCACGCG AGTATCCC DDTPIPTLTHLQSYTY TTGCAGCTACTGCCCAGCTCAGGAGTACAGCTCTCGGGTGACGACCGCC TCGGAGCT LGIGDGFDHVRRRVA CGGGTGTCTCCGCGTACGCCGACGACCTCAAGACCTTCAGCGGCACCAA CGTTGCGT LAPKLKLLKQDATAL GGCGGGCGTCACCGAGCAGCACGAGCTAGTGGCCATGTTCCTGCGATG CCCCCCCG MESGLAPWQVVKA GACCGGCATGGCGGCGAACCCGGCCAAGTGCAGGTCGATGGGCGTGC CAGCCGAG VKGYLYPRVEYALRH GCCGCAACGGCAACGGCGCCATCGAGGCGGACCACCTGGAGCTGGCGC CTCCACCA LRPDDQLLESFDLHLR TGGACGACACGCCGATCCCTACGTTGACGCACCTGCAGTCGTACACGTA ACGTTGCA RGLRHLLRLPKSANN CCTGGGGATCGGCGACGGCTTCGACCACGTGCGCCGCCGCGTCGCGCT (SEQID DFVYAPVSRGGLGFL CGCGCCCAAGCTCAAGTTGCTCAAGCAGGACGCGACGGCGCTGATGGA NO:1202) PLVELHAALQIAHGW GTCCGGCCTCGCGCCGTGGCAGGTGGTGAAGGCCGTGAAGGGGTACCT QMINSPDPAIRRIAR CTACCCTCGGGTGGAGTACGCCCTCCGGCACCTGCGACCCGACGACCAG EQLHQVADARHRLD CTCCTGGAGAGCTTCGACCTCCACCTCCGCCGCGGCCTCCGCCACCTGCT KDHWKQRGDELCEL CCGCCTCCCCAAGAGCGCCAACAACGACTTCGTGTACGCGCCGGTGTCG LLNGELGTSAHAPPK CGCGGGGGCCTGGGCTTCCTGCCGCTGGTGGAGCTGCACGCGGCGCTG RRNGDIGSLWVDVR CAGATTGCGCACGGGTGGCAGATGATCAACTCGCCCGACCCAGCCATCC KNLKAFGLKLATAPA GGCGCATCGCGCGCGAGCAGCTTCACCAGGTTGCCGACGCGCGGCACC DPESGAPAKPLQLCV GGCTCGACAAGGACCACTGGAAGCAGCGCGGTGACGAGCTGTGCGAG PHHAEWLDHRNVLR CTCCTCCTCAACGGGGAGCTCGGGACGTCGGCCCACGCGCCACCCAAAC HVKQHMKNKRWRA GGCGCAACGGCGACATTGGCTCGCTGTGGGTCGACGTCCGCAAGAATC WCSHVDQGRTARA TCAAGGCCTTCGGGCTGAAGCTCGCAACCGCGCCCGCCGACCCGGAGT HGGVGSGFLTRPRG CCGGCGCACCTGCCAAGCCGCTGCAGCTTTGCGTGCCGCACCACGCTGA MWESDYRFAVAARL GTGGCTGGACCACCGCAACGTGCTGCGGCACGTGAAGCAGCACATGAA NMLDTVNVLARRRL GAACAAGCGCTGGCGGGCCTGGTGCAGCCACGTGGACCAGGGGCGGA RAHDRCRHPGCRWK CGGCGCGCGCGCACGGAGGCGTCGGCAGCGGTTTCCTGACGAGACCGC ETLAHVLNHCPGTM GGGGCATGTGGGAGAGCGACTACCGCTTCGCGGTGGCCGCTCGCCTCA DSIRGRHDDALKEIER ACATGCTTGACACGGTGAACGTGCTCGCGCGCAGACGCCTCCGCGCCCA TLHASSGDRQGRTEL CGACCGTTGCCGTCACCCGGGCTGTCGGTGGAAGGAGACGTTGGCACA RTNQTVPGLAGPALR CGTCCTCAACCACTGCCCGGGTACGATGGACAGCATCCGCGGCCGCCAC PDLQVYNHDQRTVA GACGACGCGCTCAAGGAAATCGAGCGCACGCTCCACGCGTCGTCGGGC VVDLAIAFDEQPRDD GACCGCCAGGGCCGCACCGAGCTCCGCACGAACCAGACCGTGCCGGGC PESSGLAKAAAEKKA CTGGCCGGGCCAGCTCTGCGCCCCGACCTCCAGGTCTACAACCACGACC KYAGIKRHLERQGW AACGCACGGTCGCGGTGGTGGACCTGGCCATCGCGTTCGACGAGCAGC KVHLSALVYGSLGSV CGCGCGACGACCCAGAGAGCTCCGGGCTGGCGAAGGCGGCGGCCGAG APSNYKVYTEHLGLLK AAGAAGGCCAAGTATGCGGGCATCAAGCGACACCTCGAGCGCCAAGGG RDAKRLDRQLSVACI TGGAAGGTCCACCTCTCGGCGCTCGTGTACGGCTCGCTGGGCTCGGTGG QSSRRIWNLHCAQH CACCCAGCAACTACAAGGTGTACACCGAGCACCTGGGGCTGCTCAAGC RARQHQDQPAPRGR GGGACGCCAAGCGGCTGGACCGGCAGCTGTCGGTGGCGTGCATCCAGT RVTETGGTPSRTDRR CCAGCCGCCGCATCTGGAACCTGCACTGCGCCCAGCACCGCGCCAGGCA (SEQIDNO:1447) GCACCAAGACCAACCAGCCCCAAGGGGCAGGCGGGTGACGGAGACCG GGGGGACTCCGTCGCGCACCGATCGCCGCTAGACGGCAACACTCTGGC CCGCGTCGGCCGAAAGGGCCCCACCCACGTAGGGAACCGCCCTCGGGA AACCCAGTCTGGTGTTCGGCCAAGAAATGCACCACCACCACGGCGGAG GTGCATTTCGACAGTCGAACTTCAACCCGCCACATATCGGATATAGTTAC AGCTCTAGTTAGACATCGGATAGGAACTTCTTAGAAAATTAACGGGTAT ACCGGATGGTAAATAAAATAAAAACTTC(SEQIDNO:1079) NeSL Utopia- AATU01001281.1 Phytophthora AGCTCGGCCTCGCGGCTGCCTTCCCAGGCGCCGCCGACTTCGCGCTCTG AGCTCGGC TAAACG MLADPAALAAGLAR 4_PI infestans GCGCGGCCCACACGCCGCCGCCGAGCCTCCAAGCGCGCCCGTTGGCTTT CTCGCGGC GGTCAC APPPPSAPQDPSPAF CGCAGACGCAGGGCTCGCGGCGACGGCCCTCGCCAGCCCCCAAGACCC TGCCTTCC TTGACC PAGPAGQNPRAAAP CCCCTACGATGTGGCACCACCCGGCAGGGCGGCCGGCAGGCTGCCCGA CAGGCGCC GACAGG ARVEVHTVVAPPGR CTCGGTATCGCCGGGTGCTACACTCTCAGCCGCTACAGCTCGGGCCTTG GCCGACTT GCACCA AGGMLPDPGLVEEPI GCGGTCCGCTATTGGCCCTTGGAGCTCGACGGCGACAGCAGCGACGAC CGCGCTCT CCCAGG QATYAHDAAQFECA GAGGACGCTCAAGACCCCCACGCCGCCGCCCCAGAACCCCCAGAAGAC GGCGCGG TAGGGA LCPYVAESMAVLVQ GTCGCGAGTGTGCTTGCCCCACCCGGCAGGGCAGGCAGCATGCTCGCC CCCACACG ACCGCC HRRSAHRGTRFKDIF GACCCAGCAGCGCTGGCAGCCGGTCTGGCTCGCGCTCCGCCGCCCCCAT CCGCCGCC CTTTAAA TSGCQCSLVFYARIVA CGGCACCCCAGGACCCAAGCCCCGCCTTCCCTGCTGGTCCCGCCGGCCA GAGCCTCC ACCCAG ASHAVACARRNQRA GAACCCTCGCGCCGCCGCCCCAGCCCGTGTTGAAGTCCACACAGTCGTG AAGCGCGC GAAGAC VPPAPTPVAPTRPEA GCACCACCCGGCAGGGCGGGCGGCATGCTGCCCGACCCGGGGTTGGTT CCGTTGGC ACAAAC TPQPTGYLAAAMTA GAAGAACCTATCCAAGCGACGTACGCGCACGACGCTGCCCAGTTCGAG TTTCGCAG ACCCTCC AAAAASSDTVVAAAT TGTGCGTTGTGCCCGTACGTCGCTGAGAGCATGGCAGTGCTCGTTCAGC ACGCAGG ACATAG NMQSAVPAAAKTTG ACCGACGGTCGGCTCACCGCGGCACCCGCTTCAAGGACATCTTCACCAG GCTCGCGG TGACAT LQLVPPELEPALPQR CGGGTGCCAGTGCAGCCTAGTGTTTTACGCGCGCATTGTTGCCGCCAGC CGACGGCC ACATATT ASCHAGKRRRLNAD CATGCAGTGGCCTGTGCCAGGCGCAACCAGCGAGCCGTGCCACCAGCT CTCGCCAG TTAGCCT EAVTPCTPTARVSPQ CCGACCCCTGTTGCCCCCACGCGGCCCGAGGCTACCCCACAACCTACTG CCCCCAAG AGATTTC TEVAMAPHDAPQD GGTACCTCGCCGCGGCGATGACTGCCGCTGCGGCAGCCGCATCAAGCG ACCCCCCC AGTTAC DTVLQREAAEPQPDP ACACCGTAGTGGCAGCCGCCACCAACATGCAGAGCGCCGTGCCAGCCG TACGATGT GGAGAG AATQGAQVQRVEDT CCGCCAAGACTACTGGGCTTCAGCTCGTCCCCCCCGAGCTGGAGCCTGC GGCACCAC GTTACTA TAAQDDTVQQDHD CCTTCCGCAGCGTGCTTCTTGCCATGCAGGCAAACGCCGTCGGCTCAAT CCGGCAG ACTGGT ADTAQVSPPRRTPTR GCCGACGAGGCAGTGACACCGTGTACGCCTACGGCGCGGGTGTCGCCG GGCGGCC ACATAA WGPRPSSTQEPSPM CAAACAGAAGTGGCGATGGCGCCTCACGACGCACCCCAGGACGACACC GGCAGGCT AATTAC TGEPAATLAARRPLT GTCCTACAGCGTGAGGCCGCCGAGCCACAGCCAGACCCAGCAGCAACG GCCCGACT ACATTCT PAATGTRATRWGPC CAGGGTGCCCAAGTGCAGCGAGTCGAAGACACAACTGCGGCGCAGGAT CGGTATCG GTTCTAA HRAIGAAAIARLVTGL GACACTGTGCAGCAGGACCACGACGCCGACACTGCCCAGGTCTCGCCG CCGGGTGC TCAGTG PTEPAQPQRRQPPPP CCCCGGCGCACACCGACACGCTGGGGACCAAGACCAAGCTCAACTCAA TACACTCT TGAAAA QEPPLQPEPQAAAAT GAGCCTAGCCCCATGACAGGCGAGCCGGCTGCCACCCTGGCAGCGCGA CAGCCGCT CTGGTTT VAADIAATVAADIAA AGACCACTGACTCCTGCCGCAACGGGCACTCGTGCCACGCGGTGGGGG ACAGCTCG TCGCCTT AAANAAMDVDGGP CCATGTCACCGCGCGATCGGGGCCGCAGCCATCGCACGACTTGTCACGG GGCCTTGG TTGGCG AADETWLLRFDGAC GCCTTCCAACGGAGCCAGCGCAGCCTCAGCGACGGCAGCCACCGCCAC CGGTCCGC GACTTTT RRNPGPGGAGAALF CACAAGAACCTCCTTTGCAGCCAGAGCCACAAGCAGCAGCAGCAACAGT TATTGGCC TCACTCG APSGAVVWTCSHFM CGCAGCAGACATCGCAGCAACAGTAGCAGCAGACATTGCGGCAGCAGC CTTGGAGC CATTTTT PSRSETNNTAEYTALL AGCGAACGCAGCAATGGACGTGGATGGTGGACCGGCAGCCGACGAGA TCGACGGC GGGCAA LGAQSAVHHGAKRL CATGGCTGTTGCGCTTTGACGGCGCATGCCGACGGAACCCAGGACCTG GACAGCA TCGTCT NIEGDSHLILSQVRGA GTGGAGCAGGTGCCGCACTCTTTGCGCCCAGCGGTGCCGTAGTATGGA GCGACGAC GCGGCT FACNNKRLRSLRNRV CGTGCTCGCACTTCATGCCGAGCCGCTCCGAAACGAACAACACCGCCGA GAGGACG AGCTTG QASLRQLDWYRLQHI GTACACGGCGCTACTGCTCGGCGCACAGAGCGCAGTCCACCACGGCGC CTCAAGAC CTAGCG DRKANQHADRLANR CAAACGCTTGAATATCGAGGGCGATAGCCACCTGATCCTCTCGCAAGTG CCCCACGC GCGGAC ALDLRRTVTECGPHA CGCGGTGCCTTCGCGTGCAACAACAAGCGGTTGCGGTCTCTCCGAAACC CGCCGCCC GAGCGG ETRNRCFQTPQPLVE GAGTCCAGGCCTCTCTTCGGCAGCTCGACTGGTACCGCTTGCAGCACAT CAGAACCC TCTCCG PGETHCVPGSDEVLA CGACCGCAAGGCCAACCAGCATGCCGACCGCCTCGCGAACCGCGCCCT CCAGAAGA GGGGCG ANTAMEDATAVPTE GGACCTGCGCCGCACGGTGACCGAGTGCGGCCCACACGCGGAGACTCG CGTCGCGA TACACCT DDEAEVAARDGGEV CAATCGGTGCTTTCAGACACCGCAGCCCCTCGTCGAGCCCGGCGAGACG GTGTGCTT TTCCCCC FPTIAIGPDSAPAKQP CATTGTGTACCGGGGTCTGATGAGGTCCTCGCTGCTAATACCGCGATGG GCCCCACC GCGAGG RLRLKKLDEDDFDAA AGGATGCGACCGCTGTGCCTACCGAAGACGACGAGGCTGAGGTTGCAG CGGCAGG CCAACT AAAVTRVSEELASKIV CGCGAGACGGGGGTGAGGTGTTTCCAACCATCGCCATCGGGCCTGACT GCAGGCA ACACCG DAGDWTSGEGYISAI CCGCCCCTGCCAAGCAACCACGGCTTCGCCTCAAGAAGCTGGACGAAGA GC(SEQID ATCTTCT PERLRAALRPFALPTQ CGATTTCGATGCGGCTGCCGCCGCCGTGACGCGTGTGTCCGAGGAGCT NO:1203) CTACACT PARPQPREPRMQQP GGCAAGCAAGATCGTGGACGCTGGCGACTGGACATCAGGTGAAGGCTA TTTCTAA PRRPPRVTRDHLEHR CATCAGCGCAATTCCTGAACGACTTCGAGCAGCGCTCCGGCCATTTGCC TTCGCCT LDEALDTMENVQRS CTTCCGACGCAGCCAGCACGACCACAGCCCCGGGAGCCTCGTATGCAGC TCGTCTT TPQNQKAVRRARRR AGCCACCCCGCCGGCCACCCCGCGTCACACGCGACCACCTGGAGCACAG CGGTCT VGRLRSAMDRTRLRK GCTTGACGAGGCGCTCGACACCATGGAGAATGTGCAGCGCTCCACGCC TCGGCT KFATHERECVAEILRR GCAAAACCAGAAGGCAGTGCGCAGGGCCCGGCGCCGTGTCGGGCGGC GTCGGA ASTEEAANPSQEKCPI TTCGTTCGGCGATGGACCGGACCAGACTGCGGAAGAAGTTCGCCACAC TTTTTTT DRATLHEYFTATSTQ ACGAACGGGAGTGCGTGGCCGAGATCCTGCGACGTGCTTCAACGGAGG CTTTTTG RTPFDYDSAKGTEFR AGGCGGCGAACCCCAGTCAGGAGAAATGTCCTATCGACCGCGCCACCC ACCAAT TFLEVMSTPSHETSAL TCCACGAGTACTTTACCGCCACCAGCACCCAGCGGACACCATTCGACTAC CAGAGT TAEPTLDEIEDQLAH GACAGCGCCAAAGGCACAGAGTTCCGCACGTTCCTGGAGGTGATGTCG GCGCCA VKAGSSPGHDGVGY ACGCCTAGCCACGAGACAAGCGCGTTGACCGCAGAGCCTACGCTGGAC TGCGAC DVYRRFQVQLLPLLH GAGATCGAAGACCAGCTCGCCCACGTCAAAGCCGGCTCAAGTCCAGGG TCTTCTG AAFRFCWRHRRVPA CATGATGGGGTCGGGTACGACGTGTACCGCCGCTTCCAGGTTCAGCTGC GCCAAT LWKVGFVRLLHKKG TGCCCCTCCTGCATGCAGCCTTCCGGTTTTGCTGGCGCCACCGCCGCGTG CAGAGA DPQQPNNWRPICLQ CCTGCCCTGTGGAAGGTCGGCTTCGTTCGCCTGCTGCACAAGAAAGGTG CCGGGC TAIYKLYSGLLARRLSK ACCCGCAGCAGCCAAACAACTGGCGACCAATCTGCCTACAGACCGCCAT CCTGTCC FLEANELLPMAQKGF CTACAAGCTCTACAGCGGCCTCCTCGCCCGTCGACTGTCCAAGTTCCTGG TCGGAC RAFNGCHEHNFVAT AAGCCAACGAGCTGTTGCCCATGGCACAGAAGGGCTTCCGAGCGTTCA AGCGAG TLLDQTRRMHRRLY ACGGTTGCCACGAGCACAACTTCGTGGCAACCACGCTCCTGGACCAGAC GCCTCC QVWYDLRNAFGSLP GCGCCGCATGCATCGCAGGCTGTACCAGGTCTGGTACGACCTGCGCAAT ACGGCC QQLMWGVLRQLGV GCTTTCGGGTCGCTGCCGCAGCAGCTCATGTGGGGCGTCTTACGCCAGC AGCCAA TEEFVARCSGIYEDSY TTGGGGTGACAGAGGAGTTTGTGGCTCGCTGCAGTGGCATCTACGAGG TCAAGT FVVGNASDGATEPV ACTCATACTTCGTGGTCGGCAACGCGAGCGACGGAGCGACGGAGCCAG CTCGGC RQEVGVYQGCPLSPL TGCGCCAGGAGGTGGGAGTATACCAGGGCTGCCCGCTCAGCCCCCTCCT AGCGAC LFITALVPLLRALENQ CTTTATCACTGCGCTCGTGCCACTCCTCCGGGCGCTAGAGAACCAGGAT GCGTCT DGVGVPLADGVRPC GGCGTTGGCGTCCCGCTCGCAGACGGAGTACGCCCGTGCGCCACCGCG TTCTATA ATAYADDIKVFCDSA TACGCGGACGACATCAAGGTCTTCTGTGACAGCGCGACGGGCATCCAG GCGCAG TGIQRCHALVTRFLE CGCTGCCACGCCCTCGTGACGCGCTTCCTGGAGTGGACTGGGCTGCAG CTGACG WTGLQANPAKCAFL GCCAACCCGGCCAAGTGCGCGTTCCTGCCGGTGACACGCAGCCAGCAC AGGCCG PVTRSQHSNPTRDRD AGCAACCCCACACGCGACCGAGACATCGAGCTTCGCATCCACGGCGAG ATCTGG IELRIHGEAIATLGLQE GCCATCGCCACCCTCGGACTTCAAGAAAGCTACCGGTACCTGGGGGTTG CGGCCC SYRYLGVGDGFDHVR GCGACGGCTTCGATCACGTGCGACACCGCCTCCAGCTCGAACCCAAGCT CCGATT HRLQLEPKLKQIKREA GAAGCAGATCAAGCGAGAGGCGGTGGCACTGCTGCACTCGGAGCTGGT GGTCCG VALLHSELVPWQILK GCCGTGGCAGATTCTCAAGGCTCTGAAGGTGTACATCTACCCGAAGGTA ACTTTCG ALKVYIYPKVEYALRH GAGTACGCCCTCAGACACCTGCGGCCGCTCAAGTCACAGCTTCAGGGCT GCCAAT LRPLKSQLQGFDSAIV TCGACAGCGCCATCGTGCGCGGCCTGCGCCACCTGCTCCGCCTGCCGGA CAGCGA RGLRHLLRLPENSHD GAACAGCCACGACGGCCTCTTCTTCTCACCCACCTCGGCTGGGGGACTG CGACGA GLFFSPTSAGGLGLLS GGACTGCTGTCGCTCGTCGAGCTGCACGAAGCGCTGCAGGTGGCACAC GGGGGC LVELHEALQVAHAW GCGTGGCAAATGCTCCACTCCAAGGACCCGGCCATCCGTGCCATCGCGC AGGAGT QMLHSKDPAIRAIAR GCACGCAGGTGGGACAAGTGGCGCGCAAGCGGTTCAAGCTGGTCGAG TTACACT TQVGQVARKRFKLVE GAGCACTGGCGCGGACGGGAAGACGACCTCGCACAGCGCTTCCTCAAC TTTGCCC EHWRGREDDLAQRF ACGGAGCTCGCCGCCTCGCCCCACGCCACGGAGACACGCCGCAACGGC CCGTTTC LNTELAASPHATETR GACATCGGTTCCCTATGGAACGACGTCCGCGACACGCTGCAGACCCTCG GACATC RNGDIGSLWNDVRD GACTCAAGTTCGCCGCAGGCGACGAGGAGGAAGCACCTGGACTACTGC AACTTCA TLQTLGLKFAAGDEE AGCTCCGCGTGCCGCACCACACCAAGTGGCTCAGCCACTCGACGGTGTT GGCCAA EAPGLLQLRVPHHTK GAGGCACGTGAAGCTGCACATGAAACTGCGCCGCATGGACACGTGGAA AATGGC WLSHSTVLRHVKLH GTCCAAGGTCAGTCAGGGCACGACGGTGCGAGAGCACGGCGGCGTGG GATTTC MKLRRMDTWKSKVS GCTCACGCTTCATCACGGCGGGTGCAGGCCTGTCCGACGCCGAGTATCG GACCCT QGTTVREHGGVGSR CTTCGCTATTGCGCCCCGCGCCCACCTGATTGATACCAACAGCACCCTCA CCACGC FITAGAGLSDAEYRFA AGCGCCGTCGACTCCGGGCCAACGACACCTGCCGGGCTCCGGGATGCT GCCGTG IAPRAHLIDTNSTLKR CCTACACGGAACCCCCCGCCCACATCCTGAACAAATGCAGTCCCAACAT CCACTG RRLRANDTCRAPGCS GGACGCGATCAGGAAACGCCACGACGATGCTCTCGAGCGTATCGCTGA CTCGGC YTEPPAHILNKCSPN CGCCCTGCGGCGCAAGGTGGAGAAGTCCGGCGGCAGGCTCGAGGTGG ACCGGC MDAIRKRHDDALERI CCATCAACAAGACGGTACCGGAGTACGACGGCGCGGCCCTGCGCCCCG GGCGAT ADALRRKVEKSGGRL ACATCGTACTCCGGAACACGGAGACCAAGCGTGCCATCATCGCCGACCT TCAGCG EVAINKTVPEYDGAA GGCGATCACCCACGAGAACCAGCCGACCGACGCGACCACGTCTTCAGC GGTGCA LRPDIVLRNTETKRAII GCTACAGCAAAGCCGTGACAACAAGATTACCAAGTACCAGACGGTCGC ACTTCG ADLAITHENQPTDAT GGCGGCAATGATGCGCGCCGGTTGGCGGGTTCGCGTGACGGGCATCGT GGCACG TSSALQQSRDNKITKY CTACGGCTCGCTCGGCTCAGTCCTGCCGAGCAACTTCAAGGTCTACACC TGTGCA QTVAAAMMRAGW GAGCTGCTGGCGCTGCTCAAGCGGGACGCCCGCCGCCTTAACCGCCAG ACACAT RVRVTGIVYGSLGSVL CTCTCCAGCCACTGCATCCGGGCCAGCGCGCGGATATGGAGCGCGCACT GCAGCG PSNFKVYTELLALLKR GCCGACGACACCGTGAGCGGCAACGAAGCGGCAATGCCTCGCGAGCGT CCCATTG DARRLNRQLSSHCIR CGCGTGGGTCCGGGGGGGCCCCGCGGCGCACATCGCAGGCATCGGCA CACGCC ASARIWSAHCRRHRE CGGCGGTAAACGGGTCACTTGACCGACAGGGCACCACCCAGGTAGGGA AAGCGG RQRSGNASRASRGS ACCGCCCTTTAAAACCCAGGAAGACACAAACACCCTCCACATAGTGACA CATCGC GGAPRRTSQASARR TACATATTTTAGCCTAGATTTCAGTTACGGAGAGGTTACTAACTGGTACA GGGACG (SEQIDNO:1448) TAAAATTACACATTCTGTTCTAATCAGTGTGAAAACTGGTTTTCGCCTTTT ACGCCT GGCGGACTTTTTCACTCGCATTTTTGGGCAATCGTCTGCGGCTAGCTTGC CGGCCG TAGCGGCGGACGAGCGGTCTCCGGGGGCGTACACCTTTCCCCCGCGAG CTCAAG GCCAACTACACCGATCTTCTCTACACTTTTCTAATTCGCCTTCGTCTTCGG CGCAGC TCTTCGGCTGTCGGATTTTTTTCTTTTTGACCAATCAGAGTGCGCCATGC CCCGCC GACTCTTCTGGCCAATCAGAGACCGGGCCCTGTCCTCGGACAGCGAGGC CTTCCAG CTCCACGGCCAGCCAATCAAGTCTCGGCAGCGACGCGTCTTTCTATAGC CACGAC GCAGCTGACGAGGCCGATCTGGCGGCCCCCGATTGGTCCGACTTTCGGC CTCGCG CAATCAGCGACGACGAGGGGGCAGGAGTTTACACTTTTGCCCCCGTTTC CCGTTT GACATCAACTTCAGGCCAAAATGGCGATTTCGACCCTCCACGCGCCGTG GGCGGA CCACTGCTCGGCACCGGCGGCGATTCAGCGGGTGCAACTTCGGGCACG TCGCCAT TGTGCAACACATGCAGCGCCCATTGCACGCCAAGCGGCATCGCGGGAC CAAGAC GACGCCTCGGCCGCTCAAGCGCAGCCCCGCCCTTCCAGCACGACCTCGC GTGCGA GCCGTTTGGCGGATCGCCATCAAGACGTGCGAGAGCCAGGCGGGGTCG GAGCCA GGCAAAATATACTTACTCTAAGTATGCCCGAATCCCTGCCCTCTCAGGCT GGCGGG GTACGCGGCCCCATACTTGATCTAAGTATGGGAGGATCCCTGGCCTCTC GTCGGG AGGCTGTACGCGAGACCCGTACGGCCGAATCCCTGGCTTCTCAGCCTGT CAAAAT ACGCGGGGCTATACTTGGTCTAAGTATGCCCCGGTCGCTGGCCTCTGAG ATACTTA CCCGTACACA(SEQIDNO:1080) CTCTAA GTATGC CCGAAT CCCTGCC CTCTCAG GCTGTA CGCGGC CCCATAC TTGATCT AAGTAT GGGAGG ATCCCTG GCCTCTC AGGCTG TACGCG AGACCC GTACGG CCGAAT CCCTGG CTTCTCA GCCTGT ACGCGG GGCTAT ACTTGG TCTAAGT ATGCCC CGGTCG CTGGCC TCTGAG CCCGTA CACA (SEQID NO: 1326) NeSL Utopia- Phytophthora CCAGGAATCACCCCCGCCGCCCCCAAGCGCTCCGCCGCCGAGCCCAGCT CCAGGAAT TAGACG MSGDVVSSDGSSRT 4_PR ramorum GCGGCTGCTGCCGCCCCACTGGTGACGGCAGTCGCAGCTGCCCCTGCCA CACCCCCG GCACAC TDASGDGDDGAGSS CCTCTCCTGTTGGCCGCGCCGCCGTGGAGCCCCGCGCGCGCGCCGAGCC CCGCCCCC AGGCCC DAAGDVGVVAMDV GCCCCAAGAACAAGCACCCCCAGCTAGCGCGCGCGTGGAGCCCATGTC AAGCGCTC ACAGCG DQGARRQQPPWQR GGGCGACGTGGTCAGCAGCGACGGCAGCAGCAGGACGACGGACGCGA CGCCGCCG GCCGAC VGGKRRRLNDVDDE GCGGCGACGGCGACGACGGCGCAGGCAGCAGCGATGCAGCAGGCGAT AGCCCAGC AGGGCC DTRELAELLLEEEDEA GTCGGCGTCGTCGCCATGGATGTTGACCAAGGTGCCCGCCGACAGCAG TGCGGCTG ACACCC GDHAPAPRLSAASAR CCGCCATGGCAGCGCGTCGGCGGCAAGCGTCGACGTCTCAACGACGTC CTGCCGCC AGGTAG PASVLSVYAHNAQRF GACGACGAAGACACCCGCGAGCTGGCCGAGCTCCTGCTCGAGGAGGA CCACTGGT GGAACC QCTLCTYTAASFASLK GGACGAGGCTGGGGACCACGCTCCAGCGCCTAGGCTATCCGCGGCCAG GACGGCA GCCTTCA RHRDSRHRRTAFLDR CGCTCGCCCGGCCTCCGTCCTCTCCGTGTACGCGCACAATGCGCAGCGC GTCGCAGC AACCCG FSAGCACGVPFASRL TTCCAGTGCACGCTGTGCACGTACACGGCCGCCAGCTTCGCGTCGCTCA TGCCCCTG GCCGGT AAANHAHACDSLNR AGCGACACCGGGACTCCCGGCACCGCCGCACCGCCTTCCTCGACCGGTT CCACCTCT ACATTAT TFSVAAAPAAGELSP CTCGGCGGGATGTGCGTGCGGCGTGCCCTTCGCGTCGAGGCTGGCTGC CCTGTTGG GGTCCG TAGAANATVKAATV TGCGAACCACGCACACGCGTGCGACAGCCTCAACCGCACCTTCTCGGTA CCGCGCCG ACACCT TPDSPRQDPPKLAAT GCGGCTGCGCCGGCAGCAGGGGAATTGAGCCCCACTGCTGGTGCAGCC CCGTGGAG ATGAGG PPLASSALVVDPDHA AACGCCACTGTCAAGGCGGCCACTGTCACGCCCGACTCCCCCCGCCAAG CCCCGCGC TGCAAC EQQARERWGPPLPR ACCCCCCGAAGCTCGCAGCGACCCCCCCGCTAGCGAGCTCCGCTCTTGT GCGCGCCG CGGTAC TLVAGRVAARLSEVP CGTCGACCCGGACCACGCCGAGCAACAAGCCCGCGAGCGCTGGGGCCC AGCCGCCC ACAAGT APRWGPPLPRGVVA GCCCCTACCCAGGACGCTGGTGGCAGGCCGTGTTGCAGCCCGCCTCAGC CAAGAACA TACACA FRIGHRVLPPEMTSD GAGGTTCCAGCTCCCCGCTGGGGACCCCCACTGCCACGCGGCGTGGTG AGCACCCC CCACAT EETKDDSSVQDGDR GCATTCCGGATCGGCCACCGCGTGCTCCCGCCGGAGATGACGAGCGAC CAGCTAGC AGCGAC QDYPVAAMDVDSG GAGGAGACGAAGGACGACTCGAGTGTCCAGGACGGCGACCGCCAGGA GCGCGCGT TACCAG MSGEWLLRFDGACR CTACCCGGTGGCCGCGATGGACGTGGACAGCGGCATGAGCGGCGAGT GGAGCCC GTATTTA ANPGPGGAGAALSQ GGCTGCTGCGCTTCGACGGCGCCTGCCGCGCCAACCCTGGCCCAGGTG (SEQID CTACCTG PDGSVVWTCSHYMP GCGCCGGCGCAGCGCTGTCCCAGCCCGACGGCTCCGTTGTATGGACTTG NO:1204) GAAGCC SSSETNNTAEYTALLL CTCCCACTACATGCCGAGCAGCAGCGAAACCAACAACACCGCCGAATAC AAGGAT GTRAAADHGTTTLRV ACGGCGCTGCTGCTCGGTACGCGAGCCGCCGCCGACCACGGAACCACG TAACCG EGDSTLVIQQVRGIF ACCCTGCGAGTGGAGGGCGACAGCACCTTGGTGATCCAGCAGGTGCGT GTCGGT ATRSVTLRHLRDQVK GGCATCTTTGCCACGCGGAGCGTCACTCTCCGCCACCTCCGCGACCAAG AATACA LELARVGRFSLHHIDR TCAAGCTGGAACTCGCCCGGGTGGGGAGATTTTCGCTCCACCACATTGA CATAACT QANAHADRLANRAL TCGCCAAGCCAATGCCCACGCCGACCGCCTCGCCAACCGAGCCCTCGAC TT(SEQ DLRRTVSECGVHPDG CTCCGGCGGACAGTGTCGGAGTGCGGCGTCCACCCCGACGGGAACGGC IDNO: NGCTPTAIDDRPLAP TGCACGCCCACGGCCATCGACGACCGACCCCTAGCGCCGACTCAGCAGC 1327) TQQPPDAPPPPPAA CACCTGACGCCCCGCCACCGCCGCCCGCCGCCGACATTGAGATGGAGG DIEMEDPDDEDLADI ACCCGGACGACGAGGACCTCGCCGACATCGACGACGGCGAGGTCTACG DDGEVYAAMRVGP CGGCCATGCGCGTTGGCCCGAATGCCACTCCGCAGCGACGACGCCGTG NATPQRRRRGRSGT GACGCTCAGGCACCGCGAAGAAGCACCGCCGCCAGCGCCCACCCCGGG AKKHRRQRPPRVTR TGACCCGACACCACCGCGAACACCGCCTCGACGAGGCACTCGATGACCT HHREHRLDEALDDLH CCATGCAGTGGAGCGCTCCACCCCCAGCGACCGCACCACCGTCCGCCGG AVERSTPSDRTTVRR GCACGACGCCGCGTTGGACGCGTCAATTCGGCCATCGAACAGCAGCGC ARRRVGRVNSAIEQQ CTCCGCCACCGCTTCGACACGGACGAGAAGGCATGTGTCACCGACATCC RLRHRFDTDEKACVT TGGCGAAGGCGTGCGCGACGCGAGAGGCAGCGAGGACCACGGCCTCC DILAKACATREAARTT GGCGGTGATCCACCTGCAGGCCCGGCGACGCCAGCGGCAGGCAGCGC ASGGDPPAGPATPA AGACGACGGCACCTGCCCCATCTTGGGAGAAGAACTCTGGCGGTTTTTC AGSADDGTCPILGEE GACAGCGTCAACACGCCGCGCCAAGAATTCGCCCCGGACGCCCCGGTG LWRFFDSVNTPRQEF GGTGCCGCCTTCCGCTCGGCGTTGGCACGCCTACCGGCCGCGACCAGTT APDAPVGAAFRSALA GCAAGGAGCTGCTGACTGCGGCGCCGTCGGCCGGCGAAGTGGAAGAC RLPAATSCKELLTAAP CAGCTACAACACGTCCGTGGCGCCTCCAGCCCTGGACTCGACGGCGTCG SAGEVEDQLQHVRG GGTACGACGTGTACCAGCACTTCGCGGCGCAGCTCCTGCCGGCACTGAC ASSPGLDGVGYDVY CGCGGCATTCAAGGCATGCTGGACGGCGAAGAGAGTGCCTCAGAGCTG QHFAAQLLPALTAAF GAAGCTCGGGGTGGTGCGGCTGCTCCACAAGAAGGGCGCGCGCGAGG KACWTAKRVPQSWK ACCCGGCAAACTGGAGGCCGATCTGCCTCCAGCAAGCCATTTACAAGCT LGVVRLLHKKGARED CTACACGGGACTCCTCGCGCGGCGGCTGGTGCGTTGGCTAGACGCCAA PANWRPICLQQAIYK CGACCGCCACGCTCCAGGCCAGAAGGGCTTCCGAGCGGTGAACGGCTG LYTGLLARRLVRWLD CGGGGAGCACAACTTCCTCGCAGCCACGCTCGTCGACCAAGCCCGCCGC ANDRHAPGQKGFRA AAGCGCCGCACCCTCTTCGAAGTCTGGTACGATTTTCGCAACGCGTTCG VNGCGEHNFLAATLV GCAGCGTGCCATTCGCGCTGCTGTGGGACGCGCTGGCGCGGCTCGGCG DQARRKRRTLFEVW TGCCCGACGACTACGTTACCATGTGCAAGGGGCTCTACGAGAGCGCGG YDFRNAFGSVPFALL CGTTTGTCGTCGGGAACGCTATCGACGGCACGACGGACCCAATTGCGCT WDALARLGVPDDYV GCGGGTGGGCGTCTTCCAAGGGTGCCCACTGAGCCCCCAACTCTTCAAC TMCKGLYESAAFVVG GCTGCCATTAGCCCGCTGCTCTTCGCGTTGCAGCGGCTGCCGGCCACGG NAIDGTTDPIALRVG GCGTGCAGCTGTCGGGCGATGACTGCCCTGGTGCCTCGGCTTATGCCGA VFQGCPLSPQLFNAA CGACCTCAAGATTTTCAGTGGCACCGAGGACGGCATCAAGCGACAGCA ISPLLFALQRLPATGV CGCGCTCGTGGCCGACTTCTTGCGCTGGACCGGCATGGCGGCGAACCCC QLSGDDCPGASAYA AACAAGTGCTGCACCATGTCGGTCCAGCGCGACGGACGCGGCGTGCTC DDLKIFSGTEDGIKRQ AAGACTGACGACCTCCAGCTAGACCTGGCTGGCACGCCGATCCCCGCGC HALVADFLRWTGMA TGTCCATGAGCGCTTCGTACACCTACCTGGGGATCGGCGATGGATTCGA ANPNKCCTMSVQRD TCACGTGCGCCGCCGCGTCGAGCTCGCTCCAGCACTCAAGCAGCTTAAG GRGVLKTDDLQLDLA GACGACGCGACGACGCTGCTGCAGTCGGGCCTCGCCCCGTGGCAGGTG GTPIPALSMSASYTYL GTGAAGGCGGTGAAGACGTACCTCTACCCCCGGGTGGAGTACGCTCTC GIGDGFDHVRRRVEL CGCCACCTTCGCCCCTTCCAACAGCAGCTTGAGGGCTTCGACCGGCACC APALKQLKDDATTLL TCGCGCGCGGCCTCCGCCACCTCCTGCGACTGCCGGGCAACGCCACGGC QSGLAPWQVVKAVK GGAGTGCTTCTACGCTCCGGTGTCGCGCGGCGGGCTGGGCCTGCTGCC TYLYPRVEYALRHLRP ACTGACGGAGCTGCACGCGGCGCTCCAAGTCGCGCACGGATGGCAGCT FQQQLEGFDRHLAR GCTGAATTCGAAGGACCCTGCCATCCGGCGCATCGCGCGCGTGCAGCTC GLRHLLRLPGNATAE CGGCAGATCGCCGACGCGCGCCACCGTATCGACAGCCGCGCATGGGAG CFYAPVSRGGLGLLPL GGGCGCGACGAGGAGTTGTGCGAGCTGCTGCTCAACTCGCAGCTCGGG TELHAALQVAHGWQ ACGTCGCCCGACGCGCCACCGAAGCGTCGGAACGGCGACATCGGTTCT LLNSKDPAIRRIARVQ CTTTGGGTCGACGTCCAGCGCCACCTCCGCACGCTCGGCCTCAAGTTCG LRQIADARHRIDSRA CGACCGCGCCGGCCTGCGCGGACGCCGGCTCTGCGGCGACCACGTTGC WEGRDEELCELLLNS AGCTCCGCGTGCCGCACCACGATAAGTGGTTGGACCATCGCACTGTCCT QLGTSPDAPPKRRN GCGGCACGTGAAACTGCACGTGAAGCACCGGCACTGGTCCAAGTGGGC GDIGSLWVDVQRHL GGCCATGCGCGACCAAGGCAAGACGGCCCGCGCCCATGGCGGCGCGG RTLGLKFATAPACAD GGAGCGGCTTCCTCACCCGGCCACGAGGCATGTGGGAAGCCGACTACC AGSAATTLQLRVPHH GCTTCGCGGTGGCCGCCCGCCTGAACCAGCTCGACACGCACAGCGTGCT DKWLDHRTVLRHVK GAAGCGCCGGCGTCTCCGAGCCCACGACCACTGCAGGCAGCCGGGCTG LHVKHRHWSKWAA CTCCAGGGCGGAGACGCTGGCGCACGTGCTCAACCACTGCGCCGGCAC MRDQGKTARAHGG CATGGACGCGGTCCGCGGCCGCCACGACGACGCCCTCAAGCACATCGA AGSGFLTRPRGMWE GCGCGCGCTGCACGCGTCGTCGCCGGGCGGCCAGGACCGCGTCGAGCT ADYRFAVAARLNQL CCGGGTGAACCAGACGGTGCCGTCGCTCGCCGGCCCCGCGCTACGGCC DTHSVLKRRRLRAHD CGACCTCCAGCTGTACAACCACACCAAGAAGATGGTGGCGGTGGTCGA HCRQPGCSRAETLAH CCTGGCCGTGGCGTTCGAGGAGCAGGCGAGCGACGACCCGGAGAGCT VLNHCAGTMDAVRG CGGCGCTGGCACGCATCGCCGCACACAAGCGCGCGAAGTATGCCGGCG RHDDALKHIERALHA TCAAGCGACACCTCGAGCGCCAAGGGTGGAAGGTCCACCTCTCGGCGC SSPGGQDRVELRVN TCGTGTACGGCTCGCTCGGCGCGGTGCCGGCCGGCAACCACAAGGTGC QTVPSLAGPALRPDL TTACTGAGCACCTTGGGTTGCTCAAGCGTGATGCGAAGCGGCTGGACC QLYNHTKKMVAVVD GGCAGCTCTCGGTGGCGTGCATCCAGTCCAGCCGCCGCATCTGGAACCT LAVAFEEQASDDPES GCACTGCAGCCAGCACCGCGCGCGCCAGCACCAAGCACCAGGGGGAAG SALARIAAHKRAKYA TCGGGCGGCGGAGACCGGGGGGACTCCGCCGCGCACCGGCCGCCGCT GVKRHLERQGWKVH AGACGGCACACAGGCCCACAGCGGCCGACAGGGCCACACCCAGGTAG LSALVYGSLGAVPAG GGAACCGCCTTCAAACCCGGCCGGTACATTATGGTCCGACACCTATGAG NHKVLTEHLGLLKRD GTGCAACCGGTACACAAGTTACACACCACATAGCGACTACCAGGTATTT AKRLDRQLSVACIQS ACTACCTGGAAGCCAAGGATTAACCGGTCGGTAATACACATAACTTT SRRIWNLHCSQHRA (SEQIDNO:1081) RQHQAPGGSRAAET GGTPPRTGRR(SEQ IDNO:1449) NeSL Utopia- Phytophthora CTCAAGCCTAGCAGCGGCTACCGCAGCTACTCCAGCTCCGGTAGCTGCT CTCAAGCC TGACGC MARVLVEPMQVDEC 5_PI infestans ACTGCTACAACTGCTCGCGCTGCTGCTCGCGTCGCCGTGGAGCACCACG TAGCAGCG ACCGTG SSCDRSTLTADDGSG CGCACGCTGAACCGAACCAAGAACATCTACCGATGGCTCGCGTGCTCGT GCTACCGC ACATAG DDVAAPSSLNSNDV GGAGCCCATGCAGGTCGACGAGTGCAGCAGCTGCGACCGCAGCACTTT AGCTACTC TGCGGC AAPMDVDSGTDCPP GACAGCCGACGACGGGAGCGGTGACGACGTTGCAGCCCCAAGCTCCTT CAGCTCCG ACGTGA ALQQPPQRPRALHV GAACAGCAACGACGTCGCCGCCCCCATGGACGTTGACTCCGGCACTGAT GTAGCTGC AGATGC GSKRRRLDADDEEEA TGCCCTCCAGCGTTGCAACAGCCGCCCCAGCGACCGCGTGCGCTGCACG TACTGCTA ACATGA RQLQEEEEAGIHAPA TAGGCAGCAAGCGCCGCCGACTCGACGCAGATGATGAGGAAGAAGCTC CAACTGCT AGCTCC LRLSAASAQPASVLA GCCAGCTGCAAGAAGAGGAGGAGGCTGGCATCCATGCCCCAGCGCTCA CGCGCTGC GACACT VYTHNASRFDCTLYA GGCTATCCGCGGCCAGCGCTCAACCGGCCTCCGTCCTCGCCGTGTACAC TGCTCGCG GGGCCA YTAGSFASLKTHRNS GCACAACGCGTCGCGGTTCGACTGCACGCTGTACGCGTACACGGCCGG TCGCCGTG AGTGGG RHRRTAFLDRFSAGC CAGCTTCGCATCACTGAAAACGCACAGGAACTCTCGGCACCGCCGCACC GAGCACCA CGGCCA ACGVPFASRLAAARH GCATTCCTCGACCGATTCTCGGCGGGTTGCGCGTGTGGCGTGCCCTTCG CGCGCACG TGCGCG AQACASLSSAPLAEA CCTCGAGGCTGGCTGCAGCAAGACACGCGCAAGCGTGCGCCAGCCTCT CTGAACCG ACCAAG SSAAGASSHTVDEAD CGAGCGCACCATTGGCCGAGGCTTCGTCGGCAGCAGGAGCATCAAGCC AACCAAGA GCAAGA STVSAAGHTEPDLPR ACACTGTCGACGAAGCCGACTCCACCGTCTCAGCCGCGGGTCACACCGA ACATCTAC CAGCTC HNATELTASPPLVSSP GCCCGACCTGCCCCGCCACAACGCCACGGAGCTCACAGCGTCCCCCCCG CG(SEQID GTGCAC DVEVQAPETEATENR CTGGTGAGCTCCCCCGATGTTGAAGTGCAGGCACCAGAGACAGAGGCC NO:1205) ATGGTG WGTPLPRVLVASRIA ACCGAGAACCGCTGGGGCACTCCACTTCCGCGCGTGCTGGTCGCGTCTA GGGTTG GRLAQVPPPRWGPP GGATCGCCGGGCGCCTCGCTCAGGTGCCACCCCCACGCTGGGGCCCCCC GTAGTG LPRTTIAGRIATRLAA GCTACCGCGCACTACCATCGCGGGCCGCATTGCGACGAGGCTCGCCGCC GCTTCCT TPAPRWDPPLPRSLV ACACCTGCCCCCCGCTGGGACCCACCGCTGCCACGAAGCCTGGTTGTGT CACACG VSRIAARLLPALPDAP CACGCATCGCCGCCCGCCTGCTACCCGCGCTACCAGACGCGCCTGCATG GCCGCG ACEEEAKDSDTMDW CGAGGAAGAGGCCAAGGACAGCGACACGATGGACTGGGCGCCAACTT AGGCCT APTWTNEETKDSEPH GGACAAACGAGGAGACCAAGGACAGCGAGCCGCATGACGAAGCGCCA GTGGGA DEAPGQVDEETIDDA GGGCAGGTCGACGAGGAGACCATCGACGACGCCGACGGCGAGTGGCT AGCCGA DGEWLLRFDGACRA TTTGCGCTTCGACGGTGCTTGCCGTGCAAACCCAGGCCCAGGCGGCGCC CTACCG NPGPGGAGAALFKP GGCGCAGCTCTCTTCAAGCCCAGCGGCCCCGTAGTGTGGACTTGTTCCC GTTCGC SGPVVWTCSHYDPS ACTACGACCCGAGCACCACGGCAACGAACAACACAGCAGAGTACACTG GGTGGC TTATNNTAEYTALLL CGTTGCTGCTCGGGGCCAGGGCTGCCGCCGACCACGGCGTCACGAAGC CGGCCG GARAAADHGVTKLR TGCGCGTCGAGGGCGACAGCACGCTGGTCATCCAGCAAGTGCGGGGCA CTTAAAC VEGDSTLVIQQVRGIF TCTTCGCGACGAGAAGCACGCGGCTCCGAGCACTGCGCAACAAGGTCA CAGGTA ATRSTRLRALRNKVKL AGCTCGAGCTCGCGCGCGTGGGGAGCTTCTCGCTCCACCACATCGACCG GACACG ELARVGSFSLHHIDR GCAGGCGAACGGCCACGCCGACCGCCTCGCCAACGCCGGCCTCGACCG CACAGT QANGHADRLANAGL ACGCCGCACCAAGCTCGAGTGCAGTGTGCACCCCGATGGCAGGGGCTG GTCCTCA DRRRTKLECSVHPDG CACCAACACCTCTGTGGCGACCGCAGCGCCTACTGCTCCTGCAGCGCCC AGCGCC RGCTNTSVATAAPTA TTGCCCCCAGCCCGGCCTCCTGCAACTACAGCCGCTCCGTCTCATGATGA GGCGCC PAAPLPPARPPATTA CGACCACAGCGTTCAAGGCGACATCGACGATGGCGAAGTATACGCTGC TCCGAG APSHDDDHSVQGDI AATGTGCATCGGCCCCGATGCGGTGCCGCACCGCCGACCGCGCCTCCGC CACATG DDGEVYAAMCIGPD CTACGGCACTTGACGGACGAGGAGTCCGAGGAGGCTGGCGACGTCGTG ACAGGT AVPHRRPRLRLRHLT GAGAGGCTGGCAGCGTCGCTGGCGGCGAAGATTGCAGACGCGCCCGA GCAGAC DEESEEAGDVVERLA CTGGGAGACGGCGGAGGGCTACATCACAGCCCTACCGTACGCTCTCTAC ACCCAG ASLAAKIADAPDWET GACAAGCTGCAGCCGTACTCCCAGGCCCAGCCTCAGCCTCCAAGCCAAC GATGCA AEGYITALPYALYDKL AGCAGCAGCAGCAGCAGCAGCGACCACGGCAGCAGCAGCAAACACGG CGCGCT QPYSQAQPQPPSQQ CAGCGTCGACAGCGGAGAGGCAAGCGCGGCGGTGGCAGCCAACGCAG CCGAGA QQQQQQRPRQQQ GCAGCGGAAGACGCGCCGCCGACGGCCCCCGCGGGTGACACGCCACCA CGCTGG QTRQRRQRRGKRGG CCGCGAGCACCGGATCGATGAAGCCCTCGACGACCTCCACGCGATCGA CGCATG GSQRRQRKTRRRRPP GAGCAGGCGCCCTCAAGACAGGACGGCCATCAGCAAGGCCAGGCGTCG TGCTTAA RVTRHHREHRIDEAL AGTCGGCCGCATCCGCTCCGCCCTTGACCAGCACCAATTGCGCCACCGC CCACTG DDLHAIESRRPQDRT TTCGACACCGATGAGAAGGTGTGCGTGGACGCCATCCTGGCTGGGGCG CGACGG AISKARRRVGRIRSAL CGAGCGAGCCAAGGAGCGACAACGGCACCACCGTCGGCAACCACGGAC AACCAT DQHQLRHRFDTDEK CCCCCAGCTCCGATGGACGACAGCAGGTGCCCCATTCCCGGCGACGATC GGACGC VCVDAILAGARASQG TATGGAGGTTTTTCGACAGCGTTAACACGCCACGGCGGTCGTTCGACGC AGTCCG ATTAPPSATTDPPAP AGAGGCACCAGACGGAGCTGCCTTCCGCGAAGCGATGGCGTGCTTGCC TGGCCG MDDSRCPIPGDDLW CGCCGCCACGAGAGCCCAGGAACTGCTGACGGAGGCGCCTACGGTGGA GCCATG RFFDSVNTPRRSFDA CGAGGTTGAAGACCAGATTCAGCACGCCCGCGCCAGCTCCAGCCCCGG ACGCCG EAPDGAAFREAMAC CCTCGACGGCGTTGGTTACGACATCTACAAGCAGTTCGCCGCCCACCTC CACTCA LPAATRAQELLTEAPT CTGCCAGCACTGCACACAGCGTTCGTGTGCTGCTGGAACCACAAGCGG AGATTA VDEVEDQIQHARASS GTGCCTCAGAGCTGGAAGCTCGGCGTGGTGCGCCTTCTCCACAAGAAA TTGAGC SPGLDGVGYDIYKQF GGCGATCGACAGGATCCTGCCAACTGGCGGCCGATCTGCTTGCAGCAG GTGCGC AAHLLPALHTAFVCC ACCATCTACAAGCTCTATGCCGGCATCCTGTCACGCCGGTTCGTGCGCTG TCCTCGC WNHKRVPQSWKLG GCTGGACGCAAACGCTCGTCACGCAGAGGCGCAGAAAGGATTCCGGGC ATCGTC VVRLLHKKGDRQDP GATGAACGGCTGCGGGGAGCACAACTTCCTCGCGGCCACGCTCGTCGA GGCCGA ANWRPICLQQTIYKL CCACGCCCGCCGCAAGCGCAAGGAGCTGCACGTCGTGTGGTACGACCT CCAGCA YAGILSRRFVRWLDA TGCCAACGCCTTCGGCAGTGTGCCACACGACTTGCTATGGGAAACGCTG GGACCG NARHAEAQKGFRAM GCGCGCCAAGGCGTACCGCCCACGTTCGTGGACTGCTGCCGCGGGATC TGCTGA NGCGEHNFLAATLV TACAGTGATGCAGCCTTCACCATCGGGAACGCCGCCGACGGCACCACA GCTCCG DHARRKRKELHVVW GCGCCCATCAGGCTTCGCGTGGGAGTATTCCAGGGCTGCCCGCTCAGCC CGTGAA YDLANAFGSVPHDLL CCCACCTCTTCACCGCCGCAATCGCCCCGCTGCTCCATGCGCTGAAGAG CCAGAC WETLARQGVPPTFV GCTGCCGGTCACTGGAGTGCAACTCACCGGCGTGGACCGTCCCGGAGC CGTGCC DCCRGIYSDAAFTIG TGCTGCCTACGCGGACGACCTCAAGACGTTCAGTAGCTCCGTGGACGGA GTCACTC NAADGTTAPIRLRVG ATCAAGAGGCAGCACGAGCTCGTGGCGACCTTCCTACGATGGACCGGC GCCGGC VFQGCPLSPHLFTAAI ATGGCGGCGAACCTTTCCAAGTGCAGCGCCATGTCGGTCCAGCGCGAC CCCGCG APLLHALKRLPVTGV AGCCGGGGCGTGCTCAAGACAGGTGACCTATGCCTCAAGCTCAACGAC CTACGG QLTGVDRPGAAAYA GCAGAGATCCCAGCGCTGAGCATGACGGCCTCCTACGCGTACCTCGGCA CCCGAC DDLKTFSSSVDGIKR TCGGGGACGGCTTCGACCACGTGCGCCGCCGCCTCGAGCTCGCGCCAAT CTTCAGC QHELVATFLRWTGM GATGAAGCAGCTCAAGCACGACGCGACGGCGCTGATGCAGTCGGGGCT TCTACAA AANLSKCSAMSVQR GGCCTCATGGCAAGTGGTGAAGGCGGTGAAGGTATACCTCTACCCCCG CCACAC DSRGVLKTGDLCLKL CATAGAGTACGCCCTTCGCCACCTTCGCCCGTTCAAACAGCAATTAGAA CAAGAA NDAEIPALSMTASYA GCTTTCGACGAGCACCTCCGCCGCGGGCTCCGCCACCTGCTGCGACTAC GACGGT YLGIGDGFDHVRRRL CGACGAACGCCACCAGCGCCTTCTTCTCAGCGCCGACATCGCGCGGCGG GGCGGT ELAPMMKQLKHDAT GCTGGGCCTTCTGCCGCTGACGGAGCTGCACGCGGCGCTTCAGATCGCC GGTCGA ALMQSGLASWQVVK CATGGGTGGCAGATACTCAACTCCCCAGACGGCGCCACCCAGAGGATT CCTGGC AVKVYLYPRIEYALRH GCCCGCGAGCAGCTCCGGGAGATTCCAGACGCGCGCCACCGCCTGGAC CGTGGC LRPFKQQLEAFDEHL ACTGCCCACTGGAGGAACCGCGACGCAGAGCTGTGCGAGCTGCTGCTC GTTGAG RRGLRHLLRLPTNATS AACTCGCAGCTCGGACAGTCCAGCCACGCCCCACCCAAGCGGCGCAACT GAGCAG AFFSAPTSRGGLGLLP GCGACATCGGATCCCTTTGGATCGACATTCGGCGCCAGCTCGGCACGCT GCGAGT LTELHAALQIAHGW CGGGCTCAAATTCGAGACCGCGCCTGGCCGACGCAGTCACCAACCAGC GACGAC QILNSPDGATQRIAR CCGCCCAGCCATTGCAGCTTTCGCGTGCCGCACCACGACAAGTGGCTGA GCGAGT EQLREIPDARHRLDT CGCACCGTGACATAGTGCGGCACGTGAAGATGCACATGAAGCTCCGAC AGCTCG AHWRNRDAELCELLL ACTGGGCCAAGTGGGCGGCCATGCGCGACCAAGGCAAGACAGCTCGTG GCACTG NSQLGQSSHAPPKRR CACATGGTGGGGTTGGTAGTGGCTTCCTCACACGGCCGCGAGGCCTGT TCCCGG NCDIGSLWIDIRRQL GGGAAGCCGACTACCGGTTCGCGGTGGCCGGCCGCTTAAACCAGGTAG ATCGCC GTLGLKFETAPGRRS ACACGCACAGTGTCCTCAAGCGCCGGCGCCTCCGAGCACATGACAGGT AACCAC HQPARPAIAAFACRT GCAGACACCCAGGATGCACGCGCTCCGAGACGCTGGCGCATGTGCTTA AAGCGA TTSG(SEQIDNO: ACCACTGCGACGGAACCATGGACGCAGTCCGTGGCCGGCCATGACGCC GCCAAG 1450) GCACTCAAGATTATTGAGCGTGCGCTCCTCGCATCGTCGGCCGACCAGC TACGAC AGGACCGTGCTGAGCTCCGCGTGAACCAGACCGTGCCGTCACTCGCCG CACATC GCCCCGCGCTACGGCCCGACCTTCAGCTCTACAACCACACCAAGAAGAC AAGCTA GGTGGCGGTGGTCGACCTGGCCGTGGCGTTGAGGAGCAGGCGAGTGA CACCTC CGACGCGAGTAGCTCGGCACTGTCCCGGATCGCCAACCACAAGCGAGC GAGCGC CAAGTACGACCACATCAAGCTACACCTCGAGCGCCAAGGATGGAAGGT CAAGGA ACACCTCTCGGCACTCGTGTACGGGTCGCTTGGGGCGGTCGCTAGTGGC TGGAAG AACTACCAGGTGTACACCACACACCTGGGGCTACTCAAGCGCGATGCAA GTACAC AGCGGCTGGACCGGCAGCTGTCTGCCTAATGCATCCAGTCCAGCCGCCG CTCTCG CATCTAGAATCTACACTGCAGCCAGCACCGGACTCGCCAACACCAGGCG GCACTC AGGCCCAGCCAAGGACCAAGAGGCAGCCGGGCGACGGAGACCGGGGG GTGTAC GACTCCGTCCCAGACAAGCCGCCGCTAGGCGGAAACCAGGCCCAAGAC GGGTCG GGCCGACAGGGCCCCACCCAGGTAGGGAACCGCCCTAGAAACCCATTT CTTGGG CGGTGGTCGACTCGCAAGCCTTACCTATATTTTAGACGTAGCGACCAAA GCGGTC TTACAAATTTGGTAACGAGTAAGCCAAATGGTAATACACAAAACTTT GCTAGT (SEQIDNO:1082) GGCAAC TACCAG GTGTAC ACCACA CACCTG GGGCTA CTCAAG CGCGAT GCAAAG CGGCTG GACCGG CAGCTG TCTGCCT AATGCA TCCAGTC CAGCCG CCGCAT CTAGAA TCTACAC TGCAGC CAGCAC CGGACT CGCCAA CACCAG GCGAGG CCCAGC CAAGGA CCAAGA GGCAGC CGGGCG ACGGAG ACCGGG GGGACT CCGTCCC AGACAA GCCGCC GCTAGG CGGAAA CCAGGC CCAAGA CGGCCG ACAGGG CCCCACC CAGGTA GGGAAC CGCCCT AGAAAC CCATTTC GGTGGT CGACTC GCAAGC CTTACCT ATATTTT AGACGT AGCGAC CAAATT ACAAAT TTGGTA ACGAGT AAGCCA AATGGT AATACA CAAAAC TTT(SEQ IDNO: 1328) NeSL Utopia- Phytophthora CCAGGCATCACCCCCGCCGCCCCCAAGCGCTCCGCCGCCGAGCGCAGCT CCAGGCAT TGATGC MSGDVVSSDGSSRT 5_PR ramorum GCGGCTGCTGCCGCCCCACTGGTGACGGCAGTCGCAGCTGCCCCTGCCA CACCCCCG CCGCCA TDVSGDGDDGADGA CCTCTCCTGTTGGCCGCGCCGCCGTGGAGCCCCGCGCGCGCGCCGAGCC CCGCCCCC ACTACA GSSDAAGDVGVVA GCCCCAAGAACAAGCACCCCCAGCTAGCGCGCGCGCGGAGCCCATGTC AAGCGCTC AGGTGC MDVDQGARRQRPP GGGCGACGTGGTCAGCAGCGACGGCAGCAGCAGGACGACGGACGTGA CGCCGCCG TTACTGA WQRVGGKRRRLND GCGGCGACGGCGACGACGGCGCCGACGGCGCAGGCAGCAGCGATGCA AGCGCAGC GCACCTT VDDEDTRELAELLLEE GCAGGCGATGTCGGCGTCGTCGCCATGGATGTTGACCAAGGTGCCCGC TGCGGCTG GGGCTG EDEVGAQAPALRLFA CGACAGCGGCCGCCATGGCAGCGCGTCGGCGGCAAGCGTCGACGTCTC CTGCCGCC CTCAAG ASAHPASVLSVYAHN AACGACGTCGACGACGAAGACACCCGCGAGCTGGCAGAGCTCCTGCTC CCACTGGT CGTGAT AQRFVCTLCAYTAAS GAGGAGGAGGACGAGGTTGGGGCCCAAGCACCAGCGCTCAGGCTATT GACGGCA GCGAAG FASLKRHRDSRHRRV CGCGGCCAGCGCTCACCCGGCCTCCGTCCTCTCCGTGTACGCACACAAT GTCGCAGC CGGCTG SFVDKFSAGCACGTP GCGCAGCGCTTCGTGTGCACGCTGTGCGCGTACACGGCCGCCAGCTTCG TGCCCCTG GACCGG FGSRLAAARHAQAC CGTCGCTCAAGCGACACCGGGACTCACGACACCGGCGCGTGTCCTTCGT CCACCTCT CAGCTG ASLSIPRTVTAPAAAG GGACAAGTTCTCGGCGGGCTGCGCGTGCGGCACGCCCTTCGGCTCGAG CCTGTTGG TCGGTG DLSPTATGANATASA GCTGGCAGCGGCGAGACATGCGCAAGCGTGCGCCAGCCTCTCGATCCC CCGCGCCG GCGTGC AATSPDLPRPASPELA ACGCACGGTAACAGCTCCGGCGGCAGCAGGAGATTTGAGCCCCACTGC CCGTGGAG ATCCAG ASPPQTSPFDVAIQA CACGGGAGCCAACGCCACCGCCTCGGCGGCCGCAACTTCGCCCGATTTG CCCCGCGC TCCAGC DAAEQTAWTRWDP CCCCGCCCAGCTTCCCCGGAGCTCGCTGCGTCCCCCCCGCAGACGAGCC GCGCGCCG CGCCGC PLTRAAVAARVASRL CCTTTGACGTTGCCATACAGGCAGACGCTGCAGAGCAGACAGCGTGGA AGCCGCCC ATCTGG AVVPAPRWGPPLSR CGCGCTGGGACCCTCCGCTCACCCGCGCTGCGGTGGCAGCCCGTGTCGC CAAGAACA AACCTG TLVASRIAARLDAQTS CTCCCGCCTCGCCGTGGTTCCCGCGCCGCGATGGGGGCCGCCGCTGTCA AGCACCCC CACTGC RWGPPLPPAMVASR AGGACGCTGGTGGCCTCCAGGATTGCAGCACGCCTGGACGCGCAGACG CAGCTAGC GCGCAG IASRLAAMPAPRWG TCACGATGGGGACCGCCCCTGCCACCGGCGATGGTGGCGTCCCGGATT GCGCGCGC CATCGA PPLLRTVIASRIADRLL GCGTCGAGACTCGCCGCCATGCCGGCACCCCGCTGGGGCCCGCCGCTG GGAGCCC GCACGG PPELAADEETKDDDV CTGCGGACCGTGATCGCATCCCGCATCGCCGACCGACTGCTCCCACCGG (SEQID CAGCAC HMDNAASVDVDEES AGTTGGCGGCCGACGAGGAGACGAAGGATGACGACGTCCACATGGAC NO:1206) CAGGGC EVADVVMTDHDGE AACGCGGCCAGCGTGGACGTCGACGAGGAGTCCGAGGTCGCCGACGT CAAGCG WLLRFDGACRANPG GGTCATGACCGACCACGACGGCGAATGGCTGTTGCGCTTCGACGGCGC CCAAGG PGGAGAALFKPSGPV CTGTCGTGCCAACCCTGGCCCAGGCGGCGCCGGAGCCGCTCTGTTCAAG GGCAGT VWACSRYMPSSSAT CCCAGCGGCCCCGTAGTGTGGGCATGCTCGCGCTACATGCCGAGCAGC CGGGCG NNTAEYTALLLGARA AGCGCGACCAACAACACGGCCGAGTACACGGCGCTGCTGCTCGGCGCT GCGGAG AADHGATHLRVEGD AGGGCCGCTGCCGACCATGGAGCCACGCACCTGCGAGTGGAAGGCGAC ACCGGG STLVIQQVRGIFAARS AGTACGCTTGTGATCCAGCAGGTGCGTGGCATCTTCGCAGCGCGCAGCA GGGACT TRMRALRNQVQSEL CCCGGATGAGGGCGCTGCGCAACCAGGTGCAGTCCGAGCTCGCGCGGG CCGCCA ARVGSFSLHHIDRQD TGGGCAGCTTCTCGCTCCACCACATCGACAGGCAAGACAACGCACACGC CAGAGC NAHADRLANRALDL GGACCGCCTTGCCAATCGCGCGCTGGACCTGCGCCGGACCGTGATAGA GGCCGC RRTVIECGIHCDGVG GTGCGGCATCCACTGTGATGGTGTTGGCTGCACCGCCACCACGACGGA CGCTAA CTATTTEVQSSSAPEI GGTACAGTCATCATCAGCGCCCGAGATACCGACCCGGCCGGTTGCTGAC GCGGAC PTRPVADDHDEHEV GACCATGATGAACATGAAGTTGTTGATGTTGTTGATGTTTGTGGAGTTT ATCGGG VDVVDVCGVCGVCG GTGGAGTTTGTGGCGACCGTGGAACTTGTGGAGTCTGTGACGTTAGTG CCCGTA DRGTCGVCDVSGDI GAGATATCGATGATGGCGAGGTCTACGCAGCCATGCGTACGGGACCTG GCGGCC DDGEVYAAMRTGPD ACGCTGTTCCTGCGCGCCGACCACGCCTCCGCTTGCGGAAACTCACTGA GACAGG AVPARRPRLRLRKLT CGAAGAGCAAGAAGAGGCAGGCACGCTGGCGGAACGGCTTGGAGCGA GCCACA DEEQEEAGTLAERLG CGCTGGCGGCCAAGATTGCAGACGCACGCGACTGGGAGTCGGCGGAG CCCATGT ATLAAKIADARDWES GGATACATCACAGCCCTCCCTTATCTACTCTACGACAAATTGTTGCCGTA AGGGAA AEGYITALPYLLYDKLL CTCCCAGGGCCCAGCTCGCTCCTTGCCGGTCCGCCAGCACCAACGGCAG CCGCCCT PYSQGPARSLPVRQH CAGCAGCAGCCGGACGGACAATTCCAGCGACCCACCCAGTCCCGATCTG CTAAAC QRQQQQPDGQFQR CGGCTCGTCGACAGCGACGCCAGCGCCACCGGGCACGTCGCCGACCGC CCGCCC PTQSRSAARRQRRQ CACGCGTGACTCGTCACCACCGTGAACACCGACTAGACGAAGCGCTTGA GGTACA RHRARRRPPRVTRH CGACCTCCACGCGGTGGAGAGAGCTACGCCCAGCGACCGCCGCTCCATT TTATGGT HREHRLDEALDDLHA CGCCGAGCCCGCCGCCGGGTCGGCCGCGTTAATTCGGCCGTCGAACAA CCGACA VERATPSDRRSIRRAR CAACGGCTCCGGCACCACTTCGATACGAACGAGAAGGGCTGCGTGGAG CCTATG RRVGRVNSAVEQQR ATCCTACTGGCCAAGGCGCGTGCACAGAGGTCGACGACGGTGGCCCGG AGGTGC LRHHFDTNEKGCVEI ACGGCAGTTGGAGAGCCGAACAGCGGTGCGGCCGAGGACGACGGCAC AACCGG LLAKARAQRSTTVAR GTGCCCCATCCCGAGCGAGCGCCTTCACCGCCATTTCACCGAGGTGAAC TACACA TAVGEPNSGAAEDD ACGCCCGGGTCCAGCTTTGATGCGATGGCCCCCGTGGGTGCACCGTTCC AGTTAC GTCPIPSERLHRHFTE GTGCGGCACTTGCGCACCTCCCCGCGGCCACAGAGGCCTCGGAGCTGCT ACACCA VNTPGSSFDAMAPV GACAGAGGCGCCGACGCCCGATGAGATCGAGGATCAGCTGCAGCGAG CATAGC GAPFRAALAHLPAAT CCAAGGGAACGACCAGCCCTGGCCTGGACGGAGTGGGCTACGACGTCT GACTAC EASELLTEAPTPDEIE ACAAGGCCTTCTCCACCCAGTTACTGCCGGTGCTGCACGCGGCATTCCA CAGGTA DQLQRAKGTTSPGLD GTGCTGCTGGCAGCACCACCGTGTGCCGCAGAGCAGGAAGCAAGGCAT TTTACTA GVGYDVYKAFSTQLL CGTGAGGCTCCTGTACAAAAAGGGCCCGCGCGAAGACCCCGCGAACTG CCTGGA PVLHAAFQCCWQHH GAGACCTATCTGCCTCCAGCAAGTGATCTACAAGACCTACGCCGGGGTC AGCCAA RVPQSRKQGIVRLLY CTCGCGCGGAGGTTTACGCGCTGGTTGGCTGCAAACGGCCGCCATGCC GGATTA KKGPREDPANWRPI GACGCCCAGAAGGGGTTCCGCACTGTAAATGGCTGCGGTGAGCACAAT ACCGGT CLQQVIYKTYAGVLA TTTCTCGCGTCGACGCTGATCGATCATGCGCGCCGCAGCCGCAGAGAGC CGGTAA RRFTRWLAANGRHA TCCACATGGTCTGGTACGACCTGAAGAACGCCTTCGGCAGCGTGCCCCA TACACAT DAQKGFRTVNGCGE GGAGTTGCTGTGGGAGGTGCTGCAGCGCATGGGTGTCCCGCCAGCCTT AACTTT HNFLASTLIDHARRSR TGTGGAGGTCTGCCAGGGGCTGTACCAGGATGCTGCCTTCACCGTCGGC (SEQID RELHMVWYDLKNAF AACGCTGCTGACGGCCCGACCGATCTCGTGCGGCAGCTCGTGGGCGTG NO: GSVPQELLWEVLQR TTTCAAGGATGCCCGCTGAGCCCCCACCTGTTCACTGCGGCGATCAGCC 1329) MGVPPAFVEVCQGL CGCTCCTCCACGCGCTGGACCGACTCAAGGACACGGGCGTGCGGCTGT YQDAAFTVGNAADG CAGCGGACGACCGTCCCGGTGCCTCGGCCTACGCCGACGACCTCAAGAT PTDLVRQLVGVFQG CTTCAGTGGCACCGCGGACGGCGTTAAGCGACAGCACGCGCTCGTGGC CPLSPHLFTAAISPLLH CGACTTCCTGCGCTGGACCGGCATGGTGGCGAACCCCAACAAGTGCTGC ALDRLKDTGVRLSAD ACCATGTCGGTCCAGCGCGACGGACGCGGCGTGCTGAAGGCCTGCGAC DRPGASAYADDLKIF CTGGAGCTGCAGCTGGACGGAGCCCGGATCCCCTCGCTGATCATGAAC SGTADGVKRQHALV GCGTCGTACGCGTACCTGGGGACAGGCGACGGCTTTGACCACGTGCGC ADFLRWTGMVANP CGCCGCATCGAGCTCGTCCCGGCGCTCATGCAGCTCAAGGACGATGCG NKCCTMSVQRDGRG ACGGCACTGCTCCAGTCGGGGCTGGCGCCGTGTCAGGTGGTGAAGGCG VLKACDLELQLDGARI GTGAAGACTTACCTCTTCCCGCGGGTGGAGTACGCACTCCGCCACCTCC PSLIMNASYAYLGTG GCCCCTTTCAGCAGCAACTGGAAGGCTTTAATCGCCACCTCGTTTGCGG DGFDHVRRRIELVPA CCTTCGCCACCTGCTCCGACTTCCGGTCAGCGCCACGACCAGCTTCTTCT LMQLKDDATALLQS TCGTGCCGGTGTCGCGCGGCGGGCTAGGCCTGTTGCCACTGACGGAGC GLAPCQVVKAVKTYL TGCACGCAGCTCCGGCTGATCGCCGACCGGCGCCACCGCCTAGACCCCG FPRVEYALRHLRPFQ ACCACTGGAAGGAGCGGGAGGAGAAAATATGCGAGCTGCGGATCAAC QQLEGFNRHLVCGLR TCGCAGCTCGGGACGTCGGCCCACGCCCCACCCAAGCGCCGCAACGGC HLLRLPVSATTSFFFV GACATCGGCTCGTTGTGGGTCGACGTCCAGCGCCACCTCCGAGGACTCG PVSRGGLGLLPLTELH GCCTCAAGCTAGAGACCGCGCCGGCGTGCGCGGAGACCGGCACCGCG AAPADRRPAPPPRPR GAGGCGACGCTGCAGCTTCGCGTGCCGCACCACGAGAAGTGGCTGGAC PLEGAGGENMRAAD CACCGCACGGTCCTGCGGCAGGTGAAGCTGCACATGAAGAACAAGCAT QLAARDVGPRPTQA TGGCAGCGGTGGGCCGGGATGAAGGACCAAGGCAAGACCGCCCGCAC PQRRHRLVVGRRPA CCATGGTGGTGCAGGGAGCGGCTTCCTCACGCGGCCTCGCGGCCTATG PPPRTRPQARDRAG GGAGACCGACTACCGCTTTGCAGTGGCGGGTCGGCTCAACCAGCTGGA VRGDRHRGGDAAAS CACGCACAGCGTGCTGAAGCGCCGGCGCCTCCGCGCGCACGACCAGTG RAAPREVAGPPHGP TCGACAGCCGGGCTGCTCCCGTGCCGAGACACTGGCACACGTGCTGAAT AAGEAAHEEQALAA CACTGCGCCGGCACCATGGACGCGGTCCGCGGCCGCCACGACGATGCG VGRDEGPRQDRPHP CTCAAGACCATCGAGCGCGCGCTCATCGCGTCGTCCGGGCGGCCAGGA WWCRERLPHAASRP CCGCGCCGAGCTCCGGGTCAACCAGACCGTGCCCTCGCTCGCCGGCCCC MGDRLPLCSGGSAQ GCGCTGCGGCCCGACCTCCAGCTCTTCAACCACACCAAGAAGACGGTGG PAGHAQRAEAPAPP CGGTGGTCGACCTGGCCGTGGCGTTCGAGGAGCAGGCGAGCGACGAC RARPVSTAGLLPCRD ACCGAGAGCTCTGCGCTGGCGCGCATCGCCGCGCACAAGCGAGCCAAG TGTRAESLRRHHGRG TACGCCGGCATCAAGCGGCACCTCGAGCGCCAAGGGTGGAAGGTCCAC PRPPRRCAQDHRAR CTCTCGGCGCTCGTGTACGGCTCGTTGGGCGCGGTGATGCCCGCCAACT AHRVVRAARTAPSSG ACAAGGTGCTTACTGAGCACCTTGGGCTGCTCAAGCGTGATGCGAAGC STRPCPRSPAPRCGP GGCTGGACCGGCAGCTGTCGGTGGCGTGCATCCAGTCCAGCCGCCGCA TSSSSTTPRRRWRWS TCTGGAACCTGCACTGCGCGCAGCATCGAGCACGGCAGCACCAGGGCC TWPWRSRSRRATTP AAGCGCCAAGGGGCAGTCGGGCGGCGGAGACCGGGGGGACTCCGCCA RALRWRASPRTSEPS CAGAGCGGCCGCCGCTAAGCGGACATCGGGCCCGTAGCGGCCGACAG TPASSGTSSAKGGRS GGCCACACCCATGTAGGGAACCGCCCTCTAAACCCGCCCGGTACATTAT TSRRSCTARWAR GGTCCGACACCTATGAGGTGCAACCGGTACACAAGTTACACACCACATA (SEQIDNO:1451) GCGACTACCAGGTATTTACTACCTGGAAGCCAAGGATTAACCGGTCGGT AATACACATAACTTT(SEQIDNO:1083) NeSL YURECi Ciona ATCAACCCACTACTACACCCTCTACAAAGAACCCACTACAGCAAGATCTT ATCAACCC TAGCCA MATSSSSVSSGNVQT intestinalis GGCGACCAACTACACCTGCAGCCTCCCGTCGACTCTTCACCACCTGCGCA ACTACTAC AAAGAT EVRCVYHGKGDLLLE CCTCCCTCCGACTCCAAGCGGCAGACAGTGGCACCACCAGCACTCCAAG ACCCTCTA TTGGTG CPVAHCPSIHPTVATI GGTCCACAGCCAGAAGAGTGCACCCCCCACTGCTAGGAAGAGCTAGGC CAAAGAAC TTCTTGG TKHLKKHHTPQFEQI CATTGCCCGCTGTGGGCCTTGTGCTGGCTGTCTGCCGCTGATTGGCACG CCACTACA CGAGCT TTKNLTITYTCSQCSF TTAGGGAGTGTGCGCTGAGCACGATAGGGAGGCTGCTGAAGCCGAAG GCAAGATC GCAGTC STTGLTQHHISKHYKT AACCCGCGCCGGGGACTTTAGGACTTTAACTAAAATCTGTAACACTAAA TTGGCGAC CCGGCA CKGVGAVQEGNKGR ATTGGAATACTGGATTACTATTGGATTACTACTGGACAATAACCTGATCT CAACTACA GAGTAG FCCPACGTRWALLCK ACAACAACCAATTGGATTACAGAACCAAATTACAATCTCAACTACAAAC CCTGCAGC GCGAGA ARHHFNNVHFEYDT AAAAAGTGAGTCCCCGGCCGGGTTTCTATATTACCCAATTACTGTTTTTA CTCCCGTC TCGTGA PPIAAFSGTPYKLKKR CAATTTTAAAATTTTATAGTATTTTACTAATTTTTCCGCCCCGCTAGCACT GACTCTTC GCTCTAC KFTIINKALTYSCPLPL TAAATTGGCACCCCCCCCCCCCSTCCAAAAAAAAATAGATAACCCTCCAC ACCACCTG ACCGCTT NQLLCPLWSCSLTILN CACCTAAACCCCGGTCACCCCCTCAGAACTAATTGGAACCCGGATCTTTT CGCACCTC CACGAT KPLSSVQQETAHGD CCCTCTATCAAATTTCCGGTTCAGTTTGGCTTAATTACCTAAAACCCGTCT CCTCCGAC TTGACG GSQGQSYVPTQLRQ TTATTTTATCCCTTTATTGCCCCTTAAACCAAATTATTCAAATTGAAATCC TCCAAGCG AGTTCTT VLRARCHCGNPPIGK GGCCAAATTGCCTTGCTGAACCACCATTTTTGTTTTTCTTATAAAAGTAA GCAGACA CTTGAG GHWASCQGKRPLSS ATTTTTCAACAACCCAAAATCATAAAATTAAAACTGTTATCTGATTAGTCT GTGGCACC TCGATCC PKGGRSSPTPPANLT AAATTCTGTTATTGACAAAAACCTCCATACCAACCTTAATCTTATCATTAT ACCAGCAC CCAACG LHFLNYLPFQLPSQSS CTTCACCCAGACTGCTCAATCTAGTTCTTTTTTCAATCAATAAGGAGGTG TCCAAGGG GCAACG SPQSSTLDPTACKAR TTTTCTTTGGTCTGTTGTTTTATAGTTTACCGTTATAATAGTCTGTTGTCCT TCCACAGC CACTCCA VPIPSFLRGDCEVTFFI ATAGTTTTCCGTGGTGTATTTCTGGTCTACTATAGTTGGTGCCTCTAAAA CAGAAGA AATTTAA IPSVNFYRPYLSYPLR CATTATTATACCGAAGAGTGCTAGCGATTTCTATTATTAACTCACCCACC GTGCACCC CGACCG MFWRNRTSSGHCSL AGTACTGCTGTGCATCCACGCAGCACCACGCCTGCCATCACGTACAGTG CCCACTGC CATGGC HRVVRGFVRERLVPH CGCTTGCTGCCTTGTGTTTTGTTTTGCAGACACCACTGGACGATAGATGG TAGGAAG GACGCC PRSKSPARTPLEFLCE CGACTTCGTCTTCCTCCGTTTCATCGGGGAATGTTCAGACCGAGGTGCGT AGCTAGGC ACTGAT FRLAGVFPDPGKVAS TGCGTCTACCACGGAAAAGGAGACTTGTTGCTGGAATGCCCAGTTGCCC CATTGCCC CTCCCA LRPVPAPLTLCLSPPV ATTGCCCTAGCATACACCCCACAGTTGCCACAATCACAAAACACTTAAAA GCTGTGG GTCATCC AGPMISCEDHSAPPS AAACATCACACACCTCAATTCGAACAGATAACCACTAAAAATCTCACAAT GCCTTGTG GTGCTC VRSSSPIPNSPASVSS AACATATACATGCAGTCAATGTAGTTTCTCCACCACCGGTCTCACACAAC CTGGCTGT CGAGGG VEAHLSDLLDKVSSG ACCACATATCCAAACACTACAAAACATGCAAAGGCGTGGGCGCTGTTCA CTGCCGCT CACTTAA ELRPLSPTLPSSGFFG GGAAGGCAACAAGGGTCGATTCTGCTGCCCTGCATGCGGTACCAGGTG GATTGGCA GGGGAA PLLPPTPPPRPTPSAE GGCTCTTTTGTGTAAGGCACGGCACCATTTTAATAATGTCCACTTCGAGT CGTTAGGG CCGATTC KASPSGLSYLPCREVK ATGACACCCCTCCAATTGCCGCTTTCTCTGGCACACCCTATAAACTAAAA AGTGTGCG ACTGAT IASIARPSPASQRVGC AAACGGAAATTTACAATTATTAATAAGGCCCTAACATATTCATGCCCTCT CTGAGCAC GCTGCA DADRTGPSLNPNYQ CCCCCTCAACCAACTACTTTGCCCTTTGTGGTCGTGCTCCCTCACGATCTT GATAGGG AGTTGG QTSPPSTPSFSPIVRP AAATAAACCTCTTTCAAGCGTGCAGCAGGAAACGGCACATGGCGACGG AGGCTGCT CGGGGG PKFPRSGAKVNSKSK CTCACAAGGCCAGAGTTATGTTCCAACTCAGTTGCGGCAGGTGCTCAGA GAAGCCG CGCACT PPGVRPRRAKPIEPG GCCCGTTGCCACTGCGGAAACCCTCCAATTGGAAAGGGCCACTGGGCAT AAGAACCC CACCAA TESASPVDVDTISSSV CTTGCCAAGGGAAGCGACCACTCTCTTCACCTAAGGGGGGCGCAGTTC GCGCCGG TGGAAG QEPCTPENRTPEFFYE GCCCACTCCACCTGCTAATCTAACCCTCCACTTCCTTAACTATCTCCCCTT GGACTTTA ATGAGG RKWLVSILNIHEREGS CCAACTTCCCTCACAAAGCTCCTCCCCCCAAAGTAGCACCCTCGACCCTA GGACTTTA GAGCCC NFFQFNRDLEYWTQ CGGCCTGTAAAGCTCGAGTCCCCATCCCCTCTTTCCTACGCGGCGACTGT ACTAAAAT AACGAT LLSGSQKGGRAKRAS GAAGTCACCTTTTTCATCATCCCCTCCGTTAATTTTTATAGACCCTACCTC CTGTAACA GCCCCG YNRGAANQAMKNR TCCTACCCCCTCAGGATGTTCTGGCGTAACCGAACATCCTCCGGTCACTG CTAAAATT AACTAC DSGRKDFDPRPVAG TTCCCTACACCGGGTCGTTCGAGGCTTTGTCCGAGAGCGACTCGTCCCCC GGAATACT CGCACC HSSGGGTELGSRPRY ATCCTCGGTCGAAATCGCCGGCGAGGACTCCCCTTGAATTCCTCTGCGA GGATTACT GTCCCC PKGARLRADFWRD GTTCCGGCTCGCGGGGGTCTTCCCCGATCCTGGGAAGGTCGCGTCCCTT ATTGGATT AGCACC MKGTVRKLLDGSNG CGACCCGTACCAGCTCCCCTTACGCTTTGCCTTTCTCCACCAGTCGCTGG ACTACTGG GTGTCC ERRCGIPLDIIERKFRQ TCCAATGATTTCCTGTGAGGACCACAGCGCCCCCCCTTCCGTTCGCTCGA ACAATAAC TTGTGA VSMPGWIDHRRYAA GCTCCCCCATTCCTAACTCCCCAGCCTCCGTAAGCTCCGTGGAGGCGCAC CTGATCTA GTAATT GASPSLVTQAETDVA TTGTCTGACTTGTTAGACAAAGTGTCTTCCGGAGAACTGCGGCCGCTGT CAACAACC ATTTTCA ITSEEVEAVLSGLNVQ CCCCCACCCTGCCTTCTTCAGGCTTTTTTGGCCCTCTGTTGCCACCCACTC AATTGGAT CTAATCC SAPGSDGLSYRFWK CCCCTCCCCGACCTACCCCCTCGGCCGAAAAGGCATCTCCTTCAGGCCTT TACAGAAC AGGGAC GLDPSGRLLSCLFEIV TCCTACCTACCCTGCAGAGAAGTAAAAATCGCGTCCATTGCACGGCCAT CAAATTAC GGGGCC RRHGRIPGAWPTCS CCCCTGCCTCACAGAGGGTCGGCTGCGACGCGGACCGAACAGGGCCTT AATCTCAA ATGAAC VILLCKDAQGDVQDV CGCTGAACCCTAATTATCAGCAAACCTCCCCCCCGTCTACTCCGAGTTTC CTACAAAC TGAACT GNWRPITICRTLYKLY TCCCCCATAGTTAGACCCCCTAAATTCCCCCGGTCGGGCGCCAAAGTAA AAAAAGTG ATGTCT AAVIARRIQTWAKQ ATTCCAAGTCCAAACCCCCCGGCGTCCGACCACGCCGTGCCAAGCCAAT AGTCCCCG GCACGC GGVLSRLQKGFMPV TGAGCCAGGGACCGAGTCGGCATCACCCGTTGATGTCGACACTATTAGT GCCGGGTT TGCCCC EGVFEHVFMLDTVLS AGTTCGGTTCAGGAACCTTGTACTCCAGAAAACAGAACCCCGGAATTTT TCTATATT GCCTAG DAKLRRKNLLAVFLD TTTATGAGCGAAAGTGGCTCGTGAGTATTTTAAATATTCACGAGCGGGA ACCCAATT CCTCGG VRNAFGSVRHECLLK GGGATCCAATTTTTTTCAATTTAACCGGGATCTGGAGTATTGGACGCAG ACTGTTTT CCACAA VLRHFDAPHYLVELV TTGCTTTCAGGCTCCCAGAAGGGAGGACGCGCGAAGCGTGCGTCCTAC TACAATTT ATAAAT RDIYTGATCRVRSSV AATCGTGGTGCGGCGAACCAAGCGATGAAAAACCGCGATTCCGGCCGT TAAAATTT AGTCAA GETGDIPCDRGVRQ AAGGATTTCGATCCCCGGCCGGTAGCAGGCCACAGTAGTGGAGGTGGC TATAGTAT GCCGGG GYPLSGILFNLVTEVLI ACCGAGCTCGGTTCGCGGCCACGATACCCTAAGGGCGCACGTCTTCGTG TTTACTAA GCGTAC PGLSAGNDGYRMAC CCGACTTCTGGAGGGACATGAAAGGGACGGTCCGTAAGTTACTTGATG TTTTTCCG TACCAAT LGGKLTQVLAYADDL GTTCGAACGGTGAGCGTCGCTGCGGGATTCCCCTCGATATCATAGAGCG CCCCGCTA TGATTC VVVTENRDQMLRQL GAAATTCCGGCAAGTATCCATGCCCGGCTGGATAGACCACAGACGGTA GCACTTAA GGATTT GVCEEFGRWAGLAF CGCGGCCGGCGCGTCCCCGTCCCTAGTAACCCAAGCTGAAACGGATGTT ATTGGCAC GGCGGG NQRKCGLIGWRTLR GCCATCACCAGTGAGGAAGTTGAGGCGGTCCTCAGTGGCCTAAACGTTC CCCCCCCC GCCCAT GGRRVALEDPLLLNG AGTCCGCTCCGGGCTCGGATGGCCTCTCCTATCGATTTTGGAAGGGCCT CCCSTCCA CACGCG VEIPLLRPGEHYKYLG CGATCCTAGTGGTCGATTGTTGTCATGCCTTTTCGAAATCGTTAGGAGGC AAAAAAAA CCTCCGC AMTGVMSVPRTGS ATGGTCGTATTCCGGGCGCGTGGCCGACTTGTAGCGTCATCCTGCTCTG TAGATAAC CCCCAC QLIKDFRARLQRLFTS TAAAGATGCTCAGGGTGACGTTCAGGACGTGGGAAACTGGAGACCGAT CCTCCACC ACACAA FLTPHQKLIALKRFLLP CACCATCTGCCGCACTCTGTACAAATTGTACGCGGCTGTGATCGCCAGA ACCTAAAC AAACAC SLSFHLRVRPIARSELI CGTATTCAGACTTGGGCGAAACAGGGGGGGGTGCTCTCGCGGCTCCAG CCCGGTCA TTTTAAA ALDRRVRECLRVAFR AAAGGGTTCATGCCAGTTGAGGGTGTGTTTGAGCACGTCTTCATGTTGG CCCCCTCA CTCTTCG LTKPSCQAVFHTPTD ATACCGTCCTTTCCGACGCCAAGCTGAGGCGCAAGAATCTTCTTGCCGTC GAACTAAT GTTCCCC MDGLGVPSVCSESSI TTCCTTGACGTTCGGAATGCGTTCGGAAGCGTCCGGCATGAATGCTTGC TGGAACCC AACCAC LTIAQGFKVLTSPDGT TCAAAGTTCTTCGGCACTTCGATGCTCCCCATTACCTGGTTGAACTCGTG GGATCTTT TACAAC VSATASARVKLYAAK AGGGATATTTACACCGGTGCGACCTGCCGTGTTAGATCTTCGGTTGGTG TCCCTCTA AAAGCG FGGLTEAGPSDWAR AGACCGGTGATATACCCTGTGATCGAGGTGTCAGGCAGGGCTATCCCCT TCAAATTT AGCGGC YLSGDDVNGNSTRKP GTCTGGCATTCTTTTCAACCTGGTAACGGAGGTGCTAATACCGGGGCTC CCGGTTCA CCCTCTT GANLPSGLWTRVRC AGTGCCGGGAATGATGGCTACCGGATGGCTTGCTTAGGTGGTAAGCTC GTTTGGCT AGATCC ASRQLGAVWRVCPE ACTCAAGTCCTGGCCTACGCTGACGATCTGGTGGTAGTCACGGAAAACC TAATTACC AATTTTA NGITVRVRNSVITSRD GTGATCAGATGCTACGTCAGCTGGGTGTGTGTGAGGAATTCGGCCGTT TAAAACCC AAATTTT RRKLIRSFHDCSNQQ GGGCAGGCCTTGCATTCAATCAACGGAAGTGTGGCCTGATCGGCTGGA GTCTTTAT AAATCA WKEQWMQHPNQE GGACGTTGCGGGGAGGGAGGAGGGTCGCACTGGAGGATCCTCTGCTC TTTATCCCT GTGCAC KTAAAHMAYADANR CTTAACGGAGTAGAGATCCCCCTGTTGCGGCCCGGTGAACACTATAAAT TTATTGCC TCAACTT WVKQPSVMEPHTYF ACCTTGGCGCCATGACTGGCGTCATGAGTGTTCCCCGCACCGGATCCCA CCTTAAAC TTTTACG FALRARLNLLPTRVSR GCTGATCAAGGACTTCCGTGCAAGGCTCCAACGACTGTTTACATCGTTCC CAAATTAT TGTTGT AIYSRDQHPDILCRRC TCACACCACACCAGAAGCTGATCGCTCTGAAACGGTTTCTGCTGCCAAG TCAAATTG GTTTTTT GASVESLPHVLNHCP TCTCAGCTTTCATCTGAGGGTTCGTCCGATTGCGCGGTCTGAGCTGATCG AAATCCGG GTATTTT PNMSIILGRHNLVLQ CGCTCGACCGTCGCGTGCGCGAGTGCCTTAGGGTGGCGTTTCGCTTGAC CCAAATTG TTCCCAC EVLNAVDKTQFKEIS CAAGCCCTCGTGTCAGGCTGTGTTTCATACACCGACTGACATGGATGGT CCTTGCTG AGATTTT VDRTVPEHMSETGE CTGGGTGTCCCGTCGGTATGCAGTGAGAGCAGCATACTGACGATCGCCC AACCACCA GTATTTT ALRPDIVARRNDGSV AGGGGTTCAAGGTGTTAACGTCACCCGACGGCACAGTGTCCGCGACGG TTTTTGTTT TATATTA VVVDVACPFDQKAN CTTCGGCGCGCGTGAAGCTCTACGCCGCTAAATTTGGCGGCCTCACTGA TTCTTATA TATTTTA FDEAAKRKLLKYDKLC AGCAGGTCCTTCGGACTGGGCGCGTTATCTATCTGGTGATGACGTCAAC AAAGTAAA TATACAC CNIAASTGKPVECHSI GGGAATAGCACTCGCAAACCCGGCGCTAACCTGCCTTCTGGTCTGTGGA TTTTTCAA AACACT VVGSLGSLAEGLSTSL CTCGCGTCCGCTGTGCCAGCAGGCAGTTAGGTGCTGTGTGGCGCGTTTG CAACCCAA ATTTTTA RALGITDFARSKLVAC CCCGGAAAACGGCATCACTGTCCGGGTGCGTAATTCTGTGATTACCTCC AATCATAA TACACTA HQG(SEQIDNO: CGAGACCGACGGAAGCTCATCAGATCCTTCCACGATTGCTCTAATCAGC AATTAAAA CCTTGCA 1452) AGTGGAAGGAGCAGTGGATGCAGCACCCCAACCAGGAGAAGACCGCT CTGTTATC CTGTCCC GCAGCGCATATGGCTTATGCCGATGCGAACCGCTGGGTGAAGCAGCCG TGATTAGT ACTTTTT TCGGTTATGGAACCTCATACATACTTTTTTGCATTGAGGGCGCGCTTGAA CTAAATTC GTAATT CCTGTTACCCACTAGGGTTTCACGAGCTATTTATAGCAGGGACCAACAC TGTTATTG ATCACCT CCGGACATTTTATGCCGACGTTGTGGTGCGTCAGTTGAGTCCCTCCCGC ACAAAAAC TTTACCT ATGTACTAAACCATTGTCCTCCCAATATGAGCATAATACTCGGGAGGCA CTCCATAC TTTATGC TAATTTGGTACTGCAGGAGGTTCTGAACGCGGTAGACAAAACCCAGTTC CAACCTTA CGCTCG AAAGAGATATCCGTTGACCGAACGGTCCCCGAGCACATGTCGGAGACA ATCTTATC CTGAGC GGTGAGGCCTTGCGGCCTGACATAGTGGCACGGCGGAATGACGGTTCC ATTATCTT TCCTTGC GTCGTGGTCGTCGACGTGGCTTGCCCGTTCGATCAAAAGGCGAACTTTG CACCCAGA ACGGAT ATGAGGCGGCAAAACGGAAGCTCCTAAAGTACGACAAGCTTTGTTGCA CTGCTCAA GACCAG ATATCGCTGCCTCCACAGGTAAACCTGTGGAGTGTCATTCGATAGTCGT TCTAGTTC ACAAAC GGGCTCCTTGGGGAGCCTCGCGGAGGGTCTGTCCACCTCCCTCCGGGCT TTTTTTCAA TTTTATA TTGGGAATAACCGATTTTGCTCGCTCTAAGCTTGTCGCTTGCCATCAAGG TCAATAAG AAATTAT GTAGCCAAAAGATTTGGTGTTCTTGGCGAGCTGCAGTCCCGGCAGAGTA GAGGTGTT AACATT GGCGAGATCGTGAGCTCTACACCGCTTCACGATTTGACGAGTTCTTCTTG TTCTTTGG GTTTTAA AGTCGATCCCCAACGGCAACGCACTCCAAATTTAACGACCGCATGGCGA TCTGTTGT TTGTCG CGCCACTGATCTCCCAGTCATCCGTGCTCCGAGGGCACTTAAGGGGAAC TTTATAGT CGGGGT CGATTCACTGATGCTGCAAGTTGGCGGGGGCGCACTCACCAATGGAAG TTACCGTT ATCAGT ATGAGGGAGCCCAACGATGCCCCGAACTACCGCACCGTCCCCAGCACCG ATAATAGT GGCGCC TGTCCTTGTGAGTAATTATTTTCACTAATCCAGGGACGGGGCCATGAACT CTGTTGTC CCCTTGC GAACTATGTCTGCACGCTGCCCCGCCTAGCCTCGGCCACAAATAAATAG CTATAGTT GGCCGT TCAAGCCGGGGCGTACTACCAATTGATTCGGATTTGGCGGGGCCCATCA TTCCGTGG GAAAGC CGCGCCTCCGCCCCCACACACAAAAACACTTTTAAACTCTTCGGTTCCCC TGTATTTC CCTTTTC AACCACTACAACAAAGCGAGCGGCCCCTCTTAGATCCAATTTTAAAATTT TGGTCTAC ACCTTAA TAAATCAGTGCACTCAACTTTTTTACGTGTTGTGTTTTTTGTATTTTTTCCC TATAGTTG TTCGCCC ACAGATTTTGTATTTTTATATTATATTTTATATACACAACACTATTTTTATA GTGCCTCT CTTAGTC CACTACCTTGCACTGTCCCACTTTTTGTAATTATCACCTTTTACCTTTTATG AAAACATT CAAATTT CCGCTCGCTGAGCTCCTTGCACGGATGACCAGACAAACTTTTATAAAATT ATTATACC TTCCCCA ATAACATTGTTTTAATTGTCGCGGGGTATCAGTGGCGCCCCCTTGCGGC GAAGAGT TTGCCT CGTGAAAGCCCTTTTCACCTTAATTCGCCCCTTAGTCCAAATTTTTCCCCA GCTAGCGA GTAAAA TTGCCTGTAAAAGTGCGTTGCCGTCGACTCGAACTCGTCCTTTTTACCGC TTTCTATTA GTGCGT CTCTCTGTTACTGAACTATGACCAGCTTGGCTGTTGAAGTCGGCTTTAAT TTAACTCA TGCCGT TCGCCGGCTTCACATTATATTTTTTTGTTGGGTAATCTGTTTTTTTTCATAT CCCACCAG CGACTC ATAAAACCATTCGGCTTAAAATCACCAATCCCCATCATCCACCCTGGTCA TACTGCTG GAACTC CCCATTGAAACATCTCTTTTTAGTCAATTATTTTTTCAGATTACCCCTCCCA TGCATCCA GTCCTTT AATTGTCATATAGTTTAACACCCCGTTTGCCAAGTTGGTCTTTTCCCCATC CGCAGCAC TTACCGC CCCTTCTGTCCTTCCGGTAATCCCATAACATTTTATTCTTTAATTCGGCCC CACGCCTG CTCTCTG CATAAATAGTGTAATCTTAAGACGAAGTCCCCAGAGGCCCGGTCCCCCC CCATCACG TTACTGA CTTCTTTGTCATGGACCGGGCCAGCCCCCCTGTTACCACAACAACCCACC TACAGTGC ACTATG ATTTTATTCTTTCTTTCTTTTTATTAGTATTTATTTATATTGCCGCCCAACT GCTTGCTG ACCAGC GTCGTGTCGTGGCGCAGGGGGGTTCCCTCTGTCGGCCCAGTGGACGAC CCTTGTGT TTGGCT CGTCTAAAAAACAGGCACAGGCAGCAGGCACTTATCACCCGGTACCTCC TTTGTTTT GTTGAA GGGTACCGGAGGCTGGTTTTCAGCGCACTGTACGTGATGGCCAATTTTA GCAGACAC GTCGGC TTCATTGCATTTTATCCGCGTCGTGGTGTTTGCGTGGATGCATTAATAAA CACTGGAC TTTAATT AGATGAAATTCC(SEQIDNO:1084) GATAG CGCCGG (SEQID CTTCACA NO:1207) TTATATT TTTTTGT TGGGTA ATCTGTT TTTTTTC ATATATA AAACCA TTCGGC TTAAAAT CACCAA TCCCCAT CATCCAC CCTGGT CACCCAT TGAAAC ATCTCTT TTTAGTC AATTATT TTTTCAG ATTACCC CTCCCAA ATTGTCA TATAGTT TAACAC CCCGTTT GCCAAG TTGGTCT TTTCCCC ATCCCCT TCTGTCC TTCCGG TAATCCC ATAACA TTTTATT CTTTAAT TCGGCC CCATAA ATAGTG TAATCTT AAGACG AAGTCC CCAGAG GCCCGG TCCCCCC CTTCTTT GTCATG GACCGG GCCAGC CCCCCTG TTACCAC AACAAC CCACCAT TTTATTC TTTCTTT CTTTTTA TTAGTAT TTATTTA TATTGCC GCCCAA CTGTCG TGTCGT GGCGCA GGGGGG TTCCCTC TGTCGG CCCAGT GGACGA CCGTCTA AAAAAC AGGCAC AGGCAG CAGGCA CTTATCA CCCGGT ACCTCC GGGTAC CGGAGG CTGGTTT TCAGCG CACTGT ACGTGA TGGCCA ATTTTAT TCATTGC ATTTTAT CCGCGT CGTGGT GTTTGC GTGGAT GCATTA ATAAAA GATGAA ATTCC (SEQID NO: 1330) R2 PERERE- BN000800 Schistosoma ATCTCACGTTTTAATTTATTTTTGAACTACTGCAGTCTGAGTGCTTCTAAC ATCTCACG TAACGG MPVSTGAETDITSSLP 9 mansoni GACCCGAAGGCTCAGAAACTACCCACTTCTTGAACTGCTACTTTTTGCTG TTTTAATTT CTGAAC IPASSIVSPNYTLPDSS TTTATCCACAACAACAGTTGTGATTCTATTCTCCANATATTCCTTGTGCTT ATTTTTGA GAATAG STCLICFAIFPTHNILLS TTGTCAACATTATTCTATACCAACTGTACCACCTACTTCTTCATCTCACGT ACTACTGC CCCCCTT HATAIHHISCPPTPVQ TTTAATTCTGGTCTTATTTTCTCATCATTAGTCACGGAGAGGGCCTATGA AGTCTGAG CACTCTT DGSQQMSCVLCAAA ACGGTCCGTGACGCGAAATTCTATCCGCGATTTCGACCTCTCCTGCTAGT TGCTTCTA AGACAT FSSNRGLTQHIRHRHI GGTCCCCGAAGTACGGTTCCTCTGGCCTGTCAGTTGTGTTAAAACTATAT ACGACCCG TCCCCCA SEYNELIRQRIAVQPT AATAACGATGCCGGTCTCAACCGGCGCAGAAACTGACATAACCTCTTCT AAGGCTCA CTGTTGT SRIWSPFDDASLLSIA TTGCCTATTCCTGCATCCTCAATCGTCTCGCCAAACTACACACTCCCTGAT GAAACTAC TGCTTAT NHEAHRFPTKNDLC TCCTCTTCAACCTGCCTTATATGTTTCGCTATCTTCCCCACCCACAACATA CCACTTCT CTTCATG QHISTILTRRTAEAVK CTCCTCTCCCATGCCACTGCAATCCACCATATTTCTTGTCCTCCTACTCCA TGAACTGC CTCTTGT RRLLHLQWSRSPTAI GTGCAAGACGGTTCTCAGCAGATGTCTTGTGTTCTTTGCGCCGCCGCTTT TACTTTTT GTTAATT TTSSNNHTITDIPNTE TTCATCTAACAGGGGACTAACACAACACATTCGCCACCGGCACATCTCC GCTGTTTA GACTGC ARYIFPVDLDEHPPLS GAATATAACGAACTAATCAGACAACGAATTGCAGTGCAGCCGACGTCTC TCCACAAC TCTCTTC DATTPNASTHPLPELL GCATATGGTCACCATTCGATGATGCTTCTTTACTATCCATCGCTAACCAT AACAGTTG TGGGTT VILTPLPSPTRLQNISE GAAGCCCATAGATTCCCCACGAAGAATGACCTATGCCAACATATCAGCA TGATTCTA GACGTC SQTSHESNKNSMHT CCATACTAACACGCAGGACGGCAGAAGCCGTCAAACGCCGACTCCTCCA TTCTCCAN TGATTGT PPTYACDPDETLGAT CCTACAGTGGTCGAGATCACCAACAGCGATTACTACCTCTTCGAATAATC ATATTCCT CTCTCTC PSSTIPSCFHSYQDPL ACACAATCACAGACATCCCCAATACCGAGGCCCGATATATTTTTCCGGTA TGTGCTTT TCTTTCC AEQRGKLLRASASLL GACCTAGACGAACATCCACCATTGTCTGATGCCACAACCCCCAACGCATC TGTCAACA ATATTGC QSSCTRIRSSSLLAFL GACACATCCACTCCCAGAACTCCTTGTCATCTTGACACCGCTTCCATCCCC TTATTCTAT TTGCTCT QNESTLMDEEHVSTF GACTAGACTACAAAACATATCCGAATCACAGACCTCCCATGAATCTAATA ACCAACTG GCCCGC LNSHAEFVFPRTWTP AGAACTCAATGCATACACCGCCAACGTATGCCTGCGATCCGGATGAGAC TACCACCT TTACTTC SRPKHPSHAPANVSR ACTAGGGGCTACTCCCTCATCAACTATTCCCTCATGCTTCCACAGTTATCA ACTTCTTC CAATAG KKRRKIEYAHIQRLFH GGACCCCCTAGCTGAACAAAGAGGCAAACTCCTGAGGGCATCCGCCAG ATCTCACG TTGTCAT HRPKDASNTVLDGR CCTACTACAAAGCAGTTGTACTCGCATACGGTCCTCCAGCCTGCTCGCCT TTTTAATTC ATTATGT WRNPYVANHSMIPD TCCTCCAAAACGAATCCACATTAATGGACGAGGAACACGTGTCCACCTT TGGTCTTA CTTTGTT FDCFWTTVFTKTNSP CCTCAATAGTCATGCAGAATTCGTCTTCCCTAGAACATGGACCCCATCCC TTTTCTCAT TACTTGC DSREITPIIPMTPSLID GACCCAAACACCCCTCCCACGCCCCAGCTAATGTTTCTAGGAAGAAAAG CATTAGTC CATGTCT PILPSDVTWALKEMH GAGGAAAATAGAGTACGCACACATCCAGAGACTCTTCCACCACCGTCCC ACGGAGA AACGAC GTAGGIDRLTSYDLM AAAGATGCCTCCAACACCGTTCTAGACGGTCGGTGGAGAAACCCCTATG GGGCCTAT AATTACT RFGKNGLAGYLNML TCGCAAACCATTCAATGATTCCAGACTTCGACTGCTTCTGGACAACAGTC GAACGGTC TTATCTA LALAYLPTNLSTARVT TTTACTAAAACAAATTCCCCAGACAGCCGGGAGATTACTCCAATCATCCC CGTGACGC CCTTAGT FVPKSSSPVSPEDFRP TATGACTCCCTCTCTCATTGACCCGATCCTCCCCTCTGACGTCACATGGG GAAATTCT TTGTCCT ISVAPVATRCLHKILA CGCTGAAAGAAATGCATGGCACGGCCGGTGGGATTGATCGTCTGACAT ATCCGCGA CTTGGTT KRWMPLFPQERLQF CGTACGATCTGATGAGATTCGGGAAGAATGGTCTTGCTGGATATCTCAA TTTCGACC TCGATT AFLNRDGCFEAVNLL CATGCTACTCGCTCTTGCATACCTTCCCACTAATCTTTCAACAGCACGGG TCTCCTGC GCCTTCA HSVIRHVHTRHTGAS TAACTTTCGTCCCCAAGTCATCAAGTCCTGTGTCACCTGAGGACTTCCGT TAGTGGTC TATGTTC FALLDISRAFDTVSHD CCCATCAGTGTCGCTCCAGTAGCCACTAGGTGCCTGCACAAAATTCTAG CCCGAAGT ATGGCG SIIRAAKRYGAPELLC CTAAGAGATGGATGCCGCTCTTTCCACAGGAACGACTTCAGTTCGCTTTC ACGGTTCC GAATCT RYLNNYYRRSTSCVN CTAAACCGAGATGGATGCTTTGAAGCAGTTAATCTTCTGCACTCGGTCAT TCTGGCCT GATGTTT RTELHPTCGVKQGDP ACGGCACGTCCACACCCGCCATACAGGAGCATCCTTCGCCCTGCTCGAC GTCAGTTG ATAATG LSPLLFIMVLDEVLEG ATATCACGGGCCTTTGACACTGTATCACATGACTCCATCATCAGAGCGG TGTTAAAA ACTATTC LDPMTHLTVDGESLN CGAAAAGATATGGGGCACCTGAACTGTTATGCCGCTACCTCAATAACTA CTATATAA CTATTAC YIAYADDLVVFAPNA TTACCGACGTTCAACCAGCTGCGTCAACCGCACTGAATTGCATCCTACGT TAACG CACCACT ELLQRKLDRISILLHEA GTGGGGTGAAGCAAGGAGACCCCCTGTCGCCACTCCTCTTCATCATGGT (SEQID ACAACT GWSVNPEKSRTLDLI TCTCGACGAAGTACTGGAAGGTCTAGATCCAATGACCCACCTAACAGTT NO:1208) ACTATTA SGGHSKITALSQTEFT GATGGAGAGAGCTTGAACTACATAGCTTATGCTGACGATCTCGTAGTTT TTATTTT IAGMRIPPLSAADTF TCGCTCCAAATGCGGAACTCCTTCAACGGAAACTCGATCGGATCTCCAT CATTACT DYLGIKFNFKGRCPV ACTTCTACACGAGGCTGGATGGTCGGTTAACCCTGAAAAAAGCCGGACC ATTAAC AHIDLLNNYLTEISCA CTGGACCTAATCTCTGGTGGCCATTCCAAAATCACAGCGCTCTCTCAGAC ATTATTA PLKPQQRMKILKDNL AGAATTCACCATCGCGGGGATGCGTATACCACCGCTTTCTGCCGCCGAC TAAACA LPRLLYPLTLGIVHLKT ACCTTCGACTACCTGGGTATCAAATTCAACTTCAAGGGCCGATGCCCAG TTATTAC LKSMDRNIHTAIRKW TGGCCCATATTGACTTATTGAACAACTACCTCACGGAAATATCGTGCGCT TATTATT LRLPSDTPLAYFHSPV CCACTTAAGCCGCAGCAGCGCATGAAGATCTTGAAAGATAATCTACTCC ATTATTA AAGGLGILHLSSSVPF CTCGACTCCTATACCCCCTGACTCTAGGAATAGTACACCTGAAAACCCTG CTATTAT HRRKRLETLLSSPNRL AAGTCAATGGACCGAAATATCCACACGGCCATAAGGAAATGGTTGCGG TACTTCT LHKLPTSPTLASYSHL CTACCCTCCGACACCCCGCTAGCATATTTTCACTCACCCGTCGCTGCCGG ACAATT SQLPVRIGHETVTSRE AGGCCTAGGGATCCTCCATCTGTCCTCATCGGTTCCATTCCACCGTCGAA AATATTA EASNSWVRRLHSSCD AACGTCTAGAAACCCTCCTATCTTCACCGAACCGCCTACTGCACAAGTTG TGGCTA GKGLLLAPLSTESHA CCAACTTCCCCAACACTAGCTTCTTATTCACACCTTAGTCAACTGCCAGTT CTCCTCT WLRYPQSIFPSVYINA CGAATTGGGCACGAGACCGTAACGTCTAGAGAAGAGGCTTCCAACAGC CAGCAC VKLRGGLLSTKVRRSR TGGGTGAGACGATTACATTCGTCCTGCGACGGGAAGGGACTACTCCTA ACCAAT GGRVTNGLNCRGGC GCACCACTAAGCACCGAGTCCCATGCATGGCTGCGCTACCCCCAGTCTA AAAATA AHHETIHHILQHCALT TTTTTCCATCTGTTTACATCAACGCCGTTAAATTACGAGGTGGCTTACTAT TCAATCA HDIRCKRHNELCNLV CCACCAAAGTCAGGAGATCTCGCGGAGGTAGAGTGACGAATGGCCTGA AACATCT AKKLRRQKIHFLQEPC ACTGTCGAGGCGGTTGCGCCCATCATGAAACGATCCACCACATTCTGCA CAATTAT IPLEKTYCKPDFIIIRDS ACATTGCGCGCTCACCCACGACATCAGATGCAAACGCCATAACGAACTA ATCCACC IAYVLDVTVSDDGNT TGCAACCTTGTGGCAAAGAAACTGCGTAGGCAAAAAATCCATTTCTTAC TATTAAA HASRLLKISKYGNERT AGGAGCCCTGCATTCCTCTAGAAAAAACCTACTGCAAACCTGATTTTATA CTCTCTC VASIKRFLTSSGYIITS ATTATACGTGACTCAATTGCTTATGTTCTAGACGTCACTGTATCGGACGA TATTTCC VRQTPVVLTFRGILDR CGGAAACACCCACGCCAGCCGCCTGTTAAAAATATCAAAATACGGCAAT CCTGAG ASSQSLRRLCFSSRDL GAGCGAACCGTCGCGTCGATCAAGCGTTTCCTCACATCCAGTGGATATA TTATAAA GDLCLSAIQGSIKIYN TCATTACCAGTGTTCGACAAACACCAGTCGTCCTTACATTCAGAGGTATT CTTACAA TYMRGT(SEQID CTGGATAGAGCAAGTTCACAATCCCTACGACGCCTATGTTTTTCATCCCG TTCAGTC NO:1453) TGACCTCGGTGACCTTTGCCTGAGTGCGATTCAAGGCTCAATTAAAATAT TAACCG ATAATACCTATATGAGAGGAACCTAACGGCTGAACGAATAGCCCCCTTC AATATCT ACTCTTAGACATTCCCCCACTGTTGTTGCTTATCTTCATGCTCTTGTGTTA CTCTTTT ATTGACTGCTCTCTTCTGGGTTGACGTCTGATTGTCTCTCTCTCTTTCCAT ACAAAT ATTGCTTGCTCTGCCCGCTTACTTCCAATAGTTGTCATATTATGTCTTTGT CTTAAGT TTACTTGCCATGTCTAACGACAATTACTTTATCTACCTTAGTTTGTCCTCTT ATGTAA GGTTTCGATTGCCTTCATATGTTCATGGCGGAATCTGATGTTTATAATGA TTTTGTG CTATTCCTATTACCACCACTACAACTACTATTATTATTTTCATTACTATTAA CCAAAC CATTATTATAAACATTATTACTATTATTATTATTACTATTATTACTTCTACA CCATTTG ATTAATATTATGGCTACTCCTCTCAGCACACCAATAAAATATCAATCAAA GGTCTG CATCTCAATTATATCCACCTATTAAACTCTCTCTATTTCCCCTGAGTTATA TACAATT AACTTACAATTCAGTCTAACCGAATATCTCTCTTTTACAAATCTTAAGTAT TGATACT GTAATTTTGTGCCAAACCCATTTGGGTCTGTACAATTTGATACTTAAAAA TAAAAA TAAATGTTATTAGCC(SEQIDNO:1085) TAAATG TTATTAG CC(SEQ IDNO: 1331) R2 R2- Bombus TCAATAGTTACTCGGGGGCAGGCGGGATATTGGTCTTGCCTTGCCCAAG TCAATAGT TGAAAA IAKFDNNTNSASDAA 1_BTe terrestris TCACACTCCTACCTCCTCGTGGTACCGCCGGTAACACGCGCACGTCCACA TACTCGGG CACGAT PLSPGGAVADLSASE TCAGCGAGGGGCGTACTCCCCCGGATGTGGCGGCGCGTGGCTAAACGG GGCAGGC AACGAT GTTDNDQAMSPAM AGTGTGGCGACGAAGGAGCGAAAGACTAACAACTATAACGGTCTTCCG GGGATATT TATGAA SLXTVPLVGNRVACP TAACGGCTACTTGGAGCCGTGAATAATGGAGCCTATATTAAACCCTGGA GGTCTTGC TCAAAA XCEKREANLFFLNLSD ACTCGTTCCTTCGTTCTGTTGACGACTGGAACGGCAAAGGACATGATTT CTTGCCCA TAAGAA LDRHLTQHHPDAPIX GGATAACAATTGGAAACTTAATATCGAATTCACAATTAAATTGCAAAATT AGTCACAC AGTAAA WSCIDCAKCFPKLHG CGATAATAATACGAATAGTGCCTCGGACGCAGCGCCTCTCAGTCCCGGA TCCTACCT CATCCCA ARCHIPKCGGASSQA GGCGCTGTGGCGGATTTATCCGCTTCAGAGGGGACTACGGATAACGAC CCTCGTGG GAAATT RTGEFQCEACPMSF CAGGCTATGTCCCCGGCTATGAGTTTGATNACGGTGCCGCTCGTGGGCA TACCGCCG GTCTAC GSRRGLSTHERHAHP ATCGGGTCGCCTGTCCAGNATGTGAGAAAAGGGAGGCTAACCTTTTCTT GTAACACG GTCTTAT AVRNIKRRGADPPEE CTTAAATCTGTCGGACCTGGACAGGCACCTAACACAACACCACCCGGAT CGCACGTC TTGTTAT NTKSWKVEXVARLK GCCCCGATCTNTTGGTCNTGTATCGACTGCGCGAAATGCTTCCCGAAGC CACATCAG CTATTTA GLWEIFKDHKYPNKE TTCATGGGGCTAGATGCCACATACCTAAGTGTGGTGGCGCTAGCAGCCA CGAGGGG TTTGTTT ISKFLTTKTVDXXKYQ GGCCAGGACAGGNGAGTTCCAGTGCGAGGCCTGTCCCATGAGCTTCGG CGTACTCC A(SEQ RKKLNLIGXESPQEAT GTCGCGCAGAGGGCTATCCACCCACGAACGGCATGCGCACCCTGCCGTC CCCGGATG IDNO: SLATEGGCDLVSSGN AGAAATATTAAAAGAAGGGGAGCGGACCCCCCAGAAGAAAATACAAAA TGGCGGC 1332) ASFGSPVGRNENEEE TCTTGGAAAGTAGAANNGGTAGCACGCTTGAAGGGGCTATGGGAAATA GCGTGGCT LIHEWKLSLKNEINKP TTCAAAGACCATAAATATCCCAACAAAGAAATCAGCAAATTCCTCACCAC AAACGGA TEVPPILKEVYNRLML NAAAACGGTNGATCANATNAAATATCAAAGAAAAAAGCTAAACTTAAT GTGTGGC IWEEHQDDRDSLTES TGGTGNGGAAAGCCCCCAAGAGGCTACCTCACTGGCAACAGAGGGAG GACGAAG LDHFIRTALYELINKIN GGTGCGATCTCGTTAGTTCAGGCAATGCCAGTTTTGGCTCGCCTGTAGG GAGCGAA KNQTDLKTKRAAKTK CCGCAACGAGAACGAAGAAGAGCTCATCCACGAATGGAAGCTCTCGCT AGACTAAC SPKNNRNSRKRFSYA AAAAAACGAAATAAATAAGCCAACCGAGGTGCCCCCTATTTTAAAGGAG AACTATAA RCQELFHECPRRLAD GTTTACAATCGGCTGATGTTAATCTGGGAAGAGCACCAAGATGATCGAG CGGTCTTC AVVNNDQAYLEPAR ACTCTCTAACGGAGAGTCTCGATCACTTTATACGCACAGCTCTATACGAG CGTAACGG QPPGSEEVRGLYEKL CTTATAAACAAAATAAATAAAAACCAAACGGATTTAAAAACTAAAAGAG CTACTTGG WGQVGSTYVPAPVT CAGCTAAAACAAAAAGCCCAAAAAATAATAGAAACTCCAGGAAGCGAT AGCCGTGA RVPKLSLSEIFPPIAAE TCTCATACGCTCGTTGCCAGGAGTTATTTCACGAATGCCCAAGGAGACT ATAATGGA DVGERIGKIRKKAAA GGCTGATGCCGTGGTCAATAACGATCAGGCATATCTCGAACCAGCCAG GCCTATAT GPDGLQRDHLTIPGL GCAACCTCCCGGATCAGAGGAAGTGAGGGGGCTCTACGAAAAGCTGTG TAAACCCT PIIMAKIYNILVYCSYF GGGACAAGTGGGTTCCACGTACGTCCCGGCTCCAGTTACGAGAGTCCCC GGAACTCG PSAWKENRTTLIPKIN AAACTATCCCTATCCGAGATCTTCCCGCCGATAGCGGCTGAGGATGTGG TTCCTTCG KPCSLVENWRPITISP GGGAGAGAATCGGCAAAATAAGAAAGAAAGCTGCGGCAGGACCGGAT TTCTGTTG ILGRIFSSIIDGRIRRGT GGATTACAAAGAGATCACCTAACTATCCCCGGCCTGCCTATCATAATGG ACGACTGG VLNMRQKGFTSENG CGAAAATCTATAATATACTCGTATATTGTTCTTATTTTCCCTCCGCATGGA AACGGCAA CKINIELLNSALNYSKR AGGAGAATAGAACAACTCTTATTCCAAAAATAAATAAGCCATGCAGCCT AGGACATG NSGGIFTIVDISKVFD GGTCGAGAACTGGCGACCAATTACTATTAGTCCGATTCTAGGCCGAATC ATTTGGAT TVPHAALKPCLAKKG TTCTCCTCCATTATCGACGGGAGGATAAGAAGAGGCACTGTATTAAATA AACAATTG VPALIVDLIDEMYKN TGAGGCAGAAGGGCTTTACATCTGAAAACGGATGTAAAATTAACATCGA GAAACTTA VKTTIKTKDGGVEIMI ATTATTAAATTCTGCCTTAAACTATAGTAAAAGAAATAGCGGCGGGATA ATATCGAA RRGVKQGDPLSPLLF TTCACTATCGTGGATATCTCGAAAGTTTTTGATACAGTGCCTCATGCAGC TTCACAAT NLCLEPLLEEIEEQAS CTTAAAACCTTGTCTGGCAAAAAAGGGTGTGCCCGCCCTTATCGTCGATT TAA(SEQ GINVSEHRKVSVLAF TGATCGACGAAATGTATAAAAACGTAAAAACTACTATAAAAACTAAAGA IDNO: ADDIVLLGADAREAQ TGGCGGGGTCGAGATTATGATCCGCCGAGGAGTTAAGCAAGGCGACCC 1209) HQINVLTDYLQSLM CCTATCGCCCTTACTCTTCAATTTATGCTTGGAGCCACTGCTGGAAGAGA MNLSIEKCQTFEVVA TCGAGGAGCAAGCCAGTGGAATAAATGTTAGCGAACATCGTAAAGTAT KKDTWFIKEPGLKIG CAGTTCTGGCCTTCGCAGACGATATCGTTCTACTTGGAGCGGACGCGAG NQIMPTVDPDEAFKY GGAAGCGCAACACCAAATAAACGTACTTACCGACTACCTGCAAAGCCTC LGAKIGPWKGVHCG ATGATGAATCTATCAATTGAAAAATGCCAAACCTTCGAGGTTGTGGCCA VIVPELLSVVKRVRKL AAAAGGATACCTGGTTCATTAAGGAACCTGGACTTAAAATCGGGAATCA SLKPGQKVELLTKYIF AATAATGCCTACTGTAGATCCCGACGAGGCTTTCAAATACCTGGGCGCC PRYIYHLLVSPPSDTV AAAATTGGTCCCTGGAAAGGCGTTCATTGTGGTGTAATTGTTCCAGAAC LKLLDSEVRQEVKTIL TTCTGAGCGTGGTGAAAAGGGTGAGGAAACTCTCCCTCAAGCCGGGCC HLVPSTATGFFYTPKA AGAAGGTGGAACTTCTAACTAAATACATCTTCCCCCGCTACATTTACCAT CGGIGIPRFEHIIKLGT CTACTTGTAAGTCCGCCAAGCGATACCGTCCTTAAACTACTAGACAGCG LKSAIKIANSIDPAVA AGGTCAGACAAGAAGTTAAAACTATCTTACATCTTGTGCCTTCCACTGCC GLIDDAAIKKLKQTA ACAGGCTTCTTTTACACTCCAAAGGCCTGTGGAGGAATAGGAATACCGA NSLRINWPASLEDIEK GGTTTGAACATATAATCAAACTCGGTACCCTTAAAAGTGCAATAAAAAT ARKRLRKEHISQWAD CGCAAACTCGATCGATCCAGCGGTCGCTGGCCTAATCGATGATGCAGCC LKCQGQGVPDFIKNK ATAAAAAAGTTAAAGCAAACGGCCAACTCCTTGCGGATCAATTGGCCAG TGNLWLEDHSLLKPS CCTCCTTGGAGGATATTGAAAAAGCTCGTAAACGTCTAAGGAAAGAGCA RLIDALRLRTNTFGTR TATCAGCCAATGGGCTGATTTAAAATGCCAGGGACAAGGCGTTCCTGAT SVLARADKNIDVTCR TTTATCAAAAATAAAACTGGCAACCTGTGGCTTGAGGACCATAGTCTAC RCRAQPETLGHILGLC TCAAGCCATCGAGACTCATCGATGCCCTTCGATTAAGAACAAATACCTTT QHTKGLRIKRHDEVK GGTACAAGATCGGTGCTGGCACGGGCCGACAAAAATATTGATGTAACA SLLEGRLKSKKNNEVF TGTCGAAGATGTCGTGCCCAGCCCGAGACCCTTGGACACATACTCGGGC VEPTIKAGGSLFKPDL TATGTCAGCACACTAAAGGCTTAAGAATCAAAAGGCACGATGAGGTTAA VIKNGERVLVVDVTV ATCCCTCCTCGAAGGAAGATTAAAAAGTAAAAAGAACAACGAAGTATTT RYENKNYLALAEKEKI GTAGAGCCGACGATTAAGGCCGGAGGCAGTTTATTTAAACCAGACCTC EKYRPCLRALKEIFNA GTAATTAAAAACGGGGAAAGGGTTCTCGTGGTCGACGTAACTGTCCGCT KGGEILPVVLGSRGTI ACGAGAACAAAAACTACCTGGCCTTAGCCGAAAAAGAAAAGATAGAGA TPNTEKVLKRLGIAN AGTATCGGCCATGCCTAAGGGCATTAAAAGAAATATTTAACGCCAAAGG NDIKTILLNVLRSSIEL AGGAGAGATTCTTCCGGTGGTCTTGGGTAGTAGAGGCACTATCACGCCT CNIFIDD(SEQID AACACCGAGAAAGTCCTCAAGCGATTGGGAATCGCTAACAATGATATAA NO:1454) AGACAATACTCTTAAATGTATTAAGAAGCTCGATAGAGCTGTGTAATAT ATTCATTGATGACTGAAAACACGATAACGATTATGAATCAAAATAAGAA AGTAAACATCCCAGAAATTGTCTACGTCTTATTTGTTATCTATTTATTTGT TTA(SEQIDNO:1086) R2 R2- Crocodylus AGGCGTCTCCTTTAAGGGCAACGGTCTGGTTACGCGGTCGCAGCAGKCT AGGCGTCT TGAATC VPPGAEARGRYHHP 1_Crp porosus TGCKCCAGGTACCTCCWCGTGGTTCCCGCCGGGTGCCGMAGMCCCAG CCTTTAAG CCACTCT RXEXARQGEPPSXRV GSCTGTCGGTAGCTCGATCCTGGCACAGTAMGGCCAGGGGAGTKCTTC GGCAACG GGGGAC FLVXLPDSNPPCPICG CTTGCTGCWGGTGCCCCACAAGCGTKTGGCAGSMCMCCTGCTTCWTC GTCTGGTT CCCCAA DHVXXXSVLALHCVE GCAAMAATMASAGKGTSMTCAGTAGTCGGCCCCGCCGCTAGCCAAAA ACGCGGTC AAATTA GHXWAXVQYQCTH CTGTTCGCCACSCAGTTACAGATGGTGCCTGTGCTGACCKGKKGCCCCGT GCAGCAGK GAAAAA CGILCHIPRCQGRVXE GGGCTCGGWGGTGCAGGGGWGGCCGGCCCATGGCTGGGCCAGACST CTTGCKCC CCCAAA XTGKDXXCPECPASF GGGCCKTGGAGCCCGCTCCCAMCCCAGAGTTCCCCCTCAAMGCCGCCA AGGTACCT ACAGTT DEKAGLSQHKRHTVT GGKCAGMAMCAKCCGGGGAGKGMCACGGCCCCGGCCGTGAGTCCCC CCWCGTG GTGTTTA XSXERVAGXLLRAXLR CCCGGTGCWGAGGCCCGGGGCCGATACCACCACCCCCGGTGMGAGM GTTCCCGC AGTGTG HGCWSVEEEETLTRL GCGCCCGCCAGGGTGAACCACCCTCCAMCAGGGTTTTCCTGGTGASGCT CGGGTGCC TTCTTGT DAMFXGARNINQLIA GCCTGACTCAAACCCKCCGTGCCCGATTTGCGGGGACCACGTGSGTWA GMAGMCC TTGTCCC AEXVSKMXKQISDK ACWCTCCGTGCTGGCCCTCCACTGCGTGGAGGGCCACGWGTGGGCGS CAGGSCTG TTTGGCT WRXLXLXPEQTTXGG ATGTGCAGTACCAGTGCACCCATTGTGGCATCCTCTGCCACATCCCACGT TCGGTAGC TCACCTC XAESASVVXXESMTP TGCCAGGGGAGGGTGCMKGAGCWGACCGGGAAGGACTKGGMTTGTC TCGATCCT CAAGTT EMEAQSPAXPPGKIR CTGAGTGCCCTGCCTCCTTCGATGAGAAAGCWGGCCTATCGCAGCACA GGCACAGT GCGATC KIFTGQDGHAGGXA AGCGGCACACGGTAACACAMAGTGWKGAACGGGTTGCAGGKWACCT AMGGCCA CCCCATC WENQEDFHWTRWA GTTGAGGGCTKGTTTAAGGCACGGGTGCTGGTCGGTGGAAGAAGAGG GGGGAGT TCCCCTG RRWLKRGQXLSDKV AAACTCTCACCCGCCTWGACGCGATGTTCCSTGGCGCCCGGAACATCAA KCTTCCTT CGCTGT QEVLGXWVEGQPRI CCAGCTGATCGCTGCTGAGMTGGTWTCGAAGATGCSTAAGCAGATCAG GCTGCWG CTTTCTG XAWVDXVSLDVLTLF CGACAAATGGAGGTWGCTCKSTCTGWGTCCTGAGCAGACCACATWGG GTGCCCCA AATGAC LGVPPGPQRAPSKKG GTGGTGAKGCTGAGTCKGCCTCCGTGGTGSAGGWWGAGTCCATGACT CAAGCGTK CAGTGG PXEGGKPTSWMNKR CCGGAGATGGAAGCCCAAAGCCCAGCTGAMCCGCCTGGGAAAATCAG TGGCAGS TGTTGA AIKWGTFLRYQHLFG GAAGATTTTCACTGGACAAGATGGGCACGCCGGTGGCTGAAGAGGGG MCMCCTG GGCTGG ANRKLLAAXILDGAX GCAGGKTCTGTCAGACAAGGTGCAAGAAGTCCTGGGCKCGTGGGTGGA CTTCWTCG TGTGAC RNQXTLLLEEVXQXY GGGTCAACCCAGGATTCMTGCCTGGGTCGACKCMGTCTCCCTTGATGT CAAMAAT CTCGGT XGKWEAEPPFEGLG TTTGACTTTGTTCTTGGGGGTGCCCCCGGGACCGCAGAGGGCTCCGAGC MASAGKG CACCTCC RFGXXRDVDSFAFEA AAGAAGGGGCCCAMGGAGGGTGGCAAACCWACGTCCTGGATGAACA TSMTCAGT AAGGCC LITXEEAVKHMMXM AACGTGCCATCAAATGGGGCACATTCCTTCGGTACCAGCACCTGTTTGG AGTCGGCC AAGTGC AXXSAPGPDKLTLRD TGCCAACAGGAAGCTCCTGGCGGCCMTCATCCTGGATGGTGCTGASCG CCGCCGCT CCTGGC LRRADPEGDALAELF TAATCAGTGSACCCTCCTGCTWGAGGAGGTCTTKCAGGMCTACCKAGG AGCCAAAA CCCGAG SLWXITGVVPDRLKE AAAGTGGGAGGCCGAGCCGCCTTTCGAGGGTCTTGGACGGTTTGGGST CTGTTCGC TAGGAC XQXVLIPKAVDFEKLR CCAMMGGGATGTGGATAGCTTCGCATTCGAGGCCCTGATCACTCKGGA CACSCAGT CAGGTG QLGNWRPITIXSIVLQ GGAGGCTGTCAAGCACATGATGGMAATGGCCWMSAAMTCGGCTCCA TACAGATG GCCCAG LXSRVLTARLTAACPI GGTCCGGATAAGCTCACCCTGAGAGATCTGCGCCGCGCKGACCCCGAG GTGCCTGT CTTGCT XPHQQGFISAPXCAE GGAGATGCTCTTGCCGAACTGTTCAGCCTGTGGMTGATCACTGGCGTG GCTGACCK GGGCAC NLKXPELIXRKVKXDR GTCCCGGACAGACTCAAGGAGTKTCAGTSTGTGCTGATACCAAAGGCAG GKKGCCCC CCGTCA RPLGVAFVDXARAFD TGGACTTTGAGAAGCTGAGGCAGCTGGGCAACTGGCGCCCCATCACTAT GTGGGCTC CCGCCC SVSHDXISWVLKAKG CGSGTCCATTGTKTTGCAGCTATTMTCACGGGTGCTGACCGCACGGCTC GGWGGTG AGGGCA VDQHIVNLIEDSYQK ACCGCAGCWTGCCCCATAASCCCCCACCAGCAGGGKTTCATMTCGGCA CAGGGGW AATGGA VTMRVQVFSGSTPPI CCKKGGTGTGCCGAAAACCTGAAGMTGCCTGAGCTCATCTTSCGSAAGG GGCCGGCC AGGGAT SIKXGVKQGDPMSPL TGAAGCAMGACAGGCGGCCGCTGGGGGTGGCATTCGTGGATMTAGCS CATGGCTG CTATCCT LFNIAMDPLIXKLKTV AGGGCSTTCGACTCGGTATCGCACGACMACATTTCWTGGGTCCTAAAG GGCCAGAC GACCAC RQGVKVGSASLTTLA GCCAAAGGGGTGGACCAACACATCGTGAATCTCATWGAAGATTCATAC STGGGCCK TACCAG FADGLXLLXDSWEG CAGAAGGTTACCATGAGAGTACAAGTGTTCAGTGGTTCCACCCCWCCC TGGAGCCC GCTAAG MQHNITTSXTPGRAC ATCAGCATCAAAWCCGGGGTCAAGCAAGGCGACCCGATGTCMCCCCTS GCTCCCA TGTGGT NTTSRHPRGLLQPHG TTGTTCAACATTGCGATGGACCCCCTGATAGSCAAGCTCAAGACAGTCA MCCCAGA GCTGCC PTSATXKMXGVLLES GGCAGGGAGTGAAAGTCGGGAGTGCTTCCCTGACCACCCTGGCATTTG GTTCCCCC TAGCCT XMRLLYGEQLRGLED CGGATGGCCTCGWMCTGCTGAKTGACTCCTGGGAGGGCATGCAACAC TCAAMGC GCCGTA XRPXXHDAXARRAD AACATCACGACATCCTAGAGGACTTCTGCAACCTCACGGGCCTACGAGT CGCCAGGK AGGTCA TISGLEXRSLGWDXQ GCAACCCAMAAAATGKCAGGGGTTCTTCTTGAATCCGAMATGCGACTC CAGMAMC AGCGCC TRFGYATXLLAREDG CTWTACGGTGAACAACTGCGMGGCCTGGAAGATWGCCGGCCGTGMS AKCCGGG CTGCTG DCXAQTNAEALCWX ATCACGATGCTSGGGCCAGGCGAGCCGACACGATATCTGGGCTTGAAK GAGKGMC CCGCTC SXPFPGCAXRPXYAN GTCGGTCCCTGGGTTGGGATWGGCAAACCAGATTTGGGTACGCAACTA ACGGCCCC GGGTAK XGWVASEALDSMSR AKCTCCTGGCTCGAGAGGATGGGGACTGCCWCGCTCAAACCAATGCAG GGCCGTGA CAGTCCT RXVKEWFHLPACTDX AAGCTCTTTGCTGGTSCAGTATKCCATTCCCAGGCTGCGCTMTCAGGCC (SEQID CGTTCAC LLXSRHRDGGLGLLRL GWTTTATGCGAACMCCGGTTGGGTGGCCTCGGAGGCTCTGGATTCCAT NO:1210) TCGTCCC ARXXLAAXVRRPIRV GAGCCGAAGAAAKGTGAAGGAGTGGTTCCACCTACCCGCCTGTACCGA TCCTAGT ATSSDEVTRKVSYAC CKGCCTCCTCCASTCCCGCCACCGGGATGGGGGCCTKGGGTTGCTGCGC ACCCTCC GISDEVERLXLAXGG CTGGCGAGAGMCWTTCTGGCAGCGSAAGTGAGGAGGCCGATTCGTGT GCMTCT DXSNVPRFEDPXAPK AGCCACCTCATCTGATGAAGTCACGCGGAAAGTATCCTACGCATGTGGG GCGCTT SXXVQGPHEAAQET ATAAGTGATGAAGTGGAGCGKCTCTGKTTGGCMAGWGGGGGAGACAT CTGCTAT PRVVRTQAIPWPSN STCCAATGTCCCGAGGTTCGAGGATCCTGAWGCCCCGAAGTCTSCAKGG CCACTGT WRAEEHSKWAQLSC GTGCAGGGCCCCCATGAGGCTGCCCAGGAGACACCACGTGTTGTCCGG GGCCGG QGERVELFCNDPVSN ACACAGGCGATCCCCTGGCCCTCCAACTGGAGAGCCGAGGAACACTCCA WGATGC GWINSRGQLAERLW AATGGGCCCAACTCAGCTGTCAGGGAGAGAGGGTGGAGTTATTCTGCA CGAGG IMALKLRSNIYPTREF ATGATCCAGTCAGCAATGGCTGGATCAACAGTCGGGGACAACTGGCGG WTGSW LGRGQAGTNIGCRH AACGCCTCTGGATCATGGCCTTAAAACTGCGTTCGAACATTTATCCCACC MCAWC CTHPRETLGHILGICP AGGGAGTTTCTTGGGAGAGGCCAGGCAGGTACCAACATAGGTTGCCGG CTCGAC AMQEARILRHNKLCK CACTGTACACACCCTCGTGAAACATTAGGGCATATCCTGGGCATATGTC ACCTCCA ILAAEGKNCEWTVYY CTGCCATGCAGGAGGCACGGATCCTCCGGCACAACAAGCTGTGCAAGA GGGCCA EPHLHNAAGELRKPD TCCTGGCAGCTGAGGGCAAAAACTGTGAGTGGACAGTGTATTATGAGC AGCGCC LIFVRDGTALVVDITV CGCACCTACATAACGCAGCTGGGGAACTGCGTAAACCTGACCTCATTTT TTGGCC WYEGGPATLLSTTAE TGTCCGAGACGGTACCGCTCTGGTTGTGGATATCACTGTGTGGTATGAA MCAGGT KATKYLDLNTQIQELT GGGGGCCCCGCAACCCTCCTGTCCACCACTGCAGAAAAGGCCACAAAAT AGGACC GAEQVTYFGFPIGAR ACCTGGATCTGAACACACAGATCCAGGAGCTCACAGGGGCTGAGCAGG TGGCAC GKWHADNWRVLSEL TCACCTACTTTGGCTTCCCTATAGGAGCCAGGGGAAAGTGGCATGCTGA CTGCCC GLSNSRKERVTRLLS CAATTGGCGAGTACTGTCTGAACTGGGATTGTCCAACTCCCGGAAGGAG AGGGGG WRALLGSVDMVNIF CGGGTCACACGGCTCCTGTCGTGGCGAGCACTGCTCGGGTCAGTGGAC CCAGAC VSKHRQENLLDEHCT ATGGTGAACATCTTTGTATCTAAACACAGGCAGGAAAACCTATTGGATG ACTGCCT PAEQVVSSYAS(SEQ AGCACTGCACCCCAGCTGAGCAAGTIGTGAGTTCTTATGCGAGTTGAAT GCGGCA IDNO:1455) CCCACTCTGGGGACCCCCAAAAATTAGAAAAACCCAAAACAGTTGTGTT AGGGAA TAAGTGTGTTCTTGTTTGTCCCTTTGGCTTCACCTCCAAGTTGCGATCCCC AGGGAG CATCTCCCCTGCGCTGTCTTTCTGAATGACCAGTGGTGTTGAGGCTGGTG CCGTCCC TGACCTCGGTCACCTCCAAGGCCAAGTGCCCTGGCCCCGAGTAGGACCA TGACCG GGTGGCCCAGCTTGCTGGGCACCCGTCACCGCCCAGGGCAAATGGAAG TTACCG GGATCTATCCTGACCACTACCAGGCTAAGTGTGGTGCTGCCTAGCCTGC GGCTTG CGTAAGGTCAAGCGCCCTGCTGCCGCTCGGGTAKCAGTCCTCGTTCACT ATGGTG CGTCCCTCCTAGTACCCTCCGCMTCTGCGCTTCTGCTATCCACTGTGGCC CTGGCT GGWGATGCCGAGGWTGSWMCAWCCTCGACACCTCCAGGGCCAAGC AGCCCG GCCTTGGCCMCAGGTAGGACCTGGCACCTGCCCAGGGGGCCAGACACT CCGTAT GCCTGCGGCAAGGGAAAGGGAGCCGTCCCTGACCGTTACCGGGCTTGA GTCAAG TGGTGCTGGCTAGCCCGCCGTATGTCAAGCACTCCACAGCTGCTCGAGT CACTCCA TCGCTGGCTTCACCTTCATCCCWCCTAGTGCCTTCCGCCTCTGCGCTATT CAGCTG TTCGTCCCGACTCGTACCTCCCCACCTCTGCGCTACMGCTATCCCTGMA CTCGAG AGGACCAAGTGGGCAGGGGSGTTCGCCCCCCGCCCGCAGGAGGCTCGG TTCGCT CGTATCCGTGGCKTCWTGCCTCCACCGTCTTGTGCCGCTAGAGGGGTAC GGCTTC CTCMGAGACCGGCGCAACACGACCTTGACGSTTAGACAGTAGGGTGM ACCTTCA AACAKCCCTGCTGCAGGCSTGAAGGGCCAAACGGCTGTGCCATGAGAG TCCCWC GGGAACCTTGAAGACCGGGSCAGTCAGCCAGTTAGTCAGTTGGGCGAA CTAGTG ACAATCCCAGCTGCAGGCCCAMMAGGGCTGTCAGGTGAGGGGGTATC CCTTCCG CCCAWCCACCCCCCGCCGCCGACTACGGAGGCAKGAAGTCCCTAGTGA CCTCTGC CTTCKGACCCCCACGTCTTGTGCCGGGAGAGGGGAACCTTGAAGATCG GCTATTT GGGCAAGCCGCACTTGATAGTTAGCCAGTCKGGTGAAACAATCCCAGCT TCGTCCC GCGGGTCCGAAAGGGCCGACTWCCAGGCGAGGGGGGCCTGCGGAAA GACTCG MCCCCCTCCATGGTACGGAGKTCTGGCGTCCTMACCGACWCCTTGCCA TACCTCC CCAACGTCTTGTGCCGGGAGAGGGGAACCTTGAAGATCGGGSCAAGCT CCACCTC GCACTTGATGGTTAGTCAGTCGGGTGAAATAATCCCAGCWGCCCCCGC TGCGCT TGTGACTGCTAAGMCWGGTCCCCAAGGGGCATGAGGCATSTGCGCTG ACMGCT AGCCGGSAGGGGTGACACMCGGCGATCGGCGCAGCACAKASTGAAGG ATCCCTG GAGGCACTTGCTGAGACTGCTTCTGAGGCCCCAGACTTGGGGTGGTGC MAAGG AGCCTTGTCTGGGGTATGGTACAGCACCCTACTGCTCCCTTTGGKCAGC ACCAAG AGAATTCGTCCCGACCTCTTACCCACCCGAGTCTGCGCCTTGTTCCGCTA TGGGCA TCCTGCATCTCCGATCCACCTCGCTGTCTCCCCGCTGCGCTGCTTTTCTCT GGGGSG CAAGTGGGTTAAATCTTGTCATGATTACCTCCCACGTTTCCGCTCAAGGG TTCGCCC CAATGCCCAAMATGACGGGGATCGCTGGTGCATGGCAGTCATGAGACC CCCGCC ATCCGGACCCTCCGGTGGTCGCTATAGTCATTTTKTGTTGCATGGGGCAT CGCAGG SCTGAGTCACTTAACCGAAAGACTCWAAATAACTCAAAAGAGGKAMCC AGGCTC TCTGSGGTTCGGTAAA(SEQIDNO:1087) GGCGTA TCCGTG GCKTCW TGCCTCC ACCGTCT TGTGCC GCTAGA GGGGTA CCTCMG AGACCG GCGCAA CACGAC CTTGAC GSTTAG ACAGTA GGGTG MAACAK CCCTGCT GCAGGC STGAAG GGCCAA ACGGCT GTGCCA TGAGAG GGGAAC CTTGAA GACCGG GSCAGT CAGCCA GTTAGT CAGTTG GGCGAA ACAATC CCAGCT GCAGGC CCAMM AGGGCT GTCAGG TGAGGG GGTATC CCCAWC CACCCCC CGCCGC CGACTA CGGAGG CAKGAA GTCCCTA GTGACT TCKGAC CCCCAC GTCTTGT GCCGGG AGAGGG GAACCT TGAAGA TCGGGG CAAGCC GCACTT GATAGT TAGCCA GTCKGG TGAAAC AATCCC AGCTGC GGGTCC GAAAGG GCCGAC TWCCAG GCGAGG GGGGCC TGCGGA AAMCCC CCTCCAT GGTACG GAGKTC TGGCGT CCTMAC CGACWC CTTGCCA CCAACG TCTTGTG CCGGGA GAGGGG AACCTT GAAGAT CGGGSC AAGCTG CACTTG ATGGTT AGTCAG TCGGGT GAAATA ATCCCA GCWGCC CCCGCT GTGACT GCTAAG MCWGG TCCCCAA GGGGCA TGAGGC ATSTGC GCTGAG CCGGSA GGGGTG ACACMC GGCGAT CGGCGC AGCACA KASTGA AGGGAG GCACTT GCTGAG ACTGCTT CTGAGG CCCCAG ACTTGG GGTGGT GCAGCC TTGTCTG GGGTAT GGTACA GCACCC TACTGCT CCCTTTG GKCAGC AGAATT CGTCCC GACCTCT TACCCAC CCGAGT CTGCGC CTTGTTC CGCTATC CTGCATC TCCGATC CACCTC GCTGTC TCCCCGC TGCGCT GCTTTTC TCTCAA GTGGGT TAAATCT TGTCAT GATTAC CTCCCAC GTTTCC GCTCAA GGGCAA TGCCCA AMATGA CGGGGA TCGCTG GTGCAT GGCAGT CATGAG ACCATCC GGACCC TCCGGT GGTCGC TATAGTC ATTTTKT GTTGCA TGGGGC ATSCTGA GTCACTT AACCGA AAGACT CWAAAT AACTCA AAAGAG GKAMCC TCTGSG GTTCGG TAAA (SEQID NO: 1333) R2 R2- Drosophila GAAGCTGGGTCGGATGAGCGCAGAAGGGGTGTTCTTTGGAACACTGTA GAAGCTG TAGATG FERRSNSWGYRPLEP 1_DWi willistoni ATTCATAAGTCGTAAGTCTGATCAAGTCGACTCGAAACCTCCTCGTGGT GGTCGGAT TACTAAC RSVGTESNNNSPRSN GTTTCCTGGGTGCTGTTGAGTTCCTAGTCTCTAGGTTCTTTTCAGTAGCT GAGCGCA CTCTAGC ITITSATSRPGDQPRE AATTCGAGCGGCGAAGCAACTCTTGGGGTTACCGGCCCCTTGAGCCAA GAAGGGG TTTTCTT AIAVVNLAGEIPCAVC GAAGCGTTGGTACAGAATCAAATAATAATAGTCCTCGGAGCAATATCAC TGTTCTTT ATACTTT GRLFNTRRGLGVHM TATCACTTCAGCGACTTCACGTCCTGGAGACCAACCGAGAGAGGCTATA GGAACACT TGCCTG SHQHKDELDTQRQR GCAGTGGTAAATCTCGCGGGAGAGATTCCCTGTGCAGTATGCGGGCGC GTAATTCA CTACCTT EDVKLRWSEEEAWM CTCTTCAATACTAGAAGGGGGCTCGGTGTACACATGTCACATCAACACA TAAGTCGT GGCATT MARKEVELEASGNLR AAGACGAACTAGATACGCAACGTCAGCGTGAAGATGTAAAACTCCGAT AAGTCTGA ACATCTA FPNKKLAEVFTHRSSE GGAGCGAGGAAGAAGCGTGGATGATGGCGAGAAAGGAGGTGGAGCT TCAAGTCG AAAAGG AIKCFRKRGEYKAKLE CGAAGCAAGTGGTAATTTGAGATTTCCTAATAAGAAGCTAGCGGAAGTA ACTCGAAA TACAAA QIRGQSTPTPEALDSI TTTACTCACCGTAGCTCCGAAGCAATTAAATGTTTTCGGAAGAGGGGTG CCTCCTCG CATCGC TSQPRPSLLERNHQV AATATAAGGCAAAACTGGAGCAGATCAGAGGGCAATCTACTCCCACCCC TGGTGTTT ATTGTCA SSSEAQPINPSEEQSN AGAAGCGTTGGACTCTATTACCTCACAGCCTCGCCCTAGTTTACTCGAGC CCTGGGTG TAAAGA WEIMRILQGYRPVEC GAAACCACCAAGTATCATCGTCGGAAGCGCAACCAATCAATCCATCAGA CTGTTGAG GGTGGT SPRWRAQVLQTIVD AGAACAGTCGAACTGGGAAATCATGCGGATACTACAGGGCTATCGCCC TTCCTAGT TTTAGTA RAQAVGKETTLQCLS CGTAGAATGTAGTCCCCGGTGGAGAGCCCAGGTCTTGCAAACTATCGTA CTCTAGGT CGTAGG NYLLEVFPLPNEPHTI GATAGGGCGCAGGCCGTAGGGAAGGAAACCACTCTCCAATGCTTATCC TCTTTTCA CGCTGT GRSNLRRPRTRRQLR AACTATCTCCTGGAAGTATTTCCATTACCAAACGAACCACACACCATCGG GTAGCTAA GGGACT QQEYAQVQRRWDK TCGGAGCAATTTGCGAAGACCTCGAACTAGGAGACAGTTAAGACAACA (SEQID TCATTGT NTGRCIKSLLDGTDES AGAGTACGCACAGGTTCAGCGTCGTTGGGATAAGAATACTGGGAGATG NO:1211) CCCGGT VMPNQEIMEPYWK CATTAAATCCTTGCTTGATGGAACAGATGAGTCGGTTATGCCAAACCAA GATGCA QVMTNPSTCSCDNT GAGATAATGGAACCCTATTGGAAACAAGTAATGACGAATCCCAGCACAT GTGAAT RFRMEHSLETVWSAI GCTCTTGCGATAACACAAGATTCCGTATGGAACATTCGCTTGAGACGGT CGTGCA TPRDLRENKLKLSSAP TTGGTCAGCGATAACGCCACGCGACCTGAGGGAAAATAAGTTAAAGTT TACGAG GPDGITPRTARSVPL GTCAAGTGCTCCGGGTCCTGACGGTATCACTCCAAGAACAGCCAGGAGT ATTGTCC GIMLRIMNLILWCGK GTACCCTTAGGCATTATGCTACGCATAATGAACCTGATTCTCTGGTGCGG AGTAGT IPFSTRLARTIFIPKTVT CAAAATACCATTCTCTACCCGACTGGCCAGAACTATCTTCATTCCGAAGA TGGTTG ANRPQDFRPITVPSV CTGTGACGGCAAATCGACCGCAAGACTTTCGTCCAATAACAGTCCCCTC CTCGTAT LVRQLNAVLASRLAS GGTTTTGGTCAGGCAATTAAACGCTGTTCTGGCTTCTCGATTGGCTTCTA CTTTAGA KVNWDPRQRGFLPT AAGTCAACTGGGATCCAAGGCAGCGCGGTTTCCTACCTACCGATGGGTG AGATTTC DGCADNATLVDLILR TGCTGATAATGCGACGTTGGTTGATCTCATTTTGCGGGAGCACCATAAA CTTCCTC EHHKRWKSCYLATV CGGTGGAAGTCATGTTACCTTGCGACGGTGGATGTCAGCAAGGCTTTTG GGCGAT DVSKAFDSVSHQAIIK ACTCAGTATCACACCAGGCCATTATCAAGACTTTACAGGCCTATGGTGCT CAAAAA TLQAYGAPTNFVSFIE CCAACAAACTTTGTCAGCTTCATAGAAGAACAGTATAAGGGCGGCGGA AAAAAA EQYKGGGTSLNGAG ACCTCCCTCAATGGGGCAGGATGGAGTTCAGAGGTGTTTATACCCGCGC AAAAAA WSSEVFIPARGVKQG GGGGCGTTAAGCAAGGTGACCCTCTGTCTCCACTATTATTTAATCTTATC AAAAAA DPLSPLLFNLIIDRLLR ATTGATAGATTACTTAGGTCCTACCCCAGAGAGATTGGTGCCAAAGTCG (SEQID SYPREIGAKVGNTMT GAAATACCATGACAAGCGCGGCAGCGTTCGCGGATGATCTGGTGCTATT NO: SAAAFADDLVLFAET TGCGGAAACTCCGATGGGGCTTCAAACATTGTTGGATACCACGGTAGGC 1334) PMGLQTLLDTTVGFL TTCCTAGCCTCCGTGGGACTCTCCCTTAATGCTGATAAGTGCTTCACTGT ASVGLSLNADKCFTV CAGTATAAAGGGGCAAGCCAAGCAGAAGTGTACTGTCGTAGAACGACG SIKGQAKQKCTVVER GAGCTTTTGTGTAGGTGAGCGCGAGTGTCCTTCATTGAAGCGTACTGAA RSFCVGERECPSLKRT GAGTGGAAGTATTTAGGTATCCGGTTCACTGCGGATGGGCGGGCTCGG EEWKYLGIRFTADGR TATAGTCCAGCAGACGACCTCGGTCCGAAGCTGTTAAGATTAACAAGAG ARYSPADDLGPKLLRL CCCCTCTGAAACCACAACAGAAGTTATTTGCACTTAGGACTGTCCTTATC TRAPLKPQQKLFALR CCACAACTCTATCACCAACTAACACTTGGGAGTGTGATGATAGGCGTCC TVLIPQLYHQLTLGSV TAAGAAAATGTGACAGATTGGTACGGCAATTCGTAAGGAGATGGTTAG MIGVLRKCDRLVRQF ATCTCCCACTGGATGTACCAGTTGCTTACTTTCACGCCCCCCACACTTGT VRRWLDLPLDVPVAY GGGGGTCTCGGGATTCCGTCAATTAGATGGATAGCACCGATGCTGCGTC FHAPHTCGGLGIPSIR TGAAGCGATTGAGCAATATTAAATGGCCCCACCTCGAACAATCCGAGGT WIAPMLRLKRLSNIK AGCTAGCTCTTTCATTGACGACGAATTGCAAAGGGCTCGAGATAGATTA WPHLEQSEVASSFID AAGGCGGAAAATGTGCAGCTGTGTTCGCGTCCAGAGATTGACTCGTATT DELQRARDRLKAENV TCGCAAATAGATTGTACATGTCTGTTGATGGTTGCGGTCTCCGTGAAGC QLCSRPEIDSYFANRL AGGTCATTATGGCCCGCAACATGGATGGGTGAGTCAGCCCACGCGCTT YMSVDGCGLREAGH GCTAACAGGAAAGGAATATTTGCACGGTGTCAAATTGCGGATAAATGCC YGPQHGWVSQPTRL CTACCCTCGAAGTCTCGTACGACGAGGGGAAGGCACGAATTGGAGAGA LTGKEYLHGVKLRINA CGGTGTCGTGCAGGATGTGATGCTCCCGAGACAACAAACCACATCTTGC LPSKSRTTRGRHELER AAAAATGCTATCGTACGCATGGGAGGCGGGTAGCTAGACACAACAGCG RCRAGCDAPETTNHI TAGTAAATGCCGTCAAGCGGGGACTTGAACGGAAAGGCTGCGTTGTCC LQKCYRTHGRRVAR ATGTCGAACCAAGTCTGCAATGCGACTCGGGCTTAAATAAACCGGACCT HNSVVNAVKRGLER GGTGGGAATCCGACAGAATCACATTTATGTGATAGACGTTCAGGTTGTG KGCVVHVEPSLQCDS ACAGACGGACATTCCTTAGACCAAGCGCACCAGCGCAAGGTCGAAAGG GLNKPDLVGIRQNHI TACGACAGAGCTGACATAAGATCACAAATGCGGCGATTTTTCGGAGCGA YVIDVQVVTDGHSLD CAGGTGAAATCGAGTTTCATTCCGTTACACTCAACTGGAGAGGAATCTG QAHQRKVERYDRAD GAGTGGTCAGTCGGTAAAACGATTGATTGCAAAGGATCTCCTCATCGCT IRSQMRRFFGATGEI GAAGATACCAAACTCATCAGCGTCAGAGCAGTAAATGGCGGAGTGACG EFHSVTLNWRGIWS TCCTTCAAATATTTCATGTATTGTGCTGGGTATACTCGAAGCTAGATGTA GQSVKRLIAKDLLIAE CTAACCTCTAGCTTTTCTTATACTTTTGCCTGCTACCTTGGCATTACATCTA DTKLISVRAVNGGVT AAAAGGTACAAACATCGCATTGTCATAAAGAGGTGGTTTTAGTACGTAG SFKYFMYCAGYTRS GCGCTGTGGGACTTCATTGTCCCGGTGATGCAGTGAATCGTGCATACGA (SEQIDNO:1456) GATTGTCCAGTAGTTGGTTGCTCGTATCTTTAGAAGATTTCCTTCCTCGG CGATCAAAAAAAAAAAAAAAAAAAAAAA(SEQIDNO:1088) R2 R2- Gavialis AGGCATCTCCTTKAAGGGTAATGGTCTGGTTACATGGTCATAGCAGGTT AGGCATCT TGAGAA PAAPRAWGAVEAGP 1_Gav gangeticus TGTGTCAGGTACCTCCCAGTGGTTCCCGCCGGGTGSCAMAGCCCCAGG CCTTKAAG GTTGCG WPGRTRAVEPAPSP GCTGTCGGTAGCTCGATCCTGGTACAGTACGGCCAAGGGAGTTCTTCCT GGTAATGG AGTTCTT ESSPSEAARAAPAGE TGCTGTCGGGTGCCTCGCAAGCACKTGGCAGCCCCAATCGCTTCATTGC TCTGGTTA ATGCAA GHGPGHESPSVQRP GAAAAACACAAACGTCCTAAGGGGATGATCAGCTAGTCAGTTCTGCCGC CATGGTCA GTTGAA EADTTAPGVSAPTRE TAGCCAAAACTGTTTGCCACCCAGTTACAGATAGCGTCTGTGCTGACCA TAGCAGGT TACCACT GEPPSTRVFLVRLPDS GCTGCCCCGCGGGCTTGGGGTGCAGTGGAGGCCGGCCCGTGGCCAGG TTGTGTCA CTKGKG NPPCPICRDHVGKPS CCGGACACGGGCCGTGGAGCCTGCTCCCAGTCCAGAGAGTTCCCCCTCG GGTACCTC ACCCCA ALALHCVESHAWAD GAAGCTGCCAGGGCAGCACCAGCCGGGGAGGGCCACGGCCCCGGTCA CCAGTGGT AAAAAW VQYQCTHCKKVSAN TGAGTCCCCCTCGGTGCAGAGGCCTGAGGCCGATACCACTGCCCCTGGT TCCCGCCG WAAACC KHSILCHIPCCQGRVP GTGAGCGCGCCCACCAGGGAGGGTGAACCACCCTCCACCAGGGTTTTC GGTGSCA CCAAAA EWTGKDWACPECPA CTGGTGAGGCTGCCTGATTCAAACCCGCCGTGCCCAATTTGCAGGGACC MAGCCCC CAGTTG SFNKKVGLSQHKRHV ATGTGGGTAAACCCTCCGCGCTGGCCCTCCACTGCGTGGAGAGCCATGC AGGGCTGT TGTTTAA HPVTRNVERVAGSLS GTGGGCGGATGTGCAATACCAGTGCACCCATTGCAAAAAGGTCAGTGC CGGTAGCT GTGTGT RAGLRPQTRRGCWS TAACAAACACAGCATCCTCTGCCACATCCCATGTTGCCAGGGGAGGGTG CGATCCTG TCTTGTT VEEEETLTCLDAMFR CCCGAGTGGACCGGGAAGGACTGGGCTTGTCCTGAGTGCCCTGCCTCCT GTACAGTA CGTCCCT GARNINQLIAAEMVT TCAATAAGAAAGTTGGCCTATCGCAGCACAAGCGGCACGTGCATCCGGT CGGCCAAG TTGGCTT KMPKQISDKRRQLGL AACACGTAATGTGGAACGGGTTGCAGGGAGCCTATCGAGGGCTGGTTT GGAGTTCT CACCTCS CPEQTTLGGDAESTS AAGGCCCCAGACCAGACGCGGGTGCTGGTCGGTGGAAGAAGAGGAAA TCCTTGCT AAGTTG VVEEESMTPEMETQ CTCTCACCTGCCTAGACGCGATGTTCCGTGGTGCCCGGAACATCAACCA GTCGGGT CGATCC SPINPPGKIRKILAQR GCTGATCGCTGCTGAAATGGTAACGAAGATGCCTAAGCAGATCAGTGA GCCTCGCA CCCCATC ARRWLKKGQGLSDK CAAACGGAGGCAGCTCGGTCTGTGTCCTGAGCAGACCACATTGGGTGG AGCACKTG TCCCCTG VREVLGAWVEGQPR TGATGCTGAATCGACCTCCGTGGTGGAGGAAGAGTCCATGACTCCGGA GCAGCCCC CGCTGC IHAWVDSVSLDVLTL GATGGAAACCCAAAGCCCAATTAACCCGCCTGGGAAAATCAGGAAGAT AATCGCTT CTTTCAG FLGVPSGPQRAPNKK CCTGGCCCAGAGGGCACGCCGGTGGCTGAAGAAGGGGCAGGGTCTGT CATTGCGA AACGGC RPKEGGKPTSWMNK CGGACAAGGTGCGAGAAGTCCTGGGCGCATGGGTGGAGGGTCAACCC AAAACACA CGGTGG CAVKWGTFLRYQHLF AGGATTCATGCCTGGGTCGACTCAGTCTCCCTTGATGTTTTGACTTTGTT AACGTCCT TGTCGA GANRKLLVAIVLDGA CTTGGGGGTGCCCTCAGGACCGCAGAGGGCTCCGAACAAGAAGAGGCC AAGGGGA GGCTGG DRNQCTLLLEEVFQA CAAGGAGGGTGGCAAACCAACGTCCTGGATGAACAAATGTGCCGTCAA TGATCAGC CGCGAC YREKWGLEEVLRAYR ATGGGGCACATTCCTTCGGTACCAGCACCTGTTTGGTGCCAACAGGAAG TAGTCAGT CTCGGT GKWEVESSFEGLGRF CTCCTGGTGGCSATCGTCCTGGACGGCGCTGACCGTAATCAGTGCACCC TCTGCCGC CACCTCC GVRRDADNFAFKALI TCCTGCTAGAGGAGGTCTTCCAGGCCTACCGAGAAAAGTGGGGGCTAG TAGCCAAA AAGGCC TPEEVVKHMMAMA AGGAAGTCCTTCGGGCCTACCGAGGAAAGTGGGAGGTAGAGTCATCTT ACTGTTTG AAGTGC SKSAPGPDKLTLRDLR TTGAGGGTCTCGGACGGTTCGGGGTCCGCCGGGATGCGGATAACTTCG CCACCCAG CCTGGC RADPEGDALAELFSL CATTCAAGGCCCTGATCACTCCTGAGGAGGTTGTCAAACACATGATGGC TTACAGAT CCCGAG WLITGTVPDGLKECR AATGGCCTCGAAATCGGCTCCAGGTCCGGATAAGCTCACCCTGAGAGAT AGCGTCTG TAGGAC SVLIPKTVDREKLGQL CTGCGCCGCGCTGACCCCGAGGGAGATGCTCTTGCCGAACTCTTCAGCC TGCTGA TGAGTG GNWRPITIGSIVLRLF TGTGGCTGATTACCGGCACGGTCCCGGACGGACTCAAGGAGTGTCGGT (SEQID GCCCAG SRVLTARLAAACPINP CTGTGTTGATACCCAAAACGGTGGACCGGGAGAAGTTGGGCCAGCTGG NO:1212) CTCGCT RQRGFIAAPGCAENL GCAACTGGCGCCCTATCACGATTGGGTCCATCGTTTTACGGCTATTCTCA GGGCAC KVLELLLRKRKRDRQP CGAGTGCTAACCGCACGGCTCGCCGCAGCATGTCCCATCAACCCCCGTC CCGTCA LGVVFVDLARAFDSV AGAGGGGTTTCATAGCGGCGCCGGGGTGCGCCGAGAACCTGAAGGTG CCATCTG SHDHISWVLKAKGV CTTGAGCTTCTCTTGCGGAAGAGGAAGCGAGACAGGCAGCCGTTGGGT GGGCAA DEHIVNLIEDSYQKVT GTGGTATTTGTGGATCTAGCGAGGGCGTTCGATTCGGTGTCACACGATC ATGGAA TRVQVFNGVTPPISIK ACATTTCTTGGGTCCTAAAGGCCAAAGGGGTGGACGAACACATCGTGA GGGATC TGVKQGDPMSPLLF ATCTCATCGAAGATTCTTACCAGAAGGTTACCACGAGAGTACAAGTGTT TGTCCT NIAMDPLIAKLETDG CAATGGCGTCACCCCTCCTATCAGCATCAAAACCGGGGTTAAGCAAGGC GACCAC QGVKVGSASLTTLAF GACCCGATGTCCCCCCTCTTGTTCAACATTGCGATGGACCCCCTGATAGC TACCAG ADDLVLLSDSWEGM GAAGCTCGAGACAGACGGACAGGGAGTAAAAGTCGGGAGTGCCTCCCT GCTAAG LKNISILEDFCNLTGLR GACCACCCTGGCCTTCGCGGATGATCTCGTCCTGCTTAGCGACTCTTGG TGTGGT VQPKKCQGFFLNPTC GAGGGCATGCTGAAGAACATCAGCATCCTAGAGGACTTCTGCAACCTCA GCGGCC DSFTVNNCEAWKIA CGGGCCTACGAGTGCAACCCAAAAAATGTCAGGGGTTCTTCTTGAATCC TAGCCT GREITMLGPGESTRY GACATGCGACTCCTTTACGGTGAACAACTGCGAGGCCTGGAAGATAGC GCCGTA LGLNVGPWVGIDKP CGGCCGTGAGATCACGATGCTCGGACCAGGCGAGTCGACACGATATCT AGGTCA DLGTQLSSWLERIGT GGGCTTGAATGTCGGTCCTTGGGTTGGGATCGACAAACCAGATTTGGGT AGCGCC APLKPMQKLSLLVQY ACGCAACTAAGCTCCTGGCTCGAGAGGATAGGGACTGCTCCACTCAAAC CTGCTG AIPRLNYQADYAGIG CGATGCAGAAGCTCTCTTTGCTGGTGCAGTATGCCATACCCAGGCTGAA CCACTCA RVALEALDSMNRRK CTATCAGGCCGATTACGCGGGCATCGGCAGGGTGGCCTTGGAGGCTCT GGTATC VKEWFHLPACTSDGL GGATTCTATGAACCGGAGAAAGGTAAAGGAGTGGTTCCATCTTCCCGCC AGTCCTC LHSRHRDGGLGLPRL TGTACCTCGGACGGTCTCCTCCACTCCCGTCACCGTGACGGGGGTCTTG GTTCACT AKAIPEAQVRRLIRVA GGTTACCGCGTCTGGCGAAAGCCATTCCGGAAGCGCAAGTGAGGAGGC TGTCCCT TSSDEVTRKVSYACGI TGATCCGCGTAGCCACTTCATCTGATGAAGTCACGCGGAAAGTATCCTA CCTAGT SDEVERLWLARGGD CGCGTGTGGGATAAGTGACGAAGTGGAGCGGCTCTGGTTGGCGAGGG ACCCTCT MSSVPRFEDPEAPRS GTGGGGACATGTCCAGTGTACCGAGGTTCGAGGATCCTGAGGCCCCGA GCCTCT PGVQGPCEAAQEIPS GGTCTCCGGGGGTGCAGGGCCCCTGCGAGGCTGCCCAGGAGATTCCGA GCTCTTT VVRKLAIPRPSNWRS GCGTAGTCCGGAAGCTTGCGATCCCCCGGCCCTCCAACTGGAGATCCAA TGCTATC KKHSKWAQLSCQGE GAAACACTCCAAATGGGCCCAACTCAGCTGTCAGGGAGAGGGGATGGA CACTAT GMELFCNDPVSNG GTTATTCTGCAATGATCCAGTCAGCAATGGCTGGAACAACAGTCGGGGA GGCCAG WNNSRGQLAEHLQI CAACTGGCGGAACACCTCCAGATCGTGGCCTTAAAACTGCGTTCAAACA TGATGTT VALKLRSNIYPTREFL TTTATCCCACCAGGGAGTTTCTTGGGAGAAGCCAGGCAAGTACCAATGT GAGGTT GRSQASTNVGCRHC AGGTTGCCGGCATTGTACACACCCTCATGAAACACTAGGGCATATCTTG GGTGCA THPHETLGHILGICPA GGCATATGCCCTGCCGTGCAGGAGGCACGGATCATCCGGCATAACAAG TCCTTG VQEARIIRHNKLCKIL CTGTGCAAGATCCTAGCAGCTGAGGGCAAAAAGTGTGAGTGGACAGTG GTCACCT AAEGKKCEWTVYYEL TATTATGAACTGCAACTGCTTAACGCTGCAGGGGAACTGTGTAAACCTG CCAGGG QLLNAAGELCKPDLIF ACCTCATTTTTGTCCGAGACGGTACCGMTCTGGTTGTGAATGTCACTGT CCAAGC VRDGTXLVVNVTVG GGGGTACGAAGGGGGCCCCGCAWCCCTCCTATCCACCGCTGCAGAAAA GCCTTG YEGGPAXLLSTAAEK GGCCACAAAATACCTGGATMTGAACGCACAGATCCAGGAGCTCACAGG GCCACA ATKYLDXNAQIQELT GGCTGAGCAGGTCACCTACTTTGGCTTCCCTATWGGAGCCAGGGGAAA GGTAGG GAEQVTYFGFPIGAR GTGGCATGCTGACAACTGKCGAGTACTGTCTGAACTGGGATTGTCCAAC ACCTGG GKWHADNXRVLSEL TCCCGGAAGGAGCGGGTCGCACGGCTCCTGTMGTGGCGAGCACTGCTC CACCTG GLSNSRKERVARLLX GGGTCAGTGGACATGGTGAACATCTTTGCATCTAAGCACAGGCAGGAA CCCAGG WRALLGSVDMVNIF AACCTATCGGATGAMGCACTGAGCCCCAGCTGAGAAGTTGCGAGTTCT GGGCCA ASKHRQENLSDXALS TATGCAAGTTGAATACCACTCTKGKGACCCCAAAAAAWWAAACCCCAA GACACT PS(SEQIDNO: AACAGTTGTGTTTAAGTGTGTTCTTGTTCGTCCCTTTGGCTTCACCTCSAA GCCTGT 1457) GTTGCGATCCCCCCATCTCCCCTGCGCTGCCTTTCAGAACGGCCGGTGGT GGCAAG GTCGAGGCTGGCGCGACCTCGGTCACCTCCAAGGCCAAGTGCCCTGGC GGAAAG CCCGAGTAGGACTGAGTGGCCCAGCTCGCTGGGCACCCGTCACCATCTG GGAGCC GGGCAAATGGAAGGGATCTGTCCTGACCACTACCAGGCTAAGTGTGGT GTCCCT GCGGCCTAGCCTGCCGTAAGGTCAAGCGCCCTGCTGCCACTCAGGTATC GACCGT AGTCCTCGTTCACTTGTCCCTCCTAGTACCCTCTGCCTCTGCTCTTTTGCT TACCAG ATCCACTATGGCCAGTGATGTTGAGGTTGGTGCATCCTTGGTCACCTCCA GCTTGA GGGCCAAGCGCCTTGGCCACAGGTAGGACCTGGCACCTGCCCAGGGGG GATGGT CCAGACACTGCCTGTGGCAAGGGAAAGGGAGCCGTCCCTGACCGTTAC GCTGGC CAGGCTTGAGATGGTGCTGGCTAGCCCACCATATGTCAAGCACTCCACA TAGCCC GCTGCTTGAGTTTGTTGGCTTCACCTTCATCCCACCTAGTGTCTTCTGCCT ACCATAT CTGCACTATTTTCATCCCAACTCGTACCTCCCCATCTCTGCGCTCCTGCTA GTCAAG TCCCTGCAAGGACCAAGTAGGCAGGGGGGTTCATCCCCCTACCTGCAG CACTCCA GAGACTCAGCATATCCATGACTTCTTGCCTCCACCGTCTTGTGGCGCTAG CAGCTG AGGGGTACCTCAGAGACCGGCACAACATGACCTTGACGGTTAGACAGT CTTGAG AGGGTCAAACAACCCTGCTGCAGGCCCAAAGGGCCAACAGCTGTGCCA TTTGTTG CGAGAGGGGAACCTTGAAGACTGGGGCAGTCTGACCATGCTGGTTAGT GCTTCAC CAGTTGGGTGAAATAATCCCAGCTGCAGGCCCAAAAGGGCTGACAGTC CTTCATC AGGTGAGGGGGTATCTCCATCTGCTCCCCACTGCCAACTACGGAGGCAT CCACCTA GAAGTCCGTAGTGACTTCTGACCCCCACGTCTTGTGCCATGAGAAGGGA GTGTCTT ACCTTGAAGATTGGGACAAACCGCACTTGAAAGTTACTCAGCCGGGTGA CTGCCTC AAATAAGTCCCAGTTGCGGGCCCCTCGGGGCTGACAGTCAGGTGAGGA TGCACT GGGCTGCAAAGCCCATCTCCTGACTCCAGAGGCCTGGCGTCCTAACCGA ATTTTCA CTTCTTGCCACCAATGTCTTGCGCCAGGAGAGGGCAACCTTGAAGATCG TCCCAAC GGGCAAGCCGCACTTGATAGTTAGCCAGTCGAGTGAAACAATCTCAGCT TCGTACC GCGGGTCCGAAAGGACTGACTTCCAGGCGAGGGGGGGGCCTGCGGAA TCCCCAT AACCCCCTCCATGGTACGGAGGTCTGGCATCCTAACCGACACCTTGCCA CTCTGC CCAATGTCTTGTGCCAGGAGAGGGGAACCTTGAAGACTGGGGCAAGCC GCTCCT GCAGTTGATGGTTAGTCAGTCGGGTGAAATAATCCCGGCTGCACCCTGC GCTATCC TGTGACTGCTAAGCCCGGTCCCCAAGGGGCATGAGGCATGTGCGCTGA CTGCAA GACGGGAGGGGTGACATCTGGCGATCAGCACAGCACAGACTGAAGGG GGACCA AGGCACTTGCCGAGAATGCTTCTGAGGCCCCAGACTTGGGGTGGTGCA AGTAGG GCTTTGTCTCGTGTATAGTACAGCACCCTACTGCTCCCTTTGGGCAGCAG CAGGGG AATTTGTCCTGACCTCTTACCCACCCGAGTCTGCGCTTTTGTTCCACCTCG GGTTCA CTGTCTCCCTGCTGTGCTGTTTTTCTCTCAAGTGGGTTAAATCTCAACATG TCCCCCT ATTATCTCCCACGTTTCCGCTCAAGGGCAATGCCCAACATGACGGAGAT ACCTGC CGTTGGTGCATGGTAGTCACGAGACCATCCGGACCCTCCAGTGGTCGCT AGGAGA ATAGTCATTTTGTGTTGCATGGGGCATGCTGAGTCACTTAACCGAAAGA CTCAGC CTGTAAATAACTCAAAAGAGGTACCCTCCGGGGTTCGGTAAA(SEQID ATATCCA NO:1089) TGACTTC TTGCCTC CACCGT CTTGTG GCGCTA GAGGGG TACCTCA GAGACC GGCACA ACATGA CCTTGAC GGTTAG ACAGTA GGGTCA AACAAC CCTGCT GCAGGC CCAAAG GGCCAA CAGCTG TGCCAC GAGAGG GGAACC TTGAAG ACTGGG GCAGTC TGACCA TGCTGG TTAGTCA GTTGGG TGAAAT AATCCC AGCTGC AGGCCC AAAAGG GCTGAC AGTCAG GTGAGG GGGTAT CTCCATC TGCTCCC CACTGC CAACTA CGGAGG CATGAA GTCCGT AGTGAC TTCTGAC CCCCAC GTCTTGT GCCATG AGAAGG GAACCT TGAAGA TTGGGA CAAACC GCACTT GAAAGT TACTCA GCCGGG TGAAAA TAAGTC CCAGTT GCGGGC CCCTCG GGGCTG ACAGTC AGGTGA GGAGGG CTGCAA AGCCCA TCTCCTG ACTCCA GAGGCC TGGCGT CCTAACC GACTTCT TGCCAC CAATGT CTTGCG CCAGGA GAGGGC AACCTT GAAGAT CGGGGC AAGCCG CACTTG ATAGTT AGCCAG TCGAGT GAAACA ATCTCA GCTGCG GGTCCG AAAGGA CTGACTT CCAGGC GAGGGG GGGGCC TGCGGA AAACCC CCTCCAT GGTACG GAGGTC TGGCAT CCTAACC GACACC TTGCCAC CAATGT CTTGTG CCAGGA GAGGGG AACCTT GAAGAC TGGGGC AAGCCG CAGTTG ATGGTT AGTCAG TCGGGT GAAATA ATCCCG GCTGCA CCCTGCT GTGACT GCTAAG CCCGGT CCCCAA GGGGCA TGAGGC ATGTGC GCTGAG ACGGGA GGGGTG ACATCT GGCGAT CAGCAC AGCACA GACTGA AGGGAG GCACTT GCCGAG AATGCTT CTGAGG CCCCAG ACTTGG GGTGGT GCAGCT TTGTCTC GTGTAT AGTACA GCACCC TACTGCT CCCTTTG GGCAGC AGAATT TGTCCT GACCTCT TACCCAC CCGAGT CTGCGC TTTTGTT CCACCTC GCTGTC TCCCTGC TGTGCT GTTTTTC TCTCAA GTGGGT TAAATCT CAACAT GATTATC TCCCAC GTTTCC GCTCAA GGGCAA TGCCCA ACATGA CGGAGA TCGTTG GTGCAT GGTAGT CACGAG ACCATCC GGACCC TCCAGT GGTCGC TATAGTC ATTTTGT GTTGCA TGGGGC ATGCTG AGTCAC TTAACC GAAAGA CTGTAA ATAACTC AAAAGA GGTACC CTCCGG GGTTCG GTAAA (SEQID NO: 1335) R2 R2- Ixodes GTTCCAAAGGAAGGCACTCCTTTGGTTCGTGATGAGATGTTCATGGTGC GTTCCAAA TAGTGT MQCTSRLADAPRFA 1_IS scapularis TTGCCTAGCTGGAGAAATCCGACTCACACCTGCACGTGGTCCCTGCCGC GGAAGGC GACGGA RVGVEGEGVGASGN CTGCCAGTATGCCGAGGAAACGGGTGCAACTTAATCCGTGGATACTGGT ACTCCTTT GTCCTCA GTDAQLWYGCTGCD AGCAACGTGAGCAACGGTACGGTCCTTCGCGGACCACCCTGGGCGTTC GGTTCGTG AGCCCC EAFSSLRGLRIHAAQK GGGTTGCCAGCCCGTTCGCCCGAAATATCTTGGCCCTGAAACTAAAAGA ATGAGATG CACAAG KHGNQDGLLRLPAG AAAATGCAATGCACCAGCCGACTGGCTGATGCACCAAGATTTGCCCGAG TTCATGGT TGCCTG RPRKRRVGKSTTAGA TGGGCGTCGAGGGTGAAGGTGTCGGTGCGTCTGGTAACGGCACTGATG GCTTGCCT CCAGGT SDRVTTDPVPAPVPE CGCAGTTATGGTATGGCTGCACGGGCTGTGACGAAGCCTTCTCGTCCCT AGCTGGA GGCAGG SPGLLPGLPGPSLPGC GCGAGGACTGAGAATTCATGCGGCCCAAAAGAAACATGGAAACCAAGA GAAATCCG AAAGGG SDLPPGVLPGGWSA TGGCCTTCTCCGCCTGCCGGCGGGACGGCCCCGAAAACGACGAGTGGG ACTCACAC CAACTA SPGPLSWPPSLDAGP GAAGAGCACCACAGCGGGTGCTTCGGACCGGGTGACCACGGATCCAGT CTGCACGT CTGGTG LPGPSRVSPGPSRPSP GCCTGCTCCAGTTCCTGAATCTCCTGGGCTGCTGCCTGGGCTACCTGGAC GGTCCCTG AGCGAC GKPTGPPSLDAGPLP CATCGCTGCCTGGGTGCTCGGACCTGCCGCCTGGGGTGCTGCCTGGAG CCGCCTGC CCAAGC GPSRVSPGPSRPSPG GGTGGTCTGCATCACCTGGGCCTCTCTCCTGGCCTCCTTCCCTGGATGCC CAGTATGC AAGGCG KPPGTPEPLPGSPGG GGGCCTCTGCCTGGACCTTCAAGAGTATCACCTGGACCTTCAAGACCTTC CGAGGAA GAGCCA RRGVSPGQPGSRTD GCCCGGGAAGCCGACCGGGCCTCCTTCCCTGGATGCCGGGCCTCTGCCT ACGGGTGC AGACCA PSSSAGAGHFVCPQC GGACCTTCAAGAGTATCACCTGGACCTTCAAGACCTTCGCCCGGGAAGC AACTTAAT AGCTGG SRAFSSKIGMSQHQK CGCCCGGGACGCCTGAGCCGCTGCCTGGATCTCCTGGCGGTCGGCGCG CCGTGGAT AGCCAA HAHLEEYNAGINITRT GGGTGTCCCCGGGACAGCCCGGGTCACGGACCGACCCCTCAAGCTCTG ACTGGTAG GAGCAA KARWDPEETYLLARL CTGGCGCCGGACACTTCGTATGCCCGCAGTGCAGCAGAGCCTTCTCAAG CAACGTGA CTCCAG EATLNPDHKNINQTL CAAGATTGGCATGTCTCAACACCAAAAACATGCCCACCTCGAAGAATAC GCAACGGT GAGGCA HAALPRGSCRTLESIK AACGCGGGCATTAACATCACCCGTACCAAAGCCCGGTGGGACCCCGAG ACGGTCCT GGGGTG AHRKQAAYRDLVTSL GAGACCTATCTTCTGGCCCGCCTGGAGGCCACCCTCAACCCAGACCACA TCGCGGAC GATATC RSARESSEAQHVPDR AGAACATCAACCAGACGCTGCACGCCGCGCTGCCCCGCGGTTCCTGTCG CACCCTGG AAGAGC PLETPEPQTPANPQR AACCCTGGAGAGCATCAAGGCCCACCGAAAGCAGGCGGCTTACAGGGA GCGTTCGG AACCCC DSKQAVIEALQSLIGR CCTGGTGACGAGCCTGCGGTCAGCCAGGGAGAGCAGCGAGGCGCAGC GTTGCCAG AAGGGA APPGSFQGARLWDI ACGTTCCGGACCGGCCCCTGGAGACCCCGGAGCCCCAGACACCAGCGA CCCGTTCG CACAGA ARQATRGTNILPLLN ACCCTCAAAGAGACTCGAAGCAGGCAGTCATCGAAGCGCTGCAATCCCT CCCGAAAT CCACGG SYLRDVFTLPTKPTRK CATCGGCCGAGCACCACCAGGCTCCTTCCAGGGAGCGCGTCTCTGGGAC ATCTTGGC GCAACT KPAVRPARSRRKQKK ATCGCGAGGCAAGCCACAAGGGGGACGAACATCCTCCCACTCCTGAAC CCTGAAAC ACTGGT QEYARTQDLFRKKQS AGCTACCTGAGGGATGTTTTCACCCTCCCCACAAAGCCAACAAGAAAGA TAAAAGAA GAGCGC DCARAVLDGPTSSSV AACCTGCAGTGCGGCCCGCCCGGAGCCGCAGAAAACAAAAGAAACAAG AA(SEQID CCAAGA PGTGAFLQTWREIM AGTATGCCAGAACACAAGATCTATTCAGGAAAAAGCAGTCCGACTGTGC NO:1213) CAGGGG TGPSPALEAPPLPTR CAGGGCGGTCCTGGACGGCCCCACGTCGTCATCGGTCCCTGGAACGGG TGGATA GEVDLFFPATAQEIQ CGCCTTCCTGCAAACCTGGCGAGAGATCATGACGGGGCCCAGCCCTGCA TTAAGA SAEIAVNSAAGPDGF CTCGAGGCACCGCCTCTACCTACCCGGGGGGAAGTCGACCTGTTCTTCC ACAGCC SARLLKSVPALLLRVM CGGCGACGGCGCAAGAGATCCAGAGCGCTGAGATAGCCGTCAACTCGG CCACAA VNLLLLVRRVPAALR CTGCTGGACCCGACGGGTTCTCAGCCCGTCTCCTCAAGTCCGTCCCGGC AGTGTT DARTTFIPKVPDAVD CCTCCTCCTAAGGGTCATGGTTAACCTTCTGCTCCTCGTCCGACGTGTCC ACCTATA PSQFRPITVASVLQRL CGGCGGCCCTCCGGGACGCGAGAACGACCTTCATCCCGAAGGTCCCCG TTAACA LHRILAKRALEAIPLNF ATGCAGTGGACCCCTCCCAATTTCGCCCAATAACGGTGGCCTCCGTTCTC ATAAAG RQRAFQPVDGCAEN CAGCGCCTACTACATCGCATCCTGGCCAAGAGGGCGCTGGAGGCCATTC TTGAAG IWLLSTALNEARTRR CCCTCAACTTTCGACAAAGAGCCTTTCAGCCGGTGGATGGCTGTGCCGA CCTCAAC RPLHMASVDLTKAFD GAATATATGGCTGCTGTCCACCGCGCTCAACGAGGCAAGAACCAGACG CACGCA RVTTDAILRGARRAG GCGCCCGCTACACATGGCGAGCGTCGACCTAACCAAGGCATTCGACCG TTGCGG LSGEFIGYLKELYTTSR GGTCACCACGGATGCCATCCTGAGGGGCGCAAGGCGCGCCGGGCTGTC GTTAGA TLLQFQGESLLVEPTT CGGGGAGTTCATCGGATACCTGAAGGAGCTCTACACAACATCCAGGAC TGGCGT GVRQGDPLSPILFNL CCTCCTGCAGTTCCAGGGAGAGAGCCTGCTTGTCGAACCCACGACCGGC GGCTTG VLDEYLSSLDPDISFVS GTGCGACAGGGCGACCCACTGTCGCCCATCCTCTTCAACCTGGTCCTGG GCCCGC GDLRLDAMAFADDLI ACGAGTACCTCTCCTCCCTGGACCCGGACATCTCCTTCGTCTCGGGCGAC CGCCAT VFASTPAGLQDRLDA TTGCGCCTCGATGCGATGGCATTCGCTGACGACTTGATCGTCTTTGCCTC GATGAG LVEFFDPRGLRVNVK AACCCCAGCCGGCCTGCAGGATCGGCTCGATGCCCTCGTCGAGTTCTTC CTGGAA KSFTLSLQPGRDKKV GACCCAAGGGGGCTCAGGGTGAACGTGAAGAAGAGCTTCACGCTATCG CCCTCCA KVVCDQIFTIGGTPLP CTGCAGCCGGGACGAGACAAGAAGGTCAAGGTGGTGTGTGACCAGATC CCTGGT ASKVATPWRYLGMT TTCACCATCGGAGGAACCCCACTCCCAGCCTCCAAGGTCGCAACCCCTT GGGCCG FTPQGSINKGTSEQL GGCGCTACCTGGGGATGACCTTCACCCCCCAGGGCTCAATCAACAAGG CACGAG DLLLTRTSKAPLKPQQ GCACCAGCGAGCAGTTGGACCTACTGCTCACGAGAACCAGTAAGGCCC ACCACC RLVVLRNYLLPRLYHR CCCTCAAGCCACAACAGAGGCTGGTGGTCTTAAGAAACTACCTGCTCCC GGCTCT LVLGPWSAALLLKM GAGGCTATACCATCGCCTCGTGCTTGGACCTTGGTCGGCCGCCCTCCTAC TTCTACT DTTIRGAIRRWMDLP TGAAGATGGACACCACCATTCGAGGAGCCATTAGACGCTGGATGGATC AAGGCC HDTPLGFFHAPVTEG TCCCGCACGACACACCGCTGGGTTTCTTCCACGCCCCAGTAACGGAGGG GGTCTC GLGINSLRASIPAMVL AGGCCTAGGAATCAACTCCCTGCGAGCATCAATTCCAGCCATGGTGCTC CGTGAC QRLDGLHFSTHPGAE CAACGGCTGGATGGACTTCACTTCAGCACGCATCCCGGAGCTGAGGTCG TGCGGT VAIQLPFLTGLHRRAE CCATCCAGCTGCCGTTCCTGACAGGACTCCATCGAAGAGCGGAAGCGG TGGGAT AAAQYQGQRLLSKA CGGCCCAATACCAGGGACAGAGACTACTGTCCAAAGCGGACGTCCACC AAACTC DVHRMWSARLHGS GGATGTGGAGCGCAAGACTCCACGGGAGCTGCGACGGAAGACCCCTTC CAAGCA CDGRPLRESKRVPAA GGGAGTCCAAGAGAGTGCCGGCTGCCCATCGTTGGGCCGCGGAAGGC CTGAGC HRWAAEGTRLLSGR ACCAGACTACTCTCGGGAAGGGACTTCATCTCGATCACGAAACTCAAGA GGTAAA DFISITKLKINALPTLE TAAACGCGCTACCTACACTCGAGCGCACCAGCCGGGGCCAGCACAAGG AAAAAA RTSRGQHKDIQCRA ACATCCAGTGCAGAGCTGGCTGCCAGGCTGTTGAATCCCTGGGCCACGT AAAAAA GCQAVESLGHVLQA CCTACAAGCTTGCCATCGAGGACACCGTGGCCGAATCCGGCGGCATGAT AAAAAA CHRGHRGRIRRHDNI AACATTGCCCGCTACGTCTGCGGCCGACTGACCCAGATTGGCTGGGCGG AAAAAA ARYVCGRLTQIGWA TGAAGTGGGAGCCCCACTACTCTGTCGCTGGAAGGACCCTCAAACCTGA (SEQID VKWEPHYSVAGRTL CATCGTTGCCCATCGTGGAGCCGAGACTGTCGTGCTCGACGCCCAGGTC NO: KPDIVAHRGAETVVL GTCGGCACCAGCATGCGACTGGGCTTCCACCACGCTCAAAAGAAAGAA 1336) DAQVVGTSMRLGFH AAGTACTCTCTCCCAGACCTCCTCCACCAAGTCTGCGAGGGACGGAGAG HAQKKEKYSLPDLLH ACGCAGCCCGGGTGTCAACAATCACCCTCAATTTTCGAGGTGTTTGGGC QVCEGRRDAARVSTI ACCTGAGAGCGCCCAGGACCTGAAGTCCCTGGGCCTGACGGACAACGA TLNFRGVWAPESAQ CCTAAAGCTTCTCACCGTCCGCTGCCTCCAGGGCGGCGCGCAGTGTTTC DLKSLGLTDNDLKLLT CGGCTGCACCGCCGAATGACCACCGTGGTGAAGGCCACGGGCGATGAA VRCLQGGAQCFRLH GCCAACGCCCTCCCCGCCCATTCGGGCTTGCCGCCAACACAGCTTGGTG RRMTTVVKATGDEA GCCGAACCCTGGGTCCCTCTGCCCACAATCAGAGTGCAAGAACTACTTA NALPAHSGLPPTQLG GTGTGACGGAGTCCTCAAGCCCCCACAAGTGCCTGCCAGGTGGCAGGA GRTLGPSAHNQSART AAGGGCAACTACTGGTGAGCGACCCAAGCAAGGCGGAGCCAAGACCA T(SEQIDNO:1458) AGCTGGAGCCAAGAGCAACTCCAGGAGGCAGGGGTGGATATCAAGAG CAACCCCAAGGGACACAGACCACGGGCAACTACTGGTGAGCGCCCAAG ACAGGGGTGGATATTAAGAACAGCCCCACAAAGTGTTACCTATATTAAC AATAAAGTTGAAGCCTCAACCACGCATTGCGGGTTAGATGGCGTGGCTT GGCCCGCCGCCATGATGAGCTGGAACCCTCCACCTGGTGGGCCGCACG AGACCACCGGCTCTTTCTACTAAGGCCGGTCTCCGTGACTGCGGTTGGG ATAAACTCCAAGCACTGAGCGGTAAAAAAAAAAAAAAAAAAAAAAAAA AA(SEQIDNO:1090) R2 R2- Mnemiopsis TGGGGGCCCCTTGGACTTGCTCCCTGGGGCAGGACACCAGTGAAAGGA TGGGGGC TAGTGA MSNTSHSKLNLKMD 1_MLe leidyi GATCCTCAAGACAGGACAGAGAGACAGGCACACCAACCCTTCGAACCT CCCTTGGA CGAGAA NKLKTSLETPSGVRA GAGGAGACACCCAGATCTGGTTACCCCATTCCTGTAACCATGGTTGCTCT CTTGCTCC AAGTCG DSIITRVRTSSNRGEH CCGTGCGCTGTTAAGGAAACCCAGCCTAGTACCCTCGGGGAAAGGTTG CTGGGGCA CTTTATC SNGVTYPRCEQGVA CGTAGTACTTAGCAGTGTGCGGATCAAACCTCTACCGGTCTCTCTAGCG GGACACCA CTTACAT PLDTHGGICDAPPQV ATGAAAGTTTCTCCGACTGGAACTTGAGGGACTGGCTAGCCCAGCTAGC GTGAAAG AACAGT TVPATETDKQKKCEY CTGTAGAGCAATGCGTATACGATGCCTTGGCGACAATGGCGACCGCTGC GAGATCCT GTGATA CEFTYLKPRQIGTHM TTAGCAGACGGAGGTTAGTGAAAGGGCGACTTGCTGTTCATAGTCACGT CAAGACAG GTCATCC RKRHPQEWNDIKRT GAGTCGTCTAGAAACTGCACCGATGCTGCCCTCTGTTCCAGGAGAAGGA GACAGAG TTATCAC KFLSEKRQKRWLDED CAGTGGAGGACTAAATCGTAGCGCGAGCGGTGTTATGTCGAACACTAG AGACAGG AACTGT FELLCIGQEEYLVLSSI CCACTCGAAACTTAACCTCAAAATGGATAATAAACTCAAGACTAGTCTCG CACACCAA CCTGGC GKQGKGINQYIQTKY AGACTCCCTCCGGAGTCCGAGCAGATTCGATAATAACGCGGGTGAGGA CCCTTCGA GAACAA FPTLSTDAIKSQRKSR CCAGCAGTAACCGTGGTGAACACAGCAATGGTGTCACGTATCCCCGGTG ACCTGAGG AATGGC RFSEYSEKRSRELQPC CGAACAGGGGGTTGCGCCTCTGGATACGCATGGGGGGATATGTGACGC AGACACCC AGAGGG NTSSDPEELPNEAVT TCCTCCACAGGTGACGGTTCCCGCCACTGAGACGGATAAACAGAAAAAA AGATCTGG ATAGTA ENSPLSFDPLDRDVV TGTGAATACTGCGAGTTCACATACCTGAAACCGAGACAGATAGGAACCC TTACCCCA CTCGGT KKISSKDHGDQILLVQ ACATGAGAAAACGGCATCCCCAGGAATGGAACGACATCAAACGGACCA TTCCTGTA CCAACC EHLINGRYQEANTLA AATTCCTGAGCGAAAAGAGACAAAAACGTTGGCTTGACGAAGACTTCG ACCATGGT AGAAGG KAIFEKLSGKFPNLKT AGCTTCTCTGCATAGGCCAAGAGGAATACCTTGTGCTATCCTCGATTGG TGCTCTCC AAGCCC GDHRPGKQQTARKV CAAACAGGGCAAAGGGATCAACCAGTATATTCAAACCAAGTACTTCCCA GTGCGCTG ACACGA GKKRVRGSGKKLSPS ACCCTTAGTACAGATGCTATTAAATCTCAAAGGAAATCCAGACGGTTTTC TTAAGGAA TGCCAA KQNRRELYAIVQKQ AGAGTACAGCGAAAAGAGATCACGTGAACTCCAACCTTGCAATACCAGC ACCCAGCC ACTTGCT WRTKKRSKVINQILT AGCGACCCCGAAGAACTTCCGAATGAAGCGGTCACCGAAAACAGTCCG TAGTACCC GTAACC GNLNKEQSYTHTPD CTCTCTTTTGATCCACTTGACCGAGATGTTGTAAAGAAAATAAGCTCCAA TCGGGGA CACGTG QLAQFWSTLFGRVSP AGATCACGGCGATCAGATCCTCTTAGTACAGGAGCACCTCATCAACGGA AAGGTTGC AGCGGA RDDRPINHRRSVIPEL CGGTATCAAGAAGCGAACACGTTGGCCAAAGCTATATTCGAGAAGCTCT GTAGTACT AAACAT DKPLSVEEVEAALKG CCGGTAAATTCCCGAACCTGAAGACCGGCGACCATCGGCCTGGTAAACA TAGCAGTG CCTCCG AKDAATGIDGVPISH ACAAACAGCAAGGAAAGTCGGTAAGAAAAGAGTTAGAGGAAGCGGTA TGCGGATC AGTATA LKHLGSAALTILYNGL AGAAGCTATCCCCCTCTAAACAGAATAGACGCGAGCTATACGCGATAGT AAACCTCT GTATGA YVTGSIPDPWKRART ACAAAAGCAGTGGCGCACTAAGAAAAGATCTAAGGTGATCAACCAGAT ACCGGTCT TGGAAG ILIPKSNPPASPGDYR ACTGACAGGTAATCTAAACAAAGAACAATCTTATACACATACACCCGAT CTCTAGCG GATACA PISISSYFYRIYTSAISK CAGCTAGCGCAATTCTGGAGTACGCTGTTTGGCAGAGTCAGTCCGAGA ATGAAAGT GGATGT RLASAVSLDDRQKGF GACGATCGACCGATTAACCACCGGCGGTCAGTTATCCCGGAGCTAGACA TTCTCCGA GGACCG IKEDGIRDNLSLIDTLI AACCCCTCTCTGTTGAGGAGGTCGAAGCTGCGCTCAAAGGGGCCAAAG CTGGAACT CTCTAG NETKAGSKSLFMTF ATGCTGCAACAGGCATTGACGGAGTCCCAATCTCCCACCTGAAGCACCT TGAGGGA GCGGCG MDVKKAFDSVSHYAI AGGCAGTGCAGCTCTAACAATCTTGTACAACGGATTGTATGTGACCGGG CTGGCTAG GACGGT ARSLEWAGVPDGM TCGATTCCTGACCCCTGGAAAAGAGCAAGAACCATACTGATCCCAAAGT CCCAGCTA GTAGAC RSVIADLYQDCTTDIC CAAATCCACCAGCTTCACCTGGGGATTATAGACCTATCTCGATCAGCTCG GCCTGTAG GGCGAC GRSVKVTRGVKQGD TATTTCTATAGAATCTACACGAGCGCGATCAGCAAAAGACTTGCTTCTGC AGCAATGC CTTACCA PLSSTLFNLVIEMVM CGTCAGCCTTGACGACCGTCAGAAAGGATTCATCAAAGAAGATGGAAT GTATACGA TAGCTG SNVPERLGIQFQGHR ACGAGACAACTTATCATTGATTGATACTCTCATCAATGAGACGAAAGCA TGCCTTGG CGACTG LFYLAFADDLVLLTRG GGCTCGAAATCCCTATTCATGACCTTCATGGACGTCAAGAAAGCTTTCG CGACAATG CTGTGC PTANQKLVSLVHEQL ACTCCGTGTCGCATTACGCAATTGCCAGATCTCTAGAATGGGCTGGCGT GCGACCGC GATCCG ARVGLELHPGKCKSIA CCCCGATGGAATGAGAAGTGTCATAGCCGATCTATACCAGGACTGCACC TGCTTAGC GAACGG IMADPKRKTTFVDQ ACTGATATCTGCGGCAGATCCGTCAAAGTGACCAGAGGCGTGAAGCAG AGACGGA GCCTTTC GSSVLIGGEPVSSLGP GGAGACCCCCTAAGCTCCACTCTGTTCAACCTGGTAATAGAAATGGTGA GGTTAGTG TCACTTA QEWYKYLGIKLGSGG TGTCAAACGTCCCCGAACGTTTGGGAATTCAATTCCAGGGACACAGACT AAAGGGC CCTCAA MPQGIYRDQLADLLA GTTCTATCTGGCTTTTGCAGACGACCTAGTCCTCCTGACGAGAGGACCC GACTTGCT GCACCG KTDSAPLKPQQRLYIL ACAGCTAACCAGAAGCTAGTATCCTTAGTCCATGAACAACTAGCAAGAG GTTCATAG TGTCGG RSHILPKFNHRLMFE TTGGACTTGAGCTACATCCGGGAAAGTGCAAGTCAATCGCTATCATGGC TCACGTGA ATCGCG RVTCQTLEGLDKLIRT TGACCCCAAGAGGAAAACCACTTTCGTAGATCAGGGATCCAGTGTTCTC GTCGTCTA GGGTGG HVRKWLKLPKDTPG ATTGGAGGCGAACCCGTCTCTTCGCTCGGTCCGCAAGAATGGTACAAAT GAAACTGC GCGCGG PAFYADKGSGGLGLI ACCTTGGGATTAAACTGGGATCAGGAGGCATGCCCCAGGGTATTTACCG ACCGATGC GATTCCT TLRYRVPLLKLRRHKK AGACCAACTAGCAGACCTTCTAGCGAAGACAGACAGTGCCCCCCTGAAA TGCCCTCT GCTGGG MADSPDPVIRLIPNA CCCCAACAAAGATTGTATATACTCAGATCGCATATACTACCGAAGTTCAA GTTCCAGG AAGTTG EPTISLLARWTKMCS CCATAGACTAATGTTCGAGAGAGTAACGTGCCAAACACTAGAGGGTCTT AGAAGGA TCTGCAC LYGKQYQHKSELSKII GACAAACTGATCAGGACCCACGTCAGAAAGTGGTTGAAATTACCCAAA CAGTGGA CCAAGT RDKYWTMCDGKGLR GACACGCCTGGTCCAGCCTTCTACGCTGATAAAGGTTCAGGCGGACTCG GGACTAAA CGTCAG TEVPPDTAKKTLSLLF GACTGATAACTCTCAGATACCGAGTTCCCCTCCTAAAACTTAGGAGACAC TCGTAGCG CTAACT EDRTPLKPGQLIGAIG AAAAAGATGGCGGACTCACCTGATCCCGTTATCAGGTTGATACCAAACG CGAGCGGT GTCGGT VRLNTLGTPARNNRA CGGAACCGACCATCTCCTTGTTAGCTAGATGGACCAAGATGTGCTCCCT GTT(SEQ ACTTATA KGYSPEANICDKCPG CTATGGCAAACAGTACCAGCACAAGTCCGAACTCTCTAAGATAATTAGA IDNO: CGCACC NRQATLGHISQTCPA GATAAATACTGGACAATGTGCGACGGGAAAGGCTTGAGGACAGAGGT 1214) CAGAGC THGRRVKRHDKIVNR GCCACCAGATACGGCTAAGAAAACACTAAGCCTGCTATTCGAGGACCG CATGCTC IAKALKERGSVKNILT GACTCCACTCAAACCAGGACAACTAATTGGAGCTATCGGGGTCCGCCTA TTACCA EPHLRHDKLPLRKPD AATACCCTTGGTACCCCCGCTCGTAACAATCGAGCTAAAGGCTACAGTC GGCCAC LIVHTEKSVEIIDVQV CTGAGGCTAATATTTGTGACAAATGTCCAGGCAACAGGCAGGCCACTCT TTGACG VADQGISRHEDEDQ TGGCCACATCTCGCAAACTTGTCCGGCGACTCACGGGAGGAGAGTCAA GCGCTA QKKIVKYDVDGYKRA ACGACACGACAAGATTGTGAACAGAATCGCAAAAGCTCTAAAAGAACG CACCGG AYKMLGIDYGSIPCN AGGCTCCGTCAAGAACATACTTACCGAACCCCACCTCAGGCACGATAAA TCTGTA VSAFTITWRGNLAPH CTGCCCCTACGGAAACCCGACCTAATAGTACATACGGAAAAGTCGGTCG GGAGGG SLKLASRLQFSPVLKYI AAATCATTGATGTCCAGGTCGTTGCAGACCAGGGCATCTCGAGGCACGA TATCTCA VADSLVDTWGAFLI AGATGAAGACCAACAAAAGAAGATCGTCAAATATGACGTGGATGGATA GCGACT WGKTS(SEQIDNO: CAAGAGAGCGGCCTACAAAATGCTAGGGATTGACTACGGGAGCATCCC TGGACA 1459) CTGCAATGTAAGTGCGTTCACTATCACCTGGCGAGGAAACCTTGCCCCC AAATGG CATTCACTCAAACTGGCGAGTCGTCTCCAGTTTTCACCCGTGCTCAAATA ATCTTGA TATCGTGGCTGACTCCCTAGTTGACACATGGGGTGCGTTCCTGATATGG CTGCCT GGAAAGACGTCTTAGTGACGAGAAAAGTCGCTTTATCCTTACATAACAG GAGGGC TGTGATAGTCATCCTTATCACAACTGTCCTGGCGAACAAAATGGCAGAG GCTTATC GGATAGTACTCGGTCCAACCAGAAGGAAGCCCACACGATGCCAAACTT GAGTGA GCTGTAACCCACGTGAGCGGAAAACATCCTCCGAGTATAGTATGATGGA CCCAGA AGGATACAGGATGTGGACCGCTCTAGGCGGCGGACGGTGTAGACGGC GTAAGC GACCTTACCATAGCTGCGACTGCTGTGCGATCCGGAACGGGCCTTTCTC TGGTAG ACTTACCTCAAGCACCGTGTCGGATCGCGGGGTGGGCGCGGGATTCCT GAAGAA GCTGGGAAGTTGTCTGCACCCAAGTCGTCAGCTAACTGTCGGTACTTAT TCTTGCA ACGCACCCAGAGCCATGCTCTTACCAGGCCACTTGACGGCGCTACACCG GTTGAG GTCTGTAGGAGGGTATCTCAGCGACTTGGACAAAATGGATCTTGACTGC AGGGCT CTGAGGGCGCTTATCGAGTGACCCAGAGTAAGCTGGTAGGAAGAATCT AGTAGG TGCAGTTGAGAGGGCTAGTAGGGCCAGCCTCTGCTCGTCAACTGTCCTA GCCAGC GGCGTTAACGTGTGCCTCCCTATGAGGTAACCGTGAAGTATTACTTGTTT CTCTGCT CCCTGTAGAGACCATATAACTAGCTGAGAAACGCCCTGCGGGTTAATAT CGTCAA ACAGTTACCGCCGTCCTGCATCCCCGAGTTGTGACCAGCGTACAGCCAT CTGTCCT GTACCACGGCACTTCCAGCATTCTCTGGTCTAAGAATGTATCAGCTGGCC AGGCGT CGCCGAAAGATAGAGCGACCCCGCCTCTATCACTAGAGTAAGCAGGCA TAACGT CGCGAAATATATGCAGGACGCCTTGGCTCAGTAGCTCTGGTACTGAGTA GTGCCT TAGCATGGCTGAACGCCCCTTTAAGTTCGGTGGCCCAGAGACTCTCCGA CCCTATG GACTTCCCAGTCCTGGAGAGGGTACACCTTTACCAGAACTTTCTGTGGTC AGGTAA GGTTG(SEQIDNO:1091) CCGTGA AGTATT ACTTGTT TCCCTGT AGAGAC CATATA ACTAGC TGAGAA ACGCCC TGCGGG TTAATAT ACAGTT ACCGCC GTCCTG CATCCCC GAGTTG TGACCA GCGTAC AGCCAT GTACCA CGGCAC TTCCAGC ATTCTCT GGTCTA AGAATG TATCAG CTGGCC CGCCGA AAGATA GAGCGA CCCCGC CTCTATC ACTAGA GTAAGC AGGCAC GCGAAA TATATGC AGGACG CCTTGG CTCAGT AGCTCT GGTACT GAGTAT AGCATG GCTGAA CGCCCCT TTAAGTT CGGTGG CCCAGA GACTCTC CGAGAC TTCCCAG TCCTGG AGAGGG TACACCT TTACCA GAACTTT CTGTGG TCGGTT G(SEQ IDNO: 1337) R2 R2- Petromyzon CTATTAATGGGATGAAGAAGGGGGACACGAGTTTGTGTGTGCATCCAG CTATTAAT TAATTTA MNERLTDELTTEFILS 1_PM marinus TTTCCATGGTGCATGCAGGAGTGGTGGTTTAAATGGCGAGACTCTACAG GGGATGA AGGTAA DMFLWDYPCTDQN GGCTTCCATGGCTACACGGGATGCAAGGCATCAGACATTTTGGCACAG AGAAGGG AATCGT KCYPCNLVFLDHRTW GCAATCCTTTTGGTCTCTACCGCAATCATGTCTTAGACCTCAGTAGCGAC GGACACG GGGATT SSHMARVHPHANKT CACTACAACCACAGTGGTGACTGCTGTTGAGTGAAGGACGACTGAGCG AGTTTGTG GTTTTGA YKCRICNRTADSIHKI CTGGATAACAACTTTCTTGCGTGGCCCAACATCGAAGCAACCACTTCGG TGTGCATC TGGCAA ASHYGRTCKSLIGKTN AGCTGGCACAAGGCAAGAGGGCAGCCCAAGGTGTGAATCATCTCAACT CAGTTTCC TCTGCCT AITTTIDETLFSCLHCS TCACTGCAGGAAGAAATGCTGTGCAAGGATGAGTGTGAACGACACCAA ATGGTGCA AGTCGC RGFTTKTGLGVHTRR CGGGATTGTTGCTGACCAGGAGGTGCCAACCAAATTTGAATGGATTGAC TGCAGGA GGCCTT THPTEHEAILQQNTP TTTGGGCCTGGTTTCTCCTGCGTGTATTGCACGGAAAAACAAGTGGCTA GTGGTGGT CCATTTT GRKVRWGEEEVEIM CACGTGTGGCCGTCGTGTCCTGGGGTTTCGCAACACAACTCCACAAGAT TTAAATGG GGGTAG AHKEAQQKDEDINM CGACAACTATGAGGATGACAATGTACTTAAAGAACAAAGAGACTGACG CGAGACTC GCAGCA NQLIQNSVMPHRTLE CCAAAGGGGATTTAAACCGCCAAATCGTACATTGGGTCTCACTACAATT TACAGGGC GACCCA AIKGKRRNIKYKELVR TTTTTACGTGTATTTATTTTCCTAAGTGTCTGTACTTGCCATTCTTCGCTGC TTCCATGG TCTATAT TLKETTYKVENQCLV TTTTTCTGCATTAATTGCATATCGTATGCAAATAAGCGAATTAACCACCA CTACACGG AACAAA NLVLPTTSEITTTPSE CCGTGCAACTATATGCAGATGTTACAGCTGAGCCCTCTATCATACCGGT GATGCAAG CTACTTT GDQPAIRAEKEQSPT GTACTAATCTGGTATGGTGTTGGCATGCTATGCTTGCGTAACGACCTTTG GCATCAGA GCCTTTC AAEDLQVIINDLKSQ CTGATTGGTTCAGTCGGCTGATGGTGGGTTCAGGCGAAACATTTGTATA CATTTTGG ATAGGG NFSHNQALLLLNSHV TTGGTTTAATCAAACCGAAACACTAAAATTTTGAACACAGTTTTCCATTA CACAGGCA GTACCC EKFLNRSKPIKRKDHV CACCAGTTGTATTGCTAGAAGTGCAAATCGAAGGAGTCAATTTTGACCG ATCCTTTT GACCCT NQQEIDENRHRRQS ACGATTAGCTGCCGATGTGCGGTGAAAAAGCTGATCACAATAGCATACA GGTCTCTA ACCAAC KQTKYRRYQYLYHTN CTTGGGCCGACAACCCCGTGTGCTATAAACGTAAGTCGCGAATTATAAA CCGCAATC TTTCGG KKALLDEITSDRSGPSI GAAAACAAACCGGACGGACTACTCGGTGACGAACTAACATCGCTCATG ATGTCTTA TTTCGG YPTEESIRGTFVTLFES AATGAGCGATTAACAGACGAGCTGACTACGGAATTTATCCTTTCGGACA GACCTCAG GGAAGT NSPPDNIPSKLKNDQ TGTTTTTATGGGACTACCCATGCACAGATCAGAACAAATGTTATCCATGC TAGCGACC AAAAGA SCIDIVKAITLDELIKTL AATCTTGTTTTCCTAGACCACAGAACTTGGTCATCACATATGGCACGGGT ACTACAAC AA(SEQ AIMKDKSPGQDNITL ACATCCACATGCAAACAAAACGTATAAATGTCGAATTTGTAATCGCACA CACAGTGG IDNO: SDLRTLPIKYLLDILNII GCAGATAGCATACACAAGATAGCGTCACACTACGGAAGAACTTGCAAA TGACTGCT 1338) LYIQDIPQIWKQHRT AGTTTAATAGGTAAAACTAATGCTATAACCACCACAATTGATGAAACACT GTTGAGTG RLIPKTKEELEKPSNW ATTTAGTTGTTTACATTGCAGCAGAGGTTTTACTACGAAAACAGGTTTAG AAGGACG RPITISSIVIRLLHKILSY GGGTACATACTAGACGAACTCATCCGACAGAACATGAGGCTATACTACA ACTGAGCG RLGQQLKLNYRQKAF GCAAAACACACCAGGAAGGAAAGTTAGATGGGGAGAAGAAGAGGTAG CTGGATAA LPVDGCFENSALLHFI AAATTATGGCCCATAAAGAAGCCCAACAGAAGGATGAGGACATAAACA CAACTTTC IHNARQKHENTQIVSI TGAATCAACTAATACAGAACTCAGTTATGCCACACAGAACGCTAGAAGC TTGCGTGG DLSKAFDSVSHESIIR GATTAAAGGGAAGCGGAGAAATATCAAGTATAAGGAATTGGTAAGGAC CCCAACAT ALNRFNLSKESITYLT TTTGAAAGAAACTACCTATAAGGTAGAAAATCAATGCCTTGTTAACTTAG CGAAGCAA NIYKCNLTDIVFGSTI TTTTACCGACAACATCGGAAATAACAACTACACCTTCGGAAGGAGATCA CCACTTCG MRNINLKRGVKQGD GCCAGCAATAAGGGCCGAAAAAGAACAATCACCGACAGCAGCTGAGGA GAGCTGGC PLSPLLFNMIMDELL TCTTCAGGTCATAATTAACGATCTAAAGAGCCAGAATTTTAGCCACAATC ACAAGGCA DNLPTYIGVNVGNQ AGGCGTTACTGCTACTCAATTCTCATGTAGAAAAGTTTTTAAATCGAAGT AGAGGGC KVNSMMFADDLILF AAACCAATTAAAAGGAAAGATCACGTAAACCAACAGGAGATAGATGAG AGCCCAAG AETECGMNKLLDITT AATAGGCATCGAAGACAATCAAAGCAAACTAAATACAGGAGATATCAA GTGTGAAT KFLDDRHLKININKCN TATTTATACCATACGAACAAGAAAGCTCTATTAGACGAGATTACTTCAGA CATCTCAA SLRFIKYGKqKTFSVA TAGATCGGGGCCAAGTATATACCCAACTGAGGAAAGCATACGGGGAAC CTTCACTG TTSSYFINNEPINPVS ATTCGTTACTTTATTCGAGTCAAACTCTCCTCCAGATAATATACCCTCTAA CAGGAAG YVKGFKYLGIEFDPRG ATTAAAAAACGACCAATCCTGCATCGATATCGTAAAAGCAATCACCTTA AAATGCTG KRSISCNLLAAMLNKL GATGAGTTGATTAAGACCCTAGCAATTATGAAGGATAAGTCACCTGGAC TGCAAGGA TRAPLKPEKKVYLINN AGGACAACATTACTCTTAGCGATCTTAGGACTTTACCAATAAAATATTTA TGAGTGTG NLIPRIIHQLVLGKVT CTAGATATCTTAAATATCATCCTTTACATACAGGATATACCACAAATATG AACGACAC KGLLMSLDSEIRKTVK GAAACAGCACAGAACAAGACTTATCCCGAAAACTAAAGAGGAATTAGA CAACGGG LLLRLPHDTPDSFFYT AAAACCCTCAAATTGGAGACCCATAACCATCTCATCAATTGTAATTAGGC ATTGTTGC SVSNGGMGIRNLCD TATTACATAAAATTTTAAGTTATCGTCTAGGACAGCAATTAAAGCTTAAT TGACCAGG SVALSIINRHNKLITSD TACAGGCAGAAAGCATTCCTCCCGGTAGACGGATGTTTCGAAAATAGTG AGGTGCCA DLVIRALSQQSYTIAT CATTACTACACTTCATCATACACAACGCCAGGCAAAAGCACGAAAACAC ACCAAATT LKQAHIIAGSKFPSKS GCAAATAGTGTCAATAGACCTCAGTAAGGCATTCGATTCTGTCAGCCAC TGAATGGA LNQNKWSNKLYQTT GAATCGATTATTAGAGCCTTAAACCGATTTAACTTATCAAAGGAATCCAT TTGACTTT DGRGLVYCQSQTEN AACGTACTTGACCAACATCTATAAGTGTAATCTAACTGATATTGTATTTG GGGCCTG NSWITGNHRTIKSYN GATCGACAATAATGCGTAACATAAATCTAAAAAGAGGCGTAAAGCAAG GTTTCTCC YIDMVKLRINALPTKS GAGATCCACTTTCACCGTTACTATTTAACATGATTATGGATGAATTATTA TGCGTGTA RCNRGTLETKQCRFK GATAACTTGCCGACATATATAGGAGTTAATGTAGGAAATCAGAAAGTAA TTGCACGG CRSINNQISEETLAHIL ATTCTATGATGTTTGCAGACGACCTTATCCTATTTGCAGAAACGGAATGT AAAAACAA QKCDRSHYSRIARHD GGCATGAATAAACTCTTAGATATAACTACTAAATTCCTCGATGACAGACA GTGGCTAC SLVQFLATAAQKLN CTTGAAAATAAATATAAACAAATGCAATTCGTTAAGATTTATCAAGTACG ACGTGTGG WEVIKEPTLPSDTNK GCAAACAGAAGACATTTAGTGTTGCAACGACATCATCGTACTTTATAAAT CCGTCGTG AKPDLILVRDSHVLIV AACGAACCCATTAATCCGGTATCATATGTAAAGGGATTCAAATATCTAG TCCTGGGG DVAVPWESRSLAHA GCATTGAATTTGACCCAAGGGGAAAACGATCTATAAGCTGTAACCTGCT TTTCGCAA YDFKVKKYATDKKM CGCAGCAATGTTAAACAAACTGACCAGAGCACCGTTAAAGCCAGAAAA CACAACTC QAYLKTIYPEKEIRTEA GAAAGTATATTTAATCAATAACAATTTAATACCTCGTATTATTCATCAATT CACAAGAT LIISARGGWCALNN GGTCCTCGGAAAAGTTACCAAGGGTTTATTGATGTCACTTGATTCTGAA CGACAACT MVTKKVGLSSAWVK ATTAGGAAAACAGTAAAGCTTCTGCTCAGGTTGCCACACGATACGCCCG ATGAGGAT LALIKVMEGSVKIWR ACAGTTTCTTTTATACATCAGTATCCAACGGAGGAATGGGTATAAGAAA GACAATGT SWSKG(SEQIDNO: TTTATGCGACTCAGTTGCACTATCTATAATAAACAGACACAACAAATTGA ACTTAAAG 1460) TAACTTCAGATGATCTAGTAATAAGAGCATTATCACAACAATCATACACT AACAAAGA ATTGCAACGTTAAAACAGGCCCATATCATTGCAGGCTCCAAATTTCCTTC GACTGACG GAAATCTTTAAATCAGAACAAATGGTCAAATAAACTATATCAAACAACA CCAAAGG GATGGTCGGGGGTTGGTATACTGCCAATCTCAAACAGAAAACAATTCAT GGATTTAA GGATAACAGGGAATCATAGAACAATAAAATCGTATAATTACATAGACAT ACCGCCAA GGTTAAACTAAGGATTAATGCACTACCGACTAAATCGAGATGCAATCGA ATCGTACA GGGACGTTAGAGACCAAGCAATGTAGATTTAAATGTCGAAGTATTAACA TTGGGTCT ACCAAATTTCAGAGGAAACATTGGCACATATCTTGCAAAAGTGTGATCG CACTACAA CAGTCATTATTCAAGAATCGCAAGGCATGATTCTTTGGTGCAATTTCTGG TTTTTTTAC CAACGGCCGCACAAAAACTAAACTGGGAAGTGATCAAAGAACCCACTTT GTGTATTT ACCGAGCGATACAAATAAGGCAAAACCGGACTTAATTTTAGTAAGAGAC ATTTTCCT TCTCATGTCTTGATAGTAGATGTGGCAGTTCCGTGGGAGTCTCGATCATT AAGTGTCT GGCACATGCATACGATTTTAAGGTGAAAAAATACGCTACTGACAAAAAA GTACTTGC ATGCAAGCATATTTAAAAACTATATATCCGGAAAAAGAAATTAGAACGG CATTCTTC AGGCTTTAATCATATCTGCACGTGGGGGCTGGTGCGCTTTAAATAATAT GCTGCTTT GGTAACAAAAAAGGTGGGATTGTCAAGTGCATGGGTAAAATTAGCATT TTCTGCAT GATCAAGGTCATGGAGGGTTCCGTAAAGATATGGCGCTCTTGGAGCAA TAATTGCA AGGATAATTTAAGGTAAAATCGTGGGATTGTTTTGATGGCAATCTGCCT TATCGTAT AGTCGCGGCCTTCCATTTTGGGTAGGCAGCAGACCCATCTATATAACAA GCAAATAA ACTACTTTGCCTTTCATAGGGGTACCCGACCCTACCAACTTTCGGGGAAG GCGAATTA TAAAAGAAA(SEQIDNO:1092) ACCACCAC CGTGCAAC TATATGCA GATGTTAC AGCTGAGC CCTCTATC ATACCGGT GTACTAAT CTGGTATG GTGTTGGC ATGCTATG CTTGCGTA ACGACCTT TGCTGATT GGTTCAGT CGGCTGAT GGTGGGTT CAGGCGA AACATTTG TATATTGG TTTAATCA AACCGAAA CACTAAAA TTTTGAAC ACAGTTTT CCATTACA CCAGTTGT ATTGCTAG AAGTGCAA ATCGAAGG AGTCAATT TTGACCGA CGATTAGC TGCCGATG TGCGGTGA AAAAGCTG ATCACAAT AGCATACA CTTGGGCC GACAACCC CGTGTGCT ATAAACGT AAGTCGCG AATTATAA AGAAAACA AACCGGAC GGACTACT CGGTGACG AACTAACA TCGCTC (SEQID NO:1215) R2 R2- Schmidtea CAGTGCTATTCGAATGTCAATGTGAAGAAATTCAACTAAGCTCTGGTTA CAGTGCTA GGCCAC MKKVLNNETEKLPGS 1_SM mediterranea ACGGCGGGAGTAACTATGACTCTCTTAAGGAATTAAGAATTTACCTGCC TTCGAATG GCGCGT NLTFMCGFCDREFDT KTAKTAAAARTGAAATCMGTTGTTCATWGCAAGTGGTATTGTACACCTT TCAATGTG CGTCCTT ARGRGVHESRGHLV CCCGCGGTGCTAGTCGTTTAAAACTAAGTTACAAACCACGAGGGGCGTC AAGAAATT GTTTGAT ERDAAVQSRVKAVV CTGACGGACTGSAAAAGCATTGAGRGTCMTGAAGAGAGGCTCTTATTG CAACTAAG CACTAG SKKYYYSNEEDVALA TACGAATCTCTTCAACGATCGAAGTCTGGACCGATATGAGAACTAATAC CTCTGGTT TGGATC KMQLXHADLAKSEXL ATTAGTTGACAGGTGAAAAATACTGTTGATTACTTAGTTCTCAGTCATGT AACGGCG AACCTTC EAMYLALGKGRTREA GGTATATTGCCAGTCAATTACTACATWAATATTAGTGTGGCTCTCAAAG GGAGTAAC GACTCC IEQHIRKSLRYKGVLE GAACACGATTGRTCGGCAGTCCAATGCGCGACTGGCGGGCTTGTTGTTT TATGACTC CCGGAA EQRKLLETARGNVRQ GCATTTGTTACCGGCTACTTGAAAAGGTTATATATAGCAGACGCTTAAA TCTTAAGG CTGTGG NNVGVPASNATKNL GCGCGACTGTAATTTACATYTCATTGCCCAGTATTTGTCTTTTGTCAGATT AATTAAGA GAGTGG QRFLESLPLGTNRREE TAGCAAAATTTCATATTTTGTTAATTACCTTAACTGGTTAAACGATCCCAT ATTTACCT CGGAAG RLDRIIRSNSIESQRLE AATTGCTTGCAATTATTATAAAGTAATTCAGGTAAAAATTACATATCTGG GCCKTAKT AAAGGC LIHYCNDMCQDFVQ CTGATCCTGCCAGTAGTCATTTTACTTCCGCCGCGCTATAAAACAGTTTA AAAARTGA CAGAGG LDCQXNPINAIRRRN AAAACTGAATAGGAATCAAAAAGAACATGGCAAGCGACTATATGTAAC AATCMGTT ATGTCCT PKRLSKKQLKRAKFSA TGGGCATTCAACATTCCCTATTACATATGGTGGTGCCTGGGGTCTGTTTT GTTCATW GAAACC LQRLWIRDRKAAAQL ATATAATGGGTACCCGGGAAGTGGATCTGTATCACCAGTCATGGTGCCA GCAAGTG ATATATT VLKDKLDSLLSNKEDS TATCTTTKGATAAAGATACAGTTTAAAACTGCGATGATACTAATAGAGAT GTATTGTA TATTTAT KDLGSYWQQVFERE CCTCTTAGACCTTCGTAAAGAAGTGGGGATTGATGACATTAGCATTGGA CACCTTCC AGAAGT SELDRRPIPQVVENEE AGAATTAAATCTCCAAGGAAATGGAGTAACTTCAATGAAGTCCCACAAC CGCGGTGC TTTACTT LNSPVLEKEVEWAVK CCCGTTGAAGGGCTGGGTTCGAGTATCGAGAGAAAACTCTAAATTCTCT TAGTCGTT CATCCTA NIKKSTAAGPDGLTA TCGGTTMTGTCCAACGGAGGGGACATTACTGTAAAATATCCTCTAAAAA TAAAACTA TTTACGT LALKKIPYSELVKLFNII CAACTATGAAAAAAGTCTTAAATAATGAAACCGAAAAATTACCGGGAAG AGTTACAA ATTTCAG LLVGFLPDVLKNSRTI TAATCTAACTTTCATGTGCGGGTTCTGTGATCGGGAATTCGACACGGCW ACCACGAG TATGAA LIPEVDNPQGGGDYR AGGGGCAGAGGAGTGCACGAGAGTAGAGGTCATTTAGTTGAGCGGGA GGGCGTCC AATGAG PISINSVLTRTLNKILA TGCGGCGGTTCAGAGCAGAGTGAAAGCCGTGGTGAGTAAAAAGTATTA TGACGGAC TAAAGT KRVSEGDFGINGQKG TTATAGTAACGAAGAGGATGTGGCATTAGCGAAAATGCAGTTASAGCAT TGSAAAAG TCTCGAC FKSVDGCLENLATVE GCAGATCTGGCCAAAAGCGAACWATTAGAAGCCATGTATCTTGCATTG CATTGAGR TCGATG SILADARMKKNKLAV GGAAAGGGAAGAACTCGTGAAGCCATAGAGCAACACATAAGGAAATC GTCMTGA AGTTGG VFLDMSKAFDSVNH GTTACGTTATAAGGGGGTCCTTGAAGAACAGCGAAAGCTCCTTGAGAC AGAGAGG GGGCAA ESIVRAGEIKGYPKLL AGCAAGGGGAAATGTTCGGCAAAATAACGTGGGTGTGCCAGCTAGCAA CTCTTATT CCATTG MTYVKECYNDATTN TGCCACTAAAAATCTGCAAAGATTCTTAGAATCGTTACCCTTGGGAACG GTACGAAT GGGGTC VAGVTAKFNRGVKQ AATAGGCGCGAGGAACGATTGGATAGGATTATCCGATCTAACTCGATC CTCTTCAA CTGAAG GDPLSPALFNNVIDL GAAAGCCAAAGACTTGAATTGATCCACTATTGTAACGATATGTGTCAAG CGATCGAA AGAGGC AIERVSGTGIGYNMG ACTTTGTGCAACTAGACTGTCAAARGAACCCCATCAATGCTATAAGGCG GTCTGGAC TCTCACT GKKYSVVAYADDLVL CAGAAATCCGAAAAGACTATCGAAAAAGCAGTTGAAGAGAGCTAAATT CGATATGA GTAAAA FGESREGLQIALTALL TAGCGCTCTTCAACGGCTTTGGATAAGGGATCGGAAAGCTGCTGCACAG GAACTAAT AATCTCT EELKLNGLTPNPAKS TTAGTGTTGAAGGATAAGCTTGATAGTTTGCTCAGCAATAAAGAGGATT ACATTAGT TCGTGT ASLTFERSGPHWFAS CCAAGGATTTGGGATCGTATTGGCAACAGGTCTTCGAACGTGAGTCCGA TGACAGGT CTGTTTA TDTVTALGDQIPAM ATTAGACCGCAGACCCATACCACAAGTGGTGGAAAACGAAGAGTTAAA GAAAAATA TTCCTAG GNIETYKYLGIKFNSC TTCCCCGGTATTAGAGAAGGAAGTAGAGTGGGCTGTCAAAAACATTAA CTGTTGAT GCACNT GVVKGSLPGIYTKKLE GAAGTCCACTGCCGCAGGACCAGACGGGTTAACGGCACTTGCCTTGAA TACTTAGT GCTGCA LISKAPLKPQQRLAM GAAAATACCGTATTCCGAGCTAGTCAAACTATTTAATATAATACTGTTGG TCTCAGTC TTATGA LTDFLIPGVLHQAVF TGGGATTCCTACCTGATGTATTAAAAAATAGTAGAACTATCCTAATACCC ATGTGGTA AGCGG GQTNAGDLRSLDKRT GAAGTGGATAATCCCCAAGGGGGYGGTGATTATAGACCGATTTCGATC TATTGCCA WGAAA RRAVRSWCHLPSDTS AATTCAGTGCTCACTAGAACACTAAATAAGATCCTAGCGAAACGAGTCT GTCAATTA GTAAAG TAFIHAKAKDGGMGI CGGAAGGTGATTTTGGTATCAATGGTCAAAAAGGATTCAAAAGTGTAG CTACATW TTAGAG PSIRAEVQFGKLDRF ATGGTTGTCTAGAGAATCTAGCAACAGTTGAATCAATTTTGGCCGATGC AATATTAG CTGAGA GKLPNVKDERSKVLA TAGAATGAAGAAAAATAAGCTTGCGGTAGTATTCTTGGATATGAGTAAA TGTGGCTC GATAGG DNAHIKKKMLEKLGV GCTTTTGATTCTGTAAACCACGAGAGTATTGTTAGAGCTGGAGAAATCA TCAAAGGA TACTTGC GIPIKGVRCKNKLEFY AAGGTTATCCCAAACTATTAATGACGTATGTGAAGGAGTGTTATAACGA ACACGATT TGCATTA NKMREELIKSNDGIG CGCTACTACGAACGTCGCAGGTGTAACAGCCAAGTTTAACCGGGGAGTT GRTCGGCA TGAAGC LKEASLVPSANTWLK AAACAAGGTGATCCTCTTTCGCCGGCGTTATTTAATAACGTAATAGACCT GTCCAATG GGAGAA LSDLHMSGRTFVGCL TGCAATTGAGCGAGTTTCCGGTACTGGAATTGGATATAATATGGGCGGT CGCGACTG AGGCCT KTRGNLMATVTRTSR AAGAAGTATTCTGTGGTCGCGTATGCTGACGATCTTGTTCTATTCGGTGA GCGGGCTT CGAATA GGQNPGIELNCKKGC GTCGAGAGAGGGGTTGCAAATAGCCCTGACGGCGCTACTGGAGGAATT GTTGTTTG AATAGG QYQGSLNHIVQKCPV AAAGCTAAACGGTCTGACACCCAATCCAGCGAAAAGTGCTTCGTTGACA CATTTGTT GTGTTA VKGLRIKRHDEVVKY TTTGAGAGATCAGGGCCACATTGGTTCGCAAGTACCGATACCGTTACAG ACCGGCTA GAGTTA VEEITKKAGWSATME CACTAGGAGATCAAATACCGGCCATGGGTAACATCGAAACCTATAAATA CTTGAAAA TTGATG PIIPFEGSHRKPDLVL CTTAGGAATCAAGTTTAATTCGTGCGGAGTGGTTAAAGGGAGTCTACCC GGTTATAT GAGAGT VRGDLGKVVDIQIVS GGGATATACACCAAGAAATTGGAGTTAATCTCTAAGGCTCCTTTGAAGC ATAGCAGA ATACTA DHCGLDEKNSCKIGK CGCAACAGCGGTTAGCTATGCTAACCGATTTCTTGATTCCTGGAGTATTA CGCTTAAA GTAAGC YDNDIIRNYVRGLGP CACCAAGCCGTGTTTGGACAGACAAATGCAGGGGACCTGCGCAGTCTC GCGCGACT TTAAGCT SRVEVAAITLNWRGV GATAAACGGACGAGAAGAGCGGTGAGATCTTGGTGTCACCTTCCCTCTG GTAATTTA GCGCCT WSRDSFNLIKRLGMT ATACGTCAACGGCATTCATCCATGCTAAAGCTAAGGATGGAGGAATGG CATYTCAT CGCGCG EMDAKIISMRVLAST GAATTCCATCTATAAGAGCCGAAGTCCAATTTGGAAAATTGGATAGATT TGCCCAGT GTGCCC AKMFKTCKKVLEPVC TGGAAAGTTACCCAATGTCAAAGATGAACGATCKAAAGTTTTGGCCGAT ATTTGTCT AAAAAT RTKTADCDGYGPEET AATGCTCACATTAAAAAGAAGATGTTAGAGAAATTGGGAGTGGGGATC TTTGTCAG ATACTTA SARPCHELNLKESSGT CCAATCAAAGGAGTGCGTTGTAAGAACAAACTCGAGTTCTACAACAAGA ATTTAGCA ATGAGA *(SEQIDNO:1461) TGCGTGAAGAGTTGATTAAGTCGAATGATGGCATTGGCTTAAAAGAGG AAATTTCA GCAATA CATCTTTGGTTCCCTCTGCGAACACTTGGTTAAAACTGAGTGACCTACAT TATTTTGTT ACTCAA ATGAGCGGTCGCACATTTGTAGGATGTCTTAAAACACGGGGAAACCTTA AATTACCT GGNGAG TGGCTACCGTAACAAGAACTAGCAGAGGAGGTCAGAACCCGGGTATAG TAACTGGT TTTAATT AGTTGAACTGTAAAAAGGGATGCCAATATCAGGGAAGTCTGAACCATAT TAAACGAT CATATG AGTCCAAAAGTGCCCAGTAGTGAAAGGGTTGAGGATAAAACGACATGA CCCATAAT CGCATG TGAAGTCGTTAAGTATGTGGAAGAGATCACGAAAAAGGCTGGATGGTC TGCTTGCA CGGCAC TGCAACAATGGAACCAATAATTCCGTTCGAGGGGTCACACCGAAAACCG ATTATTAT CAAGGT GATTTAGTTCTGGTCCGGGGCGATCTCGGGAAAGTTGTGGATATTCAAA AAAGTAAT GCTGAA TAGTTTCTGATCACTGCGGTCTGGATGAAAAAAATAGTTGTAAGATCGG TCAGGTAA TGGCAT TAAGTATGATAATGACATCATACGAAATTATGTAAGGGGGCTAGGACCA AAATTACA CGATTA TCGAGAGTGGAGGTAGCGGCGATAACCCTTAATTGGCGAGGCGTGTGG TATCTGGC AACCTCT AGTAGAGACTCATTCAATCTCATTAAGAGATTAGGAATGACAGAAATGG TGATCCTG CCTGTT ATGCGAAAATCATTTCTATGAGAGTGCTGGCGAGCACTGCGAAGATGTT CCAGTAGT GTAGAA TAAGACGTGTAAAAAAGTGTTAGAGCCCGTTTGCAGAACCAAGACCGCT CATTTTAC GCAGGT GATTGTGATGGTTACGGACCGGAGGAAACCTCCGCCCGACCATGTCATG TTCCGCCG CATAAA AGCTTAACTTGAARGAGAGTTCGGGGACGTAAGGCCACGCGCGTCGTC CGCTATAA TGGAGG CTTGTTTGATCACTAGTGGATCAACCTTCGACTCCCCGGAACTGTGGGA AACAGTTT RGGGCA GTGGCGGAAGAAAGGCCAGAGGATGTCCTGAAACCATATATTTATTTAT AAAAACTG ACCACT AGAAGTTTTACTTCATCCTATTTACGTATTTCAGTATGAAAATGAGTAAA AATAGGAA GAAACT GTTCTCGACTCGATGAGTTGGGGGCAACCATTGGGGGTCCTGAAGAGA TCAAAAAG TATGAG GGCTCTCACTGTAAAAAATCTCTTCGTGTCTGTTTATTCCTAGGCACNTG AACATGGC CCAGAA CTGCATTATGAAGCGGWGAAAGTAAAGTTAGAGCTGAGAGATAGGTA AAGCGACT GAAGCT CTTGCTGCATTATGAAGCGGAGAAAGGCCTCGAATAAATAGGGTGTTA ATATGTAA TAACTAC GAGTTATTGATGGAGAGTATACTAGTAAGCTTAAGCTGCGCCTCGCGCG CTGGGCAT AWAAGT GTGCCCAAAAATATACTTAATGAGAGCAATAACTCAAGGNGAGTTTAAT TCAACATT TTTAGG TCATATGCGCATGCGGCACCAAGGTGCTGAATGGCATCGATTAAACCTC CCCTATTA CAATTAC TCCTGTTGTAGAAGCAGGTCATAAATGGAGGRGGGCAACCACTGAAAC CATATGGT TGAACG TTATGAGCCAGAAGAAGCTTAACTACAWAAGTTTTAGGCAATTACTGAA GGTGCCTG GAGTTA CGGAGTTAACTGTTAGTTAACACTACCATGTAGTIGTTTATAAAGCAAAT GGGTCTGT ACTGTTA ATCAGGTTTCAGTCTATATACTAAAAGTATTTTTTGATACCGTGGTATAT TTTATATA GTTAAC AGGCAACTAGTTAGGAAATAGTAAGGGATGACGCATTGTCTCTCTTTAT ATGGGTAC ACTACC GGTACTGAGGAAACTTATCGACTCGCGAGGGATGAAACCCGTAAACCG CCGGGAA ATGTAG ATCGATYTAGCCTATAAGTACCAGCGACAGTTAAACCATCTTACGCGAG GTGGATCT TTGTTTA GGGTAAAACCTGAGGACCGATTATGGTATAACTTCTCAAGATTAGCACA GTATCACC TAAAGC AAATGCGAGTGCAACTTGAGGAGGAGGATTTGAGTGTTAATTCATAAT AGTCATGG AAATAT GTACTAATCTAATTAAACTGTGACGGGAATTGCAGCTTCGGCTGTAATT TGCCATAT CAGGTT ACTTTGAGGCCTATCACGGATTGTAAGGAACATATTGACACCGTAAGTC CTTTKGAT TCAGTCT TAACGTGTTCCCGATTTCCAACCAGGTCATATGAAGGGCTGCCCTTGATA AAAGATAC ATATACT AGGCGGATTTGACCCAATTCTTCATATGAGAGGCTTATTCCAGCCTTCCC AGTTTAAA AAAAGT GTAGTACCGTGAGGTTTTCCCGCCTCGAACGGAACAATGTTGCAGGGTA ACTGCGAT ATTTTTT ATTAAGTACATCGGGCTATATMGCGATATTTAACGTTTTA(SEQIDNO: GATACTAA GATACC 1093) TAGAGATC GTGGTA CTCTTAGA TATAGG CCTTCGTA CAACTA AAGAAGT GTTAGG GGGGATT AAATAG GATGACAT TAAGGG TAGCATTG ATGACG GAAGAATT CATTGTC AAATCTCC TCTCTTT AAGGAAAT ATGGTA GGAGTAAC CTGAGG TTCAATGA AAACTT AGTCCCAC ATCGAC AACCCCGT TCGCGA TGAAGGG GGGATG CTGGGTTC AAACCC GAGTATCG GTAAAC AGAGAAA CGATCG ACTCTAAA ATYTAGC TTCTCTTC CTATAA GGTTMTG GTACCA TCCAACGG GCGACA AGGGGAC GTTAAA ATTACTGT CCATCTT AAAATATC ACGCGA CTCTAAAA GGGGTA ACAACT AAACCT (SEQID GAGGAC NO:1216) CGATTAT GGTATA ACTTCTC AAGATT AGCACA AAATGC GAGTGC AACTTG AGGAGG AGGATT TGAGTG TTAATTC ATAATG TACTAAT CTAATTA AACTGT GACGGG AATTGC AGCTTC GGCTGT AATTACT TTGAGG CCTATCA CGGATT GTAAGG AACATA TTGACA CCGTAA GTCTAA CGTGTT CCCGATT TCCAACC AGGTCA TATGAA GGGCTG CCCTTGA TAAGGC GGATTT GACCCA ATTCTTC ATATGA GAGGCT TATTCCA GCCTTCC CGTAGT ACCGTG AGGTTTT CCCGCCT CGAACG GAACAA TGTTGC AGGGTA ATTAAG TACATC GGGCTA TATMGC GATATTT AACGTTT TA(SEQ IDNO: 1339) R2 R2- Strongylo- TCTCGCGACGCGTTCTTCTGCCTGATGAAGTCACGTCAGGTAATAGACTT TCTCGCGA TAAACCT MENSFAWEGTSSAE 1_SP centrotus AGAAGGTTGATGAGCGTTCCTCTCCTGGACCGGGGGTGAGGATGTGTT CGCGTTCT TGCCTCC GRTTVEDSPSSSDDF purpuratus CTACTGAATCATCGTTCCGGTTGTGAGGTCCGCTGCAAATAGGCCTTGG TCTGCCTG CCGGGC VSNVGFKVAKADPTV GGTGGTCTACCTCCGGGTCGCTACTCCTTTTGAGCTAGTCGATCCAGGT ATGAAGTC CCCCCTC WEEANMSEDNTIIED GAGAGTCGGGGAAGCCCACTTAGGTGGGCCAGCTAAGCAGATCACCCC ACGTCAGG AGTGAC PPSSSDDFANNVGFK CCCAGCACGACAGTGCGCTGTAATATAGCCTGTTGAGAGTGCACCCATT TAATAGAC TAAGAC VTRADPTAWEEASTS TATACTGTTATAAAAGTATAAATGGTTCTTAGCTAACCTATCCAATGATT TTAGAAGG AACTTTC TETEDLPSSSNFIDNV TTTATTGTTATGTATAATACTGACCACTTTAGTCTAGTGTCATGTAAGGA TTGATGAG ACCGTA ETQIDMAGPTAWED TCCGAGACAATAGCCTATGTCTCTAGTCTAGCAATAACACAAGAGGGAT CGTTCCTC ATAATC ADSNEDNIDEGTPN AATCCACCTCCCTACTCCAGCCACCACCTTTACCTTTCTTATCTTCCATATA TCCTGGAC ATATATT NINNNLAIVRGRADA GATGCCAACCAGACGGGTCGATGGGCAGTGAGAAACAAGAATGGGGA CGGGGGT TGTATAC YACSCCERNFISLKAI TAATTTGGAATTTGACATTCTTTCCTAAAGCAGACCTGAAGGGTTAAGA GAGGATGT CATGTAT GTHLKETHNKKVVFE GTCACAAGCGGAGTCCGGACTGTCTTTATAAACAGACAATATTTTCTTAT GTTCTACT TAATCTA CAKCQHTFVKAHGL TTTACCACCTATATGGGGATATTTCCCAACTCGTAATATAGGGCCCACTG GAATCATC GGTAAC ACHVPKCKEDADTP TCAAGTGGCTGGTATAAGTTACCCTGTGGGTAACAAATAAATTCAAACA GTTCCGGT AATTGA MLNRLLHGCGECGL AAGAATGGAAAACTCCTTTGCGTGGGAGGGTACCAGTTCTGCTGAAGG TGTGAGGT AAGTAA AFNTRRGLSQHERHR CAGAACAACAGTTGAGGATTCACCCTCAAGCAGTGACGATTTTGTTAGC CCGCTGCA CATTGA HPSACLSTRRRSRLD AATGTCGGATTCAAAGTAGCTAAGGCGGATCCCACTGTGTGGGAGGAA AATAGGCC ACCGTA GIARKKSLRNRRDIW GCCAATATGTCAGAAGACAATACAATAATTGAGGACCCACCATCGAGTA TTGGGGTG TTCATAC TNDEIRLLKQLMIQYE GCGACGACTTTGCTAACAATGTCGGCTTCAAAGTAACTAGGGCAGATCC GTCTACCT TTTAATG HAKKINIKIAEHFNHK CACTGCGTGGGAGGAAGCCAGTACATCTACAGAGACTGAGGATTTACC CCGGGTCG AGTACA NAKQVMHKRRSLRE CTCGAGCAGTAATTTTATTGACAATGTCGAGACCCAAATAGATATGGCG CTACTCCT ATAGTG KDMALGAPHDAPPP GGCCCCACTGCGTGGGAGGACGCCGATAGCAATGAAGATAACATAGAT TTTGAGCT AAGGGA LAEEPIIEVVEGAREEL GAGGGCACACCCAATAATATTAATAATAATTTAGCAATAGTCAGAGGGA AGTCGATC ACTGTAT EQAPVDVILPDLEALT GGGCGGATGCCTATGCATGCTCGTGTTGCGAACGGAACTTCATTTCTCT CAGGTGA ATTACAT VNDGRGGGSPVLTE CAAGGCAATCGGTACTCACCTCAAGGAAACCCATAACAAAAAGGTTGTC GAGTCGG ACCTCG GGESTRDMEENGGT TTTGAATGTGCCAAATGTCAACACACCTTCGTTAAAGCACACGGACTAG GGAAGCCC AGATAG DTRSPSPREERAGST CGTGCCACGTCCCCAAATGCAAAGAGGACGCCGACACACCTATGCTTAA ACTTAGGT AGGTTTT PWERGWQPRVDRG TAGACTGCTCCACGGCTGTGGAGAGTGCGGTCTTGCATTCAATACGAGA GGGCCAG TGTACCT RGEYKGYGGERGDR AGAGGGCTGTCTCAACACGAGCGCCATAGGCACCCGTCGGCGTGTCTA CTAAGCAG TAAGGG HTVSLSQRGESGVDP AGTACTAGGAGAAGGTCACGGCTAGATGGTATAGCTAGGAAAAAATCA ATCACCCC TTCGTG LTPGGVVEDYDDSYL TTGAGGAATAGGAGGGATATTTGGACGAATGATGAGATCCGTCTACTC CCCAGCAC AGAATC EDYFPGWDEDEHM AAACAACTCATGATACAGTACGAGCATGCTAAAAAGATAAACATTAAGA GACAGTGC CATACT HIIGRLDLSDESEQGE TAGCGGAACATTTCAACCATAAGAATGCAAAACAAGTCATGCATAAACG GCTGTAAT GCATAA AAVSPRLGSFGDLLEE TAGATCCTTAAGGGAAAAGGACATGGCCTTAGGTGCTCCACATGATGCA ATAGCCTG AGGGGT VNAMEGKDNLSEAL CCTCCGCCATTGGCAGAAGAACCGATAATAGAAGTAGTTGAGGGTGCG TTGAGAGT TAACTTG AETLGLVLHEGHRVE AGAGAAGAGTTAGAACAGGCTCCTGTAGATGTAATCCTTCCAGATCTTG GCACCCAT TAACTCA YIKEKMNINVKQMA AAGCACTCACGGTTAATGATGGGAGAGGGGGTGGCAGCCCCGTGTTGA TTATACTG ACCCCA TEILAHGANKGNPKR CCGAGGGAGGGGAGAGTACAAGGGATATGGAGGAGAACGGGGGGAC TTATAAAA GGGAGA RKEAVAAKRNPGTRL CGACACACGGTCTCCCTCTCCCAGAGAGGAGAGAGCGGGGTCGACCCC GTATAAAT CAAGCC DRAQRDNQAKAKAK CTGACTCCGGGGGGAGTGGTAGAAGATTATGATGACTCGTACTTAGAA GGTTCTTA GACAAA EKKRIFSETQTQYKKN GACTACTTCCCTGGGTGGGATGAGGATGAGCACATGCATATTATTGGGA GCTAACCT TTCGGC PHRLVEKLLDGKGDE GGCTCGACCTAAGTGACGAGTCGGAACAAGGGGAGGCTGCTGTCAGTC ATCCAATG ATTACTA RCSVSLEVIQRTYMN CAAGACTAGGCTCCTTCGGAGACCTATTAGAAGAAGTTAATGCAATGGA ATTTTTATT TGTGAG RFSRESKEVDIGAYVD AGGCAAGGATAACCTGTCTGAGGCACTGGCCGAAACCCTGGGCCTTGT GTTATGTA GGTCAT PETVEDNQGIVDPISK GTTACACGAAGGTCATAGAGTAGAATACATTAAAGAGAAAATGAACAT TAATACTG AAGTGT AEIERAISTTKKGSAP AAATGTTAAACAAATGGCAACAGAAATATTAGCCCACGGGGCAAATAA ACCACTTT TAAAGG GPDGVTYDALKAYG GGGAAACCCCAAACGGAGAAAGGAGGCGGTCGCCGCCAAGCGGAATC AGTCTAGT ACCCATT NCQLYLLIMYNTWLA CCGGTACGAGATTAGATCGAGCCCAGAGGGACAACCAAGCGAAGGCCA GTCATGTA GTATAT MGKVPSEAKTYRSILI AGGCCAAGGAGAAAAAGAGAATCTTCTCCGAAACACAGACCCAATATA AGGATCCG AACTTGT PKGQGDPMDINNFR AGAAGAATCCCCACAGACTCGTGGAAAAACTTCTTGATGGGAAGGGTG AGACAATA AAATGT PLTLANVISRLYSKILT ACGAGCGGTGCTCTGTCAGCCTTGAAGTGATCCAAAGGACCTACATGAA GCCTATGT ACAGAG RRLDGAVSVCPRQR CAGGTTCTCAAGAGAAAGCAAAGAGGTTGACATAGGAGCCTATGTTGA CTCTAGTC AATTCG GFTHKASIEDNTLILR CCCCGAGACTGTGGAGGATAACCAAGGGATAGTGGACCCCATCAGCAA TAGCAATA AGTTGA ELIMKSKRNKECLAV GGCTGAAATAGAAAGGGCCATCTCTACCACGAAAAAGGGATCCGCACC ACACAAGA GTTCGA VLLDLAKAFDTVSHD TGGACCGGACGGGGTCACCTATGATGCCCTAAAGGCGTATGGAAACTG GGGATAAT AGAATA LIIKALRRHRVHEHLIS TCAATTATACTTATTAATAATGTACAACACATGGCTGGCCATGGGGAAG CCACCTCC AACAAA VIMDLYEGGTTSFTT GTCCCATCAGAGGCAAAAACCTACAGGTCAATTCTAATCCCAAAAGGTC CTACTCCA CAAAAA DEGTTCPIAIRSEVKQ AAGGGGACCCCATGGACATCAACAATTTTCGTCCATTAACACTCGCCAAT GCCACCAC AGAAAA GDSLSPVLFNLALDPL GTAATCAGCAGATTATATTCCAAAATCTTGACCAGGAGGCTGGACGGG CTTTACCTT CAAAGA LATLEQRGKGVEIGG GCGGTAAGTGTCTGTCCGAGGCAAAGAGGCTTCACTCACAAAGCTAGT TCTTATCTT GAATTC HTFVSLAYADDTALV ATTGAAGACAATACACTCATCCTACGGGAACTCATCATGAAGAGTAAGA CCATATAG GTTCGT SSSHLDMTANLDITV GAAATAAAGAATGCCTTGCAGTGGTACTTCTTGACTTAGCCAAGGCTTT ATGCCAAC CCATAG EYLNATGLSLNVRKC CGACACCGTTTCCCACGATCTCATCATCAAGGCCCTGAGGAGGCATCGG CAGACGG AAGCGG QGFLLTPINKSFLVNE GTGCATGAACATCTCATTAGCGTCATAATGGACCTGTACGAAGGGGGA GTCGATGG AAAAAG AESWVVEREAIPWV ACTACATCTTTCACGACAGATGAGGGCACCACCTGCCCCATCGCGATCC GCAGTGA CGCAAA EPGDTAKYLGVQVG GGAGCGAAGTTAAGCAAGGAGACTCTTTGTCGCCGGTGTTATTCAACTT GAAACAA AAGGCG PWSRPWPSIQPVIKR GGCTCTCGACCCGCTACTAGCTACACTCGAGCAAAGAGGGAAAGGTGT GAATGGG TCCTAAC LTAYCESIDKAALKPR TGAGATCGGGGGACACACGTTCGTCAGCCTGGCCTATGCGGATGACAC GATAATTT CTCACG QRIHILTTYIAPRIAFEI GGCACTCGTCTCGTCCAGCCACCTGGACATGACCGCAAACCTAGACATA GGAATTTG GTCTAA AEGGYSTLVDCRGGI ACTGTAGAATACCTCAATGCAACGGGACTGAGTCTCAATGTGAGGAAAT ACATTCTT CCCAGA QYTRIREVDMTIRNY GTCAAGGCTTCTTACTCACCCCCATCAATAAGTCCTTCTTAGTCAATGAA TCCTAAAG AACATTT VRKWLFLPACLSNSF GCCGAGAGCTGGGTAGTGGAACGGGAAGCTATCCCGTGGGTTGAACCT CAGACCTG AAGGGG LYTRRGEGGLGLVSF GGGGACACAGCTAAATATCTGGGGGTACAAGTGGGTCCGTGGTCACGT AAGGGTTA GGAAGG YDYVPTERMRKLVRV CCGTGGCCATCCATTCAACCAGTAATTAAACGCCTCACAGCCTACTGTGA AGAGTCAC AGCTCA CDSEDPVIAGAAASL GAGCATCGACAAAGCGGCGTTAAAGCCAAGACAGAGAATACACATTCT AAGCGGA TTACTCC GLRERAAKISAQTGL AACGACCTATATTGCCCCTAGGATTGCCTTTGAGATTGCAGAGGGGGGA GTCCGGAC AATCCA PVPVKPKGAHNAWR TACAGTACACTCGTATAAGGGAGGTAGACATGACAATCAGAAACTACGT TGTCTTTA AAACCA KVQKKKWKAQPTQG CAGGAAATGGCTATTCTTGCCCGCCTGCCTTTCAAACTCCTTCCTTTACAC TAAACAGA CGTTCCC KGVSCYQHRLGNKW AAGGAGAGGAGAGGGTGGTCTTGGGCTGGTAAGCTTTTACGACTACGT CAATATTT TAGCCC LGAPSFLTENDYIWAI ACCGACTGAACGTATGAGGAAGCTAGTTAGGGTGTGTGATTCTGAAGA TCTTATTTT AAAAGC KLRTNLVPTREAMGR TCCGGTTATAGCAGGGGCAGCAGCCTCGCTAGGGTTAAGGGAGAGAGC ACCACCTA GCGCTT GIIGRNQVECRHCHT TGCGAAGATAAGCGCTCAGACAGGGTTGCCAGTTCCAGTTAAACCCAAA TATGGGGA CAATGA TIETMGHISGYCQMV GGGGCACACAATGCATGGAGAAAAGTCCAGAAGAAAAAGTGGAAAGC TATTTCCC GGGAGC LDIRLIRHNRICKALIK TCAACCCACCCAGGGGAAGGGAGTTAGTTGTTACCAACACAGGTTGGG AACTCGTA CGATATT AATATGLRVTEEPRIV CAACAAATGGTTAGGTGCCCCCAGCTTCCTCACTGAAAATGACTACATTT ATATAGGG GGAGGA GTDGKNYLPDLIFSA GGGCCATTAAACTGAGGACAAACCTCGTCCCTACTAGGGAAGCTATGG CCCACTGT CAGAGT GAGEPCYVVDPTVV GGAGAGGTATCATAGGGAGGAATCAAGTGGAGTGCAGACACTGCCATA CAAGTGGC TCATCAA WDDDPKNLREAWR CAACAATAGAAACAATGGGACACATTTCTGGATACTGCCAGATGGTCCT TGGTATAA AAGCCA GKVRKYTPIIPAVEA GGATATTCGTTTGATTAGACACAACCGTATATGCAAAGCCCTGATCAAG GTTACCCT CCTGTA MLHPSSVQIFGFVCG GCTGCTACGGCTACAGGACTCAGAGTTACGGAAGAACCACGAATAGTG GTGGGTAA AGTAGG ARGTWCPMNDDIAK GGTACGGATGGGAAAAACTATTTGCCAGATCTTATTTTCAGTGCGGGAG CAAATAAA CCCATTT IVGLKNSGISRTLQIVL CTGGGGAGCCATGTTATGTAGTTGATCCAACAGTGGTGTGGGATGATG TTCAAACA CTGCCA CDTIRMVKAFMAR ATCCCAAGAACCTGAGGGAGGCCTGGAGGGGCAAGGTGAGGAAATAC AAGA(SEQ AGGACA (SEQIDNO:1462) ACACCAATCATTCCGGCGGTTGAAGCGATGCTTCACCCATCATCAGTTCA IDNO: TGCGCA AATATTTGGATTTGTATGTGGCGCAAGGGGAACATGGTGCCCAATGAAT 1217) AAAGGA GATGACATAGCCAAGATTGTCGGTTTAAAGAACTCAGGAATTAGCAGG AGCAGA ACACTACAAATAGTCCTTTGCGACACCATAAGAATGGTCAAGGCCTTCAT TTATCAA GGCCAGATAAACCTTGCCTCCCCGGGCCCCCCTCAGTGACTAAGACAAC ACCAGT TTTCACCGTAATAATCATATATTTGTATACCATGTATTAATCTAGGTAACA CAAACA ATTGAAAGTAACATTGAACCGTATTCATACTTTAATGAGTACAATAGTGA AGCACA AGGGAACTGTATATTACATACCTCGAGATAGAGGTTTTTGTACCTTAAG AACATT GGTTCGTGAGAATCCATACTGCATAAAGGGGTTAACTTGTAACTCAACC GGGGGG CCAGGGAGACAAGCCGACAAATTCGGCATTACTATGTGAGGGTCATAA ATGGGA GTGTTAAAGGACCCATTGTATATAACTTGTAAATGTACAGAGAATTCGA TAACCCC GTTGAGTTCGAAGAATAAACAAACAAAAAAGAAAACAAAGAGAATTCG GGAAAG TTCGTCCATAGAAGCGGAAAAAGCGCAAAAAGGCGTCCTAACCTCACG AGAGGG GTCTAACCCAGAAACATTTAAGGGGGGAAGGAGCTCATTACTCCAATCC ATCTTTA AAAACCACGTTCCCTAGCCCAAAAGCGCGCTTCAATGAGGGAGCCGATA GATAGT TTGGAGGACAGAGTTCATCAAAAGCCACCTGTAAGTAGGCCCATTTCTG GGATGG CCAAGGACATGCGCAAAAGGAAGCAGATTATCAAACCAGTCAAACAAG AAGGGG CACAAACATTGGGGGGATGGGATAACCCCGGAAAGAGAGGGATCTTTA TGGGAC GATAGTGGATGGAAGGGGTGGGACGTTGAATCCAACCATGCCGTTTTT GTTGAA ATGTTCCCGATAAAGAAGGATAAGGTCACTCCAGCCTGACACACAAAGT TCCAACC GGGGTAAAAGAACTCCGCTCGTACGGACTCCAAATAGAA(SEQIDNO: ATGCCG 1094) TTTTTAT GTTCCC GATAAA GAAGGA TAAGGT CACTCCA GCCTGA CACACA AAGTGG GGTAAA AGAACT CCGCTC GTACGG ACTCCA AATAGA A(SEQ IDNO: 1340) R2 R2- AGKD01072455 Salmo AATCTTTAACCCCGGACTCTTGGGGTTCTTACGACTCTGTATGAGGAACA AATCTTTA TAGATC MSGKRIVEMSGCDE 1_SSa salar GTCGAAGAGAGGGCGCTACCAATCCAAGTATATGTCCCAAGAGGGCTG ACCCCGGA CGTTTGT KICQNKHCLKRRWA GGACAGGGTGGAAGAGTGCACCTCGCGATCTGGGGCAGGGAAGGAAT CTCTTGGG TATGATT WISGPKGETSPPRKR GGAGAGAAGTCGAAGAAGGCTTGTAGAGAAGGGGCTCTCCTAGATCCT GTTCTTAC GGAGGG GTCENVSFQDKSHAS AACCTGTATGACGCCCGTAAAACGGTGACCCCAGTAGCGAATAAAGGA GACTCTGT AGCCTG DPDPLKAPEAREDAG GGCAGGTGACAATAGAGGGCAGGGCCGACTTCCCAGGTTTACATTGTT ATGAGGA CCGAGT SVAPQWVGEIKTPSL GTACTTGTCAACATAAAGAGGTGTCTCAATAGTTTGAATCAACAAGGGA ACAGTCGA GGTATG TSRDGVSEVVLPPQP GAGGAATACCGACCTGCTCCCTTGGGGCGGGGGTACTGGTCTTAGCCC AGAGAGG AGCGCT VHAEGVSPASDSKDK GGTTCCCCGCAAGTTTCCTTTGCCTGGGATGTGCCTGACTGGCTCCATCC GCGCTACC CCAACT ATKITLLISLPVCDLRC CCTTTCCCCATTAGGCACGGCTAGATGACGCACCGATGGGCGGGTGTGT AATCCAAG ATTGAA GRCERPLETVGKAVR AGGTCGCTACCGAAGGGGACTGGGGGTGTCCGGTGAACCAGGACTTCC TATATGTC CCCATAT HFAVAHPTVSVVFKC CAAAATGGTCTCACATTTTTAAGCGGCTTGAGTATCGCCCAGTATCCTCG CCAAGAG GATTCCC QKCEKSSKNSHSISCH CGCGGCACTGGGAACCCAGTCAACCGCTCTGTGCCCCGGCGCAGGCGG GGCTGGG GAGGCC IPKCKGMTETRTDVE GGGTTTAATGTCTCCCCGGCTTCACCGGCGCTTCGGCGACGACGCAGAG ACAGGGT TGGCCA GDHGCDHCQEKFTT GAGCACCCGGAGGCCCCCATGAACTTAAACCAACCTATCTTGAAATATG GGAAGAG GACGCC AMGLTQHKRHRHIV GCCTCTCGTTCGGGTGAAGGGCAGGTGGGAAGAGAGGGCTGCCTCACG TGCACCTC TAGATG QYCKEKEGEMTARR ATAAACACCTAGTCAATAGCCAGTCGGGAAAAATGTGGAATGTTAGGA GCGATCTG CCTGCC KGEVEAVKWSEWEE CAGGGAGGTAAGGGAGGCGGCTTTTTCGTAAGGCTCCTTCAACCCCTAC GGGCAGG ACAATT SEVARLSDGLAGLKM CTGTAGTCCACCTATTGCAGGTGTTGACAACATGCAAGATGACCTGCCT GAAGGAA GAACGC INRRIADSLGTGKTAE CGTTACGGGTCGCGTATCATTGCTACAGGTCGTGTGCCGCTTCTAAGAG TGGAGAG AGCCCT QVRQKRRRMRPEKV GATAGTAAGGAGAGGTTATAGGGAGGTCCTGTTAGGGCTTCCTCAACC AAGTCGAA AGCTTG RCDKPKEAKDKSNLIK CCTCTCTATGCGATTCCTTACAGGAGTGGATCGAGAAGTCCCGGACGTA GAAGGCTT CTAGGA MLSIPSATPTPQTGLK ATACACCCTGGAGGTAAGGGAGTGGCCTTCTAGTAGGGCTGCTTCAACC GTAGAGA GATCCA GFLLGELNGVATKGE CCTCTGATGGGAGTGTACCGGGAACCTCGACTTGTAAGCACAGGTTAGT AGGGGCTC TAGGAA VQIGGVTLSLRGVEQ ATGGGAGCAGAAGGGGGAGCCGTAATGGGCTTCTCTTTCACCTGCTTAC TCCTAGAT CTGGCC DSALLNTSALELQRLL ATAATACCTGTGGTGCATGTATCTAGGTCTTGGCGGGAGAGTACTGAGA CCTAACCT TATGGG GGRAGSANPLSLQRE GACAAGGTTGAGACCCCAAGATTGGGTCTCCCTAGCCTCTATAGCTGCG GTATGACG GCGTCA RETTLPSERRKTKQGE ACTCTTAGCGGGGATATGGAGTAACATGTACCAAGGGAGTGAATAATA CCCGTAAA TGACGG YRRVQKMFRSNEKKI AAGGAATTGACGGGGTACAAGTGACTGTTGGCCCGAATCCTAGCCACTT ACGGTGAC TTGAAG AKYILDGNGDGEAAS GATGACCAGGGATATATTACAGACTGAGAGCGAATCTAGAGACGTGAG CCCAGTAG TTCCTCC PPLEIALAFKSRWEEV AATAAAGTGAGAATTGGTTGAGCGAATACAGAGGAAGATGAGCGGAA CGAATAAA ATAGCG ETFHGLGQFYSRGEA AGAGAATTGTTGAGATGAGCGGTTGCGATGAGAAAATCTGCCAAAATA GGAGGCA TGCTTG DGVVFRSLISMSEVC AGCATTGCTTGAAGCGCAGATGGGCGTGGATTTCCGGCCCAAAGGGGG GGTGACAA GGAGGG ENLGAIKNNTAAGPD AGACATCTCCTCCTCGCAAGAGAGGAACTTGCGAAAACGTATCCTTCCA TAGAGGG GACGAC GITKPALLEWDPTGA GGATAAATCTCATGCCTCGGACCCAGATCCTCTCAAAGCCCCGGAAGCG CAGGGCC AATGAC KLAAIFSIWLTSGTLP AGAGAGGACGCGGGTTCGGTGGCGCCTCAGTGGGTTGGCGAAATAAA GACTTCCC GAGTCA GPFKKCRTTLIPKTDD GACACCTAGCCTAACATCGCGGGATGGGGTTAGTGAAGTCGTGCTGCC AGGTTTAC TGACGT PILLTQVAGWRPLTIG ACCACAGCCAGTTCATGCAGAGGGAGTATCCCCAGCCAGCGACTCAAA ATTGTTGT ACCGAG SVVLRLYSRILTHRLER GGACAAGGCTACGAAGATCACCCTGCTGATATCACTCCCCGTCTGTGAT ACTTGTCA AGAACC ACPINPRQRGFISSPG TTGAGATGCGGACGATGTGAAAGACCATTAGAGACCGTTGGGAAGGCG ACATAAAG CCAACC CSENLMILGGLIKRS GTGAGACACTTTGCTGTGGCTCATCCGACGGTGTCGGTGGTTTTTAAGT AGGTGTCT CAGGTT WAKGERLAVVLVDF GTCAAAAGTGTGAGAAGAGCAGCAAGAATAGCCACTCCATCTCCTGTCA CAATAGTT GGGGGA ARAFDSVSHSHILEIL TATCCCTAAATGTAAGGGAATGACAGAGACCCGGACGGATGTGGAAGG TGAATCAA GAGAGC RQRGLDEHIIGIVGDS TGATCACGGCTGTGATCATTGCCAGGAGAAGTTTACAACGGCTATGGG CAAGGGA CAGCAA YTDVTTTITVSGEQSP GCTGACTCAACACAAGAGACACAGACACATCGTCCAGTATTGTAAGGAG GAGGAAT GAGCGG PIDMRVGVKQGDP AAGGAGGGGGAGATGACAGCAAGAAGGAAGGGTGAAGTCGAAGCAG ACCGACCT AGATGC MSPLLFNLALDPMID TCAAATGGAGCGAATGGGAAGAAAGTGAGGTGGCAAGGTTGAGCGAT GCTCCCTT TTGGTAT TLERYGLGYRMGEQ GGACTGGCTGGGCTAAAAATGATCAACAGGCGAATCGCAGATAGCCTG GGGGCGG ACCAAG QITALAFADDLVLVS GGGACTGGAAAAACTGCGGAACAGGTGAGGCAGAAAAGACGTAGAAT GGGTACTG CTAGCA DSWEGMACNIRILEE GAGACCTGAGAAGGTACGGTGTGACAAACCTAAGGAGGCAAAAGATA GTCTTAGC GAGAGA FCRLTGLRIQPRKCH AGAGCAATCTTATCAAAATGCTGTCCATACCGAGTGCAACACCAACACC CCGGTTCC GGGTTG GFLIQKIQRARSVNLC CCAAACTGGGCTCAAAGGATTCCTCCTTGGAGAACTAAATGGGGTTGCT CCGCAAGT AAGAGG KPWIVCGEELHMVG ACCAAAGGTGAAGTACAGATTGGGGGAGTTACGTTGTCCCTAAGGGGG TTCCTTTG ATGACT PEESVSYLGMKVSP GTAGAGCAAGACTCAGCCTTGCTCAATACGTCGGCCCTTGAACTGCAAC CCTGGGAT ACTGGG WHGIMEPDPVERLC GATTGTTGGGAGGGAGAGCGGGGAGCGCGAATCCGTTAAGCTTACAG GTGCCTGA CTCAGA NWISSIGRSPLKPSQK AGAGAACGTGAGACCACATTGCCAAGTGAAAGAAGGAAAACGAAACA CTGGCTCC GTCATCT VRMLNVYAAPRMTY AGGAGAGTACCGGAGGGTACAAAAGATGTTTCGAAGCAATGAAAAGA ATCCCCTT CACCCTA QADHGGLGPIVLNVL AAATTGCTAAATACATTCTAGATGGCAATGGAGATGGAGAAGCGGCCTC TCCCCATT AAAGGC DGMIRKAVKVWLHL CCCGCCACTCGAGATCGCTTTGGCGTTCAAAAGTAGGTGGGAGGAAGT AGGCACG GGTGGG PLCTCDGLLYSRCQD GGAAACATTTCATGGGCTTGGCCAGTTCTACTCGAGGGGGGAAGCGGA GCTAGATG GCATCG GGLGIVKLACQIPSIQ TGGTGTGGTCTTCCGGTCACTTATCTCAATGAGCGAAGTATGTGAAAAC ACGCACCG GTTGAA ARRVYRLWHSKEAIT CTGGGGGCAATTAAAAACAACACAGCAGCTGGGCCAGATGGGATAACA ATGGGCG CACCTAC RVVTRRTVEAEEYRG AAACCGGCATTGCTTGAATGGGATCCCACTGGTGCGAAACTGGCCGCTA GGTGTGTA CCATACC MWLRAGGSEAGLPP TCTTTTCCATATGGTTGACATCGGGCACCCTGCCTGGGCCTTTTAAGAAG GGTCGCTA GGGATG LEDREEGAVQCTDTA TGCAGAACAACCTTAATACCCAAGACCGATGACCCGATTTTACTCACCCA CCGAAGG GGAGGT GSVKPKNPVIPDWR GGTGGCTGGGTGGAGGCCTCTCACTATCGGGTCGGTGGTTTTGAGGCT GGACTGG GGTAGG RAEFLKWQNLTAQG ATACTCTCGCATCCTGACACACAGGCTGGAGCGGGCGTGTCCCATTAAC GGGTGTCC CCGAAA VGVQVFGGDKNSNH CCGCGCCAGAGAGGATTCATTTCCTCACCTGGGTGTTCGGAAAACTTGA GGTGAACC AAGAAC WMANPETLGSKERH TGATCTTGGGAGGTCTAATCAAGAGAAGTTGGGCGAAGGGCGAGAGG AGGACTTC AGGAAG YIAGLQLRANVYPTR CTGGCGGTAGTGTTGGTTGACTTTGCGCGTGCGTTTGACTCTGTGAGCC CCAAAATG ATGGTG EALSRGRPDLPKVCR ACTCGCACATCTTGGAGATACTCAGACAGAGAGGGCTTGATGAACATAT GTCTCACA GAGTAA QCLAGTESCAHILGQ CATCGGAATCGTAGGTGACTCGTACACCGATGTAACGACCACAATTACA TTTTTAAG GTTGAG CPAVKDSRIRRHHKL GTCAGTGGGGAGCAGTCCCCTCCCATTGACATGAGGGTAGGGGTTAAG CGGCTTGA AGCGGT CDLLASEAESAGWTV CAGGGAGACCCGATGTCTCCGCTACTGTTTAATCTAGCCCTGGACCCAA GTATCGCC TGCTCG IREMCCRTRAGALRR TGATCGACACCCTTGAACGCTACGGCTTGGGGTACAGGATGGGCGAGC CAGTATCC GGAAGT PDLVFVKTGFALVVD AGCAGATCACGGCCCTAGCTTTTGCTGACGATCTGGTTCTGGTGAGCGA TCGCGCGG TATGTTG VTVRYEMAYDTLMG CTCGTGGGAGGGCATGGCGTGCAATATCCGTATTCTGGAAGAATTTTGT CACTGGGA TGATAA AAAEKVARYTPITPYV CGACTGACTGGGCTGAGGATTCAGCCTAGGAAGTGTCATGGTTTCCTCA ACCCAGTC CTCCATT AMTLKARRVKVFGFP TACAGAAGATTCAGAGGGCGAGATCGGTAAACCTCTGTAAGCCCTGGA AACCGCTC AAGGCC LGARGKWPGSNDRL TAGTGTGTGGTGAAGAACTACATATGGTCGGGCCGGAAGAGTCGGTTT TGTGCCCC GGTGGG LKAMGVGGGRRKQL CCTACCTGGGTATGAAGGTGAGCCCATGGCATGGCATTATGGAGCCAG GGCGCAG CATGGT AKLFSRRALLYSLDVL ATCCTGTCGAACGACTCTGTAACTGGATCAGTTCGATTGGGCGGTCACC GCGGGGG GCGGAT RDFYRAEGETGDLDD GCTGAAGCCTTCTCAGAAGGTGAGGATGTTGAATGTTTATGCTGCCCCG TTTAATGT AATGGA ESVDDHL(SEQID AGGATGACTTATCAGGCGGATCATGGGGGCTGGGGCCAATTGTCCTG CTCCCCGG AACTAT NO:1463) AATGTACTCGATGGGATGATCAGGAAAGCAGTGAAGGTGTGGCTACAC CTTCACCG AAAAAC CTTCCGCTGTGTACCTGTGATGGGCTACTTTACTCTAGATGCCAGGACG GCGCTTCG AATAAA GTGGACTGGGCATAGTAAAATTGGCTTGTCAAATCCCTTCTATCCAAGCT GCGACGAC AAGAAA AGAAGGGTCTACCGCCTGTGGCATTCTAAGGAAGCCATAACGCGGGTA GCAGAGG GACCAA GTCACCCGAAGGACGGTCGAAGCGGAAGAGTACCGTGGGATGTGGCT AGCACCCG AAAAAT GAGAGCCGGTGGGAGTGAGGCAGGTTTGCCTCCCCTGGAAGATAGGG GAGGCCCC GTTCTGT AAGAAGGTGCTGTACAGTGTACAGACACTGCCGGTTCGGTGAAGCCGA CATGAACT TATGAT AAAACCCAGTCATTCCCGATTGGAGGCGAGCTGAGTTCCTCAAGTGGCA TAAACCAA GCCTTAC AAACCTGACAGCGCAAGGGGTTGGAGTGCAGGTCTTTGGCGGTGACAA CCTATCTT ACATGT AAACAGCAATCACTGGATGGCGAATCCGGAAACGTTGGGATCGAAAGA GAAATATG CTGGGA GCGCCACTATATTGCAGGTCTACAGTTGAGGGCCAATGTATATCCAACT GCCTCTCG GACCCC CGCGAGGCACTGTCCAGGGGCAGGCCGGACTTACCTAAAGTCTGCCGG TTCGGGTG ATAAGG CAATGCCTAGCAGGAACTGAGTCTTGCGCGCATATTCTCGGGCAGTGTC AAGGGCA GTCTCCC CTGCAGTGAAGGATAGCCGCATCAGGCGGCATCATAAACTGTGTGACCT GGTGGGA CTTATAC GCTAGCAAGTGAAGCCGAAAGCGCCGGATGGACCGTCATCAGAGAAAT AGAGAGG TTCACTG GTGTTGCAGAACTCGTGCCGGAGCTTTGCGGCGTCCAGACCTGGTGTTT GCTGCCTC GGAAAC GTGAAAACCGGTTTTGCTTTGGTGGTGGATGTTACTGTGCGGTACGAGA ACGATAAA CCCATA TGGCCTATGATACGCTCATGGGTGCGGCTGCCGAGAAAGTTGCTCGGTA CACCTAGT AGGGTA CACCCCAATTACTCCATATGTTGCGATGACCCTGAAGGCAAGGAGAGTC CAATAGCC TCCCCCT AAGGTGTTTGGCTTTCCACTGGGAGCCCGAGGCAAATGGCCGGGAAGC AGTCGGG ATATTTA AACGACCGGCTGCTGAAGGCTATGGGTGTTGGTGGCGGCAGGAGGAA AAAAATGT CTGGGA ACAGCTGGCCAAGTTGTTTAGCAGGAGGGCGCTCCTGTACTCCTTAGAT GGAATGTT GACCCC GTCCTTAGGGACTTCTACCGGGCGGAGGGAGAAACGGGGGACTTGGAT AGGACAG ATAAGG GATGAGAGCGTGGATGATCATCTATAGATCCGTTTGTTATGATTGGAGG GGAGGTA GTCTCCC GAGCCTGCCGAGTGGTATGAGCGCTCCAACTATTGAACCCATATGATTC AGGGAGG CCTATA CCGAGGCCTGGCCAGACGCCTAGATGCCTGCCACAATTGAACGCAGCCC CGGCTTTT GATGTA TAGCTTGCTAGGAGATCCATAGGAACTGGCCTATGGGGCGTCATGACG TCGTAAGG GAGCGT GTTGAAGTTCCTCCATAGCGTGCTTGGGAGGGGACGACAATGACGAGT CTCCTTCA AAGGGG CATGACGTACCGAGAGAACCCCAACCCAGGTTGGGGGAGAGAGCCAGC ACCCCTAC TCTCCAA AAGAGCGGAGATGCTTGGTATACCAAGCTAGCAGAGAGAGGGTTGAA CTGTAGTC AGTACC GAGGATGACTACTGGGCTCAGAGTCATCTCACCCTAAAAGGCGGTGGG CACCTATT GGCCGA GCATCGGTTGAACACCTACCCATACCGGGATGGGAGGTGGTAGGCCGA GCAGGTGT TATGGC AAAAGAACAGGAAGATGGTGGAGTAAGTTGAGAGCGGTTGCTCGGGA TGACAACA CTTATG AGTTATGTTGTGATAACTCCATTAAGGCCGGTGGGCATGGTGCGGATAA TGCAAGAT GCAAAC TGGAAACTATAAAAACAATAAAAAGAAAGACCAAAAAAATGTTCTGTTA GACCTGCC TCTGGT TGATGCCTTACACATGTCTGGGAGACCCCATAAGGGTCTCCCCTTATACT TCGTTACG GGTAGG TCACTGGGAAACCCCATAAGGGTATCCCCCTATATTTACTGGGAGACCC GGTCGCGT GACAAG CATAAGGGTCTCCCCCTATAGATGTAGAGCGTAAGGGGTCTCCAAAGTA ATCATTGC GAGGTA CCGGCCGATATGGCCTTATGGCAAACTCTGGTGGTAGGGACAAGGAGG TACAGGTC AGGGCA TAAGGGCAGTGCCAACCCCTACTTGATCGGGACCATCCAGGGAATGCCA GTGTGCCG GTGCCA TCCTCCCGCGAAGGTGATGTGGTGAGGTAAGGGGGGAGCCCGTCTTCG CTTCTAAG ACCCCTA AGTTTCCCCAACCCCTACCCACAGGTGAGAGGAGGAGAAGAGGAATCT AGGATAGT CTTGATC GTCCCCAACGGGAGGAGGGTGAGGTGTAAGGGGGAGACCTTCTAGTA AAGGAGA GGGACC GGGTCTTCTCAGTCGCCTGACGTCCTGACTGTGGGGTGGATCAGTACCC GGTTATAG ATCCAG TACAGGTGAGACCGGTGAGGTAAGGGTGTGGCCCTCTTGAGGGCTGCG GGAGGTCC GGAATG CCAACCCCTACTCGAGGTAACCTGAGGGAGTGGTGGAATGGCGGCATG TGTTAGGG CCATCCT TTAGTGCTGGGACTTGATTGCGAGGGTTTAATGAGAGTGGCCTGCTGA CTTCCTCA CCCGCG GAGCAACACTTGTGGTGCTTAAAGCGGGGCGGCCCATGACCACCGTGA ACCCCTCT AAGGTG GATAGGACACTGCACAGTGCAGCCATGAGGTTCCTGGAGGATGATGCG CTATGCGA ATGTGG ATGAGGTGGGGGCCTCATCAGCCCCTCCTGGCAGGGCGTCGGCCAGGG TTCCTTAC TGAGGT AAACTAAATGTCTCTAGCATGTCAGTGCAGTGAGGTAAGGGGAGAGCA AGGAGTG AAGGGG CTCTAGTAGGGCTCTTCCAACCCCTACCTGTAGGTCACCTGGTCCAGGTG GATCGAGA GGAGCC TCGATGATGTGAAAACAAGAGCTACTTTGGTACCGGTCTGTTGCAAAAA AGTCCCGG CGTCTTC GGGTTCTGCAGAGGACGACGGCTATCCCTATCGGGAGGGAATAGTCGG ACGTAATA GAGTTT TCCCAGGTAGTGGAAAATGGGGCTTTCCACTGAGCATGAAAATGTGGT CACCCTGG CCCCAA AGAGGTTGCGTCCAACCCAATGATTTGCAGCAGAGCTCTTGGACACGAA AGGTAAG CCCCTAC GTCTGTATAGTCCCATGCAGGCAGCCAACCAGAGAATGGTGGCAAGAC GGAGTGG CCACAG CCCAGCTCCGTATGGGAGGGGAGGGCCAAGATATACGGAACGGCTGCT CCTTCTAG GTGAGA AAAGCGTTCTGCCGGTGTCAGTCTAATCACAGACAGCTGTGACGAAACA TAGGGCTG GGAGGA AAGTATGGGTTCCGACATGCTTGGTCAGCTCTTAGCCGCAAGGCTTAAA CTTCAACC GAAGAG TCGAACGCAGCCCGCCGAGAGTGAACATTAAACGGGGATGGAATGTGT CCTCTGAT GAATCT CTAGCGGTTACGTACTACCAGGGCTCAGGTTCGCCTGAGCCGAGGCTCT GGGAGTG GTCCCC ACACGTCATGGTGGGAGTTCTCCCCACGCTCGTGAGGGCATGTAGTGG TACCGGGA AACGGG GATGGCATGTGGCGGACCATCAGCTGGCACTACCAGGCCTCGGGCTTG ACCTCGAC AGGAGG CCCGAGTGCGGGACCTCACACATTGTAGGTGTGCTTGTCCCCCCTACGTT TTGTAAGC GTGAGG CGAAGACTTGAGGCGGAGAATACTCATAGGCCCCACGGCAAAGGGACA ACAGGTTA TGTAAG CAACACGGAGGCTTGTGTCCGACGAGCCGTGGACTCCTATAGACAGCCC GTATGGGA GGGGAG GGGATATCACTGGGCACGCTCATACTGAAGAAATTCGATGAACCGGGC GCAGAAG ACCTTCT CTACCGGAGCAAATGCACTCTAATCGCCTTTGTGGGCGACTGTGGCCCC GGGGAGC AGTAGG CTCATGCGAGTGAGGAATATCATAAACTGCAATGGTTCAAAAAGTGATT CGTAATGG GTCTTCT CCTATGGCTCGTCGGGGAGGGCTGACTGGGGCAAGCAAATGATTGAAA GCTTCTCT CAGTCG GGGGAAGAACCTTTTTCAACTGTTTCTTGCCAAGCCCGGTTGATGGTGG TTCACCTG CCTGAC CGCTAGTAATTGCGACGGGAAAATGCGGTTTAAGTCTCCGAAGTAGTGC CTTACATA GTCCTG GTAGCACCGGATGTCGACCGGGTGTAAAAGCCCTTCGTAAAGTCCCTGG ATACCTGT ACTGTG GGAGGTCAGTCCTGGGGCTACTGATGCGCAGTATGTAATTCGCAGAAT GGTGCATG GGGTGG AGGGCCATCGATACCGCCTGCGTGACTCGACTGGGTTTCCACTTGAGGA TATCTAGG ATCAGT TATCCGACCGTAGCGTGCACCCTCTTGTAGTTGCGCCGGAAACGGCTGT TCTTGGCG ACCCTAC GTTCCCTCACGTATGTGAGGAAACTCAACAATGTGAGTGGGTAAACGGC GGAGAGT AGGTGA GGGACGAACTATGGCTCTCGT(SEQIDNO:1095) ACTGAGAG GACCGG ACAAGGTT TGAGGT GAGACCCC AAGGGT AAGATTGG GTGGCC GTCTCCCT CTCTTGA AGCCTCTA GGGCTG TAGCTGCG CGCCAA ACTCTTAG CCCCTAC CGGGGAT TCGAGG ATGGAGTA TAACCT ACATGTAC GAGGGA CAAGGGA GTGGTG GTGAATAA GAATGG TAAAGGAA CGGCAT TTGACGGG GTTAGT GTACAAGT GCTGGG GACTGTTG ACTTGAT GCCCGAAT TGCGAG CCTAGCCA GGTTTA CTTGATGA ATGAGA CCAGGGAT GTGGCC ATATTACA TGCTGA GACTGAGA GAGCAA GCGAATCT CACTTGT AGAGACGT GGTGCT GAGAATAA TAAAGC AGTGAGA GGGGCG ATTGGTTG GCCCAT AGCGAATA GACCAC CAGAGGA CGTGAG AG(SEQID ATAGGA NO:1218) CACTGC ACAGTG CAGCCA TGAGGT TCCTGG AGGATG ATGCGA TGAGGT GGGGGC CTCATCA GCCCCTC CTGGCA GGGCGT CGGCCA GGGAAA CTAAAT GTCTCTA GCATGT CAGTGC AGTGAG GTAAGG GGAGAG CACTCTA GTAGGG CTCTTCC AACCCCT ACCTGT AGGTCA CCTGGT CCAGGT GTCGAT GATGTG AAAACA AGAGCT ACTTTG GTACCG GTCTGTT GCAAAA AGGGTT CTGCAG AGGACG ACGGCT ATCCCTA TCGGGA GGGAAT AGTCGG TCCCAG GTAGTG GAAAAT GGGGCT TTCCACT GAGCAT GAAAAT GTGGTA GAGGTT GCGTCC AACCCA ATGATTT GCAGCA GAGCTC TTGGAC ACGAAG TCTGTAT AGTCCC ATGCAG GCAGCC AACCAG AGAATG GTGGCA AGACCC CAGCTC CGTATG GGAGGG GAGGGC CAAGAT ATACGG AACGGC TGCTAA AGCGTT CTGCCG GTGTCA GTCTAAT CACAGA CAGCTG TGACGA AACAAA GTATGG GTTCCG ACATGC TTGGTC AGCTCTT AGCCGC AAGGCT TAAATC GAACGC AGCCCG CCGAGA GTGAAC ATTAAA CGGGGA TGGAAT GTGTCT AGCGGT TACGTA CTACCA GGGCTC AGGTTC GCCTGA GCCGAG GCTCTAC ACGTCA TGGTGG GAGTTC TCCCCAC GCTCGT GAGGGC ATGTAG TGGGAT GGCATG TGGCGG ACCATC AGCTGG CACTACC AGGCCT CGGGCT TGCCCG AGTGCG GGACCT CACACA TTGTAG GTGTGC TTGTCCC CCCTAC GTTCGA AGACTT GAGGCG GAGAAT ACTCATA GGCCCC ACGGCA AAGGGA CACAAC ACGGAG GCTTGT GTCCGA CGAGCC GTGGAC TCCTATA GACAGC CCGGGA TATCACT GGGCAC GCTCAT ACTGAA GAAATT CGATGA ACCGGG CCTACC GGAGCA AATGCA CTCTAAT CGCCTTT GTGGGC GACTGT GGCCCC CTCATGC GAGTGA GGAATA TCATAA ACTGCA ATGGTT CAAAAA GTGATT CCTATG GCTCGT CGGGGA GGGCTG ACTGGG GCAAGC AAATGA TTGAAA GGGGAA GAACCT TTTTCAA CTGTTTC TTGCCA AGCCCG GTTGAT GGTGGC GCTAGT AATTGC GACGGG AAAATG CGGTTT AAGTCT CCGAAG TAGTGC GTAGCA CCGGAT GTCGAC CGGGTG TAAAAG CCCTTCG TAAAGT CCCTGG GGAGGT CAGTCCT GGGGCT ACTGAT GCGCAG TATGTA ATTCGC AGAATA GGGCCA TCGATA CCGCCT GCGTGA CTCGACT GGGTTT CCACTTG AGGATA TCCGAC CGTAGC GTGCAC CCTCTTG TAGTTG CGCCGG AAACGG CTGTGTT CCCTCAC GTATGT GAGGAA ACTCAA CAATGT GAGTGG GTAAAC GGCGGG ACGAAC TATGGC TCTCGT (SEQID NO: 1341) R2 R2- Tribolium AGTCATAGAGCCAGAACCTCCTCGTGGTCCCGCTGGGCACAGGGATTAA AGTCATAG TGAAAA MSRRPGKSNEPPVRS 1_TCas castaneum TTTTTCTGTGGCAAATTTGACTGGCTTCAGAGAGCGTTTTTCGAAGTGG AGCCAGAA GAGTGG RAMGLTTLSGTKTSN WCTGTGTGACTGCGTTCCCCCCTTAGTTGCTATWTCCGCTKMGATTAAC CCTCCTCG CAGTTG SGAQGPSTSAPMQN ATCTCACCTCGACGTWTAAGATCATTATGTCGAGGCGCCCGGGGAAGT TGGTCCCG TGGAGA MAGGFVCDCGRSYA CTAATGAGCCCCCGGTTCGTAGTAGGGCTATGGGTTTAACCACTTTATCT CTGGGCAC CTTCCTT LKTSLARHKKECGKN GGCACCAAGACATCAAATTCTGGTGCCCAGGGGCCCTCTACCAGTGCCC AGGGATTA CGGACG NAECRWCGTRFNTL CCATGCAGAACATGGCTGGGGGTTTTGTTTGTGACTGCGGCCGGAGCTA ATTTTTCT TCGGGT AGTRQHERKAHFVQ TGCCTTGAAGACATCTCTGGCCCGACACAAGAAGGAGTGCGGAAAGAA GTGGCAAA CGGATT YQSDLAKALPQPESE CAATGCGGAGTGTCGATGGTGCGGGACAAGGTTTAACACCTTGGCTGG TTTGACTG TTCGGA LMEKIAIVEARSXNGI CACTCGTCAGCATGAGCGAAAGGCTCACTTTGTCCAGTATCAGTCTGAT GCTTCAGA CGCCAG FYKEMMASTGLTHQ CTGGCAAAGGCGCTCCCCCAACCTGAGTCGGAGCTTATGGAGAAAATC GAGCGTTT GGTACC QVRSRREKPEYKGFL GCCATTGTGGAGGCCAGGTCCAKTAACGGCATCTTCTATAAGGAGATGA TTCGAAGT TCCACC ERARRSLAQTNIRAG TGGCCTCCACGGGTCTGACCCACCAACAGGTCAGATCAAGGAGAGAGA GGWCTGT GCTGGG SISPASTXAGSLESAS AACCCGAATATAAGGGCTTCCTGGAGCGAGCACGCAGGAGCCTGGCCC GTGACTGC TTCACAA PKAGCSSSASPGPTT AGACCAATATCAGGGCTGGCTCCATCTCACCAGCGAGTACGAWCGCAG GTTCCCCC ACTAGG RSRAPTKGVPXRSSN GCTCTTTGGAGTCTGCGTCTCCGAAAGCGGGGTGTAGTAGCAGTGCCTC CTTAGTTG CGAACA SARIVVEAQVHTRAP GCCCGGACCGACCACCAGGTCCAGGGCACCTACCAAGGGTGTGCCCCW CTATWTCC TCTGCC PNTGETEVALRESRR ACGAAGCTCTAATTCGGCGAGAATTGTGGTGGAGGCCCAAGTCCATACC GCTKMGA GATACC TVPRLGXNPSRPCGIS CGGGCGCCACCCAATACCGGGGAAACGGAAGTGGCTCTTCGTGAGTCA TTAACATC CTCTTTA PLMAIAIDEDSVLGG CGCCGAACCGTGCCCCGGCTCGGGCMTAATCCATCTAGGCCCTGTGGT TCACCTCG GGTCAT LRVQAGPSPTAVHSV ATCTCTCCTTTGATGGCGATTGCCATAGATGAAGACTCTGTGTTGGGCG ACGTWTA AGGACC EAFPGTSSMTPMET GGCTACGAGTGCAGGCCGGGCCAAGTCCCACCGCGGTGCACTCCGTCG AGATCATT ACATGT DRVHNKSGIDPILEH AAGCTTTCCCCGGAACGTCCTCGATGACCCCGATGGAGACCGATCGGGT (SEQID CTCTGCA NGTRQVRREESSTRE ACATAATAAGTCCGGTATAGACCCCATTCTCGAACATAATGGGACACGG NO:1219) CGAGAT DPVEQWSPNYPKTP CAGGTTCGAAGGGAGGAGTCTTCCACACGGGAAGACCCCGTCGAGCAA TAACCC VTMPNITTTADAXXT TGGTCTCCCAACTATCCTAAAACGCCTGTCACTATGCCCAACATTACAAC A(SEQ SYNRTPQTLPGNRRR TACTGCTGATGCTWCGTKCACCTCGTACAACCGAACACCTCAGACACTC IDNO: RSRSLPPVQRKSASD CCTGGAAACCGGAGGCGACGCAGTCGCAGTCTACCACCAGTGCAACGG 1342) XLESVDSLGPWAVFL AAATCCGCTTCCGACGAWCTCGAGTCCGTTGATTCGCTGGGGCCCTGG QDQVDAGSLSGNDS GCGGTCTTCTTGCAAGATCAAGTAGACGCCGGGTCACTAAGCGGCAAC LADLVRVALTKSDRG GACTCGCTGGCGGATCTGGTGAGGGTGGCTCTCACTAAGTCCGATCGC VLNDAVNRYLAQRA GGTGTCCTTAATGATGCCGTGAATCGGTATCTGGCCCAAAGGGCGGAG ESLRIRKRGSKGKRKS AGCTTGAGGATACGTAAGCGAGGCAGTAAGGGCAAAAGAAAGAGTAA KTGRHYGQTTSGSG GACTGGTCGCCACTATGGCCAAACAACATCTGGCAGTGGTCAAAGAGCT QRAALFKKHQDLFLK GCGCTCTTTAAAAAGCACCAGGACCTCTTTCTGAAAAACAGGCGAGGAT NRRGLAETILSGKEDF TGGCGGAGACTATCTTGTCGGGGAAGGAGGATTTTGGTCCTAGACCTG GPRPEPPVTSVEEFY AACCTCCCGTAACAAGTGTGGAAGAGTTCTACGGCGGCATTTTCGAGTC GGIFESPSPPDNEPFE GCCTTCTCCTCCCGATAACGAACCCTTCGAAGTCCGTGCTACTGGCGTCG VRATGVEDPPPLTSP AAGACCCCCCCCCACTTACATCACCATGGACGAAATAAAGGCGGCAAGA WTKRTLRSPCYWRR GCCGGTTGGCAGATTTCAGCGCCTGGATCTGACCAAATACCGGTCGCGG RPPPTYITMDEIKAAR CCGTCAAAACTATGTCCGAGCTGGAACTGGCGATACTGTTCAATATTAT AGWQISAPGSDQIPV ATTGTTCAGAAACGTCCAGCCTTCAGCCTGGGGGGTACTTCGTACGACG AAVKTMSELELAILFN CTGGTTCCCAAAGATGGGGATCTTAGAAATCCGGCCAACTGGCGGCCCA IILFRNVQPSAWGVL TTACWATCAGTTCTGCCCTGCAGCGTCTTTTGCATAGAGTATTGGCGGC RTTLVPKDGDLRNPA CCGTCTTTCGAAGTTGGTCTCGTTGAGTTCGTCTCAGCGGGGCTTTACCG NWRPITISSALQRLLH AGATTGACGGTACTCTAGCGAATGCGCTGATACTGCATGAATATTTGCA RVLAARLSKLVSLSSS GTATCGCAGGCAGACTGGCCGCACCTACCASGTCGTGTCTCTGGACGTC QRGFTEIDGTLANALI AGAAAGGCGTTTGACACGGTTTCCCACTGCTCTGTTAGTAGAGCCCTGG LHEYLQYRRQTGRTY GTCGTTTTGGTATCCCCTCTGTTATCAGGGAATACATCCTGGCCACATTT XVVSLDVRKAFDTVS GGCGCACAGACGACCATAAAATGTGGGAGCGTTACGACAAGACCGATA HCSVSRALGRFGIPSV AGGATGCTCCGTGGAGTGAGACAGGGTGACCCTCTTAGCCCGGTCCTAT IREYILATFGAQTTIKC TCAATCTTGTGATGGATGAGCTTCTGGAAAAGGTCAATGAGAAATACGA GSVTTRPIRMLRGVR GGGCGGCTCACTGCAAAGCGGCGAACGCTGTGCCATCATGGCGTTTGC QGDPLSPVLFNLVM TGACGACCTCATCCTGATTGCTGACCGAGATCAGGATGTACCCGCAATG DELLEKVNEKYEGGSL TTCGATGATGTTTCGACATTTCTCGAGCGGCGCGGGATGTCTGTCAACC QSGERCAIMAFADDL CTGCCAAGTGCAGAGCGTTGATTGCGGGCGCGGTGAGCGGGAGAAGT ILIADRDQDVPAMFD GTTGTGAGGACAGGCAGCTCTTATAAGATCCATAACACGCCGATTCCAA DVSTFLERRGMSVNP ATGTCGACGCCCTAGATGCATTTAAGTATCTGGGGCTAGAATTTGGCCA AKCRALIAGAVSGRS CAAAGGCGTAGAGCGCCCCACAATCCACAACCTCTCCGTCTGGCTTAAC VVRTGSSYKIHNTPIP AACCTCAGACGAGCCCCTCTCAAACCGGACCAAAAATGCTTATTCATCCG NVDALDAFKYLGLEF GCAATATGTCATCCCTCGATTGCTCTACGGCATGCAAAATCCTCAAGTTA GHKGVERPTIHNLSV CCTCTAGGGTACTTCGAGAAGCTGACCGTCTGATAAGGAGACACTTAAA WLNNLRRAPLKPDQ GACATACTACCATCTTAATGTCCATACTCCAGATTCTCTTATTCACGCGA KCLFIRQYVIPRLLYG GTGTGAGTGACGGAGGTCTGGGTATAATGGAATTGCGTAAAGCAATTC MQNPQVTSRVLREA CAAGAATTTTCCTAGGACGGCTAGTTAAGCTTCTCAACAAAAACAAAGA DRLIRRHLKTYYHLNV TTCGGTCTTAAGTTCGGTCCTGCAGTCTAACCGTGTGAGAACACTAATG HTPDSLIHASVSDGG GGCAAATTATCAACCATGGCGGGTGAAGTACCGGAGTCGACGTTTTGG LGIMELRKAIPRIFLG AGAAATCAGATTGCCTCGGGTCCCCTTTCGAAGGGTCTCGAACAGGCGG RLVKLLNKNKDSVLSS CCGAGGACTCCGCAAGTCGACTCTGGATCAGCGAGAAACCTTCCGGTTG VLQSNRVRTLMGKLS GTCGGGTAGAGATCATGTCCGTGCAGTCCAACTTCGCACGGGCAACCTC TMAGEVPESTFWRN CCGACCAAAGCCATTCCATCGGTGCCGGTTGGTCAGAGACGCTGCAGAC QIASGPLSKGLEQAA ACGGATGTGCCTGCGATGAATCGATATCGCATGTGCTGCAGATGTGCCC EDSASRLWISEKPSG CCTGACGCACGCTGATCGGATTCGCAGGCACGATGAAGTGGTTAAGAA WSGRDHVRAVQLRT AGTAGCCCGCCATTGCACCTCACGCGGTTGGACGGTGGAAGTCGAGCC GNLPTKAIPSVPVGQ ACACATTCGCTCTAGGTGTGGTAGGCTTTTTAAGCCTGACCTGGCAGTCC RRCRHGCACDESISH ATCAGCCAGGCGGCGCCATTGTGATAGCGGATGTCCAGGTATCATGGG VLQMCPLTHADRIRR ATTCCGAAAGCTTGACGGTACCTTACGAACGCAAGCGAGCTAAGTACGA HDEVVKKVARHCTSR CGTGCCCCAATTCCATCAGGCGGCACAACATGCATGGCCGGGTAAGGC GWTVEVEPHIRSRCG ACTTACGTTTGCCCCTGTAATAGTGGGCGCCCGGGGCATATGGCCCCGG RLFKPDLAVHQPGGA ATAAACAACGACCGCTCTGCGGCGTTGCAGATTCCTCCGGTCGTGCGAC IVIADVQVSWDSESL GAGCTTGCGTAAATTCGGTTGTCAAGTGGGGGTCGTCGATCCATGCTAC TVPYERKRAKYDVPQ CTTTATGAGGTCAGTATGGGCAAATCGACTCAACCCCCGACCCTTAAGG FHQAAQHAWPGKA GCTTGAAAAGAGTGGCAGTTGTGGAGACTTCCTTCGGACGTCGGGTCG LTFAPVIVGARGIWP GATTTTCGGACGCCAGGGTACCTCCACCGCTGGGTTCACAAACTAGGCG RINNDRSAALQIPPV AACATCTGCCGATACCCTCTTTAGGTCATAGGACCACATGTCTCTGCACG VRRACVNSVVKWGS AGATTAACCCA(SEQIDNO:1096) SIHATFMRSVWANR LNPRPLRA(SEQID NO:1464) R2 R2- Tinamus CTGGGGACCGTGGTTACAACCCGGGCTTAGCTGCAGAGACAGTACCTCC CTGGGGAC TAGGGG MGSWIVNFVSVATQ 1_TGut guttatus CCGTGGTTCCCGCCGGACCCCGTAACATCGGGTGACTGAATCTGTCTCT CGTGGTTA GCTTGG TGEFPVDTARRAPVP GCCCCGGGAGTAGTTCCTCCTTGCCCTATTGACCAGCGGTCGCCGGCTG CAACCCGG CATTTCT VTSYPESECHXPLPLT CTCAATAGTATTCTAGGCGTGAAATATAGCGATAGTCCTAGTGGTTGTCT GCTTAGCT CATTGCC FCNSDVTIWGGVRPE TACTGGGCCATAGCCCCTTGCTTCAGGGGTCATTCGCGAAGTCTCTCAG GCAGAGA TGCTCCT PVDCLGDLPEXYDAL GAGAACTGGGGGTGGTGTTCTTCTGGGTATAGCTAAACCCCCTAGACTG CAGTACCT GAAAGG PGVAGPREXVGGSP TGTCCGATCCATGGGGTCCTGGATCGTGAATTTCGTTTCGGTGGCGACT CCCCGTGG ATATGG PGEGVRSPGIASPSG CAGACGGGAGAATTCCCTGTGGATACGGCCAGGAGGGCACCTGTGCCG TTCCCGCC GTCCTG TAVQHDFGSPILVPG GTAACATCATACCCTGAGTCGGAATGCCACNTACCGTTGCCCCTGACATT GGACCCCG CGTCGC AEAAEVSTPVVKVPQ TTGTAACTCGGATGTGACTATTTGGGGAGGGGTTCGCCCTGAACCGGTG TAACATCG GTGGTA DHPACPCCGTRVVK GACTGCTTGGGWGATCTTCCRGAGGYGTATGATGCACTCCCAGGGGTG GGTGACTG GGCAGA VTALSEHLRRAHGRK GCTGGGCCTCGGGAAYCGGTGGGTGGGAGCCCGCCGGGRGAAGGGGT AATCTGTC CCCATTC RVLFQCSRCGRMNE CAGGTCGCCAGGGATTGCGTCRCCCTCTGGTACTGCGGTCCAACATGAT TCTGCCCC GTCCGA KHHSIACHFPKCRGP TTTGGGAGTCCCATCCTCGTACCGGGAGCCGAAGCCGCCGAGGTNTCTA GGGAGTA GTAGGG PVEEGPLGAPEWCCE CCCCGGTAGTGAAGGTTCCNCAAGACCATCCAGCATGTCCNTGCTGTGG GTTCCTCC GGCTTG ECGQKFNTKSGLSQH TACGAGGGTGGTGAAAGTAACGGCGTTGTCAGAACATCTTAGGAGGGC TTGCCCTA GCAGTN KRSVHPLTRNVERIEA CCACGGTCGGAAACGGGTCCTATTTCAGTGCTCCCGATGCGGGAGGAT TTGACCAG TCCATTG ARPKGKGKRGAHKG GAATGAGAAACATCATAGCATCGCGTGCCATTTCCCGAAGTGCCGGGG CGGTCGCC CCTGTG CWTEAEVAQLIELEG GCCCCCAGTTGAGGAGGGTCCCCTGGGTGCACCCGAGTGGTGCTGTGA GGCTGCTC CCCGAA RFKNQRFINKLIAEHL GGAGTGCGGGCAGAAATTTAACACCAAAAGCGGCCTGTCTCAGCACAA AATAGTAT AGGACG PSKSAKQISDKRRQL AAGATCTGTGCACCCACTTACGAGGAATGTGGAACGGATAGAGGCAGC TCTAGGCG TGGGTC AAATKTSSPEKRVTSS TCGTCCGAAAGGAAAAGGGAAGCGTGGTGCCCACAAAGGCTGTTGGAC TGAAATAT ATCTGG TSGESSPEVEKVEGIK CGAGGCGGAGGTGGCCCAGCTGATTGAACTGGAGGGGAGATTCAAGA AGCGATAG TCTGTCT REYRRRVGEWLCAG ACCAGCGATTTATCAACAAGCTGATCGCGGAGCATTTGCCATCGAAATC TCCTAGTG GCCTAC SLXDQTSFQKILEDVE GGCGAAGCAGATCAGCGATAAGAGAAGGCAGCTGGCGGCAGCGACCA GTTGTCTT ACCTCTC SGSEIVTGPLEELASF AGACATCGTCGCCCGAGAAGAGGGTAACGTCATCAACGAGTGGGGAGT ACTGGGCC TAGACTT ARGKLAAARVRHHR CCTCCCCTGAAGTGGAAAAGGTGGAGGGTATCAAGAGAGAATATAGAA ATAGCCCC GTAACA KHPAEAVPAREEQR GGCGTGTTGGAGAGTGGTTGTGCGCTGGGTCCCTGMAGGACCAGACN TTGCTTCA TCTAGTC WMKRRVGRRGLYLR TCGTTCCAGAAGATCTTGGAAGATGTGGAGAGCGGCTCCGAGATTGTC GGGGTCAT TGTCAA FQRLFALDRRKLAGII ACCGGTCCGCTGGAGGAACTGGCCTCCTTTGCGAGGGGGAAGCTCGCG TCGCGAAG CAAGAT LDDVESIKCPLPMEE GCAGCTAGAGTGCGACATCATCGTAAGCACCCAGCTGAGGCCGTGCCT TCTCTCAG CAAAAT VADVFRRRWEEVAP GCGCGAGAGGAGCAGAGGTGGATGAAGCGCAGGGTGGGTCGTCGGG GAGAACTG TCTTCAC FTGSGSFRSLGKADN GCTTGTACCTCAGGTTCCAGCGGCTTTTTGCNTTGGATCGCAGRAAGCTT GGGGTGG ACAGAC GAFKPMISAKEVMK GCTGGGATCATCCTCGACGATGTCGAGTCCATCAAGTGCCCCCTTCCGA TGTTCTTCT GACCGA NVXEMSRRSAXGPD TGGAAGAAGTCGCTGACGTCTTCAGGAGAAGGTGGGAGGAGGTCGCC GGGTATAG GCTTGC GLSLRDLMKIDPQGS CCCTTTACCGGCTCGGGCTCATTCCGAAGTTTGGGGAAGGCTGACAACG CTAAACCC TCAGTCT RMAELFNLWLLAGR GTGCCTTCAAGCCCATGATCTCCGCYAAGGAGGTCATGAAGAACGTCRC CCTAGACT TCCTGTA VPDQVKAGRTVLIPK GGAGATGTCTCGACGCTCCGCGCYGGGWCCCGAYGGCCTCTCCCTGCG GTGTCCGA CCCGCA SADPGKIGNIDNWR GGATCTGATGAAGATTGATCCCCAGGGCAGCCGCATGGCTGAATTGTTC TCC(SEQ GAATTTT PITIGSVXLRMFSRILS AACCTGTGGCTGTTGGCAGGACGGGTCCCGGACCAAGTGAAGGCGGGC IDNO: GCTCTT ARLRRACPINRRQRG CGAACGGTCCTGATCCCNAAGTCGGCCGATCCCGGGAAGATCGGGAAC 1220) GCTCTCC FIAAPGCSENLKLLQA ATTGACAACTGGCGGCCCATCACCATCGGGTCCGTTATNCTCAGAATGT TTTGGCT LIKSAKRDHRTLGVVF TCTCTCGGATTTTGAGCGCTAGACTGCGGCGAGCATGCCCCATTAATAG GTGTCC VDLAKAFDSVNHQHI AAGGCAAAGGGGGTTTATAGCAGCCCCTGGCTGCTCGGAAAATCTGAA TGGACG FQVLVQKGVDGHIID GCTTCTCCAGGCGCTCATCAAGAGCGCGAAACGAGATCATAGGACCCTT TGGGAC ILRDLYTNAGTYLESG GGAGTCGTGTTTGTCGATTTGGCTAAGGCCTTCGACTCCGTGAACCACC TATTCCA SQRSGFIKILRGVKQG AGCATATCTTCCAAGTCCTGGTCCAGAAGGGTGTCGATGGGCATATTAT TCTCGTC DPLSPILFNLALDPLLC CGACATCCTAAGAGACCTGTATACCAACGCTGGAACGTATCTGGAGTCA CCAAAT RLEDRGLGYKYGDQ GGTTCCCAGCGATCGGGATTTATTAAGATCCTCAGGGGAGTGAAACAR GCCGCG QIXSLAFADDLALLSD GGGGACCCACTCTCTCCCATCCTGTTCAATCTTGCATTAGATCCTTTGCTG TCCAATT SWEGMQQSIRVVEE TGCCGCCTGGAAGATCGGGGCCTCGGTTATAAGTATGGAGACCAACAA ATACCG FCQRTGLRVQAPKC ATAWCATCGTTGGCATTTGCGGATGATCTCGCCCTGCTCAGCGACTCTT GATTTG HGFLIRPTKESYTIND GGGAGGGCATGCAGCAGAGTATTCGGGTGGTAGAGGAATTTTGTCAAC ACAAAG CDPWTIADMQLDMI GGACCGGGCTGCGGGTTCAAGCGCCGAAATGCCACGGGTTTTTGATCA CGGACG DPGSSEKYLGLGIDP GGCCAACTAAAGAGTCATATACCATCAATGACTGTGACCCGTGGACGAT GCCCGC WIGLSRPELSEVLTR TGCAGATATGCAATTGGATATGATCGATCCGGGCAGTTCCGAGAAGTAT TTTATAA WVKNIGGAPLKPLQ CTTGGCCTAGGGATAGACCCATGGATTGGTCTATCGAGACCGGAACTGT GCCGGA KVDILRSYALPRLLFIA CCGAGGTGCTGACCCGCTGGGTGAAGAACATCGGGGGCGCCCCTTTGA AAAGGT DHAGLSATCLHSLDL AGCCACTCCAGAAGGTGGACATCTTGAGGAGCTACGCCCTNCCAAGGC GCCTTG SIRSAVKGWLHLPPS TGCTGTTCATTGCGGATCACGCAGGCCTGAGCGCCACCTGTTTGCATTCC TAAAATT TCDAIIYVSYKDGGLG CTGGACCTTTCGATAAGATCTGCCGTCAAGGGCTGGTTACATCTACCGC GCAAGG LPRLASLIPNVQARRL CTAGTACGTGTGACGCTATTATTTACGTCAGCTACAAGGACGGCGGGCT TTCATTA VRIAQSEDDVIRSVVL GGGTCTTCCCCGTCTGGCGAGCCTAATTCCAAATGTACAGGCTCGCAGG AATAG QEGIQEEIRKVWISA TTGGTGCGGATCGCCCAATCGGAGGATGATGTCATCAGGAGTGTGGTA (SEQID GGRPEKVPSVTGEFP CTCCAGGAGGGTATCCAGGAGGAGATCCGGAAGGTCTGGATCTCGGCT NO: VMEAQAADEALSEW GGGGGGCGACCGGAGAAGGTTCCATCTGTGACGGGGGAGTTCCCAGT 1343) ERRAPRTIYPIPCKWR GATGGAGGCTCAGGCGGCTGACGAGGCCCTATCCGAGTGGGAGAGGC KREMENWTNLKSQG GAGCTCCACGAACCATCTATCCCATTCCCTGTAAGTGGAGGAAGAGAGA HGIRNFENDRISND AATGGAGAATTGGACCAATCTAAAATCGCAAGGCCACGGGATTCGGAA WLLHYGRIPHRKLITA TTTTGAAAATGACCGAATCAGTAATGATTGGCTCCTGCATTATGGCCGC IQLRANVYPTREFLAR ATTCCCCACCGCAAACTAATAACAGCTATCCAGTTGCGGGCCAATGTCTA GLGEGAPRGCRHCP TCCCACWCGGGAGTTTTTGGCCCGCGGCCTGGGCGAGGGCGCACCCAG AEWESCSHIIGYCPA GGGATGTAGGCACTGTCCCGCGGAGTGGGAATCTTGTTCCCACATAATT VQEARIKRHNDICGV GGCTACTGCCCGGCTGTCCAGGAGGCCAGGATCAAAAGGCATAATGAC LAEEARKLGWVIFIEP ATCTGTGGTGTGCTGGCTGAAGAGGCNAGAAAGCTGGGATGGGTGATA HLRDNTNELFKPDLV TTTATAGAGCCCCATCTCAGAGATAACACCAATGAGCTCTTCAAGCCAG LVKGSCAKVVDVTIR ATTTGGTTTTGGTGAAGGGATCCTGTGCGAAGGTAGTGGATGTAACCAT YESGLTTLSDAAAEKA CCGCTACGAGAGTGGGTTAACCACCTTGAGTGACGCCGCGGCAGAAAA RKYQHLAGEVRALTS GGCTAGGAAGTATCAACATCTGGCAGGGGAGGTGCGGGCCCTAACATC ATTVDFLGFPIGARG GGCCACTACTGTAGACTTCCTGGGTTTCCCTATTGGCGCTAGAGGGAAG KWYVGNNGLLSDLG TGGTACGTTGGTAATAATGGACTCCTTTCCGACCTTGGGTTCTCCACTAG FSTSRVVRIARALSKK CCGTGTAGTGCGGATAGCGAGGGCCCTCTCTAAAAAGGCTCTCCTATCG ALLSSVDIIHIFASRAR TCCGTGGACATTATACATATTTTTGCGTCTCGCGCTAGACAGGCCCAAAC QAQTSE(SEQID GTCCGAGTAGGGGGCTTGGCATTTCTCATTGCCTGCTCCTGAAAGGATA NO:1465) TGGGTCCTGCGTCGCGTGGTAGGCAGACCCATTCGTCCGAGTAGGGGG CTTGGCAGTNTCCATTGCCTGTGCCCGAAAGGACGTGGGTCATCTGGTC TGTCTGCCTACACCTCTCTAGACTTGTAACATCTAGTCTGTCAACAAGAT CAAAATTCTTCACACAGACGACCGAGCTTGCTCAGTCTTCCTGTACCCGC AGAATTTTGCTCTTGCTCTCCTTTGGCTGTGTCCTGGACGTGGGACTATT CCATCTCGTCCCAAATGCCGCGTCCAATTATACCGGATTTGACAAAGCG GACGGCCCGCTTTATAAGCCGGAAAAGGTGCCTTGTAAAATTGCAAGGT TCATTAAATAG(SEQIDNO:1097) R2 R2- Trichinella CTCCTGACTAACCTGATTTCGTCCGTGCGGCGGCGTTTTCTTTTCGCTCTC CTCCTGAC TGAGGT MSNRLANTAAAGGV 1_TSP spiralis CGCTCGTCGAAATTTGCTGTAGTTGATTCGCTTTTCTTTGCGTTTTCTTCT TAACCTGA TTTTGTT PEKTSGTLDIPGQPSS ACTTTCGCAGTTTTTTCTGCATTGCCACGATGTCAAACCGCCTTGCCAAT TTTCGTCC TTCTTTT SGEKRAISYPGPFGC ACTGCTGCGGCTGGTGGGGTTCCAGAGAAAACCTCGGGAACTTTAGAC GTGCGGC TTCCTTT NSCSFTSTTWLSLELH ATTCCTGGCCAACCCTCTTCATCCGGTGAAAAGCGTGCGATCTCTTACCC GGCGTTTT TACCATT FKSVHNIRDFVFLCSK TGGTCCATTCGGTTGCAATTCGTGTTCGTTTACGAGTACGACTTGGCTCT CTTTTCGC CTTGTTC CKKSWPSINSVASHY CATTGGAATTGCATTTTAAAAGCGTCCATAATATTCGTGACTTCGTCTTC TCTCCGCT CATTGTT PRCKGSVKAAVVPTS CTCTGCTCTAAATGTAAAAAAAGCTGGCCATCGATCAACTCCGTAGCTA CGTCGAAA GTTATTT LANTCTTCGSSFGTFS GCCATTACCCTCGGTGCAAAGGTAGCGTCAAGGCTGCAGTTGTTCCTAC TTTGCTGT GCTTTAA GLQLHRKRAHPDVF ATCTTTGGCGAATACGTGCACCACGTGCGGCTCAAGCTTCGGTACTTTCA AGTTGATT TCCTGTA AASCSKKTKARWSN GTGGTCTTCAACTCCATCGGAAAAGAGCACATCCGGACGTTTTTGCTGC CGCTTTTC TTTTACC DEFTLLARLEAGLDPA TTCTTGTAGCAAAAAAACGAAGGCGCGTTGGTCTAACGACGAATTTACC TTTGCGTT GCCGGC CKNINQVLAERLMEY CTTCTGGCGAGACTCGAAGCAGGTCTGGATCCAGCCTGTAAAAACATTA TTCTTCTAC AATTCCA NITRGVEMIKGQRRK ACCAAGTACTAGCGGAAAGGTTAATGGAGTATAACATCACCAGAGGCG TTTCGCAG TTGTTAT DQYKALVRQLRSNSE TAGAAATGATAAAAGGCCAACGTAGAAAAGATCAGTACAAAGCGCTCG TTTTTTCTG TATTACT TQQCVGLAGSMDSN TTCGTCAACTCCGGTCAAATTCTGAAACACAGCAATGTGTAGGTTTAGCC CATTGCCA GTTACT VPANDTSSSVASEVSI GGAAGTATGGATTCGAACGTACCGGCCAACGATACATCGTCTTCCGTTG CG(SEQID GTTATTA TYPEYGAVMSCDLIK CATCAGAGGTCAGCATTACGTACCCTGAGTACGGGGCCGTGATGTCGTG NO:1221) TTGTTAC EATGMAIVDINELQS CGACCTAATTAAAGAAGCGACTGGTATGGCCATAGTTGACATCAACGAG TATTGTT NLRKAFLSGRKLPMK TTGCAAAGCAACTTACGAAAAGCCTTCTTGTCCGGCCGCAAGCTTCCCAT TTTACTT FHGARETAQKKMAN GAAGTTCCATGGAGCGCGTGAAACCGCCCAGAAGAAAATGGCCAACCC TTACTTA PRVAKFKRFQRLFRS CCGTGTTGCGAAATTCAAGCGTTTCCAACGGTTGTTTCGAAGCAACAGG CTACTGT NRRKLASHIFDKASLE AGGAAACTGGCCAGCCACATCTTCGACAAAGCCTCACTGGAGCAATTCG TATTATA QFGGSIDEASDHLEK GTGGCAGCATCGATGAGGCATCTGACCATTTAGAAAAGTTCCTCTCCCG CTTTAAT FLSRPRLESDSYSVISG GCCAAGATTGGAGTCCGATTCTTATTCCGTGATAAGCGGTGATAAGTCA TCGTTAA DKSIGVAHPILAEEVE ATCGGAGTTGCACATCCAATTTTGGCCGAGGAGGTGGAATTGGAATTAA CTTACGT LELKASRPTAVGPDGI AAGCCTCCCGACCAACCGCTGTTGGTCCGGATGGAATTGCACTGGAAGA TATTGTT ALEDIKKLNTYDIASLF CATTAAAAAACTCAATACTTACGACATAGCCAGTCTTTTCAACCTCTGGC ACCACT NLWLKAGDLPASVK TAAAAGCTGGCGACCTACCCGCATCGGTGAAAGCCAGTAGAACCATCTT ACTTACT ASRTIFLPKSDGTTDIS TTTGCCCAAAAGCGACGGCACCACCGACATATCGAACTGTCGGCCAATC TTGCTCT NCRPITIASAMYRLFS ACAATCGCATCCGCCATGTATCGGCTGTTCAGCAGAATAATAACGCGAC CTCGCA RIITRRLAARLELNVR GTCTGGCAGCCAGGTTGGAATTGAACGTGCGGCAAAAAGCGTTCCGGC AACGTT QKAFRPEMNGVFEN CTGAAATGAACGGCGTATTCGAGAACTCCGCCATTTTATACGCCCTCATC CGTTGTT SAILYALIKDAKVRSRE AAGGATGCTAAGGTCAGGTCAAGGGAAATTTGCGTAACTACGCTCGAC GTTTCTT ICVTTLDLAKAFDTVP CTTGCCAAGGCCTTTGACACGGTGCCCCACTCACGCATTTTACGAGCCCT TTGGAC HSRILRALRKNNVDP GAGGAAAAATAATGTCGACCCGGAATCCGTCGACCTGATTTCGAAAATG CAGGTT ESVDLISKMLTGTTYA TTAACGGGTACGACTTATGCAGAAATAAAAGGGCTCCAGGGCAAACTTA TAGAGA EIKGLQGKLIPIRNGV TACCCATTCGCAATGGAGTCAGGCAAGGTGACCCCTTGTCGCCCCTATT AATCGC RQGDPLSPLLFSLFID ATTTAGTCTATTTATAGACGAGATAATAGGTCGCCTACAAGCCTGCGGC ACGCAC EIIGRLQACGPAYDFH CCTGCCTACGATTTCCATGGCGAAAAAATTTGCATCCTGGCTTTCGCCGA AGCGGA GEKICILAFADDLTLV TGATCTGACGCTGGTGGCTGACAGCGCAGCTGGTATGAAGATCCTTCTA ACTGGA ADSAAGMKILLKAAC AAAGCGGCTTGTGACTTCCTGGAGGAATCTGGAATGTCACTTAATGCAG CCGCTTA DFLEESGMSLNAEKC AGAAATGCCGCACTCTCTGTATTACAAGATCTCCCCGAAGCCGCAAGAC AGCCAG RTLCITRSPRSRKTFV TTTCGTCAACCCAGCTGCCAAATTCATCATCAGCGATTGGAAAACGGGT AAATAG NPAAKFIISDWKTGIS ATCAGCTCAGAAATCCCCTCCCTGTGTGCGACGGACACCTTTCGTTTCCT TAAAGT SEIPSLCATDTFRFLG GGGGCACACCTTCGATGGAGAAGGAAAGATCCACATCGATACGGAGGA AACAA HTFDGEGKIHIDTEEI AATTCGATCCATGCTCAAATCGGTGAAGTCAGCTCCACTGAAACCGGAA (SEQID RSMLKSVKSAPLKPE CAGAAGGTGGCTTTGATACGGTCACACCTTCTTCCCCGCCTTCAGTTCCT NO: QKVALIRSHLLPRLQF GTTTTCTACAGCTGAAGCTGACAGCCGGAAAGCCTGGTTGATCGATTCC 1344) LFSTAEADSRKAWLI ATCATCAGGGGGTGTGTGAAGGAGATCTTGCACTCAGTGAAAGCTGGT DSIIRGCVKEILHSVKA ATGTGCACTGATATCTTTTACATACCCTCTAGAGACGGTGGAATGGGAT GMCTDIFYIPSRDGG TTACTTCCCTCGGGGAGTTTTCTCTTTTCAGCAGGCAGAAGGCACTCGCC MGFTSLGEFSLFSRQ AAGATGGCTGGATCGTCGGACCCCCTCTCGAAACGGGTTGCTGAATTCT KALAKMAGSSDPLSK TCATCGAAAGGTGGAACATCGCCCGTGACCCGAAAGTCATTGAAGCTGC RVAEFFIERWNIARD TCGGCGCGTCTACCAGAAAAAACGGTACCAACGCTTTTTCCAGACGTAC PKVIEAARRVYQKKR CAGAGCGGTGGATGGAATGAATTTTCGGGAAACACTATTGGGAACGCC YQRFFQTYQSGGWN TGGTTGACAAACGGCCGTGCCCGCGGAAGAAATTTCATAATGGCTGTG EFSGNTIGNAWLTN AAATTCCGTTCCAACACCGCAGCCACCCGGGCCGAAAACCTACGAGGCC GRARGRNFIMAVKF GCCCCGGCACGAAAGAATGCCGGTTTTGCAAGAGTGCCACCGAAACTTT RSNTAATRAENLRGR GGCACACATTTGCCAGAGGTGTCCGGCAAATCACGGCTTGGTTATCCAG PGTKECRFCKSATETL CGCCATGACGCAGTCGTAACATTCCTGGGGGAAGTGGCGCGGAAGGAA AHICQRCPANHGLVI GGTTACCAGGTCATGATAGAGCCTAAGGTGTCAACCCCGGTCGGCGCG QRHDAVVTFLGEVA CTCAAGCCCGACCTCCTACTCATCAAAGCCGACACTGCATTCATTGTGGA RKEGYQVMIEPKVST TGTAGGCATTGCATGGGAAGGTGGACGCCCACTAAAGCTGGTCAACAA PVGALKPDLLLIKADT AATGAAATGTGACAAGTACAAAACTGCCATCCCGGCAATTTTGGAAACA AFIVDVGIAWEGGRP TTTCACGTTGGCCATGCTGAGACGTACGGCGTTATTCTGGGCAGCCGCG LKLVNKMKCDKYKTA GATGCTGGCTCAAGAGCAACGACAAGGCGTTGGCATCAATTGGGCTCA IPAILETFHVGHAETY ATATCACACGGAAGATGAAAGAACACCTGAGCTGGTTGACGTTTGAAAT GVILGSRGCWLKSND TATATTTATAACTCAAATAAGCCGGATTTATAACTCATTCATGAAAAAAT KALASIGLNITRKMKE GAGGTTTTTGTTTTCTTTTTTCCTTTTACCATTCTTGTTCCATTGTTGTTATT HLSWLTFEIIFITQISRI TGCTTTAATCCTGTATTTTACCGCCGGCAATTCCATTGTTATTATTACTGT YNSFMKK(SEQID TACTGTTATTATTGTTACTATTGTTTTTACTTTTACTTACTACTGTTATTAT NO:1466) ACTTTAATTCGTTAACTTACGTTATTGTTACCACTACTTACTTTGCTCTCTC GCAAACGTTCGTTGTTGTTTCTTTTGGACCAGGTTTAGAGAAATCGCACG CACAGCGGAACTGGACCGCTTAAGCCAGAAATAGTAAAGTAACAA (SEQIDNO:1098) R2 R2- scaffold_6 Tetranychus CTCTCTTATTTTAACATATTCGATGTACTCGTACATTGAATATGCTTTTATT CTCTCTTAT TGATGT MCILGGLTSHSREGG 1_TUr urticae TTTTTTCAAAGTTTTTTGGGTGCATACCCCTGGAAAATTCTGAGATGTAT TTTAACAT ATCCCTT LSRGSSQLKTVKPQN AAATCTCCCATCAGCTTTGGCTGAAACGTTGGCTAAGTTTTGTAGGTTGT ATTCGATG CAATAT EEDNGTTQLKAGSA TTGCCCCCTACTACTTAGTCGCAAATGGTATTTGCTAACAGTTGTTAAAT TACTCGTA ATTGTA DSFPRPSGDLNPEEP TGTTACATTTACAAGTCCTATCCAGTGCCTCCTCGTGGCGCTACCCGGTA CATTGAAT ATCCTCA LSIDICPVCFRQMKSY ACACTTAGAGTAATCTGAGTGGCTAAACTGGAAGGGCGGAAAATGCAA ATGCTTTT TTCGTCC LGVRVHMQKMHLEE ACAGGCGGTTGGTAGATGCTTCGGCATTTTGCCAAAAATCCACGGCTTT ATTTTTTTT CTATCCT YNASIPDPVVSHTRW TTAGCCCAACAACATCAGGGTGATGGACCCGCCAGCTTGTGGTCAGGAT CAAAGTTT TTCATTT SDEEAAQLAFTEAKIE CCCATCCATGAATAAAGCATGGCTCTGCTTCTGGTGCATCCTCAACGGG TTTGGGTG GATAAA VDKLLPRGKGINKFLL ATCGGCTTCGGCTGGATGTAAGTCTTGCGGAGGCATGTGCATCTTGGGT CATACCCC AGAAAC ELLPGRTLESIKSHRK GGCCTAACAAGCCATTCAAGAGAAGGAGGGCTCTCGCGAGGGAGTTCT TGGAAAAT TTTGTTG RQSHKDLVRKYVKEF CAACTTAAGACCGTTAAGCCACAAAACGAAGAAGATAATGGAACTACCC TCTGAGAT CTCCTTT VDTLAADNDDDTIIC AGTTGAAGGCTGGGAGTGCGGACTCTTTCCCAAGGCCAAGTGGTGATTT GTATAAAT TAATAG QDNGDIFNDPIVGAT GAACCCAGAGGAACCACTGAGCATTGATATCTGTCCAGTTTGTTTCAGA CTCCCATC TTGGTC DSQSETETVADPAEF CAGATGAAGAGTTATTTAGGCGTTAGAGTGCATATGCAGAAGATGCACT AGCTTTGG CCTCCTG KTFIELADDPTKPKVV TAGAAGAATATAATGCGTCTATTCCGGACCCAGTCGTTTCCCACACTAG CTGAAACG TCCCTTT AKLRNLIKDKPKSEIL GTGGTCCGATGAAGAGGCAGCACAATTAGCTTTTACTGAAGCTAAGATT TTGGCTAA TCTGGA GSDILVRILRRTLHGL GAGGTCGACAAGCTTTTACCTCGTGGCAAAGGCATAAATAAGTTCCTGT GTTTTGTA ACCTGG PVEDELDQYLEVYFT TAGAGCTTTTGCCTGGTCGAACCTTAGAGTCTATCAAGTCTCACAGAAA GGTTGTTT TTGTATC GKIKQRRSKTQTALSK AAGACAATCGCATAAAGACTTGGTTCGCAAATATGTAAAGGAGTTCGTT GCCCCCTA GATTATT KQIKQRDYGRLQELY GATACTTTAGCTGCAGATAACGATGACGATACCATCATCTGTCAAGATA CTACTTAG GAAAGT SRSRKRCANEILNPTS ATGGTGACATTTTTAATGACCCCATCGTTGGAGCAACTGATTCTCAGTCT TCGCAAAT TGCAAT MSGGFGHQELSEFW GAGACTGAAACTGTGGCTGATCCTGCTGAATTCAAGACTTTTATTGAGTT GGTATTTG AAACGG TKTFGPDEQPTLGEV AGCAGACGATCCCACAAAACCAAAAGTCGTAGCAAAACTCAGAAACTTG CTAACAGT ATTTAA EIIPKENCWWDIFSPI ATAAAAGATAAACCAAAGAGCGAGATCCTTGGAAGTGACATTCTTGTAC TGTTAAAT (SEQID SSDEIKASYPSIGKAA GAATCCTGCGAAGAACTCTTCATGGCTTGCCTGTTGAGGACGAGTTGGA TGTTACAT NO: GPDNFSAYQLRKVPV CCAGTACTTGGAGGTTTACTTCACTGGGAAGATAAAGCAGAGACGTTCT TTACAAGT 1345) WHLECLYNIFAFYKDI AAGACACAAACTGCCTTATCTAAGAAACAGATTAAGCAGAGAGATTATG CCTATCCA PSRLKDAKTILIPKKD GACGTCTTCAGGAGTTATACTCCCGGTCTAGAAAAAGGTGTGCAAATGA GTGCCTCC NAESPGDFRPITLSSII GATCTTGAACCCTACTTCTATGAGTGGTGGTTTTGGTCATCAGGAGCTAT TCGTGGCG TRHFHKILATRVNNF CCGAATTCTGGACAAAGACGTTCGGACCAGATGAGCAACCTACTCTGGG CTACCCGG VRFHPMQRGFIQSD CGAGGTGGAGATCATTCCGAAGGAGAACTGTTGGTGGGATATTTTCTCG TAACACTT GCLENTALIQTVIREA CCAATATCTTCAGACGAAATCAAAGCCTCATACCCCAGTATAGGGAAAG AGAGTAAT KVRRKQVHITFCDVR CTGCTGGTCCAGATAATTTCTCTGCTTATCAACTTAGAAAAGTTCCGGTT CTGAGTGG KAFDSVRYDSIIAAIA TGGCATCTGGAATGTCTCTATAATATCTTTGCTTTCTACAAGGACATACC CTAAACTG KKGAPGSFIMYLSNL GAGTAGACTTAAAGACGCGAAGACAATTTTGATACCAAAGAAGGATAA GAAGGGC YRGNKTTLLTAGGET CGCTGAAAGTCCAGGTGATTTCCGCCCTATCACCTTGAGCTCAATTATCA GGAAAAT RITPTRGVRQGDPLS CTCGACATTTTCACAAGATTCTAGCGACGAGAGTGAATAATTTTGTTCGA GCAAACAG PILFNCVMDQILTALP TTCCATCCTATGCAAAGAGGCTTCATTCAATCTGATGGTTGTTTGGAGAA GCGGTTG SRTGFTLSAGDESVN CACTGCGCTCATACAGACGGTCATTAGAGAAGCCAAAGTGCGTAGGAA GTAGATGC VNCLAFADDIILISKTK GCAAGTGCATATCACCTTCTGTGATGTGCGAAAAGCGTTTGACAGCGTC TTCGGCAT NGHQELLDVTQRILK CGGTATGACTCGATCATAGCAGCTATTGCAAAGAAAGGTGCACCAGGA TTTGCCAA ENGLDLNPDKCCSLS AGTTTCATCATGTACCTTAGTAACTTATATAGAGGAAATAAGACTACGCT AAATCCAC LIPHSKTKKIKVVRAD TTTGACTGCTGGCGGTGAAACAAGAATAACTCCAACACGGGGCGTTAG GGCTTTTT FVVNGVKVRSMSIG ACAAGGAGATCCGTTATCTCCGATACTTTTTAATTGCGTGATGGACCAG AGCCCAAC DSTCYLGVSINVTGQ ATTCTCACGGCTCTTCCTTCGAGAACTGGTTTTACGTTATCTGCTGGTGA AACATCAG VAPVKMYQALCEKL CGAATCAGTTAACGTTAACTGTTTGGCGTTTGCTGATGATATAATTTTAA GGTGATG DSAAIKPHQRLYILKH TCTCAAAGACCAAGAATGGACATCAAGAACTCCTTGATGTAACACAAAG GACCCGCC FVITKMFHPLILSTIAA AATTTTAAAGGAGAATGGCCTCGATCTCAATCCTGATAAGTGCTGTTCG AGCTTGTG HKIKNLDLISRRYVRK CTAAGCTTGATACCGCACAGCAAAACAAAGAAAATCAAAGTGGTTCGTG GTCAGGAT WLHLPHDCGSGMIH CTGACTTCGTGGTGAACGGAGTCAAAGTGCGATCAATGAGTATTGGTG CCCATCCA AKVSDGGLGVPLLFR ATTCAACTTGTTACCTAGGTGTCTCAATAAATGTAACTGGCCAAGTAGCA TGAATAAA TIADLKVRRKEKLQV CCAGTTAAAATGTATCAGGCTCTGTGTGAGAAACTTGATAGTGCTGCTA GCATGGCT HENPIFRILAKLSTVSK TTAAGCCTCATCAGAGACTGTACATATTGAAGCATTTCGTAATAACGAA CTGCTTCT ELENCKKIASKTTDIQ GATGTTTCACCCGCTGATTTTGTCAACTATTGCGGCACACAAGATAAAG GGTGCATC EKTFKEMLATYDGLS AATCTGGATTTGATTAGCAGACGTTATGTCAGGAAGTGGTTACACTTAC CTCAACGG LKEARAVPEVHKWV CTCATGATTGCGGATCTGGTATGATTCATGCCAAAGTAAGCGATGGAGG GATCGGCT DSYDKRYKFAGRDFV ATTGGGTGTTCCATTACTGTTCAGGACAATTGCTGATCTCAAAGTTCGGA TCGGCTGG QVIQARFNALPTRSR GAAAAGAAAAGCTGCAGGTCCATGAAAACCCAATATTTAGAATATTGGC ATGTAAGT VWRGRGADEKSLRC AAAGCTATCTACAGTAAGCAAGGAGTTGGAAAATTGTAAAAAGATAGC CTTGCGGA RAGCNARETLNHVS TTCCAAGACGACAGACATTCAAGAAAAGACTTTCAAGGAAATGCTTGCG GGC(SEQ QSCFRTHRVRTARHD ACTTATGATGGTTTGTCATTGAAAGAAGCTCGAGCGGTACCTGAGGTGC IDNO: KILDFICERLDVVGVK ACAAATGGGTGGACAGCTATGATAAGCGTTATAAGTTTGCAGGCCGAG 1222) YVREKPISFPGKKLIPD ATTTCGTTCAAGTTATACAGGCTCGTTTCAATGCGTTACCAACGAGATCT LIVENTDQALVLDLQI CGAGTATGGCGGGGCAGAGGTGCTGATGAAAAATCGTTACGTTGCAGA VGDNSELPLDERGKN GCAGGCTGCAACGCTAGGGAAACGTTGAATCACGTTTCACAATCTTGCT KVIKYNCSEMQELYK TCAGGACGCATCGAGTACGAACAGCTAGACACGACAAAATCCTGGATTT RKKKTLAVKALTLHYK TATTTGTGAAAGGCTGGACGTAGTCGGTGTTAAATACGTACGAGAAAA GLMAPETSNILRSFG GCCAATCAGCTTTCCTGGCAAAAAGTTGATTCCTGATTTGATTGTGGAAA FKSKDLEKMAYMALF ACACGGACCAAGCTCTTGTGCTCGATCTCCAGATTGTTGGCGATAACAG GTVAAWGIFNRSTET TGAACTTCCACTTGATGAAAGAGGCAAGAACAAGGTTATCAAATACAAT MRSVANWPRPEEL TGTTCGGAGATGCAGGAATTGTACAAGAGGAAGAAAAAGACTTTGGCG (SEQIDNO:1467) GTTAAGGCACTCACGCTACATTATAAAGGGTTGATGGCACCGGAAACAA GCAATATCCTAAGGTCATTCGGATTCAAGTCTAAGGATTTGGAAAAGAT GGCGTATATGGCGTTGTTTGGTACGGTGGCTGCTTGGGGCATTTTCAAT CGCTCGACGGAGACAATGCGTTCAGTCGCTAACTGGCCTAGACCTGAG GAATTATGATGTATCCCTTCAATATATTGTAATCCTCATTCGTCCCTATCC TTTCATTTGATAAAAGAAACTTTGTTGCTCCTTTTAATAGTTGGTCCCTCC TGTCCCTTTTCTGGAACCTGGTTGTATCGATTATTGAAAGTTGCAATAAA CGGATTTAA(SEQIDNO:1099) R2 R2- Drosophila GAAGCTGGGAAGCTGGGTCGGATGAGCGCAGAAGGGGTGTTCTTCGG GAAGCTG TAGATG FERRSNSWGYQNLE 2_DWi willistoni AGCACTGTAATTCATAAGTCGTAAGTCTGATCAAGTCGACTCGGAACCT GGAAGCT TACTAAC PSNVGQDMNTVPRI CTTCGTGGTGTTTCCTGGGTGCTGTTGAGTTCCTAGTCTCTAGGTTCTCT GGGTCGG CTCTAGT NNTTTTPATSRPGDQ CCAGTAGCTAATTCGAGCGGCGAAGCAACTCTTGGGGTTACCAGAACCT ATGAGCGC TTCTCTA PREAIAVVNLAGEIPC TGAGCCAAGTAACGTTGGTCAAGATATGAATACTGTGCCTCGGATAAAC AGAAGGG TACTTTT AVCGRLFNTRRGFGV AACACAACAACAACTCCAGCGACTTCCCGTCCTGGAGACCAACCGAGAG GTGTTCTT GCCTGC HMSHQHKDELDTQR AGGCTATAGCAGTGGTAAATCTCGCGGGAGAGATTCCCTGTGCAGTAT CGGAGCAC TACCTTG QREDVKLRWSEEEA GTGGGCGCCTCTTCAATACTAGAAGGGGGTTTGGGGTTCATATGTCACA TGTAATTC GCATTA WMMARKEVELEAS TCAACACAAAGACGAACTAGATACGCAACGTCAGCGTGAAGATGTAAA ATAAGTCG CATCTAA GNLRFPNKKLAEVFT ACTCCGATGGAGTGAGGAAGAAGCGTGGATGATGGCGAGAAAGGAGG TAAGTCTG AAAGGT HRSSEAIKCFRKRGEY TGGAGCTCGAAGCAAGTGGTAATTTGAGATTTCCTAATAAGAAGCTAGC ATCAAGTC ACAAAC KAKLEQIRGQSTPTPE GGAAGTATTTACTCACCGTAGCTCCGAAGCAATTAAATGTTTTCGGAAG GACTCGGA ATCGCA ALDSITSQPRPSLLER AGGGGTGAATATAAGGCAAAACTGGAGCAGATCAGAGGGCAATCTACT ACCTCTTC TTGGCA NHQVSSSEAQPINPS CCCACCCCAGAAGCGTTGGACTCAATTACCTCACAGCCTCGCCCTAGTTT GTGGTGTT AAAAGA EEQSNWEIMRILQGY ACTCGAGCGAAACCACCAAGTATCATCGTCGGAAGCGCAACCAATCAAT TCCTGGGT GGTGGT RPVECSPRWRAQVL CCATCAGAAGAACAGTCGAACTGGGAAATCATGCGGATACTACAGGGC GCTGTTGA TTTAGTA QTIVDRAQAVGKETT TATCGCCCCGTAGAATGTAGTCCCCGGTGGAGAGCCCAGGTCTTGCAAA GTTCCTAG CATAGG LQCLSNYLLEVFPLPN CTATCGTAGATAGGGCGCAGGCCGTAGGGAAGGAAACCACTCTCCAAT TCTCTAGG CGCTGT EPHTIGRSNLRRPRTR GCTTATCCAACTATCTCCTGGAAGTATTTCCATTACCAAACGAACCACAC TTCTCTCC GGGACT RQLRQQEYAQVQRR ACCATCGGTCGGAGCAATTTGCGAAGACCTCGAACTAGGAGACAGTTA AGTAGCTA TCATTGT WDKNTGRCIKSLLDG AGACAACAAGAGTACGCACAGGTTCAGCGTCGTTGGGATAAGAATACT A(SEQID CCCGAT TDESVMPNQEIMEP GGGAGATGCATTAAATCCTTGCTTGATGGAACAGATGAGTCGGTTATGC NO:1223) GATGCA YWKQVMTNPSTCSC CAAACCAAGAGATAATGGAACCCTATTGGAAACAAGTAATGACGAATCC GCGAAT ENTRFRMEHSLETV CAGCACATGCTCTTGCGAAAACACAAGATTCCGTATGGAACATTCGCTT CGTGCA WSAITPRDLRENKLK GAGACGGTTTGGTCAGCGATAACGCCACGCGACCTGAGGGAAAATAAG TACGAG LSSAPGPDGITPRTAR TTAAAGTTGTCAAGTGCTCCGGGTCCTGACGGTATCACTCCAAGAACAG ATTGTCC SVPLGIMLRIMNLIL CCAGGAGTGTACCCTTAGGCATTATGCTACGCATAATGAACCTGATTCTC AGTAGT WCGKIPFSTRLARTIFI TGGTGCGGCAAAATACCATTCTCTACCCGACTGGCCAGAACTATCTTCAT TGGTTG PKTVTANRPQDFRPI TCCGAAGACTGTGACGGCAAATCGACCGCAAGACTTTCGTCCAATAACA CTCGTAT TVPSVLVRQLNAVLA GTCCCCTCGGTTTTGGTCAGGCAATTAAACGCTGTTCTGGCTTCTCGATT CTTTAGA SRLASKVNWDPRQR GGCTTCTAAAGTCAACTGGGATCCAAGGCAGCGCGGTTTCCTACCTACC AGATTTC GFLPTDGCADNATLV GATGGGTGTGCTGATAATGCGACGTTGGTTGATCTCATTTTGCGGGAGC CTTCCTC DLILREHHKRWKSCY ACCATAAACGGTGGAAGTCATGTTACCTTGCGACGGTGGATGTCAGCAA GGCGAT LATVDVSKAFDLVSH GGCTTTTGACTTAGTATCACACCAGGCCATTATCAAGACTTTACAGGCCT CAAAAN QAIIKTLQAYGAPTNF ATGGTGCTCCAACAAACTTTGTTAGCTTTATAGAAGAACAGTATAAGGG AAAAAA VSFIEEQYKGGGTSL CGGCGGAACCTCCCTCAATGGGGCAGGATGGAGTTCAGAGGTGTTTAT AAAAAA NGAGWSSEVFIPAR ACCCGCGCGGGGCGTTAAGCAAGGTGACCCTCTGTCTCCACTATTATTT AAAAAA GVKQGDPLSPLLFNLI AATCTTATCATTGATAGATTACTTAGGTCCTACCCCAGAGAGATTGGTGC (SEQID IDRLLRSYPREIGAKV CAAAGTCGGAAATACCATGACAAGCGCGGCAGCGTTCGCGGATGATCT NO: GNTMTSAAAFADDL GGTGCTATTTGCGGAAACTCCGATGGGGCAAACATTGTTGGATACCACG 1346) VLFAETPMGQTLLDT CTAGGCTTCCTAGCCTCCGTGGGACTCTCCCTTAATGCTGATAAGTGTTT TLGFLASVGLSLNAD CACTGTCAGTATAAAGGGGCAAGCCAAGCAGAAGTGTACTGTCGTAGA KCFTVSIKGQAKQKC ACGACGGAGCTTTTGTGTAGGGGAGCGCGAGTGTCCTTCATTGAAGCG TVVERRSFCVGEREC TACTGAAGAGTGGAAGTATTTAGGTATCCGGTTCACTGCGGATGGGCG PSLKRTEEWKYLGIRF GGCTCAATATAGTCCAGCAGACGACCTCGGTCCGAAGCTGTTAAGATTA TADGRAQYSPADDL ACAAGAGCCCCTCTGAAACCACAACAGAAGTTATTTGCACATAGGACTG GPKLLRLTRAPLKPQ TCCTTATCCCACAACTCTATCACCAACTAACACTTGGGAGTGTGATGATA QKLFAHRTVLIPQLYH GGCGTCCTAGGAAAATGTGACAGATTGGTACGGCAATTCGTAAGGAGA QLTLGSVMIGVLGKC TGGTTAGATCTCCCACTGGATGTACCAGTTGCGTACTTTCACGCCCCCCA DRLVRQFVRRWLDL CACTTGTGGGGGTCTCGGGATTCCGTCAATTAGATGGATAGCACCGATG PLDVPVAYFHAPHTC CTTCGTCTGAAGCGATTGAGCAATATTAAATGGCCCCACCTCGAACAAT GGLGIPSIRWIAPML CCGAGGTAGCTAGCTCTTTCATTGACGACGAATTGCAAAGGGCTCGAGA RLKRLSNIKWPHLEQ TAGATTAAAGGCGGAAAATGTGCAGCGGTGTTCGCGTCCAGAGATTGA SEVASSFIDDELQRAR CTCGTATTTCGCAAATAGGTTGTACATGTCTGTTGATGGTTGCGGTCTCC DRLKAENVQRCSRPE GTGAAGCAGGTCATTATGGCCCGCAACATGGATGGGTGAGTCAGCCCA IDSYFANRLYMSVDG CGCGCTTGCTAACAGGAAAGGAATATTTGCACGGTGTCAAATTGCGGAT CGLREAGHYGPQHG AAATGCCCTACCCTCGAAGTCTCGTACGACGAGGGGAAGGCACGAATT WVSQPTRLLTGKEYL GGAGAGACGGTGTCGTGCAGGATGTGATGCTCCCGAGACAACAAACCA HGVKLRINALPSKSRT CATCTTGCAAAAATGCTATCGTACGCATGGGAGGCGGGTAGCTAGACA TRGRHELERRCRAGC CAACAGCGTAGTAAATGCCGTCAAGCGGGGACTTGAACGGAAAGGCTG DAPETTNHILQKCYR CGTTGCCCATGTCGAACCAAGTCTGCAATGCGACTCGGGCTTAAATAAA THGRRVARHNSVVN CCGGACCTGGTGGGAATCCGACAGAATCACATTTATGTGATAGACGTTC AVKRGLERKGCVAH AGGTTGTGACAGACGGACATTCCTTAGACCAAGCGCACCAGCGCAAGG VEPSLQCDSGLNKPD TCGAAAGGTACGACAGAGCTGACATAAGATCACAAATGCGGCGATTTTT LVGIRQNHIYVIDVQ CGGAGTGACAGGTGAAATCGAGTTTCATTCCGTTACACTCAACTGGAGA VVTDGHSLDQAHQR GGAATCTGGAGTGGTCAGTCGGTAAAACGATTGATTGCAAAAGATCTCC KVERYDRADIRSQMR TCATCGCTGAAGATACCAAACTCATCAGCGTCAGAGCAGTAAACGGCGG RFFGVTGEIEFHSVTL AGTGACGTCTTTTAAATATTTCATGTATTGTGCTGGGTATACTCGAAGCT NWRGIWSGQSVKRL AGATGTACTAACCTCTAGTTTCTCTATACTTTTGCCTGCTACCTTGGCATT IAKDLLIAEDTKLISVR ACATCTAAAAAGGTACAAACATCGCATTGGCAAAAAGAGGTGGTTTTAG AVNGGVTSFKYFMY TACATAGGCGCTGTGGGACTTCATTGTCCCGATGATGCAGCGAATCGTG CAGYTRS(SEQID CATACGAGATTGTCCAGTAGTTGGTTGCTCGTATCTTTAGAAGATTTCCT NO:1468) TCCTCGGCGATCAAAANAAAAAAAAAAAAAAAAAA(SEQIDNO:1100) R2 R2- Petromyzon CGGTGCGTTCCCTTGGGTAAGGAACACGAGTCTTAGTGGCCTTGACCTC CGGTGCGT TGAGAA RPQTKLMTDKLKFSS 2_PM marinus CACGTGGTCCCGCTGGTAACATCATCTCTTGATGATGGCTAACAAGGCT TCCCTTGG TAATGA QLARGLAKQRAMDG AATGCACCCATTCCATCTCCTATCTCGCATGGAGGCCGCTATGCGTGATT GTAAGGA GGTGCT ARVGDPPITVRPTET ACTAGAGGCCACAAACAAAACTTATGACTGACAAACTTAAGTTCTCAAG ACACGAGT AACCTCC DLCNTEGSWGRRPM CCAGTTGGCACGAGGCCTGGCAAAACAACGTGCTATGGATGGCGCTCG CTTAGTGG CTGGGC KLLFVSVSTQTQNED GGTTGGCGATCCACCCATTACAGTTAGACCCACAGAAACCGATCTGTGC CCTTGACC CTGACC ALWASDVAKPMASR AACACTGAGGGTTCATGGGGACGCCGTCCTATGAAACTCTTGTTTGTCT TCCACGTG AAACCC SALKMTSIPSMTFHN CGGTGTCAACCCAGACACAGAATGAAGATGCCCTCTGGGCATCTGATGT GTCCCGCT AGAACA SSLEKEEEMNYDFYE TGCTAAACCTATGGCGTCTAGGTCGGCGCTAAAAATGACGAGTATACCT GGTAACAT CATCACT QIKSLVESDDSSDDFT TCCATGACCTTCCATAACTCGTCCTTGGAAAAGGAAGAGGAGATGAACT CATCTCTT GGCCAA EDDEDVEESFLDISAE ACGATTTTTACGAACAGATTAAAAGTCTAGTTGAGTCGGATGACTCTTCA GATGATGG GATGAT EPVLGKFPIDTKGTIT GATGACTTTACAGAGGATGATGAGGATGTGGAGGAGTCCTTCCTCGAC CTAACAAG TTCCCGC VVLPSLEYICVICKQH ATATCGGCTGAGGAACCCGTGTTGGGAAAGTTTCCCATTGACACCAAGG GCTAATGC AGCACG MGKASELVAHFNIKH GAACTATCACGGTTGTACTGCCTAGTCTGGAGTATATATGCGTTATCTGT ACCCATTC TTGCTTT RDIPLVFKCAKCDKT AAACAACACATGGGCAAAGCATCCGAACTTGTTGCACATTTTAACATCA CATCTCCT TCTCTCT NSNHRSIACHAPKCG AACACAGAGACATTCCTCTGGTGTTTAAGTGCGCTAAATGTGACAAAAC ATCTCGCA CGATAC GIKLTEESLPMVCECC CAACTCAAACCACCGATCGATTGCCTGTCATGCCCCCAAATGTGGGGGA TGGAGGCC CCGAGA QARFATLSGLSQHKR ATAAAGTTAACTGAGGAAAGCTTACCAATGGTCTGTGAATGTTGCCAGG GCTATGCG ATGTTCT HAHPVTRNEERIKDG CACGCTTTGCGACTCTAAGTGGCCTTTCGCAGCACAAGAGGCATGCTCA TGATTACT GCGGAA IKGTSQRGVHRSCWS CCCAGTCACCCGTAATGAGGAAAGGATTAAAGATGGTATAAAGGGTAC AG(SEQID CTACGC LKEVEQLALLELQFQ CTCGCAGAGAGGGGTACACCGTAGCTGCTGGTCTTTGAAGGAAGTAGA NO:1224) AGTCTAT GKKNINKIIAEALGTK ACAGCTGGCCCTTCTAGAGTTGCAGTTTCAGGGGAAAAAGAATATCAAT GAACAG TNKQVSDKRRDLSKK AAGATCATTGCTGAAGCGCTTGGGACTAAGACCAACAAGCAAGTCTCTG TCACAG TGAPMSDSLHFSSRP ACAAGAGACGGGACCTAAGTAAAAAGACAGGGGCCCCCATGTCAGACA ACAACC LETLSPPPNVTTGTSS GCTTACATTTTTCTTCTAGGCCTCTTGAGACATTGTCTCCCCCACCAAATG TCTGATC ILAQAAERLTNENSG TAACAACGGGGACTTCATCCATACTCGCTCAAGCAGCTGAGCGGCTTAC CAAG TLEKPAMEAIKAWLN GAATGAGAATTCTGGGACCCTGGAAAAGCCTGCAATGGAGGCAATAAA (SEQID GEGQHDALVETATAL GGCTTGGCTTAACGGCGAGGGCCAACATGATGCCCTCGTAGAAACTGC NO: MLCPMRLVKNKGKR CACAGCATTGATGCTTTGTCCGATGAGATTGGTGAAAAACAAAGGCAAA 1347) SKPENDIIKPRILPTRS CGTTCAAAACCCGAGAACGACATTATTAAACCCAGGATATTACCCACAC WMKKRAEKRGSFM GATCTTGGATGAAGAAGAGAGCGGAAAAACGAGGAAGCTTCATGAAG KHQKLFFKNRSLLASL CACCAGAAGCTCTTCTTTAAGAACCGCTCTCTTCTTGCGTCCTTAGTCCTG VLDGTERHECRIPNA GATGGCACTGAACGTCATGAATGCCGAATCCCGAACGCAGATGTATATC DVYRFYCEKWEKVLP GTTTTTACTGCGAAAAATGGGAGAAGGTGTTGCCATTCAATGGCCTGGG FNGLGQFKSSGVAN CCAATTTAAGTCATCAGGTGTTGCAAATAACGAATACTTTGAGCCCCTAA NEYFEPLISVEEVQTA TTTCGGTGGAGGAAGTTCAGACTGCCATACGGGCCATTAAACCAACGTC IRAIKPTSAAGPDGLT AGCAGCTGGGCCAGATGGCCTAACAAGGGCTGCAATCTGTGCTGCCGA RAAICAADPEGRTLT CCCCGAGGGTCGGACACTGACAGCCCTATTTAATGCATGGATGATTACA ALFNAWMITGIIPKEL GGAATTATTCCCAAAGAGTTGAAAAAGAATAGGACGATTCTTATTCCTA KKNRTILIPKVMDDE AGGTTATGGACGATGAAAAGCTGAAAGAATTGGGGAACTGGAGACCAA KLKELGNWRPITIGS TAACGATTGGTTCAATGATTCTGAGATTATTTTCCAGAATAATGACTGCA MILRLFSRIMTARLAR CGTCTTGCTCGTGCTTGTCCCTTAAACCCAAGGCAGCGTGGTTTTATAGC ACPLNPRQRGFIAAS GGCATCTGGCTGCTCTGAAAATCTTAAGGTGCTACAGGACCTTATGAGA GCSENLKVLQDLMR CACGCTAAGAAATTGCACAGGCCGTTGGCTGTCATGTTCATCGACATAG HAKKLHRPLAVMFID CGAAAGCTTTTGACTCGGTTTCGCATGCTCATATTTTATGGGTGTTAAGG IAKAFDSVSHAHILW CACAAGAAAGTAGATGAACACGTGGTGGGCATCATCCAGAACGCCTAC VLRHKKVDEHVVGII GATCGGTGTACGACCTCGTTCAAAAGCAATGGCGAGTCGACTCGAGAA QNAYDRCTTSFKSNG ATTAGCATACGTGTTGGTGTCAAACAGGGTGACCCCATGTCACCCCTGC ESTREISIRVGVKQGD TCTTCAATCTTGCCATGGACCCTTTGATATGCACCCTAGAGTCACACGGA PMSPLLFNLAMDPLI GTTGGGTACTCCATTGATACCGACCACGTGACAGCTCTTGCGTTTGCTGA CTLESHGVGYSIDTD TGATTTGGTGTTGGTGAGCGAATCTTGGGTTGGTATGGCCGCCAATCTA HVTALAFADDLVLVS GCGATCTTGGAATCATTTTGTGGGCTATCGGGATTGGAGGTTCAGGCCA ESWVGMAANLAILE GAAAGTGCCAGGGCTTCATGATAAGCCCAACCAAAGATTCATATACGGT SFCGLSGLEVQARKC GAACAACTGCGACCCATGGACTATCAAAAATAAAGATGTCCATATGATC QGFMISPTKDSYTVN CAACCTGATGAATCAACGAAATACCTTGGTCTAAAAATTTGCCCTTGGAC NCDPWTIKNKDVHM TGGCATTATACGGTCGGATCTACATGTTCAACTAAAGACACGGATCTCG IQPDESTKYLGLKICP AAAATCGATGAGGCGCCTCTGAAACCGACTCAGAAGGTCGAACTCCTCA WTGIIRSDLHVQLKT ATGCCTACGCCCTACCCAGATTATTGTACCCTGCTGACCACTCGGACTGC RISKIDEAPLKPTQKV AAGCAATCAACTCTCCGTGTGTTGGACCAAGAAATAATAAAGGCGGTAA ELLNAYALPRLLYPAD AAGGATGGCTCCATCTTCCCGCGTCAACCTGTGACGGGCTGTTGTACGC HSDCKQSTLRVLDQE CAGAGCCCGAGACGGAGGCCTTGCCATCTTGAAACTGGAAAATGCAATT IIKAVKGWLHLPASTC CCTTCGGTTCAAGTTAGAAGGCTGCAACGTATTGCAAACTCCTCTGACG DGLLYARARDGGLAI CTATCGCTCGAAACATTGCGTCCTCGCAGGGTGTGGAGGAAGAGTACC LKLENAIPSVQVRRLQ GAAGTCTGTGGGTACGGGCAGGGGGTGACAGCGAAGCAATCCCAACG RIANSSDAIARNIASS TTCTTTCTCAGGGGTTCGGAATCAAAAGAGCCCGTGTATCCGAGACCCT QGVEEEYRSLWVRA GCGATTGGAGGAAACGCGAATCTCGGAGACGGTGCGAAAAGCCGGTTC GGDSEAIPTFFLRGSE AAGGAAGGGGCATTGTAAACTTTGCGCAAGATAGAATCAGTAATGCAT SKEPVYPRPCDWRKR GGTTGGGGCCACGGTGCGGCTTTAAACAATGCTTCTTTATCGCAGCATT ESRRRCEKPVQGRGI ACAATTAAGGGCAAATATTTACCCAACAAGAGAAAGCATAAACAGAGG VNFAQDRISNAWLG CAGAGATGGTGCCTCACGGTCCTGCAGGAAATGCTCTGCCAGGCTGGA PRCGFKQCFFIAALQL GTCTCTCTCGCACATTCTTGGTCAATGTCCCGCAGTACAAAAATTCAAGG RANIYPTRESINRGRD ATTGCGCGACGCAATAAGATCAGCGACATTCTAGCTGACGAAGCGGCG GASRSCRKCSARLESL AGACTGGGCTGGTGGGTGTACAAAGAGCCACGGTTCACATCTGAAGCC SHILGQCPAVQKFKD GGAGAGCTAAGGAAACCTGCCCTTGTGTTTGCCAAAGGTGAGGAAGCG CATQTEAEMVPHGP CTTGTTATTGATGTCACCGTCCGGTTTGAGCTCTCGAGGAAAACCTCATC AGNALPGWSLSRTFL AGAGGCTGCCTCGCACCAAGTTGCGTACTACACCCCCCCTTGTGATCAA VNVPQYKNSRIARRN GTCAAAGTGCTGACGAAGGCAAGCAATGTCACATTCTTTGGATTCCAGG KISDILADEAARLGW TTGGGGCAAGAGGGAAAGTGGCCCCTTGAGAATAATGAGGTGCTAACC WVYKEPRFTSEAGEL TCCCTGGGCCTGACCAAACCCAGAACACATCACTGGCCAAGATGATTTC RKPALVFAKGEEALVI CCGCAGCACGTTGCTTTTCTCTCTCGATACCCGAGAATGTTCTGCGGAAC DVTVRFELSRKTSSEA TACGCAGTCTATGAACAGTCACAGACAACCTCTGATCCAAG(SEQID ASHQVAYYTPPCDQ NO:1101) VKVLTKASNVTFFGF QVGARGKVAP(SEQ IDNO:1469) R2 R2- Schmidtea AGTCATAGGGTGAACTGCAATTCTGACACGATGACCGAGCTGTGTCAGT AGTCATAG TAGAAG RKELVTIKNLFEESGA 2_SMed mediterranea TTGCAGCTAGTCGCTAAAGACTCGATCAGTCCGCCAAGTGAGGTGGCCG GGTGAACT GGAAAC TAPAPVPLEVAVEVH GGTATCTGCAGCACTAGAGCCACTGGTATCAAGAGCAGAGATACGCGA GCAATTCT AAAGGA QSSSVPEITDESTTTQ GTGGAAGTTGAGTACGACTACCTTCACGGGGTCCTCCTGATAACCACAG GACACGAT AAAACG EGSYSEPPIHRCENC TGGACTGTGGGAACTAAATGTGTGCTCAGCGTTCCCTACTTTCTCGTAG GACCGAGC AAATGA GREFRTRAGVQQHR GGTAAAGGGTATGATAACCCAGAGAATATCCCATGGGAGATATCCATG TGTGTCAG CTGGAA RKAHTNEFMEEKEKA GAAAAAGCACCACGTTAGACAATCCGATGGTCTAACTCGGCTCCGAGG TTTGCAGC ACTATG APTKKLRWTDEEKEIL GGCTAACTATCCCAAAGGGCTTAAAGAAAAGAACTGGTTACAATCAAAA TAGTCGCT AAGGAT IESEIKIIKEGSLKEQHE ATCTATTTGAAGAGTCAGGTGCCACCGCGCCTGCACCAGTTCCCCTAGA AAAGACTC ATAGCT INKILASRMPGRSQD AGTTGCAGTTGAGGTACACCAGTCATCGAGTGTTCCCGAGATAACCGAC GATCAGTC GAAAGC GIAKIRQKQEHKAEIQ GAATCTACGACTACTCAGGAAGGAAGCTATTCAGAACCACCGATACACC CGCCAAGT CGCAAG RRLHGTVTTNETRGN GATGTGAGAACTGTGGAAGAGAATTCAGAACAAGAGCAGGAGTTCAAC GAGGTGG GAAGGC RTSEITEPIRSLPINTK AGCACCGGAGAAAAGCTCACACCAACGAGTTTATGGAGGAGAAAGAGA CCGGGTAT TAAGTC TWSEDEMKRMLAEE AGGCAGCTCCAACCAAGAAGCTTCGATGGACAGACGAAGAAAAAGAGA CTGCAGCA CTGAAA VKLRTKNEKDINKKLA TCCTCATCGAAAGCGAGATAAAGATCATCAAGGAAGGATCCCTTAAAGA CTAGAGCC CCGATCT EIFPNRTMGSIKSKRT ACAACACGAAATAAACAAGATTCTAGCTTCAAGAATGCCTGGAAGATCA ACTGGTAT ACATCTT KDKDYQDLVKLTMQ CAAGACGGAATAGCCAAAATTCGCCAGAAACAAGAGCACAAGGCTGAA CAAGAGCA CGATCC TISENPDNETDENTS ATACAGAGAAGGCTACATGGGACCGTCACTACCAACGAAACCAGAGGA GAGATACG CAAGAG NTENNSTDAEKEVKN AACAGAACCAGCGAAATTACCGAACCAATAAGAAGCTTACCAATAAACA CGAGTGG GAACTG YLNMLLLTINEEEWL CCAAAACCTGGAGCGAAGACGAAATGAAAAGAATGCTAGCCGAAGAA AAGTTGAG TGGGTT TSTLKEAATLALQGKK GTGAAGTTGAGAACCAAGAATGAAAAGGATATCAACAAGAAGCTAGCA TACGACTA AAGCTT TEASEKLNEYASKTLF GAAATATTCCCGAACAGAACAATGGGGTCCATAAAGAGCAAAAGGACG CCTTCACG GAGCCG PGLKITNQTRKREKKI AAAGATAAGGACTACCAGGATTTGGTAAAGCTAACAATGCAAACAATCA GGGTCCTC ACGGAA SKRETRRQEYAEIQKL GCGAGAACCCAGACAATGAAACAGACTTCAATACCAGCAACACGGAAA CTGATAAC AAAGCG YKKNISSAAEKAINGK ACAACAGCACTGATGCAGAGAAAGAAGTGAAGAACTACCTCAATATGC CACAGTGG AATGCA WSIKPEEEYHNNKDL TACTACTGACCATCAACGAGGAAGAATGGTTGACATCCACTCTGAAGGA ACTGTGGG TGTTAG IKAWKPILEAPPFSDC AGCAGCAACACTAGCACTACAAGGAAAGAAAACTGAAGCATCTGAAAA AACTAAAT ACGACG RPIENIKEMDYALMEI GCTTAACGAGTATGCAAGCAAAACGCTGTTCCCTGGATTGAAAATAACT GTGTGCTC AGGTAC STAEIFLAIRAMGKTA AATCAGACGAGGAAACGAGAAAAGAAAATATCAAAAAGAGAAACCAG AGCGTTCC AGTCAC PGPDGIKYSKLKKNIQ AAGGCAAGAATACGCTGAAATACAGAAACTCTACAAAAAGAATATTTCA CTACTTTCT CTCCTCG SMAILFNTCLLTSFLP AGCGCAGCCGAAAAAGCAATCAATGGGAAATGGTCCATAAAACCCGAA CGTAGGGT TGGTATT LPLKIARTILIPKQENP GAAGAATACCACAACAATAAGGATCTCATTAAAGCATGGAAACCAATAC AAAGGGT TGGCGG GILDYRPLTIASVVTR TAGAGGCACCTCCATTCAGTGACTGTAGGCCCATCGAAAACATCAAGGA ATGATAAC GCAATG VFHSILAKKLDNNAQ AATGGACTACGCTTTAATGGAGATCAGCACGGCAGAAATCTTCCTCGCA CCAGAGAA CTCACTA LSQRQKGFRKCDGV ATAAGAGCCATGGGGAAGACTGCACCAGGACCTGATGGCATTAAATAT TATCCCAT AATTAA AENIVILETILTNSRSE TCAAAGCTCAAGAAAAATATCCAATCAATGGCAATATTATTCAACACATG GGGAGAT CTGTGA KRPLCMAFVDLRKAF CCTACTAACGAGCTTCCTGCCACTCCCATTGAAGATAGCAAGGACAATCC ATCCATGG GTAGCT DSVGHESIIRGAKRV TGATACCTAAACAAGAGAATCCAGGTATCCTTGACTATCGACCACTAAC AAAAAGCA GAGAAC GVPPMLLEYISSSYQ AATAGCCTCAGTGGTGACAAGAGTGTTCCACAGTATCCTTGCGAAGAAG CCACGTTA TGTATGT NASTNLFGEILNSRR CTCGACAACAATGCCCAATTAAGTCAACGACAAAAAGGATTTCGAAAAT GACAATCC GTATCAT GVRQGDPLSPILFNF GTGATGGAGTTGCGGAAAATATAGTAATACTCGAAACTATATTAACCAA GATGGTCT GAAAAA VIDEALENLNRNIGYL CAGCAGAAGTGAAAAGAGACCGCTCTGTATGGCCTTCGTTGACTTAAGA AACTCGGC AAAA LKEEKVSCLAFADDIV AAAGCATTCGATTCTGTGGGACATGAGTCTATCATCAGAGGAGCAAAAA TCCGAGGG (SEQID LIAETKGGLENHIEKLL GAGTTGGAGTGCCACCAATGTTGCTCGAGTACATTTCKTCAAGTTACCA GCTAACTA NO: EKLNGAGLELNASKC GAATGCGTCTACTAACTTGTTCGGCGAAATACTCAACTCGAGAAGAGGA TCCCAAAG 1348) ATLMVMKNGKEKST GTCAGGCAAGGCGACCCTCTGAGCCCTATTCTCTTCAATTTTGTTATCGA GGCTTAA YISTKAIKIKENDIPTM TGAAGCTCTAGAAAACCTCAACAGGAATATTGGATATCTACTGAAGGAA (SEQID KATETYKYLGLQMGF GAAAAAGTGAGTTGCCTAGCTTTCGCGGACGACATAGTCCTGATAGCTG NO:1225) KAREQNANEVITEGL AGACAAAAGGAGGCCTAGAGAATCATATCGAGAAACTATTAGAGAAGC ENITRAPLKPQQRIHI TGAATGGGGCCGGTCTCGAGTTGAACGCCTCGAAATGCGCAACACTGA LRDFLIPRLIHKLVLGR TGGTGATGAAAAACGGAAAGGAAAAATCAACGTATATATCAACAAAAG VAKKSLKRIDQNIRKK CAATCAAAATCAAAGAAAATGACATTCCAACAATGAAAGCCACAGAAAC VRNWLHLPKDTTAA GTACAAATATCTCGGATTGCAAATGGGTTTCAAAGCTAGAGAACAGAAT FIHADAGDGGLGVP GCTAATGAGGTTATTACAGAAGGACTGGAGAATATAACAAGAGCACCA ALEHTIPLLKRERITNL CTGAAGCCCCAGCAGAGGATACATATCCTACGAGACTTCCTTATACCAA RKSNDPVTKECLRME GATTAATACACAAATTGGTATTAGGAAGAGTGGCCAAGAAGTCATTAAA YTKQVLGKWSRPTKI AAGAATCGACCAGAATATAAGAAAGAAAGTGAGGAATTGGCTACATCT GETLATNKSQLKEAF CCCTAAAGACACGACAGCAGCATTCATACACGCTGATGCAGGAGATGG RKQMLITLDGKGLKD AGGGCTTGGAGTACCAGCGTTAGAACACACAATTCCTCTACTGAAAAGA HHETPTIHKWIRRGE GAAAGAATAACTAATCTAAGAAAATCCAATGATCCAGTTACCAAAGAAT NMTGKQFITAVKIRG GCCTGAGAATGGAGTACACCAAACAAGTACTGGGAAAATGGAGTAGAC NLVATKSRNSRGRPE CAACTAAAATTGGAGAAACTCTGGCTACCAACAAAAGCCAACTAAAAGA QEKLCEAQCGRPDSL AGCATTCAGAAAACAGATGTTAATAACGCTAGATGGAAAAGGGCTAAA GHILQGCWRTHGM AGATCACCACGAAACGCCCACTATCCACAAATGGATCAGAAGAGGAGA RVERHNNICRRIKAI GAACATGACCGGCAAACAGTTTATCACAGCAGTTAAAATAAGAGGAAA MKGKESEVVEEPRLQ CCTTGTGGCAACTAAGTCAAGAAATAGCAGAGGGAGACCCGAACAAGA TNEGLRKPDLLICHKG GAAACTATGTGAAGCTCAATGTGGACGACCAGATAGCCTGGGACACAT KIIICDAQVVADSSNC ATTACAAGGTTGCTGGAGAACACATGGTATGCGAGTGGAAAGGCATAA SLESENQRKIDYYKKD CAACATTTGTCGCAGAATTAAAGCAATAATGAAAGGAAAGGAAAGCGA SVVSEARKLIGRVDE AGTAGTCGAAGAACCAAGACTACAAACGAATGAAGGTCTCAGAAAGCC DIIIMAVTFNWRGAI TGACTTACTGATCTGTCACAAAGGTAAAATAATAATATGCGATGCACAA SKTSIRDLDMLLDIKS GTAGTGGCAGATAGCTCGAACTGCAGTCTTGAAAGCGAAAACCAAAGA KEVIKMSRKIIRDNSI AAGATAGATTACTACAAGAAAGATTCAGTAGTATCAGAAGCAAGAAAA MVEMHRNRTEKRR CTTATCGGACGTGTCGACGAAGATATAATTATAATGGCAGTGACCTTTA (SEQIDNO:1470) ACTGGAGAGGAGCCATCTCAAAAACATCAATAAGAGATTTGGACATGCT TCTAGATATAAAATCAAAAGAAGTAATTAAAATGTCAAGGAAAATAATC AGAGATAATAGCATCATGGTGGAGATGCACAGAAACCGGACTGAGAAA AGGAGATAGAAGGGAAACAAAGGAAAAACGAAATGACTGGAAACTAT GAAGGATATAGCTGAAAGCCGCAAGGAAGGCTAAGTCCTGAAACCGAT CTACATCTTCGATCCCAAGAGGAACTGTGGGTTAAGCTTGAGCCGACGG AAAAAGCGAATGCATGTTAGACGACGAGGTACAGTCACCTCCTCGTGGT ATTTGGCGGGCAATGCTCACTAAATTAACTGTGAGTAGCTGAGAACTGT ATGTGTATCATGAAAAAAAAA(SEQIDNO:1102) R2 R2- Tribolium TGGAAGACCCCGCCCATGAGGCTTGGAGAGTGTGATCCTGATCACACTT TGGAAGAC TGATGC MKSRSFRRIGDCAAG 2_TCas castaneum GAAAAGTTATGCTGAGTACGTCGTGAGAGTCGGTAACTGTCCCAGGAT CCCGCCCA TCCTTTG SSRRGVRLTGKAGRE GGTCTGGGATAGGCTAAACCTCAGCAGGGGAAAGTTGTAGGGGCCTGC TGAGGCTT GTTTTAC GRFAASPHLSPRYLA CACCCCTACACTTTTATAGATATGGCATTCGATACCTCAAATAGAGCCTC GGAGAGT CATCTGT GSVSGNVPSVPPGP GGACTTGGAGGAGCATGGTTCCCCTCCTCCTCGTACTAGACCTGGAACC GTGATCCT GGGGGC GLGAGAPAFAAGRN AACGGTCTTGACAACCCCATTGGACCTACGGGAGCGGACCATGCTCATG GATCACAC ATCGGT ADGGPAQNPCPYCA GACATGGATTCCGAAGACGAAGCGGGGGAACACGGACCCCCCGCCGAT TTGAAAAG CCTCAC RSFTTANGRGLHIRR AATGCTCACTTAACGTCAGGCGAACCCATCGAAATCATCTTGATGTTACC TTATGCTG GGTTTC AHPDEANNAIDIERI CTTTCAAAGCAGGTCATGCGGCATATGTCTCAATGCCGGAAAGGGTGGC AGTACGTC CTCGGG HARWSHEETAMMA TCCCGGGCGTAACGGACATCTATCGCTGATGGAASCTAGCTCGCTCTTG GTGAGAGT TTTCCTA RLEAGAIQRGGVRF AAGAACGGCGGACGGGGACTTGGAATCGTGGTGTGGTTCTGATGTAAG CGGTAACT TTGTTTT MNQFLVPRMPGRTL TCCTGAAATTATGGCGTGATGGCCCGCCCGCCCGACCGGAGGGACTTA GTCCCAGG TCCTAAA EAVKSKRRDATYKAL GAACCCCCTTCCGCGAGGGTCCTGTCTGTAGGTCCWCCATCTCCGTAAA ATGGTCTG CCCGAC VQRFLQAPQINLPEL ACGAGTTGGAGGAAACCGCAGACGGGGATGAAGAGTCGTTCGTTCCGT GGATAGG AAGGAG RDGDAPRQPDPQQE CGCATCGGGGATTGCGCAGCTGGGTCAAGTCGGCGGGGTGTTCGGCTA CTAAACCT CCCTTTG NPPEPPSFDGAIRGA ACGGGCAAGGCTGGGCGTGAGGGGAGGTTTGCAGCTTCCCCTCACTTA CAGCAGG GCCCTCC VADLVGGVDWQRL AGCCCACGTTATCTTGCAGGCTCGGTTTCAGGTAATGTTCCTTCAGTTCC GGAAAGTT TCCTTAA GFQGDRLCNIARRAC CCCCGGCCCCGGCCTGGGGGCCGGGGCTCCTGCATTTGCTGCTGGTCG GTAGGGG ACACCTC DGGDVSGQLLGWLR GAATGCCGATGGGGGGCCCGCCCAAAATCCGTGCCCGTATTGTGCACG CCTGCCAC TCCTTCA DVFPVKRVSTRGDQS GTCGTTTACTACCGCCAATGGCCGCGGGTTGCACATACGGAGGGCGCAT CCCTACAC TCCTSTT DLDVDGALVSRRTAR CCAGACGAGGCTAATAACGCCATAGACATCGAACGCATTCATGCCCGCT TTTTATAG AGTCCA TREYARVQELYRKEP GGTCACACGAGGAGACAGCCATGATGGCCAGGCTTGAGGCCGGTGCCA ATATGGCA TTCCGG KACLARILGDRREGA TCCAGCGAGGGGGTGTAAGGTTTATGAACCAGTTTCTGGTTCCGCGCAT TTCGATAC CTAAAA NRAPNRDPAFIDFW GCCGGGCAGGACTCTGGAAGCGGTGAAAAGTAAGCGCAGAGATGCGA CTCAAATA TGATGA RGVFSEASAEVEGW CTTACAAAGCACTGGTCCAGAGGTTTCTGCAGGCGCCGCAAATAAACCT GAGCCTCG AGACCG AEEVSDHGELARRV CCCCGAACTGCGTGATGGTGATGCGCCTCGTCAGCCTGATCCTCAACAG GACTTGGA AGGAGT WDPISVEEVGRSRVR GAAAACCCTCCCGAGCCCCCTTCTTTCGATGGGGCGATCAGGGGCGCTG GGAGCAT GTCACTC NGAAPGPDGIAVSV TCGCCGACCTTGTCGGAGGGGTCGACTGGCAGCGCCTCGGGTTTCAGG GGTTCCCC TCTTGG WNKLPPEAAALLFNV GAGACCGTCTCTGTAACATCGCGAGACGGGCTTGCGATGGGGGGGATG TCCTCCTC CGGGGT LLLGRCLPAELTRTRT TTTCGGGCCAGTTGCTGGGCTGGCTGCGGGATGTCTTCCCCGTCAAGAG GTACTAGA TAACCC VFIPKTDAPRTPADYR GGTTAGTACTAGAGGGGATCAGAGCGACCTAGATGTCGACGGCGCATT CCTGGAAC GTCCAA PISIASVVARHFHRVL GGTGAGCAGGCGCACGGCGAGGACAAGGGAGTACGCTCGGGTGCAGG CAACGGTC GTGTAA SARVQRIPDLFTKYQ AGCTGTACCGGAAGGAACCTAAGGCGTGTTTGGCTCGAATCCTTGGTGA TTGACAAC ATGTGA RGFLSGVDGIADNLS TCGTCGAGAGGGGGCCAATCGGGCTCCTAATCGCGATCCCGCATTCATC CCCATTGG CCTCGCC VLDTMLTMSRRCCK GATTTTTGGCGAGGCGTCTTTTCGGAGGCGTCCGCTGAGGTCGAGGGA ACCTACGG ATTCGG HLHLAALDVSKAFDT TGGGCGGAGGAGGTCTCCGACCATGGTGAGCTGGCACGCCGGGTTTGG GAGCGGA GCTCTG VSHFAIVRACRSIFGS GACCCCATCTCTGTCGAGGAGGTGGGAAGGTCTCGGGTCCGGAATGGT CCATGCTC ATA AETVLEEGGRRHFVQ GCAGCGCCGGGGCCCGACGGAATAGCGGTTTCCGTGTGGAACAAGCTT ATGGACAT (SEQID VRXGVRQXDPLSPLL CCGCCGGAGGCCGCTGCCCTTCTTTTCAATGTGCTGCTGTTGGGGAGAT GGATTCCG NO: FNLVLDRALKRLSTDV GTCTGCCTGCAGAACTCACCCGCACGCGAACGGTCTTTATTCCAAAGAC AAGACGA 1349) GFRLTDATKVTALAF CGACGCACCGCGAACACCTGCAGACTATCGGCCGATTTCGATCGCGTCG AGCGGGG ADDVVLCATTARGL GTAGTCGCACGGCACTTCCACCGTGTCTTATCCGCACGTGTTCAACGCAT GAACACG QTNLDVLEAELRLAG TCCCGACTTGTTCACAAAGTATCAAAGGGGCTTCTTAAGCGGCGTGGAC GACCCCCC LLLNPNKCQALSLVAS GGGATCGCAGATAATCTCTCGGTCCTTGACACAATGCTGACGATGTCCC GCCGATAA GRDHKVKLVTKPTFK GGAGATGCTGCAAGCATTTGCACTTGGCCGCGTTAGACGTGTCAAAAGC TGCTCACT VGQNTIHQVDASSI ATTTGACACTGTCTCTCACTTTGCCATAGTGCGTGCTTGCCGGTCCATATT TAACGTCA WKYLGIQFRGSGMC TGGGAGTGCGGAGACCGTGTTGGAGGAGGGCGGGAGGAGGCACTTCG GGCGAACC GCGSEGVAAGLKRIT TGCAAGTGCGGYGTGGGGTGAGGCAGKGAGACCCACTCTCTCCTCTCCT CATCGAAA CAPLKPQQRMHLLR CTTTAACCTGGTCTTGGACCGGGCTCTGAAGAGGCTGTCCACCGATGTG TCATCTTG VFFLPKFYHAWTFGR GGCTTTCGTCTAACGGATGCGACCAAAGTTACGGCACTCGCTTTTGCGG ATGTTACC LNAGVLRRLDVVVRT ACGACGTCGTTCTGTGCGCGACCACCGCTAGAGGACTCCAGACCAATCT CTTTCAAA SVRTWLRLPHDIPVG GGATGTTCTCGAAGCGGAGCTTCGCCTAGCCGGTCTCTTGTTGAACCCC GCAGGTCA YFHAPTKSGGLGIPQ AACAAGTGTCAGGCCTTATCCTTGGTGGCTTCCGGACGGGACCACAAAG TGCGGCAT LSRFIPFLRLKRFDRLG TGAAGCTCGTGACAAAACCCACCTTTAAGGTGGGACAAAACACGATTCA ATGTCTCA RSAVDYVRECAFTDI CCAAGTGGATGCGTCCAGTATTTGGAAGTACCTGGGAATCCAGTTTAGG ATGCCGGA ADRKIRWCRERLSGI GGTAGTGGGATGTGCGGATGCGGTTCGGAGGGTGTGGCTGCTGGGTT AAGGGTG VDQVAGGRDALDAY GAAGCGGATCACTTGTGCTCCGCTGAAGCCTCAGCAGAGGATGCACCT GCTCCCGG WTAQLHQSVDGRAL GCTGCGAGTCTTTTTCTTGCCCAAGTTTTATCATGCTTGGACATTTGGGA GCGTAACG RESASVASSTQWLRC GACTCAATGCGGGTGTACTGCGCCGTCTGGACGTTGTGGTCCGAACTTC GACATCTA STRAIPASDWLHYTA GGTTCGGACCTGGCTACGCCTCCCCCACGATATACCGGTGGGATACTTT TCGCTGAT VHIGALPSRVRTSRG CACGCTCCTACGAAGTCCGGGGGGCTGGGGATCCCGCAGTTGTCACGA GGAASCTA RRGGQDVSCRGGCL TTCATCCCGTTCTTGCGATTAAAGCGGTTCGACCGGTTGGGGCGTTCCG GCTCGCTC LDETPAHCIQVCHRT CGGTTGACTATGTCCGAGAGTGTGCATTTACGGACATTGCGGATCGAAA TTGAAGAA HGGRVLRHDAIAKRI GATCCGATGGTGCCGTGAGCGCCTCTCGGGCATAGTGGACCAAGTCGC CGGCGGA SADLMELGWIVTREV GGGTGGGCGGGACGCGCTTGATGCATATTGGACAGCGCAGCTTCATCA CGGGGACT SFRTTAGVFRPDMV GTCGGTGGACGGAAGAGCACTAAGGGAGTCCGCTTCGGTGGCTTCCAG TGGAATCG AVKEGVTVILDVQIVS CACGCAGTGGCTCCGCTGTAGCACACGTGCAATTCCGGCCTCTGACTGG TGGTGTGG PAPTLDEAHRRKVAK CTGCATTACACGGCTGTCCATATTGGCGCCCTACCATCGCGTGTTCGCAC TTCTGATG YRDRADLARYLAEAA ATCGCGTGGGAGGCGGGGCGGCCAGGATGTCTCTTGTCGGGGTGGAT TAAGTCCT VARGRAPPANIRFAS GTCTTCTGGACGAAACTCCGGCACACTGCATCCAGGTTTGCCACCGCAC GAAATTAT ATISWRGVWSAESV CCACGGGGGGCGGGTGTTGCGACACGACGCCATCGCCAAGCGCATTTC GGCGTGAT GSLRELGLSARHFDR TGCCGACCTGATGGAACTTGGATGGATCGTAACGAGGGAGGTTTCCTTC GGCCCGCC YTTMALCGSWRNW CGGACGACTGCGGGTGTTTTCCGGCCGGACATGGTCGCGGTAAAGGAG CGCCCGAC VRFNASTASRMGRG GGTGTTACCGTGATCTTGGACGTGCAGATTGTTTCACCGGCCCCGACAC CGGAGGG RGDASPRRHENQQ TGGATGAGGCCCATCGGCGAAAGGTTGCCAAGTATCGGGACAGGGCTG ACTTAGAA (SEQIDNO:1471) ACCTGGCCAGGTACTTGGCTGAGGCTGCGGTTGCTCGTGGGCGTGCGC CCCCCTTC CGCCTGCGAATATACGATTTGCCAGCGCGACCATCTCTTGGCGAGGCGT CGCGAGG TTGGAGTGCTGAGTCGGTGGGGTCCCTGCGCGAGCTGGGGCTCTCCGC GTCCTGTC TCGGCACTTCGATCGCTATACAACGATGGCACTCTGCGGTTCATGGAGA TGTAGGTC AATTGGGTGCGGTTCAATGCGTCCACGGCCTCTCGGATGGGTCGTGGAC CWCCATCT GCGGCGACGCGAGTCCTCGGAGGCACGAAAATCAGCAGTGATGCTCCT CCGTAAAA TTGGTTTTACCATCTGTGGGGGCATCGGTCCTCACGGTTTCCTCGGGTTT CGAGTTGG CCTATTGTTTTTCCTAAACCCGACAAGGAGCCCTTTGGCCCTCCTCCTTAA AGGAAACC ACACCTCTCCTTCATCCTSTTAGTCCATTCCGGCTAAAATGATGAAGACC GCAGACG GAGGAGTGTCACTCTCTTGGGGGGTTAACCCGTCCAAGTGTAAATGTG GGG(SEQ ACCTCGCCATTCGGGCTCTGATA(SEQIDNO:1103) IDNO: 1226) R2 R2- Megachile TCTAGTTAGCAAGCGGCCCCCTCTAATCGGGTCCCGCCACGAGTACCTTC TCTAGTTA TAAATG SGPATSTFGETKSRLC 7_MR rotundata GGGGAAACCAAGAGCCGCCTCTGCGAACCTACCTCCGCATTGGGTTGCC GCAAGCG TCGAAC EPTSALGCRPGAVVI GGCCCGGTGCGGTTGTAATTCAGTGGGCCCAAATTCACAAGGAAAAGA GCCCCCTC CGAATTT QWAQIHKEKRKRIVG GAAAAAGAATTGTCGGTTGGCCCTTAGGGCATCTCGGATCACCGACCTC TAA(SEQ TGGGTA WPLGHLGSPTSLKLR ACTGAAACTTCGACACCCGCGCCTGCAAGCGAAGCGCATAGTTCCAGTG IDNO: ACGTGC HPRLQAKRIVPVLAEL CTTGCGGAGTTGATGCAATGTTTGTGTGCTCGACATGTCAGCGGCCGTT 1227) ACCCCA MQCLCARHVSGRSP CGCCACAAAAATCGGCTTGGGCGTTCACGTTAGAAGAGCGCACGAGTG CCATCCT QKSAWAFTLEERTSA CTTGCGGAGTTGATGCAATGTTTGTGTGCTCGACATGTCAAAGGTCGTT TAATCG CGVDAMFVCSTCQR CGCCACAAAAATCGGCTTGGGCGTTCACGTTAGAAGAGCGCACGTGGA GCAGCA SFATKIGLGVHVRRA AGTCGCAAACGCTGCCATATCAGTGGAGCGCGTTAAGGATCGTTGGTC CGCAAT HVEVANAAISVERVK GGAGGAGGAGCGAAGAATAATGGCTGCCGTCGAGGTTCGAGGAGTAC AAAGCC DRWSEEERRIMAAV TCTCTGGGGCTCGTTTTATAAACGAGTACATTATGAGTCATCTCCAGACC GTGGGC EVRGVLSGARFINEYI AGCCGGACCCTCGAATCTGTCAAAGGAACTCGGAAAAACCCCAAATACA AGTGGT MSHLQTSRTLESVKG AGGAACTTGTCGCAACCTTACTGGAAGAGGCACGAACCAGTGTGCGGG TTTAGTG TRKNPKYKELVATLLE AGGAGAGTCCTCGTAGTGCGGTCAACGATAGCGCCACCCAACCATCAG GGTAGT EARTSVREESPRSAV GCCCCTCTGATACACGAAGCCTGCGCACCGAACACCTGTTCACTGAAAG CATTAG NDSATQPSGPSDTRS CACCGAGCCTTTCGAACATCGGATCCGAGAGTTGATCGGGGATCTTGAG GAGTCC LRTEHLFTESTEPFEH GGAGTGACCGACTTCAGAGCTGAGTTGCTGGTTTCCATTGCGGAACAGC CACAGT RIRELIGDLEGVTDFR AGCTTCAAGGCGATGAGGTGGCGGAATCCCTGACTCGGTGGCTTGGCG ACCCAG AELLVSIAEQQLQGD AGGTTTTTAAACCCGAGAACCAACAACAACAGGTTCAGCGTAAGCGTCG CGAACA EVAESLTRWLGEVFK GCGACAAAGAAAGGCCCCTGTCAGTGGGCAGCTGCCAAAGTGGCGCGA TCTTAGT PENQQQQVQRKRRR GAGGCGAAGGGATTATGCAGCGATGCAGACTCTCTTCCATCGGAACCCC GGGTCT QRKAPVSGQLPKWR AGTCTTGCTGCTGGTCGTGTGCTGGACGGCAAGAACGAGTCCCGGCCCC GCGTAA ERRRDYAAMQTLFH CGGATCTGCCAGAAATGACAGCGTTTTGGGAACCGATACTCACCGAGCA ACGCAT RNPSLAAGRVLDGK ATCTGCCGAACATCGGGCAGTTGGGCCGGCTTCCGAGAAATCGGAGTT TTCCACT NESRPPDLPEMTAF GTGCTCAGTATGGGGTCCAGTAGAGAAAGAAGAACTTCTCTCCTCCGTG GCCTATC WEPILTEQSAEHRAV CCGCCGTTGGACACGGCTGTTGGCCCTGACGGTGTCACCGCCAGACAAT CTCCGG GPASEKSELCSVWGP GGAGAGCGGTTTTACCGGCCGTTAGAGCTCTCTTGTATAACATCATCCTC GAAAAA VEKEELLSSVPPLDTA AAGAGAGGCTCTTTCCCCGCCTCGATGCTGGAAAGCAGAACCGTGTTCC AAAAAA VGPDGVTARQWRA TGCCGAAGAAGCAACATAGTGTCAATCCTGCTGACTTTCGACCCATCAG AAAAAA VLPAVRALLYNIILKR CATAGCGTCGGTCGTCGTCCGTCAATTACACAAGATCTTGGCCATGCGC AAAAAA GSFPASMLESRTVFL CTTCGCCGTACGAACCTCGTTGACGAAAGGCAACGATGCATGGATGAC AAAAAA PKKQHSVNPADFRPI GGATGTGCCGAGAACATCACGGTGTTGGCCTCGCTGCTGGACGATGCG AAAAAA SIASVVVRQLHKILA AGGCATGGTCTGAAGGAGCTGCATCTGGTTTCGTTGGATTGTGCTAAGG AAAAAA MRLRRTNLVDERQR CGTTCGACAGCGTTAGTCACCATGCGATCGACGCAACACTAAAAGAATG AAAA CMDDGCAENITVLAS CGGTCTGCCCGCGGGATTTGTGCAATACATCTCCCGCACATATTCCGACA (SEQID LLDDARHGLKELHLV GTTCCACCAGGCTCGAGGTCGGAAGGAATCGGTCGGAGCCAATAAAGA NO: SLDCAKAFDSVSHHA CCAACCGGGGAGTGCGGCAGGGTGATCCCCTGTCAACCCTCATCTTCTG 1350) IDATLKECGLPAGFV TCTGTGCTTCGATAGAGTCGCGCGTACCCTCTCACCACACATTGGGTATG QYISRTYSDSSTRLEV ACCTTAATAACACCCGCATCAGTACTCTGCTTTACGCTGATGATGCTTTC GRNRSEPIKTNRGVR CTTGTCTCGACGACTGCTCCGGGGATGAACATCCTGCTCCGGAGTGTGG QGDPLSTLIFCLCFDR AGGAAAGTGCCGGTGAGGTGGGGCTTTCTTTTAACACCTCGAAATGCTC VARTLSPHIGYDLNN AGCCCTCTCCTTAATTCCATCTGGAAAGGAAAAGAAAATGAAGGTCGGA TRISTLLYADDAFLVS ACGACTCCGACGTTCAAAACGTCGCAGGGTTTTATTACCCAGATAACGC TTAPGMNILLRSVEE CCTCGCAAGAGTGGAGGTATCTAGGAGTCGACTTCCAATACTCCGGACC SAGEVGLSFNTSKCS GAAAAAAGCAAGTCGGAGTCTCAAAATCGAGCTGGAACGCATTTCGAA ALSLIPSGKEKKMKV GGCCCCCCTCAAGCCGCAACAGCGGCTTTTAATCCTCAGAGTCTACCTGC GTTPTFKTSQGFITQI TGCCGCGATATTACCATCATCTCGTACTCTCCAGGACAACCCTTGGTCAC TPSQEWRYLGVDFQ CTTAGGGGCCTGGACCTGCAAGTGCGCGCGGCTGTTCGTAGGTGGCTG YSGPKKASRSLKIELE TCTCTGCCGCGCGACATCCCCATTGCCTACTTCCACACAACAGCGAAGG RISKAPLKPQQRLLILR AAGGGGGTCTGGGTCTTCCGGCGTTTGAGACGTCCATTCCATGCCTGAT VYLLPRYYHHLVLSRT GTTAGCCCGCTTGAGGTCGATGGAAACCTCGACTTGCAAGGCGGCTAG TLGHLRGLDLQVRAA GGCGGCGGTTCAAGGCTTCTGGGTGCAGAAACGCATCCATTGGGCTAC VRRWLSLPRDIPIAYF GGCCGCTCTCACTAAGAACGGAGAAGCCCTAACATGCAAGGCGGACGT HTTAKEGGLGLPAFE CGATCGGTGGTGGGCCAGTAGGCTCCACAAGTCAGTGGATGGTCGCGA TSIPCLMLARLRSME ACTGCGTGAGTGCTCAGGCGTTGGCTCGTCCAGCACCTGGGTAAATTCT TSTCKAARAAVQGF GCGCTTAACATCACCGGCAGAGATTACGTGCAGTATCATCACGTCCGCA WVQKRIHWATAALT TCAATTCTCTCCCGACTCGCATTAGGACATCTCGGGGTGTGCGAAGAGA KNGEALTCKADVDR GGGGATGGAGGTCACGTGCCGCGCTGGGTGTCAGGTAACTGAGACCGC WWASRLHKSVDGRE GGCGCATGTCATCCAGTCTTGCCACCGCACACATGGAGGTCGGATTCTT LRECSGVGSSSTWVN CGACACAATGCCGTGTGCAAAGTGCTGGCGTCCGGCCTTCGCGACAAG SALNITGRDYVQYHH GGATGGGAGGTGCGTGAAGAGCCAAAACTGCGAACGCGACAGGGTCT VRINSLPTRIRTSRGV GAGGAAGCCGGACATAGTGGCGATAAAGGACGGCGTCGCAAGGGTGA RREGMEVTCRAGCQ TAGACGCACAGGTCGTGAGTGGATCTGGACCGCTTGACGAGGCGCATG VTETAAHVIQSCHRT AGACCAAGCGCAAGTATTATTCCGACAATGGCGACGTAACTGCTGCGAT HGGRILRHNAVCKVL TGCTCGTGAGTGCAACATTGCTCCATCGAATGTTGCATACTCGTCATGCA ASGLRDKGWEVREE CTATCTCTTGGAGGGGTGTGTGGTCGCCTCGGTCCGCTGCAGACCTACT PKLRTRQGLRKPDIV ACAGGTGGGTCTGTCCAAAAAGCTCTTGGGTTTCATCACCCTAAGGGTC AIKDGVARVIDAQVV TTGAGAGGCAGTCATCTAAATTGGACACGGTGGAACAAAATGACAACG SGSGPLDEAHETKRK ATGCGCGTACACCATCAGCGAACAGGTATCGGGTAAATGTCGAACCGA YYSDNGDVTAAIARE ATTTTGGGTAACGTGCACCCCACCATCCTTAATCGGCAGCACGCAATAA CNIAPSNVAYSSCTIS AGCCGTGGGCAGTGGTTTTAGTGGGTAGTCATTAGGAGTCCCACAGTAC WRGVWSPRSAADLL CCAGCGAACATCTTAGTGGGTCTGCGTAAACGCATTTCCACTGCCTATCC QVGLSKKLLGFITLRV TCCGGGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA LRGSHLNWTRWNK AAAA(SEQIDNO:1104) MTTMRVHHQRTGIG (SEQIDNO:1472) R2 R2Amel Apis TGGTAATCAAATGCCTCGCTATTTTAGTAGCGGTAGCGCTCCGCCCGCG TGGTAATC TGATCG MSSNEEGASDTGAP mellifera CAGGAACCATTGACGCCGCCGTAGTGTGGGTGATTTTATATCCAACCAA AAATGCCT TTAAAA GPGVPVADVSAADG TCACGTCAACTACGATCATTTGTAATCACCGACGGTACTTGGTAGGGGT CGCTATTT GTAAAA RATYDDHGMSTDYE ACCACATGGGCATTCTTGCTCATTCCACAACGCCGCCTCCATCATGGCAA TAGTAGCG ATCTATT KQTIELPLNGQIQCL CAATTTAAAATATATATAAATTCTTAAGGTTTGACCGTATTCATATATATA GTAGCGCT TATTTAT WCHIEGRNQRFLQE TATATATATTTAATATTACAACCATAAATCTTATATCGAGCCTTCTATTTG CCGCCCGC TTTTATT SQYLKHKDTQHPKGE GTCTCAAAAGCAATACGTTGTCAGATCTTGTAGAACATCAGGAGTGAGC GCAGGAA CCTATAT IIWRCAACQKEFEKL GGTGCGCTGTGGTATCCGTGCTTTGTGCCGCGGCGACAAACCAATACGC CCATTGAC TATAAC HGCRCHLPKCKGRKE TGCTGCCTGTCGCAAAGCAATACGCTGCTGATTCTCGGATGCGGGTGTC GCCGCCGT ACATTAT AKGVAKFKCDSCEES GACGGTCACGCAAAGCGATACGCTGGTGGGGTTTCAAAACAATACGGC AGTGTGG TTATTTA FLTQRGLSMHELHRH GCTGGTGCTAAAAAGCATTATGCCGCTAACGGCTGGATTGTCGATCGCC GTGATTTT TTTACTT PAIRNLKRTQGTSRG GCTGCGGGGGCTAGTGGCGCACCCAGAGAGGTGCGACGCGCAAGCAT ATATCCAA ATTGTTT NTRPINRASVWSKEE TGGTTCTGTGCGAAGCGGAGTTCTTGAGAGTAATGGTTGCTGGGGGCA CCAATCAC TAAAGA TDLLIKLNERYKHLKQ CAAAGCGCAACATATAGCCTCTTATGCCTCAAGTCGTAGTTCGTACCTCC GTCAACTA TGACGA PNVALKEYFPDKTLK ACGTGGTCCCGCTGGAATGCCTATCGACTCCTCCCCGGAGGATCATAGA CGATCATT AGCCGC QISDKRRLLPVQEPE GTTCGAAACCGGCTACGGCGAGGCAAGGGCGGTGAGGTGCACACCGA TGTAATCA AAGGCC DVATTDETGPPPSDS TGGGGAGCAGCGACCCCACCTACCCTTAGCTAAGAGAGCAGGCGATCC CCGACGGT AATCCA SEESIYESATEDEGGG GCCAACTGTCAGCACGAAATAAACTAATCATATGTATACGAGGGAGAAT ACTTGGTA AATTTAA DMQQTAPNDSWKE TTACAACGGGTACCTTGTGCCCGAACCGCCTGTAGGTATCACCTACAGG GGGGTACC CAAAAG PFIQSIRTNHLEEEDSL TGTTAAAATGAATCTGATAGCTGGCGGATCGTCGACCCTCTTTGATGGC ACATGGGC AACGAG RKVEEAIERMAMNE TCTGCGCCAACGACTGGAAAGAATAGGAACGGAAGTCTAATGGAAGGA ATTCTTGC ACTACT GVTEQEVGTLLEQFV AAGTGTCGGGAGCACTATAAATTCCCAAAGAAGAAAAGAAAAGAAAAA TCATTCCA GGTCGA DSLTQSPTTERKGSR AAATAAAAAACCCAAATTAAATGTCGAGTAACGAAGAAGGAGCCTCGG CAACGCCG CATTAA RKSQKTTKRKTTHNN ACACGGGGGCCCCCGGCCCTGGGGTCCCCGTGGCGGACGTGTCCGCTG CCTCCATC AAAGAC RKKFLYAKHQELYKKS CGGATGGAAGGGCCACTTATGACGACCATGGCATGTCCACGGACTACG ATGGCAAC GAAGCA PRRLLELALSGESSSG AGAAACAAACCATCGAACTGCCCCTGAACGGGCAGATACAATGCTTATG AATTTAAA GCTGCC REVVNLPEADSVGPL GTGCCACATTGAAGGAAGGAACCAAAGATTCCTCCAGGAAAGCCAATA ATATATAT AGCTGA YKSLWGQIGPEKTHR CTTAAAACACAAAGACACGCAACACCCTAAAGGCGAAATAATATGGAG AAATTCTT TAAACA NQPMCNNIDMSEI GTGCGCTGCATGCCAGAAAGAATTCGAGAAACTCCATGGCTGCAGGTG AAGGTTTG ACAGAG WTPIALESLVEKFKKI TCACTTACCCAAGTGTAAGGGACGCAAAGAAGCCAAGGGTGTTGCCAA ACCGTATT CCCGTCT KSDTAAGADQIKKFH ATTTAAATGCGACTCCTGCGAGGAATCGTTCCTAACGCAAAGAGGATTG CATATATA CGGCCT LRKKGALHVFAKLCN TCTATGCACGAGCTACATAGACATCCAGCGATTAGGAACTTGAAAAGAA TATATATA TTACACC LLMLHRIYPAQWKT CACAAGGCACCAGTCGGGGAAATACCAGACCAATCAACAGAGCCTCGG TATTTAAT GAGCGG NRTTLIPKPGKSAEEV TCTGGTCCAAAGAGGAAACGGACCTCTTGATAAAACTTAACGAGCGCTA ATTACAAC TGCAAG ENWRPITIGSLLGRIY CAAACACTTAAAACAGCCGAACGTAGCGCTCAAAGAATATTTCCCCGAC CATAAATC TCCTGAC SAMIDRKLRSKIKQHI AAGACACTAAAACAAATCAGCGACAAAAGAAGGCTCTTGCCCGTTCAG TTATATCG GTACTAT RQKGFTQEDGCKNN GAACCCGAAGACGTGGCCACAACTGATGAAACGGGACCTCCTCCTTCCG AGCCTTCT TGTACG IAILSSALTKMKEDSG ACTCATCGGAAGAGAGCATATACGAATCGGCCACGGAGGACGAAGGA ATTTGGTC TCTAGG GIITIIDISKAFDTVPH GGAGGAGATATGCAACAAACGGCTCCAAACGATAGCTGGAAGGAGCC TCAAAAGC GCGCGG GEISQSLMNKGVPSP GTTTATACAAAGTATAAGAACAAACCACCTCGAAGAGGAAGACTCCCTT AATACGTT GGCAGA ICEYIQKMYIGCKTIIY CGAAAGGTGGAAGAAGCCATCGAAAGAATGGCTATGAATGAAGGGGT GTCAGATC TTCTACC CRDKKTLPVDILRGVK AACTGAACAAGAGGTGGGCACCCTTCTTGAACAATTTGTCGACTCCCTA TTGTAGAA GTGTAG QGDPLSPLLFNLIIDPI ACTCAATCCCCAACAACGGAAAGAAAGGGGAGCCGACGTAAGAGTCAA CATCAGGA AATCTG IGTLDETTEGIKLENE AAGACTACAAAAAGAAAGACCACCCATAACAATAGAAAAAAGTTCTTAT GTGAGCG GGGCGA NISVLAFADDLVLLAK ATGCCAAACACCAGGAGCTCTATAAAAAGAGCCCACGAAGGCTTCTGG GTGCGCTG CGCCTCC DKETADKQNRLINEY AGTTGGCGTTATCGGGTGAGTCTAGCAGTGGCAGAGAAGTGGTTAATC TGGTATCC GCGAGG LDDLKMKVSAEKCTT TCCCTGAGGCCGACTCAGTGGGTCCACTATATAAAAGTCTATGGGGCCA GTGCTTTG CACTCCC FEIKRQNKTWFLGDP AATAGGCCCGGAAAAAACTCACAGAAACCAACCTATGTGCAACAATATC TGCCGCGG TGGACA QLTLGQQRIPYADPE GATATGAGCGAAATTTGGACTCCAATCGCCCTGGAGAGCCTTGTCGAAA CGACAAAC ACGTAC AAIKYLGTNFNPWRG AATTCAAAAAGATAAAGTCCGACACCGCAGCCGGCGCGGACCAGATAA CAATACGC GCTAAA LCKTSIKEIIDAARTVK AGAAATTCCACCTGAGAAAGAAAGGGGCACTACACGTATTCGCCAAACT TGCTGCCT GCGTAC QLKLKPHQKINLIRTY GTGTAACCTCCTCATGCTGCACCGAATATACCCAGCACAGTGGAAAACC GTCGCAAA GGCTAA LLPRYIHKLVANPPPL AACCGAACCACGCTTATTCCCAAACCGGGGAAGAGCGCGGAAGAGGTT GCAATACG GTGCGC GTLDLIDKELKTIIKEIL GAGAACTGGAGACCAATCACCATCGGGTCTCTGCTGGGAAGAATATATT CTGCTGAT CTCCCG HLHPSTTDGLIYTDKS CGGCTATGATCGACCGTAAATTACGGTCGAAAATAAAGCAGCACATAAG TCTCGGAT AAAGGG HGGLGIQRVANIVKL ACAGAAGGGGTTTACACAGGAGGATGGCTGTAAAAATAATATAGCCAT GCGGGTG TCCCCGT AKLKHSILMTRSEDN TCTCAGTAGTGCCTTAACCAAAATGAAAGAGGACTCAGGTGGAATCATA TCGACGGT TCCTAAT AVKIALNGQEGMVK ACCATAATAGACATTTCCAAAGCCTTCGACACGGTTCCCCACGGCGAAA CACGCAAA TTTTCCG RYATSIGLQWPCGIE TAAGCCAAAGTCTGATGAACAAAGGAGTCCCATCGCCCATATGCGAATA GCGATACG AGCCCG EIEETRKKLKRADTNK CATTCAAAAAATGTACATAGGTTGTAAAACTATTATATATTGCAGAGACA CTGGTGG CGGGCA WKTLISQGQGIKEFF AGAAAACACTGCCAGTGGACATACTGAGAGGAGTCAAACAGGGAGACC GGTTTCAA GATCTC GDKTGNAWLYNPE CGCTATCGCCACTGCTTTTCAACTTGATAATAGATCCCATAATAGGGACA AACAATAC GTGGCA MLRPSRYLDALKLRT CTGGACGAGACGACGGAGGGCATTAAATTAGAAAACGAGAACATTTCA GGCGCTG GTGACG NTYGTKAALHRAKRD GTTCTCGCCTTCGCCGACGACCTTGTCCTTTTGGCGAAAGACAAAGAAA GTGCTAAA CTAGAA IDINCRRCGVQVETL CAGCCGATAAGCAAAATCGGCTCATCAATGAATATCTGGACGACCTGAA AAGCATTA AGTTAA GHILGLCTHTKNKRIK AATGAAAGTATCCGCCGAAAAATGCACAACCTTCGAAATCAAACGGCAG TGCCGCTA GTCCGC RHDEICDLIAKNVSKE AACAAAACGTGGTTCCTAGGAGACCCACAGTTGACGTTGGGTCAGCAAC ACGGCTGG GGACAT YVIFREPEVEVNGDR GTATCCCGTATGCCGACCCAGAAGCAGCAATCAAATACCTAGGAACCAA ATTGTCGA ATAAAA RKPDMVIKDHDKVY CTTCAATCCATGGAGAGGGTTGTGCAAAACCTCGATAAAAGAAATCATC TCGCCGCT TTACAG VVDVTVRYENNDSL GATGCGGCTAGAACTGTCAAACAGCTGAAACTTAAGCCGCATCAAAAAA GCGGGGG CCTTAAA NKAYKEKENKYKETA TCAACCTTATAAGAACCTACCTCTTGCCAAGATACATACATAAATTGGTG CTAGTGGC TAATGA EIMRRDLKAKESRVL GCAAATCCTCCCCCTCTGGGGACTCTAGACCTAATCGATAAAGAGCTCA GCACCCAG ACCCCA PVVIGSRGAVPRATIE AAACTATAATAAAGGAAATATTGCACCTCCATCCGTCCACCACGGACGG AGAGGTG CGAAGG NLKVLGLQTKHALTA ACTAATATACACTGATAAGAGCCATGGCGGTCTAGGGATCCAGCGGGT CGACGCGC AGGTAT SLIALRSSIEMANEFL GGCGAACATAGTCAAGCTGGCCAAACTAAAACATAGTATACTAATGACA AAGCATTG CCTCGA DYDHTT(SEQID AGGTCAGAGGATAATGCCGTCAAGATAGCACTTAACGGGCAAGAGGGA GTTCTGTG AATTCC NO:1473) ATGGTGAAAAGATACGCCACGTCCATAGGCCTACAATGGCCATGTGGA CGAAGCG GCCACG ATAGAAGAAATCGAGGAAACGCGTAAAAAACTCAAGAGGGCGGATAC GAGTTCTT ATCCTTC AAACAAATGGAAAACTTTAATTTCGCAAGGACAAGGCATAAAAGAATTT GAGAGTA TGATCG TTCGGGGATAAAACCGGGAATGCCTGGTTATACAACCCCGAAATGCTGA ATGGTTGC TAGGCG GACCGTCTCGATACCTGGACGCACTAAAACTGAGAACAAATACATATGG TGGGGGC CAAAAC CACAAAAGCAGCACTCCACAGAGCGAAAAGAGACATAGACATAAACTG ACAAAGCG A(SEQ TCGGAGATGCGGCGTCCAGGTGGAAACCCTAGGACATATATTGGGACT CAACATAT IDNO: ATGCACCCATACAAAGAACAAAAGAATAAAAAGACACGACGAAATCTG AGCCTCTT 1351) CGATCTTATCGCAAAGAATGTCTCTAAAGAATACGTGATATTTAGGGAA ATGCCTCA CCAGAAGTAGAGGTAAACGGTGACAGACGTAAACCAGATATGGTCATA AGTCGTAG AAAGACCATGACAAGGTATACGTCGTGGACGTCACCGTAAGATACGAA TTCGTACC AACAATGATTCCCTAAACAAGGCCTACAAAGAAAAAGAAAACAAATACA TCCACGTG AAGAGACAGCGGAAATTATGAGAAGAGACTTGAAAGCAAAAGAGAGC GTCCCGCT AGAGTTCTGCCAGTGGTTATCGGGAGCAGAGGGGCAGTGCCCCGAGCC GGAATGCC ACTATAGAAAATCTAAAAGTCCTAGGGCTTCAAACAAAACATGCCCTGA TATCGACT CGGCTTCGCTCATAGCCCTCCGATCGTCGATCGAAATGGCAAATGAATT CCTCCCCG CCTGGACTACGATCACACTACGTGATCGTTAAAAGTAAAAATCTATTTAT GAGGATCA TTATTTTTATTCCTATATTATAACACATTATTTATTTATTTACTTATTGTTTT TAGAGTTC AAAGATGACGAAGCCGCAAGGCCAATCCAAATTTAACAAAAGAACGAG GAAACCG ACTACTGGTCGACATTAAAAAGACGAAGCAGCTGCCAGCTGATAAACAA GCTACGGC CAGAGCCCGTCTCGGCCTTTACACCGAGCGGTGCAAGTCCTGACGTACT GAGGCAA ATTGTACGTCTAGGGCGCGGGGCAGATTCTACCGTGTAGAATCTGGGG GGGCGGT CGACGCCTCCGCGAGGCACTCCCTGGACAACGTACGCTAAAGCGTACG GAGGTGC GCTAAGTGCGCCTCCCGAAAGGGTCCCCGTTCCTAATTTTTCCGAGCCCG ACACCGAT CGGGCAGATCTCGTGGCAGTGACGCTAGAAAGTTAAGTCCGCGGACAT GGGGAGC ATAAAATTACAGCCTTAAATAATGAACCCCACGAAGGAGGTATCCTCGA AGCGACCC AATTCCGCCACGATCCTTCTGATCGTAGGCGCAAAACA(SEQIDNO: CACCTACC 1105) CTTAGCTA AGAGAGC AGGCGATC CGCCAACT GTCAGCAC GAAATAAA CTAATCAT ATGTATAC GAGGGAG AATTTACA ACGGGTAC CTTGTGCC CGAACCGC CTGTAGGT ATCACCTA CAGGTGTT AAAATGAA TCTGATAG CTGGCGGA TCGTCGAC CCTCTTTG ATGGCTCT GCGCCAAC GACTGGAA AGAATAG GAACGGA AGTCTAAT GGAAGGA AAGTGTCG GGAGCACT ATAAATTC CCAAAGAA GAAAAGA AAAGAAA AAAAATAA AAAACCCA AATTAA (SEQID NO:1228) R2 R2B_NVi Nasonia GACTAGACTATGGGTTCAGTCAGTCCCAAATAGCCGATCCTGGCGCGTC GACTAGAC TGACCT TFAPTHPMVRSGPC vitripennis CGGCAGTAATGCCACGTATGAGTCGGTTACCCATCTCTAAACGCGTAGA TATGGGTT GAACAA RKTKRPGSDYRESLI GGTGGGGAGCTAAAGGCCAGGCGGTTTACCCGACGTCGAATTTCTCCA CAGTCAGT AACGTG MDSGNNVASEPRGA GGTCTGTGTCAGTCGACGGAATAAAGGTACTACAACATCTACTATCTAT CCCAAATA TTGTCTT VDVTSAAPIGAELNA CGGGATCGGAAGACGCCTTACAGCGTTTTCCGATTTTTGCTCTTTGAGCA GCCGATCC GTCTTGT EPCEGRNQRREAALS TTTTTCTTCAAATTGCGATAACCGACCCGATCACGCGGGGCTTTGACAAA TGGCGCGT CTAAAA AQTRRRNXARRARN GCAATGCGTGGTCGGTAAGATGGTTGCAATCTTTTCCACCTCGTTTCTTT CCGGCAGT CTATTTA AQQADEPGDDEEIET TACGGAACGAAAGCAATGCGTGTGGGGAACGTTAAAAACTCCCTTCATG AATGCCAC TTCGAA HGPLTIRTXEPMEIVA CATCCCAGGATTTATCCTGCTTACTGCAAAGCAATGCGTGTGGAGCGAC GTATGAGT ATAAGG IAKNPQACPKCLQGG TTTACCACGAGTCGCTCCACCGCAAAGCAATGCGTATCGCGCAAAAGCA CGGTTACC GGAGGC TQLLCMGSWELSRHI ATGCGTGTGGGGGACTTGTCAAAGATCCCCCGCCGCAAAGCAATGCGT CATCTCTA TAACTG NKEHPSVDVTWVCG GTCGGCACCACGTAGAGCAAAGCGTGTAGGCAGACTTTGTCAAAAGTA AACGCGTA CCTGCA ACQRRCTTLRSWSC GTTCTGCCGCAAAGCAATGCGTGTGGAGATCTTCGCCGGTGAAAGCAAT GAGGTGG AGTTGA HVLHCKGRQEPKDLP ACGTGTGGGCGAACTTAGACCTTCGCGCCCACACATCCCATGGTGAGGA GGAGCTAA ACGCGA FKCEHCSLSFDSQIGL GCGGCCCATGTCGAAAGACTAAGCGCCCCGGTAGTGACTACCGTGAAA AGGCCAG AAGTTA SQHERHVHPEVRND GTCTAATAATGGACAGCGGAAATAACGTTGCCTCGGAGCCGAGGGGAG GCGGTTTA GACCTTC KRAAEANKPKGKSGR CTGTGGATGTGACCTCAGCAGCTCCAATCGGGGCGGAGTTAAACGCCG CCCGACGT CCACCTA RPSIWSDEDLLLIRELE AACCTTGCGAAGGTCGCAACCAAAGGAGGGAGGCTGCCTTAAGTGCTC CGAATTTC AAGCCC SEYHGARNINEKIAE AAACACGCCGGCGAAATNCGGCCCGCCGAGCTCGGAATGCCCAACAGG TCCAGGTC AAAAGT HFPDRTGRQVSDAR CTGACGAGCCCGGCGATGATGAGGAAATAGAGACACACGGGCCTCTAA TGTGTCAG GATCGG RRKDYAALRGRGGP CTATCCGGACGNCGGAGCCGATGGAGATTGTCGCAATAGCGAAAAACC TCGACGGA GGAATG QGPAEGVEAIEEVDE CACAGGCCTGTCCCAAATGCCTGCAGGGAGGTACCCAACTTCTCTGCAT ATAAAGGT AATCCG GEIPEGEELVATDGA GGGCAGCTGGGAACTAAGCAGGCACATTAATAAAGAACATCCGTCAGT ACTACAAC CGGGTG ALESGPPENGGSAPA CGACGTGACCTGGGTGTGCGGTGCTTGTCAAAGGCGCTGCACAACGCT ATCTACTA ACCCCA EQVNAPALESSSQQ CAGGTCGTGGAGCTGTCATGTTCTNCACTGTAAGGGGCGACAAGAACC TCTATCGG GAGTTG DRECSPAVGSDEQIE AAAGGATCTGCCGTTCAAATGTGAGCATTGCAGCTTGTCGTTTGACTCG GATCGGAA GGTAAA DSSDDDEFSDALGEIS CAAATCGGACTCTCTCAGCACGAGAGGCATGTCCATCCAGAGGTGCGA GACGCCTT CCCTTGA LPEPLSVERTTISPPPR AACGATAAGCGCGCGGCAGAGGCCAATAAGCCAAAGGGCAAGAGTGG ACAGCGTT AACGTT DDWKGPMRWEICN CCGTAGGCCTTCTATATGGTCCGACGAAGACTTGCTGCTCATCCGGGAA TTCCGATT GGAGAA ASEEAGSYANWVTG TTAGAGAGCGAATACCACGGAGCTCGAAATATCAATGAAAAAATAGCT TTTGCTCTT GCGGAA LQELVRNNALSEIGLD GAACATTTCCCGGATAGAACAGGCAGACAGGTGTCGGACGCCCGGAGG TGAGCATT GAGAGT SLYDQLIQIMRHPSD CGTAAGGATTATGCCGCGCTCCGAGGGAGAGGAGGCCCGCAAGGCCCA TTTCTTCA CCCGCC DNEQDRLQLNARGP GCAGAAGGAGTCGAGGCCATCGAAGAGGTAGACGAAGGCGAAATCCC AATTGCGA ACCGAG PRRGHRKNRRRRRLT TGAGGGGGAAGAGCTGGTCGCCACCGATGGCGCTGCGTTGGAAAGCG TAACCGAC CATCGA AADRKRFAFARCQDL GCCCCCCGGAGAATGGAGGGAGCGCACCCGCAGAACAAGTCAATGCGC CCGATCAC GTGCTG WNNNPKKLAELVIA CCGCGCTGGAAAGCAGTTCTCAGCAAGATCGAGAGTGCAGCCCGGCAG GCGGGGC CGGCGC NDLSILQRRQAPGRT TGGGGTCCGATGAACAAATCGAGGACAGCAGTGACGACGACGAATTCA TTTGACAA CCGAAT ETQTLYNELWGRVG GCGACGCATTAGGAGAAATATCACTCCCAGAACCTCTCTCGGTTGAACG AGCAATGC GAAACC PNIEAPRRTEDPIPVS CACAACAATCTCACCACCTCCTCGAGATGACTGGAAAGGCCCAATGAGG GTGGTCG GATCGC RIFTPITPQEIMGRIRR TGGGAGATTTGCAATGCGAGCGAAGAAGCCGGAAGTTACGCGAACTGG GTAAGATG GGATGG IKNDSAAGPDGVTKD GTGACTGGACTGCAGGAGCTAGTCAGGAACAATGCGCTGAGTGAAATA GTTGCAAT TGCAAG DLRGRGVSIALSKLFN GGACTAGACTCCCTGTATGACCAGCTCATCCAGATTATGCGGCACCCTTC CTTTTCCA TCGTAG SILLAGYYPKAWREN CGATGACAACGAACAGGATCGCCTTCAATTGAACGCTAGAGGCCCCCCA CCTCGTTT GACGGG RTTLLPKPEKDPADV CGAAGGGGCCACCGCAAGAACCGACGGCGCCGTCGTCTCACGGCCGCT CTTTTACG GCACGA KNWRPITISSMVSRV GATCGAAAGCGGTTTGCCTTTGCCAGGTGCCAAGATCTTTGGAACAACA GAACGAA CCTAAG YSGLLDQRVRAVIKQ ACCCAAAGAAGCTAGCCGAGTTAGTGATTGCCAATGACCTGTCCATTCT AGCAATGC CCTCTGT CDRQKGFTEENGCFS CCAAAGGCGCCAAGCGCCAGGTAGAACGGAGACACAGACTCTGTACAA GTGTGGG CACGGC NIQLLDDAVSNAKKA CGAGCTGTGGGGGAGGGTCGGACCTAATATCGAAGCGCCAAGGCGCA GAACGTTA GGCGAA GGVITILDVSKAFDTV CCGAAGACCCGATACCCGTATCGAGGATCTTCACTCCGATCACTCCCCAA AAAACTCC GCCAGG PHAVIQGCLEKKGIPE GAGATAATGGGCAGAATCAGGCGAATCAAAAACGACTCGGCAGCGGGT CTTCATGC AATCAC TVAAYISSMYRDCST CCTGACGGGGTAACGAAGGACGACCTGAGAGGAAGAGGAGTCAGCAT ATCCCAGG CATGCA AIRTRSGDVKIGMKR AGCCCTCTCCAAGCTGTTCAACTCGATCCTGCTAGCGGGTTACTACCCAA ATTTATCC AAGGTG GVKQGDPLSPLIFNL AGGCATGGAGAGAGAACAGAACAACCCTTCTGCCGAAGCCAGAAAAAG TGCTTACT TGAACT VLEPLLERLQETSGVE ATCCTGCTGACGTTAAGAACTGGCGGCCCATTACCATCAGCTCAATGGT GCAAAGCA GGGGCG IEGMNLSCAAFADDI TAGTCGAGTCTACTCAGGCTTGCTTGACCAGCGAGTGAGGGCCGTCATT ATGCGTGT GATACC VCFANTAPEAGRQLR AAGCAGTGTGATCGGCAGAAAGGATTCACGGAGGAAAATGGCTGTTTC GGAGCGA TCCACG MVADYLGRLDMSLS AGCAACATACAGTTGTTGGATGACGCCGTATCGAACGCAAAGAAAGCG CTTTACCA GGGTTT VSKCIAVEYVPHRKT GGCGGTGTCATTACTATCTTGGATGTTTCGAAAGCATTCGACACTGTCCC CGAGTCGC CCCTGG WYTKNPGLEVNGNA GCATGCCGTGATCCAAGGGTGCTTGGAGAAAAAAGGAATCCCCGAAAC TCCACCGC GCATCG VPSISPSETFKYLGAK CGTGGCCGCCTATATCTCGAGCATGTATCGCGACTGCTCCACTGCAATCC AAAGCAAT CGCGAG VSPWKGLLEGFESDA GAACGAGGAGCGGGGACGTAAAGATTGGAATGAAGAGAGGAGTCAAG GCGTATCG CGATGG FREVISRVQRLPLKP CAGGGGGATCCCCTGTCACCTCTCATTTTCAATCTGGTTCTCGAACCTCT CGCAAAAG CCAAAG MQKVDLLQMYIFPR ATTAGAACGATTGCAAGAGACAAGTGGAGTGGAAATCGAAGGCATGAA CAATGCGT TCCGCTT YTYGLITSPPAKAVLK TCTCTCGTGCGCGGCTTTCGCAGACGACATAGTATGTTTTGCGAATACA GTGGGGG TCTCAGC TIDRIIRTRIKEILHLPE GCCCCCGAGGCGGGAAGGCAGCTACGGATGGTGGCGGATTATCTGGG ACTTGTCA TACAAA SVSSSFLYTPRKQGGL CCGACTCGATATGAGTCTTTCAGTGTCAAAGTGTATAGCTGTAGAGTAT AAGATCCC ACAAAA GLLEVEKMVLIAALR GTCCCCCACAGGAAGACCTGGTACACTAAAAACCCAGGCCTCGAGGTG CCGCCGCA ATGGTA NGLRARQSHDPVTR AACGGTAATGCCGTTCCGAGCATCTCACCTAGTGAGACGTTCAAGTACC AAGCAATG TGAGAC AAMNSNAADDRLKS TCGGGGCAAAGGTCTCTCCCTGGAAGGGGCTGCTCGAAGGCTTCGAAT CGTGTCGG TTCGTTA YADALRLHWPLTTKE CTGACGCGTTCAGGGAAGTCATATCCCGCGTCCAAAGACTGCCGTTGAA CACCACGT ACACTA LDTYKYQLRLSYAQK GCCCATGCAAAAGGTGGACCTTCTACAGATGTATATCTTTCCGAGGTAC AGAGCAA ATTTTTC WAEQKWQGQGVEE ACCTATGGGTTGATAACATCGCCTCCGGCGAAGGCAGTCTTAAAGACTA AGCGTGTA CGAGCC FAQDPVGNSWLQRY TCGACCGGATCATAAGAACGAGAATCAAGGAGATCCTCCACCTGCCAGA GGCAGACT TAGCAG DLLPASRYIDAIKLRT ATCGGTAAGTAGCAGTTTTCTCTACACGCCGAGGAAGCAGGGTGGATT TTGTCAAA GCTCCCT NTYPTRALMKIIDGR GGGGCTCCTTGAAGTGGAGAAGATGGTGCTGATAGCCGCTCTTCGGAA AGTAGTTC TGACAA VDSSCRKCQGSSETL CGGCTTGAGAGCCCGTCAATCCCACGATCCGGTCACACGCGCGGCCATG TGCCGCAA CGCTTAT GHILGRCRYTKDKRIS AACTCGAACGCAGCCGACGATCGGCTAAAATCTTACGCCGATGCTCTAA AGCAATGC GAATCT RHNEIKDLLKARLAK GACTACACTGGCCACTAACAACCAAGGAGCTAGATACTTATAAGTATCA GTGTGGA GGAAAA NHQVMDEPQITVRG GCTTCGCCTGAGCTATGCCCAGAAATGGGCTGAGCAAAAATGGCAAGG GATCTTCG GGACAC QRFKPDLVVKTNEGR CCAGGGGGTCGAGGAGTTCGCACAAGATCCCGTCGGCAACTCATGGCT CCGGTGAA AAAGTG VHVIDVTVRYEHRTY GCAGCGCTATGATCTTCTGCCCGCGTCAAGGTACATCGATGCTATCAAG AGCAATAC GAAAAA LDEGRTEKIGKYRQIL CTACGAACAAACACGTACCCGACGCGAGCACTAATGAAGATCATAGATG GTGTGGG GCGCTG STLRRDLHSNAEEVIP GACGTGTTGATAGCTCTTGCCGAAAGTGCCAAGGCAGTAGCGAGACCC CGAACTTA ATGGTG IVIGSRGAIPRETRKAL TTGGTCATATACTTGGCAGATGCCGGTATACTAAGGATAAGCGAATAAG G(SEQID GACAAA SKLGIGKSDWLTISLIA CCGGCACAATGAAATCAAAGACCTCCTCAAGGCTCGTCTAGCCAAAAAC NO:1229) AGTCAG LRSSLEIVNAFMDD CATCAGGTTATGGACGAGCCGCAGATAACGGTCCGAGGCCAGAGGTTT TTGAGA (SEQIDNO:1474) AAACCCGATCTCGTCGTGAAAACGAATGAGGGAAGGGTGCACGTAATC CTTGATA GACGTAACTGTCCGCTACGAGCACAGAACCTATCTGGATGAGGGCCGT TCAGTT ACTGAGAAAATTGGCAAATATCGCCAAATTCTCAGCACGCTTCGGCGAG GTTTTGA ATCTGCACTCGAACGCCGAGGAGGTCATTCCAATTGTAATCGGGTCGAG CTAAGA AGGTGCAATTCCAAGGGAAACGCGGAAAGCCCTCTCGAAACTCGGAAT ATTTTAT TGGCAAGAGTGATTGGCTTACGATCTCACTAATAGCGCTGCGTAGCTCG TATCGTT CTAGAGATCGTCAACGCCTTCATGGATGACTGACCTGAACAAAACGTGT GACTTTT TGTCTTGTCTTGTCTAAAACTATTTATTCGAAATAAGGGGAGGCTAACTG AAATATT CCTGCAAGTTGAACGCGAAAGTTAGACCTTCCCACCTAAAGCCCAAAAG TTATTAT TGATCGGGGAATGAATCCGCGGGTGACCCCAGAGTTGGGTAAACCCTT TGACTG GAAACGTTGGAGAAGCGGAAGAGAGTCCCGCCACCGAGCATCGAGTGC TTAATAT TGCGGCGCCCGAATGAAACCGATCGCGGATGGTGCAAGTCGTAGGACG ACTGAC GGGCACGACCTAAGCCTCTGTCACGGCGGCGAAGCCAGGAATCACCAT TTGGGA GCAAAGGTGTGAACTGGGGCGGATACCTCCACGGGGTTTCCCTGGGCA CCAAGT TCGCGCGAGCGATGGCCAAAGTCCGCTTTCTCAGCTACAAAACAAAAAT CATCTCT GGTATGAGACTTCGTTAACACTAATTTTTCCGAGCCTAGCAGGCTCCCTT GTTACCC GACAACGCTTATGAATCTGGAAAAGGACACAAAGTGGAAAAAGCGCTG GGTACC ATGGTGGACAAAAGTCAGTTGAGACTTGATATCAGTTGTTTTGACTAAG GGTTCC AATTTTATTATCGTTGACTTTTAAATATTTTATTATTGACTGTTAATATACT TGTCATC GACTTGGGACCAAGTCATCTCTGTTACCCGGTACCGGTTCCTGTCATCAA AAACCG ACCGGAAAGTCCGTCCCACGTAATGTGGTAGACGCAGGAG(SEQID GAAAGT NO:1106) CCGTCCC ACGTAA TGTGGT AGACGC AGGAG (SEQID NO: 1352) R2 R2Ci-B AB097122 Ciona CGACGGTGAACCACCTTGTCGCGGTGTAAGAGCTTTAGTGTCTCGAACA CGACGGTG TGACAG MGEWPWVSWSLTV intestinalis AGAAATAGCTTGTGTGCTGTCCTTCTGGGCGGTGCACATACTTCTTAACC AACCACCT TAATAT LVEKWRPFTILQPYP TCCCGAGGCCATGCCGGCGGGGGCTTTAGCCCCCGGCAGGTTTTACCAT TGTCGCGG GAAAAC MPGQLRVDVYLPRK GCCGGACGGGTTCGAGAGGTAGAGGCCAAACTAAGAGTTCACCAGCAG TGTAAGAG ATCACAT TSYLMDKNIYENTTS ACTTCGCACGCGGCTGGCCACTGGCCGAAGTTTAAACAACAGGGCCGC CTTTAGTG CTGACC PGGGPLCGEKTHRSD ATCTTCCCAAACTCAATATATGGTGTTAAGTGAACCGTGCCGATGGGTG TCTCGAAC GGCACA VIIPPPGFAPSTDTAS AATGGCCATGGGTAAGTTGGTCTCTAACGGTCCTTGTGGAGAAGTGGC AAGAAATA GAATCA NTLGENVDASATTSS GACCCTTCACAATACTTCAGCCTTACCCTATGCCCGGGCAACTTAGGGTG GCTTGTGT CCATGC ANPLSQEPGWCESCS GACGTATATCTCCCGCGGAAAACATCTTATCTCATGGATAAAAATATATA GCTGTCCT CGTAAT KLFKSQRGLRVHQRS CGAAAACACTACTTCTCCAGGAGGGGGACCGCTTTGCGGTGAGAAGAC TCTGGGCG GCACCC KHPELYHSQNQPLPR GCATCGTTCCGATGTGATCATCCCTCCACCGGGGTTCGCCCCGTCGACG GTGCACAT AACTAA SKARWSDEEMVIFA GACACTGCTTCCAATACATTAGGGGAAAATGTGGATGCATCTGCAACCA ACTTCTTA GGATTC REEIANRKIRFINQHL CTAGCAGTGCCAACCCATTAAGTCAGGAACCAGGATGGTGCGAATCTTG ACCTCCCG CAATGG HKVFPHRTLESIKGLR TTCCAAGCTGTTCAAATCACAAAGAGGACTTAGAGTCCACCAAAGATCA AGGCCATG GTAAAA GKNVRYARIMADLE AAACATCCAGAGTTGTACCATTCTCAGAATCAACCGCTTCCGCGATCTAA CCGGCGG AAAAAA AEMTSQPEAATSLCT AGCACGATGGAGTGATGAGGAAATGGTTATCTTTGCTCGCGAAGAAAT GGGCTTTA AAAAAA ETSENLASSNVLPQT TGCAAATAGGAAGATCCGCTTTATCAATCAGCACTTACACAAAGTGTTTC GCCCCCGG AAAAAA RGWAENLVENIDTA CACACAGAACGCTAGAGTCAATTAAAGGGTTGAGGGGAAAAAACGTAA CAGGTTTT AAAAAA HLANLGPLSQFEPGK GATACGCTCGCATCATGGCTGACCTTGAGGCGGAAATGACCTCTCAGCC ACCATGCC AAAAAA PSSSTKEAINTEYND CGAAGCAGCCACAAGCTTGTGCACCGAAACATCAGAAAATCTCGCTTCT GGACGGG AAAAAA WISKWLPSGAAHRE AGCAATGTTCTGCCCCAAACAAGAGGATGGGCTGAAAATCTCGTCGAAA TTCGAGAG A(SEQ RRANPPSTKLNARAT ATATTGACACAGCACATCTGGCAAACCTTGGGCCGTTATCCCAGTTTGA GTAGAGG IDNO: RRLQYSRIQNLYKLNR ACCTGGTAAACCAAGCAGCAGTACAAAAGAGGCTATCAACACCGAGTA CCAAACTA 1353) SACAQEVLSGAWKV CAACGACTGGATCAGCAAATGGCTTCCCTCTGGTGCTGCTCATAGAGAA AGAGTTCA QSGELNLKEVQPFW AGAAGAGCAAATCCACCCTCAACTAAGCTAAATGCTAGAGCAACAAGG CCAGCAGA EKMFRKESAKDRRKP AGGCTACAATACAGTAGGATCCAGAATCTCTACAAATTAAATCGTTCTGC CTTCGCAC KPTGEVLWGLMEPL CTGCGCCCAAGAAGTATTAAGCGGGGCTTGGAAGGTCCAGTCGGGCGA GCGGCTG TIAEVGSTLKSTTPSA ACTGAACTTGAAAGAGGTCCAACCGTTTTGGGAAAAAATGTTCCGGAAA GCCACTGG PGPDKLTLDGVKRIPI GAATCCGCCAAGGACAGGCGCAAGCCCAAGCCCACTGGTGAAGTACTC CCGAAGTT AELVSHYNLWLYAGY TGGGGACTGATGGAGCCTCTAACAATTGCCGAGGTGGGATCCACTCTG TAAACAAC QPEGLREGITTLIPKIK AAGTCAACGACACCATCAGCCCCAGGACCCGACAAGTTAACACTAGATG AGGGCCG GTRDPAKLRPITVSSF GCGTAAAGAGGATACCCATTGCTGAACTAGTATCCCATTACAACCTATG CATCTTCC ICRIFHRCLAQRMETS GTTGTACGCTGGCTACCAACCAGAGGGACTGCGCGAAGGAATAACCAC CAAACTCA LPLGERQKAFRKVDG CCTAATTCCGAAAATTAAGGGTACGCGTGACCCGGCTAAGCTACGCCCA ATATATGG ICHNIWSLRSLIHNSK ATTACAGTGTCTAGCTTTATTTGCCGCATCTTCCACCGATGTTTGGCACA TGTTAAGT DNLKELNITFLDVRKA ACGAATGGAAACCTCTCTTCCATTGGGCGAAAGACAGAAAGCTTTCCGC GAACCGTG FDSISHKSLGIAAARL AAGGTCGATGGCATCTGTCATAATATTTGGTCCCTAAGGTCTCTAATACA CCG(SEQ GLPPPLITYISNLYPNC CAATAGCAAAGACAACTTAAAAGAACTCAATATAACATTTTTGGACGTA IDNO: STKLKVNGKISKPIEV CGAAAAGCATTTGACTCCATCTCGCATAAATCACTGGGCATAGCTGCAG 1230) RRGVRQGDPLSPLLF CCCGCCTTGGACTTCCCCCACCCTTGATAACATACATCTCCAACCTTTACC NAVMDWALSELDPR CAAACTGTTCTACCAAATTAAAAGTAAACGGAAAAATATCAAAACCAAT VGVQIGEQRINHLAF TGAAGTTCGTCGAGGAGTCCGACAAGGAGACCCACTCTCCCCGCTACTG ADDIILVSSTKIGMVS TTTAATGCAGTAATGGATTGGGCCCTATCTGAACTGGACCCAAGAGTGG SINTLSRHLAKSGLEIS GAGTACAAATTGGGGAACAACGAATAAACCATCTAGCATTCGCCGATGA AGKEGKSASMAIVV TATTATTTTGGTCTCTTCAACAAAGATCGGAATGGTGTCCTCCATCAATA DGKKKMWTVDPLPR CGTTGAGTCGCCATCTGGCAAAATCAGGCCTCGAAATTAGCGCTGGGAA FKVNSQKIPALSITQQ AGAAGGAAAATCAGCTAGTATGGCCATCGTTGTTGATGGAAAGAAAAA YKYLGINIDAQGARN GATGTGGACAGTAGATCCCCTTCCTAGGTTCAAAGTAAACAGCCAGAAA DAARILTEGLAELSRA ATTCCGGCACTGAGTATTACACAACAGTATAAGTATCTGGGGATAAACA PLKPQQRLYLLRVHLL TAGATGCCCAAGGGGCAAGAAATGATGCAGCTCGAATCCTTACAGAGG PKLQHGLVLSSCAKR GTCTAGCAGAATTATCTCGTGCCCCACTGAAGCCACAACAGCGACTATA ALTYLDKSVRSAIRR TCTGCTTCGGGTGCATCTTCTTCCGAAACTGCAACACGGACTGGTTTTAA WLTLPKDTPTAFYHA GTAGTTGTGCAAAAAGAGCGCTAACATACCTGGACAAAAGTGTAAGGT KACDGGLGITRLEHTI CTGCAATAAGAAGATGGCTTACACTTCCGAAGGACACACCAACAGCCTT PILKRNRMMKLTLSE TTACCATGCGAAAGCATGTGACGGTGGACTAGGCATTACACGTTTGGAA DPVIMELVKLTYFTNL CACACAATACCAATACTCAAAAGAAATAGAATGATGAAGTTGACATTGT LHKYSNVKLLNSWPV CAGAGGATCCAGTGATCATGGAACTTGTAAAATTGACATACTTTACAAA TDKDSLARAEASMLH TTTACTACACAAATACTCAAATGTGAAGCTGCTTAACTCCTGGCCAGTAA TSVDGRGLSNCSDVP CAGACAAAGACAGTCTGGCCCGTGCCGAAGCCTCCATGCTTCACACCTC RQSDWVTNGASLLS TGTGGATGGGCGTGGACTAAGCAATTGCAGTGATGTCCCGCGCCAAAG GRDFIGAIKVRGNLLP TGACTGGGTCACAAATGGCGCGTCGCTACTTAGTGGTCGTGACTTTATT TKVSAARGRQREITC GGAGCAATAAAAGTGAGGGGCAACCTCTTGCCAACAAAGGTTAGTGCG DCCRRPESLGHILQTC GCACGTGGACGGCAACGGGAGATAACATGCGACTGCTGCAGACGACCG PRTWGPRISRHDSLL GAGTCACTGGGCCACATCCTGCAGACCTGCCCTAGAACATGGGGACCG KRVRNQACLKNWTP AGAATCTCCAGGCATGACTCCCTGTTGAAGCGAGTTCGGAATCAGGCAT IIEPSIPTNIGLRRPDL GCTTAAAAAACTGGACCCCGATAATAGAGCCATCGATACCAACCAACAT VLAKGNIAFLVDATV AGGACTTCGTCGGCCAGACCTCGTTTTGGCCAAAGGTAACATCGCCTTT VADNANMQLQHEA CTTGTAGATGCCACCGTGGTTGCAGACAATGCCAACATGCAGTTGCAAC KVEKYNNSDIKEWIK ACGAAGCAAAAGTGGAAAAGTACAACAACTCAGATATAAAGGAATGGA VHCPGVDEVRVTSLT TAAAAGTACACTGCCCAGGGGTTGATGAAGTGAGGGTAACGTCACTAA ANWRGCLYGGSASF CTGCGAACTGGCGCGGTTGCTTGTATGGAGGTTCAGCATCCTTCCTAAC LTEDLGLPKAELSLLS AGAAGACCTGGGACTGCCTAAAGCAGAGTTAAGTCTGCTCAGTGCAAA AKINEKGYYLWCAHY GATCAATGAAAAAGGATACTACCTTTGGTGTGCTCACTACAGGGGAACA RGTARLWNRPLRS GCAAGGTTGTGGAATCGACCGTTAAGGTCCTGACAGTAATATGAAAACA (SEQIDNO:1475) TCACATCTGACCGGCACAGAATCACCATGCCGTAATGCACCCAACTAAG GATTCCAATGGGTAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA AAAAAA(SEQIDNO:1107) R2 R2C_NGi Nasonia CGGGTTCCCCCGACTTCGGCTTGCCGTGGTCTGGGGCTCACTGCTTTTTG CGGGTTCC TAGCGG WVTSPRRPRYVGPQ giraulti TGGAGTCATGGTTACATGGTGACCCTGGTTCCTCGCACCCCCGCTGGAA CCCGACTT ACTGGA KKKASDGNDGRAAA ACTATCTGGGGAGGCCATGATTGGGTAACGATAAAGGTCCTGGTCGTG CGGCTTGC CTGTCT RAEPTNPGGPDRAD TCCTCCTGAGATAGGCTGAATGGGTCACTAAGTGGCACCTAATGGGTTA CGTGGTCT GGAGGA DDEGDVKFWCEFPG CCAGCCCTAGGCGGCCGAGATACGTTGGTCCACAGAAAAAGAAAGCCT GGGGCTCA GTGTTTA CDRFFMTRSGRGLH CGGATGGAAATGACGGACGAGCCGCTGCTCGTGCCGAACCAACGAATC CTGCTTTTT ACTCGG HKKGHPDWNDQRN CGGGCGGACCAGACCGCGCTGACGACGACGAAGGGGATGTTAAATTCT GTGGAGTC GTTCTCA LAGKQHRKEIWSEEE GGTGTGAATTTCCAGGTTGTGATCGCTTCTTTATGACCAGGAGCGGTAG ATGGTTAC TGGGAA RLLLAKKEAELAISGA AGGCCTCCATCACAAGAAAGGCCACCCTGATTGGAATGATCAGAGAAA ATGGTGAC CCCGAC RFINVELRDFTARSLD CCTGGCCGGAAAGCAACACCGAAAAGAGATATGGTCGGAGGAAGAAC CCTGGTTC AACGTT AIKGQRKRPDYKILVE GTCTCCTGCTTGCCAAAAAAGAGGCGGAGCTTGCCATCAGTGGAGCTA CTCGCACC GTTATCT KFVRELRVRGIRQGV GGTTTATTAACGTAGAGTTGCGTGATTTTACAGCGCGCTCCCTAGACGCT CCCGCTGG TGTATG ASRSQQARAMAVA ATCAAGGGCCAGCGAAAGAGACCCGACTATAAGATCTTAGTCGAGAAA AAACTATC ACAATTC GAPAATSSGAPPVAT TTTGTCAGGGAGTTAAGGGTTAGAGGCATTCGTCAAGGAGTGGCCTCG TGGGGAG ATAAAA QPPPSGRVLRSQVVE CGGAGTCAACAAGCTCGCGCGATGGCGGTGGCAGGAGCTCCTGCAGCG GCCATGAT AAAAAA APAMEIPVAESEGDS ACGTCCTCGGGGGCACCACCTGTCGCGACTCAACCACCACCATCAGGTC TGGGTAAC AAAAAA SGDELFEDVEPVRLS GCGTACTAAGATCTCAGGTCGTTGAAGCACCAGCGATGGAGATCCCCGT GATAAAG AAAAAA DLPPDRFTIYFAGLEIP GGCAGAGTCGGAAGGTGACTCCTCGGGGGACGAGCTGTTTGAGGATGT GTCCTGGT AAAAAA GTEDIYAHRLHTICL CGAGCCCGTGCGATTGTCCGACCTACCCCCTGACAGGTTTACGATATACT CGTGTCCT (SEQID MTTWRTKEEVRLEL TTGCTGGGCTTGAAATACCCGGCACCGAAGATATATATGCCCACAGGCT CCTGAGAT NO: GLFLKDLFPSKGSQER CCATACCATCTGCCTGATGACAACGTGGCGAACCAAAGAAGAGGTAAG AGGCTGAA 1354) PERTNLPDPRNRIER ATTAGAACTTGGCCTCTTTTTGAAAGATTTGTTCCCGAGTAAGGGCAGTC TGGGTCAC RRGEYKKCQDLWRR AAGAACGCCCGGAGAGAACCAACCTGCCGGACCCGAGAAATCGGATCG TAAGTGGC NKSTCVQRILKEDLSQ AGAGGCGGAGGGGGGAGTACAAAAAATGCCAGGATCTATGGCGACGA ACCTAA GECLPRELMEPFWN AATAAGTCAACCTGTGTTCAGCGGATCCTTAAAGAGGATCTTTCGCAGG (SEQID ATFTQNPGTAPVLPP GTGAATGTTTGCCTCGAGAGCTGATGGAGCCCTTTTGGAATGCGACTTT NO:1231) PTEVYSSVWEPIRPE CACCCAGAATCCTGGCACGGCTCCGGTGCTCCCTCCTCCCACGGAGGTT NIKGNYPPQNTAAGI TATTCTAGTGTTTGGGAGCCCATTCGGCCCGAGAATATCAAGGGCAACT DGLTVGDLKGVSRE ATCCGCCGCAGAACACCGCGGCAGGGATAGACGGACTGACAGTGGGT MLARIFNLFMWCGK GACCTGAAAGGGGTGTCGCGGGAGATGCTGGCCAGAATTTTTAACTTAT LPEHLCASRTILLPKKP TCATGTGGTGCGGCAAACTGCCAGAGCACCTTTGTGCCTCACGCACAAT GAKVPGEFRPITVTSV TCTCCTGCCCAAGAAACCTGGGGCGAAAGTCCCCGGCGAATTCAGGCCT LIRTFHKVLAERLKVV ATCACCGTGACATCCGTCCTCATCCGGACCTTTCACAAGGTTCTGGCCGA PLDPRQRGFRESDGC AAGACTGAAGGTTGTCCCTCTTGACCCCCGCCAAAGAGGCTTCAGAGAG AENVMLLDMTIRYH TCCGATGGATGTGCAGAGAACGTGATGCTACTGGACATGACCATCCGGT HERRRKMFLALLDM ACCACCACGAGCGGCGCAGAAAGATGTTCTTGGCCCTGCTAGACATGG AKAFDSVSFESMREV CTAAGGCATTTGACTCGGTCTCTTTCGAGTCCATGCGGGAGGTTTTGACT LTTKGIPTPFIEYFMT ACTAAAGGCATACCAACGCCATTTATTGAGTATTTTATGACGCACTTGGA HLEDSFTVLQHGNW GGATAGTTTTACTGTTCTTCAGCATGGTAACTGGCAATCGGGGAAAATC QSGKIHPTCGVKQG CACCCAACATGTGGTGTGAAGCAAGGCGATCCACTGTCTCCGCCTATCT DPLSPPIFNFIMDEM TCAACTTCATCATGGATGAAATGTTGAAGAGGTTGCCTAAGGAAATCGG LKRLPKEIGVNLDGLF GGTTAACTTGGACGGGTTATTTGTTAATGCTATGGCATTTGCGGATGAC VNAMAFADDLSLVA CTGAGCCTTGTTGCCAATACCGAACAAGGTCTGCAGATCCTCATAGATG NTEQGLQILIDEATSF AAGCTACTTCCTTTCTGGGGCTCTGTGGACTCCGCGCCAATCCCAATAAG LGLCGLRANPNKCVT TGCGTCACCCTAGCAATTAAGACCATCCCGAAGGAGAAAAAGACGGCC LAIKTIPKEKKTAIDPS ATTGACCCCTCATCACATTTTAGGATAGGTAATGCGGTGATCCCCTCGTT SHFRIGNAVIPSLKRT GAAGAGGACAGATGAGTGGGTGTACTTAGGGATCAAATTTAATTCAAA DEWVYLGIKFNSNGR TGGTCGCCTTATCTCTGACGCAAAACCCAAGCTCATAAAAGATCTTGAG LISDAKPKLIKDLELLT CTACTAACTAAGGCACCACTCAAACCACAGCAAAGACTGTGGGCGCTTA KAPLKPQQRLWALK AGGTGATTGTCATTCCGGGCATCCTTTACAGAGGTACCCTGGGGAGCAG VIVIPGILYRGTLGSST CACCGCAGGCTACCTACGCTCTCTTGACTGTGTAATAAGGGCCTATGTTC AGYLRSLDCVIRAYVR GGCGATGGCTACGTCTCCCTGGAGATTGCCCGAATGGGTATTTTCATGC RWLRLPGDCPNGYF AGCGGTTGCGGATGGAGGGCTGGGAGTTCACCCCATACGATACAAGGC HAAVADGGLGVHPI GATGGTAGATCGCCTTGCCCGGCTCCGAAAATTAGAGAAATCCGCGTAC RYKAMVDRLARLRKL ATCACGGGGCCTGAAGCCGCACGTTATCTTCAAAGACAAGTTTCTATCG EKSAYITGPEAARYLQ CCGAAAATAGGCTCCGAGATGGGGCCAACCGCATTATGAGTGATGCGA RQVSIAENRLRDGAN GTATGCTAAGGGAGTTCCTTCGGGAGCTTCTGTACAAGTCCTTTGATGG RIMSDASMLREFLRE TCGTCCCCTGGAAAATTCCAGCAAAGTACCAGGTCAGCACCGCTGGGTC LLYKSFDGRPLENSSK GAGGAGCCAACCCGTTTCCTATCCGGGGCGGACTATATGAATTGTATAC VPGQHRWVEEPTRF GTGCGAGGATCGCAGCTCTTCCGACTGCAGCCAGGTGTGCTAGGGGAC LSGADYMNCIRARIA GTCTCAAAGACAAGCATTGCCGGGCAGGTTGCGGAAATGTGGAGACGC ALPTAARCARGRLKD TTAACCACGTCTTGCAATTCTGCCACCGTACCCATGGCACTCGCATTGGA KHCRAGCGNVETLN CGCCATGATGCGGTTGTAAAGTATGTTGTAGGAGGACTCAAGAAGAGG HVLQFCHRTHGTRIG GGGTACGCAGTGAAAGAAGAGCCGAAAATCGTCTTACAGGATGTGGTG RHDAVVKYVVGGLK TACAAACCTGATATGGTTGCGACCAAGGAAGGGAAAACACTCATTCTGG KRGYAVKEEPKIVLQ ACGCTCAGGTTCTAGGCGACCAGCGTGATATGAGACTGGCACATGAAG DVVYKPDMVATKEG ATAAGCTCCGTAAGTATGGGGCCCCAGAATTTAAACGAAAGATCAGGA KTLILDAQVLGDQRD GTGAGACGGGGTCGGCAACCATTAAGTCCTTGTCGGTTACATTGAGCTG MRLAHEDKLRKYGA GCGAGGGTTGTGGGGACCTGACTCAGTAAAGGGGCTCCTCGAAGAGG PEFKRKIRSETGSATIK GAGTGATTCTAAAGAAGGACCTTAAGATCTTGTCCACAAGAGTACTTAT SLSVTLSWRGLWGP AGGAGCTTTGGCAGGCTGGAGAAGGTTTAATGAAAGGACGAGTATGGC DSVKGLLEEGVILKKD AACATCTGGAAGAAGAGAAGAGGTTACAACAAGGATGGTGAGAAGGT LKILSTRVLIGALAGW GGAAGAGAAGAGAGAGGGTCGGTGTTGGTTAGCGGACTGGACTGTCT RRFNERTSMATSGRR GGAGGAGTGTTTAACTCGGGTTCTCATGGGAACCCGACAACGTTGTTAT EEVTTRMVRRWKRR CTTGTATGACAATTCATAAAAAAAAAAAAAAAAAAAAAAAAAAAA(SEQ ERVGVG(SEQID IDNO:1108) NO:1476) R2 R2La JN937617 Lepidurus GGGGTAGCAATTGATCGATTCCCGCTTCCTCGTGGCGCTACCCTGGGTA GGGGTAG TGATAA MSGKSSKPRTVSSGS arcticus ATACTATGAGGAATTGATCACACCGTAGCGAACGTCATCAGTCACAGCT CAATTGAT TCGCTCC SSQETPPSGSNACDI GCACGAATCCAGATAGAAATATAACAGACGAGTAATTCTTTTAAAAGCT CGATTCCC ATCCTGC CGKCFMKPVGLSRV CGTCAGTAATCTTCCCAGATGTCTGGAAAGAGCAGCAAACCGAGAACTG GCTTCCTC AACTAA HPSQYHARLEKNQPK TATCCAGTGGATCCAGTTCCCAAGAGACTCCTCCAAGTGGGTCCAACGC GTGGCGCT TTATGA AKKFRWTDEDLYFLA ATGTGATATATGCGGGAAATGCTTCATGAAACCGGTCGGTTTGTCCAGA ACCCTGGG ATGCAA KKEAELLHLGSIKFVN GTCCATCCTAGTCAATATCATGCACGCTTAGAAAAGAACCAACCTAAAG TAATACTA ATCTGTT KELAEFFPEKSVDQIR CGAAGAAGTTCAGATGGACTGATGAGGATTTGTATTTTTTAGCGAAAAA TGAGGAAT AAGTGA GQRRSETYKQQVLSI AGAAGCGGAACTTTTACACTTGGGCAGCATTAAATTTGTGAATAAAGAA TGATCACA CATTAGT HSELLKLQTVADSPPP CTGGCAGAATTCTTTCCGGAGAAATCTGTCGACCAAATCAGAGGCCAAA CCGTAGCG GATACTT SRIPAKEVSAWLDFFL GACGTTCTGAGACGTACAAACAGCAAGTTTTATCTATCCATTCTGAACTG AACGTCAT ACCTGA ALPKTKNKFSEDKLD CTCAAGTTGCAGACTGTAGCCGATTCTCCTCCCCCATCGAGAATCCCGGC CAGTCACA TACTTAC QLIRTAQDGTLILDDL GAAAGAAGTCTCCGCTTGGCTTGACTTTTTTCTGGCCCTTCCCAAAACCA GCTGCACG CCTGGT DLYLREVLVQPTSQG AGAATAAATTTAGTGAAGACAAGTTAGACCAACTCATCCGTACCGCTCA AATCCAGA ATTTATT EKQAKLLPPPKSSREK AGATGGTACTCTCATCCTGGATGATCTAGATTTATATCTTAGAGAAGTTC TAGAAATA TGACCT RDREYARAQNLYRK TAGTTCAACCCACTAGTCAGGGAGAGAAACAAGCGAAGTTGCTCCCACC TAACAGAC ATACTTA NKTACVNAILDGNKK CCCCAAATCCAGTAGAGAGAAAAGAGACCGCGAATATGCCCGTGCCCA GAGTAATT CCCTGG CENKIPDIDDFWKTIF GAATTTATATCGGAAGAATAAGACCGCTTGTGTGAACGCCATCCTTGAT CTTTTAAA TATCTAC ESHSPPDAEPVCYVV GGTAACAAGAAGTGCGAAAATAAAATACCTGACATTGACGATTTCTGGA AGCTCGTC CTGACA DEEPTNIWSWISFFE AGACCATATTTGAGTCCCATTCTCCCCCAGATGCAGAACCGGTGTGTTAT AGTAATCT TATATTT MNHNYPDSSTSPGP GTCGTTGACGAGGAGCCGACAAATATTTGGAGTTGGATTTCATTTTTTG TCCCAG ATCTAAC DGVTARMLRSIPARV AGATGAATCATAACTACCCCGATTCGTCTACCTCACCCGGTCCTGATGGC (SEQID CACCTAC LNKLLNLLLFIEDLPAV GTGACTGCTCGCATGCTCCGTTCCATCCCAGCACGTGTCCTAAATAAATT NO:1232) CTATGAT FKCHRTVLIPKIDNPT GTTAAACCTTTTACTCTTTATTGAAGACTTACCTGCAGTGTTCAAGTGCC GACTCC SPGEFRPITISSIVVRQ ATCGTACCGTGTTGATACCGAAAATCGACAATCCGACATCACCAGGCGA CGCGGA LNKIIAARVSEGVPIN ATTTCGTCCCATCACGATTTCCTCCATAGTTGTTCGGCAGCTGAACAAAA AACTCTC PRQKAFRQIDGCAEN TCATCGCTGCTCGCGTCTCAGAGGGCGTTCCCATAAATCCTCGCCAGAA ACTTACC VFLLDFILRDAKTKIKS GGCCTTCCGGCAAATTGATGGATGCGCTGAGAATGTTTTCCTTTTGGATT TTATTAC LSLATVDIKKAFDSVS TCATTCTTCGTGACGCAAAAACGAAAATTAAGTCCCTTTCGTTGGCTACT CCACTTG HHSIFRAIRGARCPEN GTCGACATTAAGAAAGCATTTGATAGTGTGTCACATCATTCCATCTTCCG GTCTTTT LVNYIQNSYSGCTTQI AGCGATTAGAGGAGCGAGATGCCCTGAGAATCTAGTGAATTATATCCA ATTTTCT SVGGSISASKIPMNR GAACAGCTACTCGGGTTGTACGACCCAAATTTCTGTTGGCGGCTCTATAT CGTTCCT GVKQGDPLSPVLFNL CAGCATCAAAAATCCCGATGAACCGTGGTGTCAAACAAGGGGACCCACT TATTACT VINEIIRKLPASIGYPIN ATCTCCTGTCCTCTTTAATTTAGTGATAAACGAAATTATTCGCAAACTGCC TTGTTCC SELSINCIAYADDLILV GGCCTCGATTGGATACCCAATCAATAGTGAGCTTTCGATAAACTGTATT TTTGGT TNTREGLKLLLGLLNE GCTTATGCGGATGACCTGATACTAGTTACAAATACTAGAGAGGGCCTGA GTAGGG ELPKRGLELNASKCFG AGCTACTACTCGGCCTATTGAATGAAGAGCTCCCTAAGCGAGGATTAGA TTCTCTG LSLTALGKLKKTHLCT ATTGAATGCATCTAAGTGTTTCGGCTTATCCTTAACAGCACTCGGGAAAT GTTTTTG SDQLDLHGTLIKNLTA TGAAGAAGACTCACCTGTGTACCAGTGACCAACTCGATTTACATGGGAC GAACGG EESWVYLGVPFSHIG TCTAATTAAGAATCTTACCGCTGAGGAATCCTGGGTTTATTTGGGTGTTC CTTCCTT RSKSFSPDLEALLNKL CTTTCTCACACATTGGGCGCTCTAAGAGCTTCAGTCCAGATCTTGAAGCT AGCCGG QKSPLKLQQKLFALR CTTTTAAATAAACTCCAGAAATCGCCGCTTAAACTGCAACAAAAGCTTTT AATTTTG VYLIPRLLHGLVLSRV TGCTTTGCGAGTTTATCTAATTCCTAGACTGCTCCACGGTTTAGTCCTATC TCTGAT AIGELKIMDKLILKHL AAGAGTGGCCATTGGCGAATTAAAAATAATGGATAAGTTAATTTTGAAA GTATCTT RVWLRLPKDTPLGFF CACTTACGCGTATGGCTTCGTTTACCGAAAGATACCCCTTTGGGATTTTT GCTTGT YSPVKLGGLGIKNLRT CTACTCACCGGTGAAATTGGGCGGTTTGGGGATTAAAAACTTACGTACA GTCCTT NVLKCRKQRIERMLV AACGTTTTGAAGTGCAGGAAACAACGCATTGAGCGAATGCTAGTCTCCC GAAATA SPDDVVRLVAESEIFL CTGATGATGTGGTGCGCTTGGTTGCAGAGTCGGAGATTTTCCTGAAAGA TACGAC KETDKLKDLLTINGM GACGGACAAGTTAAAGGACCTACTTACTATTAACGGAATGTGCCTAGAT CCAGGC CLDXRNVPRTGKNN TANCGGAATGTGCCTAGAACGGGAAAAAATAACAAGTTTTGGAGTGAA TTGCGT KFWSERLYTSFDGKT AGGCTATATACGTCATTTGATGGTAAAACACTTGCCTATAGCGAGTACTT CATTTAG LAYSEYFTQGGGWIR TACTCAAGGCGGCGGCTGGATCCGTGAAGATAAGATCCTTCAACCTGCC GCTCTG EDKILQPAHVFAECIK CATGTTTTCGCTGAATGCATCAAGTTACGTATTAATGCTTTACCCACTAA GGAAA LRINALPTKSRVAHG ATCTAGAGTTGCACATGGCCGTCCAACTAAAGATCGTTCCTGTCGTGCC (SEQID RPTKDRSCRAGCLDV GGATGTCTTGACGTTCAAAAAGTGCCAACTATCGAAACAATCAATCATA NO: QKVPTIETINHIAQVC TAGCACAAGTTTGTCCTCGGACGCATGGTGCAAGAATAAAAAGACACG 1355) PRTHGARIKRHDRLV ATCGGCTCGTTCAATTCCTTTCTCTCAACCTTAGGAAAAATCCAAAAAGA QFLSLNLRKNPKRNV AATGTTTTAGTGGAGTACAATTTCCGTACAGTCGCTGGTATCCGTAAACC LVEYNFRTVAGIRKP GGATATCATTGTTATTGAAGATACCCGTGCAGTTATCCTTGACGTCCAAG DIIVIEDTRAVILDVQ TTGTTGGTGATTCGTCCAATTTGGAGATGGAATACCTGGAGAAATCCAG VVGDSSNLEMEYLEK GAAATATTCTAATGACGCTAACTTTATCAATGCGCTTCAAAAACTTTATC SRKYSNDANFINALQ CAACGGTCACTAACCTAACTTTTCATGCAGTTACCTTCAACAATAGAGGC KLYPTVTNLTFHAVTF CTGATTGCCAAATCCACAGTGGCGGCCCTCCGGATGCTCGGCGTCCCAC NNRGLIAKSTVAALR CTAGATGCATAATGATCCTTTGTGTGATTAGCCTTGAGAAGACCCTGGA MLGVPPRCIMILCVIS AGTCTGGAGGATGTTTAACCAATCAACTGCTTCCGCTCGGAAATGATAA LEKTLEVWRMFNQS TCGCTCCATCCTGCAACTAATTATGAATGCAAATCTGTTAAGTGACATTA TASARK(SEQIDNO: GTGATACTTACCTGATACTTACCCTGGTATTTATTTGACCTATACTTACCC 1477) TGGTATCTACCTGACATATATTTATCTAACCACCTACCTATGATGACTCCC GCGGAAACTCTCACTTACCTTATTACCCACTTGGTCTTTTATTTTCTCGTT CCTTATTACTTTGTTCCTTTGGTGTAGGGTTCTCTGGTTTTTGGAACGGCT TCCTTAGCCGGAATTTTGTCTGATGTATCTTGCTTGTGTCCTTGAAATATA CGACCCAGGCTTGCGTCATTTAGGCTCTGGGAAA(SEQIDNO:1109) R2 R2LcA Lepidurus TTTGGGGTAGCAATTGATCGATTCCCGCCTCCTCGTGGCGCTACCCCGG TTTGGGGT TAGTGC MSEESRPKQTASKRG couesii GATAGCCTCAAAGAAATTTGACGGTAAAGCAAAGAGGAATTGATCACC AGCAATTG TTGAGT AAVEKTMMSGTYVC CAAGGCAGTACATCGGCCTTCCTGCAGGAGCTCTGATAAAGATATTAGT ATCGATTC GATGCC TLCGRSFEKSVGLSLH GAGTTATTCTGTTGAAGCTCGCTATTTCATTCCCCTGATGTCAGAGGAAA CCGCCTCC TATCCTT TNRMHPEAYNKLKE GCAGACCAAAGCAGACGGCCTCCAAGCGTGGAGCCGCCGTGGAGAAG TCGTGGCG TCTTTGA AKKPVLKKARWSEEE ACTATGATGAGTGGGACCTACGTGTGTACTTTATGTGGTAGGAGTTTTG CTACCCCG TTAACTC VFLLAQKEAELSFIGG AGAAGTCGGTGGGTTTGTCACTCCACACGAACCGGATGCACCCTGAAGC GGATAGCC TTACCAT IKFMNIELHKIFPEREL TTACAATAAGCTGAAGGAGGCTAAGAAACCTGTTTTGAAGAAGGCGCG TCAAAGAA ATACTTA EGIKGQRKNPTYKAQ ATGGTCAGAGGAGGAGGTTTTCCTTCTAGCGCAGAAGGAAGCGGAGCT ATTTGACG CCAGTTC VVSLLAEIRESKANDS CTCATTCATTGGAGGAATTAAGTTCATGAACATCGAGCTGCATAAAATTT GTAAAGCA TTACCCG SSSSSSSSSCDSASLGI TTCCAGAGCGTGAGTTGGAGGGAATCAAAGGCCAAAGAAAGAACCCAA AAGAGGA TACTTAC SNWLEFLLALPKTSN CGTACAAAGCGCAAGTCGTTAGCTTATTAGCTGAGATCCGAGAGTCTAA ATTGATCA CCTGTAT QFQEGRLDRLISDAL GGCCAATGATTCCTCTTCATCTTCCTCATCCTCCTCCTCTTGTGATTCGGC CCCAAGGC ACTTACC RGVDVLENLDAYLLE CTCTTTGGGCATTAGCAATTGGCTTGAATTTCTGCTAGCTCTCCCGAAGA AGTACATC TGTGTG VFAKPMAQNPCPKP CCTCCAACCAATTCCAGGAAGGTAGGCTAGATAGGCTAATCTCTGATGC GGCCTTCC CGTACCT PPPAKNSRERRDREY CCTGCGGGGTGTTGATGTGCTAGAAAACCTTGATGCTTACCTTCTCGAG TGCAGGA GTGTAC SRVQNFYKKNRSACI GTTTTCGCTAAGCCCATGGCGCAAAATCCTTGTCCAAAGCCCCCACCCCC GCTCTGAT TTGTCCT NSILDGNTRSQNVIP TGCGAAGAACAGTCGTGAGAGAAGGGATAGAGAATACTCTAGAGTTCA AAAGATAT TTAGCC GLTKFWTETFEKNSP GAATTTTTATAAGAAGAACCGTTCAGCGTGCATTAATTCCATTTTAGACG TAGTGAGT GCCTTG PDDEAPDQFVADEP GGAACACCCGGTCTCAAAATGTGATACCTGGCTTGACGAAGTTTTGGAC TATTCTGT TGTTTTT RDMYKWITFYEMSQ CGAGACGTTTGAGAAGAACTCGCCACCTGATGATGAAGCCCCTGACCA TGAAGCTC ACCATT DYLDSSTAPGVDGFS GTTCGTTGCGGACGAGCCTAGAGACATGTATAAGTGGATAACTTTTTAT GCTATTTC GGTACT AKQLRSMSPRVLNKI GAGATGTCTCAAGATTATCTTGATTCTTCCACTGCTCCTGGAGTGGATGG ATTCCCCT TACCTTG LNLLLLSENLPNSFKM TTTCTCTGCCAAACAGCTCCGGTCCATGTCTCCCAGGGTGCTAAATAAGA G(SEQID TGTGGT HKTVLIPKIDDPKSPG TATTGAACCTTTTACTTCTTAGTGAGAACTTACCTAATTCGTTTAAGATGC NO:1233) TGCCCG DFRPITISPVLARLLNK ATAAGACAGTTCTTATACCGAAGATTGATGATCCAAAGAGTCCTGGGGA ATACTTA ILAARLSKLVPISQRQ TTTTAGACCGATCACGATTTCCCCTGTCCTTGCTCGCCTCCTTAACAAGAT CCTTGTA KAFLPVDGCGENIFLL TCTCGCTGCTAGATTGAGCAAACTAGTCCCCATTAGTCAGCGTCAAAAG TTTGCCT DYILRSSKKSSKSVAM GCTTTTTTGCCTGTTGATGGCTGTGGGGAGAATATTTTCCTTTTGGATTA TGTAATT AVLDVKKAFDSVSHH TATTCTTCGTAGCTCTAAGAAAAGTAGTAAATCAGTGGCGATGGCAGTT CTGCAT SILRALNEAKCPINFIN TTGGATGTTAAAAAGGCATTTGATAGTGTATCGCATCACTCCATCCTTAG GATATTT FVRNSYDGCTTKLTC AGCCCTTAATGAGGCCAAGTGTCCTATTAACTTTATTAACTTCGTCAGGA ATTGTGT GGTSFPDSVRMNRG ATAGTTATGATGGATGCACTACCAAACTTACTTGTGGGGGTACTTCCTTT AGGTTC VKQGDPLSPVLFNLII CCGGACTCGGTACGCATGAACCGAGGAGTGAAACAGGGAGATCCCCTA CTGATG DSAIRKLPDSIGYVIR TCGCCTGTGCTCTTTAATTTGATCATTGATAGTGCAATTAGGAAGTTACC CTTACCT DGLKINCLAYADDLIL TGACTCTATCGGCTATGTGATAAGGGATGGCTTAAAGATTAATTGCCTG GATTTGT VASSRAGLKTLLNIVA GCATACGCTGATGACCTGATTTTGGTAGCTTCAAGTCGTGCTGGCCTTAA CCCCCTC EHLSLRGLDLNAAKC GACCTTACTGAATATTGTTGCTGAGCATCTCTCGCTACGCGGGTTGGATT ATCATCT HGLSIIASGKAKTTYV TGAATGCTGCTAAATGTCATGGTCTGTCAATTATCGCCAGTGGTAAGGC TTAGTTT SAADSLDLDGQPIKN CAAAACTACTTACGTTAGTGCTGCGGATTCCCTAGATCTTGACGGTCAAC CGTTCTA LGVLDTWTYLGIPFS CTATAAAGAATTTGGGAGTGCTGGACACGTGGACCTACCTCGGGATTCC TTTCACT HLGRAEKVSPDLTNL ATTTTCCCATCTGGGTCGAGCTGAGAAAGTTTCTCCTGATTTGACCAACT CCATTAT LNKLQKAPLKLQQKL TACTTAACAAACTTCAGAAAGCTCCCCTCAAACTCCAGCAGAAGTTATAT GGAGTT YAVRNFVIPRALHGLI GCTGTACGTAACTTTGTTATCCCTAGAGCACTCCATGGCCTCATTCTATC CCGTTT LSKTNLKELNTLDRAI GAAGACAAACCTAAAAGAATTAAACACGCTTGATAGAGCAATTAGAGTT GTTTTTT RVFLRTLLYLPKDTPL TTTCTACGTACCTTGCTTTACTTACCTAAAGATACGCCACTGGGATTTTTC GGTGGA GFFHSPIKSGGLGITC CATTCCCCTATCAAGTCCGGAGGACTTGGGATTACGTGTTTTAGAACATC GGTACA FRTSVLKCRLQRIAR TGTCCTTAAGTGCCGGCTACAGAGAATAGCCCGAATGCGTAGTTCCTGT GCACCC MRSSCDGVIQAVAES GATGGAGTTATACAGGCAGTGGCGGAATCTGATATCTTCGCTGATGAAT TTTAAGC DIFADEYAKLRDLIRIN ATGCCAAGTTGCGGGACCTTATTAGGATAAACGGCAATGTGCTCGACAC TGGAAT GNVLDTTESIKRYWA GACCGAGAGTATTAAGAGGTATTGGGCTCAACGTCTTCACTCTTCGGTA TGAGTG QRLHSSVDGKTLAY GATGGCAAGACTTTGGCCTACATGGATTATTTCCCTCAGGGTAATCTTTG AGTTTAT MDYFPQGNLWMSE GATGTCAGAGGATAAAGTGAGCCAGCGATCTTACGTCTTTGCTGACTGT GTACTTT DKVSQRSYVFADCVK GTCAAGTTACGGATTAATGCCATCCCAACTCGTGTCCGTGTTTCTCGTGG GGATGG LRINAIPTRVRVSRGR TCGACCTAATAAGGAAATGTGCTGTAGAGCGAAATGTTTTGATTCCCAG TTGTAAT PNKEMCCRAKCFDS AGAATGCCAGCGTTTGAATCCTTGAACCACATAACGCAAGTGTGTCCGC AAACTA QRMPAFESLNHITQV GCACTCATGGCTCGCGAATCCAGCGGCATGATAAGATAGCCAAGTTCCT CCCGGA CPRTHGSRIQRHDKI TTTTAAGAACCTGAACAATTGCCCATCTCGCTCGGTTCTGTATGAACCCC GGCAT AKFLFKNLNNCPSRS ATTTTGTGACGGTCGATGGCCTGCGGAAGCCTGACATTATCATTTACGA (SEQID VLYEPHFVTVDGLRK TGATAGCCACATGGTGGTTCTCGATGTCCAGGTGGTTTCCGACTCTGCTA NO: PDIIIYDDSHMVVLDV ACCTGGAGAAGGAATTTGAGTGCAAGGCGAAGAAATATGCCAATGATG 1356) QVVSDSANLEKEFEC TTGCACTTCGATCTGCAATGCTAATTAAATACCCATTCATTAAGTCATTTT KAKKYANDVALRSA CTTTCGTTGCAGCTACCTACAACAACCGTGGTCTGATAGCTAAATCCTCT MLIKYPFIKSFSFVAA GTCCAGGTCCTTCGCCAGCTGGGTCTCTCGCCGCGCTCCATAATGGTCTC TYNNRGLIAKSSVQV CATCCTCATCTGCCTTGAGGGTACTTTGGAAACATGGCGCATCTTCAATC LRQLGLSPRSIMVSILI AGTCTACAATGAATGCCCATTAGTGCTTGAGTGATGCCTATCCTTTCTTT CLEGTLETWRIFNQS GATTAACTCTTACCATATACTTACCAGTTCTTACCCGTACTTACCCTGTAT TMNAH(SEQIDNO: ACTTACCTGTGTGCGTACCTGTGTACTTGTCCTTTAGCCGCCTTGTGTTTT 1478) TACCATTGGTACTTACCTTGTGTGGTTGCCCGATACTTACCTTGTATTTGC CTTGTAATTCTGCATGATATTTATTGTGTAGGTTCCTGATGCTTACCTGAT TTGTCCCCCTCATCATCTTTAGTTTCGTTCTATTTCACTCCATTATGGAGTT CCGTTTGTTTTTTGGTGGAGGTACAGCACCCTTTAAGCTGGAATTGAGT GAGTTTATGTACTTTGGATGGTTGTAATAAACTACCCGGAGGCAT(SEQ IDNO:1110) R2 R2LcB JN937619 Lepidurus TTTTGGGGTAGCAATTGATCGATTCCCGCCTCCTCGTGGCGCTACCCTGG TTTTGGGG TGATAA MSGKSSKPRTVSSGS couesii GATAACCTCAAAGAAATTTGACGGTAAAGCTAAGAGGAATTGATCACAC TAGCAATT TCGCTCC SSQETPPSGSNACDI CGTGACGAATATCATCAGTCACAGCTGCACGAATCCAGATAGATATATA GATCGATT ATCCTGC CGKCFMKPVGLSLH ACAGGCGAGTAATTCTTTTCGAAGCTCGTCAGTAATCTTCCCAGATGTCT CCCGCCTC AACTAA MSKVHPTQYHARLE GGAAAGAGCAGCAAACCGAGAACTGTATCCAGCGGATCCAGTTCCCAA CTCGTGGC TTATGA KNQPKAKKFRWTDE GAGACTCCTCCAAGTGGGTCCAACGCATGTGATATATGCGGGAAGTGCT GCTACCCT ATGCAA DLYFLAKKEAELLLLG TCATGAAACCAGTTGGTTTATCACTACACATGTCCAAAGTCCATCCTACT GGGATAAC ATCTGTT GIKFMNKELAEFFPE CAGTACCATGCACGCTTGGAAAAGAATCAACCGAAAGCCAAGAAATTTA CTCAAAGA AAGTGA KSVDQIKGQRRSETY GATGGACGGATGAAGATTTGTATTTTTTAGCGAAGAAGGAAGCTGAAC AATTTGAC CATTAGT KQQVVSIHSELLKLQ TTTTACTTTTGGGTGGCATTAAGTTCATGAACAAGGAACTGGCAGAATT GGTAAAGC GATACTT AVADSPPPSRIPAKEV CTTTCCGGAGAAATCTGTTGACCAAATCAAAGGCCAAAGACGTTCCGAG TAAGAGG ACCTGA SAWLDFLLALPKTKN ACATACAAACAGCAAGTTGTATCAATTCATTCTGAACTGCTTAAGTTGCA AATTGATC TACTTAC KFSEDKLDQLIRTAQE GGCTGTAGCTGATTCTCCTCCACCATCGAGAATTCCAGCGAAAGAAGTC ACACCGTG CCTGGT GTPVLNDLDLYLREVL TCGGCCTGGCTTGATTTCCTTCTGGCCCTTCCCAAAACCAAGAATAAATT ACGAATAT ATTTATT VQPTRQGERQAKPL TAGTGAGGACAAGTTAGACCAACTTATACGTACCGCGCAAGAAGGCACT CATCAGTC TGACCT PPPKSSREKRDREYA CCGGTCCTAAACGATCTAGATTTATATCTAAGAGAAGTTCTAGTTCAGCC ACAGCTGC ATACTTA RVQNFYRKNKTACV CACAAGACAGGGAGAGAGACAGGCGAAGCCTCTCCCACCACCCAAATC ACGAATCC CCCTGG NAILDGNKKCENKIP CAGTCGAGAGAAAAGAGACAGAGAATATGCTCGTGTCCAGAATTTTTAT AGATAGAT TATCTAC DIDEFWKAIFESQSPP CGCAAGAATAAAACCGCATGTGTGAACGCCATCCTTGATGGAAATAAGA ATATAACA CTGACA DAEPVSYVVDEEPKN AGTGCGAAAACAAAATACCTGACATTGACGAATTCTGGAAGGCCATATT GGCGAGT TATATTT IWSWISFFEMNRNY TGAATCTCAATCTCCTCCTGATGCAGAACCAGTGAGTTACGTTGTAGAC AATTCTTTT ATCTAAC PDTSTSPGPDGVTAR GAGGAACCGAAGAATATTTGGAGTTGGATTTCGTTCTTCGAAATGAATC CGAAGCTC CACCTAC MLRSIPARVLNKLLNL GTAACTACCCTGATACGTCGACCTCACCCGGTCCAGACGGCGTGACGGC GTCAGTAA CTATGAT LLFIEDLPAVFKCHRT TCGTATGCTCCGTTCCATCCCTGCGCGTGTCCTAAATAAATTGTTAAACC TCTTCCCA GACTCC VLIPKVDNPALPGEFR TATTACTCTTTATTGAAGACTTACCTGCAGTGTTCAAGTGCCATCGTACC G(SEQID CGCGGA PITISSIIVRQLNKIIAA GTTCTGATACCTAAAGTTGACAATCCAGCATTACCGGGAGAATTTCGGC NO:1234) AACTCTC RVSEGVPINPRQKAF CTATCACGATTTCTTCCATAATTGTTCGGCAGCTGAATAAAATCATCGCT ACTTACC RQIDGCAENVFLLDFI GCTCGCGTCTCAGAGGGCGTTCCCATAAATCCTCGTCAGAAGGCCTTTC TTATTAC LRDAKTKIKSLSLATV GGCAAATAGATGGATGTGCTGAGAATGTTTTCCTTTTAGATTTCATTCTT CCACTTG DIKKAFDSVSHHSIFR CGTGACGCAAAAACGAAAATTAAGTCCCTTTCGTTGGCTACTGTCGACA GTCTTTT AIRGARCPENLVNYI TTAAGAAAGCATTTGACAGTGTGTCACATCATTCCATTTTCCGAGCGATT ATTTTCT QNSYSGCTTQISVGG AGAGGAGCGAGATGCCCTGAGAATTTGGTGAATTATATCCAGAACAGC CGTTCCT SISTTKILMNRGVKQ TATTCGGGTTGTACGACCCAAATTTCCGTAGGCGGCTCTATATCTACAAC TATTACT GDPLSPVLFNLVINEII AAAAATCCTGATGAACCGTGGTGTAAAACAAGGGGACCCACTATCTCCT TTGTTCC RKLPASIGYPINSELSI GTCCTCTTTAACTTAGTGATAAACGAAATTATTCGCAAGCTGCCGGCCTC TTTGGT NCIAYADDLILVANTR GATTGGATATCCAATCAATAGTGAGCTCTCGATAAACTGTATTGCCTATG GTAGGG EGLKLLLNLLNEELPK CGGATGACCTGATACTAGTTGCAAATACTAGAGAGGGTCTGAAGCTACT TTCTCTG RGLELNASKCFGLSLT ACTCAACCTATTGAATGAAGAGCTCCCTAAGCGAGGATTAGAATTGAAT GTTTTTG ALGKLKKTHLCTSDQ GCATCTAAGTGTTTCGGCTTATCCTTAACAGCACTCGGGAAATTGAAGA GAACGG LDLHGTLIKNLTAEES AGACTCACCTGTGTACCAGTGACCAACTCGATTTACATGGGACTCTAATT CTTCCTT WVYLGVPFSHIGRSK AAGAATCTTACCGCTGAGGAATCCTGGGTTTATTTGGGTGTTCCTTTCTC AGCCGG SFSPDLEALLNKLQKS ACACATTGGGCGCTCTAAGAGCTTCAGTCCAGATCTTGAAGCTCTTTTAA AATTTTG PLKLQQKLFALRVYLI ATAAACTCCAGAAATCGCCGCTTAAACTGCAACAAAAGCTTTTTGCTTTG TCTGAT PRLLHGLVLSRVAIGE CGAGTTTATCTAATTCCTAGACTGCTCCACGGTTTAGTCCTATCAAGAGT GTATCTT LKIMDKLILKHLRVWL GGCCATTGGCGAATTAAAAATAATGGATAAGTTAATTTTGAAACACTTA GCTTGT RLPKDTPLGFFYSPVK CGCGTATGGCTTCGTTTACCGAAAGATACCCCTTTGGGATTTTTCTACTC GTCCTT LGGLGIKNLRTNVLKC ACCGGTGAAATTGGGCGGTTTGGGGATTAAAAACTTACGTACAAACGTT GAAATA RKQRIERMLVSPDDV TTGAAGTGCAGGAAACAACGCATTGAGCGAATGCTAGTCTCCCCTGATG TACGAC VRLVAESEIFLKETDKL ATGTGGTGCGCTTGGTTGCAGAGTCGGAGATTTTCCTGAAAGAGACGG CCAGGC KDLLTINGMCLDXRN ACAAGTTAAAGGACCTACTTACTATTAACGGAATGTGCCTAGATTANCG TTGCGT VPRTGKNNKFWSER GAATGTGCCTAGAACGGGAAAAAATAACAAGTTTTGGAGTGAAAGGCT CATTTAG LYTSFDGKTLAYSEYF ATATACGTCATTTGATGGTAAAACACTTGCCTATAGCGAGTACTTTACTC GCTCTG TQGGGWIREDKILQP AAGGCGGCGGCTGGATCCGTGAAGATAAGATCCTTCAACCTGCCCATGT GGAAA AHVFAECIKLRINALP TTTCGCTGAATGCATCAAGTTACGTATTAATGCTTTACCCACTAAATCTA (SEQID TKSRVAHGRPTKDRS GAGTTGCACATGGCCGTCCAACTAAAGATCGTTCCTGTCGTGCCGGATG NO: CRAGCLDVQKVPAIE TCTTGACGTTCAAAAAGTGCCAGCTATCGAAACAATCAATCATATAGCAC 1357) TINHIAQVCPRTHGA AAGTTTGTCCTCGGACGCATGGTGCAAGAATAAAAAGACACGATCGGCT RIKRHDRLVQFLSLNL CGTTCAATTCCTTTCTCTCAACCTTAGGAAAAATCCAAAAAGAAATGTTT RKNPKRNVLVEYNFR TAGTGGAGTACAATTTCCGTACAGTCGCTGGTATCCGTAAACCGGATAT TVAGIRKPDIIVIEDTR CATTGTTATTGAAGATACCCGTGCAGCTATCCTTGACGTCCAAGTTGTTG AAILDVQVVGDSSNL GTGATTCGTCCAATTTGGAGATGGAATACCTGGAGAAATCCAGGAAATA EMEYLEKSRKYSNDA TTCTAATGACGCTACTTTATCAATGCGTATCAATGCGCTTCAAAAACTTT TLSMRINALQKLYPT ATCCAACGGTCACTAGCCTAACTTTTCATGCAGTTACCTTCAACAATAGA VTSLTFHAVTFNNRG GGCCTGATTGCCAAATCCACAGTGGCGGCCCTCCGGATGCTCGGCGTCC LIAKSTVAALRMLGV CACCTAGATGCATAATGATCCTTTGTGTGATTAGCCTTGAGAAGACCCTG PPRCIMILCVISLEKTL GAAGTCTGGAGGATGTTTAACCAATCAACTGCTTCCGCTCGGAAATGAT EVWRMFNQSTASAR AATCGCTCCATCCTGCAACTAATTATGAATGCAAATCTGTTAAGTGACAT K(SEQIDNO:1479) TAGTGATACTTACCTGATACTTACCCTGGTATTTATTTGACCTATACTTAC CCTGGTATCTACCTGACATATATTTATCTAACCACCTACCTATGATGACTC CCGCGGAAACTCTCACTTACCTTATTACCCACTTGGTCTTTTATTTTCTCG TTCCTTATTACTTTGTTCCTTTGGTGTAGGGTTCTCTGGTTTTTGGAACGG CTTCCTTAGCCGGAATTTTGTCTGATGTATCTTGCTTGTGTCCTTGAAATA TACGACCCAGGCTTGCGTCATTTAGGCTCTGGGAAA(SEQIDNO: 1111) R2 R2Nvec-A Nematostella GGTTGGGGCCTTCTCGTGGCGGAGTCGTGAGTAAGGGGTATAGGGGTA GGTTGGG TGATGG MLRGTGNMNDKRD vectensis AGGGACACCACGGACCGAGAACGGTTACCGCTCAAGGCGAGTGGTGG GCCTTCTC TGGGTT GSATADPTSALLGAV AAGGCATAAAATCGTAACGCCGCCCTCCGACCTGCTCCTGAAACTAATG GTGGCGG ACTCGC GDGSLVCNLCGLACK CCAACCAACTGACTGTGGGGCTAACCTCCCCAGAGTCAGGATGCTGAG AGTCGTGA CTCTGT SRGGLSIHRRSKHAT GGGTACTGGGAATATGAATGACAAGAGGGACGGATCGGCCACTGCTGA GTAAGGG GTAACA VYHAERQPAPRAKA TCCTACCTCAGCGCTTTTGGGGGCTGTCGGTGATGGCAGCCTGGTATGC GTATAGGG GGCAAA RWTNDEMILVARKQ AATCTGTGTGGCCTTGCTTGCAAGTCTCGTGGGGGGCTCTCGATTCATC GTAAGGG TGAAAG IASEKSRCSAVVEGM GAAGGAGCAAGCATGCCACGGTGTACCATGCGGAAAGGCAGCCCGCTC ACACCACG CTGCGC REAVPHRTFDAVKSL CGCGGGCTAAGGCGAGATGGACGAATGATGAAATGATACTTGTGGCCA GACCGAG AAGCAG KTKNRNYTRILEQIRA GAAAACAAATAGCTTCGGAAAAGAGTAGGTGTAGCGCAGTAGTTGAAG AACGGTTA TCGATG ECSEEEVIESGVLKDR GGATGCGAGAAGCAGTTCCACATAGAACTTTTGACGCCGTCAAAAGTCT CCGCTCAA AGCCAA TENVCVQTTSNVPGS CAAAACCAAGAATAGGAACTACACACGGATTCTGGAGCAGATTCGCGCT GGCGAGT AGCCGC AGRAASVELEGNIQV GAATGCTCCGAGGAAGAGGTGATTGAATCAGGAGTCCTTAAGGATAGG GGTGGAA ACAGCC GHQLAQKTMAGNN ACGGAGAACGTGTGTGTGCAAACAACATCTAACGTTCCCGGTTCGGCAG GGCATAAA CCCGAC SRKQPANHTNWAEF GACGGGCAGCATCGGTGGAGCTAGAGGGTAATATCCAGGTGGGCCATC ATCGTAAC TGGGGT NIEEGNITLRKSKRKA AACTCGCCCAAAAAACCATGGCAGGCAATAATAGTAGGAAACAGCCTG GCCGCCCT ACAGGG NGMPDATHRPGPPT CCAATCATACCAACTGGGCTGAGTTCAATATTGAGGAGGGAAATATTAC CCGACCTG CAGCCC VDSLKHPVCLLQGAA CCTCCGAAAAAGCAAAAGAAAGGCTAATGGTATGCCCGATGCGACCCA CTCCTGAA TGGGCT DKRDEPHTVEQLYYN CCGCCCCGGCCCACCGACTGTCGACAGTCTTAAGCACCCAGTATGCTTG ACTAATGC ATGCCC IEEGMPLAEEQQWS TTGCAGGGTGCAGCCGACAAAAGGGACGAACCGCACACAGTGGAACAA CAACCAAC GAAGTC EKLFDAIDSSLLSVEVE TTGTATTATAACATTGAGGAAGGAATGCCTCTTGCCGAAGAGCAACAGT TGACTGTG TTTTGAC LGRIVPGCPDEETRQ GGTCAGAAAAGCTTTTCGATGCAATCGACTCATCTTTGTTGTCTGTCGAG GGGCTAAC AGGTCC LIDREFLDFIHSYSREK GTCGAGCTGGGGAGAATCGTTCCTGGTTGCCCCGACGAAGAGACCCGA CTCCCCAG AAACTTC PPQRGLAKSKPPPKG CAGCTCATTGATAGAGAATTCTTGGATTTTATCCATTCCTATTCGAGGGA AGTCAGG ACCCTTG PKSLRRQQYRQLQRL AAAGCCGCCACAAAGAGGTCTCGCTAAGTCCAAACCACCACCCAAAGG (SEQID CCGCCA WDKNRSAAAEQALT ACCAAAGAGCCTCAGACGCCAGCAATATCGGCAACTCCAGAGATTGTG NO:1235) GTAGGG GKWQEVRTAAGVPL GGACAAAAACCGATCCGCAGCAGCCGAACAGGCCCTAACAGGTAAATG CACCAG SLMEVPWREIFETPS GCAAGAAGTTCGGACTGCTGCTGGGGTCCCATTATCGTTGATGGAGGTC GGCCAG TTDVREPAPAGPVL CCGTGGCGGGAGATTTTCGAAACACCATCTACGACGGACGTCCGAGAG GTGCAG WQLLRPVTIAEVEDA CCGGCCCCAGCTGGGCCGGTTCTGTGGCAGCTTCTCAGACCAGTGACTA GTGGGC ISSKKSASGPDGVPC TCGCAGAAGTAGAGGACGCAATTTCCTCCAAGAAATCAGCTAGCGGTCC GCTTTGT AALQTMGAASLAAH AGATGGAGTGCCTTGTGCCGCCCTTCAAACCATGGGCGCCGCCTCCCTA TCTATTT FNLWLLAGTQPKRLT GCAGCTCACTTTAACTTGTGGCTGTTGGCCGGTACTCAGCCCAAACGCCT GGTTTC ECRTIFVPKEVNTHLP AACAGAATGCCGAACAATATTTGTGCCTAAGGAGGTAAATACTCACCTG GCTTAAT LHHRPITIGSVVVRLF CCCCTCCATCATAGGCCAATCACTATCGGCTCGGTGGTTGTAAGACTCTT TTTTGTT HQILGKPMEAVLPLG CCACCAGATTCTGGGAAAGCCAATGGAAGCAGTACTGCCATTGGGGTC AAATTTT SGQRGFRKGDGICQ GGGCCAGCGCGGTTTTCGAAAGGGTGACGGAATATGCCAGAACATCTG TCCCGT NIWLLHTLIRRSTDLL GCTCCTCCATACTCTAATCAGACGCAGCACTGATTTGCTGAGGCCACTCA GCGCCC RPLKLVFLDVKKAFDS AGCTGGTCTTCTTGGATGTGAAAAAGGCCTTCGATTCGGTTAGTCACGA ACCCTTT VSHESLLIAAKRLGVP ATCCCTTTTAATTGCGGCGAAGCGTCTCGGAGTGCCCGGGCCGCTCCTA TTAACCC GPLLTYINELYSRSETV ACGTACATCAATGAACTGTACTCTCGTTCGGAGACAGTCTTTGAAGTTG TTTTAAC FEVGGESSGSVKVSQ GGGGCGAGAGCAGTGGCTCTGTTAAAGTATCCCAAGGGGTAAAGCAAG ACCAATT GVKQGDPLSSTLFNC GGGATCCATTGAGTTCGACGCTCTTCAACTGCGTCATTGACTGGGCGGT TTTTTTA VIDWAVSDLDPHIGV CTCAGATCTCGACCCCCACATAGGCGTCTTACTAGGGGAGTCTCGAGTC CAACCCT LLGESRVSFLAYADDL TCCTTTCTAGCTTACGCTGATGACCTGGTTCTCCTATCGGAGACCGAGGC CTACTCA VLLSETEAALTSQLNS TGCACTGACAAGCCAGCTGAACTCAATTGAAAAGAGTCTAGCGCACTGT ATAATCC IEKSLAHCGLKLSTGD GGTCTCAAGCTCAGTACGGGGGACAGTGGTAAATCTGCTTCGCTGAACA AAATGA SGKSASLNIVIDGKAK TCGTGATTGATGGTAAAGCTAAGCGGTGGGTTGTAAACCCTACCCCGTT ATAAAA RWVVNPTPFLRASG CCTACGTGCGTCTGGTGGCGAAATTCGATCTCTCGTCGCAAATGAGACG GCGGCA GEIRSLVANETYKYLG TACAAGTACTTGGGCATTAACATCGGAGCCCAGGGCGTGAAAGCTGCT TAACAG INIGAQGVKAAEYNA GAATACAATGCATTCAAAGAAGCACTCGATAACTTGTCTCGAGCTCCACT GTGAAC FKEALDNLSRAPLKP TAAGCCGCAGCAGAGGTTATTCCTGCTGAAGACCTATCTGTTGCCGCAA AC(SEQ QQRLFLLKTYLLPQLH CTACACCACTCACTGGTATTGTCAAGGACAACTGGCAAACTATTGAACA IDNO: HSLVLSRTTGKLLNSL GCCTCGATGCTCTCGTAAGAAAAGCTGTACGGGGATGGCTTAAATTGCC 1358) DALVRKAVRGWLKL ACACGACACTCATAGGGCCTTCTTTTATGCTCACCAGGCCGACGGGGGA PHDTHRAFFYAHQA CTCAATGTCCCAAGTCTGTACCATCTAATACCTCTCCTAAGGAGGAGTAG DGGLNVPSLYHLIPLL GTATGAACGCTTGACGAGGGTCGAAGACCCTGAGATAAGGGAAGTGTC RRSRYERLTRVEDPEI CCGAACCGACTACTTCAAAAGAGTCTTGGGTGCCGCGGCGGCGGCCAC REVSRTDYFKRVLGA CACGGTTGCAGGGCATCGAATTGATTCCAAAGTAACGTTGCGCCTAGCT AAAATTVAGHRIDSK TGGCGCGAGGCCCTTTACGCTTCAGCTGACGGGGGGGGTCTCTCCCAAT VTLRLAWREALYASA GCCCGTTGGTACCCGAGGTTCACTCCTGGGTTACCGATGTTTCCGGTCTT DGRGLSQCPLVPEVH CAGACCGGATCGCAGTACATCTCTGCTGTTCGCTTAAGGGGGGCCTTGC SWVTDVSGLQTGSQ TTCCCACGGCGGTAAGGAAATCGAGGGGTCGCGGGGGTGTTAACTCGA YISAVRLRGALLPTAV ATTGCGACTGTTGCGGTCGTGGACAACCAGAGTTTCTTGGGCACGTTCT RKSRGRGGVNSNCD CCAGACGTGTCCTCGGACCTGGGGGTCTCGCATCAGTCGTCACAACCAC CCGRGQPEFLGHVL GTCCTCTCCCTTATCGCTAAAGCGTGCCGTTCGCGAAGATGGCAGGTGC QTCPRTWGSRISRHN TGGAAGAGCCGATCATCCAGACTCCTGCGCAGTTGCTCAAGCCAGACTT HVLSLIAKACRSRRW GGTGATCTGGAACCATCAAGCGGCGTACGTCGTTGACGTTAGCGTACCG QVLEEPIIQTPAQLLK GGTGATAACACCCCGCTCAGTACGTGTCACAACAGGAAAGTCGCATATT PDLVIWNHQAAYVV ACTCTGGTGAGAGCGTTCGTGAATGGGTGCGTTCCAAGACTGGGCATA DVSVPGDNTPLSTCH ATCCTACAGTGTCCTCTGTAGTGATTAACTGGCGCGGTGCCATGGCAAA NRKVAYYSGESVRE GGAGTCCTACCGCCTTTTGACGAAAGACCTTCGTCTAGCGAAGACCCTG WVRSKTGHNPTVSS CCTCGTCTGCTGGTACTAAGAGTTCTGGAAGGCGGCCATGGAATCTGGC VVINWRGAMAKESY TCAATTTCCATCGGTCGACCTTCGCCGTGGGAGTAACTTGATGGTGGGT RLLTKDLRLAKTLPRL TACTCGCCTCTGTGTAACAGGCAAATGAAAGCTGCGCAAGCAGTCGATG LVLRVLEGGHGIWLN AGCCAAAGCCGCACAGCCCCCGACTGGGGTACAGGGCAGCCCTGGGCT FHRSTFAVGVT(SEQ ATGCCCGAAGTCTTTTGACAGGTCCAAACTTCACCCTTGCCGCCAGTAG IDNO:1480) GGCACCAGGGCCAGGTGCAGGTGGGCGCTTTGTTCTATTTGGTTTCGCT TAATTTTTGTTAAATTTTTCCCGTGCGCCCACCCTTTTTAACCCTTTTAACA CCAATTTTTTTTACAACCCTCTACTCAATAATCCAAATGAATAAAAGCGG CATAACAGGTGAACAC(SEQIDNO:1112) R2 R2Sm-A Schistosoma ATGTTTTAATTTATTTTTGAACTACTACTGTCTGAGTGCTTCTTACAACCT ATGTTTTA TAGCCC MPVSTGAETDITSSLP mansoni GAAGGCTCAGAAACTACCCACTTTTTGCTGTTTATCCACAACAACAGTTG ATTTATTTT CCTTCAC IPASSIVSPNYTLPDSS TGAATCTATTCTCCAAATATTCCTTGTGCTTTTGTCAACATTATTCTATACC TGAACTAC TCTTAGA STCLICFAIFPTHNILLS AACTGTACCACCTACTTCTTCATCTCACGTTTTAATTCTGGTCTAATTTTCT TACTGTCT CATTCCC HATAIHHISCPPTPVQ CATCATTAGTCACGGAGAGGGCCTATGAACGGTCCGTGACGCGAAATTC GAGTGCTT CCACTGT DGSQQMSCVLCAAA AATCCACGAATTCGTCCTCTTCTGCTAGTGGTCCCCGAAATACGGTTCCT CTTACAAC TGTTGCT FSSNRGLTQHIRHRHI CTGGCCTGTCAGTTGTGTTAAAACTATATAATAACGATGCCGGTCTCAAC CTGAAGGC TATCTTC SEYNELIRQRIAVQPT CGGCGCAGAAACTGACATAACCTCTTCTTTGCCTATTCCTGCATCCTCAA TCAGAAAC ATGTTTT SRIWSPFDDASLLSIA TCGTCTCGCCAAACTACACACTCCCTGATTCCTCTTCAACCTGCCTTATAT TACCCACT TGTGTTA NHEAHRFPTKNDLY GTTTCGCTATCTTCCCCACCCACAACATACTCCTCTCCCATGCCACTGCAA TTTTGCTG ATTGACT QHISTVLTRRTAEAVK TCCACCATATTTCTTGTCCTCCTACTCCAGTGCAAGACGGTTCTCAGCAG TTTATCCA GCTCTCT RRLLHLQWSRSPTAI ATGTCTTGTGTTCTTTGCGCCGCCGCTTTTTCATCTAACAGGGGACTAAC CAACAACA TCTGGG TTSSNNHTTTDIPNTE ACAACACATTCGCCACCGGCACATCTCCGAATATAACGAACTAATCAGA GTTGTGAA TTGATGT ARYIFPVDLDEHPPLS CAACGAATTGCAGTGCAGCCGACGTCTCGCATATGGTCACCATTCGATG TCTATTCTC CTGATT DATTPDASTHPLPELL ATGCTTCTCTACTATCAATCGCTAACCATGAAGCCCATAGATTCCCCACG CAAATATT GTCTCTC VILTPLPSPTRLQNISE AAGAATGACTTATACCAACACATCAGCACTGTATTAACACGCAGGACGG CCTTGTGC TCTCTTT SQTSHESNRNSMHT CAGAGGCCGTCAAACGCCGACTCCTCCACCTACAGTGGTCCAGATCACC TTTTGTCA CCATATT PPTYACDSDESLGVT CACAGCGATTACTACCTCTTCGAATAATCACACAACCACAGACATCCCCA ACATTATT GCTTGC PSSTIPSCFHSYRDPL ATACCGAGGCCCGATATATTTTTCCGGTAGACCTAGACGAACATCCACC CTATACCA TCTCCCC AEQRSKLLRASASLLQ ATTGTCTGATGCCACAACCCCCGACGCATCGACACATCCACTCCCAGAAC ACTGTACC GCTTACT SSCTRIRSSSLLAFLQN TCCTTGTCATCTTGACACCGCTTCCATCCCCGACTAGACTACAAAACATA ACCTACTT TCCAATA ASTLMDEEHVSTFLN TCCGAATCACAGACCTCCCATGAATCCAATAGGAACTCAATGCATACACC CTTCATCT GTTGTC SHGEFVFPRTWTPSR GCCAACGTATGCCTGCGATTCGGATGAGTCACTAGGGGTTACTCCCTCA CACGTTTT ATATTAT PKHPSHAPANVSRKK TCAACTATCCCCTCATGCTTCCACAGTTATCGGGACCCCCTAGCTGAACA AATTCTGG GTCTTTG RRKIEYAHIQTLFHHR AAGAAGCAAACTCCTGAGGGCATCCGCCAGCCTACTACAAAGCAGTTGT TCTAATTTT TTTACTT PKDAANTVLDGRWR ACTCGCATACGGTCCTCCAGCCTGCTCGCCTTCCTCCAAAACGCATCCAC CTCATCAT GCCATG NPYVANHSMIPDFD ATTAATGGACGAGGAACACGTGTCCACCTTCCTCAATAGTCATGGAGAA TAGTCACG TCTAAC CFWTTVFTKTNSPDS TTCGTCTTCCCTAGAACATGGACCCCATCCCGACCCAAACACCCCTCCCA GAGAGGG GACAAT REITPIIPMTPSLIDPIL CGCCCCAGCTAATGTTTCTAGGAAGAAAAGGAGGAAAATAGAGTACGC CCTATGAA TACTTTA PSDVTWALKEMHGT ACACATCCAGACACTCTTCCACCACCGTCCCAAAGATGCCGCCAACACCG CGGTCCGT TCTACCT AGGIDRLTSYDLMRF TTCTAGACGGTCGGTGGAGAAACCCCTATGTCGCAAACCATTCAATGAT GACGCGA TAGTTG GKNGLAGYLNMLLA TCCAGACTTCGACTGCTTCTGGACAACAGTCTTTACTAAAACAAATTCCC AATTCAAT GTCCTCT LAYLPTNLSTARVTFV CAGACAGCCGGGAGATTACTCCAATCATCCCTATGACTCCCTCTCTCATT CCACGAAT TGGTTT PKSSSPVSPEDFRPIS GACCCGATCCTCCCCTCTGACGTCACATGGGCGCTGAAAGAAATGCATG TCGTCCTC GGTTGC VAPVATRCLHKILAKR GCACGGCCGGTGGGATTGATCGTCTAACATCGTACGATCTGATGAGATT TTCTGCTA CTTCATG WMPLFPQERLQFAF CGGGAAGAATGGTCTTGCTGGATATCTCAACATGCTACTCGCTCTTGCAT GTGGTCCC TGTTCAT LNRDGCFEAVNLLHS ACCTTCCCACTAATCTCTCAACAGCACGGGTAACTTTCGTCCCCAAGTCA CGAAATAC GGCGGA VIRHVHTRHAGASFA TCAAGTCCTGTGTCACCTGAGGACTTCCGTCCCATCAGTGTCGCTCCAGT GGTTCCTC ATCTGAT LLDISRAFDTVSHDSII AGCCACTAGGTGCCTGCACAAAATTCTAGCAAAGAGATGGATGCCGCTC TGGCCTGT GTTTATA RAAKRYGAPELLCRYL TTTCCACAGGAACGACTTCAGTTCGCTTTCCTAAACCGAGATGGATGCTT CAGTTGTG ATGACT NNYYRRSTSCVNRTE TGAAGCAGTTAATCTTCTGCACTCGGTCATACGGCACGTCCACACCCGCC TTAAAACT ATTCCTA LHPTCGVKQGDPLSP ATGCAGGAGCATCCTTCGCCCTGCTCGACATATCACGGGCCTTTGACACT ATATAATA CTACCAC LLFIMVLDELLEGLDP GTATCACATGACTCCATCATCAGAGCGGCGAAAAGATATGGGGCACCTG ACG(SEQ CATTACA MTHLTVDGESLNYIA AACTGTTATGCCGCTACCTCAATAACTATTACCGACGTTCAACCAGCTGC IDNO: ACTATTA YADDLVVFAPNAELL GTCAACCGCACTGAATTGCATCCTACGTGTGGGGTGAAGCAAGGAGAC 1236) TTATTAT QRKLDRISLLLHEAG CCCCTGTCGCCACTCCTCTTCATCATGGTTCTCGACGAATTACTGGAAGG CACTATT WSINPEKSRTLDLISG TCTAGATCCAATGACCCACCTAACAGTTGATGGAGAGAGCTTGAACTAC ATTAAC GHSKITALSQTEFTIA ATAGCTTATGCTGACGATCTCGTAGTTTTCGCTCCAAATGCAGAACTCCT ATTATTA GMRIPPLSAADTEDY TCAACGGAAACTCGATCGGATCTCCCTACTTCTACACGAGGCTGGATGG TTACTTC LGIKSNFKGRCPVAHI TCGATTAACCCTGAAAAAAGCCGGACCCTGGACCTAATCTCTGGTGGCC TACAATT DLLNNYLTEISCAPLK ATTCCAAAATCACAGCGCTCTCTCAGACAGAATTCACCATCGCGGGGAT AGTATT PQQRMKILKDNLLPR GCGTATACCACCGCTTTCCGCCGCCGACACCTTCGACTATCTGGGTATCA ATGGCT LLYPLTLGIVHLKTLKS AATCCAACTTCAAGGGCCGATGCCCAGTGGCCCATATTGACTTATTGAA ACTCCTT MDRNIHTAIRKWLRL CAACTACCTCACGGAAATATCGTGCGCTCCACTTAAGCCGCAGCAGCGC TCAGCA PSDTPLAYFHSPVAA ATGAAGATCTTGAAAGATAATCTACTCCCTCGACTCCTCTACCCCCTGAC CACCAA GGLGILHLSSSVPFHR TCTAGGAATAGTACACCTGAAAACCCTGAAGTCAATGGACCGAAATATC TAAAAT RKRLETLLSSPNRLLH CACACGGCCATAAGGAAATGGTTGCGGCTACCCTCCGACACCCCGCTAG CTCAATC KLPTSPTLASYSHLSQ CATATTTTCACTCACCCGTCGCTGCCGGAGGCCTAGGGATCCTCCATCTG AAACAT LPVRIGHETVTSREEA TCCTCATCGGTTCCATTCCACCGTCGAAAACGTCTAGAAACCCTCCTATC CTCACTT SNSWVRRLHSSCDG TTCACCGAACCGCCTACTGCACAAGTTGCCAACTTCCCCAACACTAGCTT ATTAAA KGLLLAPLSTESHAW CTTATTCACACCTTAGTCAACTGCCAGTTCGAATTGGGCACGAGACCGTA CTCTCTA LRYPQSIFPSVYINAV ACGTCTAGAGAAGAGGCTTCCAACAGCTGGGTGAGACGATTACATTCGT TTTCCCC KLRGGLLSTKVRRSR CCTGCGACGGGAAGGGACTACTCCTAGCACCACTAAGCACCGAGTCCCA TTCGTTA GGRVTNGLNCRGGC TGCATGGCTGCGCTACCCCCAGTCTATTTTTCCAAGTGTTTACATCAACG TAAACTT AHHETIHHILQHCALT CCGTTAAATTACGAGGTGGCTTACTATCCACCAAAGTCAGGAGATCTCG ACAATTC HDIRCKRHNELCNLV CGGAGGTAGAGTGACGAATGGCCTGAACTGTCGAGGCGGTTGCGCCCA AGTTTA AKKLRRQKIHFLQEPC TCATGAAACAATCCACCACATTCTGCAACATTGCGCGCTCACCCATGACA ACCGAA IPLEKTYCKPDFIIIRDS TCAGATGCAAACGCCATAACGAACTATGCAACCTTGTGGCAAAGAAACT TATCTCT IAYVLDVTVSDDGNT GCGTAGGCAAAAAATCCATTTCTTACAGGAGCCCTGCATTCCTCTAGAA CTTTTAC HASRLLKISKYGNERT AAAACTTACTGCAAACCTGATTTTATAATTATACGTGATTCAATTGCTTAT AAATCTT VASIKRFLTSSGYIITS GTTCTAGACGTCACTGTATCGGACGACGGAAACACCCACGCCAGCCGCC AAGTAT VRQTPVLTFRGILERA TGTTAAAAATATCAAAATACGGCAATGAGCGAACCGTCGCATCGATCAA GTAATTT SSQSLRRLCFSSRDLG GCGATTCCTCACATCCAGTGGATATATCATTACCAGTGTTCGACAAACAC TGTGCC DLCLSAIQGSIKIYNTY CAGTCCTTACATTCAGAGGTATTCTGGAGAGAGCAAGTTCACAATCCCT AAGCCC MRGTQRLNE(SEQ ACGACGCCTATGTTTTTCGTCCCGTGACCTCGGTGACCTTTGCCTGAGTG ATTTGG IDNO:1481) CGATTCAAGGCTCAATTAAAATATATAATACCTATATGAGAGGAACCCA GTCTGT ACGGCTGAACGAATAGCCCCCTTCACTCTTAGACATTCCCCCACTGTTGT ACAATTT TGCTTATCTTCATGTTTTTGTGTTAATTGACTGCTCTCTTCTGGGTTGATG GATACTT TCTGATTGTCTCTCTCTCTTTCCATATTGCTTGCTCTCCCCGCTTACTTCCA AAAAAT ATAGTTGTCATATTATGTCTTTGTTTACTTGCCATGTCTAACGACAATTAC AAATGT TTTATCTACCTTAGTTGGTCCTCTTGGTTTGGTTGCCTTCATGTGTTCATG TAT(SEQ GCGGAATCTGATGTTTATAATGACTATTCCTACTACCACCATTACAACTA IDNO: TTATTATTATCACTATTATTAACATTATTATTACTTCTACAATTAGTATTAT 1359) GGCTACTCCTTTCAGCACACCAATAAAATCTCAATCAAACATCTCACTTAT TAAACTCTCTATTTCCCCTTCGTTATAAACTTACAATTCAGTTTAACCGAA TATCTCTCTTTTACAAATCTTAAGTATGTAATTTTGTGCCAAGCCCATTTG GGTCTGTACAATTTGATACTTAAAAATAAATGTTAT(SEQIDNO:1113) R2 R2Tc EU854578 Triops TTTTTGGTCTGGCATTTGATCGTTTCCGCCTCCTCGTGGCGCCAGACTGG TTTTTGGT TAGATG MSQKRRPEKAVPDE cancriformis GTAAGCTGATTTATCAGTGAGCTAAGAGAAACGATCACCGCAGGAGTCC CTGGCATT ACTGCC GATAHDVAQPDKSK ATCTACCTGCGCTGCACGTGATTTCATTGTGCACTTGGCGAGTTATCCCT TGATCGTT CTACCCC CSVCGETFKGPASVT TGTGGAGCTCGCTCGTTAGCTTTTGAGATGTCTCAAAAGCGCAGACCGG TCCGCCTC TTTGCTG MHMVKKHPVEFNEL AGAAAGCAGTCCCCGACGAGGGAGCGACTGCTCATGATGTTGCGCAAC CTCGTGGC CCGAAG KMAKKPVPKKVRWS CGGATAAGAGTAAGTGTTCTGTTTGTGGGGAGACTTTTAAGGGTCCTGC GCCAGACT AACTACT EEEIFQLARTEAELTL TTCCGTGACAATGCACATGGTAAAGAAGCACCCAGTCGAATTTAATGAG GGGTAAG GAAGAC QGVRFINVELQKIFPA CTTAAGATGGCCAAGAAGCCTGTTCCGAAGAAAGTACGATGGTCTGAG CTGATTTA TATTGA REIEGIKGQRKLAKYK GAGGAGATCTTCCAACTTGCCAGGACAGAGGCCGAACTTACCCTTCAGG TCAGTGAG ACTAAC ELVKDQLDEIGRAPN GAGTTCGGTTTATTAATGTCGAGCTGCAGAAGATCTTCCCAGCACGCGA CTAAGAGA CAACTG PPEQEIGEDVPSPFKA GATCGAGGGCATCAAAGGCCAGAGGAAACTGGCTAAGTACAAAGAGCT AACGATCA TGTTAA WLELLLALPKTPNDFL CGTTAAAGATCAGCTGGACGAAATCGGCCGTGCTCCTAACCCCCCTGAG CCGCAGGA GTAAGA EHKLDNIIVQALKEDV CAAGAGATCGGAGAGGATGTGCCTAGCCCGTTCAAGGCATGGCTGGAA GTCCATCT ACTAAT NSDQVFNDLNSYLKL CTTCTGCTTGCGCTCCCCAAGACCCCCAATGATTTTCTAGAACACAAACT ACCTGCGC GCCTCTT ILEPSGRAKSVPGEIIH TGACAACATTATTGTTCAAGCGCTGAAGGAAGACGTGAACTCCGATCAG TGCACGTG TTTCCCT GDPSGSAKTSVTKAP GTATTCAATGACCTTAACTCCTATTTAAAGCTGATTCTTGAGCCGAGTGG ATTTCATT GCATGT KPATVSSSRKKRRDA TAGAGCCAAGTCAGTTCCCGGGGAAATTATACACGGTGATCCCAGTGG GTGCACTT ATCCCCT EFARIQRLYRKNRTSC GTCTGCTAAGACCTCAGTAACCAAGGCTCCTAAGCCTGCTACAGTTTCTT GGCGAGTT GATCAG INTILDGNTREHEAPK CTAGCAGGAAAAAACGCAGAGATGCCGAGTTTGCTAGGATCCAGCGTC ATCCCTTG TGACTTA NMEGFWREIFERESP TCTATCGGAAAAATCGGACATCATGTATAAATACAATCCTTGATGGCAAT TGGAGCTC TTTTCTT DDPDDPDIFLEEEAS ACAAGAGAGCATGAGGCCCCTAAGAACATGGAAGGATTTTGGAGGGAA GCTCGTTA TTCCTGT DIWKYISFYEMCNLY ATATTCGAGAGGGAATCCCCGGATGACCCCGATGATCCCGACATTTTCC GCTTTTGA TGCGCC PPPSTAPGPDGFSSK TTGAGGAAGAAGCCAGTGATATTTGGAAGTACATCTCTTTCTATGAGAT G(SEQID CTTTGTT DLRRMTPRVLNKILN GTGCAACCTTTATCCGCCTCCCTCCACTGCTCCTGGCCCCGATGGGTTTT NO:1237) TAGTTAT LLLHLRDLPQILKSHR CCTCTAAGGACCTCCGGCGGATGACGCCAAGAGTCCTCAACAAAATTTT TTCCTTT TVLIPKTDLPTKPGDF AAACCTACTCCTTCATTTGAGAGATCTTCCTCAGATCCTGAAATCCCATC AATTACT RPITISNILVRHLNKIL GAACTGTACTAATACCAAAAACCGATTTGCCTACTAAACCCGGGGACTTT AGAATT ANRVSHLIPINERQK CGTCCCATAACGATCTCAAATATACTAGTCAGACATCTTAATAAGATTCT ATCTTTT AFLPIDGCAENIFTLD GGCTAATAGAGTGAGCCACCTTATACCAATCAATGAGAGGCAGAAAGC CGTTCTC FILHHARTKIKSLSMA TTTCCTACCAATTGATGGCTGCGCAGAAAATATTTTCACTTTGGACTTCA CGTCTA ILDISKAFDSVSHHSIF TACTCCATCATGCGCGGACCAAAATTAAATCGCTGTCAATGGCCATTCTA ATTGCTT RALREARCPIGFIKFIE GATATTAGTAAGGCATTTGATAGCGTCTCGCATCATTCCATCTTCCGCGC TTTCGGT NCYGGCFTKLFCGGV TTTGCGAGAGGCTCGATGCCCCATTGGTTTTATTAAATTCATTGAGAATT GTAGGA KYPSEVSMNRGVKQ GTTATGGTGGATGTTTTACCAAACTTTTCTGTGGTGGCGTGAAATACCCG ACGGCT GDPLSPVLFNLVIDGL AGTGAGGTTTCTATGAACCGAGGAGTAAAGCAGGGGGATCCGCTGTCA ACCTAA IRQIPSALGFNVSDQ CCAGTCCTATTTAACTTAGTCATTGATGGTCTGATCAGGCAAATTCCATC AGCTGG VKVSCIAYADDLILIAT CGCTCTTGGATTCAACGTAAGTGATCAGGTGAAAGTCAGCTGTATCGCC AAGTGG TRAGLKTLLDLTNSYL TACGCTGATGACCTGATCCTTATTGCGACCACAAGGGCTGGCCTGAAGA GAAGTG AKRGLSLNPDKCSAL CCCTGCTTGACTTGACTAACTCCTACCTAGCAAAACGGGGACTTTCGCTC TTTTCAA SIVASGKQKLVYIASS AACCCGGACAAATGCTCGGCCCTATCGATTGTGGCATCCGGCAAGCAGA TGTACTT EHFDLAGQKMRNLN AGCTTGTATATATTGCGTCTTCCGAGCATTTTGATCTCGCAGGTCAGAAA TGTGATT VGDSWRYLGIQFSHL ATGCGCAATTTAAACGTCGGTGACTCTTGGAGATACCTGGGTATTCAAT ATAGAA GRAEKVTPDLTCLINR TCTCGCATCTCGGTAGAGCCGAAAAGGTCACTCCAGACCTAACATGTCT ATATAT LQKAPLKLQQKLYAL AATTAATAGACTTCAAAAGGCCCCTTTAAAGCTTCAACAGAAGCTTTATG GACCCG RIYLIPRLIHGLTLSKT CTCTCCGAATCTATCTTATACCCAGGCTTATCCATGGCCTCACCCTCTCTA AGGTGC NLGELKTLDKLIRKYIR AGACCAACCTCGGAGAGCTTAAAACGCTGGACAAGCTAATCCGCAAGT ATTGTTT AWLHLPDDTPMGYF ACATCCGCGCCTGGCTCCATCTCCCAGATGACACTCCGATGGGCTATTTC GGCATT YTPLKAGGLGLPSLRL TATACCCCCTTAAAAGCTGGTGGACTTGGCCTTCCTTCACTCCGTCTAGT TCCTCGA VILNNRLERILRMKAS GATCCTTAACAATCGACTTGAACGGATACTAAGGATGAAGGCTTCTCAA GAAA QDIIVRTIAESETLGVE GACATTATTGTCCGCACTATTGCCGAATCGGAGACGTTGGGGGTCGAGA (SEQID IRKLHDLLSIDGTILDT TTCGAAAGCTTCACGATCTGTTGTCCATTGATGGAACAATTCTGGACACG NO: SVKIHSFWAERLYSSY AGCGTTAAGATCCATAGCTTCTGGGCAGAAAGACTCTACTCGTCCTATG 1360) DGKCLCNSANFPPG ATGGTAAATGTCTTTGCAATTCTGCAAATTTTCCCCCCGGAAACAAGTGG NKWIGEDSLNQRSHI ATCGGTGAAGATTCTCTGAACCAACGCTCGCATATCTTCGCTGATTGCCT FADCLKLRINALPTRS AAAGCTCCGGATAAATGCGCTCCCAACACGTTCCAGAACAGCACGTGGA RTARGRPLKDKPCRA AGGCCACTAAAAGATAAACCATGTAGAGCTGGATGTCGGAACAGTGAT GCRNSDGVKVIETLN GGTGTAAAGGTCATTGAGACACTGAATCACATTACACAGGTGTGTGAG HITQVCERTHGARVK AGAACCCACGGAGCCCGCGTCAAGCGGCATGATCGCCTAGTGGACTTC RHDRLVDFAVKGLQ GCGGTGAAAGGACTACAAAGACCCCATCGTGTGGTTCTAAAGGAGCCA RPHRVVLKEPHYKTV CATTACAAGACAGTAAACGGTGTTCGGAAACCGGACATCGTCATTAAAA NGVRKPDIVIKIPDHT TCCCCGACCACACATACATTTGTGATTTCCAAGTTGTATCTGATACCTCTT YICDFQVVSDTSCLEL GTTTAGAGTTGGAGTTCCGGAAGAAAGCCCTCAAGTATGCGGAGGACA EFRKKALKYAEDKGL AAGGCCTCTGCGATCAACTTACCCGCGACCATCCTGGGGAGCTCTCATT CDQLTRDHPGELSFT CACCGCAATTACTTTCAATACAAGAGGGCTTATAGCCAAGTCTTCAGTCA AITFNTRGLIAKSSVT CCGCCCTCCGGAAATTGGGGATGCCACCCAGATCTATCATGACCCTCCA ALRKLGMPPRSIMTL GAAAATATGTATGGAGGGTTCTCTGGAGATCTGGAGGATTTTTAACCAG QKICMEGSLEIWRIF ACTACTGCTATGGCCCGCAATTAGATGACTGCCCTACCCCTTTGCTGCCG NQTTAMARN(SEQ AAGAACTACTGAAGACTATTGAACTAACCAACTGTGTTAAGTAAGAACT IDNO:1482) AATGCCTCTTTTTCCCTGCATGTATCCCCTGATCAGTGACTTATTTTCTTTT CCTGTTGCGCCCTTTGTTTAGTTATTTCCTTTAATTACTAGAATTATCTTTT CGTTCTCCGTCTAATTGCTTTTTCGGTGTAGGAACGGCTACCTAAAGCTG GAAGTGGGAAGTGTTTTCAATGTACTTTGTGATTATAGAAATATATGAC CCGAGGTGCATTGTTTGGCATTTCCTCGAGAAA(SEQIDNO:1114) R2 R2_DAn Drosophila AGAATATGGATTTGATTGTGCAGAGGGGGTGCTATACCGTAACTCGTAA AGAATATG TAGCCA FERRKDPWGYRPPG ananassae GCCATGCAATCAGATCAAGTCGACTCAAAACCTCCTCGTGGTATTCTCTG GATTTGAT ATGCAC TLKQIGATENNEPRN GGTGCCAGTATTTACTGGTAGCTGATTTGAGCGGCGAAAGGATCCTTGG TGTGCAGA GGGTTC LNRFVRGESTASSLES GGTTACCGGCCCCCTGGAACCTTAAAACAAATTGGTGCAACTGAAAATA GGGGGTG CAGATT TQFGTSAEVNLAGRV ACGAGCCTCGGAACCTAAATCGTTTTGTAAGAGGAGAATCCACGGCTTC CTATACCG AAGCTT PCTICEMTFSSKRGLG CAGCCTGGAGAGCACACAATTTGGAACCAGTGCAGAGGTTAACCTTGCA TAACTCGT GCTGCC VHMSHRHKDDLDA GGGAGGGTGCCCTGTACGATATGTGAAATGACGTTCAGCTCCAAGAGG AAGCCATG GAAGCA QRLRVDKKARWSEE GGTTTGGGCGTTCACATGTCACATCGGCACAAAGACGATCTTGATGCAC CAATCAGA TACCATC ETLMMARKEVELAA AACGTCTTCGTGTCGATAAAAAGGCAAGGTGGTCAGAGGAAGAAACCT TCAAGTCG AAAATC SGVRFLNKKLAEIFTH TGATGATGGCGAGAAAGGAGGTTGAGCTTGCAGCAAGTGGTGTACGAT ACTCAAAA GGCATA RSADAISSYRKRSEYK TTCTTAATAAGAAGCTAGCGGAGATTTTCACCCACCGCAGTGCCGATGC CCTCCTCG AAATTC AKLEQIRGQSVPTPE GATATCTTCGTATCGGAAGAGGAGTGAGTACAAGGCAAAACTAGAGCA TGGTATTC GCTTAAT AEEINTTQRRPSNSE GATAAGGGGGCAATCCGTTCCCACCCCAGAAGCAGAAGAAATCAACAC TCTGGGTG AAAGGA QNRRVPRSEGGPIAP CACACAGCGCCGCCCTAGTAATAGCGAGCAAAACCGACGAGTACCAAG CCAGTATT GGTGGT TEQTNNEILRVLQGL ATCAGAAGGGGGACCAATCGCACCAACCGAACAGACGAACAACGAAAT TACTGGTA TTTAGTA APVVCLPRWRAEVL CCTTAGGGTACTACAGGGTCTAGCACCTGTAGTATGCTTACCCCGGTGG GCTGA CGTAGG QNIVDNAQVSGQET AGAGCCGAGGTCCTGCAAAATATCGTAGATAATGCGCAGGTCTCGGGA (SEQID CGTCCC TLQSLSSYLMEIFPPR CAGGAAACCACTCTCCAAAGTTTATCCAGTTATCTCATGGAAATTTTTCC NO:1238) GGGACT NEPHILTRPRTEPRN GCCACGGAATGAACCGCACATTCTGACGAGGCCCCGAACGGAGCCTCG TGTCTC MRQRRRQQYARVQ AAATATGAGACAACGTAGAAGGCAGCAGTACGCGAGGGTTCAGCGTAA GGGATG RNWDKHPGRCIKSLL CTGGGATAAACATCCGGGGCGATGCATAAAGTCCCTACTGGAGGAAGA AATCGT EEDDESVMPNQEV TGATGAGTCGGTGATGCCAAACCAGGAGGTCATGGAGCCATATTGGAG GCATGC MEPYWRRVMTQPS ACGGGTAATGACTCAGCCTAGCTCAAGCTCGATAAAACGCGACATGTTT GTATAA SSSIKRDMFNMEHSL AACATGGAGCATTCACTCGAGAGGGTATGGTCCGCTGTGAACCAGCGC TTGGGA ERVWSAVNQRDLRA GATCTTAGGGCCACAAAAGTCAAATTATCTAGTTCTCCAGGCCCGGACG TCGATA TKVKLSSSPGPDGITP GGATCACTCCAAAAACTGCCAGGAGTGTCCCCGAAGGCATTATGCTTCG ACAAAT KTARSVPEGIMLRIM CATAATGAACTTGATCCTCTGGTGCGGGAATTTGCCGTACTCTATCCGTC ACCAAC NLILWCGNLPYSIRLA TGGCCCGAACCATCTTCATTCCGAAGAAGGCGACGGCAAATCAACCGCA TAAGTT RTIFIPKKATANQPQ AGACTATCGTCCTATTTCAGTCCCCTCGGTTATAGTTAGGCAACTAAATG ATTACTA DYRPISVPSVIVRQLN CCATTTTGGCTTCCCGGTTGAGCGCAGCCATCAACTGGGACACGCGTCA ATATATC AILASRLSAAINWDT GCGAGGGTTCCTACCTACCGATGGGTGTGCTGATAATACGACGATTGTT GAAATA RQRGFLPTDGCADN GATTTAGTTTTGAGGGAACATCATAAGCGATTTAAATCGTGCTACATCG CATAAA TTIVDLVLREHHKRFK GGACCCTCGATGTTAGTAAGGCCTTTGATGCTGTAGCTCACGAAGCGGT TATCCCG SCYIGTLDVSKAFDAV CTACAACACATTGGCTTCATATGGTGCCCCGAAAGGCTTCATCAACTACT TCCTTAC AHEAVYNTLASYGAP TACGGAAGGCGTACGAGGGCGGCGGCACAATGCTCGCTGGGAACGGG GTATCTT KGFINYLRKAYEGGG TGGGTTTCAGAGGCGTTCATTCCTGCCCGAGGAGTGAAGCAGGGTGAC TGAAGA TMLAGNGWVSEAFI CCTCTGTCTCCCATACTATTCAACTTGGTCATTGACCGGTTGCTTAGGTCC TTTCCAT PARGVKQGDPLSPIL TTACCCAGTGAGATTGGTGCCAAAGTCGGAAATGCCATGACAAACGCA CCTCAG FNLVIDRLLRSLPSEIG GCAGCATTCGCAGATGATATAGTCCTTTTTGCGGAAACTCCGATGGGGC CGAACA AKVGNAMTNAAAFA TTCAGAAATTGTTGGACACCACCGTTTGTTTCCTTTCCTCGGTGGGTCTC AAAAAA DDIVLFAETPMGLQK ACCCTTAATACTGATAAATGTTTCACGGTCAGTATTAAGGGGCAAGCCA AAAAAA LLDTTVCFLSSVGLTL AACAAAAGTGTACCGTCGTCGAACGGCGAAGCTTCTTGATTGGCGGGC A(SEQ NTDKCFTVSIKGQAK GCGAGTGTCCTTCATTGAAGCGTACTGACGAGTGGAAGTACTTAGGGAT IDNO: QKCTVVERRSFLIGG TAAATTCACTGCGGAGGGGCGGGCCCGGTACGATCCAGCAGAGGACCT 1361) RECPSLKRTDEWKYL CGGTCCAAAGCTGTTGAGATTGACTCGGGCCCCCCTGAAACCACAACAG GIKFTAEGRARYDPA AAGTTATTTGCCCTTCGGACCGTCCTTATCCCACAACTCTATCACAAGCT EDLGPKLLRLTRAPLK GACCCTTGGGAGTGTGACGATAGGCGTTCTGAAGAAATTTGACAAATTA PQQKLFALRTVLIPQL GTTCGATATACCGCACGGAAGTGGTTGGGGCTTCCGGTGGACGTACCA YHKLTLGSVTIGVLKK GTTTCTTTTTTCCATGCCCCCCACAAGAGTGGGGGTCTCGGGTTACCATC FDKLVRYTARKWLGL TCTAAGATGGACAGCACCAATGCTTCGACTAAAGCGATTGAGCAACATA PVDVPVSFFHAPHKS AAATGGCCTCACCTCGAGCGATCCGAGGTAGCCAGCTCTTTCGTGGAGG GGLGLPSLRWTAPM AAGAAATGCGGAGGGCCCGGGATAGGCTTCAGGCTGGAAGTGAAGAA LRLKRLSNIKWPHLER CTGTTAACCCGTTCGCAGGTAGATTCGTACTTGGCAAATAGATTGCACAT SEVASSFVEEEMRRA GTCTGTTGATGGTTGCGGGCTCCGTGAAGCAGAGCGTTTTGCTCCGCAA RDRLQAGSEELLTRS CACGGGTGGGTTAGTCAGCCCACGCGTTTGCTAACAGGAAAGGAATAT QVDSYLANRLHMSV ACTGATGGAATCAAACTGCGGATAAATGCCCTACCCTCAAGGTCTCGTA DGCGLREAERFAPQ CTACGAGGGGAAGGCACGAATTGGAGAGACGGTGCCGTGCAGGATGT HGWVSQPTRLLTGK GATGCTCCCGAAACAACAAATCACATCTTGCAGCAATGCTATAGAACTC EYTDGIKLRINALPSR ACGGGAGGAGGATAGCTCGGCACAACGGCGTAGTAAATTTTCTCAAGC SRTTRGRHELERRCR GGGGACTTGAGCGAAGAGGCTGCGTCGTTCATGTTGAACCAAGTCTGC AGCDAPETTNHILQQ AGGGCGAAACCGGACTGAATAAACCTGACCTGGTGGCTATCCGACAAA CYRTHGRRIARHNGV ATCGCATTTATGTGATTGACACTCAGATTGTGACTGACGGACATTCTCTC VNFLKRGLERRGCVV GACCAAGCGCACCAGCGTAAGGTCGGGAAGTACGATACACCGGACATA HVEPSLQGETGLNKP CGGACGAATTTGCGGAGATCTTTCGGTGCCTTTGACATTGAGTTCCATTC DLVAIRQNRIYVIDTQ CGCCACTGTGAACTGGAGGGGAATATGGAGTGGTCAATCAGTAAAACG IVTDGHSLDQAHQRK GTTGATCGCTTCAGACCTCCTCAGCTCTGGTGATAGCAATATCATCAGTG VGKYDTPDIRTNLRR TCCGGGTAATCAGTGGTGGTCTCTGGAGCTGGCGGCAGTTCATGTATCT SFGAFDIEFHSATVN GTCGGGGTACACTCGCGATTGGACTTAGCCAATGCACGGGTTCCAGATT WRGIWSGQSVKRLI AAGCTTGCTGCCGAAGCATACCATCAAAATCGGCATAAAATTCGCTTAA ASDLLSSGDSNIISVR TAAAGGAGGTGGTTTTAGTACGTAGGCGTCCCGGGACTTGTCTCGGGAT VISGGLWSWRQFMY GAATCGTGCATGCGTATAATTGGGATCGATAACAAATACCAACTAAGTT LSGYTRDWT(SEQID ATTACTAATATATCGAAATACATAAATATCCCGTCCTTACGTATCTTTGAA NO:1483) GATTTCCATCCTCAGCGAACAAAAAAAAAAAAAA(SEQIDNO:1115) R2 R2_DM X51967 Drosophila TTGGGGATCATGGGGTATTTGAGAGCAGAGGGGGAGTATTCTTCTGTA TTGGGGAT TAGCTA MTTRPSVDIFPEDQY melanogaster ATTCGTAAGTCATATCATATGATGTGCGGAAGGGGAATTTTACTCTGTA CATGGGGT AATCGTT EPNAAATLSRVPCTV ACTCACAAGTCTCTCCTTTACTCAAGTCGACTCAAAACCTCCTCGTGGTG ATTTGAGA TGGTTC CGRSFNSKRGLGVH GTCCCGGTAATGCTAAACTCGTTTAGCAGCTAATTTGAGCGGAAAAACT GCAGAGG AAAACA MRSRHPDELDEERRR TTTCCGATGGGCTGGTTCCCCAGAGGAAATTTATTCATATTGGAACTACA GGGAGTAT TTTGCTT VDIKARWSDEEKWM AGCACAAATAACGAGCCTCGGATACCTTTACACAATCTGATGACGACCC TCTTCTGT GCTGTC MARKEVELTANGCK GACCCTCCGTGGATATCTTCCCGGAGGACCAATATGAACCAAACGCAGC AATTCGTA TTGGCA HINKQLAVYFANRSV GGCTACTCTATCTAGGGTTCCCTGCACAGTATGTGGCCGGTCCTTTAACA AGTCATAT TAACATC EAIKKLRQRGDYKEKI GCAAGAGAGGACTCGGTGTTCACATGCGATCTCGGCACCCAGACGAAC CATATGAT AATAAA EQIRGQSALAPEVAN TTGATGAAGAACGTCGACGTGTCGATATAAAGGCAAGATGGAGTGATG GTGCGGA GGCATA LTIRRRPSRSEQDHQ AAGAGAAGTGGATGATGGCGAGAAAGGAGGTTGAGCTCACAGCAAAT AGGGGAA AACATC VTTSETTPITPFEQSN GGATGTAAACACATAAACAAGCAACTAGCGGTGTATTTTGCAAACCGCA TTTTACTCT GCAAAA REILRTLRGYSPVECH GCGTCGAAGCCATCAAAAAGCTAAGACAGAGGGGCGATTATAAGGAGA GTAACTCA TAATGG SKWRAQELQTIIDRA AAATAGAGCAGATAAGAGGGCAATCAGCTCTCGCCCCGGAAGTTGCTA CAAGTCTC TTATAAT HLEGKETTLQCLSLYL ATCTAACCATAAGGCGCCGCCCTAGTAGAAGTGAGCAAGACCACCAAGT TCCTTTACT TAAATG LGIFPAQGVRHTLTR AACAACATCGGAAACAACTCCAATCACTCCCTTCGAACAGTCGAACAGG CAAGTCGA GCTATG PPRRPRNRRESRRQQ GAAATTTTGCGGACACTACGCGGGTATAGCCCCGTAGAATGCCATTCCA CTCAAAAC AGGATG YAVVQRNWDKHKG AATGGAGAGCCCAAGAGTTGCAAACTATCATTGATAGGGCACATCTCGA CTCCTCGT GTTTTAG RCIKSLLNGTDESVM GGGAAAGGAAACCACTCTCCAATGCTTATCGCTATATCTCCTGGGAATTT GGTGGTCC TACGTA PSQEIMVPYWREVM TTCCGGCACAGGGTGTACGACACACACTGACGAGACCTCCTCGGAGACC CGGTAATG GGCGTT TQPSPSSCSGEVIQM TCGGAACAGGAGAGAAAGCAGAAGGCAGCAGTACGCTGTCGTCCAGC CTAAACTC GCGGAA DHSLERVWSAITEQD GTAACTGGGATAAGCATAAAGGAAGATGCATCAAGTCCTTGCTAAATG GTTTAGCA CTTCGG LRASRVSLSSSPGPDG GAACTGATGAGTCGGTAATGCCAAGCCAAGAAATAATGGTTCCCTACTG GCTAATTT TTCATAT ITPKSAREVPSGIMLR GAGAGAAGTAATGACTCAGCCTAGCCCAAGCTCTTGCAGTGGAGAAGT GAGCGGA AGAGCA IMNLILWCGNLPHSI GATACAAATGGATCACTCGCTTGAGAGGGTATGGTCTGCTATTACAGAG AAAACTTT ATGAAT RLARTVFIPKTVTAKR CAGGACCTTCGGGCGTCAAGAGTCTCATTATCCTCGTCTCCGGGGCCTG TCCGATGG CGTGCA PQDFRPISVPSVLVR ACGGGATAACTCCAAAATCTGCCAGGGAGGTGCCGTCAGGTATTATGCT GCTGGTTC TGCTAG QLNAILATRLNSSIN GCGCATAATGAACCTAATTCTATGGTGCGGTAATCTACCACACTCCATAC CCCAGAGG GAAAAC WDPRQRGFLPTDGC GACTGGCCAGAACCGTCTTCATCCCGAAGACGGTGACGGCGAAGCGAC AAATTTAT TGACCA ADNATIVDLVLRHSH CGCAAGACTTTCGTCCAATATCAGTGCCTTCAGTCCTGGTAAGACAGCTA TCATATTG CACACA KHFRSCYIANLDVSKA AATGCAATATTGGCAACCCGGTTGAACTCATCAATCAATTGGGACCCGC GAACTACA GTGTTG FDSLSHASIYDTLRAY GCCAGCGGGGCTTCTTACCAACCGACGGATGCGCCGATAATGCGACGA AGCACAAA GCAGAC GAPKGFVDYVQNTY TAGTCGACTTAGTCTTGAGGCATAGCCATAAGCACTTTAGATCTTGCTAC TAACGAGC CTAGTAT EGGGTSLNGDGWSS ATCGCAAATTTAGATGTAAGCAAGGCATTTGATTCTCTATCACATGCATC CTCGGATA CTTTCGA EEFVPARGVKQGDPL TATATACGACACCTTACGTGCTTATGGTGCGCCAAAGGGCTTCGTTGACT CCTTTACA AGATTTC SPILFNLVMDRLLRTL ACGTACAGAACACGTACGAGGGCGGTGGTACCAGTCTCAATGGGGACG CAATCTG CATACCT PSEIGAKVGNAITNA GTTGGAGTTCAGAGGAATTCGTCCCTGCTAGAGGAGTGAAGCAGGGTG (SEQID CCGCGA AAFADDLVLFAETRM ACCCTTTGTCTCCTATTCTATTTAACTTGGTAATGGACAGGTTACTTAGAA NO:1239) TCAAAA GLQVLLDKTLDFLSIV CCTTACCCAGCGAAATTGGTGCCAAAGTCGGAAATGCCATTACTAACGC AAAAAA GLKLNADKCFTVGIK GGCCGCGTTTGCAGATGATTTGGTACTATTTGCGGAAACTCGGATGGGG AAAAAA GQPKQKCTVLEAQSF CTTCAAGTATTGTTGGACAAGACGTTGGATTTTCTATCTATCGTCGGCCT AAAAAA YVGSSEIPSLKRTDEW CAAACTTAATGCCGACAAATGTTTTACCGTTGGCATTAAGGGCCAGCCG (SEQID KYLGINFTATGRVRC AAACAGAAGTGTACCGTGTTAGAGGCACAGAGCTTCTACGTAGGCTCG NO: NPAEDIGPKLQRLTK AGTGAGATTCCATCACTGAAGCGCACGGACGAGTGGAAGTACTTAGGC 1362) APLKPQQRLFALRTV ATCAACTTCACTGCAACCGGGAGGGTTCGATGCAATCCGGCCGAGGAC LIPQLYHKLALGSVAI ATTGGTCCAAAGCTACAAAGATTGACAAAGGCCCCCCTCAAACCACAAC GVLRKTDKLIRYYVRR AGAGGTTGTTCGCCCTTCGGACTGTCCTTATCCCACAGCTCTACCACAAG WLNLPLDVPIAFVHA TTAGCCCTTGGGAGTGTGGCGATAGGCGTCCTAAGAAAAACTGATAAA PPKSGGLGIPSLRWV CTTATAAGATATTATGTGCGAAGATGGCTAAATCTTCCGCTGGATGTGC APMLRLRRLSNIKWP CGATAGCATTTGTTCATGCACCCCCAAAAAGTGGAGGTCTCGGAATTCC HLTQNEVASSFLEAE ATCACTAAGATGGGTAGCTCCAATGTTAAGGCTAAGACGCTTGAGTAAC KQRARDRLLAEQNEL ATTAAATGGCCTCACCTCACGCAAAACGAGGTAGCCAGCTCTTTCCTCG LSRPAIEKYWANKLYL AAGCCGAAAAACAACGGGCCCGAGATAGATTATTAGCTGAACAAAATG SVDGSGLREGGHYG AACTGTTATCGCGTCCGGCAATAGAAAAATATTGGGCGAACAAGTTGTA PQHGWVSQPTRLLT CCTCTCAGTTGATGGTAGCGGACTCCGTGAAGGCGGCCATTATGGCCCG GKEYMDGIRLRINAL CAACACGGGTGGGTTAGTCAACCCACGCGTTTATTAACAGGAAAGGAAT PTKSRTTRGRHELER ATATGGACGGTATTCGTCTGCGGATAAATGCCCTACCCACAAAGTCTCG QCRAGCDAPETTNHI TACTACAAGGGGAAGGCACGAATTGGAACGACAGTGTCGTGCAGGATG MQKCYRSHGRRVAR TGATGCTCCCGAAACAACAAACCACATAATGCAAAAATGCTACCGCTCG HNCVVNRIKRGLEER CATGGGAGGCGGGTAGCTAGACACAACTGCGTAGTAAATCGAATCAAG GCVVIVEPSLQCESGL CGGGGACTTGAGGAGAGAGGCTGCGTGGTCATTGTTGAACCAAGTCTG NKPDLVALRQNHIDV CAGTGCGAATCCGGCCTTAATAAACCAGACCTGGTGGCACTACGACAAA IDTQIVTDGHSMDD ATCACATTGATGTGATCGACACACAAATTGTGACAGACGGACACTCTAT AHQRKINRYDRPDIR GGATGATGCGCACCAGCGCAAAATCAATAGATACGACAGACCGGACAT TELRRRFEAAGDIEFH ACGAACTGAATTGCGTCGCAGATTCGAAGCCGCAGGTGACATTGAATTC SATLNWRGIWSGQS CATTCTGCCACCCTGAACTGGAGGGGGATCTGGAGTGGTCAATCCGTTA VKRLIAKGLLSKYDSHI AAAGATTGATAGCAAAGGGTCTCCTCAGCAAATATGATAGTCATATCAT ISVQVMRGSLGCFKQ TAGCGTCCAGGTTATGAGAGGCAGTCTCGGTTGTTTTAAACAGTTCATG FMYLSGFSRDWT TACCTGAGCGGGTTTTCCCGAGATTGGACTTAGCTAAATCGTTTGGTTCA (SEQIDNO:1484) AAACATTTGCTTGCTGTCTTGGCATAACATCAATAAAGGCATAAACATCG CAAAATAATGGTTATAATTAAATGGCTATGAGGATGGTTTTAGTACGTA GGCGTTGCGGAACTTCGGTTCATATAGAGCAATGAATCGTGCATGCTAG GAAAACTGACCACACACAGTGTTGGCAGACCTAGTATCTTTCGAAGATT TCCATACCTCCGCGATCAAAAAAAAAAAAAAAAAAAAAA(SEQIDNO: 1116) R2 R2_DPe Drosophila AAGATATGGATCTGAATAATAGCGTAGAAGGGGAGTCATTCCGTAATTC AAGATATG TAGCCT SSFGLIVTNLNSETVL persimilis GTAAATCGTAAAAATCAGATCAAGTTGATTCAAGACCTCCTCGTGGTAT GATCTGAA ATACACT WGCQPLGQFSLIGT CTTCTGGATGCTATTAGACTGAAGTTCTTTTGGTCTAATAGTAACTAACT TAATAGCG ATGTTG NMQNTTPRIINTNSL TGAACAGCGAAACAGTCCTATGGGGTTGCCAGCCCCTTGGACAGTTCAG TAGAAGG GAGAGA TNQIPTVSSLGAQSE TTTGATTGGCACTAATATGCAAAATACAACGCCTCGGATAATAAACACTA GGAGTCAT AGACGC HSAQVNPNSGYQCTI ATTCGTTGACGAACCAAATCCCTACGGTCTCTAGCCTAGGGGCCCAATCT TCCGTAAT TTGCTAC CESSFRSKSGLGVHM GAACATAGTGCACAGGTTAACCCAAACAGTGGTTACCAATGCACGATAT TCGTAAAT CTAGGC SRRHKDEFDQLRLRT GTGAATCGTCTTTCCGTAGCAAAAGCGGACTAGGCGTTCACATGTCACG CGTAAAAA AAAATG DRKAQWSEEELSM TCGGCACAAGGACGAGTTTGATCAACTTCGTCTGCGTACCGACCGTAAG TCAGATCA TGAAAT MARKEIELAANGERY GCACAATGGAGTGAGGAAGAGTTGAGTATGATGGCAAGAAAAGAGAT AGTTGATT TAGGTA LNKKLAEVFTNRSVD CGAGCTCGCAGCAAATGGAGAAAGATATCTAAATAAGAAGCTAGCGGA CAAGACCT TAAACA AIKKCRQRERYKTKIE AGTATTTACGAACCGTAGCGTCGACGCTATCAAGAAATGTCGACAGAGG CCTCGTGG TCGTGG QLKGQAVPLPEALES GAGAGATATAAGACCAAAATCGAACAGCTAAAGGGTCAAGCTGTTCCTC TATCTTCT TTGTAA ETIQRRPSIRERDLLV TCCCAGAAGCATTAGAATCTGAAACCATACAGCGCCGCCCTAGTATACG GGATGCTA AACTTG TPPNTLGTTPTELSNS CGAGCGAGATCTCCTAGTAACGCCACCTAACACTCTCGGAACCACTCCA TTAGACTG AGGTGG EILAVLQGYPPVVCN ACCGAACTGTCGAACAGTGAAATCCTGGCAGTACTACAGGGGTACCCAC A(SEQID GTTTTTA DQWRVEVLQSIVDG CTGTAGTATGCAATGACCAATGGAGAGTTGAGGTTTTGCAATCCATCGT NO:1240) GTACGT AQASGKEITLQRLSTY AGATGGTGCGCAGGCCTCGGGTAAGGAAATTACTCTTCAGCGCTTGTCT ATGCGT LMEVFPSQNDRPIQT ACTTACCTTATGGAAGTATTTCCCTCACAGAATGACCGCCCCATTCAAAC GATTACT RPPRRPRNRRQGRR GAGACCTCCACGGAGACCTCGTAATAGGAGACAAGGTAGGAGACAGCA TCGTAAT QQYALTQRNWDKH GTACGCCTTAACACAGCGTAACTGGGACAAGCACAAAGGTCGTTGTATA CATGAA KGRCIKAILDGTEGTA AAAGCCATTTTGGATGGAACTGAGGGGACAGCAACTATGCCAAGTCAA TCGTGC TMPSQGIMGSYWR GGTATCATGGGGTCCTATTGGAGACAAGTCATGACACAAACAAGCCCAA ATGCTA QVMTQTSPTYSGTN CATATAGTGGTACGAACACCACGTTCCGGACGGAACACCCACTTGAAGG GTGGGG TTFRTEHPLEGVWSP GGTTTGGTCCCCGATAACACTAGGGGACCTAAGGGTACACAGAGTGTC TTTGGC ITLGDLRVHRVSLTKS ATTGACGAAATCTCCAGGACCTGATGGAATTACTCCAAGAACTGTCAGG CTCCACT PGPDGITPRTVRSIPS AGTATTCCGTCAGGAGTTATGCTTCGCATAATGAACCTGATACTTTGGTG AGTATCT GVMLRIMNLILWCG CGGAAAGTTGCCTGTCTCCATCCGACAGGCACGAACCATCTTCATTCCG TTGAAG KLPVSIRQARTIFIPKV AAGGTGGGGAATGCTTCTCGACCGCAAGACTTTCGTCCAATTACGGTAC ATTTTCC GNASRPQDFRPITVQ AATCTGTTATGGTAAGGATTTTAAATGCCATTTTGGCTTCCCGGTTGACC TTCCTCA SVMVRILNAILASRLT TCATCAGTCGACTGGGATCCGCGTCAGCGAGGTTTCCTTCCAACCGACG GCGATC SSVDWDPRQRGFLP GATGTGCCGATAATACGACGATAGTCGACTTAATCTTAAGGGATCACCA AAAAAA TDGCADNTTIVDLILR TAAACGTTGTAAATCACTTTATATCGCAACTTTAGATATAAGCAAAGCAT AA(SEQ DHHKRCKSLYIATLDI TTGACTCGGTGTCTCATGCAGCAGTTAGCGCCACTCTAACTGCATATGGT IDNO: SKAFDSVSHAAVSAT GCCCCTAAAGAATTCGTTGACTACGTACAAAATTCGTACGAGGTCTGTG 1363) LTAYGAPKEFVDYVQ GCACAACGCTCAATGGGGACGGATGGAGATCAGAGGAATTCATACCTG NSYEVCGTTLNGDG CTCGAGGTGTCAGACAGGGTGACCCGCTATCTCCCATAATATTCAACTT WRSEEFIPARGVRQG GATCATCGATCAGTTGCTTAGGTCCTACCCCAATGAGATTGGTGCCACA DPLSPIIFNLIIDQLLRS ATCGGTGATCACACAACAAACGCGGCCGCGTTCGCAGATGATATTGTCT YPNEIGATIGDHTTN TATTTGCGGAAACTCGTTTAGGCCTTCAAACAATGCTAGACACGACTGTC AAAFADDIVLFAETRL GATTTTCTATCTTCAGTCGGGCTTACCCTTAACTCGGATAAATGTTTTACA GLQTMLDTTVDFLSS GTTGGAATAAAGGGGCAACCGAAACAGAAGTGTACTGTGGTCATCCCA VGLTLNSDKCFTVGIK GAGACCTTCCGTATCGGTTCGCGCTCGTGTCCTGCATTGAAGCGCACAG GQPKQKCTVVIPETF ACGAGTGGAAGTATTTAGGCATAACATTCACTGCACAAGGGAGGACCA RIGSRSCPALKRTDE GGTACAGTCCAGCCGACGACCTAGGTCCGAAGCTGTTGAGGCTGACAA WKYLGITFTAQGRTR GGTCCCCCCTAAAACCACAACAGAAATTGTTCGCACTCAGAACAGTTCTT YSPADDLGPKLLRLTR ATCCCACAACTTTACCATAAGCTGACCCTAGGTAGTGTGATGATAGGTG SPLKPQQKLFALRTVL TTCTGAGAAAGTGTGACATACTGGTACGTTCGACCGTAAGGAAGTGGTT IPQLYHKLTLGSVMIG AGGGCTTCCTCTGGACGTGTCAACTGCATTCTTCCATGCTCCTCATACTT VLRKCDILVRSTVRK ATGGGGGCCTCGGAATCCCTTCAGTTCGTTGGGTAGCGCCAATGCTACG WLGLPLDVSTAFFHA TATGAAAAGATTGAGCAATATTAAGTGGGCCCACCTCGCGCAATCCGAG PHTYGGLGIPSVRWV GCGGCCAGCTCATTTCTTACCGACGAATTGAATAAGGCCCGGGGTAGAA APMLRMKRLSNIKW CTCTGGCTGGACTGAATGAGTTGACATCGCGTACAGAGATCGAAACGTA AHLAQSEAASSFLTD CTGGGCGAACAGGTTGTATATGTCTGTTGATGGTCGCGGCTTACGTGAA ELNKARGRTLAGLNE GCGGGACTTTTTCGTCCCCAACACGGCTGGGTGTGTCAGCCCACGCGTT LTSRTEIETYWANRLY TGCTAACAGGTCAAGATTACCGAAACAGTATCAAGCTGCGAATAAATGC MSVDGRGLREAGLF CCTACCATCGAGGTCTCGTACCACGAGGGGCAGAAATGAATTGGAACG RPQHGWVCQPTRLL GCAATGTCGTGCAGGTTGTGATGCTCCCGAAACAACAAACCACATCCTG TGQDYRNSIKLRINAL CAGAATTGTTACCGTACGCATGGGAGGCGGGTAGCAAGACATAACTGT PSRSRTTRGRNELER GTAGTCAATAACCTTAAGAGGATTCTTGAGGAGAAGGGCCACACAGTA QCRAGCDAPETTNHI CACGTCGAACCAAGTTTGCAGCTGGAAACCTCGGTAAGTAAACCAGACC LQNCYRTHGRRVAR TGGTGTGTATCCGTGACAATCACGCTTGCGTGATTGATGCGCAGATTAT HNCVVNNLKRILEEK AACGGATGGACTGTTTCTCGACGATGTGCACCATCGCAAAGTTGAGAAA GHTVHVEPSLQLETS TATAAAAGACCGGAAGTTATATCTGCACTGCGGAGAGAATTCGGAGTG VSKPDLVCIRDNHAC TCGGGCAACGTCGAAGTCCTATCCGCGACGTTAAACTGGCGTGGGATCT VIDAQIITDGLFLDDV GGAGCAATCAATCCGTTAGAAGATTGATAGCAAAGGGTCTCATCTCATC HHRKVEKYKRPEVIS CGGTGACAGCAATGTCATTAGCGCCAGAGTGGTAACAGGCGGACTATA ALRREFGVSGNVEVL TTGCTTCAGACAGTTCATGTATCTCGCAGGTTACACTCGAGATTGGACTT SATLNWRGIWSNQS AGCCTATACACTATGTTGGAGAGAAGACGCTTGCTACCTAGGCAAAATG VRRLIAKGLISSGDSN TGAAATTAGGTATAAACATCGTGGTTGTAAAACTTGAGGTGGGTTTTTA VISARVVTGGLYCFR GTACGTATGCGTGATTACTTCGTAATCATGAATCGTGCATGCTAGTGGG QFMYLAGYTRDWT GTTTGGCCTCCACTAGTATCTTTGAAGATTTTCCTTCCTCAGCGATCAAA (SEQIDNO:1485) AAAAA(SEQIDNO:1117) R2 R2_DPs Drosophila CAATTGGAAAGATATGGGTCTGAATAATAGCGTAGAAGGGGAGTCATT CAATTGGA TAGCCT SSFGLIVTNLNSETVL pseudoobscura CCGTAATTCGTAAATCGTAAAAATCAGATCAAGTTGATTCAAGACCTCCT AAGATATG ATACACT WGCQPLGQFSLIGT CGTGGTATCTTCTGGATGCTATTAGACTGAAGTTCTTTTGGTCTAATAGT GGTCTGAA ATGTTG NMQNTTPRIINTNSL AACTAACTTGAACAGCGAAACAGTCCTATGGGGCTGCCAGCCCCTTGGA TAATAGCG GAGAGA TNQIPTVSSLGAQSE CAGTTCAGTTTGATTGGCACTAATATGCAAAATACAACGCCTCGGATAA TAGAAGG AGACGC HSAQVNPNSGYQCTI TAAACACTAATTCGTTGACGAACCAAATCCCTACGGTCTCTAGCCTAGG GGAGTCAT TTGCTAC CESSFRSKSGLGVHM GGCCCAATCTGAACATAGTGCACAGGTTAACCCAAACAGTGGTTACCAA TCCGTAAT CTAGGC SRRHKDEFDQLRLRT TGCACGATATGTGAATCGTCTTTCCGTAGCAAAAGCGGACTAGGCGTTC TCGTAAAT ATAATG DRKAQWSEEELSM ACATGTCACGTCGGCACAAGGACGAGTTTGATCAACTTCGTCTGCGTAC CGTAAAAA TGAAAT MARKEIELAANGERY CGACCGTAAGGCACAATGGAGTGAGGAAGAGTTGAGTATGATGGCAA TCAGATCA TAGGTA LNKKLAEVFTNRSVD GAAAAGAGATCGAGCTCGCAGCAAATGGAGAAAGATATCTAAATAAGA AGTTGATT TAAACA AIKKCRQRERYKTKIE AGCTAGCGGAAGTATTTACGAACCGTAGCGTCGACGCTATCAAGAAATG CAAGACCT TCGTGG QLKGQAVPLPEALES TCGACAGAGGGAGAGATATAAGACCAAAATCGAACAGCTAAAGGGTCA CCTCGTGG TTGTAA ETIQRRPSIRERDLLV AGCTGTTCCTCTCCCAGAAGCATTAGAATCTGAAACCATACAGCGCCGC TATCTTCT AACTTG TPPNTLGTTPTELSNR CCTAGTATACGCGAGCGAGATCTCCTAGTAACGCCACCTAACACTCTCG GGATGCTA AGGTGG EILAVLQGYPPVVCN GAACCACTCCAACCGAACTGTCGAACAGGGAAATCCTGGCAGTACTACA TTAGACTG GTTTTTA DQWRVEVLQSIVDG GGGGTACCCACCTGTAGTATGCAATGACCAATGGAGAGTTGAGGTTTTG A(SEQID GTACGT AQASGKEITLQRLSTY CAATCCATCGTAGATGGTGCGCAGGCCTCGGGTAAGGAAATTACTCTTC NO:1241) ATGCGT LMEVFPSQNDRPIQT AGCGCTTGTCTACTTACCTTATGGAAGTATTTCCCTCACAGAATGACCGC GATTACT RPPRRPRNRRQGRR CCCATTCAAACGAGACCTCCACGGAGACCTCGTAATAGGAGACAAGGTA TCGTAAT QQYALTQRNWDKH GGAGACAGCAGTACGCCTTAACACAGCGTAACTGGGACAAGCACAAAG CATGAA KGRCIKAILDGTEGTA GTCGTTGTATAAAAGCCATTTTGGATGGAACTGAGGGGACAGCAACTAT TCGTGC TMPSQGIMGSYWR GCCAAGTCAAGGTATCATGGGGTCCTATTGGAGACAAGTCATGACACAA ATGCTA QVMTQTSPTYSGTN ACAAGCCCAACATATAGTGGTACGAACACCACGTTCCGGACGGAACACC GTGGGG TTFRTEHPLEGVWSP CACTTGAAGGGGTTTGGTCCCCGATAACACTAGGGGACCTAAGGGTAC TTTGGC ITLGDLRVHRVSLTKS ACAGAGTGTCATTGACGAAATCTCCAGGACCTGATGGGATTACTCCAAG CTCCACT PGPDGITPRTVRSIPS AACTGTCAGGAGTATTCCGTCAGGAGTTATGCTTCGCATAATGAACCTG AGTATCT GVMLRIMNLILWCG ATACTTTGGTGCGGAAAGTTGCCTGTCTCCATCCGACAGGCACGAACCA TTGAAG KLPVSIRQARTIFIPKV TCTTCATTCCGAAGGTGGGGAATGCTTCTCGACCGCAAGACTTTCGTCCA ATTTTCC GNASRPQDFRPITVQ ATTACGGTACAATCTGTTATGGTAAGGATTTTAAATGCCATTTTGGCTTC TTCCTCA SVMVRILNAILASRLT CCGGTTGACCTCATCAGTCGACTGGGATCCGCGTCAGCGAGGTTTCCTT GCGATC SSVDWDPRQRGFLP CCAACCGACGGATGTGCCGATAATACGACGATAGTCGACTTAATCTTAA AAAAAA TDGCADNTTIVDLILR GGGATCACCATAAACGTTGTAAATCACTTTATATCGCAACTTTAGATATA AAAAAA DHHKRCKSLYIATLDI AGCAAAGCATTTGACTCGGTGTCTCATGCAGCAGTTAGCGCCACTCTAA AAAAAA SKAFDSVSHAAVSAT CTGCATATGGTGCCCCTAAAGAATTCGTTGACTACGTACAAAATTCGTAC AAAAAA LTAYGAPKEFVDYVQ GAGGTCTGTGGCACAACGCTCAATGGGGACGGATGGAGATCAGAGGA (SEQID NSYEVCGTTLNGDG ATTCATACCTGCTCGAGGTGTCAGACAGGGTGACCCGCTATCTCCCATA NO: WRSEEFIPARGVRQG ATATTCAACTTGATCATCGATCAGTTGCTTAGGTCCTACCCCAATGAGAT 1364) DPLSPIIFNLIIDQLLRS TGGTGCCACAATCGGTGATCACACAACAAACGCGGCCGCGTTCGCAGAT YPNEIGATIGDHTTN GATATTGTCTTATTTGCGGAAACTCGTTTAGGCCTTCAAACAATGCTAGA AAAFADDIVLFAETRL CACGACTGTCGATTTTCTATCTTCAGTCGGGCTTACCCTTAACTCGGATA GLQTMLDTTVDFLSS AATGTTTTACAGTTGGAATAAAGGGGCAACCGAAACAGAAGTGTACTGT VGLTLNSDKCFTVGIK GGTCATCCCAGAGACCTTCCGTATCGGTTCGCGCTCGTGTCCTGCATTGA GQPKQKCTVVIPETF AGCGCACAGACGAGTGGAAGTATTTAGGCATAACATTCACTGCACAAG RIGSRSCPALKRTDE GGAGGACCAGGTACAGTCCAGCCGACGACCTAGGTCCGAAGCTGTTGA WKYLGITFTAQGRTR GGCTGACAAGGTCCCCCCTAAAACCACAACAGAAATTGTTCGCACTCAG YSPADDLGPKLLRLTR AACAGTTCTTATCCCACAACTTTACCATAAGCTGACCCTAGGTAGTGTGA SPLKPQQKLFALRTVL TGATAGGTGTTCTGAGGAAGTGTGACATACTGGTACGTTCGACCGTAAG IPQLYHKLTLGSVMIG GAAGTGGTTAGGGCTTCCTCTGGACGTGTCAACTGCATTCTTCCATGCTC VLRKCDILVRSTVRK CTCATATTTATGGGGGCCTCGGAATCCCTTCAGTTCGTTGGGTAGCGCC WLGLPLDVSTAFFHA AATGCTACGTATGAAAAGATTGAGCAATATTAAGTGGGCCCACCTCGCG PHIYGGLGIPSVRWV CAATCCGAGGCGGCCAGCTCATTTCTTACCGACGAATTGAATAAGGCCC APMLRMKRLSNIKW GGGGTAGAACTCTGGCTGGACTGAATGAGTTGACATCGCGTTCGGAGA AHLAQSEAASSFLTD TCGAAACGTACTGGGCGAACAGGTTGTATATGTCTGTTGATGGTCGCGG ELNKARGRTLAGLNE CTTACGTGAAGCGGGACTTTTTCGTCCCCAACACGGCTGGGTGTGTCAG LTSRSEIETYWANRLY CCCACGCGTTTGCTAACAGGTCAAGATTACCGAAACGGTATCAAGCTGC MSVDGRGLREAGLF GAATAAATGCCCTACCATCGAGGTCTCGTACCACGAGGGGCAGAAATG RPQHGWVCQPTRLL AATTGGAACGGCAATGTCGTGCAGGTTGTGATGCTCCCGAAACAACAAA TGQDYRNGIKLRINA CCACATCCTGCAGAATTGTTACCGTACGCATGGGAGGCGGGTAGCAAG LPSRSRTTRGRNELER ACATAACTGTGTAGTCAATAACCTTAAGAGGATTCTTGAGGAGAAGGGC QCRAGCDAPETTNHI CACACAGTACACGTCGAACCAAGTTTGCAGCTGGAAACCTCGGTAAGTA LQNCYRTHGRRVAR AACCAGACCTGGTGTGTATCCGTGACAATCACGCTTGCGTGATTGATGC HNCVVNNLKRILEEK GCAGATTATAACGGATGGACTGTTTCTCGACGATGTGCACCATCGCAAA GHTVHVEPSLQLETS GTTGAGAAATATAAAAGACCGGAAGTTATATCTGCACTGCGGAGAGAA VSKPDLVCIRDNHAC TTCGGAGTGTCGGGCAACGTCGAAGTCCTATCCGCGACGTTAAACTGGC VIDAQIITDGLFLDDV GTGGGATCTGGAGCAATCAATCCGTTAGAAGATTGATAGCAAAGGGTC HHRKVEKYKRPEVIS TCATCTCATCCGGTGACAGCAATGTCATTAGCGCCAGAGTGGTAACAGG ALRREFGVSGNVEVL CGGACTATATTGCTTCAGACAGTTCATGTATCTCGCAGGTTACACTCGAG SATLNWRGIWSNQS ATTGGACTTAGCCTATACACTATGTTGGAGAGAAGACGCTTGCTACCTA VRRLIAKGLISSGDSN GGCATAATGTGAAATTAGGTATAAACATCGTGGTTGTAAAACTTGAGGT VISARVVTGGLYCFR GGGTTTTTAGTACGTATGCGTGATTACTTCGTAATCATGAATCGTGCATG QFMYLAGYTRDWT CTAGTGGGGTTTGGCCTCCACTAGTATCTTTGAAGATTTTCCTTCCTCAG (SEQIDNO:1486) CGATCAAAAAAAAAAAAAAAAAAAAAAAA(SEQIDNO:1118) R2 R2_DSe Drosophila GGGATCAGGGGTAATTGCGAGCAGAGGGGGAGTATTTTTCTGTAATTC GGGATCA TAGCTA FERQNFSDGLVPQRK sechellia GTAAGTCATATCATATGGTGTGCGGAAGGGGAATTTTACTCTGTAACTC GGGGTAAT AAACGT FIHIGTTNRNNEPRS ACAAGTCTCTCCTTTACTCAAGTCGACTCAAAACCTCCTCGTGGTGGTCC TGCGAGCA TTGGTTC NLRNLMTTRPSVDIF CCGGTAATGCTAAACTTGTTTAGCAGCTAATTTGAGCGGCAAAACTTTTC GAGGGGG AAAACA PEDQYEPNAAATLSR CGATGGGTTGGTTCCCCAGAGGAAATTTATTCATATTGGAACTACAAAT AGTATTTT TTTGCTT VPCTVCGRSFNSKRG AGAAATAACGAGCCTCGGAGCAATTTACGCAATCTGATGACGACCCGAC TCTGTAAT GCTGTC LGVHMRSRHPDELD CCTCCGTGGACATCTTCCCGGAGGACCAATATGAACCAAACGCAGCGGC TCGTAAGT TTGGCA EERRRVDIKARWSEE TACTCTATCTAGGGTACCCTGCACAGTATGTGGCCGGTCCTTTAACAGCA CATATCAT TAACATC EKWMMARKEVELTA AGAGAGGACTCGGTGTTCACATGCGATCTCGGCACCCAGACGAACTTG ATGGTGTG AATAAA NGHKHINKQLAVYFA ATGAAGAACGTCGACGTGTCGATATAAAGGCAAGATGGAGTGAGGAA CGGAAGG GGCATA NRSVEAIKKLRQRGD GAGAAGTGGATGATGGCGAGAAAGGAGGTTGAGCTCACAGCAAATGG GGAATTTT AACATC YKEKIEQIRRQSALVP ACATAAACACATAAACAAGCAACTAGCGGTGTATTTTGCAAACCGCAGC ACTCTGTA GCAAAT EVANLTIRRRPSRSEQ GTCGAAGCCATCAAAAAGCTAAGACAAAGGGGCGATTATAAGGAGAAA ACTCACAA AATGGT NHQVTTSETTPITPFE ATAGAGCAGATAAGAAGGCAATCAGCTCTCGTCCCGGAAGTTGCAAATC GTCTCTCC AATATAT QSNREILRTLRGYSPV TAACCATAAGGCGCCGCCCTAGTAGAAGTGAGCAAAACCACCAAGTAA TTTACTCA AAATTG ECHSKWRAQELQTII CAACATCAGAAACAACTCCAATCACACCCTTCGAACAGTCGAACAGGGA AGTCGACT GCTATG DRAELEGKETTLQCLS AATTTTGCGGACACTGCGTGGGTATAGCCCCGTAGAATGCCATTCCAAA CAAAACCT AGGATG LYLLGIFPAQGVRHTL TGGAGAGCCCAAGAGCTACAAACGATCATTGACAGGGCAGAGCTCGAG CCTCGTGG GTTTTAG TRPPRRPRNRRESRR GGAAAGGAAACCACTCTCCAATGCTTATCGCTATATCTCCTAGGAATTTT TGGTCCCC TACGTA QQYAVVQRNWDKH TCCGGCACAGGGTGTACGTCACACTCTGACGAGACCTCCTCGGAGGCCT GGTAATGC GGCGTT KGRCIKSLLNGTDESV CGAAATAGGAGAGAAAGCAGAAGGCAGCAGTATGCTGTCGTCCAGCGT TAAACTTG GCGGAA MPSQEVMVPYWRE AACTGGGATAAGCATAAAGGAAGATGCATCAAGTCCTTGCTAAATGGA TTTAGCAG CTTCGG VMTQPSPSSCSREVI ACTGATGAGTCGGTAATGCCAAGCCAAGAAGTAATGGTTCCTTACTGGA CTAA(SEQ TTCAGAT QMDHSLERVWSAIT GAGAAGTAATGACTCAGCCTAGCCCAAGCTCTTGCAGTAGAGAAGTGAT IDNO: AGAGCA EHDLRASRISLSSSPG ACAAATGGATCACTCGCTTGAGAGGGTTTGGTCTGCTATTACAGAGCAC 1242) ATGAAT PDGITPKTAREVPSGI GACCTTCGGGCATCAAGAATCTCATTATCTTCATCTCCGGGGCCTGACG CGTGCA MLRIMNLILWCGNL GGATAACTCCAAAAACAGCCAGGGAGGTGCCGTCAGGTATTATGTTGC TGCTAG PHSIRLARTVFIPKTVT GAATAATGAACCTAATTCTATGGTGCGGCAATCTACCACACTCTATCCGA GAAACT AKRPQDFRPISVPSVL CTGGCCAGAACCGTCTTCATCCCGAAGACGGTGACGGCGAAGCGACCG GAAGTG VRQLNAILATRLNSSI CAAGACTTTCGTCCAATATCGGTGCCTTCAGTCCTGGTAAGACAGCTAA TTGACA NWDPRQRGFLPTDG ATGCAATATTGGCAACCCGGTTGAACTCATCAATCAATTGGGACCCGCG GACCTA CADNATIVDLVLRHS CCAGCGGGGCTTCTTACCTACCGACGGATGTGCCGATAACGCGACGATA GTATCTT HKHFRSCYIANLDVS GTTGACTTAGTCTTGAGGCATAGCCATAAGCACTTTAGATCTTGCTACAT TCGATA KAFDSLSHASIYDTLR TGCAAATTTAGATGTAAGCAAGGCATTCGATTCTTTATCGCATGCATCTA GATTTCC AYGAPKGFVDYVQN TATATGACACCTTACGTGCTTATGGTGCGCCAAAGGGCTTCGTTGACTAC ATACCTC TYEGGGTSLNGDGW GTACAGAATACGTACGAGGGTGGCGGTACCAGTCTCAATGGGGACGGT CGCGAT SSEEFVPARGVKQGD TGGAGTTCAGAGGAATTCGTCCCTGCTAGAGGAGTGAAGCAGGGTGAC CAAAAA PLSPILFNLVMDRLLR CCTTTGTCTCCTATTCTATTTAACTTGGTAATGGACAGGTTACTTAGAAAC AAAAAA NLPSEIGARVGNAIT CTACCCAGCGAAATTGGTGCCAGAGTCGGAAATGCCATTACTAACGCGG AAAAAA NAAAFADDLVLFAET CCGCGTTTGCAGATGATTTGGTACTATTTGCTGAAACTCGAATGGGACTT AAAA RMGLQVLLDRTLDFL CAAGTATTGTTGGACAGAACGTTGGATTTTCTATCTCTCGTCGGCCTCAA (SEQID SLVGLKLNADKCFTV ACTTAATGCCGACAAATGTTTTACCGTTGGCATTAAGGGCCAGCCGAAA NO: GIKGQPKQKCTVLEA CAGAAGTGTACCGTGCTAGAGGCACAGAGCTTTTACGTAGGCTCAAGG 1365) QSFYVGSREIPSLKRT GAGATTCCATCATTGAAGCGAACGGACGAGTGGAAGTACTTAGGCATC DEWKYLGINFTATGR AACTTCACTGCAACCGGGAGGGTTCGATGCAATCCGGCCGAGGACATT VRCNPAEDIGPKLQR GGTCCGAAGCTACAAAGATTGACAAAGGCCCCCCTCAAACCACAACAGA LTKAPLKPQQRMFAL GGATGTTCGCCCTTAGGACTGTCCTTATCCCACAGCTCTATCACAAGTTA RTVLIPQLYHKLALGS GCCCTTGGGAGTGTGGCGATAGGCATCCTACGAAAAACTGACAAACTA VAIGILRKTDKLIRYYV ATACGATACTATGTGCGAAGATGGCTAAATCTTCCGCTGGATATACCGA RRWLNLPLDIPIAFIH TAGCATTCATTCACGCACCCCCAAAAAGTGGAGGTCTCGGAATTCCATC APPKSGGLGIPSLRW ACTTAGATGGGTAGCTCCAATGTTAAGGCTAAGACGTTTGAGTAATATT VAPMLRLRRLSNIKW AAATGGCCTCACCTCACGCAAAACGAGGTAGCCAGCTCTTTCCTCGAAG PHLTQNEVASSFLEA CAGAAAAACAACGGGCCCGAGATAGATTATTAGCAGAACAAAATGAAT EKQRARDRLLAEQNE TGTTATCGCGTCCGGCAATAGAAAAATATTGGGCGAACAAATTGTACCT LLSRPAIEKYWANKLY CTCAGTTGATGGTAGCGGACTCCGTGAAGCAGGCCATTGGGGCCCGCA LSVDGSGLREAGHW ACACGGGTGGGTTAATCAACCCACGCGTTTACTAACAGGAAAGGAATAT GPQHGWVNQPTRLL ATAGACGGTATTCGTCTGCGGATAAATGCCCTACCCACGAAGTCTCGTA TGKEYIDGIRLRINALP CTACAAGGGGAAGGCACGAATTGGAACGACAGTGTCGTGCAGGATGTG TKSRTTRGRHELERQ ACGCTCCCGAAACAACAAACCATATTATGCAAAAATGTTACCGATCGCA CRAGCDAPETTNHI TGGGAGGCGCATAGCTAGACACAACTGCGTAGTAAATCGAATCAAGCG MQKCYRSHGRRIAR GGGACTTGAGGAGAGAGGCTGCGTAGTCATTGTTGAACCAAGTCTGCA HNCVVNRIKRGLEER GTGCGAATCCGGTCTTAATAAACCAGACCTGGTGGCACTACGACAAAAT GCVVIVEPSLQCESGL CACATTGATGTGATCGACATACAAATTGTGACAGACGGACACTCTATGG NKPDLVALRQNHIDV ATGATGCACACCAGCGCAAAATCAATAGATACGACAGACCGGACATAC IDIQIVTDGHSMDDA GAACTGAGTTGCGTCGCAGATTCGAAGCCGCAGGTGACATTGAATTCCA HQRKINRYDRPDIRT TTCTGCCACCCTGAACTGGAGGGGGATCTGGAGTGGTCAATCCGTTAAA ELRRRFEAAGDIEFHS AGATTGATAGCGAAGGGTCTCCTCAGCAAATATGATAGTCATATCATTA ATLNWRGIWSGQSV GCGTCCAGGTTATGAGAGGCAGTCTCGGTTGTTTTAAACAGTTCATGTA KRLIAKGLLSKYDSHII CCTGAGCGGGTTTTCCCGAGATTGGACTTAGCTAAAACGTTTGGTTCAA SVQVMRGSLGCFKQ AACATTTGCTTGCTGTCTTGGCATAACATCAATAAAGGCATAAACATCGC FMYLSGFSRDWT AAATAATGGTAATATATAAATTGGCTATGAGGATGGTTTTAGTACGTAG (SEQIDNO:1487) GCGTTGCGGAACTTCGGTTCAGATAGAGCAATGAATCGTGCATGCTAG GAAACTGAAGTGTTGACAGACCTAGTATCTTTCGATAGATTTCCATACCT CCGCGATCAAAAAAAAAAAAAAAAAAAAA(SEQIDNO:1119) R2 R2_DSi Drosophila GGGATCTGGGGTAATTGCGAGCAGAGGGGGAGTATTTTTCTGTAATTC GGGATCTG TAGCTA TCLAANLSGKNFSDG simulans GTAAGTCATATCATATGGTGTGCGGAAGGGGAATTTTACTCTGTAACTC GGGTAATT AAACGT LVTQRKFTHIGTTNT ACAAGTCTCTCCTTTACTCAAGTCGACTCAAAACCTCCTCGTGGTGGTCC GCGAGCA TTGGTTC NNEPRISLHNLMTTR CCGGTAATGCTAAACTTGTTTAGCAGCTAATTTGAGCGGCAAAAACTTTT GAGGGGG AAAACA PSVDIFPEDQYEPNA CCGATGGGCTGGTTACCCAGAGGAAATTTACTCATATTGGAACTACGAA AGTATTTT TTTGCTT AATLSRVPCTVCGRS CACAAATAACGAGCCTCGGATATCTTTACACAATCTGATGACGACCCGA TCTGTAAT GCTGTC FNSKRGLGVHMRSR CCCTCCGTGGATATCTTCCCGGAGGACCAATATGAACCAAACGCAGCGG TCGTAAGT TTGGCA HPDELDEERRRVDIK CTACTCTATCTAGGGTTCCCTGCACAGTATGTGGCCGGTCCTTTAACAGC CATATCAT TAACATC ARWSEEEKWMMAR AAGAGAGGACTCGGTGTTCACATGCGATCTCGGCACCCAGACGAACTT ATGGTGTG AATAAA KEVELTANGHKHMN GATGAAGAACGTCGACGTGTCGATATAAAGGCAAGGTGGAGTGAGGA CGGAAGG GGCATA KQLAVYFANRSVEAI AGAGAAGTGGATGATGGCGAGAAAGGAGGTTGAGCTCACAGCAAATG GGAATTTT AACATC KKLRQRGDYKEKIEQI GACATAAACACATGAACAAGCAACTAGCGGTGTATTTTGCAAACCGCAG ACTCTGTA GCAAAA RGQSALVPEVANLTI CGTCGAAGCCATCAAAAAGCTAAGACAGAGGGGCGATTATAAGGAGAA ACTCACAA TAATGG RRRPSRSEQNHQVTT AATAGAGCAGATAAGAGGGCAATCTGCTCTCGTCCCGGAAGTTGCAAAT GTCTCTCC TTATATA SETTPITPFEQSNREIL CTAACCATAAGGCGCCGCCCTAGTAGAAGTGAGCAAAACCACCAAGTA TTTACTCA TAAATG RTLRGYSPVECHSKW ACAACATCAGAAACAACTCCAATCACTCCCTTCGAACAGTCGAACAGGG AGTCGACT GCTATG RAQELQTIIDRAELEG AAATTTTGCGGACACTGCGTGGGTATAGCCCCGTAGAATGCCATTCCAA CAAAACCT AGGATG KETTLQCLSLYLLGIFP ATGGAGAGCCCAAGAGCTACAAACGATCATTGACAGGGCAGAGCTCGA CCTCGTGG GTTTTAG AQGVRHTLTRPPRRP GGGAAAGGAAACCACTCTCCAATGCTTATCGCTATATCTCCTGGGAATTT TGGTCCCC TACGTA RNRRESRRQQYAVV TTCCGGCACAGGGTGTACGACACACGCTGACGAGACCTCCTCGGAGAC GGTAATGC GGCGTT QRNWDKHKGRCIKS CTCGGAATAGGAGAGAAAGCAGAAGGCAGCAGTATGCTGTCGTCCAGC TAA(SEQ GCGGAA LLNGTDESVMPSQEV GTAACTGGGATAAGCATAAAGGAAGATGCATCAAGTCCTTGCTAAATG IDNO: CTTCGG MVPYWREVMTQPS GAACTGATGAGTCGGTAATGCCAAGCCAAGAAGTAATGGTTCCCTACTG 1243) TTCAGAT PSSCSGEVIQMDHSL GAGAGAAGTAATGACTCAGCCTAGCCCAAGCTCTTGCAGTGGAGAAGT AGAGCA ERVWSAITEHDLRAS GATACAAATGGATCACTCGCTTGAGAGGGTTTGGTCTGCTATTACGGAG ATGAAT RISLSSSPGPDGITPKS CATGACCTTCGGGCGTCAAGAATCTCATTATCTTCATCTCCGGGGCCTGA CGTGCA AREVPSGIMLRIMNLI CGGGATAACTCCAAAATCTGCCAGGGAGGTGCCGTCAGGTATTATGTTG TGCTAG LWCGNLPHSIRLART CGAATAATGAACCTAATTCTATGGTGCGGTAATCTACCACACTCTATCCG GAAAAC VFIPKTVTAKRPQDF ACTGGCCAGAACCGTCTTCATCCCGAAAACGGTGACGGCGAAGCGACC TGACCA RPISVPSVLVRQLNAI GCAAGACTTTCGTCCAATATCGGTGCCTTCAGTCCTGGTAAGACAGCTA CACGCA LATRLNSSINWDPRQ AATGCCATATTGGCAACCCGGTTGAACTCATCAATCAATTGGGACCCGC GTGTTG RGFLPTDGCADNATI GCCAGCGGGGCTTCTTACCTACCGACGGATGTGCCGATAATGCGACGAT GCAGCC VDLVLRHSHKHFRSC AGTTGACTTAGTCTTGAGGCATAGCCATAAGCACTTTAGATCTTGCTACA CTAGTAT YIANLDVSKAFDSLSH TAGCTAATTTAGATGTAAGCAAGGCATTCGATTCTTTATCGCATGCATCT CTTTCGA ASIYDTLRAYGAPKGF ATATATGACACCTTACGTGCTTATGGTGCGCCAAAGGGCTTCGTTGACT TAGATTT VDYVQNTYEGGGTSL ACGTACAGAATACGTACGAGGGTGGCGGTACCAGTCTCAATGGGGACG CCATACC NGDGWSSEEFVPAR GTTGGAGTTCAGAGGAATTCGTCCCTGCTAGAGGAGTGAAGCAGGGTG TCCGCG GVKQGDPLSPILFNL ACCCTTTGTCTCCTATTCTATTTAACTTGGTAATGGACAGGTTACTTAGAA ATCAAA VMDRLLRNLPSEIGA ACCTACCCAGCGAAATTGGTGCCAAAGTCGGAAATGCCATTACTAACGC AAAAAA KVGNAITNAAAFAD GGCCGCGTTTGCAGATGATTTGGTACTATTTGCTGAAACTCGAATGGGA AAAAAA DLVLFAETRMGLQVL CTTCAAGTATTGTTGGACAAAACGTTGGATTTTCTATCTCTCGTCGGCCT AAAAAA LDKTLDFLSLVGLKLN CAAACTTAATGCCGACAAATGTTTTACCGTTGGCATTAAGGGCCAGCCG AA(SEQ ADKCFTVGIKGQPKQ AAACAGAAGTGTACCGTGCTAGAGGCACAGAGCTTCTACGTAGGCTCG IDNO: KCTVLEAQSFYVGSR AGGGAGATTCCATCACTGAAGCGAACGGACGAGTGGAAGTACTTAGGC 1366) EIPSLKRTDEWKYLGI ATCAACTTCACTGCAACTGGGAGGGTTCGATGCAATCCGGCCGAGGAC NFTATGRVRCNPAE ATTGGTCCAAAGCTACAAAGATTGACAAAGGCCCCCCTCAAACCACAAC DIGPKLQRLTKAPLKP AGAGGATGTTCGCCCTTAGGACTGTCCTTATCCCACAGCTCTATCACAAG QQRMFALRTVLIPQL TTAGCCCTTGGGAGTGTGGCGATAGGCGTCCTACGAAAAACTGACAAAT YHKLALGSVAIGVLRK TAATAAGATATTATGTGCGAAGATGGCTAAATCTTCCGCTGGATGTGCC TDKLIRYYVRRWLNL GATAGCATTCATTCATGCACCCCCAAAAAGTGGAGGTCTCGGAATTCCA PLDVPIAFIHAPPKSG TCACTAAGATGGGTAGCTCCAATGTTAAGGCTAAGACGATTGAGTAATA GLGIPSLRWVAPMLR TTAAATGGCCTCACCTCACGCAAAACGAGGTAGCCAGCTCTTTCCTCGA LRRLSNIKWPHLTQN AGCCGAAAAACAACGGGCCCGAGATAGATTATTAGCAGAACAAAATGA EVASSFLEAEKQRAR ATTGTTATCGCGTCCGGCAATAGAAAAATATTGGGCGAACAAATTGTAC DRLLAEQNELLSRPAI CTCTCAGTTGATGGCAGCGGACTCCGTGAAGCGGGCCATTGGGGACCG EKYWANKLYLSVDGS CAACACGGGTGGGTTAATCAACCCACGCGTTTACTAACAGGAAAGGAAT GLREAGHWGPQHG ATATAGACGGTATTCGTCTGCGGATAAATGCCCTACCCACGAAGTCTCG WVNQPTRLLTGKEYI TACTACAAGGGGAAGGCACGAATTGGAACGACAGTGTCGTGCAGGATG DGIRLRINALPTKSRT TGACGCTCCCGAAACAACAAACCACATAATGCAAAAATGTTACCGATCG TRGRHELERQCRAGC CATGGGAGGCGCGTAGCTAGACACAACTGCGTAGTAAATCGAATCAAG DAPETTNHIMQKCY CGGGGACTTGAGGAGAGAGGCTGCGTGGTCATTGTTGAACCAAGTCTG RSHGRRVARHNCVV CAGTGCGAATCCGGCCTTAATAAACCGGACCTGGTGGCACTACGACAA NRIKRGLEERGCVVIV GATCACATTGATGTGATCGACATACAAATTGTGACAGACGGACACTCTA EPSLQCESGLNKPDL TGGATGATGCACACCAGCGCAAAATCAATAGATACGACAGACCGGACA VALRQDHIDVIDIQIV TACGAACTGAATTGCGTCGCAGATTCGAAGCCGCAGGTGACATTGAATT TDGHSMDDAHQRKI CCATTCTGCCACCCTGAACTGGAGGGGGATCTGGAGTGGTCAATCCGTT NRYDRPDIRTELRRRF AAAAGATTGATAGCAAAGGGTCTCCTCAGCAAATATGATAGTCATATCA EAAGDIEFHSATLNW TTAGCGTCCAGGTTATGAGAGGCAGTCTCGGTTGTTTTAAACAGTTCAT RGIWSGQSVKRLIAK GTACCTGAGCGGGTTTTCCCGAGATTGGACTTAGCTAAAACGTTTGGTT GLLSKYDSHIISVQVM CAAAACATTTGCTTGCTGTCTTGGCATAACATCAATAAAGGCATAAACAT RGSLGCFKQFMYLSG CGCAAAATAATGGTTATATATAAATGGCTATGAGGATGGTTTTAGTACG FSRDWT(SEQID TAGGCGTTGCGGAACTTCGGTTCAGATAGAGCAATGAATCGTGCATGCT NO:1488) AGGAAAACTGACCACACGCAGTGTTGGCAGCCCTAGTATCTTTCGATAG ATTTCCATACCTCCGCGATCAAAAAAAAAAAAAAAAAAAAAAA(SEQID NO:1120) R2 R2_DYa Drosophila GGGAACATGGGGTAAAGGTGAGTAGAGGGGGAGTATTTTTTATACTCT GGGAACAT TAGCTTA FERRIFPKGLVPLTKD yakuba GCAACTCATAAGTCTTGCCTTTACTCAAGTCGACTCAAAACCTCCTCGTG GGGGTAA AAACGT NHIGTTNLQNEPRIFT GTGTTTCCCGGTAATGTTAAACTTGTTTAGCAGCTAATTTGAGCGGCGA AGGTGAGT TTGGTTC NDLLTTRPSVDHVPE ATCTTTCCGAAAGGGTTGGTTCCCCTGACGAAGGATAATCATATTGGTA AGAGGGG ACATAC DQYEPNAAATLSRVP CCACAAATTTACAAAACGAGCCTCGGATATTTACTAATGATCTGTTGACG GAGTATTT ATCTGCC CTVCDRSFNSKRGLG ACCCGACCCTCCGTGGATCACGTCCCGGAGGACCAATATGAACCAAACG TTTATACT TGCTGC VHMRSRHPDELDEE CAGCGGCTACTCTATCAAGGGTTCCCTGCACAGTATGTGACCGGTCCTTT CTGCAACT CTTGGC RRRVDIKARWSEEEK AACAGTAAGAGAGGACTCGGTGTTCACATGCGATCTCGGCACCCAGAC CATAAGTC ACAATA WMMARKEVELMAN GAACTTGATGAAGAACGTCGACGTGTCGATATAAAAGCAAGGTGGAGT TTGCCTTT TCAAAA GFKHINKQLAVYFAN GAGGAAGAGAAGTGGATGATGGCGAGAAAGGAGGTCGAGCTCATGGC ACTCAAGT AGGCAT RSVEAIKKLRQRGDY AAATGGTTTTAAACACATAAACAAGCAACTAGCGGTGTATTTTGCAAAC CGACTCAA AAACAT KEKIEQIRGQSALAPE CGTAGCGTCGAAGCCATTAAAAAGCTGAGACAGAGGGGCGATTATAAG AACCTCCT CGCACA VANLTIRRRPSRSEQ GAGAAAATAGAGCAGATAAGAGGGCAATCCGCTCTCGCCCCAGAAGTT CGTGGTGT TATTGGT DHQVPTSEASPITPLE GCTAATCTAACCATAAGGCGCCGCCCTAGTAGAAGTGAGCAAGACCACC TTCCCGGT TATTTAC QSNREILRTLRGYSPV AAGTACCAACGTCAGAAGCATCTCCAATCACTCCGCTCGAACAGTCGAA AATGTTAA GGCTAT VCPSKWRAQELQTII CAGGGAAATTTTGCGGACGCTGCGTGGGTATAGCCCCGTAGTATGCCCT ACTTGTTT GAGGAT DRAEFEGKETTLQCLS TCCAAATGGAGAGCCCAAGAACTACAAACTATCATTGATAGGGCGGAAT AGCAGCTA GGTTTTA LYLQGIFPVQGVRHT TTGAGGGAAAGGAAACCACTCTCCAATGCTTATCGCTCTACCTCCAGGG A(SEQID GTACGT LTRPPRRPRNRRESR AATTTTTCCGGTACAGGGTGTACGACACACGCTGACGAGGCCTCCTCGG NO:1244) AGGCGT RQQYAVIQRNWDKH AGACCTCGGAATAGGAGGGAAAGCAGAAGGCAGCAGTACGCTGTCATC TGCGGA KGRCIKSLLNGTDESV CAGCGAAACTGGGATAAGCATAAAGGAAGGTGCATTAAGTCCCTGCTT ACTTCG MPSREFMEPYWREV AATGGAACTGATGAATCGGTAATGCCAAGCCGAGAATTTATGGAGCCCT GTTCGG MTQPSPSSCNGEVIR ACTGGAGAGAAGTAATGACTCAGCCTAGCCCAAGCTCTTGCAATGGAG ATAGAG TDHSLETVWSAITEQ AAGTGATTCGTACGGATCACTCGCTTGAGACGGTATGGTCTGCAATAAC CAATGA DLRASRVSLSSSPGPD GGAACAAGACCTTAGGGCATCAAGAGTTTCATTATCTTCATCTCCGGGG ATCGTG GVTPKTAREVPSGIM CCTGACGGGGTAACTCCAAAAACTGCCAGGGAGGTGCCGTCAGGTATT CATGCT LRIMNLILWCGNLPH ATGCTACGCATAATGAACCTAATTCTATGGTGCGGTAACTTACCTCATTC AGGAAC SIRLARTIFIPKTVTAK TATCCGACTGGCCAGAACCATCTTCATCCCGAAGACGGTGACGGCAAAG TGACCA RPQDFRPISVPSVLVR CGACCGCAAGACTTTCGTCCAATATCGGTGCCTTCCGTCCTGGTAAGAC AATAAC QLNAILATRLTSSIDW AGCTAAATGCCATCTTGGCAACCCGATTGACCTCATCAATCGATTGGGA AGCAGC DPRQRGFSPTDGCA CCCGCGCCAGCGGGGCTTCTCACCAACCGACGGTTGCGCCGATAATGCG CCTAGT DNATIVDLVLRHSHK ACGATAGTTGACTTAGTCCTGAGGCATAGCCATAAGTACTTTAAATCTTG ATCTTTC YFKSCYIANLDVSKAF CTACATCGCCAACTTAGATGTTAGCAAGGCATTTGACTCATTGTCACATG GAAGAT DSLSHAAIYGTLRAYG CAGCAATATATGGGACATTACGAGCTTATGGTGCGCCGAAGGGTTTTGT TTCCATA APKGFVDYVQKTYEG TGACTATGTACAGAAGACGTACGAGGGAGGTGGTATCAGTCTCAACGG CCTTTGC GGISLNGEGWCSEEF GGAAGGTTGGTGTTCAGAGGAATTCGTGCCTGCTAGAGGAGTGAAGCA GATCAA VPARGVKQGDPLSPI GGGCGACCCTTTGTCCCCCATTCTATTTAACTTGGTCATCGACCGGTTAC AAAAAA LFNLVIDRLLRALPSEI TTAGAGCCCTACCTAGCGAGATTGGTACCAAGGTCGGAAATGCCATGAT AAAAAA GTKVGNAMINAAAF AAACGCTGCTGCATTTGCAGATGATTTGGTACTATTTGCGGAAACTCGG AAAAA ADDLVLFAETRMGL ATGGGACTTCAAACTTTGTTGGACAAGACTGTGGACTTTTTATCCACCGT (SEQID QTLLDKTVDFLSTVGL CGGCCTTAAACTTAATGCTGATAAGTGCTTTACTGTCGGTATTAAGGGA NO: KLNADKCFTVGIKGQ CAGCCGAAACAGAAGTGTACTGTGCTAGAGGCACAGAGCTTCTGCGTA 1367) PKQKCTVLEAQSFCV GGCTCGAGAGAGATTCCAACACTGAAGCGTACTGACGAGTGGAAGTAT GSREIPTLKRTDEWK CTCGGTATACATTTCACTGCAAGTGGGAGGGTTCGATGCAATCCGGCAG YLGIHFTASGRVRCN AGGACATTGGTCCAAAGCTACAAAGATTGTCAGAGGCCCCCCTTAAGCC PAEDIGPKLQRLSEAP ACAACAGAGGTTGTTCGCCCTTCGGACTGTCCTGATCCCACAACTCTATC LKPQQRLFALRTVLIP ACAAGTTATCCCTTGGGAGTGTGACGATAGGCGTCTTACGAAAGACTGA QLYHKLSLGSVTIGVL CAAGCTAATACGTTTCTATGTGCGAAGATGGCTAAATCTTCCGTCGGAT RKTDKLIRFYVRRWL GTGCCGATAGCATTCGTTCATGCCCCCCCAAAATGTGGGGGTCTCGGAA NLPSDVPIAFVHAPP TTCCATCACTAAGATGGGTAGCACCAATGTTACGACTTAGACGATTGAG KCGGLGIPSLRWVAP CAACATAAAATGGCCCCACCTCGTACAATCCGAGGAAGCCAGCTCCTTC MLRLRRLSNIKWPHL ATCGAAGCGGAAAAACAAAGGGCCCGAGGTAGATTGATAGCTGAACAA VQSEEASSFIEAEKQR AATGAATTGTTATCGCGTCCGGCAATAGAAAAGTATTGGGCGAACAGG ARGRLIAEQNELLSRP TTGTACCTCTCCGTTGATGGTGGCGGACTCCGTGAAGCGGGCCACTATG AIEKYWANRLYLSVD GTCCCCAACACGGGTGGGTTAGTCAGCCCACGCGTTTACTAACAGGAAA GGGLREAGHYGPQH GGAATATTTAGACGGTATTCGGCTGCGGATAAATGCCCTACCCACAAAG GWVSQPTRLLTGKEY TCTCGCACTACGAGGGGAAGGCACGAATTGGAGAGGCAGTGTCGTGCA LDGIRLRINALPTKSR GGATGTGATGCTCCCGAAACAACAAACCACATTATGCAAAAATGCTATC TTRGRHELERQCRAG GTTCGCATGGGAGGCGAGTAGCTAGACACAACTGCGTAGTAAATCGAA CDAPETTNHIMQKCY TCAAGCGGGGACTTGAGGAGAGAGGCTGCGTGGTCATTGCCGAACCAA RSHGRRVARHNCVV GTCTGCAGTGCGAATCCGGCCTTAATAAACCGGACCTGGTGGTACTCCG NRIKRGLEERGCVVIA ACAAAATCACATTGATGTGATTGACGTGCAGGTTGTGACAGACGGACAT EPSLQCESGLNKPDL TCTATGGACGAAGCGCATCAGCGCAAAATCAATAGATACGACAGACCG VVLRQNHIDVIDVQV GACATACGAACTGAATTGCGTCGCCGATTCGAAGCCGCAGGTGACATC VTDGHSMDEAHQRK GAGTTTCATTCTGCCACCCTAAACTGGAGGGGGATTTGGAGTGGTCAAT INRYDRPDIRTELRRR CCGTAAAACGATTGATTGCGAAGGGTCTCCTCAGCAAATATGATAGTCA FEAAGDIEFHSATLN TATCATTAGCGTCCAGGTTATGAGAGGCAGTCTTGGCTGTTTTAGACAG WRGIWSGQSVKRLI TTCATGTACCTTAGCGGGTTTTCTCGTGATTGGACGTAGCTTAAAACGTT AKGLLSKYDSHIISVQ TGGTTCACATACATCTGCCTGCTGCCTTGGCACAATATCAAAAAGGCATA VMRGSLGCFRQFMY AACATCGCACATATTGGTTATTTACGGCTATGAGGATGGTTTTAGTACGT LSGFSRDWT(SEQID AGGCGTTGCGGAACTTCGGTTCGGATAGAGCAATGAATCGTGCATGCT NO:1489) AGGAACTGACCAAATAACAGCAGCCCTAGTATCTTTCGAAGATTTCCAT ACCTTTGCGATCAAAAAAAAAAAAAAAAAAA(SEQIDNO:1121) R2 R2_KF GU949558 Kalotermes GAGAGGTCACTGTTGCTGATCAGCTGGACACCTAGCTGACTGCGTGCAT GAGAGGT TAATGTC MEVPLTSSLGSGATQ flavicollis GCGTGCACGCCGCGCGCCGCCTCCTCGTGGTACCGCTGGTAACGCAGG CACTGTTG CCTTTTG APGTPELLGEHTVER GTCACTGAGACTACCTGCGGCTAAAGGCGCGCGCGAGGGATTGTAAGT CTGATCAG GCTTGC PGLDQGHSYGLLMD CCACCACCTCCACGTGGTTCCCCGCGGGCAACGGCATAGTTCATCTGCC CTGGACAC CCCCACC DVELPVRLPFFGPLIC GGCCAAGCGTGCCGTTCCCTCGATCCTCCTCTGATTAGGGATAGGAGGG CTAGCTGA TGCTTAA PGCRTLLTSEETISSH GGGCGGTGGCTCCCGCCGACGACCTGGCAAACACCTTGGACACGCTAA CTGCGTGC AGGAAC HRRVHPDARTRWVC GATAATAGGCCGTCCTCCGGGCCGGCCGAATCATAAGCACACAATGGA ATGCGTGC TGGCAG YGCDSPFMTYRAIKC AGTCCCTCTAACATCAAGCCTCGGATCTGGGGCAACCCAAGCTCCCGGG ACGCCGCG GAAAGA HLPKCSGRKVVTGDH ACCCCAGAATTATTAGGGGAGCACACCGTTGAACGACCTGGCCTTGATC CGCCGCCT GAGTGA ICNGCTKRFESQRGLS AAGGACATTCTTATGGTTTGTTAATGGATGACGTAGAGCTGCCGGTGAG CCTCGTGG TCCGTG LHKRRAHPGLRNEE GTTACCATTTTTTGGGCCTCTCATATGCCCAGGATGCAGGACGCTGCTCA TACCGCTG CCATAG MLEPPVRAERRPNA CCTCTGAGGAAACGATTAGCAGCCATCATAGGAGGGTCCACCCTGATGC GTAACGCA AAATAT HKSSIWSIDEIRILEQY CCGGACGAGGTGGGTCTGCTATGGTTGTGATTCCCCATTTATGACATAC GGGTCACT GGTTAT EAAYVGDLHINMKIA CGGGCCATTAAATGTCACTTACCTAAATGCTCTGGCAGGAAGGTAGTGA GAGACTAC CCGGGG AHLPFKTNKQVSNYR CAGGGGATCACATCTGTAATGGCTGCACAAAGCGGTTTGAGTCTCAGA CTGCGGCT CAAGTC NDRRKKSRTATDASQ GAGGTCTCTCACTGCATAAAAGGAGAGCCCATCCGGGTCTTCGGAACG AAAGGCG ACTAGC QGLGPNDGNRGIVP AGGAAATGCTTGAGCCTCCTGTGAGAGCTGAGCGACGCCCTAACGCAC CGCGCGA AATATG SGQSSPLFLEGSDAE ATAAATCCTCAATATGGAGCATTGATGAGATTCGTATCCTGGAGCAATA GGGATTGT GGACTT GDEDVFNVLVPPTLG TGAGGCTGCCTACGTCGGCGATCTCCACATCAACATGAAGATAGCTGCT AAGTCCAC CTCCGG GLEPAGQVHSLSEGE CACCTGCCGTTCAAAACCAATAAACAAGTGAGCAACTATCGGAACGATC CACCTCCA GTCCGT TSPLVGEADPCFMG GCCGTAAAAAGTCCAGAACGGCGACTGACGCGAGCCAACAGGGACTTG CGTGGTTC GCGGTC GTPSAGEASGSTLLG GTCCCAATGACGGCAATAGAGGAATTGTCCCCAGCGGGCAGTCATCCCC CCCGCGGG CTTCCAA PDPTPADGYSLVRKD GTTATTTTTAGAGGGCTCTGACGCCGAGGGGGACGAGGATGTTTTTAAT CAACGGCA CATGAG LQLSVQTSPLLAVGS GTCCTAGTGCCGCCCACCCTAGGGGGTCTTGAACCTGCTGGACAGGTTC TAGTTCAT CTGGAC VGTESVQFERGVLSC ATTCACTCTCTGAGGGGGAAACTTCGCCGCTGGTCGGCGAAGCGGATCC CTGCCGGC GTAGTC GTPPEFLHPEQFAHC CTGTTTTATGGGTGGCACGCCGTCTGCCGGAGAGGCCTCGGGCTCAACC CAAGCGTG CACTCTA ANNDPVLNASEEQV TTGTTGGGTCCGGATCCCACCCCGGCAGATGGATATTCTCTGGTCAGGA CCGTTCCC TGACTT HAPLGEEANDLPDN AAGACTTACAGCTCAGTGTGCAGACATCCCCGTTATTAGCAGTGGGCTC TCGATCCT GAACGA NHPSELGVDPEDPTC TGTCGGCACTGAGTCTGTGCAATTTGAACGTGGGGTGTTATCGTGCGGT CCTCTGAT TACGGG SPATEQVQPSSEEEA ACGCCACCAGAATTTTTACACCCGGAACAGTTTGCACATTGTGCAAACA TAGGGATA GGCGTA DDPFAQFKAWRRRV ATGACCCGGTTTTGAACGCGTCAGAGGAGCAGGTGCACGCCCCACTAG GGAGGGG TCTCCCC ASYALKIETGVLPAQV GGGAAGAAGCCAATGACCTCCCTGACAACAACCATCCGTCGGAGCTGG GGCGGTG CGGAAG DDLLRRLRDGDTQSK GGGTTGACCCCGAAGACCCGACGTGTAGTCCCGCAACGGAACAGGTCC GCTCCCGC AGGTCG VTCEEVEEVVLSLTRT AACCCTCTTCTGAGGAAGAAGCCGATGACCCGTTTGCGCAATTCAAGGC CGACGACC CCTAGG ILGGTAPKKRVEGRT GTGGCGAAGAAGGGTGGCTTCCTATGCCTTGAAGATCGAGACCGGGGT TGGCAAAC CGACTT KWTYKSRTNHEARK TCTTCCAGCCCAAGTAGATGACCTCCTTCGGAGACTGAGGGATGGGGAC ACCTTGGA AGAA RIMYARCQDLYRRRP ACACAGTCTAAGGTAACGTGCGAGGAGGTGGAGGAAGTGGTTTTATCC CACGCTAA (SEQID QRPVERAVGYQAEES CTCACTAGAACGATACTGGGTGGCACGGCACCCAAAAAGCGCGTTGAG GATAATAG NO: LLDNQDERPSHGAFE GGTCGAACGAAGTGGACGTACAAAAGTCGAACCAACCATGAGGCTCGC GCCGTCCT 1368) TFYTGLWGKSGQCNI AAGCGTATAATGTACGCTAGATGCCAGGACTTATATCGCAGGCGGCCTC CCGGGCCG TMPPGVPRHTGHVL AGCGACCTGTGGAGAGGGCCGTCGGATACCAGGCGGAGGAATCTCTTC GCCGAATC REVTPKDIYSRLRKLK TTGACAACCAAGATGAGAGACCATCCCACGGAGCCTTTGAAACCTTCTA ATAAGCAC KDYAPGPDGVTKLKV TACTGGGCTCTGGGGTAAGTCCGGTCAATGCAATATCACAATGCCTCCT ACA(SEQ QSMGAYPSLLAKVY GGTGTGCCTCGGCACACCGGGCATGTACTGAGGGAAGTTACGCCAAAG IDNO: NLVMLTGYFSSCWK GACATCTACTCGAGGCTGAGGAAACTAAAGAAAGACTATGCACCCGGC 1245) EHKTSLIPKDRGSPM CCGGATGGCGTAACTAAATTGAAGGTGCAGAGCATGGGAGCATATCCC DVSNWRPITIGSLLSR TCACTTCTTGCAAAGGTATATAATCTGGTGATGCTAACCGGTTACTTCTC IYTGLIERRLRTVSDIH CAGTTGTTGGAAGGAACATAAGACCTCTCTAATTCCTAAGGATAGAGGA QRQVGFMPVNGCA AGCCCAATGGATGTCAGTAATTGGAGACCGATTACGATTGGCTCCTTAC ANLFIFDECIRQAKKE TCAGTCGGATTTATACGGGTCTCATAGAACGGAGGCTGAGGACAGTTTC GTIVGSLIDVAKAFDT TGACATCCACCAAAGGCAGGTAGGGTTTATGCCCGTCAATGGTTGTGCC VPHEAILRALSSQGV GCCAACTTGTTTATATTTGATGAATGCATCCGACAAGCCAAAAAAGAGG DEHTMAHIRDMYSG GGACAATTGTCGGCAGTCTTATAGACGTCGCCAAAGCATTTGATACCGT IRTRINGKGSDIPLVR ACCCCATGAAGCCATCCTACGGGCGTTGAGCTCGCAAGGTGTAGATGA GVKQGDPLSPMLFN GCACACGATGGCTCACATAAGAGACATGTATTCCGGTATCCGTACGCGG MVMDPLIRDLQRKG ATCAACGGCAAAGGGAGCGATATCCCTCTGGTGAGAGGAGTGAAGCAA FRIGGHEIGALAFAD GGTGATCCTCTATCCCCAATGCTCTTTAATATGGTGATGGACCCTCTCAT DIVLLADSIDGAQDH ACGAGACCTACAGAGGAAGGGCTTCAGAATTGGCGGCCACGAGATAGG VDQVGRYMNKLGM GGCGTTGGCCTTTGCGGATGACATCGTCTTGTTGGCGGACTCCATTGAT TLNPRKSSSFLITAMR GGGGCCCAGGATCATGTAGATCAGGTTGGGCGCTACATGAATAAACTG KTWICRDPGLSIGETK GGCATGACCCTCAATCCACGAAAATCGTCCTCCTTCCTCATCACTGCGAT VPGARPSSALKYLGV GAGGAAGACCTGGATATGTAGGGACCCGGGCTTGTCAATAGGGGAGA NYTLSEGLESGALIDK CTAAGGTGCCTGGCGCCAGACCCTCCTCTGCACTTAAATATCTCGGAGT LMQAVNRARGLALK AAACTACACACTCTCGGAAGGCCTGGAGAGTGGTGCCTTGATAGACAA PLQKVNLILERIIPKFL GCTCATGCAGGCCGTTAACAGAGCACGAGGTTTGGCCCTTAAACCGCTG YGIILGGPSLTRLHAA CAAAAGGTCAACCTCATACTTGAACGAATAATCCCCAAATTTTTGTATGG DKCVRMAVKEILHLH TATCATCCTTGGCGGGCCTAGTTTGACTAGGCTGCATGCAGCCGACAAG PSTTDHVLYARKKDG TGTGTCAGGATGGCTGTAAAGGAGATCCTGCACCTCCATCCTAGTACAA GMGIPRLAHLVRLAS CAGACCATGTACTATATGCCAGAAAAAAGGATGGGGGAATGGGGATTC LRSGLALLASGDVAV CCCGACTGGCTCACCTTGTCCGCCTAGCCAGTCTCAGGTCGGGGTTAGC QAAGMAGDLEGRCK TCTGTTAGCGAGTGGGGACGTTGCCGTTCAAGCAGCGGGGATGGCAGG KVANDLRLNWPVTL CGACCTGGAAGGCCGCTGCAAGAAAGTCGCAAATGATTTGCGGCTTAA RDVVRASNKFKSQES TTGGCCGGTGACGCTGAGAGATGTGGTGAGAGCCAGCAATAAATTTAA KDWERLASQGHGVK ATCTCAAGAATCAAAAGATTGGGAGCGTCTGGCCTCCCAGGGTCACGG DFRNDRLGNCWLYD AGTCAAGGACTTCCGGAATGACCGTCTTGGGAATTGCTGGCTCTATGAT PTVLSSSRYTDALRLR CCGACGGTGTTGTCCAGCAGCAGATACACTGATGCTCTCAGGCTGAGGA TNTFGVNVALRRADK CTAACACGTTCGGCGTCAACGTTGCCCTAAGGAGGGCCGACAAGGACC DLEVNCRRCHGKPET TTGAGGTCAACTGTCGGCGATGCCACGGGAAACCTGAAACCTTAGGGC LGHVLGECVAGKGM ACGTGCTCGGAGAGTGTGTGGCAGGTAAGGGTATGAGGATACAACGAC RIQRHDKMAAFVAT ACGACAAGATGGCGGCCTTTGTGGCTACAAAATGTGAAGAAAAGGGGT KCEEKGYQTTREQLF ACCAGACGACCCGCGAACAGCTATTCTCCATCGAGCAGGGAAAACTCAA SIEQGKLKPDLVVIDG ACCTGACCTCGTCGTAATAGACGGGGAAAGGGCTCTCATTGTCGACGTA ERALIVDVTVRFESG ACAGTCCGCTTTGAGAGTGGGAACGCCCTCTCACGGGGAGCCTCTGAG NALSRGASEKIEKYQP AAGATCGAAAAGTATCAGCCTTTGGCTGACTATTTTGTCTCTCAGGGGG LADYFVSQGAVREAN CTGTTCGAGAAGCCAACGTGTTACCGATTGTTGTCGGTTCCAGGGGAGC VLPIVVGSRGAITQAT AATCACCCAGGCCACATTGAAGTCCCTAGCTACGCTAGGGTTAGATGTG LKSLATLGLDVERVGK GAACGTGTTGGCAAGTACCTCGCCATCTGTGCCGTTGCCTCCAGCGTTG YLAICAVASSVEIACM AGATCGCATGCATGCACCTGGATTACACATAATGTCCCTTTTGGCTTGCC HLDYT(SEQIDNO: CCCACCTGCTTAAAGGAACTGGCAGGAAAGAGAGTGATCCGTGCCATA 1490) GAAATATGGTTATCCGGGGCAAGTCACTAGCAATATGGGACTTCTCCGG GTCCGTGCGGTCCTTCCAACATGAGCTGGACGTAGTCCACTCTATGACTT GAACGATACGGGGGCGTATCTCCCCCGGAAGAGGTCGCCTAGGCGACT TAGAA(SEQIDNO:1122) R2 R2_RL GU949555 Reticul- CTCACTGCTGTCATGCCATTGTTAATGCGTTGGGTGATGGCGGGTGATG CTCACTGC TGAGTT MMADYNNSVDHAL itermes GGAGACGAGTTACAGCAGAGCTGGCTTCACGGGGGGGGCTGTAGTTGC TGTCATGC AGATAT EDNTRLIFARDAVLA lucifugus CCGAACCAGTCTGCTTCTGAGTGCCTCCACGTGGCCTCGCTGGAAACGT CATTGTTA ATATGA RVCGPFDNLECGLCG CGGAGCTGCTTACGGCTAACCCGTAGACAACTCCGGCGGCCAAACTCAG ATGCGTTG CGACTG VLLTSLQGVREHCHR AAAACGGCCTTACTACCAGGGTCATCCCTCGCGGCGGCGCTAAGACCTT GGTGATG ATTCTTC AHHNLDLTFQCTKCD ACTGTATGTACTACGGACTACAGTTGAGTAGCGCGGGAGAGCACTTGAT GCGGGTG CCTAACT KGFSSYRGICCHFSKC GTGAGGGTAGCAACTTGTTGCTACTTCGACTGTCTCCCTGAAGATTTCCG ATGGGAG AAAATA IGARISVSEGPLSCSEC AAGGGGTGCCGCAAGTCCATGGGGGCGGTCCTGCTGTGGGAGTTATCT ACGAGTTA TAATTTG EREFDSKRALSTHER TGCACCCGGGAAGCCCTCAGTGGATAAATACTCCAAAATGGCTTAGCCA CAGCAGA GAATGT HMHPGIRNAKRLKD CCTCACGTGGAGAGAGGGGAATTAGGGGTTTACTCCTAAGGGCCGATG GCTGGCTT GCATTA FNPRGGGGKTIHGN CCTCCAATCGGATTTCCCCGGGCAACGGCTATTATCAGCCGGCTAAGTTC CACGGGG CTTATAC TKWTEEEVQLLVSLS GGACCCTCATCTACTGATGGGATACTCTCTCTCCCCCCGCCTGCGTGATT GGGGCTG TTTATTG KRFEGYKSINKEISLIL AGTTGGGAATGCACGCGGGCGGGTGATTCTGACAAGCCCAGCCCAAAA TAGTTGCC TATTTAT TSKTCKQISDKRRYLN GTTCTTACCACGACGTACCCGGGGGTGTCACCCGAACTAATATATTTTCC CGAACCAG TGACAT LHNGNGGLAAAEAV GCCGATCGGTTCGCTGTGGGGGGATGTCGGGGCCTGGTGGCCGGGAA TCTGCTTC CCGTAG LEFCEDSHPEVTESD CCGTCAGCCGCTCTTTGAGTGTCCTGCGTAACCCGGAGGCGGTGACGTC TGAGTGCC TAGTCT GAVLSEIMDEECHQS GAAGGGCTAGGCTAGCACGGTCCAGACCCCTGAACGCCTGGGGAGACG TCCACGTG GTTTAGT SVTMRSSIVHGDIGR GGCCCACCACCTGAGTAGGAGTGGGTCCTTTACCTCATTTGAGGTGTCT GCCTCGCT GGATTT EVQGKELVRIPPDNS CCTCGTTCTTGTAGGGGGCGAAGGACGAGGAATGCATCCGTCCCTCCAG GGAAACGT AGATCG VMGNCVVLLRKLAT GATGCTGGTGGTTTCCGTCTGGTGGGCTCATGCTGAAACGCGCAGGTGC CGGAGCTG TTAACCA DKRSDLDLSKDKELRI CGATTTGTCGAAGAGGACATATGGGGTAACCCATAGAACCTAGGCAGG CTTACGGC TTCTGTG DIEKATKANRESADG AGTAATCCCTTAGCTGGGGGGGTCAGCTGGTGGGTCCTGTCAATTTATC TAACCCGT ACGTCG RVIQSVAGNEIDPDT CTCCCCTCCATGCCAGAGCCGGTCCGAGGTTAGAGACGGACTCATTTTC AGACAACT CTGATG FQWKELLLGQVRGF TCCTTTTTATATGTCATGATGGCCGATTATAATAATTCTGTTGATCATGCA CCGGCGGC TCTATAT PRVDENSELFDLDDK CTTGAGGATAATACTAGATTGATATTTGCGCGTGACGCAGTACTTGCGA CAAACTCA GTACCA LTKELSSDSPVWNDN GAGTATGTGGACCTTTTGATAACTTGGAGTGCGGGTTGTGTGGGGTGCT GAAAACG GTGGCA CELIVSDLCQVLCKKK CCTGACGTCCCTTCAGGGCGTTCGAGAGCATTGCCACCGGGCGCACCAT GCCTTACT GAATGC YELGRQHHVRKGKR AACTTGGATTTGACTTTTCAATGTACTAAATGCGATAAAGGGTTTTCCTC ACCAGGGT TGACTA HRGIHHKREKFRECQ GTATAGGGGAATTTGTTGTCATTTTTCAAAGTGTATAGGAGCTCGTATAT CATCCCTC AGAACA KIFRKSPRKLAEYLYR CTGTTTCAGAGGGCCCCCTTAGTTGTTCAGAATGTGAAAGGGAATTTGA GCGGCGG AAATAA DKDLSHISKDASTPQ CTCTAAAAGAGCTCTTTCAACCCACGAGAGGCATATGCACCCAGGGATT CGCTAAGA TTTTAAA GIEQYYSQLWGEPEL AGGAATGCTAAGAGGCTCAAAGATTTTAATCCGCGGGGGGGGGGTGGT CCTTACTG CAATAG LESNTIEEKLPSSSLFD AAAACTATTCACGGCAATACTAAGTGGACTGAAGAGGAGGTCCAGTTG TATGTACT ACAGCA CLPPITPEEVEGRIHKI CTTGTATCACTTAGCAAGAGATTTGAGGGGTATAAGTCTATCAATAAAG ACGGACTA CTAAGA RPSSAPGLDGVRKIH AAATTAGCCTTATTTTAACATCAAAAACTTGTAAACAAATTTCTGATAAA CAGTTGAG AATTCTG LVGKGITLVLVKLYNL CGTCGCTATTTAAATTTGCATAATGGAAATGGGGGGCTAGCTGCAGCGG TAGCGCGG CAAACT LFLTGGYPECWKRNR AAGCTGTATTAGAATTTTGTGAAGATAGTCACCCTGAAGTTACTGAGAG GAGAGCA GGAACG TVFIPKIGKDLSEVGG CGATGGGGCGGTGCTTTCTGAAATCATGGATGAGGAATGTCATCAGTCA CTTGATGT TGGCCC WRPLTIGSLLARMYS TCGGTAACTATGAGGTCAAGCATCGTACATGGCGACATTGGTCGTGAG GAGGGTA ACGGTA AFLERRIRRVTSLSLS GTTCAAGGGAAGGAATTGGTCCGGATCCCGCCTGATAATAGTGTAATG GCAACTTG GGCCAA QRGFTNIQGCHVNLT GGGAATTGTGTTGTTCTACTTAGGAAACTTGCTACTGACAAAAGAAGTG TTGCTACT TATACC ILKEGIRQAKVKNGG ATTTGGACTTGAGTAAGGATAAAGAGTTGCGCATTGACATTGAGAAGG TCGACTGT GGAAAG VIVSVDIEKAFDTIPHS CGACGAAAGCTAACAGAGAAAGCGCAGACGGTAGGGTAATTCAATCTG CTCCCTGA GGAAAA VIFSRLASQGVPPLLR TGGCTGGTAATGAGATCGATCCCGATACTTTTCAATGGAAAGAATTGCT AGATTTCC TGACAC KIISNMYKDVYTVIEG GCTTGGGCAGGTACGGGGGTTTCCACGAGTTGATGAGAACTCTGAATT GAAGGGG ATCCCCC QCIPIKRGVKQGDPL ATTTGATTTGGATGACAAATTGACTAAGGAATTATCTTCCGACAGCCCCG TGCCGCAA CTTAAA SPLLFNIAIDPVLRSLE TGTGGAATGACAACTGCGAACTTATTGTGTCGGACTTATGTCAGGTCTT GTCCATGG (SEQID EFQGGLPLGNSAIKIL ATGCAAAAAGAAATATGAGTTAGGTAGGCAGCATCACGTTAGGAAAGG GGGCGGT NO: AFADDIILGASSAGQ AAAGAGGCATCGGGGTATACATCATAAGAGGGAGAAATTTAGGGAATG CCTGCTGT 1369) AQQMVDMLGIGLTS CCAGAAGATATTTAGAAAGTCCCCCAGAAAGCTTGCCGAATATCTCTAC GGGAGTTA CGLGVSHRKCFGFQI CGTGATAAGGACCTGTCTCATATCAGTAAAGACGCTAGCACTCCGCAGG TCTTGCAC VNKNKTWTIVDPMI GTATAGAGCAATACTATTCTCAGTTATGGGGGGAACCTGAATTACTAGA CCGGGAA TLNGSSLPFSGPEDRL ATCCAACACAATTGAAGAAAAGTTACCGTCCAGTTCCTTATTTGACTGTC GCCCTCAG PYLGVDINPWDRKSR TTCCCCCCATCACGCCGGAGGAGGTTGAGGGAAGAATCCATAAGATTC TGGATAAA YDAGQRLISAAKRGS GACCGTCTTCCGCCCCAGGGTTGGATGGGGTACGCAAGATCCATCTTGT TACTCCAA QLSLKPQQKINLITAF CGGTAAAGGGATAACCCTGGTCCTTGTTAAACTCTACAACTTGCTATTTT AATGGCTT LLPKFLYILIEDPPSPA TGACTGGGGGATATCCTGAGTGTTGGAAGCGGAACAGGACCGTCTTTA AGCCACCT YLKSIDHDLRQIYKNIL TACCGAAGATTGGGAAAGATCTGAGTGAGGTGGGAGGATGGAGACCG CACGTGGA HLPNCVSTAFMYSPK TTGACAATTGGTTCCCTATTAGCTCGGATGTATTCAGCCTTCCTAGAGAG GAGAGGG RDGGLGLPRLSCLVPL GAGGATTAGGAGGGTGACCTCCTTGTCCTTGTCGCAGAGGGGCTTTACG GAATTAGG AHLKAGIKLGSLQDSL AATATCCAAGGTTGTCACGTTAATTTGACTATCCTCAAAGAAGGCATTCG GGTTTACT VREITTSDRFVRTMG ACAGGCCAAAGTCAAGAACGGGGGAGTTATAGTCTCAGTAGACATAGA CCTAAGGG SVAHSLWASWSLTL GAAGGCTTTTGACACAATCCCTCATTCTGTAATTTTCTCCCGCTTGGCGTC CCGATGCC QDIYKLKSALKRREAK TCAAGGTGTTCCTCCCCTTCTTAGGAAAATTATTTCCAATATGTATAAGG TCCAATCG AWESCVSQGQGAA ATGTCTACACTGTGATTGAGGGGCAGTGTATTCCAATAAAGAGAGGAG GATTTCCC QFRGDSIGNNWLHN TTAAACAAGGGGATCCTCTTTCTCCACTACTTTTCAATATTGCCATCGATC CGGGCAAC PGTYRPGQYIEALKLR CTGTGTTGCGCAGTTTGGAGGAGTTTCAGGGGGGTCTCCCACTTGGTAA GGCTATTA ANLTGVRVNLKRSGY CTCGGCTATTAAGATCTTGGCCTTTGCTGATGATATTATATTAGGAGCTA TCAGCCGG NVPITCRFCKDIPETQ GTTCGGCGGGTCAGGCGCAGCAGATGGTGGATATGTTGGGAATAGGAC CTAAGTTC AHVLGLCPKTKGMRI TTACTTCGTGTGGGTTGGGAGTATCCCATAGAAAATGTTTTGGATTTCAA GGACCCTC QRHDSIVNRVRDKLK ATCGTGAATAAAAACAAGACCTGGACAATTGTTGACCCGATGATCACAC ATCTACTG TKSPVALMHEQNFT TCAATGGTTCTTCATTACCCTTCTCAGGGCCAGAGGACAGGTTGCCTTAT ATGGGATA VEEGQVFKPDIVTILG CTCGGGGTGGATATAAACCCATGGGATAGGAAATCACGGTATGACGCT CTCTCTCTC EVGYVIDVTVRYDDR GGACAGAGACTTATTTCGGCGGCCAAACGAGGCTCTCAACTATCCCTTA CCCCCGCC DYIKDASVEKIRKYEA AGCCTCAGCAAAAGATAAATTTAATAACAGCTTTTCTTTTGCCAAAGTTT TGCGTGAT LKGYLKDLYPQLNKV CTGTACATTCTGATTGAAGATCCGCCTTCGCCTGCCTATCTGAAATCCAT TAGTTGGG EVLPLVFGSRGAVPG CGATCATGACCTGAGGCAGATATATAAGAATATTTTGCACCTCCCTAACT AATGCACG STVHNMGLLGFTKRE GTGTGTCAACCGCCTTTATGTACTCTCCCAAGAGGGATGGTGGTTTGGG CGGGCGG MVHISRKVIADSLIIS GCTACCGCGTCTGTCCTGTCTAGTTCCACTTGCCCATTTAAAAGCCGGGA GTGATTCT NFLEVY(SEQIDNO: TCAAGCTCGGTTCCCTCCAAGATTCTTTGGTTAGGGAAATTACCACCTCA GACAAGCC 1491) GACAGGTTTGTACGTACAATGGGTTCGGTAGCTCACTCCCTTTGGGCAT CAGCCCAA CATGGTCTCTAACGTTGCAGGATATCTATAAACTTAAGTCCGCACTGAAA AAGTTCTT CGTAGGGAAGCTAAGGCATGGGAGAGTTGTGTGTCTCAGGGACAAGG ACCACGAC AGCAGCGCAATTTCGCGGGGACTCAATTGGAAACAACTGGCTCCATAAT GTACCCGG CCGGGAACCTATCGACCTGGACAGTATATCGAAGCACTTAAGCTTCGTG GGGTGTCA CTAATTTAACAGGGGTTAGAGTCAACTTGAAGCGTTCAGGCTATAATGT CCCGAACT TCCAATAACTTGTAGGTTCTGTAAGGATATACCTGAAACGCAGGCTCAT AATATATT GTTCTTGGACTTTGTCCAAAGACCAAGGGAATGAGAATACAAAGGCAC TTCCGCCG GATTCAATTGTGAATAGGGTGCGAGATAAACTTAAGACTAAGTCGCCCG ATCGGTTC TTGCTCTCATGCATGAGCAAAACTTCACCGTTGAGGAAGGCCAGGTGTT GCTGTGG CAAACCAGATATAGTGACGATTTTGGGAGAGGTGGGTTACGTAATTGAT GGGGATG GTGACAGTACGGTATGACGATAGAGATTACATCAAGGATGCCTCTGTG TCGGGGCC GAAAAAATCAGGAAATACGAGGCCCTTAAAGGGTACCTGAAGGATTTA TGGTGGCC TATCCACAGCTGAATAAGGTAGAGGTTTTGCCTCTTGTCTTTGGTAGCAG GGGAACC AGGGGCCGTGCCAGGTAGTACTGTTCACAACATGGGATTGTTAGGTTTT GTCAGCCG ACGAAACGTGAAATGGTCCACATCTCTCGCAAAGTTATAGCCGACTCAC CTCTTTGA TGATAATCTCTAATTTTTTAGAGGTTTATTGAGTTAGATATATATGACGA GTGTCCTG CTGATTCTTCCCTAACTAAAATATAATTTGGAATGTGCATTACTTATACTT CGTAACCC TATTGTATTTATTGACATCCGTAGTAGTCTGTTTAGTGGATTTAGATCGT GGAGGCG TAACCATTCTGTGACGTCGCTGATGTCTATATGTACCAGTGGCAGAATG GTGACGTC CTGACTAAGAACAAAATAATTTTAAACAATAGACAGCACTAAGAAATTC GAAGGGC TGCAAACTGGAACGTGGCCCACGGTAGGCCAATATACCGGAAAGGGAA TAGGCTAG AATGACACATCCCCCCTTAAA(SEQIDNO:1123) CACGGTCC AGACCCCT GAACGCCT GGGGAGA CGGGCCCA CCACCTGA GTAGGAGT GGGTCCTT TACCTCAT TTGAGGTG TCTCCTCG TTCTTGTA GGGGGCG AAGGACG AGGAATGC ATCCGTCC CTCCAGGA TGCTGGTG GTTTCCGT CTGGTGG GCTCATGC TGAAACGC GCAGGTGC CGATTTGT CGAAGAG GACATATG GGGTAACC CATAGAAC CTAGGCAG GAGTAATC CCTTAGCT GGGGGGG TCAGCTGG TGGGTCCT GTCAATTT ATCCTCCC CTCCATGC CAGAGCCG GTCCGAGG TTAGAGAC GGACTCAT TTTCTCCTT TTTATATG TC(SEQID NO:1246) R2 R2_RU GU949554 Reticul- CACACTGCTGTCATGCCATTGTTAATTCGTTGGGTGATGGCGGGTGATG CACACTGC TGAGTT MMADYNNSVDHAL itermes GGAGACGAGTTACAGCAGAGCTGGCTTCACGGGGGGGCTGTAGTGGC TGTCATGC AGATAT EDDTRFIFARDSVLAR urbis CCGAACCAGTCTGCTTCTGAGTGCCCCCAAGTGGCCTCGCTGGAAACGT CATTGTTA ATATAA VCGHFDNLKCELCGV CGGAGCTGCTTACGGTTAACCCGTAGACAACTCCGGCGGCCAAACTCAG ATTCGTTG CGACTG LLTSLQGVREHCHRS AAAACGGCCTTACTACCAGGGCCATCCCTTGCGGTTGCGCTAAGACCTT GGTGATG ATTCTTC HHNLDLTFQCTKCDK ACTGTATGTACTACGGACTACAGTTGAGTAGCGCGGGAGAGCACTCGA GCGGGTG CCTAACT GFSSYRGICCHFSKCR TGTGAGGGTAGCAACTTGTTGCTTCTTCGACTGTCTCCCTGAAGAGTTCC ATGGGAG AAAATA GARISVSEGPLSCSEC GAAGAGGTGCCGCAAGTCCATGGGGGCGACCCGGCTGTGGGAGTTATC ACGAGTTA TAATTTG ERKFDSKRALSTHER CTGCACCCGGGAAGCCCTCAGTGGATAAATACTCCAAAATGGCTTAGCC CAGCAGA GAATGT HMHPGIRNAKRLKD ACCTCACGTGGAGAGAGGGGAATTAGGGGTTTACTCCTAAGGGCCGAT GCTGGCTT GCATTA FNPRGGGKTIHGNTK GCCTCCAATCGGATTTCCCAGGGCAACGGCTATTATCAGCCGGCTAAGT CACGGGG CTTATAC WTEEEVQLLVSLSKR TCGGACCCTCATCTACCGATGGGATACTCTCTCTCCCCCCGCCTGCGTGA GGGCTGTA TTTATTG FEGYKSINKEISLILTSK TTAATTGGGAATGCACGAGGGCGGGTGATTCTGACAAGCCCAGCCCAA GTGGCCCG TATTTAT TCKQISDKRRYLNLH AAGTTCTTACCACGACGTACCCGGGGGTGTCACCCGAACTAATATATTTT AACCAGTC TGACAT NGNGGLAAAEAVLV CCACCGATCGGTTCGCTGTGGGGGGATGTCGGGGCCTGGTGGCCAGGA TGCTTCTG CCGTAG FCDDSHLEVTDSDGA ACCGTCAGCCGCTCTTTGAGTGTCCTGCGTAACCCGGAGGCGGTGACGT AGTGCCCC TAGTCT VLSEIMDEEYYQSSLT CGAAGGGCTAGGCTAGCACGGTCCAGACCCCTGAACGCCTGGGGAGAC CAAGTGGC GTTTAGT MRSSIVHGDIGREVQ GGGCCCACCACCTGTGTAGGAGTGGGTCCTTTACCTCATTTGAGGTGTC CTCGCTGG GGATTT GKDLVRIPPDNSVM TCCTCGGTCTAGTAGGGGGCGAAGGACGAGGGATGCATCCGTCCCTCC AAACGTCG AGATCG GNCVVLLRKLATEKR AGGATGCTGGTGGTTTCCGTCTGGTGGGCTCATGCTGAAACCCGCAGGT GAGCTGCT TTAACCA SDLDLSKDKELRIDIEK GCCGATTTGTCGAAGAGGACATATGGGGTAACCCATAGAACCTAGGCA TACGGTTA TTCTGTG ATKANRESADGRVIQ GGAGTAATCCCTTAGCTGGGGGGGTCAGCTGGTGGGTCCTGTCAATTTA ACCCGTAG ACGTCG SVADNEIDPDTFQW TCCTCCCCTCCATGCCAGAGCCGGTCCGAGGTTAGAGACGGACTCATTT ACAACTCC CTGATG KELLLGQVRGFPRVD TCTCCTTTTTATATGTCATGATGGCCGATTATAATAATTCTGTTGATCATG GGCGGCC TCTACAT ENSELFDLDDKLTSEL CACTTGAGGATGATACTAGATTTATATTTGCGCGTGATTCAGTACTTGCG AAACTCAG GTACCA SSDSPVWNDNCELIV AGAGTATGTGGACATTTCGATAACTTGAAGTGTGAGTTATGTGGAGTGC AAAACGGC GTGGCA SDLCHVLCKNKYELG TCCTTACGTCCCTTCAGGGCGTTCGGGAGCATTGCCACCGGTCGCACCA CTTACTAC GAATGC RQHHVRKGKRHRGI TAACTTGGATTTGACTTTTCAATGTACTAAATGCGATAAAGGGTTTTCCT CAGGGCCA TGACTA HHKREKFRECQKIFR CGTATAGGGGAATTTGTTGTCATTTTTCGAAGTGTAGAGGAGCTCGTAT TCCCTTGC AGAACA KSPRKLAEYLYRDKDL ATCTGTTTCAGAGGGCCCCCTTAGTTGTTCAGAATGTGAGAGGAAATTT GGTTGCGC AAATAA SHISKDVSTPQGIEQY GACTCTAAAAGAGCTCTTTCAACCCACGAGAGGCATATGCACCCAGGGA TAAGACCT TTTTAAA YSQLWGKPELLESNT TTAGGAATGCTAAGAGGCTCAAAGATTTTAATCCGCGGGGGGGTGGTA TACTGTAT CAATAG TEEMLPSSSLFDCLPP AAACTATTCACGGCAATACTAAGTGGACCGAAGAGGAGGTCCAATTGCT GTACTACG ACAGCA ITPEEVEGRIHKIRPSS TGTATCACTTAGCAAGAGATTTGAAGGGTATAAGTCTATCAATAAAGAA GACTACAG CTAAGA APGLDGVGKIHLVGK ATTAGCCTTATTTTAACATCAAAAACTTGTAAACAAATTTCTGATAAACG TTGAGTAG AATTCTG GITLVLAKLYNLLFLTG TCGCTATTTAAATTTGCATAATGGAAATGGGGGGCTAGCGGCAGCGGA CGCGGGA CAAACT GYPECWKRNRTVFIP AGCTGTATTAGTATTTTGTGATGATAGTCACCTTGAAGTTACTGACAGCG GAGCACTC GGAACG KIGKDLSEVGGWRPL ATGGGGCGGTGCTTTCTGAAATCATGGATGAGGAATATTATCAGTCATC GATGTGAG TGGCCC TIGSLLARMYSAFLER GCTAACTATGAGGTCAAGCATCGTACATGGCGACATTGGTCGTGAGGTT GGTAGCAA ACGGTA RIRRVTSLSSSQRGFT CAAGGGAAGGATTTGGTCCGGATCCCGCCTGATAATAGTGTAATGGGG CTTGTTGC GGCCAA NIQGCHVNLTILKEGI AATTGTGTTGTTCTACTTAGAAAACTTGCTACTGAAAAAAGAAGTGATTT TTCTTCGA TATACC RQAKVKNGGVIVSV GGACTTGAGTAAGGATAAAGAGTTGCGCATTGACATTGAGAAGGCGAC CTGTCTCC GGAAAG DIEKAFDTIPHSVIFSR GAAAGCTAACAGAGAAAGCGCAGATGGTAGGGTAATTCAATCTGTGGC CTGAAGAG GGAAAA LASQGVPPLLRKIISN TGATAATGAGATCGATCCCGATACTTTTCAATGGAAAGAATTGCTTCTTG TTCCGAAG TGACAA MYKDVYTVIEGQCIPI GGCAGGTACGGGGGTTTCCACGAGTTGATGAGAACTCTGAATTATTTGA AGGTGCCG ATCCCCC KRGVKQGDPLFPLLF TTTGGATGACAAATTGACTAGTGAATTATCTTCCGACAGCCCCGTGTGG CAAGTCCA CTTAAA NIAIDPVLRSLEEFQG AATGACAACTGCGAACTTATTGTGTCGGACTTATGTCACGTCTTATGCAA TGGGGGC (SEQID GLPLGNSAIKILAFDD AAATAAATATGAGTTAGGTAGGCAGCATCACGTTAGGAAAGGAAAGAG GACCCGGC NO: DIILGASSAGQAQQ GCATCGGGGTATACATCATAAGAGGGAGAAATTTAGGGAATGCCAGAA TGTGGGA 1370) MVDMLGIGLTSCGL GATATTTAGAAAGTCCCCCAGAAAGCTTGCCGAATATCTCTACCGCGAT GTTATCCT GVSHRKCFGFQIVNK AAGGACCTGTCTCATATCAGTAAAGACGTTAGCACTCCGCAGGGTATAG GCACCCGG NKTWAIVDPMITLN AGCAATACTATTCTCAGTTATGGGGGAAACCTGAATTACTAGAATCCAA GAAGCCCT GSSLPFSGPEDRLPYL CACAACTGAAGAAATGTTACCTTCCAGTTCCTTATTTGACTGTCTTCCTCC CAGTGGAT GVDTNPWDRKSRYD CATCACGCCGGAGGAGGTTGAGGGGAGAATCCATAAGATTCGACCGTC AAATACTC AGQRLISAAKRGSQL TTCCGCCCCAGGGTTGGATGGGGTAGGCAAGATCCATCTTGTCGGTAAA CAAAATGG SLKPQQKINLITTFLLP GGGATAACCCTGGTCCTTGCTAAACTCTACAACTTGCTATTTTTGACTGG CTTAGCCA KFLYILIEDPPSPAYLK GGGATATCCTGAGTGTTGGAAGCGGAACAGGACCGTCTTCATACCGAA CCTCACGT SIDHDLRQIYKNILHL GATCGGGAAAGATCTGAGTGAGGTGGGAGGATGGAGACCGCTGACAA GGAGAGA PNCVSTAFMYSPKRD TTGGCTCCCTATTAGCTCGGATGTATTCAGCCTTCCTAGAGAGGAGGATT GGGGAATT GGLGLPRLSCLVPLA AGAAGGGTGACTTCCTTGTCCTCGTCACAGAGGGGCTTTACGAATATTC AGGGGTTT HIKAGIKLGSLQDSLV AAGGTTGTCACGTTAATTTGACTATCCTCAAAGAAGGCATTCGACAGGC ACTCCTAA REITTSDRFVRTMGS CAAAGTTAAGAACGGGGGAGTTATAGTTTCAGTAGACATAGAGAAGGC GGGCCGAT VSHSIGASWPLTLQD TTTTGACACAATCCCCCATTCTGTAATTTTTTCCCGCTTGGCGTCTCAAGG GCCTCCAA IYKLKSALKRREAKA TGTTCCTCCCCTTCTTAGGAAAATTATTTCCAATATGTATAAGGATGTTTA TCGGATTT WESCVSQGQGAAQF CACCGTGATTGAGGGACAGTGTATTCCAATAAAGAGAGGAGTTAAACA CCCAGGGC RGDSIGNNWLHNPG AGGGGATCCTCTTTTTCCACTACTTTTCAATATTGCCATCGATCCTGTGTT AACGGCTA TFRPGQYIEALKLRAN GCGCAGTTTGGAGGAGTTTCAGGGGGGTCTCCCACTTGGTAACTCGGCT TTATCAGC STGVRVNLKRSGYNV ATTAAGATCTTGGCCTTTGATGATGATATTATATTAGGAGCTAGTTCGGC CGGCTAAG PITCRFCKDIPETQAH GGGTCAGGCGCAGCAGATGGTGGATATGTTGGGAATAGGACTTACTTC TTCGGACC VLGLCPKTKGMRILR GTGTGGGTTGGGAGTATCCCATAGAAAATGTTTTGGATTTCAAATCGTG CTCATCTA HDSIVNRVRDKLKTK AATAAAAACAAGACCTGGGCAATTGTTGACCCAATGATCACACTCAATG CCGATGGG SPVALMHEQNFTVE GTTCTTCATTACCCTTCTCAGGGCCAGAAGACAGGCTGCCTTATCTCGGG ATACTCTC EGQVFKPDIVTILGEV GTGGATACAAACCCATGGGATAGGAAATCACGGTATGACGCTGGACAG TCTCCCCC GYVIDVTVRYEDRDYI AGACTTATTTCGGCTGCCAAACGAGGCTCTCAACTATCCCTTAAGCCTCA CGCCTGCG KDASVEKIRKYEALKG GCAAAAGATAAATTTAATAACAACTTTTCTTTTGCCGAAGTTTTTGTACA TGATTAAT YLKDLYPQLNKVEVLP TTCTGATTGAAGATCCGCCTTCGCCTGCCTATCTGAAATCCATCGATCAT TGGGAATG LVFGSRGAVPGSTVH GACCTGAGGCAGATATATAAGAATATTTTGCACCTCCCTAACTGCGTGTC CACGAGG NMGLLGFTKREMVH AACCGCCTTTATGTACTCTCCCAAGAGGGATGGTGGCTTGGGGCTACCG GCGGGTG ISRKVITDSLIIISNFLE CGTCTGTCCTGTCTTGTTCCTCTTGCCCATATAAAAGCCGGGATCAAGCT ATTCTGAC VY(SEQIDNO: TGGTTCCCTCCAAGATTCTTTGGTTAGGGAAATTACCACCTCAGACAGGT AAGCCCAG 1492) TTGTACGTACGATGGGTTCGGTATCTCACTCCATTGGGGCATCATGGCCT CCCAAAAG CTAACGTTGCAGGATATCTATAAACTTAAGTCCGCACTGAAACGTAGGG TTCTTACC AAGCTAAGGCATGGGAGAGTTGTGTGTCTCAGGGACAAGGAGCAGCG ACGACGTA CAATTTCGAGGGGACTCAATTGGAAACAACTGGCTCCATAATCCGGGAA CCCGGGG CCTTTCGACCTGGACAGTATATCGAAGCACTTAAGCTTCGTGCTAATTCA GTGTCACC ACAGGGGTTAGAGTCAACTTGAAGCGTTCAGGCTATAATGTTCCAATAA CGAACTAA CTTGTAGGTTCTGTAAGGATATACCTGAAACGCAGGCTCATGTTCTTGG TATATTTTC ACTTTGTCCAAAGACCAAGGGAATGAGAATACTAAGGCACGACTCAATT CACCGATC GTGAATAGGGTGCGAGATAAACTTAAGACTAAGTCGCCCGTTGCTCTCA GGTTCGCT TGCATGAGCAAAATTTCACTGTTGAGGAAGGCCAGGTGTTCAAACCAGA GTGGGGG TATAGTGACGATTTTGGGAGAGGTGGGTTACGTAATTGATGTGACGGT GATGTCGG ACGGTATGAAGATAGAGATTACATCAAGGATGCCTCTGTGGAAAAAATC GGCCTGGT AGGAAATACGAGGCCCTTAAAGGGTACTTGAAGGATCTGTATCCACAGC GGCCAGG TGAATAAGGTAGAGGTATTGCCTCTTGTCTTTGGTAGCAGAGGGGCCGT AACCGTCA GCCAGGTAGTACTGTTCACAACATGGGATTGTTAGGTTTTACGAAACGT GCCGCTCT GAAATGGTCCACATCTCTCGCAAAGTTATTACCGACTCACTGATAATAAT TTGAGTGT CTCTAATTTTTTAGAGGTTTATTGAGTTAGATATATATAACGACTGATTCT CCTGCGTA TCCCTAACTAAAATATAATTTGGAATGTGCATTACTTATACTTTATTGTAT ACCCGGAG TTATTGACATCCGTAGTAGTCTGTTTAGTGGATTTAGATCGTTAACCATT GCGGTGAC CTGTGACGTCGCTGATGTCTACATGTACCAGTGGCAGAATGCTGACTAA GTCGAAG GAACAAAATAATTTTAAACAATAGACAGCACTAAGAAATTCTGCAAACT GGCTAGGC GGAACGTGGCCCACGGTAGGCCAATATACCGGAAAGGGAAAATGACAA TAGCACGG ATCCCCCCTTAAA(SEQIDNO:1124) TCCAGACC CCTGAACG CCTGGGGA GACGGGC CCACCACC TGTGTAGG AGTGGGTC CTTTACCT CATTTGAG GTGTCTCC TCGGTCTA GTAGGGG GCGAAGG ACGAGGG ATGCATCC GTCCCTCC AGGATGCT GGTGGTTT CCGTCTGG TGGGCTCA TGCTGAAA CCCGCAGG TGCCGATT TGTCGAAG AGGACATA TGGGGTAA CCCATAGA ACCTAGGC AGGAGTA ATCCCTTA GCTGGGG GGGTCAGC TGGTGGGT CCTGTCAA TTTATCCTC CCCTCCAT GCCAGAGC CGGTCCGA GGTTAGAG ACGGACTC ATTTTCTCC TTTTTATAT GTC(SEQ IDNO: 1247) R2 RaR2 FJ461304 Rhyncho- CAGAACGTGGAGAAACGGAATAACTACCCAGATCCGTTGGTTAACCGG CAGAACGT TAGAAA MSNYNETNTSGGDN sciara TGGCAAAGTTAATCAAGGTTGCCATAGGCTTAATAACCCTATGGAAATG GGAGAAA TGTGTG PRMATQTTGSLSSGP americana TTTCCACACACCTCCACGTGGTGCCTGCCGGAAATTGTTCTAGGGTGAA CGGAATAA CGATAA INQHTCELCCRTFGT CAAGCTAAGTIGTGAGAAACGGGCTCCACCACAATATGGAGCCTGCCA CTACCCAG GGTGTG RAGLGQHVRKTHPIE GGGCGCGAGACTCAGGACTCTCCATGTACAAAGTGGTTAGTTGCAAAA ATCCGTTG AATAGA SNQSINVERKKRRWS AGAGTGCGCCTAGCATGACTGATAATTTTTCACTGAAATAACGTTGAAC GTTAACCG AGGGTT PEEIRRMANMEAQA TTTATCTGTGTCATGTGCACAACACTATGGTGTCTGATCAAGCACCATCA GTGGCAAA CACCAA TINNIKHLTQYLATYL GTGGTGGACCTGCTAATGTATTAGTAGAACGTGTCCAGGCGATAATGCA GTTAATCA GAGGGA PQRTLNAIKGRRRDA CACACGGCCTCCGGGCCCATCGCTTTTTTTGAGATTCCCTAGAAACTTCA AGGTTGCC GACCTA EYKELVTGIIANLRSN GTGTGTGCGACAACTGTATAACCCATAAGGGATGGACAAAGGTTATACT ATAGGCTT GTTTGG SSTQQTNQVCNESE AGGGGGTAAAAACCCTAATCGGCTAATGGCAAATGGGATGTAGAAATG AATAACCC ACCTCA MSQRSKILQSIRESVR CCAAAGATTACTCGCACCGAATAATGGTGGCCGAAAAGCGGGTAATCG TATGGAAA GAAATG DLRSRRNKYAKALQE AATGAAATGGTAATGCTGTAGCGGAAACATGATCACATTCTGTGACAGT TGTTTCCA GGGTCA LGEAALCGKMLNEE AAACCATTAGACCTAGGGGGAACTATGATTAACAAGATACCAGCTTACA CACACCTC TAGGAG QLIHCIKSMFNTAKC TGGAAGCAATGAAATGTAAGTCACAGTAGTGATAAGTGGTGAAGAGTC CACGTGGT TGATAG PKGPRFRKTATHSGT TTGTAATCACCGTAACTAGGCCAGGTTCTGGGGATGCCATGAACTTAGG GCCTGCCG GTTGTA NKQQRQQRYARVQ GGGAGTATGGTTAGCAGATCTACCAGCTAACACTATTACTGATAATATG GAAATTGT AAGCCG KLYKMNRKVAAKMV TAAGCCGCAGTAGCGCTAAGTGGTGTACAGATTTGCAACCACCGTAACT TCTAGGGT TTGGGG LEETDKIQIKLPDHDP AAGTTCTGTTTCGATGGACTAGGGGGAATCATGATTAACAAGATACCAA GAACAAGC AATCCG MFKFWESEFKEGEG CTTACATGGAAGTTATGAAATGTAAGTCACAGTAGTGATAAGTGGTGAA TAAGTTGT GCTACA MPERMPKDLKESPD GAGTCTTGTAATCACCGTAACTAGGCCAGATGTAGTCAAAGCACATGTT GAGAAAC CATGGT LKAIWDPVTEEEVRK TAGGGGGAACAAGGTTAACATTGGTAAAAGACCAATGCAACCTCCGTA GGGCTCCA ATCTCA AKVANNTAAGPDGI ACTAAACGTGAAGGCACAAAACTAAAAGTCCAGTGGATGACAGGTGAG CCACAATA GGAGCC QPKSWNRISLKYKTLI GTCATCACTGGACCCAAATGTTTTAAGCTCATCATAACAACACGGTGAA TGGAGCCT ATTCATG YNLLLYYEKVPHKLKV AAATCCAGCATTTATTTGCCTGATTGAGTAGCTTCCACACTATTCCAAAG GCCAGGG CGCTGA SRTVFIPKKKDGSSDP CCGAACCTATCTGGGTTTTTCTTGAAAGGCCGTATAGGGCATATGTCGA CGCGAGAC TCTCATT GEFRPLTICSVVLRGF GAAATAAGTCCAAGGTGAGGTAGTGTGGCCCTGTACCCAGGGGTAAGG TCAGGACT AAGGCG NKILVQRLVSLYKYDE TACTATACGGTCGAGTGGCTCAGTAGGCCTAACTAGCCACTGAGTCACT CTCCATGT TAATAA RQTAYLPIDGVGTNI ATAATGACTAGTTATGAGCAACTACAATGAAACAAACACGAGTGGTGG ACAAAGTG ACTGTG HVLAAILNDSNTKLSE GGATAACCCTCGGATGGCCACTCAAACCACTGGATCGCTATCCAGTGGA GTTAGTTG AAACAG LHVALLDITKAFNRLH CCAATTAACCAGCACACATGTGAACTATGTTGTAGGACATTTGGTACAC CAAAAAGA ATCCTG HTSIIKSLVGKGFPYG GTGCCGGATTGGGACAACATGTGAGAAAAACCCATCCAATTGAGTCCAA GTGCGCCT ATAATG FITFIRRMYTGLQTM CCAAAGCATTAATGTAGAACGAAAGAAAAGAAGATGGTCACCTGAAGA AGCATGAC CCGTGC MQFEGHCKMTQVN GATAAGACGTATGGCGAATATGGAAGCCCAAGCTACCATAAACAACATC TGATAATT TACCAA RGVYQGDPLSGPIFLL AAACACTTAACTCAATACCTTGCAACTTACCTTCCTCAAAGAACTTTGAA TTTCACTG ATGATG AIEKGLQALDKEVGY CGCTATTAAAGGAAGGAGAAGAGATGCTGAGTATAAAGAACTGGTGAC AAATAACG TAACGA DIGDVRVNAGAYAD CGGCATTATAGCAAATCTTCGGAGTAATTCCAGTACACAACAAACCAAT TTGAACTT GGCGGA DTDLVAGTRLGLQD CAGGTCTGTAATGAAAGTGAAATGTCACAACGCAGTAAGATACTGCAAA TATCTGTG AATAAA NINRFSSTIKQVGLEV GTATACGAGAAAGCGTACGTGACTTGAGAAGTAGACGTAATAAATATG TCATGTGC ATTAATC NPRKSMTLSLVPSGK CGAAAGCATTACAAGAGCTTGGGGAAGCAGCATTGTGCGGTAAAATGT ACAACACT TGGGGC EKKMKVETGKPFRA TGAACGAAGAACAGCTAATACATTGTATAAAGTCAATGTTCAACACCGC ATGGTGTC GTTCTG NDVPLKELSINDFWR TAAGTGTCCGAAAGGACCAAGATTTCGCAAAACTGCAACCCACAGTGGA TGATCAAG CGGAAT YLGISYTNEGPERLSL ACAAATAAGCAACAACGGCAACAACGATATGCTAGGGTGCAGAAGTTG CACCATCA GACTAC TIEQDLERLTKAPLKP TACAAAATGAACCGAAAGGTTGCCGCAAAAATGGTGTTAGAAGAAACG GTGGTGG TAAATAT QQRIHMLNAYVIPKY GATAAGATACAGATAAAGTTACCGGACCATGATCCGATGTTCAAGTTTT ACCTGCTA AGCGAT QDKLVLSKTTAKGLK GGGAAAGTGAATTTAAGGAGGGCGAAGGCATGCCGGAAAGGATGCCC ATGTATTA GCTATAT RTDRQIRQYVRRWL AAGGATCTGAAGGAATCGCCCGACTTAAAAGCGATATGGGATCCGGTA GTAGAACG ATACAA KLPHDVPIAYLHAPV ACTGAAGAGGAAGTTAGAAAGGCAAAAGTGGCGAATAACACCGCGGC TGTCCAGG ACGACT KSGGLNIPCLQYWIPL AGGTCCTGATGGAATTCAGCCGAAATCATGGAACAGGATAAGTTTAAAA CGATAATG GATGGT LRVNRVNKITESQRS TATAAGACGTTAATATACAACTTACTCCTGTACTATGAAAAAGTTCCACA CACACACG AACACC VLAAVGKTALLTSTVY TAAGCTTAAGGTGTCACGAACTGTGTTTATTCCTAAGAAAAAAGATGGA GCCTCCGG GGCCTT KCNQSLATLGGNPT TCCAGTGATCCTGGTGAATTCCGACCTCTTACAATATGTTCAGTGGTCCT GCCCATCG A(SEQ MLAYRTYWEKELYAK TAGAGGTTTCAATAAGATCCTTGTGCAGAGGCTTGTATCCCTATATAAAT CTTTTTTTG IDNO: VDGKDLQNARDDKA ATGATGAAAGGCAAACAGCTTATTTGCCTATAGATGGTGTGGGCACGA AGATTCCC 1371) STRWNGMLHSDISG ACATACACGTGCTTGCGGCGATACTCAATGACTCGAACACAAAACTGAG TAGAAACT EDYLNYHKLRTNSVP TGAACTGCATGTAGCGCTATTGGACATTACGAAAGCATTCAATCGCCTA TCAGTGTG TKVRTARGRPQKETS CATCACACATCGATAATCAAATCCTTAGTGGGAAAGGGGTTCCCTTATG TGCGACAA CRGGCKSTETLQHVV GATTTATTACTTTCATCAGAAGAATGTATACGGGCTTACAAACGATGAT CTGTATAA QQCHRTHGGRTLRH GCAATTTGAGGGCCACTGCAAAATGACACAAGTGAACAGAGGAGTCTA CCCATAAG DRIVGLLQHELRRDY CCAAGGAGACCCTTTAAGCGGGCCAATCTTTTTGTTGGCTATTGAAAAA GGATGGA NVLAKQELKTGIGLR GGCTTACAAGCTCTTGATAAGGAAGTTGGCTATGACATAGGGGATGTG CAAAGGTT KPDLVLIKDDTAHIVD AGAGTCAATGCAGGTGCTTACGCAGATGATACAGACTTGGTTGCAGGA ATACTAGG VQVARCSKLNESHVR ACGAGACTTGGCTTGCAAGACAACATAAACAGGTTTTCAAGTACTATTA GGGTAAA KRSKYDKKEIEVEVKS AACAGGTTGGACTTGAGGTAAATCCAAGAAAGTCAATGACATTGTCTCT AACCCTAA RYRVSKVMYEACTIS GGTACCTTCGGGGAAAGAGAAGAAAATGAAAGTAGAAACTGGCAAACC TCGGCTAA YKGIWDKQSVMSM TTTCAGAGCGAACGATGTTCCTCTGAAGGAACTGTCAATTAATGATTTTT TGGCAAAT RRLGVSEYCLFKIVTS GGAGATACCTGGGCATAAGCTATACGAATGAAGGGCCTGAAAGACTGA GGGATGTA TLRGTWLCWKRFN GTCTTACAATAGAGCAAGATCTAGAACGCCTAACTAAAGCTCCATTAAA GAAATGCC MITSVRS(SEQID GCCACAACAACGGATACATATGCTTAACGCATATGTAATACCTAAATATC AAAGATTA NO:1493) AAGACAAACTTGTCTTGAGTAAAACCACGGCAAAAGGGCTGAAGCGAA CTCGCACC CAGATAGGCAGATAAGGCAATATGTAAGACGATGGCTAAAGTTACCAC GAATAATG ATGATGTTCCAATTGCATATTTGCATGCACCAGTTAAATCTGGTGGCCTA GTGGCCGA AACATTCCATGCTTGCAATACTGGATACCATTACTTAGAGTAAACAGAGT AAAGCGG CAATAAGATTACAGAAAGTCAACGCAGTGTACTTGCAGCGGTTGGTAAA GTAATCGA ACAGCCCTGCTAACGTCAACTGTTTATAAATGTAATCAGTCTCTAGCAAC ATGAAATG ATTGGGTGGAAACCCAACAATGCTTGCATATCGAACGTATTGGGAAAAA GTAATGCT GAGCTATATGCAAAGGTCGATGGGAAAGACCTACAAAATGCGCGGGAT GTAGCGG GATAAGGCTTCGACGAGGTGGAATGGCATGTTGCATAGTGACATTAGT AAACATGA GGAGAAGACTATTTGAATTATCATAAGCTTCGAACAAATAGTGTACCCA TCACATTC CTAAAGTAAGAACGGCAAGAGGACGTCCTCAAAAGGAAACGTCTTGCA TGTGACAG GAGGTGGTTGTAAAAGCACGGAAACGCTGCAACATGTAGTTCAGCAGT TAAACCAT GTCATAGAACACATGGGGGAAGAACTCTAAGACATGACAGGATTGTGG TAGACCTA GTCTTCTCCAACATGAACTACGCCGAGACTATAATGTCTTAGCTAAACAA GGGGGAA GAACTAAAAACGGGCATAGGGTTGCGCAAACCGGATTTAGTTCTTATAA CTATGATT AAGATGACACTGCACACATCGTTGACGTGCAAGTCGCAAGGTGTAGTAA AACAAGAT ACTGAATGAAAGCCATGTAAGGAAGCGATCCAAATACGACAAGAAAGA ACCAGCTT AATAGAGGTAGAAGTAAAAAGTAGATATCGTGTAAGTAAGGTGATGTA ACATGGAA TGAAGCTTGCACGATATCATACAAAGGTATTTGGGATAAGCAGAGCGT GCAATGAA GATGAGCATGAGAAGACTTGGAGTAAGTGAATACTGTCTTTTCAAGATT ATGTAAGT GTCACATCAACACTAAGAGGAACATGGCTTTGTTGGAAACGGTTCAATA CACAGTAG TGATCACTAGTGTTCGATCATAGAAATGTGTGCGATAAGGTGTGAATAG TGATAAGT AAGGGTTCACCAAGAGGGAGACCTAGTTTGGACCTCAGAAATGGGGTC GGTGAAG ATAGGAGTGATAGGTTGTAAAGCCGTTGGGGAATCCGGCTACACATGG AGTCTTGT TATCTCAGGAGCCATTCATGCGCTGATCTCATTAAGGCGTAATAAACTGT AATCACCG GAAACAGATCCTGATAATGCCGTGCTACCAAATGATGTAACGAGGCGG TAACTAGG AAATAAAATTAATCTGGGGCGTTCTGCGGAATGACTACTAAATATAGCG CCAGGTTC ATGCTATATATACAAACGACTGATGGTAACACCGGCCTTA(SEQIDNO: TGGGGAT 1125) GCCATGAA CTTAGGGG GAGTATGG TTAGCAGA TCTACCAG CTAACACT ATTACTGA TAATATGT AAGCCGCA GTAGCGCT AAGTGGTG TACAGATT TGCAACCA CCGTAACT AAGTTCTG TTTCGATG GACTAGG GGGAATCA TGATTAAC AAGATACC AACTTACA TGGAAGTT ATGAAATG TAAGTCAC AGTAGTGA TAAGTGGT GAAGAGTC TTGTAATC ACCGTAAC TAGGCCAG ATGTAGTC AAAGCACA TGTTTAGG GGGAACA AGGTTAAC ATTGGTAA AAGACCAA TGCAACCT CCGTAACT AAACGTGA AGGCACAA AACTAAAA GTCCAGTG GATGACAG GTGAGGTC ATCACTGG ACCCAAAT GTTTTAAG CTCATCAT AACAACAC GGTGAAA AATCCAGC ATTTATTT GCCTGATT GAGTAGCT TCCACACT ATTCCAAA GCCGAACC TATCTGGG TTTTTCTTG AAAGGCC GTATAGGG CATATGTC GAGAAATA AGTCCAAG GTGAGGTA GTGTGGCC CTGTACCC AGGGGTA AGGTACTA TACGGTCG AGTGGCTC AGTAGGCC TAACTAGC CACTGAGT CACTATAA TGACTAGT T(SEQID NO:1248) R2 YURE- Ciona CCAAAATTACTTCCAGCACCTCCACAGCAGACGAACGAAGAAAGAAGAC CCAAAATT TAAGAT MAGHKITMSEGKLLE 2_Cis savignyi TTACGAAATAAGAATAAGCAGTTTAAAGACGAAGCAGACGAACACCAC ACTTCCAG CCGCGG VAVRYGGVRNVSYE TCCACCAACGACGCTCTGCAGCTACACCACCACCATCTCGCCGCAACAA CACCTCCA CTGTGG CPVPDCTKTFSQANN GAAGAATTTTCCGCTGCTCTCCACTCTGCTCCAACACCACCTCTCCTGCTC CAGCAGAC CGCCGA LIRHLNNFGNTKHRA TGTGGACTGCTGCCTTGCTGCTGGACCAACCTCTACCCGAAGGAACCCT GAACGAA ACGAGC HNFTYFFTCEKCKIQI TCGAACCCAGCAGCAAGGTACGTGTCACCACCTCTCCACAGAGCCAAGG GAAAGAA ACCTGC HSNTKHNISNHYKQC CCAGAGTGGATAGAGCAGCGGCCTCTACAACCAAGTTCTCCACTCCGAC GACTTACG CCATTCT CATGGGPSCETGQYF GACAAAACACCTGCCTCGTGGCCAGAGTCCTGCCGAAGAAGAATCTTCG AAATAAGA TCTTGTA CPACEQAGLGNLESA ACCCCAACACCGTCTCGGCAGGGCTCCAAGCAACGACATCAGCCACGAG ATAAGCAG GGGACT LRHFQSSHPEFNLPP TGCCCACCGGAGTGGACGCGGCGAACCCGAGGACCGCTGTCGCCAAGA TTTAAAGA TTTTCAC RSQFSKSHPNSYTLSL ACAATAACATCACCGCCTGCAGCAGCTACCAAACAAAGGTTAGTCTCCT CGAAGCA CCTCACT KPKDHLMKILYSGPLT ACCTCACCTACAAACTCGTCTGAATAGACGCCCCCGCGTGGGAGCTAAC GACGAACA CCCCCCA PGQLVCPIKICLRSSA CTTGGTAGCTAGCTGCAGTGCCCTGCGACAGCGGCCTCGAAGAGCTGC CCACTCCA ATAGTTT ARLFHDVSKLRKHML AGAGCGTCAGCCTGCTTCGACCTCGCTTGTTCTCATTTCAACCTACTCCG CCAACGAC TTTTTCG VDHNRTLVYETTCGK TCCTGTGGAATCAAAAGAGCCCCACTAACAATTACATTCATAAAATCTAG GCTCTGCA TTTTTTC CLRPVDTSKNMRKTT CAAGAACGAAGAGAAGCGACCAACTTTTAATCCATAACTTTTAGATCTTT GCTACACC GTTTTTT SHFEKCSGESFISSPSP TTATTTATTACTGTTTTTAAGCCCTAAGCATATTGCCTTTTTTTAGATCTTA ACCACCAT CACCCCC IPQKTYKLDLPSTSTP TAATTATAAAAATAGATTCAAAGTTAACACCACCAGGCCGCTACAGAGC CTCGCCGC ACCCCA PPRKSPKLQPYKPIRT ATTTTATTTAATCAATTTGTTACCGACCTCCTGCTGCTTCTTTTTACTTTCT AACAAGAA CTCGCCT FKNPLTKSSQSKSDN CCAGACTACTACCCGGATACAACCCTTGGAAACGAGAGGAATGGCCGG GAATTTTC CTGGGC PPKPTPFFSPRTLERS TCACAAGATCACAATGTCGGAGGGGAAGCTGCTGGAGGTTGCTGTACG CGCTGCTC TGCACA ASWPALSEVVDPLPK GTATGGTGGTGTGAGGAATGTCTCGTACGAGTGTCCAGTCCCGGATTGC TCCACTCT TCCCACA LKEKHPSLPCALDKCP ACAAAAACCTTCTCACAGGCAAATAACTTGATACGGCATTTAAATAACTT GCTCCAAC CGTAGG PSPRIKPSTLVPPCHT TGGCAATACAAAACACCGAGCACACAATTTCACCTATTTTTTCACCTGTG ACCACCTC GACCTG ANNSPKPTSPESPSTL AGAAGTGCAAAATCCAAATTCACAGCAACACAAAACATAATATTTCAAA TCCTGCTC TTTATAT KPLPRPIRPSKPLEDW TCATTACAAACAGTGTTGTGCAACCGGCGGCGGACCCTCGTGCGAGAC TGTGGACT TATTTGC LTVRSVGPDREIVLNI GGGTCAATACTTCTGCCCTGCCTGCGAGCAAGCAGGGCTGGGGAACCT GCTGCCTT CTTTTAT GPRPRPGPAAGSRTT GGAGTCGGCACTCCGCCATTTTCAATCCTCCCACCCTGAATTTAATCTAC GCTGCTGG ATGTAC SPPSTAPAKRVAANP CCCCCCGGTCTCAATTTTCAAAATCTCATCCCAATAGTTATACCCTATCCT ACCAACCT CACTTTT IAAPLSGEPGATLDC TAAAACCTAAAGACCACCTTATGAAAATACTTTACAGTGGACCACTGACT CTACCCGA TAAATAT GQTGRKVQPPKKRP CCGGGGCAATTGGTATGCCCCATTAAGATCTGTCTCAGGTCCTCAGCTG AGGAACCC ATTTTTG TESAGSLPPPAEPAT CACGTCTTTTCCATGACGTGTCCAAGCTGAGGAAGCACATGTTGGTTGA TTCGAACC TACCCCA DLLTGREGLARLVEEY TCACAACCGAACCCTTGTGTACGAGACAACATGTGGCAAATGCTTGCGG CAGCAGCA CAAGAT HLSGDFGAFCRDLER CCTGTCGACACCTCGAAAAATATGCGGAAAACGACGTCGCACTTCGAAA AGGTACGT GCTTTTC WTALSSTNRRPKPRR AGTGTTCTGGCGAATCCTTTATCTCATCCCCCTCACCCATACCCCAAAAG GTCACCAC GCCAAA GRYNRGAAARATRN ACTTATAAACTCGACTTACCCTCAACCAGCACCCCTCCCCCCCGTAAATCT CTCTCCAC AAAAAA RGRDDRQDPQDRD CCCAAATTACAACCCTACAAACCAATTCGAACCTTTAAGAACCCCCTCAC AGAGCCAA AAATTTT DQGGPGPVTCGRPQ CAAGTCCAGTCAATCCAAATCCGACAACCCACCCAAACCTACCCCTTTTT GGCCAGA TGTATCA RYKRAAALRSAFGRD TCTCGCCAAGAACACTTGAGAGGTCGGCGTCGTGGCCCGCATTGAGCG GTGGATAG CATTTTT MKATVRRIIDGERGD AGGTTGTCGACCCACTGCCCAAGCTCAAGGAAAAACACCCCTCGCTGCC AGCAGCG ATATTTT ARCEIDPKTIEGRFRD ATGCGCCCTGGATAAGTGCCCACCCTCACCCCGGATCAAACCCTCGACC GCCTCTAC GTAAAA ELSPPVREGPECSLPP CTAGTCCCTCCCTGCCATACAGCAAACAATAGTCCCAAACCAACTTCACC AACCAAGT CACAGA WMAEAQAGEHAPS CGAATCACCCTCTACCCTAAAACCCTTACCCCGCCCCATCCGACCATCTA TCTCCACT TTTTTAT NDSQPGDAYDGPIT AACCCTTGGAGGATTGGTTAACTGTGCGGAGCGTGGGACCGGATCGGG CCGACGAC AAACTTT ALEVEMVLSTLNVGS AGATCGTCCTGAACATCGGTCCGAGACCCCGACCCGGCCCCGCAGCAG AAAACACC GCACTA APGSDGLSYGFWRA GCTCCAGGACAACATCTCCCCCGTCAACGGCACCCGCGAAAAGGGTTGC TGCCTCGT TTTTTAT LDPKGLVLSELFEVCR TGCAAACCCAATCGCAGCACCCTTGAGCGGGGAGCCGGGTGCGACCCT GGCCAGA ATAAAC IERRVPGPWKSSRVT TGATTGCGGCCAGACCGGGCGAAAGGTTCAGCCGCCAAAAAAGAGACC GTCCTGCC TTCGCAC LICKDAEGDLDDLGN GACAGAGAGTGCTGGGTCCCTCCCTCCTCCGGCCGAGCCAGCCACCGAT GAAGAAG TTATTTA WRPISICQTVYKIYAA CTCCTTACAGGGAGAGAGGGGCTGGCGAGGCTGGTTGAGGAATATCAT AATCTTCG AAATGA VLARRLQSWALDGG CTGTCGGGGGACTTTGGCGCTTTCTGTCGGGATCTCGAGCGATGGACG ACCCCAAC ATCGCA VISRSQKGFMPFEGV GCGCTTTCGTCGACCAACCGAAGGCCAAAGCCGAGGCGCGGCAGGTAC ACCGTCTC TCTTTTT YEHVFLLDSVVADAR AACCGCGGTGCGGCTGCCCGTGCCACCAGGAACCGGGGCCGAGATGAT GGCAGGG TATATAC ATRRSLAVCWLDLRN CGTCAAGATCCACAAGATCGAGACGATCAAGGAGGCCCCGGTCCGGTG CTCCAAGC ACCAAC AFGSVDHTTIVEALSR ACATGCGGGCGGCCGCAAAGATACAAGCGGGCCGCCGCGCTGAGGTC AACGACAT ACAAAC FGAPAGLVEMISDIYT GGCCTTCGGGAGGGACATGAAGGCGACCGTCCGTCGCATCATAGACGG CAGCCACG AGGATG GGSCRIRTRAGFTPDI AGAACGGGGCGACGCTCGCTGTGAAATCGACCCCAAAACCATCGAGGG AGTGCCCA TGCAGC PVGRGVRQGCPLSGI TCGATTTCGCGACGAGCTGTCGCCGCCGGTTCGGGAAGGGCCGGAGTG CCGGAGTG TCAGGG IFNLVMEVLLRGVEA CTCGTTGCCCCCTTGGATGGCTGAAGCCCAGGCAGGTGAGCATGCGCCA GACGCGG GAACCA NNACGYRLSCAGGA TCGAATGATAGCCAACCCGGCGATGCATACGATGGGCCAATCACGGCG CGAACCCG ATCCTGC SVRVLAYADDVALVG CTGGAGGTGGAGATGGTCCTCAGTACCCTGAACGTCGGCTCTGCCCCGG AGGACCGC GTCCCTC SSRAEXKIQLGVCERF GATCGGACGGCCTCTCCTATGGATTTTGGAGAGCTCTAGACCCGAAAGG TGTCGCCA CTAGCG AAWAGFSFNNKKCA ACTCGTCCTTTCGGAGCTCTTCGAAGTCTGTAGGATCGAGCGCCGGGTC AGAACAAT GCGGGA AMVLKHQRGGRRLL CCGGGGCCATGGAAGAGCAGCCGGGTCACCCTCATCTGCAAAGATGCA AACATCAC GGGGCG DSAPLRLCGEEVAILG GAGGGTGACCTCGACGATTTGGGGAACTGGCGGCCCATCTCCATCTGCC CGCCTGCA CCCACCT PDSFYKYLGAHTGYG AGACGGTGTATAAGATCTACGCCGCCGTGTTGGCCCGTCGACTGCAGA GCAGCTAC ACCCCC RQTGGQLVDRVERQ GCTGGGCGCTTGACGGTGGCGTCATCTCCCGAAGTCAAAAGGGCTTCAT CAAACAAA ACGCTC VVRLFTSFLTPTQKLS GCCGTTCGAGGGGGTATACGAGCATGTATTCCTCCTCGACTCGGTCGTC GGTTAGTC CTCTTGA ALKRIVLPAMSFHLR GCTGACGCGCGCGCGACCCGGAGGTCGCTGGCCGTGTGCTGGCTNGAC TCCTACCT GCACCA VRPCAEGHLRRLDNT CTCCGGAACGCGTTCGGCAGCGTCGACCACACCACCATAGTCGAGGCGT CACCTACA ACAGGG VRRCVKTALRLPKGS TGTCTCGCTTCGGCGCCCCCGCGGGCCTCGTGGAGATGATATCGGACAT AACTCGTC ACTCCCT CRAFFHTSPDAGGLG CTATACGGGTGGGTCCTGCCGAATTAGAACTCGTGCGGGATTCACCCCG TGAATAGA TCCGGA ITSVVAECDILTVTQA GACATCCCTGTCGGTCGTGGAGTGCGACAGGGGTGTCCCTTGTCCGGTA CGCCCCCG GCCCCT FKMLSSPDHLVSLVA TCATCTTCAACCTCGTGATGGAGGTCCTGCTGCGGGGCGTCGAAGCGAA CGTGGGA GCACCC KGRLGMHAARMGR CAACGCTTGTGGATACCGGCTCTCCTGCGCCGGCGGCGCGTCCGTCAGG GCTAACCT TCAACTT SETASACAMADYLSG GTGCTCGCGTACGCCGACGATGTGGCTCTGGTGGGCTCTTCCAGGGCCG TGGTAGCT TTCTTAT DSVMGHXSWKTGYR AGATNAAGATCCAGCTGGGTGTGTGCGAGCGGTTTGCCGCCTGGGCCG AGCTGCAG TTTTAAA MPADLWTATRAASR GTTTTTCTTTTAACAACAAGAAGTGCGCCGCCATGGTACTGAAACATCA TGCCCTGC AAAAAA RLSLRFSPQPQGEFG GAGAGGGGGTCGGAGGCTCTTGGACTCGGCGCCTCTTCGCCTTTGCGG GACAGCG AAATCA LESGTFKIAPRERRSL TGAAGAGGTGGCGATCCTGGGCCCCGACTCCTTCTACAAGTACCTGGGG GCCTCGAA TATATAT TRRLHHRQNLWWR GCGCATACCGGCTACGGGCGGCAAACGGGTGGACAGCTTGTCGATCGA GAGCTGCA TGATCTT NQWAALPNQGKTV GTCGAAAGGCAGGTCGTCAGGCTGTTCACCTCGTTCCTCACCCCCACCC GAGCGTCA GACGAC AAHSAYAASNNWVK AAAAACTCTCGGCCCTGAAGAGAATAGTCCTGCCCGCCATGAGCTTCCA GCCTGCTT GGGGGC GPSSLAPQALFFGLK TCTCCGGGTCCGGCCCTGCGCCGAGGGGCATCTCCGGCGCCTTGACAAC CGACCTCG TACATTC ARLNQMPTRSVKAC ACAGTGAGGCGCTGTGTGAAGACAGCGTTGCGCCTGCCGAAGGGGTCG CTTGTTCT AGCCCC YSRAPNYDKSCRRCG TGTCGGGCTTTTTTCCACACGTCCCCCGACGCAGGGGGCCTGGGGATCA CATTTCAA CAAAAA AEVETLPHVLNHCPK CGTCGGTTGTCGCCGAGTGCGACATACTGACGGTGACCCAGGCCTTCAA CCTACTCC CCCACCC SMKSILERHDSVLAE GATGCTGTCTTCACCGGACCATCTCGTCTCGCTAGTTGCCAAGGGCCGTC GTCCTGTG ACCATCC VLAAIPRGTFASVDV TGGGGATGCACGCCGCGCGCATGGGCCGGTCCGAGACGGCGTCCGCTT GAATCAAA CCAACG DRTSREHFRRVGEAL GCGCCATGGCGGACTACCTGAGCGGGGACTCGGTAATGGGGCACANGT AGAGCCCC AGTGCC RPDIVARRHDGSVVV CGTGGAAAACCGGATACAGGATGCCGGCCGATCTCTGGACAGCCACTC ACTAACAA GGGGCA ADVTCPFESCASALD GAGCTGCCAGCCGGCGCCTGTCCCTCCGGTTTTCCCCGCAGCCCCAAGG TTACATTC TTGAAG TAAARKIEKYDQLCA CGAATTCGGCCTCGAGTCGGGCACTTTCAAGATCGCCCCCAGGGAGCG ATAAAATC AGCTCC NLRQLYRKPVESHAL GCGCTCCTTGACCCGAAGGCTGCACCACAGGCAAAACCTGTGGTGGCG TAGCAAGA GGCACA VVGSLGSWGRTNNT GAACCAGTGGGCGGCCCTTCCCAACCAAGGGAAGACCGTCGCCGCCCA ACGAAGA ATTAGC ALAALGIRGAVRSRL CTCCGCCTACGCAGCCTCCAACAACTGGGTCAAGGGCCCGTCCTCCCTG GAAGCGA ACTTAG AKQLVNLSVEGSHNI GCCCCTCAGGCCCTGTTCTTCGGCCTTAAAGCGCGGCTGAACCAGATGC CCAACTTT CTTATTT WLRWSGGIPKDLVR CAACGCGCTCGGTCAAGGCCTGCTACTCAAGGGCGCCGAACTACGACA TAATCCAT ATTATTT (SEQIDNO:1494) AGTCCTGTCGTAGGTGTGGCGCGGAGGTGGAGACCCTCCCGCACGTGC AACTTTTA TTTGTCA TGAATCATTGCCCCAAGTCCATGAAATCGATCTTGGAGCGGCATGATTC GATCTTTT ACATTTT GGTGCTTGCGGAGGTCCTCGCCGCCATCCCTCGCGGCACATTCGCCAGT TATTTATTA TGTTTTT GTCGACGTCGACAGGACGTCCCGAGAACATTTCCGGCGAGTGGGCGAA CTGTTTTT TCAAATT GCTCTGCGACCCGACATAGTCGCCCGTCGACATGACGGATCCGTCGTGG AAGCCCTA TTTTCAC TCGCAGACGTGACGTGCCCATTCGAGTCCTGCGCGTCGGCCCTCGATAC AGCATATT CCCTCAC GGCGGCCGCGCGAAAAATCGAAAAATACGACCAGCTGTGTGCGAACTT GCCTTTTTT CCCCACC GCGGCAGTTATACCGTAAGCCCGTCGAGTCGCACGCACTGGTCGTAGGT TAGATCTT CTAATA TCCTTGGGCAGCTGGGGCAGGACCAACAACACTGCTCTGGCTGCACTCG ATAATTAT GGTCCC GAATCCGAGGCGCGGTTCGCTCGAGGTTGGCCAAGCAATTGGTCAACC AAAAATAG TCGGGC TCAGCGTCGAGGGTAGCCACAACATCTGGCTACGCTGGTCCGGCGGCA ATTCAAAG TTGGGC TCCCAAAGGACCTGGTCAGATAAGATCCGCGGCTGTGGCGCCGAACGA TTAACACC CCCTTTT GCACCTGCCCATTCTTCTTGTAGGGACTTTTTCACCCTCACTCCCCCCAAT ACCAGGCC TCGTGC AGTTTTTTTTCGTTTTTTCGTTTTTTCACCCCCACCCCACTCGCCTCTGGGC GCTACAGA TCGAGA TGCACATCCCACACGTAGGGACCTGTTTATATTATTTGCCTTTTATATGTA GCATTTTA AGCGTC CCACTTTTTAAATATATTTTTGTACCCCACAAGATGCTTTTCGCCAAAAAA TTTAATCA ACATCG AAAAAATTTTTGTATCACATTTTTATATTTTGTAAAACACAGATTTTTATA ATTTGTTA CCCCACT AACTTTGCACTATTTTTATATAAACTTCGCACTTATTTAAAATGAATCGCA CCGACCTC GACCAC TCTTTTTTATATACACCAACACAAACAGGATGTGCAGCTCAGGGGAACC CTGCTGCT GACCTTC AATCCTGCGTCCCTCCTAGCGGCGGGAGGGGCGCCCACCTACCCCCACG TCTTTTTAC CCCGAC CTCCTCTTGAGCACCAACAGGGACTCCCTTCCGGAGCCCCTGCACCCTCA TTTCTCCA ATTGGA ACTTTTCTTATTTTTAAAAAAAAAAAATCATATATATTGATCTTGACGACG GACTACTA GTCCTT GGGGCTACATTCAGCCCCCAAAAACCCACCCACCATCCCCAACGAGTGC CCCGGATA GGCGTC CGGGGCATTGAAGAGCTCCGGCACAATTAGCACTTAGCTTATTTATTATT CAACCCTT TCCCAG TTTTGTCAACATTTTTGTTTTTTCAAATTTTTTCACCCCTCACCCCCACCCT GGAAACG GTCGAA AATAGGTCCCTCGGGCTTGGGCCCCTTTTTCGTGCTCGAGAAGCGTCAC AGAGGA ACAGTC ATCGCCCCACTGACCACGACCTTCCCCGACATTGGAGTCCTTGGCGTCTC (SEQID CCAAGT CCAGGTCGAAACAGTCCCAAGTGATAGCACCTAATGCTCGACTTGTTTC NO:1249) GATAGC GGCCTGGGCCGCCGAGGATTCCCAGAACGACCATTCTTCTAAATAATAT ACCTAAT TTATATTTCAGAATAAAACTATATATATATCGTTGGGGGACTTGTCCCG GCTCGA CCTCGATACCGAGTGCTGCAGAGCGGCAAAATAAAGAAGAAACCGACG CTTGTTT TCGCTCTGCAGCCAAGGACCACCCAAACTCAAGCCAGCACCGTCGACAA CGGCCT CCAACATCCTCAAGTCGGCGGTTGCTGGAACAACTCATAACATCTTCAA GGGCCG NATAAATTATCACCCTGTGCAGCAGGAGGCCGTGCTTTTAAAACTACTCT CCGAGG GTAGTGGCTCATGATAATATTTCGCTCCTTTTTTGCCCCGTGTAAACTTA ATTCCCA GTNGATGCGAATAAAATCAGTTGAATCA(SEQIDNO:1126) GAACGA CCATTCT TCTAAAT AATATTT ATATTTC AGAATA AAACTA TATATAT ATCGTT GGCGGG ACTTGTC CCGCCTC GATACC GAGTGC TGCAGA GCGGCA AAATAA AGAAGA AACCGA CGTCGC TCTGCA GCCAAG GACCAC CCAAAC TCAAGC CAGCAC CGTCGA CAACCA ACATCCT CAAGTC GGCGGT TGCTGG AACAAC TCATAAC ATCTTCA ANATAA ATTATCA CCCTGT GCAGCA GGAGGC CGTGCT TTTAAAA CTACTCT GTAGTG GCTCAT GATAAT ATTTCGC TCCTTTT TTGCCCC GTGTAA ACTTAGT NGATGC GAATAA AATCAG TTGAATC A(SEQ IDNO: 1372) R4 Dong Bombyx GCTAGCTCCCTAAAATCCTACCTTACGTCCGAGGCGAACATCTGTCCACG GCTAGCTC TAAAAA MLRRGRIFLPASTKA mori TGGGGAGCGGAAACGCGTACTATCGAAACTTACGCGGCTAACAAGGTA CCTAAAAT CTAGCA GKTRGRMKWSREV AAGGTAACCCATTAATATGGAGACAAGACTAAAAAGAAAATTGAGAGG CCTACCTT TAATTAT NLFIMRTYYYVTKLET GCCGCTTCCCGGGGGCGATCGCCGGGGCACACCTGGAGCTGGCGCTGG ACGTCCGA TAACTCA DLTIYRKKLHEHFSLK GTGTTCCAGCATAGGATCCGTCAGCGGCGGAGAGTTGAGCAGGCGGGC GGCGAAC TAACTA YPNVIISQQRISDQKR TCTCGCCGGGGATTTACAACCCGAAAATGCTACAGCCGCCAACAGAAAT ATCTGTCC ATGTAT AIERNKLLSQETLDRL CACGATGTAGAAAGTAGGAGCAATAGCCCATGTGAACCTTACAGCCCG ACGTGGG ATTACTT KEEVRKQLEDEQTNN AGTACCGGTTCATACAACCCCTCGGTACAATCATCACCATCATCCTCGGG GAGCGGA GGCCAA VENEKLNSETYSHEYT TCATAGAGGCTCACCAACGTCGACTATGGATGTTGCGCAGGGGGCGAA AACGCGTA AAGCCC TLTPQTILTKKTQQHT TCTTTTTACCTGCGTCCACCAAAGCTGGCAAAACAAGGGGCCGCATGAA CTATCGAA GTATAT NIISSTQTSHSSTQTE ATGGAGCAGAGAAGTAAACCTATTTATCATGCGCACCTACTACTACGTC ACTTACGC ACAGTT SITLLLENEVDILNTNP ACTAAATTAGAAACTGATCTGACTATTTACAGAAAAAAGCTGCATGAAC GGCTAACA CCACCG TEGATQTQEVKDKFE ATTTTTCACTAAAATACCCTAATGTAATAATTTCACAGCAAAGAATATCT AGGTAAA GCTCTG TNLTMYSGMDPKAR GACCAAAAAAGGGCAATAGAAAGAAATAAACTACTATCTCAAGAAACC GGTAACCC TCGACA PPLPKLKYSSKLNELIR CTAGACCGACTAAAAGAGGAAGTGAGAAAACAGCTCGAAGACGAACAA ATTAATAT GACTGA LFNNDILVDYISPDTQ ACCAATAATGTTGAAAATGAAAAATTAAATTCAGAAACATATTCACACG GGAGACA ACTGAG LSDVHTLTYCTAVTIS AATATACGACACTTACTCCACAAACTATTTTAACGAAGAAAACACAGCA AGACTAAA AAAGGG EQLKYKIIAIEGNARH ACATACAAATATTATCTCTAGCACACAGACCTCTCATTCATCCACTCAAAC AAGAAAAT GAAACA KKNFKPPWQQRLEK CGAATCAATAACTCTGCTGTTAGAAAATGAAGTCGATATTTTAAATACTA TGAGAGG TATGGA DIAKLRADIGKLTQYI ACCCCACAGAAGGGGCAACACAGACACAAGAAGTTAAAGATAAATTTG GCCGCTTC AATAAT NNNRSKKVVQSVEQI AAACGAATTTAACAATGTACTCAGGCATGGACCCTAAAGCAAGGCCGCC CCGGGGG AATAA FKNTKIHTSHENGNK ATTGCCTAAACTTAAATATAGCTCTAAGCTAAATGAACTGATACGTCTAT CGATCGCC (SEQID KSQEFLDTLKQKLALK TTAATAATGACATACTTGTAGATTACATCTCACCAGACACACAACTATCA GGGGCAC NO: AHRLKRYNNSQKRK GATGTACATACATTAACATATTGCACCGCCGTAACTATTTCGGAACAACT ACCTGGAG 1373) NENTIFLTNEKLFYRN AAAATATAAAATTATAGCAATAGAAGGAAACGCGAGACATAAAAAAAA CTGGCGCT LIKPKTDRDNSNIDIP CTTCAAACCACCGTGGCAACAAAGATTGGAGAAGGATATAGCAAAATT GGGTGTTC TAEQLEMYWARLWE GAGAGCAGATATTGGTAAACTGACCCAATACATAAATAATAATAGATCT CAGCATAG NSAKHNDKANWITE AAAAAAGTAGTCCAAAGCGTTGAACAAATATTTAAAAACACTAAAATAC GATCCGTC EKERWDTIEEMQFD ACACATCACACGAAAATGGCAATAAAAAATCTCAAGAATTTTTAGACAC AGCGGCG DVTEEEITTITARLHN ACTGAAACAAAAATTAGCTCTAAAAGCCCACAGACTAAAAAGATATAAC GAGAGTTG WKSPGIDKIHNFWFK AACTCACAGAAACGAAAGAATGAGAACACTATATTTCTAACAAATGAAA AGCAGGC KLICLHKTIAKNLTDIIS AACTATTCTACAGAAACCTAATAAAGCCGAAAACTGATCGAGACAATAG GGGCTCTC GNQSIPEFIATGITYM TAATATAGATATACCAACAGCAGAACAATTGGAAATGTATTGGGCTAGG GCCGGGG IPKGDFSIEASQYRPIT TTATGGGAAAATAGTGCAAAACATAATGACAAAGCAAATTGGATTACTG ATTTACAA CLPTIYKILTTVITKKIN AAGAAAAAGAAAGATGGGATACAATAGAAGAAATGCAATTTGATGACG CCCGAAAA SHIEHNNILAEEQKG TGACAGAGGAAGAAATAACTACTATAACAGCTAGACTACATAACTGGAA TGCTACAG CRRGHMGCKEQLIID ATCCCCAGGTATAGATAAAATCCATAATTTTTGGTTTAAAAAACTAATTT CCGCCAAC STIMKHATTKNRNLH GCTTACACAAAACAATAGCCAAAAATCTAACAGATATTATCTCTGGAAAT AGAAATCA CTYIDYKKAFDSIPHS CAAAGTATTCCCGAATTCATAGCGACAGGAATCACTTATATGATACCAA CGATGTAG WLIQVLEIYKINPIIISF AAGGTGACTTCTCTATAGAAGCATCCCAATATCGACCAATTACATGCCTT AAAGTAG LRNIMTHWQTTLKLK CCGACTATTTACAAAATTTTAACAACAGTTATTACAAAGAAAATAAATTC GAGCAATA NPPNFVTTRQIAIKKG ACATATAGAACACAATAATATCTTAGCTGAAGAACAGAAAGGGTGTAG GCCCATGT IYQGDSLSPLWFCLAL ACGAGGCCACATGGGCTGCAAGGAACAGCTAATTATAGACTCAACCATC GAACCTTA NPLSHQLHNDRAGY ATGAAACACGCCACCACAAAAAATAGAAATTTACACTGTACATATATTG CAGCCCGA RIKQQDNTETIISHLIY ACTACAAAAAAGCTTTTGATAGCATCCCACATTCATGGCTGATCCAAGTC GTACCGGT MDDIKLYAKNDKEM CTAGAAATCTACAAAATTAACCCTATAATAATAAGCTTCCTACGCAATAT TCATACAA KKLIDTTTIFSNDISM CATGACACATTGGCAAACCACACTTAAATTAAAAAACCCTCCTAATTTTG CCCCTCGG QFGLDKCKTVHIIKGK TAACAACACGACAAATAGCCATAAAAAAGGGTATTTACCAAGGTGATTC TACAATCA VQPGDYTIDDTNTIT TCTCAGCCCTTTGTGGTTTTGCCTCGCCTTGAACCCACTATCCCATCAATT TCACCATC AMEPSDLYKYLGFQQ GCATAATGACCGGGCGGGATACCGCATTAAACAACAAGATAACACCGA ATCCTCGG LKGLDHITIKQSLTSEY AACAATAATATCACACCTGATTTATATGGACGACATTAAATTATACGCAA GTCATAGA KKRINAICKTKLSGKH AAAATGACAAAGAAATGAAAAAGTTAATAGATACTACCACGATATTCAG GGCTCACC LIKALNTYAIPILTYSF CAACGACATCAGTATGCAATTTGGACTTGATAAATGTAAAACCGTACAT AACGTCGA GIIKWSKTDIEQIERIT ATAATAAAAGGAAAAGTCCAACCCGGTGATTATACAATAGATGACACAA CTATGG RTTLTKHNNLHPKSAI ACACAATAACGGCGATGGAACCAAGTGACCTTTATAAATATCTGGGCTT (SEQID ERLTIKRQDGGRGMI TCAGCAGCTCAAAGGACTCGATCATATAACAATAAAGCAATCATTAACTT NO:1250) DIWHLWRKQIHSLKT CAGAGTACAAAAAACGTATCAATGCCATTTGCAAAACGAAATTATCTGG FFYIKSDLSEIHRAIAQ AAAACATCTTATAAAAGCACTGAACACCTATGCAATACCCATTCTAACCT NDNNYTPLNLKQKEL ACTCATTTGGAATAATAAAATGGAGCAAAACTGACATAGAACAAATAGA IDNTENLRNRNPQKD ACGCATAACAAGGACTACATTAACAAAACACAATAATCTTCATCCAAAAT MEENWKKKALHGR CTGCAATAGAAAGATTGACAATTAAAAGACAAGACGGGGGTAGAGGCA HPHDLSQSHIDSKAS TGATAGATATTTGGCATCTATGGCGTAAACAAATACACAGCTTAAAAAC NMWLKTGSLFPETE ATTTTTCTACATAAAATCAGATTTAAGTGAAATTCACAGAGCCATAGCAC GFLIAIQDQVINTKNY AAAATGATAACAACTACACACCGCTAAATCTCAAACAAAAAGAACTAAT RKYIIKDPTIRDDKCR AGATAATACAGAAAACCTAAGAAATAGAAACCCACAAAAAGACATGGA KCNTQPETIQHITGA AGAAAACTGGAAGAAAAAAGCGCTACATGGACGACACCCTCATGACCT CSTLTQTDYTHRHNQ AAGCCAATCTCACATAGACAGCAAGGCATCAAACATGTGGCTCAAAACA LANIIHQQLALKHKLI GGAAGTCTGTTCCCCGAAACGGAAGGATTTTTAATTGCCATACAGGACC QNTNTPYYNYKPQT AAGTAATAAACACAAAAAATTACAGAAAATATATTATTAAAGATCCCAC VLENDSCKLYYDRAIL TATTAGAGACGATAAATGCCGCAAATGCAACACCCAGCCAGAAACCATA TDRTIHYNRPDITLQD CAGCACATAACTGGAGCATGTTCAACCCTTACACAGACAGATTACACTC KNNKVTYIIDIAVPNT ACAGACACAACCAACTTGCTAATATTATCCATCAACAACTGGCTCTCAAA HNIQKTFTEKMTKYT CATAAATTGATACAAAATACAAACACACCGTACTACAATTATAAACCACA ELKEEIVRIWKAKKAY AACCGTTCTTGAAAATGACTCCTGTAAACTTTATTATGATCGCGCTATTCT IVPIIISTTGVVPNHIH TACCGATAGGACGATTCACTACAATAGACCGGATATCACTTTACAAGAT NSLKLLDLKDNIFISLQ AAAAACAATAAAGTCACCTACATTATTGACATTGCAGTCCCGAATACCCA KAAILNTCRIVRKFM CAACATTCAAAAAACGTTTACAGAAAAGATGACAAAATACACAGAACTT QLEENQTYYTQ(SEQ AAAGAAGAAATAGTTAGAATTTGGAAACAGAAAAAAGCATACATAGTC IDNO:1495) CCAATAATAATCTCAACCACTGGAGTTGTCCCAAACCACATCCACAACAG CTTAAAGCTTCTAGATTTAAAAGATAACATATTTATTTCACTACAAAAGG CAGCTATCCTAAATACATGCAGAATAGTGAGAAAGTTCATGCAGCTTGA AGAAAACCAAACTTACTACACGCAATAAAAACTAGCATAATTATTAACTC ATAACTAATGTATATTACTTGGCCAAAAGCCCGTATATACAGTTCCACCG GCTCTGTCGACAGACTGAACTGAGAAAGGGGAAACATATGGAAATAAT AATAA(SEQIDNO:1127) R4 DongAG AB097127 Anopheles GAAGGCTAACCACAATAATGGAGACACGATCTATGAGAAAAAGGACCA GAAGGCTA TAACATC METRSMRKRTTRLPE gambiae CACGATTGCCCGAGGAGGGAGCCCCTACTGGAGCTGGGCCTGGGACGG ACCACAAT CGGTGC EGAPTGAGPGTGDR GAGACAGAGCAAGCATCCAGCGGCTGGAAGATGAAATGGTGCAAGAG A(SEQID AAACTC ASIQRLEDEMVQERS CGTTCTTTCAGTCAACGGGCTCTGCCCGTACCACGAACGCAAAACAGAA NO:1251) ATTAAC FSQRALPVPRTQNRN ACGGCAGCCCTATAAATCACCAAGGCAATGCTGCATCTGCTAATGTTGC ATTAAG GSPINHQGNAASAN TGTGGCTGATAGACAACAGTCACTCATTTTGGCAGGAGGCCGACGGCA AAAAGA VAVADRQQSLILAGG GAGGATTATGTGGACGAGAGAGATGAACCATTACGTGATCCGTTGCTA GAGAGG RRQRIMWTREMNH CTATGTTTACACGAGGATGGAGACGGACATGCCCGGCAGAGTGAAGAT AGAAAT YVIRCYYVYTRMETD GCTGGGTATGTTCAATGACCGTTTCCCTCGGTTTGCGCATCAGCTTGACC GAGAAT MPGRVKMLGMFND TGAGCAAGTTGTATATACGGCAGCGAGCTATTATTTTGCCTGAGGAACT GAGATT RFPRFAHQLDLSKLYI GGAGTTCATCAAGCTGGAGGTGCGGAGGGAATTTGGAGAGGAAGAGG CATTCAC RQRAIILPEELEFIKLE CAGGCTGGCGCGAGTCGAGTAGGATTTCTGCTAGGCTTAATACAATAGA CTTTGG VRREFGEEEAGWRES CCAAAATACATCAAGGGCGAGTGAGGATCGTGATTTGGATGAACCCAC CATTTGA SRISARLNTIDQNTSR CGCTCCAGGATTGTCAGTGGATATCCAACACCAGATGGCCACAGCTGTT ATAGCC ASEDRDLDEPTAPGL ACGCAGTTCCATGGGACAGACCCCTTGTCTCGTCACCGACTACCAAAAC CGGGGT SVDIQHQMATAVTQ TGCATTATTCTTACCGCCTGAAAACAGCAGTAAGCATCATAAACCAAGAT AGGTGA FHGTDPLSRHRLPKL GTTCTACCTCAGTATTTGGATAGCGTAGGGAGCATTGAGGATCTGCAGT AAAGTT HYSYRLKTAVSIINQD TAATTGTGTATTCGGCTGCGGTGGCTGTTGTACGAACGCTATGGTTGCG CCCAGC VLPQYLDSVGSIEDLQ GACCTATCCGCAAGGAGACAGCGAAGGTCGACCATGCTCCAAGGCTGA ATATTGC LIVYSAAVAVVRTLW AAAGCCCGCCTGGATGCGACGTCTAGAAAACCGGATCAACGCAACACG TGAGAA LRTYPQGDSEGRPCS GACAAAGATTGGTCGAATGCAGGAATATCAACGGGGAAATTCATCTAT GTGACA KAEKPAWMRRLENR GAAGGTGGTACGTCAGATTGCTGAAATGGTTAAACCTAAAGAACTACG AAATTC INATRTKIGRMQEYQ AGACCTCACTGATGCCAACATAACGGAGGTACTCGACATCCATTTACAA GGATAA RGNSSMKVVRQIAE CGGTTGAGTGCCCTTGCAAAACGATTACGACGTTATGCTGAATGCTCGA TAATAAT MVKPKELRDLTDANI AGCGGAAAGAACAAAATCGAATGTTCAACATTAACGAGAGAGAATTTT AATAAT TEVLDIHLQRLSALAK ACAACTGGATCCGAAATGATAAGCCCAATTTTAGAGAAGGGCTCCCGGA AATAAT RLRRYAECSKRKEQN TATTGGCGACTTTACACAGTTTTGGGCCAATCTATGTGAGAAACCTGTCC AATAAT RMFNINEREFYNWIR AACACAACAGCGAAGGAATGAGGTTAGCAGAAGATGAGCGCTTCAGTG AATAAT NDKPNFREGLPDIGD ATGGTATCGAAGACATGCCCGTGCTAGTTGTGAATGCTCAAGACATACG AATAAT FTQFWANLCEKPVQ TGAGGCAACGCAGTACACCAGGAATGGAGCTGCACCAGGACCCGATTT AATAAT HNSEGMRLAEDERF TGTATACAATTTTTGGTATAAAAAGCTAATCACAATCCATGAGCAGATAG AATAAT SDGIEDMPVLVVNA CGGCATGCTTCAATACGGTGTTGGAAGATTCGAGAAAACTACCAAAATT ATGCAT QDIREATQYTRNGAA TATCACCGGGGGAGTTACTTACTTTCTACCAAAAGATCAAAACACAAAA AATAAT PGPDFVYNFWYKKLI AATCCTGCGAAGTATAGACCACTTACCTGTCTTTCTAACTTAAACAAAGT A(SEQ TIHEQIAACFNTVLED GCTGTCGTCAGTGATAACGCAGAAAGTGAAAGATCATTGCGATACCAAC IDNO: SRKLPKFITGGVTYFL AACGTAATGACCGAAGAACAGACAGGACGTCGAAAAAACACGCAAGGC 1374) PKDQNTKNPAKYRPL TGTAAAGACCAGGTCATTATTGATGCAGTCATTGTTGGTCAAGCAGCCA TCLSNLNKVLSSVITQ AGAAACAAAGAAATCTGGATATGGCATACATCGATTACAAGAAGGCGT KVKDHCDTNNVMTE ATGATTCAGTACCCCATTCATACCTTCTTAAGGTACTCCAGTTGTACAAA EQTGRRKNTQGCKD GTAGACGGGAATGTCATCAAGCTGATGCAGCACGCGATGGGTATGTGG QVIIDAVIVGQAAKK AGTACATCTCTACACGTTACCGACGGAAAAGTTGTACTACGGTCAAGAT QRNLDMAYIDYKKAY CACTCAATATCAGGAGGGGTATTTTCCAAGGTGACACCTTTAGTACGCT DSVPHSYLLKVLQLYK GTGGTTTTGTCTAGCTATGAACCCGCTTAGCAGAACACTCAACCAGCAA VDGNVIKLMQHAM TGCAACTTTGGGTATTTACTCAAAAGTGAAGAAATAAGCACGAGAATCA GMWSTSLHVTDGKV CCCACACCTTCTTTATGGATGACTTGAAGCTGTTCGCAGAAACAGTACA VLRSRSLNIRRGIFQG GAAGATGCACCACCTGTTGAAGAACGTGCAGGGATTCAGCAACGACAT DTFSTLWFCLAMNPL TAAAATGGAATTTGGTATCGGTAAATGTCGATCAATTCATCTACACCGA SRTLNQQCNFGYLLK GGTCAAGTATTGGATGCCGATAGCTTCCGTGCCAACGAACAAGAGGAA SEEISTRITHTFFMDD ATCCGCCACATGGTTCAAGGTGAAACTTACAAGTTCCTCGGTTTCCTGCA LKLFAETVQKMHHLL GCTGAGGGGTATTCACTATGCAGTGATCAAGAAAGAGCTACAGGACAA KNVQGFSNDIKMEF GTTCTTACATCGTGTTAGCTGTATCCTGAAGAGCTTTTTGTCAGTCGGCA GIGKCRSIHLHRGQV ACAAGGTGAAAGCAATAAACACATTTGCGGTGGCTCTGTTGACCTACAG LDADSFRANEQEEIR CTTTGGAGTAATGAAATGGTCTAATACTGACTTGGAAGCGTTGGAGCGA HMVQGETYKFLGFL ACAATTCGTGTGGTTTCCACTAAGCACCAAATGCGTCACCCAAAAGCGT QLRGIHYAVIKKELQD CCGTCGAGAGAGTAATCCTGCCACGAAAAATAGGAGGGGTAGGAATCA KFLHRVSCILKSFLSV TTGATATTCAGGCACTTTGTATTTCTCAGATCCATCAGCTGCGAAGTTAC GNKVKAINTFAVALL TTCGTGGAAAGCCAAAACCGACATGAATTATACCGCACTGTGTATAAAG TYSFGVMKWSNTDL CAGATCACGGATTAAGCGCCCTGCATCTAGCGCAGCAAGATTACCAGCT EALERTIRVVSTKHQ GAATTGCAACATAAAAACCGTCGATGGAAAAGGCGCAACGTGGAAACA MRHPKASVERVILPR GAAGGAGTTACATGGGACGCACACCCATCAACTGAATCTGGAACATATC KIGGVGIIDIQALCISQ GACAAAGTGTCATCTAGCACTTGGCTTGTGAGGTGTGACCTTTTCTGTG IHQLRSYFVESQNRH AGACAGAAGGTTTCATGGTAGCCATCCAAGACCGGGTAATTGCGACGT ELYRTVYKADHGLSA GGAACTATCGGCGGTGTATATTGCGTGAAGACGTGGAGGACCGATGCA LHLAQQDYQLNCNIK GAAAGTGCAACTCAGGAGGAGAATCGATTGAGCATGTCATTGCCGGCT TVDGKGATWKQKEL GTCCAGTGCTAGCTGGGTCAGCGTATCTCGATCGCCACAACGACGTTGC HGTHTHQLNLEHIDK CAAAATTGTTCACCAGCAGCTTGCACTGAGGCACAAGTTGGTAGAGCGA VSSSTWLVRCDLFCE TTTTTACCCTGCTACCGATACCTCCCAGATCCGGTCCAGGAAAATGATTG TEGFMVAIQDRVIAT CATAAAGCTGTATTGGGATCGCGAAATTATAACGGACATCCTCATCCGT WNYRRCILREDVEDR GCCAATAGGCCAGACATCTTAGTCTACGAGAAAAGAAAGAAACGAGCG CRKCNSGGESIEHVIA ACCATCGACATCGACATTGCTGTAACGTTAGACCATAATGTTCAGACAAC GCPVLAGSAYLDRHN ATTTTCCACCAAGGTGATGAAGTATCATGATCTGGCAGAGGAGTTGAAG DVAKIVHQQLALRHK CAGACGTGGTATCTGGAGGATATCCGCATTGTTCCGGTAATCATCTCGG LVERFLPCYRYLPDPV CGACCGGAATTGTACCTATGGCCCTCTTACGTTCCCTGGACGAGCTCGA QENDCIKLYWDREIIT ACTGCAGAGAGAACTACCCAGGATTCAGAAGGCGGTGATTCTTCGAAC DILIRANRPDILVYEKR ATGTAGCACTTTAAGAAGGTTCCTGAATCCCTATAACTAACATCCGGTGC KKRATIDIDIAVTLDH AAACTCATTAACATTAAGAAAAGAGAGAGGAGAAATGAGAATGAGATT NVQTTFSTKVMKYH CATTCACCTTTGGCATTTGAATAGCCCGGGGTAGGTGAAAAGTTCCCAG DLAEELKQTWYLEDI CATATTGCTGAGAAGTGACAAAATTCGGATAATAATAATAATAATAATA RIVPVIISATGIVPMA ATAATAATAATAATAATAATAATAATAATAATATGCATAATAATA(SEQ LLRSLDELELQRELPRI IDNO:1128) QKAVILRTCSTLRRFL NPYN(SEQIDNO: 1496) R4 R4- CADV01008175 Bursa- GGGATCCTGGGTTCCTACTACCTCGCTCCACCTCCTCGCGATGGATCCTG GGGATCCT TAAGAA MTCNNAVVFPPADG 1_BX phelenchus GGGAAGTCTCCGGACTGAGCTAAGAGAGCGTTAAAGTAGAGGGTGAC GGGTTCCT AAGCAT NPAGTADRNFAIRFP xylophilus GGCGTAAGTACCTCCAAGTTGCGGTGGAGCGGAACATCTACTCTTCGGA ACTACCTC GAAATA SSEPPGPSGIRPSEPL GAGAGGGGAAGCTCTATGGCGGCGTTAGAAAGGTTGGACTACGGCAA GCTCCACC ATAAGA DGRTGIGDVEHAQA CGCCAGGGAGATGGGGAAGGTTCATCAGGTGATACTAGTTCGCTACTG TCCTCGCG AATCAG GNGGFLVDVLEYKEA TCATTCGATGTATCCGGAACCATACTCGCCAAGTTGTGAACTATGTGAA ATGGATCC ATAAGA HRYGSKCEFCYVQTK AGTCTGGATCCAATCCAAGACCACGGGGCGCAATTAAAAGGTGTGAAG TGGGGAA ATAACA GTVCSKPRTDAWLK CAGCTTGCTGGTGATCACCACGGTGGTACCTACACCGGCGGGGGAACA GTCTCCGG AGAATA CEILFLLHHAYTANQ TCTTTAATGCCGAGATGACCGCGCGAATAATAGGGAGCCGGCAAAAGC ACTGAGCT CTAATA NKSIELAESAFRRAGI CGAGCGTAAGTGGAAAGGATACAGAAATTTGCTCAAAAGTATACCAAC AAGAGAG AGTATA TRRSKATIAKRWSLIQ CCGCCACTTATAAAAGACCGAGAAAAGGGTACCACTAGAGCTCATTATA CGTTAAAG TCATGTA RGKGTDYKEYWDEY AAACATCTATGACATGTAATAATGCTGTGGTCTTTCCGCCAGCCGATGGT TAGAGGGT ACTATG FEKFRYECNPTPIVRR AACCCCGCTGGGACTGCGGATAGGAATTTTGCTATTAGGTTCCCCTCCA GACGGCGT ACAAAA KRNRLAAGLQSPSSV GTGAACCTCCAGGACCCTCTGGAATACGACCTTCTGAGCCTCTGGACGG AAGTACCT AGAACG PNGYEFERKRTCETPL TAGAACCGGGATTGGCGACGTTGAGCACGCCCAGGCGGGAAATGGAG CCAAGTTG CACCAA DTKASSLPLICNLLTGI GATTCCTGGTTGATGTTCTGGAATATAAAGAAGCACATCGCTACGGCTC CGGTGGA TAAGAA VGVENVEENMSVEC CAAGTGTGAGTTTTGTTATGTGCAAACCAAAGGTACCGTGTGCTCTAAG GCGGAAC CATGCTT TEPKELSGTANSSVP CCTCGGACCGATGCGTGGTTGAAATGCGAGATTCTGTTCTTGCTTCATCA ATCTACTC GAGTGG GLAEGVYERRHNNV TGCCTATACGGCTAATCAGAACAAGAGTATTGAACTCGCCGAGAGTGCT TTCGGAGA CCAGCT NEPAAGCPQDVPVA TTTCGGCGAGCAGGAATTACCCGACGTAGCAAGGCTACCATCGCGAAA GAGGGGA CTGCAG NNLIDSPTTNDRLEA AGATGGTCCTTAATACAGCGAGGTAAAGGCACCGACTATAAGGAATACT AGCTCTAT GCAAAA EFKAQLDRAERSYMR GGGACGAGTACTTCGAAAAATTCAGATACGAGTGCAACCCCACACCTAT GGCGGCG GTCGAA RRLPRLKNLSPDERM AGTACGGCGCAAACGAAACAGGTTAGCCGCTGGGTTACAGTCGCCCAG TTAGAAAG TTTGGA WIGTTVERLRLETVSE CAGCGTGCCGAACGGATATGAATTTGAGCGGAAAAGGACTTGTGAAAC GTTGGACT ACAGCC PVCEQWRLANAGLY GCCTCTTGATACCAAGGCTTCAAGTCTTCCACTAATCTGTAACTTGCTAA ACGGCAAC GGTAAT AAIRSIAVMRPLDAA CAGGAATTGTCGGAGTTGAGAATGTTGAGGAAAATATGAGTGTAGAAT GCCAGGG GGAAGA REAHKTWLLNMKM GCACTGAGCCGAAAGAGCTAAGTGGAACTGCGAACTCTTCTGTTCCGG AGATGGG CCTGCA TERKLRQQIGWVETT GTTTGGCAGAAGGAGTATATGAAAGACGACATAACAATGTCAACGAGC GAAGGTTC ACAAAC RRTKNEARTERQEIV CAGCGGCTGGGTGTCCCCAAGATGTCCCAGTCGCAAACAACTTGATCGA ATCAGGTG GTGGGG YRKVAKLRRERFPEM TTCCCCAACTACTAACGATCGATTAGAAGCTGAGTTTAAGGCACAGCTG ATACTAGT TAGCAG DLDSVSVHLKRKLELL GATAGAGCGGAACGGTCTTATATGAGACGAAGGCTACCTAGGCTGAAG TCGCTACT GCAATA KGRIQVRTAERLRRD AACCTATCCCCTGACGAAAGGATGTGGATTGGAACAACAGTCGAGCGC GTCATTCG TGTAACT TREAAGPYGKTALRG CTAAGACTCGAGACTGTGAGCGAACCTGTATGTGAACAGTGGCGGCTT ATGTATCC ATGACA QGFAPNVKDATQY GCCAACGCTGGGCTATATGCAGCTATCCGGTCAATCGCGGTAATGCGCC GGAACCAT GACCAA WSGLAQPSGQKCSE CCCTGGACGCCGCCCGCGAAGCCCATAAAACTTGGCTTCTAAATATGAA ACTCGCCA AACTCC NSAILSDWKELVECN AATGACGGAAAGGAAACTAAGACAGCAGATTGGCTGGGTCGAGACGA AGTTGTGA GAAACT LSSLPDQMEPLVVQG CGCGCCGTACTAAGAACGAGGCACGTACTGAAAGACAAGAAATTGTCT ACTATGTG CTGGTA ISRASPWKSPGPDGI ATAGAAAAGTGGCCAAACTGCGAAGGGAAAGATTCCCCGAGATGGACC AAAGTCTG ATGAGC FNYYWRQDFIVDWL TGGACAGTGTGTCAGTCCACCTCAAACGCAAGCTTGAGTTACTTAAAGG GATCCAAT CCGTGC KQLMLDSLRTGHYP GCGGATTCAAGTGAGAACCGCAGAGAGGTTGCGAAGGGATACCAGAG CCAAGACC CCCCCA WKLSSGRTVLLYKDG AGGCCGCTGGTCCATATGGTAAGACTGCTCTTCGGGGTCAAGGGTTTGC ACGGGGC AGCATG DPTKAENYRPITCLN CCCGAATGTAAAAGACGCGACTCAGTACTGGTCTGGATTGGCCCAGCCA GCAATTAA TGGTCT GCFKMINSVVSEVILK TCCGGACAAAAATGTTCTGAGAACTCGGCGATTCTCTCAGATTGGAAAG AAGGTGTG CGTTCG RVENTIALPIEQMAL AACTCGTCGAGTGTAATTTAAGCAGCTTGCCGGATCAGATGGAGCCGCT AAGCAGCT ATGTAG RRKVWACVESQIWD GGTAGTCCAAGGCATAAGTCGGGCTAGTCCATGGAAATCTCCTGGACC TGCTGGTG TTAGGA QIKQRKLSDRTQKCK GGACGGTATTTTTAACTACTATTGGCGACAAGACTTTATTGTGGACTGGT ATCACCAC ACAGTT VAWVDFSKAYDSLN TAAAGCAGCTAATGCTGGATTCCTTGAGGACCGGGCACTATCCGTGGAA GGTGGTAC CTCTTTA HDAIKFVIGVLKLPTG ACTTTCAAGTGGACGCACGGTGCTCCTGTACAAAGACGGGGATCCGACC CTACACCG ACCCGT INNYLLDSMQNWST AAAGCCGAGAACTACCGACCTATCACCTGCCTAAACGGATGCTTCAAAA GCGGGGG GATGAT HLELKSSGKVVRGPS TGATCAACTCGGTTGTTTCTGAAGTTATCCTGAAACGCGTCGAAAATAC AACATCTT TACGCC YPIKRGVLQGDSLSPT GATAGCTCTCCCGATAGAGCAAATGGCTTTACGACGTAAGGTATGGGCA TAATGCCG CTGTCTT LFVVVTSIIVRHIKTIES TGTGTTGAGTCTCAGATATGGGATCAGATCAAACAGCGGAAGCTGTCTG AGATGACC AAATGG SDIQMYMDDIKLYGK ATCGAACACAGAAATGCAAGGTAGCGTGGGTAGATTTCTCAAAGGCTTA GCGCGAAT CAGGTG DQETLTRLIKELQTVS TGACAGTCTTAACCATGATGCTATAAAGTTCGTCATAGGAGTTCTAAAG AATAGGG CCACCA NKLGLCMNLKKCAIL CTTCCGACAGGTATTAACAACTACCTGCTTGACAGTATGCAGAATTGGA AGCCGGCA AATACC GDDLPEEINGIEHLKE GCACGCACCTCGAGCTTAAAAGCTCTGGAAAGGTAGTACGAGGCCCTA AAAGCCGA GAACAC SYKYLGVPQREITQV GTTACCCGATTAAAAGGGGCGTTCTTCAAGGCGATAGTCTAAGCCCAAC GCGTAAGT TCGTTG RATMAALEKKILTEV ACTGTTTGTCGTGGTGACCAGTATAATAGTACGTCATATCAAGACGATC GGAAAGG AGGTAT DTSLGAAELSYRQRIS GAAAGTAGTGATATACAAATGTATATGGATGATATCAAGTTGTATGGCA ATACAGAA GGTGGT RVNSKIAPLVRFVVQ AAGATCAGGAAACTCTGACACGGCTAATAAAAGAGCTGCAGACGGTAT ATTTGCTC CCGAAT SMLVTPRDVLKVYNR CGAACAAGTTAGGTCTATGTATGAACCTTAAGAAATGCGCAATTCTCGG AAAAGTAT GTGAAG LGGIDVEIRRRLVKYEI CGACGACCTCCCAGAGGAAATCAACGGGATTGAGCACCTTAAGGAATC ACCAACCC CTGGGA RYKKSNVARLYLDRK ATATAAGTACTTAGGTGTTCCCCAACGGGAAATCACCCAAGTGCGTGCC GCCACTTA GTACAA VGGIGFVNLCRIMVE ACCATGGCCGCACTTGAAAAGAAAATACTAACTGAAGTAGATACTAGTC TAAAAGAC TTTGGTA AVAARAVYCRLAPSF TTGGAGCAGCCGAACTAAGTTACCGCCAACGGATATCGCGTGTTAACTC CGAGAAA CGAGAG NEFQDFLAEQNTSPI TAAGATCGCGCCGTTGGTGAGGTTCGTTGTGCAGAGTATGCTGGTCACG AGGGTACC CACCAG TAAQTILDKCGINIELS CCACGCGATGTGCTTAAAGTGTACAATAGGCTTGGAGGCATTGATGTCG ACTAGAGC CGCCCC TSTLGDVKKIVRNHY AAATTCGAAGAAGACTCGTCAAGTACGAGATCAGATACAAGAAAAGTA TCATTATA CGATCT HELWLTAWKNTGLY ATGTAGCCCGGCTCTACCTTGATAGAAAAGTTGGCGGCATTGGTTTCGT AAACATCT AAGTGA KRWENDHVDIKRSSL AAACCTCTGCCGGATAATGGTCGAGGCGGTTGCAGCCAGAGCTGTCTAC (SEQID TGACGC WINRGNLSANNARI TGCAGACTTGCTCCCTCGTTTAATGAGTTCCAAGACTTCCTTGCAGAACA NO:1252) ATGCGT GIGIQDNSIFCRGFVG GAATACCAGCCCAATCACTGCTGCACAGACAATTTTAGATAAGTGCGGC CGGAAC NKCDTKYCRLCGDGI ATTAATATTGAGCTGAGTACATCTACTCTGGGAGATGTAAAGAAAATAG AATGAA ESVSHIVTGCPTHRT TGCGAAACCACTATCACGAGCTTTGGCTCACGGCATGGAAGAACACTGG GACGGC NLYIERHDCVARNVY GCTCTATAAACGTTGGGAGAACGACCATGTAGACATTAAAAGATCTAGT TGGCAA AYLAIRYGIPVPHYTQ TTATGGATCAATCGAGGTAACCTAAGTGCGAATAACGCTCGTATTGGAA ACATTCA RVKTIEKNGDQSVEL TTGGCATTCAAGACAACAGTATCTTTTGTCGAGGCTTTGTTGGGAACAA GGAGTC YWNYKFPCTRALEAC GTGTGATACGAAATATTGCCGATTATGTGGGGATGGCATAGAATCCGTA GCAAA RPDIVLIDKVSKRTHII AGCCATATTGTAACTGGTTGTCCGACTCACCGTACAAACCTGTACATAGA (SEQID EVAVSWRGRLQEMV GCGGCATGACTGTGTTGCAAGAAATGTTTACGCATACCTCGCCATAAGG NO: DRKVYKYTVNGEYEA TATGGAATCCCTGTGCCACACTATACGCAGCGAGTCAAGACAATAGAGA 1375) DGSSRGWNIVRELN AAAACGGCGATCAGTCTGTAGAGCTATACTGGAATTACAAGTTCCCATG DQYGFPVEVYTLVIG CACCCGTGCACTGGAAGCTTGCCGTCCGGATATTGTTCTCATAGATAAG AGGEILPCTVKDVERL GTTAGTAAGCGAACTCATATAATTGAGGTGGCTGTCTCATGGCGGGGAC TGGAATDNLIERMER GGTTACAGGAAATGGTCGATAGGAAAGTGTACAAGTATACGGTAAACG SAVLGSCRIIKRHLAL GAGAATACGAAGCCGATGGCTCTTCCCGTGGGTGGAATATCGTCCGTG (SEQIDNO:1497) AACTAAACGATCAGTACGGCTTTCCTGTCGAGGTATATACGCTAGTGAT TGGTGCTGGAGGCGAAATCCTACCATGCACGGTTAAAGATGTAGAACG GCTCACCGGTGGAGCGGCTACTGATAACCTCATTGAGCGTATGGAGCG GAGCGCAGTCCTTGGCTCATGTCGTATCATTAAAAGGCATCTGGCACTC TAAGAAAAGCATGAAATAATAAGAAATCAGATAAGAATAACAAGAATA CTAATAAGTATATCATGTAACTATGACAAAAAGAACGCACCAATAAGAA CATGCTTGAGTGGCCAGCTCTGCAGGCAAAAGTCGAATTTGGAACAGCC GGTAATGGAAGACCTGCAACAAACGTGGGGTAGCAGGCAATATGTAAC TATGACAGACCAAAACTCCGAAACTCTGGTAATGAGCCCGTGCCCCCCA AGCATGTGGTCTCGTTCGATGTAGTTAGGAACAGTTCTCTTTAACCCGTG ATGATTACGCCCTGTCTTAAATGGCAGGTGCCACCAAATACCGAACACT CGTTGAGGTATGGTGGTCCGAATGTGAAGCTGGGAGTACAATTTGGTA CGAGAGCACCAGCGCCCCCGATCTAAGTGATGACGCATGCGTCGGAAC AATGAAGACGGCTGGCAAACATTCAGGAGTCGCAAA(SEQIDNO: 1129) R4 R4- Heterodera TGGCGATACTCGGAACCTCCGGGGAGCCTGGTAGGAGTTGGCCTACAG TGGCGATA TGAGGA MISCDLERETLTQMA 1_HG glycines GTCGCGAAAGTCCCTAGGTGCTGCACGGGTTGCGCTAATCCGAGGGCG CTCGGAAC CTCANA LFRARSDKTPTHAGIP CTGGGTTACCTTCCCATCGGCCAAAAACGTCTGGGCCTTCTTAGCCGCG CTCCGGGG ATTGAC APDEVREGGCGQNR GGTCCAGTATTTCTGTTGAGCCTGACAGTTCTTCCCGGATATGGCGAAA AGCCTGGT AATACA TNPAAPRGKAAAIQR GATTACAGGGCGGTATTTCGTGAAACCTAAAAAWGGTCGGGCCGAATG AGGAGTTG CCTCAG QNGITIPIXACAQSGL GCACGGACAGACTCACTTCGGAGTGAGCTCGGGGGCATCCGTGTGTTA GCCTACAG A(SEQ VRTQRVQQWSAVEE CCCCGCTGCACCACGCCGAAGCTGTCATAGCGAGCCCGAAGGGGAATG GTCGCGAA IDNO: SALKDVVVRNTDDR GCCATGGAGACTCCAGCCTCACCCTGTAACTCGAACCTAAGTCCAGGCC AGTCCCTA 1376) GLINWAKGVLPEWQ CCTTTCTGGTGTTGGCTCGCACTGGTTAGGAACACGACAGTCTCGTGTA GGTGCTGC RLCQLNPTMYMARS ATCCCACACGCACCGAAAGCCCTAGCCCTCGTAGGCGAAGGCTGCGTTG ACGGGTTG SPSLSNKWASLRRTH CTGGTTTCAGAACTGTGAGCWAGTGGGGTTCGGATGGCCGAGTGTACC CGCTAATC VGPGCPSKEGSGPSQ AACCTGCTTGCTAACAGGTAGCATAGAGTAATATGCTAGTAAGCAGGGC CGAGGGC DLSDVKIQPARLAHD ACGAGAAAGGGCCGTAAAGGCTCCGGCTATGAAGGACCTTGCGACCAC GCTGGGTT TVAELPQRTVPCGTD GCGTGTGTCTCCCACGTGCGGATTCTTGAAGCCAGAGTCTTGCACTGCG ACCTTCCC GHGVIDSDETETALA CGCAGGATGGAGCCTGTGCAACTCCTCCCTCGCTGATCGCAGGAGTGG ATCGGCCA EVSRSSPFGEREPLDL AGGATCACCACTCTTTTTACCTTGCTAGCTTGGGGTACCACCTTGAGCTG AAAACGTC GATERITRKRLRNAV GGGCCGGCCTTGCTAGCTTGGGGTACGACCCTTTGAGCTGGGGCTGGC TGGGCCTT RDVVPPRKRRVPSTP CTTACTAGCTTGGCGCGCCACCCTTGGAGCTGGGGTGGCGCAAGATCAC CTTAGCCG SRKEQDLVPEVDGPA TTGTATACGGTCTAACCAATACATTTGAAAAGCGATCAAAGCGAAATGA CGGGTCCA PTDVLTHPPTESEPEP TATCCTGTGATTTGGAAAGAGAAACCTTAACCCAAATGGCTCTCTTTAGA GTATTTCT MLDPLSLVQLVRPQL GCCCGCAGTGACAAGACGCCGACCCACGCCGGGATACCAGCCCCTGAT GTTGAGCC GRAMGWAAEEMEL GAGGTCCGAGAGGGAGGGTGCGGCCAAAACCGAACCAATCCCGCGGC TGACAGTT GNVVMDVELKREFN CCCCCGTGGTAAGGCGGCAGCGATTCAAAGGCAAAACGGCATAACTAT CTTCCCGG REVRRVGRTPPDQM TCCTATAAAWGCCTGCGCTCAATCCGGCCTAGTCCGGACCCAGAGGGTT ATATGGCG YKRGAGPPLPQKREP CAGCAGTGGTCAGCCGTGGAGGAGTCCGCCTTAAAGGATGTGGTGGTG AAAGATTA ERVALLEQLIAARVER CGAAACACAGACGACCGGGGGCTCATTAATTGGGCCAAAGGTGTGCTC CAGGGCG GINRGLDWFLELNVA CCTGAATGGCAAAGATTGTGCCAGCTCAACCCGACCATGTACATGGCTC GTATTTCG VFAAARVLSRRERVE GATCGTCTCCATCGCTGTCCAATAAGTGGGCATCTCTGCGGCGGACCCA TGAAACCT TLADRLHINDSATLSE TGTTGGTCCTGGATGCCCTTCGAAAGAAGGCTCCGGGCCCTCACAAGAC AAAAAWG VSRRRAKAERKLRCA CTTTCAGATGTAAAAATCCAACCAGCCAGGTTGGCCCATGATACGGTGG GTCGGGCC REQPWMSRRIRXLG CCGAGCTACCTCAACGTACTGTCCCTTGCGGCACAGACGGTCATGGTGT GAATGGCA VRVERLKQLADLVRQ AATCGATTCCGATGAGACCGAGACTGCCTTAGCWGAGGTATCCCGGTC CGGACAG RIAGRGNRSSYEGPR GAGTCCCTTTGGAGAACGCGAGCCGCTCGACTTAGGGGCGACAGAGAG ACTCACTT RRFRLRPSLRSVTEAP GATCACGCGTAAGCGGCTGCGCAACGCCGTTCGCGATGTGGTTCCACCT CGGAGTG VNPPLNGNEVYTFW CGGAAACGCAGGGTTCCATCCACCCCTTCACGTAAGGAACAAGATTTGG AGCTCGGG HSLWAQSLRANTDD TTCCAGAGGTGGACGGGCCTGCACCTACGGATGTGTTGACACACCCACC GGCATCCG CQLREFKNQLSAARH CACAGAATCCGAACCAGAACCGATGCTTGACCCACTCTCACTTGTCCAAC TGTGTTAC TDLTSVGTSSLVQMF TGGTAAGGCCGCAATTAGGCCGGGCAATGGGATGGGCAGCCGAGGAG CCCGCTGC SAALRKMKKGKAPG ATGGAACTCGGGAATGTCGTGATGGACGTGGAGCTGAAGAGGGAGTTC ACCACGCC PDGIRAAWWGVFRR AACCGGGAGGTGAGGCGTGTCGGAAGAACGCCTCCCGACCAAATGTAT GAAGCTGT IAPYVATWVVRVIRG AAGCGTGGCGCAGGTCCACCACTCCCACAGAAACGAGAGCCCGAACGA CATAGCGA AEPVANWICNGLTVL GTAGCCCTACTCGAGCAACTCATTGCGGCGCGCGTGGAGCGTGGCATC GCCCGAAG LPKSSDNADPSNYRPI AACAGGGGACTTGATTGGTTCCTTGAACTGAATGTCGCGGTCTTTGCAG GGGAATG TCLNTCYKLFTAVIAQ CCGCTAGGGTTCTGAGCCGCCGGGAGAGAGTGGAAACTCTTGCTGATC GCCATGGA ITASYVDVLGGLPRQ GGCTGCACATCAACGATTCTGCCACTCTGTCCGAAGTATCCAGACGGCG GACTCCAG QVALRKGVWGTSVS AGCTAAAGCCGAAAGGAAACTCCGGTGTGCMCGGGAACAACCATGGA CCTCACCC LMIDALTVADARRAK TGAGCAGGCGGATCAGGAWACTTGGTGTGAGGGTCGAGAGACTCAAA TGTAACTC RPLGVCWFDFKKAF CAACTAGCTGACCTGGTCCGGCAAAGAATAGCTGGGAGAGGTAATAGA GAACCTAA DSVPHNLIRWILRVIG TCGTCTTACGAAGGCCCTAGGCGGAGATTCCGACTAAGACCCTCTTTAC GTCCAGGC LPPVILSVIVSVMDQ GTTCTGTTACTGAAGCACCTGTCAATCCACCCCTTAACGGGAATGAAGTT CCCTTTCT WATRLKIGGKVMPK TACACGTTCTGGCACAGCCTTTGGGCGCAGAGCCTCCGTGCTAACACGG GGTGTTGG TIPVRTGVFQGDTLS ATGACTGTCAACTGAGGGAATTCAAAAACCAGCTTTCTGCTGCGCGACA CTCGCACT PLLFCLSVWPISFALD CACGGATCTGACGAGCGTCGGAACCAGCTCATTGGTTCAGATGTTCAGC GGTTAGGA QFPQYQFRCANHLQ GCGGCCCTTCGGAAAATGAAGAAGGGCAAAGCTCCCGGTCCTGATGGT ACACGACA QGFSVGHVFYMDDL ATCCGTGCCGCGTGGTGGGGAGTGTTCCGGCGAATTGCACCGTATGTG GTCTCGTG KCYCPDREVLTAVIQ GCTACGTGGGTGGTCCGAGTAATACGWGGGGCAGAACCAGTAGCTAA TAATCCCA QVQKSASALGLTIHY CTGGATCTGCAACGGGCTCACTGTCCTGCTGCCCAAGAGTAGTGACAAT CACGCACC KKSAWLDQDGGKSG GCCGACCCTAGTAACTACCGGCCAATCACATGCCTAAATACMTGCTACA GAAAGCCC KAVLGVPXLVGTYKY AACTCTTCACTGCCGTTATCGCACAAATAACCGCAAGTTACGTTGATGTT TAGCCCTC LGMHERFMIVSKDSL CTTGGTGGCCTACCTAGACAACAAGTAGCACTGCGAAAAGGCGTTTGG GTAGGCG ESVRGKFMGRLKTL GGCACCTCTGTGTCCCTGATGATCGATGCGCTAACGGTGGCTGACGCGA AAGGCTGC WTSKLTFGQAMLGT GAAGAGCTAAACGCCCACTTGGAGTCTGCTGGTTCGACTTCAAGAAAGC GTTGCTGG KSXCMPVVRYVLQNL GTTTGACTCAGTACCGCACAACTTAATTCGCTGGATCCTGAGAGTCATCG TTTCAGAA FLPKSEFNQTRLVLRE GACTCCCCCCGGTCATCTTGTCTGTCATTGTCTCTGTGATGGATCAATGG CTGTGAGC WDRQIRDLLDECNIR GCAACGAGGCTMAAGATAGGTGGAAAGGTCATGCCCAAGACCATCCC WAGTGGG QVFRSKTELYVSREEG GGTGCGAACCGGCGTATTCCAAGGGGACACCCTATCGCCCCTGCTKTTC GTTCGGAT GWGLPSMEDALEEE TGCCTATCKGTKTGGCCAATMTCGTTCGCCCTAGACCAATTTCCTCAGTA GGCCGAGT VVTKLAILVARQETEP TCAGTTTCGTTGTGCAAACCATCTTCAGCAGGGCTTTAGTGTCGGACAC GTACCAAC LFRVCEALERKRCPTP GTGTTTTACATGGACGACCTGAAATGCTATTGCCCAGACAGGGAAGTGT CTGCTTGC LSLGLQILKDWGVGV TGACAGCAGTTATTCAGCAGGTGCAGAAGAGCGCTTCTGCCCTCGGTCT TAACAGGT ELQGRTLLLNGNTVG GACGATCCACTACAAGAAATCTGCGTGGCTCGACCAGGACGGCGGGAA AGCATAGA PSQATRKLTGELVLR AAGTGGCAAAGCCGTCTTGGGTGTACCAGWACTTGTCGGCACTTACAA GTAATATG REAERLSRWRSKVKP GTACCTGGGAATGCACGAGAGATTCATGATTGTAAGCAAAGACAGTCTT CTAGTAAG GCGMTGGAWRDVP GAGTCAGTCAGGGGAAAGTTCATGGGGCGGCTTAAAACCCTGTGGACC CAGGGCAC GIDVHLSNRWLVKG TCGAAACTGACATTCGGGCAGGCAATGCTTGGCACAAAGAGCMTCTGC GAGAAAG ALSPTVVSNSLAIRAN ATGCCAGTCGTGCGCTACGTGCTGCAGAATCTATTCCTACCAAAATCGG GGCCGTAA TVILRGSGGGYTKGTL AATTCAACCAGACCCGTCTGGTCCTGAGAGAGTGGGATCGTCAGATCAG AGGCTCCG LRCRGCGNTGETRRH AGACCTACTGGATGAATGTAATATCCGGCAGGTTTTCCGAAGCAAGACG GCTATGAA IVSACSLGRQKGAAS GAGTTATATGTCAGCCGCGAAGAGGGTGGCTGGGGTCTTCCTAGCATG GGACCTTG RRHDNVCRILVRAIC GAAGACGCCCTCGAGGAAGAGGTAGTCACAAAACTTGCCATACTTGTTG CGACCACG HKLNIEPPNSANFPH CAAGACAAGAAACTGAGCCGTTGTTCCGGGTGTGTGAAGCTCTTGAAA CGTGTGTC VVVLEGSGAKMWID GGAAAAGGTGCCCCACCCCCCTGTCCCTAGGATTGCAAATACTGAAGGA TCCCACGT FPFVVPHKIRHTRPDI CTGGGGAGTTGGGGTAGAGTTACAGGGCCGCACATTGCTGCTGAACGG GCGGATTC VVLFEWNGVRRLSVI GAACACAGTGGGCCCATCTCAGGCAACCAGAAAACTCACTGGAGAACT TTGAAGCC EVAVSDVANMQTQ GGTCCTGAGGAGGGAAGCGGAACGGCTCAGTAGATGGAGAAGTAAAG AGAGTCTT HIRKSHRYGTNSTEPF TGAAACCTGGCTGTGGCATGACTGGAGGGGCATGGCGGGATGTGCCCG GCACTGCG VAGVTPTYRNDCLAA GAATCGATGTCCATCTTAGCAATAGGTGGCTCGTCAAAGGTGCACTGTC CGCAGGAT QLRAKFKAQQVDVIP TCCGACCGTTGTCTCAAACAGCCTTGCCATACGAGCAAACACGGTGATT GGAGCCTG IIVGTTGETLDGEFGR CTGAGAGGGAGTGGCGGGGGCTACACCAAAGGTACCCTCCTCCGGTGC TGCAACTC IRKGLPMLTKLQMPR AGAGGATGTGGTAACACGGGAGAAACACGCAGACACATTGTGTCAGCA CTCCCTCG LWSEIQRAVILGSYRI TGCTCCCTGGGAAGGCAGAAGGGGGCTGCCTCCCGCAGGCACGACAAC CTGATCGC LVEHLALPKGGA GTCTGCAGGATATTGGTGCGAGCCATCTGCCACAAACTCAACATTGAAC AGGAGTG (SEQIDNO:1498) CGCCCAACTCAGCCAACTTCCCCCATGTCGTTGTGCTGGAAGGCAGTGG GAGGATCA TGCTAAGATGTGGATAGACTTTCCCTTCGTGGTGCCACATAAGATCCGC CCACTCTT CACACACGACCTGACATCGTCGTCTTGTTTGAGTGGAACGGGGTCAGAC TTTACCTT GTCTTAGTGTCATCGAAGTGGCTGTTAGCGATGTGGCAAATATGCAAAC GCTAGCTT ACAACACATCCGCAAATCCCACAGATATGGCACCAACTCGACTGAACCC GGGGTACC TTCGTCGCGGGGGTCACGCCTACGTATAGGAATGACTGCCTGGCAGCG ACCTTGAG CAGTTGCGAGCGAAATTTAAAGCCCAGCAAGTTGATGTGATCCCGATCA CTGGGGCC TTGTCGGGACCACTGGTGAAACATTGGATGGCGAGTTTGGGCGGATCC GGCCTTGC GAAAGGGTTTGCCTATGCTCACTAAACTGCAGATGCCAAGGCTGTGGTC TAGCTTGG AGAAATCCAGCGGGCGGTGATCCTCGGATCATATCGAATTTTGGTTGAA GGTACGAC CATCTTGCGCTACCAAAGGGGGGCGCTTGAGGACTCANAATTGACAAT CCTTTGAG ACACCTCAGA(SEQIDNO:1130) CTGGGGCT GGCCTTAC TAGCTTGG CGCGCCAC CCTTGGAG CTGGGGT GGCGCAA GATCACTT GTATACGG TCTAACCA ATACATTT GAAAAGC GATCAAAG CGAA(SEQ IDNO: 1253) R4 R4- Parhyale CCGACCGCCAGCGGGATAACTGGCAAACCCTGTCTCGACCACCGGCCCG CCGACCGC TGAGCC MKMSHNRDTPSNG 1_PH hawaiensis TGAATCCATCGGGGCGTATGAGTCTGACACAGGGGGGTGTTTAAGGTG CAGCGGG TTAGGT VKGTSVRLGTSLVRSP ACCCTGTTGCGAGGAAATGCGCAGCAAAAGCCGGATGAGCCTTAGAAC ATAACTGG CGCGGG VGEAGAVRERGTHP ATCGAAGGCCAACGACAACCTGCCGAAAGACTGGCGACTAATCCAGTC CAAACCCT ATGTGA SESVSQDSDASVNAT AACCTCCTGTAGGTCACCGGCTGGTCATGCTGAATCTCAGCTTAACCAG GTCTCGAC CCCGGC GEGSVREQAPLSPPG GCGTACGAACTTACAGTTGGAGGGTCGAGCACCCCTGATGGCTGAAAA CACCGGCC GCCAGA AEEATVPTQRRTRHK GGACCATCAAAGTCGAAGGTAGCCAACGAGAAGACCAACGGCTGATTC CGTGAATC GTGTAG WSREDRVVLWECFV AGGCGGAAGAGTCAACTCGTTGAATGCGTTCGACAGCTTGGGGTAGAT CATCGGGG AGCTGA ASKREGPGYLKRLKQ GGAACTCCTAAGCCCTGAAAGGCAGTCCATCTTCGCAGACGCTAAGGTG CGTATGAG ACATCG LWDERGIPGNFPQA CCCCGCCGCCTGAGGGTTATCAGGCCCCGCCGCCTGAGGGTTACCAGGC TCTGACAC CTCAACC SLSGQIRQICSKNLLS AACCATGAAAATGAGTCATAATAGAGATACCCCTTCTAACGGCGTGAAG AGGGGGG GATCCA EEERLQIAARMEAQV GGTACCTCAGTTAGACTGGGTACCTCGCTCGTCCGGTCTCCTGTGGGTG TGTTTAAG ATTTGG ASPSADEPARQVPTR AGGCAGGGGCCGTTCGTGAACGCGGTACTCACCCGTCGGAATCGGTTA GTGACCCT GTCGTG PVTPPRSPPVEPARR GCCAGGATTCCGACGCTAGTGTGAATGCTACTGGAGAGGGGAGTGTGA GTTGCGAG AAATCC PSIPSEETPDLGAVPS GGGAGCAGGCTCCCCTTTCTCCGCCGGGGGCGGAGGAGGCTACTGTCC GAAATGCG CCTCGAT EIDSADPNRSPSRGP CGACGCAACGCCGCACGCGCCACAAGTGGTCCAGGGAAGACCGAGTTG CAGCAAAA AATAAT RHLPAHNMSQSESE TGCTGTGGGAGTGCTTCGTCGCCAGCAAGCGCGAGGGGCCAGGCTATC GCCGGATG AATAAT DDVTDPDVSDQQRS TCAAGCGCCTCAAGCAGCTGTGGGACGAGAGAGGAATCCCGGGCAATT AGCCTTAG AA(SEQ DSLEPRDLLRNSSVES TCCCACAGGCTTCTCTTAGTGGCCAAATCCGGCAAATCTGCAGCAAAAA AACATCGA IDNO: TPGHPNQELSDTLLS CCTCCTCAGCGAAGAGGAACGGCTTCAAATCGCTGCGCGGATGGAAGC AGGCCAAC 1377) NYVPSEIDSDDPNQS TCAAGTTGCCTCTCCATCCGCTGACGAACCTGCACGCCAGGTTCCAACCC GACAACCT PRRGPRHLPAHDMS GACCTGTGACCCCGCCTAGGTCCCCACCTGTGGAACCTGCTAGGCGGCC GCCGAAA LSDSMDEETEPDLSD CTCCATTCCATCGGAGGAAACCCCCGACTTAGGCGCTGTCCCGTCTGAG GACTGGCG QQRSDLLELRDLLRN ATCGACTCTGCCGACCCCAACCGGTCGCCCTCACGTGGGCCTAGGCACT ACTAATCC SSVETTPKGHPSLRHL TACCGGCCCACAATATGAGCCAGAGCGAGTCAGAAGACGATGTGACAG AGTCAACC PEPKIRAAAFRVNSVI ATCCCGACGTGTCTGACCAGCAGCGGAGTGACTCGCTGGAGCCACGGG TCCTGTAG GKIHTNNITETNALIK ACTTACTCCGAAACTCCAGCGTAGAATCAACGCCAGGTCACCCTAACCA GTCACCGG AGADLAVRILEVQPR GGAACTTAGTGACACCCTACTGTCGAACTACGTCCCATCTGAGATCGAC CTGGTCAT PQRTQRKKDPPWKH TCGGACGACCCCAACCAGTCGCCGCGACGTGGGCCTCGGCACTTACCG GCTGAATC RLEKNIAEIRKHLSWI GCCCACGATATGAGCCTGAGCGACTCGATGGACGAAGAGACAGAACCT TCAGCTTA SEWRRGNLHDEEKK GACCTATCCGACCAGCAGCGGAGTGACTTGCTGGAGCTGCGGGACTTA ACCAGGCG TLLESRYRCLEVGLTN CTCCGGAACTCCAGCGTAGAAACGACGCCAAAAGGTCATCCTAGCCTAC TACGAACT LEDTLKQRLSAKRSKV GACATCTACCGGAACCGAAGATCCGAGCAGCTGCTTTCCGCGTCAACTC TACAGTTG RRFEARVAGFHQNQ AGTGATTGGTAAAATCCACACCAACAACATCACTGAGACAAACGCCCTC GAGGGTC LFNTNQKRLYQTLRG ATCAAAGCTGGAGCAGACCTCGCAGTGAGGATCCTTGAGGTACAACCC GAGCACCC EETSSDSPNAEESIRF CGACCGCAACGGACCCAGCGGAAAAAGGATCCTCCTTGGAAGCACCGC CTGATGGC WSDIWSKEVRHNNT CTCGAGAAGAACATTGCTGAGATCAGGAAACATCTCAGCTGGATCTCCG TGAAAAG AEWLHDVKEKNVAA AATGGCGCAGAGGAAATCTCCACGACGAGGAAAAGAAGACGCTGCTCG GACCATCA DPDLTITSQQLKKQLS AATCGAGGTACCGGTGCCTCGAAGTCGGCCTAACCAACTTAGAAGATAC AAGTCGAA KTKNWKAPGPDMV CTTAAAGCAGCGGCTATCTGCTAAAAGGAGCAAAGTTAGACGCTTCGAG GGTAGCCA QGYWIKTFTSLHSRIA GCCAGAGTAGCTGGATTCCATCAGAACCAGCTATTCAACACCAACCAGA ACGAGAA AQLNHCLQRGTVPT AACGCCTCTATCAAACCCTTAGAGGAGAAGAGACGAGCTCTGACTCACC GACCAACG WMTTGKTVLIQKDK CAACGCAGAAGAGAGCATCCGATTCTGGAGTGACATCTGGAGCAAAGA GCTGATTC AKGTEVSNYRPITCLP GGTACGGCACAACAATACCGCAGAATGGCTCCACGACGTCAAGGAGAA AGGCGGA LMWKVLTGIIYERVY GAACGTCGCAGCAGATCCAGACCTTACAATCACCAGTCAACAGCTCAAA AGAGTCAA QHLDSKKLLPDEQKG AAGCAACTGAGCAAGACCAAGAACTGGAAAGCACCCGGACCAGATATG CTCGTTGA CRRNTRGTKDHLLVD GTGCAAGGCTACTGGATCAAAACTTTCACATCGCTGCACTCCAGAATTG ATGCGTTC KLLTKDARSKKKNLS CTGCACAGCTCAACCACTGCCTGCAACGCGGAACCGTGCCCACCTGGAT GACAGCTT MAWVDYKKAFDMV GACCACAGGGAAGACGGTCCTAATCCAGAAGGACAAGGCAAAGGGCA GGGGTAG PHSWILECLDIYGIAG CGGAGGTCAGCAATTACCGCCCAATCACGTGCCTCCCCCTTATGTGGAA ATGGAACT NIRNLIATTMPNWKT GGTGCTGACGGGCATAATCTACGAGAGAGTCTACCAACATCTAGACTCA CCTAAGCC QLTSANKHLGEVSIKR AAGAAGCTCCTACCCGACGAACAGAAAGGATGCCGCAGGAACACCAGG CTGAAAGG GIFQGDSLSPLLFVLT GGAACGAAGGACCATTTGCTCGTCGACAAGCTCCTCACGAAAGATGCCA CAGTCCAT MIPLSETLNKAGQGY GGAGCAAAAAGAAAAACCTAAGCATGGCTTGGGTTGATTACAAAAAAG CTTCGCAG NYSRTMKLNHLLYM CCTTCGACATGGTCCCTCACTCCTGGATCCTGGAATGCCTGGACATCTAC ACGCTAAG DDLKLYAKSKDQVEQ GGCATTGCCGGCAACATCCGCAATCTGATCGCGACGACCATGCCCAACT GTGCCCCG LLNIVHQYSQDIKMQ GGAAGACCCAGCTCACAAGCGCCAATAAACACCTCGGCGAGGTGAGCA CCGCCTGA FGVSKCGVLNIERGE TCAAGCGGGGCATCTTCCAGGGCGACTCCCTATCTCCACTGCTCTTTGTC GGGTTATC VTASEGITIEEGTIKDI CTGACCATGATCCCCCTCTCCGAGACCCTGAACAAGGCCGGCCAGGGCT AGGCCCCG EEAGYKYLGVMEYNT ACAACTACAGTCGTACCATGAAGCTCAACCACCTGCTTTACATGGACGAT CCGCCTGA ILHRTMKDSIRKEYLT CTGAAGCTCTACGCCAAATCAAAGGACCAAGTAGAGCAGCTCCTTAATA GGGTTACC RLRLILKSHLNGGNTI TAGTCCATCAGTACTCCCAAGACATCAAGATGCAGTTCGGTGTTAGCAA AGGCAACC KAINTWAVPVVRYSA ATGCGGAGTCCTTAACATCGAAAGGGGAGAGGTTACTGCGTCCGAGGG (SEQID GIINWTKKDCTDMDI AATCACCATCGAAGAAGGTACCATCAAGGATATCGAGGAGGCAGGTTA NO:1254) KTRKLMTIYRALHPRS CAAGTACCTGGGTGTCATGGAGTATAACACCATTCTCCACAGAACGATG CVDRLYINRREGGRG AAAGATTCCATTCGGAAGGAGTACCTCACCCGACTCCGGCTCATCCTGA LISVEDCVEAEKRALS AATCACACCTCAATGGGGGCAACACCATCAAGGCCATCAACACCTGGGC QHFRESDDPWARCL CGTCCCCGTAGTCCGATATTCTGCTGGCATTATTAACTGGACCAAGAAA VEAKLLKETETADQF GACTGCACCGACATGGACATCAAAACCAGGAAGCTCATGACCATCTATC KERRRLDRTNKWKS GAGCACTGCACCCAAGATCGTGCGTAGACAGACTCTACATTAACAGAAG MKMSGQYLEAVQD AGAGGGGGGCCGTGGCCTCATCAGCGTCGAGGACTGCGTCGAAGCGG KIVPDSWNWLLRGEL AAAAACGCGCTCTCTCCCAACACTTCAGAGAGAGCGACGACCCGTGGGC KRETEGTILAAQEQA ACGCTGCTTAGTCGAGGCGAAGTTGCTGAAAGAGACCGAAACGGCTGA LRTRYIQNKIDKRNVP CCAATTCAAAGAGCGACGGCGCCTCGATCGGACTAACAAATGGAAGAG STCRICRSSDETINHVI CATGAAGATGTCCGGCCAGTACCTCGAAGCTGTTCAGGACAAAATCGTA SECGVLAQKEYKRRH CCAGACTCTTGGAACTGGCTTCTGAGGGGTGAGCTGAAACGGGAAACA DKVARHLHWTLLRIH GAAGGCACCATCCTGGCAGCTCAAGAGCAAGCCCTCAGAACCAGGTAC NFPVSERWYEHEPAP ATCCAAAACAAGATCGACAAGAGAAACGTCCCTAGTACCTGTAGAATTT VVENEAVQIYWDKR GCAGGAGCTCGGATGAAACCATCAACCACGTGATAAGCGAATGCGGAG METDRVLHANRPDI TGCTGGCGCAAAAGGAGTACAAGCGCAGGCACGATAAAGTTGCCCGAC VVKDKQEKSAKLIDISI ACCTTCACTGGACCCTCCTGCGGATCCATAACTTCCCGGTCAGTGAGAG PFDSRIVDKEAEKKEK GTGGTACGAGCACGAACCCGCGCCCGTGGTTGAAAACGAAGCCGTGCA YRDLAIELQRLWQM GATCTACTGGGACAAGAGGATGGAAACTGATCGAGTACTTCACGCCAA KVDVVPVVIGALGA CCGACCGGACATCGTCGTCAAAGACAAACAGGAGAAGTCGGCCAAACT MSKNLKTALRELKCG GATCGACATTTCAATCCCCTTCGACTCCAGAATCGTGGACAAGGAAGCG HLHPGTLQKSALLGT GAGAAGAAGGAAAAGTACCGTGACCTTGCCATCGAACTCCAGAGACTG AHIIRKVL(SEQID TGGCAGATGAAGGTGGACGTCGTTCCAGTGGTCATCGGAGCTCTGGGG NO:1499) GCAATGTCCAAGAATCTCAAGACTGCCCTGCGAGAGCTGAAGTGCGGC CATCTCCACCCAGGAACCCTCCAGAAGAGCGCTCTGCTTGGAACAGCGC ACATCATTCGCAAAGTGCTCTGAGCCTTAGGTCGCGGGATGTGACCCGG CGCCAGAGTGTAGAGCTGAACATCGCTCAACCGATCCAATTTGGGTCGT GAAATCCCCTCGATAATAATAATAATAA(SEQIDNO:1131) R4 R4- CADV01009048 Bursa- GAGGATCCTGGGTTCCTACTACCCTGCTCCATCTCCTCGCGATGGATCCT GAGGATCC TAACAA MVFNNCKPKHLCPAI 2_BX phelenchus TGGGGAAGTCTCCGGACTGAGCTAAGAGAGCGTTAAAGTAGAGGGTG TGGGTTCC GTGTAA RPTGQQETNGGSEG xylophilus GCGGCGTAGTGACTTCCAAGTTGCGATGGGGCGGAACATCTACTCTTCT TACTACCC TAAAAA TAEPTAGPSRPAVSE GAGAGAGGGAAAGCCCTATGGCGGCGGTAGAAAGGTTGGGCTACGGC TGCTCCAT CCACCC DAAQPVPLFEEGEYI AACACTTGCCATGATCAGATTCGATCAAAATTAGCCTCTGGGGCTGGCA CTCCTCGC ATGCGT RAHRDKTCPYCEVL ACCCTACAACGGATTGTAAACTGAACTATGCTATGCAAAATGAAAATAA GATGGATC AGTAAA WIGARSSKARSDSW AAAATGGGGGCTTTACAATCTAAGGTGTTGGCAGATCACGAAAACTGCC CTTGGGGA CCGATC PLCQILYLMKRNDDL CGTCGATGAGGGTGAGATAACATCCTACGGAACAGCCCCTGCTGGACC AGTCTCCG AATTATC RGQRTRYPLLESSLRA AAACCAAATCATCCACAAATTGGAGGGTTTTCTTGGTAGTTTCCCTTGGC GACTGAGC TAGCAA AGIARTKFAIIKCIRNV ACGTCTTGTTTCATAAGCCAGAATAAAAACGATACCATACAGACATGAG TAAGAGA AATCGC LRDRYVPNGPYSEH GCTGGGTACCTGCCAGCCGCGACACGGAAAACCGGTAGGTGCAACCGG GCGTTAAA AGGTCA WKIYRANSGEVPQG AGACAGCTAGGGGAAAGAAAATAGTAAAGTGTCGAAAACAAAACAGG GTAGAGG GAAGAC ATITKGKRSARVAGL TAGCCCCTGGCTAGAGGGAATGGGACATTGTCCGATTAGGTTGCCTTGA GTGGCGG CAAAGA PSPSQSGHHTKRIQA CCCAATGAAAGCCAACCGGGTTTACATTATCGGTCTACCAAGGGCAATG CGTAGTGA ACCGAC GTGIETETTVTETNTT ACCAGAGAAACTGCGACTATGCCGTAACCCTTCGCAGATTGCCGATGAG CTTCCAAG CCAGAG PEVSHEHRDPCGEPE AACCATCGATCGTAAGTCGAAGCCAAGCGGATTGACCAGTGGAGGGTT TTGCGATG GAATAG TSAANVDKVTELTED ATCCCGACAACAGCATGGTATTCAATAATTGTAAACCTAAACACCTGTGC GGGCGGA GACCAG GSETRGTANVANGG CCTGCGATCCGTCCGACAGGGCAACAGGAAACGAACGGTGGTAGCGAG ACATCTAC AGCTGA VSVSDPGRKRQSSSQ GGCACTGCTGAGCCTACCGCTGGTCCTTCGAGACCGGCAGTGAGCGAG TCTTCTGA AACTCTC NRGNIETTNPELVG GATGCAGCCCAACCGGTGCCACTGTTCGAAGAGGGCGAATATATAAGG GAGAGGG AGATAC MWEDMFGVQLDGA GCTCACCGCGATAAGACTTGTCCTTATTGCGAAGTCCTGTGGATAGGAG AAAGCCCT GCCAAC MRTTERPRLPKLKHL CGCGGTCTTCAAAGGCCCGCTCCGACAGCTGGCCATTGTGCCAAATCTT ATGGCGGC GGTCCT SEPERLWIRAKLEQA GTACTTAATGAAGCGCAACGACGACCTCAGAGGTCAAAGGACGAGGTA GGTAGAA AATAAA WLQCVSYDVEQQW TCCACTGCTCGAGAGTTCTTTGAGAGCAGCCGGAATTGCCCGTACTAAG AGGTTGG ACGTCG LNANAVLYAAIRSVA TTTGCGATTATCAAGTGTATTCGGAACGTGTTGAGAGACCGGTATGTGC GCTACGGC TTAAGT ASRPCKEAREAQKT CCAACGGCCCATACTCTGAGCATTGGAAAATTTACCGCGCGAATAGCGG AACACTTG AAAGCA WLDNKKKDEAKLRRL GGAAGTTCCTCAGGGAGCGACCATAACAAAAGGGAAGAGGTCCGCCAG CCATGATC TCGTTAA IGRISSVHSMPKGDR AGTTGCGGGGCTTCCCTCCCCAAGCCAGAGCGGTCACCACACGAAACG AGATTCGA GTACAA TPREKKLVKNITKLKN AATACAAGCTGGAACTGGAATTGAAACGGAAACTACGGTGACTGAGAC TCAAAATT AACAAA THYPDMDWGGLLN TAACACAACACCCGAGGTTTCGCATGAACACAGAGATCCCTGCGGGGA AGCCTCTG AGCACT HFKVKLSQLKEKISVR ACCAGAGACGAGTGCTGCTAATGTAGATAAGGTAACGGAATTGACGGA GGGCTGG GTAAAC VAEHKRKVNRNAAG AGATGGGTCTGAAACCAGAGGAACTGCGAACGTCGCTAACGGTGGGGT CAACCCTA GCGAGG QYGKSVAGSAGLAP TTCTGTTAGCGATCCTGGTCGTAAAAGGCAGAGTTCAAGCCAAAATAGA CAACGGAT CCCCCTC DVVSATAYWSGLAQ GGAAATATCGAAACGACTAATCCAGAATTAGTCGGTATGTGGGAAGAC TGTAAACT TTTGCCA PGPKKFKASSPIFQT ATGTTTGGAGTGCAGCTTGATGGAGCGATGCGCACAACAGAGCGACCA GAACTATG AACTCC WKDDVAKNLNTEPV AGGCTACCCAAACTGAAGCACCTGTCCGAACCCGAGAGGTTATGGATCA CTATGCAA GGTAAT LLYPIIKECIRKPSPWK GGGCGAAGTTGGAACAGGCTTGGCTGCAGTGCGTTAGCTATGACGTCG AATGAAAA CCTCGTA APGPDGIYNYYWQQ AACAGCAATGGCTTAACGCCAACGCGGTTCTATACGCCGCCATCAGGTC TAAAAAAT GTACGG EFVAQWIQTLVKRTL AGTCGCGGCTAGCAGACCTTGTAAAGAGGCCCGCGAAGCTCAAAAAAC GGGGGCT TGCTTTT DIGRFPTALMCGRTV ATGGCTGGATAATAAAAAGAAGGATGAGGCGAAATTGAGGCGGCTGAT TTACAATC TCCCGCT LLFKSGDKSMPQNYR CGGGAGAATCAGCTCTGTCCACAGTATGCCGAAAGGCGACCGTACGCC TAAGGTGT TCACGA PITCLNGCFKITNAVL ACGAGAAAAGAAACTCGTGAAGAATATAACTAAGTTGAAAAACACCCA TGGCAGAT ATCAGA TKVILQRVQDTCALP CTATCCAGACATGGATTGGGGTGGACTTCTGAACCATTTTAAGGTAAAG CACGAAAA ACGCTG REQMALKPKVWSC TTGTCGCAACTTAAAGAAAAGATTAGCGTGCGGGTGGCGGAGCATAAG CTGCCCGT CCAGAT MEAQLRDQALQSEI CGGAAAGTTAATCGTAACGCAGCAGGACAGTACGGCAAAAGTGTGGCC CGATGAG CTTGTCC GDDCKTAWIDFSKAY GGCAGCGCAGGGTTAGCGCCGGATGTCGTTTCCGCGACGGCATACTGG GGTGAGAT GACATG DSLDHDALRFVIQTIA TCAGGTCTTGCACAACCTGGGCCAAAGAAATTTAAAGCAAGCTCGCCGA AACATCCT GGCCGT LPAGMEEYLLKSLDS TCTTCCAGACTTGGAAAGACGACGTAGCGAAGAACTTGAACACTGAGCC ACGGAACA AGGGTT WRTQLVLSDAGKVV GGTACTGCTGTATCCGATCATAAAAGAATGCATCAGAAAGCCAAGCCCC GCCCCTGC TGGGAG SGKPYPIKRGVLQGD TGGAAAGCACCAGGCCCTGACGGGATATACAACTATTACTGGCAGCAA TGGACCAA TACCAG SLSPALFVLTTSPIVAH GAGTTTGTGGCTCAGTGGATCCAAACCCTTGTGAAGAGAACCCTGGATA ACCAAATC CGTGGG LQRTCPTGRIQLYMD TAGGGCGGTTCCCGACTGCACTTATGTGTGGGAGGACGGTACTACTCTT ATCCACAA GCGGAG DIKLYGKTESDLCMLI CAAAAGCGGTGACAAATCTATGCCTCAGAACTATCGACCCATTACATGC ATTGGAGG AGCGTA KETQRVANKLGLNIN CTGAATGGATGCTTTAAGATTACCAATGCGGTATTGACGAAAGTAATCC GTTTTCTT CCTGGG LKKCALFGKSIKQSIA TTCAACGGGTTCAAGACACTTGCGCTCTTCCCAGAGAACAAATGGCTTT GGTAGTTT TACACT GFDPLGDRTYKYLGIP GAAACCAAAGGTGTGGTCGTGCATGGAAGCGCAGTTGCGCGACCAGGC CCCTTGGC GCATAA QRDVADIKQAYDELK ACTTCAGTCTGAGATCGGTGACGATTGTAAGACTGCATGGATCGATTTT ACGTCTTG TCGGGT AKTVQTIGETMACDY TCCAAAGCATACGATAGTCTGGACCACGACGCCTTAAGATTTGTAATCC TTTCATAA CTCAGA LTTRQVINRLNSKIPP AAACGATTGCACTGCCGGCAGGGATGGAAGAGTACCTTCTGAAATCCCT GCCAGAAT AAACCA VVRFVTQSALCSAPM GGATAGTTGGAGAACGCAACTTGTGCTATCTGATGCAGGTAAGGTTGTC AAAAACGA CCTATG TRGLYNKITELDNVSR TCCGGAAAGCCCTACCCAATTAAAAGGGGAGTGCTCCAAGGAGACAGT TACCATAC GTTTATT AELRKVLIYKATNVSR CTGAGTCCCGCGCTATTCGTCTTAACGACGAGCCCAATTGTCGCTCACCT AGACATGA ATTCTGT FYLATKEGGFGYASL TCAGAGAACTTGTCCCACGGGTCGTATACAGTTGTATATGGATGATATT GGCTGGG CTCCCAT QQVFVEAVVSRAIYC AAACTCTATGGCAAGACAGAGTCGGACCTGTGTATGTTAATAAAAGAAA TACCTGCC CTGCAG LRAPSLCDIREFILSKF CTCAACGAGTCGCAAACAAATTGGGCTTAAACATTAATTTGAAAAAATG AGCCGCGA GGTAGC DPVKVARIALARSKID TGCCCTCTTTGGTAAAAGTATAAAACAGAGTATTGCCGGGTTTGACCCG CACGGAAA TTTTCGT MDIERMDMASATRT CTCGGCGACAGAACATACAAATATCTCGGCATTCCCCAAAGGGATGTTG ACCGGTAG TGGGCC IRQHYQAKWKTLFQ CAGATATCAAACAGGCATATGACGAACTCAAAGCTAAAACGGTGCAGA GTGCAACC ATAGGA QSKLYQKWVQHKIDI CCATCGGCGAAACGATGGCGTGTGACTATCTAACAACAAGGCAAGTTAT GGAGACA GCCTAG PNSSRWLQRGEISPR CAACAGATTAAATTCGAAGATACCGCCTGTAGTCAGATTTGTCACACAAT GCTAGGG GGGCAA NCRIAVAVQDNTLLC CTGCGCTGTGCAGCGCGCCTATGACCAGAGGATTATATAATAAAATCAC GAAAGAA GAGTGC RGFVGSKDPNKQCRL GGAATTGGATAACGTGAGTAGAGCAGAGTTAAGGAAAGTTCTTATCTAT AATAGTAA ATGTAG CNAGIETASHIVTECS AAAGCGACCAATGTCTCTCGGTTCTACCTGGCCACGAAAGAAGGGGGA AGTGTCGA TCTTCAA THRVHMYIERHDSV TTCGGCTATGCAAGCCTGCAGCAGGTATTTGTTGAAGCTGTTGTGAGTA AAACAAAA CGGCCA ARNIYAVLAKNCGF GAGCGATCTACTGCCTGAGAGCTCCCAGTCTGTGTGACATACGGGAGTT CAGGTAGC TGCCAG WIPHYSQKIPTVKITK CATATTGAGTAAGTTCGACCCTGTCAAGGTTGCTCGCATTGCGTTGGCA CCCTGGCT GGAAAC SYELYWNYKFPCTQA AGGAGTAAAATTGACATGGACATCGAACGGATGGACATGGCCTCGGCC AGAGGGA TTGTGA LEACRPDIVLIDRAKK ACGCGTACCATCAGACAACATTATCAGGCAAAGTGGAAAACTCTGTTCC ATGGGACA GAGGTG RILVVEVAVSYVTRLE AACAATCGAAACTTTACCAAAAGTGGGTGCAGCATAAGATCGATATACC TTGTCCGA AGGGAT QMTQRKLYKYGVNG CAACTCGAGTAGGTGGTTGCAGAGGGGGGAAATCTCGCCGCGAAACTG TTAGGTTG AACTAG EYQADGETRGWNIC TCGAATCGCGGTAGCTGTTCAAGATAACACTTTACTGTGCAGAGGATTT CCTTGACC CATCAG RELVQKYNMRIDLCI GTGGGGAGTAAGGACCCAAACAAACAATGTCGGCTATGCAATGCTGGC CAATGAAA ATAATAT VVIGACGEILPCMVK ATTGAGACCGCTAGTCACATCGTGACTGAATGTAGTACACACCGGGTTC GCCAACCG CAGTCA EIEKISKVSGRQLLERC ACATGTATATCGAACGTCACGACTCTGTGGCGCGAAATATATATGCTGT GGTTTACA TGAAAT QRSAVLGTVRTVRRH CCTGGCAAAAAATTGCGGTTTTTGGATTCCTCATTACTCACAGAAAATCC TTATCGGT TAGTAA LAN(SEQIDNO: CAACAGTAAAGATTACGAAGTCATATGAGTTGTATTGGAATTACAAGTT CTACCAAG CAACCA 1500) CCCGTGCACGCAAGCACTGGAGGCTTGTCGTCCCGATATCGTTCTCATT GGCAATGA ACGTTC GACAGAGCGAAGAAAAGAATCCTCGTGGTGGAAGTGGCGGTCTCGTAT CCAGAGAA ACCGTC GTCACACGGCTAGAGCAGATGACGCAGAGGAAACTCTATAAATACGGG ACTGCGAC GTTGGC GTGAACGGCGAGTACCAGGCTGACGGGGAAACCCGTGGCTGGAACATT TATGCCGT AAAACA TGTCGTGAACTTGTACAAAAGTACAATATGAGAATTGATCTCTGCATAG AACCCTTC CCGACT TCGTAATTGGGGCATGTGGAGAAATACTCCCTTGCATGGTTAAAGAAAT GCAGATTG AACGAT TGAAAAAATATCCAAGGTCTCTGGTCGGCAGCTCCTAGAACGGTGCCAG CCGATGAG GCTAGT CGTAGCGCTGTACTTGGAACTGTGCGGACCGTAAGGAGACACTTGGCA AACCATCG TAGAAA AACTAACAAGTGTAATAAAAACCACCCATGCGTAGTAAACCGATCAATT ATCGTAAG GAGTCG ATCTAGCAAAATCGCAGGTCAGAAGACCAAAGAACCGACCCAGAGGAA TCGAAGCC GGTCTT TAGGACCAGAGCTGAAACTCTCAGATACGCCAACGGTCCTAATAAAACG AAGCGGAT CCCAAA TCGTTAAGTAAAGCATCGTTAAGTACAAAACAAAAGCACTGTAAACGCG TGACCAGT GTTAGG AGGCCCCCTCTTTGCCAAACTCCGGTAATCCTCGTAGTACGGTGCTTTTT GGAGGGT TGCTTG CCCGCTTCACGAATCAGAACGCTGCCAGATCTTGTCCGACATGGGCCGT TATCCCGA CACCGA AGGGTTTGGGAGTACCAGCGTGGGGCGGAGAGCGTACCTGGGTACACT CAACAGC AGCCGA GCATAATCGGGTCTCAGAAAACCACCTATGGTTTATTATTCTGTCTCCCA (SEQID TCCGCTC TCTGCAGGGTAGCTTTTCGTTGGGCCATAGGAGCCTAGGGGCAAGAGT NO:1255) TACCCAC GCATGTAGTCTTCAACGGCCATGCCAGGGAAACTTGTGAGAGGTGAGG AGCTCT GATAACTAGCATCAGATAATATCAGTCATGAAATTAGTAACAACCAACG GCCCAG TTCACCGTCGTTGGCAAAACACCGACTAACGATGCTAGTTAGAAAGAGT CGTT CGGGTCTTCCCAAAGTTAGGTGCTTGCACCGAAGCCGATCCGCTCTACC (SEQID CACAGCTCTGCCCAGCGTT(SEQIDNO:1132) NO: 1378) R4 R4- ABLA01000389 Heterodera TGTGGCGATACTCGGAACCTCCGGGGAGCCTGGTAGGAGTTGGCCTAC TGTGGCGA TAGTCG MRKSFLQHIPELSSHI 2_HG glycines AGGTCGCGAAAGTCCCTAGGTGCTGCACGGGTTGCGCTAATCCGAGCC TACTCGGA TAGCCC AMSVPARNYPKMCS CCTTCCGGTGTTGGCTAGCACTGGAGGGGGGGGGGTCCAAAGAATACG ACCTCCGG AGATGT LAQGSSGTLSHNGKG TGAAGCGGACCCTGTGTTTTCTCTGACCTCATACGGGTATATCCCGATGA GGAGCCTG CGACAG VAMHRCPSDDCAGK TATCTATACAGTGTTCATTTTCATTTCTTAACCTGCTTTTTCCTCAAGAGA GTAGGAGT CCCCTAC DPPQRGSQKGNLRS AATATGACCACATTCGTTTGCCATTTCGAAGGTGCGACTTTTGAGGGAG TGGCCTAC ACCGGT VRWTPSEEKAVFEY TTCTCAGGGACATCAAGCTGTTCATGGACGGCTTGACGGCTCGTGGCGG AGGTCGCG AGATGG WSRLEQHAMLNGSE CACGCAGCCGCGCGGGCCAGTTAGGGCTGGTGCCAATGAGGCAGCGA AAAGTCCC AAGGAG ARGTCAITRSQFLIH AGAAGTCTCTGAAACGCCAAAAGCAGCGGGAGAAGAGAGAGAAAGCG TAGGTGCT GTATGT WDGERESRSLSDGV GGGACGGGTTAGAGATAATAGCATGATTGACGGCGAAGCAATACTTTG GCACGGGT ATCTAA PEYPMRTERAYYERV ACTTCCAGGTACGGAGGACGGCGGCAAAAAGCAAAAAGGGGACGAAA TGCGCTAA GCCCAC RLLRQRGWQWDCA CCGATGGCATTGCCAAAGGGGCCGAAACCATTGAGCTGGACGCTGATG TCCGAGCC GGCAAG NECLVIGQCAEPCRK GGCCGTCTGCCGCGCAAAAGGAGGGGGGAGCCCCTCCGTCTGGGAAG CCTTCCGG CCACCA PNVVAIKADKGMKR AAGGTGCGGCGCCATTCGTCCAACAAACCGGAATAGATCTTGATGATCT TGTTGGCT GTGGAA SLVKGKLLSLPHVMG CAGCAATTGATCATATTGATCTCTTTTTCCAGTTGTTGCGATAAATTATCG AGCACTGG ACGGTG EINQVSVQVEVPLPS TGCATTATTTCTTCGATTTCTCCAAAGCTTAACTCCTTTCCTAATACCTACT AGGGGGG ACCACCT VPASVPQVEGVESKG CATGTATACGTTAGTAGGCATGTTTTATGCAGGTAAAGATAGACCTGGT GGGGTCCA CTGTTC FTETEPSNKPSLEGNP GCCCCCGGGCGACTTGGGATGTGGATGATGGTTGGCGGAGAGTTCTGA AAGAATAC GCGGAA AEEGLRKPERVNVPV TGACGCCGTAGTTCCGGAGGAACAGTTCCTACTAGTGCCAGCCTGGGCT GTGAAGC ATGCCC HGIISDSERKDLKDRF AGTGGAGCTGGTCTGGCGAGTGCTTTGCTCGTCAGATAAGGGGTGGTT GGACCCTG CGTAGA WSAYKTAKRSVGFRP GGGGTGGTACCTCTGGAATGATGTGTCGGATAAGCACTCTGCCTTATAA TGTTTTCTC GTATAT ALKIEPNRVNRAQW AGCTGTCGTTCTGCCCAGGTTTCTCCTAACCAGGTTGAACTTGTAATGAG TGACCTCA GAAGGT EVLDSCVVEVLKKRET CCTTTGGGTATCTGGTCGGGGTCTGCGCGGAGATAGCTCGTGAGGAGC TACGGGTA CGGAGC SNGYRGCVLRHLNV TGTTATTACATTATTGGTTGAACGCTTTTGGGTCGTACTGCACAGATAAA TATCCCGA ATTGAA AVYAAGYVLAEGNKE TATGAGAAAAAGTTTTTTACAACATATACCTGAGCTTTCAAGTCATATTG TGATATCT GCACAA RRQVIRRQSAEWLLR CTATGTCTGTGCCTGCGAGAAATTATCCCAAAATGTGCTCTTTAGCACAG ATACAGTG GTGAGA QKSEINNIRRHIGWIT GGTTCTTCCGGCACATTAAGTCATAATGGGAAGGGTGTCGCAATGCACC TTCATTTTC ACCCTG DELTRRRTGKNPTSR GGTGCCCCAGCGATGATTGTGCTGGTAAAGATCCCCCTCAGCGGGGGT ATTTCTTA GAAGTA QLSNFAWLQRRYQV CTCAAAAAGGGAATTTGAGGAGTGTGCGATGGACTCCCAGCGAGGAGA ACCTGCTT TGGTGG LGKPVRETRDLEVQR AAGCTGTCTTCGAGTATTGGTCACGGTTGGAACAGCATGCCATGTTAAA TTTCCTCA TGAGCT ERLVSRLRLAQDRINS TGGTAGCGAAGCCCGGGGGACCTGTGCCATAACTAGAAGTCAGTTTCTC AGAGAAAT GGTAAC SMDREERVRKRMLP ATTCATTGGGATGGCGAGAGAGAGTCAAGATCCTTGTCGGATGGTGTG ATGACCAC CTCGAA LRRKLEEPLGDSKLDT CCCGAGTACCCAATGAGGACGGAAAGGGCGTACTATGAAAGGGTCCGA ATTCGTTT TTCCTCC KQARTFWASLIGERK CTCTTGCGACAACGCGGATGGCAATGGGATTGTGCTAATGAATGCTTAG GCCATTTC TATGGG EFGKIPELENWAEEV TAATTGGCCAATGTGCAGAGCCATGCCGTAAGCCAAACGTGGTAGCGA GAAGGTG TGCTTG RSKVTDGQGFASDH TTAAGGCTGATAAGGGTATGAAGAGATCGCTGGTTAAGGGAAAGCTTT CGACTTTT CGCCCG VDQTVWKKILGKAR TGTCCCTGCCCCATGTAATGGGCGAAATAAACCAAGTGAGCGTGCAGGT GAGGGAG TAGGTC PLKAPGPDGIPNLLW TGAGGTGCCTTTGCCAAGTGTACCCGCCAGCGTGCCCCAGGTGGAAGG TTCTCAGG ATTTGTG KRLPSANQALFKWL AGTTGAAAGCAAAGGTTTTACTGAAACGGAACCGTCGAATAAACCTTCC GACATCAA TATGTA MGIKRKQLSVPSWLT CTGGAAGGGAATCCGGCGGAAGAAGGATTAAGGAAACCGGAGCGAGT GCTGTTCA GGGATG KGRVVLLPKGGDPVD AAACGTACCTGTTCATGGGATAATCAGTGATAGTGAACGGAAAGACCTG TGGACGGC GAAGGA PANYRPIACLNTQYK AAAGACCGGTTTTGGAGCGCCTATAAAACGGCCAAGAGGTCTGTGGGT TTGACGGC ATCTCG LVTGMVTAWVSEHL TTTAGACCGGCTCTAAAAATCGAACCGAATAGGGTTAACCGTGCCCAGT TCGTGGCG ATAGCG TTYSILPIEQRAMVSG GGGAAGTACTGGATTCCTGCGTTGTTGAGGTGCTCAAAAAGAGGGAAA GCACGCAG CAAATC TWGCTHAMVIDRAI CCTCCAATGGGTACCGCGGTTGCGTCCTCCGGCATTTGAACGTTGCAGT CCGCGCGG GGGATA TSYAEATGLPLYVGFV TTATGCGGCGGGATACGTGCTGGCGGAAGGCAACAAGGAAAGGAGGC GCCAGTTA CCGCAC DFAKAFDSVSQPWIR AGGTGATCAGACGCCAGTCGGCGGAATGGTTGCTTCGCCAAAAGTCCG GGGCTGG TGGCGA YALKVAGVHKRIRCLI AAATCAACAACATCCGTAGACACATCGGGTGGATTACCGATGAGTTAAC TGCCAATG TCATGCT GILMKCWSVRYEVFK ACGGAGGAGGACGGGAAAGAACCCAACATCCAGACAGTTGAGTAATTT AGGCAGC GCCAGT SGRVLRSAPLAVKNG TGCATGGTTGCAAAGGCGATACCAGGTTTTGGGTAAACCCGTGAGGGA GAAGAAG GGGCCC VLQGDTLSPLLFCLSV AACCCGGGACTTGGAAGTCCAACGAGAAAGACTTGTAAGTCGTCTTCGA TCTCTGAA TGCTGA AVVSSAVGSLFDFEV TTGGCTCAGGACCGAATTAACTCTTCCATGGATCGTGAAGAGCGAGTTC ACGCCAAA GTTGGG TIPGRGVMQQQNHL GTAAACGAATGTTACCCCTGCGGAGAAAGTTGGAAGAACCTTTGGGTG AGCAGCG CGGGTT FYMDDFKGFAPSEAS ACTCTAAACTTGACACAAAGCAGGCACGAACATTTTGGGCCAGCCTGAT GGAGAAG AGCAAC LTRMLVTLERTASAL TGGAGAGCGAAAGGAATTTGGAAAAATCCCGGAGCTTGAAAACTGGGC AGAGAGA CCGTCC GLKINKRKCALVHPR CGAAGAAGTGAGGAGCAAAGTCACCGATGGACAAGGTTTCGCAAGTGA AAGCGGG AGGGAC ERENEETGSDIPVLGL CCATGTGGATCAAACTGTTTGGAAAAAGATACTGGGTAAGGCCAGGCC GACGGGTT GCAGAT RDTYKYLGIEERFGIV ACTGAAGGCGCCCGGACCAGATGGAATCCCCAATTTACTTTGGAAACGG AGAGATAA GCCAAA FEDAWDRVRTKMFE TTACCATCCGCAAACCAGGCGTTATTTAAGTGGTTAATGGGGATCAAGC TAGCATGA AGCCTG RMRTLLCTEHTFGEL GGAAACAGCTCAGTGTCCCCAGTTGGCTCACAAAAGGAAGGGTGGTCC TTGACGGC GTGACA RAAFASTIAPVARYLF TCCTACCCAAAGGAGGTGACCCGGTAGACCCGGCCAACTACCGACCAAT GAAGCAAT CTCTAAA LNVIVGGPSWSETLT CGCTTGTTTGAACACGCAGTACAAACTAGTGACTGGTATGGTGACGGCA ACTTTGAC GGGAGT KAKDMDLRIRRLLWE TGGGTATCGGAACACCTGACCACATACTCTATTTTGCCGATTGAACAAC TTCCAGGT CTACGG RRDNEPGWRFKHCS GAGCGATGGTGTCTGGCACTTGGGGGTGTACCCATGCTATGGTGATTG ACGGAGG CGACGA ADRLYLRVQYGGLGF ACAGAGCAATTACGTCTTATGCAGAGGCCACTGGACTACCGCTTTATGT ACGGCGG GAACAT VSVEDTLSESIIYCWA AGGCTTTGTGGACTTTGCGAAGGCTTTCGACTCAGTTAGTCAACCGTGG CAAAAAGC CCCCTAA YVQCRPELELARELFG ATCAGGTACGCTCTAAAGGTCGCTGGAGTTCACAAGCGAATCCGATGTC AAAAAGG GTC TLNRSARSGIKQSIAK TTATTGGAATCCTTATGAAATGTTGGAGCGTAAGGTATGAAGTGTTCAA GGACGAA (SEQID GARKVFRSYALLSKNS GAGCGGGAGAGTGTTAAGGAGTGCGCCTTTGGCAGTTAAGAACGGAGT ACCGATGG NO: AQRVSDLDGDASPG GTTACAAGGAGACACGTTGAGCCCGCTGCTCTTTTGTCTCTCAGTAGCC CATTGCCA 1379) FRVGEMIFMEPTRG GTTGTTTCTTCTGCAGTAGGCAGCCTCTTTGATTTTGAGGTCACCATACC AAGGGGC ARAIVKILRKENDSRR TGGACGAGGAGTGATGCAGCAGCAGAACCACTTGTTCTATATGGATGA CGAAACCA LAAWKGRPMGGRV CTTTAAAGGGTTTGCTCCTAGTGAAGCGTCCTTGACTAGGATGTTAGTA TTGAGCTG VSLPELDQVHSYHWL ACATTGGAGCGAACCGCTTCAGCCCTCGGGTTGAAAATCAATAAGAGG GACGCTGA IRARIGRRSFRDCIAA AAATGCGCTCTGGTACACCCCCGTGAAAGAGAGAACGAGGAAACTGGA TGGGCCGT QEGQLKARELMCPHI AGTGATATCCCGGTCCTTGGTCTACGAGACACTTATAAATACCTTGGAAT CTGCCGCG NAKAKWCRRCGDG CGAAGAGAGATTTGGTATCGTTTTCGAGGATGCGTGGGACCGAGTCCG CAAAAGG RVETEQHILSGCAWS TACCAAGATGTTTGAGAGAATGCGCACCTTGCTATGCACGGAACATACC AGGGGGG RTGTMLDRHNGVVR TTCGGGGAGTTGCGTGCGGCCTTCGCTAGTACGATCGCCCCAGTAGCGA AGCCCCTC QVHTALCRKYGLPVS GGTACCTCTTTCTCAATGTAATCGTTGGTGGGCCTAGTTGGAGTGAAAC CGTCTGGG SHVVPLHAVIENEHA CCTGACGAAAGCTAAGGACATGGACCTGCGTATCCGACGCCTCCTGTGG AAGAAGG KILYDVALHTSPAGVL GAAAGAAGAGATAACGAGCCGGGATGGCGGTTTAAGCATTGTAGTGCT TGCGGCGC PREDGSTSYTGLRSTR GACCGCTTGTACCTCCGCGTACAGTACGGTGGACTTGGCTTCGTGTCCG CATTCGTC PDMVIFDKKARTILIV TAGAGGACACTTTGTCTGAGTCTATCATTTACTGTTGGGCTTACGTGCAA CAACAAAC EISVPWRENLVKQELI TGTAGGCCCGAGTTGGAGCTGGCTCGTGAACTTTTTGGAACCTTAAACA CGGAATAG KWRKYAINSMIEPLE GGAGTGCTAGAAGCGGTATAAAACAGAGCATTGCGAAAGGAGCGCGT ATCTTGAT LAEAEIPGPNLKHAL AAAGTGTTCCGAAGTTATGCCCTTCTCTCGAAAAATTCCGCCCAGCGAGT GATCTCAG GLAYGTSFPTVKVVPI ATCTGACCTTGATGGGGATGCCTCCCCAGGATTCCGAGTAGGGGAAAT CAATTGAT VVGSCGEVLPNITKRL GATATTCATGGAACCAACTCGGGGTGCCAGAGCAATCGTAAAAATCCTC CATATTGA SELGIPKRGIPSLLESI CGTAAGGAGAATGATTCGCGACGCTTGGCTGCCTGGAAAGGTAGACCT TCTCTTTTT QRAAIIGSGHVIRAHL ATGGGAGGTCGGGTCGTGTCATTGCCAGAGCTCGACCAGGTTCATAGCT CCAGTTGT SVPRSESET(SEQID ACCACTGGTTGATCCGTGCGAGGATTGGCAGGCGTTCGTTCCGAGACTG TGCGATAA NO:1501) CATTGCAGCACAGGAGGGACAGTTGAAAGCGCGGGAGCTTATGTGTCC ATTATCGT ACATATAAATGCGAAAGCAAAGTGGTGCCGACGATGTGGGGACGGTCG GCATTATT TGTTGAAACCGAGCAGCATATCCTTAGTGGTTGTGCTTGGTCACGGACT TCTTCGAT GGAACTATGTTAGATCGACATAACGGTGTTGTACGGCAAGTACACACG TTCTCCAA GCTTTATGCAGGAAATATGGACTGCCCGTTTCTTCTCATGTCGTTCCTTT AGCTTAAC GCATGCGGTAATCGAGAATGAACATGCCAAAATCCTGTACGATGTCGCG TCCTTTCCT TTGCATACTTCGCCTGCTGGCGTCCTCCCTAGGGAGGATGGGAGTACGA AATACCTA GTTATACGGGTCTGCGTAGCACCAGGCCCGATATGGTAATATTCGATAA CTCATGTA AAAAGCACGTACGATCCTTATTGTGGAAATTTCGGTGCCATGGAGAGAG TACGTTAG AACCTGGTTAAACAGGAGCTCATAAAGTGGCGGAAATATGCCATAAATT TAGGCATG CTATGATCGAACCATTGGAACTAGCGGAAGCTGAAATACCGGGACCTAA TTTTATGC CCTGAAACATGCGTTAGGCCTTGCGTACGGTACGAGTTTCCCAACAGTG AGGTAAA AAGGTGGTCCCGATAGTAGTTGGGTCGTGCGGCGAGGTATTACCAAAT GATAGACC ATAACGAAGAGGCTGAGTGAACTTGGGATACCTAAACGAGGAATCCCG TGGTGCCC AGTTTGCTTGAGTCGATACAGCGTGCTGCGATAATTGGTTCTGGACATG CCGGGCG TTATCCGGGCGCACTTGAGTGTACCTAGGAGCGAGTCCGAGACTTAGTC ACTTGGGA GTAGCCCAGATGTCGACAGCCCCTACACCGGTAGATGGAAGGAGGTAT TGTGGATG GTATCTAAGCCCACGGCAAGCCACCAGTGGAAACGGTGACCACCTCTGT ATGGTTGG TCGCGGAAATGCCCCGTAGAGTATATGAAGGTCGGAGCATTGAAGCAC CGGAGAG AAGTGAGAACCCTGGAAGTATGGTGGTGAGCTGGTAACCTCGAATTCCT TTCTGATG CCTATGGGTGCTTGCGCCCGTAGGTCATTTGTGTATGTAGGGATGGAAG ACGCCGTA GAATCTCGATAGCGCAAATCGGGATACCGCACTGGCGATCATGCTGCCA GTTCCGGA GTGGGCCCTGCTGAGTTGGGCGGGTTAGCAACCCGTCCAGGGACGCAG GGAACAGT ATGCCAAAAGCCTGGTGACACTCTAAAGGGAGTCTACGGCGACGAGAA TCCTACTA CATCCCCTAAGTC(SEQIDNO:1133) GTGCCAGC CTGGGCTA GTGGAGCT GGTCTGGC GAGTGCTT TGCTCGTC AGATAAG GGGTGGTT GGGGTGG TACCTCTG GAATGATG TGTCGGAT AAGCACTC TGCCTTAT AAAGCTGT CGTTCTGC CCAGGTTT CTCCTAAC CAGGTTGA ACTTGTAA TGAGCCTT TGGGTATC TGGTCGG GGTCTGCG CGGAGATA GCTCGTGA GGAGCTGT TATTACAT TATTGGTT GAACGCTT TTGGGTCG TACTGCAC AGATAAAT (SEQID NO:1256) R4 R4- CACX01002006 Strongyloides CGTCAGAAGAGCAGGTGTTTTTCAAAGCAAAGACTTATTCTACGAAGGG CGTCAGAA TAATTA MQKFSVPKDSSQIFL 2_SRa ratti GAAAGATGATCAAACATGCAGATTTGGCTGCAATGAAATAGAATCAAAC GAGCAGG GATTTAT VDSILNKHICSTKNKV TACCATGTGGTGACATCTTGCAAATACCATTCATACACAAGAAGACACG TGTTTTTC ATGTGC KDVIKRRSIIKTLICAA ATATGGTTGTGTACAATATATTAAAAAGACTAAAAGAAAAGTATAAGTT AAAGCAAA CACTTTA GLTLRKLVCGKLGNN AGATGGTGATTTACAATATGGTAGATCAGTATTTAATGGGAAGAAGGA GACTTATT CCTCAA KYNSKINQLWKKERK AAAAATTAGGATCAGATCTGGCCATAAGTTTTTGACTGAACAACCGTTA CTACGAAG AGATAT IINCIEDLKHLIETNKR GTAAACAATAAACCGGATATTGTTATAACGATGAAAGAAGGGAAAAAA GGGAAAG ATTAATA RHNFGKRLRNAKVSP AAGATAACGTACATACTAGAGATGTCGATACCACATATCTAGAACTTGA ATGATCAA CTAACTT SEMLKDYYNKLRYIK AGATGCAAGAAAGAATAAAGTATAAAAAATATTGTGTGAGCTCCATGGT ACATGCAG AATTATA NEITACLDEHKKAILR AAAAATAACGAATGATAATGTTGATTCGATAGCTCGTGATTTTAATCTAC ATTTGGCT ATTTATT TKFKLTPSIKIISNIQN TAAATTTAATGGAAAGAAAAGAAAGGTGTAAGATAAAGTTTGGATCATT GCAATGAA ATGAAT HNDEEAELPKEEEFV TGTATTTGGCTGTTATGGAGAATATGTCTTGACTGAATCTGCTCTAAGAA ATAGAATC TATAATA KYYKELFTNKDGDDK CACAGAAAATCCTGATTGAATTGGGATTCTCCAAAAAAGAGATGGATAG AAACTACC TAATAA ETPHLDNWLKKFSKT CTTGATCAAAGAGTGTTCTTACAGTTAAATGAATGAGACAGCAAGAATT ATGTGGTG GTTTTAA LIVDWTINDKEILEAL ATTATGAAACATCTAGAAGGCGAAAAGAATCAAGATGAAATCAGCCATT ACATCTTG AATAAA KYCGNFKAPGSDMV AATAACTTTTATTTACCCAATTATTTATTGCGTTAACTTGATTATATGATA CAAATACC A(SEQ MKVCYKWFKSAQNY TTTATTGTATATTTATTGTTTAATTTAGTTTCATAGTACAAATAAAAGAAT ATTCATAC IDNO: LIRWIKSTWYGEYTIN ATACTACGATTCTTTACTTTTAAGTTCTACGAGAACGTTGTTTTAGAATAT ACAAGAA 1380) KKDTNAVTFMIWKR TTTAATAATATTTCTACTACAATTAAGTAATTAGAAGAAATCAACGAAAG GACACGAT DGKPKNDVKSYRPIS CAGCTAAACTTACTCGCAAAATTCGTTGATCGAGGCTGGAATGGCACCA ATGGTTGT CLNCDFKLLNKLIANK ACTAATATACTAACCAACAATAGAAAAAAAAAGAAGAACTTCAGTGGAC GTACAATA IYESIEKILPINQMAVI TTCAAATAAAATAAAATTATTGGTAGAATTGTATGATAAAACTGAGAAA TATTAAAA KNKHGTCEALLLYKSL ATTTGACTAAAACAAAAAGATTAGAGCAAATATGTCATCATTTTCCTAAT AGACTAAA VQSMKFRRTKDVKEI CATACAATTAAAGCGATGATGACGAAATTGAGAGAATTTGAAAGAGAA AGAAAAGT WCSWIDFSKCYDSIS AAAAGGGAAAATTGTGAAATTAAAATGAGGGAAAATGAGGAAAAACCT ATAAGTTA HKCLKKMIQSIKAPPII GAGGAAAAATTAAATTTTGACAACTATGAAGAAGCAAAATTGAAAAGA GATGGTGA HKLILDGIDSWNISIC GGAATTAAGTGTAAAAAAGAAGTAAAACCAATTGTTATTGAAAACAAG TTTACAAT NGKNISKTKIPVKSGI GATTAGGAATTTTTGAAAACGGAAAAATTATTTCTCAAGTTTGTCAACAC ATGGTAGA LQGKVASSLYFVLLTG AATCAATTACCAAAGAAAGAAGGGTAAATCAACCAGAATGCAAAAATTT TCAGTATT EISYALNKEEQVPIETI AGTGTTCCAAAGGACTCTTCTCAAATATTCTTAGTGGATAGTATCCTAAA TAATGGGA TPSNTLKINHISFIDDY CAAACATATTTGCTCAACCAAAAACAAAGTTAAGGATGTCATTAAGAGG AGAAGGA QLYATSQKKVEKLTIK AGAAGTATTATTAAGACATTAATATGTGCAGCTGGTTTAACTCTTAGAAA AAAAATTA LREIAEEMNLKLNPQ GTTAGTTTGTGGGAAACTTGGAAATAATAAATATAATTCCAAGATAAAT GGATCAGA KCGIYGTDDLGKRLM CAACTTTGGAAAAAAGAAAGGAAAATTATTAATTGTATAGAAGATTTAA TCTGGCCA LKESSLNFPYTSEYKYL AGCATTTGATAGAAACGAATAAACGTAGGCACAATTTTGGGAAAAGATT TAAGTTTT GLVENSLDLKDINIQL AAGGAACGCCAAAGTGAGCCCTTCTGAGATGTTGAAGGATTACTACAAC TGACTGAA FKDKILSKYSTIFESRL AAGTTAAGATATATTAAAAATGAAATTACGGCATGCTTGGATGAACACA CAACCGTT TTHQKRKVFNSTISPC AAAAAGCAATTTTAAGAACAAAATTTAAACTAACACCAAGCATTAAGAT AGTAAACA AAYYLGNLITNKCSIQ CATCTCAAATATTCAAAATCATAATGATGAAGAGGCTGAACTCCCAAAG ATAAACCG ELLNECKKFDQMVR GAAGAAGAATTCGTCAAATACTATAAAGAACTATTTACAAACAAAGATG GATATTGT NQLVNQNIKKLQVS GGGATGATAAAGAAACACCACATCTTGACAATTGGTTGAAGAAGTTTAG TATAACGA NSRIYLPKEYNSLGLN TAAGACATTAATTGTAGACTGGACTATAAATGACAAGGAGATTCTTGAA TGAAAGAA EIEIEVAANIIRKACYIK GCTTTAAAATATTGTGGTAATTTCAAAGCACCGGGATCTGACATGGTAA GGGAAAA KRETLRGVDKLYIAM TGAAAGTTTGTTACAAATGGTTCAAATCTGCACAAAATTATCTAATCAGA AAAAGATA SKNGHRNTLSDALYIT TGGATCAAATCAACTTGGTATGGTGAATACACAATAAATAAGAAAGATA ACGTACAT KKYSNFQINWNIMG CAAATGCAGTCACATTCATGATTTGGAAAAGAGATGGTAAACCAAAAAA ACTAGAGA MVKDQNNILLDAKKI TGATGTCAAATCATATAGACCGATTTCTTGCTTGAACTGTGATTTCAAGT TGTCGATA IENIKEKRRNLWLEH TACTAAACAAACTTATTGCAAACAAGATTTATGAATCAATAGAGAAAATT CCACATAT WKKGNMTYANEAIK TTGCCTATAAATCAAATGGCAGTAATTAAGAATAAACATGGAACATGTG CTAGAACT KEFHLPDLNIDSKYL AAGCCCTACTCCTATACAAATCATTGGTTCAATCAATGAAATTTAGAAGA TGAAGATG MLCYAGSEEQIIYNG ACGAAAGATGTGAAAGAAATATGGTGTTCATGGATTGACTTCAGCAAAT CAAGAAA HVSLVNQSSPSSRLC GTTATGACTCAATAAGTCATAAGTGTTTGAAAAAAATGATACAAAGTAT GAATAAAG RKCNKLEETSYHVAS TAAGGCTCCACCAATTATTCACAAACTCATTTTAGATGGAATTGATAGCT TATAAAAA VCEFHKKNLHLMRH GGAACATATCAATTTGTAATGGGAAAAATATTTCCAAAACTAAAATACCT ATATTGTG NSAVYHIITELCRIMK GTTAAATCTGGTATTCTTCAAGGTAAAGTGGCTAGCTCTTTATATTTTGT TGAGCTCC VKCTLRYPEASGIIKS ATTATTGACTGGTGAAATATCTTATGCTTTGAACAAGGAAGAACAAGTA ATGGTAAA GNMKIAAGVKYTFG CCAATTGAAACGATTACACCAAGCAACACTTTGAAAATCAACCATATCA AATAACGA TAKIYHNKPDLVWYT GTTTTATAGATGATTATCAACTGTATGCTACATCACAAAAGAAAGTTGAA ATGATAAT PEVIYVIEVSISSLKNA AAACTAACTATTAAATTGAGGGAAATAGCAGAAGAGATGAACCTGAAG GTTGATTC KSQMKMKTARYAVN CTGAATCCACAAAAATGTGGCATATATGGCACAGATGATCTGGGAAAG GATAGCTC STKKLENFAALNNLK AGACTAATGCTGAAAGAAAGTTCATTAAACTTTCCATATACTTCCGAGTA GTGATTTT KGENFVEILSHKANFK CAAGTATCTTGGTCTGGTAGAAAATTCATTGGACTTAAAGGATATCAAT AATCTACT RVHFMPLVFCTFGEI ATCCAACTGTTTAAAGATAAAATACTATCTAAATACTCAACTATTTTTGAA AAATTTAA PKETMKYLEKLNFSN TCAAGACTCACTACACACCAGAAAAGAAAAGTATTTAATAGTACAATAT TGGAAAG EKIKTIASPIARYTGRT CTCCATGCGCAGCATATTACCTAGGTAATTTAATAACCAATAAATGTTCA AAAAGAA LKAHFTN(SEQID ATTCAAGAACTATTGAATGAATGTAAGAAATTCGATCAAATGGTCAGAA AGGTGTAA NO:1502) ATCAGTTAGTGAACCAAAACATCAAAAAGCTTCAAGTATCAAATAGTAG GATAAAGT AATCTACTTACCTAAAGAATACAACTCCCTTGGATTAAACGAAATAGAAA TTGGATCA TTGAAGTAGCAGCCAATATAATCAGGAAAGCTTGTTACATTAAAAAAAG TTTGTATTT AGAAACTCTTAGAGGAGTAGACAAACTATATATTGCAATGTCAAAAAAT GGCTGTTA GGTCACAGAAACACTCTTAGTGATGCTTTATATATAACTAAAAAATACTC TGGAGAAT TAACTTTCAAATAAATTGGAATATTATGGGTATGGTGAAAGATCAAAAC ATGTCTTG AATATTCTTCTGGATGCGAAGAAAATTATTGAAAATATCAAAGAGAAAA ACTGAATC GGAGGAACCTATGGCTGGAACATTGGAAAAAAGGAAACATGACATATG TGCTCTAA CAAATGAAGCAATCAAAAAAGAATTCCATCTCCCGGACCTAAACATTGA GAACACAG CTCCAAATACCTCATGTTGTGCTATGCTGGTTCAGAGGAACAAATCATTT AAAATCCT ATAATGGACACGTAAGTTTGGTTAATCAATCCTCACCGTCTAGTAGATTA GATTGAAT TGTAGAAAGTGTAACAAATTGGAAGAAACTTCATACCATGTGGCTAGTG TGGGATTC TTTGTGAATTCCATAAAAAGAATCTACATCTAATGCGTCACAATAGTGCA TCCAAAAA GTTTACCACATAATAACTGAACTCTGTAGAATCATGAAAGTCAAATGTAC AGAGATG GCTAAGATATCCAGAAGCATCAGGAATAATAAAAAGTGGAAACATGAA GATAGCTT GATTGCTGCTGGAGTAAAATACACTTTTGGAACAGCAAAAATATACCAT GATCAAAG AATAAGCCTGACTTGGTATGGTATACTCCAGAGGTAATTTATGTTATAG AGTGTTCT AAGTAAGTATTTCATCACTGAAGAACGCCAAATCTCAAATGAAAATGAA TACAGTTA GACTGCTAGATATGCAGTAAACTCAACAAAGAAATTGGAAAATTTTGCT AATGAATG GCTCTGAATAATTTAAAGAAGGGAGAAAACTTTGTTGAAATATTAAGTC AGACAGCA ATAAAGCAAATTTCAAAAGAGTCCATTTCATGCCCCTAGTTTTCTGCACT AGAATTAT TTTGGTGAAATACCTAAAGAAACAATGAAATACCTGGAAAAACTCAACT TATGAAAC TTTCCAATGAAAAGATTAAAACAATTGCTTCACCAATCGCACGTTACACT ATCTAGAA GGAAGAACTCTCAAAGCACATTTTACTAATTAATTAGATTTATATGTGCC GGCGAAA ACTTTACCTCAAAGATATATTAATACTAACTTAATTATAATTTATTATGAA AGAATCAA TTATAATATAATAAGTTTTAAAATAAAA(SEQIDNO:1134) GATGAAAT CAGCCATT AATAACTT TTATTTAC CCAATTAT TTATTGCG TTAACTTG ATTATATG ATATTTATT GTATATTT ATTGTTTA ATTTAGTT TCATAGTA CAAATAAA AGAATATA CTACGATT CTTTACTTT TAAGTTCT ACGAGAAC GTTGTTTT AGAATATT TTAATAAT ATTTCTAC TACAATTA AGTAATTA GAAGAAAT CAACGAAA GCAGCTAA ACTTACTC GCAAAATT CGTTGATC GAGGCTG GAATGGCA CCAACTAA TATACTAA CCAACAAT AGAAAAA AAAAGAA GAACTTCA GTGGACTT CAAATAAA ATAAAATT ATTGGTAG AATTGTAT GATAAAAC TGAGAAAA TTTGACTA AAACAAAA AGATTAGA GCAAATAT GTCATCAT TTTCCTAA TCATACAA TTAAAGCG ATGATGAC GAAATTGA GAGAATTT GAAAGAG AAAAAAG GGAAAATT GTGAAATT AAAATGAG GGAAAAT GAGGAAA AACCTGAG GAAAAATT AAATTTTG ACAACTAT GAAGAAG CAAAATTG AAAAGAG GAATTAAG TGTAAAAA AGAAGTAA AACCAATT GTTATTGA AAACAAG GATTAGGA ATTTTTGA AAACGGA AAAATTAT TTCTCAAG TTTGTCAA CACAATCA ATTACCAA AGAAAGA AGGGTAA ATCAACCA GA(SEQID NO:1257) R4 R4- CADV01008832 Bursa- GGGTCCTCGGTTCTTACTACCGTGCTCCACCTCCTCGCGATGGACCCTGG GGGTCCTC TGAGAC MISRSQADRPVEGQ 3_BX phelenchus GGTAGGCCTCCGGGCTGAGCTAAGCAGAGCATTAAAGTAAAGAGTGAC GGTTCTTA CACCCAT PVTAMSFHNLEPNN xylophilus GGCGCAGTTGCTTCCAAGTTGCGGTGGGGCGGAACATCTACTCTTCTGA CTACCGTG GCGCAG LYPENLRPTGSQDAN GAGAGGGGAAGCCCTATGGCGGCGTTAGAAAGGTTGGACTGCGGCAA CTCCACCT AGTATC RGVADIAEEVTGPSG CACTAGCCATGATCAGATTCGATCAAAATAGCCTCTGGGGCTGGCGACC CCTCGCGA CGAATC LVTNEEAARAPPLFV CTATAACGGATTGTAAACTGAACTATGCTAACCTGTCAGTAAAGACAGA TGGACCCT AGTGAA EGEYKRAHCGGGKC ACGGGGGCTTTGCAATCTAAGGTGTTGGCAGACCACTAAAACTGCCCTT GGGGTAG AGTCCA HYCRVLWIGARSSKA TGATGAGGGTGAGATAACATCCTACGGAACAGCCCCTGCTGATCCAAAC GCCTCCGG AGTTTCA RTDSWNLCEILFLINK AAAATCATCCACAAATCGGAGGGTTTTCTTGGTAGTTTCCCTTGGCACGT GCTGAGCT GGACAG CMELGNVRRIYSPLE CTTGTTTCAAAAGCCAGAATATAAAACAATACCGAAGTACAGAATGGCT AAGCAGA AAACGT SSLKEAGINRTRHAIV GGAACCTAACAGCCATGGCACAGAAAACCGGTGGGTGCACACCGGAGA GCATTAAA CAGATA KCRLAVMRDRFVDN CAGCAGGGGAAAGATTGCAAAGTGCCGTAAATAAAGATGGTAGCTCTC GTAAAGA AGTCCA APYSEHWRLYNACA TGACCTGAGGGAATGGGACACTGTCCGATTAATGCCTTATCCCGAAGAC GTGACGGC AGAGAA ENRAVVVPMDSATT GGTCAAGGTTCTATCTTATCGGTCTCCAAAGGGCTCTGGCCATAGAAGC GCAGTTGC ACGAGA VKKRTARQAGLESPS TGCGAGGAAGCCGTAACCCTACGCAGATTGCCGCCGTGCATGATAAGTC TTCCAAGT AAACAA QIGVAGKRVHEAETG GTAGCCAAGCGGATCGACCAGTGGAGGGCCAACCCGTAACAGCCATGA TGCGGTG GTTCAA TDRINAVIETNTTPLE GTTTCCATAATTTAGAACCAAACAACCTGTACCCAGAAAATCTTCGTCCT GGGCGGA GTATGC DIDLSPETPEGLAELP ACTGGGTCACAGGACGCGAATCGCGGAGTTGCGGACATTGCTGAGGAG ACATCTAC AAGAGT STVEIMELTEDGSRSR GTGACTGGTCCTTCGGGGCTGGTTACTAACGAGGAGGCAGCGCGAGCA TCTTCTGA TAATCA GTANDADGGVSISDP CCGCCGCTTTTCGTAGAGGGCGAATATAAAAGGGCACACTGCGGTGGG GAGAGGG ATAAGA LRNRPSSSQESRNVP GGCAAATGTCACTACTGCAGGGTCCTGTGGATAGGTGCGAGGTCTTCAA GAAGCCCT GAGTAC EQVDPDGELVWESL AGGCCCGTACCGACAGTTGGAATCTCTGTGAGATTCTATTTTTGATTAAC ATGGCGGC CGTAAA YGAQLRGAMRTTDR AAATGCATGGAACTTGGAAACGTGCGTCGGATTTACAGTCCTCTCGAGA GTTAGAAA TGTATG PRLPKLTKFSAAEQL GTTCTTTGAAAGAGGCTGGAATCAACCGAACACGTCATGCAATAGTTAA GGTTGGAC ACCCCCC WIKSKVEKARLECVSY GTGTCGACTAGCTGTCATGCGTGATCGATTTGTGGATAACGCCCCATATT TGCGGCAA CCTTTGC GIEQQWLRASAVLYA CTGAGCATTGGCGTCTGTATAACGCGTGTGCAGAAAATAGGGCGGTGG CACTAGCC CAAGTC TIKTVAACRPYNKAR TCGTCCCGATGGACTCGGCCACAACGGTGAAGAAAAGAACTGCTAGAC ATGATCAG GACAAC EAHKVWLENKRAEE AAGCGGGCTTGGAATCACCTTCGCAAATCGGGGTCGCTGGCAAGCGAG ATTCGATC TGTCAT KRVRRIIGRIETVRTM TCCACGAAGCTGAAACTGGAACTGATCGGATTAACGCGGTGATTGAAAC AAAATAGC GCAGGT PKGKRTDKQIRLARKI GAACACAACTCCGCTAGAGGATATTGACTTGAGTCCAGAGACCCCAGAA CTCTGGGG GTCTCTC NRLKRVSFPEMDWH GGGCTTGCAGAGTTGCCTAGTACTGTAGAGATAATGGAATTGACGGAA CTGGCGAC TTTTCAC GFLNHFKAKLDLLKKL GACGGGTCTAGATCCAGAGGGACTGCGAACGACGCTGATGGCGGGGTT CCTATAAC CCGCCA ISVRVAEHERKISRKIA TCCATCAGCGATCCACTACGAAATCGCCCAAGCTCAAGTCAGGAAAGCA GGATTGTA TATGGA GTYGKSVSGQSGFTP GAAATGTCCCGGAGCAAGTAGACCCGGATGGGGAACTTGTATGGGAGA AACTGAAC CCAAAC DVVAATTFWSGLAQ GTCTATACGGAGCACAGTTGAGGGGAGCGATGCGCACAACTGACAGGC TATGCTAA GCTATCC PGPKKFKKNSLIFQT CGAGGTTGCCAAAACTTACGAAATTCTCTGCAGCCGAACAGCTGTGGAT CCTGTCAG AGCCTC WKDSVVENMNTEP TAAATCGAAGGTTGAGAAAGCTCGGCTGGAGTGCGTTAGCTACGGCAT TAAAGACA GCTCAG VLLHPLIIECMNKPSP TGAACAGCAATGGCTCCGTGCCAGCGCAGTTCTATATGCCACGATCAAG GAACGGG AAGAGC FKATGPDGIFNSYWR ACTGTAGCGGCTTGTCGTCCGTACAACAAGGCCCGCGAAGCTCACAAG GGCTTTGC CTTAGG QGFIANWVKSLIQRT GTATGGCTCGAGAACAAACGTGCTGAAGAAAAGAGAGTACGAAGGATT AATCTAAG GCTGGG IQTGEFPASLMCGRT ATTGGGCGAATTGAGACTGTACGCACCATGCCGAAAGGCAAACGAACT GTGTTGGC GAGTAC VLLYKNGDTAKPENY GACAAGCAGATCCGCCTTGCGCGGAAGATTAACAGGTTGAAACGTGTC AGACCACT CACATG RPITCLNGCFKMTNA AGCTTCCCAGAGATGGACTGGCACGGATTCCTGAATCACTTTAAAGCTA AAAACTGC TGGCGG VITKVIVQRVQDTCA AGTTAGATCTTCTTAAAAAGTTGATCTCAGTTCGGGTGGCAGAACATGA CCTTTGAT AAACTG LPGEQMALKPKVWA GCGGAAGATTAGTCGGAAGATAGCAGGTACATACGGCAAAAGTGTATC GAGGGTG AATCTG CMEAQLRDQALQSEI CGGTCAGTCAGGGTTTACGCCGGATGTCGTTGCCGCAACGACTTTCTGG AGATAACA GATGCG GNDCKTAWIDFSKAY TCAGGCCTTGCACAGCCTGGACCAAAGAAATTTAAAAAGAACTCGCTGA TCCTACGG ATGCAT DSLDHDAIRFVIETLA TCTTCCAGACTTGGAAAGACAGTGTAGTTGAGAATATGAACACTGAGCC AACAGCCC ACCGGG LPDGMEKYLLKSLES AGTACTGCTACACCCGCTGATCATAGAATGTATGAATAAGCCCAGTCCG CTGCTGAT AGCGCA WKTKLVLSNRGKVAT TTCAAGGCCACTGGCCCTGACGGGATTTTCAACTCCTATTGGAGGCAAG CCAAACAA GCGAAA GRPYKIKRGVLQGDS GATTCATTGCGAATTGGGTTAAGTCGCTAATCCAGCGAACTATTCAAAC AATCATCC TCACTTA LSPALFVIATSPIVSHL AGGAGAGTTCCCGGCCTCACTTATGTGTGGTCGTACGGTGCTCCTGTAT ACAAATCG ACGCTG KRVCPSGRIQLYMDD AAGAATGGTGACACTGCTAAGCCGGAGAATTATCGACCTATCACATGCC GAGGGTTT GTGCAC IKLYGKSETELRMLIKE TGAATGGTTGCTTCAAGATGACTAATGCTGTTATCACGAAAGTAATCGTT TCTTGGTA CTCCTGC VQKVANKLGLQMNL CAACGGGTTCAAGACACCTGTGCTCTCCCGGGAGAACAGATGGCCCTG GTTTCCCT TATCGTA KKCSTYGAGLTESIAG AAGCCGAAGGTGTGGGCGTGCATGGAAGCGCAGTTGCGCGACCAGGC TGGCACGT GTACTTC FDPLGDRAYKYLGVP TCTACAGTCTGAGATCGGTAACGATTGTAAGACTGCGTGGATCGATTTT CTTGTTTC CTAGAT QRSVADTNLAFGELE TCAAAAGCTTATGACAGTCTGGACCACGACGCGATCCGATTCGTGATAG AAAAGCCA AGATGA GKVIRSIEETMACEYL AAACACTTGCTTTACCGGACGGAATGGAGAAATATCTCCTGAAGTCGCT GAATATAA GTAGGG TMRQVVTRLNSVIGP AGAGAGTTGGAAGACAAAACTGGTGCTTAGTAATCGAGGTAAAGTTGC AACAATAC TGGGCT LVRFVAQSVLTSQAK CACTGGGCGACCTTACAAGATTAAGAGAGGTGTTCTGCAAGGTGACAG CGAAGTAC AAAGGT VSWIYNKISDLDSKIR TCTTAGTCCGGCCCTATTTGTCATTGCGACAAGTCCTATCGTCTCCCATCT AGAATGGC AGTCGT AKLAQTGLRYKKSNV AAAGCGTGTTTGCCCATCCGGCCGTATACAATTGTATATGGATGACATT TGGAACCT CTTCAAT ARLYLSKSKNGIGLVN AAACTGTATGGCAAATCAGAGACCGAATTACGTATGCTCATTAAAGAAG AACAGCCA GGCAAT VQQVLVEALVSRAIY TTCAAAAGGTAGCCAACAAATTGGGGCTCCAGATGAATCTAAAAAAATG TGGCACAG ACCGAG CLRAPSLVEIREHILTA TAGTACCTATGGTGCTGGCCTAACAGAGAGTATCGCCGGGTTTGACCCA AAAACCGG GGGTCT EFDPVGAARTVLRRS CTCGGCGACAGAGCATACAAGTATCTCGGAGTTCCCCAGCGGAGCGTC TGGGTGCA CGTGAC RIQLEIERVEMASAIS GCAGATACTAATTTGGCCTTTGGTGAGCTTGAAGGCAAGGTAATACGGT CACCGGAG AGGTGA AIKTNYQARWMTKF CCATCGAGGAAACGATGGCGTGCGAATACTTGACAATGCGGCAGGTGG ACAGCAG GGGATA TQSKLYQKWVHHDI TCACAAGATTGAACTCTGTGATCGGTCCTCTTGTCCGGTTTGTAGCACAG GGGAAAG ACTAGT DLANSNLWLERGEIS TCCGTACTAACCAGTCAGGCGAAAGTAAGCTGGATATATAACAAAATAT ATTGCAAA ATCAGA PQNARIAVAAQDNT CGGATCTGGATAGCAAGATAAGAGCGAAATTAGCTCAAACAGGCCTAA GTGCCGTA TAATATC LLCRGFVGNRESEKQ GGTACAAGAAAAGCAACGTAGCAAGACTCTACCTAAGCAAGAGTAAAA AATAAAGA AGTCAT CRMCNMGIETCSHIL ATGGCATCGGTTTGGTGAATGTCCAGCAGGTATTAGTTGAAGCATTGGT TGGTAGCT GAAATT TECSYHRAHMYIERH GAGCCGAGCCATCTATTGTCTGAGAGCCCCGAGTCTCGTTGAGATTCGG CTCTGACC AGCAAC DSVARNIYAVLAKDH GAGCATATACTAACAGCAGAGTTCGACCCTGTAGGAGCTGCTCGTACCG TGAGGGA AACCAA GLWIPHYSQPVSSVT TGCTGCGGAGGAGTAGGATCCAACTGGAGATCGAGCGCGTCGAAATGG ATGGGACA CGTCAC KTPTCELYWNYKFPC CTTCGGCTATATCGGCGATTAAAACAAATTACCAGGCAAGATGGATGAC CTGTCCGA CGTCGT TRALEACRPDIVLIDR GAAGTTCACCCAATCGAAACTATACCAGAAGTGGGTTCACCATGACATC TTAATGCC TGGCAA AKRTILIVEVAVSYVT GATCTAGCCAACTCAAACTTATGGTTAGAGAGGGGCGAAATCTCTCCGC TTATCCCG AACACC RLKQMVSRKVYKYG AAAACGCAAGGATTGCGGTAGCGGCTCAAGATAATACGTTACTGTGTC AAGACGGT GACTAA VNGEKGADGESRG GCGGGTTTGTAGGTAATCGGGAGTCAGAAAAACAGTGCCGCATGTGTA CAAGGTTC CGATGC WNMIRELSEVYNMK ACATGGGCATTGAGACCTGCAGTCATATCCTGACTGAATGTAGTTATCA TATCTTAT TAGTTA VNLCAVVIGASGEVL CCGGGCGCATATGTATATAGAACGTCACGATTCTGTGGCAAGGAATATC CGGTCTCC GAAAGA PCTVKAIQSISSKTSSR TATGCTGTCCTGGCAAAAGACCACGGTCTCTGGATTCCGCATTACTCGCA AAAGGGCT GTCGGG QLLERCQRSAVLGST GCCCGTGAGTTCGGTAACAAAGACGCCGACGTGTGAACTTTACTGGAAT CTGGCCAT TCTTCCC RVVKRHLAEFH(SEQ TACAAGTTTCCATGTACCCGTGCATTGGAAGCTTGCCGTCCAGATATCGT AGAAGCTG AAAGTT IDNO:1503) TCTCATCGATCGCGCGAAAAGAACTATCCTCATAGTGGAAGTTGCGGTG CGAGGAA AGGTGC TCGTATGTCACACGGCTAAAGCAGATGGTGTCACGAAAAGTGTATAAAT GCCGTAAC TTGCACC ATGGAGTAAACGGTGAAAAGGGAGCCGACGGCGAAAGCCGCGGCTGG CCTACGCA GAAGCC AATATGATCCGTGAACTATCTGAAGTTTATAATATGAAAGTGAACCTTTG GATTGCCG GATCCG CGCAGTCGTGATTGGGGCGAGTGGGGAAGTTCTCCCTTGTACGGTCAA CCGTGC CTCTACC AGCCATTCAGAGCATATCTAGCAAGACCTCCAGCCGACAGCTCCTAGAG (SEQID CACAGC CGGTGCCAGCGTAGCGCTGTCCTAGGGTCGACGAGGGTTGTCAAACGA NO:1258) TCTGCCC CATCTTGCAGAATTCCATTGAGACCACCCATGCGCAGAGTATCCGAATC AGCGTT AGTGAAAGTCCAAGTTTCAGGACAGAAACGTCAGATAAGTCCAAGAGA (SEQID AACGAGAAAACAAGTTCAAGTATGCAAGAGTTAATCAATAAGAGAGTA NO: CCGTAAATGTATGACCCCCCCCTTTGCCAAGTCGACAACTGTCATGCAG 1381) GTGTCTCTCTTTTCACCCGCCATATGGACCAAACGCTATCCAGCCTCGCT CAGAAGAGCCTTAGGGCTGGGGAGTACCACATGTGGCGGAAACTGAAT CTGGATGCGATGCATACCGGGAGCGCAGCGAAATCACTTAACGCTGGT GCACCTCCTGCTATCGTAGTACTTCCTAGATAGATGAGTAGGGTGGGCT AAAGGTAGTCGTCTTCAATGGCAATACCGAGGGGTCTCGTGACAGGTG AGGGATAACTAGTATCAGATAATATCAGTCATGAAATTAGCAACAACCA ACGTCACCGTCGTTGGCAAAACACCGACTAACGATGCTAGTTAGAAAGA GTCGGGTCTTCCCAAAGTTAGGTGCTTGCACCGAAGCCGATCCGCTCTA CCCACAGCTCTGCCCAGCGTT(SEQIDNO:1135) R4 R4- Bursa- TGCCAGCGGTGTTGATTAGGTCCAAGTTCTTTGGCCAAAGATCCGCCCT TGCCAGCG TAATTA MEILWEDLRLKIEDR 4_BX phelenchus CGGTTTAGCAGTACCGAACGAGTATACCTTCAAGTGGTGGCACTGAATT GTGTTGAT GGCAGT YGVTLPQRSASSLKN xylophilus AGACTGAATACTCTGAACTGTAGACTTTTGTGCAACTGTGTATGGTGTG TAGGTCCA GCTCCTA QYPKVILRGLPDSGLP GAAGACTTGTTTGTACCACTATCAGCTTTATTGGGGCTCGTTACTGTTTC AGTTCTTT GCGGAG WAGVQVNDTGQVV ATACAGGTAGATGTCCCCTTTAGAGATTTCCCTGCAGTTTGCGCTCCAAG GGCCAAA TGCCGT VVDHAEAATLRGSSP TCGCTAGCCTCTTGCGTGTAGTCAAAGGAATACATTCGCCGTCGGTGAC GATCCGCC GAAGTG AAVDGEAEEPVVPPL AGGGCTATACCCGGCGACTACGGACTTGTTTATTACGTAGTGCAGCCTC CTCGGTTT GTGTCA PAAEVVESAADAAVP GTTTAAGACGAATGTGAAAAGAAGGTGTGATTACTAAGCGTTATGAGTC AGCAGTAC GTACCC DPQSEIVADQGVETR GGGTTATCTGGAAACTCCGCCCCGCCGCAATGGCTTCAAGGGTCAAACT CGAACGA GTCGTG PVENPPANSRETETE GCTAACATTTTAAACCAATATTTGGCAGCAGCAGACTATGATGTCAGAG GTATACCT TGGAAA PVEVEPYLEGQYKFF TGCCGGGGGTCTATCGATATAAGACTGAAAGCGATAGACGTGGAGTGA TCAAGTGG GCCCAG VSKILGKSMWRKPIK AAGGATTCCCTTTGTTAAGAGAATCGGTAGAATTCACTTTTTACTTATTC TGGCACTG GAGGGT YPRRVPETLWRQAN AAGAACTTAACAGCAACAAGCACTCGCGAGGATTACCGCCCCAGATTCG AATTAGAC TAGTAC ELIERSIRQGEVSIQSL GTCGGCGGTACTTCACCTGCTTTCTTCCACTTTCGGAATCTGGCATCCTG TGAATACT CGACAG NCMVYAAGCAVKSS GCTTTCAGTGTGGTGATGGCCGGCTTGAGTTTTCTTGAGTCGTGCGAGT CTGAACTG TGGGAA LDKKDQEAKRRESE GCCTCATCTGGGACGTCCGGACCGATTGATGGAGTCTGCAGTGGACGA TAGACTTT ACCCGC WYACRKAEIKALERY GGACTTGATGGACCGTAACCATAGTATATCCCTCATGCGTCTTCTCGACT TGTGCAAC TGCACG LNFIDLELKRRSASRP CGAGGGGGTAGCTTGCACTACCCACCCTTCTCTTCTCCGATTGGGATTTA TGTGTATG CAACCT LTSRQRQNLGVLITKY GACCTAGCCCTCTGGTGTGTCTCGACCCGCGATATCAGATTCCTGAATCG GTGTGGAA AAAGAC GRARVRSGVRLSELQ ACTGTGAGAAATGTCTACGCGCAAAGATCGACCCATTGCCACCCGGCTA GACTTGTT ATTTGCC AMLRDALVGIRKCM TGTGGATCGGGCTCTTGACTGCTTATCTCCGGCTTTAATCGCTTGAGGAA TGTACCAC CTTCGG AKRSADKKRKQGKFV AGGGGGGTGTCGCCCGAAAGGGTTACGCGATCATCCGATTCCTACCGT TATCAGCT GGGAGA PIQRYLEPSSAEPRLS AAAAACGTAGTTGAAGTAGGATTGAACCTGAGTACCAAGTGAAAAGAG TTATTGGG GGTATG PDTVRAYWNDIVGS TGGCTATAATGCTCCATGGTATACCCTAGGGATAACCGTGTCGAGCGCA GCTCGTTA AGACCA SQQSTSDSTIQDWSS CTGCTCAATACCCTGATTTGTTAGTGTAGGTATGTCTAGTGGCTGCCTAG CTGTTTCA CACTAG NLSVPSQELNASKIM GCAGATTCGCCTATTCACTACGTAAAATCTGGGTGAGTACCATAAGAAC TACAGGTA TCATGG GWWRAAVSKSKPN CTCCTGTAGGGCCAGGAGTCAAACTAGTCCAGGTTCGGTATGCTTCGGT GATGTCCC TTGCTTG KAAGPDGIPGVLWK ATACTCTCTCACGGGCTAGTCACCTAAGAGTTAAGAACCGCCTTTTTCTG CTTTAGAG CGCAAG RFRSASEWVCTWLY CACTGTGAATAGAAAAAGAAGGGCGGGAGAATATACGGGCGAGGTAA ATTTCCCT CATGAC RLLQKRRIITPRWLSV GCACGTCGGAACGGGGTGTGCCCAACTCGTACCGTACGCCCGGACAAG GCAGTTTG TCATTCA GRVVLLPKKGPLEDP GACCATCTTTCGTACCCCGTTCACCCGGCACAAGTCCGATTGTCTCTCCC CGCTCCAA TTGTAA ANYRPIACLNTVYKLI GAGAGGCGTCGGCGGAGGTTGGCTAACGCCGTCTCTCCAACCATCGGT GTCGCTAG GTTCGT TSVVEMAVREQIQA TTGGGTTTAAGTAACGCCCCAGTGGCCGGTAGCCAAACTGATGGAGGC CCTCTTGC ATTATG CPGLVPYEQIANRKG AGAAATGCCGGTCCGTTGCTAAATGCGGGAACGGACCAAAACGCCGGT GTGTAGTC AGGCCT VWGCTHASIVDRMI GTGGTGGATACAGGTGGAAGAGTCTTTTGGTCTACGCAAGAAAAGACC AAAGGAAT CGCGGA TGASREGKGGGFPDL AGGCTCAAGTACGAATACGACCGTCTTCGTGAACGACGCCGTCGAGAC ACATTCGC ATGCGA RVLFYDCKKAFDSVN AAAGTTCGAAATCCTACGGTCGATAAGGCCATGGAAATCCTTTGGGAAG CGTCGGTG AAGTCC RDHMFAVLRVANVN ATCTCCGGTTGAAAATTGAAGATAGGTACGGTGTCACATTGCCGCAAAG ACAGGGCT AAGAGC VKVVHLLHTLSQQW AAGCGCAAGCTCTTTGAAAAACCAGTACCCCAAAGTTATTCTGCGGGGG ATACCCGG GCCAAC CVRYELRRNNRVERS CTCCCAGATTCGGGCCTGCCTTGGGCGGGGGTTCAGGTAAATGATACTG CGACTACG TTGTCTC SPLRVKRGLLQGDTL GACAAGTCGTAGTCGTAGACCATGCTGAGGCGGCTACCCTTCGGGGGT GACTTGTT TTTGGG SPTWFCLCMAPISASI CCAGCCCAGCAGCGGTCGACGGGGAGGCCGAAGAACCGGTAGTACCCC TATTACGT AGGTTC KTLNPGPTLRPNMG CGTTACCCGCGGCAGAGGTCGTTGAATCGGCTGCTGACGCTGCGGTGC AGTGCAGC CTCCGG NGRNRGQVAIQVSH CCGATCCGCAAAGCGAGATCGTGGCTGACCAGGGTGTGGAGACACGCC CTCGTTTA GTTCCTT VFYMDDLKVYCPRV CTGTCGAGAACCCCCCCGCTAATAGCCGGGAGACTGAAACGGAACCCG AGACGAAT AGGGTT ADQRRMEQNIPQLF TAGAGGTAGAGCCTTACCTCGAAGGGCAGTATAAATTCTTCGTTTCTAA GTGAAAA GGCGTA GEIGLSINASKSAAAA GATCCTTGGGAAATCGATGTGGCGGAAGCCTATCAAATACCCGAGGAG GAAGGTGT CCATGCT AVGRYVESELPVLGT AGTCCCGGAAACCCTGTGGCGTCAAGCCAATGAGTTGATTGAGAGATC GATTACTA TCGTGT KDEYKYLGIESGFVVN GATTCGACAGGGGGAAGTGTCGATTCAGAGCCTGAACTGCATGGTTTAT AGCGTTAT GAGACC EVAALDRMQAVLLN GCTGCTGGGTGCGCTGTTAAGTCTTCGCTGGACAAGAAGGATCAGGAA GAGTCGG TAGGCC RVEAILSVKEHTVGQ GCTAAGCGGCGTGAATCCGAGTGGTATGCGTGTAGGAAGGCCGAGATT GTTATCTG GCTTGG RRDAIRAKAIPGGAYI AAGGCACTCGAGAGGTATCTCAATTTCATTGACCTGGAGTTAAAGCGTC GAAACTCC ACCTAG LGHIILSDLDPRGAAE GCTCTGCGTCGCGGCCGTTGACATCAAGGCAAAGGCAGAACTTGGGTG GCCCCGCC AGTTAA RMRRLDIEIRRLVKSA TGCTCATCACGAAGTATGGGCGTGCTCGGGTTAGGTCTGGTGTCCGGCT GCAATGGC GCTGTT GILHDKCSTARIHLSC CTCGGAATTGCAGGCAATGTTACGAGACGCCTTAGTGGGAATCCGAAA TTCAAGGG GCATGT EQGGLAWPSMERAY GTGTATGGCGAAGCGCTCGGCAGACAAGAAACGGAAACAGGGCAAATT TCAAACTG TTGAGG YVAVAYSASYLLTSQ TGTTCCTATTCAACGTTACTTAGAGCCTAGCAGTGCGGAGCCTCGACTCT CTAACATT ATGCCTT DETISRARDYFVSGRL CTCCAGACACGGTTCGTGCATATTGGAATGACATTGTGGGATCTTCCCA TTAAACCA AGGCGG SNKFTVYKHLTSIVDS ACAGTCAACGTCAGATTCGACGATTCAAGACTGGTCCTCCAATTTAAGC ATATTTGG ACGCCA LGLSVELPDPNGLPT GTTCCGAGTCAAGAGTTGAACGCAAGTAAGATCATGGGGTGGTGGAGA CAGCAGCA TG(SEQ GQPSVLARTIARAIDA GCCGCCGTTTCGAAGAGCAAGCCCAACAAAGCTGCTGGTCCTGATGGTA GACTATGA IDNO: KLEAQWKETLLTYQR TTCCCGGGGTACTGTGGAAACGCTTCAGGTCGGCTAGTGAGTGGGTTTG TGTCAGAG 1382) AGRVERADPTVVDH TACCTGGTTATATCGCTTATTACAGAAGCGAAGGATAATCACGCCAAGA TGCCGGG ANSYHWLRKAWINE TGGTTGTCTGTTGGTAGAGTCGTATTGTTACCAAAGAAAGGACCTCTCG GGTCTATC KAYQHAVSVMEGTL AGGATCCCGCCAACTATAGACCCATCGCGTGCCTGAACACTGTATATAA GATATAAG LEGVNPHGVLTMCR GTTGATAACCTCGGTAGTAGAAATGGCAGTTCGAGAGCAGATCCAGGC ACTGAAAG ACKAPSASIAHIITGC TTGTCCTGGACTAGTCCCATACGAGCAGATAGCTAACCGTAAAGGCGTA CGATAGAC AELRKSHMKVRHDG TGGGGCTGTACGCATGCGAGTATCGTAGACCGTATGATAACCGGTGCA GTGGAGT VTRWLYNALTEVDGS AGCCGAGAAGGTAAGGGCGGTGGGTTTCCGGACCTACGCGTCCTATTTT GAAAGGA LPKFHYTQQIPAEMR ACGATTGTAAAAAGGCCTTCGACAGCGTAAACCGAGATCATATGTTCGC TTCCCTTT GERLTVRYDSDIVTP TGTTTTGAGAGTAGCAAACGTGAACGTTAAGGTCGTGCATCTACTCCAC GTTAAGAG NKPRHNRPDLVVFDS ACACTATCTCAGCAATGGTGTGTCCGATACGAGCTCCGTCGAAACAACC AATCGGTA TRKVIYIVEVSVTWLS GTGTGGAGAGAAGTAGCCCGCTGAGGGTGAAAAGAGGACTGCTGCAA GAATTCAC VLQKQYDNKLNRYA GGCGACACTCTCTCTCCTACATGGTTTTGCCTATGCATGGCTCCAATATC TTTTTACTT VNSNHEFSESIPYPPG AGCCTCTATTAAAACCCTTAACCCTGGCCCAACCCTTCGGCCGAACATGG ATTCAAGA VNLANEIRVLYPQFT GGAATGGGCGAAATAGAGGGCAGGTGGCGATCCAAGTTAGTCATGTGT ACTTAACA GGVKVFPMIISPTGE TCTACATGGATGACTTGAAAGTGTATTGCCCCCGCGTAGCCGATCAACG GCAACAAG VHMQFVPHLAELLE CAGAATGGAACAGAATATTCCACAGTTGTTCGGTGAAATTGGACTAAGC CACTCGCG NPNIPRILEKIQRSVVL ATAAATGCGAGTAAGAGCGCAGCAGCCGCAGCCGTGGGAAGATACGTT AGGATTAC GTDYIIRSYFAM(SEQ GAATCCGAGTTGCCAGTTCTTGGGACGAAGGATGAGTACAAGTACCTG CGCCCCAG IDNO:1504) GGAATAGAAAGCGGTTTTGTTGTAAATGAGGTGGCCGCCCTTGACAGA ATTCGGTC ATGCAGGCCGTTCTGCTAAACCGTGTAGAAGCGATTCTATCGGTTAAAG GGCGGTAC AACATACTGTTGGTCAGAGGCGCGACGCTATACGAGCGAAAGCGATCC TTCACCTG CTGGGGGGGCCTATATCCTGGGTCACATCATTTTGAGTGACTTGGACCC CTTTCTTCC CAGGGGTGCTGCTGAGCGCATGCGTCGGTTGGATATCGAGATTCGGCG ACTTTCGG TTTAGTCAAATCAGCTGGTATTCTTCACGACAAGTGTAGTACAGCGCGT AATCTGGC ATTCACCTTAGCTGTGAACAGGGAGGTCTAGCTTGGCCGTCGATGGAGA ATCCTGGC GAGCTTATTATGTAGCAGTTGCATATTCAGCCTCCTATCTTCTGACAAGT TTTCAGTG CAAGACGAAACTATCAGTCGTGCCAGAGACTACTTCGTCTCGGGTCGAT TGGTGATG TGAGTAACAAGTTTACTGTTTACAAGCACCTTACGAGCATCGTAGACTCC GCCGGCTT CTTGGTTTATCGGTAGAACTGCCGGATCCGAATGGACTTCCTACCGGTC GAGTTTTC AACCGTCCGTCCTGGCTAGAACTATTGCGAGAGCGATAGATGCCAAGTT TTGAGTCG GGAAGCGCAGTGGAAGGAAACTTTGCTAACTTACCAGCGCGCAGGGCG TGCGAGTG AGTCGAAAGGGCAGATCCGACGGTCGTGGACCATGCAAACTCATACCA CCTCATCT TTGGTTGAGAAAAGCATGGATAAACGAGAAAGCGTACCAACATGCCGT GGGACGTC GAGTGTTATGGAGGGAACTCTTTTGGAGGGAGTGAACCCTCACGGTGT CGGACCGA GTTGACCATGTGTAGAGCGTGTAAAGCTCCAAGTGCAAGCATTGCTCAC TTGATGGA ATCATTACTGGCTGCGCCGAATTGCGCAAAAGCCATATGAAGGTGAGG GTCTGCAG CATGATGGAGTTACTCGCTGGCTATACAATGCTCTGACTGAGGTAGATG TGGACGA GTAGCCTACCGAAGTTCCACTATACGCAACAGATCCCGGCAGAGATGAG GGACTTGA GGGTGAGCGACTCACTGTCCGTTACGACTCGGACATAGTTACTCCCAAT TGGACCGT AAACCTCGTCACAACAGACCGGATCTTGTTGTCTTCGACTCTACTAGAAA AACCATAG GGTGATTTATATAGTAGAGGTCAGTGTAACGTGGTTGAGTGTGTTGCAA TATATCCC AAGCAATACGATAACAAGCTCAACAGGTATGCAGTGAACAGTAATCAC TCATGCGT GAGTTCAGTGAGAGCATACCATATCCGCCTGGTGTGAACCTTGCTAACG CTTCTCGA AGATAAGGGTTTTGTATCCTCAGTTTACTGGGGGTGTGAAGGTATTCCC CTCGAGGG TATGATTATCTCGCCAACAGGGGAAGTACACATGCAATTTGTGCCCCAC GGTAGCTT CTTGCAGAGTTACTCGAAAATCCGAACATTCCTCGGATTCTAGAGAAAA GCACTACC TTCAACGCTCGGTTGTGCTTGGTACAGATTACATTATCCGGTCATACTTC CACCCTTC GCCATGTAATTAGGCAGTGCTCCTAGCGGAGTGCCGTGAAGTGGTGTC TCTTCTCC AGTACCCGTCGTGTGGAAAGCCCAGGAGGGTTAGTACCGACAGTGGGA GATTGGGA AACCCGCTGCACGCAACCTAAAGACATTTGCCCTTCGGGGGAGAGGTAT TTTAGACC GAGACCACACTAGTCATGGTTGCTTGCGCAAGCATGACTCATTCATTGT TAGCCCTC AAGTTCGTATTATGAGGCCTCGCGGAATGCGAAAGTCCAAGAGCGCCA TGGTGTGT ACTTGTCTCTTTGGGAGGTTCCTCCGGGTTCCTTAGGGTTGGCGTACCAT CTCGACCC GCTTCGTGTGAGACCTAGGCCGCTTGGACCTAGAGTTAAGCTGTTGCAT GCGATATC GTTTGAGGATGCCTTAGGCGGACGCCATG(SEQIDNO:1136) AGATTCCT GAATCGAC TGTGAGAA ATGTCTAC GCGCAAA GATCGACC CATTGCCA CCCGGCTA TGTGGATC GGGCTCTT GACTGCTT ATCTCCGG CTTTAATC GCTTGAGG AAAGGGG GGTGTCGC CCGAAAG GGTTACGC GATCATCC GATTCCTA CCGTAAAA ACGTAGTT GAAGTAG GATTGAAC CTGAGTAC CAAGTGAA AAGAGTG GCTATAAT GCTCCATG GTATACCC TAGGGATA ACCGTGTC GAGCGCAC TGCTCAAT ACCCTGAT TTGTTAGT GTAGGTAT GTCTAGTG GCTGCCTA GGCAGATT CGCCTATT CACTACGT AAAATCTG GGTGAGTA CCATAAGA ACCTCCTG TAGGGCCA GGAGTCAA ACTAGTCC AGGTTCGG TATGCTTC GGTATACT CTCTCACG GGCTAGTC ACCTAAGA GTTAAGAA CCGCCTTT TTCTGCAC TGTGAATA GAAAAAG AAGGGCG GGAGAAT ATACGGGC GAGGTAA GCACGTCG GAACGGG GTGTGCCC AACTCGTA CCGTACGC CCGGACAA GGACCATC TTTCGTAC CCCGTTCA CCCGGCAC AAGTCCGA TTGTCTCT CCCGAGAG GCGTCGGC GGAGGTT GGCTAACG CCGTCTCT CCAACCAT CGGTTTGG GTTTAAGT AACGCCCC AGTGGCCG GTAGCCAA ACTGATGG AGGCAGA AATGCCGG TCCGTTGC TAAATGCG GGAACGG ACCAAAAC GCCGGTGT GGTGGATA CAGGTGG AAGAGTCT TTTGGTCT ACGCAAGA AAAGACCA GGCTCAAG TACGAATA CGACCGTC TTCGTGAA CGACGCCG TCGAGACA AAGTTCGA AATCCTAC GGTCGATA AGGCC (SEQID NO:1259) R4 R4_AL U29445 Ascaris GGGGCCGGTGGGTTTACTCACTTCTGACCCACCACCAACGGAACGAGG GGGGCCG TAGTCG MPCSTNSFFERGTPE lumbricoides GAAAGCAGAGCTGGGGCCCTCTTCCGATTGGCATGGAACCGACCTCCAC GTGGGTTT CTAAGG PHREPISGTDSSESLG GTGGTGGCCCTGGGCAACGGAATTCAAGAGAGGATTTAATCCTCTCTAT ACTCACTT GGTCCG MGTHRSPRLNDDEVI CATTTGCAAGATGGATGAGATCGAGGTATCCGGCAAACAGGTTCCAAG CTGACCCA GAAATG NGPKGHESDPVHVV TGAGCACCTTTCCCATAGCTGGGAATATGGGTTAGGCGTCCTCTGACAT CCACCAAC GTCCGG RAPRTLHPRRLELPIG ATAAGAGGAATCAGACTCGTTCGCGCCCGGTCATTAACATCGATCAGCG GGAACGA TCCTGC VNNLGEASQLRQDS GGAGGGCCGGACTGAAGTAAATTTCCTGTTGGCCCGAGTGCAGGTGGA GGGAAAG GCTACC AIAEEAQLESTENHD GCTCGGACCCGAAAAACGATCCCTAAGAGGACCACAACCCGAAGGGAT CAGAGCTG CGGTTC GRRPPLRGGRKLWS GGACGCAGTCGCCCCGGCACGTTTGGTGTTGGTTATCCTGGAGTGTTGT GGGCCCTC TGGTAG EKEIATLRRLCEAYGN GGGACGAATAGCTATGCCTTGTTCAACTAACTCTTTTTTTGAACGAGGG TTCCGATT CACGTTC RQVCWKEVQRKFAD ACCCCTGAGCCCCATCGTGAGCCAATCTCAGGGACTGACTCTTCGGAGT GGCATGG AAGCGC FHEERTVAALATKW CCTTAGGTATGGGTACCCACCGTTCACCTCGGCTAAATGATGACGAGGT AACCGACC TCAATC GALKRPRAPMVGAP GATAAACGGCCCCAAGGGTCACGAAAGTGACCCGGTTCATGTAGTGCG TCCACGTG GCCTGC PTPDHDPERGPAGE TGCGCCTCGAACGCTACACCCGAGGCGATTGGAACTCCCAATCGGAGTA GTGGCCCT CTTGTA GDGGTTSQENVPTD AATAATCTCGGAGAAGCCTCTCAACTACGGCAAGATTCAGCAATAGCTG GGGCAAC GGCAGT DPIPANGPTEGKESD AGGAAGCCCAATTGGAGAGCACCGAGAACCATGATGGACGTCGCCCTC GGAATTCA CCATCTG VRPAVACRCTEPEEQ CTTTGAGAGGCGGCAGGAAGCTCTGGAGTGAAAAGGAGATCGCCACAT AGAGAGG TGGAAG LMESDVRPPAVVRLA TAAGGAGATTGTGTGAGGCCTATGGAAATAGGCAGGTATGCTGGAAAG ATTTAATC TCGCGC DPEQHTMKSGVKPV AGGTTCAGAGGAAGTTTGCGGACTTCCACGAGGAAAGAACAGTGGCTG CTCTCTAT TCTTGAT ALDGSADLEERPKEK CGTTAGCCACGAAGTGGGGGGCTCTTAAACGCCCGAGAGCTCCGATGG CATTTGCA ACAGAT DIEQMGVDFEGEPRF TTGGTGCGCCACCGACTCCTGATCATGATCCAGAGCGTGGACCAGCGG AGATGGAT GTGGAC RAFRKAFYGYFRWA GCGAGGGAGATGGAGGTACAACCTCGCAAGAGAATGTGCCTACCGATG GAGATCGA GGATGG VNSFDREPVKRVRRD ATCCAATACCCGCGAACGGCCCGACAGAGGGCAAGGAATCGGATGTGA GGTATCCG AAGCAG CPKVFYAYADYLIATG GACCAGCCGTGGCCTGTAGGTGCACGGAACCGGAGGAGCAACTGATG GCAAACAG ATGATA SSKALGPNQSRIGRL GAGTCGGATGTGAGACCGCCTGCAGTCGTAAGACTTGCAGATCCGGAG GTTCCAAG GAGCCG NGLVYAAARTIHQF CAGCATACGATGAAGTCGGGTGTGAAACCGGTTGCACTTGACGGGTCA TGAGCACC GTGACG WREEVGHRQQGEK GCGGATCTGGAGGAGCGTCCGAAAGAGAAGGATATCGAGCAGATGGG TTTCCCAT GCCCTA GWYTKTKATREDLQ TGTTGACTTTGAAGGGGAACCGAGGTTTCGTGCCTTTCGGAAAGCCTTT AGCTGGG CTAGCC MLISMMESELARRKE TACGGGTATTTCAGATGGGCTGTCAACTCGTTTGATAGGGAACCTGTCA AATATGGG AAACGC KRKPGAKELENIHKLV AGCGAGTTAGGCGGGACTGTCCCAAGGTATTTTACGCTTATGCGGATTA TTAGGCGT (SEQID ARLGTRSTSGIVRRLE CCTTATCGCAACCGGAAGTTCTAAGGCGTTGGGACCGAATCAATCCAGG CCTCTGAC NO: MTRQRLKLLEDRISLH ATTGGTCGCTTGAACGGATTGGTCTATGCGGCAGCTAGAACAATCCACC ATATAAGA 1383) EQEKRRKRLRKQFAE AATTCTGGAGAGAAGAGGTAGGCCATCGTCAGCAAGGCGAGAAGGGA GGAATCAG TPSLKLLTKGAKDRG TGGTACACGAAAACCAAGGCAACCCGTGAAGACCTTCAGATGCTCATCT ACTCGTTC DTMVTMKSVMDFW CTATGATGGAATCTGAACTTGCAAGAAGAAAGGAGAAGAGGAAGCCCG GCGCCCGG RPIIGRRVTSNPDQL GCGCGAAAGAGCTGGAGAATATCCATAAGCTAGTTGCAAGACTTGGAA TCATTAAC QVLRDWRDEQKKAY CGCGCAGCACATCGGGCATCGTCAGAAGGCTGGAGATGACAAGACAGA ATCGATCA PADLDLEKADLEEKYE GGCTCAAACTGTTGGAAGACAGAATAAGTTTGCATGAGCAAGAGAAGA GCGGGAG GAIRRIQPWKAPGPD GGCGGAAACGATTGCGCAAGCAATTTGCGGAAACCCCTTCTCTAAAATT GGCCGGA GLHAHWWKALPSAK ACTCACAAAAGGAGCCAAGGATAGGGGCGATACGATGGTAACCATGAA CTGAAGTA RLLGELVVDWLTTGK ATCTGTAATGGACTTCTGGAGACCAATAATTGGCAGACGAGTTACCTCC AATTTCCT VTTGWMCRGRTILIP AATCCGGACCAATTGCAAGTCTTGAGAGACTGGAGAGATGAGCAGAAG GTTGGCCC KKGDRGDPSNYRPIT AAGGCTTATCCGGCAGACCTGGATTTAGAAAAGGCCGATCTTGAGGAG GAGTGCA CLNTCYKVLTSVMNS AAATATGAGGGAGCAATCAGGAGAATCCAACCGTGGAAAGCTCCCGGT GGTGGAG VILSHLSRGEALPMN CCCGACGGATTACATGCCCACTGGTGGAAAGCTTTACCGTCGGCCAAGA CTCGGACC QRAMRKREWGCTH GGCTACTGGGTGAACTGGTGGTTGATTGGCTGACAACAGGTAAGGTTA CGAAAAAC AMVLDRAMVMDA CCACTGGCTGGATGTGCCGAGGGAGGACAATCCTGATCCCTAAGAAGG GATCCCTA MAQKKHSLSVAWLD GTGATAGGGGCGACCCTTCTAATTACCGACCTATAACATGTTTAAATACA AGAGGAC YRKAYDSVSHEYIRW TGTTATAAGGTGCTAACATCGGTAATGAATTCAGTCATTCTGAGTCACCT CACAACCC AINSVNIPRSVQLTLK GAGCAGAGGCGAAGCTTTACCAATGAACCAGCGAGCAATGCGGAAACG GAAGGGA RLMSDWETRFESTQ CGAGTGGGGTTGCACCCACGCTATGGTCCTTGACAGGGCCATGGTAAT TGGACGCA CRPKLRSDKMKVLN GGATGCAATGGCTCAAAAGAAACACTCATTAAGTGTGGCCTGGCTTGAC GTCGCCCC GIFQGDSLSPTLFVLCI TATCGTAAAGCATACGATAGTGTGTCGCATGAATATATTCGCTGGGCGA GGCACGTT APISYALNKGVGQCQ TTAACTCCGTGAATATACCCCGGAGTGTTCAGCTGACGCTCAAGAGGCT TGGTGTTG SSSGWSAGYGFEIGH CATGAGTGACTGGGAGACACGCTTTGAGTCGACGCAATGCCGGCCGAA GTTATCCT QFYMDDLKLYARTPA GTTAAGGTCTGACAAAATGAAAGTGCTGAATGGCATCTTTCAAGGTGAC GGAGTGTT MLDSQIQVVSEVSEA TCGTTATCACCAACCCTTTTTGTATTATGCATAGCACCTATCAGCTACGCA GTGGGAC MGLHLNLSKCAKAH CTCAATAAGGGTGTCGGCCAGTGTCAATCCTCATCCGGCTGGAGTGCAG GAATAGCT YAPHGAGGAQEAVE GTTACGGTTTTGAGATTGGACATCAGTTCTATATGGACGATCTTAAACTG (SEQID GAEGSRKGEIPILGLR TACGCTAGGACGCCTGCGATGCTAGACTCCCAAATCCAGGTGGTGTCTG NO:1260) STYKYLGVEQRLLPM AGGTGTCGGAAGCAATGGGACTCCATTTGAATTTGAGTAAATGTGCGA EVALKEFEDKFMDRA AAGCACATTATGCTCCGCATGGGGGGGGCGGAGCGCAAGAAGCTGTGG ETIFASELTWGQMAT AAGGTGCAGAAGGATCGAGGAAGGGAGAAATCCCGATACTCGGGCTTC AYNTIAIAGLRYVYSN GAAGCACCTATAAATATCTTGGAGTGGAACAACGACTCCTTCCGATGGA TNGASPKLLEALKRA AGTAGCCCTCAAAGAGTTCGAGGATAAGTTTATGGATCGAGCAGAAAC ATLDTRIRDLLRRHKC AATCTTCGCTAGCGAACTCACATGGGGGCAGATGGCCACAGCGTATAAT RFRNSFVERLYIPREC ACTATAGCTATCGCTGGTCTACGATACGTCTATAGTAATACAAATGGAG GGYGLKSVEDTLRESI CATCACCAAAGCTTCTAGAAGCCCTGAAAAGGGCGGCCACCTTAGACAC LATWSYIATNPHLAG GCGGATAAGAGATCTTTTGAGGCGACATAAATGTCGGTTTCGAAATAGC QQYFFERLAARGKRT TTTGTCGAGAGGCTGTATATCCCTAGAGAATGCGGTGGATACGGGTTAA PMADGVKILLDLGVE AATCAGTGGAGGATACGCTGCGAGAGAGTATCCTCGCTACGTGGAGTT PQVDLKRRTVTVDGI ACATCGCCACGAACCCGCATTTGGCTGGACAACAGTATTTCTTCGAGAG VFEDPTKLHRYLVGKL GCTTGCAGCAAGAGGCAAGCGCACCCCGATGGCGGACGGGGTAAAGA LKARTEARIRRWKEA TATTGTTGGATCTGGGAGTGGAACCCCAGGTGGACTTGAAGCGAAGGA SLAGRLVNDTSIDMR CGGTGACCGTCGACGGTATAGTCTTCGAAGACCCGACCAAGCTTCATCG LSCLWMKKGFVSAR ATACCTGGTGGGAAAGCTCTTAAAAGCAAGAACTGAGGCGAGGATTCG NLRDALAVQEGSLLT AAGATGGAAGGAAGCCAGCTTAGCTGGACGGTTGGTAAATGACACAAG RACPALKGKGGQEV TATTGATATGCGACTATCATGCTTGTGGATGAAAAAAGGTTTTGTGAGT CRCCHAAPETAEHIT GCGAGGAACCTCAGAGATGCGCTTGCTGTGCAAGAGGGGAGTTTGCTT SACRYWLPSLYVERH ACTAGAGCATGCCCTGCTCTAAAGGGTAAAGGCGGCCAAGAAGTTTGC DSVARNLYYVICCRY CGGTGTTGCCATGCAGCGCCGGAAACTGCAGAGCACATAACATCAGCCT GITPVHYSNRVSPLSE GCCGCTATTGGCTTCCAAGTCTCTACGTTGAGAGACATGACTCGGTAGC NSQCRVLWNMDM AAGGAACCTCTATTACGTCATATGCTGCCGCTACGGCATAACACCGGTG QTRTPMKHRKPDIVV CATTACTCAAATAGGGTATCACCGCTCTCGGAGAATAGCCAATGCCGCG FDLKREKILMFEVSIA TTCTTTGGAACATGGATATGCAGACTCGGACGCCAATGAAGCATCGAAA HASGLLKQREIKINRY GCCTGATATAGTCGTCTTTGATCTCAAGAGGGAAAAGATCCTCATGTTC TVNSEELPDETITPYP GAAGTTTCGATAGCCCATGCCAGCGGGTTATTGAAACAGCGGGAAATC PGPNLAADLAATYG AAGATCAATCGGTATACGGTGAACTCCGAAGAGTTGCCTGATGAGACCA WQVEFAPVVVGTCG TAACACCGTATCCGCCTGGGCCGAATTTGGCCGCTGACCTCGCTGCCAC EHVPAVKEDLQRTLD CTATGGTTGGCAGGTTGAATTTGCCCCAGTGGTGGTTGGCACGTGTGGT LKPHQVEALLERISRS GAGCACGTACCAGCCGTCAAAGAGGACCTGCAAAGAACGTTGGATCTA AVIGTARVVRAHLAC AAACCTCATCAAGTCGAAGCCCTTCTTGAAAGGATATCCCGATCGGCGG S(SEQIDNO:1505) TGATCGGAACGGCTAGAGTAGTCCGAGCACACCTCGCCTGCTCCTAGTC GCTAAGGGGTCCGGAAATGGTCCGGTCCTGCGCTACCCGGTTCTGGTA GCACGTTCAAGCGCTCAATCGCCTGCCTTGTAGGCAGTCCATCTGTGGA AGTCGCGCTCTTGATACAGATGTGGACGGATGGAAGCAGATGATAGAG CCGGTGACGGCCCTACTAGCCAAACGC(SEQIDNO:1137) R4 R4_Hmel Heliconius ATAAATAATAATAATAATAATAATAATAAGCCCCCTAAAATCCAACCATA ATAAATAA TAACTTA ITYTANMALVTLFME melpomene CGTCCGAGTCGAACATCTGATTCTCGTGGGGGGCGGACACGTGAAAAT TAATAATA TTGTCA NMENKRYNLRPLPG AAATTACTTACACGGCTAACATGGCTCTGGTAACCCTTTTTATGGAGAAT ATAATAAT GAATTC GRRGASGANAGCHS ATGGAAAATAAAAGATATAATTTGCGGCCGCTACCCGGGGGTCGCCGG AATAAGCC CTTACTA MRTVGDGGLSRRVP GGCGCATCTGGAGCTAACGCTGGATGCCACAGCATGCGAACCGTTGGC CCCTAAAA GTAATA LEKNVTAEQSSSPLTS GACGGGGGACTGAGCAGGCGGGTTCCCCTCGAAAAAAATGTTACAGCC TCCAACCA ATAATTA SSSHSPVSSIPSPSSTR GAACAATCATCCTCACCACTCACTTCTTCTTCCTCTCATTCACCGGTATCA TACGTCCG TCGCTG TLLNSPNSSPTSSHSS AGCATACCCTCACCATCCTCAACTCGCACCCTATTAAATTCCCCTAATTCA AGTCGAAC AAAATC LVIRSADVVQEALAN TCACCAACATCATCACATTCATCACTGGTTATAAGGTCAGCAGATGTTGT ATCTGATT TCCACCC YPAPTAGSIRARKKW GCAGGAGGCACTTGCCAATTACCCTGCACCCACTGCGGGCTCTATTCGT CTCGTGGG AAATATT TDIMNRYIWRTYLIIT GCGCGAAAAAAATGGACCGACATAATGAACCGTTATATTTGGCGCACAT GGGCGGA GCTTGG KCETTLLNNYLEPLHQ ATCTTATTATAACAAAATGCGAAACCACGTTATTGAATAATTATTTAGAA CACGTGAA CTATATG EFSSKFPEMQVTRQR CCCCTACATCAAGAATTTAGTAGTAAATTTCCCGAAATGCAGGTAACTAG AATAA CTCGCA IGDQRRAIIRNKLLSD ACAACGTATAGGAGATCAACGTAGAGCAATAATAAGAAATAAACTACTA (SEQID ATTTTTG DTLAQILIEVKELLQIG TCAGATGATACCTTAGCTCAAATTTTAATTGAAGTAAAAGAACTATTACA NO:1261) GTTAAC DQPLTQNNIHSTQLS AATCGGTGATCAACCACTTACACAAAATAACATTCATTCAACTCAACTTT GTACCC HSNTRIKWSNELNEE CACACTCAAACACTAGAATAAAATGGTCAAACGAACTTAATGAAGAAAT CAATGA IVKCYFEVTLLEVNKT TGTCAAATGTTACTTTGAAGTCACCCTACTTGAAGTTAATAAGACTTCTT TTTGGG SYRKNLYSLFISRNPH ATCGTAAAAACCTGTACAGTTTATTTATATCTAGAAATCCACACCTATCA AGAACA LSHLTEQRIADQRRLI CACCTTACAGAACAAAGAATAGCTGACCAAAGACGACTAATATTTATGA AAAATG FMNKSVHNDRIIELK ACAAAAGTGTACACAACGATAGAATTATTGAATTAAAAAGAGAAGTAG GTAAAA REVEIKLANSNSLTKN AAATTAAATTAGCTAACAGTAATTCTTTAACAAAAAATATTACAGAGTCA CTATAAT ITESNSPSSQTNEIND AATTCTCCTAGCAGTCAAACAAATGAAATAAATGATTCTGCATATGTACA AATAAT SAYVQSNLQPVEPLD AAGTAATTTACAACCCGTTGAACCACTAGACCAACATTGTATAAATAGAC AATTATA QHCINRHNLIEKHYV ACAATTTAATAGAAAAACATTATGTTGAACAAGAATTTAATAATGCTTTA TTAATAT EQEFNNALIQFNNTN ATTCAATTTAACAATACAAATCCTGAAACCAGACCCTATATACCAAGACA (SEQID PETRPYIPRQKSSRKF AAAGTCATCGCGTAAATTTTCCCAAATTGTCAGTTTCTTAAATAGTGAAG NO: SQIVSFLNSEVLPKHL TACTTCCAAAACATTTAAATAACGAATTAGATTTCAATGCTCTTCATAACA 1384) NNELDFNALHNIIYTA TTATATATACMGCTGYCTACACTGCATCTCTGTGTAATGGAACAAAATTC XYTASLCNGTKFSFID AGTTTTATTGATAATTACCGACCTCGTAACAGCAAGCCATCGTGGCAAA NYRPRNSKPSWQRR GAAGACTTGAAAGTAGAATAGACAAATATAGATTGCAAATAGGTAGAC LESRIDKYRLQIGRLT TTACACAATATATATCAGGAAATAGGAACCGTAAAATATTAAAAACTGT QYISGNRNRKILKTVE AGAAGAAATAAAAACACAATATAAAATTCATTCACATCACGAAGAACCT EIKTQYKIHSHHEEPN AATACAGAATTACCACATTTCTTAGACACACTAAAACAAAAATTAAATGC TELPHFLDTLKQKLNA AACTTCGAACAGACTTAGAAGATATTTGACATGTACAAAACGGAAGCAA TSNRLRRYLTCTKRK CAAAATAATACATTCGTTAATAACGAAAAACATTTTTATAGAACTCTATC QQNNTFVNNEKHFY TAGCACAAACCAAAATACCACAACACAATTACWAGAACATCCAACGGA RTLSSTNQNTTTQLX AAATAATTTGCAACAATACTGGGCCAATATTTGGGAAACTTCAATAGAA EHPTENNLQQYWAN CAYAATGCAGACGCAGAATGGTTAAATAAAATACCTGACKTAGAAATAA IWETSIEHNADAEWL ATKTAATGAAATTCAAAGATATATCCATAGAAACATTCAATCAAATCATC NKIPDXEINXMKFKDI CAGCGAACACACAATTGGAAGGCACCGGGAACCGATAATATACACAAC SIETFNQIIQRTHNW TAYTGGTACAAAAAGCTAACATGCACTCATTCACTTCTTCTTAAACACAT KAPGTDNIHNYWYK AAACCAGTTCATACAATCACCTTGCACACTACCTTTATTTATAACAAACG KLTCTHSLLLKHINQFI GAATAACGTATATGTTACCGAAAGGCTTAGATCCCACAAACCCAGCTAA QSPCTLPLFITNGITY CTATAGGCCAATAACATGCCTCCAAACGATATATAAAATTATCACTGCGT MLPKGLDPTNPANY GTATTACTGACATAATTTATAAACATATCGATCAAAATAACATATTAGCT RPITCLQTIYKIITACIT GAACAACAGAAAGGCTGTAGGAAAAATAGTCAAGGGTGTAAAGAGCA DIIYKHIDQNNILAEQ GCTTACTATAGACGCAATAGTTATGAAACAGGCACATAATAAAAATAHT QKGCRKNSQGCKEQ AATACRATGTATATAGACTACAGAAAAGCATTTGATTCAGTTCCACACA LTIDAIVMKQAHNKN GTTGGTTACTATATATTTTAAAAAAGTACAAGATTCACCCCATTCTAATA XNTMYIDYRKAFDSV ACATTTTTAAGCTCCGTYATGCTCTCATGGAAAACTAGATTAAAATTGAT PHSWLLYILKKYKIHPI TAACAATAATGAAACACTGATTACTGATTGGATTAAAATACAACGAGGC LITFLSSVMLSWKTRL ATCTTTCAAGGTGATGCGCTGAGTCCATTATGGTTCTGTCTAGCGTTAAA KLINNNETLITDWIKI CCCTCTCTCTGAATTGCTTAATAATACCAATACAGGATTTAAACTAAAAC QRGIFQGDALSPLWF ATAATAACACATATCATATAATTTCACATTTAATGTACATGGACGATATC CLALNPLSELLNNTNT AAACTATACGCCAGTAATAATAAAGAACTAAAAATACTAGCCGACCTCA GFKLKHNNTYHIISHL CACAATCCTTCTCTACAGACATACGCATGGAATTTGGAATTGAAAAATGC MYMDDIKLYASNNK AAGGTACATTCAATTAAACGTGGCAAATCACAACAAAATACATATATAC ELKILADLTQSFSTDIR TTAACACTGGTGAACAAATTGAATCAATGGACGAAAATAGCACGTATAA MEFGIEKCKVHSIKR ATACCTTGGATTTCAACAAGCTAAGCAAATACAACAAAAACAAACCAAG GKSQQNTYILNTGEQ ATAGAACTAACCAATAAATTTAAATTTCGTTTGAATCAAATTTTAAGATC IESMDENSTYKYLGF ACAATTAAACTCACGTAATATTATAAAAGCCATCAACACATACGCAATAC QQAKQIQQKQTKIEL CGATCCTTACTTACTCTTTTGCTATAATAAACTGGTCTCAAACTGATTTAT TNKFKFRLNQILRSQL CGAATCTGCAACGTATAATAAATACACACATGACTACACATAGGAAACA NSRNIIKAINTYAIPILT TCACCCTAAATCATGTATCCAGCGTCTAACTATATCACGTCTGGATGGAG YSFAIINWSQTDLSNL GACGAGGTCTCATTGATATTCGTAACCTACATAATAATCTAGTGACTAAA QRIINTHMTTHRKHH TTCCGAAATTACTTTTACGCTAAGGCTGAAATTAGTGAACTACACAAATT PKSCIQRLTISRLDGG CATAGTTAATATAGATAATAAATATACACCACTTAATTTAAACGATAGAA RGLIDIRNLHNNLVTK ATATACAACTAAATCAAACTTTGATAACTAAGCAGCAAAAAATAGAAGC FRNYFYAKAEISELHK ATGGTCTTTAAAGTCACTACACGGTAGACATCTAGCCGATCTTAGCCAA FIVNIDNKYTPLNLND ACTCACGTCGACAAAGTTGCGTCGAACGAATGGCTACGACGGGGAGAT RNIQLNQTLITKQQKI CTCTTCCCAGAGACGGAAGCCTTCATGATGGCTATTCAGGACCAAGTAA EAWSLKSLHGRHLAD TAGACACTAGGAACTATCAAAAACACATCATAAAAAGACCAAATATGGT LSQTHVDKVASNEW AAACGATTTATGCAGAAGATGCTATTCTAGTCCTGAAACAATCCAACAC LRRGDLFPETEAFM ATTACCGGAGCCTGTAAAACCATAGTACAAACCGATTACAAACATAGAC MAIQDQVIDTRNYQ ATGATCAGGTCGCTGCAATTATTCATCAACACCTAGCATTTAAACATTCT KHIIKRPNMVNDLCR CTCATCACACAAGCACAAAAAACTCCCTATTACAAATATTCACCTCAGGC RCYSSPETIQHITGAC TATTTTAGAATCTACAAATTTTAAGCTGTATTGGGATCGTACAATCATCA KTIVQTDYKHRHDQV CAGATAAAACCGTACACTACAATAGACCGGATATTTTACTTCATGATAAA AAIIHQHLAFKHSLIT GTWAAAAWTTCTGTATACTTGATAGACATAGCCATTCCAAATACACACA QAQKTPYYKYSPQAI ATCTTGCATCCACTTTTTCAAACAAAATAGATAAGTATACAGATTTAACC LESTNFKLYWDRTIIT ATAGAGCTCAAAAGTCAATGGAAAGTCCAATCCGTCACTACAGTACCAA DKTVHYNRPDILLHD TAGTTCTTTCAACAACCGGAGTAGTACCTCATACGCTACACACGAGTCTG KVKXSVYLIDIAIPNTH GAAACTTTGGGTATTCACCGGTTAAGTTATATTCTTTTACAGAAAGCCGC NLASTFSNKIDKYTDL AATCTTAAATACATGCCGTATAGTTAGGAAGTTTTTATCATCAAATAATT TIELKSQWKVQSVTT AACTTATTGTCAGAATTCCTTACTAGTAATAATAATTATCGCTGAAAATC VPIVLSTTGVVPHTLH TCCACCCAAATATTGCTTGGCTATATGCTCGCAATTTTTGGTTAACGTAC TSLETLGIHRLSYILLQ CCCAATGATTTGGGAGAACAAAAATGGTAAAACTATAATAATAATAATT KAAILNTCRIVRKFLSS ATATTAATAT(SEQIDNO:1138) NN(SEQIDNO: 1506) R4 Rex6 Takifugu TTCTATGCGCCTTATGCGACTGGATAGGCCAGTGGTTTACGCCGCTGAC TTCTATGC TAGAGG MSGTXTDRVIPARTS rubripes TTTGGTGCGGAAGGTTGTCGGTTCGAATCCAGGCGAGCCCTTAGGCAA GCCTTATG ACCCGA PGSTRSASGVGEPGP GGCTCCTTACGCATATATGCCTACACCTCGGTATGAGCGGAACMWTAA CGACTGGA GTCTGA PDVKLATGTRHSWS CTGATAGAGTCATACCGGCTCGGACGTCGCCCGGGTCAACAAGGTCCG TAGGCCAG AGGAAG RAENVVLMECYYGS CGTCAGGGGTCGGGGAACCAGGGCCACCTGATGTCAAATTGGCTACTG TGGTTTAC GAGGCA NPSERGYMQRMWE GAACAAGGCATTCATGGAGCAGGGCAGAGAACGTGGTACTGATGGAG GCCGCTGA CCGCCC KWVLRNPTSSLTKKQ TGTTACTACGGGAGTAACCCCAGCGAGAGAGGGTACATGCAGAGGATG CTTTGGTG AGGAGG LLAQCSNIRNKKLLSQ TGGGAGAAATGGGTGCTTCGAAACCCCACATCCTCACTGACTAAGAAGC CGGAAGG GCGAGG LEIDEARRCASPTVQI AGCTCTTAGCTCAGTGTTCGAATATCCGCAATAAGAAGCTGCTATCACA TTGTCGGT AAGAGA CYGKGEPGRQVSXG GCTAGAGATTGACGAGGCACGGCGNTGTGCCTCACCCACGGTACAAAT TCGAATCC TTTTTTT VISSSPPNIEIGYKAP ATGCTACGGCAAGGGGGAGCCAGGACGCCAGGTCAGCNTGGGGGTGA AGGCGAG TTATATA MTDGLGTRAADLRE TTTCATCATCACCCCCCAATATCGAGATTGGGTACAAAGCCCCAATGACA CCCTTAGG TATATAT RIMKSWGNSTTSLPR GACGGACTCGGCACAAGGGCAGCTGACCTGAGAGAGAGAATCATGAA CAAGGCTC ATATATA LTHKVPDQSLLEDM GTCCTGGGGGAACTCAACTACCTCCCTGCCAAGGCTAACACACAAAGTA CTTACGCA TA(SEQ NTALSTIPTTTITETN CCAGACCAATCTCTACTGGAGGACATGAACACGGCACTGTCAACCATCC TATATGCC IDNO: QLMYAAATVILQML CTACCACTACCATCACTGAGACCAATCAGCTGATGTATGCAGCGGCAAC TACACCTC 1385) GYKMKSMNSQKEQ AGTAATCCTACAGATGCTTGGCTATAAGATGAAGAGCATGAACAGCCAG GGT(SEQ MAPWRRRLEAKIMA AAGGAGCAAATGGCCCCATGGAGGAGAAGGCTAGAGGCAAAAATCAT IDNO: TRREVSLLTELSRGVN GGCGACACGGAGAGAAGTCAGCCTTCTAACAGAGCTGAGTAGAGGCGT 1262) LRTEXPKKYNKLSTTE GAATCTAAGGACAGAGCNGCCCAAGAAATATAACAAACTGTCCACAACT ALETAKQRLTALATRL GAGGCACTGGAGACTGCTAAGCAAAGGCTCACAGCCCTGGCTACCCGA KRYTREVEARRINKVF CTAAAGAGGTACACAAGAGAAGTAGAGGCAAGGAGAATAAACAAGGT STNPAKVYSQWQGN GTTCTCCACCAATCCAGCNAAGGTCTACTCTCAATGGCAGGGCAACAAG KMTTDPPRAETEQY ATGACAACAGACCCCCCCAGGGCTGAGACTGAGCAATACTGGAAGAGT WKSIWEKEATHNTX ATCTGGGAGAAAGAGGCAACGCACAACACTANTGCCCAATGGCTGCAA AQWLQDLQTEHSQL GACCTACAGACAGAGCACAGCCAACTTCCAGAACAAGACCCAGTAGTCA PEQDPVVITLADIQTR TCACCTTAGCAGACATCCAAACAAGAGTGTCCAAAATGAAGAGCTGGAC VSKMKSWTAPGPDK AGCACCAGGGCCCGATAAGATCCACGCCTACTGGCTTAAGAAGCTGACT IHAYWLKKLTALHERL GCACTCCATGAACGCCTGGCAGCACAGATGAACCAGCTGCTAACATCAG AAQMNQLLTSGNHP GGAACCACCCAGAGTGGCTAACCCAGGGCCGGACAGTCCTCATAATGA EWLTQGRTVLIMKD AGGACCCCCAGAAGGGCACAATACCATCCAACTACCGGCCCATAACCTG PQKGTIPSNYRPITCL CCTCAGCACCACATGGAAGCTCCTATCAGGCATCATAGCGGCTAAGATC STTWKLLSGIIAAKISR AGTAGGCACATGGATCAATACATGAGCAGAGCACAGAAAGGCATAGGC HMDQYMSRAQKGI AACAACACCAGAGGTGCCAAGCACCAGCTACTGGTCGACAGGGCAATC GNNTRGAKHQLLVD GCCCAGGACTGTAGGACGCGGCACACCAACCTGTGCACTGCCTGGATT RAIAQDCRTRHTNLC GATTACAAGAAAGCCTATGACTCAATGCCGCACACATGGATACTGGAGT TAWIDYKKAYDSMP GCCTAAAGCTGTATAACATCAACAGGACACTAAGAGAGTTCATCCAGAA HTWILECLKLYNINRT CTCCATGAAGCTGTGGAACACGACTCTGGAGGCCAACTCAAAGCCAATT LREFIQNSMKLWNTT GCGCGGGTGAGCATCAGATGCGGCATATATCAAGGAGATGCCCTGTCC LEANSKPIARVSIRCGI CCCCTGCTGTTCTGCATAGGCCTAAACCCCCTCAGCCAGATCATCACAAA YQGDALSPLLFCIGLN GAGTGGCTATGGGTACCAGTTCCGAAGTGGAACAACCGTCAGCCACCTC PLSQIITKSGYGYQFR CTCTACATGGATGACATCAAGCTGTATGCCAAGAACGAGCGTGACATCG SGTTVSHLLYMDDIK ACTCCCTGATTCACCTCACTAGGATCTACAGCAAAGACATCGGGATGTC LYAKNERDIDSLIHLT ATTCGGACTAGATAAATGTGGGCGGATGATATCTAGAAGAGGAAAGGT RIYSKDIGMSFGLDKC GATCGCAACTGATGGGGTTGAACTACCTGAAGGGAACATCACAGATGT GRMISRRGKVIATDG GCAGGACAGTTACAAATACCTGGGGATCCCGCAGGCAAATGGTAACCA VELPEGNITDVQDSY TGAGGAGGCAGCTAGGAGGTCAGCCACAGCCAAATACCTACAGAGGTT KYLGIPQANGNHEEA AAGGCAGGTCCTGAAAAGTCAGCTGAATGGTAAGAATAAGATCCAGGC ARRSATAKYLQRLRQ CATAAACACCTACGCCCTGCCAGTAATCAGATACCCTGCTGGCATAATAC VLKSQLNGKNKIQAI CCTGGCCACTGGAAGAGATACAAGCCACTGACATCAAGACAAGGAAGC NTYALPVIRYPAGIIP TGAATGGTAAGCATAAGATCCAGGCCATAAACACCTACGCCCTGCCAGT WPLEEIQATDIKTRKL AATCAGATACCCTGCTGGCATAATACCCTGGCCACTGGAAGAGATACAA NGKHKIQAINTYALP GCCACTGACATCAAGACAAGGAAGCTCCTCACCATGCACGGAGGGTTTC VIRYPAGIIPWPLEEI ACCCTAAGTCCAGCGTCCTGAGGCTGTACACGAAGCGAAAGGAAGGGG QATDIKTRKLLTMHG GCCGAGGACTAGTAAGTGTCCGAACTACTGTCCAGGAGGAAACAACAA GFHPKSSVLRLYTKRK GCCTCCGAGAATACATCAAGAAGATGGCCCCCACTGACCGACTGCTGAG EGGRGLVSVRTTVQE TGAATGCCTCAGGCAACAAAAGCCCACCAAGGAGGAGGAGCCTGAGG ETTSLREYIKKMAPTD GGCTATCATGGAAGGACAAGCCCCTGCATGGNATGTACCACCGACAAA RLLSECLRQQKPTKEE TTGAGGAAGTGGCTGATATCGAGAAAACATACCAGTGGCTGGAAAAGG EPEGLSWKDKPLHG CCGGACTGAAAGACAGCACAGAGGCACTACTCATGGCTGCACAAGAAC MYHRQIEEVADIEKT AGGCCCTGAGCACCAGAGCAATAGAGGCCAGGGTCTACCATACCAGAC YQWLEKAGLKDSTEA AAGACCCCAGGTGCAGGCTGTGTGGAGACGCCCCTGAGACAGTCCAGC LLMAAQEQALSTRAI ACATCACAGCAGGGTGCAAGATGTTGGCAGGCAAGGCATACATGGAGC EARVYHTRQDPRCRL GGCATAACCAGGTGGCTGGCATAGTGTACAGGAACATCTGCACTGAGT CGDAPETVQHITAGC ATGGACTGGAGGTCCCAGGGTCCAGGTGGGAGACACCCCCGAAAGTGC KMLAGKAYMERHN TGGAGAACAAGCAGGCCAAGATCCTGTGGGACTTCCAGATCCAGACTG QVAGIVYRNICTEYGL ACAAGATGGTGGTGGCCAACCAGCCTGACATAGTGGTGGTGGATAAAC EVPGSRWETPPKVLE ACCAGAAGACAGTGGTGGTGATAGATGTAGCAATCCCAAGTGATAGCA NKQAKILWDFQIQTD ACATCAGGAAGAAGGAACACGAGAAGCTGGAGAAGTACCAAGGGCTG KMVVANQPDIVVVD AAGGAGGAGATGGAGAGAATGTGGGGCATGAAGGCAACAGTGGTCCC KHQKTVVVIDVAIPS AGTAGTGATTGGGACACTAGGGGCAGTAACACCCAAGCTGAGTAGATG DSNIRKKEHEKLEKYQ GCTCCAACAGATACCAGGAACCACATCAGAGATCTCTGTCCAGAAGAGC GLKEEMERMWGMK GCAGTCCTAGGAACAGCTAAGATCCTGCGCAGAACCCTCAGACTCCCAG ATVVPVVIGTLGAVT GCCTCTGGTAGAGGACCCGAGTCTGAAGGAAGGAGGCACCGCCCAGG PKLSRWLQQIPGTTS AGGGCGAGGAAGAGATTTTTTTTTATATATATATATATATATATA(SEQ EISVQKSAVLGTAKIL IDNO:1139) RRTLRLPGLW(SEQ IDNO:1507)
Gene Writers, e.g. Thermostable GENE WRITER? Genome Editor Polypeptides
(51) While not wishing to be bound by theory, in some embodiments, retrotransposases that evolved in cold environments may not function as well at human body temperature. This application provides a number of thermostable GENE WRITER? genome editor polypeptides, including proteins derived from avian retrotransposases. Exemplary avian transposase sequences in Table 3 include those of Taeniopygia guttata (zebra finch; transposon name R2-1_TG), Geospiza fortis (medium ground finch; transposon name R2-1_Gfo), Zonotrichia albicollis (white-throated sparrow; transposon name R2-1_ZA), and Tinamus guttatus (white-throated tinamou; transposon name R2-1_TGut).
(52) Thermostability may be measured, e.g., by testing the ability of a GENE WRITER? to polymerize DNA in vitro at a high temperature (e.g., 37? C.) and a low temperature (e.g., 25? C.). Suitable conditions for assaying in vitro DNA polymerization activity (e.g., processivity) are described, e.g., in Bibillo and Eickbush, High Processivity of the Reverse Transcriptase from a Non-long Terminal Repeat Retrotransposon (2002) JBC 277, 34836-34845. In some embodiments, the thermostable GENE WRITER? polypeptide has an activity, e.g., a DNA polymerization activity, at 37? ? C. that is no less than 70%, 75%, 80%, 85%, 90%, or 95% of its activity at 25? C. under otherwise similar conditions.
(53) In some embodiments, a GENE WRITER? polypeptide (e.g., a sequence of Table 1, 2, or 3 or a sequence having at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% identity thereto) is stable in a subject chosen from a mammal (e.g., human) or a bird. In some embodiments, a GENE WRITER? polypeptide described herein is functional at 37? C. In some embodiments, a GENE WRITER? polypeptide described herein has greater activity at 37? C. than it does at a lower temperature, e.g., at 30? C., 25? C., or 20? C. In some embodiments, a GENE WRITER? polypeptide described herein has greater activity in a human cell than in a zebrafish cell.
(54) In some embodiments, a GENE WRITER? polypeptide is active in a human cell cultured at 37? C., e.g., using an assay of Example 6 or Example 7 herein.
(55) In some embodiments, the assay comprises steps of: (1) introducing HEK293T cells into one or more wells of 6.4 mm diameter, at 10,000 cells/well, (2) incubating the cells at 37? ? C. for 24 hr, (3) providing a transfection mixture comprising 0.5 ?l if FuGENE? HD transfection reagent and 80 ng DNA (wherein the DNA is a plasmid comprising, in order, (a) CMV promoter, (b) 100 bp of sequence homologous to the 100 bp upstream of the target site, (c) sequence encoding a 5 untranslated region that binds the GENE WRITER? protein, (d) sequence encoding the GENE WRITER? protein, (e) sequence encoding a 3 untranslated region that binds the GENE WRITER? protein (f) 100 bp of sequence homologous to the 100 bp downstream of the target site, and (g) BGH polyadenylation sequence) and 10 ?l Opti-MEM and incubating for 15 min at room temperature, (4) adding the transfection mixture to the cells, (5) incubating the cells for 3 days, and (6) assaying integration of the exogenous sequence into a target locus (e.g., rDNA) in the cell genome, e.g., wherein one or more of the preceding steps are performed as described in Example 6 herein.
(56) In some embodiments, the GENE WRITER? polypeptide results in insertion of the heterologous object sequence (e.g., the GFP gene) into the target locus (e.g., rDNA) at an average copy number of at least 0.01, 0.025, 0.05, 0.075, 0.1, 0.15, 0.2, 0.25, 0.3, 0.4, 0.5, 0.75, 1, 1.25, 1.5, 1.75, 2, 2.5, 3, 4, or 5 copies per genome. In some embodiments, a cell described herein (e.g., a cell comprising a heterologous sequence at a target insertion site) comprises the heterologous object sequence at an average copy number of at least 0.01, 0.025, 0.05, 0.075, 0.1, 0.15, 0.2, 0.25, 0.3, 0.4, 0.5, 0.75, 1, 1.25, 1.5, 1.75, 2, 2.5, 3, 4, or 5 copies per genome.
(57) In some embodiments, a GENE WRITER? causes integration of a sequence in a target RNA with relatively few truncation events at the terminus. For instance, in some embodiments, a GENE WRITER? protein (e.g., of SEQ ID NO: 1016) results in about 25-100%, 50-100%, 60-100%, 70-100%, 75-95%, 80%-90%, or 86.17% of integrants into the target site being non-truncated, as measured by an assay described herein, e.g., an assay of Example 6 and
(58) In some embodiments, a system or method described herein results in insertion of the heterologous object sequence only at one target site in the genome of the target cell. Insertion can be measured, e.g., using a threshold of above 1%, 1.5%, 2%, 2.5%, 3%, 3.5%, 4%, 4.5%, 5%, e.g., as described in Example 8. In some embodiments, a system or method described herein results in insertion of the heterologous object sequence wherein less than 1%, 1.5%, 2%, 2.5%, 3%, 3.5%, 4%, 4.5%, 5%, 10%, 20%, 30%, 40%, or 50% of insertions are at a site other than the target site, e.g., using an assay described herein, e.g., an assay of Example 8.
(59) In some embodiments, a system or method described herein results in scarless insertion of the heterologous object sequence, while in some embodiments, the target site can show deletions or duplications of endogenous DNA as a result of insertion of the heterologous sequence. The mechanisms of different retrotransposons could result in different patterns of duplications or deletions in the host genome occurring during retrotransposition at the target site. In some embodiments, the system results in a scarless insertion, with no duplications or deletions in the surrounding genomic DNA. In some embodiments, the system results in a deletion of less than 1, 2, 3, 4, 5, 10, 50, or 100 bp of genomic DNA upstream of the insertion. In some embodiments, the system results in a deletion of less than 1, 2, 3, 4, 5, 10, 50, or 100 bp of genomic DNA downstream of the insertion. In some embodiments, the system results in a duplication of less than 1, 2, 3, 4, 5, 10, 50, or 100 bp of genomic DNA upstream of the insertion. In some embodiments, the system results in a duplication of less than 1, 2, 3, 4, 5, 10, 50, or 100 bp of genomic DNA downstream of the insertion.
(60) In some embodiments, a GENE WRITER? described herein, or a DNA-binding domain thereof, binds to its target site specifically, e.g., as measured using an assay of Example 21. In some embodiments, the GENE WRITER? or DNA-binding domain thereof binds to its target site more strongly than to any other binding site in the human genome. For example, in some embodiments, in an assay of Example 21, the target site represents more than 50%, 60%, 70%, 80%, 90%, or 95% of binding events of the GENE WRITER? or DNA-binding domain thereof to human genomic DNA.
(61) Genetically Engineered, e.g., Dimerized GENE WRITER? Genome Editor Polypeptides
(62) Some non-LTR retrotransposons utilize two subunits to complete retrotransposition (Christensen et al PNAS 2006). In some embodiments, a retrotransposase described herein comprises two connected subunits as a single polypeptide. For instance, two wild-type retrotransposases could be joined with a linker to form a covalently dimerized protein (see
(63) Based on mechanism, not all functions are required from both retrotransposase subunits. In some embodiments, the fusion protein may consist of a fully functional subunit and a second subunit lacking one or more functional domains. In some embodiments, one subunit may lack reverse transcriptase functionality. In some embodiments, one subunit may lack the reverse transcriptase domain. In some embodiments, one subunit may possess only endonuclease activity. In some embodiments, one subunit may possess only an endonuclease domain. In some embodiments, the two subunits comprising the single polypeptide may provide complimentary functions.
(64) In some embodiments, one subunit may lack endonuclease functionality. In some embodiments, one subunit may lack the endonuclease domain. In some embodiments, one subunit may possess only reverse transcriptase activity. In some embodiments, one subunit may possess only a reverse transcriptase domain. In some embodiments, one subunit may possess only DNA-dependent DNA synthesis functionality.
(65) Linkers:
(66) In some embodiments, domains of the compositions and systems described herein (e.g., the endonuclease and reverse transcriptase domains of a polypeptide or the DNA binding domain and reverse transcriptase domains of a polypeptide) may be joined by a linker. A composition described herein comprising a linker element has the general form S1-L-S2, wherein S1 and S2 may be the same or different and represent two domain moieties (e.g., each a polypeptide or nucleic acid domain) associated with one another by the linker. In some embodiments, a linker may connect two polypeptides. In some embodiments, a linker may connect two nucleic acid molecules. In some embodiments, a linker may connect a polypeptide and a nucleic acid molecule. A linker may be a chemical bond, e.g., one or more covalent bonds or non-covalent bonds. A linker may be flexible, rigid, and/or cleavable. In some embodiments, the linker is a peptide linker. Generally, a peptide linker is at least 2, 3, 4, 5, 6, 7, 8, 9, 10 or more amino acids in length, e.g., 2-50 amino acids in length, 2-30 amino acids in length.
(67) The most commonly used flexible linkers have sequences consisting primarily of stretches of Gly and Ser residues (GS linker). Flexible linkers may be useful for joining domains that require a certain degree of movement or interaction and may include small, non-polar (e.g. Gly) or polar (e.g. Ser or Thr) amino acids. Incorporation of Ser or Thr can also maintain the stability of the linker in aqueous solutions by forming hydrogen bonds with the water molecules, and therefore reduce unfavorable interactions between the linker and the other moieties. Examples of such linkers include those having the structure [GGS].sup.?1 or [GGGS].sup.?1 (SEQ ID NO: 1536). Rigid linkers are useful to keep a fixed distance between domains and to maintain their independent functions. Rigid linkers may also be useful when a spatial separation of the domains is critical to preserve the stability or bioactivity of one or more components in the agent. Rigid linkers may have an alpha helix-structure or Pro-rich sequence, (XP)n, with X designating any amino acid, preferably Ala, Lys, or Glu. Cleavable linkers may release free functional domains in vivo. In some embodiments, linkers may be cleaved under specific conditions, such as the presence of reducing reagents or proteases. In vivo cleavable linkers may utilize the reversible nature of a disulfide bond. One example includes a thrombin-sensitive sequence (e.g., PRS) between the two Cys residues. In vitro thrombin treatment of CPRSC (SEQ ID NO: 1537) results in the cleavage of the thrombin-sensitive sequence, while the reversible disulfide linkage remains intact. Such linkers are known and described, e.g., in Chen et al. 2013. Fusion Protein Linkers: Property, Design and Functionality. Adv Drug Deliv Rev. 65(10): 1357-1369. In vivo cleavage of linkers in compositions described herein may also be carried out by proteases that are expressed in vivo under pathological conditions (e.g. cancer or inflammation), in specific cells or tissues, or constrained within certain cellular compartments. The specificity of many proteases offers slower cleavage of the linker in constrained compartments.
(68) In some embodiments the amino acid linkers are (or are homologous to) the endogenous amino acids that exist between such domains in a native polypeptide. In some embodiments the endogenous amino acids that exist between such domains are substituted but the length is unchanged from the natural length. In some embodiments, additional amino acid residues are added to the naturally existing amino acid residues between domains.
(69) In some embodiments, the amino acid linkers are designed computationally or screened to maximize protein function (Anad et al., FEBS Letters, 587:19, 2013).
(70) Template RNA Component of GENE WRITER? Gene Editor System
(71) TheGENE WRITER? systems described herein can transcribe an RNA sequence template into host target DNA sites by target-primed reverse transcription. By writing DNA sequence(s) via reverse transcription of the RNA sequence template directly into the host genome, the GENE WRITER? system can insert an object sequence into a target genome without the need for exogenous DNA sequences to be introduced into the host cell (unlike, for example, CRISPR systems), as well as eliminate an exogenous DNA insertion step. Therefore, the GENE WRITER? system provides a platform for the use of customized RNA sequence templates containing object sequences, e.g., sequences comprising heterologous gene coding and/or function information.
(72) In some embodiments the template RNA encodes a GENE WRITER? protein in cis with a heterologous object sequence. Various cis constructs were described, for example, in Kuroki-Kami et al (2019) Mobile DNA 10:23 (incorporated by reference herein in its entirety), and can be used in combination with any of the embodiments described herein. For instance, in some embodiments, the template RNA comprises a heterologous object sequence, a sequence encoding a GENE WRITER? protein (e.g., a protein comprising (i) a reverse transcriptase domain and (ii) an endonuclease domain, e.g., as described herein), a 5 untranslated region, and a 3 untranslated region. The components may be included in various orders. In some embodiments, the GENE WRITER? protein and heterologous object sequence are encoded in different directions (sense vs. anti-sense), e.g., using an arrangement shown in
(73) The nucleic acid encoding the GENE WRITER? polypeptide may, in some instances, be 5 of the heterologous object sequence. For example, in some embodiments, the template RNA comprises, from 5 to 3, a 5 untranslated region, a sense-encoded GENE WRITER? polypeptide, a sense-encoded heterologous object sequence, and 3 untranslated region. In some embodiments, the template RNA comprises, from 5 to 3, a 5 untranslated region, a sense-encoded GENE WRITER? polypeptide, anti-sense-encoded heterologous object sequence, and 3 untranslated region.
(74) In some embodiments, the RNA further comprises homology to the DNA target site.
(75) It is understood that, when a template RNA is described as comprising an open reading frame or the reverse complement thereof, in some embodiments the template RNA must be converted into double stranded DNA (e.g., through reverse transcription) before the open reading frame can be transcribed and translated.
(76) In certain embodiments, customized RNA sequence template can be identified, designed, engineered and constructed to contain sequences altering or specifying host genome function, for example by introducing a heterologous coding region into a genome; affecting or causing exon structure/alternative splicing; causing disruption of an endogenous gene; causing transcriptional activation of an endogenous gene; causing epigenetic regulation of an endogenous DNA; causing up- or down-regulation of operably liked genes, etc. In certain embodiments, a customized RNA sequence template can be engineered to contain sequences coding for exons and/or transgenes, provide for binding sites to transcription factor activators, repressors, enhancers, etc., and combinations of thereof. In other embodiments, the coding sequence can be further customized with splice acceptor sites, poly-A tails. In certain embodiments the RNA sequence can contain sequences coding for an RNA sequence template homologous to the RLE transposase, be engineered to contain heterologous coding sequences, or combinations thereof.
(77) The template RNA may have some homology to the target DNA. In some embodiments the template RNA has at least 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 20, 25, 30, 35, 40, 45, 50, 60, 70, 80, 90, 100, 110, 120, 130, 140, 150, 175, 200 or more bases of exact homology to the target DNA at the 3 end of the RNA. In some embodiments the template RNA has at least 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 20, 25, 30, 35, 40, 45, 50, 60, 70, 80, 90, 100, 110, 120, 130, 140, 150, 160, 175, 180, or 200 or more bases of at least 50%, 60%, 70%, 80%, 85%, 90%, 95%, 97%, 98%, 99% or 100% homology to the target DNA, e.g., at the 5 end of the template RNA. In some embodiments the template RNA has a 3 untranslated region derived from a non-LTR retrotransposon, e.g. a non-LTR retrotransposons described herein. In some embodiments the template RNA has a 3 region of at least 10, 15, 20, 25, 30, 40, 50, 60, 80, 100, 120, 140, 160, 180, 200 or more bases of at least 50%, 60%, 70%, 80%, 85%, 90%, 95%, 97%, 98%, 99% or 100% homology to the 3 sequence of a non-LTR retrotransposon, e.g., a non-LTR retrotransposon described herein, e.g. a non-LTR retrotransposon in Table 1, 2, or 3. In some embodiments the template RNA has a 5 untranslated region derived from a non-LTR retrotransposon, e.g. a non-LTR retrotransposons described herein. In some embodiments the template RNA has a 5 region of at least 10, 15, 20, 25, 30, 40, 50, 60, 80, 100, 120, 140, 160, 180, or 200 or more bases of at least 40%, 50%, 60%, 70%, 80%, 90%, 95% or greater homology to the 5 sequence of a non-LTR retrotransposon, e.g., a non-LTR retrotransposon described herein, e.g. a non-LTR retrotransposon described in Table 2 or 3.
(78) The template RNA component of a GENE WRITER? genome editing system described herein typically is able to bind the GENE WRITER? genome editing protein of the system. In some embodiments the template RNA has a 3 region that is capable of binding a GENE WRITER? genome editing protein. The binding region, e.g., 3 region, may be a structured RNA region, e.g., having at least 1, 2 or 3 hairpin loops, capable of binding the GENE WRITER? genome editing protein of the system.
(79) The template RNA component of a GENE WRITER? genome editing system described herein typically is able to bind the GENE WRITER? genome editing protein of the system. In some embodiments the template RNA has a 5 region that is capable of binding a GENE WRITER? genome editing protein. The binding region, e.g., 5 region, may be a structured RNA region, e.g., having at least 1, 2 or 3 hairpin loops, capable of binding the GENE WRITER? genome editing protein of the system. In some embodiments, the 5 untranslated region comprises a pseudoknot, e.g., a pseudoknot that is capable of binding to the GENE WRITER? protein.
(80) In some embodiments, the template RNA (e.g., an untranslated region of the hairpin RNA, e.g., a 5 untranslated region) comprises a stem-loop sequence. In some embodiments, the template RNA (e.g., an untranslated region of the hairpin RNA, e.g., a 5 untranslated region) comprises a hairpin. In some embodiments, the template RNA (e.g., an untranslated region of the hairpin RNA, e.g., a 5 untranslated region) comprises a helix. In some embodiments, the template RNA (e.g., an untranslated region of the hairpin RNA, e.g., a 5 untranslated region) comprises a psuedoknot. In some embodiments the template RNA comprises a ribozyme. In some embodiments the ribozyme is similar to an hepatitis delta virus (HDV) ribozyme, e.g., has a secondary structure like that of the HDV ribozyme and/or has one or more activities of the HDV ribozyme, e.g., a self-cleavage activity. See, e.g., Eickbush et al., Molecular and Cellular Biology, 2010, 3142-3150.
(81) In some embodiments, the template RNA (e.g., an untranslated region of the hairpin RNA, e.g., a 3 untranslated region) comprises one or more stem-loops or helices. Exemplary structures of R2 3 UTRs are shown, for example, in Ruschak et al. Secondary structure models of the 3 untranslated regions of diverse R2 RNAs RNA. 2004 June; 10(6): 978-987, e.g., at
(82) In some embodiments, a template RNA described herein comprises a sequence that is capable of binding to a GENE WRITER? protein described herein. For instance, in some embodiments, the template RNA comprises an MS2 RNA sequence capable of binding to an MS2 coat protein sequence in the GENE WRITER? protein. In some embodiments, the template RNA comprises an RNA sequence capable of binding to a B-box sequence. In some embodiments, the template RNA comprises an RNA sequence (e.g., a crRNA sequence and/or tracrRNA sequence) capable of binding to a dCas sequence in the GENE WRITER? protein. In some embodiments, in addition to or in place of a UTR, the template RNA is linked (e.g., covalently) to a non-RNA UTR, e.g., a protein or small molecule.
(83) In some embodiments the template RNA has a poly-A tail at the 3 end. In some embodiments the template RNA does not have a poly-A tail at the 3 end.
(84) In some embodiments the template RNA has a 5 region of at least 10, 15, 20, 25, 30, 40, 50, 60, 80, 100, 120, 140, 160, 180, 200 or more bases of at least 40%, 50%, 60%, 70%, 80%, 90%, 95% or greater homology to the 5 sequence of a non-LTR retrotransposon, e.g., a non-LTR retrotransposon described herein.
(85) The template RNA of the system typically comprises an object sequence for insertion into a target DNA. The object sequence may be coding or non-coding.
(86) In some embodiments a system or method described herein comprises a single template RNA. In some embodiments a system or method described herein comprises a plurality of template RNAs.
(87) In some embodiments, the object sequence may contain an open reading frame. In some embodiments the template RNA has a Kozak sequence. In some embodiments the template RNA has an internal ribosome entry site. In some embodiments the template RNA has a self-cleaving peptide such as a T2A or P2A site. In some embodiments the template RNA has a start codon. In some embodiments the template RNA has a splice acceptor site. In some embodiments the template RNA has a splice donor site. In some embodiments the template RNA has a microRNA binding site downstream of the stop codon. In some embodiments the template RNA has a polyA tail downstream of the stop codon of an open reading frame. In some embodiments the template RNA comprises one or more exons. In some embodiments the template RNA comprises one or more introns. In some embodiments the template RNA comprises a eukaryotic transcriptional terminator. In some embodiments the template RNA comprises an enhanced translation element or a translation enhancing element. In some embodiments the RNA comprises the human T-cell leukemia virus (HTLV-1) R region. In some embodiments the RNA comprises a posttranscriptional regulatory element that enhances nuclear export, such as that of Hepatitis B Virus (HPRE) or Woodchuck Hepatitis Virus (WPRE). In some embodiments, in the template RNA, the heterologous object sequence encodes a polypeptide and is coded in an antisense direction with respect to the 5 and 3 UTR. In some embodiments, in the template RNA, the heterologous object sequence encodes a polypeptide and is coded in a sense direction with respect to the 5 and 3 UTR.
(88) In some embodiments, a nucleic acid described herein (e.g., a template RNA or a DNA encoding a template RNA) comprises a microRNA binding site. In some embodiments, the microRNA binding site is used to increase the target-cell specificity of a GENE WRITER? system. For instance, the microRNA binding site can be chosen on the basis that is recognized by a miRNA that is present in a non-target cell type, but that is not present (or is present at a reduced level relative to the non-target cell) in a target cell type. Thus, when the template RNA is present in a non-target cell, it would be bound by the miRNA, and when the template RNA is present in a target cell, it would not be bound by the miRNA (or bound but at reduced levels relative to the non-target cell). While not wishing to be bound by theory, binding of the miRNA to the template RNA may interfere with insertion of the heterologous object sequence into the genome. Accordingly, the heterologous object sequence would be inserted into the genome of target cells more efficiently than into the genome of non-target cells. A system having a microRNA binding site in the template RNA (or DNA encoding it) may also be used in combination with a nucleic acid encoding a GENE WRITER? polypeptide, wherein expression of the GENE WRITER? polypeptide is regulated by a second microRNA binding site, e.g., as described herein, e.g., in the section entitled Polypeptide component of GENE WRITER? gene editor system.
(89) In some embodiments, the object sequence may contain a non-coding sequence. For example, the template RNA may comprise a promoter or enhancer sequence. In some embodiments the template RNA comprises a tissue specific promoter or enhancer, each of which may be unidirectional or bidirectional. In some embodiments the promoter is an RNA polymerase I promoter, RNA polymerase II promoter, or RNA polymerase III promoter. In some embodiments the promoter comprises a TATA element. In some embodiments the promoter comprises a B recognition element. In some embodiments the promoter has one or more binding sites for transcription factors. In some embodiments the non-coding sequence is transcribed in an antisense-direction with respect to the 5 and 3 UTR. In some the non-coding sequence is transcribed in a sense direction with respect to the 5 and 3 UTR.
(90) In some embodiments, a nucleic acid described herein (e.g., a template RNA or a DNA encoding a template RNA) comprises a promoter sequence, e.g., a tissue specific promoter sequence. In some embodiments, the tissue-specific promoter is used to increase the target-cell specificity of a GENE WRITER? system. For instance, the promoter can be chosen on the basis that it is active in a target cell type but not active in (or active at a lower level in) a non-target cell type. Thus, even if the promoter integrated into the genome of a non-target cell, it would not drive expression (or only drive low level expression) of an integrated gene. A system having a tissue-specific promoter sequence in the template RNA may also be used in combination with a microRNA binding site, e.g., in the template RNA or a nucleic acid encoding a GENE WRITER? protein, e.g., as described herein. A system having a tissue-specific promoter sequence in the template RNA may also be used in combination with a DNA encoding a GENE WRITER? polypeptide, driven by a tissue-specific promoter, e.g., to achieve higher levels of GENE WRITER? protein in target cells than in non-target cells.
(91) In some embodiments the template RNA comprises a microRNA sequence, a siRNA sequence, a guide RNA sequence, a piwi RNA sequence.
(92) In some embodiments the template RNA comprises a site that coordinates epigenetic modification. In some embodiments the template RNA comprises an element that inhibits, e.g., prevents, epigenetic silencing. In some embodiments the template RNA comprises a chromatin insulator. For example, the template RNA comprises a CTCF site or a site targeted for DNA methylation.
(93) In order to promote higher level or more stable gene expression, the template RNA may include features that prevent or inhibit gene silencing. In some embodiments, these features prevent or inhibit DNA methylation. In some embodiments, these features promote DNA demethylation. In some embodiments, these features prevent or inhibit histone deacetylation. In some embodiments, these features prevent or inhibit histone methylation. In some embodiments, these features promote histone acetylation. In some embodiments, these features promote histone demethylation. In some embodiments, multiple features may be incorporated into the template RNA to promote one or more of these modifications. CpG dinculeotides are subject to methylation by host methyl transferases. In some embodiments, the template RNA is depleted of CpG dinucleotides, e.g., does not comprise CpG nucleotides or comprises a reduced number of CpG dinucleotides compared to a corresponding unaltered sequence. In some embodiments, the promoter driving transgene expression from integrated DNA is depleted of CpG dinucleotides.
(94) In some embodiments the template RNA comprises a gene expression unit composed of at least one regulatory region operably linked to an effector sequence. The effector sequence may be a sequence that is transcribed into RNA (e.g., a coding sequence or a non-coding sequence such as a sequence encoding a micro RNA).
(95) In some embodiments the object sequence of the template RNA is inserted into a target genome in an endogenous intron. In some embodiments the object sequence of the template RNA is inserted into a target genome and thereby acts as a new exon. In some embodiments the insertion of the object sequence into the target genome results in replacement of a natural exon or the skipping of a natural exon.
(96) In some embodiments the object sequence of the template RNA is inserted into the target genome in a genomic safe harbor site, such as AAVS1, CCR5, or ROSA26. In some embodiment the object sequence of the template RNA is added to the genome in an intergenic or intragenic region. In some embodiments the object sequence of the template RNA is added to the genome 5 or 3 within 0.1 kb, 0.25 kb, 0.5 kb, 0.75, kb, 1 kb, 2 kb, 3 kb, 4 kb, 5 kb, 7.5 kb, 10 kb, 15 kb, 20 kb, 25 kb, 50, 75 kb, or 100 kb of an endogenous active gene. In some embodiments the object sequence of the template RNA is added to the genome 5 or 3 within 0.1 kb, 0.25 kb, 0.5 kb, 0.75, kb, 1 kb, 2 kb, 3 kb, 4 kb, 5 kb, 7.5 kb, 10 kb, 15 kb, 20 kb, 25 kb, 50, 75 kb, or 100 kb of an endogenous promoter or enhancer. In some embodiments the object sequence of the template RNA can be, e.g., 50-50,000 base pairs (e.g., between 50-40,000 bp, between 500-30,000 bp between 500-20,000 bp, between 100-15,000 bp, between 500-10,000 bp, between 50-10,000 bp, between 50-5,000 bp. In some embodiments, the heterologous object sequence is less than 1,000, 1,300, 1500, 2,000, 3,000, 4,000, 5,000, or 7,500 nucleotides in length.
(97) In some embodiments the genomic safe harbor site is a NATURAL HARBOR? site. In some embodiments the NATURAL HARBOR? site is ribosomal DNA (rDNA). In some embodiments the NATURAL HARBOR? site is 5S rDNA, 18S rDNA, 5.8S rDNA, or 28S rDNA. In some embodiments the NATURAL HARBOR? site is the Mutsu site in 5S rDNA. In some embodiments the NATURAL HARBOR? site is the R2 site, the R5 site, the R6 site, the R4 site, the R1 site, the R9 site, or the RT site in 28S rDNA. In some embodiments the NATURAL HARBOR? site is the R8 site or the R7 site in 18S rDNA. In some embodiments the NATURAL HARBOR? site is DNA encoding transfer RNA (tRNA). In some embodiments the NATURAL HARBOR? site is DNA encoding tRNA-Asp or tRNA-Glu. In some embodiments the NATURAL HARBOR? site is DNA encoding spliceosomal RNA. In some embodiments the NATURAL HARBOR? site is DNA encoding small nuclear RNA (snRNA) such as U2 snRNA.
(98) Thus, in some aspects, the present disclosure provides a method of inserting a heterologous object sequence into a NATURAL HARBOR? site. In some embodiments, the method comprises using a GENE WRITER? system described herein, e.g., using a polypeptide of any of Tables 1-3 or a polypeptide having sequence similarity thereto, e.g., at least 80%, 85%, 90%, or 95% identity thereto. In some embodiments, the method comprises using an enzyme, e.g., a retrotransposase, to insert the heterologous object sequence into the NATURAL HARBOR? site. In some aspects, the present disclosure provides a host human cell comprising a heterologous object sequence (e.g., a sequence encoding a therapeutic polypeptide) situated at a NATURAL HARBOR? site in the genome of the cell. In some embodiments, the NATURAL HARBOR? site is a site described in Table 4 below. In some embodiments, the heterologous object sequence is inserted within 20, 50, 100, 150, 200, 250, 500, or 1000 base pairs of a sequence shown in Table 4. In some embodiments, the heterologous object sequence is inserted within 0.1 kb, 0.25 kb, 0.5 kb, 0.75, kb, 1 kb, 2 kb, 3 kb, 4 kb, 5 kb, 7.5 kb, 10 kb, 15 kb, 20 kb, 25 kb, 50, 75 kb, or 100 kb of a sequence shown in Table 4. In some embodiments, the heterologous object sequence is inserted into a site having at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% identity to a sequence shown in Table 4. In some embodiments, the heterologous object sequence is inserted within 20, 50, 100, 150, 200, 250, 500, or 1000 base pairs, or within 0.1 kb, 0.25 kb, 0.5 kb, 0.75, kb, 1 kb, 2 kb, 3 kb, 4 kb, 5 kb, 7.5 kb, 10 kb, 15 kb, 20 kb, 25 kb, 50, 75 kb, or 100 kb, of a site having at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% identity to a sequence shown in Table 4. In some embodiments, the heterologous object sequence is inserted within a gene indicated in Column 5 of Table 4, or within 20, 50, 100, 150, 200, 250, 500, or 1000 base pairs, or within 0.1 kb, 0.25 kb, 0.5 kb, 0.75, kb, 1 kb, 2 kb, 3 kb, 4 kb, 5 kb, 7.5 kb, 10 kb, 15 kb, 20 kb, 25 kb, 50, 75 kb, or 100 kb, of the gene.
(99) TABLE-US-00004 TABLE4 NaturalHarbor? sites.Column1indicatesaretrotransposon thatinsertsintotheNaturalHarbor? site.Column2 indicatesthegeneattheNaturalHarbor? site.Columns3 and4showexemplaryhumangenomesequence5 and3 ofthe insertionsite(forexample,250bp).Columns5and6list theexamplegenesymbolandcorrespondingGeneID. Example Target Target Gene Example Site Gene 5 flankingsequence 3 flankingsequence Symbol GeneID R2 28SrDNA CCGGTCCCCCCCGCCGGGTCC GTAGCCAAATGCCTCGTCATC RNA28SN1 106632264 GCCCCCGGGGCCGCGGTTCC TAATTAGTGACGCGCATGAAT GCGCGGCGCCTCGCCTCGGC GGATGAACGAGATTCCCACT CGGCGCCTAGCAGCCGACTT GTCCCTACCTACTATCCAGCG AGAACTGGTGCGGACCAGGG AAACCACAGCCAAGGGAACG GAATCCGACTGTTTAATTAAA GGCTTGGCGGAATCAGCGGG ACAAAGCATCGCGAAGGCCC GAAAGAAGACCCTGTTGAGC GCGGCGGGTGTTGACGCGAT TTGACTCTAGTCTGGCACGGT GTGATTTCTGCCCAGTGCTCT GAAGAGACATGAGAGGTGTA GAATGTCAAAGTGAAGAAAT GAATAAGTGGGAGGCCCCCG TCAATGAAGCGCGGGTAAAC GCGCCCCCCCGGTGTCCCCGC GGCGGGAGTAACTATGACTC GAGGGGCCCGGGGCGGGGT TCTTAAG(SEQIDNO:1508) CCGCCG(SEQIDNO:1513) R4 28SrDNA GCGGTTCCGCGCGGCGCCTC CGCATGAATGGATGAACGAG RNA28SN1 106632264 GCCTCGGCCGGCGCCTAGCA ATTCCCACTGTCCCTACCTACT GCCGACTTAGAACTGGTGCG ATCCAGCGAAACCACAGCCA GACCAGGGGAATCCGACTGT AGGGAACGGGCTTGGCGGA TTAATTAAAACAAAGCATCGC ATCAGCGGGGAAAGAAGACC GAAGGCCCGCGGCGGGTGTT CTGTTGAGCTTGACTCTAGTC GACGCGATGTGATTTCTGCCC TGGCACGGTGAAGAGACATG AGTGCTCTGAATGTCAAAGT AGAGGTGTAGAATAAGTGGG GAAGAAATTCAATGAAGCGC AGGCCCCCGGCGCCCCCCCG GGGTAAACGGCGGGAGTAAC GTGTCCCCGCGAGGGGCCCG TATGACTCTCTTAAGGTAGCC GGGCGGGGTCCGCCGGCCCT AAATGCCTCGTCATCTAATTA GCGGGCCGCCGGTGAAATAC GTGACG(SEQIDNO:1509) CACTACTC(SEQIDNO: 1514) R5 28SrDNA TCCCCCCCGCCGGGTCCGCCC CCAAATGCCTCGTCATCTAAT RNA28SN1 106632264 CCGGGGCCGCGGTTCCGCGC TAGTGACGCGCATGAATGGA GGCGCCTCGCCTCGGCCGGC TGAACGAGATTCCCACTGTCC GCCTAGCAGCCGACTTAGAA CTACCTACTATCCAGCGAAAC CTGGTGCGGACCAGGGGAAT CACAGCCAAGGGAACGGGCT CCGACTGTTTAATTAAAACAA TGGCGGAATCAGCGGGGAAA AGCATCGCGAAGGCCCGCGG GAAGACCCTGTTGAGCTTGA CGGGTGTTGACGCGATGTGA CTCTAGTCTGGCACGGTGAA TTTCTGCCCAGTGCTCTGAAT GAGACATGAGAGGTGTAGAA GTCAAAGTGAAGAAATTCAA TAAGTGGGAGGCCCCCGGCG TGAAGCGCGGGTAAACGGCG CCCCCCCGGTGTCCCCGCGAG GGAGTAACTATGACTCTCTTA GGGCCCGGGGCGGGGTCCG AGGTAG(SEQIDNO:1510) CCGGCCC(SEQIDNO:1515) R9 28SrDNA CGGCGCGCTCGCCGGCCGAG TAGCTGGTTCCCTCCGAAGTT RNA28SN1 106632264 GTGGGATCCCGAGGCCTCTC TCCCTCAGGATAGCTGGCGCT CAGTCCGCCGAGGGCGCACC CTCGCAGACCCGACGCACCCC ACCGGCCCGTCTCGCCCGCCG CGCCACGCAGTTTTATCCGGT CGCCGGGGAGGTGGAGCAC AAAGCGAATGATTAGAGGTC GAGCGCACGTGTTAGGACCC TTGGGGCCGAAACGATCTCA GAAAGATGGTGAACTATGCC ACCTATTCTCAAACTTTAAAT TGGGCAGGGCGAAGCCAGA GGGTAAGAAGCCCGGCTCGC GGAAACTCTGGTGGAGGTCC TGGCGTGGAGCCGGGCGTGG GTAGCGGTCCTGACGTGCAA AATGCGAGTGCCTAGTGGGC ATCGGTCGTCCGACCTGGGT CACTTTTGGTAAGCAGAACTG ATAGGGGCGAAAGACTAATC GCGCTGCGGGATGAACCGAA GAACCATCTAG(SEQIDNO: CGCC(SEQIDNO:1516) 1511) R8 18SrDNA GCATTCGTATTGCGCCGCTAG TGAAACTTAAAGGAATTGAC RNA18SN1 106631781 AGGTGAAATTCTTGGACCGG GGAAGGGCACCACCAGGAGT CGCAAGACGGACCAGAGCGA GGAGCCTGCGGCTTAATTTG AAGCATTTGCCAAGAATGTTT ACTCAACACGGGAAACCTCA TCATTAATCAAGAACGAAAGT CCCGGCCCGGACACGGACAG CGGAGGTTCGAAGACGATCA GATTGACAGATTGATAGCTCT GATACCGTCGTAGTTCCGACC TTCTCGATTCCGTGGGTGGTG ATAAACGATGCCGACCGGCG GTGCATGGCCGTTCTTAGTTG ATGCGGCGGCGTTATTCCCAT GTGGAGCGATTTGTCTGGTT GACCCGCCGGGCAGCTTCCG AATTCCGATAACGAACGAGA GGAAACCAAAGTCTTTGGGT CTCTGGCATGCTAACTAGTTA TCCGGGGGGAGTATGGTTGC CGCGACCCCCGAGCGGTCGG AAAGC(SEQIDNO:1512) CGTCCC(SEQIDNO:1517) R4-2_SRa tRNA-Asp TRD-GTC1-1 100189207 LIN25_SM tRNA-Glu TRE-CTC1-1 100189384 R1 28SrDNA TAGCAGCCGACTTAGAACTG ACCTACTATCCAGCGAAACCA RNA28SN1 106632264 GTGCGGACCAGGGGAATCCG CAGCCAAGGGAACGGGCTTG ACTGTTTAATTAAAACAAAGC GCGGAATCAGCGGGGAAAG ATCGCGAAGGCCCGCGGCGG AAGACCCTGTTGAGCTTGACT GTGTTGACGCGATGTGATTTC CTAGTCTGGCACGGTGAAGA TGCCCAGTGCTCTGAATGTCA GACATGAGAGGTGTAGAATA AAGTGAAGAAATTCAATGAA AGTGGGAGGCCCCCGGCGCC GCGCGGGTAAACGGCGGGA CCCCCGGTGTCCCCGCGAGG GTAACTATGACTCTCTTAAGG GGCCCGGGGCGGGGTCCGCC TAGCCAAATGCCTCGTCATCT GGCCCTGCGGGCCGCCGGTG AATTAGTGACGCGCATGAAT AAATACCACTACTCTGATCGT GGATGAACGAGATTCCCACT TTTTTCACTGACCCGGTGAGG GTCCCT(SEQIDNO:1518) CGGGGGG(SEQIDNO: 1524) R6 28SrDNA CCCCCCGCCGGGTCCGCCCCC AAATGCCTCGTCATCTAATTA RNA28SN1 106632264 GGGGCCGCGGTTCCGCGCGG GTGACGCGCATGAATGGATG CGCCTCGCCTCGGCCGGCGC AACGAGATTCCCACTGTCCCT CTAGCAGCCGACTTAGAACT ACCTACTATCCAGCGAAACCA GGTGCGGACCAGGGGAATCC CAGCCAAGGGAACGGGCTTG GACTGTTTAATTAAAACAAAG GCGGAATCAGCGGGGAAAG CATCGCGAAGGCCCGCGGCG AAGACCCTGTTGAGCTTGACT GGTGTTGACGCGATGTGATT CTAGTCTGGCACGGTGAAGA TCTGCCCAGTGCTCTGAATGT GACATGAGAGGTGTAGAATA CAAAGTGAAGAAATTCAATG AGTGGGAGGCCCCCGGCGCC AAGCGCGGGTAAACGGCGG CCCCCGGTGTCCCCGCGAGG GAGTAACTATGACTCTCTTAA GGCCCGGGGCGGGGTCCGCC GGTAGCC(SEQIDNO:1519) GGCCCTG(SEQIDNO:1525) R7 18SrDNA GCGCAAGACGGACCAGAGCG GGAGCCTGCGGCTTAATTTG RNA18SN1 106631781 AAAGCATTTGCCAAGAATGTT ACTCAACACGGGAAACCTCA TTCATTAATCAAGAACGAAAG CCCGGCCCGGACACGGACAG TCGGAGGTTCGAAGACGATC GATTGACAGATTGATAGCTCT AGATACCGTCGTAGTTCCGAC TTCTCGATTCCGTGGGTGGTG CATAAACGATGCCGACCGGC GTGCATGGCCGTTCTTAGTTG GATGCGGCGGCGTTATTCCC GTGGAGCGATTTGTCTGGTT ATGACCCGCCGGGCAGCTTC AATTCCGATAACGAACGAGA CGGGAAACCAAAGTCTTTGG CTCTGGCATGCTAACTAGTTA GTTCCGGGGGGAGTATGGTT CGCGACCCCCGAGCGGTCGG GCAAAGCTGAAACTTAAAGG CGTCCCCCAACTTCTTAGAGG AATTGACGGAAGGGCACCAC GACAAGTGGCGTTCAGCCAC CAGGAGT(SEQIDNO:1520) CCGAG(SEQIDNO:1526) RT 28SrDNA GGCCGGGCGCGACCCGCTCC AACTGGCTTGTGGCGGCCAA RNA28SN1 106632264 GGGGACAGTGCCAGGTGGG GCGTTCATAGCGACGTCGCTT GAGTTTGACTGGGGCGGTAC TTTGATCCTTCGATGTCGGCT ACCTGTCAAACGGTAACGCA CTTCCTATCATTGTGAAGCAG GGTGTCCTAAGGCGAGCTCA AATTCACCAAGCGTTGGATTG GGGAGGACAGAAACCTCCCG TTCACCCACTAATAGGGAACG TGGAGCAGAAGGGCAAAAG TGAGCTGGGTTTAGACCGTC CTCGCTTGATCTTGATTTTCA GTGAGACAGGTTAGTTTTACC GTACGAATACAGACCGTGAA CTACTGATGATGTGTTGTTGC AGCGGGGCCTCACGATCCTTC CATGGTAATCCTGCTCAGTAC TGACCTTTTGGGTTTTAAGCA GAGAGGAACCGCAGGTTCAG GGAGGTGTCAGAAAAGTTAC ACATTTGGTGTATGTGCTTGG CACAGGGAT(SEQIDNO: C(SEQIDNO:1527) 1521) Mutsu 5SrDNA GTCTACGGCCATACCACCC TGAACGCGCCCGATCTCGTCT RNA5S1 100169751 (SEQIDNO:1522) GATCTCGGAAGCTAAGCAGG GTCGGGCCTGGTTAGTACTT GGATGGGAGACCGCCTGGGA ATACCGGGTGCTGTAGGCTTT (SEQIDNO:1528) Utopia/ U2snRNA ATCGCTTCTCGGCCTTTTGGC TCTGTTCTTATCAGTTTAATAT RNU2-1 6066 Keno TAAGATCAAGTGTAGTA(SEQ CTGATACGTCCTCTATCCGAG IDNO:1523) GACAATATATTAAATGGATTT TTGGAGCAGGGAGATGGAAT AGGAGCTTGCTCCGTCCACTC CACGCATCGACCTGGTATTGC AGTACCTCCAGGAACGGTGC ACCC(SEQIDNO:1529)
(100) In some embodiments, a system or method described herein results in insertion of a heterologous sequence into a target site in the human genome. In some embodiments, the target site in the human genome has sequence similarity to the corresponding target site of the corresponding wild-type retrotransposase (e.g., the retrotransposase from which the GENE WRITER? was derived) in the genome of the organism to which it is native. For instance, in some embodiments, the identity between the 40 nucleotides of human genome sequence centered at the insertion site and the 40 nucleotides of native organism genome sequence centered at the insertion site is less than 99.5%, 99%, 98%, 97%, 96%, 95%, 90%, 85%, 80%, 75%, 70%, 60%, or 50%, or is between 50-60%, 60-70%, 70-80%, 80-90%, or 90-100%. In some embodiments, the identity between the 100 nucleotides of human genome sequence centered at the insertion site and the 100 nucleotides of native organism genome sequence centered at the insertion site is less than 99.5%, 99%, 98%, 97%, 96%, 95%, 90%, 85%, 80%, 75%, 70%, 60%, or 50%, or is between 50-60%, 60-70%, 70-80%, 80-90%, or 90-100%. In some embodiments, the identity between the 500 nucleotides of human genome sequence centered at the insertion site and the 500 nucleotides of native organism genome sequence centered at the insertion site is less than 99.5%, 99%, 98%, 97%, 96%, 95%, 90%, 85%, 80%, 75%, 70%, 60%, or 50%, or is between 50-60%, 60-70%, 70-80%, 80-90%, or 90-100%.
(101) Production of Compositions and Systems
(102) As will be appreciated by one of skill, methods of designing and constructing nucleic acid constructs and proteins or polypeptides (such as the systems, constructs and polypeptides described herein) are routine in the art. Generally, recombinant methods may be used. See, in general, Smales & James (Eds.), Therapeutic Proteins: Methods and Protocols (Methods in Molecular Biology), Humana Press (2005); and Crommelin, Sindelar & Meibohm (Eds.), Pharmaceutical Biotechnology: Fundamentals and Applications, Springer (2013). Methods of designing, preparing, evaluating, purifying and manipulating nucleic acid compositions are described in Green and Sambrook (Eds.), Molecular Cloning: A Laboratory Manual (Fourth Edition), Cold Spring Harbor Laboratory Press (2012).
(103) Exemplary methods for producing a therapeutic pharmaceutical protein or polypeptide described herein involve expression in mammalian cells, although recombinant proteins can also be produced using insect cells, yeast, bacteria, or other cells under control of appropriate promoters. Mammalian expression vectors may comprise non-transcribed elements such as an origin of replication, a suitable promoter, and other 5 or 3 flanking non-transcribed sequences, and 5 or 3 non-translated sequences such as necessary ribosome binding sites, a polyadenylation site, splice donor and acceptor sites, and termination sequences. DNA sequences derived from the SV40 viral genome, for example, SV40 origin, early promoter, splice, and polyadenylation sites may be used to provide other genetic elements required for expression of a heterologous DNA sequence. Appropriate cloning and expression vectors for use with bacterial, fungal, yeast, and mammalian cellular hosts are described in Green & Sambrook, Molecular Cloning: A Laboratory Manual (Fourth Edition), Cold Spring Harbor Laboratory Press (2012).
(104) Various mammalian cell culture systems can be employed to express and manufacture recombinant protein. Examples of mammalian expression systems include CHO, COS, HEK293, HeLA, and BHK cell lines. Processes of host cell culture for production of protein therapeutics are described in Zhou and Kantardjieff (Eds.), Mammalian Cell Cultures for Biologics Manufacturing (Advances in Biochemical Engineering/Biotechnology), Springer (2014). Compositions described herein may include a vector, such as a viral vector, e.g., a lentiviral vector, encoding a recombinant protein. In some embodiments, a vector, e.g., a viral vector, may comprise a nucleic acid encoding a recombinant protein.
(105) Purification of protein therapeutics is described in Franks, Protein Biotechnology: Isolation, Characterization, and Stabilization, Humana Press (2013); and in Cutler, Protein Purification Protocols (Methods in Molecular Biology), Humana Press (2010).
(106) Applications
(107) By integrating coding genes into a RNA sequence template, the GENE WRITER? system can address therapeutic needs, for example, by providing expression of a therapeutic transgene in individuals with loss-of-function mutations, by replacing gain-of-function mutations with normal transgenes, by providing regulatory sequences to eliminate gain-of-function mutation expression, and/or by controlling the expression of operably linked genes, transgenes and systems thereof. In certain embodiments, the RNA sequence template encodes a promotor region specific to the therapeutic needs of the host cell, for example a tissue specific promotor or enhancer. In still other embodiments, a promotor can be operably linked to a coding sequence.
(108) In embodiments, the GENE WRITER? gene editor system can provide therapeutic transgenes expressing, e.g., replacement blood factors or replacement enzymes, e.g., lysosomal enzymes. For example, the compositions, systems and methods described herein are useful to express, in a target human genome, agalsidase alpha or beta for treatment of Fabry Disease; imiglucerase, taliglucerase alfa, velaglucerase alfa, or alglucerase for Gaucher Disease; sebelipase alpha for lysosomal acid lipase deficiency (Wolman disease/CESD); laronidase, idursulfase, elosulfase alpha, or galsulfase for mucopolysaccharidoses; alglucosidase alpha for Pompe disease. For example, the compositions, systems and methods described herein are useful to express, in a target human genome factor I, II, V, VII, X, XI, XII or XIII for blood factor deficiencies.
(109) In some embodiments, the heterologous object sequence encodes an intracellular protein (e.g., a cytoplasmic protein, a nuclear protein, an organellar protein such as a mitochondrial protein or lysosomal protein, or a membrane protein). In some embodiments, the heterologous object sequence encodes a membrane protein, e.g., a membrane protein other than a CAR, and/or an endogenous human membrane protein. In some embodiments, the heterologous object sequence encodes an extracellular protein. In some embodiments, the heterologous object sequence encodes an enzyme, a structural protein, a signaling protein, a regulatory protein, a transport protein, a sensory protein, a motor protein, a defense protein, or a storage protein.
(110) Administration
(111) The composition and systems described herein may be used in vitro or in vivo. In some embodiments the system or components of the system are delivered to cells (e.g., mammalian cells, e.g., human cells), e.g., in vitro or in vivo. In some embodiments, the cells are eukaryotic cells, e.g., cells of a multicellular organism, e.g., an animal, e.g., a mammal (e.g., human, swine, bovine) a bird (e.g., poultry, such as chicken, turkey, or duck), or a fish. In some embodiments, the cells are non-human animal cells (e.g., a laboratory animal, a livestock animal, or a companion animal). In some embodiments, the cell is a stem cell (e.g., a hematopoietic stem cell), a fibroblast, or a T cell. In some embodiments, the cell is a non-dividing cell, e.g., a non-dividing fibroblast or non-dividing T cell. In some embodiments, the cell is an HSC and p53 is not upregulated or is upregulated by less than 10%, 5%, 2%, or 1%, e.g., as determined according to the method described in Example 30. The skilled artisan will understand that the components of the GENE WRITER? system may be delivered in the form of polypeptide, nucleic acid (e.g., DNA, RNA), and combinations thereof.
(112) For instance, delivery can use any of the following combinations for delivering the retrotransposase (e.g., as DNA encoding the retrotransposase protein, as RNA encoding the retrotransposase protein, or as the protein itself) and the template RNA (e.g., as DNA encoding the RNA, or as RNA): 1. Retrotransposase DNA+template DNA 2. Retrotransposase RNA+template DNA 3. Retrotransposase DNA+template RNA 4. Retrotransposase RNA+template RNA 5. Retrotransposase protein+template DNA 6. Retrotransposase protein+template RNA 7. Retrotransposase virus+template virus 8. Retrotransposase virus+template DNA 9. Retrotransposase virus+template RNA 10. Retrotransposase DNA+template virus 11. Retrotransposase RNA+template virus 12. Retrotransposase protein+template virus
(113) As indicated above, in some embodiments, the DNA or RNA that encodes the retrotransposase protein is delivered using a virus, and in some embodiments, the template RNA (or the DNA encoding the template RNA) is delivered using a virus.
(114) In one embodiments the system and/or components of the system are delivered as nucleic acid. For example, the GENE WRITER? polypeptide may be delivered in the form of a DNA or RNA encoding the polypeptide, and the template RNA may be delivered in the form of RNA or its complementary DNA to be transcribed into RNA. In some embodiments the system or components of the system are delivered on 1, 2, 3, 4, or more distinct nucleic acid molecules. In some embodiments the system or components of the system are delivered as a combination of DNA and RNA. In some embodiments the system or components of the system are delivered as a combination of DNA and protein. In some embodiments the system or components of the system are delivered as a combination of RNA and protein. In some embodiments the GENE WRITER? genome editor polypeptide is delivered as a protein.
(115) In some embodiments the system or components of the system are delivered to cells, e.g. mammalian cells or human cells, using a vector. The vector may be, e.g., a plasmid or a virus. In some embodiments delivery is in vivo, in vitro, ex vivo, or in situ. In some embodiments the virus is an adeno associated virus (AAV), a lentivirus, an adenovirus. In some embodiments the system or components of the system are delivered to cells with a viral-like particle or a virosome. In some embodiments the delivery uses more than one virus, viral-like particle or virosome.
(116) In one embodiment, the compositions and systems described herein can be formulated in liposomes or other similar vesicles. Liposomes are spherical vesicle structures composed of a uni- or multilamellar lipid bilayer surrounding internal aqueous compartments and a relatively impermeable outer lipophilic phospholipid bilayer. Liposomes may be anionic, neutral or cationic. Liposomes are biocompatible, nontoxic, can deliver both hydrophilic and lipophilic drug molecules, protect their cargo from degradation by plasma enzymes, and transport their load across biological membranes and the blood brain barrier (BBB) (see, e.g., Spuch and Navarro, Journal of Drug Delivery, vol. 2011, Article ID 469679, 12 pages, 2011 for review).
(117) Vesicles can be made from several different types of lipids; however, phospholipids are most commonly used to generate liposomes as drug carriers. Methods for preparation of multilamellar vesicle lipids are known in the art (see for example U.S. Pat. No. 6,693,086, the teachings of which relating to multilamellar vesicle lipid preparation are incorporated herein by reference). Although vesicle formation can be spontaneous when a lipid film is mixed with an aqueous solution, it can also be expedited by applying force in the form of shaking by using a homogenizer, sonicator, or an extrusion apparatus (see, e.g., Spuch and Navarro, Journal of Drug Delivery, vol. 2011, Article ID 469679, 12 pages, 2011 for review). Extruded lipids can be prepared by extruding through filters of decreasing size, as described in Templeton et al., Nature Biotech, 15:647-652, 1997, the teachings of which relating to extruded lipid preparation are incorporated herein by reference.
(118) Lipid nanoparticles are another example of a carrier that provides a biocompatible and biodegradable delivery system for the pharmaceutical compositions described herein. Nanostructured lipid carriers (NLCs) are modified solid lipid nanoparticles (SLNs) that retain the characteristics of the SLN, improve drug stability and loading capacity, and prevent drug leakage. Polymer nanoparticles (PNPs) are an important component of drug delivery. These nanoparticles can effectively direct drug delivery to specific targets and improve drug stability and controlled drug release. Lipid-polymer nanoparticles (PLNs), a new type of carrier that combines liposomes and polymers, may also be employed. These nanoparticles possess the complementary advantages of PNPs and liposomes. A PLN is composed of a core-shell structure; the polymer core provides a stable structure, and the phospholipid shell offers good biocompatibility. As such, the two components increase the drug encapsulation efficiency rate, facilitate surface modification, and prevent leakage of water-soluble drugs. For a review, see, e.g., Li et al. 2017, Nanomaterials 7, 122.
(119) Exosomes can also be used as drug delivery vehicles for the compositions and systems described herein. For a review, see Ha et al. July 2016. Acta Pharmaceutica Sinica B. Volume 6, Issue 4, Pages 287-296.
(120) A GENE WRITER? system can be introduced into cells, tissues and multicellular organisms. In some embodiments the system or components of the system are delivered to the cells via mechanical means or physical means.
(121) Formulation of protein therapeutics is described in Meyer (Ed.), Therapeutic Protein Drug Products: Practical Approaches to formulation in the Laboratory, Manufacturing, and the Clinic, Woodhead Publishing Series (2012).
(122) All publications, patent applications, patents, and other publications and references (e.g., sequence database reference numbers) cited herein are incorporated by reference in their entirety. For example, all GENBANK?, UNIGENE?, and ENTREZ? sequences referred to herein, e.g., in any Table herein, are incorporated by reference. Unless otherwise specified, the sequence accession numbers specified herein, including in any Table herein, refer to the database entries current as of Aug. 27, 2018. When one gene or protein references a plurality of sequence accession numbers, all of the sequence variants are encompassed.
EXAMPLES
(123) The invention is further illustrated by the following examples. The examples are provided for illustrative purposes only and are not to be construed as limiting the scope or content of the invention in any way.
Example 1: Delivery of a GENE WRITER? System to Mammalian Cells
(124) This example describes a GENE WRITER? genome editing system delivered to a mammalian cell for site-specific insertion of exogenous DNA into a mammalian cell genome.
(125) In this example, the polypeptide component of the GENE WRITER? system is the R2Bm protein from Bombyx mori and the template RNA component is RNA for the R2Bm retrotransposase from Bombyx mori containing a mutation in the reverse transcriptase domain that renders the retrotransposase inactive.
(126) HEK293T cells are transfected with the following test agents: 1. Scrambled RNA control 2. RNA coding for the polypeptide described above 3. Template RNA described above 4. Combination of 2 and 3
(127) After transfection, HEK293T cells are cultured for at least 4 days and then assayed for site-specific genome editing. Genomic DNA is isolated from each group of HEK293 cells. PCR is conducted with primers that flank the R2Bm integration site in 28s rRNA genes. The PCR product is run on an agarose gel to measure the length of the amplified DNA.
(128) A PCR product of the expected length, indicative of a successful GENE WRITING? genome editing event that inserts the sequence for the mutated R2Bm retrotransposase into the target genome, is observed only in cells that were transfected with the complete GENE WRITER? system of group 4 above.
Example 2: Site-Specific Targeted Delivery of a GENE WRITER? System into Insect Cells
(129) This example describes a GENE WRITER? genome editing system delivered to an insect cell at a specific target site of the genome.
(130) In this example, the polypeptide component of the GENE WRITER? system is derived from R2Bm of Bombyx mori, which is modified by replacing its DNA binding domain in the amino terminus of the polypeptide with a heterologous zinc-finger DNA binding domain. The zinc finger DNA binding domain is known to bind to DNA in the BmBLOS2 loci of B. mori cells (Takasu et al., insect Biochemistry and Molecular Biology 40(10): 759-765, 2010). The template RNA is RNA for the R2Bm retrotransposase from Bombyx mori containing a mutation in the reverse transcriptase domain that renders the retrotransposase inactive. Furthermore, the template RNA is modified at the 5 end to have 180 bases of homology to the target DNA site.
(131) B. mori insect cell lines are transfected with the following test agents: 1. Scrambled RNA control 2. RNA coding for polypeptide component described above 3. Template RNA described above 4. Combination of 2 and 3
(132) After transfection, the cells are cultured for at least 4 days and assayed for site-specific GENE WRITING? genome editing. Genomic DNA is isolated from the cells and PCR is conducted with primers that flank the target integration site in the genome. The PCR product is run on an agarose gel to measure the length of DNA. A PCR product of the expected length, indicative of a successful GENE WRITING? genome editing event that inserts the sequence for the mutated R2Bm retrotransposase into the target insect cell genome, is observed only in cells that were transfected with the complete GENE WRITER? system of group 4 above.
Example 3: Site-Specific Targeted Delivery of a GENE WRITER? System into Mammalian Cells
(133) This example describes a GENE WRITER? genome editor system used to insert a heterologous sequence into a specific site of the mammalian genome.
(134) In this example, the polypeptide of the system is the R2Bm protein from Bombyx mori and the template RNA component is RNA coding for the GFP protein and flanked at the 5 end by the 5 UTR and at the 3 end by the 3 UTR of the R2Bm retrotransposase from Bombyx mori. The GFP gene has an internal ribosomal entry site upstream of its start codon and a polyA tail downstream of its stop codon.
(135) HEK293 cells are transfected with the following test agents: 1. Scrambled RNA control 2. RNA coding for the polypeptide described above 3. Template RNA coding for GFP described above 4. Combination of 2 and 3
(136) After transfection, HEK293 cells are cultured for at least 4 days and then assayed for a site-specific GENE WRITING? genome editing event. Genomic DNA is isolated from the HEK293 cells and PCR is conducted with primers that flank the R2Bm integration site in 28s rRNA genes. The PCR product is run on an agarose gel to measure the length of DNA. A PCR product of the expected length, indicative of a successful GENE WRITING? genome editing event, is detected in cells transfected with the test agent of group 4 (complete GENE WRITER? system). This result demonstrates that a GENE WRITING? genome editing system can insert a novel transgene into the mammalian cell genome.
(137) The transfected cells are cultured for a further 10 days, and after multiple cell culture passages are assayed for GFP expression via flow cytometry. The percent of cells that are GFP positive from each cell population are calculated. GFP positive cells are detected in the population of HEK293 cells that were transfected with the test agent of group 4 (complete GENE WRITER? system). This result demonstrates that the novel transgene written into the mammalian cell genome is expressed.
Example 4: Targeted Delivery of a Gene Expression Unit into Mammalian Cells Using a GENE WRITER? System
(138) This example describes the making and using of a GENE WRITER? genome editor to insert a heterologous gene expression unit into the mammalian genome.
(139) In this example, the polypeptide of the GENE WRITER? system is derived from the R2Bm polypeptide of Bombyx mori as modified by replacing its DNA binding domain in the amino terminus of the polypeptide with a heterologous zinc-finger DNA binding domain. The zinc finger DNA binding domain is known to bind to DNA in the AAVS1 locus of human cells (Hockemeyer et al., Nature Biotechnology 27(9): 851-857, 2009). The template RNA comprises a gene expression unit. A gene expression unit comprises at least one regulatory sequence operably linked to at least one coding sequence. In this example, the regulatory sequences include the CMV promoter and enhancer, an enhanced translation element, and a WPRE. The coding sequence is the GFP open reading frame. The gene expression unit is flanked at the 5 end by 180 bases of homology to the target DNA site and at the 3 end by the 3 UTR of the R2Bm retrotransposase from Bombyx mori.
(140) HEK293 cells are transfected with the following test agents: 1. Scrambled control RNA 2. RNA coding for the polypeptide component described above 3. Template RNA comprising the gene expression unit (as described above) 4. The complete GENE WRITER? system comprising both (2) and (3)
(141) After transfection, HEK293 cells are cultured for at least 4 days and assayed for site-specific GENE WRITING? genome editing. Genomic DNA is isolated from the HEK293 cells and PCR is conducted with primers that flank the target integration site in the genome. The PCR product is run on an agarose gel to measure the length of DNA. A PCR product of the expected length, indicative of a successful GENE WRITING? genome editing event, is detected in cells transfected with the test agent of group 4 (complete GENE WRITER? system).
(142) The transfected cells are cultured for a further 10 days, and after multiple cell culture passages are assayed for GFP expression via flow cytometry. The percent of cells that are GFP positive from each cell population are calculated. GFP positive cells are detected in the population of HEK293 cells that were transfected with group 4 test agent, demonstrating that a gene expression unit added into the mammalian cell genome via GENE WRITING? genome editing is expressed.
Example 5: Targeted Delivery of a Gene Expression Unit into an Intronic Region of Mammalian Cells Using a GENE WRITER? System
(143) This example describes the making and use of a GENE WRITING? genome editing system to add a heterologous sequence into an intronic region to act as a splice acceptor for an upstream exon.
(144) The target integration site is the first intron of the albumin locus. Splicing into the first intron a new exon containing a splice acceptor site at the 5 end and a polyA tail at the 3 end will result in a mature mRNA containing the first natural exon of the albumin locus spliced to the new exon. Because the first exon of albumin is removed upon protein processing, the cell expressing the newly formed gene unit will secrete a mature protein comprising only the new exon.
(145) In this example, the GENE WRITER? genome editor polypeptide is derived from the R2Bm GENE WRITER? genome editor of Bombyx mori as modified by replacing the DNA binding domain in the amino terminus of the polypeptide with a heterologous zinc-finger DNA binding domain. The zinc finger DNA binding domain is known to bind tightly to the albumin locus in the first intron as described in Sarma et al., Blood 126, 15: 1777-1784, 2015. The template RNA is RNA coding for EPO with a splice acceptor site immediately 5 to the first amino acid of mature EPO (the start codon and signal peptide is removed) and a 3 polyA tail downstream of the stop codon. The EPO RNA is further flanked at the 5 end by 180 bases of homology to target DNA site and at the 3 end by the 3 UTR of the R2Bm retrotransposase from Bombyx mori.
(146) HEK293 cells are transfected with the following test agents: 1. Scrambled control RNA 2. RNA coding for the polypeptide described above 3. Template RNA comprising the EPO splice acceptor described above 4. The complete GENE WRITER? system comprising both (2) and (3)
(147) After transfection, HEK293 cells are cultured for at least 4 days and assayed for site-specific GENE WRITING? genome editing and appropriate mRNA processing. Genomic DNA is isolated from the HEK293 cells. Reverse transcription-PCR is conducted to measure the mature mRNA containing the first natural exon of the albumin locus and the new exon. The RT-PCR reaction is conducted with forward primers that bind to the first natural exon of the albumin locus and with reverse primers that bind to EPO. The RT-PCR product is run on an agarose gel to measure the length of DNA. A PCR product of the expected length is detected in cells transfected with the test agent of group 4, indicative of a successful GENE WRITING? genome editing event and a successful splice event. This result demonstrates that a GENE WRITING? genome editing system can add a heterologous sequence encoding a gene into an intronic region to act as a splice acceptor for the upstream exon.
(148) The transfected cells are cultured for a further 10 days, and after multiple cell culture passages are assayed for EPO secretion in the cell supernatant. The amount of EPO in the supernatant is measured via an EPO ELISA kit. EPO is detected in HEK293 cells that were transfected with the test agent of group 4, demonstrating that a heterologous sequence can be added into an intronic region via GENE WRITING? genome editing, to act as a splice acceptor for the upstream exon and is actively expressed.
Example 6: Targeted Delivery of R2Tg Retrotransposon to Mammalian Cells
(149) This example describes targeted integration of the R2Tg retrotransposon element (see first row of Table 3 herein) to mammalian cells via DNA or RNA delivery.
(150) R2Tg is an endogenous retrotransposon from the zebra finch (Taenopygia guttata). Because non-LTR R2 elements are not present in the human genome and are thought to be highly site-specific, the ability of R2Tg to accurately and efficiently integrate itself into the human genome would demonstrate the capability to perform genomic targeted integration and possibly enable human gene therapy.
(151) In the DNA delivery method, plasmid harboring R2Tg (PLV014) was designed and synthesized such that the R2Tg element was codon optimized and flanked by its native un-translated regions (UTRs), with or without further flanking by 100 bp homology to the rDNA target locus. The R2Tg element expression was driven by the mammalian CMV promoter. Further, a 1 bp deletion mutant (678*) having a frameshift in the coding sequence of the retrotransposase was constructed as an inactivated control (frameshift mutant). Each plasmid was introduced into HEK393T cells via FuGENE? HD transfection reagent. HEK293T cells were seeded in 96-well plate, 10,000 cells/well 24 hr before transfection. On the transfection day, 0.5 ?l transfection reagent and 80 ng DNA was mixed in 10 ?l Opti-MEM and incubated for 15 min at room temperature. Then the transfection mixture was added to the medium of the seeded cells. 3 days after transfection, genomic DNA was extracted for retrotransposition assays.
(152) Next, integration of the R2Tg transposase into the human genome was assessed. Based on homology to the finch genome, a putative integration site in human rDNA was tested. Advanced MISEQ? and ddPCR assays were used to assess integration.
(153) Bias in MISEQ? library construction was eliminated by introducing random unique molecular indices (UMIs) into initial PCRs (
(154) ddPCR was next performed to confirm integration and assess integration efficiency. A Taqman probe was designed to the 3UTR portion of the R2Tg element. A forward primer was synthesized to bind directly upstream of the probe, and a reverse primer was synthesized to bind the rDNA. Therefore, amplification of the expected product across the integration junction degrades the probe and creates a fluorescent signal. ddPCR was performed on several replicate experiments of the above plasmids to determine the average copy number of the R2Tg integration event. The results of ddPCR copy number analysis (in comparison to reference gene RPP30) are shown in
(155) In the RNA delivery method, R2Tg RNA (RNAV019) was designed such that the R2Tg element was codon optimized and flanked by its native untranslated regions (UTRs). More specifically, the construct includes, in order: a T7 promoter, a 5 28S target homology region 100 nucleotides in length, a R2Tg wild-type 5 UTR, the R2Tg codon-optimized coding sequence, a R2Tg wild type 3 UTR, and a 3 28S target homology region 100 nucleotides in length. The 100 bp 28S homology sequences were added outside of the UTRs to enhance the integration. R2Tg RNA was synthesized, and cap and polyA tail were added. The R2Tg element transcription was driven by the T7 promoter. The RNA was introduced into HEK393T cells via Lipofectamine? RNAiMAX or TransIT?-mRNA transfection reagent with series of RNA dosages. HEK293T cells were seeded in 96-well plate 24 hr before transfection. On the transfection day, transfection reagent and RNA were mixed in 10 ?l Opti-MEM, and the transfection mixture was added to the medium of the seeded cells. 3 days after transfection, genomic DNA was extracted to measure retrotransposition efficiency using ddPCR with the same design as the DNA delivery.
(156) The results of ddPCR copy number analysis (normalized to reference gene RPP30) are shown in
Example 7: Targeted Delivery of a Heterologous Object Sequence Using R2Tg Retrotransposon to Mammalian Cells
(157) This example describes the delivery of a transgene to human cells by utilizing the R2Tg retrotransposon system with multiple delivery machineries, including RNA-mediated delivery of a heterologous object sequence to human cells by utilizing the R2Tg retrotransposon system.
(158) R2 proteins recognize their template RNA structure in untranslated regions (UTRs) of each element to form ribonucleoprotein particles, which serve as the intermediates of downstream integration into a host genome. Therefore, the decoupling of UTRs from their native context and the introduction of UTRs into alternate exogenous sequence was engineered to deliver into the genome a desired nucleic acid using R2Tg machinery.
(159) Trans-transgene integration was tested by constructing 1) R2Tg coding sequence and 2) transgene cassette flanked by R2Tg UTR sequences and 100 bp homology to 28S rDNA into separate driver and transgene plasmids, respectively.
(160) Similar to the trans-transgene delivery with plasmids, RNA delivery was performed by constructing an amplicon of the coding sequence of R2Tg preceded by the T7 promoter sequence. The constructed amplicons contained the experimental R2Tg element as well as the 1 bp deletion frameshift mutant control. Separately, an amplicon was constructed that contained exogenous sequence coding for GFP and an EGF1-alpha reporter that was flanked regions sufficient to drive integration into the genome by R2Tg. More specifically, the construct included: a T7 promoter driving transcription of the RNA, wherein the RNA comprises, from 5 to 3, (a) a 5 28S homology region of 10 nt in length, (b) a 5 untranslated region, (c) an anti-sense TKpA polyA sequence, (d) an anti-sense heterologous object sequence that encodes GFP, (e) an anti-sense Kozak sequence, (f) an anti-sense EF1 alpha promoter, (g) a 3 untranslated region that binds the GENE WRITER? protein, and (h) a 3 28S homology region of 10 nt in length. Each RNA was transcribed via the New England Biolabs HiScribe T7 ARCA kit and purified via Zymo RNA clean and concentrator.
(161) The resulting heterologous object RNA and R2Tg RNA (either the experiment R2Tg element or frameshift mutant) were introduced into human HEK293T cells via TransIT?-mRNA Transfection Kit at 1:1 molar ratio. HEK293T cells were seeded in 96-well plate, 40,000 cells/well 24 hr before transfection. On the transfection day, 1 ?l transfection reagent and 500 ng total RNA was mixed in 10 ?l Opti-MEM and incubated for 5 min at room temperature. Then the transfection mixture was added to the medium of the seeded cells. 3 days after transfection, genomic DNA was extracted for PCR assays.
(162) Nested PCR was performed by with a first 30 rounds of PCR across the 3 end of the expected transgene-rDNA junction, followed by 20 additional rounds of PCR amplification using an inner primer set. One of three replicates of nested PCR performed on genomic DNA extracted from cells treated with the wild-type transposase reaction produced a PCR product of the expected size (approximately 596 bp). In contrast, no PCR product was observed in genomic DNA extracted from cells treated with the frameshift-inactivated R2Tg mutant control, or no-transfection control. The PCR product was gel-purified via Zero Blunt? TOPO? PCR Cloning Kit, and the resulting colonies were Sanger sequenced. Each individual PCR product sequence was then aligned to the expected integration sequence. The fraction of PCR product sequences that align to the expected integrated heterologous object sequence is shown in
Example 8: Targeted Delivery of R2Tg Retrotransposon to Mammalian Cells
(163) This example describes targeted integration of the R2Tg retrotransposon element to mammalian cells via DNA delivery.
(164) Plasmid harboring R2Tg (PLV014) and control plasmid were designed and synthesized as described above in Example 6. Each plasmid was introduced into HEK393T cells via FuGENE? HD transfection reagent. HEK293T cells were seeded in 96-well plate, 10,000 cells/well 24 hr before transfection. On the transfection day, 0.5 ?l transfection reagent and 80 ng DNA was mixed in 10 ?l Opti-MEM and incubated for 15 min at room temperature. Then the transfection mixture was added to the medium of the seeded cells. 3 days after transfection, genomic DNA was extracted for retrotransposition assays or cells were frozen and underwent targeted locus amplification.
(165) Target locus amplification was performed against hg38 reference human genome and the rDNA locus sequence hsu13369 (GENBANK?: U13369.1). Two independent primer sets were used to perform targeted locus amplification. Analysis with both primer sets showed that the 28S rDNA locus sequence is the only integration site detected above a 1% threshold. Thus, integration of the R2Tg transposon in mammalian cells is specific to this target site.
Example 9: Improved Trans RNA-Templated Integration into Mammalian Cells by RNA Refolding or Ratio of Driver to Template RNA
(166) RNA templates are designed as in previous examples. Two RNAs consisting of a driver and a transgene payload are delivered to mammalian cells. To better promote folding, denaturing the payload RNA by heating to 95 C and cooling to room temperature are performed to encourage proper secondary structure formation. In some embodiments, cooling the RNA to room temperate will increase integration efficiency.
(167) The molar ratio of transgene to driver is also varied to evaluate suitable stoichiometry of components. Integration is analyzed via ddPCR and sequencing. In some embodiments, a higher ratio of driver to transgene is used. In some embodiments, a higher ratio of transgene to driver is used.
(168) Previous examples with cis transgene integration are similarly assayed for stoichiometry of driver to payload. Integration is analyzed via ddPCR and sequencing. In some embodiments, a higher ratio of driver transcription or translation to transgene transcription will result in higher integration efficiency. In some embodiments, a higher ratio of transgene transcription to driver transcription and translation will result in higher integration efficiency.
Example 10: Hybrid Capture Assay
(169) A hybrid capture experiment was performed to obtain an unbiased view of the specificity of retrotransposon integration into a target site. Retrotransposon experiments were performed as in previous examples by integrating R2Tg flanked by its native UTRs and 100 bp of homology to either side of the expected R2 rDNA target. The rDNA target site had two flanking sets of 100 nucleotides identity to the corresponding native target site. The retrotransposon was delivered to human 293T cells via plasmid or mRNA. Genomic DNA was extracted after 72 hours. After extraction, each genomic DNA sample was subjected to hybrid capture according to protocol with a custom probe set (Twist). Biotinylated probes were designed such that ?120 bp probes spanned both strands of the R2Tg coding sequence and UTRs. First, a next-generation library was created by fragmentation of the genomic DNA and ligation of sequencing adapters according to a protocol from Twist (available on the world wide web at: twistbioscience.com/ngs_protocol_custompanel_hybridcap). Next, probes were hybridized to genomic DNA libraries and the enriched samples were amplified. Final libraries were sequenced on the MISEQ? using 300 bp paired-end reads. Custom MATLAB? scripts were used to analyze reads. The resulting analysis is shown in
Example 11: Long-Read PacBio Analysis
(170) Long-range PCR amplification can be performed to measure integration of the desired full-length sequence into the target site in the human genome and to measure whether mutations are introduced during insertion. Retrotransposon integration experiments are performed as described in previous examples. In one example, PCR amplification is used to generate amplicons by designing one primer targeting the genomic integration site and one primer targeting the integrant sequence. In this example, these primers are designed to maximize the length of the amplified genomic locus fused with the integrant sequence. By pooling amplicons spanning both ends of the integrant and performing long-read next-generation sequencing, the fidelity of each integration is be evaluated.
(171) In another example, hybrid capture is performed as described in a previous example but with a larger target library length during initial library generation. The resulting library is then subjected to long-read next-generation sequencing.
(172) In some embodiments, long-read next generation sequencing will show that there are less than 10%, 5%, 2%, 1%, 0.5%, 0.2%, or 0.1% SNPs in the integrated DNA across samples. In some embodiments, long-read next generation sequencing will show that less than 10%, 5%, 2%, or 1% of integrated DNA has a SNP. In some embodiments, long-read next generation sequencing will show that less than 10%, 5%, 2%, or 1% of integrated DNA has an internal deletion. In some embodiments, long-read next generation sequencing will show that less than 10%, 5%, 2%, 1%, 0.5%, 0.2%, or 0.1% of total integrated DNA across the population is deleted. In some embodiments, long-read next generation sequencing will show that less than 10%, 5%, 2%, or 1% of integrated DNA is truncated.
Example 12: Experiment with Different Homology Lengths and Point Mutations in Homology
(173) In this example, experiments are designed to characterize suitable lengths and starting positions of homology to the target site for efficient retrotransposon integration. Also, the homology is used to support the mechanism of integration being reverse transcription-driven.
(174) A series of SNPs were introduced within the 100 bp downstream homology of R2Tg plasmids by modifying plasmid PLV014. The design of the SNPs is listed in
(175) This example also describes the evaluation different homology regions to the target site to identify shorter regions that promote efficient integration into the genome. This example describes two approaches. First, different windows of 100 bp of homology to the target site are tested, starting from bp 1-100 3 of the target site, then testing 2-101 3 of the target site, 3-103 3 of the target site, and so on, through bp 30-131 3 of the target site. Second, shorter lengths of homology to the target site sufficient for DNA integration are tested, starting with bp 0-100 3 of the target site, then testing 0-95 3 of the target site, 0-90 3 of the target site, etc. through bp 0-10 3 of the target site. After the transfection of each plasmid into 293T cells, ddPCR is used to measure the retrotransposition efficiency.
(176) In this example, different UTR regions with different lengths are evaluated to identify shorter sequences for efficient integration into the genome. The 3UTR is tested by dividing this 325 bp sequence into 3 regions, 1-100 bp, 101-200 bp, and 201-325 bp. Constructs of R2Tg containing each truncated 3UTR are generated to test the integration efficiency respectively.
Example 13: Assess Whether p53 or Other Repair Pathways are Upregulated
(177) This example describes an evaluation of the effect of exogenous R2Tg retrotranspositon on gene expression, especially tumor suppressor and DNA repair genes. An R2Tg expressing plasmid is delivered to multiple cancer cell lines, including 293T, MCF-7, and T47D. After confirmation of integration in each cell line, RNA-seq is conducted to assess the effect on gene expression profile. Gene set enrichment analysis is then applied to evaluate whether any DNA repair pathways are upregulated after retrotransposition. MCF-7 and T47D are breast cancer cell lines with wild type and mutant p53 respectively, which are be used to evaluate the relationship between p53 and retrotranspositon specifically. In some embodiments, p53 is not upregulated when a retrotransposon GENE WRITER? integrates into the genome. In some embodiments, no DNA repair genes are upregulated when a retrotransposon GENE WRITER? integrates into the genome. In some embodiments, no tumor suppressor genes are upregulated when a retrotransposon GENE WRITER? integrates into the genome.
Example 14: Retrotransposition in Presence of DNA Repair Inhibitors
(178) In this example, experiments will test the effect of different DNA repair pathways on R2Tg retrotransposition via the application of DNA repair pathway inhibitors or DNA repair pathway deficient cell lines. When applying DNA repair pathway inhibitors, PrestoBlue cell viability assay is performed first to determine the toxicity of the inhibitors and whether any normalization should be applied for following assays. SCR7 is an inhibitor for NHEJ, which is applied at a series of dilutions during R2Tg delivery. PARP protein is a nuclear enzyme that binds as homodimers to both single- and double-strand breaks. Thus, its inhibitors are be used in the test of relevant DNA repair pathways, including homologous recombination repair pathway and base excision repair pathway. The experiment procedure is the same with that of SCR7. Cell lines with deficient core proteins of nucleotide excision repair (NER) pathway are used to test the effect of NER on R2Tg retrotransposition. After the delivery of R2Tg element into the cell, ddPCR is be used to evaluate the retrotransposition in the context of inhibition of DNA repair pathways. Sequencing analysis is also be performed to evaluate whether certain DNA repair pathway plays a role in the alteration of integration junction. In some embodiments, R2Tg integration into the genome will not be decreased by the knockdown of any DNA repair pathways, suggesting that R2Tg does not rely on the host cell pathways for DNA integration.
Example 15: Retrotransposition in Fibroblasts and in T Cells
(179) In this example, the previously performed R2Tg retrotransposition analysis of 293T cells is repeated in non-dividing cells, including fibroblast and T cells. Compared to 293T cells, non-dividing cells are sometimes more difficult to transfect with lipid reagent. Thus, nucleofection is used for the delivery of R2Tg element. The subsequent retrotransposition assay for integrating efficiency and sequencing analysis will be performed as described herein for 293T cells. In some embodiments, R2Tg integrates into the genome of fibroblasts and T cells.
Example 16: Single Cell ddPCR
(180) In this example, a quantitative assay is used to determine the frequency of targeted genome integration at single cell level, and that information can be compared to the copy number of targeted genome integration per genome quantified from genomic DNA.
(181) Approximately 5000 transfected cells will be collected and mixed with ddPCR reaction mixture before distributing into about 20,000 droplets, with the aim of each droplet containing one cell or no cells. ddPCR assays including 5UTR and 3UTR assays will be performed as described above to determine the frequency of R2 or transgene integration at single cell level. A control experiment will be performed in parallel using genomic DNA harvested from the same number of cells to determine the targeted genome integration efficiency per genome. In some embodiments, the frequency of targeted genome integration at the single cell level is calculated to be 1-80%, e.g., 25%, wherein the indicated percentage of cells have one or more copies of the transgene integrated into the desired locus.
Example 17: Single Cell Analysis Via Colony Isolation
(182) In this example, a quantitative assay is used to determine genome integration copy number in cell colonies derived from single cell.
(183) Single cell colonies will be isolated by colony picking up or by limited dilution and cultured in a 96 well format. When the cells reach >80% confluency, half of the cells will be frozen for backup and genomic DNA from the other half of the cells will be harvested for ddPCR. Optimized ddPCR assays including 5UTR and 3UTR assays will be performed as described previously to determine the frequency of R2 or transgene integration. At least 96 colonies will be screened for each R2 element with appropriate controls. The total number of colonies to be screened will be determined by single cell ddPCR data if applicable or the first set of single cell colony screen data. In some embodiments, the frequency of targeted genome integration at the single cell level will be calculated to be 1-80%, e.g., 25%, wherein the indicated percentage of cells have a single copy of the transgene integrated into the desired locus. The assay can also be used to determine the percentage of colonies that have more than one copy of the transgene integrated into the desired locus.
Example 18: DNA Binding Affinity and/or Re-Targeting
(184) The DNA targeting module of wild-type R2 is made of a cysteine-histidine zinc finger and c-Myb transcription factor binding motifs. This N-terminal module can be substituted with different DNA binding modules such as DNA binding protein(s) (e.g., transcription factors), zinc finger(s) (e.g., natural or designed motifs), and/or nucleic acid guided, catalytically inactive endonucleases (e.g., Cas9 bound with a guide RNA (e.g., sgRNA) to form a Cas9-RNP). This DNA binding module is swapped for the naturally occurring module and, in some cases placed with a flexible linker attaching it to the RNA binding/RT module. Additionally, in some constructions, this new DNA binding module is placed in tandem with the same and/or different DNA binding modules. Furthermore, some constructions may split the GENE WRITER? protein where one protein molecule contains the RNA binding module and the other protein contains the RT and endonuclease modules. In some embodiments, swapping of the DNA module increases specificity and/or affinity to a genomic location and in some cases allows for the specific targeting of new genomic locations.
Example 19: Assays to Measure DNA Binding Affinity
(185) DNA binding activity of GENE WRITER? genome editor polypeptides described herein (and DNA binding domains for the same) can be tested, e.g., as described in this example. DNA binding modules are purified by recombinantly expressing them in cells (e.g., E. coli) or they are expressed in a cell-free reactions of transcription and translation (e.g., T7 RNA polymerase+wheat germ extract). The purified DNA binding module(s) is tested for binding affinity by measuring the Kd in a binding assay (e.g., EMSA, Fluorescence anisotropy, dual-filter binding, FRET, SPR, or thermophoresis (temperature related intensity change). The protein (DNA binding module) is labeled and/or the DNA molecule is labeled with a molecule that is compatible with the above binding assays (e.g., dye, radioisotope (for example, Protein: .sup.35S-methionine, maleimide dye, DNA: .sup.32P end or internal label, DNA with a linked amine reacted with NHS-ester dye). The molecules are measured by changing their concentrations and fitting to a binding curve which calculates the binding affinity. In some assays, the nucleic acid sequence specificity is tested by mutational analysis of the DNA sequence or mutation to the DNA binding module by amino acid changes or alterations to protein-nucleic acid complex (e.g., Cas9-RNP DNA binding module). In some embodiments, increasing the Kd of the DNA binding module will decrease off-target insertions and, in some cases, will increase the activity of on-target sites by increasing the dwell time of the R2-RNA complex at the specific genomic location.
Example 20: Assays to Determine Global Specificity De Novo
(186) The DNA binding module is expressed in cells (e.g., animal cells, e.g., human cells) as the DNA binding module alone, in the context of the full-length retrotransposon R2, or a control without retrotransposase. The expression of the module or retrotransposon is delivered to cells using conventional methods of delivering DNA, RNA, or protein. The complex is crosslinked (e.g., using chemical or UV light) or is not crosslinked. The cells are lysed and treated with DNase I so that only the bound DNA is protected from degradation. DNA is extracted, NGS library preparation of DNA fragments and de novo binding sites are identified, analogous to ChIP-seq or DIG-seq. In some embodiments, potential off-target sites are identified that can be followed-up to remove false-positives. In other embodiments this assay confirms the in vitro assay on the specificity of the DNA binding module to bind at its intended site and not at others.
(187) An orthogonal assay to identify DNA binding sites in high-throughput uses the method described by Boyle et al, PNAS 2017 where the DNA binding domain is tested in a cell-free setting to determine specificity along with systematic analysis of sequence mutants related to the new DNA binding module.
Example 21: Modularity of RNA Molecule
(188) The RNA molecule binds to the R2 protein via interactions found in the reverse transcriptase module, designated as a sub-module RNA binding. The protein recognizes specific structures in the 5 and/or 3 UTRs to interact with the RNA. In some embodiments, swapping of the UTR modules increases protein interactions, changes the protein specificity to bind the UTR, stabilizes against nucleases, and/or improves cellular tolerance (e.g., leads to a reduced innate immune response). In other embodiments, addition and/or swapping of the RNA binding module of the R2 protein is compatible with the use of different sequence or ligands that are linked to the transgene and/or element module of the RNA. In some embodiments, combinations of new ligands in place of the UTRs will have better affinity to the RNA binding domain of R2 and lead to better insertion efficiency. In some embodiments, the changes to the sequence of the UTRs or changes to the base modifications of the UTRs will increase the secondary structure stability that leads to better interaction with the RNA binding module.
Example 22: Assays to Measure RNA Binding Affinity to New Sequences
(189) New UTR modules are tested in a binding assay. In the case of new RNAs, they are synthesized either by cell-free in vitro transcription using a synthetic DNA template or by chemical synthesis of the RNA in full-length or chemical synthesis of pieces that are ligated together to form a single RNA molecule. The binding affinity of the purified UTRs are measured in a binding assay (e.g., EMSA, Fluorescence anisotropy, dual-filter binding, FRET, SPR, or thermophoresis (temperature related intensity change)). The UTR module and/or RNA binding module/RT module is detected with or without a label which is described above for labeling RNAs. Measurement of the molecules at different concentrations is performed to determine a binding affinity. In some embodiments, alterations to/swapping of the 5 and/or 3 UTR binding module and/or changes to the RNA binding/RT module will lead to better interactions than the wild-type R2 protein or UTR. In some embodiments, the increased interaction will lead to an increase in the efficiency of retro-transposition and in some cases increases specificity of the R2 protein to interact with the RNA.
Example 23: Alternative UTRs
(190) While not wishing to be bound by theory, in some embodiments the UTRs act as a handle for the R2 protein to interact with the RNA which it uses as a template for RT in concert with it binding a genomic location, nicking the DNA with its endonuclease module, and then using the bound RNA as a template for RT insertion at the cleavage site in the DNA. For the UTR to keep the template in close proximity to the RT module, then the UTR modules can be substituted with different ligands that would bind to a specific RNA binding module engineered into the R2 protein. Thus, in some embodiments, the alternative non-RNA UTR is either a protein, small molecule, or other chemical entity that is attached covalently, through protein-protein interaction, small molecule-protein interaction, or through hybridization. In some embodiments the RNA binding module binds specifically to a ligand that is not RNA that is attached to the transgene module RNA that increases the efficiency, stability, and/or rate of retro-transposition.
Example 24: Assays to Measure the Activity of UTR Constructs
(191) Binding assays to measure affinity of R2 protein with engineered UTRs are performed as described above, e.g., for a protein-nucleic acid interaction. In cases of protein-protein or protein-small molecule interactions the assay uses a label on the RNA transgene module where the UTR module is attached.
Example 25: Targeted Genomic Integration
(192) In this example, GENE WRITING? technology is delivered to target cells and to non-target cells, and new DNA is integrated into the genome in target cells at a higher frequency than in non-target cells. As described in more detail below, this approach takes advantage of the non-target cell having an endogenous miRNA that the target cell does not have (or has at a lower level). The endogenous miRNA is used to reduce DNA integration in the non-target cell.
(193) The polypeptide used is the R2Tg protein and the template RNA component is RNA coding for the GFP protein and flanked at the 5 end by the 5 UTR and at the 3 end by the 3 UTR of the R2Tg retrotransposase. The 5 UTR is flanked by 100 bp of homology to the 5 of R2Tg 28s rDNA target site and the 3 UTR is flanked by 100 bp of homology to the 3 of R2Tg 28s rDNA target site. The GFP gene is facing in the antisense direction with regard to the 5 and 3 UTRs and has its own promoter and polyadenylation signal.
(194) The template RNA further comprises a microRNA recognition sequence. This microRNA recognition sequence is bound by microRNAs in the non-target cells, leading to the inhibition (e.g., degradation) of the template RNA prior to genomic integration.
(195) In this example the target cells are hepatocytes and the non-target cells are macrophages from the hematopoietic lineage. The target cells and non-target cells are cultured separately. The template RNA and retrotransposase protein can be delivered to cells as described herein, e.g., as RNA or using viral vectors (e.g. adeno-associated viral vectors), wherein the template RNA is transcribed from viral vector DNA.
(196) Three days after treating the cells, GFP expression and genomic integration are assayed.
(197) GFP expression is assayed via flow cytometry. In some embodiments, GFP expression will be higher in the hepatocyte population than in the macrophage population.
(198) Genomic integration (in terms of copy number per cell normalized to a reference gene) is assayed via droplet digital PCR using methods described herein. In some embodiments, genomic integration will be higher in the hepatocyte population than in the macrophage population.
Example 26: Testing Modularity of the DNA Binding Domain
(199) In this example, a series of experiments were performed to test the activity of various mutant retrotransposases, as well as gaining structural knowledge about these proteins. This experiments tested flexible linkers in different locations and lengths, in order to determine if the DNA binding domain (DBD) was modular. These experiments also provide support for being able to separate the DBD from the rest of R2Tg and replacing it with any DNA targeting protein sequence. This example thus supports an understanding that the transposases described herein can withstand the tested levels of sequence divergence at a plurality of locations (e.g., in the predicted ?1 RNA binding motif, in an alpha helix, and in a coil region located C-terminal to the predicted c-myb DNA binding motif, e.g., as described below) identified by structural modeling, while maintaining function.
(200) Briefly, the two linkers (Linker A: SGSETPGTSESATPES (SEQ ID NO: 1023), and Linker B: GGGS (SEQ ID NO: 1024)) were inserted into 3 locations, noted herein as versions v1, v2, and v3. v1 was located at the N-terminal side of an alpha helical region of R2Tg that preceded the predicted ?1 RNA binding motif, v2 was located at the C-terminal side of an alpha helical region of R2Tg that preceded the predicted ?1 RNA binding motif, and v3 was located C-terminal to a random coil region that came after the predicted c-myb DNA binding motif of R2Tg. For each of v1, v2, and v3, one of linkers A or B were added by PCR to a DNA plasmid that expressed R2Tg, thereby yielding sequences v1A (v1+ linker A), v1B (v1+ linker B), v1C (v1+ linker C), v2A (v2+ linker A), v2B (v2+ linker B), and v2C (v2+ linker C), as shown in Table 5 below. The insertion of the linkers was verified by Sanger sequencing and the DNA plasmids were purified for transfection.
(201) TABLE-US-00005 TABLE5 AminoacidsequencesofR2Tg mutantswithlinkersintheDNA bindingdomain(DBD) R2Tg SEQ Mutant+ ID Linker AminoAcidSequence NO R2Tg MASCPKPGPPVSAGAMSLESGLTTHSVLAIERGPNS 1017 withDBD LANSGSDFGGGGLGLPLRLLRVSVGTQTSRSDWVDL Linker VSWSHPGPTSKSQQVDLVSLFPKHRVDLLSKNDQVD v1A LVAQFLPSKFPPNLAENDLALLVNLEFYRSDLHVYE CVHFAAHWEGLSGLPEVYEQLAPQPCVGETLHSSLP RDSELFVPEEGSSEKESEDAPKTSPPTPGKHGLEQT GEEKVMVTVPDKNPPCPCCGTRVNSVLNLIEHLKVS HGKRGVCFRCAKCGKENSNYHSVVCHFPKCRGPETE KAPAGEWICEVCNRDFTTKIGLGQHKRLAHPAVRNQ ERIVASQPKETSNRGAHKRCWTKEEEELLIRLEAQF EGNKNINKLIAEHITTKTAKQISDKRRLLSRKPAEE PREEPGTCHHTRRAAASLRTEPEMSHHAQAEDRDNG PGRRPLPGRAAAGGRTMDEIRRHPDKGNGQQRPTKQ KSEEQLQAYYKKTLEERLSAGALNTFPRAFKQVMEG RDIKLVINQTAQDSGSETPGTSESATPESCFGCLES ISQIRTATRDKKDTVTREKHPKKPFQKWMKDRAIKK GNYLRFQRLFYLDRGKLAKIILDDIECLSCDIPLSE IYSVFKTRWETTGSFKSLGDFKTYGKADNTAFRELI TAKEIEKNVQEMSKGSAPGPDGITLGDVVKMDPEFS RTMEIFNLWLTTGKIPDMVRGCRTVLIPKSSKPDRL KDINNWRPITIGSILLRLFSRIVTARLSKACPLNPR QRGFIRAAGCSENLKLLQTIIWSAKREHRPLGVVFV DIAKAFDTVSHQHIIHALQQREVDPHIVGLVSNMYE NISTYITTKRNTHTDKIQIRVGVKQGDPMSPLLFNL AMDPLLCKLEESGKGYHRGQSSITAMAFADDLVLLS DSWENMNTNISILETFCNLTGLKTQGQKCHGFYIKP TKDSYTINDCAAWTINGTPLNMIDPGESEKYLGLQF DPWIGIARSGLSTKLDFWLQRIDQAPLKPLQKTDIL KTYTIPRLIYIADHSEVKTALLETLDQKIRTAVKEW LHLPPCTCDAILYSSTRDGGLGITKLAGLIPSVQAR RLHRIAQSSDDTMKCFMEKEKMEQLHKKLWIQAGGD RENIPSIWEAPPSSEPPNNVSTNSEWEAPTQKDKFP KPCNWRKNEFKKWTKLASQGRGIVNFERDKISNHWI QYYRRIPHRKLLTALQLRANVYPTREFLARGRQDQY IKACRHCDADIESCAHIIGNCPVTQDARIKRHNYIC ELLLEEAKKKDWVVFKEPHIRDSNKELYKPDLIFVK DARALVVDVTVRYEAAKSSLEEAAAEKVRKYKHLET EVRHLTNAKDVTFVGFPLGARGKWHQDNFKLLTELG LSKSRQVKMAETFSTVALFSSVDIVHMFASRARKSM VM R2Tg MASCPKPGPPVSAGAMSLESGLTTHSVLAIERGPNS 1018 withDBD LANSGSDFGGGGLGLPLRLLRVSVGTQTSRSDWVDL Linker VSWSHPGPTSKSQQVDLVSLFPKHRVDLLSKNDQVD v1B LVAQFLPSKFPPNLAENDLALLVNLEFYRSDLHVYE CVHFAAHWEGLSGLPEVYEQLAPQPCVGETLHSSLP RDSELFVPEEGSSEKESEDAPKTSPPTPGKHGLEQT GEEKVMVTVPDKNPPCPCCGTRVNSVLNLIEHLKVS HGKRGVCFRCAKCGKENSNYHSVVCHFPKCRGPETE KAPAGEWICEVCNRDFTTKIGLGQHKRLAHPAVRNQ ERIVASQPKETSNRGAHKRCWTKEEEELLIRLEAQF EGNKNINKLIAEHITTKTAKQISDKRRLLSRKPAEE PREEPGTCHHTRRAAASLRTEPEMSHHAQAEDRDNG PGRRPLPGRAAAGGRTMDEIRRHPDKGNGQQRPTKQ KSEEQLQAYYKKTLEERLSAGALNTFPRAFKQVMEG RDIKLVINQTAQDGGGSCFGCLESISQIRTATRDKK DTVTREKHPKKPFQKWMKDRAIKKGNYLRFQRLFYL DRGKLAKIILDDIECLSCDIPLSEIYSVFKTRWETT GSFKSLGDFKTYGKADNTAFRELITAKEIEKNVQEM SKGSAPGPDGITLGDVVKMDPEFSRTMEIFNLWLTT GKIPDMVRGCRTVLIPKSSKPDRLKDINNWRPITIG SILLRLFSRIVTARLSKACPLNPRQRGFIRAAGCSE NLKLLQTIIWSAKREHRPLGVVFVDIAKAFDTVSHQ HIIHALQQREVDPHIVGLVSNMYENISTYITTKRNT HTDKIQIRVGVKQGDPMSPLLFNLAMDPLLCKLEES GKGYHRGQSSITAMAFADDLVLLSDSWENMNTNISI LETFCNLTGLKTQGQKCHGFYIKPTKDSYTINDCAA WTINGTPLNMIDPGESEKYLGLQFDPWIGIARSGLS TKLDFWLQRIDQAPLKPLQKTDILKTYTIPRLIYIA DHSEVKTALLETLDQKIRTAVKEWLHLPPCTCDAIL YSSTRDGGLGITKLAGLIPSVQARRLHRIAQSSDDT MKCFMEKEKMEQLHKKLWIQAGGDRENIPSIWEAPP SSEPPNNVSTNSEWEAPTQKDKFPKPCNWRKNEFKK WTKLASQGRGIVNFERDKISNHWIQYYRRIPHRKLL TALQLRANVYPTREFLARGRQDQYIKACRHCDADIE SCAHIIGNCPVTQDARIKRHNYICELLLEEAKKKDW VVFKEPHIRDSNKELYKPDLIFVKDARALVVDVTVR YEAAKSSLEEAAAEKVRKYKHLETEVRHLTNAKDVT FVGFPLGARGKWHQDNFKLLTELGLSKSRQVKMAET FSTVALFSSVDIVHMFASRARKSMVM R2Tg MASCPKPGPPVSAGAMSLESGLTTHSVLAIERGPNS 1019 withDBD LANSGSDFGGGGLGLPLRLLRVSVGTQTSRSDWVDL Linker VSWSHPGPTSKSQQVDLVSLFPKHRVDLLSKNDQVD v2A LVAQFLPSKFPPNLAENDLALLVNLEFYRSDLHVYE CVHFAAHWEGLSGLPEVYEQLAPQPCVGETLHSSLP RDSELFVPEEGSSEKESEDAPKTSPPTPGKHGLEQT GEEKVMVTVPDKNPPCPCCGTRVNSVLNLIEHLKVS HGKRGVCFRCAKCGKENSNYHSVVCHFPKCRGPETE KAPAGEWICEVCNRDFTTKIGLGQHKRLAHPAVRNQ ERIVASQPKETSNRGAHKRCWTKEEEELLIRLEAQF EGNKNINKLIAEHITTKTAKQISDKRRLLSRKPAEE PREEPGTCHHTRRAAASLRTEPEMSHHAQAEDRDNG PGRRPLPGRAAAGGRTMDEIRRHPDKGNGQQRPTKQ KSEEQLQAYYKKTLEERLSAGALNTFPRAFKQVMEG RDIKLVINQTAQDCFGCLESISQIRSGSETPGTSES ATPESTATRDKKDTVTREKHPKKPFQKWMKDRAIKK GNYLRFQRLFYLDRGKLAKIILDDIECLSCDIPLSE IYSVFKTRWETTGSFKSLGDFKTYGKADNTAFRELI TAKEIEKNVQEMSKGSAPGPDGITLGDVVKMDPEFS RTMEIFNLWLTTGKIPDMVRGCRTVLIPKSSKPDRL KDINNWRPITIGSILLRLFSRIVTARLSKACPLNPR QRGFIRAAGCSENLKLLQTIIWSAKREHRPLGVVFV DIAKAFDTVSHQHIIHALQQREVDPHIVGLVSNMYE NISTYITTKRNTHTDKIQIRVGVKQGDPMSPLLFNL AMDPLLCKLEESGKGYHRGQSSITAMAFADDLVLLS DSWENMNTNISILETFCNLTGLKTQGQKCHGFYIKP TKDSYTINDCAAWTINGTPLNMIDPGESEKYLGLQF DPWIGIARSGLSTKLDFWLQRIDQAPLKPLQKTDIL KTYTIPRLIYIADHSEVKTALLETLDQKIRTAVKEW LHLPPCTCDAILYSSTRDGGLGITKLAGLIPSVQAR RLHRIAQSSDDTMKCFMEKEKMEQLHKKLWIQAGGD RENIPSIWEAPPSSEPPNNVSTNSEWEAPTQKDKFP KPCNWRKNEFKKWTKLASQGRGIVNFERDKISNHWI QYYRRIPHRKLLTALQLRANVYPTREFLARGRQDQY IKACRHCDADIESCAHIIGNCPVTQDARIKRHNYIC ELLLEEAKKKDWVVFKEPHIRDSNKELYKPDLIFVK DARALVVDVTVRYEAAKSSLEEAAAEKVRKYKHLET EVRHLTNAKDVTFVGFPLGARGKWHQDNFKLLTELG LSKSRQVKMAETFSTVALFSSVDIVHMFASRARKSM VM R2Tg MASCPKPGPPVSAGAMSLESGLTTHSVLAIERGPNS 1020 withDBD LANSGSDFGGGGLGLPLRLLRVSVGTQTSRSDWVDL Linker VSWSHPGPTSKSQQVDLVSLFPKHRVDLLSKNDQVD v2B LVAQFLPSKFPPNLAENDLALLVNLEFYRSDLHVYE CVHFAAHWEGLSGLPEVYEQLAPQPCVGETLHSSLP RDSELFVPEEGSSEKESEDAPKTSPPTPGKHGLEQT GEEKVMVTVPDKNPPCPCCGTRVNSVLNLIEHLKVS HGKRGVCFRCAKCGKENSNYHSVVCHFPKCRGPETE KAPAGEWICEVCNRDFTTKIGLGQHKRLAHPAVRNQ ERIVASQPKETSNRGAHKRCWTKEEEELLIRLEAQF EGNKNINKLIAEHITTKTAKQISDKRRLLSRKPAEE PREEPGTCHHTRRAAASLRTEPEMSHHAQAEDRDNG PGRRPLPGRAAAGGRTMDEIRRHPDKGNGQQRPTKQ KSEEQLQAYYKKTLEERLSAGALNTFPRAFKQVMEG RDIKLVINQTAQDCFGCLESISQIRGGGSTATRDKK DTVTREKHPKKPFQKWMKDRAIKKGNYLRFQRLFYL DRGKLAKIILDDIECLSCDIPLSEIYSVFKTRWETT GSFKSLGDFKTYGKADNTAFRELITAKEIEKNVQEM SKGSAPGPDGITLGDVVKMDPEFSRTMEIFNLWLTT GKIPDMVRGCRTVLIPKSSKPDRLKDINNWRPITIG SILLRLFSRIVTARLSKACPLNPRQRGFIRAAGCSE NLKLLQTIIWSAKREHRPLGVVFVDIAKAFDTVSHQ HIIHALQQREVDPHIVGLVSNMYENISTYITTKRNT HTDKIQIRVGVKQGDPMSPLLFNLAMDPLLCKLEES GKGYHRGQSSITAMAFADDLVLLSDSWENMNTNISI LETFCNLTGLKTQGQKCHGFYIKPTKDSYTINDCAA WTINGTPLNMIDPGESEKYLGLQFDPWIGIARSGLS TKLDFWLQRIDQAPLKPLQKTDILKTYTIPRLIYIA DHSEVKTALLETLDQKIRTAVKEWLHLPPCTCDAIL YSSTRDGGLGITKLAGLIPSVQARRLHRIAQSSDDT MKCFMEKEKMEQLHKKLWIQAGGDRENIPSIWEAPP SSEPPNNVSTNSEWEAPTQKDKFPKPCNWRKNEFKK WTKLASQGRGIVNFERDKISNHWIQYYRRIPHRKLL TALQLRANVYPTREFLARGRQDQYIKACRHCDADIE SCAHIIGNCPVTQDARIKRHNYICELLLEEAKKKDW VVFKEPHIRDSNKELYKPDLIFVKDARALVVDVTVR YEAAKSSLEEAAAEKVRKYKHLETEVRHLTNAKDVT FVGFPLGARGKWHQDNFKLLTELGLSKSRQVKMAET FSTVALFSSVDIVHMFASRARKSMVM R2Tg MASCPKPGPPVSAGAMSLESGLTTHSVLAIERGPNS 1021 withDBD LANSGSDFGGGGLGLPLRLLRVSVGTQTSRSDWVDL Linker VSWSHPGPTSKSQQVDLVSLFPKHRVDLLSKNDQVD v3A LVAQFLPSKFPPNLAENDLALLVNLEFYRSDLHVYE CVHFAAHWEGLSGLPEVYEQLAPQPCVGETLHSSLP RDSELFVPEEGSSEKESEDAPKTSPPTPGKHGLEQT GEEKVMVTVPDKNPPCPCCGTRVNSVLNLIEHLKVS HGKRGVCFRCAKCGKENSNYHSVVCHFPKCRGPETE KAPAGEWICEVCNRDFTTKIGLGQHKRLAHPAVRNQ ERIVASQPKETSNRGAHKRCWTKEEEELLIRLEAQF EGNKNINKLIAEHITTKTAKQISDKRRLLSRKPAEE PREEPGTCHHTRRAASGSETPGTSESATPESASLRT EPEMSHHAQAEDRDNGPGRRPLPGRAAAGGRTMDEI RRHPDKGNGQQRPTKQKSEEQLQAYYKKTLEERLSA GALNTFPRAFKQVMEGRDIKLVINQTAQDCFGCLES ISQIRTATRDKKDTVTREKHPKKPFQKWMKDRAIKK GNYLRFQRLFYLDRGKLAKIILDDIECLSCDIPLSE IYSVFKTRWETTGSFKSLGDFKTYGKADNTAFRELI TAKEIEKNVQEMSKGSAPGPDGITLGDVVKMDPEFS RTMEIFNLWLTTGKIPDMVRGCRTVLIPKSSKPDRL KDINNWRPITIGSILLRLFSRIVTARLSKACPLNPR QRGFIRAAGCSENLKLLQTIIWSAKREHRPLGVVFV DIAKAFDTVSHQHIIHALQQREVDPHIVGLVSNMYE NISTYITTKRNTHTDKIQIRVGVKQGDPMSPLLFNL AMDPLLCKLEESGKGYHRGQSSITAMAFADDLVLLS DSWENMNTNISILETFCNLTGLKTQGQKCHGFYIKP TKDSYTINDCAAWTINGTPLNMIDPGESEKYLGLQF DPWIGIARSGLSTKLDFWLQRIDQAPLKPLQKTDIL KTYTIPRLIYIADHSEVKTALLETLDQKIRTAVKEW LHLPPCTCDAILYSSTRDGGLGITKLAGLIPSVQAR RLHRIAQSSDDTMKCFMEKEKMEQLHKKLWIQAGGD RENIPSIWEAPPSSEPPNNVSTNSEWEAPTQKDKFP KPCNWRKNEFKKWTKLASQGRGIVNFERDKISNHWI QYYRRIPHRKLLTALQLRANVYPTREFLARGRQDQY IKACRHCDADIESCAHIIGNCPVTQDARIKRHNYIC ELLLEEAKKKDWVVFKEPHIRDSNKELYKPDLIFVK DARALVVDVTVRYEAAKSSLEEAAAEKVRKYKHLET EVRHLTNAKDVTFVGFPLGARGKWHQDNFKLLTELG LSKSRQVKMAETFSTVALFSSVDIVHMFASRARKSM VM R2Tg MASCPKPGPPVSAGAMSLESGLTTHSVLAIERGPNS 1022 withDBD LANSGSDFGGGGLGLPLRLLRVSVGTQTSRSDWVDL Linker VSWSHPGPTSKSQQVDLVSLFPKHRVDLLSKNDQVD v3B LVAQFLPSKFPPNLAENDLALLVNLEFYRSDLHVYE CVHFAAHWEGLSGLPEVYEQLAPQPCVGETLHSSLP RDSELFVPEEGSSEKESEDAPKTSPPTPGKHGLEQT GEEKVMVTVPDKNPPCPCCGTRVNSVLNLIEHLKVS HGKRGVCFRCAKCGKENSNYHSVVCHFPKCRGPETE KAPAGEWICEVCNRDFTTKIGLGQHKRLAHPAVRNQ ERIVASQPKETSNRGAHKRCWTKEEEELLIRLEAQF EGNKNINKLIAEHITTKTAKQISDKRRLLSRKPAEE PREEPGTCHHTRRAAGGGSASLRTEPEMSHHAQAED RDNGPGRRPLPGRAAAGGRTMDEIRRHPDKGNGQQR PTKQKSEEQLQAYYKKTLEERLSAGALNTFPRAFKQ VMEGRDIKLVINQTAQDCFGCLESISQIRTATRDKK DTVTREKHPKKPFQKWMKDRAIKKGNYLRFQRLFYL DRGKLAKIILDDIECLSCDIPLSEIYSVFKTRWETT GSFKSLGDFKTYGKADNTAFRELITAKEIEKNVQEM SKGSAPGPDGITLGDVVKMDPEFSRTMEIFNLWLTT GKIPDMVRGCRTVLIPKSSKPDRLKDINNWRPITIG SILLRLFSRIVTARLSKACPLNPRQRGFIRAAGCSE NLKLLQTIIWSAKREHRPLGVVFVDIAKAFDTVSHQ HIIHALQQREVDPHIVGLVSNMYENISTYITTKRNT HTDKIQIRVGVKQGDPMSPLLFNLAMDPLLCKLEES GKGYHRGQSSITAMAFADDLVLLSDSWENMNTNISI LETFCNLTGLKTQGQKCHGFYIKPTKDSYTINDCAA WTINGTPLNMIDPGESEKYLGLQFDPWIGIARSGLS TKLDFWLQRIDQAPLKPLQKTDILKTYTIPRLIYIA DHSEVKTALLETLDQKIRTAVKEWLHLPPCTCDAIL YSSTRDGGLGITKLAGLIPSVQARRLHRIAQSSDDT MKCFMEKEKMEQLHKKLWIQAGGDRENIPSIWEAPP SSEPPNNVSTNSEWEAPTQKDKFPKPCNWRKNEFKK WTKLASQGRGIVNFERDKISNHWIQYYRRIPHRKLL TALQLRANVYPTREFLARGRQDQYIKACRHCDADIE SCAHIIGNCPVTQDARIKRHNYICELLLEEAKKKDW VVFKEPHIRDSNKELYKPDLIFVKDARALVVDVTVR YEAAKSSLEEAAAEKVRKYKHLETEVRHLTNAKDVT FVGFPLGARGKWHQDNFKLLTELGLSKSRQVKMAET FSTVALFSSVDIVHMFASRARKSMVM
(202) HEK293T cells were plated in 96-well plates and grown overnight at 37? C., 5% CO2. The HEK293T cells were transfected with plasmids that expressed R2Tg (wild-type), R2 endonuclease mutant, and linker mutants. The transfection was carried out using the Fugene HD transfection reagent according to the manufacturer recommendations, where each well received 80 ng of plasmid DNA and 0.5 ?L of transfection reagent. All transfections were performed in duplicate and the cells were incubated for 72 h prior to genomic DNA extraction.
(203) Activity of the mutants was measured by a ddPCR assay that quantified the copy number of R2Tg integration per genome. The 5 and 3 junctions were quantified by generating two different amplicons at each end.
(204) v3 (near the c-myb binding motif in the DBD) decreased integration activity with either linker A or B. v1 (N-terminal to the alpha helix preceding the ?1 RNA binding motif) had comparable activity to the wild-type when used with linker A (16 AA) versus the shorter linker B (4 AA). This could be related to amino acid selection, length, or three-dimensional structure. v2 (C-terminal to the alpha helix preceding the ?1 RNA binding motif) did not tolerate linker A; however, linker B had activity that was comparable and slightly better than the wild-type. v1 and v2 may therefore be considered preferred locations to add a linker that can separate R2Tg's DNA binding domain and the rest of the protein.
Example 27: Long-Read Sequencing to Determine Integration Fidelity
(205) Retrotransposon integration experiments were performed as described in previous examples. In one example, PCR amplification was used to generate amplicons by designing one primer targeting the genomic integration site and one primer targeting the integrant sequence. In this example, these primers were designed to maximize the length of the amplified genomic locus fused with the integrant sequence. By pooling amplicons spanning both ends of the integrant and performing long-read next-generation sequencing, the fidelity of each integration was evaluated.
(206) A cis construct of R2Tg was integrated into 293T cells via plasmid transfection as described herein. Amplicons spanning each end of the integrations were generated with flanking randomized UMIs to control for PCR bias. These amplicons were sequenced with PacBio next-generation sequencing. The resulting sequences were collapsed to remove reads with identical UMIs. By aligning unique reads, a coverage plot was constructed as shown in
(207) In another example, hybrid capture may be performed as described in a previous example but with a larger target library length during initial library generation. The resulting library can then be subjected to long-read next-generation sequencing.
Example 28: Targeted Delivery of R2Gfo and R4Al Retrotransposon to Mammalian Cells
(208) This example describes targeted integration of the R2Gfo and R4Al retrotransposon elements to mammalian cells via DNA delivery.
(209) In one example, we assayed the full R2 element R2-1_GFo (Repbase; Kojima et al PLOS One 11, e0163496 (2015)) from the medium ground finch, Geospiza fortis (R2GFo). In another example, we assayed the full R4 element R4_AL (Repbase; Burke et al Nucleic Acids Res. 23, 4628-34 (1995)) from the large roundworm, Ascaris lumbricoides (R4Al). Because non-LTR R2 and R4 elements are not present in the human genome and are thought to be highly site-specific, the ability of retrotransposons to accurately and efficiently integrate itself into the human genome would demonstrate the capability to perform genomic targeted integration.
(210) Plasmids harboring R2Gfo (PLV033) or R4Al (PLV462) were designed for cis integration of the R2Gfo or R4Al elements as in previous examples. Plasmids were synthesized such that the wildtype element was flanked by its native un-translated regions (UTRs) and 100 bp of homology to its rDNA target (
(211) ddPCR was performed to confirm integration and assess integration efficiency. A Taqman probe was designed to the 3UTR portion of each element. A forward primer was synthesized to bind directly upstream of the probe, and a reverse primer was synthesized to bind the rDNA. Thus, amplification of the expected product across the integration junction would degrade the probe and create a fluorescent signal. The results of the ddPCR copy number analysis (in comparison to reference gene RPP30) are shown in
Example 29: Integration of Retrotransposons into Human Fibroblasts
(212) This example describes the cis integration of R2Tg into human fibroblasts. Briefly, a plasmid designed to integrate R2Tg in cis was synthesized such that R2Tg was flanked by its native UTRs and homologous sequence to its rDNA target as in previous examples. 0.5 ?g PLV014 (wild-type) and PLV072 (EN mutant) plasmids were transfected into 100,000 human dermal fibroblasts isolated from neonatal foreskin (HDFn, C0045C, ThermoFisher Scientific) respectively using the Neon transfection system. Two programs were performed, each in duplicate. The setting for Program 1 was 1700V pulse voltage, 20 ms pulse width, and 1 pulse number. The setting for Program 2 was 1400V pulse voltage, 20 ms pulse width, and 2 pulse number. Both programs achieved 95% transfection efficiency measured using plasmid encoding the EGFP. Three days post transfection, genomic DNA was extracted for the ddPCR assay. ddPCR was performed to confirm integration and assess integration efficiency. A Taqman probe was designed to the 3 UTR portion of the R2Tg element. A forward primer was synthesized to bind directly upstream of the probe, and a reverse primer was synthesized to bind the rDNA. Thus, amplification of the expected product across the integration junction would degrade the probe and create a fluorescent signal. The results of ddPCR copy number analysis (in comparison to reference gene RPP30) are shown in
Example 30: Evaluation of DNA Damage Response Upon Retrotransposon Transfection
(213) DNA damage (e.g., resulting from DSB formation or replication fork collapse) leads to the activation of p53, which among many other transcriptional responses, leads to the upregulation of p21, resulting in cell cycle arrest or apoptosis. Genome editing using CRISRP/Cas9 has been shown to activate p53 and p21, which is a potential safety and efficacy problem for CRISPR/based therapeutics. To establish whether R2Tg delivery to the cell leads to activation of p53 and p21, U2OS cells were seeded at a density of 4?10.sup.4 cells/well and transfected 24 hours later using the Fugene HD and Lipofectamine reagents with either 500 ng of R2Tg-WT plasmid or 500 ng of R2Tg-EN (a variant of R2Tg with a mutation in the endonuclease (EN) domain, rendering R2Tg inactive). To control for transfection efficiency, U2OS cells were also transfected with a plasmid expressing GFP. Lastly, as a positive control for p53 and p21 activation, U2OS cells were treated with one of the DNA damage-inducing agents etoposide (20 ?M) or bleomycin (10 ?g/ml). The U2OS cells were collected 24 hours after transfection/treatment. Protein lysates were prepared in RIPA buffer and run on an SDS-PAGE gel, followed by transfer to nitrocellulose, followed by probing with antibodies against p53 and p21, as well as Actin and Vinculin. As shown in