HIGH YIELDS OF ISOMELEZITOSE FROM SUCROSE BY ENGINEERED GLUCANSUCRASES
20180320149 ยท 2018-11-08
Inventors
Cpc classification
C12P19/18
CHEMISTRY; METALLURGY
C12P19/00
CHEMISTRY; METALLURGY
International classification
Abstract
Provided herein are compositions and methods for the synthesis of the trisaccharide, isomelezitose, using genetically modified glucansucrase enzymes from representative microorganisms, including lactic acid bacteria such as Leuconostoc mesenteroides. Various modified enzymes are detailed, increasing isomelezitose yields and provide the foundation for large-scale production of isomelezitose for food, industrial and biomedical applications.
Claims
1. A modified glucansucrase enzyme comprising a domain B motif, wherein the leucine residue equivalent to L441 of SEQ ID NO:5 in the domain B motif is substituted with an amino acid other than leucine and wherein the modified enzyme produces at least twice as much isomelezitose from sucrose as compared to the unmodified glucansucrase enzyme.
2. The modified glucansucrase of claim 1, wherein the substituting amino acid is proline.
3. The modified enzyme of claim 1, wherein the modified enzyme comprises an amino acid sequence that is at least 90% identical to SEQ ID NO:5 or SEQ ID NO:6, wherein the leucine residue at position 441 of SEQ ID NO:5 or position 400 of SEQ ID NO:6 is substituted with an amino acid other than leucine.
4. The modified enzyme of claim 3, wherein the substituting amino acid is arginine, asparagine, aspartic acid, glutamine, glutamic acid, glycine, isoleucine, lysine, proline, serine, threonine, or valine.
5. The modified enzyme of claim 1, wherein the modified enzyme comprises an amino acid sequence that is at least 90% identical to SEQ ID NO:8 or SEQ ID NO:9, wherein the leucine residue at position 459 of SEQ ID NO:8 or position 417 of SEQ ID NO:9 is substituted with an amino acid other than leucine.
6. The modified enzyme of claim 5, wherein the substituting amino acid is proline.
7. The modified enzyme of claim 1, wherein the modified enzyme comprises an amino acid sequence that is at least 90% identical to SEQ ID NO:2 or SEQ ID NO:3, wherein the leucine residue at position 544 of SEQ ID NO:2 or position 505 of SEQ ID NO:3 is substituted with an amino acid other than leucine.
8. The modified enzyme of claim 7, wherein the substituting amino acid is glutamic acid, proline, or serine.
9. The modified enzyme of claim 1, wherein the modified enzyme comprises an amino acid sequence that is at least 90% identical to SEQ ID NO:11 or SEQ ID NO:12, wherein the leucine residue at position 350 of SEQ ID NO:11 or position 312 of SEQ ID NO:12 is substituted with an amino acid other than leucine.
10. The modified enzyme of claim 9, wherein the substituting amino acid is arginine, glutamic acid, proline, or serine.
11. The modified enzyme of claim 1, wherein the modified enzyme comprises an amino acid sequence that is at least 90% identical to SEQ ID NO:14 or SEQ ID NO:15, wherein the leucine residue at position 417 of SEQ ID NO:14 or position 380 of SEQ ID NO:15 is substituted with an amino acid other than leucine.
12. The modified enzyme of claim 11, wherein the substituting amino acid is proline.
13. A DNA molecule encoding a modified enzyme of any of claims 1-12.
14. A host cell comprising the DNA molecule of claim 13.
15. A method of producing isomelezitose comprising the steps of contacting a modified enzyme of any of claims 1-12 with a solution comprising a carbohydrate source, and allowing the modified enzyme to convert at least a portion of the carbohydrate source to isomelezitose.
16. The method of claim 15, further comprising the step of expressing the modified enzyme in a recombinant host cell.
17. The method of claim 16, further comprising the step of purifying the modified enzyme prior to contacting it with the carbohydrate source.
18. The method of claim 15, wherein the carbohydrate source comprises sucrose.
19. The method of claim 18, wherein the sucrose is in aqueous solution and is at a concentration of about 1.0 M.
Description
BRIEF DESCRIPTION OF THE DRAWINGS
[0010] The novel features of the invention are set forth with particularity in the claims. Features and advantages of the present invention are referred to in the following detailed description, and the accompanying drawings of which:
[0011]
[0012]
[0013]
[0014]
[0015]
[0016]
DETAILED DESCRIPTION OF THE INVENTION
[0017] Provided herein are modified enzymes (glucansucrases) containing mutations in a highly conserved leucine residue within a conserved motif. These modified enzymes are capable of producing elevated levels of isomelezitose from sucrose as compared to unmodified (wild-type) enzymes. In some embodiments, such modified enzymes are exposed to sucrose solutions and allowed to produce isomelezitose. In preferred embodiments, the modified enzymes of the present invention are produced by recombinant cells and at least partially purified before exposure to sucrose solutions.
[0018] Preferred embodiments of the present invention are shown and described herein. It will be obvious to those skilled in the art that such embodiments are provided by way of example only. Numerous variations, changes, and substitutions will occur to those skilled in the art without departing from the invention. Various alternatives to the embodiments of the invention described herein may be employed in practicing the invention. It is intended that the included claims define the scope of the invention and that methods and structures within the scope of these claims and their equivalents are covered thereby.
[0019] Technical and scientific terms used herein have the meanings commonly understood by one of ordinary skill in the art to which the instant invention pertains, unless otherwise defined. Reference is made herein to various materials and methodologies known to those of skill in the art. Standard reference works setting forth the general principles of recombinant DNA technology include Sambrook et al., Molecular Cloning: A Laboratory Manual, 2d ed., Cold Spring Harbor Laboratory Press, Plainview, N.Y., 1989; Kaufman et al., eds., Handbook of Molecular and Cellular Methods in Biology and Medicine, CRC Press, Boca Raton, 1995; and McPherson, ed., Directed Mutagenesis: A Practical Approach, IRL Press, Oxford, 1991. Standard reference literature teaching general methodologies and principles of fungal genetics useful for selected aspects of the invention include: Sherman et al. Laboratory Course Manual Methods in Yeast Genetics, Cold Spring Harbor Laboratory, Cold Spring Harbor, N.Y., 1986 and Guthrie et al., Guide to Yeast Genetics and Molecular Biology, Academic, New York, 1991.
[0020] Any suitable materials and/or methods known to those of skill can be utilized in carrying out the instant invention. Materials and/or methods for practicing the instant invention are described. Materials, reagents and the like to which reference is made in the following description and examples are obtainable from commercial sources, unless otherwise noted. This invention teaches methods and describes tools for producing genetically altered host cells producing genetically modified glucansucrases from lactic acid bacteria, including Leuconostoc mesenteroides.
[0021] As used in the specification and claims, use of the singular a, an, and the include plural references unless the context clearly dictates otherwise.
[0022] The terms isolated, purified, or biologically pure as used herein, refer to material that is substantially or essentially free from components that normally accompany the referenced material in its native state.
[0023] The term about is defined as plus or minus ten percent of a recited value. For example, about 1.0 g means 0.9 g to 1.1 g and all values within that range, whether specifically stated or not.
[0024] The term equivalent amino acid, and grammatical variations thereof, refers to the same highly conserved amino acid residue in a conserved protein domain, regardless of its numerical position in a given amino acid sequence. For example, all leucine residues indicated by the asterisk and bold font in
[0025] As described herein, a single amino acid residue substitution can be indicated as follows: the original amino acid residue (expressed as a single-letter abbreviation), followed by the position of the original amino acid residue (i.e., a numerical expression), followed by the new amino acid residue (expressed as a single-letter abbreviation) to be inserted in place of the original amino acid residue. For example, L441G means that the original leucine (L) residue at position 441 is to be replaced by the new glycine (G) residue. For multiple substitutions (e.g., double-substitutions, triple-substitutions, and quadruple-substitutions), the various substitutions are separated by either a slash (/) or by a space.
[0026] Modified enzymes of the present invention also include enzymes with high identity or homology to a reference sequence. For example, proteins having 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% identity to any of SEQ. ID. NOs: 2, 3, 5, 6, 8, 9, 11, 12, 14 or 15 are provided herein. As a practical matter, whether any particular amino acid sequence having a percentage identity to a given amino acid sequence can be determined conventionally using known computer programs to find the best segment of homology between two sequences. When using sequence alignment program to determine whether a particular sequence is, for instance, 96% identical to a reference sequence according to the present invention, the parameters are set, of course, such that the percentage of identity is calculated over the full length of the reference peptide sequence and that gaps in homology of up to 4% of the total number of amino acids in the reference sequence are allowed.
[0027] Molecular Biological Methods
[0028] An isolated nucleic acid is a nucleic acid the structure of which is not identical to that of any naturally occurring nucleic acid. The term therefore covers, for example, (a) a DNA which has the sequence of part of a naturally occurring genomic DNA molecule but is not flanked by both of the coding or noncoding sequences that flank that part of the molecule in the genome of the organism in which it naturally occurs; (b) a nucleic acid incorporated into a vector or into the genomic DNA of a prokaryote or eukaryote in a manner such that the resulting molecule is not identical to any naturally occurring vector or genomic DNA; (c) a separate molecule such as a cDNA, a genomic fragment, a fragment produced by polymerase chain reaction (PCR), or a restriction fragment; and (d) a recombinant nucleotide sequence that is part of a hybrid gene, i.e., a gene encoding a fusion protein. Specifically excluded from this definition are nucleic acids present in mixtures of (i) DNA molecules, (ii) transformed or transfected cells, and (iii) cell clones, e.g., as these occur in a DNA library such as a cDNA or genomic DNA library.
[0029] The term recombinant nucleic acids refers to polynucleotides which are made by the combination of two otherwise separated segments of sequence accomplished by the artificial manipulation of isolated segments of polynucleotides by genetic engineering techniques or by chemical synthesis. In so doing one may join together polynucleotide segments of desired functions to generate a desired combination of functions.
[0030] Recombinant host cells, in the present context, are those which have been genetically modified to contain an isolated nucleic molecule of the instant invention. The nucleic acid can be introduced by any means known to the art which is appropriate for the particular type of cell, including without limitation, transformation, lipofection, electroporation or any other methodology known by those skilled in the art.
[0031] In practicing some embodiments of the invention disclosed herein, it can be useful to modify the DNA of a strain of lactic acid bacteria, or another target organism that produces a glucansucrase to be modified. In many embodiments, such modification involves replacing an innate gene with an artificially modified version, such that a modified protein is produced when the modified gene is expressed. Alternately, isolated nucleic acids encoding any of the proteins of the present invention can be inserted into the genome of any desired host cell. Such modifications that result in the change of one or more amino acids from a wild-type sequence can be achieved using any technique known to those of skill in the art.
[0032] Alternately, expression plasmids containing a modified gene of interest can be introduced in a host from which the gene was not originally derived (e.g., expressing a L. mesenteroides gene in Escherichia coli). Where a recombinant nucleic acid is intended for expression, cloning, or replication of a particular sequence, DNA constructs prepared for introduction into a prokaryotic or eukaryotic host can comprise a replication system (i.e. vector) recognized by the host, including the intended DNA fragment encoding the desired polypeptide, and can also include transcription and translational initiation regulatory sequences operably linked to the polypeptide-encoding segment. Expression systems (expression vectors) may include, for example, an origin of replication or autonomously replicating sequence (ARS) and expression control sequences, a promoter, an enhancer and necessary processing information sites, such as ribosome-binding sites, RNA splice sites, polyadenylation sites, transcriptional terminator sequences, and mRNA stabilizing sequences. Signal peptides may also be included where appropriate from secreted polypeptides of the same or related species, which allow the protein to cross and/or lodge in cell membranes or be secreted from the cell.
[0033] Vectors and other nucleic acids introduced into a host cell will likely contain a selectable marker, that is, a gene encoding a protein necessary for the survival or growth of a host cell transformed with the nucleic acid. Although such a marker gene can be carried on another polynucleotide sequence co-introduced into the host cell, it is most often contained on the transforming nucleic acid. Only those host cells into which the marker gene has been introduced will survive and/or grow under selective conditions. Typical selection genes encode proteins that (a) confer resistance to antibiotics or other toxic substances, e.g., hygromycin, ampicillin, neomycin, methotrexate, etc.; (b) complement auxotrophic deficiencies; or (c) supply critical nutrients not available from complex media. The choice of the proper selectable marker will depend on the host cell and appropriate markers for different hosts are well known in the art.
[0034] Screening and molecular analysis of recombinant strains of the present invention can be performed utilizing nucleic acid hybridization techniques. Hybridization procedures are useful for identifying polynucleotides, such as those modified using the techniques described herein, with sufficient homology to the subject regulatory sequences to be useful as taught herein. The particular hybridization techniques are not essential to the subject invention. As improvements are made in hybridization techniques, they can be readily applied by one of skill in the art. Hybridization probes can be labeled with any appropriate label known to those of skill in the art. Hybridization conditions and washing conditions, for example temperature and salt concentration, can be altered to change the stringency of the detection threshold. See, e.g., Sambrook et al. (1989) vide infra or Ausubel et al. (1995) Current Protocols in Molecular Biology, John Wiley & Sons, NY, N.Y., for further guidance on hybridization conditions.
[0035] Additionally, screening and molecular analysis of genetically altered strains, as well as creation of desired isolated nucleic acids can be performed using Polymerase Chain Reaction (PCR). PCR is a repetitive, enzymatic, primed synthesis of a nucleic acid sequence. This procedure is well known and commonly used by those skilled in this art (see Mullis, U.S. Pat. Nos. 4,683,195, 4,683,202, and 4,800,159; Saiki et al. (1985) Science 230:1350-1354). PCR is based on the enzymatic amplification of a DNA fragment of interest that is flanked by two oligonucleotide primers that hybridize to opposite strands of the target sequence. The primers are oriented with the 3 ends pointing towards each other. Repeated cycles of heat denaturation of the template, annealing of the primers to their complementary sequences, and extension of the annealed primers with a DNA polymerase result in the amplification of the segment defined by the 5 ends of the PCR primers. Since the extension product of each primer can serve as a template for the other primer, each cycle essentially doubles the amount of DNA template produced in the previous cycle. This results in the exponential accumulation of the specific target fragment, up to several million-fold in a few hours. By using a thermostable DNA polymerase such as the Taq polymerase, which is isolated from the thermophilic bacterium Thermus aquaticus, the amplification process can be completely automated. Other enzymes which can be used are known to those skilled in the art.
[0036] Hybridization-based screening of genetically altered strains typically utilizes homologous nucleic acid probes with homology to a target nucleic acid to be detected. The extent of homology between a probe and a target nucleic acid can be varied according to the particular application. Homology (level of sequence identity) can be 50%-100%. In some instances, such homology is greater than 80%, greater than 85%, greater than 90%, or greater than 95%. The degree of homology or identity needed for any intended use of the sequence(s) is readily identified by one of skill in the art. As used herein percent sequence identity of two nucleic acids is determined using the algorithm of Karlin and Altschul (1990) Proc. Natl. Acad. Sci. USA 87:2264-2268, modified as in Karlin and Altschul (1993) Proc. Natl. Acad. Sci. USA 90:5873-5877. Such an algorithm is incorporated into the NBLAST and XBLAST programs of Altschul et al. (1990) J. Mol. Biol. 215:402-410. BLAST nucleotide searches are performed with the NBLAST program, score=100, wordlength=12, to obtain nucleotide sequences with the desired percent sequence identity. To obtain gapped alignments for comparison purposes, Gapped BLAST is used as described in Altschul et al. (1997) Nucl. Acids. Res. 25:3389-3402. When utilizing BLAST and Gapped BLAST programs, the default parameters of the respective programs (NBLAST and XBLAST) are used. See www.ncbi.nih.gov. Additional tools, such as Lipman-Pearson alignment can also be utilized. (Pearson & Lipman, Proc. Nat'l. Acad. Sci. U.S.A., (1988) 85:2444-8).
[0037] Glucansucrases
[0038] Glucansucrases (from Glycoside-Hydrolase (GH)-family 70 (EC. 2.4.1.5)) are extracellular enzymes produced by lactic acid bacteria of the genera Leuconostoc, Streptococcus, or Lactobacillus (Monsan et al., Int'l. Dairy J., (2001) 11:675-85). Glucansucrases are a type of glucosyltransferase that catalyzes the transfer of D-glucopyranosyl units from sucrose to form -glucan chains. These enzymes are capable of catalyzing the synthesis of several different polymeric -glucosidic linkages that affect molecular mass, branching, and solubility of the polysaccharide. In general, -glucans containing mostly (1.fwdarw.6) linkages are water-soluble (e.g., dextran), while those made primarily of (1.fwdarw.3) linkages are water-insoluble. The enzymes of the GH-family 70 are diverse, being able to synthesize all the types of glucosidic linkages, namely -1,2; -1,3; -1,4; or -1,6 glucosidic bonds. Thus, depending on the enzyme specificity, a wide range of glucans can be produced, varying in terms of size, structure, degree of branches and spatial arrangements.
[0039] Despite these divergent capabilities, enzymes of the GH-family 70 have highly conserved components. They are characterized by having the same general structure consisting of a signal sequence, a variable region at the N-terminus, a conserved catalytic domain, and a C-terminal domain typically comprised of a series of homologous repeating units (Moulis et al., J. Biol. Chem., (2006) 281:31254-67). The catalytic domain is predicted to be organized in a (/).sub.8-barrel (MacGregor et al., FEBS Lett. (1996) 378:263-6).
[0040] One such conserved motif (domain B) in these enzymes is demonstrated in
[0041] As demonstrated herein via the various exemplary glucansucrases modified as described, this leucine residue (i.e., the equivalent leucine to L441 from SEQ ID NO:5), can be modified to improve production of isomelezitose. This leucine residue can be identified in most glucansucrases as the second amino acid position in the following protein motif (domain B motif) presented in PROSITE pattern format: [HQW]-L-Q-[NG]-G-[FAY]-[LV]-X-[YF]-X-[ND]. A few exceptions among this diverse enzyme group that lack an equivalent leucine residue at this position include reuteran-producing glucansucrase (e.g., reuteransucrase GtfA from L. reuteri (Kralj, et al., Appl. Environ. Microbiol., (2002) 68:4283-91) and the catalytic domain 2 of (1.fwdarw.2) synthesizing glucansucrases (e.g., DsrE from L. mesenteroides (Bozonnet, et al., J. Bacteriol., (2002)184:5753-61), which contain a phenylalanine in this position.
[0042] Table 1 provides wild-type nucleic acid and amino acid sequences of the various glucansucrases mutated and analyzed as described herein.
TABLE-US-00001 TABLE1 Sequencesofwild-typeenzymes SEQ Type Sequence IDNO: asrcoding ATGAAACAACAAGAAACAGTTACCCGTAAAAAACTTTATAAATCCGG 1 sequence TAAGGTTTGGGTTGCAGCAGCTACTGCATTTGCGGTATTGGGGGTTTC (L.citreum) AACTGTAACAACAGTCCATGCGGATACAAATTCGAATGTCGCTGTTA AGCAAATAAATAATACAGGAACCAATGATTCTGGCGAAAAAAAGGT ACCGGTTCCATCAACTAATAATGATAGTTTGAAGCAAGGAACAGATG GTTTTTGGTATGATTCAGACGGCAATCGTGTCGATCAGAAGACCAATC AGATTCTGCTTACTGCGGAACAACTTAAAAAAAATAACGAAAAAAAT TTATCAGTAATCAGTGATGATACATCAAAAAAAGATGATGAAAATAT TTCTAAGCAGACCAAAATTGCTAATCAACAAACAGTAGATACTGCTA AAGGCCTGACTACCAGTAATTTATCTGATCCCATCACTGGGGGTCACT ATGAAAATCACAATGGCTACTTTGTTTATATAGATGCTTCAGGAAAAC AAGTAACAGGTTTGCAAAATATTGATGGTAATTTACAATATTTTGATG ACAATGGATATCAAGTCAAGGGATCCTTCCGAGATGTCAACGGCAAG CATATCTATTTTGATTCAGTAACAGGGAAAGCTAGTTCAAATGTTGAT ATTGTTAACGGTAAAGCTCAAGGATATGATGCGCAAGGCAACCAATT AAAGAAAAGTTATGTCGCCGATAGTTCTGGGCAAACTTACTATTTTGA TGGTAATGGCCAACCGTTAATCGGCTTGCAAACAATTGATGGGAACC TACAATATTTTAACCAACAAGGGGTTCAAATAAAGGGTGGTTTCCAA GATGTTAACAATAAACGTATTTATTTTGCACCAAACACAGGTAATGCC GTTGCCAATACTGAAATAATTAACGGTAAATTACAGGGGCGTGACGC AAATGGTAACCAGGTAAAGAATGCATTTAGTAAAGATGTTGCAGGAA ATACATTTTATTTTGACGCAAACGGTGTGATGTTAACAGGGTTGCAAA CTATTTCAGGAAAGACATATTATCTTGATGAACAAGGACACCTGAGA AAAAATTACGCGGGAACATTCAATAATCAGTTTATGTACTTCGATGCT GATACAGGTGCGGGTAAAACAGCGATTGAATATCAATTTGATCAAGG ATTGGTATCACAAAGTAATGAAAATACTCCTCACAATGCCGCAAAGT CTTATGATAAAAGTAGTTTTGAAAATGTTGATGGTTACTTAACAGCAG ATACATGGTATCGTCCAACCGATATTTTAAAAAATGGAGATACTTGG ACGGCATCTACCGAAACTGATATGCGTCCGCTTTTAATGACATGGTGG CCTGACAAACAAACACAAGCAAATTACTTGAATTTTATGTCTAGTAA AGGACTTGGTATAACGACCACTTATACAGCAGCTACGTCACAAAAAA CACTAAATGACGCAGCCTTTGTTATTCAAACAGCAATTGAACAACAA ATATCTTTGAAAAAAAGTACTGAGTGGTTACGTGATGCAATTGATAGT TTTGTGAAGACGCAAGCTAATTGGAATAAGCAAACAGAAGATGAAGC TTTCGATGGTTTGCAGTGGCTTCAAGGGGGATTCCTAGCTTATCAAGA TGATTCACATCGGACGCCGAATACTGATTCAGGAAATAACAGAAAAC TAGGACGTCAACCAATTAATATCGATGGTTCGAAAGATACAACTGAT GGTAAAGGCTCTGAATTCTTATTAGCTAACGATATTGACAACTCAAAT CCGATTGTTCAAGCTGAGCAATTAAACTGGCTACACTATTTAATGAAT TTTGGTAGTATTACAGGTAATAATGACAATGCGAATTTTGATGGCATT CGTGTAGATGCTGTTGATAATGTTGATGCTGATTTACTAAAAATAGCT GGCGATTATTTTAAAGCTCTATATGGTACAGATAAAAGCGACGCCAA TGCCAATAAGCATTTGTCTATTTTAGAAGACTGGAACGGTAAAGATCC TCAGTATGTTAATCAACAGGGCAATGCGCAATTAACAATGGATTACA CAGTTACTTCACAGTTTGGCAATTCTCTAACACATGGCGCCAACAACA GGAGTAACATGTGGTATTTCTTAGATACTGGCTATTATCTTAATGGAG ATCTTAATAAGAAGATAGTAGATAAGAACCGTCCAAATTCTGGCACT TTGGTTAACAGAATTGCTAATTCAGGTGATACAAAAGTTATTCCAAAT TATAGTTTTGTTAGAGCACATGATTACGATGCTCAAGATCCAATTAGA AAAGCCATGATTGATCATGGTATTATTAAAAACATGCAGGATACTTTC ACTTTTGACCAACTGGCTCAGGGAATGGAATTCTACTATAAAGATCA AGAGAATCCGTCTGGTTTCAAAAAGTATAACGATTATAACTTACCTAG TGCTTATGCAATGTTGTTGACTAATAAGGATACTGTACCTCGTGTCTA TTATGGAGATATGTACCTCGAAGGCGGGCAATATATGGAAAAAGGGA CGATTTACAATCCTGTCATTTCAGCGTTGCTCAAAGCTAGAATAAAAT ATGTTTCTGGTGGGCAAACAATGGCTACCGATAGTTCTGGAAAAGAC CTTAAAGATGGCGAAACTGATTTGTTAACAAGTGTTCGATTTGGTAAA GGAATTATGACATCAGATCAAACCACAACACAAGACAATAGCCAAGA TTATAAAAATCGAGGCATCGGTGTCATTGTTGGTAATAACCCTGACCT TAAGTTGAACAATGATAAGACCATTACCTTGCATATGGGAAAGGCGC ATAAGAATCAACTTTACCGTGCCTTAGTATTATCAAATGACTCAGGAA TTGATGTTTATGATAGTGATGATAAAGCACCAACTTTGAGAACAAAT GACAACGGTGACTTGATTTTCCATAAGACAAATACGTTTGTGAAGCA AGATGGAACTATTATAAATTACGAAATGAAGGGATCATTAAATGCTT TAATTTCAGGTTATTTAGGTGTCTGGGTGCCAGTTGGAGCTAGTGATT CACAAGATGCTCGTACAGTGGCAACTGAGTCATCATCAAGTAATGAT GGTTCTGTATTCCATTCAAATGCTGCATTAGATTCTAATGTTATATATG AAGGCTTTTCAAACTTTCAAGCGATGCCGACTTCTCCTGAGCAAAGTA CAAATGTTGTTATTGCAACAAAGGCTAACTTATTTAAAGAATTAGGTA TTACTAGTTTTGAGTTAGCACCTCAATATAGGTCTAGTGGTGACACTA ATTACGGTGGCATGTCATTCTTAGATTCTTTCTTAAATAATGGTTATGC ATTTACCGATAGATATGATTTAGGCTTTAACAAAGCAGACGGGAATC CTAACCCAACAAAGTATGGAACAGATCAAGATTTACGTAATGCAATA GAGGCATTACACAAAAACGGCATGCAGGCTATAGCTGATTGGGTTCC TGACCAAATATATGCTTTACCAGGAAAGGAAGTTGTTACCGCTACTA GAGTAGACGAACGGGGAAATCAACTAAAAGACACAGATTTTGTCAAC TTACTCTATGTTGCTAATACTAAAAGTAGTGGTGTGGATTATCAGGCA AAGTATGGCGGCGAATTTTTAGATAAATTAAGAGAAGAGTACCCATC GTTATTCAAACAGAACCAAGTATCGACAGGTCAGCCAATTGATGCTT CTACAAAAATTAAGCAATGGTCAGCTAAATATATGAATGGGACCAAT ATTTTACATCGAGGTGCTTATTATGTTTTGAAAGACTGGGCTACTAAC CAGTATTTTAACATTGCAAAAACGAATGAAGTATTTTTGCCACTACAG TTGCAGAATAAAGATGCGCAAACTGGTTTCATTAGTGATGCCTCCGGT GTAAAATATTACTCAATTAGTGGTTATCAAGCAAAAGATACTTTTATT GAAGATGGTAATGGGAATTGGTATTACTTTGATAAAGATGGTTACAT GGTGCGTTCGCAGCAAGGAGAAAATCCTATAAGAACAGTCGAAACTA GTGTCAACACACGAAACGGTAATTATTACTTTATGCCAAATGGTGTCG AGTTGCGCAAAGGCTTTGGAACGGATAATAGTGGTAATGTCTATTATT TTGATGATCAAGGTAAGATGGTGAGAGATAAATACATTAACGATGAT GCTAATAATTTTTATCACTTAAATGTTGATGGGACTATGTCTCGAGGA CTATTTAAATTTGATTCTGATACTCTACAGTATTTTGCTAGTAATGGTG TCCAAATAAAAGATAGTTATGCGAAGGATAGTAAAGGCAATAAATAT TATTTTGACTCAGCTACAGGAAATAACGATACTGGGAAAGCCCAAAC TTGGGATGGTAATGGCTACTATATTACTATTGATTCTGATGCGAACAA TACAATTGGGGTTAACACAGACTACACTGCCTACATCACTAGCTCGCT GCGCGAAGATGGCTTATTTGCTAACGCACCTTACGGTGTTGTAACAAA AGACCAAAATGGTAACGATCTTAAGTGGCAGTATATTAACCATACGA AACAGTACGAAGGGCAACAAGTGCAAGTCACGCGTCAATACACAGA CAGTAAGGGAGTCAGCTGGAACTTAATTACCTTTGCTGGTGGTGATTT ACAAGGACAAAGGCTTTGGGTGGATAGTCGTGCGTTAACTATGACAC CATTTAAAACGATGAACCAAATAAGCTTCATTAGTTATGCTAACCGCA ATGATGGGTTGTTTTTGAATGCGCCATACCAAGTCAAGGGGTATCAAT TAGCTGGGATGTCCAACCAATACAAGGGCCAACAAGTGACCATTGCT GGGGTGGCGAACGTTTCTGGAAAAGACTGGAGTCTGATTAGTTTTAA TGGGACACAGTACTGGATTGATAGTCAGGCATTGAATACCAATTTCA CACATGACATGAACCAAAAGGTCTTTGTCAATACAACTAGTAATCTTG ATGGGTTATTCTTAAATGCGCCATACCGTCAACCGGGTTATAAGTTAG CCGGTTTGGCTAAAAATTACAACAACCAAACGGTTACTGTTAGTCAA CAGTACTTTGATGATCAAGGCACGGTCTGGAGTCAGGTTGTCCTTGGG GGTCAGACGGTCTGGGTTGATAACCATGCATTGGCACAGATGCAAGT TAGTGATACAGACCAACAGCTCTATGTGAATAGCAATGGTCGGAATG ATGGGTTATTCTTGAATGCGCCATATCGTGGTCAAGGGTCACAACTGA TAGGCATGACGGCAGATTATAATGGGCAACATGTACAAGTGACCAAG CAAGGGCAAGATGCCTATGGTGCACAATGGCGTCTTATTACGCTAAA TAATCAACAGGTCTGGGTTGATAGTCGCGCTTTGAGCACAACAATCAT GCAAGCCATGAATGATAATATGTATGTAAATAGCAGCCAACGGACAG ATGGCTTGTGGTTAAACGCACCTTATACGATGAGTGGGGCTAAATGG GCTGGTGATACACGTTCAGCTAATGGGCGCTATGTCCATATTTCAAAA GCTTATTCAAACGAAGTCGGCAATACATATTACTTGACGAATTTGAAT GGTCAAAGCACATGGATTGACAAGCGGGCGTTTACTGTGACCTTCGA TCAGGTGGTGGCATTAAATGCAACGATTGTGGCACGCCAACGACCAG ATGGGATGTTTAAGACAGCACCATATGGTGAAGCGGGGGCGCAGTTT GTCGATTATGTGACAAACTATAACCAGCAAACCGTGCCAGTAACAAA GCAACATTCAGATGCTCAGGGGAATCAATGGTACTTAGCGACAGTGA ATGGGACACAATACTGGATTGATCAACGGTCATTTTCACCAGTAGTA ACGAAGGTGGTTGATTATCAAGCTAAGATTGTGCCACGGACAACACG TGATGGTGTGTTTAGTGGCGCACCCTATGGGGAAGTGAATGCTAAGC TAGTTAACATGGCAACTGCGTATCAAAATCAAGTTGTCCATGCGACA GGGGAATATACGAATGCTTCAGGGATCACATGGAGTCAGTTCGCGTT AAGCGGGCAAGAAGACAAGCTATGGATTGATAAGCGTGCTTTGCAAG CTTAA Asrprotein MKQQETVTRKKLYKSGKVWVAAATAFAVLGVSTVTTVHADTNSNV 2 (L.citreum) AVKQINNTGTNDSGEKKVPVPSTNNDSLKQGTDGFWYDSDGNRVDQKT NQILLTAEQLKKNNEKNLSVISDDTSKKDDENISKQTKIANQQTVDTAKG LTTSNLSDPITGGHYENHNGYFVYIDASGKQVTGLQNIDGNLQYFDDNG YQVKGSFRDVNGKHIYFDSVTGKASSNVDIVNGKAQGYDAQGNQLKKS YVADSSGQTYYFDGNGQPLIGLQTIDGNLQYFNQQGVQIKGGFQDVNNK RIYFAPNTGNAVANTEIINGKLQGRDANGNQVKNAFSKDVAGNTFYFDA NGVMLTGLQTISGKTYYLDEQGHLRKNYAGTFNNQFMYFDADTGAGKT AIEYQFDQGLVSQSNENTPHNAAKSYDKSSFENVDGYLTADTWYRPTDI LKNGDTWTASTETDMRPLLMTWWPDKQTQANYLNFMSSKGLGITTTYT AATSQKTLNDAAFVIQTAIEQQISLKKSTEWLRDAIDSFVKTQANWNKQ TEDEAFDGLQWLQGGFLAYQDDSHRTPNTDSGNNRKLGRQPINIDGSKD TTDGKGSEFLLANDIDNSNPIVQAEQLNWLHYLMNFGSITGNNDNANFD GIRVDAVDNVDADLLKIAGDYFKALYGTDKSDANANKHLSILEDWNGK DPQYVNQQGNAQLTMDYTVTSQFGNSLTHGANNRSNMWYFLDTGYYL NGDLNKKIVDKNRPNSGTLVNRIANSGDTKVIPNYSFVRAHDYDAQDPI RKAMIDHGIIKNMQDTFTFDQLAQGMEFYYKDQENPSGFKKYNDYNLPS AYAMLLTNKDTVPRVYYGDMYLEGGQYMEKGTIYNPVISALLKARIKY VSGGQTMATDSSGKDLKDGETDLLTSVRFGKGIMTSDQTTTQDNSQDY KNRGIGVIVGNNPDLKLNNDKTITLHMGKAHKNQLYRALVLSNDSGIDV YDSDDKAPTLRTNDNGDLIFHKTNTFVKQDGTIINYEMKGSLNALISGYL GVWVPVGASDSQDARTVATESSSSNDGSVFHSNAALDSNVIYEGFSNFQ AMPTSPEQSTNVVIATKANLFKELGITSFELAPQYRSSGDTNYGGMSFLD SFLNNGYAFTDRYDLGFNKADGNPNPTKYGTDQDLRNAIEALHKNGMQ AIADWVPDQIYALPGKEVVTATRVDERGNQLKDTDFVNLLYVANTKSS GVDYQAKYGGEFLDKLREEYPSLFKQNQVSTGQPIDASTKIKQWSAKY MNGTNILHRGAYYVLKDWATNQYFNIAKTNEVFLPLQLQNKDAQTGFIS DASGVKYYSISGYQAKDTFIEDGNGNWYYFDKDGYMVRSQQGENPIRT VETSVNTRNGNYYFMPNGVELRKGFGTDNSGNVYYFDDQGKMVRDKY INDDANNFYHLNVDGTMSRGLFKFDSDTLQYFASNGVQIKDSYAKDSKG NKYYFDSATGNNDTGKAQTWDGNGYYITIDSDANNTIGVNTDYTAYITS SLREDGLFANAPYGVVTKDQNGNDLKWQYINHTKQYEGQQVQVTRQY TDSKGVSWNLITFAGGDLQGQRLWVDSRALTMTPFKTMNQISFISYANR NDGLFLNAPYQVKGYQLAGMSNQYKGQQVTIAGVANVSGKDWSLISFN GTQYWIDSQALNTNFTHDMNQKVFVNTTSNLDGLFLNAPYRQPGYKLA GLAKNYNNQTVTVSQQYFDDQGTVWSQVVLGGQTVWVDNHALAQMQ VSDTDQQLYVNSNGRNDGLFLNAPYRGQGSQLIGMTADYNGQHVQVT KQGQDAYGAQWRLITLNNQQVWVDSRALSTTIMQAMNDNMYVNSSQR TDGLWLNAPYTMSGAKWAGDTRSANGRYVHISKAYSNEVGNTYYLTN LNGQSTWIDKRAFTVTFDQVVALNATIVARQRPDGMFKTAPYGEAGAQ FVDYVTNYNQQTVPVTKQHSDAQGNQWYLATVNGTQYWIDQRSFSPV VTKVVDYQAKIVPRTTRDGVFSGAPYGEVNAKLVNMATAYQNQVVHA TGEYTNASGITWSQFALSGQEDKLWIDKRALQA Asrmature DTNSNVAVKQINNTGTNDSGEKKVPVPSTNNDSLKQGTDGFWYDSDGN 3 protein(L. RVDQKTNQILLTAEQLKKNNEKNLSVISDDTSKKDDENISKQTKIANQQT citreum) VDTAKGLTTSNLSDPITGGHYENHNGYFVYIDASGKQVTGLQNIDGNLQ YFDDNGYQVKGSFRDVNGKHIYFDSVTGKASSNVDIVNGKAQGYDAQG NQLKKSYVADSSGQTYYFDGNGQPLIGLQTIDGNLQYFNQQGVQIKGGF QDVNNKRIYFAPNTGNAVANTEIINGKLQGRDANGNQVKNAFSKDVAG NTFYFDANGVMLTGLQTISGKTYYLDEQGHLRKNYAGTFNNQFMYFDA DTGAGKTAIEYQFDQGLVSQSNENTPHNAAKSYDKSSFENVDGYLTADT WYRPTDILKNGDTWTASTETDMRPLLMTWWPDKQTQANYLNFMSSKG LGITTTYTAATSQKTLNDAAFVIQTAIEQQISLKKSTEWLRDAIDSFVKTQ ANWNKQTEDEAFDGLQWLQGGFLAYQDDSHRTPNTDSGNNRKLGRQPI NIDGSKDTTDGKGSEFLLANDIDNSNPIVQAEQLNWLHYLMNFGSITGNN DNANFDGIRVDAVDNVDADLLKIAGDYFKALYGTDKSDANANKHLSIL EDWNGKDPQYVNQQGNAQLTMDYTVTSQFGNSLTHGANNRSNMWYF LDTGYYLNGDLNKKIVDKNRPNSGTLVNRIANSGDTKVIPNYSFVRAHD YDAQDPIRKAMIDHGIIKNMQDTFTFDQLAQGMEFYYKDQENPSGFKKY NDYNLPSAYAMLLTNKDTVPRVYYGDMYLEGGQYMEKGTIYNPVISAL LKARIKYVSGGQTMATDSSGKDLKDGETDLLTSVRFGKGIMTSDQTTTQ DNSQDYKNRGIGVIVGNNPDLKLNNDKTITLHMGKAHKNQLYRALVLS NDSGIDVYDSDDKAPTLRTNDNGDLIFHKTNTFVKQDGTIINYEMKGSLN ALISGYLGVWVPVGASDSQDARTVATESSSSNDGSVFHSNAALDSNVIY EGFSNFQAMPTSPEQSTNVVIATKANLFKELGITSFELAPQYRSSGDTNYG GMSFLDSFLNNGYAFTDRYDLGFNKADGNPNPTKYGTDQDLRNAIEAL HKNGMQAIADWVPDQIYALPGKEVVTATRVDERGNQLKDTDFVNLLYV ANTKSSGVDYQAKYGGEFLDKLREEYPSLFKQNQVSTGQPIDASTKIKQ WSAKYMNGTNILHRGAYYVLKDWATNQYFNIAKTNEVFLPLQLQNKD AQTGFISDASGVKYYSISGYQAKDTFIEDGNGNWYYFDKDGYMVRSQQ GENPIRTVETSVNTRNGNYYFMPNGVELRKGFGTDNSGNVYYFDDQGK MVRDKYINDDANNFYHLNVDGTMSRGLFKFDSDTLQYFASNGVQIKDS YAKDSKGNKYYFDSATGNNDTGKAQTWDGNGYYITIDSDANNTIGVNT DYTAYITSSLREDGLFANAPYGVVTKDQNGNDLKWQYINHTKQYEGQQ VQVTRQYTDSKGVSWNLITFAGGDLQGQRLWVDSRALTMTPFKTMNQI SFISYANRNDGLFLNAPYQVKGYQLAGMSNQYKGQQVTIAGVANVSGK DWSLISFNGTQYWIDSQALNTNFTHDMNQKVFVNTTSNLDGLFLNAPYR QPGYKLAGLAKNYNNQTVTVSQQYFDDQGTVWSQVVLGGQTVWVDN HALAQMQVSDTDQQLYVNSNGRNDGLFLNAPYRGQGSQLIGMTADYN GQHVQVTKQGQDAYGAQWRLITLNNQQVWVDSRALSTTIMQAMNDN MYVNSSQRTDGLWLNAPYTMSGAKWAGDTRSANGRYVHISKAYSNEV GNTYYLTNLNGQSTWIDKRAFTVTFDQVVALNATIVARQRPDGMFKTA PYGEAGAQFVDYVTNYNQQTVPVTKQHSDAQGNQWYLATVNGTQYWI DQRSFSPVVTKVVDYQAKIVPRTTRDGVFSGAPYGEVNAKLVNMATAY QNQVVHATGEYTNASGITWSQFALSGQEDKLWIDKRALQA dsrIcoding ATGAGAAATAGAAATGCAACAAGCGTTTTCCGGAAAAAGATGTATAA 4 sequence(L. ATCTGGGAAAATGTTAGTCATTGCAGGGAGTGTTTCAATAATTGGTGT mesenteroides) TACCAGTTTTATTCAACAAGCACAAGCTGATGTTTCACAAAACAATGG GGTAGTAGTGGCCACGGCAGTCGATCAATCGAATTTGGATGCGACTA CGTCTGACAAATCAATCACAACAGATGATAAAGCTGCAACAACAGCA GCTACATCAACAGATGATAAGGCTACAACAACAGTAGCTACATCAAC AGATGATAAGGATACAACAACAGCAGCTACATCAACAGATGATAAG GCTACAACAACAGTAGCTACATCAACAGATGATAAGGCTACAACAAC AGCAGCTACATCAACAGATGATAAAGCTGCAACAACAGCAGCTACAT CAACGGATGATAAAGCTGCAACAACAGCAGCTACATCAACGGATGAT AAAGCTGCAACAACAGCAGATACATCAACAGATGATAAAGCTGCAAC AACAGCAGCTACATCAACAGATGATAAGGCTACAACAACAGCAGCTA CATCAACAGATGATAAAACAGCAACAACAGTCGGCACATCTGATAAT AACAATTCAGCTACAGCGAGCGATAAAGATGTAAGTTCATCGGCACA AAAAAGTCAAACGATTGATAACAATTCGAAGACGGCCGATACTACTG CAGCATTAGAAGCTAGTTCAAAGAATCTGAAAACGATTGATGGCAAA ACATATTATTACGACGATGATGATCAAGTAAAAAAGAACTTTGCTAC CGTAATTGATGGTAAGGTACTTTATTTTGATAAAGAGACTGGCGCATT AGCTGATACAAATGACTATCAATTTTTAGAAGGATTGACTAGTGAAA ATAATACTTATACGGAGCATAATGCCTCAGTTGGTACATCTTCTGATA GTTATACAAACGTTGACGGGTACCTAACAGCCGACAGTTGGTACAGG CCTAAGGACATATTAGTCAACGGTCAAAACTGGGAATCATCAAAGGA TGACGATTTACGACCATTGTTAATGACTTGGTGGCCAGATAAGGCAA CACAAGTAAACTATTTGAATGCGATGAAGTATTTAGATGCCACTGAA ACGGAAACTGTTTATACTTCAGATGACAGTCAAGACGCTTTGAACAA AGCAGCACAGAACATTCAAGTGAAAATTGAAGAAAAAATTAGTCAA GAAGGCCAAACACAATGGCTAAAGGATGATATTTCAAAATTTGTTGA TAGCCAATCAAATTGGAATATTGCTAGTGAATCAAAAGGAACTGATC ATTTGCAAGGTGGTGCATTGTTGTATGTCAATAGTGATAAAACACCAG ATGCCAATTCTGATTATCGATTACTTAATCGCACACCAACAAATCAAA CAGGCACGCCTTTGTATACGACAGATCCAACTCAAGGTGGTTATGACT TCCTCTTGGCCAATGATGTGGATAATTCAAACCCAGTTGTTCAAGCAG AACAACTAAATTGGATGTATTACTTGTTAAACTTTGGATCAATTACTA ATAACGATGCAGATGCTAACTTTGATAGTATTCGAGTAGATGCTGTTG ATAACGTTGATGCCGACTTATTGCAAATTGCAGCTGATTATTTCAAGG CAGCATATGGCGTCGATAAGAGTGATGCAATTTCGAATCAACATGTTT CCATTCTTGAAGATTGGAGTGACAATGATGCTGAATATGTGAAAGAC AATGGCGACAATCAATTGTCAATGGATAATAAATTGCGTTTGTCATTA AAATACTCACTCACTATGCCAGCAGTCGATCAATATGGTAATAAAAG AAGTGGATTAGAACCTTTTTTGACAAATAGTTTAGTTGATCGTACAAA TGATTCGACAGATAATACCGCACAACCAAATTATTCTTTTGTTCGTGC ACATGATAGTGAAGTACAAACAGTTATTGCTGAAATTATTAAACAAA GAATTGATCCGGATTCTGATGGCTTATCACCAACGATGGACCAATTAA CAGAAGCGTTTAAAATTTATAATGCTGATCAGTTGAAAACGGATAAA GAATTCACACAATATAACATTCCAAGTACTTATGCCACAATACTAACG AATAAAGATACAGTGCCACGTGTGTACTATGGTGATATGTATACAGA TGATGGTCAATACATGGCAACAAAGTCACTTTATTACGATGCAATTGA TACTTTGCTGAAGTCTCGTATCAAGTATGTTTCTGGCGGGCAAACAAT GTCTATGAAATATATGCAAGGTGATAGTAGTATGGCTGCTGACAGTT ATAGAGGCATTTTGACATCAGTTCGTTATGGTAATGGTGCCATGACTG CTACCGATGCAGGGACAAATGAAACACGTACGCAAGGTATTGCAGTA ATTGAAAGTAATAACCCAGATTTGAAGTTGAGCAGTACAGATCAAGT AGTTGTAGATATGGGCATAGCGCACAAAAACCAGGCTTATCGTCCTG CTTTGTTAACAACTAAAGATGGCATAGATACTTATGTATCTGATAGTG ATGTCTCACAAAGCTTAATAAGATATACAAATAGTAATGGGCAACTT ATTTTCAATAGTTCAGATATTGTTGGTACAGCAAATCCACAAGTTTCT GGATACTTGGCTGTCTGGGTACCCGTTGGTGCTTCAGATACTCAAGAT GCGCGAACTGAAAGTAGTACAGCAACAACTGCTGATGGACAAACATT ACATTCAAATGCCGCACTTGATTCTCAAGTTATTTATGAAAGTTTCTC TAACTTCCAATCTACACCAACAACAGAAGCTGAATATGCTAATGTGC AAATTGCAAACAATACTGATTTATACAAGAGTTGGGGAATTACGAAC TTCGAGTTTCCACCACAATATCGTTCAAGTACGGATAGTAGTTTCTTA GATTCAATTATTCAAAATGGTTATGCATTTACTGATCGTTATGATCTT GGATTCAATACACCAACGAAGTATGGTACTGTAGATCAACTCCGTAC AGCTATTAAAGCTTTGCATGCGACAGGTATCAAGGCAATGGCAGATT GGGTACCAGACCAGATTTATAATTTGACAGGTAAAGAAGTGGTTGCG GTACAACGTGTCAACAACTCAGGAATCTATAATCAAGATTCTGTAATT AATAAAACATTATATGCTTCACAAACCGTTGGTGGCGGAGAATATCA GGCACTATATGGTGGAGAGTTCCTTGATGAAATCAAGAAATTGTACC CTTCTCTATTCGAAAAAAATCAAATTTCAACCGGCGTACCAATGGATG CTAGTGAAAAGATAAAAGAATGGTCCGCTAAGTACTTTAACGGTACT AACATTCAAGGTCGTGGTGCTTACTATGTCCTTAAGGACTGGGCTACA AATGAGTACTTCAAGGTAAGCACTTCAAGCAACAGCAGTGTATTTTTG CCAAAGCAGTTGACGAATGAAGAATCAAACACTGGATTTATTTCAAC TGATGGTGGGATGACATATTATTCTACAAGTGGATACCAGGCAAAAG ATACATTCATCCAAGATGACAAATCTAATTGGTATTACTTTGACAAGA ATGGTTATATGACATATGGTTTCCAGACAGTCAATGATAATAATTATT ACTTCTTGCCTAATGGTATTGAATTACAAGATGCTATCTTAGAAGATA GTAAAGGAAATGTTTATTATTTCAATCAATATGGCAAACAAGCTGTTG ATGGATACTACATGTTGGCTAATAAAACTTGGCGTTACTTTGACAAAA ATGGTGTTATGGCTAATGCTGGCTTAACAACCGTGACTGTTGATGGGC AGGAGCATATCCAATACTTTGATAAGAACGGTATTCAGGTCAAAGGG ACTTCCGTGAAAGATGCAGACGGAAAGCTACGCTACTTTGACACTGA TTCTGGTGATATGGTGACGAACCGCTTTGGTGAAAACACAGATGGTA CATGGTCATACTTTGGTGCTGACGGTATCGCTGTAACTGGTGCACAGA CAATTAGTGGGCAAAAATTGTTCTTTGATGCCGACGGACAACAGATT AAAGGTAAGGAAGCGACTGATAAAAAAGGCAAAGTGCATTATTATG ATGCTAATTCTGGTGAAATGATCACTAATCGTTTTGAAAAGTTATCAG ATGGATCATGGGCGTACTTTAATAAAAAAGGTAACATCGTAACCGGC GCACAAGTCATTAATGGTCAACATTTGTTCTTTGAAAGCAACGGTAAC CAAGTTAAGGGTCGTGAATACACGGCTACTGATGGGAAGATGCGCTA CTATGATGCAGATTCTGGTGATATGGTGACGAATCGCTTTGAACGAAT ATCAGACGGATCATGGGCATATTTTGGTGCTAATGGTGTTGCTGTAAC TGGGGAACAAAATATAAATGGACAACAACTGTATTTTGATGCCAATG GTCATCAAGTTAAGGGAGCCGCAGTAACACAAGCTGACGGTAGCCAA AAATATTATGACGCAAATTCTGGAGAGATGATTAAAAGCTAA DsrIprotein MRNRNATSVFRKKMYKSGKMLVIAGSVSIIGVTSFIQQAQADVSQNN 5 (L. GVVVATAVDQSNLDATTSDKSITTDDKAATTAATSTDDKATTTVATSTD mesenteroides) DKDTTTAATSTDDKATTTVATSTDDKATTTAATSTDDKAATTAATSTDD KAATTAATSTDDKAATTADTSTDDKAATTAATSTDDKATTTAATSTDD KTATTVGTSDNNNSATASDKDVSSSAQKSQTIDNNSKTADTTAALEASS KNLKTIDGKTYYYDDDDQVKKNFATVIDGKVLYFDKETGALADTNDYQ FLEGLTSENNTYTEHNASVGTSSDSYTNVDGYLTADSWYRPKDILVNGQ NWESSKDDDLRPLLMTWWPDKATQVNYLNAMKYLDATETETVYTSDD SQDALNKAAQNIQVKIEEKISQEGQTQWLKDDISKFVDSQSNWNIASESK GTDHLQGGALLYVNSDKTPDANSDYRLLNRTPTNQTGTPLYTTDPTQGG YDFLLANDVDNSNPVVQAEQLNWMYYLLNFGSITNNDADANFDSIRVD AVDNVDADLLQIAADYFKAAYGVDKSDAISNQHVSILEDWSDNDAEYV KDNGDNQLSMDNKLRLSLKYSLTMPAVDQYGNKRSGLEPFLTNSLVDR TNDSTDNTAQPNYSFVRAHDSEVQTVIAEIIKQRIDPDSDGLSPTMDQLTE AFKIYNADQLKTDKEFTQYNIPSTYATILTNKDTVPRVYYGDMYTDDGQ YMATKSLYYDAIDTLLKSRIKYVSGGQTMSMKYMQGDSSMAADSYRGI LTSVRYGNGAMTATDAGTNETRTQGIAVIESNNPDLKLSSTDQVVVDMG IAHKNQAYRPALLTTKDGIDTYVSDSDVSQSLIRYTNSNGQLIFNSSDIVG TANPQVSGYLAVWVPVGASDTQDARTESSTATTADGQTLHSNAALDSQ VIYESFSNFQSTPTTEAEYANVQIANNTDLYKSWGITNFEFPPQYRSSTDS SFLDSIIQNGYAFTDRYDLGFNTPTKYGTVDQLRTAIKALHATGIKAMAD WVPDQIYNLTGKEVVAVQRVNNSGIYNQDSVINKTLYASQTVGGGEYQ ALYGGEFLDEIKKLYPSLFEKNQISTGVPMDASEKIKEWSAKYFNGTNIQ GRGAYYVLKDWATNEYFKVSTSSNSSVFLPKQLTNEESNTGFISTDGGM TYYSTSGYQAKDTFIQDDKSNWYYFDKNGYMTYGFQTVNDNNYYFLPN GIELQDAILEDSKGNVYYFNQYGKQAVDGYYMLANKTWRYFDKNGVM ANAGLTTVTVDGQEHIQYFDKNGIQVKGTSVKDADGKLRYFDTDSGDM VTNRFGENTDGTWSYFGADGIAVTGAQTISGQKLFFDADGQQIKGKEAT DKKGKVHYYDANSGEMITNRFEKLSDGSWAYFNKKGNIVTGAQVINGQ HLFFESNGNQVKGREYTATDGKMRYYDADSGDMVTNRFERISDGSWAY FGANGVAVTGEQNINGQQLYFDANGHQVKGAAVTQADGSQKYYDANS GEMIKS DsrImature DVSQNNGVVVATAVDQSNLDATTSDKSITTDDKAATTAATSTDDKATT 6 protein(L. TVATSTDDKDTTTAATSTDDKATTTVATSTDDKATTTAATSTDDKAATT mesenteroides) AATSTDDKAATTAATSTDDKAATTADTSTDDKAATTAATSTDDKATTT AATSTDDKTATTVGTSDNNNSATASDKDVSSSAQKSQTIDNNSKTADTT AALEASSKNLKTIDGKTYYYDDDDQVKKNFATVIDGKVLYFDKETGAL ADTNDYQFLEGLTSENNTYTEHNASVGTSSDSYTNVDGYLTADSWYRP KDILVNGQNWESSKDDDLRPLLMTWWPDKATQVNYLNAMKYLDATET ETVYTSDDSQDALNKAAQNIQVKIEEKISQEGQTQWLKDDISKFVDSQSN WNIASESKGTDHLQGGALLYVNSDKTPDANSDYRLLNRTPTNQTGTPLY TTDPTQGGYDFLLANDVDNSNPVVQAEQLNWMYYLLNFGSITNNDADA NFDSIRVDAVDNVDADLLQIAADYFKAAYGVDKSDAISNQHVSILEDWS DNDAEYVKDNGDNQLSMDNKLRLSLKYSLTMPAVDQYGNKRSGLEPFL TNSLVDRTNDSTDNTAQPNYSFVRAHDSEVQTVIAEIIKQRIDPDSDGLSP TMDQLTEAFKIYNADQLKTDKEFTQYNIPSTYATILTNKDTVPRVYYGD MYTDDGQYMATKSLYYDAIDTLLKSRIKYVSGGQTMSMKYMQGDSSM AADSYRGILTSVRYGNGAMTATDAGTNETRTQGIAVIESNNPDLKLSSTD QVVVDMGIAHKNQAYRPALLTTKDGIDTYVSDSDVSQSLIRYTNSNGQL IFNSSDIVGTANPQVSGYLAVWVPVGASDTQDARTESSTATTADGQTLH SNAALDSQVIYESFSNFQSTPTTEAEYANVQIANNTDLYKSWGITNFEFPP QYRSSTDSSFLDSIIQNGYAFTDRYDLGFNTPTKYGTVDQLRTAIKALHA TGIKAMADWVPDQIYNLTGKEVVAVQRVNNSGIYNQDSVINKTLYASQ TVGGGEYQALYGGEFLDEIKKLYPSLFEKNQISTGVPMDASEKIKEWSAK YFNGTNIQGRGAYYVLKDWATNEYFKVSTSSNSSVFLPKQLTNEESNTG FISTDGGMTYYSTSGYQAKDTFIQDDKSNWYYFDKNGYMTYGFQTVND NNYYFLPNGIELQDAILEDSKGNVYYFNQYGKQAVDGYYMLANKTWR YFDKNGVMANAGLTTVTVDGQEHIQYFDKNGIQVKGTSVKDADGKLR YFDTDSGDMVTNRFGENTDGTWSYFGADGIAVTGAQTISGQKLFFDAD GQQIKGKEATDKKGKVHYYDANSGEMITNRFEKLSDGSWAYFNKKGNI VTGAQVINGQHLFFESNGNQVKGREYTATDGKMRYYDADSGDMVTNR FERISDGSWAYFGANGVAVTGEQNINGQQLYFDANGHQVKGAAVTQAD GSQKYYDANSGEMIKS dsrScoding ATGCCATTTACAGAAAAAGTAATGCGGAAAAAGCTTTATAAAGTTGG 7 sequence(L. GAAAAGTTGGGTAGTTGGTGGGGTTTGTGCTTTTGCATTAACCGCCTC mesenteroides) ATTTGCTTTAGCAACACCAAGTGTTTTGGGAGACAGTAGTGTACCTGA TGTGAGTGCGAATAACGTTCAATCTGCTTCAGATAATACAACGGATA CGCAGCAGAACACTACGGTTACCGAAGAAAATGATAAAGTACAGTCT GCAGCTACTAATGATAATGTAACAACAGCTGCAAGCGACACAACGCA ATCTGCTGATAATAATGTGACAGAAAAACAGTCAGATGATCATGCAC TTGATAATGAAAAAGTCGATAACAAACAAGATGAAGTCGCTCAAACC AATGTTACTAGCAAAGATGAGGAATCAGCAGTTGCTTCAACTGACAC TGATCCTGCTGAAACGACAACTGACGAAACACAACAAGTTAGCGGCA AGTACGTTGAAAAAGACGGTAGTTGGTATTATTATTTTGATGATGGCA AAAATGCTAAAGGTTTATCAACGATAGACAACAATATTCAATATTTTG ACGAGAGTGGTAAACAAGTCAAAGGACAGTATGTCACAATTGATAAT CAAACATATTATTTTGATAAGGACTCAGGTGATGAGTTAACTGGTCTG CAAAGCATTGATGGGAACATAGTTGCTTTTAACGATGAAGGGCAACA AATTTTTAATCAATATTACCAATCTGAAAATGGTACAACATACTATTT TGATGATAAAGGACATGCTGCTACCGGTATTAAGAATATCGAAGGCA AAAATTATTATTTTGATAATCTTGGGCAACTAAAAAAAGGCTTCTCTG GTGTGATTGATGGTCAAATAATGACATTTGATCAGGAAACAGGGCAA GAAGTTTCTAACACAACTTCTGAAATAAAAGAAGGTTTGACGACACA AAACACGGATTATAGCGAACATAATGCAGCCCACGGTACGGATGCTG AGGACTTTGAAAATATTGACGGCTATTTAACAGCTAGTTCATGGTATC GTCCAACAGATATTTTACGTAACGGAACAGACTGGGAACCTTCTACA GATACAGATTTCAGACCAATATTGTCAGTGTGGTGGCCAGATAAGAA CACCCAGGTCAACTATTTAAATTACATGGCTGATTTAGGGTTTATCAG TAATGCGGACAGTTTTGAAACTGGGGATAGCCAAAGCTTATTAAATG AAGCAAGTAACTATGTTCAAAAATCAATTGAAATGAAAATTAGTGCG CAACAAAGTACAGAGTGGTTAAAGGATGCAATGGCGGCCTTCATTGT CACGCAACCACAGTGGAATGAAACTAGTGAAGATATGAGCAATGACC ATTTACAAAATGGCGCATTAACTTATGTCAACAGTCCACTGACACCTG ATGCTAATTCAAACTTTAGACTACTTAATCGGACACCAACAAACCAG ACTGGTGAACAAGCGTATAATTTAGATAATTCAAAAGGTGGTTTTGA ATTGTTGTTAGCCAATGACGTTGATAATTCAAACCCTGTAGTACAAGC AGAACAATTGAATTGGTTATATTATTTAATGAATTTTGGTACGATTAC GGCCAACGACGCGGATGCTAATTTTGATGGTATTCGTGTAGATGCAGT CGACAATGTGGATGCTGATTTGTTACAAATTGCTGCCGATTATTTCAA ACTAGCTTACGGTGTTGATCAAAATGATGCTACTGCTAATCAGCATCT TTCAATTTTGGAAGATTGGAGTCACAATGATCCTTTGTATGTAACAGA TCAAGGAAGCAATCAATTAACCATGGATGATTATGTGCACACACAAT TAATCTGGTCTCTAACAAAATCATCTGACATACGAGGTACAATGCAG CGCTTCGTGGATTATTATATGGTTGATCGATCTAATGATAGTACAGAA AACGAAGCCATTCCTAATTACAGCTTTGTACGTGCACACGACAGCGA AGTGCAAACGGTTATTGCCCAAATTGTTTCCGATTTGTATCCTGATGT TGAAAATAGTTTAGCACCAACAACAGAACAATTGGCAGCTGCTTTCA AAGTATACAATGAAGATGAAAAATTAGCAGACAAAAAGTACACACA ATATAATATGGCTAGTGCTTATGCGATGTTGCTAACCAATAAGGATAC TGTTCCTCGTGTCTATTATGGCGATTTATATACAGATGATGGTCAATA TATGGCAACAAAGTCACCATACTATGATGCGATTAACACTTTGCTGAA GGCTAGAGTCCAATATGTTGCTGGTGGCCAATCGATGTCCGTTGATAG TAATGACGTGTTAACAAGTGTTCGCTATGGTAAAGATGCCATGACGG CTTCTGACACTGGAACATCTGAGACGCGTACTGAAGGTGTTGGGGTC ATCGTCAGCAACAACGCGGAACTACAATTAGAGGATGGGCATACAGT CACATTGCACATGGGGGCAGCTCATAAGAACCAAGCTTATCGTGCTTT GTTATCAACAACTGCAGATGGATTAGCTTATTATGATACTGATGAAAA TGCACCTGTGGCGTACACAGATGCTAACGGCGATTTGATTTTTACGAA TGAATCAATTTATGGTGTACAAAATCCACAAGTTTCTGGTTACTTGGC AGTTTGGGTTCCGGTAGGTGCGCAACAAGATCAAGATGCACGAACGG CCTCTGATACAACAACAAACACGAGTGATAAAGTGTTCCATTCAAAC GCTGCTCTTGATTCTCAAGTCATCTACGAAGGTTTCTCAAACTTCCAA GCATTTGCTACAGACAGCAGTGAATATACAAACGTAGTCATCGCTCA GAATGCGGACCAATTTAAGCAATGGGGTGTGACAAGCTTCCAATTGG CACCACAATATCGTTCAAGTACAGATACAAGTTTCTTGGATTCAATTA TTCAAAACGGGTATGCATTCACGGATCGTTATGACTTAGGTTATGGCA CACCGACAAAATATGGAACTGCTGATCAGTTGCGCGATGCTATTAAA GCCTTACATGCTAGCGGTATTCAAGCCATTGCCGATTGGGTGCCGGAC CAAATTTATAATTTGCCAGAGCAAGAATTAGCTACTGTCACAAGAAC AAATTCATTTGGAGATGACGATACAGATTCTGATATTGACAATGCCTT ATATGTTGTACAAAGTCGTGGGGGTGGTCAATATCAAGAGATGTATG GTGGTGCCTTCTTAGAAGAGTTACAGGCACTCTATCCATCCCTATTTA AAGTGAATCAAATCTCAACTGGCGTTCCAATTGATGGCAGTGTAAAG ATTACTGAGTGGGCGGCTAAGTACTTCAATGGCTCTAACATCCAAGGT AAAGGTGCTGGATACGTATTGAAAGATATGGGTTCTAATAAGTATTTT AAGGTCGTTTCGAACACTGAAGATGGTGACTACTTACCAAAACAGTT AACTAATGATCTGTCAGAAACTGGCTTTACACACGATGATAAAGGAA TCATCTATTATACATTAAGTGGTTATCGTGCCCAAAATGCATTTATTC AAGATGATGATAATAACTATTACTATTTTGATAAAACAGGTCATTTAG TAACAGGTTTGCAAAAGATTAATAACCATACCTACTTCTTCTTACCTA ATGGTATCGAACTGGTCAAGAGCTTCTTACAAAACGAAGATGGTACA ATTGTTTATTTCGATAAGAAAGGTCATCAAGTTTTTGACCAATATATA ACTGATCAAAATGGAAATGCGTATTACTTTGATGATGCTGGTGTAATG CTTAAATCAGGGCTTGCAACGATTGATGGACATCAACAGTATTTTGAT CAAAATGGTGTGCAGGTTAAGGATAAGTTTGTGATTGGCACTGATGG TTATAAGTATTACTTTGAACCAGGTAGTGGTAACTTAGCTATCCTACG TTATGTGCAAAACAGTAAGAATCAATGGTTCTATTTTGATGGTAATGG CCATGCTGTCACTGGTTTCCAAACAATTAATGGTAAGAAACAATATTT CTATAATGATGGTCATCAAAGTAAAGGTGAATTCATTGATGCAGACG GTGATACTTTCTATACGAGTGCCACTGATGGTCGCCTAGTAACTGGTG TTCAGAAGATTAATGGTATTACCTATGCTTTTGATAACACAGGAAATT TGATCACAAATCAGTATTATCAATTAGCAGATGGTAAATATATGTTGT TAGATGATAGTGGTCGTGCGAAAACAGGGTTTGTATTGCAAGATGGT GTACTAAGATACTTCGATCAAAACGGTGAGCAAGTGAAAGATGCTAT CATTGTGGATCCAGATACTAACTTGAGTTATTATTTCAATGCAACACA AGGTGTCGCTGTAAAAAATGATTATTTCGAGTATCAAGGTAATTGGTA TTTAACAGATGCTAATTATCAACTTATCAAAGGTTTTAAAGCAGTTGA CGACAGCTTACAACATTTTGATGAAGTCACTGGTGTACAAACAAAAG ATAGTGCTTTAATAAGTGCTCAGGGTAAGGTTTACCAATTTGATAATA ATGGAAATGCTGTGTCAGCATAA DsrSprotein MPFTEKVMRKKLYKVGKSWVVGGVCAFALTASFALATPSVLGDSSV 8 (L. PDVSANNVQSASDNTTDTQQNTTVTEENDKVQSAATNDNVTTAASDTT mesenteroides) QSADNNVTEKQSDDHALDNEKVDNKQDEVAQTNVTSKDEESAVASTDT DPAETTTDETQQVSGKYVEKDGSWYYYFDDGKNAKGLSTIDNNIQYFD ESGKQVKGQYVTIDNQTYYFDKDSGDELTGLQSIDGNIVAFNDEGQQIFN QYYQSENGTTYYFDDKGHAATGIKNIEGKNYYFDNLGQLKKGFSGVIDG QIMTFDQETGQEVSNTTSEIKEGLTTQNTDYSEHNAAHGTDAEDFENIDG YLTASSWYRPTDILRNGTDWEPSTDTDFRPILSVWWPDKNTQVNYLNY MADLGFISNADSFETGDSQSLLNEASNYVQKSIEMKISAQQSTEWLKDA MAAFIVTQPQWNETSEDMSNDHLQNGALTYVNSPLTPDANSNFRLLNRT PTNQTGEQAYNLDNSKGGFELLLANDVDNSNPVVQAEQLNWLYYLMN FGTITANDADANFDGIRVDAVDNVDADLLQIAADYFKLAYGVDQNDAT ANQHLSILEDWSHNDPLYVTDQGSNQLTMDDYVHTQLIWSLTKSSDIRG TMQRFVDYYMVDRSNDSTENEAIPNYSFVRAHDSEVQTVIAQIVSDLYP DVENSLAPTTEQLAAAFKVYNEDEKLADKKYTQYNMASAYAMLLTNK DTVPRVYYGDLYTDDGQYMATKSPYYDAINTLLKARVQYVAGGQSMS VDSNDVLTSVRYGKDAMTASDTGTSETRTEGVGVIVSNNAELQLEDGHT VTLHMGAAHKNQAYRALLSTTADGLAYYDTDENAPVAYTDANGDLIFT NESIYGVQNPQVSGYLAVWVPVGAQQDQDARTASDTTTNTSDKVFHSN AALDSQVIYEGFSNFQAFATDSSEYTNVVIAQNADQFKQWGVTSFQLAP QYRSSTDTSFLDSIIQNGYAFTDRYDLGYGTPTKYGTADQLRDAIKALHA SGIQAIADWVPDQIYNLPEQELATVTRTNSFGDDDTDSDIDNALYVVQSR GGGQYQEMYGGAFLEELQALYPSLFKVNQISTGVPIDGSVKITEWAAKY FNGSNIQGKGAGYVLKDMGSNKYFKVVSNTEDGDYLPKQLTNDLSETG FTHDDKGIIYYTLSGYRAQNAFIQDDDNNYYYFDKTGHLVTGLQKINNH TYFFLPNGIELVKSFLQNEDGTIVYFDKKGHQVFDQYITDQNGNAYYFD DAGVMLKSGLATIDGHQQYFDQNGVQVKDKFVIGTDGYKYYFEPGSGN LAILRYVQNSKNQWFYFDGNGHAVTGFQTINGKKQYFYNDGHQSKGEFI DADGDTFYTSATDGRLVTGVQKINGITYAFDNTGNLITNQYYQLADGKY MLLDDSGRAKTGFVLQDGVLRYFDQNGEQVKDAIIVDPDTNLSYYFNAT QGVAVKNDYFEYQGNWYLTDANYQLIKGFKAVDDSLQHFDEVTGVQT KDSALISAQGKVYQFDNNGNAVSA DsrSmature DSSVPDVSANNVQSASDNTTDTQQNTTVTEENDKVQSAATNDNVTTAA 9 protein(L. SDTTQSADNNVTEKQSDDHALDNEKVDNKQDEVAQTNVTSKDEESAVA mesenteroides) STDTDPAETTTDETQQVSGKYVEKDGSWYYYFDDGKNAKGLSTIDNNIQ YFDESGKQVKGQYVTIDNQTYYFDKDSGDELTGLQSIDGNIVAFNDEGQ QIFNQYYQSENGTTYYFDDKGHAATGIKNIEGKNYYFDNLGQLKKGFSG VIDGQIMTFDQETGQEVSNTTSEIKEGLTTQNTDYSEHNAAHGTDAEDFE NIDGYLTASSWYRPTDILRNGTDWEPSTDTDFRPILSVWWPDKNTQVNY LNYMADLGFISNADSFETGDSQSLLNEASNYVQKSIEMKISAQQSTEWLK DAMAAFIVTQPQWNETSEDMSNDHLQNGALTYVNSPLTPDANSNFRLL NRTPTNQTGEQAYNLDNSKGGFELLLANDVDNSNPVVQAEQLNWLYYL MNFGTITANDADANFDGIRVDAVDNVDADLLQIAADYFKLAYGVDQND ATANQHLSILEDWSHNDPLYVTDQGSNQLTMDDYVHTQLIWSLTKSSDI RGTMQRFVDYYMVDRSNDSTENEAIPNYSFVRAHDSEVQTVIAQIVSDL YPDVENSLAPTTEQLAAAFKVYNEDEKLADKKYTQYNMASAYAMLLT NKDTVPRVYYGDLYTDDGQYMATKSPYYDAINTLLKARVQYVAGGQS MSVDSNDVLTSVRYGKDAMTASDTGTSETRTEGVGVIVSNNAELQLED GHTVTLHMGAAHKNQAYRALLSTTADGLAYYDTDENAPVAYTDANGD LIFTNESIYGVQNPQVSGYLAVWVPVGAQQDQDARTASDTTTNTSDKVF HSNAALDSQVIYEGFSNFQAFATDSSEYTNVVIAQNADQFKQWGVTSFQ LAPQYRSSTDTSFLDSIIQNGYAFTDRYDLGYGTPTKYGTADQLRDAIKA LHASGIQAIADWVPDQIYNLPEQELATVTRTNSFGDDDTDSDIDNALYVV QSRGGGQYQEMYGGAFLEELQALYPSLFKVNQISTGVPIDGSVKITEWA AKYFNGSNIQGKGAGYVLKDMGSNKYFKVVSNTEDGDYLPKQLTNDLS ETGFTHDDKGIIYYTLSGYRAQNAFIQDDDNNYYYFDKTGHLVTGLQKI NNHTYFFLPNGIELVKSFLQNEDGTIVYFDKKGHQVFDQYITDQNGNAY YFDDAGVMLKSGLATIDGHQQYFDQNGVQVKDKFVIGTDGYKYYFEPG SGNLAILRYVQNSKNQWFYFDGNGHAVTGFQTINGKKQYFYNDGHQSK GEFIDADGDTFYTSATDGRLVTGVQKINGITYAFDNTGNLITNQYYQLAD GKYMLLDDSGRAKTGFVLQDGVLRYFDQNGEQVKDAIIVDPDTNLSYY FNATQGVAVKNDYFEYQGNWYLTDANYQLIKGFKAVDDSLQHFDEVT GVQTKDSALISAQGKVYQFDNNGNAVSA gtfIcoding ATGGAGAAGAATGTACGTTTTAAGATGCATAAGGTGAAAAAGAGATG 10 sequence(S. GGTAACCCTCTCTGTCGCATCTGCCACCATGTTGGCATCAGCCCTTGG sobrinus) TGCTTCAGTAGCTAGTGCGGATACAGACACTGCTAGTGATGATAGCA ACCAAACCGTGGTAACTGGTGACCAGACTACTAACAATCAAGCCACT GACCAGACTTCTATTGCAGCAACAGCTACATCAGAACAGTCTGCTTCA ACTGATGCAGCAACAGATCAAGCATCAGCAGCAGAGCAAACTCAAG GAACAACAGCTAGCACAGACACGGCAGCTCAAACAACCACAAATGCT AATGAAGCTAAGTGGGTTCCGACTGAAAATGAGAACCAAGGTTTTAC AGATGAGATGTTAGCAGAAGCCAAGAATGTGGCTACTGCTGAATCTG ATTCAATTCCATCAGACTTGGCTAAAATGTCAAATGTTAAGCAGGTTG ACGGTAAATATTATTACTACGACCAAGATGGCAATGTTAAGAAAAAC TTTGCTGTCAGCGTTGGTGATAAGATTTATTACTTTGATGAAACTGGC GCTTACAAGGACACTAGCAAGGTTGATGCCGACAAGTCCAGTTCAGC TGTAAGTCAAAATGCAACAATATTTGCAGCTAATAACCGTGCCTACA GCACCTCAGCTAAAAATTTTGAAGCCGTTGATAACTACCTGACAGCTG ACTCTTGGTATCGTCCAAAATCAATCCTGAAAGACGGAAAAACTTGG ACAGAATCTGGCAAAGATGACTTCCGCCCGCTTCTCATGGCTTGGTGG CCTGATACCGAAACCAAACGTAACTACGTTAATTACATGAACAAGGT TGTTGGTATTGATAAGACCTATACCGCTGAAACCAGCCAAGCTGATTT AACGGCAGCAGCAGAATTGGTTCAAGCTCGTATTGAACAAAAAATTA CAAGTGAAAATAACACTAAGTGGCTCCGTGAGGCGATTTCTGCCTTTG TGAAAACTCAGCCGCAATGGAATGGTGAAAGCGAAAAGCCTTACGAT GATCACTTGCAAAATGGTGCTCTTCTCTTTGACAATCAAACTGATTTA ACACCAGATACGCAATCGAACTATCGTTTGCTCAATCGCACACCAACT AACCAAACTGGTTCCTTGGATTCTCGTTTCACCTATAACCCAAATGAC CCACTGGGCGGCTATGATTTCCTTTTAGCCAACGATGTTGATAATTCC AATCCAGTCGTGCAAGCGGAACAACTCAACTGGCTGCACTACCTGCT CAACTTTGGCTCTATCTATGCCAATGATGCAGATGCCAATTTTGACTC AATCCGTGTAGATGCGGTTGATAATGTTGATGCTGACCTTCTGCAAAT CTCTAGTGATTACCTTAAGGCAGCTTACGGTATCGATAAAAACAACA AAAATGCTAATAACCACGTTTCTATCGTAGAAGCATGGAGCGACAAC GATACCCCTTATCTCCATGATGATGGCGACAACCTCATGAACATGGAC AACAAGTTCCGTTTGTCCATGCTTTGGTCTTTAGCTAAGCCAACCGAT GTTCGTTCTGGTTTGAATCCTTTGATCCACAACAGTCTGGTTGACCGT GAAGTGGATGACCGTGAAGTTGAAACCGTTCCAAGTTACAGCTTTGC TCGGGCTCATGATAGTGAAGTTCAGGATATCATTCGTGATATTATTAA GGCTGAGATTAATCCAAATTCATTTGGTTATTCATTCACCCAAGAAGA AATTGATCAAGCTTTCAAGATTTACAACGAAGATCTCAAGAAGACTG ATAAAAAATACACTCACTACAATGTGCCGCTTTCTTATACCTTGCTTC TGACTAACAAGGGTTCGATTCCTCGCGTCTATTATGGAGATATGTTCA CCGATGATGGTCAATACATGGCCAACAAGACTGTGAACTACGATGCT ATCGAATCTCTGCTGAAAGCCCGTATGAAGTACGTTGCTGGTGGTCAG GCTATGCAGAATTACCAAATCGGTAATGGCGAAATCTTGACTTCTGTC CGTTATGGTAAGGGTGCCCTTAAACAAAGCGATAAGGGTGATGCGAC AACTCGTACGTCAGGTGTCGGCGTTGTTATGGGAAACCAACCCAACTT TAGCTTGGATGGAAAGGTTGTAGCCCTCAACATGGGTGCTGCCCACG CTAACCAAGAATACCGTGCTCTTATGGTATCAACTAAAGACGGTGTTG CAACCTATGCTACAGATGCTGATGCTAGCAAGGCTGGTCTGGTTAAG CGCACAGATGAAAATGGTTACCTCTACTTCTTGAACGACGATCTCAAG GGGGTTGCTAACCCTCAGGTTTCTGGTTTCCTTCAAGTCTGGGTACCA GTGGGAGCAGCAGATGACCAAGATATTCGTGTAGCAGCTAGCGATAC AGCAAGTACCGATGGAAAATCACTCCATCAAGATGCTGCCATGGACT CTCGCGTCATGTTTGAAGGTTTCTCTAACTTCCAATCTTTTGCGACAA AAGAAGAAGAGTATACCAATGTTGTCATTGCTAACAATGTTGATAAA TTTGTTTCATGGGGAATCACTGACTTTGAAATGGCTCCTCAGTATGTC TCATCTACTGACGGTCAGTTCCTTGATTCTGTCATTCAAAATGGTTAT GCCTTTACCGACCGTTATGACTTGGGTATGTCTAAAGCAAACAAGTAT GGTACAGCCGACCAATTGGTTAAGGCTATCAAGGCTCTCCATGCTAA GGGCCTGAAGGTTATGGCAGACTGGGTTCCAGACCAAATGTACACCT TCCCTAAACAAGAAGTGGTCACTGTTACTCGGACAGATAAGTTTGGC AAACCAATCGCAGGAAGCCAAATTAATCACAGTCTCTACGTAACAGA TACAAAGAGCTCTGGTGATGACTATCAAGCTAAATACGGCGGTGCCT TCCTTGACGAATTAAAGGAAAAATATCCAGAACTCTTCACCAAGAAG CAAATGTCTACTGGTCAGGCGATTGATCCATCTGTTAAGATTAAACAA TGGTCTGCTAAGTACTTTAATGGAAGTAATATTCTTGGCCGGGGTGCC GATTATGTCCTCAGCGACCAAGTCAGCAACAAGTACTTCAACGTTGCC AGCGATACACTCTTCTTACCAAGCAGCTTACTCGGCAAGGTCGTAGA GTCTGGTATTCGTTATGATGGTAAGGGTTATATTTATAACTCAAGTGC AACTGGTGACCAAGTCAAAGCAAGCTTCATTACCGAAGCAGGCAATC TATACTACTTCGGTAAAGACGGTTATATGGTGACTGGCGCTCAAACCA TTAATGGTGCTAACTATTTCTTCCTTGAAAATGGTACGGCTCTTCGCA ACACTATTTATACAGATGCTCAAGGCAATAGCCATTACTACGCAAAT GACGGTAAACGCTATGAAAATGGTTACCAACAATTTGGTAATGACTG GCGTTACTTCAAGGACGGTAACATGGCTGTTGGCTTGACAACTGTTGA TGGCAATGTTCAATACTTTGATAAAGATGGTGTTCAAGCTAAGGATA AGATTATTGTCACCCGTGATGGTAAGGTTCGTTACTTTGACCAACATA ATGGAAATGCTGTAACCAATACCTTCATCGCTGACAAGACTGGTCACT GGTACTATCTAGGTAAAGATGGTGTCGCTGTTACCGGTGCTCAAACCG TTGGGAAACAAAAACTTTACTTTGAAGCAAACGGTCAACAAGTTAAG GGTGACTTCGTAACTTCTGACGAAGGTAAACTTTACTTCTACGATGTC GATTCAGGTGACATGTGGACTGATACCTTCATTGAAGATAAGGCAGG CAATTGGTTCTACCTTGGTAAAGATGGTGCAGCTGTGACTGGTGCTCA AACTATTCGTGGCCAAAAACTTTACTTCAAGGCTAACGGCCAACAAG TCAAGGGAGATATCGTCAAGGGTACTGATGGTAAGATCCGTTACTAC GACGCTAAATCTGGTGAACAAGTCTTCAACAAGACTGTTAAGGCCGC TGATGGCAAGACCTATGTTATCGGAAATGATGGTGTTGCAGTTGATCC AAGCGTTGTCAAAGGACAAACCTTCAAGGATGCTTCAGGTGCTCTTC GTTTCTATAACCTCAAAGGACAACTGGTAACAGGCAGCGGTTGGTAT GAAACTGCAAATCACGATTGGGTTTATATCCAATCTGGTAAAGCCTTG ACTGGGGAACAGACCATCAATGGTCAACATCTTTACTTCAAGAAAGA TGGACATCAAGTCAAAGGACAACTGGTAACAGGAACTGATGGTAAGG TTCGCTATTATGATGCAAATTCAGGCGACCAAGCCTTCAACAAGTCTG TAACAGTTAACGGTAAGACTTACTACTTCGGTAATGATGGCACTGCTC AAACAGCGGGAAACCCTAAGGGACAAACCTTCAAAGATGGTTCAGAT ATCCGCTTTTACAGCATGGAAGGCCAATTAGTGACTGGCAGTGGTTG GTACTCAAACGCACAAGGTCAGTGGCTTTATGTCAAAAATGGTAAAG TCTTGACAGGCCTGCAAACAGTTGGTAGCCAACGTGTTTACTTTGACG AAAATGGTATTCAAGCTAAAGGTAAAGCAGTAAGGACTTCCGACGGT AAGATACGCTACTTCGATGAAAATTCAGGTAGCATGATTACCAACCA ATGGAAAGAGGTTAACGGTCGATATTATTACTTCGGTAATGATGGCG CAGCTATCTACCGTGGCTGGAACTAA Gtflprotein MEKNVRFKMHKVKKRWVTLSVASATMLASALGASVASADTDTASD 11 (S.sobrinus) DSNQTVVTGDQTTNNQATDQTSIAATATSEQSASTDAATDQASAAEQTQ GTTASTDTAAQTTTNANEAKWVPTENENQGFTDEMLAEAKNVATAESD SIPSDLAKMSNVKQVDGKYYYYDQDGNVKKNFAVSVGDKIYYFDETGA YKDTSKVDADKSSSAVSQNATIFAANNRAYSTSAKNFEAVDNYLTADS WYRPKSILKDGKTWTESGKDDFRPLLMAWWPDTETKRNYVNYMNKVV GIDKTYTAETSQADLTAAAELVQARIEQKITSENNTKWLREAISAFVKTQ PQWNGESEKPYDDHLQNGALLFDNQTDLTPDTQSNYRLLNRTPTNQTGS LDSRFTYNPNDPLGGYDFLLANDVDNSNPVVQAEQLNWLHYLLNFGSIY ANDADANFDSIRVDAVDNVDADLLQISSDYLKAAYGIDKNNKNANNHV SIVEAWSDNDTPYLHDDGDNLMNMDNKFRLSMLWSLAKPTDVRSGLNPLI HNSLVDREVDDREVETVPSYSFARAHDSEVQDIIRDIIKAEINPNSFGYS FTQEEIDQAFKIYNEDLKKTDKKYTHYNVPLSYTLLLTNKGSIPRVYYGD MFTDDGQYMANKTVNYDAIESLLKARMKYVAGGQAMQNYQIGNGEIL TSVRYGKGALKQSDKGDATTRTSGVGVVMGNQPNFSLDGKVVALNMG AAHANQEYRALMVSTKDGVATYATDADASKAGLVKRTDENGYLYFLN DDLKGVANPQVSGFLQVWVPVGAADDQDIRVAASDTASTDGKSLHQD AAMDSRVMFEGFSNFQSFATKEEEYTNVVIANNVDKFVSWGITDFEMAP QYVSSTDGQFLDSVIQNGYAFTDRYDLGMSKANKYGTADQLVKAIKAL HAKGLKVMADWVPDQMYTFPKQEVVTVTRTDKFGKPIAGSQINHSLYV TDTKSSGDDYQAKYGGAFLDELKEKYPELFTKKQMSTGQAIDPSVKIKQ WSAKYFNGSNILGRGADYVLSDQVSNKYFNVASDTLFLPSSLLGKVVES GIRYDGKGYIYNSSATGDQVKASFITEAGNLYYFGKDGYMVTGAQTING ANYFFLENGTALRNTIYTDAQGNSHYYANDGKRYENGYQQFGNDWRY FKDGNMAVGLTTVDGNVQYFDKDGVQAKDKIIVTRDGKVRYFDQHNG NAVTNTFIADKTGHWYYLGKDGVAVTGAQTVGKQKLYFEANGQQVKG DFVTSDEGKLYFYDVDSGDMWTDTFIEDKAGNWFYLGKDGAAVTGAQ TIRGQKLYFKANGQQVKGDIVKGTDGKIRYYDAKSGEQVFNKTVKAAD GKTYVIGNDGVAVDPSVVKGQTFKDASGALRFYNLKGQLVTGSGWYET ANHDWVYIQSGKALTGEQTINGQHLYFKKDGHQVKGQLVTGTDGKVR YYDANSGDQAFNKSVTVNGKTYYFGNDGTAQTAGNPKGQTFKDGSDIR FYSMEGQLVTGSGWYSNAQGQWLYVKNGKVLTGLQTVGSQRVYFDEN GIQAKGKAVRTSDGKIRYFDENSGSMITNQWKEVNGRYYYFGNDGAAI YRGWN GtfImature DTDTASDDSNQTVVTGDQTTNNQATDQTSIAATATSEQSASTDAATDQA 12 protein(S. SAAEQTQGTTASTDTAAQTTTNANEAKWVPTENENQGFTDEMLAEAKN sobrinus) VATAESDSIPSDLAKMSNVKQVDGKYYYYDQDGNVKKNFAVSVGDKIY YFDETGAYKDTSKVDADKSSSAVSQNATIFAANNRAYSTSAKNFEAVDN YLTADSWYRPKSILKDGKTWTESGKDDFRPLLMAWWPDTETKRNYVN YMNKVVGIDKTYTAETSQADLTAAAELVQARIEQKITSENNTKWLREAI SAFVKTQPQWNGESEKPYDDHLQNGALLFDNQTDLTPDTQSNYRLLNR TPTNQTGSLDSRFTYNPNDPLGGYDFLLANDVDNSNPVVQAEQLNWLH YLLNFGSIYANDADANFDSIRVDAVDNVDADLLQISSDYLKAAYGIDKN NKNANNHVSIVEAWSDNDTPYLHDDGDNLMNMDNKFRLSMLWSLAKP TDVRSGLNPLIHNSLVDREVDDREVETVPSYSFARAHDSEVQDIIRDIIKA EINPNSFGYSFTQEEIDQAFKIYNEDLKKTDKKYTHYNVPLSYTLLLTNK GSIPRVYYGDMFTDDGQYMANKTVNYDAIESLLKARMKYVAGGQAMQ NYQIGNGEILTSVRYGKGALKQSDKGDATTRTSGVGVVMGNQPNFSLD GKVVALNMGAAHANQEYRALMVSTKDGVATYATDADASKAGLVKRT DENGYLYFLNDDLKGVANPQVSGFLQVWVPVGAADDQDIRVAASDTAS TDGKSLHQDAAMDSRVMFEGFSNFQSFATKEEEYTNVVIANNVDKFVS WGITDFEMAPQYVSSTDGQFLDSVIQNGYAFTDRYDLGMSKANKYGTA DQLVKAIKALHAKGLKVMADWVPDQMYTFPKQEVVTVTRTDKFGKPIA GSQINHSLYVTDTKSSGDDYQAKYGGAFLDELKEKYPELFTKKQMSTGQ AIDPSVKIKQWSAKYFNGSNILGRGADYVLSDQVSNKYFNVASDTLFLPS SLLGKVVESGIRYDGKGYIYNSSATGDQVKASFITEAGNLYYFGKDGYM VTGAQTINGANYFFLENGTALRNTIYTDAQGNSHYYANDGKRYENGYQ QFGNDWRYFKDGNMAVGLTTVDGNVQYFDKDGVQAKDKIIVTRDGKV RYFDQHNGNAVTNTFIADKTGHWYYLGKDGVAVTGAQTVGKQKLYFE ANGQQVKGDFVTSDEGKLYFYDVDSGDMWTDTFIEDKAGNWFYLGKD GAAVTGAQTIRGQKLYFKANGQQVKGDIVKGTDGKIRYYDAKSGEQVF NKTVKAADGKTYVIGNDGVAVDPSVVKGQTFKDASGALRFYNLKGQL VTGSGWYETANHDWVYIQSGKALTGEQTINGQHLYFKKDGHQVKGQL VTGTDGKVRYYDANSGDQAFNKSVTVNGKTYYFGNDGTAQTAGNPKG QTFKDGSDIRFYSMEGQLVTGSGWYSNAQGQWLYVKNGKVLTGLQTV GSQRVYFDENGIQAKGKAVRTSDGKIRYFDENSGSMITNQWKEVNGRY YYFGNDGAAIYRGWN gtfGcoding ATGGGAGAGAAAGTCGTGGCGAGAAAGAAGCTTTATAAGGCGAAAA 13 sequence(L. AAAGTTGGGTGGTAGCTGGTTTGACTACTGCCTTTTTGATGGTGAATC pseudo- AAGCCAGTGTAAGCGCTGATCAAAATGTAAATGATACATCGGTCACA mesenteroides) ACAACAACGCAGGATGTCACAACAGATCAGGACACTGGTATTGACGC ATCTGTAACGACGACAGTTAGTCCAAATTTGGATGATACTCAAGTTGA TAACACCAATATTCAGACGTCAACAGATCAAAAAGATGATTCAAAAG GCACCACGCAAACAGTTGAAACGGACGTTACAACGAATAGTCAATCA ACAGACACAACAGCAGTGACAGCTCAAACGAATCAAACAGAAACAA TACAAAATAGTGATGCGACAACTGAAACAGGATTAGTGACAGTTAAT AATCAAGTCAGATACGTTAATCCTGATGGCACAGTTTTGAAAGGCGC ATACAAAACAATTAATGGTAATACCTATTATTTTGATGATAATAGTGG TGACGCACTGATAGGAATACATAAAATTGGAGAATCAATTAAGGGAT TTGGTCTTACTGGTGTCCAAGTCAAAGGAGATTACTTAACGGCAGTCA ACGGTGACAAATATTACTTTGATTCTGACGGTAATACGGTGTCTGGCG TGCAGCAAATTAATGGCAAGACCTATTATTTTGACAGCACTGGTAAAT TAATGAAGGGCTACACAGCAGTCTTGAATGGTGTCGTAACTTTCTTCA ATAGCACAACTGGTGAAGCAGATAATACTGATGCCTCAACCATTAAA ACTGGCGTTACAATCGACAACTCGGATTACACAGCTCATAATGCTGCC TATGATAATACAGCCGCCAGCTTTGATAATATCAATGGCTATCTGACG GCAGAAAGTTGGTACAGACCTAAAGAAATATTGGAAAATGGTGAGTC ATGGCGGCCATCTACTGCTGAGGATAAACGTCCCATTTTAATCACTTG GCAACCGGATATTGTGACCGAGGTCAATTATCTCAACATGATGTCTGC AAATGGTTTGCTCTCGATTAACGCACCATTTACAACTGCTAGTGACCT TGCCATTATGAATGATGCTGTCAGAGCTGTTCAAAAGAATATTGAAAT ACGGATTAGCCAAGAAAAATCAACTGATTGGTTAAAAGCGTTGATGA CTCAGTTTATTAATACACAACCGCAGTGGAATGAGGTGAGTGAATCA CCAAGCAATGATCACCTACAAGGCGGTGCATTAACGTATGTCAATAG TCCATTGACGCCAGATGCCAATTCTAATTTTCGTTTGCTTAATCGGAC CCCGACTAATCAATCTGGCACAACGCGTTATGATACTGACAAATCTG AAGGTGGTTTTGAATTATTATTAGCTAATGATGTTGATAATTCAAACC CAGTAGTTCAAGCTGAGCAACTTAACTGGTTGTACTATTTAATGAATT TTGGCTCAATTACAGCTAATGATCCAACGGCTAATTTTGATGGTATTA GAGTTGATGCTGTTGATAACGTAGACGCTGACTTGTTGCAAATTGCAT CGGATTACTTTAAATTAGCGTATGGCACTAGTTTATCTGATACAAATG CTAACCAACATTTATCAATTTTGGAAGATTGGTCTGCTAATGATGCGG AATACATGTCAAAAACGGGTAGTAATCAATTGACAATGGACACGTAT ACGCAGCAACAATTACTCTTTTCATTGACAAAACAAGTTGGTAATCGT GCTGACATGCGACGCTTCCTAGAATACTTTATGATTAATCGTGCCAAC GATTCAACCGAAAATATTGCGACACCAAATTACTCATTTGTTCGTGCA CATGACAGTGAAGTTCAAACGGTCATTGCTACGATAATTAAAGATTT ACATCCTGATGTTGTGAATAGTCTTGCGCCAACTCAAGCACAATTAGA AGAGGCATTTGCCGTGTATAACGCTGATATGAATCGGGTGGATAAAC AATATACCCAATACAATATGCCAAGTGCTTATGCCATGCTTTTGACCA ATAAAGATACGATTCCACGTGTATATTATGGTGATTTATACACAGATG ATGGTGAGTATATGGGTACGCAAACACCCTATTATGATGCTATCGTTA ATCTATTGCAGTCTCGCGTTAAATATGTTGCAGGTGGACAATCCATGG CGGTTGATCAACATGATATTTTAACAAGTGTGCGTTATGGCAAAAATT TGGCTGACGCTAATGCGACATCAGATGATTTAACCAGTATTAACTCAG GCATAGGTGTTATTGTTTCTAATAATCCCAATCTTTCGTTGGCGTCTGG TGAAACCGTCGTGCTCCATATGGGCATTGCACACGCTAATCAAGTTTA TCGTGAGATACTTGAGACAACCGACAACGGTATTGCAAATAATACCG ATATTTTTAAAACAACAGACAGTAATGGTGACTTGATTTTCACAGCTT CTGAAATTCATGGGTATAGTAATGTTCAAGTATCAGGCTTTTTATCAG TTTGGGCGCCTAAAGATGCTACGGATAATCAAGATGTACGTACTGCT GCTAGTGAATCGACTTCTAGTGATGGCAATACGCTTCATTCAAATGCT GCCTTAGATTCCAACATAATTTATGAAGGCTTTTCAAACTTTCAATCC ACACCTCAGTCAGAAAGTGAATTTGCAAAGGTCAAAATAGCTGCTAA TGTTAATCTGTTCAAATCTTGGGGTGTCACCAGTTTTCAAATGGCACC TCAATATCGCTCGAGCACCGATACAAGCTTTTTGGATTCCATTATTCA AAATGGTTATGCCTTCACTGACCGTTACGATTTGGGATTTGAAACACC AACGAAGTATGGGACGGACCAGCAATTGCGTGATGCAATTAAAGCAT TGCATGCTAATGGTATACAAGCAATGGCTGACTTTGTGCCAGACCAG ATTTATAATTTGCCTCAAACAGAACTGGTTTCTGTATCACGCACCGAT AGTCTTGGTAATCAGTCAGCCAATTCAAATGCAGCCAATGTATTGTAT GTATCTCATACAGTTGGTGGTGGTGAATATCAAAGCAAGTATGGGGG CGAATTTTTAGCGCTTATTAAGTCTAAATATCCAAGCTTGTTTAAAAC AATTCAGGTTTCGACAGGACTACCAATTGATGATTCAACTAAGATTAA AGAGTGGTCGGCAAAATACTTTAATGGTTCAAATATTCAAGGACGTG GTTTTGGATATGTGCTATCTGATGGTGGCACGCAGAATTACTTTAAAG TGATTTCGAACAGTACAGATGATGACTTTTTGCCTAATCAGCTGACTG GACAACCCACAATGACAGGCTTTGAACAAACAAGTAAGGGTATTGTA TATTACTCTAAGAGTGGTATTCAGGCTAAAAATCAATTCGTCAAAGAT GATGTTTCTGGTAATTACTACTATTTCAATAAGAATGGTCTGATGACA ATTGGCAGTAAGACGATCAATGGTAAAAACTATATGTTCTTGCCAAA CGGCGTAGAGTTACGAGGATCCTTTTTACAAACGGCGGATGGGACCG TCAATTACTATGCGACTAATGGGGCACAGGTTAAGGACGCCTATGTG ACTGACACAGAAGGTAATAGTTATTACTTTGATGGTGATGGGGAAAT GGTAACGGGTGCTTATACAGTTGATGGACATGCGCAATATTTTGATGT GAATGGTGTTCAAACCAAAGGGGCTATTATTACACTTGACGGTGTGC AACGCTATTATCAAGCTGGGAACGGTAATTTGGCAACGAATCAATAT GTCAGTTACAACAACAGCTGGTACTATGCCAACGCCAAGGGCGAGTT AGTGACTGGTGTTCAAAGTATTAATGGTAACGTTCAATATTTTGCCAG CAATGGGCAACAAATTAAAGGTCAAATTGTTGTGACTGGTAATCAGA AAAGTTATTACGATGCAAACACTGGAAATCTTATCAGAAATGATTTTT TGACACCGGATCAAGGTAAAACTTGGTATTATGCCGATCAAGATGGT AATCTTGTGGTAGGTGTACGGAATATTAATGGACACAATCAATATTTT GATGATAATGGGATACAAATCAAAGACCAAATCATATCAAATGATGG GCAACAATATTATTATCAAGGTGGTAATGGTGATTTAGTCACAAATCG ATATATCAGTTACAATGATAGTTGGTATTACGCCGACGCAACAGGTGT TCTTGTAACAGGTCAACAAATTATCAACGGTGAAACGCAATACTTTA GGACAGATGGTCGCCAAGTCAAGGGCCAAATTATTGCTGATGGTGAT AAACAGCATTATTACGACGCATATTCAGGCAATTTGGTTAAAAATAA TTTTGTCACAGTCGACCAAGGTAAAACTTGGTATTATGCTGATCAAGA TGGGAACCTCTCTTTGGTTGCCCAATAA GtfGprotein MGEKVVARKKLYKAKKSWVVAGLTTAFLMVNQASVSADQNVNDTS 14 (L. VTTTTQDVTTDQDTGIDASVTTTVSPNLDDTQVDNTNIQTSTDQKDDSK pseudo- GTTQTVETDVTTNSQSTDTTAVTAQTNQTETIQNSDATTETGLVTVNNQ mesenteroides) VRYVNPDGTVLKGAYKTINGNTYYFDDNSGDALIGIHKIGESIKGFGLTG VQVKGDYLTAVNGDKYYFDSDGNTVSGVQQINGKTYYFDSTGKLMKG YTAVLNGVVTFFNSTTGEADNTDASTIKTGVTIDNSDYTAHNAAYDNTA ASFDNINGYLTAESWYRPKEILENGESWRPSTAEDKRPILITWQPDIVTEV NYLNMMSANGLLSINAPFTTASDLAIMNDAVRAVQKNIEIRISQEKSTDW LKALMTQFINTQPQWNEVSESPSNDHLQGGALTYVNSPLTPDANSNFRL LNRTPTNQSGTTRYDTDKSEGGFELLLANDVDNSNPVVQAEQLNWLYY LMNFGSITANDPTANFDGIRVDAVDNVDADLLQIASDYFKLAYGTSLSD TNANQHLSILEDWSANDAEYMSKTGSNQLTMDTYTQQQLLFSLTKQVG NRADMRRFLEYFMINRANDSTENIATPNYSFVRAHDSEVQTVIATIIKDL HPDVVNSLAPTQAQLEEAFAVYNADMNRVDKQYTQYNMPSAYAMLLT NKDTIPRVYYGDLYTDDGEYMGTQTPYYDAIVNLLQSRVKYVAGGQSM AVDQHDILTSVRYGKNLADANATSDDLTSINSGIGVIVSNNPNLSLASGE TVVLHMGIAHANQVYREILETTDNGIANNTDIFKTTDSNGDLIFTASEIHG YSNVQVSGFLSVWAPKDATDNQDVRTAASESTSSDGNTLHSNAALDSNI IYEGFSNFQSTPQSESEFAKVKIAANVNLFKSWGVTSFQMAPQYRSSTDT SFLDSIIQNGYAFTDRYDLGFETPTKYGTDQQLRDAIKALHANGIQAMAD FVPDQIYNLPQTELVSVSRTDSLGNQSANSNAANVLYVSHTVGGGEYQS KYGGEFLALIKSKYPSLFKTIQVSTGLPIDDSTKIKEWSAKYFNGSNIQGR GFGYVLSDGGTQNYFKVISNSTDDDFLPNQLTGQPTMTGFEQTSKGIVY YSKSGIQAKNQFVKDDVSGNYYYFNKNGLMTIGSKTINGKNYMFLPNG VELRGSFLQTADGTVNYYATNGAQVKDAYVTDTEGNSYYFDGDGEMV TGAYTVDGHAQYFDVNGVQTKGAIITLDGVQRYYQAGNGNLATNQYV SYNNSWYYANAKGELVTGVQSINGNVQYFASNGQQIKGQIVVTGNQKS YYDANTGNLIRNDFLTPDQGKTWYYADQDGNLVVGVRNINGHNQYFD DNGIQIKDQIISNDGQQYYYQGGNGDLVTNRYISYNDSWYYADATGVLV TGQQIINGETQYFRTDGRQVKGQIIADGDKQHYYDAYSGNLVKNNFVTV DQGKTWYYADQDGNLSLVAQ GtfGmature DQNVNDTSVTTTTQDVTTDQDTGIDASVTTTVSPNLDDTQVDNTNIQTS 15 protein(L. TDQKDDSKGTTQTVETDVTTNSQSTDTTAVTAQTNQTETIQNSDATTET pseudo- GLVTVNNQVRYVNPDGTVLKGAYKTINGNTYYFDDNSGDALIGIHKIGE mesenteroides) SIKGFGLTGVQVKGDYLTAVNGDKYYFDSDGNTVSGVQQINGKTYYFD STGKLMKGYTAVLNGVVTFFNSTTGEADNTDASTIKTGVTIDNSDYTAH NAAYDNTAASFDNINGYLTAESWYRPKEILENGESWRPSTAEDKRPILIT WQPDIVTEVNYLNMMSANGLLSINAPFTTASDLAIMNDAVRAVQKNIEI RISQEKSTDWLKALMTQFINTQPQWNEVSESPSNDHLQGGALTYVNSPL TPDANSNFRLLNRTPTNQSGTTRYDTDKSEGGFELLLANDVDNSNPVVQ AEQLNWLYYLMNFGSITANDPTANFDGIRVDAVDNVDADLLQIASDYFK LAYGTSLSDTNANQHLSILEDWSANDAEYMSKTGSNQLTMDTYTQQQL LFSLTKQVGNRADMRRFLEYFMINRANDSTENIATPNYSFVRAHDSEVQ TVIATIIKDLHPDVVNSLAPTQAQLEEAFAVYNADMNRVDKQYTQYNMP SAYAMLLTNKDTIPRVYYGDLYTDDGEYMGTQTPYYDAIVNLLQSRVK YVAGGQSMAVDQHDILTSVRYGKNLADANATSDDLTSINSGIGVIVSNN PNLSLASGETVVLHMGIAHANQVYREILETTDNGIANNTDIFKTTDSNGD LIFTASEIHGYSNVQVSGFLSVWAPKDATDNQDVRTAASESTSSDGNTLH SNAALDSNIIYEGFSNFQSTPQSESEFAKVKIAANVNLFKSWGVTSFQMA PQYRSSTDTSFLDSIIQNGYAFTDRYDLGFETPTKYGTDQQLRDAIKALH ANGIQAMADFVPDQIYNLPQTELVSVSRTDSLGNQSANSNAANVLYVSH TVGGGEYQSKYGGEFLALIKSKYPSLFKTIQVSTGLPIDDSTKIKEWSAKY FNGSNIQGRGFGYVLSDGGTQNYFKVISNSTDDDFLPNQLTGQPTMTGFE QTSKGIVYYSKSGIQAKNQFVKDDVSGNYYYFNKNGLMTIGSKTINGKN YMFLPNGVELRGSFLQTADGTVNYYATNGAQVKDAYVTDTEGNSYYFD GDGEMVTGAYTVDGHAQYFDVNGVQTKGAIITLDGVQRYYQAGNGNL ATNQYVSYNNSWYYANAKGELVTGVQSINGNVQYFASNGQQIKGQIVV TGNQKSYYDANTGNLIRNDFLTPDQGKTWYYADQDGNLVVGVRNING HNQYFDDNGIQIKDQIISNDGQQYYYQGGNGDLVTNRYISYNDSWYYA DATGVLVTGQQIINGETQYFRTDGRQVKGQIIADGDKQHYYDAYSGNLV KNNFVTVDQGKTWYYADQDGNLSLVAQ
[0043] In many embodiments, the modified glucansucrase enzymes of the present invention are at least partially purified from a recombinant host cell, or its growth medium. A purified protein or polypeptide of the mutant enzymes of the present invention can be obtained by several methods. The purified protein or polypeptide of the modified glucansucrase of the present invention is preferably produced in pure form (preferably at least about 80%, more preferably 90%, pure) by conventional techniques well known in the art. Typically, the purified protein or polypeptide of the modified glucansucrase of the present invention is secreted into the growth medium of recombinant host cells. Alternatively, the purified protein or polypeptide of the glucansucrase of the present invention is produced but not secreted into growth medium. In such cases, to isolate the protein or polypeptide of the mutant glucansucrase, the host cell carrying a recombinant plasmid is propagated, lysed by any method known in the art (e.g., sonication, heat, or chemical treatment), and the homogenate is centrifuged to remove cell debris. The supernatant is then subjected to immobilized affinity chromatography depending on the affinity tag (e.g., hexahistidine, maltose binding protein, or glutathione-S-transferase). Depending on the application requirements, affinity tags may be removed from the enzyme by enzymatic cleavage and further purified to homogeneity. Alternatively, traditional protein purification methods involving, but not limited to, sequential ammonium precipitation, ion exchange chromatography, hydrophobic interaction chromatography and gel filtration may be used in the purification of the mutant glucansucrase.
[0044] Oligosaccharides
[0045] Disclosed herein are variant glucansucrase enzymes that produce various products. In general, the variant enzymes produce some amount of at least one oligosaccharide. Of pertinence to the present application, isomelizitose (-D-glucopyranosyl-(1.fwdarw.6)--D-fructofuranosyl-(21)--D-glucopyranoside) production, is increased by many variants disclosed herein.
[0046] Although several uses for isomelezitose have been proposed, including prebiotics (Grl et al., supra) and pharmaceutical excipients (Backstrom et al, 1999), applications are currently limited due to the high cost and relative scarcity of this compound. Isomelezitose has been isolated in small amounts from several enzymatic reaction mixtures (Chiba et al., supra; Fujii et al., supra; Inohara-Ochiai et al., supra), but only one instance of a high-yielding synthesis has been reported (Grl et al., supra). In that example, the yield was reportedly over 70% from sucrose, but that number was calculated from the amount of sucrose consumed, not from the total sucrose added to the reaction. Because their method gave undesirable side products if the reaction was allowed to proceed to completion, it was halted when only a fraction of the sucrose was consumed. If their yield were calculated on the basis of the amount of sucrose present in the starting mixture, the result would actually be closer to 20-25% yield. This contrasts with yields from several variant enzymes described herein, which are on the order of 40-60% yield from the total sucrose added to the reaction mixture, all of which is consumed in the reaction.
[0047] Thus, one aspect of the present invention is the production of isomelezitose and other products using the modified proteins provided herein. Such modified proteins (purified or unpurified) can be exposed to one or more carbohydrate sources as a method of converting the carbohydrate(s) to a desired product (e.g., isomelezitose). In preferred embodiments, the carbohydrate source is sucrose in an aqueous solution, with or without additional components. In such embodiments, the sucrose can be at any desired molar concentration, such as 0.1M, 0.2M, 0.3M, 0.4M, 0.5M, 0.6M, 0.7M, 0.8M, 0.9M, 1.0M, 1.1M, 1.2M, 1.3M, 1.4M, 1.5M, 1.6M, 1.7M, 1.8M, 1.9M, 2.0M, or higher. Such reactions can be performed in reaction solutions under any conditions at which the enzyme(s) exhibit catalytic activity. Standard reaction variables such as pH, temperature and ionic concentration can be readily modified by one of skill in the art. Preferably, reactions are performed at or below 40 C., and more preferably between 20 C. and 35 C. Preferably, the pH of such reactions is between 3.5 and 8.5. Preferably, reactions are performed in the presence of any desirable ion or salt, such as Ca.sup.+2, Mg.sup.+2, Na.sup.+, K.sup.+, etc.
[0048] Having generally described this invention, the same will be better understood by reference to certain specific examples, which are included herein to further illustrate the invention and are not intended to limit the scope of the invention as defined by the claims.
EXAMPLES
Example 1
[0049] Modifications of Glucansucrases
[0050] The glucansucrase gene, dsrI, from L. mesenteroides NRRL B-1118 (SEQ ID NO: 4) was previously cloned and expressed in E. coli using a small ubiquinone-like modifier (SUMO) fusion tag to improve solubility (Ct and Skory (2012), supra). After removal of the SUMO tag with SUMO protease 1, the purified enzyme is expected to have the same amino acid sequence as the mature full-length protein (SEQ ID NO: 6) without the native dsrI signal peptide (see, Table 1, underlined section of SEQ ID NO: 5), which is normally removed during secretion in the wild-type L. mesenteroides host. The lysine residue at position 441 and the threonine residue at position 654 of the full length DsrI protein (SEQ ID NO:5) are equivalent to the lysine and threonine residues at positions 400 and 613 (respectively) of the mature DsrI protein (SEQ ID NO:6) and these residues are typically referred to as L441 or T654. Thus, text describing a mutation of L441 in the full-length DsrI also refers to a mutation of L400 in the mature DsrI protein (lacking the signal sequence). Mutations for L441 substitutions (
[0051] The dextransucrase gene, dsrS, from L. mesenteroides NRRL B-1118 (SEQ ID NO: 7) was previously cloned and expressed in E. coli using a similar SUMO fusion tag (Ct and Skory (2015), supra). After removal of the SUMO tag with SUMO protease 1, the purified enzyme is expected to have the same amino acid sequence as the mature full-length protein (SEQ ID NO: 9) without the native DsrS signal peptide (see, Table 1, underlined section of SEQ ID NO: 8), which is normally removed during secretion in the wild-type L. mesenteroides host. The lysine residue at position 459 of the full length DsrS protein (SEQ ID NO:8) is equivalent to the lysine residue at position 417 of the mature DsrS protein (SEQ ID NO:9) and this residue is typically referred to as L459. Thus, text describing a mutation of L459 in the full-length DsrS also refers to a mutation of L417 in the mature DsrS protein (lacking the signal sequence). Mutations for L459P substitutions (
[0052] The glucosyltransferase gene, gtfI, from Streptococcus sobrinus NRRL B-14554 (SEQ ID NO: 10) was PCR amplified and then used for Gibson assembly with PCR-amplified pE-SUMOpro Kan (Life Sensors, Malvern, Pa.). Cleavage of the purified protein from E. coli containing the resultant plasmid pGtfI.SUMO with SUMO protease 1 should yield enzyme with the same amino acid sequence as the secreted GtfI protein (SEQ ID NO: 12) from the wild-type S. sobrinus host, without the native signal peptide (see, Table 1, underlined section of SEQ ID NO:11). The lysine residue at position 350 of the full length GtfI protein (SEQ ID NO:11) is equivalent to the lysine residue at position 312 of the mature GtfI protein (SEQ ID NO:12) and this residue is typically referred to as L350, but indicates the same lysine residue. Thus, text describing a mutation of L350 in full-length GtfI also refers to a mutation of L312 in the mature GtfI protein (lacking the signal sequence). Mutations for L350 substitutions (
[0053] The alternansucrase gene, asr, from Leuconostoc citreum NRRL B-1355 (SEQ ID NO:1) was PCR amplified and used for Gibson assembly with the same pESUMO.Gib PCR fragment previously used. In order to improve solubility of the recombinant enzyme, the sequence was also cloned into the same L. lactis expression plasmid previously described. This was accomplished by Gibson assembly of the PCR-amplified asr gene to remove the predicted signal peptide (Arguello-Morales, et al., FEMS Microbiol. Lett., (2000) 182:81-5; see, Table 1, underlined section of SEQ ID NO:2) and the vector portion of pDsrI.3535.usp45 previously used. The resultant plasmid, pAsr.3535.usp45, was further altered by modifying the codon for L544 (
[0054] The glucosyltransferase gene responsible for isomelezitose production in L. pseudomesenteroides NRRL B-1297, previously classified as L. mesenteroides (Ct and Skory, Carbohydr. Res., (2017) 439:57-60), was identified by genomic sequencing using the Illumina Nextera XT DNA Library Preparation Kit and MiSeq Reagent Kit v3. A single gene (SEQ ID NO:13), gtfG, having similarity to other dextransucrases was PCR amplified to eliminate the predicted signal peptide (see, Table 1, underlined section of SEQ ID NO:15) and then used for Gibson assembly with the same pESUMO.Gib PCR fragment previously used. The lysine residue at position 417 of the full length GftG protein (SEQ ID NO:14) is equivalent to the lysine residue at position 380 of the mature GftG protein (SEQ ID NO:15) and this residue is typically referred to as L417, but indicates the same lysine residue. Thus, text describing a mutation of L417 in full-length GftG protein can also refer to a mutation of L380 in the mature GftG protein (lacking the signal sequence). A L417P substitution was introduced into the resultant plasmid, pGtfG.SUMO as previously described. All plasmid modifications were confirmed by sequencing prior to utilization for enzyme studies.
Example 2
[0055] Analytical Methods
[0056] Glucansucrase activity was measured in one of two ways. Glucan formation was measured directly by monitoring the incorporation of .sup.14C-glucose into methanol-insoluble glucan using a modification of the technique first described by Germaine, et al. (J. Dent. Res., (1974) 53:1355-60; Ct and Skory, (2012), supra). Alternatively, glucansucrase activity was determined indirectly by measuring the accumulation of fructose released under the same reaction conditions using the Megazyme D-Glucose/D-Fructose Assay Kit with a modified microplate protocol from those previously described (Vettori et al., Carbohydr. Res. (2011) 346:1077-82). Samples were removed at timed intervals throughout the enzyme reaction and then immediately diluted 20 in Megazyme Buffer #1 and heat denatured at 80 C. for 10 minutes. The activity of DsrI is almost non-existent in Buffer #1 and the enzyme is quickly heat inactivated at this temperature. Precipitated protein for the cooled sample was then removed by centrifugation and the remaining supernatant was then sequentially analyzed for glucose and fructose according to the manufacturer's recommendations. Formation of NADPH with this assay kit was monitored at OD.sub.340 using a Biotek Synergy2 microplate reader to ensure that all conversion reactions were complete. The rate of fructose accumulation is representative of the initial rate of glucan biosynthesis. For most glucansucrases, the activities measured by both methods are nearly identical. However, for the DsrI L441 mutants, the amount of polysaccharide synthesized was significantly less than the amount of fructose released, as most of the glucosyl transfer reaction yielded isomelezitose, rather than glucan.
[0057] Reactions were also monitored chromatographically. Thin-layer chromatography was carried out using silica gel 60 plates with three solvent ascents of acetonitrile-water 4:1 (v/v). Sugars were made visible using N-(1-naphthyl) ethylenediamine dihydrochloride in 3% (v/v) sulfuric acid in methanol (Bounias, M., Anal. Biochem., (1980) 106:291-95). HPLC was performed using a Waters HPLC system with refractive index detector, fitted with a Regis Spherisorb S5NH column, 5 m particle size, 4.6 mm25 cm, eluted with acetonitrile-water 4:1 (v/v) at room temperature.
[0058] Isomelezitose was identified by chromatographic mobility, MALDI-TOFS, and .sup.1H and .sup.13C-NMR as previously described Ct et al., (2008), supra; Ct & Skory, (2017), supra).
Example 3
[0059] Reactions
[0060] All enzyme reactions were carried out at room temperature in 20 mM pH 5.5 sodium acetate buffer containing 2 mM calcium chloride and 1.5 mM sodium azide as a preservative. Sucrose concentration was varied between approximately 50 mM and 2.8M, depending on the experiment.
[0061] To compare the products from DsrI variants, 1 mL of an enzyme preparation was mixed with 6 mL of 1M sucrose (2 g sucrose). When all sucrose had been consumed, as determined by thin-layer chromatography, the water-insoluble glucan was removed by centrifugation, washed three times with water, once with 50% ethanol, once with absolute ethanol, and dried in vacuo at 50 C. Weights were recorded. The water-soluble portion of each reaction mixture was mixed with four volumes of ethanol and chilled at 18 C. for several hours. The ethanol-precipitated polysaccharide was redissolved in water, precipitated a second time, and dried and weighed as above. The 80%-ethanol soluble fraction containing the residual oligosaccharides from the reaction mixtures was evaporated under a stream of nitrogen at 60 C. to remove the ethanol. The resulting aqueous samples were chromatographed over a 3 cm57 cm column of Dowex Monosphere 99CA/320 ion exchange resin, which is a strong cation-exchange resin in the Ca.sup.2+ form. Isomelezitose and higher oligosaccharides were eluted immediately with water, whereas leucrose and fructose were retained and eluted in later fractions. The total yield of isomelezitose plus higher oligosaccharides was measured using the phenol-sulfuric acid method (DuBois et al., Anal. Chem., (1956) 28:350-6), using maltose as a standard. The oligosaccharide fraction was further analyzed by TLC as described above and the isomelezitose content determined densitometrically by scanning the TLC plate in reflectance mode on a desktop scanner (Epson Perfection V200 Photo) in black-and-white photographic mode. The image was saved as a 300 dpi jpeg file, which was subsequently analyzed densitometrically, using Un-Scan-It software version 6.1 (Silk Scientific, Orem, Utah).
[0062] To calculate yield of isomelezitose from sucrose in a large scale reaction, 11 mL of L441E DsrI was incubated with 100 g of sucrose (0.3 moles) in 120 ml of buffer at room temperature (22 C.) until all of the sucrose had been consumed (40 hours). The entire reaction mixture was then chromatographed over BioGel P-2, eluting with water. Fractions containing isomelezitose, as determined by TLC, were combined and freeze-dried in vacuo at 50 C. overnight. Conditions were similar for analysis of DsrS L459P.
Example 4
[0063] Activity of DsrI Variants
[0064] Initial .sup.14C-based glucansucrase assays of DsrI L441E enzyme preparations indicated much lower levels of glucan synthesis than the parent isolate and TLC analysis of the reactions showed that sucrose was being consumed at a rate comparable to wild-type DsrI. Fructose accumulation assays subsequently confirmed much higher activity than radioassays indicated. In one example, wild type DsrI showed 0.26 U/mL by radioassay and 0.38 U/mL based on the rate of fructose accumulation. Thin-layer chromatography of the wild-type enzyme reaction revealed that most of the difference could be accounted for by formation of leucrose. However, for the L441E mutant form of DsrI, radioassay measured only 0.07 U/mL of glucan synthase activity, but fructose accumulation analysis indicated 0.38 U/mL, similar to the wild-type. Furthermore, TLC actually showed less leucrose formation by L441E than by wild-type enzyme. Instead, the main products were fructose and an oligosaccharide with nearly the same chromatographic mobility as raffinose, suggesting it was a trisaccharide of similar structure. The unknown saccharide was isolated by gel-filtration chromatography over Bio-Gel P-2. NMR analysis was carried out as previously described (Ct et al., (2008), supra), and the resultant spectra matched previously published spectra for isomelezitose (Ct et al., (2008), supra; Inohara-Ochiai et al., supra; Shi et al., supra).
[0065] To determine the optimum sucrose concentration for maximum isomelezitose yields, a series of reactions was set up using 0.1 mL of L441E DsrI (0.18 U/mL glucan synthase activity) and 0.4 mL of sucrose solution of varying concentrations. When sucrose was completely consumed, as determined by TLC, the reaction mixtures were analyzed by HPLC. Fructose, isomelezitose, and leucrose plus isomaltulose (DP2) concentrations were measured. The results do not show any large effect of sucrose concentration on the relative ratios of each product (
[0066] Several other amino acid substituents at L441 were also investigated by TLC of reaction mixtures after complete utilization of sucrose. Leucine (native enzyme), phenylalanine (L441F), tyrosine (L441Y), and tryptophan (L441W) made little or no isomelezitose. Those producing large amounts of isomelezitose were proline (L441P), glycine (L441P), serine (L441S), threonine (L441T), arginine (L441R), aspartate (L441D), glutamate (L441E), glutamine (L441Q), and valine (L441V). Intermediate amounts of isomelezitose were produced by L441 variants I (isoleucine), K (lysine) and N (asparagine). The reaction products of each of these are shown in the thin-layer chromatogram in
[0067] After removal of fructose by Dowex chromatography, the oligosaccharide fraction was measured for total carbohydrate concentration (DuBois et al., supra). The bar graph in
[0068] There were also higher oligosaccharides formed in most of the reactions, with degrees of polymerization (DP) ranging from tetrasaccharides (DP4) upwards to DP14, as measured by MALDI-TOFS and thin-layer chromatography. Treatment with endodextranase eliminated most of the higher (DP>4) oligosaccharides, indicating that they contained predominantly (1.fwdarw.6)-linked D-glucopyranosyl residues. These are apparently acceptor products arising from glucosylation of isomelezitose. These soluble compounds may be considered higher DP oligosaccharides or very low-MW polysaccharides related to dextran.
[0069] The yield of isomelezitose from sucrose in a large scale reaction with L441E DsrI was 43 g (0.085 moles) isomelezitose from 100 g sucrose (0.3 moles), for a yield of 57%. The same reaction with L441P resulted in an isomelezitose yield of 51%.
Example 5
[0070] Activity of DsrS Variants
[0071] The wild type dextransucrase enzyme, DsrS, from L. mesenteroides NRRL B-1118 produces a water-soluble dextran, with predominantly -1.fwdarw.6 linkages (Ct and Skory (2015), supra). Substitution of leucine 459 with a proline residue resulted in a mutant enzyme (L459P) that produced isomelezitose in yields comparable to those of L441E and L441P, but with slightly lower amounts of higher DP oligosaccharides. When using the DsrS L459P mutant enzyme secreted by L. lactis, the average yield on a 1 g scale was 40% (SD5) from sucrose.
Example 6
[0072] Activity of Asr Variants
[0073] L. citreum alternansucrase (Asr) synthesizes an alternating -1.fwdarw.3, -1.fwdarw.6-linked D-glucan (Ct, G. L. (2002) Alternan. Chapter 13 in Biopolymers, Vol. 5. A. Steinbchel, Ed. Wiley-VCH, Weinheim, Germany. Pp. 323-350). Mutant enzymes were created by substitution of leucine 544 with glutamic acid, proline, arginine, or serine. All four were expressed from L. lactis extracellularly. Yields varied according to the amino acid substituent. Whereas the wild-type enzyme gave a 2.5% yield of isomelezitose, variant L544R gave a 1.9% yield, variant L544E gave a 6.8% yield, and variant L544S gave a 9.5% yield. The only alternansucrase variant tested that gave drastically higher yields of isomelezitose was L544P, which gave a 23% yield from sucrose.
Example 7
[0074] Activity of GtfI Variants
[0075] S. sobrinus 6715 is a cariogenic lactic acid bacterium that, like L. mesenteroides NRRL B-1118, produces both water-soluble and water-insoluble glucans (Hamada & Slade, Microbiol. Rev., (1980) 44:331-84). The GtfI enzyme is responsible for the synthesis of water-insoluble glucan similar to that of L. mesenteroides NRRL B-1118 in some respects (Shimamura et al., FEBS Lett., (1983) 157:79-84). Like other glucansucrases, it made isomelezitose in low yields (1.2% of theoretical maximum yield from sucrose) (Ct & Skory, (2017), supra). Mutant versions of GtfI produced enhanced amounts of isomelezitose, but the yields were much lower than those produced by the DsrI and DsrS mutants described herein. Respectively, the yields for GtfI mutants L350E, L350P, L350R and L350S as expressed from E. coli were 4.2%, 6.5%, 4.0% and 5.0%. It appeared that the lower yields were due in part to the formation of larger quantities of the higher DP oligosaccharides.
Example 8
[0076] Activity of GtfG Variants
[0077] Two glucansucrases that potentially could be responsible for isomelezitose production were identified from the L. pseudomesenteroides genome. One of them shared a 70% protein sequence identity by Lipman Pearson alignment with DsrE from L. mesenteroides NRRL B-1299, which catalyses the synthesis of -1,6 and -1,2 linkages from sucrose (Bozonnet et al., J. Bacteriol. (2002) 184:5753-61). The other protein, GtfG, was between 95-98% identical to a relatively new uncharacterized clade of glucosyltransferase from L. pseudomesenteroides (Pedersen et al., Genome Announc., (2014) 2:e00484-14; Frantzen et al., Front. Microbiol., (2017) 8:132). These proteins most closely align, 52-56% identity, with other glucansucrases that produce predominately soluble dextran where the majority of the glucosyl linkages are -1,6. Variant GtfG L419P produced enhanced levels of isomelezitose relative to the unmodified enzyme. An overall yield of 10% based on sucrose was isolated chromatographically, whereas the wild type protein is not able to produce detectable levels of isomelezitose. Also observed was a series of oligosaccharides with chromatographic mobility similar to those observed for the other LxxxP variants described above.
Example 9
[0078] Summary of Results
[0079] We previously cloned DsrI from L. mesenteroides strain NRRL B-1118 (Ct & Skory, (2012), supra) that synthesizes a water-insoluble glucan, and demonstrated that amino acid substitutions within the active site of the enzyme at threonine residue 654 exhibit altered linkage specificity with respect to the ratios of (1.fwdarw.3) and (1.fwdarw.6) D-glucopyranosyl linkages (Ct & Skory, (2014), supra). Several of those strains produced higher proportions of (1.fwdarw.3) linkages, but also gave lower yields of glucan. In an attempt to increase glucan yields, we subsequently decided to focus on amino acid substitution at leucine 441 with DsrI. The corresponding residue, Leu940, in a L. reuteri glucansucrase GTF180-AN, which produces a water-soluble dextran, was shown to be involved in acceptor substrate binding and is crucial to linkage specificity and glucan yields with this enzyme (Meng et al., Appl. Microbiol. Biotechnol., (2015) 99:5885-94; Meng et al., J. Biol. Chem. (2014) 289:32773-82). All amino acid substitutions in Leu940 resulted in an increased percentage of (1.fwdarw.6), with a subsequent decrease in (1.fwdarw.3) linkages. However, L940E and L940F substitutions also significantly shifted reaction specificity from oligosaccharide to polysaccharide synthesis (Meng et al., (2014), supra).
[0080] We initially focused on the equivalent L940E substitution in DsrI because this particular GTF180-N mutant had the highest (1.fwdarw.3) polysaccharide productivity compared to the other substitutions. When we performed the L441E substitution with DsrI (as described above), the resulting mutant form of the enzyme produced very little insoluble glucan. Instead, and unexpectedly, it produced isomelezitose in high yields. This contrasts with L940 substitutions in GTF180-N that produced linear isomalto-oligosaccharides or very complex oligosaccharide mixtures, none of which were identified as isomelezitose (Meng et al., (2014), supra).
[0081] Isomelezitose has previously been described as minor product in reactions of alternansucrase using fructose as an acceptor Ct et al., (2008), supra) and Weisella dextransucrase using lactose as acceptor (Shi et al., supra). More recently, it was reported that isomelizitose is produced in trace amounts by a number of glucansucrases when sucrose is the only substrate added (Ct & Skory, (2017), supra). However, it was surprising to find that so many of the L441 variants of DsrI investigated produced high levels of isomelezitose. Besides the wild-type enzyme, the only other L441 variants studied which produced little or no isomelezitose were from the large aromatic amino acid substituents tryptophan, tyrosine and phenylalanine. Furthermore, these three variants also made little or no water-insoluble glucan (
[0082] While the invention has been described with reference to details of the illustrated embodiments, these details are not intended to limit the scope of the invention as defined in the appended claims. The embodiment of the invention in which exclusive property or privilege is claimed is defined as follows: