Methods for the improved formation of acarbose

12351847 ยท 2025-07-08

    Inventors

    Cpc classification

    International classification

    Abstract

    The present invention relates to Actinomycetales strains for the improved formation of acarbose. Provided are Actinomycetales strains which are engineered to overexpress dTDP-D-glucose-4,6-dehydratase (AcbB) and/or uridyltransferase (GtaB). Also provided are Actinomycetales strains which are engineered to have a reduced or absent expression of the small carbohydrate binding protein (Cgt) and/or a reduced or absent expression of genes which are essential for carotenoid synthesis. Also provided are tools, methods and means to generate these strains.

    Claims

    1. A method to engineer an Actinoplanes strain for the improved production of acarbose, the method comprising engineering the Actinoplanes strain by deleting the gene encoding extracellular small carbohydrate binding protein Cgt according to SEQ ID No. 20.

    2. An Actinoplanes strain for the improved production of acarbose, wherein the Actinoplanes strain is genetically engineered by deleting the gene encoding the extracellular small carbohydrate binding protein Cgt according to SEQ ID No. 20.

    Description

    BRIEF DESCRIPTION OF THE DRAWINGS

    (1) FIG. 1. Model of the biosynthesis of acarviosyl-maltose in Actinoplanes sp. SE50/110. Shown are the eleven steps of acarbose biosynthesis from 2-epi-5-epi-valiolone (Zhang (2002)).

    (2) FIG. 2. The acarbose biosynthesis gene cluster and gene disposition in the genome of Actinoplanes sp. SE50/110 (GenBank: LT827010.1) (Schaffert, et al. 2019).

    (3) FIG. 3. Promoter screening on protein and transcript level in Actinoplanes sp. SE50/110 strains, cf. Table 1. Shown are the normalized glucuronidase activities on the left side (absolute values) and the relative transcript amount of gusA gene calculated by RT-qPCR on the right side. For the glucuronidase assay, the slope of absorption curves of indigo was calculated by linear regression and normalized by the cell dry weights. The normalized activities were tested for significant differences compared to pGUS in a two-sided t-test (p-values: P.sub.2475: 0.8889, P.sub.efp: 3.048e-07, P.sub.cdaR: 8.967e-07, P.sub.rpst: 1.296e-08, P.sub.rpsJ: 0.0003677, P.sub.cgt: 2.183e-06, P.sub.tipA: 0.0001651, P.sub.apm: 0.0001078, P.sub.ermE*: 0.007406, P.sub.katE: 0.002577, P.sub.moeEs: 0.001809, P.sub.gapDH: 0.0005821, Pact: 0.02042). The relative transcription amounts of gusA gene were analyzed in relation to the pGUS-vector (set to 1). For the act-promoter, no RNA could be isolated due to severe growth deficiencies. For the residual promoters, a significant increase in the relative transcript amount was measured (p-values of a two-sided t-test: P.sub.2475: 0.0001133, P.sub.efp: 4.871e-05, P.sub.cdaR: 0.002509, P.sub.rps: 9.928e-06, P.sub.rpsJ: 1.167e-08, P.sub.cgt: 5.9Ile-08, P.sub.tipA: 7.158e-06, P.sub.apm: 4.596e-05, P.sub.ermE: 0.0009364, P.sub.katE: 0.0001373, P.sub.moeEs: 0.0002518, P.sub.gapDH: 4.207e-06). Significance levels of the calculated p-values are shown by asterisks: *<a=5%, **<a=1%, ***<a=0.1%. Figure published in (Schaffert, et al. 2019).

    (4) FIG. 4. Strategies for improved acarbose production. Three different strategies are provided to improve the acarbose production: 1. Increasing the gene dose of acarbose biosynthesis genes, 2. Deployment of precursors of acarbose biosynthesis and 3. Reducing the metabolic burden by gene deletion. Shown are the target genes evaluated in this work. Furthermore, an overexpression system had to be implemented for the overexpression of single genes.

    (5) FIG. 5. Chemical structure of acarbose. Acarbose is a cyclitol-containing aminoglycosid composed of a pseudodisaccharide (valienaminyl-4-amino-4,6-dideoxyglucose), called acarviose, and maltose. Both are connected by an -1,4-glycosidic bond. Figure published in Wolf 2017.

    (6) FIG. 6. Vector card of novel cloning system pSETT4 (cf. SEQ ID No. 110, SEQ ID No. 111). A promoter, such as the strong promoter of the gene gapDH from Eggerthella lenta or the tipA promoter is cloned in front of an expression cassette, e.g. the IacZ-cassette. The IacZ-cassette is flanked by a recognition side of a restriction enzyme e.g. Bsal. The restricition site enables exchange of IacZ by the gene of interest by Gibson Assembly, restriction/ligation cloning or Golden Gate cloning. For termination, T4-terminators are introduced before and after the cloning side. Behind the cloning side, two antiparallel oriented T4-terminators shall prevent read-through from both directions. For exchange of the promoter sequence, further restriction sites, e.g. Ndel and Kpnl restriction sites were introduced. Furthermore, the vector comprises the integrase gene int and the attachement site attP of the phage qC31, the origin of transfer (ncP) and relaxosome gene traJ, the high-copy-number ColE1/pMB1/pBR322/pUC origin of replication and an resistance gene (here: apramycin resistance gene aac (3) IV (apmR)).

    (7) FIG. 7. Scheme of the novel deletion system and the processes during the homologous recombination (first and second crossover). Selection of vector integration is performed by use of either apramycin or kanamycin (first crossover, resistance mediated by apmR or kanR). Counterselection is performed by use of 5-flourouracil (second crossover, sensitivity mediated by codA).

    (8) FIG. 8. Workflow of novel deletion system using homologous recombination.

    (9) FIG. 9. BlastP analysis of the amino acid sequence of Cgt leads to the identification of 17 other proteins consisting of a singular CBM-20 domain. The protein tree was created and visualized on the basis of multiple sequence alignment performed by BlastP (Altschul et al. 1990). The protein tree shows the distance of the 18 singular CBM-20 domain proteins, identified by the NCBI accession number and their hosts. In brackets the sequence identity and positives of BlastP analysis are shown in percentages.

    (10) FIG. 10. Growth of the wild type of Actinoplanes sp. SE50/110 in minimal medium supplemented with different carbon sources (in equal C-molar amounts). Shown are the cell dry weights of at least three biological replicates and the standard deviation (n.sub.glc=3, n.sub.mal=5, n.sub.cel=4, n.sub.lac=3, n.sub.ara=5, n.sub.starch=5).

    (11) FIG. 11. A. Relative transcript amounts of cgt in Actinoplanes sp. SE50/110 grown on minimal medium supplemented with starch, C-Pur, glucose, galactose, cellobiose, or lactose as carbon source, compared to a culture grown on maltose minimal medium. Testing for differences in a two-sided t-test displayed significant differential gene expression of the cgt gene on the carbon sources glucose (p-value=0.002848), galactose (p-value=0.002945) and lactose (p-value=0.00114) compared to maltose. B. Relative transcript amount of cgt in Actinoplanes sp. SE50/110 grown on maltose minimal medium complemented with 44.40 g.Math.L.sup.1 maltose compared to a culture grown on 72.06 g.Math.L.sup.1 maltose. Testing for differences in a two-sided t-test displayed significant reduced gene expression of cgt in the medium containing reduced amounts of maltose (p-value=0.04141).

    (12) FIG. 12. Growth of the wild type and the deletion mutant cgt of Actinoplanes sp. SE50/110 in minimal medium complemented with different carbon sources. Shown are the cell dry weights and the standard deviation over time (wild type: n.sub.glc=3, n.sub.mal=5, n.sub.cel=4, n.sub.lac=3, n.sub.ara=5, n.sub.starch=5, cgt: n.sub.glc=2, n.sub.mal=5, n.sub.cel=4, n.sub.lac=4, n.sub.ara=5, n.sub.starch=5).

    (13) FIG. 13. Final cell dry weights obtained in cultivations of the wild type and the cgt mutant in minimal media supplemented with six different carbon sources. The error bars denote standard deviations.

    (14) FIG. 14. Growth of cgt and the wild type under limited amounts of starch as carbon source. Medium was supplemented with 1 g.Math.L.sup.1, 2 g.Math.L.sup.1, 3 g.Math.L.sup.1, 4 g.Math.L.sup.1 and 5 g.Math.L.sup.1 starch and cultivation was performed in the RoboLector system of m2p labs. Shown are the backscatter signals in a bar diagram and standard deviation of at least three biological replicates. No restraint on growth was observed for cgt. For 1 g.Math.L.sup.1 growth was even found to be significant enhanced (p-value of a two-sided t-test: 0.006141, n.sub.wt=3, n.sub.acgt=4).

    (15) FIG. 15. Final cell dry weights of a pH screening experiment in maltose minimal medium. Wild type and cgt mutant of Actinoplanes sp. SE50/110 were grown in 1 mL reaction volume in a 48-well FlowerPlates in the RoboLector system of m2p-labs. In pH ranging from 4 to 7, no significant differences in final cell dry weights were observed (tested by a two-sided t-test, n.sub.wt=3, n.sub.acgt=4).

    (16) FIG. 16. Osmolarity tolerance screening in the RoboLector system of m2p-labs: Final cell dry weights in maltose minimal medium with maltose monohydrate concentrations ranging between 3.6 and 108.1 g.Math.L.sup.1. No significant growth differences were observed (tested by a two-sided t-test, n.sub.wt=3, n.sub.acgt=4).

    (17) FIG. 17. Osmolarity tolerance screening in the RoboLector system of m2p-labs: Final cell dry weights of an osmolarity-screening experiment in maltose minimal medium. The different osmolarities were achieved by addition of inositol in concentrations ranging from 0 mM to 280 mM. No significant growth differences between the wild type and cgt were observed (tested by a two-sided t-test, n.sub.wt=3, n.sub.acgt=4).

    (18) FIG. 18. Growth and acarbose production of Actinoplanes sp. SE50/110 wild type and cgt mutant in the complex medium NBS supplemented with 11.0 g.Math.L.sup.1 maltose-respectively 10.0 g.Math.L.sup.1 glucose-monohydrate. No differential growth was detected. During growth phase, a significant increased acarbose concentration was measured in cgt (significance of t-test after 49 h of cultivation: p-value=0.006778, n.sub.wt-acb=3, n.sub.acgt-acb=3, n.sub.wt-cdwGic=4, N.sub.Acgt-cdwGlc=3, n.sub.wt-cdwMal=4, n.sub.acgt-cdwMal=4).

    (19) FIG. 19. A. Final yield coefficient of acarbose with reference to the cell dry weight in a bar chart. Error bars was calculated by Gaussian error propagation. B. Cell dry weights and acarbose concentration in the supernatant during cultivation in maltose minimal medium (n.sub.cdw=5, n.sub.ach=4).

    (20) FIG. 20. Relative transcript amounts of the genes acbZ, acbW, acbV, acbA, acbB, acbE and acbD of the mutant cgt compared to the wild type of Actinoplanes sp. SE50/110 grown on maltose minimal medium (n=3-6).

    (21) FIG. 21. Reconstruction of the carotenogenesis in Actinoplanes sp. SE50/110. Shown are putative homologous genes in Actinoplanes sp. SE50/110 identified by BLASTX analysis against the NCBI database. Reconstruction was performed by help of the Kyoto Encyclopedia of Genes and Genomes (Kanehisa et al. (2014)) A. Methylerythritolphosphate (MEP) pathway for the biosynthesis of the isoprenoid precursors isopentenyl pyrophosphate (IPP) and dimethylallyl pyrophosphate (DMAPP), which is also known as alternative metabolic pathway of the mevalonate pathway. B-C: Carotenogenesis. B. Formation of Lycopene from isoprenoid precursors. C. Synthesis of the glycosylated carotenoid Sioxanthin in Salinospora tropica CNB-440 (FIG. 1 of Richter et al. (2015)). D. Genomic organization of the identified genes in Actinoplanes sp. SE50/110. Gene cluster 2b displays homologies to the sioxanthin gene cluster from Salinospora tropica CNB-440 according to analysis by antiSMASH, a rapid genome-wide identification tool for the annotation and analysis of secondary metabolite biosynthesis gene clusters in bacterial and fungal genomes (Weber et al., 2015).

    (22) FIG. 22. Growth, acarbose and pigment formation of Actinoplanes sp. SE50/110 exposed to and covered from light. A. Cultivation of the wild type Actinoplanes sp. SE50/110 in maltose minimal medium exposed to or covered from bulb light (22-44 E, 1 E=mol.sub.photons m.sup.2 s.sup.1). Shown are the cell dry weights of five biological replicates and the acarbose concentration in the supernatant of three biological replicates. B. Pellets and supernatants at final cultivation time. C. Growth and pigment formation in solid culture on SFM agar plates exposed to or hidden from natural light.

    (23) FIG. 23. Position of a gene encoding a MerR-regulator in terpene cluster 1 and its disposition in the genome of Actinoplanes sp. SE50/110 (cf. FIG. 21 and Table E12). The genes of the cluster encode a MerR-like transcriptional regulator (ACSP50_0145), an isopentenyl-diphosphate delta-isomerase (idi, ACSP50_0146), a phytoene dehydrogenase (crtl, ACSP50_0147), a polyprenyl synthetase (crtE, ACSP50_0148), a phytoene synthase (crtB, ACSP50_0149), a deoxyribodipyrimidine photo-lyase (ACSP50_0150) and a pyridine nucleotide-disulfide oxidoreductase (ACSP50_0151).

    (24) FIG. 24. Growth, acarbose and pigment formation of Actinoplanes sp. SE50/110 and the deletion mutant merR exposed to and covered from light. A. Cultivation of the wild type and the deletion mutant merR of Actinoplanes sp. SE50/110 in maltose minimal medium exposed to or covered from bulb light (22-44 E, 1 E=mol.sub.photons m.sup.2 s.sup.1). Shown are the cell dry weights of at least four biological replicates and the acarbose concentration in the supernatant of three biological replicates. B. Pellets and supernatants at final cultivation time. C. Growth and pigment formation on solid media (SFM agar plates). D. Maximal acarbose concentrations (p-values of a two-sided t-test: wt dark vs. wt light: 0.003975, wt dark vs. merR dark: 0.09711, wt dark vs. merR light: 0.007043, merR dark vs. wt light: 0.02081, merR dark vs. merR light: 0.0002131). E. Relative transcript amounts of the genes crtE (ACSP50_0148), crtB (ACSP50_0149), crtl (ACSP50_0147), idi (ACSP50_0146) and merR (ACSP50_0145) in the deletion mutant compared to the wild type (set to a value of 1) when cultivated under dark conditions (p-values of a two-sided t-test: crtE: 0.04245, crtB: 0.01017, crtl: 0.07162, idi: 0.004366). Asterisks indicate the significance level: * p-value<a=5%, **p-value<a=1%, ***p-value<a=0.1%.

    (25) FIG. 25. A ratio/intensity plot of differentially transcribed genes in Actinoplanes sp. SE50/110 exposed to light compared to a cultivation grown in the dark. The ratio (log 2 (fold-change)) is plotted against the mean average intensity of a microarray experiment. Darker dots represent genes with significant differential transcription levels in the culture exposed to light compared to a culture hidden from light.

    (26) FIG. 26. ReadXplorer (Hilker et al. 2016; Hilker et al. 2014) view showing the TSS of putative antisense promoters behind the gene of interest in the pSET152-vector system. TSS were determined by sequencing of a pooled primary transcript library. Shown are the stacked reads exemplary mapped to the integrated vector-mutant of pGUS::Papm:gusA. Two TSS (surrounded by boxes), are localized behind the gene of interest in antisense orientation (A). These TSS can be assigned to sequence motifs (B) on the vector backbone, which are putatively recognized as promoter sequences by the oA/RNA-polymerase complex. Conserved nucleotides of the 10- and 35-hexamer are highlighted. TG-dimers are shown in bold black letters, if present. The distance between the hexamers is shown by s1; the distance between the 10-motif to the TSS is shown by s2.

    (27) FIG. 27. Growth and acarbose production of acbB overexpression strains in maltose minimal medium. Shown are two independent cultivations (A and B). The sampling times for RNA-isolation are indicated by t.sub.1 (early growth phase) and t.sub.2 (linear growth phase).

    (28) FIG. 28. Yield coefficient of acbB overexpression mutants in maltose minimal medium. The mutant with acbB transcribed under control of the heterologous tipA-promoter displayed an enhanced yield coefficient (approx. 50%), whereas only minor differences were observed for the construct with gapDH-promoter. Errors were calculated by Gaussian error propagation. All differences were tested for significance by a two-sided t-test (abbreviations assigned in the picture). Asterisks indicate the significance level: * p-value<a=5%, **p-value<a=1%, ***p-value<a=0.1%.

    (29) FIG. 29. Analysis of intracellular metabolites of acbB-overexpression mutants by LC-MS. Shown are the normalized peak areas of the masses m/z=545 [MH.sup.+]. A glucose-1P and galactose-1P (m/z=259 [MH.sup.+]. B. glucose-6P (m/z=259 [MH.sup.+] and C. UDP-glucose (m/z=565 [MH.sup.+]. D. Significant differences compared to the empty vector control were observed for the normalized peak areas of UDP-glucose (p-values of a two-sided t-test: Ptip: 0.01068, Pgap: 0.001356) and of the mass m/z=545 [MH.sup.+](p-value of a two-sided t-test: Ptip: 0.0412).

    (30) FIG. 30. Relative transcript amounts of the genes acbB, acbA and acbV in acbB-overexpression mutants in the initial growth phase. Shown are the means and standard deviations of at least three biological replicates. The differences to the empty vector control (set to a value of 1) were tested by a two-sided t-test (p-values from left to right corresponding to pSETT4gap: acbB, pSETT4tip: acbB, pSETT4: P.sub.acbB: acbB, pSET152: P.sub.acbB: acbB): acbB: 4.332e-05, 4.561e-06, 0.3511, 0.7082; acbA: 0.3384, 0.0001164, 0.5967, 0.4246; acbV: 0.3033, 0.0423, 0.73, 0.4687). Asterisks indicate the significance level: * p-value<a=5%, **p-value<a=1%, ***p-value<a=0.1%.

    (31) FIG. 31. Relative transcript amounts of the gene acbB in acbB-overexpression mutants in the linear growth phase. Shown are the means and standard deviations of at least three biological replicates. The RT-qPCR indicates significant differences in gene expression compared to the empty vector control (set to a value of 1), which was tested by a two-sided t-test (p-values from left to right corresponding to pSETT4gap: acbB, pSETT4tip: acbB, pSETT4: : P.sub.acbB: acbB, pSET152: P.sub.acbB: acbB): acbB: 0.02217, 0.02771, 0.03895, 0.1582). Asterisks indicate the significance level: * p-value<a=5%, **p-value<a=1%, ***p-value<a=0.1%.

    (32) FIG. 32. Growth and acarbose production of a gtaB overexpression mutant in maltose minimal medium. The sampling time for RNA-isolation is indicated by an arrow.

    (33) FIG. 33. Relative transcript amount of gtaB in an overexpression mutant. The RT-qPCR indicates significant increase of gtaB expression compared to the empty vector control (set to a value of 1) (p-value of a two-sided t-test: 0.01295). Asterisk indicates the significance level: * p-value<a=5%, **p-value<a=1.

    (34) FIG. 34. Analysis of the intracellular metabolites of a gtaB-overexpression mutant by LC-MS. Shown are the peak areas of the masses m/z=545 [MH.sup.+](A), glucose-1P and galactose-1P (m/z=259 [MH.sup.+], B), glucose-6P (m/z=259 [MH.sup.+], C) and UDP-glucose (m/z=565 [MH.sup.+], D) in an overexpression strain of the gene gtaB. Significant differences compared to the empty vector control were observed for the normalized peak areas of the mass m/z=545 [MH.sup.+](p-value of a two-sided t-test: 0.01531). All other peak areas are not significantly different according to a two-sided t-test.

    BRIEF DESCRIPTION OF THE SEQUENCE IDS

    (35) The Sequence Listing associated with this application is filed in electronic format and hereby incorporated by reference into the specification in its entirety.

    (36) TABLE-US-00002 SEQ ID No. Type Name Sequence 1 DNA >acbA GTGCGCGGAATATTGCTGGCCGGGGGAACCGGCTCACGGCTTCGACC (ACSP50_ GGTGACCTGGGCGGTTTCCAAACAACTGATGCCGGTCTATGACAAACC 3609) GATGATCTACTATCCGCTGGCCACGCTCGTCAGCTGCGGGATCCGGG AGATCCTGGTCATCACGACCGAGACCGAGGCCGCCCAGTTCCAGCGG TTGCTGGGTGACGGCTCGCAGTGGGGCCTGCGTCTGGAGTTCGCCGT GCAGCAGCGCCCCGGGGGCATCGCCGAGGCCTTCCTCATCGGCGAG GAGTTCCTGGCCGGTGGGCCGGTGGCGCTCATGCTCGGCGACAACCT GCTGCACGGGGTGGACTTCCGCCCCTGCGTGCAGCGGGCACGCGAG ACGGCCGGTGGGCACGTCTTCGGAGTGGCGGTGGCCGACCCGTCGG CCTACGGGGTGGTCGAGTTCGACGCCGCCGGGCGGGTGCTGTCCATC GAGGAGAAACCGGTCCGTCCCCGCTCGCCGTACGCGGTTCCCGGCTT CTACCTCTACGACGCCGATGTGGTCGAGACGGCCCGGTCGCTGCGGC CCAGCGCCCGCGGGGAGCTGGAGATCACCGAGGTCAACCAGGCCTA CCTGCGGCGCGGCGCACTCTCGGTGACGCTGCTGGGTCGGGGCGCG GTCTGGCTCGACACCGGCACCCTGGCCGACTGCATGCGCGCGGTCGA CTACGTGCGCGCCATCGACGAGGGCCAGGGCATCAAGATCGGCTGTG TGGAGGAGGCGGCCTGGCGGGCCGGTTTCCTCGACACCGCGCAGCT GCGTGCCCTCGCCGAGCCGTTGATGAGCAGCGGCTACGGACAGTACC TGCTGGCTCTGACCGGCGACGGGCTCAGCCGTACCCCGCAGTGGCC GGCCTTGACCGCCGCCGCCGGGTGA 2 DNA >acbB ATGAAAATCTTGGTCACCGGCGGAGCCGGCTTTATCGGGTCCCATTTT (ACSP50_ GTAACTTCCCTGATCAGTGGCGACATTGCCACACCACAACCCGTGACG 3608) CAGGTTACGGTCGTCGACAAACTGGGTTACGGAGGCAATCTCAGAAAT CTCGCCGAAGCGTCGGCGGACCCTCGTTTCAGCTTCGTTCGGGGCGA CATCTGTGACGAAGGTCTAATCGAGGGGCTGATGGCGCGGCACGACA CCGTGGCGCACTTCGCCGCCGAGACCCACGTCGACCGCTCGGTGGTC GCCTCCGGCCCCTTCGTGGCCAGCAACCTGGTCGGCACTCAGGTGCT ACTGGACGCCGCGCTACGCCACCATATCGGCCGCTTCCTGCATGTTTC CACCGACGAGGTGTACGGGTCGATCGACACCGGCTCGTGGGCCGAG GGCCATCCGCTGGCGCCCAACTCGCCGTACGCCGCGAGCAAAGCCG GGTCCGACCTCCTCGCTCTGGCCTACCACCAGACGCACGGGATGGAC GTCGTGGTGACCCGCTGCTCGAACAACTACGGGCCCCGGCAATTCCC GGAGAAAATGATTCCGCTGTTCGTCACCAGGCTGCTCGACGGGCTCG ACGTACCGGTCTACGGCGACGGCCGCAACATCCGCGACTGGCTCCAC GTCAGCGACCATTGCCGCGGTCTCGCCCTGGCCCTGGGTGCCGGCC GGGCAGGCGAGGTCTATCACATCGGCGGTGGGTGGGAGGCGACGAA TCTCGAATTGACCGAGATCCTCCTCGAGGCGTGCGGCGCCCCGGCTT CGCGCATATCTTTCGTGACCGATCGCAAAGGTCACGACCGGCGCTATT CTCTCGACTATTCGAAAATCGCCGGGGAACTCGGTTACCGGCCGCGG GTCGATTTCACCGACGGCATCGCGGAAACGGTCGCGTGGTATCGCGC CAACCGTTCCTGGTGGACCTGA 3 DNA >acbC GTGAGTGGTGTCGAGACGGTAGGGGTGCACGCGGATGCGCACCGCG (ACSP50_ ACTCGTGGCAGGTGCGGGCCCAGAAGCAGATCACCTACGAGGTGCGC 3607) TTCCGGGACGACGTGTTCGGGCTGGACTCCACCGACCTGCTGGAGGC CGGGGCGGACGGGGCCGGTTCACGGCGGCGGTTCGTGGTGGTGGAC AGCGCCGTCGACGCCTTGTACGGGTCCCGGATCCGGGAGTACTTCAC CCATCACGGCATCGATCATTCGATCCTGGTGATGCGGGTGGGCGAGA CGGTCAAGGACTTCGACACGGCGGGCCGCATCGTCGCCGCGATGGAC GCCTTCGGACTGGCCCGCCGCCGGGAGCCGATGATCGTCGTCGGTG GTGGGGTGCTGATGGACGTGGCCGGTCTGGTGGCCAGCCTCTACCGG CGCGGCACGCCGTTCCTGCGGGTGCCGACGACACTGGTCGGACTGAT CGACGCGGGTGTCGGCGCGAAGACCGGGGTCAACTTCAACGGCCACA AGAACCGGCTGGGTACGTACGCCCCGGCTGATCTGACCCTGCTGGAC CGCCGCTTCCTGGCCACCCTGGACCGGCGCCACCTCAGCAACGGGCT CGCCGAGATGCTCAAGATCGCGCTGATCAAGGATGCCGAGCTGTTCC AGCTGCTGGAGCGGCACGGGCGGGTCCTGATCGAGGAACGGTTCCA GGGCCGTACCGGAACCGGTGACCGGGCCGCCGTCCGGGCCCTGCGC GCGGCCACCCATGGCATGCTGGAGGAACTCGGCCCCAATCTGTGGGA GAGCCGGCTGGAACGCAGTGTCGACTACGGGCACACGTTCAGCCCGA CCATCGAGATGCGCGCGCTGCCGGCTCTGCTGCACGGCGAGGCCGT GTGTGTGGACATGGCGCTGACCACGGTGCTGGCGTACCGGCGGGGT CTGCTCGACGTCGCGCAGCGGGACCGGATCTTCGCGGTGATGACCGC CCTGGGCCTGCCGACCTGGCATCCGCTGCTCACGCCGGAGGTGCTGG AGGCGGCGTTGCAGGACACCGTCCGGCACCGGGACGGGTGGCAGCG GCTGCCACTGCCGGTGGGGATCGGGGGTGTCACGTTCGTCAACGACG TGACGGCCGCCGAGCTGCAGGCCGCCGCGCTGATGCAGCACCGGCT CGCCGAGGACGCCCTGCTGCTGCGCGCCTAG 4 DNA >acbS ATGCACATCATCGAGACGTACTTCGAATGCGGCGGCTTCGACCACCGG (ACSP50_ TTCATCCAGGGCGGCACCTCGGTCTATCTCTGGCAGCTGTCGCGTGG 3596) CCTGGCCGACCTGGGACACCGGGTCTCCATCGTCACACCGGCGCACG GCCGCCTGGACGATCTGCGCCGGCTGCACGAGGTCGAGGACCTGCC CGGCACCGACGAGTACGAACTGCCGCTGGTGCTCGACCCGCGCGTGT GGGGCGAACGGTTCCCGGCCCAGATGGACATCGCCCTGCGGACCAC CGCGCATCGGATCCGGCTGGCGGGCGTGGACCTGTACTTCCTCTCCA ACGAACTGCTCGATCAGTTGCCGGACCGGTTCTATCCCCCGTACGAGA GCAAGGGGGTTGATCTGGTCTTCTTCAAGCCGCTCGCCTATCAGGTGG CGGCCATCCGGTTCATCAGGTCGCACTTCGGTGACCAGCGCGCGATC GTGCACGCACACGAGCCGTTCTACCACTACCTGATGCCGGCCGCCTT CGCCGCGGACCCGGCCAAACACGTGGTCAGCACGGTGCAGAGCAACA TGCCGATCAACAAGTCGGTGTACCGGGCCGAGGTGGCGCGGCTGCTC GGCTTCCTCGGCGCCCCGAACGCGCTGCCCGCCGACGATCCGGCCG GCAGCCGTTCGCCGCACACCGTGGCGATGAGCCAGTACCAGCAGCTG ACCCACCTGCACTACGAATACCCGCCGGACCACGTGCGGGTCTACGA CCTGGTGGCCGAGCACGCCGACCGGATCGACTTCCTGTCGCCGGGG CACCGCGACTACTACACCTGCTTCGCCGACACCCCGTTCGCGCAGCT GTTCGCCACCCTGCCGGTGTCGCGGACGGTACGGCGCAACGCGGAC AAGACGTTCGTCGGCGGCTGCGCCGTCGGTGACGAGTGGGTGACCG GCGAGCTGCCCCCGGTCGACCGGGAGAAGGTGCTGGCCGGGCTCGG CCTGGACCCGGACCTGCCGGCCTTCTACCACAACGCCCGGTACGCGG TCAACCACAAGGGGCAGGTCGAGCTGATCCGGGCCGTCGACCGGGTG CTGAGCGGCGGCGTGCGGGCCAGCTTCATCGTGCGCTGCCTCAGCGA CGCCGGGATCGCCGACCCGCTCTTCCACGAGGTGGTGGCCCGCCAC CCGGGCCGGGTGAATCTGGAGTGGCACCGGGTGCCGGAGGACCAGC TGCGGGAGTACGCCCGAGCCGCGGACTTCTGTCTCTTCCCGTCCAAG TTCGAGATGGACACCTTCCTGATCGCCCAGGGTGAGGCGATGGCTGC CGGTGCGGTACCGATCGCCACCGCCCAGCTGGGGATGGCGCACTTCG GTCACGTCGCCGACCCGCTGACCGGGCCGGACGCGGCGACGGCCAC CGGATTCGCCGTCAACCGCTCGTTCGCCGAGGACGATCCGCTGCTGG TCCAGGGCCTGACCGAGCAGATCCGCCGGGCCGTCACGCTCTGGAAC GAGCAGCCCGGCCAGTACCGCCGGTTGTCCGCCAACGCCGTCGCCC GGGCCCGCGAGTTCACCTGGCGGCGGGCGGCCCAGGCGCACGAGGC CGCGTTCGCCGGGGTGTGGGCCGGCCGTACCCCCCGCCTGCCGGTC GGTGACCTGCTGCGGTTCGGCTGGTTCGACGAGCTGCCCGCGGACGC CTGGACGCTGCACCGCGACGAGATCGCGGAGGTGGCCCTGGCCCAC GGCGACGCCGACGCCTACCTGCGCTGCCGGCCCGACGACCTCGACG CCCTGGCGGCACTCTTCGAGCGGGCCTGGGCCCGGGCCGACTTCCC GGCCTGCGCGCGGACCGTAGAGCTGGCCGAGGAGCACCGGCAGGAG CGGGTGCCGCAGTGGCGGGCCCGGCTCGCCGGCCGCGGCCGCATC GACCGCGACGGTCGGCTGCACTACCGTCCGCCGTCCGCCGAACGGG TCGAACTGGTCTTGCCCGACCTGGCCGAACCCCTGCGCGGAACGGTC ACCGTGACCGCGATGGCTCCGACCGGCGACACCTTCACCGGACAGCT GCCGGCCGGAACCCGGCGTGCCGACCTGCTGCTCACCCTCAGTGACG GGCGCACCGTCTGGGACGAGGTGACGGCATGA 5 DNA >acbW ATGCCCGGGTACGCCCGGCATGCCCGGCCGGACGGCACGACCGGCA (ACSP50_ TGATCGTCGCCGAGCACCTCAGCAAGCACTTCAAGCGCTACCGGCGC 3593) GAGCCGGGTCTGCGGGGCAGCCTGCGAACCATGTTCTCGGCCCGGTA CGACGTGGTCCGGGCCGTCGACGACATCAGCTTCGAGGTCCCGTCCG GTGTCAAGATCGCCTACATCGGGGCGAACGGCGCGGGCAAGTCCACC ACGATCAAACTCCTGACCGGCATCATGCGCCCGACCACCGGGCGGGT CCGGGTCGACGGCCTCGACCCGCACCGGCAGCGCACCCGGGTCGCC GGCCGGATCGGCGTGGTCTTCGGCCAGCGCAGCCAGCTCTGGTGGG ATCTGCCGGTCCTCGACTCGTTCCGCATCCTGCGGCACGTCTACGAG GTGCCGCAGGCGGTGTACGACCGGAACATGCGCCTGTTCCGGGACCG GCTGGACCTCGGCGCCCTCGGCAACACCCCGGTCCGCCAGCTGAGC CTGGGCCAGCGCATGCGGGCCGAGATCGCCGCCTCGCTGCTGCACG ACCCGGCCGTGGTCTTCCTCGACGAACCCACCATCGGCCTGGACCTG GTCCTCAAGCAGGCGGTCCGGGACCTGATCAACCACATCCACGCCGA ACTGGGCACCACGGTCATGCTGACCAGCCACGACATCGGCGACATCA CCAGCATCTGCGATCAGGCGCTGGTCGTGGACCGCGGGACGATCGTC CACCAGGGAACGATGCGGGACCTGCTGCGGTCGGTGGACACCCGGG CGGTCACCTTCGAGTACGCCGCCGGCAGCGTCTCCGAGGCCGCCGC GCTGCGCATCATCACCGAAGGACTGCCCGAGGTGGACGCCACTCCGG CCGAGTCCGGCCGGATCCGGGTCGAGTTCCCGGTGGACCGCTGGTC GGCCCGGCAGGTGATCGCCTTCCTGCTGGACCGGTTCGACCTGAGCG ACGTGCTGGTGCCGGACGCCGATCTGGAGACACTGCTGCGCCGCATC TACGCCGGGTCGCGCCCGGAGCCGGTCACCGCCGGGGACGGCGCAT GA 6 DNA >acbX ATGATCCGCGCCGCGCGCCGGTACGCGCCGTTCGCCCTCGCCGGACT (ACSP50_ GCACGCCGTCACCCGTTACCGCTCGACCATCGTCCTGAGCGCACTCA 3592) CGGCGGCTGCGGCCACCTCGTTGCAGGTGTTCCTGTGGCGAGCCGTC TACGCCGGCGGACCGGCACCGGCCGGCCTCCCGTTCGCACAGCTCA CCTCGTACATCGTGCTCGCGCAGGTGCTCGGGATGCTGCACACCAAC CGGATCGACGAGATGATCGCCGGCGAGGTGTACCGCGGGGACATCG CGGTCTCCCTGGTACGCCCGGCGAACTACGCGCTCAGCTGTCTGGCG GTGAACCTGCCGACCGCCGCGCTCAGTGCGCTGCTGGCCGGCGCCC CGGTGCTCGCCGGTTTCGCGATGTTCGCGTCGCTGCCCGCTCCCCCG CCCGCCAACCTGCTGCTGTTCGCCGTCGCGCTGCTGCTCTCGGTGAT CCTCGCCTTCGAGATCAACTTCCTGGTGGGTCTCGCCGCCTTCGTCAC GACCAACACCTGGGGCATCCGTACGATCAAGAACGCGCTCGTCGCCT TCCTGGCCGGCCAGGTCGTCCCGCTCGCGCTGTTCCCGGACGGCGTG GCCCGGCTGCTGCGGCTGCTGCCGTTCCAGGGCCTGATCGACAGCCC GTTGCGGCTGCTGCTCGGCGGCTACTCCGGCGGTTCCGGCGCCGCT GCCATCCTCGGTGTCCAGGCGCTCTGGGCGGTACTGCTGTACGGCGT GCTGGCCCTGGCCTGGAACCGGTCGCTGCGCAGGGTGGAGGTGCTC GGCGGATGA 7 DNA >acbY atgaccgtctccacggcgcgccggtacctgcgcctcacggcggtgctgtgcggggcgagcctgcaccg (ACSP50_ gctcaccgcgtaccggatggacttcctcatcggggcggccagcttcgtcatccggatcgcctgccagatc 3591) gccctgatcggggtgatcttccagtacgttccggcgctcggcggctggacccgccagcaggcgctgttcc tgctcgggttctccctgctgccccgcgggctggaccggctcttcaccgaccagctgtggatcctggcctgg cagctggtgcgcaccggcgacttcttccgctacctgatccggccggtgaacccgttctacgcgctgctgtc cgaacggttcctctatccggacgggttcggggagctggccaccggcatcgccatcgtggtcaccgcggc cgggacgatggacctgcacctgaccgtggcacagtggctgctgttgctgcccctggtcctcggcggcgc cctgatccacaccttcctcaaggcgttcctggcctccctgtcgttctggatgaccagcagcctcaacgtgat ggtggcggtcaaccagctcagcgagttcaccgcgtacccgctcaacctctaccacccggtgctgcgcgg ggtgctcacctgggtgctgccgttcgcgttcaccgcctacctaccggtgcgctacctgctcaccggggacg ccgggccgctgctgtggatgctgccggtcaccacgctcaccgtcctgctggggtacggcaccttccggct cgggctgcggcgctacgagatgcccggcagctga 8 DNA >gtaB ATGACGACGAACGCGCAAGGGTCGGGCAAGCGCGCGGTGAAAGCAG galU TGATTCCGGCGGCCGGCCTAGCCACGCGTTTCCTGCCTGCCACCAAA (ACSP50_ GCCGTTCCGAAAGAGCTGCTGCCGGTCGTCGACCGGCCGGTCCTGCA 7820) GTACATCGTCGAGGAGGCCGCCGCGGCCGGCATCACCGACGTGCTG CTGGTGACCGGGCGTGGCAAGACCTCGATGGTCGACCACTTCGACCG TCGCCCCGACGTGGAGCAGCGGCTGGAGGAGAAGGGCGACACCGAG CGGCTCGCCGCCGTCCGGCGCACCAGTGAGCTGGCCGACATCTACAC CTGCCGACAGGGGGAGCCGCTCGGCCTCGGCCATGCCGTCGGGACC GCCGCCTCGCACGTCGGGGACAACCCGTTCGCGGTGCTGCTCGGGG ACGAGTTCGTCGAGGAGGGCAGCCCGCTGCTGCCCGACATGCTCGAC CTGCAGGCCCGCACCGGCGGCATCGTGCTCGCCTTCATCGAGGTCAC CCCGGAGGAGACGTCGCGCTACGGGATCGCCTCGGTGCGGGAGTCC GACCTGGGCGAGGGCGTGGTCGAGGTGACCGGCCTGGTGGAGAAGC CGTCGCCGGAGGAGGCGCCGAGCAACCTTGCCGTGGTGGGGCGGTA CGTGCTGCCTGGCAGGATCTTCGAGACGATCGCCGGCACCAAGCCGG GCAGCGGGGGCGAGATCCAGCTGACCGACGCGATGGCGACGCTGCT GGCCGAGGGCACCCCGGTGCACGGCATCGTCTACCGCGGTGTCCGG TACGACACCGGCCAGCCGCTGGGCTACCTGCAGACCGTCGTCCAGCT CGCGGCTCAGCGTCCCGACCTGGGTGCCGAGTTCCGGGCCTGGCTCA CCGACTTCGTCGGTGGTCAGAAGGGATGA 9 DNA >cgt ATGAATCGCACCACCGTTCGGGCCGGCGTGCTGGCCACCGCCCTGAT (ACSP50_ CAGCGGCGTGCTCGGGGTGGCCGGCCCGGCGCTCGCCGCCCCGGTC 5024) ACCGACGCGGCGCCGGTCGCCGCCGCCGGCACCGCCGTCGCGCCGA TCGCCGCGACCTTCAACGTGACCGCCGGGTTCACCAGCTGGGGTCAG AACGTCTACGTCGTCGGCAGCATCCCGGCGCTCGGCTCCTGGGACGT CTCCAAGGCGGTGCCGCTGACCACCACGAGCAGCGCCTTCCCGACCT GGACCGGGAGCGTGGCGCTGCCGGCGAACACGTACACCGAGTTCCA GTACGTGGTGAAGAACGCCGACGGCAGCGTCGCCCGCTGGGAGAAG GGTTTCCAGCAGAACCGCACCACGATCACCCCGCCGACCGGCACCTA CGTCACGCACGACACCTTCGGCGCGTACTGA 10 DNA >crtl ATGATGAAACCCCCCACCCCCTGGAGCCGCGGCGTGCGCACTGTTAC (ACSP50_ CGGACCCACCGATCGTGTCGTGATAGTGGGGGCCGGCCTGGCCGGC 0147) CTCTCCTGCGCCTTGCACCTGGCCGCAGCCGGGCGGCAGGTCACCGT CGTCGAGCGGGAGCCGGTGCCGGGCGGCCGCGCCGGGCGCCTCTC GGTCGGCGGATACGACTTCGACACCGGCCCGACCGTGCTGACCATGC CGGAACTGATCGCCGAGCCGCTCGCCGCGGTCGGCGAGAATCTCTCC GACTGGCTGGAGCTGACCCCGCTCGACCCGGCCTACCGGGCGTACTA CCCGGACGGCTCCACGCTGGACGTCCGCACCGACACCACCCGGATG GCGGCCGAGATCGCCCAGGTCTGCGGCGCCCGCGAGGCCGACGGCT ACCTGCGGTTCGTCGACTACACCCGGCGGCTCTGGCAGCTGGAACGG GACCACTTCATCGACCGGAACCTGGACAGTCCGCTCGACCTGCTCAAC CTCAACCTGCTGAAGCTGCTCGGGATGGGCGCTTTCGGTCGCCTGCA GCCGAAGATCAACGAGTTCTTCCGCGATCCGCGGACCCAGCGGATCT TCTCGTTCCAGGCGATGTACGCCGGTCTCGCCCCGCACGACGCGATG GCCATCTACGCGGTGATCGCCTACCTCGACTCGGTCGCCGGGGTGTA CTACCCCAAGGGCGGCATGCACGCCGTCCCCAAGGCGCTGGCCGGC GCCGCCGAGAAGCACGGGGTCACCTTCCGTTACGACACGACGGTCGA GCGGGTGCTCACCCAGCACGGCCGGGCGACCGGGGTGGTGACCGTC GGCGGGGACGTGATCGAGGCGGACACCGTCGTACTCAATCCCGACCT GCCCATCGCGTACCGCGACCTGCTGCCTGCCCGGAACAGCCGCAACC TGCGCTTTTCGCCCTCCTGCGTGGTACTCCACATCGGATCGTCACAGC GGTATTCGAAGATCGCACACCACAACATCCACTTTGGTACGACGTGGC GCCGCACCTTCGACGAAGTGATCAACCGTGGGCTGCTGATGAGCGAC CCGTCACTGCTGGTCACCAATCCCACGCACACCGACCCCTCTGCCGC GCCCGACGGCAAACAGACCTACTACGTGCTGGCGCCCGCCCCGAACC TCGTCTCCGGTCCGATGAACTGGCGCGGCGGCCTCGCCGAACGGTAT GCCGACGAGCTGCTGCGTACCCTGGAGCAGCGCGGCTACATCGGCTT CCGGGACGGGGTCGAGGTCGAACGGATCATCACGCCGGCCGACTGG GCCGACGACGGGATGGCGGCCGGCACGCCGTTCGCCGCCGCGCACA CCTTCGCCCAGACCGGCCCGTTCCGGCCGGCGAACCTGCACCCCACG CTGCCGAACGTGGTCTTCACCGGTTCGGGCACACAACCCGGGGTCGG CGTGCCGATGGTGCTCATCTCCGGGAAGCTGGCCGCGAGCCGGATCA CACAGGGAGCCTCATGA 11 DNA >merR GTGGCCGGTGAGGCGTTGAGCGCCGAGATCCCCACCTCGCCGGGCA (ACSP50_ GCTCGGTCGCCTCCTCGCACGACATCCCGGCCACAGCCGGTCCCGGC 0145) GCCGTCCGGACCGGCCCGGTGGCTGCCGCGCCCGGTGGCCCGAGCG ATACGCCCCTGACCGACGCGACAGCTGCCGCGTCGGGTGCCGCGGA CGACGCCTCCCGGGCCCGCCCGGCGACCGCCACGGACGACGCCTCC CGCACCGGCCCGGCGACCGCCGCGACGGATTCTCCGGACGACGCCG TCCGGACCGGCGTGGCAGATGCCGCGCCGGCCGGGCGGGCGGGCG ATGTGGCGTTGAGTGCCGGGGCGGCCGCGCGGCGGCTGGGAGTGGC GGTCACGACCCTGCGCACCTGGCACCAGCGGTACGGGCTCGGGCCG AGCCGGCACGAGCCCGGACATCACCGGCGGTACACCGCCGAGGACA TGGACCGGCTGCAGGTGATGCAGCGGCTCACCACTCAGGGCGTGGC GCCCGCCGAGGCCGCCGCCTGGGCGCGGTCCAGGCCCCTCACCCCA CCGGAGCCCGGCGCGGCGCTGTACGACCCCACCGCCGTGGCGTCGC CACCCACCCCGGCCGCTCCCGGACAGCCCCCGGTCGGCCCCGCCGG CCGGGGCACCCGCCCGACCCGCGGACCGGCCCCGGCCGCTCGCGG GCTGACCCGGGCCGCGATGCGGCTCGACGTGCGCGGCATGCGCGAC ATCCTCTGCAGCACGCTGCACGACCGCGGCGTGATACCCGCCTGGAC CGAGGTGATGGTCCCGGCTCTGGCCGCGATCGGCGACCGGTACGAG GCCACTCGGCGTTTCGTCGAGGTCGAACACCTGCTGTCGCGCGCCGT CACCGAAATCCTCGCCTCGGTCCCACACCCCGCCGGCTCTCCCCGGG TGCTGCTCGCCGCCGCCGACGAGGAACAGCACACACTGCCCCTGGAG GCCCTGGCCGCCGCCCTGGCCGAGGGAGGCGTGCCGAGCCGTCTGT TCGGCGCCCGGGTGCCGTCACAGGCCCTGCTGGACGCCATCGCCCG CACCGGCCCGGCTGCCGTCGTGCTCTGGTCGCAGCGCCCGGCCACC GGCATCGTCACCCAGCTGACCCGGGTCCGCGACATCCCGCACCCGCC GCTGGTCATCGCCGCCGCCGGCCCCGGCTGGCCGCATGACCTGCCTT CCGGGATCACCCGCCTGACCGGCCTCACCGAGGCCGTCCACCTGCTC GCCACGGTCTAG 12 PRT >AcbA MRGILLAGGTGSRLRPVTWAVSKQLMPVYDKPMIYYPLATLVSCGIREILVI (ACSP50_ TTETEAAQFQRLLGDGSQWGLRLEFAVQQRPGGIAEAFLIGEEFLAGGPV 3609) ALMLGDNLLHGVDFRPCVQRARETAGGHVFGVAVADPSAYGVVEFDAAG RVLSIEEKPVRPRSPYAVPGFYLYDADVVETARSLRPSARGELEITEVNQA YLRRGALSVTLLGRGAVWLDTGTLADCMRAVDYVRAIDEGQGIKIGCVEE AAWRAGFLDTAQLRALAEPLMSSGYGQYLLALTGDGLSRTPQWPALTAA AG 13 PRT >AcbB MKILVTGGAGFIGSHFVTSLISGDIATPQPVTQVTVVDKLGYGGNLRNLAEA (ACSP50_ SADPRFSFVRGDICDEGLIEGLMARHDTVAHFAAETHVDRSVVASGPFVA 3608) SNLVGTQVLLDAALRHHIGRFLHVSTDEVYGSIDTGSWAEGHPLAPNSPY AASKAGSDLLALAYHQTHGMDVVVTRCSNNYGPRQFPEKMIPLFVTRLLD GLDVPVYGDGRNIRDWLHVSDHCRGLALALGAGRAGEVYHIGGGWEATN LELTEILLEACGAPASRISFVTDRKGHDRRYSLDYSKIAGELGYRPRVDFTD GIAETVAWYRANRSWWT 14 PRT >AcbC MSGVETVGVHADAHRDSWQVRAQKQITYEVRFRDDVFGLDSTDLLEAGA (ACSP50_ DGAGSRRRFVVVDSAVDALYGSRIREYFTHHGIDHSILVMRVGETVKDFD 3607) TAGRIVAAMDAFGLARRREPMIVVGGGVLMDVAGLVASLYRRGTPFLRVP TTLVGLIDAGVGAKTGVNFNGHKNRLGTYAPADLTLLDRRFLATLDRRHLS NGLAEMLKIALIKDAELFQLLERHGRVLIEERFQGRTGTGDRAAVRALRAA THGMLEELGPNLWESRLERSVDYGHTFSPTIEMRALPALLHGEAVCVDMA LTTVLAYRRGLLDVAQRDRIFAVMTALGLPTWHPLLTPEVLEAALQDTVRH RDGWQRLPLPVGIGGVTFVNDVTAAELQAAALMQHRLAEDALLLRA 15 PRT >AcbS MHIIETYFECGGFDHRFIQGGTSVYLWQLSRGLADLGHRVSIVTPAHGRLD (ACSP50_ DLRRLHEVEDLPGTDEYELPLVLDPRVWGERFPAQMDIALRTTAHRIRLAG 3596) VDLYFLSNELLDQLPDRFYPPYESKGVDLVFFKPLAYQVAAIRFIRSHFGD QRAIVHAHEPFYHYLMPAAFAADPAKHVVSTVQSNMPINKSVYRAEVARL LGFLGAPNALPADDPAGSRSPHTVAMSQYQQLTHLHYEYPPDHVRVYDL VAEHADRIDFLSPGHRDYYTCFADTPFAQLFATLPVSRTVRRNADKTFVG GCAVGDEWVTGELPPVDREKVLAGLGLDPDLPAFYHNARYAVNHKGQVE LIRAVDRVLSGGVRASFIVRCLSDAGIADPLFHEVVARHPGRVNLEWHRVP EDQLREYARAADFCLFPSKFEMDTFLIAQGEAMAAGAVPIATAQLGMAHF GHVADPLTGPDAATATGFAVNRSFAEDDPLLVQGLTEQIRRAVTLWNEQP GQYRRLSANAVARAREFTWRRAAQAHEAAFAGVWAGRTPRLPVGDLLR FGWFDELPADAWTLHRDEIAEVALAHGDADAYLRCRPDDLDALAALFERA WARADFPACARTVELAEEHRQERVPQWRARLAGRGRIDRDGRLHYRPP SAERVELVLPDLAEPLRGTVTVTAMAPTGDTFTGQLPAGTRRADLLLTLSD GRTVWDEVTA 16 PRT >AcbW MPGYARHARPDGTTGMIVAEHLSKHFKRYRREPGLRGSLRTMFSARYDV (ACSP50_ VRAVDDISFEVPSGVKIAYIGANGAGKSTTIKLLTGIMRPTTGRVRVDGLDP 3593) HRQRTRVAGRIGVVFGQRSQLWWDLPVLDSFRILRHVYEVPQAVYDRNM RLFRDRLDLGALGNTPVRQLSLGQRMRAEIAASLLHDPAVVFLDEPTIGLD LVLKQAVRDLINHIHAELGTTVMLTSHDIGDITSICDQALVVDRGTIVHQGT MRDLLRSVDTRAVTFEYAAGSVSEAAALRIITEGLPEVDATPAESGRIRVEF PVDRWSARQVIAFLLDRFDLSDVLVPDADLETLLRRIYAGSRPEPVTAGDG A 17 PRT >AcbX MIRAARRYAPFALAGLHAVTRYRSTIVLSALTAAAATSLQVFLWRAVYAGG (ACSP50_ PAPAGLPFAQLTSYIVLAQVLGMLHTNRIDEMIAGEVYRGDIAVSLVRPANY 3592) ALSCLAVNLPTAALSALLAGAPVLAGFAMFASLPAPPPANLLLFAVALLLSVI LAFEINFLVGLAAFVTTNTWGIRTIKNALVAFLAGQWPLALFPDGVARLLRL LPFQGLIDSPLRLLLGGYSGGSGAAAILGVQALWAVLLYGVLALAWNRSLR RVEVLGG 18 PRT >AcbY MTVSTARRYLRLTAVLCGASLHRLTAYRMDFLIGAASFVIRIACQIALIGVIF (ACSP50_ QYVPALGGWTRQQALFLLGFSLLPRGLDRLFTDQLWILAWQLVRTGDFFR 3591) YLIRPVNPFYALLSERFLYPDGFGELATGIAIVVTAAGTMDLHLTVAQWLLL LPLVLGGALIHTFLKAFLASLSFWMTSSLNVMVAVNQLSEFTAYPLNLYHP VLRGVLTWVLPFAFTAYLPVRYLLTGDAGPLLWMLPVTTLTVLLGYGTFRL GLRRYEMPGS 19 PRT >GtaB MTTNAQGSGKRAVKAVIPAAGLATRFLPATKAVPKELLPWDRPVLQYIVE GalU EAAAAGITDVLLVTGRGKTSMVDHFDRRPDVEQRLEEKGDTERLAAVRRT (ACSP50_ SELADIYTCRQGEPLGLGHAVGTAASHVGDNPFAVLLGDEFVEEGSPLLP 7820) DMLDLQARTGGIVLAFIEVTPEETSRYGIASVRESDLGEGWEVTGLVEKP SPEEAPSNLAVVGRYVLPGRIFETIAGTKPGSGGEIQLTDAMATLLAEGTP VHGIVYRGVRYDTGQPLGYLQTWQLAAQRPDLGAEFRAWLTDFVGGQK G 20 PRT >Cgt MNRTTVRAGVLATALISGVLGVAGPALAAPVTDAAPVAAAGTAVAPIAATF (ACSP50: NVTAGFTSWGQNVYVVGSIPALGSWDVSKAVPLTTTSSAFPTWTGSVALP 5024) ANTYTEFQYVVKNADGSVARWEKGFQQNRTTITPPTGTYVTHDTFGAY 21 PRT >Crtl MMKPPTPWSRGVRTVTGPTDRVVIVGAGLAGLSCALHLAAAGRQVTVVE (ACSP50_ REPVPGGRAGRLSVGGYDFDTGPTVLTMPELIAEPLAAVGENLSDWLELT 0147) PLDPAYRAYYPDGSTLDVRTDTTRMAAEIAQVCGAREADGYLRFVDYTRR LWQLERDHFIDRNLDSPLDLLNLNLLKLLGMGAFGRLQPKINEFFRDPRTQ RIFSFQAMYAGLAPHDAMAIYAVIAYLDSVAGVYYPKGGMHAVPKALAGA AEKHGVTFRYDTTVERVLTQHGRATGWTVGGDVIEADTWLNPDLPIAYR DLLPARNSRNLRFSPSCVVLHIGSSQRYSKIAHHNIHFGTTWRRTFDEVIN RGLLMSDPSLLVTNPTHTDPSAAPDGKQTYYVLAPAPNLVSGPMNWRGG LAERYADELLRTLEQRGYIGFRDGVEVERIITPADWADDGMAAGTPFAAA HTFAQTGPFRPANLHPTLPNVVFTGSGTQPGVGVPMVLISGKLAASRITQ GAS 22 PRT >MerR MAGEALSAEIPTSPGSSVASSHDIPATAGPGAVRTGPVAAAPGGPSDTPL (ACSP50_ TDATAAASGAADDASRARPATATDDASRTGPATAATDSPDDAVRTGVAD 0145) AAPAGRAGDVALSAGAAARRLGVAVTTLRTWHQRYGLGPSRHEPGHHR RYTAEDMDRLQVMQRLTTQGVAPAEAAAWARSRPLTPPEPGAALYDPTA VASPPTPAAPGQPPVGPAGRGTRPTRGPAPAARGLTRAAMRLDVRGMR DILCSTLHDRGVIPAWTEVMVPALAAIGDRYEATRRFVEVEHLLSRAVTEIL ASVPHPAGSPRVLLAAADEEQHTLPLEALAAALAEGGVPSRLFGARVPSQ ALLDAIARTGPAAVVLWSQRPATGIVTQLTRVRDIPHPPLVIAAAGPGWPH DLPSGITRLTGLTEAVHLLATV 23 DNA >dxs ATGAGCGACTCCCCTTCGACCCCGGCCGGCCTGCTGGCGAGCGTCAC (ACSP50_ CGGTCCCGGTGCTCTCAAGCGACTGTCCGCGGAGCAGCTGACCCTGC 7096) TCGCGGCCGAGATCCGTGACTTCCTCGTGGCCAAGGTGTCGAAGACC GGGGGGCACCTCGGACCGAACCTGGGCGTGGTCGAGATGACCCTCG CCATGCACCGGGTCTTCGACTCGCCGCGCGACAAGATCCTCTTCGACA CCGGCCACCAGGCGTACGTGCACAAGATCGTCACCGGCCGGCAGGAC GGTTTCGACCTGCTCCGCCAGCGGGGTGGCCTGACCGGCTACCCGAG CCAGGCGGAGAGCGAGCACGACCTCATCGAGAACTCGCACGCCTCCA CCGCGTTGTCCTACGCCGACGGCCTGGCCAAGGCGTTCGCGCTGCGC GGCGAGGACCGGCACGTGGTGGCCGTGGTCGGCGACGGCGCGCTCA CCGGCGGCATGTGCTGGGAGGCGCTCAACAACATCGCCGCCACGAAG AACAGGCTGGTCATCGTCGTCAACGACAACGGTCGGTCGTACGCGCC GACGATCGGCGGCCTGGCCGACCACCTCTCCACGCTGCGGCTCAACC CCGGCTACGAGAAGGTGCTCGACCTGGTCAAGGACGCGCTCGGCTCG ACCCCGCTGGTCGGAAAGCCGGTCTTCGAGGTGCTGCACGCGGTCAA GCGCGGGATCAAGGACGCGGTCAGCCCGCAGCCGATGTTCGAGGAC CTCGGCCTGAAGTACATCGGGCCGGTCGACGGTCACGACCAGCAGGC GATGGAGTCCGCGCTGCGCCGGGCCAAGGGGTTCAACGCGCCGGTG ATCGTGCACGCGGTGACCCGCAAGGGCTACGGCTACCGTCCCGCCGA GCAGGACGAGGCGGACTGCCTGCACGGCCCGGGCGCCTTCGACCCG CAGACCGGCGCGCTCACCGCCAAGCCGTCGCTCAAGTGGACCAAGGT CTTCGCCGAGGAGCTGGTGAAGATCGCCGACGAACGCCCCGACGTGG TGGGCATCACGGCCGCCATGGCCGAGCCGACCGGCATCGCCGCTCTC GCCAAGAAGTACCCCGACCGGGCGTACGACGTGGGCATCGCCGAGCA GCACGCCGCGACCAGCGCCGCGGGCCTGGCGATGGGCGGCCTGCAC CCGGTGGTGGCGGTCTACGCCACCTTCCTGAACCGCGCTTTCGACCA GGTGCTGCTGGACGTCGCGATGCATCGGCTGCCGGTGACCTTCGTGC TGGACCGGGCCGGCATCACCGGGCCGGACGGCCCCAGCCACTACGG CATCTGGGACATGAGTGTCTTCGGCGCCGTCCCCGGCCTGCGCATCG CCGCCCCGCGGGACGCCGCCACCCTGCGCGAGGAACTGCGCGAGGC GGTCGCGGTCGACGACGGCCCGACCATCGTGCGGTTCCCGACCGGT GCGGTCGCCGCGGACACCCCGGCGGTGCGCCGGGTCGGTCAGGTCG ACGTGCTGCGCGAGGCGGAGAAGAAGGACATCCTGCTGGTCGCGGTC GGCTCGTTCGTCGGCCTCGGGCTGGACGCCGCCGAGCGGCTCGCCG AGCAGGGGTACGGCGTGACCGTGGTCGACCCGCGCTGGGTGCGCCC GGTGCCGATCGAGCTGACCGGCCTGGCCGCCCAGCACCGCCTGGTG GTGACCCTGGAGGACGGGATCCGCGCCGGTGGTGTCGGTGACGCGG TGGCCGCCGCGCTGCGCGACGCCGGGGTGCACGTGCCGCTGCGCGA TTTCGGCGTGCCGGCCGGTTTCCACCCGCACGGCACCCGGGCCGAGA TCCTCGCCTCGCTGGGTCTGACCGCGCAGGACGTCGCGCGGGACGT GACCGGCTGGGTGTCCGGCCTGGACGCCGGCACGTCGGTGGCGGCC CCGGCGATCTGA 24 DNA >ispG GTGACCGCGATCAGTCTCGGAATGCCGGCCGTCCCCCCGCCGCCGCT (ACSP50_ GGCCCCGCGCCGCCAGAGCCGGCAGATCAACGTCGGAGGAGTCCTG 7248) GTCGGCGGGGGCGCCCCGGTCAGCGTCCAGTCGATGACCACCACCC TCACCTCCGACGTCAACGCGACCCTGCAGCAGATCGCCGAGCTGACC GCGGCCGGCTGCCAGATCGTCCGGGTCGCCGTGCCGTCCCAGGACG ACGTCGAGGCGCTGCCGGCGATCGCCAAGAAGTCGCAGATCCCGGTG ATCGCCGACATCCACTTCCAGCCCAAGTACGTGTTCGCCGCGATCGAC GCGGGCTGCGCGGCGGTCCGGGTCAATCCGGGCAACATCCGCCAGT TCGACGACAAGGTCAAGGAGATCGCCCGGGCCGCGTCCGACGCCGG CGTGCCGATCCGGATCGGGGTCAACGCCGGCTCGCTCGACAAGCGG CTTCTCGAGAAATACGGCAAGGCCACCGCCGAGGCGCTGGTGGAGTC GGCGCTCTGGGAGTGCTCGCTGTTCGAGGAGCACGGTTTCCGGGACA TCAAGATCTCGGTCAAACACAACGATCCGGTCGTGATGATCCGCGCCT ACCGTCAGCTCGCCGAGCAGTGCGACTACCCGCTGCACCTGGGCGTG ACCGAGGCCGGGCCGGCCTTCCAGGGCACGATCAAGTCGGCGGTGG CGTTCGGCGCGCTGCTCGCCGAGGGGATCGGCGACACCATCCGGGT CTCGCTGTCCGCGCCGCCGGTCGAGGAGATCAAGGTCGGGCAGCAG ATCCTGGAGTCGCTCGGCCTGCGCGAACGCGGCCTGGAGATCGTCTC CTGCCCGTCCTGCGGGCGGGCCCAGGTCGACGTCTACACGCTGGCC GAGCAGGTGACCGCGGCGCTCGACGGGTTCCCGGTGCCGCTGCGAG TGGCCGTGATGGGCTGCGTCGTGAACGGGCCCGGGGAGGCTCGCGA GGCCGACCTCGGGGTCGCCTCCGGCAACGGCAAGGGGCAGATCTTC GTCAAGGGCAAGGTGATCAAGACGGTGCCGGAGGCGGTGATCGTCGA GACGCTGGTCGAGGAGGCGCTGCGGCTCGCCGACGAGATGGGCGCG GAGCTGCCCGACGAGCTGCGCGAGCTGCTGCCCGGTCCCACGGTCA CCGTGCACTAG 25 DNA >dxr ATGCGTGAGCTTGTGCTGCTGGGGTCGACCGGGTCCATCGGCACCCA (ACSP50_ GGCCATCGATATCGTCCGGCGCAACCCGGAGCTGTTCCGGGTGGTCG 7250) CGATCGGGGCCGGGGGTGGCAACGTCGCGTTGCTCGCGGCGCAGGC GCTGGAGCTGGGCGTCGAGGTGGTCGGGGTGGCCCGGGCCTCGGTC GTGCAGGATCTGCAGCTGGCCTTCTACGCCGAGGCGCAGAAGCGTGG CTGGTCGTCCGGCGACTTCAAACTGCCGAAGATCGTGGCCGGGCCGG ACGCGATGACCGAGCTGGCCCGCTGGCCGTGTGACGTCGTTCTCAAC GGGGTGGTCGGCAGCCTCGGCCTGGCGCCGACCCTGGCCGCTCTGG AGTCCGGGCGGATCCTTGCGCTGGCCAACAAGGAGTCGCTGGTCGCC GGCGGCCCGCTGGTCCGGCGGATCGCCAAGGACGGGCAGATCGTCC CGGTCGACTCGGAGCATTCGGCGCTGGCCCAGTGCCTGCGCGGCGG GCGGGCCGCGGAGGTGCGCCGGCTGGTGCTGACCGCCAGCGGGGG AGCCTTCCGCGGGCGGCGGCGCGCGGAGCTGACGAACGTCACCCCC GAGGAGGCGCTCAAGCACCCGACCTGGGACATGGGGCCGGTCGTCA CGATCAACTCGGCGACCATGGTGAACAAGGCGCTGGAAGTGATCGAG GCGCACGAGCTGTTCGGCGTGCCGTACGACGACATCGCGGTGATGGT GCACCCGCAGTCGGTGCTGCATTCGCTGGTCGAGTTCACCGACGGCT CGACGCTGGCCCAGGCCAGCCCGCCGGACATGCGGCTGCCGATCGC GCTGGCGCTGGCCTGGCCGGACCGGGTGCCGGGGGCGGCCGCCGC GGTGGACTGGACGCTGGCGCACAACTGGGAGCTGCGACCGCTGGAC GACGAGGCGTTCCCGGCGGTCGAGCTGGCCAAGGCGGCCGGCCGGT ACGGTCGCTGCCGTCCGGCGATCTTCAACGCCGCCAACGAGGAGTGT GTGGCCGCTTTCGCCGCCGGTCGGCTACCTTTCTTGGGCATCGTCGA CACCCTGGAACGGGTGCTCGCGGCGGCCCCGGATTTCGCGGAGCCG AGTACCGTCGATGACGTGCTGGCCGCAGAATCCTGGGCGCGTGCCCA GGCACAGCGGACGATCGCGACTGTGGCTGAAGGAGCCTGA 26 DNA >ispH GTGTTGCTCGCCAAGCCGCGTGGTTACTGCGCCGGTGTCGACCGCGC (ACSP50_ CGTGCAGACCGTCGAGGAGGCGCTGAAACTCTACGGCGCCCCGGTCT 7707) ACGTGCGTAAGCAGATCGTGCACAACAAGCACGTGGTCAGCACGCTG GAGGCCCGCGGCGCGATCTTCGTCGAGGAGAACTACGAGGTGCCCGA GGGCGCCACCGTGGTGTTCTCCGCGCACGGCGTCGCCCCCGAGGTG CACGACCAGGCCCGCGAGCGCCGGCTCAAGGCGATCGACGCGACCT GCCCGCTGGTCACCAAGGTGCACCACGAGGCGAAACGGTTCGCCGCC GAGGACTACGACATCCTGCTGATCGGTCACGAGGGGCACGAGGAGGT CATCGGCACCTCCGGCGAGGCCCCGGCGCACATCCAGCTCGTCGACG GCCCCGACGACGTGGCGAACGTCGTCGTCCGCGACCCGGCCAAGGT CGTCTGGCTGTCGCAGACCACGCTGTCGGTGGACGAGACGATGGAGA CGGTGGCCCGGCTCAAGACCCGGCTGCCGCTGCTGCAGTCGCCGCC CAGCGACGACATCTGCTACGCCACCTCGAACCGGCAGCACGTGATCA AGGAGATCGCGCCGGAGTGCGACGTGGTGATCGTGGTCGGCTCGACC AACTCGTCGAACTCGGTCCGCCTGGTCGAGGTCGCCCTCGGTGCCGG CGCCCGGGCCGGTCACCTCGTCGACTACGCCGCCGAGATCCAGGAC GAGTGGCTGGCCGGCGCCACCACGGTCGGTGTCTCCTCCGGCGCCA GCGTGCCGGACGAGCTGGTGATGGAGGTGCTGGCGCACCTCGCGGA GCGTGGCTTCGGCGAGGTCACCGAGTTCACCACGGCCGAGGAGCGG CTCACCTTCTCCCTCCCGCAGGAGCTCCGCAAGGACATGAAGGCCGC CGAGGCGGCCCGGGCCGCTGCCGCCGGCTGA 27 DNA >ispE ATGACCGAGGCGTGGGGTCCGGACGACGACGAGCCGCGCCCGTACA (ACSP50_ GCGGCCCGGTCAAGGTCCGCGTGCCGGCCAAAATCAACCTGCACCTC 7802) GCGGTCGGCCCGCTGCGACCCGACGGCTACCACGAGCTGAACACCGT CTACCACGCCATCTCGCTGTTCGACGAGATCACCGCCCGGCACGGCG ACACCCTCACCCTCACCATGGAGGGCGAGGGCACCGGCGACCTCGCC CTCGACGAGACCAACCTGATCATCCGCGCCGCCCGCGCCCTGGCCGC CCGCGCCCGCGTCCCCGCCTACGCCCGGCTGCACCTGCGCAAGAGC ATCCCGCTCGCCGGCGGCCTGGCCGGCGGCAGCGCCGACGCCGCCG CCACCCTGATCGCCTGCGACCTGCTCTGGGGCCTCGGCATGAGCCGC GACGAGCTCGCCGAGGTCGGCGCCCAACTCGGCTCCGACATCCCCTT CCTGCTGCACGGCGGCACCGCCCTCGGCACCGGCCACGGCGAGGCG GTCAGCCCCATCCTGGCCCGCCCCACCACCTGGCACTGGACCGTCGC CATCGCCGACGGCGGCCTGGCCACCCCCGCCGTCTACCGCGAGCTC GACACCCTGCGCGCCGGCACCTGGCCACCCACTCCGCTCGGCAGCG CCGACACCCTGATGGCCGCCCTGCGCCAGCGCAACCCGGAAATCCTC GGCGCCGCCCTCGGCAACGACCTGCAACCGGCCGCCCTCGCCCTGC GCCCCCAGCTCGCCGACGTGCTCAAAGCCGGCACCGAGGCCGGCGC CCTCGCCGGCCTCGTCTCCGGCTCCGGCCCCACCTGCGTCTTCCTCG CCGCCGACGCCACACACGCCCAGGAGATCGCCGACAGCCTCACCGAA GCCGGCGTCTGCCGGGCCGCGGTCACCGCCCGCGGACCCCAGCCCG GCGCGCGGGTAATCTAG 28 DNA >ispF GTGATCATTCCGCGGGTGGGTATCGGCACGGACGTGCACGCATTCGA (ACSP50_ CGCTGACCGGGCCTGCTGGGTGGCCGGGCTGGAGTGGCCGGGGGAG 8046) CCGGGGCTGGCCGGGCACTCGGACGCGGACGTGGTGGCCCACGCGG CCTGTGACGCGCTGCTGTCGGCGGCCGGGCTCGGGGATCTGGGGGG CAACTTCGGGACGAGCCGGCCGGAGTGGGCCGGGGCAGCCGGGGTC ACGCTGCTCGCCGAGACGGCGCGGCTGGTCCGGGCGGCCGGGTTCG CGATCGGCAACGTGTCGGTGCAGGTGATCGGGAACCGGCCGAAGATC GGGAAGCGGCGGGCCGAGGCCGAGAAGGTGCTCTCCGCGGCGGTGG GGGCGCCGGTCACCGTGTCCGGGACCACATCCGACGGGCTGGGGCT CACCGGGCGTGGTGAGGGGCTGGCCGGAGTCGCGGTGGCGATGGTC TACACGGAGAACGCTCTTCCGGCCTGA 29 DNA >ispD GTGATCGCCGACCGCGACGTGACCGCGCAGCTCAATGCTCGCGGTGA (ACSP50_ CGTCGCGGTCGTCGTTCCGGCGGCGGGGGCGGGTCTCCGGCTCGGC 8047) CCGGGCGGCCCGAAAGCTCTGCGTCTGCTCGACGGCGAGCCGCTGC TCGTGCACGCGGTCCGGCGGTTGGCCGCGGCCGCGCCGGTCCGCAT GATCGTGGTGGCCGCTCCGCCCGCCGAGGTCGACGCGGTGTCCGCG CTCCTCGCCCCGGTGGCCCCGGTCACCGTCGTGCCCGGCGGCGCCG AACGCCAGGAATCGGTCGCCGCGGCACTCGCGGTCGTTCCGCCGGAC GTTCCGATCGTTCTGGTCCACGACGCGGCTCGATGCCTCACCCCGCC CTCGGTTACGGAGCGTGTCGCCGCCGCTGTCCGGGACGGTGCCGAC GCGGTGATCCCGGTCCTGCCGGTCGTCGACACGATCAAAGAGGTCGC GGCCGATGCCACCGTTCTCGGCACGGTCGACCGTTCCGTGCTGCGTG CGGTACAGACTCCGCAAGGCTTCCGCGCCTCGGTGCTGCGCGCCGCT CACCGGGCCGCCGCCGACTCACACACCGACGACGCCGGTGCCGTCG AGAAGCTCGGCATCCCGGTCCTGTGCGTCCCGGGCTCCGACCTCGCG CTCAAGATCACCCGGCCGATCGATCTGGCGCTCGCCACGCACCTCCT GGCCCTGCCGGACCCGGACGCCCCTACCGCCTGA 30 DNA >idi ATGAGCAGCATCGGTCACCTCAACCGTGAAGATCATCTCGTCGAGCTC (ACSP50_ GTCAACGAGGAGGGGCAGCCGCTCGGGTCGGCCACCGTCTCCGACG 0146) CCCACCTCTCGCCGGGTGCGCTGCACCGGGCCTTCTCGGTCTTCCTC ACCGACGATGAGGGCCGGGTGCTGCTCCAGCAGCGGGCCGCGGCCA AAACCCGCTTCCCGCTCCGCTGGGGCAACACCTGCTGCGGCCACCCC GCGCCCGGCGAGCCGGTCACGGTCGCCGCGGCGCGGCGTCTCACCG AGGAATTGGCGGTACGTGACGTCACGCTGACCGAGATCGGCGTGTAC ACCTACCGCGCGACCGACCCGGTCACCGGCCGGGTGGAGCACGAATA CGACCACGTGCTGATCGGCGCCCTGCCGGACGGCGTCGTGCCACACC CCGATCCGGCGGAGATCGCCACGCTGCGCTGGGCCTCGCTGCCCGG GCTGCGCACCGGGTTGACGGAGTCCCCCGAGCTGTACGCGCCCTGG CTCCCCGGGGTGTTCGAGATTCTCACGGAGCGGTCGGGTGTCCTTTC CACGGAGCGGTCGGGTGGCCGGTGA 31 DNA >crtE GTGGCCAATGACACCCTCGAGGGAAATCGCCTTGCCGCGATACCCCG IdsA GCAGTCCGTCTCTCACACTGGGCTGGTCGGTGCAGTCGAGGGGACGC (ACSP50_ TCGCCGACTTCCTCGCCTCCCAGATCGCCTCTCTCGACGCCGTCGACC 0148) CATCGCTCGGTGGCTTCGGCCGCACCGCCCGTGACCTGGTGATGGCC GGCGGCAAACGGCTGCGGCCGACGTTCGCGTACTGGGGCTGGCGCG GCGTCGCCGGGCCGGCCGCGGACGCCGAGACGCTGCTGCCCGCGCT CGGCGCGCTGGAGCTGATGCACACCTTCGCGCTCGTCCACGACGACG TGATGGACGACTCGTCCACCCGCCGCGGCCGGCCCACCGCCCACCG GATCTTCGCGGCCCAGCACGGCGGCCGGTTCGGCACGTCGGCCGCG ATCCTGGTCGGCGACCTCTGCCTGGTCTGGGCCGACCAGCTGTTGGC CCGCACCCCGGTGCCGGCGGCCACCCTGCTTGCAGTCCGCGCGCATT ACGACCGGATGCGGATCGAGGCGGTCGCCGGGCAGTATCTGGACGTC CTCGGTGAGACCGATCCGGCGTCCTGGTCGGTGGAGCGCGCACTGCT GGTCGCCCGGCACAAGACCGCCAGCTACACCGTGCAGCGGCCGCTC GACTTCGGCCTGGCCCTGGCCGGGGTCGAGGACGTGGAGGTCGCCG AGGCGTACCGGACCTACGGCATCGCCGTCGGCGAGGCCTTCCAGCTG CGCGACGACCTGCTCGGTGTCTACGGCGACCCGGCGGTGACCGGCA AACCGGTCAGCGACGACCTGCGCACCGGCAAACCGACCGCACTGCTG ATGCTGGCCCGTCGGATGGCCACCCCCGGCCAGCTGGCCGAGCTGG AGTCGGCGGAGATCGAGCGCAAGGCGCAGGTCGTCGCCGAGACCGG CGCCCCGGCCCGGGTCGAGGAGATGATCCGTGCCCGGGTCACCGAA GGACTGACCGCGCTGGCCTCGGCGCCGATCGACGCCGAGGCCCGTG CCACCCTGATCGAGCTGGCCACCGTGGCGACGCAGCGCCCGGCATGA 32 DNA >crtB ATGGAAACCGATCTGGCCGCCGCCTATGAGCGGTGCCGTGAGCTACA (ACSP50_ CCGAGAGCACGGACGCACGTACTACCTGGCGACCCGGTTACTACCGG 0149) CCTGGAAGCGCCGGCATGTGCACGCTCTGTATGGATTCACCCGGTTC GCCGACGAGATCGTCGACCGCACCGAGGCGCAACCACCCGCCGAGC GCGCCGCCGAGCTGGCCACCTGGTCCGCCGGATTCCTCGCCGGACT GCGCGGCGAGCCGGTCGACGACCCGCTGCTCCCGGCCGTGCTGCAC ACCATCGCGGTCTTCGGGCTCGACCTGGAGGACTTCGCGAAGTTCCT GCGCAGCATGGAGATGGACCTCACCGTCACCGGCTACCGCACCTACG ACGACCTGCTCGACTACATGGAGGGCTCGGCCGCCGTGATCGGCACC ATGATGCTGCCGATCCTGGGCTCCACCGACCCGGCCGCCGCCCGCGA ACCGGCCCGCCAGCTCGGCTTCGCCTTCCAGCTCACCAACTTCATCCG GGACGTCGCCGAGGACCTCGCGCGGGACCGGATCTACCTGCCCGAG GAGCACCTCGCCGAGTTCGGTGTGACCCGCGCCGACCTGGCCGCCG GCGTCGCCACCCCGGCGATCCGCGCGCTCATCCGGGCCGAGGTGGA CCGCGCCCGTGAGCACTACGCGGCCGCCGCCCCCGGCATCCCGCTG CTCGAACGCACCTCGCAGGCCTGCATGCGGACCGCCTTCCAGCTGTA CGGCGGGATCCTGGACGAGATCGAGGCGGCCGACTACGACGTGTTCG CCCGGCGGGTCACGGTGCCGAACCGGCGCCGGGCCGCGGTCGCCGT CCGCAGCCTGCTCACCCGGCCCGGCACCCCGGTCGAACTGGCGGCC TGA 33 DNA >ACSP50_ ATGGGCGCCCGCGTCGCGCTGTTCACCCGCGACCTGCGGATCCACGA 0150 CAACCCGCTGCTCAGCGGGCCCGACCCGGTGGTGCCGCTGTTCGTCC TCGACCCACGGCTGAGCGGCCTCTCGGCCAACCGCAGCCGCTTTCTC CACCAGAGCCTGGCCGACCTGCGGAACAGTCTCCGCGAGCGTGGCG CCGACCTGGTGATCCGGGAGGGCGACCCGGTGGCCGAGACCATCGC GGTCGCCTCCGAGGTGGACGCGTCGACGATCACGGTGGCCGCCGAC GTGACCGGTTACGCCCAGCGGCGCGAGCGGCGGCTGCGGGACGAGC GATTCCGGGTGAAGACGGTGCCGAGCGTCACGGTGCTGCCGCCCGGT ACGGTCCGGCCGGGCGGGGGAGGCGAGTCGTACCGCGTGTTCACGC CGTACTTCAAAGCCTGGGAGAAAGCTGGGTGGCGCGCACCCTCCGCA ACGCCGGGGAAGGTCGCGATGCCGGCCGGCATCGCGCCGGGAAGGC TCCCCGAGATGCCCGCCGGCGACTCACCGGACGCCGTCGCCGGTGG CGAGACCGAGGGCCGCCGCCGGCTCCAGGCCTGGCAGAAAGAAATG GCGCGGTACGCCGAGGACCACGACGACATGGCCGCCGACAACACCA GCCGGCTCAGCGCCTACCTCCGGTTCGGCTGCCTGTCGCCGCTCGAA CTGGCGCTGGCCGCGAAAGCCGACGACTCTCCCGGCGCCCAGGCCT ACCTGCGGCAACTGTGCTGGCGGGACTTCTACTACCAGGTCACCGCG ACCTTCCCGGAGATCTCCACCCGGCCGCTGCGGGAGAAGGCGGACCA GAACTGGCGATACGACGACGACGCGCTGCGTCACTGGCAGGACGGCC TGACCGGGGTGCCGATCGTCGACGCCGGCATGCGCCAGCTCCGCGC GGAGGGCTGGATGCACAACCGGGCCCGGCTGATCACCGCCGCGTTC CTCACCAAACACCTGGGCATCGACTGGCGGCCCGGGCTGCAATGGTT CTTCCGCTGGCTGCTCGACGGCGACGTGCCGAACAACTCCGGCAACT GGCAGTGGACCGCCGGCACCGGCAACGACACCCGGCCCTATCGCAG GTTCAATCCCATTCGCCAAGCGCAGCGATTCGATGCGCAGGGCGTGTA CGTTCGGCGCTACGTACCGGAGTTGAAAGACATCGACGGTGTCACGG TGCATCAGCCGTGGCGACTGCCGGAATCGGTACGCCGCGGGCTCGAC TATCCCGGACCGTTGGAGTCACATCGGGACGAGGCGGTCTGGCTGCG CGACTGA 34 DNA >ACSP50_ ATGTCTGAAGCGCGGCAAGTGGACGTGGTGGTCGTCGGGCTCGGTGT 0151 CGGCGGCGAGGAGGTCGCCGGTCGCCTGGCCGCGGCCGGCCTGAG CGTGATCGGCGTCGAACACCGACTGGTCGGTGGCGAATGCCCGTACT GGGGATGCATCCCCACCAAGATCATGGTCCGCGCCGGGAACGCGCTG GCCGAGGCCCGCCGGATCCCCGGCCTCGCCGGGACGTCCACGGTGC GGGCCGACTGGGCGCCGGTCGCCAAACGGATCCGCGACGAGGCCAC CGACGACTGGAACGACAAGGTCGCCGTCGAGCGGTTCACCGGTAAGG GCGGAACGTTCGTCCGGGGCACGGCCGAACTGACCGGTCCCGGTCA GGTCCGGGTCGGGGACCAGGAATTCGCCGCTTCGCGCGGCGTGGTC ATCGCCACCGGCACCGCCGCTGTGGTCCCACCCATCGAGGGCCTGTC CGGTACGCCGTTCTGGACGAACCGTGAGGCCGTGGAAGCGGCGGCC CTGCCCGCATCGATGCTGGTGCTCGGCGGCGGGGCGATCGGGTGCG AGCTGGCCCAGGCGTACGCCCGGTTCGGCGTGCAGGTGACGGTCATC GAGGGCTCACCCCGGGTGCTGGCCATGGAGGAACCGGAGTCGTCCG AGGTGGCGGCCGCCGCCCTGACCGCCGACGGGGTCCGGATCGTCAC CGGGGTGCGCGCGCAGAAGGTCGCCCACGACGACGGGTTCCACGTG ACCCTCTCCGACGGCAGCGTGCTGGCCGGCGAGAAGCTGCTGGTCGC GACCGGGCGGGCGGCCCGGCTCGGCGGGCTCGGGCTGGACCGGGT GGGGCTGGACCCGTCGGCTCGATTCCTGGCCACCGATGACCGGCTGC GCGCCGGCGAGGGCATCTGGGCGGTGGGGGACGTGACCGGGAACG GGGCGTTCACCCACATGGCGATGTACGAGGCGGACATCGCGGTGCGG GACATCCTGGGGCAGGGCGGCCCGGGAGCCGACTACCGGGCGCGGC CGCGGGTGACCTTCCTCGACCCGGAGATCGGGGCGGTGGGGATGAC CGAGCAGCAGGCCCGGGACGCCGGCCTCGAGGTGCGGGTGGGGTAC GTGCCGCTGAACCAGACCTCGCGAGGGTTCATCCACGGGCCGGGGAA CGAGGGATTCCTCAAACTTGTCGCGGACGGGGAGCGGGGAGTGCTGG TCGGCGGGACGACCGCCGGGCAGTCCGGTGGCGAGATGATCGGGGC GGTGGCGGTGGCGGTGCACGCCGAGGTGCCGGTGTCGACGTTGCTC AGCCAGATCTGGGCGTACCCGACGTTTCATCGGGGGCTGGGGCAGGC GCTTCAGTCGCTGGCCTGA 35 DNA >ACSP50_ GTGAGCGAACCCGTCATCACCGAACCGGCTGCCTGGATCAACCTGCC 1631 CGACCTGTCCGAGAGGCTGGACGTGTCGATCAGCAAGGTGCACCAGA TGATCAGAGACGGCGACCTGCTCGCGGTCCGCCGCGACGGCATCCGC GTGGTGCCCGCCGAACTGGTGGCCAACGCCACCGTCCTCAAGCATCT GCCCGGTGTGCTGAACGTGCTCCGCGACGCCGGGTACAACGACGAAG AGGCCTTCCGGTGGCTCTACGCCGAGGACGCCGAGGTCGGCGGCAG CGCCGCGATCGCGCTCGGCGGTCAGCAGGCGCGCGAGATCAAGCGC CGCGCGCAGGCCCTCGGCTTCTGA 36 DNA >ACSP50_ ATGAGGCATTTGTCGTACGTCGCGGTGCTGGCCGGATGCCTGGCCGG 1632 GGCGCTGTGGCTGGAACCGATCCTGCGGGTCAACGTGCTGCGCCGGT GGCGTCGGCTGCTGCTGGCCGTGCTGCCGATGGCGGTCGTCTTCACC CTGTGGGACCTGGCGGCGATCGCGGCCGGCCACTGGCACTTCGACC CGGCCCAGATCACCGGCGTCTACCTCGGCGGCGGGCTGCCCCTCGA CGAGGTGCTGTTCTTCCTGGTGGTGCCGGTCTGCGCGATCCTCGGCT TCGAGGCCGTGCGGGCCGTGCTGCGACGTCCGGCGGGGGACGAGTG A 37 DNA >ACSP50_ GTGACCTACACCACCGCTGCGGTGCTCGGCGTGCTGGCCGCCCTCAC 1633 GCTCGACGTGCTGATCCTGCGGACCCGGCTCGTCGGGCGACTGGTGT TCTGGGCCACGTACCCCATCATCTTCGTCTTTCAGTTGATCTCGAACG GCATTCTGACCGGGCGCGACATCGTGATGTACGACCCGGCCGCGATC CTCGGCCCGCGGCTCGTCCACGCCCCGGTCGAGGACCTGCTGTTCGG TTTCGCCCTGGTGCTCGGCACGCTGTCGCTGTGGGTGGCGCTGGGCC GGCGCGGCATCCAGCGCACCCCGCGAGCCGGGTCTAGACGGACCGA CGAGTAG 38 DNA >crtE GTGACGAACTCCCCGCTCGACGAGGCCGGTCTGCGGTCGCGTGTCGA fps2 CAAGGCGCTGACCGTGTTCCTGGCCGGGCAGCGTGACCGGCTGCTG (ACSP50_ GCGATCGACCCGGCGCTGGCCGAGATGTCCGCCACGGTCTCCGAGTT 1634) CGTGCTGGGCGGCGGGAAGCGGCTGCGGCCGGCATTCGCCTACTGG GGTTTCCGCGGGGCCGGCGGCGCCGACTCGGACGCCGTGGTGGCGG CCGTCGCCGCGCTGGAGCTGGTGCAGGCCAGCGCGCTGATCCACGA CGATCTGATGGACCGCTCGGACACCCGGCGCGGGGTGCCGTCGGTG CACCGTCGGTTCGAGAAACTGCACGCCGGCGAGGGCTGGCGGGGCA GCGCGGCCGGGTTCGGCGACTGCGCCGCGGTGCTGCTCGGCGACCT GGCCCTGGTCTGGTCGGACGAGCTGCTGCACACCTCGGGGATGGCG GTGGCCGACGTGCAACGGGCCCGCCCGATCTTCGACGGGATGCGCA CCGAGGTGACCGTCGGGCAGTACCTGGACGTGCTCACCCAGGCGACC GGCGACACGTCGCTGGAGCGGGCCGGCAAGGTGGCCGTCTACAAGG CCGCGAAATACACCGTGGAGCGTCCGCTGCTGCTGGGCGCGGCGCT GGCCGGAGCGGCCCCCGGGGTGCACGCGGCGTACTCGGCGTTCGGC CTGCCGCTGGGCGAGGCGTTCCAGCTGCGCGACGACGTGCTGGGCG TGTTCGGCGACCCGGAGCGGACCGGCAAGCCGGCCGGCGACGACCT GCGCGAGGGCAAGCGCACCTATCTGGTCGCGGCCGCCTTCGGCGCG CTGGACGCGGCCGGGCGGGCCGAACTGGACGCCGCGCTCGGCGACC CCGGCCTGGACGAGGCCGGGGTGGCCCGGCTGCGCACGGTCATCCG GGACAGCGGTGCGCTGGCCGCGACCGAGGCCCGGATCGACGAGCTG ATGACCGCGTCGATCGGCGCGCTGGACGCGGCACCGATCGATCAGGA CGCCCGGGAGGTGCTGCGCCGGCTGGCCGACGCGGCTACTCGTCGG TCCGTCTAG 39 DNA >ACS GTGTCTCTCGGACTTCCCTCCCGGCTGCCCGGCACCCCGTCGATCGG P50_1 CGACCTGGTCCGCGGCGCGGCGCCGACGTTCTCCTTCGAGTTCTTCC 635 CGCCGAAGACACCGGACGGGGAGCGGCTGCTCTGGCAGGCCATCCG GGAGCTGGAGTCGCTGCGGCCCAGCTTCGTCTCGATCACCTACGGGG CCGGCGGCACCACCCGGGAGACCACGGTCGCGGTCACCGAGCGGGT CGCCACCGAGACCACGCTGCTGCCGCTGGCCCACCTCACCGCGGTCG ACCACTCAGTGGCCGACCTGCGCAACGTGATCGGCCGGCTGGCCGGC GCCGGGATCCGCAACGTGCTGGCGCTGCGCGGCGACCCGCCGGGCG ACCCGATGGGCGAGTGGGTCCGGCACCCGGACGGCGTCGGTTACGC CGACGAGCTGGTCCGGCTGATCCGCGAGTCCGGCGACTTCAGCGTCG GGGTGGCCGCCTTCCCGCACAAACACCCCCGGTCGGCCGGCGTCAA GGACGACACCCGCAACTTCGTCCGCAAGTGCCGGGCCGGTGCCGACT ACGCGATCACCCAGATGTTCTTCGACGCCGACGAATATCTGCGGCTGC GCGACCGGGTGGTGGCCGCCGGCTGTCACACCCCGATCGTGGCCGG CGTGATGCCGGTGACCCGGATGGCCACCATCGCGCGCTCCACCCAGC TCTCCGGCGCGCCCTTCCCGCCGGCGCTGCTGCGCGACTTCGAGCG GGTCGCCGGCGACGACGCGGCGGTGCGCGAGCTGGGCATCGAGACG TGCGCGGCGATGTGCGCCCGGTTGCTGCGGGAGGGTGTGCCGGGCA TCCACTTCATCACCATGAACCGGTCCACCGCCACCCGCGAGGTCTGG CAGCGGCTGGCCCCCGCGGAAGTCGCCGCGTCGGCGTGA 40 DNA >ACSP50_ GTGCAGCTGCAACAACTCCGGTACTTCCTGGCGGTGGTGGAGACCCG 1650 GCATTTCACCCAAGCAGCGGACATTCTGGGCGTCTCGCAACCTACCTT GAGTAAGCAGATTCACACCCTTGAGATGTCACTCGGAGCCCCGCTGTT CGAGCGGATGCGCGGTGCGGTGACCCTGACCGTCGCCGGCGAGACA TTGCTGCCGATGGCCCAGCGGATCGTCGCCGACGCCGACGCGGCCC GCGACGCCGTGCAGGACATCGTCGGTCTGCGCCGCGGCGAGGTGCG CCTGGGTGCCACCCCGAGCCTGTGCTCCTCGCTGGTCCCGGCCGTGT TGCGCACCTTCCGCGCCGACCACCCGGGGGTCAAGCTGCACATCAGT GAGGGCAGCTCGCACGACCTGACCGCCGGCCTGCTGGCGCACACCC TGGATCTGGCCCTGATCGTGCAGCCCGAGCACGGCGTCGATCCGGCC CTGGTGGCCATCGAGCTGCTGCGCGAGAGCCTGGTGGTGGCCTCGGT CGCGGCCGGCCCGCCGCCCACCGTGGGCCGCCAACTGGAGCTCTCC GAGCTGCGCCACACCCCGATGGTGATGTTCCGCGAGGGCTACGACAT CCGTGAGGTCACCCTGCACGCCTGCGAGCGGGCCGGCTTCGCGCCG AAGTTCGCGGTCGAGGGTGGTGAGATGGACGCGGTGCTCGCCTTCGT CGAGGCCGGCCTCGGGGTCGCCCTGGTGCCCAGCATGGTGCTCGCC AACCGGCCGCTGCTGCGGGCCACCCCGCTCGCGCCGCCGGGGATGC GCCGGACCATCGCGCTCGCCCAGCGCCGTGCCGCGGTGCTGCCGCA TGCCGCGGCCGCGCTGCGTGAGGTGGTGCTCGACCACATCGGCTCG GGCCGGCTGCCGTTCGGCGTGCGCGCCCTGGAGAGACCGTCCACTTA G 41 DNA >ACSP50_ ATGGGCGAGTTCCACGACCCGCGACTCGTCGAGGTCTACGACGCCGA 1651 ATGTCCCTGGGGCTGGGACGACGACTTCTTCATGGCCGTGCTCGCCG AACGCTCCGCGCACCGGGTCGCCGACCTGGGGTGCGGCACCGGCCG GCTGGCCATCGCGATGGCCGCGGCCGGGCACGAGGTGATCGCGATC GACCCGGCGCCGGCCGCCCTGGCCGCGGCCCGCCGCAAGCCGGGC GGCACCCGGGTGCGCTGGCTGCAGGGCTCGGCCGAGCGGCTCGCCC CGCGCTCGCTCGACGCCGCGTTCATGACCGGTCACGTCGCCCAGTCC TTCGTCGACGACGAGGAATGGGACACCGTGCTCCGCGGGCTGCGCCG GGCGCTGGTCCCGGAGGGACGGCTGGTCTTCGACAGCCGGGACCCG GACGACCGGCCGTGGCAGCAGTGGAACCCGCAGGATTCGTGGCGCA CCGTGGTGCTCGACGACGGGAGGGTGGTGGAGGCGTGGAGCGAGGC CGAGCAGGTCGGGCTGAACACCGTGCGCGTCACCGGGCGCTACCGG TTCGCCGACGGAGGGGAACTGGCGAACTCGGCGACCCTGCGTTTCCG GACCGAGCCGGAGCTGCGCGACTCACTGCGCGAGGCGGGCTTCCGG GTCGAGCGGATCTACGGCGGCTGGGGGCGCGAGCCGGTGGGTCTGA GCGGCGACGGCGAGTTCATCGTGATCGCGGTCGCGACGCCCCGGCT GATGTCCTGA 42 DNA >ACSP50_ ATGCCCGAGAACGAGTGGCCCGACGACCCCCGCCCGCCCGACCAGG 1652 GCGAGTGGAGCCAGCCGCATCACGAGCCGCCACCCGGCCGTGGCCG CGCCCTGCTGGCCGCCGCGGTGGTGGTGCTGGTCCTGCTGGCCGCC GGCGGCATCGCCTGGCGTCTGATGAGCAGCCGCGGCGCTACGCCGG TGGCGCAGCCCACCGCGCCCGCCCCGACGCCCACCGCGCAGACCGC GCCACCCTGCCCACAGCCGCGCCTGCGGGTCGCCGCCGCGCCGGAG ATCGCCCCGGTGATCCAGCAGGCCGCCGCCGCACTCAGCCAGCCCG GCCAGCGCTGCTCCGAGGTGCTGGTGCAGGCCGCCGAGCCGGGCGC CGCGCTGACCGGCAAGCCGGACGTCTGGGTGCCGTCCAGCAGCGTG TGGCTGGCCCTGGCCAAAAGCCGCGGCGACGTCTACACCACGCAGGG CGCGTCGCTGGCCTGGTCGCCGCTGGTGATCGCCGGGCCGGAGTCG ATCGCCAGCCTGTTCGCGCCGAACGGGGTCACCTCCTGGTCCGGCCT GGTCCAGGGCACCATCCAGAAACGGGTGCCGGCGGTCCGGATGCCC GATCCGACGCTGACCACGACCGGACTGCTCAGCGTCTACGCGGTGGG CCAGGCCACGGTCAAGGCCAACCCGGACGCCGGGATCGCCCAGTTG CAGGCGCTCACCCTGCGCAGCCGGCTGGAGAACGCGGCCGCCGACC CGGCGGAACTGTTCGCGCAGATGGGCAAGCAGACCGACGCGGCCAC GGCGATCTACCAGGTCGGGGTCTTCCCGACCACCGAGCAGCAGCTGC TGACCTATCAGAAGAGTCAGCACGACGTCCGGCTGTCCGGCTCGGCG CCCGCCGACGGCCAGATCGACGCCGACTATCCGTACGCGGTCCGCAA GGGCGCCCCGGCCGACCTGGTCGAGAGCCTTCGCGAGGCGATCACC CCGGACGCGCTGACGACGGCCGGATTCCGGGCCACCGCGACCAAGA ACGCGCTGCGCCTGCCGGCCCCGGCCGTGCTCGCCGGGGCGGCCCG GCAGTGGTCGGCGTACAAGTCGGTGGCCTTCCAGGTGCTGCTGCTGA TCGACGCGTCCGGCTCGATGAACGAGAAGATCACCGACCGGGCCGGC CGCAGCGTCACCAAGGCCGCGCTGCTGCGCGAGTCCGGGACCAGCG CGGCCCAGCTCTTCGGTGACGACACCAGCCTCGGCCTGTGGTTCTTC GGCACCCCGACGGCGGACAGCCCGGCGCACACCGAGGAGGTGCCGT TCGGCCCGGTCATCGCCACCGTCGACGGCAAGAGCCGCCGTGACCTG CTGGCCGCCAAGATCGGCGAGTACCGGCCGGTGGCGAACGCCGGGA CCCCGCTCTACCAGAGCGTGCTGGACGGCGTCGCCGAGATGCGCGG CCGGGCCAAGCCGGACACGGCGACCGTGGTGGTGGTCCTCACCGAC GGCTCGGACGGCGGCACGAAGTACCGGATGTCCAACGCGGACTTCCT GAAGAAGCTGACCGCCGGTGCCGACCCCGCCAAGCCGGTGCCGGTG ATCGCCGTCGGTTACGGCCCGGCCGCGAACGCCACCGCCCTGCAGG CCATGGCCAAGGCCACCGGTGGCCAGGCGGTCACCGTCAAGAACCCG GCCGACCTGGCCGCCGGCATCGCCCAGGCCTTCCTCGCCGCACACAC CCACTAG 43 DNA >crtD ATGAGCGACATCGTGGTGGTCGGGGCTGGGGTCGGCGGGCTGGCCG (ACSP50_ CGGCGATCCGGCTGGCCGAGGCGGGGCATCGGGTCAGCATCCATGA 1653) GCGGTCCGGCGTGGTCGGCGGCAAGCTGGCGGCATACGAGAGGGAC GGCTACCGGTTCGACACCGGCCCCAGCCTGCTCACCCTGCCGGACGT GTTCACCGGCCTCGGTCTGGACCTGCGCCCGGAGCCGCTGGACCCG GTGGTGCGGCACTTCTTCCCGGACGGCACGGTGCTGGACTCGTCGTC GGACCACGAGACCTTCCTGGCCCGGATCACCGACGCGCTGGGCGGT GCCGCGGCGCGCGACTGGGACCGGTTCTGGCGCCGTGCCGAGCGGA TCTGGCACGCCTCCTGGGAGTCGGTGCTGCGCCGCCCGGTGACCGC GGCGTCGCTGGCCCGGCTGTCCTGGCGGCTCGGTGACCTGGCCGCG ATCGCTCCCGGCCGGTCACTGCGGTCGCTGGGCCGCCGCTATCTGCG CGACCCGCGGCTGCGGATGCTGCTGGACCGCTATGCGACGTATTCGG GCGCGGATCCGCGGCGGGCGCCGGCGGCGCTGGCCGCGATCCCCTA CGCCGAGCTGGCGTTCGGCGGGTGGTATCTGCCGGGTGGGCTGGTC ACCCTCGCGGAGGCGCTGCTCGCCCGATGCGAGAAACTGGGCGTACG GGTGCATCTGCACTCACCGGTCGCCTCGATCGCCACGACCGGCGCCC GGGTGTCCGGGGTCCGGCTGGGGGACGGGACCCGCCTCGCGGCGG ACGTCGTCGTCTCCAACGTGGACGCCGTCACGCTCTACCGGGATCTG CTGCCCAGTCCGAAACCGCTGGCCCGCCTCGCCGACCGGAGCCTGG CCGGATTCGTGCTGCTGCTCGCGGTGCGGGGCGAGACTCCGCGGCT GGCGCACCACAACGTGTTCTTCCCGCGGGACTACGACGCCGAGTTCG ACGCGGTCTTCGGGGGGCCGGGGCGGCGGGCGCGGCCGGCCGGCG ACCCGACCGTCTTCGTCACCCGGGCCGCGGATCCGGCGGTGCGCCC GGCCGGCGACGAGGCGTGGTTCGTGCTGGTCAACGCGGCGCCACAC GGCACCTCGTGGTCCACCGTGGACTGGCTGCGGGCGGGGCTGGCCG ACGCGTACCGGGATCGGGTCCTCGAGGTCCTGGCGGGGCGCGGTCT CGACGTACGCGATCGGCTGATCTTCGCCGAGACCCGGACCCCGGCGG ATCTGGCGGCGTCGGCCGCAGCGCCGGGCGGAGCGATCTACGGCAC CGCCGGCGGCCTGGTCCGGCCGGCGAACCGCGCGCCGGTCGACGG GTTGTTCCTGGTCGGCGGCTCGACGCATCCCGGCGGCGGGCTGCCG ATGGTCACCCTCTCCGCCGAGATCGTCGCGGGCATGATCGGATCGAA CTGA 44 DNA >cruC ATGATCGTCGCCTGGCTGATCCTGCCGCCGCTGCTGCTGATCACCGC (ACSP50_ ACACACCGCCGTCAACGCGCTGCTGCTGCGCCGCCCGCGCCGGGCG 1654) GCGACCAGCACCGAACGGGTCGCCGTCCTGCTCCCGCTGCGCGACG AGGCCACCCGGGTCACCCCGTGCCTGCGCGCCCTGCTCGCCCAGCG CGGCGTCGCCGATCTCACCGTGCACGTGCTCGACGACGGCTCCACCG ACGGCACCGCGGACGTGGTCCGGGCGGTCGCCGGCGACCGGGTCCG GCTGCACACCGGCACTCCGCCGCCGCCCGGCTGGCTCGGCAAACCG GCCGCCTGCCAACGGCTCGCCGACCTGGCCGGGGACGTGGACGTGC TGGTCTTCGTCGACGCCGACGTGGTGCTCGCGCCGGACGCGGTGGC CGGGGCCGTCGATCTGCTGCGCCGGGCCGGAGCGGACCTGCTCAGC CCGTACCCGAAGATCGTCGGTGCCGGCCGGCTGGTCCAGCCGCTGCT GCAGTGGTCCTGGCTGAGTTTCCTGCCACTGCGCGCGATGGAACGCT CGGCGCGGCCGTCGCTGGCCGCCGCCGGTGGCCAGTGGCTGGTGCT GGACCGGGCCGGTTACCGGCGAGCCGGTGGCCACGCCGCGGTGCGC GGCGAGATCCTGGAGGACATCGCGCTGGCCCGCGCGGTCAAACGGG CCGGCGGGCGGATCGCCCTGGCCGACGGTTCCGGCCTGGCCACCTG CCGGATGTACGAGTCCTGGGACGAGCTCGCCGACGGATACGCCAAAT CGCTGTGGGCGTCATTGGGGTCCGCGGCCGGCGCGACCGCCGTCAC GCTCCTGCTGATTCTGCTGTACGTGGTGCCACCCCTGCTGGCGCCCTT CGCCCCGCTTCCGGCGGTGCTCGGCTACCTGCTCGGCGTGACCGGC CGGATGATCGCCGCCAGGGCCACCGGCGGCCGCGTCCTGCCCGGCA CGCTGGCCCATCCGGTCTCCATCGTCCTGTTCGGCTACCTGATCGCCC GCTCCTTCCGGCTGCGCCGGGCCGGCCGCCTGGCCTGGCGCGGCCG CCCGGTGCCCTGA 45 DNA >cruF GTGTCTCCCCGTCATCTGCCCTGGGGCCTGCTCGGGGCGCTCGTGCT (ACSP50_ CGCCCAGATCTGCTATCCGCTCACCGAGGGTGACACCCGGGCCGGGC 1655) TGACCGTGCTCACCGTGCTGCTCGGCGTCGCGTTCTCGCTGAGCCAC GCGCTGCTCACCCGGGGCCCCCGGGCGCTCACGGCGCTGCTGTCGA CCGCCACCCTGGGCGGGTTCGCGGTGGAGGCGATCGGGGTGGCCAC CGGTTTCCCGTTCGGTTCCTACGAGTACTCCGGGCGTCTCGGTCCGC GCCTGCTCGGCGTACCGCTGATCATCCCGCTGGCCTGGACCTGGATG GCCTGGCCGGCCTGGCTCGCCGCGCTGCGGGTGACCCGGCGGCGGC TCCCCCGGATCCTGGTCGCCGGGGCCGGCCTGGCCGCCTGGGACGT CTTCCTCGACCCGCAGATGGTCGCCGAGGACTACTGGCGGTGGCGGC ACCCGGTGCCCGCGCTGCCCGGCGTGCCCGGTGTGCCGCTCGGCAA CTACCTGGGCTGGCTCGGCTTCGCGCTGCTGCTGATGACCGCGCTGG CCGCCGTCGCCGGCCGGGCCGCCGACCGGCCGCTGTCCGCCGACCG GCCGGCGCTCGCCCTGTGGATCTGGACGTACGCCTCGTCGGTGCTCG CCCACGCCGTCTTCCTGTCGCTGCCGGCGTCCGCGGCGTGGGGCGC GCTGATCATGGGCGCCGCGGTCCTCCCGCTGCTCGCCCGGCTGCGC GCACCCGCATGA 46 DNA >ACSP50_ ATGAGGCTTGTGGCGTGGCAGCCGGACGACCTGCTGCGGCGGCTCG 1656 ACGACGTGGTCGGGGTCTACGGCGAGGCGATGGGCTACCGCCAGGA GCTGCTGCAGACCCGCCGGGGATACATCGGGTCGCACGTGCGCCGG CCCGGGTTCCGGGCGGTGGCCACGCTGACCACCGAGGGCCGGCTGA TGGGCTTCGGATACGGCTACACCTCCGCCGCCGGCCAGTGGTGGCAC GACCAGGTCCGGTTCGCTCTCGGCGAGGACGACCGCCGGCAGTGGC TGACCGACTGCTTCGAGGTGGTCGAGCTGCACGTGCGCCCGGCCGCG CAGGGCCACGGGGTGGGCGCCCGGCAGCTGCGCGCGCTGCTGGCCA TGGCCAAAGGCCGCACCGTGCTGCTGTCCACTCCGGAGGCCGACGAG CAGGCGTCCCGCGCCTGGCGGCTGTACCGGCGGTACGGCTTCGCCG ACGTGCTGCGGCACTTCTACTTCCCGGGTGACGAGCGGGCCTTCGCG GTCCTCGGCCGCGAGCTGCCGCTGGCCGAGCGTCCGCTCGAGGACG CACCGGGCATCGCCGGCGCCTGA 47 DNA >ACSP50_ ATGACGCACGTCGCCCTGCACGTCTGGCGGGTGCCGCGCAGCGCCG 1657 TCGGCTCGGCCATGCTGCGCATGGCCTTCGCGCGGCGCCATCTGGCC GGTCTGCGGTTCGGCAAGTTCCTCGGCACCGGCACCGGCACCGGCTT CGGTCCCGGCGACACCGATCTCACCCGGTGGGCGGCGATCACGGTCA GTGATGCGCCGGTACGTTTCCCCGTCTGGGAGCGGATCGCCGTCAAC GGCGCCCGGATCGATCTGGAGCCACTGATCAGCCGGGGCACCTGGG CCGGCCGTACCCCGTTCGAGCCCACCGGCCGCCGCCCGGACGGTCC GGTGCTCGCGCTCACCCGGGCCCGGCTGCGGCCGGCTCGCGCGCTG ACCTTCTGGCGGGCGGTCCCGGCGGTGGTGCGCGAGGTGCACCGGG CGCCCGGGCTGCTCGCCCGGTTCGGCGTCGGCGAGGCGCCGATCGG CTGGCAGGGCACCGTCACCGTGTGGCGGGACGCGGCGGATCTCGTC GCGTTCGCGTACCGTCAGCCGGAGCATCGCGCGGCGATCGCCCGGA CCCCGGCCGACCGCTGGTACGCCGAGGAGTTGTTCGCCCGGTTCGCG GTGCTCGGGATCAGCGGTGACCGGTCCGTGCTGGGCTGGACCGCCG ACGAAGGGGAACGGGCGGAAGCATGA 48 DNA >ACSP50_ ATGACACAGACCATCGTGATCACCGGGGCCAGCTCCGGGGTCGGGCT 1658 GGCCGCCGCCGAGCAGCTCGCCGCCCGCGGTGACGAGGTGGTGCTG GTCGGCCGCGACCCGGGCCGGCTCGACGCGGCCGTGCAGCGGGTCC GGGAGGCCGGCGGCGGCCGCGCGCCCCGGCACTTCCGGGCCGACTT CGAACGGCTCGACGACGTGCGGGAGCTCGCCGCCGGGCTGCTGGCC GAGCTGCCCCGGATCGACGTGCTGGCCAACAACGCCGGCGGGATCAT CAAGCGGCCCCGGCAGACGGTGGACGGCCACGAGGCCACCATCCAG GGCAACCACCTGGCCCCGTTCCTGCTCACCCACCTGCTGCGGGAGCG GCTGACCGGGGGCCGGGTGGTGAACACCGCCTCGGCGGCACACGTG CAGGGCCGGCCCGGCACCCGGTTCACCGACGACCCGAAGTCGTACA GTCCGTGGCGCTCCTACGGGGCGAGCAAGGCGGCCAACATCCTGTTC GCCGCCGAGGCCGCCCGCCGCTGGCCGGACGTGTGCAGCGTCTCGT TCCACCCCGGTGTGGTGCGCACCAACTTCGGGGAGGGCCGGCTGATC CGGCTGTTCTACCGGTACGCGCCCGGCCTGGTCACCCCGGAGGCCG CCGGCGAGCTGCTGACCTGGCTGTGCACCACCCCGGCCGGGGAGCT GGAGAACGGCGCCTACTACGTCAAGCGTCAGGTGACCCGGCCGGCC GCGCACGCCCGCGACCCGCGGCTGGCCGCCGAGTTGTGGGACGCCA GCCTGACCGCGACCGGCCTCGCCGGATGA 49 DNA >crtE GTGATCGACGACTTCCTCAGCGCGCAACGCGACGTGCTGGCCGAGGT (ACSP50_ CAGCGACGACTGCGCGCCGCTGGAACGCTACGTGGCCGACCTGATGG 3873) GCGGCGGCAAACGACTCCGGCCGGCGTTCTGCTACTGGGCGTGGCG GGCGGCCGGCGCCCCCGACGGCCCGGGCATCGTGGCGGCCGCGAC ATCCCTGGAGTTCCTGCAGGCCGCCGCGCTGATCCACGACGACATCA TGGACGATTCGGACACCCGTCGCGGCGCCCCGGCGGTGCACCGCAG ACTGGCGGCCCTGCACTCCGGCGGCCGCTGGGCCGGGGACGCCGAC CACTTCGGGCTGTCCGCCGCCGTGCTCGCCGGCGACCTGTGCCTGAC CTGGAGCGACGCGTTGTATTCGGGCAGCGGCCTGCACCCGTCCGCGC TGGCCCGGGGCCGGCCGGTCTTCGACCGGATGCGCACCCAGCTGAT GGGCGGCCAGTATCTGGACCTGCTGGACCAGGCGCGGCCGTCCCGG GGCGGCGTCGACCGGGCGCGCCGGGTGGTGCACTTCAAGAGCGCCA AGTACACCGTCGAACATCCGCTGCTGCTCGGCGCCCGGCTCGCCGGC GCGGACGACGATCTGCTCGCCCGGTTGTCCGCGTTCGGTCTGCCGCT GGGCGAGGCGTTCCAGCTGCGCGACGACCTGCTCGGGGTCTTCGGC GACGCGGCGCAGACCGGCAAACCCACCGGCGACGACCTGCGCGAGG GAAAGCGCACCACGCTGGTCATCCTGGCCGCGGACCGCGCCACCGCA CCCCAGCAGGCCGCCCTCACCGCGCTGCTCGGCGATCGCGGCCTGA CCGGGGCCGGCGTCGACACCCTCCGGCAGATCATCGTGGACACCGGT GCCCGGGCCGAGGTCGAGCGGATGATCGAGCAACTGCTGGCGACGA GTCTCGGCGTGCTCAGCGGCACGCCCGTCGACGAGGCGGCCCGCTC GGTGCTGCTCGCCCTCGCCGAGGCGGCGACCGCCCGCAGCTCCTGA 50 DNA >ACSP50_ ATGGTGAGCACAGTGATCGCCTCGGGGCCCACCGGCCTGGGCACCTC 1950 CGCGGCCCGTCTCTTCGGTCGGGTGGACCGGGACGAGCCGGAGCTC TTCTGCCCGGCGCCGCTGCGCGACGACCGGGCGCTGGGGGAGCGGG TCAACGACGCCGTGGTCCAGTGGGCCGAGAAGGCCGGCATCTACCCC GGCCGGCTGGACAAGCTGCGCGGGGCGAACTTCGGCCGCTTCATGAT GCTCGCCCACCCGGCCACCAGCGATCCCGACCGGCTGCTCGCCGCG ACGAAGTGTCTGGTCGCCGAGTGGGCGGCGGACGACTACTACGTCGA CGAGGTGTCCCTGGGCGCGGATCCGATGGTGGTCGGCTCGCGGCTG GCCAACCTCTACTCGGTGGTCGACCCGGCCTCGCTGACCCCGCGCTA TCAGGCCGACTTCGAGAAGCATCACCGCCTGCAGCCGATCTCGGTGG CGTTCCGCACCGCGATGGAACACCTGGCCGAGTACGCCTCGGTCACC CAACTGGCCCGGTTCCAGCACCAGATGGCGATCCTGTTCGTCGCCTG GTCGCAGGAGGCCGACTGGCACGCCAACCGGCGCACCCCGCCGGTC TGGGAGTATCTGGTGCAGCGGCACCTGAACAGCTATCTGCCGCCGAT GATCCTGGTCGACGTGCTGGCCGGGTACGAGCTGTCGCCGGCCGAGT TCTTCGATCCGCGGGTCCGCGCGGCGTTCACCACCGCAGGCAACGCC GCCGTGCTGGTCAACGACCTCTACTCGGGCAGGAACGAGTCCGAGAC CGATCACAACCTGCCGACCGTGCTGGTGTCCGGGGAGCGGCTCACGC CGCGGGCCGCGGTCCGGCGCACCGTGGAGATCCACAACGAGTTGAT GCACACCTTCGTGACCTCGGCCGCGTCGTTGAGCGCGTCCGGCTCGC CGCAGCTGCGCCGGTTTCTCGCGGACACCTGGGCCTGGCTGGGCGG AAGTCGCGAGTGGCACGCCACGAGCGGCCGCTACCACTCATCCAACT GA 51 DNA >ACSP50_ ATGACGACCACCGCACCGACTCCCGCCCACCTCGCCGGCAACTTCGC 5522 GCCCGTCACCGGGGAGACCACCACGCTCGACCTGCCGGTCACCGGC GCCGTCCCGGCCGAACTGACCGGGTGGTATCTGCGCAACGGGCCCAA CCCCCACCACGGGACCTCGGCGCACTGGTTTCTCGGCGACGGCATGG TGCACGGCGTCCGCCTCGATCACGGCCGGGCCACCTGGTACCGCAAC CGCTGGGTGCGGACCCGGGTGCTGACCGACGACGCCCGCGCCTACG GCCCGGACGGCACCCGCGACCTCACCGCCGGCCCGGCGAACACCAA CGTCGTGCGCCACGGCGGACGACTGCTGGCGCTGGTCGAGTCCGCG CTTCCGTACGAGATCACCACCGACCTGGAGACCGTCGGCCCCTACGA CTTCGGCGGCCGCCTGCACACCCCGATGACCGCCCACCCCAAGGTCT GTCCCACCACCGGGGAGATGCACTTCTTCGGCTACGGCGGACTCGAG CCGCCCTACCTCACCTACCACCGCGCCGGCGCGGACGGCCGGCTGT CGCTCAGCCGCCCGATCGACGTCCCCGCGCACACGATGATGCACGAC TTCAGCCTCACCGCGGCCCACGTGATCTTCATGGACCTGCCGGTGCT GTTCAGCCTGGACGGGGCGCGGACCGGCGGCATGCCGTACCGGTGG GACGACACCTACCAGGCGCGCCTGGGCGTGCTGCGGCGCGACGCCC CGCAGGGGGAGGTCCGCTGGTACACCATCGATCCCGGATACGTCTTC CACACCCTGAACGCCCACGACGACGGCGACCGGATCGTCATGCACGT CGTCCGCCACGAGCACGCGTACCGCCCGGGGCAGCCCGCCGCCGCA CCGGACCTCTGGCGCTGGACCATCGACCAGCGCACCGGCCGGGTCG CCGAGGAACGGCTGGACGACGAAGCGGTCGAGTTCCCCCGCATCGAC GATCGGCGCACCGGGCAGCCGGCCCGTTACGGCTTCGCCGTGACCG ACAACGTTCCCCGCCGGCTCGCCGACGTCAGCGCCGTCATCCGCTAC GACCTGCACACCGGCTCGACCACCCGGCACCGCCTGCCGACCGGGC AGGTACCCGGGGAGGCGGTCTTCGTGCCGGCCGGCGGCGCCCCCGC CGGATCGGCCGACGGCTGGCTGCTGACGTTCGCCTACGACCCGGGG CGCGACGCCAGCGATCTGATCATCATCGACGCCACCGACCTCGCCGC CCCGCCGCTGGCCCGGATCCACCTGCCGCACCGGGTGCCGTTCGGC TTCCACGGCAACTGGCTGCCCGACCACGACCGCGCAGAATAG 52 PRT >Dxs MSDSPSTPAGLLASVTGPGALKRLSAEQLTLLAAEIRDFLVAKVSKTGGHL (ACSP50_ GPNLGVVEMTLAMHRVFDSPRDKILFDTGHQAYVHKIVTGRQDGFDLLRQ 7096) RGGLTGYPSQAESEHDLIENSHASTALSYADGLAKAFALRGEDRHWAVV GDGALTGGMCWEALNNIAATKNRLVIWNDNGRSYAPTIGGLADHLSTLRL NPGYEKVLDLVKDALGSTPLVGKPVFEVLHAVKRGIKDAVSPQPMFEDLG LKYIGPVDGHDQQAMESALRRAKGFNAPVIVHAVTRKGYGYRPAEQDEA DCLHGPGAFDPQTGALTAKPSLKWTKVFAEELVKIADERPDVVGITAAMA EPTGIAALAKKYPDRAYDVGIAEQHAATSAAGLAMGGLHPVVAVYATFLNR AFDQVLLDVAMHRLPVTFVLDRAGITGPDGPSHYGIWDMSVFGAVPGLRI AAPRDAATLREELREAVAVDDGPTIVRFPTGAVAADTPAVRRVGQVDVLR EAEKKDILLVAVGSFVGLGLDAAERLAEQGYGVTVVDPRWVRPVPIELTGL AAQHRLWTLEDGIRAGGVGDAVAAALRDAGVHVPLRDFGVPAGFHPHG TRAEILASLGLTAQDVARDVTGWVSGLDAGTSVAAPAI 53 PRT >lspG MTAISLGMPAVPPPPLAPRRQSRQINVGGVLVGGGAPVSVQSMTTTLTSD (ACSP50_ VNATLQQIAELTAAGCQIVRVAVPSQDDVEALPAIAKKSQIPVIADIHFQPKY 7248) VFAAIDAGCAAVRVNPGNIRQFDDKVKEIARAASDAGVPIRIGVNAGSLDK RLLEKYGKATAEALVESALWECSLFEEHGFRDIKISVKHNDPVVMIRAYRQ LAEQCDYPLHLGVTEAGPAFQGTIKSAVAFGALLAEGIGDTIRVSLSAPPVE EIKVGQQILESLGLRERGLEIVSCPSCGRAQVDVYTLAEQVTAALDGFPVP LRVAVMGCVVNGPGEAREADLGVASGNGKGQIFVKGKVIKTVPEAVIVET LVEEALRLADEMGAELPDELRELLPGPTVTVH 54 PRT >Dxr MRELVLLGSTGSIGTQAIDIVRRNPELFRVVAIGAGGGNVALLAAQALELGV (ACSP50_ EVVGVARASVVQDLQLAFYAEAQKRGWSSGDFKLPKIVAGPDAMTELAR 7250) WPCDVVLNGVVGSLGLAPTLAALESGRILALANKESLVAGGPLVRRIAKDG QIVPVDSEHSALAQCLRGGRAAEVRRLVLTASGGAFRGRRRAELTNVTPE EALKHPTWDMGPWTINSATMVNKALEVIEAHELFGVPYDDIAVMVHPQS VLHSLVEFTDGSTLAQASPPDMRLPIALALAWPDRVPGAAAAVDWTLAHN WELRPLDDEAFPAVELAKAAGRYGRCRPAIFNAANEECVAAFAAGRLPFL GIVDTLERVLAAAPDFAEPSTVDDVLAAESWARAQAQRTIATVAEGA 55 PRT >lspH MLLAKPRGYCAGVDRAVQTVEEALKLYGAPVYVRKQIVHNKHVVSTLEAR (ACSP50_ GAIFVEENYEVPEGATWFSAHGVAPEVHDQARERRLKAIDATCPLVTKVH 7707) HEAKRFAAEDYDILLIGHEGHEEVIGTSGEAPAHIQLVDGPDDVANWVRD PAKVVWLSQTTLSVDETMETVARLKTRLPLLQSPPSDDICYATSNRQHVIK EIAPECDVVIVVGSTNSSNSVRLVEVALGAGARAGHLVDYAAEIQDEWLAG ATTVGVSSGASVPDELVMEVLAHLAERGFGEVTEFTTAEERLTFSLPQEL RKDMKAAEAARAAAAG 56 PRT >lspE MTEAWGPDDDEPRPYSGPVKVRVPAKINLHLAVGPLRPDGYHELNTVYH (ACSP50_ AISLFDEITARHGDTLTLTMEGEGTGDLALDETNLIIRAARALAARARVPAY 7802) ARLHLRKSIPLAGGLAGGSADAAATLIACDLLWGLGMSRDELAEVGAQLG SDIPFLLHGGTALGTGHGEAVSPILARPTTWHWTVAIADGGLATPAVYREL DTLRAGTWPPTPLGSADTLMAALRQRNPEILGAALGNDLQPAALALRPQL ADVLKAGTEAGALAGLVSGSGPTCVFLAADATHAQEIADSLTEAGVCRAA VTARGPQPGARVI 57 PRT >lspF MIIPRVGIGTDVHAFDADRACWVAGLEWPGEPGLAGHSDADVVAHAACD (ACSP50_ ALLSAAGLGDLGGNFGTSRPEWAGAAGVTLLAETARLVRAAGFAIGNVSV 8046) QVIGNRPKIGKRRAEAEKVLSAAVGAPVTVSGTTSDGLGLTGRGEGLAGV AVAMVYTENALPA 58 PRT >lspD MIADRDVTAQLNARGDVAVVVPAAGAGLRLGPGGPKALRLLDGEPLLVHA (ACSP50_ VRRLAAAAPVRMIVVAAPPAEVDAVSALLAPVAPVTWPGGAERQESVAA 8047) ALAVVPPDVPIVLVHDAARCLTPPSVTERVAAAVRDGADAVIPVLPVVDTIK EVAADATVLGTVDRSVLRAVQTPQGFRASVLRAAHRAAADSHTDDAGAV EKLGIPVLCVPGSDLALKITRPIDLALATHLLALPDPDAPTA 59 PRT >ldi MSSIGHLNREDHLVELVNEEGQPLGSATVSDAHLSPGALHRAFSVFLTDD (ACSP50_ EGRVLLQQRAAAKTRFPLRWGNTCCGHPAPGEPVTVAAARRLTEELAVR 0146) DVTLTEIGVYTYRATDPVTGRVEHEYDHVLIGALPDGVVPHPDPAEIATLR WASLPGLRTGLTESPELYAPWLPGVFEILTERSGVLSTERSGGR 60 PRT >CrtE MANDTLEGNRLAAIPRQSVSHTGLVGAVEGTLADFLASQIASLDAVDPSLG ldsA GFGRTARDLVMAGGKRLRPTFAYWGWRGVAGPAADAETLLPALGALELM (ACSP50_ HTFALVHDDVMDDSSTRRGRPTAHRIFAAQHGGRFGTSAAILVGDLCLVW 0148) ADQLLARTPVPAATLLAVRAHYDRMRIEAVAGQYLDVLGETDPASWSVER ALLVARHKTASYTVQRPLDFGLALAGVEDVEVAEAYRTYGIAVGEAFQLRD DLLGVYGDPAVTGKPVSDDLRTGKPTALLMLARRMATPGQLAELESAEIE RKAQWAETGAPARVEEMIRARVTEGLTALASAPIDAEARATLIELATVATQ RPA 61 PRT >CrtB METDLAAAYERCRELHREHGRTYYLATRLLPAWKRRHVHALYGFTRFAD (ACSP50_ EIVDRTEAQPPAERAAELATWSAGFLAGLRGEPVDDPLLPAVLHTIAVFGL 0149) DLEDFAKFLRSMEMDLTVTGYRTYDDLLDYMEGSAAVIGTMMLPILGSTD PAAAREPARQLGFAFQLTNFIRDVAEDLARDRIYLPEEHLAEFGVTRADLA AGVATPAIRALIRAEVDRAREHYAAAAPGIPLLERTSQACMRTAFQLYGGIL DEIEAADYDVFARRVTVPNRRRAAVAVRSLLTRPGTPVELAA 62 PRT >ACSP50_ MGARVALFTRDLRIHDNPLLSGPDPVVPLFVLDPRLSGLSANRSRFLHQSL 0150 ADLRNSLRERGADLVIREGDPVAETIAVASEVDASTITVAADVTGYAQRRE RRLRDERFRVKTVPSVTVLPPGTVRPGGGGESYRVFTPYFKAWEKAGWR APSATPGKVAMPAGIAPGRLPEMPAGDSPDAVAGGETEGRRRLQAWQK EMARYAEDHDDMAADNTSRLSAYLRFGCLSPLELALAAKADDSPGAQAYL RQLCWRDFYYQVTATFPEISTRPLREKADQNWRYDDDALRHWQDGLTG VPIVDAGMRQLRAEGWMHNRARLITAAFLTKHLGIDWRPGLQWFFRWLL DGDVPNNSGNWQWTAGTGNDTRPYRRFNPIRQAQRFDAQGVYVRRYVP ELKDIDGVTVHQPWRLPESVRRGLDYPGPLESHRDEAVWLRD 63 PRT >ACSP50_ MSEARQVDVVVVGLGVGGEEVAGRLAAAGLSVIGVEHRLVGGECPYWG 0151 CIPTKIMVRAGNALAEARRIPGLAGTSTVRADWAPVAKRIRDEATDDWND KVAVERFTGKGGTFVRGTAELTGPGQVRVGDQEFAASRGVVIATGTAAV VPPIEGLSGTPFWTNREAVEAAALPASMLVLGGGAIGCELAQAYARFGVQ VTVIEGSPRVLAMEEPESSEVAAAALTADGVRIVTGVRAQKVAHDDGFHV TLSDGSVLAGEKLLVATGRAARLGGLGLDRVGLDPSARFLATDDRLRAGE GIWAVGDVTGNGAFTHMAMYEADIAVRDILGQGGPGADYRARPRVTFLD PEIGAVGMTEQQARDAGLEVRVGYVPLNQTSRGFIHGPGNEGFLKLVAD GERGVLVGGTTAGQSGGEMIGAVAVAVHAEVPVSTLLSQIWAYPTFHRGL GQALQSLA 64 PRT >ACSP50_ MSEPVITEPAAWINLPDLSERLDVSISKVHQMIRDGDLLAVRRDGIRVVPAE 1631 LVANATVLKHLPGVLNVLRDAGYNDEEAFRWLYAEDAEVGGSAAIALGGQ QAREIKRRAQALGF 65 PRT >ACSP50_ MRHLSYVAVLAGCLAGALWLEPILRVNVLRRWRRLLLAVLPMAVVFTLWD 1632 LAAIAAGHWHFDPAQITGVYLGGGLPLDEVLFFLVVPVCAILGFEAVRAVLR RPAGDE 66 PRT >ACSP50_ MTYTTAAVLGVLAALTLDVLILRTRLVGRLVFWATYPIIFVFQLISNGILTGRD 1633 IVMYDPAAILGPRLVHAPVEDLLFGFALVLGTLSLWVALGRRGIQRTPRAG SRRTDE 67 PRT >CrtE MTNSPLDEAGLRSRVDKALTVFLAGQRDRLLAIDPALAEMSATVSEFVLG fps2 GGKRLRPAFAYWGFRGAGGADSDAVVAAVAALELVQASALIHDDLMDRS (ACSP50_ DTRRGVPSVHRRFEKLHAGEGWRGSAAGFGDCAAVLLGDLALVWSDELL 1634) HTSGMAVADVQRARPIFDGMRTEVTVGQYLDVLTQATGDTSLERAGKVA VYKAAKYTVERPLLLGAALAGAAPGVHAAYSAFGLPLGEAFQLRDDVLGV FGDPERTGKPAGDDLREGKRTYLVAAAFGALDAAGRAELDAALGDPGLD EAGVARLRTVIRDSGALAATEARIDELMTASIGALDAAPIDQDAREVLRRLA DAATRRSV 68 PRT >ACSP50_ MSLGLPSRLPGTPSIGDLVRGAAPTFSFEFFPPKTPDGERLLWQAIRELES 1635 LRPSFVSITYGAGGTTRETTVAVTERVATETTLLPLAHLTAVDHSVADLRN VIGRLAGAGIRNVLALRGDPPGDPMGEWVRHPDGVGYADELVRLIRESGD FSVGVAAFPHKHPRSAGVKDDTRNFVRKCRAGADYAITQMFFDADEYLRL RDRVVAAGCHTPIVAGVMPVTRMATIARSTQLSGAPFPPALLRDFERVAG DDAAVRELGIETCAAMCARLLREGVPGIHFITMNRSTATREVWQRLAPAE VAASA 69 PRT >ACSP50_ MQLQQLRYFLAWETRHFTQAADILGVSQPTLSKQIHTLEMSLGAPLFERM 1650 RGAVTLTVAGETLLPMAQRIVADADAARDAVQDIVGLRRGEVRLGATPSL CSSLVPAVLRTFRADHPGVKLHISEGSSHDLTAGLLAHTLDLALIVQPEHG VDPALVAIELLRESLVVASVAAGPPPTVGRQLELSELRHTPMVMFREGYDI REVTLHACERAGFAPKFAVEGGEMDAVLAFVEAGLGVALVPSMVLANRPL LRATPLAPPGMRRTIALAQRRAAVLPHAAAALREVVLDHIGSGRLPFGVRA LERPST 70 PRT >ACSP50_ MGEFHDPRLVEVYDAECPWGWDDDFFMAVLAERSAHRVADLGCGTGRL 1651 AIAMAAAGHEVIAIDPAPAALAAARRKPGGTRVRWLQGSAERLAPRSLDA AFMTGHVAQSFVDDEEWDTVLRGLRRALVPEGRLVFDSRDPDDRPWQQ WNPQDSWRTVVLDDGRVVEAWSEAEQVGLNTVRVTGRYRFADGGELAN SATLRFRTEPELRDSLREAGFRVERIYGGWGREPVGLSGDGEFIVIAVATP RLMS 71 PRT >ACSP50_ MPENEWPDDPRPPDQGEWSQPHHEPPPGRGRALLAAAVVVLVLLAAGGI 1652 AWRLMSSRGATPVAQPTAPAPTPTAQTAPPCPQPRLRVAAAPEIAPVIQQ AAAALSQPGQRCSEVLVQAAEPGAALTGKPDVWVPSSSVWLALAKSRGD VYTTQGASLAWSPLVIAGPESIASLFAPNGVTSWSGLVQGTIQKRVPAVR MPDPTLTTTGLLSVYAVGQATVKANPDAGIAQLQALTLRSRLENAAADPAE LFAQMGKQTDAATAIYQVGVFPTTEQQLLTYQKSQHDVRLSGSAPADGQI DADYPYAVRKGAPADLVESLREAITPDALTTAGFRATATKNALRLPAPAVL AGAARQWSAYKSVAFQVLLLIDASGSMNEKITDRAGRSVTKAALLRESGT SAAQLFGDDTSLGLWFFGTPTADSPAHTEEVPFGPVIATVDGKSRRDLLA AKIGEYRPVANAGTPLYQSVLDGVAEMRGRAKPDTATWVVLTDGSDGG TKYRMSNADFLKKLTAGADPAKPVPVIAVGYGPAANATALQAMAKATGGQ AVTVKNPADLAAGIAQAFLAAHTH 72 PRT >CrtD MSDIVWGAGVGGLAAAIRLAEAGHRVSIHERSGWGGKLAAYERDGYRF (ACSP50_ DTGPSLLTLPDVFTGLGLDLRPEPLDPVVRHFFPDGTVLDSSSDHETFLAR 1653) ITDALGGAAARDWDRFWRRAERIWHASWESVLRRPVTAASLARLSWRLG DLAAIAPGRSLRSLGRRYLRDPRLRMLLDRYATYSGADPRRAPAALAAIPY AELAFGGWYLPGGLVTLAEALLARCEKLGVRVHLHSPVASIATTGARVSG VRLGDGTRLAADVVVSNVDAVTLYRDLLPSPKPLARLADRSLAGFVLLLAV RGETPRLAHHNVFFPRDYDAEFDAVFGGPGRRARPAGDPTVFVTRAADP AVRPAGDEAWFVLVNAAPHGTSWSTVDWLRAGLADAYRDRVLEVLAGR GLDVRDRLIFAETRTPADLAASAAAPGGAIYGTAGGLVRPANRAPVDGLFL VGGSTHPGGGLPMVTLSAEIVAGMIGSN 73 PRT >CruC MIVAWLILPPLLLITAHTAVNALLLRRPRRAATSTERVAVLLPLRDEATRVTP (ACSP50_ CLRALLAQRGVADLTVHVLDDGSTDGTADVVRAVAGDRVRLHTGTPPPP 1654) GWLGKPAACQRLADLAGDVDVLVFVDADWLAPDAVAGAVDLLRRAGAD LLSPYPKIVGAGRLVQPLLQWSWLSFLPLRAMERSARPSLAAAGGQWLVL DRAGYRRAGGHAAVRGEILEDIALARAVKRAGGRIALADGSGLATCRMYE SWDELADGYAKSLWASLGSAAGATAVTLLLILLYVVPPLLAPFAPLPAVLG YLLGVTGRMIAARATGGRVLPGTLAHPVSIVLFGYLIARSFRLRRAGRLAW RGRPVP 74 PRT >CruF MSPRHLPWGLLGALVLAQICYPLTEGDTRAGLTVLTVLLGVAFSLSHALLT (ACSP50_ RGPRALTALLSTATLGGFAVEAIGVATGFPFGSYEYSGRLGPRLLGVPLIIP 1655) LAWTWMAWPAWLAALRVTRRRLPRILVAGAGLAAWDVFLDPQMVAEDY WRWRHPVPALPGVPGVPLGNYLGWLGFALLLMTALAAVAGRAADRPLSA DRPALALWIWTYASSVLAHAVFLSLPASAAWGALIMGAAVLPLLARLRAPA 75 PRT >ACSP50_ MRLVAWQPDDLLRRLDDVVGVYGEAMGYRQELLQTRRGYIGSHVRRPG 1656 FRAVATLTTEGRLMGFGYGYTSAAGQWWHDQVRFALGEDDRRQWLTDC FEVVELHVRPAAQGHGVGARQLRALLAMAKGRTVLLSTPEADEQASRAW RLYRRYGFADVLRHFYFPGDERAFAVLGRELPLAERPLEDAPGIAGA 76 PRT >ACSP50_ MTHVALHVWRVPRSAVGSAMLRMAFARRHLAGLRFGKFLGTGTGTGFG 1657 PGDTDLTRWAAITVSDAPVRFPVWERIAVNGARIDLEPLISRGTWAGRTPF EPTGRRPDGPVLALTRARLRPARALTFWRAVPAVVREVHRAPGLLARFGV GEAPIGWQGTVTVWRDAADLVAFAYRQPEHRAAIARTPADRWYAEELFA RFAVLGISGDRSVLGWTADEGERAEA 77 PRT >ACSP50_ MTQTIVITGASSGVGLAAAEQLAARGDEVVLVGRDPGRLDAAVQRVREAG 1658 GGRAPRHFRADFERLDDVRELAAGLLAELPRIDVLANNAGGIIKRPRQTVD GHEATIQGNHLAPFLLTHLLRERLTGGRWNTASAAHVQGRPGTRFTDDP KSYSPWRSYGASKAANILFAAEAARRWPDVCSVSFHPGVVRTNFGEGRLI RLFYRYAPGLVTPEAAGELLTWLCTTPAGELENGAYYVKRQVTRPAAHAR DPRLAAELWDASLTATGLAG 78 PRT >CrtE MIDDFLSAQRDVLAEVSDDCAPLERYVADLMGGGKRLRPAFCYWAWRAA (ACSP50_ GAPDGPGIVAAATSLEFLQAAALIHDDIMDDSDTRRGAPAVHRRLAALHSG 3873) GRWAGDADHFGLSAAVLAGDLCLTWSDALYSGSGLHPSALARGRPVFDR MRTQLMGGQYLDLLDQARPSRGGVDRARRVVHFKSAKYTVEHPLLLGAR LAGADDDLLARLSAFGLPLGEAFQLRDDLLGVFGDAAQTGKPTGDDLREG KRTTLVILAADRATAPQQAALTALLGDRGLTGAGVDTLRQIIVDTGARAEVE RMIEQLLATSLGVLSGTPVDEAARSVLLALAEAATARSS 79 PRT >ACSP50_ MVSTVIASGPTGLGTSAARLFGRVDRDEPELFCPAPLRDDRALGERVNDA 1950 WQWAEKAGIYPGRLDKLRGANFGRFMMLAHPATSDPDRLLAATKCLVAE WAADDYYVDEVSLGADPMVVGSRLANLYSVVDPASLTPRYQADFEKHHR LQPISVAFRTAMEHLAEYASVTQLARFQHQMAILFVAWSQEADWHANRRT PPVWEYLVQRHLNSYLPPMILVDVLAGYELSPAEFFDPRVRAAFTTAGNA AVLVNDLYSGRNESETDHNLPTVLVSGERLTPRAAVRRTVEIHNELMHTFV TSAASLSASGSPQLRRFLADTWAWLGGSREWHATSGRYHSSN 80 PRT >ACSP50_ MTTTAPTPAHLAGNFAPVTGETTTLDLPVTGAVPAELTGWYLRNGPNPHH 5522 GTSAHWFLGDGMVHGVRLDHGRATWYRNRWVRTRVLTDDARAYGPDG TRDLTAGPANTNVVRHGGRLLALVESALPYEITTDLETVGPYDFGGRLHTP MTAHPKVCPTTGEMHFFGYGGLEPPYLTYHRAGADGRLSLSRPIDVPAHT MMHDFSLTAAHVIFMDLPVLFSLDGARTGGMPYRWDDTYQARLGVLRRD APQGEVRWYTIDPGYVFHTLNAHDDGDRIVMHVVRHEHAYRPGQPAAAP DLWRWTIDQRTGRVAEERLDDEAVEFPRIDDRRTGQPARYGFAVTDNVP RRLADVSAVIRYDLHTGSTTRHRLPTGQVPGEAVFVPAGGAPAGSADGW LLTFAYDPGRDASDLIIIDATDLAAPPLARIHLPHRVPFGFHGNWLPDHDRA E 81 DNA >tipA atccctagaacgtccgggcttgcacctcacgtcacgtgaggaggcagcgtggacggcgtggtaccaag promoter cttattggcactagtcgagcaacggaggtattccg 82 DNA >gapDH gtactggccgatgctgggagaagcgcgctgctgtacggcgcgcaccgggtgcggagcccctcggcga promoter gcggtgtgaaacttctgtgaatggcctgttcggttgctttttttatacggctgccagataaggcttgcagcat ctgggcggctaccgctatgatcggggcgttcctgcaattcttagtgcgagtatctgaaaggggatacgc 83 DNA >lacZ? taatgtgagttagctcactcattaggcaccccaggctttacactttatgcttccggctcgtatgttgtgtgga promoter attgtgagcggataacaatttcacacaggaaacagctatgacatgattacgaattcgatatcgcgcgcggcc andgene gcggatcctctagagtcgacctgcagcccaagcttggcactggccgtcgttttacaacgtcgtgactggg aaaaccctggcgttacccaacttaatcgccttgcagcacatccccctttcgccagctggcgtaatagcga agaggcccgcaccgatcgcccttcccaacagttgcgcagcctgaatggcgaatggcgcctgatgcggt attttctccttacgcatctgtgcggtatttcacaccgcataaattccccaatgtcaagcacttccggaatcggg agcgcggccgatgcaaagtgccgatcaacataa 84 DNA >T4 aagctttatgcttgtaaaccgttttgtgaaaaaatttttaaaataaaaaaggggacctctagggtccccaatt terminator aattagtaatataatctattaaaggtcattcaaaaggtcatcca 85 DNA >PhiC gtggacacgtacgcgggtgcttacgaccgtcagtcgcgcgagcgcgagaattcgagcgcagcaagcc 31 cagcgacacagcgtagcgccaacgaagacaaggcggccgaccttcagcgcgaagtcgagcgcgac integrase gggggccggttcaggttcgtcgggcatttcagcgaagcgccgggcacgtcggcgttcgggacggcgga gene gcgcccggagttcgaacgcatcctgaacgaatgccgcgccgggcggctcaacatgatcattgtctatga cgtgtcgcgcttctcgcgcctgaaggtcatggacgcgattccgattgtctcggaattgctcgccctgggcgt gacgattgtttccactcaggaaggcgtcttccggcagggaaacgtcatggacctgattcacctgattatgc ggctcgacgcgtcgcacaaagaatcttcgctgaagtcggcgaagattctcgacacgaagaaccttcag cgcgaattgggcgggtacgtcggcgggaaggcgccttacggcttcgagcttgtttcggagacgaagga gatcacgcgcaacggccgaatggtcaatgtcgtcatcaacaagcttgcgcactcgaccactccccttac cggacccttcgagttcgagcccgacgtaatccggtggtggtggcgtgagatcaagacgcacaaacacc ttcccttcaagccgggcagtcaagccgccattcacccgggcagcatcacggggctttgtaagcgcatgg acgctgacgccgtgccgacccggggcgagacgattgggaagaagaccgcttcaagcgcctgggacc cggcaaccgttatgcgaatccttcgggacccgcgtattgcgggcttcgccgctgaggtgatctacaagaa gaagccggacggcacgccgaccacgaagattgagggttaccgcattcagcgcgacccgatcacgctc cggccggtcgagcttgattgcggaccgatcatcgagcccgctgagtggtatgagcttcaggcgtggttgg acggcagggggcgcggcaaggggctttcccgggggcaagccattctgtccgccatggacaagctgta ctgcgagtgtggcgccgtcatgacttcgaagcgcggggaagaatcgatcaaggactcttaccgctgccg tcgccggaaggtggtcgacccgtccgcacctgggcagcacgaaggcacgtgcaacgtcagcatggcg gcactcgacaagttcgttgcggaacgcatcttcaacaagatcaggcacgccgaaggcgacgaagaga cgttggcgcttctgtgggaagccgcccgacgcttcggcaagctcactgaggcgcctgagaagagcggc gaacgggcgaaccttgttgcggagcgcgccgacgccctgaacgcccttgaagagctgtacgaagacc gcgcggcaggcgcgtacgacggacccgttggcaggaagcacttccggaagcaacaggcagcgctg acgctccggcagcaaggggcggaagagcggcttgccgaacttgaagccgccgaagccccgaagctt ccccttgaccaatggttccccgaagacgccgacgctgacccgaccggccctaagtcgtggtgggggcg cgcgtcagtagacgacaagcgcgtgttcgtcgggctcttcgtagacaagatcgttgtcacgaagtcgact acgggcagggggcagggaacgcccatcgagaagcgcgcttcgatcacgtgggcgaagccgccgac cgacgacgacgaagacgacgcccaggacggcacggaagacgtagcggcgtag 86 DNA >PhiC cccaggtcagaagcggttttcgggagtagtgccccaactggggtaacctttgagttctctcagttgggggc 31 gtagggtcgccgacatgacacaaggggtt attachment site 87 DNA >incP ccggccagcctcgcagagcaggattcccgttgagcaccgccaggtgcgaataagggacagtgaaga aggaacacccgctcgcgggtgggcctacttcacctatcctgccc 88 DNA >traJ atggctgatgaaaccaagccaaccaggaagggcagcccacctatcaaggtgtactgccttccagacg aacgaagagcgattgaggaaaaggcggcggcggccggcatgagcctgtcggcctacctgctggccgt cggccagggctacaaaatcacgggcgtcgtggactatgagcacgtccgcgagctggcccgcatcaat ggcgacctgggccgcctgggcggcctgctgaaactctggctcaccgacgacccgcgcacggcgcggt tcggtgatgccacgatcctcgccctgctggcgaagatcgaagagaagcaggacgagcttggcaaggtc atgatgggcgtggtccgcccgagggcagagccatga 89 DNA >ColE ttgagatcctttttttctgcgcgtaatctgctgcttgcaaacaaaaaaaccaccgctaccagcggtggtttgtt 1/pMB tgccggatcaagagctaccaactctttttccgaaggtaactggcttcagcagagcgcagataccaaatact 1/pBR gttcttctagtgtagccgtagttaggccaccacttcaagaactctgtagcaccgcctacatacctcgctctgc 322/ taatcctgttaccagtggctgctgccagtggcgataagtcgtgtcttaccgggttggactcaagacgatagt pUCori taccggataaggcgcagcggtcgggctgaacggggggttcgtgcacacagcccagcttggagcgaac gacctacaccgaactgagatacctacagcgtgagctatgagaaagcgccacgcttcccgaagggaga aaggcggacaggtatccggtaagcggcagggtcggaacaggagagcgcacgagggagcttccagg gggaaacgcctggtatctttatagtcctgtcgggtttcgccacctctgacttgagcgtcgatttttgtgatgct cgtcaggggggcggagcctatggaaa 90 DNA >aac(3) gtgcaatacgaatggcgaaaagccgagctcatcggtcagcttctcaaccttggggttacccccggcggt IV gtgctgctggtccacagctccttccgtagcgtccggcccctcgaagatgggccacttggactgatcgagg ccctgcgtgctgcgctgggtccgggagggacgctcgtcatgccctcgtggtcaggtctggacgacgagc cgttcgatcctgccacgtcgcccgttacaccggaccttggagttgtctctgacacattctggcgcctgccaa atgtaaagcgcagcgcccatccatttgcctttgcggcagcggggccacaggcagagcagatcatctctg atccattgcccctgccacctcactcgcctgcaagcccggtcgcccgtgtccatgaactcgatgggcaggt acttctcctcggcgtgggacacgatgccaacacgacgctgcatcttgccgagttgatggcaaaggttccct atggggtgccgagacactgcaccattcttcaggatggcaagttggtacgcgtcgattatctcgagaatga ccactgctgtgagcgctttgccttggcggacaggtggctcaaggagaagagccttcagaaggaaggtcc agtcggtcatgcctttgctcggttgatccgctcccgcgacattgtggcgacagccctgggtcaactgggcc gagatccgttgatcttcctgcatccgccagaggcgggatgcgaagaatgcgatgccgctcgccagtcga ttggctga 91 DNA >cgt GCCCGGCCCTGTCGAGCTGACGGCTGTCCCGCGGCCTCGTCATCGGT promoter GCTGTCGAGCAGGCTGTCGCCTGGTAGGAAGATTGCCATGGTCCAGA TGGACCCCCTCAGCGCACGTCCCGATGGACGACGTTCCGTCTTGTCG ACGACTCCGAGCCGCCCGACCCACCGGGCCTGAGCGCGCCCGATCA CGGCTCCCCGGCCTGACGGGTTCTGCACCTCCGGCGGCTTTCCCGAG GACGGCGTGGTGGTCGGTGACGGCTGCTGGACCTCCTCCGGTGGGC AAGCGTTTCGGTGAGGTGGGCAGCCCGGCTGCGGGCACATCGGGGG CGGAGAGACGCTTAGGTTTATTGCAAGTTCTTTCTTCGGTGGCGCGGC GTGTCATCAGCAGCCGATTGTGGCATTCTGGTGACGCATTGACGCAGG TCACAGATTTGTTGGGATAGGCAACGAACAATTCCTAAATCGCCTATTC GGACAAATAGGCTTGACCTGACGACGCTGTCCCACCACTGTGGATGAC GCCTACCGCGCAAGTTCTGGAAGTACTTGCAATCAGCGGTGAGGATCA TCAAAGGGGACTGTC 92 DNA >efp TGGAGCACATCTGCCGGTAGACCCGATTCGCCCTCACCAGCGAATCG promoter CCGGTAAAGTGGTTCGGTCAACGATTCGAGTCAAGATCAAGGCAGGAC ATGGCTTCCACCAACGACCT 93 DNA >rpsJ ATTGCGGGTTGTCGCCGGTGAGAGCCGGTGACAACCCCCACCGGTGA promoter CCCCGATTAGCAATGCTGCGTTCAATCGGGCATACTAGTCAGGTTGCG TCCGCGCGGGGTGGGTGGCTGGCGTTCGTCAGCCGCCCACCCTCGC CGGGTGTCCGGGTGTGTTTCCAGCCGCCCGGCGCCCTCAGATCCCCG CGATCGCGTTCGTCCCCGGCAAGATCGGGGATGGAGGCCGAAAGCTG AGTGCCCAGCACTCTGTGACGAGGCGCGACACGCCCGACCGCGGGG GTCGGACAACGCAGGATCAACGGTCCTGCGGGCATGTGGGGGCCACC GCCTCCGCACGTAGCGGCATCGAGAGAAGGAAACAGAAGCCACC 94 DNA >katE ATCTCGGGCTCGGTAGGCATCAGGCACTCGTTTCGTCGGGCTCTCGT promoter GACAGTGACCTTGATACTGGAGGGGTACGACAAAACCGGGACCGCCA CCGACGTCCGGACCGACCCGATCGTCGGCCACGAACAGGGCCGGAT GGTCGTCGTGACGCGTCCGCGAGACGCCGTCCGGGCCGGGCCGATG CTCGGCCGGACCGTTTGCCGGGGTTCATGCGGGGTATCCGCCATCCG ATCACATACCCTTATCGAGGAGTTTGTCCGG 95 DNA >moeE5 AGGGCGCCACCAGCTGGAGCCCCATCCCCGCGGGGACCAGGAGGGC promoter GAGCAGCGCCACGGCGGTCCGTTCACCGCGCAGGTAGCGGACAAAC GTGGAGAGATGCCGCAACGGACTGTCTGCCAACGCGCCCCTCCCCCG TTCGCCCGGCGGCGAGCGGCCAGCATAAAGTCCTGTGCGCCTCCTTG TGAATGACGCCTCGTCAACGGCGGCCGGAGCACGCCCTTTCTGCGGG AAGCCGATAGCGGACGCCGCTCCGGGAGGGGGCGAAGCACACCATT GCTCGTGATTGACGCATGCTGTTAGACTCCCCACGTCTCTTGGTCCGG ACATGCGTTTCTCAACGCCGAAAGCCTGGTCAACCGCACTTTCGGCAC CGCACAGTCCCACGGCGTCCGAGCGGTCGCGCGAGTCGGCCCGGTC GAGCCAGAGGCAGCCACACGAACGTGCACCGCAATGCACCGCCTTGA TC 96 DNA >apm CTCTGCTGAAGCCAGTTACCTTCGGAAAAAGAGTTGGTAGCTCTTGAT promoter CCGGCAAACAAACCACCGCTGGTAGCGGTGGTTTTTTTGTTTGCAAGC AGCAGATTACGCGCAGAAAAAAAGGATCTCAAGAAGATCCTTTGATCTT TTCTACGGGGTCTGACGCTCAGTGGAACGAAAACTCACGTTAAGGGAT TTTGGTCATGAGATTATCAAAAAGGATCTTCACCTAGATCCTTTTGGTTC ATGTGCAGCTCCATCAGCAAAAGGGGATGATAAGTTTATCACCACCGA CTATTTGCAACAGTGCCGTTGATCGTGCTATGATCGACTGAGC 97 DNA >cdaR GGGCCCCCCCTCGAGGTCGACGGTATCGATAAGCTTGATTTTTCTGGA promoter CAACCTGGACGCCGAGACCTCGGCGGCGACCCTCGCGCTGTTCCTCA CGGCCCTGTGGGCGCTGGCCGTCATCGCCCGCCCCTACACCTGGTGG CGCGTGCTGCTGGTGCTGACCATGGCGGTGGGCTTCGCCGTGGTGCT GGTGGTGCCCTACCTCCAGGAGTTCTTCCAGCTGAAGCTGGTCGGCG TCACCGCGCCGTGGGCGGCGGTCGCCTGTGCGGCGGTCGCCGGGCT GGTGCTGGAGTTGGTGTGGGCACGTATGCGGCGTCGTCTCGACGCCG ACTGAGCCCACCGGGCGGTCGACCCCCGTACCGCCCGGTGAAGAGG AGGGGACGCCCGGTCCGTGCCGGGCGTCCCCTCCGTCTTTGTGCGCC CCCCGCCGACCGGAACGGCACGATCCGGCCAAACCTGCGCAGCGGT GCGGCCGGAGGAGCCGCTTCCGGGCCGTTCGACGGGCGGCCCGCCA CGGGACCGGAACGAGCCCCGGCATCCGCCGCCACCAGCGGATTTCAC ATTCCTTACGCAATCGGCGGCGAGAGCGACCGGCAGGTAACCTCGGG GCTGAATCCAGGCCATCGGGGAATAGCAAACGGCGCACTGACGAAAG CAAGGGCAGAGACCTGCCGAAAGTTGAGTGTTGGATTCAAAGAAGATC CGTATTATTCCGACTGCAGGCAGGGGGGAGCCGGCTACGAAGGAAAA GTTCCGCAGGTCAGATTGGGCCGGGTCGCAGGCAGCGCCGCACCGG CAACCACGACCGCGACTTTCGTCGACGCACCCCCTCGCACCGCCGCC CGGCCACCGGTCCGGGCGCACGACCCGAAGGGAAGTGAGGCTCACG CACGGACCAGCAGCTCCTGACGCAGCGACCCGGACCCGGAGGTGAG TGACATGACGACGAGGCCCCGACCAGCGGTGAACCCTGCTGACCCGG CCGTAACGAAGTCTTCATGCCCGTGGCACCCGACGGCTTCGGAGAGT TTCGGCACGCAGACATCAGCACAACTTGACGCGGGGGTATCAAGAGG TCATGGATCTTCGGTACC 98 DNA >ermE* GCGGTTGATCGGCGATCGCAGGTGCACGCGGTCGATCTTGACGGCTG promoter GCGAGAGGTGCGGGGAGGATCTGACCGACGCGGTCCACACGTGGCA CCG CGATGCTGTTGTGGGCACAATCGTGCCGGTTGGTAGGATCCAGCGAG CA 99 DNA >rpsL TGAGCACGTCCGCGAGCTGGCCCTGCAGGCGGAAGTCAGGTAGACAC promoter GACTTCCGCTAGTCCTTGCAAGGTCTGCTGACGTGAGGCGGGGCGGT CGTTTTTGACCGCCCTGCCTTCGTCATGTAGGCTCGCTCGCTGTGCCT GGCGTGTCATCAGACGCCCAGGTCCCGGTGCCGTGAGGCCCGGGCC ATCGAGCCGGTGGTACGTGGCTGCGGTCCCCTTGTGAGGGCTGCGCG CCGTGTGCTGTCCGGCGCGCACAGCCTTGAATCCACCCGCGGGGGCC GGCCGGTCTCCGTGAGCTCGAGTAGACGACGGAGACGTA 100 DNA >ACSP50_ GTGGCGACTCCCACGCAGTCCGAGATCCGCGAGGAAGAGCACGAAGA 1949 GCAGCGGCAGAGCCTGAGCACGGCGGCGGCCCGCAACCTCACGACC ACCACCAAGACCGCGCCGCAGATGCAGGAGATCACTTCGCGATGGCT GCTCCGTAAGCTTCCCTGGGTTCAGGTCGCCGGTGGGGCGTATCGGG TGAACCGGCGGATGACTTATCGGATCGGCGACGGCCGGCTGAGCTTC ACCAACGTCGGTGCGCAGGTCCGGGTCGTCCCGGCCGAGCTGCGGG AACTCTCGGTGCTCAGCGAGTTCGACGACGCGGACGTGCTGGCCGCC ATGGCCGACAAGTTCGTGCAGCAGGAGTACCAGCCCGGTCAGGTGAT CGTCGAGTTCGGCTCGGTCGCCGACCACGTGTACGTGATCGCGCACG GCAAGGTGAACAAGGTCGGCGTCGGCAACTACGGCGACCCGGTCAAC CTGGGGGTGCTCGCCGACGGGGAGGCGTTCGGCGAGAAGTCGCTCA CCGACGAGGAGCGGATCTGGGACTACACCGCCAAGGCGATGACCGC GGTGACCCTGCTGGCCATGCCGCGCTCGGCGTTCACCGCGCTGCTCG GCCAGAGTGACCACCTGCGCACGCACGTCGAGCAGTTCCGGGCCAAG AACCGCCGGCCGCAGAACAAGCACGGCGAGGCGGAGATCTCGGTGG CCGCCGGGCACACCGGCGAACCGAAGCTGGACGGCACGTACGTCGA CTACGAGCTGACGCCGCGCGAATACGAGCTGAGCGTCGCGCAGACCG TGCTGCGCGTGCACACCCGGGTCGCCGACCTCTACAACGAGCCGATG AACCAGGTGGAGCAGCAGCTCCGGCTGACCGTCGAGGCGCTGCGCG AGCGTCAGGAATACGAAATGATCAACAACCGCGAGTTCGGCCTGCTGC ACAACGCCGACCTGCGGCAGCGCATCCACACCCGGGGCGGCCCGCC CACCCCGGACGACCTCGACGAGCTGCTCAGCATGCGGCGCGGCACCA GGATGTTCGTGGCCCACCCGCAGGCGGTCGCCGCGTTCGGCCGGGA GTGCACCAAGCGGGGCATCTATCCACCGATGCTGGAACAGGACGGCG GCACCTTCCTGTCCTGGCGCGGGGTCCCGATCCTGCCGTGCGGCAAG ATCCCGGTGACCGAGACGCACACCACCTCGATCCTGGCGATGCGCAC CGGGGAGAGCGACCAGGGTGTGGTCGGGCTGCACCAGACCGGGATC CCGGACGAGTACGAGCCGAGCCTGTCCGTGCGGTTCATGGGGATCAG CGAGCAGGCGATCATGTCGTACCTGGTGAGCGCGTACTACTCGGCCG CGGTGCTGGTGCCGGACGCGCTGGGCATCCTGGACCACGTCGAGCT GTCCCACTGA 101 DNA >ACSP50_ ATGACAAGTGCTGTTGCTTCGCCACTGCGGACCGACTTCGAGCGCTCG 1951 GTCGCCAGCTACTGGAACACCAACCGGGCCGACCCGGTCAACCTGCG CCTCGGCGAGGTCGACGGGCTGTACCACCACCACTACGGCGTCGGCG AGCCCGACCTCAGCGTGCTGGACGGCCCGGCCGACACCCGCGAGCA GCGGATCATCGCCGAGCTGCACCGGCTGGAGAACGCCCAGGCCGAC CTGCTGCTCGACCACCTCGGCCCGATCCGGCCGGGCGACGCGCTGCT CGACGGCGGGTCCGGCCGCGGCGGCACCAGCATCATGGCCAACGCG CGGTTCGGCTGCCGGGTCGACGGGGTGTCCATCTCGGAATACCAGGT GGGTTTCGCCAACGAGCAGGCCGCTCAGCGCGGCGTCGCCGACAGG GTGCGCTTCCACTTCCGCAACATGCTGGACTCCGGATTCGCGACCGG GTCACGGCAGGCGATCTGGACGAACGAGACGACGATGTACGTCGACC TGTTCGACCTGTACGCGGAGTTCGCCCGGATGCTCGGCTTCGGCGGC CGCTACGTGTGCATCACCGGTTGCGCCAACGACGTGACCGGCCGGCG CTCCAAGGCGGTCAACAGGATCAACGAGCACTACACCTGTGACATCCA CCCGCGCAGCGACTACTTCAAGGCGCTCGCCGCCCACGATCTCGTGC CGATCGCCGTCACCGACCTGACCGCGGCCACCATCCCGTACTGGGAG CTGCGCGCCCGGTCCGAGGTGGCGACCGGGATCGAACAGGCTTTCCT CACGGCGTACTCAGAAGGCAGTTTCCACTACCTTCTGATCGCCGCCGA TCGGGTCTGA 102 DNA >ACSP50_ ATGGCCCTGCCGATCGAGGACTACGCGATCATCGCCGACACCCAGAC 1952 CGCGGCCCTGGTCGGTCGCAACGGATCGATCGACTGGCTCTGCGTGC CCCGCTTCGACTCCGGCGCGATCTTCGCGGCGCTGCTCGGCGAGGC GGAGAACGGCCACTGGACCATCGCACCGTCCGGCGAGGTGGTCACCA CCCGCCGCCGCTACCGGGACGACACGCTGGTGTTGGAGACGGAGTTC GAGACGGCCGGCGGCGTCGCCCGGTTGATCGACTTCATGCCGCCGC GCACCGACTCGCCGTCCGTCATCCGGATCGTCGAGGGCGTCCGCGG GCAGGTGGACTTCGGCATGGAGCTGCGGCTGCGCTTCGACTATGGAC ACGTCGTGCCATGGGTCTACCGCGAGGGTGGGGCGCTCGTCGCGGT CGCCGGTCCGGACGCGGCCTGGTTGCGCACCGACGTGCCGACCCGG GGCGAGAATCTGACCACCAAAGCCGATTTCCGGGTACGGGCGGGGGA ACGCGCCGCCTTCACCCTGACCTGGCGCCCGTCGCATCTGCCCTCGC CCGCCCCGCTGGACCCGGCCCACGAGCTCGGCGTGACCGAGGGTTA CTGGCGCGGCTGGGTGTCCGCCTGCACGTACGAGGGGGAGTGGCGG GACGCCGTCGTCCGATCGCTGCTCACTCTGAAAGCCCTCACCTACGCA CCCACCGGCGGCATTGTCGCGGCCGCCACCACCAGCCTCCCGGAGAA ACTCGGCGGCGTCCGCAACTGGGACTACCGCTTCTGCTGGCTCCGCG ACGCCACCATCACCCTGCAGTCGCTGCTCTTCTCCGGTTTCCAGAGTG AGGCGATCGCCTGGCGCAAATGGCTGCTGCGCGCGATCGCCGGCAAC CCCGCCGAGCTGCAGATCATGTACGGCGTCGCCGGCGAACGCCGCCT CGACGAGTATCTGGCCGACTGGCTCACCGGCTACGACGGCAACCCGG TCCGGATCGGCAACGCCGCCGCCGAGCAGTTCCAGTTGGACGTGTAC GGCGAGGTGATGGACGCCCTGCATCAGGGCCGCCGGGCCGGCCTCA AAGCCGACGACCCGTCCTGGGGCCTGCAGGTCAAACTGATGGAGTTC GTCGAGGAGCACTGGCAGGACCCGGACGAGGGCATCTGGGAGGTCC GCGGCGGCCCCCGCCAGTTCACCCACTCCAAACTGATGGCCTGGGTC GCCGCCGACCGCGCCGTCAAGGCCGTCGAGGAGTTCGGCCTGGACG GCCCCGCCGACCGCTGGCGCCGCCTGCGCGACGAGATCCGTCAGGA CATCCTGGACAAGGGTTACGACCCGGTCCGCAAGACCTTCACCCAGTA CTACGGCTCCGATGAGCTCGACGCCGCGATGCTGATGGTCCCCCTGG TCGGCTTCCTCCCCGGGGATGACGAACGCGTCGCCGGCACGGTCGCC GCCATCGAGCAACACCTGCTGGTCGACGGTTTCGTCCAGCGGTACAC CCAACATCCGGACGCCGACGTCGACGGCCTTCCCCCGGGCGAGGGC GCGTTCCTGGCCTGCACGTTCTGGCTGGCCGACAACTACGCGCTGAT GGGTCGCCACGACGAGGCCCGGGAGACGTTCGCCCGCCTGCTGGCC CTGCGCAACGACGTGGGTCTGCTCGCCGAGGAGTACGACACCACCAC CGGCCGCCTGGTCGGCAACTTCCCTCAGGCCTTCAGTCACGTCCCGC TGATCGACACGGCCCGGACCTTGACCAGCGCGCTGGCGCCGACCGA GGCCCGGGCCTCGGAGGGCCTCAGGTAG 103 DNA >ACSP50_ atgcgtacggtgattcgtgggatcgtggtgttggcgctggtggccgggggtggcgccggcatggtggggc 1953 ccgccggagcggcgccggcggtgacgttcaagaactgcactgagctgaacaagaagtacaagcacg gggtcggcaagcggggcgccgaggacagggtgagcgggtccaccaagccggtcaccaccttctccgt gaacaacgatctctatgcggcgaacaagaggctggaccgtgacaaggacgggatcgcctgcgagaa gcggtga 104 PRT >ACSP50_ MATPTQSEIREEEHEEQRQSLSTAAARNLTTTTKTAPQMQEITSRWLLRKL 1949 PWVQVAGGAYRVNRRMTYRIGDGRLSFTNVGAQVRVVPAELRELSVLSE FDDADVLAAMADKFVQQEYQPGQVIVEFGSVADHVYVIAHGKVNKVGVG NYGDPVNLGVLADGEAFGEKSLTDEERIWDYTAKAMTAVTLLAMPRSAFT ALLGQSDHLRTHVEQFRAKNRRPQNKHGEAEISVAAGHTGEPKLDGTYV DYELTPREYELSVAQTVLRVHTRVADLYNEPMNQVEQQLRLTVEALRERQ EYEMINNREFGLLHNADLRQRIHTRGGPPTPDDLDELLSMRRGTRMFVAH PQAVAAFGRECTKRGIYPPMLEQDGGTFLSWRGVPILPCGKIPVTETHTTS ILAMRTGESDQGVVGLHQTGIPDEYEPSLSVRFMGISEQAIMSYLVSAYYS AAVLVPDALGILDHVELSH 105 PRT >ACSP50_ MTSAVASPLRTDFERSVASYWNTNRADPVNLRLGEVDGLYHHHYGVGEP 1951 DLSVLDGPADTREQRIIAELHRLENAQADLLLDHLGPIRPGDALLDGGSGR GGTSIMANARFGCRVDGVSISEYQVGFANEQAAQRGVADRVRFHFRNML DSGFATGSRQAIWTNETTMYVDLFDLYAEFARMLGFGGRYVCITGCANDV TGRRSKAVNRINEHYTCDIHPRSDYFKALAAHDLVPIAVTDLTAATIPYWEL RARSEVATGIEQAFLTAYSEGSFHYLLIAADRV 106 PRT >ACSP50_ MALPIEDYAIIADTQTAALVGRNGSIDWLCVPRFDSGAIFAALLGEAENGH 1952 WTIAPSGEVVTTRRRYRDDTLVLETEFETAGGVARLIDFMPPRTDSPSVIRI VEGVRGQVDFGMELRLRFDYGHVVPWVYREGGALVAVAGPDAAWLRTD VPTRGENLTTKADFRVRAGERAAFTLTWRPSHLPSPAPLDPAHELGVTEG YWRGWVSACTYEGEWRDAVVRSLLTLKALTYAPTGGIVAAATTSLPEKLG GVRNWDYRFCWLRDATITLQSLLFSGFQSEAIAWRKWLLRAIAGNPAELQI MYGVAGERRLDEYLADWLTGYDGNPVRIGNAAAEQFQLDVYGEVMDALH QGRRAGLKADDPSWGLQVKLMEFVEEHWQDPDEGIWEVRGGPRQFTHS KLMAWVAADRAVKAVEEFGLDGPADRWRRLRDEIRQDILDKGYDPVRKT FTQYYGSDELDAAMLMVPLVGFLPGDDERVAGTVAAIEQHLLVDGFVQRY TQHPDADVDGLPPGEGAFLACTFWLADNYALMGRHDEARETFARLLALR NDVGLLAEEYDTTTGRLVGNFPQAFSHVPLIDTARTLTSALAPTEARASEG LR 107 PRT >ACSP50_ MRTVIRGIVVLALVAGGGAGMVGPAGAAPAVTFKNCTELNKKYKHGVGKR 1953 GAEDRVSGSTKPVTTFSVNNDLYAANKRLDRDKDGIACEKR 108 DNA >anti- CACTCATTAGGCACCCCAGGCTTTACACTTTATGCTTCCGGCTCGTATG sense TTGTGTGG 1(put. Anti- sense promoters) 109 DNA >anti- ACGCGGTCGAACACGCGGTGGTACATGTCCAGCCACGCGCACTGGTA sense CTCTTCGGAC 2(put. Anti- sense promoters) 110 DNA >pSET aagcgcggggaagaatcgatcaaggactcttaccgctgccgtcgccggaaggtggtcgacccgtccg T4gap cacctgggcagcacgaaggcacgtgcaacgtcagcatggcggcactcgacaagttcgttgcggaacg catcttcaacaagatcaggcacgccgaaggcgacgaagagacgttggcgcttctgtgggaagccgcc cgacgcttcggcaagctcactgaggcgcctgagaagagcggcgaacgggcgaaccttgttgcggagc gcgccgacgccctgaacgcccttgaagagctgtacgaagaccgcgcggcaggcgcgtacgacggac ccgttggcaggaagcacttccggaagcaacaggcagcgctgacgctccggcagcaaggggcggaa gagcggcttgccgaacttgaagccgccgaagccccgaagcttccccttgaccaatggttccccgaaga cgccgacgctgacccgaccggccctaagtcgtggtgggggcgcgcgtcagtagacgacaagcgcgtg ttcgtcgggctcttcgtagacaagatcgttgtcacgaagtcgactacgggcagggggcagggaacgccc atcgagaagcgcgcttcgatcacgtgggcgaagccgccgaccgacgacgacgaagacgacgccca ggacggcacggaagacgtagcggcgtagcgagacacccgggaagcctgatctacgtctgtcgagaa gtttctgatcgaaaagttcgacagcgtctccgacctgatgcagctctcgcagggcgaagaatctcgtgcttt cagcttcgatgtaggagggcgtggatatgtcctgcgggtaaatagctgcgccgatggttCTCTGTCG TCGCTGACGTCTGTAGTCTAGCCTCATTATGATTGTACGCTATTCAGGG ATTGACTGATACCGGAAGACATCTCAAATGAAGTGGTCAAGCTTTATGC TTGTAAACCGTTTTGTGAAAAAATTTTTAAAATAAAAAAGGGGACCTCTA GGGTCCCCAATTAATTAGTAATATAATCTATTAAAGGTCATTCAAAAGGT CATCCAAGCTTGGCTGTTTTGGCGGATGAGAGAAGATTTTCAGCCTGA TACAGATTAAATCAGAACGCAGAAGCGGTCTGATAAAACAGAATTTGCC TGGCGGCAGTAGCGCGGTGGTCCCACCTGACCCCATGCCGAACTCAG AAGTGAAACGCCGTAGCGCCGATGGTAGTGTGGCCCATGCGAGAGTA CATATGGTACTGGCCGATGCTGGGAGAAGCGCGCTGCTGTACGGCGC GCACCGGGTGCGGAGCCCCTCGGCGAGCGGTGTGAAACTTCTGTGAA TGGCCTGTTCGGTTGCTTTTTTTATACGGCTGCCAGATAAGGCTTGCAG CATCTGGGCGGCTACCGCTATGATCGGGGCGTTCCTGCAATTCTTAGT GCGAGTATCTGAAAGGGGATACGCATGGTACCGAGACCTTATGTTGAT CGGCACTTTGCATCGGCCGCGCTCCCGATTCCGGAAGTGCTTGACATT GGGGAATTTATGCGGTGTGAAATACCGCACAGATGCGTAAGGAGAAAA TACCGCATCAGGCGCCATTCGCCATTCAGGCTGCGCAACTGTTGGGAA GGGCGATCGGTGCGGGCCTCTTCGCTATTACGCCAGCTGGCGAAAGG GGGATGTGCTGCAAGGCGATTAAGTTGGGTAACGCCAGGGTTTTCCCA GTCACGACGTTGTAAAACGACGGCCAGTGCCAAGCTTGGGCTGCAGG TCGACTCTAGAGGATCCGCGGCCGCGCGCGATATCGAATTCGTAATCA TGTCATAGCTGTTTCCTGTGTGAAATTGTTATCCGCTCACAATTCCACA CAACATACGAGCCGGAAGCATAAAGTGTAAAGCCTGGGGTGCCTAATG AGTGAGCTAACTCACATTAATTGCGTTGCGCGGTCTCGGCGTTTCGTG CCGCGTGATTTTCCGCCAAAAACTTTAACGAACGTTCGTTATAATGGTG TCATGACCTTCACGACGAAGTACTAAAATTGGCCCGAATCATCAGCTAA GCTTTATGCTTGTAAACCGTTTTGTGAAAAAATTTTTAAAATAAAAAAGG GGACCTCTAGGGTCCCCAATTAATTAGTAATATAATCTATTAAAGGTCA TTCAAAAGGTCATCCACCTCACTTCGGTGAATCGAAGCGCGGCATCAG GGTTACTTTTTGGATACCTGAGACATTCGTCGCTTCCGGGTATGCGCT CTATGTGACGGTCTTTTGGCGCACAAATGCTCAGCACCATTTAAATTAG ACCGACTCCAGATCTGTAAGGTCCAACAAAACCCATCGTAGTCCTTAG ACTTGGCACACTTACACCTGCAGTGGATGACCTTTTGAATGACCTTTAA TAGATTATATTACTAATTAATTGGGGACCCTAGAGGTCCCCTTTTTTATT TTAAAAATTTTTTCACAAAACGGTTTACAAGCATAAAGCTTGCCACGCA GACGACAGCCCACGCTGACCGATCTACCTGAACGGCGACCATCTGTG TGGTACTGGGGCGGAGAGATAACTACGGTGCCGCTTACCGGgctcactca aaggcggtaatacggttatccacagaatcaggggataacgcaggaaagaacatgtgagcaaaaggc cagcaaaaggccaggaaccgtaaaaaggccgcgttgctggcgtttttccataggctccgcccccctgac gagcatcacaaaaatcgacgctcaagtcagaggtggcgaaacccgacaggactataaagataccag gcgtttccccctggaagctccctcgtgcgctctcctgttccgaccctgccgcttaccggatacctgtccgcctt tctcccttcgggaagcgtggcgctttctcatagctcacgctgtaggtatctcagttcggtgtaggtcgttcgct ccaagctgggctgtgtgcacgaaccccccgttcagcccgaccgctgcgccttatccggtaactatcgtctt gagtccaacccggtaagacacgacttatcgccactggcagcagccactggtaacaggattagcagag cgaggtatgtaggcggtgctacagagttcttgaagtggtggcctaactacggctacactagaagaacagt atttggtatctgcgctctgctgaagccagttaccttcggaaaaagagttggtagctcttgatccggcaaaca aaccaccgctggtagcggtggtttttttgtttgcaagcagcagattacgcgcagaaaaaaaggatctcaa gaagatcctttgatcttttctacggggtctgacgctcagtggaacgaaaactcacgttaagggattttggtca tgagattatcaaaaaggatcttcacctagatccttttggttcatgtgcagctccatcagcaaaaggggatga taagtttatcaccaccgactatttgcaacagtgccgttgatcgtgctatgatcgactgatgtcatcagcggtg gagtgcaatgtcgtgcaatacgaatggcgaaaagccgagctcatcggtcagcttctcaaccttggggtta cccccggcggtgtgctgctggtccacagctccttccgtagcgtccggcccctcgaagatgggccacttgg actgatcgaggccctgcgtgctgcgctgggtccgggagggacgctcgtcatgccctcgtggtcaggtctg gacgacgagccgttcgatcctgccacgtcgcccgttacaccggaccttggagttgtctctgacacattctg gcgcctgccaaatgtaaagcgcagcgcccatccatttgcctttgcggcagcggggccacaggcagagc agatcatctctgatccattgcccctgccacctcactcgcctgcaagcccggtcgcccgtgtccatgaactc gatgggcaggtacttctcctcggcgtgggacacgatgccaacacgacgctgcatcttgccgagttgatgg caaaggttccctatggggtgccgagacactgcaccattcttcaggatggcaagttggtacgcgtcgattat ctcgagaatgaccactgctgtgagcgctttgccttggcggacaggtggctcaaggagaagagccttcag aaggaaggtccagtcggtcatgcctttgctcggttgatccgctcccgcgacattgtggcgacagccctgg gtcaactgggccgagatccgttgatcttcctgcatccgccagaggcgggatgcgaagaatgcgatgccg ctcgccagtcgattggctgagctcatgagcggagaacgagatgacgttggaggggcaaggtcgcgctg attgctggggcaacacgtggagcggatcggggattgtctttcttcagctcgctgatgatatgctgacgctca atgccgtttggcctccgactaacgaaaatcccgcatttggacggctgatccgattggcacggcggacgg cgaatggcggagcagacgctcgtccgggggcaatgagatatgaaaaagcctgaactcaccgcgacgt atcgggccctggccagctagctagagtcgacctgcaggtccccggggatcggtcttgccttgctcgtcggt gatgtacttcaccagctccgcgaagtcgctcttcttgatggagcgcatggggacgtgcttggcaatcacgc gcaccccccggccgttttagcggctaaaaaagtcatggctctgccctcgggcggaccacgcccatcatg accttgccaagctcgtcctgcttctcttcgatcttcgccagcagggcgaggatcgtggcatcaccgaaccg cgccgtgcgcgggtcgtcggtgagccagagtttcagcaggccgcccaggcggcccaggtcgccattga tgcgggccagctcgcggacgtgctcatagtccacgacgcccgtgattttgtagccctggccgacggcca gcaggtaggccgacaggctcatgccggccgccgccgccttttcctcaatcgctcttcgttcgtctggaagg cagtacaccttgataggtgggctgcccttcctggttggcttggtttcatcagccatccgcttgccctcatctg ttacgccggcggtagccggccagcctcgcagagcaggattcccgttgagcaccgccaggtgcgaataag ggacagtgaagaaggaacacccgctcgcgggtgggcctacttcacctatcctgcccggctgacgccgtt ggatacaccaaggaaagtctacacgaaccctttggcaaaatcctgtatatcgtgcgaaaaaggatggat ataccgaaaaaatcgctataatgaccccgaagcagggttatgcagcggaaaagatccgtcgacctgca ggcatgcaagctctagcgattccagacgtcccgaaggcgtggcgcggcttccccgtgccggagcaatc gccctgggtgggttacacgacgcccctctatggcccgtactgacggacacaccgaagccccggcggc aaccctcagcggatgccccggggcttcacgttttcccaggtcagaagcggttttcgggagtagtgcccca actggggtaacctttgagttctctcagttgggggcgtagggtcgccgacatgacacaaggggttgtgacc ggggtggacacgtacgcgggtgcttacgaccgtcagtcgcgcgagcgcgagaattcgagcgcagcaa gcccagcgacacagcgtagcgccaacgaagacaaggcggccgaccttcagcgcgaagtcgagcgc gacgggggccggttcaggttcgtcgggcatttcagcgaagcgccgggcacgtcggcgttcgggacggc ggagcgcccggagttcgaacgcatcctgaacgaatgccgcgccgggcggctcaacatgatcattgtct atgacgtgtcgcgcttctcgcgcctgaaggtcatggacgcgattccgattgtctcggaattgctcgccctgg gcgtgacgattgtttccactcaggaaggcgtcttccggcagggaaacgtcatggacctgattcacctgatt atgcggctcgacgcgtcgcacaaagaatcttcgctgaagtcggcgaagattctcgacacgaagaacctt cagcgcgaattgggcgggtacgtcggcgggaaggcgccttacggcttcgagcttgtttcggagacgaa ggagatcacgcgcaacggccgaatggtcaatgtcgtcatcaacaagcttgcgcactcgaccactcccct taccggacccttcgagttcgagcccgacgtaatccggtggtggtggcgtgagatcaagacgcacaaac accttcccttcaagccgggcagtcaagccgccattcacccgggcagcatcacggggctttgtaagcgca tggacgctgacgccgtgccgacccggggcgagacgattgggaagaagaccgcttcaagcgcctggg acccggcaaccgttatgcgaatccttcgggacccgcgtattgcgggcttcgccgctgaggtgatctacaa gaagaagccggacggcacgccgaccacgaagattgagggttaccgcattcagcgcgacccgatcac gctccggccggtcgagcttgattgcggaccgatcatcgagcccgctgagtggtatgagcttcaggcgtgg ttggacggcagggggcgcggcaaggggctttcccgggggcaagccattctgtccgccatggacaagct gtactgcgagtgtggcgccgtcatgacttcg 111 DNA >pSET aagcgcggggaagaatcgatcaaggactcttaccgctgccgtcgccggaaggtggtcgacccgtccg T4tip cacctgggcagcacgaaggcacgtgcaacgtcagcatggcggcactcgacaagttcgttgcggaacg catcttcaacaagatcaggcacgccgaaggcgacgaagagacgttggcgcttctgtgggaagccgcc cgacgcttcggcaagctcactgaggcgcctgagaagagcggcgaacgggcgaaccttgttgcggagc gcgccgacgccctgaacgcccttgaagagctgtacgaagaccgcgcggcaggcgcgtacgacggac ccgttggcaggaagcacttccggaagcaacaggcagcgctgacgctccggcagcaaggggcggaa gagcggcttgccgaacttgaagccgccgaagccccgaagcttccccttgaccaatggttccccgaaga cgccgacgctgacccgaccggccctaagtcgtggtgggggcgcgcgtcagtagacgacaagcgcgtg ttcgtcgggctcttcgtagacaagatcgttgtcacgaagtcgactacgggcagggggcagggaacgccc atcgagaagcgcgcttcgatcacgtgggcgaagccgccgaccgacgacgacgaagacgacgccca ggacggcacggaagacgtagcggcgtagcgagacacccgggaagcctgatctacgtctgtcgagaa gtttctgatcgaaaagttcgacagcgtctccgacctgatgcagctctcgcagggcgaagaatctcgtgcttt cagcttcgatgtaggagggcgtggatatgtcctgcgggtaaatagctgcgccgatggttCTCTGTCG TCGCTGACGTCTGTAGTCTAGCCTCATTATGATTGTACGCTATTCAGGG ATTGACTGATACCGGAAGACATCTCAAATGAAGTGGTCAAGCTTTATGC TTGTAAACCGTTTTGTGAAAAAATTTTTAAAATAAAAAAGGGGACCTCTA GGGTCCCCAATTAATTAGTAATATAATCTATTAAAGGTCATTCAAAAGGT CATCCAAGCTTGGCTGTTTTGGCGGATGAGAGAAGATTTTCAGCCTGA TACAGATTAAATCAGAACGCAGAAGCGGTCTGATAAAACAGAATTTGCC TGGCGGCAGTAGCGCGGTGGTCCCACCTGACCCCATGCCGAACTCAG AAGTGAAACGCCGTAGCGCCGATGGTAGTGTGGCCCATGCGAGAGTA CAATCCCTAGAACGTCCGGGCTTGCACCTCACGTCACGTGAGGAGGC AGCGTGGACGGCGTGGTACCAAGCTTATTGGCACTAGTCGAGCAACG GAGGTATTCCGATGGTACCGAGACCTTATGTTGATCGGCACTTTGCAT CGGCCGCGCTCCCGATTCCGGAAGTGCTTGACATTGGGGAATTTATGC GGTGTGAAATACCGCACAGATGCGTAAGGAGAAAATACCGCATCAGGC GCCATTCGCCATTCAGGCTGCGCAACTGTTGGGAAGGGCGATCGGTG CGGGCCTCTTCGCTATTACGCCAGCTGGCGAAAGGGGGATGTGCTGC AAGGCGATTAAGTTGGGTAACGCCAGGGTTTTCCCAGTCACGACGTTG TAAAACGACGGCCAGTGCCAAGCTTGGGCTGCAGGTCGACTCTAGAG GATCCGCGGCCGCGCGCGATATCGAATTCGTAATCATGTCATAGCTGT TTCCTGTGTGAAATTGTTATCCGCTCACAATTCCACACAACATACGAGC CGGAAGCATAAAGTGTAAAGCCTGGGGTGCCTAATGAGTGAGCTAACT CACATTAATTGCGTTGCGCGGTCTCGGCGTTTCGTGCCGCGTGATTTT CCGCCAAAAACTTTAACGAACGTTCGTTATAATGGTGTCATGACCTTCA CGACGAAGTACTAAAATTGGCCCGAATCATCAGCTAAGCTTTATGCTTG TAAACCGTTTTGTGAAAAAATTTTTAAAATAAAAAAGGGGACCTCTAGG GTCCCCAATTAATTAGTAATATAATCTATTAAAGGTCATTCAAAAGGTCA TCCACCTCACTTCGGTGAATCGAAGCGCGGCATCAGGGTTACTTTTTG GATACCTGAGACATTCGTCGCTTCCGGGTATGCGCTCTATGTGACGGT CTTTTGGCGCACAAATGCTCAGCACCATTTAAATTAGACCGACTCCAGA TCTGTAAGGTCCAACAAAACCCATCGTAGTCCTTAGACTTGGCACACTT ACACCTGCAGTGGATGACCTTTTGAATGACCTTTAATAGATTATATTACT AATTAATTGGGGACCCTAGAGGTCCCCTTTTTTATTTTAAAAATTTTTTC ACAAAACGGTTTACAAGCATAAAGCTTGCCACGCAGACGACAGCCCAC GCTGACCGATCTACCTGAACGGCGACCATCTGTGTGGTACTGGGGCG GAGAGATAACTACGGTGCCGCTTACCGGgctcactcaaaggcggtaatacggttatc cacagaatcaggggataacgcaggaaagaacatgtgagcaaaaggccagcaaaaggccaggaac cgtaaaaaggccgcgttgctggcgtttttccataggctccgcccccctgacgagcatcacaaaaatcgac gctcaagtcagaggtggcgaaacccgacaggactataaagataccaggcgtttccccctggaagctcc ctcgtgcgctctcctgttccgaccctgccgcttaccggatacctgtccgcctttctcccttcgggaagcgtgg cgctttctcatagctcacgctgtaggtatctcagttcggtgtaggtcgttcgctccaagctgggctgtgtgcac gaaccccccgttcagcccgaccgctgcgccttatccggtaactatcgtcttgagtccaacccggtaagac acgacttatcgccactggcagcagccactggtaacaggattagcagagcgaggtatgtaggcggtgct acagagttcttgaagtggtggcctaactacggctacactagaagaacagtatttggtatctgcgctctgctg aagccagttaccttcggaaaaagagttggtagctcttgatccggcaaacaaaccaccgctggtagcggt ggtttttttgtttgcaagcagcagattacgcgcagaaaaaaaggatctcaagaagatcctttgatcttttctac ggggtctgacgctcagtggaacgaaaactcacgttaagggattttggtcatgagattatcaaaaaggatc ttcacctagatccttttggttcatgtgcagctccatcagcaaaaggggatgataagtttatcaccaccgacta tttgcaacagtgccgttgatcgtgctatgatcgactgatgtcatcagcggtggagtgcaatgtcgtgcaata cgaatggcgaaaagccgagctcatcggtcagcttctcaaccttggggttacccccggcggtgtgctgctg gtccacagctccttccgtagcgtccggcccctcgaagatgggccacttggactgatcgaggccctgcgtg ctgcgctgggtccgggagggacgctcgtcatgccctcgtggtcaggtctggacgacgagccgttcgatcc tgccacgtcgcccgttacaccggaccttggagttgtctctgacacattctggcgcctgccaaatgtaaagc gcagcgcccatccatttgcctttgcggcagcggggccacaggcagagcagatcatctctgatccattgcc cctgccacctcactcgcctgcaagcccggtcgcccgtgtccatgaactcgatgggcaggtacttctcctcg gcgtgggacacgatgccaacacgacgctgcatcttgccgagttgatggcaaaggttccctatggggtgc cgagacactgcaccattcttcaggatggcaagttggtacgcgtcgattatctcgagaatgaccactgctgt gagcgctttgccttggcggacaggtggctcaaggagaagagccttcagaaggaaggtccagtcggtca tgcctttgctcggttgatccgctcccgcgacattgtggcgacagccctgggtcaactgggccgagatccgtt gatcttcctgcatccgccagaggcgggatgcgaagaatgcgatgccgctcgccagtcgattggctgagc tcatgagcggagaacgagatgacgttggaggggcaaggtcgcgctgattgctggggcaacacgtgga gcggatcggggattgtctttcttcagctcgctgatgatatgctgacgctcaatgccgtttggcctccgactaa cgaaaatcccgcatttggacggctgatccgattggcacggcggacggcgaatggcggagcagacgct cgtccgggggcaatgagatatgaaaaagcctgaactcaccgcgacgtatcgggccctggccagctag ctagagtcgacctgcaggtccccggggatcggtcttgccttgctcgtcggtgatgtacttcaccagctccgc gaagtcgctcttcttgatggagcgcatggggacgtgcttggcaatcacgcgcaccccccggccgttttagc ggctaaaaaagtcatggctctgccctcgggcggaccacgcccatcatgaccttgccaagctcgtcctgct tctcttcgatcttcgccagcagggcgaggatcgtggcatcaccgaaccgcgccgtgcgcgggtcgtcggt gagccagagtttcagcaggccgcccaggcggcccaggtcgccattgatgcgggccagctcgcggacg tgctcatagtccacgacgcccgtgattttgtagccctggccgacggccagcaggtaggccgacaggctc atgccggccgccgccgccttttcctcaatcgctcttcgttcgtctggaaggcagtacaccttgataggtggg ctgcccttcctggttggcttggtttcatcagccatccgcttgccctcatctgttacgccggcggtagccggcca gcctcgcagagcaggattcccgttgagcaccgccaggtgcgaataagggacagtgaagaaggaaca cccgctcgcgggtgggcctacttcacctatcctgcccggctgacgccgttggatacaccaaggaaagtct acacgaaccctttggcaaaatcctgtatatcgtgcgaaaaaggatggatataccgaaaaaatcgctata atgaccccgaagcagggttatgcagcggaaaagatccgtcgacctgcaggcatgcaagctctagcgat tccagacgtcccgaaggcgtggcgcggcttccccgtgccggagcaatcgccctgggtgggttacacga cgcccctctatggcccgtactgacggacacaccgaagccccggcggcaaccctcagcggatgccccg gggcttcacgttttcccaggtcagaagcggttttcgggagtagtgccccaactggggtaacctttgagttctc tcagttgggggcgtagggtcgccgacatgacacaaggggttgtgaccggggtggacacgtacgcgggt gcttacgaccgtcagtcgcgcgagcgcgagaattcgagcgcagcaagcccagcgacacagcgtagc gccaacgaagacaaggcggccgaccttcagcgcgaagtcgagcgcgacgggggccggttcaggttc gtcgggcatttcagcgaagcgccgggcacgtcggcgttcgggacggcggagcgcccggagttcgaac gcatcctgaacgaatgccgcgccgggcggctcaacatgatcattgtctatgacgtgtcgcgcttctcgcgc ctgaaggtcatggacgcgattccgattgtctcggaattgctcgccctgggcgtgacgattgtttccactcag gaaggcgtcttccggcagggaaacgtcatggacctgattcacctgattatgcggctcgacgcgtcgcac aaagaatcttcgctgaagtcggcgaagattctcgacacgaagaaccttcagcgcgaattgggcgggta cgtcggcgggaaggcgccttacggcttcgagcttgtttcggagacgaaggagatcacgcgcaacggcc gaatggtcaatgtcgtcatcaacaagcttgcgcactcgaccactccccttaccggacccttcgagttcgag cccgacgtaatccggtggtggtggcgtgagatcaagacgcacaaacaccttcccttcaagccgggcag tcaagccgccattcacccgggcagcatcacggggctttgtaagcgcatggacgctgacgccgtgccga cccggggcgagacgattgggaagaagaccgcttcaagcgcctgggacccggcaaccgttatgcgaat ccttcgggacccgcgtattgcgggcttcgccgctgaggtgatctacaagaagaagccggacggcacgc cgaccacgaagattgagggttaccgcattcagcgcgacccgatcacgctccggccggtcgagcttgatt gcggaccgatcatcgagcccgctgagtggtatgagcttcaggcgtggttggacggcagggggcgcggc aaggggctttcccgggggcaagccattctgtccgccatggacaagctgtactgcgagtgtggcgccgtc atgacttcg

    DETAILED DESCRIPTION

    Definitions

    (37) Unless otherwise defined, all scientific and technical terms used in the description, Figures and claims have their ordinary meaning as commonly understood by one of ordinary skill in the art. All publications, patent applications, patents, and other references mentioned herein are incorporated by reference in their entirety. In case of conflict, the present specification, including definitions, will prevail. If two or more documents incorporated by reference include conflicting and/or inconsistent disclosure with respect to each other, then the document having the later effective date shall control. The materials, methods, and examples are illustrative only and not intended to be limiting. Unless stated otherwise, the following terms used in this document, including the description and claims, have the definitions given below.

    (38) The terms comprising, including, containing, having etc. shall be read expansively or open-ended and without limitation. Singular forms such as a, an or the include plural references unless the context clearly indicates otherwise. Unless otherwise indicated, the term at least preceding a series of elements is to be understood to refer to every element in the series. The terms at least one and at least one of include for example, one, two, three, four, five, six, seven, eight, nine, ten or more elements.

    (39) It is furthermore understood that slight variations above and below a stated range can be used to achieve substantially the same results as a value within the range. Also, unless indicated otherwise, the disclosure of ranges is intended as a continuous range including every value between the minimum and maximum values.

    (40) Where protein or amino acid sequences are provided throughout the application it is also understood by the skilled person that single or multiple amino acids may be exchanged by amino acids with similar properties to achieve substantially the same effect, i.e.an equivalent result. The skilled person furthermore knows that a defined protein or amino acid sequence may be encoded by various nucleic acid sequences. For a given amino acid sequence as defined herein, each of the countable nucleic acid sequences encoding the specific amino acid sequence shall be deemed to be disclosed herein. Where nucleic acid sequences are provided throughout the application it is furthermore understood that silent mutations may be introduced.

    (41) O-{4,6-dideoxy-4 [1S-(1,4,6/5)-4,5,6-trihydroxy-3-hydroxymethyl-2-cyclohexen-1-yl]-amino--D-glucopyranosyl}-(1->4)-O--D-glucopyranosyl-(1->4)-D-glucopyranose or acarbose is a cyclitol-containing aminoglycoside, composed of a pseudodisaccharide and an -1,4-glycosidic bound maltose (Wehmeier and Piepersberg 2009). The pseudodisaccharide, named acarviose, is built by an unsaturated C7-aminocyclitol, also referred as valienol or valienamine, which is connected to C4 of a 4,6-didesoxy-D-glucose by a nitrogen bond (cf. FIG. 5) (Wehmeier and Piepersberg 2009). This N-glycosidic bond cannot be hydrolyzed by foreign alpha-1,4-glucoside hydrolases, leading to an almost irreversible inhibitory effect (Wehmeier and Piepersberg 2009; Brayer et al. 2000).

    (42) Overexpression of a gene product or protein as described herein refers to an increase in expression compared to the wild type or a specified reference strain. Preferably, the reference strain or control is the strain which has not been engineered for the specific overexpression of the respective gene(s) or protein(s). For example, the control does not comprise a vector comprising an expression cassette for the respective gene product or protein. For example, the overexpression of the gene product may be an increase during the early growth phase, during the linear growth phase, during the stationary phase or an increase during any other time. Preferably, overexpression is an increase of the gene product or protein by a factor of at least 1.5 or at least a factor of 2 compared to the control. With regard to transcript amounts and if not defined otherwise herein, strong overexpression refers to a log 2 (fold change)>6. With regard to transcript amounts and if not defined otherwise herein, weak overexpression refers to a log 2 (fold change)<2. With regard to transcript amounts and if not defined otherwise herein, medium strong overexpression refers to a log 2 (fold change)2 and 6.

    (43) The expression of a gene product or protein as described herein is absent or reduced if the respective gene has been deleted or mutated in such a way that its gene product is not expressed at all or in a significantly decreased amount (e.g. less than 0.75 fold or less than 0.5 fold). The expression of a gene product or protein as described herein is also considered absent or reduced if the gene product or protein has lost functionality, e.g. in a transient or permanent way, e.g. by mutation or knockdown. Methods to monitor the amount and or activity of a gene product or protein are known in the art and are also described herein in an exemplary way. In general, suitable methods to obtain an absent or reduced expression of a gene product are methods that alter the genetic sequence or elements of gene expression (e.g. by deletion or point mutations) and/or methods that negatively affect the transcription and translation of a gene or the activity or half-life of the gene product (protein).

    (44) If not specified otherwise the symbol A refers to a deletion mutant, i.e. a mutant wherein a specific gene sequence has been at least partially deleted.

    (45) The early growth phase is the time, in which the Actinoplanes strain adapts to the medium and in which the cell dry weight is below 3 g.Math.L 1. After adaption to the environment, the culture metabolizes the nutrients supplied by the medium and starts to grow. Since Actinoplanes is growing in a spherical mycelium, which can only expand to the outside of the sphere, the cells in the middle are shielded from nutrients and have only limited space for cell division. Therefore, only the cell in the outer layer of the spherical mycelium are dividing. By this, growth of Actinoplanes is linear and not exponentialin contrast to other bacteria, which are growing unicellular. The growth phase is called linear growth phase for Actinoplanes ssp. and starts at a cell dry weight of 3 g.Math.L.sup.1. The stationary phase is defined as growth phase, in which the cells reach the capacity limits (of space and nutrients) respectively in which growth decreases due to the formation of inhibitory by-products or other chemical and physical factors such as changes in the osmolarity or pH. The stationary phase is the growth phase, in which the number of dying cells equals the number of dividing cells. This phase usually starts at a cell dry weight of 16-18 g.Math.L.sup.1 in maltose minimal medium.

    (46) The term vector, as used herein, refers to a nucleic acid molecule capable of propagating a nucleic acid molecule to which it is linked.

    (47) The term expression cassette, as used herein, refers to a nucleic acid molecule comprising at least a gene for expression and a regulatory sequence, such as a promoter.

    (48) A promoter is a nucleic acid sequence which leads to initiation of transcription of a particular gene.

    (49) A strong promoter as defined herein is a promoter, which leads to a normalized glucuronidase activity of at least 5.104 [L.Math.g.sup.1.Math.min.sup.1] in the glucuronidase assay, and/or which leads to a 350-fold relative transcription (in log 2 (fold change)) of the gusA gene compared to the promoterless pGUS control vector. A detailed description of a method for characterizing the strength of a promoter is provided within the examples and in (Schaffert, et al. 2019). Examples include the promoters of apm: 9.2.10.sup.4 [L.Math.g.sup.1.Math.min.sup.1] and log 2 (fold change)=360.78 ermE*: 9.7-10.sup.4 [L.Math.g.sup.1.Math.min.sup.1] and log 2 (fold change)=291.03 katE: 5.1.10.sup.4 [L.Math.g.sup.1.Math.min.sup.1] and log 2 (fold change)=342.51 moeE5: 9.7.10.sup.4 [L.Math.g.sup.1.Math.min.sup.1] and log 2 (fold change)=329.32 gapDH: 11.5.10.sup.4 [L.Math.g.sup.1.Math.min.sup.1] and log 2 (fold change)=931.45, and actP: 22.9.10.sup.4 [L.Math.g.sup.1.Math.min.sup.1].

    (50) A medium strong promoter is defined as promoter, which leads to a normalized glucuronidase activity of at least 1.104 [L.Math.g.sup.1.Math.min.sup.1] was achieved in the glucuronidase assay, and/or which leads to a 10-fold relative transcription (in log 2 (fold change)) of the gusA gene compared to the promoterless pGUS control vector. Examples include the promoters of efp: 3.1.10-4 [L.Math.g.sup.1.Math.min.sup.1] and log 2 (fold change)=53.08 cdaR: 3.1.10-4 [L.Math.g.sup.1.Math.min.sup.1] and log 2 (fold change)=86.82 rpsL: 3.5-10-4 [L.Math.g.sup.1.Math.min.sup.1] and log 2 (fold change)=98.53 rpsJ: 3.7.10-4 [L.Math.g.sup.1.Math.min.sup.1] and log 2 (fold change)=123.97 cgt: 2.5.10-4 [L.Math.g.sup.1.Math.min.sup.1] and log 2 (fold change)=347.29, and tipA: 4.2.10-4 [L.Math.g.sup.1.Math.min.sup.1] and log 2 (fold change)=191.

    (51) In some cases, the medium strong promoter leads to a normalized glucuronidase activity of at least 1.10-4 [L.Math.g.sup.1.Math.min.sup.1] and maximal 5.10-4 [L.Math.g.sup.1.Math.min.sup.1] in the glucuronidase assay.

    (52) A weak promotor is defined as promoter, which leads to a normalized glucuronidase activity of below 1.104 [L.Math.g.sup.1.Math.min.sup.1], and/or which leads to a relative transcription of below 10-fold compared to the promoterless pGUS control vector.

    (53) The term Cgt (ACSP50_5024, previously: ACPL_5091) refers to extracellular small carbohydrate binding protein, previously described as cyclomaltodextrin glucanotransferase due the high similarity to the C-terminal domain of cyclodextrin glycosyltransferases, obtained from Actinoplanes sp., e.g. strain ATCC 31044/CBS 674.73/SE50/110. Cgt protein is encoded by the gene cgt. Sequence(s) are described herein (SEQ ID No. 20) or are accessible via UniProt Identifier G8S155 (G8S155_ACTS5). Different isoforms and variants may exist for the different strains and are all comprised by the term. Where a specific mutation can be exchanged without changing the described catalytic properties of the initial sequence, it is clear that the sequence having such a functionally silent mutation is equivalent with regard to the initial sequence. In addition, the protein may furthermore be subject to various modifications, e.g, synthetic or naturally occurring modifications.

    (54) The term AcbB (ACSP50_3608, previously ACPL_3681) refers to dTDP-D-glucose-4,6-dehydratase obtained from Actinoplanes sp., e.g. strain ATCC 31044/CBS 674.73/SE50/110, which is probably involved in the biosynthesis of the acarviose moiety of acarbose. AcbB protein is encoded by the gene acbB. Sequence(s) are described herein (SEQ ID No. 13) or are accessible via UniProt Identifier Q9ZAE8 (RMLB_ACTS5). Different isoforms and variants may exist for the different strains and are all comprised by the term. Where a specific mutation can be exchanged without changing the described catalytic properties of the initial sequence, it is clear, that the sequence having such a functionally silent mutation is equivalent with regard to the initial sequence. In addition, the protein may furthermore be subject to various modifications, e.g, synthetic or naturally occurring modifications.

    (55) The term GtaB also GalU (ACSP50_7820, previously ACPL_7811) refers to UTP-glucose-1-phosphate uridylyltransferase obtained from Actinoplanes sp., e.g. strain ATCC 31044/CBS 674.73/SE50/110. GtaB seems to catalyze the conversion of glucose-1P and UDP-glucose into each other and might be involved in the precursor supply for acarbose. GtaB protein is encoded by the gene gtaB. Sequence(s) are described herein (SEQ ID No. 19) or are accessible via UniProt Identifier G8S608 (ACPL_7811). Different isoforms and variants may exist for the different strains and are all comprised by the term. Where a specific mutation can be exchanged without changing the described catalytic properties of the initial sequence, it is clear that the sequence having such a functionally silent mutation is equivalent with regard to the initial sequence. In addition, the protein may furthermore be subject to various modifications, e.g, synthetic or naturally occurring modifications.

    (56) As defined herein, a gene which is essential for carotenoid synthesis is defined as a gene which is positively required for the synthesis of a carotenoid. Actinoplanes are known to produce a variety of soluble pigments including yellow, orange and pink pigments of the class carotenoids. In Actinoplanes, the set of genes which are essential for carotenoid synthesis include genes from the MEP/DOXP pathway, genes of terpene cluster 1, genes of terpene cluster 2a, genes of terpene cluster 2b and genes of camphene-like monoterpene biosynthesis terpene cluster 3. Genes of the MEP/DOXP pathway comprise i. 1-deoxy-D-xylulose-5-phosphate synthase gene dxs (ACSP50_7096, SEQ ID No. 23), ii. 4-hydroxy-3-methylbut-2-en-1-yl diphosphate synthase gene ispG (ACSP50_7248, SEQ ID No. 24), ili. 1-deoxy-D-xylulose-5-phosphate reductoisomerase gene dxr (ACSP50_7250, SEQ ID No. 25), iv. 4-hydroxy-3-methylbut-2-enyl diphosphate reductase gene ispH (ACSP50_7707, SEQ ID No. 26), v. 4-(cytidine 5-diphospho)-2-C-methyl-D-erythritol kinase gene ispE (ACSP50_7802, SEQ ID No. 27), vi. 2-C-methyl-D-erythritol 2;4-cyclodiphosphate synthase gene ispF, ACSP50_8046, SEQ ID No. 28), and/or vii. 2-C-methyl-D-erythritol 4-phosphate cytidylyltransferase gene ispD (ACSP50_8047, SEQ ID No. 29).
    Genes of Terpene Cluster 1 Comprise i. isopentenyl-diphosphate delta-isomerase gene idi (ACSP50_0146, SEQ ID No. 30), ii. zeta-phytoene desaturase gene crtl (ACSP50_0147, SEQ ID No. 10), iii. polyprenyl synthetase gene crtE/IdsA (ACSP50_0148, SEQ ID No. 31), iv. phytoene synthase gene crtB (ACSP50_0149, SEQ ID No. 32), v. deoxyribodipyrimidine photo-lyase gene (ACSP50_0150, SEQ ID No. 33), or vi. pyridine nucleotide-disulfide oxidoreductase gene (ACSP50_0151, SEQ ID No. 34).
    Genes of Terpene Cluster 2a Comprise i. transcriptional regulator gene (ACSP50_1631, SEQ ID No. 35), ii. lycopene cyclase gene (ACSP50_1632, SEQ ID No. 36), iii. lycopene cyclase gene (ACSP50_1633, SEQ ID No. 37), iv. polyprenyl synthetase (farnesyl pyrophosphate synthetase 2 gene fps2/crtE (ACSP50_1634, SEQ ID No. 38), and v. methylenetetrahydrofolate reductase (NADPH) gene (ACSP50_1635, SEQ ID No. 39).
    Genes of Terpene Cluster 2b Comprise i. LysR-family transcriptional regulator gene (ACSP50_1650, SEQ ID No. 40), ii. methyltransferase type 11 gene (ACSP50_1651, SEQ ID No. 41), iii. CDP-alcoholphosphatidyltransferase pgsA (ACSP50_1652, SEQ ID No. 42), iv. zeta-phytoene desaturase (crtl-family) gene crtD (ACSP50_1653, SEQ ID No. 43), v. glycosyl transferase gene cruC (ACSP50_1654, SEQ ID No. 44), vi. hypothetical protein (put. membrane prot,) gene cruF, (ACSP50_1655, SEQ ID No. 45), vii. GCN5 family acetyltransferase gene (ACSP50_1656, SEQ ID No. 46), viii. monooxygenase gene (ACSP50_1657, SEQ ID No. 47), and ix. short-chain dehydrogenase gene (ACSP50_1658, SEQ ID No. 48).

    (57) Another gene which is essential for carotenoid synthesis is polyprenyl synthetase gene crtE (ACSP50_3873, SEQ ID No. 49).

    (58) Genes of Camphene-Like Monoterpene Biosynthesis Terpene Cluster 3 Comprise

    (59) i. transcriptional regulator (Crp/Fnr family) gene eshA (ACSP50_1949, SEQ ID No. 104), ii. camphene synthase gene (ACSP50_1950, SEQ ID No. 50), ili. methyltransferase (SAM-dependent) type 11 gene (ACSP50_1951, SEQ ID No. 105), iv. glycosyl-hydrolase gene (ACSP50_1952, SEQ ID No. 106), and v. oxidoreductase/aldo/ketoreductase (ACSP50_1953, SEQ ID No. 107).

    EMBODIMENTS

    (60) While Actinomycetales strain Actinoplanes sp. SE50/110 was used as a model strain for the current invention, it is clear for the skilled person, that the general mechanisms and findings can be applied for other acarbose producing strains such as those strains which are currently used for the commercial production of acarbose. According to some embodiments, the Actinomycetales strain is a Micromonosporaceae strain. According to some embodiments, the Actinomycetales strain is an Actinoplanes strain. According to some embodiments, the Actinomycetales strain is Actinoplanes SE50 (ATCC 31042, CBS 961.70) (Frommer et al. 1973), Actinoplanes sp. SE50/110 (ATCC 31044, CBS 674.73) or an Actinoplanes strain derived thereof. In some embodiments, the Actinomycetales strain is an Actinoplanes strain which is commercially used for acarbose production. In some embodiments, the Actinomycetales strain is an Actinoplanes strain which is commercially used for Acarbose production, such as SN223-29-47, C445-P47, SN12755-38, SC3687-18-43, SC7177-40-17 or SN19910-37-21 as disclosed e.g. in EP 2601209 B1 and CN103298828 B, or a strain derived thereof.

    (61) Improvement of acarbose production refers to an increase in yield of acarbose over a specific time (either in total or relative to cell growth) and/or improvement of the purity of the acarbose, e.g. the decrease of side-products and/or acarbose analogs such as component C. Cultivation of the Actinoplanes strain can occur as known in the art or as described herein. In some embodiments, cultivation of the Actinoplanes strain occurs in maltose minimal medium. According to a first aspect of the current invention, there is provided a method to engineer an Actinomycetales strain, such as an Actinoplanes strain, for the improved production of acarbose.

    (62) According to some first embodiments according to the first aspect, the method according to the first aspect comprises engineering the Actinomycetales strain for absent or reduced expression of extracellular small carbohydrate binding protein Cgt (SEQ ID No. 20).

    (63) Surprisingly, deletion of carbohydrate binding protein Cgt (SEQ ID No. 20) resulted in an improved production of acarbose. An increase of the final acarbose yield between 8.3 and 16.6% was achieved in three independent shake flask cultivations (cf. example cgt displays improved acarbose formation on maltose minimal medium, FIG. 18, FIG. 19, Table E10, Table E11).

    (64) Furthermore, in comparison with the wildtype, the gene deletion mutant cgt displayed no apparent growth phenotype in screening experiments testing for different carbon sources, or under carbon-limited conditions (cf. examples Analysis of cgt expression during growth on different carbon sources, cgt on different carbon sources or under carbon-limited conditions, FIG. 12, FIG. 13, FIG. 14), or pH and osmolyte stress (cf. example cgt has no impact on osmolarity- or pH-tolerance, FIG. 15, FIG. 16, FIG. 17). The inventors could furthermore show, that deletion of cgt had no negative impact on the expression of acarbose biosynthesis genes (cf. example cgt has no impact on the expression of acarbose biosynthesis genes, FIG. 20).

    (65) Without being bound by theory, Cgt was found to be highly expressed in Actinoplanes sp. SE50/110 according to comprehensive studies of the extracellular proteome (Wendler et al. 2013; Ortseifen 2016) and transcriptome (Schwientek et al. 2013). Its gene product is exported into the extracellular space making up for about 8% of the whole secreted proteome. The inventors have analyzed the distribution of CBM-20 single-domain proteins in the prokaryotic world by BlastP analysis. Interestingly, singular CBM-20 domain-proteins were found in only 17 other species (cf. example Distribution of single-domain CBM-20 proteins in the eubacterial world). Most of these are found in species of the order Actinomycetales, for example in all strains of the genus Actinoplanes. Without being bound by theory, by deletion or reduced expression of cgt, energy and resources, such as ATP and amino acids, are relieved. These resources may then be redirected to the acarbose biosynthesis, which is a growth-associated product.

    (66) According to some embodiments according to the first aspect, the method comprises deletion or mutation of the gene encoding extracellular small carbohydrate binding protein Cgt (SEQ ID No. 20). The establishment of an intergeneric conjugation system (Gren et al. 2016) and the CRISPR/Cas9 technique (Wolf et al. 2016), allows genome editing in Actinoplanes sp. SE50/110. In some embodiments according to the first aspect engineering the Actinomycetales strain for absent or reduced expression may occur using CRISPR/Cas9 technique. In some embodiments, engineering the Actinomycetales strain for absent or reduced expression may occur as described by (Wolf et al. 2016). In some embodiments engineering the Actinomycetales strain for absent or reduced expression may occur as described herein, e.g. as described in the example Deletion of the gene cgt by CRISPR/Cas9 technique or Deletion system based on homologous recombination and counterselection with the cytosine deaminase CodA. For example, the inventors have successfully established a novel deletion system by homologous recombination, which uses an integrase-free vector backbone and CodA for counter selection, like described by Zhao et al. (2017).

    (67) According to some second embodiments according to the first aspect, the method according to the first aspect comprises engineering the Actinomycetales strain for absent or reduced expression of at least one gene which is essential for carotenoid synthesis. In some embodiments, the carotenoid is the orange pigment of Actinoplanes or a derivative thereof. In some different or the same embodiments, the carotenoid is a C40-carotenoid.

    (68) Engineering the Actinomycetales strain for absent or reduced expression may occur as described previously for the current aspect. According to some embodiments according to the first aspect, the method comprises deletion or mutation of the gene which is essential for carotenoid synthesis.

    (69) Actinoplanes are known to produce a variety of soluble pigments including yellow, orange and pink pigments of the class carotenoids (Parenti and Coronelli 1979). The inventors observed, that strong pigmentation was associated with acarbose production losses. This was confirmed by comparing growth and acarbose yields of cultures exposed to and covered from light (cf. example Light-dependent carotenoid-formation and oxidative stress reduce acarbose production in Actinoplanes sp. SE50/110, FIG. 22). While carotenoid formation was induced, acarbose production and growth of Actinoplanes sp. SE50/110 was strongly reduced, when exposed to bulb light (FIG. 22). In total, a loss of 39% of the final acarbose concentration was monitored.

    (70) From these findings it is not only plausible that the produced pigments are not essential (e.g. in a technical setup for commercial acarbose production) but also that reducing or depleting the carotenoid synthesis in Actinoplanes can be used to improve the acarbose formation. To this end, the method according to the first aspect comprises reducing or depleting the expression of at least one gene which is essential for carotenoid synthesis.

    (71) The inventors could furthermore reconstruct the carotenogenesis in Actinoplanes sp. SE50/110 (cf. example Analysis of the functional relevance of carotenoid formation, FIG. 21). The set of genes which are essential for carotenoid synthesis in Actinoplanes include genes from the MEP/DOXP pathway, genes of terpene cluster 1, genes of terpene cluster 2a, genes of terpene cluster 2b, genes of camphene-like monoterpene biosynthesis terpene cluster 3.

    (72) According to some embodiments according to the current aspect and embodiments, the at least one gene essential for carotenoid synthesis is a gene of the MEP/DOXP pathway, such as i. 1-deoxy-D-xylulose-5-phosphate synthase gene dxs (ACSP50_7096, SEQ ID No. 23), ii. 4-hydroxy-3-methylbut-2-en-1-yl diphosphate synthase gene ispG (ACSP50_7248, SEQ ID No. 24), iii. 1-deoxy-D-xylulose-5-phosphate reductoisomerase gene dxr (ACSP50_7250, SEQ ID No. 25), iv. 4-hydroxy-3-methylbut-2-enyl diphosphate reductase gene ispH (ACSP50_7707, SEQ ID No. 26), v. 4-(cytidine 5-diphospho)-2-C-methyl-D-erythritol kinase gene ispE (ACSP50_7802, SEQ ID No. 27), vi. 2-C-methyl-D-erythritol 2;4-cyclodiphosphate synthase gene ispF, ACSP50_8046, SEQ ID No. 28), and/or vii. 2-C-methyl-D-erythritol 4-phosphate cytidylyltransferase gene ispD (ACSP50_8047, SEQ ID No. 29).

    (73) According to some embodiments according to the current aspect and embodiments, the at least one gene essential for carotenoid synthesis is a gene of terpene cluster 1, such as i. isopentenyl-diphosphate delta-isomerase gene idi (ACSP50_0146, SEQ ID No. 30), ii. zeta-phytoene desaturase gene crtl (ACSP50_0147, SEQ ID No. 10), iii. polyprenyl synthetase gene crtE/IdsA (ACSP50_0148, SEQ ID No. 31), iv. phytoene synthase gene crtB (ACSP50_0149, SEQ ID No. 32), v. deoxyribodipyrimidine photo-lyase gene (ACSP50_0150, SEQ ID No. 33), or vi. pyridine nucleotide-disulfide oxidoreductase gene (ACSP50_0151, SEQ ID No. 34).

    (74) According to some embodiments according to the current aspect and embodiments, the at least one gene essential for carotenoid synthesis is zeta-phytoene desaturase gene crtl (ACSP50_0147, SEQ ID No. 10). As discussed before, carotenoid formation is dispensable under laboratory conditions. In order to improve acarbose production, switching off the concurring carotenoid biosynthesis pathway, in particular by deletion of the central gene crtl, can be used for strain development.

    (75) According to some embodiments according to the current aspect and embodiments, the at least one gene essential for carotenoid synthesis is a gene of terpene cluster 2a, such as i. transcriptional regulator gene (ACSP50_1631, SEQ ID No. 35), ii. lycopene cyclase gene (ACSP50_1632, SEQ ID No. 36), iii. lycopene cyclase gene (ACSP50_1633, SEQ ID No. 37), iv. polyprenyl synthetase (farnesyl pyrophosphate synthetase 2 gene fps2/crtE (ACSP50_1634, SEQ ID No. 38), or v. methylenetetrahydrofolate reductase (NADPH) gene (ACSP50_1635, SEQ ID No. 39),

    (76) According to some embodiments according to the current aspect and embodiments, the at least one gene essential for carotenoid synthesis is a gene of terpene cluster 2b, such as i. LysR-family transcriptional regulator gene (ACSP50_1650, SEQ ID No. 40), ii. methyltransferase type 11 gene (ACSP50_1651, SEQ ID No. 41), iii. CDP-alcoholphosphatidyltransferase pgsA (ACSP50_1652, SEQ ID No. 42), iv. zeta-phytoene desaturase (crtl-family) gene crtD (ACSP50_1653, SEQ ID No. 43), v. glycosyl transferase gene cruC (ACSP50_1654, SEQ ID No. 44), vi. hypothetical protein (put. membrane prot,) gene cruF, (ACSP50_1655, SEQ ID No. 45), vii. GCN5 family acetyltransferase gene (ACSP50_1656, SEQ ID No. 46), viii. monooxygenase gene (ACSP50_1657, SEQ ID No. 47), or ix. short-chain dehydrogenase gene (ACSP50_1658, SEQ ID No. 48),

    (77) According to some embodiments according to the current aspect and embodiments, the at least one gene essential for carotenoid synthesis is polyprenyl synthetase gene crtE (ACSP50_3873, SEQ ID No. 49).

    (78) According to some embodiments according to the current aspect and embodiments, the at least one gene essential for carotenoid synthesis is a gene of camphene-like monoterpene biosynthesis terpene cluster 3, such as i. transcriptional regulator (Crp/Fnr family) gene eshA (ACSP50_1949, SEQ ID No. 104), ii. camphene synthase gene (ACSP50_1950, SEQ ID No. 50), iii. methyltransferase (SAM-dependent) type 11 gene (ACSP50_1951, SEQ ID No. 105), iv. glycosyl-hydrolase gene (ACSP50_1952, SEQ ID No. 106), or v. oxidoreductase/aldo/ketoreductase (ACSP50_1953, SEQ ID No. 107).

    (79) Since carotenoids influence the fluidity of membranes, lack of carotenoids and in particular of the C40-carotenoid can also affect the surface and mycelial structure of Actinoplanes sp. SE50/110. With regard to production break-up of mycelial lumps is advantageous to increase the mycelial surface and the number of biochemically available cells.

    (80) According to some further embodiments, the method according to the first aspect comprises engineering the Actinomycetales strain for overexpression of MerR-/HTH-transcriptional regulator gene merR (ACSP50_0145, SEQ ID No. 11). Engineering the Actinomycetales strain for overexpression may occur as described elsewhere herein.

    (81) Beside the mentioned genes which are essential for carotenoid synthesis, the inventors surprisingly identified a transcriptional repressor for the carotenoid synthesis among the genes of terpene cluster 1: ACSP50_0145 (SEQ ID No. 11, MerR-/HTH-transcriptional regulator gene merR) cf. example Deletion of merR in SE50/110 induces carotenoid formation without exposure to light, FIG. 24. By CRISPR/Cas9 deletion of the corresponding gene in SE50/110, the carotenoid formation was strongly induced without exposure to light (FIGS. 24B and C). Consistent with this, the acarbose production was found to be decreased. When illuminated, both wild type and merR are strongly pigmented and the final acarbose concentrations were similar for both strains, reaching approx. 0.52 g.Math.L.sup.1 (FIGS. 24B and D). This corresponds to a reduction of acarbose formation of approx. 38% compared to the wild type under dark conditions (reaching 0.83 g.Math.L.sup.1). This is in accordance to the previous growth experiments of the wild type. Under dark conditions, merR produces approx. 15% less acarbose than the wild type (0.70 g.Math.L.sup.1) (FIG. 24D). Without being bound by theory, these production losses are assumed to be caused by the waste of resources by carotenoid formation in the deletion mutant (FIG. 24C). In conclusion, the production losses under light conditions (38-39%) might be assigned to further light-induced stress in both the deletion mutant and the wild type.

    (82) According to some third embodiments according to the first aspect the method comprises engineering the Actinomycetales strain for overexpression of dTDP-D-glucose-4,6-dehydratase AcbB (SEQ ID No. 13).

    (83) According to the current invention it was surprisingly found that overexpression of the acb gene encoding the dTDP-D-glucose-4,6-dehydratase AcbB increased the final acarbose concentration significantly by approx. 50%. This was particularly surprising, because other genes of the Acb cluster such as AcbC did not lead to an improved formation of acarbose. Furthermore, the observed increase was superior compared to the observed increase for overexpression of the complete Acb cluster as described by Zhao et al. (Zhao, Xie, et al. 2017). According to some embodiments, the strain does not comprise engineering the Actinomycetales strain for overexpression of other genes of the Acb cluster, except for AcbA.

    (84) The dTDP-D-glucose-4,6-dehydratase AcbB seems to be involved in the generation of an activated amino sugar from D-glucose-1P which is a feeding pathway of the acarbose biosynthesis (FIG. 1): Without being bound by theory, increased AcbB activity was surprisingly found to also improve the supply of the modified precursor.

    (85) Overexpression of AcbB as described herein refers to an increase in expression for AcbB compared to the wild type or a specified reference strain/control. For example, the overexpression of the gene product may be an increase during the early growth phase, during the linear growth phase, during the stationary phase or an increase during any other time. Preferably, as described herein, overexpression of AcbB refers to an increase of AcbB transcript and/or protein by a factor of at least 1.5 or at least a factor of 2 compared to the control. With regard to AcbB transcript amounts, and if not defined otherwise herein, strong overexpression refers to a log 2 (fold change)>6. With regard to AcbB transcript amounts and if not defined otherwise herein, medium strong overexpression refers to a log 2 (fold change)2 and 6.

    (86) According to some embodiments the overexpression of dTDP-D-glucose-4,6-dehydratase AcbB (SEQ ID No. 13) is the increase of the expression of AcbB transcript and/or protein by a factor of a log 2 (fold change) of at least 1.5 or at least 2, during the early growth phase, during the linear growth phase, during the stationary phase or an increase during any other time.

    (87) According to some embodiments the overexpression of dTDP-D-glucose-4,6-dehydratase AcbB (SEQ ID No. 13) is the increase of the expression of AcbB transcript and/or protein by a log 2 (fold change)>2 and 6 during the early growth phase, during the linear growth phase, during the stationary phase or an increase during any other time, such as during the early growth phase and/or during the linear growth phase.

    (88) According to some embodiments the overexpression of dTDP-D-glucose-4,6-dehydratase AcbB (SEQ ID No. 13) is the increase of the expression of AcbB transcript and/or protein by a log 2 (fold change)>3 and <5 during the early growth phase, during the linear growth phase, during the stationary phase or an increase during any other time.

    (89) According to some embodiments the overexpression of dTDP-D-glucose-4,6-dehydratase AcbB (SEQ ID No. 13) is the increase of the expression of AcbB transcript and/or protein by a log 2 (fold change)>6 during the early growth phase, during the linear growth phase, during the stationary phase or an increase during any other time.

    (90) In overexpression mutants with expression vectors comprising heterologous promoters the relative transcription of acbB decelerated from 4.06- to 3.33-fold (log 2 (fold change)) between the two sampling times in pSETT4tip::acbB (medium strong promoter) and from 6.54- to 2.05-fold in in pSETT4gap::acbB (strong promoter) (cf. example Medium overexpression of acbB leads to improved acarbose formation).

    (91) According to some embodiments, engineering the Actinomycetales strain for overexpression of a gene according to the first aspect may occur by any method known in the art or described herein.

    (92) As described within the example Medium overexpression of acbB leads to improved acarbose formation, two pSETT4-based overexpression mutants were created, in which acbB is transcribed under control of the medium strong tipA-promoter or the strong gapDH-promoter. The native promoter was used in both the pSET152- and the pSETT4-vector background as control. In particular the mutant with acbB transcribed under control of the heterologous tipA-promoter displayed enhanced acarbose production compared to the control strains (FIG. 27, FIG. 28). The yield coefficient was increased to 48.6 and 51.9% compared to the empty vector control. By usage of the strong gapDH-promoter, the acarbose yield coefficient was found to be slightly increased (FIG. 28).

    (93) According to some embodiments, engineering the Actinomycetales strain for overexpression of a gene according to the first aspect may occur by introducing a vector comprising an expression cassette for AcbB (SEQ ID No. 13) into the Actinomycetales strain. In some embodiments, the expression vector is derived from pSET152. In some embodiments, the expression vector is derived from pSETT4. A vector is derived from another vector, if it comprises at least one, two, three, four elements of the second vector.

    (94) According to some embodiments, engineering the Actinomycetales strain for overexpression of a gene according to the first aspect may occur by introducing a vector comprising an expression cassette for AcbB (SEQ ID No. 13) into the Actinomycetales strain. In some of these or other embodiments the expression cassette is under the control of a medium strong promoter, as characterized by a normalized glucuronidase activity of at least 1 x.Math.10.sup.4, preferably between 1 x.Math.10.sup.4 and 510.sup.4 [L.Math.g.sup.1.Math.min.sup.1] in a glucuronidase assay, e.g. as described elsewhere herein. In some embodiments said promoter is selected from efp promoter (SEQ ID No. 92), cdaR promoter (SEQ ID No. 97), rpsL promoter (SEQ ID No. 99), rpsJ promoter (SEQ ID No. 93), cgt promoter (SEQ ID No. 91), or tipA promoter (SEQ ID No. 81). In some embodiments the promoter is the tipA promoter (SEQ ID No. 81). Excellent results for acarbose production were obtained with pSETT4tip::acbB, cf. FIG. 27, FIG. 28.

    (95) In some embodiments the expression cassette is under the control of a strong promoter, as characterized by a normalized glucuronidase activity of at least 510.sup.5 [L.Math.g.sup.1.Math.min.sup.1] in a glucuronidase assay, e.g. as described elsewhere herein. In some embodiments said promoter is selected from apm promoter (SEQ ID No. 96), ermE*promoter (SEQ ID No. 98), katE promoter (SEQ ID No. 94), moeE5 promoter (SEQ ID No. 95) or gapDH promoter (SEQ ID No. 82).

    (96) According to some embodiments, the method according to the first aspect comprises engineering the Actinomycetales strain for medium overexpression of dTDP-D-glucose-4,6-dehydratase AcbB (SEQ ID No. 13) and optionally AcbA (SEQ ID No. 12). In some embodiments, which are also compatible with all other embodiments described herein if not explicitly stated otherwise, genetic engineering does not result in an increase of transcript and/or protein by a log 2 (fold change)2 for Acb genes other than AcbB and AcbA. In some of embodiments which are also compatible with all other embodiments described herein, genetic engineering does not result in an increase of transcript and/or protein by a log 2 (fold change) 2 for AcbC.

    (97) Upon overexpression of AcbB, further genes of the acb gene cluster were not significantly affected, e.g. in the early growth phase, like shown for acbA and acbV (FIG. 30). Only exception is a slightly higher transcription abundance of acbA in pSETT4tip: acbB (log 2 (fold change)=1.87).

    (98) According to some embodiments, the method according to the first aspect comprises engineering the Actinomycetales strain for overexpression of AcbB (SEQ ID No. 13) and AcbS (ACSP50_3596) and/or Acbl (ACSP50_3599).

    (99) By (additional) overexpression of AcbS and/or Acbl, the transfer reaction of the amino sugar to the cyclitol precursor can be strengthened. According to the current model (see FIG. 1), this reaction is catalyzed by AcbS (ACSP50_3596) or Acbl (ACSP50_3599).

    (100) According to some embodiments, the method according to the first aspect comprises engineering the Actinomycetales strain for overexpression of AcbB (SEQ ID No. 13) and AcbCUJ (AcbC (ACSP50_3607) and/or AcbU (ACSP50_3595) and/or AcbJ (ACSP50_3600)) and/or AcbSI (AcbS (ACSP50_3596) and/or Acbl (ACSP50_3599)). Without being bound by theory, this combination can plausibly reinforce both acarbose synthesis strands.

    (101) According to some fourth embodiments according to the first aspect, the method comprises engineering the Actinomycetales strain for overexpression of UDP-glucose-1P uridyltransferase GtaB (SEQ ID No. 19).

    (102) By medium overexpression of gtaB, an increase of 8.5% of the final acarbose concentration was observed, cf. example Medium overexpression of gtaB leads to improved acarbose formation, FIG. 32, FIG. 33. Interestingly, the acarbose formation is particularly increased in the late linear to stationary growth phase (FIG. 32). Without being bound by theory, this may result from the improved deployment of the precursor glucose-1P (cf. FIG. 34).

    (103) Overexpression of GtaB (SEQ ID No. 19) as described herein refers to an increase in expression for GtaB transcript and/or protein compared to the wild type or a specified reference strain/control. For example, the overexpression of the gene product may be an increase during the early growth phase and/or during the linear growth phase and/or during the stationary phase, and/or an increase during any other time.

    (104) Preferably, overexpression is an increase of GtaB transcript and/or protein by a factor of at least 1.5 or at least a factor of 2 compared to the control. With regard to GtaB transcript amounts and if not defined otherwise herein, strong overexpression refers to a log 2 (fold change)>6. With regard to GtaB transcript amounts and if not defined otherwise herein, medium strong overexpression refers to a log 2 (fold change)2 and 6.

    (105) According to some embodiments the overexpression of UDP-glucose-1P uridyltransferase GtaB is the increase of the expression of GtaB by a factor of a log 2 (fold change) of at least 1.5 or at least 2 during the early growth phase and/or during the linear growth phase and/or during the stationary phase, and/or an increase during any other time.

    (106) In one of the overexpression mutants described herein, the relative transcript amount of the gene gtaB is 2.64-fold increased (log 2 (fold change)) (FIG. 33).

    (107) According to some embodiments the overexpression of UDP-glucose-1P uridyltransferase GtaB is the increase of the expression of GtaB transcript and/or protein by a log 2 (fold change)2 and 6 during the early growth phase and/or during the linear growth phase and/or during the stationary phase, and/or an increase during any other time. According to some embodiments the overexpression of UDP-glucose-1P uridyltransferase GtaB is the increase of the expression of GtaB by a log 2 (fold change)>3 and 5 during the early growth phase and/or during the linear growth phase and/or during the stationary phase, and/or an increase during any other time. According to some embodiments the overexpression of UDP-glucose-1P uridyltransferase GtaB is the increase of the expression of GtaB by a log 2 (fold change)6 during the early growth phase and/or during the linear growth phase and/or during the stationary phase.

    (108) According to some embodiments, engineering the Actinomycetales strain for overexpression of a gene according to the first aspect may occur by introducing a vector comprising an expression cassette for GtaB (SEQ ID No. 19) into the Actinomycetales strain. In some embodiments, the expression vector is derived from pSET152. In some embodiments, the expression vector is derived from pSETT4. A vector is derived from another vector, if it comprises at least one, two, three, four elements of the second vector.

    (109) According to some embodiments, engineering the Actinomycetales strain for overexpression of a gene according to the first aspect may occur by introducing a vector comprising an expression cassette for GtaB (SEQ ID No. 19) into the Actinomycetales strain.

    (110) In some of these or other embodiments the expression cassette is under the control of a medium strong promoter, as characterized by a normalized glucuronidase activity of between 1 x.Math.10.sup.4 and 510.sup.5 [L.Math.g.sup.1.Math.min.sup.1] in a glucuronidase assay, e.g. as described elsewhere herein. In some embodiments said promoter is selected from efp promoter (SEQ ID No. 92), cdaR promoter (SEQ ID No. 97), rpsL promoter (SEQ ID No. 99), rpsJ promoter (SEQ ID No. 93), cgt promoter (SEQ ID No. 91), or tipA promoter (SEQ ID No. 81). In some embodiments the promoter is the tipA promoter (SEQ ID No. 81). Good results for acarbose production were obtained for example with pSETT4tip::gtaB, cf. FIG. 32, FIG. 33.

    (111) In some embodiments the expression cassette is under the control of a strong promoter, as characterized by a normalized glucuronidase activity of at least 510.sup.5 [L.Math.g.sup.1.Math.min.sup.1] in a glucuronidase assay, e.g. as described elsewhere herein. In some embodiments said promoter is selected from apm promoter (SEQ ID No. 96), ermE*promoter (SEQ ID No. 98), katE promoter (SEQ ID No. 94), moeE5 promoter (SEQ ID No. 95) or gapDH promoter (SEQ ID No. 82).

    (112) According to some further or the same embodiments of the first aspect, the method comprises engineering the Actinomycetales strain for medium overexpression of dTDP-D-glucose-4,6-dehydratase AcbB (SEQ ID No. 13) and GtaB (SEQ ID No. 19).

    (113) It was surprisingly found that overexpression of GtaB triggers improved acarbose formation. By medium overexpression of acbB (e.g. by use of the tipA-promoter), a positive effect on acarbose production was observed yielding approx. 50% more acarbose in two independent cultivations.

    (114) Therefore, the improvement of the acarbose biosynthesis by overexpression of singular acb gene AcbB was achieved. Furthermore, by medium overexpression of gtaB, an increase of 8.5% of the final acarbose concentration was observed. It is plausible, that by a combined overexpression of acbB and gtaB, the flux through the amino sugar biosynthesis is improved leading to a further enhancement of acarbose production.

    (115) Without being bound by theory, strong overexpression of AcbB induced only smaller increases of acarbose production compared to medium strong overexpression of AcbB. This may be due to an imbalance in glucose-phosphate-metabolism, occurring upon massive overexpression of AcbB. Overexpression of gtaB might cure this imbalance, and combined overexpression of both, acbB and gtaB therefore plausibly leads to a further increase in acarbose production.

    (116) Interestingly, a significant decreased amount of the mass m/z=545 [MH.sup.+] was found in pSETT4tip::gtaB (approx. decrease of 48%), which might correspond to dTDP-4-keto-6-deoxy-D-glucose, the proposed product of AcbB. This may indicate, that the flow through the synthesis strand is more balanced, since the accumulation of this metabolite is reduced in comparison to the empty vector control and AcbB-overexpression mutants (FIG. 34).

    (117) According to some embodiments, the method according to the first aspect comprises engineering the Actinomycetales strain (i) for absent or reduced expression of extracellular small carbohydrate binding protein Cgt (SEQ ID No. 20) and/or, (ii) for absent or reduced expression of at least one gene involved in carotenoid synthesis, and/or, (iii) for overexpression of dTDP-D-glucose-4,6-dehydratase AcbB (SEQ ID No. 13), and/or (iv) for overexpression of UDP-glucose-1P uridyltransferase GtaB (SEQ ID No. 19).

    (118) According to some embodiments; the method according to the first aspect further comprises engineering the Actinomycetales strain for absent or reduced expression of treY.

    (119) According to some embodiments, the method according to the first aspect further comprises (i) deletion or mutation of the gene encoding extracellular small carbohydrate binding protein Cgt (SEQ ID No. 20) and/or, (ii) deletion or mutation of at least one gene involved in carotenoid synthesis and/or, (iii) introducing a vector comprising an expression cassette for AcbB (SEQ ID No. 13) into the Actinomycetales strain and/or (iv) introducing a vector comprising an expression cassette for GtaB (SEQ ID No. 19) into the Actinomycetales strain.

    (120) According to some embodiments, the expression cassette according to (iii) and/or (iv) is under the control of a medium strong promoter, as characterized by a normalized glucuronidase activity of between 1 x.Math.10.sup.4 and 510.sup.5 [L.Math.g.sup.1.Math.min.sup.1] in a glucuronidase assay.

    (121) According to a second aspect there is provided an Actinomycetales strain, such as an Actinoplanes strain, for the production of acarbose. According to some embodiments the Actinomycetales strain is a strain generated by a method according to the first aspect. According to some other embodiments the Actinomycetales strain is genetically engineered for absent or reduced expression of extracellular small carbohydrate binding protein Cgt (SEQ ID No. 20). According to some embodiments the Actinomycetales strain is a cgt mutant. A cgt mutant is a variant of an Actinomycetales strain wherein the gene Cgt (SEQ ID No. 20) has been at least partially deleted or inverted.

    (122) According to some of these or other embodiments the Actinomycetales strain is genetically engineered for absent or reduced expression of at least one gene which is essential for carotenoid synthesis. According to some embodiments the at least one gene which is essential for carotenoid synthesis has been at least partially deleted or inverted. According to some of these embodiments the at least one gene which is essential for carotenoid synthesis comprises at least one gene selected from any of a. the genes of the MEP/DOXP pathway, such as i. 1-deoxy-D-xylulose-5-phosphate synthase gene dxs (ACSP50_7096, SEQ ID No. 23), ii. 4-hydroxy-3-methylbut-2-en-1-yl diphosphate synthase gene ispG (ACSP50_7248, SEQ ID No. 24), iii. 1-deoxy-D-xylulose-5-phosphate reductoisomerase gene dxr (ACSP50_7250, SEQ ID No. 25), iv. 4-hydroxy-3-methylbut-2-enyl diphosphate reductase gene ispH (ACSP50_7707, SEQ ID No. 26), v. 4-(cytidine 5-diphospho)-2-C-methyl-D-erythritol kinase gene ispE (ACSP50_7802, SEQ ID No. 27), vi. 2-C-methyl-D-erythritol 2;4-cyclodiphosphate synthase gene ispF, ACSP50_8046, SEQ ID No. 28), and/or vii. 2-C-methyl-D-erythritol 4-phosphate cytidylyltransferase gene ispD (ACSP50_8047, SEQ ID No. 29), b. the genes of terpene cluster 1, such as i. isopentenyl-diphosphate delta-isomerase gene idi (ACSP50_0146, SEQ ID No. 30), ii. zeta-phytoene desaturase gene crtl (ACSP50_0147, SEQ ID No. 10), iii. polyprenyl synthetase gene crtE/IdsA (ACSP50_0148, SEQ ID No. 31), iv. phytoene synthase gene crtB (ACSP50_0149, SEQ ID No. 32), v. deoxyribodipyrimidine photo-lyase gene (ACSP50_0150, SEQ ID No. 33), or vi. pyridine nucleotide-disulfide oxidoreductase gene (ACSP50_0151, SEQ ID No. 34) c. the genes of terpene cluster 2a, such as i. transcriptional regulator gene (ACSP50_1631, SEQ ID No. 35), ii. lycopene cyclase gene (ACSP50_1632, SEQ ID No. 36), iii. lycopene cyclase gene (ACSP50_1633, SEQ ID No. 37), iv. polyprenyl synthetase (farnesyl pyrophosphate synthetase 2 gene fps2/crtE (ACSP50_1634, SEQ ID No. 38), or v. methylenetetrahydrofolate reductase (NADPH) gene (ACSP50_1635, SEQ ID No. 39), d. the genes of terpene cluster 2b, such as i. LysR-family transcriptional regulator gene (ACSP50_1650, SEQ ID No. 40), ii. methyltransferase type 11 gene (ACSP50_1651, SEQ ID No. 41), iii. CDP-alcoholphosphatidyltransferase pgsA (ACSP50_1652, SEQ ID No. 42), iv. zeta-phytoene desaturase (crtl-family) gene crtD (ACSP50_1653, SEQ ID No. 43), v. glycosyl transferase gene cruC (ACSP50_1654, SEQ ID No. 44), vi. hypothetical protein (put. membrane prot,) gene cruF, (ACSP50_1655, SEQ ID No. 45), vii. GCN5 family acetyltransferase gene (ACSP50_1656, SEQ ID No. 46), viii. monooxygenase gene (ACSP50_1657, SEQ ID No. 47), ix. short-chain dehydrogenase gene (ACSP50_1658, SEQ ID No. 48), e. polyprenyl synthetase gene crtE (ACSP50_3873, SEQ ID No. 49), or f. the genes for camphene-like monoterpene biosynthesis terpene cluster 3, such as i. transcriptional regulator (Crp/Fnr family) gene eshA (ACSP50_1949, SEQ ID No. 104), ii. camphene synthase gene (ACSP50_1950, SEQ ID No. 50), iii. methyltransferase (SAM-dependent) type 11 gene (ACSP50_1951, SEQ ID No. 105), iv. glycosyl-hydrolase gene (ACSP50_1952, SEQ ID No. 106), v. oxidoreductase/aldo/ketoreductase (ACSP50_1953, SEQ ID No. 107).

    (123) According to some of these or other embodiments the Actinomycetales strain is genetically engineered for overexpression of MerR-/HTH-transcriptional regulator gene merR (ACSP50_0145, SEQ ID No. 11).

    (124) According to some of these or other embodiments the Actinomycetales strain is genetically engineered for overexpression of dTDP-D-glucose-4,6-dehydratase AcbB (SEQ ID No. 13).

    (125) As described elsewhere herein, overexpression of AcbB refers to an increase of AcbB by a factor of at least 1.5 or at least a factor of 2 compared to the control. Preferably, the control is the strain which has not been engineered for the specific overexpression of dTDP-D-glucose-4,6-dehydratase AcbB (SEQ ID No. 13). For example, the control does not comprise a vector comprising an expression cassette for AcbB.

    (126) For example, the overexpression of the gene product may be an increase during the early growth phase, during the linear growth phase, during the stationary phase or an increase during any other time.

    (127) According to some embodiments the overexpression of dTDP-D-glucose-4,6-dehydratase AcbB (SEQ ID No. 13) is the increase of the expression of AcbB transcript and/or protein by a factor of a log 2 (fold change) of at least 1.5 or at least 2, during the early growth phase, during the linear growth phase, during the stationary phase or an increase during any other time.

    (128) According to some embodiments the overexpression of dTDP-D-glucose-4,6-dehydratase AcbB (SEQ ID No. 13) is the increase of the expression of AcbB transcript and/or protein by a log 2 (fold change)2 and 6 during the early growth phase, during the linear growth phase, during the stationary phase or an increase during any other time, such as during the early growth phase and/or during the linear growth phase.

    (129) According to some embodiments the overexpression of dTDP-D-glucose-4,6-dehydratase AcbB (SEQ ID No. 13) is the increase of the expression of AcbB transcript and/or protein by a log 2 (fold change)>3 and <5 during the early growth phase, during the linear growth phase, during the stationary phase or an increase during any other time.

    (130) According to some embodiments the overexpression of dTDP-D-glucose-4,6-dehydratase AcbB (SEQ ID No. 13) is the increase of the expression of AcbB transcript and/or protein by a log 2 (fold change)>6 during the early growth phase, during the linear growth phase, during the stationary phase or an increase during any other time.

    (131) According to some embodiments the Actinomycetales strain genetically engineered for overexpression of dTDP-D-glucose-4,6-dehydratase AcbB (SEQ ID No. 13) comprises a vector for overexpression of AcbB. According to some of these embodiments, the vector is a vector as described herein, preferably according to an aspect described herein.

    (132) According to some embodiments the Actinomycetales strain genetically engineered for overexpression of dTDP-D-glucose-4,6-dehydratase AcbB (SEQ ID No. 13) comprises an expression cassette for AcbB (SEQ ID No. 13) under the control of a medium strong promoter.

    (133) According to some embodiments the Actinomycetales strain genetically engineered for overexpression of dTDP-D-glucose-4,6-dehydratase AcbB (SEQ ID No. 13) comprises an expression cassette for AcbB (SEQ ID No. 13) under the control of strong promoter. Preferably, the promoter is not the native promoter of AcbB.

    (134) According to some of these or other embodiments the Actinomycetales strain is genetically engineered for overexpression of UDP-glucose-1P uridyltransferase GtaB (SEQ ID No. 19).

    (135) Overexpression of GtaB (SEQ ID No. 19) as described elsewhere herein refers to an increase in expression for GtaB compared to the wild type or a specified reference strain/control. Preferably, the control is the strain which has not been engineered for the specific overexpression of GtaB (SEQ ID No. 19). For example, the control does not comprise a vector comprising an expression cassette for GtaB (SEQ ID No. 19). For example, the overexpression of the gene product may be an increase during the early growth phase and/or during the linear growth phase and/or during the stationary phase, and/or an increase during any other time.

    (136) According to some embodiments the overexpression of GtaB is the increase of the expression of GtaB transcript and/or protein by a factor of a log 2 (fold change) of at least 1.5, or at least 2, during the early growth phase, during the linear growth phase, during the stationary phase or an increase during any other time.

    (137) According to some embodiments the overexpression of GtaB is the increase of the expression of GtaB transcript and/or protein by a log 2 (fold change)2 and 6 during the early growth phase, during the linear growth phase, during the stationary phase or an increase during any other time, such as during the early growth phase and/or during the linear growth phase.

    (138) According to some embodiments the overexpression of GtaB is the increase of the expression of GtaB transcript and/or protein by a log 2 (fold change)>3 and <5 during the early growth phase, during the linear growth phase, during the stationary phase or an increase during any other time.

    (139) According to some embodiments the overexpression of GtaB is the increase of the expression of GtaB transcript and/or protein by a log 2 (fold change)>6 during the early growth phase, during the linear growth phase, during the stationary phase or an increase during any other time.

    (140) According to some embodiments the Actinomycetales strain genetically engineered for overexpression of GtaB comprises a vector for overexpression of GtaB. According to some of these embodiments, the vector is a vector as described herein, preferably according to an aspect described herein.

    (141) According to some embodiments the Actinomycetales strain genetically engineered for overexpression of GtaB (SEQ ID No. 19) comprises an expression cassette for GtaB (SEQ ID No. 19) under the control of a medium strong promoter.

    (142) According to some embodiments the Actinomycetales strain genetically engineered for overexpression of GtaB (SEQ ID No. 19) comprises an expression cassette for GtaB (SEQ ID No. 19) under the control of strong promoter. Preferably, the promoter is not the native promoter of GtaB.

    (143) According to a third aspect there is provided an Actinomycetales strain, such as an Actinoplanes strain, for the production of acarbose for use in the production of acarbose.

    (144) According to some embodiments there is provided a method for the production of acarbose, wherein the method comprises the use of an Actinomycetales strain according to the second aspect.

    (145) For genetic engineering of Actinoplanes, an expression system is required for the overexpression of singular or multiple genes. According to a fourth aspect there is provided an expression vector for Actinoplanes.

    (146) According to some embodiments, the vector according to the fourth aspect comprises a medium strong promoter characterized by a normalized glucuronidase activity of at least 1 x.Math.10.sup.4 [L.Math.g.sup.1.Math.min.sup.1] in a glucuronidase assay. In some embodiments, the medium strong promoter is selected from efp according to SEQ ID No. 92, cdaR according to SEQ ID No. 97, rpsL according to SEQ ID No. 99, rpsJ according to SEQ ID No. 93, cgt according to SEQ ID No. 91, or tipA according to SEQ ID No. 81.

    (147) According to some embodiments, the vector according to the fourth aspect comprises a strong promoter characterized by a normalized glucuronidase activity of at least 5 x.Math.10.sup.4 [L.Math.g.sup.1.Math.min.sup.1] in a glucuronidase assay. In some embodiments, the strong promoter is selected from apm according to SEQ ID No. 96, ermE* according to SEQ ID No. 98, katE according to SEQ ID No. 94, moeE5 according to SEQ ID No. 95 or gapDH according to SEQ ID No. 82.

    (148) To find further suitable promoters, that allow medium to strong gene expression, a promoter screening can be carried out by use of the screening system of Horbal et al. (2013) and Myronovskyi et al. (2011), which is based on the reporter GusA cloned in a pSET152-vector system, cf. FIG. 3, Table 1.

    (149) In some embodiments the vector according to the first aspect comprises an expression cassette. Preferably the vector comprises an expression cassette for AcbB (SEQ ID No. 13) and/or an expression cassette for GtaB (SEQ ID No. 19) and/or an expression cassette for MerR. In some embodiments, the expression cassette may furthermore comprise a IacZa-gene under control of the lac-promoter. The IacZa-gene encodes a catalytic domain of a -galactosidase, that enables quick selection of the integration of a target sequence by blue/white-selection in the cloning strain Escherichia coli DH5aMCR (NC_017638.1) (Grant et al. 1990).

    (150) Without being bound by theory, the vector according to the current aspect comprises elements for vector replication, transfer, maintenance and selection. In some embodiments, at least one of these elements is derived from pSET152.

    (151) In some embodiments, the vector according to the current aspect comprises parts of the sequence of the pSET152 vector of Bierman et al. (1992).

    (152) Preferably, the vector does not comprise putative antisense promoters according to SEQ ID NO 108 and/or SEQ ID No. 109. These antisense promoters were identified by the inventors by sequencing of a 5-primary transcript library and impair suitability of the vector pSET152. In brief, identification occurred by sequencing of an enriched primary transcript library. The two putative promoters were identified behind the gene of interest in antisense orientation (FIG. 26). These two pseudo-promoters were removed in order to prevent antisense transcription.

    (153) Furthermore, a T4-terminator was introduced behind the expression cassette in opposite orientation to prevent further putative antisense reads (cf. e.g. FIG. 6). In some embodiments, the vector comprises at least one T4-terminator (derived from the bacteriophage T4). T4-terminators can block transcription efficiently and prevent read-through from the integrase gene into the gene of interest. In some embodiments, the vector comprises a T4-terminator behind the expression cassette in opposite orientation to prevent further putative antisense reads. For example, the vector may comprise at least one T4-terminator before and/or at least one T4-terminator after the expression cassette. In some embodiments, the vector may comprise three terminators, one before and two after the expression cassette.

    (154) In some embodiments the vector comprises the C31 integrase gene int. In some of these embodiments the C31 integrase gene int is derived from pSET152. In some embodiments, the vector according to the first aspect furthermore comprises the attachment site attP. The integrase of the C31 integrase gene int mediates the integration of the vector into the host chromosome at a distinct genomic location by catalyzing the targeted and unidirectional recombination of two attachment sites: attP, localized on the vector, and attB, localized in the host chromosome in the gene ACSP50_6589 (former: ACPL_6602) (the Poele et al. 2008; Gren et al. 2016). Without being bound by theory, after integration, the vector is flanked by the attachment site left (attL) and right (attR), which are derived from attP-attB-recombination (the Poele, Bolhuis und Dijkhuizen 2008).

    (155) In some embodiments the vector comprises an origin of transfer such as the origin of transfer (incP) and/or a relaxosome gene such as the relaxosome gene traJ. In some of these embodiments the origin of transfer such as the origin of transfer (incP) and/or the relaxosome gene, such as traJ are derived from pSET152. The origin of transfer and the relaxosome gene enable the transfer of the plasmid from the donor strain (e.g. Escherichia coli ET12567/pUZ8002 (Kieser et al. 2000)).

    (156) In some embodiments the vector according to the first aspect comprises an origin of replication such as the high-copy-number ColE1/pMB1/pBR322/pUC origin of replication (ori). In some of these embodiments the origin of replication such as the high-copy-number ColE1/pMB1/pBR322/pUC origin of replication (ori) is derived from pSET152. The origin of replication such as the high-copy-number ColE1/pMB1/pBR322/pUC origin of replication (ori) enables replication of the plasmid in the cloning strain (Escherichia coli DH5aMCR) and donor strain (Escherichia coli ET12567/pUZ8002).

    (157) In some embodiments the vector according to the first aspect comprises at least one resistance marker such as a resistance marker mediating apramycin resistance (aac (3) IV, apmR). Resistance markers mediating apramycin resistance (aac (3) IV, apmR) can be used for selection.

    (158) According to some embodiments according to the fourth aspect the expression vector comprises at least one element of pSET152, such as (a) the C31 integrase gene int according to SEQ ID No. 85, (b) the origin of transfer (incP) according to SEQ ID No. 87, (c) the relaxosome gene traJ according to SEQ ID No. 88, or (d) the high-copy-number ColE1/pMB1/pBR322/pUC according to SEQ ID No. 89, and furthermore does not comprise putative antisense promoters according to SEQ ID NO 108 and SEQ ID No. 109.

    (159) According to some embodiments according to the fourth aspect the expression vector comprises (a) the C31 integrase gene int according to SEQ ID No. 85, and (b) the origin of transfer (incP) according to SEQ ID No. 87, and (c) the relaxosome gene traJ according to SEQ ID No. 88, and (d) an origin of replication, such as the high-copy-number ColE1/pMB1/pBR322/pUC, origin of replication (ori) according to SEQ ID No. 89 and (e) optionally at least one resistance marker, such as a resistance marker mediating apramycin resistance, such as aac (3) IV according to SEQ ID No. 90, apmR, and (f) optionally at least one T4-terminator, and (g) optionally, wherein the vector does not comprise putative antisense promoters according to SEQ ID NO 108 and/or SEQ ID No. 109.

    (160) According to some embodiments, the vector comprises the sequence according to SEQ ID No. 110 or SEQ ID No. 111. According to some embodiments, the vector comprises the sequence according to SEQ ID No. 110 or SEQ ID No. 111, or a fragment thereof.

    (161) In some embodiments, the vector is excelled by an easy cloning mechanism allowing integration of different promoters. By this, the system can be quickly adapted to further species, e.g. production strains of Acarbose.

    EXAMPLES

    (162) General Tools and Methods

    (163) Strains and Plasmids

    (164) All strains used in this work are listed in Table E1. Recombinant strains used or created in this work are listed in Table E2, Table E3 and Table E4 (plasmid-based expression systems in Table E2, deletion and integration constructs cloned and stored in E. coli DH5aMCR in Table E3, deletion and integration mutants of Actinoplanes sp. SE50/110 in Table E4).

    (165) TABLE-US-00003 TABLE E1 Culture collection of microorganisms. NCBI reference strain strain collection sequence reference Actinoplanes sp. SE50/110 ATCC31044, CBS NZ_LT827010.1 (Wolf etal. 2017b; 674.73 Frommer et al. 1979; Parenti and Coronelli 1979) Escherichia coli DH5MCR Mcr-deficient NC_017638.1 (Grant et al. 1990) derivative of E. coli DH1 Escherichia coli (Kieser et al. 2000) ET12567/pUZ8002 Streptomyces lividans TK23 plasmid-free derivative NZ_CP009124.1 (Kieser et al. 2000) of S. lividans 66 (TK24 as representative genome) Streptomyces coelicolor ATCCBAA-471D-5, NC_003888.3 (Bentley et al. 2002; A3(2) M145 plasmid-free derivative Dyson and Schrempf of S. coelicolor A3(2) 1987) ATCCBAA-471 Streptomyces glaucescens DSM40922 NZ_CP009438.1 (Ortseifen GLA.O et al 2015)

    (166) TABLE-US-00004 TABLE E2 Replicative and integrative vector systems. E. coli Actinoplanes vector name promoter and insert DH5MCR sp. SE50/110 source pSETT4 constructs (backbone created in this work) pSETT4gap P.sub.gapDH Ec112 this work pSETT4tip P.sub.tipA Ec117 this work pSETT4::P.sub.acbB:acbB P.sub.acbB acbB (ACSP50_3608) Ec120 Ac152 this work pSETT4tip::acbB P.sub.tipA acbB (ACSP50_3608) Ec119 Ac153 this work pSETT4gap::acbB P.sub.gapDH acbB (ACSP50_3608) Ec118 Ac154 this work pSETT4tip::gtaB P.sub.tipA gtaB (ACSP50_7820) Ec115 Ac150 this work

    (167) TABLE-US-00005 TABLE E3 Vector systems for targeted deletion and integration based on pCRISPomyces-2 of Cobb et al. (2015). E. coli vector name insert DH5MCR source pCRISPomyces-2::sp:cgt_flanks flanks for deletion of the Ec018 this work gene cgt (ACSP50_5024) pCRISPomyces-2::sp1:merR_flanks flanks for deletion of the Ec109 this work gene merR (ACSP50_0145)

    (168) TABLE-US-00006 TABLE E4 Deletion and integration mutants obtained in Actinoplanes ssp. by CRISPR/Cas9- technique. Actinoplanes strain description sp. SE50/110 source Actinoplanes sp. deletion mutant of the gene Ac064 this work SE50/110 cgt cgt (ACSP50_5024) Actinoplanes sp. deletion mutant of the gene Ac146 this work SE50/110 merR merR (ACSP50_0145)
    Media and Cultivation Conditions

    (169) Unless otherwise specified, all chemicals and media components were obtained from Carl Roth GmbH & Co. KG (Karlsruhe, Germany), Sigma-Aldrich (St. Louis, USA), SERVA Electrophoresis GmbH (Heidelberg, Germany) or VWR International (Pennsylvania, USA).

    (170) Preparation of Glycerol Stocks of Actinoplanes Sp. SE50/110

    (171) For preparation of glycerol stocks, Actinoplanes sp. SE50/110 (ATCC 31044) was grown in the complex medium NBS (11 g.Math.L.sup.1 glucose-1H.sub.2O, 4 g.Math.L.sup.1 peptone, 4 g.Math.L.sup.1 yeast extract, 1 g.Math.L.sup.1 MgSO.sub.4.Math.7H.sub.2O, 2 g.Math.L.sup.1 KH.sub.2PO.sub.4, 4 g.Math.L.sup.1 K.sub.2HPO.sub.4) and mixed 2:3 with sterile 86% (v/v) glycerol. Glycerol stocks are stored at 80 C.

    (172) Growth on Solid Media and Preparation of Spore Solutions

    (173) For spore formation, 200-300 L of a glycerol stock were grown on agar plates of soy flour medium (SFM-agar) (20 g.Math.L.sup.1 soy flour (SOBOR Naturkost (Cologne, Germany)), 20 g.Math.L.sup.1 D-mannitol, 20 g.Math.L.sup.1 Bacto agar (Becton-Dickinson, Heidelberg, Germany), 167 L 10 N NaOH in tap water). Spores could be harvested after 5-7 days of incubation at 28 C. by washing them off in 3 mL ddH.sub.2O with a cotton swab, like described by Wolf et al. (2016).

    (174) Preparation of Minimal Medium

    (175) Maltose minimal medium (72.06 g.Math.L.sup.1 maltose.Math. 1H.sub.2O, 5 g.Math.L.sup.1 (NH.sub.4).sub.2SO.sub.4, 0.184 g.Math.L.sup.1 FeCl.sub.2.Math.4H.sub.2O, 5.7 g.Math.L.sup.1 Na.sub.3C.sub.6H.sub.5O.sub.7.Math.2H.sub.2O, 1 g.Math.L.sup.1 MgCl.sub.2.Math.6H.sub.2O, 2 g.Math.L.sup.1 CaCl.sub.2.Math.2H.sub.2O, trace elements (final concentration: 1 M CuCl.sub.2, 50 UM ZnCl.sub.2, 7.5 UM MnCl.sub.2 dissolved in 1 M HCl) and phosphate buffer consisting of 5 g.Math.L.sup.1 each K.sub.2HPO.sub.4 and KH.sub.2PO.sub.4 in ddH.sub.2O) was prepared and filter sterilized following the protocol of Wendler et al. (2013).

    (176) For substitution of the carbon source maltose, 79.2 g.Math.L.sup.1 glucose.Math.1H.sub.2O, 72.0 g.Math.L.sup.1 C-pur (Cerestar 01908, Cerestar GmbH, Krefeld, Germany), 71.9 g.Math.L.sup.1 galactose, 68.4 g.Math.L.sup.1 cellobiose, 71.9 g.Math.L.sup.1 D-arabinose or 72.0 g.Math.L.sup.1 D-lactose were used respectively, instead of maltose-monohydrate. Mixtures of maltose and glucose were prepared in the ratio of 90:10, 80:20 and 50:50 (v/v).

    (177) For the starch medium, a 4% (w/v) opalescent solution of starch soluble from Acros Organics (part of Thermo Fisher Scientific, Geel, Belgium) was generated. For this, sterile water was preheated to 90 C. in a water bath and the weighed portion of starch added with stirring. Afterwards, the residual media components were added. To allow comparison to the starch-cultivation, a maltose minimal medium was created with comparable C-molarity (here net weight of 44.4 g.Math.L.sup.1 maltose.Math.1H.sub.2O). Media of different pH and osmolarity were created by addition of correcting agents (HCl or NaOH), by varying of the concentration of the carbon-sources maltose respectively by addition of inositol, which is not metabolized according to our study (data not shown).

    (178) Furthermore, minimal media with 1 g.Math.L.sup.1, 2 g.Math.L.sup.1, 3 g.Math.L.sup.1, 4 g.Math.L.sup.1 and 5 g.Math.L.sup.1 starch soluble from Acros Organics were created for cultivation under limited carbon-source.

    (179) The pH and osmolarity of all media were determined by the pH-meter Calimatic of Knick GmbH (Berlin, Germany) and the Osmomat 3000 of Gonotec GmbH (Berlin, Germany) according to the manufacturer's instructions.

    (180) Shake Flask Cultivation

    (181) Cultivations were performed in 250 ml Corning Erlenmeyer baffled cell culture flasks at 28 C. and 140 rpm for seven days in the GFL shake-imcubators 3032 or 3033 (Burgwedel, Germany). For inoculation of 50 mL medium, 1 mL spore solution of an OD=3-5 was used. Cell dry weights were determined like described by Wolf et al. (2017a). The supernatant was stored at 20 C. for later analysis.

    (182) Miniaturized Cultivation in the BioLector System of m2p-Labs GmbH (Baesweiler, Germany)

    (183) Comparative growth experiments were performed in a 1 mL reaction volume in a 48-well FlowerPlate covered by a gas-permeable sealing foil (m2p-labs GmbH, Baesweiler, Germany) and incubated for 1 week at 28 C. and 800 rpm in the RoboLector of m2p-labs. Growth was recorded by the backscatter signal. For determination of final cell dry weights, 800 L of each well was sampled in a weighed reaction tube (14,000 g, 2 min), washed with deionized water and dried for 1 day at 60-70 C. The supernatant was stored at 20 C. for later analyses.

    (184) Recombinant DNA Work

    (185) Unless otherwise specified, plasmid construction and assembly was performed by Gibson Assembly (Gibson et al. 2009). Fragments were amplified by PCR (Phusion High-Fidelity PCR Master Mix with GC Buffer, NEB, Ipswich, MA, USA) in the Eppendorf thermocycler vapo.protect (Hamburg, Germany) and treated with Dpnl (Thermo Fisher Scientific, Waltham, MA, USA), when necessary. Purification of PCR products and gel extracts was performed by use of the NucleoSpin Gel and PCR Clean-up kit (Macherey-Nagel, Dren, Germany). Equimolar amounts of the DNA fragments were added to the Gibson Assembly Master Mix in a ratio of 1:4. The master mix consists of 0.64 L T5 Exonuclease (10 U.Math.L.sup.1, NEB, Ipswich, MA, USA), 20 L Phusion High-Fidelity DNA Polymerase (2 U.Math.L.sup.1, Thermo Fisher Scientific, US), 160 L Taq DNA Ligase (NEB, Ipswich, MA, USA), 699.36 L aqua distilled and 320 L isothermal reaction buffer (25% PEG-8000, 1 mL 1 M Tris-HCl, 100 L 1 M MgCl.sub.2, 100 L 1 M DTT, 20 L each 1 mM dNTP, 200 L NAD). The sample was incubated at 50 C. for at least 1 h and subsequently transferred to Escherichia coli DH5aMCR by chemical transformation according to a protocol of Beyer et al (2015). Selection of E. coli was performed on Luria/Miller broth medium with 15 g.Math.L.sup.1 agar-agar (Carl Roth, GmbH&Co.KG, Karlsruhe, Germany)) and 50 mg.Math.L.sup.1 apramycin-sulfate. Positive colonies were tested by PCR and gel-electrophoresis as well as by Sanger sequencing by our in-house sequencing core facility.

    (186) Construction of Plasmids for the gusA Reporter System

    (187) For the construction of plasmids for the gusA reporter system see Schaffert et al. (2019).

    (188) Construction of the Novel pSETT4 Expression System

    (189) For cloning of the novel pSETT4 expression system, the pSET152 vector of Bierman et al. (1992) was used as template. The vector backbone was linearized by PCR (Table E5).

    (190) The cloning cassette, consisting of the gapDH-promoter, a IacZ-gene under control of the lac-promoter and several restriction sites flanked by three T4-terminators, was ordered as string

    (191) DNA at Integrated DNA Technologies (lowa, USA). Due to the complex structure, the cassette was ordered in three parts and assembled by GeneSOEing (Horton 1995) by use of the primers in Table E5. Finally, backbone and insert were assembled by Gibson Assembly (Gibson et al. 2009). The novel vector system was named pSETT4gap.

    (192) For exchange of the gapDH-promoter by the tipA-promoter, pSETT4gap was digested with Ndel and Kpnl and treated with shrimp alkaline phosphatase following the instructions of the supplier. All enzymes were purchased from Thermo Fisher Scientific (Waltham, MA, USA). The tipA-promoter was amplified from pSETGUS (Myronovskyi et al. 2011) by use of the primers tipA_GAF and tipA_GAR and assembled with the linearized backbone by Gibson assembly (Gibson et al. 2009). The vector was named pSETT4tip (cf. FIG. 6).

    (193) TABLE-US-00007 TABLEE5 GibsonAssemblyprimerforassemblyofthenovelexpressionsystem pSETT4gapandpSETT4tip. size fragment template (bp) primersequence(5-3) pSET152_lin pSET152 5114 CTACGGTGCCGCTTACCGGgctcactcaaaggcggtaatac gg CAGACGTCAGCGACGACAGAGaaccatcggcgcagctatt tac genesoeing_ IDT-order1and 1473 CTCTGTCGTCGCTGACGTCTG for 2 genesoeing_1r CAGATCTGGAGTCGGTCTAATTT genesoeing_2f IDT-order2and 878 AGGGTTTTCCCAGTCACGACG genesoeing_ 3 CCGGTAAGCGGCACCGTAG rev tipA_GAF pSETGUS 146 GTGGCCCATGCGAGAGTACAATCCCTAGAACGTC CGGG tipA_GAR TCAACATAAGGTCTCGGTACCATCGGAATACCTCC GTTGCT
    Overexpression of Single Genes in the Novel pSETT4 Expression System

    (194) For the overexpression of single genes, the insert was amplified by PCR (Table E6). The vector (pSETT4gap or pSETT4tip), was digested with Bsal (NEB, Ipswich, MA, USA) and assembled with the insert by Gibson Assembly (Gibson et al. 2009). For expression of the acbB gene under control of the native promoter, the vector backbone pSETT4gap was digested with Bsal and Ndel, leading to the linearization of the vector under removal of the promoter. The gene of interest and the native promoter were amplified by use of the primers in Table E6 and assembled with the vector backbone by Gibson Assembly (Gibson et al. 2009).

    (195) TABLE-US-00008 TABLEE6 PrimerforamplificationofinsertsforGoldenGatecloningandrestrictioncloning intothepSETT4gapandpSETT4tipvectorsystem. size fragment template (bp) primersequence(5-3) acbBfor gDNA 1008 GAGTATCTGAAAGGGGATACGCATGAAAATCTTGGTCA pSETT4gap CCGGCGGAGC GGCGGAAAATCACGCGGCACGAATCAGGTCCACCAGG AACGGTTGGC acbBforpSETT4tip gDNA 1006 CGAGCAACGGAGGTATTCCGATGAAAATCTTGGTCACC GGCGGAGC GGCGGAAAATCACGCGGCACGAATCAGGTCCACCAGG AACGGTTGGC P.sub.acbB:acbBfor gDNA 1136 GGCCCATGCGAGAGTACATAGCCAGCCTTTCATGATAT pSETT4 ATCTC AATCACGCGGCACGAAACGCACCGGATCCATGTTGTG TGG gtaBforpSETT4tip gDNA 950 GCAACGGAGGTATTCCGATGACGACGAACGCGCAAGG G GGAAAATCACGCGGCACGAAGTCATCCCTTCTGACCA CCGACG
    Construction of pCRISPomyces-2 Deletion and Integration Vectors

    (196) For the construction of deletion and integration mutants by CRISPR/Cas9 technique, the plasmid pCRISPomyces-2 (Cobb et al. 2015) was used according to a protocol of Wolf et al. (2016). The spacer and its reverse complement were ordered at metabion GmbH (Steinkirchen, Germany) or Sigma-Aldrich (Taufkirchen, Germany) as oligonucleotides with overlap (Table E7).

    (197) The oligonucleotides were annealed to a double-strand and assembled with the plasmid by Golden Gate Assembly (Engler et al. 2008) according to the protocol of Cobb et al. (2015). For repair of the Cas9-induced double-strand break, a DNA template was cloned into the vector backbone by Gibson Assembly (Gibson et al. 2009). As DNA template, flanking sequences up- and downstream of the target gene (each round about 1 kB) were amplified by PCR (Table E8) from genomic DNA.

    (198) TABLE-US-00009 TABLEE7 SpacerandthereversecomplementusedinaGoldenGateAssemblywith pCRISPomyes-2. gene oligo1(5-3) oligo2(5-3) cgt(ACSP50_5024) acgcAGCGTCGCCCGCTGGGAG aaacTTCTCCCAGCGGGCGACGCT AA merR(ACSP50_0145) acgcGACCGGGGGCTGTCCGG aaacCTCCCGGACAGCCCCCGGTC GAG

    (199) TABLE-US-00010 TABLEE8 GibsonAssemblyprimerforpCRISPomyes-2deletionandintegrationvectors. size size gene primersforflank1(5-3) (bp) primersforflank2(5-3) (bp) cgt tcggttgccgccgggcgttttttatCC 1101 gtatctgagccatatccctcGACCTGC 982 (ACSP50_5024) GGTACCCTGCTCCTCGT GTCAATGCGTCAC C gcggcctttttacggttcctggcctACCTG embedded image ACCCTGCTGAAATGG embedded image merR tcggttgccgccgggcgttttttatCT 1115 gcaggtggacggcctcggtgATCTCG 1129 (ACSP50_0145) CCGGGCGCCGACCGGC GCGCTCAACGCCTC AC gcggcctttttacggttcctggcctCGGC gaggcgttgagcgccgagatCAC AAACAGACCTACTACG CGAGGCCGTCCACCTGC
    Deletion of the Gene Cgt by CRISPR/Cas9 Technique

    (200) For the construction of a cgt (AACSP50_5024) deletion mutant by CRISPR/Cas9 technique (clustered regular interspaced short palindromic repeats/CRISPR-associated endonuclease 9), the plasmid pCRISPomyces-2 was used (Cobb et al. 2015). The spacer sequence was selected according to Wolf et al. (2016) and ordered as oligonucleotides together with its reverse complement at metabion GmbH (Steinkirchen, Germany) (spacer_1: 5-acgcAGCGTCGCCCGCTGGGAGAA-3, spacer_2: 5-aaacTTCTCCCAGCGGGCGACGCT-3). The oligonucleotides were annealed to a double-strand and assembled with the plasmid by Golden Gate Assembly (Engler et al. 2008) by use of Bsal (NEB, Ipswich, MA, USA) according to the protocol of Cobb et al. (2015) (Cobb et al. 2015). For repair of the Cas9-induced double-strand break, a deoxyribonucleic acid (DNA) template was cloned into the Xbal-linearized vector by Gibson Assembly (Gibson et al. 2009). As DNA template, flanking sequences up- and downstream of the target gene (each round about 1 kB) were amplified by polymerase chain reaction (PCR) with the Phusion High-Fidelity PCR Master Mix with GC Buffer (NEB, Ipswich, MA, USA) (Primer sequences: cgt_flank1_fw: 5-tcggttgccgccgggcgttttttatCCGGTACCCTGCTCCTCGTC-3, cgt_flank1_rv: 5-gtgacgcattgacgcaggtcGAGGGATATGGCTCAGATAC-3, cgt_flank2_fw: 5-gtatctgagccatatccctcGACCTGCGTCAATGCGTCAC-3, cgt_flank2_rv: 5-gcggcctttttacggttcctggcctACCTGACCCTGCTGAAATGG-3). For Gibson Assembly, the DNA fragments (flank_1: 1101 bp and flank_2: 982 bp) were mixed equimolar added in a ratio of 1:4 to the Gibson Assembly Master Mix consisting of 0.64 L T5 Exonuclease (10 U/L, NEB, Ipswich, MA, USA), 20 L Phusion High-Fidelity DNA Polymerase (2 U/L, Thermo Fisher Scientific, US) and 160 L Taq DNA Ligase (40 U/L NEB, Ipswich, MA, USA), 699.36 L aqua distilled and 320 L isothermal reaction buffer (25% PEG-8000, 1 mL 1 M Tris-HCl, 100 L 1 M MgCl.sub.2, 100 L 1 M DTT, 20 L each 1 mM dNTP, 200 L NAD). After incubation at 50 C. for at least 1 h, the reaction mix was transferred to Escherichia coli DH5aMCR by chemical transformation according to a protocol of (Beyer et al. 2015). Growth and selection of E. coli was performed by plating them on Luria/Miller broth (LB-media) with 15 g.Math.L.sup.1 agar-agar Kobel (both: Carl Roth, GmbH&Co.KG, Karlsruhe, Germany)) supplemented with 50 mg.Math.L.sup.1 apramycin-sulfate. Plates were incubated for 10-14 h at 37 C. Apramycin-resistant colonies were tested by PCR and gel-electrophoresis first, and second by Sanger sequencing by our in-house sequencing core facility (primer sequences for PCR: for: 5-GGCGTTCCTGCAATTCTTAG-3, rev: 5-TCGCCACCTCTGACTTGAGC-3, walking primer for sequencing: w1: 5-CGCTGATCTTCAGCTTCC-3, w2: 5-GCCTTCACCTTCCATCTG-3, w3: 5-TCGGGAAAGCCGCCGGAG-3)).

    (201) Conjugal Transfer to Actinoplanes Sp. SE50/110

    (202) Competent Actinoplanes sp. SE50/110 cells were prepared from a freshly grown NBS-culture (see above). Cells were washed twice in 10% (w/v) ice-cold sucrose and twice in ice-cold 15% (v/v) glycerol. Finally, the cells were taken up in 15% (v/v) ice-cold glycerol (by addition of round about the four-fold volume of the cell pellet), aliquoted to 100 L in reaction tubes and snap-frozen in liquid nitrogen. The competent Actinoplanes cells are stored at 80 C.

    (203) For conjugation, Escherichia coli ET12567/pUZ8002 (Kieser et al. 2000) was used. After transfer of the desired construct into E. coli ET12567/pUZ8002 according to Beyer et al. (2015) and selection on LB agar plates supplemented with 50 mg.Math.L.sup.1 apramycin-sulfate, 50 mg.Math.L.sup.1 kanamycin-sulfate and 15 mg L.sup.1 chloramphenicol, cells were grown in liquid culture (LB-medium with the same supplements) and harvested at an optical density of 0.4-0.6. The cells were washed twice in ice-cold LB medium and mixed with competent cells of Actinoplanes sp. SE50/110. The cell suspension was plated on SFM agar plates. After 20-24 h of incubation at 28 C., 1 mL 500 mg.Math.L.sup.1 apramycin-sulfate dissolved in ddH.sub.2O was distributed on the plate with a sterile swab. First exconjugants of Actinoplanes sp. SE50/110 can be observed after 1 week. Exconjugants were transferred to an SFM agar plate supplemented with 50 mg.Math.L.sup.1 apramycin-sulfate. Repeated streaking is performed for several times to purify Actinoplanes exconjugants from E. coli. To expedite this process, 50 mg.Math.L.sup.1 fosfomycin or trimethoprim can be supplemented to the medium to get rid of the donor strain.

    (204) Plasmid Curing to Obtain Marker-Free CRISPR/Cas9 Deletion/Integration Mutants of Actinoplanes Sp. SE50/110

    (205) Plasmid curing was performed according to the protocol of Wolf et al. (2016) by cultivation in the complex medium NBS at elevated temperatures. Colonies were tested for the presence of the plasmid by parallel streaking on apramycin-containing and apramycin-free SFM plates. Apramycin-sensitive exconjugants were tested for the deletion by PCR (primer sequence data not shown). The PCR fragment was excised from the gel and sequenced by our in-house Sanger sequencing core facility.

    (206) Additionally, also genomic DNA of the deletion or integration mutant was sequenced by the Oxford Nanopore technique (Oxford, UK) to exclude off-target effects. For this, genomic DNA of an NBS-grown culture was isolated with the NucleoSpin Microbial DNA kit (Macherey-Nagel, Dren, Germany). A library was prepared with help of the 1D Genomic DNA by ligation-kit (Oxford Nanopore, Oxford, UK).

    (207) Deletion System Based on Homologous Recombination and Counterselection with the Cytosine Deaminase CodA.

    (208) Vector integration into genes of the acb gene cluster has occurred by use of the replicative vector pKC1139. Based on this observation, a novel deletion system using homologous recombination was developed and tested by example of the gene cgt (ACSP50_5024).

    (209) A vector backbone with origin of transfer (ncP) and relaxosome gene traJ was used to allow conjugation into Actinoplanes sp. SE50/110. In this work, two different antibiotic resistance markers mediating apramycin and kanamycin resistance were tested for selection: aph (3) Il (kanR, kanamycin) and aac (3) IV (apmR, apramycin). Furthermore, the high-copy-number ColE1/pMB1/pBR322/pUC origin of replication was integrated to allow replication in the donor strain E. coli. The ori, the oriTncP, tra gene and resistance cassettes were taken from pRT802 respectively pRT801 (Gregory et al. 2003). Since no replicon for replication in Actinoplanes sp. SE50/110 neither an integrase gene with attachment site are contained in the novel deletion system, the vector can only be maintained in Actinoplanes sp. SE50/110 when being integrated into the genome by homologous recombination (FIG. 7). For this, homologous sequences of 2 KB were integrated, which are flanking the gene cgt. After conjugal transfer in Actinoplanes sp. SE50/110, mutants, in which the first crossover has taken place, can be selected by apramycin or kanamycin resistance. To force desintegration of the vector backbone (second crossover), 5-fluorocytosine (5-FC) is added, which is converted into the toxic product 5-fluorouracil (5-FU) by the cytosine deaminase CodA. In this work, codA(s) is used, which is codon-optimized for Streptomyces ssp. (Dubeau et al. 2009). After second crossover either the genotype of the wild type or the genotype of the deletion mutant is present.

    (210) The novel deletion system was successfully tested for the gene cgt, which was shown by colony PCR and ONT-sequencing. The proportion of deletion mutants after successful second crossover was between 25% and 32%. The workflow is illustrated in FIG. 8.

    (211) Analytical Methods

    (212) Acarbose Quantification from the Supernatant by High Performance Liquid Chromatography (HPLC)

    (213) Supernatants of maltose-grown cultures of Actinoplanes ssp. were centrifuged (20,000 g, 2 min), mixed 1:5 with methanol by vortexing and centrifuged again to remove the precipitate (20,000 g, 2 min). The samples were transferred to HPLC vials and analyzed in the HPLC system 1100 series of Agilent (G1312A Binary Pump Serial #DE43616357, G1329A ALS autosampler Serial #DE43613/10, G1315A diode-array detector (DAD) Serial #DE72002469). As stationary phase the Hypersil APS-2 column (1254 mm, 3 m particle size) of Thermo Fisher Scientific Inc. (Waltham, MA, USA) was used, heated to 40 C. As mobile phase an isocratic flow of 1 mL.Math.min.sup.1 68% acetonitrile (solvent B) and 32% phosphate buffer (0.62 g.Math.L.sup.1 KH.sub.2PO.sub.4 and 0.38 g.Math.L.sup.1 Na.sub.2HPO.sub.4.Math.2H.sub.2O) (solvent A) was applied. 40 L of each sample was injected and separated in a 10 min run. Detection of acarbose was carried out with a DAD detector at 210 nm (Reference 360 nm) and quantified from the peak areas of a calibration curve.

    (214) Liquid Chromatography-Mass Spectrometry (LC-MS)

    (215) Sample Preparation for Analysis of Intracellular Metabolites

    (216) Triplicates of Actinoplanes sp. SE50/110 strains were grown in maltose minimal medium for at least 4 days. 10 mL of the culture were quickly filtrated through filtering paper by a Bchner funnel and washed with 2.63 g.Math.L.sup.1 NaCl solution. Cells were transferred into pre-weighted round bottom screw-cap tubes, snap-frozen in liquid nitrogen and stored at 80 C. Cells were dried overnight in the Centrifugal Evaporator (SpeedVac) of Thermo Fisher Scientific (Waltham, MA, USA). 4 mg dried cells were transferred into a fresh 2 mL screw-cap tube containing round about 500 L of a mixture of zirconia/silica micro beads of the sizes 0.1 mm, 0.05 mm and 0.01 mm (Bio Spec Products Inc., Bartlesville, USA). 700 L 80% MeOH was added to the cells and beads. Cell disruption was carried out in a homogenizer (FastPrep FP120, Thermo Fisher Scientific, Waltham, MA, USA) for three times 30 s at speed setting 6.5. Samples were cooled for 5 min on ice in between. The cell suspension was centrifuged for 5 min at 13,000 g and 4 C. 500 L of the supernatant was transferred into HPLC vials, dried under nitrogen flow and taken up in 50 L distilled water.

    (217) Sample Preparation for Analysis of Extracellular Acarviosyl-Metabolites

    (218) The sample preparation was conducted according to a protocol described by Ortseifen (2016). Sugars and pseudo-sugars were enriched from 10 mL of the supernatant by solid phase extraction using the Chromabond Easy columns (Macherey-Nagel, Dren, Germany, REF 730753). The columns were equilibrated with 3 mL methanol, afterwards washed with 3 ml distilled water before loading of the sample. Unspecific bound metabolites were rinsed by 3 mL 95% (v/v) methanol. Elution was conducted in 3 mL methanol.

    (219) LC-ESI-MS of Intracellular and Extracellular Metabolites

    (220) For LC-MS, the LaChromUltra (Hitachi Europe Ltd., UK) HPLC system coupled to a microTOF-Q hybrid quadrupole/time-of-flight mass spectrometer (Bruker Daltonics, Bremen, Germany) was used, which was equipped with an electrospray ionization (ESI) source.

    (221) For the analysis of intracellular metabolites, 2 L of the sample was separated with the SeQuant ZIC-PHILIC 5 m Polymeric column (1502.1 mm) (Merck, Darmstadt, Germany). Eluent A (20 mM NH.sub.4HCO.sub.3, pH 9.3, adjusted with aqueous ammonia solution) and eluent B (acetonitrile) were applied at a flow rate of 0.2 mL.Math.min.sup.1 by use of the following gradient: 0 min B: 90%, 30 min B: 25%, 37.5 min B: 25%, 40.0 min B: 80%.

    (222) As standards for the peak identification, 2 L of 10 M of UDP-glucose, glucose-1-phosphate, galactose-1-phosphate, glucose-6-phosphate and dTDP-glucose were injected.

    (223) For the analysis of extracellular acarviosyl-metabolites, 10 L of the sample was separated with the Cogent Diamond Hydride HPLC column (MicroSolv Technology Corporation; 150 mm2.1 mm; 3 L particle size). Eluent A (50% (v/v) acetonitrile, 50% (v/v) H.sub.2O und 0.1% (v/v) formic acid) and eluent B (90% (v/v) acetonitrile, 10% (v/v) H.sub.2O und 0.1% (v/v) formic acid) were applied at a flow rate of 0.4 mL.Math.min.sup.1 by use of the following gradient: 0 min B: 100%, 8 min B: 0%, 13 min B: 0%, 15.5 min B: 100%, 18 min B: 100%.

    (224) The ESI source was operated in the negative ionization mode for analysis of intracellular metabolites and in the positive ionization mode for analysis of extracellular acarviosyl-metabolites. The temperature of the dry gas and the capillary was set to 180 C. The scan range of the MS was set to 200-1,000 m/z (intracellular metabolites) respectively 50-3,000 m/z (extracellular acarviosyl-metabolites)

    (225) The peak areas of specific masses were integrated by use of the software Compass (Bruker Daltonics, Bremen, Germany). Peaks were normalized on the weighed amount of dried cells (intracellular metabolites) respectively the cell dry weight at sampling time (extracellular acarviosyl-metabolites).

    (226) Extraction and Analysis of Carotenoids

    (227) Extraction

    (228) Cell pellets from Actinoplanes sp. SE50/110 were transferred into a 2 mL screw-cap tube with round about 500 L of a mixture of zirconia/silica micro beads of the sizes 0.1 mm, 0.05 mm and 0.01 mm (Bio Spec Products Inc., Bartlesville, USA). 1 mL acetone or methanol was added as extracting solvent. Cell disruption was carried out in a homogenizer (FastPrep FP120, Thermo Fisher Scientific, Waltham, MA, USA) for three times 45 s at speed setting 6.5. Samples were cooled for 5 min on ice in between. The homogenized cell suspension was centrifuged for 20 min at 13,000 g and 4 C. The supernatants were transferred into glass vials. For HPLC-analysis, mixtures of the acetone- and methanol-extracts were created in the ratio of 7:3 and transferred into a novel glass vial.

    (229) Thin Layer Chromatography (TLC) and Spectral Analysis

    (230) 50 L of the extracted carotenoids were applied in 5 L-steps onto a silica gel matrix (HPTLC-HL, Cat. 58077, Analtech Inc., Newark, USA) and incubated in a TLC-chamber filled with 100 mL petroleum, 11 mL isopropanol and 50 L water. The run was carried out in darkness. After drying of the TLC-plate, bands were stripped off with a scalpel and transferred into a novel tube. After addition of 1 mL ethanol, the absorption spectrum was analyzed by use of the Genesys 10S UV-Vis spectrophotometer of Thermo Fisher Scientific (Waltham, MA, USA).

    (231) HPLC Analysis of Carotenoids with Absorbance Scan

    (232) Carotenoids were separated by reversed-phase HPLC according to Henke et al. (2017) and Heider et al. (2014) using the Agilent 1200 series HPLC system (Agilent Technologies GmbH&Co. KG, Boblingen, Germany) including diode array detector (DAD) for the UV-Vis spectrum. 20 L sample volume was applied to a flow of 0.5 mL.Math.min.sup.1. As stationary phase a pre-column (104 mm MultoHigh 100 RP18-5) and a main column (ProntoSIL 200-5 C30, 2504 mm) from CS ChromatographieService GmbH (Langerwehe, Germany) were used, like described before (Heider et al. 2014; Henke et al. 2017).

    (233) Following gradient was applied: 0 min A: 100%, 32 min A: 75%, 47 min A: 0%, 70 min A: 0%, 75 min A: 100%, with eluent A consisting of 0.1 M ammonium acetate in deionized water and methanol in the ratio of 15:85 (v/v). Eluent B consists of a mixture of methanol, acetonitrile and acetone in the ratio of 44:43:13 (v/v). Detection of carotenoids was conducted at 470 nm. Additionally, wavelength scans between 360 nm and 700 nm were performed each second during the run.

    (234) Assays

    (235) Promoter Screening Experiment by Spectrophotometric Measurement of the Glucuronidase Activity

    (236) Two different types of glucuronidase assay were carried out: one with protein raw extract and one with entire cells. The protocols described by Horbal et al. (2013) and Siegl et al. (2013) were adapted to Actinoplanes sp. SE50/110. The substrate 5-bromo-4-chloro-3-indolyl--D-glucuronide (X-Gluc, AppliChem GmbH, Darmstadt, Germany) was chosen, as the substrate p-nitrophenyl-D-glucuronide turned out to dissociate under our assay conditions.

    (237) Growth Conditions and Sample Preparation

    (238) Actinoplanes mutants carrying promoter constructs with gusA gene, were cultivated for one week in maltose minimal medium, like described above. The assays were conducted during growth phase. 500 L of each culture was sampled for an assay with entire cells. 1 mL was sampled for an assay with protein raw extract and transferred to a screw cap tube containing zirconia/silica micro beads (Bio Spec Products Inc., Bartlesville, USA) of the sizes 0.1 mm and 0.05 mm. Cells were disrupted in a homogenizer (FastPrep FP120, Thermo Fisher Scientific, Waltham, MA, USA) for two times 30 s at speed setting 6.5 and 5 min on ice in between. After centrifugation, the lysate was transferred to a new reaction tube and centrifuged. The supernatant was used for a cell-free assay. Total protein quantification was carried out by a Bradford assay (see above).

    (239) Glucuronidase (Gus) Assay

    (240) The gus assay was performed in a black microtiter plate (96 well PS F-bottom uCLEAR, black, med. binding, Greiner Bio-One, Kremsmnster, sterreich, REF 655096). 100 L of each sample (either cell suspension or lysate) was pipetted in three wells, of which one serves as negative control and two as technical replicates. The gus buffer (50 mM phosphate buffer pH 7.0 (5.136 g.Math.L.sup.1 Na.sub.2HPO.sub.4.Math.2H.sub.2O, 3.299 g.Math.L.sup.1 NaH.sub.2PO.sub.4: 2H.sub.2O) with 5 mM DTT and 0.1% Triton-X-100) was complemented with 2 mM substrate X-Gluc (stock solution: 0.2 M in DMF). 100 L was added to 100 L of the sample. For the negative control, 100 L gus buffer without substrate was added. Beside of the individual negative control of each sample, also medium and substrate controls were prepared.

    (241) The microtiter plate was measured in a pre-warmed Tecan reader Infinite M200 (Ref 30016056, Tecan Group AG, Mannedorf, Switzerland) (37 C.) for 3 hours (assay with entire cells), respectively for 2 hours (assay with lysate). The absorption maxima of indigo were measured at 610 and 660 nm. After discounting the absorption value of all controls, the slope of each absorption curve was calculated by linear regression and normalized either on cell dry weight (assay with entire cells) or on whole protein amount (assay with lysate). The normalized slope was used to compare the -glucuronidase activities in the different mutants.

    (242) Screening Experiments in the Biolog OmniLog Phenotypic Microarray System

    (243) Pre-screening experiments were performed in the Biolog OmniLog Identification System (Hayward, CA, USA) to evaluate respiration on different carbon sources (panel PM1 and PM2). Actinoplanes sp. SE50/110 wild type and the deletion mutant cgt were grown on SFM agar plates, as described elsewhere herein. Cells were harvested by use of a sterile swab and diluted in the inoculating fluid IF-Oa for PM1 and PM2. The turbidity of the cell suspension was checked to achieve 80% transmittance in the turbidimeter of Biolog, according to manufacturer's protocol. 2.32 mL of the cell suspension was added to 20 mL IF-0a, 0.24 mL 0.5 M MgCl.sub.2, 0.24 mL 0.5 M Na.sub.2SO.sub.4, 0.24 mL 1.5 M NH.sub.4Cl, 0.24 mL 1.0 M NasPO.sub.4, 0.24 mL aqua distilled, 0.24 mL Biolog redox dye mix G, and 0.24 mL metal ion cocktail (5.0 mM each: ZnCl.sub.2.Math.7H.sub.2O, FeCl.sub.2.Math.6H.sub.2O, MnCl.sub.2.Math.4H.sub.2O, CaCl.sub.2.Math.2H.sub.2O), according to the manufacturer's protocol. The PM panels were inoculated with 100 L per well of the prepared solution and incubated for 1 week in the OmniLog system (Mode 71000 Serial #406) at 28-30 C. Data evaluation was carried out with the manufacturer's software (Kinetic Analysis, Biolog and Omnilog 2.3, Biolog).

    (244) RNA Work

    (245) Sampling and RNA Isolation

    (246) For transcriptome analysis, 21 mL culture were taken during growth phase, separated from the supernatant by centrifugation (10 s) and snap-frozen in liquid nitrogen. Pellets were stored at 80 C. until further processing.

    (247) For isolation of ribonucleic acid (RNA), frozen cell pellets were resuspended in 500 L LB-buffer (NucleoSpin RNA Plus, Macherey-Nagel, Dren, Germany) and transferred to 2 mL lysing matrix tubes (0.1 mm spherical silica beads, MP Biomedicals, Santa Ana, California, USA). Cell disruption was carried out in a homogenizer (FastPrep FP120, Thermo Fisher Scientific, Waltham, MA, USA) for three times 20 s at speed setting 6.5 and 5 min on ice in between. Subsequently, the cell suspension was centrifuged for 5 min at 13,000 g and 4 C. The supernatant was used for RNA extraction using the NucleoSpin RNA Plus kit in combination with rDNase Set (Macherey-Nagel, Dren, Germany) for an on-column DNA digestion. After clean-up and elution according to the manufacturer's protocol, the DNA-digestion was repeated (in solution) and the sample cleaned up again by use of the same kit. With two primer pairs binding to the genomic DNA of Actinoplanes sp. SE50/110 and amplifying small fragments at round about 200-300 nt, the sample was tested for residual DNA. DNA digestion and RNA clean-up was repeated, if necessary. The quantity of RNA was analyzed with the NanoDrop 1000 spectrometer (Peqlab, Erlangen, Germany).

    (248) Reverse Transcription Quantitative PCR

    (249) Reverse transcription quantitative PCR was carried out according to the protocol of Wolf et al. (2017a) by use of SensiFast SYBR No-Rox One-Step kit (Bioline, London, UK) and 96 well lightcycler plates (Sarstedt, Numbrecht, Germany) in a LightCycler 96 System of Roche (Mannheim, Germany). The relative RNA amount was normalized on total RNA (100 ng) and calculated as 2.sup.cq. Cq is the difference of the mean Cq in the mutant strain compared to the control strain. The primers in Table E9 were used for determination of the relative transcription of a gene.

    (250) TABLE-US-00011 TABLEE9 PrimersusedinRT-qPCRexperiments. fragment size geneticlocus forwardprimer(5-3) reverseprimer(5-3) (bp) merR(ACSP50_0145) GAGCGATACGCCCCTGACC GGTGATGTCCGGGCTCGTG 309 idi(ACSP50_0146) GCCTTCTCGGTCTTCCTCAC CGCCAATTCCTCGGTGAGA 168 C crtl(ACSP50_0147) CTCTCGGTCGGCGGATAC GAGCCGTCCGGGTAGTACG 158 C crtE(ACSP50_0148) TTCCTCGCCTCCCAGATCG CGCGAAGGTGTGCATCAG 210 crtB(ACSP50_0149) CATGTGCACGCTCTGTATG AAGACCGCGATGGTGTGCA 185 G G acbZ(ACSP50_3590) CGGCAATTCGCTGTTCAGT TGTGCTTGACGGTGTCCAT 167 G C acbY(ACSP50_3591) TCCGAACGGTTCCTCTATCC AACTCGCTGAGCTGGTTGA 239 C acbX(ACSP50_3592) TCGGGATGCTGCACACCAA CGACGCGAACATCGCGAAA 191 C C acbW(ACSP50_3593) GGTGTACGACCGGAACATG GTTCGGCGTGGATGTGGTT 224 C G acbV(ACSP50_3594) GCTTCCACGGCAAGACGAT GCGCTCACGTTGGGTTTCT 196 G C acbS(ACSP50_3596) GTTGCCGGACCGGTTCTAT CCCGGTACACCGACTTGTT 248 C G acbQ(ACSP50_3601) TGCTGGCGCAGATCTACTC AGCCGCAGATACATCGGGT 211 C C acbK(ACSP50_3602) CGAGGTCTACGCCTTCAAC AGAGGAAGCCGGACACGAA 248 G C acbC(ACSP50_3607) GATCGCGCTGATCAAGGAT CTGAACGTGTGCCCGTAGT 213 G C acbB(ACSP50_3608) GTCGACAAACTGGGTTACG GTCCAGTAGCACCTGAGTG 231 G acbA(ACSP50_3609) TCATGCTCGGCGACAACCT GACCGGTTTCTCCTCGATG 173 G G acbE(ACSP50_3610) GCGCGGCATGAAGATCTAC CGGACGGCTTCTCGAAGAA 218 C C acbD(ACSP50_3611) ACGCCAACTACTGGATGGA TCGAGCGGTTGGTGTAGAA 231 C G cgt(ACSP50_5024) CACCACGTACTGGAACTC GCGACCTTCAACGTGAC 192 gtaB/galU CTCGCCTTCATCGAGGTCA GGCGATCGTCTCGAAGATC 192 (ACSP50_7820) C C gusA ACGCGGACATCCGCAACTA CCCTGGTGCTCCATCACTT 157 C C
    Whole Genome Oligonucleotide Microarray

    (251) Whole genome oligonucleotide microarrays were performed according to a protocol of Wolf et al. (2017a), who adapted the hybridization procedure to the high G+C content of Actinoplanes sp. SE50/110.

    (252) RNA of triplicates was isolated and equimolar pooled (total amount of 5 g pooled RNA in 12 L). For cDNA synthesis, labeling and microarray hybridization the Two-Color Microarray-Based Prokaryote Analysis FairPlay III Labeling kit (Version 1.4, Agilent Technologies, Santa Clara, CA, USA) was used according to the manufacturer's instructions with practical adjustments described by Wolf et al. (2017a). The Amersham CyDy mono-reactive dye packs (GE Healthcare, Little Chalfont, UK) were utilized for labeling. A custom whole genome oligonucleotide microarray representing the coding sequence of Actinoplanes sp. SE50/110 was used, which was designed by Wolf et al. (2017a) (444K format, 43,803 features representing 8,238 genes and 1,417 control spots, supplier: Agilent Technologies, Santa Clara, CA, USA). All microarray specific reagents and device including hybridization oven and scanner were used from Agilent Technologies (Santa Clara, CA, USA). The Agilent Feature Extraction Software Version 10.7.3.1 (Agilent Technologies, Santa Clara, CA, USA) was used for feature extraction (protocol GE2_107_Sep09). Subsequent data analysis, including LOWESS normalization and statistical analysis were performed by use of the microarray and gene expression (MAGE)-compliant system EMMA 2 (Dondrup et al. 2009). A p-value of 0.05 was used as a cut-off for significance. The M-value cut-offs for a false discovery rate of 0.01 were determined as 1.1 and 1.1 according to previous yellow experiments performed by Wolf et al. (2017a).

    (253) Analysis of the Functional Relevance of Cgt

    (254) Distribution of Single-Domain CBM-20 Proteins in the Eubacterial World

    (255) The inventors have analyzed the distribution of CBM-20 single-domain proteins in the prokaryotic world by BlastP analysis.

    (256) In brief, the distribution of singular CBM-20-domain proteins was analyzed by BlastP analyses using the NCBI non-redundant protein database (Altschul et al. 2005; Altschul et al. 1990). As CBM-20 domains occur in a variety of different proteins and enzymes, data filtering had to be performed: Of the initial 3,316 BlastP hits, all of eukaryotic origin and all enzymes with function-specific annotation or sizes above 350 amino acids were excluded. The domain structures of the remaining 80 BlastP hits were analyzed (Marchler-Bauer et al. 2017; Marchler-Bauer and Bryant 2004; Marchler-Bauer et al. 2015; Marchler-Bauer et al. 2010). Most of these, 53 proteins in total, contain two CBM-20 domains traversed by a higher domain described as glyco-hydro-77-superfamiliy 4-alpha-glucanotransferase. Ten contain different additional domains: Five of them alpha-amylase inhibitor domains, two CBM-25, respectively, CBM-26 binding domains at the N-terminus, two N-terminal domains of IPT-superfamily with probable regulatory function and one a DUF1393-domain, which was described to occur in several alpha-amylases (information taken from the NCBI database). These candidates were also excluded. Only 18 candidates (including Cgt from Actinoplanes sp. SE50/110) displayed a singular CBM-20 domain. A protein tree was created by Blast tree view 1.17.5 of the NCBI database (NCBI database) on basis of a multiple sequence alignment performed by BlastP (Altschul et al. 1990; Altschul et al. 2005).

    (257) Interestingly, singular CBM-20 domain-proteins were found in only 17 other species (FIG. 9). Most of these are found in species of the order Actinomycetales, for example in all strains of the genus Actinoplanes. The majority of the 17 species were originally isolated from soil and environmental samples, namely A. missouriensis (Parenti and Coronelli 1979), A. utahensis (described by Parenti and Coronelli (1979) and first isolated by Couch (1963)), A. teichomyceticus (Wink et al. 2006), Streptomyces sp. 94 (Chu et al. 1996), Streptomyces sp. OK885 (isolated from roots, Tennessee, USA, information taken from GenBank (Benson et al. 2013) of the NCBI (NCBI database)), Streptosporangium roseum (Nolan et al. 2010), Streptosporangium sclerotialus (syn. Chainia antibiotica) (Thirumalachar 1955), Cellulomonas sp. B6 (Piccinni et al. 2016), Paenibacillus sp. P22 (Hanak et al. 2014), and Clostridium sp. DMHC 10, which was isolated from the sludge of a distillery waste treatment plant (Kamalaskar et al. 2010). CBM-20 proteins also occur in Streptomyces sp. DI166, for which the sampling sites has not been reported, and in multi-species of the family Pseudomonadaceae. They belong to genera, which are known to include soil-inhabiting members.

    (258) Strains carrying singular CBM-20 proteins without direct connection to the habitats soil or environment occur only occasionally, like in singular isolates of the human pathogens Chlamydia trachomatis (Thomson et al. 2008) and Mycobacterium abscessus (Ryan and Byrd 2018; Moore and Frerichs 1953).

    (259) Confirmation of the Starch Binding Function by an In Vitro Assay

    (260) CBM-20 domains are described to have a starch binding function, which the inventors wanted to test by an in vitro assay. As the small carbohydrate binding protein Cgt is highly expressed and enriched in the extracellular space due to an N-terminal signal peptide (Wendler et al. 2015a), the protein could be directly concentrated from the supernatant by filtration. A starch binding assay was performed with starch from potato in different concentrations. Boththe starch fraction as well as the supernatant-were analyzed by SDS-PAGE. In all starch fractions (ranging from 1 to 10% (w/v) of starch), a protein band at about 15 kDA was detected, which was clearly identified as Cgt by MALDI-TOF-MS. In contrast, the supernatant fractions were almost completely depleted by Cgt. Residual Cgt in the supernatant was found, indicating, that the added starch was completely saturated by Cgt. In the negative control without starch, most of Cgt remains in the supernatant fraction. Beside Cgt, another small extracellular protein of unknown function, ACSP50_6253, was identified by the starch binding assay (data not shown).

    (261) Analysis of Cgt Expression During Growth on Different Carbon Sources

    (262) The gene cgt has been reported of being differentially expressed in the presence of different carbon sources, as determined by transcriptome and proteome analysis on glucose and maltose (Schwientek et al. 2013; Wendler et al. 2015a; Ortseifen 2016). The inventors have tested the effects of several carbon sources on the expression of cgt gene by measuring the transcript amounts by reverse transcription quantitative PCR (RT-qPCR). For this purpose, the wild type strain of Actinoplanes sp. SE50/110 was grown on minimal medium supplemented with maltose, glucose, starch, galactose, cellobiose, lactose and C-Pur (Cerestar 01908) (FIG. 10). The latter is a sugar-containing product from the degradation of starch mainly consisting of maltose and maltotriose. All carbon sources were supplemented in equivalent C-molar amounts. The only exception was starch: Due to the low solubility, here, a 4% (w/v) opalescent solution of starch soluble from Acros Organics was generated. For comparison, a maltose minimal medium with reduced amount of maltose was prepared (here: 44.40 g.Math.L.sup.1 maltose monohydrate), in which the C-molarity should approximate the one in the starch medium.

    (263) For most tested carbon sources, the transcription of the cgt gene was similar or just slightly and insignificantly reduced compared to a maltose grown culture (FIG. 11A). Differential transcription was observed for galactose to a minor extent (3.4-less transcribed, log 2 (fold-change)=0.291). A significant reduction of cgt transcript was measured for the carbon sources glucose (142-fold less transcribed, log 2 (fold-change)=0.007) and lactose (62-fold less transcribed, log 2 (fold-change)=0.016). When cells were grown on maltose minimal medium with reduced amount of maltose (here: 44.4 g.Math.L.sup.1 instead of 72.06 g.Math.L.sup.1), a 2.9-fold decreased transcription of cgt gene was observed (log 2 (fold-change)=0.345) (FIG. 11B).

    (264) Analysis of Gene Deletion Mutant Cgt

    (265) Cgt on Different Carbon Sources or Under Carbon-Limited Conditions

    (266) The differential transcription profile of cgt in dependence of the carbon source indicated a function within sugar metabolism, like it has been presumed before (Ortseifen 2016). Ortseifen (2016) (Ortseifen 2016) suggested Cgt of being responsible for the retention of carbon as energy source in the context of the carbophore model. Growth of the wild type and the CRISPR/Cas9 deletion mutant cgt was tested on different carbon sources in liquid culture.

    (267) Before, a pre-screening experiment was performed in the OmniLog Phenotypic Microarray System (Biolog Inc., Hayward, United States of America), which allows fast phenotypic screening by measurement of cellular respiration activity on a total of 190 different carbon sources in multi-well plates. Of these, Actinoplanes displayed respiration on 103 carbon sources. Except for arabinose and lactose, no differential respiration profile was observed for cgt on the remaining 101 carbon sources. In order to validate these results on the level of growth, the carbonsources arabinose and lactose were furthermore tested in a shake flask cultivation. Also, the standard laboratory sugars maltose and glucose, and the complex carbon source starch as well as the disaccharide cellobiose were tested, to imitate natural carbon sources of the habitat soil. No restraint on growth was observed for cgt (FIG. 12 and FIG. 13).

    (268) Furthermore, growth under carbon-limited conditions (here: 1 g.Math.L.sup.1, 2 g.Math.L.sup.1, 3 g.Math.L.sup.1, 4 g.Math.L.sup.1 and 5 g.Math.L.sup.1 starch) was tested in the RoboLector-system of m.sup.2p-labs. No growth disadvantages for the cgt mutant were observed in case of carbon source constraints compared to the wild type (FIG. 14).

    (269) Cgt has No Impact on Osmolarity- or pH-Tolerance

    (270) Cgt multimers have been proposed to form surface layers through multimerization (Wendler et al. 2015a). This might suggest a potential role in protection against environmental changes, like drought, pH and osmolarity.

    (271) A pH screening was performed on solid media as well as in liquid culture in the RoboLector-system. For screening on solid media, SFM-agar plates of pH ranging from pH 4 to 11 (in steps of 1) were prepared and droplets of a dilution series of spores of the wild type and the deletion mutant cgt were applied. Both mutant and wild type were able to grow from pH 5 to 11. No differences in growth or spore formation on agar-plates were observed.

    (272) Since an effect of drought tolerance is difficult to assess, the inventors analyzed the colony and spore formation on the surface of the bacterial lawn and found no differences between the wild type and cgt.

    (273) For pH screening in liquid culture, maltose minimal medium of pH ranging from 4 to 7 was prepared. Higher pH values could not be tested in liquid culture, as medium components tend to precipitate. Both strains grew from pH 4.5 to 7 (FIG. 15). Regarding the final cell dry weights, no differences were observed.

    (274) For osmolarity screening, maltose minimal medium was prepared with different concentrations of maltose ranging from 3.6 to 108.1 g.Math.L.sup.1 maltose monohydrate and osmolarity ranging from 323.5 to 681.0 mOsmol.Math.kg.sup.1 (Table E11). No significant growth differences were observed between the wild type and the deletion mutant cgt (FIG. 16).

    (275) Also, inositol was tested as osmolyte, since it is not consumed by Actinoplanes. Here, osmolarity ranged from 388.5 to 695.0 mOsmol.Math.kg.sup.1, but no growth differences were observed (FIG. 17).

    (276) Lower osmolarities between. 159-190 mOsmol.Math.kg.sup.1 were tested by use of the complex medium NBS (FIG. 19, Table E10). Again, no significant differences in growth were observed between the wild type strain and the deletion mutant cgt.

    (277) TABLE-US-00012 TABLE E10 Tabular summary of screening experiments. Final cell dry weights, final acarbose concentrations, pH and osmolarity of different minimal media used in this work for screening of osmolarity and pH. Different osmolarities in the media used for pH screening are caused by addition of correcting agents. maltose glucose osmolarity final cell dry weights final acarbose concentration x1H.sub.2O x1H.sub.2O inositol (mOsmol .Math. (g .Math. L.sup.1) (g .Math. L.sup.1) (g .Math. L.sup.1) (g .Math. L.sup.1) (mM) kg.sup.1) pH wild type cgt wild type cgt significance (p-value) pH Screening 72.06 629.0 4.0 1.17 0.95 2.09 0.26 not detectable 72.06 630.0 4.5 5.50 0.66 6.94 1.07 0.27 0.18 0.38 0.05 *0.02216 72.06 606.0 5.0 9.67 1.28 11.34 0.80 0.47 0.003 0.65 0.03 *0.00136 72.06 587.0 5.5 11.04 0.71 12.13 0.42 0.66 0.03 0.80 0.02 *0.00124 72.06 569.0 6.0 12.67 0.38 12.97 0.47 0.97 0.17 0.89 0.01 72.06 569.0 6.5 13.79 1.84 12.83 0.76 1.14 0.13 1.06 0.04 72.06 563.0 7.0 13.86 0.57 12.50 1.53 0.94 0.10 0.85 0.06 Osmo-Screening 1 3.6 323.5 6.5 1.75 0.38 1.97 0.28 not detectable 14.41 361.5 6.4 6.21 0.19 6.25 0.10 not detectable 36.03 420.5 6.4 13.29 0.26 12.66 0.41 0.33 0.04 0.39 0.08 57.65 485.0 6.4 14.00 0.45 14.19 0.30 0.49 0.10 0.61 0.08 72.06 531.0 6.4 14.83 0.47 15.17 0.38 0.58 0.05 0.65 0.06 86.47 605.0 6.4 14.79 0.63 16.91 0.77 0.67 0.04 0.69 0.01 108.09 681.0 6.3 16.21 1.20 17.91 0.33 0.56 0.03 0.66 0.02 *0.01432 Osmo- 20 0 388.5 6.4 11.88 0.22 12.25 0.25 not detectable 20 1 406.5 6.4 12.83 0.47 12.75 0.98 not detectable 20 80 469.5 6.4 12.75 0.45 12.06 0.95 not detectable 20 140 550.0 6.4 15.13 0.25 13.16 0.75 not detectable 20 180 580.0 6.4 14.63 0.45 14.03 1.15 not detectable 20 220 610.3 6.4 14.79 0.44 13.13 1.76 not detectable 20 280 695.0 6.3 15.33 0.51 14.00 1.45 not detectable NBS- 0 10 0 190 6.9 4.35 0.22 3.93 + 0.47 acarbose is not main component NBS- 11 0 159 6.9 4.51 + 0.22 3.87 0.27 0.38 0.06 0.42 0.04 *0.006778
    Cgt Displays Improved Acarbose Formation on Maltose Minimal Medium

    (278) Although no distinct growth phenotype could be observed under the tested conditions, lack of the highly expressed Cgt protein seems to save metabolic resources of the cell, such as ATP and amino acids. These might be used for cellular growth or other anabolic processes. In the experiments, cgt has not displayed significant growth advantages. However, remarkably higher final acarbose concentrations were detected for the deletion mutant cgt compared to the wild type (Table E10). For the cultivation in complex medium, this was most striking during the growth phase (FIG. 18).

    (279) The improved acarbose-producing phenotype was validated by three independent shake flask cultivations in maltose minimal medium (FIG. 19 and Table E11). Quantification of acarbose from the supernatant displayed an enhanced acarbose yield coefficient of the deletion mutant compared to the wild type. The differences in the final acarbose yields were significant (tested by a two-sided t-test, p-value=0.04608). Thereby, in cgt an increase of 8.3 to 16.6% of final acarbose concentration was reached (cf. Table E11).

    (280) TABLE-US-00013 TABLE E11 Tabular summary of growth experiments under acarbose producing conditions. Final acarbose concentrations, final cell dry weights and the number of replicates n of three independent cultivations of cgt and the wild type in maltose minimal medium. No.1 No.2 No.3 acarbose WT 0.76 (+/ 0.07) 0.87 (+/ 0.04) 0.69 (+/ 0.02) (g .Math. L.sup.1) n 5 4 3 cgt 0.88 (+/ 0.05) 0.99 (+/0.01)) 0.74 (+/ 0.03 n 6 4 3 % 116.6 114.2 108.3 cell dry weight WT 14.72 (+/ 1.71) 15.49 (+/ 0.65) 15.30 (+/ 1.25) (g .Math. L.sup.1) n 7 5 5 cgt 16.12 (+/ 0.89) 14.99 (+/ 0.94) 13.85 (+/ 1.46) n 6 5 5 % 109.5 96.8 90.5
    Cgt has No Impact on the Expression of Acarbose Biosynthesis Genes

    (281) Findings that the deletion of the highly expressed gene cgt has no negative impact on growth or viability of the organism under various conditions, but yields into an enhanced acarbose producing phenotype, was surprising. Due to this and to rule out a direct impact on the regulation of acarbose biosynthesis (acb) genes, RT-qPCR of representative acb genes were performed. For this, wild type and cgt were grown on maltose.Math.minimal medium and RNA was isolated from samples of the early growth phase. The relative transcript amount of the acarbose biosynthesis cluster genes acbZ, acbW, acbV, acbA, acbB, acbD and acbE were calculated for cgt in comparison to the wild type (FIG. 20). The gene acbV is the first of several polycistronically transcribed genes within the main operon of the acarbose biosynthesis gene cluster (Wolf et al. 2017b). The monocistronically transcribed genes acbD and acbE, encoding for proteins of the extracellular acarbose metabolism, have shown to be strongly regulated by the acarbose regulator AcrC (Wolf et al. 2017a). The genes acbA, acbB and acbZ are monocistronically transcribed, too, and are annotated as enzymes of the acarbose biosynthesis (acbAB) and its extracellular metabolism (acbZ), respectively. AcbW is the first gene of the acbWXY-operon, putatively encoding for an ABC transporter. For all selected transcripts, no significant change in relative transcript levels was measured in the deletion mutant cgt compared to the wild type (FIG. 20).

    Discussion

    (282) The connection of carbohydrate metabolism and acarbose biosynthesis is of high interest. Recent research has pointed out the importance of carbon utilization in the context of the biosynthesis of acarbose and further acarviosyl metabolites in the wild type (Wendler et al. 2014).

    (283) In this context, the starch binding protein Cgt is striking. It is one of the strongest expressed genes in Actinoplanes sp. SE50/110 (Schwientek et al. 2013) making up for about 8% of the whole secreted proteome (unpublished data of the inventors). Its gene product is exported into the extracellular space (Wendler et al. 2013). Excess production and export means high costs for the cell: Only for the translational process, 4 ATP are required per peptide bond (Campbell and Reece 2011; Purves 2006), i.e. not including additional costs for RNA synthesis, amino acid production, protein folding and export. The inventors therefore concluded that Cgt has a significant role in Actinoplanes sp. SE50/110 physiology. Two different functions of Cgt are proposed and analyzed herein: A role within the sugar metabolism and a role as surface protein. Due to the starch binding domain Ortseifen (2016) (Ortseifen 2016) suggested, that Cgt might be involved in binding and retention of energy sources in the context of the carbophore model (Wehmeier 2003). Evidence was also given here by RT-qPCR, which displayed differential expression of the gene cgt in glucose-, galactose- and lactose-grown cultures compared to cultures grown on maltose, higher maltodextrins and cellobiose. This is in accordance with differential proteome analyses on the carbon sources maltose and glucose (Wendler et al. 2015a; Wendler et al. 2015b). These results indicate a carbon-dependent expression of cgt. It would be exciting to elucidate the regulatory mechanism. However, it remains to be considered that over 900 genes are putatively involved in transcriptional regulation in Actinoplanes sp. SE50/110, of which 697 are annotated as transcriptional regulators according to the annotation of Wolf et al. (2017b) (GenBank: LT827010.1).

    (284) A sugar-dependent expression of cgt might indicate a function within the utilization of maltose, higher maltodextrins andpotentiallyalso cellobiose. However, our studies of the deletion mutant cgt have not unveiled phenotypical differences regarding the carbon utilization. This was tested for a total of 105 different carbon sources, of which 103 were analyzed in the OmniLog screening system and six in liquid culture.

    (285) As the function of Cgt might be negligible under excess of carbon source but indispensable when growing under conditions with limited carbon source, the inventors have tested growth of the deletion mutant cgt and the wild type on minimal medium with low concentrations of starch. Starch was chosen as carbon source, due to the starch binding activity of Cgt, which was confirmed in a starch binding assay here. Nevertheless, no growth phenotype of the mutant could be observed under limited carbon source conditions.

    (286) Another function within the sugar metabolism could consist in binding of insoluble crystalline substrates, which might lead to structural changes, that increases substrate accessibility and enhances the activity of other hydrolyzing enzymes like amylases. Such mechanisms have already been described in the soil bacteria Serratia marcescens for chitinolysis (Vaaje-Kolstad et al. 2005) and Thermobifida fusca for cellulysis (Moser et al. 2008). In the genome of Actinoplanes sp. SE50/110 several genes are encoded with putative -glyosidic function, of which three, the -amylases/pullulanases AcbE, AcbZ and PulA, were shown to accumulate in the extracellular space (Wendler et al. 2015a). Additionally, another small extracellular protein of unknown function and starch binding capability (ACSP50_6253) was identified in a starch binding assay. By heterologous expression of extracellular amylases and enzyme assays in presence and absence of both-Cgt and ACSP50_6253-, a supporting function during starch degradation might be detected in future experiments.

    (287) Apart from the sugar metabolism, also a function as surface layer protein is conceivable, which is supported by the fact, that Cgt forms multimers (Ortseifen 2016; Wendler et al. 2013). Wendler et al. (2015) (Wendler et al. 2015a) identified two transmembrane domains in the Cgt protein, of which one is involved in translocation by the Sec pathway as part of the leader peptide and the second is assumed to be required for multimerization. Although Cgt is not likely to be physically anchored in the membrane (Wendler et al. 2015a), Cgt proteins may remain as multimers in the mesh of the mycelium, due to the reduced fluid flow. In this context, the starch binding domain might serve also as anchor.

    (288) In the role as putative surface protein, the inventors initially assumed a protective function in the context of pH and osmolyte stress or drought. However, the screening experiments showed that the deletion of cgt gene did not lead to significant growth inhibition at different pH in liquid culture. From the screening experiments on solid media, there was no indication, that Cgt might have a protective function in case of pH or drought.

    (289) Hints for a putative function in the context of osmoregulation were given by reverse transcription quantitative PCR of the wild type, grown on different amounts of maltose. Here, the inventors observed a 2.9-fold reduced transcription of the gene cgt, when growing on 44.4 g.Math.L.sup.1 maltose compared to a 72 g.Math.L.sup.1, which might be an effect of osmolarity. The inventors analyzed growth of the deletion mutant cgt in several screening experiments in liquid culture with media ranging from 159 to 681 mOsmol.Math.kg.sup.1. Under all tested conditions, no differences in growth and viability were observed for the deletion mutant cgt compared to the wild type.

    (290) As surprisingly no apparent physiological impact was observed by the deletion of cgt gene neither in utilization of different carbon sources in excess nor in limitation, neither under different pH nor osmolyte conditions, it might be possible that the function of Cgt only becomes apparent in its natural environment and in possible competition with other soil organisms. Interestingly, the inventors found similar independent singular CBM-20 domain proteins in 17 other prokaryotic species, most of which belong to the order Actinomycetales. Although rare, this at least displays a certain distribution and shows, that Cgt is not a strain-specific protein. Most of the species harboring single domain CBM-20 proteins were associated with soil habitats. Together with the fact, that cgt is highly expressed in Actinoplanes sp. SE50/110, this supports the hypothesis that proteins like Cgt fulfill a crucial function in bacteria living within this habitat. A function of Cgt could be tested in future by co-cultivations in direct contact with other microbial competitors.

    (291) While it was surprising, that Cgt turned out to be dispensable under the tested laboratory conditions, the inventors observed a positive phenotype regarding the acarbose production. An increase of the acarbose yield between 8.3 and 16.6% was achieved by deletion of cgt. Although the final product yields differ slightly between batch cultivation, the cgt mutant always performed significantly better. This was shown over a time period of several month (data not shown) in three independent shake flask and several micro-scale cultivations performed in maltose minimal medium. Thus, the improved production was robust over long time periods and in different cultivation settings.

    (292) We assume that this is due to metabolic burden by expression of cgt gene in the wild type, which brings relief of energy and of free resources in cgt. These resources are probably redirected to the acarbose biosynthesis, which is a growth-associated product. A direct regulatory effect by deletion of cgt on the expression of the acb genes was not observed.

    (293) Analysis of the Functional Relevance of Carotenoid Formation

    (294) Light-Dependent Carotenoid-Formation and Oxidative Stress Reduce Acarbose Production in Actinoplanes Sp. SE50/110

    (295) Actinoplanes are known to produce a variety of soluble pigments including yellow, orange and pink pigments of the class carotenoids (Parenti and Coronelli 1979). The pigment of Actinoplanes sp. SE50/110 is orange. Its formation is intensified when cultivated exposed to light. Since the pigment was found likewise in the supernatant, it seems to be soluble in watery solutions. After cell extraction and separation by thin layer chromatography, spectral analysis display absorption maxima at 450, 475 und 505-510, which was confirmed by an absorbance scan performed during HPLC-separation. Consistent with these findings in silico reconstruction shows, that Actinoplanes sp. SE50/110 has the full genetic equipment to produce a C40-carotenoid with similarity to sioxanthin from Salinospora tropica CNB-440 (Richter et al. 2015; Wolf et al. 2017b) (FIG. 21 and Table E12).

    (296) TABLE-US-00014 TABLE E12 Reconstruction of the carotinoid synthesis in Actinoplanes sp. SE50/110. Two terpene synthesis gene cluster were identified by antiSMASH analysis (Blin et al. 2017; Weber et al. 2015), which could be assigned to the formation of a C40-carotenoid with similarity to the sioxanthin gene cluster from Salinospora tropica CNB-440 (Richter et al. 2015) (terpene cluster 1-2). Furthermore, a camphene-like monoterpene gene cluster (terpene cluster 3), all genes of the MEP/DOXP-pathway and a gene coding for the degradation of lycopene were identified by BLASTP analysis (Altschul et al. 2005) and KEGG (Kanehisa et al. 2014). locus tag name annotation genes of MEP/DOXP pathway ACSP50_7096 dxs 1-deoxy-D-xylulose-5-phosphate synthase ACSP50_7248 ispG 4-hydroxy-3-methylbut-2-en-1-yl diphosphate synthase ACSP50_7250 dxr 1-deoxy-D-xylulose-5-phosphate reductoisomerase ACSP50_7707 ispH 4-hydroxy-3-methylbut-2-enyl diphosphate reductase ACSP50_7802 ispE 4-(cytidine 5-diphospho)-2-C- methyl-D-erythritol kinase ACSP50_8046 ispF 2-C-methyl-D-erythritol 2;4- cyclodiphosphate synthase ACSP50_8047 ispD 2-C-methyl-D-erythritol 4- phosphate cytidylyltransferase genes proposed for the biosynthesis of a glycosylated C40-carotenoid with similarity to sioxanthin terpene cluster 1 homol. genes identity / in S. tropica positives CNB-440 ACSP50_0145 merR MerR-/HTH-transcriptional merR 55% / regulator (STROP_4437) 65% ACSP50_0146 idi isopentenyl-diphosphate delta- idi 62% / isomerase (STROP_4438) 66% ACSP50_0147 crtl zeta-phytoene desaturase crtl 76% / (STROP_4439) 84% ACSP50_0148 crtE/IdsA polyprenyl synthetase crtE 60% / (STROP_4440) 67% ACSP50_0149 crtB phytoene synthase crtB 73% / (STROP_4441) 80% ACSP50_0150 deoxyribodipyrimidine photo-lyase not found ACSP50_0151 pyridine nucleotide-disulfide STROP_4442 58% / oxidoreductase 70% terpene cluster 2a ACSP50_1631 transcriptional regulator STROP_3252 70% / 83% ACSP50_1632 lycopene cyclase not found ACSP50_1633 lycopene cyclase not found ACSP50_1634 fps2/crtE polyprenyl synthetase (farnesyl crtE 69% / pyrophosphate synthetase 2) (STROP_3251) 78% ACSP50_1635 methylenetetrahydrofolate STROP_3250 77% / reductase (NADPH) 87% terpene cluster 2b ACSP50_1650 LysR-family transcriptional STROP_1711 58% / regulator 70% ACSP50_1651 methyltransferase type 11 not found ACSP50_1652 CDP- STROP_3594 27% / alcoholphosphatidyltransferase 39% pgsA ACSP50_1653 crtD zeta-phytoene desaturase (crtl- crtD 66% / family) (STROP_3248) 73% ACSP50_1654 cruC glycosyl transferase cruC 63% / (STROP_3247) 71% ACSP50_1655 cruF hypothetical protein (put. cruF 64% / membrane prot.) (STROP_3246) 73% ACSP50_1656 GCN5 family acetyltransferase STROP_3245 75% / 82% ACSP50_1657 monooxygenase crtA 65% / (STROP_3244) 61% ACSP50_1658 short-chain dehydrogenase STROP_0722 53% / 64% ACSP50_3873 crtE- polyprenyl synthetase crtE 54% / hom. (STROP_3251) 65% put. genes for camphene-like monoterpene biosynthesis terpene cluster 3 ACSP50_1949 eshA transcriptional regulator (Crp/Fnr family) ACSP50_1950 camphene synthase ACSP50_1951 methyltransferase (SAM- dependent) type 11 ACSP50_1952 glycosyl-hydrolase ACSP50_1953 oxidoreductase/aldo/ketoreductase degradation of lycopene ACSP50_5522 ccd carotenoid oxygenase/carotenoid cleavage dioxygenase (RPE65 superfamily)

    (297) The genes of the C40-carotenoid biosynthesis are organized in three gene cluster: terpene cluster 1,2a and 2b (cf. FIG. 21D).

    (298) In contrast to S. tropica, homologues of crtY and crtU, encoding a cyclase and a desaturase, could not be identified in Actinoplanes sp. SE50/110 (Wolf et al. 2017b). Instead, two cyclases of the CarR-domain superfamily were found in this work. They are localized in the terpene cluster 2b (FIG. 21). CarR-domain cyclases are common in fungal, archaeal and bacterial genomes (information taken from CDD-search of the NCBI (Marchler-Bauer et al. 2017)). Since the pigment of SE50/110 is orange-colored, a terminal cycling of the red-colored precursor lycopene is highly likely and might be catalyzed by one or both CarR-domain cyclases. Similar to S. tropica, the carotenoid gene cluster of SE50/110 contains a glycosyltransferase CruC (FIG. 21, Table E12). This strongly indicates for a glycosylated carotenoid, which is in accordance with the observation, that the pigment seems to have polar characteristic, since it was found in the supernatant (FIG. 22B).

    (299) Comparative genome analysis by the software platform EDGAR 2.0 (Blom et al. 2016), display similar terpene cluster arrangements in related species of the genus Actinoplanes, whereas a different organization was found in Streptomyces (data not shown). By this, the gene arrangements found in SE50/110 and CNB-440 (Richter et al. 2015; Wolf et al. 2017b) seem to be characteristic for the family Micromonosporaceae.

    (300) Besides, genes for the synthesis of the building blocks IPP and DMAPP via the MEP/DOXP-pathway (Table E12), a gene coding for a camphene-like monoterpene synthase (terpene cluster 3, Table E12) as well as a carotenoid cleavage dioxygenase (ACSP50_5522, Table E12) were found in the genome of SE50/110. The latter two might be involved in the formation of odorous substances (Yamada et al. 2015). The inventors observed, that strong pigmentation was associated with production losses. This was confirmed by comparing growth and acarbose yields of cultures exposed to and covered from light (FIG. 22). While carotenoid formation was induced, acarbose production and growth of Actinoplanes sp. SE50/110 was strongly reduced, when exposed to bulb light (36 W, Osram 830U) with an intensity of 22-44 E (1 E=mol.sub.photons m.sup.2 s.sup.1). In total, a loss of 39% of the final acarbose concentration was monitored.

    (301) Deletion of merR in SE50/110 Induces Carotenoid Formation without Exposure to Light

    (302) Since natural or bulb light was able to induce carotenoid formation (FIG. 22B, C), this study searched for possible regulatory genes in SE50/110. A MerR-regulator was found within terpene cluster 1 (ACSP50_0145, FIG. 23).

    (303) The MerR-family mainly consists of activators, which are able to respond to environmental stimuli, like oxidative stress, heavy metals or antibiotics (Brown et al. 2003). Indeed, several members of the MerR-family have been described as both light-dependent activators or repressors of the carotenoid biosynthesis in non-photosynthetic bacteria, f. e. LitR in the related actinomycete S. coelicolor (Takano et al. 2005; Takano et al. 2006), in the Gram-negative Thermus thermophiles HB27. (Takano et al. 2011) and in the Gram-positive Bacillus megaterium QM B1551 (Takano et al. 2015). Here, cobalamin (vitamin B12) acts as cofactor, which mediates light sensitivity, since it is able to absorb ultraviolet and blue light: By either binding covalently to the regulator or falling off after light excitation, it is able to modulate the conformation and activity of the regulator (van der Horst et al. 2007). The mechanisms of regulation and the binding sites are quite different: Whereas in T. thermophiles and B. megaterium the promoter regions of litR/crtB (Takano et al. 2011) or litR and crtl (Takano et al. 2015) are repressed in the dark and relieved after illumination, LitR in S. coelicolor seems to be an essential light-induced transcriptional activator of the adjacent localized litS, which encodes an ECF sigma factor and directs the transcription of the carotenoid biosynthesis genes (Takano et al. 2005). A gene encoding an ECF sigma factor does not occur within the gene cluster of SE50/110. In the Gram-negative bacterium Myxococcus xanthus a B12-dependent MerR regulator is part of a complex regulatory cascade including eight further regulatory genes (Fontes et al. 2003; Galbis-Martnez et al. 2012). Indeed, no homologues of the regulatory network from M. xanthus were identified in the genome of SE50/110 by BLASTP-analysis (data not shown).

    (304) The MerR-family regulator ACSP50_0145 of Actinoplanes sp. SE50/110 contains an N-terminal HTH-motif and a C-terminal B12-binding domain (according to BLASTP-analysis and CDD-search (Marchler-Bauer et al. 2015; Marchler-Bauer et al. 2010; Altschul et al. 2005)). The position of the HTH-domain accounts for a transcriptional repressor (Prez-Rueda and Collado-Vides 2000).

    (305) By CRISPR/Cas9 deletion of the corresponding gene in SE50/110, the carotenoid formation was strongly induced without exposure to light (FIG. 24B, C). This confirms a function as transcriptional repressor.

    (306) Indeed, it has to be noted, that the repressor/operator system is leaky, since the typical orange color is also produced in the wild type without exposure to light. According to this, the transcription of the genes crtEBI and idi (ACSP50_0146-0149) was only doubled in merR compared to the wild type under dark conditions (FIG. 24E). These differences were significant for crtE, crtB and idi. No effects on the transcription of acb genes were observed.

    (307) However, in the context of this work, the question was examined, whether pigment formation in merR influences the formation of the fine-chemical acarbose. Again, higher carotenoid formation was associated with lower acarbose formation (FIG. 24A, D). When illuminated, both wild type and merR are strongly pigmented and the final acarbose concentrations were similar for both strains, reaching approx. 0.52 g.Math.L.sup.1 (FIG. 24B, D). This corresponds to a reduction of acarbose formation of approx. 38% compared to the wild type under dark conditions (reaching 0.83 g.Math.L.sup.1). This is in accordance to the previous growth experiments of the wild type as described herein.

    (308) Under dark conditions, merR produces approximately 15% less acarbose than the wild type (0.70 g.Math.L.sup.1) (FIG. 24D). It is suggested, that these production losses are assigned to the waste of resources by carotenoid formation in the deletion mutant (FIG. 24C). In conclusion, the production losses under light conditions (38-39%) might result from further light-induced stress in both the deletion mutant and the wild type.

    (309) Comparative transcriptome analyses of the wild type cultivated under dark and light conditions using the microarray technique, display a complex response on transcript level affecting various genes (cf. FIG. 25). Several of the differentially expressed genes indicate a cellular response to combat oxidative stress. Oxidative stress is caused by reactive oxygen species (ROS), which are formed by energy transfer (leading to singlet oxygen) or electron transfer (leading to superoxide, hydrogen peroxides and hydroxyl radicals) (Ziegelhoffer and Donohue 2009). At high concentrations, ROS are toxic and cause protein and membrane oxidation and DNA damage (Ziegelhoffer and Donohue 2009; Gout 2019).

    (310) In SE50/110, the tyrosinase MelC (ACSP50_4950, previously: ACPL_5017), a photo-protector, which is involved in the formation of the brown pigment eumelanin (Wolf et al. 2016), and genes of the riboflavin biosynthesis (ACSP50_6437-40) are stronger transcribed when exposed to light (FIG. 25). Riboflavin is a water-soluble photo-oxidative sensitizer absorbing at 374 and 445 nm (Silva et al. 1999; Kim et al. 1993). It is the precursor of flavin mononucleotide (FMN) and flavin adenine dinucleotide (FAD). These are cofactors of proteins, that are involved in cellular redox metabolism, light-sensing, DNA-repair and further functions (reviewed in Garca-Angulo (2017)). By this, riboflavin and its derivates are important micro-nutrients, that enable the cells to overcome oxidative stress (Chen et al. 2013).

    (311) According to this, also several flavin-dependent oxygenases are stronger transcribed, when exposed to light. One of them is annotated as taurine dioxygenase, which substrate is a degradation product of cysteine. Sulfur-containing amino acids like cysteine belong to the group of low molecular weight thiols (LMW thiols), that are able to catch ROS and function as redox buffers (Gout 2019). Corresponding to this, further genes probably involved in cysteine and methionine metabolism and transport are stronger transcribed in cells exposed to light. Remarkably, several transcriptional regulator genes and a gene encoding the sigma factor SigE (ACSP50_0558) are stronger transcribed, too (FIG. 25). SigE was associated with oxidative stress-response in the photosynthetic bacterium Rhodococcus sphaeroides (reviewed in Ziegelhoffer and Donohue (2009)) and with envelope stress response in the related species S. coelicolor (Hutchings et al. 2006) and C. glutamicum (Park et al. 2008). It might be possible, that SigE is involved in oxidative stress response in SE50/110.

    (312) Interestingly, genes of the carotenoid biosynthesis and of the regulator MerR are not significantly stronger transcribed in the wild type exposed to light compared to the wild type hidden from light. This is noteworthy, since a clear effect of light on carotenoid formation can be observed in the wild type. Since the carotenoid synthesis takes place both in the dark and in the light and the enhancement of relative transcript amounts is quite moderate in the regulator mutant (see above), the effects on transcript level might be inconspicuous. It is assumed, that further regulation of carotenoid synthesis on protein level or metabolome level might exists, f. e. by degradation of carotenoids or terpenoid-precursors by the carotenoid cleavage dioxygenase (ACSP50_5522). However, according to the results obtained from the microarray of the wild type, the crt gene expression does not seem to be a primary target of the global oxidative stress response, similar to findings from Rhodococcus sphaeroides (reviewed in Ziegelhoffer and Donohue (2009)).

    (313) Taken all together, illumination triggers oxidative stress response and seems to have an important impact on the distribution of metabolic resources towards growth, carotenoid and acarbose formation. The regulation of carotenoid biosynthesis seems to be decoupled from the global response to oxidative stress, which needs further investigation. With view to direct the metabolic fluxes towards the production of acarbose, it is desirable to gain a better understanding of these processes in future. The sigma factor SigE might be responsible for the oxidative stress response, since it is higher transcribed when exposed to light.

    (314) Apart from light stress, this work demonstrates, that a large portion of production losses can be directly assigned to the carotenoid formation. Carotenoids of non-photosynthetic bacteria are assumed to have a function as photo-protectors (Lee and Schmidt-Dannert 2002), since they have shown to protect from photodynamic killing (Mathews and Sistrom 1959). As the influence of light can be excluded by simple structural measures, carotenoid formation is assumed to be dispensable under laboratory conditions. In order to improve acarbose production, switching off the concurring carotenoid biosynthesis pathway, f. e. by deletion of the central gene crtl, can be used for strain development. Since carotenoids influence the fluidity of membranes (Gruszecki and Strzaka 2005), lack of the C40-carotenoid can also affect the surface and mycelial structure of Actinoplanes sp. SE50/110. With regard to production, a break-up of mycelial lumps is advantageous to increase the mycelial surface and the number of biochemically available cells.

    (315) Overexpression of acbB and gtaB

    (316) Expression vector pSETT4 was tested for the genes acbB and gtaB. Both genes, acbB and gtaB, are probably involved in the amino sugar synthesis, a feeding branch of acarbose biosynthesis: AcbB catalyzes the dehydration of dTDP-D-glucose to dTDP-4-keto-6-deoxy-D-glucose and GtaB is assumed to be involved in the supply of the precursor glucose-1P. Interestingly, both proteins display increased protein amounts in the cytosol of acarbose producer.

    (317) pSETT4gap and pSETT4tip Vectors for Overexpression of Single Genes

    (318) A novel cloning system was implemented, that allows easy cloning and overexpression of singular genes in Actinoplanes strains such as Actinoplanes sp. SE50/110. For this, the strong promoter of the gene gapDH from Eggerthella lenta was cloned in front of a IacZ-cassette in a pSET152-backbone. The gene IacZ is transcribed under control of the lac-promoter and flanked by the recognition side of the restriction enzyme Bsal, which enables exchange of IacZ by the gene of interest by Gibson Assembly (Gibson et al. 2009), restriction/ligation cloning or Golden Gate cloning (Engler et al. 2008). As strong expression requires strong termination, T4-terminators were introduced before and after the cloning side of the novel expression system. T4-terminators have already been successfully used in the pGUS-cloning system developed by Myronovskyi et al. (2011). Whole track RNAseq analysis of a pGUS-integration mutant performed herein showed, that the T4-terminators block transcription efficiently and prevent read-through from the integrase gene into the gene of interest. Like shown by a pre-experiment, T4-terminators do not have any side effects on the transcription of acb genes, when introduced into Actinoplanes sp. SE50/110 via pSET152-integration.

    (319) Besides, by sequencing of an enriched primary transcript library derived from the promoter-screening experiment, two putative promoters were identified behind the gene of interest in antisense orientation (FIG. 26). These two pseudo-promoters were removed in the novel expression system in order to prevent antisense transcription. Furthermore, an additional (third) T4-terminator was introduced behind the cloning side in opposite orientation to prevent further putative antisense reads.

    (320) To allow exchange of the promoter sequence, Ndel and Kpnl restriction sites were introduced. In this work, the strong gapDH-promoter was exchanged by the medium-strong tipA-promoter from S. lividans. By this, it was shown, that the system can be easily modified, f. e. to adjust it for other species of the order Actinomycetales. The vectors (named pSETT4gap and pSETT4tip) were tested for strong and medium strong overexpression of the genes acbB and gtaB.

    (321) Medium Overexpression of acbB Leads to Improved Acarbose Formation

    (322) The dTDP-D-glucose-4,6-dehydratase AcbB seems to be involved in the generation of an activated amino sugar from D-glucose-1Pa feeding pathway of the acarbose biosynthesis (FIG. 1): Increased AcbB-activity was found to improve the supply of the modified precursor: In brief, two overexpression mutants were created based on expression vector pSETT4 described elsewhere herein. In these mutants acbB is transcribed under control of the medium strong tipA-promoter or the strong gapDH-promoter. As previously published in (Schaffert, et al. 2019), expression vectors using the native promoter did not lead to a significant overexpression of the genes of the Acb gene cluster. The native promoter was therefore used in both the pSET152- and the pSETT4-vector background as control. Growth and acarbose formation were monitored in two shake flask cultivations in maltose minimal medium (FIG. 27).

    (323) The mutant with acbB transcribed under control of the heterologous tipA-promoter displayed enhanced acarbose production compared to the control strains: The yield coefficient was increased to 48.6 and 51.9% compared to the empty vector control in two independent cultivations (FIG. 28). By usage of the strong gapDH-promoter, the acarbose yield coefficient was slightly increased (FIG. 28).

    (324) In pSETT4tip::acbB, the normalized peak areas of phosphorylated glucose/galactose and UDP-glucose were similar or even slightly increased compared to the empty vector control (FIG. 29). Therefore, the supply of activated glucose moieties seems to be guaranteed. In this mutant, increased amounts of the mass m/z=545 [MH.sup.+] were found (FIG. 29, approx. 41%). Without being bound by theory this intermediate accumulates by medium AcbB-overexpression, e.g. using pSETT4tip: acbB.

    (325) At beginning growth phase enhanced expression of acbB was observed in the expected range: Strongest overexpression was achieved by use of the gapDH-promoter (log 2 (fold-change)=6.54) followed by use of the tipA-promoter (log 2 (fold-change)=4.06) (FIG. 30). Usage of the native promoter does not lead to a significant increase of relative transcript amounts of acbB. This was tested in both the pSET152- and pSETT4-vector background (FIG. 30). Further genes of the acb gene cluster were not significantly affected, like shown for acbA and acbV (FIG. 30). Only exception is a slightly higher transcription abundance of acbA in pSETT4tip: acbB (log 2 (fold-change)=1.87).

    (326) Remarkably, the transcription profile in the linear growth phase differs from the early growth phase: Here, only a doubling of transcript amounts was reached by use of the gapDH-promoter (log 2 (fold-change)=2.05), whereas by use of the tipA-promoter the overexpression of acbB was maintained, but to a lesser extent (log 2 (fold-change)=3.33) (FIG. 30, FIG. 31). In overexpression mutants including heterologous promoters the relative transcription of acbB decreases from 4.06- to 3.33-fold (log 2 (fold-change)) between the two sampling times in pSETT4tip: acbB and from 6.54- to 2.05-fold in in pSETT4gap: acbB. It is assumed, that whereas the transcription of the chromosomal acbB-copy is down-regulated in these mutants, the transcription of the vector copy is maintained by the heterologous promoters. The differences in acbB-transcription at different sampling times furthermore suggest, that the down-regulation of acb gene transcription occurs stronger respectively earlier in pSETT4gap: acbB compared to pSETT4tip: acbB. Overexpression of acbB (pSETT4gap: acbB and pSETT4tip: acbB) seems to decelerate during linear growth phase.

    (327) In summary, in particular medium overexpression of acbB by usage of the tipA-promoter seems to be beneficial for acarbose production, whereas strong overexpression by use of the gapDH-promoter seems to have only a smaller effect on acarbose formation. Further improvement in acarbose formation may be achieved by varying of the expression level of acbB, e.g. by using alternative promoters from the promoter screening or by introducing multiple gene copies. In summary, this work demonstrates, that medium overexpression of AcbB increases the acarbose yields, possibly due to improved amino sugar supply. By medium overexpression of acbB (e.g. by use of the tipA-promoter), a positive effect on acarbose production was observed yielding into round about 50% more acarbose in two independent cultivations. Therefore, the improvement of the acarbose biosynthesis by overexpression of singular acb genes was achieved.

    (328) Medium Overexpression of gtaB Leads to Improved Acarbose Formation

    (329) GtaB is supposed to catalyze the conversion of UDP-glucose and glucose-1P into each other. It was surprisingly found that overexpression of GtaB triggers acarbose formation. Without being bound by theory this may occur by improved deployment of the precursor glucose-1P. As shown by a shake flask cultivation in maltose minimal medium (FIG. 32), the final yield coefficient for acarbose of overexpression mutants of gtaB introduced into pSETT4tip is increased to 8.56%. Interestingly, the acarbose formation is particularly increased in the late linear to stationary growth phase. In the overexpression mutant, the relative transcript amount of the gene gtaB is 2.64-fold increased (log 2 (fold-change)) (FIG. 33).

    (330) Since the metabolism of activated sugars is connected or redirected to other metabolic pathways, they are not supposed to accumulate. Butlike shown in previous experimentsthe supply can be seriously disturbed. Analysis of the intracellular metabolome displays similar amounts of phosphorylated hexoses and/or UDP-glucose (FIG. 34). Therefore, the pool of activated C6-sugars is not significantly affected by overexpression of gtaB.

    (331) Interestingly, a significant decreased amount of the mass m/z=545 [MH.sup.+] was found in pSETT4tip::gtaB (approx. decrease of 48%), which might correspond to dTDP-4-keto-6-deoxy-D-glucose, the proposed product of AcbB. This may indicate, that the flow through the synthesis strand is more balanced, since the accumulation of this metabolite is reduced in comparison to the empty vector control and AcbB-overexpression mutants (FIG. 34). Taken together, the introduction of a second gene copy of gtaB has a positive effect on the acarbose production, although the impact of gtaB-overexpression on the distribution of cellular goods remains unclear. Transfer of this construct to producer strains of Actinoplanes can result in an increase of the beneficial effect, as here the demand for the precursor is higher compared to the wild type. Since strong overexpression of AcbB leads to an imbalance in glucose-phosphate-metabolism combined overexpression of acbB and gtaB would plausibly further improve acarbose production beyond the observed effect for the single overexpressions.