Biosynthesis of human milk oligosaccharides in engineered bacteria
11028419 · 2021-06-08
Assignee
Inventors
- Massimo Merighi (Somerville, MA, US)
- John M. McCoy (Reading, MA)
- Matthew Ian Heidtman (Brighton, MA, US)
Cpc classification
C12N9/00
CHEMISTRY; METALLURGY
C12P19/18
CHEMISTRY; METALLURGY
C07H13/04
CHEMISTRY; METALLURGY
Y02P20/52
GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
C12P19/00
CHEMISTRY; METALLURGY
Y02A50/30
GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
C12Y204/01065
CHEMISTRY; METALLURGY
C12N9/2471
CHEMISTRY; METALLURGY
C12Y306/03018
CHEMISTRY; METALLURGY
C12N15/70
CHEMISTRY; METALLURGY
C12P19/26
CHEMISTRY; METALLURGY
C12Y302/01023
CHEMISTRY; METALLURGY
C07H3/06
CHEMISTRY; METALLURGY
International classification
C12P19/18
CHEMISTRY; METALLURGY
C12P19/26
CHEMISTRY; METALLURGY
C12N15/70
CHEMISTRY; METALLURGY
C07H13/04
CHEMISTRY; METALLURGY
C12N9/00
CHEMISTRY; METALLURGY
C07H3/06
CHEMISTRY; METALLURGY
Abstract
The invention provides compositions and methods for engineering bacteria to produce fucosylated oligosaccharides, and the use thereof in the prevention or treatment of infection.
Claims
1. A method for producing a fucosylated oligosaccharide in an E.coli bacterium, comprising (a) providing an E.coli bacterium comprising a β-galactosidase activity between 0.05 and 5 units and a mutation in a lacA gene, wherein said E.coli bacterium further comprises an exogenous lactose-accepting fucosyltransferase gene; (b) culturing said E.coli bacterium in the presence of lactose; and (c) retrieving a fucosylated oligosaccharide from said E.coli bacterium or from a culture supernatant of said E.coli bacterium, wherein a functional β-galactosidase gene is inserted into an endogenous gene of said E. coli bacterium.
2. The method of claim 1, wherein said mutation prevents the formation of intracellular acetyl-lactose.
3. The method of claim 1, wherein said E.coli bacterium further comprises a defective colanic acid synthesis pathway.
4. The method of claim 1, wherein said defective colanic acid synthesis pathway comprises a mutation in a colanic acid synthesis gene.
5. The method of claim 4, wherein said colanic acid synthesis gene comprises a wcaJ, wzxC, wcaD, wza, wzb, or wzc gene.
6. The method of claim 1, further comprising an inactivating mutation in a Ion gene.
7. The method of claim 6, wherein said bacterium comprises a functional promoter-less wild-type E. coli lacZ.sup.+ gene inserted into said Ion gene.
8. The method of claim 1, wherein said bacterium further comprises an exogenous E. coli rcsA or E. coli rcsB gene.
9. The method of claim 1, wherein said exogenous lactose-accepting fucosyltransferase gene comprises a Bacteroides fragilis wcfW gene or a Helicobacter pylori 26695 futA gene.
10. The method of claim 1, wherein said bacterium comprises both an exogenous fucosyltransferase gene encoding α(1,2) fucosyltransferase and an exogenous fucosyltransferase gene encoding α(1,3) fucosyltransferase.
11. The method of claim 1, wherein said bacterium comprises an endogenous lacY gene.
12. The method of claim 1, wherein said bacterium comprises a lactose permease gene.
13. The method of claim 12, wherein said lactose permease gene comprises an exogenous lactose permease gene.
14. The method of claim 1, wherein said E.coli bacterium comprises a deletion of an endogenous β-galactosidase gene.
15. The method of claim 1, wherein said bacterium accumulates an increased cytoplasmic lactose level, wherein the increased intracellular lactose level is at least 10% more than the level in a corresponding wild type bacterium.
16. The method of claim 1, wherein said fucosylated oligosaccharide comprises 2′-fucosyllactose, 3-fucosyllactose, or lactodifucotetraose.
17. The method of claim 1, wherein culturing said bacterium in step (b) does not comprise an antibiotic.
18. The method of claim 1, wherein said bacterium comprises a plasmid expression vector.
19. The method of claim 1, wherein said bacterium comprises an exogenous fucosyltransferase gene encoding α(1,2) fucosyltransferase or an exogenous fucosyltransferase gene encoding α(1,3) fucosyltransferase.
20. The method of claim 1, wherein said exogenous lactose-accepting fucosyltransferase gene comprises an α(1,3) fucosyltransferase gene from Helicobacter pylori , H. hepaticus, H. bilis, Campylobacter jejuni, or a species of Bacteroides.
21. The method of claim 1, wherein said exogenous lactose-accepting fucosyltransferase gene comprises a E. coli strain O128:B12 wbsJ gene, a Helicobacter pylori 26695 futC gene, a H. hepaticus Hh0072 gene, a H. pylori 11639 FucTa gene, or a H. pylori UA948 FucTa gene.
Description
BRIEF DESCRIPTION OF THE DRAWINGS
(1)
(2)
(3)
(4)
(5)
(6)
(7)
(8)
(9)
(10)
(11)
(12)
(13)
(14)
(15)
(16)
(17)
(18)
(19)
(20)
DETAILED DESCRIPTION OF THE INVENTION
(21) Human milk glycans, which comprise both oligosaccharides (HMOS) and their glycoconjugates, play significant roles in the protection and development of human infants, and in particular the infant gastrointestinal (GI) tract. Milk oligosaccharides found in various mammals differ greatly, and their composition in humans is unique (Hamosh M., 2001 Pediatr Clin North Am, 48:69-86; Newburg D. S., 2001 Adv Exp Med Biol, 501:3-10). Moreover, glycan levels in human milk change throughout lactation and also vary widely among individuals (Morrow A. L. et al., 2004 J Pediatr, 145:297-303; Chaturvedi P et al., 2001 Glycobiology, 11:365-372). Previously, a full exploration of the roles of HMOS was limited by the inability to adequately characterize and measure these compounds. In recent years sensitive and reproducible quantitative methods for the analysis of both neutral and acidic HMOS have been developed (Erney, R., Hilty, M., Pickering, L., Ruiz-Palacios, G., and Prieto, P. (2001) Adv Exp Med Biol 501, 285-297. Bao, Y., and Newburg, D. S. (2008) Electrophoresis 29, 2508-2515). Approximately 200 distinct oligosaccharides have been identified in human milk, and combinations of a small number of simple epitopes are responsible for this diversity (Newburg D. S., 1999 Curr Med Chem, 6:117-127; Ninonuevo M. et al., 2006 J Agric Food Chem, 54:7471-74801). HMOS are composed of 5 monosaccharides: D-glucose (Glc), D-galactose (Gal), N-acetylglucosamine (GlcNAc), L-fucose (Fuc), and sialic acid (N-acetyl neuraminic acid, Neu5Ac, NANA). HMOS are usually divided into two groups according to their chemical structures: neutral compounds containing Glc, Gal, GlcNAc, and Fuc, linked to a lactose (Galβ1-4Glc) core, and acidic compounds including the same sugars, and often the same core structures, plus NANA (Charlwood J. et al., 1999 Anal Biochem, 273:261-277; Martifn-Sosa et al., 2003 J Dairy Sci, 86:52-59; Parkkinen J. and Finne J., 1987 Methods Enzymol, 138:289-300; Shen Z. et al., 2001 J Chromatogr A, 921:315-321). Approximately 70-80% of oligosaccharides in human milk are fucosylated, and their synthetic pathways are believed to proceed in a manner similar to those pathways shown in
(22) Interestingly, HMOS as a class, survive transit through the intestine of infants very efficiently, a function of their being poorly transported across the gut wall and of their resistance to digestion by human gut enzymes (Chaturvedi, P., Warren, C. D., Buescher, C. R., Pickering, L. K. & Newburg, D. S. Adv Exp Med Biol 501, 315-323 (2001)). One consequence of this survival in the gut is that HMOS are able to function as prebiotics, i.e. they are available to serve as an abundant carbon source for the growth of resident gut commensal microorganisms (Ward, R. E., Nifionuevo, M., Mills, D. A., Lebrilla, C. B., and German, J. B. (2007) Mol Nutr Food Res 51, 1398-1405). Recently, there is burgeoning interest in the role of diet and dietary prebiotic agents in determining the composition of the gut microflora, and in understanding the linkage between the gut microflora and human health (Roberfroid, M., Gibson, G. R., Hoyles, L, McCartney, A. L., Rastall, R., Rowland, I., Wolvers, D., Watzl, B., Szajewska, H., Stahl, B., Guarner, F., Respondek, F., Whelan, K., Coxam, V., Davicco, M. J., Léotoing, L., Wittrant, Y., Delzenne, N. M., Cani, P. D., Neyrinck, A. M., and Meheust, A. (2010) Br J Nutr 104 Suppl 2, S1-63).
(23) A number of human milk glycans possess structural homology to cell receptors for enteropathogens, and serve roles in pathogen defense by acting as molecular receptor “decoys”. For example, pathogenic strains of Campylobacter bind specifically to glycans in human milk containing the H-2 epitope, i.e., 2′-fucosyl-N-acetyllactosamine or 2′-fucosyllactose (2′-FL); Campylobacter binding and infectivity are inhibited by 2′-FL and other glycans containing this H-2 epitope (Ruiz-Palacios, G. M., Cervantes, L. E., Ramos, P., Chavez-Munguia, B., and Newburg, D. S. (2003) J Biol Chem 278, 14112-14120). Similarly, some diarrheagenic E. coli pathogens are strongly inhibited in vivo by HMOS containing 2′-linked fucose moieties. Several major strains of human caliciviruses, especially the noroviruses, also bind to 2′-linked fucosylated glycans, and this binding is inhibited by human milk 2′-linked fucosylated glycans. Consumption of human milk that has high levels of these 2′-linked fucosyloligosaccharides has been associated with lower risk of norovirus, Campylobacter, ST of E. coli-associated diarrhea, and moderate-to-severe diarrhea of all causes in a Mexican cohort of breastfeeding children (Newburg D. S. et al., 2004 Glycobiology, 14:253-263; Newburg D. S. et al., 1998 Lancet, 351:1160-1164). Several pathogens are also known to utilize sialylated glycans as their host receptors, such as influenza (Couceiro, J. N., Paulson, J. C. & Baum, L G. Virus Res 29, 155-165 (1993)), parainfluenza (Amonsen, M., Smith, D. F., Cummings, R. D. & Air, G. M. J Virol 81, 8341-8345 (2007), and rotoviruses (Kuhlenschmidt, T. B., Hanafin, W. P., Gelberg, H. B. & Kuhlenschmidt, M. S. Adv Exp Med Biol 473, 309-317 (1999)). The sialyl-Lewis X epitope is used by Helicobacter pylori (Mahdavi, J., Sondén, B., Hurtig, M., Olfat, F. O., et al. Science 297, 573-578 (2002)), Pseudomonas aeruginosa (Scharfman, A., Delmotte, P., Beau, J., Lamblin, G., et al. Glycoconj J 17, 735-740 (2000)), and some strains of noroviruses (Rydell, G. E., Nilsson, J., Rodriguez-Diaz, J., Ruvoën-Clouet, N., et al. Glycobiology 19, 309-320 (2009)).
(24) While studies suggest that human milk glycans could be used as prebiotics and as antimicrobial anti-adhesion agents, the difficulty and expense of producing adequate quantities of these agents of a quality suitable for human consumption has limited their full-scale testing and perceived utility. What has been needed is a suitable method for producing the appropriate glycans in sufficient quantities at reasonable cost. Prior to the invention described herein, there were attempts to use several distinct synthetic approaches for glycan synthesis. Novel chemical approaches can synthesize oligosaccharides (Flowers, H. M. Methods Enzymol 50, 93-121 (1978); Seeberger, P. H. Chem Commun (Camb) 1115-1121 (2003)), but reactants for these methods are expensive and potentially toxic (Koeller, K. M. & Wong, C. H. Chem Rev 100, 4465-4494 (2000)). Enzymes expressed from engineered organisms (Albermann, C., Piepersberg, W. & Wehmeier, U. F. Carbohydr Res 334, 97-103 (2001); Bettler, E., Samain, E., Chazalet, V., Bosso, C., et al. Glycoconj J 16, 205-212 (1999); Johnson, K. F. Glycoconj J 16, 141-146 (1999); Palcic, M. M. Curr Opin Biotechnol 10, 616-624 (1999); Wymer, N. & Toone, E. J. Curr Opin Chem Biol 4, 110-119 (2000)) provide a precise and efficient synthesis (Palcic, M. M. Curr Opin Biotechnol 10, 616-624 (1999)); Crout, D. H. & Vic, G. Curr Opin Chem Biol 2, 98-111 (1998)), but the high cost of the reactants, especially the sugar nucleotides, limits their utility for low-cost, large-scale production. Microbes have been genetically engineered to express the glycosyltransferases needed to synthesize oligosaccharides from the bacteria's innate pool of nucleotide sugars (Endo, T., Koizumi, S., Tabata, K., Kakita, S. & Ozaki, A. Carbohydr Res 330, 439-443 (2001); Endo, T., Koizumi, S., Tabata, K. & Ozaki, A. Appl Microbiol Biotechnol 53, 257-261 (2000); Endo, T. & Koizumi, S. Curr Opin Struct Biol 10, 536-541 (2000); Endo, T., Koizumi, S., Tabata, K., Kakita, S. & Ozaki, A. Carbohydr Res 316, 179-183 (1999); Koizumi, S., Endo, T., Tabata, K. & Ozaki, A. Nat Biotechnol 16, 847-850 (1998)). However, low overall product yields and high process complexity have limited the commercial utility of these approaches.
(25) Prior to the invention described herein, which enables the inexpensive production of large quantities of neutral and acidic HMOS, it had not been possible to fully investigate the ability of this class of molecule to inhibit pathogen binding, or indeed to explore their full range of potential additional functions.
(26) Prior to the invention described herein, chemical syntheses of HMOS were possible, but were limited by stereo-specificity issues, precursor availability, product impurities, and high overall cost (Flowers, H. M. Methods Enzymol 50, 93-121 (1978); Seeberger, P. H. Chem Commun (Camb) 1115-1121 (2003); Koeller, K. M. & Wong, C. H. Chem Rev 100, 4465-4494 (2000)). Also, prior to the invention described herein, in vitro enzymatic syntheses were also possible, but were limited by a requirement for expensive nucleotide-sugar precursors. The invention overcomes the shortcomings of these previous attempts by providing new strategies to inexpensively manufacture large quantities of human milk oligosaccharides for use as dietary supplements. The invention described herein makes use of an engineered bacterium E. coli (or other bacteria) engineered to produce 2′-FL, 3FL, LDFT, or sialylated fucosyl-oligosaccharides in commercially viable levels, for example the methods described herein enable the production of 2′-fucosylactose at >50 g/L in bioreactors.
Example 1. Engineering of E. coli to Generate Host Strains for the Production of Fucosylated Human Milk Oligosaccharides
(27) The E. coli K12 prototroph W3110 was chosen as the parent background for fucosylated HMOS biosynthesis. This strain had previously been modified at the ampC locus by the introduction of a tryptophan-inducible P.sub.trpB-cI+ repressor construct (McCoy, J. & Lavallie, E. Current protocols in molecular biology/edited by Frederick M. Ausubel . . . [et al.](2001)), enabling economical production of recombinant proteins from the phage λP.sub.L promoter (Sanger, F., Coulson, A. R., Hong, G. F., Hill, D. F. & Petersen, G. B. J Mol Biol 162, 729-773 (1982)) through induction with millimolar concentrations of tryptophan (Mieschendahl, M., Petri, T. & Hänggi, U. Nature Biotechnology 4, 802-808 (1986)). The strain GI724, an E. coli W3110 derivative containing the tryptophan-inducible P.sub.trpB-cI+ repressor construct in ampC, was used at the basis for further E. coli strain manipulations (
(28) Biosynthesis of fucosylated HMOS requires the generation of an enhanced cellular pool of both lactose and GDP-fucose (
(29) Subsequently, the ability of the host E. coli strain to synthesize colanic acid, an extracellular capsular polysaccharide, was eliminated in strain E205 (
(30) A thyA (thymidylate synthase) mutation was introduced into strain E205 to produce strain E214 (
(31) One strategy for GDP-fucose production is to enhance the bacterial cell's natural synthesis capacity. For example, this is enhancement is accomplished by inactivating enzymes involved in GDP-fucose consumption, and/or by overexpressing a positive regulator protein, RcsA, in the colanic acid (a fucose-containing exopolysaccharide) synthesis pathway. Collectively, this metabolic engineering strategy re-directs the flux of GDP-fucose destined for colanic acid synthesis to oligosaccharide synthesis (
Glucose.fwdarw.Glc-6-P.fwdarw.Fru-6-P.fwdarw..sup.1Man-6-P.fwdarw..sup.2Man-1-P.fwdarw.GDP-Man-.sup.4.5GDP-Fuc .sup.6Colanic acid.
(32) Specifically, the magnitude of the cytoplasmic GDP-fucose pool in strain E214 is enhanced by over-expressing the E. coli positive transcriptional regulator of colanic acid biosynthesis, RscA (Gottesman, S. & Stout, V. Mol Microbiol 5, 1599-1606 (1991)). This over-expression of RcsA is achieved by incorporating a wild-type rcsA gene, including its promoter region, onto a multicopy plasmid vector and transforming the vector into the E. coli host, e.g. into E214. This vector typically also carries additional genes, in particular one or two fucosyltransferase genes under the control of the pL promoter, and thyA and beta-lactamase genes for plasmid selection and maintenance. pG175 (SEQ ID NO: 1 and
(33) A third means to increase the intracellular GDP-fucose pool may also be employed. Colanic acid biosynthesis is increased following the introduction of a null mutation into the E. coli ion gene. Lon is an ATP-dependant intracellular protease that is responsible for degrading RcsA, mentioned above as a positive transcriptional regulator of colanic acid biosynthesis in E. coli (Gottesman, S. & Stout, V. Mol Microbiol 5, 1599-1606 (1991)). In a ion null background, RcsA is stabilized, RcsA levels increase, the genes responsible for GDP-fucose synthesis in E. coli are up-regulated, and intracellular GDP-fucose concentrations are enhanced. The ion gene was almost entirely deleted and replaced by an inserted functional, wild-type, but promoter-less E. coli lacZ.sup.+ gene (Δlon::(kan, lacZ.sup.+) in strain E214 to produce strain E390. λRed recombineering was used to perform the construction.
(34) The production host strain, E390 incorporates all the above genetic modifications and has the following genotype:
ampC::(P.sub.trpBλcI.sup.+),P.sub.lacI.sub.
(35) An additional modification of E390 that is useful for increasing the cytoplasmic pool of free lactose (and hence the final yield of 2′-FL) is the incorporation of a lacA mutation. LacA is a lactose acetyltransferase that is only active when high levels of lactose accumulate in the E. coli cytoplasm. High intracellular osmolarity (e.g., caused by a high intracellular lactose pool) can inhibit bacterial growth, and E. coli has evolved a mechanism for protecting itself from high intra cellular osmolarity caused by lactose by “tagging” excess intracellular lactose with an acetyl group using LacA, and then actively expelling the acetyl-lactose from the cell (Danchin, A. Bioessays 31, 769-773 (2009)). Production of acetyl-lactose in E. coli engineered to produce 2′-FL or other human milk oligosaccharides is therefore undesirable: it reduces overall yield. Moreover, acetyl-lactose is a side product that complicates oligosaccharide purification schemes. The incorporation of a lacA mutation resolves these problems. Strain E403 (
(36) The production host strain, E403 incorporates all the above genetic modifications and has the following genotype:
ampC::(P.sub.trpBλcI.sup.+),P.sub.lacI.sub.
Example 2. 2′-FL Production at Small Scale
(37) Various alternative α(1,2) fucosyltransferases are able to utilize lactose as a sugar acceptor and are available for the purpose of 2′-FL synthesis when expressed under appropriate culture conditions in E. coli E214, E390 or E403. For example the plasmid pG175 (ColE1, thyA+, bla+, P.sub.L2-wbsJ, rcsA+) (SEQ ID NO: 1,
(38) The addition of tryptophan to the lactose-containing growth medium of cultures of any one of the strains E214, E390 or E403, when transformed with any one of the plasmids pG171, pG175 or pG180 leads, for each particular strain/plasmid combination, to activation of the host E. coli tryptophan utilization repressor TrpR, subsequent repression of P.sub.trpB and a consequent decrease in cytoplasmic cI levels, which results in a de-repression of P.sub.L, expression of futC, wbsJ or wcfW, respectively, and production of 2′-FL.
(39) The DNA sequence of the Bacteroides fragilis strain NCTC 9343 wcfW gene (protein coding sequence) is set forth below (SEQ ID NO: 4).
Example 3. 2′-FL Production in the Bioreactor
(40) 2′-FL can be produced in the bioreactor by any one of the host E. coli strains E214, E390 or E403, when transformed with any one of the plasmids pG171, pG175 or pG180. Growth of the transformed strain is performed in a minimal medium in a bioreactor, 10 L working volume, with control of dissolved oxygen, pH, lactose substrate, antifoam and nutrient levels. Minimal “FERM” medium is used in the bioreactor, which is detailed below.
(41) Ferm (10 liters): Minimal medium comprising:
(42) 40 g (NH.sub.4).sub.2HPO.sub.4 100 g KH.sub.2PO.sub.4 10 g MgSO.sub.4.7H.sub.2O 40 g NaOH Trace elements: 1.3 g NTA 0.5 g FeSO.sub.4.7H.sub.2O 0.09 g MnCl.sub.2.4H.sub.2O 0.09 g ZnSO.sub.4.7H.sub.2O 0.01 g CoCl.sub.2.6H.sub.2O 0.01 g CuCl.sub.2.2H.sub.2O 0.02 g H.sub.3BO.sub.3 0.01 g Na.sub.2MoO.sub.4.2H.sub.2O (pH 6.8) Water to 10 liters DF204 antifoam (0.1 ml/L) 150 g glycerol (initial batch growth), followed by fed batch mode with a 90% glycerol-1% MgSO.sub.4-1× trace elements feed, at various rates for various times.
(43) Production cell densities of A.sub.600>100 are routinely achieved in these bioreactor runs. Briefly, a small bacterial culture is grown overnight in “FERM”—in the absence of either antibiotic or exogenous thymidine. The overnight culture (@˜2 A.sub.600) is used to inoculate a bioreactor (10 L working volume, containing “FERM”) to an initial cell density of ˜0.2 A.sub.600. Biomass is built up in batch mode at 30° C. until the glycerol is exhausted (A.sub.600˜20), and then a fed batch phase is initiated utilizing glycerol as the limiting carbon source. At A.sub.600˜ 30, 0.2 g/L tryptophan is added to induce α(1,2) fucosyltransferase synthesis. An initial bolus of lactose is also added at this time. 5 hr later, a continuous slow feed of lactose is started in parallel to the glycerol feed. These conditions are continued for 48 hr (2′-FL production phase). At the end of this period, both the lactose and glycerol feeds are terminated, and the residual glycerol and lactose are consumed over a final fermentation period, prior to harvest. 2′-FL accumulates in the spent fermentation medium at concentrations as much as 30 times higher than in the cytoplasm. The specific yield in the spent medium varies between 10 and 50 g/L, depending on precise growth and induction conditions.
Example 4. 2′-Fucosyllactose Purification
(44) 2′-FL purification from E. coli fermentation broth is accomplished though five steps:
(45) 1. Clarification
(46) Fermentation broth is harvested and cells removed by sedimentation in a preparative centrifuge at 6000×g for 30 min. Each bioreactor run yields about 5-7 L of partially clarified supernatant. Clarified supernatants have a brown/orange coloration attributed to a fraction of caramelized sugars produced during the course of the fermentation, particularly by side-reactions promoted by the ammonium ions present in the fermentation medium.
(47) 2. Product Capture on Coarse Carbon
(48) A column packed with coarse carbon (Calgon 12×40 TR) of ˜1000 ml volume (dimension 5 cm diameter×60 cm length) is equilibrated with 1 column volume (CV) of water and loaded with clarified culture supernatant at a flow rate of 40 ml/min. This column has a total capacity of about 120 g of sugar (lactose). Following loading and sugar capture, the column is washed with 1.5 CV of water, then eluted with 2.5 CV of 50% ethanol or 25% isopropanol (lower concentrations of ethanol at this step (25-30%) may be sufficient for product elution). This solvent elution step releases about 95% of the total bound sugars on the column and a small portion of the color bodies (caramels). In this first step capture of the maximal amount of sugar is the primary objective. Resolution of contaminants is not an objective. The column can be regenerated with a 5 CV wash with water.
(49) 3. Evaporation
(50) A volume of 2.5 L of ethanol or isopropanol eluate from the capture column is rotary-evaporated at 56 C and a sugar syrup in water is generated (this typically is a yellow-brown color). Alternative methods that could be used for this step include lyophilization or spray-drying.
(51) 4. Flash Chromatography on Fine Carbon and Ion Exchange Media
(52) A column (GE Healthcare HiScale50/40, 5×40 cm, max pressure 20 bar) connected to a Biotage Isolera One FLASH Chromatography System is packed with 750 ml of a Darco Activated Carbon G60 (100-mesh): Celite 535 (coarse) 1:1 mixture (both column packings obtained from Sigma). The column is equilibrated with 5 CV of water and loaded with sugar from step 3 (10-50 g, depending on the ratio of 2′-FL to contaminating lactose), using either a celite loading cartridge or direct injection. The column is connected to an evaporative light scattering (ELSD) detector to detect peaks of eluting sugars during the chromatography. A four-step gradient of isopropanol, ethanol or methanol is run in order to separate 2′-FL from monosaccharides (if present), lactose and color bodies. e.g., for B=ethanol: Step 1, 2.5 CV 0% B; Step 2, 4 CV 10% B (elutes monosaccharides and lactose contaminants); step 3, 4 CV 25% B (Elutes 2′-FL); step 4, 5 CV 50% B (elutes some of the color bodies and partially regenerates the column). Additional column regeneration is achieved using methanol @ 50% and isopropanol @ 50%. Fractions corresponding to sugar peaks are collected automatically in 120-ml bottles, pooled and directed to step 5. In certain purification runs from longer-than-normal fermentations, passage of the 2′-FL-containing fraction through anion-exchange and cation exchange columns can remove excess protein/DNA/caramel body contaminants. Resins tested successfully for this purpose are Dowex 22 and Toyopearl Mono-Q, for the anion exchanger, and Dowex 88 for the cation exchanger. Mixed bed Dowex resins have proved unsuitable as they tend to adsorb sugars at high affinity via hydrophobic interactions.
(53) 5. Evaporation/Lyophilization
(54) 3.0 L of 25% B solvent fractions is rotary-evaporated at 56 C until dry. Clumps of solid sugar are re-dissolved in a minimum amount of water, the solution frozen, and then lyophilized. A white, crystalline, sweet powder (2′-FL) is obtained at the end of the process. 2′-FL purity obtained lies between 95 and 99%.
(55) Sugars are routinely analyzed for purity by spotting 1 μl aliquots on aluminum-backed silica G60 Thin Layer Chromatography plates (10×20 cm; Macherey-Nagel). A mixture of LDFT (Rf=0.18), 2′-FL (Rf=0.24), lactose (Rf=0.30), trehalose (Rf=0.32), acetyl-lactose (Rf=0.39) and fucose (Rf=0.48) (5 g/L concentration for each sugar) is run alongside as standards. The plates are developed in a 50% butanol:25% acetic acid:25% water solvent until the front is within 1 cm from the top. Improved sugar resolution can be obtained by performing two sequential runs, drying the plate between runs. Sugar spots are visualized by spraying with α-naphtol in a sulfuric acid-ethanol solution (2.4 g α-naphtol in 83% (v/v) ethanol, 10.5% (v/v) sulfuric acid) and heating at 120 C for a few minutes. High molecular weight contaminants (DNA, protein, caramels) remain at the origin, or form smears with Rfs lower than LDFT.
Example 5. 3FL Production
(56) Any one of E. coli host strains E214, E390 or E403, when transformed with a plasmid expressing an α(1,3)fucosyltransferase capable of using lactose as the sugar acceptor substrate, will produce the human milk oligosaccharide product, 3-fucosyllactose (3FL).
(57) 3FL biosynthesis is performed as described above for 2′-FL, either at small scale in culture tubes and culture flasks, or in a bioreactor (10 L working volume) utilizing control of dissolved oxygen, pH, lactose substrate, antifoam and carbon:nitrogen balance. Cell densities of A.sub.600˜100 are reached in the bioreacter, and specific 3FL yields of up to 3 g/L have been achieved. Approximately half of the 3FL produced is found in the culture supernatant, and half inside the cells. Purification of 3FL from E. coli culture supernatants is achieved using an almost identical procedure to that described above for 2′-FL. The only substantive difference being that 3FL elutes from carbon columns at lower alcohol concentrations than does 2′-FL.
Example 6. The Simultaneous Production of Human Milk Oligosaccharides 2′-Fucosyllactose (2′-FL), 3-Fucosyllactose (3FL), and Lactodifucohexaose (LDFT) in E. coli
(58) E. coli strains E214, E390 and E403 accumulate cytoplasmic pools of both lactose and GDP-fucose, as discussed above, and when transformed with plasmids expressing either an α(1,2) fucosyltransferase or an α(1,3) fucosyltransferase can synthesize the human milk oligosaccharides 2′-FL or 3FL respectively. The tetrasaccharide lactodifucotetrose (LDFT) is another major fucosylated oligosaccharide found in human milk, and contains both α(1,2)- and α(1,3)-linked fucose residues. pG177 (
Example 7. 3′-SL Synthesis in the E. coli Cytoplasm
(59) The first step in the production of 3′-sialyllactose (3′-SL) in E. coli is generation of a host background strain that accumulates cytoplasmic pools of both lactose and CMP-Neu5Ac (CMP-sialic acid). Accumulation of cytoplasmic lactose is achieved through growth on lactose and inactivation of the endogenous E. coli β-galactosidase gene (lacZ), being careful to minimize polarity effects on lacY, the lac permease. This accumulation of a lactose pool has already been accomplished and is described above in E. coli hosts engineered for 2′-FL, 3FL and LDFT production.
(60) Specifically, a scheme to generate a cytoplasmic CMP-Neu5Ac pool, modified from methods known in the art, (e.g., Ringenberg, M., Lichtensteiger, C. & Vimr, E. Glycobiology 11, 533-539 (2001); Fierfort, N. & Samain, E. J Biotechnol 134, 261-265 (2008)), is shown in
(61) Next, since E. coli K12 lacks a de novo sialic acid synthesis pathway, sialic acid synthetic capability is introduced through the provision of three recombinant enzymes; a UDP-GlcNAc 2-epimerase (e.g., neuC), a Neu5Ac synthase (e.g., neuB) and a CMP-Neu5Ac synthetase (e.g., neuA). Equivalent genes from C. jejuni, E. coli K1, H. influenzae or from N. meningitides can be utilized (interchangeably) for this purpose.
(62) The addition of sialic acid to the 3′ position of lactose to generate 3′-sialyllactose is then achieved utilizing a bacterial-type α(2,3)sialyltransferase, and numerous candidate genes have been described, including those from N. meningitidis and N. gonorrhoeae (Gilbert, M., Watson, D. C., Cunningham, A. M., Jennings, M. P., et al. J Biol Chem 271, 28271-28276 (1996); Gilbert, M., Cunningham, A. M., Watson, D. C., Martin, A., et al. Eur J Biochem 249, 187-194 (1997)). The Neisseria enzymes are already known to use lactose as an acceptor sugar. The recombinant N. meningitidis enzyme generates 3′-sialyllactose in engineered E. coli (Fierfort, N. & Samain, E. J Biotechnol 134, 261-265 (2008)).
Example 8. The Production of Human Milk Oligosaccharide 3′-Sialyl-3-Fucosyllactose (3′-S3FL) in E. coli
(63) Prior to the invention described herein, it was unpredictable that a combination of any particular fucosyltransferase gene and any particular sialyl-transferase gene in the same bacterial strain could produce 3′-S3FL. Described below are results demonstrating that the combination of a fucosyltransferase gene and a sialyl-transferase gene in the same Lac.sup.+ E. coli strain resulted in the production of 3′-S3FL. These unexpected results are likely due to the surprisingly relaxed substrate specificity of the particular fucosyltransferase and sialyl-transferase enzymes utilzied.
(64) Humans synthesize the sialyl-Lewis X epitope utilizing different combinations of six α(1,3)fucosyl- and six α(2,3)sialyl-transferases encoded in the human genome (de Vries, T., Knegtel, R. M., Holmes, E. H. & Macher, B. A. Glycobiology 11, 119R-128R (2001); Taniguchi, A. Curr Drug Targets 9, 310-316 (2008)). These sugar transferases differ not only in their tissue expression patterns, but also in their acceptor specificities. For example, human myeloid-type α(1,3) fucosyltransferase (FUT IV) will fucosylate Type 2 (Galβ1->4Glc/GlcNAc) chain-based acceptors, but only if they are non-sialylated. In contrast “plasma-type” α(1,3) fucosyltansferase (FUT VI) will utilize Type 2 acceptors whether or not they are sialylated, and the promiscuous “Lewis” α(1,3/4) fucosyltransferase (FUT III), found in breast and kidney, will act on sialylated and non-sialylated Type 1 (Galβ1->3GlcNAc) and Type 2 acceptors (Easton, E. W., Schiphorst, W. E., van Drunen, E., van der Schoot, C. E. & van den Eijnden, D. H. Blood 81, 2978-2986 (1993)). A similar situation exists for the family of human □ α(2,3)sialyl-transferases, with different enzymes exhibiting major differences in acceptor specificity (Legaigneur, P., Breton, C., El Battari, A., Guillemot, J. C., et al. J Biol Chem 276, 21608-21617 (2001); Jeanneau, C., Chazalet, V., Augé, C., Soumpasis, D. M., et al. J Biol Chem 279, 13461-13468 (2004)). This diversity in acceptor specificity highlights a key issue in the synthesis of 3′-sialyl-3-fucosyllactose (3′-S3FL) in E. coli, i.e., to identify a suitable combination of fucosyl- and sialyl-transferases capable of acting cooperatively to synthesize 3′-S3FL (utilizing lactose as the initial acceptor sugar). However, since human and all other eukaryotic fucosyl- and sialyl-transferases are secreted proteins located in the lumen of the golgi, they are poorly suited for the task of 3′-S3FL biosynthesis in the bacterial cytoplasm.
(65) Several bacterial pathogens are known to incorporate fucosylated and/or sialylated sugars into their cell envelopes, typically for reasons of host mimicry and immune evasion. For example; both Neisseria meningitides and Campylobacter jejuni are able to incorporate sialic acid through 2,3-linkages to galactose moieties in their capsular lipooligosaccharide (LOS) (Tsai, C. M., Kao, G. & Zhu, P. I Infection and Immunity 70, 407 (2002); Gilbert, M., Brisson, J. R., Karwaski, M. F., Michniewicz, J., et al. J Biol Chem 275, 3896-3906 (2000)), and some strains of E. coli incorporate α(1,2) fucose groups into lipopolysaccharide (LPS) (Li, M., Liu, X. W., Shao, J., Shen, J., et al. Biochemistry 47, 378-387 (2008); Li, M., Shen, J., Liu, X., Shao, J., et al. Biochemistry 47, 11590-11597 (2008)). Certain strains of Helicobacter pylori are able not only to incorporate α(2,3)-sialyl-groups, but also α(1,2)-, α(1,3)-, and α(1,4)-fucosyl-groups into LPS, and thus can display a broad range of human Lewis-type epitopes on their cell surface (Moran, A. P. Carbohydr Res 343, 1952-1965 (2008)). Most bacterial sialyl- and fucosyl-transferases operate in the cytoplasm, i.e., they are better suited to the methods described herein than are eukaryotic golgi-localized sugar transferases.
(66) Strains of E. coli engineered to express the transferases described above accumulate a cytoplasmic pool of lactose, as well as an additional pool of either the nucleotide sugar GDP-fucose, or the nucleotide sugar CMP-Neu5Ac (CMP-sialic acid). Addition of these sugars to the lactose acceptor is performed in these engineered hosts using candidate recombinant α(1,3)-fucosyl- or α(2,3)-sialyl-transferases, generating 3-fucosyllactose and 3′-sialyllactose respectively. Finally, the two synthetic capabilities are combined into a single E. coli strain to produce 3′-S3FL.
(67) An E. coli strain that accumulates cytoplasmic pools of both lactose and GDP-fucose has been developed. This strain, when transformed with a plasmid over-expressing an α(1,2)fucosyltransferase, produces 2′-fucosyllactose (2′-FL) at levels of ˜10-50 g/L of bacterial culture medium. A substitution of the α(1,2) fucosyltransferase in this host with an appropriate α(1,3) fucosyltransferase leads to the production of 3-fucosyllactose (3FL). The bacterial α(1,3) fucosyltransferase then works in conjunction with a bacterial α(2,3)sialyltransferase to make the desired product, 3′-S3FL.
(68) An α(1,3)fucosyltransferase (Hh0072) isolated from Helicobacter hepaticus exhibits activity towards both non-sialylated and sialylated Type 2 oligosaccharide acceptor substrates (Zhang, L., Lau, K., Cheng, J., Yu, H., et al. Glycobiology (2010)). This enzyme is cloned, expressed, and evaluated to measure utilization of a lactose acceptor and to evaluate production of 3FL in the context of the current GDP-fucose-producing E. coli host. Hh0072 is also tested in concert with various bacterial α(2,3)sialyltransferases for its competence in 3′-S3FL synthesis. As alternatives to Hh0072, there are two characterized homologous bacterial-type 3-fucosyltransferases identified in Helicobacter pylori, “11639 FucTa” (Ge, Z., Chan, N. W., Palcic, M. M. & Taylor, D. E. J Biol Chem 272, 21357-21363 (1997); Martin, S. L., Edbrooke, M. R., Hodgman, T. C., van den Eijnden, D. H. & Bird, M. I. J Biol Chem 272, 21349-21356 (1997)) and “UA948 FucTa” (Rasko, D. A., Wang, G., Palcic, M. M. & Taylor, D. E. J Biol Chem 275, 4988-4994 (2000)). These two paralogs exhibit differing acceptor specificities, “11639 FucTa” utilizes only Type 2 acceptors and is a strict α(1,3)-fucosyltransferase, whereas “UA948 FucTa” has relaxed acceptor specificity (utilizing both Type1 and Type 2 acceptors) and is able to generate both α(1,3)- and α(1,4)-fucosyl linkages. The precise molecular basis of this difference in specificity was determined (Ma, B., Lau, L. H., Palcic, M. M., Hazes, B. & Taylor, D. E. J Biol Chem 280, 36848-36856 (2005)), and characterization of several additional α(1,3)-fucosyltransferase paralogs from a variety of additional H. pylori strains revealed significant strain-to-strain acceptor specificity diversity.
(69) In addition to the enzymes from H. pylori and H. hepaticus, other bacterial α(1,3)-fucosyltransferases are optionally used. For example, close homologs of Hh0072 are found in H. bilis (HRAG_01092 gene, sequence accession EEO24035), and in C. jejuni (C1336_000250319 gene, sequence accession EFC31050).
(70) Described below is 3′-S3FL synthesis in E. coli. The first step towards this is to combine into a single E. coli strain the 3-fucosyllactose synthetic ability, outlined above, with the ability to make 3′-sialyllactose, also outlined above. All of the chromosomal genetic modifications discussed above are introduced into a new host strain, which will then simultaneously accumulate cytoplasmic pools of the 3 specific precursors; lactose, GDP-fucose and CMP-Neu5Ac. This “combined” strain background is then used to host simultaneous production of an α(1,3)fucosyltransferase with an α(2,3)sialyltransferase, with gene expression driven either off two compatible multicopy plasmids or with both enzyme genes positioned on the same plasmid as an artificial operon. Acceptor specificities for some of the bacterial α(1,3)fucosyltransferases and α(2,3)sialyltransferases, particularly with respect to fucosylation of 3′-sialyllactose and sialylation of 3-fucosyllactose and different combinations of α(1,3)fucosyltransferase and α(2,3)sialyltransferase enzymes are evaluated. Production levels and ratios of 3′-SL, 3FL and 3′-S3FL are monitored, e.g., by TLC, with confirmation of identity by NMR and accurate quantitation either by calibrated mass spectrometry utilizing specific ion monitoring, or by capillary electrophoresis (Bao, Y., Zhu, L. & Newburg, D. S. Simultaneous quantification of sialyloligosaccharides from human milk by capillary electrophoresis. Anal Biochem 370, 206-214 (2007)).
(71) The sequences corresponding to the SEQ ID NOs described herein are provided below.
(72) The sequence of PG175 is set forth below (SEQ ID NO: 1):
(73) TABLE-US-00001 TCGCGCGTTTCGGTGATGACGGTGAAAACCTCTGACACATGCAGCTCCCGGAGACGGTCACAGCTTGTCTGTAAG CGGATGCCGGGAGCAGACAAGCCCGTCAGGGCGCGTCAGCGGGTGTTGGCGGGTGTCGGGGCTGGCTTAACTATG CGGCATCAGAGCAGATTGTACTGAGAGTGCACCATATATGCGGTGTGAAATACCGCACAGATGCGTAAGGAGAAA ATACCGCATCAGGCGCCTCCTCAACCTGTATATTCGTAAACCACGCCCAATGGGAGCTGTCTCAGGTTTGTTCCT GATTGGTTACGGCGCGTTTCGCATCATTGTTGAGTTTTTCCGCCAGCCCGACGCGCAGTTTACCGGTGCCTGGGT GCAGTACATCAGCATGGGGCAAATTCTTTCCATCCCGATGATTGTCGCGGGTGTGATCATGATGGTCTGGGCATA TCGTCGCAGCCCACAGCAACACGTTTCCTGAGGAACCATGAAACAGTATTTAGAACTGATGCAAAAAGTGCTCGA CGAAGGCACACAGAAAAACGACCGTACCGGAACCGGAACGCTTTCCATTTTTGGTCATCAGATGCGTTTTAACCT GCAAGATGGATTCCCGCTGGTGACAACTAAACGTTGCCACCTGCGTTCCATCATCCATGAACTGCTGTGGTTTCT GCAGGGCGACACTAACATTGCTTATCTACACGAAAACAATGTCACCATCTGGGACGAATGGGCCGATGAAAACGG CGACCTCGGGCCAGTGTATGGTAAACAGTGGCGCGCCTGGCCAACGCCAGATGGTCGTCATATTGACCAGATCAC TACGGTACTGAACCAGCTGAAAAACGACCCGGATTCGCGCCGCATTATTGTTTCAGCGTGGAACGTAGGCGAACT GGATAAAATGGCGCTGGCACCGTGCCATGCATTCTTCCAGTTCTATGTGGCAGACGGCAAACTCTCTTGCCAGCT TTATCAGCGCTCCTGTGACGTCTTCCTCGGCCTGCCGTTCAACATTGCCAGCTACGCGTTATTGGTGCATATGAT GGCGCAGCAGTGCGATCTGGAAGTGGGTGATTTTGTCTGGACCGGTGGCGACACGCATCTGTACAGCAACCATAT GGATCAAACTCATCTGCAATTAAGCCGCGAACCGCGTCCGCTGCCGAAGTTGATTATCAAACGTAAACCCGAATC CATCTTCGACTACCGTTTCGAAGACTTTGAGATTGAAGGCTACGATCCGCATCCGGGCATTAAAGCGCCGGTGGC TATCTAATTACGAAACATCCTGCCAGAGCCGACGCCAGTGTGCGTCGGTTTTTTTACCCTCCGTTAAATTCTTCG AGACGCCTTCCCGAAGGCGCCATTCGCCATTCAGGCTGCGCAACTGTTGGGAAGGGCGATCGGTGCGGGCCTCTT CGCTATTACGCCAGCTGGCGAAAGGGGGATGTGCTGCAAGGCGATTAAGTTGGGTAACGCCAGGGTTTTCCCAGT CACGACGTTGTAAAACGACGGCCAGTGCCAAGCTTTCTTTAATGAAGCAGGGCATCAGGACGGTATCTTTGTGGA GAAAGCAGAGTAATCTTATTCAGCCTGACTGGTGGGAAACCACCAGTCAGAATGTGTTAGCGCATGTTGACAAAA ATACCATTAGTCACATTATCCGTCAGTCGGACGACATGGTAGATAACCTGTTTATTATGCGTTTTGATCTTACGT TTAATATTACCTTTATGCGATGAAACGGTCTTGGCTTTGATATTCATTTGGTCAGAGATTTGAATGGTTCCCTGA CCTGCCATCCACATTCGCAACATACTCGATTCGGTTCGGCTCAATGATAACGTCGGCATATTTAAAAACGAGGTT ATCGTTGTCTCTTTTTTCAGAATATCGCCAAGGATATCGTCGAGAGATTCCGGTTTAATCGATTTAGAACTGATC AATAAATTTTTTCTGACCAATAGATATTCATCAAAATGAACATTGGCAATTGCCATAAAAACGATAAATAACGTA TTGGGATGTTGATTAATGATGAGCTTGATACGCTGACTGTTAGAAGCATCGTGGATGAAACAGTCCTCATTAATA AACACCACTGAAGGGCGCTGTGAATCACAAGCTATGGCAAGGTCATCAACGGTTTCAATGTCGTTGATTTCTCTT TTTTTAACCCCTCTACTCAACAGATACCCGGTTAAACCTAGTCGGGTGTAACTACATAAATCCATAATAATCGTT GACATGGCATACCCTCACTCAATGCGTAACGATAATTCCCCTTACCTGAATATTTCATCATGACTAAACGGAACA ACATGGGTCACCTAATGCGCCACTCTCGCGATTTTTCAGGCGGACTTACTATCCCGTAAAGTGTTGTATAATTTG CCTGGAATTGTCTTAAAGTAAAGTAAATGTTGCGATATGTGAGTGAGCTTAAAACAAATATTTCGCTGCAGGAGT ATCCTGGAAGATGTTCGTAGAAGCTTACTGCTCACAAGAAAAAAGGCACGTCATCTGACGTGCCTTTTTTATTTG TACTACCCTGTACGATTACTGCAGCTCGAGTTATTATAATTTTACCCACGATTCGGGAATAATATCATGTTTAAT ATCTTTCTTAAACCATTTACTCGGAGCAATTACTGTTTTATTTTTATTTTCATTTAACCAAGCAGCCCACCAACT GAAAGAACTATTTGAAATTATATTATTTTTACATTTACTCATAAGCAGCATATCTAATTCAACATGATAAGCATC ACCTTGAACAAAACATATTTGATTATTAAAAAATATATTTTCCCTGCACCACTTTATATCATCAGAAAAAATGAA GAGAAGGGTTTTTTTATTAATAACACCTTTATTCATCAAATAATCAATGGCACGTTCAAAATATTTTTCACTACA TGTGCCATGAGTTTCATTTGCTATTTTACTGGAAACATAATCACCTCTTCTAATATGTAATGAACAAGTATCATT TTCTTTAATTAAATTAAGCAATTCATTTTGATAACTATTAAACTTGGTTTTAGGTTGAAATTCCTTTATCAACTC ATGCCTAAATTCCTTAAAATATTTTTCAGTTTGAAAATAACCGACGATTTTTTTATTTATACTTTTGGTATCAAT ATCTGGATCATACTCTAAACTTTTCTCAACGTAATGCTTTCTGAACATTCCTTTTTTCATGAAATGTGGGATTTT TTCGGAAAATAAGTATTTTTCAAATGGCCATGCTTTTTTTACAAATTCTGAACTACAAGATAATTCAACTAATCT TAATGGATGAGTTTTATATTTTACTGCATCAGATATATCAACAGTCAAATTTTGATGAGTTCTTTTTGCAATAGC AAATGCAGTTGCATACTGAAACATTTGATTACCAAGACCACCAATAATTTTAACTTCCATATGTATATCTCCTTC TTCTAGAATTCTAAAAATTGATTGAATGTATGCAAATAAATGCATACACCATAGGTGTGGTTTAATTTGATGCCC TTTTTCAGGGCTGGAATGTGTAAGAGCGGGGTTATTTATGCTGTTGTTTTTTTGTTACTCGGGAAGGGCTTTACC TCTTCCGCATAAACGCTTCCATCAGCGTTTATAGTTAAAAAAATCTTTCGGAACTGGTTTTGCGCTTACCCCAAC CAACAGGGGATTTGCTGCTTTCCATTGAGCCTGTTTCTCTGCGCGACGTTCGCGGCGGCGTGTTTGTGCATCCAT CTGGATTCTCCTGTCAGTTAGCTTTGGTGGTGTGTGGCAGTTGTAGTCCTGAACGAAAACCCCCCGCGATTGGCA CATTGGCAGCTAATCCGGAATCGCACTTACGGCCAATGCTTCGTTTCGTATCACACACCCCAAAGCCTTCTGCTT TGAATGCTGCCCTTCTTCAGGGCTTAATTTTTAAGAGCGTCACCTTCATGGTGGTCAGTGCGTCCTGCTGATGTG CTCAGTATCACCGCCAGTGGTATTTATGTCAACACCGCCAGAGATAATTTATCACCGCAGATGGTTATCTGTATG TTTTTTATATGAATTTATTTTTTGCAGGGGGGCATTGTTTGGTAGGTGAGAGATCAATTCTGCATTAATGAATCG GCCAACGCGCGGGGAGAGGCGGTTTGCGTATTGGGCGCTCTTCCGCTTCCTCGCTCACTGACTCGCTGCGCTCGG TCGTTCGGCTGCGGCGAGCGGTATCAGCTCACTCAAAGGCGGTAATACGGTTATCCACAGAATCAGGGGATAACG CAGGAAAGAACATGTGAGCAAAAGGCCAGCAAAAGGCCAGGAACCGTAAAAAGGCCGCGTTGCTGGCGTTTTTCC ATAGGCTCCGCCCCCCTGACGAGCATCACAAAAATCGACGCTCAAGTCAGAGGTGGCGAAACCCGACAGGACTAT AAAGATACCAGGCGTTTCCCCCTGGAAGCTCCCTCGTGCGCTCTCCTGTTCCGACCCTGCCGCTTACCGGATACC TGTCCGCCTTTCTCCCTTCGGGAAGCGTGGCGCTTTCTCATAGCTCACGCTGTAGGTATCTCAGTTCGGTGTAGG TCGTTCGCTCCAAGCTGGGCTGTGTGCACGAACCCCCCGTTCAGCCCGACCGCTGCGCCTTATCCGGTAACTATC GTCTTGAGTCCAACCCGGTAAGACACGACTTATCGCCACTGGCAGCAGCCACTGGTAACAGGATTAGCAGAGCGA GGTATGTAGGCGGTGCTACAGAGTTCTTGAAGTGGTGGCCTAACTACGGCTACACTAGAAGGACAGTATTTGGTA TCTGCGCTCTGCTGAAGCCAGTTACCTTCGGAAAAAGAGTTGGTAGCTCTTGATCCGGCAAACAAACCACCGCTG GTAGCGGTGGTTTTTTTGTTTGCAAGCAGCAGATTACGCGCAGAAAAAAAGGATCTCAAGAAGATCCTTTGATCT TTTCTACGGGGTCTGACGCTCAGTGGAACGAAAACTCACGTTAAGGGATTTTGGTCATGAGATTATCAAAAAGGA TCTTCACCTAGATCCTTTTAAATTAAAAATGAAGTTTTAAATCAATCTAAAGTATATATGAGTAAACTTGGTCTG ACAGTTACCAATGCTTAATCAGTGAGGCACCTATCTCAGCGATCTGTCTATTTCGTTCATCCATAGTTGCCTGAC TCCCCGTCGTGTAGATAACTACGATACGGGAGGGCTTACCATCTGGCCCCAGTGCTGCAATGATACCGCGAGACC CACGCTCACCGGCTCCAGATTTATCAGCAATAAACCAGCCAGCCGGAAGGGCCGAGCGCAGAAGTGGTCCTGCAA CTTTATCCGCCTCCATCCAGTCTATTAATTGTTGCCGGGAAGCTAGAGTAAGTAGTTCGCCAGTTAATAGTTTGC GCAACGTTGTTGCCATTGCTACAGGCATCGTGGTGTCACGCTCGTCGTTTGGTATGGCTTCATTCAGCTCCGGTT CCCAACGATCAAGGCGAGTTACATGATCCCCCATGTTGTGCAAAAAAGCGGTTAGCTCCTTCGGTCCTCCGATCG TTGTCAGAAGTAAGTTGGCCGCAGTGTTATCACTCATGGTTATGGCAGCACTGCATAATTCTCTTACTGTCATGC CATCCGTAAGATGCTTTTCTGTGACTGGTGAGTACTCAACCAAGTCATTCTGAGAATAGTGTATGCGGCGACCGA GTTGCTCTTGCCCGGCGTCAATACGGGATAATACCGCGCCACATAGCAGAACTTTAAAAGTGCTCATCATTGGAA AACGTTCTTCGGGGCGAAAACTCTCAAGGATCTTACCGCTGTTGAGATCCAGTTCGATGTAACCCACTCGTGCAC CCAACTGATCTTCAGCATCTTTTACTTTCACCAGCGTTTCTGGGTGAGCAAAAACAGGAAGGCAAAATGCCGCAA AAAAGGGAATAAGGGCGACACGGAAATGTTGAATACTCATACTCTTCCTTTTTCAATATTATTGAAGCATTTATC AGGGTTATTGTCTCATGAGCGGATACATATTTGAATGTATTTAGAAAAATAAACAAATAGGGGTTCCGCGCACAT TTCCCCGAAAAGTGCCACCTGACGTCTAAGAAACCATTATTATCATGACATTAACCTATAAAAATAGGCGTATCA CGAGGCCCTTTCGTC
(74) The sequence of pG176 is set forth below (SEQ ID NO: 2):
(75) TABLE-US-00002 TCGCGCGTTTCGGTGATGACGGTGAAAACCTCTGACACATGCAGCTCCCGGAGACGGTCACAGCTTGTCTGTAAG CGGATGCCGGGAGCAGACAAGCCCGTCAGGGCGCGTCAGCGGGTGTTGGCGGGTGTCGGGGCTGGCTTAACTATG CGGCATCAGAGCAGATTGTACTGAGAGTGCACCATATATGCGGTGTGAAATACCGCACAGATGCGTAAGGAGAAA ATACCGCATCAGGCGCCATGAAACAGTATTTAGAACTGATGCAAAAAGTGCTCGACGAAGGCACACAGAAAAACG ACCGTACCGGAACCGGAACGCTTTCCATTTTTGGTCATCAGATGCGTTTTAACCTGCAAGATGGATTCCCGCTGG TGACAACTAAACGTTGCCACCTGCGTTCCATCATCCATGAACTGCTGTGGTTTCTGCAGGGCGACACTAACATTG CTTATCTACACGAAAACAATGTCACCATCTGGGACGAATGGGCCGATGAAAACGGCGACCTCGGGCCAGTGTATG GTAAACAGTGGCGCGCCTGGCCAACGCCAGATGGTCGTCATATTGACCAGATCACTACGGTACTGAACCAGCTGA AAAACGACCCGGATTCGCGCCGCATTATTGTTTCAGCGTGGAACGTAGGCGAACTGGATAAAATGGCGCTGGCAC CGTGCCATGCATTCTTCCAGTTCTATGTGGCAGACGGCAAACTCTCTTGCCAGCTTTATCAGCGCTCCTGTGACG TCTTCCTCGGCCTGCCGTTCAACATTGCCAGCTACGCGTTATTGGTGCATATGATGGCGCAGCAGTGCGATCTGG AAGTGGGTGATTTTGTCTGGACCGGTGGCGACACGCATCTGTACAGCAACCATATGGATCAAACTCATCTGCAAT TAAGCCGCGAACCGCGTCCGCTGCCGAAGTTGATTATCAAACGTAAACCCGAATCCATCTTCGACTACCGTTTCG AAGACTTTGAGATTGAAGGCTACGATCCGCATCCGGGCATTAAAGCGCCGGTGGCTATCTAAGGCGCCATTCGCC ATTCAGGCTGCGCAACTGTTGGGAAGGGCGATCGGTGCGGGCCTCTTCGCTATTACGCCAGCTGGCGAAAGGGGG ATGTGCTGCAAGGCGATTAAGTTGGGTAACGCCAGGGTTTTCCCAGTCACGACGTTGTAAAACGACGGCCAGTGC CAAGCTTTCTTTAATGAAGCAGGGCATCAGGACGGTATCTTTGTGGAGAAAGCAGAGTAATCTTATTCAGCCTGA CTGGTGGGAAACCACCAGTCAGAATGTGTTAGCGCATGTTGACAAAAATACCATTAGTCACATTATCCGTCAGTC GGACGACATGGTAGATAACCTGTTTATTATGCGTTTTGATCTTACGTTTAATATTACCTTTATGCGATGAAACGG TCTTGGCTTTGATATTCATTTGGTCAGAGATTTGAATGGTTCCCTGACCTGCCATCCACATTCGCAACATACTCG ATTCGGTTCGGCTCAATGATAACGTCGGCATATTTAAAAACGAGGTTATCGTTGTCTCTTTTTTCAGAATATCGC CAAGGATATCGTCGAGAGATTCCGGTTTAATCGATTTAGAACTGATCAATAAATTTTTTCTGACCAATAGATATT CATCAAAATGAACATTGGCAATTGCCATAAAAACGATAAATAACGTATTGGGATGTTGATTAATGATGAGCTTGA TACGCTGACTGTTAGAAGCATCGTGGATGAAACAGTCCTCATTAATAAACACCACTGAAGGGCGCTGTGAATCAC AAGCTATGGCAAGGTCATCAACGGTTTCAATGTCGTTGATTTCTCTTTTTTTAACCCCTCTACTCAACAGATACC CGGTTAAACCTAGTCGGGTGTAACTACATAAATCCATAATAATCGTTGACATGGCATACCCTCACTCAATGCGTA ACGATAATTCCCCTTACCTGAATATTTCATCATGACTAAACGGAACAACATGGGTCACCTAATGCGCCACTCTCG CGATTTTTCAGGCGGACTTACTATCCCGTAAAGTGTTGTATAATTTGCCTGGAATTGTCTTAAAGTAAAGTAAAT GTTGCGATATGTGAGTGAGCTTAAAACAAATATTTCGCTGCAGGAGTATCCTGGAAGATGTTCGTAGAAGCTTAC TGCTCACAAGAAAAAAGGCACGTCATCTGACGTGCCTTTTTTATTTGTACTACCCTGTACGATTACTGCAGCTCG AGTTAATTCAAATCTTCTTCAGAAATCAATTTTTGTTCCAAACCCAATTTTTTAACCAACTTTCTCACCGCGCGC AACAAAGGCAAGGATTTTTGATAAGCTTTGCGATAGATTTTAAAAGTGGTGTTTTGAGAGAGTTCTAATAAAGGC GAAGCGTTTTGTAAAAGCCGGTCATAATTAACCCTCAAATCATCATAATTAACCCTCAAATCATCAATGGATACT AACGGCTTATGCAGATCGTACTCCCACATGAAAGATGTTGAGAATTTGTGATAAATCGTATCGTTTTCTAAAATC GTTTTAAAAAAATCTAGGATTTTTTTAAAACTCAAATCTTGGTAAAAGTAAGCTTTCCCATCAAGGGTGTTTAAA GGGTTTTCATAGAGCATGTCTAAATAAGCGTTTGGGTGCGTGTGCAGGTATTTGATATAATCAATCGCTTCATCA AAGTTGTTGAAATCATGCACATTCACAAAACTTTTAGGGTTAAAATCTTTCGCCACGCTGGGACTCCCCCAATAA ATAGGAATGGTATGGCTAAAATACGCATCAAGGATTTTTTCGGTTACATAGCCATAACCTTGCGAGTTTTCAAAA CAGAGATTGAACTTGTATTGGCTTAAAAACTCGCTTTTGTTTCCAACCTTATAGCCTAAAGTGTTTCTCACACTT CCTCCCCCAGTAACTGGCTCTATGGAATTTAGAGCGTCATAAAAAGCGTTCCTCATAGGAGCGTTAGCGTTGCTC GCTACAAAACTGGCAAACCCTCTTTTTAAAAGATCGCTCTCATCATTCACTACTGCGCACAAATTAGGGTGGTTT TCTTTAAAATGATGAGAGGGTTTTTTTAAAGCATAAAGGCTGTTGTCTTTGAGTTTGTAGGGCGCAGTGGTGTCA TTAACAAGCTCGGCTTTATAGTGCAAATGGGCATAATACAAAGGCATTCTCAAATAACGATCATTAAAATCCAAT TCATCAAAGCCTATGGCGTAATCAAAGAGGTTGAAATTAGGTGATTCGTTTTCACCGGTGTAAAACACTCGTTTA GTGTTTTGATAAGATAAAATCTTTCTAGCCGCTCCAAGAGGATTGCTAAAAACTAGATCTGAAAATTCATTGGGG TTTTGGTGGAGGGTGATTGCGTAGCGTTGGCTTAGGATAAAATAAAGAACGCTCTTTTTAAATTCTTTAATTTCT TCATCTCCCCACCAATTCGCCACAGCGATTTTTAGGGGGGGGGGGGGAGATTTAGAGGCCATTTTTTCAATGGAA GCGCTTTCTATAAAGGCGTCTAATAGGGGTTGGAACATATGTATATCTCCTTCTTGAATTCTAAAAATTGATTGA ATGTATGCAAATAAATGCATACACCATAGGTGTGGTTTAATTTGATGCCCTTTTTCAGGGCTGGAATGTGTAAGA GCGGGGTTATTTATGCTGTTGTTTTTTTGTTACTCGGGAAGGGCTTTACCTCTTCCGCATAAACGCTTCCATCAG CGTTTATAGTTAAAAAAATCTTTCGGAACTGGTTTTGCGCTTACCCCAACCAACAGGGGATTTGCTGCTTTCCAT TGAGCCTGTTTCTCTGCGCGACGTTCGCGGCGGCGTGTTTGTGCATCCATCTGGATTCTCCTGTCAGTTAGCTTT GGTGGTGTGTGGCAGTTGTAGTCCTGAACGAAAACCCCCCGCGATTGGCACATTGGCAGCTAATCCGGAATCGCA CTTACGGCCAATGCTTCGTTTCGTATCACACACCCCAAAGCCTTCTGCTTTGAATGCTGCCCTTCTTCAGGGCTT AATTTTTAAGAGCGTCACCTTCATGGTGGTCAGTGCGTCCTGCTGATGTGCTCAGTATCACCGCCAGTGGTATTT ATGTCAACACCGCCAGAGATAATTTATCACCGCAGATGGTTATCTGTATGTTTTTTATATGAATTTATTTTTTGC AGGGGGGCATTGTTTGGTAGGTGAGAGATCAATTCTGCATTAATGAATCGGCCAACGCGCGGGGAGAGGCGGTTT GCGTATTGGGCGCTCTTCCGCTTCCTCGCTCACTGACTCGCTGCGCTCGGTCGTTCGGCTGCGGCGAGCGGTATC AGCTCACTCAAAGGCGGTAATACGGTTATCCACAGAATCAGGGGATAACGCAGGAAAGAACATGTGAGCAAAAGG CCAGCAAAAGGCCAGGAACCGTAAAAAGGCCGCGTTGCTGGCGTTTTTCCATAGGCTCCGCCCCCCTGACGAGCA TCACAAAAATCGACGCTCAAGTCAGAGGTGGCGAAACCCGACAGGACTATAAAGATACCAGGCGTTTCCCCCTGG AAGCTCCCTCGTGCGCTCTCCTGTTCCGACCCTGCCGCTTACCGGATACCTGTCCGCCTTTCTCCCTTCGGGAAG CGTGGCGCTTTCTCATAGCTCACGCTGTAGGTATCTCAGTTCGGTGTAGGTCGTTCGCTCCAAGCTGGGCTGTGT GCACGAACCCCCCGTTCAGCCCGACCGCTGCGCCTTATCCGGTAACTATCGTCTTGAGTCCAACCCGGTAAGACA CGACTTATCGCCACTGGCAGCAGCCACTGGTAACAGGATTAGCAGAGCGAGGTATGTAGGCGGTGCTACAGAGTT CTTGAAGTGGTGGCCTAACTACGGCTACACTAGAAGGACAGTATTTGGTATCTGCGCTCTGCTGAAGCCAGTTAC CTTCGGAAAAAGAGTTGGTAGCTCTTGATCCGGCAAACAAACCACCGCTGGTAGCGGTGGTTTTTTTGTTTGCAA GCAGCAGATTACGCGCAGAAAAAAAGGATCTCAAGAAGATCCTTTGATCTTTTCTACGGGGTCTGACGCTCAGTG GAACGAAAACTCACGTTAAGGGATTTTGGTCATGAGATTATCAAAAAGGATCTTCACCTAGATCCTTTTAAATTA AAAATGAAGTTTTAAATCAATCTAAAGTATATATGAGTAAACTTGGTCTGACAGTTACCAATGCTTAATCAGTGA GGCACCTATCTCAGCGATCTGTCTATTTCGTTCATCCATAGTTGCCTGACTCCCCGTCGTGTAGATAACTACGAT ACGGGAGGGCTTACCATCTGGCCCCAGTGCTGCAATGATACCGCGAGACCCACGCTCACCGGCTCCAGATTTATC AGCAATAAACCAGCCAGCCGGAAGGGCCGAGCGCAGAAGTGGTCCTGCAACTTTATCCGCCTCCATCCAGTCTAT TAATTGTTGCCGGGAAGCTAGAGTAAGTAGTTCGCCAGTTAATAGTTTGCGCAACGTTGTTGCCATTGCTACAGG CATCGTGGTGTCACGCTCGTCGTTTGGTATGGCTTCATTCAGCTCCGGTTCCCAACGATCAAGGCGAGTTACATG ATCCCCCATGTTGTGCAAAAAAGCGGTTAGCTCCTTCGGTCCTCCGATCGTTGTCAGAAGTAAGTTGGCCGCAGT GTTATCACTCATGGTTATGGCAGCACTGCATAATTCTCTTACTGTCATGCCATCCGTAAGATGCTTTTCTGTGAC TGGTGAGTACTCAACCAAGTCATTCTGAGAATAGTGTATGCGGCGACCGAGTTGCTCTTGCCCGGCGTCAATACG GGATAATACCGCGCCACATAGCAGAACTTTAAAAGTGCTCATCATTGGAAAACGTTCTTCGGGGCGAAAACTCTC AAGGATCTTACCGCTGTTGAGATCCAGTTCGATGTAACCCACTCGTGCACCCAACTGATCTTCAGCATCTTTTAC TTTCACCAGCGTTTCTGGGTGAGCAAAAACAGGAAGGCAAAATGCCGCAAAAAAGGGAATAAGGGCGACACGGAA ATGTTGAATACTCATACTCTTCCTTTTTCAATATTATTGAAGCATTTATCAGGGTTATTGTCTCATGAGCGGATA CATATTTGAATGTATTTAGAAAAATAAACAAATAGGGGTTCCGCGCACATTTCCCCGAAAAGTGCCACCTGACGT CTAAGAAACCATTATTATCATGACATTAACCTATAAAAATAGGCGTATCACGAGGCCCTTTCGTC
(76) The sequence of pG177 is set forth below (SEQ ID NO: 3):
(77) TABLE-US-00003 TCGCGCGTTTCGGTGATGACGGTGAAAACCTCTGACACATGCAGCTCCCGGAGACGGTCACAGCTTGTCTGTAAG CGGATGCCGGGAGCAGACAAGCCCGTCAGGGCGCGTCAGCGGGTGTTGGCGGGTGTCGGGGCTGGCTTAACTATG CGGCATCAGAGCAGATTGTACTGAGAGTGCACCATATATGCGGTGTGAAATACCGCACAGATGCGTAAGGAGAAA ATACCGCATCAGGCGCCATGAAACAGTATTTAGAACTGATGCAAAAAGTGCTCGACGAAGGCACACAGAAAAACG ACCGTACCGGAACCGGAACGCTTTCCATTTTTGGTCATCAGATGCGTTTTAACCTGCAAGATGGATTCCCGCTGG TGACAACTAAACGTTGCCACCTGCGTTCCATCATCCATGAACTGCTGTGGTTTCTGCAGGGCGACACTAACATTG CTTATCTACACGAAAACAATGTCACCATCTGGGACGAATGGGCCGATGAAAACGGCGACCTCGGGCCAGTGTATG GTAAACAGTGGCGCGCCTGGCCAACGCCAGATGGTCGTCATATTGACCAGATCACTACGGTACTGAACCAGCTGA AAAACGACCCGGATTCGCGCCGCATTATTGTTTCAGCGTGGAACGTAGGCGAACTGGATAAAATGGCGCTGGCAC CGTGCCATGCATTCTTCCAGTTCTATGTGGCAGACGGCAAACTCTCTTGCCAGCTTTATCAGCGCTCCTGTGACG TCTTCCTCGGCCTGCCGTTCAACATTGCCAGCTACGCGTTATTGGTGCATATGATGGCGCAGCAGTGCGATCTGG AAGTGGGTGATTTTGTCTGGACCGGTGGCGACACGCATCTGTACAGCAACCATATGGATCAAACTCATCTGCAAT TAAGCCGCGAACCGCGTCCGCTGCCGAAGTTGATTATCAAACGTAAACCCGAATCCATCTTCGACTACCGTTTCG AAGACTTTGAGATTGAAGGCTACGATCCGCATCCGGGCATTAAAGCGCCGGTGGCTATCTAAGGCGCCATTCGCC ATTCAGGCTGCGCAACTGTTGGGAAGGGCGATCGGTGCGGGCCTCTTCGCTATTACGCCAGCTGGCGAAAGGGGG ATGTGCTGCAAGGCGATTAAGTTGGGTAACGCCAGGGTTTTCCCAGTCACGACGTTGTAAAACGACGGCCAGTGC CAAGCTTTCTTTAATGAAGCAGGGCATCAGGACGGTATCTTTGTGGAGAAAGCAGAGTAATCTTATTCAGCCTGA CTGGTGGGAAACCACCAGTCAGAATGTGTTAGCGCATGTTGACAAAAATACCATTAGTCACATTATCCGTCAGTC GGACGACATGGTAGATAACCTGTTTATTATGCGTTTTGATCTTACGTTTAATATTACCTTTATGCGATGAAACGG TCTTGGCTTTGATATTCATTTGGTCAGAGATTTGAATGGTTCCCTGACCTGCCATCCACATTCGCAACATACTCG ATTCGGTTCGGCTCAATGATAACGTCGGCATATTTAAAAACGAGGTTATCGTTGTCTCTTTTTTCAGAATATCGC CAAGGATATCGTCGAGAGATTCCGGTTTAATCGATTTAGAACTGATCAATAAATTTTTTCTGACCAATAGATATT CATCAAAATGAACATTGGCAATTGCCATAAAAACGATAAATAACGTATTGGGATGTTGATTAATGATGAGCTTGA TACGCTGACTGTTAGAAGCATCGTGGATGAAACAGTCCTCATTAATAAACACCACTGAAGGGCGCTGTGAATCAC AAGCTATGGCAAGGTCATCAACGGTTTCAATGTCGTTGATTTCTCTTTTTTTAACCCCTCTACTCAACAGATACC CGGTTAAACCTAGTCGGGTGTAACTACATAAATCCATAATAATCGTTGACATGGCATACCCTCACTCAATGCGTA ACGATAATTCCCCTTACCTGAATATTTCATCATGACTAAACGGAACAACATGGGTCACCTAATGCGCCACTCTCG CGATTTTTCAGGCGGACTTACTATCCCGTAAAGTGTTGTATAATTTGCCTGGAATTGTCTTAAAGTAAAGTAAAT GTTGCGATATGTGAGTGAGCTTAAAACAAATATTTCGCTGCAGGAGTATCCTGGAAGATGTTCGTAGAAGCTTAC TGCTCACAAGAAAAAAGGCACGTCATCTGACGTGCCTTTTTTATTTGTACTACCCTGTACGATTACTGCAGCTCG AGTTAATTCAAATCTTCTTCAGAAATCAATTTTTGTTCAGCGTTATACTTTTGGGATTTTACCTCAAAATGGGAT TCTATTTTCACCCACTCCTTACAAAGGATATTCTCATGCCCAAAAAGCCAGTGTTTGGGGCCAATAATGATTTTT TCTGGATTTTCTATCAAATAGGCCGCCCACCAGCTATAAGTGCTATTAGCGATAATGCCATGCTGACAAGATTGC ATGAGCAGCATGTCCCAATACGCCTCTTCTTCTTTATCCCTAGTGGTCATGTCCATAAAAGGGTAGCCAAGATCA AGATTTTGCGTGAATTCTAAGTCTTCGCAAAACACAAAAAGCTCCATGTTTGGCACGCGCTTTGCCATATACTCA AGCGCCTTTTTTTGATAGTCAATACCAAGCTGACAGCCAATCCCCACATAATCCCCTCTTCTTATATGCACAAAC ACGCTGTTTTTAGCGGCTAAAATCAAAGAAAGCTTGCACTGATATTCTTCCTCTTTTTTATTATTATTCTTATTA TTTTCGGGTGGTGGTGGTAGAGTGAAGGTTTGCTTGATTAAAGGGGATATAGCATCAAAGTATCGTGGATCTTGG AAATAGCCAAAAAAATAAGTCAAGCGGCTTGGCTTTAGCAATTTAGGCTCGTATTCAAAAACGATTTCTTGACTC ACCCTATCAAATCCCATGCATTTGAGCGCGTCTCTTACTAGCTTGGGGAGGTGTTGCATTTTAGCTATAGCGATT TCTTTCGCGCTCGCATAGGGCAAATCAATAGGGAAAAGTTCTAATTGCATTTTCCTATCGCTCCAATCAAAAGAA GTGATATCTAACAGCACAGGCGTATTAGAGTGTTTTTGCAAACTTTTAGCGAAAGCGTATTGAAACATTTGATTC CCAAGCCCTCCGCAAATTTGCACCACCTTAAAAGCCATATGTATATCTCCTTCTTGCTCGAGTTAATTCAAATCT TCTTCAGAAATCAATTTTTGTTCCAAACCCAATTTTTTAACCAACTTTCTCACCGCGCGCAACAAAGGCAAGGAT TTTTGATAAGCTTTGCGATAGATTTTAAAAGTGGTGTTTTGAGAGAGTTCTAATAAAGGCGAAGCGTTTTGTAAA AGCCGGTCATAATTAACCCTCAAATCATCATAATTAACCCTCAAATCATCAATGGATACTAACGGCTTATGCAGA TCGTACTCCCACATGAAAGATGTTGAGAATTTGTGATAAATCGTATCGTTTTCTAAAATCGTTTTAAAAAAATCT AGGATTTTTTTAAAACTCAAATCTTGGTAAAAGTAAGCTTTCCCATCAAGGGTGTTTAAAGGGTTTTCATAGAGC ATGTCTAAATAAGCGTTTGGGTGCGTGTGCAGGTATTTGATATAATCAATCGCTTCATCAAAGTTGTTGAAATCA TGCACATTCACAAAACTTTTAGGGTTAAAATCTTTCGCCACGCTGGGACTCCCCCAATAAATAGGAATGGTATGG CTAAAATACGCATCAAGGATTTTTTCGGTTACATAGCCATAACCTTGCGAGTTTTCAAAACAGAGATTGAACTTG TATTGGCTTAAAAACTCGCTTTTGTTTCCAACCTTATAGCCTAAAGTGTTTCTCACACTTCCTCCCCCAGTAACT GGCTCTATGGAATTTAGAGCGTCATAAAAAGCGTTCCTCATAGGAGCGTTAGCGTTGCTCGCTACAAAACTGGCA AACCCTCTTTTTAAAAGATCGCTCTCATCATTCACTACTGCGCACAAATTAGGGTGGTTTTCTTTAAAATGATGA GAGGGTTTTTTTAAAGCATAAAGGCTGTTGTCTTTGAGTTTGTAGGGCGCAGTGGTGTCATTAACAAGCTCGGCT TTATAGTGCAAATGGGCATAATACAAAGGCATTCTCAAATAACGATCATTAAAATCCAATTCATCAAAGCCTATG GCGTAATCAAAGAGGTTGAAATTAGGTGATTCGTTTTCACCGGTGTAAAACACTCGTTTAGTGTTTTGATAAGAT AAAATCTTTCTAGCCGCTCCAAGAGGATTGCTAAAAACTAGATCTGAAAATTCATTGGGGTTTTGGTGGAGGGTG ATTGCGTAGCGTTGGCTTAGGATAAAATAAAGAACGCTCTTTTTAAATTCTTTAATTTCTTCATCTCCCCACCAA TTCGCCACAGCGATTTTTAGGGGGGGGGGGGGAGATTTAGAGGCCATTTTTTCAATGGAAGCGCTTTCTATAAAG GCGTCTAATAGGGGTTGGAACATATGTATATCTCCTTCTTGAATTCTAAAAATTGATTGAATGTATGCAAATAAA TGCATACACCATAGGTGTGGTTTAATTTGATGCCCTTTTTCAGGGCTGGAATGTGTAAGAGCGGGGTTATTTATG CTGTTGTTTTTTTGTTACTCGGGAAGGGCTTTACCTCTTCCGCATAAACGCTTCCATCAGCGTTTATAGTTAAAA AAATCTTTCGGAACTGGTTTTGCGCTTACCCCAACCAACAGGGGATTTGCTGCTTTCCATTGAGCCTGTTTCTCT GCGCGACGTTCGCGGCGGCGTGTTTGTGCATCCATCTGGATTCTCCTGTCAGTTAGCTTTGGTGGTGTGTGGCAG TTGTAGTCCTGAACGAAAACCCCCCGCGATTGGCACATTGGCAGCTAATCCGGAATCGCACTTACGGCCAATGCT TCGTTTCGTATCACACACCCCAAAGCCTTCTGCTTTGAATGCTGCCCTTCTTCAGGGCTTAATTTTTAAGAGCGT CACCTTCATGGTGGTCAGTGCGTCCTGCTGATGTGCTCAGTATCACCGCCAGTGGTATTTATGTCAACACCGCCA GAGATAATTTATCACCGCAGATGGTTATCTGTATGTTTTTTATATGAATTTATTTTTTGCAGGGGGGCATTGTTT GGTAGGTGAGAGATCAATTCTGCATTAATGAATCGGCCAACGCGCGGGGAGAGGCGGTTTGCGTATTGGGCGCTC TTCCGCTTCCTCGCTCACTGACTCGCTGCGCTCGGTCGTTCGGCTGCGGCGAGCGGTATCAGCTCACTCAAAGGC GGTAATACGGTTATCCACAGAATCAGGGGATAACGCAGGAAAGAACATGTGAGCAAAAGGCCAGCAAAAGGCCAG GAACCGTAAAAAGGCCGCGTTGCTGGCGTTTTTCCATAGGCTCCGCCCCCCTGACGAGCATCACAAAAATCGACG CTCAAGTCAGAGGTGGCGAAACCCGACAGGACTATAAAGATACCAGGCGTTTCCCCCTGGAAGCTCCCTCGTGCG CTCTCCTGTTCCGACCCTGCCGCTTACCGGATACCTGTCCGCCTTTCTCCCTTCGGGAAGCGTGGCGCTTTCTCA TAGCTCACGCTGTAGGTATCTCAGTTCGGTGTAGGTCGTTCGCTCCAAGCTGGGCTGTGTGCACGAACCCCCCGT TCAGCCCGACCGCTGCGCCTTATCCGGTAACTATCGTCTTGAGTCCAACCCGGTAAGACACGACTTATCGCCACT GGCAGCAGCCACTGGTAACAGGATTAGCAGAGCGAGGTATGTAGGCGGTGCTACAGAGTTCTTGAAGTGGTGGCC TAACTACGGCTACACTAGAAGGACAGTATTTGGTATCTGCGCTCTGCTGAAGCCAGTTACCTTCGGAAAAAGAGT TGGTAGCTCTTGATCCGGCAAACAAACCACCGCTGGTAGCGGTGGTTTTTTTGTTTGCAAGCAGCAGATTACGCG CAGAAAAAAAGGATCTCAAGAAGATCCTTTGATCTTTTCTACGGGGTCTGACGCTCAGTGGAACGAAAACTCACG TTAAGGGATTTTGGTCATGAGATTATCAAAAAGGATCTTCACCTAGATCCTTTTAAATTAAAAATGAAGTTTTAA ATCAATCTAAAGTATATATGAGTAAACTTGGTCTGACAGTTACCAATGCTTAATCAGTGAGGCACCTATCTCAGC GATCTGTCTATTTCGTTCATCCATAGTTGCCTGACTCCCCGTCGTGTAGATAACTACGATACGGGAGGGCTTACC ATCTGGCCCCAGTGCTGCAATGATACCGCGAGACCCACGCTCACCGGCTCCAGATTTATCAGCAATAAACCAGCC AGCCGGAAGGGCCGAGCGCAGAAGTGGTCCTGCAACTTTATCCGCCTCCATCCAGTCTATTAATTGTTGCCGGGA AGCTAGAGTAAGTAGTTCGCCAGTTAATAGTTTGCGCAACGTTGTTGCCATTGCTACAGGCATCGTGGTGTCACG CTCGTCGTTTGGTATGGCTTCATTCAGCTCCGGTTCCCAACGATCAAGGCGAGTTACATGATCCCCCATGTTGTG CAAAAAAGCGGTTAGCTCCTTCGGTCCTCCGATCGTTGTCAGAAGTAAGTTGGCCGCAGTGTTATCACTCATGGT TATGGCAGCACTGCATAATTCTCTTACTGTCATGCCATCCGTAAGATGCTTTTCTGTGACTGGTGAGTACTCAAC CAAGTCATTCTGAGAATAGTGTATGCGGCGACCGAGTTGCTCTTGCCCGGCGTCAATACGGGATAATACCGCGCC ACATAGCAGAACTTTAAAAGTGCTCATCATTGGAAAACGTTCTTCGGGGCGAAAACTCTCAAGGATCTTACCGCT GTTGAGATCCAGTTCGATGTAACCCACTCGTGCACCCAACTGATCTTCAGCATCTTTTACTTTCACCAGCGTTTC TGGGTGAGCAAAAACAGGAAGGCAAAATGCCGCAAAAAAGGGAATAAGGGCGACACGGAAATGTTGAATACTCAT ACTCTTCCTTTTTCAATATTATTGAAGCATTTATCAGGGTTATTGTCTCATGAGCGGATACATATTTGAATGTAT TTAGAAAAATAAACAAATAGGGGTTCCGCGCACATTTCCCCGAAAAGTGCCACCTGACGTCTAAGAAACCATTAT TATCATGACATTAACCTATAAAAATAGGCGTATCACGAGGCCCTTTCGTC
(78) The sequence of Bacteroides fragilis NCTC 9343 wcfW CDS DNA is set for the below (SEQ ID NO: 4):
(79) TABLE-US-00004 ATGATTGTATCATCTTTGCGAGGAGGATTGGGGAATCAAATGTTTATTTA CGCTATGGTGAAGGCCATGGCATTAAGAAACAATGTACCATTCGCTTTTA ATTTGACTACTGATTTTGCAAATGATGAAGTTTATAAAAGGAAACTTTTA TTATCATATTTTGCATTAGACTTGCCTGAAAATAAAAAATTAACATTTGA TTTTTCATATGGGAATTATTATAGAAGGCTAAGTCGTAATTTAGGTTGTC ATATACTTCATCCATCATATCGTTATATTTGCGAAGAGCGCCCTCCCCAC TTTGAATCAAGGTTAATTAGTTCTAAGATTACAAATGCTTTTCTGGAAGG ATATTGGCAGTCAGAAAAATATTTTCTTGATTATAAACAAGAGATAAAAG AGGACTTTGTAATACAAAAAAAATTAGAATACACATCGTATTTGGAATTG GAAGAAATAAAATTGCTAGATAAGAATGCCATAATGATTGGGGTTAGACG GTATCAGGAAAGTGATGTAGCTCCTGGTGGAGTGTTAGAAGATGATTACT ATAAATGTGCTATGGATATTATGGCATCAAAAGTTACTTCTCCTGTTTTC TTTTGTTTTTCACAAGATTTAGAATGGGTTGAAAAACATCTAGCGGGAAA ATATCCTGTTCGTTTGATAAGTAAAAAGGAGGATGATAGTGGTACTATAG ATGATATGTTTCTAATGATGCATTTTCGTAATTATATAATATCGAATAGC TCTTTTTACTGGTGGGGAGCATGGCTTTCGAAATATGATGATAAGCTGGT GATTGCTCCAGGTAATTTTATAAATAAGGATTCTGTACCAGAATCTTGGT TTAAATTGAATGTAAGATAA
(80) The sequence of pG171 is set forth below (SEQ ID NO: 5):
(81) TABLE-US-00005 TCGCGCGTTTCGGTGATGACGGTGAAAACCTCTGACACATGCAGCTCCCGGAGACGGTCACAGCTTGTCTGTAAG CGGATGCCGGGAGCAGACAAGCCCGTCAGGGCGCGTCAGCGGGTGTTGGCGGGTGTCGGGGCTGGCTTAACTATG CGGCATCAGAGCAGATTGTACTGAGAGTGCACCATATATGCGGTGTGAAATACCGCACAGATGCGTAAGGAGAAA ATACCGCATCAGGCGCCTCCTCAACCTGTATATTCGTAAACCACGCCCAATGGGAGCTGTCTCAGGTTTGTTCCT GATTGGTTACGGCGCGTTTCGCATCATTGTTGAGTTTTTCCGCCAGCCCGACGCGCAGTTTACCGGTGCCTGGGT GCAGTACATCAGCATGGGGCAAATTCTTTCCATCCCGATGATTGTCGCGGGTGTGATCATGATGGTCTGGGCATA TCGTCGCAGCCCACAGCAACACGTTTCCTGAGGAACCATGAAACAGTATTTAGAACTGATGCAAAAAGTGCTCGA CGAAGGCACACAGAAAAACGACCGTACCGGAACCGGAACGCTTTCCATTTTTGGTCATCAGATGCGTTTTAACCT GCAAGATGGATTCCCGCTGGTGACAACTAAACGTTGCCACCTGCGTTCCATCATCCATGAACTGCTGTGGTTTCT GCAGGGCGACACTAACATTGCTTATCTACACGAAAACAATGTCACCATCTGGGACGAATGGGCCGATGAAAACGG CGACCTCGGGCCAGTGTATGGTAAACAGTGGCGCGCCTGGCCAACGCCAGATGGTCGTCATATTGACCAGATCAC TACGGTACTGAACCAGCTGAAAAACGACCCGGATTCGCGCCGCATTATTGTTTCAGCGTGGAACGTAGGCGAACT GGATAAAATGGCGCTGGCACCGTGCCATGCATTCTTCCAGTTCTATGTGGCAGACGGCAAACTCTCTTGCCAGCT TTATCAGCGCTCCTGTGACGTCTTCCTCGGCCTGCCGTTCAACATTGCCAGCTACGCGTTATTGGTGCATATGAT GGCGCAGCAGTGCGATCTGGAAGTGGGTGATTTTGTCTGGACCGGTGGCGACACGCATCTGTACAGCAACCATAT GGATCAAACTCATCTGCAATTAAGCCGCGAACCGCGTCCGCTGCCGAAGTTGATTATCAAACGTAAACCCGAATC CATCTTCGACTACCGTTTCGAAGACTTTGAGATTGAAGGCTACGATCCGCATCCGGGCATTAAAGCGCCGGTGGC TATCTAATTACGAAACATCCTGCCAGAGCCGACGCCAGTGTGCGTCGGTTTTTTTACCCTCCGTTAAATTCTTCG AGACGCCTTCCCGAAGGCGCCATTCGCCATTCAGGCTGCGCAACTGTTGGGAAGGGCGATCGGTGCGGGCCTCTT CGCTATTACGCCAGCTGGCGAAAGGGGGATGTGCTGCAAGGCGATTAAGTTGGGTAACGCCAGGGTTTTCCCAGT CACGACGTTGTAAAACGACGGCCAGTGCCAAGCTTTCTTTAATGAAGCAGGGCATCAGGACGGTATCTTTGTGGA GAAAGCAGAGTAATCTTATTCAGCCTGACTGGTGGGAAACCACCAGTCAGAATGTGTTAGCGCATGTTGACAAAA ATACCATTAGTCACATTATCCGTCAGTCGGACGACATGGTAGATAACCTGTTTATTATGCGTTTTGATCTTACGT TTAATATTACCTTTATGCGATGAAACGGTCTTGGCTTTGATATTCATTTGGTCAGAGATTTGAATGGTTCCCTGA CCTGCCATCCACATTCGCAACATACTCGATTCGGTTCGGCTCAATGATAACGTCGGCATATTTAAAAACGAGGTT ATCGTTGTCTCTTTTTTCAGAATATCGCCAAGGATATCGTCGAGAGATTCCGGTTTAATCGATTTAGAACTGATC AATAAATTTTTTCTGACCAATAGATATTCATCAAAATGAACATTGGCAATTGCCATAAAAACGATAAATAACGTA TTGGGATGTTGATTAATGATGAGCTTGATACGCTGACTGTTAGAAGCATCGTGGATGAAACAGTCCTCATTAATA AACACCACTGAAGGGCGCTGTGAATCACAAGCTATGGCAAGGTCATCAACGGTTTCAATGTCGTTGATTTCTCTT TTTTTAACCCCTCTACTCAACAGATACCCGGTTAAACCTAGTCGGGTGTAACTACATAAATCCATAATAATCGTT GACATGGCATACCCTCACTCAATGCGTAACGATAATTCCCCTTACCTGAATATTTCATCATGACTAAACGGAACA ACATGGGTCACCTAATGCGCCACTCTCGCGATTTTTCAGGCGGACTTACTATCCCGTAAAGTGTTGTATAATTTG CCTGGAATTGTCTTAAAGTAAAGTAAATGTTGCGATATGTGAGTGAGCTTAAAACAAATATTTCGCTGCAGGAGT ATCCTGGAAGATGTTCGTAGAAGCTTACTGCTCACAAGAAAAAAGGCACGTCATCTGACGTGCCTTTTTTATTTG TACTACCCTGTACGATTACTGCAGCTCGAGTTTAATTCAAATCTTCTTCAGAAATCAATTTTTGTTCAGCGTTAT ACTTTTGGGATTTTACCTCAAAATGGGATTCTATTTTCACCCACTCCTTACAAAGGATATTCTCATGCCCAAAAA GCCAGTGTTTGGGGCCAATAATGATTTTTTCTGGATTTTCTATCAAATAGGCCGCCCACCAGCTATAAGTGCTAT TAGCGATAATGCCATGCTGACAAGATTGCATGAGCAGCATGTCCCAATACGCCTCTTCTTCTTTATCCCTAGTGG TCATGTCCATAAAAGGGTAGCCAAGATCAAGATTTTGCGTGAATTCTAAGTCTTCGCAAAACACAAAAAGCTCCA TGTTTGGCACGCGCTTTGCCATATACTCAAGCGCCTTTTTTTGATAGTCAATACCAAGCTGACAGCCAATCCCCA CATAATCCCCTCTTCTTATATGCACAAACACGCTGTTTTTAGCGGCTAAAATCAAAGAAAGCTTGCACTGATATT CTTCCTCTTTTTTATTATTATTCTTATTATTTTCGGGTGGTGGTGGTAGAGTGAAGGTTTGCTTGATTAAAGGGG ATATAGCATCAAAGTATCGTGGATCTTGGAAATAGCCAAAAAAATAAGTCAAGCGGCTTGGCTTTAGCAATTTAG GCTCGTATTCAAAAACGATTTCTTGACTCACCCTATCAAATCCCATGCATTTGAGCGCGTCTCTTACTAGCTTGG GGAGGTGTTGCATTTTAGCTATAGCGATTTCTTTCGCGCTCGCATAGGGCAAATCAATAGGGAAAAGTTCTAATT GCATTTTCCTATCGCTCCAATCAAAAGAAGTGATATCTAACAGCACAGGCGTATTAGAGTGTTTTTGCAAACTTT TAGCGAAAGCGTATTGAAACATTTGATTCCCAAGCCCTCCGCAAATTTGCACCACCTTAAAAGCCATATGTATAT CTCCTTCTTGAATTCTAAAAATTGATTGAATGTATGCAAATAAATGCATACACCATAGGTGTGGTTTAATTTGAT GCCCTTTTTCAGGGCTGGAATGTGTAAGAGCGGGGTTATTTATGCTGTTGTTTTTTTGTTACTCGGGAAGGGCTT TACCTCTTCCGCATAAACGCTTCCATCAGCGTTTATAGTTAAAAAAATCTTTCGGAACTGGTTTTGCGCTTACCC CAACCAACAGGGGATTTGCTGCTTTCCATTGAGCCTGTTTCTCTGCGCGACGTTCGCGGCGGCGTGTTTGTGCAT CCATCTGGATTCTCCTGTCAGTTAGCTTTGGTGGTGTGTGGCAGTTGTAGTCCTGAACGAAAACCCCCCGCGATT GGCACATTGGCAGCTAATCCGGAATCGCACTTACGGCCAATGCTTCGTTTCGTATCACACACCCCAAAGCCTTCT GCTTTGAATGCTGCCCTTCTTCAGGGCTTAATTTTTAAGAGCGTCACCTTCATGGTGGTCAGTGCGTCCTGCTGA TGTGCTCAGTATCACCGCCAGTGGTATTTATGTCAACACCGCCAGAGATAATTTATCACCGCAGATGGTTATCTG TATGTTTTTTATATGAATTTATTTTTTGCAGGGGGGCATTGTTTGGTAGGTGAGAGATCAATTCTGCATTAATGA ATCGGCCAACGCGCGGGGAGAGGCGGTTTGCGTATTGGGCGCTCTTCCGCTTCCTCGCTCACTGACTCGCTGCGC TCGGTCGTTCGGCTGCGGCGAGCGGTATCAGCTCACTCAAAGGCGGTAATACGGTTATCCACAGAATCAGGGGAT AACGCAGGAAAGAACATGTGAGCAAAAGGCCAGCAAAAGGCCAGGAACCGTAAAAAGGCCGCGTTGCTGGCGTTT TTCCATAGGCTCCGCCCCCCTGACGAGCATCACAAAAATCGACGCTCAAGTCAGAGGTGGCGAAACCCGACAGGA CTATAAAGATACCAGGCGTTTCCCCCTGGAAGCTCCCTCGTGCGCTCTCCTGTTCCGACCCTGCCGCTTACCGGA TACCTGTCCGCCTTTCTCCCTTCGGGAAGCGTGGCGCTTTCTCATAGCTCACGCTGTAGGTATCTCAGTTCGGTG TAGGTCGTTCGCTCCAAGCTGGGCTGTGTGCACGAACCCCCCGTTCAGCCCGACCGCTGCGCCTTATCCGGTAAC TATCGTCTTGAGTCCAACCCGGTAAGACACGACTTATCGCCACTGGCAGCAGCCACTGGTAACAGGATTAGCAGA GCGAGGTATGTAGGCGGTGCTACAGAGTTCTTGAAGTGGTGGCCTAACTACGGCTACACTAGAAGGACAGTATTT GGTATCTGCGCTCTGCTGAAGCCAGTTACCTTCGGAAAAAGAGTTGGTAGCTCTTGATCCGGCAAACAAACCACC GCTGGTAGCGGTGGTTTTTTTGTTTGCAAGCAGCAGATTACGCGCAGAAAAAAAGGATCTCAAGAAGATCCTTTG ATCTTTTCTACGGGGTCTGACGCTCAGTGGAACGAAAACTCACGTTAAGGGATTTTGGTCATGAGATTATCAAAA AGGATCTTCACCTAGATCCTTTTAAATTAAAAATGAAGTTTTAAATCAATCTAAAGTATATATGAGTAAACTTGG TCTGACAGTTACCAATGCTTAATCAGTGAGGCACCTATCTCAGCGATCTGTCTATTTCGTTCATCCATAGTTGCC TGACTCCCCGTCGTGTAGATAACTACGATACGGGAGGGCTTACCATCTGGCCCCAGTGCTGCAATGATACCGCGA GACCCACGCTCACCGGCTCCAGATTTATCAGCAATAAACCAGCCAGCCGGAAGGGCCGAGCGCAGAAGTGGTCCT GCAACTTTATCCGCCTCCATCCAGTCTATTAATTGTTGCCGGGAAGCTAGAGTAAGTAGTTCGCCAGTTAATAGT TTGCGCAACGTTGTTGCCATTGCTACAGGCATCGTGGTGTCACGCTCGTCGTTTGGTATGGCTTCATTCAGCTCC GGTTCCCAACGATCAAGGCGAGTTACATGATCCCCCATGTTGTGCAAAAAAGCGGTTAGCTCCTTCGGTCCTCCG ATCGTTGTCAGAAGTAAGTTGGCCGCAGTGTTATCACTCATGGTTATGGCAGCACTGCATAATTCTCTTACTGTC ATGCCATCCGTAAGATGCTTTTCTGTGACTGGTGAGTACTCAACCAAGTCATTCTGAGAATAGTGTATGCGGCGA CCGAGTTGCTCTTGCCCGGCGTCAATACGGGATAATACCGCGCCACATAGCAGAACTTTAAAAGTGCTCATCATT GGAAAACGTTCTTCGGGGCGAAAACTCTCAAGGATCTTACCGCTGTTGAGATCCAGTTCGATGTAACCCACTCGT GCACCCAACTGATCTTCAGCATCTTTTACTTTCACCAGCGTTTCTGGGTGAGCAAAAACAGGAAGGCAAAATGCC GCAAAAAAGGGAATAAGGGCGACACGGAAATGTTGAATACTCATACTCTTCCTTTTTCAATATTATTGAAGCATT TATCAGGGTTATTGTCTCATGAGCGGATACATATTTGAATGTATTTAGAAAAATAAACAAATAGGGGTTCCGCGC ACATTTCCCCGAAAAGTGCCACCTGACGTCTAAGAAACCATTATTATCATGACATTAACCTATAAAAATAGGCGT ATCACGAGGCCCTTTCGTC
(82) The sequence of pG180 is set forth below (SEQ ID NO: 6):
(83) TABLE-US-00006 TCGCGCGTTTCGGTGATGACGGTGAAAACCTCTGACACATGCAGCTCCCGGAGACGGTCACAGCTTGTCTGTAAG CGGATGCCGGGAGCAGACAAGCCCGTCAGGGCGCGTCAGCGGGTGTTGGCGGGTGTCGGGGCTGGCTTAACTATG CGGCATCAGAGCAGATTGTACTGAGAGTGCACCATATATGCGGTGTGAAATACCGCACAGATGCGTAAGGAGAAA ATACCGCATCAGGCGCCTCCTCAACCTGTATATTCGTAAACCACGCCCAATGGGAGCTGTCTCAGGTTTGTTCCT GATTGGTTACGGCGCGTTTCGCATCATTGTTGAGTTTTTCCGCCAGCCCGACGCGCAGTTTACCGGTGCCTGGGT GCAGTACATCAGCATGGGGCAAATTCTTTCCATCCCGATGATTGTCGCGGGTGTGATCATGATGGTCTGGGCATA TCGTCGCAGCCCACAGCAACACGTTTCCTGAGGAACCATGAAACAGTATTTAGAACTGATGCAAAAAGTGCTCGA CGAAGGCACACAGAAAAACGACCGTACCGGAACCGGAACGCTTTCCATTTTTGGTCATCAGATGCGTTTTAACCT GCAAGATGGATTCCCGCTGGTGACAACTAAACGTTGCCACCTGCGTTCCATCATCCATGAACTGCTGTGGTTTCT GCAGGGCGACACTAACATTGCTTATCTACACGAAAACAATGTCACCATCTGGGACGAATGGGCCGATGAAAACGG CGACCTCGGGCCAGTGTATGGTAAACAGTGGCGCGCCTGGCCAACGCCAGATGGTCGTCATATTGACCAGATCAC TACGGTACTGAACCAGCTGAAAAACGACCCGGATTCGCGCCGCATTATTGTTTCAGCGTGGAACGTAGGCGAACT GGATAAAATGGCGCTGGCACCGTGCCATGCATTCTTCCAGTTCTATGTGGCAGACGGCAAACTCTCTTGCCAGCT TTATCAGCGCTCCTGTGACGTCTTCCTCGGCCTGCCGTTCAACATTGCCAGCTACGCGTTATTGGTGCATATGAT GGCGCAGCAGTGCGATCTGGAAGTGGGTGATTTTGTCTGGACCGGTGGCGACACGCATCTGTACAGCAACCATAT GGATCAAACTCATCTGCAATTAAGCCGCGAACCGCGTCCGCTGCCGAAGTTGATTATCAAACGTAAACCCGAATC CATCTTCGACTACCGTTTCGAAGACTTTGAGATTGAAGGCTACGATCCGCATCCGGGCATTAAAGCGCCGGTGGC TATCTAATTACGAAACATCCTGCCAGAGCCGACGCCAGTGTGCGTCGGTTTTTTTACCCTCCGTTAAATTCTTCG AGACGCCTTCCCGAAGGCGCCATTCGCCATTCAGGCTGCGCAACTGTTGGGAAGGGCGATCGGTGCGGGCCTCTT CGCTATTACGCCAGCTGGCGAAAGGGGGATGTGCTGCAAGGCGATTAAGTTGGGTAACGCCAGGGTTTTCCCAGT CACGACGTTGTAAAACGACGGCCAGTGCCAAGCTTTCTTTAATGAAGCAGGGCATCAGGACGGTATCTTTGTGGA GAAAGCAGAGTAATCTTATTCAGCCTGACTGGTGGGAAACCACCAGTCAGAATGTGTTAGCGCATGTTGACAAAA ATACCATTAGTCACATTATCCGTCAGTCGGACGACATGGTAGATAACCTGTTTATTATGCGTTTTGATCTTACGT TTAATATTACCTTTATGCGATGAAACGGTCTTGGCTTTGATATTCATTTGGTCAGAGATTTGAATGGTTCCCTGA CCTGCCATCCACATTCGCAACATACTCGATTCGGTTCGGCTCAATGATAACGTCGGCATATTTAAAAACGAGGTT ATCGTTGTCTCTTTTTTCAGAATATCGCCAAGGATATCGTCGAGAGATTCCGGTTTAATCGATTTAGAACTGATC AATAAATTTTTTCTGACCAATAGATATTCATCAAAATGAACATTGGCAATTGCCATAAAAACGATAAATAACGTA TTGGGATGTTGATTAATGATGAGCTTGATACGCTGACTGTTAGAAGCATCGTGGATGAAACAGTCCTCATTAATA AACACCACTGAAGGGCGCTGTGAATCACAAGCTATGGCAAGGTCATCAACGGTTTCAATGTCGTTGATTTCTCTT TTTTTAACCCCTCTACTCAACAGATACCCGGTTAAACCTAGTCGGGTGTAACTACATAAATCCATAATAATCGTT GACATGGCATACCCTCACTCAATGCGTAACGATAATTCCCCTTACCTGAATATTTCATCATGACTAAACGGAACA ACATGGGTCACCTAATGCGCCACTCTCGCGATTTTTCAGGCGGACTTACTATCCCGTAAAGTGTTGTATAATTTG CCTGGAATTGTCTTAAAGTAAAGTAAATGTTGCGATATGTGAGTGAGCTTAAAACAAATATTTCGCTGCAGGAGT ATCCTGGAAGATGTTCGTAGAAGCTTACTGCTCACAAGAAAAAAGGCACGTCATCTGACGTGCCTTTTTTATTTG TACTACCCTGTACGATTACTGCAGCTCGAGTTTAATTCAAATCTTCTTCAGAAATCAATTTTTGTTCTCTTACAT TCAATTTAAACCAAGATTCTGGTACAGAATCCTTATTTATAAAATTACCTGGAGCAATCACCAGCTTATCATCAT ATTTCGAAAGCCATGCTCCCCACCAGTAAAAAGAGCTATTCGATATTATATAATTACGAAAATGCATCATTAGAA ACATATCATCTATAGTACCACTATCATCCTCCTTTTTACTTATCAAACGAACAGGATATTTTCCCGCTAGATGTT TTTCAACCCATTCTAAATCTTGTGAAAAACAAAAGAAAACAGGAGAAGTAACTTTTGATGCCATAATATCCATAG CACATTTATAGTAATCATCTTCTAACACTCCACCAGGAGCTACATCACTTTCCTGATACCGTCTAACCCCAATCA TTATGGCATTCTTATCTAGCAATTTTATTTCTTCCAATTCCAAATACGATGTGTATTCTAATTTTTTTTGTATTA CAAAGTCCTCTTTTATCTCTTGTTTATAATCAAGAAAATATTTTTCTGACTGCCAATATCCTTCCAGAAAAGCAT TTGTAATCTTAGAACTAATTAACCTTGATTCAAAGTGGGGAGGGCGCTCTTCGCAAATATAACGATATGATGGAT GAAGTATATGACAACCTAAATTACGACTTAGCCTTCTATAATAATTCCCATATGAAAAATCAAATGTTAATTTTT TATTTTCAGGCAAGTCTAATGCAAAATATGATAATAAAAGTTTCCTTTTATAAACTTCATCATTTGCAAAATCAG TAGTCAAATTAAAAGCGAATGGTACATTGTTTCTTAATGCCATGGCCTTCACCATAGCGTAAATAAACATTTGAT TCCCCAATCCTCCTCGCAAAGATGATACAATCATATGTATATCTCCTTCTTGTCTAGAATTCTAAAAATTGATTG AATGTATGCAAATAAATGCATACACCATAGGTGTGGTTTAATTTGATGCCCTTTTTCAGGGCTGGAATGTGTAAG AGCGGGGTTATTTATGCTGTTGTTTTTTTGTTACTCGGGAAGGGCTTTACCTCTTCCGCATAAACGCTTCCATCA GCGTTTATAGTTAAAAAAATCTTTCGGAACTGGTTTTGCGCTTACCCCAACCAACAGGGGATTTGCTGCTTTCCA TTGAGCCTGTTTCTCTGCGCGACGTTCGCGGCGGCGTGTTTGTGCATCCATCTGGATTCTCCTGTCAGTTAGCTT TGGTGGTGTGTGGCAGTTGTAGTCCTGAACGAAAACCCCCCGCGATTGGCACATTGGCAGCTAATCCGGAATCGC ACTTACGGCCAATGCTTCGTTTCGTATCACACACCCCAAAGCCTTCTGCTTTGAATGCTGCCCTTCTTCAGGGCT TAATTTTTAAGAGCGTCACCTTCATGGTGGTCAGTGCGTCCTGCTGATGTGCTCAGTATCACCGCCAGTGGTATT TATGTCAACACCGCCAGAGATAATTTATCACCGCAGATGGTTATCTGTATGTTTTTTATATGAATTTATTTTTTG CAGGGGGGCATTGTTTGGTAGGTGAGAGATCAATTCTGCATTAATGAATCGGCCAACGCGCGGGGAGAGGCGGTT TGCGTATTGGGCGCTCTTCCGCTTCCTCGCTCACTGACTCGCTGCGCTCGGTCGTTCGGCTGCGGCGAGCGGTAT CAGCTCACTCAAAGGCGGTAATACGGTTATCCACAGAATCAGGGGATAACGCAGGAAAGAACATGTGAGCAAAAG GCCAGCAAAAGGCCAGGAACCGTAAAAAGGCCGCGTTGCTGGCGTTTTTCCATAGGCTCCGCCCCCCTGACGAGC ATCACAAAAATCGACGCTCAAGTCAGAGGTGGCGAAACCCGACAGGACTATAAAGATACCAGGCGTTTCCCCCTG GAAGCTCCCTCGTGCGCTCTCCTGTTCCGACCCTGCCGCTTACCGGATACCTGTCCGCCTTTCTCCCTTCGGGAA GCGTGGCGCTTTCTCATAGCTCACGCTGTAGGTATCTCAGTTCGGTGTAGGTCGTTCGCTCCAAGCTGGGCTGTG TGCACGAACCCCCCGTTCAGCCCGACCGCTGCGCCTTATCCGGTAACTATCGTCTTGAGTCCAACCCGGTAAGAC ACGACTTATCGCCACTGGCAGCAGCCACTGGTAACAGGATTAGCAGAGCGAGGTATGTAGGCGGTGCTACAGAGT TCTTGAAGTGGTGGCCTAACTACGGCTACACTAGAAGGACAGTATTTGGTATCTGCGCTCTGCTGAAGCCAGTTA CCTTCGGAAAAAGAGTTGGTAGCTCTTGATCCGGCAAACAAACCACCGCTGGTAGCGGTGGTTTTTTTGTTTGCA AGCAGCAGATTACGCGCAGAAAAAAAGGATCTCAAGAAGATCCTTTGATCTTTTCTACGGGGTCTGACGCTCAGT GGAACGAAAACTCACGTTAAGGGATTTTGGTCATGAGATTATCAAAAAGGATCTTCACCTAGATCCTTTTAAATT AAAAATGAAGTTTTAAATCAATCTAAAGTATATATGAGTAAACTTGGTCTGACAGTTACCAATGCTTAATCAGTG AGGCACCTATCTCAGCGATCTGTCTATTTCGTTCATCCATAGTTGCCTGACTCCCCGTCGTGTAGATAACTACGA TACGGGAGGGCTTACCATCTGGCCCCAGTGCTGCAATGATACCGCGAGACCCACGCTCACCGGCTCCAGATTTAT CAGCAATAAACCAGCCAGCCGGAAGGGCCGAGCGCAGAAGTGGTCCTGCAACTTTATCCGCCTCCATCCAGTCTA TTAATTGTTGCCGGGAAGCTAGAGTAAGTAGTTCGCCAGTTAATAGTTTGCGCAACGTTGTTGCCATTGCTACAG GCATCGTGGTGTCACGCTCGTCGTTTGGTATGGCTTCATTCAGCTCCGGTTCCCAACGATCAAGGCGAGTTACAT GATCCCCCATGTTGTGCAAAAAAGCGGTTAGCTCCTTCGGTCCTCCGATCGTTGTCAGAAGTAAGTTGGCCGCAG TGTTATCACTCATGGTTATGGCAGCACTGCATAATTCTCTTACTGTCATGCCATCCGTAAGATGCTTTTCTGTGA CTGGTGAGTACTCAACCAAGTCATTCTGAGAATAGTGTATGCGGCGACCGAGTTGCTCTTGCCCGGCGTCAATAC GGGATAATACCGCGCCACATAGCAGAACTTTAAAAGTGCTCATCATTGGAAAACGTTCTTCGGGGCGAAAACTCT CAAGGATCTTACCGCTGTTGAGATCCAGTTCGATGTAACCCACTCGTGCACCCAACTGATCTTCAGCATCTTTTA CTTTCACCAGCGTTTCTGGGTGAGCAAAAACAGGAAGGCAAAATGCCGCAAAAAAGGGAATAAGGGCGACACGGA AATGTTGAATACTCATACTCTTCCTTTTTCAATATTATTGAAGCATTTATCAGGGTTATTGTCTCATGAGCGGAT ACATATTTGAATGTATTTAGAAAAATAAACAAATAGGGGTTCCGCGCACATTTCCCCGAAAAGTGCCACCTGACG TCTAAGAAACCATTATTATCATGACATTAACCTATAAAAATAGGCGTATCACGAGGCCCTTTCGTC
(84) The sequence of W3110 deltalon::Kan::lacZwithRBS Escherichia coli str. K-12 substr. W3110 is set forth below (SEQ ID NO: 7):
(85) TABLE-US-00007 GTCCATGGAAGACGTCGAAAAAGTGGTTATCGACGAGTCGGTAATTGATGGTCAAAGCAAACCGTTGCTGATTTA TGGCAAGCCGGAAGCGCAACAGGCATCTGGTGAATAATTAACCATTCCCATACAATTAGTTAACCAAAAAGGGGG GATTTTATCTCCCCTTTAATTTTTCCTCTATTCTCGGCGTTGAATGTGGGGGAAACATCCCCATATACTGACGTA CATGTTAATAGATGGCGTGAAGCACAGTCGTGTCATCTGATTACCTGGCGGAAATTAAACTAAGAGAGAGCTCTA TGATTCCGGGGATCCGTCGACCTGCAGTTCGAAGTTCCTATTCTCTAGAAAGTATAGGAACTTCAGAGCGCTTTT GAAGCTCACGCTGCCGCAAGCACTCAGGGCGCAAGGGCTGCTAAAGGAAGCGGAACACGTAGAAAGCCAGTCCGC AGAAACGGTGCTGACCCCGGATGAATGTCAGCTACTGGGCTATCTGGACAAGGGAAAACGCAAGCGCAAAGAGAA AGCAGGTAGCTTGCAGTGGGCTTACATGGCGATAGCTAGACTGGGCGGTTTTATGGACAGCAAGCGAACCGGAAT TGCCAGCTGGGGCGCCCTCTGGTAAGGTTGGGAAGCCCTGCAAAGTAAACTGGATGGCTTTCTTGCCGCCAAGGA TCTGATGGCGCAGGGGATCAAGATCTGATCAAGAGACAGGATGAGGATCGTTTCGCATGATTGAACAAGATGGAT TGCACGCAGGTTCTCCGGCCGCTTGGGTGGAGAGGCTATTCGGCTATGACTGGGCACAACAGACAATCGGCTGCT CTGATGCCGCCGTGTTCCGGCTGTCAGCGCAGGGGCGCCCGGTTCTTTTTGTCAAGACCGACCTGTCCGGTGCCC TGAATGAACTGCAGGACGAGGCAGCGCGGCTATCGTGGCTGGCCACGACGGGCGTTCCTTGCGCAGCTGTGCTCG ACGTTGTCACTGAAGCGGGAAGGGACTGGCTGCTATTGGGCGAAGTGCCGGGGCAGGATCTCCTGTCATCTCACC TTGCTCCTGCCGAGAAAGTATCCATCATGGCTGATGCAATGCGGCGGCTGCATACGCTTGATCCGGCTACCTGCC CATTCGACCACCAAGCGAAACATCGCATCGAGCGAGCACGTACTCGGATGGAAGCCGGTCTTGTCGATCAGGATG ATCTGGACGAAGAGCATCAGGGGCTCGCGCCAGCCGAACTGTTCGCCAGGCTCAAGGCGCGCATGCCCGACGGCG AGGATCTCGTCGTGACCCATGGCGATGCCTGCTTGCCGAATATCATGGTGGAAAATGGCCGCTTTTCTGGATTCA TCGACTGTGGCCGGCTGGGTGTGGCGGACCGCTATCAGGACATAGCGTTGGCTACCCGTGATATTGCTGAAGAGC TTGGCGGCGAATGGGCTGACCGCTTCCTCGTGCTTTACGGTATCGCCGCTCCCGATTCGCAGCGCATCGCCTTCT ATCGCCTTCTTGACGAGTTCTTCTAATAAGGGGATCTTGAAGTTCCTATTCCGAAGTTCCTATTCTCTAGAAAGT ATAGGAACTTCGAAGCAGCTCCAGCCTACATAAAGCGGCCGCTTATTTTTGACACCAGACCAACTGGTAATGGTA GCGACCGGCGCTCAGCTGGAATTCCGCCGATACTGACGGGCTCCAGGAGTCGTCGCCACCAATCCCCATATGGAA ACCGTCGATATTCAGCCATGTGCCTTCTTCCGCGTGCAGCAGATGGCGATGGCTGGTTTCCATCAGTTGCTGTTG ACTGTAGCGGCTGATGTTGAACTGGAAGTCGCCGCGCCACTGGTGTGGGCCATAATTCAATTCGCGCGTCCCGCA GCGCAGACCGTTTTCGCTCGGGAAGACGTACGGGGTATACATGTCTGACAATGGCAGATCCCAGCGGTCAAAACA GGCGGCAGTAAGGCGGTCGGGATAGTTTTCTTGCGGCCCTAATCCGAGCCAGTTTACCCGCTCTGCTACCTGCGC CAGCTGGCAGTTCAGGCCAATCCGCGCCGGATGCGGTGTATCGCTCGCCACTTCAACATCAACGGTAATCGCCAT TTGACCACTACCATCAATCCGGTAGGTTTTCCGGCTGATAAATAAGGTTTTCCCCTGATGCTGCCACGCGTGAGC GGTCGTAATCAGCACCGCATCAGCAAGTGTATCTGCCGTGCACTGCAACAACGCTGCTTCGGCCTGGTAATGGCC CGCCGCCTTCCAGCGTTCGACCCAGGCGTTAGGGTCAATGCGGGTCGCTTCACTTACGCCAATGTCGTTATCCAG CGGTGCACGGGTGAACTGATCGCGCAGCGGCGTCAGCAGTTGTTTTTTATCGCCAATCCACATCTGTGAAAGAAA GCCTGACTGGCGGTTAAATTGCCAACGCTTATTACCCAGCTCGATGCAAAAATCCATTTCGCTGGTGGTCAGATG CGGGATGGCGTGGGACGCGGCGGGGAGCGTCACACTGAGGTTTTCCGCCAGACGCCACTGCTGCCAGGCGCTGAT GTGCCCGGCTTCTGACCATGCGGTCGCGTTCGGTTGCACTACGCGTACTGTGAGCCAGAGTTGCCCGGCGCTCTC CGGCTGCGGTAGTTCAGGCAGTTCAATCAACTGTTTACCTTGTGGAGCGACATCCAGAGGCACTTCACCGCTTGC CAGCGGCTTACCATCCAGCGCCACCATCCAGTGCAGGAGCTCGTTATCGCTATGACGGAACAGGTATTCGCTGGT CACTTCGATGGTTTGCCCGGATAAACGGAACTGGAAAAACTGCTGCTGGTGTTTTGCTTCCGTCAGCGCTGGATG CGGCGTGCGGTCGGCAAAGACCAGACCGTTCATACAGAACTGGCGATCGTTCGGCGTATCGCCAAAATCACCGCC GTAAGCCGACCACGGGTTGCCGTTTTCATCATATTTAATCAGCGACTGATCCACCCAGTCCCAGACGAAGCCGCC CTGTAAACGGGGATACTGACGAAACGCCTGCCAGTATTTAGCGAAACCGCCAAGACTGTTACCCATCGCGTGGGC GTATTCGCAAAGGATCAGCGGGCGCGTCTCTCCAGGTAGCGAAAGCCATTTTTTGATGGACCATTTCGGCACAGC CGGGAAGGGCTGGTCTTCATCCACGCGCGCGTACATCGGGCAAATAATATCGGTGGCCGTGGTGTCGGCTCCGCC GCCTTCATACTGCACCGGGCGGGAAGGATCGACAGATTTGATCCAGCGATACAGCGCGTCGTGATTAGCGCCGTG GCCTGATTCATTCCCCAGCGACCAGATGATCACACTCGGGTGATTACGATCGCGCTGCACCATTCGCGTTACGCG TTCGCTCATCGCCGGTAGCCAGCGCGGATCATCGGTCAGACGATTCATTGGCACCATGCCGTGGGTTTCAATATT GGCTTCATCCACCACATACAGGCCGTAGCGGTCGCACAGCGTGTACCACAGCGGATGGTTCGGATAATGCGAACA GCGCACGGCGTTAAAGTTGTTCTGCTTCATCAGCAGGATATCCTGCACCATCGTCTGCTCATCCATGACCTGACC ATGCAGAGGATGATGCTCGTGACGGTTAACGCCTCGAATCAGCAACGGCTTGCCGTTCAGCAGCAGCAGACCATT TTCAATCCGCACCTCGCGGAAACCGACATCGCAGGCTTCTGCTTCAATCAGCGTGCCGTCGGCGGTGTGCAGTTC AACCACCGCACGATAGAGATTCGGGATTTCGGCGCTCCACAGTTTCGGGTTTTCGACGTTCAGACGTAGTGTGAC GCGATCGGCATAACCACCACGCTCATCGATAATTTCACCGCCGAAAGGCGCGGTGCCGCTGGCGACCTGCGTTTC ACCCTGCCATAAAGAAACTGTTACCCGTAGGTAGTCACGCAACTCGCCGCACATCTGAACTTCAGCCTCCAGTAC AGCGCGGCTGAAATCATCATTAAAGCGAGTGGCAACATGGAAATCGCTGATTTGTGTAGTCGGTTTATGCAGCAA CGAGACGTCACGGAAAATGCCGCTCATCCGCCACATATCCTGATCTTCCAGATAACTGCCGTCACTCCAGCGCAG CACCATCACCGCGAGGCGGTTTTCTCCGGCGCGTAAAAATGCGCTCAGGTCAAATTCAGACGGCAAACGACTGTC CTGGCCGTAACCGACCCAGCGCCCGTTGCACCACAGATGAAACGCCGAGTTAACGCCATCAAAAATAATTCGCGT CTGGCCTTCCTGTAGCCAGCTTTCATCAACATTAAATGTGAGCGAGTAACAACCCGTCGGATTCTCCGTGGGAAC AAACGGCGGATTGACCGTAATGGGATAGGTCACGTTGGTGTAGATGGGCGCATCGTAACCGTGCATCTGCCAGTT TGAGGGGACGACGACAGTATCGGCCTCAGGAAGATCGCACTCCAGCCAGCTTTCCGGCACCGCTTCTGGTGCCGG AAACCAGGCAAAGCGCCATTCGCCATTCAGGCTGCGCAACTGTTGGGAAGGGCGATCGGTGCGGGCCTCTTCGCT ATTACGCCAGCTGGCGAAAGGGGGATGTGCTGCAAGGCGATTAAGTTGGGTAACGCCAGGGTTTTCCCAGTCACG ACGTTGTAAAACGACGGCCAGTGAATCCGTAATCATGGTCATAGTAGGTTTCCTCAGGTTGTGACTGCAAAATAG TGACCTCGCGCAAAATGCACTAATAAAAACAGGGCTGGCAGGCTAATTCGGGCTTGCCAGCCTTTTTTTGTCTCG CTAAGTTAGATGGCGGATCGGGCTTGCCCTTATTAAGGGGTGTTGTAAGGGGATGGCTGGCCTGATATAACTGCT GCGCGTTCGTACCTTGAAGGATTCAAGTGCGATATAAATTATAAAGAGGAAGAGAAGAGTGAATAAATCTCAATT GATCGACAAGATTGCTGCAGGGGCTGATATCTCTAAAGCTGCGGCTGGCCGTGCGTTAGATGCTATTATTGCTTC CGTAACTGAATCTCTGAAAGAAGG
(86) The sequence of pG186 is set forth below (SEQ ID NO: 8):
(87) TABLE-US-00008 TCGCGCGTTTCGGTGATGACGGTGAAAACCTCTGACACATGCAGCTCCCGGAGACGGTCACAGCTTGTCTGTAAG CGGATGCCGGGAGCAGACAAGCCCGTCAGGGCGCGTCAGCGGGTGTTGGCGGGTGTCGGGGCTGGCTTAACTATG CGGCATCAGAGCAGATTGTACTGAGAGTGCACCATATATGCGGTGTGAAATACCGCACAGATGCGTAAGGAGAAA ATACCGCATCAGGCGCCTCCTCAACCTGTATATTCGTAAACCACGCCCAATGGGAGCTGTCTCAGGTTTGTTCCT GATTGGTTACGGCGCGTTTCGCATCATTGTTGAGTTTTTCCGCCAGCCCGACGCGCAGTTTACCGGTGCCTGGGT GCAGTACATCAGCATGGGGCAAATTCTTTCCATCCCGATGATTGTCGCGGGTGTGATCATGATGGTCTGGGCATA TCGTCGCAGCCCACAGCAACACGTTTCCTGAGGAACCATGAAACAGTATTTAGAACTGATGCAAAAAGTGCTCGA CGAAGGCACACAGAAAAACGACCGTACCGGAACCGGAACGCTTTCCATTTTTGGTCATCAGATGCGTTTTAACCT GCAAGATGGATTCCCGCTGGTGACAACTAAACGTTGCCACCTGCGTTCCATCATCCATGAACTGCTGTGGTTTCT GCAGGGCGACACTAACATTGCTTATCTACACGAAAACAATGTCACCATCTGGGACGAATGGGCCGATGAAAACGG CGACCTCGGGCCAGTGTATGGTAAACAGTGGCGCGCCTGGCCAACGCCAGATGGTCGTCATATTGACCAGATCAC TACGGTACTGAACCAGCTGAAAAACGACCCGGATTCGCGCCGCATTATTGTTTCAGCGTGGAACGTAGGCGAACT GGATAAAATGGCGCTGGCACCGTGCCATGCATTCTTCCAGTTCTATGTGGCAGACGGCAAACTCTCTTGCCAGCT TTATCAGCGCTCCTGTGACGTCTTCCTCGGCCTGCCGTTCAACATTGCCAGCTACGCGTTATTGGTGCATATGAT GGCGCAGCAGTGCGATCTGGAAGTGGGTGATTTTGTCTGGACCGGTGGCGACACGCATCTGTACAGCAACCATAT GGATCAAACTCATCTGCAATTAAGCCGCGAACCGCGTCCGCTGCCGAAGTTGATTATCAAACGTAAACCCGAATC CATCTTCGACTACCGTTTCGAAGACTTTGAGATTGAAGGCTACGATCCGCATCCGGGCATTAAAGCGCCGGTGGC TATCTAATTACGAAACATCCTGCCAGAGCCGACGCCAGTGTGCGTCGGTTTTTTTACCCTCCGTTAAATTCTTCG AGACGCCTTCCCGAAGGCGCCATTCGCCATTCAGGCTGCGCAACTGTTGGGAAGGGCGATCGGTGCGGGCCTCTT CGCTATTACGCCAGCTGGCGAAAGGGGGATGTGCTGCAAGGCGATTAAGTTGGGTAACGCCAGGGTTTTCCCAGT CACGACGTTGTAAAACGACGGCCAGTGCCAAGCTTTCTTTAATGAAGCAGGGCATCAGGACGGTATCTTTGTGGA GAAAGCAGAGTAATCTTATTCAGCCTGACTGGTGGGAAACCACCAGTCAGAATGTGTTAGCGCATGTTGACAAAA ATACCATTAGTCACATTATCCGTCAGTCGGACGACATGGTAGATAACCTGTTTATTATGCGTTTTGATCTTACGT TTAATATTACCTTTATGCGATGAAACGGTCTTGGCTTTGATATTCATTTGGTCAGAGATTTGAATGGTTCCCTGA CCTGCCATCCACATTCGCAACATACTCGATTCGGTTCGGCTCAATGATAACGTCGGCATATTTAAAAACGAGGTT ATCGTTGTCTCTTTTTTCAGAATATCGCCAAGGATATCGTCGAGAGATTCCGGTTTAATCGATTTAGAACTGATC AATAAATTTTTTCTGACCAATAGATATTCATCAAAATGAACATTGGCAATTGCCATAAAAACGATAAATAACGTA TTGGGATGTTGATTAATGATGAGCTTGATACGCTGACTGTTAGAAGCATCGTGGATGAAACAGTCCTCATTAATA AACACCACTGAAGGGCGCTGTGAATCACAAGCTATGGCAAGGTCATCAACGGTTTCAATGTCGTTGATTTCTCTT TTTTTAACCCCTCTACTCAACAGATACCCGGTTAAACCTAGTCGGGTGTAACTACATAAATCCATAATAATCGTT GACATGGCATACCCTCACTCAATGCGTAACGATAATTCCCCTTACCTGAATATTTCATCATGACTAAACGGAACA ACATGGGTCACCTAATGCGCCACTCTCGCGATTTTTCAGGCGGACTTACTATCCCGTAAAGTGTTGTATAATTTG CCTGGAATTGTCTTAAAGTAAAGTAAATGTTGCGATATGTGAGTGAGCTTAAAACAAATATTTCGCTGCAGGAGT ATCCTGGAAGATGTTCGTAGAAGCTTACTGCTCACAAGAAAAAAGGCACGTCATCTGACGTGCCTTTTTTATTTG TACTACCCTGTACGATTACTGCAGCTCGAGTTAGTCTTTATCTGCCGGACTTAAGGTCACTGAAGAGAGATAATT CAGCAGGGCGATATCGTTCTCGACACCCAGCTTCATCATCGCAGATTTCTTCTGGCTACTGATGGTTTTAATACT GCGGTTCAGCTTTTTAGCGATCTCGGTCACCAGGAAGCCTTCCGCAAACAGGCGCAGAACTTCACTCTCTTTTGG CGAGAGACGCTTGTCACCGTAACCACCAGCACTGATTTTTTCCAACAGGCGAGAAACGCTTTCCGGGGTAAATTT CTTCCCTTTCTGCAGCGCGGCGAGAGCTTTCGGCAGATCGGTCGGTGCACCTTGTTTCAGCACGATCCCTTCGAT ATCCAGATCCAATACCGCACTAAGAATCGCCGGGTTGTTGTTCATAGTCAGAACAATGATCGACAGGCTTGGGAA ATGGCGCTTGATGTACTTGATTAAGGTAATGCCATCGCCGTACTTATCGCCAGGCATGGAGAGATCGGTAATCAA CACATGCGCATCCAGTTTCGGCAGGTTGTTGATCAGTGCTGTAGAGTCTTCAAATTCGCCGACAACATTCACCCA CTCAATTTGCTCAAGTGATTTGCGAATACCGAACAAGACTATCGGATGGTCATCGGCAATAATTACGTTCATATT GTTCATTGTATATCTCCTTCTTCTCGAGTTTAATTCAAATCTTCTTCAGAAATCAATTTTTGTTCAGCGTTATAC TTTTGGGATTTTACCTCAAAATGGGATTCTATTTTCACCCACTCCTTACAAAGGATATTCTCATGCCCAAAAAGC CAGTGTTTGGGGCCAATAATGATTTTTTCTGGATTTTCTATCAAATAGGCCGCCCACCAGCTATAAGTGCTATTA GCGATAATGCCATGCTGACAAGATTGCATGAGCAGCATGTCCCAATACGCCTCTTCTTCTTTATCCCTAGTGGTC ATGTCCATAAAAGGGTAGCCAAGATCAAGATTTTGCGTGAATTCTAAGTCTTCGCAAAACACAAAAAGCTCCATG TTTGGCACGCGCTTTGCCATATACTCAAGCGCCTTTTTTTGATAGTCAATACCAAGCTGACAGCCAATCCCCACA TAATCCCCTCTTCTTATATGCACAAACACGCTGTTTTTAGCGGCTAAAATCAAAGAAAGCTTGCACTGATATTCT TCCTCTTTTTTATTATTATTCTTATTATTTTCGGGTGGTGGTGGTAGAGTGAAGGTTTGCTTGATTAAAGGGGAT ATAGCATCAAAGTATCGTGGATCTTGGAAATAGCCAAAAAAATAAGTCAAGCGGCTTGGCTTTAGCAATTTAGGC TCGTATTCAAAAACGATTTCTTGACTCACCCTATCAAATCCCATGCATTTGAGCGCGTCTCTTACTAGCTTGGGG AGGTGTTGCATTTTAGCTATAGCGATTTCTTTCGCGCTCGCATAGGGCAAATCAATAGGGAAAAGTTCTAATTGC ATTTTCCTATCGCTCCAATCAAAAGAAGTGATATCTAACAGCACAGGCGTATTAGAGTGTTTTTGCAAACTTTTA GCGAAAGCGTATTGAAACATTTGATTCCCAAGCCCTCCGCAAATTTGCACCACCTTAAAAGCCATATGTATATCT CCTTCTTGAATTCTAAAAATTGATTGAATGTATGCAAATAAATGCATACACCATAGGTGTGGTTTAATTTGATGC CCTTTTTCAGGGCTGGAATGTGTAAGAGCGGGGTTATTTATGCTGTTGTTTTTTTGTTACTCGGGAAGGGCTTTA CCTCTTCCGCATAAACGCTTCCATCAGCGTTTATAGTTAAAAAAATCTTTCGGAACTGGTTTTGCGCTTACCCCA ACCAACAGGGGATTTGCTGCTTTCCATTGAGCCTGTTTCTCTGCGCGACGTTCGCGGCGGCGTGTTTGTGCATCC ATCTGGATTCTCCTGTCAGTTAGCTTTGGTGGTGTGTGGCAGTTGTAGTCCTGAACGAAAACCCCCCGCGATTGG CACATTGGCAGCTAATCCGGAATCGCACTTACGGCCAATGCTTCGTTTCGTATCACACACCCCAAAGCCTTCTGC TTTGAATGCTGCCCTTCTTCAGGGCTTAATTTTTAAGAGCGTCACCTTCATGGTGGTCAGTGCGTCCTGCTGATG TGCTCAGTATCACCGCCAGTGGTATTTATGTCAACACCGCCAGAGATAATTTATCACCGCAGATGGTTATCTGTA TGTTTTTTATATGAATTTATTTTTTGCAGGGGGGCATTGTTTGGTAGGTGAGAGATCAATTCTGCATTAATGAAT CGGCCAACGCGCGGGGAGAGGCGGTTTGCGTATTGGGCGCTCTTCCGCTTCCTCGCTCACTGACTCGCTGCGCTC GGTCGTTCGGCTGCGGCGAGCGGTATCAGCTCACTCAAAGGCGGTAATACGGTTATCCACAGAATCAGGGGATAA CGCAGGAAAGAACATGTGAGCAAAAGGCCAGCAAAAGGCCAGGAACCGTAAAAAGGCCGCGTTGCTGGCGTTTTT CCATAGGCTCCGCCCCCCTGACGAGCATCACAAAAATCGACGCTCAAGTCAGAGGTGGCGAAACCCGACAGGACT ATAAAGATACCAGGCGTTTCCCCCTGGAAGCTCCCTCGTGCGCTCTCCTGTTCCGACCCTGCCGCTTACCGGATA CCTGTCCGCCTTTCTCCCTTCGGGAAGCGTGGCGCTTTCTCATAGCTCACGCTGTAGGTATCTCAGTTCGGTGTA GGTCGTTCGCTCCAAGCTGGGCTGTGTGCACGAACCCCCCGTTCAGCCCGACCGCTGCGCCTTATCCGGTAACTA TCGTCTTGAGTCCAACCCGGTAAGACACGACTTATCGCCACTGGCAGCAGCCACTGGTAACAGGATTAGCAGAGC GAGGTATGTAGGCGGTGCTACAGAGTTCTTGAAGTGGTGGCCTAACTACGGCTACACTAGAAGGACAGTATTTGG TATCTGCGCTCTGCTGAAGCCAGTTACCTTCGGAAAAAGAGTTGGTAGCTCTTGATCCGGCAAACAAACCACCGC TGGTAGCGGTGGTTTTTTTGTTTGCAAGCAGCAGATTACGCGCAGAAAAAAAGGATCTCAAGAAGATCCTTTGAT CTTTTCTACGGGGTCTGACGCTCAGTGGAACGAAAACTCACGTTAAGGGATTTTGGTCATGAGATTATCAAAAAG GATCTTCACCTAGATCCTTTTAAATTAAAAATGAAGTTTTAAATCAATCTAAAGTATATATGAGTAAACTTGGTC TGACAGTTACCAATGCTTAATCAGTGAGGCACCTATCTCAGCGATCTGTCTATTTCGTTCATCCATAGTTGCCTG ACTCCCCGTCGTGTAGATAACTACGATACGGGAGGGCTTACCATCTGGCCCCAGTGCTGCAATGATACCGCGAGA CCCACGCTCACCGGCTCCAGATTTATCAGCAATAAACCAGCCAGCCGGAAGGGCCGAGCGCAGAAGTGGTCCTGC AACTTTATCCGCCTCCATCCAGTCTATTAATTGTTGCCGGGAAGCTAGAGTAAGTAGTTCGCCAGTTAATAGTTT GCGCAACGTTGTTGCCATTGCTACAGGCATCGTGGTGTCACGCTCGTCGTTTGGTATGGCTTCATTCAGCTCCGG TTCCCAACGATCAAGGCGAGTTACATGATCCCCCATGTTGTGCAAAAAAGCGGTTAGCTCCTTCGGTCCTCCGAT CGTTGTCAGAAGTAAGTTGGCCGCAGTGTTATCACTCATGGTTATGGCAGCACTGCATAATTCTCTTACTGTCAT GCCATCCGTAAGATGCTTTTCTGTGACTGGTGAGTACTCAACCAAGTCATTCTGAGAATAGTGTATGCGGCGACC GAGTTGCTCTTGCCCGGCGTCAATACGGGATAATACCGCGCCACATAGCAGAACTTTAAAAGTGCTCATCATTGG AAAACGTTCTTCGGGGCGAAAACTCTCAAGGATCTTACCGCTGTTGAGATCCAGTTCGATGTAACCCACTCGTGC ACCCAACTGATCTTCAGCATCTTTTACTTTCACCAGCGTTTCTGGGTGAGCAAAAACAGGAAGGCAAAATGCCGC AAAAAAGGGAATAAGGGCGACACGGAAATGTTGAATACTCATACTCTTCCTTTTTCAATATTATTGAAGCATTTA TCAGGGTTATTGTCTCATGAGCGGATACATATTTGAATGTATTTAGAAAAATAAACAAATAGGGGTTCCGCGCAC ATTTCCCCGAAAAGTGCCACCTGACGTCTAAGAAACCATTATTATCATGACATTAACCTATAAAAATAGGCGTAT CACGAGGCCCTTTCGTC
Other Embodiments
(88) While the invention has been described in conjunction with the detailed description thereof, the foregoing description is intended to illustrate and not limit the scope of the invention, which is defined by the scope of the appended claims. Other aspects, advantages, and modifications are within the scope of the following claims.
(89) The patent and scientific literature referred to herein establishes the knowledge that is available to those with skill in the art. All United States patents and published or unpublished United States patent applications cited herein are incorporated by reference. All published foreign patents and patent applications cited herein are hereby incorporated by reference.
(90) Genbank and NCBI submissions indicated by accession number cited herein are hereby incorporated by reference. All other published references, documents, manuscripts and scientific literature cited herein are hereby incorporated by reference.
(91) While this invention has been particularly shown and described with references to preferred embodiments thereof, it will be understood by those skilled in the art that various changes in form and details may be made therein without departing from the scope of the invention encompassed by the appended claims.