Universal Platform for Genetic Code Expansion
20200407705 ยท 2020-12-31
Inventors
Cpc classification
International classification
Abstract
Methods and compositions are described for selecting and identifying orthogonal aminoacyl synthetase-tRNA pairs and their use to incorporate unnatural amino acids in a site-specific manner in proteins. Specifically described is a novel E.coli tyrptophanyl synthetase-tRNA pair that functions as both an opal and amber suppressor and that incorporates tryptophan analogs into proteins.
Claims
1. A genetically engineered E. coli bacterial strain, wherein the endogenous E. coli tryptophanyl-tRNA synthetase (trpS) and the tryptophanyl tRNA (trpT) genes are deleted and functionally replaced with an orthogonal eukaryotic tryptophanyl-tRNA synthetase-tRNA gene pair.
2. The genetically engineered E. coli bacterial strain of claim 1 wherein the strain is ATMW1 and ATMW1 is derived from EcNR1 having the following genotype: EcNR1 pUltraG-ScW40CCA trpS::ZeoR trpT::GentR galK RED::galK.
3. The genetically engineered E. coli bacterial strain of claim 1 wherein the strain is derived from BL21(De3) having the following genotype: BL21 (DE3) pUltraG-ScW40CCA trpS::ZeoR trpT::GentR.
4. A method of producing orthogonal aminoacyl synthetase-tRNA pairs for incorporating unnatural amino acids into specific sites in proteins produced in E. coli and mammalian cells, the method comprising: a. functionally replacing an endogenous aminoacyl synthetase-tRNA pair in an E. coli host cell with a counter-part aminoacyl synthetase-tRNA pair orthogonal to E. coli, resulting in an altered translational machinery (ATM) E. coli and liberating the endogenous E. coli aminoacyl synthetase-tRNA pair, wherein the liberated E. coli aminoacyl synthetase-tRNA pair is orthogonal to the ATM E. coli and mammalian cells; b. reintroducing the liberated E. coli aminoacyl synthetase-tRNA pair into the ATM E. coli cell as a nonsense suppressor under conditions suitable for genetically selecting and identifying a variant E. coli aminoacyl synthetase that preferentially aminoacylates a tRNA with an unnatural amino acid over a natural amino acid, thereby producing an orthogonal aminoacyl synthetase-tRNA pair for incorporating unnatural amino acids into specific sites in proteins produced in E. coli and mammalian cells.
5. The method of claim 4 wherein the ATM E. coli is ATMW1 or BL21 (DE3) pUltraG-ScW40CCA trpS::ZeoR trpT::GentR.
6. A cell comprising a variant E. coli tryptophanyl-tRNA synthetase (EcTrp-RS), wherein the variant EcTrp-RS preferentially aminoacylates an E. coli tryptophanyl tRNA (Ec-tRNA.sup.Trp) with a tryptophan analog over the naturally-occurring tryptophan amino acid, wherein the variant EcTrp-RS comprises: the amino acid sequence of SEQ ID NO: 91, or an amino acid sequence with at least 90% sequence identity with the full-length SEQ ID NO:91, wherein the variant E. coli EcTrp-RS is mutated, relative to SEQ ID NO:91, such that the serine at position 8 is replaced with alanine; the valine at position 144 is replaced with either serine, glycine or alanine; and the valine at position 146 is replaced with either alanine, isoleucine or cysteine; and wherein the cell is E.coli and the E. coli strain is ATMW1 or BL21 (DE3) pUltraG-ScW40CCA trpS::ZeoR trpT::GentR.
7. The cell of claim 6, wherein the Ec-tRNA.sup.Trp comprises the polynucleotide sequence SEQ ID NO: 1 or SEQ ID NO: 3, or a homologous bacteria-derived tRNA comprising at least about 80% sequence identity with SEQ ID NO: 1 or 3, wherein the tRNA has an anti-codon loop comprising a sequence that specifically binds to a selector sequence of an mRNA selected from the group consisting of an amber codon or an opal codon.
Description
BRIEF DESCRIPTION OF THE DRAWINGS
[0029] In the accompanying drawings, reference characters refer to the same parts throughout the different views. The drawings are not necessarily to scale; emphasis has instead been placed upon illustrating the principles of the invention. The patent or application file contains at least one drawing executed in color. Copies of this patent or patent application publication with color drawings(s) will be provided by the Office upon request and payment of the necessary fee. Of the drawings:
[0030]
[0031]
[0032]
[0033]
[0034]
[0035]
[0036]
[0037]
[0038]
[0039]
[0040]
[0041]
[0042]
[0043]
[0044]
[0045]
[0046]
[0047]
[0048]
[0049]
[0050]
[0051]
[0052]
[0053]
[0054]
DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS
[0055] The above and other features of the invention including various novel details of construction and combinations of parts, and other advantages, will now be more particularly described with reference to the accompanying drawings and pointed out in the claims. It will be understood that the particular method and device embodying the invention are shown by way of illustration and not as a limitation of the invention. The principles and features of this invention may be employed in various and numerous embodiments without departing from the scope of the invention.
[0056] As used herein, the term and/or includes any and all combinations of one or more of the associated listed items. Further, the singular forms and the articles a, an and the are intended to include the plural forms as well, unless expressly stated otherwise. It will be further understood that the terms: includes, comprises, including and/or comprising, when used in this specification, specify the presence of stated features, integers, steps, operations, elements, and/or components, but do not preclude the presence or addition of one or more other features, integers, steps, operations, elements, components, and/or groups thereof.
[0057] The present disclosure provides a universal platform for genetic code expansion and involves a bacterial tryptophanyl-tRNA/tryptophanyl-tRNA synthetase pair for site-specific incorporation of tryptophan analogs into proteins expressed in E. Coli and eukaryotic cells. The current invention discloses an alternative strategy which takes advantage of an E. coli strain, where one of its native tRNA/aaRS pairs is functionally replaced with a eukaryotic/archaeal counterpart (
[0058] The feasibility of substituting a tRNA/aaRS pair in E. coli with an evolutionarily distant counterpart has previously been demonstrated. However, the resulting strains often exhibit growth defect, presumably due to the suboptimal interaction of the heterologous tRNA/aaRS with the translational apparatus of E. coli. Moreover, whether variants of the liberated tRNA/aaRS pair with altered substrate specificity can be developed using the corresponding ATM strain as the selection host remains unknown. Optimizing the performance of the substituting tRNA/aaRS pair in E. coli using directed evolution may allow it to functionally replace its endogenous counterpart more efficiently, circumventing the growth defect associated with such substitution. A number of heterologous tRNA/aaRS pairs have already been engineered for efficient suppression of nonsense codons in E. coli, providing a pool of potential candidates.
[0059] The endogenous tryptophanyl (Trp)-tRNA/aaRS pair in E. coli was targeted for functional replacement with a eukaryotic/archaeal counterpart. A S. cerevisiae derived tryptophanyl-tRNA/TrpRS pair has already been optimized in E. coli for highly efficient nonsense suppression, providing a great candidate for substituting its endogenous counterpart. Additionally, the unique active site of TrpRS should enable the introduction of structurally novel UAAs in the eukaryotic, as well as the bacterial genetic code. To verify if the E. coli Trp-tRNA/aaRS pair is indeed suitable for eukaryotic genetic code expansion, HEK293T cells were co-expressed with the TAG-suppressing E. coli Trp-tRNA (tRNAEcTrpCUA) and an enhanced green fluorescent protein (EGFP) reporter harboring a stop codon at a permissive site (EGFP-39-TAG), with or without the cognate synthetase. Robust EGFP expression was only observed in the presence of the EcTrpRS (
[0060] To allow the deletion of the endogenous tryptophanyl pair from the E. coli genome, a plasmid (pUltra-ScW40CCA) was first generated expressing the aforementioned engineered ScTrpRS/tRNAScTrpCCA pair derived from yeast (
[0061] With ATMW1 in hand, the possibility of reintroducing the liberated EcTrpRS/tRNAEcTrp pair for TAG suppression into this strain as a TAG suppressor was investigated. Surprisingly, when the tRNAEcTrpCUA was expressed in ATMW1 along with a superfolder GFP reporter (sfGFP-151-TAG), robust protein expression was observed in the absence of EcTrpRS, indicating cross-reactivity of this tRNA with another E. coli aaRS (
[0062] Circumventing this issue is envisioned by generating a TGA suppressor EctRNATrpUCA that avoids introducing the middle U-residue in the anticodon. Unfortunately, termination at the TGA stop codon in E. coli is often leakya result of non-specific suppression by the endogenous tryptophanyl tRNAmaking it a suboptimal choice for genetic code expansion. However, in the ATMW1 strainwhere the endogenous tryptophanyl pair was replaced with the yeast counterpartTGA did not exhibit such leaky behavior (
[0063] Establishment of an efficient, orthogonal opal suppressing EcTrpRS/tRNAEcTrpUCA pair in the ATMW1 E. coli opens up the possibility of altering its substrate specificity using the facile E. coli-based selection system. The existing reporter plasmids used for this double-sieved selection scheme, which enables either enrichment (positive selection) or depletion (negative selection) of aaRS variants based on their ability to charge its amber suppressing cognate tRNACUA, were mutated to generate variants that would allow selection based on opal suppression instead. Based on the crystal structure (
[0064] The aforementioned library of EcTrpRS variants was subjected to three rounds of selections (positive selection in the presence of 1 mM 5HTP, negative selection in the absence of the UAA, then another round of positive selection) in the ATMW1 strain, and 96 colonies from the surviving pool were individually screened for conditional survival under the positive selection conditions (40 g/mL chloramphenicol) in the presence of 5HTP. Sequence analysis of four of the most successful mutants show significant sequence-convergence, where Phe7 and Pro145 are conserved, Ser8 is mutated to Ala, Val144 changes to a small amino acid (Gly/Ser/Ala), and Val146 is mutated to different small/hydrophobic amino acids (
[0065] Table 1 below shows whole protein LCMS measurements.
TABLE-US-00001 Reporter UAA aaRS Expected mass Observed mass Note sfGFP- None None No expr. 27562 pBK system in 151-TAG ATMW1 sfGFP- W EcWRSwt 27620 27562* pBK/tac system in 151TAG ATMW1 (Glnx-rxtive) sfGFP- W EcWRSwt 27620 27621 pBK/tac system in 151-TGA ATMW1 sfGFP- 5HTP Hit 14 27636 27637 pBK/tac system in 151-TGA ATMW1 sfGFP- 5MTP Hit 14 27652 27652 pBK/tac system in 151-TGA ATMW1 sfGFP- 5BrW Hit 14 27700 27699 pBK/tac system in 151-TGA ATMW1 sfGFP- 5AzW Hit 14 27663 27660 pBK/tac system in 151-TGA ATMW1 sfGFP- 5PropW Hit 14 27676 27674 pBK/tac system in 151-TGA ATMW1 sfGFP- 5HTP Hit 9 27636 27637 pBK/tac system in 151-TGA ATMW1 sfGFP- 5AmW Hit 9 27636 27635 pBK/tac system in 151-TGA ATMW1 EGFPwt Y None 29683 29683 Pacbac1 system in HEK293T EGFP- BocK MbPy1 29748 29748 Pacbac1 system in 39-TAG HEK293T EGFP- AzK MbPy1 29761 29761 Pacbac1 system in 39-TAG HEK293T EGFP- W EcWRSwt 29707 29708 Pacbac1 system in 39-TAG HEK293T EGFP- 5HTP Hit 14 29723 29724 Pacbac1 system in 39-TAG HEK293T EGFP- 5MTP Hit 14 29736 29736 Pacbac1 system in 39-TAG HEK293T EGFP- 5BrW Hit 14 29786 29785 Pacbac1 system in 39-TAG HEK293T EGFP- 5AzW Hit 14 29748 29747 Pacbac1 system in 39-TAG HEK293T EGFP- 5PropW Hit 14 29761 29760 Pacbac1 system in 39-TAG HEK293T EGFP- 5HTP Hit 9 29723 29724 Pacbac1 system in 39-TAG HEK293T EGFP- 5AmW Hit 9 29722 29721 Pacbac1 system in 39-TAG HEK293T sfGFP- 5AzW Hit 14 28368 28366 Labeled with DBCO- 151-TAG biotin EGFP- 5AzW Hit 14 30453 30453 Labeled with DBCO- 39-TAG biotin
[0066] Since the negative selection step in the aforementioned scheme only discriminates against natural amino acids, but not other UAAs, the isolated mutants are sometimes capable of polyspecificity: The ability to charge a number of structurally similar UAAs, while discriminating against the 20 canonical amino acids. The isolated EcTrpRS mutants were screened for polyspecificity towards other 5-substituted tryptophan derivatives (
[0067] In conclusion, the present invention demonstrates here the feasibility of functionally replacing an endogenous tRNA/aaRS in E. coli with an E. coli-optimized counterpart from a different domain of life with no growth-penalty, and that the resulting strain can be used as a selection host for evolving variants of the liberated tRNA/aaRS pair for charging UAAs. In addition to E. coli, these variants also enable genetic code expansion in eukaryotes. Since bacterial tRNA/aaRS pairs are generally orthogonal in eukaryotic cells, this approach holds the potential to provide additional universal tRNA/aaRS platforms. The present invention also introduces a new tryptophanyl-tRNA synthetase/tRNA platform, the utility of which was illustrated by introducing several new UAA additions to the genetic code of E. coli as well as eukaryotes, including 5AzW and 5PrW which enable selective bioconjugation reactions. Access to new universal tRNA/aaRS pairs will augment the structural diversity of genetically encoded UAA toolbox, and facilitate the development of powerful new technology involving simultaneous incorporation of multiple UAAs into a polypeptide in both E. coli and eukaryotes.
[0068] A number of tRNA/aminoacyl-tRNA synthetase (aaRS) pair has been developed to date to site specifically incorporate novel unnatural amino acids into proteins. The E. coli tryptophanyl-tRNA synthetase/tRNA pair developed in the present invention is unique and is a novel system because it can be used to incorporate new unnatural amino acids into proteins both in an engineered E. coli (the strain created where the endogenous tryptophanyl tRNA/aaRS pair was replaced with a counterpart from yeast) as well as in eukaryotic cells. This tRNA/aaRS pair has been engineered to enable site specific incorporation of six new unnatural tryptophan analogs into proteins expressed in both E. coli as well as eukaryotic cells.
[0069] The novel system described in the present invention has a number of potential applications. This includes site-specific bioconjugation using 5-azidotryptophan and 5-propargyltryptophan: These two unnatural amino acids can be incorporated into proteins expressed in both E. coli and eukaryotic cells (e.g., mammalian cells), and will allow site-specific bioconjugation using bioorthogonal azide-alkyne click reactions. This reaction can be used to site-specifically attach onto proteins a variety of entities such as drugs (for antibody-drug conjugation), attachment of biophysical probes (such as fluorophores, PET probe, etc.), polyethylene glycol (to improve pharmacokinetic properties of therapeutic proteins), onto recombinantly expressed proteins.
[0070] The same strategy can also be used to label the capsid proteins of human viruses, which can be subsequently labeled with either probes to study its infection process, or attach synthetic receptor binding agents that target specific cell-surface receptors to generate cell-specific viral vectors. The site-specific conjugation strategy can also be used to attach relevant proteins on surfaces (e.g., sensor chips, electrodes, etc.) with precise control over its orientation and site of attachment.
[0071] Another application involves 5-azidotryptophan, which is an aryl-azide, which upon irradiation forms a highly reactive nitrene intermediate. Its incorporation into proteins will allow light-induced capture of weak protein-protein interactions by the formation of a stable covalent linkage between the reactive nitrene intermediate and various residues from the interacting protein. This can be a powerful tool to interrogate weak or transient protein-protein interactions. Additionally tryptophan residues are frequently found at the interface of protein-protein interactions, making 5-azidotyrptophan an ideal candidate to capture such interactions.
[0072] The tryptophanyl-tRNA synthetase/tRNA pair of the present invention can be further engineered to incorporate new tryptophan analogs such as fluorinated-tryptophan (NMR as well as fluorescence probe), nitrated tryptophans (these residues form naturally as post-translational modifications, but their physiological relevance remain poorly characterized, since such modified proteins cannot be homogeneously produced; however, the ability to specifically charge these nitrated amino acid into specific sites of proteins using the technology of the present invention will circumvent this problem). The large active site of the tryptophanyl-tRNA synthetase can also be engineered to charge other unnatural amino acids with large side chains, such as fluorophores.
[0073] A variety of tryptophan analogs can be incorporated site-specifically into proteins expressed in E. coli or eukaryotic cells, enabling the study of the roles of functionally important tryptophan residues. 5-azidotryptophan, 5-propargytryptophan, which can be incorporated site-specifically into any protein expressed in bacteria or eukaryotic cells using our platform, will allow selective conjugation of various molecules to these proteins for applications such as PEGylation, generation of site-specific antibody-drug conjugates. 5-azidotryptophan which can be incorporated site-specifically into any protein expressed in bacteria or eukaryotic cells using the platform of the current invention, can be used as a photo-affinity probe to capture weak or transient protein-protein interactions within a living cell.
[0074] The present invention has many commercial applications and could be useful to manufacturers of research kits, and to companies pursuing antibody-drug conjugate or other protein modification (such as PEGylation). Without further elaboration, it is believed that one skilled in the art can, based on the above description, utilize the present invention to its fullest extent. The following specific embodiments and examples are, therefore, to be construed as merely illustrative, and not limitative of the remainder of the disclosure in any way whatsoever.
[0075] Without further elaboration, it is believed that one skilled in the art can, based on the above description, utilize the present invention to its fullest extent. The following specific embodiments and examples are, therefore, to be construed as merely illustrative, and not limitative of the remainder of the disclosure in any way whatsoever.
Materials and Methods
[0076] For cloning and plasmid propagation, the DH10b (Life Technologies) strain of E. coil was used. Polymerase chain reaction (PCR) was performed using the Phusion Hot Start II DNA polymerase (Fisher Scientific) using manufacturer's protocol. For purification of DNA (plasmid as well as PCR products, etc.) spin columns from Epoch Life Science were used. Restriction enzymes and T4 DNA ligase were obtained from New England BioLabs (NEB). DNA oligomers for PCR were purchased from Integrated DNA Technologies (IDT). Verification of cloned DNA by Sanger sequencing was performed by Eton Biosciences. Antibiotics, isopropyl -D-1-thiogalactopyranoside (IPTG), and L-arabinose were purchased from Sigma-Aldrich or Fisher Scientific. Components of media were obtained from Fisher Scientific. Bacteria were grown on LB or M63 agar plates.sup.17 and LB liquid medium with the following antibiotic concentrations unless otherwise mentioned: 95 g/mL spectinomycin, 20 g/mL chloramphenicol, and 100 g/mL ampicillin, 30 g/mL kanamycin, 15 g/mL zeocin, 12 g/mL tetracycline, 10 g/mL gentamycin.
[0077] Sequences of the oligonucleotides used herein are shown in Table 2, below SEQ ID NOS: 6-74, respectively.
TABLE-US-00002 SEQID NO: 6 TrpRS.Z.ab- ATCAGTCTATAAATGACCTTCTGCCCGCATTAGGGCTTC F CGCATAGCGAAAATCAGGAATCGAAAAAGGTGTTGACA ATTAATCATCGGC 7 TrpRS.Z.ab- TGTAGGCCGGATAAGGCGTTCACGCCGCATCCGGCATGA R ACAAAGCGCAATTTGCCAGCAATAGTGAAAGCTTGCAA ATTAAAGCCTTCG 8 TrpRS150F GTCGGCGACTCACGCAATGATATTCAGGCGGC 9 TrpRS150R AGCGAGATGTGGAAACGGCGAGGCACTTCAC 10 Zeo-iR CTGGTCCTGGACCGCGCTGATGAAC 11 TrpRS_sqiR ATCCTGGCGCACGGTGATCGCGTGTTG 12 trpTKO.Gent-F CAGTCGGTTAGAATACCTGCCTGTCACGCAGGGGGTCGC GGGTTCGAGTCCCGTCCGTTCCGCCACCCTAATTACGCA CACCGTGGAAAC 13 trpTKO.Gent-R CGGTAGAAGGATTTACTTCGGAGAGGGTTATTCAGATA AAAAAAATCCTTAGCTTTCGCTAAGGATGATTTCCCGGG AAGCCGATCTCG 14 trpTGsqF GGGGTCTCCCCATGCGAGAGTAGGGAAC 15 trpTGsqR CCGTTGTCGATAGCACAACACTTTCACGGCC 16 galK.90del CGCGCAGTCAGCGATATCCATTTTCGCGAATCCGGAGTG TAAGAACGCGCAGTCAGCGATATCCATTTTCGCGAATCC GGAGTGTAAGAA 17 galK_KO_verf_F TGGCAGAGACCCAGCG 18 galK_KO_verf_R ACCCCAGTCCATCAGCG 19 dLambda.galK-F GCTATGAAATAGAAAAATGAATCCGTTGAAGCCTGCTTT TTTATACTAACTTGAGCGAAACGGGAAGCCTGTTGACAA TTAATCATCGGC 20 dlambda.galK GCCGCGTTGATTTTCTCCTGCCAGCTCATAATGCTGCCGC dterm-R GTTGTAATATTCAGCACTGTCCTGCTCCTT 21 dlambda.sqF GGTTTGATCAGAAGGACGTTGATCGGGCGG 22 dlambda.sqR TTCAGATACTGGCGATCATCCGCCACCAG 23 dLambda.sqiR AGCCCATTGATAGTTTTCATGTGCGACAATGGGCG 24 EcWRS_mut7_8-F GAATCCCATATGATGACTAAGCCCATCGTTNBTNSTGGC GCACAGCCCTCAGGTGAATTG 25 libEcWRS- TACGCTTTGAGGAATCCCATATGATGACTAAGCCCATCG NdeI-F 26 EcWRS1_mut- CAGATTAGTTTGATACAGCAGGATGTCCGCTGCCATC VPViR 27 EcWRS1_mut144-6_F GATGGCAGCGGACATCCTGCTGTATCAAACTAATCTGnnk nnknnkGGTGAAGACCAGAAACAGCACCTCGAACTGAGC 28 EcWRS_NcoI_PstI_termR agcgtttgaaactgcagccatggtaccTTACGGCTTCGCCACAAAACCA ATCGC 29 proK-F GTTAGCCTGCAGGTAATTCCGCTTCGCAACATGTGAG 30 TrpHNcoI-R GGCCGCCATGGCAAATTCGACCCTG 31 Trp40CCA- GCAACCAGGCGCTTTGGAGGCGCCAGCTCTACCCTGAG iR 32 Trp40CCA- AGCTGGCGCCTCCAAAGCGCCTGGTTGCAGGTTC iF 33 SmR-R CGCGCGCAGATCAGTTGGAAGAATTTGTCCACTACGTG 34 PNP-SpeI-F ATAATGGACTAGTGCGCTTGTTTCGGC 35 pNP-BAB- CTCCTTAGATCTTCCTAGGTGGATCCACCATTCC R 36 pEvoICmRSpeI-F AATAATACTAGTGTTGATACCGGGAAGCCCTGG 37 pEvol_CmRPstI-R AATAATCTGCAGCGAATTTCTGCCATTCATCCGCTTATTA TCAC 38 CmR-TGA-7 GCTAAGGAAGCTAAAATGGAGAAAAAAATCACTTGATA TACCACCGTTGATATATCCCAATGGC 39 CmR-TGA-84 GCAATGAAAGACGGTGAGCTGGTGTGATGGGATAGTGT TCACCCTTGTTACACC 40 CmR-TGAT-98 CCCTTGTTACACCGTTTTCCATGAGTGATCTGAAACGTTT TCATCGCTCTGGAG 41 pRep-KpnI- AATAATaggtaccGTTCTGTTGCCCGTCTCACTGGTG tR-F 42 pRep- AATAATAcatatgCCTAGGTGGCAGGGGCGGAGAGACTC EcWtR- NdeI/AvrII- R. 43 EcW-TGA-MSDM GTTCAATTGGTAGAGCACCGGTCTTCAAAACCGGGTGTT GGGAGTTCGAG 44 T7F1 CAGGTTCGCAGCGTCAGCCGGAATGGTACCG 45 T7R3 GCGCCCGACAGCCTTCCAGTTCCTGTGAGAAATCAAGCC GGAAGCCGTAGCGTAC 46 T7F3 GTACGCTACGGCTTCCGGCTTGATTTCTCACAGAACTG GAAGGCTGTCGGGCGC 47 T7R4b CCATGACCATGATTACCGTGCACTGAAATACCATTAACA TTGCTAAGAACG 48 T7F4 CGTTCTTAGCAATGTTAATGGTATTUCAGTGCACGGTAA TCATGGTCATGG 49 T7R5 CGAAGGCGAAGCGGCATGCATAATGTGCCTGTCAAATGG 50 T7_mut- AATAATcctgcaggCTACTCAGGAGAGCGTTCACCGAC SbfI-F 51 T7_mut- AATAATCCTGCAGGGCGGCCGCTACGGGAGGGCTTACC NotI/SbfI-R ATCTGG 52 Barnase TTTAACTTTAAGAAGGAGATATACATATGGCATgaGTTAT MSDM3 CAACACGTTTGACGGGGTTG TGA 53 Barnase GTGGCATCAAAAGGGAACCTTGCATGAGTCGCTCCGGG MSDM45 GAAAAGCATC TGA 54 EcWtR- TTAGCCTGCAGTGTGCTTCTCAAATGCCTGAGGCCAGTT PstI-F TGCTC 55 EcWtR- GCGCCCCGCATTTAGGGGCGTAGTTCAATTGGTAGAGCA prok-oF CCGGTC 56 EcWtR- ACTACGCCCCTAAATGCGGGGCGCATCTTACTGCGC proK-oR 57 EcWtR- ATATATGGTACCAAAAAATGGCAGGGGCGGAGAGACTCG KpnI-R 58 GFPflip- TCGATCCCGCGAAATTAATACGACTCACTATAG NotI-F+ 59 sfGFP- ATATACTGCAGCGCCAAGCTAGCTTGGATTCTCACCAAT T7+lam- AAAAAACGC PstI-R 60 MjYtRdelF TGGCAGGGGCGGAGAGACTCGAACTCC 61 MjYtRdeloR CGAGTCTCTCCGCCCCTGCCA AATTCGAAAAGCCTGCTCAACGAGCAGG 62 EcWtR GTTCAATTGGTAGAGCACCGGTCTTCAAAACCGGGTGTT TGA GGGAGTTCGAG MSDM 63 sfGFP CTCGAGTACAACTTTAACTCACACAATGTATGAATCACG (pEvol) GCAGACAAACAAAAGAATGG TGA151 64 EcWRS1.FA.NotI-F AATAATAgcggccgcATGACTAAGCCCATCGTTTTTGCTGGC GCAC 65 EcWRS- AATAATAgcggccgcTTACGGCTTCGCCACAAAACCAATCGC NotI-R 66 pUltraII- GGTGCCCTTAAACGCCTGGTTGC tRsqR 67 EcTrpRS- AAtAAAcatatgATGACTAAGCCCATCGTTTTTAGTGGCGCAC NdeI-F 68 EcTrpRS- TTATTCTGCAGTTACGGCTTCGCCACAAAACCAATCGC PstI-R 69 TrpRS-REcoRI ATTATTGAATTCTTACGGCTTCGCCACAA 70 TrpRS-F- AATAAATGCTAGCATGACTAAGCCCATC NheI 71 U6-R AATTATTGCTAGCAAAAAATGGCAGGGGCG tRNAtrp* AvrII 72 tRNAtrp* AATTATTGCTAGCAAAAAATGGCAGGGGCG Nhe-R 73 EcWRS_mamNheI-F aataataGCTAGCgccaccATGACTAAGCCCATCGTT 74 EcWRS_mamEcoRI-R AATAATAgaattcTTACGGCTTCGCCACAAAACCAATCGC
[0078] Statistical methods. For all expression analysis (sfGFP in bacteria or EGFP in HEK293T cells), mean of three independent experiments were reported, and error bars represent standard deviation. For the growth rate analysis, each data point represent the mean O.D.600 of three independent cultures of the same strain (error bars represent S.D.). In our experience, a mean of three experiments provides adequate levels of accuracy for these experiments.
[0079] Strains, cell lines. The EcNR1 strain was a gift from Prof. George M. Church. The HEK293T cell-line was obtained from ATCC, and propagated without further confirmation. Cell lines are regularly monitored for Mycoplasma contamination. Even though HEK293T is listed under misidentified cell lines in ICLAC database, we used it for our expression analyses as a representative mammalian cell-line. Given the wide-spread use of this cell-line as a model mammalian expression host, and since our conclusions does not rely on its specific identity (beyond a representative mammalian cell-line), we believe that the use of this strain is justified.
[0080] Lambda-Red recombination. All strains were derived from EcNR1.sup.16. This strain contains temperature inducible lambda-recombinase genes (Exo, Beta, Gam) and a constitutive ampicillin resistance gene disrupting the bioA/bioB genomic locus. Strains were grown in 10 mL LB at 30 C. to 0.5 OD.sub.600 and then induced for 15 minutes in a 42 C. water bath (250 rpm). The cell pellet was then washed twice with 10 mL ddH.sub.2O by centrifuging at 4500 xg. Cells were resuspended in residual ddH.sub.2O (250 L) and 50 L aliquots were electroporated with 50 ng dsDNA or 2 M 90 by oligo containing 45-77 by genomic homology, depending on the desired recombination. Electroporated cells were recovered for 6 hours and plated on either LB or minimal media. Single colonies from selection plates were re-streaked and subjected to colony PCR using KAPA-2G polymerase (Kapa Biosystems), following manufacturer's instructions, to verify desired recombination.
[0081] Building ATMW1. EcNR1 was transformed with pUltraG_ScW40.sub.CCA. To remove the E. coli tryptophanyl-tRNA synthetase (trpS) from this strain, the gene encoding zeocin resistance (ShBle) driven by the EM-7 promoter and the CYC1 transcription terminator was PCR amplified using primers TrpRS.Z.ab-F and TrpRS.Z.ab-R to generate the PCR product trpS::Zeo.sup.R. 50 ng of the trpS::ZeoR PCR cassette was transformed in the recombination following the aforementioned protocol, and the resulting strains were plated on LB-Agar plates supplemented with Zeocin. The resulting colonies were screened via colony PCR using TrpRS150F+TrpRS150R, TrpRS150F+Zeo-iR, and TrpRS150F+TrpRS_sqiR, as well as sequencing these colony PCR products. This strain was named EZ4.
[0082] To replace the E. coli trp-tRNA.sub.CCA (trpT), the trpT::Gent.sup.R PCR cassette was amplified using trpTKO.Gent-F and trpTKO.Gent-R (965 bp). 50 ng of trpT::Gent.sup.R PCR cassette was transformed into EZ4, induced as previously described. Resulting gentamycin resistant colonies were screened for the desired recombination using colony PCR primers trpT GsqF and trpT GsqR, as well as sequencing of the PCR product. The resulting strain was named EZG4.
[0083] 2 M 90 by oligo, galK.90 del, was used to delete galK from the endogenous genomic location. Following transformation, cells were recovered for 6 hours, washed twice with M9 minimal media at 5000 xg for 5 min and 100 L of a 10.sup.4 dilution was plated on M63 minimal media supplemented with glycerol and 2-deoxygalactose to select for successful galK deletion. Colony PCR was used to verify the deletion of galK using galK_KO_verf-F/R. The resulting strain was named G4.
[0084] galK.PCR cassette was amplified containing the endogenous promoter dlambda.GalK-F and dlambda.galK dterm-R (1348 bp) and was used to remove the -RED genes from the G strain. Following transformation of 50 ng of this PCR product into strain G, successful insertion of the galK.PCR cassette into the -RED site was selected by plating the cells on M63 plates containing galactose as the sole carbon source for three days..sup.17 Surviving colonies were screened by colony PCR using dlambda.sqF with dlambda.sqR or dLambda.sqiR for the desired deletion. This final strain was named ATMW1 (EcNR1 trpS::Zeo.sup.R trpT::Gent.sup.R galK -RED::galK.
[0085] Essentially the same procedure as described above for producing ATMW1 was used to produce BL21(DE3).
[0086] Growth Comparison. 5 mL starter cultures of EcNR1G, EcNR1G+pUltraG_ScW40.sub.CCA, and ATMW1 strains were grown for 16 hrs in LB with all strain-dependent antibiotics. For each strain, the starter culture was diluted to an initial OD.sub.600 of 0.01 in three identical cultures of 80 mL LB with no antibiotics and allowed to grow in 250 mL sterile Erlenmeyer flasks at 30 C., with shaking (250 rpm). Growth was monitored every 30 min by measuring OD.sub.600 in a 10 mm cuvette.
[0087] Assessment of tRNA/aaRS activity using a chloramphenicol reporter. Overnight cultures of ATMW1 harboring pRepAC-EcW-TAG or pRepJI-EcW-TGA, with or without pBK-EcWRSwt, were diluted to an OD.sub.600 of 0.1 and 3 L was spot plated on LB agar plates supplemented with kanamycin (+pBK plates), spectinomycin, tetracycline, and varying chloramphenicol concentrations. Growth was analyzed after 48 hrs of incubation at 37 C.
[0088] Assessment of tRNA/aaRS activity using a sfGFP151 reporter. EcNR1 or ATMW1 harboring pEvolT5-EcW sfGFP151 (TAG or TGA) with or without pBK-EcWRSwt, pRK-EcWRS-h14, or pBK-EcWRS-h9 were grown overnight in LB. The starter cultures were diluted in LB supplemented with required antibiotics to 0.05 OD.sub.600. Cultures were grown at 30 C. or 37 C. (30 C. when comparing to progenitor strain EcNR1) until 0.55 OD.sub.600, at which point the sfGFP expression was induced with a final concentration of 1 mM IPTG. Unnatural amino acids (UAA) were added during induction to a final concentration of 1 mM. Cultures were grown for an additional 17-20 hours at 37 C. with shaking. To evaluate sfGFP expression, cells from 150 L of the cultures were pelleted at 5000 xg, resuspended in 150 L PBS, and transferred to a 96-well clear-bottom assay plate. Fluorescence was measured by using a SpectraMAX M5 (Molecular Devices) (Ex. 488 nm; Em. 534 nm). Fluorescence for each sample was normalized using its OD.sub.600.
[0089] Protein purification. To maximize the yield of UAA-modified protein expression, a different plasmid combination was used: EcTrpRS-h14 and -h9 was cloned into a pEvoltac plasmid that expresses them from a strong tacI promoter, while the tRNA.sup.EcTrp.sub.UCA is expressed from the proK promoter. The sfGFP reporter gene (sfGFP-151-TGA or wild type sfGFP) was expressed from pET22b-T5lac plasmid driven by the strong t5.lac promoter. Overnight expression cultures were centrifuged and resuspended in lysis buffer: B-PER Bacterial Protein Extraction Reagent (Thermo Scientific)+1 Halt Protease Inhibitor Cocktail (Thermo Scientific)+0.01% Pierce Universal Nuclease (Thermo Scientific). After 30 min incubation at room temperature, the lysate was clarified by centrifuging at 22,000 xg for 5 min. The C-terminally polyhistidine tagged soluble sfGFP in the supernatant was purified using a HisPur Ni-NTA resin (Thermo Scientific) following manufacturer's protocol. Protein purity was confirmed by SDS-PAGE and purified protein molecular weight was confirmed by ESI-MS (Agilent Technologies 1260 Infinity ESI-TOF).
[0090] Construction of the ECWRS-5HTP pBK library. Overlap extension was used to introduce degenerate codons, creating the five-residue tryptophanyl tRNA-synthetase library pBK-EcWRS1.5 (786,432 diversity): F7-NBT, S8-NST, V144-NNK, P145-NST, V146-NNK. Using Phusion HSII (Fisher Scientific) and manufacturer's protocol, EcWRS1_mut7-8-F+EcWRS1_mut-VPViR and EcWRS1_mut-VPViR and EcWRS1_mut144-6_F+EcWRS_NcoI_PstI_termR were used to PCR amplify the N-term and C-term of the EcWRS PCR product, respectively. The N-term and C-term PCR products were joined together by overlap extension PCR using the following terminal primers: libEcWRS-Ndel-F and EcWRS_NcoI_PstI_termR. These inserts were digested with NdeI/NcoI (NEB) and ligated by T4 DNA Ligase (NEB) into pBK vector cut with the same restriction enzymes. The ligation mixture was ethanol precipitated with Yeast-tRNA (Ambion) and transformed into electrocompetent DH10b cells. The library was covered using 10.sup.7 distinct colony forming units.
[0091] Selection for Synthetase Charging 5HTP. ATMW1 was co-transformed with the pBK-EcWRS1.5b library and the positive selection reporter plasmid pRepJI EcW TGA. The reporter plasmid harbors a lpp-promoted E. coil tRNA.sup.Trp.sub.UCA, a CAT reporter modified to contain TGA codons (Q98TGA), an arabinose-inducible T7 RNA polymerase harboring two TGA nonsense codons (at positions 8 and 114), and a wild-type GFPuv reporter expressed from a t7 promoter. Suppression of CAT leads to chloramphenicol resistance, and suppression of T7 RNA polymerase drives expression of a t7-promoted GFPuv. 9.210.sup.7 cfu (colony forming units) were plated on LB+0.5Spec/0.5Tet/Kan/0.02% arabinose+chloramphenicol (25, 35, 45 g/mL) in the presence of 1 mM 5HTP for 36 hrs at 37 C.
[0092] Colonies from the 35 and 45 g/mL chloramphenicol positive selection plates were harvested, and the pBK plasmids harboring mutant EcTrpRS were purified by miniprep and gel purification. These were co-transformed into ATMW1 harboring the negative selection plasmid pNegJ2-EcW (araBad-Barnase harboring two TGA codons at 3 and 45). 310.sup.7 cfu were plated on LB+Amp/0.5Kan/0.02% arabinose and incubated for 12 hrs at 37 C. Cells were harvested and library pBK plasmid was purified by miniprep/gel purification.
[0093] Isolated pBK plasmids from the negative selection were transformed again into ATMW1 pRepJI-EcW TGA, and 10.sup.6 cfu were plated on LB+0.5Spec/Tet/Kan0.02% arabinose+chloramphenicol (30, 40, 50 g/mL) in the presence or absence of 1 mM 5HTP for 18 hrs, 37 C., which revealed significantly higher number of colonies in the presence of the UAA. 96 colonies were picked into a 1 mL LB supplemented with Spec/Tet/Kan in deep 96 well polypropylene plates and grown overnight. The resulting cultures were diluted to 0.01 OD.sub.600 and 3 L of each was spot plated on LB/Agar plates supplemented with Spec/Tet/Kan, and chloramphenicol (50, 60 g/mL) in the presence or absence of 5HTP. Four clones exhibiting the most prominent UAA dependent growth were picked and sequenced.
[0094] Assessing activity of tRNA.sup.EcTrP.sub.CUA and synthetase hits in HEK293T. Dulbecco's modified Eagle's medium (high glucose DMEM) supplemented with 10% fetal bovine serum (FBS) and Penicillin/Streptomycin (0.5) was used to culture HEK293T cells. Cells were incubated in a humidified incubator at 37 C. in the presence of 5% CO.sub.2. HEK293T were seeded at a density of 600,000 per well for a 12-well plate one day prior to transfection and transfected at 70% confluence. Polyethylenimine (PEI, Sigma) and DNA were mixed at a ratio of 4 L PEI (1 mg/mL) to 1 g DNA in DMEM. After 20 min incubation, 100 L of this mixture was used to transfect one single well in a 12-well plate. For these transfections, 500 ng of pAcBac EGFP39* U6-EcWtR TAG was transfected in the presence or absence of pAcBac-TrpRS (wt, h14, or h9) U6-EcWtR TAG. UAAs were added to the culture medium to a final concentration of 1 mM at the time of transfection. Fluorescence images were taken at 48 hrs post-transfection using a Zeiss Axio Observer fluorescence microscope.
[0095] To obtain EGFP39*-expression data, cells were harvested, washed once with PBS buffer (5000 xg), and lysed with CelLytic-M lysis buffer (Sigma) supplemented with 1 Halt protease inhibitor and 0.01% Pierce universal nuclease. 50 L lysis buffer was used for each well of a 12-well plate, and was allowed to incubate for 20 min. After incubation, the lysate was clarified by centrifuging at 22000 xg for 5 minutes and was transferred to a clear bottom 96-well assay plate. Fluorescence was measured using a SpectraMAX M5 (Molecular Devices) (Ex. 488 nm; Em. 534 nm).
[0096] Isolation of EGFP-39-TAG reporters from HEK293T. HEK293T cells were cultured as previously described. One day prior to transfection, cells were seeded at a density of 8-million cells/10 cm dish. 50 L PEI MAX (Polysciences) was mixed with 10 g total DNA (5 g reporter, 5 g tRNA/aaRS plasmid) with 180 L DMEM (no FBS), incubated for 20 min, and added evenly to the dish at 90% confluence. Desired UAAs were supplemented at a final concentration of 1 mM and cells were allowed to express the desired protein for 48 hours.
[0097] Cells from a 10 cm dish were harvested and lysed with 600 L CelLytic M lysis buffer (Sigma, 1 Halt protease inhibitor, 0.01% Pierce universal nuclease). Lysate was clarified as described and purified via Ni.sup.2+-NTA, following manufacturer's protocol. Purified protein was analyzed by SDS-PAGE and molecular weight was confirmed by ESI-MS (Agilent Technologies 1260 Infinity ESI-TOF).
[0098] Click-labeling of 5AzW and 5PrW residues. Purified proteins containing 5AzW were incubated with or without 20 M DBCO-Cy5 (Sigma) for 1 hr in the dark at room temp. Proteins samples were resolved by SDS-PAGE gel and imaged using Cy5 specific settings on a Chemidoc MP Imaging System (Bio-Rad). The SDS-PAGE gels were then coomassie stained and imaged.
[0099] 5PrW containing proteins were labeled using the Click-iT Plus Alexa Fluor Picolyl Azide kit (Life Technologies) with a modified protocol. The following were mixed in order: 1 g protein (4 L), 3 L 10 buffer additive, 0.3 L 100 mM CuSO.sub.4, 0.3 L copper protectant, 1.2 L Alexa 488 PCA (50 M, 2 M final), 21.2 L 1 Click-iT reaction buffer, to a final volume of 30 L. Samples were incubated for 40 min in dark and subsequently resolved by SDS-PAGE and imaged using the Chemidoc MP Imaging System (Bio-Rad) with Alexa488 specific settings.
[0100] Tryptic Digestion and LC-MS/MS Analysis of reporter proteins. 12 g purified reporter protein was treated with DMSO (0.2 M) and precipitated with 5 L 100% solution of trichloracetic acid. Sample was frozen at 80 C. overnight. Thawed sample was centrifuged at 15000 rpm for 10 min, room temp. Supernatant was removed and pellets were vortexed to resuspend in 500 L cold acetone. Samples were then centrifuged at 5000 rpm for 10 min. Acetone was then removed and pellet was allowed to air dry. Pellet was resuspended in 30 L 8 M urea in PBS, followed by 70 L 100 mM ammonium bicarbonate and then 1.5 L 1 M DTT was added. Samples were incubated at 65 C. for 15 minutes. After incubation, 2.5 L of 500 mM iodoacetamide in PBS was added and the sample was left at room temperature for 30 minutes. Following incubation, 120 L PBS was added to each sample and vortexed rapidly. 4 L of trypsin was added to samples, followed by 2.5 L 100 mM CaCl.sub.2. Samples were then agitated for 37 C. overnight. Trypsin was then quenched with 10 L formic acid and pelleted at 15,000 rpm for 20 min. Supernatants were stored at 20 C.
[0101] Samples were subsequently analyzed by LC MS/MS using a LTQ Orbitrap XL mass spectrometer (ThermoFisher) coupled to an EASY-nLC 1000 nanoLC (ThermoFisher). 10 L samples were loaded onto 100 m fused silica column with a 5 m tip packed with 10 cm of Aqua C18 reverse phase resin (Phenomenex) using the EASY-nLC 1000 autosampler. The digests were eluted using a gradient 0-100% Buffer B in Buffer A (Buffer A: 95% water, 5% acetonitrile, 0.1% formic acid; Buffer B; 20% water, 80% acetonitrile. 0.1% formic acid). The flow rate through the column was set to 400 nL/min and the spray voltage was set to 3.5 kV. One fuL1 MS scan (FTMS) (400-1800 MW) was followed by 7 data dependent scans (ITMS) of the nth most intense ion from the imported mass list with dynamic exclusion. The tandem MS data were searched using the SEQUEST algorithm using a concatenated target/decoy variant of the human IPI databases. A static modification of +57.02146 on cysteine was specified to account for iodoacetamide alkylation. SEQUEST output files were filtered using DTASelect 2.0.
Construction of Various Plasmids:
[0102] Complementation plasmid pUltraG_ScW40.sub.CCA. Previously reported pUltra Hit14, containing a tacI promoted wild type yeast tryptophanyl-tRNA synthetase and proK promoted evolved yeast tRNA amber suppressor,.sup.14 was used to generate additional pUltra derivatives. To generate pUltra_ScW40.sub.CUA, a previously evolved tRNA 40A was amplified from 40A gblock (IDT) using primers proK-F and TrpH NcoI-R..sup.15 The fill-length prok-tRNA PCR product was cloned into pUltra Hit14 via SbfI/NcoI (NEB), producing pUltra ScW40.sub.CUA.
[0103] To build pUltraG_ScW40.sub.CCA, a complementation plasmid with a constitutively active synthetase, the tacI promoter region was removed by digesting the pUltra_ScW40.sub.CUA vector with XbaI/SbfI, and a glnS-ScWRS cassette was amplified from an pEVOL.sup.8 derived plasmid, pEvol ScW, using glnS-XbaI-F+glnS-SbfI-R and subsequently cloned using the same set of restriction sites. The 40A tRNA anticodon was then mutated to CCA using overlap extension PCR. The two PCR products were amplified with proK-F+Trp40CCA-iR and Trp40CCA-iF+SmR-R, overlap extended, and cloned into pUltraG_ScW40.sub.CUA using SbfI/NcoI, producing the final complementation plasmid pUltraG_ScW40.sub.CCA.
[0104] Construction of selection plasmids. To generate the TAG positive selection plasmid, pRep-ScW14.sup.14 was digested with SpeI/BgIII. The lpp-promoted E. coli tRNA.sup.Trp.sub.CUA was amplified from gblock I-EcWtR TAG with pNP-SpeI-F+pNP-BAB-R. The PCR product was digested with SpeI/BamHI, and cloned into the SpeI/BgIII sites of the pRep vector creating pRepAC-EcWtR-TAG.
[0105] In order to create a selection plasmid based on TGA suppression, a smaller pRep-Cm3b.sup.32 was used as a starting template. In order to delete the chloramphenicol-UPP fusion protein, chloramphenicol acetyl transferase was amplified from pEvol.sup.33 with pEvol CmR SpeI-F+pEvol CmR PstI-R, digested with SpeI/PstI, and ligated into the same sites of pRep-Cm3b, creating pRep-Cm3J-wt. TGA sites were introduced into the CAT gene by replacing G7, 184, or Q98 via site-directed mutagenesis using primers CmR-TGA-7, CmR-TGA-84, or CmR-TGAT-98, building three different reporters. The lpp-promoted E. coli tRNA.sup.Trp.sub.CUA was amplified from pRepAC-EcWtR-TAG with pRep-KpnI-tR-F and pRep-EcWtR-NdeI/AvrII-R and cloned via KpnI/AvrII into these pRep-Cm3J plasmids, creating three different pRep-Cm3J-#TGA-EcWtR TAG. The anti-codon of the tRNA was mutated to TGA using site-directed mutagenesis with primer EcWtR-TGA-MSDM. While all three reporter plasmids exhibited desired phenotypes upon TGA suppression, pRep-Cm3J-98TGA-EcWtR was used for the subsequent cloning steps.
[0106] To add the T7 RNA polymerase-GFPuv reporter system into pRep-Cm3J-98TGA-EcWtR, these elements were amplified from pRepAC-EcWtR-TAG using multiple overlap extension PCR reactions to change the two TAG nonsense codons in the T7RP gene to TGA: Three PCR products were first amplified using T7F1+T7R3, T7F3+T7R4b, T7F4+T7R5, which were overlapped to produce T7-araC PCR cassette. This cassette was first cloned back into pRepAC-EcWtR-TAG via KpnI/NsiI, and then the entire GFP-T7araC cassette was amplified from the resulting plasmid with T7_mut-SbfI-F+T7_mut-NotI/SbfI-R, digested with SbfI, and ligated into the PstI site of pRep-Cm3J-98TGA-EcWtR vector producing pRepJI-EcW.
[0107] The negative selection plasmid was built by modifying an existing pNeg plasmid..sup.3,4 The two barnase suppression sites (3TAG, 45TAG) were mutated to TGA with site-directed mutagenesis using primers Barnase MSDM 3 TGA and Barnase MSDM 45 TGA, creating plasmid pNegJ2. The E. coli tRNA.sup.Trp.sub.CUA was amplified from pRep-Cm3J-98TGA-EcWtR with pRep_NegtR-EcoRI-R and pRep_KpnI-tR-F and then cloned into pNegJ2, replacing the preexisting tRNA, creating pNegJ2-EcW.
[0108] Construction of bacterial suppression plasmids. Since ATMW1 uses the pUltraG plasmid to express the yeast tryptophanyl pair, the suppression plasmids cannot use the CloDF13 origin of replication, or the spectinomycin marker. The previously described pEvoltac MjY plasmid,.sup.8 which harbors a compatible p15a origin of replication and a chloramphenicol resistance marker, was used as the template to generate these plasmids. The plasmids pEvolT5 EcW sfGFP151 (TAG or TGA) were built to allow the rapid evaluation of various EcTrpRS mutants isolated from the selection scheme (encoded in pBK vector). Initially, a t5.lac-promoted sfGFP-151-TAG was amplified from pET22b-T5-sfGFP151TAG using GFPflip-NotI-F+sfGFP-T7+lam-PstI-R and subsequently cloned into pEvoltac MjY using EcoNI/PstI, to generate pEvolT5 MjY sfGFP151TAG.
[0109] A proK-promoted E. coli tRNA.sup.Trp.sub.CUA was assembled by overlap extension as follows. The proK promoter was amplified from pUltraG ScW40 using EcWtR-PstI-F+EcWtR proK-oR, which was overlapped with tRNA.sup.EcTrp.sub.CUA amplified with EcWtR-proK-oF and EcWtR-KpnI-R. This PCR product was cloned into pEvoltac MjY sfGFP151TAG using PstI/KpnI, producing pEvoltac MjY EcWtR sfGFP151TAG. The MjY tRNA was removed by using Polymerase Incomplete Primer Extension (PIPE) cloning with primers MjYtR-del-oF and MjYtR-del-oR, producing the final plasmid pEvolT5-EcW-sfGFP151TAG. Additionally, site-directed mutagenesis was used to build the TGA reporter pEvolT5-EcW-sfGFP151TGA with primers sfGFP(pEvol)TGA151 and EcWtR TGA MSDM.
[0110] For more efficient protein expression using evolved tRNA/aaRS pairs, plasmids containing the proK promoted tRNA.sup.EcTrp.sub.UCA and tacI promoted EcWRS-h14 or -h9 were assembled. The tRNA in pEvoltac MjY was first replaced by amplifying the EcTrp-tRNA.sub.UCA from pEvoltac-EcW-sfGFP151TGA with EcWtR PstI-F and pUltraII-tRsqR, and subsequently cloning into PstI/SphI to generate pEvoltac-EcW-MjYRS. EcWRS-h14 and -h9 were then amplified from their respective pBK plasmids using EcWRS1.FA.NotI-F and EcWRS-NotI-R, digested with NotI, and cloned into NotI-digested pEvoltac-EcW-MjYRS-pAcF, producing pEvoltac-EcW-TGA-h14 or h9.
[0111] pBK MjYRS.sup.4 was used as a template to introduce tryptophanyl-tRNA synthetase variants. Top10 genomic DNA was purified using previously described protocols,.sup.34 and used as the template to amplify the EcTrpRS using EcTrpRS-NdeI-F and EcTrpRS-Pstl-R. The PCR product was digested with NdeI/PstI, producing pBK EcWRS.
[0112] Construction of mammalian suppression plasmids. Previously reported pAcBac1 was used to generate mammalian reporter and suppression plasmids..sup.30 To build pAcBac1 TrpRS, EcTrpRS was amplified from Top10 genomic DNA with TrpRS-F-NheI+TrpRS-R-EcoRI and subsequently cloned into pAcBacl via Nhel/EcoRI. U6-EcWtR Gblock was used as a template and amplified with tRNAtrp*-NheI-R+tRNAtrp* Nhe-R to produce the human U6 promoted. E. coli tRNA.sup.EcTrp.sub.CUA PCR product. This PCR product was then digested with AvrII/NheI and cloned into the AvrII site in the pAcBac variants, resulting in pAcBac1-TrpRS-U6EcWtR-TAG and pAcBac1-EGFP39*-U6EcWtR-TAG. EcWRS-h14 and -h9 synthetase variants were cloned into the pAcBac1-TrpRS-U6EcWtR-TAG plasmid via NheI/EcoRI digestion after amplification with EcWRS_mamNheI-F+EcWRS_mamEcoRI-R to make pAcBac1-EcWRS-h14-U6EcWtR-TAG and pAcBac1-EcWRS-h9-U6EcWtR-TAG.
[0113] Unnatural amino acids. 5HTP and 5MTP were purchased from Fisher Scientific, 5BrW and 5AmW were purchased from Chem-Impex International (Wood Dale, Ill.). AzK was purchased from Sirius Fine Chemicals. 5AzW was synthesized as previously described..sup.35
[0114] Yield of sfGFP reporters incorporating various UAAs expressed in ATMW1 are shown in Table 3, below.
TABLE-US-00003 TABLE 3 Yield UAA aaRS (mg/L) 5HTP h14 92 5MTP h14 89 5BrW h14 25 5AzW h14 80 5PrW h14 61 5AmW h9 68 sfGFPwt N/A 140
REFERENCES
[0115] 1. Chin, J. W. Expanding and reprogramming the genetic code of cells and animals. Annu. Rev. Biochem. 83, 379-408 (2014).
[0116] 2. Dumas, A., Lercher, L., Spicer, C. D. & Davis, B. G. Designing logical codon reassignmentExpanding the chemistry in biology. Chem. Sci. 6, 50-69 (2015).
[0117] 3. Liu. C. C. & Schultz, P. G. Adding new chemistries to the genetic code, Annu. Rev. Biochem. 79, 413-444 (2010).
[0118] 4. Wang, Brock, A., Herberich, B. &. Schultz, P. G. Expanding the genetic code of Escherichia coli, Science 292, 498-500 (2001).
[0119] 5. Chin, J. W. et al. An expanded eukaryotic genetic code. Science 301, 964-967 (2003).
[0120] 6. Wan, W., Tharp, J. M. & Liu, W. R. Pyrrolysyl-tRNA synthetase: an ordinary enzyme but an outstanding genetic code expansion tool. Biochim. Biophys. Acta 1844, 1059-1070 (2014).
[0121] 7. Anderson, J. C. et al. An expanded genetic code with a functional quadruplet codon. Proc. Natl. Acad. Sci. USA 101, 7566-7571 (2004).
[0122] 8. Chatterjee, A., Sun, S. B., Furman, J. L., Xiao, H. & Schultz, P. G. A versatile platform for single- and multiple-unnatural amino acid mutagenesis in Escherichia coli. Biochemistry 52, 1828-1837 (2013).
[0123] 9. Neumann, H., Wang, K., Davis, L., Garcia-Alai, M, & Chin, J. W. Encoding multiple unnatural amino acids via evolution of a quadruplet-decoding ribosome. Nature 464, 441-444 (2010).
[0124] 10. Wan, W. et al. A facile system for genetic incorporation of two different noncanonical amino acids into one protein in Escherichia coli. Angew. Chem. Int. Ed. 49, 3211-3214 (2010).
[0125] 11. Xiao, H. et al. Genetic incorporation of multiple unnatural amino acids into proteins in mammalian cells. Angew. Chem. Int. Ed. 52, 14080-14083 (2013).
[0126] 12. Iraha, F. et al. Functional replacement of the endogenous tyrosyl-tRNA synthetase-tRNATyr pair by the archaeal tyrosine pair in Escherichia coli for genetic code expansion. Nucleic Acids Res. 38, 3682-3691 (2010).
[0127] 13. Chatterjee, A., Xiao, H. & Schultz, P. G. Evolution of multiple, mutually orthogonal prolyl-tRNA synthetase/tRNA pairs for unnatural amino acid mutagenesis in Escherichia coli. Proc. Natl. Acad. Sci. USA 109, 14841-14846 (2012).
[0128] 14. Chatterjee, A., Xiao, H., Yang, P. Y., Soundararajan, G. & Schultz, P. G. A tryptophanyl-tRNA synthetase/tRNA pair for unnatural amino acid mutagenesis in E. coli. Angew. Chem. Int. Ed. 52, 5106-5109 (2013).
[0129] 15. Ellefson, J. W. et al. Directed evolution of genetic parts and circuits by compartmentalized partnered replication. Nat. Biotechnol. 32, 97-101 (2014).
[0130] 16. Wang, H. H. et al. Programming cells by multiplex genome engineering and accelerated evolution. Nature 460, 894-898 (2009).
[0131] 17. Warming, S., Costantino, N., Court, D. L., Jenkins, N. A. & Copeland, N. G. Simple and highly efficient BAC recombineering using galK selection. Nucleic Acids Res. 33, e36 (2005).
[0132] 18. Soll, L. & Berg, P. Recessive lethal nonsense suppressor in Escherichia coli which inserts glutamine. Nature 223, 1340-1342 (1969).
[0133] 19. Jahn, M., Rogers, M. J. & Soll, D. Anticodon and acceptor stem nucleotides in tRNA(Gln) are major recognition elements for E. coli glutaminyl-tRNA synthetase. Nature 352, 258-260 (1991).
[0134] 20. Rogers, M. J., Adachi, T., Inokuchi, H. & Soll, D. Switching tRNA(Gln) identity from glutamine to tryptophan. Proc. Natl. Acad. Sci. USA 89, 3463-3467 (1992).
[0135] 21. Kopelowitz, J., Hampe, C., Goldman, R., Reches, M. & Engelberg-Kulka, H. Influence of codon context on UGA suppression and readthrough. J. Mol. Biol. 225, 261-269 (1992).
[0136] 22. O'Donoghue, P. et al. Near-cognate suppression of amber, opal and quadruplet codons competes with aminoacyl-tRNAPyl for genetic code expansion. FEBS Lett. 586, 3931-3937 (2012).
[0137] 23. Santoro, S. W., Wang, L., Herberich, B., King, D. S. & Schultz, P. G. An efficient system for the evolution of aminoacyl-tRNA synthetase specificity. Nat. Biotechnol. 20, 1044-1048 (2002).
[0138] 24. Zhang, Z. et al. Selective incorporation of 5-hydroxytryptophan into proteins in mammalian cells. Proc. Natl. Acad. Sci. USA 101, 8882-8887 (2004).
[0139] 25. Antonczak, A. K. et al. Importance of single molecular determinants in the fidelity of expanded genetic codes. Proc. Natl. Acad. Sci. USA 108, 1320-1325 (2011).
[0140] 26. Cooley, R. B., Karplus, P. A. & Mehl, R. A. Gleaning unexpected fruits from hard-won synthetases: probing principles of permissivity in non-canonical amino acid-tRNA synthetases. ChemBioChem 15, 1810-1819 (2014).
[0141] 27. Young, D. D. et al. An evolved aminoacyl-tRNA synthetase with atypical polysubstrate specificity. Biochemistry 50, 1894-1900 (2011).
[0142] 28. Prather, N. E., Murgola, E. J. & Mims, B. H. Primary structure of an unusual glycine tRNA UGA suppressor, Nucleic Acids Res. 9, 6421-6428 (1981).
[0143] 29. Raftery, L. A., Egan, J. B., Cline, S. W. & Yarus, M. Defined set of cloned termination suppressors: in vivo activity of isogenetic UAG, UAA, and UGA suppressor tRNAs. J. Bacteriol. 158, 849-859 (1984).
[0144] 30. Chatterjee, A., Xiao, H., Bollong, M., Ai, H. W. & Schultz, P. G. Efficient viral delivery system for unnatural amino acid mutagenesis in mammalian cells. Proc. Natl. Acad. Sci. USA 110, 11803-11808 (2013).
[0145] 31. Guo, J., Melancon, C. E., 3rd, Lee, H. S., Groff, D. & Schultz, P. G. Evolution of amber suppressor tRNAs for efficient bacterial production of proteins containing nonnatural amino acids. Angew. Chem. Int. Ed. 48, 9148-9151 (2009).
[0146] 32. Melancon, C. E., 3rd & Schultz, P. G. One plasmid selection system for the rapid evolution of aminoacyl-tRNA synthetases. Bioorg. Med. Chem. Lett. 19, 3845-3847 (2009).
[0147] 33. Young, T. S., Ahmad, I., Yin, J. A. & Schultz, P. G. An enhanced system for unnatural amino acid mutagenesis in E. coli. J. Mol. Biol. 395, 361-374 (2010).
[0148] 34. Syn, C. K. & Swarup, S. A scalable protocol for the isolation of large-sized genomic DNA within an hour from several bacteria. Anal. Biochem. 278, 86-90 (2000).
[0149] 35. Li, M. &. Johnson, M. E. An efficient synthesis of 5-azidotryptophan. Tetrahedron Lett. 35, 6255-6258 (1994).
[0150] While this invention has been particularly shown and described with references to preferred embodiments thereof, it will be understood by those skilled in the art that various changes in form and details may be made therein without departing from the scope of the invention encompassed by the appended claims.