RNA-DIRECTED DNA CLEAVAGE BY THE Cas9-crRNA COMPLEX
20230272394 · 2023-08-31
Assignee
Inventors
- Virginijus Siksnys (Vilnius, LT)
- Giedrius Gasiunas (Birzu r., LT)
- Tautvydas Karvelis (Vilniaus r., LT)
- Arvydas Lubys (Vilnius, LT)
- Lolita Zaliauskiene (Vilnius, LT)
- Monika Gasiuniene (Vilnius, LT)
- Anja SMITH (Lafyette, CO, US)
Cpc classification
C12N2310/20
CHEMISTRY; METALLURGY
C12Q2522/101
CHEMISTRY; METALLURGY
C12Q2525/121
CHEMISTRY; METALLURGY
C12Q2525/151
CHEMISTRY; METALLURGY
C12N9/22
CHEMISTRY; METALLURGY
C12N2800/80
CHEMISTRY; METALLURGY
C12N15/113
CHEMISTRY; METALLURGY
C12Q1/6811
CHEMISTRY; METALLURGY
C12Q1/6811
CHEMISTRY; METALLURGY
C12P19/34
CHEMISTRY; METALLURGY
C12Q2525/151
CHEMISTRY; METALLURGY
C12Q2522/101
CHEMISTRY; METALLURGY
C12Q2525/121
CHEMISTRY; METALLURGY
International classification
C12N15/113
CHEMISTRY; METALLURGY
C12N15/10
CHEMISTRY; METALLURGY
C12Q1/6811
CHEMISTRY; METALLURGY
C12P19/34
CHEMISTRY; METALLURGY
C12N9/22
CHEMISTRY; METALLURGY
Abstract
Isolation or in vitro assembly of the Cas9-crRNA complex of the Streptococcus thermophilus CRISPR3/Cas system and use for cleavage of DNA bearing a nucleotide sequence complementary to the crRNA and a proto-spacer adjacent motif. Methods for site-specific modification of a target DNA molecule in vitro or in vivo using an RNA-guided DNA endonuclease comprising RNA sequences and at least one of an RuvC active site motif and an HNH active site motif; for conversion of Cas9 polypeptide into a nickase cleaving one strand of double-stranded DNA by inactivating one of the active sites (RuvC or HNH) in the polypeptide by at least one point mutation; for assembly of active polypeptide-polyribonucleotides complex in vivo or in vitro; and for re-programming a Cas9-crRNA complex specificity in vitro and using a cassette containing a single repeat-spacer-repeat unit.
Claims
1-49. (canceled)
50. A method for site-specific modification of a target DNA molecule, the method comprising: expressing a polynucleotide encoding a tracrRNA and an engineered crRNA to produce the tracrRNA and the engineered crRNA, wherein the engineered crRNA has a spacer sequence complementary to a nucleotide sequence of the target DNA molecule; expressing a polynucleotide encoding a Cas9 protein heterologous to the tracrRNA and the engineered crRNA to produce the Cas9 protein; and combining the tracrRNA and the engineered crRNA with the Cas9 protein to form a Cas9-crRNA complex; wherein the Cas9-crRNA complex is reprogrammed to cleave the target DNA molecule, the Cas9-crRNA complex being heterologous to the target DNA molecule.
51. The method of claim 50, further comprising preparing the polynucleotide encoding the tracrRNA and the engineered crRNA by chemical synthesis or in-vitro transcription.
52. The method of claim 50, wherein the spacer sequence of the engineered crRNA comprises at least 20 nucleotides.
53. The method of claim 50, wherein the Cas9 protein comprises a mutation in a RuvC active site motif or a HNH active site motif of the Cas9 protein.
54. The method of claim 50, wherein the polynucleotide encoding the tracrRNA and the engineered crRNA is a plasmid DNA.
55. The method of claim 50, further comprising contacting the Cas9-crRNA complex with the target DNA molecule to cleave the target DNA molecule.
56. The method of claim 55, wherein the target DNA molecule is double stranded.
57. A method for site-specific modification of a target DNA molecule, the method comprising: expressing a polynucleotide encoding an engineered crRNA to produce the engineered crRNA, wherein the engineered crRNA has a spacer sequence complementary to a nucleotide sequence of the target DNA molecule; expressing a polynucleotide encoding a Cas9 protein that is heterologous to the engineered crRNA to produce the Cas9 protein; and combining the engineered crRNA with the Cas9 protein and a tracrRNA to form a Cas9-crRNA complex, wherein the Cas9-crRNA complex is reprogrammed to cleave the target DNA molecule.
58. The method of claim 57, further comprising contacting the Cas9-crRNA complex with the target DNA molecule.
59. The method of claim 58, wherein the target DNA molecule is double stranded, and wherein the Cas9-crRNA complex modifies the target DNA molecule by site-specific double stranded cleavage of the target DNA molecule.
60. The method of claim 58, wherein the target DNA molecule includes a protospacer-adjacent motif.
61. The method of claim 57, wherein the Cas9 protein is a nickase.
62. The method of claim 57, wherein the Cas9-crRNA complex is formed in the absence of RNaseIII.
63. A method for site-specific modification of a target DNA molecule, the method comprising: re-programming a sequence of a polynucleotide to encode an engineered crRNA capable of site-specifically binding to a nucleotide sequence of the target DNA molecule; expressing the polynucleotide to produce the engineered crRNA; and combining the engineered crRNA with a tracrRNA, and a Cas9 protein heterologous to the engineered crRNA to form a Cas9-crRNA complex, wherein the Cas9-crRNA complex has engineered specificity towards the target DNA molecule.
64. The method of claim 63, further comprising contacting the Cas9-crRNA complex with the target DNA molecule to cleave the target DNA molecule, wherein the Cas9-crRNA complex modifies the target DNA molecule by site-specific cleavage of the target DNA molecule.
65. The method of claim 64, wherein the target DNA molecule is double stranded and includes a protospacer-adjacent motif.
66. The method of claim 63, wherein the spacer sequence of the engineered crRNA comprises at least 20 nucleotides.
67. The method of claim 63, wherein the polynucleotide encoding the engineered crRNA is prepared by chemical synthesis or in-vitro transcription.
68. The method of claim 63, wherein the Cas9-crRNA complex is formed in the absence of RNaseIII.
Description
BRIEF DESCRIPTION OF THE FIGURES
[0012]
[0013]
[0014]
[0015]
[0016]
[0017]
[0018]
[0019]
[0020]
[0021]
[0022]
[0023]
[0024]
[0025]
[0026]
[0027]
[0028]
[0029]
[0030]
[0031]
[0032]
[0033]
[0034]
[0035]
[0036]
[0037]
[0038]
[0039]
[0040]
[0041]
[0042] The following non-limiting examples further describe the methods, compositions, uses, and embodiments.
DETAILED DESCRIPTION OF ILLUSTRATIVE EMBODIMENTS
Example 1
[0043] In this example, we have isolated the Cas9-crRNA complex of S. thermophilus CRISPR3/Cas system and demonstrate that it cuts in a PAM dependent manner both synthetic oligodeoxynucleotide and plasmid DNA bearing a nucleotide sequence complementary to the crRNA. Furthermore, we provide experimental evidence that PAM is recognized in the context of double-stranded DNA and is critical for in vitro DNA binding and cleavage. Finally, we show that RuvC and HNH-motifs of Cas9 contribute to the cleavage of opposite DNA strands. Taken together, our data demonstrate that Cas9-crRNA complex functions as RNA-guided endonuclease which uses RNA module for the target site recognition and employs two separate active sites in the protein module for DNA cleavage. These findings pave the way for engineering of programmable Cas9-crRNA complexes as universal RNA-guided endonucleases.
[0044] Materials and Methods
[0045] DNA manipulations. Genomic DNA of Streptococcus thermophilus DGCC7710 strain was used as a template in PCR reactions to clone cas9. To generate a pASKIBA3-Cas9 plasmid which was used for the expression of the C-terminal Strep-tagged Cas9 protein variant, PCR fragment amplified with following primers: 5′-ACGTCTCAAATGTTGTTTAATAAGTGTATAATAATTTC-3′ (SEQ ID NO: 21) and 5′-ACGTCTCCGCGCTACCCTCTCCTAGTTTG-3′ (SEQ ID NO: 22) was cloned into the pASK-IBA3 expression vector via Esp3I sites. To generate a pBAD-Cas9 plasmid which was used for the expression of the C-terminal 6×His-tagged Cas9 protein variant (“6×His” disclosed as SEQ ID NO: 23), PCR fragment amplified with the following primer pair: 5′-ACGTCTCACATGACTAAGCCATACTCAATTGGAC-3′ (SEQ ID NO: 24) and 5′-ACTCGAGACCCTCTCCTAGTTTGGCAA-3′ (SEQ ID NO: 25) was cloned into the pBAD24-Chis expression vector via NcoI and XhoI sites. Full sequencing of cas9 gene in pASKIBA3-Cas9 and pBAD-Cas9 plasmids revealed no difference with the original cas9 sequence. To obtain plasmids pCas9(−)SP1 (
[0046] Expression and purification of Cas9 protein and Cas9-crRNA complex. (His)6-tagged (“(His)6” disclosed as SEQ ID NO: 23) version of Cas9 protein was expressed and purified using a scheme described for the Cas3 protein from S. thermophilus CRISPR4/Cas system (Sinkunas et al., 2011. EMBO J 30:1335-42). For purification of the Cas9-crRNA complex, Strep-tagged version of the Cas9 protein was expressed in E. coli RR1 strain, bearing pCas9(−)SP1 plasmid (
[0047] Northern blot analysis. Cas9-bound RNA was isolated from Strep-Tactin purified Cas9, co-expressed with pCas9(−)SP1 plasmid using the miRNeasy Mini kit (Qiagen). Northern blots were performed by running RNA on a 10% polyacrylamide gel with 7 M urea in 20 mM MOPS/NaOH pH 8 buffer. The RNA was transferred to a SensiBlot™ Plus Nylon Membrane (Fermentas) by semi-dry blotting using a Trans-blot SD (Bio-Rad). RNA was cross-linked to the membrane with 0.16 M l-ethyl-3-(3-dimethylaminopropyl) carbodiimide (EDC) (Pierce)/0.13 M 1-methylimidazole (Sigma) pH 8 at 60° C. for 1 h. The membrane was pre-hybridized with 2×SSC buffer containing 1% SDS and 0.1 mg/ml denatured DNA from fish testes (Ambion) for 1 h at 40° C. Blots were probed for 12 h with a .sup.32P-5′-labelled 42 nt anti-crRNA DNA oligonucleotide containing 20 nt of spacer1 and 22 nt of the repeat sequence (5′-TCGAAACAACACAGCTCTAAAACTGTCCTCTTCCTCTTTAGC-3′ (SEQ ID NO: 28)). The blots were washed 3× for 15 min with 0.2×SSC buffer containing 0.2% SDS, and were visualized using phosphorimaging. A 42 nt synthetic oligoribonucleotide (5′-CGCUAAAGAGGAAGAGGACAGUUUUAGAGCUGUGUUGUUUCG-3′ (SEQ ID NO: 7)) and 84 nt DNA oligonucleotide.
[0048] Oligonucleotide substrates. All oligonucleotide substrates used in this study are given in Table 1. Oligodeoxyribonucleotides were purchased from Metabion (Martinsried, Germany). The 5′-ends of oligonucleotides were radiolabelled using PNK (Fermentas) and [γ-33P]ATP (Hartmann Analytic). Duplexes were made by annealing two oligonucleotides with complementary sequences (SP1, SP1-Δp, SP2). Radioactive label was introduced at the 5′ end of individual DNA strand prior to the annealing with unlabelled strand.
[0049] Reactions with oligonucleotide substrates. Reactions were typically carried out by adding 2 nM of Cas9-crRNA complex to 1 nM labeled oligonucleotide in 10 mM Tris-HCl (pH 7.5 at 37° C.), 10 mM NaCl, 0.1 mg/ml BSA and 10 mM MgCl2 at 37° C. Aliquots were removed at timed intervals and quenched with loading dye (95% v/v formamide, 0.01% bromphenol blue, 25 mM EDTA, pH 9.0) and subjected to denaturing gel electrophoresis through 20% polyacrylamide followed by a FLA-5100 phosphorimager (Fujilm) detection.
[0050] Reactions with plasmid substrates. Reactions on pUC18 plasmid and its derivatives (Sapranauskas et al., 2011. Nucleic Acids Res 39:9275-82) were conducted at 37° C. in the buffer used for reactions on oligonucleotide substrates. Reaction mixtures typically contained 2.5 nM supercoiled plasmid and 2 nM of Cas9-crRNA complex. The reactions were initiated by adding protein to the mixture of the other components. Aliquots were removed at timed intervals and quenched with phenol/chloroform. The aqueous phase was mixed with loading dye solution (0.01% bromphenol blue and 75 mM EDTA in 50% v/v glycerol) and analyzed by electrophoresis through agarose.
[0051] Plasmid cleavage position determination. To achieve complete cleavage of plasmid substrate, 8 nM of Cas9-crRNA complex was incubated with 2.5 nM of supercoiled plasmid in the reaction buffer at 37° C. for 10 min. Reaction products were purified and concentrated using GeneJET PCR Purification Kit (Fermentas). Spacer1 surrounding region of Cas9 linearized and nicked plasmids were directly sequenced with the following primers: 5′-ccgcatcaggcgccattcgcc-3′ (SEQ ID NO: 29) (sequencing of (+)strand) and 5′-gcgaggaagcggaagagcgccc-3′ (SEQ ID NO: 30) (sequencing of (−)strand).
[0052] Binding assay. Increasing amounts of protein-crRNA complex were mixed with 0.5 nM of 33P-labeled double-stranded and single-stranded DNA substrates (Table 1) in the binding buffer (40 mM Tris-acetate, pH 8.3 at 25 C, 0.1 EDTA, 0.1 mg/ml BSA, 10% v/v glycerol) and incubated for 15 min at room temperature. Free DNA and protein-DNA complexes were separated on the non-denaturing 8% polyacrylamide gel (ratio of acrylamide/N,N′-methylenebisacrylamide 29:1) using 40 mM Tris-acetate (pH 8.3) supplemented with 0.1 mM EDTA as the running buffer. Electrophoresis was run at room temperature for 3 h at 6 V/cm.
[0053] Mutagenesis. The mutants D31A and N891A were obtained by the site-directed mutagenesis as previously described (Tamulaitis et al., 2007. Nucleic Acids Res 35:4792-9). Sequencing of the entire gene for each mutant confirmed that only the designed mutation had been introduced.
TABLE-US-00001 TABLE 1 Oligonucleotide substrates. Proto-spacer sequence is underlined, PAM is on bold. Oligo- nucleotide Sequence Specification SP1 5′-GCTCGAATTGAAATTCTAAACGCTAAAGAGGAAGAGGACATGGTGAATTCGTAAT-3′ 55 bp oligoduplex (SEQ ID NO: 3′-CGAGCTTAACTTTAAGATTTGCGATTTCTCCTTCTCCTGTACCACTTAAGCATTA-5′ substrate containing 31) proto-spacer1 and PAM SP1-pA (SEQ) 5′-GCTCGAATTGAAATTCTAAACGCTAAAGAGGAAGAGGACAAATTCGTAAT-3′ 50 bp oligoduplex ID NO: 32) 3′-CGAGCTTAACTTTAAGATTTGCGATTTCTCCTTCTCCTGTTTAAGCATTA-5′ substrate containing proto-spacer2 SP2 5′-GCTCGAATTGTACTGCTGTATTAGCTTGGTTGTTGGTTTGTGGTGAATTCGTAAT-3′ 55 bp oligoduplex (SEQ ID NO: 3′-CGAGCTTAACATGACGACATAATCGAACCAACAACCAAACACCACTTAAGCATTA-5′ substrate containing 33) proto-spacer2 and PAM (oligodublex without proto-spacer1) s (+) SP1 5′-ATTACGAATTCACCATGTCCTCTTCCTCTTTAGCGTTTAGAATTTCAATTCGAGC-3′ 55 nt ssDNA oligo- (SEQ ID NO: nucleotide substrate 34) (+) strand of SP1 oligoduplex s (+) SP1-pΔ 5′-ATTACGAATTTGTCCTCTTCCTCTTTAGCGTTTAGAATTTCAATTCGAGC-3′ 50 nt ssDNA oligo- (SEQ ID NO: nucleotide substrate 35) (+) strand of SP1- pa oligoduplex s (+) SP2 5′-ATTACGAATTCACCACAAACCAACAACCAAGCTAATACAGCAGTACAATTCGAGC-3′ 55 nt ssDNA oligo- (SEQ ID NO: nucleotides ubstrate, 36) (+) strand of SP2 oligoduplex s (-) SP1 5′-GCTCGAATTGAAATTCTAAACGCTAAAGAGGAAGAGGACATGGTGAATTCGTAAT-3′ 55 nt ssDNA oligo- (SEQ ID NO: nucleotide substrate, 37) (-) strand of SP1 oligoduplx SP1-20 (SEQ 5′-GCTCGAATTGCGCTAAAGAGGAAGAGGACATGGTGAATTCGTAAT-3′ 45 nt oligoduplex ID NO: 38) 3′-CGAGCTTAACGCGATTTCTCCTTCTCCTGTACCACTTAAGCATTA-5′ substrate containing 20 nt of proto- spacer1 and PAM SPN (SEQ ID 5′-GCTCGAATTGCCACCCAGCAAAATTCGGTTTTCTGGCTGATGGTGAATTCGTAAT-3′ 55 bp oligoduplex NO: 39) 3′-CGAGCTTAACGGTGGGTCGTTTTAAGCCAAAAGACCGACTACCACTTAAGCATTA-5′ substrate containing proto-spacerN and PAM
[0054] Results
[0055] Expression and purification of the Cas9-crRNA complex. The cas9 gene from the CRISR3 system of S. thermophilus DGCC7710 strain was cloned into the pASK-IBA3 vector to produce a construct encoding a Cas9 protein fusion containing a C-terminal Strep(II)-tag (
[0056] Cas9 protein co-purifies with crRNA. CRISPR3/Cas system of S. thermophilus belongs to the Type IIA subtype (former Nmeni or CASS4) of CRISPR/Cas systems (Makarova et al., 2011. Nat Rev Microbiol 9:467-77). It has been shown that in the Type IIA CRISPR/Cas system of Streptococcus pyogenes trans-encoded small RNA (tracrRNA) and bacterial RNaseIII are involved in the generation of crRNA (Deltcheva et al., 2011. Nature 471:602-7). Streptococcus pyogenes crRNA is only 42 nt in length and has no “5′-handle” which is conserved in crRNA's from Type I and III CRISPR systems (Hale et al., 2009. Cell 139:945-56; Jore et al., 2011. Nat Struct Mol Biol 18:529-36). According to the northern blot analysis crRNA of similar length is generated in the S. thermophilus LMD-9 CRISPR3/Cas system (Makarova et al., 2011. Nat Rev Microbiol 9:467-77), which is almost identical to the CRISPR3/Cas system of DGCC7710 strain (
[0057] Cas9 protein cleaves double-stranded DNA within a proto-spacer. To test in vitro activity of purified Cas9-crRNA complex we first used the SP1 oligoduplex (Table 1) containing the proto-spacer sequence identical to spacer SP1 in the CRISPR3 array, the PAM sequence 5′-TGGTG-3′ downstream of the proto-spacer, and 10 nt flanking sequences from pSP1 plasmid (Sapranauskas et al., 2011. Nucleic Acids Res 39:9275-82) (
[0058] To test whether the Cas9-crRNA complex can locate the proto-spacer and cut DNA in vitro in long DNA substrates mimicking in vivo invading foreign DNA we analyzed cleavage of pSP1 plasmid (Sapranauskas et al., 2011. Nucleic Acids Res 39:9275-82) (
[0059] The length of the spacer in the CRISPR3 region of S. thermophilus is 30 nt. According to the data provided in the
[0060] PAM is required for DNA binding and cleavage by Cas9-crRNA. Plasmids carrying a proto-spacer but not PAM (pSP1-pΔ) or multiple PAM's but no proto-spacer (pUC18) are resistant for Cas9-crRNA cleavage (
[0061] Consistent with the plasmid cleavage experiments, oligoduplexes which have only proto-spacer, but not PAM are not cut by Cas9-crRNA (
[0062] To test if PAM is important for DNA binding by the Cas9-crRNA complex, electrophoretic mobility shift experiments were performed. To avoid cleavage, binding experiments were performed in the absence of Mg2+ ions which are necessary for cleavage. Cas9-crRNA showed different binding patterns for double-stranded and single-stranded oligonucleotides. In the case of the SP1 oligoduplex a low mobility complex is observed already at 1 nM concentration (
[0063] On the other hand, single-stranded oligonucleotides ((+)strand) are bound by Cas9-crRNA with the same affinity independently of the PAM presence (
[0064] Since some Type III CRISPR systems provide RNA rather than DNA interference, we have studied RNA binding and cleavage by the Cas9-crRNA complex. The Cas9-crRNA did not cleave specifically either single-stranded RNA, or double-stranded RNA bearing a proto-spacer and PAM (
[0065] Mutagenesis of Cas9 protein RuvC and HNH motifs. Plasmid transformation experiments indicate that RuvC and HNH motifs (
[0066] To determine whether mutant proteins exhibit a strand preference, we analysed D31A and N891A mutant cleavage of the SP1 oligoduplex. RuvC active site mutant (D31A) cut (+) strand of oligoduplex at the same position as wt Cas9-crRNA protein. while the (−)strand stayed intact (
[0067] To test whether mutations altered DNA-binding affinity of mutant protein-crRNA complexes, DNA binding was studied using the electrophoretic mobility shift assay. Both mutant protein-crRNA complexes bound oligoduplex SP1 with the same affinity as wild type protein (
[0068] Discussion
[0069] Cas9-crRNA complex of CRISPR3/Cas system of S. thermophilus is crRNA-guided endonuclease. This work demonstrates that Cas9-crRNA complex of CRISPR3/Cas system of S. thermophilus is crRNA-directed endonuclease which cuts both DNA strands in the presence of Mg2+-ions within a protospacer 4 nt downstream of the PAM sequence to produce blunt end cleavage products. Sequence specificity of the Cas9-crRNA complex is dictated by the 42 nt crRNA which include ˜20 nt fragment complementary to the proto-spacer sequence in the target DNA. In this respect the mature crRNA in the Cas9 complex of CRISPR3/Cas system of S. thermophilus is similar to crRNA of Streptococcus pyogenes which has a 3-handle of repeat sequence but lacks part of the spacer sequence and 5′-handle corresponding to the repeat fragment (Deltcheva et al, 2011). Therefore, crRNA present in the Cas9-crRNA complex of CRISPR3/Cas system of S. thermophilus is complementary only to the part of the proto-spacer sequence distal to PAM. Not surprisingly, truncation of the 3-end of the proto-spacer sequence by 10 nucleotides has no effect on Cas9-crRNA cleavage of synthetic oligoduplexes or plasmid DNA (
[0070] The cleavage machinery of Cas9-crRNA complex resides in the Cas9 protein which provides two active sites for the phosphodiester bond cleavage. The RuvC- and HNH-like active sites of Cas9 protein are located on different domains and act independently on individual DNA strands. Alanine replacement of the active site residues in the RuvC- and HNH-motifs transforms Cas9-crRNA complex into a strand-specific nicking endonucleases similar to the nicking enzymes (Chan et al., 2011. Nucleic Acids Res 39:1-18). Consistent with in vivo studies. a functional activity of the Cas9-crRNA complex in vitro is absolutely dependent on the presence of the proto-spacer adjacent motif NGGNG upstream of the proto-spacer sequence. Data presented in the
[0071] Mechanism of DNA interference in the Type II systems. Our results establish a simple model for the mechanism of double-stranded DNA cleavage by Cas9-crRNA complex in the S. thermophilus CRISPR3/Cas system (
[0072] Comparison to other RNA interference complexes. The mechanism proposed here for the double-stranded DNA cleavage by the Cas9-crRNA complex differs significantly from that for the Type I-E (former E. coli or CASS2) system (Jore et al., 2011. Nat Struct Mol Biol 18:529-36). In the E. coli system crRNA and Cas proteins assemble into a large ribonucleoprotein complex named Cascade that facilitates target recognition by enhancing sequence-specific hybridization between the CRISPR RNA and complementary target sequences (Jore et al., 2011. Nat Struct Mol Biol 18:529-36). Target recognition is dependent on PAM and governed by the “seed” crRNA sequence located at the 5′-end of the spacer region (Semenova et al., 2011. Proc Natl Acad Sci USA 108:10098-103). However, while Cascade-crRNA complex alone is able to bind double-stranded DNA containing PAM and proto-spacer, it requires an accessory Cas3 protein for DNA cleavage. Cas3 is a single-stranded DNA nuclease and helicase which is able to cleave single-stranded DNA producing multiple cuts (Sinkunas et al., 2011. EMBO J 30:1335-42). The mechanistic details of the Cas3 action on a proper biological substrate (e.g., Cascade-crRNA bound to the double-stranded DNA in the R-loop like complex) have yet to be established. However, it has been demonstrated recently that Cas3 of M. jannaschii alone is able to cut both DNA strands in the synthetic substrate mimicking R-loop (Beloglazova et al., 2011. EMBO J 30:616-27). It is proposed that Cas3 may follow similar mechanism for DNA cleavage in the presence of Cascade-crRNA complex. Thus, current data clearly show that mechanistic details of the interference step for the Type I-E system differs from that of CRISPR3 system both by the catalytic machinery and mechanism and complexity.
[0073] In the III-B subtype CRISPR systems present in many archea and some bacteria, Cas module RAMP (Cmr) proteins and cRNA assemble into the effector complex that targets invading RNA (Hale et al., 2009. Cell 139:945-56; Hale et al., 2012. Mol Cell 45:292-302). In Pyrococcus furiosus RNA silencing complex comprised of six Cmr1-6 proteins and crRNA binds to the target RNA and cuts it at fixed distance in respect to 3′-end the psiRNA. The cleavage activity depends on Mg2+-ions however individual Cmr protein(-s) responsible for target RNA cleavage has yet to be identified. The effector complex of Sulfolobus solfataricus comprised of seven Cmr1-7 proteins and crRNA cuts invading RNA in an endonucleolytic reaction at UA dinucleotides (Zhang et al., 2012. Mol Cell 45: 303-13). Importantly, both Cmr-crRNA complexes perform RNA cleavage in a PAM independent manner.
[0074] The data provided here show that Cas9-crRNA complex of CRISPR3 system is so far the most simple DNA interference system comprised of a single Cas9 protein bound to the crRNA molecule. The simple modular organization of the Cas9-crRNA complex where specificity for DNA target is encoded by the crRNA and cleavage machinery is brought by the Cas protein provides a versatile platform for engineering of universal RNA-guided DNA endonucleases.
Example 2
[0075] In Vitro Assembly of Cas9-crRNA Complex from 4 Components
[0076] In this example we demonstrate that the catalytically active Cas9-crRNA complex can be assembled in vitro by mixing 4 individual components: the C-terminal (His)6-tagged variant of Cas9 protein (“(His)6” disclosed as SEQ ID NO: 23), tracrRNA transcript (SEQ ID NO: 5), CRISPR RNA transcript (SEQ ID NO: 8) and E. coli RNAseIII (Abgene). Cas9 protein is first pre-incubated with tracrRNA and CRISPR RNA transcripts, followed by the subsequent incubation with RNAselII to generate a catalytically competent Cas9-crRNA complex which is used for the site-specific DNA cleavage.
[0077] More specifically, RNA fragments required for complex assembly were produced by in vitro transcription (TranscriptAid™ T7 High Yield Transcription Kit, Fermentas) of PCR-generated fragment containing a T7 promoter at the proximal end of RNA coding sequence. PCR-generated DNA fragments encoding CRISPR RNA and tracrRNA were produced using pCas9(−)SP1 plasmid as a template with a following primer pair: 5′-taatacgactcactataGggtagaaaagatatcctacgagg-3′ (SEQ ID NO: 40)/5′-CAACAACCAAGCTAATACAGCAG-3′ (SEQ ID NO: 41) and 5′-aaaaacaccgaatcggtgccac-3′ (SEQ ID NO: 42)/5′-taatacgactcactataGggTAATAATAATTGTGGTTTGAAACCATTC-3′ (SEQ ID NO: 43) (T7 RNA polymerase promoter underlined, transcription start shown in bold). The 150 nt CRISPR RNA transcript is comprised of 102 nt Repeat-Spacer1-Repeat sequences flanked by the 23 nt upstream and 25 nt downstream regions required for primer annealing. The 105 nt transcript of tracrRNA is comprised of a 38 nt stretch partially complimentary to the S. thermophilus DCGG7710 CRISPR3 repeat sequence fragment (anti-repeat sequence), flanked by the 16 nt upstream and 51 nt downstream region. RNA fragments produced by in vitro transcription were purified using RNeasy MinElute Cleanup Kit (Qiagen).
[0078] For in vitro assembly of catalytically competent Cas9-crRNA complex, the (His)6-tagged Cas9 protein (“(His)6” disclosed as SEQ ID NO: 23) was mixed with CRISPR RNA and tracrRNA transcripts at 1:0.5:1 molar ratio and pre-incubated in a buffer containing 10 mM Tris-HCl (pH 7.5 at 37° C.), 100 mM NaCl at 37° C. for 30 min followed by addition of RNAselII (Ambion), MgCl2 and DTT and subsequent incubation for additional 30 min. The final concentrations of the components in the assembly mix were the following: 100 nM of (His)6-tagged Cas9 protein (“(His)6” disclosed as SEQ ID NO: 23), 50 nM of CRISPR RNA, 100 nM of tracrRNA, 50 nM RNAselII, 10 mM MgCl2 and 1 mM DTT.
[0079] Below we provide experimental evidences that in vitro assembled Cas9-crRNA complex guided by the crRNA sequence cleaves DNA at the specific site to generate blunt ends. In this respect Cas9-crRNA complex can be used an alternative for a restriction endonuclease or meganuclease for the site-specific DNA cleavage in vitro. The sequence specificity of the complex is dictated by the crRNA sequence which can be engineered to address a desirable DNA target.
[0080] First, the DNA cleavage activity of the in vitro assembled Cas9-crRNA complex was assayed on the plasmid substrates pSP1 and pUC18. The pSP1 plasmid contained a proto-spacer1 sequence flanked by the 5′-GGNG-3′PAM sequence. Proto-spacer1 sequence was not present in pUC18. Reactions on pUC18 and pSP1 plasmids (Sapranauskas et al., 2011. Nucleic Acids Res 39:9275-82) were conducted at 37° C. in the 10 mM Tris HCl (pH 7.5 at 37° C.), 50 mM NaCl, 0.05 mg/ml BSA, 0.5 mM DTT and 10 mM MgCl2. Reaction mixtures typically contained 3.0 nM of supercoiled plasmid DNA. The reactions were initiated by mixing 50 μl volumes of Cas9-crRNA complex and plasmid DNA (1:1 v/v ratio) in a reaction buffer. Aliquots were removed at timed intervals and quenched with phenol/chloroform. The aqueous phase was mixed with loading dye solution (0.01% bromphenol blue and 75 mM EDTA in 50% v/v glycerol) and reaction products analyzed by electrophoresis through agarose (
[0081] Next, the cleavage activity of the in vitro assembled Cas9-crRNA complex was assayed on a synthetic 55 bp oligodeoxynucleotide duplex SP1 containing a proto-spacer sequence matching to the spacer sequence of crRNA (
Example 3
[0082] In Vitro Assembly of Cas9-crRNA Complex from 3 Components
[0083] In this example we demonstrate that active Cas9-crRNA complex can be assembled in vitro by mixing 3 individual components: the C-terminal (His)6-tagged variant of Cas9 protein (“(His)6” disclosed as SEQ ID NO: 23), tracrRNA transcript provided in Example 1 (SEQ ID NO: 5 and SEQ ID NO: 6), and CRISPR RNA transcript (SEQ ID NO: 8) provided in Example 1 or synthetic crRNA (SEQ ID NO: 8) which corresponds to the putative crRNA of CRISPR3/Cas system of S. thermophilus DGCC7710 strain. Synthetic 42 nt oligoribonucleotide is comprised of 20 nt of identical to the spacer1 of CRISPR3 region at the 5′ terminus and 22 nt of repeat sequence at the 3′ end. More specifically, tracrRNA and CRISPR RNA transcripts were obtained as described in Example 1. To generate the Cas9-crRNA complex the (His)6-tagged Cas9 protein (“(His)6” disclosed as SEQ ID NO: 23) was mixed with tracrRNA and CRISPR RNA transcript, or 42 nt synthetic crRNA, at 1:0.5:1 molar ratio and incubated in a buffer containing 10 mM Tris-HCl (pH 7.5 at 37° C.). 100 mM NaCl at 37° C. for 1 h. The final concentrations of the components in the assembly mix were the following: 100 nM of (His)6-tagged Cas9 protein (“(His)6” disclosed as SEQ ID NO: 23), 50 nM of CRISPR RNA or 42 nt synthetic crRNA, 100 nM of tracrRNA.
[0084] Below we provide experimental evidences that in vitro assembled Cas9-crRNA complex guided by the crRNA sequence cleaves DNA at the specific site to generate blunt ends. In this respect Cas9-crRNA complex can be used an alternative for a restriction endonuclease or meganuclease for the site-specific DNA cleavage in vitro. The sequence specificity of the complex is dictated by the crRNA sequence which can be engineered to address a desirable DNA target.
[0085] First, the DNA cleavage activity of the in vitro assembled Cas9-crRNA complex was assayed on the plasmid substrates pSP1 and pUC18. The pSP1 plasmid contained a proto-spacer1 sequence flanked by the 5′-GGNG-3′PAM sequence. Proto-spacer1 sequence was not present in pUC18. Reactions on plasmid substrates (Sapranauskas et al., 2011. Nucleic Acids Res 39:9275-82) were conducted at 37° C. in the 10 mM Tris-HCl (pH 7.5 at 37° C.), 50 mM NaCl, 0.05 mg/ml BSA, 0.5 mM of DTT and 10 mM MgCl2. Reaction mixtures typically contained 3.0 nM of supercoiled plasmid DNA. The reactions were initiated by mixing 50 μl volumes of Cas9-crRNA complex and plasmid DNA (1:1 v/v ratio) in a reaction buffer. Aliquots were removed at timed intervals and quenched with phenol/chloroform. The aqueous phase was mixed with loading dye solution (0.01% bromphenol blue and 75 mM EDTA in 50% v/v glycerol) and reaction products analyzed by electrophoresis through agarose (
[0086] Next, the cleavage activity of the in vitro assembled Cas9-crRNA complex was assayed on a synthetic 55 bp oligodeoxynucleotide duplex SP1 containing a a proto-spacer sequence matching to the spacer sequence of crRNA (
Example 4
[0087] Interchangeable Spacer Cassette for the Re-Programing of the Cas9-crRNA Complex Specificity.
[0088] In this example we describe an interchangeable spacer cassette which allows to produce crRNA carrying a nucleotide sequence against any desirable DNA target to be used for assembly of the Cas9-crRNA complex described in Examples 1 and 2 (
[0089] As proof of the principle demonstration, we used an interchangeable spacer cassette to generate crRNA1 and crRNA2 which were engineered to target pUC18 plasmid at proto-spacer1 and proto-spacer2, respectively, incorporated crRNA1 and crRNA2 into Cas9 complex as described in the Example 1 and used these complexes for the cleavage of pUC18 plasmid. The proto-spacer N1 is located near the SapI restriction endonuclease site, while the proto-spacer N2 is in the vicinity of AatII site. The distance between SapI and AatII restriction sites is 775 bp, while the distance between the putative Cas9-crRNA complex cleavage sites located in the spacers N1 and N2 is 612 bp (
Example 5
[0090] Cloning Procedure Using Cas9-crRNA Complex.
[0091] In this example we demonstrate that Cas9-crRNA complex may be used to prepare a vector for cloning procedure. First we demonstrated that cleavage products obtained by the Cas9-crRNA complex can be re-ligated by DNA ligase. We purified linear pSP1 cleavage product from agarose gel and re-ligated it using DNA ligase. After transformation of E. coli cells by the ligation mix, five individual clones were selected from resulting transformants, plasmid DNA was purified and subjected to sequencing. Sequence analysis revealed that the DNA sequence of the pSP1 plasmid in the locus that was cleaved by Cas9-RNA complex and re-ligated was identical to the sequence of the non-treated plasmid. E. coli transformation by the ligation mix in the absence of T4 DNA ligase did not produce transformants indicating that no traces of supercoiled plasmid are co-purified with the linear reaction product. This result illustrates, that the DNA ends generated by the Cas9 cleavage are substrates for T4 DNA ligase, and therefore must contain a phosphate at the 5′ terminus and a free OH group at the 3′ terminus (Lehman, 1974).
[0092] Next we analyzed cleavage of pUC18 plasmid with Cas9 complex loaded with crRNA1 and crRNA2 described in Example 5 (
[0093] More specifically, the 2 μg pUC18 was incubated with the mix of separately assembled Cas9-RNA complexes (250 nM each) containing different crRNAs for 1 hour at 37° C. in 100 μl reaction volume (10 mM Tris-HCl (pH 7.5 at 37° C.), 100 mM NaCl, 1 mM DTT and 10 mM MgCl.sub.2). Obtained vector fragment was purified from agarose gel using GeneJET gel extraction Kit (Thermo Fisher scientific) and divided in to two equal parts. One part of pre-cleaved vector was dephosphorylated with the FastAP alkaline phosphatase while another part was untreated. 1282 bp insert containing a promoter and a tetracycline resistance gene was obtained from the pACYC184 plasmid by PCR. After purification using the GeneJET PCR Purification Kit (Thermo Fisher scientific), a solution containing the PCR fragment was divided in to two parts. One part was phosphorylated with T4 polynucleotide kinase (Thermo Fisher scientific) while another part remained untreated. Untreated vector was ligated with the untreated PCR fragment, while a dephosphorylated vector was ligated with a phosphorylated fragment using the T4 DNA ligase (Thermo Fisher scientific). Clones were selected on a media supplemented with 100 μg/ml of Ap and 25 μg/ml Tc.
Example 6
[0094] Cleavage of Long DNA Substrates by Cas9 crRNA Complex.
[0095] In this example we demonstrate that Cas9-crRNA may be addressed to cleave targets in long DNA molecules, including phage Δ, E. coli and human genomic DNAs.
[0096] More specifically, we addressed Cas9-RNA complex to cleave specific sites in A bacteriophage (48 kb), E. coli BL-21 strain (4.6 Mb) and human (3.2 Gb) genomic DNAs. Cas9-crRNA complex was assembled as described in Examples 2 and 3. We used 42 nt long synthetic crRNAs, 150 nt pre-crRNAs and tracrRNAs synthesized using in vitro transcription from templates generated as described in Example 4.
[0097] A DNA cleavage reactions were initiated by mixing A DNA (Thermo Fisher Scientific) with assembled Cas9-RNA complex (1:1 v/v ratio) and incubating at 37° C. Final reaction mixture contained 2 μg A DNA, 50 nM Cas9-RNA complex, 10 mM Tris-HCl (pH 7.5 at 37° C.), 100 mM NaCl, 1 mM DTT and 10 mM MgCl.sub.2 in 100 μl reaction volume. Aliquots were removed at timed intervals and quenched with phenol/chloroform. The aqueous phase was mixed with 3× loading dye solution (0.01% bromphenol blue and 75 mM EDTA in 50% v/v glycerol) and reaction products analyzed by electrophoresis through agarose gels and ethidium bromide staining. The analysis of linear A phage genomic DNA cleavage products in agarose gel confirmed that ˜40 bp length DNA is efficiently cleaved at a single site (
[0098] DNA from E. coli BL21 (DE3) strain was isolated using the Genomic DNA purification kit (Thermo Fisher Scientific). For cleavage assay, E. coli genomic DNA was combined with assembled Cas9-RNA complex (1:1 v/v ratio) and incubated for 3 hours at 37° C. Final reaction mixture contained 30 μg genomic DNA, 1 μM Cas9-RNA complex, 10 mM Tris-HCl (pH 7.5 at 37° C.), 100 mM NaCl, 1 mM DTT and 10 mM MgCl.sub.2 in 300 μl reaction volume. Following incubation, 30 μl of FastDigest PstI (Thermo Fisher Scientific) was added and the reaction mix was incubated for additional 16 hours at 37° C. The reaction was terminated by heating the reaction mixture for 30 min at 55° C. with Proteinase K (0.5 mg/ml; Thermo Fisher Scientific) and SDS (0.5%, w/v) followed by 30 min incubation at room temperature with RNase A (0.25 mg/ml; Thermo Fisher Scientific). After phenol/chloroform extraction, DNA was precipitated by isopropanol and dissolved in TE buffer (10 mM Tris-HCl, pH 8.0 and 1 mM EDTA). 10 μg of DNA was mixed with 3× loading dye solution (0.01% bromphenol blue and 75 mM EDTA in 50% v/v glycerol) and electrophoresed on 1% agarose gel.
[0099] To analyse Cas9-crRNA cleavage products of E. coli genomic DNA, we designed a probe against DNA fragment containing a Cas9-RNA complex target (a proto-spacer) (
[0100] The probe was designed to target DNA fragment containing a target (a proto-spacer) for the Cas9-RNA complex (
[0101] To analyze Cas9-crRNA cleavage products of human genomic DNA we used DNA extracted from human brain. Human genomic DNA was combined with assembled Cas9-crRNA complex (1:1 v/v ratio) and incubated for 30 min at 37° C. Final reaction mixture contained 1 μg genomic DNA, 100 nM Cas9, 10 mM Tris-HCl (pH 7.5 at 37° C.), 100 mM NaCl. 1 mM DTT and 10 mM MgCl.sub.2 in 100 μl reaction volume. Cas9-crRNA-HS1 (SeqID #13) and Cas9-crRNA-HS2 (SeqID #14) complexes were assembled to target RASGEF1C or ARL15 loci, respectively. Cleavage products were analyzed using qPCR (
Example 7
[0102] Evidence for Gene Editing of a Reporter Plasmid in Mammalian Cells after Transfection of Cas9/RNA Complexes.
[0103] A reporter plasmid was constructed to monitor double-strand break repair either through non-homologous end-joining (NHEJ) or homologous recombination (HR). The plasmid contained GFP with an intron and flanking the eGFP sequences are 5′ and 3′ sequences of RFP as well as sites of homology (
[0104] The crRNA targeting used 42 nucleotide RNA molecules, as described above, having 22 nucleotides that are the repeat sequence, and 20 nucleotides (spacer sequence) are for the specific target. As described above, the target DNA needs the S. thermophilus motif or PAM which is “NGGNG” downstream of the protospacer in the target. GFP was not “engineered” to contain this PAM motif; several target sequences within eGFP naturally occur with the PAM sequence and crRNAs were designed to target the adjacent spacer sequences. RFP was a marker for homologous recombination after a double strand break in eGFP was created by Cas9/RNA.
[0105]
[0106] S. thermophilus Cas9 protein, purified from E. coli, was complexed with in vitro-transcribed tracrRNA and synthetic unmodified crRNA targeting either sequence A (intronic) or sequence C (coding) of eGFP. For transfection, the Cas9/RNA complexes (either targeting A or C) were incubated with the transfection reagent TurboFECT and the reporter plasmid DNA was also incubated with TurboFECT in separate tubes and they were both added to CHO-K1 cells. The percentage of eGFP-positive cells was determined by flow cytometry. As shown in
[0107] In addition to analyzing the percentage of eGFP-positive cells, transfected cells were also visualized by fluorescent microscopy to monitor the appearance of RFP-positive cells, an indication of repair of Cas9-mediated double strand break by HR rather than NHEJ. As seen in
Example 8
[0108] Cas9/RNA Complexes Made Using Synthetic Unmodified tracrRNAs and crRNAs are Functional In Vitro.
[0109] The experiments described in Example 7 above used Cas9/RNA complexes comprised of purified Cas9, synthetic crRNAs, and in vitro-transcribed tracrRNA. To determine whether Cas9/RNA complexes were functional when made using fully synthetic RNA components (crRNA and tracrRNA), unmodified S. thermophilus tracrRNAs (both endogenous 89-mer and a shorter 74-mer version that is expected to maintain functionality) were synthesized. The unmodified synthetic crRNAs were generated against target E (see
[0110] As seen in
TABLE-US-00002 Sequences SEQ ID NO: 1 WT Cas9 S. thermophilus DGCC7710 CRISPR3-Cas strain One letter: mlfnkciiisinldfsnkekcmtkpysigldigtnsvgwavitdnykvpskkmkvlgntskkyikknllgvllfdsgitaegrrlkrtarrrytrrr nrilylqeifstematlddaffqrlddsflvpddkrdskypifgnlveekvyhdefptiyhlrkyladstkkadlrlvylalahmikyrghfliegef nsknndiqknfqdfldtynaifesdlslenskqleeivkdkisklekkdrilklfpgeknsgifseflklivgnqadfrkcfnldekaslhfskesy dedletllgyigddysdvflkakklydaillsgfltvtdneteaplssamikrynehkedlallkeyirnislktynevfkddtkngyagyidgktn qedfyvylknllaefegadyflekidredflrkqrtfdngsipyqihlqemraildkqakfypflaknkeriekiltfripyyvgplargnsdfaws irkrnekitpwnfedvidkessaeafinrmtsfdlylpeekvlpkhsllyetfnvyneltkvrfiaesmrdyqfldskqkkdivrlyfkdkrkvtd kdiieylhaiygydgielkgiekqfnsslstyhdllniindkeflddssneaiieeiihtltifedremikqrlskfenifdksvlkklsrrhytgwgkl saklingirdeksgntildyliddgisnrnfmqlihddalsfkkkiqkaqiigdedkgnikevvkslpgspaikkgilqsikivdelvkvmggrk pesivvemarenqytnqgksnsqqrlkrlekslkelgskilkenipaklskidnnalqndrlylyylqngkdmytgddldidrlsnydidhiip qaflkdnsidnkvlvssasnrgksddfpslevvkkrktfwyqllksklisqrkfdnltkaerggllpedkagfiqrqlvetrqitkhvarlldekfn nkkdennravrtvkiitlkstlvsqfrkdfelykvreindfhhahdaylnaviasallkkypklepefvygdypkynsfrerksatekvyfysni mnifkksisladgrvierplievneetgesvwnkesdlatvrrvlsypqvnvvkkveeqnhgldrgkpkglfnanlsskpkpnsnenlvgak eyldpkkyggyagisnsfavlvkgtiekgakkkitnvlefqgisildrinyrkdklnfIlekgykdieliielpkyslfelsdgsrrmlasilstnnkr geihkgnqiflsqkfvkllyhakrisntinenhrkyvenhkkefeelfyyilefnenyvgakkngkllnsafqswqnhsidelcssfigptgser kglfeltsrgsaadfeflgvkipryrdytpssllkdatlihqsvtglyetridlaklgeg Three letters: MetLeuPheAsnLysCysIleIleIleSerIleAsnLeuAspPheSerAsnLysGluLys ValIleThrAspAsnTyrLysValProSerLysLysMetLysValLeuGlyAsnThrSer LysLysTyrIleLysLysAsnLeuLeuGlyValLeuLeuPheAspSerGlyIleThrAla GluGlyArgArgLeuLysArgThrAlaArgArgArgTyrThrArgArgArgAsnArgIle LeuTyrLeuGlnGluIlePheSerThrGluMetAlaThrLeuAspAspAlaPhePheGln ArgLouAspAspSerPhcLcuValProAspAspLysArgAspSerLysTyrProIlcPhe GlyAsnLeuValGluGluLysValTyrHisAspGluPheProThrIleTyrHisLeuArg LysTyrLeuAlaAspSerThrLysLysAlaAspLeuArgLeuValTyrLeuAlaLeuAla HisMetIleLysTyrArgGlyHisPheLeuIleGluGlyGluPheAsnSerLysAsnAsn AspIleGlnLysAsnPheGlnAspPheLeuAspThrTyrAsnAlaIlePheGluSerAsp LeuSerLeuGluAsnSerLysGlnLeuGluGluIleValLysAspLysIleSerLysLeu GluLysLysAspArgIleLeuLysLeuPheProGlyGluLysAsnSerGlyIlePheSer GluPheLeuLysLeuIleValGlyAsnGlnAlaAspPheArgLysCysPheAsnLeuAsp GluLysAlaSerLeuHisPheSerLysGluSerTyrAspGluAspLeuGluThrLeuLeu GlyTyrIleGlyAspAspTyrSerAspValPheLeuLysAlaLysLysLeuTyrAspAla IlcLeuLeuSerGlyPheLcuThrValThrAspAsnGluThrGluAlaProLcuScrSer AlaMetIleLysArgTyrAsnGluHisLysGluAspLeuAlaLeuLeuLysGluTyrIle ArgAsnIleSerLeuLysThrTyrAsnGluValPheLysAspAspThrLysAsnGlyTyr AlaGlyTyrIleAspGlyLysThrAsnGlnGluAspPheTyrValTyrLeuLysAsnLeu LeuAlaGluPheGluGlyAlaAspTyrPheLeuGluLysIleAspArgGluAspPheLeu ArgLysGlnArgThrPheAspAsnGlySerIleProTyrGlnIleHisLeuGlnGluMet ArgAlaIleLeuAspLysGlnAlaLysPheTyrProPheLeuAlaLysAsnLysGluArg IleGluLysIleLeuThrPheArgIleProTyrTyrValGlyProLeuAlaArgGlyAsn SerAspPheAlaTrpSerIleArgLysArgAsnGluLysIleThrProTrpAsnPheGlu AspValIleAspLysGluSerSerAlaGluAlaPheIleAsnArgMetThrSerPheAsp LeuTyrLeuProGluGluLysValLeuProLysHisSerLeuLeuTyrGluThrPheAsn ValTyrAsnGluLeuThrLysValArgPheIleAlaGluSerMetArgAspTyrGlnPhe LeuAspSerLysGlnLysLysAspIleValArgLeuTyrPheLysAspLysArgLysVal ThrAspLysAspIleIleGluTyrLeuHisAlaIleTyrGlyTyrAspGlyIleGluLeu LysGlyIleGluLysGlnPheAsnSerSerLeuSerThrTyrHisAspLeuLeuAsnIle IleAsnAspLysGluPheLeuAspAspSerSerAsnGluAlaIleIleGluGluIleIle HisThrLeuThrIlePheGluAspArgGluMetIleLysGlnArgLeuSerLysPheGlu AsnIlePheAspLysSerValLeuLysLysLeuSerArgArgHisTyrThrGlyTrpGly LysLeuSerAlaLysLeuIleAsnGlyIleArgAspGluLysSerGlyAsnThrIleLeu AspTyrLeuIleAspAspGlyIleSerAsnArgAsnPheMetGlnLeuIleHisAspAsp AlaLeuSerPheLysLysLysIleGlnLysAlaGlnIleIleGlyAspGluAspLysGly AsnIlcLysGluValValLysScrLcuProGlySerProAlaIleLysLysGlyIlcLeu GlnSerIleLysIleValAspGluLeuValLysValMetGlyGlyArgLysProGluSer IleValValGluMetAlaArgGluAsnGlnTyrThrAsnGlnGlyLysSerAsnSerGln GlnArgLeuLysArgLeuGluLysSerLeuLysGluLeuGlySerLysIleLeuLysGlu AsnIleProAlaLysLeuSerLysIleAspAsnAsnAlaLeuGlnAsnAspArgLeuTyr LeuTyrTyrLeuGlnAsnGlyLysAspMetTyrThrGlyAspAspLcuAspIlcAspArg LeuSerAsnTyrAspIleAspHisIleIleProGlnAlaPheLeuLysAspAsnSerIle AspAsnLysValLeuValSerSerAlaSerAsnArgGlyLysSerAspAspPheProSer LeuGluValValLysLysArgLysThrPheTrpTyrGlnLeuLeuLysSerLysLeuIle SerGlnArgLysPheAspAsnLeuThrLysAlaGluArgGlyGlyLeuLeuProGluAsp LysAlaGlyPheIleGlnArgGlnLeuValGluThrArgGlnIleThrLysHisValAla ArgLeuLeuAspGluLysPheAsnAsnLysLysAspGluAsnAsnArgAlaValArgThr ValLysIleIleThrLeuLysSerThrLeuValSerGlnPheArgLysAspPheGluLeu TyrLysValArgGluIleAsnAspPheHisHisAlaHisAspAlaTyrLeuAsnAla Val IleAlaSerAlaLeuLeuLysLysTyrProLysLeuGluProGluPheValTyrGlyAsp TyrProLysTyrAsnScrPhcArgGluArgLysScrAlaThrGluLysValTyrPhcTyr SerAsnIleMetAsnIlePheLysLysSerIleSerLeuAlaAspGlyArgValIleGlu ArgProLeuIleGluValAsnGluGluThrGlyGluSerValTrpAsnLysGluSerAsp LeuAlaThrValArgArgValLeuSerTyrProGlnValAsn ValValLysLysValGlu GluGlnAsnHisGlyLeuAspArgGlyLysProLysGlyLeuPheAsnAlaAsnLeuSer SerLysProLysProAsnSerAsnGluAsnLeuValGlyAlaLysGluTyrLeuAspPro LysLysTyrGlyGlyTyrAlaGlyIleSerAsnSerPheAlaValLeuValLysGlyThr IleGluLysGlyAlaLysLysLysIleThrAsnValLeuGluPheGlnGlyIleSerIle LeuAspArgIleAsnTyrArgLysAspLysLeuAsnPheLeuLeuGluLysGlyTyrLys AspIleGluLeuIleIleGluLeuProLysTyrSerLeuPheGluLeuSerAspGlySer ArgArgMetLeuAlaSerIleLeuSerThrAsnAsnLysArgGlyGluIleHisLysGly AsnGlnIlePheLeuSerGinLysPheValLysLeuLeuTyrHisAlaLysArgIleSer AsnThrIleAsnGluAsnHisArgLysTyrValGluAsnHisLysLysGluPheGluGlu LeuPheTyrTyrIleLeuGluPheAsnGluAsnTyrValGlyAlaLysLysAsnGlyLys LeuLeuAsnSerAlaPheGlnSerTrpGlnAsnHisSerIleAspGluLeuCysSerSer PheIleGlyProThrGlySerGluArgLysGlyLeuPheGluLeuThrSerArgGlySer AlaAlaAspPheGluPheLeuGly ValLysIleProArgTyrArgAspTyrThrProSer SerLeuLeuLysAspAlaThrLeuIleHisGlnSerValThrGlyLeuTyrGluThrArg IleAspLeuAlaLysLeuGlyGluGly SEQ ID NO: 2 D31A mutant One letter: mlfnkciiisinldfsnkckcmtkpysiglaigtnsvgwavitdnykvpskkmkvlgntskkyikknllgvllfdsgitaegrrlkrtarrrytrrr nrilylqeifstematlddaffqrlddsflvpddkrdskypifgnlveekvyhdefptiyhlrkyladstkkadlrlvylalahmikyrghfliegef nsknndiqknfqdfldtynaifesdlslenskqleeivkdkisklekkdrilklfpgeknsgifseflklivgnqadfrkcfnldekaslhfskesy dedletllgyigddysdvflkakklydaillsgfltvtdneteaplssamikrynehkedlallkeyirnislktynevfkddtkngyagyidgktn qedfyvylknllaefegadyflekidredflrkqrtfdngsipyqihlqemraildkqakfypflaknkeriekiltfripyyvgplargnsdfaws irkrnekitpwnfedvidkessaeafinrmtsfdlylpeekvlpkhsllyetfnvyneltkvrfiaesmrdyqfldskqkkdivrlyfkdkrkvtd kdiieylhaiygydgielkgiekqfnsslstyhdllniindkeflddssneaiieeiihtltifedremikqrlskfenifdksvlkklsrrhytgwgkl saklingirdeksgntildyliddgisnrfmqlihddalsfkkkiqkaqiigdedkgnikevvkslpgspaikkgilqsikivdelvkvmggrk pesivvemarenqytnqgksnsqqrlkrlekslkelgskilkenipaklskidnnalqndrlylyylqngkdmytgddldidrlsnydidhiip qaflkdnsidnkvlvssasnrgksddfpslevvkkrktfwyqllksklisqrkfdnltkaerggllpedkagfiqrqlvetrqitkhvarlldekfn nkkdennravrtvkiitlkstlvsqfrkdfclykvrcindfhhahdaylnaviasallkkypklcpcfvygdypkynsfrerksatckvyfysni mnifkksisladgrvierplievneetgesvwnkesdlatvrrvlsypqvnvvkkveeqnhgldrgkpkglfnanlsskpkpnsnenlvgak eyldpkkyggyagisnsfavlvkgtiekgakkkitnvlefqgisildrinyrkdklnfIlekgykdieliielpkyslfelsdgsrrmlasilstnnkr geihkgnqiflsqkfvkllyhakrisntinenhrkyvenhkkefeelfyyilefnenyvgakkngkllnsafqswqnhsidelcssfigptgser kglfeltsrgsaadfeflgvkipryrdytpssllkdatlihqsvtglyetridlaklgeg Three letters: MetLeuPheAsnLysCysIleIleIleSerIleAsnLeuAspPheSerAsnLysGluLys CysMetThrLysProTyrSerIleGlyLeuAlaIleGlyThrAsnSerValGlyTrpAla ValIleThrAspAsnTyrLysValProSerLysLysMetLysValLeuGlyAsnThrSer LysLysTyrIleLysLysAsnLeuLeuGlyValLeuLeuPheAspSerGlyIleThrAla GluGlyArgArgLeuLysArgThrAlaArgArgArgTyrThrArgArgArgAsnArgIle LeuTyrLeuGlnGluIlePheSerThrGluMetAlaThrLeuAspAspAlaPhePheGln ArgLeuAspAspSerPheLeuValProAspAspLysArgAspSerLysTyrProIlePhe GlyAsnLeuValGluGluLysValTyrHisAspGluPheProThrIleTyrHisLeuArg LysTyrLeuAlaAspSerThrLysLysAlaAspLeuArgLeuValTyrLeuAlaLeuAla HisMetIleLysTyrArgGlyHisPheLeuIleGluGlyGluPheAsnSerLysAsnAsn AspIleGlnLysAsnPheGlnAspPheLeuAspThrTyrAsnAlaIlePheGluSerAsp LeuSerLeuGluAsnSerLysGlnLeuGluGluIleValLysAspLysIleSerLysLeu GluLysLysAspArgIleLeuLysLeuPheProGlyGluLysAsnSerGlyIlePheSer GluPheLeuLysLeuIleValGlyAsnGlnAlaAspPheArgLysCysPheAsnLeuAsp GluLysAlaScrLeuHisPhcScrLysGluSerTyrAspGluAspLcuGluThrLcuLcu GlyTyrIleGlyAspAspTyrSerAspValPheLeuLysAlaLysLysLeuTyrAspAla IleLeuLeuSerGlyPheLeuThrValThrAspAsnGluThrGluAlaProLeuSerSer AlaMetIleLysArgTyrAsnGluHisLysGluAspLeuAlaLeuLeuLysGluTyrIle ArgAsnIleSerLeuLysThrTyrAsnGluValPheLysAspAspThrLysAsnGlyTyr AlaGlyTyrIlcAspGlyLysThrAsnGlnGluAspPhcTyrValTyrLcuLysAsnLcu LeuAlaGluPheGluGlyAlaAspTyrPheLeuGluLysIleAspArgGluAspPheLeu ArgLysGlnArgThrPheAspAsnGlySerIleProTyrGlnIleHisLeuGlnGluMet ArgAlaIleLeuAspLysGlnAlaLysPheTyrProPheLeuAlaLysAsnLysGluArg IleGluLysIleLeuThrPheArgIleProTyrTyrValGlyProLeuAlaArgGlyAsn SerAspPheAlaTrpSerIleArgLysArgAsnGluLysIleThrProTrpAsnPheGlu AspValIleAspLysGluSerSerAlaGluAlaPheIleAsnArgMetThrSerPheAsp LeuTyrLeuProGluGluLysValLeuProLysHisSerLeuLeuTyrGluThrPheAsn ValTyrAsnGluLeuThrLysValArgPheIleAlaGluSerMetArgAspTyrGlnPhe LeuAspSerLysGlnLysLysAspIleValArgLeuTyrPheLysAspLysArgLysVal ThrAspLysAspIlcIlcGluTyrLcuHisAlaIlcTyrGlyTyrAspGlyIlcGluLcu LysGlyIleGluLysGlnPheAsnSerSerLeuSerThrTyrHisAspLeuLeuAsnIle IleAsnAspLysGluPheLeuAspAspSerSerAsnGluAlaIleIleGluGluIleIle HisThrLeuThrIlePheGluAspArgGluMetIleLysGlnArgLeuSerLysPheGlu AsnIlePheAspLysSerValLeuLysLysLeuSerArgArgHisTyrThrGlyTrpGly LysLeuSerAlaLysLeuIleAsnGlyIleArgAspGluLysSerGlyAsnThrIleLeu AspTyrLeuIleAspAspGlyIleSerAsnArgAsnPheMetGinLeuIleHisAspAsp AlaLeuSerPheLysLysLysIleGlnLysAlaGlnIleIleGlyAspGluAspLysGly AsnIleLysGluValValLysSerLeuProGlySerProAlaIleLysLysGlyIleLeu GlnSerIleLysIleValAspGluLeuValLysValMetGlyGlyArgLysProGluSer IleValValGluMetAlaArgGluAsnGlnTyrThrAsnGlnGlyLysSerAsnSerGln GlnArgLeuLysArgLeuGluLysSerLeuLysGluLeuGlySerLysIleLeuLysGlu AsnIleProAlaLysLeuSerLysIleAspAsnAsnAlaLeuGlnAsnAspArgLeuTyr LeuTyrTyrLeuGlnAsnGlyLysAspMetTyrThrGlyAspAspLeuAspIleAspArg LeuSerAsnTyrAspIleAspHisIleIleProGlnAlaPheLeuLysAspAsnSerIle AspAsnLys ValLeuValSerSerAlaSerAsnArgGlyLysSerAspAspPheProSer LeuGluValValLysLysArgLysThrPheTrpTyrGlnLeuLeuLysSerLysLeuIle SerGlnArgLysPheAspAsnLeuThrLysAlaGluArgGlyGlyLeuLeuProGluAsp LysAlaGlyPheIleGlnArgGlnLeuValGluThrArgGlnIleThrLysHisValAla ArgLeuLeuAspGluLysPheAsnAsnLysLysAspGluAsnAsnArgAlaValArgThr ValLysIleIleThrLeuLysSerThrLeuValSerGlnPheArgLysAspPheGluLeu TyrLysValArgGlullcAsnAspPhcHisHisAlaHisAspAlaTyrLcuAsnAlaVal IleAlaSerAlaLeuLeuLysLysTyrProLysLeuGluProGluPheValTyrGlyAsp TyrProLysTyrAsnSerPheArgGluArgLysSerAlaThrGluLysValTyrPheTyr SerAsnIleMetAsnIlePheLysLysSerIleSerLeuAlaAspGlyArgValIleGlu ArgProLeuIleGluValAsnGluGluThrGlyGluSerValTrpAsnLysGluSerAsp LeuAlaThrValArgArgValLcuScrTyrProGlnValAsnValValLysLysValGlu GluGlnAsnHisGlyLeuAspArgGlyLysProLysGlyLeuPheAsnAlaAsnLeuSer SerLysProLysProAsnSerAsnGluAsnLeuValGlyAlaLysGluTyrLeuAspPro LysLysTyrGlyGlyTyrAlaGlyIleSerAsnSerPheAlaValLeuValLysGlyThr IleGluLysGlyAlaLysLysLysIleThrAsnValLeuGluPheGlnGlyIleSerIle LeuAspArgIleAsnTyrArgLysAspLysLeuAsnPheLeuLeuGluLysGlyTyrLys AspIleGluLeuIleIleGluLeuProLysTyrSerLeuPheGluLeuSerAspGlySer ArgArgMetLeuAlaSerIleLeuSerThrAsnAsnLysArgGlyGluIleHisLysGly AsnGlnIlePheLeuSerGlnLysPheValLysLeuLeuTyrHisAlaLysArgIleSer AsnThrIleAsnGluAsnHisArgLysTyrValGluAsnHisLysLysGluPheGluGlu LeuPhcTyrTyrIlcLeuGluPheAsnGluAsnTyrValGlyAlaLysLysAsnGlyLys LeuLeuAsnSerAlaPheGlnSerTrpGlnAsnHisSerIleAspGluLeuCysSerSer PheIleGlyProThrGlySerGluArgLysGlyLeuPheGluLeuThrSerArgGlySer AlaAlaAspPheGluPheLeuGlyValLysIleProArgTyrArgAspTyrThrProSer SerLeuLeuLysAspAlaThrLeuIleHisGlnSerValThrGlyLeuTyrGluThrArg IleAspLeuAlaLysLeuGlyGluGly SEQ ID NO: 3 N891A mutant One letter: mlfnkciiisinldfsnkekcmtkpysigldigtnsvgwavitdnykvpskkmkvlgntskkyikknllgvllfdsgitaegrrlkrtarrrytrrr nrilylqeifstematlddaffqrlddsflvpddkrdskypifgnlveekvyhdefptiyhlrkyladstkkadlrlvylalahmikyrghfliegef nsknndiqknfqdfldtynaifesdlslenskqleeivkdkisklekkdrilklfpgeknsgifseflklivgnqadfrkcfnldekaslhfskesy dedletllgyigddysdvflkakklydaillsgfltvtdneteaplssamikrynehkedlallkeyirnislktynevfkddtkngyagyidgktn qedfyvylknllaefegadyflekidredflrkqrtfdngsipyqihlqemraildkqakfypflaknkeriekiltfripyyvgplargnsdfaws irkrnekitpwnfedvidkessaeafinrmtsfdlylpeekvlpkhsllyetfnvyneltkvrfiaesmrdyqfldskqkkdivrlyfkdkrkvtd kdiieylhaiygydgielkgiekqfnsslstyhdllniindkeflddssneaiieeiihtltifedremikqrlskfenifdksvlkklsrrhytgwgkl saklingirdeksgntildyliddgisnrnfmqlihddalsfkkkiqkaqiigdedkgnikevvkslpgspaikkgilqsikivdelvkvmggrk pcsivvcmarenqytnqgksnsqqrlkrlekslkelgskilkenipaklskidnnalqndrlylyylqngkdmytgddldidrlsnydidhiip qaflkdnsidnkvlvssasargksddfpslevvkkrktfwyqllksklisqrkfdnltkaerggllpedkagfiqrqlvetrqitkhvarlldekfn nkkdennravrtvkiitlkstlvsqfrkdfelykvreindfhhahdaylnaviasallkkypklepefvygdypkynsfrerksatekvyfysni mnifkksisladgrvierplievneetgesvwnkesdlatvrrvlsypqvnvvkkveeqnhgldrgkpkglfnanlsskpkpnsnenlvgak eyldpkkyggyagisnsfavlvkgtiekgakkkitnvlefqgisildrinyrkdklnfIlekgykdieliielpkyslfelsdgsrrmlasilstnnkr kglfeltsrgsaadfeflgvkipryrdytpssllkdatlihqsvtglyetridlaklgeg Three letters: MetLeuPheAsnLysCysIleIleIleSerIleAsnLeuAspPheSerAsnLysGluLys CysMetThrLysProTyrSerIleGlyLeuAspIleGlyThrAsnSerValGlyTrpAla ValIleThrAspAsnTyrLysValProSerLysLysMetLysValLeuGlyAsnThrSer LysLysTyrIleLysLysAsnLeuLeuGlyValLeuLeuPheAspSerGlyIleThrAla GluGlyArgArgLeuLysArgThrAlaArgArgArgTyrThrArgArgArgAsnArgIle LeuTyrLeuGlnGluIlePheSerThrGluMetAlaThrLeuAspAspAlaPhePheGln ArgLcuAspAspScrPheLcuValProAspAspLysArgAspScrLysTyrProIlcPhe GlyAsnLeuValGluGluLysValTyrHisAspGluPheProThrIleTyrHisLeuArg LysTyrLeuAlaAspSerThrLysLysAlaAspLeuArgLeuValTyrLeuAlaLeuAla HisMetIleLysTyrArgGlyHisPheLeuIleGluGlyGluPheAsnSerLysAsnAsn AspIleGlnLysAsnPheGlnAspPheLeuAspThrTyrAsnAlaIlePheGluSerAsp LeuSerLeuGluAsnSerLysGlnLeuGluGluIleValLysAspLysIleSerLysLeu GluLysLysAspArgIleLeuLysLeuPheProGlyGluLysAsnSerGlyIlePheSer GluPheLeuLysLeuIleValGlyAsnGlnAlaAspPheArgLysCysPheAsnLeuAsp GluLysAlaSerLeuHisPheSerLysGluSerTyrAspGluAspLeuGluThrLeuLeu GlyTyrIleGlyAspAspTyrSerAspValPheLeuLysAlaLysLysLeuTyrAspAla IleLeuLeuSerGlyPheLeuThrValThrAspAsnGluThrGluAlaProLeuSerSer AlaMetIleLysArgTyrAsnGluHisLysGluAspLeuAlaLeuLeuLysGluTyrIle ArgAsnIleSerLeuLysThrTyrAsnGluValPheLysAspAspThrLysAsnGlyTyr AlaGlyTyrIleAspGlyLysThrAsnGlnGluAspPheTyrValTyrLeuLysAsnLeu LeuAlaGluPheGluGlyAlaAspTyrPheLeuGluLysIleAspArgGluAspPheLeu ArgLysGlnArgThrPheAspAsnGlySerIleProTyrGlnIleHisLeuGlnGluMet ArgAlaIleLeuAspLysGlnAlaLysPheTyrProPheLeuAlaLysAsnLysGluArg IleGluLysIleLeuThrPheArgIleProTyrTyrValGlyProLeuAlaArgGlyAsn SerAspPheAlaTrpSerIleArgLysArgAsnGluLysIleThrProTrpAsnPheGlu AspValIleAspLysGluSerSerAlaGluAlaPheIleAsnArgMetThrSerPheAsp LeuTyrLeuProGluGluLysValLeuProLysHisSerLeuLeuTyrGluThrPheAsn ValTyrAsnGluLcuThrLysValArgPhcIlcAlaGluScrMctArgAspTyrGlnPhc LeuAspSerLysGlnLysLysAspIleValArgLeuTyrPheLysAspLysArgLysVal ThrAspLysAspIleIleGluTyrLeuHisAlaIleTyrGlyTyrAspGlyIleGluLeu LysGlyIleGluLysGlnPheAsnSerSerLeuSerThrTyrHisAspLeuLeuAsnIle IleAsnAspLysGluPheLeuAspAspSerSerAsnGluAlaIleIleGluGluIleIle HisThrLcuThrIlcPhcGluAspArgGluMctIlcLysGlnArgLcuScrLysPhcGlu AsnIlePheAspLysSerValLeuLysLysLeuSerArgArgHisTyrThrGlyTrpGly LysLeuSerAlaLysLeuIleAsnGlyIleArgAspGluLysSerGlyAsnThrIleLeu AspTyrLeuIleAspAspGlyIleSerAsnArgAsnPheMetGlnLeuIleHisAspAsp AlaLeuSerPheLysLysLysIleGlnLysAlaGlnIleIleGlyAspGluAspLysGly AsnIleLysGluValValLysSerLeuProGlySerProAlaIleLysLysGlyIleLeu GlnSerIleLysIleValAspGluLeuValLysValMetGlyGlyArgLysProGluSer IleValValGluMetAlaArgGluAsnGlnTyrThrAsnGlnGlyLysSerAsnSerGln GlnArgLeuLysArgLeuGluLysSerLeuLysGluLeuGlySerLysIleLeuLysGlu AsnIleProAlaLysLeuSerLysIleAspAsnAsnAlaLeuGlnAsnAspArgLeuTyr LeuTyrTyrLeuGlnAsnGlyLysAspMctTyrThrGlyAspAspLeuAspIlcAspArg LeuSerAsnTyrAspIleAspHisIleIleProGInAlaPheLeuLysAspAsnSerIle AspAsnLysValLeuValSerSerAlaSerAlaArgGlyLysSerAspAspPheProSer LeuGhiValValLysLysArgLysThrPheTrpTyrGlnLeuLeuLysSerLysLeuIle SerGlnArgLysPheAspAsnLeuThrLysAlaGluArgGlyGlyLeuLeuProGluAsp LysAlaGlyPheIleGlnArgGlnLeuValGluThrArgGlnIleThrLysHisValAla ArgLeuLeuAspGluLysPheAsnAsnLysLysAspGluAsnAsnArgAlaValArgThr ValLysIleIleThrLeuLysSerThrLeuValSerGlnPheArgLysAspPheGluLeu TyrLysValArgGluIleAsnAspPheHisHisAlaHisAspAlaTyrLeuAsnAla Val IleAlaSerAlaLeuLeuLysLysTyrProLysLeuGluProGluPheValTyrGlyAsp TyrProLysTyrAsnSerPheArgGluArgLysSerAlaThrGluLysValTyrPheTyr SerAsnIleMetAsnIlePheLysLysSerIleSerLeuAlaAspGlyArgValIleGlu ArgProLeuIleGluValAsnGluGluThrGlyGluSerValTrpAsnLysGluSerAsp LeuAlaThrValArgArgValLeuSerTyrProGlnValAsnValValLysLysValGlu GluGlnAsnHisGlyLeuAspArgGlyLysProLysGlyLeuPheAsnAlaAsnLeuSer SerLysProLysProAsnSerAsnGluAsnLeuValGlyAlaLysGluTyrLeuAspPro LysLysTyrGlyGlyTyrAlaGlyIleSerAsnSerPheAlaValLeuValLysGlyThr IleGluLysGlyAlaLysLysLysIleThrAsnValLeuGluPheGlnGlyIleSerIle LeuAspArgIleAsnTyrArgLysAspLysLeuAsnPheLeuLeuGluLysGlyTyrLys AspIleGluLeuIleIleGluLeuProLysTyrSerLeuPheGluLeuSerAspGlySer ArgArgMetLeuAlaSerIleLeuSerThrAsnAsnLysArgGlyGluIleHisLysGly AsnGlnIlcPhcLeuSerGlnLysPhcValLysLeuLcuTyrHisAlaLysArgIleSer AsnThrIleAsnGluAsnHisArgLysTyrValGluAsnHisLysLysGluPheGluGlu LeuPheTyrTyrIleLeuGluPheAsnGluAsnTyrValGlyAlaLysLysAsnGlyLys LeuLeuAsnSerAlaPheGlnSerTrpGlnAsnHisSerIleAspGluLeuCysSerSer PheIleGlyProThrGlySerGluArgLysGlyLeuPheGluLeuThrSerArgGlySer AlaAlaAspPheGluPheLcuGlyValLysIlcProArgTyrArgAspTyrThrProSer SerLeuLeuLysAspAlaThrLeuIleHisGlnSerValThrGlyLeuTyrGluThrArg IleAspLeuAlaLysLeuGlyGluGly SEQ ID NO: 4 H868A mutant One letter mlfnkciiisinldfsnkckcmtkpysigldigtnsvgwavitdnykvpskkmkvlgntskkyikknllgvllfdsgitacgrrlkrtarrrytrrr nrilylqeifstematlddaffqrlddsflvpddkrdskypifgnlveekvyhdefptiyhlrkyladstkkadlrlvylalahmikyrghfliegef nsknndiqknfqdfldtynaifesdlslenskqleeivkdkisklekkdrilklfpgeknsgifseflklivgnqadfrkcfnldekaslhfskesy dedletllgyigddysdvflkakklydaillsgfltvtdneteaplssamikrynehkedlallkeyirnislktynevfkddtkngyagyidgktn qedfyvylknllaefegadyflekidredflrkqrtfdngsipyqihlqemraildkqakfypflaknkeriekiltfripyyvgplargnsdfaws irkrnekitpwnfedvidkessaeafinrmtsfdlylpeekvlpkhsllyetfnvyneltkvrfiaesmrdyqfldskqkkdivrlyfkdkrkvtd kdiieylhaiygydgielkgiekqfnsslstyhdllniindkeflddssneaiieeiihtltifedremikqrlskfenifdksvlkklsrrhytgwgkl saklingirdeksgntildyliddgisnrnfmqlihddalsfkkkiqkaqiigdedkgnikevvkslpgspaikkgilqsikivdelvkvmggrk pesivvemarenqytnqgksnsqqrlkrlekslkelgskilkenipaklskidnnalqndrlylyylqngkdmytgddldidrlsnydidaiipq aflkdnsidnkvlvssasnrgksddfpslevvkkrktfwyqllksklisqrkfdnltkaerggllpedkagfiqrqlvetrqitkhvarlldekfnn kkdennravrtvkiitlkstlvsqfrkdfelykvreindfhhahdaylnaviasallkkypklepefvygdypkynsfrerksatekvyfysnim nifkksisladgrvierplievneetgesvwnkesdlatvrrvlsypqvnvvkkveeqnhgldrgkpkglfnanlsskpkpnsnenlvgakey ldpkkyggyagisnsfavlvkgtiekgakkkitnvlefqgisildrinyrkdklnfIlekgykdieliielpkyslfelsdgsrrmlasilstnnkrge ihkgnqiflsqkfvkllyhakrisntinenhrkyvenhkkefeelfyyilefnenyvgakkngkllnsafqswqnhsidelcssfigptgserkgl feltsrgsaadfeflgvkipryrdytpssllkdatlihqsvtglyetridlaklgeg Three letters: MetLeuPheAsnLysCysIleIleIleSerIleAsnLeuAspPheSerAsnLysGluLys CysMetThrLysProTyrSerIleGlyLeuAspIleGlyThrAsnSerValGlyTrpAla ValIleThrAspAsnTyrLysValProSerLysLysMetLysValLeuGlyAsnThrSer LysLysTyrIleLysLysAsnLeuLeuGlyValLeuLeuPheAspSerGlyIleThrAla GluGlyArgArgLcuLysArgThrAlaArgArgArgTyrThrArgArgArgAsnArgIlc LeuTyrLeuGlnGluIlePheSerThrGluMetAlaThrLeuAspAspAlaPhePheGln ArgLeuAspAspSerPheLeuValProAspAspLysArgAspSerLysTyrProIlePhe GlyAsnLeuValGluGluLysValTyrHisAspGluPheProThrIleTyrHisLeuArg LysTyrLeuAlaAspSerThrLysLysAlaAspLeuArgLeuValTyrLeuAlaLeuAla HisMetIleLysTyrArgGlyHisPhcLcuIlcGluGlyGluPheAsnSerLysAsnAsn AspIleGlnLysAsnPheGlnAspPheLeuAspThrTyrAsnAlaIlePheGluSerAsp LeuSerLeuGluAsnSerLysGlnLeuGluGluIleValLysAspLysIleSerLysLeu GluLysLysAspArgIleLeuLysLeuPheProGlyGluLysAsnSerGlyIlePheSer GluPheLeuLysLeuIleValGlyAsnGlnAlaAspPheArgLysCysPheAsnLeuAsp GluLysAlaSerLeuHisPheSerLysGluSerTyrAspGluAspLeuGluThrLeuLeu GlyTyrIleGlyAspAspTyrSerAspValPheLeuLysAlaLysLysLeuTyrAspAla IleLeuLeuSerGlyPheLeuThrValThrAspAsnGluThrGluAlaProLeuSerSer AlaMetIleLysArgTyrAsnGluHisLysGluAspLeuAlaLeuLeuLysGluTyrIle ArgAsnIleSerLeuLysThrTyrAsnGluValPheLysAspAspThrLysAsnGlyTyr AlaGlyTyrIlcAspGlyLysThrAsnGlnGluAspPhcTyrValTyrLcuLysAsnLcu LeuAlaGluPheGluGlyAlaAspTyrPheLeuGluLysIleAspArgGluAspPheLeu ArgLysGlnArgThrPheAspAsnGlySerIleProTyrGlnIleHisLeuGlnGluMet ArgAlaIleLeuAspLysGlnAlaLysPheTyrProPheLeuAlaLysAsnLysGluArg IleGluLysIleLeuThrPheArgIleProTyrTyrValGlyProLeuAlaArgGlyAsn SerAspPheAlaTrpSerIleArgLysArgAsnGluLysIleThrProTrpAsnPheGlu AspValIleAspLysGluSerSerAlaGluAlaPheIleAsnArgMetThrSerPheAsp LeuTyrLeuProGluGluLysValLeuProLysHisSerLeuLeuTyrGluThrPheAsn ValTyrAsnGluLeuThrLysValArgPheIleAlaGluSerMetArgAspTyrGlnPhe LeuAspSerLysGlnLysLysAspIleValArgLeuTyrPheLysAspLysArgLysVal ThrAspLysAspIleIleGluTyrLeuHisAlaIleTyrGlyTyrAspGlyIleGluLeu LysGlyIleGluLysGlnPheAsnSerSerLeuSerThrTyrHisAspLeuLeuAsnIle IleAsnAspLysGluPheLeuAspAspSerSerAsnGluAlaIleIleGluGluIleIle HisThrLeuThrIlePheGluAspArgGluMetIleLysGlnArgLeuSerLysPheGlu AsnIlePheAspLysSerValLeuLysLysLeuSerArgArgHisTyrThrGlyTrpGly LysLeuSerAlaLysLeuIleAsnGlyIleArgAspGluLysSerGlyAsnThrIleLeu AspTyrLeuIleAspAspGlyIleSerAsnArgAsnPheMetGlnLeuIleHisAspAsp AlaLeuSerPheLysLysLysIleGlnLysAlaGlnIleIleGlyAspGluAspLysGly AsnIleLysGluValValLysSerLeuProGlySerProAlaIleLysLysGlyIleLeu GlnSerIleLysIleValAspGluLeuValLysValMetGlyGlyArgLysProGluSer IleValValGluMetAlaArgGluAsnGlnTyrThrAsnGlnGlyLysSerAsnSerGln GlnArgLcuLysArgLcuGluLysScrLcuLysGluLcuGlySerLysIlcLeuLysGlu AsnIleProAlaLysLeuSerLysIleAspAsnAsnAlaLeuGlnAsnAspArgLeuTyr LeuTyrTyrLeuGlnAsnGlyLysAspMetTyrThrGlyAspAspLeuAspIleAspArg LeuSerAsnTyrAspIleAspAlaIleIleProGlnAlaPheLeuLysAspAsnSerIle AspAsnLysValLeuValSerSerAlaSerAsnArgGlyLysSerAspAspPheProSer LeuGluValValLysLysArgLysThrPheTrpTyrGlnLcuLcuLysSerLysLeuIle SerGlnArgLysPheAspAsnLeuThrLysAlaGluArgGlyGlyLeuLeuProGluAsp LysAlaGlyPheIleGlnArgGlnLeuValGluThrArgGlnIleThrLysHisValAla ArgLeuLeuAspGluLysPheAsnAsnLysLysAspGluAsnAsnArgAlaValArgThr ValLysIleIleThrLeuLysSerThrLeuValSerGlnPheArgLysAspPheGluLeu TyrLysValArgGluIleAsnAspPheHisHisAlaHisAspAlaTyrLeuAsnAlaVal IleAlaSerAlaLeuLeuLysLysTyrProLysLeuGluProGluPheValTyrGlyAsp TyrProLysTyrAsnSerPheArgGluArgLysSerAlaThrGluLysValTyrPheTyr SerAsnIleMetAsnIlePheLysLysSerIleSerLeuAlaAspGlyArgValIleGlu ArgProLeuIleGluValAsnGluGluThrGlyGluSerValTrpAsnLysGluSerAsp LeuAlaThrValArgArgValLcuSerTyrProGlnValAsnValValLysLysValGlu GluGlnAsnHisGlyLeuAspArgGlyLysProLysGlyLeuPheAsnAlaAsnLeuSer SerLysProLysProAsnSerAsnGluAsnLeuValGlyAlaLysGluTyrLeuAspPro LysLysTyrGlyGlyTyrAlaGlyIleSerAsnSerPheAlaValLeuValLysGlyThr IleGluLysGlyAlaLysLysLysIleThrAsnValLeuGluPheGlnGlyIleSerIle LeuAspArgIleAsnTyrArgLysAspLysLeuAsnPheLeuLeuGluLysGlyTyrLys AspIleGluLeuIleIleGluLeuProLysTyrSerLeuPheGluLeuSerAspGlySer ArgArgMetLeuAlaSerIleLeuSerThrAsnAsnLysArgGlyGluIleHisLysGly AsnGlnIlePheLeuSerGlnLysPheValLysLeuLeuTyrHisAlaLysArgIleSer AsnThrIleAsnGluAsnHisArgLysTyrValGluAsnHisLysLysGluPheGluGlu LeuPheTyrTyrIleLeuGluPheAsnGluAsnTyrValGlyAlaLysLysAsnGlyLys LeuLeuAsnSerAlaPheGlnSerTrpGlnAsnHisSerIleAspGluLeuCysSerSer PheIleGlyProThrGlySerGluArgLysGlyLeuPheGluLeuThrSerArgGlySer AlaAlaAspPheGluPheLeuGlyValLysIleProArgTyrArgAspTyrThrProSer SerLeuLeuLysAspAlaThrLeuIleHisGlnSerValThrGlyLeuTyrGluThrArg IleAspLeuAlaLysLeuGlyGluGly SEQ ID NO: 5 Tra-crRNA, Unmature (102 nt): uaauaauaauugugguuugaaaccauucgaaacaacacagcgaguuaaaauaaggcuuaguccguacucaacuugaaaagguggcac cgauucgguguuuuu SEQ ID NO: 6 Mature 78 nt tracrRNA: gggcgaaacaacacagcgaguuaaaauaaggcuuaguccguacucaacuugaaaagguggcaccgauucgguguuuuu Shorter variants: gggcgaaacaacacagcgaguuaaaauaaggcuuaguccguacucaacuugaaaagguggcaccgauucggug (SEQ ID NO: 44) gggcgaaacaacacagcgaguuaaaauaaggcuuaguccguacucaacuugaaaagguggcaccgauu (SEQ ID NO: 45) gggcgaaacaacacagcgaguuaaaauaaggcuuaguccguacucaacuugaaaagguggcac (SEQ ID NO: 46) gggcgaaacaacacagcgaguuaaaauaaggcuuaguccguacucaacuugaaaaggu (SEQ ID NO: 47) gggcgaaacaacacagcgaguuaaaauaaggcuuaguccguacucaacuugaa (SEQ ID NO: 48) gggcgaaacaacacagcgaguuaaaauaaggcuuaguccguacucaac (SEQ ID NO: 49) SEQ ID NO: 7 42 nt crRNA from spacer 1: 5′-CGCUAAAGAGGAAGAGGACAGUUUUAGAGCUGUGUUGUUUCG-3′ SEQ ID NO: 8 150 nt pre-crRNA 5′- ggguagaaaagauauccuacgagguuuuagagcuguguuguuucgaaugguuccaaaacaaauucuaa acgcuaaagaggaagaggacaguuuuagagcuguguuguuucgaaugguuccaaaacuacugcuguau uagcuugguuguug-3′ SEQ ID NO: 9 crRNA1 5′- ggguagaaaagauauccuacgagguuuuagagcuguguuguuucgaaugguuccaaaacTGTCATGA TAATAATGGTTTCTTAGACGTCguuuuagagcuguguuguuucgaaugguuccaaaacuacugcug uauuagcuugguuguug-3′ SEQ ID NO: 10 crRNA2 5′- ggguagaaaagauauccuacgagguuuuagagcuguguuguuucgaaugguuccaaaacacgagccg gaagcataaagtgtaaagcctgguuuuagagcuguguuguuucgaaugguuccaaaacuacugcug uauuagcuugguuguug-3′ SEQ ID NO: 11 Anti-A phage CRISPR RNA 5′- ggguagaaaagauauccuacgagguuuuagagcuguguuguuucgaaugguuccaaaactcaaggga gaatagaggctctogttgcattguuuuagagcuguguuguuucgaaugguuccaaaacuacugcug uauuagcuugguuguug-3′ SEQ ID NO: 12 Anti E. coli CRISPR RNA 5′- ggguagaaaagauauccuacgagguuuuagagcuguguuguuucgaaugguuccaaaaccgggaggg aagctgcatgatgcgatgttatguuuuagagcuguguuguuucgaaugguuccaaaacuacugcug uauuagcuugguuguug-3′ SEQ ID NO: 13 crRNA-HS1 5′-GCUCCCGGGGCUCGAUGAAGGUUUUAGAGCUGUGUUGUUUCG-3′ SEQ ID NO: 14 crRNA-HS2 UGAAUCGUGAAAUCUGCUCAGUUUUAGAGCUGUGUUGUUUCG
[0111] The application contains a Sequence Listing which has been submitted in ASCII format and is hereby incorporated by reference in its entirety. The ASCII copy, created on Mar. 20, 2013, is named 078981_6_SL.txt and is 64.4 kilobytes in size.
[0112] The embodiments shown and described in the specification are only specific embodiments of inventors who are skilled in the art and are not limiting in any way. Therefore, various changes, modifications, or alterations to those embodiments may be made without departing from the spirit of the invention in the scope of the following claims. The references cited are expressly incorporated by reference herein in their entirety.