PROTEIN HAVING NUCLEASE ACTIVITY, FUSION PROTEINS AND USES THEREOF
20220002757 · 2022-01-06
Inventors
Cpc classification
A01K67/0275
HUMAN NECESSITIES
C12N9/22
CHEMISTRY; METALLURGY
C12N2800/80
CHEMISTRY; METALLURGY
A01K2207/05
HUMAN NECESSITIES
A01K2217/07
HUMAN NECESSITIES
C12N2999/007
CHEMISTRY; METALLURGY
C07K2319/80
CHEMISTRY; METALLURGY
C12N15/8509
CHEMISTRY; METALLURGY
C12Y301/21004
CHEMISTRY; METALLURGY
International classification
C12N15/90
CHEMISTRY; METALLURGY
Abstract
The present invention relates to a nucleic acid molecule encoding (I) a polypeptide having the activity of an endonuclease, which is (a) a nucleic acid molecule encoding a polypeptide comprising or consisting of the amino acid sequence of SEQ ID NO: 1; (b) a nucleic acid molecule comprising or consisting of the nucleotide sequence of SEQ ID NO: 2; (c) a nucleic acid molecule encoding an endonuclease, the amino acid sequence of which is at least 70% identical to the amino acid sequence of SEQ ID NO: 1; (d) a nucleic acid molecule comprising or consisting of a nucleotide sequence which is at least 50% identical to the nucleotide sequence of SEQ ID NO: 2; (e) a nucleic acid molecule which is degenerate with respect to the nucleic acid molecule of (d); or (f) a nucleic acid molecule corresponding to the nucleic acid molecule of any one of (a) to (e) wherein T is replaced by U; (II) a fragment of the polypeptide of (I) having the activity of an endonuclease. Also, the present invention relates to a vector comprising the nucleic acid molecule and a protein encoded by said nucleic acid molecule. Further, the invention relates to a method of modifying the genome of a eukaryotic cell and a method of producing a non-human vertebrate or mammal.
Claims
1. A nucleic acid molecule encoding (I) a polypeptide having the activity of an endonuclease, which is selected from the group consisting of: (a) a nucleic acid molecule encoding a polypeptide comprising or consisting of the amino acid sequence of SEQ ID NO: 1; (b) a nucleic acid molecule comprising or consisting of the nucleotide sequence of SEQ ID NO: 2; (c) a nucleic acid molecule encoding an endonuclease, the amino acid sequence of which is at least 70% identical to the amino acid sequence of SEQ ID NO: 1; (d) a nucleic acid molecule comprising or consisting of a nucleotide sequence which is at least 50% identical to the nucleotide sequence of SEQ ID NO: 2; (e) a nucleic acid molecule which is degenerate with respect to the nucleic acid molecule of (d); and (f) a nucleic acid molecule corresponding to the nucleic acid molecule of any one of (a) to (e) wherein T is replaced by U; or (II) a fragment of the polypeptide of (I) having the activity of an endonuclease.
2. The nucleic acid molecule of claim 1, wherein in (I)(c) in said amino acid sequence having at least 70% sequence identity to SEQ ID NO: 1 the amino acid residues P66, D67, D84 and/or K86 of SEQ ID NO: 1 are not modified.
3. The nucleic acid molecule of claim 1 further encoding a DNA-binding domain.
4. The nucleic acid molecule of claim 3, wherein the DNA-binding domain is a TAL effector motif of a TAL effector protein.
5. A vector comprising the nucleic acid molecule of claim 1.
6. A host cell comprising the nucleic acid molecule of claim 1.
7. A protein or fusion protein having the activity of an endonuclease encoded by the nucleic acid molecule of claim 1.
8. A method of modifying a target sequence in the genome of a eukaryotic cell, the method comprising the step of: (a) introducing into said cell the nucleic acid molecule of claim 1, a vector of comprising the nucleic acid molecule of claim 1 or a protein or fusion protein having the activity of an endonuclease encoded by the nucleic acid molecule of claim 1.
9. The method of claim 8, wherein the modification of said target sequence is by homologous recombination with a donor nucleic acid sequence, further comprising the step: (b) introducing a nucleic acid molecule into said cell, wherein said nucleic acid molecule comprises said donor nucleic acid sequence, wherein said donor DNA sequence is flanked upstream by a first flanking element and downstream by a second flanking element, wherein said first and second flanking element are different and wherein each of said first and second flanking element are homologous to a continuous DNA sequence on either side of the double-strand break introduced in (a) of claim 8 within said target sequence in the genome of said eukaryotic cell.
10. The method of claim 8, wherein said cell is analysed for successful modification of said target sequence in the genome.
11. The method of claim 8, wherein the cell is selected from the group consisting of a mammalian or vertebrate cell, a plant cell or a fungal cell.
12. The method of claim 8, wherein the cell is an oocyte.
13. A method of producing a non-human vertebrate or mammal carrying a modified target sequence in its genome, the method comprising transferring a cell produced by the method of claim 9 into a pseudo pregnant female host.
14. The method of claim 8, wherein the cell is selected from the group consisting of rodents, dogs, felides, primates, rabbits, pigs, cows, chickens, turkeys, pheasants, ducks, geese, quails, ostriches, emus, cassowaries and zebrafish.
15. A method of producing a protein or fusion protein having the activity of an endonuclease encoded by the nucleic acid molecule of claim 1 comprising the steps of: (a) culturing a host cell comprising the nucleic acid molecule of claim 1 and (b) isolating the produced protein or fusion protein.
16. A host cell comprising the vector of claim 5.
17. A protein or fusion protein having the activity of an endonuclease encoded by the vector of claim 5.
Description
[0137] The figures show:
[0138]
[0139] The figure shows the structure and function of TAL-Nuclease fusion proteins, consisting of a sequence-specific DNA-binding domain and a nonspecific DNA cleavage (nuclease) domain. The DNA-binding domain can be assembled from the four types of 34 amino acid TAL peptide elements that exhibit binding specificity against one of the DNA nucleotides through the amino acid positions 12 and 13 (NI-A; HD-C; NG-T; NN-G). Upon binding of the TAL element domain to the selected target DNA sequence, the nuclease domain of the fusion protein comes into close contact to the DNA double-strand but does not cleave the DNA as a nuclease monomer. Only upon the binding of a second TAL-Nuclease fusion protein to a second DNA target sequence located downstream of the binding site of the first fusion protein, the DNA double strand is cleaved through cooperation of the two nuclease domains that are in close contact.
[0140]
[0141] The figure shows a pair of TAL-nuclease fusion proteins that bind up- and downstream of a selected target site within a genomic target gene. Upon the creation of a DNA double-strand break within the target site two competing DNA repair mechanisms are strongly activated in cells: i) by homologous recombination, in the presence of an externally introduced gene targeting vector that comprises two homology regions to the target gene and a predesigned genetic modification/mutation, the preplanned modification is copied from the targeting vector into the genome; by this route any targeted gene modification (e.g. knock-out, knock-in) can be placed into the genome, ii) by the non-homologous end joining repair pathway (NHEJ) the free DNA ends are closed by ligation without a repair template; by this route a variable number of nucleotides is frequently lost (knife symbol) before end ligation and results frequently into a knockout allele of the target gene.
[0142]
[0143] A: For the generation of genetic modifications in mammalian cell lines TAL-nuclease expression vectors can be transfected, together with or without a specific gene targeting vector, into cultured cells. Upon nuclease expression and DNA repair a fraction of the treated cells contains the desired genetic alteration. These cells can be isolated and further cultured as a pure genetically modified cell line. B: Upon the microinjection of TAL-nuclease mRNA, together with or without a specific gene targeting vector, into fertilized mammalian oocytes (zygotes, isolated from wildtype female e.g. mice) a knockout (KO) or Knockin (KI) allele can be directly introduced into the genome of the one-cell embryo. Pseudopregnant females deliver live offspring from microinjected oocytes. The offspring is genotyped for the presence of the induced genetic modification. Positive animals are selected for further breeding to establish a gene targeted strain.
[0144]
[0145] The Tal nuclease expression vector pCAG-Tal-nuclease contains a CAG promoter region and a transcriptional unit comprising, upstream of a central pair of BsmBI restriction sites, an ATG start codon (arrow), a nuclear localisation sequence (NLS), a FLAG Tag sequence (FLAG), a linker sequence, a segment coding for 110 amino acids of the Tal protein AvrBs3 (AvrN) and its invariable N-terminal Tal repeat (r0.5). Downstream of the BsmBl sites the transcriptional unit contains an invariable C-terminal Tal repeat (rx.5), a segment coding for 44 amino acids derived from the Tal protein AvrBs3, a PmeI and MluI restriction site for the insertion of nuclease coding regions and a polyadenylation signal sequence (pA). DNA segments coding for TAL repeat elements can be inserted into the BsmBI sites of pCAG-Tal-nuclease for the expression of variable TAL-nuclease fusion proteins. To create ArtTal1-nuclease expression vectors the ArtTal1 array of TAL repeat elements, recognizing the specified 12 bp target sequence, was inserted into the BsmBl sites of pCAG-TAL-nuclease. Each 34 amino acid Tal repeat is drawn as a square indicating the repeat's amino acid code at positions 12/13 that confers binding to one of the DNA nucleotides of the target sequence (NI>A, NG>T, HD>C, NN>G) shown above. Next, synthetic nuclease domain coding regions were inserted into the PmeI and MluI sites of pCAG-ArtTal1-nuclease to obtain the expression vectors: A: pCAG-ArtTal1-Alw including the nuclease domain of the AlwI restriction endonuclease, B: pCAG-ArtTal1-CleDORF including the nuclease domain of the CIeDORF gene, C: pCAG-ArtTal1-Clo051 including the nuclease domain of the Clo051 gene, D: pCAG-ArtTal1-Mly including the nuclease domain of the MlyI restriction endonuclease, E: pCAG-ArtTal1-Pept071 including the nuclease domain of the Pept071 gene, F: pCAG-ArtTal1-Sbf including the nuclease domain of the SbfI restriction endonuclease, G: pCAG-ArtTal1-SdaI including the nuclease domain of the SdaI restriction endonuclease, H: pCAG-ArtTal1-Sst including the nuclease domain of the StsI restriction endonuclease, and I: pCAG-ArtTal1-Fok including the nuclease domain of the FokI restriction endonuclease
[0146]
[0147] Sequence of the 587 amino acid Clo051 protein in the single letter code. Indicated are the methionine at position 1 (M1), the tyrosine at position 587 (Y587) and the 199 residue nuclease domain between position E389 and Y587. Further highlighted are the positions D455, D472 and K474 that are characteristic for the conserved active site of the ‘PD-(D/E)XK’ superfamily of enzymes interacting with DNA.
[0148]
[0149] The tertiary structure of the Clo051 protein was predicted from its amino acid sequence (
[0150]
[0151] A: TAL-nuclease reporter plasmids contain a CMV promoter region, a 400 bp sequence coding for the N-terminal segment of β-galactosidase and a stop codon. This unit is followed by a TAL binding target region consisting of two inverse oriented recognition sequences (underlined), separated by a 15 bp spacer region (NNN.), for the ArtTal1 array (a), the TalRab1 array (b), the TalRab2 array (c), or a hybrid binding region composed of one ArtTal1 and one TalRab2 recognition sequence (d). The TAL-nuclease target region is followed by the complete coding region for β-galactosidase and a polyadenylation signal (pA). To test for nuclease activity against the target sequence a TAL-nuclease expression vector (
[0152]
[0153] To test for the nuclease activity of TAL-nuclease domain fusion proteins, expression vectors for the ArtTal1-AlwI, -CIeDORF, -Clo051, -MlyI, -FokI, -Pept071, -SbfI, -SdaI, and -StsI proteins (
[0154]
[0155] To test for the specificity of the ArtTal1-Clo051 nuclease against the predesigned target sequence in comparison to unrelated DNA sequences, the pCAG-ArtTal1-Clo051 expression vector was cotransfected with the corresponding ArtTal1-reporter plasmid or with the TalRab1 or TalRab2 reporter plasmids (
[0156]
[0157] A: To test for the cooperativity of the Clo051 nuclease domains of a pair of TAL-Clo051 fusion proteins, expression vectors for the ArtTal1-Clo051 or TalRab2-Clo051 fusion proteins were cotransfected with the corresponding ArtTal1- or TalRab2-reporter plasmid (
[0158]
[0159] TAL nucleases recognizing a target sequence within exon 1 of the mouse Rab38 gene. The trinucleotide representing codon 19 is underlined. Indicated is each of a 14 nucleotide sequence that is recognised by one the indicated TAL-Clo051 fusion proteins RabChtTal1- and RabChtTal2-Clo051. The two 14 bp target sequences are flanking a central 15 bp spacer sequence that is cleaved by the Clo051 nuclease domains.
[0160]
[0161] Within exon 1 of the wildtype Rab38 gene (Rab38 WT) the position of the binding sites for the TAL nuclease pair RabChtTal1- and RabChtTal2-Clo051 are indicated. The Rab38-cht targeting vector contains a 942 bp 5′-homology region and a 2788 bp 3′-homology region flanking the Rab38 TAL recognition sites. Within exon1 two nucleotide changes within codon 19 (Gta) of Rab38 create a chocolate (cht) missense mutation coding for valine (Val) instead of the wildtype (WT) glycine (Gly), and remove a BsaJI restriction site. In each of the adjacent Rab38 TAL recognition sites several silent mutations were introduced to prevent the binding of Rab38 TAL proteins to the targeting vector. The induction of a double-strand break within the wildtype Rab38 gene by the RabChtTal protein pair stimulates homologous recombination with the Rab38-cht targeting vector and integrates the chocolate missense and the silent mutations into the genome.
[0162]
[0163] The figure shows the primary sequence of the Clo051 nuclease domain between the positions E389 and Y587. Indicated is the distribution of the positively charged arginine (R) and lysine (K) residues (filled squares) and of negatively charged glutamate (E) and aspartate (D) residues (open circles). Triangles indicate the positions 5423 and R446. These residues constitute a three-dimensional framework of charges within the Clo051 domain that determines the unique tertiary structure of this nuclease, as modelled in the structure of
[0164]
[0165] HEK293 cells harboring genomic integrated copies of the pCMV-Rab-Reporter(hygro) reporter construct were transfected with pBluescript or pCAG.ArtTal1-Clo051. Specific nuclease activity against the reporter's target sequence leads to homologous recombination and the expression of β-galactosidase. Two days after transfection the cell populations were fixed and the fraction of β-galactosidase expressing cells was determined by histochemical X-Gal staining. A: X-Gal stained reporter cell culture upon transfection with pBluescript. B: X-Gal stained reporter cell culture upon transfection with pCAG-ArtTal1-Clo051 nuclease expression vector.
[0166] The examples illustrate the invention:
EXAMPLE 1
Construction of Expression and Reporter Vectors for Tal Nucleases and Detection of Specific Nuclease Activity
[0167] Construction of TAL-Nuclease Expression Vectors
[0168] For the expression of TAL-nucleases in mammalian cells we designed the generic expression vector pCAG-TAL-nuclease (SEQ ID NO: 3) (
[0169] To generate TAL-nuclease vectors for expression in mammalian cells we inserted a synthetic DNA segment with the coding region of an array of 12 Tal repeats, designated ArtTal1 (SEQ ID NO: 6), into the BsmBI sites of pCAG-TAL-nuclease, to derive the plasmid pCAG-ArtTal1-nuclease (SEQ ID NO: 7). The TAL element array ArtTal1 recognises the artificial DNA target sequence 5′-ATTCTGGGACGT-3′ (SEQ ID NO: 62) (
[0170] Next, we constructed fusion proteins of the ArtTal1 DNA binding domain with protein domains derived from known or putative nucleases and tested whether these TAL-nuclease fusion proteins are able to induce a double-strand break next to the DNA bound by the TAL recognition region. For this purpose we inserted synthetic DNA segments comprising the coding regions of eight putative nuclease domains and the known nuclease domain of FokI (SEQ ID NO: 10), into the PmeI and MluI sites of the pCAG-ArtTal1-nuclease plasmid. Among the eight putative nuclease domains we selected domains from the five known restriction enzymes AlwI (SEQ ID NO: 11), MlyI (SEQ ID NO: 12), SbfI (SEQ ID NO: 13), SdaI (SEQ ID NO: 14) and StsI (SEQ ID NO: 15). In addition, we selected putative nuclease domains of three yet uncharacterised, hypothetical microbial genes, designated here as ‘CIeDORF’ (SEQ ID NO: 16) (NCBI Reference Sequence: ZP_02080987.1, derived from the genome of Clostridium leptum DSM753), ‘Clo051 (SEQ ID NO: 17) (NCBI Reference Sequence: ZP_05132802.1, derived from the genome of Clostridium spec. 7_2_43FAA) and ‘Pept071’ (SEQ ID NO: 18) (NCBI Reference Sequence: ZP_07399918.1, derived from the genome of Peptomphilus duerdenii ATCC BAA-1640). These proteins were selected by characteristic sequence features that are compatible with the conserved active site of the ‘PD-(D/E)XK’ superfamily of enzymes (Kosinski, J., et al. (2005). BMC Bioinformatics, 6,172) interacting with DNA (see
[0171] In particular, the 587 residue Clo051 protein can be classified as a member of the PD-(D/E)XK protein family by the location of the amino acid pairs P454/D455 (PD motif) and D472/K474 (DXK motif) (
[0172] For the expression of these protein domains in mammalian cells we used synthetic coding regions optimised according to the mammalian codon usage and inserted segments comprising the putative nuclease domains of AlwI (SEQ ID NO: 19), CleDORF (SEQ ID NO: 20), Clo051 (SEQ ID NO: 1), MlyI (SEQ ID NO: 21), Pept071 (SEQ ID NO: 22), SbfI (SEQ ID NO: 23), SdaI (SEQ ID NO: 24), StsI (SEQ ID NO: 25) and the known nuclease domain of FokI (SEQ ID NO: 26) into the PmeI and MluI sites of the pCAG-ArtTal1-nuclease plasmid, to derive the expression vectors pCAG-ArtTal1-AlwI (SEQ ID NO: 27) (
[0173] Construction of TAL Nuclease Reporter Plasmids
[0174] To determine the activity and specificity of TAL nuclease domain fusion proteins in mammalian cells we constructed TAL nuclease reporter plasmids that contain two copies of a TAL DNA target sequence in inverse orientation, separated by a 15 nucleotide spacer region (
[0175] The TAL nuclease reporter plasmids contain a CMV promoter region, a 400 bp sequence coding for the N-terminal segment of β-galactosidase and a stop codon. This unit is followed by the TAL nuclease target region (consisting of two inverse oriented recognition sequences separated by a 15 bp spacer region) for ArtTal1-fusion proteins in the plasmid ArtTal1-reporter (SEQ ID NO: 45)(
[0176] Within these reporter plasmids the TAL nuclease target regions are followed by the complete coding region for β-galactosidase and a polyadenylation signal (pA). To test for nuclease activity against the specific target sequence a TAL nuclease expression vector (
[0177] Measurement of TAL-Nuclease Activity and Specificity in Human 293 Cells
[0178] To determine the activity and specificity of TAL nucleases in mammalian cells, we electroporated one million HEK 293 cells (ATCC #CRL-1573) (Graham F L, Smiley J, Russell W C, Nairn R., J. Gen. Virol. 36, 59-74, 1977) with 5 μg plasmid DNA of one of the TAL nuclease expression vectors (
[0179] As shown in
[0180] Since in repeated assays TAL fusions with the Clo051 domain appeared more active as compared to fusions with the FokI nuclease domain, we believe that the Clo051 domain is most suited for the construction of highly active TAL-nucleases.
[0181] In order to define whether the ArtTal1-Clo051 nuclease specifically recognizes its target sequence within the ArtTal1-reporter plasmid (
[0182] Next, we characterized whether the Clo051 nuclease domain induces recombinogenic double-strand breaks as a monomer, or whether the interaction of two nuclease domains as dimer is required. For this purpose we constructed the hybrid reporter plasmid ArtTal1/TalRab2-reporter (SEQ ID NO: 48) (
[0183] Next, we studied whether two Clo051 nuclease domains, that are fused to different arrays of TAL DNA binding elements, are also able to interact and to induce double-strand breaks. For this purpose the expression vectors pCAG-ArtTal1-Clo051 and pCAG-TalRab2-Clo051 were cotransfected together with the ArtTal1/TalRab2-reporter plasmid and the results compared to the cotransfection of pCAG-ArtTal1-Clo051 together with the ArtTal1/TalRab2-reporter. As shown in
EXAMPLE 2
Targeting of the Mouse Rab38 Gene in ES Cells and Zygotes with TAL-Clo051 Nucleases
[0184] Construction of Rab38 Specific TAL-Clo051 Nucleases and a Targeting Vector
[0185] To demonstrate the functionality of TAL effector DNA-binding domain—nuclease fusion proteins in mammalian cells we designed a pair of fusion proteins that recognizes a DNA target sequence within the mouse Rab38 gene (
[0186] The mouse Rab38 gene encodes the RAB38 protein that is a member of a family of proteins known to play a crucial role in vesicular trafficking. In chocolate (cht) mutant mice a single nucleotide exchange at position 146 (G>T mutation) within the first exon of Rab38 leads to the replacement of glycine by valine at codon 19 (Loftus, S. K., et al., Proc Natl Acad Sci USA, 2002. 99(7): p. 4471-6). This amino acid replacement is located within the conserved GTP binding domain of RAB38 and impairs the sorting of the tyrosinase-related protein 1 (TYRP1) into the melanosomes of Rab38.sup.cht/Rab38.sup.cht melanocytes. TYRP1 is a melanosomal membrane glycoprotein, which functions both as a 5,6-Dihydroxyindol-2-carbonic-acid oxidase enzyme to produce melanin and as a provider of structural stability to tyrosinase in the melanogenic enzyme complex. TYRP1 is believed to transit from the trans-Golgi network to stage II melanosomes by means of clathrin-coated vesicles. The reduced amount of correctly located TYRP1 leads to an impairment of pigment production and the change of fur color from black to a chocolate-like brown color in Rab38.sup.cht/Rab38.sup.cht mice. Since mutations of genes needed for melanocyte function are known to cause oculocutaneous albinism (OCD), such as Hermansky-Pudlak syndrome in man, the Rab38 gene is a candidate locus in OCD patients.
[0187] We aimed to introduce a phenocopy of the chocolate mutation at codon 19 of Rab38 using a pair of TAL-nucleases (RabChtTal1- and RabChtTal2-Clo051) that each recognise a 14 bp target sequence located up- and downstream of a central 15 bp spacer sequence within exon 1 of the Rab38 gene (
[0188] For the modification of the Rab38 gene by homologous recombination in fertilised oocytes we constructed the gene targeting vector pRab38-chtTAL (
[0189] For the modification of the Rab38 gene by homologous recombination in mouse ES cells we modified the gene targeting vector pRab38-chtTAL (
[0190] Targeting of the Rab38 Gene in ES Cells and Zygotes
[0191] To demonstrate the utility of the RabChtTal1- and RabChtTal2-Clo051 proteins for gene targeting in mammalian cells (
[0192] For targeting in ES cells we transfected IDG3.2 ES cells (Hitz, C. et al. Nucleic Acids Res. 35, e90, 2007) with linearised pRab38-chtTAL-neo targeting vector together with or without the TAL-nuclease expression plasmids pCAG-RabChtTal1- and pCAG-RabChtTal2-Clo051. The transfection, selection, expansion and genotyping of neomycin resistant ES cell clones was performed according to standard gene targeting procedures as described ((Nagy A, Gertsenstein M, Vintersten K, Behringer R., 2003. Manipulating the Mouse Embryo. Cold Spring Harbour, N.Y.: Cold Spring Harbour Laboratory Press). The analysis of resistant ES cell clones revealed that the expression of the TAL-nucleases lead to a significantly increased rate of homologous recombination at the Rab38 gene in ES cells.
[0193] For microinjection into fertilised mouse oocytes the circular pRab38-chtTAL vector DNA was mixed with in vitro transcribed mRNA coding for RabChtTal1- and RabChtTal2-Clo051 proteins in injection buffer as described (Meyer, M., et al., Proc Natl Acad Sci USA. 107(34): p. 15022-6). TAL-nuclease mRNA is prepared from the linearised expression plasmids pCAG-RabChtTA11- and pCAG-RabChtTal2-Clo051
[0194] by in vitro transcription from the T7 promoter using the mMessage mMachine kit (Ambion) according to the manufacturers instructions. The mRNA is further modified by the addition of a poly-A tail using the Poly(A) tailing kit and purified with MegaClear columns from Ambion. Finally the mRNA is precipitated and resolved in injection buffer.
[0195] To isolate fertilised oocytes, males of the C57BL/6 strain are mated to super-ovulated females of the FVB strain. For super-ovulation three-week old FVB females are treated with 2.5 IU pregnant mares serum (PMS) 2 days before mating and with 2.5 IU Human chorionic gonadotropin (hCG) at the day of mating. Fertilised oocytes are isolated from the oviducts of plug positive females and microinjected in M2 medium (Sigma-Aldrich Inc Cat. No. M7167) with the TAL-nuclease mRNA and pRab38-chtTAL targeting vector preparation into one pronucleus and the cytoplasm following standard procedures (Nagy A, Gertsenstein M, Vintersten K, Behringer R., 2003. Manipulating the Mouse Embryo. Cold Spring Harbour, N.Y.: Cold Spring Harbour Laboratory Press).
[0196] Upon microinjection the TAL-nuclease mRNAs are translated into proteins that induce a double-strand break at one or both Rab38 alleles in one or more cells of the developing embryo. This event stimulates the recombination of the pRab38-chtTAL targeting vector with a Rab38 allele via the homology regions present in the vector and leads to the site-specific insertion of the mutant codon 19 into the genome, resulting into a Rab38cIn allele bearing the chocolate mutation (
[0197] In one such experiment, mice derived from microinjected zygotes were analysed by a Rab38 PCR assay. Among this group most mice exhibited two alleles of the normal Rab38 wildtype genotype, whereas some individuals harboured one allele of the preplanned Rab38 chocolate mutation, as indicated by the absence of the BsaJl restriction site in exon 1
[0198] Taken together, it was possible to introduce a preplanned modification into the coding region of the Rab38 gene by TAL-Clo051 nuclease-assisted homologous recombination in mouse ES cells and fertilised oocytes.
EXAMPLE 3
Isolation of Hyperactive Clo051 Nuclease Mutants
[0199] As shown in
[0200] Such amino acid replacements may be made by trial and error or may follow specific hypotheses on the structural and functional impact on the Clo051 nuclease domain. Alternatively, a large number of randomly mutagenised variants of the Clo051 nuclease domain coding region can be assembled in a library by mutagenic PCR. This library of mutant molecules can be tested for the presence of hyperactive nuclease variants by a phenotypic screening assay in yeast, mammalian or E. coli cells that is coupled to a functional nuclease readout, e.g. as described for the improvement of the FLP recombinase (Buchholz et al., Nat. Biotechnol. 16, 657-62, 1998).
[0201] Such a functional screen for improved nuclease variants can result into the replacement of e.g. the residue 423 from a serine to a proline and of the residue 446 from an arginine to a glutamate. Such variant molecules can prove a superior nuclease activity as compared to the Clo051 wildtype form.
EXAMPLE 4
Clo051 Nuclease Induced Recombination of Genomic Substrates in Human Cells
[0202] The action of Clo051 nuclease was further tested in human HEK293 cells on a genomic integrated reporter construct. For this purpose the ArtTal1 reporter plasmid (
[0203] Next, one million reporter cells were transfected with 5 μg plasmid DNA of the Tal nuclease expression vector pCAG-ArtTal1-Clo051 (