CENH3 DELETION MUTANTS
20200340009 ยท 2020-10-29
Inventors
Cpc classification
C12N2310/20
CHEMISTRY; METALLURGY
C12N15/8261
CHEMISTRY; METALLURGY
C12N9/22
CHEMISTRY; METALLURGY
A01H1/08
HUMAN NECESSITIES
C12N2800/80
CHEMISTRY; METALLURGY
C12N15/113
CHEMISTRY; METALLURGY
Y02A40/146
GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
International classification
C12N15/82
CHEMISTRY; METALLURGY
C12N15/113
CHEMISTRY; METALLURGY
Abstract
Disclosed are methods of creating a haploid inducing plant by editing a CENH3 gene of a plant such that the CENH3 gene encodes a CENH3 polypeptide with a two or more contiguous amino acid deletion relative to wild-type CENH3, wherein said haploid inducing plant, when crossed with a second plant, results in haploid progeny. Also provided is a method of creating a haploid inducing plant by editing a CENH3 gene of a plant such that the CENH3 gene encodes a CENH3 polypeptide with a two or more contiguous amino acid insertion relative to wild-type CENH3, wherein said haploid inducing plant, when crossed with a second plant, results in haploid progeny.
Claims
1. A method of creating a haploid inducing plant, the method comprising, editing a CENH3 gene of a plant such that the CENH3 gene encodes a CENH3 polypeptide with a two or more contiguous amino acid deletion relative to wild-type CENH3, wherein said haploid inducing plant, when crossed with a second plant, results in haploid progeny.
2. The method of claim 1, wherein the CENH3 polypeptide has an eleven amino acid deletion relative to wild-type CENH3.
3. The method of claim 1, wherein the CENH3 polypeptide has a 2-15 (e.g., 2-12) contiguous amino acid deletion relative to wild-type CENH3.
4. The method of claim 1, wherein the deletion is in a alpha-N helix domain of the CENH3 polypeptide.
5. The method of claim 1, wherein the CENH3 polypeptide comprises a sequence at least 90% identical to SEQ ID NO:1-50 or 101-126.
6. The method of claim 1, wherein the CENH3 polypeptide comprises any of SEQ ID NO: 101, 110, 116-117, or 126-144.
7. The method of claim 1, wherein the plant is a tomato or potato plant.
8. The method of claim 1, wherein the editing occurs in situ in the plant.
9. The method of claim 1, wherein the editing comprises introducing into the plant a Cas protein or Cpf1 protein and a guide RNA targeting a CENH3-coding sequence, thereby inducing the two or more contiguous amino acid deletion.
10. A method of creating a haploid inducing plant, the method comprising, editing a CENH3 gene of a plant such that the CENH3 gene encodes a CENH3 polypeptide with a two or more contiguous amino acid insertion relative to wild-type CENH3, wherein said haploid inducing plant, when crossed with a second plant, results in haploid progeny.
11. The method of claim 10, wherein the CENH3 polypeptide has a 2-15 contiguous amino acid insertion relative to wild-type CENH3.
12. The method of claim 10, wherein the insertion is in an alpha-N helix domain of the CENH3 polypeptide.
13. The method of claim 10, wherein the CENH3 polypeptide comprises a sequence at least 90% identical to SEQ ID NO:1-50 or 101-126.
14. The method of claim 10, wherein the CENH3 polypeptide comprises any of SEQ ID NO: 101, 110, 116-117, or 126-144.
15. The method of claim 10, wherein the plant is a tomato or potato plant.
16. The method of claim 10, wherein the editing occurs in situ in the plant.
17. The method of claim 10, wherein the editing comprises introducing into the plant a Cas protein or Cpf1 protein and a guide RNA targeting a CENH3-coding sequence, thereby inducing the two or more contiguous amino acid insertion.
18. A haploid-inducing plant expressing a mutant CENH3 polypeptide encoded by a CENH3 coding sequence, wherein the CENH3 coding sequence comprises an in-frame deletion or insertion of 6 or more contiguous nucleotides, relative to wildtype CENH3.
19. The haploid-inducing plant of claim 18, wherein the in-frame deletion comprises 6-42 contiguous nucleotides of the wildtype CENH3 gene.
20-24. (canceled)
21. A method of making progeny with reduced chromosome content, the method comprising crossing the plant of claim 18 to a plant having a ploidy; and selecting progeny from the cross that have half the ploidy.
22-28. (canceled)
Description
BRIEF DESCRIPTION OF THE DRAWINGS
[0030]
[0031]
[0032]
[0033]
[0034]
[0035]
[0036]
[0037]
DETAILED DESCRIPTION OF THE INVENTION
[0038] Endogenous Centromeric histone H3 (CENH3) proteins are a well characterized class of proteins that are variants of histone H3 proteins. These specialized proteins, which are specifically associated with the centromere, are essential for proper formation and function of the kinetochore, a multiprotein complex that assembles at centromeres and links the chromosome to spindle microtubules during mitosis and meiosis. Cells that are deficient in CENH3 fail to localize kinetochore proteins and show strong chromosome segregation defects.
[0039] CENH3 proteins are characterized by a N-terminal variable tail domain and a C-terminal conserved histone fold domain made up of three -helical regions connected by loop sections. The CENH3 histone fold domain is conserved between CENH3 proteins from different species. See, e.g., Torras-Llort et al., EMBO J. 28:2337-48 (2009). In contrast, the N-terminal tail domains of CENH3 are highly variable even between closely related species. Histone tail domains (including CENH3 tail domains) are flexible and unstructured, as shown by their lack of strong electron density in the structure of the nucleosome determined by X-ray crystallography (Luger et al., Nature 389(6648):251-60 (1997)). Additional structural and functional features of CENH3 proteins can be found in, e.g., Cooper et al., Mol Biol Evol. 21(9):1712-8 (2004); Malik et al., Nat Struct Biol. 10(11):882-91 (2003); Black et al., Curr Opin Cell Biol. 20(1):91-100 (2008); and Torras-Llort et al., EMBO J. 28:2337-48 (2009).
[0040] CENH3 proteins are widely found throughout eukaryotes, and a large number of CENH3 proteins have been identified. See, e.g. SEQ ID NOs: 1-50. It will be appreciated that the above list is not intended to be exhaustive and that additional CENH3 sequences are available from genomic studies or can be identified from genomic databases or by well-known laboratory techniques. For example, where a particular plant or other organism species CENH3 is not readily available from a database, one can identify and clone the organism's CENH3 gene sequence using primers, which are optionally degenerate, based on conserved regions of other known CENH3 proteins.
[0041] The inventors have discovered that introduction of nucleotide deletions or insertions in a number divisible by three (e.g., 6 and 33) in a wildtype CENH3 coding sequence results in a viable CENH3 allele, which when homozygous in a plant and crossed with a wildtype diploid plant, results in haploid progeny. See, e.g., SEQ ID Nos: 101, 110, 116-117, or 126-144. Accordingly, methods are provided for introducing deletions or insertions of six or more nucleotides from a CENH3 coding sequence to delete nucleotides in a contiguous multiple of three to cause deletion or insertion of two or more amino acids, e.g., 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15 or more amino acids. In other embodiments, methods are provided for introducing deletions of three nucleotides from a CENH3 coding sequence to cause deletion of one amino acid. In other embodiments, methods are provided for introducing one or more nucleotide to a coding seqyence to introduce one or more amino acid addition to the CENH3 protein sequence. Also provided are plants comprising introduced nucleotide deletions as discussed above or elsewhere herein. Methods of crossing such plants with a parent plant to generate a progeny plant having half the chromosomes of the parent plant are also provided.
[0042] Deletions or insertions in the CENH3 polypeptides can occur at various locations. In some embodiments, the deletion is in or at least part of the deletion includes one or more amino acid from the histone-fold domain. For example, the deletion or insertion can include deletion of one or more (e.g., 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15 or more) contiguous amino acid in any of the alpha-N helix domain, alpha-1 helix domain, alpha-2 helix domain, or alpha-2 helix domain, and/or an intervening amino acid as occurs in the respective (e.g., most closely aligned and/or from which the deleted sequence has been derived) wildtype CENH3 polypeptide. These domains are shown for representative sequences in
[0043] The CENH3 histone fold domain is conserved between CENH3 proteins from different species. The CENH3 histone fold domain can be distinguished by three -helical regions connected by loop sections. While it will be appreciated that the exact location of the histone fold domain will vary in CENH3 proteins from other species, it will be found at the carboxyl terminus of an endogenous (wildtype) CENH3 protein. Thus, in some embodiments, a CENH3 protein can be identified in an endogenous protein as having a carboxyl terminal domain substantially similar (e.g., at least 30%, 40%, 50%, 60%, 70%, 85%, 90%, 95% or more identity) to any of SEQ ID NO:s 55-100.
[0044] The border between the tail domain and the histone fold domain of CENH3 proteins is at, within, or near (i.e., within 5, 10, 15, 20, or 25 amino acids from the P of) the conserved PGTVAL sequence (SEQ ID NO: 146). The PGTVAL sequence (SEQ ID NO: 146) is approximately 81 amino acids from the N terminus of the Arabidopsis CENH3 protein, though the distance from the N terminus of different endogenous CENH3 proteins varies. See, for example, the sequence listing.
[0045] Deletions as described herein (for example but not limited to those corresponding to the above-described positions) can be introduced into a CENH3 coding sequence from any species. In some embodiments the CENH3 polypeptide has one of the deletions described herein and is substantially identical to any one of SEQ ID NOs:1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, or 50. In some embodiments, the CENH3 that has an introduced deletion is from a species of plant of the genus Abelmoschus, Allium, Apium, Amaranthus, Arachis, Arabidopsis, Asparagus, Atropa, Avena, Benincasa, Beta, Brassica, Cannabis, Capsella, Cica, Cichorium, Citrus, Citrullus, Capsicum, Carthamus, Cocos, Coea, Cucumis, Cucurbita, Cynasa, Daucus, Diplotaxis, Dioscorea, Elais, Eruca, Foeniculum, Fragaria, Glycine, Gossypium, Helianthus, Heterocallis, Hordeum, Hyoscyamus, Ipomea, Lactuca, Lagenaria, Lepidium, Linum, Loliur, Luffa, Luula, Lycopersicon, Malus, Manihot, Majorana, Medicago, Momodica, Musa, Nicotiana, Olea, Oryza, Panicum, Pastinaca, Pennisetum, Persea, Petroselinium, Phaseolus, Physalis, Pinus, Pisum, Populus, Pyrus, Prunus, Raphanus, Saccharum, Secale, Senecio, Sesamum, Sinapis, Solanum, Sorghum, Spinacia, Theobroma, Trichosantes, Trigonella, Triticum, Turritis, Valerianelle, Vitis, Vigna, or Zea. For example, the CENH3 deletion can be in a tomato, potato, rice, Arabidopsis or other plant CENH3 and can be expressed in the same species or a different species of plant. The resulting deleted CENH3 polypeptide can be expressed in the same plant species from which the CENH3 polypeptide was derived or the CENH3 polypeptide having the deletion can be expressed in a different species.
[0046] Mutation methods that introduce DNA deletions, as well as site-directed mutagenesis can be used to generate the deletions described herein as desired. Methods for introducing genetic deletions into plant genes and selecting plants with desired traits are well known and can be used to introduce deletions into or to knock out the CENH3 gene. For instance, seeds or other plant material can be treated with a mutagenic insertional polynucleotide (e.g., transposon, T-DNA, etc.) or chemical substance, according to standard techniques. Chemical substances that cause deletions include, but are not limited to, bleomycin and nalidixic acid. Alternatively, ionizing radiation from sources such as, X-rays or gamma rays can be used. Plants having a mutated or knocked-out CENH3 gene can be identified, for example, by phenotype or by molecular techniques, including but not limited to TILLING methods. See, e.g., Comai, L. & Henikoff. S. The Plant Journal 45, 684-694 (2006).
[0047] CENH3 polypeptides having deletions as described herein can also be constructed in vitro by mutating the DNA sequences that encode the corresponding wild-type CENH3 polypeptide (e.g., a wild-type CENH3 polypeptide of any of SEQ ID NOs:1-50), such as by using site-directed or random mutagenesis. Nucleic acid molecules encoding the wild-type CENH3 polypeptide can be mutated in vitro to have one or more deletions by a variety of polymerase chain reaction (PCR) techniques. See, e.g., PCR Strategies (M. A. Innis, D. H. Gelfand, and J. J. Sninsky eds., 1995, Academic Press, San Diego, Calif.) at Chapter 14; PCR Protocols: A Guide to Methods and Applications (M. A. Innis, D. H. Gelfand, J. J. Sninsky, and T. J. White eds., Academic Press, NY, 1990).
[0048] As a non-limiting example, mutagenesis may be accomplished using site-directed mutagenesis, in which deletions are made to a DNA template. Kits for site-directed mutagenesis are commercially available, such as the QuikChange Site-Directed Mutagenesis Kit (Stratagene). Briefly, a DNA template to be mutagenized is amplified by PCR according to the manufacturer's instructions using a high-fidelity DNA polymerase (e.g., Pfu Turbo) and oligonucleotide primers containing the desired mutation (e.g., deletion). Incorporation of the oligonucleotides generates a mutated plasmid, which can then be transformed into suitable cells (e.g., bacterial or yeast cells) for subsequent screening to confirm mutagenesis of the DNA.
[0049] Other mutation induction systems, such as genome editing methods, can be used to target deletions in CENH3 (Lozano-Juste, J., and Cutler, S. R (2014) Trends in Plant Science 19, 284-287). The sequence-specific introduction of a double stranded DNA break (DSB) in a genome leads to the recruitment of DNA repair factors at the breakage site, which then repair lesion by either the error-prone non-homologous end joining (NHEJ) or homologous recombination (HR) pathways. NHEJ repairs the breaks, but is imprecise and often creates diverse mutations at and around the DSB. In cells in which the HR machinery repairs the DSB, sequences with homology flanking the DSB, including exogenously supplied sequences, can be incorporated at the region of the DSB. DSBs can therefore be leveraged by geneticists to increase the frequency of mutations at defined sites, however intrinsic differences between the relative roles of HR and NHEJ can affect the mutation types at a targets locus. A number of technologies have been developed to create DSBs at specific sites including synthetic zinc finger nucleases (ZFNs), transcription activator-like endonucleases (TALENs) and most recently the clustered regularly interspaced short palindromic repeats (CRISPR)/CRISPR-associated protein 9 (Cas9) system. This system is based on a bacterial immune system against invading bacteriophages in which a complex of 2 small RNAs, the CRISPR-RNA (crRNA) and the trans-activating crRNA (tracrRNA) directs a nuclease (Cas9) to a specific DNA sequence complementary to the crRNA. In other embodiments, Cpf-1 or other Class 2 CRISPR proteins or CRISPR-associated protein (CAS) CRISPR-associated protein (e.g., other Class 1 CRISPR proteins) from other bacteria, for example, can be similarly used. Using any of these systems, one can create DSBs at pre-determined sites in cells expressing the genome editing constructs. In order for homologous recombination to occur, a DNA cassette homologous to the targeted site must be provided, preferably at a high concentration so that homologous recombination is favored or NHEJ. Multiple strategies are conceivable for realizing this, including template delivery using agrobacterium mediated transformation or particle bombardment of DNA templates, and one recently described method uses a modified viral genome to provide the double stranded DNA template. For example, Baltes et al. 2014 (Baltes, N. J., et al. (2014) Plant Cell 26, 151-163) recently demonstrated that an engineered geminivirus that was introduced into plant cells using Agrobacterium mediated transformation could be engineered to produce DNA recombination templates in cells where a ZFN was co-expressed.
[0050] In the CRISPR/Cas9 bacterial antiviral and transcriptional regulatory system, a complex of two small RNAsthe CRISPR-RNA (crRNA) and the trans-activating crRNA (tracrRNA)directs the nuclease (Cas9) to a specific DNA sequence complementary to the crRNA (Jinek, M., et al. Science 337, 816-821 (2012)). Binding of these RNAs to Cas9 involves specific sequences and secondary structures in the RNA. The two RNA components can be simplified into a single element, the single guide-RNA (sgRNA), which is transcribed from a cassette containing a target sequence defined by the user (Jinek, M., et al. Science 337, 816-821 (2012)). This system has been used for genome editing in humans, zebrafish, Drosophila, mice, nematodes, bacteria, yeast, and plants (Hsu, P. D., et al., Cell 157, 1262-1278 (2014)). In this system the nuclease creates double stranded breaks at the target region programmed by the sgRNA. These can be repaired by non-homologous recombination, which often yields inactivating mutations. The breaks can also be repaired by homologous recombination, which enables the system to be used for gene targeted gene replacement (Li, J.-F., et al. Nat. Biotechnol. 31, 688-691, 2013; Shan, Q., et al. Nat. Biotechnol. 31, 686-688, 2013). The CENH3 mutations described in this application can be introduced into plants using the CAS9/CRISPR or other CRISPR system.
[0051] Accordingly, in some embodiments, instead of generating a transgenic plant, a native CENH3 coding sequence in a plant or plant cell can be altered in situ to generate a plant or plant cell carrying a polynucleotide encoding a CENH3 polypeptide having one or more deletion as described herein. The CRISPR/Cas system has been modified for use in prokaryotic and eukaryotic systems for genome editing and transcriptional regulation. The CRISPR/Cas system refers to a widespread class of bacterial systems for defense against foreign nucleic acid. CRISPR/Cas systems are found in a wide range of eubacterial and archaeal organisms. CRISPR/Cas systems include type I, II, and III sub-types. Wild-type type II CRISPR/Cas systems utilize the RNA-mediated nuclease, Cas9 in complex with guide and activating RNA to recognize and cleave foreign nucleic acid. Cas9 homologs are found in a wide variety of eubacteria, including, but not limited to bacteria of the following taxonomic groups: Actinobacteria, Aquficae, Bacteroidetes-Chlorobi, Chlamydiae-Verrucomicrobia, Chofiexi, Cyanobacteria, Firmicutes, Proteobacteria, Spirochaetes, and Thermotogae. An exemplary Cas9 protein is the Streptococcus pyogenes Cas9 protein. Additional Cas9 proteins and homologs thereof are described in, e.g., Chylinksi, et al., RNA Biol. 2013 May 1; 10(5): 726-737; Nat. Rev. Microbiol. 2011 June; 9(6): 467-477; Hou, et al., Proc Natl Acad Sci USA. 2013 Sep. 24; 110(39):15644-9; Sampson et al., Nature. 2013 May 9; 497(7448):254-7; and Jinek, et al., Science. 2012 Aug. 17; 337(6096):816-21. In some embodimemts, a Cms1 nuclease is used. See, e.g., Begemann, Matthew B., et al., bioRxiv (2017): 192799. Other exemplary nucleases include, for example, TALE nucleases (TALENs), zinc-finger proteins (ZFPs), zinc-finger nucleases (ZFNs), DNA-guided polypeptides such as Natronobacterium gregoryi Argonaute (NgAgo).
[0052] The present disclosure also provides for nucleic acids, including isolated nucleic acids, nucleic acid expression cassettes, and expression vectors, that encode the CENH3 polypeptides having one or more deletion as described herein. Also provided are cells comprising the nucleic acids.
[0053] Once a polynucleotide encoding a CENH3 polypeptide having the deletion(s) is obtained, in some embodiments, it can also be used to prepare an expression cassette for expressing the resulting modified CENH3 polypeptide in a transgenic plant, directed by a promoter, which can be endogenous (e.g., a CENH3 promoter) or heterologous. Expression of the CENH3 polynucleotides encoding the polypeptide having the deletion(s) in a genetic background that otherwise does not express other CENH3 proteins, is useful, for example, to make a haploid inducer plant.
[0054] Any of a number of means can be used to drive CENH3 (having a deletion as described herein) activity or expression in plants. In some embodiments, to use a polynucleotide sequence for a CENH3 polypeptide having a deletion in the above techniques, recombinant DNA vectors suitable for transformation of plant cells are prepared. Techniques for transforming a wide variety of higher plant species are well known and described in the technical and scientific literature. See, e.g., Weising et al. Ann. Rev. Genet. 22:421-477 (1988). A DNA sequence coding for the CENH3 polypeptide having a deletion can be combined with transcriptional and translational initiation regulatory sequences which will direct the transcription of the sequence from the gene in the intended tissues of the transformed plant.
[0055] For example, a plant promoter fragment may be employed to direct expression of the CENH3 polynucleotide having a deletion in all tissues of a regenerated plant. Such promoters are referred to herein as constitutive promoters and are active under most environmental conditions and states of development or cell differentiation. Examples of constitutive promoters include the cauliflower mosaic virus (CaMV) 35S transcription initiation region, the 1- or 2-promoter derived from T-DNA of Agrobacterium tumafaciens, and other transcription initiation regions from various plant genes known to those of skill.
[0056] Alternatively, the plant promoter may direct expression of the CENH3 protein having a deletion in a specific tissue (tissue-specific promoters) or may be otherwise under more precise environmental control (inducible promoters).
[0057] If proper protein expression is desired, a polyadenylation region at the 3-end of the coding region should be included. The polyadenylation region can be derived from a naturally occurring CENH3 gene, from a variety of other plant genes, or from T-DNA.
[0058] In some embodiments, the vector comprising the sequences (e.g., promoters or CENH3 coding regions) comprises a marker gene that confers a selectable phenotype on plant cells. For example, the marker may encode biocide resistance, particularly antibiotic resistance, such as resistance to kanamycin, G418, bleomycin, hygromycin, or herbicide resistance, such as resistance to chlorosluforon or Basta.
[0059] In some embodiments, the CENH3 nucleic acid sequence having a deletion is expressed recombinantly in plant cells. A variety of different expression constructs, such as expression cassettes and vectors suitable for transformation of plant cells, can be prepared. Techniques for transforming a wide variety of higher plant species are well known and described in the technical and scientific literature. See, e.g., Weising et al. Ann. Rev. Genet. 22:421-477 (1988). A DNA sequence coding for a CENH3 protein can be combined with cis-acting (promoter) and trans-acting (enhancer) transcriptional regulatory sequences to direct the timing, tissue type and levels of transcription in the intended tissues of the transformed plant. Translational control elements can also be used.
[0060] Embodiments of the present disclosure also provide for a mutated CENH3 nucleic acid operably linked to a promoter which, in some embodiments, is capable of driving the transcription of the CENH3 coding sequence having a deletion in plants. The promoter can be, e.g., derived from plant or viral sources. The promoter can be, e.g., constitutively active, inducible, or tissue specific. In construction of recombinant expression cassettes, vectors, transgenics, of the invention, different promoters can be chosen and employed to differentially direct gene expression, e.g., in some or all tissues of a plant or animal.
[0061] When generating transgenic plants, it will be desirable to ultimately generate a plant that expresses the CENH3 polypeptide having a deletion but does not express wildtype CENH3. In some embodiments, one can generate a CENH3 mutation in an endogenous gene that reduces or eliminates CENH3 activity or expression, e.g., generating a CENH3 gene knockout. In these embodiments, one can generate an organism heterozygous for the gene knockout or mutation and introduce an expression cassette for expression of the heterologous corresponding mutated kinetochore complex protein into the organism. Progeny from the heterozygote can then be selected that are homozygous for the mutation or knockout but that comprises the recombinantly expressed heterologous mutated kinetochore complex protein. Accordingly, in some embodiments, plants, plant cells or other organisms are provided in which one or both endogenous CENH3 alleles are knocked out or mutated to significantly or essentially completely lack CENH3 activity, i.e., sufficient to induce embryo lethality without a complementary expression of a mutated CENH3 protein as described herein. In plants having more than a diploid set of chromosomes (e.g. tetraploids), all alleles can be inactivated, mutated, or knocked out.
[0062] Alternatively, one can introduce the expression cassette encoding a CENH3 protein having a deletion into an organism with an intact set of endogenous CENH3 alleles and then silence the endogenous CENH3 gene. As an example, an siRNA or microRNA can be introduced or expressed in the organism that reduces or eliminates expression of the endogenous CENH3.
[0063] The silencing siRNA or other silencing agent can be selected to silence the endogenous CENH3 gene but not substantially interfere with expression of the CENH3 protein having a deletion. In situations where endogenous CENH3 is to be inactivated, this can be achieved, for example, by targeting the siRNA to the N-terminal tail coding section, or untranslated portions, or the CENH3 mRNA, depending on the structure of the mutated kinetochore complex protein. Alternatively, the CENH3 protein transgene having a deletion can be designed with novel codon usage, such that it lacks sequence homology with the endogenous CENH3 protein gene and with the silencing siRNA.
[0064] Also provided are host cell(s) comprising a nucleic acid encoding a CENH3 polypeptide having a deletion as described herein. As discussed above, the cell can comprise an endogenous CENH3 gene that has been mutated to contain the nucleic acid encoding the CENH3 polypeptide having a deletion, or the nucleic acid can be heterologous to the cell (for example, the nucleic acid could be transformed into the cell). In the latter case, the nucleic acid can be part of a heterologous expression cassette (e.g., comprising a promoter operably linked to the coding sequence). Exemplary host cells include, for example, prokaryotic (e.g., including but not limited to E. coli) cells or eukaryotic cells, and can for example plant, fungal, yeast, mammalian, insect, or other cells. Also provided as discussed above are plants comprising a nucleic acid encoding a CENH3 polypeptide having a deletion as described herein.
[0065] Crossing a plant that expresses a CENH3 polypeptide having a deletion as described herein, and that does not express a wildtype CENH3 polypeptide, either as a pollen or ovule parent, to a diploid plant that expresses an endogenous CENH3 polypeptide will result in at least some progeny (e.g., at least 0.1%, 0.5%, 1%, 5%, 10%, 20% or more) that are haploid and comprise only chromosomes from the plant that expresses the endogenous CENH3 polypeptide. Thus, the present disclosure allows for the generation of haploid plants having all of its chromosomes from a plant of interest (i.e., the plant expressing the endogenous CENH3 polypeptide) by crossing the plant of interest with a plant expressing the mutated CENH3 polypeptide and collecting and/or selecting the resulting haploid seed. The methods can similarly be used to generate plants with higher number of chromosomes to generate progeny with half the number of chromosomes, e.g., crossing a plant that expresses a CENH3 polypeptide having a deletion as described herein, and that does not express a wildtype CENH3 polypeptide to a tetraploid plant will generate some progeny that have half the chromosomes of the tetraploid plant (e.g., diploid plants).
[0066] As noted above, the plant expressing a wild type (e.g., endogenous) CENH3 protein can be crossed as either the male or female parent. An aspect of the method is that it allows for generation of a plant (or other organism) having only a male parent's nuclear chromosomes and a female parent's cytoplasm with associated mitochondria and plastids, when the mutated CENH3 polypeptide parent is the female parent.
[0067] Once generated, haploid plants can be used for a variety of useful endeavors, including but not limited to the generation of doubled haploid plants, which comprise an exact duplicate copy of chromosomes. Such doubled haploid plants are of particular use to speed plant breeding, for example. A wide variety of methods are known for generating doubled haploid organisms from haploid organisms.
[0068] Somatic haploid cells, haploid embryos, haploid seeds, or haploid plants produced from haploid seeds can be treated with a chromosome doubling agent. Homozygous double haploid plants can be regenerated from haploid cells by contacting the haploid cells, including but not limited to haploid callus, with chromosome doubling agents, such as colchicine, anti-microtubule herbicides, or nitrous oxide to create homozygous doubled haploid cells.
[0069] Methods of chromosome doubling are disclosed in, for example, U.S. Pat. Nos. 5,770,788; 7,135,615, and US Patent Publication No. 2004/0210959 and 2005/0289673; Antoine-Michard, S. et al., Plant Cell, Tissue Organ Cult., Dordrecht, the Netherlands, Kluwer Academic Publishers 48(3):203-207 (1997); Kato, A., Maize Genetics Cooperation Newsletter 1997, 36-37; and Wan, Y. et al., Trends Genetics 77: 889-892 (1989). Wan, Y. et al., Trends Genetics 81: 205-211 (1991), the disclosures of which are incorporated herein by reference. Methods can involve, for example, contacting the haploid cell with nitrous oxide, anti-microtubule herbicides, or colchicine. Optionally, the haploids can be transformed with a heterologous gene of interest, if desired.
[0070] Double haploid plants can be further crossed to other plants to generate F1, F2, or subsequent generations of plants with desired traits.
EXAMPLES
[0071] CENH3 is a histone 3 variant that determines, epigenetically, the location of centromeres. Centromeres are the attachment sites for the kinetochore, which is required for the separation of sister chromatids to opposite poles of the cell during mitosis. CENH3 is therefore an essential protein. The protein's structure can be divided into the highly conserved histone fold domain (HFD) and the highly variable N-terminal tail. It is hypothesized that defective (or weak) alleles of CENH3 cannot compete with wild-type alleles for kinetochore components (and reloading of centromeric components) during the first few mitotic divisions of embryogenesis. This results in proper segregation of sister chromatids derived from the wild-type parent, but loss of chromosomes derived from the mutant parent.
[0072] The conservation of the histone fold domain of CENH3 among eukaryotes is illustrated in
Details:
[0073] a) Tomatoes were transformed with a variety of T-DNA constructs. The most significant of these, pMR303, carries a CRISPR targeting the region encoding the alpha-N helix of the native CENH3, plus a chimeric CENH3 transgene termed citrine:tailswap (
Two alleles identified in tomato were:
TABLE-US-00001 6-1(2AA)(SEQIDNO:101,6-1)RYRP{GT}VAL(SEQIDNO:147)>RYRPVAL(SEQ IDNO:148)(boldedandbracketedaminoacidsinbracketsweredeletedand boldedandunderlinedaminoacidswereaddedduetochangeinthecodon). (SEQIDNO:101) MARTKHLAKRSRTTSAAPSATPSTPSRKSPRSAPATSVQKPKQKKRYRPVALREIRHFQK TWDLLIPAAPFIRLVREISHFYAPGVTRWQAEALIAIQEAAEDFLVHLFEDAMLCAIHAK RVTLMKKDFELARRLGGKGQPW* 12(4AA)(SEQIDNO:110,12-3)KKR{YRPGT}VAL(SEQIDNO:149)>KKRSVAL (SEQIDNO:150) (SEQIDNO:110) MARTKHLAKRSRTTSAAPSATPSTPSRKSPRSAPATSVQKPKQKKRSVALREIRHFQKT VDLLIPAAPFIRLVREISHFYAPGVTRWQAEALIAIQEAAEDFLVHLFEDAMLCAIHAKR VTLMKKDFELARRLGGKGQPW*
[0074] b) Hairy roots transformed with pMR303 often produced homozygous roots carrying in-frame deletions (e.g., 3 bp, 6 bp, 12 bp) and more rarely larger deletions like a 33 bp deletion mutation at CENH3. All of these in-frame deletions are predicted to produce CENH3 proteins with internal deletions within the highly-conserved alpha-N helix (
[0075] c) In order to test the functionality of these deletion alleles, Arabidopsis CENH3 alleles with the same deletions were synthesized and transformed into an Arabidopsis CENH3+/heterozygote. cenh3/ homozygotes were identified among the T1 transformants. This result indicates that both the 6 bp and the 33 bp deletions express a functional CENH3. The plants are fertile on self-pollination. Outcrossing the deletion mutants by wild-type pollen, in contrast, results in high seed lethality, and production of paternal haploids (assayed as expression of a recessive marker derived from the pollen donor). The two amino acid deletion produces 25% haploids (among surviving seeds), while the eleven amino acid deletion produces 16% haploid progeny (among surviving seeds). Thus CRISPR-induced deletions in the alpha-N helix can result in haploid inducers.
[0076] We have demonstrated that, in Arabidopsis, in-frame deletions in the alpha-N helix of CENH3 can induce haploids on outcrossing by wild-type pollen, using transgenic CENH3 variants synthesized in the lab and transformed into CENH3 KO lines.
[0077] We have shown that [0078] 1) in-frame mutations can routinely be generated by CRISPR mutagenesis using a variety of guides; and [0079] 2) a variety of CRISPR-induced in-frame mutations in CENH3 can result in haploid-inducing plants.
[0080] For example we employed 5 guide RNAs distributed across the CENH3 gene of the model plant Arabidopsis to generate in-frame deletions, additions, and amino acid changes.
[0081] The guide RNAs were cloned into a Cas9-expressing vector and the resulting constructs were used to transform WT Col-0 Arabidopsis plants. T1 plants were screened and transgenic plants were genotyped for mutations in CENH3. A list of T2 or T3 mutants obtained as viable homozygotes is provided below. This viability demonstrates that a wide range of changes can be accommodated by CENH3.
From construct CenH3 G1-392 we obtained:
TABLE-US-00002 392#2-3 is (+28-4)/(+28-4)(+8AA) GPTTTPT (SEQ ID NO: 151 > GPTAGPISNLKFTPT (SEQ ID NO: 152) in the N-terminal tail. Bolded and underlined sequence was added to the wild type CENH3 sequence. The mutant's protein's sequence is therfore: (SEQIDNO:127) MARTKHRVTRSQPRNQTDAAGASSSQAAGPTAGPISNLKFTPTRRGGEGGDNTQQTNP TTSPATGTRRGAKRSRQAMPRGSQKKSYRYRPGTVALKEIRHFQKQTNLLIPAASFIREV RSITHMLAPPQINRWTAEALVALQEAAEDYLVGLFSDSMLCAIHARRVTLMRKDFELAR RLGGKGRPW* 392#2-4is95/95(3AA)GPTT{TPT}RR(SEQIDNO:153)>GPTTRR(SEQID NO:154)intheN-terminaltail.Boldedandbracketedsequencewasdeleted fromthewildtypeCENH3squence. Themutant'sprotein'ssequenceistherfore: (SEQIDNO:144) MARTKHRVTRSQPRNQTDAAGASSSQAAGPTTRRGGEGGDNTQQTNPTTSPATGTRRG AKRSRQAMPRGSQKKSYRYRPGTVALKEIRHFQKQTNLLIPAASFIREVRSITHMLAPPQ INRWTAEALVALQEAAEDYLVGLFSDSMLCAIHARRVTLMRKDFELARRLGGKGRPW* 392#5-2is77/7777bpofintroncausingthe9bplefttobeadded(+3AA) totheN-terminaltailGPTTTPT(SEQIDNO:151)>GPTTKLKTPT(SEQIDNO:155) Themutant'sprotein'ssequenceistherfore: (SEQIDNO:128) MARTKHRVTRSQPRNQTDAAGASSSQAAGPTTKLKTPTRRGGEGGDNTQQTNPTTSPA TGTRRGAKRSRQAMPRGSQKKSYRYPGTVALKEIRHFQKQTNLLIPAASFIREVRSITH MLAPPQINRWTAEALVALQEAAEDYLVGLFSDSMLCAIHARRVTLMRKDFELARRLGG KGRPW* 392#9-2is(+8-)/(+8/-2)(-1+3AA)GPT{T}TPT(SEQIDNO:156)>GPTIELTPT (SEQIDNO:157)intheN-terminaltail Themutant'sprotein'ssequenceistherfore: (SEQIDNO:129) MARTKHRVTRSQPRNQTDAAGASSSQAAGPTIELTPTRRGGEGGDNTQQTNPTTSPAT GTRRGAKRSRQAMPRGSQKKSYRYRPGTVALKEIRHFQKQTNLLIPAASFIREVRSITHM LAPPQINRWTAEALVALQEAAEDYLVGLFSDSMLCAIHARRVTLMRKDFELARRLGGK GRPW*
From construct CenH3 G2-393 we obtained:
[0082] 393#3-1 is (20/20) has a deletion probably resulting in a splicing defect. This mutation is viable as a homozygote.
TABLE-US-00003 393#3-3is(+3/+3)(+1AA)KRSRQA(SEQIDNO:158)>KRSTRQA(SEQIDNO: 159)intheN-terminaltail Themutant'sprotein'ssequenceistherfore: (SEQIDNO:130) MARTKHRVTRSQPRNQTDAAGASSSQAAGPTTTPTRRGGEGGDNTQQTNPTTSPATGT RRGAKRSTRQAMPRGSQKKSYRYRPGTVALKEIRHFQKQTNLLIPAASFIREVRSITHML APPQINRWTAEALVALQEAAEDYLVGLFSDSMLCAIHARRVTLMRKDFELARRLGGKG RPW*
From construct CenH3 G1G2-401 we obtained:
TABLE-US-00004 401#1-2is367/367(37AA) GPT{TTPTRRGGEGGDNTQQTNPTTSPATGTRRGAKRSRQA}MPR(SEQIDNO:160) >GPTMPR(SEQIDNO:161)intheN-terminaltail Themutant'sprotein'ssequenceistherfore: (SEQIDNO:131) MARTKHRVTRSQPRNQTDAAGASSSQAAGPTMPRGSQKKSYRYRPGTVALKEIRHFQK QTNLLIPAASFIREVRSITHMLAPPQINRWTAEALVALQEAAEDYLVGLFSDSMLCAIHA RRVTLMRKDFELARRLGGKGRPW* 401#2-2is(+15-8)/(+15-8)inG2thatcausesplicingchangeof1AA+7AA andalso9/9inG2(3AA) Themutant'sprotein'ssequenceistherfore: (SEQIDNO:132) MARTKHRVTRSQPRNQTDAAGASSSQAAGPTIVMFLPFSTPTRRGGEGGDNTQQTNPTT SPATGTRRGAKRSRQAMPRGSQKKSYRYRPGTVALKEIRHFQKQTNLLIPAASFIREVRS ITHMLAPPQINRWTAEALVALQEAAEDYLVGLFSDSMLCAIHARRVTLMRKDFELARRL GGKGRPW* 401#3-2is355/355(33AA) GPTT{TPTRRGGEGGDNTQQTNPTTSPATGTRRGAKRS}RQA(SEQIDNO:162)> GPTTRQA(SEQIDNO:163) Themutant'sprotein'ssequenceistherfore: (SEQIDNO:133) MARTKHRVTRSQPRNQTDAAGASSSQAAGPTTRQAMPRGSQKKSYRYRPGTVALKEIR HFQKQTNLLIPAASFIREVRSITHMLAPPQINRWTAEALVALQEAAEDYLVGLFSDSMLC AIHARRVTLMRKDFELARRLGGKGRPW*
From construct CenH3 G1G2-355 we got:
TABLE-US-00005 58#8is6/6(2AA)inG2AKR{SR}QAM(SEQIDNO:164)>AKRQAM(SEQ IDNO:165) Themutant'sprotein'ssequenceistherefore: (SEQIDNO:134) MARTKHRVTRSQPRNQTDAAGASSSQAAGPTTTPTRRGGEGGDNTQQTNPTTSPATGT RRGAKRQAMPRGSQKKSYRYRPGTVALKEIRHFQKQTNLLIPAASFIREVRSITHMLAPP QINRWTAEALVALQEAAEDYLVGLFSDSMLCAIHARRVTLMRKDFELARRLGGKGRP W* 37#1is408/408(54) RNQT{DAAGASSSQAAGPTTTPTRRGGEGGDNTQQTNPTTSPATGTRRGAKRSRQA MPR)GSQ(SEQIDNO:166)>RNQTGSQ(SEQIDNO:167) Themutant'sprotein'ssequenceistherefore: (SEQIDNO:135) MARTKHRVTRSQPRNQTGSQKKSYRYRPGTVALKEIRHFQKQTNLLIPAASFIREVRSIT HMLAPPQINRWTAEALVALQEAAEDYLVGLFSDSMLCAIHARRVTLMRKDFELARRLG GKGRPW*
From construct CenH3 G3-376 we got: PGP-26DNA
TABLE-US-00006 376#4-5is(+2-11)/(+2-11)(3AA)KKS{YRYR}PGT(SEQIDNO:168)> KKSMPGT(SEQIDNO:169) Themutant'sprotein'ssequenceistherefore: (SEQIDNO:136) MARTKHRVTRSQPRNQTDAAGASSSQAAGPTTTPTRRGGEGGDNTQQTNPTTSPATGT RRGAKRSRQAMPRGSQKKSMPGTVALKEIRHFQKQTNLLIPAASFIREVRSITHMLAPPQ INRWTAEALVALQEAAEDYLVGLFSDSMLCAIHARRVTLMRKDFELARRLGGKGRPW*
From construct CeH3 G5-388 we got:
TABLE-US-00007 388#5-1is(+9-3)/(+9-3)(+2AA)EIRH{FQ}KQTNL(SEQIDNO:170)> EIRHCVIKKQTNL(SEQIDNO:171) Themutant'sprotein'ssequenceistherefore: (SEQIDNO:137) MARTKHRVTRSQPRNQTDAAGASSSQAAGPTTTPTRRGGEGGDNTQQTNPTTSPATGT RRGAKRSRQAMPRGSQKKSYRYRPGTVALKEIRHCVIKKQTNLLIPAASFIREVRSITHM LAPPQINRWTAEALVALQEAAEDYLVGLFSDSMLCAIHARRVTLMRKDFELARRLGGK GRPW*
From construct CenH3 G8-391 we obtained:
TABLE-US-00008 391#2-1is3/3(1AA)GGK{K}GRPW*(SEQIDNO:172)>GGGRPW*(SEQID NO:173) Themutant'sprotein'ssequenceistherefore: (SEQIDNO:138) MARTKHRVTRSQPRNQTDAAGASSSQAAGPTTTPTRRGGEGGDNTQQTNPTTSPATGT RRGAKRSRQAMPRGSQKKSYRYRPGTVALKEIRHFQKQTNLLIPAASFIREVRSITHMLA PPQINRWTAEALVALQEAAEDYLVGLFSDSMLCAIHARRVTLMRKDFELARRLGGGRP W* 391#3-1is9/(3AA)GG{KGR}PW*(SEQIDNO:174)>GGPW*(SEQIDNO: 175) Themutant'sprotein'ssequenceistherefore: (SEQIDNO:139) MARTKHRVTRSQPRNQTDAAGASSSQAAGPTTTPTRRGGEGGDNTQQTNPTTSPATGT RRGAKRSRQAMPRGSQKKSYRYRPGTVALKEIRHFQKQTNLLIPAASFIREVRSITHMLA PPQINRWTAEALVALQEAAEDYLVGLFSDSMLCAIHARRVTLMRKDFELARRLGGPW* 391#5-3is(19/19(remove6AA+Stop>adding14newAA) RLG{GKGRPW*}(SEEQIDNO:176)>RLGDRKLTHYSHLLHCK*(SEQIDNO:177) Themutant'sprotein'ssequenceistherefore: (SEQIDNO:140) MARTKHRVTRSQPRNQTDAAGASSSQAAGPTTTPTRRGGEGGDNTQQTNPTTSPATGT RRGAKRSRQAMPRGSQKKSYRYRPGTVALKEIRHFQKQTNLLIPAASFIREVRSITHMLA PPQINRWTAEALVALQEAAEDYLVGLFSDSMLCAIHARRVTLMRKDFELARRLGDRKL THYSHLLHCK* 391#5-5is3/3(1AA)G{G}KGRPW*(SEQIDNO:178)>GKGRPW*(SEQID NO:179) Themutant'sprotein'ssequenceistherefore: (SEQIDNO:141) MARTKHRVTRSQPRNQTDAAGASSSQAAGPTTTPTRRGGEGGDNTQQTNPTTSPATGT RRGAKRSRQAMPRGSQKKSYRYRPGTVALKEIRHFQKQTNLLIPAASRIREVRSITHMLA PPQINRWTAEALVALQEAAEDYLVGLFSDSMLCAIHARRVTLMRKDFELARRLGKGRP W* 391#6-1is18/18(5AA)GG{KGRPW*}(SEQIDNO:180)>GG* Themutant'sprotein'ssequenceistherefore: (SEQIDNO:142) MARTKHRVTRSQPRNQTDAAGASSSQAAGPTTTPTRRGGEGGDNTQQTNPTTSPATGT RRGAKRSRQAMPRGSQKKSYRYRPGTVALKEIRHFQKQTNLLIPAASFIREVRSITHMLA PPQINRWTAEALVALQEAAEDYLVGLFSDSMLCAIHARRVTLMRKDFELARRLGG*
[0083] Thus we have shown that all guides tested could produce in-frame deletions and additions.
[0084] To determine whether these in-frame deletion/addition/substitution lines are haploid inducers, we crossed them by Landsberg erecta glabrous (Ler gl1-1 CE-H3) pollen. Haploid induction was assayed as elimination of maternal (cenh3 mutant derived) chromosomes leading to the production of paternal haploids, which exhibit both of the
recessive erecta and glabrous phenotypes (Kuppu 2015). In this work we scored the frequency of gl (trichomeless) progeny derived from a cross of the mutant CENH3 homozygote by pollen from Ler gl.
[0085] The mutant 388#5 (carrying a (+82)/(+82) bp in-frame addition in the -N-Helix of the HFD that resulted in a change of two AA and an addition of 2 more AA (EIRH{FQ}KQTNL (SEQ ID NO: 170)>EIRHCVIKKQTNL (SEQ ID NO: 171))), was crossed by the tester pollen (Ler gl1-1 CENH3). Among the offspring 8.6% (7 out of 81) were trichomeless, consistent with loss of the dominant maternal marker gl1 (the marker er was not tested in any of these experiments).
[0086] The mutant 376#4 (carrying a 9/9 bp in-frame deletion in the Tail-HFD junction, resulting in a change of 1 AA and deletion of 3 more AA (KKS{YRYR} (SEQ ID NO: 168)>KKSMPGT (SEQ ID NO: 169)), when crossed with the tester pollen produced 3% (4 out of 133) trichomeless offspring.
[0087] The mutant 58#8 (carring a 6/6 bp in-frame deletion in the tail domain resulting in a deletion of 2 AA (AKR{SR}QAM (SEQ ID NO: 164)>AKRQAM (SEQ ID NO: 165)), when crossed with the tester pollen produced 0.5% (2 out of 376) trichomeless offspring.
[0088] The mutant 392#2-3 (carrying a (+284)/(+284) bp in-frame addition in the tail domain that results in an addition of 8
TABLE-US-00009 AA(GPTTFPT(SEQIDNO:151)>GPTAGPISNLKFTPT (SEQIDNO:152)),
when crossed with the tester pollen, produced 0.3%(1 out of 328) trichomeless offspring.
[0089] In addition, in order to test this method in other crops we designed a construct with a gRNA to target the -N-Helix of the HFD in tomato. See
[0090] From transformation events we identified 3 plants carrying in-frame deletions of either
TABLE-US-00010 6bp(RYRP{GT}VAL(SEQIDNO:147)>RYRPVAL(SEQIDNO:148))orA9bp RY{RPG}TVAL(SEQIDNO:181)>RYTVAL(SEQIDNO:182). A6-1(A2AA)(thisisthesamealleleasSEQIDNO:101,A6-1)RYRP{GT}VAL (SEQIDNO:147)>RYRPVAL(SEQIDNO:148) (SEQIDNO:101) MARTKHLAKRSRTTSAAPSATPSTPSRKSPRSAPATSVQKPKQKKRYRPVALREIRHFQK TWDLLIPAAPFIRLVREISHFYAPGVTRWQAEALIAIQEAAEDFLVHLFEDAMLCAIHAK RVTLMKKDFELARRLGGKGQPW* A9-1 (SEQIDNO:143) MARTKHLAKRSRTTSAAPSATPSTPSRKSPRSAPATSVQKPKQKKRYTVALREIRHFQKT WDLKIPAAPFIRLVREISHFYAPGVTRWQAEALIAIQEAAEDFLVHLFEDAMLCAIHAKR VTLMKKDFELARRLGGKGQPW*
[0091] We have also derived a tomato line homozygous for allele 6-1 without the citrine:tailswap and found that it is viable and fertile. This result suggests that these in-frame deletions can produce a CENH3 that is both mitotically and meioticaly functional.
[0092] In summary, we found, that all of our guides could produce mutations that result in amino acid in-frame indels at the target site. Our results indicate that the in-frame indels in the HFD has a stronger effect on the ability to induce haploids than indels in the N-terminal tail, but indels in either domain are capable of generating a haploid-inducing allele.
[0093] It is understood that the examples and embodiments described herein are for illustrative purposes only and that various modifications or changes in light thereof will be suggested to persons skilled in the art and are to be included within the spirit and purview of this application and scope of the appended claims. All publications, patents, and patent applications cited herein are hereby incorporated by reference in their entirety for all purposes.