GENETICALLY MODIFIED PLANTS WITH ENHANCED DROUGHT TOLERANCE
20220145320 · 2022-05-12
Inventors
- Jonathan Napier (Hertfordshire, GB)
- Smita KURUP (Hertfordshire, GB)
- Michaela Matthes (Hertfordshire, GB)
Cpc classification
C12N15/8261
CHEMISTRY; METALLURGY
Y02A40/146
GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
International classification
Abstract
This invention relates to genetically modified plants with improved drought tolerance and to genetically modified cells thereof. These plants may have thicker leaves, rounder leaf shape or higher chlorophyll content. The plants may be modified to increase the expression of a plant calmodulin-binding protein IQD22, IQD23 or IQD24. The modified plants are expected to have improved yields under variable field conditions. The invention also relates to a method of genetically modifying a plant to have these characteristics. The invention further relates to isolated nucleic acids and polynucleotide constructs for the genetic engineering of such plants and their use.
Claims
1-46. (canceled)
47. A method of modifying a plant to increase yield compared to a corresponding unmodified plant, and/or whereby at least some of the leaves of the modified plant are: (i) thicker; and/or (ii) rounder in shape; and/or (iii) contain more chlorophyll per wet or dry weight, compared to a corresponding unmodified plant, comprising genetically modifying the plant to increase expression of a plant calmodulin-binding protein (IQD) compared to the unmodified plant grown under same or comparative conditions.
48. A method as claimed in claim 47, wherein the IQD protein is IQD22, IQD23 or IQD24.
49. A method as claimed in claim 47, wherein (a) the IQD protein comprises a conserved 67 amino acid domain (IQ67) having an amino acid sequence with at least 50% identity to FRGYLARRALRALKGLVRLQALVRG [SEQ ID NO: 10] or at least 50% identity to EE#AA#IQX#FRGYLARRALRALKGLVRLQALVRGX#VR+QA##TL+CMQALVR#Q AXVRARR#+# [SEQ ID NO: 11]; and or (b) wherein the protein further comprises the amino acid sequence: (i) HAIAVAAATAAVAEAA [SEQ ID NO: 15]; or (ii) (D/R/S/N)(K/Q/D/N/R/T)(H/R)(A/S)(I/V/M/S)(A/H)V(A/T)(A/E/F)(A/P)T(A/E)(A/H/V)(V/A) [SEQ ID NO: 16]; preferably DKHAIAVAAATAAV [SEQ ID NO: 17]; optionally further comprising the amino acid sequence: V(A/Q)(E/N/D/K)(A/N)(A/R)(V/Q/I/L)(A/T/N)(A/I)A(Q/E/V/N/H/R/N)AAA(A/V/E)VV(R/K/E/S/L/T)(L/F)(T/N)X [SEQ ID NO: 19], wherein X is any amino acid; optionally wherein this sequence is V(A/Q)(E/N/D/K)(A/N)(A/R)(V/Q/I/L)(A/T/N)(A/I)A(Q/E/V/N/H/R/N)AAA(A/V/E)VVRL TX [SEQ ID NO: 20] or V(A/Q)(E/N/D/K)(A/N)(A/R)(V/Q/I/L)(A/T/N)(A/I)A(Q/E/V/N/H/R/N)AAA(A/V/E) VVRLTS [SEQ ID NO: 21] V(A/Q)(E/N/D/K)(A/N)(A/R)(V/Q/I/L)(A/T/N)(A/I)A(Q/E/V/N/H/R/N)AAAEVVR LTX [SEQ ID NO: 22], and/or wherein the protein further comprises the amino acid sequence: (R/K)(R/K/T)W(S/G)F [SEQ ID NO: 33] or (R/K)(R/K/T)WSF [SEQ ID NO: 34]; preferably wherein this sequence listing is RRWSF [SEQ ID NO: 35] or RKWSF [SEQ ID NO: 36].
50. A method as claimed in claim 49, wherein the protein (a) further comprises the amino acid sequence: G(Y/D)HP(N/S)YMA(N/C)TES(Y/S)(K/R)(V/A)RS(Q/A)SAP(K/R)(Q/S)R [SEQ ID NO. 23]; and/or (b) further comprises the amino acid sequence TABLE-US-00019 [SEQ ID NO: 24] L(H/Y)SATSRSKRSAFTASSIAPSDCT(Q/K)SCC(Y/D)(A/G)DHPS YMACTESSRAKARSAPKSRPQL(Y/F)YE(Q/R)(S/P)SSKRFG(Y/F) VD(L/V)PYCGD(S/T)(R/K)SGPQK(V/G)SALHTSFMNKAYPGSGRL DRLGMPIGYRY.
51. A method as claimed in claim 47, wherein the protein has an amino acid sequence of SEQ ID NO: 1 or an amino acid sequence of at least 30% identity therewith.
52. A method as claimed in claim 47, wherein the protein has an amino acid sequence of SEQ ID NO. 4 or an amino acid sequence of at least 30% identity therewith.
53. A method as claimed in claim 47, wherein the protein has an amino acid sequence of SEQ ID NO: 7 or an amino acid sequence of at least 30% identity therewith.
54. A genetically modified plant cell having (a) an increased amount of a IQD protein as defined in claim 47 when compared to a corresponding unmodified plant cell or (b) having at least one polynucleotide sequence encoding an IQD protein as defined in claim 47 in addition to any naturally occurring homolog(s) of said protein(s) in a corresponding unmodified plant cell.
55. A genetically modified plant cell as claimed in claim 54, wherein the increased amount of IQD protein is at least 2-fold; preferably at least 7-fold, compared to the corresponding unmodified plant cell.
56. A genetically modified plant cell as claimed in claim 54, wherein the at least one additional polynucleotide sequence is under the control of a native promoter for the IQD protein.
57. A genetically modified plant cell as claimed in claim 54, wherein the at least one additional polynucleotide sequence is under the control of an heterologous promoter; optionally a constitutive promoter.
58. A genetically modified plant cell as claimed in claim 54, wherein each polynucleotide sequence is inserted into a different genomic locus.
59. A plant comprising or consisting of genetically modified plant cells as claimed in claim 54, wherein at least some of the leaves of the plant are (i) rounder in shape; and/or (ii) thicker; and/or (iii) contain more than chlorophyll per wet or dry weight, compared to a corresponding unmodified plant.
60. A plant as claimed in claim 59, wherein the at least some leaves have a palisade layer comprising more cells and/or more elongated cells compared to a corresponding unmodified plant.
61. A plant as claimed in claim 59, wherein epinastic leaf curling in a corresponding unmodified plant is abolished.
62. A plant as claimed in claim 59, wherein a stem of the plant is thicker compared a corresponding unmodified plant.
63. A plant as claimed in claim 59, wherein transpiration rate is reduced compared to an unmodified plant kept under the same conditions.
64. A plant as claimed in claim 59, wherein wilting is delayed when water is withheld compared to unmodified plant kept under the same conditions; and/or wherein a wilted plant recovers after re-watering compared to an unmodified plant kept under the same conditions which does not recover.
65. A plant as claimed in claim 59, wherein yield of the plant is increased under drought conditions compared to an unmodified plant kept under the same drought conditions, e.g. wherein the yield is total biomass, organ or part biomass, leaf biomass, yield of fruit or yield of seed.
66. A plant comprising or consisting of genetically modified plant cells as claimed in claim 54, wherein at least some of the leaves of the plant have a rate of photosynthesis which is 10% or greater than the photosynthesis rate of a control unmodified plant under the same conditions.
Description
BRIEF DESCRIPTION OF THE DRAWINGS
[0068]
[0069]
[0070]
[0071]
[0072]
[0073]
[0074]
[0075]
[0076]
[0077]
[0078]
[0079]
[0080]
[0081]
[0082]
[0083]
[0084]
[0085]
[0086]
[0087]
[0088]
[0089]
[0090]
[0091]
[0092]
[0093]
[0094]
[0095]
[0096]
[0097]
[0098]
[0099]
[0100]
[0101]
[0102]
[0103]
[0104]
[0105]
[0106]
[0107]
[0108]
[0109]
[0110]
[0111]
[0112]
[0113]
[0114]
[0115]
[0116]
[0117]
[0118]
[0119]
[0120]
[0121]
[0122]
[0123]
[0124]
DETAILED DESCRIPTION
[0125] Camelina is a genus within the flowering plant family Brassicaceae. Camelina is an emerging biofuel crop, in particular Camelina sativa. The plant is useful as a biofuel feedstock due in part to its drought tolerance and minimal requirements for supplemental nitrogen and other agricultural inputs (Gehringer et al. (2006) Genome 49(12): 1555-63; Gugel and Falk (2006) Canadian Journal of Plant Science 86(4): 1047-1058). C. sativa is of course in the same family and so relatively close genetically and developmentally and in many ways to Arabidopsis thaliana as well as the common oilseed crop Brassica napus (canola). The inventors were looking to further increase the drought tolerance of Camelina sativa in various ways, and in doing so discovered unexpectedly that if IQD genes are overexpressed in Arabidopsis or Camelina plants, then the plants have an altered morphology which and significantly improved drought tolerance compared to a corresponding unmodified control plant allowed to grow and develop under the same regime. More particularly the inventors have found that if IQD 22, IQD 23 or IQD 24 genes are overexpressed in modified Arabidopsis or Camelina plants, then the plants have an altered morphology which and significantly improved drought tolerance compared to an unmodified control plant. The morphology and architecture of the modified overexpressing plants is altered, manifest as a significant increase in the thickness of leaves and stem. This also affects biomass in the altered plants. There are also marked differences in physiology, that is to say in terms of transpiration and drought tolerance, and in certain instances in photosynthetic efficiency. There are also advantages gained in terms of seed yield and productivity.
[0126] Comparing the IQD 22 overexpressing Arabidopsis plants in particular with control (wt) plants under conditions of drought, wilting of the IQD 22 overexpressor lines is considerably delayed. Re-watering of droughted plants leads to a much quicker and better recovery of the IQD 22 overexpressors.
[0127] Naturally, an important industrial application of this invention is the engineering of more drought tolerant crops and the making available of such engineered germplasm for use in connection with the generation of new plant varieties. The inventors consider that the enhancements of drought tolerance in plants, particularly in preferred aspects, of the Brassicaceae, may be achieved by overexpression of IQD genes; preferably IQD 22 and/or IQD 23 and/or IQD 24 genes; more preferably IQD 22.
[0128] As used herein, the terms “increase”, “improve” or “enhance” are used interchangeably. Also, the terms “reduce” or “decrease” are used interchangeably.
[0129] As described herein, when the amino acid sequences of any IQD protein are defined herein with respect to a reference sequence, then homologues (i.e. sequence variants) are also defined with reference to a percentage of sequence identity. In the broadest aspect of the invention these variant sequences are at least 30% identical to a reference sequence. The proteins of the IQD family are unusual in that there is a very wide degree of variation between plant species. However, there are motifs, as described by Steffen, A. et al., (2005) Ibid. where there is much narrower degree of variation between plant species. The inventors define certain consensus sequences of IQD 22, 23 and 24 or combinations therefore and which may be used in combination with the reference sequence and at least 30% identity variants to further define the IQD polypeptides which are overexpressed in accordance with the invention.
[0130] Amino acid motifs which are intended to assist in defining IQD 22 and/or IQD 23 and/or IQD24 polypeptide sequences, whether or not together with SEQ ID NO: 1 or SEQ ID NO:4 or SEQ ID NO: 7, are as follows: RRWSF [SEQ ID NO: 35]; DKHAIAVAAATAAVAEAA [SEQ ID NO: 26]; WRLT; KIQS; FRGYLA [SEQ ID NO: 29]; RALRALK [SEQ ID NO: 30] or ENSPQ [SEQ ID NO: 54], whether alone or in any combination.
[0131] Amino acid motifs which are intended to assist in defining IQD22 and/or IQD 23 polypeptide sequences, whether or not together with SEQ ID NO: 1 or SEQ ID NO:4, are as follows: KRRWSF; VVRLTS or VRGHI, whether alone or in any combination.
[0132] Amino acid motifs which are intended to assist in defining IQD23 and/or IQD 24 polypeptide sequences, whether or not together with SEQ ID NO: 4 or SEQ ID NO:7, are as follows: MGFFGRLFGSKK [SEQ ID NO: 38]; DKHAIAVAAATAAVAEAAL [SEQ ID NO: 56]; AAAEVVRLT [SEQ ID NO: 39]; RSNRRW [SEQ ID NO: 40]; AAMKIQSAFRGYLARRALRALKALVKLQALV [SEQ ID NO: 41]; GHIVRKQTADMLRRMQTLVRLQ [SEQ ID NO: 42]; ARARASRSSH [SEQ ID NO: 43]; DKILEVDTWKPH [SEQ ID NO: 44]; ESPRKR [SEQ ID NO: 45]; TPFTP [SEQ ID NO: 46]; YYSGYHPNYMANTESYKAKVRSQSAP [SEQ ID NO: 47]; SGYKRS [SEQ ID NO: 48]; or QGQYYYYT [SEQ ID NO: 49].
[0133] Amino acid motifs which are intended to assist in defining IQD22 and/or IQD 24 polypeptide sequences are as follows: RRWSF [SEQ ID NO: 35]; VVRLT [SEQ ID NO: 55]; FRGYLA [SEQ ID NO: 29]; RALRALK [SEQ ID NO: 30] or ENSPQL [SEQ ID NO: 53].
[0134] Accordingly, the invention—having regard to any reference IQD sequence, i.e. SEQ ID NO: 1, SEQ ID NO: 4 or SEQ ID NO: 7—also includes any variant of such reference sequence which is at least 31%, at least 32%, at least 33%, at least 34%, at least 35%, at least 36%, at least 37%, at least 38%, at least 39%, at least 40%, at least 41%, at least at least 42%, at least 43%, at least 44%, at least 45%, at least 46%, at least 47%, at least at least 48%, at least 49%, at least 50%, at least 51%, at least 52%, at least 53%, at least at least 54%, at least 55%, at least 56%, at least 57%, at least 58%, at least 59%, at least 60%, at least 61%, at least 62%, at least 63%, at least 64%, at least 65%, at least 66%, at least 67%, at least 68%, at least 69%, at least 70%, at least 71%, at least 72%, at least at least 73%, at least 74%, at least 75%, at least 76%, at least 77%, at least 78%, at least 79%, at least 80%, at least 81%, at least 82%, at least 83%, at least 84%, at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% sequence identity thereto. Sequence identity may be determined using a global alignment algorithm known in the art, such as the Needleman Wunsch algorithm in the program GAP (GCG Wisconsin Package, Accelrys). Other methods of sequence comparison and analysis may be used; e.g Bestfit (Wisconsin Sequence Analysis Package, Version 8 for Unix, Genetics Computer Group, University Research Park, 575 Science Drive Madison, Wis. 53711).
[0135] When using a sequence alignment program to determine whether a particular sequence has for instance 95% identity with a reference sequence of the present invention, the parameters are preferably adjusted so that the percentage of identity is calculated over the entire length of the reference sequence and homology gaps of up to 5% of the total number of the nucleotides in the reference sequence are permitted. Usually these may be default settings in commonly used software packages. Preferably therefore when a sequence has a percent identity to any one of the SEQ ID NOs as detailed herein, this refers to a sequence which has the stated percent identity over the entire length of the SEQ ID NO referred to.
[0136] The degree of sequence identity of polynucleotides of the invention may, instead of being expressed as a percentage identity to reference sequence, may instead be defined in terms of hybridization to a polynucleotide of reference sequence. Hybridization of such sequences may be carried out under stringent conditions. By “stringent conditions” or “stringent hybridization conditions” is intended conditions under which a probe will hybridize to its target sequence to a detectably greater degree than to other sequences (e.g., at least 2-fold over background). Stringent conditions are sequence dependent and will be different in different circumstances. By controlling the stringency of the hybridization and/or washing conditions, target sequences that are 100% complementary to the probe can be identified (homologous probing). Alternatively, stringency conditions can be adjusted to allow some mismatching in sequences so that lower degrees of similarity are detected (heterologous probing). Generally, a probe is less than about 1000 nucleotides in length, preferably less than 500 nucleotides in length.
[0137] Typically, stringent conditions will be those in which the salt concentration is less than about 1.5 M Na.sup.+ ion, typically about 0.01 to 1.0 M Na.sup.+ ion concentration (or other salts) at pH 7.0 to 8.3 and the temperature is at least about 30° C. for short probes (e.g., 10 to 50 nucleotides) and at least about 60° C. for long probes (e.g., greater than 50 nucleotides). Duration of hybridization is generally less than about 24 hours, usually about 4 to 12. Stringent conditions may also be achieved with the addition of destabilizing agents such as formamide.
[0138] When employed in the invention herein, “transgenic”, “transgene” or “recombinant” means with regard to, for example, a nucleic acid sequence, an expression cassette, gene construct or a vector comprising the nucleic acid sequence or an organism transformed with the nucleic acid sequences, expression cassettes or vectors according to the invention, all those constructions brought about by recombinant methods in which either (a) the nucleic acid sequences encoding proteins useful in the methods of the invention, or (b) genetic control sequence(s) which is operably linked with the nucleic acid sequence according to the invention, for example a promoter, or (c) a) and b) are not located in their natural genetic environment or have been modified by recombinant methods, it being possible for the modification to take the form of, for example, a substitution, addition, deletion, inversion or insertion of one or more nucleotide residues. The natural genetic environment is understood as meaning the natural genomic or chromosomal locus in the original plant or the presence in a genomic library. In the case of a genomic library, the natural genetic environment of the nucleic acid sequence is preferably retained, at least in part. The environment flanks the nucleic acid sequence at least on one side and has a sequence length of at least 50 bp, preferably at least 500 bp, especially preferably at least 1000 bp, most preferably at least 5000 bp. A naturally occurring expression cassette—for example the naturally occurring combination of the natural promoter of the nucleic acid sequences with the corresponding nucleic acid sequence encoding a polypeptide useful in the methods of the present invention, as defined above—becomes a transgenic expression cassette when this expression cassette is modified by non-natural, synthetic (“artificial”) methods such as, for example, mutagenic treatment. Suitable methods are described, for example, in U.S. Pat. No. 5,565,350 or WO 00/15815 both incorporated herein by reference.
[0139] Where the invention may provide a transgenic plant, the nucleic acids used in the method of the invention are not at their natural locus in the genome of said plant, it being possible for the nucleic acids to be expressed homologously or heterologously. Thus, the plant expresses a transgene. However, as mentioned, in certain embodiments, transgenic may means that, while the nucleic acids according to the different embodiments of the invention are at their natural position in the genome of a plant, the sequence has been modified with regard to the natural sequence, and/or that the regulatory sequences of the natural sequences have been modified, for example by mutagenesis.
[0140] Transgenic is preferably understood as meaning the expression of the nucleic acids according to the invention at an unnatural locus in the genome, i.e. homologous or, preferably, heterologous expression of the nucleic acids takes place. According to the invention, the transgene is stably integrated into the plant and the plant is preferably homozygous for the transgene.
[0141] Modified plant cells, modified plants or parts thereof as defined herein may be stably transformed with additional genetic material. Such additional genetic material is preferably under the control of at least one regulatory sequence, but a multiplicity of control points may be built in, whether using native or modified regulatory sequences.
[0142] Transformation of plants is now a routine technique in many species. Advantageously, any of several transformation methods may be used to introduce the gene of interest into a suitable ancestor cell. The methods described for the transformation and regeneration of plants from plant tissues or plant cells may be utilized for transient or for stable transformation. Transformation methods include the use of liposomes, electroporation, chemicals that increase free DNA uptake, injection of the DNA directly into the plant, particle gun bombardment, transformation using viruses or pollen and microprojection. Methods may be selected from the calcium/polyethylene glycol method for protoplasts, electroporation of protoplasts, microinjection into plant material, DNA or RNA-coated particle bombardment, infection with (non-integrative) viruses and the like. Transgenic plants, including transgenic crop plants, are preferably produced via Agrobacterium tumefaciens mediated transformation.
[0143] Transformation methods are well known in the art. Thus, according to the various aspects of the invention, an heterologous nucleic acid is introduced into a plant and expressed as a transgene. The nucleic acid sequence is introduced into said plant through a process called transformation. The term “introduction” or “transformation” as referred to herein encompasses the transfer of an exogenous polynucleotide into a host cell, irrespective of the method used for transfer. Plant tissue capable of subsequent clonal propagation, whether by organogenesis or embryogenesis, may be transformed with a genetic construct of the present invention and a whole plant regenerated therefrom. The particular tissue chosen will vary depending on the clonal propagation systems available for, and best suited to, the particular species being transformed. Exemplary tissue targets include leaf disks, pollen, embryos, cotyledons, hypocotyls, megagametophytes, callus tissue, existing meristematic tissue (e.g., apical meristem, axillary buds, and root meristems), and induced meristem tissue (e.g., cotyledon meristem and hypocotyl meristem). The polynucleotide may be transiently or stably introduced into a host cell and may be maintained non-integrated, for example, as a plasmid. Alternatively, it may be integrated into the host genome. The resulting transformed plant cell may then be used to regenerate a transformed plant in a manner well known in the art.
[0144] To select transformed plants, plant material obtained in the transformation is, as a rule, subjected to selective conditions so that transformed plants can be distinguished from untransformed plants. For example, seeds obtained in the above-described manner can be planted and, after an initial growing period, subjected to a suitable selection by spraying. A further possibility is growing the seeds, if appropriate after sterilization, on agar plates using a suitable selection agent so that only the transformed seeds can grow into plants. Alternatively, the transformed plants are screened for the presence of a selectable marker such as the ones described above. Following DNA transfer and regeneration, putatively transformed plants may also be evaluated, for instance using Southern analysis, for the presence of the gene of interest, copy number and/or genomic organisation. Alternatively or additionally, expression levels of the newly introduced DNA may be monitored using Northern and/or Western analysis, both techniques being well known in the art.
[0145] The generated transformed plants may be propagated by a variety of means, such as by clonal propagation or classical breeding techniques. For example, a first generation (or T.sub.1) transformed plant may be selfed and homozygous second-generation (or T.sub.2) transformants selected, and the T.sub.2 plants may then further be propagated through classical breeding techniques. The generated transformed organisms may take a variety of forms. For example, they may be chimeras of transformed cells and non-transformed cells; clonal transformants (e.g., all cells transformed to contain the expression cassette); grafts of transformed and untransformed tissues (e.g., in plants, a transformed rootstock grafted to an untransformed scion).
[0146] The regulatory sequence may be a promoter; optionally an inducible promoter, preferably then one which may be induced by an external stress condition. In the alternative, a constitutive promoter may be employed, e.g. cauliflower mosaic 35S.
[0147] The regulatory sequence may optionally be tissue specific. The term “regulatory element” as used herein may be considered interchangeably with “control sequence” and “promoter” and all terms are to be taken in a broad context to refer to regulatory nucleic acid sequences capable of effecting expression of the sequences to which they are ligated. The term “promoter” typically refers to a nucleic acid control sequence located upstream from the transcriptional start of a gene and which is involved binding of RNA polymerase and other proteins, thereby directing transcription of an operably linked nucleic acid. Encompassed by the aforementioned terms are transcriptional regulatory sequences derived from a classical eukaryotic genomic gene (including the TATA box which is required for accurate transcription initiation, with or without a CCAAT box sequence) and additional regulatory elements (i.e. upstream activating sequences, enhancers and silencers) which alter gene expression in response to developmental and/or external stimuli, or in a tissue-specific manner. Also included within the term is a transcriptional regulatory sequence of a classical prokaryotic gene, in which case it may include a −35 box sequence and/or −10 box transcriptional regulatory sequences.
[0148] The term “regulatory element” also encompasses a synthetic fusion molecule or derivative that confers, activates or enhances expression of a nucleic acid molecule in a cell, tissue or organ.
[0149] A “plant promoter” comprises regulatory elements, which mediate the expression of a coding sequence segment in plant cells. Accordingly, a plant promoter need not be of plant origin, but may originate from viruses or micro-organisms, for example from viruses which attack plant cells. The “plant promoter” can also originate from a plant cell, e.g. from the plant which is transformed with the nucleic acid sequence to be expressed. This also applies to other “plant” regulatory signals, such as “plant” terminators. The promoters upstream of the nucleotide sequences useful in the methods of the present invention can be modified by one or more nucleotide substitution(s), insertion(s) and/or deletion(s) without interfering with the functionality or activity of either the promoters, the open reading frame (ORF) or the 3′-regulatory region such as terminators or other 3′ regulatory regions which are located away from the ORF. Also possible is that the activity of the promoters is increased by modification of their sequence, or that they are replaced completely by more active promoters, even promoters from heterologous organisms. For expression in plants, the nucleic acid molecule is, as described above, preferably linked operably to or comprises a suitable promoter which expresses the gene at the right point in time and with the required spatial expression pattern. For the identification of functionally equivalent promoters, the promoter strength and/or expression pattern of a candidate promoter may be analysed for example by operably linking the promoter to a reporter gene and assaying the expression level and pattern of the reporter gene in various tissues of the plant. Suitable well-known reporter genes are known to the skilled person and include for example beta-glucuronidase or beta-galactosidase.
[0150] The term “operably linked” as used herein refers to a functional linkage between the promoter sequence and the gene of interest, such that the promoter sequence is able to initiate transcription of the gene of interest.
[0151] For example, the nucleic acid sequence may be expressed using a promoter that drives overexpression. Overexpression according to the invention means that the transgene is expressed at a level that is higher than expression of endogenous counterparts driven by their endogenous promoters. For example, overexpression may be carried out using a strong promoter, such as a constitutive promoter. A “constitutive promoter” refers to a promoter that is transcriptionally active during most, but not necessarily all, phases of growth and development and under most environmental conditions, in at least one cell, tissue or organ. Examples of constitutive promoters include the cauliflower mosaic virus promoter (CaMV35S or 19S), rice actin promoter, maize ubiquitin promoter, rubisco small subunit, maize or alfalfa H3 histone, OCS, SAD1 or 2, GOS2 or any promoter that gives enhanced expression. Alternatively, enhanced or increased expression can be achieved by using transcription or translation enhancers or activators and may incorporate enhancers into the gene to further increase expression. Furthermore, an inducible expression system may be used, where expression is driven by a promoter induced by environmental stress conditions (for example the pepper pathogen-induced membrane protein gene CaPIMPI or promoters that comprise the dehydration-responsive element (DRE), the promoter of the sunflower HD-Zip protein genes Hahbl or Hahb4, which is inducible by water stress, high salt concentrations and ABA or a chemically inducible promoter (such as steroid- or ethanol-inducible promoter system). The promoter may also be tissue-specific. The types of promoters listed above are described in the art. Other suitable promoters and inducible systems are also known to a person of average skill.
[0152] A green tissue-specific promoter may be used. For example, a green tissue-specific promoter may be selected from the maize orthophosphate kinase promoter, maize phosphoenolpyruvate carboxylase promoter, rice phosphoenolpyruvate carboxylase promoter, rice small subunit rubisco promoter, rice beta expansin EXBO9 promoter, pigeonpea small subunit rubisco promoter or pea RBS3A promoter.
[0153] The promoter may be a constitutive or strong promoter. In a preferred embodiment, the regulatory sequence is an inducible promoter or a stress inducible promoter. The stress inducible promoter is selected from the following non limiting list: the HaHB1 promoter, RD29A (which drives drought inducible expression of DREB1A), the maize rabl7 drought-inducible promoter, P5CS1 (which drives drought inducible expression of the proline biosynthetic enzyme P5CS1), ABA- and drought-inducible promoters of Arabidopsis clade A PP2Cs (ABM, ABI2, HAB1, PP2CA, HA11, HAI2 and HAI3) or their corresponding crop orthologues.
[0154] Altered plants in accordance with the invention advantageously may provide better yield characteristics. These may be designed into the alterations being made. Yield characteristics, also known as yield traits may comprise one or more of the following non-limitative list of features: biomass, seed yield, seed/grain size, starch content of grain, greenness index, increased growth rate. The term “yield” in general means a measurable produce of economic value, typically related to a specified crop, to an area, and to a period of time. Individual plant parts directly contribute to yield based on their number, size and/or weight, or the actual yield is the yield per square meter for a crop and year, which is determined by dividing total production (includes both harvested and appraised production) by planted square metres. The term “yield” of a plant may relate to vegetative biomass (root and/or shoot biomass), to reproductive organs, and/or to propagules (such as seeds) of that plant. Thus, according to the invention, yield comprises one or more of and can be measured by assessing one or more of: increased seed yield per plant, increased seed filling rate, increased number of filled seeds, increased harvest index, increased viability/germination efficiency, increased number or size of seeds/capsules/pods, increased growth, increased biomass or grain fill. Preferably, increased yield comprises an increased number of grains/seeds/capsules/pods, increased biomass, increased growth. Yield is usually measured relative to a control plant.
[0155] The term “plant” as used herein encompasses whole plants, ancestors and progeny of the plants and plant parts, including seeds, fruit, shoots, stems, leaves, roots (including tubers), flowers, and tissues and organs, wherein each of the aforementioned comprise the gene/nucleic acid of interest. The term “plant” also encompasses plant cells, suspension cultures, callus tissue, embryos, meristematic regions, gametophytes, sporophytes, pollen and microspores, again wherein each of the aforementioned comprises the gene/nucleic acid of interest. When referred to herein, a “plant part” may be green tissue, for example a leaf.
[0156] A monocot plant may, for example, be selected from the families Arecaceae, Amaryllidaceae or Poaceae. For example, the plant may be a cereal crop, such as wheat, rice, barley, maize, oat, sorghum, rye, millet, buckwheat, turf grass, Italian rye grass, sugarcane or Festuca species, or a crop such as onion, leek, yam or banana.
[0157] A dicot plant may be selected from the families including, but not limited to Asteraceae, Brassicaceae (e.g. Brassica napus), Chenopodiaceae, Cucurbitaceae, Leguminosae (Caesalpiniaceae, Aesalpiniaceae Mimosaceae, Papilionaceae or Fabaceae), Malvaceae, Rosaceae or Solanaceae. For example, the plant may be selected from lettuce, sunflower, Arabidopsis, broccoli, spinach, water melon, squash, cabbage, tomato, potato, yam, capsicum, tobacco, cotton, okra, apple, rose, strawberry, alfalfa, bean, soybean, field (fava) bean, pea, lentil, peanut, chickpea, apricots, pears, peach, grape vine, bell pepper, chilli or citrus species. In one embodiment, the plant is oilseed rape.
[0158] Also included are biofuel and bioenergy crops such as rape/canola, sugar cane, sweet sorghum, Panicum virgatum (switchgrass), linseed, lupin and willow, poplar, poplar hybrids, Miscanthus or gymnosperms, such as loblolly pine. Also included are crops for silage (maize), grazing or fodder (grasses, clover, sanfoin, alfalfa), fibres (e.g. cotton, flax), building materials (e.g. pine, oak), pulping (e.g. poplar), feeder stocks for the chemical industry (e.g. high erucic acid oil seed rape, linseed) and for amenity purposes (e.g. turf grasses for golf courses), ornamentals for public and private gardens (e.g. snapdragon, petunia, roses, geranium, Nicotiana sp.) and plants and cut flowers for the home (African violets, Begonias, chrysanthemums, geraniums, Coleus spider plants, Dracaena, rubber plant)
[0159] Other useful and preferred plants are maize, rice, wheat, oilseed rape/canola, sorghum, soybean, sunflower, alfalfa, potato, tomato, tobacco, grape, barley, pea, bean, field bean, lettuce, cotton, sugar cane, sugar beet, broccoli or other vegetable brassicas or poplar.
[0160] The invention also extends to harvestable parts of a plant of the invention as described above such as, but not limited to seeds, leaves, fruits, flowers, stems, roots, rhizomes, tubers and bulbs. The invention furthermore relates to products derived, preferably directly derived, from a harvestable part of such a plant, such as dry pellets or powders, oil, fat and fatty acids, starch or proteins. The invention also relates to food products and food supplements comprising the plant of the invention or parts thereof.
[0161] To the extent that any experimental or practical techniques needed are not provided in the specific examples, they are available as a matter of common general knowledge in the art to a person of ordinary skill. For example, with reference to (1) J. Sambrook, E. F. Fritsch, and T. Maniatis, 1989, “Molecular Cloning: A Laboratory Manual” Second Edition, Books 1-3, Cold Spring Harbor Laboratory Press; (2) J. M. Polak and James O'D. McGee, 1990, “In Situ Hybridization: Principles and Practice” Oxford University Press; (3) B. Roe, J. Crabtree, and A. Kahn, 1996, “DNA Isolation and Sequencing: Essential Techniques”, John Wiley & Sons; (4) Ausubel, F. M. et al. (1995 and periodic supplements “Current Protocols in Molecular Biology” chapters 9, 13, and 16, John Wiley & Sons, New York, N.Y.); (5) J. Lilley and J. E. Dahlberg, 1992, Methods of Enzymology: “DNA Structure Part A: Synthesis and Physical Analysis of DNA Methods in Enzymology” Academic Press; (6) M. J. Gait (Editor), 1984, “Oligonucleotide Synthesis: A Practical Approach” IRL Press; and (7) E. M. Shevach and W. Strober, 1992 and periodic supplements, “Current Protocols in Immunology”, John Wiley & Sons, New York, N.Y. Each of these general texts is herein incorporated by reference.
Example 1: Generation of IQD22 (At4q23060) Overexpression (OE) Lines in Arabidopsis thaliana Col-0
[0162] Arabidopsis lines were generated which overexpress IQD22 under the constitutive CaMVS35 promoter.
[0163] The preparation and construction of the genetic constructs and the transformation and culturing up and reproduction of transgenic plants was as follows:
[0164] The clone containing the IQD22 cDNA was ordered from NASC. After confirmation of the full length cDNA sequence, the cDNA was amplified with Primer NcoI F: ccatggGAAAAG CGTCACGGTG GTT [SEQ ID NO: 57]; and Primer EcoRV R: gatatc tcagtacctatacccaattggcat [SEQ ID NO: 58]; and subcloned into the TOPO vector (Invitrogen). The fragment containing the IQD22 fragment was excised using the restriction enzymes NcoI and EcoRV and cloned into the vector pJD330 which had been digested with NcoI/SmaI, thereby inserting it between the S35 promoter and the nos terminator. The fragment containing S35:IQD22:nos was excised from pJD330 using BamHI/BglII and cloned into the binary transformation vector BIN19. Transformation of Arabidopsis thaliana col-0 was performed using the floral dip method (Clough S J and Bent A F, Plant J. 1998 December; 16, 735-43). Plants containing the transgene were selected on kanamycin plates (generation T1) and taken forward to homozygosity (T3). Experiments were performed with homozygous T3 plants.
[0165] Out of twelve positive transgenic plants, four plants (30%) showed a noticeable phenotype with rounder, thicker and darker green leaves (see
[0166]
[0167] This alteration in growth is directly correlated with the expression level of the transgene as confirmed by Northern blotting as shown in
Example 2: Characterisation of the Effects of Constitutive IQD22 OE in Arabidopsis on Whole Plant Morphology
[0168] The photographs of
[0169] Measurements of the overall inflorescence height of 49 days old plants showed that stem height in the OE lines was reduced to 53% compared to wild type (see table 1 below). There was no change in internode number but there was a large reduction in internode length.
[0170] The stem diameter measured 1 cm above the rosette was increased by 60% in the IQD22 OE lines compared to WT (see table 1 below). Resin embedded stem cross sections shown in
TABLE-US-00009 TABLE 1 Morphology of inflorescence stems of wild type and IQD22 overexpressing lines Fat1 and Fat5. Main inflorescence stem Wild type Fat1 Fat5 Height (cm) 36.1 ± 3.4 19.2 ± 2.4 18.8 ± 2.2 Cauline branch number 4.9 ± 0.9 5.2 ± 0.7 5.2 ± 0.7 Diameter (mm) 1.68 ± 0.26 2.8 ± 0.4 2.7 ± 0.3 First internode 4.0 ± 1.6 2.5 ± 1.4 2.6 ± 1.2 Second internode 3.3 ± 0.7 1.4 ± 0.8 1.1 ± 0.5 Third internode 3.4 ± 0.9 1.5 ± 0.8 1.4 ± 0.7 Data are mean values ± standard deviation (WT n = 14, Fat1 n = 24, Fat5 n = 20). Plants are 49 days old.
[0171] The rosette phenotype obtained when IQD22 is overexpressed in Arabidopsis has significantly altered leaf shape, not being as elongated as WT and rounder than WT. As can be seen in
[0172] The IQD22 OE plant leaves are much thicker than WT. When they are bent they tend to snap and appear juicy—in a sense it is like the plant has become more of a succulent phenotype. Indeed, as shown in
[0173]
[0174]
[0175]
[0176]
[0177]
[0178] The IQD22 OE lines senesce later than the WT (not measured, observed visually). Leaf shape of the IQD22 OE lines is altered. The length of the leaf blades of OE is reduced to 70-60% of WT and petioles showed a length reduction of 15-33% compared to WT. Leaf width in the OE lines is also reduced to 80% of WT at 33 days.
[0179] The height of the OE plants are 50% of WT. However, when considering the cells in the stem and in particular the interfascicular fibres (between the vein areas), these are shortened, as can be observed in
[0180] The effects on the stem of OE plants is interesting. The stem is generally shorter and shows significant increase in cross sectional diameter being 60% bigger, but this is due to an increase in the number of cells rather than size of cells.
[0181]
[0182] For overexpressing plants where IQD22 is under the control of native promoter, and when grown under low white light conditions (60 μmol mm-2 sec-2)
[0183] The increased leaf thickness between wt and IQD22 overexpressing lines is reflected in an increase of leaf mass per unit area in the latter (see
Example 3: IQD22 is Expressed in the Palisades in Arabidopsis
[0184] In order to investigate where IQD22 is expressed in the plant, 1.6 kb of 5′ upstream sequence of the IQD22 gene, comprising the promoter sequence, were amplified and fused to the reporter gene GUS. With this construct, stable transgenic Arabidopsis lines were generated. In 7 day old seedlings GUS expression was detected in the hypocotyl and in what at first impression looked like vasculature (
Example 4: IQD22 Expression is Localised to Microtubules within the Palisade Cells in Arabidopsis
[0185] The same IQD22 promoter fragment used for the GUS reporter construct was cloned in front of IQD22 which had been translationally fused at its 3′ end to the fluorescent protein mCherry (see
[0186] With this construct stable transgenic Arabidopsis lines were generated and analysed. The fluorescent fusion protein was associated with the cytoskeleton of the palisade cells (
Example 5: Expression of IQD22 Under its Own Promoter Alters Leaf Shape and Results in Flattened Leaf Laminae Cells in Arabidopsis
[0187] The transgenic plants which were used for the localisation studies described in the previous section also showed an altered phenotype. Compared to the elongated leaf shape of the wildtype the leaves were much rounder and also felt thicker to the touch. In addition, the typical epinastic curling of the leaves of the wildtype is completely abolished in the lines expressing IQD22 under the control of its own native promoter (see
Example 6: Response of WT and IQD22 OE Arabidopsis Lines to Drought
[0188] It is known that plants adapted to growth in arid habitats frequently are characterised by thicker leaves. As high IQD22 OE Arabidopsis lines had much thicker leaves and stems than WT, it was investigated whether the OE lines also had an advantage over the WT to cope with water stress. Two independent transgenic lines were chosen (‘fat1’ and ‘fat5’). Pots were filled with equal amounts of compost and plants grown under a normal watering regime for 4 weeks. Before starting to withhold water, pots were weighed again and brought to an equal weight by adding water. In addition, sellotape was placed over the compost in order to minimise water loss through evaporation from the compost surface. During the period of the progressive drying of the pots, pots were weighed daily and pictures were taken to record the response of the different lines. The experiment was repeated twice.
[0189] Recording the daily weight loss of the pots, the first observation made was that water loss of the pots containing the IQD22 OE lines was much reduced when compared to WT as they consistently weighed in heavier than the pots containing the WT plants (see
[0190]
[0191] Secondly, as the series of pictures in
[0192] After 11 days of no watering, pots were thoroughly saturated with water and a normal watering regime was resumed for a couple of days. As can be seen in
Example 7: Gene Expression in Arabidopsis WT and IQD22 OE Plants
[0193] The varying ability of the WT and IQD22 OE plants to cope with progressive drought was investigated by looking at gene expression. RNAseq was performed with RNA obtained from leaf samples at different time points during the imposition of the stress.
[0194] As can be seen in
[0195] As the PCA plot in
Example 8: Measurement of Abaxial Leaf Cell Characteristics on WT and IQD22 OE Arabidopsis Plants
[0196] Looking at
Example 9: Other Miscellaneous Morphological and Cellular Observations as Between WT and IQD22 OE Arabidopsis Plants
[0197]
[0198]
[0199]
Example 10: Measurement of Stomatal Conductance on WT and IQD22 OE Arabidopsis Plants
[0200] To test whether differences in gas exchange between the WT and IQD22 OE lines could be detected measurements with the LI-6400XT Portable Photosynthesis System were performed. Data was collected at PAR (photosynthetic active radiation) levels of 500, 250, 125, 60 and 30 (μmol (photons) m.sup.−2 s.sup.−1) with 500 μmol m.sup.−2 s.sup.−1 representing saturating levels of PAR. Conductance and photosynthesis rate was measured at each light level and plotted against each other. The method of residual maximum likelihood (REML) was used to fit a multivariate linear mixed model to the photosynthesis rate and conductance data combined, testing (F-tests) for the main effects and interactions between the factors of light level and genotype, and accounting for the structure of the variance-covariance matrix underlying the data from the two variables. The plots thus obtained for IQD22 OE lines ‘fat1’ ‘fat5’ and ‘fat4’ are significantly different to WT with regard to conductance and show at the same photosynthetic rate a reduced conductance (see
Example 11: Measurement of Stomatal Numbers in WT and IQD22 OE Arabidopsis Plants
[0201] In order to investigate whether a reduced number of stomata could explain the observed decrease in water loss of the IQD22 overexpressing lines, stomata numbers per mm.sup.−2 and stomatal index were determined. Although a higher number of stomata were counted per mm.sup.−2 leaf area for the IQD22 overexpressing lines, when stomatal index was measured no differences for those lines compared to wt were observed (
Example 12: Generation of IQD23 and IQD24 OE Arabidopsis Lines
[0202] In Arabidopsis thaliana the two closest homologues of IQD22 are IQD23 (At5g62070) and IQD24 (At5g07240) (
[0203] Both IQD23 and IQD24 were cloned and tested whether they confer similar traits to Arabidopsis as IQD22 when overexpressed.
[0204] As is the case for IQD22, transgenic plants with thicker leaves were obtained for the S35:IQD23 and S35:IQD24 OE constructs. In both cases, not all the transformed plants displayed the phenotype of increased leaf thickness. This trait was shown to be related to expression levels of the transgenes. IQD23 and IQD24 OE also resulted in increases in leaf thickness and gave drought resistance compared to WT plants.
Example 13: Response of WT and IQD23 OE Arabidopsis Lines to Drought
[0205] The drought experiment was set up as already described in Example 6 for IQD22 OE lines. After 4 weeks of growth under well-watered conditions in a long day cabinet (16 hours of daylight) watering was stopped and the response of the plants to progressive drought recorded.
Example 14: Response of WT and IQD24 OE Arabidopsis Lines to Drought
[0206] Transgenic IQD24 OE were generated as described. As the case for IQD22 and IQD23, the IQD24 OE plants show increased leaf and stem thickness which is dependent on the expression level of the IQD24 transgene. The percentage of plants with a thicker phenotype obtained after the transformation was however lower than that for IQD23 and IQD22. The increase in thickness of leaves and stem is the trait that confers the ability to cope with water stress and so it is fully expected that the IQD24 OE lines also to do better under water scarce conditions compared to WT plants.
Example 15: Effect of IQD22 OE on Seed Yield Compared to WT in Arabidopsis
[0207] In order to establish whether IQD22 overexpression has an effect on seed yield, 12 plants each for wt and IQD22 OE lines ‘fat1’, ‘fat4’ and ‘fat5’ were grown to maturity and traits affecting seed yield recorded. Measurements of the length of the siliques revealed that in the IQD22 overexpressing lines silique length was in average ca 40% shorter than in wt (
[0208] As a consequence of the reduced silique length, the number of seeds obtained per silique is also reduced in the IQD22 overexpressing lines (see
[0209] Total seed yield was reduced in the IQD22 overexpressing lines (see
[0210] Where the WT and IQD22 OE lines were subjected to a severe drought stress during their growing period however, the seed yield obtained from transgenic OE lines was much higher than the one obtained from WT (see
Example 16: Generation of IQD22 (At4g23060) Overexpression (OE) Lines in Camelina Sativa ‘Celine’
[0211] The same construct as used in Arabidopsis Example 1 was used in this example Camelina sativa ‘Celine’. This is shown schematically in
[0212] Transformants were grouped into ‘strong’ or ‘intermediate’ transgenic lines depending on the strength of the phenotype shown. Both are fertile. As shown in
[0213] The ‘intermediate’ type plants are between WT and the ‘strong’ OE phenotype, having longer, thinner (i.e. more lanceolate) leaves than the strong ones. The plants are of normal height but slightly later flowering than WT. The seed pod shape was the same as that observed for the ‘strong’ lines.
[0214]
Example 17: Subjecting WT and IQD22 OE Camelina Lines to Drought Stress
[0215] This experiment was carried out in order to establish whether IQD22 OE confers the same enhanced ability to cope with water stress to Camelina as was observed for A. thaliana. Five plants displaying either the ‘strong’ or the ‘intermediate’ phenotype were planted out together with five wildtype plants. Before planting, it was ensured that all pots contained equal amounts of soil and had the same weight. After 4 weeks of growth, when flowers were about to open, the pot weights were brought again to equal weights with water and subsequently watering was stopped and the response of the plants to decreasing water availability was observed.
[0216] As reported for Arabidopsis, soil-drying occurred at a much slower speed in the case of the transgenic lines, suggesting a reduced transpiration rate for these plants. As is shown in
[0217] After 11 days without watering, WT Camelina showed distinctive stress symptoms such as severely wilted and yellowing leaves and drying of the inflorescences with shriveled flowers whereas the two transgenic lines still had green leaves (although they started to lose their turgor) and upright inflorescences. However, the ‘intermediate’ OE line showed stronger water-stress induced symptoms than the ‘strong’ line.
[0218] In both transgenic lines the water stress resulted in stunted inflorescences as the distance between seed pods was reduced when compared the growth habit of well-watered plants, as can be seen in
[0219] Watering of the plants was resumed after 11 days without water (after the pictures in
Example 18: The Effect of IQD22 OE in Camelina on Seed Yield Compared to WT Plants
[0220] The results here are based on data obtained from 5 plants for wt, ‘intermediate’ and ‘strong’ phenotype plants respectively. Fully ripened seeds obtained from each plant were harvested and weighed. This was done for plants grown under optimal water conditions and also for the droughted plants described in Example 17.
[0221] As can be seen in
[0222] However, when plants were subjected to the severe water stress of withholding water for 11 days, after which normal watering was resumed, the IQD22-overexpressing lines performed much better than the wt plants (see
[0223] The reduction of seed yield observed in the well-watered transgenic lines compared to wt was not due to a reduction of seed number per seed pod (see
Example 19: The Native IQD22 Promoter Drives Palisade Cell Specific Expression in Leaves of Arabidopsis thaliana
[0224] Transgenic A. thaliana plants are produced according to procedures described in Example 1, but the expression construct use comprises a native IQD22 promoter driving expression of a GUS reporter gene. The construct is shown in
Example 20: IQD22 Overexpression in Poplar
[0225] Wild type hybrid poplar (Populus tremula X Populus alba Institut National de la Recherche Agronomique (INRA) clone 7171-B) was used. Transgenic poplar plants were produced according to the procedures described in Example 1 using an expression construct comprising an S35 constitutive promoter, the Arabidopsis IQD22 gene and the nos termination sequence. The expression construct is shown in
[0226] What the inventors have found with the transgenic plants made so far, is that when IQD22 is expressed constitutively (e.g. with a S35 promoter) the particular phenotype is related to the relative level of overexpression of the transgene. In practice, this means that some transgenic lines with high levels of expression show what is described herein as the ‘fat’ phenotype. However, when IQD22 is overexpressed using a native promoter, all of the lines analysed so far have leaves with increased cell layers in the leaf mesophyll and elongated palisade cells.
Genetic Resources
[0227] The following genetic resources were used in the making of the invention:
[0228] Seeds of Arabidopsis thaliana col-0 were obtained from the seed collection at Rothamsted Research, Rothamsted Research, West Common, Harpenden, Hertfordshire, AL5 2JQ, United Kingdom.
[0229] Seeds of Camelina sativa ‘Celine’ were obtained the seed collection at Rothamsted Research, Rothamsted Research, West Common, Harpenden, Hertfordshire, AL5 2JQ, United Kingdom.
[0230] Wild type hybrid poplar (Populus tremula X Populus alba) Institut National de la Recherche Agronomique (INRA) clone 7171-B was obtained from the University of Malaga, Spain.
Nucleotide and Amino Acid Sequences
[0231] Arabidopsis thaliana IQD22 protein (AT4G23060.1)
TABLE-US-00010 [SEQ ID NO: 1] 1 MGKASRWFRS LFGVKKPDPG YPDLSVETPS RSTSSNLKRR WSFVKSKREK 51 ESTPINQVPH TPSLPNSTPP PPSHHQSSPR RRRKQKPMWE DEGSEDSDKH 101 AIAVAAATAA VAEAAVAAAN AAAAVVRLTS TSGRSTRSPV KARFSDGFDD 151 VVAHGSKFYG HGRDSCELAV IKIQSIFRGY LAKRALRALK GLVRLQAIVR 201 GHIERKRMSV HLRRMHALVR AQARVRATRV IVTPESSSSQ SNNTKSSHFQ 251 NPGPPTPEKL EHSISSRSSK LAHSHLFKRN GSKASDNNRL YPAHRETFSA 301 TDEEEKILQI DRKHISSYTR RNRPDMFYSS HLILDNAGLS EPVFATPFSP 351 SSSHEEITSQ FCTAENSPQL YSATSRSKRS AFTASSIAPS DCTKSCCDGD 401 HPSYMACTES SRAKARSASA PKSRPQLFYE RPSSKRFGFV DLPYCGDTKS 451 GPQKGSALHT SFMNKAYPGS GRLDRLGMPI GYRY
Arabidopsis thaliana IQD22 full length genomic DNA. Underlined sequence portion is the coding region. The start codon is marked in bold:
TABLE-US-00011 [SEQ ID NO: 2] 1 ATTTTACTTC CTCAGATTCA CATGACCAAA TCATGTAACC ATTTTCCAAA 51 TAAAATCTTT ACATTTAGAT TTAGATTCAG AGGAATTGAA TTAGCCTCAT 101 CATAATGTAT GATACTACAT ACTACACTAG TGACACGGAA ATTACACGAC 151 GAAAATAAAC AATGTGAAGA ATAACGAAAT TTCCCGGGAA AAGAGAGATA 201 GAGAGTGAGA CACACGCGCG AGTGATGCGT GTGGTAGTAA ATAGAACACT 251 GTTTGATGAT ACTGCTGCGA CTACTTAACT CTTATTACAA AGCTCTCTTT 301 TTGTTGTCTC TCTCTTGCTC TCTCTGCAAA ACTCCGGCGA GAAGAGAACG 351 TTGTCGTTTC ATTCGTATCA AAGTCTTCAT CAGCA ATGGG AAAAGCGTCA 401 CGGTGGTTTA GGAGTCTATT CGGAGTTAAG AAACCCGACC CGGGTTATCC 451 GGATCTATCC GTCGAGACGC CTTCTCGGTC AACTTCCTCT AATCTCAAAC 501 GCCGATGGAG TTTCGTCAAA TCCAAACGAG AAAAAGAAAG TACACCGATT 551 AATCAAGTTC CTCATACTCC ATCGCTACCG AATTCGACGC CTCCACCGCC 601 GTCTCACCAC CAATCGTCGC CGAGACGACG GAGAAAACAA AAGCCGATGT 651 GGGAGGATGA GGGAAGTGAA GATTCGGACA AGCATGCTAT TGCGGTGGCT 701 GCCGCGACTG CTGCGGTTGC TGAAGCTGCA GTCGCCGCCG CTAATGCTGC 751 TGCTGCGGTC GTCAGGCTGA CGAGCACAAG TGGGAGGTCG ACTCGAAGTC 801 CTGTTAAGGC ACGGTTTAGC GACGGATTCG ACGACGTGGT GGCGCATGGT 851 AGCAAGTTTT ATGGACACGG CCGTGACAGT TGTGAACTTG CGGTGATTAA 901 GATACAATCT ATATTTCGCG GATACTTG GT AATTTTCTTT CTCTAAAATT 951 TATTTCACTC GTTTTATAAA TTACTTTATG AAGCCAAAAA TAGAAATATA 1001 CATGAATTGC ATTTTTTTAG TCAAAAGTTT ATAATTTTTG ATTAAACGAA 1051 AGCTGAATGA ATACATTTGT GTTCATCATT TACACGATTT TGCATAATTA 1101 TATTTTTAAT TAGGATTAGA AAAGTATAAA ATGGTTATAG ATTTGTTAAA 1151 CATTCACGGG GATTATCGTC CAGATAACAT TTTTCGATGT CGTAAACCTT 1201 CTGCTGCTAC CACTTCCCTA GACGAGTGAT AGGACAATTA ACCTTCCATA 1251 TTCTCATTTA ATTATTTTAA TATCTTAGCA GAGCACTTGA ATATTCTTTG 1301 TAAAGTAATT TGCTTTGTGA ATACCAAATC AAATTGTATT TGCTTTTTCC 1351 TTAATTTTAT ATTCACTGGT TAGAAATTTG TTTTTCAATT TAACCGACTC 1401 AAAAATTGAT TCCGGTCCGA ACAGTATAGA TGATTCTCAA CAAGTTTAAT 1451 GCTAAGTTTA ATATTATATC CATTCTTCCA ATTAATTCGA TCCTCAACAA 1501 GTTTAATGCT AAGTTGTTTT CTTTTTTTTT TTTCTTTTAG TTTCCCCATT 1551 TGGTTACGAG TAAATGAAAG ATTTGTATAA AAATACTTAG TGGAAAATCA 1601 GTTAACATCA ACCAATACGA AGAAGTGGTC GTACTCTTGT TCTATTGGTT 1651 AGTGAATGTT ATTCACTGGA CCACTGTATA AATAACATGG TGAATGCTGG 1701 CTGCTATAAT TAGTCTATAT TTCAGTTCAT TCTCTATATG TTATAGTCTC 1751 TATGTAGTGA ATATACTATG TCTATTAATG TTTCGATGCA TGCCTATTGA 1801 AACTTTATAC TTTATAGCTT ATGGCGAATC TTACAAAAAA AAAATTTTGG 1851 AGGGGTTTTA AGATAATAGT GTGTAAGACA ATTCACACCC AAATGTGTAC 1901 TCATCGACAT AAGTCGCATT GTTCATTAAT AGTCGATTCA TCTTTTCTCT 1951 CGTAGAGAGT GACTATACAT GTAAATGTTT TTTTTATATA TGTCCCTCTA 2001 TTTAATTTTG TTGAATTCCT CGTGAATATT TCAAAAGTTC AATTTCTTTA 2051 AAAGGTCAAT ATGTTTTCTT TCTTTTTCCC TTTCTTTTGG CGCCAACTTA 2101 TATCAAAAGT AATGCTTTTT CAAATTTCAA TATGCTTTTG TGGTCCTGTC 2151 CATAATGAGA GCTTGAGCTA GTCAAGACTC TTAATGTTTT TTTAAGAGTT 2201 GATACTTAGA ATATGATCTT GTGATTATTG GCTGGCTTAT CTCCATTTCT 2251 AAGGCAATTT GTTATTGTTT GATTATTTTA ATTGTAG GCA AAGAGAGCGT 2301 TAAGGGCACT CAAGGGTTTG GTTAGGCTTC AAGCGATAGT TAGAGGCCAT 2351 ATCGAAAGAA AGAGAATGTC AGTCCATCTG CGCAGGATGC ACGCTTTGGT 2401 TCGAGCTCAG GCTCGTGTGC GTGCCACTCG GGTTATTGTC ACGCCTGAAT 2451 CTTCTTCTTC TCAATCCAAC AATACCAAAT CTTCTCACTT CCAAAACCCT 2501 GTAAGCCTTG TAAAATTCCC TATGATTGTT CCTTTTAACC TAAAACATGT 2551 CTTTTCGTTC AAGGTCCCTT TTTAACAACT CCTCTGTTTT TCTTTAAG GG 2601 TCCACCAACT CCGGAAAAAC TCGAGCATTC GATCTCTTCT CGCAGCTCCA 2651 AACTCGCTCA TTCTCATCTT TTCAAG GTAT TACATTTTCA GTTGCTGTTT 2701 GTTTCTTCTG TTTTTGTTGC TTGTGGTCCC ATTAGTTCAA AATTTCAGAG 2751 ACTATACAAG TTACTGACAT TGCTTTATGT TCAAAACAAA TCCAATCTCA 2801 AGAATTGGTT TATAGGCTTC TCATTGTTCC TTTGTTAGAC TTATCCATTT 2851 CACATTCCAT TTAAACTAAG AGATTGGATT AAATTTTCAG AGGAATGGTT 2901 CGAAGGCAAG CGACAACAAC AGACTGTACC CTGCTCACAG GGAGACATTC 2951 TCAGCCACAG ACGAAGAAGA AAAGATTCTT CAAATCGACA GGAAACACAT 3001 CAGTTCTTAC ACAAGACGCA ACAGACCAGA CATGTTCTAC TCATCCCACC 3051 TCATCCTAGA CAATGCTGGC CTGTCTGAAC CAGTTTTCGC CACGCCTTTT 3101 AGCCCGTCCT CGTCGCATGA AGAGATTACA AGCCAGTTTT GCACTGCAGA 3151 GAACAGTCCT CAGTTATACT CAGCTACTTC TAGAAGCAAA CGCAGTGCTT 3201 TCACCGCTAG TTCTATAGCA CCGAGCGATT GCACAAAAAG CTGCTGTGAT 3251 GGTGACCATC CAAGCTACAT GGCTTGTACA GAGTCCTCTA GGGCTAAGGC 3301 TAGGTCCGCT AGTGCCCCGA AGTCTCGACC ACAGTTATTT TACGAGCGGC 3351 CTTCATCAAA ACGGTTTGGA TTTGTTGATT TGCCGTACTG TGGTGATACA 3401 AAGTCCGGTC CCCAGAAAGG CTCTGCTCTG CATACTAGTT TTATGAACAA 3451 GGCTTATCCC GGTTCAGGTC GGTTGGACCG TCTCGGGATG CCAATTGGGT 3501 ATAGGTACTG A GAACGTTGT TGGACCCTTT AACCTGGTTT TGTTGTAGAT 3551 CAGTGGATTG AGCTTTGCTT CTAATTTTCT TTGTTTGTAG TTTGGAGCTA 3601 AGTGGATTTA ATCTAAAATG CAAATAGAAC TTGCTTAATC AAAATGTTTG 3651 CATCTCTATA AAGCATTGTC ATTTGCCTAG ACTTTAGTTG TTATCAAAAA 3701 TTTTATGAAT ATGAAGGAAA CCAAGAGCTA TAACAGATAT ATTTGGTGTA 3751 ATAACAAAAA CGTCTAACAC TAAAAAAATT GACAAGAAAA TGTCATGTCC 3801 ATTGGGATTT AGAGGAG
Arabidopsis thaliana IQD22 Full Length cDNA:
TABLE-US-00012 [SEQ ID NO: 3] 1 ATTTTACTTC CTCAGATTCA CATGACCAAA TCATGTAACC ATTTTCCAAA 51 TAAAATCTTT ACATTTAGAT TTAGATTCAG AGGAATTGAA TTAGCCTCAT 101 CATAATGTAT GATACTACAT ACTACACTAG TGACACGGAA ATTACACGAC 151 GAAAATAAAC AATGTGAAGA ATAACGAAAT TTCCCGGGAA AAGAGAGATA 201 GAGAGTGAGA CACACGCGCG AGTGATGCGT GTGGTAGTAA ATAGAACACT 251 GTTTGATGAT ACTGCTGCGA CTACTTAACT CTTATTACAA AGCTCTCTTT 301 TTGTTGTCTC TCTCTTGCTC TCTCTGCAAA ACTCCGGCGA GAAGAGAACG 351 TTGTCGTTTC ATTCGTATCA AAGTCTTCAT CAGCAATGGG AAAAGCGTCA 401 CGGTGGTTTA GGAGTCTATT CGGAGTTAAG AAACCCGACC CGGGTTATCC 451 GGATCTATCC GTCGAGACGC CTTCTCGGTC AACTTCCTCT AATCTCAAAC 501 GCCGATGGAG TTTCGTCAAA TCCAAACGAG AAAAAGAAAG TACACCGATT 551 AATCAAGTTC CTCATACTCC ATCGCTACCG AATTCGACGC CTCCACCGCC 601 GTCTCACCAC CAATCGTCGC CGAGACGACG GAGAAAACAA AAGCCGATGT 651 GGGAGGATGA GGGAAGTGAA GATTCGGACA AGCATGCTAT TGCGGTGGCT 701 GCCGCGACTG CTGCGGTTGC TGAAGCTGCA GTCGCCGCCG CTAATGCTGC 751 TGCTGCGGTC GTCAGGCTGA CGAGCACAAG TGGGAGGTCG ACTCGAAGTC 801 CTGTTAAGGC ACGGTTTAGC GACGGATTCG ACGACGTGGT GGCGCATGGT 851 AGCAAGTTTT ATGGACACGG CCGTGACAGT TGTGAACTTG CGGTGATTAA 901 GATACAATCT ATATTTCGCG GATACTTGGC AAAGAGAGCG TTAAGGGCAC 951 TCAAGGGTTT GGTTAGGCTT CAAGCGATAG TTAGAGGCCA TATCGAAAGA 1001 AAGAGAATGT CAGTCCATCT GCGCAGGATG CACGCTTTGG TTCGAGCTCA 1051 GGCTCGTGTG CGTGCCACTC GGGTTATTGT CACGCCTGAA TCTTCTTCTT 1101 CTCAATCCAA CAATACCAAA TCTTCTCACT TCCAAAACCC TGGTCCACCA 1151 ACTCCGGAAA AACTCGAGCA TTCGATCTCT TCTCGCAGCT CCAAACTCGC 1201 TCATTCTCAT CTTTTCAAGA GGAATGGTTC GAAGGCAAGC GACAACAACA 1251 GACTGTACCC TGCTCACAGG GAGACATTCT CAGCCACAGA CGAAGAAGAA 1301 AAGATTCTTC AAATCGACAG GAAACACATC AGTTCTTACA CAAGACGCAA 1351 CAGACCAGAC ATGTTCTACT CATCCCACCT CATCCTAGAC AATGCTGGCC 1401 TGTCTGAACC AGTTTTCGCC ACGCCTTTTA GCCCGTCCTC GTCGCATGAA 1451 GAGATTACAA GCCAGTTTTG CACTGCAGAG AACAGTCCTC AGTTATACTC 1501 AGCTACTTCT AGAAGCAAAC GCAGTGCTTT CACCGCTAGT TCTATAGCAC 1551 CGAGCGATTG CACAAAAAGC TGCTGTGATG GTGACCATCC AAGCTACATG 1601 GCTTGTACAG AGTCCTCTAG GGCTAAGGCT AGGTCCGCTA GTGCCCCGAA 1651 GTCTCGACCA CAGTTATTTT ACGAGCGGCC TTCATCAAAA CGGTTTGGAT 1701 TTGTTGATTT GCCGTACTGT GGTGATACAA AGTCCGGTCC CCAGAAAGGC 1751 TCTGCTCTGC ATACTAGTTT TATGAACAAG GCTTATCCCG GTTCAGGTCG 1801 GTTGGACCGT CTCGGGATGC CAATTGGGTA TAGGTACTGA GAACGTTGTT 1851 GGACCCTTTA ACCTGGTTTT GTTGTAGATC AGTGGATTGA GCTTTGCTTC 1901 TAATTTTCTT TGTTTGTAGT TTGGAGCTAA GTGGATTTAA TCTAAAATGC 1951 AAATAGAACT TGCTTAATCA AAATGTTTGC ATCTCTATAA AGCATTGTCA 2001 TTTGCCTAGA CTTTAGTTGT TATCAAAAAT TTTATGAATA TGAAGGAAAC 2051 CAAGAGCTAT AACAGATATA TTTGGTGTAA TAACAAAAAC GTCTAACACT 2101 AAAAAAATTG ACAAGAAAAT GTCATGTCCA TTGGGATTTA GAGGAG
Arabidopsis thaliana IQD23 Protein (AT5G62070.1)
TABLE-US-00013 [SEQ ID NO: 4] 1 MGFFGRLFGS KKKSDKAASS RDKRRWSFTT RSSNSSKRAP AVTSASVVEQ 51 NGLDADKHAI AVAAATAAVA EAALTAAHAA AEVVRLTSGN GGRNVGGGGN 101 SSVFQIGRSN RRWAQENIAA MKIQSAFRGY LARRALRALK ALVKLQALVR 151 GHIVRKQTAD MLRRMQTLVR LQSQARARAS RSSHSSASFH SSTALLFPSS 201 SSSPRSLHTR CVSNAEVSSL DHRGGSKRLD WQAEESENGD KILEVDTWKP 251 HYHPKPLRSE RNNESPRKRQ QSLLGPRSTE NSPQVGSSGS RRRTPFTPTS 301 RSEYSWGCNN YYYSGYHPNY MANTESYKAK VRSQSAPKQR VEVSNETSGY 351 KRSVQGQYYY YTAVEEESLD VGSAGYYGGG GGDSDRLNRN QSAKSRMHSS 401 FLV
Arabidopsis thaliana IQD23 Full Length Genomic DNA:
TABLE-US-00014 [SEQ ID NO: 5] 1 AAAATAAAAA AGCCTTTTAA AATCTCGTGT AACTAACTCG CTGTATGCCG 51 GCGCGTGGAA ACGTACTCTG CAATCTATAT ATAAGCCGTT AAACTCTATT 101 TAGCTTCTCT ATAGACAAAA GTTTATGTTT TGTTTTTCCT CTAACACAAA 151 AAACAATCTC AGATATTAAA TAAAGTGTTA ATCAACCGGA GAACTCTCCG 201 ACAAAACCTC GCCGACGATT TTCCGACGAT GAAAGATGGG CTTTTTCGGG 251 AGACTGTTCG GGAGTAAAAA AAAGTCTGAT AAAGCTGCTT CGTCGAGAGA 301 TAAACGGAGG TGGAGCTTCA CCACCAGATC TTCAAATTCC AGCAAGAGAG 351 CTCCGGCGGT GACGTCGGCT TCGGTGGTTG AGCAAAATGG TTTGGATGCT 401 GACAAACATG CGATAGCTGT GGCTGCTGCG ACAGCCGCGG TGGCGGAAGC 451 AGCTCTTACT GCTGCTCATG CGGCAGCTGA AGTCGTGAGA CTAACAAGCG 501 GAAACGGCGG GAGAAACGTC GGTGGCGGTG GAAACTCGTC GGTGTTTCAA 551 ATAGGAAGAA GTAACCGTCG GTGGGCTCAG GAGAATATCG CGGCGATGAA 601 GATTCAATCC GCTTTTCGTG GTTATCTGGT TAGTTTACTA ATTTCTCAAA 651 CGTTTTAGCT GATTAATTCA AGAATGATTC AAAATGTTGA ATTTTGAAAC 701 TGTTCGAAAA TTTCATAAAA CAATTTACAT AATGTGCAAT AACTGTTGAA 751 ATTCGGATTT TTTTTCTTCT CATTCACAAT TTCAAATTTT GAAAATGGAA 801 ACAATGGGTA AAACTTGTAT TGTTTTGTGT GGGGTTTGGT CCCAAAAGAA 851 ACTGACTATA GAAGAAGAAG AAGAAGACAA ACAAATGTGA AATGAAGAAG 901 GTTCTTTGAA ACTTGTACAC TCTGCCGACA CTATTTTTAG TGGTTATCAA 951 TAGTTCCAGT TTTTTTTTTT ATCTTCTTCT GTTAGTTTAT CACTCAAAAT 1001 TTTGATGAAA GATCTGATCA GAATGTCGAA ATGAACTTAG TTTAATGTTG 1051 GTGTGAATCA TTTGATTTTA GTTGTCAGAT GGGCCAAGTT TTTTTTTGTT 1101 CCACGTTATG TTGTGCTTTA TAATTCTGAA CCTTTATTGA AATTAACAAG 1151 ACTAATTAGT ATTTTACAGA TGGGTTCTTT TTTGCTTTAG GACAGCAACT 1201 TTGAAGTTGA AATCTTTAGC CCAAAAACTT TTTCAAGTTG CTTATTTATC 1251 TTCTCTTTAC TTACAAAGAT TCATTTTGAC CATGTTTTTT ACTCTTTTTC 1301 AAATTCCTAT CTGTGAATCT TGGAGTTTTT TTTGTTTTTG CTAATTTGCG 1351 TTAAATTATT CCAAGATCTT ATGTGTGTTG TCGAATCTTG ACCAATGTGT 1401 AAAGAAGAAC TCGTGTGTGT TTGTGTGTGT ACATTTGACT TTGAGACATA 1451 GTTATGAGAA TGGTGGGCTC TGTTGTAATT ACTAAATCTA GTTTTTGGTA 1501 CAAGTGGAAG TGGTTGAGAA ACTAAAAGCT AAATAGAATA ATTTCACAAA 1551 CAACTAATCC CTTTACTACT CTTTTTCTGT GTGTTTTAAT GTTAGTTGGC 1601 AAAACGATGA TTATGTCTTC AACAAATGAT TGAACTCGCC TTGTCTTTGT 1651 TATTCACTTT CAAGCAAACT CAAATTTTGA TTATTCATGG TGGAGCCAAT 1701 TTTTTACTCC AAGTTCTTTC TATATGGAGT AAATATTCAT TTTTTGAACA 1751 TAAATTAAAG ACTCTATATG AAGTTTATTG GTTTGCTTTG GCTACTTTGG 1801 AAAATTTTCA GGCGAGGAGA GCATTACGAG CACTAAAGGC ATTAGTGAAG 1851 CTTCAAGCAT TAGTGAGAGG ACACATAGTG AGAAAACAAA CTGCAGATAT 1901 GCTTAGAAGG ATGCAGACTC TTGTTCGTCT TCAATCTCAA GCTCGTGCTC 1951 GAGCCTCTCG TTCTTCTCAC TCCTCTGCTT CTTTCCACTC CTCCACCGCT 2001 CTTTTGTTCC CATCTTCCTC GTCTTCTCCA CGTTCTCTTC ACACGCGCTG 2051 CGTTTCAAAC GCTGAAGTCA GCTCTCTTGA CCACCGTGGA GGCTCTAAGC 2101 GGTTAGATTG GCAGGCGGAG GAAAGCGAAA ATGGAGACAA GATCCTAGAA 2151 GTGGATACTT GGAAGCCTCA TTATCATCCC AAACCGTTAC GTTCAGAGAG 2201 AAACAATGAG TCTCCGAGGA AACGACAACA ATCTTTGTTG GGTCCGAGAA 2251 GTACAGAGAA TAGTCCTCAA GTTGGTTCTA GTGGGTCAAG AAGAAGAACT 2301 CCTTTTACGC CGACGTCAAG AAGCGAGTAT TCTTGGGGAT GTAATAACTA 2351 TTACTACTCG GGTTATCACC CGAATTACAT GGCTAACACT GAGTCTTATA 2401 AAGCTAAAGT TCGGTCACAA AGTGCGCCGA AACAGAGAGT TGAGGTCTCT 2451 AATGAGACCA GTGGCTACAA GAGATCTGTT CAGGGACAGT ATTACTACTA 2501 CACAGCGGTA GAAGAAGAGA GTTTGGATGT TGGAAGCGCT GGTTACTACG 2551 GAGGAGGAGG AGGCGATTCT GATCGATTGA ATCGGAACCA AAGTGCGAAA 2601 TCGAGGATGC ATTCTTCGTT TCTTGTTTAG ATTGTGATTC TCTTTCTCTT 2651 CTTTTTTTTG GTTTGAGTTT GGTAATTTTC CAGAAGAGAT AAGTATTAAC 2701 ACTGGATGTG TAAATTGTTG TTAACAAGTT TCGATTGCTT GCAAAAGAAG 2751 GAACACCACT CTGTTCTCTA TCTTTGTTGA TTAGTGTTCA ATTAAGATTA 2801 ATCAAACTGA GTCAAACCCC ACAATTTA
Arabidopsis thaliana IQD23 Full Length cDNA
TABLE-US-00015 [SEQ ID NO: 6] 1 AAAATAAAAA AGCCTTTTAA AATCTCGTGT AACTAACTCG CTGTATGCCG 51 GCGCGTGGAA ACGTACTCTG CAATCTATAT ATAAGCCGTT AAACTCTATT 101 TAGCTTCTCT ATAGACAAAA GTTTATGTTT TGTTTTTCCT CTAACACAAA 151 AAACAATCTC AGATATTAAA TAAAGTGTTA ATCAACCGGA GAACTCTCCG 201 ACAAAACCTC GCCGACGATT TTCCGACGAT GAAAGATGGG CTTTTTCGGG 251 AGACTGTTCG GGAGTAAAAA AAAGTCTGAT AAAGCTGCTT CGTCGAGAGA 301 TAAACGGAGG TGGAGCTTCA CCACCAGATC TTCAAATTCC AGCAAGAGAG 351 CTCCGGCGGT GACGTCGGCT TCGGTGGTTG AGCAAAATGG TTTGGATGCT 401 GACAAACATG CGATAGCTGT GGCTGCTGCG ACAGCCGCGG TGGCGGAAGC 451 AGCTCTTACT GCTGCTCATG CGGCAGCTGA AGTCGTGAGA CTAACAAGCG 501 GAAACGGCGG GAGAAACGTC GGTGGCGGTG GAAACTCGTC GGTGTTTCAA 551 ATAGGAAGAA GTAACCGTCG GTGGGCTCAG GAGAATATCG CGGCGATGAA 601 GATTCAATCC GCTTTTCGTG GTTATCTGGC GAGGAGAGCA TTACGAGCAC 651 TAAAGGCATT AGTGAAGCTT CAAGCATTAG TGAGAGGACA CATAGTGAGA 701 AAACAAACTG CAGATATGCT TAGAAGGATG CAGACTCTTG TTCGTCTTCA 751 ATCTCAAGCT CGTGCTCGAG CCTCTCGTTC TTCTCACTCC TCTGCTTCTT 801 TCCACTCCTC CACCGCTCTT TTGTTCCCAT CTTCCTCGTC TTCTCCACGT 851 TCTCTTCACA CGCGCTGCGT TTCAAACGCT GAAGTCAGCT CTCTTGACCA 901 CCGTGGAGGC TCTAAGCGGT TAGATTGGCA GGCGGAGGAA AGCGAAAATG 951 GAGACAAGAT CCTAGAAGTG GATACTTGGA AGCCTCATTA TCATCCCAAA 1001 CCGTTACGTT CAGAGAGAAA CAATGAGTCT CCGAGGAAAC GACAACAATC 1051 TTTGTTGGGT CCGAGAAGTA CAGAGAATAG TCCTCAAGTT GGTTCTAGTG 1101 GGTCAAGAAG AAGAACTCCT TTTACGCCGA CGTCAAGAAG CGAGTATTCT 1151 TGGGGATGTA ATAACTATTA CTACTCGGGT TATCACCCGA ATTACATGGC 1201 TAACACTGAG TCTTATAAAG CTAAAGTTCG GTCACAAAGT GCGCCGAAAC 1251 AGAGAGTTGA GGTCTCTAAT GAGACCAGTG GCTACAAGAG ATCTGTTCAG 1301 GGACAGTATT ACTACTACAC AGCGGTAGAA GAAGAGAGTT TGGATGTTGG 1351 AAGCGCTGGT TACTACGGAG GAGGAGGAGG CGATTCTGAT CGATTGAATC 1401 GGAACCAAAG TGCGAAATCG AGGATGCATT CTTCGTTTCT TGTTTAGATT 1451 GTGATTCTCT TTCTCTTCTT TTTTTTGGTT TGAGTTTGGT AATTTTCCAG 1501 AAGAGATAAG TATTAACACT GGATGTGTAA ATTGTTGTTA ACAAGTTTCG 1551 ATTGCTTGCA AAAGAAGGAA CACCACTCTG TTCTCTATCT TTGTTGATTA 1601 GTGTTCAATT AAGATTAATC AAACTGAGTC AAACCCCACA ATTTA
Arabidopsis thaliana IQD24 Protein (AT5G07240.1)
TABLE-US-00016 [SEQ ID NO: 7] 1 MGFFGRLFGS KKQEKATPNR RRWSFATRSS HPENDSSSHS SKRRGDEDVL 51 NGDKHAIAVA AATAAVAEAA LAAARAAAEV VRLTNGGRNS SVKQISRSNR 101 RWSQEYKAAM KIQSAFRGYL ARRALRALKA LVKLQALVKG HIVRKQTADM 151 LRRMQTLVRL QARARASRSS HVSDSSHPPT LMIPSSPQSF HARCVSEAEY 201 SKVIAMDHHH NNHRSPMGSS RLLDQWRTEE SLWSAPKYNE DDDKILEVDT 251 WKPHFRESPR KRGSLVVPTS VENSPQLRSR TGSSSGGSRR KTPFTPARSE 301 YEYYSGYHPN YMANTESYKA KVRSQSAPRQ RLQDLPSESG YKRSIQGQYY 351 YYTPAAERSF DQRSDNGIAG YRGVSDGLDR NQSDKSKMYT SFFSSNPLFF 401 Q
Arabidopsis thaliana IQD24 Full Length DNA
TABLE-US-00017 [SEQ ID NO: 8] 1 TTCTCCATTC CCAATTGTCT CTTCTTTTCT TTTTGTACTT GTCAAAAACA 51 AAAAGAACAA CAAAAAAAAT CTCAACCGTA GAAAATTCCG ACAAGAGTTC 101 AGTTCATACA ATGAACTAAG TATGGGTTTC TTTGGAAGAC TGTTCGGAAG 151 TAAGAAGCAA GAAAAGGCAA CACCGAACAG ACGAAGATGG AGCTTCGCTA 201 CTAGATCCTC ACATCCCGAG AATGATTCGT CTTCTCATTC AAGCAAGAGA 251 CGTGGGGATG AAGATGTCTT AAACGGCGAC AAGCATGCGA TAGCCGTCGC 301 GGCTGCTACA GCTGCAGTGG CTGAAGCCGC ACTCGCTGCT GCTCGTGCGG 351 CGGCGGAAGT CGTGAGACTC ACCAATGGTG GTAGAAACTC GTCGGTAAAA 401 CAAATCAGTC GGAGTAATCG TCGGTGGTCT CAAGAGTATA AAGCAGCTAT 451 GAAGATTCAA TCCGCTTTTC GTGGCTACTT GGTGAGTTAA TTACTTCTCT 501 AAGTTTAATC TTAATACTCA ACTTCATAAT CTATAATTTT GTTTGAAAGA 551 AATTAAGTTT TTGAGCCAAA GTGTGGAATA GCTGTCTGAT GTTGATTTTT 601 CTACTTAAAC CCAAATTGAA ATATTTACTA AACAAAAAAA AGAAAGAATG 651 AGAATAATTT GTATTTATTT GTATTGGTCC CCGAAAAAGA CTAGGAAAGG 701 AGACAAATGA ATGAATAATG ATACCAAGAA ACTTGTTTAG AGCCGATACG 751 ATTCTCTACC GAGTACCGAC ACTATTTAAT GGTACTACTC TAGAATTGTA 801 TAGATAAAAT TTGGTCCCAA AAAATATGCA AATATTATCT CAAACAAAAT 851 CTTACCTAAC TAGATTTAGC TAAGGGCTTA TTCTTTGTAA TAAATTAATT 901 TTTACATTAA ACCAAGTCAG CAATAAAGAA TAACAAGGAG AGGGCCAAGT 951 TCTTTATTTT ATCTACTTTC TTTTTAGTTT CCAAAGTTCC AAAGTGTATT 1001 TTACTATAAA GGTGAAACTA CAGATTGGTT TTTAAGCTTT TTGAGCTTTA 1051 AAGTTGCATC ACCACTGTTA CTATCAAAGT TGCAACCTAC CTTTTTCATA 1101 ATTTGACAGA TAGGTTATGT AGTTTAGACA CATAATTTAG AGATTCTCTA 1151 ATCTTATTGC TTCGTATATA ACAAATAAGG TGTGTATCAT TTAGAAGAAA 1201 ATGATAGTGG TTGGGAAACT ATGAGTAATT TAATAAGGAG TATTAGTGTA 1251 TTACTTTGTG GTTTGAATTA AGTGTGTAAC CAAGTTTTCT TTAATTCAGG 1301 CGAGGAGGGC GTTGAGAGCA CTGAAGGCAT TAGTGAAGCT TCAAGCGTTG 1351 GTGAAGGGAC ACATAGTAAG GAAACAAACG GCTGATATGC TGCGTCGAAT 1401 GCAAACGCTG GTTCGGCTCC AAGCACGAGC TAGAGCTTCG CGTTCTTCTC 1451 ACGTTTCTGA CTCTTCCCAT CCGCCAACAC TAATGATTCC ATCTTCCCCA 1501 CAATCTTTCC ATGCACGATG CGTTTCAGAG GCTGAGTACA GTAAAGTCAT 1551 TGCCATGGAT CACCACCACA ACAACCACCG TTCACCGATG GGTTCAAGCC 1601 GGTTATTAGA CCAATGGAGG ACAGAGGAAA GTCTATGGAG CGCACCAAAG 1651 TACAATGAAG ATGATGACAA AATCCTAGAA GTCGACACTT GGAAGCCTCA 1701 CTTCAGAGAG TCACCAAGGA AAAGAGGATC TCTAGTGGTT CCTACAAGTG 1751 TGGAGAACAG TCCACAATTA AGGTCTAGAA CAGGAAGCAG CAGTGGTGGT 1801 TCAAGGAGAA AAACTCCCTT CACGCCTGCG AGAAGCGAGT ACGAGTACTA 1851 CTCTGGGTAT CACCCTAACT ACATGGCTAA CACTGAGTCT TACAAAGCAA 1901 AAGTCCGATC ACAAAGCGCA CCAAGACAGA GACTACAAGA TTTACCTTCA 1951 GAGAGTGGTT ACAAGAGGTC TATACAGGGA CAGTATTACT ACTACACGCC 2001 TGCTGCAGAG CGATCGTTTG ATCAGCGTTC GGATAACGGG ATCGCGGGTT 2051 ACAGAGGAGT TTCTGATGGG TTAGATCGAA ACCAAAGTGA CAAATCGAAG 2101 ATGTACACTT CGTTTTTCAG TTCTAATCCT CTTTTCTTTC AATAGTCGAG 2151 AAAGGATGAA AAAAGTGAGT GGAATGTGTA AAATTAGATT TCGACACACG 2201 AGTACAGAGA CAGCCAGTGA TCAATCTGTG TTTTGTACTA TTTTCTAATT 2251 GACTGTATCC AACAAGGGTC CATTCTTGTC TGATAAAAAA ACTTCAATAA 2301 TTTGAAGTGA TGTCAAGTCA AGACGTGGGA ATCACCACTT AAAGCAATGA 2351 AAATTGATTG ATATAACCTT TCATATTAAA CT
Arabidopsis thaliana IQD24 cDNA
TABLE-US-00018 [SEQ ID NO: 9] 1 TTCTCCATTC CCAATTGTCT CTTCTTTTCT TTTTGTACTT GTCAAAAACA 51 AAAAGAACAA CAAAAAAAAT CTCAACCGTA GAAAATTCCG ACAAGAGTTC 101 AGTTCATACA ATGAACTAAG TATGGGTTTC TTTGGAAGAC TGTTCGGAAG 151 TAAGAAGCAA GAAAAGGCAA CACCGAACAG ACGAAGATGG AGCTTCGCTA 201 CTAGATCCTC ACATCCCGAG AATGATTCGT CTTCTCATTC AAGCAAGAGA 251 CGTGGGGATG AAGATGTCTT AAACGGCGAC AAGCATGCGA TAGCCGTCGC 301 GGCTGCTACA GCTGCAGTGG CTGAAGCCGC ACTCGCTGCT GCTCGTGCGG 351 CGGCGGAAGT CGTGAGACTC ACCAATGGTG GTAGAAACTC GTCGGTAAAA 401 CAAATCAGTC GGAGTAATCG TCGGTGGTCT CAAGAGTATA AAGCAGCTAT 451 GAAGATTCAA TCCGCTTTTC GTGGCTACTT GGCGAGGAGG GCGTTGAGAG 501 CACTGAAGGC ATTAGTGAAG CTTCAAGCGT TGGTGAAGGG ACACATAGTA 551 AGGAAACAAA CGGCTGATAT GCTGCGTCGA ATGCAAACGC TGGTTCGGCT 601 CCAAGCACGA GCTAGAGCTT CGCGTTCTTC TCACGTTTCT GACTCTTCCC 651 ATCCGCCAAC ACTAATGATT CCATCTTCCC CACAATCTTT CCATGCACGA 701 TGCGTTTCAG AGGCTGAGTA CAGTAAAGTC ATTGCCATGG ATCACCACCA 751 CAACAACCAC CGTTCACCGA TGGGTTCAAG CCGGTTATTA GACCAATGGA 801 GGACAGAGGA AAGTCTATGG AGCGCACCAA AGTACAATGA AGATGATGAC 851 AAAATCCTAG AAGTCGACAC TTGGAAGCCT CACTTCAGAG AGTCACCAAG 901 GAAAAGAGGA TCTCTAGTGG TTCCTACAAG TGTGGAGAAC AGTCCACAAT 951 TAAGGTCTAG AACAGGAAGC AGCAGTGGTG GTTCAAGGAG AAAAACTCCC 1001 TTCACGCCTG CGAGAAGCGA GTACGAGTAC TACTCTGGGT ATCACCCTAA 1051 CTACATGGCT AACACTGAGT CTTACAAAGC AAAAGTCCGA TCACAAAGCG 1101 CACCAAGACA GAGACTACAA GATTTACCTT CAGAGAGTGG TTACAAGAGG 1151 TCTATACAGG GACAGTATTA CTACTACACG CCTGCTGCAG AGCGATCGTT 1201 TGATCAGCGT TCGGATAACG GGATCGCGGG TTACAGAGGA GTTTCTGATG 1251 GGTTAGATCG AAACCAAAGT GACAAATCGA AGATGTACAC TTCGTTTTTC 1301 AGTTCTAATC CTCTTTTCTT TCAATAGTCG AGAAAGGATG AAAAAAGTGA 1351 GTGGAATGTG TAAAATTAGA TTTCGACACA CGAGTACAGA GACAGCCAGT 1401 GATCAATCTG TGTTTTGTAC TATTTTCTAA TTGACTGTAT CCAACAAGGG 1451 TCCATTCTTG TCTGATAAAA AAACTTCAAT AATTTGAAGT GATGTCAAGT 1501 CAAGACGTGG GAATCACCAC TTAAAGCAAT GAAAATTGAT TGATATAACC 1551 TTTCATATTA AACT IQ67 Domain consensus portion [SEQ ID NO: 10] FRGYLARRALRALKGLVRLQALVRG IQ67 Domain consensus [SEQ ID NO: 11] EE#AA#+IQX#FRGYLARRALRALKGLVRLQALVRGX#VR+QA##TL+CMQALVR#QAXVR ARR#+# >AT4G23060.1 IQD22 [SEQ ID NO: 12] atgggaaaagcgtcacggtggtttaggagtctattcggagttaagaaacccgacccgggt tatccggatctatccgtcgagacgccttctcggtcaacttcctctaatctcaaacgccga tggagtttcgtcaaatccaaacgagaaaaagaaagtacaccgattaatcaagttcctcat actccatcgctaccgaattcgacgcctccaccgccgtctcaccaccaatcgtcgccgaga cgacggagaaaacaaaagccgatgtgggaggatgagggaagtgaagattcggacaagcat gctattgcggtggctgccgcgactgctgcggttgctgaagctgcagtcgccgccgctaat gctgctgctgcggtcgtcaggctgacgagcacaagtgggaggtcgactcgaagtcctgtt aaggcacggtttagcgacggattcgacgacgtggtggcgcatggtagcaagttttatgga cacggccgtgacagttgtgaacttgcggtgattaagatacaatctatatttcgcggatac ttggcaaagagagcgttaagggcactcaagggtttggttaggcttcaagcgatagttaga ggccatatcgaaagaaagagaatgtcagtccatctgcgcaggatgcacgctttggttcga gctcaggctcgtgtgcgtgccactcgggttattgtcacgcctgaatcttcttcttctcaa tccaacaataccaaatcttctcacttccaaaaccctggtccaccaactccggaaaaactc gagcattcgatctcttctcgcagctccaaactcgctcattctcatcttttcaagaggaat ggttcgaaggcaagcgacaacaacagactgtaccctgctcacagggagacattctcagcc acagacgaagaagaaaagattcttcaaatcgacaggaaacacatcagttcttacacaaga cgcaacagaccagacatgttctactcatcccacctcatcctagacaatgctggcctgtct gaaccagttttcgccacgccttttagcccgtcctcgtcgcatgaagagattacaagccag ttttgcactgcagagaacagtcctcagttatactcagctacttctagaagcaaacgcagt gctttcaccgctagttctatagcaccgagcgattgcacaaaaagctgctgtgatggtgac catccaagctacatggcttgtacagagtcctctagggctaaggctaggtccgctagtgcc ccgaagtctcgaccacagttattttacgagcggccttcatcaaaacggtttggatttgtt gatttgccgtactgtggtgatacaaagtccggtccccagaaaggctctgctctgcatact agttttatgaacaaggcttatcccggttcaggtcggttggaccgtctcgggatgccaatt gggtataggtactga >AT5G62070.1 IQD23 [SEQ ID NO: 13] atgggctttttcgggagactgttcgggagtaaaaaaaagtctgataaagctgcttcgtcg agagataaacggaggtggagcttcaccaccagatcttcaaattccagcaagagagctccg gcggtgacgtcggcttcggtggttgagcaaaatggtttggatgctgacaaacatgcgata gctgtggctgctgcgacagccgcggtggcggaagcagctcttactgctgctcatgcggca gctgaagtcgtgagactaacaagcggaaacggcgggagaaacgtcggtggcggtggaaac tcgtcggtgtttcaaataggaagaagtaaccgtcggtgggctcaggagaatatcgcggcg atgaagattcaatccgcttttcgtggttatctggcgaggagagcattacgagcactaaag gcattagtgaagcttcaagcattagtgagaggacacatagtgagaaaacaaactgcagat atgcttagaaggatgcagactcttgttcgtcttcaatctcaagctcgtgctcgagcctct cgttcttctcactcctctgcttctttccactcctccaccgctcttttgttcccatcttcc tcgtcttctccacgttctcttcacacgcgctgcgtttcaaacgctgaagtcagctctctt gaccaccgtggaggctctaagcggttagattggcaggcggaggaaagcgaaaatggagac aagatcctagaagtggatacttggaagcctcattatcatcccaaaccgttacgttcagag agaaacaatgagtctccgaggaaacgacaacaatctttgttgggtccgagaagtacagag aatagtcctcaagttggttctagtgggtcaagaagaagaactccttttacgccgacgtca agaagcgagtattcttggggatgtaataactattactactcgggttatcacccgaattac atggctaacactgagtcttataaagctaaagttcggtcacaaagtgcgccgaaacagaga gttgaggtctctaatgagaccagtggctacaagagatctgttcagggacagtattactac tacacagcggtagaagaagagagtttggatgttggaagcgctggttactacggaggagga ggaggcgattctgatcgattgaatcggaaccaaagtgcgaaatcgaggatgcattcttcg tttcttgtttag AT5G07240.2 IQD24 [SEQ ID NO: 14] atgtatcacactttagcttctccattcccaattgtctcttcttttctttttgtacttgtc aaaaacaaaaagaacaacaaaaaaaatctcaaccgtagaaaattccgacaagagttcagt tcatacaatgaactaagtatgggtttctttggaagactgttcggaagtaagaagcaagaa aaggcaacaccgaacagacgaagatggagcttcgctactagatcctcacatcccgagaat gattcgtcttctcattcaagcaagagacgtggggatgaagatgtcttaaacggcgacaag catgcgatagccgtcgcggctgctacagctgcagtggctgaagccgcactcgctgctgct cgtgcggcggcggaagtcgtgagactcaccaatggtggtagaaactcgtcggtaaaacaa atcagtcggagtaatcgtcggtggtctcaagagtataaagcagctatgaagattcaatcc gcttttcgtggctacttggcgaggagggcgttgagagcactgaaggcattagtgaagctt caagcgttggtgaagggacacatagtaaggaaacaaacggctgatatgctgcgtcgaatg caaacgctggttcggctccaagcacgagctagagcttcgcgttcttctcacgtttctgac tcttcccatccgccaacactaatgattccatcttccccacaatctttccatgcacgatgc gtttcagaggctgagtacagtaaagtcattgccatggatcaccaccacaacaaccaccgt tcaccgatgggttcaagccggttattagaccaatggaggacagaggaaagtctatggagc gcaccaaagtacaatgaagatgatgacaaaatcctagaagtcgacacttggaagcctcac ttcagagagtcaccaaggaaaagaggatctctagtggttcctacaagtgtggagaacagt ccacaattaaggtctagaacaggaagcagcagtggtggttcaaggagaaaaactcccttc acgcctgcgagaagcgagtacgagtactactctgggtatcaccctaactacatggctaac actgagtcttacaaagcaaaagtccgatcacaaagcgcaccaagacagagactacaagat ttaccttcagagagtggttacaagaggtctatacagggacagtattactactacacgcct gctgcagagcgatcgtttgatcagcgttcggataacgggatcgcgggttacagaggagtt tctgatgggttagatcgaaaccaaagtgacaaatcgaagatgtacacttcgtttttcagt tctaatcctcttttctttcaatag Sequence of At4g23060 promoter used for construct of expressing IQD22 under the control of its native promoter [SEQ ID NO: 127] ctttgccaacgaatgttagttgtgggggaaacaacaattccgctggtaattttgactgaattaaaaaattagagatagaatgcaa ggagacacgtacgaacgagttcatacacgagaagcagttgaccagagtcaacaaacctgtaaaaagctaaagaaccacc gtccactcgcagtctactatatacgactaactaagattcatgcaggctcaagtttttgtacttgtatgacacgtggctagatcgtaa cggccgtttagactgatgtctcttcgtctctccgagtctttgtcctgtctttttctcattgcatgagacataccgaagattccagtggatt aattgtcggtccatttcttttttcctttttactactgacttcgagttagattctctaaagtagtacacgtgaaccataggattcgtttatgatt acataaaacctttgttagtagtgtttgttcatttcttaattctggttatacatttgaaacatatgcaaatttatcactcgacttattaatcct aagttcgtaactaagttctttttggttaattagcttatatacttctcaagatttctgattctgacttatgctataaagtatatcttcgataagt tgagtcgcatctctctcctcggaaaatattcttgtattttggacaaatttctcgtgtattgttttgatatttttgtggtcatgttgttttgtgaatt agtatataagtagataaactacgtagttttcattcgtacaactatataaccaatcgtattcgacgaaataaatagtcagaaaatatt ctatttgttatgcaataatatctcactatacagtccaaaatattaacctccatataaatctgatagtaacaacaaacaattttttctcttt gtcctataacatacataatacatttatgtttgaaagacaagagaagaacaaaaacaagaacccacaaatgtcaaaatagtga aagaattggattccgacttcgatttcccattacacaaaacgacgtagcaacggacggcgctagctattacgttaagagacgaa aaacggagtagggaccaacaaaaaaacgtgatcatggatggattattaacgatagacattggattacggcgatggacccat aaaactgacgtcatccgttatggggtccgtagtgtggtcctctaaaaagtattgacgctttttgtcatctctctgttctttaggactcgt cacaaattttacttcctcagattcacatgaccaaatcatgtaaccattttccaaataaaatctttacatttagatttagattcagagga attgaattagcctcatcataatgtatgatactacatactacactagtgacacggaaattacacgacgaaaataaacaatgtgaa gaataacgaaatttcccgggaaaagagagatagagagtgagacacacgcgcgagtgatgcgtgtggtagtaaatagaac actgtttgatgatactgctgcgactacttaactcttattacaaagctctctttttgttgtctctctcttgctctctctgcaaaactccggcg agaagagaacgttgtcgtttcattcgt start of gene (start codon underlined): [SEQ ID NO: 128] atcaaagtcttcatcagcaATGGGAAAAGCGTCACGG
[0232] Throughout the description and claims of this specification, the words “comprise” and “contain” and variations of them mean “including but not limited to”, and they are not intended to (and do not) exclude other moieties, additives, components, integers or steps. Throughout the description and claims of this specification, the singular encompasses the plural unless the context otherwise requires. In particular, where the indefinite article is used, the specification is to be understood as contemplating plurality as well as singularity, unless the context requires otherwise.
[0233] Features, integers, characteristics, compounds, chemical moieties or groups described in conjunction with a particular aspect, embodiment or example of the invention are to be understood to be applicable to any other aspect, embodiment or example described herein unless incompatible therewith. All of the features disclosed in this specification (including any accompanying claims, abstract and drawings), and/or all of the steps of any method or process so disclosed, may be combined in any combination, except combinations where at least some of such features and/or steps are mutually exclusive. The invention is not restricted to the details of any foregoing embodiments. The invention extends to any novel one, or any novel combination, of the features disclosed in this specification (including any accompanying claims, abstract and drawings), or to any novel one, or any novel combination, of the steps of any method or process so disclosed.
[0234] The readers attention is directed to all papers and documents which are filed concurrently with or previous to this specification in connection with this application and which are open to public inspection with this specification, and the contents of all such papers and documents are incorporated herein by reference.