ARGONAUTE PROTEIN FROM EUKARYOTES AND APPLICATION THEREOF
20230235306 · 2023-07-27
Inventors
- Lixin Ma (Wuhan, CN)
- Ruyi He (Wuhan, CN)
- Baotong Sun (Wuhan, CN)
- Fei Wang (Wuhan, CN)
- Yaping Wang (Wuhan, CN)
- Zhongchen Li (Wuhan, CN)
- Guangbo Yan (Wuhan, CN)
Cpc classification
C12N2310/20
CHEMISTRY; METALLURGY
C12N15/111
CHEMISTRY; METALLURGY
C12N9/22
CHEMISTRY; METALLURGY
C12N2800/80
CHEMISTRY; METALLURGY
C12N15/11
CHEMISTRY; METALLURGY
C12N15/113
CHEMISTRY; METALLURGY
International classification
C12N9/22
CHEMISTRY; METALLURGY
C12N15/11
CHEMISTRY; METALLURGY
C12N15/113
CHEMISTRY; METALLURGY
Abstract
An Argonaute protein from eukaryotes and an application thereof are provided. An amino acid sequence of the Argonaute protein is shown in SEQ ID NO: 1 or has at least 50% sequence identity with the sequence shown in SEQ ID NO: 1. The specific cleavage activity of the eukaryotic Argonaute protein on DNA is first proved, and an experimental proof for the study of interaction between the eukaryotic Argonaute protein and DNA is provided. In addition, polypeptides, nucleic acids, expression vectors, compositions, kits, and methods used therein can carry out site-specific operation on intracellular and extracellular genetic materials and can be effectively applied in many fields of biotechnology, providing a new tool for gene editing, modification, and molecular detection of Argonaute polypeptides based on eukaryotic sources.
Claims
1. An application of a eukaryotic Argonaute (eAgo) complex, comprising: specifically cleaving a target nucleic acid in vitro by using the eAgo complex; wherein the eAgo complex is formed by a combination of an Argonaute protein and a guide molecule; wherein the Argonaute protein has a nuclease activity, and an amino acid sequence of the Argonaute protein is shown in SEQ ID NO: 1; wherein a nucleotide sequence of a nucleic acid molecule encoding the Argonaute protein is shown in SEQ ID NO: 3; and wherein the guide molecule is one of a guide ssDNA and a guide RNA selected from a group consisting of a 5′-terminal phosphorylated guide RNA, a 5′-terminal hydroxylated guide RNA, a 5′-terminal phosphorylated guide ssDNA, and a 5′-terminal hydroxylated guide ssDNA; and a length of the guide ssDNA is in a range of 12 to 30 nucleotides.
2. The application according to claim 1, specifically comprising: making the eAgo complex contact with the target nucleic acid to specifically cleave the target nucleic acid by the eAgo complex, wherein the target nucleic acid contains a nucleotide sequence complementary to at least 12 bases of the guide molecule.
3. The application according to claim 1, wherein the Argonaute protein has the nuclease activity at a temperature in a range of 10~65 Celsius degree (°C).
4. The application according to claim 1, wherein the Argonaute protein has the nuclease activity in a solution of bivalent metal cations, and the bivalent metal cations are at least one selected from a group consisting of Fe.sup.2+, Co.sup.2+, Ni.sup.2+, Cu.sup.2+, Zn.sup.2+, Mg.sup.2+, Mn.sup.2+, and Ca.sup.2+.
5. The application according to claim 4, wherein the divalent metal cations are at least one of Mn.sup.2+ and Mg.sup.2+.
6. The application according to claim 1, wherein the nuclease activity of the Argonaute protein has at least one of single-base specificity and double-base specificity.
Description
BRIEF DESCRIPTION OF DRAWINGS
[0050] In order to illustrate technical solutions of embodiments of the disclosure more clearly, the following will briefly introduce the drawings required to be used in the embodiments of the disclosure. Apparently, the drawings described below are only some embodiments of the disclosure. For those skilled in the art, other drawings can be obtained from these drawings without creative work.
[0051]
[0052]
[0053]
[0054]
[0055]
[0056]
[0057]
[0058]
[0059]
[0060]
[0061]
[0062]
[0063]
[0064]
DETAILED DESCRIPTION OF EMBODIMENTS
[0065] In order to make purposes, technical solutions and advantages of the disclosure clearer, the disclosure will be further described in detail in combination with embodiments and drawings. It should be understood that the specific embodiments described herein are only used to explain the disclosure and not to limit the disclosure.
[0066] The disclosure provides an eAgo protein from eukaryotes, and the eAgo protein is any of the following:
[0067] i) a protein with an amino acid sequence as shown in SEQ ID NO: 1. The protein is derived from Thermomycetes thermophilus eukaryotes, named TteAgo protein. A sequence of a nucleic acid molecule encoding the TteAgo protein is shown in SEQ ID NO: 3.
[0068] ii) a protein with at least 50% sequence identity with the amino acid sequence as shown in SEQ ID NO: 1 and the same function as the protein with the amino acid sequence as shown in SEQ ID NO: 1; preferably, at least 80% sequence identity with the amino acid sequence as shown in SEQ ID NO: 1; more preferably, at least 90% sequence identity with the amino acid sequence as shown in SEQ ID NO: 1; and most preferably, at least 95% sequence identity with the amino acid sequence as shown in SEQ ID NO: 1. A sequence of nucleic acid molecule encoding this type of protein is: a polynucleotide sequence hybridizing with the DNA molecule shown in SEQ ID NO: 3 under a strict condition, or a nucleotide sequence having at least 50%, at least 80%, at least 90% or at least 95% sequence identity with the sequence shown in SEQ ID NO: 3.
[0069] In an embodiment, the eAgo protein has binding activity to a guide RNA (also referred to as gRNA) and a guide single stranded DNA (ssDNA), and has nuclease activity to a target RNA (also referred to as tRNA) and a target DNA (also referred to as tDNA). Therefore, when the guide RNA or the guide ssDNA having most of pairing with the sequence of the target RNA or the target DNA binds to the eAgo protein to form an eAgo complex (also referred to as eAgo-guide complex), and when the eAgo-guide complex binds to the target RNA or the target DNA, site-specific cleavage of the target DNA or the target RNA can occur.
[0070] The guide molecule can be 5′-terminal phosphorylated RNA and/or 5′-terminal phosphorylated ssDNA, or hydroxylated RNA and/or hydroxylated ssDNA. The guide molecule can contain 5′-terminal-triphosphate.
[0071] In an embodiment, a length of the guide ssDNA is 12 to 30 nucleotides, preferably 15 to 20 nucleotides, such as 16, 17 or 18 nucleotides.
[0072] In an embodiment, the eAgo protein has nuclease activity in a temperature range of 25-65° C.; advantageously and preferably, the eAgo protein of the disclosure has the nuclease activity at 37° C.
[0073] In an embodiment, the nuclease activity of the eAgo protein requires the presence of cations, which are any one or any combination selected from the group consisting of Fe.sup.2+, Co.sup.2+, Ni.sup.2+, Cu.sup.2+, Zn.sup.2+, Mg.sup.2+, Mn.sup.2+, and Ca.sup.2+. Preferably, the cations are Mn.sup.2+ and Mg.sup.2+. A concentration range of the cations can vary from about 0.01 millimoles per liter (mM) to about 2000 mM; more preferably, the concentration range is from about 0.05 mM to about 20 mM.
[0074] In some embodiments, the N-terminal and/or C-terminal of the eAgo protein have multiple nuclear localization sequences (NLS).
[0075] In some embodiments, the target RNA has no advanced structure. In other embodiments, the target RNA has an advanced structure. Other possible target RNAs include a double-stranded RNA, an RNA transcribed in vitro, a viral genome RNA, a messenger RNA (mRNA) and other RNA in the cell.
[0076] Specifically, a length of the eAgo protein in the disclosure is 1082 amino acids as shown in SEQ ID NO: 1, or a longer or shorter continuous fragment of amino acids. The number of amino acids (longer or shorter) may be 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 128, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 150, 151, 152, 153, 154, 155, 156, 157, 158, 159, 160, 161, 162, 163, 164, 165, 166, 167, 168, 169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 181, 182, 183, 184, 185, 186, 187, 188, 189, 190, 191, 192, 193, 194, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 207, 208, 209, 210, 211, 212, 213, 214, 215, 216, 217, 218, 219, 220, 221, 222, 223, 224, 225, 226, 227, 228, 229, 230, 231, 232, 233, 234, 235, 236, 237, 238, 239, 240, 241, 242, 243, 244, 245, 246, 247, 248, 249, 250, 251, ... (consecutive digits), and/or 1082.
[0077] The above continuous amino acids are defined as functional fragments that include or are less than the total length of the eAgo protein (1082 amino acids) described in the disclosure, but retain the formation of the eAgo-guide complex with the guide molecule and the site-specific cleavage activity on the target RNA and/or the target DNA.
[0078] The eAgo protein and the eAgo complex with nuclease activity can specifically cleave the target RNA or the target DNA in vivo or in vitro, and in vivo is intracellular.
[0079] The disclosure also provides a method of genetic material for site-specific modification in cells, specifically: introducing an expression vector containing a polynucleotide sequence encoding the eAgo protein into the cell, and simultaneously or not simultaneously introducing one or more guide RNA and/or guide ssDNA, so as to express the eAgo protein in the cell.
[0080] In some embodiments, the site-specific modification method occurs in isolated cells. In other embodiments, the method in the disclosure may occur in situ cells, which may be living tissues, organs or animals including humans.
[0081] The eAgo protein in the site-specific modification method may be encoded by an expression vector. In other such methods, one or more eAgo proteins may be encoded by two expression vectors. In some embodiments, one expression vector can encode all eAgos proteins.
[0082] The expression vector in the site-specific modification method may be contained in a viral vector, such as a lentivirus vector or a retrovirus vector.
[0083] The site-specific modification method may be used in eukaryotic cells.
[0084] The kit provided by the disclosure include the following three types: [0085] kit 1, including: the eAgo protein described in the disclosure, and a guide RNA and/or a guide ssDNA; [0086] kit 2, including: an expression vector containing a polynucleotide sequence encoding the eAgo protein, and the guide RNA and/or the guide ssDNA; and [0087] kit 3, including: a virus vector containing the expression vector, and a virus vector encoding the guide RNA and/or the guide ssDNA.
[0088] When the eAgo protein of the disclosure has the binding activity to the guide RNA or the guide ssDNA, but has no nuclease activity to the target DNA and the target RNA, the guide RNA or the guide ssDNA having most of pairing with the target RNA or the target DNA binds to the eAgo protein to form the eAgo-guide complex, and when the eAgo-guide complex binds to the target RNA or the target DNA, site-specificity of the target RNA or the target DNA is blocked.
[0089] The eAgo protein without nuclease activity can be prepared as follows: a new nuclease activity is created by mutating one or more amino acid residues essential for the catalytic activity of the eAgo protein, especially the loss of endonuclease activity. That is to say, at least one amino acid in the evolutionarily conserved amino acid quadruplet (i.e., DEDD) is mutated. Therefore, the mutation may be a single amino acid change in any one or more of the following amino acid sequences of the TteAgo protein: [0090] FVGYDVTHP, shown in SEQ ID NO: 27; [0091] KSRVEQVGGK, shown in SEQ ID NO: 28; [0092] VIFRDGVSE, shown in SEQ ID NO: 29; and [0093] AYYADLVAA, shown in SEQ ID NO: 30.
[0094] More specifically, the amino acid change is a single change at one or more of the highlighted residues. Preferably, a single mutation is a non-conservative substitution, such as from D (i.e., aspartic acid) to A (i.e., alanine), or from E (i.e., glutamic acid) to A. Therefore, any substitution other than D to E or E to D is possible.
[0095] In addition to substitution, one or more highlighted residues can be simply deleted. In an embodiment, one or more amino acids in the amino acid sequence can be deleted continuously or discontinuously, or one or more sequence motifs can be deleted as a whole. Any combination of the above changes can be made, for example, a non-conservative change in one motif and an absence of the other three motifs. The structural features of nuclease-deficient eAgo protein in the disclosure can include any structural changes as defined above for the eAgo protein with nuclease activity, such as the sequence identity range compared with the reference sequence, the composition of the eAgo protein in terms of amino acid domain and the total length in terms of amino acid. The definition of the guide is similar to the guide used in the eAgo protein with nuclease activity in the disclosure.
[0096] For the eAgo complex without nuclease activity, this means that there is an advantageous method to block specific sites in the target DNA or the RNA through specific sequence recognition. The target can be single-stranded and double-stranded. Such site-specific blocking provides an accurate means of blocking target gene transcription, or blocking, disrupting or interfering with specific sites involved in gene expression regulation.
[0097] Therefore, the disclosure provides a method for site-specific targeted blocking of target nucleic acid in cells, including the following steps: mixing an eAgo protein without nuclease activity with a guide RNA or a guide ssDNA to form an eAgo complex; transferring the eAgo complex into the cells (such as through transformation, transfection, fiber injection, etc.), and the guide sequence is substantially complementary to the nucleotide sequence contained in the target nucleic acid.
[0098] Based on this, the method for site-specific targeted blocking of target nucleic acid in cells can also adopt the following steps: transfecting, transforming or transducing the cells with the expression vector containing a nucleic acid molecule for encoding the eAgo protein without nuclease activity; transfecting, transforming or transducing a first guide RNA sequence or a first guide ssDNA sequence and a second guide ssRNA sequence or a second guide ssDNA sequence; in which at least one guide molecule sequence is substantially complementary to the nucleotide sequence contained in the target nucleic acid, and the eAgo protein generated by expression in the cell and the guide molecule form the eAgo complex capable of blocking specific sites.
[0099] In an embodiment, the method for site-specific blocking of target polynucleotides using the eAgo protein without nuclease activity can be targeted to destroy gene expression and/or the control elements of the gene expression, such as promoters or enhancers.
[0100] Among the various methods for site-specific blocking of the target DNA or the target RNA, particularly preferred or optional aspects refer to the eAgo protein without nuclease activity defined in the disclosure.
Embodiment 1 Expression and Purification of TteAgo Protein
[0101] The pET28a-CL7-TteAgo plasmid is transformed into Escherichia coli BL21(DE3), and a single colony is inoculated into a Luria-Bertani (LB) liquid medium containing 50 micrograms per milliliter (.Math.g/mL) kanamycin and cultured in a shake flask at 37° C. and 220 revolutions per minute (rpm). When the optical density at 600 nanometers (OD600) reaches 0.8, the bacteria are moved to a shaker at 18° C. and induced by isopropylthio-β-galactoside (IPTG) overnight. The bacteria are collected by centrifugation at 6000 rpm for 10 minutes (min), washed with Buffer A (including 20 mM Tris-HCl pH 7.4, 500 mM NaCl, and 10 mM imidazole), suspended in the Buffer A, added phenylmethanesulfonyl fluoride (PMSF) at a final concentration of 1 mM, and disrupted under high pressure. Then, the supernatant is collected by centrifugation at 18000 rpm for 30 min. After the supernatant is filtered, nickel-nitrilotriacetic acid (Ni-NTA) purification is performed. An amino acid sequence of CL7-TteAgo fusion protein is shown in SEQ ID NO: 2, and a polynucleotide sequence of CL7-TteAgo fusion protein is shown in SEQ ID NO: 4.
[0102] A column is washed with the Buffer A containing 10 mM imidazole (added in three times) for 10 column volumes, then the column is washed with 200 mM imidazole for 5 column volumes, and samples are taken for sodium dodecyl sulfate-polyacrylamide gel electrophoresis (SDS-PAGE) detection. 200 mM imidazole eluant (containing high purity target protein) is collected and incubated with activated agarose beads coupled with Im7 protein. The fusion expressed TteAgo-CL7 fusion protein can be specifically combined with the Im7 protein, specifically combined on the agarose beads, impurity proteins are removed by repeated elution (10 times) of high salt (1 m NaCl) and low salt (100 mM NaCl), and then the pure target protein is obtained by cleavage with 3C protease. The purified protein is collected and identified the purity with SDS-PAGE, and ultrafiltered to Buffer B (including 20 mM Tris-HCl pH 7.4, 500 mM NaCl, and 1 mM TCEP). The protein is divided into small parts and stored at -80° C. after quick freezing with liquid nitrogen.
[0103]
Embodiment 2 Cleavage Activity of TteAgo Protein
[0104] In order to evaluate which combinations of guide RNA/DNA and target RNA/DNA can be cleaved by the TteAgo protein, the activity of all possible combinations is determined in this embodiment.
[0105] The cleavage experiments are carried out at 37° C. with a molar ratio of 5:2:1 (TteAgo: guide: target). 1 uM TteAgo is mixed with 400 nM guide (i.e., guide molecule, e.g., RNA/DNA) in a reaction buffer containing 10 mM HEPES-NaOH (pH 7.5), 100 mM NaCl, 5 mM MnC12 and 5% glycerol, and incubated at 37° C. for 10 min for guide loading. The target nucleic acid (i.e., target RNA/DNA) is added to a final concentration of 200 nM. After reaction at 37° C. for 1 hour, the sample is mixed with 2x RNA loading dyes ( 95% formamide, 18 mM EDTA, 0.025% SDS and 0.025% bromophenol blue) and heated at 95° C. for 5 min to terminate the reaction. The cleavage products are analyzed by 20% denatured Tris-borate EDTA-polyacrylamide gel electrophoresis (TBE-PAGE), stained by SYBR.sup.® Gold (Invitrogen), and visualized by GelDoc™ XR+(Bio-Rad).
[0106]
[0107] In addition, the first and third amino acids D of the quadruplet DEDD catalyzed by the TteAgo are mutated into amino acid A, and the double mutant DM is recorded as TteAgo-DM. As shown in
Embodiment 3 Influence of Length of Guide Molecule on Cleavage Effect
[0108] Referring to the experimental method in the embodiment 2, the guide RNA or guide DNA with different length binds to the TteAgo to verify its activity of cleaving the target RNA or the target DNA.
[0109] The detection results are shown in
Embodiment 4 Influence of Metal Ions on Cleavage Effect
Influence of Metal Ion Type
[0110] Referring to the experimental method in the embodiment 2, different divalent metal ions, including Fe.sup.2+, Co.sup.2+, Ni.sup.2+, Cu.sup.2+, Zn.sup.2+, Mg.sup.2+, Mn.sup.2+, Ca.sup.2+, are used in a reaction buffer to verify the effect of cations on the activity of cleaving the target RNA or the target DNA.
[0111] The detection results are as shown in
Influence of Metal Ion Concentration
[0112] Mn.sup.2+ and Mg.sup.2+ are selected to find the concentration range of Mn.sup.2+ or Mg.sup.2+ in which the TteAgo shows guide-guided cleavage of the target RNA within 15 min.
[0113] The detection results are shown in
Embodiment 5 Influence of Temperature on Cleavage Effect
[0114] Referring to the experimental method in the embodiment 2, the temperature range at which the TteAgo displays guide-guided cleavage of the target RNA within 15 min is found. As shown in
[0115] Using the same method, the temperature range at which the TteAgo displays guide-guided cleavage of the target DNA within 60 min is found. As shown in
Embodiment 6
[0116] Referring to the experimental method in the embodiment 2, the influence of single-base or double-base mutation of the guide molecule on the TteAgo cleaving the target RNA or the target DNA is as follows.
Influence of Single-Base and/or Double-Base Mutations in Guide RNA on TteAgo Cleaving the Target RNA
[0117] The guide RNAs with single-base or double-base mutation are synthetized (m1 as shown in SEQ ID NO: 31, m2 as shown in SEQ ID NO: 32, m3 as shown in SEQ ID NO: 33, m4 as shown in SEQ ID NO: 34, m5 as shown in SEQ ID NO: 35, m6 as shown in SEQ ID NO: 36, m7 as shown in SEQ ID NO: 37, m8 as shown in SEQ ID NO: 38, m9 as shown in SEQ ID NO: 39, m10 as shown in SEQ ID NO: 40, m11 as shown in SEQ ID NO: 41, m12 as shown in SEQ ID NO: 42, m13 as shown in SEQ ID NO: 43, m14 as shown in SEQ ID NO: 44, m15 as shown in SEQ ID NO: 45, m16 as shown in SEQ ID NO: 46, m17 as shown in SEQ ID NO: 47, m18 as shown in SEQ ID NO: 48, m7m8 as shown in SEQ ID NO: 49, m8m9 as shown in SEQ ID NO: 50, m9m10 as shown in SEQ ID NO: 51, m10m11 as shown in SEQ ID NO: 52, m11m12 as shown in SEQ ID NO: 53, m12m13 as shown in SEQ ID NO: 54, and m13m14 as shown in SEQ ID NO: 55, specific mutations are shown in
Influence of Single-Base Mutation in Guide DNA on TteAgo Cleaving The Target RNA
[0118] The guide DNAs with single-base mutation are synthetized (m1 as shown in SEQ ID NO: 56, m2 as shown in SEQ ID NO: 57, m3 as shown in SEQ ID NO: 58, m4 as shown in SEQ ID NO: 59, m5 as shown in SEQ ID NO: 60, m6 as shown in SEQ ID NO: 61, m7 as shown in SEQ ID NO: 62, m8 as shown in SEQ ID NO: 63, m9 as shown in SEQ ID NO: 64, m10 as shown in SEQ ID NO: 65, m11 as shown in SEQ ID NO: 66, m12 as shown in SEQ ID NO: 67, m13 as shown in SEQ ID NO: 68, m14 as shown in SEQ ID NO: 69, m15 as shown in SEQ ID NO: 70, m16 as shown in SEQ ID NO: 71, m17 as shown in SEQ ID NO: 72, and m18 as shown in SEQ ID NO: 73, specific mutations are shown in
Influence of Single-Base Mutation in Guide RNA on TteAgo Cleaving Target DNA
[0119] The guide RNAs with single-base mutation are synthetized (m1, m2, m3, m4, m5, m6, m7, m8, m9, m10, m11, m12, m13, m14, m15, m16, m17, m18, specific mutations are shown in
[0120] In summary, the eukaryotic Argonaute protein provided by the disclosure has binding activity to the guide RNA and the guide ssDNA, and has nuclease activity to both the target RNA and the target DNA, and the eAgo protein of the disclosure can carry out site-specific modification on intracellular and extracellular genetic material. Therefore, it can be effectively applied in many fields of biotechnology, such as nucleic acid detection, gene editing and gene modification.
[0121] The above description is only specific embodiments of the disclosure, but the scope of protection of the disclosure is not limited thereto. Any modification, equivalent substitution and improvement made by any person skilled in the art within the technical scope disclosed by the disclosure within the spirit and principles of the disclosure shall be included by the scope of protection of the disclosure.