SPLIT SINGLE-BASE GENE EDITING SYSTEMS AND APPLICATION THEREOF
20200347407 ยท 2020-11-05
Inventors
- Dali LI (Shanghai, CN)
- Xiaohui ZHANG (Shanghai, CN)
- Liren WANG (Shanghai, CN)
- Biyun ZHU (Shanghai, CN)
- Liang Chen (Shanghai, CN)
- Mingyao Liu (Shanghai, CN)
Cpc classification
C12N9/78
CHEMISTRY; METALLURGY
C12N15/864
CHEMISTRY; METALLURGY
C12N9/22
CHEMISTRY; METALLURGY
C07K2319/80
CHEMISTRY; METALLURGY
C07K2319/92
CHEMISTRY; METALLURGY
C12N15/113
CHEMISTRY; METALLURGY
C12N15/86
CHEMISTRY; METALLURGY
International classification
C12N15/86
CHEMISTRY; METALLURGY
C12N15/113
CHEMISTRY; METALLURGY
C12N9/22
CHEMISTRY; METALLURGY
Abstract
Provided are two split single-base gene editing systems. Intein-mediated split BE3, saKKH-BE3 and ABE7.10 are separately developed by using the existing protein structure information of spCas9 and saCas9 and splitting methods thereof, and have targeted gene mutation efficiency equivalent to that of unsplit BE3, saKKH-BE3 and ABE7.10 working system, thereby making possible package into an AAV for delivery.
Claims
1. A nucleic acid construct combination comprising a first nucleic acid construct and a second nucleic acid construct, wherein the first nucleic acid construct has a structure of Formula I from 5-3:
P1-X1-X2-X3-Z-X4(I); wherein P1 is a first promoter sequence; X1 is a coding sequence of cytosine deaminase or a coding sequence of adenosine deaminase; X2 is an optional linker sequence; X3 is a coding sequence of the N-terminal fragment of Cas9 nuclease or a coding sequence of the N-terminal fragment of SaKKH nuclease (Cas9n-N or SSaKKH-N); Z is a coding sequence of the N-terminal fragment of a first fusion peptide; X4 is a polyA sequence; the second nucleic acid construct has a structure of Formula II from 5-3:
P2-Z-Y1-Y2-Y3(II); wherein P2 is a second promoter sequence; Z is a coding sequence of the C-terminal fragment of a first fusion peptide; Y1 is a coding sequence of the C-terminal fragment of Cas9 nuclease or a coding sequence of the C-terminal fragment of SSaKKH nuclease (Cas9n-C or SSaKKH-C); Y2 is a coding sequence of UGI or none; Y3 is a polyA sequence; and each - is independently a bond or a nucleotide linker sequence.
2. The nucleic acid construct combination of claim 1, wherein the first promoter and the second promoter are each independently selected from the group consisting of: a CAG promoter, CMV promoter, and a combination thereof.
3. The nucleic acid construct combination of claim 1, wherein the Cas9 nuclease is selected from the group consisting of Cas9, Cas9n, and a combination thereof.
4. The nucleic acid construct combination of claim 1, wherein the origin of the X3 element is selected from the group consisting of Streptococcus pyogenes, Staphylococcus aureus, and a combination thereof.
5. The nucleic acid construct combination of claim 1, wherein the length of the first nucleic acid construct is 4.7 kb, preferably, 4.5 kb, and more preferably, 3.0-4.5 kb.
6. The nucleic acid construct combination of claim 1, wherein the length of the second nucleic acid construct is 4.7 kb, preferably, 4.5 kb, and more preferably, 3.0-4.5 kb.
7. A vector combination, comprising a first vector and a second vector, the first vector contains a first nucleic acid construct, the second vector contains the second nucleic acid construct, the first nucleic acid construct and the second nucleic acid construct are as defined in claim 1.
8. A genetically engineered cell wherein the cell is transformed by the construct of claim 1.
9. A method for gene editing in a cell, comprising the steps of: (i) providing a cell and a first vector and a second vector, wherein the first vector contains a first nucleic acid construct, the second vector contains a second nucleic acid construct, the first nucleic acid construct and the second nucleic acid construct are as defined in claim 1, and both the first vector and the second vector are viral vectors; (ii) infecting the cell with the viral vector, thereby performing gene editing in the cell.
10. A kit, comprising: (a1) a first container, and a first vector located in the first container; and (b1) a second container, and a second vector located in the second container.
11. A genetically engineered cell wherein the cell is transformed or transfected by the vector combination of claim 7.
Description
DESCRIPTION OF DRAWINGS
[0075]
[0076]
SaCas9(D10A)(2-739) represents amino acids 2-739 of SpCas9 with a nick of D10A mutation; SaCas9(D10A)(740-1053 represents amino acids 740-1053 of SpCas9 with a nick of D10A mutation; UGI: uracil glycosidase inhibitor; BGH: PolyA sequence.
[0077]
[0078]
[0079]
[0080]
[0081]
[0082]
[0083]
DETAILED DESCRIPTION OF INVENTION
[0084] After extensive and intensive research, the inventors has first time found that intein-mediated resolution of BE3, SaKKH-BE3 and ABE7.10 were developed respectively using the existing protein structure information of spCas9 and saCas9 and their splits methods, and the split BE3, SaKKH, ABE7.10 has a higher target gene mutation efficiency, which is comparable to the target gene mutation efficiency of the unsplit BE3, SaKKH, ABE7.10 working system, providing the possibility of packaging into AAV for delivery, and promoting its wide application in gene editing, gene therapy and clinical. On this basis, the inventor has completed the present invention.
[0085] BE3 Working System
[0086] BE3, that is Base editor 3, which is formed by fusion of cytosine deaminase and spCas9 (spCas9n) with a D10A mutation derived from Streptococcus pyogenes. It uses NGG as PAM and recognizes and specifically binds DNA and a single base mutation from C to T is achieved at positions 16-19 upstream of NGG (
[0087] SaKKH-BE3 Working System
[0088] SaKKH is formed by fusion of cytosine deaminase and SaCas9 with a D10A mutation and saCas9 with E782K/N968K/R1015H mutation from Staphylococcus aureus with NNNRRT (R=A or G) as PAM and recognize and specifically bind DNA and realize C to T single base mutation at positions 11-16 upstream of PAM (
[0089] ABE7.10 Working System
[0090] ABE7.10, which is formed by fusion of adenosine deaminase with spCas9 (spCas9n) with a D10A mutation derived from Streptococcus pyogenes. It uses NGG as PAM and recognizes and specifically binds DNA and realizes a single-base mutation from A to G at positions 16-19 upstream of NGG (
[0091] Self-Cleaving Protein
[0092] Self-cleaving protein 2A is a class of 18-22 amino acid peptides. When it connects two or more proteins, its translated protein product can be cleaved between glycine and proline(Asp-Val/Ile-Glu-X-Asn-Pro-Gly-Pro) at highly conserved C-terminal of 2A. Therefore, the protein products at both ends of 2A can be functioned independently. The self-cleaving protein used in this invention is T2A derived from tetrahymenops -larvae virus (Thosea asignavirus). There are also F2A from foot-and-mouth disease virus, E2A from horse rhinitis virus, and P2A from swine Jieshen 1 virus.
[0093] In a preferred embodiment, the self-cleaving protein is a 2A sequence. The 2A sequence is from the virus and is a short peptide of 18-22 amino acids. It expresses multiple proteins in an open reading frame through self-splicing, and the self-splicing efficiency is almost 100%. Commonly used are T2A, P2A, F2A, E2A.
[0094] Fusion Peptide
[0095] In the present invention, the fusion peptide is not particularly limited, and a preferred fusion peptide is selected from the group consisting of intein, FRB/FKBP, DmC/FKBP, ABI/PYL, and a combination thereof.
[0096] In a preferred embodiment, the fusion peptide is an intein. Intein is a naturally occurring intermediate sequence, which can catalyze protein splicing reaction to make inteins and flanking peptides into tandems with natural peptide bonds. Inteins control enzyme activity at specific times based on splicing reactions and coupling of peptides from different sources.
[0097] Construct Combination of the Present Invention
[0098] The invention provides a nucleic acid construct combination, including a first construct and a second construct. It uses the existing protein structure information of spCas9 or saCas9 and their split methods, intein-mediated split BE3, SaKKH-BE3 and the split BE3, SaKKH-BE3 and ABE7.10 were developed respectively. Among them, the split BE3 and SaKKH-BE3 nucleic acid construct can achieve the C mutation to T in the endogenous gene target, which is equivalent to the targeted gene mutation efficiency of the unsplit BE3 and SaKKH-BE3 working systems. Splitting the ABE7.10 nucleic acid construct can achieve the A mutation to G in the endogenous gene target, compared with the unsplit ABE7.10, the efficiency is slightly lower. The split-type BE3, SaKKH-BE3, and ABE7.10 provide the possibility of being packaged into AAV for delivery, and promote its wide application in base editing, gene therapy and clinical. The construct combination of the invention is as described in the first aspect of the invention.
[0099] The various elements used in the construct combination of the invention are known in the field, so those technician in the field can use conventional methods, such as PCR method, fully artificial chemical synthesis method, and enzyme digestion method to obtain the corresponding elements. These elements were ligased together by well-known DNA ligation techniques and then the construct combination of the invention is formed.
[0100] Specifically, the construction, verification process and application of the construct combination in the invention are as follows:
[0101] 1. According to the human EMX1, RNF2, FANCF gene, BE3, SaKKH-BE3 and ABE7.10 target sequence were designed respectively (
[0102] 2. The above plasmids in sequence was constructed.
[0103] 3. Comparing the fixed-point editing efficiency of C to T for BE3 and BE3-N/BE3-C, SaKKH-BE3 and SaKKH-BE3-N/SaKKH-BE3-C on human endogenous gene targets (
[0104] That is, the following combination was co-transfected with 293T,
[0105] BE3: U6-sgRNA-EF1-GFP=750 ng:250 ng,
[0106] BE3-N: BE3-C: U6-sgRNA-EF1-GFP=375 ng:375 ng:250 ng
[0107] SaKKH-BE3: U6-sgRNA-EF1-GFP=750 ng:250 ng,
[0108] SaKKH-BE3-N: SaKKH-BE3-C: U6-sgRNA-EF1-GFP=375 ng:375 ng:250 ng
[0109] ABE7.10: U6-sgRNA-EF1-GFP=750 ng:250 ng,
[0110] ABE7.10-N: ABE7.10-C: U6-sgRNA-EF1-GFP=375 ng:375 ng:250 ng
[0111] 72 h after transfection, GFP positive cells were sorted by flow cytometry. According to the instructions of the HITOM kit (Nuohe Zhiyuan item number: PT026), after performing PCR using the first round of PCR identification primers (Table-2), and then the second round of PCR library construction is completed, high-throughput sequencing was performed, comparing the editing efficiency of endogenous targets for BE3 and split BE3, SaKKH-BE3 and split SaKKH-BE3, ABE7.10 and split ABE7.10 (
[0112] The results show that BE3 and split BE3, SaKKH-BE3 and split SaKKH-BE3, ABE7.10 and split ABE7.10 have a considerable editing efficiency at the endogenous target.
TABLE-US-00001 TABLE1 oligodesign SEQID name sequence NO.: EMX1site1-up CACCGAAGGACGGCGGCACCGGCGG 5 EMX1site1-dn AAACCCGCCGGTGCCGCCGTCCTT 6 EMX1site2-up CACCGACTACGTGGTGGGCGCCGAG 7 EMX1site2-dn AAACCTCGGCGCCCACCACGTAGT 8 RNF2site1-up CACCGGTCATCTTAGTCATTACCTG 9 RNF2site1-dn AAACCAGGTAATGACTAAGATGAC 10 EMX1site3-up CACCGCGGATGCACGGTCAGCGCGG 11 EMX1site3-dn AAACCCGCGCTGACCGTGCATCCG 12 FANCFsite1-up CACCGGCCGTCTCCAAGGTGAAAGC 13 FANCFsite1-dn AAACGCTTTCACCTTGGAGACGGC 14
TABLE-US-00002 TABLE2 targetidentificationprimers SEQID name sequence NO.: EMX1site1- GGAGTGAGTACGGTGTGCTCCCGCGGCT 15 HITOM-F GCGACCA EMX1site1- GAGTTGGATGCTGGATGGGGGTGAGGGT 16 HITOM-R AGTTGAGCGCC EMX1site2- GGAGTGAGTACGGTGTGCCCCTTCGCAC 17 HITOM-F GCAAGCCCAAG EMX1site2- GAGTTGGATGCTGGATGGCGGGGGTGAT 18 HITOM-R TACCTGCGTCTCG RNF2site1- GGAGTGAGTACGGTGTGCTATTTCCAGC 19 HITOM-F AATGTCTCAGGCT RNF2site1- GAGTTGGATGCTGGATGGGTTTTCATGT 20 HITOM-R TCTAAAAATGTATCCCA EMX1site3- GGAGTGAGTACGGTGTGCCTTCGTGAGT 21 HITOM-F GGCTTCCCTGCC EMX1site3- GAGTTGGATGCTGGATGGGAAGAAGGAG 22 HITOM-R TGCGGGGGCTG FANCFsite1- GGAGTGAGTACGGTGTGCTCGACCAATA 23 HITOM-F GCATTGCAGAGAGGC FANCFsite1- GAGTTGGATGCTGGATGGGCTGACGTAG 24 HITOM-R GTAGTGCTTGAGACC
[0113] The main advantages of the present invention include:
[0114] (1) In the present invention, for the first time, using the existing protein structure information of spCas9 and saCas9 and the split methods, intein-mediated resolution of BE3, SaKKH-BE3, and ABE7.10 were developed, respectively. Split-BE3, SaKKH-BE3 and ABE7.10 have a significant targeted gene mutation efficiency, which is comparable to the targeted gene mutation efficiency of the unsplit BE3, KKH working system, which provides the possibility of packaging into AAV for delivery and promoting its gene widely used in base editing, gene therapy and clinic.
[0115] (2) The split single-base gene editing system of the present invention splits BE3, SaKKH-BE3, ABE7.10 into N-terminal and C-terminal, its length is less than packaging limit (4.7 Kb) of the adeno-associated virus (AAV), which can be packaged and delivered with AAV, further expanding the scope of application of the base editor.
[0116] The present invention will be further described below in conjunction with specific case. These cases are not only used to illustrate the present invention, but also expande the scope of the present invention. The experimental methods unless specifically stated generally follow conventional conditions such as Sambrook et al. Molecular cloning: the conditions described in the laboratory manual (New York: Cold Spring Harbor Laboratory Press, 1989), or according to the conditions the manufacturer recommended. Unless otherwise stated, percentages and parts are calculated by weight. The experimental materials and reagents involved in the present invention can be obtained from commercially available channels without special instructions.
Example 1 Comparison of the Mutation Efficiency of Targeting Endogenous Gene Using Unsplit and Split BE3, SaKKH-BE3, ABE7.10
[0117] 1. Design and Construction of Split-Type Split BE3, SaKKH-BE3, ABE7.10 Plasmid Vector and sgRNA
1.1 Design of Split BE3, SaKKH-BE3, ABE7.10 Plasmid
[0118] The BE3 and ABE7.10 splits were all fused with SpCas9 (D10A). According to the structure information of SpCas9, splitting occurs between amino acids 573 and 574 of SpCas9; and SaKKH-BE3 is formed by a fusion of SaCas9, According to the structure information of SaCas9, splitting occurs between amino acids 739 and 740 of SpCas9.
1.2 Design Principle of sgRNA:
[0119] According to the working principle of CRISPR/Cas9 and BE3, KKH, the human EMX1, RNF2, FANCF target sequences were designed (
In CRISPR/Cas9, spCas9 recognizes PAM (NGG) and SaCas9 recognizes PAM (NNNRRT); and at the same time, 20 bases of complementary paired sgRNA are required for targeted binding. The sgRNA uses U6 as the promoter and requires G as the transcription start site. At the same time, U6-SpsgRNA-EF1-GFP and U6-SasgRNA-EF1-GFP are ligased into the target site by BbsI enzyme digestion sites. CACCG was needed to be added at 5 end of sgRNA oligo-up and AAAC was needed to be added at 5 end of sgRNA oligo-up, the specific sequence is designed as follows.
1.3 Construction of the Plasmid in 1.2
[0120] Synthesis of sgRNA Oligo.
1.3.1 Dissolving oligo in pure water to a final concentration of 100 M.
1.3.2 Annealing. 10 L of each of the two complementary oligos were mixed, and put in a boiling water bath for 5 min, then cooled naturally to room temperature, about 2 hours.
1.3.3 ligased. U6-SpsgRNA-EF1-GFP (BbsI) and U6-SasgRNA-EF1-GFP (BbsI) were digested with BbsI and the vectors and annealed oligo were ligated and reacted according to the following reaction system.
TABLE-US-00003 Take Biyuntian's Rapid DNA ligation kit (D7006) as an example. annealed oligo 1 L precut vector 1 L (10-50 ng) T4 DNA Ligase 1 L 10x T4 DNA Ligase buffer 1 L H2O add to 10 L
[0121] After ligased at room temperature for 60 min, 5 L mixture was transformed into 50 L of competent bacteria, coated with kanamycin-resistant plates, and incubated overnight at 37 C.
1.3.4 Two clones was picked from the overnight culture plate, inoculated in a 4-5 mL medium, shaked at 37 C., incubated overnight at 220 r/min.
1.3.5 After shaking culture overnight, the plasmids were extracted and verified by hU6 sequencing, sequencing the correct plasmid.
2. Comparison of Targeted Endogenous Gene Mutation Efficiency of Unsplit and Split BE3, SaKKH-BE3 and ABE7.10.
2.1 Plasmid Transfection
[0122] Day 1 Seeding a 24-Well Plate with 293T Cells
2.1.1 The HEK293T cells were digested and seeded a 24-well plate at 2.010.sup.5 cell/well.
[0123] Note: After the cells were recovered, they generally need to be passaged twice before being used for transfection experiments.
Day 2 Transfection
[0124] 2.1.2 The state of cells in each well was observed.
[0125] Note: The cell density before transfection should be 80%-95%, and the condition is normal. 2.1.3 To ensure the accuracy of the data and the repeatability of the experiment, the plasmid was diluted with sterile water. Diluting the plasmid concentration of each group to be consistent, or ensure that the volume of plasmid samples between the groups was the same.
[0126] The group settings were as follows:
[0127] Blank: blank control, including only cultured cells and culture medium;
[0128] The treatment groups were:
[0129] That is the combination of the following was used to transfect 293T,
[0130] BE3: U6-sgRNA-EF1-GFP=750 ng:250 ng,
[0131] BE3-N: BE3-C: U6-sgRNA-EF1-GFP=375 ng:375 ng:250 ng
[0132] SaKKH-BE3: U6-sgRNA-EF1-GFP=750 ng:250 ng,
[0133] SaKKH-BE3-N:SaKKH-BE3-C:U6-sgRNA-EF1-GFP=375 ng:375 ng:250 ng
[0134] ABE7.10: U6-sgRNA-EF1-GFP=750 ng:250 ng,
[0135] ABE7.10-N: ABE7.10-C: U6-sgRNA-EF1-GFP=375 ng:375 ng:250 ng
2.1.4 Adding DMEM (no serum, no antibiotics) to the 1.5 mL EP tube.
2.1.5 Adding the DNA plasmid to the EP tube in step (4) and mixing it well.
2.1.6 Adding PEI to the EP tube in the previous step, mixing it well, and letting stand at room temperature for 20 minutes.
2.1.7 Adding the transfection mix to the 24-well plate. Gently tapping the 24-well plate to mix well.
2.1.8 After culturing at 37 C., 5% CO.sub.2 for 72 h, GFP-positive cells were sorted by flow cytometry.
Day 5
[0136] 2.1.9 After 72 h, GFP positive cells were sorted by flow cytometry.
2.1.10 Extracting the genomic DNA of the sorted GFP-positive cells with Tiangen Cell Genome Extraction Kit (DP304), and performing PCR using the first round of PCR identification primers (Table-2) according to the instructions of the HITOM kit (Nuohe Zhiyuan Item No.: PT026). After the second round of PCR library construction is completed, high-throughput sequencing were performed. Comparing the editing efficiency of endogenous targets for BE3 and split BE3, SaKKH-BE3 and split SaKKH-BE3, ABE7.10 and split ABE7.10 (
[0137] The results show that BE3 and split BE3, SaKKH-BE3 and split SaKKH-BE3, ABE7.10 and split ABE7.10 have a considerable editing efficiency at each endogenous target.
Comparative Example 1
[0138] The method of Example 1 was used, except that co-transformation of BE3-N terminal or C terminal with the sgRNA-expressing plasmid, and co-transformation of SaKKH-BE3-N terminal or C terminal with the sgRNA-expressing plasmid and co-transformation of ABE7.10-N terminal or C terminal with the sgRNA-expressing plasmid.
[0139] The result shows that none of the endogenous gene mutations are detected.
Comparative Example 2
[0140] The method of Example 1 was used, except that there was no corresponding intein at BE3-N, C-terminal, no corresponding intein at SaKKH-BE3-N, C-terminal, ABE7.10-N or C-terminal, and they were co-transformed with sgRNA plasmid.
[0141] The result shows that none of the endogenous gene mutations are detected.
Comparative Example 3
[0142] The method of Example 1 was used, except that BE3-C and SaKKH-BE3-C were not fused with glycosidase inhibitors and co-transformed with sgRNA plasmids.
[0143] The result shows that the endogenous gene mutation efficiency is lower, about 10%, and there are also lower unexpected mutations, namely mutation of cytosine to adenine, guanine.
[0144] All literatures mentioned in the present application are incorporated by reference herein, as though individually incorporated by reference. Additionally, it should be understood that after reading the above teaching, many variations and modifications may be made by the skilled in the art, and these equivalents also fall within the scope as defined by the appended claims.