Gene knockout method
11624077 · 2023-04-11
Assignee
Inventors
- Wensheng Wei (Beijing, CN)
- Yiou Chen (Beijing, CN)
- Yuexin Zhou (Beijing, CN)
- Hongmin Zhang (Beijing, CN)
- Pengfei YUAN (Beijing, CN)
- Yuan Liu (Beijing, CN)
Cpc classification
C12N2310/20
CHEMISTRY; METALLURGY
C12N15/111
CHEMISTRY; METALLURGY
C12N9/22
CHEMISTRY; METALLURGY
G01N21/6428
PHYSICS
C12N15/90
CHEMISTRY; METALLURGY
International classification
C12N15/90
CHEMISTRY; METALLURGY
C12N15/11
CHEMISTRY; METALLURGY
C12N9/22
CHEMISTRY; METALLURGY
Abstract
A donor construct and a gene knockout method, as well as a system and kit for the gene knockout are provided. The donor construct is a linear donor DNA or can be cleaved in a cell to produce the linear donor DNA. The gene knockout method uses a marker gene contained in the donor construct to enrich cells in which a gene is knocked out, thereby improving the efficiency of generating the gene knockout by a sequence-specific nuclease.
Claims
1. A universal donor construct being a linear donor DNA or being cleavable in a cell to produce a linear donor DNA, wherein the linear donor DNA, from its middle to both ends, sequentially comprises: an expression cassette; a sequence extension consisting of a reverse termination codon located at the 5′-end of the expression cassette and a sequence extension consisting of a forward termination codon located at the 3′-end of the expression cassette; a universal target sequence located at the 5′-end and/or 3′-end, comprising a target site cleavable by a Cas9 nuclease; and protective sequences located at both ends; wherein the expression cassette comprises a promoter-driven marker gene; and wherein the universal target sequence is absent in a cell genome to be subjected to a gene knockout.
2. The universal donor construct of claim 1, which is a linear donor DNA.
3. The universal donor construct of claim 1, wherein the protective sequence is 5-30 bp.
4. A method for generating a gene knockout in a cell, comprising the steps of: (1) introducing into the cell: (a) a Cas9 nuclease; (b) a gRNA that recognizes a specific target sequence in a cell genome; (c) a universal donor construct, wherein the universal donor construct is a linear donor DNA or is cleavable in a cell to produce a linear donor DNA, and the linear donor DNA, from the middle to both ends, sequentially comprises: an expression cassette; a sequence extension consisting of a reverse termination codon located at the 5′-end of the expression cassette and a sequence extension consisting of a forward termination codon located at the 3′-end of the expression cassette; a universal target sequence located at the 5′-end and/or 3′-end, comprising a target site cleavable by a Cas9 nuclease; and protective sequences located at both ends; wherein the expression cassette comprises a promoter-driven marker gene; and wherein the universal target sequence is absent in the cell genome to be subjected to a gene knockout; and (d) a gRNA that recognizes the universal target sequence contained in the linear donor DNA; (2) inserting the linear donor DNA into a specific target site in the cell genome by non-homologous end joining; and (3) screening cells positive for the marker expression.
5. The method of claim 4, wherein the universal donor construct is a linear donor DNA.
6. The method of claim 4, wherein the gRNA that recognizes the specific target sites in the cell genome is one kind of gRNA, or more than one kind of gRNA that recognize different target sites in the cell genome.
7. The method of claim 4, wherein the marker gene is an antibiotic resistance gene or a fluorescent protein gene.
8. A system or kit for a gene knockout, comprising: (1) a Cas9 nuclease or a vector or cell capable of expressing the Cas9 nuclease; (2) a gRNA that recognizes a specific target sequence in a cell genome; (3) a universal donor construct, wherein the universal donor construct is a linear donor DNA or is cleavable in a cell to produce a linear donor DNA, and the linear donor DNA, from the middle to both ends, sequentially comprises: an expression cassette; a sequence extension consisting of a reverse termination codon located at the 5′-end of the expression cassette and a sequence extension consisting of a forward termination codon located at the 3′-end of the expression cassette; a universal target sequence located at the 5′-end and/or 3′-end, comprising a target site cleavable by a Cas9 nuclease; and protective sequences located at both ends; wherein the expression cassette comprises a promoter-driven marker gene; and wherein the universal target sequence is absent in the cell genome to be subjected to a gene knockout; and (4) a gRNA that recognizes the universal target sequence in the linear donor DNA.
9. The system or kit of claim 8, wherein the universal donor construct is a linear donor DNA.
10. The system or kit of claim 9, wherein the marker gene is an antibiotic resistance gene or a fluorescent protein gene.
11. The universal donor construct of claim 2, which is a double-stranded linear donor DNA.
12. The universal donor construct of claim 2, wherein the linear donor DNA only has the universal target sequence at the 5′-end or the 3′-end.
13. The universal donor construct of claim 2, wherein the marker gene is an antibiotic resistance gene or a fluorescent protein gene.
14. The universal donor construct of claim 3, wherein the protective sequence is 20 bp, in length.
15. The universal donor construct of claim 3, wherein the universal target sequence comprises 5′-GTACGGGGCGATCATCCACA-3′ or 5′-AATCGACTCGAACTTCGTGT-3′.
16. The method of claim 5, wherein the universal donor construct is a double-stranded linear donor DNA.
17. The method of claim 5, wherein the linear donor DNA only has the universal target sequence at the 5′-end or the 3′-end.
18. The method of claim 6, wherein the gRNA that recognizes the specific target sequence in the cell genome is an sgRNA, and/or the gRNA that recognizes the universal target sequence in the linear donor DNA is an sgRNA.
19. The method of claim 18, wherein the sgRNA that recognizes the specific target site in the cell genome and the sgRNA that recognizes the universal target sequence in the linear donor DNA are located in the same vector; or the sgRNA that recognizes the specific target site in the cell genome and the sgRNA that recognizes the universal target sequence in the linear donor DNA are located in different vectors.
20. The method of claim 7, wherein the cells are screened by drug resistance, or the cells are screened by a FACS method.
21. The method of claim 7, wherein the protective sequence is 5-30 bp in length.
22. The method of claim 21, wherein the protective sequence is 20 bp in length.
23. The method of claim 7, wherein the universal target sequence comprises 5′-GTACGGGGCGATCATCCACA-3′ or 5′-AATCGACTCGAACTTCGTGT-3′.
24. The system or kit of claim 9, wherein the universal donor construct is a double-stranded linear donor DNA.
25. The system or kit of claim 9, wherein the linear donor DNA only has the universal target sequence at the 5′-end or the 3′ end.
26. The system or kit of claim 9, wherein the gRNA that recognizes the specific target site in the cell genome is one kind of gRNA, or more than one kind of gRNA that recognize different target sites in the cell genome.
27. The system or kit of claim 9, wherein the gRNA that recognizes the specific target sequence in the cell genome is an sgRNA, and/or the gRNA that recognizes the universal target sequence in the linear donor DNA is an sgRNA.
28. The system or kit of claim 27, wherein the sgRNA that recognizes the specific target sequence in the cell genome and the sgRNA that recognizes the universal target sequence in the linear donor DNA are located in the same vector; or the sgRNA that recognizes the specific target sequence in the cell genome and the sgRNA that recognizes the universal target sequence in the linear donor DNA are located in different vectors.
29. The system or kit of claim 10, wherein the protective sequence is 20 bp in length.
30. The system or kit of claim 10, wherein the universal target sequence comprises 5′-GTACGGGGCGATCATCCACA-3′ or 5′-AATCGACTCGAACTTCGTGT-3′.
31. The universal donor construct of claim 2, wherein the linear donor DNA has the universal target sequence at both of the 5′-end and the 3′-end.
32. The method of claim 5, wherein the linear donor DNA has the universal target sequences at both of the 5′-end and the 3′-end.
33. The system or kit of claim 9, wherein the linear donor DNA has the universal target sequences at both of the 5′-end and the 3′-end.
Description
BRIEF DESCRIPTION OF THE DRAWINGS
(1)
(2)
(3)
(4)
(5)
(6)
(7)
(8)
(9)
(10)
(11)
(12)
(13)
(14)
(15)
DETAILED DESCRIPTION OF THE INVENTION
(16) The present invention provides a novel donor construct and a gene knockout method. The method uses a linear donor DNA to improve the efficiency of generating a gene knockout by a sequence-specific nuclease. The linear donor DNA of the present invention comprises at least one target site that can be cleaved by a sequence-specific nuclease. The target site comprised in the linear donor DNA is designed according to the target site in the cell genome, so that a sequence-specific nuclease that can cleave the target site in the cell genome can also cleave the target site comprised in the linear donor DNA. After a sequence-specific nuclease and a donor construct are introduced into a cell, double-strand breaks (DSBs) are generated at a specific target site in the cell by the sequence-specific nuclease, and at the same time the sequence-specific nuclease cleaves at least one target site contained in the linear donor DNA. This allows the linear donor DNA to be inserted into the cleaved target site in the cell genome by the non-homologous end joining (NHEJ) pathway with a higher efficiency. Subsequent selection of cells through a marker can effectively enrich cells in which a gene is knocked out by cleavage at the specific target site of the genome, thereby greatly improving the efficiency of generating the gene knockout by the sequence-specific nuclease.
(17) The target site comprised in the linear donor DNA is designed according to the target site in the cell genome, and the linear donor DNA obtained is a specific linear donor. When gene knockouts are required at different target sites in the cell genome, it is necessary to construct a matched linear donor DNA according to the sequence of the target site. Therefore, in order to further optimize the present invention, the inventors further provide a universal linear donor DNA in the present invention. The universal linear donor DNA comprises a universal target sequence that can be cleaved by a sequence-specific nuclease. The universal target sequence is absent in the cell genome to be subjected to a gene knockout, i.e., there is no sequence, which is identical to the universal target sequence and cleavable by the sequence-specific nuclease, in the cell genome to be subjected to a gene knockout. In this case, after a sequence-specific nuclease and a universal linear donor DNA are introduced into a cell, a sequence-specific nuclease generates double-strand breaks (DSBs) at a specific target site in the cell, and the universal target sequence contained in the universal linear donor DNA is also cleaved by the sequence-specific nuclease through a universal gRNA that recognizes the target sequence. At this time, the linear donor DNA can still be inserted into the cleaved target site in the cell genome by the non-homologous end joining (NHEJ) pathway with a higher efficiency. Subsequent selection of cells through a marker can effectively enrich cells in which a gene is knocked out by cleavage at the specific target site of the genome, and can also greatly improve the efficiency of generating a gene knockout by the sequence-specific nuclease. The target sequence in the universal linear donor DNA is not relevant to the gene to be knocked out, and it can be used as a universal donor for the knockout of different target genes in different cells, and can improve the efficiency of generating a gene knockout by the sequence-specific nuclease. A universal linear donor DNA is particularly useful in the case of gene knockout using the Cas9/CRISPR system which targets a target sequence using a gRNA (preferably an sgRNA). When gene knockout is performed, it is only necessary to construct a gRNA for a specific target site in the cell genome, without the need to specifically construct a matched linear donor DNA, i.e., a universal linear donor DNA and a gRNA targeting the universal linear donor DNA can be directly used, thereby reducing the operation complexity and improving the efficiency.
(18) It has been reported that if one of the homologous alleles is modified, the mutation frequency of the target alleles is usually higher [25, 26]. Thus, while not wishing to be bound by theory, the inventors speculate that if a donor can be inserted at a specific site in one of the target alleles, and clones that express a marker gene contained in the donor are selected, it may be possible to enrich rare events where all the alleles are modified.
(19) In the present invention, the “gene knockout” is to realize the loss of gene functions through genome editing. The gene knockout effect that is usually pursued is simultaneous knockout of two alleles, at which time the corresponding protein loses its functions and a gene knockout cell line is obtained. If only one allele is knocked out, the protein can also play its partial role, i.e., the protein functions are only down-regulated. Cells with both alleles knocked out can be enriched effectively by using the linear donor DNA and the method of the present invention.
(20) The donor construct of the present invention is a double-stranded DNA. The donor construct of the present invention may itself be a linear donor DNA. Alternatively, the donor construct of the present invention may be a circular DNA molecule comprising a linear donor DNA, and when introduced into a cell, it is cleaved in the cell to produce the linear donor DNA. A method for cleaving a circular donor construct in a cell to produce a linear donor DNA is well known in the art. For instance, the circular construct can further comprise cleavage sites for another sequence-specific nuclease upstream of the 5′-end and downstream of the 3′-end of the linear donor DNA.
(21) The method of the present invention may further comprise introducing to the cell another sequence-specific nuclease, which cleaves a sequence upstream of the 5′-end and downstream of the 3′-end of the linear donor DNA in the circular construct in the cell, thereby producing the linear donor DNA.
(22) In the linear donor DNA of the present invention, the “reverse termination codon” means the codon oriented in the opposite direction to the reading frame of the expression cassette. The “forward termination codon” means the codon oriented in the same direction as the reading frame of the expression cassette. The role of termination codons is that, regardless of whether the linear donor is inserted into the genome forward or backward, both the triplet termination codons can terminate endogenous and exogenous gene expression.
(23) The “protective sequence” in the linear donor DNA of the present invention can be any sequence, and preferably the protective sequence is different from the target sequence in the same linear donor DNA. The protective sequence can be 5-30 bp, preferably 20 bp, in length. The role of the protective sequence is to protect the target sequence in the linear donor DNA from being cleaved by an enzyme (e.g., an exonuclease) in the cell.
(24) The “marker gene” described herein refers to any marker gene whose expression can be selected or enriched, i.e., when the marker gene is expressed in a cell, cells expressing the marker gene can be selected and enriched in a certain manner. The marker gene useful in the present invention includes, but is not limited to, a fluorescent protein gene that can be sorted by FACS after expression, or a resistance gene that can be screened by an antibiotic, or a protein gene that can be recognized by a corresponding antibody and screened by immunostaining or magnetic beads adsorption after expression. The resistance gene useful in the present invention includes, but is not limited to, resistance genes against Blasticidin, Geneticin (G-418), Hygromycin B, Mycophenolic Acid, Puromycin, Zeocin or Neomycin. The fluorescent protein gene useful in the present invention includes, but is not limited to, genes of Cyan Fluorescent Protein, Green Fluorescent Protein, Yellow Fluorescent Protein, Orange Fluorescent Protein, Red Fluorescent Protein, Far-Red Fluorescent Protein, or Switchable Fluorescent Proteins.
(25) Examples of the sequence-specific nuclease comprises a zinc finger nuclease (ZFN). The zinc finger nuclease is a non-naturally occurring and artificially modified endonuclease, which is composed of a zinc finger protein domain and a non-specific endonuclease domain. The zinc finger protein domain comprises a series of Cys2-His2 zinc finger proteins in series. Each zinc finger protein recognizes and binds to a specific base triplet on the DNA strand in the 3′ to 5′ direction and a base in the 5′ to 3′ direction. Multiple zinc finger proteins can be connected in series to form a zinc finger protein group, which recognizes a stretch of specific base sequence with a strong specificity. The non-specific endonuclease linked to the zinc finger protein group is derived from the DNA cleavage domain consisting of 96 amino acid residues at the carboxyl terminus of Fold. Each Fokl monomer is linked to a zinc finger protein group to form a ZFN that recognizes a specific site. When two recognition sites are at an appropriate distance (6-8 bp), two monomeric ZFNs interact with each other to produce an enzymatic digestion function, so as to achieve the purpose of site-specific DNA cleavage. 8-10 zinc finger domains are designed for the target sequence. By linking these zinc finger domains to DNA nucleases, double-strand breaks (DSBs) of the target sequences can be produced, and the DSB repair mechanism can be thus induced to conduct directional modification of specific sites in the genome.
(26) Another example of the sequence-specific nuclease comprises a transcription activator-like effector nuclease (TALEN). The transcription activator-like effector nuclease is mainly composed of a Fok I endonuclease domain and a DNA binding domain of the TALE protein. The TALE protein contains multiple peptide segment repeats, each of which comprises 33-35 amino acids, and each peptide segment recognizes one base. Like ZFNs, TALENs can also cleave DNA target sequences to form DSBs, thereby activating DNA damage repair mechanisms and performing site-specific modification of the genome.
(27) Another example of the sequence-specific nuclease system useful in the present invention comprises the Cas9/CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats) system. The Cas9/CRISPR system utilizes RNA-directed DNA binding for sequence-specific cleavage of a target DNA, in which a crRNA (CRISPR-derived RNA) binds to tracrRNA (trans-activating RNA) by base pairing to form a tracrRNA/crRNA complex, which directs the nuclease Cas9 protein to cleave the double-stranded DNA at a specific position in the target sequence that is paired with the crRNA. The target sequence paired with the crRNA is usually a sequence of about 20 nucleotides located upstream of the genomic PAM (protospacer adjacent motif) site (NNG).
(28) The Cas9 protein cleaves the target site by means of a guide RNA. The term “guide RNA” is also known as gRNA (guide RNA). A gRNA typically comprises a nucleotide on the crRNA complementary to the target sequence and an RNA scaffold formed by base pairing of the crRNA and the tracrRNA, and is capable of recognizing the target sequence paired with the crRNA. The gRNA can form a complex with the Cas9 protein and guide the Cas9 protein to the target sequence for cleaving the target site therein.
(29) The gRNA is commonly used in the form of an sgRNA (single guide RNA). The sgRNA, also known as a “single-stranded guide RNA”, is an RNA strand formed by fusing the crRNA with the trancrRNA.
(30) Another example of the sequence-specific nuclease system useful in the present invention comprises an NgAgo nuclease and its gDNA. An NgAgo nuclease can bind to a single-stranded guide DNA (gDNA) phosphorylated at the 5′-end to cleave the target sequence complementary to the gDNA, thus producing DNA double-strand breaks.
(31) The linear donor DNA of the present invention may have a target sequence only at one end, or may have target sequences at both ends, respectively. The target sequences at both ends of the linear donor DNA can be different. When a gene knockout is required to be produced by cleavage at two different target sites in the cell genome, two linear donor DNAs can be provided, and each linear donor DNA comprises a corresponding target sequence, respectively; or alternatively, a linear donor DNA can be provided, each end of which comprises a corresponding target sequence. When a gene knockout is required to be produced by cleavage at multiple different target sites in the cell genome, an appropriate number of linear donor DNAs can be provided, and one or both ends of each linear donor DNA comprises one of the multiple different corresponding target sequences, respectively. For instance, linear donor DNAs can be provided in the same number as the number of the target sites, and each linear donor DNA comprises a corresponding target sequence, respectively. Alternatively, linear donor DNAs may be provided in an number less than the number of the target sites, wherein both ends of all or part of the linear donor DNAs comprise one of the multiple different corresponding target sequences, respectively, and each of other linear donor DNAs comprises one of the other corresponding target sequences, respectively.
(32) For a universal linear donor DNA comprising a universal target sequence, the universal target sequence may be contained at either end or both ends. The target sequence of such universal linear donor DNA is independent of the target sites to be cleaved in the cell genome, and is thus universally applicable to the case of generating a gene knockout by the cleavage of any one target site, any two target sites, or any more target sites in the cell genome.
(33) The “universal target sequence” of the present invention refers to a sequence that can be cleaved by a sequence-specific nuclease. However, the universal target sequence is absent in the cell genome to be subjected to a gene knockout, in other words, there is no sequence, which is identical to the universal target sequence and cleavable by the sequence-specific nuclease, in the cell genome to be subjected to a gene knockout. The universal target sequence is different from the target sequence that is present in the cell genome and cleavable by the same sequence-specific nuclease. The linear donor DNA comprising the universal target site is not specific to any target site in the cell genome, and is thus universally applicable to the gene knockout of any gene in the cell, without the need to construct a specific linear donor DNA for the gene to be knocked out and the target site in the gene.
(34) The sequence-specific nuclease can be introduced into a cell in the form of a protein or its coding nucleic acid sequence (e.g., an mRNA or a cDNA). A nucleic acid encoding the sequence-specific nuclease can be introduced into a cell by inclusion in a plasmid or viral vector, e.g., introduced into a cell by transfection. A nucleic acid encoding the sequence-specific nuclease can also be delivered directly to a cell by electroporation, liposome, microinjection, or other means.
(35) The donor construct can be delivered by any method suitable for introducing a nucleic acid into a cell, e.g., introduced into a cell by transfection.
(36) In the cases of producing gene knockouts using the Cas9/CRISPR system and the NgAgo nuclease, an sgRNA or a gDNA should also be introduced into a cell. The sgRNA or gDNA can be delivered by any method suitable for introducing an RNA or a DNA into a cell. The sgRNA can be introduced into a cell in the form of an isolated RNA. The isolated sgRNA can be prepared by in vitro transcription using any in vitro transcription system known in the art. The sgRNA can also be introduced into a cell by a vector comprising an sgRNA coding sequence and a promoter. The vector may be a viral vector or a plasmid. The means for introduction into a cell can be transfection.
(37) Two or more sgRNAs for different respective target sites can be introduced into a cell to direct cleavage by Cas9 at two or more different target sites in the cell genome to produce gene knockouts. The two or more sgRNAs may be comprised in different vectors, or may be contained in the same vector, such as a vector comprising a pair of gRNAs (paired gRNAs), or a vector comprising more sgRNAs.
(38) In the method of the present invention, when two or more sgRNAs for different respective target sites are introduced into a cell, linear donor DNAs comprising target sequences recognized by these sgRNAs are simultaneously introduced. Since the linear donor DNA may comprise a target sequence only at the 5′-end or 3′-end, or may also comprise target sequences at both ends, respectively, the number of sgRNAs and the number of linear donor DNAs can be different, i.e., it is possible that one sgRNA corresponds to one linear donor DNA, or two sgRNAs correspond to two linear donor DNAs.
(39) When the Cas9/CRISPR system and the universal linear donor DNA of the present invention are used for a gene knockout, in addition to introducing the universal linear donor DNA and the Cas9 nuclease into a cell, an sgRNA for a specific target sequence in the cell genome and an sgRNA for a universal target sequence on the universal linear donor DNA are also introduced into the cell, so as to direct the Cas9 to cleave the specific target sequence in the cell genome and the universal target sequence on the universal linear donor DNA. An sgRNA for a specific target sequence in the cell genome and an sgRNA for a universal target sequence on the universal linear donor DNA may be conprised in different vectors, or may be comprised in the same vector.
(40) An sgRNA for a specific target sequence in the cell genome may be one sgRNA or more sgRNAs, such as two, three, or more gRNAs. The more than one sgRNA may target different specific target sites in the cell genome respectively, so as to achieve simultaneous cleavage on different target sites in the cell genome. When these different target sites are located in different genes respectively, the knockout of multiple genes, such as of two, three or more genes, can be achieved. Specifically, when the multi-gene knockout is performed, a plurality of sgRNAs for a plurality of respective specific target sites in the cell genome and an sgRNA for a universal target sequence on the universal linear donor DNA may be introduced into the cell, so as to direct the Cas9 to cleave the plurality of specific target sites in the cell genome and the universal target sequence on the universal linear donor DNA. The plurality of specific target sequences are located on different genes respectively, thereby achieving the multi-gene knockout. The plurality of sgRNAs for the plurality of respective specific target sites in the cell genome may be comprised in different vectors, or may be comprised in the same vector. Any one or more sgRNAs of the plurality of sgRNAs for the plurality of respective specific target sites in the cell genome, and an sgRNA for a universal target sequence on the universal linear donor DNA may be comprised in different vectors, or may be comprised in the same vector.
(41) According to the present invention, the universal target sequence on the universal donor construct DNA is preferably 5′-GTACGGGGCGATCATCCACA-3′ (SEQ ID NO:1) or 5′-AATCGACTCGAACTTCGTGT-3′ (SEQ ID NO:2).
(42) Preferably, in the present invention, in the case of producing a gene knockout using the Cas9/CRISPR system, a Cas9, an sgRNA and a linear donor DNA can be introduced into a cell simultaneously; or for instance, a Cas9 can be first introduced into a cell, and then an sgRNA and a linear donor DNA are introduced into the cell. In some embodiments, the cell is co-transfected with a Cas9-containing vector, an sgRNA-containing vector, and a linear donor DNA. In some other embodiments, a Cas9 and an sgRNA are assembled in vitro into a protein-RNA complex, which is used to co-transfect the cell with a linear donor DNA. In some other embodiments, a Cas9 and an sgRNA are stably expressed in the cell by lentivirus, and the cell is transfected with a linear donor DNA. In other embodiments, a Cas9 is first stably expressed in the cell, and the cell is then co-transfected with an sgRNA-containing vector and a linear donor DNA.
(43) In the system or kit provided by the present invention for a gene knockout, a sequence-specific nuclease may be in the form of a protein or its coding nucleic acid sequence (e.g., an mRNA or a cDNA), such as in the form of a plasmid or viral vector comprising a nucleic acid encoding the sequence-specific nuclease. In the case of using the Cas9/CRISPR system, an sgRNA may be in the form of an isolated RNA, or in the form of a vector comprising an sgRNA coding sequence and a promoter, such as a viral vector or a plasmid vector.
(44) The cell described herein can be any eukaryotic cell, such as an isolated animal cell, e.g., a totipotent cell, a pluripotent cell, an adult stem cell, a fertilized egg, a somatic cell, or the like. In some embodiments, the cell is a vertebrate cell. In some embodiments, the cell is a mammalian cell. In some embodiments, the cell is a human cell. In some embodiments, the cell is a cell from a cow, a goat, a sheep, a cat, a dog, a horse, a rodent, fish, and a primate. In some embodiments, the rodent comprises mice, rats, and rabbits.
(45) The method of the present invention can be used to perform targeted gene knockout in a single gene or multiple genes in a cell, such as two, three, four, five or more targeted gene knockouts. Targeted gene knockout for multiple genes can be performed simultaneously or successively. For instance, sequence-specific nucleases or sequence-specific nuclease systems for two or more target genes can be introduced into a cell and then cells are subjected to enrichment screening. Alternatively, a sequence-specific nuclease(s) or a sequence-specific nuclease system(s) for one or more target genes can be first introduced into a cell and cells are subjected to enrichment screening, and then a sequence-specific nuclease(s) or a sequence-specific nuclease system(s) for other target genes can be introduced into the cell and the cells are subjected to enrichment screening. Different marker/marker genes can be used for different target genes. For instance, in the case of producing a gene knockout using the Cas9/CRISPR system, two or more sgRNAs for different respective target sites can be introduced into a cell, and a linear donor DNA(s) comprising target sequences recognized by these sgRNAs is(are) simultaneously introduced, as previously mentioned. When these different target sites are located in different genes, knockout of multiple genes can be achieved. Target sequences in an sgRNA and a linear donor DNA can also be designed by using the consensus sequence of two or more genes in the cell genome. At this point, an sgRNA recognizing a single specific target site in the cell genome can be introduced into a cell, and a linear donor DNA comprising a target sequence recognized by the sgRNA is simultaneously introduced, wherein the target sequence recognized by the sgRNA is the consensus sequence of two or more genes in the cell genome, in the condition that the consensus sequence has no more than one base difference from the sequences in any of the two or more genes at positions corresponding to the consensus sequence. A two-base difference may disrupt recognition by an sgRNA, as demonstrated in Example 7.
(46) The target gene edited with the linear donor DNA of the present invention targets is not particularly limited, as long as double-strand breaks can be produced on it by the Cas9/CRISPR system. The target gene may be an exon, an intron or a regulatory sequence, or any combination thereof.
(47) The term “comprise” or “contain”, as used in the present invention, indicates “include, but are not limited to”, “consist essentially of” or “consist of”.
(48) The present invention is further illustrated in conjunction with the following examples and the accompanying drawings, which are used for illustration purposes only and are not intended to limit the scope of the present invention. If not specially stated, the examples are all conducted in accordance with normal experimental conditions, such as those described in Sambrook J & Russell DW, Molecular cloning: a laboratory manual, 2001, or in accordance with the instructions provided by the manufacturers.
Example 1. Enrichment of ANTXR1 Gene Knockout Events in HeLa Cells Using Linear Donor DNAs 1. Design of sgRNAs
(49) Two sgRNAs targeting the first exon of the ANTXR1 gene in HeLa cells were designed, and their efficiency in producing deletions or insertion mutations (Indels) at target sites was verified by T7E1 assay. The verification results are shown in Table 1. The target sequence that sgRNA1.sub.ANTXR1 targets is referred to as sg1 in this example, and the target sequence that sgRNA2.sub.ANTXR1 targets is referred to as sg2 in this example.
(50) TABLE-US-00001 TABLE 1 The sgRNAs targeting the first exon of the ANTXR1 gene in HeLa cells Mean Indels ± SEQ s.d. sgRNA Target sequence (PAM) (5′ to 3′) ID NO: (%, n = 3) sgRNA1.sub.ANTXR1 AGCGGAGAGCCCTCGGCAT(CGG) 3 19.97 ± 1.84 sgRNA2.sub.ANTXR1 TGCTCATCTGCGCCGGGCAA(GGG) 4 16.83 ± 2.05
(51) 2. Construction of Linear Donor DNAs
(52) A total of two linear donor DNAs (DonorANTXR1-sg2 and DonorANTXR1-pg) were constructed, the structures of which are shown in
(53) Donor.sub.ANTXR1-sg2 comprises, from 5′-end to 3′- end: a 20-bp protective sequence, sg2, a reverse termination codon, CMV promoter-driven puromycin-resistant gene, a forward termination codon, and a 20-bp protective sequence, respectively.
(54) Donor.sub.ANTXR1-pg comprises, from 5′-end to 3′-end: a 20-bp protective sequence, sg 1, a reverse termination codon, CMV promoter-driven puromycin-resistant gene, a forward termination codon, sg2, and a 20-bp protective sequence, respectively.
(55) A linear donor DNA (Donor.sub.no cut) as a control comprises, from 5′-end to 3′-end: a 20-bp protective sequence, a 20-bp random sequence, a reverse termination codon, CMV promoter-driven puromycin-resistant gene, a forward termination codon, and a 20-bp protective sequence, respectively. The random sequence is different from sg1 or sg2.
(56) 3. Transfection
(57) HeLa cells were co-transfected with a Cas9-expressing plasmid, sgRNA2.sub.ANTXR1 or pgRNA.sub.ANTXR1, and the corresponding donors. As a control, HeLa cells were transfected with linear donor DNAs (Donor.sub.ANTXR1-sg2, Donor.sub.ANTXR1-pg, and Donor.sub.no cut) alone. To the cells puromycin was added for resistance screening. A pooled population and single clones were obtained and stained with MTT (3-(4,5-dimethylthiazol-2-yl)-2,5-diphenyltertazolium bromide). The results are shown in
(58) A number of puromycin-resistant (puro+) cell clones were obtained from samples receiving sgRNA2.sub.ANTXR1 and its corresponding donor Donor.sub.ANTXR1-sg2, and from samples receiving pgRNA.sub.ANTXR1 and its corresponding donor Donor.sub.ANTXR1-pg. Only a few puromycin-resistant clones were produced by transfection with donors alone, probably because integration of linear donors into the chromosome was rare and random. In addition, co-transfection using the control donor Donor.sub.no cut with the Cas9-expressing plasmid and sgRNA2.sub.ANTXR1 also failed to produce a significant number of puro+clones (see the rightmost panel in
(59) 4. Verification of Gene Knockout Efficiency
(60) In addition, HeLa cells were co-transfected using a Cas9-expressing plasmid and sgRNA2.sub.ANTXR1 or pgRNA.sub.ANTXR1, with or without the corresponding donors. A pooled population and single clones were obtained by screening with puromycin (1 μg/ml). The plasmid expressing the puromycin-resistant gene is used for co-transfection instead, when the corresponding donors are not added. As the ANTXR1 gene knockout in HeLa cells results in resistance of the cells to chimeric anthrax toxin (PA/LFnDTA) [17], the pooled population and single clones obtained by puromycin screening were treated with PA/LFnDTA (PA: 150 ng/ml; and LFnDTA: 100 ng/ml) to compare the effect of linear donor DNAs on the ANTXR1 knockout efficiency. Images of different cells after being treated with PA/LFnDTA are shown in
(61) TABLE-US-00002 TABLE 2 Primers for amplifying the linear donor- integrated ANTXR1 site in HeLa cells Primer pair Sequence L1/R1 5′-AAGCGGAGGACAGGATTGGG-3′ (SEQ ID NO: 5) /5′-CCTCTGTGGCCCTGGAGATG-3′ (SEQ ID NO: 6)
Example 2. Enrichment of HBEGF Gene Knockout Events in HeLa Cells Using a Linear Donor DNA
(62) Since a donor with a single- or double-cleavage site is capable of greatly improving the selection of cells with a modification at a target site, for convenience, in this example, only a single-cleavage donor was adopted.
(63) 1. Design of sgRNAs
(64) Two sgRNAs targeting the HBEGF gene in HeLa cells were designed, and their efficiency in producing Indels at target sites was verified by T7E1 assay. The verification results are shown in Table 3. The target sequence that sgRNA1.sub.HBEGY targets is referred to as sg 1 in this example, and the target sequence that sgRNA2.sub.HBEGF targets is referred to as sg2 in this example.
(65) TABLE-US-00003 TABLE 3 The sgRNAs Targeting the HBEGF Gene in HeLa Cells Mean Indels ± s.d. sgRNA Target sequence (PAM) (5′ to 3′) SEQ ID NO: (%, n = 3) sgRNA1.sub.HBEGF GACTGGCGAGAGCCTGGAG(CGG) 7 32.03 ± 7.13 sgRNA2.sub.HBEGF CGGACCAGCTGCTACCCCT(AGG) 8 14.27 ± 0.90
(66) 2. Construction of Linear Donor DNAs
(67) A linear donor DNA (Donor.sub.HBEGF-sg1) was constructed, the structure of which is shown in
(68) Donor.sub.HBEGY-sg1 comprises, from 5′-end to 3′- end: a 20-bp protective sequence, sg 1, a reverse termination codon, CMV promoter-driven puromycin-resistant gene, a forward termination codon, and a 20-bp protective sequence, respectively.
(69) 3. Transfection
(70) HeLa cells were co-transfected with a Cas9-expressing plasmid, sgRNA1.sub.HBEGF, and its corresponding donor Donor.sub.HBEGF-sg1. As a control, HeLa cells were transfected with donor Donor.sub.HBEGY-sg1 alone. Puromycin was added to the cells for resistance screening. A pooled population and single clones were obtained and stained with MTT. The results are shown in
(71) Similar to the results of Example 1, only the donor plus sgRNA obtained a large number of puro+clones. This result again demonstrates that the donor insertion depends on specific sgRNA/Cas9-mediated DSBs.
(72) 4. Verification of Gene Knockout Efficiency
(73) In addition, HeLa cells were co-transfected using a Cas9-expressing plasmid and sgRNA1.sub.HBEGF, with or without its corresponding donor Donor.sub.HBEGF-sg1. A pooled population and single clones were obtained by screening with puromycin (1 μg/ml). The plasmid expressing the puromycin-resistant gene is used for co-transfection insdead, when the corresponding donor is not added. As the HBEGF gene encodes a diphtheria toxin (DT) receptor, knocking this gene out in HeLa cells would result in resistance of the cells to DT [17], the pooled population and single clones obtained by puromycin screening were treated with DT (40 ng/ml) to compare the effect of the linear donor DNA on the HBEGF knockout efficiency. Images of different cells after being treated with DT are shown in
Example 3. Enrichment of HBEGF Gene Knockout Events in HEK293T Cells Using a Linear Donor DNA
(74) 1. Design of an sgRNA and construction of a linear donor DNA
(75) An sgRNA2.sub.HBEGF targeting the HBEGF gene in HEK293T cells was designed and a linear donor DNA (Donor.sub.HBEGF-sg2) was constructed. The donor comprises, from 5′-end to 3′-end: a 20-bp protective sequence, sg2, a reverse termination codon, CMV promoter-driven EGFP gene, a forward termination codon, and a 20-bp protective sequence, respectively, as shown in
(76) 2. Verification of Gene Knockout Efficiency
(77) HEK293T cells were co-transfected using a Cas9-expressing plasmid and sgRNA2.sub.HBEGF, with or without its corresponding donor Donor.sub.HBEGF-sg2. Cells were screened by FACS. The group added the donor was screened for EGFP-positive cells by FACS, while the group didn't add the donor was screened for mCherry-positive cells by FACS. FACS-selected cells were treated with DT (40 ng/ml) to compare the effects of the linear donor DNAs on the HBEGF knockout efficiency. Images of different cells after being treated with DT are shown in
Example 4. Enrichment of ANTXR1 Gene Knockout Events in HeLa.SUB.oc .Cells Using Linear Donor DNAs
(78) 1. Establishment of HeLa.sub.oc cell line
(79) The HeLa.sub.oc cell line stably expressing Cas9 was established according to the existing method [17].
(80) 2. Design of sgRNAs and construction of linear donor DNAs
(81) Two sgRNAs (sgRNA1.sub.ANTXR1 and sgRNA2.sub.ANTXR1) targeting the ANTXR1 gene in HeLa.sub.oc cells were designed, and three linear donor DNAs (Donor.sub.ANTXR1-sg1, Donor.sub.ANTXR1-sg2 and Donor.sub.ANTXR1-pg) were constructed, as shown in
(82) Donor.sub.ANTXR1-sg1 comprises, from 5′-end to 3′- end: a 20-bp protective sequence, sg 1, a reverse termination codon, CMV promoter-driven puromycin-resistant gene, a forward termination codon, and a 20-bp protective sequence, respectively.
(83) Donor.sub.ANTXR1-sg2 comprises, from 5′-end to 3′-end: a 20-bp protective sequence, sg2, a reverse termination codon, CMV promoter-driven puromycin-resistant gene, a forward termination codon, and a 20-bp protective sequence, respectively.
(84) Donor.sub.ANTXR1-pg comprises, from 5′-end to 3′-end: a 20-bp protective sequence, sg 1, a reverse termination codon, CMV promoter-driven puromycin-resistant gene, a forward termination codon, sg2, and a 20-bp protective sequence, respectively.
(85) 3. Transfection
(86) HeLa.sub.oc cells were co-transfected with a Cas9-expressing plasmid, sgRNA1.sub.ANTXR1 or sgRNA2.sub.ANTXR1 or pgRNA.sub.ANTXR1, and the corresponding donors. As a control, HeLa.sub.oc cells were transfected with linear donor DNAs (Donor.sub.ANTXR1-SG1, Donor.sub.ANTXR1-sg2, and Donor.sub.ANTXR1-pg) alone. To the cells puromycin was added for resistance screening, and the cells were stained with MTT. The results are shown in
(87) Similar to the results in HeLa cells, only the donors plus sgRNAs obtained a large number of puro+clones.
(88) 4. Verification of Gene Knockout Efficiency
(89) In addition, HeLa.sub.oc cells were co-transfected using a Cas9-expressing plasmid and sgRNA1.sub.ANTXR1 or sgRNA2.sub.ANTXR1 or pgRNA.sub.ANTXR1, with or without the corresponding donors, and screened with puromycin (1 μg/ml). The cells obtained by screening were treated with PA/LFnDTA. Images of different cells treated with PA/LFnDTA are shown in
(90) For puro+single clones, PCR verification was performed on the integration site of donor Donor.sub.ANTXR1-sg1 in the ANTXR1 gene. It was found that most of the puro+clones contain donor inserts at the sgRNA-targeting sites (
(91) A PCR fragment of about 500 bp (the length corresponding to the wild type ANTXR1 gene) and a PCR fragment of about 1.8 kb (the length corresponding to the wild type ANTXR1 gene plus a donor insert) were subjected to genome sequencing. The results are shown in Table 4.
(92) TABLE-US-00004 TABLE 4 Genomic sequencing results of a PCR fragment of about 500 bp and a PCR fragment of about 1.8 kb Target site: ANTXR1 (Chr 2, Hela.sub.oc) Sequencing results of the Sequencing results of the PCR band PCR band (about 500 bp) of the (about 1.8 kb) of the same size as the same size as the wild type wild type after the donor insertion Category Wild type | Mutant type | Mutation rate Wild type | Mutant type | Mutation rate sgRNA.sub.1 15 | 0 | 0% — | — | — sgRNA.sub.1/Donor.sub.1 1 | 14 | 93% 15 | 15 | 100% sgRNA2 13 | 2 | 15% — | — | — sgRNA.sub.2/Donor.sub.2 5 | 10 | 67% 15 | 15 | 100% pgRNA.sub.1-2 10 | 5 | 33% — | — | — pgRNA.sub.1-2/Donor.sub.1-2 1 | 14 | 93% 15 | 15 | 100%
(93) As can be seen from the PCR verification results (
(94) 5. Effect of a Donor on the Off-Target Effect of the CRISPR/Cas System
(95) To examine whether the use of an external donor would affect the off-target effect of the CRISPR/Cas system, a whole genome integration site was found by splinkerette PCR analysis [35-37].
(96) In this example, the off-target insertions in single clones and pooled clones were verified by splinkerette PCR analysis after puromycin selection. If a correct donor insertion in the ANTXR1 gene is present, amplification with primers Splink2/R1 and Splink2/R2 will result in 711- and 927-bp products, respectively (see
(97) For splinkerette PCR analysis, we randomly selected 10 single clones with donor insertions and 3 puro+pooled clones targeting ANTXR1 in HeLa.sub.oc cells. Based on the splinkerette PCR results (
Example 5. Enrichment of HBEGF Gene Knockout Events in HeLa.SUB.oc .Cells Using Linear Donor DNAs
(98) 1. Design of sgRNAs and construction of linear donor DNAs
(99) Two sgRNAs (sgRNA1.sub.HBEGF and sgRNA2.sub.HBEGF) targeting the HBEGF gene in HeLa.sub.oc cells were designed, and three linear donor DNAs (Donor.sub.HBEGF-sg1, Donor.sub.HBEGF-sg2 and Donor.sub.HBEGF-pg) were constructed, as shown in
(100) Donor.sub.HBEGF-sg1 comprises, from 5′-end to 3′- end: a 20-bp protective sequence, sg 1, a reverse termination codon, CMV promoter-driven puromycin-resistant gene, a forward termination codon, and a 20-bp protective sequence, respectively.
(101) Donor.sub.HBEGF-sg2 comprises, from 5′-end to 3′-end: a 20-bp protective sequence, sg2, a reverse termination codon, CMV promoter-driven puromycin-resistant gene, a forward termination codon, and a 20-bp protective sequence, respectively.
(102) Donor.sub.HBEGF-pg comprises, from 5′-end to 3′-end: a 20-bp protective sequence, sg 1, a reverse termination codon, CMV promoter-driven puromycin-resistant gene, a forward termination codon, sg2, and a 20-bp protective sequence, respectively.
(103) 3. Transfection
(104) HeLa.sub.oc cells were co-transfected with a Cas9-expressing plasmid, sgRNA1.sub.HBEGF or sgRNA2.sub.HBEGF or pgRNA.sub.HBEGF, and the corresponding donors. As a control, HeLa.sub.oc cells were transfected with linear donor DNAs (Donor.sub.HBEGF-sg1, Donor.sub.HBEGF-sg2 and Donor.sub.HBEGF-pg) alone. Puromycin was added to the cells for resistance screening. The results are shown in
(105) Similar to the results in HeLa cells, only the donors plus sgRNAs obtained a large number of puro+clones.
(106) 4. Verification of Gene Knockout Efficiency
(107) For puro+single clones, PCR verification was performed on the integration site of donor Donor.sub.HBEGF-sg1 in the HBEGF gene. The L2/R2 primer sequences used in PCR amplification are shown in Table 5. It was found that most of the puro+clones contain donor inserts at the sgRNA-targeting sites (
(108) TABLE-US-00005 TABLE 5 Primers for amplifying the linear donor- integrated HBEGF locus in HeLa.sub.OC cells Primer pair Sequence L2/R2 5′-GCCGCTTCGAAAGTGACTGG-3′ (SEQ ID NO: 9) /5′-GATCCCCCAGTGCCCATCAG-3′ (SEQ ID NO: 10)
Example 6. Double Gene Knockout in HeLa.SUB.oc .Cells Using Linear Donor DNAs
(109) 1. Design of sgRNAs
(110) Two target genes, PSEN1 and PSEN2, in HeLa.sub.oc cells were selected. Two sgRNAs targeting these two target genes respectively were designed, and their efficiency in producing indels at target sites was verified by T7E1 assay. The results are shown in Table 6. The target sequence that sgRNA.sub.PSEN1 targets is referred to as sg.sub.PSEN1 in this example, and the target sequence that sgRNA.sub.PSEN2 targets is referred to as sg.sub.PSEN2 in this example.
(111) TABLE-US-00006 TABLE 6 The sgRNAs targeting PSEN1 and PSEN2 genes in HeLa.sub.OC cells Mean Indels ± s.d. sgRNA Target sequence (5′ to 3′) SEQ ID NO: (%, n = 3) sgRNA.sub.PSEN1 CCAGAATGCACAGATGTCTG(AGG) 11 13.03 ± 3.04 sgRNA.sub.PSEN2 TTCATGGCCTCTGACAGCG(AGG) 12 13.67 ± 0.55
(112) 2. Construction of Linear Donor DNAs
(113) Two types of donors were constructed. One type had two separate donors (Donor.sub.PSEN1+Donor.sub.PSEN2), and each donor had a corresponding sgRNA target sequence; and the other type of donor (Donor.sub.PSEN) had two sgRNA-targeting sequences at both ends, respectively, as shown in
(114) Donor.sub.PSEN1 comprises, from 5′-end to 3′-end: a 20-bp protective sequence, sg.sub.PSEN1, a reverse termination codon, CMV promoter-driven puromycin-resistant gene, a forward termination codon, and a 20-bp protective sequence, respectively.
(115) Donor.sub.PSEN2 comprises, from 5′-end to 3′-end: a 20-bp protective sequence, sg.sub.PSEN2, a reverse termination codon, CMV promoter-driven puromycin-resistant gene, a forward termination codon, and a 20-bp protective sequence, respectively.
(116) Donor.sub.PSEN comprises, from 5′-end to 3′-end: a 20-bp protective sequence, sg.sub.PSEN1, a reverse termination codon, CMV promoter-driven puromycin-resistant gene, a forward termination codon, Sg.sub.PSEN2, and a 20-bp protective sequence, respectively.
(117) 3. Verification of transfection and gene knockout efficiency
(118) HeLa.sub.oc cells were co-transfected with a Cas9-expressing plasmid, sgRNA.sub.PSEN1 or sgRNA.sub.PSEN2; or HeLa.sub.oc cells were co-transfected with a Cas9-expressing plasmid and pgRNA.sub.PSEN to produce indels at specific sites of the PSEN1 and PSEN2 genes. T7E1 analysis was performed on the indels production efficiency [26] (see Table 7 for the primers used). The results are shown in
(119) HeLa.sub.oc cells were co-transfected with a Cas9-expressing plasmid, pgRNA.sub.PSEN, and Donor.sub.PSEN; or HeLa.sub.oc cells were co-transfected with a Cas9-expressing plasmid, pgRNA.sub.PSEN, and Donor.sub.PSEN1+Donor.sub.PSEN2. To the cells puromycin was added for resistance screening to obtain puro+clones (see
(120) For each transfection result, puro+single clones were subjected to PCR verification of the integration sites of donors Donor.sub.PSEN and Donor.sub.PSEN1+Donor.sub.PSEN2 in the PSEN1 and PSEN2 genes. The primers used in the PCR amplification are shown in Table 7, wherein L3/R3 were used to amplify the integration site in PSEN1, and L4/R4 were used to amplify the integration site in PSEN2. The PCR verification results are shown in
(121) TABLE-US-00007 TABLE 7 Primers for amplifying the linear donor-integrated PSEN1 and PSEN2 sites in HeLa.sub.OC cells Primer pair Sequence L3/R3 5′-TGGTGTCTCAGGCGGTTCTA-3′ (SEQ ID NO: 13) /5′-TGAACTATGAGGCGCTGCAC-3′ (SEQ ID NO: 14) L4/R4 5′-TGACTTTCGTGGCTATGCGT-3′ (SEQ ID NO: 15) /5′-CTAGCACCCAGGCATCCAAA-3′ (SEQ ID NO: 16)
Example 7. Multi-Gene Knockout in HeLa.SUB.oc .Cells Using a Linear Donor DNA
(122) 1. Selection of Target Genes and Design of an sgRNA
(123) The HSPA gene family in HeLa.sub.oc cells was selected, which includes five homologous genes, HSAPA1A, HSPA1B, HSBA1L, HSPA6 and HSPA2. The sgRNA.sub.HSPA simultaneously targeting HSAPA1A,HSPA1B and HSBA1L was designed. The target sequence that the sgRNA targets has a mismatch with the corresponding sequence in HSPA6 and two mismatches with HSPA2. These are shown in
(124) 2. Construction of Linear Donor DNAs
(125) A linear donor Donor.sub.HSPA was constructed, comprising, from 5′-end to 3′-end: a 20-bp protective sequence, sg.sub.HSPA, a reverse termination codon, CMV promoter-driven puromycin-resistant gene, a forward termination codon, and a 20-bp protective sequence, respectively (
(126) 3. Verification of Transfection and Gene Knockout Efficiency
(127) HeLa.sub.oc cells were co-transfected using a Cas9-expressing plasmid and sgRNA.sub.HSPA, with or without its corresponding donor Donor.sub.HSPA, indels were triggered, and resistance screening was performed by puromycin. The group without the donor added was co-transfected with a plasmid expressing a puromycin-resistant gene instead. The indels efficiency for all five genes was evaluated by T7E1 assay (see Table 8 for the primers used). The results are shown in
(128) The target regions of HSPA family genes in pooled population co-transfected with and without the donor were sequenced. The results are shown in
(129) Notably, the T7E1 assay demonstrates that the selected pooled clones are highly rich in cells carrying target mutations, and the enrichment factor is approximately 753 (5.5*6.1*3.4*6.6) compared with traditional methods without using a donor. Considering that this calculation does not consider genes with donor insertions, the actual efficiency is even higher.
(130) For puromycin-resistant single clones, PCR verification was performed at five target sites. Specific primers (L5/R5, L6/R6, L7/R7, L8/R8, and L9/R9) used to amplify target sites of all five genes are listed in Table 8. The results are shown in
(131) TABLE-US-00008 TABLE 8 Primers for amplifying five gene target sites of the HSPA family in HeLa.sub.OC cells Primer pair Sequence L5/R5 5′-GAGAGTGACTCCCGTTGTCC-3′ (SEQ ID NO: 17) /5′-ACATTGCAAACACAGGAAATTGAG-3′ (SEQ ID NO: 18) L6/R6 5′-GTGTTGAGTTTCCGGCGTTC-3′ (SEQ ID NO: 19) /5′-TCGCTTGTTCTGGCTGATGT-3′ (SEQ ID NO: 20) L7/R7 5′-GCACTCTCCCAAAACAGTATCTTA-3′ (SEQ ID NO: 21) /5′-GTGCCTCCACCCAGATCAAA-3′ (SEQ ID NO: 22) L8/R8 5′-GGGTGAGGCGCAAAAGGATA-3′ (SEQ ID NO: 23) /5′-ACACCAGCGTCAATGGAGAG-3′ (SEQ ID NO: 24)
Example 8. Enrichment of CSPG4 Gene Knockout Events in SC-8 Cells Using Linear Donor DNAs Containing Universal sgRNAs
(132) 1. Screening of universal sgRNAs
(133) The following 10 sgRNAs were selected as candidate sequences for screening, as shown in the following table:
(134) TABLE-US-00009 Predicted gene sgRNA Target Sequence (PAM) (5′ to 3′) knockout efficiency sgRNA.sub.Universal_1 GTACGGGGCGATCATCCACACGG 0.982784325 (SEQ ID NO: 25) sgRNA.sub.Universal_2 GCAAAAGTGGCATAAAACCGCGG 0.971302462 (SEQ ID NO: 26) sgRNA.sub.Universal_3 TATCGCTTCCGATTAGTCCGCGG 0.96832667 (SEQ ID NO: 27) sgRNA.sub.Universal_4 CTATCTCGAGTGGTAATGCGCGG 0.966411034 (SEQ ID NO: 28) sgRNA.sub.Universal_5 GTAGCTGCTGTAAATCGCATCGG 0.963330804 (SEQ ID NO: 29) sgRNA.sub.Universal_6 TATACCAGACCACAGCGCCGCGG 0.963267571 (SEQ ID NO: 30) sgRNA.sub.Universal_7 GCACGAGGTGAACAGCCGCTCGG 0.960224565 (SEQ ID NO: 31) sgRNA.sub.Universal_8 ATGATATCTGACATGCAGCGCGG 0.95578653 (SEQ ID NO: 32) sgRNA.sub.Universal_9 AATCGACTCGAACTTCGTGTCGG 0.950640031 (SEQ ID NO: 33) sgRNA.sub.Universal_10 CGAATCGGAACTTTGTACCGCGG 0.948431616 (SEQ ID NO: 34)
(135) 2. Construction of Linear Donor DNAs
(136) 10 linear donor DNAs (Donor.sub.sSGRMA_Imoversa;_1˜10 puro) were constructed based on the above-mentioned 10 universal sgRNAs, respectively.
(137) These linear donor DNAs comprise, from 5′-end to 3′-end: a 20-bp protective sequence, a target sequence that sgRNA.sub.Universal_1˜10 targets, a reverse termination codon, CMV promoter-driven puromycin-resistant gene, a forward termination codon, and a 20-bp protective sequence, respectively.
(138) 3. Construction of Tandem sgRNA Plasmids
(139) 10 tandem sgRNA plasmids (Plasmid.sub.pgRNA_Universal_1˜10) were constructed based on the above-mentioned 10 linear donors, respectively, the structures of which are shown in
(140) Two tandem sgRNAs were sgRNA.sub.CSPG4 and sgRNA.sub.Universal_1˜10, respectively. Among these sgRNAs, sgRNA.sub.CSPG4 targets CSPG4, the receptor of TcdB toxin, while sgRNA.sub.Universal_1˜10 targets the target sequence in the corresponding donor DNA
(141) (Donor.sub.sgRNA_Universal_1˜10puro)
(142) 4. Transfection
(143) The cell line used in the transfection experiment was a cell line stably expressing Cas9 (SC-8). SC-8 cells were co-transfected with ten tandem plasmids Plasmid.sub.pgRNA_Universal_1˜10 and the corresponding donor DNAs (Donors.sub.sgRNA_Universal_1˜10puro). As a control, SC-8 cells were co-transfected with ten linear donor DNAs (Donors.sub.sgRNA_Universal_1˜10puro) alone. Puromycin was added to the cells for resistance screening. A pooled population was obtained. The screening results are shown in the following table.
(144) TABLE-US-00010 Viable colonies Experimental condition (resistant to puromycin) Dnonr.sub.sgRNA.sub.
(145) According to the above-mentioned results, four sgRNAs, i.e., sgRNA.sub.Universal_1, sgRNA.sub.Universal_3, sgRNA.sub.Universal_6 and sgRNA.sub.Universal_9, had better effects. Therefore, in subsequent experiments, pooled clones corresponding to these four sgRNAs were used as the experimental objects.
(146) 4. Verification of Gene Knockout Efficiency
(147) TcdB toxin was added to the four pooled clones for screening, and the cell survival was observed after 23 hours. The experimental results are shown in
(148) The materials and methods used in Examples 1-7 above were as follows:
(149) Cell Culture and Transfection
(150) HeLa, HeLa.sub.oc and HEK293T cells were maintained in a Dulbecco's modified Eagle's medium (DMEM, 10-013-CV, Corning, Tewksbury, MA, USA) supplemented with 10% fetal bovine serum (FBS, Lanzhou Bailing Biotechnology Co., Ltd., Lanzhou, China) at a temperature of 37° C., and supplied with 5% CO2. For transfection, all cells were seeded on a 6-well plate and transfected with X-tremeGENE HP (06366546001, Roche, Mannheim, Germany) according to the supplier's instructions. Briefly, 2 μg of DNA and 4 μl of X-tremeGENE HP were added to 200 μl of Opti-MEM I Reduced Serum Medium (31985088, Thermo Fisher Scientific, Grand Island, N.Y., USA). The mixture was incubated for 15 minutes at room temperature and then added to the cells.
(151) Cloning of a Plasmid Expressing a gRNA
(152) For plasmids expressing sgRNAs, the oligonucleotide of each sgRNA coding sequence was designed separately (see Table 9) and synthesized (Beijing Ruibo Xingke Biotechnology Co., Ltd.).
(153) TABLE-US-00011 TABLE 9 Primers for construction of an sgRNA or a pgRNA sgRNA Forward primer Reverse primer sgRNA1.sub.ANTXR1 5′-ACCGAGCGGAGAGCCCTCGGCAT-3′ 5′-AAACATGCCGAGGGCTCTCCGCT-3′ (SEQ ID NO: 35) (SEQ ID NO: 36) sgRNA2.sub.ANTXR1 5′-ACCGTGCTCATCTGCGCCGGGCAA-3′ 5′-AAACTTGCCCGGCGCAGATGAGCA-3′ (SEQ ID NO: 37) (SEQ ID NO: 38) pgRNA.sub.ANTXR1 5′-TATACGTCTCAACCGAGCGGAGAGCCCTC 5′-TATACGTCTCAAAACTTGCCCGGCGCAG GGCATGTTTAAGAGCTATGCTGGAAACAG-3′ ATGAGCACGGTGTTTCGTCCTTTCCACA-3′ (SEQ ID NO: 39) (SEQ ID NO: 40) sgRNA1.sub.HBEGF 5′-ACCGGACTGGCGAGAGCCTGGAG-3′ 5′-AAACCTCCAGGCTCTCGCCAGTC-3′ (SEQ ID NO: 41) (SEQ ID NO: 42) sgRNA2.sub.HBEGF 5′-ACCGCGGACCAGCTGCTACCCCT-3′ 5′-AAACAGGGGTAGCAGCTGGTCCG-3′ (SEQ ID NO: 43) (SEQ ID NO: 44) pgRNA.sub.HBEGF 5′-TATACGTCTCAACCGGACTGGCGAGAGCC 5′-TATACGTCTCAAAACAGGGGTAGCAGCT TGGAGGTTTAAGAGCTATGCTGGAAACAG-3′ GGTCCGCGGTGTTTCGTCCTTTCCACA-3′ (SEQ ID NO: 45) (SEQ ID NO: 46) sgRNA.sub.PSEN1 5′-ACCGCCAGAATGCACAGATGTCTG-3′ 5′-AAACCAGACATCTGTGCATTCTGG-3′ (SEQ ID NO: 47) (SEQ ID NO: 48) sgRNA.sub.PSEN2 5′-ACCGTTCATGGCCTCTGACAGCG-3′ 5′-AAACCGCTGTCAGAGGCCATGAA-3′ (SEQ ID NO: 49) (SEQ ID NO: 50) pgRNA.sub.PSEN 5′-TATACGTCTCaACCGCCAGAATGCACAGA 5′-TATACGTCTCaAAACCGCTGTCAGAGGC TGTCTGGTTTAAGAGCTATGCTGGAAACA-3′ CATGAACGGTGTTTCGTCCTTTCCACA-3′ (SEQ ID NO: 51) (SEQ ID NO: 52) sgRNA.sub.HSPA 5′-ACCGCAGGAGTAGGTGGTGCCC-3′ 5′-AAACGGGCACCACCTACTCCTG-3′ (SEQ ID NO: 53) (SEQ ID NO: 54)
(154) Oligonucleotides were dissolved to a concentration of 10 μM in 1×TE, and the paired oligonucleotides were mixed with TransTaq HiFi Buffer II (K10222, Beijing TransGen Biotech Co., Ltd.), heated to 95° C. for 3 minutes, and then slowly cooled to 4° C. These annealed oligonucleotide pairs were phosphorylated for 30 minutes at 37° C. After heat inactivation, the product was ligated into the sgRNA backbone vector using the “Golden Gate” method. For plasmids expressing pgRNAs, the scaffold sequence of the gRNA and the U6 promoter were amplified with primers comprising two gRNA coding sequences (Table 5), and the PCR product was then purified and ligated into the sgRNA backbone vector using the “Golden Gate” method. Compared with the previously reported sgRNA backbone vector [17], the sgRNA backbone vector of the present invention has modifications in the sgRNA backbone [38], and has the EGFP sequence replaced with the mCherry coding sequence.
(155) T7E1 Assay
(156) Genomic DNAs were extracted using a DNeasy Blood & Tissue kit (69504, Qiagen, Hilden, Germany), and the genomic region comprising the gRNA target sequence was subjected to PCR amplification. The primer sequences used in the assay are shown in Table 2, Table 5, Table 7, and Table 8. 300-500 ng of PCR product obtained using these primer sequences was mixed with 10× NEB Buffer2 in a 50 μl of system, heated at 95° C. for 3 minutes, and slowly cooled to room temperature. The resulting product was incubated with 0.5 μl of T7E1 for 15 min at 37° C. for agarose gel electrophoresis. The electropherogram was analyzed by Image J image analysis software for the band cleavage efficiency which indicates the efficiency of generating Indels by sgRNAs.
(157) Construction of a Linear Donor
(158) A donor sequence comprising a CMV-driven puromycin-resistant gene or EGFP gene, and termination codon sequences were pre-produced, and cloned into the pEASY-T5-Zero clone vector (CT501-02, Beijing TransGen Biotech Co., Ltd.) as a universal template. The template was amplified using primers comprising sgRNA cleavage target sites and protective sequences. The primer sequences are shown in Table 10.
(159) TABLE-US-00012 TABLE 10 Primers for construction of a linear donor Primer (forward/reverse) Donor Step 1 Step 2 Donor.sub.ANTXR1-sg1 5′-AGAGCCCTCGGCATCGGCTTCCAGTGGCTCTCTTT 5′-TCCACTGCGACGTCGCGAGTAGCGGAGAGCCC GGTTAGTCACCTACTAGTTAGTCA-3′ TCGGCATCGGCTTCCAGTGGCTCTC-3′ (SEQ ID NO: 55) (SEQ ID NO: 57) /5′-GGCTTAGGATTGTTACGCCCTCACTTATCTACTA /5′-GGCTTAGGATTGTTACGCCCTCACTTATCTA ATCAATTA-3′ CTAATCAATTA-3′ (SEQ ID NO: 56) (SEQ ID NO: 56) Donor.sub.ANTXR1-sg2 5′-TCTGCGCCGGGCAAGGGGGACGCAGGGAGGATGGG 5′-TCCACTGCGACGTCGCGAGTTGCTCATCTGCG GGTTAGTCACCTACTAGTTAGTCA-3′ CCGGGCAAGGGGGACGCAGGGCAGGAT-3′ (SEQ ID NO: 58) (SEQ ID NO: 59) /5′-GGCTTAGGATTGTTACGCCCTCACTTATCTACTA /5′-GGCTTAGGATTGTTACGCCCTCACTTATCTA ATCAATTA-3′ CTAATCAATTA-3′ (SEQ ID NO: 56) (SEQ ID NO: 56) Donor.sub.ANTRXR1-pgRNA 5′-AGAGCCCTCGGCATCGGCTTCCAGTGGCTCTCTTT 5′-TCCACTGCGACGTCGCGAGTAGCGGAGAGCCC GGTTAGTCACCTACTAGTTAGTCA-3′ TCGGCATCGGCTTCCAGTGGCTCTC-3′ (SEQ ID NO: 55) (SEQ ID NO: 57) /5′-CCCGGCGCAGATGAGCACCAGAGTGGCCAAAGAG /5′-GGCTTAGGATTGTTACGCCCCCCTTGCCCGG AGCTCACTTATCTACTAATCAATTA-3′ CGCAGATGAGCACCAGAGTGG-3′ (SEQ ID NO: 60) (SEQ ID NO: 61) Donor.sub.HBEGF-sg1 5′-GCGAGAGCCTGGAGCGGCTTCGGAGAGGGCTAGCT 5′-TCCACTGCGACGTCGCGAGTGACTGGCGAGAG GCTTAGTCACCTACTAGTTAGTCA-3′ CCTGGAGCGGCTTCGGAGAGGGCT-3′ (SEQ ID NO: 62) (SEQ ID NO: 63) /5′-GGCTTAGGATTGTTACGCCCTCACTTATCTACTA /5′-GGCTTAGGATTGTTACGCCCTCACTTATCTA ATCAATTA-3′ CTAATCAATTA-3′ (SEQ ID NO: 56) (SEQ ID NO: 56) Donor.sub.HBEGF-sg2 5′-AGCTGCTACCCCTAGGAGGCGGCCGGGACCGGAAA 5′-TCCACTGCGACGTCGCGAGTCGGACCAGCTGC GTTAGTCACCTACTAGTTAGTCA-3′ TACCCCTAGGAGGCGGCCGGGACCGGA-3′ (SEQ ID NO: 64) (SEQ ID NO: 65) /5′-GGCTTAGGATTGTTACGCCCTCACTTATCTACTA /5′-GGCTTAGGATTGTTACGGGCCACTTATCTAC ATCAATTA-3′ TAATCAATTA-3′ (SEQ ID NO: 56) (SEQ ID NO: 56) Donor.sub.HBGEF-pgRNA 5′-GCGAGAGCCTGGAGCGGCTTCGGAGAGGGCTAGCT 5′-TCCACTGCGACGTCGCGAGTGACTGGCGAGAG GCTTAGTCACCTACTAGTTAGTCA-3′ CCTGGAGCGGCTTCGGAGAGGGCT-3′ (SEQ ID NO: 62) (SEQ ID NO: 63) /5′-GGGTAGCAGCTGGTCCGTGGATACAGTGGGAGGG /5′-GGCTTAGGATTGTTACGCCCCCTAGGGGTAG TCCTCACTTATCTACTAATCAATTA-3′ CAGCTGGTCCGTGGATACAGTGGGA-3′ (SEQ ID NO: 66) (SEQ ID NO: 67) Donor.sub.PSEN1 5′-GATGTCTGAGGACAACCACCTGAGCAATACTTTAG 5′-TCCACTGCGACGTCGCGAGTCCAGAATGCACA TCACCTACTAGTTAGTCA-3′ GATGTCTGAGGACAACCACCTG-3′ (SEQ ID NO: 68) (SEQ ID NO: 69) /5′-GGCTTAGGATTGTTACGCCCTCACTTATCTACTA /5′-GGCTTAGGATTGTTACGCCCTCACTTATCTA ATCAATTA-3′ CTAATCAATTA-3′ (SEQ ID NO: 56) (SEQ ID NO: 56) Donor.sub.PSEN2 5′-CTGACAGCGAGGAAGAAGTGTGTGATGAGCGGTTA 5′-TCCACTGCGACGTCGCGAGTTTCATGGCCTCT GTCACCTACTAGTTAGTCA-3′ GACAGCGAGGAAGAAGTG-3′ (SEQ ID NO: 70) (SEQ ID NO: 71) /5′-GGCTTAGGATTGTTACGCCCTCACTTATCTACTA /5′-GGCTTAGGATTGTTACGCCCTCACTTATCTA ATCAATTA-3′ CTAATCAATTA-3′ (SEQ ID NO: 56) (SEQ ID NO: 56) Donor.sub.PSEN1+PSEN2 5′-GATGTCTGAGGACAACCACCTGAGCAATACTTTAG 5′-TCCACTGCGACGTCGCGAGTCCAGAATGCACA TCACCTACTAGTTAGTCA-3′ GATGTCTGAGGACAACCACCTG-3′ (SEQ ID NO: 68) (SEQ ID NO: 69) /5′-GGCTTAGGATTGTTACGCCCCCTCGCTGTCAGAG /5′-GCCATGAATGTGAGCATAGCCCTGCCTCTCA GCCATGAATGTGAGCATAGCC-3′ CTTATCTACTAATCAATTA-3′ (SEQ ID NO: 72) (SEQ ID NO: 73) Donor.sub.HSPA 5′-ACCACCTACTCCTGCGTGGGGGTGTTCCAACACGT 5′-TCCACTGCGACGTCGCGAGTCCTGGGCACCAC TAGTCACCTACTAGTTAGTCA-3′ CTACTCCTGCGTGGGGGTGTTC-3′ (SEQ ID NO: 74) (SEQ ID NO: 75) /5′-GGCTTAGGATTGTTACGCCCTCACTTATCTACTA /5′-GGCTTAGGATTGTTACGCCCTCACTTATCTA ATCAATTA-3′ CTAATCAATTA-3′ (SEQ ID NO: 56) (SEQ ID NO: 56)
(160) NHEJ-Based Donor Insertion and Cell Selection
(161) HeLa.sub.oc cells were transfected with 1 μg of purified linear donor PCR product and 1 μg of sgRNA/pgRNA, and treated with 1 μg/ml puromycin two weeks after transfection. HeLa and HEK293T cells were transfected with 1 μg of a donor, 0.5 μg of sgRNA/pgRNA and 0.5 μg of Cas9 plasmid. The cells were then treated with 1 μg/ml puromycin two weeks after transfection, or determined to be EGFP positive by the fluorescence activated cell sorting (FACS), depending on which type of donor was used.
(162) Splinkerette PCR
(163) The splinkerette PCR method has been previously reported (Potter, C. J. & Luo, L. Splinkerette PCR for mapping transposable elements in Drosophila. PLoS One 5, e10168 (2010); Uren, A. G. et al. A high-throughput splinkerette-PCR method for the isolation and sequencing of retroviral insertion sites. Nat Protoc 4, 789-798 (2009); and Yin, B. & Largaespada, D.A. PCR-based procedures to isolate insertion sites of DNA elements. Biotechniques 43, 79-84 (2007)). The primer and adaptor sequences used are shown in Table 11.
(164) TABLE-US-00013 TABLE 11 Primers for Splinkerette PCR Primer Sequence Long-strand 5′-CGAAGAGTAACCGTTGCTAGGAGAGACCGTGGC adaptor TGAATGAGACTGGTGTCGACACTAGTGG-3′ (SEQ ID NO: 76) Short-strand 5′-CGCGCCACTAGTGTCGACACCAGTCTCTAATTT adaptor TTTTTTTCAAAAAAA (SEQ ID NO: 77) Splink1 5′-CGAAGAGTAACCGTTGCTAGGAGAGACC-3′ (SEQ ID NO: 78) Splink2 5′-GTGGCTGAATGAGACTGGTGTCGAC-3′ (SEQ ID NO: 79) R1 5′-GCAACCTCCCCTTCTACGAGCGGC-3′ (SEQ ID NO: 80) R2 5′-GCATGGCCGTGTTGAGCGGTTCCC-3′ (SEQ ID NO: 81)
REFERENCES
(165) 1. Kim, Y. G., Cha, J. & Chandrasegaran, S. Hybrid restriction enzymes: zinc finger fusions to Fok I cleavage domain. Proc Natl Acad Sci USA 93, 1156-1160 (1996). 2. Boch, J. et al. Breaking the code of DNA binding specificity of TAL-type III effectors. Science 326, 1509-1512 (2009). 3. Moscou, M. J. & Bogdanove, A. J. A simple cipher governs DNA recognition by TAL effectors. Science 326, 1501 (2009). 4. Miller, J. C. et al. A TALE nuclease architecture for efficient genome editing. Nat Biotechnol 29, 143-148 (2011). 5. Jinek, M. et al. A programmable dual-RNA-guided DNA endonuclease in adaptive bacterial immunity. Science 337, 816-821 (2012). 6. Mali, P. et al. RNA-guided human genome engineering via Cas9. Science 339, 823-826 (2013). 7. Cong, L. et al. Multiplex Genome Engineering Using CRISPR/Cas Systems. Science 339, 819-823 (2013). 8. Phillips, E. R. & McKinnon, P. J. DNA double-strand break repair and development. Oncogene 26, 7799-7808 (2007). 9. Chapman, J. R., Taylor, M. R. & Boulton, S. J. Playing the end game: DNA double-strand break repair pathway choice. Mol Cell 47, 497-510 (2012). 10. Gilbert, L. A. et al. CRISPR-mediated modular RNA-guided regulation of transcription in eukaryotes. Cell 154, 442-451 (2013). 11. Ma, H. et al. Multicolor CRISPR labeling of chromosomal loci in human cells. Proc Natl Acad Sci USA 112, 3002-3007 (2015). 12. Chen, B. et al. Dynamic imaging of genomic loci in living human cells by an optimized CRISPR/Cas system. Cell 155, 1479-1491 (2013). 13. Li, H.L., Gee, P., Ishida, K. & Hotta, A. Efficient genomic correction methods in human iPS cells using CRISPR-Cas9 system. Methods (2015). 14. Savic, N. & Schwank, G. Advances in therapeutic CRISPR/Cas9 genome editing. Translational research: the journal of laboratory and clinical medicine 168, 15-21 (2016). 15. Miyaoka, Y. et al. Isolation of single-base genome-edited human iPS cells without antibiotic selection. Nat Methods 11, 291-293 (2014). 16. Fu, Y. et al. High-frequency off-target mutagenesis induced by CRISPR-Cas nucleases in human cells. Nat Biotechnol 31, 822-826 (2013). 17. Zhou, Y. et al. High-throughput screening of a CRISPR/Cas9 library for functional genomics in human cells. Nature 509, 487-491 (2014). 18. Yu, C. et al. Small Molecules Enhance CRISPR Genome Editing in Pluripotent Stem
(166) Cells. Cell Stem Cell 16, 142-147 (2015). 19. Liao, S., Tammaro, M. & Yan, H. Enriching CRISPR-Cas9 targeted cells by co-targeting the HPRT gene. Nucleic Acids Res 43, e134 (2015). 20. Kim, H. et al. Surrogate reporters for enrichment of cells with nuclease-induced mutations. Nat Methods 8, 941-943 (2011). 21. Ramakrishna, S. et al. Surrogate reporter-based enrichment of cells containing RNA-guided Cas9 nuclease-induced mutations. Nature communications 5, 3378 (2014). 22. Wang, T., Wei, J. J., Sabatini, D. M. & Lander, E. S. Genetic screens in human cells using the CRISPR-Cas9 system. Science 343, 80-84 (2014). 23. Yuan, P. et al. Chondroitin sulfate proteoglycan 4 functions as the cellular receptor for Clostridium difficile toxin B. Cell Res 25, 157-168 (2015). 24. Yang, J. et al. ULtiMATE System for Rapid Assembly of Customized TAL Effectors. PLoS One 8, e75649 (2013). 25. Perez, E. E. et al. Establishment of HIV-1 resistance in CD4+ T cells by genome editing using zinc-finger nucleases. Nat Biotechnol 26, 808-816 (2008). 26. Kim, H. J., Lee, H. J., Kim, H., Cho, S. W. & Kim, J. S. Targeted genome editing in human cells with zinc finger nucleases constructed via modular assembly. Genome Res 19, 1279-1288 (2009). 27. Lackner, D. H. et al. A generic strategy for CRISPR-Cas9-mediated gene tagging. Nature communications 6, 10237 (2015). 28. Auer, T. O. & Del Bene, F. CRISPR/Cas9 and TALEN-mediated knock-in approaches in zebrafish. Methods (2014). 29. Li, K., Wang, G., Andersen, T., Zhou, P. & Pu, W. T. Optimization of Genome Engineering Approaches with the CRISPR/Cas9 System. PLoS One 9, e105779 (2014). 30. Orlando, S. J. et al. Zinc-finger nuclease-driven targeted integration into mammalian genomes using donors with limited chromosomal homology. Nucleic Acids Res 38, e152 (2010). 31. Sakuma, T., Nakade, S., Sakane, Y., Suzuki, K. T. & Yamamoto, T. MMEJ-assisted gene knock-in using TALENs and CRISPR-Cas9 with the PITCh systems. Nat Protoc 11, 118-133 (2016). 32. Nakade, S. et al. Microhomology-mediated end-joining-dependent integration of donor DNA in cells and animals using TALENs and CRISPR/Cas9. Nature communications 5, 5560 (2014). 33. Cristea, S. et al. In vivo cleavage of transgene donors promotes nuclease-mediated targeted integration. Biotechnol Bioeng 110, 871-880 (2013). 34. Chen, F. et al. High-frequency genome editing using ssDNA oligonucleotides with zinc-finger nucleases. Nat Methods 8, 753-755 (2011). 35. Potter, C. J. & Luo, L. Splinkerette PCR for mapping transposable elements in Drosophila. PLoS One 5, e10168 (2010). 36. Uren, A. G. et al. A high-throughput splinkerette-PCR method for the isolation and sequencing of retroviral insertion sites. Nat Protoc 4, 789-798 (2009). 37. Yin, B. & Largaespada, D. A. PCR-based procedures to isolate insertion sites of DNA elements. Biotechniques 43, 79-84 (2007). 38. Peng, J., Zhou, Y., Zhu, S. & Wei, W. High-throughput screens in mammalian cells using the CRISPR-Cas9 system. FEBS J 282, 2089-2096 (2015).