COMPOSITIONS AND METHODS FOR EFFICIENT GENOME EDITING
20200370070 ยท 2020-11-26
Inventors
Cpc classification
C12N15/873
CHEMISTRY; METALLURGY
C12N15/88
CHEMISTRY; METALLURGY
International classification
C12N15/90
CHEMISTRY; METALLURGY
C12N15/88
CHEMISTRY; METALLURGY
Abstract
The present invention relates to the field of genome editing. More specifically, the present invention provides compositions and methods useful in clustered regularly interspaced short palindromic repeats (CRISPR)-based techniques. In one embodiment, the present invention provides a double-stranded, linear donor polynucleotide comprising a template polynucleotide flanked by a first homology arm and a second homology arm, wherein the homology arms are between 30-35 bases in length.
Claims
1. A double-stranded, linear donor polynucleotide comprising a polynucleotide encoding a fluorescent protein flanked by a first homology arm and a second homology arm.
2. The polynucleotide of claim 1, wherein the homology arms are 15-60 bases in length.
3. The polynucleotide of claim 1, wherein the homology arms are 25-45 bases in length.
4. The polynucleotide of claim 1, wherein the homology arms are 30-40 bases in length.
5. A double-stranded, linear donor polynucleotide comprising a polynucleotide encoding a fluorescent protein flanked by a first homology arm and a second homology arm, wherein the first and second homology arms are between 30-35 bases in length.
6. A double-stranded, linear donor polynucleotide comprising a template polynucleotide encoding an edit flanked by an intervening sequence and two homology arms.
7. The polynucleotide of claim 6, wherein the homology arms are 15-60 bases in length.
8. The polynucleotide of claim 6, wherein the homology arms are 25-45 bases in length.
9. The polynucleotide of claim 6, wherein the homology arms are 30-40 bases in length.
10. The polynucleotide of claim 6, wherein the template polynucleotide is up to 1 kb in length.
11. The polynucleotide of claim 6, wherein the template polynucleotide comprises a sequence designed to change at least one nucleotide base within 30 bases of a double-stranded break (DSB) of a target nucleic acid.
12. The polynucleotide of claim 11, wherein the template polynucleotide further comprises a restriction enzyme site.
13. A double-stranded, linear donor polynucleotide comprising a template polynucleotide flanked by a first homology arm and a second homology arm, wherein the homology arms are between 30-35 bases in length.
14. The polynucleotide of claim 14, wherein the template polynucleotide is up to 1 kb in length.
15. The polynucleotide of claim 14, wherein the template polynucleotide comprises a sequence designed to change at least one nucleotide base within 30 bases of a DSB of a target nucleic acid.
16. The polynucleotide of claim 15, wherein the template polynucleotide further comprises a restriction enzyme site.
17. A method comprising the step of performing a clustered regularly interspaced short palindromic repeats (CRISPR)-based technique using a double-stranded, linear donor polynucleotide of claim 6 as the donor polynucleotide.
18. A method comprising injecting into a target cell a composition comprising (a) an RNA-guided DNA endonuclease; (b) a guide RNA; and (c) a double-stranded, linear donor polynucleotide of claim 6.
Description
BRIEF DESCRIPTION OF THE FIGURES
[0010]
[0011]
[0012]
[0013]
[0014]
[0015]
[0016]
[0017]
[0018]
[0019]
[0020]
[0021]
[0022]
[0023]
[0024]
[0025]
[0026]
[0027]
[0028]
[0029]
[0030]
DETAILED DESCRIPTION OF THE INVENTION
[0031] It is understood that the present invention is not limited to the particular methods and components, etc., described herein, as these may vary. It is also to be understood that the terminology used herein is used for the purpose of describing particular embodiments only, and is not intended to limit the scope of the present invention. It must be noted that as used herein and in the appended claims, the singular forms a, an, and the include the plural reference unless the context clearly dictates otherwise. Thus, for example, a reference to a protein is a reference to one or more proteins, and includes equivalents thereof known to those skilled in the art and so forth.
[0032] Unless defined otherwise, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this invention belongs. Specific methods, devices, and materials are described, although any methods and materials similar or equivalent to those described herein can be used in the practice or testing of the present invention.
[0033] All publications cited herein are hereby incorporated by reference including all journal articles, books, manuals, published patent applications, and issued patents. In addition, the meaning of certain terms and phrases employed in the specification, examples, and appended claims are provided. The definitions are not meant to be limiting in nature and serve to provide a clearer understanding of certain aspects of the present invention.
I. Definitions
[0034] Unless otherwise indicated, the terms polynucleotide and nucleic acid refer to a deoxyribonucleotide or ribonucleotide polymer, in linear or circular conformation, and in either single- or double-stranded form. The terms can encompass known analogs of natural nucleotides, as well as nucleotides that are modified in the base, sugar and/or phosphate moieties (e.g., phosphorothioate backbones). In general, an analog of a particular nucleotide has the same base-pairing specificity; i.e., an analog of A will base-pair with T.
[0035] The term nucleotide refers to deoxyribonucleotides or ribonucleotides. The nucleotides may be standard nucleotides (i.e., adenosine, guanosine, cytidine, thymidine, and uridine) or nucleotide analogs. A nucleotide analog refers to a nucleotide having a modified purine or pyrimidine base or a modified ribose moiety. A nucleotide analog may be a naturally occurring nucleotide (e.g., inosine) or a non-naturally occurring nucleotide. Non-limiting examples of modifications on the sugar or base moieties of a nucleotide include the addition (or removal) of acetyl groups, amino groups, carboxyl groups, carboxymethyl groups, hydroxyl groups, methyl groups, phosphoryl groups, and thiol groups, as well as the substitution of the carbon and nitrogen atoms of the bases with other atoms (e.g., 7-deaza purines). Nucleotide analogs also include dideoxy nucleotides, 2-O-methyl nucleotides, locked nucleic acids (LNA), peptide nucleic acids (PNA), and morpholinos.
[0036] A gene, as used herein, refers to a DNA region (including exons and introns) encoding a gene product, as well as all DNA regions which regulate the production of the gene product, whether or not such regulatory sequences are adjacent to coding and/or transcribed sequences. Accordingly, a gene includes, but is not necessarily limited to, promoter sequences, terminators, translational regulatory sequences such as ribosome binding sites and internal ribosome entry sites, enhancers, silencers, insulators, boundary elements, replication origins, matrix attachment sites, and locus control regions.
[0037] As used herein, an edit is the desired modification to be introduced into the genome. In other words, an edit is any change in the genomic sequence that is included in the repair template polynucleotide. Edits can include, for example, base pair insertions, deletions or changes.
[0038] The term intervening sequence refers to a sequence between the edit and the double-stranded break (DSB). An intervening sequence can be unmodified (identical to genome sequence) or can be modified (for example, see
[0039] As used herein, a homology arm, homology sequence or sequence homologous to a reference or target gene/sequence describes a polynucleotide sequence that has substantial sequence identity to a corresponding segment of the reference or target gene/sequence, e.g., at least 75, 80, 85, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99% identical or even 100% identical, to the nucleotide sequence of the reference or target gene/sequence, such that, when placed under appropriate conditions, homologous recombination can take place between a pair of homologous sequences and their reference or target gene/sequence. The homology arms have substantial sequence identity to the sequence upstream and downstream of the targeted site in the target nucleic acid molecule.
[0040] For edits inserted to the right of a DSB: the right homology arm corresponds to the genomic sequence immediately to the right of the insertion point of the edit and the left homology arm corresponds to the genomic sequence immediately on the left side of the DSB.
[0041] For edits inserted to the left of a DSB: the left homology arm corresponds to the genomic sequence immediately to the left of the insertion point of the edit and the right homology arm corresponds to the genomic sequence immediately on the right side of the DSB.
[0042] The terms target sequence, target nucleic acid or target DNA sequence, when used to refer to a pre-determined segment of a genomic sequence or polynucleotide is similarly defined in regard to the percentage sequence identity between the target sequence and its corresponding guide RNA. On the other hand, a homology arm or target sequence is of the appropriate length that ensures its purpose. Typically, a homology arm is in the size range of about 10-100, 10-90, 10-80, 15-75, 15-70, 15-65, 15-60, 15-55, 15-50, 15-45, 15-40, 15-35, 20-50, 20-45, 20-40, 20-35, 25-40, 25-35 or 30-35 nucleotides (e.g., about 30, 35, 40, 45, 50, 55 or 60 nucleotides in length); whereas a target sequence may vary in the size range of about 10-50, 15-45, or 20-40 (e.g., about 20, 25, or 30) nucleotides. In some embodiments, the target sequence contains a sequence that is suitable as a substrate for an RNA-guided DNA endonuclease (e.g., a Cas9 nuclease) (i.e., a nuclease target sequence site). In some embodiments, the target sequence contains a sequence that is suitable as a substrate for Cfp1 endonuclease (i.e., an endonuclease target sequence site).
[0043] Techniques for determining nucleic acid and amino acid sequence identity are known in the art. Typically, such techniques include determining the nucleotide sequence of the mRNA for a gene and/or determining the amino acid sequence encoded thereby, and comparing these sequences to a second nucleotide or amino acid sequence. Genomic sequences can also be determined and compared in this fashion. In general, identity refers to an exact nucleotide-to-nucleotide or amino acid-to-amino acid correspondence of two polynucleotides or polypeptide sequences, respectively. Two or more sequences (polynucleotide or amino acid) can be compared by determining their percent identity. The percent identity of two sequences, whether nucleic acid or amino acid sequences, is the number of exact matches between two aligned sequences divided by the length of the shorter sequences and multiplied by 100. An approximate alignment for nucleic acid sequences is provided by the local homology algorithm of Smith and Waterman, 2 A
[0044] Cas9 or (CRISPR associated protein 9) is an RNA-guided DNA endonuclease enzyme associated with the CRISPR (Clustered Regularly Interspersed Palindromic Repeats) adaptive immunity system in Streptococcus pyogenes, among other bacteria. S. pyogenes utilizes the CRISPR system to memorize and later interrogate and cleave foreign DNA, such as the DNA of an invading bacteriophage. Cas9, complexed with a guide RNA, performs this interrogation by unwinding foreign DNA and checking whether the DNA contains any sequence segment complementary to a spacer region of the guide RNA. If the guide RNA finds sequence complementarity in the DNA, it is cleaved by Cas9.
[0045] Cpf1 or CRISPR/Cpf1 is a DNA editing technology analogous to the CRISPR/Cas9 system. Cpf1 is an RNA-guided DNA endonuclease enzyme associated with the CRISPR adaptive immunity system in Prevotella and Francisella, among other bacteria. Cpf1 is a smaller and simpler endonuclease as compared to Cas9 because Cpf1 only requires one RNA molecule to cut DNA while Cas9 requires two. Cpf1 is a Type V CRISPR/Cas system containing a 1,300 amino acid protein.
[0046] As used herein, sgRNA or small guide RNA refers to a short RNA molecule that is capable of forming a complex with Cas9 protein and contains a segment of about 20 nucleotides complementary to a target DNA sequence, such that the Cas9-sgRNA complex directs Cas9 cleavage of a target DNA sequence upon the sgRNA recognizing the complementary sequence in the target DNA sequence. Accordingly, a sgRNA is approximately a 20-base sequence (ranging from about 10-50, 15-45, or 20-40, for example, 15, 20, 25, or 30 bases) specific to the target DNA 5 of a non-variable scaffold sequence.
[0047] As used herein, the term endogenous sequence refers to a chromosomal sequence that is native to the cell.
[0048] The term exogenous, as used herein, refers to a sequence that is not native to the cell, or a chromosomal sequence whose native location in the genome of the cell is in a different chromosomal location.
[0049] The term heterologous refers to an entity that is not endogenous or native to the cell of interest. For example, a heterologous protein refers to a protein that is derived from or was originally derived from an exogenous source, such as an exogenously introduced nucleic acid sequence. In some instances, the heterologous protein is not normally produced by the cell of interest.
II. RNA-Guided Endonucleases
[0050] In particular embodiments, the compositions and methods of the present invention utilize RNA-guided endonucleases. In some embodiments, the endonuclease comprises at least one nuclear localization signal, which permits entry of the endonuclease into the nuclei of eukaryotic cells and embryos such as, for example, non-human one-cell embryos. In other embodiments, RNA-guided endonucleases comprise at least one nuclease domain and at least one domain that interacts with a guide RNA. An RNA-guided endonuclease is directed to a specific nucleic acid sequence (or target sequence/site) by a guide RNA. The guide RNA interacts with the RNA-guided endonuclease as well as the target site such that, once directed to the target site, the RNA-guided endonuclease is able to introduce a double-stranded break into the target site nucleic acid sequence. Since the guide RNA provides the specificity for the targeted cleavage, the endonuclease of the RNA-guided endonuclease is universal and can be used with different guide RNAs to cleave different target nucleic acid sequences. The RNA-guided endonuclease can be derived from a clustered regularly interspersed short palindromic repeats (CRISPR)/CRISPR-associated (Cas) system. The CRISPR/Cas system can be a type I, a type II, or a type III system. Non-limiting examples of suitable CRISPR/Cas proteins include Cas3, Cas4, Cas5, Cas5e (or CasD), Cas6, Cas6e, Cas6f, Cas7, Cas8a1, Cas8a2, Cas8b, Cas8c, Cas9, Cas10, Cas10d, CasF, CasG, CasH, Csy1, Csy2, Csy3, Cse1 (or CasA), Cse2 (or CasB), Cse3 (or CasE), Cse4 (or CasC), Csc1, Csc2, Csa5, Csn2, Csm2, Csm3, Csm4, Csm5, Csm6, Cmr1, Cmr3, Cmr4, Cmr5, Cmr6, Csb1, Csb2, Csb3, Csx17, Csx14, Csx10, Csx16, CsaX, Csx3, Csz1, Csx15, Csf1, Csf2, Csf3, Csf4, and Cu1966.
[0051] In one embodiment, the RNA-guided endonuclease is derived from a type II CRISPR/Cas system. In specific embodiments, the RNA-guided endonuclease is derived from a Cas9 protein. The Cas9 protein can be from Streptococcus pyogenes, Streptococcus thermophilus, Streptococcus sp., Nocardiopsis dassonvillei, Streptomyces pristinaespiralis, Streptomyces viridochromogenes, Streptomyces viridochromogenes, Streptosporangium roseum, Streptosporangium roseum, Alicyclobacillus acidocaldarius, Bacillus pseudomycoides, Bacillus selenitireducens, Exiguobacterium sibiricum, Lactobacillus delbrueckii, Lactobacillus salivarius, Microscilla marina, Burkholderiales bacterium, Polaromonas naphthalenivorans, Polaromonas sp., Crocosphaera watsonii, Cyanothece sp., Microcystis aeruginosa, Synechococcus sp., Acetohalobium arabaticum, Ammonifex degensii, Caldicelulosiruptor becscii, Candidatus Desulforudis, Clostridium botulinum, Clostridium difficile, Finegoldia magna, Natranaerobius thermophilus, Pelotomaculum the mopropionicum, Acidithiobacillus caldus, Acidithiobacillus ferrooxidans, Allochromatium vinosum, Marinobacter sp., Nitrosococcus halophilus, Nitrosococcus watsoni, Pseudoalteromonas haloplanktis, Ktedonobacter racemifer, Methanohalobium evestigatum, Anabaena variabilis, Nodularia spumigena, Nostoc sp., Arthrospira maxima, Arthrospira platensis, Arthrospira sp., Lyngbya sp., Microcoleus chthonoplastes, Oscillatoria sp., Petrotoga mobilis, Thermosipho africanus, or Acaryochloris marina.
[0052] In other embodiments, the RNA-guided endonuclease is derived from another Cas nuclease including, but not limited to, Cpf1, C2c1, C2c2, and C2c3 proteins. Cpf1 is similar to Cas9, and contains a RuvC-like nuclease domain. See Zetsche et al., 163 C
III. Guide RNA
[0053] In some embodiments of the present disclosure, a CRISPR/Cas nuclease system includes at least one guide RNA. In some embodiments, the guide RNA and the Cas protein may form a ribonucleoprotein (RNP), e.g., a CRISPR/Cas complex. The guide RNA may guide the Cas protein to a target sequence on a target nucleic acid molecule, where the guide RNA hybridizes with, and the Cas protein cleaves, the target sequence. In some embodiments, the CRISPR/Cas complex may be a Cpf1/guide RNA complex. In some embodiments, the CRISPR complex may be a Type-II CRISPR/Cas9 complex. In some embodiments, the Cas protein may be a Cas9 protein. In some embodiments, the CRISPR/Cas9 complex may be a Cas9/guide RNA complex.
[0054] A guide RNA for a CRISPR/Cas9 nuclease system comprises a CRISPR RNA (crRNA) and a tracr RNA (tracr). In another embodiment, a single guide RNA (sgRNA)a chimer of cr/tracrRNAcan be used. See Doudna, J. A. & Charpentier, E., 346(6213) S
[0055] The guide RNA may target any sequence of interest via the targeting sequence of the crRNA. In some embodiments, the degree of complementarity between the targeting sequence of the guide RNA and the target sequence on the target nucleic acid molecule may be about 70%, 75%, 80%, 85%, 90%, 95%, 97%, 98%, 99%, or 100%. In some embodiments, the targeting sequence of the guide RNA and the target sequence on the target nucleic acid molecule may be 100% complementary. In other embodiments, the targeting sequence of the guide RNA and the target sequence on the target nucleic acid molecule may contain at least one mismatch. For example, the targeting sequence of the guide RNA and the target sequence on the target nucleic acid molecule may contain 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10 mismatches. In some embodiments, the targeting sequence of the guide RNA and the target sequence on the target nucleic acid molecule may contain 1-6 mismatches. In some embodiments, the targeting sequence of the guide RNA and the target sequence on the target nucleic acid molecule may contain 5 or 6 mismatches.
[0056] The length of the targeting sequence of the guide RNA may depend on the CRISPR/Cas9 system and components used. For example, different Cas9 proteins from different bacterial species have varying optimal targeting sequence lengths. Accordingly, the targeting sequence may comprise 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 35, 40, 45, 50, or more than 50 nucleotides in length. In some embodiments, the targeting sequence may comprise 18-24 nucleotides in length. In some embodiments, the targeting sequence may comprise 19-21 nucleotides in length. In some embodiments, the targeting sequence may comprise 20 nucleotides in length.
IV. Target Site/Sequence of the Target Nucleic Acid Molecule
[0057] An RNA-guided endonuclease in conjunction with a guide RNA is directed to a target site in the chromosomal sequence, wherein the RNA-guided endonuclease introduces a double-stranded break in the chromosomal sequence. The target site has no sequence limitation except that the sequence is immediately followed (downstream) by a consensus sequence. This consensus sequence is also known as a protospacer adjacent motif (PAM). Examples of PAMs include, but are not limited to, NGG, NGGNG, and NNAGAAW (wherein N is defined as any nucleotide and W is defined as either A or T). In particular embodiments, the first region (at the 5 end) of the guide RNA is complementary to the protospacer of the target sequence. Typically, the first region of the guide RNA is about 19 to 21 nucleotides in length. Thus, in certain aspects, the sequence of the target site in the chromosomal sequence is 5-N.sub.19-21-NGG-3. The PAM is in italics.
[0058] The target site can be in the coding region of a gene, in an intron of a gene, in a control region of a gene, in a non-coding region between genes, etc. The gene can be a protein coding gene or an RNA coding gene. The gene can be any gene of interest.
V. Linear Donor Polynucleotides & Design Parameters Thereof
[0059] In certain embodiments, the present invention provides a double-stranded, linear donor polynucleotide comprising a template polynucleotide encoding an edit flanked by an intervening sequence and two homology arms. In other embodiments, the donor polynucleotide comprises a template polynucleotide encoding an edit flanked by two homology arms.
[0060] In some embodiments, the template polynucleotide of the double-stranded, linear donor polynucleotide may correspond to an endogenous sequence of a target cell. In some embodiments, the endogenous sequence may be a genomic sequence of the cell. In some embodiments, the endogenous sequence may be a chromosomal or extrachromosomal sequence. In some embodiments, the endogenous sequence may be a plasmid sequence of the cell. In some embodiments, the template sequence may be substantially identical to a portion of the endogenous sequence in a cell at or near the cleavage site, but comprise at least one nucleotide change (i.e., an edit as defined herein). In some embodiments, the repair of the cleaved target nucleic acid molecule with the template may result in an edit comprising an insertion, deletion, or substitution of one or more nucleotides of the target nucleic acid molecule. In some embodiments, the edit may result in one or more amino acid changes in a protein expressed from a gene comprising the target sequence. In some embodiments, the edit or mutation may result in one or more nucleotide changes in an RNA expressed from the target gene. In some embodiments, the edit may alter the expression level of the target gene. In some embodiments, the edit may result in increased or decreased expression of the target gene. In some embodiments, the edit may result in gene knockdown. In some embodiments, the edit may result in gene knockout. In some embodiments, the repair of the cleaved target nucleic acid molecule with the template may result in replacement of an exon sequence, an intron sequence, a transcriptional control sequence, a translational control sequence, or a non-coding sequence of the target gene.
[0061] In other embodiments, the double-stranded, linear donor polynucleotide encoding an edit may comprise an exogenous sequence. In some embodiments, the exogenous sequence may comprise a protein or RNA coding sequence operably linked to an exogenous promoter sequence such that, upon integration of the exogenous sequence into the target nucleic acid molecule, the cell is capable of expressing the protein or RNA encoded by the integrated sequence. In other embodiments, upon integration of the exogenous sequence into the target nucleic acid molecule, the expression of the integrated sequence may be regulated by an endogenous promoter sequence. In some embodiments, the exogenous sequence may be a chromosomal or extrachromosomal sequence. In some embodiments, the exogenous sequence may provide a cDNA sequence encoding a protein or a portion of the protein. In yet other embodiments, the exogenous sequence may comprise an exon sequence, an intron sequence, a transcriptional control sequence, a translational control sequence, or a non-coding sequence. In some embodiments, the integration of the exogenous sequence may result in gene knock-in.
[0062] In the double-stranded, linear donor polynucleotide, the template polynucleotide is flanked by a first homology arm and a second homology arm, e.g., a left homology arm and a right homology arm. These sequences to the left and right of the template polynucleotide have substantial sequence identity to sequences located to the left and right, respectively, of the target site of the RNA-guided endonuclease in the target nucleic acid molecule. Because of these sequence similarities, homology arms permit homologous recombination between the donor polynucleotide and the targeted sequence such that the template polynucleotide can serve as a template for DNA synthesis. In certain embodiments, the linear donor polynucleotide comprises a template polynucleotide encoding an edit flanked by an intervening sequence and two homology arms.
[0063] In certain embodiments, specifically, for edits inserted to the right of a DSB, the right homology arm corresponds to the genomic sequence immediately to the right of the insertion point of the edit and the left homology arm corresponds to the genomic sequence immediately on the left side of the DSB. In other embodiments, specifically, for edits inserted to the left of a DSB, the left homology arm corresponds to the genomic sequence immediately to the left of the insertion point of the edit and the right homology arm corresponds to the genomic sequence immediately on the right side of the DSB.
[0064] In particular embodiments, each homology arm can range in length from about 10 nucleotides to about 100 nucleotides. The recited range includes ranges within the recited range including, but not limited to, 10-100, 10-90, 10-80, 15-75, 15-70, 15-65, 15-60, 15-55, 15-50, 15-45, 15-40, 15-35, 20-50, 20-45, 20-40, 20-35, 25-40, 25-35 or 30-35 nucleotides (e.g., about 30, 35, 40, 45, 50, 55 or 60 nucleotides in length). In a specific embodiment, a homology arm is 15-60 nucleotides in length. In another embodiment, a homology arm is 25-45 nucleotides in length. In yet another embodiment, a homology arm is 30-40 nucleotides in length. In a further embodiment, a homology arm is 35 nucleotides in length. In certain embodiments, homology arms can comprise different lengths within the range.
VI. Introducing Genome Editing Compositions into the Cell or Embryo
[0065] The RNA-guided endonuclease(s) (or encoding nucleic acid), the guide RNA(s) (or encoding DNA), and the double-stranded, linear donor polynucleotide can be introduced into a cell or embryo by a variety of means. In some embodiments, the cell or embryo is transfected. Suitable transfection methods include calcium phosphate-mediated transfection, nucleofection (or electroporation), cationic polymer transfection (e.g., DEAE-dextran or polyethylenimine), viral transduction, virosome transfection, virion transfection, liposome transfection, cationic liposome transfection, immunoliposome transfection, nonliposomal lipid transfection, dendrimer transfection, heat shock transfection, magnetofection, lipofection, gene gun delivery, impalefection, sonoporation, optical transfection, and proprietary agent-enhanced uptake of nucleic acids. Transfection methods are well known in the art. In other embodiments, the molecules are introduced into the cell or embryo by microinjection. In certain embodiments, the embryo is a fertilized one-cell stage embryo of the species of interest. In such embodiments, the molecules can be injected into the pronuclei of one-cell embryos.
[0066] The RNA-guided endonuclease(s) (or encoding nucleic acid), the guide RNA(s) (or DNAs encoding the guide RNA), and the double-stranded, linear donor polynucleotide(s) can be introduced into the cell or embryo simultaneously or sequentially. The ratio of the RNA-guided endonuclease(s) (or encoding nucleic acid) to the guide RNA(s) (or encoding DNA) generally will be about stoichiometric such that they can form an RNA-protein complex. In one embodiment, DNA encoding an RNA-guided endonuclease and DNA encoding a guide RNA are delivered together within a plasmid vector.
[0067] In further embodiments, the method comprises maintaining the cell or embryo under appropriate conditions such that the guide RNA(s) directs the RNA-guided endonuclease(s) to the targeted site(s) in the chromosomal sequence, and the RNA-guided endonuclease(s) introduce at least one double-stranded break in the chromosomal sequence. A double-stranded break can be repaired by a DNA repair process such that the chromosomal sequence is modified by a deletion of at least one nucleotide, an insertion of at least one nucleotide, a substitution of at least one nucleotide, or a combination thereof.
[0068] In general, the cell is maintained under conditions appropriate for cell growth and/or maintenance. Suitable cell culture conditions are well known in the art. Those of skill in the art appreciate that methods for culturing cells can and will vary depending on the cell type. Routine optimization may be used, in all cases, to determine the best techniques for a particular cell type.
[0069] An embryo can be cultured in vitro (e.g., in cell culture). Typically, the embryo is cultured at an appropriate temperature and in appropriate media with the necessary O.sub.2/CO.sub.2 ratio to allow the expression of the RNA endonuclease and guide RNA, if necessary. Suitable non-limiting examples of media include M2, M16, KSOM, BMOC, and HTF media. A skilled artisan will appreciate that culture conditions can and will vary depending on the species of embryo. Routine optimization may be used, in all cases, to determine the best culture conditions for a particular species of embryo. In some cases, a cell line may be derived from an in vitro-cultured embryo (e.g., an embryonic stem cell line).
[0070] Alternatively, an embryo may be cultured in vivo by transferring the embryo into the uterus of a female host. Generally speaking the female host is from the same or similar species as the embryo. In certain embodiments, the female host is pseudo-pregnant. Methods of preparing pseudo-pregnant female hosts are known in the art. Additionally, methods of transferring an embryo into a female host are known. Culturing an embryo in vivo permits the embryo to develop and can result in a live birth of an animal derived from the embryo. Such an animal would comprise the modified chromosomal sequence in every cell of the body.
VII. Cell and Embryo Types
[0071] A variety of eukaryotic cells and embryos are suitable for use in the method. For example, the cell can be a human cell, a non-human mammalian cell, a non-mammalian vertebrate cell, an invertebrate cell, an insect cell, a plant cell, a yeast cell, or a single cell eukaryotic organism. In general, the embryo is non-human mammalian embryo. In specific embodiments, the embryos can be a one-cell non-human mammalian embryo. Exemplary mammalian embryos, including one-cell embryos, include without limit mouse, rat, hamster, rodent, rabbit, feline, canine, ovine, porcine, bovine, equine, and primate embryos. In still other embodiments, the cell can be a stem cell. Suitable stem cells include without limit embryonic stem cells, ES-like stem cells, fetal stem cells, adult stem cells, pluripotent stem cells, induced pluripotent stem cells, multipotent stem cells, oligopotent stem cells, unipotent stem cells and others. In exemplary embodiments, the cell is a mammalian cell.
[0072] Non-limiting examples of suitable mammalian cells include Chinese hamster ovary (CHO) cells, baby hamster kidney (BHK) cells; mouse myeloma NSO cells, mouse embryonic fibroblast 3T3 cells (NIH3T3), mouse B lymphoma A20 cells; mouse melanoma B16 cells; mouse myoblast C2C12 cells; mouse myeloma SP2/0 cells; mouse embryonic mesenchymal C3H-10T1/2 cells; mouse carcinoma CT26 cells, mouse prostate DuCuP cells; mouse breast EMT6 cells; mouse hepatoma Hepalclc7 cells; mouse myeloma J5582 cells; mouse epithelial MTD-1A cells; mouse myocardial MyEnd cells; mouse renal RenCa cells; mouse pancreatic RIN-5F cells; mouse melanoma X64 cells; mouse lymphoma YAC-1 cells; rat glioblastoma 9L cells; rat B lymphoma RBL cells; rat neuroblastoma B35 cells; rat hepatoma cells (HTC); buffalo rat liver BRL 3A cells; canine kidney cells (MDCK); canine mammary (CMT) cells; rat osteosarcoma D17 cells; rat monocyte/macrophage DH82 cells; monkey kidney SV-40 transformed fibroblast (COS7) cells; monkey kidney CVI-76 cells; African green monkey kidney (VERO-76) cells; human embryonic kidney cells (HEK293, HEK293T); human cervical carcinoma cells (HELA); human lung cells (W138); human liver cells (Hep G2); human U2-OS osteosarcoma cells, human A549 cells, human A-431 cells, and human K562 cells. An extensive list of mammalian cell lines may be found in the American Type Culture Collection catalog (ATCC, Manassas, Va.).
[0073] Without further elaboration, it is believed that one skilled in the art, using the preceding description, can utilize the present invention to the fullest extent. The following examples are illustrative only, and not limiting of the remainder of the disclosure in any way whatsoever.
EXAMPLES
[0074] The following examples are put forth so as to provide those of ordinary skill in the art with a complete disclosure and description of how the compounds, compositions, articles, devices, and/or methods described and claimed herein are made and evaluated, and are intended to be purely illustrative and are not intended to limit the scope of what the inventors regard as their invention. Efforts have been made to ensure accuracy with respect to numbers (e.g., amounts, temperature, etc.) but some errors and deviations should be accounted for herein. Unless indicated otherwise, parts are parts by weight, temperature is in degrees Celsius or is at ambient temperature, and pressure is at or near atmospheric. There are numerous variations and combinations of reaction conditions, e.g., component concentrations, desired solvents, solvent mixtures, temperatures, pressures and other reaction ranges and conditions that can be used to optimize the product purity and yield obtained from the described process. Only reasonable and routine experimentation will be required to optimize such process conditions.
Precision Genome Editing Using Synthesis-Dependent Repair of Cas9-Induced DNA Breaks
[0075] The RNA-guided DNA endonuclease Cas9 has emerged as a powerful new tool for genome engineering. Cas9 creates targeted double-strand breaks (DSBs) in the genome. Knock-in of specific mutations (precision genome editing) requires homology-directed repair (HDR) of the DSB by synthetic donor DNAs containing the desired edits, but HDR has been reported to be variably efficient. Here, we report that linear DNAs (single and double-stranded) engage in a high-efficiency HDR mechanism that requires only about 35 nucleotides of homology with the targeted locus to introduce edits ranging from about 1 to 1000 nucleotides. We demonstrate the utility of linear donors by introducing fluorescent protein tags in human cells and mouse embryos using PCR fragments. We find that repair is local, polarity-sensitive, and prone to template switching, characteristics that are consistent with gene conversion by synthesis-dependent strand-annealing (SDSA). Our findings enable rational design of synthetic donor DNAs for efficient genome editing.
[0076] We documented previously that, in C. elegans, HDR can be very efficient provided that the donor DNAs are linear (Paix et al., 44(15) N
Materials and Methods
[0077] Detailed Results, Sequences and Solutions.
[0078] Tables 1-3 lists all experiments, including detailed conditions and results of experimental replicates. Table 5-14 lists sequences of linear donors, plasmids, PCR primers and cr/sgRNAs, respectively. Position of the cr/sgRNAs on the loci targeted in this study can be found in
[0079] Repair Templates, Cas9, Cr/tracrRNAs and Plasmids for Cell Culture.
[0080] ssODNs (ultramers) and PCR primers where ordered from IDT and reconstituted at 50 M and 100 M respectively in water. For the Illumina sequencing experiment shown in
[0081] Cas9 protein was purified as described in Paix et al., 201(1) G
[0082] Cas9 RNP Nucleofection.
[0083] With the exception of experiments at the PYM1 locus (see below), all experiments in this study used Cas9 RNP delivery (DeWitt et al., 121-122 M
[0084] The Cas9 RNP mix contains: 6.5 M of crRNA and tracrRNA, 9.8 M of Cas9 (1.6 g/l), a variable concentration of repair templates (see Tables 1-3 for details), 10.4% Glycerol, 131 mM KCl, 5.2 mM Hepes, 1 mM MgCl2, 0.5 mM Tris-HCl, pH7.5.
[0085] For sequencing of GFP edits at the Lamin A/C locus, cells were sorted (at the JHU Ross Flow Cytometry Core Facility) for GFP signal and cloned in 96 wells plates for genotyping or pooled in a 6-well plate for microscopy analysis. Single cell clones were lysed using QuickExtract DNA Extraction Solution (Epicentre) and genotyped by PCR using Phusion taq (NEB) with genomic primers outside of the HDR fragment. PCR products were analyzed on agarose gel and sequenced (see
[0086] Cas9 Plasmid Transfections.
[0087] For experiments at the PYM1 locus, Cas9 and the sgRNA were delivered on plasmids. HEK293T cells were grown to 50-75% confluency in 6 wells plate (with 2 ml of culture media per wells). 10.8 l of Cas9 plasmid mix (containing 3.6 l of X-tremeGENE 9 DNA Transfection Reagent from Roche, 892 ng of plasmid pX458 containing PYM1 sgRNA and 3.24 pmol of repair template) was added to 120l of optiMEM glutaMAX media (ThermoFisher), incubated for 15 min at room temperature, and next added to the cells. 48 h after transfection, cells were sorted for GFP signal (to select for cells that received pX458) and grown out as single cell clones. The single cell clones were lysed and genotyped by PCR. PCR products were directly analyzed on agarose gel or mix with EcoR1 (NEB) and the corresponding Restriction Enzyme (RE) buffer, digested over-night and analyzed on agarose gel.
[0088] Cytometer Analysis.
[0089] For each experiment, 5000 to 10000 cells were analyzed using a Guava EasyCyte 6/2L (Millipore) cytometer. Cells were scored as GFP+ if they exhibited a higher signal than 99.5% of non-transfected control cells. HEK293T (GFP1-10) cells exhibit a higher basal green fluorescence than wild-type HEK293T cells. Cytometer analysis could not be performed on these cells for GFP11-tagged Lamin A/C and SMC3. For those experiments, as well as for RFP tagging, cells were analyzed by fluorescence microscopy and scored manually.
[0090] Microscopy.
[0091] Cells were fixed in 4% PFA and mounted with DAPI. Cells were imaged using a confocal microscope with a 63 objective. >50 fields of cells (>1000 cells) were selected in the DAPI channel, photographed, and analyzed for GFP or RFP expression manually.
[0092] PCR Amplicons for Illumina Sequencing.
[0093] HEK293T (GFP1-10) were nucleofected with different combinations of repair ssODNs (
[0094] PCR amplicons were purified on a 10% non-denaturing TBE/PAGE gel and the band corresponding to the PCR product was cut from the gel, eluted over-night, and precipitated with isopropanol. After resuspension, sample concentrations were quantified on a bioanalyzer, and the barcoded samples were pooled to a concentration of 0.4 M per sample in 10p. This sample was submitted to the Johns Hopkins School of Medicine Genetics Resources Core Facility for 250 cycle paired-end sequencing on an Illumina MiSeq instrument.
[0095] Illumina Sequencing Analysis.
[0096] After de-multiplexing of barcoded samples, the 3 adaptor and all downstream nucleotides were trimmed from the forward reads using Cutadapt (Martin, M., 17(1) EMB
[0097] Cas9 RNP Injection in Mouse Zygotes.
[0098] All mouse experiments were carried out under protocols approved by the JHU animal care and use committee. The PCR fragment donor was synthesized as described in Paix et al. (2017). The plasmid donor was generated using a gBlock and restriction enzyme cloning, and purified by Qiagen midi-prep kit and eluted in injection buffer (10 mM Tris-HCl, pH 7.5, 0.1 mM EDTA). Pronuclear injections of zygotes (from B6SJLF1/J parents (Jackson labs)) was performed by the JHU Transgenic facility at a final concentration: 30 ng/l Cas9 protein (PNABio), 0.6 M each of crRNA/TracrRNA (Dharmacon) and PCR donor (3 ng/l or 5 ng/l) or plasmid donor (10 ng/l). The Cas9 protein, crRNA, tracrRNA were combined from stocks at 1000 ng/l, 20 M, 20 M respectively and incubated at 4 C. for 10 minutes. Then injection buffer was added to dilute to the final working concentrations above (Tables 1-3) along with repair vector or fragment. The solution was microcentrifuged 5 min at 13000g and the solution used for injection. Pups were genotyped using genomic primers immediately outside of the PCR donor sequence, or using one primer in mCherry and one upstream of the 483 bp homology arms in the case of the plasmid donor. Genomic DNA from all pups was also subjected to PCR amplification with internal mCherry specific primers to identify random insertions of the donor template (locus-specific mCherry negative/internal mCherry product positive).
[0099] We identified 7 pups (11%, out of 60 pups without mCherry insertion at the Adcy3 locus) with potential transgenic insertions of the PCR fragment at other undetermined loci. In contrast, we identified no transgenics (0%, out of 20 pups without mCherry insertion at the Adcy3 locus) when using the plasmid donor.
Results
[0100] mCherry-Tagging of a Mouse Locus Using a PCR Donor with Short Homology Arms.
[0101] In mammalian systems, ssODNs and plasmids are most commonly used as donors for genome editing (Danner et al., 28(708) M
[0102] GFP-Tagging of Human Loci Using PCR Donors with Short Homology Arms.
[0103] To determine whether PCR fragments can also function for genome editing in human cells, we attempted to knock-in GFP at three loci in HEK293T cells. We designed the HS to insert GFP 0, 11 and 5 bp away from a Cas9 cleavage site in the Lamin A/C, RAB11A, and SMC3 ORFs, respectively (
[0104] Reducing the molarity of the PCR fragments by 10-fold reduced efficiency by (Compare
[0105] Editing Efficiency is Sensitive to Insert Size.
[0106] To test the effect of insert size on editing efficiency, we added varied sizes of DNA sequence to the GFP insert. For ease of synthesis and to maintain equimolar amounts of donor DNAs, we introduced donor fragments at the same low molarity (0.12 M). We found that inserts beyond 1 kb performed very poorly, yielding less than 0.5% edits (
[0107] To test whether decreasing insert size below the size of GFP would increase editing efficiency, we took advantage of the split-GFP system (Kamiyama et al. (2016; Leonetti et al. (2016)). In this system, the 11.sup.th beta-strand of GFP (57 bp, GFP11) is knocked-in in cells expressing a complementary GFP fragment (GFP1-10). We generated PCR products containing the GFP11 insert and 35 bp HS and introduced these at 0.33 M. We obtain 45.4% edits at the Lamin A/C locus (
[0108] Accuracy of Repair is Asymmetric.
[0109] To investigate the accuracy of repair with PCR fragments, we isolated GFP+ and GFP cells by fluorescence-activated cell sorting from a single editing experiment targeting the Lamin A/C locus with a GFP-containing PCR fragment under optimal conditions (
[0110] In total, we sequenced 13 imprecise GFP edits and found only one internal deletion and one insertion in the wrong orientation (
[0111] Repair is a Polarity-Sensitive Process.
[0112] In the SDSA model, initiation and resolution of repair proceeds via distinct steps. First, the DSB is resected to yield 3 overhangs on both sides of the DSB (
[0113] Polarity of Single-Stranded Donors Affects Incorporation of Distal Edits.
[0114] We wondered whether the different requirements for homology on the 3 and 5 ends of single-stranded donors might also apply to donors that contain two HS at different distances from the DSB. Such HS are found in donors designed to insert an edit at a distance from the DSB. In these donors, one HS (proximal HS) matches sequences immediately next to the DSB and the other HS (recessed HS) matches sequences at a distance from the DSB on the distal side of the edit (
[0115] Recoding of Sequences Between the DSB and the Edit Increases Recovery of Distal Edits.
[0116] Editing efficiency has been observed to decrease with increasing distance between the edit and the DSB (Paquet et al., 533(7601) N
[0117] To test whether internal homologies can also participate in the repair process when using double-stranded donors, we performed a similar experiment with a PCR fragment designed to incorporate GFP11 at the DSB, and tagRFP 33 bases from the DSB in the Lamin A/C locus (
[0118] Repair is Prone to Template Switching Between Donors.
[0119] Another characteristic of SDSA first observed in yeast is the ability of the repair process to undergo sequential rounds of invasion and synthesis (29, 30). Template switching can create edits that combine sequences from overlapping donors (14). To test whether template switching also occurs in human cells, we used two donors to correct a single DSB. The first donor was an ssODN with two HS and a GFP11-coding insert containing a STOP codon to prevent translation of the full-length fusion (
[0120] To visualize template switching more directly, we combined wild-type donors with recoded donors where the GFP11 insert contained several silent mutations and used Illumina sequencing to sequence the insertional edits en masse (
Discussion
[0121] In this report, we demonstrate that PCR fragments are efficient donors for genome editing in mouse embryos and human cells. PCR fragments with short homology arms (HS 35 bp) can be used to integrate edits up to 1 kb, long enough to encode fluorescent reporters such as GFP. Experiments using single and double-stranded DNAs suggest that linear donors participate in a replicative repair mechanism that broadly conforms to the SDSA model for gene conversion. Our findings suggest simple guidelines to streamline donor design and maximize editing efficiency (
[0122] Linear DNAs Repair Cas9-Induced DSBs by Templating Repair Synthesis.
[0123] In principle, linear donors could repair Cas9-induced breaks by integrating directly at the DSB. For example, microhomology-mediated end-joining (MMEJ) could cause donor ends to become ligated to each side of the DSB (Yao et al., 20 EB
[0124] If partial edits are due to premature withdrawal of the newly replicated strand from the donor, partial edits should be less frequent when using donors with shorter inserts. Consistent with this prediction, we found that editing efficiency is inversely proportional to insert size. At the Lamin A/C locus, we obtained 45.4% edits for a 57 bp insert, 23.5% edits for 714 bp insert (GFP) and 17.9% edits for a 993 bp insert. The size of the insert, and not the overall size of the donor, correlated with efficiency, arguing against the possibility that breakage of longer donors contributes to reduced efficiency (
[0125] We also obtained evidence for dissociation and invasion events between donors. Such template switching was also observed in yeast and C. elegans and can cause sequences from overlapping donors to become incorporated in the same edit (Anand et al. (2014); Tspaonina et al. (2014)) (Paix et al., 44(15) N
[0126] SDSA as a Repair Mechanism for Cas9-Induced DSBs: Implications for Genome Editing.
[0127] The demonstration that ssODNs and PCR fragments engage in a SDSA-like mechanism to repair Cas9-induced DSBs has two important implications for genome editing. First, the SDSA model makes simple predictions for optimal donor design (
TABLE-US-00001 TABLE1 Detailedexperimentalconditionsandresults crRNA/sgRNA polarity Repairtemplateleft/ crRNA/ (relativeto righthomology)arms sgRNA genecoding Repairtemplate (nucleotides/ Edit Celltype Cas9delivery name) sequence) type/name basepairs) FIG.1 Pronuclear RNP/ crAdcy3 AS PlasmidpBS- 483/421 injectionof Injection AC3CtermGenomic- mouseembryos mCherry
Pronuclear RNP/ crAdcy3 AS PCR1596/1597 36/36 injectionof Injection mouseembryos FIGS.2/10/11
HEK293T RNP/ crRNA1629 S PCR1630/832 0/0 Nucleofection
HEK293T RNP/ crRNA1629 S PCR1685/1686or 16/16 Nucleofection PCR1858/1859
HEK293T RNP/ crRNA1629 S PCR1618/1619or 33/33 Nucleofection PCR1743/1744
HEK293T RNP/ crRNA1629 S PCR1743/1744 33/33 Nucleofection
HEK293T RNP/ crRNA1629 S PCR1741/1742 518/518 Nucleofection
HEK293T RNP/ crRNA1629 S Plasmid1716 518/518 Nucleofection
HEK293T RNP/ crRNA1648 AS PCR1630/832 0/0 Nucleofection
HEK293T RNP/ crRNA1648 AS PCR1838/1839 15/15 Nucleofection
HEK293T RNP/ crRNA1348 AS PCR1840/1841or 33/33 Nucleofection PCR1652/1653
HEK293T RNP/ crRNA1648 AS PCR1840/1841 33/33 Nucleofection
HEK293T RNP/ crRNA1648 AS PCR1846/1847 461/432 Nucleofection
HEK293T RNP/ crRNA1648 AS Plasmid1791 461/432 Nucleofection
HEK293T RNP/ crRNA1629 S PCR1652/1653 Nohomologyarm,PCR Nucleofection contatiningeGFPwith RAB11Ahomologyarms usedwithLaminA/C crRNA
HEK293T RNP/ crRNA1553 AS PCR1630/832 0/0 Nucleofection
HEK293T RNP/ crRNA1553 AS PCR1604/1605 16/17 Nucleofection
HEK293T RNP/ crRNA1553 AS PCR1554/1555 37/38 Nucleofection FIGS.3/11/14
HEK293T RNP/ crRNA1629 S PCR2005/2006 33/33 Nucleofection
HEK293T RNP/ crRNA1629 S PCR2005/2015 33/33 Nucleofection
HEK293T RNP/ crRNA1629 S PCR2049/1619 33/33 Nucleofection
HEK293T RNP/ crRNA1629 S PCR1618/1619 33/33 Nucleofection
HEK293T RNP/ crRNA1629 S PCR2058/2059 237/237 Nucleofection
HEK293T RNP/ crRNA1629 S PCR2049/1619 33/33 (GFP1-10) Nucleofection
HEK293T RNP/ crRNA1629 S PCR1618/1619 33/33 (GFP1-10) Nucleofection
HEK293T RNP/ crRNA1629 S PCR2051/2052 33/32 (GFP1-10) Nucleofection
HEK293T RNP/ crRNA1629 S PCR2003/2004 33/32 (GFP1-10) Nucleofection
HEK293T RNP/ crRNA1648 AS PCR1840/1841or 33/33 (GFP1-10) Nucleofection PCR1652/1653
HEK293T RNP/ crRNA1648 AS PCR2008/2009 33/33 (GFP1-10) Nucleofection
HEK293T RNP/ crRNA1777 S PCR2055/2054 33/34 (GFP1-10) Nucleofection
HEK293T RNP/ crRNA1777 S PCR2053/2054 33/34 (GFP1-10) Nucleofection
HEK293T RNP/ crRNA1777 S PCR2003/2004 Nohomologyarm,PCR (GFP1-10) Nucleofection contatiningGFP11with LaminA/Chomologyarms usedwithRAB11AcrRNA Repairtemplate polarity Distancebetween (relativeto DSBandedit Repairtemplate Insertsize genecoding (bp,relativeto concentration (nucleotides/ Edit sequence) DSB) (MforRNP/Nucleofection) basepairs) FIG.1
dsDNAcircular between4/+2 3.5nMofplasmidrepair 739 templateand30ng/lCas9 proteinand0.6Mof crRNA/TracrRNAininjection buffer
dsDNA between4/+2 6nMor10nMofPCRrepair 739 templateand30ng/lCas9 proteinand0.6Mof FIGS.2/10/11
dsDNA 0 0.33 714
dsDNA 0 0.33 714
dsDNA 0 0.33 714
dsDNA 0 0.03 714
dsDNA 0 0.03 714
dsDNA 0 0.03 714
dsDNA 11(recoded) 0.21 714
dsDNA 11(recoded) 0.21 714
dsDNA 11(recoded) 0.21 714
dsDNA 11(recoded) 0.02 714
dsDNA 11(recoded) 0.02 714
dsDNAcircular 11(recoded) 0.02 714
dsDNA 0 0.21 714
dsDNA +5(recoded) 0.33 714
dsDNA +5(recoded) 0.33 714
dsDNA +5(recoded) 0.33 714 FIGS.3/11/14
dsDNA 0 0.12 2229
dsDNA 0 0.12 1122
dsDNA 0 0.12 993
dsDNA 0 0.12 714
dsDNA 0 0.12 714
dsDNA 0 0.33 993
dsDNA 0 0.33 714
dsDNA 0 0.33 336
dsDNA 0 0.33 57
dsDNA 11(recoded) 0.21 714
dsDNA 11(recoded) 0.21 57
dsDNA 2 0.33 57
dsDNA 32(recoded) 0.33 57
dsDNA 0 0.33 57 Efficiency(%)(forcytometeranalysis:non-nucleofectedcells<0.5%)(SD: Edit StandardDeviation) FIG.1
PCRonmousetailDNA(predictedsizeshift):20.0(5/25).0%(0/20)ofthe negativeforAdcy3mCherryinsertionarepositivebyPCRusingmCherryinternal primers
PCRonmousetailDNA(predictedsizeshift):31.0(27/87,including3 homozygous),10outof10HetclonesformCherryhaveperfectinsertion,11.6% (7/60)ofthenegativeforAdcy3mCherryinsertionarepositivebyPCRusing mCherryinternalprimers FIGS.2/10/11
Cytometer:1.5(averageofindependentexperiments:1.3+1.7)(n=2,SD=0.3)
Cytometer:7.2(averageofindependentexperiments:4.9+9.4)(n=2,SD=3.2)
Cytometer:14.9(averageofindependentexperiments:19.2+14.9+13.2+15.8+ 16.3+12.5+14.9+12.7+11.8+18.1)(n=10,SD=2.5)Sequencingresultscanbe foundinFIG.S4andS5
Cytometer:7.8(averageofindependentexperiments:7.5+8)(n=2,SD=0.4)
Cytometer:8.9(averageofindependentexperiments:9.4+8.4)(n=2,SD=0.7)
Cytometer:1.0(aver,ofindependentexperiments:1.1+1.2+0.8)(n=23, SD=0.2)
Cytometer:1.2(averageofindependentexperiments:1.3+1.1)(n=2,SD=0.1)
Cytometer:9.9(averageofindependentexperiments:11.0+8.7)(n=2,SD=1.6)
Cytometer:17.5(averageofindependentexperiments:17.5+22.5+12.6)(n=3, SD=5.0)
Cytometer:7.6(averageofindependentexperiments:7.6+7.5)(n=2,SD=0.1)
Cytometer:5.3(averageofindependentexperiments:6.4+4.1)(n=2,SD=1.6)
Cytometer:2.7(averageofindependentexperiments:3.7+1.7)(n=2,SD=1.4)
Microscope:noLaminA/CorRAB11AeGFPsignal
Cytometer:1.4(averageofindependentexperiments:1.9+0.9)(n=2,SD=0.7)
Cytometer:12.0(averageofindependentexperiments:10.1+13.9)(n=2,SD=2.7)
Cytometer:14.0(averageofindependentexperiments:14.4+15.5+11.2+9.0+ 13.9+18l4+16.4+13.1)(n=8,SD=3.0) FIGS.3/11/14
Microscope:<0.1(averageofindependentexperiments:D(Q/1406,butfew positivescanbefoundwhentheentiremicroscopeslidewasexamined)+<0.1 (1/1529))(n=2,SD=0.3)
Microscope:0.3(averageofindependentexperiments:0.5(7/1350)+<0.1 (1/1674))(n=2,SD=0.3)
Microscope:3.1(averageofindependentexperiments:3.2+3.0)(n=2,SD=0.1)
Microscope:9.8(averageofindependentexperiments:11.4+8.1)(n=2,SD=2.3)
Microscope:8.5(averageofindependentexperiments:10.0+6.9)(n=2,SD=2.2)
Microscope:17.9(averageofindependentexperiments:20.4+15.3)(n=2, SD=3.6)
Microscope:23.5(averageofindependentexperiments:31.2+25.7+22.2+ 14.9)(n=4,SD=6.8)
Microscope:30.5(averageofindependentexperiments:29.0+32.0)(n=2, SD=2.1)
Microscope:45.4(averageofindependentexperiments:44.9+45.8)(n=2, SD=0.6)
Cytometer:12.8(averageofindependentexperiments:14.1+11.5)(n=2,SD=1.8)
Cytometer:32.8(averageofindependentexperiments:37.4+28.2)(n=2,SD=6.5)
Cytometer:50.0
Cytometer:20.7
Cytometer:1.3
indicates data missing or illegible when filed
TABLE-US-00002 TABLE2 Detailedexperimentalconditionsandresults Repair Distance template Repair between crRNA/sgRNA left/right template DSNand polarity homology polarity edit (relativeto Repair arms (relativeto (bp, genecoding template (nucleotides/ genecoding relative Edit Celltype Cas9delivery name sequence) type/name basepairs) sequence) toDSB) FIG.4 HEK293T RNP/ crRNA S ssODN1788 33/0 S 0 (GFP1-10) Nucleofection 1629
HEK293T RNP/ crRNA S ssODN1789 33/0 AS 0 (GFP1-10) Nucleofection 1629
HEK293T RNP/ crRNA S ssODN1705 0/32 S 0 (GFP1-10) Nucleofection 1629
HEK293T RNP/ crRNA S ssODN1706 0/32 AS 0 (GFP1-10) Nucleofection 1629
HEK293T RNP/ crRNA AS ssODN1816 33/0 S +1 (GFP1-10) Nucleofection 1648
HEK293T RNP/ crRNA AS ssODN1817 33/0 AS +1 (GFP1-10) Nucleofection 1648
HEK293T RNP/ crRNA AS ssODN1819 0/33 S 2 (GFP1-10) Nucleofection 1648
HEK293T RNP/ crRNA AS ssODN1820 0/33 AS 2 (GFP1-10) Nucleofection 1648 Repair Efficienty[%] template (forcytometer concentration analysis: (Mfor Insert non-nucleofected RNP/ size cells<0.5%) Nucleo- (nucleotides/ (SD:Standard Edit fection) basepairs) Deviation) FIG.4
5 126 Microscope:0.2
5 126 Microscope:20.8
5 126 Microscope:9.5
5 126 Microscope:0.4
5 126 Cytometer:1.6
5 126 Cytometer:21.9
5 126 Cytometer:15.4
5 126 Cytometer:1.1
indicates data missing or illegible when filed
TABLE-US-00003 TABLE3 Detailedexperimentalconditionsandresults Repair crRNA/sgRNA template polarity left/right crRNA/ (relativeto Repair homologyarms sgRNA genecoding template (nucleotides/ Edit Celltype Cas9delivery name sequence) type/name basepairs) FIGS5/14 HEK293T RNP/Nucleofection crRNA1629 S ssODN1620 33/32 (GFP1-10)
HEK293T RNP/Nucleofection crRNA1629 S ssODN1732 33/32 (GFP1-10)
HEK293T RNP/Nucleofection crRNA1629 S ssODN1678 38/35 (GFP1-10)
HEK293T RNP/Nucleofection crRNA1629 S ssODN1679 38/35 (GFP1-10)
HEK293T RNP/Nucleofection crRNA1629 S ssODN1793 33/33 (GFP1-10)
HEK293T RNP/Nucleofection crRNA1629 S ssODN1794 33/33 (GFP1-10)
HEK293T RNP/Nucleofection crRNA1629 S ssODN1795 33/33 (GFP1-10)
HEK293T RNP/Nucleofection crRNA1629 S ssODN1796 33/33 (GFP1-10)
HEK293T RNP/Nucleofection crRNA1729 AS ssODN1736 38/33 (GFP1-10)
HEK293T RNP/Nucleofection crRNA1729 AS ssODN1737 38/33 (GFP1-10)
HEK293T RNP/Nucleofection crRNA1728 S ssODN1736 38/34 (GFP1-10)
HEK293T RNP/Nucleofection crRNA1728 S ssODN1737 38/34 (GFP1-10)
HEK293T RNP/Nucleofection crRNA1648 AS ssODN1778 33/33 (GFP1-10)
HEK293T RNP/Nucleofection crRNA1648 AS ssODN1779 33/33 (GFP1-10)
HEK293T RNP/Nucleofection crRNA1648 AS ssODN1864 33/33 (GFP1-10)
HEK293T RNP/Nucleofection crRNA1688 AS ssODN1865 33/33 (GFP1-10)
HEK293T RNP/Nucleofection crRNA1648 AS ssODN1831 33/34 (GFP1-10)
HEK293T RNP/Nucleofection crRNA1648 AS ssODN1832 33/34 (GFP1-10)
HEK293T RNP/Nucleofection crRNA1777 S ssODN1782 33/34 (GFP1-10)
HEK293T RNP/Nucleofection crRNA1777 S ssODN1783 33/34 (GFP1-10)
HEK293T RNP/Nucleofection crRNA1777 S ssODN1833 33/34 (GFP1-10)
HEK293T RNP/Nucleofection crRNA1777 S ssODN1834 33/34 (GFP1-10)
HEK293T RNP/Nucleofection crRNA1777 S ssODN1827 33/34 (GFP1-10)
HEK293T RNP/Nucleofection crRNA1777 S ssODN1828 33/34 (GFP1-10)
HEK293T RNP/Nucleofection crRNA1776 AS ssODN1782 33/34 (GFP1-10)
HEK293T RNP/Nucleofection crRNA1776 AS ssODN1783 33/34 (GFP1-10)
HEK293T RNP/Nucleofection crRNA1776 AS ssODN1833 33/34 (GFP1-10)
HEK293T RNP/Nucleofection crRNA1776 AS ssODN1834 33/34 (GFP1-10)
HEK293T RNP/Nucleofection crRNA1776 AS ssODN1827 33/34 (GFP1-10)
HEK293T RNP/Nucleofection crRNA1776 AS ssODN1828 33/34 (GFP1-10)
HEK293T RNP/Nucleofection crRNA1910 AS ssODN1911 35/32 (GFP1-10)
HEK293T RNP/Nucleofection crRNA1910 AS ssODN1912 35/32 (GFP1-10)
HEK293T RNP/Nucleofection crRNA1910 AS ssODN1924 35/35 (GFP1-10)
HEK293T RNP/Nucleofection crRNA1910 AS ssODN1925 35/35 (GFP1-10)
HEK293T RNP/Nucleofection crRNA1909 S ssODN1911 33/35 (GFP1-10)
HEK293T RNP/Nucleofection crRNA1909 S ssODN1912 33/35 (GFP1-10)
HEK293T RNP/Nucleofection crRNA1909 S ssODN1922 35/35 (GFP1-10)
HEK293T RNP/Nucleofection crRNA1909 S ssODN1923 35/35 (GFP1-10)
HEK293T RNP/Nucleofection crRNA1748 AS ssODN1751 34/38 (GFP1-10)
HEK293T RNP/Nucleofection crRNA1748 AS ssODN1752 34/38 (GFP1-10)
HEK293T RNP/Nucleofection crRNA1747 S ssODN1753 34/38 (GFP1-10)
HEK293T RNP/Nucleofection crRNA1747 S ssODN1754 34/38 (GFP1-10)
HEK293T Plasmid(CAS9:: sgPYM1 S ssODN1583 38/40 T2A::GFP;GFP sorting/ Transfection
HEK293T Plasmid(CAS9:: sgPYM1 S ssODN1584 38/40 T2A::GFP;GFP sorting/ Transfection
HEK293T RNP/Nucleofection crRNA1729 AS ssODN1734 33/33 (GFP1-10)
HEK293T RNP/Nucleofection crRNA1728 S ssODN1734 32/33 (GFP1-10) Distance Efficiency[%] Repair between (forcytometer template DSB Insert analysis:non- polarity andedit Repairtemplate size nucleofected (relativeto (bp, concentration (nucleo- cells<0.5%) genecoding relative (MforRNP/ tides/ (SD: Edit sequence toDSB) Nucleofection) basepairs) StandardDeviation FIGS.5/14
S 0 5 57 Microscope:11.6
AS 0 5 57 Microscope:11.8
S 12(recoded) 5 57 Microscope:17.9
AS 12(recoded) 5 57 Microscope:11.2
S +12(recoded) 5 57 Microscope:9.7
AS +12(recoded) 5 57 Microscope:11.9
S +33(recoded) 5 57 Microscope:6.3
AS +33(recoded) 5 57 Microscope:12.4
S 32(recoded) 5 57 Microscope:1.1
AS 32(recoded) 5 57 Microscope:0.2
S 31(recoded) 5 57 Microscope:12.8
AS 31(recoded) 5 57 Microscope:1.3
S 11(recoded) 5 57 Cytometer:14.2
AS 11(recoded) 5 57 Cytometer:9.7
S 2 5 57 Cytometer:25.7
AS 2 5 57 Cytometer:32.1
S +19(recoded) 5 57 Cytometer:2.5
AS +19(recoded) 5 57 Cytometer:14.3
S 32(recoded) 5 57 Cytometer:14.6
(experimentalreplicate:14.4)
AS 32(recoded) 5 57 Cytometer:2.3
(experimentalreplicate:1.3)
S 17(recoded) 5 57 Cytometer:31.0
AS 17(recoded) 5 57 Cytometer:7.3
S 2 5 57 Cytometer:36.6
AS 2 5 57 Cytometer:31.5
S 32(recoded) 5 57 Cytometer:10.5
(experimentalreplicate:10.1)
AS 32(recoded) 5 57 Cytometer:1.3
(experimentalreplicate:1.4)
S 17(recoded) 5 57 Cytometer:21.6
AS 17(recoded) 5 57 Cytometer:3.9
S 2 5 57 Cytometer:22.6
AS 2 5 57 Cytometer:24.6
S 3 5 57 Cytometer:26.5
AS 3 5 57 Cytometer:31.5
S +21(recoded) 5 57 Cytometer:1.0
AS +21(recoded) 5 57 Cytometer:5.5
S +2 5 57 Cytometer:15.5
AS +2 5 57 Cytometer:18.8
S +26(recoded) 5 57 Cytometer:0.7
AS +26(recoded) 5 57 Cytometer:3.4
S +29(recoded) 5 57 Microscope:1.8
AS +29(recoded) 5 57 Microscope:11.0
S +41(recoded) 5 57 Microscope:0.4
AS +41(recoded) 5 57 Microscope:10.3
S +1and+25 3.24pmolof 6and66 PCRsinglecellcolonies (recoded) ssODNand892ng (predictedsizeshiftand ofCAS9plasmid RestrictionEnzymedigest): inamixof10.8 3.7(2/58)forREand3xFlag lcontaining insertion,0(0/58)for3xFlag 3.6lofX- insertionalone,0(0/58)for tremeGENE9 REinsertionalone
AS +1and+25 3.24pmolof 6and66 PCRsinglecellcolonies (recoded) ssODNand892ng (predictedsizeshiftand ofCAS9plasmid RestrictionEnzymedigest): inamixof10.8 44.6(21/47)including7 lcontaining homozygous)forREand3xFlag 3.6lofX- insertion,0(0/47)for3xFlag tremeGENE9 insertionalong,10.6(5/47) forREinsertionalone
S +1 5 57 Microscope:2.0
S +2 5 57 Microscope:22.8
indicates data missing or illegible when filed
TABLE-US-00004 TABLE4 Detailedexperimentalconditionsandresults crRNA/ sgRNA polarity relative to crRNA/ gene Repairtemplateleft/right sgRNA coding Repairtemplate homologyarms Edit Celltype Cas9delivery name sequence) type/name (nucleotides/basepairs) FIGS.5/6/14 HEK293T Plasmid(CAS9::T2A:: sgPYM1 S ssODN1582 38/37 GFP;GFPsorting)/ Transfection
HEK293T Plasmid(CAS9::T2A:: sgPYM1 S ssODN1580 38/37 GFP;GFPsorting)/ Transfection
HEK293T Plasmid(CAS9::T2A:: sgPYM1 S ssODN1581 38/37 GFP;GFPsorting)/ Transfection
HEK293T Plasmid(CAS9::T2A:: sgPYM1 S ssODN1518 46/43 GFP;GFPsorting)/ Transfection
HEK293T RNP/Nucleofection crRNA S PCR1948/1949 33/33 (GFP1-10) 1629 (onplasmid1892)
HEK293T RNP/Nucleofection crRNA S PCR1948/1949 33/33 (GFP1-10) 1629 (onplasmid1893) FIGS.7/15
HEK293T RNP/Nucleofection crRNA AS ssODN1957and 33/33andnon-applicable (GFP1-10) 1648 ssODN1379 (unrelated)
HEK293T RNP/Nucleofection crRNA AS ssODN1955and nohomologyarmbut65/63 (GFP1-10) 1648 ssODN1379 homologoussequencesflankingthe (unrelated) STOP(/frameshift)inGFP11and non-applicable
HEK293T RNP/Nucleofection crRNA AS ssODN1957and 33/33andnohomologyarmbut (GFP1-10) 1648 ssODN1955 65/63homologoussequences flankingtheSTOP/frameshift
HEK293T RNP/Nucleofection crRNA AS ssODN1954and 33/33andnon-applicable (GFP1-10) 1648 ssODN1379 (unrelated)
HEK293T RNP/Nucleofection crRNA AS ssODN1956and 33/33andnon-applicable (GFP1-10) 1648 ssODN1379 (unrelated)
HEK293T RNP/Nucleofection crRNA AS ssODN1956and 33/33andnohomologyarmbut (GFP1-10) 1648 ssODN1955 65/63homologoussequences flankingtheSTOP
HEK293T RNP/Nucleofection crRNA AS PCR2083/2084 33/33andnon-applicable (GFP1-10) 1648 (onssODN1957)+ PCR2090/2091 (onssODN1379, unrelated)
HEK293T RNP/Nucleofection crRNA AS PCR2086/2087 nohomologyarmbut65/63 (GFP1-10) 1648 (onssODN1955)+ homologoussequencesflankingthe PCR2090/2091 STOP(/frameshift)inGFP11and (onssODN1379, non-applicable unrelated)
HEK293T RNP/Nucleofection crRNA AS PCR2083/2084 33/33andnohomologyarmbut (GFP1-10) 1648 (onssODN1957)+ 65/63homologoussequences PCR2086/2087 flankingtheSTOP/frameshift (onssODN1955)
HEK293T RNP/Nucleofection crRNA AS PCR2083/2084 33/33andnon-applicable (GFP1-10) 1648 (onssODN1954)+ PCR2090/2091 (onssODN1379, unrelated)
HEK293T RNP/Nucleofection crRNA S ssODN1799 33/32 (GFP1-10) 1629
HEK293T RNP/Nucleofection crRNA S ssODN1835 33/32 (GFP1-10) 1629
HEK293T RNP/Nucleofection crRNA S ssODN1799and 33/32andssODNwithouthomology (GFP1-10) 1629 ssODN1813 armandnomutations
HEK293T RNP/Nucleofection crRNA S ssODN1799and 33/32andssODNwithouthomology (GFP1-10) 1629 ssODN1804 armand1mutationevery3nt
HEK293T RNP/Nucleofection crRNA S ssODN1799and 33/32andssODNwithouthomology (GFP1-10) 1629 ssODN1805 armand1mutationevery6nt
HEK293T RNP/Nucleofection crRNA S ssODN1799and 33/32andssODNwithouthomology (GFP1-10) 1629 ssODN1806 armand1mutationevery12nt
HEK293T RNP/Nucleofection crRNA AS ssODN1799 Nohomologyarm,ssODNcontaining (GFP1-10) 1648 extra-sequence::GFP11-Mycwithout mutation::extra-sequencewith LaminA/Chomologyarmsusedwith RAB11AcrRNA Distance Repair between template DSBand polarity edit(bp, (relativeto relative Insertsize genecoding to Repairtemplateconcentration (nucleotides/ Edit sequence) DSB) (MforRNP/Nucleofection) basepairs) FIGS.5/6/14
S +1and23 3.24pmolofssODNand892ngof 6and66 Cas9plasmidinamixof10.8l containing3.6lofX-tremeGENE 9
S +1and23 3.24pmolofssODNand892ngof 6and66 (recoded) Cas9plasmidinamixof10.8l containing3.6lofX-tremeGENE 9
AS +1and23 3.24pmolofssODNand892ngof 6and66 (recoded) Cas9plasmidinamixof10.8l containing3.6lofX-tremeGENE 9
S +1 3.24pmolofssODNand892ngof 66 Cas9plasmidinamixof10.8l containing3.6lofX-tremeGENE 9
dsDNA 0and+33 0.23 57and708
dsDNA 0and+33 0.23 57and708 (recoded) FIGS.7/15
Sandnon- 0 5and5 133 applicable
Sandnon- 0 5and5 129 applicable
SandS 0 5and5 133and129
Sandnon- 0 5and5 132 applicable
Sandnon- 0 5and5 132 applicable
SandS 0 5and5 133and129
dsDNAand 0 0.33and0.33 133 dsDNA
dsDNAand 0 0.33and0.33 129 dsDNA
dsDNAand 0 0.33and0.33 133and129 dsDNA
dsDNAand 0 0.33and0.33 132 dsDNA
S 0 5 132
S 0 5 132
S 0 5and5 132
S 0 5and5 132
S 0 5and5 132
S 0 5and5 132
S 0 5 132 Efficiency(%)(forcytometeranalysis:non-nucleofectedcells<0.5%)(SD: Edit StandardDeviation) FIGS.5/6/14
PCRsinglecellcolonies(predictedsizeshiftandRestrictionEnzymedigest):12.6 (8/63)forREand3xFlaginsertion,0(0/63)for3xFlaginsertionalone,46.0(29/63) forREinsertionalone.Totaledits:58.7(37/63)
PCRsinglecellcolonies(predictedsizeshiftandRestrictionEnzymedigest):25.8 (15/58,including3homozygous)forREand3xFlaginsertion,1.7(1/58)for3xFlag insertionalone,10.3(6/58)forREinsertionalone.Totaledits:37.9(22/58)
PCRsinglecellcolonies(predictedsizeshiftandRestrictionEnzymedigest):3.2 (2/61)forREand3xFlaginsertion,0(0/61)for3xFlaginsertionalone,0(0/61)for REinsertionalone.Totaledits:3.2(2/61)
PCRsinglecellcolonies(predictedsizeshift):53.5(30/56,including7 homozygous)
Microscope:8.6(160/1842)forGFP11andtagRFPinsertion,0.7(13/1842)for tagRFPinsertionalone,10.8(199/1842)forGFP11insertionalone.Totaledits: 20.1(372/1842)
Microscope:17.6(288/1629)forGFP11andtagRFPinsertion,0.1(3/1629)for tagRFPinsertionalone,4.4(73/1629)forGFP11insertionalone.Totaledits:22.3 (364/1629) FIGS.7/15
Cytometer:0.3(averageofindependentexperiments:0.3+0.3)(n=2,SD=0)
Cytometer:0.4(averageofindependentexperiments:0.6+0.2)(n=2,SD=0.3)
Cytometer:3.2(averageofindependentexperiments:4.0+2.4)(n=2,SD=1.1) (GFPpositivecellsconfirmedbymicroscopy)
Cytometer:17.2(averageofindependentexperiments:19.6+14.7)(n=2,SD=3.5)
Cytometer:0.6
Cytometer:4.1
Cytometer:0.5(averageofindependentexperiments:0.4+0.5)(n=2,SD=0.1)
Cytometer:0.6(averageofindependentexperiments:0.5+0.7)(n=2,SD=0.1)
Cytometer:2.3(averageofindependentexperiments:2.2+2.3)(n=2,SD=0.1) (GFPpositivecellsconfirmedbymicroscopy)
Cytometer:21.7(averageofindependentexperiments:19.7+23.6)(n=2,SD=2.8)
DNAwasextractedfromcellseditedwithssODN1799orssODN1835.Nextthe
DNAfromtheseexperimentswasmixedandtheinsertionwasPCRamplifiedand sequencedbyIlluminatechnology(Barcode10).Thiscontrolwasperformedto ensurethattemplateswitchingdoesnotoccurduringPCRamplication(PCRcontrol inFIG.15andTable15)
PCRampliconsequencingusingIlluminatechnology(Barcode5,Nomutation). SeeFIG.15andTable15
PCRampliconsequencingusingIlluminatechnology(Barcode6,1/3mutations). SeeFIG.15andTable15
PCRampliconsequencingusingIlluminatechnology(Barcode7,1/6mutations). SeeFIG.15andTable15
PCRampliconsequencingusingIlluminatechnology(Barcode8,1/12mutations). SeeFIG.15andTable15
NoPCRpositivesignalwasdetected(Barcode9)
indicates data missing or illegible when filed
TABLE-US-00005 TABLE5 Repairtemplatesusedinthisstudy crRNA/ Repair Gene sgRNA template Homologyarms targeted used codingfor (nucleotides/basepairs) non- non- unrelated non-applicable applicable applicable ssODN (142nt) non- non- unrelated non-applicable applicable applicable PCRdonor (142nt) 1629
33/32
1629
33/32
1629
33/32
1629
0/0
1629
16/16
1629
33/33
1629
237/237
1629
518/518
1629
518/518
1629
33/32
1629
33/33
1629
33/33
1629
33/33
1629
33/0
1629
33/0
1629
0/32
1629
0/32
1629
33/33
1629
33/33
1629
33/32
1629
33/32
1629
nohomologyarmbuthomologoussequencestothe
1629
insertnohomologyarmbut23/24homologous sequencestotheinsertoneachside,and homeologoussequence(1/3mutations)tothe insertinthemiddle
1629
nohomologyarmbut23/24homologoussequences totheinsertoneachside,andhomeologous sequence(1/6mutations)totheinsert inthemiddle
1629
nohomologyarmbut23/24homologoussequences totheinsertoneachside,andhomeologous sequence(1/12mutations)totheinsert inthemiddle Gene Distance Polarity(S=Sense; targeted fromDSB TypeandName AS=AntiSense) non- non- ssODN1379 non-applicable applicable applicable non- non- PCR2090/2091(onssODN1379) dsDNA applicable applicable
0 ssODN1620 S
0 ssODN1732 AS
0 PCR2003/2004(onssODN dsDNA 1620/1732)
0 PCR1630/832(onplasmid1698) dsDNA
0 PCR1685/1686(onplasmid1698) dsDNA or PCR1858/1589(onplasmid1716)
0 PCR1618/1619(onplasmid1698) dsDNA or PCR1743/1744(onplasmid1716)
0 PCR2058/2059(onplasmid1716) dsDNA
0 PCR1741/1742(onplasmid1716) dsDNA
0 Plasmid1716 dsDNAcircular
0 PCR2051/2052(onplasmid2050) dsDNA
0 PCR2049/1619(onplasmid2042) dsDNA
0 PCR2005/2015(onplasmid1894) dsDNA
0 PCR2005/2006(onplasmid1894) dsDNA
0 ssODN1788 S
0 ssODN1789 AS
0 ssODN1705 S
0 ssODN1706 AS
0and+33 PCR1948/1949(onplasmid1892) dsDNA
0and+33 PCR1948/1949(onplasmid1893) dsDNA (recoded)
0 ssODN1799 S
0 ssODN1835 S
non- ssODN1813 S applicable
non- ssODN1804 S applicable
non- ssODN1805 S applicable
non- ssODN1806 S applicable Gene targeted Sequence non- ggttcgggtggtgctccacgaggtggtatgcgcaagcacacagaatacaaaacgcgactttgtgatgcgttccgccgtg applicable aaggatactgcccgtacaacgacattgcacatatgctcacggacaagatgagctgagagttc non- asssODN1379butdsDNA applicable
asssODN1620butAS
asssODN1620butdsDNA
seeTable8
asssODN1788butAS
asssODN1705butAS
indicates data missing or illegible when filed
TABLE-US-00006 TABLE6 Repairtemplatesusedinthisstudy Polarity (S= Repair Sense; crRNA/ template Homologyarms AS= Gene sgRNA coding (nucleotides/ Distance Anti- targeted used for basepairs) fromDSB TypeandName Sense) Sequence 1629
38/35 12(recoded) ssODN1678 S
1629
38/35 12(recoded) ssODN1679 AS asssODN1678butAS
1629
33/33 +12(recoded) ssODN1793 S
1629
33/33 +12(recoded) ssODN1794 AS asssODN1793butAS
1629
33/33 +33(recoded) ssODN1795 S
1629
33/33 +33(recoded) ssODN1796 AS asssODN1795butAS
1729
38/33 32(recoded) ssODN1736 S
1729
38/33 32(recoded) ssODN1737 AS asssODN1736butAS
1729
33/33 +1 ssODN1734 S
1728
38/34 31(recoded) ssODN1736 S seecr172932bp insertion
1728
38/34 31(recoded) ssODN1737 AS seecr172932bp insertion
1728
32/33 +2 ssODN1734 S seecr172932bp insertion
1648
33/33 11(recoded) ssODN1778 S
1648
33/33 11(recoded) ssODN1779 AS asssODN1778butAS
1648
33/33 11(recoded) PCR2008/2009 dsDNA asssODN1778butdsDNA (onssODN seeLamina/cwitheGFP 1778/1779) PCRwithouthomologyarm
1648
0/0 11(recoded) PCR1630/832 dsDNA
(onplasmid 1698)
1648
15/15 11(recoded) PCR1838/1839 dsDNA
(onplasmid 1791)
1648
33/33 11(recoded) PCR1652/1653 dsDNA
(onplasmid 1698)or PCR1840/1841 (onplasmid 1791)
1648
63/63 11(recoded) PCR1842/1843 dsDNA
(onplasmid 1791)
1648
461/432 11(recoded) PCR1846/1847 dsDNA
(onplasmid 1791)
1648
461/432 11(recoded) Plasmid1791 dsDNA seeTable9 circular
1648
33/0 +1 ssODN1816 S
1648
33/0 +1 ssODN1817 AS asssODN1816butAS
1648
0/33 2 ssODN1819 S
1648
0/33 2 ssoDN1820 AS asssODN1819butAS
1648
33/33 2 ssODN1864 S
1648
33/33 2 ssODN1865 AS asssODN1864butAS
1648
33/34 +19(recoded) ssODN1831 S
1648
33/34 +19(recoded) ssODN1832 AS asssODN1831butAS
1648
33/33 0 ssODN1957 S
1648
nohomologyarmbut non- ssODN1955 S
65/63homologous applicable sequencesflanking theSTOP(/frame- shift)inGFP11
1648
33/33 0 ssODN1956 S
1648
33/33 0 ssODN1954 S
indicates data missing or illegible when filed
TABLE-US-00007 TABLE7 Repairtemplatesusedinthisstudy Repair crRNA/ template Homologyarms Polarity Gene sgRNA coding (nudeotides/ Distance (S-Sense targeted used for basepairs) fromDSB TypeandName AS=AntiSense) Sequence 1648
33/33 0 PCR2083/2084 dsDNA
(onssODN1957)
1648
nohomologyarmbut non- PCR2086/2087 dsDNA
65/63homologous applicable (onssODN1955) sequencesflanking theSTOP
1648
33/33 0 PCR2083/2084 dsDNA
(onssODN1954)
1776and1777
33/34 32(recoded) ssODN1782 S
1776and1777
33/34 32(recoded) ssODN1783 AS
1776and1777
33/34 17(recoded) ssODN1833 S
1776and1777
33/34 17(recoded) ssODN1834 AS
1776and1777
33/34 2 ssODN1827 S
1776and1777
33/34 2 ssODN1828 AS
1777
33/34 2 PCR2055/2054 dsDNA
(onssODN 1827/1828)
1777
33/34 32(recoded) PCR2053/2054 dsDNA
(onssODN 1782/1783)
1910
35/32 3 ssODN1911 S
1910
35/32 3 ssODN1912 AS
1909
33/35 +2 ssODN1911 S
1909
33/35 +2 ssODN1912 AS
1910
35/35 +21(recoded) ssODN1924 S
1910
35/35 +21(recoded) ssODN1925 AS
1909
35/35 +26(recoded) ssODN1922 S
1909
35/35 +26(recoded) ssODN1923 AS
1553
0/0 +5(recoded) PCR1630/832 dsDNA
(onplasmid 1698)
1553
16/17 +5(recoded) PCR1604/1605 dsDNA
(onplasmid 1698)
1553
37/38 +5(recoded) PCR1554/1555 dsDNA
(onplasmid 1698)
1748
34/38 +29(recoded) ssODN1751 S
1748
34/38 +29(recoded) ssODN1752 AS
1747
34/38 +41(recoded) ssODN1753 S
1747
34/38 +41(recoded) ssODN1754 AS
sgPYM1
38/37 +1and23 ssODN1582 S
sgPYM1
38/37 +1and23 ssODN1580 S
(recoded)
sgPYM1
38/37 +1and23 ssODN1581 AS
(recoded)
sgPYM1
46/43 +1 ssODN1518 S
sgPYM1
38/40 +1and+25 ssODN1583 S
(recoded)
sgPYM1
38/40 +1and+25 ssODN1584 AS
(recoded)
crAC3
36/36 between4/+2 dsDNA
crAC3
483/421 between4/+2 dsDNAcircular
indicates data missing or illegible when filed
TABLE-US-00008 TABLE8 Plasmidsusedinthisstudy Plasmidname Backbone Insert Sequence 1698 pUC19
2050 pUC19 GFP11withextra-sequence
2042 pUC19 eGFPwithextra-sequence
1894 pUC19 eGFPwithextra-sequenceandtagRFP
1716 pUC19
indicates data missing or illegible when filed
TABLE-US-00009 TABLE9 Plasmidsusedinthisstudy Plasmidname Backbone Insert Sequence 1791 pUC19
1892 pUC19
1893 pUC19
sgPYM1 pX458 sgRNAforPYM1 seeTable14 pBS-AC3CtermGenomic- pBlueScript-KS
mCherry
indicates data missing or illegible when filed
TABLE-US-00010 TABLE10 Primersusedinthisstudy Primersname F/R Description Sequence(5to3) 2090 F amplificationofssODN1379 ggttcgggtggtgctccac 2091 R amplificationofssODN1379 gaactctcagctcatcttg 832 R amplificationof withouthomologyarm
1630 F amplificationof
withouthomologyarm
1676 F genotypingandsequencingof
insert
1677 R genotypingandsequencingof
insert
2014 R genotypingandsequencingof
insert
2044 R genotypingandsequencingof
insert
2045 F genotypingandsequencingof
insert
2047 F genotypingandsequencingof
insert
3001 F genotypingandsequencingof
insert
1762/HK13 F genotypingandsequencingofmCherryinsert
1763/HK14 R genotypingandsequencingofmCherryinsert
1766 R genotypingandsequencingofmCherryinsert
1767 F genotypingandsequencingofmCherryinsert
1768 R genotypingandsequencingofmCherryinsert
1769 F genotypingandsequencingofmCherryinsert
1773 F genotypingandsequencingofmCherryinsert
1685 F amplificationofLaminA/C
repairtemplate
(cr1629)with~15bphomologyarm 1686 R amplificationofLaminA/C
repairtemplate(cr1629)
(cr1629)with~15bphomologyarm 1858 F amplificationofLaminA/C
repairtemplate(cr1629)
(cr1629)with~15bphomologyarm 1859 R amplificationofLaminA/C
repairtemplate(cr1629)
(cr1629)with~15bphomologyarm 1618 F amplificationofLaminA/C
repairtemplate(cr1629)
with~35bphomologyarm 1619 R amplificationofLaminA/C
repairtemplate(cr1629)
with~35bphomologyarm 1743 F amplificationofLaminA/C
repairtemplate(cr1629)
with~35bphomologyarm
indicates data missing or illegible when filed
TABLE-US-00011 TABLE11 Primersusedinthisstudy Primers name F/R Description Sequence(5to3) 1744 R amplificationofLaminA/CeGFPrepairtemplate(cr1629) with~35bphomologyarm 2003 F amplificationofLaminA/CGFP11repairtemplate(cr1629)
with~35bphomologyarm 2004 R amplificationofLaminA/CGFP11repairtemplate(cr1629)
with~35bphomologyarm 2058 F amplificationofLaminA/CeGFPrepairtemplate(cr1629)
with~240bphomologyarm 2059 R amplificationofLaminA/CeGFPrepairtemplate(cr1629)
with~240bphomologyarm 1741 F amplificationofLaminA/CeGFPrepairtemplate(cr1629)
with~500bphomologyarm 1742 R amplificationofLaminA/CeGFPrepairtemplate(cr1629)
with~500bphomologyarm 2051 F amplificationofLaminA/CGFP11::extra-sequence(336bpinsert)
repairtemplate(cr1629)with~35bphomologyarm 2052 R amplificationofLaminA/CGFP11::extra-sequence(336bpinsert)
repairtemplate(cr1629)with~35bphomologyarm 2049 F amplificationofLaminA/Cextra-sequence::eGFP(993bpinsert)
repairtemplate(cr1629)with~35bphomologyarm 2015 R amplificationofLaminA/CTEV::eGFP::extra-sequence(1112bpinsert)
repairtemplate(cr1629)with~35bphomologyarm 2005 F amplificationofLaminA/CTEV::eGFP::extra-sequence::tagRFP(2229bp
insert)(cr1629)with~35bphomologyarm 2006 R amplificationofLaminA/CTEV::eGFP::extra-sequence::tagRFP(2229bp
insert)(cr1629)withwith~35bphomologyarm 1948 F amplificationofLaminA/CGFP11atcutandtagRFP33bpdownstream
(cr1629)with~35bphomologyarm 1949 R amplificationofLaminA/CGFP11atcutandtagRFP33bpdownstream
(cr1629)with~35bphomologyarm 1965 R LaminA/CIlluminasequencingwithbarcode5 caagcagaagacggcatacgagatacagcagtgactggagttcagacgtgtgctcttccgatc
1966 R LaminA/CIlluminasequencingwithbarcode6 caagcagaagacggcatacgagatcacgtgtgactggagttcagacgtgtgctcttccgatc
1967 R LaminA/CIlluminasequencingwithbarcode7 caagcagaagacggcatacgagatgtgatggtgactggagttcagacgtgtctcttccgatc
1968 R LaminA/CIlluminasequencingwithbarcode8 caagcagaagacggcatacgagattgttacgtgactggagttcagacgtgtgctcttccgatc
1969 R LaminA/CIlluminasequencingwithbarcode9 caagcagaagacggcatacgagatagatccgtgactggagttcagacgtgtgctcttccgatc
1970 R LaminA/CIlluminasequencingwithbarcode10 caagcagaagacggcatacgagatcccggagtgactggagttcagacgtgtgctcttccgatc
390 F pre-amplificationoftheeditforIlluminasequencing(ininsert)
1849 R pre-amplificationoftheeditforIlluminasequencing(inLaminA/C
1928 F illuminasequencing(ininsert) aatgatacggcgaccaccgagatctacactctttccctacacgacgctcttccgatct
indicates data missing or illegible when filed
TABLE-US-00012 TABLE12 Primersusedinthisstudy Primersname F/R Description Sequence(5to3) 1712 F LaminA/Cgenotypingandsequencing gaaggtctgaggcaatgggg 1713 R LaminA/Cgenotypingandsequencing gatgtagaccgccaagcgat 2076 F LaminA/Cgenotypingandsequencing ggccggcgcactccgactc 2077 R LaminA/Cgenotypingandsequencing caagcgatcattgagctcc 1838 F amplificationofRAB11A repairtemplate(cr1648)with ctcggccgcgcaatg
~15bphomologyarm 1839 R amplificationofRAB11AeGFPrepairtemplate(cr1648)with tagtcgtactcgtcgtcac ~15bphomologyarm 1652 F amplificationofRAB11A
repairtemplate(cr1648)with gctcccgccctttcgctctc ~35bphomologyarm ggccgcgcaatg
1653 R amplificationofRAB11A
repairtemplate(cr1648)with gcctcacctttaaagaggtagtc ~35bphomologyarm gtactcgtcgtcacgtgttcc
1840 F amplificationofRAB11AeGFPrepairtemplate(cr1648)with gctcccgcccttcgctc ~35bphomologyarm 1841 R amplificationofRAB11AeGFPrepairtemplate(cr1648)with gcctcacctttaaagagg ~35bphomologyarm 2008 F amplificationofRAB11AGFP11repairtemplate(cr1648)with gctccagccctttcgctcc ~35bphomologyarm 2009 R amplificationofRAB11AGFP11repairtemplate(cr1648)with gcctcacctttaacgaggtag ~35bphomologyarm 1846 F amplificationofRAB11AeGFPrepairtemplate(cr1648)with ggaaccgccacgcatgtg ~50bphomologyarm 1847 R amplificationofRAB11AeGFPrepairtemplate(cr1648)with cagaggggcttcgggagag ~50bphomologyarm 2053 F amplificationofRAB11AGFP11repairtemplate(cr1777, gctcccgccctttcgctcc at32bpfromDSB)with~35bphomologyarm 2055 F amplificationofRAB11AGFP11repairtemplate(cr1777, atgggctcccgcgacgacg at2bpfromDSB)with~35bphomologyarm 2054 R amplificationofRAB11AGFP11repairtemplate(cr1777) gtgtagagtgcgagagcc with~35bphomologyarm 2083 F amplificationofRAB11Aextra-sequence::GFP11(with/without ttcgctcctcggctgcgc STOP/frameshift)::extra-sequencerepairtemplate(cr1648) with~35bphomologyarm 2084 R amplificationofRAB11Aextra-sequence::GFP11(with/without gcctcaccttcaaagagg STOP/frameshift)::extra-sequencerepairtemplate(cr1648) with~35bphomologyarm 2086 F amplificationofextra-sequence::GFP11(withoutSTOP/ aaagatcatgatatcgattac frameshift)::extra-sequencerepairtemplate 2087 R amplificationofextra-sequence::GFP11(withoutSTOP/ cagatcctctcctgatatcag frameshift)::extra-sequencerepairtemplate 1604 F amplificationofSMC3
repairtemplate(cr1553)with gaagatgataccacacacgga
~15bphomologyarm 1605 R amplificationofSMC3
repairtemplate(cr1553)with gtagtattccccaatta
~15bphomologyarm 1554 F amplificationofSMC3
repairtemplate(cr1553)with gagatggccaaagactttgtaga ~35bphomologyarm agtgataccacacacgga
indicates data missing or illegible when filed
TABLE-US-00013 TABLE13 Primersusedinthisstudy Primersname F/R Description Sequence(5to3) 1555 R amplificationofSMC3 repairtemplate(cr1553)
with~35bphomologyarm 1483 F PYM1genotyping tgtacggtgtattggcactcg 1485 R PYM1genotyping gatagttgcccctccttcca sgPYM1cloning F PYM1sgRNAcloning caccgcgtcaacacagcgacctga sgPYM1cloning R PYM1sgRNAcloning aaactcaggtcgctgtgttgacgc 1596 F amplificationofmouseAdcy3mCherryrepair ctctgttacactgcccac template(crAdcy3)with~35bphomologyarm 1597 R amplificationofmouseAdcy3mCherryrepair ccccttcctatgtggacc template(crAdcy3)with~35bphomologyarm 1760/HK15 F mouseAdcy3genotypingandsequencing ccttcgagagtacggcttcc 1761/HK16 R mouseAdcy3genotypingandsequencing ggaacaccaggacttggtca 1772 F mouseAdcy3genotypingandsequencing ttgtgaggcgaggtcccatc 1764/HK11 F mouseAdcy3genotypingandsequencing gacatccggggcaatacggtc HK12 R mouseAdcy3genotypingandsequencing gactgtagcaagagctcagaaga 1765 R mouseAdcy3genotypingandsequencing gctcagaagacaaggcaatattg
indicates data missing or illegible when filed
TABLE-US-00014 TABLE14 crRNA/sgRNAusedinthisstudy Guide Name Target Type Polarity Sequence 1629 LaminA/C crRNA S ccatggagaccccgtcccag 1648 RAB11A crRNA AS ggtagtcgtactcgtcgtcg 1728 LaminA/C crRNA S gcggcgcgccacccgcagcg 1729 LaminA/C crRNA AS agctggcctgcgccccgctg 1776 RAB11A crRNA AS ccatggcctcacctttaaag 1777 RAB11A crRNA S gagtacgactacctctttaa 1909 RAB11A crRNA S aaccactgaaaacaagccaa 1910 RAB11A crRNA AS ttctgacagcactgcacctt 1553 SMC3 crRNA AS attttccaattaaccatgtg 1747 SMC3 crRNA S tgatgtgatcacagcagaga 1748 SMC3 crRNA AS atcatcttctacaaagtctt sgPYM1 PYM1 sgRNA S gcgtcaacacagcgacctga crAdcy3 mouse crRNA AS gtggagccagaggtcgctca Adcy3
TABLE-US-00015 TABLE 15 Classification of reads from the Illumina sequencing experiment Unexpected Reads with Total Below mutation at Used in switching sequencing Do not fully quality diagnostic downstream detected (% of the Sample reads map to template threshold position analysis previous column) No mutation 3,369,768 21.60% 42.30% 0.20% 36.00% 0.00% PCR control 3,241,689 20.10% 56.40% 0.20% 23.50% 0.02% 1/3 3,411,796 20.80% 42.30% 0.40% 36.60% 0.02% 1/6 5,680,820 21.20% 42.40% 0.20% 36.20% 0.50% 1/12 6,414,459 21.00% 42.40% 0.10% 36.50% 1.40%