COMPOSITION FOR TREATING HEMOPHILIA A BY CRISPR/CAS SYSTEM OF REVERTING FVIII GENE INVERSION
20200263204 ยท 2020-08-20
Inventors
Cpc classification
C12N2310/20
CHEMISTRY; METALLURGY
A61K31/7088
HUMAN NECESSITIES
C12N9/22
CHEMISTRY; METALLURGY
C12N2750/14143
CHEMISTRY; METALLURGY
C12N15/113
CHEMISTRY; METALLURGY
C12N15/90
CHEMISTRY; METALLURGY
A61K48/005
HUMAN NECESSITIES
A61P7/04
HUMAN NECESSITIES
C07K14/755
CHEMISTRY; METALLURGY
C12N15/88
CHEMISTRY; METALLURGY
C12N15/86
CHEMISTRY; METALLURGY
International classification
C12N15/90
CHEMISTRY; METALLURGY
A61P7/04
HUMAN NECESSITIES
C12N15/86
CHEMISTRY; METALLURGY
C12N9/22
CHEMISTRY; METALLURGY
Abstract
The present invention relates to a CRISPR/Cas system having an inversion correction potential, which uses at least one guide RNA targeting a sequence region where two different homologs present on genomic introns are conjugated to each other in an inversion manner, and a Cas protein, and a CRISPR/Cas system of FVIII gene inversion correction potential that uses at least one guide RNA targeting an int22-1/3 homolog or int22-1/2 homolog sequence region present on intron 22 of coagulation factor VIII (F8) gene and a Cas protein. A CRISPR/Cas system according to the present invention comprises a system which employs a small-size Cas9 and a guide RNA fitted thereto, thereby enabling all CRISPR/Cas instruments to be easily packaged in one AAV, which is impossible in conventional large-size Cas9. In addition, the CRISPR/Cas system can induce normal gene expression thanks to the inversion gene correction potential thereof and is excellent as a technology capable of effectively overcoming the difficult intracellular delivery of large-size gene mutation through gene editing. Particularly, the system can induce normal FVIII expression by restoring the inversion of FVIII gene and thus is useful for the treatment of hemophilia A.
Claims
1. A method for editing an inversion of a blood coagulation factor VIII (F8) gene,. comprising: subjecting a patient in need thereof to a composition comprising: (i) a Cas protein or a nucleotide encoding the Cas protein; and (ii) at least one guide RNA that specifically targets a sequence region of an int22-1/2 homolog or int22-1/3 homolog resulting from conjugation by inversion between homologs 1 (int22-1) and 2 (int22-2) or between homologs 1 (int22-1) and 3 (int22-3) of intron 22 in the blood coagulation factor VIII (F8) gene.
2. The method according to claim 1, wherein the guide RNA specifically targets a sequence comprising a proto-spacer-adjacent motif (PAM) sequence 5-NNNNRYAC-3 and a sequence comprising 1 bp or 2 bp mismatch not present on a human genome.
3. The method according to claim 1, wherein the guide RNA specifically targets a sequence selected from SEQ ID NOS: 1 to 42.
4. A method for preventing or treating hemophilia, the method comprising: subjecting a patient in need thereof to a composition comprising: (i) a Cas protein or a nucleotide encoding the Cas protein; and (ii) at least one guide RNA that specifically targets a sequence region of an int22-1/3 homolog or int22-1/2 homolog resulting from conjugation by inversion between homologs 1 (int22-1) and 2 (int22-2) or between homologs 1 (int22-1) and 3 (int22-3) of intron 22 in a blood coagulation factor VIII (F8) gene.
5. The method according to claim 4, wherein the guide RNA specifically targets a sequence comprising a proto-spacer-adjacent motif (PAM) sequence 5-NNNNRYAC-3 and a sequence comprising 1 bp or 2 bp mismatch not present on a human genome.
6. The method according to claim 4, wherein the guide RNA specifically targets a sequence selected from SEQ ID NOS: 1 to 42.
7. A method for inducing an inversion of a blood coagulation factor VIII (F8) gene, the method comprising: subjecting a patient in need thereof to a composition comprising: (i) a Cas protein or a nucleotide encoding the Cas protein; and (ii) at least one guide RNA that specifically targets a sequence region of an int22-1/3 homolog or int22-1/2 homolog resulting from conjugation by inversion between homologs 1 (int22-1) and 2 (int22-2) or between homologs 1 (int22-1) and 3 (int22-3) of intron 22 in the blood coagulation factor VIII (F8) gene.
8. The method according to claim 7, wherein the guide RNA specifically targets a sequence comprising a proto-spacer-adjacent motif (PAM) sequence 5-NNNNRYAC-3 and a sequence comprising 1 bp or 2 bp mismatch not present on a human genome.
9. The method according to claim 7, wherein the guide RNA specifically targets a sequence selected from SEQ ID NOS: 1 to 42.
10. A guide RNA that specifically targets one or more sequences selected from sequences represented by SEQ ID NOS: 1 to 42.
Description
DESCRIPTION OF DRAWINGS
[0025] The above and other objects, features and other advantages of the present invention will be more clearly understood from the following detailed description taken in conjunction with the accompanying drawings, in which:
[0026]
[0027]
[0028]
[0029]
[0030]
[0031]
[0032]
DETAILED DESCRIPTION AND EXEMPLARY EMBODIMENTS
[0033] Unless defined otherwise, all technical and scientific terms as used herein have the same meanings as commonly understood by one of ordinary skill in the art to which the present invention pertains. Generally, the nomenclature used herein is well known in the art and is commonly used.
[0034] The present invention relates to a composition for editing an inversion of the blood coagulation factor VIII (F8) gene, which includes: (i) a Cas protein or a nucleotide encoding the Cas protein; and (ii) at least one guide RNA that specifically targets a sequence region of an int22-1/3 homolog or int22-1/2 homolog resulting from conjugation by inversion between homologs 1 (int22-1) and 2 (int22-2) or between homologs 1 (int22-1) and 3 (int22-3) of intron 22 in the blood coagulation factor VIII (F8) gene.
[0035] In relation to the development of a therapeutic agent for hemophilia A, to restore an inverted state of the FVIII gene using the CRISPR/Cas system, it is required to design a guide RNA targeting the sequence of the int22-1/3 homolog or the int22-1/2 homolog, which is a conjugated region between homolog 1 and homolog 2 or 3 of int22, which is an inversion position.
[0036] Generally, in guide RNAs, a guide sequence having a length of 30 bp or less present on the 5 end thereof binds to a target site, and the 3 end thereof is involved in forming a complex with the Cas protein, which is a nuclease, and since the Cas protein-binding site on the 3 end has a unique sequence due to the biological characteristics thereof, designing a guide RNA means a process of selecting a guide sequence that binds to the target site.
[0037] The term target of the guide RNA as used herein refers to a sequence on the genome that is cleaved by the CRISPR/Cas system or undergoes modification such as deletion, insertion, or substitution in a restoration process after cleavage, and to which a guide sequence binds. For example, in the present invention, the target is a gene region capable of being hybridized with the guide RNA in the int22-1/2 homolog or int22-1/3 homolog sequence region, and is a consecutive nucleotide sequence having a length of 17 bp to bp or 18 bp to 22 bp located adjacent to the 5 end and/or the 3 end of a protospacer-adjacent motif (PAM) present on DNA including the int22-1/2 homolog or int22-1/3 homolog sequence region. For example, an upstream sequence on the 5 end is set as a guide sequence, and the guide sequence may be designed to be positioned on the 5 end of the guide RNA. Actually, since the guide RNA hybridizes to a strand complementary to a DNA strand where the PAM is present, the sequence of the guide RNA has the same characteristics as those of the upstream sequence of the 5 end of the PAM.
[0038] The sequence of the guide RNA that can be hybridized with the target region of the target gene is a nucleotide sequence with at least 90% homology, at least 95% homology, at least 96% homology, at least 97% homology, at least 98% homology, at least 99% homology, or 100% homology to a nucleotide sequence of a strand complementary to a DNA strand at which the target sequence is positioned (i.e., a DNA strand at which the PAM sequence is positioned), and is capable of binding to the nucleotide sequence of the complementary strand.
[0039] The target sequence is represented by a nucleic acid sequence of a strand where the PAM sequence is located, among the two DNA strands of the int22-1/2 homolog or int22-1/3 homolog sequence region, which is a target gene. In this regard, since the DNA strand to which the guide RNA binds is actually a strand complementary to the strand where the PAM sequence is located, a sequence included in the guide RNA has the same nucleic acid sequence as the target sequence located in the target gene, except that T is changed to U due to the characteristics of RNA. Thus, in the present specification, the guide RNA sequence and the target sequence are represented by the same nucleic acid sequence, except that T is changed to U or vice versa.
[0040] The target sequence has very high genetic correction efficiency (e.g., indel frequency (%)) at on-target sites, and at sites except for the on-target sites, since the number of mismatching nucleotides is 3 or less, 2 or less, 1, or 0, there are almost no off-target sites, and thus there is little or no chance that genetic correction will occur in regions except for the on-target sites, such that the target sequence has excellent safety.
[0041] The guide RNA may be capable of being hybridized with a target including, a proto-spacer-adjacent motif (PAM) sequence 5-NNNNRYAC-3 and for example, a sequence comprising 1 bp or 2 bp mismatch not present on a human genome.
[0042] In one embodiment, the guide RNA includes a sequence that specifically targets a target including a sequence selected from SEQ ID NOS: 1 to 42 and is hybridizable with the target, and the guide RNA may include one or more sequences selected from sequences represented by SEQ ID NOS: 1 to 42 (provided that T in the sequence is changed to U).
[0043] Since the specificity of the CRISPR/Cas system is determined by the guide sequence of the guide RNA, an activity verification process must be performed before application to a gene-editing process: guide sequences are selected through an available program such as ATUM Scoring Algorithms or GenScript based on the protospacer-adjacent motif (PAM) present in the target gene, guide RNAs including the selected guide sequences are further subjected to mismatch-sensitive T7 endonuclease I (T7E1) analysis and targeted deep-sequencing to verify on-target and off-target effects, and finally, guide RNAs with high target specificity and no off-target effect are determined.
[0044] In the present invention, for 10 kb of the int22-1/2 homolog or int22-1/3 homolog sequence, for example, about 42 guide sequences where 1b-mismatched off-target or 2b-mismatched off-target is not present on the human genome were designed by considering the PAM sequence 5-NNNNRYAC-3 (each N independently being A, T, C, or G, R being A or G, and Y being C or T) for the Cas protein derived from Campylobacter jejuni, guide RNAs including the same were produced, and guide RNAs with excellent gene cleavage activity were selected through T7E1 assay and verification of each of the produced guide RNAs.
[0045] As a result of measuring the frequency of induction of an inversion of the FVIII gene in HEK293 cells, which are normal cells, 6 types with the highest cleavage activity among the selected guide RNAs of the present invention were found to have an efficiency of 5% to 10%. This result contrasted notably with a frequency of inversion induction of 1.5% to 2.2% in a Hela cell line, as disclosed in the related art (Park et al., Cell Stem Cell, 2015) which discloses targeting the external sequence of the int22-1/3 homolog with two guide RNAs, from which it was confirmed that the guide RNA of the present invention has considerably high activity. In particular, it was demonstrated that the CRISPR/Cas system of the present invention is capable of reversing an inversion of the FVIII gene using only a guide RNA and the Cas protein, unlike the related art, which uses two types of guide RNAs.
[0046] Since the sequence targeted by the guide RNA in the normal cells is the same sequence as that of a target for restoring the inverted FVIII gene, it will be obvious to those of ordinary skill in the art to predict gene restoration capability based on the frequency of inversion induction in normal cells rather than in a disease model.
[0047] An embodiment of the present invention relates to a CRISPR/Cas system having inversion correction capability, which uses at least one guide RNA targeting a sequence region where two different homologs present in an intron on the genome are inverted and conjugated, and the Cas protein.
[0048] As used herein, the term intron is a nucleic acid site present on the genome of eukaryotic cells, which is present in pre-mRNA during transcription, but is removed during matured mRNA production by splicing, and thus is a DNA sequence on the genome that is not involved in protein production.
[0049] As used herein, the term homolog is a section in which the same sequence is repeated on an intron, and may be interchangeably used with the term homologous copy or homolog repeat. Since homologs present at different positions on the genome have the same sequence, the homologs may cause homologous recombination during a genetic recombination process, and in the FVIII gene, representative examples of the homologs include an int22-1 homolog, an int22-2 homolog, and an int22-3 homolog. For reference, the sizes of the respective homologs of FVIII are 10 kb, 12 kb, and 11 kb, respectively.
[0050] As used herein, the term inversion is a structural abnormality of a chromosome caused by rearrangement of sequences generated by homologous recombination between different homologs on the chromosome. Most mutations caused by an inversion exhibit a normal phenotype, but when occurring in the meiosis stage of germ cells, such mutations may cause the formation of a recombinant chromosome through deletion or duplication of chromosomes, thus exhibiting an abnormal phenotype. Most commonly, there is hemophilia A caused by an inversion at 28q of the X chromosome, and there are also known diseases caused by inversions, such as Wolf-Hirschhorn's syndrome, in which 4p is deleted due to a structural abnormality in chromosome 4, and Hunter's disease, caused by an inversion of the IDS gene in the Xq chromosome.
[0051] As used herein, the term conjugation refers to a state in which a nucleic acid sequence artificially cleaved by a genetic recombination process or endonuclease is relinked, and can be used interchangeably with ligation.
[0052] As used herein, the term cleavage refers to breakage of the covalent backbone of a DNA molecule, and may be initiated by various methods, including enzymatic or chemical hydrolysis of a phosphate bond. Both single-strand cleavage and double-strand cleavage are possible, among which double-strand cleavage may occur as a result of cleaving two separate single strands. DNA cleavage may result in the production of blunt ends or staggered ends, and cleavage in the present invention refers to a blunt end.
[0053] The guide RNA for reversing an inversion according to the present invention targets the sequence region resulting from conjugation by inversion. The term sequence region resulting from conjugation refers to a range corresponding to the length of a recombinant homolog present on the genome, and particularly, may be a range including a sequence having a maximum size of 12 kb including the 5 and 3 directions with respect to the conjugated position.
[0054] Generally, the CRISPR/Cas system consists of the Cas protein, trans-activating RNA (tracrRNA), and crRNA, and the crRNA includes a spacer that binds to a target site and an RNA sequence capable of being hybridized with tracrRNA, and thus is first hybridized with tracrRNA before forming a complex with the Cas protein, thereby forming an RNA-RNA conjugate. In addition, crRNA and tracrRNA may be artificially fused and induced to form a complex with the Cas protein in the single-stranded state.
[0055] As used herein, the term guide RNA generally includes tracrRNA and crRNA, and specifically, refers to double-stranded RNA consisting of RNA including a spacer or guide sequence that binds to a target site (sequence) and RNA that binds to the Cas protein, or single-stranded RNA obtained by fusing said two RNAs. Thus, the term guide RNA may be used interchangeably with the terms single-stranded guide RNA, chimeric RNA, chimeric guide RNA, and synthetic guide RNA. In addition, any RNA that binds to the Cas protein (or to an endonuclease) to bind to the target site, or binds to the same to form a complex having a characteristic of cleaving the target site may be interpreted as being encompassed by the term guide RNA.
[0056] A fused molecule is a molecule in which two or more subunit molecules are preferably covalently linked. The subunit molecules may be molecules of the same chemical type or molecules of different chemical types. The fusion of crRNA and tracrRNA in the guide RNA refers to a state in which the backbone is connected by phosphorylation bonds, not hydrogen bonds, and consists of a single strand.
[0057] As used herein, the term guide sequence refers to an RNA sequence having a length within 30 bp at the 5 end of the guide RNA and having characteristics complementary to a target site, and may be used interchangeably with the term spacer or spacer sequence. Generally, all sequences that are not involved in binding to the Cas protein in the Cas protein/guide RNA complex and complementarily bind to the target site may be interpreted as being encompassed by the above term.
[0058] The guide sequence may be easily selected by one of ordinary skill in the art using CRISPR RGEN Tools (http://www.rgenome.net; Park et al, Bioinformatics, 31:4014-4016, 2015); sgRNA Designer (Doench et al., Nature Biotechnology 34(2):184-191, 2016); E-CRISP (http://www.e-crisp.org/E-CRISP/Heigwar et al., Nature Methods 11(2):122-123, 2014); Benchling (https://benchling.com); sgRNA scorer 2.0 (https://crispr.med.harvard.edu/sgRNAScorerV2; Chari et al., ACS Synthetic Biology, 2017. doi: 10.1021/acssynbio.6b00343), and CRISPy-web (Blin et al., Synthetic and Systems Biotechnology, 1(2): 118-121, 2016).
[0059] In the present invention, the guide RNA may consist of (a) crRNA including a guide sequence and tracrRNA, (b) a fusion body of crRNA including a guide sequence and tracrRNA (chimeric guide RNA), or (c) crRNA including a guide sequence, but the present invention is not limited thereto.
[0060] For the guide RNA of the present invention, modifications may be applied to nucleic acid backbones or bases: in backbone modifications, phosphorothioates, chiral phosphorothioates, phosphorodithioates, phosphotriesters, aminoalkylphosphotriesters, 3-alkylene phosphonates, 5-alkylene phosphonates, chiral phosphonates, phosphinates, phosphoramidates including 3-amino phosphoramidate aminoalkylphosphoramidates, phosphorodiamidates, thionophosphoramidates, thionoalkylphosphonates, thionoalkylphosphotriesters, selenophosphates, and boranophosphate may be used, and in sugar molecule residue modifications, 2-methoxyethoxy, 2-MOE, 2-dimethylaminooxyethoxy, 2-dimethylaminoethoxyethoxy, methoxy, aminopropoxy, fluoro, ally, and O-allyl may be used, but the present invention is not limited thereto. In addition, in base modifications, instead of generally using purine and pyrimidine, 5-methylcytosine(5meC), 5-hydroxymethyl cytosine, xanthine, hypoxanthine, 2-aminoadenine, 7-deaza-adenine, 7-deazaguanosine, 2-aminopyridine, 2-pyridone, 2-thiouracil, 2-thiothymine 2-thiocytosine, 5-halouracil, 5-propynyl uracil, 6-azo uracil, pseudouracil, 7-methylguanine, 7-methyladenine, 2-F-adenine, 2-aminoadenine, 8-azaguanine, 8-azaadenine, 7-deazaguanine, 7-deazaadenine, 3-deazaguanine, 3-deazaadenine, phenoxazine cytidine, phenothiazine cytidine, carbazole cytidine, and pyridoindole cytidine may be used, but the present invention is not limited thereto.
[0061] The Cas protein of the present invention may be derived from a microorganism including an ortholog of the Cas protein selected from the genus Corynebacter, the genus Sutterella, the genus Legionella, the genus Treponema, the genus Filifactor, the genus Eubacterium, the genus Streptococcus, the genus Lactobacillus, the genus Mycoplasma, the genus Bacteroides, the genus Flaviivola, the genus Flavobacterium, the genus Azospirillum, the genus Gluconacetobacter, the genus Neisseria, the genus Roseburia, the genus Parvibaculum, the genus Staphylococcus, the genus Nitratifractor, the genus Corynebacterium, and the genus Campylobacter, and may be at least one selected from simply isolated Cas proteins and recombined Cas proteins, particularly a Cas protein derived from the genus Campylobacter, but the present invention is not limited thereto.
[0062] The Cas protein of the present invention binds to the guide RNA and forms a complex, recognizes the PAM sequence present on the sequence of the target gene to guide the CRISPR/Cas system to the target gene, recognizes the target gene through complementary binding between the guide sequence of the guide RNA and the target site, and finally exhibits nucleic acid cleavage activity having a characteristic of a double-strand break due to active sites of Cas9, e.g., the HNH domain and the RuvC domain thereof.
[0063] Gene and protein information of the Cas protein can be obtained from GenBank of the National Center for Biotechnology Information (NCBI).
[0064] In the present invention, the Cas protein may be a nuclease selected from Cas3, Cas9, Cpf1, Cas6, and C2c2, particularly the Cas protein of CRISPR/Cas type II, and more particularly a Campylobacter jejuni-derived Cas protein.
[0065] In the present invention, the Cas protein may be a nuclease simply isolated from a microorganism, but may be modified to (a) a variant in which some amino acid sequences are substituted, (b) a fusion protein with an affinity tag, recombinase, transposon, or the like, (c) a variant to which nuclear localization sequences (NLS) are added, or the like.
[0066] The CRISPR/Cas system according to the present invention enables normal genes to be expressed by editing inverted genes, and thus may induce a semi-permanent treatment effect on a disease. The proportion of normal gene expression may range from 1% to 100%, particularly 1% to 80%, and more particularly 30% to 50%, but the present invention is not limited thereto.
[0067] At least one guide RNA and the Cas protein, which are detailed components of the CRISPR/Cas system of the present invention, may be intracellularly delivered via (a) a vector including nucleic acid sequences expressing at least one guide RNA and the Cas protein, (b) a ribonucleoprotein (RNP) or RNA-guided engineered nuclease (RGEN) consisting of at least one guide RNA and the Cas protein, or (c) a configuration of at least one guide RNA and mRNA by which the Cas protein is encoded, but the present invention is not limited thereto.
[0068] The vector including nucleic acid sequences expressing at least one guide RNA and the Cas protein may simultaneously include at least one guide RNA and the Cas protein, or may consist of two or more vectors respectively expressing the guide RNA and the Cas protein.
[0069] In the CRISPR/Cas system of the present invention, the vector may be one or more vectors, and may be a plasmid vector or a viral vector, and the viral vector may particularly be an adenovirus vector, an adeno-associated virus vector, a lentivirus vector, or a retrovirus vector, but the present invention is not limited thereto.
[0070] The vectors of the present invention may be delivered to a cell by microvehicles, liposomes, exosomes, nanoparticles, a gene gun, or the like, but the present invention is not limited thereto, and it will be obvious to those of ordinary skill in the art to select a delivery method suitable for the structural characteristics of vectors and the characteristics of cells to which the vectors are to be delivered.
[0071] To verify that the CRISPR/Cas system can restore inverted genes and enable normal gene expression, thus inducing therapeutic effects, the inventors of the present invention determined the restoration capacity by the CRISPR/Cas system via targeting a sequence region of the int22-1/3 homolog of the FVIII gene causative of hemophilia A, which is a representative disease caused by genetic inversions, as follows.
[0072] The inventors of the present invention first selected, from the 10 kb sequence of an int22-1/2 homolog or an int22-1/3 homolog of FVIII, which has the PAM sequence 5-NNNNRYAC-3, about 42 guide RNA sequences where 1b-mismatched off-target or 2b-mismatched off-target is not present on the human genome (see Tables 1 and 2).
TABLE-US-00001 TABLE1 Indels guideRNA Di- mismatch mismatch mismatch Activity inNGS w/oPAM SeqIDNO rection (0) (1) (2) (T7E1) (%) hF8-int22-Cj1 TCTCGTTGGTGT SEQIDNO.:1 + 3 0 0 TGTCTCAATG hF8-int22-Cj2 GGTATACTGTGT SEQIDNO.:2 3 0 0 ++ 13.3 TAAGCATTGA hF8-int22-Cj3 TTAGGTAGTAA SEQIDNO.:3 3 0 0 AATGGCACACA hF8-int22-Cj4 TCAGGTTTTAG SEQIDNO.:4 3 0 0 AATACCCTGTA hF8-int22-Cj5 CTTCTCATCATG SEQIDNO.:5 + 3 0 0 CTACAGATTC hF8-int22-Cj6 TCTCATCATGC SEQIDNO.:6 + 3 0 0 ACAGATTCTT hF8-int22-Cj7 TCATCATGCTAC SEQIDNO.:7 + 3 0 0 AGATTCTTTA hF8-int22-Cj8 CAGGGTTGGCA SEQIDNO.:8 3 0 0 TATTAGTCCGT hF8-int22-Cj9 AATCTGCAACA SEQIDNO.:9 + 3 0 0 GTCGGTTGTAT hF8-int22-Cj10 TGAAATTTTCA SEQIDNO.:10 3 0 0 GAGTATACATC hF8-int22-Cj11 AAGGCATTTTG SEQIDNO.:11 + 3 0 0 TGTTAACGGAT hF8-int22-Cj12 GGCATCTGGGG SEQIDNO.:12 + 3 0 0 +++ 42 ACGGCAGAGGG hF8-int22-Cj13 ATAGGAAGAGA SEQIDNO.:13 3 0 0 ++ 24.7 GGCCTGAAACG hF8-int22-Cj14 CACTACCAGCCC SEQIDNO.:14 3 0 0 CCACACCCTC hF8-int22-Cj15 ACCACTACCAG SEQIDNO.:15 3 0 0 CCCCCACACCC hF8-int22-Cj16 CAGTCACTTGCC SEQIDNO.:16 3 0 0 GCACGCCTCC hF8-int22-Cj17 GAACTTTCCCTT SEQIDNO.:17 3 0 0 + TCTACTGGAT hF8-int22-Cj18 TGTGGTGACGG SEQIDNO.:18 3 0 0 + CCCCTCACAAG hF8-int22-Cj19 CATGTGGTGAC SEQIDNO.:19 3 0 0 + GGCCCCTCACA hF8-int22-Cj20 AGGGGCTGGGC SEQIDNO.:20 + 3 0 0 CCTGGTCGCTA hF8-int22-Cj21 GCCTAGCTGGG SEQIDNO.:21 + 3 0 0 CCAAGCCGAGA hF8-int22-Cj22 CGACAGGCTAC SEQIDNO.:22 + 3 0 0 TGAGCACCCTT
TABLE-US-00002 TABLE2 guideRNA mismatch mismatch mismatch Activity Indelsin w/oPAM SEQIDNO Direction (0) (1) (2) (T7E1) NGS(%) hF8-int22- TGGCCCCT SEQIDNO.:23 + 3 0 0 Cj23 GGCGAGG ACTAGCT hF8-int22- CCCCCGGC SEQIDNO.:24 + 3 0 0 Cj24 CTGAACCC CCGGCC hF8-int22- GGGGAGT SEQIDNO.:25 + 3 0 0 + Cj25 GGCCCTGG GTGGGAA hF8-int22- GTCTTGAG SEQIDNO.:26 3 0 0 +++ 31.9 Cj26 GCCCACCC GCCCCA hF8-int22- GGCTGAAC SEQIDNO.:27 + 3 0 0 +++ 35.2 Cj30 GTTACCAG CACCCC hF8-int22- TTTATAGA SEQIDNO.:28 3 0 0 ++ 22.6 Cj31 CGACGGAC ACCAA hF8-int22- AGAGAGTC SEQIDNO.:29 3 0 0 Cj32 ACATTTTA TAGACG hF8-int22- CTGCCTTG SEQIDNO.:30 3 0 0 Cj33 CAGTCACG TGATCG hF8-int22- ACGGAACG SEQIDNO.:31 3 0 0 ++ 23 Cj34 CCATCCTG CTTTGG hF8-int22- AAAGCAG SEQIDNO.:32 + 3 0 0 Cj35 GATGGCGT TCCGTTT hF8-int22- AAATGCAT SEQIDNO.:33 + 3 0 0 ++ 21.4 Cj36 CCTTTGGG CCGAAG hF8-int22- CTGATCTT SEQIDNO.:34 + 3 0 0 Cj37 CGGGGCTG AGATGT hF8-int22- CCCCTGCA SEQIDNO.:35 3 0 0 Cj38 AAAAGAC GCAAAAC hF8-int22- CACTCAAC SEQIDNO.:36 3 0 0 + cj39 TCACAGTG ACAACC hF8-int22- CCTCTCCTG SEQIDNO.:37 3 0 0 cj40 TGCTTCCC ATTAC hF8-int22- TGGGTATC SEQIDNO.:38 + 3 0 0 ++ 20.2 Cj41 TGAGCAT ATTCTA hF8-int22- ATGTATCA SEQIDNO.:39 + 3 0 0 Cj42 CATCAGTA TCAAAA hF8-int22- TTTAACTT SEQIDNO.:40 + 3 0 0 Cj43 GAAAACA ATTCCAA hF8-int22- TTCGCTGC SEQIDNO.:41 3 0 0 Cj44 CACCATTT CCGACG hF8-int22- CGGACTGT SEQIDNO.:42 3 0 0 Cj45 GAGTTAAG AATAGT
[0073] Guide RNAs including the target sequences listed in Tables 1 and 2 were produced, and a CRISPR/Cas system including the same was used to verify the cleavage efficiency thereof on the int22-1/2 homolog or the int22-1/3 homolog of FVIII through a T7E1 assay, and as a result, guide RNAs including 13 guide sequences were selected. Thereafter, the off-target sequence of each guide RNA was found to a level of 3b-mismatch, and after confirming that no off-target was present, whether capability to restore inverted genes was realized was confirmed in vitro.
[0074] Subsequently, to confirm whether in-vivo treatment can be induced, an animal model in which an inversion occurred about 300 kb away from FVIII-int22 was produced, and the selected guide RNAs and the Cas protein were packaged in an AAV, which was then injected into the animal model, and as a result of performing genetic analysis, it was confirmed that the inverted FVIII was reversed. In addition, as a result of evaluating the activity of FVIII in blood vessels, it was verified that the activity was increased to a significant level.
[0075] Therefore, other embodiments of the present invention relate to a CRISPR/Cas system for editing an inversion of the blood coagulation factor VIII (FVIII) gene, which uses at least one guide RNA targeting a sequence region of an int22-1/3 homolog or int22-1/2 homolog of intron 22 in the blood coagulation factor VIII (FVIII) gene, and a Cas protein, and particularly to a composition for editing an inversion of the blood coagulation factor VIII (F8) gene or a composition for the prevention or treatment of hemophilia, which includes: (i) a Cas protein or a nucleotide encoding the Cas protein; and (ii) at least one guide RNA that specifically targets a sequence region of an int22-1/3 homolog or int22-1/2 homolog resulting from conjugation by inversion between homologs 1 (int22-1) and 2 (int22-2) or between homologs 1 (int22-1) and 3 (int22-3) of intron 22 in the blood coagulation factor VIII (F8) gene.
[0076] The present invention relates to a method of editing an inversion of the blood coagulation factor VIII (F8) gene or a method of preventing or treating hemophilia, the method including administering, to an individual, (i) a Cas protein or a nucleotide encoding the Cas protein; and (ii) at least one guide RNA that specifically targets a sequence region of an int22-1/3 homolog or int22-1/2 homolog resulting from conjugation by inversion between homologs 1 (int22-1) and 2 (int22-2) or homologs 1 (int22-1) and 3 (int22-3) of intron 22 in the blood coagulation factor VIII (F8) gene.
[0077] The present invention also relates to a composition for editing an inversion of the blood coagulation factor VIII (F8) gene, which includes: (i) a Cas protein or a nucleotide encoding the Cas protein; and (ii) at least one guide RNA which specifically targets a sequence region of an int22-1/3 homolog or int22-1/2 homolog resulting from conjugation by inversion between homologs 1 (int22-1) and 2 (int22-2) or between homologs 1 (int22-1) and 3 (int22-3) of intron 22 in the blood coagulation factor VIII (F8) gene.
[0078] Another embodiment of the present invention relates to a CRISPR/Cas system for editing an inversion of the blood coagulation factor VIII (FVIII) gene, which uses a guide RNA that targets at least one sequence selected from sequences represented by SEQ ID NOS: 1 to 42 and specifically targets the corresponding sequence, and a Cas protein.
[0079] The guide RNA for reversing an inversion according to the present invention targets the sequence region resulting from conjugation by inversion. The sequence region may be a range corresponding to an int22-1/2 homolog where int22-1 and int22-2 homologs are conjugated to each other, and an int22-1/3 homolog where int22-1 and int22-3 homologs are conjugated to each other, and particularly, may be a range including sequences having a maximum size of 12 kb, including 5 and 3 directions with respect to the conjugated position.
[0080] At least one guide RNA and the Cas protein which constitute the CRISPR/Cas system of the present invention, wherein the at least one guide RNA targets a sequence region resulting from conjugation between int22-1 and int22-2 or int22-3, which are homologs present in intron 22 (int22) in the FVIII gene, may be intracellularly delivered via (a) a vector including nucleic acid sequences expressing at least one guide RNA and the Cas protein, (b) a ribonucleoprotein (RNP) or RNA-guided engineered nuclease (RGEN) consisting of at least one guide RNA and the Cas protein, and (c) at least one guide RNA and mRNA by which the Cas protein is encoded, but the present invention is not limited thereto.
[0081] In the present invention, the Cas protein may be a nuclease selected from Cas3, Cas9, Cpf1, Cas6, and C2c2, particularly the Cas protein of CRISPR/Cas type II, and more particularly a Campylobacter-derived Cas protein.
[0082] In the present invention, the Cas protein may be a nuclease simply isolated from a microorganism, but may be modified to (a) a variant in which some amino acid sequences are substituted, (b) a fusion protein with an affinity tag, recombinase, transposon, or the like, or (c) a variant to which nuclear localization sequences (NLS) are added, but the present invention is not limited thereto.
[0083] In the present invention, the guide RNA may consist of (a) crRNA including a guide sequence and tracrRNA, (b) a fusion body of crRNA including a guide sequence and tracrRNA (chimeric guide RNA), or (c) crRNA including a guide sequence, but the present invention is not limited thereto.
[0084] The CRISPR/Cas system of the present invention may be included in a composition for editing an inversion of an int22-1/2 or int22-1/3 homolog of the FVIII gene for the treatment of hemophilia A.
[0085] The composition for editing an inversion enables normal FVIII expression by genetic correction using the CRISPR/Cas system, and thus may induce a semi-permanent treatment effect for hemophilia A.
[0086] A target disease to be treated using the composition of the present invention is severe hemophilia A, which corresponds to symptoms in the case where the activity of FVIII is less than 1% that of normal individuals.
[0087] The components of the CRISPR/Cas system included in the composition, i.e., at least one guide RNA targeting a sequence region of an int22-1/3 homolog or int22-1/2 homolog present in intron 22 (int22) in the FVIII gene, and a Cas protein may be intracellularly delivered via (a) a vector including nucleic acid sequences expressing at least one guide RNA and the Cas protein, (b) a ribonucleoprotein (RNP) consisting of at least one guide RNA and the Cas protein, or (c) at least one guide RNA and mRNA by which the Cas protein is encoded, but the present invention is not limited thereto.
[0088] In the CRISPR/Cas system of the present invention and the composition using the same, the vector may be one or more vectors, and may be a plasmid vector or a viral vector, and the viral vector may be, particularly, an adenovirus vector, an adeno-associated virus vector, a lentivirus vector, or a retrovirus vector, but the present invention is not limited thereto, and particularly, the viral vector may be an adeno-associated virus vector.
[0089] The vectors of the present invention may be delivered to a cell by microvehicles, liposomes, exosomes, nanoparticles, a gene gun, or the like, but the present invention is not limited thereto, and it will be obvious to those of ordinary skill in the art to select a delivery method suitable for the structural characteristics of vectors and the characteristics of cells to which the vectors are to be delivered.
[0090] The composition of the present invention may be administered, to a subject in need thereof, in a sufficient or effective amount. The effective amount or sufficient amount indicates an amount that is used alone or in combination with one or more other therapeutic compositions, protocols, and therapies in a single dose or multiple doses to benefit a subject or provide predicted or desired results for a subject for a certain period of time.
[0091] An AAV vector dose for achieving a therapeutic effect may be provided as a vector genome dose/kg (body weight) (vg/kg), but may vary depending on (a) the administration route, (b) expression levels of therapeutic genes required to achieve a therapeutic effect, (c) any host immune response to the AAV vector, and (d) the stability of an expressed protein. When using hemophilia as an example, generally, in order to achieve a therapeutic effect, a blood coagulation factor concentration that is greater than 1% of the factor concentration found in a normal individual is needed in order to change a severe disease phenotype to a moderate phenotype. A severe phenotype is characterized by joint damage and life-threatening bleeding. To convert a moderate disease phenotype into a mild phenotype, a blood coagulation factor concentration greater than 5% of a normal level is required.
[0092] The CRISPR/Cas system and composition of the present invention may be incorporated into pharmaceutically acceptable carriers or excipients along with an AAV vector, additional compositions, agents, drugs, and biological agents (proteins), thereby preparing the same as a pharmaceutical composition. In particular, such a pharmaceutical composition is useful for administration and delivery to a patient or subject in vivo or in vitro.
[0093] When prepared as such a pharmaceutical composition, the pharmaceutical composition includes a pharmaceutically acceptable carrier. Pharmaceutically acceptable carriers commonly used in formulation include lactose, dextrose, sucrose, sorbitol, mannitol, starch, acacia gum, calcium phosphate, alginates, gelatin, calcium silicate, microcrystalline cellulose, polyvinylpyrrolidone, cellulose, water, syrup, methyl cellulose, methylhydroxy benzoate, propylhydroxy benzoate, talc, magnesium stearate, mineral oil, saline, phosphate-buffered saline (PBS), or media, but the present invention is not limited thereto. The pharmaceutical composition may further include, in addition to the above-described components, a lubricant, a wetting agent, a sweetener, a flavor enhancer, an emulsifying agent, a suspension agent, a preservative, or the like. Suitable pharmaceutically acceptable carriers and formulations are described in detail in Remington's Pharmaceutical Sciences (19th ed., 1995). A suitable dose of the pharmaceutical composition may be variously prescribed according to factors, such as patient age, body weight, gender, pathological conditions, diet, administration time, administration routes, excretion speed, and reaction sensitivity.
[0094] In addition, the pharmaceutical composition may be formulated using a pharmaceutically acceptable carrier and/or excipient by a method that may be easily carried out by one of ordinary skill in the art to which the present invention pertains, to be prepared in a unit dose form or to be contained in a multi-dose container. In this regard, the formulation may be a solution in oil or an aqueous medium, a suspension, a syrup, an emulsion, an extract, powder, granules, tablets, or capsules, and may further include a dispersing agent or a stabilizing agent.
[0095] Administration routes for the pharmaceutical composition include oral, transdermal, or local delivery and administration via a systemic, topical, local, or other route, for example, injection, infusion, ingestion, or inhalation. Such delivery and administration routes include parenteral administration such as intravenous administration, intramuscular administration, intraperitoneal administration, intradermal administration, subcutaneous administration, intranasal administration, intracranial administration, transdermal or topical administration, mucosal administration, or intrarectal administration. Examples of administration and delivery routes include intravenous (i.v.) administration and delivery, intraperitoneal (i.p.) administration and delivery, intraarterial administration and delivery, intramuscular administration and delivery, parenteral administration and delivery, subcutaneous administration and delivery, intrapleural administration and delivery, topical administration and delivery, transdermal administration and delivery, intradermal administration and delivery, percutaneous administration and delivery, intracranial administration and delivery, intraspinal administration and delivery, oral (digestive) administration and delivery, transmucosal administration and delivery, respiratory administration and delivery, intranasal administration and delivery, administration and delivery via intubation, intrapulmonary administration and delivery, administration and delivery via intrapulmonary instillation, buccal administration and delivery, sublingual administration and delivery, intravascular administration and delivery, intraarterial administration and delivery, intraluminal administration and delivery, iontophoretic administration and delivery, intraocular administration and delivery, administration and delivery via the eye, intraglandular administration and delivery, intratracheal administration and delivery, and intralymphatic administration and delivery. Delivery and administration doses may be based on existing protocols, and may be determined selectively in human clinical trials or using animal disease models or experimentally. Doses in the initial study may be based on animal studies described herein for mice or dogs, for example.
[0096] The subject to be treated with the CRISPR/Cas system or the composition according to the present invention includes animals, typically mammals such as humans, non-human primates (apes, gibbons, gorillas, chimpanzees, orangutans, macaques), pets (dogs and cats), livestock (poultry such as chickens and ducks, horses, cows, goats, sheep, pigs), laboratory animals (mice, rats, rabbits, guinea pigs), and animal disease models, e.g., animal models including mice and other animals with blood-clotting diseases and other diseases known to those of ordinary skill in the art.
[0097] Each composition and process used in the present invention has already been described above, and thus descriptions thereof will be omitted herein to avoid excessively repeated descriptions.
[0098] Hereinafter, the present invention will be described in further detail with reference to the following examples. It will be obvious to those of ordinary skill in the art that these examples are provided for illustrative purposes only and are not intended to limit the scope of the present invention.
EXAMPLES
Example 1: Preparation of CRISPR/Cas System
[0099] 1) Selection of Guide RNAs
[0100] CRISPR RGEN Tools (http://www.rgenome.net; Park et al., Bioinformatics 31:4014-4016, 2015) was used to select guide RNAs having 5-NNNNRYAC-3 as a PAM sequence and present in all three regions of the human FVIII gene, i.e., Chr-X:154109091-154118603 (Consortium Human Build 37 (Grch37) standard) corresponding to the int22-1 homolog, Chr-X:154606156-154615709 present apart therefrom by approximately 500 kb and corresponding to the int22-2 homolog, and Chr-X:154684313-154693870 corresponding to the int22-3 homolog. Subsequently, guide RNAs without 1 base or base mismatch except for on-target sites on the human genome were further selected and used for an activity test.
[0101] 2) On-Target/Off-Target Effects of Guide RNAs
[0102] Each cloned pU6-CjSgRNA was transfected into HEK293, which is a human cell line, along with pCMV-CjCas9 using Lipofectamine 2000. After about 2 to 3 days, genomic DNA was extracted therefrom and on-target sites were amplified by PCR, and then additional PCR for ligation between adapters specific to sequencing primers for next-generation sequencing and TruSeq HT dual index primers was carried out. Thereafter, reads obtained through paired sequencing were analyzed to confirm insertions or deletions (Indels) at on-target genome locations, through which guide RNAs with activity were selected.
[0103] For off-target analysis of the selected guide RNAs, first, off-target lists with 3-base mismatch were selected by an in-silico method using Cas-Offinder of CRISPR RGEN Tools, a specific region on the genome corresponding to each off-target was verified through targeted deep sequencing. As a second method, human whole genomic DNA treated with each guide RNA and the CjCas9 protein at 37 C. overnight was subjected to whole genome sequencing, and then potential lists were obtained through Digenome-seq analysis. Thereafter, targeted-deep sequencing was performed on a specific region on the genome of each of off-target candidates to verify whether indels were introduced at off-target positions. In addition, next generation sequencing (NGS) was performed on a specific region on the genome of each of off-target candidates to verify whether indels were introduced at off-target positions. Off-target lists are shown in Tables 3 to 11 below. As illustrated in
TABLE-US-00003 TABLE3 Mismatch SEQID (base) Targetsequence-PAM No. Chr Location Direction mismatch(0) Cj2-ON GTATACTGTGTTAAGCATTGA-GACAA 43 X CAC mismatch(4) Off1 GTATAaTGTGTaAttCATTGA-CCTAATAC 44 chr8 99961923 Off2 GtgTACTtTGcTAAGCtTTGA-TTATACAC 45 chr3 175280721 Off3 GTATACTGTtTTAAGgtTTGt-ATCTGTAC 46 chr7 17171284 + Off4 cTATAaTGTGTTAAGtgTTGA-GAATACAC 47 chr7 121479898 Off5 cTATtCTtTGTTAAGCATTtA-CTAAGTAC 48 chr5 138186574 + Off6 GTgTACTGTGTcAtGCATTGt-CTAAGCAC 49 chr5 162926700 Off7 cTATACTGTGccAAGCATTGt-TCAGGTAC 50 chr1 235243115 Off8 tTATAaTGTGTTAtGCAgTGA-CTGAACAC 51 chr13 58076595 Off9 GTATACTGTGTgcAGCAaaGA-ATTAACAC 52 chr2 142505968 + Off10 GTATACTGgGTTAAGCccTaA-ACTAGTAC 53 chr2 225054936 Off11 caATACTGTtTTAAGCATTGc-TACTACAC 54 chr9 112040664 + Off12 aTATAaTGTGTTAgGCATTaA-TGAAGCAC 55 chrX 100444393 + Off13 agAcACTGTGTTAAGCATTaA-ACATACAC 56 chr11 1033137291
TABLE-US-00004 TABLE4 Mismatch SEQID (base) Targetsequence-PAM No. Chr Location Direction mismatch(0) Cj12-ON GCATCTGGGGACGGCAGAGGG-TATC 57 X ACAC mismatch(3) Off1 GCATaTGGGGcCGGCAGAGtG-GCTCACA 58 chr8 41337333 C Off2 GCATCTctGcACGGCAGAGGG-AAACACA 59 chr7 45582779 C Off3 GCATCTGGGGtgGGgAGAGGG-TTGGATA 60 chr9 92253496 + C mismatch(4) Off4 GgAgCTaGGGAgGGCAGAGGG-AGCAGC 61 chr12 2237879 AC Off5 GCAgCTGGtGACaaCAGAGGG-GCCGGCA 62 chr3 13066365 + C Off6 GCtTCTGGGGACGGCccAGaG-GCAGACA 63 chr1 15932696 C Off7 GCAgCTGGGGAtGGaAGAGGc-ATGAACA 64 chr1 161986421 + C Off8 GCATaTGaGGACGGCAGcaGG-TGAGGCA 65 chr13 113844761 + C Off9 tCATaTGGGGACGGCAGAGaa-AACGGCA 66 chr2 222511138 C Off10 GCcTCTGGGGtCaGCAGAaGG-CACGGCA 67 chr2 240624171 C Off11 GCAgCTcGGGACGcCAGAGGc-AAATGCA 68 chr15 63441723 + C
TABLE-US-00005 TABLE5 Mismatch SEQID (base) Targetsequence-PAM No. Chr Location Direction mismatch(0) Cj13-ON TAGGAAGAGAGGCCTGAAACG-ACA 69 X TACAC mismatch(3) Off1 TAGGAAGAtAGGCCTGccACG-TTTCA 70 chr10 126477396 + CAC mismatch(4) Off2 cAGGAAGAGAGcCCTGgAAgG-TCTT 71 chr3 68403798 + GCAC Off3 TAGagAGAGAGcCCTtAAACG-CCAG 72 chr3 112100249 ATAC Off4 TAaGAgGAGAtGCCTGAcACG-TGGC 73 chr1 244962338 ACAC Off5 TgGGgAGAGgGGCCTGAAACc-CAC 74 chr2 235938779 + ACAC Off6 TgGGcAGAGAGGCCTGAgACc-TGGA 75 chr19 32496611 ACAC Off7 TAaGAAGAGAGaCCTGAActG-ACAC 76 chr6 139741563 ACAC Off8 TAtGAAtAGAGGCCaGAAACc-AAGA 77 chr20 60943338 + GCAC
TABLE-US-00006 TABLE6 Mismatch SEQID (base) Targetsequence-PAM No. Chr Location Direction mismatch(0) Cj26-ON TCTTGAGGCCCACCCGCCCCA-ACG 78 X AACAC mismatch(3) Off1 TCaTtAGGCCCAgCCGCCCCA-CGAA 79 chr4 13851509 + GTAC mismatch(4) Off2 TCTTGAGGaCaACCCtCCCCc-CAGC 80 chr8 1709161 + ACAC Off3 TCTTaAGGCCCAgCgtCCCCA-CCATG 81 chr3 52380671 + CAC Off4 TCcTGAGGCCaACCaGCCCCt-ATTCA 82 chr3 184577591 CAC Off5 TCcTGAGGaCaAaCCGCCCCA-GGGC 83 chr7 73328692 GTAC Off6 TCaTGAGGCCCcaCCGCCCCg-CCCG 84 chr1 3590481 + GCAC Off7 TCTgGAGGgCCACCCGCaCCc-CCTC 85 chr2 10365768 ACAC Off8 aCcTcAGGCCCACCCGtCCCA-GGTC 86 chr22 39131563 + ACAC Off9 TCTTaAGGCCCAgCtaCCCCA-CCCTA 87 chr15 73941008 + TAC Off10 cCcTGAGGCCCACaCaCCCCA-CAGG 88 chr6 33430699 ATAC Off11 TCTTGAGGCCCcCCtGCtCCt-TGGTA 89 chr11 113288464 CAC
TABLE-US-00007 TABLE7 Mismatch SEQID (base) Targetsequence-PAM No. Chr Location Direction mismatch(0) Cj30-ON GCTGAACGTTACCAGCACCCC-GAGAACAC 90 X mismatch(4) Off1 GCTtAAtGTaACCAGCAtCCC-AACTGCAC 91 chr12 108947583 + Off2 GCaGAACaTTACCAGtACCCt-CACAACAC 92 chr5 80572029 + Off3 GgTGAAaaTTACCAcCACCCC-CTGCACAC 93 chr11 114050717 +
TABLE-US-00008 TABLE8 Mismatch SEQID (base) Targetsequence-PAM No. Chr Location Direction mismatch(0) Cj31-ON TTTATAGACGACGGACACCAA-AACCACAC 94 X mismatch(4) Off1 TTaATAGAtGAaGcACACCAA-CATGGCAC 95 chr3 135386579 + Off2 TTgATAGACGcCGGAtgCCAA-GCTGGTAC 96 chr10 76176232 Off3 TTTATAGACtACctACACCtA-AAAAGCAC 97 chrX 142567739
TABLE-US-00009 TABLE9 Mismatch SEQID (base) Targetsequence-PAM No. Chr Location Direction mismatch(0) Cj34-ON CGGAACGCCATCCTGCTTTGG-GGGAACAC 98 X mismatch(3) Off1 CtGAAaGCCATCCTGCcTTGG-TTATATAC 99 chr2 153036292 + mismatch(4) Off2 CGGAACtCCAcCtTGCTcTGG-CCCAGCAC 100 chr7 75597775 +
TABLE-US-00010 TABLE10 Mismatch SEQID (base) Targetsequence-PAM No. Chr Location Direction mismatch(0) Cj36-ON AATGCATCCTTTGGGCCGAAG-TTGCAC 101 X mismatch(4) Off1 AATGCtTCCTTTGGGCtGAgc-ACACACAC 102 chr1 71100731 + Off2 AATGCATCCTTccGGtCtAAG-AAACACAC 103 chr18 10387958
TABLE-US-00011 TABLE11 Mismatch SEQID (base) Targetsequence-PAM No. Chr Location Direction mismatch(0) Cj41-ON GGGTATCTGAGCATAATTCTA-GGAAAC 104 X AC mismatch(3) Off1 GGGTATCTGAtCtAcTTCTA-CCCAACAC 105 chr14 68310240 mismatch(4) Off2 GGGTATtTGAcCATtATTaTA-TGAGATAC 106 chr8 124870465 + Off3 GttTATCTGgGCATAATgCTA-CTGCACAC 107 chr12 22066265 Off4 GGGTAgCTGAtCATAATaCTg-AAAGACAC 108 chr7 86399626 + Off5 GtGTATCTGAGgATAATcCTg-GAAGATAC 109 chr10 5232205 + Off6 tGGTATCTGAGCATAAaTtaA-AAGAGTAC 110 chr6 70660363 + Off7 taGTATCTGAGCATcATTaTA-GAAGATAC 111 chr6 107643781 + Off8 GGGgAgCTGAtCATAATTCcA-AAAGACAC 112 chr11 58355632
Example 2: Verification of Inversion-Inducing Efficiency by CRISPR/Cas9 System in Normal Cells
[0104] Each of the guide RNAs found to have activity was transfected into HEK293 along with a pU6 plasmid and a pCMV-CjCas9 plasmid using Lipofectamine 2000 (manufacturer), and then genomic DNA was extracted. Subsequently, long-distance PCR (LD-PCR) assay was performed using Expand Long Template PCR system (Roche). LC-PCR was performed in a total volume of 25 l using 600 ng of genomic DNA, 7.5% DMSO, 310 M dGTP, 190 M 7-deaza-dGTP, 500 M of dATP, dCTP, and dTTP each, 0.2 M inversion-specific primer, 1 Unit Taq Polymerase, and 10PCR buffer #2. Thereafter, 15 l of samples were subjected to agarose electrophoresis to identify inversion-specific PCR bands corresponding to about 10.8 kb.
[0105] To quantitatively detect inversion efficiency, a standard curve was made through the LD-PCR results of a sample obtained by mixing genomic DNA obtained from induced pluripotent stem cells (iPSCs) of intron-22-inversion patients and genomic DNA of WT HEK293 in a specific ratio, which was then compared with the LD-PCR results by each guide RNA.
Example 3: Production of Animal Model with Int22-Inverted Genes of FVIII
[0106] sgRNAs of SpCas9, exhibiting activity in Chr-X:75243250-75243500 (GRCm38 standard), which is a FVIII int-22 position of mouse genome, and Chr-X:75562900-75563100, which is an intergenic locus region distanced therefrom by about 320 kb, were selected through screening. The sequences of the selected guide RNAs are as follows.
TABLE-US-00012 TABLE12 GuideRNA Sequence(5--->3) SEQIDNO. mF8-int22-Sp1 GTGGCCTGGTCAAGCTTATC 113 mF8-int22-Sp2 TTTGCAGAACAATCCCCTTT 114 mF8-int22-Sp3 TTACTAAGGGCTGAACAAGG 115 mF8-distal-Sp1 AGCGGGGCTAACACTGCACA 116 mF8-distal-Sp2 GCATACGGGTCAGATGCCTG 117 mF8-distal-Sp3 CCAAATACGGCAGGGCATAC 118
[0107] Target sequences containing the PAMs of hF8-INT22-Cj12 and hF8-INT22-Cj30 to be used for treatment were knocked in on opposite sides by inverted repeats, and at the same time, a single-stranded oligodeoxynucleotide (ssODN) for inducing inversion with respect to a sequence where double strand break occurs by a guide RNA in mouse genome was synthesized. The sequences of ssODNs are as follows.
TABLE-US-00013 TABLE13 SEQID ssODN Sequence NO. ssODN-1 acaccatctcactgggtgccatggaa 119 (mF8- ccacccctcatccaaacacaccattt int22) ggccagtgacttccagatGGCATCTG GGGACGGCAGAGGGTATCACACGGCT GAACGTTACCAGCACCCCGAGAACAC acaaggtacaaactaaaggtcccaag agggaccttactgaaaagttattgaa tttaaagaacaatataag ssODN-2 ttgttctattaacccaggttcattaa 120 (mF8- cataaaaatagcattgttcccccagt distal) acaacacaagtacctcctGTGTTCTC GGGGTGCTGGTAACGTTCAGCCGTGT GATACCCTCTGCCGTCCCCAGATGCC tgccctgccgtatttggttgtgccag cacctcttaccagacagtgatctgag gactcagtggacccagat
[0108] In the ssODN sequences, capital letters denote a human sequence to be knocked in, and underlined sequences denote homolog arm sequences for inversion. SpCas9 mRNA (50 ng/l), sgRNA (5 ng/l each), ssODN-1 (20 ng/l), and ssODN-2 (20 ng/l) were simultaneously microinjected into mouse fertilized eggs and implanted into surrogate. Subsequently, newborn FO mice were subjected to genotyping through PCR, TA-sequencing, and whole-genome sequencing to select a mouse model in which 320 kb containing exonl-22 of FVIII was inverted and a desired human sequence was knocked in on opposite sides of the 320 kb interval by an inverted repeat structure. Thereafter, a bleeding test was performed to determine whether symptoms of hemophilia by inversions were exhibited and ELISA measurement was performed on mouse FVIII, to further carry out verification for a hemophilia disease model.
Example 4: Production of Vector System for Delivering CRISPR/Cas9 System
[0109] 1) RGEN
[0110] A pU6-CjSgRNA vector, including a U6 promoter and the sequence of a sgRNA of Campylobacter jejuni from which a guide RNA sequence was excluded, was synthesized and produced. Subsequently, a DNA oligo corresponding to each guide RNA was synthesized and cloned into the pU6-CjSgRNA vector using a BsmBI site, thereby producing a plasmid capable of expressing each of the selected guide RNAs in mammalian cells. A CjSgRNA sequence expressed by the produced plasmid is as follows: 5-NNNNNNNNNNNNNNNNNNNNNNGUUUUAGUCCCUGAAAAGGGACUAAAAUAAAGAGUUUG CGGGACUCUGCGGGGUUACAAUCCCCUAAAACCGCUUUUU-3 (SEQ ID NO.: 121, N is an arbitrary nucleotide), wherein 22 Ns at the 5-end correspond to a target sequence. A vector including a CMV promoter and a sequence optimized with a human codon of CjCas9 or SpCas9 was synthesized and produced, and each Cas9 includes NLS and HA tags at the C-terminal thereof and further includes NLS also at the N-terminal thereof according to the type of vector.
[0111] 2) AAV
[0112] A vector including, between inverted tandem repeats (ITRs) of AAV2, a U6 promoter, an sgRNA sequence, a promoter for mammalian expression (CMV, EFS or LP1), and Cas9 having NLS and HA tags at the C- or N-terminal thereof and optimized with a human codon was synthesized and produced (pAAV-ITR-sgRNA-Cas9). For the production of an AAV, a vector for the pseudotype of an AAV capsid, pAAV-ITR-sgRNA-Cas9, and a pHelper vector were simultaneously transfected into HEK293 cells in a molar concentration ratio of 1:1:1. After 72 hours, the cells were lysed to obtain virus particles, followed by separation and purification by a step-gradient ultracentrifuge using iodixanol (Sigma-Aldrich), and the AAV were quantitatively measured by a titration method using qPCR.
Example 5: Confirmation of Indels and Reversed State of FVIII int22-1/3 in Animal Model
[0113] Mouse embryonic fibroblasts (MEFs) were isolated from FVIII-inverted mice at 2 to 8 weeks of age in which a human target was knocked in and treated with the AAV at a density of 110.sup.11 vg to 110.sup.12 vg through tail vein injection. After about 5-7 days, genomic DNA was extracted from the cells and PCR was performed thereon using PCR primers capable of amplifying the same by reversion of the inversion. Thereafter, the PCR product was cloned into a TA-vector and the reversion of the inversion was confirmed through sequencing. Similarly, the reversion of the inversion was confirmed in vivo. The AAV was injected at a density of 110.sup.11 vg to 110.sup.12 vg into the FVIII-inverted mice at 2 to 8 weeks of age in which a human target was knocked in, through tail vein injection. After 4 weeks or 8 weeks, genomic DNA was extracted from each organ, and PCR was performed thereon using PCR primers capable of amplifying the same by reversion of the inversion, and then the reversion of the inversion was confirmed through sequencing of the PCR product. In addition, genomic DNA extracted from each organ was subjected to a targeted deep-sequencing to confirm in-vivo indels by CRISPR.
INDUSTRIAL APPLICABILITY
[0114] Hemophilia A is a disease that is treated only by replacement therapies because there is no therapeutic agent for fundamental treatment, and even though techniques for gene therapy via therapeutic gene insertion are applied, it is difficult to deliver a therapeutic gene into the human body due to a limitation in packaging in a vector because the size of the FVIII gene is very large. Recently, to address this problem, methods using a minimal domain of FVIII have entered into clinical trials, but only short-term effects are expected, and thus there is a need to develop a fundamental and long-term treatment method. Genetic correction using a CRISPR/Cas9 system is a more fundamental and long-term treatment method, but commonly used Cas9s are difficult to package in an AAV, which is the most clinically proven delivery means, due to a large size thereof, and due to a relatively short PAM, there is a relatively high possibility of potential off-targeting, and thus existing Cas9s are not suitable for use as a component of a gene therapeutic agent.
[0115] A CRISPR/Cas system according to the present invention uses a small size of Cas9 and a guide RNA in accordance therewith and consists of a system capable of easily packaging all components of the CRISPR/Cas system in a single AAV, which is impossible in existing Cas9s having a large size. In addition, the CRISPR/Cas system can induce normal gene expression due to the capability to correct inverted genes, and thus is excellent as a technology for efficiently overcoming mutations of genes having a large size, which are difficult to deliver intracellularly, through genetic correction. In particular, the CRISPR/Cas system can induce normal FVIII expression by reversing an inversion of the FVIII gene, and thus is useful for the treatment of hemophilia A.
Sequence List Free Text
[0116] Electronic file attached.