MEANS AND METHODS FOR TREATING PARKINSON'S DISEASE
20190324017 ยท 2019-10-24
Inventors
Cpc classification
A61K31/192
HUMAN NECESSITIES
C12Q1/6897
CHEMISTRY; METALLURGY
C12Q1/6811
CHEMISTRY; METALLURGY
C12Q1/6897
CHEMISTRY; METALLURGY
A61K31/713
HUMAN NECESSITIES
C12Q2539/105
CHEMISTRY; METALLURGY
A61K31/7105
HUMAN NECESSITIES
C12Q1/6883
CHEMISTRY; METALLURGY
International classification
G01N33/50
PHYSICS
C12Q1/6811
CHEMISTRY; METALLURGY
A61K31/7105
HUMAN NECESSITIES
C12Q1/6897
CHEMISTRY; METALLURGY
C12Q1/6883
CHEMISTRY; METALLURGY
Abstract
The present invention relates to a method for screening a compound for Parkinson's disease treatment, comprising contacting a cell having a DJ-1 gene containing a c.192G>C mutation with a compound of interest, and testing whether said compound of interest prevents skipping of exon 3 of said DJ-1 gene, thereby identifying said compound as candidate for Parkinson's disease treatment. The present invention further relates to a compound which prevents skipping of exon 3 of the DJ-1 gene for use in a method of treatment of Parkinson's disease. The present invention further relates to a pharmaceutical composition comprising one or more compounds which prevents skipping of exon 3 of the DJ-1 gene for use in a method of treatment of Parkinson's disease.
Claims
1. An in vitro method for screening a compound for Parkinson's disease treatment, comprising (a) contacting a cell having a DJ-1 gene containing a c.192G>C mutation with a compound of interest; (b) testing whether said compound of interest prevents skipping of exon 3 of said DJ-1 gene, thereby identifying said compound as candidate for Parkinson's disease treatment.
2. The method of claim 1, wherein said cell has a c.192G>C mutation in at least one allele of the DJ-1 gene.
3. The method of any one of the preceding claims, wherein said cell has a c.192G>C mutation in both alleles of the DJ-1 gene.
4. The method of any one of the preceding claims, wherein said c.192G>C mutation leads to a p.E64D amino acid exchange in the DJ-1 protein.
5. The method of claim 1, wherein said cell is a mammalian cell, preferably a human cell line.
6. The method of claim 5, wherein said cell contains a DJ-1 gene having a c.192G>C mutation.
7. The method of any one of the preceding claims, wherein said cell is a primary fibroblast cell, an immortalized fibroblast cell, an induced pluripotent stem cell (iPSC), a small-molecule-derived neuronal precursor cell (smNPC), or a midbrain-specific dopaminergic neuro (mDA) from a patient having at least one c.192G>C mutation in a DJ-1 gene.
8. The method of any one of the preceding claims, wherein the DJ-1 gene containing a c.192G>C is fused to a reporter gene.
9. The method of any one of the preceding claims, wherein said testing in step (b) is accomplished by western blot, qPCR, mass spectrometry, detection of reporter gene activity, microscopy-based assay, or FACS-based assay
10. A compound which prevents skipping of exon 3 of the DJ-1 gene for use in a method of treatment of Parkinson's disease.
11. The compound for the use of claim 10, which is obtainable by the method of any one of claims 1 to 9.
12. The compound for the use of claim 10 or 11, which is a mutated U1 snRNA allowing the U1 complex to bind to the mutant pre-mRNA.
13. The compound for the use of claim 12, wherein said mutated U1 snRNA has the nucleotide sequence shown in SEQ ID NO: 7.
14. The compound for the use of claim 10 or 11, which is an optionally substituted C1-C10 alkyl carboxylic acid.
15. The compound for the use of claim 14, which is butyric acid.
16. The compound for the use of claim 14, which comprises at least one aryl and/or at least one heteroaryl substituent.
17. The compound for the use of claim 16, which is a compound having the structural formula (I): ##STR00003## wherein R1 is optionally substituted aryl or optionally substituted heteroaryl; and n is an integer of from 0 to 8.
18. The compound for the use of claim 17, which is 4-phenyl butyric acid.
19. The compound for the use of claim 10 or 11, which is a compound having the structural formula (II): ##STR00004## wherein R.sup.2 is hydrogen or optionally substituted alkyl; and X is hydrogen or halogen.
20. The compound for the use of claim 19, which is N6-furfuryladenine.
21. The compound for the use of claim 19, which is 2-chloro-N-[(furan-2-yl)methyl]-7H-purin-6-amine or 2-chloro-N-[(furan-2-yl)methyl]-N-methyl-7H-purin-6-amine.
22. A pharmaceutical composition comprising one or more compounds which prevents skipping of exon 3 of the DJ-1 gene for use in a method of treatment of Parkinson's disease.
23. The pharmaceutical composition for the use of claim 22, wherein the one or more compound comprises a compound as defined in any one of claims 10 to 21.
24. The pharmaceutical composition for the use of claim 22 or 23, wherein the composition comprises at least two compounds as defined in any one of the claims 10 to 21.
25. The pharmaceutical composition for the use of any one of claims 22 to 24, wherein the composition comprises a compound as defined in any one of claims 14 to 18 and a compound as defined in any one of claims 19 to 21.
26. The pharmaceutical composition for the use of any one of claims 22 to 25, wherein the composition comprises 4-phenyl butyric acid and 2-chloro-N-[(furan-2-yl)methyl]-7H-purin-6-amine.
27. The method, compound or pharmaceutical composition of any one of the preceding claims, wherein the Parkinson's disease is early onset Parkinson's disease, idiopathic Parkinson's Disease, Parkinson's disease associated with aberrant splicing, Parkinson's disease associated with a U1-dependent splicing defect, PARK7-associated Parkinson's disease, or Parkinson's disease associated with a c.G192C mutation in the PARK7 gene.
Description
BRIEF DESCRIPTION OF THE DRAWINGS
[0078]
[0079]
[0080]
[0081]
[0082]
[0083]
[0084]
[0085]
[0086]
[0087]
[0088]
[0089]
[0090]
[0091]
[0092]
[0093]
[0094]
[0095]
[0096]
[0097]
[0098]
[0099]
[0100]
[0101]
[0102]
[0103]
[0104]
EXAMPLES
[0105] The following examples illustrate the invention. These examples should not be construed as to limit the scope of this invention. The examples are included for purposes of illustration and the present invention is limited only by the claims.
Example 1: Establishment of an iPSC Model from Patient Derived Fibroblasts
[0106] Reprogramming of both homozygous c.192G>C DJ-1 mutation carriers and the heterozygous c.192G>C DJ-1 mutation carrier was performed using retroviral vectors expressing the classical four Yamanaka factors. A minimum of 20 clones per individual was picked, passaged and stocks were frozen down for subsequent characterization of the clones. Characterization included tests about pluripotency and stem cell characteristics of the clones. Therefore, qPCR was performed to ensure silencing of the four exogenous viral factors and endogenous expression of pluripotency markers, ICC was used to show the expression of stem cell markers. Furthermore, the picked clones were tested for major chromosomal aberrations and mutations via G-banding and a genotyping microarray analysis. Additionally, the mutation carrying exon 3 of DJ-1 was sequenced in all clones to verify the correct genotype. Likewise, the ability of the clones to differentiate in cell types of all three germ layers was tested by ICC after differentiating the clones. At least two different clones per one individual were correctly characterized and a sufficient number of stocks was frozen down.
Example 2: DJ-1 Protein Level Varies Between Different Human Control Fibroblasts and c.192G>C DJ-1 Mutation Carriers do not Express DJ-1 Protein
[0107] Loss of DJ-1 function is a rare cause of familial PD. To calculate the steady state DJ-1 protein levels in different healthy individuals in comparison to the c.192G>C DJ-1 mutation carriers, samples from the respective individuals were analyzed on a low percentage SDS-PAGE gel (4% acrylamide). We took cell lysates from three different female human control individuals and from two different male human control individuals. Lysates from fibroblasts of family members of the c.192G>C DJ-1 family were taken as well. Lysates were taken from individual IV.9 who is a female heterozygous c.192G>C DJ-1 mutation carrier, from individual III.7, who is a male heterozygous c.192G>C DJ-1 mutation carrier and from individual III.11 and III.13 who are both homozygous c.192G>C DJ-1 mutation carriers; III.11 being female and premotor diseased and III.13 being the male index patient with motor symptoms.
[0108] DJ-1 proteins were examined by immunoblotting using a DJ-1 specific antibody (
[0109] Primary fibroblasts from patients are a useful tool but at the same time they only grow very slow and particularly from older people they become senescent after a certain times of passaging. To not lose the ability to work with fibroblasts from the c.192G>C DJ-1 family we immortalized fibroblasts using a lentiviral vector expressing SV40. This opened the opportunity to work with cells from patients that grow faster and can be used at higher passages.
[0110] To test if the immortalization had an effect on DJ-1 protein level and to see if the c.192G>C DJ-1 mutation still results in undetectable DJ-1 protein in immortal cells we repeated the Western blot for DJ-1 in this cell type which has a higher metabolic rate. Indeed DJ-1 protein could not be detected in both homozygous c.192G>C DJ-1 mutation carriers. The DJ-1 protein level of the heterozygous c.192G>C DJ-1 mutation carrier was reduced to about 20% of the amount seen in the healthy control (
Example 3: The c.192G>C DJ-1 Mutation Causes Skipping of DJ-1 Exon 3
[0111] The findings on DJ-1 protein level in c.192G>C DJ-1 mutation carriers raised interest in the underlying effect of the mutation. In a first experiment, RNA from patient derived fibroblasts was isolated, transcribed into cDNA and amplified DJ-1 by PCR. Agarose gel revealed a reduced length of c.192G>C DJ-1 carrying cDNA and a normal size band as well as the short band in the heterozygous c.192G>C carrying cDNA. cDNA from a healthy control served as positive control for correct length (
[0112] Subsequently, these results led us to sequence the amplified cDNA to find out which sequence segment was missing in the cDNA of c.192G>C DJ-1 mutation carriers. Sequencing result shows that the end of DJ-1 exon 2 is directly followed by the beginning of exon 4. Complete sequence of DJ-1 exon 3 was missing or skipped.
[0113] After obtaining these results, we wanted to validate the finding of skipping of DJ-1 exon 3 in c.192G>C DJ-1 mutation carrying cells. Therefore, minigene assay to assess the effect of DJ-1 exon 3 on splicing was performed.
[0114] Minigene assay is used to analyze splicing of an exon of interest by cloning it into a vector (pSPL3) between two given exons (Exon A and Exon B) and subsequent analysis of the cDNA by rtPCR and sequencing (
[0115] In the minigene assay we tested wt exon 3 of the control and mutant exon 3 (carrying the c.192G>C DJ-1 mutation) from the index patient. At the same time minigene constructs were tested where the wt exon 3 was mutated to carry the c.192G>C mutation and the corrected exon from the index patient now carrying the wt allele. Agarose gel electrophoresis (
[0116] The c.192G>C DJ-1 mutation in exon 3 which does not lead to a premature stop codon and which does not cause a frameshift in the RNA caused the skipping of exon 3 in the minigene assay. In addition to the correction of the mutation in the patient exon 3 and the mutating of exon 3 to carry the c.192G>C DJ-1 mutation from a healthy control in the minigene, sequencing of the entire DJ-1 locus of the patient's DNA was performed. This was done in order to have a second confirmation that the c.192G>C DJ-1 mutation suffices to cause skipping of exon 3 in c.192G>C DJ-1 mutation carriers and other sequence variants can be excluded.
[0117] The minigene assay showed, in an artificial system, that the c.192G>C DJ-1 mutation which lies at the end of exon 3 leads to skipping of the respective exon. Skipping of this exon does not lead to a frameshift or a premature stop codon.
[0118] Next, we wanted to verify our minigene assay results in the different cell types available from the c.192G>C DJ-1 mutation carriers. Therefore, cells were grown under standard conditions, harvested, RNA was extracted and transcribed into cDNA. Quantitative qPCR was performed using cDNA and two different pairs of primers for amplification. One primer pair to detect correctly spliced DJ-1 as well as cDNA lacking exon 3 and another primer pair where one primer is complementary with exon 2 and exon 3, leading to only correctly spliced (with exon 3) DJ-1 cDNA detected by qPCR. qPCR was performed with cDNA from immortalized fibroblasts (
[0119] qPCR results generated with primers that detect normal and misspliced DJ-1 cDNA show a reduction which is not significant of DJ-1 RNA expression in all c.192G>C DJ-1 mutation carriers (
[0120] When looking at the qPCR results in iPSCs derived from these fibroblasts one sees a reduction in RNA expression of DJ-1 with and without skipping of exon 3 of the c.192G>C DJ-1 mutation carriers. With two clones (homozygous premotor diseased c4 and homozygous affected c4) being particularly low (
Example 4: Genetically Modified U1 Rescues Skipping of Exon 3
[0121] Further experiments aimed at analyzing why the c.192G>C base exchange leads to missplicing of DJ-1 pre-mRNA and skipping of exon 3 during the splicing process. Hypothesized mechanism of splicing of pre-mRNA carrying the c.192G>C DJ-1 mutation is shown (
[0122] As the c.192G>C DJ-1 mutation lies in the splice donor site normally it would get recognized by the U1 snRNP which is initiating the formation of the spliceosome. U1 snRNP recognizes the 5 splice site through base pairing at the 5 end. Of note, a hundred percent match is not necessary. In case of DJ-1 exon 3 in the wt situation there are already four mismatches between U1 snRNA and the splice donor site (
[0123] To test this hypothesis, one base of the U1 snRNA was modified in the sense that it matches the c.192G>C mutation in DJ-1 exon 3, namely C>G were exchanged as indicated in cyan (
[0124] In a splicing reaction where C>G mutant U1 snRNP is present and recognizes the splice donor sequence of c.192G>C mutant DJ-1 exon 3 skipping of this exon does not take place.
[0125] In order to test the hypothesis, the previously performed minigene experiment was repeated and a plasmid expressing c.192G>C U1 snRNA was co-transfected.
[0126] The minigene construct expressing wt exon 3 resulted in the inclusion of the exon 3 when the minigene assay was performed with the minigene construct alone, when wt U1 snRNP was co-transfected and when c.192G>C U1 snRNA was co-transfected, this is shown by the long prominent bands on the agarose gel picture in (
[0127] As c.192G>C U1 snRNA was able to rescue skipping of c.192G>C DJ-1 mutation carrying exon 3 in the minigene assay in HEK cells, the next step was to test the effect of c.192G>C U1 snRNA in patient-derived cells.
[0128] Immortalised fibroblasts of the index patient carrying the c.192G>C DJ-1 mutation were electroporated with empty vector, wt U1 snRNA or c.192G>C U1 snRNA, respectively. Electroporation with wt U1 snRNA caused a significant increase in DJ-1 RNA expression of normal RNA and RNA lacking DJ-1 exon 3 in comparison to electroporation with empty vector or c.192G>C U1 snRNA. c.192G>C U1 snRNA also caused higher RNA levels in comparison to empty vector (
Example 5: Rescue of DJ-1 Protein in Patient Based Cellular Models
[0129] The c.192G>C mutation causes mis-splicing of the DJ-1 pre mRNA upon which Exon 3 is spliced out. Although the resulting shorter Ex3-mRNA is present in the patient derived small molecule-derived neuronal precursor cells (smNPC), the protein levels of DJ-1 are reduced to an almost undetectable level. The Ex3-mRNA is, although stably expressed in the cells, not translated into protein. Even overexpression of a recombinant Ex3-mRNA that doesn't undergo splicing does not translate into expression of a truncated Ex3-DJ-1 protein. However, overexpression of recombinant full length DJ-1 restores DJ-1 full length mRNA and protein levels in patient derived.
[0130] Both constructs (Ex3 and full length DJ-1) were delivered to cells by stable transduction using lentiviral vectors. The vectors co-express GFP and population of successfully transduced cells were enriched by FACS sorting for GFP+ cells to an purity of over 80%. (
[0131] As the DJ-1 Ex3-mRNA does not get translated into protein we aimed to rescue the pathological skipping of exon 3 genetically in the patient-derived cell model. By stably transducing the cells with a specifically designed mutated U1 snRNA (U1mut) that will allow the U1 complex to bind again to the mutant pre-mRNA, the amount of correctly spliced DJ-1 RNA was predicted to increase in the cell model. A lentiviral vector encoding wild type U1 snRNA (U1wt) was used as control to account for effects on protein levels that occur due to general increase of U1 levels in the cells. These lentiviral vectors were co-expressing GFP for later enrichment of transduced cells.
[0132] The enriched population of U1 transduced cells indeed showed a rescue of DJ-1 protein only when transduced with the mutant U1 snRNA (
[0133] Both rescue strategies, expression of DJ-1 full length and U1mut, increase level of full length RNA and protein levels not only in smNPC but also in DIV14 neurons differentiated from smNPC (
[0134] The genetic rescue shows that the DJ-1 protein can be successfully restored in patient cells after identification of the underlying mechanism by correcting the splicing defect.
[0135] This opens the opportunity for a second strategy to rescue DJ-1 protein level, the use of small chemical compounds. The patient-derived smNPC were used to screen for active compounds. Here, our experiments identified 4-Phenylbutric acid as active in rescuing DJ-1 protein expression in concentrations of 150 M and 300 M as shown by Western blot (
Example 6: Impaired Translation of c.192G>C Mutant mRNA Causing Loss of DJ-1 Protein
[0136] Since exon 3 skipping is in frame, a shorter DJ-1 peptide would be expected, that is identical to the wt protein except for the deleted 34 amino acids encoded by exon 3. Strikingly, no truncated protein can be detected (
[0137] Next, we investigated the predicted centroid secondary structure of mRNA, where 47.1% of the bases were predicted to be unpaired for wt DJ-1, while only 40.3% were unpaired for the ex3 variant. This difference in the number of unpaired bases in the native mRNA structure is also reflected by a slightly higher predicted minimum free energy (271.1 kcal/mol) as compared to the ex3 variant (273.1 kcal/mol). These energy predictions can only provide indicative estimates, but structure visualizations (
Example 7: Combination of Compounds
[0138] We hypothesized a synergistic effect of RECTAS when combined with PB to rescue DJ-1 in c.192G>C carriers. Therefore, DIV 30 neurons differentiated from the two patient-derived smNPC clones were treated for 4 days with 1 mM PB and indicated concentrations of RECTAS. mRNA levels of full-length DJ-1 were measured by qPCR. Compared to the untreated and the solvent control treated neurons correctly spliced full-length DJ-1 mRNA was significantly increased after treatment with 1 mM PB and 50 M RECTAS. Four days treatment with 1 mM PB and RECTAS concentrations of 10 M and 25 M increased the amount of correctly spliced mRNA in a dose-dependent manner (
[0139] We also tested the effect of RECTAS and PB as a single treatment for rescuing DJ-1 in c.192G>C carriers. The experiments were carried out with patient-derived neurons as described above. As shown in
Example 8: Treatment of Parkinson's Disease Associated with Aberrant Splicing
[0140] Here we describe a novel mechanistic concept for the pathogenic role of the PD-associated c.192G>C mutation in PARK7. In contrast to a missense mutation that was predicted to cause instability of E64D mutant DJ-1 protein, we identified a drastic reduction of DJ-1 protein levels in patient-based cellular models due to U1-dependent mis-splicing of pre-mRNA. The demonstration of the disease-causing ex3 DJ-1 mRNA splice variant as cause of the drastically reduced expression of DJ-1 protein reported here underscores the relevance of access to patient material for functional studies in order to validate predictions and define the underlying molecular pathology as basis for precision medicine approaches.
[0141] The observed discrepancy between the overall amount of mutant pre-mRNA and the drastically reduced levels of DJ-1 protein in homozygous c.192G>C carriers indicated the involvement of pathological mRNA processing and/or translation and argued against nonsense-mediated decay (NMD) as underlying mechanism. In our case, the c.192G>C mutation in the DJ-1 gene leads to an in-frame deletion of exon 3 without a PTC and therefore mutant mRNA was predicted to encode a smaller protein lacking the 34 amino acids encoded by exon 3. However, neither mass spectroscopy nor Western blot with different antibodies directed against different epitopes of DJ-1 revealed a truncated protein encoded by ex3 DJ-1 pre-mRNA (
[0142] In order to prove a U1-mediated pathogenic splicing mechanism underlying loss of DJ-1 function in PD, we performed genetic rescue experiments. This may open avenues for future gene transfer strategies to the central nervous system, as options for viral vectors delivered to the brain are becoming safer and more effective.
[0143] Here, we successfully applied an advanced literature mining approach for prioritization of candidate drugs to revert molecular and cellular phenotypes associated with mis-splicing of DJ-1 in homozygous carriers of the c.192G>C mutation. The novel combination of a RECTAS with PB significantly increased mRNA and protein levels of DJ-1 in patient-based cells across different tissues (
[0144] Our findings on the combination of RECTAS and PB as causative treatment for DJ-1 deficiency have an immediate impact for homozygous carriers of the c.192G>C mutation, and may qualify for treatment at the prodromal stage of PD for addressing neuroprotective strategies in PD. As a first drug candidate for correction of aberrant splicing, recently kinetin underwent clinical assessments for pharmacokinetics, safety and effectiveness in vivo (Axelrod, F. B. et al. Kinetin improves IKBKAP mRNA splicing in patients with familial dysautonomia. Pediatr. Res. 70, 480-3 (2011); Shetty, R. S. et al. Specific correction of a splice defect in brain by nutritional supplementation. Hum. Mol. Genet. 20, 4093-101 (2011)). Within a clinical trial kinetin was well-tolerated, safe, and resulted in increased levels of correctly spliced mRNA in vivo. This indicates that drugs targeting U1-mediated splicing defects have a high potential to become a therapeutic tool and supports future clinical trials.
[0145] The therapeutic potential of combined RECTAS and PB treatment for targeting defective splicing is further substantiated by additional mutations in monogenic PD. Homozygous mutations in PARK7 affecting the 5 consensus splice site for U1 were described as a c.91-2AG mutation in an Iranian family and a c.317-322del mutation in a Turkish family. Moreover, a novel c.1488+1 G>A mutation in the PINK1 gene was shown to affect a U1 binding site and cause an in-frame deletion of exon 7 (Samaranch, L. et al. PINK1-linked parkinsonism is associated with Lewy body pathology. Brain 133, 1128-42 (2010)). Interestingly, as for the c.192G>C mutation in PARK7, no NMD of the mutant PINK1 mRNA was observed in affected carriers.
[0146] These findings already indicate that the pathogenic relevance of exonic splicing mutations may be underestimated in PD, as it is known to represent a common cause for human diseases with approximately 15% of all mutations caused by aberrant spliceosome function, that translates into defective pre-mRNA processing (Nissim-Rafinia, M. & Kerem, B. Splicing regulation as a potential genetic modifier. Trends Genet. 18, 123-7 (2002)). To further assess the potential role of splicing-related mutations in PD, we performed an exploratory analysis of exonic mutations affecting U1-mediated splicing using publicly available databases including whole exome sequencing results from the PPMI study (Parkinson Progression Marker Initiative. The Parkinson Progression Marker Initiative (PPMI). Prog. Neurobiol. 95, 629-35 (2011).). We found a significantly higher burden of rare exonic variants affecting U1 snRNA binding sites in 380 PD patients compared to 162 control samples (n=109, P-value=0.015, OR=1.55, CI=1.05-2.29), with the majority of the burden defined by brain expressed genes (P-value=0.047, OR=1.50, CI=0.95-2.38). These results are in line with a recent study showing that approximately 10% of pathogenic missense variants predicted to alter protein code are essentially disrupting splicing (Soemedi, R. et al. Pathogenic variants that alter protein code often disrupt splicing. Nat. Genet. (2017). doi:10.1038/ng.3837). This supports our findings of an enrichment of missense and synonymous disease-associated variants in the exon boundary and underscores the therapeutic potential of compounds acting on pathological splicing.
[0147] Our study illustrates the promise for treatment concepts in precision medicine in PD that focus on genetic and molecular stratification. In order to account for the increasingly recognized heterogeneity in PD and other neurodegenerative disorders, new strategies have to be developed for the stratification of patients along shared pathogenic mechanisms.
[0148] Variant Annotation and Filtering:
[0149] In the current study we only focused on the intronic 5 splice single nucleotide variants (SNVs). Multi-allellic variants were decomposed by using variant-tests (Tan, A., Abecasis, G. R. & Kang, H. M. Unified representation of genetic variants. Bioinformatics 31, 2202-2204 (2015)) and left normalized by bcftools (Li, H. et al. The Sequence Alignment/Map format and SAMtools. Bioinformatics 25, 2078-2079 (2009)). Variants were annotated by using ANNOVAR (Wang, K., Li, M. & Hakonarson, H. ANNOVAR: functional annotation of genetic variants from high-throughput sequencing data. Nucleic Acids Res. 38, e164-e164 (2010)) version 2016December05 using RefSeq gene annotations and the dbNSFP v3.0 (Liu, X., Wu, C., Li, C. & Boerwinkle, E. dbNSFP v3.0: A One-Stop Database of Functional Predictions and Annotations for Human Nonsynonymous and Splice-Site SNVs. Hum. Mutat. 37, 235-241 (2016)) prediction scores. Only rare variants, as defined by variants with a MAF<0.05 in the current dataset and also European population of public databases such as 1000 genomes (The 1000 Genomes Project Consortium. A global reference for human genetic variation. Nature 526, 68-74 (2015)), ExAC (release 0.3), and the Exome variant server (http://evs.gs.washington.edu/EVS/) were selected. In order to prioritize the 5 splice variants based on their deleteriousness, we used three different scores. The first score is generated by using the MaxEntScan method (Yeo, G. & Burge, C. B. Maximum entropy modeling of short sequence motifs with applications to RNA splicing signals. J. Comput. Biol. J. Comput. Mol. Cell Biol. 11, 377-394 (2004).) which is based on the maximum entropy principle. The other two scores were ensemble scores (dbscSNV_ADA and dbscSNV_RF) generated from multiple splice site prediction tools (Jian, X., Boerwinkle, E. & Liu, X. In silico prediction of splice-altering single nucleotide variants in the human genome. Nucleic Acids Res. 42, 13534-13544 (2014)) which are available as part of dbNSFP database (Liu, X., Wu, C., Li, C. & Boerwinkle, E. dbNSFP v3.0: A One-Stop Database of Functional Predictions and Annotations for Human Nonsynonymous and Splice-Site SNVs. Hum. Mutat. 37, 235-241 (2016)).
[0150] Generation of MaxEntScan score:
[0151] In the current study we only focused on the variants present in the consensus splice site positions i.e., +3 to 6 from the exon/intron boundary. To prioritize such variants using MaxEntScan method, for each SNV that lies in the consensus splice site region (SNV.sub.splice) a wild type 9 mer (WT) was extracted from the reference genome (hg19). Then the variant was introduced within the WT by using the python module pyfaidx (Shirley, M. D., Ma, Z., Pedersen, B. S. & Wheelan, S. J. Efficient pythonic access to FASTA files using pyfaidx. (PeerJ PrePrints, 2015).), hence creating a mutated consensus splice site (MUT) sequence for each variant. In the next steps the scores were calculated for both WT and MUT sequences by using the scripts provided in the MaxEntScan website (http://genes.mit.edu/burgelab/maxent/Xmaxentscan_scoreseq.html). The relative percentage change (maxentscan_change) was calculated by using the formula below and all the variant sites were annotated with the WT.sub.score and maxentscan_change.
[0152] Deleterious splice site variants (SNV.sub.splicedef) were defined as SNVs with the following criteria: WT.sub.score>5 and maxentscan_change>70 and dbscSNV_ADA score>0.9 and dbscSNV_RF score>0.9. If the ensemble scores are not available for any particular variant only MaxEntScan methods (WT.sub.score>5 and maxentscan_change>70) was used.
[0153] Burden Tests:
[0154] To test the genome-wide differential burden of SNV.sub.splicedef variants in cases compared to the controls CMCFisherExact test available part of rvtests (Zhan, X., Hu, Y., Li, B., Abecasis, G. R. & Liu, D. J. RVTESTS: an efficient and comprehensive tool for rare variant association analysis using sequence data. Bioinformatics 32, 1423-1426 (2016)) with default parameters was used and one sided P-values were reported. CMCFisherExact test collapses the variants in the given regions (gene or gene-set) and performs the fisher exact test. We used a list of brain expressed genes to see if there is an increased burden in the brain expressed genes (n=14,177) compared to the non-brain expressed genes (n=6,428).
[0155] Results:
[0156] Burden Analysis:
[0157] After the filtering based on ethnicity, cryptic relatedness and quality parameters the final dataset comprised of 380 PD and 162 control samples. A total of 128 SNV.sub.splicedef variants were included in the final analysis. We observed a genome-wide significant burden in cases compared to the controls (P-value=0.021, OR=1.49, CI=1.02-2.18). The signal is coming mainly from the exonic SNV.sub.splicedef variants (n=109, P-value=0.015, OR=1.55, CI=1.05-2.29) rather than the intronic ones (n=19, P-value=0.24, OR=1.34, CI=0.68-2.65). In the exonic SNV.sub.splicedef variants majority of the burden is caused by the brain expressed genes (P-value 0.047, OR=1.50, CI=0.95-2.38), compared to the genes that are not expressed in brain (P-value 0.12, OR=1.37, CI=0.84-2.23). The SNV.sub.splicedef variants are provided in the supplemental table 1, along with their respective case-control counts.
[0158] Research co-funded by the Fonds National de la Recherche Luxembourg (FNR) under grant number FNR/P13/6682797/Krger
[0159] The invention illustratively described herein may suitably be practiced in the absence of any element or elements, limitation or limitations, not specifically disclosed herein. Additionally, the terms and expressions employed herein have been used as terms of description and not of limitation, and there is no intention in the use of such terms and expressions of excluding any equivalents of the features shown and described or portions thereof, but it is recognized that various modifications are possible within the scope of the invention claimed. Thus, it should be understood that although the present invention has been specifically disclosed by exemplary embodiments and optional features, modification and variation of the inventions embodied therein herein disclosed may be resorted to by those skilled in the art, and that such modifications and variations are considered to be within the scope of this invention.
[0160] The invention has been described broadly and generically herein. Each of the narrower species and subgeneric groupings falling within the generic disclosure also form part of the invention. This includes the generic description of the invention with a proviso or negative limitation removing any subject matter from the genus, regardless of whether or not the excised material is specifically recited herein.
[0161] Other embodiments are within the following claims. In addition, where features or aspects of the invention are described in terms of Markush groups, those skilled in the art will recognize that the invention is also thereby described in terms of any individual member or subgroup of members of the Markush group.