GAMETOGENESIS
20210071138 ยท 2021-03-11
Assignee
Inventors
- Petra HAJKOVA (Swindon, Wiltshire, GB)
- Peter HILL (Swindon, Wiltshire, GB)
- Harry LEITCH (Swindon, Wiltshire, GB)
Cpc classification
C12N2506/45
CHEMISTRY; METALLURGY
C12N2501/06
CHEMISTRY; METALLURGY
C12N2501/999
CHEMISTRY; METALLURGY
C12N5/0611
CHEMISTRY; METALLURGY
C12Q1/6876
CHEMISTRY; METALLURGY
International classification
Abstract
The present invention relates to in vitro methods of inducing gametogenesis by producing meiotically competent cells. Reagents and kits for use in the methods of the invention are also provided. The present invention finds use in the field of medicine, particularly in the study and treatment of infertility.
Claims
1. An in vitro method of producing a meiotically competent cell, the method comprising: (i) providing a precursor cell, (ii) inhibiting methylation of the genomic DNA of the precursor cell, (iii) treating the precursor cell with an inhibitor of a polycomb repressive complex, and then (iv) propagating the precursor cell for a period of time and under culture conditions suitable for the precursor cell to become a meiotically competent cell; wherein step (ii) and step (iii) may be performed simultaneously or sequentially in either order.
2. The method according to claim 1, wherein the precursor cell is derived from a sample that has been obtained from a subject.
3. The method according to claim 1, wherein the precursor cell is a stem cell or a primordial germ cell-like cell (PGCLC).
4. The method according to claim 3, wherein the stem cell is an iPS cell.
5. The method according to claim 1, wherein said precursor cell expresses Tet1, or begins expressing Tet1 following step (i) and/or (ii).
6. The method according to claim 1, wherein said inhibiting step (ii) and said treating step (iii) are sufficient to induce expression of germline reprogramming responsive (GRR) genes by the precursor cell during propagating step (iv).
7. The method according to claim 6, wherein the expression of the GRR genes is associated with or induced by recruitment of a transcriptional activator.
8. The method according to claim 7, wherein the transcriptional activator is Tet1.
9. The method according to claim 5, wherein Tet1 expression is exogenously provided or enhanced.
10. The method according to claim 1, wherein Tet1 protein is exogenously introduced into the precursor cell before or during step (iv).
11. The method according to claim 9, wherein the exogenously provided or exogenously introduced Tet1 is a Tet1 fusion construct that is targeted to one or more specific genomic regions.
12. The method according to claim 1, further comprising: (v) detecting the expression level of one or more GRR genes in the cell.
13. The method according to claim 12, wherein step (v) is performed on the meiotically competent cell following step (iv).
14. The method according to claim 1, wherein the inhibitor of polycomb repressive complex is a PRC1 inhibitor and/or a PRC2 inhibitor.
15. (canceled)
16. The method according to claim 14, wherein the PRC1 inhibitor is PRT4165.
17. The method according to claim 1, wherein the inhibitor of polycomb repressive complex is an RNAi molecule.
18. The method according to claim 1, wherein step (ii) is performed by treating the precursor cell with an agent that reduces genomic DNA methylation.
19. The method according to claim 18, wherein the agent that reduces genomic DNA methylation is a DNA methyltransferase inhibitor, an agent that prevents the deposition of DNA methylation, or an agent that inhibits the maintenance of DNA methylation.
20. The method according to claim 19, wherein the agent that reduces genomic DNA methylation is a DNA methyltransferase inhibitor, optionally wherein the DNA methyltransferase inhibitor is a DNMT1 inhibitor.
21. (canceled)
22. The method according to claim 20, wherein the DNA methyltransferase inhibitor is SGI 1027, 5-azacytidine, or an RNAi molecule.
23. (canceled)
24. The method according to claim 1, wherein step (ii) is performed by using gene-editing to inactivate a DNA methyltransferase gene or a component of DNA methylation machinery.
25. A meiotically competent cell produced by the method of any one of the preceding claims.
26. A method of inducing gametogenesis, the method comprising treating the meiotically competent cell according to claim 25 with retinoic acid.
27. (canceled)
28. (canceled)
29. A gametocyte produced by the method according to claim 21, or a gamete derived therefrom.
30. (canceled)
31. A kit for the in vitro production of the meiotically competent cell according to claim 25, the kit comprising a methylation inhibitor, and an inhibitor of a poly comb repressive complex.
32. (canceled)
33. A method of assessing the fertility of a mammal, the method comprising determining the nucleic acid sequence and/or epigenetic status of one or more germline reprogramming responsive (GRR) genes in a cell that has been obtained from the mammal.
34. A method of determining the meiotic competency of a cell, the method comprising determining the nucleic acid sequence and/or epigenetic status and/or gene expression level of one or more germline reprogramming responsive (GRR) genes in the genomic DNA of the cell.
Description
BRIEF DESCRIPTION OF THE DRAWINGS
[0050] Embodiments of the invention will now be described by way of example with reference to the accompanying drawings:
[0051]
[0052]
[0053]
[0054]
[0055]
[0056]
[0057]
[0058]
[0059]
[0060]
[0061]
[0062]
[0063]
[0064]
[0065]
[0066]
[0067] The timely and efficient activation of germline reprogramming responsive (GRR) genes, involved in the PGC-to-gonocyte transition and successful gametogenesis, requires the interplay between the initiation of global DNA demethylation, Tet1 recruitment, and removal of PRC1-mediated repression. Both DNA demethylation-dependent (safeguarding against aberrant residual/de novo promoter DNA methylation) and -independent (such as the potential recruitment of OGT to gene promoters36, thus facilitating deposition of H3K4me3 via SET1/COMPASS38) functions of Tet1 are important for GRR gene activation.
[0068]
[0069] Mouse embryonic stem cells (mESCs) were treated with retinoic acid (RA). The J1 cell line was used in comparison to J1 TKO cells lack DNA methylation machinery by virtue of being Dnmt1/Dnmt3a/Dnmt3b triple knock-out. The black bars respectively show the fold-change in Dazl-Hormad1- and Mae1-expression in TKO cells compared with J1 controls (neither treated with RA). The grey bars respectively show the fold-change in Dazl-Hormad1- and Mae1-expression in J1 cells treated with RA compared with J1 cells not treated with RA. The white bars respectively show the fold-change in Dazl-Hormad1- and Mae1-expression in TKO cells treated with RA compared with J1 cells not treated with RA.
DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS AND FURTHER OPTIONAL FEATURES OF THE INVENTION
[0070] While the invention has been described in conjunction with the exemplary embodiments described above, many equivalent modifications and variations will be apparent to those skilled in the art when given this disclosure. Accordingly, the exemplary embodiments of the invention set forth above are considered to be illustrative and not limiting. Various changes to the described embodiments may be made without departing from the spirit and scope of the invention.
[0071] Epigenetic Reprogramming Enables the Primordial Germ Cell-to-Gonocyte Transition
[0072] Gametes are highly specialised cells that can give rise to the next generation through their ability to generate a totipotent zygote. In mouse, germ cells are first specified in the developing embryo as primordial germ cells (PGCs) starting around embryonic day (E) 6.25.sup.1 (
[0073] In order to address the potential role and underlying molecular mechanisms of gonadal germline reprogramming, the inventors first set out to investigate the dynamics of and relationship between 5mC and 5-hydroxymethylcytosine (5hmC), which has previously been implicated in DNA demethylation in PGCs.sup.3,6,9-11. The inventors did this quantitatively and at single base resolution using liquid chromatography/mass spectrometry (LC/MS) coupled with whole genome bisulphite sequencing (WGBS,
[0074] By LC/MS the inventors observed that global levels of genomic 5mC remain stable between migratory (E9.5) and early gonadal (E10.5) PGCs, followed by a significant reduction between E10.5 and E11.5 and much more limited DNA demethylation between E11.5 and E13.5 (
[0075] Consistent with our LC/MS measurements, WGBS analysis revealed near complete loss of combined 5mC/5hmC between E10.5 and E11.5 at features within uniquely mapped regions of the genome, with limited further DNA demethylation observed between E11.5 and E12.5 (
[0076] Detailed analysis of 5hmC patterns across examined developmental stages uncovered that the majority of 5hmC is lost from uniquely mapped regions of the genome and re-localised to repetitive elements (
[0077] We next explored the relationship between 5hmC deposition and DNA demethylation in gonadal PGCs between E10.5 and E12.5 for all initially methylated 2 kb windows (i.e. min. 20% methylation at E10.5). DNA demethylation involving a 5hmC intermediate predicts a direct correlation between 5hmC appearance and 5mC loss (
[0078] To expand on this observation, the inventors used a previously published Tet1-KO mouse model.sup.18 (
[0079] In support of our LC/MS measurements, only a limited number of differentially methylated regions were detected in E14.5 Tet1-KO PGCs by reduced representation bisulphite sequencing (RRBS) (
[0080] We next analysed the observed 5mC and 5hmC dynamics in combination with RNA-Seq datasets derived from E10.5-E14.5 PGCs (
[0081] As the influence of 5mC on transcriptional activity of a gene has been shown in mammals to be highly dependent on promoter CpG content.sup.19, the inventors performed clustering analysis specifically at genes with either high-CpG (HCPs), intermediate-CpG (ICPs) or low-CpG (LCPs) promoters.sup.19 (
[0082] Considering that GRR genes (n=45) constituted less than 25% of the entire subset of HCP genes that undergo DNA demethylation (n=226;
[0083] The observed binding of Tet1 is functionally relevant, as the extent of GRR gene upregulation is considerably lower in Tet1-KO PGCs (
[0084] To further mechanistically probe the causal relationship between epigenetic reprogramming and GRR gene activation, the inventors turned to an in vitro model. Serum-grown mESCs represented an ideal system, as these cells are not germ line-restricted yet have highly similar epigenetic modifications at GRR gene promoters to what is observed in vivo in pre-reprogramming gonadal PGCs (
[0085] Although these in vitro observations clearly supported our in vivo data with respect to the roles of 5mC and Tet1, the extent to which GRR genes were up-regulated in Dnmt-TKO mESCs (
[0086] Our study has identified a set of germline reprogramming responsive (GRR) genes crucial for the correct progression of gametogenesis. These genes have unique promoter sequence characteristics, with high levels of both 5mC and 5hmC, and are targets of Tet1 and PRC1. This disclosure shows that combined loss of DNA methylation and PRC1 repression is uniquely required for GRR gene activation, with this epigenetically poised state further requiring Tet1 to potentiate both full and efficient activation. Tet1 appears to be particularly important in female PGCs.sup.9, which initiate meiotic prophase soon after completion of epigenetic reprogramming, thus posing a requirement on the timely high expression of these genes. Importantly, although the inventors observed slight hypermethylation at GRR gene promoters in E14.5 Tet1-KO PGCs, our study clearly documents that Tet1 stimulates transcription of GRR genes also via a DNA demethylation-independent mechanism.sup.21,22. In this context, previous studies have shown that Tet1 recruits OGT to gene promoters.sup.22, thus facilitating deposition of H3K4me3 via SET1/COMPASS.sup.28 leading to transcriptional activation. In further support, GRR gene promoters in mESCs are marked by low but detectable H3K4me3, the levels of which are significantly decreased in the absence of Tet1 without changes in DNA methylation (
Methods
Statistics and Reproducibility
[0087] All statistical tests are clearly described in the figure legends and/or in the Methods section, and exact p-values or adjusted p-values are given where possible. For WGBS data (
[0088] Mice
[0089] All animal experiments were carried out under and in accordance with a UK Home Office Project Licence in a Home-Office designated facility. Except for direct comparison with Tet1-KO PGCs, wild type PGCs were isolated from embryos produced by crossing outbred MF1 females with mixed background GOF18PE-EGFP.sup.5 transgenic males. The sex of embryos from E12.5 onwards was determined by visual inspection of the gonads. For study of Tet1-KO PGCs, the Tet1 knockout mouse strain (B6;29S4-Tet1.sup.tml.IJae/J).sup.18 was purchased from Jackson Laboratory and bred onto the GOF18PE-EGFP.sup.5 transgenic mouse line. Wild type and Tet1-KO PGCs were isolated from embryos produced from crosses between Tet1-heterozygous GOF18PE-EGFP-homozygous females and males. For genotyping of embryos produced by crossing Tet1-heterozygous GOF18PE-EGFP-homozygous males and females, PCR was always carried out twice using two different sets of primers (see below) to confirm exon 4 deletion. The sex of the embryos from E12.5 onwards was determined by visual inspection of gonads and additionally confirmed by PCR for Sry. In all cases, the mating is timed in the way that appearance of a vaginal plug at noon is defined as E0.5.
[0090] Molecular Biology
[0091] The following genotyping primers were used in this study:
TABLE-US-00002 (Tet1forwardprimer1) TCAGGGAGCTCATGGAGACTA; (Tet1forwardprimer2) AACTGATTCCCTTCGTGCAG; (Tet1reverseprimer) TTAAAGCATGGGTGGGAGTC; (Sryforwardprimer) TTGTCTAGAGAGCATGGAGGGCCATGTCAA; (Sryreverseprimer) CCACTCCTCTGTGACACTTTAGCCCTCCGA.
[0092] PGC Isolation by Flow Cytometry
[0093] PGC isolation was carried out as previously described.sup.4. Briefly, the embryonic trunk (E10.5) or genital ridge (E11.5-E14.5) was digested at 37 C. for 3 min using 0.05% Trypsin-EDTA (lx) (Gibco) or TrypLE Express (Thermo). Enzymatic digestion was followed by neutralization with DMEM/F-12 (Gibco) containing 15% foetal bovine serum (Gibco) and manual dissociation by pipetting. Following centrifugation, cells were re-suspended in DMEM/F-12 supplemented with hyaluronidase (300 g/ml; Sigma), and a single cell suspension was generated by manual pipetting. Following centrifugation, cells were re-suspended in ice-cold PBS supplemented with poly-vinyl alcohol (10 g/ml) and EGTA (0.4 mg/ml, Sigma). GFP positive cells were isolated using an Aria IIu (BD Bioscience) or Aria III (BD Bioscience) flow cytometer and sorted into ice cold PBS supplemented with poly-vinyl alcohol (10 g/ml) and EGTA (0.4 mg/ml, Sigma).
[0094] Generation of Tet1-KO Dnmt-TKO mESCs
[0095] Tet1-KO Dnmt-TKO mESC line was generated by CRISPR/Cas9-mediated genome editing. pX330 (Addgene, #42230) with the sgRNA targeting Tet1.sup.31 (GGCTGCTGTCAGGGAGCTCA) was co-transfected with a reporter GFP plasmid in 510.sup.6 Dnmt-TKO mESCs.sup.23 using Lipofectamine 3000. The day after, GFP positive cells were sorted by FACS (BD FACS Aria III) in a 96-well plate. Cells were cultured for a week before being frozen down and extracting gDNA. Colonies were screened for mutations using surveyor assay (Surveyor Mutation Detection Kit from IDT, and Taq DNA polymerase from Qiagen). Tet1-KO Dnmt-TKO mESC selected clone was further analysed by genotype sequencing, which confirmed the presence of a frameshift mutation. Loss of Tet1 was verified by RNA-Seq and western blot. The following primers were used for genotype sequencing and surveyor assay: 5 TTGTTCTCTCCTCTGACTGC 3 and 5 TGATTGATCAAATAGGCCTGC 3.
[0096] mESC Cell Culture
[0097] J1 (wild type), Dnmt-TKO.sup.23 and Tet1-KO Dnmt-TKO mESCs were cultured in FCS/LIF medium without feeders on 0.1% gelatin. FCS/LIF medium consists of GMEM (Gibco) supplemented with 10% FCS, 0.1 mM MEM nonessential amino acids, 2 mM 1-glutamine, 1 mM sodium pyruvate, 0.1 mM 2-mercaptoethanol and mouse LIF (ESGRO, Millipore). For inhibitor experiments, mESCs were plated at a density of 1.510.sup.4/cm.sup.2 and left overnight. The next morning medium was exchanged for FCS/LIF medium containing either 50 M PRC1 inhibitor PRT4165 (Ismail et al., 2013) or DMSO control and cells pelleted at the indicated time for analysis.
[0098] AbaSeq Library Preparation
[0099] Total DNA was isolated from 10,000 sorted PGCs using the QIAamp DNA Micro Kit (Qiagen). AbaSeq libraries for 5hmC profiling were constructed as previously described.sup.15. In brief, genomic DNA was glucosylated, then digested by AbaSI enzyme (NEB). Biotinylated P1 adapters were ligated onto the AbaSI digested DNA then fragmented using a Covaris S2 sonicator (Covaris), following the manufacturer's instructions. Sheared P1-ligated DNA was then captured by mixing with Dynabeads MyOne Streptavidin C1 beads (Life Technologies) according to the manufacturer's specifications. End repair and dA-tailing were carried out on the beads by using the NEBNext End Repair Module (NEB) and the NEBNext dA-tailing Module (NEB) at 20 C. and 37 C. respectively for 30 min. P2 adapters were ligated to the random sheared ends of the dA-tailed DNA. Finally, the entire DNA was amplified using the Phusion DNA polymerase (NEB) with the addition of 300 nM forward primer (PCR_I) and 300 nM reverse primers (PCR_IIpe) for 16 cycles. The libraries were purified using AMPure XP beads (Beckman-Coulter) and sequenced on the Illumina HiSeq 2000 instrument.
[0100] Whole Genome Bisulphite Sequencing (WGBS) Library Preparation
[0101] Total DNA was isolated from 10,000 sorted PGCs using the QIAamp DNA Micro Kit (Qiagen). In some cases, unmethylated k phage DNA (Promega) was spiked in following DNA isolation to assess bisulphite conversion rate. DNA was fragmented using a Covaris S2 sonicator (Covaris), as per manufacturer's instructions. Libraries were made following the NEBNext Library Prep protocol, with methylated adaptors and the following modifications: following adaptor ligation, bisulphite conversion was carried out using the Imprint Modification Kit (Sigma); and PCR enrichment was carried out for 16 cycles using the NEXTflex Bisulphite-Seq Kit for Illumina Sequencing (Bioo Scientific) master mix and the NEBNext Library Prep universal and index primers (NEB). The libraries were purified by AMPure XP beads (Beckman-Coulter). Libraries were sequenced on the Illumina HiSeq 2000 or 2500 instrument.
[0102] Reduced Representation Bisulphite Sequencing (RRBS) Library Preparation
[0103] Total DNA from FACS-sorted PGCs isolated from individual Tet1-KO or wild type embryos was isolated using ZR-Duet DNA-RNA MiniPrep kit (Zymo), and DNA from between two to six embryos (equivalent to 1,000 to 8,000 cells) of the same genotype, stage and sex was pooled and concentrated to 26 L final volume using the Savant SpeedVac Concentrator (Thermo) and following the manufacturer's instructions. Genomic DNA was digested by 20 units of MspI enzyme (NEB) in NEB buffer 2 at 37 C. for 3 hrs, and digested DNA was purified using AMPure XP beads (Beckman-Coulter). Libraries were made following the NEBNext Ultra DNA Library Prep protocol with methylated adaptors and the following modifications: following adaptor ligation, bisulphite conversion was carried out using the Imprint Modification Kit (Sigma); and PCR enrichment was carried out for 18 cycles using the KAPA Uracil+DNA polymerase master mix (KAPA Biosystems) and the NEBNext Library Prep universal and index primers (NEB). The libraries were purified by AMPure XP beads (Beckman-Coulter). Pooled libraries were sequenced on the Illumina HiSeq 2500 instrument, using the dark sequencing protocol, as previously described.sup.32.
[0104] RNA-Seq Library Preparation
[0105] For study of Tet1-KO PGCs, total RNA from sorted PGCs isolated from individual Tet1-KO or wild type embryos was isolated using ZR-Duet DNA-RNA MiniPrep kit (Zymo), and RNA from between two to six embryos (equivalent to 1,000 to 8,000 cells) of the same genotype, stage and sex was pooled and concentrated to 6 L final volume using the RNA Clean and Concentrator 5 kit (Zymo). For study of wild type PGCs isolated from embryos produced by crossing MF1 females with GOF18PE-EGFP males, total RNA from 600-1,000 sorted E10.5 PGCs was isolated using the Nucleospin RNA XS kit (Macherey-Nagel). cDNA synthesis and amplification (15 cycles) was performed with the SMARTer Ultra Low Input RNA kit (Clontech) using between 100 pg and 3 ng total RNA and following the manufacturer's instructions. The amplified cDNA was fragmented by a Covaris S2 sonicator (Covaris) and following the manufacturer's instructions. Sheared cDNA was converted to sequencing libraries using the NEBNext DNA Library Prep kit (NEB), following the manufacturer's instructions and using 15 cycles of amplification. For study of mESCs, total RNA was isolated using ZR-Duet DNA-RNA MiniPrep kit (Zymo). cDNA synthesis and library prep was performed starting with 500 ng total RNA following manufacturer's instructions using the NEBNext Ultra Library Prep Kit (NEB) and the NEBNext Poly(A) mRNA Magnetic Isolation Module (NEB). All libraries were purified by AMPure XP beads (Beckman-Coulter) and sequenced on the Illumina HiSeq 2500 instrument.
[0106] Bioinformatics
[0107] Whole Genome Bisulphite Sequencing (WGBS) and Tet-Assisted Bisulphite Sequencing (TAB-Seq) Alignment and Downstream Analysis
[0108] Raw reads were first trimmed using Trim Galore (version 0.3.1) with the --paired --trim1 options. Alignments were carried out to the mouse genome (mm9, NCBI build 37) with Bismark (version 0.13.0) with the -n 1 parameter; where appropriate, the phage genome was added as an extra chromosome. Aligned reads were deduplicated with deduplicate_bismark. Where appropriate, the bisulphite conversion rate was computed using reads aligned to the phage genome and using the to-mr script (parameters: -m bismark) and bsrate script (paramters: -N) of Methpipe (version 3.3.1). CpG methylation calls were extracted from the deduplicated mapping output using the Bismark methylation extractor.
[0109] The number of methylated and unmethylated cytosines in a CpG context was extracted using bismark2bedGraph and coverage2cytosine. Symmetric CpGs were merged with custom R script. For all downstream analysis, only symmetric CpGs with minimum 8 coverage were used. All WGBS analysis was carried out on data from merged biological replicates. For assessing DNA modification levels at specific repetitive elements, Bismark (version 0.14.4) was used to map all reads from each data set against consensus sequences constructed from Repbase with the n 1 parameter set. CpG methylation calls were extracted from the mapping output using the Bismark methylation extractor (version 0.14.4).
[0110] The mapBed function of BEDtools (version 2.24.0) was used to compute the combined 5mC/5hmC level for the following genomic features: 1) all 2 kb windows (containing a minimum 4 symmetric CpGs); 2) gene promoters (defined as Ensembl 67 gene start sites 1 kB/+500 bp); 3) gene bodies (defined as the region contained within Ensembl 67 gene start and gene end sites); 4) putative active enhancers in day 6 PGCLCs.sup.33; 5) imprint control regions; 6) CpG islands (UCSC); 7) intergenic regions. For metagene plots, a genomic feature was divided into equally sized bins using BEDtools (version 2.24.0), including: 1) gene bodies (defined as the region contained within Ensembl 67 gene start and gene end sites)+/0.5*gene body length (100 bins); 2) putative active enhancers in day 6 PGCLCs 33 +/1*putative active enhancer length (90 bins); and 3) CpG islands (UCSC)+/1*CpG island length (90 bins). In all cases, the combined 5mC/5hmC level was expressed as the mean of individual CpG sites.
[0111] For k-means clustering of the combined mean 5mC/5hmC levels, high CpG (HCP), intermediate CpG (ICP) and low CpG (LCP) promoters, as defined using the same parameters as previously published.sup.1934. Briefly, LCPs contain no 500-bp window with a CpG ratio >0.45; HCPs contain at least one 500-bp window with a CpG ratio >0.65 and GC content >55%; ICPs do not meet the previous criteria.
[0112] For determining locus-specific methylation levels in wild type mESCs grown in serum-containing media, raw WGBS reads were downloaded from GSE48519.sup.30 and processed as above. TAB-Seq reads for E14 mESCs were downloaded from GSE36173.sup.35 and processed as above, with the exception that only symmetric CpGs with minimum 12 coverage were used.
[0113] AbaSeq Alignment and Downstream Analysis
[0114] For the uniquely mappable part of the genome, AbaSeq reads were processed as previously described.sup.15. In brief, raw sequencing reads were trimmed for adaptor sequences and low quality bases using Trim Galore. The trimmed reads were mapped to the mouse genome (mm9, NCBI build 37) using Bowtie (version 0.12.8) with parameters n 1 -l25 --best --strata -m 1. Calling of 5hmC was based on the recognition sequence and cleavage pattern of the AbaSI enzyme (5-C.sub.11-13N.sub.9-10G-3/3-GN.sub.9-10N.sub.11-13C-5) using custom Perl scripts. For assessing relative enrichment of 5hmC at repetitive elements and non-repetitive elements, AbaSeq alignments were divided into two groups: unique (single best alignment) and ambiguous (map to multiple locations with equal alignment score). Both groups were then mapped to the repetitive elements defined by the RepeatMasker track of mm9 (UCSC Genome Browser) separately. For comparison with 5hmC levels in mESCs, AbaSeq reads were downloaded from GSE42898.sup.15 and aligned in the same way.
[0115] For quantification of relative 5hmC levels at symmetric CpGs in the uniquely mapped part of the genome, the number of counts per symmetric CpG for a given sample were normalised to the combined number of uniquely mapped and ambiguously mapped reads for a given library, and then further multiplied by a stage-specific normalisation factor based on the mean 5hmC level for each stage as computed by LC/MS (E14 ESC=1.64; E10.5=1.0; E11.5=1.13; E12.5F=0.76; E12.5M=1.0). All symmetric CpGs falling within genomic intervals blacklisted by the mouse (mm9) ENCODE project were excluded from all further downstream analysis. Unless stated otherwise, all AbaSeq analysis was carried out on data from merged biological replicates.
[0116] The mapBed function of BEDtools (version 2.24.0) was used to compute the 5hmC level for the same genomic features as was carried out with WGBS datasets (see above). In all cases, the 5hmC level was expressed as the mean of individual CpG sites.
[0117] To identify 5hmC enriched or depleted regions in E10.5 and E11.5 PGCs, the mm9 genome was first divided into 2 kb windows (minimum 4 symmetric CpGs) and the mean 5hmC level for each window was computed using BEDtools (version 2.24.0). To determine the significance of 5hmC enrichment in each 2 kB window, upper-tail (to determine 5hmC enriched regions) or lower-tail (to determine 5hmC depleted regions) Poisson probability p-values were computed using ppois(x, ), where x is the observed 5hmC mean value for each 2 kb window and is the mean of 5hmC mean values for all 2 kb windows at E10.5. Benjamini-Hochberg correction was then applied to correct for multiple testing, giving a final adjusted upper-tail and lower-tail p-value for each 2 kb window. Windows with adjusted upper-tail p-value<0.05 were considered relatively enriched for 5hmC while windows with adjusted lower-tail p-value<0.05 were considered relatively depleted for 5hmC.
[0118] For assessing relative enrichment of 5hmC at specific repetitive elements, Bowtie was used to map all reads from each data set against consensus sequences constructed from Repbase with parameters n 1M 1 --strata -best. The number of reads mapped to each sequence within a given sample was first normalised to the library size of that particular sample, and then normalised to both a stage-specific normalisation factor based on the mean 5hmC level for each stage as computed by LC/MS (E10.5=1.0; E11.5=1.13; E12.5F=0.76; E12.5M=1.0) and the mean proportion of reads mapped to a given sequence in E10.5 PGCs.
[0119] Reduced Representation Bisulphite Sequencing (RRBS) Alignment and Downstream Analysis
[0120] Raw RRBS reads were first trimmed using Trim Galore (version 0.3.1) with --rrbs parameter. Alignments were carried out to the mouse genome (mm9, NCBI build 37) with Bismark (version 0.13.0) with the -n 1 parameter. CpG methylation calls were extracted from the mapping output using the Bismark methylation extractor (version 0.13.0). The number of methylated and unmethylated cytosines in a CpG context was extracted using bismark2bedGraph.
[0121] RnBeads (version 1.0.0) and RnBeads.mm9 (version 0.99.0) were used to identify differentially methylated regions between two test groups for the following genomic features, with filtering.missing.value.quantile set to 0.95 and filtering.missing.coverage.threshold set to 8: 1) all 2 kb windows (containing a minimum 4 symmetric CpGs); 2) gene promoters (defined as Ensembl 67 gene start sites 1 kB/+500 bp); and 3) imprint control regions (mm9 genome). The following was extracted from the output of RnBeads: 1) the mean methylation level for each group (i.e. stage, sex and/or genotype) for each commonly covered test region; 2) the difference in methylation means between two groups for each commonly covered test region; and 3) the p-value representing the significance of the difference in methylation means between two groups for each commonly covered test region. Differentially methylated regions were identified as regions with a p-value<0.05 and a difference in methylation means between two groups greater than 10%.
[0122] For assessing DNA modification levels at specific repetitive elements, Bismark (version 0.14.4) was used to map all reads from each data set against consensus sequences constructed from Repbase with the -n 1 parameter set. CpG methylation calls were extracted from the mapping output using the Bismark methylation extractor (version 0.14.4). The number of methylated and unmethylated cytosines in a CpG context were extract using bismark2bedGraph and coverage2cytosine. Differentially methylated consensus repeats were identified as regions with a p-value<0.05 (as computed by two-sided Student's t-test) and a difference in methylation means between two groups greater than 10%.
[0123] hMeDIP Alignment and Downstream Analysis
[0124] Raw hMeDIP-Seq and input reads for E14 mESCs were downloaded from GSE28500.sup.36 and aligned to the mouse genome (mm9, NCBI build 37) with Bowtie (version 0.12.8) with parameters -n 2 -l25 -m 1. BEDtools multicov was used to identify the number of hMeDIP and input reads overlapping each 2 kB window (containing a minimum 4 symmetric CpGs). Final 5hmC levels for each 2 kB window were determined by first normalising the number of overlapping hMeDIP reads (normalised to library size) by the number of overlapping input reads (normalised to library size) and then by dividing this value by the number of symmetric CpGs contained within the 2 kB window.
[0125] ChIP-Seq Alignment and Downstream Analysis
[0126] For putative active enhancer calling, raw ChIP-Seq reads for H3K4me3, H3K27me3 and H3K27Ac in day 6 PGC-like cells (PGCLCs) were downloaded from GSE60204.sup.33 and raw ChIP-Seq reads for H3K4me3, H3K27me3, H3K4mel and H3K27Ac in wild type mESCs were downloaded from GSE48519.sup.30. Reads were aligned to the mouse genome (mm9, NCBI build 37) with Bowtie (version 0.12.8 or version 1.0.0) with parameters -n 2 -l 25 -m l and -C where appropriate. Subsequent ChIP-Seq analysis was carried out on data from merged biological replicates. To identify putative active enhancers, the inventors first generated an 8-state chromatin model using ChromHMM. Putative active enhancers were defined as all regions not overlapping any potential promoter regions (Ensembl 67 gene start sites 1 kB/+500 bp) and contained within the (H3K27Ac.sup.+/H3K4me3.sup./H3K27me3.sup.) chromatin state in day 6 PGCLCs or (H3K4me.sup.+/H3K27Ac.sup.+/H3K4me3.sup./H3K27me3.sup.) in wild type mESCs.
[0127] For analysis of epigenetic modifications and modifiers around transcription start sites (Ensembl 67): raw ChIP-Seq reads for: Tet1 binding in wild type serum-grown mESCs was downloaded from GSE24843.sup.21; H2AK19Ub1 levels in wild type serum-grown mESCs were downloaded from GSE34520.sup.37; Ring1b binding in wild type serum-grown mESCs were downloaded from ERP005575.sup.38; and for H3K4me3 in wild type and Tet1-KO serum-grown mESCs were downloaded from GSE48519.sup.30. Reads were aligned to the mouse genome (mm9, NCBI build 37) with Bowtie (version 0.12.8 or version 1.0.0) with parameters -n 2 -l 25 -m 1. Subsequent ChIP-Seq analysis was carried out on data from merged biological replicates. For computing ChIP-Seq signal around transcription start sites (TSS), the genomic interval around the Ensembl 67 gene start sites+/5 kB (or 2 kB) was divided into 100 (or 40) equally sized bins using BED tools make windows. BED tools multicov was then used to compute the number of test and control reads overlapping each bin. The total number of test and control reads per bin for each sample were normalised to the appropriate library size, and fold enrichment for each bin was determined by dividing the number of normalised ChIP-Seq test sample reads by the number of normalised ChIP-Seq control sample reads. For computing ChIP-Seq signal at gene promoters, the genomic interval around the Ensembl 67 gene start sites +500 bp/1 kB was
[0128] RNA-Seq Alignment and Downstream Analysis
[0129] For study of Tet1-KO and Tet1-WT PGCs, Illumina and Smart-seq adapters from the sequencing reads were first trimmed using Trimmomatic. For other RNA-Seq libraries, fastq files generated from output of next generation sequencing were used directly for alignment. RNA-Seq reads were aligned to the mouse genome (mm9, NCBI build 37) with Bowtie (version 0.12.8) and Tophat (version 2.0.2) with options -N 2 --b2-very-sensitive --b2-L 25. Annotations from Ensembl Gene version 67 were used as gene model with Tophat. Read counts per annotated gene were computed using HTSeq (version 0.5.3p9) and expression level of each gene was quantified by computing the number of fragments detected per kilobase per million of reads (FPKM) using custom R script. Genes were assigned to an expression level bin based on the mean FPKM values of the two biological replicates. Differential expression analysis was performed using DESeq2 (version 1.6.3), and genes with an adjusted p-value<0.05 were considered differentially expressed. For determining gene expression levels in wild type and Dnmt1-conditional knockout and matched wild type E10.5 PGCs, raw RNA-Seq reads were downloaded from GSE74938.sup.24 and processed as above.
[0130] HCPs methylated and demethylating in PGCs during epigenetic reprogramming (cluster 3,
[0131] For classification of GRR genes (
[0132] Tet1 and 5mC/5hmC Detection by Immunofluorescence
[0133] The embryonic trunk (E10.5) or genital ridge (E12.5/E13.5) was first fixed in 2% PFA (in PBS) for 30 min at 4 C. Following fixation, tissue was washed in PBS three times for 10 min and then incubated in 15% sucrose in PBS overnight. After rinsing with 1% BSA in PBS the following day, the tissue was embedded in OCT Embedding Matrix (Thermo Scientific Raymond Lamb) and frozen using liquid nitrogen. Samples were subsequently stored at 80 C. A Leica CM 1950 cryostat was used to cut 10 m sections from the frozen embedded tissue. Sections were settled on poly-lysine slides (Thermo Scientific) and post-fixed with 2% PFA in PBS for 3 minutes.
[0134] For detection of Tet1, sections were washed three times for 5 min with PBS. After incubating for 30 min at room temperature in 1% BSA/PBS containing 0.1% Triton X-100, the sections were incubated with primary antibodies listed at 4 C. overnight in the same buffer. Sections were subsequently washed three times in 1% BSA/PBS containing 0.1% Triton X-100 for 5 min and incubated with secondary antibodies in the same buffer for 1 hour in the dark at room temperature. Secondary antibody incubation was followed by three 5 min washes with PBS. DNA was then stained with DAPI (100 ng/ml). After a final wash in PBS for 10 min, the sections were mounted with Vectashield (Vector Laboratories).
[0135] For detection of 5hmC/5mC, sections were washed three times for 5 min with PBS. Post-fixed sections were first permeabilized for 30 min with 0.5% Triton X-100 (in 1% BSA/PBS), and subsequently treated with RNase A (10 mg/ml; Roche) in 1% BSA/PBS for 1 hour at 37 C. Following three 5 min washes with PBS, sections were incubated with 4N HCl for 10-20 min at 37 C. to denature genomic DNA, followed by three 10 min washes with PBS. After incubating for 30 min at room temperature in 1% BSA/PBS containing 0.1% Triton X-100, the sections were incubated with primary antibodies listed at 4 C. overnight in the same buffer. Sections were subsequently washed three times in 1% BSA/PBS containing 0.1% Triton X-100 for 5 min and incubated secondary antibodies in the same buffer for 1 hour in the dark at room temperature. Secondary antibody incubation was followed by three 5 min washes with PBS. DNA was then stained with propidium iodide (PI) (0.25 mg/ml). After a final wash in PBS for 10 min, the sections were mounted with Vectashield (Vector Laboratories).
[0136] The following primary antibodies were used in this study: anti-SSEA1 (gifted by Dr P. Beverly via Dr G. Durcova Hills); anti-MVH (Abcam 27591 or Abcam 13840); anti-5hmC (Active motif 39791), anti-5mC (Diagenode C15200081-100); anti-Tet1 (GeneTex GTX125888); anti-GFP (Abcam 5450). The following secondary antibodies were used in this study: Alexa Fluor 647 Goat anti-Mouse IgM (Invitrogen A21238); Alexa Fluor 488 Goat anti-Rabbit IgG (Invitrogen A11008); Alexa Fluor 405 Goat anti-Mouse IgG 1:300 (Invitrogen A31553); Alexa Fluor 488 Goat anti-Mouse IgG 1:300 (Invitrogen A11001); Alexa Fluor 405 Goat anti-Rabbit IgG 1:300 (Invitrogen A31556); Alexa Fluor 568 Donkey anti-Rabbit IgG (Invitrogen A10042); Alexa Fluor 488 Donkey anti-Goat IgG (Invitrogen A11055).
[0137] Locus-Specific Bisulphite Sequencing
[0138] Bisulphite treatment of genomic DNA was carried out using the Imprint DNA modification kit (Sigma). The following primers were used for the semi-nested amplification of the Dazl promoter: F1: GATTTTTGTTATTTTTTAGTTTTTTTAGGAT; F2: TTTATTTAAGTTATTATTTTAAAAATGGTATT; R: AGAAACAAGCTAGGCCAGCTGAGAGAATTCT. The following primers were used for the semi-nested amplification of the IG-DMR ICR: F1: GTGTTAAGGTATATTATGTTAGTGTTAGG; F2: ATATTATGTTAGTGTTAGGAAGGATTGTG; R: TACAACCCTTCCCTCACTCCAAAAATT. The following primers were used for the nested amplification of the Peg3 ICR: F1: TTTTTAGATTTTGTTTGGGGGTTTTTAATA; F2: TTGATAATAGTAGTTTGATTGGTAGGGTGT; R1: AATCCCTATCACCTAAATAACATCCCTACA; R2: ATCTACAACCTTATCAATTACCCTTAAAAA. Methylation levels were assessed by QUMA, using default settings with duplicate bisulphite sequences excluded.
[0139] Mass Spectrometry
[0140] Genomic DNA from between 100 and 2,000 FACS-sorted PGCs was extracted using ZR-Duet DNA/RNA Miniprep kit (Zymo Reasearch) following manufacturer instructions and eluted in LC/MS grade water. DNA was digested to nucleosides using a digestion enzyme mix provided by NEB. A dilution-series made with known amounts of synthetic nucleosides and the digested DNA were spiked with a similar amount of isotope-labelled nucleosides (provided by Dr T. Carell (LMU, Germany)) and separated on an Agilent RRHD Eclipse Plus C18 2.1100 mm 1.8u column by using the UHPLC 1290 system (Agilent) and an Agilent 6490 triple quadrupole mass spectrometer. To calculate the quantity of individual nucleosides, standard curves representing the ratio of unlabelled over isotope-labelled nucleosides were generated and used to convert the peak-area values to corresponding quantity. Threshold for quantification is a signal-to-noise (calculated with a peak-to-peak method) above 10.
[0141] Western Blot
[0142] mESCs were lysed by sonication in RIPA buffer (150 mM sodium chloride, 1.0% Triton X-100, 0.5% sodium deoxychlorate, 0.1% sodium dodecylsulfate, 50 mM Tris pH 8.0) and protease-inhibitor cocktail (Roche, 11 697 498 001). Cell debris were removed by centrifugation at 14000 g 5 min 4 C. Protein was quantified using the BCA protein assay (Thermo, 23227). 2 g (for H2A and H2Aub) or 20 g (for Tet1) of each protein extract was loaded onto an 15% or 8% SDS polyacrylamide gel and transferred to a PVDF membrane after electrophoresis. Membranes were blocked with 5% BSA for 1 hour and then incubated overnight at 4 C. with primary antibodies at the following dilutions: anti-H2A antibody (Abcam, 18255) 1:2000; anti-ubiquityl H2A antibody (Cell Signalling 8240) 1:2000; anti-Tet1 antibody [N1] (GeneTex GTX125888) 1:1000; anti-Lamin B antibody (C20) (Santa Cruz Biotechnologies, sc-6216) 1:10000. Donkey anti-rabbit IgG-HRP (Santa Cruz Biotechnologies, sc-2077) or donkey anti-goat IgG-HRP (Santa Cruz Biotechnologies, sc-2056) secondary antibody were incubated for 1h at room temperature. Blots were developed by using Luminata Crescendo Western HRP substrate (EMD Milipore).
REFERENCES
[0143] Alphabetised References: [0144] Chung et al., 2008 Human Embryonic Stem Cell Lines Generated without Embryo Destruction. Cell Stem Cell. 2(2) 113-117. Epub 2008 Jan 10 [0145] Galdon et al. In Vitro Spermatogenesis: How Far from Clinical Application? Curr Urol Rep 17; 49 (2016) [0146] Hill et al. Epigenetic reprogramming enables the primordial germ cell-to-gonocyte transition, Nature; 555(7696): 392-396; 15 Mar. 2018. [0147] Jiang et al., Pluripotency of mesenchymal stem cells derived from adult marrow. Nature 418, 41-49, 2002 [0148] Kaji et al Virus-free induction of pluripotency and subsequent excision of reprogramming factors. Nature. Online publication 1 Mar. 2009 [0149] Kanatsu-Shinohara et al., Generation of pluripotent stem cells from neonatal mouse testis. Cell 119, 1001-1012,2004 [0150] Phillips et al. Spermatogonial stem cell regulation and spermatogenesis. Phil Trans R Soc B 365, 1663-1678 (2010) [0151] Matsui et al., Derivation of pluripotential embryonic stem cells from murine primordial germ cells in culture. Cell 70, 841-847, 1992 [0152] Medrano et al. Human somatic cells subjected to genetic induction with six germ line-related factors display meiotic germ cell-like features. Scientific Reports 6:24956: 10.1038 (2016) [0153] Numbered References (MAIN): [0154] 1 Lesch, B. & Page, D. Genetics of germ cell development. Nat Rev Genet 13, 781-794 (2012). [0155] 2 Guibert, S., Forn6, T. & Weber, M. Global profiling of DNA methylation erasure in mouse primordial germ cells. Genome Res 22, 633-641 (2012). [0156] 3 Hackett, J. et al. Germline DNA demethylation dynamics and imprint erasure through 5-hydroxymethylcytosine. Science 339, 448-452 (2013). [0157] 4 Hajkova, P. et al. Chromatin dynamics during epigenetic reprogramming in the mouse germ line. Nature 452, 877-881 (2008). [0158] Hajkova, P. et al. Epigenetic reprogramming in mouse primordial germ cells. Mech Dev 117, 15-23 (2002). [0159] 6 Hill, P., Amouroux, R. & Hajkova, P. DNA demethylation, Tet proteins and 5-hydroxymethylcytosine in epigenetic reprogramming: an emerging complex story. Genomics 104, 324-333 (2014). [0160] 7 Lee, J. et al. Erasing genomic imprinting memory in mouse clone embryos produced from day 11.5 primordial germ cells. Development 129, 1807-1817 (2002). [0161] 8 Seisenberger, S. et al. The dynamics of genome-wide DNA methylation reprogramming in mouse primordial germ cells. Mol Cell 48, 849-862 (2012). [0162] 9 Yamaguchi, S. et al. Tet1 controls meiosis by regulating meiotic gene expression. Nature 492, 443-447 (2012). [0163] Yamaguchi, S., Shen, L., Liu, Y., Sendler, D. & Zhang, Y. Role of Tet1 in erasure of genomic imprinting. Nature 504, 460-464 (2013). [0164] 11 Hajkova, P. et al. Genome-wide reprogramming in the mouse germ line entails the base excision repair pathway. Science 329, 78-82 (2010). [0165] 12 Hayashi, K. et al. Offspring from oocytes derived from in vitro primordial germ cell-like cells in mice. Science 338, 971-975 (2012). [0166] 13 Hayashi, K., Ohta, H., Kurimoto, K., Aramaki, S. & Saitou, M. Reconstitution of the mouse germ cell specification pathway in culture by pluripotent stem cells. Cell 146, 519-532 (2011). [0167] 14 Hikabe, O. et al. Reconstitution in vitro of the entire cycle of the mouse female germ line. Nature 539, 299-303 (2016). [0168] Sun, Z. et al. High-resolution enzymatic mapping of genomic 5-hydroxymethylcytosine in mouse embryonic stem cells. Cell Rep 3, 567-576 (2013). [0169] 16 Huang, Y. et al. The behaviour of 5-hydroxymethylcytosine in bisulfite sequencing. PLoS One 5, e8888 (2010). [0170] 17 Yamaguchi, S. et al. Dynamics of 5-methylcytosine and 5-hydroxymethylcytosine during germ cell reprogramming. Cell Res 23, 329-339 (2013). [0171] 18 Dawlaty, M. et al. Tet1 is dispensable for maintaining pluripotency and its loss is compatible with embryonic and postnatal development. Cell Stem Cell 9, 166-175 (2011). [0172] 19 Weber, M. et al. Distribution, silencing potential and evolutionary impact of promoter DNA methylation in the human genome. Nat Genet 39, 457-466 (2007). [0173] Tahiliani, M. et al. Conversion of 5-methylcytosine to 5-hydroxymethylcytosine in mammalian DNA by MLL partner TET1. Science 324, 930-935 (2009). [0174] 21 Williams, K. et al. TET1 and hydroxymethylcytosine in transcription and DNA methylation fidelity. Nature 473, 343-348 (2011). [0175] 22 Vella, P. et al. Tet proteins connect the O-linked N-acetylglucosamine transferase Ogt to chromatin in embryonic stem cells. Mol Cell 49, 645-656 (2013). [0176] 23 Tsumura, A. et al. Maintenance of self-renewal ability of mouse embryonic stem cells in the absence of DNA methyltransferases Dnmt1, Dnmt3a and Dnmt3b. Genes Cells 11, 805-814 (2006). [0177] 24 Hargan-Calvopina, J. et al. Stage-Specific Demethylation in Primordial Germ Cells Safeguards against Precocious Differentiation. Dev Cell 39, 75-86 (2016). [0178] Mansour, A. et al. The H3K27 demethylase Utx regulates somatic and germ cell epigenetic reprogramming. Nature 488, 409-413 (2012). [0179] 26 Yokobayashi, S. et al. PRC1 coordinates timing of sexual differentiation of female primordial germ cells. Nature 495, 236-240 (2013). [0180] 27 Ismail, I., McDonald, D., Strickfaden, H., Xu, Z. & Hendzel, M. A small molecule inhibitor of polycomb repressive complex 1 inhibits ubiquitin signaling at DNA double-strand breaks. J Biol Chem 288, 26944-26954 (2013). [0181] 28 Deplus, R. et al. TET2 and TET3 regulate GlcNAcylation and H3K4 methylation through OGT and SET1/COMPASS. EMBO J32, 645-655 (2013). [0182] 29 Amouroux, R. et al. De novo DNA methylation drives 5hmC accumulation in mouse zygotes. Nat Cell Biol 18, 225-233, doi:10.1038/ncb3296 (2016). [0183] 30 Hon, G. et al. 5mC Oxidation by Tet2 Modulates Enhancer Activity and Timing of Transcriptome Reprogramming during Differentiation. Mol Cell 56, 286-297 (2014). [0184] 31 Yang, H. et al. One-step generation of mice carrying reporter and conditional alleles by CRISPR/Cas-mediated genome engineering. Cell 154, 1370-1379 (2013). [0185] 32 Boyle, P. et al. Gel-free multiplexed reduced representation bisulfite sequencing for large-scale DNA methylation profiling. Genome Biol 13, R92 (2012). [0186] 33 Kurimoto, K. et al. Quantitative Dynamics of Chromatin Remodeling during Germ Cell Specification from Mouse Embryonic Stem Cells. Cell Stem Cell 16, 517-532 (2015). [0187] 34 Borgel, J. et al. Targets and dynamics of promoter DNA methylation during early mouse development. Nat Genet 42, 1093-1100 (2010). [0188] 35 Yu, M. et al. Base-resolution analysis of 5-hydroxymethylcytosine in the mammalian genome. Cell 149, 1368-1380 (2012). [0189] 36 Xu, Y. et al. Genome-wide regulation of 5hmC, 5mC, and gene expression by Tet1 hydroxylase in mouse embryonic stem cells. Mol Cell 42, 451-464 (2011). [0190] 37 Brookes, E. et al. Polycomb associates genome-wide with a specific RNA polymerase II variant, and regulates metabolic genes in ESCs. Cell Stem Cell 10, 157-170 (2012). [0191] 38 Cooper, S. et al. Targeting polycomb to pericentric heterochromatin in embryonic stem cells reveals a role for H2AK119ul in PRC2 recruitment. Cell Rep 7, 1456-1470 (2014). [0192] 39 Hashimoto, H. et al. Structure of a Naegleria Tet-like dioxygenase in complex with 5-methylcytosine DNA. Nature 506, 391-395 (2014).
[0193] All references referred to above are hereby incorporated by reference.