Mouse strains and experimentation
The Spocd1HA and Miwi2tdTomato (Miwi2tdTom) mouse alleles have been described previously4,31. Miwi2tdTom is a Miwi2 null allele and is used as such31. Both lines were kept on a mixed B6CBAF1/Crl;C57BL/6âN;Hsd:ICR (CD1) genetic background. The Spocd1ÎSPIN1 allele was generated by CRISPRâCas9 gene editing as previously described32,33. A single guide RNA (sgRNA) (GGGTCAGGAATCAGGCTTGT) together with Cas9 mRNA and a single-stranded DNA oligonucleotide containing the eight-alanine mutation flanked by 85âbase pairs (bp) of homology arm (AGATGGTAAACAGTTGAAGCCAAGGCAGGGAGGATTTCAGGCAGAGCCTTGCCATACTCTCTCTCAGCAGGTCTACACTGGGTCAGCTGCCGCAGCGGCCGCTGCCGCCGCTGCAAGTCAGCCAGGACAAATTGAACCTCTGGAGGAGTTGGACACCAACTCAGCCAGAAGGAAGAGAAGGCCCACAACTGCTCACCCTA) was injected into the cytoplasm of fertilized single-cell zygotes (B6CBA F1/Crl). F0 offspring were screened by PCR and the Spocd1ÎSPIN1 allele was confirmed by Sanger sequencing. The allele was established from one founder animal and back-crossed several times to a C57BL/6N genetic background. The Spocd1ÎSPIN1 mice were thus on a mixed B6CBAF1/Crl;C57BL/6N genetic background. Animals were genotyped using a PCR of four primers (F, GACCCTGTATTTATTGAAGTCACTG; R, CCTCAGTGACATCAGGCGGA; WT-F, CACTGGGTCAGGAATCAGGC; and âSpin-R, GTCCTGGCTGACTTGCAGC). Mice carrying the Oct4eGFP reporter allele34 were originally obtained from Jackson Laboratories (B6;129S4-Pou5f1tm2Jae/J (Oct4-eGFP), stock number 008214).
Male fertility was assessed by mating studs to Hsd:ICR (CD1) wild-type females and counting the number of pups born for each plugged female. For each experiment, animal tissue samples were collected from one or more litters and allocated to groups according to genotype. No further randomization or blinding was applied during data acquisition and analysis.
Animals were maintained at the University of Edinburgh, UK, in accordance with the regulation of the UK Home Office, or at the Institute for Molecular Biology in Mainz, Germany, in accordance with local and European animal-welfare laws. Ethical approval for the UK mouse experimentation has been given by the University of Edinburghâs Animal Welfare and Ethical Review Body and the work done under licence from the UK Home Office. Animal experiments done in Germany were approved by the ethical committees on animal care and use of the federal states of Rheinland-Pfalz, Germany, covered by LUA licence G 23-5-049.
Immunofluorescence
Immunofluorescence experiments were done as previously described35. The following primary antibodies were used in this study: anti-HA (Cell Signaling Technologies) 1:200; anti-LINE1-ORF1p (ref. 36) 1:500; anti-IAP-GAG (a gift from B. Cullen, Duke University) 1:500; anti-γH2AX (Bethyl Laboratories) 1:500; anti-MIWI2 (a gift from R. Pillai, Université de Genève) 1:500; anti-SPOCD1 rabbit serum rb175 1:500 (OâCarroll laboratory antibody); anti-SPIN1 (Cell Signaling Technologies) 1:500 (of a custom preparation of 1.1âμgâμlâ1 in PBS). Images were taken on a Zeiss Observer or Zeiss LSM880 with an Airyscan module. Images acquired using the Airyscan module were deconvoluted with the Zeiss Zen software âAiryscan processingâ with settings 3D and a strength of 6. ImageJ and Zeiss Zen software were used to process and analyse the images.
Cell culture, transfection, immunoprecipitation and western blotting
HEK293T cells (OâCarroll laboratory stock, not further authenticated, tested for mycoplasma contamination) were cultured and transfected as previously described4 with a minor modification, and 3âμl Jetprime reagent was used. On day 2 after transfection, cells were washed twice with PBS and resuspended in 1âml lysis buffer (IP buffer: 150âmM KCl, 2.5âmM MgCl2, 0.5% Triton X-100, 50âmM Tris-HCl, pH 8, supplemented with 1à protease inhibitors (cOmplete ULTRA EDTA-free, Roche) with 37 units per ml benzonase (Millipore)) and lysed for 30âmin, rotating at 4â°C. The lysate was cleared by centrifugation for 10âmin at 21,000g. Cleared lysate (800âμl) was incubated with 20âμl of anti-HA beads (Pierce) that had been calibrated in lysis buffer and incubated for 1âh at 4â°C on a rotating wheel. The beads were washed four times with lysis buffer. Immunoprecipitates were eluted at 50â°C for 10âmin in 20âμl 0.1% sodium dodecyl sulphate (SDS), 50âmM Tris-HCl, pH 8. Lysates and eluates were run on a 4â12% bisâtris acrylamide gel (Invitrogen) and blotted onto a nitrocellulose membrane (Amersham Protran 0.45 NC) according to standard laboratory procedures. The membrane was blocked with blocking buffer (4% (w/v) skimmed milk powder (Sigma-Aldrich) in PBS-T (phosphate buffered saline, 0.1% Tween-20)) and subsequently incubated for 1âh with primary antibodies (anti-HA (C29F4s, Cell Signaling Technologies), 1:1,000; anti-FLAG (M2, Sigma-Aldrich) 1:1,000, anti-SPOCD1 rabbit serum rb175 (OâCarroll laboratory antibody) 1:500 or anti-α-Tubulin (T9026, Sigma-Aldrich) 1:1,000) in blocking buffer. The anti-α-tubulin staining was used as loading control on the same blot as the experimental staining. After three PBS-T washes for 10âmin, the membrane was incubated with secondary antibodies (IRDye 680RD donkey anti-rabbit or IRDye 800CW donkey anti-mouse, LI-COR, 1:10,000) in blocking buffer for 1âh. It was washed three times for 10âmin in PBS-T and imaged on a LI-COR Odyssey CLx system. Exposure of the entire images was optimized in Image Studio Lite (LI-COR), and areas of interest were cropped for presentation.
Protein alignments and structure prediction
The mouse SPOCD1 AlphaFold2 protein structure prediction model22,23 was downloaded from the AlphaFold Protein Structure Database (https://www.alphafold.ebi.ac.uk/). Models for the SPOCD1âSPIN1 interaction, as well as the single SPOCD1 proteins from Anolis, Xenopus and Latimeria, were generated with AlphaFold2 (refs. 22,23) on ColabFold37. The model was visualized using PyMol38. Multiple sequence alignments of SPOCD1 and SPIN1 were generated with ClustalW39 and edited in Jalview40. For SPOCD1, alignments were edited based on secondary-structure elements of the AlphaFold2 model (B1ASB6) using Jalview40.
Protein purification
GST-tagged mouse SPOCD1 fragments (amino acids 203â409), Anolis SPOCD1 fragments (XP_008116112.1, amino acids 457â748), Xenopus SPOCD1 fragments (XP_031752218.1, amino acids 1â229), Latimeria SPOCD1 fragments (XP_014348336.1, amino acids 510â1009) and His-tagged SPIN1 (amino acids 49â262) were cloned in a pET-based backbone. Proteins were expressed in Escherichia coli BL21 (DE3). Bacteria were grown in 2xTY media at 37â°C until an optical density of 0.8 was reached. Then, the temperature was reduced to 18â°C, the bacteria were induced with 1âmM IPTG and grown for another 14â16âh. Cells were collected and pellets were stored at â80â°C until purification. The pellets were resuspended in 50âml lysis buffer (20âmM Tris-HCl, pH 7.5, 200âmM NaCl, 2.5âmM imidazole, 0.5âmM β-mercaptoethanol, Roche cOmplete EDTA-free Protease Inhibitor Cocktail, 0.01âmgâmlâ1 DNaseI (Sigma) and 2âmM AEBSF (Pefabloc) for SPIN1, or 20âmM Tris-HCl, pH 7.5, 200âmM NaCl, 1âmM DTT, Roche cOmplete EDTA-free Protease Inhibitor Cocktail, 0.01âmgâmlâ1 DNaseI (Sigma) and 2âmM AEBSF (Pefabloc) for SPOCD1) and cells were lysed with the Constant systems 1.1âkW TS cell disruptor at 25 kPSI. The cleared lysate was used to load on a cOmplete His-Tag Purification Column (Roche) for SPIN1 or incubated with 7âml glutathione sepharose high-performance beads (Cytiva) for SPOCD1 calibrated in the respective buffer. Elution from column/beads with increasing (2.5â500âmM) imidazole gradient for SPIN1 or GST elution buffer containing 20âmM reduced glutathione for SPOCD1. The fractions of interest were pooled and dialysed overnight in 20âmM Tris-HCl, pH 7.5, 100â150âmM NaCl, 1âmM DTT. The SPIN1 construct was cleaved with GSTâ3C protease (made in our lab) overnight. The SPOCD1 constructs were concentrated and stored at â80â°C until used. SPIN1 was further purified by ion exchange with a gradient of 100â1,000âmM NaCl (Resource Q, Cytiva) and size-exclusion chromatography (HiLoad 16/600 Superdex 200âpg, Cytiva). Finally, the protein was concentrated and stored at â80â°C until used.
Nucleosome pull-downs with recombinant SPIN1-SPOCD1 proteins
Histone H3 site-specifically modified with H3K4me3 and/or H3K9me3 was generated by native chemical ligation (NCL) and assembled into nucleosomes as described previously41,42. In brief, Xenopus H3 and H4 and human H2A and H2B were expressed in E. coli and purified from inclusion bodies. For NCL, a tail-less histone H3 lacking residues 1â31 and containing a threonine-to-cysteine substitution at position 32 and a cysteine-to-alanine substitution at position 110 of Xenopus H3 (H3Î1â31T32C C110A) was expressed in E. coli and purified in the same way. NCL reactions were carried out with synthetic carboxy-terminal benzyl thioester peptides spanning residues 1â31 of histone H3.1 and carrying the desired modifications at K4 and K9 (Peptide Protein Research) in 6âM guanidine HCl, 250âmM sodium phosphate buffer, pH 7.2, 150âmM 4-mercaptophenylacetic acid (MPAA, Sigma) and 50âmM TCEP for 72âh at room temperature. Ligated full-length modified histone H3 was purified through cation-exchange chromatography on a HiTrap SP column (Cytiva). Histone octamers were reconstituted by dialysis and purified by gel filtration on an S200 size-exclusion column (Cytiva). For the generation of trans-histone octamers carrying H3K4me3 and H3K9me3 on separate copies of histone H3, the H3XâH3Y system was used43, starting from H3Î1â31T32C C110A constructs that also contained the required H3X and H3Y mutations. H3X was used for H3K4me3 and H3Y for H3K9me3. A biotinylated 209-bp DNA fragment containing the 601 nucleosome positioning sequence was generated by PCR and purified by ion-exchange chromatography on a HiTrap Q column followed by ethanol precipitation. Mononucleosomes were then assembled from histone octamers and 601 DNA by gradient dialysis. Nucleosome assembly was verified by native gel electrophoresis on 6% acrylamide gels in 0.5Ã TGE buffer (12.5âmM Tris, pH 8.0, 95âmM glycine and 0.5âmM EDTA).
Nucleosome pull-down assays were done essentially as described previously44. All incubations and washes were performed at 4â°C with end-over-end rotation, and all centrifugation steps were done at 1,500g for 2âmin at 4â°C. Then, 23âpmol (3âµg) of recombinant, site-specifically modified nucleosomes were bound to streptavidin sepharose high-performance beads (Cytiva) by overnight incubation in pull-down buffer (20âmM HEPES, pH 7.9, 175âmM NaCl, 10% glycerol, 1âmM EDTA, 1âmM DTT, 0.1% NP-40, 0.1âmgâmlâ1 BSA). Before incubation, beads were blocked with 1âmgâmlâ1 BSA in pull-down buffer. Nucleosome-bound beads were washed three times with pull-down buffer before incubation with recombinant SPIN1 and SPOCD1 proteins for 2âh. His-tagged SPIN1 (49â262) and His-tagged SPOCD1 fragment 1b were expressed and purified as above. SPIN1âSPOCD1 fragment 1b complexes were purified by size-exclusion chromatography on an S200 increase column (Cytiva) as above. For the experiment shown in Fig. 1j, 23âpmol of protein was used. After incubation with recombinant proteins, beads were washed three times with high-salt pull-down buffer (as above but with 350âmM NaCl) for 5âmin. Nucleosomes and bound proteins were eluted by boiling in 1.5à SDS sample buffer (95âmM Tris HCl, pH 6.8, 15% glycerol, 3% SDS, 75âmM DTT, 0.15% bromophenol blue). Binding was analysed by western blotting with antibodies against His tag (Sigma H1029, lot 033m4785) 1:1,000. Antibodies against histone H3 (Abcam ab176842, lot GR1494741-36) 1:2,500, H3K4me3 (Cell Signaling) 1:2,000 and H3K9me3 (Abcam ab176916) 1:1,000 were used to verify nucleosome loading and modification state.
Analytical size-exclusion chromatography
For analytical size-exclusion chromatography, 125âμg SPIN1 and/or 500âμg mouse GSTâSPOCD1-F1b were used for each run. Proteins were diluted in 250âμl size-exclusion chromatography buffer (20âmM HEPES, pH 7.5, 150âmM NaCl, 1âmM DTT) and injected on a Superdex 200 10/300 GL column. Peak fractions were collected, loaded on an SDSâPAGE gel and visualized by Coomassie staining.
Crosslinking mass-spectrometry analysis
Recombinant fragments (25âμg) of SPOCD1 (GSTâF1b) and SPIN1 were incubated in 20âmM HEPES, pH 7.5, 150âmM NaCl, 1âmM DTT and crosslinked with BS3 (bis(sulfosuccinimidyl)suberate) (Thermo Fisher Scientific) at BS3:protein ratios of 1:1, 2:1 and 4:1 (w/w) for 2âh on ice. The crosslinking reaction was stopped by adding 2âμl ammonium bicarbonate (2.0âM). Crosslinking products were run on 4â12% bis-Tris NuPAGE (Invitrogen) for 15âmin and briefly stained using Instant Blue (Expedeon). Bands at more than 150âkD were excised and the proteins were reduced with 10âmM DTT for 30âmin at room temperature, alkylated with 55âmM iodoacetamide for 20âmin at room temperature and digested using 13ângâμlâ1 trypsin (Promega) overnight at 37â°C37. The digested peptides were loaded onto C18-Stage-tips38 for liquid chromatography with tandem mass spectrometry (LC-MS/MS) analysis. The LC-MS/MS analysis was performed using Orbitrap Fusion Lumos (Thermo Fisher Scientific) with a âhigh/highâ acquisition strategy. The peptide separation was done on an EASY-Spray column (50âcmâÃâ75âμm internal diameter, PepMap C18, 2-μm particles, 100âà pore size; Thermo Fisher Scientific). Mobile phase A consisted of water and 0.1% (v/v) formic acid. Mobile phase B consisted of 80% (v/v) acetonitrile and 0.1% (v/v) formic acid. Peptides were loaded at a flow rate of 0.3âμlâminâ1 and eluted at 0.25âμlâminâ1 using a linear gradient going from 2% mobile phase B to 40% mobile phase B over 102 or 132âmin (each sample was run twice with different gradients), followed by a linear increase from 40% to 95% mobile phase B in 11âmin. The eluted peptides were introduced directly into the mass spectrometer. MS data were acquired in the data-dependent mode with a 3âs acquisition cycle. Precursor spectra were recorded in the Orbitrap with a resolution of 120,000 and a mass-to-charge ratio (m/z) range of 350â1,700. Ions with a precursor charge state between 3+ and 8+ were isolated with a window size of m/zâ=â1.6 and fragmented using high-energy collision dissociation with a collision energy of 30. The fragmentation spectra were recorded in the Orbitrap with a resolution of 15,000. Dynamic exclusion was enabled with single repeat count and 60âs exclusion duration. The mass-spectrometric raw files were processed into peak lists using ProteoWizard (v.3.0)39 and crosslinked peptides were matched to spectra using Xi software (v.1.7.6.4)40 with in-search assignment of mono-isotopic peaks41. Search parameters were: MS accuracy, 3âppm; MS/MS accuracy, 5âppm; enzyme, trypsin; crosslinker, BS3; maximum missed cleavages, 4; fixed modification, carbamidomethylation on cysteine; variable modifications, oxidation on methionine; fragments b and y ions with loss of H2O, NH3 and CH3SOH. The linkage specificity for BS3 was assumed to be at lysine, serine, threonine, tyrosine and protein N termini. Identified candidates of crosslinked peptides were validated by Xi software40, and only auto-validated crosslinked peptides were used. Identified crosslinks underlying Fig. 2b are shown in Supplementary Table 1.
ChIP sequencing analysis
Raw fastq.gz sequencing files for ChIP-seq of H3K4me3 and H4K9me3 were downloaded from the Sequence Read Archive record SRP165187 (ref. 24). Paired-end reads were preprocessed to remove adapter sequences and trim low-quality bases using Trimmomatic v.0.35 (ref. 45). Tru-seq adapter sequences were used in the case of ChIP-seq samples. Trimmed reads were aligned to the mouse mm10 genome with bwa mem v.0.7.16 (ref. 46) using the -M parameter. Alignments were filtered to remove duplicate reads with Picard MarkDuplicates v.2.24.0 (http://broadinstitute.github.io/picard/) and improper alignments with Samtools view v.1.11 -F 260 -f 3 (ref. 47). In the case of multi-mapping reads, a single alignment (marked as primary by bwa) was selected for downstream analysis. BAM files were converted to normalized bigWig files for visualization and plotting using deepTools48 bamCoverage v.3.5.0 with the following parameters: -bs 1 –normalizeUsing BPM.
ChIP heatmaps and average profile plots
Genomic annotations for repetitive elements L1Md_A, L1Md_T, L1Md_F (combining elements classified as L1MD_F, L1Md_F2, L1Md_F3), L1Md_Gf, IAPEy and MMERVK_10C were extracted from Repeat Masker using the UCSC table browser. Normalized read coverage was computed across these elements using deepTools v.3.5.0 computeMatrix. The central regions were length-normalized to 5âkb with flanking regions ±2âkb from the start and end positions. Heatmaps were drawn using deepTools v.3.5.0 plotHeatmap, separating each repetitive element and sorting rows in descending order of total signal. LINE1 elements (L1Md_A, L1Md_F and L1Md_T) were further separated into young LINE1 elements based on a divergence of 38 bases per kb or less from a consensus sequence4 or the presence of an intact functional promoter denoted by the presence of specific monomer annotations49. Monomers associated with inert promoters (subtypes 6 and 2) were removed from the analysis. Average profiles were generated for each experiment and each category of repetitive element by calculating the mean signal between replicate samples. Computations were performed in R, with the seqplots package50, using bins of 50 bases, flanking regions of 2âkb and a central-region length normalized to 5âkb. Final plots were drawn and formatted using the tidyverse packages51.
IP-MS
IP-MS of SPOCD1âHA from Spocd1HA/+ E14.5 fetal testis using 50âμl of anti-HA beads (Pierce, 88837) was done as previously described4, with a reduced number of 25 testes per replicate. Wild-type fetal testes were used as controls.
Fluorescence-activated cell sorting (FACS)
To purify foetal germ cells for CUT&Tag analysis, E14.5 testes were dissected from embryos carrying the Oct4eGFP allele34. A single cell suspension was obtained by sequential treatment with 100âµl collagenase solution at 37â°C for 8âmin (10 units of collagenase A (Sigma-Aldrich 10103578001); 2à NEAAs (Gibco); 2à Na-pyruvate (Gibco); 25âmM HEPESâKOH, pH 7.5) and 200âµl TryPLE Express (Gibco) at 37â°C for 5âmin with gentle flicking and pipetting of the solution to aid dissociation. Digestion was neutralized by 70âµl prewarmed FBS and cells were collected by spinning at 600g for 4âmin at room temperature followed by two washes in FACS buffer (1à PBS; 2âmM EDTA, 25âmM HEPES-KOH, pH 7.5, 1.5% BSA, 10% FBS; 2âµgâmlâ1 DAPI) and filtering (Corning, 352235) just before sorting. Cell sorting was done on an Invitrogen Bigfoot using a 100âμm nozzle and gating for DAPI-negative (live), OCT4âeGFP-positive (germ cells) populations into collection tubes containing 100âµl 1à PBS.
For EM-seq, CD9+ spermatogonia were sorted from P14 testes as described previously52 using Fc block (eBioscience, 14-0161-86, clone 93, lot 2297433) 1:50; biotin-conjugated anti-CD45 (eBioscience, 13-0451-85, clone 30-F11, lot 2349865) 1:400, and biotin-conjugated anti-CD51 (Biolegend, 104104, clone RMV-7, lot B308465) 1:100 anti-CD9APC (eBioscience, 17-0091-82, clone eBioKMC8, lot 2450733) 1:200, anti-cKitPE-Cy7 (eBioscience, 25-1171-82, clone 2B8, lot 2191977) 1:1,600, streptavidinV450 (BD bioscience, 560797, lot 1354158) 1:400 and 1âμgâmlâ1 DAPI. Cells were sorted into DMEM media on a BD Aria II sorter, pelleted for 5âmin at 500g and snap frozen in liquid nitrogen.
For gating strategies, see Supplemental Fig. 2.
CUT&Tag assays
CUT&Tag was done on FACS-isolated fetal germ cells as previously described26, with some minor modifications. First, 10,000 to 20,000 germ cells were bound to 10âµl concanavalin A-coated beads (Polysciences, 86057-10). After binding to beads, cells were fixed with 0.2% formaldehyde for 2âmin followed by quenching with glycine (125âmM) and washed with Dig-Wash buffer while separated on the magnet. The remaining steps were as previously described26, using pAâTn5 at a 1:400 dilution (Diagenode, C01070001) and 15 PCR cycles of library amplification. Libraries were cleaned up by magnetic bead-based solid-phase separation and assessed on a Tapestation (Agilent). Antibodies and dilutions used for CUT&Tag were rabbit IgG control (Abcam, ab37415, lot GR3219601-1) at 1:50, rabbit anti-SPIN1 (Cell Signaling, 89139S, lot 2) at 1:50, rabbit anti-H3K4me3 (Merck-Milipore, 07-473, lot 403371) at 1:50, rabbit anti-H3K9me3 (Abcam, ab8898, lot GR27111-1) at 1:50, and guinea pig anti-rabbit IgG (Antibodies Online, ABIN101961, lot NE-200-032309) at 1:100. Pooled libraries were sequenced using paired-end 150âbp on a NextSeq 2000 instrument (Illumina).
CUT&Tag analysis
First, 150b and 155b paired-end CUT&Tag sequencing reads were processed and aligned to the mouse-genome assembly (version GRCm38) using the NF-core (https://doi.org/10.5281/zenodo.7715959) CUT&RUN Nextflow pipeline version 3.1 (ref. 53). The pipeline performed adapter trimming with Trim Galore (https://doi.org/10.5281/zenodo.5127898) and reference-genome alignment with Bowtie2 (ref. 54). Multimap reads were included using the parameter –minimum_alignment_q_score 0. The pipeline performed further filtering of reads to report only properly paired primary alignments and remove alignments to GRCm38 blacklisted regions. The default for the pipeline is to remove only duplicate reads (alignments that share common start and end points) from IgG controls. However, after further assessment of the sequence duplication rates in all samples, we decided to perform read deduplication on the SPIN1 replicate samples. Deduplication of SPIN1 samples was performed using Picard MarkDuplicates v.2.24.0 (http://broadinstitute.github.io/picard/) with the parameter –REMOVE_DUPLICATES. Individual replicates from each sample were then merged into a single BAM file using Samtools merge v.1.11 (ref. 47) for downstream analysis. Normalized bigWig files of read coverage were generated with deepTools bamCoverage v.3.50 (ref. 48), using the following parameters: -bs 1 –normalizeUsing CPM âexactScaling –ignoreForNormalization MT. Log2 enrichment profiles of CUT&Tag samples over IgG controls were generated with deepTools bamCompare using the following parameters: -bs 1 –normalizeUsing CPM –exactScaling –ignoreForNormalization MT –scaleFactorsMethod None.
Log2 enrichment profiles of CUT&Tag versus IgG control over various classes of repetitive elements (L1Md_A, L1Md_F, L1Md_Gf, L1Md_T, IAPEy-int and IAPEz-int) were plotted as heatmaps and average profiles, using computeMatrix from the deepTools48 package and the profilePlyr55 R package to include annotations of peak overlaps. Positions of repetitive elements were extracted from a table of mouse mm10 repeatMasker annotations downloaded from the UCSC table browser and filtered for elements greater than 5âkb in length. LINE1 elements (L1Md_A, L1Md_F, L1Md_T) were further separated into young LINE1 elements based on a divergence of 38 bases per kb or less from a consensus sequence4 or the presence of an intact functional promoter denoted by the presence of specific monomer annotations49. Monomers associated with inert promoters (subtypes 6 and 2) were removed from the analysis. The central regions of repetitive elements were length-normalized to 5âkb with flanking regions ±2âkb from the start and end positions. Heatmaps and profile plots show data in consecutive 10b bins with regions subdivided by elements and arranged in descending order of total enrichment across all samples.
Peak calling was done using MACS2 callpeak56 on individual replicates as well as all replicates together, with IgG samples set as a control. The parameter –keep-dup all was used to include duplicate reads, when present, in the peak calling model. To attain a set of high-confidence peaks, we selected peaks with a minimum coverage of 20 reads in the CUT&Tag sample and a peak score greater than the mean peak score. Peaks of co-localized H3K4me3 and H3K9me3 binding were attained by finding the intersection of both peak sets using the GenomicRanges R package57. Peak sets were overlapped with annotations to provide a breakdown of their intersection with specific genomic features, with each peak assigned to a single classification in the following hierarchy: LINEs, other repetitive elements, genes and intergenic. LINEs included all repeatMasker annotations included in the LINE class. Other repetitive elements included repeatMasker annotations in the classes LTR, Simple_repeat, Satellite, ERVK and Retrotransposon. Genes were defined as any coding or non-coding transcriptional unit plus 500 bases upstream, based on the ENSEMBL gene annotations GRCm38 v.79. Overlaps of peaks with genomic features was performed using the GenomicRanges R package57.
Downstream data analysis and plotting was predominantly performed using the R programming language (R Core Team, 2021, https://www.R-project.org/) and the Tidyverse libraries51. Genome snapshots and data tracks were prepared using pyGenomeTracks58.
Histology of mouse samples
Histology experiments on mouse samples were done as previously described4.
TUNEL assay
TUNEL assay experiments were done as previously described4.
RNA sequencing and analysis
RNA sequencing experiments and analysis were done as previously described4 with data for Spocd1â/â downloaded from GSE131377 (ref. 4).
Whole-genome methylation sequencing and analysis
Whole-genome methylation sequencing of DNA derived from Spocd1ÎSPIN1 and wild-type P14 spermatogonia was performed using the NEBNext Enzymatic Methyl-seq (EM-seq, New England Biolabs) as described4. Analysis of DNA methylation was done as described previously4. Data for Spocd1â/â and corresponding wild-type P14 spermatogonia were retrieved from E-MTAB-7997 (ref. 4).
Statistical information
Data were plotted in R (v.2022.07.01 and 554 running R v.4.0.3 (2020-10-10)) using the dplyr, ggplot2, tidyr, cowplot, reshape2, ggrepel, ggpubr, scales and RColorBrewer packages (versions dplyr_1.0.4, ggplot2_3.3.3, tidyr_1.1.2, cowplot_1.1.1, scales_1.1.1, reshape2_1.4.4, ggrepel_0.9.1, ggpubr_0.4.0, scales_1.1.1, RColorBrewer_1.1-2) or Microsoft Excel for Mac (v.16). Statistical testing was done with R (v.4.0.3 (2020-10-10)) using R Studio software or with Perseus59 (v.1.6.5.0) for the mass-spectrometry data and DEseq2 (ref. 60) for the RNA-seq data. We used the regioneR package55 in R to perform permutation tests to assess the statistical significance of overlaps of CUT&Tag peaks with LINE1 elements. Unpaired, two-tailed Studentâs t-tests were used to compare the differences between groups and adjusted for multiple testing using Bonferroni correction where indicated, except for RNA-seq data analysis, where Waldâs tests were used. Averaged data are presented as meanâ±âs.e.m., unless otherwise indicated. No statistical methods were used to predetermine the sample size. The experiments were not randomized and the investigators were not blinded to allocation during experiments and outcome assessment.
Reporting summary
Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.