Skip to main content
Advertisement
  • Loading metrics

Evolutionary lineage-specific genomic imprinting at the ZNF791 locus

  • Jinsoo Ahn,

    Roles Conceptualization, Data curation, Formal analysis, Investigation, Methodology, Software, Validation, Visualization, Writing – original draft, Writing – review & editing

    Affiliation Department of Animal Sciences, The Ohio State University, Columbus, Ohio, United States of America

  • In-Sul Hwang,

    Roles Formal analysis, Investigation, Methodology, Resources, Validation

    Affiliations National Institute of Animal Science, Rural Development Administration, Jeonbuk 55365, Republic of Korea, Columbia Center for Translational Immunology, Columbia University Irving Medical Center, Columbia University, New York, New York, United States of America

  • Mi-Ryung Park,

    Roles Formal analysis, Investigation, Methodology, Validation

    Affiliation National Institute of Animal Science, Rural Development Administration, Jeonbuk 55365, Republic of Korea

  • Milca Rosa-Velazquez,

    Roles Formal analysis, Investigation

    Current address: Facultad de Medicina Veterinaria y Zootecnia, Universidad Veracruzana, Veracruz, Mexico

    Affiliation Department of Animal Sciences, The Ohio State University, Wooster, Ohio, United States of America

  • In-Cheol Cho,

    Roles Methodology, Resources

    Affiliation National Institute of Animal Science, Rural Development Administration, Jeonbuk 55365, Republic of Korea

  • Alejandro E. Relling,

    Roles Investigation, Resources

    Affiliation Department of Animal Sciences, The Ohio State University, Wooster, Ohio, United States of America

  • Seongsoo Hwang,

    Roles Conceptualization, Supervision, Validation

    Affiliation National Institute of Animal Science, Rural Development Administration, Jeonbuk 55365, Republic of Korea

  • Kichoon Lee

    Roles Conceptualization, Funding acquisition, Supervision, Validation, Writing – original draft, Writing – review & editing

    lee.2626@osu.edu

    Affiliation Department of Animal Sciences, The Ohio State University, Columbus, Ohio, United States of America

Abstract

Genomic imprinting is an epigenetic process that results in parent-of-origin effects on mammalian development and growth. Research on genomic imprinting in domesticated animals has lagged due to a primary focus on orthologs of mouse and human imprinted genes. This emphasis has limited the discovery of imprinted genes specific to livestock. To identify genomic imprinting in pigs, we generated parthenogenetic porcine embryos alongside biparental normal embryos, and then performed whole-genome bisulfite sequencing and RNA sequencing on these samples. In our analyses, we discovered a maternally methylated differentially methylated region within the orthologous ZNF791 locus in pigs. Additionally, we identified both a major imprinted isoform of the ZNF791-like gene and an unannotated antisense transcript that has not been previously annotated. Importantly, our comparative analyses of the orthologous ZNF791 gene in various eutherian mammals, including humans, non-human primates, rodents, artiodactyls, and dogs, revealed that this gene is subjected to genomic imprinting exclusively in domesticated animals, thereby highlighting lineage-specific imprinting. Furthermore, we explored the potential mechanisms behind the establishment of maternal DNA methylation imprints in porcine and bovine oocytes, supporting the notion that integration of transposable elements, active transcription, and histone modification may collectively contribute to the methylation of embedded intragenic CpG island promoters. Our findings convey fundamental insights into molecular and evolutionary aspects of livestock species-specific genomic imprinting and provide critical agricultural implications.

Author summary

Genomic imprinting, an epigenetic process, influences mammalian development and growth through gene expression dependent on parental origin. Research on imprinting in domesticated animals has primarily focused on imprinted genes previously identified in mice and humans. However, we discovered that, unlike in mice and humans, the porcine ZNF791 promoter is encompassed by a differentially methylated region with maternal methylation, and the ZNF791-like gene is expressed from only one allele in pigs, indicating novel imprinting of ZNF791. This imprinting appears to be conserved in the domesticated mammals we analyzed, including cattle, sheep, goats, horses, and dogs, but is absent in humans, non-human primates, and mice. Additionally, we provide mechanistic links between transcription and maternal methylation in porcine and bovine oocytes, highlighting the integration of transposable elements that leads to the methylation of intragenic CpG island promoters. Our findings suggest that genomic imprinting at the ZNF791 locus has selectively evolved in livestock species.

Introduction

Genomic imprinting is a primary epigenetic process that results in parent-of-origin-dependent gene expression in offspring. A subset of mammalian genes is affected by genomic imprinting, which fundamentally contributes to normal development and growth [1]. Epigenetic imprints, such as DNA methylation, are differentially established in the parental germ cells, i.e., oocytes and sperm. After fertilization, if these primary germline differentially methylated regions (gDMRs) resist genome-wide demethylation during pre-implantation development, they can serve as imprinting control regions (ICRs) [2]. Subsequently, ICRs dictate monoallelic expression depending on whether the allele is inherited from the mother or the father, leading to the formation of imprinted gene clusters in the majority of cases involving multiple coding and noncoding genes [3] or micro-imprinted domains in fewer cases in which surrounding genes escape imprinting [4].

The DMRs show spatially distinctive patterns. Sperm-derived gDMRs/ICRs are predominantly found in intergenic regions. In contrast, oocyte-derived gDMRs/ICRs, often located at CpG island (CGI) promoters, are associated with transcriptional repression that drives the silencing of the maternal allele and the expression of the paternal allele of corresponding genes [5]. Therefore, in this case of direct silencing, the paternal allele-specific expression is accompanied by a maternally methylated gDMR (hereafter, maternal gDMR) that originates from oocytes and is maintained throughout zygotic and post-implantation embryonic development. Since the discovery of genomic imprinting, molecular biology approaches have been applied to identify these gDMRs and the allelic expression of putative imprinted genes, utilizing single nucleotide polymorphisms (SNPs). It has been found that genomic imprinting occurs in therian mammals, including eutherians and marsupials, while there is no evidence of genomic imprinting in prototherians (monotremes), avians (birds), and lower vertebrates such as reptiles, amphibians, and fish [6].

Research on genomic imprinting in domesticated mammals has primarily targeted orthologous genes known to be imprinted in mice and humans [7]. This focus may restrict the identification of livestock-specific imprinted genes that do not have imprinted orthologs in mice and humans. Importantly, the generation of diploid parthenogenetic or gynogenetic embryos, which contain two maternal genomes in place of the paternal genome, leads to early lethality in mouse embryos, underscoring the necessity of both paternal and maternal genomes for normal embryonic development [8]. However, these types of uniparental embryos tend to survive longer in domesticated animals [9]. Based on this, our research focused on generating parthenogenetic porcine embryos and comparing them with biparental embryos to identify maternal imprints that have not been previously reported in mice and humans [10]. Utilizing these embryos, we aim to investigate a maternal DMR located at a CGI promoter, alongside assessing concurrent differential gene expression patterns.

Specifically, analyzing imprinting states at the transcript level will allow us to gain detailed insights into the imprinting status in a locus. Further comparative analyses in other mammalian species, including other domesticated mammals, humans, non-human primates, and rodents, will serve to highlight the evolutionary conservation or non-conservation of imprinting, as well as explore potential mechanisms of de novo DNA methylation. To facilitate these locus-wise comparative analyses at the transcript level, we focused on orthologues within one of the largest gene families, the zinc finger protein gene family. This group provides a substantial number of gene members for analysis, given that less than 1% of genes are generally estimated to be subject to imprinting [11], and it includes at least one known imprinted gene, PLAGL1 [12,13], in pigs, which serves as a positive control. Additionally, the imprinting statuses of porcine zinc finger protein genes have not yet been comprehensively studied, aside from their role in maintenance methylation [14].

In this study, we highlight the zinc finger protein 791-like (ZNF791-like) gene as an imprinted gene in pigs, which has not been extensively studied. Using whole-genome bisulfite sequencing (WGBS) and RNA sequencing (RNA-seq) on parthenogenetic and normal biparental porcine embryos, we identified a maternal DMR, an imprinted isoform, and a previously unannotated antisense transcript in pigs. Importantly, our comprehensive analyses of the orthologous genes across eutherian mammalian species revealed that the imprinting patterns were exclusive to domesticated mammals and absent in primates and rodents. Furthermore, we explored the potential mechanisms underlying the establishment of maternal imprints in oocytes, supporting the notion that a long terminal repeat (LTR) integration, upstream transcription initiation, and histone modification may act in concert to DNA methylation at embedded intragenic CpG island promoters. These findings describe lineage-specific ZNF791 imprinting, offering insights into molecular and evolutionary aspects of epigenetic regulation among eutherian mammals.

Results

DNA methylation profiling and detection of differentially methylated regions

Following the generation of parthenogenetically activated (PA) and control (CN) porcine embryos as detailed in the Materials and Methods section (S1 Fig), we conducted WGBS with a mean sequencing depth of 44.6× and a mean genome coverage of 96.67% (S1 Table). Subsequently, we analyzed DNA methylation levels in these pig embryos within genomic regions (between –10 kb from the transcription start site [TSS] and +10 kb from the transcription end site [TES]) and putative promoter regions (defined as ± 2 kb from the TSS) of all 2,900 porcine zinc finger transcripts from 642 genes (S2 Table). Our analysis revealed that mean DNA methylation levels in the putative promoter regions were generally low (Fig 1A), indicative of a condition that is conducive to robust detection of DMRs in the promoter regions of these embryos. Moreover, methylation levels were higher in gene bodies (from TSS to TES) compared to flanking regions across embryos, aligning with the gene-body methylation pattern in mammals (Fig 1A). Based on the methylation ratios between PA and CN embryos, we initially selected the top 100 transcripts, with a minimum methylation ratio being approximately 1.2, and identified human orthologs for potential comparative analyses. Of these, the top 88 porcine zinc finger transcripts with orthologous loci in humans were highlighted (S2 Table). Further sorting based on the PA1 methylation levels revealed that, while methylation levels can be identical if the putative 4 kb promoter regions overlap and share CpG sites, the promoter regions of 21 PLAGL1 transcripts and two ZNF791-like transcripts exhibited high methylation in PA embryos and moderate methylation in CN embryos (Fig 1B and S2 Table). During the initial methylation profiling, this pattern suggested a maternal methylation effect, where PA embryos with only maternal alleles tended to have higher methylation levels than the control embryos.

thumbnail
Fig 1. Overview of imprinted transcript discovery.

(A) Mean DNA methylation levels of genomic regions in porcine embryos, including transcript bodies (TSS–TES) and ± 10 kb flanking regions. The ± 2 kb region from TSS is highlighted in grey. Methylation levels of 2,900 porcine zinc finger transcripts are presented with heatmaps in descending order. (B) A heatmap of mean CpG methylation levels within ± 2 kb of TSSs of selected transcripts. Based on methylation ratios between PA and CN embryos, the top 88 porcine transcripts with human orthologs were identified and re-sorted by methylation levels in PA1. (C) Smoothed methylation profiles for PA (pink) and CN (blue) replicates. BSmooth-identified DMRs (pink-colored regions) for two genes are displayed, along with CpG sites, CpG coverages, transcripts, and CpG islands (CGIs). TSS positions are marked with red triangles on the first exons, with subsequent aligned transcripts sharing the same TSS not marked. Arrows indicate transcriptional directions. Mb, million base pairs (Mb). (D) From 2,900 porcine zinc finger transcripts, 23 mRNA transcripts from the PLAGL1 and ZNF791-like genes were identified as candidate DMR-associated transcripts.

https://doi.org/10.1371/journal.pgen.1011532.g001

We then employed a DMR detection software to identify differentially methylated regions between the PA and CN embryos within the putative promoter regions. Using the BSmooth method with default parameters, candidate DMRs were identified in the promoter regions (TSS ± 2 kb) with dense CpG sites, or CGIs, of two zinc finger protein genes, PLAGL1 and ZNF791-like (Fig 1C). The individual CpG coverages were generally well above the minimum threshold of three (Fig 1C), and the mean CpG coverages ranged from 10.53 to 11.41 for XM_005654368.3 (PLAGL1) and from 11.15 to 11.69 for XM_013990640.2 (ZNF791-like) across samples (S2A and S2B Fig). The mean CpG coverages for all transcript isoforms are provided in S2 Table. Among 2,900 porcine zinc finger transcripts, there are 2,670 mRNAs and 230 long-noncoding RNAs (lncRNA), and 23 mRNA transcripts of PLAGL1 and ZNF791-like were initially categorized as candidate transcripts that might be associated with DMRs in their putative promoter regions (Fig 1D). The known imprinted locus of PLAGL1 was utilized as a positive control for the maternally methylated DMR. This DMR was further validated using the metilene software (FDR < 0.05) at the promoter region of expressed transcripts, and our RNA-seq analysis showed that the corresponding paternal expression pattern was exclusively observed in the control embryos for a subset of transcripts with the IDs XM_005654368.3, XM_005654371.3, and XM_021084067.1 (S2C Fig). Another DMR identified upstream by metilene was more than 3kb apart from the TSS of XM_021084156.1, falling outside the TSS ± 2 kb range (S2C Fig). In summary, differentially methylated regions were identified within the putative promoter regions of zinc finger transcripts at the porcine ZNF791-like locus, as well as at the PLAGL1 locus, which served as a positive control.

Locus-specific paternal expression of ZNF791-like was attributed to a major transcript isoform

Following the current annotation of pig reference genome assembly (Sscrofa11.1/susScr11), the LOC100521431 (ZNF791-like) locus was explored to study the imprinting status. In the downstream (3′) of MAN2B1 gene, short transcripts (XM_031990640.2 and XM_021083735.1) of ZNF791-like appeared to be expressed from the CN embryos exclusively that are initiated from a CGI (Fig 2A and 2B). In addition to this NCBI genome annotation, we integrated the ENSSSCG00000033388 gene from the Ensembl genome annotation in the upstream (5′) of long transcripts of ZNF791-like. This Ensembl transcript is initiated from a CGI and appeared to be expressed from both PA and CN embryos. There also appeared to be an expression of previously unannotated antisense (AS) transcript overlapping the short transcripts of ZNF791-like (Fig 2A and 2B). Our profiling of DNA methylation status revealed a DMR spanning the CGI that regulates transcription of the short ZNF791-like transcripts (Fig 2C). This DMR resulted from full methylation in PA embryos and partial methylation in CN embryos, indicating maternal DNA methylation (i.e., maternal imprint) that could lead to silencing of the maternal allele of short ZNF791-like transcripts and their subsequent paternal expression. In contrast, the CGI regulating transcription of the ENSSSCG00000033388 gene was unmethylated in both PA and CN embryos (Fig 2C).

thumbnail
Fig 2. Regional view of the LOC100521431 (ZNF791-like) locus in pigs.

(A) RNA-seq read coverages of PA (pink, n = 3) and CN (blue, n = 6) embryos across a 15-Kb region (66.34−66.49 Mb), normalized to TPM values. (B) NCBI RefSeq annotation alongside an Ensembl-derived transcript (ENSSSCG00000033388) and an unannotated antisense (AS) transcript (purple). Protein-coding transcripts (brown) show translated regions as tall boxes and untranslated regions as short boxes. Arrows indicate transcriptional directions. CGI, CpG island; GC%, GC content. (C) WGBS DNA methylation levels (n = 2 for each of PA and CN embryos). Mean methylation ratios (PA and CN) are followed by their differences (PA-CN). A DMR (FDR < 0.05) is marked in red. (D) Close-up view of expressed transcripts with exon (E) numbers. Splicing donor (GT) and acceptor (AG) are denoted with arrows for orientation. The start codon (ATG) marks the beginning of a protein-coding open reading frame. (E) Quantification of expressed transcripts and genes. TPM values are represented as mean ± SEM. *, P < 0.05; ***, P < 0.001; †, a major predominant ZNF791-like transcript. (F) Zoom-in of the mean methylation levels at CpG resolution, with the DMR highlighted in red.

https://doi.org/10.1371/journal.pgen.1011532.g002

A zoomed visualization of the short ZNF791-like transcripts showed prominent exon 3 (E3) usage, instead of E3 skipping, as seen in the transcript XM_031990640.2 (Fig 2D). A combined close view revealed that exon 2 (E2) of the unannotated AS overlapped with exon 1 (E1) of the short ZNF791-like transcript (Fig 2D). The splicing of the unannotated AS was supported by the presence of splicing donor (GT) and accepter (AG) sites, allowing the inference of its transcriptional direction based on the orientation of the GT and AG sequences (Figs 2D and S3). Of note, expression of the unannotated AS tended to be higher in PA embryos than in CN embryos. Further quantification of transcript expression revealed that the mean expression level of the major short ZNF791-like transcript (XM_031990640.2) in the CN embryos was approximately 8-fold higher than that of another short transcript (XM_021083735.1): 5.36 ± 0.40 TPM for XM_031990640.2 and 0.66 ± 0.23 TPM for XM_021083735.1 (Fig 2E). Notably, expression of the unannotated AS was significantly 1.43-fold higher in the PA embryos than in the CN embryos (P < 0.05): 2.43 ± 0.22 TPM in CN and 3.49 ± 0.25 in PA embryos (Fig 2E), suggesting a potential maternally biased expression. This could further support the ZNF791-like imprinting and paternal expression, which might affect the expression of a nearby overlapping gene. The methylation levels at single CpG resolution highlighted the imprinted DMR, which encompasses the first exons and the CGI (Fig 2F). In short, the paternal expression of ZNF791-like was greatly due to one of the short transcripts (XM_031990640.2), that could be regulated by the DMR, and potentially led to greater expression of the overlapping unannotated AS transcript in the PA embryos than in the CN embryos.

DNA methylation imprint in the ZNF791 locus was species-specific

To examine conservation of ZNF791 imprinting in mammalian species, we analyzed publicly available WGBS data (listed in S3 Table) from gametes, tissues, and cells of 13 species. In humans, mice, and rats, the ZNF791 or Zfp791 promoter regions were unmethylated or significantly hypomethylated in both oocytes and sperm, indicating a lack of imprinted gametic methylation (Fig 3A). On the other hand, in pigs and cows, the promoter CGIs were fully methylated in oocytes and unmethylated in sperm, suggesting a conserved pattern of imprinted gametic methylation (Fig 3A). To further investigate maintenance of methylation patterns, various tissues and cells were analyzed. In humans, non-human primates (crab-eating macaque, chimpanzee, rhesus monkey, and gibbon), and rodents (mice and rats), all the CGI promoters or the promoter regions were unmethylated or significantly hypomethylated in fetal and adult tissues, including f-Br (fetal brain), Br (brain), f-Mu (fetal muscle), Mu (muscle), He (heart), Li (liver), Lu (lung), Ki (kidney), and Bl (blood) (Fig 3B).

thumbnail
Fig 3. DNA methylation status of the ZNF791 locus in mammalian cells and tissues.

(A) Methylation levels (0 = unmethylated, 1 = fully methylated) in oocytes (pink) and sperm (blue), with promoter regions of ZNF791 (or Zfp791) and ZNF791-like highlighted in grey, alongside CpG islands (green). (B) Methylation levels in tissues and cells (e.g., f-Br, fetal brain; see abbreviations in Results), with transcripts color-coded cyan for primates and rodents, and brown for other mammals. Partially methylated promoter regions are shaded in pink. (C) Analyses of partial methylation. WGBS reads were extracted and analyzed for hemi-methylation, applying a threshold of at least 30% of hypomethylated and hypermethylated reads defined in D. (D) Hemi-methylation status evaluated in two sequences with ≥ 10 reads and ≥ 3 consecutive CpG dinucleotides per read. The percentage of mCpGs was then calculated (see Materials and Methods). (E) Evaluation of hemi-methylation status across species. WGBS reads were extracted from indicated regions. Open and closed circles indicate unmethylated and methylated CpGs, respectively. (F) Full- and hemi-methylation states in PA and CN embryos, respectively. (G) A phylogenetic tree showing divergence. MYA, million years ago.

https://doi.org/10.1371/journal.pgen.1011532.g003

To the contrary, in domesticated mammals (pigs, cows, goat, sheep, horses, and dogs), all the CGI promoters were partially methylated in fetal and adult tissues and cells, including Fb (fibroblast), f-Li (fetal liver), y-Mu (young skeletal muscle, day 40), a-Mu (adult skeletal muscle, day 180), a-Sn (angen stage of skin), t-Sn (telogen stage of skin), e-Mu (embryonic muscle, embryonic day 110), a-Mu (adult skeletal muscle, 2-years-old), as well as some tissues that share the same abbreviation as those listed above (Fig 3B). Whether the partial methylation is originated from methylation difference between parental alleles (i.e., hemi-methylation pattern) could be analyzed based on WGBS reads. All reads overlapping the partially methylated regions tended to be either hypomethylated or hypermethylated, while the full methylated region in PA embryos showed hypermethylation only (Figs 3C and S4S10 and S4 Table). Our definition of hypo- or hypermethylation is the percentage of methylated CpG dinucleotides (mCpGs) less than 20% or more than 80% and is consistent with previous approaches [15] (Fig 3D). In at least two consecutive CpG dinucleotide sequences within reads overlapping the partially methylated regions, the percentage of mCpGs were either 0 ~ 20% (hypomethylation) or 80 ~ 100% (hypermethylation) in analyzed individual reads, indicating hemi-methylation status similar to that in CN embryos, whereas PA embryos showed full methylation (Fig 3E and 3F). Estimated divergence time between clade 1 (primates and rodents) and clade 2 (artiodactyls and carnivore) was circa 94 million years ago (MYA) (Fig 3G). Consequently, DNA methylation imprints in the ZNF791 or ZNF791-like locus appeared in one group of the analyzed mammalian species (artiodactyls and carnivore), but not in another group (primates and rodents), indicating a species-specific imprinting pattern.

Bi or mono-allelic expression of the ZNF791 gene across species based on individual-matched genomic DNA and RNA-seq reads

With the aim of investigating whether ZNF791 expression is non-imprinted or imprinted, its biallelic or monoallelic expression was to be examined across mammalian species. To achieve this, we first identified single nucleotide polymorphisms (SNPs) where two different alleles exist (i.e., heterozygous or informative SNPs), except for inbred mice. In humans, rhesus monkeys, chimpanzees, dogs, pigs, and cows, heterozygous SNPs were identified on genomic DNAs covering exons of ZNF791 (Fig 4 and S5 Table). Whether ZNF791 is expressed from both alleles or from a single allele was determined by analyzing the expression of alleles at the SNP sites in various normal tissues and cells. In RNA-seq data derived from the same individual whose genomic DNA was sequenced, both alleles were expressed in humans, rhesus monkeys, and chimpanzees in analyzed transcripts (Figs 4, S11 and S12, and S5 Table). On the other hand, in dogs, pigs, and cows, either a single alternative or reference allele was expressed in predominantly expressed isoforms in all analyzed tissues (Figs 4 and S13, and S5 Table). Additionally, the unannotated AS was expressed in the liver of a 60-day-old pig (S13C Fig). In stranded RNA-seq data from the liver of 180-day-old pigs, the transcriptional direction of the unannotated AS transcript was opposite to that of ZNF791-like (S13D and S13E Fig). In contrast, due to the presence of homozygous alleles at each SNP (i.e., homozygous SNPs) in inbred mice, a different approach was applied for the mouse analysis. It revealed that offspring from both initial and reciprocal crosses between CAST/EiJ and C57BL/6J strains consistently expressed both alleles in expressed transcripts (Figs 4 and S12D, and S5 Table). In summary, ZNF791 was found to be expressed biallelically in humans, non-human primates, and mice, indicating a non-imprinted expression in these species. Conversely, in dogs, pigs, and cows, ZNF791 or ZNF791-like exhibited monoallelic expression at the heterozygous SNP sites, indicative of an imprinted expression in these mammalian species.

thumbnail
Fig 4. Bi- or monoallelic expression of ZNF791 or ZNF791-like across mammalian species.

i) Whole genome/exome sequencing data was used to identify heterozygous SNPs for analyzing bi- or monoallelic mRNA expression. For inbred mouse strains, homozygous SNPs were chosen to analyze allelic expression in offspring from reciprocal crosses. ii) In the same individuals, mRNA expressions in normal tissues or cells were analyzed to determine allelic expression patterns. Reference SNP identifiers (rs IDs) are accompanied by reference/alternative alleles (‘ref/alt’). For SNPs without rs IDs, genomic coordinates and alleles are specified. Hu, humans; RM, rhesus monkey; CHM, chimpanzee; Mo, mouse with reciprocal crosses of CAST/EiJ (C) and C58BL/6J (B). Tissues: Lu, lung; Li, liver; Cx, cortex; Cm, cerebellum; Ad, adipose tissue; Sp, spleen; LCL, lymphoblastoid cell line; EB, embryonic brain; Sr, striatum; AN, arcuate nucleus; DN, dorsal raphe nucleus; Ht, hypothalamus; RV, right ventricle; Ki, kidney; LA, left atrium; BM, bone marrow; Ca, cartilage; Pa, pancreas; TG, thyroid gland; SI, small intestine; Hc, hippocampus; Md, midbrain; Th, thalamus; He, heart; Ar, adrenal cortex; Mu, skeletal muscle; Ma, mammary gland. Accession numbers and ID are listed in S5 Table. Alleles: HET, heterozygous; HOM REF, homozygous reference; HOM ALT, homozygous alternative.

https://doi.org/10.1371/journal.pgen.1011532.g004

LTR-mediated transcription might regulate ZNF791 imprinting in pigs and cattle

To investigate potential mechanisms behind the imprinting of the maternal allele of ZNF791 gene, we analyzed transcription in oocytes and histone modifications during early development. Because the CGI promoter of ZNF791 was maternally imprinted (Figs 2 and 3), there could be oocyte-specific events that contribute DNA methylation. In fully grown porcine oocytes, the ZNF791-like transcript (NCBI accession no. XM_021083732.1) was expressed, initiating from the TSS located near another putative CGI promoter and the LTR sequence, LTR52 (Fig 5A and 5B, grey shading, S14 and S15). This LTR52 was identified through a search in the Dfam database (E-value = 3.7 × 10−5), which uses profile hidden Markov models (HMMs) to improve the sensitivity of detection (S16 Fig). Additionally, H3K4me3, a hallmark of promoters, was found enriched in the CGI promoter region of this transcript XM_021083732.1 (Fig 5C, grey shading). From the site where H3K4me3 enrichment ended, both H3K36me2 and H3K36me3 tended to be enriched, supporting their role in de novo gene-body methylation in oocytes (Fig 5C and 5D) [16,17]. During zygotic development (4 cell, 8 cell, and blastocyst), H3K4me3 became enriched around the intragenic CGI at the blastocyst stage, indicating transcriptional permissiveness of the transcripts XM_013990640.2 and XM_021083735.1 (Fig 5B, †, and 5C, red shading).

thumbnail
Fig 5. Establishment of methylation imprints in porcine and bovine oocytes.

(A, B) RNA-seq read coverages (TPM) from porcine fully grown oocytes (FGOs) and the expressed ZNF791-like transcript. CGI and LTR52, near the TSS (chr2:66,440,912), are marked with green and red rectangles, respectively, and highlighted with grey shading. LTR52 is further indicated by a red arrow. Non-expressed transcripts are faded. (C) Enrichment of histone modifications, H3K4me3 and H3K36me2/me3, during porcine pre-implantation development. (D) WGBS methylation ratios: Full methylation in porcine oocytes (OO), unmethylation in sperm (SP), and partial methylation in somatic tissue (d120 muscle), shaded red. (E, F) RNA-seq coverages from bovine FGOs and expressed ZNF791 transcripts. CGI and LTR52 near the TSS (chr7:12,934,209) are marked and shaded as in B. Faded are non-expressed transcripts. (G) Histone modifications, H3K4me3 and H3K36me2/me3, during bovine pre-implantation development. (H) WGBS methylation ratios of bovine samples: Full methylation (OO), unmethylation (SP), and partial methylation (somatic tissue; muscle), shaded red. Stages: MII (MII oocytes), 4-cell, 8-cell, Blast, (blastocysts). From the red-shaded region, the major predominant ZNF791 transcripts, marked with † in (B) and (F), are preferentially transcribed at later developmental stages.

https://doi.org/10.1371/journal.pgen.1011532.g005

In bovine oocytes, the ZNF791 transcripts (XM_015471970.2, XM_015471971.2, and XM_024994880.1) were expressed, initiating from TSSs located near a putative CGI promoter and an LTR52 element (E-value = 1.2 × 10−11) (Fig 5E and 5F, grey shading, and S16−S17). Enrichment of H3K4me3 further supported the active transcription of those transcripts (Fig 5G, grey shading). From the site where H3K4me3 terminated, H3K36me2/me3 enrichment began in oocytes, indicating their role in de novo gene-body DNA methylation (Fig 5G and 5H). The demarcation of H3K36me2/me3 (i.e., the formation of a boundary) was somewhat more pronounced in bovine oocytes compared to porcine oocytes. As zygotes developed, H3K4me3 became enriched at the intragenic CGI during the blastocyst stage (Fig 5G, red shading).

Overall, the patterns of transcription initiation, histone modification, and DNA methylation were similar between porcine and bovine oocytes and developing zygotes. Taken together, these data suggest a mechanism whereby transcription was initiated near the LTR element during oogenesis, leading to the recruitment of DNA methyltransferase, subsequent de novo gene-body DNA methylation at the intragenic CGI, and the establishment of the DMR.

The ZNF791 locus in humans and mice lacks genomic imprinting

The imprinting mechanism described above was previously identified by Bogutz et al. (2019) [18] for the Impact gene (S18 Fig) and the Slc38a4 gene, both of which are imprinted in mice. The imprinted expression of Impact has also been reported in rats [19], but not in humans and pigs, indicating species-specific imprinting patterns. In contrast, the ZNF791 (or Zfp791) gene exhibited no maternal imprints in both humans and mice (Fig 3) and showed non-monoallelic expression in both species (Fig 4), indicating a non-imprinted status. To confirm the absence of genomic imprinting at this locus, we further explored the ZNF791 (or Zfp791) region. In human oocytes, the only annotated transcript of ZNF791 was expressed, to a lesser degree than the ZNF490 transcript, which is transcribed in the opposite direction (Fig 6A and 6B). Also, H3K4me3 was consistently enriched in the CGI promoter region during both gametic and zygotic stages (Fig 6C, grey shading). The absence of an additional CGI promoter in the downstream region suggests the lack of canonical maternal imprints in human oocytes (Fig 6D).

thumbnail
Fig 6. Non-establishment of methylation imprints in human and mouse oocytes.

(A, B) RNA-seq read coverages (in TPM) from human oocytes at FGO, MI, and MII stages and the expressed ZNF791 transcript. (C) H3K4me3 histone modifications during developmental stages of oocytes and zygotes. ICM, inner cell mass of the blastocyst. (D) WGBS methylation ratios of human samples. Unmethylation in oocytes (OO), sperm (SP), and somatic tissue (adult adipose tissue) are highlighted with grey shading. (E, F) RNA-seq read coverages from mouse growing oocytes (GOs) and the expressed Zfp791 transcript. Non-expressed transcripts are presented as faded. (G) Histone modifications, H3K4me3 and H3K36me3, in mouse oocytes. (H) WGBS methylation ratios of mouse samples. Unmethylation in OO, SP, and somatic tissue (kidney) are highlighted with grey shading. All mouse samples are from the C57BL/6 strain.

https://doi.org/10.1371/journal.pgen.1011532.g006

In mouse oocytes, only one Zfp791 transcript was expressed, with no other transcripts annotated to be transcribed from the downstream region (Fig 6E and 6F). The promoter region was enriched with H3K4me3 histone modification (Fig 6G, grey shading). Enrichment of H3K36me3 occurred in the gene body of Zfp791 (Fig 6G). The absence of a downstream CGI promoter region and additional annotated transcripts suggests a deficiency of canonical maternal imprints in mouse oocytes (Fig 6G), in contrast to the mouse Impact gene, which contains the canonical maternal imprint and exhibits partial methylation in somatic tissue (S18 Fig).

Unlike in pigs and cattle, maternal methylation imprints were not established in human and mouse oocytes, attributable to the absence of additional CGI promoters and downstream ZNF791 transcript isoforms. This suggests that divergence between these domesticated animals and humans and mice has led to lineage-specific imprinting at the ZNF791 locus.

ZNF791 imprinting implies selective forces of LTRs and evolution of lineage-specific imprinting

To further investigate orthologous imprinted ZNF791 loci, we first compared the loci and LTRs in artiodactyls and dogs. Focusing on the ZNF791 sequences immediately downstream (3′) of the MAN2B1 gene in each species, we found that LTR52-derived sequences are present near the second CGI in domesticated Artiodactyla species (pigs, cows, sheep, horses, and goat), while an LTR103-derived sequence is present near the second CGI in dogs (order Carnivora) (Fig 7A, grey shaded rectangles). E-values, or expectation values, for LTRs derived from the Dfam database were as follows: pig LTR52 (E-value = 3.7 × 10−5), cattle LTR52 (E-value = 1.2 × 10−11), sheep LTR52 (E-value = 6.8 × 10−6), horse LTR52 (E-value = 2.4 × 10−6), goat LTR52-int (E-value = 7.0 × 10−5), and dog LTR103_Mam (E-value = 0.051). These values indicate significant sequence similarity with the consensus sequences of LTR52 or LTR52-int (E-value < 0.05) and suggest a potential similarity with the consensus LTR103_Mam sequence in dogs. As imprinted ZNF791 transcripts are expected to be expressed from the third CGI downstream (3′) of LTRs (Fig 7A, red shaded rectangles) in pigs and cows (Figs 2 and 5, and S13B and S13C), in dogs (S13A Fig), and in sheep, horses, and goats (S19A and S19B Fig), it suggests that the conservation of ZNF791 imprinting in the analyzed species is also subject to an analogous pattern. The phylogenetic analysis of domesticated animals showed that the genetic distances between LTR52 sequences and the consensus LTR52 sequence are smaller than the distances between goat- and dog-derived sequences and the same consensus sequences (Figs 7B and S20). From the motif discovery/comparison and scanning, distribution of transcription factor binding sites (TFBSs) in LTR52 and its derivatives were identified (Fig 7C). From the motif discovery and comparison, DNA binding motifs of E2F3 and E2F2 (E2F2_DBD_1 and E2F3_DBD_1) were found to be significant in LTR52 sequences from pigs, cows, sheep, and horses (E-value < 0.05, q-value < 0.05) (Figs 7C, 7D and S21). Also, from motif scanning, binding sites for the MYBL2 transcription factor were found in the LTR52 sequences from pigs, cows, sheep, horses, and goat_LTR52-int (q-value < 0.05) (Fig 7C and 7D). Binding sites for the ZNF770 transcription factor were found in the LTR52 sequences from pigs, cows, sheep, horses, and dog_LTR103_Mam (q-value < 0.05) (Fig 7C and 7D).

thumbnail
Fig 7. LTR52 sequence comparison across the ZNF791 genomic regions in domesticated animals.

(A) LTR locations near CGIs are marked by red rectangles in grey shaded areas. Red shading indicates the 1st exon of candidate imprinted transcripts and CGI promoter regions. (B) Phylogenetic analyses of genetic distances between LTR sequences. The scale bar represents evolutionary distance, with a value of 0.1 indicating a substitution per nucleotide, as estimated by the Kimura 2-parameter (K2P) model. (C) Transcription factor binding motifs detected in LTR sequences. (D) Logos of the significant motifs. (E) Phylogenetic time tree with artiodactyls and carnivores highlighted in brown, and primates and rodents in cyan. Species silhouettes are from phylopic.org. Potential evolutionary events are marked with letters and arrows. MYA, million years ago; #, inclusion of the MAN2B1 gene; *, lineage-specific imprinting (Impact).

https://doi.org/10.1371/journal.pgen.1011532.g007

The phylogenetic time tree suggests that the insertion of the ZNF791 gene likely occurred around 94 MYA (Fig 7E, β), although the immediate downstream gene, MAN2B1, predates the emergence of genomic imprinting (Fig 7E, #). In Gallus gallus (chicken, avian), the whole locus containing both the MAN2B1 gene and the ZNF791 gene is missing from the NCBI RefSeq database. In Ornithorhynchus anatinus (platypus, monotreme), Monodelphis domestica (gray short-tailed opossum, marsupial), Dasypus novemcinctus (nine-banded armadillo, edentate), and Loxodonta africana (African savanna elephant, proboscidean), the MAN2B1 gene exists but the downstream ZNF791 gene is absent. In addition to the proposed emergence of imprinting at least 160 MYA following the divergence between monotremes and therians (marsupial and eutherian mammals) (Fig 7E, α), lineage-specific imprinting of ZNF791 might have occurred at least approximately 78 MYA in the common ancestors of artiodactyls and dogs (Fig 7E, γ). This lineage-specific imprinting might have involved the acquisition of upstream LTRs, overlapping transcripts, and the maternal gDMR locus, as illustrated by the ZNF791 locus in this study. Similarly, lineage-specific imprinting of Impact may have occurred relatively recently in the common ancestors of rodents, involving the acquisition of maternal gDMR (Fig 7E, *, and S18) [18,19]. The ZNF791 imprinting also leads to imprinted maternal expression of the overlapping unannotated AS in pigs (Fig 2D), and based on the pig QTLdb, a SNP located 327 bp upstream of the 1st exon of the unannotated AS is associated with residual feed intake (RFI) at a suggestive level (P = 2.89E-05) (S22 Fig), Taken together, in analyzed domesticated mammals (pigs, cows, sheep, horses, goat, and dogs), genomic imprinting at the ZNF791 or ZNF791-like locus might have been selectively evolved with LTR sequence integration.

Discussion

Herein, we conducted a comparative investigation of the imprinting statuses of ZNF791 across 13 mammalian genomes, focusing on previously unreported DMRs, unannotated transcripts, and several key factors: CpG island promoter methylation, allele-specific expression, transcription in oocytes, integration of transposable elements, and lineage-specific imprinting. To identify candidate maternal imprints, we initially compared DNA methylation levels between bimaternal parthenogenetic embryos and biparental control embryos from pigs, utilizing two biological replicates from WGBS with sequencing depths ranging from 43.8× to 45.2× (S1 Table) that met the ENCODE standards, which require a minimum of two biological replicates with at least 30× coverage for reproducibility (see Standards and Guidelines for Whole Genome Shotgun Bisulfite Sequencing (WGBS) under the ‘standards document’ section at ENCODE project; https://www.encodeproject.org/documents/). This comparison facilitated the robust detection of maternal DMRs at CGI promoters which implicated transcriptional repression of the maternal allele [5]. Subsequently, to determine whether the maternal DMR detected in the ZNF791-like locus constitutes a germline DMR, we analyzed WGBS methylation levels from oocytes and sperm. Interestingly, the CGI promoters exhibited full methylation in oocytes from pigs and cattle, yet remained unmethylated in oocytes from humans, mice, and rats, indicating species-specific maternal gDMR imprints. The maintenance of allele-specific DNA methylation in somatic tissues of outbred animals were analyzed by examining methylation levels of consecutive CpG dinucleotides in individual WGBS reads (a read-based method), independent of SNPs [15]. It revealed that hemi-methylation (i.e., reflection of full methylation in one of the parental alleles and unmethylation in the other) at the CGI promoters is observed only in artiodactyls and dogs. It suggests conservation of this maternal gDMR in domestic animals and led us to examine allele-specific gene expression to identify corresponding imprinted genes.

Genomic imprinting can affect all transcript isoforms in an isoform-independent manner, however when there are multiple promoters and they are imprinted in different ways, a mixed pattern of imprinting can occur (isoform-dependent imprinting) [20]. In this study, based on our methylation profiling which revealed maternal DMRs and suggested corresponding paternal gene expression, we included more control samples for RNA-seq to clearly show the paternal expression pattern, if present, in the CN group, using a different set of whole embryos having unique identifying numbers compared to those used for WGBS. Among previously annotated transcripts in the NCBI RefSeq database, we identified the major ZNF791 transcript isoforms that were predominantly expressed in pig CN embryos, as well as in adult dogs, pigs, cattle, sheep, horses, and goats, indicating that ZNF791 undergoes isoform-independent imprinting. Accordingly, we revealed monoallelic expression of ZNF791 in these domestic animals, contrasting with the biallelic expression observed in humans, non-human primates, and mice. Additionally, the presence of long non-coding RNAs (lncRNAs) within an imprinted locus can further complicate the imprinting landscape, especially when these lncRNAs overlap with protein-coding mRNA transcripts and run in an antisense orientation. This overlap can lead to allele-specific silencing of mRNAs and ncRNAs through two possible mechanisms: i) transcriptional interference of a promoter or an enhancer and ii) RNA-interference (RNAi) [3]. The former mechanism, transcriptional interference, was demonstrated by the paternal allele-specific silencing of Igf2r, which requires the overlap of Airn lncRNA transcription over the Igf2r promoter, thereby inhibiting RNA polymerase II recruitment [21]. Alternatively, transcriptional interference can occur over enhancer domains when so-called Enhancer Occlusion Transcripts (EOTrs) cover these regions [22]. In the current study, we identified a previously unannotated antisense transcript (unannotated AS). The maternally biased expression of this unannotated AS might occur through transcriptional interference of its intronic enhancer domains, which are occupied by ZNF791-like transcripts acting as EOTrs, or through the latter mechanism of RNAi, mediated by the paternally expressed ZNF791 transcripts. These processes occur without overlapping the promoter region of unannotated AS, indicating potential transcriptional interference at an enhancer or RNAi-mediated silencing of the unannotated AS. Of note, this novel unannotated AS transcript was not observed in any other analyzed domesticated species (Figs 5B, S13A, S13B and S19) except in pigs, specifically in pig embryos, oocytes, and liver tissues (Figs 2, 5A, S13C and S13E). Whether this unannotated transcript arose during the speciation of Sus scrofa remains an open question. Further in-depth transcript profiling across various tissues from diverse domesticated animals is needed to determine its evolutionary origin and status of imprinting.

During oogenesis, RNA transcription, initiating from an upstream transcription start site and extending across intragenic CpG island promoters, is associated with the establishment of DNA methylation imprints at these CGI promoters, which are situated within actively transcribed regions [18,2325] (S18 Fig). Accordingly, in our analyses on pigs and cattle, we discovered that the transcription of long ZNF791 mRNA in oocytes overlaps with intragenic CGI-spanning gDMRs, indicating acquisition of the upstream transcript over the existing CGI that resulted in establishing maternal imprints. Additionally, we identified upstream alternative promoters that are active in porcine and bovine oocytes, supporting the transcription of the long overlapping ZNF791 transcript. These upstream promoters overlapped with transposable elements (TEs) as observed in the mouse Impact locus [18]. It has been suggested that the mechanisms underlying genomic imprinting in therian genomes may have evolved from an epigenetic defense system against integrated TEs [26,27]. Furthermore, transcripts initiated from species-specific LTRs have been implicated in the establishment of DNA methylation in oocytes [18,24]. The LTR52, identified in the ZNF791 locus, may have undergone mutations during evolution but might act as artiodactyl-specific alternative promoters that induce transcription and CGI methylation. All ZNF791-like and ZNF791 genes in analyzed livestock are transcribed in the reverse direction relative to their respective genomes. A Dfam database search showed that, except in goats, LTRs are located on the + strand, suggesting that they may act as antisense promoters (S16 Fig). This is similar to the human SCIN locus, where the gene is transcribed in the forward direction and an upstream LTR is located on the–strand [18], as well as other loci with opposite transcriptional orientations, as listed in S1 Table of reference [18]. Moreover, in mouse oocytes, transcribed regions are enriched with H3K36me3, which mediates de novo DNA methylation, thereby leading to the establishment of maternal imprints [16]. Conversely, H3K36me2 is broadly deposited in intergenic regions, mediating moderate levels of DNA methylation [17,28]. However, in porcine and bovine oocytes, both H3K36me2 and H3K36me3 modifications have been observed to be enriched within transcribed regions and showed a high correlation with DNA methylation [29]. Although these modifications were somewhat more widespread in porcine oocytes than in bovine, their distribution appeared to overlap supporting their conserved role in methylation processes. During post-fertilization, the ZNF791 DMRs/ICRs might be selectively preserved by being protected against the genome-wide demethylation occurring before embryo implantation, potentially through the involvement of ZFP57 and/or ZNF445 as key factors [14,30,31]. Among these, ZNF445 is highly conserved across domesticated animals (S23A and S23B Fig) and is expressed starting from the stage of porcine and bovine oocytes [14], whereas ZFP57 is less conserved (S23C and S23D Fig), absent in porcine and bovine oocytes, and gradually increases in subsequent embryonic stages [14]. Notably, the imprinted ZNF791 DMRs of pigs and cattle, particularly in CGIs, contain the TGCCGC motif, which is recognized as the consensus binding motif for ZFP57 [30] and potentially for ZNF445 [31] (S23E and S23F Fig; S6 Table). This motif is also present in CGIs within the imprinted ZNF791 DMRs of other domestic species (S23E and S23F Fig; S6 Table). On the other hand, these ZNF791 CGIs do not contain the two previously predicted ZNF445 motifs (ZNF445 TO and ZNF445 TR) [32] (S23E Fig and S6 Table).

Over the past three decades, more than 200 imprinted genes have been identified in both mice and humans, with 63 of these genes being common to both species and the majority being species-specific, underscoring both evolutionary conservation and divergence [11]. In domestic pigs, approximately 40 genes have been identified as imprinted, with about 30 of these being common when compared to mice and humans and about a dozen being unique to pigs, according to a previous publication [7] and geneimprint.com (accessed Feb 2024). This indicates that the current catalog of porcine imprinted genes may be insufficient to fully understand the complexity and variability of the pig epigenome. Recently, the number of imprinted genes in marsupial mammals was estimated to be around 60 [33] suggesting divergence of the imprinting processes across Theria. This divergence can be attributed to molecular factors, as the maternal gDMRs for ZNF791 and Impact appear to have been established at irregular time points during mammalian evolution. Moreover, de novo DNA methylation-independent non-canonical imprinting discovered in rodents, particularly in the placenta, provides further targets for investigating the conservation and divergence of extraembryonic imprinting [3438]. In domestic animals, parent-of-origin effects on body size, body weight, and feed efficiency, which are transmitted either paternally or maternally, have been reported in cross-bred species [7]. The potential impact of genomic imprinting on these agronomically important traits can also be evaluated on a large scale using experimental livestock populations [39]. This approach is grounded in the parental conflict theory, which suggests that paternally expressed genes (PEGs) tend to promote growth, while maternally expressed genes (MEGs) favor resource preservation and reduced growth [40]. As mentioned, a SNP is located at a putative promoter of the unannotated AS, which may regulate expression of unannotated AS and consequently affect the overlapping ZNF791-like expression through transcriptional interference or RNAi-mediated silencing mechanisms. Because this SNP was associated with RFI, according to the pig QTLdb based on a recent GWAS [41], and imprinted genes are dosage-sensitive [42], further research on RFI and genomic imprinting in this locus is needed to investigate the functional relevance of the unannotated AS gene and the role of this imprinted gene cluster. Overall, our findings convey molecular and evolutionary insights into species-specific genomic imprinting in the ZFN791 locus and may benefit future investigations on genetic selection of livestock.

Methods

Ethics statement

All animal procedures were conducted All animal procedures were conducted in accordance with guidelines approved by the Institutional Animal Care and Use Committee (IACUC) of the National Institute of Animal Science, Rural Development Administration (RDA) of Korea (NIAS2015-670).

Sample preparation

The method of parthenogenesis has been described in our previous report [43]. Briefly, ovaries of Landrace × Yorkshire × Duroc (LYD) pigs were obtained from a slaughterhouse. Cumulus-oocyte complexes (COCs) were collected and washed in Tyrode’s lactate-HEPES-polyvinyl alcohol. Oocytes with several layers of cumulus cells were selected and washed three times in TCM-199 based medium (GIBCO, Grand Island, NY, USA). For in vitro maturation (IVM), 50 COCs were transferred into 500 μL of maturation medium in four-well dishes and matured for 40 h at 38.5°C in an incubator containing 5% CO2.

The cumulus cells were removed, and oocytes having the first polar body were selected and placed in a fusion chamber with 250 μm diameter wire electrodes (BLS, Budapest, Hungary) covered with 0.3 mol/L mannitol solution containing 0.1 mM MgSO4, 1.0 mM CaCl2, and 0.5 mM HEPES. To achieve fusion, two DC pulses (1 sec interval) of 1.2 kV/cm were applied for 30 μs using an LF101 Electro Cell Fusion Generator (Nepa Gene Co., Ltd., Chiba, Japan). After a 2-h stabilization period, the parthenogenetically activated (PA) embryos were transferred into the oviducts of two LYD surrogate gilts, each aged 12 months on the estrus phase, for further development.

To generate fertilized control (CN) embryos, two LYD gilts were naturally mated with boars twice with a 6-h interval during their natural heat period at the onset of estrus. Ten embryos from each of the parthenogenetically activated and control groups were recovered from two euthanized gilts 21 days after the onset of estrus, with half collected from each dam. This timing prevented the abnormal morphological changes in parthenogenetic embryos that typically appear around day 30, while the sizes of parthenogenetic embryos were smaller than that of controls prior to day 30 [44] (S1 Fig). Additionally, our previous histological assays showed that PA embryos developed with normal morphologies by day 26, forming major organs and tissues comparable to control embryos, although PA embryos were smaller than the control [45]. Embryos with intact morphology from both the PA and CN groups were used for sequencing. The recovery process involved sectioning the reproductive tracts, isolating the placenta from the uterus, and separating embryos from the surrounding placenta, and subsequently the samples were stored in liquid nitrogen until further use.

Whole-genome bisulfite sequencing and downstream analyses

Genomic DNA was isolated from the whole embryos (two biological replicates for each of the PA and CN groups) and fragmented. The fragmented DNA (200 ng) underwent bisulfite conversion using the EZ DNA Methylation-Gold Kit (Zymo Research, Irvine, CA, USA). To construct the DNA library, 1 ng of DNA was processed with the Accel-NGS Methyl-Seq DNA Library Kit (Swift Biosciences, Inc. Ann Arbor, MI, USA). PCR was conducted with adapter primers and Diastar EF-Taq DNA polymerase (Solgent, Daejeon, Korea) with thermal cycling conditions of a 3 m denaturation at 95°C, followed by 10 cycles of 30 s at 95°C, 30 s at 60°C, and 30 s at 72°C, and a final 5 m extension at 72°C. Following bead-based purification, the DNA library was sequenced to produce paired-end reads of 151 nucleotides using the HiSeqX system operated by Macrogen Inc. (Seoul, Korea).

Read quality was evaluated using FastQC (v0.12.1). The raw WGBS reads were trimmed and filtered, using the default parameters of Trim Galore (v0.12.1) to remove adapters and short reads less than 20bp, except for additional trimming to eliminate up to 18 bp of low complexity sequence tags, introduced during the library preparation, from the 3′ end of R1 (–three_prime_clip_R1 18) and from the 5′ end of R2 (–clip_R2 18). The resulting cleaned reads (approximately 420 million pairs for PA1, 425 million pairs for PA2, 431 million pairs for CN1, and 433 million pairs for CN2), were mapped to the pig reference genome (Sscrofa11.1/susScr11) using the Bismark aligner (v0.22.3) with default parameters [46]. Mapping efficiency ranged from 79.6% to 81.3% (S1 Table). Deduplication was performed with deduplicate_bismark, removing PCR-based duplications that comprised between 13.00% and 15.21% of the mapped pairs. Methylation percentages for cytosines in the CpG context were determined using bismark_methylation_extractor. CpG sites covered by at least three reads and with a median of mean CpG coverages greater than 10 for all analyzed transcripts in each sample (see S2 Table, footnote) were further analyzed.

Genomic coordinates of all 2,900 porcine zinc finger transcripts from autosomes (chr1 ~ chr18) were extracted from the GFF file for the pig reference genome, excluding unplaced scaffolds (chrUn), sex chromosomes (chrX and chrY), and the mitochondrial chromosome. Profile plots and heatmaps for the transcript bodies (TSS ~ TES) and ± 10 kb regions (–10 kb from TSS and +10 kb from TES) were generated using computeMatrix, plotProfile, and plotHeatmap from deepTools (v3.5.5) [47]. A heatmap for mean DNA methylation levels of all CpG sites within the putative promoter regions (±2 kb from TSS) of selected transcripts was generated using heatmap.2 of the gplots package (v3.1.3.1). For this transcript selection, all 2,900 zinc finger transcripts were initially sorted based on their methylation ratios, defined as the mean methylation level in PA embryos divided by the mean methylation level in CN embryos. From this sorted list, the top 88 transcripts having orthologous loci in humans were identified and re-sorted based on their mean methylation levels in the first PA embryo.

Using the BSmooth method from bsseq (v1.38.0) with default parameters [48], we initially obtained methylation profile curves and candidate DMRs within the putative promoter regions. To avoid possible false positives, we further applied additional procedures using the metilene software (v0.2–8) [49]. For the metilene analysis, we employed default parameters, which include a maximum distance between CpG dinucleotides (-M) of 300, a minimum number of CpGs (-m) of 10, and a mode of operation (-f) of de novo. Moreover, we specified a more stringent minimum mean methylation difference (-d) of 0.2 and applied multiple testing correction (-c) with a false discovery rate (FDR) adjustment. We designated regions with an FDR < 0.05 as DMRs. DNA methylation ratios were plotted as histograms using the R package Gviz (v1.44.0) [50].

Analyses of hemi-methylation

For the analysis of WGBS reads overlapping with partially methylated regions, we utilized MethylDackel v0.6.1 [51] to extract the names and locations of individual reads, count the CpG sites on each read, and calculate the methylation percentage (S4 Table). We retained only those WGBS reads containing at least three CpGs sites, referred to as ‘qualified reads’, and partially methylated regions supported by more than 30 such qualified reads for further analyses, based on a previously described SNP-free method for outbred animals [15]. From the percentage of methylated CpGs (mCpGs) for each qualified read, hypomethylated reads (0–20%) and hypermethylated reads (80–100%) were determined. After calculating the percentages of these hypo- and hypermethylated reads among all reads in the partially methylated regions, we plotted the hemi-methylation tendency of these regions, defined as regions with more than or equal to 30% of hypomethylated reads and more than or equal to 30% of hypermethylated reads (S4 Table). For consecutive CpG sites within the partially methylated regions, CpG methylation pattern plots, referenced from Figure S2A in Xie et al. (2012) [52], were drawn using a web-based tool, QUMA (quantification tool for methylation analysis) (S4C–S4E Fig) [53].

RNA sequencing and data processing

Total RNA was isolated from the whole embryos (three biological replicates for the PA group and six biological replicates for the CN group) using TRIzol reagent (Sigma-Aldrich, USA). The RNA samples were treated with DNase I to avoid genomic DNA contamination and electrophoresed in 1.2% agarose gels to evaluate RNA integrity, which was confirmed by a 28S/18S rRNA ratio greater than 2 and an RNA integrity number (RIN) greater than 7, using an Agilent 2100 BioAnalyzer. The concentration of RNA was assessed using the ratios of A260/A280 and A260/A230 (1.8–2.0). To select poly-A tails and construct cDNA libraries, 1 μg of total RNA was used with the TruSeq RNA Sample Prep Kit v.2 (Illumina, San Diego, CA, USA). The cDNA libraries were quantified by quantitative Real-Time PCR (qPCR). The Illumina HiSeq2500 RNA-seq platform was used to sequence the library products to produce 101 nucleotide paired-end reads.

Read quality was assessed using FastQC (v0.12.1). To remove adapters and low-quality reads from the raw RNA-seq data, we utilized Trimmomatic (v0.39), applying parameters LEADING:3 TRAILING:3 SLIDINGWINDOW:4:15 MINLEN:36 [54]. This process yielded approximately 27 to 43 million read pairs for the PA and CN embryo samples. Using the STAR aligner (v2.7.9a), the cleaned reads were aligned to the reference genome (Sscrofa11.1/susScr11) with default parameters [55], resulting in 96.0% to 96.5% of the reads being uniquely mapped (S1 Table). These reads were retrieved with SAMtools [56] and 20.0% to 25.0% of them were subsequently deduplicated using Picard MarkDuplicates for further analyses. BAM files were normalized to generate BigWig files using bamCoverage from deepTools (v3.5.5) [47], with options—binSize 10 and—smoothLength 15. These files were then visualized using Gviz (v1.44.0) [50]. Transcript quantification was performed using Salmon (v1.3.0) in mapping-based mode [57], calculating transcript per million (TPM) values.

Data mining and processing

Our generated data, along with publicly available data downloaded from the NCBI Gene Expression Omnibus (GEO) and other resources, are listed in S3 Table. WGBS and RNA-seq data were processed using the procedures described earlier in this report, while ChIP-seq, whole genome sequencing (WGS), and whole exome sequencing (WES) data were processed as previously outlined in our report [58]. Reference genomes used in this study include GRCh38.p14/hg38 (human), macFas5 (crab-eating macaque), panTro6 (chimpanzee), rheMac8 (rhesus monkey), Asia_NLE_v1 (gibbon), GRCm39/mm39 (mouse), rat (rn7), equCab3 (horse), canFam3 (dog), susScr11 (pig), bosTau9 (cow), oriAri4 (sheep) and CHIR_1.0 (goat). The VCF files used for SNP analyses are listed in S3 Table. For VCF files based on a previous genome, the Picard LiftoverVcf (v2.23.8) was used to convert them to match the reference genome mentioned above (S3 Table).

LTR search

When there are multiple ZNF791 orthologs, we focused on the ZNF791 gene immediately downstream (3′) of the MAN2B1 gene across species. We extracted sequences of the promoter regions of the ZNF791 gene and conducted searches for transposable elements (TEs) in the Dfam database [59] (release 3.8; accessed on December 28, 2023), which utilizes profile hidden Markov models (HMMs), as detailed in S16 Fig. For each species analyzed, the search was parameterized with the—species option, while employing the default gathering (GA) threshold via—cut_ga, the Dfam curated threshold, to ensure specificity to repetitive element families. For dogs, however, which did not yield significant matches under the GA threshold, we adapted our approach by utilizing an E-value threshold (-E 0.1) to identify potential LTRs.

Phylogenetic analyses

For phylogenetic analyses, LTR-derived sequences from domesticated animal species and the consensus LTR52 sequence (DF000000543.4) from the Dfam database [59] (accessed December 28, 2023) were aligned using MUSCLE for multiple sequence alignment, as implemented in the MEGA11 software [60]. Using MEGA11, the genetic distances were calculated by the Kimura 2-parameter substitution model with uniform rates among sites [61] and maximum-likelihood phylogenetic trees (500 bootstrap replications) were constructed. For another construction of phylogenetic trees with divergence times estimated by TimeTree 5, resulting Newick files were downloaded from http://www.timetree.org (accessed Feb 5, 2024) [62]. The phylogenetic trees were edited using FigTree (v.1.4.4) [63].

Motif analyses

De novo motif discovery was conducted using the MEME tool from the MEME Suite [64], and the discovered motifs were compared with the JASPAR2022 CORE vertebrates non-redundant database using the TOMTOM tool. Also, motif scanning was performed using the FIMO tool with transcription factor binding sites (TFBSs) data downloaded from the AnimalTFDB v4.0 database [64]. Motif logos were drawn using the R package universalmotif (v1.20.0).

Statistical analyses

For DEG analysis, the output files generated by Salmon were imported into R and analyzed using the DESeq2 package (v.1.28.1) [65]. DEGs were identified based on two criteria: an absolute log2-fold change greater than 1 and an FDR below 0.05. To compare mean expression levels at the transcript level between two unrelated groups, specifically PA and CN embryos, the Welch two sample t-test, implemented in R, was used.

Supporting information

S2 Fig. CpG coverages and a positive control for the pig imprinted gene.

https://doi.org/10.1371/journal.pgen.1011532.s002

(PDF)

S3 Fig. Identification of expressed transcripts from RNA-seq data of porcine PA and CN embryos.

https://doi.org/10.1371/journal.pgen.1011532.s003

(PDF)

S4 Fig. Partial DNA methylation at the ZNF791 locus in pigs downstream of the MAN2B1 gene.

https://doi.org/10.1371/journal.pgen.1011532.s004

(PDF)

S5 Fig. Partial DNA methylation at the ZNF791 locus in cattle downstream of the MAN2B1 gene.

https://doi.org/10.1371/journal.pgen.1011532.s005

(PDF)

S6 Fig. Partial DNA methylation at the ZNF791 locus in sheep downstream of the MAN2B1 gene.

https://doi.org/10.1371/journal.pgen.1011532.s006

(PDF)

S7 Fig. Partial DNA methylation at the ZNF791 locus in horses downstream of the MAN2B1 gene.

https://doi.org/10.1371/journal.pgen.1011532.s007

(PDF)

S8 Fig. Partial DNA methylation at the ZNF791 locus in goat downstream of the MAN2B1 gene.

https://doi.org/10.1371/journal.pgen.1011532.s008

(PDF)

S9 Fig. Partial DNA methylation at the ZNF791 locus in dogs downstream of the MAN2B1 gene.

https://doi.org/10.1371/journal.pgen.1011532.s009

(PDF)

S10 Fig. DNA methylation at the ZNF791 locus in pig embryos downstream of the MAN2B1 gene.

https://doi.org/10.1371/journal.pgen.1011532.s010

(PDF)

S11 Fig. Human lung exome from WES and RNA-seq.

https://doi.org/10.1371/journal.pgen.1011532.s011

(PDF)

S12 Fig. Expressed ZNF791 transcripts in humans, primates, and mice.

https://doi.org/10.1371/journal.pgen.1011532.s012

(PDF)

S13 Fig. Expressed ZNF791 transcripts in dogs, cattle, and pigs.

https://doi.org/10.1371/journal.pgen.1011532.s013

(PDF)

S14 Fig. Expressed transcripts at the ZNF791 locus in pig oocytes.

https://doi.org/10.1371/journal.pgen.1011532.s014

(PDF)

S15 Fig. An expressed unannotated transcript at the ZNF791 locus in pig oocytes.

https://doi.org/10.1371/journal.pgen.1011532.s015

(PDF)

S17 Fig. Expressed transcripts at the ZNF791 locus in cow oocytes.

https://doi.org/10.1371/journal.pgen.1011532.s017

(PDF)

S18 Fig. LTR-initiated transcription and establishment of methylation imprint in mouse oocytes.

https://doi.org/10.1371/journal.pgen.1011532.s018

(PDF)

S19 Fig. Expressed ZNF791 transcripts in sheep, horses, and goats.

https://doi.org/10.1371/journal.pgen.1011532.s019

(PDF)

S20 Fig. Multiple sequence alignment of LTRs.

https://doi.org/10.1371/journal.pgen.1011532.s020

(PDF)

S22 Fig. QTL analysis for the upstream of the unannotated antisense transcript.

https://doi.org/10.1371/journal.pgen.1011532.s022

(PDF)

S23 Fig. Sequence and binding motif analysis.

https://doi.org/10.1371/journal.pgen.1011532.s023

(PDF)

S2 Table. Identification of candidate DMR-associated transcripts.

https://doi.org/10.1371/journal.pgen.1011532.s025

(XLSX)

S4 Table. Analyses of partially methylated regions.

https://doi.org/10.1371/journal.pgen.1011532.s027

(XLSX)

S5 Table. Individual-matched analyses of genomic DNA and RNA-seq read counts on informative SNPs.

https://doi.org/10.1371/journal.pgen.1011532.s028

(XLSX)

Acknowledgments

We are thankful to Ms. Michelle Milligan for proofreading this manuscript.

References

  1. 1. Ferguson-Smith AC. Genomic imprinting: the emergence of an epigenetic paradigm. Nat Rev Genet. 2011;12(8):565–75. Epub 2011/07/19. pmid:21765458.
  2. 2. Reik W, Walter J. Genomic imprinting: parental influence on the genome. Nat Rev Genet. 2001;2(1):21–32. Epub 2001/03/17. pmid:11253064.
  3. 3. Barlow DP, Bartolomei MS. Genomic imprinting in mammals. Cold Spring Harb Perspect Biol. 2014;6(2). Epub 2014/02/05. pmid:24492710; PubMed Central PMCID: PMC3941233.
  4. 4. Iglesias-Platas I, Court F, Camprubi C, Sparago A, Guillaumet-Adkins A, Martin-Trujillo A, et al. Imprinting at the PLAGL1 domain is contained within a 70-kb CTCF/cohesin-mediated non-allelic chromatin loop. Nucleic Acids Res. 2013;41(4):2171–9. Epub 2013/01/09. pmid:23295672; PubMed Central PMCID: PMC3575839.
  5. 5. Smith ZD, Chan MM, Mikkelsen TS, Gu H, Gnirke A, Regev A, et al. A unique regulatory phase of DNA methylation in the early mammalian embryo. Nature. 2012;484(7394):339–44. Epub 2012/03/30. pmid:22456710; PubMed Central PMCID: PMC3331945.
  6. 6. Hore TA, Rapkins RW, Graves JA. Construction and evolution of imprinted loci in mammals. Trends Genet. 2007;23(9):440–8. Epub 2007/08/09. pmid:17683825.
  7. 7. Tian XC. Genomic imprinting in farm animals. Annu Rev Anim Biosci. 2014;2:23–40. Epub 2014/11/11. pmid:25384133.
  8. 8. Surani MA, Barton SC, Norris ML. Development of reconstituted mouse eggs suggests imprinting of the genome during gametogenesis. Nature. 1984;308(5959):548–50. Epub 1984/04/05. pmid:6709062.
  9. 9. Han X, Ouyang H, Chen X, Huang Y, Song Y, Zhang M, et al. Aberrant expression of Igf2/H19 in porcine parthenogenetic fetuses and placentas. Anim Reprod Sci. 2013;139(1–4):101–8. Epub 2013/05/11. pmid:23660367.
  10. 10. Ahn J, Hwang IS, Park MR, Hwang S, Lee K. Imprinting at the KBTBD6 locus involves species-specific maternal methylation and monoallelic expression in livestock animals. J Anim Sci Biotechnol. 2023;14(1):131. Epub 2023/10/11. pmid:37817239; PubMed Central PMCID: PMC10565993.
  11. 11. Tucci V, Isles AR, Kelsey G, Ferguson-Smith AC, Erice Imprinting G. Genomic Imprinting and Physiological Processes in Mammals. Cell. 2019;176(5):952–65. Epub 2019/02/23. pmid:30794780.
  12. 12. Zhang FW, Cheng HC, Jiang CD, Deng CY, Xiong YZ, Li FE, et al. Imprinted status of pleomorphic adenoma gene-like I and paternal expression gene 10 genes in pigs. J Anim Sci. 2007;85(4):886–90. Epub 2006/12/21. pmid:17178803.
  13. 13. Ahn J, Hwang IS, Park MR, Hwang S, Lee K. Genomic Imprinting at the Porcine PLAGL1 Locus and the Orthologous Locus in the Human. Genes (Basel). 2021;12(4). Epub 2021/05/01. pmid:33918057; PubMed Central PMCID: PMC8069715.
  14. 14. Ivanova E, Canovas S, Garcia-Martinez S, Romar R, Lopes JS, Rizos D, et al. DNA methylation changes during preimplantation development reveal inter-species differences and reprogramming events at imprinted genes. Clin Epigenetics. 2020;12(1):64. Epub 2020/05/13. pmid:32393379; PubMed Central PMCID: PMC7216732.
  15. 15. Chu C, Zhang W, Kang Y, Si C, Ji W, Niu Y, et al. Analysis of developmental imprinting dynamics in primates using SNP-free methods to identify imprinting defects in cloned placenta. Dev Cell. 2021;56(20):2826–40 e7. Epub 2021/10/08. pmid:34619096.
  16. 16. Xu Q, Xiang Y, Wang Q, Wang L, Brind’Amour J, Bogutz AB, et al. SETD2 regulates the maternal epigenome, genomic imprinting and embryonic development. Nat Genet. 2019;51(5):844–56. Epub 2019/05/02. pmid:31040401.
  17. 17. Yano S, Ishiuchi T, Abe S, Namekawa SH, Huang G, Ogawa Y, et al. Histone H3K36me2 and H3K36me3 form a chromatin platform essential for DNMT3A-dependent DNA methylation in mouse oocytes. Nat Commun. 2022;13(1):4440. Epub 2022/08/04. pmid:35922445; PubMed Central PMCID: PMC9349174.
  18. 18. Bogutz AB, Brind’Amour J, Kobayashi H, Jensen KN, Nakabayashi K, Imai H, et al. Evolution of imprinting via lineage-specific insertion of retroviral promoters. Nat Commun. 2019;10(1):5674. Epub 2019/12/14. pmid:31831741; PubMed Central PMCID: PMC6908575.
  19. 19. Okamura K, Sakaki Y, Ito T. Comparative genomics approach toward critical determinants for the imprinting of an evolutionarily conserved gene Impact. Biochem Biophys Res Commun. 2005;329(3):824–30. Epub 2005/03/09. pmid:15752730.
  20. 20. Gregg C, Zhang J, Weissbourd B, Luo S, Schroth GP, Haig D, et al. High-resolution analysis of parent-of-origin allelic expression in the mouse brain. Science. 2010;329(5992):643–8. Epub 2010/07/10. pmid:20616232; PubMed Central PMCID: PMC3005244.
  21. 21. Latos PA, Pauler FM, Koerner MV, Senergin HB, Hudson QJ, Stocsits RR, et al. Airn transcriptional overlap, but not its lncRNA products, induces imprinted Igf2r silencing. Science. 2012;338(6113):1469–72. Epub 2012/12/15. pmid:23239737.
  22. 22. Pande A, Makalowski W, Brosius J, Raabe CA. Enhancer occlusion transcripts regulate the activity of human enhancer domains via transcriptional interference: a computational perspective. Nucleic Acids Res. 2020;48(7):3435–54. Epub 2020/03/07. pmid:32133533; PubMed Central PMCID: PMC7144904.
  23. 23. Veselovska L, Smallwood SA, Saadeh H, Stewart KR, Krueger F, Maupetit-Mehouas S, et al. Deep sequencing and de novo assembly of the mouse oocyte transcriptome define the contribution of transcription to the DNA methylation landscape. Genome Biol. 2015;16:209. Epub 2015/09/27. pmid:26408185; PubMed Central PMCID: PMC4582738.
  24. 24. Brind’Amour J, Kobayashi H, Richard Albert J, Shirane K, Sakashita A, Kamio A, et al. LTR retrotransposons transcribed in oocytes drive species-specific and heritable changes in DNA methylation. Nat Commun. 2018;9(1):3331. Epub 2018/08/22. pmid:30127397; PubMed Central PMCID: PMC6102241.
  25. 25. Ferguson-Smith AC, Bourc’his D. The discovery and importance of genomic imprinting. Elife. 2018;7. Epub 2018/10/23. pmid:30343680; PubMed Central PMCID: PMC6197852.
  26. 26. Barlow DP. Methylation and imprinting: from host defense to gene regulation? Science. 1993;260(5106):309–10. Epub 1993/04/16. pmid:8469984.
  27. 27. Renfree MB, Suzuki S, Kaneko-Ishino T. The origin and evolution of genomic imprinting and viviparity in mammals. Philos Trans R Soc Lond B Biol Sci. 2013;368(1609):20120151. Epub 2012/11/21. pmid:23166401; PubMed Central PMCID: PMC3539366.
  28. 28. Weinberg DN, Papillon-Cavanagh S, Chen H, Yue Y, Chen X, Rajagopalan KN, et al. The histone mark H3K36me2 recruits DNMT3A and shapes the intergenic DNA methylation landscape. Nature. 2019;573(7773):281–6. Epub 2019/09/06. pmid:31485078; PubMed Central PMCID: PMC6742567.
  29. 29. Lu X, Zhang Y, Wang L, Wang L, Wang H, Xu Q, et al. Evolutionary epigenomic analyses in mammalian early embryos reveal species-specific innovations and conserved principles of imprinting. Sci Adv. 2021;7(48):eabi6178. Epub 2021/11/25. pmid:34818044; PubMed Central PMCID: PMC8612685.
  30. 30. Quenneville S, Verde G, Corsinotti A, Kapopoulou A, Jakobsson J, Offner S, et al. In embryonic stem cells, ZFP57/KAP1 recognize a methylated hexanucleotide to affect chromatin and DNA methylation of imprinting control regions. Mol Cell. 2011;44(3):361–72. Epub 2011/11/08. pmid:22055183; PubMed Central PMCID: PMC3210328.
  31. 31. Takahashi N, Coluccio A, Thorball CW, Planet E, Shi H, Offner S, et al. ZNF445 is a primary regulator of genomic imprinting. Genes Dev. 2019;33(1–2):49–54. Epub 2019/01/04. pmid:30602440; PubMed Central PMCID: PMC6317318.
  32. 32. Barazandeh M, Lambert SA, Albu M, Hughes TR. Comparison of ChIP-Seq Data and a Reference Motif Set for Human KRAB C2H2 Zinc Finger Proteins. G3 (Bethesda). 2018;8(1):219–29. Epub 2017/11/18. pmid:29146583; PubMed Central PMCID: PMC5765350.
  33. 33. Cao W, Douglas KC, Samollow PB, VandeBerg JL, Wang X, Clark AG. Origin and Evolution of Marsupial-specific Imprinting Clusters Through Lineage-specific Gene Duplications and Acquisition of Promoter Differential Methylation. Mol Biol Evol. 2023;40(2). Epub 2023/02/02. pmid:36721950; PubMed Central PMCID: PMC9937046.
  34. 34. Okae H, Hiura H, Nishida Y, Funayama R, Tanaka S, Chiba H, et al. Re-investigation and RNA sequencing-based identification of genes with placenta-specific imprinted expression. Hum Mol Genet. 2012;21(3):548–58. Epub 2011/10/26. pmid:22025075.
  35. 35. Okae H, Matoba S, Nagashima T, Mizutani E, Inoue K, Ogonuki N, et al. RNA sequencing-based identification of aberrant imprinting in cloned mice. Hum Mol Genet. 2014;23(4):992–1001. Epub 2013/10/10. pmid:24105465.
  36. 36. Inoue A, Jiang L, Lu F, Suzuki T, Zhang Y. Maternal H3K27me3 controls DNA methylation-independent imprinting. Nature. 2017;547(7664):419–24. Epub 2017/07/21. pmid:28723896; PubMed Central PMCID: PMC9674007.
  37. 37. Richard Albert J, Kobayashi T, Inoue A, Monteagudo-Sanchez A, Kumamoto S, Takashima T, et al. Conservation and divergence of canonical and non-canonical imprinting in murids. Genome Biol. 2023;24(1):48. Epub 2023/03/16. pmid:36918927; PubMed Central PMCID: PMC10012579.
  38. 38. Richard Albert J, Greenberg MVC. Non-canonical imprinting in the spotlight. Development. 2023;150(12). Epub 2023/06/13. pmid:37309910.
  39. 39. Hubert JN, Perret M, Riquet J, Demars J. Livestock species as emerging models for genomic imprinting. Front Cell Dev Biol. 2024;12:1348036. Epub 2024/03/19. pmid:38500688; PubMed Central PMCID: PMC10945557.
  40. 40. Haig D. Genomic imprinting and kinship: how good is the evidence? Annu Rev Genet. 2004;38:553–85. Epub 2004/12/01. pmid:15568986.
  41. 41. Li W, Wang Z, Luo S, Wu J, Zhou L, Liu J. Genome-Wide Association Analysis and Genetic Parameters for Feed Efficiency and Related Traits in Yorkshire and Duroc Pigs. Animals (Basel). 2022;12(15). Epub 2022/07/28. pmid:35892552; PubMed Central PMCID: PMC9329986.
  42. 42. Gallou-Kabani C, Gabory A, Tost J, Karimi M, Mayeur S, Lesage J, et al. Sex- and diet-specific changes of imprinted gene expression and DNA methylation in mouse placenta under a high-fat diet. PLoS One. 2010;5(12):e14398. Epub 2011/01/05. pmid:21200436; PubMed Central PMCID: PMC3006175 Alexandre Vige). This does not alter the authors’ adherence to all the PLoS ONE policies on sharing data and materials.
  43. 43. Ahn J, Wu H, Lee J, Hwang IS, Yu D, Ahn JS, et al. Identification of a Novel Imprinted Transcript in the Porcine GNAS Complex Locus Using Methylome and Transcriptome of Parthenogenetic Fetuses. Genes (Basel). 2020;11(1):96. Epub 2020/01/18. pmid:31947640; PubMed Central PMCID: PMC7017182.
  44. 44. Bischoff SR, Tsai S, Hardison N, Motsinger-Reif AA, Freking BA, Piedrahita JA. Functional genomic approaches for the study of fetal/placental development in swine with special emphasis on imprinted genes. Soc Reprod Fertil Suppl. 2009;66:245–64. Epub 2009/10/24. pmid:19848292.
  45. 45. Hwang IS, Park MR, Lee HS, Kwak TU, Son HY, Kang JK, et al. Developmental and Degenerative Characterization of Porcine Parthenogenetic Fetuses during Early Pregnancy. Animals (Basel). 2020;10(4). Epub 2020/04/09. pmid:32260352; PubMed Central PMCID: PMC7222715.
  46. 46. Krueger F, Andrews SR. Bismark: a flexible aligner and methylation caller for Bisulfite-Seq applications. Bioinformatics. 2011;27(11):1571–2. Epub 2011/04/16. pmid:21493656; PubMed Central PMCID: PMC3102221.
  47. 47. Ramirez F, Dundar F, Diehl S, Gruning BA, Manke T. deepTools: a flexible platform for exploring deep-sequencing data. Nucleic Acids Res. 2014;42(Web Server issue):W187–91. Epub 2014/05/07. pmid:24799436; PubMed Central PMCID: PMC4086134.
  48. 48. Hansen KD, Langmead B, Irizarry RA. BSmooth: from whole genome bisulfite sequencing reads to differentially methylated regions. Genome Biol. 2012;13(10):R83. Epub 2012/10/05. pmid:23034175; PubMed Central PMCID: PMC3491411.
  49. 49. Juhling F, Kretzmer H, Bernhart SH, Otto C, Stadler PF, Hoffmann S. metilene: fast and sensitive calling of differentially methylated regions from bisulfite sequencing data. Genome Res. 2016;26(2):256–62. Epub 2015/12/04. pmid:26631489; PubMed Central PMCID: PMC4728377.
  50. 50. Hahne F, Ivanek R. Visualizing Genomic Data Using Gviz and Bioconductor. Methods Mol Biol. 2016;1418:335–51. Epub 2016/03/24. pmid:27008022.
  51. 51. Ryan DP. MethylDackel: A (mostly) universal methylation extractor for BS-seq experiments. https://github.com/dpryan79/methyldackel. 2019.
  52. 52. Xie W, Barr CL, Kim A, Yue F, Lee AY, Eubanks J, et al. Base-resolution analyses of sequence and parent-of-origin dependent DNA methylation in the mouse genome. Cell. 2012;148(4):816–31. Epub 2012/02/22. pmid:22341451; PubMed Central PMCID: PMC3343639.
  53. 53. Kumaki Y, Oda M, Okano M. QUMA: quantification tool for methylation analysis. Nucleic Acids Res. 2008;36(Web Server issue):W170–5. Epub 2008/05/20. pmid:18487274; PubMed Central PMCID: PMC2447804.
  54. 54. Bolger AM, Lohse M, Usadel B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics. 2014;30(15):2114–20. Epub 2014/04/04. pmid:24695404; PubMed Central PMCID: PMC4103590.
  55. 55. Dobin A, Davis CA, Schlesinger F, Drenkow J, Zaleski C, Jha S, et al. STAR: ultrafast universal RNA-seq aligner. Bioinformatics. 2013;29(1):15–21. Epub 2012/10/30. pmid:23104886; PubMed Central PMCID: PMC3530905.
  56. 56. Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, et al. The Sequence Alignment/Map format and SAMtools. Bioinformatics. 2009;25(16):2078–9. Epub 2009/06/10. pmid:19505943; PubMed Central PMCID: PMC2723002.
  57. 57. Patro R, Duggal G, Love MI, Irizarry RA, Kingsford C. Salmon provides fast and bias-aware quantification of transcript expression. Nat Methods. 2017;14(4):417–9. Epub 2017/03/07. pmid:28263959; PubMed Central PMCID: PMC5600148.
  58. 58. Ahn J, Lee J, Kim DH, Hwang IS, Park MR, Cho IC, et al. Loss of Monoallelic Expression of IGF2 in the Adult Liver Via Alternative Promoter Usage and Chromatin Reorganization. Front Genet. 2022;13:920641. Epub 2022/08/09. pmid:35938007; PubMed Central PMCID: PMC9355166.
  59. 59. Storer J, Hubley R, Rosen J, Wheeler TJ, Smit AF. The Dfam community resource of transposable element families, sequence models, and genome annotations. Mob DNA. 2021;12(1):2. Epub 2021/01/14. pmid:33436076; PubMed Central PMCID: PMC7805219.
  60. 60. Tamura K, Stecher G, Kumar S. MEGA11: Molecular Evolutionary Genetics Analysis Version 11. Mol Biol Evol. 2021;38(7):3022–7. Epub 2021/04/24. pmid:33892491; PubMed Central PMCID: PMC8233496.
  61. 61. Kimura M. A simple method for estimating evolutionary rates of base substitutions through comparative studies of nucleotide sequences. J Mol Evol. 1980;16(2):111–20. Epub 1980/12/01. pmid:7463489.
  62. 62. Kumar S, Stecher G, Suleski M, Hedges SB. TimeTree: A Resource for Timelines, Timetrees, and Divergence Times. Mol Biol Evol. 2017;34(7):1812–9. Epub 2017/04/08. pmid:28387841.
  63. 63. Rambaut A. FigTree v.1.4.4. http://tree.bio.ed.ac.uk/software/figtree. 2018.
  64. 64. Shen WK, Chen SY, Gan ZQ, Zhang YZ, Yue T, Chen MM, et al. AnimalTFDB 4.0: a comprehensive animal transcription factor database updated with variation and expression annotations. Nucleic Acids Res. 2023;51(D1):D39–D45. Epub 2022/10/22. pmid:36268869; PubMed Central PMCID: PMC9825474.
  65. 65. Love MI, Huber W, Anders S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 2014;15(12):550. Epub 2014/12/18. pmid:25516281; PubMed Central PMCID: PMC4302049.