Schon Michael A, Nodine Michael D
Gregor Mendel Institute, Austrian Academy of Sciences, Vienna Biocenter, 1030 Vienna, Austria.
Gregor Mendel Institute, Austrian Academy of Sciences, Vienna Biocenter, 1030 Vienna, Austria
Plant Cell. 2017 Apr;29(4):608-617. doi: 10.1105/tpc.16.00845. Epub 2017 Mar 17.
A major goal of global gene expression profiling in plant seeds has been to investigate the parental contributions to the transcriptomes of early embryos and endosperm. However, consistency between independent studies has been poor, leading to considerable debate. We have developed a statistical tool that reveals the presence of substantial RNA contamination from maternal tissues in nearly all published endosperm and early embryo transcriptomes generated in these studies. We demonstrate that maternal RNA contamination explains the poor reproducibility of these transcriptomic data sets. Furthermore, we found that RNA contamination from maternal tissues has been repeatedly misinterpreted as epigenetic phenomena, which has resulted in inaccurate conclusions regarding the parental contributions to both the endosperm and early embryo transcriptomes. After accounting for maternal RNA contamination, no published genome-wide data set supports the concept of delayed paternal genome activation in plant embryos. Moreover, our analysis suggests that maternal and paternal genomic imprinting are equally rare events in Arabidopsis endosperm. Our publicly available software (https://github.com/Gregor-Mendel-Institute/tissue-enrichment-test) can help the community assess the level of contamination in transcriptome data sets generated from both seed and non-seed tissues.
植物种子中全球基因表达谱分析的一个主要目标是研究亲本对早期胚胎和胚乳转录组的贡献。然而,独立研究之间的一致性很差,引发了大量争论。我们开发了一种统计工具,该工具揭示了在这些研究中几乎所有已发表的胚乳和早期胚胎转录组中都存在来自母体组织的大量RNA污染。我们证明母体RNA污染解释了这些转录组数据集较差的可重复性。此外,我们发现来自母体组织的RNA污染一直被反复误解为表观遗传现象,这导致了关于亲本对胚乳和早期胚胎转录组贡献的不准确结论。在考虑母体RNA污染后,没有已发表的全基因组数据集支持植物胚胎中父本基因组延迟激活的概念。此外,我们的分析表明,在拟南芥胚乳中,母本和父本基因组印记同样罕见。我们公开可用的软件(https://github.com/Gregor-Mendel-Institute/tissue-enrichment-test)可以帮助科学界评估从种子和非种子组织生成的转录组数据集中的污染水平。