使用广义线性模型对植物全基因组印迹研究进行一致的重新分析可提高数据集之间的一致性。

Consistent Reanalysis of Genome-wide Imprinting Studies in Plants Using Generalized Linear Models Increases Concordance across Datasets.

机构信息

Department of Plant and Microbial Biology & Zurich-Basel Plant Science Center, University of Zurich, Zollikerstrasse 107, CH-8008, Zurich, Switzerland.

Centre for Organismal Studies, Heidelberg University, Im Neuenheimer Feld 230, 69120, Heidelberg, Germany.

出版信息

Sci Rep. 2019 Feb 4;9(1):1320. doi: 10.1038/s41598-018-36768-4.

DOI:10.1038/s41598-018-36768-4

PMID:30718537

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6362150/

Abstract

Genomic imprinting leads to different expression levels of maternally and paternally derived alleles. Over the last years, major progress has been made in identifying novel imprinted candidate genes in plants, owing to affordable next-generation sequencing technologies. However, reports on sequencing the transcriptome of hybrid F1 seed tissues strongly disagree about how many and which genes are imprinted. This raises questions about the relative impact of biological, environmental, technical, and analytic differences or biases. Here, we adopt a statistical approach, frequently used in RNA-seq data analysis, which properly models count overdispersion and considers replicate information of reciprocal crosses. We show that our statistical pipeline outperforms other methods in identifying imprinted genes in simulated and real data. Accordingly, reanalysis of genome-wide imprinting studies in Arabidopsis and maize shows that, at least for Arabidopsis, an increased agreement across datasets could be observed. For maize, however, consistent reanalysis did not yield a larger overlap between the datasets. This suggests that the discrepancy across publications might be partially due to different analysis pipelines but that technical, biological, and environmental factors underlie much of the discrepancy between datasets. Finally, we show that the set of genes that can be characterized regarding allelic bias by all studies with minimal confidence is small (~8,000/27,416 genes for Arabidopsis and ~12,000/39,469 for maize). In conclusion, we propose to use biologically replicated reciprocal crosses, high sequence coverage, and a generalized linear model approach to identify differentially expressed alleles in developing seeds.

摘要

基因组印迹导致来自母系和父系等位基因的不同表达水平。近年来，由于负担得起的下一代测序技术，在鉴定植物中新的印迹候选基因方面取得了重大进展。然而，关于杂交 F1 种子组织转录组测序的报告强烈不同意有多少和哪些基因是印迹的。这引发了关于生物、环境、技术和分析差异或偏差相对影响的问题。在这里，我们采用了一种统计方法，该方法常用于 RNA-seq 数据分析，能够正确地对计数过度分散进行建模，并考虑到相互交叉的重复信息。我们表明，我们的统计管道在识别模拟和真实数据中的印迹基因方面优于其他方法。相应地，对拟南芥和玉米中全基因组印迹研究的重新分析表明，至少对于拟南芥，可以观察到数据集之间的一致性增加。然而，对于玉米，一致的重新分析并没有在数据集之间产生更大的重叠。这表明，出版物之间的差异可能部分归因于不同的分析管道，但技术、生物和环境因素是数据集之间差异的主要原因。最后，我们表明，通过所有具有最小置信度的研究来描述等位基因偏倚的基因集很小（拟南芥为~~8000/27416 个基因，玉米为~~12000/39469 个基因）。总之，我们建议使用生物复制的相互交叉、高序列覆盖度和广义线性模型方法来识别发育种子中差异表达的等位基因。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b845/6362150/39380cacbed6/41598_2018_36768_Fig1_HTML.jpg

相似文献

Consistent Reanalysis of Genome-wide Imprinting Studies in Plants Using Generalized Linear Models Increases Concordance across Datasets.使用广义线性模型对植物全基因组印迹研究进行一致的重新分析可提高数据集之间的一致性。

Sci Rep. 2019 Feb 4;9(1):1320. doi: 10.1038/s41598-018-36768-4.

Dynamic expression of imprinted genes associates with maternally controlled nutrient allocation during maize endosperm development.印记基因的动态表达与玉米胚乳发育过程中母本控制的养分分配相关。

Plant Cell. 2013 Sep;25(9):3212-27. doi: 10.1105/tpc.113.115592. Epub 2013 Sep 20.

Identification and Comparison of Imprinted Genes Across Plant Species.不同植物物种中印迹基因的鉴定与比较

Methods Mol Biol. 2020;2093:173-201. doi: 10.1007/978-1-0716-0179-2_13.

Genomic imprinting during seed development.种子发育过程中的基因组印记

Adv Genet. 2002;46:165-214. doi: 10.1016/s0065-2660(02)46007-5.

High-resolution analysis of parent-of-origin allelic expression in the Arabidopsis Endosperm.拟南芥胚乳中亲本来源等位基因表达的高分辨率分析。

PLoS Genet. 2011 Jun;7(6):e1002126. doi: 10.1371/journal.pgen.1002126. Epub 2011 Jun 16.

Widespread imprinting of transposable elements and variable genes in the maize endosperm.玉米胚乳中转座元件和可变基因的广泛印迹。

PLoS Genet. 2021 Apr 8;17(4):e1009491. doi: 10.1371/journal.pgen.1009491. eCollection 2021 Apr.

Genomic imprinting in the Arabidopsis embryo is partly regulated by PRC2.拟南芥胚胎中的基因组印记部分受 PRC2 调控。

PLoS Genet. 2013;9(12):e1003862. doi: 10.1371/journal.pgen.1003862. Epub 2013 Dec 5.

Dynamic and Antagonistic Allele-Specific Epigenetic Modifications Controlling the Expression of Imprinted Genes in Maize Endosperm.动态且拮抗的等位基因特异性表观遗传修饰控制玉米胚乳中印迹基因的表达。

Mol Plant. 2017 Mar 6;10(3):442-455. doi: 10.1016/j.molp.2016.10.007. Epub 2016 Oct 25.

Imprinted gene expression in maize starchy endosperm and aleurone tissues of reciprocal F1 hybrids at a defined developmental stage.在特定发育阶段，正反交F1杂种玉米淀粉胚乳和糊粉层组织中的印记基因表达。

Genes Genomics. 2018 Jan;40(1):99-107. doi: 10.1007/s13258-017-0613-9. Epub 2017 Sep 30.

Identification of imprinted genes subject to parent-of-origin specific expression in Arabidopsis thaliana seeds.鉴定拟南芥种子中具有亲本来源特异性表达的印迹基因。

BMC Plant Biol. 2011 Aug 12;11:113. doi: 10.1186/1471-2229-11-113.

引用本文的文献

Multilayered epigenetic control of persistent and stage-specific imprinted genes in rice endosperm.水稻胚乳中持久的和阶段特异性印记基因的多层次表观遗传控制。

Nat Plants. 2024 Aug;10(8):1231-1245. doi: 10.1038/s41477-024-01754-4. Epub 2024 Jul 30.

Machine learning on alignment features for parent-of-origin classification of simulated hybrid RNA-seq.基于比对特征的机器学习方法用于模拟杂交 RNA-seq 的亲本来源分类。

BMC Bioinformatics. 2024 Mar 12;25(1):109. doi: 10.1186/s12859-024-05728-3.

Imprinting but not cytonuclear interactions determines seed size heterosis in Arabidopsis hybrids.印记而非细胞核与细胞质的相互作用决定了拟南芥杂交种的种子大小杂种优势。

Plant Physiol. 2024 May 31;195(2):1214-1228. doi: 10.1093/plphys/kiae061.

The evolution of imprinting in plants: beyond the seed.植物印迹的进化：超越种子。

Plant Reprod. 2021 Dec;34(4):373-383. doi: 10.1007/s00497-021-00410-7. Epub 2021 Apr 29.

Widespread imprinting of transposable elements and variable genes in the maize endosperm.玉米胚乳中转座元件和可变基因的广泛印迹。

PLoS Genet. 2021 Apr 8;17(4):e1009491. doi: 10.1371/journal.pgen.1009491. eCollection 2021 Apr.

Mutation of the imprinted gene OsEMF2a induces autonomous endosperm development and delayed cellularization in rice.印记基因 OsEMF2a 的突变诱导水稻自主胚乳发育和细胞延迟化。

Plant Cell. 2021 Mar 22;33(1):85-103. doi: 10.1093/plcell/koaa006.

Genomic imprinted genes in reciprocal hybrid endosperm of Brassica napus.甘蓝型油菜正反交胚乳的基因组印迹基因。

BMC Plant Biol. 2021 Mar 16;21(1):140. doi: 10.1186/s12870-021-02908-8.

Paternally Expressed Imprinted Genes under Positive Darwinian Selection in Arabidopsis thaliana.拟南芥中受正达尔文选择作用的父源印迹基因。

Mol Biol Evol. 2019 Jun 1;36(6):1239-1253. doi: 10.1093/molbev/msz063.

本文引用的文献

Construction of the third-generation Zea mays haplotype map.第三代玉米单倍型图谱的构建。

Gigascience. 2018 Apr 1;7(4):1-12. doi: 10.1093/gigascience/gix134.

Widespread Contamination of Arabidopsis Embryo and Endosperm Transcriptome Data Sets.拟南芥胚胎和胚乳转录组数据集的广泛污染

Plant Cell. 2017 Apr;29(4):608-617. doi: 10.1105/tpc.16.00845. Epub 2017 Mar 17.

iCOBRA: open, reproducible, standardized and live method benchmarking.iCOBRA：开放、可重复、标准化且实时的方法基准测试。

Nat Methods. 2016 Apr;13(4):283. doi: 10.1038/nmeth.3805.

Evolution and function of genomic imprinting in plants.植物基因组印记的进化与功能

Genes Dev. 2015 Dec 15;29(24):2517-31. doi: 10.1101/gad.269902.115.

Genomic imprinting in the human placenta.人类胎盘中的基因组印记

Am J Obstet Gynecol. 2015 Oct;213(4 Suppl):S152-62. doi: 10.1016/j.ajog.2015.06.032.

Tools and best practices for data processing in allelic expression analysis.等位基因表达分析中数据处理的工具及最佳实践

Genome Biol. 2015 Sep 17;16(1):195. doi: 10.1186/s13059-015-0762-6.

Different yet similar: evolution of imprinting in flowering plants and mammals.不同却又相似：开花植物和哺乳动物中印迹的进化

F1000Prime Rep. 2014 Aug 1;6:63. doi: 10.12703/P6-63. eCollection 2014.

Natural epigenetic polymorphisms lead to intraspecific variation in Arabidopsis gene imprinting.自然表观遗传多态性导致拟南芥基因印记的种内变异。

Elife. 2014 Jul 3;3:e03198. doi: 10.7554/eLife.03198.

Using next-generation RNA sequencing to identify imprinted genes.使用新一代RNA测序技术来鉴定印记基因。

Heredity (Edinb). 2014 Aug;113(2):156-66. doi: 10.1038/hdy.2014.18. Epub 2014 Mar 12.

Genomic imprinting in the Arabidopsis embryo is partly regulated by PRC2.拟南芥胚胎中的基因组印记部分受 PRC2 调控。

PLoS Genet. 2013;9(12):e1003862. doi: 10.1371/journal.pgen.1003862. Epub 2013 Dec 5.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

使用广义线性模型对植物全基因组印迹研究进行一致的重新分析可提高数据集之间的一致性。

Consistent Reanalysis of Genome-wide Imprinting Studies in Plants Using Generalized Linear Models Increases Concordance across Datasets.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献