Suppr超能文献

机器学习分析甲基化谱揭示了小麦组织特异性基因表达模式。

Machine learning analyses of methylation profiles uncovers tissue-specific gene expression patterns in wheat.

机构信息

Department of Plant Sciences and Crop Development Centre, University of Saskatchewan, Saskatoon, SK, Canada, S7N 5A8.

Global Institute for Food Security, Saskatoon, SK, Canada, S7N 0W9.

出版信息

Plant Genome. 2020 Jul;13(2):e20027. doi: 10.1002/tpg2.20027. Epub 2020 Jun 3.

Abstract

DNA methylation is a mechanism of epigenetic modification in eukaryotic organisms. Generally, methylation within genes promoter inhibits regulatory protein binding and represses transcription, whereas gene body methylation is associated with actively transcribed genes. However, it remains unclear whether there is interaction between methylation levels across genic regions and which site has the biggest impact on gene regulation. We investigated and used the methylation patterns of the bread wheat cultivar Chinese Spring to uncover differentially expressed genes (DEGs) between roots and leaves, using six machine learning algorithms and a deep neural network. As anticipated, genes with higher expression in leaves were mainly involved in photosynthesis and pigment biosynthesis processes whereas genes that were not differentially expressed between roots and leaves were involved in protein processes and membrane structures. Methylation occurred preponderantly (60%) in the CG context, whereas 35 and 5% of methylation occurred in CHG and CHH contexts, respectively. Methylation levels were highly correlated (r = 0.7 to 0.9) between all genic regions, except within the promoter (r = 0.4 to 0.5). Machine learning models gave a high (0.81) prediction accuracy of DEGs. There was a strong correlation (p-value = 9.20×10 ) between all features and gene expression, suggesting that methylation across all genic regions contribute to gene regulation. However, the methylation of the promoter, the CDS and the exon in CG context was the most impactful. Our study provides more insights into the interplay between DNA methylation and gene expression and paves the way for identifying tissue-specific genes using methylation profiles.

摘要

DNA 甲基化是真核生物中一种表观遗传修饰的机制。一般来说,基因启动子内的甲基化抑制调节蛋白结合并抑制转录,而基因体甲基化与活跃转录的基因有关。然而,甲基化水平在基因区域之间是否存在相互作用,以及哪个位点对基因调控的影响最大,仍不清楚。我们使用六种机器学习算法和一个深度神经网络,研究并利用面包小麦品种春小麦的甲基化模式,揭示根和叶之间差异表达的基因(DEGs)。正如预期的那样,在叶中表达较高的基因主要参与光合作用和色素生物合成过程,而在根和叶之间没有差异表达的基因则参与蛋白质过程和膜结构。CG 背景下发生的甲基化占主导地位(60%),而 CHG 和 CHH 背景下分别发生 35%和 5%的甲基化。除启动子(r = 0.4 到 0.5)外,所有基因区域之间的甲基化水平高度相关(r = 0.7 到 0.9)。机器学习模型对 DEGs 的预测准确率很高(0.81)。所有特征与基因表达之间存在很强的相关性(p 值 = 9.20×10 ),这表明所有基因区域的甲基化都有助于基因调控。然而,CG 背景下的启动子、CDS 和外显子的甲基化影响最大。我们的研究为 DNA 甲基化和基因表达之间的相互作用提供了更多的见解,并为使用甲基化谱识别组织特异性基因铺平了道路。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验