Suppr超能文献

基于表观基因组的剪接预测使用递归神经网络。

Epigenome-based splicing prediction using a recurrent neural network.

机构信息

Program in Computational Biology and Bioinformatics, Yale University, New Haven, Connecticut, United States of America.

Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, Connecticut, United States of America.

出版信息

PLoS Comput Biol. 2020 Jun 25;16(6):e1008006. doi: 10.1371/journal.pcbi.1008006. eCollection 2020 Jun.

Abstract

Alternative RNA splicing provides an important means to expand metazoan transcriptome diversity. Contrary to what was accepted previously, splicing is now thought to predominantly take place during transcription. Motivated by emerging data showing the physical proximity of the spliceosome to Pol II, we surveyed the effect of epigenetic context on co-transcriptional splicing. In particular, we observed that splicing factors were not necessarily enriched at exon junctions and that most epigenetic signatures had a distinctly asymmetric profile around known splice sites. Given this, we tried to build an interpretable model that mimics the physical layout of splicing regulation where the chromatin context progressively changes as the Pol II moves along the guide DNA. We used a recurrent-neural-network architecture to predict the inclusion of a spliced exon based on adjacent epigenetic signals, and we showed that distinct spatio-temporal features of these signals were key determinants of model outcome, in addition to the actual nucleotide sequence of the guide DNA strand. After the model had been trained and tested (with >80% precision-recall curve metric), we explored the derived weights of the latent factors, finding they highlight the importance of the asymmetric time-direction of chromatin context during transcription.

摘要

可变剪接为后生动物转录组多样性的扩展提供了重要手段。与之前被广泛接受的观点相反,现在认为剪接主要发生在转录过程中。鉴于新兴数据表明剪接体与 Pol II 物理上接近,我们调查了表观遗传背景对共转录剪接的影响。具体来说,我们观察到剪接因子不一定在exon 交界处富集,并且大多数表观遗传特征在已知剪接位点周围呈现明显的不对称分布。有鉴于此,我们试图构建一个可解释的模型,模拟剪接调控的物理布局,其中染色质背景随着 Pol II 沿着指导 DNA 移动而逐渐变化。我们使用递归神经网络架构根据相邻的表观遗传信号来预测剪接exon 的包含情况,并且我们表明这些信号的不同时空特征是模型结果的关键决定因素,除了指导 DNA 链的实际核苷酸序列之外。在对模型进行训练和测试(准确率-召回率曲线指标>80%)之后,我们探索了潜在因子的衍生权重,发现它们突出了转录过程中染色质背景的不对称时间方向的重要性。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验