• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

相似文献

1
Evolutionarily informed deep learning methods for predicting relative transcript abundance from DNA sequence.从 DNA 序列预测相对转录丰度的进化信息深度学习方法。
Proc Natl Acad Sci U S A. 2019 Mar 19;116(12):5542-5549. doi: 10.1073/pnas.1814551116. Epub 2019 Mar 6.
2
tRNA-DL: A Deep Learning Approach to Improve tRNAscan-SE Prediction Results.tRNA-DL:一种用于改善tRNAscan-SE预测结果的深度学习方法。
Hum Hered. 2018;83(3):163-172. doi: 10.1159/000493215. Epub 2019 Jan 25.
3
CRMSNet: A deep learning model that uses convolution and residual multi-head self-attention block to predict RBPs for RNA sequence.CRMSNet:一种深度学习模型,使用卷积和残差多头自注意力块来预测 RNA 序列的 RBPs。
Proteins. 2023 Aug;91(8):1032-1041. doi: 10.1002/prot.26489. Epub 2023 Mar 28.
4
DeepHE: Accurately predicting human essential genes based on deep learning.DeepHE:基于深度学习的人类必需基因精准预测。
PLoS Comput Biol. 2020 Sep 16;16(9):e1008229. doi: 10.1371/journal.pcbi.1008229. eCollection 2020 Sep.
5
Deep learning based DNA:RNA triplex forming potential prediction.基于深度学习的 DNA:RNA 三链形成势能预测。
BMC Bioinformatics. 2020 Nov 12;21(1):522. doi: 10.1186/s12859-020-03864-0.
6
Context specific transcription factor prediction.上下文特异性转录因子预测
Ann Biomed Eng. 2007 Jun;35(6):1053-67. doi: 10.1007/s10439-007-9268-z. Epub 2007 Mar 22.
7
DEGnext: classification of differentially expressed genes from RNA-seq data using a convolutional neural network with transfer learning.DEGnext:使用具有迁移学习的卷积神经网络对 RNA-seq 数据进行差异表达基因分类。
BMC Bioinformatics. 2022 Jan 6;23(1):17. doi: 10.1186/s12859-021-04527-4.
8
TEC-miTarget: enhancing microRNA target prediction based on deep learning of ribonucleic acid sequences.TEC-miTarget:基于 RNA 序列深度学习的 miRNA 靶基因预测增强方法。
BMC Bioinformatics. 2024 Apr 20;25(1):159. doi: 10.1186/s12859-024-05780-z.
9
Local Epigenomic Data are more Informative than Local Genome Sequence Data in Predicting Enhancer-Promoter Interactions Using Neural Networks.利用神经网络进行增强子-启动子相互作用预测时,局部表观基因组数据比局部基因组序列数据更具信息量。
Genes (Basel). 2019 Dec 29;11(1):41. doi: 10.3390/genes11010041.
10
Improving the Quantification of DNA Sequences Using Evolutionary Information Based on Deep Learning.基于深度学习的利用进化信息提高 DNA 序列定量分析。
Cells. 2019 Dec 14;8(12):1635. doi: 10.3390/cells8121635.

引用本文的文献

1
Beyond the genome: the role of functional markers in contemporary plant breeding.超越基因组:功能标记在当代植物育种中的作用
Front Plant Sci. 2025 Aug 5;16:1637299. doi: 10.3389/fpls.2025.1637299. eCollection 2025.
2
In silico prediction of variant effects: promises and limitations for precision plant breeding.变异效应的计算机模拟预测:精准植物育种的前景与局限
Theor Appl Genet. 2025 Jul 28;138(8):193. doi: 10.1007/s00122-025-04973-1.
3
Discriminating models of trait evolution.性状进化的判别模型。
bioRxiv. 2025 Jun 13:2025.06.12.659377. doi: 10.1101/2025.06.12.659377.
4
Improving plant breeding through AI-supported data integration.通过人工智能支持的数据整合改进植物育种。
Theor Appl Genet. 2025 Jun 2;138(6):132. doi: 10.1007/s00122-025-04910-2.
5
From text to traits: exploring the role of large language models in plant breeding.从文本到性状:探索大语言模型在植物育种中的作用
Front Plant Sci. 2025 May 14;16:1583344. doi: 10.3389/fpls.2025.1583344. eCollection 2025.
6
Precise engineering of gene expression by editing plasticity.通过编辑可塑性实现基因表达的精确工程。
Genome Biol. 2025 Mar 10;26(1):51. doi: 10.1186/s13059-025-03516-7.
7
Big data and artificial intelligence-aided crop breeding: Progress and prospects.大数据与人工智能辅助作物育种:进展与展望
J Integr Plant Biol. 2025 Mar;67(3):722-739. doi: 10.1111/jipb.13791. Epub 2024 Oct 28.
8
Identification of lineage-specific cis-trans regulatory networks related to kiwifruit ripening initiation.与猕猴桃成熟起始相关的谱系特异性顺式-反式调控网络的鉴定
Plant J. 2024 Dec;120(5):1987-1999. doi: 10.1111/tpj.17093. Epub 2024 Oct 27.
9
Identification, Design, and Application of Noncoding Cis-Regulatory Elements.非编码顺式调控元件的鉴定、设计与应用。
Biomolecules. 2024 Aug 5;14(8):945. doi: 10.3390/biom14080945.
10
Applying interpretable machine learning in computational biology-pitfalls, recommendations and opportunities for new developments.将可解释机器学习应用于计算生物学——新发展的陷阱、建议和机会。
Nat Methods. 2024 Aug;21(8):1454-1461. doi: 10.1038/s41592-024-02359-7. Epub 2024 Aug 9.

本文引用的文献

1
Deep learning in biomedicine.深度学习在生物医学中的应用。
Nat Biotechnol. 2018 Oct;36(9):829-838. doi: 10.1038/nbt.4233. Epub 2018 Sep 6.
2
DNA sequence and shape are predictive for meiotic crossovers throughout the plant kingdom.DNA序列和形状在整个植物界中对减数分裂交叉具有预测性。
Plant J. 2018 May 29. doi: 10.1111/tpj.13979.
3
Identification of cis-regulatory elements by chromatin structure.通过染色质结构鉴定顺式调控元件。
Curr Opin Plant Biol. 2018 Apr;42:90-94. doi: 10.1016/j.pbi.2018.04.004. Epub 2018 Apr 25.
4
Dysregulation of expression correlates with rare-allele burden and fitness loss in maize.表达失调与玉米中稀有等位基因负担和适应度损失相关。
Nature. 2018 Mar 22;555(7697):520-523. doi: 10.1038/nature25966. Epub 2018 Mar 14.
5
Deep learning for biology.用于生物学的深度学习
Nature. 2018 Feb 22;554(7693):555-557. doi: 10.1038/d41586-018-02174-z.
6
DRUID: a pipeline for transcriptome-wide measurements of mRNA stability.DRUID:一种用于测量 mRNA 稳定性的转录组范围的方法。
RNA. 2018 May;24(5):623-632. doi: 10.1261/rna.062877.117. Epub 2018 Feb 8.
7
Determining mRNA half-lives on a transcriptome-wide scale.在转录组范围内测定 mRNA 半衰期。
Methods. 2018 Mar 15;137:90-98. doi: 10.1016/j.ymeth.2017.12.006. Epub 2017 Dec 13.
8
UTR-Dependent Control of Gene Expression in Plants.UTR 依赖的植物基因表达调控。
Trends Plant Sci. 2018 Mar;23(3):248-259. doi: 10.1016/j.tplants.2017.11.003. Epub 2017 Dec 6.
9
The Sorghum bicolor reference genome: improved assembly, gene annotations, a transcriptome atlas, and signatures of genome organization.高粱参考基因组:改进的组装、基因注释、转录组图谱和基因组结构特征。
Plant J. 2018 Jan;93(2):338-354. doi: 10.1111/tpj.13781. Epub 2017 Dec 28.
10
Differentially Regulated Orthologs in Sorghum and the Subgenomes of Maize.高粱和玉米亚基因组中差异调控的直系同源基因。
Plant Cell. 2017 Aug;29(8):1938-1951. doi: 10.1105/tpc.17.00354. Epub 2017 Jul 21.

从 DNA 序列预测相对转录丰度的进化信息深度学习方法。

Evolutionarily informed deep learning methods for predicting relative transcript abundance from DNA sequence.

机构信息

Institute for Genomic Diversity, Cornell University, Ithaca, NY 14853.

Institute for Genomic Diversity, Cornell University, Ithaca, NY 14853;

出版信息

Proc Natl Acad Sci U S A. 2019 Mar 19;116(12):5542-5549. doi: 10.1073/pnas.1814551116. Epub 2019 Mar 6.

DOI:10.1073/pnas.1814551116
PMID:30842277
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6431157/
Abstract

Deep learning methodologies have revolutionized prediction in many fields and show potential to do the same in molecular biology and genetics. However, applying these methods in their current forms ignores evolutionary dependencies within biological systems and can result in false positives and spurious conclusions. We developed two approaches that account for evolutionary relatedness in machine learning models: () gene-family-guided splitting and () ortholog contrasts. The first approach accounts for evolution by constraining model training and testing sets to include different gene families. The second approach uses evolutionarily informed comparisons between orthologous genes to both control for and leverage evolutionary divergence during the training process. The two approaches were explored and validated within the context of mRNA expression level prediction and have the area under the ROC curve (auROC) values ranging from 0.75 to 0.94. Model weight inspections showed biologically interpretable patterns, resulting in the hypothesis that the 3' UTR is more important for fine-tuning mRNA abundance levels while the 5' UTR is more important for large-scale changes.

摘要

深度学习方法已经彻底改变了许多领域的预测,并显示出在分子生物学和遗传学中也具有同样的潜力。然而,将这些方法应用于其当前形式忽略了生物系统内的进化依赖性,可能导致假阳性和错误的结论。我们开发了两种方法来解决机器学习模型中的进化相关性问题:(1)基因家族指导分割和(2)直系同源物对比。第一种方法通过限制模型训练和测试集来包含不同的基因家族,从而考虑进化。第二种方法使用进化信息在直系同源基因之间进行比较,既可以在训练过程中控制进化分歧,又可以利用进化分歧。这两种方法在 mRNA 表达水平预测的背景下进行了探索和验证,ROC 曲线下面积(auROC)值范围从 0.75 到 0.94。模型权重检查显示出具有生物学可解释性的模式,这导致了一个假设,即 3'UTR 对微调 mRNA 丰度水平更为重要,而 5'UTR 对大规模变化更为重要。