• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

深度注释:一种基于深度学习的新型可解释基因组选择模型,该模型整合了全面的功能注释。

DeepAnnotation: A novel interpretable deep learning-based genomic selection model that integrates comprehensive functional annotations.

作者信息

Ma Wenlong, Zheng Weigang, Qin Shenghua, Wang Chao, Lei Bowen, Liu Yuwen

机构信息

State Key Laboratory of Genome and Multi-omics Technologies, Shenzhen Branch, Guangdong Laboratory for Lingnan Modern Agriculture, Key Laboratory of Livestock and Poultry Multi-Omics of MARA, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen 518124, China.

Innovation Group of Pig Genome Design and Breeding, Research Centre for Animal Genome, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen 518124, China.

出版信息

Gigascience. 2025 Jan 6;14. doi: 10.1093/gigascience/giaf083.

DOI:10.1093/gigascience/giaf083
PMID:40874866
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12392413/
Abstract

BACKGROUND

Genomic selection, which leverages genomic information to predict the breeding value of individuals, has dramatically accelerated the improvement of economically important traits. The growing availability of multiomics data in agricultural species offers an unprecedented opportunity to enrich this process with prior biological knowledge. However, fully harnessing these rich data sources for accurate phenotype prediction in genomic selection remains in its early stages.

RESULTS

In this study, we present DeepAnnotation, a novel interpretable genomic selection model designed for phenotype prediction by integrating comprehensive multiomics functional annotations using deep learning. To capture the complex information flow from genotype to phenotype, DeepAnnotation aligns multiomics biological annotations with sequential network layers in a deep learning architecture, mirroring the natural regulatory cascade from genotype to intermediate molecular phenotypes-such as cis-regulatory elements, genes, and gene modules-and ultimately to phenotypes of economic traits. Comparing against 7 classical models (rrBLUP, LightGBM, KAML, BLUP, BayesR, MBLUP, and BayesRC), DeepAnnotation demonstrated significantly superior prediction accuracy (Pearson correlation coefficient increased by 6.4% to 120.0%) and computational efficiency for 3 pork production traits (lean meat percentage, loin muscle depth, and back fat thickness) using a dataset of 1,700 training Duroc boars and 240 independent validation individuals, each genotyped for 11,633,164 single-nucleotide polymorphisms (SNPs), particularly in identifying top-performing individuals. Furthermore, the interpretability embedded within our framework enables the identification of potential causal SNPs and the exploration of their mediated molecular mechanisms underlying trait variation.

CONCLUSIONS

DeepAnnotation is an open-source, interpretable deep learning approach for phenotype prediction, leveraging comprehensive multiomics functional annotations. Freely accessible via GitHub and Docker, it provides a valuable tool for researchers and practitioners in genomic selection.

摘要

背景

基因组选择利用基因组信息预测个体的育种值,极大地加速了经济重要性状的改良。农业物种中多组学数据的日益丰富为利用先验生物学知识丰富这一过程提供了前所未有的机会。然而,在基因组选择中充分利用这些丰富的数据源进行准确的表型预测仍处于早期阶段。

结果

在本研究中,我们提出了DeepAnnotation,这是一种新颖的可解释基因组选择模型,旨在通过深度学习整合全面的多组学功能注释来进行表型预测。为了捕捉从基因型到表型的复杂信息流,DeepAnnotation将多组学生物学注释与深度学习架构中的顺序网络层对齐,反映了从基因型到中间分子表型(如顺式调控元件、基因和基因模块),最终到经济性状表型的自然调控级联。与7种经典模型(rrBLUP、LightGBM、KAML、BLUP、BayesR、MBLUP和BayesRC)相比,使用1700头杜洛克种公猪的训练数据集和240个独立验证个体(每个个体针对11,633,164个单核苷酸多态性(SNP)进行基因分型),DeepAnnotation在预测3个猪肉生产性状(瘦肉率、腰大肌深度和背膘厚度)时表现出显著更高的预测准确性(皮尔逊相关系数提高了6.4%至120.0%)和计算效率,特别是在识别表现最佳的个体方面。此外,我们框架中嵌入的可解释性能够识别潜在的因果SNP,并探索其介导的性状变异分子机制。

结论

DeepAnnotation是一种用于表型预测的开源、可解释的深度学习方法,利用了全面的多组学功能注释。通过GitHub和Docker可免费获取,它为基因组选择的研究人员和从业者提供了一个有价值的工具。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9216/12392413/e4298329c1c8/giaf083fig6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9216/12392413/4c86886a7a15/giaf083fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9216/12392413/e23da49e13d8/giaf083fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9216/12392413/f8f995dcbf3f/giaf083fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9216/12392413/587beb6db7b4/giaf083fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9216/12392413/b74dbf61f636/giaf083fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9216/12392413/e4298329c1c8/giaf083fig6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9216/12392413/4c86886a7a15/giaf083fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9216/12392413/e23da49e13d8/giaf083fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9216/12392413/f8f995dcbf3f/giaf083fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9216/12392413/587beb6db7b4/giaf083fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9216/12392413/b74dbf61f636/giaf083fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9216/12392413/e4298329c1c8/giaf083fig6.jpg

相似文献

1
DeepAnnotation: A novel interpretable deep learning-based genomic selection model that integrates comprehensive functional annotations.深度注释:一种基于深度学习的新型可解释基因组选择模型,该模型整合了全面的功能注释。
Gigascience. 2025 Jan 6;14. doi: 10.1093/gigascience/giaf083.
2
Prescription of Controlled Substances: Benefits and Risks管制药品的处方:益处与风险
3
iACP-DPNet: a dual-pooling causal dilated convolutional network for interpretable anticancer peptide identification.iACP-DPNet:一种用于可解释抗癌肽识别的双池因果扩张卷积网络。
Funct Integr Genomics. 2025 Jul 4;25(1):147. doi: 10.1007/s10142-025-01641-x.
4
GPS: Harnessing data fusion strategies to improve the accuracy of machine learning-based genomic and phenotypic selection.GPS:利用数据融合策略提高基于机器学习的基因组和表型选择的准确性。
Plant Commun. 2025 Aug 11;6(8):101416. doi: 10.1016/j.xplc.2025.101416. Epub 2025 Jun 11.
5
Heritability estimates and genome-wide association study of methane emission traits in Nellore cattle.内罗尔牛甲烷排放性状的遗传力估计和全基因组关联研究。
J Anim Sci. 2024 Jan 3;102. doi: 10.1093/jas/skae182.
6
Deep Genomics: Deep Learning-Based Analysis of Genome-Sequenced Data for Identification of Gene Alterations.深度基因组学:基于深度学习的基因组测序数据分析以识别基因改变
Methods Mol Biol. 2025;2952:335-367. doi: 10.1007/978-1-0716-4690-8_20.
7
AdaptiveGS: an explainable genomic selection framework based on adaptive stacking ensemble machine learning.AdaptiveGS:一种基于自适应堆叠集成机器学习的可解释基因组选择框架。
Theor Appl Genet. 2025 Aug 7;138(9):204. doi: 10.1007/s00122-025-04991-z.
8
Enhancing Gene Expression Predictions Using Deep Learning and Functional Annotations.利用深度学习和功能注释增强基因表达预测
Genet Epidemiol. 2025 Jan;49(1):e22595. doi: 10.1002/gepi.22595. Epub 2024 Sep 30.
9
Genetic determinants of testicular sperm extraction outcomes: insights from a large multicentre study of men with non-obstructive azoospermia.睾丸精子提取结果的遗传决定因素:来自一项针对非梗阻性无精子症男性的大型多中心研究的见解
Hum Reprod Open. 2025 Aug 29;2025(3):hoaf049. doi: 10.1093/hropen/hoaf049. eCollection 2025.
10
Predicting cognitive decline: Deep-learning reveals subtle brain changes in pre-MCI stage.预测认知衰退:深度学习揭示轻度认知障碍前阶段大脑的细微变化。
J Prev Alzheimers Dis. 2025 May;12(5):100079. doi: 10.1016/j.tjpad.2025.100079. Epub 2025 Feb 6.

本文引用的文献

1
Disease prediction with multi-omics and biomarkers empowers case-control genetic discoveries in the UK Biobank.多组学和生物标志物的疾病预测使英国生物库中的病例对照遗传发现成为可能。
Nat Genet. 2024 Sep;56(9):1821-1831. doi: 10.1038/s41588-024-01898-1. Epub 2024 Sep 11.
2
KDM5 family as therapeutic targets in breast cancer: Pathogenesis and therapeutic opportunities and challenges.KDM5 家族作为乳腺癌的治疗靶点:发病机制及治疗机会与挑战。
Mol Cancer. 2024 May 20;23(1):109. doi: 10.1186/s12943-024-02011-0.
3
Constructing eRNA-mediated gene regulatory networks to explore the genetic basis of muscle and fat-relevant traits in pigs.
构建 eRNA 介导的基因调控网络,探索猪肌肉和脂肪相关性状的遗传基础。
Genet Sel Evol. 2024 Apr 9;56(1):28. doi: 10.1186/s12711-024-00897-4.
4
PigBiobank: a valuable resource for understanding genetic and biological mechanisms of diverse complex traits in pigs.猪生物银行:一个有价值的资源,用于了解猪的多种复杂性状的遗传和生物学机制。
Nucleic Acids Res. 2024 Jan 5;52(D1):D980-D989. doi: 10.1093/nar/gkad1080.
5
SIGNET: transcriptome-wide causal inference for gene regulatory networks.SIGNET:基因调控网络的转录组范围因果推断。
Sci Rep. 2023 Nov 8;13(1):19371. doi: 10.1038/s41598-023-46295-6.
6
Gene expression and RNA splicing explain large proportions of the heritability for complex traits in cattle.基因表达和RNA剪接解释了牛复杂性状遗传力的很大一部分。
Cell Genom. 2023 Aug 23;3(10):100385. doi: 10.1016/j.xgen.2023.100385. eCollection 2023 Oct 11.
7
Hypothesis-free phenotype prediction within a genetics-first framework.基于先验遗传学框架的无假设表型预测。
Nat Commun. 2023 Feb 17;14(1):919. doi: 10.1038/s41467-023-36634-6.
8
A comprehensive investigation of statistical and machine learning approaches for predicting complex human diseases on genomic variants.对用于基于基因组变异预测复杂人类疾病的统计和机器学习方法的全面研究。
Brief Bioinform. 2023 Jan 19;24(1). doi: 10.1093/bib/bbac552.
9
Drawing a line between histone demethylase KDM5A and KDM5B: their roles in development and tumorigenesis.在组蛋白去甲基化酶 KDM5A 和 KDM5B 之间划清界限:它们在发育和肿瘤发生中的作用。
Exp Mol Med. 2022 Dec;54(12):2107-2117. doi: 10.1038/s12276-022-00902-0. Epub 2022 Dec 12.
10
How to estimate heritability: a guide for genetic epidemiologists.如何估计遗传率:遗传流行病学家指南。
Int J Epidemiol. 2023 Apr 19;52(2):624-632. doi: 10.1093/ije/dyac224.