• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

AdaptiveGS:一种基于自适应堆叠集成机器学习的可解释基因组选择框架。

AdaptiveGS: an explainable genomic selection framework based on adaptive stacking ensemble machine learning.

作者信息

Yang Zhen, Song Mei, Huang Xianggeng, Rao Quanrui, Zhang Shanghui, Zhang Zhongzheng, Wang Chenyang, Li Wenjia, Qin Ran, Zhao Chunhua, Wu Yongzhen, Sun Han, Liu Guangchen, Cui Fa

机构信息

School of Mathematics and Statistics, Ludong University, Yantai, 264025, Shandong, China.

School of Information and Electrical Engineering, Ludong University, Yantai, 264025, Shandong, China.

出版信息

Theor Appl Genet. 2025 Aug 7;138(9):204. doi: 10.1007/s00122-025-04991-z.

DOI:10.1007/s00122-025-04991-z
PMID:40772967
Abstract

We developed an adaptive and unified stacking genomic selection framework and designed a model interpretation strategy to identify the candidate significant SNPs of target traits. Genomic selection (GS) is an important technique in modern molecular breeding. As a powerful machine learning (ML) GS approach, stacking ensemble learning (SEL) combines multiple basic models (base learners, BLs) and effectively blends the strengths of different models to precisely depict the complex relationships between phenotypes and genotypes. However, in the key step of the SEL, there is currently a lack of an effective and unified framework for the selection of BLs. We developed adaptiveGS, an adaptive and explainable data-driven BLs selection strategy for the first time, to pre-screen the optimal BLs for stacking GS framework and improve the prediction accuracy. The adaptiveGS is performed based on the PR index, leveraging the Pearson correlation coefficient (PCC) and the normalized root mean square error (NRMSE), and the top 3 out of 7 (or self-setting) ML are tailored to be BLs via the PR index. We compared the adaptiveGS with 13 other GS algorithms based on a total of 21 traits (datasets) from 4 species. The results showed that adaptiveGS outperformed the 13 models on most of the 21 traits, with the average prediction accuracy (PCC) reaching 0.703, an average improvement of 14.4%, demonstrating superior predictive accuracy and robustness. Furthermore, the SHapley Additive exPlanations (SHAP) technique was utilized to interpret the adaptiveGS and identify significant SNPs for trait variations and potential interaction effects between SNPs. The adaptiveGS provides an operable and unified solution for stacking GS users to improve prediction accuracy in the breeding field. The adaptiveGS package is accessible at https://github.com/yangzhen0117/adaptiveGS .

摘要

我们开发了一种自适应且统一的堆叠基因组选择框架,并设计了一种模型解释策略来识别目标性状的候选显著单核苷酸多态性(SNP)。基因组选择(GS)是现代分子育种中的一项重要技术。作为一种强大的机器学习(ML)GS方法,堆叠集成学习(SEL)结合了多个基本模型(基础学习器,BLs),并有效地融合了不同模型的优势,以精确描述表型和基因型之间的复杂关系。然而,在SEL的关键步骤中,目前缺乏一种有效且统一的BLs选择框架。我们首次开发了adaptiveGS,这是一种自适应且可解释的数据驱动的BLs选择策略,用于为堆叠GS框架预筛选最佳BLs并提高预测准确性。adaptiveGS基于PR指数执行,利用皮尔逊相关系数(PCC)和归一化均方根误差(NRMSE),通过PR指数从7个(或自行设定)ML中挑选出前3个作为BLs。我们基于来自4个物种的总共21个性状(数据集),将adaptiveGS与其他13种GS算法进行了比较。结果表明,在21个性状中的大多数上,adaptiveGS优于这13个模型,平均预测准确性(PCC)达到0.703,平均提高了14.4%,显示出卓越的预测准确性和稳健性。此外,利用SHapley加法解释(SHAP)技术来解释adaptiveGS,并识别性状变异的显著SNP以及SNP之间的潜在相互作用效应。adaptiveGS为堆叠GS用户提供了一种可操作且统一的解决方案,以提高育种领域的预测准确性。可在https://github.com/yangzhen0117/adaptiveGS获取adaptiveGS软件包。

相似文献

1
AdaptiveGS: an explainable genomic selection framework based on adaptive stacking ensemble machine learning.AdaptiveGS:一种基于自适应堆叠集成机器学习的可解释基因组选择框架。
Theor Appl Genet. 2025 Aug 7;138(9):204. doi: 10.1007/s00122-025-04991-z.
2
Breeding perspectives on tackling trait genome-to-phenome (G2P) dimensionality using ensemble-based genomic prediction.利用基于集成的基因组预测解决性状基因组到表型(G2P)维度问题的育种前景。
Theor Appl Genet. 2025 Jul 4;138(7):172. doi: 10.1007/s00122-025-04960-6.
3
Supervised Machine Learning Models for Predicting Sepsis-Associated Liver Injury in Patients With Sepsis: Development and Validation Study Based on a Multicenter Cohort Study.用于预测脓毒症患者脓毒症相关肝损伤的监督式机器学习模型:基于多中心队列研究的开发与验证研究
J Med Internet Res. 2025 May 26;27:e66733. doi: 10.2196/66733.
4
Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.两种现代生存预测工具 SORG-MLA 和 METSSS 在接受手术联合放疗和单纯放疗治疗有症状长骨转移患者中的比较。
Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.
5
Does the Presence of Missing Data Affect the Performance of the SORG Machine-learning Algorithm for Patients With Spinal Metastasis? Development of an Internet Application Algorithm.缺失数据的存在是否会影响 SORG 机器学习算法在脊柱转移瘤患者中的性能?开发一种互联网应用算法。
Clin Orthop Relat Res. 2024 Jan 1;482(1):143-157. doi: 10.1097/CORR.0000000000002706. Epub 2023 Jun 12.
6
Approaches for predicting dairy cattle methane emissions: from traditional methods to machine learning.预测奶牛甲烷排放的方法:从传统方法到机器学习。
J Anim Sci. 2024 Jan 3;102. doi: 10.1093/jas/skae219.
7
Are Current Survival Prediction Tools Useful When Treating Subsequent Skeletal-related Events From Bone Metastases?当前的生存预测工具在治疗骨转移后的骨骼相关事件时有用吗?
Clin Orthop Relat Res. 2024 Sep 1;482(9):1710-1721. doi: 10.1097/CORR.0000000000003030. Epub 2024 Mar 22.
8
A Framework Integrating GWAS and Genomic Selection to Enhance Prediction Accuracy of Economical Traits in Common Carp.整合全基因组关联研究(GWAS)和基因组选择以提高鲤鱼经济性状预测准确性的框架
Int J Mol Sci. 2025 Jul 21;26(14):7009. doi: 10.3390/ijms26147009.
9
Heritability estimates and genome-wide association study of methane emission traits in Nellore cattle.内罗尔牛甲烷排放性状的遗传力估计和全基因组关联研究。
J Anim Sci. 2024 Jan 3;102. doi: 10.1093/jas/skae182.
10
Impact of genomic selection for growth and carcass traits on foot structure in Angus cattle.安格斯牛生长和胴体性状的基因组选择对足部结构的影响。
J Anim Sci. 2025 Jan 4;103. doi: 10.1093/jas/skaf158.