• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于 L2,1-范数正则化的多元回归模型及其在基因组预测中的应用。

L2,1-norm regularized multivariate regression model with applications to genomic prediction.

机构信息

Systems Biology and Mathematical Modeling Group, Max Planck Institute of Molecular Plant Physiology, 14476 Potsdam-Golm, Germany.

Bioinformatics Group, Institute of Biochemistry and Biology, University of Potsdam, 14476 Potsdam-Golm, Germany.

出版信息

Bioinformatics. 2021 Sep 29;37(18):2896-2904. doi: 10.1093/bioinformatics/btab212.

DOI:10.1093/bioinformatics/btab212
PMID:33774677
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8479665/
Abstract

MOTIVATION

Genomic selection (GS) is currently deemed the most effective approach to speed up breeding of agricultural varieties. It has been recognized that consideration of multiple traits in GS can improve accuracy of prediction for traits of low heritability. However, since GS forgoes statistical testing with the idea of improving predictions, it does not facilitate mechanistic understanding of the contribution of particular single nucleotide polymorphisms (SNP).

RESULTS

Here, we propose a L2,1-norm regularized multivariate regression model and devise a fast and efficient iterative optimization algorithm, called L2,1-joint, applicable in multi-trait GS. The usage of the L2,1-norm facilitates variable selection in a penalized multivariate regression that considers the relation between individuals, when the number of SNPs is much larger than the number of individuals. The capacity for variable selection allows us to define master regulators that can be used in a multi-trait GS setting to dissect the genetic architecture of the analyzed traits. Our comparative analyses demonstrate that the proposed model is a favorable candidate compared to existing state-of-the-art approaches. Prediction and variable selection with datasets from Brassica napus, wheat and Arabidopsis thaliana diversity panels are conducted to further showcase the performance of the proposed model.

AVAILABILITY AND IMPLEMENTATION

: The model is implemented using R programming language and the code is freely available from https://github.com/alainmbebi/L21-norm-GS.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

基因组选择(GS)目前被认为是加速农业品种选育最有效的方法。人们已经认识到,在 GS 中考虑多个性状可以提高低遗传力性状预测的准确性。然而,由于 GS 放弃了统计检验的想法,以提高预测的准确性,它不利于对特定单核苷酸多态性(SNP)的贡献的机制理解。

结果

在这里,我们提出了一种 L2,1-范数正则化多变量回归模型,并设计了一种快速有效的迭代优化算法,称为 L2,1-联合,适用于多性状 GS。L2,1-范数的使用在考虑个体之间关系的惩罚多变量回归中促进了变量选择,当 SNP 的数量远远大于个体的数量时。变量选择的能力使我们能够定义主调节器,可用于多性状 GS 环境中,以剖析所分析性状的遗传结构。我们的比较分析表明,与现有的最先进的方法相比,所提出的模型是一个有吸引力的候选者。使用 Brassica napus、小麦和 Arabidopsis thaliana 多样性面板的数据集进行预测和变量选择,进一步展示了所提出模型的性能。

可用性和实现

该模型使用 R 编程语言实现,代码可从 https://github.com/alainmbebi/L21-norm-GS 免费获得。

补充信息

补充数据可在生物信息学在线获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8da7/8479665/4389a12f5610/btab212f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8da7/8479665/4389a12f5610/btab212f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8da7/8479665/4389a12f5610/btab212f1.jpg

相似文献

1
L2,1-norm regularized multivariate regression model with applications to genomic prediction.基于 L2,1-范数正则化的多元回归模型及其在基因组预测中的应用。
Bioinformatics. 2021 Sep 29;37(18):2896-2904. doi: 10.1093/bioinformatics/btab212.
2
A Comparative Study of Single-Trait and Multi-Trait Genomic Selection.单性状与多性状基因组选择的比较研究
J Comput Biol. 2019 Oct;26(10):1100-1112. doi: 10.1089/cmb.2019.0032. Epub 2019 Apr 17.
3
Regularized multi-trait multi-locus linear mixed models for genome-wide association studies and genomic selection in crops.作物全基因组关联研究和基因组选择的正则化多性状多基因座线性混合模型。
BMC Bioinformatics. 2023 Oct 26;24(1):399. doi: 10.1186/s12859-023-05519-2.
4
Genomic Prediction for Grain Yield and Yield-Related Traits in Chinese Winter Wheat.中国冬小麦产量及产量相关性状的基因组预测。
Int J Mol Sci. 2020 Feb 17;21(4):1342. doi: 10.3390/ijms21041342.
5
Genomic prediction of agronomic traits in wheat using different models and cross-validation designs.利用不同模型和交叉验证设计对小麦农艺性状进行基因组预测。
Theor Appl Genet. 2021 Jan;134(1):381-398. doi: 10.1007/s00122-020-03703-z. Epub 2020 Nov 1.
6
Genome-wide regression models considering general and specific combining ability predict hybrid performance in oilseed rape with similar accuracy regardless of trait architecture.考虑一般配合力和特殊配合力的全基因组回归模型,无论性状结构如何,都能以相似的准确性预测油菜的杂种表现。
Theor Appl Genet. 2018 Feb;131(2):299-317. doi: 10.1007/s00122-017-3002-5. Epub 2017 Oct 28.
7
The value of early-stage phenotyping for wheat breeding in the age of genomic selection.基因组选择时代小麦早期表型选择的价值。
Theor Appl Genet. 2020 Aug;133(8):2499-2520. doi: 10.1007/s00122-020-03613-0. Epub 2020 Jun 1.
8
Genomic Selection in Winter Wheat Breeding Using a Recommender Approach.利用推荐方法进行冬小麦育种中的基因组选择。
Genes (Basel). 2020 Jul 11;11(7):779. doi: 10.3390/genes11070779.
9
An efficient unified model for genome-wide association studies and genomic selection.一种用于全基因组关联研究和基因组选择的高效统一模型。
Genet Sel Evol. 2017 Aug 24;49(1):64. doi: 10.1186/s12711-017-0338-x.
10
Genomic selection for wheat traits and trait stability.基因组选择在小麦性状和性状稳定性方面的应用。
Theor Appl Genet. 2016 Sep;129(9):1697-710. doi: 10.1007/s00122-016-2733-z. Epub 2016 Jun 4.

引用本文的文献

1
External validation of the Oncotype DX breast cancer recurrence score nomogram and development and validation of a novel machine learning-based model to predict postoperative overall survival and guide adjuvant chemotherapy in ER positive, Her-2 negative breast cancer patients: a retrospective cohort study.Oncotype DX乳腺癌复发评分列线图的外部验证以及一种基于机器学习的新型模型的开发与验证,该模型用于预测雌激素受体(ER)阳性、人表皮生长因子受体2(Her-2)阴性乳腺癌患者的术后总生存期并指导辅助化疗:一项回顾性队列研究
Front Oncol. 2025 May 21;15:1586262. doi: 10.3389/fonc.2025.1586262. eCollection 2025.
2
Advances in multi-trait genomic prediction approaches: classification, comparative analysis, and perspectives.多性状基因组预测方法的进展:分类、比较分析及展望
Brief Bioinform. 2025 May 1;26(3). doi: 10.1093/bib/bbaf211.
3

本文引用的文献

1
simplePHENOTYPES: SIMulation of pleiotropic, linked and epistatic phenotypes.简单表型:多效性、连锁和上位性表型的模拟。
BMC Bioinformatics. 2020 Oct 31;21(1):491. doi: 10.1186/s12859-020-03804-y.
2
A Comparative Study of Single-Trait and Multi-Trait Genomic Selection.单性状与多性状基因组选择的比较研究
J Comput Biol. 2019 Oct;26(10):1100-1112. doi: 10.1089/cmb.2019.0032. Epub 2019 Apr 17.
3
Evaluation of GBLUP, BayesB and elastic net for genomic prediction in Chinese Simmental beef cattle.GBLUP、BayesB 和弹性网络在西门塔尔牛肉用牛基因组预测中的评估。
Development and external validation of a machine learning-based model to predict postoperative recurrence in patients with duodenal adenocarcinoma: a multicenter, retrospective cohort study.基于机器学习的十二指肠腺癌患者术后复发预测模型的开发与外部验证:一项多中心回顾性队列研究
BMC Med. 2025 Feb 21;23(1):98. doi: 10.1186/s12916-025-03912-7.
4
A comparative analysis of genomic and phenomic predictions of growth-related traits in 3-way coffee hybrids.三种咖啡杂交种生长相关性状的基因组和表型预测的比较分析。
G3 (Bethesda). 2022 Aug 25;12(9). doi: 10.1093/g3journal/jkac170.
5
Bayesian multitrait kernel methods improve multienvironment genome-based prediction.贝叶斯多性状核方法可提高多环境基于基因组的预测。
G3 (Bethesda). 2022 Feb 4;12(2). doi: 10.1093/g3journal/jkab406.
PLoS One. 2019 Feb 28;14(2):e0210442. doi: 10.1371/journal.pone.0210442. eCollection 2019.
4
Genomic Prediction Using Multi-trait Weighted GBLUP Accounting for Heterogeneous Variances and Covariances Across the Genome.使用多性状加权基因组最佳线性无偏预测法进行基因组预测,该方法考虑了全基因组的异质方差和协方差。
G3 (Bethesda). 2018 Nov 6;8(11):3549-3558. doi: 10.1534/g3.118.200673.
5
The Calcium-Dependent Protein Kinases (CDPKs) and Their Roles in Plant Growth Regulation and Abiotic Stress Responses.钙依赖型蛋白激酶(CDPKs)及其在植物生长调控和非生物胁迫响应中的作用。
Int J Mol Sci. 2018 Jun 28;19(7):1900. doi: 10.3390/ijms19071900.
6
Genomic Prediction from Multiple-Trait Bayesian Regression Methods Using Mixture Priors.基于混合先验的多性状贝叶斯回归方法的基因组预测。
Genetics. 2018 May;209(1):89-103. doi: 10.1534/genetics.118.300650. Epub 2018 Mar 7.
7
SIP1, a novel SOS2 interaction protein, is involved in salt-stress tolerance in Arabidopsis.SIP1,一种新型 SOS2 互作蛋白,参与拟南芥的耐盐胁迫。
Plant Physiol Biochem. 2018 Mar;124:167-174. doi: 10.1016/j.plaphy.2018.01.018. Epub 2018 Feb 4.
8
Genomic Selection in Plant Breeding: Methods, Models, and Perspectives.基因组选择在植物育种中的应用:方法、模型与展望。
Trends Plant Sci. 2017 Nov;22(11):961-975. doi: 10.1016/j.tplants.2017.08.011. Epub 2017 Sep 28.
9
Omics-based hybrid prediction in maize.基于组学的玉米杂交预测
Theor Appl Genet. 2017 Sep;130(9):1927-1939. doi: 10.1007/s00122-017-2934-0. Epub 2017 Jun 24.
10
Combined Use of Genome-Wide Association Data and Correlation Networks Unravels Key Regulators of Primary Metabolism in Arabidopsis thaliana.全基因组关联数据与相关网络的联合使用揭示了拟南芥初级代谢的关键调控因子。
PLoS Genet. 2016 Oct 19;12(10):e1006363. doi: 10.1371/journal.pgen.1006363. eCollection 2016 Oct.