• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

高维特征选择的应用:人类基因组预测评估

Application of high-dimensional feature selection: evaluation for genomic prediction in man.

作者信息

Bermingham M L, Pong-Wong R, Spiliopoulou A, Hayward C, Rudan I, Campbell H, Wright A F, Wilson J F, Agakov F, Navarro P, Haley C S

机构信息

MRC Human Genetics Unit, MRC Institute of Genetics and Molecular Medicine, University of Edinburgh.

The Roslin Institute and Royal (Dick) School of Veterinary Studies, University of Edinburgh.

出版信息

Sci Rep. 2015 May 19;5:10312. doi: 10.1038/srep10312.

DOI:10.1038/srep10312
PMID:25988841
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4437376/
Abstract

In this study, we investigated the effect of five feature selection approaches on the performance of a mixed model (G-BLUP) and a Bayesian (Bayes C) prediction method. We predicted height, high density lipoprotein cholesterol (HDL) and body mass index (BMI) within 2,186 Croatian and into 810 UK individuals using genome-wide SNP data. Using all SNP information Bayes C and G-BLUP had similar predictive performance across all traits within the Croatian data, and for the highly polygenic traits height and BMI when predicting into the UK data. Bayes C outperformed G-BLUP in the prediction of HDL, which is influenced by loci of moderate size, in the UK data. Supervised feature selection of a SNP subset in the G-BLUP framework provided a flexible, generalisable and computationally efficient alternative to Bayes C; but careful evaluation of predictive performance is required when supervised feature selection has been used.

摘要

在本研究中,我们调查了五种特征选择方法对混合模型(G-BLUP)和贝叶斯(Bayes C)预测方法性能的影响。我们使用全基因组SNP数据,对2186名克罗地亚人和810名英国个体的身高、高密度脂蛋白胆固醇(HDL)和体重指数(BMI)进行了预测。在克罗地亚数据中,使用所有SNP信息时,Bayes C和G-BLUP在所有性状上具有相似的预测性能;在对英国数据进行预测时,对于高度多基因性状身高和BMI也是如此。在英国数据中,对于受中等大小基因座影响的HDL,Bayes C在预测方面优于G-BLUP。在G-BLUP框架中对SNP子集进行监督特征选择,为Bayes C提供了一种灵活、可推广且计算高效的替代方法;但在使用监督特征选择时,需要仔细评估预测性能。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/93c8/4437376/a905d6f79518/srep10312-f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/93c8/4437376/e5d98f8f27cf/srep10312-f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/93c8/4437376/09669fb40c55/srep10312-f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/93c8/4437376/a905d6f79518/srep10312-f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/93c8/4437376/e5d98f8f27cf/srep10312-f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/93c8/4437376/09669fb40c55/srep10312-f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/93c8/4437376/a905d6f79518/srep10312-f3.jpg

相似文献

1
Application of high-dimensional feature selection: evaluation for genomic prediction in man.高维特征选择的应用:人类基因组预测评估
Sci Rep. 2015 May 19;5:10312. doi: 10.1038/srep10312.
2
Genomic prediction of breeding values using previously estimated SNP variances.利用先前估计的单核苷酸多态性(SNP)方差进行育种值的基因组预测。
Genet Sel Evol. 2014 Sep 25;46(1):52. doi: 10.1186/s12711-014-0052-x.
3
Simultaneous fitting of genomic-BLUP and Bayes-C components in a genomic prediction model.在基因组预测模型中同时拟合基因组最佳线性无偏预测(genomic-BLUP)和贝叶斯C(Bayes-C)成分
Genet Sel Evol. 2017 Aug 24;49(1):63. doi: 10.1186/s12711-017-0339-9.
4
Performance of Bayesian and BLUP alphabets for genomic prediction: analysis, comparison and results.贝叶斯和 BLUP 字母在基因组预测中的性能:分析、比较和结果。
Heredity (Edinb). 2022 Jun;128(6):519-530. doi: 10.1038/s41437-022-00539-9. Epub 2022 May 4.
5
A comparison of five methods to predict genomic breeding values of dairy bulls from genome-wide SNP markers.比较五种方法从全基因组 SNP 标记预测奶牛公牛的基因组育种值。
Genet Sel Evol. 2009 Dec 31;41(1):56. doi: 10.1186/1297-9686-41-56.
6
Using markers with large effect in genetic and genomic predictions.在遗传和基因组预测中使用具有大效应的标记。
J Anim Sci. 2017 Jan;95(1):59-71. doi: 10.2527/jas.2016.0754.
7
Quantitative trait loci markers derived from whole genome sequence data increases the reliability of genomic prediction.源自全基因组序列数据的数量性状位点标记提高了基因组预测的可靠性。
J Dairy Sci. 2015 Jun;98(6):4107-16. doi: 10.3168/jds.2014-9005. Epub 2015 Apr 16.
8
Accuracy of whole-genome prediction using a genetic architecture-enhanced variance-covariance matrix.使用遗传结构增强的方差协方差矩阵进行全基因组预测的准确性。
G3 (Bethesda). 2015 Feb 9;5(4):615-27. doi: 10.1534/g3.114.016261.
9
Different models of genetic variation and their effect on genomic evaluation.不同遗传变异模型及其对基因组评估的影响。
Genet Sel Evol. 2011 May 17;43(1):18. doi: 10.1186/1297-9686-43-18.
10
Genomic predictions can accelerate selection for resistance against Piscirickettsia salmonis in Atlantic salmon (Salmo salar).基因组预测可以加速大西洋鲑(Salmo salar)对鲑鱼立克次氏体抗性的选育。
BMC Genomics. 2017 Jan 31;18(1):121. doi: 10.1186/s12864-017-3487-y.

引用本文的文献

1
Core Perturbomes of and Using a Machine Learning Approach.使用机器学习方法的[具体研究对象1]和[具体研究对象2]的核心扰动组。 (你原文中“of and ”表述不完整,这里是根据常见情况补充后翻译的,你可根据实际调整。)
Pathogens. 2025 Aug 7;14(8):788. doi: 10.3390/pathogens14080788.
2
Monitoring pilots' mental workload in real flight conditions using multinomial logistic regression with a ridge estimator.在实际飞行条件下使用带有岭估计器的多项逻辑回归监测飞行员的心理负荷。
Front Robot AI. 2025 Apr 24;12:1441801. doi: 10.3389/frobt.2025.1441801. eCollection 2025.
3
Exploring genomic feature selection: A comparative analysis of GWAS and machine learning algorithms in a large-scale soybean dataset.

本文引用的文献

1
Effectiveness of shrinkage and variable selection methods for the prediction of complex human traits using data from distantly related individuals.使用远亲个体数据的收缩和变量选择方法对复杂人类性状进行预测的有效性。
Ann Hum Genet. 2015 Mar;79(2):122-35. doi: 10.1111/ahg.12099. Epub 2015 Jan 20.
2
Special issues on advances in quantitative genetics: introduction.数量遗传学进展专题:引言
Heredity (Edinb). 2014 Jan;112(1):1-3. doi: 10.1038/hdy.2013.115.
3
Inference of the genetic architecture underlying BMI and height with the use of 20,240 sibling pairs.
探索基因组特征选择:大规模大豆数据集中全基因组关联研究(GWAS)与机器学习算法的比较分析
Plant Genome. 2025 Mar;18(1):e20503. doi: 10.1002/tpg2.20503. Epub 2024 Sep 10.
4
Identification of Psychological Treatment Dropout Predictors Using Machine Learning Models on Italian Patients Living with Overweight and Obesity Ineligible for Bariatric Surgery.使用机器学习模型识别意大利超重和肥胖且不符合减重手术条件的患者心理治疗脱落的预测因子。
Nutrients. 2024 Aug 8;16(16):2605. doi: 10.3390/nu16162605.
5
Feature engineering of environmental covariates improves plant genomic-enabled prediction.环境协变量的特征工程改进了基于植物基因组的预测。
Front Plant Sci. 2024 May 15;15:1349569. doi: 10.3389/fpls.2024.1349569. eCollection 2024.
6
Reservoir temperature prediction based on characterization of water chemistry data-case study of western Anatolia, Turkey.基于水化学数据特征的储层温度预测——以土耳其安纳托利亚西部为例
Sci Rep. 2024 May 6;14(1):10339. doi: 10.1038/s41598-024-59409-5.
7
Enhancing prediction accuracy of coronary artery disease through machine learning-driven genomic variant selection.通过机器学习驱动的基因组变异选择提高冠状动脉疾病预测准确性。
J Transl Med. 2024 Apr 16;22(1):356. doi: 10.1186/s12967-024-05090-1.
8
Selective Genotyping and Phenotyping for Optimization of Genomic Prediction Models for Populations with Different Diversity.针对不同多样性群体优化基因组预测模型的选择性基因分型和表型分析。
Plants (Basel). 2024 Mar 28;13(7):975. doi: 10.3390/plants13070975.
9
An enhanced and efficient approach for feature selection for chronic human disease prediction: A breast cancer study.一种用于慢性人类疾病预测的特征选择的增强型高效方法:一项乳腺癌研究。
Heliyon. 2024 Feb 28;10(5):e26799. doi: 10.1016/j.heliyon.2024.e26799. eCollection 2024 Mar 15.
10
Deep self-supervised machine learning algorithms with a novel feature elimination and selection approaches for blood test-based multi-dimensional health risks classification.基于血液检测的多维健康风险分类的新型特征消除和选择方法的深度自我监督机器学习算法。
BMC Bioinformatics. 2024 Mar 8;25(1):103. doi: 10.1186/s12859-024-05729-2.
利用 20240 对兄弟姐妹对推断 BMI 和身高的遗传结构。
Am J Hum Genet. 2013 Nov 7;93(5):865-75. doi: 10.1016/j.ajhg.2013.10.005. Epub 2013 Oct 31.
4
Prediction of complex human traits using the genomic best linear unbiased predictor.利用基因组最佳线性无偏预测器预测复杂人类特征。
PLoS Genet. 2013;9(7):e1003608. doi: 10.1371/journal.pgen.1003608. Epub 2013 Jul 11.
5
Pitfalls of predicting complex traits from SNPs.从单核苷酸多态性预测复杂性状的陷阱。
Nat Rev Genet. 2013 Jul;14(7):507-15. doi: 10.1038/nrg3457.
6
Genomic BLUP decoded: a look into the black box of genomic prediction.基因组 BLUP 解码:探索基因组预测的黑箱。
Genetics. 2013 Jul;194(3):597-607. doi: 10.1534/genetics.113.152207. Epub 2013 May 2.
7
Genomic prediction in CIMMYT maize and wheat breeding programs.CIMMYT 玉米和小麦育种计划中的基因组预测。
Heredity (Edinb). 2014 Jan;112(1):48-60. doi: 10.1038/hdy.2013.16. Epub 2013 Apr 10.
8
Phenotype prediction from genome-wide association studies: application to smoking behaviors.基于全基因组关联研究的表型预测:在吸烟行为中的应用
BMC Syst Biol. 2012;6 Suppl 2(Suppl 2):S11. doi: 10.1186/1752-0509-6-S2-S11. Epub 2012 Dec 12.
9
Whole-genome regression and prediction methods applied to plant and animal breeding.全基因组回归和预测方法在动植物育种中的应用。
Genetics. 2013 Feb;193(2):327-45. doi: 10.1534/genetics.112.143313. Epub 2012 Jun 28.
10
Effect of the prior distribution of SNP effects on the estimation of total breeding value.单核苷酸多态性效应的先验分布对总育种值估计的影响。
BMC Proc. 2012 May 21;6 Suppl 2(Suppl 2):S6. doi: 10.1186/1753-6561-6-S2-S6.