• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

大数据时代的复杂性状预测

Complex-Trait Prediction in the Era of Big Data.

机构信息

Department of Epidemiology and Biostatistics, Michigan State University, East Lansing, MI 48824, USA; Department of Statistics and Probability, Michigan State University, East Lansing, MI 48824, USA; Institute for Quantitative Health Science and Engineering, Michigan State University, East Lansing, MI 48824, USA.

Department of Epidemiology and Biostatistics, Michigan State University, East Lansing, MI 48824, USA; Department of Statistics and Probability, Michigan State University, East Lansing, MI 48824, USA.

出版信息

Trends Genet. 2018 Oct;34(10):746-754. doi: 10.1016/j.tig.2018.07.004. Epub 2018 Aug 20.

DOI:10.1016/j.tig.2018.07.004
PMID:30139641
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6150788/
Abstract

Accurate prediction of complex traits requires using a large number of DNA variants. Advances in statistical and machine learning methodology enable the identification of complex patterns in high-dimensional settings. However, training these highly parameterized methods requires very large data sets. Until recently, such data sets were not available. But the situation is changing rapidly as very large biomedical data sets comprising individual genotype-phenotype data for hundreds of thousands of individuals become available in public and private domains. We argue that the convergence of advances in methodology and the advent of Big Genomic Data will enable unprecedented improvements in complex-trait prediction; we review theory and evidence supporting our claim and discuss challenges and opportunities that Big Data will bring to complex-trait prediction.

摘要

准确预测复杂性状需要使用大量的 DNA 变体。统计和机器学习方法的进步使得在高维环境中识别复杂模式成为可能。然而,训练这些高度参数化的方法需要非常大的数据集。直到最近,这种数据集还不可用。但是,随着越来越多的公共和私人领域提供包含数十万人个体基因型-表型数据的大型生物医学数据集,这种情况正在迅速改变。我们认为,方法上的进步和大型基因组数据的出现将使复杂性状预测取得前所未有的进展;我们回顾了支持我们主张的理论和证据,并讨论了大数据将给复杂性状预测带来的挑战和机遇。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/10b2/6150788/5733a1a17b9c/nihms-1500559-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/10b2/6150788/b2b189761bdf/nihms-1500559-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/10b2/6150788/5733a1a17b9c/nihms-1500559-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/10b2/6150788/b2b189761bdf/nihms-1500559-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/10b2/6150788/5733a1a17b9c/nihms-1500559-f0003.jpg

相似文献

1
Complex-Trait Prediction in the Era of Big Data.大数据时代的复杂性状预测
Trends Genet. 2018 Oct;34(10):746-754. doi: 10.1016/j.tig.2018.07.004. Epub 2018 Aug 20.
2
A Multiple-Trait Bayesian Lasso for Genome-Enabled Analysis and Prediction of Complex Traits.用于基于基因组的复杂性状分析与预测的多性状贝叶斯套索法
Genetics. 2020 Feb;214(2):305-331. doi: 10.1534/genetics.119.302934. Epub 2019 Dec 26.
3
Haplotype function score improves biological interpretation and cross-ancestry polygenic prediction of human complex traits.单体型功能评分可改善人类复杂性状的生物学解释和跨血统多基因预测。
Elife. 2024 Apr 19;12:RP92574. doi: 10.7554/eLife.92574.
4
Contrasting the Genetic Architecture of 30 Complex Traits from Summary Association Data.基于汇总关联数据对比30种复杂性状的遗传结构
Am J Hum Genet. 2016 Jul 7;99(1):139-53. doi: 10.1016/j.ajhg.2016.05.013. Epub 2016 Jun 23.
5
Non-parametric Polygenic Risk Prediction via Partitioned GWAS Summary Statistics.基于分区 GWAS 汇总统计量的非参数多基因风险预测。
Am J Hum Genet. 2020 Jul 2;107(1):46-59. doi: 10.1016/j.ajhg.2020.05.004. Epub 2020 May 28.
6
Average semivariance yields accurate estimates of the fraction of marker-associated genetic variance and heritability in complex trait analyses.平均半方差在复杂性状分析中能准确估计与标记相关的遗传方差和遗传率的分数。
PLoS Genet. 2021 Aug 26;17(8):e1009762. doi: 10.1371/journal.pgen.1009762. eCollection 2021 Aug.
7
Underestimation of heritability using a mixed model with a polygenic covariance structure in a genome-wide association study for complex traits.在针对复杂性状的全基因组关联研究中,使用具有多基因协方差结构的混合模型对遗传力的低估。
Eur J Hum Genet. 2014 Jun;22(6):851-4. doi: 10.1038/ejhg.2013.236. Epub 2013 Oct 23.
8
Genetic prediction of quantitative lipid traits: comparing shrinkage models to gene scores.遗传预测定量脂质特征:比较收缩模型与基因评分。
Genet Epidemiol. 2014 Jan;38(1):72-83. doi: 10.1002/gepi.21777. Epub 2013 Nov 23.
9
Powerful detection of polygenic selection and evidence of environmental adaptation in US beef cattle.美国肉牛中多基因选择的有力检测和环境适应性的证据。
PLoS Genet. 2021 Jul 22;17(7):e1009652. doi: 10.1371/journal.pgen.1009652. eCollection 2021 Jul.
10
Leveraging GWAS for complex traits to detect signatures of natural selection in humans.利用全基因组关联研究(GWAS)复杂性状来检测人类自然选择的特征。
Curr Opin Genet Dev. 2018 Dec;53:9-14. doi: 10.1016/j.gde.2018.05.012. Epub 2018 Jun 16.

引用本文的文献

1
Bridging Genomics to Cardiology Clinical Practice: Artificial Intelligence in Optimizing Polygenic Risk Scores: A Systematic Review.将基因组学与心脏病临床实践相联系:人工智能在优化多基因风险评分中的应用:一项系统综述
JACC Adv. 2025 Jun;4(6 Pt 2):101803. doi: 10.1016/j.jacadv.2025.101803.
2
TraitTrainR: accelerating large-scale simulation under models of continuous trait evolution.TraitTrainR:加速连续性状进化模型下的大规模模拟
Bioinform Adv. 2024 Dec 9;5(1):vbae196. doi: 10.1093/bioadv/vbae196. eCollection 2025.
3
Bayesian hierarchical hypothesis testing in large-scale genome-wide association analysis.

本文引用的文献

1
Accurate Genomic Prediction of Human Height.人类身高的精确基因组预测。
Genetics. 2018 Oct;210(2):477-497. doi: 10.1534/genetics.118.301267. Epub 2018 Aug 27.
2
Will Big Data Close the Missing Heritability Gap?大数据能否弥合遗传缺失的鸿沟?
Genetics. 2017 Nov;207(3):1135-1145. doi: 10.1534/genetics.117.300271. Epub 2017 Sep 11.
3
Improving power for rare-variant tests by integrating external controls.通过整合外部对照提高罕见变异检测的效能。
大规模全基因组关联分析中的贝叶斯分层假设检验
Genetics. 2024 Nov 19;228(4). doi: 10.1093/genetics/iyae164.
4
Patterns of information literacy and their predictors among emergency department nurses: a latent profile analysis based on the person-context interaction theory.急诊科护士的信息素养模式及其预测因素:基于人-环境互动理论的潜在剖面分析
BMC Nurs. 2024 Jan 26;23(1):71. doi: 10.1186/s12912-024-01756-9.
5
Exploring the potential of incremental feature selection to improve genomic prediction accuracy.探索增量特征选择提高基因组预测准确性的潜力。
Genet Sel Evol. 2023 Nov 9;55(1):78. doi: 10.1186/s12711-023-00853-8.
6
mtPGS: Leverage multiple correlated traits for accurate polygenic score construction.mtPGS:利用多个相关性状进行准确的多基因评分构建。
Am J Hum Genet. 2023 Oct 5;110(10):1673-1689. doi: 10.1016/j.ajhg.2023.08.016. Epub 2023 Sep 15.
7
Predictive modeling of antibiotic eradication therapy success for new-onset Pseudomonas aeruginosa pulmonary infections in children with cystic fibrosis.预测模型在儿童囊性纤维化新发铜绿假单胞菌肺部感染的抗生素清除治疗中的应用。
PLoS Comput Biol. 2023 Sep 6;19(9):e1011424. doi: 10.1371/journal.pcbi.1011424. eCollection 2023 Sep.
8
LLM-PBC: Logic Learning Machine-Based Explainable Rules Accurately Stratify the Genetic Risk of Primary Biliary Cholangitis.基于逻辑学习机的可解释规则准确分层原发性胆汁性胆管炎的遗传风险
J Pers Med. 2022 Sep 26;12(10):1587. doi: 10.3390/jpm12101587.
9
Predicting Physical Appearance from DNA Data-Towards Genomic Solutions.从 DNA 数据预测身体外貌-迈向基因组解决方案。
Genes (Basel). 2022 Jan 10;13(1):121. doi: 10.3390/genes13010121.
10
Construction and Clinical Translation of Causal Pan-Cancer Gene Score Across Cancer Types.跨癌症类型的因果泛癌基因评分的构建与临床转化
Front Genet. 2021 Dec 23;12:784775. doi: 10.3389/fgene.2021.784775. eCollection 2021.
Genet Epidemiol. 2017 Nov;41(7):610-619. doi: 10.1002/gepi.22057. Epub 2017 Jun 28.
4
Genomic variance estimates: With or without disequilibrium covariances?基因组方差估计:是否考虑不平衡协方差?
J Anim Breed Genet. 2017 Jun;134(3):232-241. doi: 10.1111/jbg.12268.
5
Prediction of years of life after diagnosis of breast cancer using omics and omic-by-treatment interactions.利用组学及组学与治疗的相互作用预测乳腺癌诊断后的生存年限
Eur J Hum Genet. 2017 May;25(5):538-544. doi: 10.1038/ejhg.2017.12. Epub 2017 Mar 8.
6
Improved Genetic Profiling of Anthropometric Traits Using a Big Data Approach.使用大数据方法改进人体测量特征的基因分析
PLoS One. 2016 Dec 15;11(12):e0166755. doi: 10.1371/journal.pone.0166755. eCollection 2016.
7
Evidence for sex-specific genetic architectures across a spectrum of human complex traits.人类一系列复杂性状中性别特异性遗传结构的证据。
Genome Biol. 2016 Jul 29;17(1):166. doi: 10.1186/s13059-016-1025-x.
8
Increased Proportion of Variance Explained and Prediction Accuracy of Survival of Breast Cancer Patients with Use of Whole-Genome Multiomic Profiles.利用全基因组多组学图谱解释的乳腺癌患者生存方差比例增加及预测准确性提高。
Genetics. 2016 Jul;203(3):1425-38. doi: 10.1534/genetics.115.185181. Epub 2016 Apr 29.
9
Limitations of GCTA as a solution to the missing heritability problem.全基因组复杂性状分析(GCTA)作为解决“遗传性缺失”问题方法的局限性。
Proc Natl Acad Sci U S A. 2016 Jan 5;113(1):E61-70. doi: 10.1073/pnas.1520109113. Epub 2015 Dec 22.
10
Incorporating Genetic Heterogeneity in Whole-Genome Regressions Using Interactions.利用相互作用在全基因组回归中纳入遗传异质性。
J Agric Biol Environ Stat. 2015;20(4):467-490. doi: 10.1007/s13253-015-0222-5. Epub 2015 Nov 9.