• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于家庭测序数据的随机场方法风险预测建模

Risk Prediction Modeling on Family-Based Sequencing Data Using a Random Field Method.

作者信息

Wen Yalu, Burt Alexandra, Lu Qing

机构信息

Institute of Cancer Stem Cell, Dalian Medical University, Liaoning, 116044, China

Department of Statistics, University of Auckland, 1010, New Zealand.

出版信息

Genetics. 2017 Sep;207(1):63-73. doi: 10.1534/genetics.117.199752. Epub 2017 Jul 5.

DOI:10.1534/genetics.117.199752
PMID:28679544
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5586386/
Abstract

Family-based design is one of the most popular designs in genetic studies and has many unique features for risk-prediction research. It is robust against genetic heterogeneity, and the relatedness among family members can be informative for predicting an individual's risk for disease with polygenic and shared environmental components of risk. Despite these strengths, family-based designs have been used infrequently in current risk-prediction studies, and their related statistical methods have not been well developed. In this article, we developed a generalized random field (GRF) method for family-based risk-prediction modeling on sequencing data. In GRF, subjects' phenotypes are viewed as stochastic realizations of a random field in a space, and a subject's phenotype is predicted by adjacent subjects, where adjacencies between subjects are determined by their genetic and within-family similarities. Different from existing methods that adjust for familial correlations, the GRF uses this information to form surrogates to further improve prediction accuracy. It also uses within-family information to capture predictors (, rare mutations) that are homogeneous in families. Through simulations, we have demonstrated that the GRF method attained better performance than an existing method by considering additional information from family members and accounting for genetic heterogeneity. We further provided practical recommendations for designing family-based risk prediction studies. Finally, we illustrated the GRF method with an application to a whole-genome exome data set from the Michigan State University Twin Registry study.

摘要

基于家系的设计是基因研究中最常用的设计之一,对于风险预测研究具有许多独特的特征。它对基因异质性具有稳健性,家庭成员之间的亲缘关系对于预测个体患具有多基因和共享环境风险因素的疾病风险可能具有参考价值。尽管有这些优势,但基于家系的设计在当前的风险预测研究中使用并不频繁,其相关的统计方法也没有得到很好的发展。在本文中,我们开发了一种广义随机场(GRF)方法,用于对测序数据进行基于家系的风险预测建模。在GRF中,将受试者的表型视为空间中随机场的随机实现,并通过相邻受试者来预测某个受试者的表型,其中受试者之间的相邻关系由其基因和家庭内相似性确定。与现有的用于调整家族相关性的方法不同,GRF利用这些信息来形成替代指标,以进一步提高预测准确性。它还利用家庭内信息来捕获在家族中具有同质性的预测因子(如罕见突变)。通过模拟,我们证明了GRF方法通过考虑来自家庭成员的额外信息并考虑基因异质性,比现有方法具有更好的性能。我们还为设计基于家系的风险预测研究提供了实用建议。最后,我们通过应用密歇根州立大学双胞胎登记研究的全基因组外显子数据集来说明GRF方法。

相似文献

1
Risk Prediction Modeling on Family-Based Sequencing Data Using a Random Field Method.基于家庭测序数据的随机场方法风险预测建模
Genetics. 2017 Sep;207(1):63-73. doi: 10.1534/genetics.117.199752. Epub 2017 Jul 5.
2
Detecting Rare Mutations with Heterogeneous Effects Using a Family-Based Genetic Random Field Method.基于家系遗传随机场方法检测具有异质性效应的罕见突变。
Genetics. 2018 Oct;210(2):463-476. doi: 10.1534/genetics.118.301266. Epub 2018 Aug 13.
3
Single Marker Family-Based Association Analysis Not Conditional on Parental Information.基于单标记家系的关联分析,不依赖父母信息。
Methods Mol Biol. 2017;1666:409-439. doi: 10.1007/978-1-4939-7274-6_20.
4
An ensemble-based likelihood ratio approach for family-based genomic risk prediction.基于集成的似然比方法进行基于家系的基因组风险预测。
J Zhejiang Univ Sci B. 2018;19(12):935-947. doi: 10.1631/jzus.B1800162.
5
Risk Prediction Modeling of Sequencing Data Using a Forward Random Field Method.使用前向随机场方法对测序数据进行风险预测建模。
Sci Rep. 2016 Feb 19;6:21120. doi: 10.1038/srep21120.
6
Bayesian linear mixed model with multiple random effects for family-based genetic studies.用于基于家系的基因研究的具有多个随机效应的贝叶斯线性混合模型。
Front Genet. 2023 Oct 19;14:1267704. doi: 10.3389/fgene.2023.1267704. eCollection 2023.
7
Improving Disease Prediction by Incorporating Family Disease History in Risk Prediction Models with Large-Scale Genetic Data.利用大规模遗传数据在风险预测模型中纳入家族病史以改善疾病预测。
Genetics. 2017 Nov;207(3):1147-1155. doi: 10.1534/genetics.117.300283. Epub 2017 Sep 12.
8
Selecting cases and controls for DNA sequencing studies using family histories of disease.利用疾病家族史为DNA测序研究选择病例和对照。
Stat Med. 2017 Jun 15;36(13):2081-2099. doi: 10.1002/sim.7248. Epub 2017 Feb 21.
9
Design considerations for genetic linkage and association studies.基因连锁与关联研究的设计考量
Methods Mol Biol. 2012;850:237-62. doi: 10.1007/978-1-61779-555-8_13.
10
Multi-syndrome, multi-gene risk modeling for individuals with a family history of cancer with the novel R package PanelPRO.多症状、多基因风险模型构建用于具有癌症家族史的个体:新型 R 包 PanelPRO 的应用。
Elife. 2021 Aug 18;10:e68699. doi: 10.7554/eLife.68699.

引用本文的文献

1
Bayesian linear mixed model with multiple random effects for family-based genetic studies.用于基于家系的基因研究的具有多个随机效应的贝叶斯线性混合模型。
Front Genet. 2023 Oct 19;14:1267704. doi: 10.3389/fgene.2023.1267704. eCollection 2023.
2
An ensemble-based likelihood ratio approach for family-based genomic risk prediction.基于集成的似然比方法进行基于家系的基因组风险预测。
J Zhejiang Univ Sci B. 2018;19(12):935-947. doi: 10.1631/jzus.B1800162.

本文引用的文献

1
A Clustered Multiclass Likelihood-Ratio Ensemble Method for Family-Based Association Analysis Accounting for Phenotypic Heterogeneity.一种用于基于家系的关联分析的聚类多类似然比集成方法,该方法考虑了表型异质性。
Genet Epidemiol. 2016 Sep;40(6):512-9. doi: 10.1002/gepi.21987. Epub 2016 Jun 19.
2
Rare variant in scavenger receptor BI raises HDL cholesterol and increases risk of coronary heart disease.清道夫受体BI中的罕见变异会升高高密度脂蛋白胆固醇并增加冠心病风险。
Science. 2016 Mar 11;351(6278):1166-71. doi: 10.1126/science.aad3517.
3
Risk Prediction Modeling of Sequencing Data Using a Forward Random Field Method.使用前向随机场方法对测序数据进行风险预测建模。
Sci Rep. 2016 Feb 19;6:21120. doi: 10.1038/srep21120.
4
Genomic risk prediction of complex human disease and its clinical application.复杂人类疾病的基因组风险预测及其临床应用。
Curr Opin Genet Dev. 2015 Aug;33:10-6. doi: 10.1016/j.gde.2015.06.005. Epub 2015 Jul 24.
5
A new initiative on precision medicine.一项关于精准医学的新倡议。
N Engl J Med. 2015 Feb 26;372(9):793-5. doi: 10.1056/NEJMp1500523. Epub 2015 Jan 30.
6
The contribution of rare and common variants in 30 genes to risk nicotine dependence.30个基因中的罕见和常见变异对尼古丁依赖风险的贡献。
Mol Psychiatry. 2015 Nov;20(11):1467-78. doi: 10.1038/mp.2014.156. Epub 2014 Dec 2.
7
Effective genetic-risk prediction using mixed models.使用混合模型进行有效的遗传风险预测。
Am J Hum Genet. 2014 Oct 2;95(4):383-93. doi: 10.1016/j.ajhg.2014.09.007.
8
MultiBLUP: improved SNP-based prediction for complex traits.MultiBLUP:基于单核苷酸多态性(SNP)的复杂性状预测方法的改进
Genome Res. 2014 Sep;24(9):1550-7. doi: 10.1101/gr.169375.113. Epub 2014 Jun 24.
9
FFBSKAT: fast family-based sequence kernel association test.FFBSKAT:基于家系的快速序列核关联检验
PLoS One. 2014 Jun 6;9(6):e99407. doi: 10.1371/journal.pone.0099407. eCollection 2014.
10
Poly-omic prediction of complex traits: OmicKriging.复杂性状的多组学预测:全基因组克里金法
Genet Epidemiol. 2014 Jul;38(5):402-15. doi: 10.1002/gepi.21808. Epub 2014 May 2.