• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于自适应套索的空间自回归模型进行遗传风险预测。

Genetic risk prediction using a spatial autoregressive model with adaptive lasso.

机构信息

Department of Statistics, University of Auckland, Auckland, New Zealand.

Department of Epidemiology and Biostatistics, Michigan State University, East Lansing, Michigan, USA.

出版信息

Stat Med. 2018 Nov 20;37(26):3764-3775. doi: 10.1002/sim.7832. Epub 2018 May 31.

DOI:10.1002/sim.7832
PMID:29855063
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6492943/
Abstract

With rapidly evolving high-throughput technologies, studies are being initiated to accelerate the process toward precision medicine. The collection of the vast amounts of sequencing data provides us with great opportunities to systematically study the role of a deep catalog of sequencing variants in risk prediction. Nevertheless, the massive amount of noise signals and low frequencies of rare variants in sequencing data pose great analytical challenges on risk prediction modeling. Motivated by the development in spatial statistics, we propose a spatial autoregressive model with adaptive lasso (SARAL) for risk prediction modeling using high-dimensional sequencing data. The SARAL is a set-based approach, and thus, it reduces the data dimension and accumulates genetic effects within a single-nucleotide variant (SNV) set. Moreover, it allows different SNV sets having various magnitudes and directions of effect sizes, which reflects the nature of complex diseases. With the adaptive lasso implemented, SARAL can shrink the effects of noise SNV sets to be zero and, thus, further improve prediction accuracy. Through simulation studies, we demonstrate that, overall, SARAL is comparable to, if not better than, the genomic best linear unbiased prediction method. The method is further illustrated by an application to the sequencing data from the Alzheimer's Disease Neuroimaging Initiative.

摘要

随着高通量技术的快速发展,正在开展研究以加速精准医学的发展进程。大量测序数据的收集为我们提供了系统研究深度测序变体目录在风险预测中的作用的巨大机会。然而,测序数据中大量的噪声信号和罕见变体的低频对风险预测建模提出了巨大的分析挑战。受空间统计学发展的启发,我们提出了一种基于空间自回归模型的自适应套索(SARAL)方法,用于使用高维测序数据进行风险预测建模。SARAL 是一种基于集合的方法,因此可以降低数据维度,并在单个单核苷酸变体 (SNV) 集合内累积遗传效应。此外,它允许具有不同幅度和方向的效应大小的不同 SNV 集合,反映了复杂疾病的本质。通过实施自适应套索,SARAL 可以将噪声 SNV 集合的效应收缩为零,从而进一步提高预测准确性。通过模拟研究,我们证明 SARAL 总体上与基因组最佳线性无偏预测方法相当,如果不是更好的话。该方法通过对阿尔茨海默病神经影像学倡议的测序数据的应用进一步说明了这一点。

相似文献

1
Genetic risk prediction using a spatial autoregressive model with adaptive lasso.基于自适应套索的空间自回归模型进行遗传风险预测。
Stat Med. 2018 Nov 20;37(26):3764-3775. doi: 10.1002/sim.7832. Epub 2018 May 31.
2
Multikernel linear mixed model with adaptive lasso for complex phenotype prediction.多内核线性混合模型与自适应套索在复杂表型预测中的应用。
Stat Med. 2020 Apr 30;39(9):1311-1327. doi: 10.1002/sim.8477. Epub 2020 Jan 27.
3
Risk Prediction Modeling of Sequencing Data Using a Forward Random Field Method.使用前向随机场方法对测序数据进行风险预测建模。
Sci Rep. 2016 Feb 19;6:21120. doi: 10.1038/srep21120.
4
Multi-kernel linear mixed model with adaptive lasso for prediction analysis on high-dimensional multi-omics data.基于自适应套索的多核线性混合模型在高维多组学数据分析中的预测分析。
Bioinformatics. 2020 Mar 1;36(6):1785-1794. doi: 10.1093/bioinformatics/btz822.
5
IPF-LASSO: Integrative -Penalized Regression with Penalty Factors for Prediction Based on Multi-Omics Data.IPF-LASSO:基于多组学数据的带惩罚因子的整合惩罚回归用于预测
Comput Math Methods Med. 2017;2017:7691937. doi: 10.1155/2017/7691937. Epub 2017 May 4.
6
QQ-SNV: single nucleotide variant detection at low frequency by comparing the quality quantiles.QQ-SNV:通过比较质量分位数进行低频单核苷酸变异检测
BMC Bioinformatics. 2015 Nov 10;16:379. doi: 10.1186/s12859-015-0812-9.
7
Stratified polygenic risk prediction model with application to CAGI bipolar disorder sequencing data.应用于CAGI双相情感障碍测序数据的分层多基因风险预测模型。
Hum Mutat. 2017 Sep;38(9):1235-1239. doi: 10.1002/humu.23229. Epub 2017 Jun 13.
8
Robust estimation of the expected survival probabilities from high-dimensional Cox models with biomarker-by-treatment interactions in randomized clinical trials.在随机临床试验中,通过生物标志物与治疗的相互作用,从高维Cox模型中稳健估计预期生存概率。
BMC Med Res Methodol. 2017 May 22;17(1):83. doi: 10.1186/s12874-017-0354-0.
9
Adjusted adaptive Lasso for covariate model-building in nonlinear mixed-effect pharmacokinetic models.用于非线性混合效应药代动力学模型中协变量模型构建的调整自适应套索法
J Pharmacokinet Pharmacodyn. 2017 Feb;44(1):55-66. doi: 10.1007/s10928-017-9504-6. Epub 2017 Jan 31.
10
An Empirical Bayes risk prediction model using multiple traits for sequencing data.一种使用多性状的经验贝叶斯风险预测模型用于测序数据。
Stat Appl Genet Mol Biol. 2015 Dec;14(6):551-73. doi: 10.1515/sagmb-2015-0060.

引用本文的文献

1
Explainable deep transfer learning model for disease risk prediction using high-dimensional genomic data.基于高维基因组数据的疾病风险预测可解释深度迁移学习模型。
PLoS Comput Biol. 2022 Jul 15;18(7):e1010328. doi: 10.1371/journal.pcbi.1010328. eCollection 2022 Jul.
2
GWAS findings improved genomic prediction accuracy of lipid profile traits: Tehran Cardiometabolic Genetic Study.GWAS 研究结果提高了血脂谱特征的基因组预测准确性:德黑兰心脏代谢遗传研究。
Sci Rep. 2021 Mar 11;11(1):5780. doi: 10.1038/s41598-021-85203-8.

本文引用的文献

1
Genomic risk prediction of complex human disease and its clinical application.复杂人类疾病的基因组风险预测及其临床应用。
Curr Opin Genet Dev. 2015 Aug;33:10-6. doi: 10.1016/j.gde.2015.06.005. Epub 2015 Jul 24.
2
A new initiative on precision medicine.一项关于精准医学的新倡议。
N Engl J Med. 2015 Feb 26;372(9):793-5. doi: 10.1056/NEJMp1500523. Epub 2015 Jan 30.
3
MultiBLUP: improved SNP-based prediction for complex traits.MultiBLUP:基于单核苷酸多态性(SNP)的复杂性状预测方法的改进
Genome Res. 2014 Sep;24(9):1550-7. doi: 10.1101/gr.169375.113. Epub 2014 Jun 24.
4
Modeling and testing for joint association using a genetic random field model.使用遗传随机场模型进行联合关联的建模与测试。
Biometrics. 2014 Sep;70(3):471-9. doi: 10.1111/biom.12160. Epub 2014 Mar 13.
5
A variant within FGF1 is associated with Alzheimer's disease in the Han Chinese population.一个 FGF1 内的变异与汉族人群的阿尔茨海默病相关。
Am J Med Genet B Neuropsychiatr Genet. 2014 Mar;165B(2):131-6. doi: 10.1002/ajmg.b.32205. Epub 2014 Jan 24.
6
Nodes and biological processes identified on the basis of network analysis in the brain of the senescence accelerated mice as an Alzheimer's disease animal model.基于网络分析在衰老加速小鼠大脑中确定的与阿尔茨海默病动物模型相关的节点和生物过程。
Front Aging Neurosci. 2013 Oct 29;5:65. doi: 10.3389/fnagi.2013.00065. eCollection 2013.
7
Prediction of complex human traits using the genomic best linear unbiased predictor.利用基因组最佳线性无偏预测器预测复杂人类特征。
PLoS Genet. 2013;9(7):e1003608. doi: 10.1371/journal.pgen.1003608. Epub 2013 Jul 11.
8
The value of statistical or bioinformatics annotation for rare variant association with quantitative trait.统计或生物信息学注释对罕见变异与数量性状关联的价值。
Genet Epidemiol. 2013 Nov;37(7):666-74. doi: 10.1002/gepi.21747. Epub 2013 Jul 8.
9
Projecting the performance of risk prediction based on polygenic analyses of genome-wide association studies.基于全基因组关联研究的多基因分析预测风险的性能。
Nat Genet. 2013 Apr;45(4):400-5, 405e1-3. doi: 10.1038/ng.2579. Epub 2013 Mar 3.
10
Genome-wide efficient mixed-model analysis for association studies.全基因组高效混合模型关联分析。
Nat Genet. 2012 Jun 17;44(7):821-4. doi: 10.1038/ng.2310.