• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

全基因组关联研究中连锁不平衡的考量:一种惩罚回归方法。

Accounting for linkage disequilibrium in genome-wide association studies: A penalized regression method.

作者信息

Liu Jin, Wang Kai, Ma Shuangge, Huang Jian

机构信息

School of Public Health, Yale University, New Haven, CT 06520, USA.

Department of Biostatistics, University of Iowa, Iowa City, IA 52242, USA.

出版信息

Stat Interface. 2013 Jan 1;6(1):99-115. doi: 10.4310/SII.2013.v6.n1.a10.

DOI:10.4310/SII.2013.v6.n1.a10
PMID:25258655
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4172344/
Abstract

Penalized regression methods are becoming increasingly popular in genome-wide association studies (GWAS) for identifying genetic markers associated with disease. However, standard penalized methods such as LASSO do not take into account the possible linkage disequilibrium between adjacent markers. We propose a novel penalized approach for GWAS using a dense set of single nucleotide polymorphisms (SNPs). The proposed method uses the minimax concave penalty (MCP) for marker selection and incorporates linkage disequilibrium (LD) information by penalizing the difference of the genetic effects at adjacent SNPs with high correlation. A coordinate descent algorithm is derived to implement the proposed method. This algorithm is efficient in dealing with a large number of SNPs. A multi-split method is used to calculate the -values of the selected SNPs for assessing their significance. We refer to the proposed penalty function as the smoothed MCP and the proposed approach as the SMCP method. Performance of the proposed SMCP method and its comparison with LASSO and MCP approaches are evaluated through simulation studies, which demonstrate that the proposed method is more accurate in selecting associated SNPs. Its applicability to real data is illustrated using heterogeneous stock mice data and a rheumatoid arthritis.

摘要

惩罚回归方法在全基因组关联研究(GWAS)中越来越受欢迎,用于识别与疾病相关的遗传标记。然而,诸如LASSO等标准惩罚方法没有考虑相邻标记之间可能存在的连锁不平衡。我们提出了一种使用密集单核苷酸多态性(SNP)集的新型GWAS惩罚方法。所提出的方法使用极小极大凹惩罚(MCP)进行标记选择,并通过惩罚具有高相关性的相邻SNP的遗传效应差异来纳入连锁不平衡(LD)信息。推导了一种坐标下降算法来实现所提出的方法。该算法在处理大量SNP时效率很高。使用多分割方法来计算所选SNP的值以评估其显著性。我们将所提出的惩罚函数称为平滑MCP,将所提出的方法称为SMCP方法。通过模拟研究评估了所提出的SMCP方法的性能及其与LASSO和MCP方法的比较,结果表明所提出的方法在选择相关SNP方面更准确。使用异质品系小鼠数据和类风湿性关节炎说明了其在实际数据中的适用性。

相似文献

1
Accounting for linkage disequilibrium in genome-wide association studies: A penalized regression method.全基因组关联研究中连锁不平衡的考量:一种惩罚回归方法。
Stat Interface. 2013 Jan 1;6(1):99-115. doi: 10.4310/SII.2013.v6.n1.a10.
2
Regularized regression method for genome-wide association studies.用于全基因组关联研究的正则化回归方法
BMC Proc. 2011 Nov 29;5 Suppl 9(Suppl 9):S67. doi: 10.1186/1753-6561-5-S9-S67.
3
Iterative hard thresholding for model selection in genome-wide association studies.全基因组关联研究中用于模型选择的迭代硬阈值法
Genet Epidemiol. 2017 Dec;41(8):756-768. doi: 10.1002/gepi.22068. Epub 2017 Sep 6.
4
Performance of a blockwise approach in variable selection using linkage disequilibrium information.使用连锁不平衡信息进行变量选择时的分块方法性能。
BMC Bioinformatics. 2015 May 8;16:148. doi: 10.1186/s12859-015-0556-6.
5
SNP selection in genome-wide and candidate gene studies via penalized logistic regression.通过惩罚逻辑回归进行全基因组和候选基因研究中的 SNP 选择。
Genet Epidemiol. 2010 Dec;34(8):879-91. doi: 10.1002/gepi.20543.
6
Penalized multimarker vs. single-marker regression methods for genome-wide association studies of quantitative traits.用于数量性状全基因组关联研究的惩罚多标记与单标记回归方法
Genetics. 2015 Jan;199(1):205-22. doi: 10.1534/genetics.114.167817. Epub 2014 Oct 28.
7
Efficient cross-trait penalized regression increases prediction accuracy in large cohorts using secondary phenotypes.利用次要表型,有效的跨性状惩罚回归提高了大队列中的预测准确性。
Nat Commun. 2019 Feb 4;10(1):569. doi: 10.1038/s41467-019-08535-0.
8
Penalized-regression-based multimarker genotype analysis of Genetic Analysis Workshop 17 data.基于惩罚回归的遗传分析研讨会17数据多标记基因型分析
BMC Proc. 2011 Nov 29;5 Suppl 9(Suppl 9):S92. doi: 10.1186/1753-6561-5-S9-S92.
9
Majorization Minimization by Coordinate Descent for Concave Penalized Generalized Linear Models.基于坐标下降法的凹惩罚广义线性模型的优化最小化
Stat Comput. 2014 Sep;24(5):871-883. doi: 10.1007/s11222-013-9407-3.
10
Genome-wide association studies using a penalized moving-window regression.基于惩罚移动窗口回归的全基因组关联研究。
Bioinformatics. 2017 Dec 15;33(24):3887-3894. doi: 10.1093/bioinformatics/btx522.

引用本文的文献

1
Quantitative genomics-enabled selection for simultaneous improvement of lint yield and seed traits in cotton (Gossypium hirsutum L.).基于全基因组关联分析的棉花纤维产量和种子性状的协同改良选择。
Theor Appl Genet. 2024 May 26;137(6):142. doi: 10.1007/s00122-024-04645-6.
2
Bi-level structured functional analysis for genome-wide association studies.基于双层结构的全基因组关联研究功能分析。
Biometrics. 2023 Dec;79(4):3359-3373. doi: 10.1111/biom.13871. Epub 2023 May 7.
3
Potential application of elastic nets for shared polygenicity detection with adapted threshold selection.

本文引用的文献

1
: Coordinate Descent With Nonconvex Penalties.带非凸惩罚项的坐标下降法
J Am Stat Assoc. 2011;106(495):1125-1138. doi: 10.1198/jasa.2011.tm09738.
2
COORDINATE DESCENT ALGORITHMS FOR NONCONVEX PENALIZED REGRESSION, WITH APPLICATIONS TO BIOLOGICAL FEATURE SELECTION.用于非凸惩罚回归的坐标下降算法及其在生物特征选择中的应用
Ann Appl Stat. 2011 Jan 1;5(1):232-253. doi: 10.1214/10-AOAS388.
3
Regularization Paths for Generalized Linear Models via Coordinate Descent.基于坐标下降法的广义线性模型正则化路径
弹性网络在具有自适应阈值选择的共享多基因性检测中的潜在应用。
Int J Biostat. 2022 Nov 3;19(2):417-438. doi: 10.1515/ijb-2020-0108. eCollection 2023 Nov 1.
4
A Physics-Guided Neural Network for Predicting Protein-Ligand Binding Free Energy: From Host-Guest Systems to the PDBbind Database.基于物理的神经网络预测蛋白质-配体结合自由能:从主客体体系到 PDBbind 数据库。
Biomolecules. 2022 Jun 29;12(7):919. doi: 10.3390/biom12070919.
5
A maximum flow-based network approach for identification of stable noncoding biomarkers associated with the multigenic neurological condition, autism.一种基于最大流的网络方法,用于识别与多基因神经疾病——自闭症相关的稳定非编码生物标志物。
BioData Min. 2021 May 3;14(1):28. doi: 10.1186/s13040-021-00262-x.
6
Studying the effects of haplotype partitioning methods on the RA-associated genomic results from the North American Rheumatoid Arthritis Consortium (NARAC) dataset.研究单倍型划分方法对来自北美类风湿关节炎协会(NARAC)数据集的类风湿关节炎相关基因组结果的影响。
J Adv Res. 2019 Jan 18;18:113-126. doi: 10.1016/j.jare.2019.01.006. eCollection 2019 Jul.
7
Fast and flexible linear mixed models for genome-wide genetics.快速灵活的全基因组遗传学线性混合模型。
PLoS Genet. 2019 Feb 8;15(2):e1007978. doi: 10.1371/journal.pgen.1007978. eCollection 2019 Feb.
8
Efficient cross-trait penalized regression increases prediction accuracy in large cohorts using secondary phenotypes.利用次要表型,有效的跨性状惩罚回归提高了大队列中的预测准确性。
Nat Commun. 2019 Feb 4;10(1):569. doi: 10.1038/s41467-019-08535-0.
9
Genome-wide association studies using a penalized moving-window regression.基于惩罚移动窗口回归的全基因组关联研究。
Bioinformatics. 2017 Dec 15;33(24):3887-3894. doi: 10.1093/bioinformatics/btx522.
10
A pharmacogenomic study on the pharmacokinetics of tacrolimus in healthy subjects using the DMET Plus platform.一项使用DMET Plus平台对健康受试者中他克莫司药代动力学进行的药物基因组学研究。
Pharmacogenomics J. 2017 Mar;17(2):174-179. doi: 10.1038/tpj.2015.99. Epub 2016 Feb 16.
J Stat Softw. 2010;33(1):1-22.
4
Data for Genetic Analysis Workshop 16 Problem 1, association analysis of rheumatoid arthritis data.遗传分析研讨会16问题1的数据,类风湿性关节炎数据的关联分析。
BMC Proc. 2009 Dec 15;3 Suppl 7(Suppl 7):S2. doi: 10.1186/1753-6561-3-s7-s2.
5
Genome-wide association analysis by lasso penalized logistic regression.基于套索惩罚逻辑回归的全基因组关联分析。
Bioinformatics. 2009 Mar 15;25(6):714-21. doi: 10.1093/bioinformatics/btp041. Epub 2009 Jan 28.
6
Genetic and environmental effects on complex traits in mice.基因和环境对小鼠复杂性状的影响。
Genetics. 2006 Oct;174(2):959-84. doi: 10.1534/genetics.106.060004. Epub 2006 Aug 3.
7
Genome-wide genetic association of complex traits in heterogeneous stock mice.异质种群小鼠复杂性状的全基因组遗传关联研究
Nat Genet. 2006 Aug;38(8):879-87. doi: 10.1038/ng1840. Epub 2006 Jul 9.
8
Replication of putative candidate-gene associations with rheumatoid arthritis in >4,000 samples from North America and Sweden: association of susceptibility with PTPN22, CTLA4, and PADI4.在来自北美和瑞典的4000多个样本中对类风湿关节炎假定候选基因关联进行复制研究:易感性与蛋白酪氨酸磷酸酶非受体型22(PTPN22)、细胞毒性T淋巴细胞相关抗原4(CTLA4)和肽基精氨酸脱亚氨酶4(PADI4)的关联。
Am J Hum Genet. 2005 Dec;77(6):1044-60. doi: 10.1086/498651. Epub 2005 Nov 1.
9
A missense single-nucleotide polymorphism in a gene encoding a protein tyrosine phosphatase (PTPN22) is associated with rheumatoid arthritis.编码蛋白酪氨酸磷酸酶(PTPN22)的基因中的一个错义单核苷酸多态性与类风湿性关节炎相关。
Am J Hum Genet. 2004 Aug;75(2):330-7. doi: 10.1086/422827. Epub 2004 Jun 18.
10
A review of the MHC genetics of rheumatoid arthritis.类风湿关节炎的主要组织相容性复合体遗传学综述。
Genes Immun. 2004 May;5(3):151-7. doi: 10.1038/sj.gene.6364045.