• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

SHARE:一种用于为候选基因关联选择信息量最大的单核苷酸多态性(SNP)集合的自适应算法。

SHARE: an adaptive algorithm to select the most informative set of SNPs for candidate genetic association.

作者信息

Dai James Y, Leblanc Michael, Smith Nicholas L, Psaty Bruce, Kooperberg Charles

机构信息

Division of Public Health Sciences, Fred Hutchinson Cancer Research Center, 1100 Fairview Avenue N, M2-C200, Seattle, WA 98109, USA.

出版信息

Biostatistics. 2009 Oct;10(4):680-93. doi: 10.1093/biostatistics/kxp023. Epub 2009 Jul 15.

DOI:10.1093/biostatistics/kxp023
PMID:19605740
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2742496/
Abstract

Association studies have been widely used to identify genetic liability variants for complex diseases. While scanning the chromosomal region 1 single nucleotide polymorphism (SNP) at a time may not fully explore linkage disequilibrium, haplotype analyses tend to require a fairly large number of parameters, thus potentially losing power. Clustering algorithms, such as the cladistic approach, have been proposed to reduce the dimensionality, yet they have important limitations. We propose a SNP-Haplotype Adaptive REgression (SHARE) algorithm that seeks the most informative set of SNPs for genetic association in a targeted candidate region by growing and shrinking haplotypes with 1 more or less SNP in a stepwise fashion, and comparing prediction errors of different models via cross-validation. Depending on the evolutionary history of the disease mutations and the markers, this set may contain a single SNP or several SNPs that lay a foundation for haplotype analyses. Haplotype phase ambiguity is effectively accounted for by treating haplotype reconstruction as a part of the learning procedure. Simulations and a data application show that our method has improved power over existing methodologies and that the results are informative in the search for disease-causal loci.

摘要

关联研究已被广泛用于识别复杂疾病的遗传易感性变异。虽然一次扫描染色体区域的单个单核苷酸多态性(SNP)可能无法充分探索连锁不平衡,但单倍型分析往往需要相当多的参数,因此可能会降低效能。已经提出了聚类算法,如分支方法,以降低维度,但它们有重要的局限性。我们提出了一种单核苷酸多态性-单倍型自适应回归(SHARE)算法,该算法通过逐步增加或减少一个SNP来生长和收缩单倍型,并通过交叉验证比较不同模型的预测误差,从而在目标候选区域中寻找用于遗传关联的最具信息性的SNP集合。根据疾病突变和标记的进化历史,该集合可能包含单个SNP或几个SNP,为单倍型分析奠定基础。通过将单倍型重建视为学习过程的一部分,有效地解决了单倍型相位模糊问题。模拟和数据应用表明,我们的方法比现有方法具有更高的效能,并且结果在寻找疾病致病基因座方面具有参考价值。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5de4/2742496/43b18081b78f/biostskxp023f03_lw.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5de4/2742496/f956ccdc1388/biostskxp023f01_lw.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5de4/2742496/4c5c8db733df/biostskxp023f02_lw.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5de4/2742496/43b18081b78f/biostskxp023f03_lw.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5de4/2742496/f956ccdc1388/biostskxp023f01_lw.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5de4/2742496/4c5c8db733df/biostskxp023f02_lw.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5de4/2742496/43b18081b78f/biostskxp023f03_lw.jpg

相似文献

1
SHARE: an adaptive algorithm to select the most informative set of SNPs for candidate genetic association.SHARE:一种用于为候选基因关联选择信息量最大的单核苷酸多态性(SNP)集合的自适应算法。
Biostatistics. 2009 Oct;10(4):680-93. doi: 10.1093/biostatistics/kxp023. Epub 2009 Jul 15.
2
The impact of missing and erroneous genotypes on tagging SNP selection and power of subsequent association tests.缺失和错误基因型对标签单核苷酸多态性选择及后续关联检验效能的影响。
Hum Hered. 2006;61(1):31-44. doi: 10.1159/000092141. Epub 2006 Mar 23.
3
Comparison of multimarker logistic regression models, with application to a genomewide scan of schizophrenia.多标志物逻辑回归模型的比较及其在精神分裂症全基因组扫描中的应用。
BMC Genet. 2010 Sep 9;11:80. doi: 10.1186/1471-2156-11-80.
4
Selecting Closely-Linked SNPs Based on Local Epistatic Effects for Haplotype Construction Improves Power of Association Mapping.基于局部上位效应选择紧密连锁 SNPs 进行单倍型构建可提高关联作图的功效。
G3 (Bethesda). 2019 Dec 3;9(12):4115-4126. doi: 10.1534/g3.119.400451.
5
Multi-locus stepwise regression: a haplotype-based algorithm for finding genetic associations applied to atopic dermatitis.多基因逐步回归:一种基于单体型的寻找遗传关联的算法,应用于特应性皮炎。
BMC Med Genet. 2012 Jan 27;13:8. doi: 10.1186/1471-2350-13-8.
6
Genome-wide association data classification and SNPs selection using two-stage quality-based Random Forests.使用基于质量的两阶段随机森林进行全基因组关联数据分类和单核苷酸多态性选择。
BMC Genomics. 2015;16 Suppl 2(Suppl 2):S5. doi: 10.1186/1471-2164-16-S2-S5. Epub 2015 Jan 21.
7
Efficient haplotype block partitioning and tag SNP selection algorithms under various constraints.各种约束条件下的高效单倍型块划分及标签单核苷酸多态性选择算法。
Biomed Res Int. 2013;2013:984014. doi: 10.1155/2013/984014. Epub 2013 Nov 11.
8
A hidden Markov random field model for genome-wide association studies.基于隐马尔可夫随机场模型的全基因组关联研究。
Biostatistics. 2010 Jan;11(1):139-50. doi: 10.1093/biostatistics/kxp043. Epub 2009 Oct 12.
9
A systematic search for SNPs/haplotypes associated with disease phenotypes using a haplotype-based stepwise procedure.使用基于单倍型的逐步程序对与疾病表型相关的单核苷酸多态性/单倍型进行系统搜索。
BMC Genet. 2008 Dec 22;9:90. doi: 10.1186/1471-2156-9-90.
10
Performance of random forest when SNPs are in linkage disequilibrium.单核苷酸多态性处于连锁不平衡状态时随机森林的性能。
BMC Bioinformatics. 2009 Mar 5;10:78. doi: 10.1186/1471-2105-10-78.

引用本文的文献

1
Analysis of dog breed diversity using a composite selection index.利用复合选择指数分析犬种多样性。
Sci Rep. 2023 Jan 30;13(1):1674. doi: 10.1038/s41598-023-28826-3.
2
Selecting Closely-Linked SNPs Based on Local Epistatic Effects for Haplotype Construction Improves Power of Association Mapping.基于局部上位效应选择紧密连锁 SNPs 进行单倍型构建可提高关联作图的功效。
G3 (Bethesda). 2019 Dec 3;9(12):4115-4126. doi: 10.1534/g3.119.400451.
3
Sliding window haplotype approaches overcome single SNP analysis limitations in identifying genes for meat tenderness in Nelore cattle.

本文引用的文献

1
A second generation human haplotype map of over 3.1 million SNPs.一张包含超过310万个单核苷酸多态性的第二代人类单倍型图谱。
Nature. 2007 Oct 18;449(7164):851-61. doi: 10.1038/nature06258.
2
Imputation-based analysis of association studies: candidate regions and quantitative traits.基于归因的关联研究分析:候选区域和数量性状
PLoS Genet. 2007 Jul;3(7):e114. doi: 10.1371/journal.pgen.0030114. Epub 2007 May 30.
3
Sequential haplotype scan methods for association analysis.用于关联分析的序列单倍型扫描方法。
滑动窗口单体型方法克服了单 SNP 分析在鉴定内罗尔牛嫩度相关基因方面的局限性。
BMC Genet. 2019 Jan 14;20(1):8. doi: 10.1186/s12863-019-0713-4.
4
Multi-locus stepwise regression: a haplotype-based algorithm for finding genetic associations applied to atopic dermatitis.多基因逐步回归:一种基于单体型的寻找遗传关联的算法,应用于特应性皮炎。
BMC Med Genet. 2012 Jan 27;13:8. doi: 10.1186/1471-2350-13-8.
5
Shrunken methodology to genome-wide SNPs selection and construction of SNPs networks.用于全基因组单核苷酸多态性(SNP)选择和SNP网络构建的缩减方法。
BMC Syst Biol. 2010 Sep 13;4 Suppl 2(Suppl 2):S5. doi: 10.1186/1752-0509-4-S2-S5.
6
Structures and Assumptions: Strategies to Harness Gene × Gene and Gene × Environment Interactions in GWAS.结构与假设:全基因组关联研究中利用基因×基因和基因×环境相互作用的策略
Stat Sci. 2009;24(4):472-488. doi: 10.1214/09-sts287.
Genet Epidemiol. 2007 Sep;31(6):553-64. doi: 10.1002/gepi.20228.
4
Association mapping via regularized regression analysis of single-nucleotide-polymorphism haplotypes in variable-sized sliding windows.通过对可变大小滑动窗口中的单核苷酸多态性单倍型进行正则化回归分析进行关联作图。
Am J Hum Genet. 2007 Apr;80(4):705-15. doi: 10.1086/513205. Epub 2007 Feb 19.
5
Association of genetic variations with nonfatal venous thrombosis in postmenopausal women.绝经后女性基因变异与非致命性静脉血栓形成的关联
JAMA. 2007 Feb 7;297(5):489-98. doi: 10.1001/jama.297.5.489.
6
LdCompare: rapid computation of single- and multiple-marker r2 and genetic coverage.LdCompare:单标记和多标记r2及遗传覆盖率的快速计算
Bioinformatics. 2007 Jan 15;23(2):252-4. doi: 10.1093/bioinformatics/btl574. Epub 2006 Dec 5.
7
Testing untyped alleles (TUNA)-applications to genome-wide association studies.非分型等位基因检测(TUNA)在全基因组关联研究中的应用
Genet Epidemiol. 2006 Dec;30(8):718-27. doi: 10.1002/gepi.20182.
8
A flexible Bayesian framework for modeling haplotype association with disease, allowing for dominance effects of the underlying causative variants.一种用于对单倍型与疾病关联进行建模的灵活贝叶斯框架,该框架考虑了潜在致病变异的显性效应。
Am J Hum Genet. 2006 Oct;79(4):679-94. doi: 10.1086/508264. Epub 2006 Aug 31.
9
Evaluating and improving power in whole-genome association studies using fixed marker sets.使用固定标记集评估和提高全基因组关联研究的效能
Nat Genet. 2006 Jun;38(6):663-7. doi: 10.1038/ng1816. Epub 2006 May 21.
10
Multilocus association mapping using variable-length Markov chains.使用可变长度马尔可夫链的多位点关联作图
Am J Hum Genet. 2006 Jun;78(6):903-13. doi: 10.1086/503876. Epub 2006 Apr 7.