• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

全基因组关联研究中的一种高效加权标签单核苷酸多态性集分析方法。

An efficient weighted tag SNP-set analytical method in genome-wide association studies.

作者信息

Yan Bin, Wang Shudong, Jia Huaqian, Liu Xing, Wang Xinzeng

机构信息

College of Mathematics and Systems Science, Shandong University of Science and Technology, Qingdao, Shandong, 266590, China.

College of Computer and Communication Engineering, China University of Petroleum, Qingdao, Shandong, 266580, China.

出版信息

BMC Genet. 2015 Mar 13;16:25. doi: 10.1186/s12863-015-0182-3.

DOI:10.1186/s12863-015-0182-3
PMID:25879733
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4373116/
Abstract

BACKGROUND

Single-nucleotide polymorphism (SNP)-set analysis in Genome-wide association studies (GWAS) has emerged as a research hotspot for identifying genetic variants associated with disease susceptibility. But most existing methods of SNP-set analysis are affected by the quality of SNP-set, and poor quality of SNP-set can lead to low power in GWAS.

RESULTS

In this research, we propose an efficient weighted tag-SNP-set analytical method to detect the disease associations. In our method, we first design a fast algorithm to select a subset of SNPs (called tag SNP-set) from a given original SNP-set based on the linkage disequilibrium (LD) between SNPs, then assign a proper weight to each of the selected tag SNP respectively and test the joint effect of these weighted tag SNPs. The intensive simulation results show that the power of weighted tag SNP-set-based test is much higher than that of weighted original SNP-set-based test and that of un-weighted tag SNP-set-based test. We also compare the powers of the weighted tag SNP-set-based test based on four types of tag SNP-sets. The simulation results indicate the method of selecting tag SNP-set impacts the power greatly and the power of our proposed method is the highest.

CONCLUSIONS

From the analysis of simulated replicated data sets, we came to a conclusion that weighted tag SNP-set-based test is a powerful SNP-set test in GWAS. We also designed a faster algorithm of selecting tag SNPs which include most of information of original SNP-set, and a better weighted function which can describe the status of each tag SNP in GWAS.

摘要

背景

全基因组关联研究(GWAS)中的单核苷酸多态性(SNP)集分析已成为识别与疾病易感性相关的遗传变异的研究热点。但现有的大多数SNP集分析方法都受SNP集质量的影响,而SNP集质量不佳会导致GWAS功效较低。

结果

在本研究中,我们提出了一种有效的加权标签SNP集分析方法来检测疾病关联。在我们的方法中,我们首先设计了一种快速算法,基于SNP之间的连锁不平衡(LD)从给定的原始SNP集中选择一个SNP子集(称为标签SNP集),然后分别为每个选定的标签SNP赋予适当的权重,并测试这些加权标签SNP的联合效应。大量模拟结果表明,基于加权标签SNP集的检验功效远高于基于加权原始SNP集的检验和基于未加权标签SNP集的检验。我们还比较了基于四种类型标签SNP集的加权标签SNP集检验的功效。模拟结果表明,标签SNP集的选择方法对功效有很大影响,我们提出的方法功效最高。

结论

通过对模拟重复数据集的分析,我们得出结论,基于加权标签SNP集的检验是GWAS中一种强大的SNP集检验方法。我们还设计了一种更快的选择标签SNP的算法,该算法包含原始SNP集的大部分信息,以及一个更好的加权函数,该函数可以描述每个标签SNP在GWAS中的状态。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e79c/4373116/17bacd253785/12863_2015_182_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e79c/4373116/4fd7a14cfea1/12863_2015_182_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e79c/4373116/41162ab810be/12863_2015_182_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e79c/4373116/17bacd253785/12863_2015_182_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e79c/4373116/4fd7a14cfea1/12863_2015_182_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e79c/4373116/41162ab810be/12863_2015_182_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e79c/4373116/17bacd253785/12863_2015_182_Fig3_HTML.jpg

相似文献

1
An efficient weighted tag SNP-set analytical method in genome-wide association studies.全基因组关联研究中的一种高效加权标签单核苷酸多态性集分析方法。
BMC Genet. 2015 Mar 13;16:25. doi: 10.1186/s12863-015-0182-3.
2
Tagging SNP-set selection with maximum information based on linkage disequilibrium structure in genome-wide association studies.基于全基因组关联研究中连锁不平衡结构的最大信息进行 SNP 集选择标记。
Bioinformatics. 2017 Jul 15;33(14):2078-2081. doi: 10.1093/bioinformatics/btx151.
3
A multi-SNP association test for complex diseases incorporating an optimal P-value threshold algorithm in nuclear families.一种在核心家庭中纳入最优P值阈值算法的复杂疾病多单核苷酸多态性关联测试。
BMC Genomics. 2015 May 15;16(1):381. doi: 10.1186/s12864-015-1620-3.
4
FastTagger: an efficient algorithm for genome-wide tag SNP selection using multi-marker linkage disequilibrium.FastTagger:一种利用多标记连锁不平衡进行全基因组标签 SNP 选择的高效算法。
BMC Bioinformatics. 2010 Jan 29;11:66. doi: 10.1186/1471-2105-11-66.
5
Genome-wide selection of tag SNPs using multiple-marker correlation.使用多标记相关性进行全基因组标签单核苷酸多态性选择。
Bioinformatics. 2007 Dec 1;23(23):3178-84. doi: 10.1093/bioinformatics/btm496. Epub 2007 Nov 15.
6
Weighted SNP set analysis in genome-wide association study.基于全基因组关联研究的加权 SNP 集分析。
PLoS One. 2013 Sep 30;8(9):e75897. doi: 10.1371/journal.pone.0075897. eCollection 2013.
7
Multi-marker-LD based genetic algorithm for tag SNP selection.基于多标记连锁不平衡的标签单核苷酸多态性选择遗传算法
Interdiscip Sci. 2014 Dec;6(4):303-11. doi: 10.1007/s12539-012-0060-x. Epub 2014 Aug 9.
8
Increasing power of genome-wide association studies by collecting additional single-nucleotide polymorphisms.通过收集额外的单核苷酸多态性来提高全基因组关联研究的效力。
Genetics. 2011 Jun;188(2):449-60. doi: 10.1534/genetics.111.128595. Epub 2011 Apr 5.
9
A new model of multi-marker correlation for genome-wide tag SNP selection.一种用于全基因组标签单核苷酸多态性选择的多标记相关性新模型。
Genome Inform. 2008;21:27-41.
10
SNP-based pathway enrichment analysis for genome-wide association studies.基于 SNP 的通路富集分析在全基因组关联研究中的应用。
BMC Bioinformatics. 2011 Apr 15;12:99. doi: 10.1186/1471-2105-12-99.

引用本文的文献

1
The mutation-free embryo for in vitro fertilization selected by MALBAC-PGD resulted in a healthy live birth from a family carrying PKD 1 mutation.MALBAC-PGD 筛选的无突变胚胎导致携带 PKD1 突变的家庭健康活产。
J Assist Reprod Genet. 2017 Dec;34(12):1653-1658. doi: 10.1007/s10815-017-1018-z. Epub 2017 Aug 19.
2
Prioritizing individual genetic variants after kernel machine testing using variable selection.在使用变量选择的核机器测试后对个体遗传变异进行优先级排序。
Genet Epidemiol. 2016 Dec;40(8):722-731. doi: 10.1002/gepi.21993. Epub 2016 Aug 3.
3
A powerful score-based test statistic for detecting gene-gene co-association.

本文引用的文献

1
Linkage disequilibrium in finite populations.有限群体中的连锁不平衡。
Theor Appl Genet. 1968 Jun;38(6):226-31. doi: 10.1007/BF01245622.
2
Nonparametric tests of associations with disease based on U-statistics.基于U统计量的疾病关联非参数检验。
Ann Hum Genet. 2014 Mar;78(2):141-53. doi: 10.1111/ahg.12049. Epub 2013 Dec 16.
3
SNP set analysis for detecting disease association using exon sequence data.使用外显子序列数据进行疾病关联检测的单核苷酸多态性(SNP)集分析。
一种用于检测基因-基因共关联的基于分数的强大检验统计量。
BMC Genet. 2016 Jan 29;17:31. doi: 10.1186/s12863-016-0331-3.
BMC Proc. 2011 Nov 29;5 Suppl 9(Suppl 9):S91. doi: 10.1186/1753-6561-5-S9-S91.
4
Statistical analysis of rare sequence variants: an overview of collapsing methods.稀有序列变异的统计分析:压缩方法概述。
Genet Epidemiol. 2011;35 Suppl 1(Suppl 1):S12-7. doi: 10.1002/gepi.20643.
5
HAPGEN2: simulation of multiple disease SNPs.HAPGEN2:模拟多种疾病 SNP。
Bioinformatics. 2011 Aug 15;27(16):2304-5. doi: 10.1093/bioinformatics/btr341. Epub 2011 Jun 8.
6
Powerful SNP-set analysis for case-control genome-wide association studies.基于全基因组关联研究的病例对照 SNP 集分析。
Am J Hum Genet. 2010 Jun 11;86(6):929-42. doi: 10.1016/j.ajhg.2010.05.002.
7
Association tests using kernel-based measures of multi-locus genotype similarity between individuals.基于核函数的个体间多基因座基因型相似性的关联测试。
Genet Epidemiol. 2010 Apr;34(3):213-21. doi: 10.1002/gepi.20451.
8
Estimation and testing for the effect of a genetic pathway on a disease outcome using logistic kernel machine regression via logistic mixed models.通过逻辑混合模型,使用逻辑核机器回归估计和检验遗传通路对疾病结局的影响。
BMC Bioinformatics. 2008 Jun 24;9:292. doi: 10.1186/1471-2105-9-292.
9
On multiple-testing correction in genome-wide association studies.关于全基因组关联研究中的多重检验校正
Genet Epidemiol. 2008 Sep;32(6):567-73. doi: 10.1002/gepi.20331.
10
A genome-wide association study identifies alleles in FGFR2 associated with risk of sporadic postmenopausal breast cancer.一项全基因组关联研究确定了FGFR2基因中的等位基因与散发性绝经后乳腺癌风险相关。
Nat Genet. 2007 Jul;39(7):870-4. doi: 10.1038/ng2075. Epub 2007 May 27.