• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过高效的单核苷酸多态性选择增强基于基因的关联研究中的信号。

Boosting signals in gene-based association studies via efficient SNP selection.

作者信息

Wu Cen, Cui Yuehua

机构信息

Department of Statistics and Probability, Michigan State University, 619 Red Cedar Road, Rm C432, East Lansing, MI 48824, USA. Tel.: +1-517-432-7098; Fax: +1-517-432-1405;

出版信息

Brief Bioinform. 2014 Mar;15(2):279-91. doi: 10.1093/bib/bbs087. Epub 2013 Jan 15.

DOI:10.1093/bib/bbs087
PMID:23325548
Abstract

Set-based association studies based on genes or pathways have shown great promise in interpreting association signals associated with complex diseases. These approaches are particularly useful when variants in a set have moderate effects and are difficult to be detected with single marker analysis, especially when variants function jointly in a complicated manner. The set-based analyses use a summary statistic such as the maximum or average of individual signal (e.g. a chi-square statistic) over all variants in a set, or consider their joint distribution to assess the significance of the set. The signal obtained with this treatment, however, could be potentially diluted when noisy variants are not taken good care of, leading to either inflated false negatives or false positives. Thus, the selection of disease informative single-nucleotide polymorphism (diSNPs) plays a crucial role in improving the power of the set-based association study. In this work, we propose an efficient diSNP selection method based on the information theory. We select diSNP variants by considering their relative information contribution to a disease status, which is different from the usual tag SNP selection. The relative merit of pre-selecting diSNPs in a set-based association analysis is demonstrated through extensive simulation studies and real data analysis.

摘要

基于基因或通路的集合关联研究在解释与复杂疾病相关的关联信号方面显示出巨大潜力。当集合中的变异具有中等效应且难以通过单标记分析检测到时,这些方法特别有用,尤其是当变异以复杂的方式共同发挥作用时。基于集合的分析使用诸如集合中所有变异的单个信号(例如卡方统计量)的最大值或平均值之类的汇总统计量,或者考虑它们的联合分布来评估集合的显著性。然而,当没有妥善处理有噪声的变异时,通过这种处理获得的信号可能会被潜在稀释,导致假阴性或假阳性的膨胀。因此,疾病信息性单核苷酸多态性(diSNP)的选择在提高基于集合的关联研究的效能方面起着关键作用。在这项工作中,我们提出了一种基于信息论的有效diSNP选择方法。我们通过考虑它们对疾病状态的相对信息贡献来选择diSNP变异,这与通常的标签SNP选择不同。通过广泛的模拟研究和实际数据分析,证明了在基于集合的关联分析中预先选择diSNP的相对优点。

相似文献

1
Boosting signals in gene-based association studies via efficient SNP selection.通过高效的单核苷酸多态性选择增强基于基因的关联研究中的信号。
Brief Bioinform. 2014 Mar;15(2):279-91. doi: 10.1093/bib/bbs087. Epub 2013 Jan 15.
2
A multi-SNP association test for complex diseases incorporating an optimal P-value threshold algorithm in nuclear families.一种在核心家庭中纳入最优P值阈值算法的复杂疾病多单核苷酸多态性关联测试。
BMC Genomics. 2015 May 15;16(1):381. doi: 10.1186/s12864-015-1620-3.
3
An efficient weighted tag SNP-set analytical method in genome-wide association studies.全基因组关联研究中的一种高效加权标签单核苷酸多态性集分析方法。
BMC Genet. 2015 Mar 13;16:25. doi: 10.1186/s12863-015-0182-3.
4
Dealing with heterogeneity between cohorts in genomewide SNP association studies.应对全基因组单核苷酸多态性关联研究中不同队列间的异质性。
Stat Appl Genet Mol Biol. 2010;9:Article 8. doi: 10.2202/1544-6115.1503. Epub 2010 Jan 13.
5
On optimal gene-based analysis of genome scans.基于基因的最优基因组扫描分析。
Genet Epidemiol. 2012 May;36(4):333-9. doi: 10.1002/gepi.21625. Epub 2012 Apr 16.
6
The power of genome-wide association studies of complex disease genes: statistical limitations of indirect approaches using SNP markers.复杂疾病基因的全基因组关联研究的力量:使用单核苷酸多态性(SNP)标记的间接方法的统计局限性。
J Hum Genet. 2001;46(8):478-82. doi: 10.1007/s100380170048.
7
The impact of missing and erroneous genotypes on tagging SNP selection and power of subsequent association tests.缺失和错误基因型对标签单核苷酸多态性选择及后续关联检验效能的影响。
Hum Hered. 2006;61(1):31-44. doi: 10.1159/000092141. Epub 2006 Mar 23.
8
Genome-wide selection of tag SNPs using multiple-marker correlation.使用多标记相关性进行全基因组标签单核苷酸多态性选择。
Bioinformatics. 2007 Dec 1;23(23):3178-84. doi: 10.1093/bioinformatics/btm496. Epub 2007 Nov 15.
9
Tag SNP selection via a genetic algorithm.通过遗传算法进行标签 SNP 选择。
J Biomed Inform. 2010 Oct;43(5):800-4. doi: 10.1016/j.jbi.2010.05.011. Epub 2010 May 28.
10
Is there still a need for candidate gene approaches in the era of genome-wide association studies?在全基因组关联研究时代,候选基因方法是否仍有必要?
Genomics. 2009 May;93(5):415-9. doi: 10.1016/j.ygeno.2008.12.011. Epub 2009 Jan 20.

引用本文的文献

1
Springer: An R package for bi-level variable selection of high-dimensional longitudinal data.施普林格:用于高维纵向数据双层变量选择的R包。
Front Genet. 2023 Apr 6;14:1088223. doi: 10.3389/fgene.2023.1088223. eCollection 2023.
2
Integrating Multi-Omics Data for Gene-Environment Interactions.整合多组学数据以研究基因-环境相互作用
BioTech (Basel). 2021 Jan 29;10(1):3. doi: 10.3390/biotech10010003.
3
Interep: An R Package for High-Dimensional Interaction Analysis of the Repeated Measurement Data.Interep:一个用于重复测量数据高维交互分析的R软件包。
Genes (Basel). 2022 Mar 19;13(3):544. doi: 10.3390/genes13030544.
4
Identifying Gene-Environment Interactions With Robust Marginal Bayesian Variable Selection.利用稳健边际贝叶斯变量选择识别基因-环境相互作用
Front Genet. 2021 Dec 8;12:667074. doi: 10.3389/fgene.2021.667074. eCollection 2021.
5
Gene-Environment Interaction: A Variable Selection Perspective.基因-环境相互作用:变量选择视角
Methods Mol Biol. 2021;2212:191-223. doi: 10.1007/978-1-0716-0947-7_13.
6
Semiparametric Bayesian variable selection for gene-environment interactions.用于基因-环境相互作用的半参数贝叶斯变量选择
Stat Med. 2020 Feb 28;39(5):617-638. doi: 10.1002/sim.8434. Epub 2019 Dec 21.
7
Penalized Variable Selection for Lipid-Environment Interactions in a Longitudinal Lipidomics Study.纵向脂质组学研究中脂质环境相互作用的惩罚变量选择。
Genes (Basel). 2019 Dec 3;10(12):1002. doi: 10.3390/genes10121002.
8
Considering Genetic Heterogeneity in the Association Analysis Finds Genes Associated With Nicotine Dependence.在关联分析中考虑基因异质性发现与尼古丁依赖相关的基因。
Front Genet. 2019 May 17;10:448. doi: 10.3389/fgene.2019.00448. eCollection 2019.
9
A Selective Review of Multi-Level Omics Data Integration Using Variable Selection.使用变量选择对多组学数据整合进行的选择性综述
High Throughput. 2019 Jan 18;8(1):4. doi: 10.3390/ht8010004.
10
A powerful score-based test statistic for detecting gene-gene co-association.一种用于检测基因-基因共关联的基于分数的强大检验统计量。
BMC Genet. 2016 Jan 29;17:31. doi: 10.1186/s12863-016-0331-3.