• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过有效的自适应评分检验识别与表型相关的遗传标记集。

Identifying genetic marker sets associated with phenotypes via an efficient adaptive score test.

机构信息

Department of Biostatistics, Harvard School of Public Health, Boston, MA 02115, USA.

出版信息

Biostatistics. 2012 Sep;13(4):776-90. doi: 10.1093/biostatistics/kxs015. Epub 2012 Jun 25.

DOI:10.1093/biostatistics/kxs015
PMID:22734045
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3440238/
Abstract

In recent years, genome-wide association studies (GWAS) and gene-expression profiling have generated a large number of valuable datasets for assessing how genetic variations are related to disease outcomes. With such datasets, it is often of interest to assess the overall effect of a set of genetic markers, assembled based on biological knowledge. Genetic marker-set analyses have been advocated as more reliable and powerful approaches compared with the traditional marginal approaches (Curtis and others, 2005. Pathways to the analysis of microarray data. TRENDS in Biotechnology 23, 429-435; Efroni and others, 2007. Identification of key processes underlying cancer phenotypes using biologic pathway analysis. PLoS One 2, 425). Procedures for testing the overall effect of a marker-set have been actively studied in recent years. For example, score tests derived under an Empirical Bayes (EB) framework (Liu and others, 2007. Semiparametric regression of multidimensional genetic pathway data: least-squares kernel machines and linear mixed models. Biometrics 63, 1079-1088; Liu and others, 2008. Estimation and testing for the effect of a genetic pathway on a disease outcome using logistic kernel machine regression via logistic mixed models. BMC bioinformatics 9, 292-2; Wu and others, 2010. Powerful SNP-set analysis for case-control genome-wide association studies. American Journal of Human Genetics 86, 929) have been proposed as powerful alternatives to the standard Rao score test (Rao, 1948. Large sample tests of statistical hypotheses concerning several parameters with applications to problems of estimation. Mathematical Proceedings of the Cambridge Philosophical Society, 44, 50-57). The advantages of these EB-based tests are most apparent when the markers are correlated, due to the reduction in the degrees of freedom. In this paper, we propose an adaptive score test which up- or down-weights the contributions from each member of the marker-set based on the Z-scores of their effects. Such an adaptive procedure gains power over the existing procedures when the signal is sparse and the correlation among the markers is weak. By combining evidence from both the EB-based score test and the adaptive test, we further construct an omnibus test that attains good power in most settings. The null distributions of the proposed test statistics can be approximated well either via simple perturbation procedures or via distributional approximations. Through extensive simulation studies, we demonstrate that the proposed procedures perform well in finite samples. We apply the tests to a breast cancer genetic study to assess the overall effect of the FGFR2 gene on breast cancer risk.

摘要

近年来,全基因组关联研究(GWAS)和基因表达谱分析产生了大量有价值的数据集,可用于评估遗传变异与疾病结果之间的关系。有了这些数据集,通常有兴趣评估一组遗传标记的总体效应,这些标记是根据生物学知识组装的。与传统的边际方法(Curtis 等人,2005 年。微阵列数据分析的途径。生物技术趋势 23,429-435;Efroni 等人,2007 年。使用生物途径分析鉴定癌症表型的关键过程。公共科学图书馆一号 2,425)相比,遗传标记集分析已被提倡为更可靠和强大的方法。近年来,人们积极研究了检验标记集总体效应的程序。例如,在经验贝叶斯(EB)框架下推导的评分检验(Liu 等人,2007 年。多维遗传途径数据的半参数回归:最小二乘核机和线性混合模型。生物统计学 63,1079-1088;Liu 等人,2008 年。使用逻辑核机回归通过逻辑混合模型对遗传途径对疾病结果的影响进行估计和检验。生物信息学 9,292-2;Wu 等人,2010 年。用于病例对照全基因组关联研究的强大 SNP 集分析。美国人类遗传学杂志 86,929)已被提议作为标准 Rao 评分检验(Rao,1948 年。关于多个参数的统计假设的大样本检验及其在估计问题中的应用。剑桥哲学学会数学会刊,44,50-57)的有力替代品。由于自由度的减少,当标记物相关时,这些基于 EB 的检验的优势最为明显。在本文中,我们提出了一种自适应评分检验,根据其效应的 Z 分数对标记集的每个成员的贡献进行上下加权。当信号稀疏且标记物之间的相关性较弱时,这种自适应程序比现有程序更具优势。通过结合基于 EB 的评分检验和自适应检验的证据,我们进一步构建了一个综合检验,在大多数情况下都能获得良好的功效。所提出的检验统计量的零分布可以通过简单的摄动程序或分布逼近很好地近似。通过广泛的模拟研究,我们证明了这些程序在有限样本中表现良好。我们将这些检验应用于乳腺癌遗传研究,以评估 FGFR2 基因对乳腺癌风险的总体影响。

相似文献

1
Identifying genetic marker sets associated with phenotypes via an efficient adaptive score test.通过有效的自适应评分检验识别与表型相关的遗传标记集。
Biostatistics. 2012 Sep;13(4):776-90. doi: 10.1093/biostatistics/kxs015. Epub 2012 Jun 25.
2
A small-sample multivariate kernel machine test for microbiome association studies.用于微生物组关联研究的小样本多变量核机器测试。
Genet Epidemiol. 2017 Apr;41(3):210-220. doi: 10.1002/gepi.22030. Epub 2016 Dec 26.
3
Powerful gene set analysis in GWAS with the Generalized Berk-Jones statistic.基于广义 Berk-Jones 统计量的 GWAS 中强大的基因集分析。
PLoS Genet. 2019 Mar 15;15(3):e1007530. doi: 10.1371/journal.pgen.1007530. eCollection 2019 Mar.
4
An Adaptive Association Test for Multiple Phenotypes with GWAS Summary Statistics.一种利用全基因组关联研究汇总统计数据对多种表型进行的适应性关联测试。
Genet Epidemiol. 2015 Dec;39(8):651-63. doi: 10.1002/gepi.21931. Epub 2015 Oct 22.
5
Powerful SNP-set analysis for case-control genome-wide association studies.基于全基因组关联研究的病例对照 SNP 集分析。
Am J Hum Genet. 2010 Jun 11;86(6):929-42. doi: 10.1016/j.ajhg.2010.05.002.
6
Multivariate phenotype association analysis by marker-set kernel machine regression.基于标记集核机器回归的多变量表型关联分析。
Genet Epidemiol. 2012 Nov;36(7):686-95. doi: 10.1002/gepi.21663. Epub 2012 Aug 16.
7
Prioritizing individual genetic variants after kernel machine testing using variable selection.在使用变量选择的核机器测试后对个体遗传变异进行优先级排序。
Genet Epidemiol. 2016 Dec;40(8):722-731. doi: 10.1002/gepi.21993. Epub 2016 Aug 3.
8
Testing and estimation in marker-set association study using semiparametric quantile regression kernel machine.使用半参数分位数回归核机器进行标记集关联研究中的检验与估计。
Biometrics. 2016 Jun;72(2):364-71. doi: 10.1111/biom.12438. Epub 2015 Nov 17.
9
Inference on phenotype-specific effects of genes using multivariate kernel machine regression.使用多变量核机器回归推断基因的表型特异性效应。
Genet Epidemiol. 2018 Feb;42(1):64-79. doi: 10.1002/gepi.22096. Epub 2018 Jan 3.
10
A gene based combination test using GWAS summary data.基于 GWAS 汇总数据的基因组合测试。
BMC Bioinformatics. 2023 Jan 3;24(1):2. doi: 10.1186/s12859-022-05114-x.

引用本文的文献

1
A high-dimensional omnibus test for set-based association analysis.基于集合的关联分析的高维综合检验。
Brief Bioinform. 2024 Jul 25;25(5). doi: 10.1093/bib/bbae456.
2
A Two-Part Mixed Model for Differential Expression Analysis in Single-Cell High-Throughput Gene Expression Data.一种用于单细胞高通量基因表达数据分析中差异表达分析的两部分混合模型。
Genes (Basel). 2022 Feb 18;13(2):377. doi: 10.3390/genes13020377.
3
An Adaptive Multivariate Two-Sample Test With Application to Microbiome Differential Abundance Analysis.一种应用于微生物组差异丰度分析的自适应多变量双样本检验
Front Genet. 2019 Apr 24;10:350. doi: 10.3389/fgene.2019.00350. eCollection 2019.
4
Investigating Drought Tolerance in Chickpea Using Genome-Wide Association Mapping and Genomic Selection Based on Whole-Genome Resequencing Data.利用基于全基因组重测序数据的全基因组关联图谱和基因组选择研究鹰嘴豆的耐旱性。
Front Plant Sci. 2018 Feb 19;9:190. doi: 10.3389/fpls.2018.00190. eCollection 2018.
5
Integrating Clinical and Multiple Omics Data for Prognostic Assessment across Human Cancers.整合临床和多种组学数据以进行跨人类癌症的预后评估。
Sci Rep. 2017 Dec 5;7(1):16954. doi: 10.1038/s41598-017-17031-8.
6
Integrated genomic analysis of biological gene sets with applications in lung cancer prognosis.生物基因集的综合基因组分析及其在肺癌预后中的应用
BMC Bioinformatics. 2017 Jul 11;18(1):336. doi: 10.1186/s12859-017-1737-2.
7
A unified powerful set-based test for sequencing data analysis of GxE interactions.一种用于基因与环境相互作用测序数据分析的统一的基于强大集的检验。
Biostatistics. 2017 Jan;18(1):119-131. doi: 10.1093/biostatistics/kxw034. Epub 2016 Jul 28.
8
An Adaptive Genetic Association Test Using Double Kernel Machines.一种使用双内核机器的自适应基因关联测试。
Stat Biosci. 2015 Oct 1;7(2):262-281. doi: 10.1007/s12561-014-9116-2. Epub 2014 Jun 24.
9
iGWAS: Integrative Genome-Wide Association Studies of Genetic and Genomic Data for Disease Susceptibility Using Mediation Analysis.整合全基因组关联研究:使用中介分析对疾病易感性的遗传和基因组数据进行整合全基因组关联研究。
Genet Epidemiol. 2015 Jul;39(5):347-56. doi: 10.1002/gepi.21905. Epub 2015 May 22.
10
Integrative analysis of micro-RNA, gene expression, and survival of glioblastoma multiforme.多形性胶质母细胞瘤的微小RNA、基因表达及生存情况的综合分析
Genet Epidemiol. 2015 Feb;39(2):134-43. doi: 10.1002/gepi.21875. Epub 2014 Dec 23.

本文引用的文献

1
Powerful SNP-set analysis for case-control genome-wide association studies.基于全基因组关联研究的病例对照 SNP 集分析。
Am J Hum Genet. 2010 Jun 11;86(6):929-42. doi: 10.1016/j.ajhg.2010.05.002.
2
A note on the effect on power of score tests via dimension reduction by penalized regression under the null.关于在原假设下通过惩罚回归进行降维对得分检验功效的影响的一则注释。
Int J Biostat. 2010 Mar 29;6(1):Article 12. doi: 10.2202/1557-4679.1231.
3
Genome-wide association for major depressive disorder: a possible role for the presynaptic protein piccolo.全基因组关联研究重度抑郁症:突触前蛋白 piccolo 的潜在作用。
Mol Psychiatry. 2009 Apr;14(4):359-75. doi: 10.1038/mp.2008.125. Epub 2008 Dec 9.
4
Estimation and testing for the effect of a genetic pathway on a disease outcome using logistic kernel machine regression via logistic mixed models.通过逻辑混合模型,使用逻辑核机器回归估计和检验遗传通路对疾病结局的影响。
BMC Bioinformatics. 2008 Jun 24;9:292. doi: 10.1186/1471-2105-9-292.
5
On multiple-testing correction in genome-wide association studies.关于全基因组关联研究中的多重检验校正
Genet Epidemiol. 2008 Sep;32(6):567-73. doi: 10.1002/gepi.20331.
6
A powerful and flexible multilocus association test for quantitative traits.一种用于数量性状的强大且灵活的多位点关联检验。
Am J Hum Genet. 2008 Feb;82(2):386-97. doi: 10.1016/j.ajhg.2007.10.010.
7
Genome-wide association study identifies genes for biomarkers of cardiovascular disease: serum urate and dyslipidemia.全基因组关联研究确定心血管疾病生物标志物的基因:血清尿酸和血脂异常。
Am J Hum Genet. 2008 Jan;82(1):139-49. doi: 10.1016/j.ajhg.2007.11.001.
8
Semiparametric regression of multidimensional genetic pathway data: least-squares kernel machines and linear mixed models.多维遗传通路数据的半参数回归:最小二乘核机器与线性混合模型
Biometrics. 2007 Dec;63(4):1079-88. doi: 10.1111/j.1541-0420.2007.00799.x.
9
Reproducibility of differential gene detection across multiple microarray studies.多个微阵列研究中差异基因检测的可重复性。
Annu Int Conf IEEE Eng Med Biol Soc. 2007;2007:4231-4. doi: 10.1109/IEMBS.2007.4353270.
10
Genome-wide association studies provide new insights into type 2 diabetes aetiology.全基因组关联研究为2型糖尿病的病因学提供了新的见解。
Nat Rev Genet. 2007 Sep;8(9):657-62. doi: 10.1038/nrg2178.