• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

多表型遗传研究中关联筛选的协变量选择

Covariate selection for association screening in multiphenotype genetic studies.

作者信息

Aschard Hugues, Guillemot Vincent, Vilhjalmsson Bjarni, Patel Chirag J, Skurnik David, Ye Chun J, Wolpin Brian, Kraft Peter, Zaitlen Noah

机构信息

Centre de Bioinformatique, Biostatistique et Biologie Intégrative (C3BI), Institut Pasteur, Paris, France.

Department of Epidemiology, Harvard TH Chan School of Public Health, Boston, Massachusetts, USA.

出版信息

Nat Genet. 2017 Dec;49(12):1789-1795. doi: 10.1038/ng.3975. Epub 2017 Oct 16.

DOI:10.1038/ng.3975
PMID:29038595
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5797835/
Abstract

Testing for associations in big data faces the problem of multiple comparisons, wherein true signals are difficult to detect on the background of all associations queried. This difficulty is particularly salient in human genetic association studies, in which phenotypic variation is often driven by numerous variants of small effect. The current strategy to improve power to identify these weak associations consists of applying standard marginal statistical approaches and increasing study sample sizes. Although successful, this approach does not leverage the environmental and genetic factors shared among the multiple phenotypes collected in contemporary cohorts. Here we developed covariates for multiphenotype studies (CMS), an approach that improves power when correlated phenotypes are measured on the same samples. Our analyses of real and simulated data provide direct evidence that correlated phenotypes can be used to achieve increases in power to levels often surpassing the power gained by a twofold increase in sample size.

摘要

在大数据中检测关联面临多重比较问题,即在所有查询的关联背景下,真实信号难以检测。这种困难在人类基因关联研究中尤为突出,其中表型变异通常由众多小效应变体驱动。目前提高识别这些弱关联能力的策略包括应用标准的边际统计方法和增加研究样本量。尽管这种方法取得了成功,但它没有利用当代队列中收集的多个表型之间共享的环境和遗传因素。在此,我们开发了多表型研究协变量(CMS),这是一种在对相同样本测量相关表型时提高能力的方法。我们对真实数据和模拟数据的分析提供了直接证据,表明相关表型可用于将能力提高到通常超过样本量翻倍所获得的能力水平。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7b08/5797835/7f047ace480d/nihms908080f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7b08/5797835/4cfc4d9ee4ca/nihms908080f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7b08/5797835/c0494a651cd7/nihms908080f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7b08/5797835/9315cde766d8/nihms908080f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7b08/5797835/1e1264cb9fa3/nihms908080f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7b08/5797835/7f047ace480d/nihms908080f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7b08/5797835/4cfc4d9ee4ca/nihms908080f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7b08/5797835/c0494a651cd7/nihms908080f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7b08/5797835/9315cde766d8/nihms908080f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7b08/5797835/1e1264cb9fa3/nihms908080f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7b08/5797835/7f047ace480d/nihms908080f5.jpg

相似文献

1
Covariate selection for association screening in multiphenotype genetic studies.多表型遗传研究中关联筛选的协变量选择
Nat Genet. 2017 Dec;49(12):1789-1795. doi: 10.1038/ng.3975. Epub 2017 Oct 16.
2
MARS: leveraging allelic heterogeneity to increase power of association testing.MARS:利用等位基因异质性提高关联测试的功效。
Genome Biol. 2021 Apr 30;22(1):128. doi: 10.1186/s13059-021-02353-8.
3
Backward genotype-transcript-phenotype association mapping.反向基因型-转录本-表型关联映射。
Methods. 2017 Oct 1;129:18-23. doi: 10.1016/j.ymeth.2017.09.004. Epub 2017 Sep 14.
4
Phenotypic complexity, measurement bias, and poor phenotypic resolution contribute to the missing heritability problem in genetic association studies.表型复杂性、测量偏差和表型分辨率低是导致遗传关联研究中遗传力缺失问题的原因。
PLoS One. 2010 Nov 10;5(11):e13929. doi: 10.1371/journal.pone.0013929.
5
Model-based assessment of replicability for genome-wide association meta-analysis.基于模型的全基因组关联荟萃分析可重复性评估。
Nat Commun. 2021 Mar 30;12(1):1964. doi: 10.1038/s41467-021-21226-z.
6
Improving the power of genetic association tests with imperfect phenotype derived from electronic medical records.利用源自电子病历的不完美表型提高基因关联测试的效能。
Hum Genet. 2014 Nov;133(11):1369-82. doi: 10.1007/s00439-014-1466-9. Epub 2014 Jul 26.
7
Detecting sample misidentifications in genetic association studies.在基因关联研究中检测样本误认
Stat Appl Genet Mol Biol. 2012;11(3):Article 13. doi: 10.1515/1544-6115.1772.
8
Simultaneous Modeling of Disease Status and Clinical Phenotypes To Increase Power in Genome-Wide Association Studies.疾病状态与临床表型的联合建模以提高全基因组关联研究的效能
Genetics. 2017 Mar;205(3):1041-1047. doi: 10.1534/genetics.116.198473. Epub 2017 Jan 27.
9
Matching whole genomes to rare genetic disorders: Identification of potential causative variants using phenotype-weighted knowledge in the CAGI SickKids5 clinical genomes challenge.将全基因组与罕见遗传疾病相匹配:在 CAGI SickKids5 临床基因组挑战中使用表型加权知识鉴定潜在的致病变异。
Hum Mutat. 2020 Feb;41(2):347-362. doi: 10.1002/humu.23933. Epub 2019 Nov 15.
10
Semiparametric Allelic Tests for Mapping Multiple Phenotypes: Binomial Regression and Mahalanobis Distance.用于定位多种表型的半参数等位基因检验:二项式回归和马氏距离
Genet Epidemiol. 2015 Dec;39(8):635-50. doi: 10.1002/gepi.21930. Epub 2015 Oct 23.

引用本文的文献

1
Improving GWAS performance in underrepresented groups by appropriate modeling of genetics, environment, and sociocultural factors.通过对遗传、环境和社会文化因素进行适当建模,提高代表性不足群体中的全基因组关联研究(GWAS)性能。
bioRxiv. 2024 Oct 29:2024.10.28.620716. doi: 10.1101/2024.10.28.620716.
2
Joint regression analysis of multiple traits based on genetic relationships.基于遗传关系的多性状联合回归分析
Bioinform Adv. 2024 Jan 4;4(1):vbad192. doi: 10.1093/bioadv/vbad192. eCollection 2024.
3
Phenotype integration improves power and preserves specificity in biobank-based genetic studies of major depressive disorder.

本文引用的文献

1
Adjusting for Principal Components of Molecular Phenotypes Induces Replicating False Positives.调整分子表型的主成分会导致复制的假阳性。
Genetics. 2019 Apr;211(4):1179-1189. doi: 10.1534/genetics.118.301768. Epub 2019 Jan 28.
2
Consensus Genome-Wide Expression Quantitative Trait Loci and Their Relationship with Human Complex Trait Disease.全基因组表达数量性状位点共识及其与人类复杂性状疾病的关系。
OMICS. 2016 Jul;20(7):400-14. doi: 10.1089/omi.2016.0063.
3
A multiple-phenotype imputation method for genetic studies.一种用于基因研究的多表型插补方法。
表型整合提高了基于生物库的重度抑郁症遗传研究的功效并保持了特异性。
Nat Genet. 2023 Dec;55(12):2082-2093. doi: 10.1038/s41588-023-01559-9. Epub 2023 Nov 20.
4
An epidemiological introduction to human metabolomic investigations.人类代谢组学研究的流行病学概论。
Trends Endocrinol Metab. 2023 Sep;34(9):505-525. doi: 10.1016/j.tem.2023.06.006. Epub 2023 Jul 17.
5
Leveraging pleiotropy for joint analysis of genome-wide association studies with per trait interpretations.利用多效性对全基因组关联研究进行联合分析,并对每个性状进行解释。
PLoS Genet. 2022 Nov 7;18(11):e1010447. doi: 10.1371/journal.pgen.1010447. eCollection 2022 Nov.
6
Contrasting Water Withholding Responses of Young Maize Plants Reveal Link Between Lipid Peroxidation and Osmotic Regulation Corroborated by Genetic Analysis.玉米幼苗不同水分胁迫响应揭示脂质过氧化与渗透调节的联系并得到遗传分析的证实
Front Plant Sci. 2022 Jul 6;13:804630. doi: 10.3389/fpls.2022.804630. eCollection 2022.
7
Accounting for age of onset and family history improves power in genome-wide association studies.考虑发病年龄和家族史可提高全基因组关联研究的效能。
Am J Hum Genet. 2022 Mar 3;109(3):417-432. doi: 10.1016/j.ajhg.2022.01.009. Epub 2022 Feb 8.
8
Transcriptomic profiling of whole blood in 22q11.2 reciprocal copy number variants reveals that cell proportion highly impacts gene expression.22q11.2相互拷贝数变异中全血的转录组分析表明细胞比例对基因表达有高度影响。
Brain Behav Immun Health. 2021 Nov 9;18:100386. doi: 10.1016/j.bbih.2021.100386. eCollection 2021 Dec.
9
Wavelet Screening: a novel approach to analyzing GWAS data.小波筛选:一种分析 GWAS 数据的新方法。
BMC Bioinformatics. 2021 Oct 7;22(1):484. doi: 10.1186/s12859-021-04356-5.
10
Mitochondrial DNA variants modulate N-formylmethionine, proteostasis and risk of late-onset human diseases.线粒体 DNA 变异体调节 N-甲酰甲硫氨酸、蛋白质平衡和晚年发病风险。
Nat Med. 2021 Sep;27(9):1564-1575. doi: 10.1038/s41591-021-01441-3. Epub 2021 Aug 23.
Nat Genet. 2016 Apr;48(4):466-72. doi: 10.1038/ng.3513. Epub 2016 Feb 22.
4
Iterative Usage of Fixed and Random Effect Models for Powerful and Efficient Genome-Wide Association Studies.用于强大且高效的全基因组关联研究的固定效应模型和随机效应模型的迭代使用
PLoS Genet. 2016 Feb 1;12(2):e1005767. doi: 10.1371/journal.pgen.1005767. eCollection 2016 Feb.
5
Many Phenotypes Without Many False Discoveries: Error Controlling Strategies for Multitrait Association Studies.无众多错误发现的多种表型:多性状关联研究的误差控制策略
Genet Epidemiol. 2016 Jan;40(1):45-56. doi: 10.1002/gepi.21942. Epub 2015 Dec 2.
6
An atlas of genetic correlations across human diseases and traits.人类疾病与性状的遗传相关性图谱。
Nat Genet. 2015 Nov;47(11):1236-41. doi: 10.1038/ng.3406. Epub 2015 Sep 28.
7
Genetic studies of body mass index yield new insights for obesity biology.遗传研究体重指数为肥胖生物学提供了新的见解。
Nature. 2015 Feb 12;518(7538):197-206. doi: 10.1038/nature14177.
8
Adjusting for heritable covariates can bias effect estimates in genome-wide association studies.在全基因组关联研究中,对可遗传的协变量进行调整可能会使效应估计产生偏差。
Am J Hum Genet. 2015 Feb 5;96(2):329-39. doi: 10.1016/j.ajhg.2014.12.021. Epub 2015 Jan 29.
9
Elevation of circulating branched-chain amino acids is an early event in human pancreatic adenocarcinoma development.循环中支链氨基酸水平升高是人类胰腺腺癌发生过程中的早期事件。
Nat Med. 2014 Oct;20(10):1193-1198. doi: 10.1038/nm.3686. Epub 2014 Sep 28.
10
Genome-wide association study identifies multiple susceptibility loci for pancreatic cancer.全基因组关联研究确定了胰腺癌的多个易感基因座。
Nat Genet. 2014 Sep;46(9):994-1000. doi: 10.1038/ng.3052. Epub 2014 Aug 3.