• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

将家族疾病史纳入基于人群的遗传关联研究,并控制病例-对照失衡。

Incorporating family disease history and controlling case-control imbalance for population-based genetic association studies.

机构信息

Center for Statistical Genetics, University of Michigan School of Public Health, Ann Arbor, MI, USA.

Department of Biostatistics, University of Michigan School of Public Health, Ann Arbor, MI, USA.

出版信息

Bioinformatics. 2022 Sep 15;38(18):4337-4343. doi: 10.1093/bioinformatics/btac459.

DOI:10.1093/bioinformatics/btac459
PMID:35876838
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9477535/
Abstract

MOTIVATION

In the genome-wide association analysis of population-based biobanks, most diseases have low prevalence, which results in low detection power. One approach to tackle the problem is using family disease history, yet existing methods are unable to address type I error inflation induced by increased correlation of phenotypes among closely related samples, as well as unbalanced phenotypic distribution.

RESULTS

We propose a new method for genetic association test with family disease history, mixed-model-based Test with Adjusted Phenotype and Empirical saddlepoint approximation, which controls for increased phenotype correlation by adopting a two-variance-component mixed model, accounts for case-control imbalance by using empirical saddlepoint approximation, and is flexible to incorporate any existing adjusted phenotypes, such as phenotypes from the LT-FH method. We show through simulation studies and analysis of UK Biobank data of white British samples and the Korean Genome and Epidemiology Study of Korean samples that the proposed method is robust and yields better calibration compared to existing methods while gaining power for detection of variant-phenotype associations.

AVAILABILITY AND IMPLEMENTATION

The summary statistics and code generated in this study are available at https://github.com/styvon/TAPE.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

在基于人群的生物库的全基因组关联分析中,大多数疾病的患病率较低,导致检测能力较低。一种解决该问题的方法是使用家族病史,但现有的方法无法解决由于密切相关样本中表型相关性增加而导致的 I 型错误膨胀,以及表型分布不平衡的问题。

结果

我们提出了一种新的具有家族病史的遗传关联测试方法,混合模型的基于调整表型的测试和经验鞍点逼近,该方法通过采用两方差分量混合模型来控制表型相关性的增加,通过经验鞍点逼近来考虑病例对照不平衡,并灵活地结合任何现有的调整表型,例如 LT-FH 方法中的表型。我们通过模拟研究和对英国生物库中白种英国人样本和韩国基因组和流行病学研究中韩国样本的数据分析表明,与现有方法相比,所提出的方法具有稳健性和更好的校准效果,同时提高了检测变异-表型关联的能力。

可用性和实现

本研究生成的汇总统计数据和代码可在 https://github.com/styvon/TAPE 上获得。

补充信息

补充数据可在生物信息学在线获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/816c/9477535/192a9cd31557/btac459f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/816c/9477535/8bb16f8820a0/btac459f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/816c/9477535/f838a41e17fb/btac459f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/816c/9477535/7c12c93ae582/btac459f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/816c/9477535/192a9cd31557/btac459f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/816c/9477535/8bb16f8820a0/btac459f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/816c/9477535/f838a41e17fb/btac459f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/816c/9477535/7c12c93ae582/btac459f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/816c/9477535/192a9cd31557/btac459f4.jpg

相似文献

1
Incorporating family disease history and controlling case-control imbalance for population-based genetic association studies.将家族疾病史纳入基于人群的遗传关联研究,并控制病例-对照失衡。
Bioinformatics. 2022 Sep 15;38(18):4337-4343. doi: 10.1093/bioinformatics/btac459.
2
Efficiently controlling for case-control imbalance and sample relatedness in large-scale genetic association studies.在大规模的遗传关联研究中,有效地控制病例-对照不平衡和样本相关性。
Nat Genet. 2018 Sep;50(9):1335-1341. doi: 10.1038/s41588-018-0184-y. Epub 2018 Aug 13.
3
A Fast and Accurate Method for Genome-wide Scale Phenome-wide G × E Analysis and Its Application to UK Biobank.一种用于全基因组规模表型全基因组 G × E 分析的快速准确方法及其在 UK Biobank 中的应用。
Am J Hum Genet. 2019 Dec 5;105(6):1182-1192. doi: 10.1016/j.ajhg.2019.10.008. Epub 2019 Nov 14.
4
UK Biobank Whole-Exome Sequence Binary Phenome Analysis with Robust Region-Based Rare-Variant Test.英国生物银行全外显子组序列双表型分析与稳健基于区域的罕见变异测试。
Am J Hum Genet. 2020 Jan 2;106(1):3-12. doi: 10.1016/j.ajhg.2019.11.012. Epub 2019 Dec 19.
5
Joint analysis of multiple phenotypes for extremely unbalanced case-control association studies using multi-layer network.利用多层网络对极度不平衡病例对照关联研究的多种表型进行联合分析。
Bioinformatics. 2023 Dec 1;39(12). doi: 10.1093/bioinformatics/btad707.
6
Joint analysis of multiple phenotypes for extremely unbalanced case-control association studies.极端不平衡病例对照关联研究的多表型联合分析。
Genet Epidemiol. 2023 Mar;47(2):185-197. doi: 10.1002/gepi.22513. Epub 2023 Jan 24.
7
Liability threshold modeling of case-control status and family history of disease increases association power.病例对照状态和疾病家族史的 LIABILITY 阈值建模增加了关联力度。
Nat Genet. 2020 May;52(5):541-547. doi: 10.1038/s41588-020-0613-6. Epub 2020 Apr 20.
8
Robust meta-analysis of biobank-based genome-wide association studies with unbalanced binary phenotypes.基于生物库的不平衡二分类表型全基因组关联研究的稳健荟萃分析。
Genet Epidemiol. 2019 Jul;43(5):462-476. doi: 10.1002/gepi.22197. Epub 2019 Feb 22.
9
A Fast and Accurate Method for Genome-Wide Time-to-Event Data Analysis and Its Application to UK Biobank.一种用于全基因组事件时间数据分析的快速而准确的方法及其在 UK Biobank 中的应用。
Am J Hum Genet. 2020 Aug 6;107(2):222-233. doi: 10.1016/j.ajhg.2020.06.003. Epub 2020 Jun 25.
10
Leveraging family history in genetic association analyses of binary traits.利用家族史进行二元性状的遗传关联分析。
BMC Genomics. 2022 Oct 1;23(1):678. doi: 10.1186/s12864-022-08897-8.

引用本文的文献

1
Genome-scale metabolic network reconstruction analysis identifies bacterial vaginosis-associated metabolic interactions.全基因组规模代谢网络重建分析确定了与细菌性阴道病相关的代谢相互作用。
Nat Commun. 2025 May 22;16(1):4768. doi: 10.1038/s41467-025-59965-y.
2
Genome-wide study on 72,298 individuals in Korean biobank data for 76 traits.针对韩国生物样本库数据中72298名个体的76种性状进行全基因组研究。
Cell Genom. 2022 Oct 5;2(10):100189. doi: 10.1016/j.xgen.2022.100189. eCollection 2022 Oct 12.
3
The HUNT study: A population-based cohort for genetic research.

本文引用的文献

1
Remote modulation of lncRNA by risk variant at 16p13 underlying genetic susceptibility to gastric cancer.长链非编码 RNA 通过位于 16p13 的风险变异体的远程调控与胃癌遗传易感性有关。
Sci Adv. 2020 May 20;6(21):eaay5525. doi: 10.1126/sciadv.aay5525. eCollection 2020 May.
2
A Fast and Accurate Method for Genome-Wide Time-to-Event Data Analysis and Its Application to UK Biobank.一种用于全基因组事件时间数据分析的快速而准确的方法及其在 UK Biobank 中的应用。
Am J Hum Genet. 2020 Aug 6;107(2):222-233. doi: 10.1016/j.ajhg.2020.06.003. Epub 2020 Jun 25.
3
Liability threshold modeling of case-control status and family history of disease increases association power.
HUNT研究:一项基于人群的基因研究队列。
Cell Genom. 2022 Oct 12;2(10):100193. doi: 10.1016/j.xgen.2022.100193.
病例对照状态和疾病家族史的 LIABILITY 阈值建模增加了关联力度。
Nat Genet. 2020 May;52(5):541-547. doi: 10.1038/s41588-020-0613-6. Epub 2020 Apr 20.
4
A resource-efficient tool for mixed model association analysis of large-scale data.一种资源高效的工具,用于大规模数据的混合模型关联分析。
Nat Genet. 2019 Dec;51(12):1749-1755. doi: 10.1038/s41588-019-0530-8. Epub 2019 Nov 25.
5
Meta-analysis of genome-wide association studies and functional assays decipher susceptibility genes for gastric cancer in Chinese populations.基于全基因组关联研究和功能测定的荟萃分析,揭示了中国人群胃癌易感性的基因。
Gut. 2020 Apr;69(4):641-651. doi: 10.1136/gutjnl-2019-318760. Epub 2019 Aug 5.
6
The UK Biobank resource with deep phenotyping and genomic data.英国生物银行资源库,具有深度表型和基因组数据。
Nature. 2018 Oct;562(7726):203-209. doi: 10.1038/s41586-018-0579-z. Epub 2018 Oct 10.
7
Genome-wide association study identifies gastric cancer susceptibility loci at 12q24.11-12 and 20q11.21.全基因组关联研究鉴定出 12q24.11-12 和 20q11.21 两个胃癌易感性位点。
Cancer Sci. 2018 Dec;109(12):4015-4024. doi: 10.1111/cas.13815. Epub 2018 Oct 31.
8
Efficiently controlling for case-control imbalance and sample relatedness in large-scale genetic association studies.在大规模的遗传关联研究中,有效地控制病例-对照不平衡和样本相关性。
Nat Genet. 2018 Sep;50(9):1335-1341. doi: 10.1038/s41588-018-0184-y. Epub 2018 Aug 13.
9
Overview of the BioBank Japan Project: Study design and profile.日本生物样本库项目概述:研究设计与概况
J Epidemiol. 2017 Mar;27(3S):S2-S8. doi: 10.1016/j.je.2016.12.005. Epub 2017 Feb 8.
10
Case-control association mapping by proxy using family history of disease.基于疾病家族史的代理病例对照关联映射。
Nat Genet. 2017 Mar;49(3):325-331. doi: 10.1038/ng.3766. Epub 2017 Jan 16.