• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

在全基因组关联研究(GWAS)中使用先前基因分型的对照:应用于中风遗传学网络(SiGN)。

Using previously genotyped controls in genome-wide association studies (GWAS): application to the Stroke Genetics Network (SiGN).

机构信息

Department of Medicine and Program for Personalized and Genomic Medicine, University of Maryland School of Medicine Baltimore, MD, USA ; Veterans Administration Medical Center Baltimore, MD, USA.

Department of Medicine, University of Texas Health Science Center Houston, TX, USA.

出版信息

Front Genet. 2014 Apr 29;5:95. doi: 10.3389/fgene.2014.00095. eCollection 2014.

DOI:10.3389/fgene.2014.00095
PMID:24808905
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4010766/
Abstract

Genome-wide association studies (GWAS) are widely applied to identify susceptibility loci for a variety of diseases using genotyping arrays that interrogate known polymorphisms throughout the genome. A particular strength of GWAS is that it is unbiased with respect to specific genomic elements (e.g., coding or regulatory regions of genes), and it has revealed important associations that would have never been suspected based on prior knowledge or assumptions. To date, the discovered SNPs associated with complex human traits tend to have small effect sizes, requiring very large sample sizes to achieve robust statistical power. To address these issues, a number of efficient strategies have emerged for conducting GWAS, including combining study results across multiple studies using meta-analysis, collecting cases through electronic health records, and using samples collected from other studies as controls that have already been genotyped and made publicly available (e.g., through deposition of de-identified data into dbGaP or EGA). In certain scenarios, it may be attractive to use already genotyped controls and divert resources to standardized collection, phenotyping, and genotyping of cases only. This strategy, however, requires that careful attention be paid to the choice of "public controls" and to the comparability of genetic data between cases and the public controls to ensure that any allele frequency differences observed between groups is attributable to locus-specific effects rather than to a systematic bias due to poor matching (population stratification) or differential genotype calling (batch effects). The goal of this paper is to describe some of the potential pitfalls in using previously genotyped control data. We focus on considerations related to the choice of control groups, the use of different genotyping platforms, and approaches to deal with population stratification when cases and controls are genotyped across different platforms.

摘要

全基因组关联研究(GWAS)广泛应用于使用基因分型阵列识别各种疾病的易感基因座,这些阵列可以检测基因组中的已知多态性。GWAS 的一个特别优势是它对特定基因组元素(例如基因的编码或调控区域)是无偏的,并且它揭示了重要的关联,这些关联是基于先前的知识或假设永远不会被怀疑的。迄今为止,与复杂人类特征相关的发现的 SNP 往往具有较小的效应大小,需要非常大的样本量才能实现稳健的统计功效。为了解决这些问题,已经出现了许多用于进行 GWAS 的有效策略,包括使用荟萃分析结合多个研究的结果、通过电子健康记录收集病例,以及使用已经进行基因分型并公开可用的其他研究的样本作为对照(例如,通过将去识别数据存入 dbGaP 或 EGA)。在某些情况下,使用已经基因分型的对照并将资源转移到仅对病例进行标准化收集、表型和基因分型可能会很有吸引力。然而,这种策略需要仔细注意“公共对照”的选择以及病例和公共对照之间遗传数据的可比性,以确保观察到的群体之间的任何等位基因频率差异归因于特定基因座的影响,而不是由于匹配不良(群体分层)或差异基因型调用(批次效应)引起的系统偏差。本文的目的是描述使用先前基因分型对照数据的一些潜在陷阱。我们专注于与对照组选择、使用不同基因分型平台以及在病例和对照在不同平台上进行基因分型时处理群体分层的方法相关的考虑因素。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/11f8/4010766/0cb609a162ac/fgene-05-00095-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/11f8/4010766/6d6415156c0d/fgene-05-00095-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/11f8/4010766/0cb609a162ac/fgene-05-00095-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/11f8/4010766/6d6415156c0d/fgene-05-00095-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/11f8/4010766/0cb609a162ac/fgene-05-00095-g0002.jpg

相似文献

1
Using previously genotyped controls in genome-wide association studies (GWAS): application to the Stroke Genetics Network (SiGN).在全基因组关联研究(GWAS)中使用先前基因分型的对照:应用于中风遗传学网络(SiGN)。
Front Genet. 2014 Apr 29;5:95. doi: 10.3389/fgene.2014.00095. eCollection 2014.
2
Design considerations for genetic linkage and association studies.基因连锁与关联研究的设计考量
Methods Mol Biol. 2012;850:237-62. doi: 10.1007/978-1-61779-555-8_13.
3
Endometrial vezatin and its association with endometriosis risk.子宫内膜 vezatin 及其与子宫内膜异位症风险的关联。
Hum Reprod. 2016 May;31(5):999-1013. doi: 10.1093/humrep/dew047. Epub 2016 Mar 22.
4
Pleiotropy informed adaptive association test of multiple traits using genome-wide association study summary data.利用全基因组关联研究汇总数据进行多性状的多效性知情适应性关联测试。
Biometrics. 2019 Dec;75(4):1076-1085. doi: 10.1111/biom.13076. Epub 2019 Aug 2.
5
Extending the use of GWAS data by combining data from different genetic platforms.通过整合来自不同基因平台的数据来扩展全基因组关联研究(GWAS)数据的应用。
PLoS One. 2017 Feb 28;12(2):e0172082. doi: 10.1371/journal.pone.0172082. eCollection 2017.
6
Shared genetic factors for age at natural menopause in Iranian and European women.伊朗和欧洲女性自然绝经年龄的共享遗传因素。
Hum Reprod. 2013 Jul;28(7):1987-94. doi: 10.1093/humrep/det106. Epub 2013 Apr 16.
7
The eMERGE Network: a consortium of biorepositories linked to electronic medical records data for conducting genomic studies.eMERGE 网络:一个由生物库组成的联盟,与电子病历数据相关联,用于进行基因组研究。
BMC Med Genomics. 2011 Jan 26;4:13. doi: 10.1186/1755-8794-4-13.
8
GACT: a Genome build and Allele definition Conversion Tool for SNP imputation and meta-analysis in genetic association studies.GACT:一种用于基因关联研究中SNP插补和荟萃分析的基因组构建与等位基因定义转换工具。
BMC Genomics. 2014 Jul 19;15:610. doi: 10.1186/1471-2164-15-610.
9
Combining multi-population datasets for joint genome-wide association and meta-analyses: The case of bovine milk fat composition traits.联合多群体数据集进行全基因组关联和荟萃分析:以牛乳脂肪成分性状为例。
J Dairy Sci. 2019 Dec;102(12):11124-11141. doi: 10.3168/jds.2019-16676. Epub 2019 Sep 25.
10
Assessing batch effects of genotype calling algorithm BRLMM for the Affymetrix GeneChip Human Mapping 500 K array set using 270 HapMap samples.使用270个HapMap样本评估基因分型算法BRLMM对Affymetrix GeneChip Human Mapping 500 K芯片组的批次效应。
BMC Bioinformatics. 2008 Aug 12;9 Suppl 9(Suppl 9):S17. doi: 10.1186/1471-2105-9-S9-S17.

引用本文的文献

1
Genetic Variants Associated with Suspected Neonatal Hypoxic Ischaemic Encephalopathy: A Study in a South African Context.与疑似新生儿缺氧缺血性脑病相关的基因变异:南非背景下的一项研究。
Int J Mol Sci. 2025 Feb 27;26(5):2075. doi: 10.3390/ijms26052075.
2
Rare and common coding variants in lipid metabolism-related genes and their association with coronary artery disease.脂质代谢相关基因的罕见和常见编码变异及其与冠心病的关系。
BMC Cardiovasc Disord. 2024 Feb 9;24(1):97. doi: 10.1186/s12872-024-03759-5.
3
Increase in power by obtaining 10 or more controls per case when type-1 error is small in large-scale association studies.

本文引用的文献

1
Heart disease and stroke statistics--2014 update: a report from the American Heart Association.《2014年心脏病和中风统计数据更新:美国心脏协会报告》
Circulation. 2014 Jan 21;129(3):e28-e292. doi: 10.1161/01.cir.0000441139.02102.80. Epub 2013 Dec 18.
2
Stroke Genetics Network (SiGN) study: design and rationale for a genome-wide association study of ischemic stroke subtypes.中风遗传学网络(SiGN)研究:一项针对缺血性中风亚型的全基因组关联研究的设计和原理。
Stroke. 2013 Oct;44(10):2694-702. doi: 10.1161/STROKEAHA.113.001857. Epub 2013 Sep 10.
3
The power of meta-analysis in genome-wide association studies.
在大规模关联研究中,当Ⅰ类错误较小时,通过每例获得 10 个或更多对照来增加功效。
BMC Med Res Methodol. 2023 Jun 29;23(1):153. doi: 10.1186/s12874-023-01973-x.
4
Genetic Associations with Smoking Relapse and Proportion of Follow-up in Smoking Relapse throughout Adulthood in Pre- and Postmenopausal Women.绝经前和绝经后女性整个成年期吸烟复发与遗传相关性及吸烟复发随访比例。
Cancer Prev Res (Phila). 2023 May 1;16(5):269-279. doi: 10.1158/1940-6207.CAPR-22-0421.
5
A combination of HLA-DP α and β chain polymorphisms paired with a SNP in the DPB1 3' UTR region, denoting expression levels, are associated with atopic dermatitis.HLA-DP α链和β链多态性与DPB1 3'非翻译区(UTR)区域中的一个单核苷酸多态性(SNP)相结合,该SNP表示表达水平,与特应性皮炎相关。
Front Genet. 2023 Jan 23;14:1004138. doi: 10.3389/fgene.2023.1004138. eCollection 2023.
6
Variants Tagging Haplotype Block in Association with First Myocardial Infarction and Plasma Galectin-3 Six Months after the Acute Event.变异标签 与急性事件后 6 个月首次心肌梗死和血浆半乳糖凝集素-3 相关的单倍型块。
Genes (Basel). 2022 Dec 29;14(1):109. doi: 10.3390/genes14010109.
7
Pathogen exposure misclassification can bias association signals in GWAS of infectious diseases when using population-based common control subjects.当使用基于人群的常见对照受试者进行传染病的 GWAS 时,病原体暴露分类错误可能会使关联信号产生偏差。
Am J Hum Genet. 2023 Feb 2;110(2):336-348. doi: 10.1016/j.ajhg.2022.12.013. Epub 2023 Jan 16.
8
A Genome-Wide Association Study Reveals a -Centered Molecular Network Associated with Alcohol Dependence and Related Clinical Measures.一项全基因组关联研究揭示了一个与酒精依赖及相关临床指标相关的以-为中心的分子网络。 (注:原文中“-Centered”处有信息缺失,这里按照完整格式翻译,但该部分信息不完整可能影响对整体内容的准确理解。)
Biomedicines. 2022 Nov 22;10(12):3007. doi: 10.3390/biomedicines10123007.
9
A data harmonization pipeline to leverage external controls and boost power in GWAS.利用外部对照和提高 GWAS 效能的数据协调管道。
Hum Mol Genet. 2022 Feb 3;31(3):481-489. doi: 10.1093/hmg/ddab261.
10
In-Depth Analysis of Genetic Variation Associated with Severe West Nile Viral Disease.与严重西尼罗河病毒病相关的基因变异深入分析
Vaccines (Basel). 2020 Dec 8;8(4):744. doi: 10.3390/vaccines8040744.
荟萃分析在全基因组关联研究中的作用。
Annu Rev Genomics Hum Genet. 2013;14:441-65. doi: 10.1146/annurev-genom-091212-153520. Epub 2013 May 24.
4
Integration of genome-wide association studies with biological knowledge identifies six novel genes related to kidney function.全基因组关联研究与生物学知识的整合确定了与肾功能相关的六个新基因。
Hum Mol Genet. 2012 Dec 15;21(24):5329-43. doi: 10.1093/hmg/dds369. Epub 2012 Sep 8.
5
zCall: a rare variant caller for array-based genotyping: genetics and population analysis.zCall:一种用于基于阵列的基因分型的罕见变体调用者:遗传学和群体分析。
Bioinformatics. 2012 Oct 1;28(19):2543-5. doi: 10.1093/bioinformatics/bts479. Epub 2012 Jul 27.
6
Results on differential and dependent measurement error of the exposure and the outcome using signed directed acyclic graphs.使用有向无环图(DAG)对暴露和结局的差异和相关测量误差进行分析。
Am J Epidemiol. 2012 Jun 15;175(12):1303-10. doi: 10.1093/aje/kwr458. Epub 2012 May 8.
7
The genetics of type 2 diabetes: what have we learned from GWAS?2 型糖尿病的遗传学:全基因组关联研究告诉了我们什么?
Ann N Y Acad Sci. 2010 Nov;1212:59-77. doi: 10.1111/j.1749-6632.2010.05838.x.
8
Robust relationship inference in genome-wide association studies.全基因组关联研究中的稳健关系推断。
Bioinformatics. 2010 Nov 15;26(22):2867-73. doi: 10.1093/bioinformatics/btq559. Epub 2010 Oct 5.
9
Quality control and quality assurance in genotypic data for genome-wide association studies.全基因组关联研究中基因型数据的质量控制和质量保证。
Genet Epidemiol. 2010 Sep;34(6):591-602. doi: 10.1002/gepi.20516.
10
Biological, clinical and population relevance of 95 loci for blood lipids.95 个与血脂相关的生物学、临床和人群相关性位点。
Nature. 2010 Aug 5;466(7307):707-13. doi: 10.1038/nature09270.