Suppr超能文献

遗传病例对照关联研究中的数据质量控制。

Data quality control in genetic case-control association studies.

机构信息

Genetic and Genomic Epidemiology Unit, Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, UK.

出版信息

Nat Protoc. 2010 Sep;5(9):1564-73. doi: 10.1038/nprot.2010.116. Epub 2010 Aug 26.

Abstract

This protocol details the steps for data quality assessment and control that are typically carried out during case-control association studies. The steps described involve the identification and removal of DNA samples and markers that introduce bias. These critical steps are paramount to the success of a case-control study and are necessary before statistically testing for association. We describe how to use PLINK, a tool for handling SNP data, to perform assessments of failure rate per individual and per SNP and to assess the degree of relatedness between individuals. We also detail other quality-control procedures, including the use of SMARTPCA software for the identification of ancestral outliers. These platforms were selected because they are user-friendly, widely used and computationally efficient. Steps needed to detect and establish a disease association using case-control data are not discussed here. Issues concerning study design and marker selection in case-control studies have been discussed in our earlier protocols. This protocol, which is routinely used in our labs, should take approximately 8 h to complete.

摘要

本方案详细介绍了病例对照关联研究中通常进行的数据质量评估和控制步骤。所描述的步骤涉及鉴定和去除引入偏倚的 DNA 样本和标记物。这些关键步骤对于病例对照研究的成功至关重要,并且在进行关联的统计检验之前是必要的。我们描述了如何使用 PLINK,一种用于处理 SNP 数据的工具,来执行个体和 SNP 的失效率评估,并评估个体之间的亲缘关系程度。我们还详细介绍了其他质量控制程序,包括使用 SMARTPCA 软件识别祖先异常值。选择这些平台是因为它们易于使用、广泛使用且计算效率高。本方案未讨论使用病例对照数据检测和建立疾病关联所需的步骤。病例对照研究中有关研究设计和标记选择的问题已在我们之前的方案中讨论过。本方案在我们的实验室中经常使用,大约需要 8 小时才能完成。

相似文献

1
2
Basic statistical analysis in genetic case-control studies.遗传病例对照研究中的基本统计学分析。
Nat Protoc. 2011 Feb;6(2):121-33. doi: 10.1038/nprot.2010.182. Epub 2011 Feb 3.
7
coPLINK: A complementary tool to PLINK.coPLINK:PLINK 的补充工具。
PLoS One. 2020 Sep 18;15(9):e0239144. doi: 10.1371/journal.pone.0239144. eCollection 2020.

引用本文的文献

9
Genetics of growth rate in induced pluripotent stem cells.诱导多能干细胞生长速率的遗传学
bioRxiv. 2025 Jul 3:2025.07.02.662844. doi: 10.1101/2025.07.02.662844.

本文引用的文献

2
Marker selection for genetic case-control association studies.遗传病例对照关联研究的标记选择
Nat Protoc. 2009;4(5):743-52. doi: 10.1038/nprot.2009.38. Epub 2009 Apr 23.
9
A genotype calling algorithm for the Illumina BeadArray platform.一种适用于Illumina BeadArray平台的基因型分型算法。
Bioinformatics. 2007 Oct 15;23(20):2741-6. doi: 10.1093/bioinformatics/btm443. Epub 2007 Sep 10.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验