• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

遗传病例对照关联研究中的数据质量控制。

Data quality control in genetic case-control association studies.

机构信息

Genetic and Genomic Epidemiology Unit, Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, UK.

出版信息

Nat Protoc. 2010 Sep;5(9):1564-73. doi: 10.1038/nprot.2010.116. Epub 2010 Aug 26.

DOI:10.1038/nprot.2010.116
PMID:21085122
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3025522/
Abstract

This protocol details the steps for data quality assessment and control that are typically carried out during case-control association studies. The steps described involve the identification and removal of DNA samples and markers that introduce bias. These critical steps are paramount to the success of a case-control study and are necessary before statistically testing for association. We describe how to use PLINK, a tool for handling SNP data, to perform assessments of failure rate per individual and per SNP and to assess the degree of relatedness between individuals. We also detail other quality-control procedures, including the use of SMARTPCA software for the identification of ancestral outliers. These platforms were selected because they are user-friendly, widely used and computationally efficient. Steps needed to detect and establish a disease association using case-control data are not discussed here. Issues concerning study design and marker selection in case-control studies have been discussed in our earlier protocols. This protocol, which is routinely used in our labs, should take approximately 8 h to complete.

摘要

本方案详细介绍了病例对照关联研究中通常进行的数据质量评估和控制步骤。所描述的步骤涉及鉴定和去除引入偏倚的 DNA 样本和标记物。这些关键步骤对于病例对照研究的成功至关重要,并且在进行关联的统计检验之前是必要的。我们描述了如何使用 PLINK,一种用于处理 SNP 数据的工具,来执行个体和 SNP 的失效率评估,并评估个体之间的亲缘关系程度。我们还详细介绍了其他质量控制程序,包括使用 SMARTPCA 软件识别祖先异常值。选择这些平台是因为它们易于使用、广泛使用且计算效率高。本方案未讨论使用病例对照数据检测和建立疾病关联所需的步骤。病例对照研究中有关研究设计和标记选择的问题已在我们之前的方案中讨论过。本方案在我们的实验室中经常使用,大约需要 8 小时才能完成。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/46c8/3025522/0ea962a2441a/ukmss-33586-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/46c8/3025522/b8593200fd0d/ukmss-33586-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/46c8/3025522/fcc4e5bf390c/ukmss-33586-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/46c8/3025522/0ea962a2441a/ukmss-33586-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/46c8/3025522/b8593200fd0d/ukmss-33586-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/46c8/3025522/fcc4e5bf390c/ukmss-33586-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/46c8/3025522/0ea962a2441a/ukmss-33586-f0003.jpg

相似文献

1
Data quality control in genetic case-control association studies.遗传病例对照关联研究中的数据质量控制。
Nat Protoc. 2010 Sep;5(9):1564-73. doi: 10.1038/nprot.2010.116. Epub 2010 Aug 26.
2
Basic statistical analysis in genetic case-control studies.遗传病例对照研究中的基本统计学分析。
Nat Protoc. 2011 Feb;6(2):121-33. doi: 10.1038/nprot.2010.182. Epub 2011 Feb 3.
3
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
4
Erratum: High-Throughput Identification of Resistance to Pseudomonas syringae pv. Tomato in Tomato using Seedling Flood Assay.勘误:利用幼苗浸没法高通量鉴定番茄对丁香假单胞菌 pv.番茄的抗性。
J Vis Exp. 2023 Oct 18(200). doi: 10.3791/6576.
5
Odyssey: a semi-automated pipeline for phasing, imputation, and analysis of genome-wide genetic data.奥德赛:一个用于全基因组遗传数据相位、插补和分析的半自动流水线。
BMC Bioinformatics. 2019 Jun 28;20(1):364. doi: 10.1186/s12859-019-2964-5.
6
fcGENE: a versatile tool for processing and transforming SNP datasets.fcGENE:一种用于处理和转换单核苷酸多态性数据集的通用工具。
PLoS One. 2014 Jul 22;9(7):e97589. doi: 10.1371/journal.pone.0097589. eCollection 2014.
7
coPLINK: A complementary tool to PLINK.coPLINK:PLINK 的补充工具。
PLoS One. 2020 Sep 18;15(9):e0239144. doi: 10.1371/journal.pone.0239144. eCollection 2020.
8
[Standard technical specifications for methacholine chloride (Methacholine) bronchial challenge test (2023)].[氯化乙酰甲胆碱支气管激发试验标准技术规范(2023年)]
Zhonghua Jie He He Hu Xi Za Zhi. 2024 Feb 12;47(2):101-119. doi: 10.3760/cma.j.cn112147-20231019-00247.
9
PSReliP: an integrated pipeline for analysis and visualization of population structure and relatedness based on genome-wide genetic variant data.PSReliP:一个基于全基因组遗传变异数据的分析和可视化群体结构及亲缘关系的集成分析工具。
BMC Bioinformatics. 2023 Apr 5;24(1):135. doi: 10.1186/s12859-023-05169-4.
10
Shrinkage estimation for robust and efficient screening of single-SNP association from case-control genome-wide association studies.基于病例对照全基因组关联研究的单 SNP 关联稳健高效筛选的收缩估计。
Genet Epidemiol. 2009 Dec;33(8):740-50. doi: 10.1002/gepi.20428.

引用本文的文献

1
Regulatory polymorphisms of , , and genes affect survival of patients with immunotherapy-treated lung cancer.、、和基因的调控多态性影响接受免疫治疗的肺癌患者的生存率。
J Immunother Cancer. 2025 Sep 1;13(9):e011526. doi: 10.1136/jitc-2025-011526.
2
The interaction between candidate SNPs and social factors significantly influences the prevalence of alcohol use disorder in Chinese young male adults.候选单核苷酸多态性与社会因素之间的相互作用显著影响中国年轻男性成年人酒精使用障碍的患病率。
PLoS One. 2025 Aug 26;20(8):e0330822. doi: 10.1371/journal.pone.0330822. eCollection 2025.
3
Mediterranean-DASH Intervention for Neurodegenerative Delay (MIND) Trial: Genetic Resource for Precision Nutrition.

本文引用的文献

1
An evaluation of statistical approaches to rare variant analysis in genetic association studies.遗传关联研究中罕见变异分析的统计方法评估。
Genet Epidemiol. 2010 Feb;34(2):188-93. doi: 10.1002/gepi.20450.
2
Marker selection for genetic case-control association studies.遗传病例对照关联研究的标记选择
Nat Protoc. 2009;4(5):743-52. doi: 10.1038/nprot.2009.38. Epub 2009 Apr 23.
3
Genome-wide association study for early-onset and morbid adult obesity identifies three new risk loci in European populations.针对早发性和病态成人肥胖的全基因组关联研究在欧洲人群中发现了三个新的风险基因座。
地中海饮食延缓神经退行性变干预试验(MIND):精准营养的遗传资源
Nutrients. 2025 Aug 4;17(15):2548. doi: 10.3390/nu17152548.
4
Health and Aging Brain Study-Health Disparities (HABS-HD) methods and partner characteristics.健康与衰老大脑研究-健康差异(HABS-HD)方法及合作伙伴特征
Alzheimers Dement (N Y). 2025 Aug 12;11(3):e70140. doi: 10.1002/trc2.70140. eCollection 2025 Jul-Sep.
5
Genome-wide association study and molecular marker development for resistance to soybean cyst nematode in soybean.大豆抗大豆胞囊线虫的全基因组关联研究及分子标记开发
Plant Genome. 2025 Sep;18(3):e70083. doi: 10.1002/tpg2.70083.
6
Genome-wide association study reveals that the SUS domain protein SS7 positively regulates seed storability in rice (Oryza sativa L.).全基因组关联研究表明,SUS结构域蛋白SS7正向调控水稻(Oryza sativa L.)种子的耐贮藏性。
Theor Appl Genet. 2025 Aug 7;138(9):202. doi: 10.1007/s00122-025-04976-y.
7
Measuring the Associations Between Brain Morphometry and Polygenic Risk Scores for Substance use Disorders in Drug-Naive Adolescents.测量初涉毒品青少年的脑形态测量学与物质使用障碍多基因风险评分之间的关联。
Behav Genet. 2025 Jul 25. doi: 10.1007/s10519-025-10227-z.
8
Genetic prediction of lower limb isometric strength changes after 12 weeks of resistance training.抗阻训练12周后下肢等长肌力变化的遗传预测
Biol Sport. 2025 Jan 14;42(3):3-15. doi: 10.5114/biolsport.2025.145918. eCollection 2025 Jul.
9
Genetics of growth rate in induced pluripotent stem cells.诱导多能干细胞生长速率的遗传学
bioRxiv. 2025 Jul 3:2025.07.02.662844. doi: 10.1101/2025.07.02.662844.
10
Increased genetic protection against Alzheimer's disease in centenarians.百岁老人对阿尔茨海默病的遗传保护增强。
Geroscience. 2025 Jul 4. doi: 10.1007/s11357-025-01774-8.
Nat Genet. 2009 Feb;41(2):157-9. doi: 10.1038/ng.301. Epub 2009 Jan 18.
4
Ulcerative colitis-risk loci on chromosomes 1p36 and 12q15 found by genome-wide association study.通过全基因组关联研究发现的1号染色体p36区域和12号染色体q15区域的溃疡性结肠炎风险基因座。
Nat Genet. 2009 Feb;41(2):216-20. doi: 10.1038/ng.275. Epub 2009 Jan 4.
5
Long-range LD can confound genome scans in admixed populations.长程连锁不平衡可能会混淆混合群体中的基因组扫描结果。
Am J Hum Genet. 2008 Jul;83(1):132-5; author reply 135-9. doi: 10.1016/j.ajhg.2008.06.005.
6
Genetic determinants of ulcerative colitis include the ECM1 locus and five loci implicated in Crohn's disease.溃疡性结肠炎的遗传决定因素包括ECM1基因座和与克罗恩病相关的五个基因座。
Nat Genet. 2008 Jun;40(6):710-2. doi: 10.1038/ng.145. Epub 2008 Apr 27.
7
Goldsurfer2 (Gs2): a comprehensive tool for the analysis and visualization of genome wide association studies.金冲浪者2(Gs2):一种用于全基因组关联研究分析和可视化的综合工具。
BMC Bioinformatics. 2008 Mar 4;9:138. doi: 10.1186/1471-2105-9-138.
8
Designing candidate gene and genome-wide case-control association studies.设计候选基因和全基因组病例对照关联研究。
Nat Protoc. 2007;2(10):2492-501. doi: 10.1038/nprot.2007.366.
9
A genotype calling algorithm for the Illumina BeadArray platform.一种适用于Illumina BeadArray平台的基因型分型算法。
Bioinformatics. 2007 Oct 15;23(20):2741-6. doi: 10.1093/bioinformatics/btm443. Epub 2007 Sep 10.
10
PLINK: a tool set for whole-genome association and population-based linkage analyses.PLINK:一个用于全基因组关联分析和基于群体的连锁分析的工具集。
Am J Hum Genet. 2007 Sep;81(3):559-75. doi: 10.1086/519795. Epub 2007 Jul 25.