• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

Illumina 基因分型芯片的处理和质量控制策略。

Strategies for processing and quality control of Illumina genotyping arrays.

机构信息

Department of Cancer Biology, Vanderbilt University, Nashville, TN, USA.

Department of Molecular Physics and Biology, Vanderbilt University, Nashville, TN, USA.

出版信息

Brief Bioinform. 2018 Sep 28;19(5):765-775. doi: 10.1093/bib/bbx012.

DOI:10.1093/bib/bbx012
PMID:28334151
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6171493/
Abstract

Illumina genotyping arrays have powered thousands of large-scale genome-wide association studies over the past decade. Yet, because of the tremendous volume and complicated genetic assumptions of Illumina genotyping data, processing and quality control (QC) of these data remain a challenge. Thorough QC ensures the accurate identification of single-nucleotide polymorphisms and is required for the correct interpretation of genetic association results. By processing genotyping data on > 100 000 subjects from >10 major Illumina genotyping arrays, we have accumulated extensive experience in handling some of the most peculiar scenarios related to the processing and QC of Illumina genotyping data. Here, we describe strategies for processing Illumina genotyping data from the raw data to an analysis ready format, and we elaborate on the necessary QC procedures required at each processing step. High-quality Illumina genotyping data sets can be obtained by following our detailed QC strategies.

摘要

Illumina 基因分型阵列在过去十年中为数以千计的大规模全基因组关联研究提供了支持。然而,由于 Illumina 基因分型数据的巨大数量和复杂的遗传假设,这些数据的处理和质量控制 (QC) 仍然是一个挑战。彻底的 QC 可确保单核苷酸多态性的准确识别,并且是正确解释遗传关联结果所必需的。通过对来自 10 多个主要 Illumina 基因分型阵列的 >100,000 名受试者的基因分型数据进行处理,我们积累了处理与 Illumina 基因分型数据的处理和 QC 相关的一些最特殊情况的丰富经验。在这里,我们描述了从原始数据到可分析格式处理 Illumina 基因分型数据的策略,并详细说明了每个处理步骤所需的必要 QC 程序。通过遵循我们详细的 QC 策略,可以获得高质量的 Illumina 基因分型数据集。

相似文献

1
Strategies for processing and quality control of Illumina genotyping arrays.Illumina 基因分型芯片的处理和质量控制策略。
Brief Bioinform. 2018 Sep 28;19(5):765-775. doi: 10.1093/bib/bbx012.
2
The COPILOT Raw Illumina Genotyping QC Protocol.COPILOT原始Illumina基因分型质量控制方案。
Curr Protoc. 2022 Apr;2(4):e373. doi: 10.1002/cpz1.373.
3
SNPQC--an R pipeline for quality control of Illumina SNP genotyping array data.SNPQC——用于Illumina SNP基因分型阵列数据质量控制的R流程
Anim Genet. 2014 Oct;45(5):758-61. doi: 10.1111/age.12198. Epub 2014 Jul 18.
4
argyle: An R Package for Analysis of Illumina Genotyping Arrays.阿盖尔:一个用于分析Illumina基因分型阵列的R软件包。
G3 (Bethesda). 2015 Dec 18;6(2):281-6. doi: 10.1534/g3.115.023739.
5
Genotyping Informatics and Quality Control for 100,000 Subjects in the Genetic Epidemiology Research on Adult Health and Aging (GERA) Cohort.成人健康与衰老基因流行病学研究(GERA)队列中10万名受试者的基因分型信息学与质量控制
Genetics. 2015 Aug;200(4):1051-60. doi: 10.1534/genetics.115.178905. Epub 2015 Jun 19.
6
Low-depth genotyping-by-sequencing (GBS) in a bovine population: strategies to maximize the selection of high quality genotypes and the accuracy of imputation.牛群中的低深度测序基因分型(GBS):最大化高质量基因型选择和归因准确性的策略。
BMC Genet. 2017 Apr 5;18(1):32. doi: 10.1186/s12863-017-0501-y.
7
Illumina human exome genotyping array clustering and quality control.Illumina人类外显子基因分型阵列聚类与质量控制
Nat Protoc. 2014 Nov;9(11):2643-62. doi: 10.1038/nprot.2014.174. Epub 2014 Oct 16.
8
StrandScript: evaluation of Illumina genotyping array design and strand correction.StrandScript:评估 Illumina 基因分型阵列设计和链校正。
Bioinformatics. 2017 Aug 1;33(15):2399-2401. doi: 10.1093/bioinformatics/btx186.
9
Genome-Wide SNP Calling from Genotyping by Sequencing (GBS) Data: A Comparison of Seven Pipelines and Two Sequencing Technologies.基于测序基因分型(GBS)数据的全基因组单核苷酸多态性(SNP)检测:七种流程和两种测序技术的比较
PLoS One. 2016 Aug 22;11(8):e0161333. doi: 10.1371/journal.pone.0161333. eCollection 2016.
10
iCall: a genotype-calling algorithm for rare, low-frequency and common variants on the Illumina exome array.iCall:一种用于 Illumina 外显子组阵列上罕见、低频和常见变异的基因型调用算法。
Bioinformatics. 2014 Jun 15;30(12):1714-20. doi: 10.1093/bioinformatics/btu107. Epub 2014 Feb 23.

引用本文的文献

1
Genetic Diversity and Population Structure of Five Pig Breeds from Chongqing, China.中国重庆五个猪品种的遗传多样性与群体结构
Animals (Basel). 2025 Sep 5;15(17):2610. doi: 10.3390/ani15172610.
2
Polygenic score from MODY genes is associated with type 1 diabetes and disease characteristics.来自青少年发病的成年型糖尿病(MODY)基因的多基因评分与1型糖尿病及疾病特征相关。
Acta Diabetol. 2025 Jun 20. doi: 10.1007/s00592-025-02544-w.
3
GWAS links APOE to neuropsychiatric symptoms in mild cognitive impairment and dementia.全基因组关联研究将载脂蛋白E与轻度认知障碍和痴呆中的神经精神症状联系起来。
Alzheimers Dement. 2025 Jun;21(6):e70329. doi: 10.1002/alz.70329.
4
An Analysis of the Genetic Diversity, Genetic Structure, and Selection Signal of Beagle Dogs Using SNP Chips.利用单核苷酸多态性芯片分析比格犬的遗传多样性、遗传结构和选择信号
Genes (Basel). 2025 Mar 21;16(4):358. doi: 10.3390/genes16040358.
5
Transcriptional impacts of substance use disorder and HIV on human ventral midbrain neurons and microglia.物质使用障碍和艾滋病毒对人类腹侧中脑神经元和小胶质细胞的转录影响。
bioRxiv. 2025 Feb 8:2025.02.05.636667. doi: 10.1101/2025.02.05.636667.
6
GWAS links to neuropsychiatric symptoms in mild cognitive impairment and dementia.全基因组关联研究将轻度认知障碍和痴呆症中的神经精神症状联系起来。
medRxiv. 2025 Feb 2:2025.01.31.25321498. doi: 10.1101/2025.01.31.25321498.
7
PGSXplorer: an integrated nextflow pipeline for comprehensive quality control and polygenic score model development.PGSXplorer:一个用于全面质量控制和多基因评分模型开发的集成式Nextflow工作流程。
PeerJ. 2025 Feb 12;13:e18973. doi: 10.7717/peerj.18973. eCollection 2025.
8
A brain DNA co-methylation network analysis of psychosis in Alzheimer's disease.阿尔茨海默病精神病的脑DNA共甲基化网络分析
Alzheimers Dement. 2025 Feb;21(2):e14501. doi: 10.1002/alz.14501.
9
Severe COVID-19 disease is associated with genetic factors affecting plasma ACE2 receptor and CRP concentrations.重症 COVID-19 疾病与影响血浆 ACE2 受体和 CRP 浓度的遗传因素有关。
Sci Rep. 2025 Feb 8;15(1):4708. doi: 10.1038/s41598-025-89306-4.
10
CNV-Finder: Streamlining Copy Number Variation Discovery.CNV-Finder:简化拷贝数变异检测
bioRxiv. 2024 Nov 23:2024.11.22.624040. doi: 10.1101/2024.11.22.624040.

本文引用的文献

1
Multi-perspective quality control of Illumina RNA sequencing data analysis.Illumina RNA 测序数据分析的多角度质量控制。
Brief Funct Genomics. 2017 Jul 1;16(4):194-204. doi: 10.1093/bfgp/elw035.
2
Practicability of mitochondrial heteroplasmy detection through an Illumina genotyping array.通过Illumina基因分型芯片检测线粒体异质性的可行性
Mitochondrion. 2016 Nov;31:75-78. doi: 10.1016/j.mito.2016.08.018. Epub 2016 Sep 11.
3
Heterozygosity Ratio, a Robust Global Genomic Measure of Autozygosity and Its Association with Height and Disease Risk.杂合度比率,一种衡量纯合性的稳健全基因组指标及其与身高和疾病风险的关联。
Genetics. 2016 Nov;204(3):893-904. doi: 10.1534/genetics.116.189936. Epub 2016 Aug 31.
4
Illumina human exome genotyping array clustering and quality control.Illumina人类外显子基因分型阵列聚类与质量控制
Nat Protoc. 2014 Nov;9(11):2643-62. doi: 10.1038/nprot.2014.174. Epub 2014 Oct 16.
5
Genome measures used for quality control are dependent on gene function and ancestry.用于质量控制的基因组测量取决于基因功能和祖先。
Bioinformatics. 2015 Feb 1;31(3):318-23. doi: 10.1093/bioinformatics/btu668. Epub 2014 Oct 8.
6
Alternative applications for distinct RNA sequencing strategies.不同RNA测序策略的替代应用。
Brief Bioinform. 2015 Jul;16(4):629-39. doi: 10.1093/bib/bbu032. Epub 2014 Sep 22.
7
MultiRankSeq: multiperspective approach for RNAseq differential expression analysis and quality control.MultiRankSeq:用于RNA测序差异表达分析和质量控制的多视角方法。
Biomed Res Int. 2014;2014:248090. doi: 10.1155/2014/248090. Epub 2014 May 27.
8
Multi-perspective quality control of Illumina exome sequencing data using QC3.使用QC3对Illumina外显子组测序数据进行多视角质量控制。
Genomics. 2014 May-Jun;103(5-6):323-8. doi: 10.1016/j.ygeno.2014.03.006. Epub 2014 Apr 3.
9
Three-stage quality control strategies for DNA re-sequencing data.DNA 重测序数据的三阶段质量控制策略。
Brief Bioinform. 2014 Nov;15(6):879-89. doi: 10.1093/bib/bbt069. Epub 2013 Sep 24.
10
Rare variants in CFI, C3 and C9 are associated with high risk of advanced age-related macular degeneration.CFI、C3 和 C9 中的罕见变异与高龄相关性黄斑变性的高风险相关。
Nat Genet. 2013 Nov;45(11):1366-70. doi: 10.1038/ng.2741. Epub 2013 Sep 15.