通过Illumina基因组分析仪测序对多样本池进行变异鉴定。

Variant identification in multi-sample pools by illumina genome analyzer sequencing.

作者信息

Margraf Rebecca L, Durtschi Jacob D, Dames Shale, Pattison David C, Stephens Jack E, Voelkerding Karl V

机构信息

ARUP Institute for Clinical & Experimental Pathology®, Salt Lake City, Utah, USA.

出版信息

J Biomol Tech. 2011 Jul;22(2):74-84.

PMID:21738440

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3121147/

Abstract

Multi-sample pooling and Illumina Genome Analyzer (GA) sequencing allows high throughput sequencing of multiple samples to determine population sequence variation. A preliminary experiment, using the RET proto-oncogene as a model, predicted ≤ 30 samples could be pooled to reliably detect singleton variants without requiring additional confirmation testing. This report used 30 and 50 sample pools to test the hypothesized pooling limit and also to test recent protocol improvements, Illumina GAIIx upgrades, and longer read chemistry. The SequalPrep(TM) method was used to normalize amplicons before pooling. For comparison, a single 'control' sample was run in a different flow cell lane. Data was evaluated by variant read percentages and the subtractive correction method which utilizes the control sample. In total, 59 variants were detected within the pooled samples, which included all 47 known true variants. The 15 known singleton variants due to Sanger sequencing had an average of 1.62 ± 0.26% variant reads for the 30 pool (expected 1.67% for a singleton variant [unique variant within the pool]) and 1.01 ± 0.19% for the 50 pool (expected 1%). The 76 base read lengths had higher error rates than shorter read lengths (33 and 50 base reads), which eliminated the distinction of true singleton variants from background error. This report demonstrated pooling limits from 30 up to 50 samples (depending on error rates and coverage), for reliable singleton variant detection. The presented pooling protocols and analysis methods can be used for variant discovery in other genes, facilitating molecular diagnostic test design and interpretation.

摘要

多样本混合及Illumina基因组分析仪（GA）测序可对多个样本进行高通量测序，以确定群体序列变异。一项以RET原癌基因为模型的初步实验预测，可将≤30个样本混合，以可靠地检测单例变异，而无需额外的确认测试。本报告使用30样本池和50样本池来测试假设的混合极限，并测试近期的方案改进、Illumina GAIIx升级以及更长读长的化学方法。在混合之前，使用SequalPrep™方法对扩增子进行标准化。为作比较，在不同的流动池泳道中运行单个“对照”样本。通过变异读数百分比和利用对照样本的减法校正方法对数据进行评估。在混合样本中共检测到59个变异，其中包括所有47个已知的真实变异。对于30样本池，因桑格测序产生的15个已知单例变异的变异读数平均为1.62±0.26%（单例变异[样本池内的独特变异]预期为1.67%），对于50样本池则为1.01±0.19%（预期为1%）。76碱基的读长比更短的读长（33和50碱基读长）具有更高的错误率，这消除了真实单例变异与背景错误之间的区别。本报告证明了30至50个样本的混合极限（取决于错误率和覆盖率），用于可靠的单例变异检测。所提出的混合方案和分析方法可用于其他基因的变异发现，有助于分子诊断测试的设计和解读。

相似文献

Variant identification in multi-sample pools by illumina genome analyzer sequencing.通过Illumina基因组分析仪测序对多样本池进行变异鉴定。

J Biomol Tech. 2011 Jul;22(2):74-84.

Multi-sample pooling and illumina genome analyzer sequencing methods to determine gene sequence variation for database development.用于数据库开发的多样本合并及Illumina基因组分析仪测序方法以确定基因序列变异

J Biomol Tech. 2010 Sep;21(3):126-40.

Estimating allele frequency from next-generation sequencing of pooled mitochondrial DNA samples.从混合线粒体DNA样本的下一代测序中估计等位基因频率。

Front Genet. 2011 Aug 17;2:51. doi: 10.3389/fgene.2011.00051. eCollection 2011.

A statistical method for the detection of variants from next-generation resequencing of DNA pools.一种用于从 DNA 池的下一代重测序中检测变异的统计方法。

Bioinformatics. 2010 Jun 15;26(12):i318-24. doi: 10.1093/bioinformatics/btq214.

Comparison of the Illumina Genome Analyzer and Roche 454 GS FLX for resequencing of hypertrophic cardiomyopathy-associated genes.Illumina基因组分析仪与罗氏454 GS FLX用于肥厚型心肌病相关基因重测序的比较。

J Biomol Tech. 2010 Jul;21(2):73-80.

SNP calling by sequencing pooled samples.基于测序的混合样本 SNP 检测。

BMC Bioinformatics. 2012 Sep 20;13:239. doi: 10.1186/1471-2105-13-239.

Evaluation of genomic high-throughput sequencing data generated on Illumina HiSeq and genome analyzer systems.Illumina HiSeq 和基因组分析仪系统生成的基因组高通量测序数据评估。

Genome Biol. 2011 Nov 8;12(11):R112. doi: 10.1186/gb-2011-12-11-r112.

A two-dimensional pooling strategy for rare variant detection on next-generation sequencing platforms.基于下一代测序平台的稀有变异检测的二维池化策略。

PLoS One. 2014 Apr 11;9(4):e93455. doi: 10.1371/journal.pone.0093455. eCollection 2014.

Quantitative group testing-based overlapping pool sequencing to identify rare variant carriers.基于定量分组检测的重叠池测序技术，用于鉴定罕见变异携带者。

BMC Bioinformatics. 2014 Jun 17;15:195. doi: 10.1186/1471-2105-15-195.

UNDR ROVER - a fast and accurate variant caller for targeted DNA sequencing.UNDR ROVER——一种用于靶向DNA测序的快速且准确的变异检测工具。

BMC Bioinformatics. 2016 Apr 16;17:165. doi: 10.1186/s12859-016-1014-9.

引用本文的文献

Studying Rare Movement Disorders: From Whole-Exome Sequencing to New Diagnostic and Therapeutic Approaches in a Modern Genetic Clinic.研究罕见运动障碍：从全外显子组测序到现代基因诊所的新诊断与治疗方法

Biomedicines. 2024 Nov 23;12(12):2673. doi: 10.3390/biomedicines12122673.

Comparison of Benign and Malignant Pilomatricomas Using Whole-exome Sequencing.采用全外显子组测序技术对良、恶性毛发基质瘤进行比较。

Cancer Genomics Proteomics. 2020 Nov-Dec;17(6):795-802. doi: 10.21873/cgp.20233.

A new approach based on targeted pooled DNA sequencing identifies novel mutations in patients with Inherited Retinal Dystrophies.一种基于靶向靶向 DNA 测序的新方法可识别遗传性视网膜营养不良患者中的新型突变。

Sci Rep. 2018 Oct 18;8(1):15457. doi: 10.1038/s41598-018-33810-3.

Whole exome sequencing in neurogenetic odysseys: An effective, cost- and time-saving diagnostic approach.神经遗传学探索中的全外显子组测序：一种有效、节省成本和时间的诊断方法。

PLoS One. 2018 Feb 1;13(2):e0191228. doi: 10.1371/journal.pone.0191228. eCollection 2018.

Genetic Variants Associated with Port-Wine Stains.与葡萄酒色斑相关的基因变异

PLoS One. 2015 Jul 20;10(7):e0133158. doi: 10.1371/journal.pone.0133158. eCollection 2015.

VarBin, a novel method for classifying true and false positive variants in NGS data.VarBin，一种用于分类 NGS 数据中真阳性和假阳性变体的新方法。

BMC Bioinformatics. 2013;14 Suppl 13(Suppl 13):S2. doi: 10.1186/1471-2105-14-S13-S2. Epub 2013 Oct 1.

Validation of SNP allele frequencies determined by pooled next-generation sequencing in natural populations of a non-model plant species.通过混合下一代测序确定的单核苷酸多态性（SNP）等位基因频率在非模式植物物种自然种群中的验证。

PLoS One. 2013 Nov 7;8(11):e80422. doi: 10.1371/journal.pone.0080422. eCollection 2013.

Germline mutations in NFKB2 implicate the noncanonical NF-κB pathway in the pathogenesis of common variable immunodeficiency.NFKB2 种系突变提示非经典 NF-κB 通路参与普通变异性免疫缺陷病的发病机制。

Am J Hum Genet. 2013 Nov 7;93(5):812-24. doi: 10.1016/j.ajhg.2013.09.009. Epub 2013 Oct 17.

Empirical validation of pooled whole genome population re-sequencing in Drosophila melanogaster.对黑腹果蝇全基因组群体重测序进行合并的经验验证。

PLoS One. 2012;7(7):e41901. doi: 10.1371/journal.pone.0041901. Epub 2012 Jul 26.

Determination of RET Sequence Variation in an MEN2 Unaffected Cohort Using Multiple-Sample Pooling and Next-Generation Sequencing.使用多样本混合和下一代测序技术测定MEN2未患病队列中的RET序列变异

J Thyroid Res. 2012;2012:318232. doi: 10.1155/2012/318232. Epub 2012 Apr 1.

本文引用的文献

High-throughput, pooled sequencing identifies mutations in NUBPL and FOXRED1 in human complex I deficiency.高通量、池测序鉴定出人类复合体 I 缺陷中 NUBPL 和 FOXRED1 的突变。

Nat Genet. 2010 Oct;42(10):851-8. doi: 10.1038/ng.659. Epub 2010 Sep 5.

J Biomol Tech. 2010 Sep;21(3):126-40.

A statistical method for the detection of variants from next-generation resequencing of DNA pools.一种用于从 DNA 池的下一代重测序中检测变异的统计方法。

Bioinformatics. 2010 Jun 15;26(12):i318-24. doi: 10.1093/bioinformatics/btq214.

Comparison of normalization methods for construction of large, multiplex amplicon pools for next-generation sequencing.用于下一代测序的大型多重扩增子池构建的标准化方法比较。

Appl Environ Microbiol. 2010 Jun;76(12):3863-8. doi: 10.1128/AEM.02585-09. Epub 2010 Apr 23.

Accurate detection and genotyping of SNPs utilizing population sequencing data.利用群体测序数据进行 SNP 的精确检测和基因分型。

Genome Res. 2010 Apr;20(4):537-45. doi: 10.1101/gr.100040.109. Epub 2010 Feb 11.

Genetic diagnosis of familial breast cancer using clonal sequencing.采用克隆测序技术进行家族性乳腺癌的基因诊断。

Hum Mutat. 2010 Apr;31(4):484-91. doi: 10.1002/humu.21216.

Deep sequencing to reveal new variants in pooled DNA samples.深度测序揭示混合 DNA 样本中的新变体。

Hum Mutat. 2009 Dec;30(12):1703-12. doi: 10.1002/humu.21122.

VarScan: variant detection in massively parallel sequencing of individual and pooled samples.VarScan：个体样本与混合样本大规模平行测序中的变异检测

Bioinformatics. 2009 Sep 1;25(17):2283-5. doi: 10.1093/bioinformatics/btp373. Epub 2009 Jun 19.

DNA Sudoku--harnessing high-throughput sequencing for multiplexed specimen analysis.DNA数独——利用高通量测序进行多重样本分析。

Genome Res. 2009 Jul;19(7):1243-53. doi: 10.1101/gr.092957.109. Epub 2009 May 15.

Overlapping pools for high-throughput targeted resequencing.用于高通量靶向重测序的重叠文库。

Genome Res. 2009 Jul;19(7):1254-61. doi: 10.1101/gr.088559.108. Epub 2009 May 15.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验