• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于人群的罕见变异检测,通过组合外显子组或定制杂交捕获,有或没有个体索引。

Population-based rare variant detection via pooled exome or custom hybridization capture with or without individual indexing.

机构信息

Center for Genome Sciences and Systems Biology, Washington University School of Medicine, St, Louis, MO 63108, USA.

出版信息

BMC Genomics. 2012 Dec 6;13:683. doi: 10.1186/1471-2164-13-683.

DOI:10.1186/1471-2164-13-683
PMID:23216810
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3534616/
Abstract

BACKGROUND

Rare genetic variation in the human population is a major source of pathophysiological variability and has been implicated in a host of complex phenotypes and diseases. Finding disease-related genes harboring disparate functional rare variants requires sequencing of many individuals across many genomic regions and comparing against unaffected cohorts. However, despite persistent declines in sequencing costs, population-based rare variant detection across large genomic target regions remains cost prohibitive for most investigators. In addition, DNA samples are often precious and hybridization methods typically require large amounts of input DNA. Pooled sample DNA sequencing is a cost and time-efficient strategy for surveying populations of individuals for rare variants. We set out to 1) create a scalable, multiplexing method for custom capture with or without individual DNA indexing that was amenable to low amounts of input DNA and 2) expand the functionality of the SPLINTER algorithm for calling substitutions, insertions and deletions across either candidate genes or the entire exome by integrating the variant calling algorithm with the dynamic programming aligner, Novoalign.

RESULTS

We report methodology for pooled hybridization capture with pre-enrichment, indexed multiplexing of up to 48 individuals or non-indexed pooled sequencing of up to 92 individuals with as little as 70 ng of DNA per person. Modified solid phase reversible immobilization bead purification strategies enable no sample transfers from sonication in 96-well plates through adapter ligation, resulting in 50% less library preparation reagent consumption. Custom Y-shaped adapters containing novel 7 base pair index sequences with a Hamming distance of ≥2 were directly ligated onto fragmented source DNA eliminating the need for PCR to incorporate indexes, and was followed by a custom blocking strategy using a single oligonucleotide regardless of index sequence. These results were obtained aligning raw reads against the entire genome using Novoalign followed by variant calling of non-indexed pools using SPLINTER or SAMtools for indexed samples. With these pipelines, we find sensitivity and specificity of 99.4% and 99.7% for pooled exome sequencing. Sensitivity, and to a lesser degree specificity, proved to be a function of coverage. For rare variants (≤2% minor allele frequency), we achieved sensitivity and specificity of ≥94.9% and ≥99.99% for custom capture of 2.5 Mb in multiplexed libraries of 22-48 individuals with only ≥5-fold coverage/chromosome, but these parameters improved to ≥98.7 and 100% with 20-fold coverage/chromosome.

CONCLUSIONS

This highly scalable methodology enables accurate rare variant detection, with or without individual DNA sample indexing, while reducing the amount of required source DNA and total costs through less hybridization reagent consumption, multi-sample sonication in a standard PCR plate, multiplexed pre-enrichment pooling with a single hybridization and lesser sequencing coverage required to obtain high sensitivity.

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/abdc/3534616/ce7869322089/1471-2164-13-683-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/abdc/3534616/9fa104e334c3/1471-2164-13-683-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/abdc/3534616/77fd87251ae4/1471-2164-13-683-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/abdc/3534616/fbac320d3f99/1471-2164-13-683-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/abdc/3534616/ce7869322089/1471-2164-13-683-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/abdc/3534616/9fa104e334c3/1471-2164-13-683-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/abdc/3534616/77fd87251ae4/1471-2164-13-683-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/abdc/3534616/fbac320d3f99/1471-2164-13-683-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/abdc/3534616/ce7869322089/1471-2164-13-683-4.jpg
摘要

背景

人类群体中罕见的遗传变异是病理生理学变异性的主要来源,并与许多复杂表型和疾病有关。发现携带不同功能罕见变异的疾病相关基因需要对许多个体在许多基因组区域进行测序,并与未受影响的队列进行比较。然而,尽管测序成本持续下降,但对大多数研究人员来说,在大型基因组目标区域进行基于人群的罕见变异检测仍然成本过高。此外,DNA 样本通常很珍贵,杂交方法通常需要大量的输入 DNA。 pooled sample DNA sequencing 是一种用于调查个体罕见变异的经济高效的策略。我们着手 1)创建一种可扩展的、带有或不带有个体 DNA 索引的定制捕获的多路复用方法,该方法适用于少量输入 DNA;2)通过将变体调用算法与动态规划对齐器 Novoalign 集成,扩展 SPLINTER 算法用于在候选基因或整个外显子中调用替换、插入和缺失的功能。

结果

我们报告了一种带有预富集的 pooled hybridization capture 方法学,最多可对 48 个个体或多达 92 个个体进行索引多路复用,每个个体的 DNA 量低至 70ng。改良的固相可逆固定珠纯化策略可避免从 96 孔板的超声处理到接头连接的样品转移,从而使文库制备试剂的消耗减少 50%。包含新型 7 碱基对索引序列的 Y 形定制接头与 Hamming 距离≥2 的索引序列直接连接到片段化的源 DNA 上,从而无需 PCR 即可掺入索引,并且随后使用单个寡核苷酸(无论索引序列如何)进行定制阻塞策略。这些结果是通过使用 Novoalign 对原始读数进行全基因组比对,并使用 SPLINTER 或 SAMtools 对非索引池进行变体调用而获得的。使用这些管道,我们发现 pooled exome sequencing 的灵敏度和特异性分别为 99.4%和 99.7%。灵敏度,在较小程度上是特异性,是覆盖范围的函数。对于罕见变异(≤2%的次要等位基因频率),我们在 22-48 个个体的多路复用文库中对 2.5Mb 进行定制捕获时,灵敏度和特异性分别≥94.9%和≥99.99%,并且仅需要≥5 倍覆盖/染色体,但这些参数在 20 倍覆盖/染色体时提高到≥98.7%和 100%。

结论

该高度可扩展的方法学可实现罕见变异的准确检测,无论是带有个体 DNA 样本索引还是不带有个体 DNA 样本索引,同时通过减少杂交试剂的消耗、在标准 PCR 板中进行多样本超声处理、使用单个杂交进行预富集多路复用以及获得高灵敏度所需的较少测序覆盖范围,来减少所需的源 DNA 量和总成本。

相似文献

1
Population-based rare variant detection via pooled exome or custom hybridization capture with or without individual indexing.基于人群的罕见变异检测,通过组合外显子组或定制杂交捕获,有或没有个体索引。
BMC Genomics. 2012 Dec 6;13:683. doi: 10.1186/1471-2164-13-683.
2
Detection of rare genomic variants from pooled sequencing using SPLINTER.使用SPLINTER从混合测序中检测罕见基因组变异。
J Vis Exp. 2012 Jun 23(64):3943. doi: 10.3791/3943.
3
Evaluation of Hybridization Capture Versus Amplicon-Based Methods for Whole-Exome Sequencing.杂交捕获法与基于扩增子的全外显子组测序方法的评估
Hum Mutat. 2015 Sep;36(9):903-14. doi: 10.1002/humu.22825. Epub 2015 Jul 15.
4
A unified approach for allele frequency estimation, SNP detection and association studies based on pooled sequencing data using EM algorithms.基于 EM 算法的基于测序数据的等位基因频率估计、SNP 检测和关联研究的统一方法。
BMC Genomics. 2013;14 Suppl 1(Suppl 1):S1. doi: 10.1186/1471-2164-14-S1-S1. Epub 2013 Jan 21.
5
Pooled-DNA Sequencing for Elucidating New Genomic Risk Factors, Rare Variants Underlying Alzheimer's Disease.用于阐明新的基因组风险因素、阿尔茨海默病潜在罕见变异的混合DNA测序
Methods Mol Biol. 2016;1303:299-314. doi: 10.1007/978-1-4939-2627-5_18.
6
SNP calling by sequencing pooled samples.基于测序的混合样本 SNP 检测。
BMC Bioinformatics. 2012 Sep 20;13:239. doi: 10.1186/1471-2105-13-239.
7
Investigation of rare and low-frequency variants using high-throughput sequencing with pooled DNA samples.使用高通量测序对混合 DNA 样本进行罕见和低频变异的研究。
Sci Rep. 2016 Sep 16;6:33256. doi: 10.1038/srep33256.
8
Imputation-based assessment of next generation rare exome variant arrays.基于插补法的新一代罕见外显子变异阵列评估
Pac Symp Biocomput. 2014:241-52.
9
iCall: a genotype-calling algorithm for rare, low-frequency and common variants on the Illumina exome array.iCall:一种用于 Illumina 外显子组阵列上罕见、低频和常见变异的基因型调用算法。
Bioinformatics. 2014 Jun 15;30(12):1714-20. doi: 10.1093/bioinformatics/btu107. Epub 2014 Feb 23.
10
Comparison of solution-based exome capture methods for next generation sequencing.基于溶液的外显子组捕获方法在下一代测序中的比较。
Genome Biol. 2011 Sep 28;12(9):R94. doi: 10.1186/gb-2011-12-9-r94.

引用本文的文献

1
Whole Exome-Sequencing of Pooled Genomic DNA Samples to Detect Quantitative Trait Loci in Esotropia and Exotropia of Strabismus in Japanese.对日本斜视患者内斜视和外斜视的混合基因组DNA样本进行全外显子组测序以检测数量性状位点
Life (Basel). 2021 Dec 27;12(1):41. doi: 10.3390/life12010041.
2
Pre-capture multiplexing provides additional power to detect copy number variation in exome sequencing.预捕获多重分析为外显子测序中检测拷贝数变异提供了额外的功效。
BMC Bioinformatics. 2021 Jul 20;22(1):374. doi: 10.1186/s12859-021-04246-w.
3
Targeted sequencing of both DNA strands barcoded and captured individually by RNA probes to identify genome-wide ultra-rare mutations.

本文引用的文献

1
Cost-effective, high-throughput DNA sequencing libraries for multiplexed target capture.经济高效、高通量的用于多重靶标捕获的 DNA 测序文库。
Genome Res. 2012 May;22(5):939-46. doi: 10.1101/gr.128124.111. Epub 2012 Jan 20.
2
An evaluation of different target enrichment methods in pooled sequencing designs for complex disease association studies.在复杂疾病关联研究的合并测序设计中,不同目标富集方法的评估。
PLoS One. 2011;6(11):e26279. doi: 10.1371/journal.pone.0026279. Epub 2011 Nov 1.
3
Multiplexed array-based and in-solution genomic enrichment for flexible and cost-effective targeted next-generation sequencing.
靶向测序两条 DNA 链,每条 DNA 链均通过 RNA 探针进行单独标记和捕获,以鉴定全基因组超罕见突变。
Sci Rep. 2017 Jun 13;7(1):3356. doi: 10.1038/s41598-017-03448-8.
4
Exome sequencing in pooled DNA samples to identify maternal pre-eclampsia risk variants.在合并的 DNA 样本中进行外显子组测序,以鉴定母体子痫前期风险变异。
Sci Rep. 2016 Jul 7;6:29085. doi: 10.1038/srep29085.
5
VarElect: the phenotype-based variation prioritizer of the GeneCards Suite.VarElect:基因卡片套件中基于表型的变异优先级排序工具。
BMC Genomics. 2016 Jun 23;17 Suppl 2(Suppl 2):444. doi: 10.1186/s12864-016-2722-2.
6
The minimal amount of starting DNA for Agilent's hybrid capture-based targeted massively parallel sequencing.安捷伦基于杂交捕获的靶向大规模平行测序所需的起始DNA最小量。
Sci Rep. 2016 May 25;6:26732. doi: 10.1038/srep26732.
7
Candidate gene resequencing to identify rare, pedigree-specific variants influencing healthy aging phenotypes in the long life family study.在长寿家族研究中进行候选基因重测序,以鉴定影响健康衰老表型的罕见、特定家系变异。
BMC Geriatr. 2016 Apr 9;16:80. doi: 10.1186/s12877-016-0253-y.
8
Targeted single molecule mutation detection with massively parallel sequencing.利用大规模平行测序进行靶向单分子突变检测。
Nucleic Acids Res. 2016 Feb 18;44(3):e22. doi: 10.1093/nar/gkv915. Epub 2015 Sep 17.
9
Genetic Analysis of Substrain Divergence in Non-Obese Diabetic (NOD) Mice.非肥胖型糖尿病(NOD)小鼠亚系差异的遗传分析
G3 (Bethesda). 2015 Mar 3;5(5):771-5. doi: 10.1534/g3.115.017046.
10
Rare variants in FBN1 and FBN2 are associated with severe adolescent idiopathic scoliosis.FBN1和FBN2基因中的罕见变异与重度青少年特发性脊柱侧凸有关。
Hum Mol Genet. 2014 Oct 1;23(19):5271-82. doi: 10.1093/hmg/ddu224. Epub 2014 May 15.
基于多重微阵列和溶液内基因组富集的灵活且经济高效的靶向下一代测序。
Nat Protoc. 2011 Nov 3;6(12):1870-86. doi: 10.1038/nprot.2011.396.
4
Double indexing overcomes inaccuracies in multiplex sequencing on the Illumina platform.双索引克服了 Illumina 平台上多重测序的不准确性。
Nucleic Acids Res. 2012 Jan;40(1):e3. doi: 10.1093/nar/gkr771. Epub 2011 Oct 21.
5
Deep resequencing of GWAS loci identifies independent rare variants associated with inflammatory bowel disease.全基因组关联研究位点的深度重测序鉴定出与炎症性肠病相关的独立稀有变异。
Nat Genet. 2011 Oct 9;43(11):1066-73. doi: 10.1038/ng.952.
6
Efficient and cost effective population resequencing by pooling and in-solution hybridization.通过池化和溶液杂交进行高效且具有成本效益的群体重测序。
PLoS One. 2011 Mar 30;6(3):e18353. doi: 10.1371/journal.pone.0018353.
7
A framework for variation discovery and genotyping using next-generation DNA sequencing data.利用下一代 DNA 测序数据进行变异发现和基因分型的框架。
Nat Genet. 2011 May;43(5):491-8. doi: 10.1038/ng.806. Epub 2011 Apr 10.
8
Cost-effective multiplexing before capture allows screening of 25 000 clinically relevant SNPs in childhood acute lymphoblastic leukemia.在捕获前进行具有成本效益的多重分析,可筛选 25000 个与儿童急性淋巴细胞白血病相关的临床相关 SNP。
Leukemia. 2011 Jun;25(6):1001-6. doi: 10.1038/leu.2011.32. Epub 2011 Mar 18.
9
Genomic DNA pooling strategy for next-generation sequencing-based rare variant discovery in abdominal aortic aneurysm regions of interest-challenges and limitations.基于下一代测序的腹主动脉瘤感兴趣区域稀有变异发现的基因组 DNA 池化策略——挑战和局限性。
J Cardiovasc Transl Res. 2011 Jun;4(3):271-80. doi: 10.1007/s12265-011-9263-5. Epub 2011 Mar 1.
10
A scalable, fully automated process for construction of sequence-ready human exome targeted capture libraries.一种可扩展的、全自动的人类外显子靶向捕获文库构建方法。
Genome Biol. 2011;12(1):R1. doi: 10.1186/gb-2011-12-1-r1. Epub 2011 Jan 4.