• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用Illumina平台实现低多样性样本高测序准确性及避免样本交叉污染的策略。

Strategies for achieving high sequencing accuracy for low diversity samples and avoiding sample bleeding using illumina platform.

作者信息

Mitra Abhishek, Skrzypczak Magdalena, Ginalski Krzysztof, Rowicka Maga

机构信息

Department of Biochemistry and Molecular Biology, University of Texas Medical Branch at Galveston, 301 University Blvd, Galveston, TX, 77555, USA; Institute for Translational Sciences, University of Texas Medical Branch at Galveston, 301 University Blvd, Galveston, TX, 77555, USA.

Laboratory of Bioinformatics and Systems Biology, Centre of New Technologies, University of Warsaw, Zwirki i Wigury 93, 02-089 Warsaw, Poland.

出版信息

PLoS One. 2015 Apr 10;10(4):e0120520. doi: 10.1371/journal.pone.0120520. eCollection 2015.

DOI:10.1371/journal.pone.0120520
PMID:25860802
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4393298/
Abstract

Sequencing microRNA, reduced representation sequencing, Hi-C technology and any method requiring the use of in-house barcodes result in sequencing libraries with low initial sequence diversity. Sequencing such data on the Illumina platform typically produces low quality data due to the limitations of the Illumina cluster calling algorithm. Moreover, even in the case of diverse samples, these limitations are causing substantial inaccuracies in multiplexed sample assignment (sample bleeding). Such inaccuracies are unacceptable in clinical applications, and in some other fields (e.g. detection of rare variants). Here, we discuss how both problems with quality of low-diversity samples and sample bleeding are caused by incorrect detection of clusters on the flowcell during initial sequencing cycles. We propose simple software modifications (Long Template Protocol) that overcome this problem. We present experimental results showing that our Long Template Protocol remarkably increases data quality for low diversity samples, as compared with the standard analysis protocol; it also substantially reduces sample bleeding for all samples. For comprehensiveness, we also discuss and compare experimental results from alternative approaches to sequencing low diversity samples. First, we discuss how the low diversity problem, if caused by barcodes, can be avoided altogether at the barcode design stage. Second and third, we present modified guidelines, which are more stringent than the manufacturer's, for mixing low diversity samples with diverse samples and lowering cluster density, which in our experience consistently produces high quality data from low diversity samples. Fourth and fifth, we present rescue strategies that can be applied when sequencing results in low quality data and when there is no more biological material available. In such cases, we propose that the flowcell be re-hybridized and sequenced again using our Long Template Protocol. Alternatively, we discuss how analysis can be repeated from saved sequencing images using the Long Template Protocol to increase accuracy.

摘要

对微小RNA进行测序、简化代表性测序、Hi-C技术以及任何需要使用内部条形码的方法,都会导致测序文库的初始序列多样性较低。由于Illumina簇识别算法的局限性,在Illumina平台上对这类数据进行测序通常会产生低质量的数据。此外,即使是在样本多样的情况下,这些局限性也会在多重样本分配(样本串扰)中导致大量不准确的情况。这种不准确在临床应用以及其他一些领域(例如罕见变异的检测)中是不可接受的。在此,我们讨论了低多样性样本质量问题和样本串扰这两个问题是如何在初始测序循环期间由流动池上簇的错误检测所导致的。我们提出了简单的软件修改方法(长模板协议)来克服这个问题。我们展示的实验结果表明,与标准分析协议相比,我们的长模板协议显著提高了低多样性样本的数据质量;它还大幅减少了所有样本的样本串扰。为了全面性,我们还讨论并比较了对低多样性样本进行测序的替代方法的实验结果。首先,我们讨论如果低多样性问题是由条形码引起的,如何在条形码设计阶段完全避免这个问题。第二和第三,我们给出了比制造商的指南更严格的修改后的指南,用于将低多样性样本与多样样本混合以及降低簇密度,根据我们的经验,这始终能从低多样性样本中产生高质量的数据。第四和第五,我们提出了在测序产生低质量数据且没有更多生物材料可用时可以应用的挽救策略。在这种情况下,我们建议使用我们的长模板协议对流动池进行重新杂交并再次测序。或者,我们讨论如何使用长模板协议从保存的测序图像中重复进行分析以提高准确性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b8ab/4393298/9bb20bfef227/pone.0120520.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b8ab/4393298/abdc3441de91/pone.0120520.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b8ab/4393298/233d23d05c30/pone.0120520.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b8ab/4393298/6111db8abcce/pone.0120520.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b8ab/4393298/778ff001c6b2/pone.0120520.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b8ab/4393298/9bb20bfef227/pone.0120520.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b8ab/4393298/abdc3441de91/pone.0120520.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b8ab/4393298/233d23d05c30/pone.0120520.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b8ab/4393298/6111db8abcce/pone.0120520.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b8ab/4393298/778ff001c6b2/pone.0120520.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b8ab/4393298/9bb20bfef227/pone.0120520.g005.jpg

相似文献

1
Strategies for achieving high sequencing accuracy for low diversity samples and avoiding sample bleeding using illumina platform.利用Illumina平台实现低多样性样本高测序准确性及避免样本交叉污染的策略。
PLoS One. 2015 Apr 10;10(4):e0120520. doi: 10.1371/journal.pone.0120520. eCollection 2015.
2
A novel ultra high-throughput 16S rRNA gene amplicon sequencing library preparation method for the Illumina HiSeq platform.一种新型的超高通量 16S rRNA 基因扩增子测序文库制备方法,适用于 Illumina HiSeq 平台。
Microbiome. 2017 Jul 6;5(1):68. doi: 10.1186/s40168-017-0279-1.
3
Deep sequencing analysis of phage libraries using Illumina platform.利用 Illumina 平台进行噬菌体文库的深度测序分析。
Methods. 2012 Sep;58(1):47-55. doi: 10.1016/j.ymeth.2012.07.006. Epub 2012 Jul 20.
4
Software updates in the Illumina HiSeq platform affect whole-genome bisulfite sequencing.Illumina HiSeq平台的软件更新会影响全基因组亚硫酸氢盐测序。
BMC Genomics. 2017 Jan 5;18(1):31. doi: 10.1186/s12864-016-3392-9.
5
Double indexing overcomes inaccuracies in multiplex sequencing on the Illumina platform.双索引克服了 Illumina 平台上多重测序的不准确性。
Nucleic Acids Res. 2012 Jan;40(1):e3. doi: 10.1093/nar/gkr771. Epub 2011 Oct 21.
6
Large scale loss of data in low-diversity illumina sequencing libraries can be recovered by deferred cluster calling.大规模的低多样性 Illumina 测序文库数据丢失可以通过延迟聚类调用来恢复。
PLoS One. 2011 Jan 28;6(1):e16607. doi: 10.1371/journal.pone.0016607.
7
Multi-centre evaluation of a comprehensive preimplantation genetic test through haplotyping-by-sequencing.通过测序单倍型分析对综合植入前遗传学检测的多中心评估。
Hum Reprod. 2019 Aug 1;34(8):1608-1619. doi: 10.1093/humrep/dez106.
8
Increased efficiency in identifying mixed pollen samples by meta-barcoding with a dual-indexing approach.通过双索引方法进行元条形码分析提高识别混合花粉样本的效率。
BMC Ecol. 2015 Jul 22;15:20. doi: 10.1186/s12898-015-0051-y.
9
Performance evaluation of a mitogenome capture and Illumina sequencing protocol using non-probative, case-type skeletal samples: Implications for the use of a positive control in a next-generation sequencing procedure.使用非验证性、病例型骨骼样本对线粒体基因组捕获和Illumina测序方案的性能评估:对下一代测序程序中使用阳性对照的启示。
Forensic Sci Int Genet. 2017 Nov;31:198-206. doi: 10.1016/j.fsigen.2017.09.001. Epub 2017 Sep 8.
10
Biomarker discovery: quantification of microRNAs and other small non-coding RNAs using next generation sequencing.生物标志物发现:使用下一代测序技术对微小RNA和其他小非编码RNA进行定量分析。
BMC Med Genomics. 2015 Jul 1;8:35. doi: 10.1186/s12920-015-0109-x.

引用本文的文献

1
First isolation of a methanotrophic reveals ammonia- and pH-tolerant methane oxidation.首次分离出一种甲烷营养菌,揭示了其对氨和pH的耐受性甲烷氧化作用。
Appl Environ Microbiol. 2025 Aug 20;91(8):e0079625. doi: 10.1128/aem.00796-25. Epub 2025 Jul 31.
2
Optimized in-solution enrichment of over a million ancient human SNPs.超过一百万个古代人类单核苷酸多态性的优化溶液内富集。
Genome Biol. 2025 Jul 3;26(1):190. doi: 10.1186/s13059-025-03622-6.
3
Development and field application of metabarcoding-adapted mt-ND4 markers shows substantial gene flow and varying local pressures on Haemonchus contortus and Teladorsagia circumcincta populations in the UK.

本文引用的文献

1
Nucleotide-resolution DNA double-strand break mapping by next-generation sequencing.基于新一代测序的核苷酸分辨率 DNA 双链断裂作图
Nat Methods. 2013 Apr;10(4):361-5. doi: 10.1038/nmeth.2408. Epub 2013 Mar 17.
2
Fast gapped-read alignment with Bowtie 2.快速缺口读对准与 Bowtie 2。
Nat Methods. 2012 Mar 4;9(4):357-9. doi: 10.1038/nmeth.1923.
3
Double indexing overcomes inaccuracies in multiplex sequencing on the Illumina platform.双索引克服了 Illumina 平台上多重测序的不准确性。
适用于元条形码分析的线粒体 ND4 标记的开发与田间应用表明,在英国,捻转血矛线虫和环形泰勒虫种群存在大量基因流动以及不同的局部压力。
PLoS One. 2025 Jul 2;20(7):e0327254. doi: 10.1371/journal.pone.0327254. eCollection 2025.
4
Section-level genome sequencing and comparative genomics of sections and .组和组的区段水平基因组测序及比较基因组学
Stud Mycol. 2025 Jun;111:101-114. doi: 10.3114/sim.2025.111.03. Epub 2025 Mar 5.
5
Early detection and population dynamics of in naturally contaminated drains from a meat processing plant.肉类加工厂自然污染排水系统中[具体对象未给出]的早期检测与种群动态
Front Microbiol. 2025 Apr 9;16:1541481. doi: 10.3389/fmicb.2025.1541481. eCollection 2025.
6
Comparative Community Ecology Reveals Conserved Ectoparasite Microbiomes Amidst Variable Host and Environment Microbiomes.比较群落生态学揭示了在宿主和环境微生物群落多变的情况下,体表寄生虫微生物群落具有保守性。
Ecol Evol. 2025 Apr 2;15(4):e71120. doi: 10.1002/ece3.71120. eCollection 2025 Apr.
7
How do phytophagous insects affect phyllosphere fungi? Tracking fungi from milkweed to monarch caterpillar frass reveals communities dominated by fungal yeast.植食性昆虫如何影响叶际真菌?从马利筋到帝王蝶毛虫粪便中追踪真菌,揭示了以真菌酵母为主导的群落。
Environ Microbiol Rep. 2024 Jun;16(3):e13213. doi: 10.1111/1758-2229.13213.
8
Defining a metagenomic threshold for detecting low abundances of in canine faecal samples.定义检测犬粪便样本中低丰度的宏基因组学阈值。
Front Cell Infect Microbiol. 2024 Feb 28;14:1305742. doi: 10.3389/fcimb.2024.1305742. eCollection 2024.
9
Comparison of four different human papillomavirus genotyping methods in cervical samples: Addressing method-specific advantages and limitations.宫颈样本中四种不同人乳头瘤病毒基因分型方法的比较:探讨方法特异性的优势与局限性。
Heliyon. 2024 Jan 30;10(3):e25474. doi: 10.1016/j.heliyon.2024.e25474. eCollection 2024 Feb 15.
10
Fatal neural angiostrongyliasis in the Bolivian squirrel monkey () leading to defining risk map at a zoo in Australia.玻利维亚松鼠猴()致命性嗜酸性粒细胞增多性脑膜炎导致澳大利亚一家动物园绘制风险地图。
One Health. 2023 Sep 15;17:100628. doi: 10.1016/j.onehlt.2023.100628. eCollection 2023 Dec.
Nucleic Acids Res. 2012 Jan;40(1):e3. doi: 10.1093/nar/gkr771. Epub 2011 Oct 21.
4
Large scale loss of data in low-diversity illumina sequencing libraries can be recovered by deferred cluster calling.大规模的低多样性 Illumina 测序文库数据丢失可以通过延迟聚类调用来恢复。
PLoS One. 2011 Jan 28;6(1):e16607. doi: 10.1371/journal.pone.0016607.
5
The sequence read archive.序列读取存档库。
Nucleic Acids Res. 2011 Jan;39(Database issue):D19-21. doi: 10.1093/nar/gkq1019. Epub 2010 Nov 9.
6
MicroRNA profiling: separating signal from noise.miRNA 谱分析:从噪声中分离信号。
Nat Methods. 2010 Sep;7(9):687-92. doi: 10.1038/nmeth0910-687.
7
Illumina sequencing library preparation for highly multiplexed target capture and sequencing.用于高度多重目标捕获和测序的Illumina测序文库制备。
Cold Spring Harb Protoc. 2010 Jun;2010(6):pdb.prot5448. doi: 10.1101/pdb.prot5448.
8
Hi-C: a method to study the three-dimensional architecture of genomes.Hi-C:一种研究基因组三维结构的方法。
J Vis Exp. 2010 May 6(39):1869. doi: 10.3791/1869.
9
Target-enrichment strategies for next-generation sequencing.下一代测序的靶向富集策略。
Nat Methods. 2010 Feb;7(2):111-8. doi: 10.1038/nmeth.1419.
10
Direct multiplex sequencing (DMPS)--a novel method for targeted high-throughput sequencing of ancient and highly degraded DNA.直接多重测序(DMPS)——一种用于古代和高度降解DNA靶向高通量测序的新方法。
Genome Res. 2009 Oct;19(10):1843-8. doi: 10.1101/gr.095760.109. Epub 2009 Jul 27.