利用独特分子标识符和 SiMSen-Seq 方法进行 STR 标记的超灵敏测序。

Ultrasensitive sequencing of STR markers utilizing unique molecular identifiers and the SiMSen-Seq method.

机构信息

National Forensic Centre, Swedish Police Authority, Linköping SE-581 94, Sweden.

National Institute of Standards and Technology, 100 Bureau Drive, M/S 8314, Gaithersburg, MD 20899, USA.

出版信息

Forensic Sci Int Genet. 2024 Jul;71:103047. doi: 10.1016/j.fsigen.2024.103047. Epub 2024 Apr 3.

DOI:10.1016/j.fsigen.2024.103047

PMID:38598919

Abstract

Massively parallel sequencing (MPS) is increasingly applied in forensic short tandem repeat (STR) analysis. The presence of stutter artefacts and other PCR or sequencing errors in the MPS-STR data partly limits the detection of low DNA amounts, e.g., in complex mixtures. Unique molecular identifiers (UMIs) have been applied in several scientific fields to reduce noise in sequencing. UMIs consist of a stretch of random nucleotides, a unique barcode for each starting DNA molecule, that is incorporated in the DNA template using either ligation or PCR. The barcode is used to generate consensus reads, thus removing errors. The SiMSen-Seq (Simple, multiplexed, PCR-based barcoding of DNA for sensitive mutation detection using sequencing) method relies on PCR-based introduction of UMIs and includes a sophisticated hairpin design to reduce unspecific primer binding as well as PCR protocol adjustments to further optimize the reaction. In this study, SiMSen-Seq is applied to develop a proof-of-concept seven STR multiplex for MPS library preparation and an associated bioinformatics pipeline. Additionally, machine learning (ML) models were evaluated to further improve UMI allele calling. Overall, the seven STR multiplex resulted in complete detection and concordant alleles for 47 single-source samples at 1 ng input DNA as well as for low-template samples at 62.5 pg input DNA. For twelve challenging mixtures with minor contributions of 10 pg to 150 pg and ratios of 1-15% relative to the major donor, 99.2% of the expected alleles were detected by applying the UMIs in combination with an ML filter. The main impact of UMIs was a substantially lowered number of artefacts as well as reduced stutter ratios, which were generally below 5% of the parental allele. In conclusion, UMI-based STR sequencing opens new means for improved analysis of challenging crime scene samples including complex mixtures.

摘要

大规模并行测序（MPS）越来越多地应用于法医短串联重复序列（STR）分析。MPS-STR 数据中存在的重迭伪像和其他 PCR 或测序错误部分限制了低 DNA 量的检测，例如在复杂混合物中。独特分子标识符（UMI）已在多个科学领域中应用，以减少测序中的噪声。UMI 由一段随机核苷酸组成，每个起始 DNA 分子都有一个独特的条形码，该条形码通过连接或 PCR 掺入 DNA 模板中。条形码用于生成一致的读取，从而消除错误。SiMSen-Seq（使用测序对 DNA 进行简单、多重、基于 PCR 的 UMI 条形码标记，以灵敏检测突变）方法依赖于基于 PCR 的 UMI 引入，并且包括一种复杂的发夹设计，以减少非特异性引物结合，以及 PCR 协议调整，以进一步优化反应。在这项研究中，SiMSen-Seq 被应用于开发用于 MPS 文库制备的概念验证七重 STR 多重扩增，以及相关的生物信息学管道。此外，还评估了机器学习（ML）模型，以进一步提高 UMI 等位基因调用的准确性。总体而言，该七重 STR 多重扩增在 47 个单源样本（输入 DNA 为 1ng）和低模板样本（输入 DNA 为 62.5pg）中实现了完全检测和一致的等位基因。对于 12 个具有挑战性的混合物，其次要贡献为 10pg 至 150pg，相对主要供体的比例为 1-15%，通过应用 ML 滤波器与 UMI 结合，检测到了 99.2%的预期等位基因。UMI 的主要影响是显著降低了伪像数量和重迭比率，总体低于亲本等位基因的 5%。总之，基于 UMI 的 STR 测序为包括复杂混合物在内的具有挑战性的犯罪现场样本的分析提供了新的手段。

相似文献

Ultrasensitive sequencing of STR markers utilizing unique molecular identifiers and the SiMSen-Seq method.

Forensic Sci Int Genet. 2024 Jul;71:103047. doi: 10.1016/j.fsigen.2024.103047. Epub 2024 Apr 3.

Reducing noise and stutter in short tandem repeat loci with unique molecular identifiers.

Forensic Sci Int Genet. 2021 Mar;51:102459. doi: 10.1016/j.fsigen.2020.102459. Epub 2020 Dec 25.

Using unique molecular identifiers to improve allele calling in low-template mixtures.

Forensic Sci Int Genet. 2023 Mar;63:102807. doi: 10.1016/j.fsigen.2022.102807. Epub 2022 Nov 24.

Mixture deconvolution by massively parallel sequencing of microhaplotypes.

Int J Legal Med. 2019 May;133(3):719-729. doi: 10.1007/s00414-019-02010-7. Epub 2019 Feb 13.

FDSTools: A software package for analysis of massively parallel sequencing data with the ability to recognise and correct STR stutter and other PCR or sequencing noise.

Forensic Sci Int Genet. 2017 Mar;27:27-40. doi: 10.1016/j.fsigen.2016.11.007. Epub 2016 Nov 27.

Characterizing the amplification of STR markers in multiplex polymerase chain displacement reaction using massively parallel sequencing.

Forensic Sci Int Genet. 2023 Jan;62:102802. doi: 10.1016/j.fsigen.2022.102802. Epub 2022 Oct 21.

Development of a multiplex forensic identity panel for massively parallel sequencing and its systematic optimization using design of experiments.

Forensic Sci Int Genet. 2019 Mar;39:32-43. doi: 10.1016/j.fsigen.2018.11.023. Epub 2018 Nov 30.

High sensitivity multiplex short tandem repeat loci analyses with massively parallel sequencing.

Forensic Sci Int Genet. 2015 May;16:38-47. doi: 10.1016/j.fsigen.2014.11.022. Epub 2014 Dec 3.

Evaluation of Promega PowerSeq™ Auto/Y systems prototype on an admixed sample of Rio de Janeiro, Brazil: Population data, sensitivity, stutter and mixture studies.

Forensic Sci Int Genet. 2021 Jul;53:102516. doi: 10.1016/j.fsigen.2021.102516. Epub 2021 Apr 6.

Investigation into the sequence structure of 23 Y chromosomal STR loci using massively parallel sequencing.

Forensic Sci Int Genet. 2016 Nov;25:132-141. doi: 10.1016/j.fsigen.2016.08.010. Epub 2016 Aug 28.

引用本文的文献

Evaluation of automatic cell free DNA extraction metrics using different blood collection tubes.

Sci Rep. 2025 Jun 3;15(1):19364. doi: 10.1038/s41598-025-03508-4.

Digital sequencing is improved by using structured unique molecular identifiers.

Genome Biol. 2025 Feb 25;26(1):37. doi: 10.1186/s13059-025-03504-x.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

利用独特分子标识符和 SiMSen-Seq 方法进行 STR 标记的超灵敏测序。

Ultrasensitive sequencing of STR markers utilizing unique molecular identifiers and the SiMSen-Seq method.

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献