基于一小部分单核苷酸多态性（SNP）的靶向二代测序（NGS）数据进行基因分型能够正确匹配患者样本。

Genotyping from targeted NGS data based on a small set of SNPs correctly matches patient samples.

作者信息

Yosifov Deyan Yordanov, Schneider Christof, Stilgenbauer Stephan, Mertens Daniel, Tausch Eugen

机构信息

Division of CLL, Department of Internal Medicine III, Ulm University Hospital, Ulm, Germany.

Cooperation Unit "Mechanisms of Leukemogenesis", German Cancer Research Center (DKFZ), Heidelberg, Germany.

出版信息

BMC Res Notes. 2025 Jul 2;18(1):270. doi: 10.1186/s13104-025-07348-3.

DOI:10.1186/s13104-025-07348-3

PMID:40605045

Abstract

OBJECTIVE

Mislabelling and swapping of laboratory samples are handling errors that can lead to erroneous interpretation of data and/or patient harm. Sequenced samples can be traced back to the respective donors by matching of single nucleotide polymorphisms (SNPs). Frameworks and software to do this have been developed for use with whole genome/exome sequencing data but not for targeted next-generation sequencing (tNGS), possibly due to the limited genomic coverage with tNGS and the need for individualization of the set of interrogated SNPs. We decided to adapt a popular tool for use with tNGS data, to demonstrate the possibility of selecting informative SNPs from a typical tNGS panel and to create an automated workflow for detection of sample handling errors.

RESULTS

We compiled a custom list of 28 SNPs and with its help we demonstrated the practicability of using only tNGS data to cost-effectively detect mislabelled samples. In two cohorts of totally 1441 patients with sequential samples, we could identify 3 sample swaps, 7 mislabelled samples (3 externally and 4 internally) and 1 mistake of unknown origin. We provide an R function for automated detection of sample swaps and mislabelling to the community as a free and open-source tool.

摘要

目的

实验室样本标记错误和样本交换是操作失误，可能导致数据解读错误和/或对患者造成伤害。通过单核苷酸多态性（SNP）匹配，测序样本可追溯至各自的捐赠者。用于全基因组/外显子组测序数据的相关框架和软件已开发出来，但针对靶向新一代测序（tNGS）的尚未开发，这可能是由于tNGS的基因组覆盖范围有限，以及需要对所检测的SNP集进行个体化处理。我们决定改编一种常用工具以用于tNGS数据，证明从典型的tNGS面板中选择信息性SNP的可能性，并创建一个用于检测样本处理错误的自动化工作流程。

结果

我们编制了一份包含28个SNP的自定义列表，并借助该列表证明了仅使用tNGS数据经济高效地检测标记错误样本的可行性。在两组共1,441例有连续样本的患者中，我们能够识别出3次样本交换、7个标记错误的样本（3个外部样本和4个内部样本）以及1个来源不明的错误。我们向社区提供了一个用于自动检测样本交换和标记错误的R函数，作为免费的开源工具。

相似文献

Genotyping from targeted NGS data based on a small set of SNPs correctly matches patient samples.基于一小部分单核苷酸多态性（SNP）的靶向二代测序（NGS）数据进行基因分型能够正确匹配患者样本。

BMC Res Notes. 2025 Jul 2;18(1):270. doi: 10.1186/s13104-025-07348-3.

Laboratory validation of targeted next-generation sequencing assay for pathogen detection in lower respiratory infection.用于下呼吸道感染病原体检测的靶向新一代测序检测方法的实验室验证

Microbiol Spectr. 2025 Jul;13(7):e0175124. doi: 10.1128/spectrum.01751-24. Epub 2025 May 21.

Interventions to reduce harm from continued tobacco use.减少持续吸烟危害的干预措施。

Cochrane Database Syst Rev. 2016 Oct 13;10(10):CD005231. doi: 10.1002/14651858.CD005231.pub3.

Diagnostic test accuracy and cost-effectiveness of tests for codeletion of chromosomal arms 1p and 19q in people with glioma.染色体臂 1p 和 19q 缺失的检测在胶质瘤患者中的诊断准确性和成本效益。

Cochrane Database Syst Rev. 2022 Mar 2;3(3):CD013387. doi: 10.1002/14651858.CD013387.pub2.

Cost-effectiveness of using prognostic information to select women with breast cancer for adjuvant systemic therapy.利用预后信息为乳腺癌患者选择辅助性全身治疗的成本效益

Health Technol Assess. 2006 Sep;10(34):iii-iv, ix-xi, 1-204. doi: 10.3310/hta10340.

Rapid, point-of-care antigen tests for diagnosis of SARS-CoV-2 infection.用于 SARS-CoV-2 感染诊断的快速、即时抗原检测。

Cochrane Database Syst Rev. 2022 Jul 22;7(7):CD013705. doi: 10.1002/14651858.CD013705.pub3.

Multiplex amplicon sequencing for the comprehensive genotyping of .用于全面基因分型的多重扩增子测序。不过你提供的原文似乎不完整，后面应该还有具体的研究对象等内容。

Microbiol Spectr. 2025 Jul;13(7):e0271924. doi: 10.1128/spectrum.02719-24. Epub 2025 May 22.

Clinical evaluation of two pathogen enrichment approaches for next-generation sequencing in the diagnosis of lower respiratory tract infections.两种病原体富集方法用于下一代测序诊断下呼吸道感染的临床评估

Microbiol Spectr. 2025 Jul;13(7):e0092225. doi: 10.1128/spectrum.00922-25. Epub 2025 Jun 11.

Technological aids for the rehabilitation of memory and executive functioning in children and adolescents with acquired brain injury.脑损伤儿童和青少年记忆与执行功能康复的技术辅助手段。

Cochrane Database Syst Rev. 2016 Jul 1;7(7):CD011020. doi: 10.1002/14651858.CD011020.pub2.

Assessing the comparative effects of interventions in COPD: a tutorial on network meta-analysis for clinicians.评估慢性阻塞性肺疾病干预措施的比较效果：面向临床医生的网状Meta分析教程

Respir Res. 2024 Dec 21;25(1):438. doi: 10.1186/s12931-024-03056-x.

本文引用的文献

Genetic sex validation for sample tracking in next-generation sequencing clinical testing.用于下一代测序临床检测中样本追踪的遗传性别验证。

BMC Res Notes. 2024 Mar 3;17(1):62. doi: 10.1186/s13104-024-06723-w.

A community effort to identify and correct mislabeled samples in proteogenomic studies.一项旨在识别和纠正蛋白质基因组学研究中错误标记样本的社区行动。

Patterns (N Y). 2021 May 7;2(5):100245. doi: 10.1016/j.patter.2021.100245. eCollection 2021 May 14.

Uniform genomic data analysis in the NCI Genomic Data Commons.在 NCI 基因组数据共享中心进行统一的基因组数据分析。

Nat Commun. 2021 Feb 22;12(1):1226. doi: 10.1038/s41467-021-21254-9.

SMaSH: Sample matching using SNPs in humans.SMaSH：基于人类 SNP 进行样本匹配。

BMC Genomics. 2019 Dec 30;20(Suppl 12):1001. doi: 10.1186/s12864-019-6332-7.

Maftools: efficient and comprehensive analysis of somatic variants in cancer.Maftools：癌症体细胞变异的高效全面分析。

Genome Res. 2018 Nov;28(11):1747-1756. doi: 10.1101/gr.239244.118. Epub 2018 Oct 19.

A SNP panel for identification of DNA and RNA specimens.用于鉴定 DNA 和 RNA 样本的 SNP 面板。

BMC Genomics. 2018 Jan 25;19(1):90. doi: 10.1186/s12864-018-4482-7.

Patient and Sample Identification. Out of the Maze?患者及样本识别。走出迷宫？

J Med Biochem. 2017 Apr 22;36(2):107-112. doi: 10.1515/jomb-2017-0003. eCollection 2017 Apr.

NGSCheckMate: software for validating sample identity in next-generation sequencing studies within and across data types.NGSCheckMate：用于在下一代测序研究中验证样本身份的数据类型内和跨数据类型的软件。

Nucleic Acids Res. 2017 Jun 20;45(11):e103. doi: 10.1093/nar/gkx193.

LDlink: a web-based application for exploring population-specific haplotype structure and linking correlated alleles of possible functional variants.LDlink：一个基于网络的应用程序，用于探索特定人群的单倍型结构，并链接可能具有功能变异的相关等位基因。

Bioinformatics. 2015 Nov 1;31(21):3555-7. doi: 10.1093/bioinformatics/btv402. Epub 2015 Jul 2.

Patient misidentification in laboratory medicine: a qualitative analysis of 227 root cause analysis reports in the Veterans Health Administration.实验室医学中的患者身份识别错误：退伍军人健康管理局 227 份根本原因分析报告的定性分析。

Arch Pathol Lab Med. 2010 Feb;134(2):244-55. doi: 10.5858/134.2.244.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

基于一小部分单核苷酸多态性（SNP）的靶向二代测序（NGS）数据进行基因分型能够正确匹配患者样本。

Genotyping from targeted NGS data based on a small set of SNPs correctly matches patient samples.

作者信息

机构信息

出版信息

OBJECTIVE

RESULTS

目的

结果

相似文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

本文引用的文献