Suppr超能文献

通过假阳性预测算法减少桑格确认测试。

Reducing Sanger confirmation testing through false positive prediction algorithms.

机构信息

HudsonAlpha Institute for Biotechnology, Huntsville, AL, USA.

出版信息

Genet Med. 2021 Jul;23(7):1255-1262. doi: 10.1038/s41436-021-01148-3. Epub 2021 Mar 25.

Abstract

PURPOSE

Clinical genome sequencing (cGS) followed by orthogonal confirmatory testing is standard practice. While orthogonal testing significantly improves specificity, it also results in increased turnaround time and cost of testing. The purpose of this study is to evaluate machine learning models trained to identify false positive variants in cGS data to reduce the need for orthogonal testing.

METHODS

We sequenced five reference human genome samples characterized by the Genome in a Bottle Consortium (GIAB) and compared the results with an established set of variants for each genome referred to as a truth set. We then trained machine learning models to identify variants that were labeled as false positives.

RESULTS

After training, the models identified 99.5% of the false positive heterozygous single-nucleotide variants (SNVs) and heterozygous insertions/deletions variants (indels) while reducing confirmatory testing of nonactionable, nonprimary SNVs by 85% and indels by 75%. Employing the algorithm in clinical practice reduced overall orthogonal testing using dideoxynucleotide (Sanger) sequencing by 71%.

CONCLUSION

Our results indicate that a low false positive call rate can be maintained while significantly reducing the need for confirmatory testing. The framework that generated our models and results is publicly available at https://github.com/HudsonAlpha/STEVE .

摘要

目的

临床基因组测序(cGS)后进行正交确认性测试是标准做法。虽然正交测试显著提高了特异性,但也导致了测试周转时间和成本的增加。本研究的目的是评估经过训练以识别 cGS 数据中假阳性变体的机器学习模型,以减少对正交测试的需求。

方法

我们对五个由基因组瓶联盟(GIAB)表征的参考人类基因组样本进行测序,并将结果与每个基因组的一组已建立的变体进行比较,这些变体称为真实集。然后,我们训练机器学习模型来识别被标记为假阳性的变体。

结果

经过训练,模型识别出 99.5%的假阳性杂合单核苷酸变体(SNV)和杂合插入/缺失变体(indels),同时将非操作性、非主要 SNV 的确认性测试减少了 85%,indels 减少了 75%。在临床实践中使用该算法可将双脱氧核苷酸(Sanger)测序的总体正交测试减少 71%。

结论

我们的结果表明,在保持低假阳性率的同时,可以显著减少确认性测试的需求。生成我们模型和结果的框架可在 https://github.com/HudsonAlpha/STEVE 上公开获取。

相似文献

引用本文的文献

本文引用的文献

7
Variant Review with the Integrative Genomics Viewer.使用综合基因组浏览器进行变异审查。
Cancer Res. 2017 Nov 1;77(21):e31-e34. doi: 10.1158/0008-5472.CAN-17-0337.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验