Suppr超能文献

纳米STR:一种基于纳米孔测序数据检测目标短串联重复序列的方法。

NanoSTR: A method for detection of target short tandem repeats based on nanopore sequencing data.

作者信息

Lang Jidong, Xu Zhihua, Wang Yue, Sun Jiguo, Yang Zhi

机构信息

Department of Bioinformatics and Application Development, Qitan Technology Co., Ltd., Beijing, China.

出版信息

Front Mol Biosci. 2023 Jan 18;10:1093519. doi: 10.3389/fmolb.2023.1093519. eCollection 2023.

Abstract

Short tandem repeats (STRs) are widely present in the human genome. Studies have confirmed that STRs are associated with more than 30 diseases, and they have also been used in forensic identification and paternity testing. However, there are few methods for STR detection based on nanopore sequencing due to the challenges posed by the sequencing principles and the data characteristics of nanopore sequencing. We developed NanoSTR for detection of target STR loci based on the length-number-rank (LNR) information of reads. NanoSTR can be used for STR detection and genotyping based on long-read data from nanopore sequencing with improved accuracy and efficiency compared with other existing methods, such as Tandem-Genotypes and TRiCoLOR. NanoSTR showed 100% concordance with the expected genotypes using error-free simulated data, and also achieved >85% concordance using the standard samples (containing autosomal and Y-chromosomal loci) with MinION sequencing platform, respectively. NanoSTR showed high performance for detection of target STR markers. Although NanoSTR needs further optimization and development, it is useful as an analytical method for the detection of STR loci by nanopore sequencing. This method adds to the toolbox for nanopore-based STR analysis and expands the applications of nanopore sequencing in scientific research and clinical scenarios. The main code and the data are available at https://github.com/langjidong/NanoSTR.

摘要

短串联重复序列(STRs)广泛存在于人类基因组中。研究证实,STRs与30多种疾病相关,并且它们也已被用于法医鉴定和亲子鉴定。然而,由于纳米孔测序的测序原理和数据特征带来的挑战,基于纳米孔测序的STR检测方法很少。我们开发了NanoSTR,用于基于reads的长度-数量-排名(LNR)信息检测目标STR位点。与其他现有方法(如Tandem-Genotypes和TRiCoLOR)相比,NanoSTR可用于基于纳米孔测序的长读长数据进行STR检测和基因分型,具有更高的准确性和效率。使用无错误模拟数据时,NanoSTR与预期基因型的一致性为100%,使用MinION测序平台对标准样本(包含常染色体和Y染色体位点)进行检测时,一致性也分别达到了>85%。NanoSTR在检测目标STR标记方面表现出高性能。尽管NanoSTR需要进一步优化和开发,但它作为一种通过纳米孔测序检测STR位点的分析方法很有用。该方法为基于纳米孔的STR分析增添了工具,并扩展了纳米孔测序在科研和临床场景中的应用。主要代码和数据可在https://github.com/langjidong/NanoSTR获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e395/9889824/86923d52359d/fmolb-10-1093519-g001.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验