Suppr超能文献

CmVCall:一种自动化和可调的纳米孔分析管道,用于检测人类线粒体基因组控制区的异质体。

CmVCall: An automated and adjustable nanopore analysis pipeline for heteroplasmy detection of the control region in human mitochondrial genome.

机构信息

Institute of Forensic Medicine, West China School of Basic Medical Sciences & Forensic Medicine, Sichuan University, Chengdu 610041, China.

School of Life Sciences, Jilin University, Changchun 130012, China.

出版信息

Forensic Sci Int Genet. 2023 Nov;67:102930. doi: 10.1016/j.fsigen.2023.102930. Epub 2023 Aug 14.

Abstract

Genetic associations between human mitochondrial DNA (mtDNA) heteroplasmy and mitochondrial diseases, aging, and cancer have been elaborated, contributing a lot to the further understanding of mtDNA polymorphic spectrum in anthropology, population, and forensic genetics. In the past decade, heteroplasmy detection using Sanger sequencing and next generation sequencing (NGS) was hampered by the former's inefficiency and the latter's inherent bias due to amplification and mapping of short reads, respectively. Nanopore sequencing stands out for its ability to yield long contiguous segments of DNA, providing a new insight into heterogeneity authentication. In addition to MinION from Oxford Nanopore Technologies, an alternative nanopore sequencer QNome (Qitan Technology) has also been applied to various biological research and the forensic applicability of this platform has been proved recently. In this study, we evaluated the performance of four commonly used variant callers in the heterogeneity authentication of the control region of human mtDNA based on simulations of different ratios generated by mixing QNome nanopore sequencing reads of two synthetic sequences. Then, an open-source and python-based nanopore analytics pipeline, CmVCall was developed and incorporated multiple programs including reads filtering, removal of nuclear mitochondrial sequences (NUMTs), alignment, optional 'Correction' mode, and heterogeneity identification. CmVCall can achieve high precision, accuracy, and recall of 100%, 99.9%, and 92.3% with a 5% heteroplasmy level in 'Correction' mode. Moreover, blood, saliva, and hair shaft samples from monozygotic (MZ) twins were used for heterogeneity evaluation and comparison with the NGS data. Results of MZ twin samples showed that CmVCall could identify more point heteroplasmy sites, revealing significant levels of inter- and intra-individual mtDNA polymorphism. In conclusion, we believe that this analysis pipeline will lay a solid foundation for the development of a comprehensive nanopore analysis pipeline targeting the whole mitochondrial genome.

摘要

人类线粒体 DNA(mtDNA)异质性与线粒体疾病、衰老和癌症之间的遗传关联已经得到了详细阐述,这对进一步了解人类学、人群和法医遗传学中的 mtDNA 多态性谱有很大帮助。在过去的十年中,桑格测序和下一代测序(NGS)在检测异质性方面受到了限制,前者效率低下,后者由于短读长的扩增和映射而存在固有偏差。纳米孔测序因其能够产生长的连续 DNA 片段而脱颖而出,为异质性验证提供了新的视角。除了牛津纳米孔技术公司的 MinION 之外,另一种纳米孔测序仪 QNome(奇天技术)也已经应用于各种生物学研究中,最近证明了该平台在法医学中的适用性。在这项研究中,我们评估了基于混合两种人工序列的 QNome 纳米孔测序reads 生成的不同比例的模拟数据,使用四种常用的变异调用器在人类 mtDNA 控制区异质性验证中的性能。然后,开发了一个开源的、基于 Python 的纳米孔分析管道 CmVCall,并集成了多个程序,包括读取过滤、核线粒体序列(NUMTs)去除、比对、可选的“校正”模式和异质性识别。在“校正”模式下,CmVCall 可以达到 100%的高精度、99.9%的准确率和 92.3%的召回率,异质性水平为 5%。此外,还使用来自同卵双胞胎的血液、唾液和毛发样本进行异质性评估,并与 NGS 数据进行比较。MZ 双胞胎样本的结果表明,CmVCall 可以识别更多的点异质性位点,揭示了个体间和个体内 mtDNA 多态性的显著水平。总之,我们相信这个分析管道将为开发针对整个线粒体基因组的综合纳米孔分析管道奠定坚实的基础。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验