Cuenca-Guardiola Javier, de la Morena-Barrio Belén, Corral Javier, Fernández-Breis Jesualdo Tomás
Departamento de Informática y Sistemas, IMIB-Pascual Parrilla, CEIR Campus Mare Nostrum, Universidad de Murcia, 30100, Murcia, Spain.
Servicio de Hematología, CIBERER-ISCIII, IMIB-Pascual Parrilla, Centro Regional de Hemodonación, Hospital Universitario Morales Meseguer, Universidad de Murcia, 30003, Murcia, Spain.
Sci Rep. 2025 Apr 25;15(1):14489. doi: 10.1038/s41598-025-98847-7.
Transposable elements (TEs) make up 45% of the human genome, are a source of genetic variability difficult to detect, and involved in processes related to gene regulation and disease. Nanopore sequencing is recognized as one of the best technologies for detecting TEs; however, tools for analyzing of human TE insertions and deletions with nanopore-based data can be improved. RetroInspector is an easy to use, configurable Snakemake pipeline that performs detection, annotation, enrichment, and genotyping of TEs. RetroInspector requires the FASTQ files of the samples and the reference genome to start the identification and analysis of TEs. The user can also set the threshold for the number of supporting reads for the variant filtering. RetroInspector also allows users to compare the results of two samples. Different versions of the reference genome can be used and the presence of retrotransposition features can be annotated. RetroInspector has been run on three nanopore sequencing datasets and validated experimentally using proprietary and public data with over 80% precision.
转座元件(TEs)占人类基因组的45%,是一种难以检测的遗传变异来源,并参与与基因调控和疾病相关的过程。纳米孔测序被认为是检测TEs的最佳技术之一;然而,用于分析基于纳米孔数据的人类TE插入和缺失的工具仍有改进空间。RetroInspector是一个易于使用、可配置的Snakemake工作流程,可对TEs进行检测、注释、富集和基因分型。RetroInspector需要样本的FASTQ文件和参考基因组来启动TEs的识别和分析。用户还可以设置变异过滤的支持读数数量阈值。RetroInspector还允许用户比较两个样本的结果。可以使用不同版本的参考基因组,并注释反转录转座特征的存在。RetroInspector已在三个纳米孔测序数据集上运行,并使用专有数据和公共数据进行了实验验证,精度超过80%。