RNA测序数据中融合转录本检测方法的比较评估

Comparative assessment of methods for the fusion transcripts detection from RNA-Seq data.

作者信息

Kumar Shailesh, Vo Angie Duy, Qin Fujun, Li Hui

机构信息

Department of Pathology, School of Medicine, University of Virginia, Charlottesville, VA 22908.

Department of Biochemistry and Molecular Genetics, School of Medicine, University of Virginia, Charlottesville, VA 22908.

出版信息

Sci Rep. 2016 Feb 10;6:21597. doi: 10.1038/srep21597.

DOI:10.1038/srep21597

PMID:26862001

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4748267/

Abstract

RNA-Seq made possible the global identification of fusion transcripts, i.e. "chimeric RNAs". Even though various software packages have been developed to serve this purpose, they behave differently in different datasets provided by different developers. It is important for both users, and developers to have an unbiased assessment of the performance of existing fusion detection tools. Toward this goal, we compared the performance of 12 well-known fusion detection software packages. We evaluated the sensitivity, false discovery rate, computing time, and memory usage of these tools in four different datasets (positive, negative, mixed, and test). We conclude that some tools are better than others in terms of sensitivity, positive prediction value, time consumption and memory usage. We also observed small overlaps of the fusions detected by different tools in the real dataset (test dataset). This could be due to false discoveries by various tools, but could also be due to the reason that none of the tools are inclusive. We have found that the performance of the tools depends on the quality, read length, and number of reads of the RNA-Seq data. We recommend that users choose the proper tools for their purpose based on the properties of their RNA-Seq data.

摘要

RNA测序使得对融合转录本（即“嵌合RNA”）进行全面鉴定成为可能。尽管已经开发了各种软件包来实现这一目的，但它们在不同开发者提供的不同数据集中表现各异。对于用户和开发者而言，对现有融合检测工具的性能进行公正评估都很重要。为实现这一目标，我们比较了12个知名融合检测软件包的性能。我们在四个不同数据集（阳性、阴性、混合和测试）中评估了这些工具的灵敏度、错误发现率、计算时间和内存使用情况。我们得出结论，在灵敏度、阳性预测值、时间消耗和内存使用方面，一些工具比其他工具表现更好。我们还观察到在真实数据集（测试数据集）中，不同工具检测到的融合存在少量重叠。这可能是由于各种工具的错误发现，但也可能是因为没有一个工具是包罗万象的。我们发现工具的性能取决于RNA测序数据的质量、读长和读数数量。我们建议用户根据其RNA测序数据的特性为自己的目的选择合适的工具。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5828/4748267/77ac2f13e86a/srep21597-f1.jpg

相似文献

Comparative assessment of methods for the fusion transcripts detection from RNA-Seq data.RNA测序数据中融合转录本检测方法的比较评估

Sci Rep. 2016 Feb 10;6:21597. doi: 10.1038/srep21597.

InFusion: Advancing Discovery of Fusion Genes and Chimeric Transcripts from Deep RNA-Sequencing Data.InFusion：从深度RNA测序数据中推进融合基因和嵌合转录本的发现

PLoS One. 2016 Dec 1;11(12):e0167417. doi: 10.1371/journal.pone.0167417. eCollection 2016.

Identifying fusion transcripts using next generation sequencing.使用下一代测序技术鉴定融合转录本。

Wiley Interdiscip Rev RNA. 2016 Nov;7(6):811-823. doi: 10.1002/wrna.1382. Epub 2016 Aug 2.

SimBA: A methodology and tools for evaluating the performance of RNA-Seq bioinformatic pipelines.SimBA：一种用于评估RNA测序生物信息学流程性能的方法和工具。

BMC Bioinformatics. 2017 Sep 29;18(1):428. doi: 10.1186/s12859-017-1831-5.

Comparative study of bioinformatic tools for the identification of chimeric RNAs from RNA Sequencing.生物信息学工具在 RNA 测序中鉴定嵌合 RNA 的比较研究。

RNA Biol. 2021 Oct 15;18(sup1):254-267. doi: 10.1080/15476286.2021.1940047. Epub 2021 Jun 18.

ChimPipe: accurate detection of fusion genes and transcription-induced chimeras from RNA-seq data.ChimPipe：从RNA测序数据中准确检测融合基因和转录诱导嵌合体。

BMC Genomics. 2017 Jan 3;18(1):7. doi: 10.1186/s12864-016-3404-9.

SOAPfusion: a robust and effective computational fusion discovery tool for RNA-seq reads.SOAPfusion：一种用于 RNA-seq 读段的强大而有效的计算融合发现工具。

Bioinformatics. 2013 Dec 1;29(23):2971-8. doi: 10.1093/bioinformatics/btt522. Epub 2013 Oct 11.

State of art fusion-finder algorithms are suitable to detect transcription-induced chimeras in normal tissues?最先进的融合查找算法是否适合检测正常组织中转录诱导的嵌合体？

BMC Bioinformatics. 2013;14 Suppl 7(Suppl 7):S2. doi: 10.1186/1471-2105-14-S7-S2. Epub 2013 Apr 22.

SimFuse: A Novel Fusion Simulator for RNA Sequencing (RNA-Seq) Data.SimFuse：一种用于RNA测序（RNA-Seq）数据的新型融合模拟器

Biomed Res Int. 2015;2015:780519. doi: 10.1155/2015/780519. Epub 2015 Dec 29.

Discovering chimeric transcripts in paired-end RNA-seq data by using EricScript.使用 EricScript 从 RNA-seq 数据中发现嵌合转录本。

Bioinformatics. 2012 Dec 15;28(24):3232-9. doi: 10.1093/bioinformatics/bts617. Epub 2012 Oct 23.

引用本文的文献

Whole transcriptome analysis identifies fusion as a novel biomarker in metastatic colorectal cancer.全转录组分析确定融合为转移性结直肠癌中的一种新型生物标志物。

Cancer Pathog Ther. 2025 Feb 4;3(5):420-433. doi: 10.1016/j.cpt.2025.02.002. eCollection 2025 Sep.

Profiling chimeric RNA in prostate cancer in Chinese cohorts reveals similarities and differences compared to Western populations.在中国人群中对前列腺癌嵌合RNA进行分析，揭示了与西方人群相比的异同。

Imeta. 2025 Mar 13;4(2):e70014. doi: 10.1002/imt2.70014. eCollection 2025 Apr.

SplitFusion enables ultrasensitive gene fusion detection and reveals fusion variant-associated tumor heterogeneity.SplitFusion可实现超灵敏的基因融合检测，并揭示与融合变异相关的肿瘤异质性。

Patterns (N Y). 2025 Feb 14;6(2):101174. doi: 10.1016/j.patter.2025.101174.

Architects and Partners: The Dual Roles of Non-coding RNAs in Gene Fusion Events.架构师与合作伙伴：非编码RNA在基因融合事件中的双重作用

Methods Mol Biol. 2025;2883:231-255. doi: 10.1007/978-1-0716-4290-0_10.

Oncogenic fusion protein interacts with polypyrimidine tract binding protein 1 to facilitate bladder cancer proliferation and metastasis by regulating mRNA stability.致癌融合蛋白与多嘧啶序列结合蛋白1相互作用，通过调节mRNA稳定性促进膀胱癌的增殖和转移。

MedComm (2020). 2024 Aug 14;5(9):e685. doi: 10.1002/mco2.685. eCollection 2024 Sep.

Precision Medicine in Cytopathology.细胞病理学中的精准医学。

Surg Pathol Clin. 2024 Sep;17(3):329-345. doi: 10.1016/j.path.2024.04.002. Epub 2024 May 18.

A Protocol for the Detection of Fusion Transcripts Using RNA-Sequencing Data.使用 RNA 测序数据检测融合转录本的方案。

Methods Mol Biol. 2024;2812:243-258. doi: 10.1007/978-1-0716-3886-6_14.

RTCpredictor: identification of read-through chimeric RNAs from RNA sequencing data.RTCpredictor：从 RNA 测序数据中识别通读嵌合 RNA。

Brief Bioinform. 2024 May 23;25(4). doi: 10.1093/bib/bbae251.

Fusion InPipe, an integrative pipeline for gene fusion detection from RNA-seq data in acute pediatric leukemia.Fusion InPipe，一种用于从急性小儿白血病的RNA测序数据中检测基因融合的综合流程。

Front Mol Biosci. 2023 Jun 9;10:1141310. doi: 10.3389/fmolb.2023.1141310. eCollection 2023.

Targeted characterization of fusion transcripts in tumor and normal tissues via FusionInspector.通过 FusionInspector 对肿瘤和正常组织中的融合转录本进行靶向特征分析。

Cell Rep Methods. 2023 May 8;3(5):100467. doi: 10.1016/j.crmeth.2023.100467. eCollection 2023 May 22.

本文引用的文献

Characterization of fusion genes and the significantly expressed fusion isoforms in breast cancer by hybrid sequencing.通过混合测序技术对乳腺癌中的融合基因和显著表达的融合异构体进行特征分析。

Nucleic Acids Res. 2015 Oct 15;43(18):e116. doi: 10.1093/nar/gkv562. Epub 2015 Jun 3.

JAFFA: High sensitivity transcriptome-focused fusion gene detection.贾法：高灵敏度的转录组聚焦融合基因检测。

Genome Med. 2015 May 11;7(1):43. doi: 10.1186/s13073-015-0167-x. eCollection 2015.

Discovery of CTCF-sensitive Cis-spliced fusion RNAs between adjacent genes in human prostate cells.人类前列腺细胞中相邻基因间CTCF敏感的顺式剪接融合RNA的发现。

PLoS Genet. 2015 Feb 6;11(2):e1005001. doi: 10.1371/journal.pgen.1005001. eCollection 2015 Feb.

Chimeric RNAs generated by intergenic splicing in normal and cancer cells.正常细胞和癌细胞中由基因间剪接产生的嵌合RNA。

Genes Chromosomes Cancer. 2014 Dec;53(12):963-71. doi: 10.1002/gcc.22207. Epub 2014 Aug 11.

TIGRA: a targeted iterative graph routing assembler for breakpoint assembly.TIGRA：一种用于断点组装的靶向迭代图路由组装器。

Genome Res. 2014 Feb;24(2):310-7. doi: 10.1101/gr.162883.113. Epub 2013 Dec 4.

State of art fusion-finder algorithms are suitable to detect transcription-induced chimeras in normal tissues?最先进的融合查找算法是否适合检测正常组织中转录诱导的嵌合体？

BMC Bioinformatics. 2013;14 Suppl 7(Suppl 7):S2. doi: 10.1186/1471-2105-14-S7-S2. Epub 2013 Apr 22.

TopHat2: accurate alignment of transcriptomes in the presence of insertions, deletions and gene fusions.TopHat2：在存在插入、缺失和基因融合的情况下对转录组进行精确比对。

Genome Biol. 2013 Apr 25;14(4):R36. doi: 10.1186/gb-2013-14-4-r36.

State-of-the-art fusion-finder algorithms sensitivity and specificity.最先进的融合发现算法的灵敏度和特异性。

Biomed Res Int. 2013;2013:340620. doi: 10.1155/2013/340620. Epub 2013 Feb 17.

SOAPfuse: an algorithm for identifying fusion transcripts from paired-end RNA-Seq data.SOAPfuse：一种用于从双末端RNA测序数据中识别融合转录本的算法。

Genome Biol. 2013 Feb 14;14(2):R12. doi: 10.1186/gb-2013-14-2-r12.

Recurrent reciprocal RNA chimera involving YPEL5 and PPP1CB in chronic lymphocytic leukemia.慢性淋巴细胞白血病中涉及 YPEL5 和 PPP1CB 的反复 RNA 嵌合体。

Proc Natl Acad Sci U S A. 2013 Feb 19;110(8):3035-40. doi: 10.1073/pnas.1214326110. Epub 2013 Feb 4.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

RNA测序数据中融合转录本检测方法的比较评估

Comparative assessment of methods for the fusion transcripts detection from RNA-Seq data.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献