• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

VirDiG:一种用于冠状病毒的转录组组装工具

VirDiG: a transcriptome assembler for coronavirus.

作者信息

Li Minghao, Guo Xuaoyu, Zhao Jin

机构信息

School of Computer Science and Technology, Qingdao University, Shandong 266071, China.

出版信息

Bioinform Adv. 2025 Apr 8;5(1):vbaf075. doi: 10.1093/bioadv/vbaf075. eCollection 2025.

DOI:10.1093/bioadv/vbaf075
PMID:40291015
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12034387/
Abstract

MOTIVATION

The discontinuous transcription mechanism of coronaviruses contributes to their adaptation to different host environments and plays a critical role in their lifecycle. Accurate assembly of coronavirus transcripts is vital for understanding the virus's biological traits and developing precise prevention and treatment strategies. However, existing assembly algorithms are primarily designed for alternative splicing events in eukaryotes and are not suitable for assembling coronavirus transcriptome, which consists of both genomic RNA and subgenomic mRNAs. Coronavirus transcriptome reconstruction from short reads remains a challenging problem.

RESULTS

In this work, we present VirDiG, a transcriptome assembler specifically designed for coronaviruses. VirDiG utilizes a discontinuous graph to facilitate accurate transcript assembly by incorporating information from paired-end reads, sequence depth, and start and stop codons. Experimental results from both simulated and real datasets show that VirDiG exhibits significant advantages in reconstructing the transcriptome of coronaviruses when compared to traditional assemblers tailored for classical eukaryotic transcriptome assembly.

AVAILABILITY AND IMPLEMENTATION

VirDiG is freely available at https://github.com/Limh616/VirDiG.git.

摘要

动机

冠状病毒的不连续转录机制有助于其适应不同的宿主环境,并在其生命周期中发挥关键作用。准确组装冠状病毒转录本对于理解病毒的生物学特性以及制定精确的预防和治疗策略至关重要。然而,现有的组装算法主要是为真核生物中的可变剪接事件设计的,并不适用于组装由基因组RNA和亚基因组mRNA组成的冠状病毒转录组。从短读长中重建冠状病毒转录组仍然是一个具有挑战性的问题。

结果

在这项工作中,我们提出了VirDiG,一种专门为冠状病毒设计的转录组组装器。VirDiG利用不连续图,通过整合来自双端读长、序列深度以及起始和终止密码子的信息,来促进准确的转录本组装。来自模拟数据集和真实数据集的实验结果表明,与为经典真核生物转录组组装量身定制的传统组装器相比,VirDiG在重建冠状病毒转录组方面具有显著优势。

可用性和实现方式

VirDiG可在https://github.com/Limh616/VirDiG.git上免费获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/34ea/12034387/282aafe8e111/vbaf075f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/34ea/12034387/93249baef2f9/vbaf075f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/34ea/12034387/c30f33edcc28/vbaf075f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/34ea/12034387/11f388f5de4e/vbaf075f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/34ea/12034387/282aafe8e111/vbaf075f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/34ea/12034387/93249baef2f9/vbaf075f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/34ea/12034387/c30f33edcc28/vbaf075f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/34ea/12034387/11f388f5de4e/vbaf075f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/34ea/12034387/282aafe8e111/vbaf075f4.jpg

相似文献

1
VirDiG: a transcriptome assembler for coronavirus.VirDiG:一种用于冠状病毒的转录组组装工具
Bioinform Adv. 2025 Apr 8;5(1):vbaf075. doi: 10.1093/bioadv/vbaf075. eCollection 2025.
2
Cov-trans: an efficient algorithm for discontinuous transcript assembly in coronaviruses.Cov-trans:一种用于冠状病毒中不连续转录本组装的高效算法。
BMC Genomics. 2024 Dec 30;25(1):1257. doi: 10.1186/s12864-024-11179-0.
3
Ameliorated de novo transcriptome assembly using Illumina paired end sequence data with Trinity Assembler.使用Trinity组装器结合Illumina双端序列数据改进从头转录组组装。
Genom Data. 2015 Jul 15;5:352-9. doi: 10.1016/j.gdata.2015.07.012. eCollection 2015 Sep.
4
DTA-SiST: de novo transcriptome assembly by using simplified suffix trees.DTA-SiST:使用简化后缀树进行从头转录组组装。
BMC Bioinformatics. 2019 Dec 24;20(Suppl 25):698. doi: 10.1186/s12859-019-3272-9.
5
Playing hide and seek with repeats in local and global de novo transcriptome assembly of short RNA-seq reads.在短RNA测序读数的局部和全局从头转录组组装中与重复序列玩捉迷藏游戏。
Algorithms Mol Biol. 2017 Feb 22;12:2. doi: 10.1186/s13015-017-0091-2. eCollection 2017.
6
Improving the quality of barley transcriptome de novo assembling by using a hybrid approach for lines with varying spike and stem coloration.通过使用混合方法提高不同穗部和茎部颜色品系大麦转录组从头组装的质量。
Vavilovskii Zhurnal Genet Selektsii. 2021 Feb;25(1):30-38. doi: 10.18699/VJ21.004.
7
PERGA: a paired-end read guided de novo assembler for extending contigs using SVM and look ahead approach.PERGA:一种用于使用支持向量机和前瞻方法扩展重叠群的双端读段引导的从头组装器。
PLoS One. 2014 Dec 2;9(12):e114253. doi: 10.1371/journal.pone.0114253. eCollection 2014.
8
TransLiG: a de novo transcriptome assembler that uses line graph iteration.TransLiG:一种基于线图迭代的从头转录组组装算法。
Genome Biol. 2019 Apr 23;20(1):81. doi: 10.1186/s13059-019-1690-7.
9
TraRECo: a greedy approach based de novo transcriptome assembler with read error correction using consensus matrix.TraRECo:一种基于贪心策略的从头转录组组装方法,使用一致矩阵进行读错误校正。
BMC Genomics. 2018 Sep 4;19(1):653. doi: 10.1186/s12864-018-5034-x.
10
ClusTrast: a short read de novo transcript isoform assembler guided by clustered contigs.ClusTrast:一种基于聚类 contigs 的短读从头转录本异构体组装工具。
BMC Bioinformatics. 2024 Feb 1;25(1):54. doi: 10.1186/s12859-024-05663-3.

本文引用的文献

1
Cov-trans: an efficient algorithm for discontinuous transcript assembly in coronaviruses.Cov-trans:一种用于冠状病毒中不连续转录本组装的高效算法。
BMC Genomics. 2024 Dec 30;25(1):1257. doi: 10.1186/s12864-024-11179-0.
2
The coronavirus recombination pathway.冠状病毒重组途径。
Cell Host Microbe. 2023 Jun 14;31(6):874-889. doi: 10.1016/j.chom.2023.05.003.
3
Improved transcriptome assembly using a hybrid of long and short reads with StringTie.使用长读长和短读长混合的方法进行转录组组装,可提高组装质量。
PLoS Comput Biol. 2022 Jun 1;18(6):e1009730. doi: 10.1371/journal.pcbi.1009730. eCollection 2022 Jun.
4
Structures and functions of coronavirus replication-transcription complexes and their relevance for SARS-CoV-2 drug design.冠状病毒复制-转录复合物的结构和功能及其与 SARS-CoV-2 药物设计的相关性。
Nat Rev Mol Cell Biol. 2022 Jan;23(1):21-39. doi: 10.1038/s41580-021-00432-z. Epub 2021 Nov 25.
5
Jumper enables discontinuous transcript assembly in coronaviruses.跳跃基因使冠状病毒的不连续转录组装成为可能。
Nat Commun. 2021 Nov 18;12(1):6728. doi: 10.1038/s41467-021-26944-y.
6
coronaSPAdes: from biosynthetic gene clusters to RNA viral assemblies.coronaSPAdes:从生物合成基因簇到 RNA 病毒组装。
Bioinformatics. 2021 Dec 22;38(1):1-8. doi: 10.1093/bioinformatics/btab597.
7
MultiTrans: An Algorithm for Path Extraction Through Mixed Integer Linear Programming for Transcriptome Assembly.MultiTrans:一种通过混合整数线性规划进行转录组组装的路径提取算法。
IEEE/ACM Trans Comput Biol Bioinform. 2022 Jan-Feb;19(1):48-56. doi: 10.1109/TCBB.2021.3083277. Epub 2022 Feb 3.
8
The coding capacity of SARS-CoV-2.SARS-CoV-2 的编码能力。
Nature. 2021 Jan;589(7840):125-130. doi: 10.1038/s41586-020-2739-1. Epub 2020 Sep 9.
9
The Architecture of SARS-CoV-2 Transcriptome.SARS-CoV-2 转录组的结构。
Cell. 2020 May 14;181(4):914-921.e10. doi: 10.1016/j.cell.2020.04.011. Epub 2020 Apr 23.
10
rnaSPAdes: a de novo transcriptome assembler and its application to RNA-Seq data.rnaSPAdes:一种从头转录组组装程序及其在 RNA-Seq 数据中的应用。
Gigascience. 2019 Sep 1;8(9). doi: 10.1093/gigascience/giz100.