• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

TransIntegrator:通过整合 Illumina 和 PacBio 转录组,捕获几乎完整的蛋白质编码转录变体。

TransIntegrator: capture nearly full protein-coding transcript variants via integrating Illumina and PacBio transcriptomes.

机构信息

State Key Laboratory of Cellular Stress Biology, School of Life Sciences, Faculty of Medicine and Life Sciences, Xiamen University, 361102, Xiamen, China.

National Institute for Data Science in Health and Medicine, Xiamen University, 361102, Xiamen, China.

出版信息

Brief Bioinform. 2023 Sep 22;24(6). doi: 10.1093/bib/bbad334.

DOI:10.1093/bib/bbad334
PMID:37779246
Abstract

Genes have the ability to produce transcript variants that perform specific cellular functions. However, accurately detecting all transcript variants remains a long-standing challenge, especially when working with poorly annotated genomes or without a known genome. To address this issue, we have developed a new computational method, TransIntegrator, which enables transcriptome-wide detection of novel transcript variants. For this, we determined 10 Illumina sequencing transcriptomes and a PacBio full-length transcriptome for consecutive embryo development stages of amphioxus, a species of great evolutionary importance. Based on the transcriptomes, we employed TransIntegrator to create a comprehensive transcript variant library, namely iTranscriptome. The resulting iTrancriptome contained 91 915 distinct transcript variants, with an average of 2.4 variants per gene. This substantially improved current amphioxus genome annotation by expanding the number of genes from 21 954 to 38 777. Further analysis manifested that the gene expansion was largely ascribed to integration of multiple Illumina datasets instead of involving the PacBio data. Moreover, we demonstrated an example application of TransIntegrator, via generating iTrancriptome, in aiding accurate transcriptome assembly, which significantly outperformed other hybrid methods such as IDP-denovo and Trinity. For user convenience, we have deposited the source codes of TransIntegrator on GitHub as well as a conda package in Anaconda. In summary, this study proposes an affordable but efficient method for reliable transcriptomic research in most species.

摘要

基因具有产生执行特定细胞功能的转录变体的能力。然而,准确检测所有转录变体仍然是一个长期存在的挑战,尤其是在处理注释不良的基因组或没有已知基因组的情况下。为了解决这个问题,我们开发了一种新的计算方法 TransIntegrator,它能够在全转录组范围内检测新的转录变体。为此,我们确定了 10 个 Illumina 测序转录组和一个 PacBio 全长转录组,用于研究文昌鱼连续胚胎发育阶段,文昌鱼是一种具有重要进化意义的物种。基于转录组,我们使用 TransIntegrator 创建了一个全面的转录变体文库,即 iTranscriptome。生成的 iTranscriptome 包含 91,915 个独特的转录变体,每个基因的平均转录变体数为 2.4 个。这大大改进了当前的文昌鱼基因组注释,将基因数量从 21,954 个扩展到 38,777 个。进一步的分析表明,基因扩展主要归因于多个 Illumina 数据集的整合,而不是涉及 PacBio 数据。此外,我们通过生成 iTranscriptome 展示了 TransIntegrator 的一个应用示例,即帮助准确的转录组组装,这明显优于其他混合方法,如 IDP-denovo 和 Trinity。为了方便用户,我们将 TransIntegrator 的源代码存储在 GitHub 上,并在 Anaconda 中创建了一个 conda 包。总之,本研究提出了一种经济高效的方法,可用于大多数物种中可靠的转录组学研究。

相似文献

1
TransIntegrator: capture nearly full protein-coding transcript variants via integrating Illumina and PacBio transcriptomes.TransIntegrator:通过整合 Illumina 和 PacBio 转录组,捕获几乎完整的蛋白质编码转录变体。
Brief Bioinform. 2023 Sep 22;24(6). doi: 10.1093/bib/bbad334.
2
IDP-denovo: de novo transcriptome assembly and isoform annotation by hybrid sequencing.IDP-denovo:通过混合测序进行从头转录组组装和异构体注释。
Bioinformatics. 2018 Jul 1;34(13):2168-2176. doi: 10.1093/bioinformatics/bty098.
3
Illuminating the dark side of the human transcriptome with long read transcript sequencing.利用长读转录组测序揭示人类转录组的暗面。
BMC Genomics. 2020 Oct 30;21(1):751. doi: 10.1186/s12864-020-07123-7.
4
A Full-Length mRNA Transcriptome Generated From Hybrid-Corrected PacBio Long-Reads Improves the Transcript Annotation and Identifies Thousands of Novel Splice Variants in Atlantic Salmon.通过混合校正的PacBio长读长生成的全长mRNA转录组改善了转录本注释并鉴定了大西洋鲑鱼中数千种新的剪接变体。
Front Genet. 2021 Apr 27;12:656334. doi: 10.3389/fgene.2021.656334. eCollection 2021.
5
A survey of the complex transcriptome from the highly polyploid sugarcane genome using full-length isoform sequencing and de novo assembly from short read sequencing.利用全长异构体测序和短读长测序的从头组装对高度多倍体甘蔗基因组的复杂转录组进行的一项调查。
BMC Genomics. 2017 May 22;18(1):395. doi: 10.1186/s12864-017-3757-8.
6
Transcript Profiling Using Long-Read Sequencing Technologies.使用长读长测序技术进行转录本分析
Methods Mol Biol. 2018;1783:121-147. doi: 10.1007/978-1-4939-7834-2_6.
7
PacBio single molecule long-read sequencing provides insight into the complexity and diversity of the Pinctada fucata martensii transcriptome.太平洋生物单分子长读测序技术为了解马氏珠母贝转录组的复杂性和多样性提供了新视角。
BMC Genomics. 2020 Jul 13;21(1):481. doi: 10.1186/s12864-020-06894-3.
8
Unveiling the transcriptomic complexity of Miscanthus sinensis using a combination of PacBio long read- and Illumina short read sequencing platforms.利用 PacBio 长读测序平台和 Illumina 短读测序平台组合揭示芒的转录组复杂性。
BMC Genomics. 2021 Sep 22;22(1):690. doi: 10.1186/s12864-021-07971-x.
9
A high-quality annotated transcriptome of swine peripheral blood.猪外周血的高质量注释转录组。
BMC Genomics. 2017 Jun 24;18(1):479. doi: 10.1186/s12864-017-3863-7.
10
Reconstruction and functional annotation of Ascosphaera apis full-length transcriptome utilizing PacBio long reads combined with Illumina short reads.利用 PacBio 长读长测序和 Illumina 短读长测序重建并功能注释蜜蜂球囊菌全长转录组。
J Invertebr Pathol. 2020 Oct;176:107475. doi: 10.1016/j.jip.2020.107475. Epub 2020 Sep 23.

引用本文的文献

1
Efficacy and mechanisms of Angelica sinensis in treating endometrial cancer: an integrated study.当归治疗子宫内膜癌的疗效及机制:一项综合研究。
Discov Oncol. 2025 May 24;16(1):904. doi: 10.1007/s12672-025-02619-8.
2
Characterization of alternative splicing during mammalian brain development reveals the extent of isoform diversity and potential effects on protein structural changes.哺乳动物大脑发育过程中可变剪接的特征分析揭示了异构体多样性的程度及其对蛋白质结构变化的潜在影响。
Biol Open. 2024 Jul 15;13(10). doi: 10.1242/bio.061721. Epub 2024 Oct 10.