• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

UnigeneFinder:一种用于在没有参考基因组的情况下从转录组组装中进行基因识别的自动化流程。

UnigeneFinder: An Automated Pipeline for Gene Calling From Transcriptome Assemblies Without a Reference Genome.

作者信息

Xue Bo, Prado Karine, Rhee Seung Yon, Stata Matt

机构信息

Plant Resilience Institute Michigan State University East Lansing Michigan USA.

Department of Biochemistry and Molecular Biology Michigan State University East Lansing Michigan USA.

出版信息

Plant Direct. 2025 Apr 22;9(4):e70056. doi: 10.1002/pld3.70056. eCollection 2025 Apr.

DOI:10.1002/pld3.70056
PMID:40264544
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12012387/
Abstract

For most species, transcriptome data are much more readily available than genome data. Without a reference genome, gene calling is cumbersome and inaccurate because of the high degree of redundancy in de novo transcriptome assemblies. To simplify and increase the accuracy of de novo transcriptome assembly in the absence of a reference genome, we developed UnigeneFinder. Combining several clustering methods, UnigeneFinder substantially reduces the redundancy typical of raw transcriptome assemblies. This pipeline offers an effective solution to the problem of inflated transcript numbers, achieving a closer representation of the actual underlying genome. UnigeneFinder performs comparably or better, compared with existing tools, on plant species with varying genome complexities. UnigeneFinder is the only available transcriptome redundancy solution that fully automates the generation of primary transcript, coding region, and protein sequences, analogous to those available for high-quality reference genomes. These features, coupled with the pipeline's cross-platform implementation, focus on automation, and an accessible, user-friendly interface, make UnigeneFinder a useful tool for many downstream sequence-based analyses in nonmodel organisms lacking a reference genome, including differential gene expression analysis, accurate ortholog identification, functional enrichments, and evolutionary analyses. UnigeneFinder also runs efficiently both on high-performance computing (HPC) systems and personal computers, further reducing barriers to use.

摘要

对于大多数物种而言,转录组数据比基因组数据更容易获取。在没有参考基因组的情况下,由于从头转录组组装中的高度冗余性,基因识别既繁琐又不准确。为了在没有参考基因组的情况下简化并提高从头转录组组装的准确性,我们开发了UnigeneFinder。通过结合多种聚类方法,UnigeneFinder大幅减少了原始转录组组装中典型的冗余性。该流程为转录本数量膨胀的问题提供了有效的解决方案,更接近实际潜在基因组的表现形式。与现有工具相比,UnigeneFinder在具有不同基因组复杂性的植物物种上表现相当或更优。UnigeneFinder是唯一可用的转录组冗余解决方案,它能完全自动化生成初级转录本、编码区和蛋白质序列,类似于为高质量参考基因组提供的序列。这些特性,再加上该流程的跨平台实现、对自动化的关注以及易于访问且用户友好的界面,使得UnigeneFinder成为在缺乏参考基因组的非模式生物中进行许多基于序列的下游分析的有用工具,包括差异基因表达分析、准确的直系同源物鉴定、功能富集和进化分析。UnigeneFinder在高性能计算(HPC)系统和个人计算机上也能高效运行,进一步降低了使用门槛。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/393e/12012387/cb5e60b29b6a/PLD3-9-e70056-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/393e/12012387/99a43bb6a286/PLD3-9-e70056-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/393e/12012387/6605f0dff6a4/PLD3-9-e70056-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/393e/12012387/cb5e60b29b6a/PLD3-9-e70056-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/393e/12012387/99a43bb6a286/PLD3-9-e70056-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/393e/12012387/6605f0dff6a4/PLD3-9-e70056-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/393e/12012387/cb5e60b29b6a/PLD3-9-e70056-g001.jpg

相似文献

1
UnigeneFinder: An Automated Pipeline for Gene Calling From Transcriptome Assemblies Without a Reference Genome.UnigeneFinder:一种用于在没有参考基因组的情况下从转录组组装中进行基因识别的自动化流程。
Plant Direct. 2025 Apr 22;9(4):e70056. doi: 10.1002/pld3.70056. eCollection 2025 Apr.
2
HPC-T-Assembly: a pipeline for de novo transcriptome assembly of large multi-specie datasets.HPC-T-Assembly:一种用于大型多物种数据集从头转录组组装的流程。
BMC Bioinformatics. 2025 Apr 28;26(1):113. doi: 10.1186/s12859-025-06121-4.
3
RNA-Seq in Nonmodel Organisms.非模式生物的 RNA-Seq。
Methods Mol Biol. 2021;2243:143-167. doi: 10.1007/978-1-0716-1103-6_8.
4
TransPi-a comprehensive TRanscriptome ANalysiS PIpeline for de novo transcriptome assembly.TransPi-一个用于从头组装转录组的全面转录组分析管道。
Mol Ecol Resour. 2022 Jul;22(5):2070-2086. doi: 10.1111/1755-0998.13593. Epub 2022 Feb 18.
5
Optimization of de novo transcriptome assembly from high-throughput short read sequencing data improves functional annotation for non-model organisms.优化从头转录组组装从高通量短读测序数据提高非模式生物的功能注释。
BMC Bioinformatics. 2012 Jul 18;13:170. doi: 10.1186/1471-2105-13-170.
6
Combining independent de novo assemblies optimizes the coding transcriptome for nonconventional model eukaryotic organisms.结合独立的从头组装可优化非传统模式真核生物的编码转录组。
BMC Bioinformatics. 2016 Dec 9;17(1):525. doi: 10.1186/s12859-016-1406-x.
7
A consensus-based ensemble approach to improve transcriptome assembly.基于共识的集成方法提高转录组组装。
BMC Bioinformatics. 2021 Oct 21;22(1):513. doi: 10.1186/s12859-021-04434-8.
8
Comparative performance of transcriptome assembly methods for non-model organisms.非模式生物转录组组装方法的比较性能
BMC Genomics. 2016 Jul 27;17:523. doi: 10.1186/s12864-016-2923-8.
9
Challenges and advances for transcriptome assembly in non-model species.非模式物种转录组组装面临的挑战与进展
PLoS One. 2017 Sep 20;12(9):e0185020. doi: 10.1371/journal.pone.0185020. eCollection 2017.
10
Re-assembly, quality evaluation, and annotation of 678 microbial eukaryotic reference transcriptomes.重新组装、质量评估和 678 个微生物真核参考转录组的注释。
Gigascience. 2019 Apr 1;8(4). doi: 10.1093/gigascience/giy158.

本文引用的文献

1
A simple guide to de novo transcriptome assembly and annotation.从头转录组组装与注释简明指南。
Brief Bioinform. 2022 Mar 10;23(2). doi: 10.1093/bib/bbab563.
2
Representation and participation across 20 years of plant genome sequencing.二十年来植物基因组测序的表现与参与。
Nat Plants. 2021 Dec;7(12):1571-1578. doi: 10.1038/s41477-021-01031-8. Epub 2021 Nov 29.
3
Twelve years of SAMtools and BCFtools.SAMtools 和 BCFtools 十二年。
Gigascience. 2021 Feb 16;10(2). doi: 10.1093/gigascience/giab008.
4
Compacta: a fast contig clustering tool for de novo assembled transcriptomes.Compacta:一种用于从头组装转录组的快速重叠群聚类工具。
BMC Genomics. 2020 Feb 11;21(1):148. doi: 10.1186/s12864-020-6528-x.
5
Optimized sequencing depth and de novo assembler for deeply reconstructing the transcriptome of the tea plant, an economically important plant species.优化测序深度和从头组装算法,以深度重建经济重要植物物种茶树的转录组。
BMC Bioinformatics. 2019 Nov 6;20(1):553. doi: 10.1186/s12859-019-3166-x.
6
A chromosome-anchored eggplant genome sequence reveals key events in Solanaceae evolution.一个染色体锚定的茄子基因组序列揭示了茄科进化中的关键事件。
Sci Rep. 2019 Aug 13;9(1):11769. doi: 10.1038/s41598-019-47985-w.
7
Graph-based genome alignment and genotyping with HISAT2 and HISAT-genotype.基于图的基因组比对和基因分型与 HISAT2 和 HISAT-genotype。
Nat Biotechnol. 2019 Aug;37(8):907-915. doi: 10.1038/s41587-019-0201-4. Epub 2019 Aug 2.
8
Inferring Orthology and Paralogy.推断直系同源和旁系同源关系。
Methods Mol Biol. 2019;1910:149-175. doi: 10.1007/978-1-4939-9074-0_5.
9
Effect of de novo transcriptome assembly on transcript quantification.从头转录组组装对转录本定量的影响。
Sci Rep. 2019 Jun 5;9(1):8304. doi: 10.1038/s41598-019-44499-3.
10
De novo transcriptome assembly: A comprehensive cross-species comparison of short-read RNA-Seq assemblers.从头转录组组装:短读 RNA-Seq 组装器的全面跨物种比较。
Gigascience. 2019 May 1;8(5). doi: 10.1093/gigascience/giz039.