• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

解决真核藻类中普遍存在的结构注释稀缺问题。

Addressing the pervasive scarcity of structural annotation in eukaryotic algae.

机构信息

Genomics and Bioanalytics, Bioscience Division, Los Alamos National Laboratory, Los Alamos, NM, USA.

出版信息

Sci Rep. 2023 Jan 30;13(1):1687. doi: 10.1038/s41598-023-27881-0.

DOI:10.1038/s41598-023-27881-0
PMID:36717613
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9886943/
Abstract

Despite a continuous increase in algal genome sequencing, structural annotations of most algal genome assemblies remain unavailable. This pervasive scarcity of genome annotation has restricted rigorous investigation of these genomic resources and may have precipitated misleading biological interpretations. However, the annotation process for eukaryotic algal species is often challenging as genomic resources and transcriptomic evidence are not always available. To address this challenge, we benchmark the cutting-edge gene prediction methods that can be generalized for a broad range of non-model eukaryotes. Using the most accurate methods selected based on high-quality algal genomes, we predict structural annotations for 135 unannotated algal genomes. Using previously available genomic data pooled together with new data obtained in this study, we identified the core orthologous genes and the multi-gene phylogeny of eukaryotic algae, including of previously unexplored algal species. This study not only provides a benchmark for the use of structural annotation methods on a variety of non-model eukaryotes, but also compensates for missing data in the current spectrum of algal genomic resources. These results bring us one step closer to the full potential of eukaryotic algal genomics.

摘要

尽管藻类基因组测序不断增加,但大多数藻类基因组组装的结构注释仍然不可用。这种普遍缺乏基因组注释的情况限制了对这些基因组资源的严格研究,并可能导致了误导性的生物学解释。然而,真核藻类物种的注释过程通常具有挑战性,因为基因组资源和转录组证据并不总是可用。为了解决这一挑战,我们对可广泛应用于多种非模式真核生物的最先进的基因预测方法进行了基准测试。我们使用基于高质量藻类基因组选择的最准确方法,为 135 个未注释的藻类基因组预测结构注释。利用以前可用的基因组数据和本研究中获得的新数据,我们确定了真核藻类的核心直系同源基因和多基因系统发育,包括以前未探索的藻类物种。这项研究不仅为在各种非模式真核生物上使用结构注释方法提供了基准,还弥补了当前藻类基因组资源谱中缺失的数据。这些结果使我们更接近真核藻类基因组学的全部潜力。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9d5/9886943/55f4c716cf7f/41598_2023_27881_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9d5/9886943/532e6b1d8bc1/41598_2023_27881_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9d5/9886943/f467de6d5182/41598_2023_27881_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9d5/9886943/ce1c357f37d3/41598_2023_27881_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9d5/9886943/ae9628945db4/41598_2023_27881_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9d5/9886943/55f4c716cf7f/41598_2023_27881_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9d5/9886943/532e6b1d8bc1/41598_2023_27881_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9d5/9886943/f467de6d5182/41598_2023_27881_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9d5/9886943/ce1c357f37d3/41598_2023_27881_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9d5/9886943/ae9628945db4/41598_2023_27881_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9d5/9886943/55f4c716cf7f/41598_2023_27881_Fig5_HTML.jpg

相似文献

1
Addressing the pervasive scarcity of structural annotation in eukaryotic algae.解决真核藻类中普遍存在的结构注释稀缺问题。
Sci Rep. 2023 Jan 30;13(1):1687. doi: 10.1038/s41598-023-27881-0.
2
Algal Functional Annotation Tool: a web-based analysis suite to functionally interpret large gene lists using integrated annotation and expression data.藻种功能注释工具:一个基于网络的分析套件,可使用集成注释和表达数据来对大型基因列表进行功能解释。
BMC Bioinformatics. 2011 Jul 12;12:282. doi: 10.1186/1471-2105-12-282.
3
4
GASS: genome structural annotation for Eukaryotes based on species similarity.GASS:基于物种相似性的真核生物基因组结构注释
BMC Genomics. 2015 Mar 4;16(1):150. doi: 10.1186/s12864-015-1353-3.
5
PhycoCosm, a comparative algal genomics resource.藻宇宙,一个藻类比较基因组学资源。
Nucleic Acids Res. 2021 Jan 8;49(D1):D1004-D1011. doi: 10.1093/nar/gkaa898.
6
Peptimapper: proteogenomics workflow for the expert annotation of eukaryotic genomes.Peptimapper:用于真核生物基因组专家注释的蛋白质基因组学工作流程。
BMC Genomics. 2019 Jan 17;20(1):56. doi: 10.1186/s12864-019-5431-9.
7
A benchmark study of ab initio gene prediction methods in diverse eukaryotic organisms.不同真核生物中从头基因预测方法的基准研究。
BMC Genomics. 2020 Apr 9;21(1):293. doi: 10.1186/s12864-020-6707-9.
8
In the grip of algal genomics.在藻类基因组学的掌控之中。
Adv Exp Med Biol. 2007;616:54-76. doi: 10.1007/978-0-387-75532-8_6.
9
The Rhodoexplorer Platform for Red Algal Genomics and Whole-Genome Assemblies for Several Gracilaria Species.Rhodoexplorer 平台用于红藻基因组学和几种江蓠属物种的全基因组组装。
Genome Biol Evol. 2023 Jul 3;15(7). doi: 10.1093/gbe/evad124.
10
Saccharomyces cerevisiae: gene annotation and genome variability, state of the art through comparative genomics.酿酒酵母:基因注释与基因组变异性,通过比较基因组学呈现的最新技术水平
Methods Mol Biol. 2011;759:31-40. doi: 10.1007/978-1-61779-173-4_2.

引用本文的文献

1
Global characterization of biosynthetic gene clusters in non-model eukaryotes using domain architectures.利用结构域架构对非模式真核生物中的生物合成基因簇进行全局表征。
Sci Rep. 2024 Jan 17;14(1):1534. doi: 10.1038/s41598-023-50095-3.
2
Comparative Genomic Analysis of a MB200 Isolated from Biogas Digesters Provided New Insights into the Carbon Metabolism of Methylotrophic Bacteria.从沼气消化器中分离出的一株 MB200 的比较基因组分析为研究甲基营养菌的碳代谢提供了新的见解。
Int J Mol Sci. 2023 Apr 19;24(8):7521. doi: 10.3390/ijms24087521.

本文引用的文献

1
FINDER: an automated software package to annotate eukaryotic genes from RNA-Seq data and associated protein sequences.FINDER:一个自动化软件包,用于从 RNA-Seq 数据和相关蛋白质序列中注释真核基因。
BMC Bioinformatics. 2021 Apr 20;22(1):205. doi: 10.1186/s12859-021-04120-9.
2
BRAKER2: automatic eukaryotic genome annotation with GeneMark-EP+ and AUGUSTUS supported by a protein database.BRAKER2:借助蛋白质数据库,由GeneMark-EP+和AUGUSTUS支持的真核生物基因组自动注释工具。
NAR Genom Bioinform. 2021 Jan 6;3(1):lqaa108. doi: 10.1093/nargab/lqaa108. eCollection 2021 Mar.
3
GenBank.
GenBank
Nucleic Acids Res. 2021 Jan 8;49(D1):D92-D96. doi: 10.1093/nar/gkaa1023.
4
GFF Utilities: GffRead and GffCompare.
F1000Res. 2020 Apr 28;9. doi: 10.12688/f1000research.23297.2. eCollection 2020.
5
GeneMark-EP+: eukaryotic gene prediction with self-training in the space of genes and proteins.GeneMark-EP+:在基因和蛋白质空间中进行自我训练的真核基因预测
NAR Genom Bioinform. 2020 Jun;2(2):lqaa026. doi: 10.1093/nargab/lqaa026. Epub 2020 May 13.
6
A benchmark study of ab initio gene prediction methods in diverse eukaryotic organisms.不同真核生物中从头基因预测方法的基准研究。
BMC Genomics. 2020 Apr 9;21(1):293. doi: 10.1186/s12864-020-6707-9.
7
IQ-TREE 2: New Models and Efficient Methods for Phylogenetic Inference in the Genomic Era.IQ-TREE 2:基因组时代系统发育推断的新模型和有效方法。
Mol Biol Evol. 2020 May 1;37(5):1530-1534. doi: 10.1093/molbev/msaa015.
8
OrthoFinder: phylogenetic orthology inference for comparative genomics.OrthoFinder:用于比较基因组学的系统发育直系同源推断。
Genome Biol. 2019 Nov 14;20(1):238. doi: 10.1186/s13059-019-1832-y.
9
Evolview v3: a webserver for visualization, annotation, and management of phylogenetic trees.Evolview v3:一个用于可视化、注释和管理系统发育树的网络服务器。
Nucleic Acids Res. 2019 Jul 2;47(W1):W270-W275. doi: 10.1093/nar/gkz357.
10
New Phylogenomic Analysis of the Enigmatic Phylum Telonemia Further Resolves the Eukaryote Tree of Life.新的神秘门纲目 Telonemia 的系统基因组分析进一步解析了真核生物的生命之树。
Mol Biol Evol. 2019 Apr 1;36(4):757-765. doi: 10.1093/molbev/msz012.