• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

宏基因组数据基因预测程序的基准测试。

Benchmarking of gene prediction programs for metagenomic data.

作者信息

Yok Non, Rosen Gail

机构信息

Drexel University, Electrical and Computer Engineering Department, 3141 Chestnut Street, PA 19104, USA.

出版信息

Annu Int Conf IEEE Eng Med Biol Soc. 2010;2010:6190-3. doi: 10.1109/IEMBS.2010.5627744.

DOI:10.1109/IEMBS.2010.5627744
PMID:21097156
Abstract

This manuscript presents the most rigorous benchmarking of gene annotation algorithms for metagenomic datasets to date. We compare three different programs: GeneMark, MetaGeneAnnotator (MGA) and Orphelia. The comparisons are based on their performances over simulated fragments from one hundred species of diverse lineages. We defined four different types of fragments; two types come from the inter- and intra-coding regions and the other types are from the gene edges. Hoff et al. used only 12 species in their comparison; therefore, their sample is too small to represent an environmental sample. Also, no predecessors has separately examined fragments that contain gene edges as opposed to intra-coding regions. General observations in our results are that performances of all these programs improve as we increase the length of the fragment. On the other hand, intra-coding fragments of our data show low annotation error in all of the programs if compared to the gene edge fragments. Overall, we found an upper-bound performance by combining all the methods.

摘要

本手稿展示了迄今为止针对宏基因组数据集的基因注释算法最严格的基准测试。我们比较了三种不同的程序:GeneMark、MetaGeneAnnotator(MGA)和Orphelia。这些比较基于它们对来自一百个不同谱系物种的模拟片段的性能表现。我们定义了四种不同类型的片段;两种类型来自编码区之间和编码区内,其他类型来自基因边缘。霍夫等人在他们的比较中仅使用了12个物种;因此,他们的样本太小,无法代表环境样本。此外,没有前人分别检查过包含基因边缘而非编码区内的片段。我们结果中的一般观察是,随着片段长度的增加,所有这些程序的性能都会提高。另一方面,与基因边缘片段相比,我们数据中的编码区内片段在所有程序中显示出较低的注释错误。总体而言,通过结合所有方法,我们发现了性能上限。

相似文献

1
Benchmarking of gene prediction programs for metagenomic data.宏基因组数据基因预测程序的基准测试。
Annu Int Conf IEEE Eng Med Biol Soc. 2010;2010:6190-3. doi: 10.1109/IEMBS.2010.5627744.
2
Combining gene prediction methods to improve metagenomic gene annotation.结合基因预测方法以提高宏基因组基因注释。
BMC Bioinformatics. 2011 Jan 13;12:20. doi: 10.1186/1471-2105-12-20.
3
Short-read reading-frame predictors are not created equal: sequence error causes loss of signal.短读阅读框预测器并不相同:序列错误导致信号丢失。
BMC Bioinformatics. 2012 Jul 28;13:183. doi: 10.1186/1471-2105-13-183.
4
The effect of sequencing errors on metagenomic gene prediction.测序错误对宏基因组基因预测的影响。
BMC Genomics. 2009 Nov 12;10:520. doi: 10.1186/1471-2164-10-520.
5
From Gene Annotation to Function Prediction for Metagenomics.从宏基因组学的基因注释到功能预测
Methods Mol Biol. 2017;1611:27-34. doi: 10.1007/978-1-4939-7015-5_3.
6
A multi-source domain annotation pipeline for quantitative metagenomic and metatranscriptomic functional profiling.用于定量宏基因组和宏转录组功能分析的多源域注释管道。
Microbiome. 2018 Aug 28;6(1):149. doi: 10.1186/s40168-018-0532-2.
7
geneRFinder: gene finding in distinct metagenomic data complexities.基因 RFinder:在不同元基因组数据复杂性中进行基因预测。
BMC Bioinformatics. 2021 Feb 25;22(1):87. doi: 10.1186/s12859-021-03997-w.
8
MG-RAST, a Metagenomics Service for Analysis of Microbial Community Structure and Function.MG-RAST,一种用于分析微生物群落结构和功能的宏基因组学服务。
Methods Mol Biol. 2016;1399:207-33. doi: 10.1007/978-1-4939-3369-3_13.
9
Analysis of sequencing strategies and tools for taxonomic annotation: Defining standards for progressive metagenomics.分析分类注释的测序策略和工具:为逐步宏基因组学定义标准。
Sci Rep. 2018 Aug 13;8(1):12034. doi: 10.1038/s41598-018-30515-5.
10
Fast and accurate average genome size and 16S rRNA gene average copy number computation in metagenomic data.快速准确地计算宏基因组数据中的平均基因组大小和 16S rRNA 基因平均拷贝数。
BMC Bioinformatics. 2019 Sep 5;20(1):453. doi: 10.1186/s12859-019-3031-y.

引用本文的文献

1
EBI Metagenomics in 2017: enriching the analysis of microbial communities, from sequence reads to assemblies.2017 年 EBI 宏基因组学:从序列读取到组装,丰富微生物群落的分析。
Nucleic Acids Res. 2018 Jan 4;46(D1):D726-D735. doi: 10.1093/nar/gkx967.
2
Integrative workflows for metagenomic analysis.宏基因组分析的综合工作流程。
Front Cell Dev Biol. 2014 Nov 19;2:70. doi: 10.3389/fcell.2014.00070. eCollection 2014.
3
Reduce manual curation by combining gene predictions from multiple annotation engines, a case study of start codon prediction.
通过结合来自多个注释引擎的基因预测来减少人工注释,以起始密码子预测为例。
PLoS One. 2013 May 10;8(5):e63523. doi: 10.1371/journal.pone.0063523. Print 2013.
4
Combining gene prediction methods to improve metagenomic gene annotation.结合基因预测方法以提高宏基因组基因注释。
BMC Bioinformatics. 2011 Jan 13;12:20. doi: 10.1186/1471-2105-12-20.