• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

MetaGene:从环境基因组鸟枪法测序中寻找原核生物基因

MetaGene: prokaryotic gene finding from environmental genome shotgun sequences.

作者信息

Noguchi Hideki, Park Jungho, Takagi Toshihisa

机构信息

Department of Computational Biology, Graduate School of Frontier Sciences, University of Tokyo, Kashiwa, Chiba 277-8562, Japan.

出版信息

Nucleic Acids Res. 2006;34(19):5623-30. doi: 10.1093/nar/gkl723. Epub 2006 Oct 5.

DOI:10.1093/nar/gkl723
PMID:17028096
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC1636498/
Abstract

Exhaustive gene identification is a fundamental goal in all metagenomics projects. However, most metagenomic sequences are unassembled anonymous fragments, and conventional gene-finding methods cannot be applied. We have developed a prokaryotic gene-finding program, MetaGene, which utilizes di-codon frequencies estimated by the GC content of a given sequence with other various measures. MetaGene can predict a whole range of prokaryotic genes based on the anonymous genomic sequences of a few hundred bases, with a sensitivity of 95% and a specificity of 90% for artificial shotgun sequences (700 bp fragments from 12 species). MetaGene has two sets of codon frequency interpolations, one for bacteria and one for archaea, and automatically selects the proper set for a given sequence using the domain classification method we propose. The domain classification works properly, correctly assigning domain information to more than 90% of the artificial shotgun sequences. Applied to the Sargasso Sea dataset, MetaGene predicted almost all of the annotated genes and a notable number of novel genes. MetaGene can be applied to wide variety of metagenomic projects and expands the utility of metagenomics.

摘要

详尽的基因识别是所有宏基因组学项目的基本目标。然而,大多数宏基因组序列是未组装的匿名片段,传统的基因发现方法无法应用。我们开发了一个原核生物基因发现程序MetaGene,它利用给定序列的GC含量估计的双密码子频率以及其他各种指标。MetaGene可以基于几百个碱基的匿名基因组序列预测整个原核生物基因范围,对于人工鸟枪法序列(来自12个物种的700bp片段),其灵敏度为95%,特异性为90%。MetaGene有两组密码子频率插值,一组用于细菌,一组用于古细菌,并使用我们提出的域分类方法为给定序列自动选择合适的一组。域分类工作正常,能将域信息正确分配给超过90%的人工鸟枪法序列。应用于马尾藻海数据集时,MetaGene预测了几乎所有注释基因以及大量新基因。MetaGene可应用于各种各样的宏基因组学项目,扩展了宏基因组学的效用。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/11af/1636498/ea3be807a5cf/gkl723f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/11af/1636498/53a1d09fce1f/gkl723f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/11af/1636498/64fe3d0e2a96/gkl723f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/11af/1636498/635382799741/gkl723f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/11af/1636498/c192aebedeef/gkl723f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/11af/1636498/ea3be807a5cf/gkl723f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/11af/1636498/53a1d09fce1f/gkl723f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/11af/1636498/64fe3d0e2a96/gkl723f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/11af/1636498/635382799741/gkl723f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/11af/1636498/c192aebedeef/gkl723f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/11af/1636498/ea3be807a5cf/gkl723f5.jpg

相似文献

1
MetaGene: prokaryotic gene finding from environmental genome shotgun sequences.MetaGene:从环境基因组鸟枪法测序中寻找原核生物基因
Nucleic Acids Res. 2006;34(19):5623-30. doi: 10.1093/nar/gkl723. Epub 2006 Oct 5.
2
GeneMarkS: a self-training method for prediction of gene starts in microbial genomes. Implications for finding sequence motifs in regulatory regions.GeneMarkS:一种用于预测微生物基因组中基因起始位点的自训练方法。对在调控区域中寻找序列基序的启示。
Nucleic Acids Res. 2001 Jun 15;29(12):2607-18. doi: 10.1093/nar/29.12.2607.
3
Re-annotation of genome microbial coding-sequences: finding new genes and inaccurately annotated genes.基因组微生物编码序列的重新注释:发现新基因和注释不准确的基因。
BMC Bioinformatics. 2002;3:5. doi: 10.1186/1471-2105-3-5. Epub 2002 Feb 5.
4
Comparative Genomics for Prokaryotes.原核生物的比较基因组学
Methods Mol Biol. 2018;1704:55-78. doi: 10.1007/978-1-4939-7463-4_3.
5
Environmental genome shotgun sequencing of the Sargasso Sea.马尾藻海的环境基因组鸟枪法测序
Science. 2004 Apr 2;304(5667):66-74. doi: 10.1126/science.1093857. Epub 2004 Mar 4.
6
Large-scale, multi-genome analysis of alternate open reading frames in bacteria and archaea.细菌和古细菌中交替开放阅读框的大规模多基因组分析。
OMICS. 2005 Spring;9(1):91-105. doi: 10.1089/omi.2005.9.91.
7
Ab initio gene identification in metagenomic sequences.从头鉴定宏基因组序列中的基因。
Nucleic Acids Res. 2010 Jul;38(12):e132. doi: 10.1093/nar/gkq275. Epub 2010 Apr 19.
8
Dictionary-driven prokaryotic gene finding.基于字典驱动的原核生物基因查找。
Nucleic Acids Res. 2002 Jun 15;30(12):2710-25. doi: 10.1093/nar/gkf338.
9
Gene identification in prokaryotic genomes, phages, metagenomes, and EST sequences with GeneMarkS suite.使用GeneMarkS套件在原核生物基因组、噬菌体、宏基因组和EST序列中进行基因识别。
Curr Protoc Bioinformatics. 2011 Sep;Chapter 4:4.5.1-4.5.17. doi: 10.1002/0471250953.bi0405s35.
10
GISMO--gene identification using a support vector machine for ORF classification.GISMO——使用支持向量机进行开放阅读框分类的基因识别方法
Nucleic Acids Res. 2007;35(2):540-9. doi: 10.1093/nar/gkl1083. Epub 2006 Dec 14.

引用本文的文献

1
Soyasaponin and vertical microbial transmission: Maternal effect on the intestinal development and health of early chicks.大豆皂苷与垂直微生物传播:母体对雏鸡早期肠道发育和健康的影响。
Imeta. 2025 May 20;4(4):e70044. doi: 10.1002/imt2.70044. eCollection 2025 Aug.
2
Macrogenomics-based analysis of rumen microbial composition and their metabolic pathways in yaks under different dietary concentrate-to-forage ratios.基于宏基因组学分析不同精粗比条件下牦牛瘤胃微生物组成及其代谢途径
Front Microbiol. 2025 Aug 6;16:1587474. doi: 10.3389/fmicb.2025.1587474. eCollection 2025.
3
Impact of Habitat Transformation on Soil Microbial Diversity and Functionality in Karst Mountainous Parks: A Comparative Study of Remnant Forests and Artificial Green Spaces.

本文引用的文献

1
Metagenomics to paleogenomics: large-scale sequencing of mammoth DNA.从宏基因组学到古基因组学:猛犸象DNA的大规模测序
Science. 2006 Jan 20;311(5759):392-4. doi: 10.1126/science.1123360. Epub 2005 Dec 20.
2
Large-scale prokaryotic gene prediction and comparison to genome annotation.大规模原核生物基因预测及与基因组注释的比较。
Bioinformatics. 2005 Dec 15;21(24):4322-9. doi: 10.1093/bioinformatics/bti701. Epub 2005 Oct 25.
3
Bioinformatics for whole-genome shotgun sequencing of microbial communities.用于微生物群落全基因组鸟枪法测序的生物信息学
喀斯特山地公园栖息地转变对土壤微生物多样性和功能的影响:残存森林与人工绿地的比较研究
Ecol Evol. 2025 Aug 14;15(8):e72021. doi: 10.1002/ece3.72021. eCollection 2025 Aug.
4
Comparative analysis of rhizosphere microbial communities in monoculture and mixed oak-pine forests: structural and functional insights.单一栽培和橡树-松树混交林根际微生物群落的比较分析:结构和功能见解
Front Microbiol. 2025 Jul 25;16:1646535. doi: 10.3389/fmicb.2025.1646535. eCollection 2025.
5
Rhizosphere Microbiome-Root Exudate Synergy in Pteris vittata: Coordinated Arsenic Speciation and Multielement Metabolic Coupling Drive Hyperaccumulation Efficiency.蜈蚣草根际微生物组与根系分泌物的协同作用:砷形态的协调与多元素代谢耦合驱动超积累效率
Microb Ecol. 2025 Jul 23;88(1):79. doi: 10.1007/s00248-025-02584-3.
6
Direct-fed microbials optimize ruminal fermentation, microbial ecosystem and milk quality to enhance the lactation performance of Sanhe dairy cows.直接投喂微生物可优化瘤胃发酵、微生物生态系统和牛奶品质,从而提高三河奶牛的泌乳性能。
Anim Microbiome. 2025 Jul 19;7(1):75. doi: 10.1186/s42523-025-00437-5.
7
Thermophilic microbial agents promote the fermentation progression of spent mushroom compost and pig manure.嗜热微生物制剂促进了蘑菇废料和猪粪的发酵进程。
Front Microbiol. 2025 Jun 18;16:1575397. doi: 10.3389/fmicb.2025.1575397. eCollection 2025.
8
Nitrogen-shaped microbiotas with nutrient competition accelerate early-stage residue decomposition in agricultural soils.具有养分竞争的氮形态微生物群加速了农业土壤中早期残留物的分解。
Nat Commun. 2025 Jul 1;16(1):5793. doi: 10.1038/s41467-025-60948-2.
9
Integrated multi-omics reveals the impact of ruminal keystone bacteria and microbial metabolites on average daily gain in Xuzhou cattle.整合多组学揭示瘤胃关键细菌和微生物代谢产物对徐州牛平均日增重的影响。
Microbiol Spectr. 2025 Aug 5;13(8):e0076925. doi: 10.1128/spectrum.00769-25. Epub 2025 Jun 30.
10
Effect of different bulking agents on the quality, microbial community structure and metabolic functions during human feces composting in foam composting device.不同膨松剂对泡沫堆肥装置中人类粪便堆肥过程中质量、微生物群落结构和代谢功能的影响
Front Microbiol. 2025 Jun 10;16:1556537. doi: 10.3389/fmicb.2025.1556537. eCollection 2025.
PLoS Comput Biol. 2005 Jul;1(2):106-12. doi: 10.1371/journal.pcbi.0010024.
4
Genomic sequencing of Pleistocene cave bears.更新世洞熊的基因组测序。
Science. 2005 Jul 22;309(5734):597-9. doi: 10.1126/science.1113485. Epub 2005 Jun 2.
5
Comparative metagenomics of microbial communities.微生物群落的比较宏基因组学
Science. 2005 Apr 22;308(5721):554-7. doi: 10.1126/science.1107851.
6
Reverse methanogenesis: testing the hypothesis with environmental genomics.反向产甲烷作用:利用环境基因组学验证该假说
Science. 2004 Sep 3;305(5689):1457-62. doi: 10.1126/science.1100025.
7
Environmental genome shotgun sequencing of the Sargasso Sea.马尾藻海的环境基因组鸟枪法测序
Science. 2004 Apr 2;304(5667):66-74. doi: 10.1126/science.1093857. Epub 2004 Mar 4.
8
Community structure and metabolism through reconstruction of microbial genomes from the environment.通过从环境中重建微生物基因组来研究群落结构与代谢。
Nature. 2004 Mar 4;428(6978):37-43. doi: 10.1038/nature02340. Epub 2004 Feb 1.
9
The uncultured microbial majority.未培养的微生物主体
Annu Rev Microbiol. 2003;57:369-94. doi: 10.1146/annurev.micro.57.030502.090759.
10
Detecting pathogenicity islands and anomalous gene clusters by iterative discriminant analysis.通过迭代判别分析检测致病岛和异常基因簇。
FEMS Microbiol Lett. 2003 Apr 25;221(2):269-75. doi: 10.1016/S0378-1097(03)00204-0.