• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基因标记-HM:改进人类微生物组DNA序列中的基因预测

GeneMark-HM: improving gene prediction in DNA sequences of human microbiome.

作者信息

Lomsadze Alexandre, Bonny Christophe, Strozzi Francesco, Borodovsky Mark

机构信息

Gene Probe, Inc., 1106 Wrights Mill Ct, Atlanta, GA 30324, USA.

Enterome, 94/96 avenue Ledru-Rollin, 75011 Paris, France.

出版信息

NAR Genom Bioinform. 2021 May 26;3(2):lqab047. doi: 10.1093/nargab/lqab047. eCollection 2021 Jun.

DOI:10.1093/nargab/lqab047
PMID:34056597
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8153819/
Abstract

Computational reconstruction of nearly complete genomes from metagenomic reads may identify thousands of new uncultured candidate bacterial species. We have shown that reconstructed prokaryotic genomes along with genomes of sequenced microbial isolates can be used to support more accurate gene prediction in novel sequences. We have proposed an approach that used three types of gene prediction algorithms and found for all contigs in a metagenome nearly optimal models of protein-coding regions either in libraries of pre-computed models or constructed de novo. The model selection process and gene annotation were done by the new GeneMark-HM pipeline. We have created a database of the species level pan-genomes for the human microbiome. To create a library of models representing each pan-genome we used a self-training algorithm GeneMarkS-2. Genes initially predicted in each contig served as queries for a fast similarity search through the pan-genome database. The best matches led to selection of the model for gene prediction. Contigs not assigned to pan-genomes were analyzed by crude, but still accurate models designed for sequences with particular GC compositions. Tests of GeneMark-HM on simulated metagenomes demonstrated improvement in gene annotation of human metagenomic sequences in comparison with the current state-of-the-art gene prediction tools.

摘要

从宏基因组读数中进行近乎完整基因组的计算重建可能会识别出数千种新的未培养候选细菌物种。我们已经表明,重建的原核基因组以及已测序微生物分离株的基因组可用于支持对新序列进行更准确的基因预测。我们提出了一种方法,该方法使用三种类型的基因预测算法,并在预计算模型库中或从头构建中为宏基因组中的所有重叠群找到蛋白质编码区域的近乎最优模型。模型选择过程和基因注释由新的GeneMark-HM管道完成。我们已经为人微生物组创建了一个物种水平泛基因组数据库。为了创建一个代表每个泛基因组的模型库,我们使用了一种自训练算法GeneMarkS-2。最初在每个重叠群中预测的基因用作通过泛基因组数据库进行快速相似性搜索的查询。最佳匹配导致选择用于基因预测的模型。未分配到泛基因组的重叠群通过为具有特定GC组成的序列设计的粗略但仍然准确的模型进行分析。与当前最先进的基因预测工具相比,GeneMark-HM对模拟宏基因组的测试表明人类宏基因组序列的基因注释有所改进。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8256/8153819/7278ad7eb05d/lqab047fig7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8256/8153819/212816a91640/lqab047fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8256/8153819/2a4730788dd6/lqab047fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8256/8153819/fbb3eb21610c/lqab047fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8256/8153819/dc6b7119f222/lqab047fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8256/8153819/1be15a0136e0/lqab047fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8256/8153819/b44471fdc1dd/lqab047fig6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8256/8153819/7278ad7eb05d/lqab047fig7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8256/8153819/212816a91640/lqab047fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8256/8153819/2a4730788dd6/lqab047fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8256/8153819/fbb3eb21610c/lqab047fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8256/8153819/dc6b7119f222/lqab047fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8256/8153819/1be15a0136e0/lqab047fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8256/8153819/b44471fdc1dd/lqab047fig6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8256/8153819/7278ad7eb05d/lqab047fig7.jpg

相似文献

1
GeneMark-HM: improving gene prediction in DNA sequences of human microbiome.基因标记-HM:改进人类微生物组DNA序列中的基因预测
NAR Genom Bioinform. 2021 May 26;3(2):lqab047. doi: 10.1093/nargab/lqab047. eCollection 2021 Jun.
2
Gene identification in prokaryotic genomes, phages, metagenomes, and EST sequences with GeneMarkS suite.使用GeneMarkS套件在原核生物基因组、噬菌体、宏基因组和EST序列中进行基因识别。
Curr Protoc Microbiol. 2014 Feb 6;32:Unit 1E.7.. doi: 10.1002/9780471729259.mc01e07s32.
3
Gene identification in prokaryotic genomes, phages, metagenomes, and EST sequences with GeneMarkS suite.使用GeneMarkS套件在原核生物基因组、噬菌体、宏基因组和EST序列中进行基因识别。
Curr Protoc Bioinformatics. 2011 Sep;Chapter 4:4.5.1-4.5.17. doi: 10.1002/0471250953.bi0405s35.
4
VirFinder: a novel k-mer based tool for identifying viral sequences from assembled metagenomic data.VirFinder:一种新型的基于 k-mer 的工具,用于从组装的宏基因组数据中识别病毒序列。
Microbiome. 2017 Jul 6;5(1):69. doi: 10.1186/s40168-017-0283-5.
5
Probabilistic methods of identifying genes in prokaryotic genomes: connections to the HMM theory.原核生物基因组中基因识别的概率方法:与隐马尔可夫模型理论的联系。
Brief Bioinform. 2004 Jun;5(2):118-30. doi: 10.1093/bib/5.2.118.
6
Optimizing and evaluating the reconstruction of Metagenome-assembled microbial genomes.优化和评估宏基因组组装微生物基因组的重建。
BMC Genomics. 2017 Nov 28;18(1):915. doi: 10.1186/s12864-017-4294-1.
7
8
GeneMark: web software for gene finding in prokaryotes, eukaryotes and viruses.基因标记:用于在原核生物、真核生物和病毒中寻找基因的网络软件。
Nucleic Acids Res. 2005 Jul 1;33(Web Server issue):W451-4. doi: 10.1093/nar/gki487.
9
Fragmentation and Coverage Variation in Viral Metagenome Assemblies, and Their Effect in Diversity Calculations.病毒宏基因组组装中的碎片化和覆盖度变化,及其对多样性计算的影响。
Front Bioeng Biotechnol. 2015 Sep 17;3:141. doi: 10.3389/fbioe.2015.00141. eCollection 2015.
10
How to interpret an anonymous bacterial genome: machine learning approach to gene identification.如何解读匿名细菌基因组:用于基因识别的机器学习方法
Genome Res. 1998 Nov;8(11):1154-71. doi: 10.1101/gr.8.11.1154.

引用本文的文献

1
Regional antimicrobial resistance gene flow among the One Health sectors in China.中国“同一健康”各领域间的区域抗菌药物耐药基因流动
Microbiome. 2025 Jan 7;13(1):3. doi: 10.1186/s40168-024-01983-x.
2
The miniature genome of broad mite, Polyphagotarsonemus latus (Tarsonemidae: Acari).宽带盲蝽微小基因组(盲蝽科:蜱螨目)。
Sci Data. 2024 Jul 9;11(1):748. doi: 10.1038/s41597-024-03579-4.
3
Metagenomics: An Effective Approach for Exploring Microbial Diversity and Functions.宏基因组学:探索微生物多样性与功能的有效方法。

本文引用的文献

1
A new genomic blueprint of the human gut microbiota.人类肠道微生物组的新基因组蓝图。
Nature. 2019 Apr;568(7753):499-504. doi: 10.1038/s41586-019-0965-1. Epub 2019 Feb 11.
2
Extensive Unexplored Human Microbiome Diversity Revealed by Over 150,000 Genomes from Metagenomes Spanning Age, Geography, and Lifestyle.从来自不同年龄、地理和生活方式的宏基因组中超过 15 万条基因组揭示了广泛未被探索的人类微生物组多样性。
Cell. 2019 Jan 24;176(3):649-662.e20. doi: 10.1016/j.cell.2019.01.001. Epub 2019 Jan 17.
3
High throughput ANI analysis of 90K prokaryotic genomes reveals clear species boundaries.
Foods. 2023 May 25;12(11):2140. doi: 10.3390/foods12112140.
4
Genome Characterization of Bacteriophage KPP-1, a Novel Member in the Subfamily , and Use of Its Endolysin for the Lysis of Multidrug-Resistant In Vitro.噬菌体KPP-1的基因组特征分析,该噬菌体属于亚家族中的一个新成员,及其溶菌酶在体外裂解耐多药菌中的应用
Microorganisms. 2023 Jan 13;11(1):207. doi: 10.3390/microorganisms11010207.
5
Low-dose IL-2 shapes a tolerogenic gut microbiota that improves autoimmunity and gut inflammation.低剂量白介素 2 可塑造出具有免疫耐受性的肠道微生物群,从而改善自身免疫和肠道炎症。
JCI Insight. 2022 Sep 8;7(17):e159406. doi: 10.1172/jci.insight.159406.
高通量 ANI 分析 9 万余组原核基因组揭示了清晰的物种界限。
Nat Commun. 2018 Nov 30;9(1):5114. doi: 10.1038/s41467-018-07641-9.
4
Modeling leaderless transcription and atypical genes results in more accurate gene prediction in prokaryotes.无领导转录和非典型基因的建模可提高原核生物中基因预测的准确性。
Genome Res. 2018 Jul;28(7):1079-1089. doi: 10.1101/gr.230615.117. Epub 2018 May 17.
5
RefSeq: an update on prokaryotic genome annotation and curation.RefSeq:原核生物基因组注释和管理的最新进展。
Nucleic Acids Res. 2018 Jan 4;46(D1):D851-D860. doi: 10.1093/nar/gkx1068.
6
Comparing and Evaluating Metagenome Assembly Tools from a Microbiologist's Perspective - Not Only Size Matters!从微生物学家的视角比较和评估宏基因组组装工具——重要的不只是大小!
PLoS One. 2017 Jan 18;12(1):e0169662. doi: 10.1371/journal.pone.0169662. eCollection 2017.
7
Intestinal microbiome analyses identify melanoma patients at risk for checkpoint-blockade-induced colitis.肠道微生物组分析可识别出有发生检查点阻断诱导性结肠炎风险的黑色素瘤患者。
Nat Commun. 2016 Feb 2;7:10391. doi: 10.1038/ncomms10391.
8
Reference sequence (RefSeq) database at NCBI: current status, taxonomic expansion, and functional annotation.美国国立生物技术信息中心的参考序列(RefSeq)数据库:当前状态、分类扩展及功能注释。
Nucleic Acids Res. 2016 Jan 4;44(D1):D733-45. doi: 10.1093/nar/gkv1189. Epub 2015 Nov 8.
9
Metagenome Sequencing of the Hadza Hunter-Gatherer Gut Microbiota.哈扎部落觅食者肠道微生物组的宏基因组测序。
Curr Biol. 2015 Jun 29;25(13):1682-93. doi: 10.1016/j.cub.2015.04.055. Epub 2015 May 14.
10
Subsistence strategies in traditional societies distinguish gut microbiomes.传统社会中的生存策略区分肠道微生物群。
Nat Commun. 2015 Mar 25;6:6505. doi: 10.1038/ncomms7505.