• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

奥菲莉亚:宏基因组测序读段中的基因预测

Orphelia: predicting genes in metagenomic sequencing reads.

作者信息

Hoff Katharina J, Lingner Thomas, Meinicke Peter, Tech Maike

机构信息

Abteilung Bioinformatik, Institut für Mikrobiologie und Genetik, Georg-August-Universität Göttingen, Goldschmidtstr. 1, 37077 Göttingen, Germany.

出版信息

Nucleic Acids Res. 2009 Jul;37(Web Server issue):W101-5. doi: 10.1093/nar/gkp327. Epub 2009 May 8.

DOI:10.1093/nar/gkp327
PMID:19429689
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2703946/
Abstract

Metagenomic sequencing projects yield numerous sequencing reads of a diverse range of uncultivated and mostly yet unknown microorganisms. In many cases, these sequencing reads cannot be assembled into longer contigs. Thus, gene prediction tools that were originally developed for whole-genome analysis are not suitable for processing metagenomes. Orphelia is a program for predicting genes in short DNA sequences that is available through a web server application (http://orphelia.gobics.de). Orphelia utilizes prediction models that were created with machine learning techniques on the basis of a wide range of annotated genomes. In contrast to other methods for metagenomic gene prediction, Orphelia has fragment length-specific prediction models for the two most popular sequencing techniques in metagenomics, chain termination sequencing and pyrosequencing. These models ensure highly specific gene predictions.

摘要

宏基因组测序项目产生了大量来自各种未培养且大多未知微生物的测序读数。在许多情况下,这些测序读数无法组装成更长的重叠群。因此,最初为全基因组分析开发的基因预测工具不适用于处理宏基因组。Orphelia是一个用于预测短DNA序列中基因的程序,可通过网络服务器应用程序(http://orphelia.gobics.de)获得。Orphelia利用基于广泛注释基因组通过机器学习技术创建的预测模型。与其他宏基因组基因预测方法不同,Orphelia针对宏基因组学中两种最流行的测序技术——链终止测序和焦磷酸测序,具有片段长度特异性预测模型。这些模型确保了高度特异性的基因预测。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/71f8/2703946/14727bf88a4a/gkp327f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/71f8/2703946/755f23b0acd5/gkp327f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/71f8/2703946/6bf2c59e37ab/gkp327f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/71f8/2703946/14727bf88a4a/gkp327f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/71f8/2703946/755f23b0acd5/gkp327f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/71f8/2703946/6bf2c59e37ab/gkp327f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/71f8/2703946/14727bf88a4a/gkp327f3.jpg

相似文献

1
Orphelia: predicting genes in metagenomic sequencing reads.奥菲莉亚:宏基因组测序读段中的基因预测
Nucleic Acids Res. 2009 Jul;37(Web Server issue):W101-5. doi: 10.1093/nar/gkp327. Epub 2009 May 8.
2
The effect of sequencing errors on metagenomic gene prediction.测序错误对宏基因组基因预测的影响。
BMC Genomics. 2009 Nov 12;10:520. doi: 10.1186/1471-2164-10-520.
3
Short-read reading-frame predictors are not created equal: sequence error causes loss of signal.短读阅读框预测器并不相同:序列错误导致信号丢失。
BMC Bioinformatics. 2012 Jul 28;13:183. doi: 10.1186/1471-2105-13-183.
4
An ORFome assembly approach to metagenomics sequences analysis.一种用于宏基因组学序列分析的开放阅读框(ORF)集组装方法。
Comput Syst Bioinformatics Conf. 2008;7:3-13.
5
MGC: a metagenomic gene caller.MGC:一种宏基因组基因调用器。
BMC Bioinformatics. 2013;14 Suppl 9(Suppl 9):S6. doi: 10.1186/1471-2105-14-S9-S6. Epub 2013 Jun 28.
6
Artificial and natural duplicates in pyrosequencing reads of metagenomic data.元基因组数据焦磷酸测序reads 中的人工和天然重复。
BMC Bioinformatics. 2010 Apr 13;11:187. doi: 10.1186/1471-2105-11-187.
7
FragGeneScan: predicting genes in short and error-prone reads.FragGeneScan:预测短读和易错读中的基因。
Nucleic Acids Res. 2010 Nov;38(20):e191. doi: 10.1093/nar/gkq747. Epub 2010 Aug 30.
8
Evaluating the Quantitative Capabilities of Metagenomic Analysis Software.评估宏基因组分析软件的定量能力。
Curr Microbiol. 2016 May;72(5):612-6. doi: 10.1007/s00284-016-0991-2. Epub 2016 Jan 30.
9
Selection of marker genes for genetic barcoding of microorganisms and binning of metagenomic reads by Barcoder software tools.微生物遗传条形码标记基因的选择和 Barcoder 软件工具对宏基因组读段的分类。
BMC Bioinformatics. 2018 Aug 30;19(1):309. doi: 10.1186/s12859-018-2320-1.
10
A better sequence-read simulator program for metagenomics.一个更好的宏基因组学序列读取模拟程序。
BMC Bioinformatics. 2014;15 Suppl 9(Suppl 9):S14. doi: 10.1186/1471-2105-15-S9-S14. Epub 2014 Sep 10.

引用本文的文献

1
A toolbox of machine learning software to support microbiome analysis.一个支持微生物组分析的机器学习软件工具箱。
Front Microbiol. 2023 Nov 22;14:1250806. doi: 10.3389/fmicb.2023.1250806. eCollection 2023.
2
Integrated gene prediction and peptide assembly of metagenomic sequencing data.宏基因组测序数据的整合基因预测与肽段组装
NAR Genom Bioinform. 2023 Mar 11;5(1):lqad023. doi: 10.1093/nargab/lqad023. eCollection 2023 Mar.
3
FragGeneScanRs: faster gene prediction for short reads.FragGeneScanRs:用于短读长的更快基因预测。

本文引用的文献

1
MetaGeneAnnotator: detecting species-specific patterns of ribosomal binding site for precise gene prediction in anonymous prokaryotic and phage genomes.MetaGeneAnnotator:检测核糖体结合位点的物种特异性模式,以在未知原核生物和噬菌体基因组中进行精确的基因预测。
DNA Res. 2008 Dec;15(6):387-96. doi: 10.1093/dnares/dsn027. Epub 2008 Oct 21.
2
MetaSim: a sequencing simulator for genomics and metagenomics.MetaSim:一款用于基因组学和宏基因组学的测序模拟器。
PLoS One. 2008 Oct 8;3(10):e3373. doi: 10.1371/journal.pone.0003373.
3
Millimeter-scale genetic gradients and community-level molecular convergence in a hypersaline microbial mat.
BMC Bioinformatics. 2022 May 28;23(1):198. doi: 10.1186/s12859-022-04736-5.
4
NGS read classification using AI.使用人工智能进行二代测序(NGS)读数分类
PLoS One. 2021 Dec 22;16(12):e0261548. doi: 10.1371/journal.pone.0261548. eCollection 2021.
5
geneRFinder: gene finding in distinct metagenomic data complexities.基因 RFinder:在不同元基因组数据复杂性中进行基因预测。
BMC Bioinformatics. 2021 Feb 25;22(1):87. doi: 10.1186/s12859-021-03997-w.
6
Updated review of research on the gut microbiota and their relation to depression in animals and human beings.更新的关于肠道微生物群及其与动物和人类抑郁症关系的研究综述。
Mol Psychiatry. 2020 Nov;25(11):2759-2772. doi: 10.1038/s41380-020-0729-1. Epub 2020 Apr 24.
7
PlasGUN: gene prediction in plasmid metagenomic short reads using deep learning.PlasGUN:使用深度学习进行质粒宏基因组短读测序中的基因预测。
Bioinformatics. 2020 May 1;36(10):3239-3241. doi: 10.1093/bioinformatics/btaa103.
8
Machine Learning Approaches for Epidemiological Investigations of Food-Borne Disease Outbreaks.用于食源性疾病暴发流行病学调查的机器学习方法
Front Microbiol. 2019 Aug 6;10:1722. doi: 10.3389/fmicb.2019.01722. eCollection 2019.
9
Comparison of Bacterial Populations in the Ceca of Swine at Two Different Stages and their Functional Annotations.比较两个不同阶段猪盲肠中的细菌种群及其功能注释。
Genes (Basel). 2019 May 20;10(5):382. doi: 10.3390/genes10050382.
10
CNN-MGP: Convolutional Neural Networks for Metagenomics Gene Prediction.CNN-MGP:用于宏基因组基因预测的卷积神经网络。
Interdiscip Sci. 2019 Dec;11(4):628-635. doi: 10.1007/s12539-018-0313-4. Epub 2018 Dec 27.
高盐度微生物席中的毫米级遗传梯度和群落水平的分子趋同
Mol Syst Biol. 2008;4:198. doi: 10.1038/msb.2008.35. Epub 2008 Jun 3.
4
Gene prediction in metagenomic fragments: a large scale machine learning approach.宏基因组片段中的基因预测:一种大规模机器学习方法。
BMC Bioinformatics. 2008 Apr 28;9:217. doi: 10.1186/1471-2105-9-217.
5
Gene identification and protein classification in microbial metagenomic sequence data via incremental clustering.通过增量聚类对微生物宏基因组序列数据进行基因识别和蛋白质分类。
BMC Bioinformatics. 2008 Apr 10;9:182. doi: 10.1186/1471-2105-9-182.
6
Use of simulated data sets to evaluate the fidelity of metagenomic processing methods.使用模拟数据集评估宏基因组学处理方法的保真度。
Nat Methods. 2007 Jun;4(6):495-500. doi: 10.1038/nmeth1043. Epub 2007 Apr 29.
7
MetaGene: prokaryotic gene finding from environmental genome shotgun sequences.MetaGene:从环境基因组鸟枪法测序中寻找原核生物基因
Nucleic Acids Res. 2006;34(19):5623-30. doi: 10.1093/nar/gkl723. Epub 2006 Oct 5.
8
Finding novel genes in bacterial communities isolated from the environment.在从环境中分离出的细菌群落中寻找新基因。
Bioinformatics. 2006 Jul 15;22(14):e281-9. doi: 10.1093/bioinformatics/btl247.
9
BLAT--the BLAST-like alignment tool.BLAT——类BLAST比对工具。
Genome Res. 2002 Apr;12(4):656-64. doi: 10.1101/gr.229202.
10
Heuristic approach to deriving models for gene finding.用于推导基因发现模型的启发式方法。
Nucleic Acids Res. 1999 Oct 1;27(19):3911-20. doi: 10.1093/nar/27.19.3911.