• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

OrthoReD:一种计算需求低的快速且准确的直系同源预测工具。

OrthoReD: a rapid and accurate orthology prediction tool with low computational requirement.

作者信息

Battenberg Kai, Lee Ernest K, Chiu Joanna C, Berry Alison M, Potter Daniel

机构信息

Department of Plant Sciences, University of California, Davis, CA, USA.

Department of Entomology and Nematology, University of California, Davis, CA, USA.

出版信息

BMC Bioinformatics. 2017 Jun 21;18(1):310. doi: 10.1186/s12859-017-1726-5.

DOI:10.1186/s12859-017-1726-5
PMID:28633662
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5479036/
Abstract

BACKGROUND

Identifying orthologous genes is an initial step required for phylogenetics, and it is also a common strategy employed in functional genetics to find candidates for functionally equivalent genes across multiple species. At the same time, in silico orthology prediction tools often require large computational resources only available on computing clusters. Here we present OrthoReD, an open-source orthology prediction tool with accuracy comparable to published tools that requires only a desktop computer. The low computational resource requirement of OrthoReD is achieved by repeating orthology searches on one gene of interest at a time, thereby generating a reduced dataset to limit the scope of orthology search for each gene of interest.

RESULTS

The output of OrthoReD was highly similar to the outputs of two other published orthology prediction tools, OrthologID and/or OrthoDB, for the three dataset tested, which represented three phyla with different ranges of species diversity and different number of genomes included. Median CPU time for ortholog prediction per gene by OrthoReD executed on a desktop computer was <15 min even for the largest dataset tested, which included all coding sequences of 100 bacterial species.

CONCLUSIONS

With high-throughput sequencing, unprecedented numbers of genes from non-model organisms are available with increasing need for clear information about their orthologies and/or functional equivalents in model organisms. OrthoReD is not only fast and accurate as an orthology prediction tool, but also gives researchers flexibility in the number of genes analyzed at a time, without requiring a high-performance computing cluster.

摘要

背景

识别直系同源基因是系统发育学所需的第一步,也是功能遗传学中常用的策略,用于在多个物种中寻找功能等效基因的候选者。同时,计算机模拟直系同源预测工具通常需要大型计算资源,而这些资源通常只有在计算集群上才能获得。在此,我们展示了OrthoReD,这是一种开源的直系同源预测工具,其准确性与已发表的工具相当,且只需要一台台式计算机。OrthoReD对计算资源的低要求是通过一次对一个感兴趣的基因重复进行直系同源搜索来实现的,从而生成一个简化的数据集,以限制对每个感兴趣基因的直系同源搜索范围。

结果

对于所测试的三个数据集,OrthoReD的输出与另外两个已发表的直系同源预测工具(OrthologID和/或OrthoDB)的输出高度相似,这三个数据集代表了具有不同物种多样性范围和不同基因组数量的三个门。即使对于测试的最大数据集,即在台式计算机上运行的OrthoReD预测每个基因直系同源的中位数CPU时间也<15分钟,该最大数据集包含100种细菌物种的所有编码序列。

结论

随着高通量测序技术的发展,来自非模式生物的基因数量空前增加,人们越来越需要了解它们在模式生物中的直系同源基因和/或功能等效基因的清晰信息。OrthoReD作为一种直系同源预测工具,不仅快速准确,而且使研究人员在一次分析的基因数量上具有灵活性,而无需高性能计算集群。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/84c2/5479036/e95881aa5dc4/12859_2017_1726_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/84c2/5479036/fa5bef9bfb7d/12859_2017_1726_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/84c2/5479036/01c0eb8403ee/12859_2017_1726_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/84c2/5479036/e95881aa5dc4/12859_2017_1726_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/84c2/5479036/fa5bef9bfb7d/12859_2017_1726_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/84c2/5479036/01c0eb8403ee/12859_2017_1726_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/84c2/5479036/e95881aa5dc4/12859_2017_1726_Fig3_HTML.jpg

相似文献

1
OrthoReD: a rapid and accurate orthology prediction tool with low computational requirement.OrthoReD:一种计算需求低的快速且准确的直系同源预测工具。
BMC Bioinformatics. 2017 Jun 21;18(1):310. doi: 10.1186/s12859-017-1726-5.
2
Orthograph: a versatile tool for mapping coding nucleotide sequences to clusters of orthologous genes.Orthograph:一种将编码核苷酸序列映射到直系同源基因簇的多功能工具。
BMC Bioinformatics. 2017 Feb 16;18(1):111. doi: 10.1186/s12859-017-1529-8.
3
OrthologID: automation of genome-scale ortholog identification within a parsimony framework.直系同源物ID:简约框架内全基因组规模直系同源物鉴定的自动化
Bioinformatics. 2006 Mar 15;22(6):699-707. doi: 10.1093/bioinformatics/btk040. Epub 2006 Jan 12.
4
SwiftOrtho: A fast, memory-efficient, multiple genome orthology classifier.SwiftOrtho:一种快速、内存高效、多基因组同源分类器。
Gigascience. 2019 Oct 1;8(10). doi: 10.1093/gigascience/giz118.
5
Gene orthology assessment with OrthologID.使用OrthologID进行基因直系同源性评估。
Methods Mol Biol. 2009;537:23-38. doi: 10.1007/978-1-59745-251-9_2.
6
Protein-Coding Gene Families in Prokaryote Genome Comparisons.原核生物基因组比较中的蛋白质编码基因家族。
Methods Mol Biol. 2024;2802:33-55. doi: 10.1007/978-1-0716-3838-5_2.
7
[Analysis, identification and correction of some errors of model refseqs appeared in NCBI Human Gene Database by in silico cloning and experimental verification of novel human genes].[通过新型人类基因的电子克隆和实验验证对NCBI人类基因数据库中出现的模型参考序列的一些错误进行分析、鉴定和校正]
Yi Chuan Xue Bao. 2004 May;31(5):431-43.
8
DrosOMA: the Orthologous Matrix browser.果蝇直系同源矩阵浏览器(DrosOMA)
F1000Res. 2024 Jan 16;12:936. doi: 10.12688/f1000research.135250.2. eCollection 2023.
9
ExUTR: a novel pipeline for large-scale prediction of 3'-UTR sequences from NGS data.ExUTR:一种从 NGS 数据中大规模预测 3'-UTR 序列的新管道。
BMC Genomics. 2017 Nov 6;18(1):847. doi: 10.1186/s12864-017-4241-1.
10
PorthoMCL: Parallel orthology prediction using MCL for the realm of massive genome availability.PorthoMCL:在海量基因组可用的领域中使用马尔可夫聚类算法(MCL)进行平行直系同源预测
Big Data Anal. 2017;2. doi: 10.1186/s41044-016-0019-8. Epub 2017 Jan 10.

引用本文的文献

1
DAVID Ortholog: an integrative tool to enhance functional analysis through orthologs.DAVID 直系同源基因:通过直系同源基因增强功能分析的综合工具。
Bioinformatics. 2024 Oct 1;40(10). doi: 10.1093/bioinformatics/btae615.
2
SNARE Proteins LjVAMP72a and LjVAMP72b Are Required for Root Symbiosis and Root Hair Formation in .SNARE蛋白LjVAMP72a和LjVAMP72b是根共生和根毛形成所必需的。
Front Plant Sci. 2019 Jan 16;9:1992. doi: 10.3389/fpls.2018.01992. eCollection 2018.
3
Multilevel comparative bioinformatics to investigate evolutionary relationships and specificities in gene annotations: an example for tomato and grapevine.

本文引用的文献

1
Orthology Analysis and In Vivo Complementation Studies to Elucidate the Role of DIR1 during Systemic Acquired Resistance in Arabidopsis thaliana and Cucumis sativus.通过直系同源分析和体内互补研究阐明DIR1在拟南芥和黄瓜系统获得性抗性中的作用
Front Plant Sci. 2016 May 3;7:566. doi: 10.3389/fpls.2016.00566. eCollection 2016.
2
Heterogeneous molecular processes among the causes of how sequence similarity scores can fail to recapitulate phylogeny.序列相似性得分无法概括系统发育的原因中存在的异质分子过程。
Brief Bioinform. 2017 May 1;18(3):451-457. doi: 10.1093/bib/bbw034.
3
OrthoDB v8: update of the hierarchical catalog of orthologs and the underlying free software.
多层次比较生物信息学研究基因注释中的进化关系和特异性:以番茄和葡萄为例。
BMC Bioinformatics. 2018 Nov 30;19(Suppl 15):435. doi: 10.1186/s12859-018-2420-y.
4
Comparative Transcriptomic Analysis of Two Actinorhizal Plants and the Legume Supports the Homology of Root Nodule Symbioses and Is Congruent With a Two-Step Process of Evolution in the Nitrogen-Fixing Clade of Angiosperms.两种放线菌根植物与豆科植物的比较转录组分析支持根瘤共生的同源性,并且与被子植物固氮分支中的两步进化过程一致。
Front Plant Sci. 2018 Oct 8;9:1256. doi: 10.3389/fpls.2018.01256. eCollection 2018.
5
OrthoGNC: A Software for Accurate Identification of Orthologs Based on Gene Neighborhood Conservation.OrthoGNC:一款基于基因邻域保守性准确鉴定直系同源基因的软件。
Genomics Proteomics Bioinformatics. 2017 Dec;15(6):361-370. doi: 10.1016/j.gpb.2017.07.002. Epub 2017 Nov 11.
OrthoDB v8:直系同源基因分层目录及底层免费软件的更新
Nucleic Acids Res. 2015 Jan;43(Database issue):D250-6. doi: 10.1093/nar/gku1220. Epub 2014 Nov 26.
4
Phylotranscriptomic analysis of the origin and early diversification of land plants.陆地植物起源与早期多样化的系统发育转录组学分析
Proc Natl Acad Sci U S A. 2014 Nov 11;111(45):E4859-68. doi: 10.1073/pnas.1323926111. Epub 2014 Oct 29.
5
Phylogeny of the class Actinobacteria revisited in the light of complete genomes. The orders 'Frankiales' and Micrococcales should be split into coherent entities: proposal of Frankiales ord. nov., Geodermatophilales ord. nov., Acidothermales ord. nov. and Nakamurellales ord. nov.基于全基因组对放线菌纲系统发育的重新审视。“弗兰克氏菌目”和微球菌目应拆分为连贯的分类单元:提议设立新目弗兰克氏菌目、新目嗜皮菌目、新目嗜酸热菌目和新目中村氏菌目。
Int J Syst Evol Microbiol. 2014 Nov;64(Pt 11):3821-3832. doi: 10.1099/ijs.0.063966-0. Epub 2014 Aug 28.
6
Orthology inference in nonmodel organisms using transcriptomes and low-coverage genomes: improving accuracy and matrix occupancy for phylogenomics.利用转录组和低覆盖度基因组在非模式生物中进行直系同源基因推断:提高系统发育基因组学的准确性和矩阵占有率
Mol Biol Evol. 2014 Nov;31(11):3081-92. doi: 10.1093/molbev/msu245. Epub 2014 Aug 25.
7
Orthology detection combining clustering and synteny for very large datasets.结合聚类和共线性分析的直系同源性检测方法用于超大型数据集
PLoS One. 2014 Aug 19;9(8):e105015. doi: 10.1371/journal.pone.0105015. eCollection 2014.
8
Gene duplication as a driver of plant morphogenetic evolution.基因复制作为植物形态发生进化的驱动力。
Curr Opin Plant Biol. 2014 Feb;17:43-8. doi: 10.1016/j.pbi.2013.11.002. Epub 2013 Nov 28.
9
RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies.RAxML 版本 8:用于系统发育分析和大型系统发育后分析的工具。
Bioinformatics. 2014 May 1;30(9):1312-3. doi: 10.1093/bioinformatics/btu033. Epub 2014 Jan 21.
10
PhylomeDB v4: zooming into the plurality of evolutionary histories of a genome.PhylomeDB v4:深入研究基因组的多种进化历史。
Nucleic Acids Res. 2014 Jan;42(Database issue):D897-902. doi: 10.1093/nar/gkt1177. Epub 2013 Nov 25.