• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

四联体数据库:基于进化证据推断的原核生物和真核生物的大型同源物数据库。

QuartetS-DB: a large-scale orthology database for prokaryotes and eukaryotes inferred by evolutionary evidence.

机构信息

United States Department of Defense Biotechnology High Performance Computing Software Applications Institute, Telemedicine and Advanced Technology Research Center, US Army Medical Research and Materiel Command, Fort Detrick, MD 21702, USA.

出版信息

BMC Bioinformatics. 2012 Jun 22;13:143. doi: 10.1186/1471-2105-13-143.

DOI:10.1186/1471-2105-13-143
PMID:22726705
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3434046/
Abstract

BACKGROUND

The concept of orthology is key to decoding evolutionary relationships among genes across different species using comparative genomics. QuartetS is a recently reported algorithm for large-scale orthology detection. Based on the well-established evolutionary principle that gene duplication events discriminate paralogous from orthologous genes, QuartetS has been shown to improve orthology detection accuracy while maintaining computational efficiency.

DESCRIPTION

QuartetS-DB is a new orthology database constructed using the QuartetS algorithm. The database provides orthology predictions among 1621 complete genomes (1365 bacterial, 92 archaeal, and 164 eukaryotic), covering more than seven million proteins and four million pairwise orthologs. It is a major source of orthologous groups, containing more than 300,000 groups of orthologous proteins and 236,000 corresponding gene trees. The database also provides over 500,000 groups of inparalogs. In addition to its size, a distinguishing feature of QuartetS-DB is the ability to allow users to select a cutoff value that modulates the balance between prediction accuracy and coverage of the retrieved pairwise orthologs. The database is accessible at https://applications.bioanalysis.org/quartetsdb.

CONCLUSIONS

QuartetS-DB is one of the largest orthology resources available to date. Because its orthology predictions are underpinned by evolutionary evidence obtained from sequenced genomes, we expect its accuracy to continue to increase in future releases as the genomes of additional species are sequenced.

摘要

背景

使用比较基因组学来解码不同物种之间基因的进化关系,同源性的概念是关键。QuartetS 是一种最近报道的大规模同源性检测算法。基于基因复制事件区分同源和旁系同源基因的既定进化原则,QuartetS 被证明可以提高同源性检测的准确性,同时保持计算效率。

描述

QuartetS-DB 是一个使用 QuartetS 算法构建的新的同源性数据库。该数据库提供了 1621 个完整基因组(1365 个细菌、92 个古细菌和 164 个真核生物)之间的同源性预测,涵盖了超过 700 万个蛋白质和 400 万个成对的同源物。它是同源物组的主要来源,包含了 30 多万个同源蛋白组和 236000 个相应的基因树。该数据库还提供了超过 50 万个同系物组。除了其规模之外,QuartetS-DB 的一个显著特点是能够让用户选择一个截止值,该值调节预测准确性和检索到的成对同源物的覆盖范围之间的平衡。该数据库可在 https://applications.bioanalysis.org/quartetsdb 访问。

结论

QuartetS-DB 是迄今为止可用的最大的同源性资源之一。由于其同源性预测是基于从测序基因组中获得的进化证据,我们预计随着更多物种的基因组被测序,其准确性将在未来的版本中继续提高。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/954d/3434046/65822b1addf6/1471-2105-13-143-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/954d/3434046/bfff2b431935/1471-2105-13-143-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/954d/3434046/5807cbfb8a1b/1471-2105-13-143-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/954d/3434046/65822b1addf6/1471-2105-13-143-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/954d/3434046/bfff2b431935/1471-2105-13-143-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/954d/3434046/5807cbfb8a1b/1471-2105-13-143-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/954d/3434046/65822b1addf6/1471-2105-13-143-3.jpg

相似文献

1
QuartetS-DB: a large-scale orthology database for prokaryotes and eukaryotes inferred by evolutionary evidence.四联体数据库:基于进化证据推断的原核生物和真核生物的大型同源物数据库。
BMC Bioinformatics. 2012 Jun 22;13:143. doi: 10.1186/1471-2105-13-143.
2
QuartetS: a fast and accurate algorithm for large-scale orthology detection. QuartetS:一种用于大规模同源检测的快速准确算法。
Nucleic Acids Res. 2011 Jul;39(13):e88. doi: 10.1093/nar/gkr308. Epub 2011 May 13.
3
Standardized benchmarking in the quest for orthologs.寻找直系同源基因过程中的标准化基准测试。
Nat Methods. 2016 May;13(5):425-30. doi: 10.1038/nmeth.3830. Epub 2016 Apr 4.
4
Clusters of orthologous genes for 41 archaeal genomes and implications for evolutionary genomics of archaea.41个古菌基因组的直系同源基因簇及其对古菌进化基因组学的意义。
Biol Direct. 2007 Nov 27;2:33. doi: 10.1186/1745-6150-2-33.
5
MetaPhOrs: orthology and paralogy predictions from multiple phylogenetic evidence using a consistency-based confidence score.MetaPhOrs:使用基于一致性的置信分数,从多种系统发育证据预测直系同源和旁系同源。
Nucleic Acids Res. 2011 Mar;39(5):e32. doi: 10.1093/nar/gkq953. Epub 2010 Dec 11.
6
Benchmarking orthology methods using phylogenetic patterns defined at the base of Eukaryotes.使用在真核生物基部定义的系统发育模式对同源物方法进行基准测试。
Brief Bioinform. 2021 May 20;22(3). doi: 10.1093/bib/bbaa206.
7
Algorithm of OMA for large-scale orthology inference.用于大规模直系同源推断的OMA算法。
BMC Bioinformatics. 2008 Dec 4;9:518. doi: 10.1186/1471-2105-9-518.
8
The OMA orthology database in 2018: retrieving evolutionary relationships among all domains of life through richer web and programmatic interfaces.2018 年的 OMA 同源数据库:通过更丰富的网络和编程接口检索所有生命领域之间的进化关系。
Nucleic Acids Res. 2018 Jan 4;46(D1):D477-D485. doi: 10.1093/nar/gkx1019.
9
Automatic clustering of orthologs and in-paralogs from pairwise species comparisons.通过成对物种比较对直系同源基因和旁系同源基因进行自动聚类。
J Mol Biol. 2001 Dec 14;314(5):1041-52. doi: 10.1006/jmbi.2000.5197.
10
Evolutionary sequence analysis of complete eukaryote genomes.完整真核生物基因组的进化序列分析
BMC Bioinformatics. 2005 Mar 11;6:53. doi: 10.1186/1471-2105-6-53.

引用本文的文献

1
Identification of Biomarkers for Schistosoma-Associated Pulmonary Arterial Hypertension Based on RNA-Seq Data of Mouse Whole Lung Tissues.基于小鼠全肺组织RNA测序数据鉴定血吸虫相关性肺动脉高压的生物标志物
Lung. 2017 Jun;195(3):377-385. doi: 10.1007/s00408-017-9999-z. Epub 2017 Apr 9.
2
DBSecSys 2.0: a database of Burkholderia mallei and Burkholderia pseudomallei secretion systems.DBSecSys 2.0:一个关于鼻疽伯克霍尔德菌和类鼻疽伯克霍尔德菌分泌系统的数据库。
BMC Bioinformatics. 2016 Sep 20;17:387. doi: 10.1186/s12859-016-1242-z.
3
Using host-pathogen protein interactions to identify and characterize Francisella tularensis virulence factors.

本文引用的文献

1
Testing the ortholog conjecture with comparative functional genomic data from mammals.利用来自哺乳动物的比较功能基因组数据检验直系同源假说。
PLoS Comput Biol. 2011 Jun;7(6):e1002073. doi: 10.1371/journal.pcbi.1002073. Epub 2011 Jun 9.
2
Genome-wide identification of conserved regulatory function in diverged sequences.全基因组鉴定分化序列中的保守调控功能。
Genome Res. 2011 Jul;21(7):1139-49. doi: 10.1101/gr.119016.110. Epub 2011 May 31.
3
QuartetS: a fast and accurate algorithm for large-scale orthology detection. QuartetS:一种用于大规模同源检测的快速准确算法。
利用宿主-病原体蛋白相互作用来鉴定和表征土拉弗朗西斯菌的毒力因子。
BMC Genomics. 2015 Dec 29;16:1106. doi: 10.1186/s12864-015-2351-1.
4
A systems biology strategy to identify molecular mechanisms of action and protein indicators of traumatic brain injury.一种用于识别创伤性脑损伤作用分子机制和蛋白质指标的系统生物学策略。
J Neurosci Res. 2015 Feb;93(2):199-214. doi: 10.1002/jnr.23503. Epub 2014 Nov 14.
5
Systems level analysis and identification of pathways and networks associated with liver fibrosis.与肝纤维化相关的通路和网络的系统水平分析与鉴定
PLoS One. 2014 Nov 7;9(11):e112193. doi: 10.1371/journal.pone.0112193. eCollection 2014.
6
Novel Burkholderia mallei virulence factors linked to specific host-pathogen protein interactions.新型马鼻疽伯克霍尔德菌毒力因子与特定宿主-病原体蛋白相互作用有关。
Mol Cell Proteomics. 2013 Nov;12(11):3036-51. doi: 10.1074/mcp.M113.029041. Epub 2013 Jun 24.
7
OrtholugeDB: a bacterial and archaeal orthology resource for improved comparative genomic analysis.OrthoLugeDB:一个用于改进比较基因组分析的细菌和古菌直系同源资源。
Nucleic Acids Res. 2013 Jan;41(Database issue):D366-76. doi: 10.1093/nar/gks1241. Epub 2012 Nov 29.
8
OrthoDB: a hierarchical catalog of animal, fungal and bacterial orthologs.OrthoDB:动物、真菌和细菌直系同源物的层次目录。
Nucleic Acids Res. 2013 Jan;41(Database issue):D358-65. doi: 10.1093/nar/gks1116. Epub 2012 Nov 24.
Nucleic Acids Res. 2011 Jul;39(13):e88. doi: 10.1093/nar/gkr308. Epub 2011 May 13.
4
Signalogs: orthology-based identification of novel signaling pathway components in three metazoans.信号蛋白:三种后生动物中基于直系同源的信号通路新组分的鉴定。
PLoS One. 2011 May 3;6(5):e19240. doi: 10.1371/journal.pone.0019240.
5
OMA 2011: orthology inference among 1000 complete genomes.OMA 2011:1000个完整基因组间的直系同源推断
Nucleic Acids Res. 2011 Jan;39(Database issue):D289-94. doi: 10.1093/nar/gkq1238. Epub 2010 Nov 27.
6
PhylomeDB v3.0: an expanding repository of genome-wide collections of trees, alignments and phylogeny-based orthology and paralogy predictions.系统发育基因组数据库v3.0:一个不断扩展的全基因组树集合、比对以及基于系统发育的直系同源和旁系同源预测的知识库。
Nucleic Acids Res. 2011 Jan;39(Database issue):D556-60. doi: 10.1093/nar/gkq1109. Epub 2010 Nov 12.
7
OrthoDB: the hierarchical catalog of eukaryotic orthologs in 2011.OrthoDB:2011年真核生物直系同源基因的分层目录。
Nucleic Acids Res. 2011 Jan;39(Database issue):D283-8. doi: 10.1093/nar/gkq930. Epub 2010 Oct 23.
8
Protein network prediction and topological analysis in Leishmania major as a tool for drug target selection.主要利什曼原虫蛋白网络预测和拓扑分析作为药物靶点选择的工具。
BMC Bioinformatics. 2010 Sep 27;11:484. doi: 10.1186/1471-2105-11-484.
9
Evolution by leaps: gene duplication in bacteria.跳跃式进化:细菌中的基因复制。
Biol Direct. 2009 Nov 23;4:46. doi: 10.1186/1745-6150-4-46.
10
eggNOG v2.0: extending the evolutionary genealogy of genes with enhanced non-supervised orthologous groups, species and functional annotations.eggNOG v2.0:通过增强的非监督同源物聚类、物种和功能注释,扩展基因的进化系统发生。
Nucleic Acids Res. 2010 Jan;38(Database issue):D190-5. doi: 10.1093/nar/gkp951. Epub 2009 Nov 9.