• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

TBMap:关于系统发育数据库TreeBASE的分类学视角。

TBMap: a taxonomic perspective on the phylogenetic database TreeBASE.

作者信息

Page Roderic D M

机构信息

Division of Environmental and Evolutionary Biology, Institute of Biomedical and Life Sciences, Graham Kerr Building, University of Glasgow, Glasgow, UK.

出版信息

BMC Bioinformatics. 2007 May 18;8:158. doi: 10.1186/1471-2105-8-158.

DOI:10.1186/1471-2105-8-158
PMID:17511869
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC1885449/
Abstract

BACKGROUND

TreeBASE is currently the only available large-scale database of published organismal phylogenies. Its utility is hampered by a lack of taxonomic consistency, both within the database, and with names of organisms in external genomic, specimen, and taxonomic databases. The extent to which the phylogenetic knowledge in TreeBASE becomes integrated with these other sources is limited by this lack of consistency.

DESCRIPTION

Taxonomic names in TreeBASE were mapped onto names in the external taxonomic databases IPNI, ITIS, NCBI, and uBio, and graph G of these mappings was constructed. Additional edges representing taxonomic synonymies were added to G, then all components of G were extracted. These components correspond to "name clusters", and group together names in TreeBASE that are inferred to refer to the same taxon. The mapping to NCBI enables hierarchical queries to be performed, which can improve TreeBASE information retrieval by an order of magnitude.

CONCLUSION

TBMap database provides a mapping of the bulk of the names in TreeBASE to names in external taxonomic databases, and a clustering of those mappings into sets of names that can be regarded as equivalent. This mapping enables queries and visualisations that cannot otherwise be constructed. A simple query interface to the mapping and names clusters is available at http://linnaeus.zoology.gla.ac.uk/~rpage/tbmap.

摘要

背景

TreeBASE是目前唯一可用的已发表生物系统发育的大规模数据库。由于在数据库内部以及与外部基因组、标本和分类数据库中的生物名称缺乏分类一致性,其效用受到阻碍。由于缺乏这种一致性,TreeBASE中的系统发育知识与其他这些来源整合的程度受到限制。

描述

将TreeBASE中的分类名称映射到外部分类数据库IPNI、ITIS、NCBI和uBio中的名称,并构建这些映射的图G。将表示分类同义词的附加边添加到G中,然后提取G的所有组件。这些组件对应于“名称簇”,并将TreeBASE中推断指同一分类单元的名称组合在一起。与NCBI的映射使得能够执行分层查询,这可以将TreeBASE信息检索提高一个数量级。

结论

TBMap数据库提供了TreeBASE中大部分名称到外部分类数据库中名称的映射,并将这些映射聚类为可视为等效的名称集。这种映射使得能够进行否则无法构建的查询和可视化。可在http://linnaeus.zoology.gla.ac.uk/~rpage/tbmap获得一个到该映射和名称簇的简单查询界面。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/93d2/1885449/7301510739f6/1471-2105-8-158-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/93d2/1885449/eec414df5ec4/1471-2105-8-158-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/93d2/1885449/a3568101a600/1471-2105-8-158-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/93d2/1885449/d60b11c9f8b8/1471-2105-8-158-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/93d2/1885449/7301510739f6/1471-2105-8-158-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/93d2/1885449/eec414df5ec4/1471-2105-8-158-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/93d2/1885449/a3568101a600/1471-2105-8-158-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/93d2/1885449/d60b11c9f8b8/1471-2105-8-158-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/93d2/1885449/7301510739f6/1471-2105-8-158-4.jpg

相似文献

1
TBMap: a taxonomic perspective on the phylogenetic database TreeBASE.TBMap:关于系统发育数据库TreeBASE的分类学视角。
BMC Bioinformatics. 2007 May 18;8:158. doi: 10.1186/1471-2105-8-158.
2
Improved data retrieval from TreeBASE via taxonomic and linguistic data enrichment.通过分类学和语言数据丰富,改进从TreeBASE检索数据。
BMC Evol Biol. 2009 May 8;9:93. doi: 10.1186/1471-2148-9-93.
3
PhyloFinder: an intelligent search engine for phylogenetic tree databases.系统发育树查找器:一种用于系统发育树数据库的智能搜索引擎。
BMC Evol Biol. 2008 Mar 21;8:90. doi: 10.1186/1471-2148-8-90.
4
A Taxonomic Search Engine: federating taxonomic databases using web services.一个分类搜索引擎:使用网络服务联合分类数据库。
BMC Bioinformatics. 2005 Mar 9;6:48. doi: 10.1186/1471-2105-6-48.
5
PhyloExplorer: a web server to validate, explore and query phylogenetic trees.PhyloExplorer:一个用于验证、探索和查询系统发育树的网络服务器。
BMC Evol Biol. 2009 May 18;9:108. doi: 10.1186/1471-2148-9-108.
6
HCVDB: hepatitis C virus sequences database.HCVDB:丙型肝炎病毒序列数据库。
Appl Bioinformatics. 2004;3(4):237-40. doi: 10.2165/00822942-200403040-00005.
7
An edit script for taxonomic classifications.一份用于分类学分类的编辑脚本。
BMC Bioinformatics. 2005 Aug 25;6:208. doi: 10.1186/1471-2105-6-208.
8
The NCBI Taxonomy database.NCBI 分类数据库。
Nucleic Acids Res. 2012 Jan;40(Database issue):D136-43. doi: 10.1093/nar/gkr1178. Epub 2011 Dec 1.
9
HICLAS: a taxonomic database system for displaying and comparing biological classification and phylogenetic trees.HICLAS:一个用于展示和比较生物分类及系统发育树的分类数据库系统。
Bioinformatics. 1999 Feb;15(2):149-56. doi: 10.1093/bioinformatics/15.2.149.
10
Visualizations for taxonomic and phylogenetic trees.分类树和系统发育树的可视化。
Bioinformatics. 2004 Nov 22;20(17):2997-3004. doi: 10.1093/bioinformatics/bth345. Epub 2004 Jun 4.

引用本文的文献

1
20 GB in 10 minutes: a case for linking major biodiversity databases using an open socio-technical infrastructure and a pragmatic, cross-institutional collaboration.十分钟内传输20GB:利用开放的社会技术基础设施和务实的跨机构合作连接主要生物多样性数据库的实例
PeerJ Comput Sci. 2018 Sep 17;4:e164. doi: 10.7717/peerj-cs.164. eCollection 2018.
2
NetiNeti: discovery of scientific names from text using machine learning methods.内提内提:使用机器学习方法从文本中发现科学名称。
BMC Bioinformatics. 2012 Aug 22;13:211. doi: 10.1186/1471-2105-13-211.
3
GIDL: a rule based expert system for GenBank Intelligent Data Loading into the Molecular Biodiversity Database.

本文引用的文献

1
A molecular phylogeny of the endemic Australian genus Gastrolobium (Fabaceae: Mirbelieae) and allied genera using chloroplast and nuclear markers.利用叶绿体和核标记对澳大利亚特有属 Gastrolobium(豆科:Mirbelieae)及其近缘属的分子系统发育研究。
Am J Bot. 2001 Sep;88(9):1675-87.
2
Constraints in naming parts of the Tree of Life.生命之树各部分命名的限制因素。
Mol Phylogenet Evol. 2007 Feb;42(2):331-8. doi: 10.1016/j.ympev.2006.08.001. Epub 2006 Aug 11.
3
Which random processes describe the tree of life? A large-scale study of phylogenetic tree imbalance.
GIDL:一个基于规则的专家系统,用于将 GenBank 智能数据加载到分子生物多样性数据库中。
BMC Bioinformatics. 2012 Mar 28;13 Suppl 4(Suppl 4):S4. doi: 10.1186/1471-2105-13-S4-S4.
4
Linking NCBI to Wikipedia: a wiki-based approach.将美国国立医学图书馆国家生物技术信息中心(NCBI)与维基百科相连接:一种基于维基的方法。
PLoS Curr. 2011 Mar 31;3:RRN1228. doi: 10.1371/currents.RRN1228.
5
LINNAEUS: a species name identification system for biomedical literature.林奈氏:生物医学文献的物种名称识别系统。
BMC Bioinformatics. 2010 Feb 11;11:85. doi: 10.1186/1471-2105-11-85.
6
PhyloExplorer: a web server to validate, explore and query phylogenetic trees.PhyloExplorer:一个用于验证、探索和查询系统发育树的网络服务器。
BMC Evol Biol. 2009 May 18;9:108. doi: 10.1186/1471-2148-9-108.
7
Improved data retrieval from TreeBASE via taxonomic and linguistic data enrichment.通过分类学和语言数据丰富,改进从TreeBASE检索数据。
BMC Evol Biol. 2009 May 8;9:93. doi: 10.1186/1471-2148-9-93.
8
Universal artifacts affect the branching of phylogenetic trees, not universal scaling laws.通用伪迹影响系统发育树的分支,而非通用标度律。
PLoS One. 2009;4(2):e4611. doi: 10.1371/journal.pone.0004611. Epub 2009 Feb 26.
9
Extended Newick: it is time for a standard representation of phylogenetic networks.扩展的新ick格式:是时候采用系统发育网络的标准表示法了。
BMC Bioinformatics. 2008 Dec 15;9:532. doi: 10.1186/1471-2105-9-532.
10
PhyloFinder: an intelligent search engine for phylogenetic tree databases.系统发育树查找器:一种用于系统发育树数据库的智能搜索引擎。
BMC Evol Biol. 2008 Mar 21;8:90. doi: 10.1186/1471-2148-8-90.
哪些随机过程描述了生命之树?系统发育树不平衡的大规模研究。
Syst Biol. 2006 Aug;55(4):685-91. doi: 10.1080/10635150600889625.
4
Standard data model representation for taxonomic information.分类信息的标准数据模型表示。
OMICS. 2006 Summer;10(2):220-30. doi: 10.1089/omi.2006.10.220.
5
Taxonomic indexing--extending the role of taxonomy.分类索引——扩展分类法的作用
Syst Biol. 2006 Jun;55(3):367-73. doi: 10.1080/10635150500541680.
6
TreeFam: a curated database of phylogenetic trees of animal gene families.TreeFam:一个经过精心策划的动物基因家族系统发育树数据库。
Nucleic Acids Res. 2006 Jan 1;34(Database issue):D572-80. doi: 10.1093/nar/gkj118.
7
PANDIT: an evolution-centric database of protein and associated nucleotide domains with inferred trees.PANDIT:一个以进化为中心的蛋白质及相关核苷酸结构域数据库,并带有推断树。
Nucleic Acids Res. 2006 Jan 1;34(Database issue):D327-31. doi: 10.1093/nar/gkj087.
8
An edit script for taxonomic classifications.一份用于分类学分类的编辑脚本。
BMC Bioinformatics. 2005 Aug 25;6:208. doi: 10.1186/1471-2105-6-208.
9
Cenozoic biogeography and evolution in direct-developing frogs of Central America (Leptodactylidae: Eleutherodactylus) as inferred from a phylogenetic analysis of nuclear and mitochondrial genes.基于核基因和线粒体基因系统发育分析推断的中美洲直接发育蛙类(细趾蟾科:姬蛙属)的新生代生物地理学与演化
Mol Phylogenet Evol. 2005 Jun;35(3):536-55. doi: 10.1016/j.ympev.2005.03.006. Epub 2005 Apr 7.
10
A Taxonomic Search Engine: federating taxonomic databases using web services.一个分类搜索引擎:使用网络服务联合分类数据库。
BMC Bioinformatics. 2005 Mar 9;6:48. doi: 10.1186/1471-2105-6-48.