Suppr超能文献

TBMap:关于系统发育数据库TreeBASE的分类学视角。

TBMap: a taxonomic perspective on the phylogenetic database TreeBASE.

作者信息

Page Roderic D M

机构信息

Division of Environmental and Evolutionary Biology, Institute of Biomedical and Life Sciences, Graham Kerr Building, University of Glasgow, Glasgow, UK.

出版信息

BMC Bioinformatics. 2007 May 18;8:158. doi: 10.1186/1471-2105-8-158.

Abstract

BACKGROUND

TreeBASE is currently the only available large-scale database of published organismal phylogenies. Its utility is hampered by a lack of taxonomic consistency, both within the database, and with names of organisms in external genomic, specimen, and taxonomic databases. The extent to which the phylogenetic knowledge in TreeBASE becomes integrated with these other sources is limited by this lack of consistency.

DESCRIPTION

Taxonomic names in TreeBASE were mapped onto names in the external taxonomic databases IPNI, ITIS, NCBI, and uBio, and graph G of these mappings was constructed. Additional edges representing taxonomic synonymies were added to G, then all components of G were extracted. These components correspond to "name clusters", and group together names in TreeBASE that are inferred to refer to the same taxon. The mapping to NCBI enables hierarchical queries to be performed, which can improve TreeBASE information retrieval by an order of magnitude.

CONCLUSION

TBMap database provides a mapping of the bulk of the names in TreeBASE to names in external taxonomic databases, and a clustering of those mappings into sets of names that can be regarded as equivalent. This mapping enables queries and visualisations that cannot otherwise be constructed. A simple query interface to the mapping and names clusters is available at http://linnaeus.zoology.gla.ac.uk/~rpage/tbmap.

摘要

背景

TreeBASE是目前唯一可用的已发表生物系统发育的大规模数据库。由于在数据库内部以及与外部基因组、标本和分类数据库中的生物名称缺乏分类一致性,其效用受到阻碍。由于缺乏这种一致性,TreeBASE中的系统发育知识与其他这些来源整合的程度受到限制。

描述

将TreeBASE中的分类名称映射到外部分类数据库IPNI、ITIS、NCBI和uBio中的名称,并构建这些映射的图G。将表示分类同义词的附加边添加到G中,然后提取G的所有组件。这些组件对应于“名称簇”,并将TreeBASE中推断指同一分类单元的名称组合在一起。与NCBI的映射使得能够执行分层查询,这可以将TreeBASE信息检索提高一个数量级。

结论

TBMap数据库提供了TreeBASE中大部分名称到外部分类数据库中名称的映射,并将这些映射聚类为可视为等效的名称集。这种映射使得能够进行否则无法构建的查询和可视化。可在http://linnaeus.zoology.gla.ac.uk/~rpage/tbmap获得一个到该映射和名称簇的简单查询界面。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/93d2/1885449/eec414df5ec4/1471-2105-8-158-1.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验