Suppr超能文献

TreeHub:系统发育树的综合数据集。

TreeHub: a comprehensive dataset of phylogenetic trees.

作者信息

Wu Ping, Cao Yawei, Yang Jiajie, Wu Hui

机构信息

College of Life Science, Sichuan Normal University, Chengdu, Sichuan, 610101, China.

Big Data and AI Biodiversity Conservation Research Center, Institute of Botany, Chinese Academy of Sciences, Beijing, 100093, China.

出版信息

Sci Data. 2025 Jun 2;12(1):924. doi: 10.1038/s41597-025-05282-4.

Abstract

Phylogenetic relationships are crucial for solving various biological questions, serving as a fundamental knowledge in biology. However, the application of phylogenetic trees has been limited by inadequate coverage of updated published phylogenies and the scarcity of reliable comprehensive datasets. In this study, we present a novel approach for automatically extracting phylogenetic data and integrating relevant species information from scientific papers and public databases. On this basis, we constructed a dataset TreeHub, including 135,502 corresponding phylogenetic trees from 7,879 phylogenetic research articles across 609 academic journals. This database will serve as a reliable and accessible resource for the scientific community, accelerating innovations in biodiversity studies and evolutionary theory based on high-density data.

摘要

系统发育关系对于解决各种生物学问题至关重要,是生物学中的基础知识。然而,系统发育树的应用受到已发表的更新系统发育信息覆盖不足以及可靠综合数据集稀缺的限制。在本研究中,我们提出了一种新方法,可从科学论文和公共数据库中自动提取系统发育数据并整合相关物种信息。在此基础上,我们构建了一个数据集TreeHub,其中包含来自609种学术期刊的7879篇系统发育研究文章中的135,502个相应系统发育树。该数据库将成为科学界可靠且易于获取的资源,基于高密度数据加速生物多样性研究和进化理论的创新。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2852/12130454/5a06f1928110/41597_2025_5282_Fig1_HTML.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验