• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

系统发育体系:一个基于Git的用于社区策划系统发育估计的数据存储库。

Phylesystem: a git-based data store for community-curated phylogenetic estimates.

作者信息

McTavish Emily Jane, Hinchliff Cody E, Allman James F, Brown Joseph W, Cranston Karen A, Holder Mark T, Rees Jonathan A, Smith Stephen A

机构信息

Department of Ecology and Evolutionary Biology, University of Kansas, Lawrence, KS, USA, Heidelberg Institute for Theoretical Studies, Heidelberg 69118, Germany.

Department of Ecology and Evolutionary Biology, University of Michigan, Ann Arbor, MI, USA.

出版信息

Bioinformatics. 2015 Sep 1;31(17):2794-800. doi: 10.1093/bioinformatics/btv276. Epub 2015 May 4.

DOI:10.1093/bioinformatics/btv276
PMID:25940563
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4547614/
Abstract

MOTIVATION

Phylogenetic estimates from published studies can be archived using general platforms like Dryad (Vision, 2010) or TreeBASE (Sanderson et al., 1994). Such services fulfill a crucial role in ensuring transparency and reproducibility in phylogenetic research. However, digital tree data files often require some editing (e.g. rerooting) to improve the accuracy and reusability of the phylogenetic statements. Furthermore, establishing the mapping between tip labels used in a tree and taxa in a single common taxonomy dramatically improves the ability of other researchers to reuse phylogenetic estimates. As the process of curating a published phylogenetic estimate is not error-free, retaining a full record of the provenance of edits to a tree is crucial for openness, allowing editors to receive credit for their work and making errors introduced during curation easier to correct.

RESULTS

Here, we report the development of software infrastructure to support the open curation of phylogenetic data by the community of biologists. The backend of the system provides an interface for the standard database operations of creating, reading, updating and deleting records by making commits to a git repository. The record of the history of edits to a tree is preserved by git's version control features. Hosting this data store on GitHub (http://github.com/) provides open access to the data store using tools familiar to many developers. We have deployed a server running the 'phylesystem-api', which wraps the interactions with git and GitHub. The Open Tree of Life project has also developed and deployed a JavaScript application that uses the phylesystem-api and other web services to enable input and curation of published phylogenetic statements.

AVAILABILITY AND IMPLEMENTATION

Source code for the web service layer is available at https://github.com/OpenTreeOfLife/phylesystem-api. The data store can be cloned from: https://github.com/OpenTreeOfLife/phylesystem. A web application that uses the phylesystem web services is deployed at http://tree.opentreeoflife.org/curator. Code for that tool is available from https://github.com/OpenTreeOfLife/opentree.

CONTACT

mtholder@gmail.com.

摘要

动机

已发表研究中的系统发育估计可以使用Dryad(Vision,2010)或TreeBASE(Sanderson等人,1994)等通用平台进行存档。此类服务在确保系统发育研究的透明度和可重复性方面发挥着关键作用。然而,数字树数据文件通常需要一些编辑(例如重新定根)以提高系统发育陈述的准确性和可重用性。此外,在单个通用分类法中建立树中使用的末端标签与分类单元之间的映射,可显著提高其他研究人员重用系统发育估计的能力。由于整理已发表的系统发育估计的过程并非无差错,保留对树的编辑来源的完整记录对于开放性至关重要,这使编辑能够因他们的工作而获得认可,并使整理过程中引入的错误更容易纠正。

结果

在此,我们报告了软件基础设施的开发,以支持生物学家群体对系统发育数据进行开放整理。该系统的后端通过向git存储库提交来提供用于创建、读取、更新和删除记录的标准数据库操作的接口。git的版本控制功能保留了对树的编辑历史记录。将此数据存储托管在GitHub(http://github.com/)上,可使用许多开发人员熟悉的工具对数据存储进行开放访问。我们已经部署了一台运行“phylesystem-api”的服务器,它封装了与git和GitHub的交互。生命之树开放项目还开发并部署了一个JavaScript应用程序,该应用程序使用phylesystem-api和其他网络服务来实现已发表系统发育陈述的输入和整理。

可用性与实现

网络服务层的源代码可在https://github.com/OpenTreeOfLife/phylesystem-api获取。数据存储可从https://github.com/OpenTreeOfLife/phylesystem克隆。使用phylesystem网络服务的网络应用程序部署在http://tree.opentreeoflife.org/curator。该工具的代码可从https://github.com/OpenTreeOfLife/opentree获取。

联系方式

mtholder@gmail.com。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4c65/4547614/280c38e810c6/btv276f1p.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4c65/4547614/280c38e810c6/btv276f1p.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4c65/4547614/280c38e810c6/btv276f1p.jpg

相似文献

1
Phylesystem: a git-based data store for community-curated phylogenetic estimates.系统发育体系:一个基于Git的用于社区策划系统发育估计的数据存储库。
Bioinformatics. 2015 Sep 1;31(17):2794-800. doi: 10.1093/bioinformatics/btv276. Epub 2015 May 4.
2
DendroPy: a Python library for phylogenetic computing.DendroPy:一个用于系统发育计算的 Python 库。
Bioinformatics. 2010 Jun 15;26(12):1569-71. doi: 10.1093/bioinformatics/btq228. Epub 2010 Apr 25.
3
Phylo.io: Interactive Viewing and Comparison of Large Phylogenetic Trees on the Web.Phylo.io:在网络上对大型系统发育树进行交互式查看和比较。
Mol Biol Evol. 2016 Aug;33(8):2163-6. doi: 10.1093/molbev/msw080. Epub 2016 Apr 19.
4
Treehouse: a user-friendly application to obtain subtrees from large phylogenies.树屋:一款从大型系统发育树中获取子树的用户友好型应用程序。
BMC Res Notes. 2019 Aug 27;12(1):541. doi: 10.1186/s13104-019-4577-5.
5
LAILAPS-QSM: A RESTful API and JAVA library for semantic query suggestions.Lailaps-QSM:用于语义查询建议的 RESTful API 和 JAVA 库。
PLoS Comput Biol. 2018 Mar 12;14(3):e1006058. doi: 10.1371/journal.pcbi.1006058. eCollection 2018 Mar.
6
The Ensembl REST API: Ensembl Data for Any Language.Ensembl REST应用程序编程接口:适用于任何语言的Ensembl数据。
Bioinformatics. 2015 Jan 1;31(1):143-5. doi: 10.1093/bioinformatics/btu613. Epub 2014 Sep 17.
7
FirebrowseR: an R client to the Broad Institute's Firehose Pipeline.FirebrowseR:一款用于连接布罗德研究所Firehose管道的R客户端。
Database (Oxford). 2017 Jan 6;2017. doi: 10.1093/database/baw160. Print 2017.
8
AlgoRun: a Docker-based packaging system for platform-agnostic implemented algorithms.AlgoRun:一种用于与平台无关的已实现算法的基于Docker的打包系统。
Bioinformatics. 2016 Aug 1;32(15):2396-8. doi: 10.1093/bioinformatics/btw120. Epub 2016 Mar 2.
9
IcyTree: rapid browser-based visualization for phylogenetic trees and networks.IcyTree:基于浏览器的快速进化树和网络图可视化工具。
Bioinformatics. 2017 Aug 1;33(15):2392-2394. doi: 10.1093/bioinformatics/btx155.
10
A Java API for working with PubChem datasets.一个用于处理 PubChem 数据集的 Java API。
Bioinformatics. 2011 Mar 1;27(5):741-2. doi: 10.1093/bioinformatics/btq715. Epub 2011 Jan 6.

引用本文的文献

1
A complete and dynamic tree of birds.一个完整且动态的鸟类谱系图。
Proc Natl Acad Sci U S A. 2025 May 6;122(18):e2409658122. doi: 10.1073/pnas.2409658122. Epub 2025 Apr 29.
2
Is variation in female aggressiveness across species associated with reproductive potential?雌性在不同物种间的攻击性差异与繁殖潜力有关吗?
Proc Biol Sci. 2025 Apr;292(2044):20242301. doi: 10.1098/rspb.2024.2301. Epub 2025 Apr 9.
3
More social species live longer, have longer generation times and longer reproductive windows.社会性越强的物种寿命越长,世代时间越长,繁殖窗口也越长。

本文引用的文献

1
Synthesis of phylogeny and taxonomy into a comprehensive tree of life.将系统发育学和分类学整合为一个全面的生命之树。
Proc Natl Acad Sci U S A. 2015 Oct 13;112(41):12764-9. doi: 10.1073/pnas.1423041112. Epub 2015 Sep 18.
2
The dawn of open access to phylogenetic data.系统发育数据开放获取的开端。
PLoS One. 2014 Oct 24;9(10):e110268. doi: 10.1371/journal.pone.0110268. eCollection 2014.
3
Best practices for data sharing in phylogenetic research.系统发育研究中数据共享的最佳实践。
Philos Trans R Soc Lond B Biol Sci. 2024 Dec 16;379(1916):20220459. doi: 10.1098/rstb.2022.0459. Epub 2024 Oct 28.
4
DateLife: Leveraging Databases and Analytical Tools to Reveal the Dated Tree of Life.DateLife:利用数据库和分析工具揭示有日期的生命之树。
Syst Biol. 2024 Jul 27;73(2):470-485. doi: 10.1093/sysbio/syae015.
5
Genomic Assessment of the Contribution of the Endosymbiont of to Gall Induction.对 内共生体诱导虫瘿形成的贡献的基因组评估。
Int J Mol Sci. 2023 Jun 1;24(11):9613. doi: 10.3390/ijms24119613.
6
The electronic tree of life (eToL): a net of long probes to characterize the microbiome from RNA-seq data.电子生命之树 (eToL):从 RNA-seq 数据中描述微生物组的长探针网络。
BMC Microbiol. 2022 Dec 22;22(1):317. doi: 10.1186/s12866-022-02671-2.
7
Genetic sex determination, sex chromosome size and sex-specific lifespans across tetrapods.四足动物的遗传性别决定、性染色体大小和性别特异性寿命。
J Evol Biol. 2023 Feb;36(2):480-494. doi: 10.1111/jeb.14130. Epub 2022 Dec 20.
8
Quantifying research interests in 7,521 mammalian species with h-index: a case study.用 h 指数量化 7,521 种哺乳动物的研究兴趣:案例研究。
Gigascience. 2022 Aug 13;11. doi: 10.1093/gigascience/giac074.
9
A synthesis tree of the Copepoda: integrating phylogenetic and taxonomic data reveals multiple origins of parasitism.桡足纲的综合树:整合系统发育和分类数据揭示了寄生现象的多个起源。
PeerJ. 2021 Aug 18;9:e12034. doi: 10.7717/peerj.12034. eCollection 2021.
10
Physcraper: a Python package for continually updated phylogenetic trees using the Open Tree of Life.Physcraper:一个使用生命之树开放图谱持续更新系统发育树的Python软件包。
BMC Bioinformatics. 2021 Jun 29;22(1):355. doi: 10.1186/s12859-021-04274-6.
PLoS Curr. 2014 Jun 19;6:ecurrents.tol.bf01eff4a6b60ca4825c69293dc59645. doi: 10.1371/currents.tol.bf01eff4a6b60ca4825c69293dc59645.
4
Scientific names of organisms: attribution, rights, and licensing.生物体的学名:归属、权利和许可。
BMC Res Notes. 2014 Feb 4;7:79. doi: 10.1186/1756-0500-7-79.
5
Lost branches on the tree of life.生命之树上的失落枝丫。
PLoS Biol. 2013 Sep;11(9):e1001636. doi: 10.1371/journal.pbio.1001636. Epub 2013 Sep 3.
6
Git can facilitate greater reproducibility and increased transparency in science.Git有助于提高科学研究的可重复性和透明度。
Source Code Biol Med. 2013 Feb 28;8(1):7. doi: 10.1186/1751-0473-8-7.
7
Sharing and re-use of phylogenetic trees (and associated data) to facilitate synthesis.共享和重新使用系统发育树(及相关数据)以促进综合分析。
BMC Res Notes. 2012 Oct 22;5:574. doi: 10.1186/1756-0500-5-574.
8
NeXML: rich, extensible, and verifiable representation of comparative data and metadata.NeXML:用于比较数据和元数据的丰富、可扩展和可验证的表示形式。
Syst Biol. 2012 Jul;61(4):675-89. doi: 10.1093/sysbio/sys025. Epub 2012 Feb 22.
9
Missing the forest for the trees: phylogenetic compression and its implications for inferring complex evolutionary histories.只见树木不见森林:系统发育压缩及其对推断复杂进化历史的影响。
Syst Biol. 2005 Feb;54(1):146-57. doi: 10.1080/10635150590905984.
10
NEXUS: an extensible file format for systematic information.NEXUS:一种用于系统信息的可扩展文件格式。
Syst Biol. 1997 Dec;46(4):590-621. doi: 10.1093/sysbio/46.4.590.