Suppr超能文献

《世界植物区系》:一个R软件包,用于根据《世界植物区系在线》分类学主干数据对植物名称进行精确和模糊匹配。

WorldFlora: An R package for exact and fuzzy matching of plant names against the World Flora Online taxonomic backbone data.

作者信息

Kindt Roeland

机构信息

Tree Productivity and Diversity World Agroforestry P.O. Box 30677-00100 Nairobi Kenya.

出版信息

Appl Plant Sci. 2020 Sep 25;8(9):e11388. doi: 10.1002/aps3.11388. eCollection 2020 Sep.

Abstract

PREMISE

The standardization of plant names is a critical step in various fields of biology, including biodiversity, biogeography, and vegetation research. The WorldFlora package is introduced here to help achieve this goal by matching lists of plant names with a static copy from World Flora Online (WFO), an ongoing global effort to complete an online flora of all known vascular plants and bryophytes by 2020.

METHODS AND RESULTS

Based on direct and fuzzy matching, WorldFlora inserts matching cases from the WFO to a submitted data set containing taxonomic names. The results and success rates for selecting the expected best single matches are presented for four data sets, including two data sets used in recent comparisons of software tools for correcting taxon names.

CONCLUSIONS

WorldFlora offers a straightforward pipeline for semi-automatic plant name checking. For the four data sets, the success rate of credible matches ranged from 94.7% to 99.9%.

摘要

前提

植物名称的标准化是生物学各个领域的关键步骤,包括生物多样性、生物地理学和植被研究。本文介绍了WorldFlora软件包,旨在通过将植物名称列表与来自《世界植物在线》(WFO)的静态副本进行匹配来帮助实现这一目标。WFO是一项正在进行的全球努力,目标是到2020年完成所有已知维管植物和苔藓植物的在线植物志。

方法与结果

基于直接匹配和模糊匹配,WorldFlora将来自WFO的匹配案例插入到包含分类学名称的提交数据集中。给出了四个数据集选择预期最佳单一匹配的结果和成功率,其中包括最近用于分类群名称校正软件工具比较的两个数据集。

结论

WorldFlora提供了一个用于半自动植物名称检查的简单流程。对于这四个数据集,可靠匹配的成功率在94.7%至99.9%之间。

相似文献

1
WorldFlora: An R package for exact and fuzzy matching of plant names against the World Flora Online taxonomic backbone data.
Appl Plant Sci. 2020 Sep 25;8(9):e11388. doi: 10.1002/aps3.11388. eCollection 2020 Sep.
2
The big four of plant taxonomy - a comparison of global checklists of vascular plant names.
New Phytol. 2023 Nov;240(4):1687-1702. doi: 10.1111/nph.18961. Epub 2023 May 27.
3
The taxonomic name resolution service: an online tool for automated standardization of plant names.
BMC Bioinformatics. 2013 Jan 16;14:16. doi: 10.1186/1471-2105-14-16.
4
U.Taxonstand: An R package for standardizing scientific names of plants and animals.
Plant Divers. 2022 Sep 8;45(1):1-5. doi: 10.1016/j.pld.2022.09.001. eCollection 2023 Jan.
5
Treemendous: an R package for integrating taxonomic information across backbones.
PeerJ. 2024 Feb 28;12:e16896. doi: 10.7717/peerj.16896. eCollection 2024.
7
Solr-Plant: efficient extraction of plant names from text.
BMC Bioinformatics. 2019 May 22;20(1):263. doi: 10.1186/s12859-019-2874-6.
9
Taxamatch, an algorithm for near ('fuzzy') matching of scientific names in taxonomic databases.
PLoS One. 2014 Sep 23;9(9):e107510. doi: 10.1371/journal.pone.0107510. eCollection 2014.
10
"gnparser": a powerful parser for scientific names based on Parsing Expression Grammar.
BMC Bioinformatics. 2017 May 26;18(1):279. doi: 10.1186/s12859-017-1663-3.

引用本文的文献

1
An updated checklist of vascular plants of Myanmar.
PhytoKeys. 2025 Aug 11;261:135-164. doi: 10.3897/phytokeys.261.154986. eCollection 2025.
4
5
A Pantropical Analysis of Fire Impacts and Post-Fire Species Recovery of Plant Life Forms.
Ecol Evol. 2025 Feb 17;15(2):e71018. doi: 10.1002/ece3.71018. eCollection 2025 Feb.
6
Pytaxon: A Python software for resolving and correcting taxonomic names in biodiversity data.
Biodivers Data J. 2025 Jan 8;13:e138257. doi: 10.3897/BDJ.13.e138257. eCollection 2025.
7
Pollination Across the Diel Cycle: A Global Meta-Analysis.
Ecol Lett. 2025 Jan;28(1):e70036. doi: 10.1111/ele.70036.
9
expowo: An R package for mining global plant diversity and distribution data.
Appl Plant Sci. 2024 Jul 30;12(6):e11609. doi: 10.1002/aps3.11609. eCollection 2024 Nov-Dec.
10
florabr: An R package to explore and spatialize species distribution using Flora e Funga do Brasil.
Appl Plant Sci. 2024 Aug 29;12(6):e11616. doi: 10.1002/aps3.11616. eCollection 2024 Nov-Dec.

本文引用的文献

1
A method to implement continuous characters in digital identification keys that estimates the probability of an annotation.
Appl Plant Sci. 2019 May 8;7(5):e01247. doi: 10.1002/aps3.1247. eCollection 2019 May.
2
Solr-Plant: efficient extraction of plant names from text.
BMC Bioinformatics. 2019 May 22;20(1):263. doi: 10.1186/s12859-019-2874-6.
3
Towards a dynamic list of Amazonian tree species.
Sci Rep. 2019 Mar 5;9(1):3501. doi: 10.1038/s41598-019-40101-y.
4
Multidimensional biases, gaps and uncertainties in global plant occurrence information.
Ecol Lett. 2016 Aug;19(8):992-1006. doi: 10.1111/ele.12624. Epub 2016 Jun 2.
5
Taxamatch, an algorithm for near ('fuzzy') matching of scientific names in taxonomic databases.
PLoS One. 2014 Sep 23;9(9):e107510. doi: 10.1371/journal.pone.0107510. eCollection 2014.
6
The taxonomic name resolution service: an online tool for automated standardization of plant names.
BMC Bioinformatics. 2013 Jan 16;14:16. doi: 10.1186/1471-2105-14-16.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验