Mesibov Robert
West Ulverstone, Tasmania, Australia 7315.
Zookeys. 2018 Apr 20(751):129-146. doi: 10.3897/zookeys.751.24791. eCollection 2018.
A total of ca 800,000 occurrence records from the Australian Museum (AM), Museums Victoria (MV) and the New Zealand Arthropod Collection (NZAC) were audited for changes in selected Darwin Core fields after processing by the Atlas of Living Australia (ALA; for AM and MV records) and the Global Biodiversity Information Facility (GBIF; for AM, MV and NZAC records). Formal taxon names in the genus- and species-groups were changed in 13-21% of AM and MV records, depending on dataset and aggregator. There was little agreement between the two aggregators on processed names, with names changed in two to three times as many records by one aggregator alone compared to records with names changed by both aggregators. The type status of specimen records did not change with name changes, resulting in confusion as to the name with which a type was associated. Data losses of up to 100% were found after processing in some fields, apparently due to programming errors. The taxonomic usefulness of occurrence records could be improved if aggregators included both original and the processed taxonomic data items for each record. It is recommended that end-users check original and processed records for data loss and name replacements after processing by aggregators.
对来自澳大利亚博物馆(AM)、维多利亚博物馆(MV)和新西兰节肢动物收藏馆(NZAC)的总计约800,000条出现记录进行了审核,以检查在经过澳大利亚生物图谱(ALA;用于AM和MV记录)和全球生物多样性信息设施(GBIF;用于AM、MV和NZAC记录)处理后,选定的达尔文核心字段是否有变化。在AM和MV记录中,属级和种级的正式分类名称在13%-21%的记录中发生了变化,具体取决于数据集和聚合器。两个聚合器在处理后的名称上几乎没有一致性,仅一个聚合器更改名称的记录数量是两个聚合器都更改名称的记录数量的两到三倍。标本记录的模式状态并未随名称变化而改变,这导致了与模式相关联的名称产生混淆。在某些字段处理后发现数据损失高达100%,显然是由于编程错误。如果聚合器为每条记录同时包含原始和处理后的分类数据项,出现记录的分类学实用性可能会得到提高。建议最终用户在聚合器处理后检查原始记录和处理后的记录,以查看是否存在数据损失和名称替换情况。