Quantitative Disease Ecology and Conservation (QDEC) Lab Group, Department of Geography, University of Florida, Gainesville, FL 32611, USA.
Emerging Pathogens Institute, University of Florida, Gainesville, FL 32610, USA.
J Med Entomol. 2023 Mar 6;60(2):247-254. doi: 10.1093/jme/tjad009.
A growing body of information on vector-borne diseases has arisen as increasing research focus has been directed towards the need for anticipating risk, optimizing surveillance, and understanding the fundamental biology of vector-borne diseases to direct control and mitigation efforts. The scope and scale of this information, in the form of data, comprising database efforts, data storage, and serving approaches, means that it is distributed across many formats and data types. Data ranges from collections records to molecular characterization, geospatial data to interactions of vectors and traits, infection experiments to field trials. New initiatives arise, often spanning the effort traditionally siloed in specific research disciplines, and other efforts wane, perhaps in response to funding declines, different research directions, or lack of sustained interest. Thusly, the world of vector data - the Vector Data Ecosystem - can become unclear in scope, and the flows of data through these various efforts can become stymied by obsolescence, or simply by gaps in access and interoperability. As increasing attention is paid to creating FAIR (Findable Accessible Interoperable, and Reusable) data, simply characterizing what is 'out there', and how these existing data aggregation and collection efforts interact, or interoperate with each other, is a useful exercise. This study presents a snapshot of current vector data efforts, reporting on level of accessibility, and commenting on interoperability using an illustration to track a specimen through the data ecosystem to understand where it occurs for the database efforts anticipated to describe it (or parts of its extended specimen data).
随着越来越多的研究关注于预测风险、优化监测以及理解虫媒病的基本生物学,以指导控制和缓解工作,有关虫媒病的信息不断增加。这些信息的范围和规模,以数据的形式出现,包括数据库工作、数据存储和服务方法,意味着它分布在许多格式和数据类型中。数据范围从馆藏记录到分子特征描述、地理空间数据到媒介和特征的相互作用、感染实验到野外试验。新的举措不断涌现,通常跨越了传统上在特定研究学科中孤立的努力,而其他努力则减弱,可能是由于资金减少、不同的研究方向或缺乏持续的兴趣。因此,虫媒数据的世界——虫媒数据生态系统——在范围上可能变得不清晰,而这些各种努力中的数据流动可能会因为过时、或者仅仅是因为访问和互操作性的差距而受阻。随着人们越来越关注创建 FAIR(可发现、可访问、可互操作和可重用)数据,简单地描述“现有”的数据是什么,以及这些现有的数据聚合和收集工作如何相互作用或互操作,是一项有用的工作。本研究展示了当前虫媒数据工作的一个快照,报告了可访问性的水平,并通过一个示例来评论互操作性,以跟踪标本在数据生态系统中的位置,从而了解它在预期描述它的数据库工作中出现的位置(或其扩展标本数据的部分位置)。