Lucas-Dominguez Rut, Alonso-Arroyo Adolfo, Vidal-Infer Antonio, Aleixandre-Benavent Rafael
Department of the History of Science and Information Science, School of Medicine and Dentistry, University of Valencia, Avda. Blasco Ibañez 15, 46010 Valencia, Spain.
UISYS, Joint Research Unit CSIC-University of Valencia, Pza. Cisneros 4, 46003 Valencia, Spain.
Scientometrics. 2021;126(6):4975-4990. doi: 10.1007/s11192-021-03971-6. Epub 2021 Apr 26.
During the previous Ebola and Zika outbreaks, researchers shared their data, allowing many published epidemiological studies to be produced only from open research data, to speed up investigations and control of these infections. This study aims to evaluate the dissemination of the COVID-19 research data underlying scientific publications. Analysis of COVID-19 publications from December 1, 2019, to April 30, 2020, was conducted through the PubMed Central repository to evaluate the research data available through its publication as supplementary material or deposited in repositories. The PubMed Central search generated 5,905 records, of which 804 papers included complementary research data, especially as supplementary material (77.4%). The most productive journals were , and , the most frequent keyword was pneumonia, and the most used repositories were GitHub and GenBank. An expected growth in the number of published articles following the course of the pandemics is confirmed in this work, while the underlying research data are only 13.6%. It can be deduced that data sharing is not a common practice, even in health emergencies, such as the present one. High-impact generalist journals have accounted for a large share of global publishing. The topics most often covered are related to epidemiological and public health concepts, genetics, virology and respiratory diseases, such as pneumonia. However, it is essential to interpret these data with caution following the evolution of publications and their funding in the coming months.
在之前的埃博拉和寨卡疫情期间,研究人员分享了他们的数据,使得许多已发表的流行病学研究仅基于公开研究数据得以开展,从而加速了对这些感染的调查和控制。本研究旨在评估新冠病毒疾病(COVID-19)科学出版物所依据的研究数据的传播情况。通过美国国立医学图书馆的生物医学文献数据库(PubMed Central)对2019年12月1日至2020年4月30日期间的COVID-19出版物进行分析,以评估可通过其作为补充材料发表或存入数据库获得的研究数据。PubMed Central搜索产生了5905条记录,其中804篇论文包含补充研究数据,尤其是作为补充材料(占77.4%)。发文量最多的期刊是《 》、《 》和《 》,最常见的关键词是肺炎,使用最多的数据库是GitHub和GenBank。本研究证实了随着疫情发展发表文章数量的预期增长,而其基础研究数据仅占13.6%。可以推断,即使在当前这样的卫生紧急情况下,数据共享也并非普遍做法。高影响力的综合性期刊在全球出版中占很大比例。最常涉及的主题与流行病学和公共卫生概念、遗传学、病毒学以及呼吸系统疾病(如肺炎)相关。然而,随着未来几个月出版物及其资金情况的发展,谨慎解读这些数据至关重要。