CREAGEN - Environmental, Genetic and Nutritional Epidemiology Research Center, Section of Public Health, Department of Biomedical, Metabolic and Neural Sciences, University of Modena and Reggio Emilia, Modena, Italy.
School of Public Health, University of California Berkeley, Berkeley, CA, USA.
Ann Ig. 2023 May-Jun;35(3):344-358. doi: 10.7416/ai.2022.2514. Epub 2022 Sep 29.
Since the beginning of the COVID-19 outbreak in Italy, health authorities have released epidemiologic data about this disease. These data were the most important sources of information which were periodically updated and analyzed by researchers to predict the spread of the epidemic. However, comprehensive and timely data on the evolution of COVID-19 have not always been made available to researchers and physicians.
The aim of our work is to investigate quality, availability and format of epidemiologic data about COVID-19 in Italy in different territorial and temporal areas. We tried to access the online resources made available by each of the 19 Italian Regions and the two autonomous Provinces, and in more detail by the Local Health Authorities of one of them, the Emilia-Romagna Region. We analyzed the main sources and flows of data (namely new and cumulative cases of infection, total swabs, new and cumulative COVID-19 deaths, overall and divided by sex), describing their characteristics such as accessibility, format and completeness. We eventually reviewed the data published by the Italian Ministry of Health, the National Institute of Health (ISS) and the Civil Protection Department. The Tim Berners-Lee scale was used to evaluate the open data format.
The flow of COVID-19 epidemiologic data in Italy originated from the Local Health Authorities that transmitted the data - on a daily basis - to the regional authorities, which in turn transferred them to the national authorities. We found a rather high heterogeneity in both the content and the format of the released data, both at the local and the regional level. Few Regions were releasing data in open format. ISS was the only national source of data that provided the number of COVID-19 health outcomes divided by sex and age groups since Spring 2020.
Despite multiple potential useful sources for COVID-19 epidemiology are present in Italy, very few open format data were available both at a macro geographical level (e.g. per Region) and at the provincial level. The access to open format epidemiologic data should be eased, to allow researchers to adequately assess future epidemics and therefore favor timely and effective public health interventions.
自意大利 COVID-19 疫情爆发以来,卫生当局发布了有关该病的流行病学数据。这些数据是最重要的信息来源,研究人员定期对其进行更新和分析,以预测疫情的传播。然而,研究人员和医生并不总能获得关于 COVID-19 演变的全面和及时的数据。
我们的工作旨在调查意大利不同地区和时间范围内 COVID-19 的流行病学数据的质量、可用性和格式。我们尝试访问每个意大利大区和两个自治省以及其中一个地区(艾米利亚-罗马涅大区)的地方卫生局提供的在线资源。我们分析了主要的数据来源和流(即新的和累积的感染病例、总拭子、新的和累积的 COVID-19 死亡病例、总死亡病例和按性别划分的死亡病例),描述了它们的特征,如可访问性、格式和完整性。最后,我们还审查了意大利卫生部、国家卫生研究所(ISS)和民防部门发布的数据。蒂姆·伯纳斯-李(Tim Berners-Lee)量表用于评估开放数据格式。
意大利 COVID-19 流行病学数据的来源是地方卫生局,它们每天向地区当局传输数据,然后地区当局再将数据转交给国家当局。我们发现,无论是在地方层面还是在地区层面,发布的数据在内容和格式上都存在相当大的异质性。很少有大区以开放格式发布数据。ISS 是唯一自 2020 年春季以来提供按性别和年龄组划分的 COVID-19 健康结果数量的国家数据源。
尽管意大利存在多个潜在有用的 COVID-19 流行病学数据源,但无论是在宏观地理层面(例如大区层面)还是在省级层面,可用的开放格式数据都非常有限。应简化对开放格式流行病学数据的访问,以允许研究人员充分评估未来的疫情,从而促进及时有效的公共卫生干预措施。