Institute of Health Information and Statistics of the Czech Republic, Prague, Czech Republic.
Institute of Biostatistics and Analyses, Faculty of Medicine, Masaryk University, Brno, Czech Republic.
PLoS One. 2022 Apr 21;17(4):e0267397. doi: 10.1371/journal.pone.0267397. eCollection 2022.
At the time of the COVID-19 pandemic, providing access to data (properly optimised regarding personal data protection) plays a crucial role in providing the general public and media with up-to-date information. Open datasets also represent one of the means for evaluation of the pandemic on a global level. The primary aim of this paper is to describe the methodological and technical framework for publishing datasets describing characteristics related to the COVID-19 epidemic in the Czech Republic (epidemiology, hospital-based care, vaccination), including the use of these datasets in practice. Practical aspects and experience with data sharing are discussed. As a reaction to the epidemic situation, a new portal COVID-19: Current Situation in the Czech Republic (https://onemocneni-aktualne.mzcr.cz/covid-19) was developed and launched in March 2020 to provide a fully-fledged and trustworthy source of information for the public and media. The portal also contains a section for the publication of (i) public open datasets available for download in CSV and JSON formats and (ii) authorised-access-only section where the authorised persons can (through an online generated token) safely visualise or download regional datasets with aggregated data at the level of the individual municipalities and regions. The data are also provided to the local open data catalogue (covering only open data on healthcare, provided by the Ministry of Health) and to the National Catalogue of Open Data (covering all open data sets, provided by various authorities/publishers, and harversting all data from local catalogues). The datasets have been published in various authentication regimes and widely used by general public, scientists, public authorities and decision-makers. The total number of API calls since its launch in March 2020 to 15 December 2020 exceeded 13 million. The datasets have been adopted as an official and guaranteed source for outputs of third parties, including public authorities, non-governmental organisations, scientists and online news portals. Datasets currently published as open data meet the 3-star open data requirements, which makes them machine-readable and facilitates their further usage without restrictions. This is essential for making the data more easily understandable and usable for data consumers. In conjunction with the strategy of the MH in the field of data opening, additional datasets meeting the already implemented standards will be also released, both on COVID-19 related and unrelated topics.
在 COVID-19 大流行期间,提供数据(在个人数据保护方面进行适当优化)对于向公众和媒体提供最新信息至关重要。开放数据集也是评估全球大流行的手段之一。本文的主要目的是描述发布描述与捷克共和国 COVID-19 流行相关特征的数据集的方法学和技术框架(流行病学、基于医院的护理、疫苗接种),包括在实践中使用这些数据集。讨论了数据共享的实际方面和经验。作为对疫情的反应,2020 年 3 月开发并推出了一个新的门户 COVID-19:捷克共和国当前状况(https://onemocneni-aktualne.mzcr.cz/covid-19),为公众和媒体提供一个全面可靠的信息来源。该门户还包含一个用于发布的部分(i)可下载 CSV 和 JSON 格式的公共开放数据集,以及(ii)仅授权访问部分,授权人员可以(通过在线生成的令牌)安全地查看或下载具有单个直辖市和地区聚合数据的区域数据集。这些数据还提供给本地开放数据目录(仅涵盖卫生部提供的医疗保健方面的开放数据)和国家开放数据目录(涵盖各当局/发布者提供的所有开放数据集,并从本地目录中获取所有数据)。这些数据集以各种认证制度发布,并被公众、科学家、公共当局和决策者广泛使用。自 2020 年 3 月推出以来,到 2020 年 12 月 15 日,API 调用总数超过 1300 万次。这些数据集已被采用为第三方(包括公共当局、非政府组织、科学家和在线新闻门户)输出的官方和有保障的来源。目前作为开放数据发布的数据集符合 3 星级开放数据要求,这使其具有机器可读性,并在没有限制的情况下促进了进一步使用。这对于使数据更容易被数据消费者理解和使用至关重要。结合卫生部在数据开放领域的战略,将发布更多符合已实施标准的数据集,包括与 COVID-19 相关和不相关的主题。