通过聚类分析监测印度的新型冠状病毒（COVID-19）感染情况。

Monitoring Novel Corona Virus (COVID-19) Infections in India by Cluster Analysis.

作者信息

Kumar Sanjay

机构信息

Department of Statistics, Central University of Rajasthan, Bandarsindri, Kishangarh, Ajmer, Rajasthan 305817 India.

出版信息

Ann Data Sci. 2020;7(3):417-425. doi: 10.1007/s40745-020-00289-7. Epub 2020 May 19.

DOI:10.1007/s40745-020-00289-7

PMID:38624317

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7236640/

Abstract

It is a great challenge of identification as well as formation of groups of infectious disease data set. Data mining, a process of uncovering silent characteristics of big data is one of such techniques which have nowadays become more popular for treating massive volume of infectious disease data set. In the current study, we apply cluster analysis, one of the data mining techniques to classify real groups of infectious disease "novel corona virus disease (COVID-19)" data set of different states and union territories (UTs) in India according to their high similarity to each other. The results obtained permit us to have a sense of clusters of affected Indian states and UTs. The main objective of clustering in this study is to optimize monitoring techniques in affected states and UTs in India which will be very valuable to the government, doctors, the police and others involved in understanding seriousness of the spread of novel coronavirus (COVID-19) to improve government policies, decisions, medical facilities (ventilators, testing kits, masks etc.), treatment etc. to reduce number of infected and deceased persons.

摘要

识别和形成传染病数据集组是一项巨大的挑战。数据挖掘作为一种揭示大数据潜在特征的过程，是如今处理大量传染病数据集时更受欢迎的技术之一。在本研究中，我们应用数据挖掘技术之一的聚类分析，根据印度不同邦和联邦属地（UTs）的“新型冠状病毒病（COVID-19）”数据集彼此之间的高度相似性，对其实际组进行分类。所获得的结果使我们能够了解受影响的印度邦和联邦属地的聚类情况。本研究中聚类的主要目的是优化印度受影响邦和联邦属地的监测技术，这对政府、医生、警方及其他参与了解新型冠状病毒（COVID-19）传播严重性的人员非常有价值，有助于改进政府政策、决策、医疗设施（呼吸机、检测试剂盒、口罩等）、治疗等，以减少感染和死亡人数。

相似文献

Monitoring Novel Corona Virus (COVID-19) Infections in India by Cluster Analysis.通过聚类分析监测印度的新型冠状病毒（COVID-19）感染情况。

Ann Data Sci. 2020;7(3):417-425. doi: 10.1007/s40745-020-00289-7. Epub 2020 May 19.

Monitoring COVID-19 Cases and Vaccination in Indian States and Union Territories Using Unsupervised Machine Learning Algorithm.使用无监督机器学习算法监测印度各邦和中央直辖区的新冠肺炎病例及疫苗接种情况。

Ann Data Sci. 2023;10(4):967-989. doi: 10.1007/s40745-022-00404-w. Epub 2022 May 4.

Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区，服用抗叶酸抗疟药物的人群中，叶酸补充剂与疟疾易感性和严重程度的关系。

Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.

Modeling and tracking Covid-19 cases using Big Data analytics on HPCC system platformm.在惠普高性能计算集群（HPCC）系统平台上使用大数据分析对新冠病毒疾病（Covid-19）病例进行建模和追踪。

J Big Data. 2021;8(1):33. doi: 10.1186/s40537-021-00423-z. Epub 2021 Feb 15.

Assessment of bio-medical waste before and during the emergency of novel Coronavirus disease pandemic in India: A gap analysis.评估印度新型冠状病毒病大流行前后的生物医学废物：差距分析。

Waste Manag Res. 2022 Apr;40(4):470-481. doi: 10.1177/0734242X211021473. Epub 2021 May 27.

Prediction of COVID-19 Trend in India and Its Four Worst-Affected States Using Modified SEIRD and LSTM Models.使用改进的SEIRD和LSTM模型预测印度及其四个受影响最严重邦的COVID-19趋势。

SN Comput Sci. 2021;2(3):224. doi: 10.1007/s42979-021-00598-5. Epub 2021 Apr 20.

COVID-19: Situation analysis in the district of Ernakulam.新型冠状病毒肺炎：埃纳库拉姆地区情况分析

J Family Med Prim Care. 2022 Jan;11(1):67-73. doi: 10.4103/jfmpc.jfmpc_469_21. Epub 2022 Jan 31.

Efficiency analysis in the management of COVID-19 pandemic in India based on data envelopment analysis.基于数据包络分析的印度新冠疫情管理效率分析

Curr Res Behav Sci. 2021 Nov;2:100063. doi: 10.1016/j.crbeha.2021.100063. Epub 2021 Oct 30.

Physical interventions to interrupt or reduce the spread of respiratory viruses.中断或减少呼吸道病毒传播的物理干预措施。

Cochrane Database Syst Rev. 2020 Nov 20;11(11):CD006207. doi: 10.1002/14651858.CD006207.pub5.

Tracking COVID-19 burden in India: A review using SMAART RAPID tracker.追踪印度的新冠疫情负担：使用SMAART RAPID追踪器的综述

Online J Public Health Inform. 2021 Mar 12;13(1):e4. doi: 10.5210/ojphi.v13i1.11456. eCollection 2021.

引用本文的文献

Impact of COVID-19 on Stock Indices Volatility: Long-Memory Persistence, Structural Breaks, or Both?新冠疫情对股票指数波动率的影响：长期记忆持续性、结构突变，还是两者皆有？

Ann Data Sci. 2022 Sep 12:1-28. doi: 10.1007/s40745-022-00446-0.

Intervention Analysis of COVID-19 Vaccination in Nigeria: The Naive Solution Versus Interrupted Time Series.尼日利亚新冠疫苗接种的干预分析：朴素解法与中断时间序列法

Ann Data Sci. 2023 Jan 19:1-26. doi: 10.1007/s40745-023-00462-8.

Ann Data Sci. 2023;10(4):967-989. doi: 10.1007/s40745-022-00404-w. Epub 2022 May 4.

Forecasting the Trends of Covid-19 and Causal Impact of Vaccines Using Bayesian Structural time Series and ARIMA.使用贝叶斯结构时间序列和自回归积分移动平均模型预测新冠疫情趋势及疫苗的因果影响。

Ann Data Sci. 2022;9(5):1025-1047. doi: 10.1007/s40745-022-00418-4. Epub 2022 Jun 13.

Bayesian Hierarchical Spatial Modeling of COVID-19 Cases in Bangladesh.孟加拉国新冠肺炎病例的贝叶斯分层空间建模

Ann Data Sci. 2023 Jan 22:1-27. doi: 10.1007/s40745-022-00461-1.

Interval-Valued Intuitionistic Fuzzy Soft Rough Approximation Operators and Their Applications in Decision Making Problem.区间值直觉模糊软粗糙近似算子及其在决策问题中的应用

Ann Data Sci. 2022;9(3):611-625. doi: 10.1007/s40745-022-00370-3. Epub 2022 Feb 7.

Effective Learning During COVID-19: Multilevel Covariates Matching and Propensity Score Matching.COVID-19期间的有效学习：多层次协变量匹配和倾向得分匹配

Ann Data Sci. 2022;9(5):967-982. doi: 10.1007/s40745-022-00392-x. Epub 2022 Apr 4.

Content Analysis of the Economic Problems of Covid-19 Disease on Businesses: A Case Study of Tehran Province, Iran.新冠疫情对企业经济问题的内容分析：以伊朗德黑兰省为例

Ann Data Sci. 2022;9(5):1069-1083. doi: 10.1007/s40745-022-00380-1. Epub 2022 Mar 19.

Dictionary Based Global Twitter Sentiment Analysis of Coronavirus (COVID-19) Effects and Response.基于词典的全球推特对冠状病毒（COVID-19）影响及应对的情绪分析

Ann Data Sci. 2022;9(1):175-186. doi: 10.1007/s40745-021-00358-5. Epub 2022 Jan 20.

Action for Action: Mad COVID-19, Falling Markets and Rising Volatility of SAARC Region.行动应对行动：疯狂的新冠疫情、下跌的市场与南盟地区不断上升的波动性

Ann Data Sci. 2022;9(1):33-54. doi: 10.1007/s40745-021-00349-6. Epub 2021 Jul 3.

本文引用的文献

What are the underlying transmission patterns of COVID-19 outbreak? An age-specific social contact characterization.新冠疫情的潜在传播模式是什么？特定年龄的社会接触特征分析。

EClinicalMedicine. 2020 Apr 18;22:100354. doi: 10.1016/j.eclinm.2020.100354. eCollection 2020 May.

Assessment of water quality using cluster analysis in coastal region of Mumbai, India.利用聚类分析评估印度孟买沿海地区的水质。

Environ Monit Assess. 2011 Jul;178(1-4):321-32. doi: 10.1007/s10661-010-1692-0. Epub 2010 Sep 14.

Multivariate statistical techniques for the evaluation of spatial and temporal variations in water quality of Gomti River (India)--a case study.用于评估印度贡蒂河水质时空变化的多元统计技术——案例研究

Water Res. 2004 Nov;38(18):3980-92. doi: 10.1016/j.watres.2004.06.011.

Using cluster analysis for medical resource decision making.运用聚类分析进行医疗资源决策。

Med Decis Making. 1995 Oct-Dec;15(4):333-47. doi: 10.1177/0272989X9501500404.

Cluster analysis and related techniques in medical research.医学研究中的聚类分析及相关技术。

Stat Methods Med Res. 1992;1(1):27-48. doi: 10.1177/096228029200100103.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验