Barbieri Davide, Chawla Nitesh, Zaccagni Luciana, Grgurinović Tonći, Šarac Jelena, Čoklo Miran, Missoni Saša
Department of Biomedical and Specialty Surgical Sciences, Faculty of Medicine, Pharmacy and Prevention, University of Ferrara, 44121 Ferrara, Italy.
Interdisciplinary Center for Network Science and Applications, University of Notre Dame, Notre Dame, IN 46556, USA.
Int J Environ Res Public Health. 2020 Oct 28;17(21):7923. doi: 10.3390/ijerph17217923.
Cardiovascular diseases are the main cause of death worldwide. The aim of the present study is to verify the performances of a data mining methodology in the evaluation of cardiovascular risk in athletes, and whether the results may be used to support clinical decision making. Anthropometric (height and weight), demographic (age and sex) and biomedical (blood pressure and pulse rate) data of 26,002 athletes were collected in 2012 during routine sport medical examinations, which included electrocardiography at rest. Subjects were involved in competitive sport practice, for which medical clearance was needed. Outcomes were negative for the largest majority, as expected in an active population. Resampling was applied to balance positive/negative class ratio. A decision tree and logistic regression were used to classify individuals as either at risk or not. The receiver operating characteristic curve was used to assess classification performances. Data mining and resampling improved cardiovascular risk assessment in terms of increased area under the curve. The proposed methodology can be effectively applied to biomedical data in order to optimize clinical decision making, and-at the same time-minimize the amount of unnecessary examinations.
心血管疾病是全球主要的死因。本研究的目的是验证一种数据挖掘方法在评估运动员心血管风险方面的性能,以及其结果是否可用于支持临床决策。2012年在常规运动医学检查期间收集了26002名运动员的人体测量数据(身高和体重)、人口统计学数据(年龄和性别)以及生物医学数据(血压和脉搏率),这些检查包括静息心电图。受试者参与竞技体育活动,需要进行医学许可。正如在活跃人群中所预期的那样,大多数结果为阴性。采用重采样来平衡阳性/阴性类比例。使用决策树和逻辑回归将个体分类为有风险或无风险。使用受试者工作特征曲线来评估分类性能。数据挖掘和重采样在增加曲线下面积方面改善了心血管风险评估。所提出的方法可以有效地应用于生物医学数据,以优化临床决策,同时尽量减少不必要检查的数量。