Jiang Jinglu, Cameron Ann-Frances, Yang Ming
Binghamton University, Binghamton, NY, United States.
HEC Montreal, Montreal, QC, Canada.
JMIR Med Inform. 2020 Feb 18;8(2):e16765. doi: 10.2196/16765.
Online health care consultation has become increasingly popular and is considered a potential solution to health care resource shortages and inefficient resource distribution. However, many online medical consultation platforms are struggling to attract and retain patients who are willing to pay, and health care providers on the platform have the additional challenge of standing out in a crowd of physicians who can provide comparable services.
This study used machine learning (ML) approaches to mine massive service data to (1) identify the important features that are associated with patient payment, as opposed to free trial-only appointments; (2) explore the relative importance of these features; and (3) understand how these features interact, linearly or nonlinearly, in relation to payment.
The dataset is from the largest China-based online medical consultation platform, which covers 1,582,564 consultation records between patient-physician pairs from 2009 to 2018. ML techniques (ie, hyperparameter tuning, model training, and validation) were applied with four classifiers-logistic regression, decision tree (DT), random forest, and gradient boost-to identify the most important features and their relative importance for predicting paid vs free-only appointments.
After applying the ML feature selection procedures, we identified 11 key features on the platform, which are potentially useful to predict payment. For the binary ML classification task (paid vs free services), the 11 features as a whole system achieved very good prediction performance across all four classifiers. DT analysis further identified five distinct subgroups of patients delineated by five top-ranked features: previous offline connection, total dialog, physician response rate, patient privacy concern, and social return. These subgroups interact with the physician differently, resulting in different payment outcomes.
The results show that, compared with features related to physician reputation, service-related features, such as service delivery quality (eg, consultation dialog intensity and physician response rate), patient source (eg, online vs offline returning patients), and patient involvement (eg, provide social returns and reveal previous treatment), appear to contribute more to the patient's payment decision. Promoting multiple timely responses in patient-provider interactions is essential to encourage payment.
在线医疗咨询日益普及,被视为解决医疗资源短缺和资源分配低效问题的潜在方案。然而,许多在线医疗咨询平台难以吸引并留住愿意付费的患者,而且平台上的医疗服务提供者还面临着在众多能提供类似服务的医生中脱颖而出的额外挑战。
本研究采用机器学习(ML)方法挖掘海量服务数据,以(1)识别与患者付费相关而非仅与免费试用预约相关的重要特征;(2)探究这些特征的相对重要性;(3)了解这些特征如何与付费呈线性或非线性交互。
数据集来自中国最大的在线医疗咨询平台,涵盖2009年至2018年医患对之间的1582564条咨询记录。运用ML技术(即超参数调整、模型训练和验证),使用四个分类器——逻辑回归、决策树(DT)、随机森林和梯度提升——来识别最重要的特征及其对预测付费预约与仅免费预约的相对重要性。
应用ML特征选择程序后,我们在该平台上识别出11个关键特征,这些特征可能有助于预测付费情况。对于二元ML分类任务(付费服务与免费服务),这11个特征作为一个整体系统在所有四个分类器中均取得了非常好的预测性能。DT分析进一步识别出由五个排名靠前的特征所划分的五个不同患者亚组:既往线下联系、对话总数、医生回复率、患者隐私关注和社会回报。这些亚组与医生的互动方式不同,导致不同的付费结果。
结果表明,与医生声誉相关的特征相比,服务相关特征,如服务提供质量(如咨询对话强度和医生回复率)、患者来源(如线上与线下复诊患者)和患者参与度(如提供社会回报和透露既往治疗情况),似乎对患者的付费决策贡献更大。在医患互动中促进多次及时回复对于鼓励付费至关重要。