School of Languages and Cultures, University of Sydney, Sydney 2006, Australia.
Department of Computer Science, City University of Hong Kong, Hong Kong 999077, China.
Int J Environ Res Public Health. 2021 Oct 13;18(20):10743. doi: 10.3390/ijerph182010743.
We aimed to develop a quantitative instrument to assist with the automatic evaluation of the actionability of mental healthcare information. We collected and classified two large sets of mental health information from certified mental health websites: generic and patient-specific mental healthcare information. We compared the performance of the optimised classifier with popular readability tools and non-optimised classifiers in predicting mental health information of high actionability for people with mental disorders. sensitivity of the classifier using both semantic and structural features as variables achieved statistically higher than that of the binary classifier using either semantic ( < 0.001) or structural features ( = 0.0010). The specificity of the optimized classifier was statistically higher than that of the classifier using structural variables ( = 0.002) and the classifier using semantic variables ( = 0.001). Differences in specificity between the full-variable classifier and the optimised classifier were statistically insignificant ( = 0.687). These findings suggest the optimised classifier using as few as 19 semantic-structural variables was the best-performing classifier. By combining insights of linguistics and statistical analyses, we effectively increased the interpretability and the diagnostic utility of the binary classifiers to guide the development, evaluation of the actionability and usability of mental healthcare information.
我们旨在开发一种定量工具,以协助自动评估心理健康保健信息的可操作性。我们从认证的心理健康网站收集和分类了两类大型心理健康信息:通用信息和患者特定的心理健康保健信息。我们比较了优化分类器与流行的可读性工具和非优化分类器在预测精神障碍患者高可操作性心理健康信息方面的性能。使用语义和结构特征作为变量的分类器的敏感性在统计学上高于使用语义特征(<0.001)或结构特征(=0.0010)的二进制分类器。优化分类器的特异性在统计学上高于使用结构变量的分类器(=0.002)和使用语义变量的分类器(=0.001)。全变量分类器和优化分类器之间的特异性差异在统计学上无显著意义(=0.687)。这些发现表明,使用 19 个语义-结构变量的优化分类器是性能最佳的分类器。通过结合语言学和统计分析的见解,我们有效地提高了二进制分类器的可解释性和诊断效用,以指导心理健康保健信息的开发、可操作性和可用性评估。