Computational Health Science Research Group, Department of Psychology, Brigham Young University, Provo, UT, United States.
JMIR Ment Health. 2016 May 16;3(2):e21. doi: 10.2196/mental.4822.
One of the leading causes of death in the United States (US) is suicide and new methods of assessment are needed to track its risk in real time.
Our objective is to validate the use of machine learning algorithms for Twitter data against empirically validated measures of suicidality in the US population.
Using a machine learning algorithm, the Twitter feeds of 135 Mechanical Turk (MTurk) participants were compared with validated, self-report measures of suicide risk.
Our findings show that people who are at high suicidal risk can be easily differentiated from those who are not by machine learning algorithms, which accurately identify the clinically significant suicidal rate in 92% of cases (sensitivity: 53%, specificity: 97%, positive predictive value: 75%, negative predictive value: 93%).
Machine learning algorithms are efficient in differentiating people who are at a suicidal risk from those who are not. Evidence for suicidality can be measured in nonclinical populations using social media data.
在美国,自杀是导致死亡的主要原因之一,因此需要新的评估方法来实时跟踪其风险。
我们的目的是验证机器学习算法在推特数据上的应用,以评估美国人群的自杀倾向。
使用机器学习算法,将 135 名 Mechanical Turk(MTurk)参与者的推特消息与经过验证的、自我报告的自杀风险评估方法进行比较。
我们的研究结果表明,机器学习算法可以轻松区分高自杀风险人群和低自杀风险人群,能够准确识别 92%的临床显著自杀率(敏感性:53%,特异性:97%,阳性预测值:75%,阴性预测值:93%)。
机器学习算法在区分有自杀风险的人和没有自杀风险的人方面非常有效。可以使用社交媒体数据来衡量非临床人群的自杀倾向。