Ambalavan Ashwin Karthik, Moulahi Bilel, Azé Jérome, Bringay Sandra
Arizona State University, Tempe, AZ, USA.
Acelys Informatique, Montpellier, France.
Stud Health Technol Inform. 2019 Aug 21;264:50-54. doi: 10.3233/SHTI190181.
Suicide is a growing public health concern in online communities. In this paper, we analyze online communications on the topic of suicide in the social networking platform, Reddit. We combine lexical text characteristics with semantic information to identify comments with features of suicide attempts and methods. Then, we develop a set of machine learning methods to automatically extract suicide methods and classify the user comments. Our classification methods performance varied between suicide experiences, with F1-scores up to 0.92 for "drugs" and greater than 0.82 for "hanging" and "other methods". Our exploratory analysis reveals that the most frequent reported suicide methods are drug overdose, hanging, and wrist-cutting.
自杀在网络社区中日益成为一个公共卫生问题。在本文中,我们分析了社交网络平台Reddit上关于自杀主题的在线交流内容。我们将词汇文本特征与语义信息相结合,以识别具有自杀未遂特征和方法的评论。然后,我们开发了一套机器学习方法来自动提取自杀方法并对用户评论进行分类。我们的分类方法在不同自杀经历中的表现有所不同,对于“药物”的F1分数高达0.92,对于“上吊”和“其他方法”则大于0.82。我们的探索性分析表明,报告最多的自杀方法是药物过量、上吊和割腕。