School of Software Engineering, Beijing University of Technology, Beijing, China.
Interdisciplinary Laboratory of Digital Sciences, Centre national de la recherche scientifique, Université Paris-Saclay, Orsay, France.
J Med Internet Res. 2021 Aug 26;23(8):e26119. doi: 10.2196/26119.
Web-based social media provides common people with a platform to express their emotions conveniently and anonymously. There have been nearly 2 million messages in a particular Chinese social media data source, and several thousands more are generated each day. Therefore, it has become impossible to analyze these messages manually. However, these messages have been identified as an important data source for the prevention of suicide related to depression disorder.
We proposed in this paper a distant supervision approach to developing a system that can automatically identify textual comments that are indicative of a high suicide risk.
To avoid expensive manual data annotations, we used a knowledge graph method to produce approximate annotations for distant supervision, which provided a basis for a deep learning architecture that was built and refined by interactions with psychology experts. There were three annotation levels, as follows: free annotations (zero cost), easy annotations (by psychology students), and hard annotations (by psychology experts).
Our system was evaluated accordingly and showed that its performance at each level was promising. By combining our system with several important psychology features from user blogs, we obtained a precision of 80.75%, a recall of 75.41%, and an F1 score of 77.98% for the hardest test data.
In this paper, we proposed a distant supervision approach to develop an automatic system that can classify high and low suicide risk based on social media comments. The model can therefore provide volunteers with early warnings to prevent social media users from committing suicide.
基于网络的社交媒体为普通人提供了一个方便、匿名表达情感的平台。在一个特定的中文社交媒体数据源中,已经有近 200 万条消息,而且每天还会产生数千条新消息。因此,手动分析这些消息已经变得不可能。然而,这些消息已被确定为预防与抑郁症相关自杀的重要数据源。
本文提出了一种远程监督方法,用于开发一种能够自动识别提示高自杀风险的文本评论的系统。
为避免昂贵的手动数据标注,我们使用知识图谱方法为远程监督生成近似标注,为通过与心理学专家交互构建和完善的深度学习架构提供了基础。有三个标注级别,如下所示:自由标注(零成本)、容易标注(由心理学学生完成)和困难标注(由心理学专家完成)。
我们的系统进行了相应的评估,结果表明其在每个级别上的性能都很有前景。通过将我们的系统与用户博客中的几个重要心理学特征相结合,我们获得了最困难测试数据的 80.75%的准确率、75.41%的召回率和 77.98%的 F1 分数。
在本文中,我们提出了一种远程监督方法来开发一种能够根据社交媒体评论对高和低自杀风险进行分类的自动系统。因此,该模型可以为志愿者提供预警,以防止社交媒体用户自杀。