Garg Sanjana, Taylor Jordan, El Sherief Mai, Kasson Erin, Aledavood Talayeh, Riordan Raven, Kaiser Nina, Cavazos-Rehg Patricia, De Choudhury Munmun
College of Computing, Georgia Institute of Technology, Atlanta, GA 30332, United States of America.
Department of Psychiatry, Washington University School of Medicine, St Louis, MO 63130, United States of America.
Internet Interv. 2021 Oct 20;26:100467. doi: 10.1016/j.invent.2021.100467. eCollection 2021 Dec.
Opioid misuse is a public health crisis in the US, and misuse of synthetic opioids such as fentanyl have driven the most recent waves of opioid-related deaths. Because those who misuse fentanyl are often a hidden and high-risk group, innovative methods for identifying individuals at risk for fentanyl misuse are needed. Machine learning has been used in the past to investigate discussions surrounding substance use on Reddit, and this study leverages similar techniques to identify risky content from discussions of fentanyl on this platform.
A codebook was developed by clinical domain experts with 12 categories indicative of fentanyl misuse risk, and this was used to manually label 391 Reddit posts and comments. Using this data, we built machine learning classification models to identify fentanyl risk.
Our machine learning risk model was able to detect posts or comments labeled as risky by our clinical experts with 76% accuracy and 76% sensitivity. Furthermore, we provide a vocabulary of community-specific, colloquial words for fentanyl and its analogues.
This study uses an interdisciplinary approach leveraging machine learning techniques and clinical domain expertise to automatically detect risky discourse, which may elicit and benefit from timely intervention. Moreover, our vocabulary of online terms for fentanyl and its analogues expands our understanding of online "street" nomenclature for opiates. Through an improved understanding of substance misuse risk factors, these findings allow for identification of risk concepts among those misusing fentanyl to inform outreach and intervention strategies tailored to this at-risk group.
阿片类药物滥用是美国的一个公共卫生危机,而芬太尼等合成阿片类药物的滥用引发了最近几波与阿片类药物相关的死亡潮。由于滥用芬太尼的人群往往是一个隐蔽且高危的群体,因此需要创新方法来识别有芬太尼滥用风险的个体。机器学习过去曾被用于调查Reddit上围绕药物使用的讨论,本研究利用类似技术从该平台上关于芬太尼的讨论中识别出有风险的内容。
临床领域专家制定了一个包含12个表明芬太尼滥用风险类别的编码手册,并用其对391条Reddit帖子和评论进行人工标注。利用这些数据,我们构建了机器学习分类模型来识别芬太尼风险。
我们的机器学习风险模型能够以76%的准确率和76%的灵敏度检测出被临床专家标注为有风险的帖子或评论。此外,我们提供了一份针对芬太尼及其类似物的特定社区、口语化词汇表。
本研究采用跨学科方法,利用机器学习技术和临床领域专业知识自动检测有风险的话语,这可能会引发及时干预并从中受益。此外,我们关于芬太尼及其类似物的在线术语词汇表扩展了我们对阿片类药物在线“街头”命名法的理解。通过更好地理解药物滥用风险因素,这些发现有助于识别芬太尼滥用者中的风险概念,为针对这一高危群体的外展和干预策略提供信息。