# 慢性疼痛：利用机器学习从推特自动构建慢性疼痛队列

#ChronicPain: Automated Building of a Chronic Pain Cohort from Twitter Using Machine Learning.

作者信息

Sarker Abeed, Lakamana Sahithi, Guo Yuting, Ge Yao, Leslie Abimbola, Okunromade Omolola, Gonzalez-Polledo Elena, Perrone Jeanmarie, McKenzie-Brown Anne Marie

机构信息

Department of Biomedical Informatics, School of Medicine, Emory University, Atlanta, GA, USA.

Department of Radiology, Robert Larner College of Medicine, University of Vermont, Burlington, VT, USA.

出版信息

Health Data Sci. 2023;3. doi: 10.34133/hds.0078. Epub 2023 Jul 4.

DOI:10.34133/hds.0078

PMID:38333075

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10852024/

Abstract

BACKGROUND

Due to the high burden of chronic pain, and the detrimental public health consequences of its treatment with opioids, there is a high-priority need to identify effective alternative therapies. Social media is a potentially valuable resource for knowledge about self-reported therapies by chronic pain sufferers.

METHODS

We attempted to (a) verify the presence of large-scale chronic pain-related chatter on Twitter, (b) develop natural language processing and machine learning methods for automatically detecting self-disclosures, (c) collect longitudinal data posted by them, and (d) semiautomatically analyze the types of chronic pain-related information reported by them. We collected data using chronic pain-related hashtags and keywords and manually annotated 4,998 posts to indicate if they were self-reports of chronic pain experiences. We trained and evaluated several state-of-the-art supervised text classification models and deployed the best-performing classifier. We collected all publicly available posts from detected cohort members and conducted manual and natural language processing-driven descriptive analyses.

RESULTS

Interannotator agreement for the binary annotation was 0.82 (Cohen's kappa). The RoBERTa model performed best (F score: 0.84; 95% confidence interval: 0.80 to 0.89), and we used this model to classify all collected unlabeled posts. We discovered 22,795 self-reported chronic pain sufferers and collected over 3 million of their past posts. Further analyses revealed information about, but not limited to, alternative treatments, patient sentiments about treatments, side effects, and self-management strategies.

CONCLUSION

Our social media based approach will result in an automatically growing large cohort over time, and the data can be leveraged to identify effective opioid-alternative therapies for diverse chronic pain types.

摘要

背景

由于慢性疼痛负担沉重，且使用阿片类药物治疗会对公众健康产生有害影响，因此迫切需要确定有效的替代疗法。社交媒体对于了解慢性疼痛患者自我报告的疗法而言，是一个潜在的宝贵资源。

方法

我们试图（a）验证推特上是否存在大规模与慢性疼痛相关的讨论，（b）开发自然语言处理和机器学习方法以自动检测自我披露内容，（c）收集他们发布的纵向数据，以及（d）半自动分析他们报告的慢性疼痛相关信息的类型。我们使用与慢性疼痛相关的主题标签和关键词收集数据，并手动标注4998条帖子，以表明它们是否为慢性疼痛经历的自我报告。我们训练并评估了几种最先进的监督式文本分类模型，并部署了表现最佳的分类器。我们收集了检测到的队列成员的所有公开帖子，并进行了手动和自然语言处理驱动的描述性分析。

结果

二元注释的注释者间一致性为0.82（科恩kappa系数）。RoBERTa模型表现最佳（F分数：0.84；95%置信区间：0.80至0.89），我们使用该模型对所有收集到的未标记帖子进行分类。我们发现了22795名自我报告的慢性疼痛患者，并收集了他们过去的300多万条帖子。进一步分析揭示了有关替代治疗、患者对治疗的看法、副作用和自我管理策略等信息，但不限于这些。

结论

我们基于社交媒体的方法将随着时间的推移自动形成一个不断扩大的大型队列，这些数据可用于确定针对各种慢性疼痛类型的有效的阿片类药物替代疗法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cbe2/10880168/ece884aa0de2/hds.0078.fig.001.jpg

相似文献

#ChronicPain: Automated Building of a Chronic Pain Cohort from Twitter Using Machine Learning.

Health Data Sci. 2023;3. doi: 10.34133/hds.0078. Epub 2023 Jul 4.

Generalizable Natural Language Processing Framework for Migraine Reporting from Social Media.

AMIA Jt Summits Transl Sci Proc. 2023 Jun 16;2023:261-270. eCollection 2023.

Discovering Cohorts of Pregnant Women From Social Media for Safety Surveillance and Analysis.

J Med Internet Res. 2017 Oct 30;19(10):e361. doi: 10.2196/jmir.8164.

Automatic Detection of Twitter Users Who Express Chronic Stress Experiences via Supervised Machine Learning and Natural Language Processing.

Comput Inform Nurs. 2023 Sep 1;41(9):717-724. doi: 10.1097/CIN.0000000000000985.

Developing an Automatic System for Classifying Chatter About Health Services on Twitter: Case Study for Medicaid.

J Med Internet Res. 2021 May 3;23(5):e26616. doi: 10.2196/26616.

An aspect-level sentiment analysis dataset for therapies on Twitter.

Data Brief. 2023 Sep 23;50:109618. doi: 10.1016/j.dib.2023.109618. eCollection 2023 Oct.

Machine Learning and Natural Language Processing for Geolocation-Centric Monitoring and Characterization of Opioid-Related Social Media Chatter.

JAMA Netw Open. 2019 Nov 1;2(11):e1914672. doi: 10.1001/jamanetworkopen.2019.14672.

Utilizing a multi-class classification approach to detect therapeutic and recreational misuse of opioids on Twitter.

Comput Biol Med. 2021 Feb;129:104132. doi: 10.1016/j.compbiomed.2020.104132. Epub 2020 Nov 20.

Promoting Reproducible Research for Characterizing Nonmedical Use of Medications Through Data Annotation: Description of a Twitter Corpus and Guidelines.

J Med Internet Res. 2020 Feb 26;22(2):e15861. doi: 10.2196/15861.

Social media mining for birth defects research: A rule-based, bootstrapping approach to collecting data for rare health-related events on Twitter.

J Biomed Inform. 2018 Nov;87:68-78. doi: 10.1016/j.jbi.2018.10.001. Epub 2018 Oct 4.

引用本文的文献

Linguistic Markers of Pain Communication on X (Formerly Twitter) in US States With High and Low Opioid Mortality: Machine Learning and Semantic Network Analysis.

J Med Internet Res. 2025 May 13;27:e67506. doi: 10.2196/67506.

Which social media platforms facilitate monitoring the opioid crisis?

PLOS Digit Health. 2025 Apr 28;4(4):e0000842. doi: 10.1371/journal.pdig.0000842. eCollection 2025 Apr.

The Role and Applications of Artificial Intelligence in the Treatment of Chronic Pain.

Curr Pain Headache Rep. 2024 Aug;28(8):769-784. doi: 10.1007/s11916-024-01264-0. Epub 2024 Jun 1.

Overview of the 8th Social Media Mining for Health Applications (#SMM4H) shared tasks at the AMIA 2023 Annual Symposium.

J Am Med Inform Assoc. 2024 Apr 3;31(4):991-996. doi: 10.1093/jamia/ocae010.

Overview of the 8 Social Media Mining for Health Applications (#SMM4H) Shared Tasks at the AMIA 2023 Annual Symposium.

medRxiv. 2023 Nov 8:2023.11.06.23298168. doi: 10.1101/2023.11.06.23298168.

本文引用的文献

Can accurate demographic information about people who use prescription medications nonmedically be derived from Twitter?

Proc Natl Acad Sci U S A. 2023 Feb 21;120(8):e2207391120. doi: 10.1073/pnas.2207391120. Epub 2023 Feb 14.

Biases in using social media data for public health surveillance: A scoping review.

Int J Med Inform. 2022 Aug;164:104804. doi: 10.1016/j.ijmedinf.2022.104804. Epub 2022 May 23.

Reddit discussions about buprenorphine associated precipitated withdrawal in the era of fentanyl.

Clin Toxicol (Phila). 2022 Jun;60(6):694-701. doi: 10.1080/15563650.2022.2032730. Epub 2022 Feb 4.

Text classification models for the automatic detection of nonmedical prescription medication use from social media.

BMC Med Inform Decis Mak. 2021 Jan 26;21(1):27. doi: 10.1186/s12911-021-01394-0.

A Comprehensive Review of Alternative Therapies for the Management of Chronic Pain Patients: Acupuncture, Tai Chi, Osteopathic Manipulative Medicine, and Chiropractic Care.

Adv Ther. 2021 Jan;38(1):76-89. doi: 10.1007/s12325-020-01554-0. Epub 2020 Nov 12.

Cannabis and cannabidiol (CBD) for the treatment of fibromyalgia.

Best Pract Res Clin Anaesthesiol. 2020 Sep;34(3):617-631. doi: 10.1016/j.bpa.2020.08.010. Epub 2020 Aug 15.

Self-reported COVID-19 symptoms on Twitter: an analysis and a research resource.

J Am Med Inform Assoc. 2020 Aug 1;27(8):1310-1315. doi: 10.1093/jamia/ocaa116.

American Society of Hematology 2020 guidelines for sickle cell disease: management of acute and chronic pain.

Blood Adv. 2020 Jun 23;4(12):2656-2701. doi: 10.1182/bloodadvances.2020001851.

Sharing the pain: an observational analysis of Twitter and pain in Ireland.

Reg Anesth Pain Med. 2020 Aug;45(8):597-602. doi: 10.1136/rapm-2020-101547. Epub 2020 Jun 4.

Emerging concepts on the use of ketamine for chronic pain.

Expert Rev Clin Pharmacol. 2020 Feb;13(2):135-146. doi: 10.1080/17512433.2020.1717947. Epub 2020 Jan 28.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

# 慢性疼痛：利用机器学习从推特自动构建慢性疼痛队列

#ChronicPain: Automated Building of a Chronic Pain Cohort from Twitter Using Machine Learning.

作者信息

Sarker Abeed, Lakamana Sahithi, Guo Yuting, Ge Yao, Leslie Abimbola, Okunromade Omolola, Gonzalez-Polledo Elena, Perrone Jeanmarie, McKenzie-Brown Anne Marie

机构信息

Department of Biomedical Informatics, School of Medicine, Emory University, Atlanta, GA, USA.

Department of Radiology, Robert Larner College of Medicine, University of Vermont, Burlington, VT, USA.