Shankar Ravi, Xu Qian, Bundele Anjali
Medical Affairs - Research Innovation & Enterprise, Alexandra Hospital, Singapore, Singapore.
School of Civil, Aerospace and Design Engineering, University of Bristol, Bristol, United Kingdom.
J Med Internet Res. 2025 May 15;27:e70128. doi: 10.2196/70128.
BACKGROUND: Patients with end-stage kidney disease undergoing dialysis face significant physical, psychological, and social challenges that impact their quality of life. Social media platforms such as X (formerly known as Twitter) have become important outlets for these patients to share experiences and exchange information. OBJECTIVE: This study aimed to uncover key themes, emotions, and challenges expressed by the dialysis community on X from April 2006 to August 2024 by leveraging natural language processing techniques, specifically sentiment analysis and topic modeling. METHODS: We collected 12,976 publicly available X posts related to dialysis using the platform's application programming interface version 2 and Python's Tweepy library. After rigorous preprocessing, 58.13% (7543/12,976) of the posts were retained for analysis. Sentiment analysis using the Valence Aware Dictionary and Sentiment Reasoner (VADER) model, which is a rule-based sentiment analyzer specifically attuned to social media content, classified the emotional tone of posts. VADER uses a human-curated lexicon that maps lexical features to sentiment scores, considering punctuation, capitalization, and modifiers. For topic modeling, posts with <50 tokens were removed, leaving 53.81% (4059/7543) of the posts, which were analyzed using latent Dirichlet allocation with coherence score optimization to identify the optimal number of topics (k=8). The analysis pipeline was implemented using Python's Natural Language Toolkit, Gensim, and scikit-learn libraries, with hyperparameter tuning to maximize model performance. RESULTS: Sentiment analysis revealed 49.2% (3711/7543) positive, 26.2% (1976/7543) negative, and 24.7% (1863/7543) neutral sentiment posts. Latent Dirichlet allocation topic modeling identified 8 key thematic clusters: medical procedures and outcomes (722/4059, 17.8% prevalence), daily life impact (666/4059, 16.4%), risks and complications (621/4059, 15.3%), patient education and support (544/4059, 13.4%), health care access and costs (499/4059, 12.3%), symptoms and side effects (442/4059, 10.9%), patient experiences and socioeconomic challenges (406/4059, 10%), and diet and fluid management (162/4059, 4%). Cross-analysis of topics and sentiment revealed that negative sentiment was highest for daily life impact (580/666, 87.1%) and socioeconomic challenges (145/406, 35.8%), whereas the education and support topic exhibited more positive sentiment (250/544, 46%). Topic coherence scores ranged from 0.38 to 0.52, with the medical procedures topic showing the highest semantic coherence. Intertopic distance mapping via multidimensional scaling revealed conceptual relationships between identified themes, with lifestyle impact and socioeconomic challenges clustering closely. Our longitudinal analysis demonstrated evolving discourse patterns, with technology-related discussions increasing by 24% in recent years, whereas financial concerns remained consistently prominent. CONCLUSIONS: This study provides a comprehensive, data-driven understanding of the complex lived experiences of patients undergoing dialysis shared on social media. The findings underscore the need for more holistic, patient-centered care models and policies that address the multidimensional challenges illuminated by patients' voices.
J Med Internet Res. 2025-5-15
J Med Internet Res. 2023-8-15
J Med Internet Res. 2022-11-18
Kidney Int Rep. 2024-10-28
NPJ Digit Med. 2024-11-12
Front Artif Intell. 2024-2-12
Health Informatics J. 2024
Clin J Am Soc Nephrol. 2024-1-1
Front Artif Intell. 2023-7-19