Pepin Kristen, Bologna Federica, Thalken Rosamond, Wilkens Matthew
Department of Obstetrics and Gynecology, Weill Cornell Medicine (Dr. Pepin), New York, New York.
Department of Information Science, Cornell University (Drs. Bologna, Thalken, Wilkens), Ithaca, New York.
J Minim Invasive Gynecol. 2024 Dec;31(12):1011-1018.e3. doi: 10.1016/j.jmig.2024.08.001. Epub 2024 Aug 10.
Use machine learning to characterize the content of endometriosis online community posts and comments.
Retrospective Descriptive Study.
Endometriosis online health communities (OHCs) on the platform Reddit.
Users of the endometriosis OHCs r/Endo and r/endometriosis.
Machine learning was used to analyze thousands of posts made to endometriosis OHCs. Content of posts and comments was interpreted using topic modeling, persona identification, and intent labeling. Measurements included baseline characteristics of users, posts, and comments to the OHCs. Machine-learning techniques; topic modeling, intent labeling, and persona identification were used to identify the most common topics of conversation, the intents behind the posts, and the subjects of people discussed in posts. System performance was assessed via accuracy at F-score.
A total of 34 715 posts and 353 162 comments responding to posts were evaluated. The topics most likely to be a subject of a post were menstruation (8%), sharing symptoms (8%), medical appointments (8%), medical story (9%), and empathy (7%). The majority of posts were written with the intent of seeking information about endometriosis (49%) or seeking the experiences of others with endometriosis (29%). Users expressed a strong preference for surgeons performing excision rather than ablation of endometriosis.
Endometriosis OHCs are mostly used to learn about symptoms of endometriosis and share one's medical experiences. Posts and comments from users highlight the need for more empathy in the clinical care of endometriosis and easier access for patients to high-quality information about endometriosis.
运用机器学习来描述子宫内膜异位症在线社区帖子及评论的内容。
回顾性描述性研究。
Reddit平台上的子宫内膜异位症在线健康社区(OHCs)。
子宫内膜异位症OHCs的r/Endo和r/endometriosis子版块的用户。
运用机器学习分析向子宫内膜异位症OHCs发布的数千篇帖子。通过主题建模、人物角色识别和意图标注来解读帖子及评论的内容。测量内容包括用户、帖子及对OHCs评论的基线特征。运用机器学习技术;主题建模、意图标注和人物角色识别来确定最常见的话题、帖子背后的意图以及帖子中所讨论人物的主题。通过F值的准确性来评估系统性能。
共评估了34715篇帖子以及对这些帖子的353162条评论。最有可能成为帖子主题的话题有月经(8%)、分享症状(8%)、医疗预约(8%)、医疗经历(9%)和同理心(7%)。大多数帖子的写作意图是寻求有关子宫内膜异位症的信息(49%)或寻求其他子宫内膜异位症患者的经历(29%)。用户强烈倾向于由外科医生进行子宫内膜异位症的切除而非消融手术。
子宫内膜异位症OHCs主要用于了解子宫内膜异位症的症状并分享个人医疗经历。用户的帖子和评论凸显了在子宫内膜异位症临床护理中需要更多同理心,以及患者更容易获取有关子宫内膜异位症的高质量信息。