Yamaguchi Atsuko, Queralt-Rosinach Núria
Tokyo City University, Setagaya, Tokyo 157-0087, Japan.
Leiden University Medical Center, Leiden, 2333 ZA, The Netherlands.
Genomics Inform. 2020 Jun;18(2):e17. doi: 10.5808/GI.2020.18.2.e17. Epub 2020 Jun 18.
The amount of content on social media platforms such as Twitter is expanding rapidly. Simultaneously, the lack of patient information seriously hinders the diagnosis and treatment of rare/intractable diseases. However, these patient communities are especially active on social media. Data from social media could serve as a source of patient-centric knowledge for these diseases complementary to the information collected in clinical settings and patient registries, and may also have potential for research use. To explore this question, we attempted to extract patient-centric knowledge from social media as a task for the 3-day Biomedical Linked Annotation Hackathon 6 (BLAH6). We selected amyotrophic lateral sclerosis and multiple sclerosis as use cases of rare and intractable diseases, respectively, and we extracted patient histories related to these health conditions from Twitter. Four diagnosed patients for each disease were selected. From the user timelines of these eight patients, we extracted tweets that might be related to health conditions. Based on our experiment, we show that our approach has considerable potential, although we identified problems that should be addressed in future attempts to mine information about rare/intractable diseases from Twitter.
推特等社交媒体平台上的内容量正在迅速增长。与此同时,患者信息的匮乏严重阻碍了罕见/疑难疾病的诊断和治疗。然而,这些患者群体在社交媒体上格外活跃。社交媒体数据可作为这些疾病以患者为中心的知识来源,补充临床环境和患者登记处收集的信息,并且可能也具有研究用途。为了探究这个问题,我们试图从社交媒体中提取以患者为中心的知识,作为为期三天的生物医学关联注释黑客马拉松6(BLAH6)的一项任务。我们分别选择肌萎缩侧索硬化症和多发性硬化症作为罕见病和疑难病的案例,并从推特中提取了与这些健康状况相关的患者病史。每种疾病选取了四名确诊患者。从这八名患者的用户动态中,我们提取了可能与健康状况相关的推文。基于我们的实验,我们表明我们的方法具有相当大的潜力,尽管我们也发现了一些问题,这些问题在未来从推特挖掘罕见/疑难疾病信息的尝试中需要加以解决。