Matsuda Shinichi, Aoki Kotonari, Tomizawa Shiho, Sone Masayoshi, Tanaka Riwa, Kuriki Hiroshi, Takahashi Yoichiro
Chugai Pharmaceutical Co Ltd, Drug Safety Data Management Department, Tokyo, Japan.
Chugai Pharmaceutical Co Ltd, Pharmacovigilance Department, Tokyo, Japan.
JMIR Public Health Surveill. 2017 Feb 24;3(1):e10. doi: 10.2196/publichealth.6872.
Although several reports have suggested that patient-generated data from Internet sources could be used to improve drug safety and pharmacovigilance, few studies have identified such data sources in Japan. We introduce a unique Japanese data source: tōbyōki, which translates literally as "an account of a struggle with disease."
The objective of this study was to evaluate the basic characteristics of the TOBYO database, a collection of tōbyōki blogs on the Internet, and discuss potential applications for pharmacovigilance.
We analyzed the overall gender and age distribution of the patient-generated TOBYO database and compared this with other external databases generated by health care professionals. For detailed analysis, we prepared separate datasets for blogs written by patients with depression and blogs written by patients with rheumatoid arthritis (RA), because these conditions were expected to entail subjective patient symptoms such as discomfort, insomnia, and pain. Frequently appearing medical terms were counted, and their variations were compared with those in an external adverse drug reaction (ADR) reporting database. Frequently appearing words regarding patients with depression and patients with RA were visualized using word clouds and word cooccurrence networks.
As of June 4, 2016, the TOBYO database comprised 54,010 blogs representing 1405 disorders. Overall, more entries were written by female bloggers (68.8%) than by male bloggers (30.8%). The most frequently observed disorders were breast cancer (4983 blogs), depression (3556), infertility (2430), RA (1118), and panic disorder (1090). Comparison of medical terms observed in tōbyōki blogs with those in an external ADR reporting database showed that subjective and symptomatic events and general terms tended to be frequently observed in tōbyōki blogs (eg, anxiety, headache, and pain), whereas events using more technical medical terms (eg, syndrome and abnormal laboratory test result) tended to be observed frequently in the ADR database. We also confirmed the feasibility of using visualization techniques to obtain insights from unstructured text-based tōbyōki blog data. Word clouds described the characteristics of each disorder, such as "sleeping" and "anxiety" in depression and "pain" and "painful" in RA.
Pharmacovigilance should maintain a strong focus on patients' actual experiences, concerns, and outcomes, and this approach can be expected to uncover hidden adverse event signals earlier and to help us understand adverse events in a patient-centered way. Patient-generated tōbyōki blogs in the TOBYO database showed unique characteristics that were different from the data in existing sources generated by health care professionals. Analysis of tōbyōki blogs would add value to the assessment of disorders with a high prevalence in women, psychiatric disorders in which subjective symptoms have important clinical meaning, refractory disorders, and other chronic disorders.
尽管有几份报告表明,来自互联网来源的患者生成数据可用于改善药物安全性和药物警戒,但在日本,很少有研究确定此类数据源。我们介绍一种独特的日本数据源:“病历”,直译为“与疾病斗争的记录”。
本研究的目的是评估TOBYO数据库(互联网上“病历”博客的集合)的基本特征,并讨论其在药物警戒中的潜在应用。
我们分析了患者生成的TOBYO数据库的总体性别和年龄分布,并将其与医疗保健专业人员生成的其他外部数据库进行比较。为了进行详细分析,我们为抑郁症患者撰写的博客和类风湿性关节炎(RA)患者撰写的博客准备了单独的数据集,因为这些疾病预计会出现诸如不适、失眠和疼痛等主观患者症状。统计经常出现的医学术语,并将其变体与外部药物不良反应(ADR)报告数据库中的变体进行比较。使用词云图和词共现网络对抑郁症患者和RA患者经常出现的词汇进行可视化。
截至2016年6月4日,TOBYO数据库包含代表1405种疾病的54,010篇博客。总体而言,女性博主撰写的条目(68.8%)多于男性博主(30.8%)。最常观察到的疾病是乳腺癌(4983篇博客)、抑郁症(3556篇)、不孕症(2430篇)、RA(1118篇)和惊恐障碍(1090篇)。将“病历”博客中观察到的医学术语与外部ADR报告数据库中的术语进行比较,结果表明,主观和症状性事件以及通用术语在“病历”博客中往往经常出现(例如,焦虑、头痛和疼痛),而使用更多专业医学术语的事件(例如,综合征和异常实验室检查结果)在ADR数据库中往往经常出现。我们还证实了使用可视化技术从基于非结构化文本的“病历”博客数据中获取见解的可行性。词云图描述了每种疾病的特征,例如抑郁症中的“睡眠”和“焦虑”以及RA中的“疼痛”和“痛苦”。
药物警戒应高度关注患者的实际经历、担忧和结果,预计这种方法能够更早地发现隐藏的不良事件信号,并帮助我们以患者为中心理解不良事件。TOBYO数据库中患者生成的“病历”博客显示出与医疗保健专业人员生成的现有来源数据不同的独特特征。对“病历”博客的分析将为评估女性中高患病率的疾病、主观症状具有重要临床意义的精神疾病、难治性疾病和其他慢性疾病增加价值。