Kusama Hiroki, Takahashi Yoshimitsu, Orihara Shunichiro, Adachi Kayo, Ishizuka Yumiko, Semba Ryoko, Shima Hidetaka, Horimoto Yoshiya, Kaise Hiroshi, Taguri Masataka, Inoue Sho, Nakayama Takeo, Ishikawa Takashi
Department of Breast Surgical Oncology, Tokyo Medical University Hospital, 6-7-1, Nishishinjuku, Shinjuku-ku, Tokyo, 160-0023, Japan, 81 3-3342-6111.
Department of Implementation Science in Public Health, School of Public Health, Kyoto University, Kyoto, Japan.
JMIR Infodemiology. 2025 Jun 11;5:e66416. doi: 10.2196/66416.
There is breast cancer-related medical information on social media, but there is no established method for objectively evaluating the quality of this information. Principles for Health-Related Information on Social Media (PRHISM) is a newly developed tool for objectively assessing the quality of health-related information on social media; however, there have been no reports evaluating its reliability and validity.
The purpose of this study was to statistically examine the reliability and validity of PRHISM using videos about breast cancer treatment on YouTube (Google).
In total, 60 YouTube videos were selected on January 5, 2024, with the Japanese words for "breast cancer," "treatment," and "chemotherapy," and assessed by 6 Japanese physicians with expertise in breast cancer. These evaluators independently evaluated the videos using PRHISM and an established tool for assessing the quality of health-related information, DISCERN, as well as through subjective assessments. We calculated interrater and intrarater agreement among evaluators with CIs, measuring agreement using weighted Cohen kappa.
The interrater agreement for PRHISM overall quality was κ=0.52 (90% CI 0.49-0.55), indicating that the expected level of agreement, statistically defined by the lower limit of the 90% CI exceeding 0.53, was not achieved. However, PRHISM demonstrated higher agreement compared with DISCERN overall quality, which had a κ=0.45 (90% CI 0.41-0.48). In terms of validity, the intrarater agreement between PRHISM and subjective assessments by breast experts was κ=0.37 (95% CI 0.14-0.60), while DISCERN showed an agreement of κ=0.27 (95% CI 0.07-0.48), indicating fair agreement and no significant difference in validity.
PRHISM has demonstrated sufficient reliability and validity for evaluating the quality of health-related information on YouTube, making it a promising new metric. To further enhance objectivity, it is necessary to explore the use of artificial intelligence and other approaches.
社交媒体上存在与乳腺癌相关的医学信息,但尚无客观评估此类信息质量的既定方法。《社交媒体健康相关信息原则》(PRHISM)是一种新开发的用于客观评估社交媒体上健康相关信息质量的工具;然而,尚无评估其可靠性和有效性的报告。
本研究旨在使用YouTube(谷歌)上关于乳腺癌治疗的视频,对PRHISM的可靠性和有效性进行统计学检验。
2024年1月5日,总共选择了60个带有日语词汇“乳腺癌”“治疗”和“化疗”的YouTube视频,并由6名具有乳腺癌专业知识的日本医生进行评估。这些评估者使用PRHISM和一种既定的健康相关信息质量评估工具DISCERN以及通过主观评估对视频进行独立评估。我们计算了评估者之间的评分者间一致性和评分者内一致性,并给出置信区间,使用加权科恩kappa系数测量一致性。
PRHISM整体质量的评分者间一致性为κ=0.52(90%置信区间0.49 - 0.55),这表明由90%置信区间下限超过0.53统计定义的预期一致性水平未达到。然而,与DISCERN整体质量相比,PRHISM表现出更高的一致性,DISCERN的κ=0.45(90%置信区间0.41 - 0.48)。在有效性方面,PRHISM与乳腺癌专家主观评估之间的评分者内一致性为κ=0.37(95%置信区间0.14 - 0.60),而DISCERN的一致性为κ=0.27(95%置信区间0.07 - 0.48),表明一致性尚可且有效性无显著差异。
PRHISM已证明在评估YouTube上健康相关信息质量方面具有足够的可靠性和有效性,使其成为一个有前景的新指标。为进一步提高客观性,有必要探索使用人工智能等方法。