• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

检测关于抑郁症的患者健康信息神经机器翻译中的症状错误:开发可解释的贝叶斯机器学习分类器

Detecting Symptom Errors in Neural Machine Translation of Patient Health Information on Depressive Disorders: Developing Interpretable Bayesian Machine Learning Classifiers.

作者信息

Xie Wenxiu, Ji Meng, Zhao Mengdan, Zhou Tianqi, Yang Fan, Qian Xiaobo, Chow Chi-Yin, Lam Kam-Yiu, Hao Tianyong

机构信息

Department of Computer Science, City University of Hong Kong, Kowloon, Hong Kong, SAR China.

School of Languages and Cultures, The University of Sydney, Darlington, NSW, Australia.

出版信息

Front Psychiatry. 2021 Oct 21;12:771562. doi: 10.3389/fpsyt.2021.771562. eCollection 2021.

DOI:10.3389/fpsyt.2021.771562
PMID:34744846
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8566668/
Abstract

Due to its convenience, wide availability, low usage cost, neural machine translation (NMT) has increasing applications in diverse clinical settings and web-based self-diagnosis of diseases. Given the developing nature of NMT tools, this can pose safety risks to multicultural communities with limited bilingual skills, low education, and low health literacy. Research is needed to scrutinise the reliability, credibility, usability of automatically translated patient health information. We aimed to develop high-performing Bayesian machine learning classifiers to assist clinical professionals and healthcare workers in assessing the quality and usability of NMT on depressive disorders. The tool did not require any prior knowledge from frontline health and medical professionals of the target language used by patients. We used Relevance Vector Machine (RVM) to increase generalisability and clinical interpretability of classifiers. It is a typical sparse Bayesian classifier less prone to overfitting with small training datasets. We optimised RVM by leveraging automatic recursive feature elimination and expert feature refinement from the perspective of health linguistics. We evaluated the diagnostic utility of the Bayesian classifier under different probability cut-offs in terms of sensitivity, specificity, positive and negative likelihood ratios against clinical thresholds for diagnostic tests. Finally, we illustrated interpretation of RVM tool in clinic using Bayes' nomogram. After automatic and expert-based feature optimisation, the best-performing RVM classifier (RVM_DUFS12) gained the highest AUC (0.8872) among 52 competing models with distinct optimised, normalised features sets. It also had statistically higher sensitivity and specificity compared to other models. We evaluated the diagnostic utility of the best-performing model using Bayes' nomogram: it had a positive likelihood ratio (LR+) of 4.62 (95% C.I.: 2.53, 8.43), and the associated posterior probability (odds) was 83% (5.0) (95% C.I.: 73%, 90%), meaning that approximately 10 in 12 English texts with positive test are likely to contain information that would cause clinically significant conceptual errors if translated by Google; it had a negative likelihood ratio (LR-) of 0.18 (95% C.I.: 0.10,0.35) and associated posterior probability (odds) was 16% (0.2) (95% C.I: 10%, 27%), meaning that about 10 in 12 English texts with negative test can be safely translated using Google.

摘要

由于其便利性、广泛可用性和低使用成本,神经机器翻译(NMT)在各种临床环境和基于网络的疾病自我诊断中的应用越来越多。鉴于NMT工具的发展性质,这可能会给双语技能有限、教育程度低和健康素养低的多元文化社区带来安全风险。需要进行研究以审查自动翻译的患者健康信息的可靠性、可信度和可用性。我们旨在开发高性能的贝叶斯机器学习分类器,以协助临床专业人员和医护人员评估NMT在抑郁症方面的质量和可用性。该工具不需要一线健康和医疗专业人员具备患者所使用目标语言的任何先验知识。我们使用相关向量机(RVM)来提高分类器的泛化能力和临床可解释性。它是一种典型的稀疏贝叶斯分类器,在处理小训练数据集时不太容易出现过拟合。我们从健康语言学的角度利用自动递归特征消除和专家特征细化来优化RVM。我们根据针对诊断测试临床阈值的敏感性、特异性、阳性和阴性似然比,评估了不同概率截断值下贝叶斯分类器的诊断效用。最后,我们使用贝叶斯列线图说明了RVM工具在临床中的解释。经过基于自动和专家的特征优化后,性能最佳的RVM分类器(RVM_DUFS12)在52个具有不同优化、归一化特征集的竞争模型中获得了最高的AUC(0.8872)。与其他模型相比,它在统计学上也具有更高的敏感性和特异性。我们使用贝叶斯列线图评估了性能最佳模型的诊断效用:其阳性似然比(LR+)为4.62(95%置信区间:2.53,8.43),相关的后验概率(比值)为83%(5.0)(95%置信区间:73%,90%),这意味着如果由谷歌翻译,每12篇测试呈阳性的英文文本中约有10篇可能包含会导致具有临床意义的概念性错误的信息;其阴性似然比(LR-)为0.18(95%置信区间:0.10,0.35),相关的后验概率(比值)为16%(0.2)(95%置信区间:10%,27%),这意味着每12篇测试呈阴性的英文文本中约有10篇可以安全地使用谷歌进行翻译。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dc22/8566668/2f6b21c76b20/fpsyt-12-771562-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dc22/8566668/d9070d5b2e15/fpsyt-12-771562-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dc22/8566668/ed859990238c/fpsyt-12-771562-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dc22/8566668/2f6b21c76b20/fpsyt-12-771562-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dc22/8566668/d9070d5b2e15/fpsyt-12-771562-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dc22/8566668/ed859990238c/fpsyt-12-771562-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dc22/8566668/2f6b21c76b20/fpsyt-12-771562-g0003.jpg

相似文献

1
Detecting Symptom Errors in Neural Machine Translation of Patient Health Information on Depressive Disorders: Developing Interpretable Bayesian Machine Learning Classifiers.检测关于抑郁症的患者健康信息神经机器翻译中的症状错误:开发可解释的贝叶斯机器学习分类器
Front Psychiatry. 2021 Oct 21;12:771562. doi: 10.3389/fpsyt.2021.771562. eCollection 2021.
2
Forecasting Erroneous Neural Machine Translation of Disease Symptoms: Development of Bayesian Probabilistic Classifiers for Cross-Lingual Health Translation.预测疾病症状的神经机器翻译错误:用于跨语言健康翻译的贝叶斯概率分类器的开发。
Int J Environ Res Public Health. 2021 Sep 19;18(18):9873. doi: 10.3390/ijerph18189873.
3
Forecasting the Suitability of Online Mental Health Information for Effective Self-Care Developing Machine Learning Classifiers Using Natural Language Features.使用自然语言特征预测在线心理健康信息对有效自我保健的适用性:开发机器学习分类器。
Int J Environ Res Public Health. 2021 Sep 24;18(19):10048. doi: 10.3390/ijerph181910048.
4
Probabilistic Prediction of Nonadherence to Psychiatric Disorder Medication from Mental Health Forum Data: Developing and Validating Bayesian Machine Learning Classifiers.从心理健康论坛数据中预测精神障碍药物治疗不依从的概率:开发和验证贝叶斯机器学习分类器。
Comput Intell Neurosci. 2022 Apr 15;2022:6722321. doi: 10.1155/2022/6722321. eCollection 2022.
5
Supporting Risk-Aware Use of Online Translation Tools in Delivering Mental Healthcare Services among Spanish-Speaking Populations.支持西班牙语人群的精神卫生保健服务中使用在线翻译工具并降低风险。
Comput Intell Neurosci. 2021 Oct 28;2021:1011197. doi: 10.1155/2021/1011197. eCollection 2021.
6
Predicting Risks of Machine Translations of Public Health Resources by Developing Interpretable Machine Learning Classifiers.通过开发可解释的机器学习分类器来预测公共卫生资源机器翻译的风险。
Int J Environ Res Public Health. 2021 Aug 20;18(16):8789. doi: 10.3390/ijerph18168789.
7
Developing Machine Learning and Statistical Tools to Evaluate the Accessibility of Public Health Advice on Infectious Diseases among Vulnerable People.开发机器学习和统计工具,以评估弱势群体获取传染病公共卫生建议的可及性。
Comput Intell Neurosci. 2021 Dec 17;2021:1916690. doi: 10.1155/2021/1916690. eCollection 2021.
8
Predicting the Easiness and Complexity of English Health Materials for International Tertiary Students With Linguistically Enhanced Machine Learning Algorithms: Development and Validation Study.使用语言增强机器学习算法预测国际大学生英语健康材料的难易程度:开发与验证研究
JMIR Med Inform. 2021 Oct 26;9(10):e25110. doi: 10.2196/25110.
9
Predicting Health Material Accessibility: Development of Machine Learning Algorithms.预测卫生材料可及性:机器学习算法的开发
JMIR Med Inform. 2021 Sep 1;9(9):e29175. doi: 10.2196/29175.
10
Automatic Diagnosis of Mental Healthcare Information Actionability: Developing Binary Classifiers.自动诊断精神保健信息可操作性:开发二进制分类器。
Int J Environ Res Public Health. 2021 Oct 13;18(20):10743. doi: 10.3390/ijerph182010743.

本文引用的文献

1
A Pragmatic Assessment of Google Translate for Emergency Department Instructions.Google 翻译在急诊科医嘱中的实用性评估。
J Gen Intern Med. 2021 Nov;36(11):3361-3365. doi: 10.1007/s11606-021-06666-z. Epub 2021 Mar 5.
2
Assessment of the Readability of Online Patient Education Material from Major Geriatric Associations.评估主要老年病学协会在线患者教育材料的可读性。
J Am Geriatr Soc. 2021 Apr;69(4):1051-1056. doi: 10.1111/jgs.16960. Epub 2020 Nov 25.
3
Bayes' theorem, COVID19, and screening tests.贝叶斯定理、COVID19 和筛查试验。
Am J Emerg Med. 2020 Oct;38(10):2011-2013. doi: 10.1016/j.ajem.2020.06.054. Epub 2020 Jun 25.
4
Assessing the Use of Google Translate for Spanish and Chinese Translations of Emergency Department Discharge Instructions.评估 Google 翻译在西班牙语和中文急诊出院指导翻译中的使用。
JAMA Intern Med. 2019 Apr 1;179(4):580-582. doi: 10.1001/jamainternmed.2018.7653.
5
A prospective observational study of machine translation software to overcome the challenge of including ethnic diversity in healthcare research.一项关于机器翻译软件的前瞻性观察性研究,旨在克服医疗保健研究中纳入种族多样性所面临的挑战。
Nurs Open. 2015 Jan 29;2(1):14-23. doi: 10.1002/nop2.13. eCollection 2015 Apr.
6
Third Molars on the Internet: A Guide for Assessing Information Quality and Readability.互联网上的第三磨牙:信息质量与可读性评估指南
Interact J Med Res. 2015 Oct 6;4(4):e19. doi: 10.2196/ijmr.4712.
7
Addressing language barriers to healthcare in India.解决印度医疗保健中的语言障碍问题。
Natl Med J India. 2013 Jul-Aug;26(4):236-8.
8
Operator functional state classification using least-square support vector machine based recursive feature elimination technique.基于最小二乘支持向量机的递归特征消除技术的操作员功能状态分类。
Comput Methods Programs Biomed. 2014;113(1):101-15. doi: 10.1016/j.cmpb.2013.09.007. Epub 2013 Sep 19.
9
Using machine translation in clinical practice.在临床实践中使用机器翻译。
Can Fam Physician. 2013 Apr;59(4):382-3.
10
The two-step Fagan's nomogram: ad hoc interpretation of a diagnostic test result without calculation.两步法费根氏列线图:无需计算即可对诊断测试结果进行特殊解读。
Evid Based Med. 2013 Aug;18(4):125-8. doi: 10.1136/eb-2013-101243. Epub 2013 Mar 6.