• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

预测疾病症状的神经机器翻译错误:用于跨语言健康翻译的贝叶斯概率分类器的开发。

Forecasting Erroneous Neural Machine Translation of Disease Symptoms: Development of Bayesian Probabilistic Classifiers for Cross-Lingual Health Translation.

机构信息

School of Languages and Cultures, University of Sydney, Sydney 2006, Australia.

Department of Computer Science, City University of Hong Kong, Hong Kong 518057, China.

出版信息

Int J Environ Res Public Health. 2021 Sep 19;18(18):9873. doi: 10.3390/ijerph18189873.

DOI:10.3390/ijerph18189873
PMID:34574795
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8466164/
Abstract

BACKGROUND

Machine translation (MT) technologies have increasing applications in healthcare. Despite their convenience, cost-effectiveness, and constantly improved accuracy, research shows that the use of MT tools in medical or healthcare settings poses risks to vulnerable populations.

OBJECTIVES

We aimed to develop machine learning classifiers (MNB and RVM) to forecast nuanced yet significant MT errors of clinical symptoms in Chinese neural MT outputs.

METHODS

We screened human translations of MSD Manuals for information on self-diagnosis of infectious diseases and produced their matching neural MT outputs for subsequent pairwise quality assessment by trained bilingual health researchers. Different feature optimisation and normalisation techniques were used to identify the best feature set.

RESULTS

The RVM classifier using optimised, normalised (L normalisation) semantic features achieved the highest sensitivity, specificity, AUC, and accuracy. MNB achieved similar high performance using the same optimised semantic feature set. The best probability threshold of the best performing RVM classifier was found at 0.6, with a very high positive likelihood ratio (LR+) of 27.82 (95% CI: 3.99, 193.76), and a low negative likelihood ratio (LR-) of 0.19 (95% CI: 0.08, 046), suggesting the high diagnostic utility of our model to predict the probabilities of erroneous MT of disease symptoms to help reverse potential inaccurate self-diagnosis of diseases among vulnerable people without adequate medical knowledge or an ability to ascertain the reliability of MT outputs.

CONCLUSION

Our study demonstrated the viability, flexibility, and efficiency of introducing machine learning models to help promote risk-aware use of MT technologies to achieve optimal, safer digital health outcomes for vulnerable people.

摘要

背景

机器翻译(MT)技术在医疗保健领域的应用越来越广泛。尽管它们具有方便、经济高效和准确性不断提高的特点,但研究表明,在医疗或医疗保健环境中使用 MT 工具会给弱势群体带来风险。

目的

我们旨在开发机器学习分类器(MNB 和 RVM),以预测中文神经 MT 输出中临床症状的细微但重要的 MT 错误。

方法

我们筛选了 MSD 手册中的人类翻译信息,以了解传染病的自我诊断,并为随后的双语健康研究人员进行配对质量评估生成了他们的匹配神经 MT 输出。使用不同的特征优化和归一化技术来确定最佳特征集。

结果

使用优化、归一化(L 归一化)语义特征的 RVM 分类器实现了最高的敏感性、特异性、AUC 和准确性。MNB 使用相同的优化语义特征集也实现了类似的高性能。最佳 RVM 分类器的最佳概率阈值发现为 0.6,具有非常高的阳性似然比(LR+)为 27.82(95%CI:3.99,193.76),而阴性似然比(LR-)为 0.19(95%CI:0.08,046),表明我们的模型预测疾病症状的 MT 错误概率具有很高的诊断效用,可以帮助没有足够医学知识或无法确定 MT 输出可靠性的弱势群体避免潜在不准确的自我诊断疾病。

结论

我们的研究证明了引入机器学习模型的可行性、灵活性和效率,以帮助促进对 MT 技术的风险意识使用,为弱势群体实现最佳、更安全的数字健康结果。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/34ab/8466164/dd344f9ca889/ijerph-18-09873-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/34ab/8466164/88390662cffc/ijerph-18-09873-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/34ab/8466164/dd344f9ca889/ijerph-18-09873-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/34ab/8466164/88390662cffc/ijerph-18-09873-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/34ab/8466164/dd344f9ca889/ijerph-18-09873-g002.jpg

相似文献

1
Forecasting Erroneous Neural Machine Translation of Disease Symptoms: Development of Bayesian Probabilistic Classifiers for Cross-Lingual Health Translation.预测疾病症状的神经机器翻译错误:用于跨语言健康翻译的贝叶斯概率分类器的开发。
Int J Environ Res Public Health. 2021 Sep 19;18(18):9873. doi: 10.3390/ijerph18189873.
2
Detecting Symptom Errors in Neural Machine Translation of Patient Health Information on Depressive Disorders: Developing Interpretable Bayesian Machine Learning Classifiers.检测关于抑郁症的患者健康信息神经机器翻译中的症状错误:开发可解释的贝叶斯机器学习分类器
Front Psychiatry. 2021 Oct 21;12:771562. doi: 10.3389/fpsyt.2021.771562. eCollection 2021.
3
Predicting Risks of Machine Translations of Public Health Resources by Developing Interpretable Machine Learning Classifiers.通过开发可解释的机器学习分类器来预测公共卫生资源机器翻译的风险。
Int J Environ Res Public Health. 2021 Aug 20;18(16):8789. doi: 10.3390/ijerph18168789.
4
Forecasting the Suitability of Online Mental Health Information for Effective Self-Care Developing Machine Learning Classifiers Using Natural Language Features.使用自然语言特征预测在线心理健康信息对有效自我保健的适用性:开发机器学习分类器。
Int J Environ Res Public Health. 2021 Sep 24;18(19):10048. doi: 10.3390/ijerph181910048.
5
Supporting Risk-Aware Use of Online Translation Tools in Delivering Mental Healthcare Services among Spanish-Speaking Populations.支持西班牙语人群的精神卫生保健服务中使用在线翻译工具并降低风险。
Comput Intell Neurosci. 2021 Oct 28;2021:1011197. doi: 10.1155/2021/1011197. eCollection 2021.
6
Probabilistic Prediction of Nonadherence to Psychiatric Disorder Medication from Mental Health Forum Data: Developing and Validating Bayesian Machine Learning Classifiers.从心理健康论坛数据中预测精神障碍药物治疗不依从的概率:开发和验证贝叶斯机器学习分类器。
Comput Intell Neurosci. 2022 Apr 15;2022:6722321. doi: 10.1155/2022/6722321. eCollection 2022.
7
Developing Machine Learning and Statistical Tools to Evaluate the Accessibility of Public Health Advice on Infectious Diseases among Vulnerable People.开发机器学习和统计工具,以评估弱势群体获取传染病公共卫生建议的可及性。
Comput Intell Neurosci. 2021 Dec 17;2021:1916690. doi: 10.1155/2021/1916690. eCollection 2021.
8
Machine Translation of Public Health Materials From English to Chinese: A Feasibility Study.公共卫生材料的机器翻译:从英文到中文的可行性研究。
JMIR Public Health Surveill. 2015 Nov 17;1(2):e17. doi: 10.2196/publichealth.4779. eCollection 2015 Jul-Dec.
9
Screening of COVID-19 based on the extracted radiomics features from chest CT images.基于胸部 CT 图像提取的放射组学特征对 COVID-19 进行筛查。
J Xray Sci Technol. 2021;29(2):229-243. doi: 10.3233/XST-200831.
10
Predicting Health Material Accessibility: Development of Machine Learning Algorithms.预测卫生材料可及性:机器学习算法的开发
JMIR Med Inform. 2021 Sep 1;9(9):e29175. doi: 10.2196/29175.

引用本文的文献

1
The Technology-Oriented Pathway for Auxiliary Diagnosis in the Digital Health Age: A Self-Adaptive Disease Prediction Model.数字健康时代的辅助诊断技术途径:一种自适应疾病预测模型。
Int J Environ Res Public Health. 2022 Sep 30;19(19):12509. doi: 10.3390/ijerph191912509.
2
Analysis of Language Translations of State Governments' Coronavirus Disease 2019 Vaccine Websites.州政府2019冠状病毒病疫苗网站语言翻译分析
Health Equity. 2022 Sep 22;6(1):738-749. doi: 10.1089/heq.2021.0189. eCollection 2022.

本文引用的文献

1
Automating the Classification of Complexity of Medical Decision-Making in Patient-Provider Messaging in a Patient Portal.在患者门户中实现患者与医疗服务提供者消息传递中医疗决策复杂性分类的自动化。
J Surg Res. 2020 Nov;255:224-232. doi: 10.1016/j.jss.2020.05.039. Epub 2020 Jun 19.
2
Beyond validation: getting health apps into clinical practice.超越验证:将健康应用程序应用于临床实践。
NPJ Digit Med. 2020 Feb 3;3:14. doi: 10.1038/s41746-019-0212-z. eCollection 2020.
3
Assessing the Use of Google Translate for Spanish and Chinese Translations of Emergency Department Discharge Instructions.
评估 Google 翻译在西班牙语和中文急诊出院指导翻译中的使用。
JAMA Intern Med. 2019 Apr 1;179(4):580-582. doi: 10.1001/jamainternmed.2018.7653.
4
Clinical Integration of Digital Solutions in Health Care: An Overview of the Current Landscape of Digital Technologies in Cancer Care.数字解决方案在医疗保健中的临床整合:癌症护理中数字技术的当前格局概述。
JCO Clin Cancer Inform. 2018 Dec;2:1-9. doi: 10.1200/CCI.17.00159.
5
Applying Naive Bayesian Networks to Disease Prediction: a Systematic Review.将朴素贝叶斯网络应用于疾病预测:一项系统综述。
Acta Inform Med. 2016 Oct;24(5):364-369. doi: 10.5455/aim.2016.24.364-369. Epub 2016 Nov 1.
6
A prospective observational study of machine translation software to overcome the challenge of including ethnic diversity in healthcare research.一项关于机器翻译软件的前瞻性观察性研究,旨在克服医疗保健研究中纳入种族多样性所面临的挑战。
Nurs Open. 2015 Jan 29;2(1):14-23. doi: 10.1002/nop2.13. eCollection 2015 Apr.
7
Third Molars on the Internet: A Guide for Assessing Information Quality and Readability.互联网上的第三磨牙:信息质量与可读性评估指南
Interact J Med Res. 2015 Oct 6;4(4):e19. doi: 10.2196/ijmr.4712.
8
Sensitivity and specificity of the World Health Organization dengue classification schemes for severe dengue assessment in children in Rio de Janeiro.世界卫生组织登革热分类方案对里约热内卢儿童严重登革热评估的敏感性和特异性。
PLoS One. 2014 Apr 28;9(4):e96314. doi: 10.1371/journal.pone.0096314. eCollection 2014.
9
Addressing language barriers to healthcare in India.解决印度医疗保健中的语言障碍问题。
Natl Med J India. 2013 Jul-Aug;26(4):236-8.
10
Readability analysis of healthcare-oriented education resources from the American Academy of Facial Plastic and Reconstructive Surgery.面向医疗保健的教育资源可读性分析:来自美国面部整形与重建外科学会
Laryngoscope. 2013 Jan;123(1):90-6. doi: 10.1002/lary.23574. Epub 2012 Sep 28.