• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

ChatGPT 4.0对护理评估量表翻译的语义评估:词汇计量分析

Semantic Evaluation of Nursing Assessment Scales Translations by ChatGPT 4.0: A Lexicometric Analysis.

作者信息

Parozzi Mauro, Bozzetti Mattia, Lo Cascio Alessio, Napolitano Daniele, Pendoni Roberta, Marcomini Ilaria, Cangelosi Giovanni, Mancin Stefano, Bonacaro Antonio

机构信息

Medicine and Surgery Department, University of Parma, Via Gramsci 14, 43126 Parma, Italy.

Direction of Health Professions, ASST Cremona, 26100 Cremona, Italy.

出版信息

Nurs Rep. 2025 Jun 11;15(6):211. doi: 10.3390/nursrep15060211.

DOI:10.3390/nursrep15060211
PMID:40559502
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12196477/
Abstract

: The use of standardized assessment tools within the nursing care process is a globally established practice, widely recognized as a foundation for evidence-based evaluation. Accurate translation is essential to ensure their correct and consistent clinical use. While effective, traditional procedures are time-consuming and resource-intensive, leading to increasing interest in whether artificial intelligence can assist or streamline this process for nursing researchers. Therefore, this study aimed to assess the translation's quality of nursing assessment scales performed by ChatGPT 4.0. : A total of 31 nursing rating scales with 772 items were translated from English to Italian using two different prompts, and then underwent a deep lexicometric analysis. To assess the semantic accuracy of the translations the Sentence-BERT, Jaccard similarity, TF-IDF cosine similarity, and Overlap ratio were used. Sensitivity, specificity, AUC, and AUROC were calculated to assess the quality of the translation classification. Paired-sample -tests were conducted to compare the similarity scores. : The Maastricht prompt produced translations that are marginally but consistently more semantically and lexically faithful to the original. While all differences were found to be statistically significant, the corresponding effect sizes indicate that the advantage of the Maastricht prompt is slight but consistent across all measures. The sensitivity of the prompts was 0.929 (92.9%) for York and 0.932 (93.2%) for Maastricht. Specificity and precision remained for both at 1.000. : Findings highlight the potential of prompt engineering as a low-cost, effective method to enhance translation outcomes. Nonetheless, as translation represents only a preliminary step in the full validation process, further studies should investigate the integration of AI-assisted translation within the broader framework of instrument adaptation and validation.

摘要

在护理过程中使用标准化评估工具是一种全球公认的做法,被广泛视为循证评估的基础。准确翻译对于确保其在临床中的正确和一致使用至关重要。虽然传统方法有效,但耗时且资源密集,这使得人们越来越关注人工智能是否可以协助或简化护理研究人员的这一过程。因此,本研究旨在评估ChatGPT 4.0对护理评估量表的翻译质量。

使用两种不同的提示将总共31个护理评定量表(共772个项目)从英语翻译成意大利语,然后进行深入的词汇分析。为了评估翻译的语义准确性,使用了句子BERT、杰卡德相似度、TF-IDF余弦相似度和重叠率。计算敏感性、特异性、AUC和AUROC以评估翻译分类的质量。进行配对样本检验以比较相似度得分。

马斯特里赫特提示生成的翻译在语义和词汇上对原文的忠实度略高且较为一致。虽然所有差异均具有统计学意义,但相应的效应大小表明,马斯特里赫特提示的优势微小但在所有指标上均保持一致。约克提示的敏感性为0.929(92.9%),马斯特里赫特提示的敏感性为0.932(93.2%)。两者的特异性和精确性均为1.000。

研究结果凸显了提示工程作为一种低成本、有效方法来提高翻译效果的潜力。尽管如此,由于翻译只是完整验证过程的初步步骤,进一步的研究应探讨在工具改编和验证的更广泛框架内整合人工智能辅助翻译的情况。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/886a/12196477/9e6dffa8fe7d/nursrep-15-00211-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/886a/12196477/533862584f85/nursrep-15-00211-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/886a/12196477/9e6dffa8fe7d/nursrep-15-00211-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/886a/12196477/533862584f85/nursrep-15-00211-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/886a/12196477/9e6dffa8fe7d/nursrep-15-00211-g002.jpg

相似文献

1
Semantic Evaluation of Nursing Assessment Scales Translations by ChatGPT 4.0: A Lexicometric Analysis.ChatGPT 4.0对护理评估量表翻译的语义评估:词汇计量分析
Nurs Rep. 2025 Jun 11;15(6):211. doi: 10.3390/nursrep15060211.
2
Prescription of Controlled Substances: Benefits and Risks管制药品的处方:益处与风险
3
Home treatment for mental health problems: a systematic review.心理健康问题的居家治疗:一项系统综述
Health Technol Assess. 2001;5(15):1-139. doi: 10.3310/hta5150.
4
The development of a novel, standardized, norm-referenced Arabic Discourse Assessment Tool (ADAT), including an examination of psychometric properties of discourse measures in aphasia.开发一种新型、标准化、基于常模的阿拉伯语语篇评估工具(ADAT),包括评估失语症患者语篇测量的心理测量特性。
Int J Lang Commun Disord. 2024 Sep-Oct;59(5):2103-2117. doi: 10.1111/1460-6984.13083. Epub 2024 Jun 18.
5
Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.在基层医疗机构或医院门诊环境中,如果患者出现以下症状和体征,可判断其是否患有 COVID-19。
Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.
6
The measurement of collaboration within healthcare settings: a systematic review of measurement properties of instruments.医疗机构内协作的测量:对测量工具属性的系统评价
JBI Database System Rev Implement Rep. 2016 Apr;14(4):138-97. doi: 10.11124/JBISRIR-2016-2159.
7
The agreement of phonetic transcriptions between paediatric speech and language therapists transcribing a disordered speech sample.儿科言语和语言治疗师转写语音样本的音标转录的一致性。
Int J Lang Commun Disord. 2024 Sep-Oct;59(5):1981-1995. doi: 10.1111/1460-6984.13043. Epub 2024 Jun 8.
8
Intravenous magnesium sulphate and sotalol for prevention of atrial fibrillation after coronary artery bypass surgery: a systematic review and economic evaluation.静脉注射硫酸镁和索他洛尔预防冠状动脉搭桥术后房颤:系统评价与经济学评估
Health Technol Assess. 2008 Jun;12(28):iii-iv, ix-95. doi: 10.3310/hta12280.
9
Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.两种现代生存预测工具 SORG-MLA 和 METSSS 在接受手术联合放疗和单纯放疗治疗有症状长骨转移患者中的比较。
Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.
10
Eliciting adverse effects data from participants in clinical trials.从临床试验参与者中获取不良反应数据。
Cochrane Database Syst Rev. 2018 Jan 16;1(1):MR000039. doi: 10.1002/14651858.MR000039.pub2.

引用本文的文献

1
Correction: Parozzi et al. Semantic Evaluation of Nursing Assessment Scales Translations by ChatGPT 4.0: A Lexicometric Analysis. 2025, , 211.更正:帕罗齐等人。ChatGPT 4.0对护理评估量表翻译的语义评估:词汇计量分析。2025年,第211页。
Nurs Rep. 2025 Jul 10;15(7):251. doi: 10.3390/nursrep15070251.

本文引用的文献

1
The impact of a risk assessment tool on hospital pressure injury prevalence and prevention: a quantitative pre-post evaluation.一种风险评估工具对医院压力性损伤发生率及预防的影响:一项前后对照定量评估。
Int J Nurs Stud Adv. 2025 May 1;8:100342. doi: 10.1016/j.ijnsa.2025.100342. eCollection 2025 Jun.
2
Integrating human expertise & automated methods for a dynamic and multi-parametric evaluation of large language models' feasibility in clinical decision-making.整合人类专业知识和自动化方法,对大型语言模型在临床决策中的可行性进行动态和多参数评估。
Int J Med Inform. 2024 Aug;188:105501. doi: 10.1016/j.ijmedinf.2024.105501. Epub 2024 May 26.
3
Presenting artificial intelligence, deep learning, and machine learning studies to clinicians and healthcare stakeholders: an introductory reference with a guideline and a Clinical AI Research (CAIR) checklist proposal.
向临床医生和医疗保健利益相关者介绍人工智能、深度学习和机器学习研究:一份带有指南和临床人工智能研究 (CAIR) 清单提案的入门参考资料。
Acta Orthop. 2021 Oct;92(5):513-525. doi: 10.1080/17453674.2021.1918389. Epub 2021 May 14.
4
Use of Standardized and Non-Standardized Tools for Measuring the Risk of Falls and Independence in Clinical Practice.在临床实践中使用标准化和非标准化工具测量跌倒风险和独立性。
Int J Environ Res Public Health. 2021 Mar 20;18(6):3226. doi: 10.3390/ijerph18063226.
5
Use of Standardized Assessment Tools to Improve the Effectiveness of Palliative Care Rounds: A Quality Improvement Initiative.使用标准化评估工具提高姑息治疗查房的有效性:一项质量改进举措。
J Palliat Care. 2017 Jul/Oct;32(3-4):134-140. doi: 10.1177/0825859717740051. Epub 2017 Nov 3.
6
English language skills requirements for internationally educated nurses working in the care industry: Barriers to UK registration or institutionalised discrimination?
Int J Nurs Stud. 2016 Feb;54:1-4. doi: 10.1016/j.ijnurstu.2014.12.006. Epub 2015 Jan 3.
7
Evaluation of online machine translation by nursing users.护理用户对在线机器翻译的评估。
Comput Inform Nurs. 2013 Aug;31(8):382-7. doi: 10.1097/NXN.0b013e3182999dc2.
8
Use of online machine translation for nursing literature: a questionnaire-based survey.在线机器翻译在护理文献中的应用:一项基于问卷调查的研究。
Open Nurs J. 2013;7:22-8. doi: 10.2174/1874434601307010022. Epub 2013 Feb 1.
9
Online machine translation use with nursing literature: evaluation method and usability.在线机器翻译在护理文献中的应用:评估方法与可用性
Comput Inform Nurs. 2013 Feb;31(2):59-65. doi: 10.1097/NXN.0b013e3182701056.
10
Guidelines for the process of cross-cultural adaptation of self-report measures.自我报告测量工具跨文化适应过程指南。
Spine (Phila Pa 1976). 2000 Dec 15;25(24):3186-91. doi: 10.1097/00007632-200012150-00014.