• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

电子认知行为疗法失眠对话系统:大语言模型及失眠治疗适应策略的比较评估

eCBT-I dialogue system: a comparative evaluation of large language models and adaptation strategies for insomnia treatment.

作者信息

Bao Xueying, Zhu Xingyu, Yang Dongren, Lou Hao, Wang Ruoyun, Wu Yutong, Li Wenhui, Xia Yu, Zeng Li, Pan Yingying, Wang Xiqin, Zhang Xian, Ling Cheng, Ling Youhui, Zhang Yan, Zhao Qi, Yang Mei

机构信息

Department of Intensive Care Unit, The First Affiliated Hospital of Wenzhou Medical University, Wenzhou, 325000, China.

Department of Physics, Research Institute for Biomimetics and Soft Matter, Fujian Provincial Key Laboratory for Soft Functional Materials Research, Xiamen University, Xiamen, 361005, China.

出版信息

J Transl Med. 2025 Aug 5;23(1):862. doi: 10.1186/s12967-025-06871-y.

DOI:10.1186/s12967-025-06871-y
PMID:40764995
Abstract

BACKGROUND

Traditional face-to-face mental health treatments are often limited by time and space. Thanks to the development of advanced large language models (LLMs), digital mental health treatments can provide personalized advice to patients and improve compliance. However, in the field of CBT-I, specialized, real-time interactive dialogue platforms have not been fully developed.

METHODS

Our research team construct an eCBT-I intelligent dialogue system based on the RAG architecture, aiming to provide an example of the deep integration of CBT-I knowledge graphs and large language models. Furthermore, in order to optimize the performance of the system's core language generation module on the insomnia dialogue dataset, we systematically include eight mainstream large language models (ChatGLM2-6b, ChatGLM3-6b, Baichuan-7b, Baichuan-13b, Qwen-7b, Qwen2-7b, Llama-2-7b-chat-hf, and Llama-2-13b-chat-hf) and three adaptation strategies (LoRA, QLoRA, and Freeze). We screen the suitability of the three adaptation strategies for the eight major language models in the group, and thus determine the best adaptation method for each language model to maximize performance improvement. The eight best-adapted language models are then evaluated in three dimensions to compare their performance on the small sample sleep dialogue dataset and the C-eval dataset. All subjects that evaluated under experimental conditions are historical medical records and patients who did not exhibit delirium and had normal language expression abilities.

RESULTS

Through the matching of model characteristics to adaptation strategies and the horizontal evaluation of multiple models, we compare the contribution of different fine-tuning strategies to the performance improvement of different language models on the small insomnia dialogue dataset, and finally determine that Qwen2-7b (Freeze) is the model with the best performance on the insomnia dialogue dataset.

CONCLUSIONS

This study effectively integrates the CBT-I knowledge graph with the large language model through the RAG architecture, which improves the professionalism of the eCBT-I intelligent dialogue system. The systematic fine-tuning method selection process and the confirmation of the optimal model not only improve the adaptability of the large language model in the CBT-I task, but also provide a useful paradigm for AI applications in medical subfields with resource constraints and difficulties in data collection, laying a solid foundation for more accurate and efficient digital CBT-I clinical practice in the future.

摘要

背景

传统的面对面心理健康治疗往往受到时间和空间的限制。得益于先进的大语言模型(LLMs)的发展,数字心理健康治疗可以为患者提供个性化建议并提高依从性。然而,在失眠认知行为疗法(CBT-I)领域,专门的实时交互式对话平台尚未得到充分发展。

方法

我们的研究团队构建了一个基于检索增强生成(RAG)架构的eCBT-I智能对话系统,旨在提供一个CBT-I知识图谱与大语言模型深度整合的示例。此外,为了优化系统核心语言生成模块在失眠对话数据集上的性能,我们系统地纳入了八个主流大语言模型(ChatGLM2-6B、ChatGLM3-6B、百川-7B、百川-13B、通义千问-7B、通义千问2-7B、Llama-2-7B-chat-hf和Llama-2-13B-chat-hf)以及三种适配策略(LoRA、QLoRA和Freeze)。我们筛选这三种适配策略对组内八个主要语言模型的适用性,从而确定每个语言模型的最佳适配方法,以最大限度地提高性能。然后从三个维度对八个最佳适配语言模型进行评估,以比较它们在小样本睡眠对话数据集和C-eval数据集上的性能。所有在实验条件下评估的受试者均为历史病历以及未出现谵妄且语言表达能力正常的患者。

结果

通过模型特征与适配策略的匹配以及多个模型的横向评估,我们比较了不同微调策略对小失眠对话数据集上不同语言模型性能提升的贡献,最终确定通义千问2-7B(Freeze)是在失眠对话数据集上性能最佳的模型。

结论

本研究通过RAG架构有效地将CBT-I知识图谱与大语言模型整合在一起,提高了eCBT-I智能对话系统的专业性。系统的微调方法选择过程以及最优模型的确定不仅提高了大语言模型在CBT-I任务中的适应性,还为资源受限且数据收集困难的医学子领域中的人工智能应用提供了有用的范例,为未来更准确、高效的数字CBT-I临床实践奠定了坚实基础。

相似文献

1
eCBT-I dialogue system: a comparative evaluation of large language models and adaptation strategies for insomnia treatment.电子认知行为疗法失眠对话系统:大语言模型及失眠治疗适应策略的比较评估
J Transl Med. 2025 Aug 5;23(1):862. doi: 10.1186/s12967-025-06871-y.
2
Network meta-analysis comparing the effectiveness of a prescription digital therapeutic for chronic insomnia to medications and face-to-face cognitive behavioral therapy in adults.网络荟萃分析:比较一种治疗成人慢性失眠的处方数字疗法与药物及面对面认知行为疗法的有效性
Curr Med Res Opin. 2022 Oct;38(10):1727-1738. doi: 10.1080/03007995.2022.2108616. Epub 2022 Aug 23.
3
Evaluating and Enhancing Japanese Large Language Models for Genetic Counseling Support: Comparative Study of Domain Adaptation and the Development of an Expert-Evaluated Dataset.评估和增强用于遗传咨询支持的日本大语言模型:领域适应的比较研究与专家评估数据集的开发
JMIR Med Inform. 2025 Jan 16;13:e65047. doi: 10.2196/65047.
4
Development of a Voice-Activated Virtual Assistant to Improve Insomnia Among Young Adult Cancer Survivors: Mixed Methods Feasibility and Acceptability Study.开发语音激活虚拟助手以改善年轻成年癌症幸存者的失眠问题:混合方法可行性和可接受性研究
JMIR Form Res. 2025 Mar 10;9:e64869. doi: 10.2196/64869.
5
Psychological therapies for panic disorder with or without agoraphobia in adults: a network meta-analysis.成人伴或不伴有广场恐惧症的惊恐障碍的心理治疗:一项网状荟萃分析。
Cochrane Database Syst Rev. 2016 Apr 13;4(4):CD011004. doi: 10.1002/14651858.CD011004.pub2.
6
Implementing Large Language Models in Health Care: Clinician-Focused Review With Interactive Guideline.在医疗保健中应用大语言模型:以临床医生为重点的回顾与交互式指南
J Med Internet Res. 2025 Jul 11;27:e71916. doi: 10.2196/71916.
7
Performance of ChatGPT-4o and Four Open-Source Large Language Models in Generating Diagnoses Based on China's Rare Disease Catalog: Comparative Study.ChatGPT-4o与四个开源大语言模型基于中国罕见病目录生成诊断的性能:比较研究
J Med Internet Res. 2025 Jun 18;27:e69929. doi: 10.2196/69929.
8
Evaluating and Improving Syndrome Differentiation Thinking Ability in Large Language Models: Method Development Study.评估和提高大语言模型中的辨证思维能力:方法开发研究
JMIR Med Inform. 2025 Jun 20;13:e75103. doi: 10.2196/75103.
9
Effects of cognitive behavioral therapy for insomnia (CBT-I) on quality of life: A systematic review and meta-analysis.失眠认知行为疗法(CBT-I)对生活质量的影响:系统评价和荟萃分析。
Sleep Med Rev. 2022 Aug;64:101646. doi: 10.1016/j.smrv.2022.101646. Epub 2022 May 10.
10
Digital cognitive behavioural therapy for patients with insomnia and depression: A systematic review and meta-analysis.数字认知行为疗法治疗失眠伴抑郁患者的系统评价和荟萃分析。
J Psychiatr Ment Health Nurs. 2024 Aug;31(4):654-667. doi: 10.1111/jpm.13024. Epub 2024 Jan 16.

本文引用的文献

1
A Machine Learning Model for Diagnosing Opportunistic Infections in HIV Patients: Broad Applicability Across Infection Types.一种用于诊断HIV患者机会性感染的机器学习模型:在各种感染类型中的广泛适用性。
J Cell Mol Med. 2025 Mar;29(6):e70497. doi: 10.1111/jcmm.70497.
2
Investigation of cell development and tissue structure network based on natural Language processing of scRNA-seq data.基于单细胞RNA测序数据自然语言处理的细胞发育和组织结构网络研究
J Transl Med. 2025 Mar 4;23(1):264. doi: 10.1186/s12967-025-06263-2.
3
An active inference strategy for prompting reliable responses from large language models in medical practice.
一种用于在医学实践中促使大语言模型给出可靠回答的主动推理策略。
NPJ Digit Med. 2025 Feb 22;8(1):119. doi: 10.1038/s41746-025-01516-2.
4
A Multi-Task Self-Supervised Strategy for Predicting Molecular Properties and FGFR1 Inhibitors.一种用于预测分子性质和FGFR1抑制剂的多任务自监督策略
Adv Sci (Weinh). 2025 Apr;12(13):e2412987. doi: 10.1002/advs.202412987. Epub 2025 Feb 8.
5
Design and implementation of a radiomic-driven intelligent dental hospital diversion system utilizing multilabel imaging data.利用多标签成像数据的基于放射组学的智能牙科医院分流系统的设计与实现
J Transl Med. 2024 Dec 20;22(1):1123. doi: 10.1186/s12967-024-05958-2.
6
MSA-MaxNet: Multi-Scale Attention Enhanced Multi-Axis Vision Transformer Network for Medical Image Segmentation.MSA-MaxNet:用于医学图像分割的多尺度注意力增强多轴视觉Transformer网络
J Cell Mol Med. 2024 Dec;28(24):e70315. doi: 10.1111/jcmm.70315.
7
LMCD-OR: a large-scale, multilevel categorized diagnostic dataset for oral radiography.LMCD-OR:用于口腔放射学的大规模多层次分类诊断数据集。
J Transl Med. 2024 Oct 14;22(1):930. doi: 10.1186/s12967-024-05741-3.
8
Artificial intelligence-based evaluation of prognosis in cirrhosis.基于人工智能的肝硬化预后评估。
J Transl Med. 2024 Oct 14;22(1):933. doi: 10.1186/s12967-024-05726-2.
9
Predicting the potential associations between circRNA and drug sensitivity using a multisource feature-based approach.基于多源特征的方法预测 circRNA 与药物敏感性的潜在关联。
J Cell Mol Med. 2024 Oct;28(19):e18591. doi: 10.1111/jcmm.18591.
10
PepNet: an interpretable neural network for anti-inflammatory and antimicrobial peptides prediction using a pre-trained protein language model.PepNet:一种基于预训练蛋白质语言模型的可解释神经网络,用于预测抗炎和抗菌肽。
Commun Biol. 2024 Sep 28;7(1):1198. doi: 10.1038/s42003-024-06911-1.