Xiao Xiao, Wang Shuqin, Jiang Feng, Qi Tingyue, Wang Wei
Department of Ultrasound, The Affiliated Hospital of Yangzhou University, Yangzhou University, Yangzhou, Jiangsu, China.
Department of Information Engineering, Yangzhou University, Yangzhou, Jiangsu, China.
Front Comput Neurosci. 2024 Dec 18;18:1468519. doi: 10.3389/fncom.2024.1468519. eCollection 2024.
It is a universal phenomenon for patients who do not know which clinical department to register in large general hospitals. Although triage nurses can help patients, due to the larger number of patients, they have to stand in a queue for minutes to consult. Recently, there have already been some efforts to devote deep-learning techniques or pre-trained language models (PLMs) to triage recommendations. However, these methods may suffer two main limitations: (1) These methods typically require a certain amount of labeled or unlabeled data for model training, which are not always accessible and costly to acquire. (2) These methods have not taken into account the distortion of semantic feature structure and the loss of category discriminability in the model training. To overcome these limitations, in this study, we propose a cross-domain text classification method based on prompt-tuning, which can classify patients' questions or texts about their symptoms into several given categories to give suggestions on which kind of consulting room patients could choose. Specifically, first, different prompt templates are manually crafted based on various data contents, embedding source domain information into the prompt templates to generate another text with similar semantic feature structures for performing classification tasks. Then, five different strategies are employed to expand the label word space for modifying prompts, and the integration of these strategies is used as the final verbalizer. The extensive experiments on Chinese Triage datasets demonstrate that our method achieved state-of-the-art performance.
在大型综合医院中,患者不知道该挂哪个临床科室是一种普遍现象。尽管分诊护士可以帮助患者,但由于患者数量众多,他们不得不排队几分钟进行咨询。最近,已经有人致力于将深度学习技术或预训练语言模型(PLM)用于分诊推荐。然而,这些方法可能存在两个主要局限性:(1)这些方法通常需要一定数量的标记或未标记数据用于模型训练,而这些数据并不总是容易获取且获取成本高昂。(2)这些方法在模型训练中没有考虑语义特征结构的扭曲和类别可辨别性的损失。为了克服这些局限性,在本研究中,我们提出了一种基于提示调整的跨域文本分类方法,该方法可以将患者关于其症状的问题或文本分类为几个给定类别,以给出患者可以选择哪种咨询室的建议。具体来说,首先,根据各种数据内容手动制作不同的提示模板,将源域信息嵌入到提示模板中,以生成具有相似语义特征结构的另一个文本用于执行分类任务。然后,采用五种不同的策略来扩展标签词空间以修改提示,并将这些策略的整合用作最终的语言生成器。在中文分诊数据集上的广泛实验表明,我们的方法取得了领先的性能。