School of Software, Henan University, Kaifeng, China.
Institute of Data Science, City University of Macau, Taipa, Macau, China.
J Healthc Eng. 2021 Feb 23;2021:6664893. doi: 10.1155/2021/6664893. eCollection 2021.
There are a large number of symptom consultation texts in medical and healthcare Internet communities, and Chinese health segmentation is more complex, which leads to the low accuracy of the existing algorithms for medical text classification. The deep learning model has advantages in extracting abstract features of text effectively. However, for a large number of samples of complex text data, especially for words with ambiguous meanings in the field of Chinese medical diagnosis, the word-level neural network model is insufficient. Therefore, in order to solve the triage and precise treatment of patients, we present an improved Double Channel (DC) mechanism as a significant enhancement to Long Short-Term Memory (LSTM). In this DC mechanism, two channels are used to receive word-level and char-level embedding, respectively, at the same time. Hybrid attention is proposed to combine the current time output with the current time unit state and then using attention to calculate the weight. By calculating the probability distribution of each timestep input data weight, the weight score is obtained, and then weighted summation is performed. At last, the data input by each timestep is subjected to trade-off learning to improve the generalization ability of the model learning. Moreover, we conduct an extensive performance evaluation on two different datasets: cMedQA and Sentiment140. The experimental results show that the DC-LSTM model proposed in this paper has significantly superior accuracy and ROC compared with the basic CNN-LSTM model.
医疗和健康互联网社区中存在大量症状咨询文本,且中文分词更为复杂,这导致现有医学文本分类算法的准确率较低。深度学习模型在有效提取文本的抽象特征方面具有优势。然而,对于大量复杂文本数据的样本,尤其是对于中文医疗诊断领域中含义模糊的单词,词级神经网络模型就显得力不从心。因此,为了解决患者分诊和精准治疗的问题,我们提出了一种改进的双通道(DC)机制,作为对长短期记忆(LSTM)的重大改进。在这个 DC 机制中,两个通道同时接收词级和字符级的嵌入。混合注意力机制用于结合当前时间的输出和当前时间单元的状态,然后使用注意力计算权重。通过计算每个时间步输入数据权重的概率分布,得到权重得分,然后进行加权求和。最后,对每个时间步输入的数据进行权衡学习,以提高模型学习的泛化能力。此外,我们在两个不同的数据集 cMedQA 和 Sentiment140 上进行了广泛的性能评估。实验结果表明,与基本的 CNN-LSTM 模型相比,本文提出的 DC-LSTM 模型在准确率和 ROC 方面具有显著优势。
BMC Med Inform Decis Mak. 2019-4-4
Comput Intell Neurosci. 2022
Sci Rep. 2025-2-5
Comput Intell Neurosci. 2022
IEEE Trans Neural Netw Learn Syst. 2022-9
Comput Intell Neurosci. 2021
Sensors (Basel). 2020-9-28
Proc Conf Empir Methods Nat Lang Process. 2016-11