• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

相似文献

1
Entity perception of Two-Step-Matching framework for public opinions.舆情两步匹配框架的实体感知
J Saf Sci Resil. 2020 Sep;1(1):36-43. doi: 10.1016/j.jnlssr.2020.06.005. Epub 2020 Jun 30.
2
Extracting clinical named entity for pituitary adenomas from Chinese electronic medical records.从中文电子病历中提取垂体腺瘤的临床命名实体。
BMC Med Inform Decis Mak. 2022 Mar 23;22(1):72. doi: 10.1186/s12911-022-01810-z.
3
A deep learning model incorporating part of speech and self-matching attention for named entity recognition of Chinese electronic medical records.基于词性和自匹配注意力的深度学习模型在中文电子病历命名实体识别中的应用。
BMC Med Inform Decis Mak. 2019 Apr 9;19(Suppl 2):65. doi: 10.1186/s12911-019-0762-7.
4
Where No Universal Health Care Identifier Exists: Comparison and Determination of the Utility of Score-Based Persons Matching Algorithms Using Demographic Data.在不存在通用医疗保健标识符的情况下:使用人口统计学数据对基于分数的人员匹配算法的效用进行比较和判定。
JMIR Public Health Surveill. 2018 Dec 13;4(4):e10436. doi: 10.2196/10436.
5
A hybrid approach for named entity recognition in Chinese electronic medical record.中文电子病历命名实体识别的混合方法。
BMC Med Inform Decis Mak. 2019 Apr 9;19(Suppl 2):64. doi: 10.1186/s12911-019-0767-2.
6
Document-level attention-based BiLSTM-CRF incorporating disease dictionary for disease named entity recognition.基于文档级注意力的 BiLSTM-CRF 结合疾病词典的疾病命名实体识别。
Comput Biol Med. 2019 May;108:122-132. doi: 10.1016/j.compbiomed.2019.04.002. Epub 2019 Apr 7.
7
Research on named entity recognition of Traditional Chinese Medicine chest discomfort cases incorporating domain vocabulary features.基于领域词汇特征的中医胸痹病案命名实体识别研究。
Comput Biol Med. 2023 Nov;166:107466. doi: 10.1016/j.compbiomed.2023.107466. Epub 2023 Sep 9.
8
Research on named entity recognition of adverse drug reactions based on NLP and deep learning.基于自然语言处理和深度学习的药物不良反应命名实体识别研究
Front Pharmacol. 2023 Jun 1;14:1121796. doi: 10.3389/fphar.2023.1121796. eCollection 2023.
9
A diffusion enhanced CRF and BiLSTM framework for accurate entity recognition.一种用于精确实体识别的扩散增强条件随机场和双向长短期记忆网络框架。
Sci Rep. 2025 Jun 4;15(1):19670. doi: 10.1038/s41598-025-04036-x.
10
Robust multi-source geographic entities matching by maximizing geometric and semantic similarity.通过最大化几何和语义相似度实现稳健的多源地理实体匹配。
Sci Rep. 2024 Dec 30;14(1):31616. doi: 10.1038/s41598-024-79812-2.

本文引用的文献

1
Nowcasting and forecasting the potential domestic and international spread of the 2019-nCoV outbreak originating in Wuhan, China: a modelling study.实时预测和预报源自中国武汉的 2019-nCoV 疫情在国内和国际的潜在传播:一项建模研究。
Lancet. 2020 Feb 29;395(10225):689-697. doi: 10.1016/S0140-6736(20)30260-9. Epub 2020 Jan 31.
2
An attention-based BiLSTM-CRF approach to document-level chemical named entity recognition.基于注意力机制的 BiLSTM-CRF 方法在文档级化学命名实体识别中的应用。
Bioinformatics. 2018 Apr 15;34(8):1381-1388. doi: 10.1093/bioinformatics/btx761.
3
PKDE4J: Entity and relation extraction for public knowledge discovery.PKDE4J:用于公共知识发现的实体与关系提取
J Biomed Inform. 2015 Oct;57:320-32. doi: 10.1016/j.jbi.2015.08.008. Epub 2015 Aug 12.

舆情两步匹配框架的实体感知

Entity perception of Two-Step-Matching framework for public opinions.

作者信息

Li Ren-De, Ma Hao-Tian, Wang Zi-Yi, Guo Qiang, Liu Jian-Guo

机构信息

Library and Research Center of Computer Systems Science, University of Shanghai for Science and Technology, Shanghai 200093, PR China.

School of Accountancy and Shanghai Key Laboratory of Financial Information Technology, Shanghai University of Finance and Economics, Shanghai 200433, PR China.

出版信息

J Saf Sci Resil. 2020 Sep;1(1):36-43. doi: 10.1016/j.jnlssr.2020.06.005. Epub 2020 Jun 30.

DOI:10.1016/j.jnlssr.2020.06.005
PMID:40477064
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7324343/
Abstract

Entity perception of ambiguous user comments is a critical problem of target identification for huge amount of public opinions. In this paper, a Two-Step-Matching method is proposed to identify the precise target entity from multiple entities mentioned. Firstly, potential entities are extracted by BiLSTM-CRF model and characteristic words by TF-IDF model from public comments. Secondly, the first matching is implemented between potential entities and an official business directory by Jaro-Winkler distance algorithm. Then, in order to find the precise one, an industry-characteristic dictionary is developed into the second matching process. The precise entity is identified according to the count of characteristic words matching to industry-characteristic dictionary. In addition, associated rate (global indicator) and accuracy rate (sample indicator) are defined for evaluation of matching accuracy. The results for three data sets of public opinions about major public health events show that the highest associated rate and accuracy rate arrive at 0.93 and 0.95, averagely enhanced by 32% and 30% above the case of using the first matching process alone. This framework provides the method to find the true target entity of really wanted expression from public opinions.

摘要

模糊用户评论的实体感知是海量舆情目标识别中的关键问题。本文提出一种两步匹配方法,从提及的多个实体中识别出精确的目标实体。首先,利用双向长短期记忆网络-条件随机场(BiLSTM-CRF)模型从公众评论中提取潜在实体,利用词频-逆文档频率(TF-IDF)模型提取特征词。其次,通过Jaro-Winkler距离算法在潜在实体与官方业务目录之间进行第一次匹配。然后,为了找到精确的实体,将行业特征词典引入到第二次匹配过程中。根据与行业特征词典匹配的特征词数量来识别精确实体。此外,定义关联率(全局指标)和准确率(样本指标)来评估匹配精度。关于重大公共卫生事件的三个舆情数据集的结果表明,最高关联率和准确率分别达到0.93和0.95,比仅使用第一次匹配过程的情况平均提高了32%和30%。该框架提供了一种从舆情中找到真正想要表达的真实目标实体的方法。