• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

组织带标签知识:结构挖掘中的相似性度量与语义流畅性

Organizing Tagged Knowledge: Similarity Measures and Semantic Fluency in Structure Mining.

作者信息

Sexton Rachael, Fuge Mark

机构信息

Systems Integration Division, Engineering Laboratory, National Institute of Standards and Technology, Gaithersburg, Maryland 20871.

Dept. of Mechanical Engineering, University of Maryland, College Park, Maryland 20742.

出版信息

J Mech Des N Y. 2020;142(3). doi: 10.1115/1.4045686.

DOI:10.1115/1.4045686
PMID:33613016
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7890696/
Abstract

Recovering a system's underlying structure from its historical records (also called structure mining) is essential to making valid inferences about that system's behavior. For example, making reliable predictions about system failures based on maintenance work-order data requires determining how concepts described within the work order are related. Obtaining such structural information is challenging, requiring system understanding, synthesis, and representation design. This is often either too difficult or too time-consuming to produce. Consequently, a common approach to quickly eliciting tacit structural knowledge from experts is to gather uncontrolled keywords as record labels-i.e., "tags." One can then map those tags to concepts within the structure and quantitatively infer relationships between them. Existing models of tag similarity tend to either depend on correlation strength (e.g. overall co-occurrence frequencies), or on conditional strength (e.g. tag sequence probabilities). A key difficulty in applying either model is understanding under what conditions one is better than the other for overall structure recovery. In this paper, we investigate the core assumptions and implications of these two classes of similarity measures on structure recovery tasks. Then, using lessons from this characterization, we borrow from recent psychology literature on semantic fluency tasks to construct a tag similarity measure that emulates how humans recall tags from memory. We show through empirical testing that this method combines strengths of both common modeling paradigms. We also demonstrate its potential as a pre-processor for structure mining tasks via a case study in semi-supervised learning on real excavator maintenance work-orders.

摘要

从系统的历史记录中恢复其底层结构(也称为结构挖掘)对于对该系统的行为做出有效推断至关重要。例如,基于维护工作订单数据对系统故障进行可靠预测需要确定工作订单中描述的概念之间的关系。获取此类结构信息具有挑战性,需要系统理解、综合和表示设计。这通常要么太难,要么太耗时以至于无法完成。因此,一种从专家那里快速引出隐性结构知识的常见方法是收集不受控制的关键词作为记录标签,即“标签”。然后可以将这些标签映射到结构中的概念,并定量推断它们之间的关系。现有的标签相似性模型往往要么依赖于相关强度(例如总体共现频率),要么依赖于条件强度(例如标签序列概率)。应用这两种模型的一个关键困难在于理解在什么条件下一种模型在整体结构恢复方面比另一种更好。在本文中,我们研究了这两类相似性度量对结构恢复任务的核心假设和影响。然后,借鉴这一特征描述中的经验教训,我们借鉴近期关于语义流畅性任务的心理论文来构建一种标签相似性度量,该度量模拟人类从记忆中回忆标签的方式。我们通过实证测试表明,这种方法结合了两种常见建模范式的优点。我们还通过在真实挖掘机维护工作订单的半监督学习中的案例研究,展示了其作为结构挖掘任务预处理工具的潜力。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/43f2/7890696/010f687120a0/nihms-1613391-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/43f2/7890696/263d4cc6a8c6/nihms-1613391-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/43f2/7890696/14dcc322c877/nihms-1613391-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/43f2/7890696/b0c6f8da6dc7/nihms-1613391-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/43f2/7890696/536510b3f48b/nihms-1613391-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/43f2/7890696/010f687120a0/nihms-1613391-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/43f2/7890696/263d4cc6a8c6/nihms-1613391-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/43f2/7890696/14dcc322c877/nihms-1613391-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/43f2/7890696/b0c6f8da6dc7/nihms-1613391-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/43f2/7890696/536510b3f48b/nihms-1613391-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/43f2/7890696/010f687120a0/nihms-1613391-f0005.jpg

相似文献

1
Organizing Tagged Knowledge: Similarity Measures and Semantic Fluency in Structure Mining.组织带标签知识:结构挖掘中的相似性度量与语义流畅性
J Mech Des N Y. 2020;142(3). doi: 10.1115/1.4045686.
2
Qualitative Study定性研究
3
Verbal fluency difficulties in aphasia: A combination of lexical and executive control deficits.失语症中的言语流畅性障碍:词汇和执行控制缺陷的组合。
Int J Lang Commun Disord. 2022 May;57(3):593-614. doi: 10.1111/1460-6984.12710. Epub 2022 Mar 23.
4
Implementation and evaluation of a multivariate abstraction-based, interval-based dynamic time-warping method as a similarity measure for longitudinal medical records.基于多元抽象和区间的动态时间规整方法的实现和评估,作为一种用于纵向医疗记录的相似性度量方法。
J Biomed Inform. 2021 Nov;123:103919. doi: 10.1016/j.jbi.2021.103919. Epub 2021 Oct 8.
5
Short-Term Memory Impairment短期记忆障碍
6
Incorporating Domain Knowledge Into Language Models by Using Graph Convolutional Networks for Assessing Semantic Textual Similarity: Model Development and Performance Comparison.通过使用图卷积网络将领域知识融入语言模型以评估语义文本相似度:模型开发与性能比较
JMIR Med Inform. 2021 Nov 26;9(11):e23101. doi: 10.2196/23101.
7
Towards tacit knowledge mining within context: Visual cognitive graph model and eye movement image interpretation.面向语境下的隐性知识挖掘:视觉认知图模型与眼动图像解读。
Comput Methods Programs Biomed. 2022 Nov;226:107107. doi: 10.1016/j.cmpb.2022.107107. Epub 2022 Sep 6.
8
Evolving knowledge graph similarity for supervised learning in complex biomedical domains.用于复杂生物医学领域中监督学习的进化知识图相似度。
BMC Bioinformatics. 2020 Jan 3;21(1):6. doi: 10.1186/s12859-019-3296-1.
9
Combining unsupervised, supervised and rule-based learning: the case of detecting patient allergies in electronic health records.结合无监督、监督和基于规则的学习:以电子健康记录中检测患者过敏为例。
BMC Med Inform Decis Mak. 2023 Sep 18;23(1):188. doi: 10.1186/s12911-023-02271-8.
10
Context Matters: Recovering Human Semantic Structure from Machine Learning Analysis of Large-Scale Text Corpora.语境至关重要:从大规模文本语料库的机器学习分析中恢复人类语义结构。
Cogn Sci. 2022 Feb;46(2):e13085. doi: 10.1111/cogs.13085.

引用本文的文献

1
Fusion-Learning of Bayesian Network Models for Fault Diagnostics.贝叶斯网络模型的融合学习在故障诊断中的应用。
Sensors (Basel). 2021 Nov 17;21(22):7633. doi: 10.3390/s21227633.