• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于序列数据建模的无穷阶条件随机场模型。

The infinite-order conditional random field model for sequential data modeling.

机构信息

Department of Electrical Engineering, Computer Engineering, and Informatics, Cyprus University of Technology, 33 Saripolou Str., Limassol 3036, Cyprus.

出版信息

IEEE Trans Pattern Anal Mach Intell. 2013 Jun;35(6):1523-34. doi: 10.1109/TPAMI.2012.208.

DOI:10.1109/TPAMI.2012.208
PMID:23599063
Abstract

Sequential data labeling is a fundamental task in machine learning applications, with speech and natural language processing, activity recognition in video sequences, and biomedical data analysis being characteristic examples, to name just a few. The conditional random field (CRF), a log-linear model representing the conditional distribution of the observation labels, is one of the most successful approaches for sequential data labeling and classification, and has lately received significant attention in machine learning as it achieves superb prediction performance in a variety of scenarios. Nevertheless, existing CRF formulations can capture only one- or few-timestep interactions and neglect higher order dependences, which are potentially useful in many real-life sequential data modeling applications. To resolve these issues, in this paper we introduce a novel CRF formulation, based on the postulation of an energy function which entails infinitely long time-dependences between the modeled data. Building blocks of our novel approach are: 1) the sequence memoizer (SM), a recently proposed nonparametric Bayesian approach for modeling label sequences with infinitely long time dependences, and 2) a mean-field-like approximation of the model marginal likelihood, which allows for the derivation of computationally efficient inference algorithms for our model. The efficacy of the so-obtained infinite-order CRF (CRF(∞)) model is experimentally demonstrated.

摘要

顺序数据标注是机器学习应用中的一项基本任务,在语音和自然语言处理、视频序列中的活动识别以及生物医学数据分析等领域都有典型的应用案例。条件随机场 (CRF) 是一种表示观测标签条件分布的对数线性模型,是顺序数据标注和分类的最成功方法之一,最近在机器学习中受到了广泛关注,因为它在各种场景下都能实现出色的预测性能。然而,现有的 CRF 公式只能捕捉一到几个时间步的交互作用,而忽略了更高阶的依赖关系,而这些依赖关系在许多实际的顺序数据建模应用中可能是有用的。为了解决这些问题,在本文中,我们提出了一种新的 CRF 公式,基于一个能量函数的假设,该函数涉及到建模数据之间的无限长时间依赖关系。我们新方法的构建块包括:1) 序列记忆器 (SM),这是一种最近提出的用于对具有无限长时间依赖关系的标签序列进行建模的非参数贝叶斯方法,以及 2) 模型边际似然的均值场似然近似,它允许为我们的模型导出计算效率高的推断算法。实验证明了所得到的无限阶 CRF (CRF(∞)) 模型的有效性。

相似文献

1
The infinite-order conditional random field model for sequential data modeling.用于序列数据建模的无穷阶条件随机场模型。
IEEE Trans Pattern Anal Mach Intell. 2013 Jun;35(6):1523-34. doi: 10.1109/TPAMI.2012.208.
2
On the computational aspects of Gibbs-Markov random field modeling of missing-data in image sequences.关于图像序列中缺失数据的吉布斯 - 马尔可夫随机场建模的计算方面
IEEE Trans Image Process. 1999;8(8):1139-42. doi: 10.1109/83.777096.
3
Learning conditional random fields for classification of hyperspectral images.学习条件随机场进行高光谱图像分类。
IEEE Trans Image Process. 2010 Jul;19(7):1890-907. doi: 10.1109/TIP.2010.2045034. Epub 2010 Mar 15.
4
Combining features in a graphical model to predict protein binding sites.在图形模型中结合特征以预测蛋白质结合位点。
Proteins. 2015 May;83(5):844-52. doi: 10.1002/prot.24775. Epub 2015 Mar 14.
5
Discriminative learning for dynamic state prediction.用于动态状态预测的判别式学习
IEEE Trans Pattern Anal Mach Intell. 2009 Oct;31(10):1847-61. doi: 10.1109/TPAMI.2009.37.
6
Mixtures of Conditional Random Fields for Improved Structured Output Prediction.条件随机场混合模型提高结构输出预测。
IEEE Trans Neural Netw Learn Syst. 2017 May;28(5):1233-1240. doi: 10.1109/TNNLS.2016.2521875. Epub 2016 Feb 16.
7
Precursor-induced conditional random fields: connecting separate entities by induction for improved clinical named entity recognition.诱导前条件随机场:通过诱导连接独立实体以提高临床命名实体识别。
BMC Med Inform Decis Mak. 2019 Jul 15;19(1):132. doi: 10.1186/s12911-019-0865-1.
8
Training an active random field for real-time image denoising.训练用于实时图像去噪的主动随机场
IEEE Trans Image Process. 2009 Nov;18(11):2451-62. doi: 10.1109/TIP.2009.2028254. Epub 2009 Jul 24.
9
Hidden Markov models with nonelliptically contoured state densities.具有非椭圆轮廓状态密度的隐马尔可夫模型。
IEEE Trans Pattern Anal Mach Intell. 2010 Dec;32(12):2297-304. doi: 10.1109/TPAMI.2010.153.
10
Joint random field model for all-weather moving vehicle detection.全天候移动车辆检测的联合随机场模型。
IEEE Trans Image Process. 2010 Sep;19(9):2491-501. doi: 10.1109/TIP.2010.2048970. Epub 2010 Apr 22.

引用本文的文献

1
Determining the prevalence of cannabis, tobacco, and vaping device mentions in online communities using natural language processing.使用自然语言处理技术确定在线社区中关于大麻、烟草和蒸气设备的提及率。
Drug Alcohol Depend. 2021 Nov 1;228:109016. doi: 10.1016/j.drugalcdep.2021.109016. Epub 2021 Sep 6.
2
Semi-supervised morphosyntactic classification of Old Icelandic.古冰岛语的半监督形态句法分类
PLoS One. 2014 Jul 16;9(7):e102366. doi: 10.1371/journal.pone.0102366. eCollection 2014.