在最大间隔句法分析中针对性能度量进行优化。

Optimizing for Measure of Performance in Max-Margin Parsing.

作者信息

Bauer Alexander, Nakajima Shinichi, Gornitz Nico, Muller Klaus-Robert

出版信息

IEEE Trans Neural Netw Learn Syst. 2020 Jul;31(7):2680-2684. doi: 10.1109/TNNLS.2019.2934225. Epub 2019 Sep 5.

DOI:10.1109/TNNLS.2019.2934225

Abstract

Many learning tasks in the field of natural language processing including sequence tagging, sequence segmentation, and syntactic parsing have been successfully approached by means of structured prediction methods. An appealing property of the corresponding training algorithms is their ability to integrate the loss function of interest into the optimization process improving the final results according to the chosen measure of performance. Here, we focus on the task of constituency parsing and show how to optimize the model for the F -score in the max-margin framework of a structural support vector machine (SVM). For reasons of computational efficiency, it is a common approach to binarize the corresponding grammar before training. Unfortunately, this introduces a bias during the training procedure as the corresponding loss function is evaluated on the binary representation, while the resulting performance is measured on the original unbinarized trees. Here, we address this problem by extending the inference procedure presented by Bauer et al. Specifically, we propose an algorithmic modification that allows evaluating the loss on the unbinarized trees. The new approach properly models the loss function of interest resulting in better prediction accuracy and still benefits from the computational efficiency due to binarized representation. The presented idea can be easily transferred to other structured loss functions.

摘要

自然语言处理领域中的许多学习任务，包括序列标记、序列分割和句法分析，都已通过结构化预测方法成功解决。相应训练算法的一个吸引人的特性是它们能够将感兴趣的损失函数集成到优化过程中，从而根据所选的性能度量来提高最终结果。在这里，我们专注于成分句法分析任务，并展示如何在结构支持向量机（SVM）的最大间隔框架中针对F值优化模型。出于计算效率的考虑，在训练前对相应的语法进行二值化是一种常见的方法。不幸的是，这在训练过程中引入了偏差，因为相应的损失函数是在二进制表示上进行评估的，而最终的性能是在原始的未二值化的树状结构上进行测量的。在这里，我们通过扩展鲍尔等人提出的推理过程来解决这个问题。具体来说，我们提出了一种算法修改，允许在未二值化的树状结构上评估损失。新方法正确地对感兴趣的损失函数进行建模，从而提高预测准确性，并且由于二值化表示仍然受益于计算效率。所提出的想法可以很容易地转移到其他结构化损失函数上。

相似文献

Optimizing for Measure of Performance in Max-Margin Parsing.在最大间隔句法分析中针对性能度量进行优化。

IEEE Trans Neural Netw Learn Syst. 2020 Jul;31(7):2680-2684. doi: 10.1109/TNNLS.2019.2934225. Epub 2019 Sep 5.

Efficient Exact Inference With Loss Augmented Objective in Structured Learning.结构化学习中基于损失增强目标的高效精确推理

IEEE Trans Neural Netw Learn Syst. 2017 Nov;28(11):2566-2579. doi: 10.1109/TNNLS.2016.2598721. Epub 2016 Aug 19.

Accurate Maximum-Margin Training for Parsing With Context-Free Grammars.基于上下文无关语法的解析的精确最大间隔训练。

IEEE Trans Neural Netw Learn Syst. 2017 Jan;28(1):44-56. doi: 10.1109/TNNLS.2015.2497149. Epub 2015 Dec 4.

Translational Metabolomics of Head Injury: Exploring Dysfunctional Cerebral Metabolism with Ex Vivo NMR Spectroscopy-Based Metabolite Quantification头部损伤的转化代谢组学：基于体外核磁共振波谱的代谢物定量分析探索脑代谢功能障碍

A Framework for Efficient Structured Max-Margin Learning of High-Order MRF Models.一种高效的结构化最大边缘学习高阶马尔可夫随机场模型的框架。

IEEE Trans Pattern Anal Mach Intell. 2015 Jul;37(7):1425-41. doi: 10.1109/TPAMI.2014.2368990.

Performance of a Computational Model of the Mammalian Olfactory System哺乳动物嗅觉系统计算模型的性能

Training the max-margin sequence model with the relaxed slack variables.使用松弛的松弛变量训练最大间隔序列模型。

Neural Netw. 2012 Sep;33:228-35. doi: 10.1016/j.neunet.2012.05.011. Epub 2012 Jun 2.

Recursive gene selection based on maximum margin criterion: a comparison with SVM-RFE.基于最大间隔准则的递归基因选择：与支持向量机递归特征消除法的比较

BMC Bioinformatics. 2006 Dec 25;7:543. doi: 10.1186/1471-2105-7-543.

A Learning Algorithm for Multimodal Grammar Inference.一种用于多模态语法推理的学习算法。

IEEE Trans Syst Man Cybern B Cybern. 2011 Dec;41(6):1495-510. doi: 10.1109/TSMCB.2011.2155057. Epub 2011 Jun 30.

Novel maximum-margin training algorithms for supervised neural networks.用于监督神经网络的新型最大间隔训练算法。

IEEE Trans Neural Netw. 2010 Jun;21(6):972-84. doi: 10.1109/TNN.2010.2046423. Epub 2010 Apr 19.

引用本文的文献

DDSUD: dynamically detecting subsequence uncertainty and diversity for active learning in imbalanced Chinese sentiment analysis.DDSUD：用于不平衡中文情感分析中主动学习的动态检测子序列不确定性和多样性

PeerJ Comput Sci. 2025 Aug 11;11:e3091. doi: 10.7717/peerj-cs.3091. eCollection 2025.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

在最大间隔句法分析中针对性能度量进行优化。

Optimizing for Measure of Performance in Max-Margin Parsing.

作者信息

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献