• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种归一化的莱文斯坦距离度量。

A normalized Levenshtein distance metric.

作者信息

Yujian Li, Bo Liu

机构信息

College of Computer Science and Technology, Beijing University of Technology, Pingleyuan 100, Chaoyang District, Beijing 100022, P.R. China.

出版信息

IEEE Trans Pattern Anal Mach Intell. 2007 Jun;29(6):1091-5. doi: 10.1109/TPAMI.2007.1078.

DOI:10.1109/TPAMI.2007.1078
PMID:17431306
Abstract

Although a number of normalized edit distances presented so far may offer good performance in some applications, none of them can be regarded as a genuine metric between strings because they do not satisfy the triangle inequality. Given two strings X and Y over a finite alphabet, this paper defines a new normalized edit distance between X and Y as a simple function of their lengths (|X| and |Y|) and the Generalized Levenshtein Distance (GLD) between them. The new distance can be easily computed through GLD with a complexity of O(|X|.|Y|) and it is a metric valued in [0, 1] under the condition that the weight function is a metric over the set of elementary edit operations with all costs of insertions/deletions having the same weight. Experiments using the AESA algorithm in handwritten digit recognition show that the new distance can generally provide similar results to some other normalized edit distances and may perform slightly better if the triangle inequality is violated in a particular data set.

摘要

尽管迄今为止提出的一些归一化编辑距离在某些应用中可能表现良好,但它们都不能被视为字符串之间真正的度量标准,因为它们不满足三角不等式。给定有限字母表上的两个字符串X和Y,本文将X和Y之间的新归一化编辑距离定义为它们长度(|X|和|Y|)以及它们之间的广义莱文斯坦距离(GLD)的简单函数。新距离可以通过GLD轻松计算,复杂度为O(|X|.|Y|),并且在权重函数是基本编辑操作集上的度量标准且所有插入/删除成本具有相同权重的条件下,它是一个取值在[0, 1]的度量标准。在手写数字识别中使用AESA算法的实验表明,新距离通常可以提供与其他一些归一化编辑距离相似的结果,并且如果在特定数据集中违反三角不等式,可能会表现得稍好一些。

相似文献

1
A normalized Levenshtein distance metric.一种归一化的莱文斯坦距离度量。
IEEE Trans Pattern Anal Mach Intell. 2007 Jun;29(6):1091-5. doi: 10.1109/TPAMI.2007.1078.
2
A binary linear programming formulation of the graph edit distance.图编辑距离的二元线性规划公式化表述。
IEEE Trans Pattern Anal Mach Intell. 2006 Aug;28(8):1200-14. doi: 10.1109/TPAMI.2006.152.
3
Where are linear feature extraction methods applicable?线性特征提取方法适用于哪些领域?
IEEE Trans Pattern Anal Mach Intell. 2005 Dec;27(12):1934-44. doi: 10.1109/TPAMI.2005.250.
4
Markov edit distance.马尔可夫编辑距离
IEEE Trans Pattern Anal Mach Intell. 2004 Mar;26(3):311-21. doi: 10.1109/TPAMI.2004.1262315.
5
A systolic algorithm for Euclidean distance transform.一种用于欧几里得距离变换的收缩算法。
IEEE Trans Pattern Anal Mach Intell. 2006 Jul;28(7):1127-34. doi: 10.1109/TPAMI.2006.133.
6
Effective feature extraction in high-dimensional space.高维空间中的有效特征提取。
IEEE Trans Syst Man Cybern B Cybern. 2008 Dec;38(6):1652-6. doi: 10.1109/TSMCB.2008.927276.
7
An efficient Earth Mover's Distance algorithm for robust histogram comparison.一种用于稳健直方图比较的高效推土机距离算法。
IEEE Trans Pattern Anal Mach Intell. 2007 May;29(5):840-53. doi: 10.1109/TPAMI.2007.1058.
8
Effective proximity retrieval by ordering permutations.通过排列排序实现有效的近似检索。
IEEE Trans Pattern Anal Mach Intell. 2008 Sep;30(9):1647-58. doi: 10.1109/TPAMI.2007.70815.
9
Metric learning for text documents.文本文献的度量学习
IEEE Trans Pattern Anal Mach Intell. 2006 Apr;28(4):497-508. doi: 10.1109/TPAMI.2006.77.
10
Robust pose estimation and recognition using non-gaussian modeling of appearance subspaces.使用外观子空间的非高斯建模进行鲁棒姿态估计与识别。
IEEE Trans Pattern Anal Mach Intell. 2007 May;29(5):901-5. doi: 10.1109/TPAMI.2007.1028.

引用本文的文献

1
Get in the Virtual Hole! Examination of Gaze and Performance of Experts, Athletes, and Novices While Putting in Virtual Reality and in the Real-World.进入虚拟球洞!对专家、运动员和新手在虚拟现实和现实世界中推杆时的注视及表现进行研究。
Eur J Sport Sci. 2025 Sep;25(9):e70049. doi: 10.1002/ejsc.70049.
2
Hyper-speed meaning and form predictions: An EEG-based representational similarity analysis.超高速意义和形式预测:基于脑电图的表征相似性分析。
Psychon Bull Rev. 2025 Jul 25. doi: 10.3758/s13423-025-02731-4.
3
ACES-GNN: can graph neural network learn to explain activity cliffs?
ACES-GNN:图神经网络能学会解释活性断崖吗?
Digit Discov. 2025 Jun 30. doi: 10.1039/d5dd00012b.
4
On automatic decipherment of lost ancient scripts relying on combinatorial optimisation and coupled simulated annealing.基于组合优化和耦合模拟退火算法对失传古代文字的自动破译
Front Artif Intell. 2025 May 30;8:1581129. doi: 10.3389/frai.2025.1581129. eCollection 2025.
5
MOPSOGAT: Predicting CircRNA-Disease Associations via Improved Multi-objective Particle Swarm Optimization and Graph Attention Network.MOPSOGAT:通过改进的多目标粒子群优化和图注意力网络预测环状RNA与疾病的关联
Interdiscip Sci. 2025 Jun 13. doi: 10.1007/s12539-025-00725-3.
6
A ground truth approach for assessing process mining techniques.一种用于评估过程挖掘技术的真实情况方法。
Process Sci. 2025;2(1):1. doi: 10.1007/s44311-025-00006-8. Epub 2025 Mar 20.
7
PAPreC: A Pipeline for Antigenicity Prediction Comparison Methods across Bacteria.PAPreC:一种用于比较细菌抗原性预测方法的流程
ACS Omega. 2025 Feb 3;10(6):5415-5429. doi: 10.1021/acsomega.4c07147. eCollection 2025 Feb 18.
8
Enhancing Activity Recognition using CPD-based Activity Segmentation.使用基于条件随机场的活动分割增强活动识别
Pervasive Mob Comput. 2019 Feb;53:75-89. doi: 10.1016/j.pmcj.2019.01.004. Epub 2019 Jan 16.
9
Tracking the Spread of Pollen on Social Media Using Pollen-Related Messages From Twitter: Retrospective Analysis.利用 Twitter 上与花粉相关的信息追踪花粉在社交媒体上的传播:回顾性分析。
J Med Internet Res. 2024 Oct 21;26:e58309. doi: 10.2196/58309.
10
Using Machine Learning and Electronic Health Records to Identify Neuropsychiatric Risk Scores for Delirium in ICU and General Hospital Settings.利用机器学习和电子健康记录识别重症监护病房及综合医院环境中谵妄的神经精神风险评分。
Neuropsychiatr Dis Treat. 2024 Oct 2;20:1861-1876. doi: 10.2147/NDT.S479756. eCollection 2024.