• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过最大化几何和语义相似度实现稳健的多源地理实体匹配。

Robust multi-source geographic entities matching by maximizing geometric and semantic similarity.

作者信息

Yan YuHan, Wu PengDa, Yin Yong, Guo PeiPei

机构信息

Department of Geographic Information System, Chinese Academy of Surveying and mapping, Beijing, 100036, China.

School of Information Engineering, China University of Geosciences, Beijing, 100083, China.

出版信息

Sci Rep. 2024 Dec 30;14(1):31616. doi: 10.1038/s41598-024-79812-2.

DOI:10.1038/s41598-024-79812-2
PMID:39738125
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11685609/
Abstract

Geographic entity matching is an important means for multi-source spatial data fusion and information association and sharing. Corresponding matching methods have been designed by existing studies for different types of entity data characteristics, such as line and area. However, these approaches are often limited in the generalization ability for matching heterogeneous data from multiple sources and the accuracy for complex pattern matching. To resolve these problems, robust multi-source geographic entities matching by maximizing geometric and semantic similarity is proposed. First, the entire entity is segmented based on shape features, and the partitioned feature segments are extracted as matching primitives; Second, feature segments are grouped into patterns, encompassing three major categories and fourteen subcategories; Following this, pattern matching is performed based on spatial similarity metric such as maximum projection distance, etc.; Finally, the spatial matches are detected and refined through semantic similarity calculation. The proposed method is tested using two datasets from regions in southeast and northwest China. The experimental results demonstrate that our method can be effectively applied to both area and line entity matching with strong generalization and application capability and significantly improved matching accuracy. Specifically, nine feature segment matching patterns for matching area entities and six for line entities are utilized, and the precision and recall are nearly 90%.

摘要

地理实体匹配是多源空间数据融合、信息关联与共享的重要手段。现有研究针对不同类型的实体数据特征(如线和区域)设计了相应的匹配方法。然而,这些方法在匹配多源异构数据的泛化能力以及复杂模式匹配的准确性方面往往存在局限。为解决这些问题,提出了通过最大化几何和语义相似度进行稳健的多源地理实体匹配方法。首先,基于形状特征对整个实体进行分割,并提取分割后的特征片段作为匹配基元;其次,将特征片段分组为模式,包括三大类和十四个子类;接着,基于最大投影距离等空间相似性度量进行模式匹配;最后,通过语义相似度计算检测并优化空间匹配。使用来自中国东南部和西北部地区的两个数据集对所提方法进行了测试。实验结果表明,我们的方法能够有效应用于区域和线状实体匹配,具有很强的泛化和应用能力,且匹配精度显著提高。具体而言,利用了九个用于区域实体匹配的特征片段匹配模式和六个用于线状实体匹配的模式,精确率和召回率接近90%。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a4cc/11685609/f8e96ef9819d/41598_2024_79812_Fig24_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a4cc/11685609/5311d48c85f7/41598_2024_79812_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a4cc/11685609/aa363f458cad/41598_2024_79812_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a4cc/11685609/c29252c65123/41598_2024_79812_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a4cc/11685609/27a0d5abd23f/41598_2024_79812_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a4cc/11685609/5ccb9e93c256/41598_2024_79812_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a4cc/11685609/73afd495d1d9/41598_2024_79812_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a4cc/11685609/1e921cfe9a8f/41598_2024_79812_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a4cc/11685609/76ba781ab4d6/41598_2024_79812_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a4cc/11685609/6baeabe1bf31/41598_2024_79812_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a4cc/11685609/d192c384c77d/41598_2024_79812_Fig10_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a4cc/11685609/c19c1a0ca191/41598_2024_79812_Fig11_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a4cc/11685609/1d330b9321d6/41598_2024_79812_Fig12_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a4cc/11685609/a435388210c6/41598_2024_79812_Fig13_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a4cc/11685609/ab071d17ed1c/41598_2024_79812_Fig14_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a4cc/11685609/a0a7ac2b15c3/41598_2024_79812_Fig15_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a4cc/11685609/8ddc7ad887ac/41598_2024_79812_Fig16_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a4cc/11685609/a0e9947b9532/41598_2024_79812_Fig17_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a4cc/11685609/e5c47f66aee6/41598_2024_79812_Fig18_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a4cc/11685609/7cd9f1c5f6cf/41598_2024_79812_Fig19_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a4cc/11685609/6b7557b0260c/41598_2024_79812_Fig20_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a4cc/11685609/e1d186604b46/41598_2024_79812_Fig21_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a4cc/11685609/da651689e245/41598_2024_79812_Fig22_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a4cc/11685609/ae03a515d3ff/41598_2024_79812_Fig23_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a4cc/11685609/f8e96ef9819d/41598_2024_79812_Fig24_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a4cc/11685609/5311d48c85f7/41598_2024_79812_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a4cc/11685609/aa363f458cad/41598_2024_79812_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a4cc/11685609/c29252c65123/41598_2024_79812_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a4cc/11685609/27a0d5abd23f/41598_2024_79812_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a4cc/11685609/5ccb9e93c256/41598_2024_79812_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a4cc/11685609/73afd495d1d9/41598_2024_79812_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a4cc/11685609/1e921cfe9a8f/41598_2024_79812_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a4cc/11685609/76ba781ab4d6/41598_2024_79812_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a4cc/11685609/6baeabe1bf31/41598_2024_79812_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a4cc/11685609/d192c384c77d/41598_2024_79812_Fig10_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a4cc/11685609/c19c1a0ca191/41598_2024_79812_Fig11_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a4cc/11685609/1d330b9321d6/41598_2024_79812_Fig12_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a4cc/11685609/a435388210c6/41598_2024_79812_Fig13_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a4cc/11685609/ab071d17ed1c/41598_2024_79812_Fig14_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a4cc/11685609/a0a7ac2b15c3/41598_2024_79812_Fig15_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a4cc/11685609/8ddc7ad887ac/41598_2024_79812_Fig16_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a4cc/11685609/a0e9947b9532/41598_2024_79812_Fig17_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a4cc/11685609/e5c47f66aee6/41598_2024_79812_Fig18_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a4cc/11685609/7cd9f1c5f6cf/41598_2024_79812_Fig19_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a4cc/11685609/6b7557b0260c/41598_2024_79812_Fig20_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a4cc/11685609/e1d186604b46/41598_2024_79812_Fig21_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a4cc/11685609/da651689e245/41598_2024_79812_Fig22_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a4cc/11685609/ae03a515d3ff/41598_2024_79812_Fig23_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a4cc/11685609/f8e96ef9819d/41598_2024_79812_Fig24_HTML.jpg

相似文献

1
Robust multi-source geographic entities matching by maximizing geometric and semantic similarity.通过最大化几何和语义相似度实现稳健的多源地理实体匹配。
Sci Rep. 2024 Dec 30;14(1):31616. doi: 10.1038/s41598-024-79812-2.
2
A Knowledge Graph Entity Disambiguation Method Based on Entity-Relationship Embedding and Graph Structure Embedding.基于实体关系嵌入和图结构嵌入的知识图谱实体消歧方法。
Comput Intell Neurosci. 2021 Sep 23;2021:2878189. doi: 10.1155/2021/2878189. eCollection 2021.
3
Multi-Knowledge Graph and Multi-View Entity Feature Learning for Predicting Drug-Related Side Effects.用于预测药物相关副作用的多知识图谱与多视图实体特征学习
J Chem Inf Model. 2025 May 26;65(10):5124-5138. doi: 10.1021/acs.jcim.5c00136. Epub 2025 May 6.
4
An effective knowledge graph entity alignment model based on multiple information.基于多源信息的知识图谱实体对齐模型
Neural Netw. 2023 May;162:83-98. doi: 10.1016/j.neunet.2023.02.029. Epub 2023 Feb 24.
5
Chinese Clinical Named Entity Recognition From Electronic Medical Records Based on Multisemantic Features by Using Robustly Optimized Bidirectional Encoder Representation From Transformers Pretraining Approach Whole Word Masking and Convolutional Neural Networks: Model Development and Validation.基于多语义特征,利用经过稳健优化的基于变换器预训练方法的全词掩码和卷积神经网络从电子病历中进行中文临床命名实体识别:模型开发与验证
JMIR Med Inform. 2023 May 10;11:e44597. doi: 10.2196/44597.
6
Discovering biomedical semantic relations in PubMed queries for information retrieval and database curation.在PubMed查询中发现生物医学语义关系以进行信息检索和数据库管理。
Database (Oxford). 2016 Mar 25;2016. doi: 10.1093/database/baw025. Print 2016.
7
Multi-Level Representation Learning for Chinese Medical Entity Recognition: Model Development and Validation.用于中文医学实体识别的多层次表示学习:模型开发与验证
JMIR Med Inform. 2020 May 4;8(5):e17637. doi: 10.2196/17637.
8
An Improved Method for Stable Feature Points Selection in Structure-from-Motion Considering Image Semantic and Structural Characteristics.考虑图像语义和结构特征的运动结构中稳定特征点选择的改进方法。
Sensors (Basel). 2021 Apr 1;21(7):2416. doi: 10.3390/s21072416.
9
A deep learning model incorporating part of speech and self-matching attention for named entity recognition of Chinese electronic medical records.基于词性和自匹配注意力的深度学习模型在中文电子病历命名实体识别中的应用。
BMC Med Inform Decis Mak. 2019 Apr 9;19(Suppl 2):65. doi: 10.1186/s12911-019-0762-7.
10
Multi-line laser 3D reconstruction method based on spatial quadric surface and geometric estimation.基于空间二次曲面和几何估计的多线激光三维重建方法
Sci Rep. 2024 Oct 9;14(1):23589. doi: 10.1038/s41598-024-74331-6.