EPITA Research Lab. (LRE), Kremlin-Bicêtre, France.
LASTIG, Univ Gustave Eiffel, IGN, ENSG, Saint-Mande, France.
PLoS One. 2024 Feb 15;19(2):e0298217. doi: 10.1371/journal.pone.0298217. eCollection 2024.
Shape vectorization is a key stage of the digitization of large-scale historical maps, especially city maps that exhibit complex and valuable details. Having access to digitized buildings, building blocks, street networks and other geographic content opens numerous new approaches for historical studies such as change tracking, morphological analysis and density estimations. In the context of the digitization of Paris atlases created in the 19th and early 20th centuries, we have designed a supervised pipeline that reliably extract closed shapes from historical maps. This pipeline is based on a supervised edge filtering stage using deep filters, and a closed shape extraction stage using a watershed transform. It relies on probable multiple suboptimal methodological choices that hamper the vectorization performances in terms of accuracy and completeness. Objectively investigating which solutions are the most adequate among the numerous possibilities is comprehensively addressed in this paper. The following contributions are subsequently introduced: (i) we propose an improved training protocol for map digitization; (ii) we introduce a joint optimization of the edge detection and shape extraction stages; (iii) we compare the performance of state-of-the-art deep edge filters with topology-preserving loss functions, including vision transformers; (iv) we evaluate the end-to-end deep learnable watershed against Meyer watershed. We subsequently design the critical path for a fully automatic extraction of key elements of historical maps. All the data, code, benchmark results are freely available at https://github.com/soduco/Benchmark_historical_map_vectorization.
图形矢量化是大规模历史地图数字化的关键阶段,特别是对于展示复杂而有价值细节的城市地图。获取数字化的建筑物、建筑块、街道网络和其他地理内容为历史研究开辟了许多新途径,例如变化跟踪、形态分析和密度估计。在 19 世纪和 20 世纪初创建的巴黎地图集数字化的背景下,我们设计了一种监督管道,可以从历史地图中可靠地提取封闭形状。该管道基于使用深度滤波器的监督边缘过滤阶段和使用分水岭变换的封闭形状提取阶段。它依赖于可能存在的多个次优方法选择,这些选择会影响准确性和完整性方面的矢量化性能。本文全面探讨了客观调查众多可能性中哪些解决方案最合适的问题。随后介绍了以下贡献:(i)我们提出了一种改进的地图数字化训练协议;(ii)我们引入了边缘检测和形状提取阶段的联合优化;(iii)我们比较了具有拓扑保持损失函数的最先进的深度边缘滤波器与视觉转换器的性能;(iv)我们评估了端到端可学习的深度分水岭与 Meyer 分水岭的性能。随后,我们设计了从历史地图中自动提取关键元素的关键路径。所有数据、代码和基准结果均可在 https://github.com/soduco/Benchmark_historical_map_vectorization 上免费获取。