基于颜色空间分割的古文献修复与内容分析。

Restoration and content analysis of ancient manuscripts via color space based segmentation.

机构信息

Faculty of Computer Science and Engineering, GIK Institute, Topi, Pakistan.

Istituto di Scienza e Tecnologie dell'Informazione "Alessandro Faedo" Area della Ricerca CNR di Pisa, Pisa, Italy.

出版信息

PLoS One. 2023 Mar 22;18(3):e0282142. doi: 10.1371/journal.pone.0282142. eCollection 2023.

DOI:10.1371/journal.pone.0282142

PMID:36947504

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10032482/

Abstract

Ancient manuscripts are a rich source of history and civilization. Unfortunately, these documents are often affected by different age and storage related degradation which impinge on their readability and information contents. In this paper, we propose a document restoration method that removes the unwanted interfering degradation patterns from color ancient manuscripts. We exploit different color spaces to highlight the spectral differences in various layers of information usually present in these documents. At each image pixel, the spectral representations of all color spaces are stacked to form a feature vector. PCA is applied to the whole data cube to eliminate correlation of the color planes and enhance separation among the patterns. The reduced data cube, along with the pixel spatial information, is used to perform a pixel based segmentation, where each cluster represents a class of pixels that share similar color properties in the decorrelated color spaces. The interfering, unwanted classes can thus be removed by inpainting their pixels with the background texture. Assuming Gaussian distributions for the various classes, a Gaussian Mixture Model (GMM) is estimated through the Expectation Maximization (EM) algorithm from the data, and then used to find appropriate labels for each pixel. In order to preserve the original appearance of the document and reproduce the background texture, the detected degraded pixels are replaced based on Gaussian conditional simulation, according to the surrounding context. Experiments are shown on manuscripts affected by different kinds of degradations, including manuscripts from the DIBCO 2018 and 2019 publicaly available dataset. We observe that the use of a few PCA dominant components accelerates the clustering process and provides a more accurate segmentation.

摘要

古文献是历史和文明的丰富来源。不幸的是，这些文档经常受到不同年代和存储相关退化的影响，从而影响其可读性和信息内容。在本文中，我们提出了一种从彩色古文献中去除不需要的干扰退化模式的文档恢复方法。我们利用不同的颜色空间来突出显示这些文档中通常存在的各种信息层的光谱差异。在每个图像像素处，所有颜色空间的光谱表示都被堆叠在一起以形成特征向量。PCA 应用于整个数据立方体以消除颜色平面的相关性并增强模式之间的分离。经过降维的数据立方体，以及像素的空间信息，用于执行基于像素的分割，其中每个聚类代表具有相似颜色属性的一类像素在去相关颜色空间中。因此，可以通过用背景纹理填充这些像素来去除干扰的、不需要的类。假设各种类别的分布为高斯分布，通过 EM 算法从数据中估计出高斯混合模型 (GMM)，然后将其用于为每个像素找到适当的标签。为了保持文档的原始外观并再现背景纹理，根据周围的上下文，通过高斯条件模拟替换检测到的退化像素。实验在受到不同类型退化影响的手稿上进行，包括来自 DIBCO 2018 和 2019 年公开可用数据集的手稿。我们观察到使用几个 PCA 主导成分可以加速聚类过程并提供更准确的分割。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4e14/10032482/05209532ddf2/pone.0282142.g001.jpg

相似文献

Restoration and content analysis of ancient manuscripts via color space based segmentation.基于颜色空间分割的古文献修复与内容分析。

PLoS One. 2023 Mar 22;18(3):e0282142. doi: 10.1371/journal.pone.0282142. eCollection 2023.

Metaheuristic Algorithms Applied to Color Image Segmentation on HSV Space.应用于HSV空间彩色图像分割的元启发式算法

J Imaging. 2022 Jan 5;8(1):6. doi: 10.3390/jimaging8010006.

Gaussian-mixture-model-based spatial neighborhood relationships for pixel labeling problem.基于高斯混合模型的像素标记问题空间邻域关系

IEEE Trans Syst Man Cybern B Cybern. 2012 Feb;42(1):193-202. doi: 10.1109/TSMCB.2011.2161284. Epub 2011 Aug 15.

Skin segmentation using color pixel classification: analysis and comparison.基于颜色像素分类的皮肤分割：分析与比较

IEEE Trans Pattern Anal Mach Intell. 2005 Jan;27(1):148-54. doi: 10.1109/TPAMI.2005.17.

Robust spatial fuzzy GMM based MRI segmentation and carotid artery plaque detection in ultrasound images.基于鲁棒空间模糊 GMM 的 MRI 分割和超声图像中颈动脉斑块检测。

Comput Methods Programs Biomed. 2019 Jul;175:179-192. doi: 10.1016/j.cmpb.2019.04.026. Epub 2019 Apr 23.

Partitioning histopathological images: an integrated framework for supervised color-texture segmentation and cell splitting.分割组织病理学图像：一种用于监督的颜色-纹理分割和细胞分裂的集成框架。

IEEE Trans Med Imaging. 2011 Sep;30(9):1661-77. doi: 10.1109/TMI.2011.2141674. Epub 2011 Apr 11.

An extension Gaussian mixture model for brain MRI segmentation.一种用于脑部磁共振成像分割的扩展高斯混合模型。

Annu Int Conf IEEE Eng Med Biol Soc. 2014;2014:4711-4. doi: 10.1109/EMBC.2014.6944676.

Manifold regularized semi-supervised Gaussian mixture model.流形正则化半监督高斯混合模型

J Opt Soc Am A Opt Image Sci Vis. 2015 Apr 1;32(4):566-75. doi: 10.1364/JOSAA.32.000566.

A Rough Set Bounded Spatially Constrained Asymmetric Gaussian Mixture Model for Image Segmentation.一种用于图像分割的粗糙集有界空间约束非对称高斯混合模型

PLoS One. 2017 Jan 3;12(1):e0168449. doi: 10.1371/journal.pone.0168449. eCollection 2017.

Background based Gaussian mixture model lesion segmentation in PET.PET中基于背景的高斯混合模型病变分割

Med Phys. 2016 May;43(5):2662. doi: 10.1118/1.4947483.

引用本文的文献

Minimizing Bleed-Through Effect in Medieval Manuscripts with Machine Learning and Robust Statistics.利用机器学习和稳健统计方法减少中世纪手稿中的渗色效应

J Imaging. 2025 Apr 28;11(5):136. doi: 10.3390/jimaging11050136.

本文引用的文献

Blind Bleed-Through Removal for Scanned Historical Document Image With Conditional Random Fields.基于条件随机场的扫描历史文档图像盲渗色去除

IEEE Trans Image Process. 2016 Dec;25(12):5702-5712. doi: 10.1109/TIP.2016.2614133. Epub 2016 Sep 27.

User-assisted ink-bleed reduction.用户辅助墨滴洇散减少。

IEEE Trans Image Process. 2010 Oct;19(10):2646-58. doi: 10.1109/TIP.2010.2048971. Epub 2010 Apr 22.

A Bayesian framework for image segmentation with spatially varying mixtures.基于空间变化混合的图像分割的贝叶斯框架。

IEEE Trans Image Process. 2010 Sep;19(9):2278-89. doi: 10.1109/TIP.2010.2047903. Epub 2010 Apr 8.

A spatially constrained mixture model for image segmentation.一种用于图像分割的空间约束混合模型。

IEEE Trans Neural Netw. 2005 Mar;16(2):494-8. doi: 10.1109/TNN.2004.841773.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

基于颜色空间分割的古文献修复与内容分析。

Restoration and content analysis of ancient manuscripts via color space based segmentation.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献