• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于条件随机场的扫描历史文档图像盲渗色去除

Blind Bleed-Through Removal for Scanned Historical Document Image With Conditional Random Fields.

出版信息

IEEE Trans Image Process. 2016 Dec;25(12):5702-5712. doi: 10.1109/TIP.2016.2614133. Epub 2016 Sep 27.

DOI:10.1109/TIP.2016.2614133
PMID:28114067
Abstract

Scanned images of historical documents often suffer from bleed-through, which refers to the ink on one side seeping through the paper and appearing on the other side. In this paper, a new conditional random field (CRF)-based method is proposed to remove the bleed-through from the scanned images of historical images. The proposed method only requires the scanned image of one side, referred as a blind method. In general, the scanned historical document image is composed of three components: foreground, bleed-through, and background. By assuming Gaussian distributions of the three components, the proposed method establishes conditional probability distribution (CPD) models of the three components first. The parameters of the component CPD models are estimated based on an initial segmentation of the input image. Then, CRFs are used to capture the relations between observed pixels in the scanned image and the corresponding labels as well as the spatial relation between the adjacent labels. The belief propagation algorithm is used to calculate the probabilities of different labels for each pixel. Once the labeling is completed by choosing the most possible label for each pixel, the bleed-through component is removed from the input historical image by a random-filling inpainting algorithm. Experimental results on the real data set show that the proposed method preserves the foreground component very well and removes the bleed-through effectively.

摘要

历史文档的扫描图像常常存在渗色问题,即纸张一侧的墨水渗透到另一侧并显现出来。本文提出了一种基于条件随机场(CRF)的新方法,用于去除历史图像扫描图像中的渗色。该方法仅需要一侧的扫描图像,称为盲法。一般来说,扫描的历史文档图像由三个部分组成:前景、渗色和背景。通过假设这三个部分的高斯分布,该方法首先建立这三个部分的条件概率分布(CPD)模型。基于输入图像的初始分割来估计各部分CPD模型的参数。然后,使用CRF来捕捉扫描图像中观察到的像素与其相应标签之间的关系以及相邻标签之间的空间关系。使用置信传播算法计算每个像素不同标签的概率。一旦通过为每个像素选择最可能的标签完成标注,就通过随机填充修复算法从输入的历史图像中去除渗色部分。在真实数据集上的实验结果表明,该方法能很好地保留前景部分并有效去除渗色。

相似文献

1
Blind Bleed-Through Removal for Scanned Historical Document Image With Conditional Random Fields.基于条件随机场的扫描历史文档图像盲渗色去除
IEEE Trans Image Process. 2016 Dec;25(12):5702-5712. doi: 10.1109/TIP.2016.2614133. Epub 2016 Sep 27.
2
Document ink bleed-through removal with two hidden Markov random fields and a single observation field.用两个隐马尔可夫随机场和一个单观测场去除文档墨迹洇透。
IEEE Trans Pattern Anal Mach Intell. 2010 Mar;32(3):431-47. doi: 10.1109/TPAMI.2009.33.
3
User-assisted ink-bleed reduction.用户辅助墨滴洇散减少。
IEEE Trans Image Process. 2010 Oct;19(10):2646-58. doi: 10.1109/TIP.2010.2048971. Epub 2010 Apr 22.
4
Restoration and content analysis of ancient manuscripts via color space based segmentation.基于颜色空间分割的古文献修复与内容分析。
PLoS One. 2023 Mar 22;18(3):e0282142. doi: 10.1371/journal.pone.0282142. eCollection 2023.
5
A dynamic conditional random field model for foreground and shadow segmentation.一种用于前景和阴影分割的动态条件随机场模型。
IEEE Trans Pattern Anal Mach Intell. 2006 Feb;28(2):279-89. doi: 10.1109/TPAMI.2006.25.
6
A label fusion method using conditional random fields with higher-order potentials: Application to hippocampal segmentation.一种使用具有高阶势的条件随机场的标签融合方法:在海马体分割中的应用。
Artif Intell Med. 2015 Jun;64(2):117-29. doi: 10.1016/j.artmed.2015.04.005. Epub 2015 May 4.
7
Ontology-Based Semantic Image Segmentation Using Mixture Models and Multiple CRFs.基于本体论的混合模型和多个条件随机场的语义图像分割。
IEEE Trans Image Process. 2016 Jul;25(7):3233-3248. doi: 10.1109/TIP.2016.2552401. Epub 2016 Apr 8.
8
Using Paper Texture for Choosing a Suitable Algorithm for Scanned Document Image Binarization.利用纸张纹理选择适用于扫描文档图像二值化的算法。
J Imaging. 2022 Oct 5;8(10):272. doi: 10.3390/jimaging8100272.
9
A vessel segmentation method for multi-modality angiographic images based on multi-scale filtering and statistical models.一种基于多尺度滤波和统计模型的多模态血管造影图像血管分割方法。
Biomed Eng Online. 2016 Nov 8;15(1):120. doi: 10.1186/s12938-016-0241-7.
10
Auto-context and its application to high-level vision tasks and 3D brain image segmentation.自动上下文及其在高级视觉任务和 3D 脑图像分割中的应用。
IEEE Trans Pattern Anal Mach Intell. 2010 Oct;32(10):1744-57. doi: 10.1109/TPAMI.2009.186.

引用本文的文献

1
Minimizing Bleed-Through Effect in Medieval Manuscripts with Machine Learning and Robust Statistics.利用机器学习和稳健统计方法减少中世纪手稿中的渗色效应
J Imaging. 2025 Apr 28;11(5):136. doi: 10.3390/jimaging11050136.
2
Restoration and content analysis of ancient manuscripts via color space based segmentation.基于颜色空间分割的古文献修复与内容分析。
PLoS One. 2023 Mar 22;18(3):e0282142. doi: 10.1371/journal.pone.0282142. eCollection 2023.