Wong Tak-Shing, Bouman Charles A, Pollak Ilya, Fan Zhigang
School of Electrical and Computer Engineering, Purdue University, West Lafayette, IN 47907, USA.
IEEE Trans Image Process. 2009 Nov;18(11):2518-35. doi: 10.1109/TIP.2009.2028252. Epub 2009 Jul 24.
The JPEG standard is one of the most prevalent image compression schemes in use today. While JPEG was designed for use with natural images, it is also widely used for the encoding of raster documents. Unfortunately, JPEG's characteristic blocking and ringing artifacts can severely degrade the quality of text and graphics in complex documents. We propose a JPEG decompression algorithm which is designed to produce substantially higher quality images from the same standard JPEG encodings. The method works by incorporating a document image model into the decoding process which accounts for the wide variety of content in modern complex color documents. The method works by first segmenting the JPEG encoded document into regions corresponding to background, text, and picture content. The regions corresponding to text and background are then decoded using maximum a posteriori (MAP) estimation. Most importantly, the MAP reconstruction of the text regions uses a model which accounts for the spatial characteristics of text and graphics. Our experimental comparisons to the baseline JPEG decoding as well as to three other decoding schemes, demonstrate that our method substantially improves the quality of decoded images, both visually and as measured by PSNR.
JPEG标准是当今使用最广泛的图像压缩方案之一。虽然JPEG是为处理自然图像而设计的,但它也被广泛用于光栅文档的编码。不幸的是,JPEG特有的块状和振铃伪像会严重降低复杂文档中文字和图形的质量。我们提出了一种JPEG解压缩算法,旨在从相同的标准JPEG编码中生成质量更高的图像。该方法通过将文档图像模型纳入解码过程来实现,该模型考虑了现代复杂彩色文档中的各种内容。该方法首先将JPEG编码文档分割成对应于背景、文本和图片内容的区域。然后,使用最大后验(MAP)估计对对应于文本和背景的区域进行解码。最重要的是,文本区域的MAP重建使用了一个考虑文本和图形空间特征的模型。我们与基线JPEG解码以及其他三种解码方案的实验比较表明,我们的方法在视觉上以及通过PSNR测量都显著提高了解码图像的质量。