• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用分类、检测和透视校正解决图表识别问题的一种真实方法。

A Real-World Approach on the Problem of Chart Recognition Using Classification, Detection and Perspective Correction.

机构信息

Computer Science Graduate Program (PPGCC), Federal University of Pará (UFPA), 66075-110 Belém, Brazil.

Institute of Electronics and Informatics Engineering of Aveiro (IEETA), Department of Electronics, Telecommunications e Informatics (DETI), University of Aveiro (UA), 3810-193 Aveiro, Portugal.

出版信息

Sensors (Basel). 2020 Aug 5;20(16):4370. doi: 10.3390/s20164370.

DOI:10.3390/s20164370
PMID:32764352
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7472071/
Abstract

Data charts are widely used in our daily lives, being present in regular media, such as newspapers, magazines, web pages, books, and many others. In general, a well-constructed data chart leads to an intuitive understanding of its underlying data. In the same way, when data charts have wrong design choices, a redesign of these representations might be needed. However, in most cases, these charts are shown as a static image, which means that the original data are not usually available. Therefore, automatic methods could be applied to extract the underlying data from the chart images to allow these changes. The task of recognizing charts and extracting data from them is complex, largely due to the variety of chart types and their visual characteristics. Other features in real-world images that can make this task difficult are photo distortions, noise, alignment, etc. Two computer vision techniques that can assist this task and have been little explored in this context are perspective detection and correction. These methods transform a distorted and noisy chart in a clear chart, with its type ready for data extraction or other uses. This paper proposes a classification, detection, and perspective correction process that is suitable for real-world usage, when considering the data used for training a state-of-the-art model for the extraction of a chart in real-world photography. The results showed that, with slight changes, chart recognition methods are now ready for real-world charts, when taking time and accuracy into consideration.

摘要

数据图表在我们的日常生活中被广泛使用,存在于各种媒体中,如报纸、杂志、网页、书籍等。通常,一个构造良好的数据图表可以直观地理解其底层数据。同样,当数据图表有错误的设计选择时,可能需要重新设计这些表示。然而,在大多数情况下,这些图表是以静态图像的形式呈现的,这意味着原始数据通常不可用。因此,可以应用自动方法从图表图像中提取底层数据,以允许进行这些更改。识别图表并从中提取数据的任务非常复杂,主要是因为图表类型的多样性及其视觉特征。现实世界图像中的其他特征也会使这项任务变得困难,例如照片失真、噪声、对齐等。有两种计算机视觉技术可以辅助这项任务,而在这种情况下很少被探索,即透视检测和校正。这些方法可以将扭曲和嘈杂的图表转换为清晰的图表,并准备好提取数据或其他用途。本文提出了一种分类、检测和透视校正的过程,适用于真实世界的使用情况,同时考虑到用于在真实世界摄影中提取图表的最先进模型的数据。结果表明,考虑到时间和准确性,图表识别方法现在已经可以适用于真实世界的图表了。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e778/7472071/1c415f967f31/sensors-20-04370-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e778/7472071/0b52fb75defc/sensors-20-04370-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e778/7472071/5a2a9159caab/sensors-20-04370-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e778/7472071/c92054ddf16d/sensors-20-04370-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e778/7472071/05efb5c15677/sensors-20-04370-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e778/7472071/610444573983/sensors-20-04370-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e778/7472071/a2922c3af2fb/sensors-20-04370-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e778/7472071/1c415f967f31/sensors-20-04370-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e778/7472071/0b52fb75defc/sensors-20-04370-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e778/7472071/5a2a9159caab/sensors-20-04370-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e778/7472071/c92054ddf16d/sensors-20-04370-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e778/7472071/05efb5c15677/sensors-20-04370-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e778/7472071/610444573983/sensors-20-04370-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e778/7472071/a2922c3af2fb/sensors-20-04370-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e778/7472071/1c415f967f31/sensors-20-04370-g007.jpg

相似文献

1
A Real-World Approach on the Problem of Chart Recognition Using Classification, Detection and Perspective Correction.使用分类、检测和透视校正解决图表识别问题的一种真实方法。
Sensors (Basel). 2020 Aug 5;20(16):4370. doi: 10.3390/s20164370.
2
Performance of a Computational Model of the Mammalian Olfactory System哺乳动物嗅觉系统计算模型的性能
3
Data Extraction of Circular-Shaped and Grid-like Chart Images.圆形和网格状图表图像的数据提取
J Imaging. 2022 May 12;8(5):136. doi: 10.3390/jimaging8050136.
4
Chartem: Reviving Chart Images with Data Embedding.Chartem:通过数据嵌入来恢复图表图像。
IEEE Trans Vis Comput Graph. 2021 Feb;27(2):337-346. doi: 10.1109/TVCG.2020.3030351. Epub 2021 Jan 28.
5
VividGraph: Learning to Extract and Redesign Network Graphs From Visualization Images.VividGraph:从可视化图像中学习提取和重新设计网络图。
IEEE Trans Vis Comput Graph. 2023 Jul;29(7):3169-3181. doi: 10.1109/TVCG.2022.3153514. Epub 2023 May 26.
6
An analysis of the paper-based health record: information content and its implications for electronic patient records.纸质健康记录分析:信息内容及其对电子病历的影响。
Medinfo. 1995;8 Pt 1:295.
7
A synthetic dataset of different chart types for advancements in chart identification and visualization.一个用于图表识别和可视化进展的不同图表类型的合成数据集。
Data Brief. 2024 Feb 21;53:110233. doi: 10.1016/j.dib.2024.110233. eCollection 2024 Apr.
8
ChartKG: A Knowledge-Graph-Based Representation for Chart Images.ChartKG:一种基于知识图谱的图表图像表示法。
IEEE Trans Vis Comput Graph. 2025 Sep;31(9):5854-5868. doi: 10.1109/TVCG.2024.3476508.
9
Graphical Overlays: Using Layered Elements to Aid Chart Reading.图形叠加:使用分层元素辅助图表阅读。
IEEE Trans Vis Comput Graph. 2012 Dec;18(12):2631-8. doi: 10.1109/TVCG.2012.229.
10
Chart Mining: A Survey of Methods for Automated Chart Analysis.图表挖掘:自动化图表分析方法综述。
IEEE Trans Pattern Anal Mach Intell. 2021 Nov;43(11):3799-3819. doi: 10.1109/TPAMI.2020.2992028. Epub 2021 Oct 1.

引用本文的文献

1
A Multi-Purpose Shallow Convolutional Neural Network for Chart Images.多用途浅层卷积神经网络在图表图像中的应用。
Sensors (Basel). 2022 Oct 11;22(20):7695. doi: 10.3390/s22207695.
2
Document-Image Related Visual Sensors and Machine Learning Techniques.文档-图像相关的视觉传感器和机器学习技术。
Sensors (Basel). 2021 Aug 30;21(17):5849. doi: 10.3390/s21175849.

本文引用的文献

1
Nanosystems, Edge Computing, and the Next Generation Computing Systems.纳米系统、边缘计算和下一代计算系统。
Sensors (Basel). 2019 Sep 19;19(18):4048. doi: 10.3390/s19184048.
2
Bar charts detection and analysis in biomedical literature of PubMed Central.PubMed Central生物医学文献中的柱状图检测与分析
AMIA Annu Symp Proc. 2018 Apr 16;2017:859-865. eCollection 2017.
3
Show and Tell: Lessons Learned from the 2015 MSCOCO Image Captioning Challenge.展示与讲述:从 2015 年 MSCOCO 图像字幕挑战赛中学到的经验教训。
IEEE Trans Pattern Anal Mach Intell. 2017 Apr;39(4):652-663. doi: 10.1109/TPAMI.2016.2587640. Epub 2016 Jul 7.
4
Region-Based Convolutional Networks for Accurate Object Detection and Segmentation.基于区域的卷积神经网络用于精确的目标检测和分割。
IEEE Trans Pattern Anal Mach Intell. 2016 Jan;38(1):142-58. doi: 10.1109/TPAMI.2015.2437384.
5
Deep learning.深度学习。
Nature. 2015 May 28;521(7553):436-44. doi: 10.1038/nature14539.