• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

图表挖掘:自动化图表分析方法综述。

Chart Mining: A Survey of Methods for Automated Chart Analysis.

出版信息

IEEE Trans Pattern Anal Mach Intell. 2021 Nov;43(11):3799-3819. doi: 10.1109/TPAMI.2020.2992028. Epub 2021 Oct 1.

DOI:10.1109/TPAMI.2020.2992028
PMID:32365018
Abstract

Charts are useful communication tools for the presentation of data in a visually appealing format that facilitates comprehension. There have been many studies dedicated to chart mining, which refers to the process of automatic detection, extraction and analysis of charts to reproduce the tabular data that was originally used to create them. By allowing access to data which might not be available in other formats, chart mining facilitates the creation of many downstream applications. This paper presents a comprehensive survey of approaches across all components of the automated chart mining pipeline, such as (i) automated extraction of charts from documents; (ii) processing of multi-panel charts; (iii) automatic image classifiers to collect chart images at scale; (iv) automated extraction of data from each chart image, for popular chart types as well as selected specialized classes; (v) applications of chart mining; and (vi) datasets for training and evaluation, and the methods that were used to build them. Finally, we summarize the main trends found in the literature and provide pointers to areas for further research in chart mining.

摘要

图表是一种有用的沟通工具,以吸引人的视觉格式呈现数据,便于理解。已经有许多专门针对图表挖掘的研究,图表挖掘是指自动检测、提取和分析图表以再现最初用于创建图表的表格数据的过程。通过允许访问可能在其他格式中不可用的数据,图表挖掘促进了许多下游应用的创建。本文全面调查了自动化图表挖掘管道的所有组件的方法,例如(i)从文档中自动提取图表;(ii)处理多面板图表;(iii)自动图像分类器,可大规模收集图表图像;(iv)自动从每个图表图像中提取数据,适用于流行的图表类型以及选定的专业图表类型;(v)图表挖掘的应用;以及(vi)用于训练和评估的数据集,以及用于构建它们的方法。最后,我们总结了文献中发现的主要趋势,并为图表挖掘的进一步研究提供了方向。

相似文献

1
Chart Mining: A Survey of Methods for Automated Chart Analysis.图表挖掘:自动化图表分析方法综述。
IEEE Trans Pattern Anal Mach Intell. 2021 Nov;43(11):3799-3819. doi: 10.1109/TPAMI.2020.2992028. Epub 2021 Oct 1.
2
Data Extraction of Circular-Shaped and Grid-like Chart Images.圆形和网格状图表图像的数据提取
J Imaging. 2022 May 12;8(5):136. doi: 10.3390/jimaging8050136.
3
ChartKG: A Knowledge-Graph-Based Representation for Chart Images.ChartKG:一种基于知识图谱的图表图像表示法。
IEEE Trans Vis Comput Graph. 2025 Sep;31(9):5854-5868. doi: 10.1109/TVCG.2024.3476508.
4
Categorizing biomedicine images using novel image features and sparse coding representation.使用新颖的图像特征和稀疏编码表示对生物医学图像进行分类。
BMC Med Genomics. 2013;6 Suppl 3(Suppl 3):S8. doi: 10.1186/1755-8794-6-S3-S8. Epub 2013 Nov 11.
5
A Real-World Approach on the Problem of Chart Recognition Using Classification, Detection and Perspective Correction.使用分类、检测和透视校正解决图表识别问题的一种真实方法。
Sensors (Basel). 2020 Aug 5;20(16):4370. doi: 10.3390/s20164370.
6
FamPlex: a resource for entity recognition and relationship resolution of human protein families and complexes in biomedical text mining.FamPlex:生物医学文本挖掘中人类蛋白质家族和复合物的实体识别和关系解析资源。
BMC Bioinformatics. 2018 Jun 28;19(1):248. doi: 10.1186/s12859-018-2211-5.
7
Bar charts detection and analysis in biomedical literature of PubMed Central.PubMed Central生物医学文献中的柱状图检测与分析
AMIA Annu Symp Proc. 2018 Apr 16;2017:859-865. eCollection 2017.
8
[Creation of a Stereo-paired Bone Anatomical Chart Using Human Bone Specimens for X-ray Anatomical Education Part II: Addition of Stereo-paired X-ray Bone Images to the Anatomical Chart].[利用人体骨骼标本制作用于X射线解剖学教学的立体配对骨骼解剖图 第二部分:在解剖图中添加立体配对X射线骨图像]
Nihon Hoshasen Gijutsu Gakkai Zasshi. 2021;77(4):365-370. doi: 10.6009/jjrt.2021_JSRT_77.4.365.
9
Text Mining in Biomedical Domain with Emphasis on Document Clustering.生物医学领域中的文本挖掘,重点在于文档聚类
Healthc Inform Res. 2017 Jul;23(3):141-146. doi: 10.4258/hir.2017.23.3.141. Epub 2017 Jul 31.
10
Intelligent bar chart plagiarism detection in documents.文档中的智能柱状图抄袭检测
ScientificWorldJournal. 2014;2014:612787. doi: 10.1155/2014/612787. Epub 2014 Sep 17.

引用本文的文献

1
Compound Figure Separation of Biomedical Images: Mining Large Datasets for Self-supervised Learning.生物医学图像的复合图形分离:挖掘大型数据集用于自监督学习。
J Mach Learn Biomed Imaging. 2022 Aug;1. Epub 2022 Sep 4.
2
Graphical integrity issues in open access publications: Detection and patterns of proportional ink violations.开放获取出版物中的图形完整性问题:比例墨水违规的检测和模式。
PLoS Comput Biol. 2021 Dec 13;17(12):e1009650. doi: 10.1371/journal.pcbi.1009650. eCollection 2021 Dec.