• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

从科学出版物中进行视觉摘要识别 自监督学习

Visual Summary Identification From Scientific Publications Self-Supervised Learning.

作者信息

Yamamoto Shintaro, Lauscher Anne, Ponzetto Simone Paolo, Glavaš Goran, Morishima Shigeo

机构信息

Department of Pure and Applied Physics, Waseda University, Tokyo, Japan.

Data and Web Science Group, University of Mannheim, Mannheim, Germany.

出版信息

Front Res Metr Anal. 2021 Aug 19;6:719004. doi: 10.3389/frma.2021.719004. eCollection 2021.

DOI:10.3389/frma.2021.719004
PMID:34490413
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8418328/
Abstract

The exponential growth of scientific literature yields the need to support users to both effectively and efficiently analyze and understand the some body of research work. This exploratory process can be facilitated by providing graphical abstracts-a visual summary of a scientific publication. Accordingly, previous work recently presented an initial study on automatic identification of a central figure in a scientific publication, to be used as the publication's visual summary. This study, however, have been limited only to a single (biomedical) domain. This is primarily because the current state-of-the-art relies on supervised machine learning, typically relying on the existence of large amounts of labeled data: the only existing annotated data set until now covered only the biomedical publications. In this work, we build a novel benchmark data set for visual summary identification from scientific publications, which consists of papers presented at conferences from several areas of computer science. We couple this contribution with a new self-supervised learning approach to learn a heuristic matching of in-text references to figures with figure captions. Our self-supervised pre-training, executed on a large unlabeled collection of publications, attenuates the need for large annotated data sets for visual summary identification and facilitates domain transfer for this task. We evaluate our self-supervised pretraining for visual summary identification on both the existing biomedical and our newly presented computer science data set. The experimental results suggest that the proposed method is able to outperform the previous state-of-the-art without any task-specific annotations.

摘要

科学文献的指数级增长使得有必要支持用户有效且高效地分析和理解某一研究工作主体。通过提供图形摘要(科学出版物的视觉总结)可以促进这一探索过程。因此,先前的工作最近提出了一项关于自动识别科学出版物中核心人物以用作出版物视觉总结的初步研究。然而,这项研究仅限于单一(生物医学)领域。这主要是因为当前的先进技术依赖于监督机器学习,通常依赖于大量标记数据的存在:到目前为止,唯一现有的注释数据集仅涵盖生物医学出版物。在这项工作中,我们为从科学出版物中识别视觉总结构建了一个新的基准数据集,该数据集由计算机科学几个领域的会议上发表的论文组成。我们将这一贡献与一种新的自监督学习方法相结合,以学习文本参考文献与带有图注的图表之间启发式匹配。我们在大量未标记的出版物集合上执行的自监督预训练减少了对用于视觉总结识别的大型注释数据集的需求,并促进了该任务的领域转移。我们在现有的生物医学数据集和我们新提出的计算机科学数据集上评估了用于视觉总结识别的自监督预训练。实验结果表明,所提出的方法能够在没有任何特定任务注释的情况下优于先前的先进技术。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/210c/8418328/845f0cbebb2f/frma-06-719004-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/210c/8418328/1f7881a70403/frma-06-719004-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/210c/8418328/e7321f593448/frma-06-719004-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/210c/8418328/10e520dbf72e/frma-06-719004-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/210c/8418328/a515f081c07b/frma-06-719004-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/210c/8418328/fddf767a99ad/frma-06-719004-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/210c/8418328/06a566773b47/frma-06-719004-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/210c/8418328/845f0cbebb2f/frma-06-719004-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/210c/8418328/1f7881a70403/frma-06-719004-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/210c/8418328/e7321f593448/frma-06-719004-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/210c/8418328/10e520dbf72e/frma-06-719004-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/210c/8418328/a515f081c07b/frma-06-719004-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/210c/8418328/fddf767a99ad/frma-06-719004-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/210c/8418328/06a566773b47/frma-06-719004-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/210c/8418328/845f0cbebb2f/frma-06-719004-g007.jpg

相似文献

1
Visual Summary Identification From Scientific Publications Self-Supervised Learning.从科学出版物中进行视觉摘要识别 自监督学习
Front Res Metr Anal. 2021 Aug 19;6:719004. doi: 10.3389/frma.2021.719004. eCollection 2021.
2
Figure and caption extraction from biomedical documents.从生物医学文献中提取图和标题。
Bioinformatics. 2019 Nov 1;35(21):4381-4388. doi: 10.1093/bioinformatics/btz228.
3
Weakly supervised learning of information structure of scientific abstracts--is it accurate enough to benefit real-world tasks in biomedicine?科学文摘信息结构的弱监督学习——其准确性足以有益于生物医学中的实际任务吗?
Bioinformatics. 2011 Nov 15;27(22):3179-85. doi: 10.1093/bioinformatics/btr536. Epub 2011 Sep 22.
4
Entity linking for biomedical literature.生物医学文献的实体链接
BMC Med Inform Decis Mak. 2015;15 Suppl 1(Suppl 1):S4. doi: 10.1186/1472-6947-15-S1-S4. Epub 2015 May 20.
5
Exploiting the potential of unlabeled endoscopic video data with self-supervised learning.利用自监督学习挖掘未标记内镜视频数据的潜力。
Int J Comput Assist Radiol Surg. 2018 Jun;13(6):925-933. doi: 10.1007/s11548-018-1772-0. Epub 2018 Apr 27.
6
Unsupervised inference of implicit biomedical events using context triggers.使用上下文触发器进行无监督的隐含生物医学事件推断。
BMC Bioinformatics. 2020 Jan 28;21(1):29. doi: 10.1186/s12859-020-3341-0.
7
Learning to rank figures within a biomedical article.学习对生物医学文章中的图表进行排序。
PLoS One. 2014 Mar 13;9(3):e61567. doi: 10.1371/journal.pone.0061567. eCollection 2014.
8
Semi-Supervised Semantic Segmentation With High- and Low-Level Consistency.具有高低级一致性的半监督语义分割
IEEE Trans Pattern Anal Mach Intell. 2021 Apr;43(4):1369-1379. doi: 10.1109/TPAMI.2019.2960224. Epub 2021 Mar 4.
9
Knowledge based word-concept model estimation and refinement for biomedical text mining.用于生物医学文本挖掘的基于知识的词概念模型估计与优化。
J Biomed Inform. 2015 Feb;53:300-7. doi: 10.1016/j.jbi.2014.11.015. Epub 2014 Dec 12.
10
Self-Supervised Contextual Language Representation of Radiology Reports to Improve the Identification of Communication Urgency.用于提高沟通紧迫性识别的放射学报告自监督上下文语言表示
AMIA Jt Summits Transl Sci Proc. 2020 May 30;2020:413-421. eCollection 2020.

本文引用的文献

1
A Picture Is Worth a Thousand Views: A Triple Crossover Trial of Visual Abstracts to Examine Their Impact on Research Dissemination.一图胜千言:三重交叉试验研究视觉摘要对研究传播的影响。
J Med Internet Res. 2020 Dec 4;22(12):e22327. doi: 10.2196/22327.
2
Learning to rank figures within a biomedical article.学习对生物医学文章中的图表进行排序。
PLoS One. 2014 Mar 13;9(3):e61567. doi: 10.1371/journal.pone.0061567. eCollection 2014.
3
Automatic figure ranking and user interfacing for intelligent figure search.智能图像搜索的自动图像排序和用户界面。
PLoS One. 2010 Oct 7;5(10):e12983. doi: 10.1371/journal.pone.0012983.
4
Pictorial superiority effect.图像优势效应
J Exp Psychol Hum Learn. 1976 Sep;2(5):523-8.