• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于动态实例生成和局部阈值的少样本像素级文档布局分割。

Few-Shot Pixel-Precise Document Layout Segmentation via Dynamic Instance Generation and Local Thresholding.

机构信息

Department of Mathematics, Computer Science and Physics, Università degli Studi di Udine, Via delle Scienze 206, 33100 Udine, Italy.

Department of Humanities and Cultural Heritage, Università degli Studi di Udine, Vicolo Florio 2/b, 33100 Udine, Italy.

出版信息

Int J Neural Syst. 2023 Oct;33(10):2350052. doi: 10.1142/S0129065723500521. Epub 2023 Aug 10.

DOI:10.1142/S0129065723500521
PMID:37567858
Abstract

Over the years, the humanities community has increasingly requested the creation of artificial intelligence frameworks to help the study of cultural heritage. Document Layout segmentation, which aims at identifying the different structural components of a document page, is a particularly interesting task connected to this trend, specifically when it comes to handwritten texts. While there are many effective approaches to this problem, they all rely on large amounts of data for the training of the underlying models, which is rarely possible in a real-world scenario, as the process of producing the ground truth segmentation task with the required precision to the pixel level is a very time-consuming task and often requires a certain degree of domain knowledge regarding the documents at hand. For this reason, in this paper, we propose an effective few-shot learning framework for document layout segmentation relying on two novel components, namely a dynamic instance generation and a segmentation refinement module. This approach is able of achieving performances comparable to the current state of the art on the popular Diva-HisDB dataset, while relying on just a fraction of the available data.

摘要

多年来,人文学科领域越来越要求创建人工智能框架来帮助研究文化遗产。文档布局分割旨在识别文档页面的不同结构组件,这是一个特别有趣的任务,尤其是在手写文本方面。虽然有许多有效的方法可以解决这个问题,但它们都依赖于大量数据来训练底层模型,这在实际情况下很少可能实现,因为以像素级的精度生成所需的地面真实分割任务的过程是一个非常耗时的任务,并且通常需要对所处理的文档有一定程度的领域知识。出于这个原因,在本文中,我们提出了一种基于两个新颖组件的有效少样本学习框架,用于文档布局分割,这两个组件分别是动态实例生成和分割细化模块。这种方法能够在仅依赖一小部分可用数据的情况下,在流行的 Diva-HisDB 数据集上实现与当前最先进技术相媲美的性能。

相似文献

1
Few-Shot Pixel-Precise Document Layout Segmentation via Dynamic Instance Generation and Local Thresholding.基于动态实例生成和局部阈值的少样本像素级文档布局分割。
Int J Neural Syst. 2023 Oct;33(10):2350052. doi: 10.1142/S0129065723500521. Epub 2023 Aug 10.
2
Image generation by GAN and style transfer for agar plate image segmentation.基于 GAN 和风格迁移的琼脂平板图像分割的图像生成。
Comput Methods Programs Biomed. 2020 Feb;184:105268. doi: 10.1016/j.cmpb.2019.105268. Epub 2019 Dec 17.
3
Script-independent text line segmentation in freestyle handwritten documents.自由手写文档中与脚本无关的文本行分割
IEEE Trans Pattern Anal Mach Intell. 2008 Aug;30(8):1313-29. doi: 10.1109/TPAMI.2007.70792.
4
A transfer learning approach to few-shot segmentation of novel white matter tracts.一种基于迁移学习的新白质束少样本分割方法。
Med Image Anal. 2022 Jul;79:102454. doi: 10.1016/j.media.2022.102454. Epub 2022 Apr 12.
5
ADNet++: A few-shot learning framework for multi-class medical image volume segmentation with uncertainty-guided feature refinement.ADNet++:一种基于不确定性引导特征细化的多类医学图像体积分割的小样本学习框架。
Med Image Anal. 2023 Oct;89:102870. doi: 10.1016/j.media.2023.102870. Epub 2023 Jun 26.
6
Anomaly detection-inspired few-shot medical image segmentation through self-supervision with supervoxels.基于超像素的自监督异常检测启发的少样本医学图像分割。
Med Image Anal. 2022 May;78:102385. doi: 10.1016/j.media.2022.102385. Epub 2022 Feb 11.
7
SinGAN-Seg: Synthetic training data generation for medical image segmentation.SinGAN-Seg:用于医学图像分割的合成训练数据生成。
PLoS One. 2022 May 2;17(5):e0267976. doi: 10.1371/journal.pone.0267976. eCollection 2022.
8
DAN: A Segmentation-Free Document Attention Network for Handwritten Document Recognition.DAN:一种用于手写文档识别的无分割文档注意力网络。
IEEE Trans Pattern Anal Mach Intell. 2023 Jul;45(7):8227-8243. doi: 10.1109/TPAMI.2023.3235826. Epub 2023 Jun 5.
9
MaskMitosis: a deep learning framework for fully supervised, weakly supervised, and unsupervised mitosis detection in histopathology images.MaskMitosis:一种深度学习框架,用于在组织病理学图像中进行全监督、弱监督和无监督的有丝分裂检测。
Med Biol Eng Comput. 2020 Jul;58(7):1603-1623. doi: 10.1007/s11517-020-02175-z. Epub 2020 May 22.
10
Signature detection and matching for document image retrieval.用于文档图像检索的签名检测与匹配。
IEEE Trans Pattern Anal Mach Intell. 2009 Nov;31(11):2015-31. doi: 10.1109/TPAMI.2008.237.