• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于音译对齐的弱监督楔形文字符号检测深度学习。

Deep learning of cuneiform sign detection with weak supervision using transliteration alignment.

机构信息

Heidelberg Collaboratory for Image Processing, Interdisciplinary Center for Scientific Computing, Heidelberg University, Heidelberg, Germany.

Department of the Languages and Cultures of the Near East, Institute for Assyriology, Heidelberg University, Heidelberg, Germany.

出版信息

PLoS One. 2020 Dec 16;15(12):e0243039. doi: 10.1371/journal.pone.0243039. eCollection 2020.

DOI:10.1371/journal.pone.0243039
PMID:33326435
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7743970/
Abstract

The cuneiform script provides a glimpse into our ancient history. However, reading age-old clay tablets is time-consuming and requires years of training. To simplify this process, we propose a deep-learning based sign detector that locates and classifies cuneiform signs in images of clay tablets. Deep learning requires large amounts of training data in the form of bounding boxes around cuneiform signs, which are not readily available and costly to obtain in the case of cuneiform script. To tackle this problem, we make use of existing transliterations, a sign-by-sign representation of the tablet content in Latin script. Since these do not provide sign localization, we propose a weakly supervised approach: We align tablet images with their corresponding transliterations to localize the transliterated signs in the tablet image, before using these localized signs in place of annotations to re-train the sign detector. A better sign detector in turn boosts the quality of the alignments. We combine these steps in an iterative process that enables training a cuneiform sign detector from transliterations only. While our method works weakly supervised, a small number of annotations further boost the performance of the cuneiform sign detector which we evaluate on a large collection of clay tablets from the Neo-Assyrian period. To enable experts to directly apply the sign detector in their study of cuneiform texts, we additionally provide a web application for the analysis of clay tablets with a trained cuneiform sign detector.

摘要

楔形文字为我们了解古代历史提供了线索。然而,阅读古老的泥板需要花费大量的时间和多年的训练。为了简化这个过程,我们提出了一种基于深度学习的符号检测器,可以在泥板图像中定位和分类楔形符号。深度学习需要大量的训练数据,这些数据以楔形符号的边界框的形式呈现,但对于楔形文字来说,这些数据并不容易获得,而且获取成本很高。为了解决这个问题,我们利用现有的音译,即拉丁字母表示的泥板内容的符号到符号的表示。由于这些音译没有提供符号定位,因此我们提出了一种弱监督的方法:我们将泥板图像与其对应的音译对齐,以在泥板图像中定位音译符号,然后使用这些本地化的符号代替注释来重新训练符号检测器。一个更好的符号检测器反过来又会提高对齐的质量。我们将这些步骤结合在一个迭代过程中,从而可以仅从音译中训练楔形符号检测器。虽然我们的方法是弱监督的,但少量的注释进一步提高了楔形符号检测器的性能,我们在大量来自新亚述时期的泥板上对其进行了评估。为了使专家能够在对楔形文字文本的研究中直接应用符号检测器,我们还提供了一个带有训练好的楔形符号检测器的泥板分析网络应用程序。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/41c3/7743970/df26e4c3b670/pone.0243039.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/41c3/7743970/0ae4f7ea04d8/pone.0243039.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/41c3/7743970/ce7f32cca3ec/pone.0243039.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/41c3/7743970/d4568b466dea/pone.0243039.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/41c3/7743970/9f5a78811646/pone.0243039.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/41c3/7743970/50e5a6867886/pone.0243039.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/41c3/7743970/d232c609cc4f/pone.0243039.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/41c3/7743970/df26e4c3b670/pone.0243039.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/41c3/7743970/0ae4f7ea04d8/pone.0243039.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/41c3/7743970/ce7f32cca3ec/pone.0243039.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/41c3/7743970/d4568b466dea/pone.0243039.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/41c3/7743970/9f5a78811646/pone.0243039.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/41c3/7743970/50e5a6867886/pone.0243039.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/41c3/7743970/d232c609cc4f/pone.0243039.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/41c3/7743970/df26e4c3b670/pone.0243039.g007.jpg

相似文献

1
Deep learning of cuneiform sign detection with weak supervision using transliteration alignment.基于音译对齐的弱监督楔形文字符号检测深度学习。
PLoS One. 2020 Dec 16;15(12):e0243039. doi: 10.1371/journal.pone.0243039. eCollection 2020.
2
Create distinctive databases of ancient languages and using a computer vision model to accurately recognize and classify them.创建独特的古代语言数据库,并使用计算机视觉模型对其进行准确识别和分类。
Data Brief. 2024 Aug 10;56:110809. doi: 10.1016/j.dib.2024.110809. eCollection 2024 Oct.
3
Reading Akkadian cuneiform using natural language processing.使用自然语言处理阅读阿卡德语楔形文字。
PLoS One. 2020 Oct 28;15(10):e0240511. doi: 10.1371/journal.pone.0240511. eCollection 2020.
4
Synthesizing Supervision for Learning Deep Saliency Network without Human Annotation.无人工标注的深度学习显著图网络合成监督学习
IEEE Trans Pattern Anal Mach Intell. 2020 Jul;42(7):1755-1769. doi: 10.1109/TPAMI.2019.2900649. Epub 2019 Feb 20.
5
Application of Deep Learning Convolutional Neural Networks for Internal Tablet Defect Detection: High Accuracy, Throughput, and Adaptability.深度学习卷积神经网络在内部片剂缺陷检测中的应用:高准确率、高吞吐量和高适应性。
J Pharm Sci. 2020 Apr;109(4):1547-1557. doi: 10.1016/j.xphs.2020.01.014. Epub 2020 Jan 23.
6
Joint Weakly and Semi-Supervised Deep Learning for Localization and Classification of Masses in Breast Ultrasound Images.联合弱监督深度学习在乳腺超声图像中肿块的定位和分类。
IEEE Trans Med Imaging. 2019 Mar;38(3):762-774. doi: 10.1109/TMI.2018.2872031. Epub 2018 Sep 24.
7
Constrained-CNN losses for weakly supervised segmentation.约束卷积神经网络损失的弱监督分割。
Med Image Anal. 2019 May;54:88-99. doi: 10.1016/j.media.2019.02.009. Epub 2019 Feb 13.
8
Restoration of fragmentary Babylonian texts using recurrent neural networks.使用递归神经网络修复残缺的巴比伦文本。
Proc Natl Acad Sci U S A. 2020 Sep 15;117(37):22743-22751. doi: 10.1073/pnas.2003794117. Epub 2020 Sep 1.
9
Weakly supervised mitosis detection in breast histopathology images using concentric loss.使用同心损失的乳腺组织病理学图像弱监督有丝分裂检测。
Med Image Anal. 2019 Apr;53:165-178. doi: 10.1016/j.media.2019.01.013. Epub 2019 Feb 15.
10
Image generation by GAN and style transfer for agar plate image segmentation.基于 GAN 和风格迁移的琼脂平板图像分割的图像生成。
Comput Methods Programs Biomed. 2020 Feb;184:105268. doi: 10.1016/j.cmpb.2019.105268. Epub 2019 Dec 17.

引用本文的文献

1
Create distinctive databases of ancient languages and using a computer vision model to accurately recognize and classify them.创建独特的古代语言数据库,并使用计算机视觉模型对其进行准确识别和分类。
Data Brief. 2024 Aug 10;56:110809. doi: 10.1016/j.dib.2024.110809. eCollection 2024 Oct.
2
Deep Aramaic: Towards a synthetic data paradigm enabling machine learning in epigraphy.深层阿拉米语:在金石学中实现机器学习的综合数据范例。
PLoS One. 2024 Apr 19;19(4):e0299297. doi: 10.1371/journal.pone.0299297. eCollection 2024.
3
Interpol questioned documents review 2019-2022.

本文引用的文献

1
Characterization of the cuneiform signs by the use of a multifunctional optoelectronic device.利用多功能光电器件对楔形文字符号进行表征。
Appl Opt. 1996 Oct 10;35(29):5811-20. doi: 10.1364/AO.35.005811.
2
A novel connectionist system for unconstrained handwriting recognition.一种用于无约束手写识别的新型连接主义系统。
IEEE Trans Pattern Anal Mach Intell. 2009 May;31(5):855-68. doi: 10.1109/TPAMI.2008.137.
国际刑警组织对2019年至2022年文件的审查
Forensic Sci Int Synerg. 2023 Feb 24;6:100300. doi: 10.1016/j.fsisyn.2022.100300. eCollection 2023.
4
Restoring and attributing ancient texts using deep neural networks.利用深度神经网络修复和归因古代文本。
Nature. 2022 Mar;603(7900):280-283. doi: 10.1038/s41586-022-04448-z. Epub 2022 Mar 9.