• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于改进Mask R-CNN的数学函数图形元素检测与分割

Element detection and segmentation of mathematical function graphs based on improved Mask R-CNN.

作者信息

Lu Jiale, Chen Jianjun, Xu Taihua, Song Jingjing, Yang Xibei

机构信息

School of Computer Science, Jiangsu University of Science and Technology, Zhenjiang 212100, Jiangsu, China.

出版信息

Math Biosci Eng. 2023 May 31;20(7):12772-12801. doi: 10.3934/mbe.2023570.

DOI:10.3934/mbe.2023570
PMID:37501466
Abstract

There are approximately 2.2 billion people around the world with varying degrees of visual impairments. Among them, individuals with severe visual impairments predominantly rely on hearing and touch to gather external information. At present, there are limited reading materials for the visually impaired, mostly in the form of audio or text, which cannot satisfy the needs for the visually impaired to comprehend graphical content. Although many scholars have devoted their efforts to investigating methods for converting visual images into tactile graphics, tactile graphic translation fails to meet the reading needs of visually impaired individuals due to image type diversity and limitations in image recognition technology. The primary goal of this paper is to enable the visually impaired to gain a greater understanding of the natural sciences by transforming images of mathematical functions into an electronic format for the production of tactile graphics. In an effort to enhance the accuracy and efficiency of graph element recognition and segmentation of function graphs, this paper proposes an MA Mask R-CNN model which utilizes MA ConvNeXt as its improved feature extraction backbone network and MA BiFPN as its improved feature fusion network. The MA ConvNeXt is a novel feature extraction network proposed in this paper, while the MA BiFPN is a novel feature fusion network introduced in this paper. This model combines the information of local relations, global relations and different channels to form an attention mechanism that is able to establish multiple connections, thus increasing the detection capability of the original Mask R-CNN model on slender and multi-type targets by combining a variety of multi-scale features. Finally, the experimental results show that MA Mask R-CNN attains an 89.6% mAP value for target detection and 72.3% mAP value for target segmentation in the instance segmentation of function graphs. This results in a 9% mAP improvement for target detection and 12.8% mAP improvement for target segmentation compared to the original Mask R-CNN.

摘要

全球约有22亿人有不同程度的视力障碍。其中,严重视力障碍者主要依靠听觉和触觉来获取外部信息。目前,可供视障人士阅读的材料有限,大多为音频或文本形式,无法满足视障人士理解图形内容的需求。尽管许多学者致力于研究将视觉图像转换为触觉图形的方法,但由于图像类型的多样性和图像识别技术的局限性,触觉图形翻译无法满足视障人士的阅读需求。本文的主要目标是通过将数学函数图像转换为电子格式以制作触觉图形,使视障人士更好地理解自然科学。为了提高函数图像的图元识别和分割的准确性和效率,本文提出了一种MA Mask R-CNN模型,该模型利用MA ConvNeXt作为改进的特征提取主干网络,MA BiFPN作为改进的特征融合网络。MA ConvNeXt是本文提出的一种新型特征提取网络,MA BiFPN是本文引入的一种新型特征融合网络。该模型结合了局部关系、全局关系和不同通道的信息,形成了一种能够建立多重连接的注意力机制,从而通过结合多种多尺度特征提高了原始Mask R-CNN模型对细长和多类型目标的检测能力。最后,实验结果表明,在函数图像的实例分割中,MA Mask R-CNN在目标检测方面的mAP值达到89.6%,在目标分割方面的mAP值达到72.3%。与原始Mask R-CNN相比,目标检测的mAP提高了9%,目标分割的mAP提高了12.8%。

相似文献

1
Element detection and segmentation of mathematical function graphs based on improved Mask R-CNN.基于改进Mask R-CNN的数学函数图形元素检测与分割
Math Biosci Eng. 2023 May 31;20(7):12772-12801. doi: 10.3934/mbe.2023570.
2
Improved Mask R-CNN Multi-Target Detection and Segmentation for Autonomous Driving in Complex Scenes.改进的 Mask R-CNN 多目标检测与分割在复杂场景下的自动驾驶。
Sensors (Basel). 2023 Apr 10;23(8):3853. doi: 10.3390/s23083853.
3
Mask-Refined R-CNN: A Network for Refining Object Details in Instance Segmentation.Mask-Refined R-CNN:用于实例分割中细化对象细节的网络。
Sensors (Basel). 2020 Feb 13;20(4):1010. doi: 10.3390/s20041010.
4
Fusing attention mechanism with Mask R-CNN for instance segmentation of grape cluster in the field.将注意力机制与Mask R-CNN融合用于田间葡萄串的实例分割。
Front Plant Sci. 2022 Jul 22;13:934450. doi: 10.3389/fpls.2022.934450. eCollection 2022.
5
Apple detection and instance segmentation in natural environments using an improved Mask Scoring R-CNN Model.使用改进的掩码评分R-CNN模型在自然环境中进行苹果检测和实例分割。
Front Plant Sci. 2022 Dec 2;13:1016470. doi: 10.3389/fpls.2022.1016470. eCollection 2022.
6
Deep Learning-Based Segmentation of Peach Diseases Using Convolutional Neural Network.基于深度学习的卷积神经网络对桃病害进行分割
Front Plant Sci. 2022 May 25;13:876357. doi: 10.3389/fpls.2022.876357. eCollection 2022.
7
A Multiscale Instance Segmentation Method Based on Cleaning Rubber Ball Images.基于清理橡皮球图像的多尺度实例分割方法。
Sensors (Basel). 2023 Apr 25;23(9):4261. doi: 10.3390/s23094261.
8
A deep semantic network-based image segmentation of soybean rust pathogens.基于深度语义网络的大豆锈病病原体图像分割
Front Plant Sci. 2024 Mar 27;15:1340584. doi: 10.3389/fpls.2024.1340584. eCollection 2024.
9
Fine Segmentation of Chinese Character Strokes Based on Coordinate Awareness and Enhanced BiFPN.基于坐标感知与增强型双向特征金字塔网络的汉字笔画精细分割
Sensors (Basel). 2024 May 28;24(11):3480. doi: 10.3390/s24113480.
10
Improved-Mask R-CNN: Towards an accurate generic MSK MRI instance segmentation platform (data from the Osteoarthritis Initiative).改进型掩膜 R-CNN:迈向准确的通用 MSK MRI 实例分割平台(来自骨关节炎倡议的数据)。
Comput Med Imaging Graph. 2022 Apr;97:102056. doi: 10.1016/j.compmedimag.2022.102056. Epub 2022 Mar 19.