• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用差分进化优化RetinaNet锚框以改进目标检测

Optimizing RetinaNet anchors using differential evolution for improved object detection.

作者信息

Mohammed Asaad, Ibrahim Hosny M, Omar Nagwa M

机构信息

Information Technology Department, Faculty of Computers and Information, Assiut University, Assiut, 71515, Egypt.

出版信息

Sci Rep. 2025 Jun 20;15(1):20101. doi: 10.1038/s41598-025-02888-x.

DOI:10.1038/s41598-025-02888-x
PMID:40542001
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12181318/
Abstract

Object detection is a fundamental task in computer vision. It has two primary types: one-stage detectors known for their high speed and efficiency, and two-stage detectors, which offer higher accuracy but are often slower due to their complex architecture. Balancing these two aspects has been a significant challenge in the field. RetinaNet, a premier single-stage object detector, is renowned for its remarkable balance between speed and accuracy. Its success is largely due to the groundbreaking focal loss function, which adeptly addresses the issue of class imbalance prevalent in object detection tasks. This innovative approach significantly enhances detection accuracy while maintaining high speed, making RetinaNet an ideal choice for a wide range of real-world applications. However, its performance decreases when applied to datasets containing objects with unique characteristics, such as objects with elongated or squat shapes. In such cases, the default anchor parameters may not fully meet the requirements of these specialized objects. To overcome this limitation, we present an enhancement to the RetinaNet model to improve its ability to handle variations in objects across different domains. Specifically, we propose an optimization algorithm based on Differential Evolution (DE) that adjusts anchor scales and ratios while determining the most appropriate number of these parameters for each dataset based on the annotated data. Through extensive experiments on datasets spanning diverse domains such as the Karlsruhe Institute of Technology and Toyota Technological Institute (KITTI), the Unconstrained Face Detection Dataset (UFDD), the TomatoPlantFactoryDataset, and the widely used Common Objects in Context (COCO) 2017 benchmark, we demonstrate that our proposed method significantly outperforms both the original RetinaNet and anchor-free methods by a considerable margin.

摘要

目标检测是计算机视觉中的一项基本任务。它主要有两种类型:一种是单阶段检测器,以其高速和高效著称;另一种是两阶段检测器,虽然精度更高,但由于其复杂的架构,速度往往较慢。在该领域中,平衡这两个方面一直是一项重大挑战。RetinaNet作为一种卓越的单阶段目标检测器,以其在速度和精度之间的出色平衡而闻名。它的成功很大程度上归功于开创性的焦点损失函数,该函数巧妙地解决了目标检测任务中普遍存在的类别不平衡问题。这种创新方法在保持高速的同时显著提高了检测精度,使RetinaNet成为广泛的实际应用的理想选择。然而,当将其应用于包含具有独特特征的物体的数据集时,例如具有细长或矮胖形状的物体,其性能会下降。在这种情况下,默认的锚点参数可能无法完全满足这些特殊物体的要求。为了克服这一限制,我们提出了对RetinaNet模型的一种改进,以提高其处理不同领域中物体变化的能力。具体而言,我们提出了一种基于差分进化(DE)的优化算法,该算法在根据标注数据为每个数据集确定最合适的锚点参数数量的同时,调整锚点的比例和比率。通过在跨越不同领域的数据集上进行广泛实验,如卡尔斯鲁厄理工学院和丰田技术研究所(KITTI)、无约束人脸检测数据集(UFDD)、番茄工厂数据集以及广泛使用的上下文常见物体(COCO)2017基准,我们证明了我们提出的方法在很大程度上显著优于原始的RetinaNet和无锚点方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6872/12181318/4e28e07a6c76/41598_2025_2888_Fig19_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6872/12181318/3450893923d8/41598_2025_2888_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6872/12181318/9271aa8e8b84/41598_2025_2888_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6872/12181318/9bf80c4b8f1b/41598_2025_2888_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6872/12181318/d52e2eebac38/41598_2025_2888_Figa_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6872/12181318/f10ae20661f6/41598_2025_2888_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6872/12181318/75120555b13b/41598_2025_2888_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6872/12181318/dc5430b49c53/41598_2025_2888_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6872/12181318/870b9eb73b83/41598_2025_2888_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6872/12181318/bb24bec417bc/41598_2025_2888_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6872/12181318/2d06523d66f7/41598_2025_2888_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6872/12181318/0b7678520ba8/41598_2025_2888_Fig10_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6872/12181318/7100c7593112/41598_2025_2888_Fig11_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6872/12181318/1aa991a3b001/41598_2025_2888_Fig12_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6872/12181318/1d56b433907e/41598_2025_2888_Fig13_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6872/12181318/42fbf0cbeba2/41598_2025_2888_Fig14_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6872/12181318/e5d552b8837e/41598_2025_2888_Fig15_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6872/12181318/680ac6c5b7dd/41598_2025_2888_Fig16_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6872/12181318/28d726ebc76f/41598_2025_2888_Fig17_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6872/12181318/e257f269e693/41598_2025_2888_Fig18_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6872/12181318/4e28e07a6c76/41598_2025_2888_Fig19_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6872/12181318/3450893923d8/41598_2025_2888_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6872/12181318/9271aa8e8b84/41598_2025_2888_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6872/12181318/9bf80c4b8f1b/41598_2025_2888_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6872/12181318/d52e2eebac38/41598_2025_2888_Figa_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6872/12181318/f10ae20661f6/41598_2025_2888_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6872/12181318/75120555b13b/41598_2025_2888_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6872/12181318/dc5430b49c53/41598_2025_2888_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6872/12181318/870b9eb73b83/41598_2025_2888_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6872/12181318/bb24bec417bc/41598_2025_2888_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6872/12181318/2d06523d66f7/41598_2025_2888_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6872/12181318/0b7678520ba8/41598_2025_2888_Fig10_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6872/12181318/7100c7593112/41598_2025_2888_Fig11_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6872/12181318/1aa991a3b001/41598_2025_2888_Fig12_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6872/12181318/1d56b433907e/41598_2025_2888_Fig13_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6872/12181318/42fbf0cbeba2/41598_2025_2888_Fig14_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6872/12181318/e5d552b8837e/41598_2025_2888_Fig15_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6872/12181318/680ac6c5b7dd/41598_2025_2888_Fig16_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6872/12181318/28d726ebc76f/41598_2025_2888_Fig17_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6872/12181318/e257f269e693/41598_2025_2888_Fig18_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6872/12181318/4e28e07a6c76/41598_2025_2888_Fig19_HTML.jpg

相似文献

1
Optimizing RetinaNet anchors using differential evolution for improved object detection.使用差分进化优化RetinaNet锚框以改进目标检测
Sci Rep. 2025 Jun 20;15(1):20101. doi: 10.1038/s41598-025-02888-x.
2
Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.在基层医疗机构或医院门诊环境中,如果患者出现以下症状和体征,可判断其是否患有 COVID-19。
Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.
3
Systemic treatments for metastatic cutaneous melanoma.转移性皮肤黑色素瘤的全身治疗
Cochrane Database Syst Rev. 2018 Feb 6;2(2):CD011123. doi: 10.1002/14651858.CD011123.pub2.
4
Immunogenicity and seroefficacy of pneumococcal conjugate vaccines: a systematic review and network meta-analysis.肺炎球菌结合疫苗的免疫原性和血清效力:系统评价和网络荟萃分析。
Health Technol Assess. 2024 Jul;28(34):1-109. doi: 10.3310/YWHA3079.
5
Eliciting adverse effects data from participants in clinical trials.从临床试验参与者中获取不良反应数据。
Cochrane Database Syst Rev. 2018 Jan 16;1(1):MR000039. doi: 10.1002/14651858.MR000039.pub2.
6
Computer and mobile technology interventions for self-management in chronic obstructive pulmonary disease.用于慢性阻塞性肺疾病自我管理的计算机和移动技术干预措施。
Cochrane Database Syst Rev. 2017 May 23;5(5):CD011425. doi: 10.1002/14651858.CD011425.pub2.
7
Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.系统性药理学治疗慢性斑块状银屑病:网络荟萃分析。
Cochrane Database Syst Rev. 2021 Apr 19;4(4):CD011535. doi: 10.1002/14651858.CD011535.pub4.
8
Reading aids for adults with low vision.针对视力低下成年人的阅读辅助工具。
Cochrane Database Syst Rev. 2018 Apr 17;4(4):CD003303. doi: 10.1002/14651858.CD003303.pub4.
9
Interventions for central serous chorioretinopathy: a network meta-analysis.中心性浆液性脉络膜视网膜病变的干预措施:一项网状Meta分析
Cochrane Database Syst Rev. 2025 Jun 16;6(6):CD011841. doi: 10.1002/14651858.CD011841.pub3.
10
A rapid and systematic review of the clinical effectiveness and cost-effectiveness of paclitaxel, docetaxel, gemcitabine and vinorelbine in non-small-cell lung cancer.对紫杉醇、多西他赛、吉西他滨和长春瑞滨在非小细胞肺癌中的临床疗效和成本效益进行的快速系统评价。
Health Technol Assess. 2001;5(32):1-195. doi: 10.3310/hta5320.

本文引用的文献

1
Enhanced object detection in remote sensing images by applying metaheuristic and hybrid metaheuristic optimizers to YOLOv7 and YOLOv8.通过将元启发式和混合元启发式优化器应用于YOLOv7和YOLOv8来增强遥感图像中的目标检测。
Sci Rep. 2025 Feb 28;15(1):7226. doi: 10.1038/s41598-025-89124-8.
2
Enhancing two-stage object detection models via data-driven anchor box optimization in UAV-based maritime SAR.通过基于无人机的海事合成孔径雷达中数据驱动的锚框优化来增强两阶段目标检测模型。
Sci Rep. 2024 Feb 27;14(1):4765. doi: 10.1038/s41598-024-55570-z.
3
A dataset of tomato fruits images for object detection in the complex lighting environment of plant factories.
一个用于植物工厂复杂光照环境下目标检测的番茄果实图像数据集。
Data Brief. 2023 Jun 3;48:109291. doi: 10.1016/j.dib.2023.109291. eCollection 2023 Jun.
4
Unsupervised Pre-Training for Detection Transformers.用于检测变压器的无监督预训练
IEEE Trans Pattern Anal Mach Intell. 2023 Nov;45(11):12772-12782. doi: 10.1109/TPAMI.2022.3216514. Epub 2023 Oct 3.
5
Object Detection in Medical Images Based on Hierarchical Transformer and Mask Mechanism.基于分层 Transformer 和掩模机制的医学图像目标检测。
Comput Intell Neurosci. 2022 Aug 4;2022:5863782. doi: 10.1155/2022/5863782. eCollection 2022.
6
A Real-Time Object Detector for Autonomous Vehicles Based on YOLOv4.基于 YOLOv4 的自动驾驶车辆实时目标检测器。
Comput Intell Neurosci. 2021 Dec 10;2021:9218137. doi: 10.1155/2021/9218137. eCollection 2021.
7
Improving Performance and Adaptivity of Anchor-Based Detector Using Differentiable Anchoring With Efficient Target Generation.使用具有高效目标生成的可微锚定来提高基于锚定的检测器的性能和适应性。
IEEE Trans Image Process. 2021;30:712-724. doi: 10.1109/TIP.2020.3038349. Epub 2020 Dec 4.
8
Intact Detection of Highly Occluded Immature Tomatoes on Plants Using Deep Learning Techniques.基于深度学习技术的植株上高度遮挡未成熟番茄的完整检测。
Sensors (Basel). 2020 May 25;20(10):2984. doi: 10.3390/s20102984.
9
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks.更快的 R-CNN:基于区域建议网络的实时目标检测。
IEEE Trans Pattern Anal Mach Intell. 2017 Jun;39(6):1137-1149. doi: 10.1109/TPAMI.2016.2577031. Epub 2016 Jun 6.