Informatics Institute, Gazi University, 06680 Ankara, Turkey.
Computer Engineering Department, Faculty of Engineering, Gazi University, 06570 Ankara, Turkey.
Sensors (Basel). 2024 Jan 31;24(3):922. doi: 10.3390/s24030922.
In the field of unmanned systems, the combination of artificial intelligence with self-operating functionalities is becoming increasingly important. This study introduces a new method for autonomously detecting humans in indoor environments using unmanned aerial vehicles, utilizing the advanced techniques of a deep learning framework commonly known as "You Only Look Once" (YOLO). The key contribution of this research is the development of a new model (YOLO-IHD), specifically designed for human detection in indoor using drones. This model is created using a unique dataset gathered from aerial vehicle footage in various indoor environments. It significantly improves the accuracy of detecting people in these complex environments. The model achieves a notable advancement in autonomous monitoring and search-and-rescue operations, highlighting its importance for tasks that require precise human detection. The improved performance of the new model is due to its optimized convolutional layers and an attention mechanism that process complex visual data from indoor environments. This results in more dependable operation in critical situations like disaster response and indoor rescue missions. Moreover, when combined with an accelerating processing library, the model shows enhanced real-time detection capabilities and operates effectively in a real-world environment with a custom designed indoor drone. This research lays the groundwork for future enhancements designed to significantly increase the model's accuracy and the reliability of indoor human detection in real-time drone applications.
在无人系统领域,人工智能与自主功能的结合变得越来越重要。本研究介绍了一种使用无人机在室内环境中自主检测人类的新方法,利用深度学习框架的先进技术,通常称为“你只看一次”(YOLO)。这项研究的主要贡献是开发了一种新的模型(YOLO-IHD),专门用于使用无人机在室内检测人类。该模型是使用从各种室内环境的飞行器镜头中收集的独特数据集创建的。它大大提高了在这些复杂环境中检测人员的准确性。该模型在自主监控和搜索-救援行动中取得了显著进展,突出了其在需要精确人员检测的任务中的重要性。新模型的改进性能归功于其优化的卷积层和注意力机制,这些机制可以处理来自室内环境的复杂视觉数据。这使得在灾难响应和室内救援任务等关键情况下的操作更加可靠。此外,当与加速处理库结合使用时,该模型显示出增强的实时检测能力,并在具有自定义设计的室内无人机的真实环境中有效运行。这项研究为未来的增强奠定了基础,旨在显著提高模型的准确性和实时无人机应用中室内人类检测的可靠性。