Roos-Hoefgeest Sara, Roos-Hoefgeest Mario, Álvarez Ignacio, González Rafael C
Department of Electrical, Computer Electronics and Systems Engineering, University of Oviedo, 33003 Oviedo, Spain.
CIN Advanced Systems Group, 33211 Gijón, Spain.
Sensors (Basel). 2025 Apr 3;25(7):2271. doi: 10.3390/s25072271.
High-precision surface defect detection in manufacturing often relies on laser triangulation profilometric sensors for detailed surface measurements, providing detailed and accurate surface measurements over a line. Accurate motion between the sensor and workpiece, usually managed by robotic systems, is critical for maintaining optimal distance and orientation. This paper introduces a novel Reinforcement Learning (RL) approach to optimize inspection trajectories for profilometric sensors based on the boustrophedon scanning method. The RL model dynamically adjusts sensor position and tilt to ensure consistent profile distribution and high-quality scanning. We use a simulated environment replicating real-world conditions, including sensor noise and surface irregularities, to plan trajectories offline using CAD models. Key contributions include designing a state space, action space, and reward function tailored for profilometric sensor inspection. The Proximal Policy Optimization (PPO) algorithm trains the RL agent to optimize these trajectories effectively. Validation involves testing the model on various parts in simulation and performing real-world inspection with a UR3e robotic arm, demonstrating the approach's practicality and effectiveness.
制造中的高精度表面缺陷检测通常依赖激光三角测量轮廓传感器进行详细的表面测量,可对一条线上的表面进行详细且准确的测量。传感器与工件之间的精确运动通常由机器人系统控制,这对于保持最佳距离和方向至关重要。本文介绍了一种基于双向扫描方法的新型强化学习(RL)方法,用于优化轮廓传感器的检测轨迹。RL模型动态调整传感器的位置和倾斜度,以确保一致的轮廓分布和高质量扫描。我们使用一个模拟真实世界条件的环境,包括传感器噪声和表面不规则性,利用CAD模型离线规划轨迹。主要贡献包括设计了一个针对轮廓传感器检测量身定制的状态空间、动作空间和奖励函数。近端策略优化(PPO)算法训练RL智能体有效地优化这些轨迹。验证包括在模拟中对各种零件测试模型,并使用UR3e机器人手臂进行实际检测,证明了该方法的实用性和有效性。