Wenbo Lin, Tingting Li, Xiao Li
School of Geology, Gansu Industrial Vocational and Technical College, Tianshui, Gansu, China.
School of Electronic Information, Gansu Industrial Vocational and Technical College, Tianshui, Gansu, China.
Front Neurorobot. 2025 Apr 1;19:1544694. doi: 10.3389/fnbot.2025.1544694. eCollection 2025.
Foundation pit deformation prediction is a critical aspect of underground engineering safety assessment, influencing construction quality and personnel safety. However, due to complex geological conditions and numerous environmental interference factors, traditional prediction methods struggle to achieve precise modeling. Conventional approaches, including numerical simulations, empirical formulas, and machine learning models, suffer from limitations such as high computational costs, poor generalization, or excessive dependence on specific data distributions. Recently, deep learning models, particularly cross-modal architectures, have demonstrated great potential in engineering applications. However, effectively integrating multi-modal data for improved prediction accuracy remains a significant challenge.
This study proposes a Multi-Scale Contrastive Language-Image Pretraining (CLP) framework, ModuCLIP, designed for foundation pit deformation prediction in multi-modal robotic systems. The framework leverages a self-supervised contrastive learning mechanism to integrate multi-source information, including images, textual descriptions, and sensor data, while employing a multi-scale feature learning approach to enhance adaptability to complex conditions. Experiments conducted on multiple foundation pit engineering datasets demonstrate that ModuCLIP outperforms existing methods in terms of prediction accuracy, generalization, and robustness.
The findings suggest that this framework provides an efficient and precise solution for foundation pit deformation prediction while offering new insights into multi-modal robotic perception and engineering monitoring applications.
基坑变形预测是地下工程安全评估的关键环节,影响着施工质量和人员安全。然而,由于地质条件复杂且环境干扰因素众多,传统预测方法难以实现精确建模。包括数值模拟、经验公式和机器学习模型在内的传统方法存在计算成本高、泛化性差或过度依赖特定数据分布等局限性。近年来,深度学习模型,特别是跨模态架构,在工程应用中展现出了巨大潜力。然而,有效整合多模态数据以提高预测精度仍然是一项重大挑战。
本研究提出了一种用于多模态机器人系统中基坑变形预测的多尺度对比语言-图像预训练(CLP)框架,即ModuCLIP。该框架利用自监督对比学习机制整合多源信息,包括图像、文本描述和传感器数据,同时采用多尺度特征学习方法来增强对复杂条件的适应性。在多个基坑工程数据集上进行的实验表明,ModuCLIP在预测精度、泛化性和鲁棒性方面优于现有方法。
研究结果表明,该框架为基坑变形预测提供了一种高效且精确的解决方案,同时为多模态机器人感知和工程监测应用提供了新的见解。