• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

InCrowd-VI:一个用于评估在室内人员密集空间中进行同步定位与地图构建以实现人员导航的真实视觉惯性数据集。

InCrowd-VI: A Realistic Visual-Inertial Dataset for Evaluating Simultaneous Localization and Mapping in Indoor Pedestrian-Rich Spaces for Human Navigation.

作者信息

Bamdad Marziyeh, Hutter Hans-Peter, Darvishy Alireza

机构信息

Institute of Computer Science, Zurich University of Applied Sciences, 8400 Winterthur, Switzerland.

Department of Informatics, University of Zurich, 8050 Zurich, Switzerland.

出版信息

Sensors (Basel). 2024 Dec 21;24(24):8164. doi: 10.3390/s24248164.

DOI:10.3390/s24248164
PMID:39771900
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11679079/
Abstract

Simultaneous localization and mapping (SLAM) techniques can be used to navigate the visually impaired, but the development of robust SLAM solutions for crowded spaces is limited by the lack of realistic datasets. To address this, we introduce InCrowd-VI, a novel visual-inertial dataset specifically designed for human navigation in indoor pedestrian-rich environments. Recorded using Meta Aria Project glasses, it captures realistic scenarios without environmental control. InCrowd-VI features 58 sequences totaling a 5 km trajectory length and 1.5 h of recording time, including RGB, stereo images, and IMU measurements. The dataset captures important challenges such as pedestrian occlusions, varying crowd densities, complex layouts, and lighting changes. Ground-truth trajectories, accurate to approximately 2 cm, are provided in the dataset, originating from the Meta Aria project machine perception SLAM service. In addition, a semi-dense 3D point cloud of scenes is provided for each sequence. The evaluation of state-of-the-art visual odometry (VO) and SLAM algorithms on InCrowd-VI revealed severe performance limitations in these realistic scenarios. Under challenging conditions, systems exceeded the required localization accuracy of 0.5 m and the 1% drift threshold, with classical methods showing drift up to 5-10%. While deep learning-based approaches maintained high pose estimation coverage (>90%), they failed to achieve real-time processing speeds necessary for walking pace navigation. These results demonstrate the need and value of a new dataset to advance SLAM research for visually impaired navigation in complex indoor environments.

摘要

同时定位与地图构建(SLAM)技术可用于帮助视障人士导航,但针对拥挤空间开发强大的SLAM解决方案受到缺乏真实数据集的限制。为了解决这个问题,我们引入了InCrowd-VI,这是一个专门为室内行人密集环境中的人类导航设计的新型视觉惯性数据集。该数据集使用Meta Aria Project眼镜录制,可在不受环境控制的情况下捕捉真实场景。InCrowd-VI包含58个序列,总轨迹长度为5公里,录制时间为1.5小时,包括RGB图像、立体图像和惯性测量单元(IMU)测量数据。该数据集捕捉了诸如行人遮挡、人群密度变化、复杂布局和光照变化等重要挑战。数据集中提供了精确到约2厘米的地面真值轨迹,这些轨迹来自Meta Aria项目的机器感知SLAM服务。此外,还为每个序列提供了场景的半密集三维点云。在InCrowd-VI上对先进的视觉里程计(VO)和SLAM算法进行的评估表明,在这些真实场景中存在严重的性能限制。在具有挑战性的条件下,系统超过了所需的0.5米定位精度和1%的漂移阈值,经典方法的漂移高达5%-10%。虽然基于深度学习的方法保持了较高的姿态估计覆盖率(>90%),但它们未能实现步行速度导航所需的实时处理速度。这些结果证明了一个新数据集对于推进复杂室内环境中视障人士导航的SLAM研究的必要性和价值。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/729a/11679079/09edc4e5fc6a/sensors-24-08164-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/729a/11679079/3ef40f7fa6c6/sensors-24-08164-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/729a/11679079/32061f831e10/sensors-24-08164-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/729a/11679079/efdc3945d11e/sensors-24-08164-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/729a/11679079/26c3a4c93c3d/sensors-24-08164-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/729a/11679079/a53e9468c0ee/sensors-24-08164-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/729a/11679079/7292590e2c2b/sensors-24-08164-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/729a/11679079/a22fc91eda73/sensors-24-08164-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/729a/11679079/09edc4e5fc6a/sensors-24-08164-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/729a/11679079/3ef40f7fa6c6/sensors-24-08164-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/729a/11679079/32061f831e10/sensors-24-08164-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/729a/11679079/efdc3945d11e/sensors-24-08164-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/729a/11679079/26c3a4c93c3d/sensors-24-08164-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/729a/11679079/a53e9468c0ee/sensors-24-08164-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/729a/11679079/7292590e2c2b/sensors-24-08164-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/729a/11679079/a22fc91eda73/sensors-24-08164-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/729a/11679079/09edc4e5fc6a/sensors-24-08164-g008.jpg

相似文献

1
InCrowd-VI: A Realistic Visual-Inertial Dataset for Evaluating Simultaneous Localization and Mapping in Indoor Pedestrian-Rich Spaces for Human Navigation.InCrowd-VI:一个用于评估在室内人员密集空间中进行同步定位与地图构建以实现人员导航的真实视觉惯性数据集。
Sensors (Basel). 2024 Dec 21;24(24):8164. doi: 10.3390/s24248164.
2
An Enhanced Pedestrian Visual-Inertial SLAM System Aided with Vanishing Point in Indoor Environments.增强型室内环境视惯性 SLAM 系统,辅助灭点估计。
Sensors (Basel). 2021 Nov 9;21(22):7428. doi: 10.3390/s21227428.
3
Assistive Navigation Using Deep Reinforcement Learning Guiding Robot With UWB/Voice Beacons and Semantic Feedbacks for Blind and Visually Impaired People.使用深度强化学习的辅助导航:借助超宽带/语音信标和语义反馈引导机器人为盲人和视力障碍者服务
Front Robot AI. 2021 Jun 22;8:654132. doi: 10.3389/frobt.2021.654132. eCollection 2021.
4
Research on PF-SLAM Indoor Pedestrian Localization Algorithm Based on Feature Point Map.基于特征点地图的PF-SLAM室内行人定位算法研究
Micromachines (Basel). 2018 May 28;9(6):267. doi: 10.3390/mi9060267.
5
Pedestrian Navigation Using Foot-Mounted Inertial Sensor and LIDAR.使用足部惯性传感器和激光雷达的行人导航
Sensors (Basel). 2016 Jan 19;16(1):120. doi: 10.3390/s16010120.
6
Stereo Visual Odometry Pose Correction through Unsupervised Deep Learning.通过无监督深度学习进行立体视觉里程计位姿校正。
Sensors (Basel). 2021 Jul 11;21(14):4735. doi: 10.3390/s21144735.
7
Accurate localization of indoor high similarity scenes using visual slam combined with loop closure detection algorithm.结合回环检测算法,利用视觉即时定位与地图构建实现室内高相似度场景的精确定位。
PLoS One. 2024 Dec 30;19(12):e0312358. doi: 10.1371/journal.pone.0312358. eCollection 2024.
8
HeadSLAM: Pedestrian SLAM with Head-Mounted Sensors.头戴式传感器的行人 SLAM:HeadSLAM
Sensors (Basel). 2022 Feb 18;22(4):1593. doi: 10.3390/s22041593.
9
Solving Monocular Visual Odometry Scale Factor with Adaptive Step Length Estimates for Pedestrians Using Handheld Devices.使用手持设备解决行人单目视觉里程计尺度因子问题的自适应步长估计方法。
Sensors (Basel). 2019 Feb 23;19(4):953. doi: 10.3390/s19040953.
10
RTK/Pseudolite/LAHDE/IMU-PDR Integrated Pedestrian Navigation System for Urban and Indoor Environments.RTK/Pseudolite/LAHDE/IMU-PDR 集成行人导航系统,适用于城市和室内环境。
Sensors (Basel). 2020 Mar 24;20(6):1791. doi: 10.3390/s20061791.