• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于在不同天气和光照条件下进行人行横道分割的合成数据集和真实世界数据集。

Synthetic and real-world datasets for crosswalk segmentation under diverse weather and lighting conditions.

作者信息

Romić Krešimir, Leventić Hrvoje, Habijan Marija, Galić Irena

机构信息

Faculty of Electrical Engineering, Computer Science and Information Technology Osijek, Kneza Trpimira 2B, Osijek HR-31000, Croatia.

出版信息

Data Brief. 2025 Jun 7;61:111755. doi: 10.1016/j.dib.2025.111755. eCollection 2025 Aug.

DOI:10.1016/j.dib.2025.111755
PMID:40586087
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12206053/
Abstract

This article presents a new dataset for crosswalk segmentation targeting assistive technologies for visually impaired individuals. The dataset combines synthetic and real-world first-person view images with corresponding binary segmentation masks. The synthetic portion contains 3000 images generated using a fine-tuned Stable Diffusion model, with 1500 images created using a standard prompt ("a crosswalk image") and 1500 additional images incorporating various environmental conditions (sunny, cloudy, rainy, and night) through specialized prompts. The real-world component comprises 300 images extracted from chest-mounted smartphone video recordings of pedestrians approaching crosswalks, carefully distributed across different environmental conditions (120 sunny, 60 cloudy, 60 rainy, and 60 night images). To ensure diversity, each physical crosswalk location appears in at most two images from different approach directions. All images in both synthetic and real-world sets were manually annotated using a custom interface where annotators defined crosswalk regions as quadrilateral polygons, creating binary masks. The dataset is organized hierarchically by image source (synthetic/real-world) and environmental condition, with consistent subfolder structures for images and their corresponding masks. This dataset addresses the scarcity of publicly available crosswalk segmentation data with environmental diversity and has potential applications in developing and benchmarking computer vision algorithms for assistive navigation systems, investigating synthetic data augmentation efficacy, and advancing pedestrian safety technologies.

摘要

本文提出了一个用于人行横道分割的新数据集,旨在为视障人士提供辅助技术。该数据集将合成的和真实世界的第一人称视角图像与相应的二值分割掩码相结合。合成部分包含使用微调后的Stable Diffusion模型生成的3000张图像,其中1500张图像是使用标准提示(“一张人行横道图像”)创建的,另外1500张图像则通过专门提示融入了各种环境条件(晴天、多云、雨天和夜晚)。真实世界部分由从行人接近人行横道时佩戴在胸前的智能手机视频记录中提取的300张图像组成,这些图像仔细分布在不同的环境条件下(120张晴天、60张多云、60张雨天和60张夜晚图像)。为确保多样性,每个实际人行横道位置在来自不同接近方向的图像中最多出现两次。合成集和真实世界集中的所有图像都使用自定义界面进行了手动标注,标注人员将人行横道区域定义为四边形多边形,从而创建二值掩码。该数据集按图像来源(合成/真实世界)和环境条件进行分层组织,图像及其相应掩码具有一致的子文件夹结构。这个数据集解决了具有环境多样性的公开可用人行横道分割数据稀缺的问题,并且在开发和基准测试辅助导航系统的计算机视觉算法、研究合成数据增强效果以及推进行人安全技术方面具有潜在应用。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4482/12206053/9c7b53910e6d/gr4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4482/12206053/d6532aa21095/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4482/12206053/295894da6c94/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4482/12206053/671e22858739/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4482/12206053/9c7b53910e6d/gr4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4482/12206053/d6532aa21095/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4482/12206053/295894da6c94/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4482/12206053/671e22858739/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4482/12206053/9c7b53910e6d/gr4.jpg

相似文献

1
Synthetic and real-world datasets for crosswalk segmentation under diverse weather and lighting conditions.用于在不同天气和光照条件下进行人行横道分割的合成数据集和真实世界数据集。
Data Brief. 2025 Jun 7;61:111755. doi: 10.1016/j.dib.2025.111755. eCollection 2025 Aug.
2
A reproducible framework for synthetic data generation and instance segmentation in robotic suturing.用于机器人缝合中合成数据生成和实例分割的可重复框架。
Int J Comput Assist Radiol Surg. 2025 Jun 24. doi: 10.1007/s11548-025-03460-8.
3
Integrating computer vision algorithms and RFID system for identification and tracking of group-housed animals: an example with pigs.整合计算机视觉算法和射频识别系统用于群居动物的识别与跟踪:以猪为例。
J Anim Sci. 2024 Jan 3;102. doi: 10.1093/jas/skae174.
4
Antidepressants for pain management in adults with chronic pain: a network meta-analysis.抗抑郁药治疗成人慢性疼痛的疼痛管理:一项网络荟萃分析。
Health Technol Assess. 2024 Oct;28(62):1-155. doi: 10.3310/MKRT2948.
5
Proposal for Using AI to Assess Clinical Data Integrity and Generate Metadata: Algorithm Development and Validation.关于使用人工智能评估临床数据完整性并生成元数据的提案:算法开发与验证
JMIR Med Inform. 2025 Jun 30;13:e60204. doi: 10.2196/60204.
6
Cauliflower leaf diseases: A computer vision dataset for smart agriculture.花椰菜叶部病害:一个用于智慧农业的计算机视觉数据集。
Data Brief. 2025 Apr 28;60:111594. doi: 10.1016/j.dib.2025.111594. eCollection 2025 Jun.
7
VIIDA and InViDe: computational approaches for generating and evaluating inclusive image paragraphs for the visually impaired.VIIDA和InViDe:为视障人士生成和评估包容性图像段落的计算方法。
Disabil Rehabil Assist Technol. 2025 Jul;20(5):1470-1495. doi: 10.1080/17483107.2024.2437567. Epub 2024 Dec 11.
8
Management of urinary stones by experts in stone disease (ESD 2025).结石病专家对尿路结石的管理(2025年结石病专家共识)
Arch Ital Urol Androl. 2025 Jun 30;97(2):14085. doi: 10.4081/aiua.2025.14085.
9
Drugs for preventing postoperative nausea and vomiting in adults after general anaesthesia: a network meta-analysis.成人全身麻醉后预防术后恶心呕吐的药物:网状Meta分析
Cochrane Database Syst Rev. 2020 Oct 19;10(10):CD012859. doi: 10.1002/14651858.CD012859.pub2.
10
A Head-On Comparison of EQ-VT- and Crosswalk-Based EQ-5D-5L Value Sets.基于EQ-VT和人行横道法的EQ-5D-5L值集的直接比较。
Appl Health Econ Health Policy. 2025 Mar 11. doi: 10.1007/s40258-025-00954-z.

本文引用的文献

1
UrOAC: Urban objects in any-light conditions.UrOAC:任何光照条件下的城市物体。
Data Brief. 2022 Apr 14;42:108172. doi: 10.1016/j.dib.2022.108172. eCollection 2022 Jun.
2
Smartphone-based computer vision travelling aids for blind and visually impaired individuals: A systematic review.用于盲人和视力受损者的基于智能手机的计算机视觉移动辅助设备:一项系统综述。
Assist Technol. 2022 Mar 4;34(2):178-194. doi: 10.1080/10400435.2020.1743381. Epub 2020 Apr 17.