• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

多模态颗粒物预测:利用移动设备和深度学习模型实现可扩展且高精度的空气质量监测。

Multimodal Particulate Matter Prediction: Enabling Scalable and High-Precision Air Quality Monitoring Using Mobile Devices and Deep Learning Models.

作者信息

Madokoro Hirokazu, Nix Stephanie

机构信息

Faculty of Software and Information Science, Iwate Prefectural University, Takizawa 020-0693, Iwate, Japan.

出版信息

Sensors (Basel). 2025 Jun 29;25(13):4053. doi: 10.3390/s25134053.

DOI:10.3390/s25134053
PMID:40648310
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12251630/
Abstract

This paper presents a novel approach for predicting Particulate Matter (PM) concentrations using mobile camera devices. In response to persistent air pollution challenges across Japan, we developed a system that utilizes cutting-edge transformer-based deep learning architectures to estimate PM values from imagery captured by smartphone cameras. Our approach employs Contrastive Language-Image Pre-Training (CLIP) as a multimodal framework to extract visual features associated with PM concentration from environmental scenes. We first developed a baseline through comparative analysis of time-series models for 1D PM signal prediction, finding that linear models, particularly NLinear, outperformed complex transformer architectures for short-term forecasting tasks. Building on these insights, we implemented a CLIP-based system for 2D image analysis that achieved a Top-1 accuracy of 0.24 and a Top-5 accuracy of 0.52 when tested on diverse smartphone-captured images. The performance evaluations on Graphics Processing Unit (GPU) and Single-Board Computer (SBC) platforms highlight a viable path toward edge deployment. Processing times of 0.29 s per image on the GPU versus 2.68 s on the SBC demonstrate the potential for scalable, real-time environmental monitoring. We consider that this research connects high-performance computing with energy-efficient hardware solutions, creating a practical framework for distributed environmental monitoring that reduces reliance on costly centralized monitoring systems. Our findings indicate that transformer-based multimodal models present a promising approach for mobile sensing applications, with opportunities for further improvement through seasonal data expansion and architectural refinements.

摘要

本文提出了一种使用移动摄像设备预测颗粒物(PM)浓度的新方法。针对日本各地持续存在的空气污染挑战,我们开发了一种系统,该系统利用基于前沿Transformer的深度学习架构,从智能手机摄像头拍摄的图像中估计PM值。我们的方法采用对比语言-图像预训练(CLIP)作为多模态框架,从环境场景中提取与PM浓度相关的视觉特征。我们首先通过对一维PM信号预测的时间序列模型进行对比分析,建立了一个基线,发现线性模型,特别是NLinear,在短期预测任务中优于复杂的Transformer架构。基于这些见解,我们实现了一个基于CLIP的二维图像分析系统,在对各种智能手机拍摄的图像进行测试时,其Top-1准确率达到0.24,Top-5准确率达到0.52。在图形处理单元(GPU)和单板计算机(SBC)平台上的性能评估突出了一条可行的边缘部署路径。GPU上每张图像的处理时间为0.29秒,而SBC上为2.68秒,这表明了可扩展的实时环境监测的潜力。我们认为,这项研究将高性能计算与节能硬件解决方案联系起来,创建了一个分布式环境监测的实用框架,减少了对昂贵的集中监测系统的依赖。我们的研究结果表明,基于Transformer的多模态模型为移动传感应用提供了一种很有前景的方法,通过季节性数据扩展和架构优化有进一步改进的机会。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1380/12251630/8a11ef4c020f/sensors-25-04053-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1380/12251630/23e235487eb0/sensors-25-04053-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1380/12251630/bdf769daa66d/sensors-25-04053-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1380/12251630/0818d0f1b304/sensors-25-04053-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1380/12251630/092f50f2d6c4/sensors-25-04053-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1380/12251630/11dba4db59e9/sensors-25-04053-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1380/12251630/fa86caf484af/sensors-25-04053-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1380/12251630/8a11ef4c020f/sensors-25-04053-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1380/12251630/23e235487eb0/sensors-25-04053-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1380/12251630/bdf769daa66d/sensors-25-04053-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1380/12251630/0818d0f1b304/sensors-25-04053-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1380/12251630/092f50f2d6c4/sensors-25-04053-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1380/12251630/11dba4db59e9/sensors-25-04053-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1380/12251630/fa86caf484af/sensors-25-04053-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1380/12251630/8a11ef4c020f/sensors-25-04053-g007.jpg

相似文献

1
Multimodal Particulate Matter Prediction: Enabling Scalable and High-Precision Air Quality Monitoring Using Mobile Devices and Deep Learning Models.多模态颗粒物预测:利用移动设备和深度学习模型实现可扩展且高精度的空气质量监测。
Sensors (Basel). 2025 Jun 29;25(13):4053. doi: 10.3390/s25134053.
2
Leveraging a foundation model zoo for cell similarity search in oncological microscopy across devices.利用基础模型库进行跨设备肿瘤显微镜检查中的细胞相似性搜索。
Front Oncol. 2025 Jun 18;15:1480384. doi: 10.3389/fonc.2025.1480384. eCollection 2025.
3
Computer and mobile technology interventions for self-management in chronic obstructive pulmonary disease.用于慢性阻塞性肺疾病自我管理的计算机和移动技术干预措施。
Cochrane Database Syst Rev. 2017 May 23;5(5):CD011425. doi: 10.1002/14651858.CD011425.pub2.
4
Short-Term Memory Impairment短期记忆障碍
5
Exploring the Potential of Electroencephalography Signal-Based Image Generation Using Diffusion Models: Integrative Framework Combining Mixed Methods and Multimodal Analysis.利用扩散模型探索基于脑电图信号的图像生成潜力:结合混合方法和多模态分析的综合框架
JMIR Med Inform. 2025 Jun 25;13:e72027. doi: 10.2196/72027.
6
Influence of early through late fusion on pancreas segmentation from imperfectly registered multimodal magnetic resonance imaging.早期至晚期融合对来自配准不完善的多模态磁共振成像的胰腺分割的影响。
J Med Imaging (Bellingham). 2025 Mar;12(2):024008. doi: 10.1117/1.JMI.12.2.024008. Epub 2025 Apr 26.
7
Artificial intelligence for diagnosing exudative age-related macular degeneration.人工智能在渗出性年龄相关性黄斑变性诊断中的应用。
Cochrane Database Syst Rev. 2024 Oct 17;10(10):CD015522. doi: 10.1002/14651858.CD015522.pub2.
8
A deep learning approach to direct immunofluorescence pattern recognition in autoimmune bullous diseases.深度学习方法在自身免疫性大疱性疾病中的直接免疫荧光模式识别。
Br J Dermatol. 2024 Jul 16;191(2):261-266. doi: 10.1093/bjd/ljae142.
9
Comparison of self-administered survey questionnaire responses collected using mobile apps versus other methods.使用移动应用程序与其他方法收集的自我管理调查问卷回复的比较。
Cochrane Database Syst Rev. 2015 Jul 27;2015(7):MR000042. doi: 10.1002/14651858.MR000042.pub2.
10
Effectiveness of interventions to reduce household air pollution and/or improve health in homes using solid fuel in low-and-middle income countries: A systematic review and meta-analysis.中低收入国家使用固体燃料的家庭中,减少室内空气污染和/或改善健康的干预措施的效果:系统评价和荟萃分析。
Environ Int. 2017 Jun;103:73-90. doi: 10.1016/j.envint.2017.03.010. Epub 2017 Mar 22.

本文引用的文献

1
Low-cost video-based air quality estimation system using structured deep learning with selective state space modeling.基于低成本视频的空气质量评估系统,采用带选择性状态空间建模的结构化深度学习技术。
Environ Int. 2025 May;199:109496. doi: 10.1016/j.envint.2025.109496. Epub 2025 Apr 26.
2
Epidemiological studies on the health impact of air pollution in Japan: their contribution to the improvement of ambient air quality.日本空气污染对健康影响的流行病学研究:其对改善环境空气质量的贡献。
Environ Health Prev Med. 2025;30:30. doi: 10.1265/ehpm.25-00020.
3
Machine learning-enhanced high-resolution exposure assessment of ultrafine particles.
机器学习增强的超细颗粒高分辨率暴露评估
Nat Commun. 2025 Jan 31;16(1):1209. doi: 10.1038/s41467-025-56581-8.
4
A hybrid model for enhanced forecasting of PM spatiotemporal concentrations with high resolution and accuracy.一种混合模型,用于提高具有高分辨率和高精度的 PM 时空浓度预测。
Environ Pollut. 2024 Aug 15;355:124263. doi: 10.1016/j.envpol.2024.124263. Epub 2024 May 28.
5
Vision-Language Models for Vision Tasks: A Survey.用于视觉任务的视觉语言模型:一项综述。
IEEE Trans Pattern Anal Mach Intell. 2024 Aug;46(8):5625-5644. doi: 10.1109/TPAMI.2024.3369699. Epub 2024 Jul 2.
6
Improving PM prediction in New Delhi using a hybrid extreme learning machine coupled with snake optimization algorithm.利用混合极限学习机和蛇型优化算法提高新德里的 PM 预测精度。
Sci Rep. 2023 Nov 29;13(1):21057. doi: 10.1038/s41598-023-47492-z.
7
Multimodal Learning With Transformers: A Survey.基于Transformer的多模态学习:一项综述。
IEEE Trans Pattern Anal Mach Intell. 2023 Oct;45(10):12113-12132. doi: 10.1109/TPAMI.2023.3275156. Epub 2023 Sep 5.
8
The characteristics of PM emissions from construction sites during the earthwork and foundation stages: an empirical study evidence.施工工地在土方和基础阶段的 PM 排放特征:实证研究证据。
Environ Sci Pollut Res Int. 2023 May;30(22):62716-62732. doi: 10.1007/s11356-023-26494-4. Epub 2023 Mar 22.
9
A Review of Generalized Zero-Shot Learning Methods.广义零样本学习方法综述
IEEE Trans Pattern Anal Mach Intell. 2023 Apr;45(4):4051-4070. doi: 10.1109/TPAMI.2022.3191696. Epub 2023 Mar 7.
10
Enhancing PM Prediction Using NARX-Based Combined CNN and LSTM Hybrid Model.基于 NARX 的卷积神经网络和长短期记忆网络混合模型增强 PM 预测。
Sensors (Basel). 2022 Jun 11;22(12):4418. doi: 10.3390/s22124418.