• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于支持向量机方法和成本敏感分类方案的地面臭氧预测

Ground-level ozone prediction by support vector machine approach with a cost-sensitive classification scheme.

作者信息

Lu Wei-Zhen, Wang Dong

机构信息

Department of Building and Construction, City University of Hong Kong, Kowloon Tong, Kowloon, Hong Kong.

出版信息

Sci Total Environ. 2008 Jun 1;395(2-3):109-16. doi: 10.1016/j.scitotenv.2008.01.035. Epub 2008 Mar 10.

DOI:10.1016/j.scitotenv.2008.01.035
PMID:18329697
Abstract

For ground-level ozone (O(3)) prediction, a predictive model, with reliable performance not only on non-polluted days but, more importantly, on polluted days, is favored by public authorities to issue alerts, so that concerned citizens and industrial organizations could take precautions to avoid exposure and reduce harmful emissions. However, the class imbalance problem, i.e., in some collected field data, number of O(3) polluted days are much smaller than that of non-polluted days, will deteriorate the model performance on minority class-O(3) polluted days. Despite support vector machine (SVM) obtaining promising results in air quality prediction, in this study, a cost-sensitive classification scheme is proposed for the standard support vector classification model (S-SVC) in order to investigate whether the class imbalance plagues S-SVC. The S-SVC with such scheme is named as CS-SVC. Experiments on imbalanced data sets collected from two air quality monitoring sites in Hong Kong show that 1) S-SVC is still sensitive to class imbalance problem; 2) compared with S-SVC, CS-SVC effectively avoids class imbalance problem with lower percentage of false negative on O(3) polluted days but with higher percentage of false positive on non-polluted days; 3) compared with both S-SVC and CS-SVC, support vector regression model (SVR), after converting its output to binary one, only has similar performance with S-SVC, which indicates class imbalance problem also impairs the regressor model. From point of protecting public health, CS-SVC, which less likely misses to forecast O(3) polluted days, is recommended here.

摘要

对于地面臭氧(O₃)预测而言,一种预测模型不仅在无污染日表现可靠,更重要的是在污染日也表现可靠,这受到公共当局的青睐,以便发布警报,让相关公民和工业组织能够采取预防措施以避免暴露并减少有害排放。然而,类别不平衡问题,即在一些收集的现场数据中,O₃污染日的数量远少于无污染日的数量,会使模型在少数类别——O₃污染日的性能变差。尽管支持向量机(SVM)在空气质量预测中取得了不错的结果,但在本研究中,为标准支持向量分类模型(S - SVC)提出了一种成本敏感分类方案,以研究类别不平衡是否困扰S - SVC。具有这种方案的S - SVC被命名为CS - SVC。对从香港两个空气质量监测站点收集的不平衡数据集进行的实验表明:1)S - SVC对类别不平衡问题仍然敏感;2)与S - SVC相比,CS - SVC有效避免了类别不平衡问题,在O₃污染日的假阴性百分比更低,但在无污染日的假阳性百分比更高;3)与S - SVC和CS - SVC相比,支持向量回归模型(SVR)在将其输出转换为二元输出后,仅具有与S - SVC相似的性能,这表明类别不平衡问题也损害了回归模型。从保护公众健康的角度出发,这里推荐CS - SVC,因为它不太可能漏报O₃污染日。

相似文献

1
Ground-level ozone prediction by support vector machine approach with a cost-sensitive classification scheme.基于支持向量机方法和成本敏感分类方案的地面臭氧预测
Sci Total Environ. 2008 Jun 1;395(2-3):109-16. doi: 10.1016/j.scitotenv.2008.01.035. Epub 2008 Mar 10.
2
Interval estimation of urban ozone level and selection of influential factors by employing automatic relevance determination model.利用自动相关性确定模型对城市臭氧水平进行区间估计及影响因素选择。
Chemosphere. 2006 Mar;62(10):1600-11. doi: 10.1016/j.chemosphere.2005.06.047. Epub 2005 Aug 9.
3
Prediction of maximum daily ozone level using combined neural network and statistical characteristics.利用组合神经网络和统计特征预测每日最大臭氧水平
Environ Int. 2003 Aug;29(5):555-62. doi: 10.1016/S0160-4120(03)00013-8.
4
A study of ozone variation trend within area of affecting human health in Hong Kong.香港人类健康影响区内臭氧变化趋势研究。
Chemosphere. 2003 Sep;52(9):1405-10. doi: 10.1016/S0045-6535(03)00476-4.
5
Potential assessment of the "support vector machine" method in forecasting ambient air pollutant trends.“支持向量机”方法在预测环境空气污染物趋势中的潜力评估。
Chemosphere. 2005 Apr;59(5):693-701. doi: 10.1016/j.chemosphere.2004.10.032. Epub 2004 Dec 16.
6
Forecasting of ozone episode days by cost-sensitive neural network methods.基于成本敏感神经网络方法的臭氧事件天数预测
Sci Total Environ. 2009 Mar 1;407(6):2124-35. doi: 10.1016/j.scitotenv.2008.12.007. Epub 2009 Jan 20.
7
Using air quality modeling to study source-receptor relationships between nitrogen oxides emissions and ozone exposures over the United States.利用空气质量模型研究美国氮氧化物排放与臭氧暴露之间的源-受体关系。
Environ Int. 2009 Nov;35(8):1109-17. doi: 10.1016/j.envint.2009.06.008. Epub 2009 Aug 4.
8
Evaluation of the Mesoscale Meteorological Model (MM5)-Community Multi-Scale Air Quality Model (CMAQ) performance in hindcast and forecast of ground-level ozone.中尺度气象模型(MM5)-社区多尺度空气质量模型(CMAQ)在地面臭氧后报和预报中的性能评估。
J Air Waste Manag Assoc. 2008 Oct;58(10):1341-50.
9
Potential assessment of a neural network model with PCA/RBF approach for forecasting pollutant trends in Mong Kok urban air, Hong Kong.采用主成分分析/径向基函数方法的神经网络模型对香港旺角市区空气污染趋势进行预测的潜力评估。
Environ Res. 2004 Sep;96(1):79-87. doi: 10.1016/j.envres.2003.11.003.
10
Simulation of ozone formation at different elevations in mountainous area of Hong Kong using WRF-CMAQ model.利用 WRF-CMAQ 模式模拟香港山区不同海拔高度的臭氧形成。
Sci Total Environ. 2015 Feb 1;505:939-51. doi: 10.1016/j.scitotenv.2014.10.070. Epub 2014 Nov 7.

引用本文的文献

1
Optimization of Skewed Data Using Sampling-Based Preprocessing Approach.基于抽样预处理方法的偏态数据优化。
Front Public Health. 2020 Jul 16;8:274. doi: 10.3389/fpubh.2020.00274. eCollection 2020.
2
Prediction of Indoor Air Exposure from Outdoor Air Quality Using an Artificial Neural Network Model for Inner City Commercial Buildings.使用人工神经网络模型预测市中心商业建筑室内空气暴露量与室外空气质量的关系
Int J Environ Res Public Health. 2015 Dec 1;12(12):15233-53. doi: 10.3390/ijerph121214975.
3
Observational studies and a statistical early warning of surface ozone pollution in Tangshan, the largest heavy industry city of North China.
观测研究和统计预警:华北最大重工业城市唐山的地表臭氧污染。
Int J Environ Res Public Health. 2013 Mar 13;10(3):1048-61. doi: 10.3390/ijerph10031048.
4
Improving predictions in imbalanced data using Pairwise Expanded Logistic Regression.使用成对扩展逻辑回归改善不平衡数据中的预测。
AMIA Annu Symp Proc. 2011;2011:625-34. Epub 2011 Oct 22.