Wang Jing, Han Yongxiang, Yu Xingna, Zhang Zefeng, Song Tongai
Key Laboratory for Aerosol-Cloud-Precipitation of China Meteorological Administration, Nanjing University of Information Science & Technology, Nanjing 210044, China.
Key Laboratory for Aerosol-Cloud-Precipitation of China Meteorological Administration, Nanjing University of Information Science & Technology, Nanjing 210044, China.
Sci Total Environ. 2024 Jul 20;935:172743. doi: 10.1016/j.scitotenv.2024.172743. Epub 2024 Apr 26.
Accurately identifying the authentic local aerosol types is one of the fundamental tasks in studying aerosol radiative effects and model assessment. In this paper, improvements were made to the traditional Gaussian Mixture Model, leading to the following results: 1) This study introduces several improvements to the traditional Gaussian Mixture Model (GMM), referred to as M-GMMs. These improvements include the incorporation of multivariate kurtosis coefficients, Mahalanobis distance instead of Euclidean distance, and weights of variables. The M-GMMs overcome the issues related to dimensional units and correlations among multiple parameters, thereby enhancing the estimation of the covariance matrix. 2) The proposed M-GMMs model was evaluated for its clustering performance using machine-generated data with known classifications and real iris flower data. The results demonstrated that the classification performance of M-GMMs was superior to other models. Furthermore, compared to the slightly less effective K-means algorithm (which requires manual definition of the number of aerosol types), the M-GMMs model was able to automatically iterate and produce consistent classification results based on similar characteristics. 3) There is still a significant disparity between the characteristics of real stations and typical aerosols. Directly evaluating local aerosols using the characteristics of typical aerosols results in substantial errors. However, the M-GMMs model can effectively reflect the authentic aerosol characteristics at the local level. 4) The M-GMMs model was utilized to perform cluster analysis on the Xuzhou and Nanjing stations of AERONET. This analysis yielded quantitative proportions, temporal distribution characteristics, and spectral distribution features of aerosol types in the two regions. The improved M-GMMs model presented in this paper enables more accurate and continuous characterization of aerosol type variations. Its findings hold significant theoretical and practical value in reassessing aerosol radiative effects.
准确识别真实的局地气溶胶类型是研究气溶胶辐射效应和模型评估的基本任务之一。本文对传统高斯混合模型进行了改进,得到以下结果:1)本研究对传统高斯混合模型(GMM)进行了若干改进,称为M-GMMs。这些改进包括纳入多元峰度系数、马氏距离而非欧氏距离以及变量权重。M-GMMs克服了与维度单位和多个参数之间相关性相关的问题,从而增强了协方差矩阵的估计。2)使用具有已知分类的机器生成数据和真实鸢尾花数据对所提出的M-GMMs模型的聚类性能进行了评估。结果表明,M-GMMs的分类性能优于其他模型。此外,与效果稍差的K-means算法(需要手动定义气溶胶类型数量)相比,M-GMMs模型能够自动迭代并根据相似特征产生一致的分类结果。3)实际站点的特征与典型气溶胶之间仍存在显著差异。直接使用典型气溶胶的特征评估局地气溶胶会导致大量误差。然而,M-GMMs模型能够有效反映局地真实的气溶胶特征。4)利用M-GMMs模型对AERONET的徐州和南京站点进行聚类分析。该分析得出了两个地区气溶胶类型的定量比例、时间分布特征和光谱分布特征。本文提出的改进M-GMMs模型能够更准确、连续地表征气溶胶类型变化。其研究结果在重新评估气溶胶辐射效应方面具有重要的理论和实用价值。