Suppr超能文献

使用Adaboost算法和近红外光谱法区分不同品牌香烟的可行性研究。

Study of the feasibility of distinguishing cigarettes of different brands using an Adaboost algorithm and near-infrared spectroscopy.

作者信息

Tan Chao, Li Menglong, Qin Xin

机构信息

College of Chemistry, Sichuan University, Chengdu, Sichuan 610064, China.

出版信息

Anal Bioanal Chem. 2007 Sep;389(2):667-74. doi: 10.1007/s00216-007-1461-2. Epub 2007 Jul 20.

Abstract

The feasibility of utilizing an Adaboost algorithm in conjuction with near-infrared (NIR) spectroscopy to automatically distinguish cigarettes of different brands was explored. Simple linear discriminant analysis (LDA) was used as the base algorithm to train all weak classifiers in Adaboost. Both principal component analysis (PCA) and its kernel version (kernel principal component analysis, KPCA) were used for feature extraction and were also compared to each other. The influence of the training set size on the final classification model was also investigated. Using a case study, it was demonstrated that Adaboost coupled with PCA or KPCA can obviously improve the ability to discriminate between samples that cannot be separated by a single linear classifier. However, in term of the overall performance, KPCA appears preferable to PCA for feature extraction, especially when the samples used for training are relatively small. The results also indicate that more training samples should be applied, if possible, in order to fully demonstrate the superiority of Adaboost. It seems that the use of an Adaboost algorithm in conjunction with NIR spectroscopy in combination with KPCA for feature extraction comprises a promising tool for distinguishing cigarettes of different brands, especially in situations where there is an obvious overlap between the NIR spectra afforded by cigarettes of different brands.

摘要

探讨了将Adaboost算法与近红外(NIR)光谱相结合以自动区分不同品牌香烟的可行性。使用简单线性判别分析(LDA)作为基础算法来训练Adaboost中的所有弱分类器。主成分分析(PCA)及其核版本(核主成分分析,KPCA)均用于特征提取,并且也相互进行了比较。还研究了训练集大小对最终分类模型的影响。通过一个案例研究表明,Adaboost与PCA或KPCA相结合可以明显提高区分单个线性分类器无法分离的样本的能力。然而,就整体性能而言,KPCA在特征提取方面似乎比PCA更可取,尤其是当用于训练的样本相对较少时。结果还表明,如果可能的话,应使用更多的训练样本,以充分展示Adaboost的优越性。将Adaboost算法与NIR光谱相结合并结合KPCA进行特征提取,似乎是区分不同品牌香烟的一种有前途的工具,尤其是在不同品牌香烟的近红外光谱存在明显重叠的情况下。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验