Ma Shuai, Shen Tao, Wang Rui-qi, Lai Hua, Yu Zheng-tao
Guang Pu Xue Yu Guang Pu Fen Xi. 2015 Dec;35(12):3325-9.
Feature extraction and classification are the key issues of terahertz spectroscopy identification. Because many materials have no apparent absorption peaks in the terahertz band, it is difficult to extract theirs terahertz spectroscopy feature and identify. To this end, a novel of identify terahertz spectroscopy approach with Deep Belief Network (DBN) was studied in this paper, which combines the advantages of DBN and K-Nearest Neighbors (KNN) classifier. Firstly, cubic spline interpolation and S-G filter were used to normalize the eight kinds of substances (ATP, Acetylcholine Bromide, Bifenthrin, Buprofezin, Carbazole, Bleomycin, Buckminster and Cylotriphosphazene) terahertz transmission spectra in the range of 0.9-6 THz. Secondly, the DBN model was built by two restricted Boltzmann machine (RBM) and then trained layer by layer using unsupervised approach. Instead of using handmade features, the DBN was employed to learn suitable features automatically with raw input data. Finally, a KNN classifier was applied to identify the terahertz spectrum. Experimental results show that using the feature learned by DBN can identify the terahertz spectrum of different substances with the recognition rate of over 90%, which demonstrates that the proposed method can automatically extract the effective features of terahertz spectrum. Furthermore, this KNN classifier was compared with others (BP neural network, SOM neural network and RBF neural network). Comparisons showed that the recognition rate of KNN classifier is better than the other three classifiers. Using the approach that automatic extract terahertz spectrum features by DBN can greatly reduce the workload of feature extraction. This proposed method shows a promising future in the application of identifying the mass terahertz spectroscopy.
特征提取与分类是太赫兹光谱识别的关键问题。由于许多材料在太赫兹波段没有明显的吸收峰,因此难以提取其太赫兹光谱特征并进行识别。为此,本文研究了一种基于深度置信网络(DBN)的新型太赫兹光谱识别方法,该方法结合了DBN和K近邻(KNN)分类器的优点。首先,采用三次样条插值和S-G滤波器对0.9 - 6 THz范围内的八种物质(三磷酸腺苷、溴化乙酰胆碱、联苯菊酯、噻嗪酮、咔唑、博来霉素、富勒烯和环三聚磷腈)的太赫兹透射光谱进行归一化处理。其次,通过两个受限玻尔兹曼机(RBM)构建DBN模型,然后采用无监督方法逐层训练。DBN不是使用手工制作的特征,而是利用原始输入数据自动学习合适的特征。最后,应用KNN分类器对太赫兹光谱进行识别。实验结果表明,利用DBN学习到的特征能够识别不同物质的太赫兹光谱,识别率超过90%,这表明所提方法能够自动提取太赫兹光谱的有效特征。此外,将该KNN分类器与其他分类器(BP神经网络、SOM神经网络和RBF神经网络)进行了比较。比较结果表明,KNN分类器的识别率优于其他三种分类器。利用DBN自动提取太赫兹光谱特征的方法可以大大减少特征提取的工作量。所提方法在大量太赫兹光谱识别应用中显示出广阔的前景。