Guo Jianru, Chen QianQian, Wang Caiyun, Qiu Hongcong, Liu Buming, Jiang Zhi-Hong, Zhang Wei
State Key Laboratory of Quality Research in Chinese Medicines, Macau Institute for Applied Research in Medicine and Health, Macau University of Science and Technology, Taipa, Macau, China.
Anal Bioanal Chem. 2015 Feb;407(5):1389-401. doi: 10.1007/s00216-014-8371-x. Epub 2014 Dec 13.
In this study, unsupervised and supervised classification methods were compared for comprehensive analysis of the fingerprints of 26 Phyllanthus samples from different geographical regions and species. A total of 63 compounds were identified and tentatively assigned structures for the establishment of fingerprints using high-performance liquid chromatography time-of-flight mass spectrometry (HPLC/TOFMS). Unsupervised and supervised pattern recognition technologies including principal component analysis (PCA), nearest neighbors algorithm (NN), partial least squares discriminant analysis (PLS-DA), and artificial neural network (ANN) were employed. Results showed that Phyllanthus could be correctly classified according to their geographical locations and species through ANN and PLS-DA. Important variables for clusters discrimination were also identified by PCA. Although unsupervised and supervised pattern recognitions have their own disadvantage and application scope, they are effective and reliable for studying fingerprints of traditional Chinese medicines (TCM). These two technologies are complementary and can be superimposed. Our study is the first holistic comparison of supervised and unsupervised pattern recognition technologies in the TCM chemical fingerprinting. They showed advantages in sample classification and data mining, respectively.
在本研究中,对无监督和有监督分类方法进行了比较,以综合分析来自不同地理区域和物种的26种叶下珠样品的指纹图谱。使用高效液相色谱-飞行时间质谱(HPLC/TOFMS)共鉴定出63种化合物,并初步确定其结构以建立指纹图谱。采用了包括主成分分析(PCA)、最近邻算法(NN)、偏最小二乘判别分析(PLS-DA)和人工神经网络(ANN)在内的无监督和有监督模式识别技术。结果表明,通过人工神经网络和偏最小二乘判别分析,叶下珠可以根据其地理位置和物种进行正确分类。主成分分析还确定了用于聚类判别的重要变量。尽管无监督和有监督模式识别都有各自的缺点和应用范围,但它们对于研究中药指纹图谱是有效且可靠的。这两种技术是互补的,可以叠加使用。我们的研究是首次对中药化学指纹图谱中的有监督和无监督模式识别技术进行全面比较。它们分别在样品分类和数据挖掘方面显示出优势。