Um Jung A, Choi Young-Geun, Lee Dong-Kyu, Lee Yun Sun, Lim Chang Ju, Youn Young A, Lee Hwa Dong, Cho Hi Jae, Park Jeong Hill, Seo Young Bae, Kuo Hsun-chih, Lim Johan, Yang Tae-Jin, Kwon Sung Won, Lee Jeongmi
College of Pharmacy, Seoul National University, Seoul, 151-742, South Korea.
Anal Bioanal Chem. 2013 Sep;405(23):7523-34. doi: 10.1007/s00216-013-7182-9. Epub 2013 Jul 16.
Sixty peony root training samples of the same age were collected from various regions in Korea and China, and their genetic diversity was investigated for 23 chloroplast intergenic space regions. All samples were genetically indistinguishable, indicating that the DNA-based techniques employed were not appropriate for determining the samples' regions of origin. In contrast, (1)H-nuclear magnetic resonance ((1)H-NMR) spectroscopy-based metabolomics coupled with multivariate statistical analysis revealed a clear difference between the metabolic profiles of the Korean and Chinese samples. Orthogonal projections on the latent structure-discrimination analysis allowed the identification of potential metabolite markers, including γ-aminobutyric acid, arginine, alanine, paeoniflorin, and albiflorin, that could be useful for classifying the samples' regions of origin. The validity of the discrimination model was tested using the response permutation test and blind prediction test for internal and external validations, respectively. Metabolomic data of 21 blended samples consisting of Korean and Chinese samples mixed at various proportions were also acquired by (1)H-NMR analysis. After data preprocessing which was designed to eliminate uncontrolled deviations in the spectral data between the testing and training sets, a new statistical procedure for estimating the mixing proportions of blended samples was established using the constrained least squares method for the first time. The predictive procedure exhibited relatively good predictability (adjusted R (2) = 0.7669), and thus has the potential to be used in the quality control of peony root by providing correct indications for a sample's geographical origins.
从韩国和中国的不同地区收集了60个同龄的芍药根训练样本,并对其23个叶绿体基因间隔区的遗传多样性进行了研究。所有样本在基因上无法区分,这表明所采用的基于DNA的技术不适用于确定样本的原产地。相比之下,基于氢核磁共振(¹H-NMR)光谱的代谢组学结合多变量统计分析显示,韩国和中国样本的代谢谱存在明显差异。潜在结构判别分析的正交投影能够识别潜在的代谢物标记物,包括γ-氨基丁酸、精氨酸、丙氨酸、芍药苷和 albiflorin,这些标记物可用于对样本的原产地进行分类。分别使用响应置换检验和盲预测检验对内部和外部验证进行判别模型的有效性测试。还通过¹H-NMR分析获得了21个由韩国和中国样本按不同比例混合而成的混合样本的代谢组学数据。在进行旨在消除测试集和训练集之间光谱数据中不受控制偏差的数据预处理后,首次使用约束最小二乘法建立了一种估计混合样本混合比例的新统计程序。该预测程序表现出相对较好的预测能力(调整后R² = 0.7669),因此有潜力通过为样本的地理来源提供正确指示而用于芍药根的质量控制。