Zhao Li-Ting, Xiang Yu-Hong, Dai Yin-Mei, Zhang Zhuo-Yong
Department of Chemistry, Capital Normal University, Beijing 100048, China.
Guang Pu Xue Yu Guang Pu Fen Xi. 2010 Apr;30(4):901-5.
Near infrared spectroscopy was applied to measure the tissue slice of endometrial tissues for collecting the spectra. A total of 154 spectra were obtained from 154 samples. The number of normal, hyperplasia, and malignant samples was 36, 60, and 58, respectively. Original near infrared spectra are composed of many variables, for example, interference information including instrument errors and physical effects such as particle size and light scatter. In order to reduce these influences, original spectra data should be performed with different spectral preprocessing methods to compress variables and extract useful information. So the methods of spectral preprocessing and wavelength selection have played an important role in near infrared spectroscopy technique. In the present paper the raw spectra were processed using various preprocessing methods including first derivative, multiplication scatter correction, Savitzky-Golay first derivative algorithm, standard normal variate, smoothing, and moving-window median. Standard deviation was used to select the optimal spectral region of 4 000-6 000 cm(-1). Then principal component analysis was used for classification. Principal component analysis results showed that three types of samples could be discriminated completely and the accuracy almost achieved 100%. This study demonstrated that near infrared spectroscopy technology and chemometrics method could be a fast, efficient, and novel means to diagnose cancer. The proposed methods would be a promising and significant diagnosis technique of early stage cancer.
采用近红外光谱法对子宫内膜组织切片进行测量以采集光谱。共从154个样本中获得了154个光谱。正常、增生和恶性样本的数量分别为36、60和58。原始近红外光谱由许多变量组成,例如包括仪器误差在内的干扰信息以及诸如粒度和光散射等物理效应。为了减少这些影响,应使用不同的光谱预处理方法对原始光谱数据进行处理,以压缩变量并提取有用信息。因此,光谱预处理和波长选择方法在近红外光谱技术中发挥了重要作用。在本文中,对原始光谱使用了各种预处理方法进行处理,包括一阶导数、多元散射校正、Savitzky-Golay一阶导数算法、标准正态变量变换、平滑处理和移动窗口中位数法。使用标准差来选择4000 - 6000 cm(-1)的最佳光谱区域。然后采用主成分分析进行分类。主成分分析结果表明,三种类型的样本能够被完全区分,准确率几乎达到100%。本研究表明,近红外光谱技术和化学计量学方法可以成为一种快速、高效且新颖的癌症诊断手段。所提出的方法将是一种有前景且重要的早期癌症诊断技术。