Jia Menghan, Hui Zhaoyan, Zhang Hui, Gao Yu, Tong Meiqi, Ma Yinan
School of Energy and Environmental Engineering, University of Science and Technology Beijing, Beijing 100083, China.
Se Pu. 2021 Jun;39(6):670-677. doi: 10.3724/SP.J.1123.2020.11009.
The detection and analysis of spectral peaks play an important role in research on chromatography technology. However, in the process of collecting and transmitting chromatographic data, it is very difficult to detect spectral peaks owing to the interference of different levels of noise. Most of the traditional spectral peak detection algorithms follow three steps: spectral smoothing, baseline correction, and spectral peak recognition, which require high denoising and curve smoothing, and therefore increase the complexity of the algorithm. In addition, a traditional spectrum peak detection algorithm generally defines the shape of the spectrum peak by applying the base deduction method, and divides the spectrum peak into a single peak, overlapping peaks, and so on. Different detection methods are used for different types of spectral peaks, which lead to the shortcomings of traditional peak detection algorithms, such as high complexity, low automation, and susceptibility to distortion. Therefore, this study proposes a novel peak detection algorithm developed using a different point of view. The algorithm omits the base subtraction and spectral peak classification steps and instead detects spectral peaks directly based on the source data curve. In a traditional spectrum peak detection algorithm, the spectrum peak classification depends on determining a baseline. If the baseline is adjusted, the baseline will fit the spectrum peak more closely. At this time, the overlapping peaks can be regarded as two connected peaks. However, there is no so-called baseline in the source data curve, and therefore the proposed algorithm cannot classify the spectral peaks using the baseline approach. Instead, an obvious bulge or depression in the source curve is considered to be the spectral peak. This algorithm essentially performs three steps: discrete difference, trend accumulation, and searching for all peaks. First, the difference between adjacent data is obtained using a discrete difference process. The difference value is compared with 0, and either a 1 or -1 value is used to replace the difference value to reflect the data fluctuation trend. The signals representing the trend are accumulated, and the spectrum peak is located according to the sum of the accumulated signals. The algorithm uses three-point location; that is, the peak starting point, extreme point, and peak end point are used to describe the position of a spectral peak. Finally, according to the spectrum peaks obtained in the previous step, the magnitude of each peak is calculated, and the spectrum peaks are screened by a sorting method. In this manner, the algorithm skips the base subtraction part and obtains the spectrum peak directly. Therefore, to obtain the base part, the peak subtraction method is applied. This study used the C language to design and write the algorithm, and nitrogen adsorption and desorption chromatographic curves measured by several dynamic specific surface area analyzers were detected and analyzed. The results indicate that the proposed algorithm can accurately distinguish the peak part from the base part, and is robust to data curve burr, vibration, and other types of noise. The three-point location of the spectrum peak is very accurate and is not affected by its complex morphology. Therefore, it has strong universality. Compared with other algorithms, this algorithm has the advantages of accurate positioning, clear structure, and good stability and reliability. The application of the proposed peak detection methods such as base-free deduction and trend accumulation, in the adsorption and desorption chromatographic curve and has been proven effective in the determination of absorption and desorption chromatographic peaks.
光谱峰的检测与分析在色谱技术研究中起着重要作用。然而,在收集和传输色谱数据的过程中,由于不同程度噪声的干扰,很难检测到光谱峰。大多数传统的光谱峰检测算法遵循三个步骤:光谱平滑、基线校正和光谱峰识别,这需要高去噪和曲线平滑,因此增加了算法的复杂性。此外,传统的光谱峰检测算法通常通过应用基线扣除方法来定义光谱峰的形状,并将光谱峰分为单峰、重叠峰等。针对不同类型的光谱峰使用不同的检测方法,这导致了传统峰检测算法的缺点,如高复杂性、低自动化和易失真。因此,本研究提出了一种从不同角度开发的新型峰检测算法。该算法省略了基线扣除和光谱峰分类步骤,而是直接基于源数据曲线检测光谱峰。在传统的光谱峰检测算法中,光谱峰分类取决于确定基线。如果调整基线,基线将更紧密地拟合光谱峰。此时,重叠峰可被视为两个相连的峰。然而,源数据曲线中没有所谓的基线,因此所提出的算法不能使用基线方法对光谱峰进行分类。相反,源曲线中明显的凸起或凹陷被视为光谱峰。该算法主要执行三个步骤:离散差分、趋势累加和搜索所有峰。首先,通过离散差分过程获得相邻数据之间的差值。将差值与0进行比较,并用1或 -1值替换差值以反映数据波动趋势。表示趋势的信号被累加,并根据累加信号的总和定位光谱峰。该算法使用三点定位;即,峰起点、极值点和峰终点用于描述光谱峰的位置。最后,根据上一步获得的光谱峰,计算每个峰的幅度,并通过排序方法筛选光谱峰。通过这种方式,该算法跳过了基线扣除部分,直接获得了光谱峰。因此,为了获得基线部分,应用了峰减法。本研究使用C语言设计并编写了该算法,并对几种动态比表面积分析仪测量的氮吸附和解吸色谱曲线进行了检测和分析。结果表明,所提出的算法能够准确地将峰部分与基线部分区分开来,并且对数据曲线毛刺、振动和其他类型的噪声具有鲁棒性。光谱峰的三点定位非常准确,不受其复杂形态的影响。因此,它具有很强的通用性。与其他算法相比,该算法具有定位准确、结构清晰、稳定性和可靠性好的优点。所提出的无基线扣除和趋势累加等峰检测方法在吸附和解吸色谱曲线中的应用,已被证明在吸附和解吸色谱峰的测定中是有效的。