Radioisotope Science and Technology Division, 6146Oak Ridge National Laboratory, Oak Ridge, TN, USA.
Appl Spectrosc. 2021 Sep;75(9):1155-1167. doi: 10.1177/0003702820987281. Epub 2021 Jan 22.
Implementing remote, real-time spectroscopic monitoring of radiochemical processing streams in hot cell environments requires efficiency and simplicity. The success of optical spectroscopy for the quantification of species in chemical systems highly depends on representative training sets and suitable validation sets. Selecting a training set (i.e., calibration standards) to build multivariate regression models is both time- and resource-consuming using standard one-factor-at-a-time approaches. This study describes the use of experimental design to generate spectral training sets and a validation set for the quantification of sodium nitrate (0-1 M) and nitric acid (0.1-10 M) using the near-infrared water band centered at 1440 nm. Partial least squares regression models were built from training sets generated by both D- and I-optimal experimental designs and a one-factor-at-a-time approach. The prediction performance of each model was evaluated by comparing the bias and standard error of prediction for statistical significance. D- and I-optimal designs reduced the number of samples required to build regression models compared with one-factor-at-a-time while also improving performance. Models must be confirmed against a validation sample set when minimizing the number of samples in the training set. The D-optimal design performed the best when considering both performance and efficiency by improving predictive capability and reducing number of samples in the training set by 64% compared with the one-factor-at-a-time approach. The experimental design approach objectively selects calibration and validation spectral data sets based on statistical criterion to optimize performance and minimize resources.
在热室环境中实现放射性化学处理流的远程实时光谱监测需要高效和简单。光学光谱法在化学系统中定量分析物种的成功高度依赖于有代表性的训练集和合适的验证集。使用标准的单因素逐一方法选择用于构建多元回归模型的训练集(即校准标准)既费时又费力。本研究描述了使用实验设计来生成光谱训练集和验证集,用于使用近红外水带(中心在 1440nm)定量分析硝酸钠(0-1M)和硝酸(0.1-10M)。偏最小二乘回归模型是由 D-和 I-最优实验设计以及单因素逐一方法生成的训练集构建的。通过比较预测的偏差和预测的标准误差来评估每个模型的预测性能,以确定统计显著性。与单因素逐一方法相比,D-和 I-最优设计减少了构建回归模型所需的样本数量,同时也提高了性能。在最小化训练集样本数量时,模型必须经过验证样本集的确认。与单因素逐一方法相比,D-最优设计通过提高预测能力和将训练集样本数量减少 64%,在考虑性能和效率方面表现最佳。实验设计方法基于统计标准客观地选择校准和验证光谱数据集,以优化性能并最小化资源。