Daba Sintayehu D, Honigs David, McGee Rebecca J, Kiszonas Alecia M
USDA-ARS Western Wheat Quality Laboratory, E-202 Food Quality Building, Washington State University, Pullman, WA 99164, USA.
PerkinElmer Inc., Waltham, MA 02451, USA.
Foods. 2022 Nov 18;11(22):3701. doi: 10.3390/foods11223701.
Breeding for increased protein concentration is a priority in field peas. Having a quick, accurate, and non-destructive protein quantification method is critical for screening breeding materials, which the near-infrared spectroscopy (NIRS) system can provide. Partial least square regression (PLSR) models to predict protein concentration were developed and compared for DA7250 and FT9700 NIRS systems. The reference protein data were accurate and exhibited a wider range of variation (15.3−29.8%). Spectral pre-treatments had no clear advantage over analyses based on raw spectral data. Due to the large number of samples used in this study, prediction accuracies remained similar across calibration sizes. The final PLSR models for the DA7250 and FT9700 systems required 10 and 13 latent variables, respectively, and performed well and were comparable (R2 = 0.72, RMSE = 1.22, and bias = 0.003 for DA7250; R2 = 0.79, RMSE = 1.23, and bias = 0.055 for FT9700). Considering three groupings for protein concentration (Low: <20%, Medium: ≥20%, but ≤25%, and High: >25%), none of the entries changed from low to high or vice versa between the observed and predicted values for the DA7250 system. Only a single entry moved from a low category in the observed data to a high category in the predicted data for the FT9700 system in the calibration set. Although the FT9700 system outperformed the DA7250 system by a small margin, both systems had the potential to predict protein concentration in pea seeds for breeding purposes. Wavelengths between 950 nm and 1650 nm accounted for most of the variation in pea protein concentration.
提高蛋白质含量的育种是豌豆育种的一个重点。拥有一种快速、准确且无损的蛋白质定量方法对于筛选育种材料至关重要,近红外光谱(NIRS)系统能够提供这样的方法。针对DA7250和FT9700近红外光谱系统,开发并比较了用于预测蛋白质含量的偏最小二乘回归(PLSR)模型。参考蛋白质数据准确,且呈现出更广泛的变化范围(15.3 - 29.8%)。光谱预处理相较于基于原始光谱数据的分析并无明显优势。由于本研究中使用的样本数量众多,不同校准规模下的预测准确性保持相似。DA7250和FT9700系统的最终PLSR模型分别需要10个和13个潜变量,表现良好且具有可比性(DA7250的R2 = 0.72,RMSE = 1.22,偏差 = 0.003;FT9700的R2 = 0.79,RMSE = 1.23,偏差 = 0.055)。考虑蛋白质含量的三个分组(低:<20%,中:≥20%但≤25%,高:>25%),对于DA7250系统,在观测值和预测值之间,没有条目从低类别变为高类别或反之亦然。在校准集中,对于FT9700系统,只有一个条目在观测数据中属于低类别,而在预测数据中变为高类别。尽管FT9700系统略优于DA7250系统,但两个系统都有潜力用于预测豌豆种子中蛋白质含量以用于育种目的。波长在950纳米至1650纳米之间的波段占豌豆蛋白质含量变化的大部分。