傅里叶变换红外模型校准中观测值数量对牛奶奶脂肪成分预测数据估计遗传参数的影响。

The effect of the number of observations used for Fourier transform infrared model calibration for bovine milk fat composition on the estimated genetic parameters of the predicted data.

机构信息

Animal Breeding and Genomics Centre, Wageningen University, PO Box 338, 6700 AH Wageningen, the Netherlands.

出版信息

J Dairy Sci. 2010 Oct;93(10):4872-82. doi: 10.3168/jds.2010-3157.

DOI:10.3168/jds.2010-3157

PMID:20855022

Abstract

Fourier transform infrared spectroscopy is a suitable method to determine bovine milk fat composition. However, the determination of fat composition by gas chromatography, required for calibration of the infrared prediction model, is expensive and labor intensive. It has recently been shown that the number of calibration samples is strongly related to the model's validation r(2) (i.e., accuracy of prediction). However, the effect of the number of calibration samples used, and therefore validation r(2), on the estimated genetic parameters of data predicted using the model needs to be established. To this end, 235 calibration data subsets of different sizes were sampled: n=100, n=250, n=500, and n=1,000 calibration samples. Subsequently, these data subsets were used to calibrate fat composition prediction models for 2 specific fatty acids: C16:0 and C18u (where u=unsaturated). Next, genetic parameters were estimated on predicted fat composition data for these fatty acids. Strong relationships between the number of calibration samples and validation r(2), as well as strong genetic correlations were found. However, the use of n=100 calibration samples resulted in a broad range of validation r(2) values and genetic correlations. Subsequent increases of the number of calibration samples resulted in narrowing patterns for validation r(2) as well as genetic correlations. The use of n=1,000 calibration samples resulted in estimated genetic correlations varying within a range of 0.10 around the average, which seems acceptable. Genetic analyses for the human health-related fatty acids C14:0, C16:0, and C18u, and the ratio of saturated fatty acids to unsaturated fatty acids showed that replacing observations on fat composition determined by gas chromatography by predictions based on infrared spectra reduced the potential genetic gain to 98, 86, 96, and 99% for the 4 fatty acid traits, respectively, in dairy breeding schemes where progeny testing is practiced. We conclude that a relatively large number of calibration samples is required to be able to obtain genetic correlations that lie within a limited range. Considering that the routine recording of infrared spectra is relatively cheap and straightforward, we concluded that this methodology provides an excellent means for the dairy industry to genetically alter milk fat composition.

摘要

傅里叶变换红外光谱是一种测定牛乳脂肪组成的合适方法。然而，为了校准红外预测模型，需要进行气相色谱法测定脂肪组成，这种方法既昂贵又费力。最近有人指出，校准样品的数量与模型验证 r²（即预测的准确性）密切相关。然而，用于建立模型的校准样品数量（即验证 r²）对使用模型预测的数据的估计遗传参数的影响尚待确定。为此，我们从不同大小的 235 个校准数据子集中进行采样：n=100、n=250、n=500 和 n=1000 个校准样本。随后，我们使用这些数据子集来校准用于 2 种特定脂肪酸（C16:0 和 C18u（其中 u 表示不饱和））的脂肪组成预测模型。接下来，我们对这些脂肪酸的预测脂肪组成数据进行遗传参数估计。我们发现，校准样品数量与验证 r²之间以及遗传相关性之间存在很强的关系。然而，使用 n=100 个校准样本会导致验证 r²和遗传相关性的范围很广。随后增加校准样本数量会导致验证 r²和遗传相关性的模式变窄。使用 n=1000 个校准样本会导致遗传相关性在平均值的 0.10 范围内变化，这似乎是可以接受的。对与人类健康相关的脂肪酸 C14:0、C16:0 和 C18u 以及饱和脂肪酸与不饱和脂肪酸的比例进行的遗传分析表明，在进行后裔测试的奶牛育种计划中，用基于红外光谱的预测值代替气相色谱法测定的脂肪组成观测值，分别使 4 种脂肪酸性状的潜在遗传增益降低到 98%、86%、96%和 99%。我们得出的结论是，需要相当数量的校准样本才能获得遗传相关性，其范围限制在一定范围内。考虑到红外光谱的常规记录相对便宜且简单，我们得出的结论是，这种方法为奶制品行业提供了一种改变牛奶脂肪组成的绝佳途径。