Department of Animal and Dairy Science, University of Georgia, Athens, GA, 30602, USA.
Council on Dairy Cattle Breeding (CDCB), Bowie, MD, 20716, USA.
Genet Sel Evol. 2024 Mar 8;56(1):18. doi: 10.1186/s12711-024-00883-w.
Validation by data truncation is a common practice in genetic evaluations because of the interest in predicting the genetic merit of a set of young selection candidates. Two of the most used validation methods in genetic evaluations use a single data partition: predictivity or predictive ability (correlation between pre-adjusted phenotypes and estimated breeding values (EBV) divided by the square root of the heritability) and the linear regression (LR) method (comparison of "early" and "late" EBV). Both methods compare predictions with the whole dataset and a partial dataset that is obtained by removing the information related to a set of validation individuals. EBV obtained with the partial dataset are compared against adjusted phenotypes for the predictivity or EBV obtained with the whole dataset in the LR method. Confidence intervals for predictivity and the LR method can be obtained by replicating the validation for different samples (or folds), or bootstrapping. Analytical confidence intervals would be beneficial to avoid running several validations and to test the quality of the bootstrap intervals. However, analytical confidence intervals are unavailable for predictivity and the LR method.
We derived standard errors and Wald confidence intervals for the predictivity and statistics included in the LR method (bias, dispersion, ratio of accuracies, and reliability). The confidence intervals for the bias, dispersion, and reliability depend on the relationships and prediction error variances and covariances across the individuals in the validation set. We developed approximations for large datasets that only need the reliabilities of the individuals in the validation set. The confidence intervals for the ratio of accuracies and predictivity were obtained through the Fisher transformation. We show the adequacy of both the analytical and approximated analytical confidence intervals and compare them versus bootstrap confidence intervals using two simulated examples. The analytical confidence intervals were closer to the simulated ones for both examples. Bootstrap confidence intervals tend to be narrower than the simulated ones. The approximated analytical confidence intervals were similar to those obtained by bootstrapping.
Estimating the sampling variation of predictivity and the statistics in the LR method without replication or bootstrap is possible for any dataset with the formulas presented in this study.
由于人们对预测一组年轻的候选个体的遗传优势感兴趣,因此在遗传评估中,数据截断验证是一种常见的做法。遗传评估中最常用的两种验证方法都使用单一的数据分区:预测性或预测能力(预调整表型与估计育种值(EBV)之间的相关性除以遗传力的平方根)和线性回归(LR)方法(“早期”和“晚期” EBV 的比较)。这两种方法都将预测值与整个数据集以及通过删除与一组验证个体相关的信息获得的部分数据集进行比较。在预测性方法中,使用部分数据集获得的 EBV 与调整后的表型进行比较;在 LR 方法中,使用整个数据集获得的 EBV 与调整后的表型进行比较。通过对不同样本(或折叠)进行重复验证或自举法,可以获得预测性和 LR 方法的置信区间。分析置信区间将有助于避免多次验证,并测试自举区间的质量。但是,无法为预测性和 LR 方法提供分析置信区间。
我们为预测性和 LR 方法中包含的统计量(偏差、分散度、准确性比和可靠性)导出了标准误差和 Wald 置信区间。偏差、分散度和可靠性的置信区间取决于验证集中个体之间的关系和预测误差方差和协方差。我们为大型数据集开发了仅需要验证集中个体可靠性的近似值。准确性比和预测性的置信区间通过 Fisher 变换获得。我们通过两个模拟示例展示了分析和近似分析置信区间的充分性,并将其与自举置信区间进行了比较。对于这两个示例,分析置信区间都更接近模拟置信区间。自举置信区间往往比模拟置信区间窄。近似分析置信区间与自举获得的置信区间相似。
对于本研究中提出的公式,对于任何数据集,无需重复或自举即可估计预测性和 LR 方法中的统计量的抽样变化。