Angelopoulos Anastasios N, Bates Stephen, Fannjiang Clara, Jordan Michael I, Zrnic Tijana
Department of Electrical Engineering and Computer Sciences, University of California, Berkeley, Berkeley, CA 94720, USA.
Science. 2023 Nov 10;382(6671):669-674. doi: 10.1126/science.adi6000. Epub 2023 Nov 9.
Prediction-powered inference is a framework for performing valid statistical inference when an experimental dataset is supplemented with predictions from a machine-learning system. The framework yields simple algorithms for computing provably valid confidence intervals for quantities such as means, quantiles, and linear and logistic regression coefficients without making any assumptions about the machine-learning algorithm that supplies the predictions. Furthermore, more accurate predictions translate to smaller confidence intervals. Prediction-powered inference could enable researchers to draw valid and more data-efficient conclusions using machine learning. The benefits of prediction-powered inference were demonstrated with datasets from proteomics, astronomy, genomics, remote sensing, census analysis, and ecology.
预测驱动推理是一种在实验数据集辅以机器学习系统的预测时进行有效统计推断的框架。该框架产生了简单的算法,用于计算均值、分位数以及线性和逻辑回归系数等数量的可证明有效的置信区间,而无需对提供预测的机器学习算法做任何假设。此外,更准确的预测会转化为更小的置信区间。预测驱动推理能够使研究人员利用机器学习得出有效且数据效率更高的结论。蛋白质组学、天文学、基因组学、遥感、人口普查分析和生态学等数据集证明了预测驱动推理的优势。