Nabi Razieh, Malinsky Daniel, Shpitser Ilya
Department of Computer Science, Johns Hopkins University, Baltimore, MD, USA.
Proc Mach Learn Res. 2019 Jun;97:4674-4682.
Systematic discriminatory biases present in our society influence the way data is collected and stored, the way variables are defined, and the way scientific findings are put into practice as policy. Automated decision procedures and learning algorithms applied to such data may serve to perpetuate existing injustice or unfairness in our society. In this paper, we consider how to make optimal but fair decisions, which "break the cycle of injustice" by correcting for the unfair dependence of both decisions and outcomes on sensitive features (e.g., variables that correspond to gender, race, disability, or other protected attributes). We use methods from causal inference and constrained optimization to learn optimal policies in a way that addresses multiple potential biases which afflict data analysis in sensitive contexts, extending the approach of Nabi & Shpitser (2018). Our proposal comes equipped with the theoretical guarantee that the chosen fair policy will induce a joint distribution for new instances that satisfies given fairness constraints. We illustrate our approach with both synthetic data and real criminal justice data.
我们社会中存在的系统性歧视偏见会影响数据的收集和存储方式、变量的定义方式以及科学研究结果作为政策付诸实践的方式。应用于此类数据的自动化决策程序和学习算法可能会使我们社会中现有的不公正或不公平现象长期存在。在本文中,我们考虑如何做出最优且公平的决策,通过纠正决策和结果对敏感特征(例如与性别、种族、残疾或其他受保护属性相对应的变量)的不公平依赖来“打破不公正的循环”。我们使用因果推断和约束优化方法,以一种解决在敏感背景下困扰数据分析的多种潜在偏差的方式来学习最优政策,扩展了纳比和什皮策(2018年)的方法。我们的提议具有理论保证,即所选择的公平政策将为新实例诱导出满足给定公平约束的联合分布。我们用合成数据和真实刑事司法数据说明了我们的方法。