Department of Computer Science and Engineering, SUNY at Buffalo, United States.
Comput Methods Programs Biomed. 2013 Oct;112(1):135-45. doi: 10.1016/j.cmpb.2013.05.023. Epub 2013 Jul 18.
Statistical tests are powerful tools for data analysis. Kruskal-Wallis test is a non-parametric statistical test that evaluates whether two or more samples are drawn from the same distribution. It is commonly used in various areas. But sometimes, the use of the method is impeded by privacy issues raised in fields such as biomedical research and clinical data analysis because of the confidential information contained in the data. In this work, we give a privacy-preserving solution for the Kruskal-Wallis test which enables two or more parties to coordinately perform the test on the union of their data without compromising their data privacy. To the best of our knowledge, this is the first work that solves the privacy issues in the use of the Kruskal-Wallis test on distributed data.
统计检验是数据分析的有力工具。Kruskal-Wallis 检验是一种非参数统计检验方法,用于评估两个或多个样本是否来自同一分布。它在各个领域都有广泛的应用。但是,在生物医学研究和临床数据分析等领域,由于数据中包含的机密信息,该方法的使用受到隐私问题的阻碍。在这项工作中,我们提出了一种针对 Kruskal-Wallis 检验的隐私保护解决方案,使得两个或多个参与者可以在不损害其数据隐私的情况下,协调地在他们的数据的并集上执行检验。据我们所知,这是第一个解决分布式数据中 Kruskal-Wallis 检验使用中的隐私问题的工作。