Yu Xiyu, Liu Tongliang, Gong Mingming, Batmanghelich Kayhan, Tao Dacheng
UBTECH Sydney AI Centre, SIT, FEIT, The University of Sydney, Australia.
Department of Biomedical Informatics, University of Pittsburgh.
Conf Comput Vis Pattern Recognit Workshops. 2018 Jun;2018:4480-4489. doi: 10.1109/CVPR.2018.00471. Epub 2018 Dec 17.
In this paper, we study the mixture proportion estimation (MPE) problem in a new setting: given samples from the mixture and the component distributions, we identify the proportions of the components in the mixture distribution. To address this problem, we make use of a linear independence assumption, i.e., the component distributions are independent from each other, which is much weaker than assumptions exploited in the previous MPE methods. Based on this assumption, we propose a method (1) that uniquely identifies the mixture proportions, (2) whose output provably converges to the optimal solution, and (3) that is computationally efficient. We show the superiority of the proposed method over the state-of-the-art methods in two applications including learning with label noise and semi-supervised learning on both synthetic and real-world datasets.
在本文中,我们在一种新的设定下研究混合比例估计(MPE)问题:给定来自混合分布和各成分分布的样本,我们要确定混合分布中各成分的比例。为解决此问题,我们利用线性独立性假设,即各成分分布相互独立,这一假设比先前MPE方法中所采用的假设要弱得多。基于此假设,我们提出一种方法:(1)能唯一确定混合比例;(2)其输出可证明收敛到最优解;(3)计算效率高。我们在包括带标签噪声学习和半监督学习的两个应用中,在合成数据集和真实世界数据集上展示了所提方法相对于现有最先进方法的优越性。