Meng Zihang, Chakraborty Rudrasis, Singh Vikas
University of Wisconsin-Madison.
Butlr.
Adv Neural Inf Process Syst. 2021 Dec;34:14056-14068.
We present an efficient stochastic algorithm (RSG+) for canonical correlation analysis (CCA) using a reparametrization of the projection matrices. We show how this reparametrization (into structured matrices), simple in hindsight, directly presents an opportunity to repurpose/adjust mature techniques for numerical optimization on Riemannian manifolds. Our developments nicely complement existing methods for this problem which either require ( ) time complexity per iteration with convergence rate (where is the dimensionality) or only extract the top 1 component with convergence rate. In contrast, our algorithm offers an improvement: it achieves ( ) runtime complexity per iteration for extracting the top canonical components with convergence rate. While our paper focuses more on the formulation and the algorithm, our experiments show that the empirical behavior on common datasets is quite promising. We also explore a potential application in training fair models with missing sensitive attributes.
我们提出了一种高效的随机算法(RSG+),用于使用投影矩阵的重新参数化进行典型相关分析(CCA)。我们展示了这种事后看来很简单的重新参数化(转换为结构化矩阵)如何直接为在黎曼流形上重新利用/调整成熟的数值优化技术提供了机会。我们的进展很好地补充了针对此问题的现有方法,这些方法要么每次迭代需要()时间复杂度且收敛速度为(其中为维度),要么仅以收敛速度提取前1个分量。相比之下,我们的算法有一个改进:它以收敛速度提取前个典型分量时,每次迭代实现()运行时复杂度。虽然我们的论文更多地关注公式和算法,但我们的实验表明,在常见数据集上的实证表现很有前景。我们还探索了在训练具有缺失敏感属性的公平模型方面的潜在应用。