Wang Xin, Chen Hong, Tang Si'ao, Wu Zihao, Zhu Wenwu
IEEE Trans Pattern Anal Mach Intell. 2024 Dec;46(12):9677-9696. doi: 10.1109/TPAMI.2024.3420937. Epub 2024 Nov 6.
Disentangled Representation Learning (DRL) aims to learn a model capable of identifying and disentangling the underlying factors hidden in the observable data in representation form. The process of separating underlying factors of variation into variables with semantic meaning benefits in learning explainable representations of data, which imitates the meaningful understanding process of humans when observing an object or relation. As a general learning strategy, DRL has demonstrated its power in improving the model explainability, controlability, robustness, as well as generalization capacity in a wide range of scenarios such as computer vision, natural language processing, and data mining. In this article, we comprehensively investigate DRL from various aspects including motivations, definitions, methodologies, evaluations, applications, and model designs. We first present two well-recognized definitions, i.e., Intuitive Definition and Group Theory Definition for disentangled representation learning. We further categorize the methodologies for DRL into four groups from the following perspectives, the model type, representation structure, supervision signal, and independence assumption. We also analyze principles to design different DRL models that may benefit different tasks in practical applications. Finally, we point out challenges in DRL as well as potential research directions deserving future investigations. We believe this work may provide insights for promoting the DRL research in the community.
解缠表征学习(DRL)旨在学习一种模型,该模型能够以表征形式识别并解开隐藏在可观测数据中的潜在因素。将潜在变化因素分离为具有语义意义的变量的过程,有助于学习数据的可解释表征,这模仿了人类观察物体或关系时的有意义理解过程。作为一种通用的学习策略,DRL已在诸如计算机视觉、自然语言处理和数据挖掘等广泛场景中,展现出其在提高模型可解释性、可控性、鲁棒性以及泛化能力方面的强大作用。在本文中,我们从动机、定义、方法、评估、应用和模型设计等各个方面对DRL进行了全面研究。我们首先给出了两个公认的定义,即解缠表征学习的直观定义和群论定义。我们进一步从模型类型、表征结构、监督信号和独立性假设等角度,将DRL的方法分为四类。我们还分析了设计不同DRL模型的原则,这些原则可能在实际应用中对不同任务有益。最后,我们指出了DRL中的挑战以及值得未来研究的潜在方向。我们相信这项工作可能为推动该领域的DRL研究提供见解。
IEEE Trans Pattern Anal Mach Intell. 2024-12
IEEE Trans Pattern Anal Mach Intell. 2022-9
Med Image Anal. 2022-8
IEEE Trans Neural Netw Learn Syst. 2025-4
Proc SIGCHI Conf Hum Factor Comput Syst. 2023-4
IEEE Trans Neural Netw Learn Syst. 2024-4
Comput Methods Programs Biomed. 2022-11
Predict Intell Med. 2025
Natl Sci Rev. 2024-8-23
Biomed Eng Online. 2024-8-31
Comput Biol Med. 2024-9
Entropy (Basel). 2023-10-31