Siddiquee Md Mahfuzur Rahman, Zhou Zongwei, Tajbakhsh Nima, Feng Ruibin, Gotway Michael B, Bengio Yoshua, Liang Jianming
Arizona State University.
Mila - Quebec Artificial Intelligence Institute.
Proc IEEE Int Conf Comput Vis. 2019 Nov;2019:191-200. doi: 10.1109/iccv.2019.00028. Epub 2020 Feb 27.
Generative adversarial networks (GANs) have ushered in a revolution in image-to-image translation. The development and proliferation of GANs raises an interesting question: can we train a GAN to remove an object, if present, from an image while otherwise preserving the image? Specifically, can a GAN "virtually heal" anyone by turning his medical image, with an unknown health status (diseased or healthy), into a healthy one, so that diseased regions could be revealed by subtracting those two images? Such a task requires a GAN to identify a minimal subset of target pixels for domain translation, an ability that we call fixed-point translation, which no GAN is equipped with yet. Therefore, we propose a new GAN, called Fixed-Point GAN, trained by (1) supervising same-domain translation through a conditional identity loss, and (2) regularizing cross-domain translation through revised adversarial, domain classification, and cycle consistency loss. Based on fixed-point translation, we further derive a novel framework for disease detection and localization using only image-level annotation. Qualitative and quantitative evaluations demonstrate that the proposed method outperforms the state of the art in multi-domain image-to-image translation and that it surpasses predominant weakly-supervised localization methods in both disease detection and localization. Implementation is available at https://github.com/jlianglab/Fixed-Point-GAN.
生成对抗网络(GAN)在图像到图像的翻译领域引发了一场革命。GAN的发展与普及引发了一个有趣的问题:我们能否训练一个GAN,在图像中去除存在的物体,同时保留图像的其他部分?具体而言,GAN能否通过将健康状况未知(患病或健康)的医学图像转换为健康图像,从而“虚拟治愈”任何人,以便通过减去这两张图像来揭示患病区域?这样的任务要求GAN识别用于域翻译的目标像素的最小子集,我们将这种能力称为定点翻译,而目前还没有GAN具备这种能力。因此,我们提出了一种新的GAN,称为定点GAN,它通过以下方式进行训练:(1)通过条件身份损失监督同域翻译,(2)通过修正的对抗损失、域分类损失和循环一致性损失对跨域翻译进行正则化。基于定点翻译,我们进一步推导了一种仅使用图像级注释的疾病检测和定位的新框架。定性和定量评估表明,所提出的方法在多域图像到图像翻译方面优于现有技术,并且在疾病检测和定位方面超过了主要的弱监督定位方法。实现代码可在https://github.com/jlianglab/Fixed-Point-GAN获取。