Abstract
We propose MAD-GAN, an intuitive generalization to the Generative Adversarial Networks (GANs) and its conditional variants to address the well known problem of mode collapse. First, MAD-GAN is a multi-agent GAN architecture incorporating multiple generators and one discriminator. Second, to enforce that different generators capture diverse high probability modes, the discriminator of MAD-GAN is designed such that along with finding the real and fake samples, it is also required to identify the generator that generated the given fake sample. Intuitively, to succeed in this task, the discriminator must learn to push different generators towards different identifiable modes. We perform extensive experiments on synthetic and real datasets and compare MAD-GAN with different variants of GAN. We show high quality diverse sample generations for challenging tasks such as image-to-image translation and face generation. In addition, we also show that MAD-GAN is able to disentangle different modalities when trained using highly challenging diverse-class dataset (e.g. dataset with images of forests, icebergs, and bedrooms). In the end, we show its efficacy on the unsupervised feature representation task.
Original language | English |
---|---|
Title of host publication | Proceedings - 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2018 |
Publisher | IEEE |
Pages | 8513-8521 |
Number of pages | 9 |
ISBN (Electronic) | 9781538664209 |
DOIs | |
Publication status | Published - 17 Dec 2018 |
Event | 31st Meeting of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2018 - Salt Lake City, USA United States Duration: 18 Jun 2018 → 22 Jun 2018 |
Publication series
Name | Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition |
---|---|
ISSN (Print) | 1063-6919 |
Conference
Conference | 31st Meeting of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2018 |
---|---|
Country/Territory | USA United States |
City | Salt Lake City |
Period | 18/06/18 → 22/06/18 |
ASJC Scopus subject areas
- Software
- Computer Vision and Pattern Recognition
Fingerprint
Dive into the research topics of 'Multi-agent Diverse Generative Adversarial Networks'. Together they form a unique fingerprint.Profiles
-
Vinay Namboodiri
- Department of Computer Science - Senior Lecturer
- Visual Computing
- Bath Institute for the Augmented Human
- Artificial Intelligence and Machine Learning
Person: Research & Teaching, Core staff, Affiliate staff