HoloGAN: Unsupervised Learning of 3D Representations From Natural Images

Thu Nguyen Phuoc, Chuan Li, Lucas Theis, Christian Richardt, Yongliang Yang

Research output: Chapter or section in a book/report/conference proceedingChapter in a published conference proceeding

258 Citations (SciVal)
181 Downloads (Pure)

Abstract

We propose a novel generative adversarial network (GAN) for the task of unsupervised learning of 3D representations from natural images. Most generative models rely on 2D kernels to generate images and make few assumptions about the 3D world. These models therefore tend to create blurry images or artefacts in tasks that require a strong 3D understanding, such as novel-view synthesis. HoloGAN instead learns a 3D representation of the world, and to render this representation in a realistic manner. Unlike other GANs, HoloGAN provides explicit control over the pose of generated objects through rigid-body transformations of the learnt 3D features. Our experiments show that using explicit 3D features enables HoloGAN to disentangle 3D pose and identity, which is further decomposed into shape and appearance, while still being able to generate images with similar or higher visual quality than other generative models. HoloGAN can be trained end-to-end from unlabelled 2D images only. Particularly, we do not require pose labels, 3D shapes, or multiple views of the same objects. This shows that HoloGAN is the first generative model that learns 3D representations from natural images in an entirely unsupervised manner.


This project has received funding from the European Union’s Horizon 2020 research and innovation programme under the Marie Skłodowska-Curie grant agreement No 665992
Original languageEnglish
Title of host publication2019 IEEE/CVF International Conference on Computer Vision (ICCV)
PublisherIEEE
Pages7588-7597
Number of pages10
ISBN (Electronic)9781728148038
ISBN (Print)9781728148045
DOIs
Publication statusPublished - 27 Feb 2020
EventInternational Conference on Computer Vision 2019 -
Duration: 27 Oct 20192 Nov 2019

Publication series

Name2019 IEEE/CVF International Conference on Computer Vision (ICCV)
PublisherIEEE
ISSN (Print)1550-5499
ISSN (Electronic)2380-7504

Conference

ConferenceInternational Conference on Computer Vision 2019
Period27/10/192/11/19

Fingerprint

Dive into the research topics of 'HoloGAN: Unsupervised Learning of 3D Representations From Natural Images'. Together they form a unique fingerprint.

Cite this