Skip to main navigation Skip to search Skip to main content

Robust Shape Fitting for 3D Scene Abstraction

Florian Kluger, Eric Brachmann, Michael Ying Yang, Bodo Rosenhahn

Research output: Contribution to journalArticlepeer-review

5   Link opens in a new tab Citations (SciVal)
303 Downloads (Pure)

Abstract

Humans perceive and construct the world as an arrangement of simple parametric models. In particular, we can often describe man-made environments using volumetric primitives such as cuboids or cylinders. Inferring these primitives is important for attaining high-level, abstract scene descriptions. Previous approaches for primitive-based abstraction estimate shape parameters directly and are only able to reproduce simple objects. In contrast, we propose a robust estimator for primitive fitting, which meaningfully abstracts complex real-world environments using cuboids. A RANSAC estimator guided by a neural network fits these primitives to a depth map. We condition the network on previously detected parts of the scene, parsing it one-by-one. To obtain cuboids from single RGB images, we additionally optimise a depth estimation CNN end-to-end. Naively minimising point-to-primitive distances leads to large or spurious cuboids occluding parts of the scene. We thus propose an improved occlusion-aware distance metric correctly handling opaque scenes. Furthermore, we present a neural network based cuboid solver which provides more parsimonious scene abstractions while also reducing inference time. The proposed algorithm does not require labour-intensive labels, such as cuboid annotations, for training. Results on the NYU Depth v2 dataset demonstrate that the proposed algorithm successfully abstracts cluttered real-world 3D scene layouts.
Original languageEnglish
Pages (from-to)6306-6325
Number of pages20
JournalIEEE Transactions on Pattern Analysis and Machine Intelligence
Volume46
Issue number9
Early online date19 Mar 2024
DOIs
Publication statusPublished - 19 Mar 2024

Funding

This work was supported by the BMBF grant LeibnizAILab (01DD20003), by the DFG grant COVMAP (RO 2497/12-2), by the DFG Cluster of Excellence PhoenixD (EXC 2122), and by the Center for Digital Innovations (ZDIN).

Keywords

  • Estimation
  • Image reconstruction
  • Scene abstraction
  • Shape
  • Solid modeling
  • Surface reconstruction
  • Three-dimensional displays
  • Training
  • cuboid fitting
  • minimal solver
  • multi-model fitting
  • shape decomposition

ASJC Scopus subject areas

  • Software
  • Artificial Intelligence
  • Applied Mathematics
  • Computer Vision and Pattern Recognition
  • Computational Theory and Mathematics

Fingerprint

Dive into the research topics of 'Robust Shape Fitting for 3D Scene Abstraction'. Together they form a unique fingerprint.

Cite this