Abstract
Humans perceive and construct the world as an arrangement of simple parametric models. In particular, we can often describe man-made environments using volumetric primitives such as cuboids or cylinders. Inferring these primitives is important for attaining high-level, abstract scene descriptions. Previous approaches for primitive-based abstraction estimate shape parameters directly and are only able to reproduce simple objects. In contrast, we propose a robust estimator for primitive fitting, which meaningfully abstracts complex real-world environments using cuboids. A RANSAC estimator guided by a neural network fits these primitives to a depth map. We condition the network on previously detected parts of the scene, parsing it one-by-one. To obtain cuboids from single RGB images, we additionally optimise a depth estimation CNN end-to-end. Naively minimising point-to-primitive distances leads to large or spurious cuboids occluding parts of the scene. We thus propose an improved occlusion-aware distance metric correctly handling opaque scenes. Furthermore, we present a neural network based cuboid solver which provides more parsimonious scene abstractions while also reducing inference time. The proposed algorithm does not require labour-intensive labels, such as cuboid annotations, for training. Results on the NYU Depth v2 dataset demonstrate that the proposed algorithm successfully abstracts cluttered real-world 3D scene layouts.
| Original language | English |
|---|---|
| Pages (from-to) | 6306-6325 |
| Number of pages | 20 |
| Journal | IEEE Transactions on Pattern Analysis and Machine Intelligence |
| Volume | 46 |
| Issue number | 9 |
| Early online date | 19 Mar 2024 |
| DOIs | |
| Publication status | Published - 19 Mar 2024 |
Funding
This work was supported by the BMBF grant LeibnizAILab (01DD20003), by the DFG grant COVMAP (RO 2497/12-2), by the DFG Cluster of Excellence PhoenixD (EXC 2122), and by the Center for Digital Innovations (ZDIN).
Keywords
- Estimation
- Image reconstruction
- Scene abstraction
- Shape
- Solid modeling
- Surface reconstruction
- Three-dimensional displays
- Training
- cuboid fitting
- minimal solver
- multi-model fitting
- shape decomposition
ASJC Scopus subject areas
- Software
- Artificial Intelligence
- Applied Mathematics
- Computer Vision and Pattern Recognition
- Computational Theory and Mathematics
Fingerprint
Dive into the research topics of 'Robust Shape Fitting for 3D Scene Abstraction'. Together they form a unique fingerprint.Cite this
- APA
- Standard
- Harvard
- Vancouver
- Author
- BIBTEX
- RIS