Skip to main navigation Skip to search Skip to main content

Planner3D: LLM-enhanced Graph Prior Meets 3D Indoor Scene Explicit Regularization

Yao Wei, Martin Renqiang Min, George Vosselman, Li Erran Li, Michael Ying Yang

Research output: Contribution to journalArticlepeer-review

1   Link opens in a new tab Citation (SciVal)
237 Downloads (Pure)

Abstract

Compositional 3D scene synthesis has diverse applications across a spectrum of industries such as robotics, films, and video games, as it closely mirrors the complexity of real-world multi-object environments. Conventional works typically employ shape retrieval based frameworks which naturally suffer from limited shape diversity. Recent progresses have been made in object shape generation with generative models such as diffusion models, which increases the shape fidelity. However, these approaches separately treat 3D shape generation and layout generation. The synthesized scenes are usually hampered by layout collision, which suggests that the scene-level fidelity is still under-explored. In this paper, we aim at generating realistic and reasonable 3D indoor scenes from scene graph. To enrich the priors of the given scene graph inputs, large language model is utilized to aggregate the global-wise features with local node-wise and edge-wise features. With a unified graph encoder, graph features are extracted to guide joint layout-shape generation. Additional regularization is introduced to explicitly constrain the produced 3D layouts. Benchmarked on the SG-FRONT dataset, our method achieves better 3D scene synthesis, especially in terms of scene-level fidelity. The source code will be released after publication. 3D indoor scene synthesis, generative model, scene graph, large language model, spatial arrangement, latent diffusion.

Original languageEnglish
Number of pages17
JournalIEEE Transactions on Pattern Analysis and Machine Intelligence
Early online date25 Aug 2025
Publication statusE-pub ahead of print - 25 Aug 2025

Funding

This work has been partially supported by EU HORIZON-CL42023-HUMAN-01-CNECT XTREME (grant no. 101136006). We sincerely appreciate all valuable comments and suggestions from all reviewers, which helped us to improve the quality of this article.

FundersFunder number
EU HORIZON-CL42023-HUMAN-01-CNECT101136006

    ASJC Scopus subject areas

    • Software
    • Computer Vision and Pattern Recognition
    • Computational Theory and Mathematics
    • Applied Mathematics
    • Artificial Intelligence

    Fingerprint

    Dive into the research topics of 'Planner3D: LLM-enhanced Graph Prior Meets 3D Indoor Scene Explicit Regularization'. Together they form a unique fingerprint.

    Cite this