结合程序内容生成与扩散模型的图像到三维瓷瓶生成技术

Translated title of the contribution: Image to 3D vase generation technology combining procedural content generation and diffusion models

Heyi Sun, Yixiao Li, Xi Tian, Songhai Zhang

Research output: Contribution to journalArticlepeer-review

Abstract

In the traditional manual production of 3D content, 3D meshes and textures serve as the foundational elements in constructing 3D assets. To enhance the visual representation and rendering performance of 3D assets, the meshes are typically constructed using quadrilateral faces, requiring optimal topology and UV mapping. Moreover, 3D textures must be congruent with the geometric shape and maintain global consistency. However, current 3D content generation technologies based on latent diffusion models fail to meet these standards, limiting their potential in practical applications. At the same time, procedural content generation techniques have gained widespread application in the gaming and architectural industries due to their ability to systematically produce a vast array of 3D assets that conform to industry best practices. To improve the usability of generated assets, an integrated solution combining procedural content generation with diffusion model techniques was proposed. Using the 3D rotational body example of a vase, the image-to-3D asset generation problem was divided into two principal tasks: 3D mesh reconstruction and 3D texture generation. In the domain of 3D mesh reconstruction, a novel vase generation program was developed, and a deep neural network was trained to learn the mapping between image features and procedural parameters, thereby facilitating the reconstruction from a 2D image to a 3D model. For3D texture generation, a novel two-stage texturing strategy was introduced, combining multi-view image synthesis and multi-view consistency sampling techniques to produce high quality texture maps with global coherence. In summary, a scheme for the automatic construction of 3D vase assets from images was presented, which can be generalized to generate other 3D rotational body content and holds promise for applications in generating other types of 3D content.

Translated title of the contributionImage to 3D vase generation technology combining procedural content generation and diffusion models
Original languageChinese (Traditional)
Pages (from-to)332-344
Number of pages13
JournalJournal of Graphics
Volume46
Issue number2
Early online date30 Apr 2025
Publication statusPublished - 30 Apr 2025

Keywords

  • 3D reconstruction
  • deep learning
  • diffusion models
  • procedural content generation
  • texture generation

ASJC Scopus subject areas

  • Engineering (miscellaneous)
  • Computer Vision and Pattern Recognition
  • Computer Science Applications
  • Computer Graphics and Computer-Aided Design
  • Industrial and Manufacturing Engineering

Fingerprint

Dive into the research topics of 'Image to 3D vase generation technology combining procedural content generation and diffusion models'. Together they form a unique fingerprint.

Cite this