MWFormer: Mesh Understanding with Window-based Transformer

Hao Yang Peng, Meng Hao Guo, Zheng Ning Liu, Yong Liang Yang, Tai Jiang Mu

Research output: Contribution to journalArticlepeer-review

3 Citations (SciVal)

Abstract

Polygonal mesh has been proven to be a powerful representation of 3D shapes, given its efficiency in expressing shape surface while maintaining geometric and topological information. Increasing efforts have been made to design elaborate deep convolutional neural networks for meshes. However, these methods naturally ignore the global connectivity among mesh primitives due to the locality nature of convolution operations. In this paper, we introduce a transformer-like self-attention mechanism with down-sampling architectures for mesh learning to capture both the global and local relationships among mesh faces. To achieve this, we propose BFS-Pooling, which can convert a connected mesh into discrete tokens (i.e., a set of adjacent faces) with breath-first-search (BFS) and naturally build hierarchical architectures for mesh learning by pooling mesh tokens. Benefiting from BFS-Pooling, we design a hierarchical transformer architecture with a window-based local attention mechanism, Mesh Window Transformer (MWFormer). Experimental results demonstrate that MWFormer achieves the best or competitive performance in both mesh classification and mesh segmentation tasks. Code will be available.

Original languageEnglish
Pages (from-to)382-391
Number of pages10
JournalComputers and Graphics (Pergamon)
Volume115
Early online date16 Jul 2023
DOIs
Publication statusPublished - 31 Oct 2023

Bibliographical note

Funding Information:
This work was supported by the National Key Research and Development Program of China ( 2021ZD0112902 ), and the National Natural Science Foundation of China (Grant No. 62220106003 , 61902210 ).

Funding

This work was supported by the National Key Research and Development Program of China ( 2021ZD0112902 ), and the National Natural Science Foundation of China (Grant No. 62220106003 , 61902210 ).

Keywords

  • Mesh classification
  • Mesh learning
  • Mesh processing
  • Mesh segmentation
  • Transformer

ASJC Scopus subject areas

  • Software
  • Signal Processing
  • General Engineering
  • Human-Computer Interaction
  • Computer Vision and Pattern Recognition
  • Computer Graphics and Computer-Aided Design

Fingerprint

Dive into the research topics of 'MWFormer: Mesh Understanding with Window-based Transformer'. Together they form a unique fingerprint.

Cite this