Projects per year
Abstract
Polygonal mesh has been proven to be a powerful representation of 3D shapes, given its efficiency in expressing shape surface while maintaining geometric and topological information. Increasing efforts have been made to design elaborate deep convolutional neural networks for meshes. However, these methods naturally ignore the global connectivity among mesh primitives due to the locality nature of convolution operations. In this paper, we introduce a transformer-like self-attention mechanism with down-sampling architectures for mesh learning to capture both the global and local relationships among mesh faces. To achieve this, we propose BFS-Pooling, which can convert a connected mesh into discrete tokens (i.e., a set of adjacent faces) with breath-first-search (BFS) and naturally build hierarchical architectures for mesh learning by pooling mesh tokens. Benefiting from BFS-Pooling, we design a hierarchical transformer architecture with a window-based local attention mechanism, Mesh Window Transformer (MWFormer). Experimental results demonstrate that MWFormer achieves the best or competitive performance in both mesh classification and mesh segmentation tasks. Code will be available.
Original language | English |
---|---|
Pages (from-to) | 382-391 |
Number of pages | 10 |
Journal | Computers and Graphics (Pergamon) |
Volume | 115 |
Early online date | 16 Jul 2023 |
DOIs | |
Publication status | Published - 31 Oct 2023 |
Bibliographical note
Funding Information:This work was supported by the National Key Research and Development Program of China ( 2021ZD0112902 ), and the National Natural Science Foundation of China (Grant No. 62220106003 , 61902210 ).
Funding
This work was supported by the National Key Research and Development Program of China ( 2021ZD0112902 ), and the National Natural Science Foundation of China (Grant No. 62220106003 , 61902210 ).
Keywords
- Mesh classification
- Mesh learning
- Mesh processing
- Mesh segmentation
- Transformer
ASJC Scopus subject areas
- Software
- Signal Processing
- General Engineering
- Human-Computer Interaction
- Computer Vision and Pattern Recognition
- Computer Graphics and Computer-Aided Design
Fingerprint
Dive into the research topics of 'MWFormer: Mesh Understanding with Window-based Transformer'. Together they form a unique fingerprint.-
Centre for the Analysis of Motion, Entertainment Research and Applications (CAMERA) - 2.0
Campbell, N. (PI), Cosker, D. (PI), Bilzon, J. (CoI), Campbell, N. (CoI), Cazzola, D. (CoI), Colyer, S. (CoI), Cosker, D. (CoI), Lutteroth, C. (CoI), McGuigan, P. (CoI), O'Neill, E. (CoI), Petrini, K. (CoI), Proulx, M. (CoI) & Yang, Y. (CoI)
Engineering and Physical Sciences Research Council
1/11/20 → 31/10/25
Project: Research council
-
Centre for the Analysis of Motion, Entertainment Research and Applications (CAMERA)
Cosker, D. (PI), Bilzon, J. (CoI), Campbell, N. (CoI), Cazzola, D. (CoI), Colyer, S. (CoI), Fincham Haines, T. (CoI), Hall, P. (CoI), Kim, K. I. (CoI), Lutteroth, C. (CoI), McGuigan, P. (CoI), O'Neill, E. (CoI), Richardt, C. (CoI), Salo, A. (CoI), Seminati, E. (CoI), Tabor, A. (CoI) & Yang, Y. (CoI)
Engineering and Physical Sciences Research Council
1/09/15 → 28/02/21
Project: Research council