H4MER: Human 4D Modeling by Learning Neural Compositional Representation with Transformer


1Fudan University     2Google



Overview of Our H4D Representation. We introduce a compact and compositional representation for 4D human, which represents a temporally dynamic human with the low-dimensional codes for body shape, initial pose, skeleton motion and auxiliary information.



Abstract

Despite the impressive results achieved by deep learning based 3D reconstruction, the techniques of directly learning to model 4D human captures with detailed geometry have been less studied. This work presents a novel neural compositional representation for Human 4D Modeling with transformER (H4MER). Specifically, our H4MER is a compact and compositional representation for dynamic human by exploiting the human body prior from the widely used SMPL parametric model. Thus, H4MER can represent a dynamic 3D human over a temporal span with the codes of shape, initial pose, motion and auxiliaries. A simple yet effective linear motion model is proposed to provide a rough and regularized motion estimation, followed by per-frame compensation for pose and geometry details with the residual encoded in the auxiliary codes. We present a novel Transformer-based feature extractor and conditional GRU decoder to facilitate learning and improve the representation capability. Extensive experiments demonstrate our method is not only efficacy in recovering dynamic human with accurate motion and detailed geometry, but also amenable to various 4D human related tasks, including monocular video fitting, motion retargeting, 4D completion, and future prediction.


Paper and Code of Conference Version

B. Jiang, Y. Zhang, X. Wei, X. Xue, Y. Fu

H4D: Human 4D Modeling by Learning Neural Compositional Representation

CVPR 2022

[arXiv]     [Project Page]     [GitHub]    





Results

Monocular Video Fitting











Acknowledgements

We thank Xingkui Wei for offering technical supports of several baseline methods. The corresponding author is Yanwei Fu. The website is modified from this template.