The middle part blue ▇ displays the overall pipeline for 3D co-speech gesture generation from emotion transition human speech. The left part green ▇ depicts our proposed Motion Transition Infusion Mechanism that enhances the coordination of transition gestures \wrt head/tail ones. The right part orange ▇ shows the designed Emotion Mixture Strategy to provide weak supervision of the generated transition gestures, thereby achieving authority producing.
Visualization of our generated 3D co-speech gestures against various state-of-the-art methods. The samples of the left part are from our newly collected TED-ETrans dataset, and the samples of the right part are from our BEAT-ETrans dataset. Best view on screen.
Visual comparisons of ablation study on our newly collected BEAT-ETrans dataset. We show the key frames of the generated motions given the emotion transition of human speech. Best view on screen.
Visual comparisons of ablation study on our newly collected TED-ETrans dataset. We show the key frames of the generated motions given the emotion transition of human speech. Best view on screen.
@misc{qi2023weaklysupervised,
title={Weakly-Supervised Emotion Transition Learning for Diverse 3D Co-speech Gesture Generation},
author={Xingqun Qi and Jiahao Pan and Peng Li and Ruibin Yuan and Xiaowei Chi and Mengfei Li and Wenhan Luo and Wei Xue and Shanghang Zhang and Qifeng Liu and Yike Guo},
year={2023},
eprint={2311.17532},
archivePrefix={arXiv},
primaryClass={cs.CV}
}