(1/2) New CVPR paper on speech-to-gesture prediction! Human speech is often accompanied by hand and arm gestures. Given audio speech input, we generate plausible gestures to go along with the sound and synthesize a corresponding video of the speaker.
00:00










