AvatarCLIP: Zero-Shot Text-Driven Generation and Animation of 3D Avatars
we propose AvatarCLIP, a zero-shot text-driven framework for 3D avatar generation and animation
Our key insight is to take advantage of the powerful vision-language model CLIP for supervising neural human generation, in terms of 3D geometry, texture and animation.
AvatarCLIP: Zero-Shot Text-Driven Generation and Animation of 3D Avatars