arxiv preprint - Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation

In this episode we discuss Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation
by Li Hu, Xin Gao, Peng Zhang, Ke Sun, Bang Zhang, Liefeng Bo. The paper presents a novel framework designed for character animation that synthesizes consistent and controllable videos from still images using diffusion models. It introduces a ReferenceNet that utilizes spatial attention to keep the character’s appearance consistent and integrates a pose guider for movement controllability along with a technique to ensure smooth temporal transitions. The method exhibits superior performance on character animation, including fashion video and human dance synthesis benchmarks, outperforming other image-to-video methods.

arxiv preprint – Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation