Arxiv paper – Video-T1: Test-Time Scaling for Video Generation


In this episode, we discuss Video-T1: Test-Time Scaling for Video Generation by Fangfu Liu, Hanyang Wang, Yimo Cai, Kaiyan Zhang, Xiaohang Zhan, Yueqi Duan. The paper investigates Test-Time Scaling (TTS) for video generation, aiming to enhance video quality by leveraging additional inference-time computation instead of expanding model size or training data. The authors treat video generation as a search problem, introducing the Tree-of-Frames (ToF) method, which efficiently navigates the search space by adaptively expanding and pruning video branches based on feedback from test-time verifiers. Experimental results on text-conditioned video benchmarks show that increasing inference-time compute through TTS significantly improves the quality of the generated videos.


Posted

in

by

Tags:

Comments

Leave a Reply

Your email address will not be published. Required fields are marked *