In this episode, we discuss Video-T1: Test-Time Scaling for Video Generation by Fangfu Liu, Hanyang Wang, Yimo Cai, Kaiyan Zhang, Xiaohang Zhan, Yueqi Duan. The paper investigates Test-Time Scaling (TTS) for video generation, aiming to enhance video quality by leveraging additional inference-time computation instead of expanding model size or training data. The authors treat video generation as a search problem, introducing the Tree-of-Frames (ToF) method, which efficiently navigates the search space by adaptively expanding and pruning video branches based on feedback from test-time verifiers. Experimental results on text-conditioned video benchmarks show that increasing inference-time compute through TTS significantly improves the quality of the generated videos.
Arxiv paper – Video-T1: Test-Time Scaling for Video Generation
by
Tags:
Leave a Reply