Video Generation from Text Employing Latent Path Construction for Temporal Modeling
ICPR 2022Amir Mazaheri, Mubarak Shah
2021 – 2022
Pioneering text-to-video generation on realistic datasets using latent path construction for temporal modeling.
Introduces a pioneering approach to text-to-video generation — among the first to use realistic datasets such as A2D and UCF101. The method regresses latent representations of initial and final frames, then employs context-aware interpolation to synthesize intermediate frames, addressing the challenge of visualizing natural language descriptions as a coherent video sequence.
Amir Mazaheri, Mubarak Shah