← All projects

Text-to-Video Generation via Latent Path Construction

2021 – 2022

Pioneering text-to-video generation on realistic datasets using latent path construction for temporal modeling.

video-generationtext-to-videotemporal-modelinggenerative

Introduces a pioneering approach to text-to-video generation — among the first to use realistic datasets such as A2D and UCF101. The method regresses latent representations of initial and final frames, then employs context-aware interpolation to synthesize intermediate frames, addressing the challenge of visualizing natural language descriptions as a coherent video sequence.

Publication

Video Generation from Text Employing Latent Path Construction for Temporal Modeling

ICPR 2022

Amir Mazaheri, Mubarak Shah