← Back to work
Works citing this work
3 works
Work: Vid2Seq: Large-Scale Pretraining of a Visual Language Model for Dense Video Captioning
Products
For developers
AkademBaseOpen API for the ecosystem3 works
Work: Vid2Seq: Large-Scale Pretraining of a Visual Language Model for Dense Video Captioning