← Back to work
Works cited by this work
28 works
Work: MIRA-CAP: Memory-Integrated Retrieval-Augmented Captioning for State-of-the-Art Image and Video Captioning
Microsoft COCO: Common Objects in Context
Tsung-Yi Lin, Michael Maire, Serge Belongie +5
Chapter20147 citationsABIActivityNet: A large-scale video benchmark for human activity understanding
Fabian Caba Heilbron, Víctor Escorcia, Bernard Ghanem +1
Article20153 citationsABITowards Automatic Learning of Procedures From Web Instructional Videos
Luowei Zhou, Chenliang Xu, Jason J. Corso
Article20183 citationsABIVid2Seq: Large-Scale Pretraining of a Visual Language Model for Dense Video Captioning
Antoine Yang, Arsha Nagrani, Paul Hongsuck Seo +5
Article20233 citationsABIVideo Captioning Using Large Language Models
Priyanshu Malaviya, Dhruvit Patel, Santosh Kumar Bharti
Article20243 citationsABICluster-guided temporal modeling for action recognition
Jeong-Hun Kim, Fei Hao, Carson K. Leung +1
Article20232 citationsABI