← Назад к работе
Работы, на которые ссылается эта работа
Работ: 28
Работа: MIRA-CAP: Memory-Integrated Retrieval-Augmented Captioning for State-of-the-Art Image and Video Captioning
Microsoft COCO: Common Objects in Context
Tsung-Yi Lin, Michael Maire, Serge Belongie +5
Глава2014Цитирований: 7ABIActivityNet: A large-scale video benchmark for human activity understanding
Fabian Caba Heilbron, Víctor Escorcia, Bernard Ghanem +1
Статья2015Цитирований: 3ABITowards Automatic Learning of Procedures From Web Instructional Videos
Luowei Zhou, Chenliang Xu, Jason J. Corso
Статья2018Цитирований: 3ABIVid2Seq: Large-Scale Pretraining of a Visual Language Model for Dense Video Captioning
Antoine Yang, Arsha Nagrani, Paul Hongsuck Seo +5
Статья2023Цитирований: 3ABIVideo Captioning Using Large Language Models
Priyanshu Malaviya, Dhruvit Patel, Santosh Kumar Bharti
Статья2024Цитирований: 3ABICluster-guided temporal modeling for action recognition
Jeong-Hun Kim, Fei Hao, Carson K. Leung +1
Статья2023Цитирований: 2ABIStreaming Dense Video Captioning
Xingyi Zhou, Anurag Arnab, Shyamal Buch +5
Статья2024Цитирований: 2ABI