Conceptual Captions: A Cleaned, Hypernymed, Image Alt-text Dataset For Automatic Image Captioning
Piyush SharmaGoogle AI Venice, CA 90291Nan DingGoogle AI Venice, CA 90291Sebastian GoodmanGoogle AI Venice, CA 90291Radu SoricutGoogle AI Venice, CA 90291
2018en
ABI
Abstract
We present a new dataset of image caption annotations, Conceptual Captions, which contains an order of magnitude more images than the MS-COCO dataset We achieve this by extracting and filtering image caption annotations from billions of webpages. We also present quantitative evaluations of a number of image captioning models and show that a model architecture based on Inception- ResNet-v2 (Szegedy et al., 2016) for image-feature extraction and Transformer
Identifiers
Citations and references
Cited by 20 references