Skip to main content
Article

Conceptual Captions: A Cleaned, Hypernymed, Image Alt-text Dataset For Automatic Image Captioning

Piyush SharmaGoogle AI Venice, CA 90291Nan DingGoogle AI Venice, CA 90291Sebastian GoodmanGoogle AI Venice, CA 90291Radu SoricutGoogle AI Venice, CA 90291
2018en
ABI

Abstract

We present a new dataset of image caption annotations, Conceptual Captions, which contains an order of magnitude more images than the MS-COCO dataset We achieve this by extracting and filtering image caption annotations from billions of webpages. We also present quantitative evaluations of a number of image captioning models and show that a model architecture based on Inception- ResNet-v2 (Szegedy et al., 2016) for image-feature extraction and Transformer

Identifiers

Citations and references

Cited by 20 references