This repository contains the code for M4C-Captioner model from the following paper, released under the MMF.
- O. Sidorov, R. Hu, M. Rohrbach, A. Singh, TextCaps: a Dataset for Image Captioning with Reading Comprehension. in ECCV, 2020 (PDF)
@inproceedings{sidorov2019textcaps,
title={TextCaps: a Dataset for Image Captioningwith Reading Comprehension},
author={Sidorov, Oleksii and Hu, Ronghang and Rohrbach, Marcus and Singh, Amanpreet},
booktitle={European Conference on Computer Vision},
year={2020}
}
Please see https://mmf.sh/docs/projects/m4c_captioner for more details on how to use m4c_captioner model.