Deep Learning models for Image Caption Generation
v1 is based on https://www.analyticsvidhya.com/blog/2021/12/step-by-step-guide-to-build-image-caption-generator-using-deep-learning/
v2 is based on https://huggingface.co/docs/transformers/tasks/image_captioning
v3 is based on v2, using a BLIP instead of a GIT training model
Image training dataset available at https://github.com/jbrownlee/Datasets/releases/tag/Flickr8k
M. Hodosh, P. Young and J. Hockenmaier (2013) "Framing Image Description as a Ranking Task: Data, Models and Evaluation Metrics", Journal of Artifical Intellegence Research, Volume 47, pages 853-899. http://www.jair.org/papers/paper3994.html