Some papers about image caption

1 Deep Visual-Semantic Alignments for Generating Image Description (CVPR2015)

2 long-term recurrent convolutional networks for visual recognition and description cvpr2015

3 Show and tell, a neural image caption generator.

4 Unifying visual-semantic embeddins with multimodal neural language model.

   This work has published code on github.  

github.com

and author also published other interesting work such as skip-thought.

5 Explain images with multimodal recurrent neural networks

6 Show, Attend and Tell, neural image caption generation with visual attention. 

 aslo very interesting work. use MCMC method.

7 Mind's eye: a recurrent visual representation for image caption generation. cvpr2015 CMU

8 From captions to visual concepts and back.

9 Order-embeddings of images and language

10 deep compositional captions: describing novel object categories without paired training data.