Multimodal Medical Caption Generation with Deep Learning