Cartinoe
VLP: Unified Vision-Language Pre-Traning for Image Captioning and VQA ๋…ผ๋ฌธ ๋ฆฌ๋ทฐ