Cartinoe
ALIGN: Scaling up Visual and Vision-Language Representation with Noisy Text Supervision ๋…ผ๋ฌธ ๋ฆฌ๋ทฐ