Research Intern – Multimodal Foundation Model for Vision