We are currently looking for a talented Multimodal Generative AI Researcher, to research and develop next-generation multimodal generative AI technologies.
You will join a close-knit team of highly accomplished researchers and research engineers to deliver cutting-edge technologies that impact millions of users.
The responsibility involves designing, implementing, and optimizing multimodal AI models towards various impactful applicational domains, including but not limited to multimodal conversation, interaction, and content generation.
As a member of this team, you will be tackling meaningful technical problems together with one of the world’s most innovative product development teams.
Requirements & Skills:
Ph.D. (or MSc with 4+ years’ experience) in Generative AI, Multimodal Machine Learning with Vision and Language
Experienced with at least one common DL framework, e.g., PyTorch, TensorFlow
Strong Expertise in at least one of the following areas: Computer Vision, Natural Language Processing, Reinforcement Learning, Conversational AI
Solid Programming skills with Python
Fluent in English
Experience working with large-scale dataset construction and model training
Proven records in top-tier publications, e.g., TPAMI, CVPR, ICCV, ACL, EMNLP, etc.
Experience in project management
Knowledge of Diffusion Models and Transformers.
Experience in training large-scale deep neural networks on distributed systems.