Develop Large Language Models as part of our applied research projects and in support of the AI21 Platform, including designing, implementing, and training massive-scale deep language models
Implement, optimize, scale, and test new cutting-edge ideas and architectures
Perform large-scale evaluations and comparisons of trained models across a range of benchmarks, as well as adding support for new benchmarks
Requirements & Skills:
B.Sc. in computer science, software engineering, or equivalent
Self learner, and proven record of ability to remove technical roadblocks
5+ years of experience developing software for production systems and/or internal infrastructure/tools
Prior experience working with cloud computing platforms (e.g., AWS, GCP, Docker, Kubernetes)
Skilled at writing production-grade Python code
Hands-on experience in deep learning and machine learning (TensorFlow/PyTorch)