Leverage your strong software engineering expertise and experience to develop robust solutions for real-world problems
Collaborate with researchers on projects related to machine learning and artificial intelligence
Algorithm development, software implementation, performance profiling, and optimization for various kinds of AI/ML workloads
AI Platform Development: Assist in the design and development of AI/machine learning-based platforms for various safety-critical technical problems
AI/ML Model Deployment: Assist in deploying AI/machine learning models into production environments
AI/ML pipeline and model monitoring: Contribute to code development and tool rollout for monitoring and maintenance of model performance and infrastructure health
Automation: Participate in the development and maintenance of automated AI systems pipelines
Collaboration: Under guidance from project leads, collaborate with the domain experts to integrate machine learning models into production systems
Documentation: Maintain documentation of AI platform processes and procedures
Problem Solving: Assist in troubleshooting and resolving AI platform-related issues
Research and stay updated on the latest trends and technologies in DevOps, MLOps, and AIOps
Requirements & Skills:
Bachelor’s degree or higher in Computer Science, Engineering, Mathematics, or related field
5+ years of experience in software engineering, data engineering, or machine learning engineering
Proficient in one or more programming languages, such as Python, Java, C++
Proficient in working with AI frameworks and tools, such as TensorFlow, PyTorch, Scikit-learn
Experience developing, training, and debugging ML pipelines for computer vision, natural language processing (NLP)
Experience in working with cloud platforms, especially AWS and Azure
Experience in working with CI/CD frameworks such as GitLab, Jenkins
Experience in working with container and workload management frameworks such as Docker, Kubernetes, Airflow
Experience in infrastructure as code technologies such as CloudFormation, Terraform, and Open Tofu
Strong analytical and problem-solving skills
Excellent communication and collaboration skills
Ability to work independently and in a team environment
Deep expertise across a variety of DevOps, MLOPs, and AIOps tools and technologies
Demonstrated experience in prototyping and shipping AI-based tools
Experience in working with big data technologies, such as Hadoop, Spark, Kafka
Experience in working with platforms such as DataBricks, Amazon Sagemaker, W&B (Weights and Biases), Kubeflow, SnorkelFlow, Labelbox, DVC, Doit, GreatExpectations
Experience in working with DevOps tools such as Puppet, Ansible, Packer, Vagrant
Experience with security-focused programming languages such as Rust, WebAssembly
Experience developing, training, debugging, and maintaining ML pipelines for Large Language Models (LLMs), foundational models, time-series forecasting, anomaly detection
Experience working with DOD and other govt. agencies
Experience in open-source software development
U.S. Person (U.S. citizen, permanent resident, refugee, or asylee) or eligible to obtain necessary export authorizations required.
Bachelor’s degree or higher in Computer Science, Engineering, Mathematics, or related field.