
Company |
Unlikely AI |
Job title |
Senior Research Engineer |
Job location |
London, United Kingdom |
Type |
Full Time |
Responsibilities:
- Implementing, deploying, and monitoring deep learning models, including LLMs.
- Optimising model deployments and designing deep learning model features systems.
- Conducting comprehensive performance evaluations, focusing on latency and accuracy across different implementations
- Communicating complex solutions to colleagues, facilitating collaboration and knowledge sharing.
- Analysing and inspecting large-scale datasets, effectively managing data scalability and integrity.
Requirements & Skills:
- 5 + years experience in a Machine learning/Research Engineer role
- Experience utilising & deploying deep learning models.
- Strong coding skills in Python, including the use of PyTorch or TensorFlow.
- Enthusiasm to learn and get up to speed with cutting-edge technologies that you may not already be deeply familiar with.
- Strong verbal and written communication skills.
- Experience with cloud infrastructure (e.g. AWS / GCP / Azure)
- Experience with MLOps, with strong expertise in Docker for containerization and orchestration.
- Knowledge of ML model deployment including technologies such as Torchserve, Sagemaker, or VertexAI
- Understanding of modern best practices for agile software development.
- Knowledge of the latest developments in NLP including LLMs and the transformer architecture
- SRE: An understanding of how to keep models stable and performant in production settings
-
Experience with building CI/CD workflows.
-
Experience working in a startup
-
Experience with retrieval augmented generation for LLMs and semantic vector search
-
Experience optimising model deployments in terms of latency and throughput
-
Infrastructure-as-code tools, such as Terraform
