AI Inference Engineer, Tether

AI Inference Engineer, Tether

Company Tether
Job title AI Inference Engineer
Job location LondonEnglandUnited Kingdom / Remote (Europe)
Type Full Time

Responsibilities:

  • Work on deploying machine learning models to edge devices using frameworks such as TVM, MLC, and IREE (MLIR).
  • Collaborate closely with researchers to assist in coding, training, and transitioning models from research to production environments.
  • Integrate AI features into existing products, enriching them with the latest advancements in machine learning.

Requirements & Skills:

  • Excellent programming skills in Python, C and C++.
  • Experience with platforms such as TVM, MLC, and IREE (MLIR), which facilitate the deployment of models to specific GPU architectures.
  • Experience in NLP, computer vision, TensorFlow, PyTorch, JAX, and CUDA toolkits.
  • Experience with different aspects of Large Language Models (LLMs), such as fine-tuning techniques to tailor models to specific tasks and prompt engineering.
  • Extensive experience in training models using multi-GPU setups.
  • Demonstrated ability to rapidly assimilate new technologies and techniques.
  • A degree in Computer Science, AI, Machine Learning, or a related field, complemented by a solid track record in AI R&D.

apply for job button