AI Inference Engineer, Tether

Company	Tether
Job title	AI Inference Engineer
Job location	London, England, United Kingdom / Remote (Europe)
Type	Full Time

Responsibilities:

Work on deploying machine learning models to edge devices using frameworks such as TVM, MLC, and IREE (MLIR).
Collaborate closely with researchers to assist in coding, training, and transitioning models from research to production environments.
Integrate AI features into existing products, enriching them with the latest advancements in machine learning.

Excellent programming skills in Python, C and C++.
Experience with platforms such as TVM, MLC, and IREE (MLIR), which facilitate the deployment of models to specific GPU architectures.
Experience in NLP, computer vision, TensorFlow, PyTorch, JAX, and CUDA toolkits.
Experience with different aspects of Large Language Models (LLMs), such as fine-tuning techniques to tailor models to specific tasks and prompt engineering.
Extensive experience in training models using multi-GPU setups.
Demonstrated ability to rapidly assimilate new technologies and techniques.
A degree in Computer Science, AI, Machine Learning, or a related field, complemented by a solid track record in AI R&D.