Machine Learning Engineer — Distillation

This listing is synced directly from the company ATS.

Role Overview

The Machine Learning Engineer role at Featherless AI focuses on model distillation, where you'll design and implement distillation pipelines, run large-scale experiments, and optimize model performance. This mid-level position offers real ownership, with a direct impact on product development and efficiency. You will collaborate closely with both research and engineering teams to translate cutting-edge techniques into production-ready systems.

Perks & Benefits

Featherless AI offers a remote-friendly and async-first work environment, promoting flexibility and work-life balance. As part of a small, senior team, you'll have high ownership of your work and a direct impact on the product roadmap. Competitive compensation and meaningful equity are provided, alongside opportunities for growth in a dynamic, fast-moving startup culture.

⚠️ This job was posted over 5 months ago and may no longer be open. We recommend checking the company's site for the latest status.

Full Job Description

About the Role

We’re looking for a Machine Learning Engineer focused on model distillation to help us build smaller, faster, and more efficient models without sacrificing quality. You’ll work at the intersection of research and production—taking cutting-edge techniques and turning them into systems that scale.

This is a hands-on role with real ownership: you’ll design distillation pipelines, run large-scale experiments, and ship models used in production.

What You’ll Do

Design and implement knowledge distillation pipelines (teacher–student, self-distillation, multi-teacher, etc.)
Distill large foundation models into smaller, faster, and cheaper models for inference
Run and analyze large-scale training experiments to evaluate quality, latency, and cost tradeoffs
Collaborate with research to translate new distillation ideas into production-ready code
Optimize training and inference performance (memory, throughput, latency)
Contribute to internal tooling, evaluation frameworks, and experiment tracking
(Optional) Contribute back to open-source models, tooling, or research

What We’re Looking For

Strong background in machine learning or deep learning
Hands-on experience with model distillation (LLMs or other neural networks)
Solid understanding of training dynamics, loss functions, and optimization
Experience with PyTorch (or JAX) and modern ML tooling
Comfort running experiments on multi-GPU or distributed setups
Ability to reason about model quality vs. performance tradeoffs
Pragmatic mindset: you care about shipping, not just papers

Nice to Have

Experience distilling LLMs or large sequence models
Experience with inference optimization (quantization, pruning, kernels, etc.)
Familiarity with evaluation for language models
Open-source contributions or research publications
Experience in early-stage or fast-moving startups

Why Join

Work on core model quality and cost efficiency—not side projects
High ownership and direct impact on product and roadmap
Small, senior team with strong research + engineering culture
Competitive compensation + meaningful equity
Remote-friendly, async-first environment

Apply on original site

Similar jobs

Found 6 similar jobs

Founding Business Development Rep (AI Cloud US/CA)

Featherless AI • Remote

Content Marketer

Featherless AI • Remote

Founding Account Executive (AI Cloud)

Featherless AI • Remote

Business Development Rep (AI Cloud)

Featherless AI • Remote

AI Researcher — Training Optimization

Featherless AI • Remote

AI Researcher – Multilingual Data

Featherless AI • Remote

Browse more jobs in:

Machine Learning Engineer Jobs

Featherless AI

featherless.ai

Featherless AI specializes in developing lightweight and efficient artificial intelligence solutions tailored for resource-constrained environments. Their typical customers include tech startups, IoT device manufacturers, and enterprises seeking to integrate AI into mobile and edge computing applications. The company's main product is a suite of optimized AI models and tools that reduce computational overhead while maintaining high performance. As a fully remote organization, Featherless AI fosters a distributed work culture that emphasizes asynchronous communication and flexible scheduling to support a global team.

Industry

Artificial Intelligence

Fully remote

23 open positions

About this company (remote-wise)

Headquarters:: Distributed / remote-first
Team style:: Async-ish, remote-first

View company profile →

About the job

Posted onJan 22, 2026

LocationRemote

Skills

Machine LearningDeep LearningModel DistillationPyTorchJAXMulti-GPUDistributed SystemsOptimizationExperiment Tracking

Share this job

💌 Get remote jobs in your inbox

Subscribe to get the latest curated remote jobs every week.