1. Home
  2. Jobs
  3. Quantization-Aware Distillation

Quantization-Aware Distillation Jobs

Browse 39 Quantization-Aware Distillation jobs on Inference Jobs.

21-39 of 39 jobs

2wBA

Software Engineer - Model Performance

Baseten

San Francisco, California, United States (On-site)$150k – $250k Yearly
4wNV

Research Scientist, AI Accelerator SW HW Co-Design - New College Grad 2026

NVIDIA

Santa Clara, California, United States (On-site)$168k – $264.5k Yearly
2wPL

Research Engineer - Midtraining

Periodic Labs

Menlo Park, California, United States (On-site)
3dDE

Staff Software Engineer, ML Infrastructure

Decagon

San Francisco, California, United States (On-site)$300k – $430k Yearly
2wNV

Senior Research Scientist, AI Accelerator Design and VLSI

NVIDIA

Santa Clara, California, United States (On-site)$192k – $356.5k Yearly
2wOP

Research-Hardware Codesign Engineer

OpenAI

San Francisco, California, United States (Hybrid)$230k – $460k Yearly
6dOP

Research Engineer / Machine Learning Engineer - B2B Applications

OpenAI

San Francisco, California, United States (On-site)$295k – $445k Yearly
6dMA

Future AI Global leaders - Applied Science

Mistral AI

Île de Ré, Charente-Maritime, France (On-site)
1wTA

Research Engineer, Frontier Speculative Decoding

Together AI

San Francisco, California, United States (On-site)$190k – $270k Yearly
2wPO

Member of Engineering (Pre-training / Data)

Poolside

United Kingdom or Remote (Europe, Middle East, and Africa, North America)
6dTM

Research Engineer, Infrastructure, Numerics

Thinking Machines Lab

San Francisco, California, United States (On-site)$350k – $475k Yearly
2wNV

Senior Software Engineer – TensorRT Edge-LLM

NVIDIA

Santa Clara, California, United States (Hybrid)$152k – $287.5k Yearly
1dNV

Senior Deep Learning Compiler Engineer - XLA

NVIDIA

Santa Clara, California, United States (On-site)$152k – $241.5k Yearly
2wRA
2wPE

Software Engineer - Data Flywheel

Perplexity

London, England, United Kingdom (On-site)$210k – $385k Yearly