Quantization-Aware Distillation Jobs
Browse 39 Quantization-Aware Distillation jobs on Inference Jobs.
21-39 of 39 jobs
2wBA
Software Engineer - Model Performance
Baseten
San Francisco, California, United States (On-site)$150k – $250k Yearly
4wNV
Research Scientist, AI Accelerator SW HW Co-Design - New College Grad 2026
NVIDIA
Santa Clara, California, United States (On-site)$168k – $264.5k Yearly
3dDE
Staff Software Engineer, ML Infrastructure
Decagon
San Francisco, California, United States (On-site)$300k – $430k Yearly
2wNV
Senior Research Scientist, AI Accelerator Design and VLSI
NVIDIA
Santa Clara, California, United States (On-site)$192k – $356.5k Yearly
2wOP
Research-Hardware Codesign Engineer
OpenAI
San Francisco, California, United States (Hybrid)$230k – $460k Yearly
6dOP
Research Engineer / Machine Learning Engineer - B2B Applications
OpenAI
San Francisco, California, United States (On-site)$295k – $445k Yearly
6dMA
1wTA
Research Engineer, Frontier Speculative Decoding
Together AI
San Francisco, California, United States (On-site)$190k – $270k Yearly
2wPO
Member of Engineering (Pre-training / Data)
Poolside
United Kingdom or Remote (Europe, Middle East, and Africa, North America)
6dTM
Research Engineer, Infrastructure, Numerics
Thinking Machines Lab
San Francisco, California, United States (On-site)$350k – $475k Yearly
2wNV
Senior Software Engineer – TensorRT Edge-LLM
NVIDIA
Santa Clara, California, United States (Hybrid)$152k – $287.5k Yearly
2wPE
1dNV
Senior Deep Learning Compiler Engineer - XLA
NVIDIA
Santa Clara, California, United States (On-site)$152k – $241.5k Yearly
2wRA
Member of Technical Staff - Data Quality Engineer (Pre-training)
Reflection AI
San Francisco, California, United States (On-site)
2wPE
Software Engineer - Data Flywheel
Perplexity
London, England, United Kingdom (On-site)$210k – $385k Yearly