Quantization Jobs
Browse 30 Quantization jobs on Inference Jobs.
30 jobs
2dNV
Senior Machine Learning Engineer, Quantized Inference
NVIDIA
Redmond, Washington, United States (On-site)$152k – $287.5k Yearly
5mNV
Senior Software Engineer, Quantized Inference
NVIDIA
Redmond, Washington, United States (On-site)$152k – $287.5k Yearly
3wXA
Member of Technical Staff, Inference
xAI
Palo Alto, California, United States (On-site)$180k – $440k Yearly
2wNV
Senior Deep Learning Engineer
NVIDIA
Warszawa, Masovian Voivodeship, Poland (Hybrid)zł 292.5k – zł 507k Yearly
5dAN
Performance Engineer, GPU
Anthropic
San Francisco, California, United States (Hybrid)$315k – $560k Yearly
5dTA
LLM Inference Frameworks and Optimization Engineer
Together AI
San Francisco, California, United States (On-site)$160k – $230k Yearly
5dAI
ML Runtime Optimization Engineer
Applied Intuition
Mountain View, California, United States (On-site)$159.1k – $199.3k Yearly
2wAI
ML Runtime Optimization Engineer - Lead
Applied Intuition
Sunnyvale, California, United States (On-site)$199.3k – $264.5k Yearly
2wD-
Senior Staff Machine Learning Engineer -Frameworks
d-Matrix
Santa Clara, California, United States (Hybrid)$155k – $250k Yearly
2wNV
Senior Research Scientist, AI Accelerator Design and VLSI
NVIDIA
Santa Clara, California, United States (On-site)$192k – $356.5k Yearly
1wBA
Software Engineer - Model Performance
Baseten
San Francisco, California, United States (On-site)$150k – $250k Yearly
2wTE
1wOP
Research-Hardware Codesign Engineer
OpenAI
San Francisco, California, United States (Hybrid)$230k – $460k Yearly
4wNV
Research Scientist, AI Accelerator SW HW Co-Design - New College Grad 2026
NVIDIA
Santa Clara, California, United States (On-site)$168k – $264.5k Yearly
2dDE
Staff Software Engineer, ML Infrastructure
Decagon
San Francisco, California, United States (On-site)$300k – $430k Yearly