Model Optimization Jobs
Browse 1,124 Model Optimization jobs on Inference Jobs.
1,124 jobs
6dAI
ML Runtime Optimization Engineer
Applied Intuition
Mountain View, California, United States (On-site)$159.1k – $199.3k Yearly
2wAI
ML Runtime Optimization Engineer - Lead
Applied Intuition
Sunnyvale, California, United States (On-site)$199.3k – $264.5k Yearly
3wPE
Research Engineering Manager - Model Training
Perplexity
San Francisco, California, United States (On-site)$300k – $470k Yearly
6dTA
LLM Inference Frameworks and Optimization Engineer
Together AI
San Francisco, California, United States (On-site)$160k – $230k Yearly
2wTA
Research Intern, Model Shaping (Summer 2026)
Together AI
San Francisco, California, United States (On-site)
6dTA
AI Researcher, Core ML
Together AI
San Francisco, California, United States (On-site)$160k – $230k Yearly
2wNV
Senior Developer Relations Manager - COSMOS and Foundation Models
NVIDIA
Santa Clara, California, United States (On-site)$184k – $356.5k Yearly
1wOP
Distributed Training Engineer, Sora
OpenAI
San Francisco, California, United States (Hybrid)$380k – $555k Yearly
4wCE
Python / PyTorch Developer — Frontend Inference Compiler – Dubai
Cerebras
United Arab Emirates (On-site)
2wSE
ML Model Serving Engineer
Sesame
San Francisco, California, United States (On-site)$175k – $280k Yearly
6dVA
Systems/GPU Research Engineer
Vast.ai
San Francisco, California, United States (On-site)$160k – $320k Yearly
6dNV
Senior Design Optimization Engineer - LPU Packaging
NVIDIA
Santa Clara, California, United States (Hybrid)$184k – $345k Yearly
2wRA
Member of Technical Staff - Post-Training
Reflection AI
San Francisco, California, United States (On-site)
6dTM
Research Engineer, Infrastructure, Inference
Thinking Machines Lab
San Francisco, California, United States (On-site)$350k – $475k Yearly