Model Inference Jobs
Browse 869 Model Inference jobs on Inference Jobs.
869 jobs
1wOP
Software Engineer, Model Inference
OpenAI
San Francisco, California, United States (On-site)$325k – $490k Yearly
6dTA
LLM Inference Frameworks and Optimization Engineer
Together AI
San Francisco, California, United States (On-site)$160k – $230k Yearly
3wCE
Inference Compiler and Frontend Engineer – Dubai
Cerebras
Dubai, Dubai, United Arab Emirates (On-site)
6dOP
Inference Runtime, Engineering Manager
OpenAI
San Francisco, California, United States (On-site)$455k – $555k Yearly
6dNV
Senior Deep Learning Engineer - Model Evaluation & AI Systems
NVIDIA
Santa Clara, California, United States (On-site)$224k – $431.3k Yearly
1wNV
Senior ML Framework Performance Engineer - AI for Science at Scale
NVIDIA
Santa Clara, California, United States (On-site)$184k – $287.5k Yearly
5dCO
Director of Engineering, Inference Services
CoreWeave
Sunnyvale, California, United States (Hybrid)$206k – $303k Yearly
2wOP
Inference Technical Lead, Sora
OpenAI
San Francisco, California, United States (Hybrid)$380k – $380k Yearly
4wD-
Machine Learning Intern - Dynamic KV-Cache Modeling for Efficient LLM Inference
d-Matrix
Campbell, California, United States or Remote (California, United States)$30 – $59 Hourly
15hNV
Senior Software Engineer, Quantized Inference
NVIDIA
Redmond, Washington, United States (On-site)$152k – $287.5k Yearly
4wCE
Python / PyTorch Developer — Frontend Inference Compiler – Dubai
Cerebras
United Arab Emirates (On-site)
3dNV
Senior Machine Learning Engineer, Quantized Inference
NVIDIA
Redmond, Washington, United States (On-site)$152k – $287.5k Yearly
3wXA
Member of Technical Staff, Model Evaluation
xAI
Palo Alto, California, United States (On-site)$180k – $440k Yearly
1wNE
Senior Technical Product Manager Token Factory - Inference
Nebius
United States (Remote)$204k – $255k Yearly
2wCO
Member of Technical Staff, Model Efficiency
Cohere
New York, New York, United States or Remote (New York, United States + 3 more)