Inference Engineer Jobs
Explore Inference Engineer roles on Inference Jobs and apply today.
1mo agoNV
AI Inference Performance Engineer - New College Grad 2026
NVIDIA
Santa Clara, California, United States (On-site)$124K – $241.5K Yearly
3mo agoOP
Inference Technical Lead, Sora
OpenAI
San Francisco, California, United States (Hybrid)$380K – $380K Yearly
3w agoOP
Inference Technical Lead, On-Device Transformers
OpenAI
San Francisco, California, United States (Hybrid)$445K – $445K Yearly
2mo agoNV
Senior AI Inference Compiler Engineer
NVIDIA
Santa Clara, California, United States (On-site)$152K – $241.5K Yearly
2mo agoXA
Member of Technical Staff, Inference
xAI
Palo Alto, California, United States (On-site)$180K – $440K Yearly
2mo agoNV
Principal Software Engineer - AI Inference
NVIDIA
Santa Clara, California, United States (On-site)$272K – $431.3K Yearly
3mo agoCO
Site Reliability Engineer, Inference Infrastructure
Cohere
Toronto, Ontario, Canada or Remote (Canada + 2 more)
2mo agoCO
Software Engineer, Inference AI/ML
CoreWeave
Sunnyvale, California, United States (Hybrid)$92K – $135K Yearly
2mo agoNV
Senior System Software Engineer - Dynamo-Triton Inference Server
NVIDIA
Santa Clara, California, United States (On-site)$152K – $241.5K Yearly
3w agoTA
Senior Backend Engineer, Inference Platform
Together AI
San Francisco, California, United States (On-site)$160K – $250K Yearly
2mo agoNV
Senior Software Engineer, Quantized Inference
NVIDIA
Redmond, Washington, United States (On-site)$152K – $287.5K Yearly
3mo agoPO
Member of Engineering (Pre-training and inference software)
Poolside
United Kingdom or Remote (Europe, Middle East, and Africa, North America)
1mo agoNV
Senior Deep Learning Engineer
NVIDIA
Redmond, Washington, United States (On-site)$152K – $287.5K Yearly
1mo agoAN
Sr. Software Engineer, Inference
Anthropic
London, England, United Kingdom (Hybrid)£225K – £325K Yearly
3mo agoOP
Software Engineer, Inference – AMD GPU Enablement
OpenAI
San Francisco, California, United States (On-site)$325K – $490K Yearly
5d agoAN
2mo agoCE
Python / PyTorch Developer — Frontend Inference Compiler – Dubai
Cerebras
United Arab Emirates (On-site)