Inference Optimization Jobs
Explore Inference Optimization roles on Inference Jobs and apply today.
3mo agoOP
Software Engineer, Inference – AMD GPU Enablement
OpenAI
San Francisco, California, United States (On-site)$325K – $490K Yearly
2w agoGR
Distinguished Engineer - Inference Serving Network and Storage
Graphcore
Austin, Texas, United States (On-site)
2mo agoAN
Technical Program Manager, Inference Performance
Anthropic
San Francisco, California, United States (Hybrid)$290K – $365K Yearly
2mo agoXA
Member of Technical Staff, Inference
xAI
Palo Alto, California, United States (On-site)$180K – $440K Yearly
2mo agoCO
Software Engineer, Inference AI/ML
CoreWeave
Sunnyvale, California, United States (Hybrid)$92K – $135K Yearly
3mo agoCO
Site Reliability Engineer, Inference Infrastructure
Cohere
Toronto, Ontario, Canada or Remote (Canada + 2 more)
2w agoNV
2mo agoNV
Senior System Software Engineer - Dynamo-Triton Inference Server
NVIDIA
Santa Clara, California, United States (On-site)$152K – $241.5K Yearly
3mo agoCR
Staff Product Manager, Managed Inference (SF/Sunnyvale/New York)
Crusoe
San Francisco, California, United States or Remote (California, United States + 1 more)$204K – $247K Yearly
2mo agoNV
Senior Machine Learning Engineer, Quantized Inference
NVIDIA
Redmond, Washington, United States (On-site)$152K – $287.5K Yearly
2mo agoNV
Senior Software Engineer, Deep Learning Inference - TensorRT
NVIDIA
Santa Clara, California, US$152K – $287.5K Yearly
2mo agoNV
Principal Software Engineer - AI Inference
NVIDIA
Santa Clara, California, United States (On-site)$272K – $431.3K Yearly
4w agoAN
Engineering Manager, Inference Routing and Performance
Anthropic
San Francisco, California, United States (Hybrid)$405K – $485K Yearly
2mo agoAN
Software Engineer, Inference Deployment
Anthropic
San Francisco, California, United States (Hybrid)$320K – $485K Yearly
2mo agoNV
Senior Systems Software Engineer - Deep Learning Solutions
NVIDIA
Toronto, Ontario, Canada (On-site)C$225K – C$275K Yearly
2mo agoNV
Senior Compiler Engineer, AI Inference Platforms
NVIDIA
Santa Clara, California, United States (On-site)$152K – $241.5K Yearly
3mo agoBA
Software Engineer - Model Performance
Baseten
San Francisco, California, United States (On-site)$150K – $250K Yearly
3w agoCE
4w agoAI
ML Runtime Optimization Engineer
Applied Intuition
Sunnyvale, California, United States (On-site)$159.1K – $199.3K Yearly