- Home
- Jobs
- United States
- Inference Optimization
Inference Optimization Jobs in United States
Discover Inference Optimization roles in United States on Inference Jobs and apply today.
1mo agoAN
Engineering Manager, Inference Routing and Performance
Anthropic
San Francisco, California, United States (Hybrid)$405K – $485K Yearly
3mo agoOP
Software Engineer, Inference - Multi Modal
OpenAI
San Francisco, California, United States (On-site)$325K – $490K Yearly
1mo agoMA
Member of Technical Staff, Inference & RL Systems
Magic
San Francisco, California, United States (On-site)$225K – $550K Yearly
2mo agoNV
Senior Software Engineer, Quantized Inference
NVIDIA
Redmond, Washington, United States (On-site)$152K – $287.5K Yearly
2mo agoNV
Senior Machine Learning Engineer, Quantized Inference
NVIDIA
Redmond, Washington, United States (On-site)$152K – $287.5K Yearly
2mo agoTA
Research Engineer, Core ML
Together AI
San Francisco, California, United States (On-site)$200K – $280K Yearly
2mo agoNV
Senior Compiler Engineer, AI Inference Platforms
NVIDIA
Santa Clara, California, United States (On-site)$152K – $241.5K Yearly
4w agoOP
Inference Technical Lead, On-Device Transformers
OpenAI
San Francisco, California, United States (Hybrid)$445K – $445K Yearly
4w agoTA
Machine Learning Engineer - Inference
Together AI
San Francisco, California, United States (On-site)$160K – $230K Yearly
2w agoGR
Distinguished Engineer - Inference Serving Network and Storage
Graphcore
Austin, Texas, United States (On-site)
2w agoAN
Engineering Manager, Inference
Anthropic
San Francisco, California, United States (Hybrid)$425K – $560K Yearly
3d agoNV
Senior Deep Learning Software Engineer, LLM Performance
NVIDIA
Santa Clara, California, United States (On-site)$184K – $356.5K Yearly
3mo agoSE
ML Model Serving Engineer
Sesame
San Francisco, California, United States (On-site)$175K – $280K Yearly
3mo agoCO
Member of Technical Staff, Model Efficiency
Cohere
New York, United States or Remote (New York, United States + 3 more)
2mo agoAN
Technical Program Manager, Inference Performance
Anthropic
San Francisco, California, United States (Hybrid)$290K – $365K Yearly
3mo agoBA
Software Engineer - Model Performance
Baseten
San Francisco, California, United States (On-site)$150K – $250K Yearly
2mo agoNV
Senior Software Engineer – TensorRT Edge-LLM
NVIDIA
Santa Clara, California, United States (Hybrid)$152K – $287.5K Yearly
5d agoCE