- Home
- Jobs
- United States
- Inference Optimization
Inference Optimization Jobs in United States
Discover Inference Optimization roles in United States on Inference Jobs and apply today.
4w agoTA
LLM Inference Frameworks and Optimization Engineer
Together AI
San Francisco, California, United States (On-site)$160K – $230K Yearly
3mo agoOP
Inference Technical Lead, Sora
OpenAI
San Francisco, California, United States (Hybrid)$380K – $380K Yearly
4w agoOP
3mo agoPE
AI Inference Engineer (San Francisco)
Perplexity
San Francisco, California, United States (On-site)$210K – $385K Yearly
2mo agoNV
Senior DL Algorithms Engineer - Inference Performance
NVIDIA
Santa Clara, California, United States (On-site)$184K – $356.5K Yearly
3mo agoCO
Audio Inference Engineer, Model Efficiency
Cohere
New York, United States or Remote (New York, United States + 3 more)
3mo agoOP
Software Engineer, Model Inference
OpenAI
San Francisco, California, United States (On-site)$325K – $490K Yearly
2mo agoNV
AI Inference Performance Engineer - New College Grad 2026
NVIDIA
Santa Clara, California, United States (On-site)$124K – $241.5K Yearly
3w agoTM
Research Engineer, Infrastructure, Inference
Thinking Machines Lab
San Francisco, California, United States (On-site)$350K – $475K Yearly
3mo agoD-
Machine Learning Intern - Dynamic KV-Cache Modeling for Efficient LLM Inference
d-Matrix
Santa Clara, Ca, Ca, United States or Remote (California, United States)$30 – $59 Hourly
2mo agoNV
Senior AI Inference Compiler Engineer
NVIDIA
Santa Clara, California, United States (On-site)$152K – $241.5K Yearly
3mo agoCA
2mo agoNV
Principal Software Engineer - AI Inference
NVIDIA
Santa Clara, California, United States (On-site)$272K – $431.3K Yearly
3mo agoOP
Software Engineer, Inference – AMD GPU Enablement
OpenAI
San Francisco, California, United States (On-site)$325K – $490K Yearly
2w agoAN
Senior/Staff Software Engineer, Inference
Anthropic
San Francisco, California, United States (Hybrid)$300K – $485K Yearly