- Home
- Jobs
- United States
- California
- San Francisco
- Inference Engineer
Inference Engineer jobs in San Francisco, California, United States
Discover Inference Engineer roles in San Francisco, California, United States on Inference Jobs and apply today.
20 jobs
AI Inference Engineer (San Francisco)
Perplexity
San Francisco, California, United States (On-site)
$210k – $385k Yearly
Inference Engineering Manager
Perplexity
San Francisco, California, United States (On-site)
$300k – $385k Yearly
Machine Learning Engineer - Inference
Together AI
San Francisco, California, United States (On-site)
$160k – $230k Yearly
LLM Inference Frameworks and Optimization Engineer
Together AI
San Francisco, California, United States (On-site)
$160k – $230k Yearly
Inference Runtime, Engineering Manager
OpenAI
San Francisco, California, United States (On-site)
$455k – $555k Yearly
Senior Site Reliability Engineer — Token Factory (Inference Platform)
Nebius
Netherlands + 4 more (Remote)
Inference Technical Lead, Sora
OpenAI
San Francisco, California, United States (Hybrid)
$380k – $380k Yearly
Software Engineer, Inference – AMD GPU Enablement
OpenAI
San Francisco, California, United States (On-site)
$325k – $490k Yearly
Member of Engineering (Pre-training and inference software)
Poolside
United Kingdom or Remote (Europe, Middle East, and Africa, North America)
Software Engineer, Load Balancing - Inference
OpenAI
San Francisco, California, United States (On-site)
$325k – $490k Yearly
Full-Stack Software Engineer, Inference
Cohere
Toronto, Ontario, Canada or Remote (Canada + 2 more)
Software Engineer, Inference Deployment
Anthropic
San Francisco, California, United States (Hybrid)
$320k – $485k Yearly
Engineering Manager, Inference
Anthropic
San Francisco, California, United States (Hybrid)
$425k – $560k Yearly
Fullstack Engineer, Applied AI
LangChain
San Francisco, California, United States (On-site)
$170k – $195k Yearly
Engineering Manager - Forward Deployed Engineering (LLM)
Baseten
San Francisco, California, United States (On-site)
$220k – $285k Yearly
Research Engineer, Infrastructure, Inference
Thinking Machines Lab
San Francisco, California, United States (On-site)
$350k – $475k Yearly
Software Engineer, Model Inference
OpenAI
San Francisco, California, United States (On-site)
$325k – $490k Yearly
GPU Systems Engineer – HPC / Parallel Computing
Vast.ai
San Francisco, California, United States (On-site)
$160k – $320k Yearly