Inference jobs in San Francisco, California, United States
Discover Inference roles in San Francisco, California, United States on Inference Jobs and apply today.
20 jobs
Inference Engineering Manager
Perplexity
San Francisco, California, United States (On-site)
$300k – $385k Yearly
Inference Runtime, Engineering Manager
OpenAI
San Francisco, California, United States (On-site)
$455k – $555k Yearly
Inference Technical Lead, Sora
OpenAI
San Francisco, California, United States (Hybrid)
$380k – $380k Yearly
AI Inference Engineer (San Francisco)
Perplexity
San Francisco, California, United States (On-site)
$210k – $385k Yearly
LLM Inference Frameworks and Optimization Engineer
Together AI
San Francisco, California, United States (On-site)
$160k – $230k Yearly
Staff Product Manager, Managed Inference (SF/Sunnyvale/New York)
Crusoe
San Francisco, California, United States or Remote (California, United States + 1 more)
$204k – $247k Yearly
Senior Site Reliability Engineer — Token Factory (Inference Platform)
Nebius
Netherlands + 4 more (Remote)
Software Engineer, Inference – AMD GPU Enablement
OpenAI
San Francisco, California, United States (On-site)
$325k – $490k Yearly
Member of Engineering (Pre-training and inference software)
Poolside
United Kingdom or Remote (Europe, Middle East, and Africa, North America)
Machine Learning Engineer - Inference
Together AI
San Francisco, California, United States (On-site)
$160k – $230k Yearly
Engineering Manager, Inference
Anthropic
San Francisco, California, United States (Hybrid)
$425k – $560k Yearly
Full-Stack Software Engineer, Inference
Cohere
Toronto, Ontario, Canada or Remote (Canada + 2 more)
Engineering Manager - Forward Deployed Engineering (LLM)
Baseten
San Francisco, California, United States (On-site)
$220k – $285k Yearly
Software Engineer, Load Balancing - Inference
OpenAI
San Francisco, California, United States (On-site)
$325k – $490k Yearly
Software Engineer, Inference Deployment
Anthropic
San Francisco, California, United States (Hybrid)
$320k – $485k Yearly
Research Economist, Economic Research
Anthropic
San Francisco, California, United States (Hybrid)
$275k – $370k Yearly
Research Engineer, Infrastructure, Inference
Thinking Machines Lab
San Francisco, California, United States (On-site)
$350k – $475k Yearly
Senior Technical Product Manager Token Factory - Inference
Nebius
United States (Remote)
$204k – $255k Yearly