- Home
- Jobs
- United States
- California
- LLM Inference
LLM Inference Jobs in California, United States
Discover LLM Inference roles in California, United States on Inference Jobs and apply today.
3mo agoPE
Inference Engineering Manager
Perplexity
San Francisco, California, United States (On-site)$300K – $385K Yearly
2mo agoNV
Senior Research Scientist, Fundamental LLM Research for Knowledge, Reasoning, and Agents
NVIDIA
Santa Clara, California, United States (On-site)$224K – $356.5K Yearly
1mo agoNV
Research Scientist, Fundamental LLM Research for Knowledge, Reasoning, and Agents - New College Grad 2026
NVIDIA
Santa Clara, California, United States (On-site)$168K – $264.5K Yearly
3mo agoMA
2w agoAN
Senior/Staff Software Engineer, Inference
Anthropic
San Francisco, California, United States (Hybrid)$300K – $485K Yearly
2mo agoNV
Senior Software Engineer – TensorRT Edge-LLM
NVIDIA
Santa Clara, California, United States (Hybrid)$152K – $287.5K Yearly
2mo agoDE
Staff Software Engineer, ML Infrastructure
Decagon
San Francisco, California, United States (On-site)$300K – $430K Yearly
4w agoTA
Machine Learning Engineer - Inference
Together AI
San Francisco, California, United States (On-site)$160K – $230K Yearly
3w agoTM
Research Engineer, Infrastructure, Inference
Thinking Machines Lab
San Francisco, California, United States (On-site)$350K – $475K Yearly
2mo agoCO
Software Engineer, Inference AI/ML
CoreWeave
Sunnyvale, California, United States (Hybrid)$92K – $135K Yearly
3w agoCE
4w agoSC
ML Research Engineer, ML Systems
Scale
San Francisco, California, United States (On-site)$218.4K – $273K Yearly
1mo agoAN
Engineering Manager, Cloud Inference AWS
Anthropic
San Francisco, California, United States (Hybrid)$405K – $485K Yearly
3mo agoSE
ML Model Serving Engineer
Sesame
San Francisco, California, United States (On-site)$175K – $280K Yearly
2mo agoAN
Staff + Senior Software Engineer, Cloud Inference
Anthropic
San Francisco, California, United States (Hybrid)$300K – $485K Yearly
3mo agoOP
Software Engineer, Data Infrastructure - Research
OpenAI
San Francisco, California, United States (On-site)$250K – $380K Yearly
2mo agoNV
Senior System Software Engineer - Dynamo-Triton Inference Server
NVIDIA
Santa Clara, California, United States (On-site)$152K – $241.5K Yearly
3mo agoCO
Senior Software Engineer I, Inference
CoreWeave
Sunnyvale, California, United States (Hybrid)$139K – $204K Yearly
1mo agoTA
Engineering Manager, Model Serving
Together AI
San Francisco, California, United States (On-site)$250K – $300K Yearly
3mo agoCO
Senior Software Engineer II, Inference
CoreWeave
Sunnyvale, California, United States (Hybrid)$165K – $242K Yearly