- Home
- Jobs
- United States
- California
- LLM Inference
LLM Inference Jobs in California, United States
Discover LLM Inference roles in California, United States on Inference Jobs and apply today.
4w agoTA
LLM Inference Frameworks and Optimization Engineer
Together AI
San Francisco, California, United States (On-site)$160K – $230K Yearly
4w agoTA
Machine Learning Engineer
Together AI
San Francisco, California, United States (On-site)$160K – $220K Yearly
3mo agoCO
Member of Technical Staff, Model Efficiency
Cohere
New York, United States or Remote (New York, United States + 3 more)
3mo agoD-
Machine Learning Intern - Dynamic KV-Cache Modeling for Efficient LLM Inference
d-Matrix
Santa Clara, Ca, Ca, United States or Remote (California, United States)$30 – $59 Hourly
2mo agoNV
Principal Software Engineer - AI Inference
NVIDIA
Santa Clara, California, United States (On-site)$272K – $431.3K Yearly
3mo agoBA
Software Engineer - Model Performance
Baseten
San Francisco, California, United States (On-site)$150K – $250K Yearly
1mo agoNV
AI Inference Performance Engineer - New College Grad 2026
NVIDIA
Santa Clara, California, United States (On-site)$124K – $241.5K Yearly
3mo agoPE
AI Inference Engineer (San Francisco)
Perplexity
San Francisco, California, United States (On-site)$210K – $385K Yearly
2mo agoNV
Senior AI Inference Compiler Engineer
NVIDIA
Santa Clara, California, United States (On-site)$152K – $241.5K Yearly
3mo agoOP
3mo agoPO
Member of Engineering (Pre-training and inference software)
Poolside
United Kingdom or Remote (Europe, Middle East, and Africa, North America)
4w agoAN
Engineering Manager, Inference Routing and Performance
Anthropic
San Francisco, California, United States (Hybrid)$405K – $485K Yearly
3mo agoOP
Software Engineer, Inference - Multi Modal
OpenAI
San Francisco, California, United States (On-site)$325K – $490K Yearly
1mo agoNV
Senior DL Algorithms Engineer - Inference Performance
NVIDIA
Santa Clara, California, United States (On-site)$184K – $356.5K Yearly
20h agoCE