vLLM Jobs in California, United States
Discover vLLM roles in California, United States on Inference Jobs and apply today.
3mo agoSE
ML Model Serving Engineer
Sesame
San Francisco, California, United States (On-site)$175K – $280K Yearly
20h agoCE
3w agoTM
Research Engineer, Infrastructure, Inference
Thinking Machines Lab
San Francisco, California, United States (On-site)$350K – $475K Yearly
3w agoCE
4w agoTA
AI Researcher, Core ML
Together AI
San Francisco, California, United States (On-site)$200K – $280K Yearly
2mo agoNV
Principal Software Engineer - AI Inference
NVIDIA
Santa Clara, California, United States (On-site)$272K – $431.3K Yearly
3mo agoCO
Member of Technical Staff, Model Efficiency
Cohere
New York, United States or Remote (New York, United States + 3 more)
4w agoTA
LLM Inference Frameworks and Optimization Engineer
Together AI
San Francisco, California, United States (On-site)$160K – $230K Yearly
3mo agoCE
4w agoTA
Machine Learning Engineer
Together AI
San Francisco, California, United States (On-site)$160K – $220K Yearly
1mo agoNV
AI Inference Performance Engineer - New College Grad 2026
NVIDIA
Santa Clara, California, United States (On-site)$124K – $241.5K Yearly
3mo agoOP
Software Engineer, Inference - Multi Modal
OpenAI
San Francisco, California, United States (On-site)$325K – $490K Yearly
2mo agoCO
Software Engineer, Inference AI/ML
CoreWeave
Sunnyvale, California, United States (Hybrid)$92K – $135K Yearly
2mo agoTA
Research Engineer, Core ML
Together AI
San Francisco, California, United States (On-site)$200K – $280K Yearly
3mo agoBA
Software Engineer - Model API's
Baseten
San Francisco, California, United States (On-site)$150K – $230K Yearly
2mo agoNV
Senior Software Engineer – TensorRT Edge-LLM
NVIDIA
Santa Clara, California, United States (Hybrid)$152K – $287.5K Yearly