vLLM Jobs in California, United States
Discover vLLM roles in California, United States on Inference Jobs and apply today.
3mo agoCA
3mo agoOP
Software Engineer, Inference – AMD GPU Enablement
OpenAI
San Francisco, California, United States (On-site)$325K – $490K Yearly
1mo agoNV
Senior Deep Learning Algorithm Engineer, Training Framework
NVIDIA
Santa Clara, California, United States (Hybrid)$184K – $287.5K Yearly
2mo agoNV
Senior Software Engineer, AI Inference Systems
NVIDIA
Santa Clara, California, United States (Hybrid)$184K – $356.5K Yearly
2mo agoNE
3mo agoPO
Member of Engineering (Pre-training / Data Engineering)
Poolside
United Kingdom or Remote (Europe, Middle East, and Africa + 1 more)
4w agoSC
AI Infrastructure Engineer, Model Serving Platform
Scale
San Francisco, California, United States (On-site)$179.4K – $224.3K Yearly
1mo agoTA
Engineering Manager, Model Serving
Together AI
San Francisco, California, United States (On-site)$250K – $300K Yearly
4w agoNE
Senior Site Reliability Engineer — Token Factory (Inference Platform)
Nebius
United States + 4 more (Remote)
3mo agoCO
2d agoNV
Principal Deep Learning Communication Architect
NVIDIA
Santa Clara, California, United States (On-site)$272K – $431.3K Yearly
2mo agoBA
Software Engineer — GPU Networking & Distributed Systems
Baseten
San Francisco, California, United States (On-site)$150K – $250K Yearly
2w agoTA
Senior Machine Learning Engineer, Voice AI
Together AI
San Francisco, California, United States (On-site)$200K – $260K Yearly
4w agoTA
Machine Learning Engineer - Inference
Together AI
San Francisco, California, United States (On-site)$160K – $230K Yearly
3mo agoBA
Software Engineer - Model Performance
Baseten
San Francisco, California, United States (On-site)$150K – $250K Yearly
2mo agoNV
Principal GenAI Engagement Lead, Partner Platforms
NVIDIA
Santa Clara, California, United States (Hybrid)$272K – $431.3K Yearly