vLLM Jobs in San Francisco, California, United States
Discover vLLM roles in San Francisco, California, United States on Inference Jobs and apply today.
3mo agoSE
ML Model Serving Engineer
Sesame
San Francisco, California, United States (On-site)$175K – $280K Yearly
3w agoTM
Research Engineer, Infrastructure, Inference
Thinking Machines Lab
San Francisco, California, United States (On-site)$350K – $475K Yearly
4w agoTA
AI Researcher, Core ML
Together AI
San Francisco, California, United States (On-site)$200K – $280K Yearly
3mo agoCO
Member of Technical Staff, Model Efficiency
Cohere
New York, United States or Remote (New York, United States + 3 more)
4w agoTA
LLM Inference Frameworks and Optimization Engineer
Together AI
San Francisco, California, United States (On-site)$160K – $230K Yearly
3mo agoCE
4w agoTA
Machine Learning Engineer
Together AI
San Francisco, California, United States (On-site)$160K – $220K Yearly
3mo agoOP
Software Engineer, Inference - Multi Modal
OpenAI
San Francisco, California, United States (On-site)$325K – $490K Yearly
2mo agoTA
Research Engineer, Core ML
Together AI
San Francisco, California, United States (On-site)$200K – $280K Yearly
3mo agoBA
Software Engineer - Model API's
Baseten
San Francisco, California, United States (On-site)$150K – $230K Yearly
3mo agoCA
3mo agoOP
Software Engineer, Inference – AMD GPU Enablement
OpenAI
San Francisco, California, United States (On-site)$325K – $490K Yearly
2mo agoNE
3mo agoPO
Member of Engineering (Pre-training / Data Engineering)
Poolside
United Kingdom or Remote (Europe, Middle East, and Africa + 1 more)
4w agoSC
AI Infrastructure Engineer, Model Serving Platform
Scale
San Francisco, California, United States (On-site)$179.4K – $224.3K Yearly
1mo agoTA
Engineering Manager, Model Serving
Together AI
San Francisco, California, United States (On-site)$250K – $300K Yearly
4w agoNE
Senior Site Reliability Engineer — Token Factory (Inference Platform)
Nebius
United States + 4 more (Remote)