vLLM Jobs in San Francisco, California, United States

Discover vLLM roles in San Francisco, California, United States on Inference Jobs and apply today.

3mo agoSE

ML Model Serving Engineer

Sesame

San Francisco, California, United States (On-site)$175K – $280K Yearly
3w agoTM

Research Engineer, Infrastructure, Inference

Thinking Machines Lab

San Francisco, California, United States (On-site)$350K – $475K Yearly
4w agoTA

AI Researcher, Core ML

Together AI

San Francisco, California, United States (On-site)$200K – $280K Yearly
3mo agoCO

Member of Technical Staff, Model Efficiency

Cohere

New York, United States or Remote (New York, United States + 3 more)
4w agoTA

LLM Inference Frameworks and Optimization Engineer

Together AI

San Francisco, California, United States (On-site)$160K – $230K Yearly
3mo agoCE

AI Models, Product Manager

Cerebras

Sunnyvale, California, United States or Remote (United States)
1mo agoNE
4w agoTA

Machine Learning Engineer

Together AI

San Francisco, California, United States (On-site)$160K – $220K Yearly
3mo agoOP
2mo agoTA

Research Engineer, Core ML

Together AI

San Francisco, California, United States (On-site)$200K – $280K Yearly
3mo agoBA

Software Engineer - Model API's

Baseten

San Francisco, California, United States (On-site)$150K – $230K Yearly
3mo agoCA

Inference Engineer

Cartesia

San Francisco, California, United States (On-site)$180K – $250K Yearly
3mo agoOP

Software Engineer, Inference – AMD GPU Enablement

OpenAI

San Francisco, California, United States (On-site)$325K – $490K Yearly
3mo agoPO

Member of Engineering (Pre-training / Data Engineering)

Poolside

United Kingdom or Remote (Europe, Middle East, and Africa + 1 more)
4w agoSC

AI Infrastructure Engineer, Model Serving Platform

Scale

San Francisco, California, United States (On-site)$179.4K – $224.3K Yearly
1mo agoTA

Engineering Manager, Model Serving

Together AI

San Francisco, California, United States (On-site)$250K – $300K Yearly