vLLM Jobs in California, United States

Discover vLLM roles in California, United States on Inference Jobs and apply today.

3mo agoCA

Inference Engineer

Cartesia

San Francisco, California, United States (On-site)$180K – $250K Yearly
3mo agoOP

Software Engineer, Inference – AMD GPU Enablement

OpenAI

San Francisco, California, United States (On-site)$325K – $490K Yearly
1mo agoNV
2mo agoNV

Senior Software Engineer, AI Inference Systems

NVIDIA

Santa Clara, California, United States (Hybrid)$184K – $356.5K Yearly
3mo agoHA
3mo agoPO

Member of Engineering (Pre-training / Data Engineering)

Poolside

United Kingdom or Remote (Europe, Middle East, and Africa + 1 more)
4w agoSC

AI Infrastructure Engineer, Model Serving Platform

Scale

San Francisco, California, United States (On-site)$179.4K – $224.3K Yearly
1mo agoTA

Engineering Manager, Model Serving

Together AI

San Francisco, California, United States (On-site)$250K – $300K Yearly
3w agoET
2d agoNV

Principal Deep Learning Communication Architect

NVIDIA

Santa Clara, California, United States (On-site)$272K – $431.3K Yearly
2mo agoBA

Software Engineer — GPU Networking & Distributed Systems

Baseten

San Francisco, California, United States (On-site)$150K – $250K Yearly
2w agoTA

Senior Machine Learning Engineer, Voice AI

Together AI

San Francisco, California, United States (On-site)$200K – $260K Yearly
4w agoTA

Machine Learning Engineer - Inference

Together AI

San Francisco, California, United States (On-site)$160K – $230K Yearly
3mo agoBA

Software Engineer - Model Performance

Baseten

San Francisco, California, United States (On-site)$150K – $250K Yearly
2mo agoNV

Principal GenAI Engagement Lead, Partner Platforms

NVIDIA

Santa Clara, California, United States (Hybrid)$272K – $431.3K Yearly