vLLM Jobs
Explore vLLM roles on Inference Jobs and apply today.
1mo agoNV
AI Inference Performance Engineer - New College Grad 2026
NVIDIA
Santa Clara, California, United States (On-site)$124K – $241.5K Yearly
2mo agoTA
Research Engineer, Core ML
Together AI
San Francisco, California, United States (On-site)$200K – $280K Yearly
3mo agoBA
Software Engineer - Model API's
Baseten
San Francisco, California, United States (On-site)$150K – $230K Yearly
2mo agoNV
Senior Software Engineer – TensorRT Edge-LLM
NVIDIA
Santa Clara, California, United States (Hybrid)$152K – $287.5K Yearly
3mo agoCA
2w agoNV
2w agoNV
Senior Deep Learning Software Engineer, Inference
NVIDIA
Netherlands + 1 more (Remote)zł 221.3K – zł 383.5K Yearly
3w agoNV
2mo agoNV
Senior Software Engineer, AI Inference Systems
NVIDIA
Santa Clara, California, United States (Hybrid)$184K – $356.5K Yearly
2mo agoNE
3mo agoMO
Member of Technical Staff - ML Performance
Modal
New York, United States (On-site)$150K – $270K Yearly
4w agoSC
AI Infrastructure Engineer, Model Serving Platform
Scale
San Francisco, California, United States (On-site)$179.4K – $224.3K Yearly
1mo agoTA
Engineering Manager, Model Serving
Together AI
San Francisco, California, United States (On-site)$250K – $300K Yearly
4w agoNE
Senior Site Reliability Engineer — Token Factory (Inference Platform)
Nebius
United States + 4 more (Remote)
3mo agoCO