vLLM Jobs
Explore vLLM roles on Inference Jobs and apply today.
2mo agoNV
Senior Software Engineer - VLM Microservices for Neural Reconstruction
NVIDIA
Santa Clara, California, US$152K – $287.5K Yearly
2mo agoNV
Senior Software Engineer, Quantized Inference
NVIDIA
Redmond, Washington, United States (On-site)$152K – $287.5K Yearly
3mo agoSE
ML Model Serving Engineer
Sesame
San Francisco, California, United States (On-site)$175K – $280K Yearly
13h agoCE
3w agoTM
Research Engineer, Infrastructure, Inference
Thinking Machines Lab
San Francisco, California, United States (On-site)$350K – $475K Yearly
3w agoCE
3w agoTA
AI Researcher, Core ML
Together AI
San Francisco, California, United States (On-site)$200K – $280K Yearly
2mo agoNV
Principal Software Engineer - AI Inference
NVIDIA
Santa Clara, California, United States (On-site)$272K – $431.3K Yearly
3mo agoMA
Machine Learning Engineer, Open-Source Software - Paris/London
Mistral AI
Paris, Paris, France (Hybrid)
3mo agoCO
Member of Technical Staff, Model Efficiency
Cohere
New York, United States or Remote (New York, United States + 3 more)
3w agoTA
LLM Inference Frameworks and Optimization Engineer
Together AI
San Francisco, California, United States (On-site)$160K – $230K Yearly
3mo agoCE
2mo agoNV
Senior Software Engineer, AI Inference Systems
NVIDIA
Toronto, Ontario, Canada (Hybrid)C$170K – C$275K Yearly
3w agoTA
Machine Learning Engineer
Together AI
San Francisco, California, United States (On-site)$160K – $220K Yearly