vLLM Jobs
Browse 73 vLLM jobs on Inference Jobs.
73 jobs
2wNV
Senior Software Engineer - VLM Microservices for Neural Reconstruction
NVIDIA
Santa Clara, California, United States (On-site)$152k – $287.5k Yearly
2wBA
Engineering Manager - Forward Deployed Engineering (LLM)
Baseten
San Francisco, California, United States (On-site)$220k – $285k Yearly
2wNV
Senior Software Engineer – TensorRT Edge-LLM
NVIDIA
Santa Clara, California, United States (Hybrid)$152k – $287.5k Yearly
2wCO
Member of Technical Staff, Model Efficiency
Cohere
New York, New York, United States or Remote (New York, United States + 3 more)
6dTM
Research Engineer, Infrastructure, Inference
Thinking Machines Lab
San Francisco, California, United States (On-site)$350k – $475k Yearly
1dNV
Senior Software Engineer, Quantized Inference
NVIDIA
Redmond, Washington, United States (On-site)$152k – $287.5k Yearly
6dTA
LLM Inference Frameworks and Optimization Engineer
Together AI
San Francisco, California, United States (On-site)$160k – $230k Yearly
2wNV
Senior Software Engineer, AI Inference Systems
NVIDIA
Santa Clara, California, United States (Hybrid)$184k – $356.5k Yearly
6dTA
Machine Learning Engineer - Inference
Together AI
San Francisco, California, United States (On-site)$160k – $230k Yearly
6dCO
Principal Engineer, Inference
CoreWeave
Sunnyvale, California, United States (Hybrid)$206k – $303k Yearly
2wNV
Senior Deep Learning Engineer
NVIDIA
Warszawa, Masovian Voivodeship, Poland (Hybrid)zł 292.5k – zł 507k Yearly
5dNV
Senior Software Engineer, AI Inference Systems
NVIDIA
Toronto, Ontario, Canada (Hybrid)C$170k – C$275k Yearly
6dSC
AI Infrastructure Engineer, Model Serving Platform
Scale
San Francisco, California, United States (On-site)$179.4k – $224.3k Yearly