vLLM Jobs
Browse 73 vLLM jobs on Inference Jobs.
41-60 of 73 jobs
2wRA
Member of Technical Staff - GPU Infrastructure
Reflection AI
San Francisco, California, United States (On-site)
7dCO
Staff Engineer - Perf and Benchmarking
CoreWeave
Sunnyvale, California, United States (Hybrid)$188k – $275k Yearly
7dNE
Senior Site Reliability Engineer — Token Factory (Inference Platform)
Nebius
Netherlands + 4 more (Remote)
7dCO
Sr. Software Engineer - Perf and Benchmarking
CoreWeave
Sunnyvale, California, United States (Hybrid)$139k – $204k Yearly
2wMO
Member of Technical Staff - ML Performance
Modal
New York, New York, United States (On-site)$150k – $270k Yearly
2wSC
Tech Lead Manager, Machine Learning Research Scientist- LLM Evals
Scale
San Francisco, California, United States (On-site)$280k – $380k Yearly
4wD-
Machine Learning Intern - Dynamic KV-Cache Modeling for Efficient LLM Inference
d-Matrix
Campbell, California, United States or Remote (California, United States)$30 – $59 Hourly
3wSC
Staff Machine Learning Research Scientist, LLM Evals
Scale
San Francisco, California, United States (On-site)$280k – $380k Yearly
2wD-
Senior Staff ML Researcher - LLM Algorithmic Optimization
d-Matrix
Bengaluru, Karnataka, India (Hybrid)₹4M – ₹6M Yearly
2wNV
High-Performance LLM Training Engineer - New College Grad 2026
NVIDIA
Santa Clara, California, United States (On-site)$124k – $195.5k Yearly
6dNV
Senior Research Scientist, Fundamental LLM Research for Knowledge, Reasoning, and Agents
NVIDIA
Santa Clara, California, United States (On-site)$224k – $356.5k Yearly
4wSC
Machine Learning Systems Research Engineer, Agent Post-training - Enterprise GenAI
Scale
San Francisco, California, United States (On-site)$252k – $315k Yearly
3wNV
Senior Research Scientist, Multi-Modal Language Models
NVIDIA
Santa Clara, California, United States (On-site)$192k – $356.5k Yearly