vLLM Jobs

Browse 73 vLLM jobs on Inference Jobs.

41-60 of 73 jobs

2wRA

Member of Technical Staff - GPU Infrastructure

Reflection AI

San Francisco, California, United States (On-site)
7dCO

Staff Engineer - Perf and Benchmarking

CoreWeave

Sunnyvale, California, United States (Hybrid)$188k – $275k Yearly
7dCO

Sr. Software Engineer - Perf and Benchmarking

CoreWeave

Sunnyvale, California, United States (Hybrid)$139k – $204k Yearly
2wMO

Member of Technical Staff - ML Performance

Modal

New York, New York, United States (On-site)$150k – $270k Yearly
2wPE

Data Scientist, Evals

Perplexity

London, England, United Kingdom (On-site)$210k – $385k Yearly
2wSC

Tech Lead Manager, Machine Learning Research Scientist- LLM Evals

Scale

San Francisco, California, United States (On-site)$280k – $380k Yearly
4wD-

Machine Learning Intern - Dynamic KV-Cache Modeling for Efficient LLM Inference

d-Matrix

Campbell, California, United States or Remote (California, United States)$30 – $59 Hourly
7dCE

Full Stack LLM Engineer

Cerebras

Toronto, Ontario, Canada (On-site)
3wSC

Staff Machine Learning Research Scientist, LLM Evals

Scale

San Francisco, California, United States (On-site)$280k – $380k Yearly
2wD-

Senior Staff ML Researcher - LLM Algorithmic Optimization

d-Matrix

Bengaluru, Karnataka, India (Hybrid)₹4M – ₹6M Yearly
2wNV

High-Performance LLM Training Engineer - New College Grad 2026

NVIDIA

Santa Clara, California, United States (On-site)$124k – $195.5k Yearly
2wCE

Senior Full Stack LLM Engineer - Training

Cerebras

Sunnyvale, California, United States (On-site)
6dNV
4wSC

Machine Learning Systems Research Engineer, Agent Post-training - Enterprise GenAI

Scale

San Francisco, California, United States (On-site)$252k – $315k Yearly
3wNV

Senior Research Scientist, Multi-Modal Language Models

NVIDIA

Santa Clara, California, United States (On-site)$192k – $356.5k Yearly