vLLM Jobs in United States

Discover vLLM roles in United States on Inference Jobs and apply today.

3mo agoSE

ML Model Serving Engineer

Sesame

San Francisco, California, United States (On-site)$175K – $280K Yearly
3mo agoHA
3mo agoPL
4w agoTA

Machine Learning Engineer

Together AI

San Francisco, California, United States (On-site)$160K – $220K Yearly
2mo agoNV

Senior Software Engineer – TensorRT Edge-LLM

NVIDIA

Santa Clara, California, United States (Hybrid)$152K – $287.5K Yearly
3d agoNV

Senior Deep Learning Software Engineer, LLM Performance

NVIDIA

Santa Clara, California, United States (On-site)$184K – $356.5K Yearly
2mo agoNV

Principal Software Engineer - AI Inference

NVIDIA

Santa Clara, California, United States (On-site)$272K – $431.3K Yearly
3mo agoCO

Member of Technical Staff, Model Efficiency

Cohere

New York, United States or Remote (New York, United States + 3 more)
4w agoTA

LLM Inference Frameworks and Optimization Engineer

Together AI

San Francisco, California, United States (On-site)$160K – $230K Yearly
2mo agoNV

Senior Machine Learning Engineer, Quantized Inference

NVIDIA

Redmond, Washington, United States (On-site)$152K – $287.5K Yearly
3mo agoBA

Software Engineer - Model Performance

Baseten

San Francisco, California, United States (On-site)$150K – $250K Yearly
1mo agoNV
3mo agoD-
4w agoSC

AI Infrastructure Engineer, Model Serving Platform

Scale

San Francisco, California, United States (On-site)$179.4K – $224.3K Yearly
2mo agoNV

Senior Software Engineer, AI Inference Systems

NVIDIA

Santa Clara, California, United States (Hybrid)$184K – $356.5K Yearly
2mo agoCE

Sr. MTS - Inference ML Eng

Cerebras

Sunnyvale, California, United States (On-site)
3w agoTA

Senior Machine Learning Engineer, Voice AI

Together AI

San Francisco, California, United States (On-site)$200K – $260K Yearly