vLLM Jobs in Santa Clara, California, United States

Discover vLLM roles in Santa Clara, California, United States on Inference Jobs and apply today.

2mo agoNV

Principal Software Engineer - AI Inference

NVIDIA

Santa Clara, California, United States (On-site)$272K – $431.3K Yearly
3mo agoCO

Member of Technical Staff, Model Efficiency

Cohere

New York, United States or Remote (New York, United States + 3 more)
3mo agoCE

AI Models, Product Manager

Cerebras

Sunnyvale, California, United States or Remote (United States)
1mo agoNE
1mo agoNV
2mo agoNV

Senior Software Engineer – TensorRT Edge-LLM

NVIDIA

Santa Clara, California, United States (Hybrid)$152K – $287.5K Yearly
2mo agoNV

Senior Scientist, Synthetic Data and Privacy

NVIDIA

Santa Clara, California, United States (On-site)$192K – $356.5K Yearly
1mo agoNV
2mo agoNV

Senior Software Engineer, AI Inference Systems

NVIDIA

Santa Clara, California, United States (Hybrid)$184K – $356.5K Yearly
3mo agoPO

Member of Engineering (Pre-training / Data Engineering)

Poolside

United Kingdom or Remote (Europe, Middle East, and Africa + 1 more)
1d agoNV

Principal Deep Learning Communication Architect

NVIDIA

Santa Clara, California, United States (On-site)$272K – $431.3K Yearly
1mo agoPE

Backend Software Engineer

Perplexity

Worldwide (Remote)$220K – $405K Yearly
2mo agoNV

Principal GenAI Engagement Lead, Partner Platforms

NVIDIA

Santa Clara, California, United States (Hybrid)$272K – $431.3K Yearly
2mo agoNV
2mo agoNV
3mo agoCO

Audio Inference Engineer, Model Efficiency

Cohere

New York, United States or Remote (New York, United States + 3 more)