vLLM Jobs in California, United States

Discover vLLM roles in California, United States on Inference Jobs and apply today.

3mo agoPL
3mo agoSE

ML Model Serving Engineer

Sesame

San Francisco, California, United States (On-site)$175K – $280K Yearly
20h agoCE
3w agoTM

Research Engineer, Infrastructure, Inference

Thinking Machines Lab

San Francisco, California, United States (On-site)$350K – $475K Yearly
4w agoTA

AI Researcher, Core ML

Together AI

San Francisco, California, United States (On-site)$200K – $280K Yearly
2mo agoNV

Principal Software Engineer - AI Inference

NVIDIA

Santa Clara, California, United States (On-site)$272K – $431.3K Yearly
2mo agoCE

Sr. MTS - Inference ML Eng

Cerebras

Sunnyvale, California, United States (On-site)
3mo agoCO

Member of Technical Staff, Model Efficiency

Cohere

New York, United States or Remote (New York, United States + 3 more)
4w agoTA

LLM Inference Frameworks and Optimization Engineer

Together AI

San Francisco, California, United States (On-site)$160K – $230K Yearly
3mo agoCE

AI Models, Product Manager

Cerebras

Sunnyvale, California, United States or Remote (United States)
1mo agoNE
4w agoTA

Machine Learning Engineer

Together AI

San Francisco, California, United States (On-site)$160K – $220K Yearly
1mo agoNV
3mo agoOP
2mo agoCO

Software Engineer, Inference AI/ML

CoreWeave

Sunnyvale, California, United States (Hybrid)$92K – $135K Yearly
2mo agoTA

Research Engineer, Core ML

Together AI

San Francisco, California, United States (On-site)$200K – $280K Yearly
3w agoET
3mo agoBA

Software Engineer - Model API's

Baseten

San Francisco, California, United States (On-site)$150K – $230K Yearly
2mo agoNV

Senior Software Engineer – TensorRT Edge-LLM

NVIDIA

Santa Clara, California, United States (Hybrid)$152K – $287.5K Yearly