vLLM Jobs in Washington, United States
Discover vLLM roles in Washington, United States on Inference Jobs and apply today.
2mo agoNV
Senior Software Engineer, Quantized Inference
NVIDIA
Redmond, Washington, United States (On-site)$152K – $287.5K Yearly
3mo agoCO
Member of Technical Staff, Model Efficiency
Cohere
New York, United States or Remote (New York, United States + 3 more)
3mo agoCE
2mo agoNE
2mo agoPO
Member of Engineering (Pre-training / Data Engineering)
Poolside
United Kingdom or Remote (Europe, Middle East, and Africa + 1 more)
4w agoNE
Senior Site Reliability Engineer — Token Factory (Inference Platform)
Nebius
United States + 4 more (Remote)
3mo agoCO
2mo agoNV
Senior Machine Learning Engineer, Quantized Inference
NVIDIA
Redmond, Washington, United States (On-site)$152K – $287.5K Yearly
3mo agoCO
Audio Inference Engineer, Model Efficiency
Cohere
New York, United States or Remote (New York, United States + 3 more)
1mo agoCO
2mo agoD-
Machine Learning Intern - Dynamic KV-Cache Modeling for Efficient LLM Inference
d-Matrix
Santa Clara, Ca, Ca, United States or Remote (California, United States)$30 – $59 Hourly
3w agoBA
3mo agoCO
Member of Technical Staff, Integration/RL Team (Research Engineer)
Cohere
Paris, Paris, France or Remote (United States + 3 more)
3mo agoD-
Software Engineer, Staff - SIMD Kernels
d-Matrix
Santa Clara, Ca, Ca, United States or Remote (United States)$190K – $300K Yearly