vLLM Jobs in Potrero Hill, California, United States
Discover vLLM roles in Potrero Hill, California, United States on Inference Jobs and apply today.
3mo agoCO
Member of Technical Staff, Model Efficiency
Cohere
New York, United States or Remote (New York, United States + 3 more)
3mo agoCE
2mo agoNE
2mo agoPO
Member of Engineering (Pre-training / Data Engineering)
Poolside
United Kingdom or Remote (Europe, Middle East, and Africa + 1 more)
4w agoNE
Senior Site Reliability Engineer — Token Factory (Inference Platform)
Nebius
United States + 4 more (Remote)
3mo agoCO
3mo agoCO
Audio Inference Engineer, Model Efficiency
Cohere
New York, United States or Remote (New York, United States + 3 more)
1mo agoCO
2mo agoD-
Machine Learning Intern - Dynamic KV-Cache Modeling for Efficient LLM Inference
d-Matrix
Santa Clara, Ca, Ca, United States or Remote (California, United States)$30 – $59 Hourly
3w agoBA
3mo agoCO
Member of Technical Staff, Integration/RL Team (Research Engineer)
Cohere
Paris, Paris, France or Remote (United States + 3 more)
3mo agoD-
Software Engineer, Staff - SIMD Kernels
d-Matrix
Santa Clara, Ca, Ca, United States or Remote (United States)$190K – $300K Yearly
3w agoD-
Compiler Architect
d-Matrix
Santa Clara, California, US or Remote (United States + 1 more)$190K – $300K Yearly
3mo agoPO
Member of Engineering (Pre-training and inference software)
Poolside
United Kingdom or Remote (Europe, Middle East, and Africa, North America)