vLLM Jobs in Austin, Texas, United States
Discover vLLM roles in Austin, Texas, United States on Inference Jobs and apply today.
3mo agoCO
Member of Technical Staff, Model Efficiency
Cohere
New York, United States or Remote (New York, United States + 3 more)
3mo agoCE
2mo agoNE
2mo agoPO
Member of Engineering (Pre-training / Data Engineering)
Poolside
United Kingdom or Remote (Europe, Middle East, and Africa + 1 more)
4w agoNE
Senior Site Reliability Engineer — Token Factory (Inference Platform)
Nebius
United States + 4 more (Remote)
3mo agoCO
3mo agoCO
Audio Inference Engineer, Model Efficiency
Cohere
New York, United States or Remote (New York, United States + 3 more)
1mo agoCO
2mo agoNV
Senior Power Methodology and Modeling Engineer
NVIDIA
Austin, Texas, United States (On-site)$136K – $264.5K Yearly
3mo agoD-
Machine Learning Intern - Dynamic KV-Cache Modeling for Efficient LLM Inference
d-Matrix
Santa Clara, Ca, Ca, United States or Remote (California, United States)$30 – $59 Hourly
2mo agoTE
C++ Machine Learning Engineer, Models Training
Tenstorrent
Austin, Texas, United States (Hybrid)$100K – $500K Yearly
3w agoBA
3mo agoCO
Member of Technical Staff, Integration/RL Team (Research Engineer)
Cohere
Paris, Paris, France or Remote (United States + 3 more)
3mo agoD-
Software Engineer, Staff - SIMD Kernels
d-Matrix
Santa Clara, Ca, Ca, United States or Remote (United States)$190K – $300K Yearly