Latency Optimization Jobs
Explore Latency Optimization roles on Inference Jobs and apply today.
2mo agoNV
Senior AI Inference Compiler Engineer
NVIDIA
Santa Clara, California, United States (On-site)$152K – $241.5K Yearly
3mo agoNV
Senior Power Analysis and Optimization Engineer, AI-LLM Systems
NVIDIA
Santa Clara, California, United States (On-site)$136K – $264.5K Yearly
3mo agoPE
AI Inference Engineer (San Francisco)
Perplexity
San Francisco, California, United States (On-site)$210K – $385K Yearly
3w agoOP
2mo agoNV
Senior Software Engineer, Quantized Inference
NVIDIA
Redmond, Washington, United States (On-site)$152K – $287.5K Yearly
3mo agoOP
Inference Technical Lead, Sora
OpenAI
San Francisco, California, United States (Hybrid)$380K – $380K Yearly
2w agoTA
Senior Machine Learning Engineer, Voice AI
Together AI
San Francisco, California, United States (On-site)$200K – $260K Yearly
3mo agoD-
Machine Learning Intern - Dynamic KV-Cache Modeling for Efficient LLM Inference
d-Matrix
Santa Clara, Ca, Ca, United States or Remote (California, United States)$30 – $59 Hourly
3mo agoSE
ML Model Serving Engineer
Sesame
San Francisco, California, United States (On-site)$175K – $280K Yearly