1. Home
  2. Jobs
  3. Latency Optimization

Latency Optimization Jobs

Explore Latency Optimization roles on Inference Jobs and apply today.

2mo agoNV

Senior AI Inference Compiler Engineer

NVIDIA

Santa Clara, California, United States (On-site)$152K – $241.5K Yearly
3w agoET
4w agoCE
1mo agoNV
3mo agoPE

AI Inference Engineer (San Francisco)

Perplexity

San Francisco, California, United States (On-site)$210K – $385K Yearly
3w agoOP

TL, Research Inference

OpenAI

San Francisco, California, United States (On-site)$380K – $555K Yearly
3mo agoHA
3mo agoPL
2mo agoNV

Senior Software Engineer, Quantized Inference

NVIDIA

Redmond, Washington, United States (On-site)$152K – $287.5K Yearly
3mo agoOP

Inference Technical Lead, Sora

OpenAI

San Francisco, California, United States (Hybrid)$380K – $380K Yearly
4w agoCE
2w agoTA

Senior Machine Learning Engineer, Voice AI

Together AI

San Francisco, California, United States (On-site)$200K – $260K Yearly
3mo agoD-
3mo agoSE

ML Model Serving Engineer

Sesame

San Francisco, California, United States (On-site)$175K – $280K Yearly