Latency Optimization Jobs
Explore Latency Optimization roles on Inference Jobs and apply today.
2mo agoNV
Senior Machine Learning Applications and Compiler Engineer
NVIDIA
Cambridge, England, United Kingdom (Hybrid)
3mo agoBA
Engineering Manager - Model Performance
Baseten
San Francisco, California, United States (On-site)$230K – $300K Yearly
1w agoAN
3mo agoCO
Audio Inference Engineer, Model Efficiency
Cohere
New York, United States or Remote (New York, United States + 3 more)
2mo agoAN
[P] Compute Efficiency Engineer
Anthropic
San Francisco, California, United States (Hybrid)$1 – $2 Yearly
2mo agoNV
Senior Software Engineer – TensorRT Edge-LLM
NVIDIA
Santa Clara, California, United States (Hybrid)$152K – $287.5K Yearly
3mo agoCO
Member of Technical Staff, Model Efficiency
Cohere
New York, United States or Remote (New York, United States + 3 more)
2w agoNV
1mo agoNV
Senior Performance Engineer - Deep Learning
NVIDIA
Santa Clara, California, United States (On-site)$152K – $241.5K Yearly
3mo agoOP
Software Engineer, Model Inference
OpenAI
San Francisco, California, United States (On-site)$325K – $490K Yearly
1mo agoNV
Senior DL Algorithms Engineer - Inference Performance
NVIDIA
Santa Clara, California, United States (On-site)$184K – $356.5K Yearly