1. Home
  2. Jobs
  3. Latency Optimization

Latency Optimization Jobs

Explore Latency Optimization roles on Inference Jobs and apply today.

3mo agoBA

Engineering Manager - Model Performance

Baseten

San Francisco, California, United States (On-site)$230K – $300K Yearly
1w agoAN

TPU Kernel Engineer

Anthropic

San Francisco, California, United States (Hybrid)$280K – $850K Yearly
3mo agoCO

Audio Inference Engineer, Model Efficiency

Cohere

New York, United States or Remote (New York, United States + 3 more)
2mo agoAN

[P] Compute Efficiency Engineer

Anthropic

San Francisco, California, United States (Hybrid)$1 – $2 Yearly
2mo agoNV

Senior Software Engineer – TensorRT Edge-LLM

NVIDIA

Santa Clara, California, United States (Hybrid)$152K – $287.5K Yearly
3mo agoCO

Member of Technical Staff, Model Efficiency

Cohere

New York, United States or Remote (New York, United States + 3 more)
1mo agoNV

Senior Performance Engineer - Deep Learning

NVIDIA

Santa Clara, California, United States (On-site)$152K – $241.5K Yearly
3mo agoOP

Software Engineer, Model Inference

OpenAI

San Francisco, California, United States (On-site)$325K – $490K Yearly
1mo agoNV

Senior DL Algorithms Engineer - Inference Performance

NVIDIA

Santa Clara, California, United States (On-site)$184K – $356.5K Yearly
3w agoET

Front-End Power Engineer

Etched

Cupertino, California, United States (On-site)