Latency Optimization Jobs
Browse 317 Latency Optimization jobs on Inference Jobs.
81-100 of 317 jobs
6dTM
Research Engineer, Infrastructure, Kernels
Thinking Machines Lab
San Francisco, California, United States (On-site)$350k – $475k Yearly
3dNV
Senior Machine Learning Applications and Compiler Engineer
NVIDIA
Cambridge, England, United Kingdom (Hybrid)
1dNV
Senior Software Engineer, Quantized Inference
NVIDIA
Redmond, Washington, United States (On-site)$152k – $287.5k Yearly
2wAI
Software Engineer - Controls
Applied Intuition
Sunnyvale, California, United States (On-site)$170k – $250k Yearly
2wSE
Embedded ML Engineer – Gesture Recognition
Sesame
San Francisco, California, United States (On-site)$175k – $280k Yearly
3dNV
Senior Compiler Engineer - AI
NVIDIA
Santa Clara, California, United States (On-site)$184k – $287.5k Yearly
6dAN
Research Engineer, Pretraining Scaling
Anthropic
San Francisco, California, United States (On-site)$315k – $560k Yearly
4wNV
Deep Learning Algorithm Engineer - New College Grad 2026
NVIDIA
Santa Clara, California, United States (On-site)$124k – $241.5k Yearly
6dAN
Research Engineer, Pretraining Scaling (London)
Anthropic
London, England, United Kingdom (On-site)£250k – £435k Yearly
1wNV
Senior Software R&D Engineer, VLSI Physical Design
NVIDIA
Santa Clara, California, United States (Hybrid)$168k – $264.5k Yearly
4wNV
Software Engineer, CPU and SoC Profiling Tools
NVIDIA
Ustroń, Silesian Voivodeship, Poland or Remote (Poland)zł 183.8k – zł 318.5k Yearly
4wEL
3wAN
[P] Compute Efficiency Engineer
Anthropic
San Francisco, California, United States (Hybrid)$1 – $2 Yearly
2wOP
Inference Technical Lead, Sora
OpenAI
San Francisco, California, United States (Hybrid)$380k – $380k Yearly
2wPE