1. Home
  2. Jobs
  3. Low-Latency Inference

Low-Latency Inference Jobs

Explore Low-Latency Inference roles on Inference Jobs and apply today.

3mo agoOP

Inference Technical Lead, Sora

OpenAI

San Francisco, California, United States (Hybrid)$380K – $380K Yearly
3mo agoD-
1mo agoAN
3w agoET
2mo agoAN

Staff Software Engineer, Inference

Anthropic

London, England, United Kingdom (Hybrid)£325K – £390K Yearly
3mo agoOP

Software Engineer, Model Inference

OpenAI

San Francisco, California, United States (On-site)$325K – $490K Yearly
4w agoAN
17h agoNV

Senior Software Engineer - AI Inference

NVIDIA

Santa Clara, California, United States (On-site)$152K – $287.5K Yearly
3mo agoOP

Software Engineer, Inference – AMD GPU Enablement

OpenAI

San Francisco, California, United States (On-site)$325K – $490K Yearly
2mo agoNV

Senior Software Engineer – TensorRT Edge-LLM

NVIDIA

Santa Clara, California, United States (Hybrid)$152K – $287.5K Yearly
3w agoTM

Research, Audio Expertise

Thinking Machines Lab

San Francisco, California, United States (On-site)$350K – $475K Yearly