Latency Jobs
Browse 13 Latency jobs on Inference Jobs.
13 jobs
1wDE
Senior Software Engineer, Voice Agent
Decagon
San Francisco, California, United States (On-site)$250k – $330k Yearly
5dOP
Inference Runtime, Engineering Manager
OpenAI
San Francisco, California, United States (On-site)$455k – $555k Yearly
2wMA
3dOP
Software Engineer, ChatGPT Infrastructure
OpenAI
San Francisco, California, United States (On-site)$255k – $405k Yearly
2wPE
Forward-Deployed Engineer - API Platform | London, NYC, Seattle, SF
Perplexity
New York, New York, United States (On-site)$205k – $335k Yearly
2wNV
System Software Architecture Researcher - PhD Program
NVIDIA
Roskilde, Region Zealand, Denmark (On-site)
2wPE
AI Inference Engineer (San Francisco)
Perplexity
San Francisco, California, United States (On-site)$210k – $385k Yearly
1wBA
Software Engineer - Model API's
Baseten
San Francisco, California, United States (On-site)$150k – $230k Yearly
5dCO
Principal Engineer, Inference
CoreWeave
Sunnyvale, California, United States (Hybrid)$206k – $303k Yearly
3dNV
Senior Systems Software Engineer – Cloud Networking
NVIDIA
Santa Clara, California, United States (On-site)$184k – $287.5k Yearly
2wNV
Senior Software Engineer – TensorRT Edge-LLM
NVIDIA
Santa Clara, California, United States (Hybrid)$152k – $287.5k Yearly