1. Home
  2. Jobs
  3. Latency

Latency Jobs

Browse 13 Latency jobs on Inference Jobs.

13 jobs

1wDE

Senior Software Engineer, Voice Agent

Decagon

San Francisco, California, United States (On-site)$250k – $330k Yearly
5dOP

Inference Runtime, Engineering Manager

OpenAI

San Francisco, California, United States (On-site)$455k – $555k Yearly
2wMA

Software Engineer, Technical Lead, Inference

Mistral AI

Île de Ré, Charente-Maritime, France (Hybrid)
3dOP

Software Engineer, ChatGPT Infrastructure

OpenAI

San Francisco, California, United States (On-site)$255k – $405k Yearly
2wPE

Forward-Deployed Engineer - API Platform | London, NYC, Seattle, SF

Perplexity

New York, New York, United States (On-site)$205k – $335k Yearly
2wPL

LLM Inference Engineer

Periodic Labs

Menlo Park, California, United States (On-site)
2wPE

AI Inference Engineer (San Francisco)

Perplexity

San Francisco, California, United States (On-site)$210k – $385k Yearly
1wBA

Software Engineer - Model API's

Baseten

San Francisco, California, United States (On-site)$150k – $230k Yearly
5dCO

Principal Engineer, Inference

CoreWeave

Sunnyvale, California, United States (Hybrid)$206k – $303k Yearly
3dNV

Senior Systems Software Engineer – Cloud Networking

NVIDIA

Santa Clara, California, United States (On-site)$184k – $287.5k Yearly
2wNV

Senior Software Engineer – TensorRT Edge-LLM

NVIDIA

Santa Clara, California, United States (Hybrid)$152k – $287.5k Yearly