1. Home
  2. Jobs
  3. Real-time Inference

Real-time Inference Jobs

Explore Real-time Inference roles on Inference Jobs and apply today.

3mo agoCO

Audio Inference Engineer, Model Efficiency

Cohere

New York, United States or Remote (New York, United States + 3 more)
4w agoOP

TL, Research Inference

OpenAI

San Francisco, California, United States (On-site)$380K – $555K Yearly
3mo agoCA

Inference Engineer

Cartesia

San Francisco, California, United States (On-site)$180K – $250K Yearly
3mo agoPE

AI Inference Engineer (San Francisco)

Perplexity

San Francisco, California, United States (On-site)$210K – $385K Yearly
3mo agoOP
3mo agoPL
3mo agoOP

Inference Technical Lead, Sora

OpenAI

San Francisco, California, United States (Hybrid)$380K – $380K Yearly
2mo agoCE

Sr. MTS - Inference ML Eng

Cerebras

Sunnyvale, California, United States (On-site)
3mo agoOP

Software Engineer, Inference – AMD GPU Enablement

OpenAI

San Francisco, California, United States (On-site)$325K – $490K Yearly
4w agoTA

LLM Inference Frameworks and Optimization Engineer

Together AI

San Francisco, California, United States (On-site)$160K – $230K Yearly
3w agoET
3mo agoHA
1mo agoNV

Senior DL Algorithms Engineer - Inference Performance

NVIDIA

Santa Clara, California, United States (On-site)$184K – $356.5K Yearly
1mo agoNV
2mo agoNV

Principal Software Engineer - AI Inference

NVIDIA

Santa Clara, California, United States (On-site)$272K – $431.3K Yearly
2d agoCE
3mo agoOP

Software Engineer, Model Inference

OpenAI

San Francisco, California, United States (On-site)$325K – $490K Yearly