1. Home
  2. Jobs
  3. Inference Serving

Inference Serving Jobs

Explore Inference Serving roles on Inference Jobs and apply today.

2mo agoTogether AI logoTA

Engineering Manager, Model Serving

Together AI

San Francisco, California, United States (On-site)$250K – $300K Yearly
1mo agoAnthropic logoAN
1mo agoAnthropic logoAN
1mo agoTogether AI logoTA

LLM Inference Frameworks and Optimization Engineer

Together AI

San Francisco, California, United States (On-site)$160K – $230K Yearly
2mo agoNVIDIA logoNV

Senior DL Algorithms Engineer - Inference Performance

NVIDIA

Santa Clara, California, United States (On-site)$184K – $356.5K Yearly
2mo agoNVIDIA logoNV
3mo agoCohere logoCO

Audio Inference Engineer, Model Efficiency

Cohere

New York, United States or Remote (New York, United States + 3 more)
3w agoTogether AI logoTA

Senior Machine Learning Engineer, Voice AI

Together AI

San Francisco, California, United States (On-site)$200K – $260K Yearly
4w agoThinking Machines Lab logoTM

Research Engineer, Infrastructure, Inference

Thinking Machines Lab

San Francisco, California, United States (On-site)$350K – $475K Yearly
4w agoEtched logoET
3mo agoOpenAI logoOP

Software Engineer, Model Inference

OpenAI

San Francisco, California, United States (On-site)$325K – $490K Yearly
3mo agoCohere logoCO

Staff Software Engineer, Inference Infrastructure

Cohere

San Francisco, California, United States or Remote (United States + 2 more)
1w agoCerebras logoCE
2mo agoNVIDIA logoNV