1. Home
  2. Jobs
  3. Inference Routing

Inference Routing Jobs

Browse 206 Inference Routing jobs on Inference Jobs.

41-60 of 206 jobs
4w agoOP

Software Engineer, Caching Infrastructure

OpenAI

San Francisco, California, United States (On-site)$255k – $405k Yearly
2w agoNE

Senior Tech Recruiter (ML,LLM)

Nebius

Amsterdam, North Holland, Netherlands (Hybrid)
2w agoTA

Engineering Manager, Model Serving

Together AI

San Francisco, California, United States (On-site)$250k – $300k Yearly
3w agoNV

Senior Compiler Engineer, AI Inference Performance

NVIDIA

Santa Clara, California, United States (On-site)$152k – $241.5k Yearly
2w agoCE

Applied Machine Learning Research Scientist

Cerebras

Sunnyvale, California, United States (On-site)
3w agoNV

Senior Compiler Engineer, AI Inference Platforms

NVIDIA

Santa Clara, California, United States (On-site)$152k – $241.5k Yearly
4w agoOP

Research Engineer / Research Scientist - Foundations Retrieval Lead

OpenAI

San Francisco, California, United States (Hybrid)$460k – $555k Yearly
3w agoNV

Senior Software Engineer, AI Inference Systems

NVIDIA

Toronto, Ontario, Canada (Hybrid)C$170k – C$275k Yearly
7d agoAC

Infrastructure Engineer, ML Systems

Applied Compute

San Francisco, California, United States (On-site)
3w agoNV

Senior Machine Learning Engineer, Quantized Inference

NVIDIA

Redmond, Washington, United States (On-site)$152k – $287.5k Yearly
4w agoTM

Research, Audio Expertise

Thinking Machines Lab

San Francisco, California, United States (On-site)$350k – $475k Yearly
1d agoXA

Member of Technical Staff, RL Training Framework

xAI

Palo Alto, California, United States (On-site)$180k – $440k Yearly
4w agoAN

Research Compute Operations

Anthropic

San Francisco, California, United States (Hybrid)$270k – $290k Yearly
4w agoTA

Research Engineer, Core ML

Together AI

San Francisco, California, United States (On-site)$200k – $280k Yearly
3w agoNV

Senior Systems Software Engineer - Deep Learning Solutions

NVIDIA

Toronto, Ontario, Canada (On-site)C$225k – C$275k Yearly
7d agoD-

Principal Architect, Performance Analysis and Modeling

d-Matrix

Santa Clara, California, United States (Hybrid)$190k – $280k Yearly
4w agoAN

Research Engineer, Discovery

Anthropic

San Francisco, California, United States (Hybrid)$340k – $425k Yearly