1. Home
  2. Jobs
  3. Inference Workloads

Inference Workloads Jobs

Browse 244 Inference Workloads jobs on Inference Jobs.

244 jobs

6dNE

ML/AI Engineer

Nebius

Amsterdam, North Holland, Netherlands (On-site)
3wNV

Platform Architecture Engineer, GeForce NOW

NVIDIA

Santa Clara, California, United States (On-site)$184k – $287.5k Yearly
2wNV

Principal Software Engineer - Inference as a Service

NVIDIA

Santa Clara, California, United States (On-site)$248k – $391k Yearly
2wPE

Inference Engineering Manager

Perplexity

San Francisco, California, United States (On-site)$300k – $385k Yearly
1wOP

Software Engineer, Load Balancing - Inference

OpenAI

San Francisco, California, United States (On-site)$325k – $490k Yearly
6dCR

Group Product Manager, Bare Metal as a Service (Seattle)

Crusoe

Seattle, Washington, United States (On-site)$210.4k – $255k Yearly
3wXA

Software Engineer - Applied Inference

xAI

Palo Alto, California, United States (On-site)$180k – $440k Yearly
2wNV

Senior Software Engineer - Inference as a Service

NVIDIA

Santa Clara, California, United States (On-site)$200k – $391k Yearly
5dAN

Staff Software Engineer, Inference

Anthropic

Dublin, County Dublin, Ireland (Hybrid)€295k – €355k Yearly
5dAN

Senior/Staff Software Engineer, Inference

Anthropic

New York, New York, United States (Hybrid)$300k – $485k Yearly
6dOP

Inference Runtime, Engineering Manager

OpenAI

San Francisco, California, United States (On-site)$455k – $555k Yearly
3wCE

Inference Frontend

Cerebras

Sunnyvale, California, United States (On-site)
3dNV

Senior Compiler Engineer - AI

NVIDIA

Santa Clara, California, United States (On-site)$184k – $287.5k Yearly
6dAN

Senior Software Engineer, Inference

Anthropic

Dublin, Dublin, Ireland (Hybrid)€235k – €295k Yearly
3wAN

Software Engineer, Inference Deployment

Anthropic

San Francisco, California, United States (Hybrid)$320k – $485k Yearly
2wPE

AI Inference Engineer (San Francisco)

Perplexity

San Francisco, California, United States (On-site)$210k – $385k Yearly
2wOP

Inference Technical Lead, Sora

OpenAI

San Francisco, California, United States (Hybrid)$380k – $380k Yearly
6dTA

LLM Inference Frameworks and Optimization Engineer

Together AI

San Francisco, California, United States (On-site)$160k – $230k Yearly