1. Home
  2. Jobs
  3. Inference Services

Inference Services Jobs

Browse 281 Inference Services jobs on Inference Jobs.

281 jobs

2wNV

Principal Software Engineer - Inference as a Service

NVIDIA

Santa Clara, California, United States (On-site)$248k – $391k Yearly
5dCO

Director of Engineering, Inference Services

CoreWeave

Sunnyvale, California, United States (Hybrid)$206k – $303k Yearly
2wCR

Staff Product Manager, Managed Inference (SF/Sunnyvale/New York)

Crusoe

San Francisco, California, United States or Remote (California, United States + 1 more)$204k – $247k Yearly
2wNV

Senior Software Engineer - Inference as a Service

NVIDIA

Santa Clara, California, United States (On-site)$200k – $391k Yearly
3wAN

Software Engineer, Inference Deployment

Anthropic

San Francisco, California, United States (Hybrid)$320k – $485k Yearly
3dCR

Group Product Manager, Bare Metal as a Service (SF, Sunnyvale, New York)

Crusoe

California, United States + 1 more (Remote)$237.6k – $288k Yearly
2wPE

Inference Engineering Manager

Perplexity

San Francisco, California, United States (On-site)$300k – $385k Yearly
5dAN

Staff Software Engineer, Inference

Anthropic

Dublin, County Dublin, Ireland (Hybrid)€295k – €355k Yearly
5dOP

Inference Runtime, Engineering Manager

OpenAI

San Francisco, California, United States (On-site)$455k – $555k Yearly
1wOP

Inference Technical Lead, Sora

OpenAI

San Francisco, California, United States (Hybrid)$380k – $380k Yearly
1wHA

LLM Inference Engineer

Hippocratic AI

Palo Alto, California, United States (On-site)
5dCR

Group Product Manager, Bare Metal as a Service (Seattle)

Crusoe

Seattle, Washington, United States (On-site)$210.4k – $255k Yearly
2wPE

AI Inference Engineer (San Francisco)

Perplexity

San Francisco, California, United States (On-site)$210k – $385k Yearly
2wPL

LLM Inference Engineer

Periodic Labs

Menlo Park, California, United States (On-site)
2wCE

Inference Compiler and Frontend Engineer – Dubai

Cerebras

Dubai, Dubai, United Arab Emirates (On-site)
3wCE

Inference Frontend

Cerebras

Sunnyvale, California, United States (On-site)
2wPE

AI Inference Engineer (London)

Perplexity

London, England, United Kingdom (On-site)
1wOP

Software Engineer, Load Balancing - Inference

OpenAI

San Francisco, California, United States (On-site)$325k – $490k Yearly
2wCE