1. Home
  2. Jobs
  3. Cluster Reliability

Cluster Reliability Jobs

Browse 219 Cluster Reliability jobs on Inference Jobs.

21-40 of 219 jobs

1wNV

Manager, Next-Generation AI Cluster Architecture

NVIDIA

Santa Clara, California, United States (On-site)$224k – $356.5k Yearly
3wNV

Senior HPC and AI Cluster Administrator

NVIDIA

Yokneam Ilit, Northern District, Israel (Hybrid)
5dCR

Site Reliability Engineer, Managed AI

Crusoe

San Francisco, California, United States (On-site)$204k – $247k Yearly
1wTA

Site Reliability Engineer

Together AI

San Francisco, California, United States (On-site)$150k – $200k Yearly
2wMA

Site Reliability Engineer

Mistral AI

Europe + 7 more (Remote)
19hCO

Staff Software Engineer, Cluster Orchestration

CoreWeave

Bellevue, Washington, United States (Hybrid)$185k – $275k Yearly
2wCR

Site Reliability Engineer

Crusoe

Dublin, Dublin, Ireland (On-site)
2wAN

Software Engineer, AI Reliability

Anthropic

San Francisco, California, United States (Hybrid)$325k – $485k Yearly
2wRE

Site Reliability Engineer

Replit

Foster City, California, United States (Hybrid)$160k – $250k Yearly
1wFI

Reliability Engineer (All Levels)

Figure

San Jose, California, United States (On-site)$120k – $250k Yearly
3wCR

Senior Site Reliability Engineer, Managed AI

Crusoe

San Francisco, California, United States (On-site)$172k – $209k Yearly
1wTM

Research Engineer, Infrastructure, RL Systems

Thinking Machines Lab

San Francisco, California, United States (On-site)$350k – $475k Yearly