Cluster Reliability Jobs
Browse 219 Cluster Reliability jobs on Inference Jobs.
21-40 of 219 jobs
1wNV
Manager, Next-Generation AI Cluster Architecture
NVIDIA
Santa Clara, California, United States (On-site)$224k – $356.5k Yearly
2wNV
5dCR
Site Reliability Engineer, Managed AI
Crusoe
San Francisco, California, United States (On-site)$204k – $247k Yearly
1wTA
Site Reliability Engineer
Together AI
San Francisco, California, United States (On-site)$150k – $200k Yearly
19hCO
Staff Software Engineer, Cluster Orchestration
CoreWeave
Bellevue, Washington, United States (Hybrid)$185k – $275k Yearly
2wNV
Senior Networking Solution Test Engineer, AI Cluster Debugging
NVIDIA
Yokne'am, Northern District, Israel (Hybrid)
2wAN
Software Engineer, AI Reliability
Anthropic
San Francisco, California, United States (Hybrid)$325k – $485k Yearly
2wRE
1wFI
Reliability Engineer (All Levels)
Figure
San Jose, California, United States (On-site)$120k – $250k Yearly
3wCR
Senior Site Reliability Engineer, Managed AI
Crusoe
San Francisco, California, United States (On-site)$172k – $209k Yearly
1wTM
Research Engineer, Infrastructure, RL Systems
Thinking Machines Lab
San Francisco, California, United States (On-site)$350k – $475k Yearly