Operational Resilience Jobs
Browse 9 Operational Resilience jobs on Inference Jobs.
9 jobs
4dNV
Senior Resiliency and Safety Architect, GPU Workloads and Failure Analysis
NVIDIA
Santa Clara, California, United States (On-site)$184k – $356.5k Yearly
5dNV
Principal Datacenter Resiliency Architect, RAS Features and Modeling
NVIDIA
Santa Clara, California, United States (On-site)$272k – $431.3k Yearly
2wNV
Senior Software Engineer, AI Resiliency
NVIDIA
Redmond, Washington, United States (On-site)$184k – $287.5k Yearly
6dCO
Operations Engineering Manager, Fleet Reliability
CoreWeave
Dublin, Dublin, Ireland (Hybrid)€97k – €130k Yearly
3wCO
Operations Engineering Manager, Fleet Reliability
CoreWeave
Livingston, New Jersey, United States (Hybrid)$143k – $210k Yearly
2dQD
Senior SRE Engineer - Cloud Operations (Remote, Americas Only)
Qdrant
Berlin, Berlin, Germany or Remote (Americas)
3wCR
Manager, Field Operations - Spark
Crusoe
Denver, Colorado, United States (On-site)$140.3k – $170k Yearly
2wCR
Senior+ Site Reliability Engineer
Crusoe
San Francisco, California, United States (On-site)$172k – $209k Yearly