RAS (Reliability, Availability, Serviceability) Jobs
Browse 59 RAS (Reliability, Availability, Serviceability) jobs on Inference Jobs.
59 jobs
4dNV
Principal Datacenter Resiliency Architect, RAS Features and Modeling
NVIDIA
Santa Clara, California, United States (On-site)$272k – $431.3k Yearly
3dNV
Senior Reliability Engineer - LPU Packaging
NVIDIA
Santa Clara, California, United States (On-site)$168k – $310.5k Yearly
2wAN
Software Engineer, AI Reliability
Anthropic
San Francisco, California, United States (Hybrid)$325k – $485k Yearly
1wOP
Reliability/DFX Engineer
OpenAI
San Francisco, California, United States (On-site)$285k – $460k Yearly
2wNV
Manager, Software TPM - Server Firmware and System Software
NVIDIA
Santa Clara, California, United States (On-site)$200k – $379.5k Yearly
2wCR
Staff+ Software Engineer - Cloud Availability Platform Engineering (CAPE)
Crusoe
San Francisco, California, United States (On-site)$209k – $253k Yearly
2dCR
Site Reliability Engineer, Managed AI
Crusoe
San Francisco, California, United States (On-site)$204k – $247k Yearly
2wNV
Senior Silicon Reliability Engineer
NVIDIA
Santa Clara, California, United States (On-site)$168k – $264.5k Yearly
3wXA
Software Engineer - Reliability
xAI
Palo Alto, California, United States (On-site)$180k – $440k Yearly
5dCO
Operations Engineering Manager, Fleet Reliability
CoreWeave
Dublin, Dublin, Ireland (Hybrid)€97k – €130k Yearly
3wCR
Senior Site Reliability Engineer, Managed AI
Crusoe
San Francisco, California, United States (On-site)$172k – $209k Yearly
1wSI
Software Engineer, Site Reliability (SRE)
Sierra
San Francisco, California, United States (On-site)$230k – $390k Yearly
4wOP
Engineering Manager, Identity Infrastructure
OpenAI
San Francisco, California, United States (Hybrid)$405k – $490k Yearly
2wNV
Reliability Test Manager
NVIDIA
Santa Clara, California, United States (On-site)$184k – $287.5k Yearly
2wOP
Engineering Manager, Core Services
OpenAI
San Francisco, California, United States (On-site)$293k – $385k Yearly