1. Home
  2. Jobs
  3. RAS (Reliability, Availability, Serviceability)

RAS (Reliability, Availability, Serviceability) Jobs

Browse 59 RAS (Reliability, Availability, Serviceability) jobs on Inference Jobs.

59 jobs

4dNV

Principal Datacenter Resiliency Architect, RAS Features and Modeling

NVIDIA

Santa Clara, California, United States (On-site)$272k – $431.3k Yearly
2wNV
3dNV

Senior Reliability Engineer - LPU Packaging

NVIDIA

Santa Clara, California, United States (On-site)$168k – $310.5k Yearly
2wAN

Software Engineer, AI Reliability

Anthropic

San Francisco, California, United States (Hybrid)$325k – $485k Yearly
1wOP

Reliability/DFX Engineer

OpenAI

San Francisco, California, United States (On-site)$285k – $460k Yearly
3dCE

Performance Reliability Engineer

Cerebras

Sunnyvale, California, United States (On-site)
2wNV

Manager, Software TPM - Server Firmware and System Software

NVIDIA

Santa Clara, California, United States (On-site)$200k – $379.5k Yearly
2wCR

Staff+ Software Engineer - Cloud Availability Platform Engineering (CAPE)

Crusoe

San Francisco, California, United States (On-site)$209k – $253k Yearly
2dCR

Site Reliability Engineer, Managed AI

Crusoe

San Francisco, California, United States (On-site)$204k – $247k Yearly
2wNV

Senior Silicon Reliability Engineer

NVIDIA

Santa Clara, California, United States (On-site)$168k – $264.5k Yearly
2wOP

Site Reliability

OpenEvidence

San Francisco, California, United States (On-site)
3wXA

Software Engineer - Reliability

xAI

Palo Alto, California, United States (On-site)$180k – $440k Yearly
5dCO

Operations Engineering Manager, Fleet Reliability

CoreWeave

Dublin, Dublin, Ireland (Hybrid)€97k – €130k Yearly
3wCR

Senior Site Reliability Engineer, Managed AI

Crusoe

San Francisco, California, United States (On-site)$172k – $209k Yearly
1wSI

Software Engineer, Site Reliability (SRE)

Sierra

San Francisco, California, United States (On-site)$230k – $390k Yearly
4wOP

Engineering Manager, Identity Infrastructure

OpenAI

San Francisco, California, United States (Hybrid)$405k – $490k Yearly
2wNV

Reliability Test Manager

NVIDIA

Santa Clara, California, United States (On-site)$184k – $287.5k Yearly
2wOP

Engineering Manager, Core Services

OpenAI

San Francisco, California, United States (On-site)$293k – $385k Yearly