1. Home
  2. Jobs
  3. Fault Detection

Fault Detection Jobs

Browse 82 Fault Detection jobs on Inference Jobs.

41-60 of 82 jobs

1wCE
4dNV

Senior Resiliency and Safety Architect, GPU Workloads and Failure Analysis

NVIDIA

Santa Clara, California, United States (On-site)$184k – $356.5k Yearly
1wCE

Staff Hardware Diagnostics Engineer

Cerebras

Sunnyvale, California, United States (On-site)$150k – $260k Yearly
1wFI

Reliability Engineer (All Levels)

Figure

San Jose, California, United States (On-site)$120k – $250k Yearly
6dTE

Staff/Sr. Staff Engineer, Diagnostic Development

Tenstorrent

Toronto, Ontario, Canada (Hybrid)$100k – $500k Yearly
2wCR

Senior+ Site Reliability Engineer

Crusoe

San Francisco, California, United States (On-site)$172k – $209k Yearly
6dCE

Engineering Manager, Kernel Reliability

Cerebras

Sunnyvale, California, United States (On-site)
2wNV

Senior Silicon Reliability Engineer

NVIDIA

Santa Clara, California, United States (On-site)$168k – $264.5k Yearly
2dNV

Systems Quality and Reliability Lead - LPU

NVIDIA

Santa Clara, California, United States (On-site)$168k – $310.5k Yearly
2wNV
2wNV
5dNV

Principal Datacenter Resiliency Architect, RAS Features and Modeling

NVIDIA

Santa Clara, California, United States (On-site)$272k – $431.3k Yearly
2wNV

Senior System Software Engineer, Data Center Diagnostics

NVIDIA

Santa Clara, California, United States (On-site)$152k – $287.5k Yearly
4dNV

Distinguished Resiliency and Safety Architect, GPU Diagnostics

NVIDIA

Santa Clara, California, United States (On-site)$320k – $488.8k Yearly
3wHE

Senior Site Reliability Engineer

Heidi

Sydney, New South Wales, Australia (Hybrid)
3wXA

Member of Technical Staff

xAI

Palo Alto, California, United States (On-site)
3wNE

Incident Manager

Nebius

Amsterdam, North Holland, Netherlands (On-site)
2wCR

Site Reliability Engineer

Crusoe

Dublin, Dublin, Ireland (On-site)
2wNV

Product Engineer - RMA and Rework Lead

NVIDIA

Mexico City, Mexico City, Mexico or Remote (Mexico)