1. Home
  2. Jobs
  3. Fault Isolation

Fault Isolation Jobs

Browse 25 Fault Isolation jobs on Inference Jobs.

25 jobs

4dOP

Software Engineer, ChatGPT Infrastructure

OpenAI

San Francisco, California, United States (On-site)$255k – $405k Yearly
3dBA

Senior Software Engineer - New Products

Baseten

San Francisco, California, United States (On-site)$185k – $285k Yearly
6dNV

Senior DFT ATPG Engineer

NVIDIA

Yokneam Ilit, Northern District, Israel (On-site)
2wNV
6dVA

C++ Software Engineer — Systems

Vast.ai

San Francisco, California, United States (On-site)$120k – $180k Yearly
2wNV

Senior Software Engineer, AI Resiliency

NVIDIA

Redmond, Washington, United States (On-site)$184k – $287.5k Yearly
3wNV

DFT ATPG Engineer

NVIDIA

Yokne'am, Northern District, Israel (On-site)
2wTA

Senior Software Engineer, Observability

Together AI

San Francisco, California, United States (Hybrid)$160k – $260k Yearly
2wOP

Software Engineer, Platform Systems

OpenAI

San Francisco, California, United States (On-site)$310k – $460k Yearly
1wOP

Training: ML Framework Engineer

OpenAI

San Francisco, California, United States (Hybrid)$245k – $385k Yearly
1wOP

Reliability/DFX Engineer

OpenAI

San Francisco, California, United States (On-site)$285k – $460k Yearly
3wFI

Staff Diagnostics Engineer

Figure

San Jose, California, United States (On-site)$150k – $250k Yearly
3dNV

Senior Resiliency and Safety Architect, GPU Workloads and Failure Analysis

NVIDIA

Santa Clara, California, United States (On-site)$184k – $356.5k Yearly
5dNV

Principal Datacenter Resiliency Architect, RAS Features and Modeling

NVIDIA

Santa Clara, California, United States (On-site)$272k – $431.3k Yearly
6dFI
3dNV

Distinguished Resiliency and Safety Architect, GPU Diagnostics

NVIDIA

Santa Clara, California, United States (On-site)$320k – $488.8k Yearly
16hNV

Systems Quality and Reliability Lead - LPU

NVIDIA

Santa Clara, California, United States (On-site)$168k – $310.5k Yearly