Fault Isolation Jobs
Browse 25 Fault Isolation jobs on Inference Jobs.
25 jobs
4dOP
Software Engineer, ChatGPT Infrastructure
OpenAI
San Francisco, California, United States (On-site)$255k – $405k Yearly
3dBA
Senior Software Engineer - New Products
Baseten
San Francisco, California, United States (On-site)$185k – $285k Yearly
6dVA
C++ Software Engineer — Systems
Vast.ai
San Francisco, California, United States (On-site)$120k – $180k Yearly
2wNV
Senior Software Engineer, AI Resiliency
NVIDIA
Redmond, Washington, United States (On-site)$184k – $287.5k Yearly
2wTA
Senior Software Engineer, Observability
Together AI
San Francisco, California, United States (Hybrid)$160k – $260k Yearly
2wOP
Software Engineer, Platform Systems
OpenAI
San Francisco, California, United States (On-site)$310k – $460k Yearly
1wOP
Training: ML Framework Engineer
OpenAI
San Francisco, California, United States (Hybrid)$245k – $385k Yearly
1wOP
Reliability/DFX Engineer
OpenAI
San Francisco, California, United States (On-site)$285k – $460k Yearly
3wFI
3dNV
Senior Resiliency and Safety Architect, GPU Workloads and Failure Analysis
NVIDIA
Santa Clara, California, United States (On-site)$184k – $356.5k Yearly
5dNV
Principal Datacenter Resiliency Architect, RAS Features and Modeling
NVIDIA
Santa Clara, California, United States (On-site)$272k – $431.3k Yearly
3dNV
Distinguished Resiliency and Safety Architect, GPU Diagnostics
NVIDIA
Santa Clara, California, United States (On-site)$320k – $488.8k Yearly
16hNV
Systems Quality and Reliability Lead - LPU
NVIDIA
Santa Clara, California, United States (On-site)$168k – $310.5k Yearly