Production Reliability Jobs
Explore Production Reliability roles on Inference Jobs and apply today.
3w agoCR
2mo agoNV
Distinguished Resiliency and Safety Architect, GPU Diagnostics
NVIDIA
Santa Clara, California, United States (On-site)$320K – $488.8K Yearly
2mo agoCR
Site Operations Hardware Technician
Crusoe
Springfield, Ohio, United States (On-site)$127K – $154K Yearly
3mo agoOP
Software Engineer, GPU Infrastructure - HPC
OpenAI
San Francisco, California, United States (On-site)$255K – $490K Yearly
6d agoCR
Field Acceptance Testing Technician
Crusoe
Brighton, Colorado, United States (On-site)$33 – $36 Hourly
3mo agoMA
3w agoFI
Senior Hardware Failure Analysis Engineer
Figure
San Jose, California, United States (On-site)$120K – $250K Yearly
3mo agoHA
Staff Software Engineer, Developer Experience (DevEx)
Harvey
San Francisco, California, United States (On-site)$238K – $290K Yearly
3mo agoCR
Senior+ Software Engineer - Cloud Availability Platform Engineering (Observability)
Crusoe
San Francisco, California, US$166K – $201K Yearly
5d agoCO
Engineering Manager, Data Infrastructure
CoreWeave
New York, United States (Hybrid)$165K – $242K Yearly
5d agoCO
Software Engineer - Data Infrastructure Services
CoreWeave
Sunnyvale, California, United States (Hybrid)$109K – $160K Yearly
2mo agoNV
Senior Technical Program Manager, Software Compute Platform
NVIDIA
Santa Clara, California, United States (On-site)$200K – $322K Yearly
3mo agoHA
Product Operations Engineer
HappyRobot
San Francisco, California, United States (On-site)$120K – $180K Yearly