1. Home
  2. Jobs
  3. Cluster Reliability

Cluster Reliability Jobs

Browse 219 Cluster Reliability jobs on Inference Jobs.

141-160 of 219 jobs

2wAN

TPM Manager, Compute & Infrastructure

Anthropic

San Francisco, California, United States (Hybrid)$435k – $565k Yearly
5dDE

Staff Software Engineer, ML Infrastructure

Decagon

San Francisco, California, United States (On-site)$300k – $430k Yearly
1wTM

Infrastructure Engineer, Security

Thinking Machines Lab

San Francisco, California, United States (On-site)$200k – $475k Yearly
3wCE

Deployment Engineer, AI Inference

Cerebras

Sunnyvale, California, United States (On-site)
22hGR

Senior Staff Engineer - Telemetry

Graphcore

Gdańsk, Pomeranian Voivodeship, Poland (On-site)zł 350.7k – zł 474.4k Yearly
1wCE

System Software Engineer (Embedded)

Cerebras

Sunnyvale, California, United States (On-site)$175k – $275k Yearly
3dNV

Senior Software Engineer - Deep Learning Compiler Verification and Infrastructure

NVIDIA

Santa Clara, California, United States (On-site)$140k – $224.3k Yearly
1wTA

Research Engineer, Frontier Speculative Decoding

Together AI

San Francisco, California, United States (On-site)$190k – $270k Yearly
2wOP

Technical Lead, Host Assurance

OpenAI

San Francisco, California, United States (Hybrid)$347k – $385k Yearly
2wCR
1wNE

Developer Advocate - AI cloud

Nebius

United States (Remote)$220k – $300k Yearly
3wNE

Data Center IT Technician

Nebius

Béthune, Pas-de-Calais, France (On-site)
3wNE

Senior Hardware Support Engineer

Nebius

United States (Remote)$125k – $180k Yearly
6dNV

Director, NSV Automation

NVIDIA

Yokne'am, Northern District, Israel (Hybrid)
1wFI

Reliability Test Engineer, Electrical (All Levels)

Figure

San Jose, California, United States (On-site)$120k – $250k Yearly
2wRE

Software Engineer, Distributed Systems

Replit

Foster City, California, United States (Hybrid)$130k – $290k Yearly
2wLA

Technical Program Manager - Dallas

Lambda

Dallas, Texas, United States (On-site)$214k – $321k Yearly
4wXA

Infrastructure Engineer - US Government

xAI

Palo Alto, California, United States (On-site)$180k – $440k Yearly
2wNV

Senior Software Research Architect, AI Networking

NVIDIA

Tel Aviv-Yafo, Tel Aviv District, Israel (On-site)
1wCO

Software Engineer, Observability

CoreWeave

Livingston, New Jersey, United States (Hybrid)$109k – $145k Yearly