1. Home
  2. Jobs
  3. Inference Runtimes

Inference Runtimes jobs

Explore Inference Runtimes roles on Inference Jobs and apply today.

181-200 of 240 jobs

D-4d

Software Engineering Intern - Kernels

d-Matrix

Ontario, Canada (Remote)

C$40 – C$70 Hourly

NV3d

Senior Software Engineer - Deep Learning Compiler Verification and Infrastructure

NVIDIA

Santa Clara, California, United States (On-site)

$140k – $224.3k Yearly

NV2w

Deep Learning Performance Architect - Intern - 2026

NVIDIA

Shanghai, Shanghai, China (On-site)

NV1w

Raytracing Compiler Engineer - Developer and Performance Technology

NVIDIA

Santa Clara, California, United States (On-site)

$184k – $356.5k Yearly

D-3w

Senior Software Engineer—Kernels

d-Matrix

Bengaluru, Karnataka, India (Hybrid)

TM1w

Research Engineer, Infrastructure, Kernels

Thinking Machines Lab

San Francisco, California, United States (On-site)

$350k – $475k Yearly

GR5d

Senior Principal Performance Engineering

Graphcore

Austin, Texas, United States (Hybrid)

D-2w

Software Engineer, Senior Staff - Kernels

d-Matrix

Santa Clara, California, United States (Hybrid)

$180k – $300k Yearly

TE3w

Performance Architect, AI HW

Tenstorrent

Toronto, Ontario, Canada (Hybrid)

$100k – $500k Yearly

BA5d

Senior Software Engineer - Infrastructure

Baseten

San Francisco, California, United States (On-site)

$150k – $230k Yearly

TM1w

Research Engineer, Infrastructure, Numerics

Thinking Machines Lab

San Francisco, California, United States (On-site)

$350k – $475k Yearly

NV5d

Senior AI Compiler Engineer, MLIR

NVIDIA

Santa Clara, California, United States (On-site)

$152k – $241.5k Yearly

NV3d

Senior Engineer - Deep Learning Compiler Verification and Infrastructure

NVIDIA

Santa Clara, California, United States (On-site)

$140k – $224.3k Yearly

NE2w

Senior ML Engineer (Token Factory)

Nebius

Europe + 6 more (Remote)

CE2w

Senior Full Stack LLM Engineer - Training

Cerebras

Sunnyvale, California, United States (On-site)

AI2w

Senior Software Engineer - ML Infrastructure

Applied Intuition

Sunnyvale, California, United States (On-site)

$153k – $222k Yearly

NV3d

Senior Deep Learning Compiler Engineer - XLA

NVIDIA

Santa Clara, California, United States (On-site)

$152k – $241.5k Yearly

NV2w

Senior Hypervisor and RTOS Engineer - Performance

NVIDIA

Santa Clara, California, United States (On-site)

$184k – $356.5k Yearly

CO1w

Staff Engineer - Perf and Benchmarking

CoreWeave

Sunnyvale, California, United States (Hybrid)

$188k – $275k Yearly

AI1w

Engineering Manager - ML Platform and Infrastructure

Applied Intuition

Sunnyvale, California, United States (On-site)

$204k – $343k Yearly