1. Home
  2. Jobs
  3. GPU Performance Engineering

GPU Performance Engineering jobs

Explore GPU Performance Engineering roles on Inference Jobs and apply today.

41-60 of 1,042 jobs

NE2w

Senior ML Engineer (Token Factory)

Nebius

Amsterdam, North Holland, Netherlands (On-site)

CE1w

CoDesign & NextGen - New College Grad

Cerebras

Sunnyvale, California, United States (On-site)

$145k – $155k Yearly

NV4w

Deep Learning Algorithm Engineer - New College Grad 2026

NVIDIA

Santa Clara, California, United States (On-site)

$124k – $241.5k Yearly

AN1w

TPU Kernel Engineer

Anthropic

San Francisco, California, United States (Hybrid)

$280k – $560k Yearly

NV2w

Senior High-Performance AI Training Engineer

NVIDIA

Santa Clara, California, United States (On-site)

$184k – $356.5k Yearly

NV4w

AI Developer Technology Engineer

NVIDIA

München, Bavaria, Germany (On-site)

CO2w

Staff Research Engineer, Model Efficiency

Cohere

New York, New York, United States (Hybrid)

NV5d

Senior Performance Compiler Engineer - Triton

NVIDIA

Redmond, Washington, United States (On-site)

$184k – $287.5k Yearly

AI2w

ML Runtime Optimization Engineer - Lead

Applied Intuition

Sunnyvale, California, United States (On-site)

$199.3k – $264.5k Yearly

NV4w

Manager, Software Engineering - Profiling Services

NVIDIA

Santa Clara, California, United States (On-site)

$224k – $356.5k Yearly

NV5d

Principal Software Engineer - AI Inference

NVIDIA

Santa Clara, California, United States (On-site)

$272k – $431.3k Yearly

TA1w

Research Engineer, Core ML

Together AI

San Francisco, California, United States (On-site)

$200k – $280k Yearly

NV5d

Senior AI Inference Compiler Engineer

NVIDIA

Santa Clara, California, United States (On-site)

$152k – $241.5k Yearly

PL2w

LLM Inference Engineer

Periodic Labs

Menlo Park, California, United States (On-site)

NV3w

GPU PCIe and Boot Architect - New College Grad 2026

NVIDIA

Santa Clara, California, United States (Hybrid)

$108k – $195.5k Yearly

CO2w

Member of Technical Staff, Model Efficiency

Cohere

New York, New York, United States or Remote (New York, United States + 3 more)

OP1w

Inference Runtime, Engineering Manager

OpenAI

San Francisco, California, United States (On-site)

$455k – $555k Yearly

VA1w

GPU Systems Engineer – HPC / Parallel Computing

Vast.ai

San Francisco, California, United States (On-site)

$160k – $320k Yearly

NV3w

Platform Architecture Engineer, GeForce NOW

NVIDIA

Santa Clara, California, United States (On-site)

$184k – $287.5k Yearly

NV2w

Senior GPU Floorplan Design Engineer

NVIDIA

Bengaluru, Karnataka, India (Hybrid)