1. Home
  2. Jobs
  3. Inference Systems

Inference Systems jobs

Explore Inference Systems roles on Inference Jobs and apply today.

101-120 of 120 jobs

BA2w

Engineering Manager - Forward Deployed Engineering (LLM)

Baseten

San Francisco, California, United States (On-site)

$220k – $285k Yearly

D-2w

Software Engineering Intern, Developer and Qualification Tools

d-Matrix

Santa Clara, California, United States (Hybrid)

$30 – $59 Hourly

NE1w

Lead Tech Recruiter (ML, LLM)

Nebius

United States (Remote)

HA2w

Staff Software Engineer

Hippocratic AI

Palo Alto, California, United States (On-site)

NE2w

Senior ML Engineer (Token Factory)

Nebius

Amsterdam, North Holland, Netherlands (On-site)

DE4d

Staff Software Engineer, ML Infrastructure

Decagon

San Francisco, California, United States (On-site)

$300k – $430k Yearly

NV5d

Software Engineer, TensorRT Specialized Platforms - New College Grad 2025

NVIDIA

Santa Clara, California, United States (On-site)

$124k – $195.5k Yearly

NV5d

Senior Compiler Engineer, AI Inference Performance

NVIDIA

Santa Clara, California, United States (On-site)

$152k – $241.5k Yearly

XA4w

Member of Technical Staff, Model Evaluation

xAI

Palo Alto, California, United States (On-site)

$180k – $440k Yearly

AN1w

TPU Kernel Engineer

Anthropic

San Francisco, California, United States (Hybrid)

$280k – $560k Yearly

NV4w

Technical Marketing Engineer, World Models - AV Physical AI

NVIDIA

Santa Clara, California, United States (On-site)

$148k – $287.5k Yearly

NV2w

Senior Deep Learning Engineer

NVIDIA

Warszawa, Masovian Voivodeship, Poland (Hybrid)

zł 292.5k – zł 507k Yearly

OP2w

Research Engineer / Research Scientist - Foundations Retrieval Lead

OpenAI

San Francisco, California, United States (Hybrid)

$460k – $555k Yearly

NV2w

Senior Machine Learning Applications and Compiler Engineer

NVIDIA

Toronto, Ontario, Canada (Hybrid)

C$135k – C$220k Yearly

CA3w

Platform Engineer Intern

Cartesia

San Francisco, California, United States (On-site)

$8k – $8k Monthly

NV5d

Senior Machine Learning Engineer, Quantized Inference

NVIDIA

Redmond, Washington, United States (On-site)

$152k – $287.5k Yearly

D-2w

Senior Staff Machine Learning Engineer -Frameworks

d-Matrix

Santa Clara, California, United States (Hybrid)

$155k – $250k Yearly

MO2w

Member of Technical Staff - ML Performance

Modal

New York, New York, United States (On-site)

$150k – $270k Yearly

AI2w

Machine Learning Engineer - Defense

Applied Intuition

Ann Arbor, Michigan, United States (On-site)

$130k – $200k Yearly

NV2w

Senior Software Engineer – TensorRT Edge-LLM

NVIDIA

Santa Clara, California, United States (Hybrid)

$152k – $287.5k Yearly