1. Home
  2. Jobs
  3. LLM Inference

LLM Inference jobs

Explore LLM Inference roles on Inference Jobs and apply today.

101-120 of 424 jobs

OP2w

Software Engineer, Model Inference

OpenAI

San Francisco, California, United States (On-site)

$325k – $490k Yearly

OP2w

Backend Software Engineer (Evals) – Support Automation Engineering

OpenAI

San Francisco, California, United States (On-site)

$255k – $405k Yearly

NE1w

Senior Site Reliability Engineer — Token Factory (Inference Platform)

Nebius

Netherlands + 4 more (Remote)

PL2w

Research Engineer - Posttraining

Periodic Labs

Menlo Park, California, United States (On-site)

SC2w

AI Research Engineer, Enterprise Evaluations

Scale

San Francisco, California, United States (On-site)

$179.4k – $224.3k Yearly

AN1w

Research Engineer, Pretraining Scaling

Anthropic

San Francisco, California, United States (On-site)

$315k – $560k Yearly

CE3w

Inference Compiler and Frontend Engineer – Dubai

Cerebras

Dubai, Dubai, United Arab Emirates (On-site)

NV2w

Senior Capability Development Engineer

NVIDIA

Shenzhen Shi, Guangdong, China (On-site)

PO4w

Member of Engineering (Pre-training / Synthetic Data)

Poolside

United States or Remote (United States + 2 more)

AN6d

Machine Learning Systems Engineer, RL Engineering

Anthropic

San Francisco, California, United States (Hybrid)

$300k – $405k Yearly

NV2w

Senior Software Engineer, AI Inference Systems

NVIDIA

Santa Clara, California, United States (Hybrid)

$184k – $356.5k Yearly

RA2w

Member of Technical Staff - Evaluations

Reflection AI

San Francisco, California, United States (On-site)

PE2w

Data Scientist, Evals

Perplexity

London, England, United Kingdom (On-site)

$210k – $385k Yearly

NV2w

AI Safety Scientist, Deep Learning

NVIDIA

Ho Chi Minh City, Ho Chi Minh City, Vietnam (On-site)

NV2w

Senior Software Engineer - Inference as a Service

NVIDIA

Santa Clara, California, United States (On-site)

$200k – $391k Yearly

XA1w

Member of Technical Staff - Reasoning Efficiency

xAI

Palo Alto, California, United States (On-site)

$180k – $440k Yearly

CO2w

Member of Technical Staff, Post-Training

Cohere

London, England, United Kingdom (Hybrid)

AN1w

Research Scientist, Interpretability

Anthropic

San Francisco, California, United States (Hybrid)

$315k – $560k Yearly

OP1w

Inference Runtime, Engineering Manager

OpenAI

San Francisco, California, United States (On-site)

$455k – $555k Yearly

NV2w

High-Performance LLM Training Engineer - New College Grad 2026

NVIDIA

Santa Clara, California, United States (On-site)

$124k – $195.5k Yearly