1. Home
  2. Jobs
  3. LLM Inference

LLM Inference jobs

Explore LLM Inference roles on Inference Jobs and apply today.

241-260 of 424 jobs

SC1w

Machine Learning Research Engineer, GenAI Applied ML

Scale

San Francisco, California, United States (On-site)

$176k – $220k Yearly

NV1w

Senior ML Framework Performance Engineer - AI for Science at Scale

NVIDIA

Santa Clara, California, United States (On-site)

$184k – $287.5k Yearly

XA1w

Member of Technical Staff - Reasoning Post-training

xAI

Palo Alto, California, United States (On-site)

$180k – $440k Yearly

AN3d

Research Engineer / Research Scientist, Pre-training

Anthropic

Zürich, Zürich, Switzerland (Hybrid)

DE2w

Senior Research Engineer

Decagon

San Francisco, California, United States (On-site)

£200k – £300k Yearly

PE2w

Data Scientist/Engineer – Online Metrics

Perplexity

London, England, United Kingdom (On-site)

CA2w

Software Engineer

Cartesia

San Francisco, California, United States (On-site)

$180k – $250k Yearly

AN6d

Engineering Manager, ML Acceleration

Anthropic

San Francisco, California, United States (Hybrid)

$425k – $560k Yearly

TA5d

Machine Learning, Platform Engineer

Together AI

San Francisco, California, United States (On-site)

$160k – $250k Yearly

AN1w

Staff Research Engineer, Discovery Team

Anthropic

San Francisco, California, United States (Hybrid)

$340k – $425k Yearly

RA2w

Member of Technical Staff - Safety Lead

Reflection AI

San Francisco, California, United States (On-site)

RA2w

Member of Technical Staff - Alignment Lead

Reflection AI

San Francisco, California, United States (On-site)

SC4w

Machine Learning Research Engineer, Agent Data Foundation - Enterprise GenAI

Scale

San Francisco, California, United States (On-site)

$252k – $315k Yearly

NV4w

Software Product Manager - Nemotron

NVIDIA

Santa Clara, California, United States (On-site)

$240k – $379.5k Yearly

VA1w

GPU Systems Engineer – HPC / Parallel Computing

Vast.ai

San Francisco, California, United States (On-site)

$160k – $320k Yearly

RA2w

Member of Technical Staff - Pre-Training

Reflection AI

San Francisco, California, United States (On-site)

FU5d

AI Engineer - Agent Team

FurtherAI

San Francisco, California, United States (On-site)

$150k – $250k Yearly

SC1w

Machine Learning Research Intern (Summer 2026)

Scale

San Francisco, California, United States (On-site)

NV3w

Senior Research Scientist, Multi-Modal Language Models

NVIDIA

Santa Clara, California, United States (On-site)

$192k – $356.5k Yearly

SC1w

AI Strategy Consultant, Frontier Tech

Scale

United States (Remote)

$60 – $80 Hourly