1. Home
  2. Jobs
  3. LLM Optimization

LLM Optimization jobs

Explore LLM Optimization roles on Inference Jobs and apply today.

121-140 of 536 jobs

LA2w

Deployed Engineer (West)

LangChain

San Francisco, California, United States (On-site)

$150k – $270k Yearly

LO4d

AI Engineer

Lovable

Stockholm, Stockholm, Sweden (On-site)

NV2d

Senior Scientist, Synthetic Data and Privacy

NVIDIA

Santa Clara, California, United States (On-site)

$192k – $356.5k Yearly

HA4w

Mid/Senior/Staff Software Engineer, Agents

Harvey

San Francisco, California, United States (On-site)

$165k – $312k Yearly

AI1w

ML Runtime Optimization Engineer

Applied Intuition

Mountain View, California, United States (On-site)

$159.1k – $199.3k Yearly

NV2w

Senior Deep Learning Test Development Engineer, SDET

NVIDIA

Shanghai, Shanghai, China (On-site)

HE2w

Staff AI Engineer (Orchestration)

Heidi

Sydney, New South Wales, Australia (Hybrid)

VE3w

LLMOps Engineer

Vertiv

Westerville, Ohio, United States (On-site)

DE2w

Senior Research Engineer

Decagon

San Francisco, California, United States (On-site)

£200k – £300k Yearly

TA1w

Machine Learning Engineer

Together AI

San Francisco, California, United States (On-site)

$160k – $220k Yearly

NV2w

Senior Performance Architect - Heterogeneous Workload Optimization

NVIDIA

Santa Clara, California, United States (Hybrid)

$184k – $356.5k Yearly

PE2w

Search Senior Machine Learning Engineer (London, Belgrade, Berlin)

Perplexity

Belgrade, Belgrade, Serbia (On-site)

OP2w

TLM, Machine Learning, Integrity

OpenAI

San Francisco, California, United States (On-site)

$405k – $490k Yearly

NE2w

Senior ML Engineer (Token Factory)

Nebius

Europe + 6 more (Remote)

PE2w

Data Scientist, Evals

Perplexity

London, England, United Kingdom (On-site)

$210k – $385k Yearly

BA2w

Software Engineer, Model Performance Tooling

Baseten

Canada or Remote (Canada + 1 more)

C$130k – C$200k Yearly

NV5d

Principal Software Engineer - AI Inference

NVIDIA

Santa Clara, California, United States (On-site)

$272k – $431.3k Yearly

PE2w

Search Machine Learning Research Engineer (Berlin)

Perplexity

Berlin, Berlin, Germany (On-site)

TA1w

Research Engineer, Frontier Speculative Decoding

Together AI

San Francisco, California, United States (On-site)

$190k – $270k Yearly

LA6d

Applied Research Intern

Labelbox

San Francisco, California, United States (Hybrid)

$35 – $45 Yearly