1. Home
  2. Jobs
  3. LLM Inference Optimization

LLM Inference Optimization jobs

Explore LLM Inference Optimization roles on Inference Jobs and apply today.

141-160 of 447 jobs

TM1w

Research Engineer, Infrastructure, RL Systems

Thinking Machines Lab

San Francisco, California, United States (On-site)

$350k – $475k Yearly

HE2w

Staff AI Engineer (Orchestration)

Heidi

Sydney, New South Wales, Australia (Hybrid)

PO4w

Member of Engineering (Pre-training / Synthetic Data)

Poolside

United States or Remote (United States + 2 more)

CE3w

Deployment Engineer, AI Inference

Cerebras

Sunnyvale, California, United States (On-site)

PO2w

Member of Engineering (Inference)

Poolside

United Kingdom or Remote (Europe + 1 more)

CO2w

Member of Technical Staff, Post-Training

Cohere

London, England, United Kingdom (Hybrid)

PE2w

AI Engineer, Applied ML

Perplexity

San Francisco, California, United States (On-site)

$210k – $385k Yearly

SC1w

AI Infrastructure Engineer, Model Serving Platform

Scale

San Francisco, California, United States (On-site)

$179.4k – $224.3k Yearly

VE3w

LLMOps Engineer

Vertiv

Westerville, Ohio, United States (On-site)

CO2w

Member of Technical Staff, Pretraining evaluations

Cohere

London, England, United Kingdom or Remote (Worldwide)

NV2w

Senior Software Engineer, AI Inference Systems

NVIDIA

Santa Clara, California, United States (Hybrid)

$184k – $356.5k Yearly

PL2w

Research Engineer - Midtraining

Periodic Labs

Menlo Park, California, United States (On-site)

PO2w

Member of Engineering (Pre-training / Data)

Poolside

United Kingdom or Remote (Europe, Middle East, and Africa, North America)

MA1w

AI Scientist - Paris (Internship Phd)

Mistral AI

Paris, Paris, France (On-site)

XA4w

Member of Technical Staff, Inference

xAI

Palo Alto, California, United States (On-site)

$180k – $440k Yearly

TA1w

Research Engineer, Frontier Speculative Decoding

Together AI

San Francisco, California, United States (On-site)

$190k – $270k Yearly

CO2w

Product Marketing Manager, CoreWeave Inference

CoreWeave

Livingston, New Jersey, United States (Hybrid)

$143k – $210k Yearly

NV1w

Senior Deep Learning Engineer - Model Evaluation & AI Systems

NVIDIA

Santa Clara, California, United States (On-site)

$224k – $431.3k Yearly

CR2w

Staff Product Manager, Managed Inference (SF/Sunnyvale/New York)

Crusoe

San Francisco, California, United States or Remote (California, United States + 1 more)

$204k – $247k Yearly

RA2w

Member of Technical Staff - GPU Infrastructure

Reflection AI

San Francisco, California, United States (On-site)