1. Home
  2. Jobs
  3. LLM Inference Optimization

LLM Inference Optimization Jobs

Browse 455 LLM Inference Optimization jobs on Inference Jobs.

121-140 of 455 jobs

2wSC

Senior/Staff Machine Learning Engineer, General Agents, Enterprise GenAI

Scale

San Francisco, California, United States (On-site)$218k – $273k Yearly
2wAI

ML Runtime Optimization Engineer - Lead

Applied Intuition

Sunnyvale, California, United States (On-site)$199.3k – $264.5k Yearly
4wNV

Deep Learning Algorithm Engineer - New College Grad 2026

NVIDIA

Santa Clara, California, United States (On-site)$124k – $241.5k Yearly
2wOP

Software Engineer, Inference – AMD GPU Enablement

OpenAI

San Francisco, California, United States (On-site)$325k – $490k Yearly
2wSC

AI Research Engineer, Enterprise Evaluations

Scale

San Francisco, California, United States (On-site)$179.4k – $224.3k Yearly
2wNE

Senior ML Solutions Architect - Token Factory

Nebius

United States (Remote)$215k – $275k Yearly
2wNV

Senior Software Engineer, Deep Learning Inference - TensorRT

NVIDIA

Santa Clara, California, United States (Hybrid)$152k – $287.5k Yearly
3wCE

Inference Frontend

Cerebras

Sunnyvale, California, United States (On-site)
2wRA

Member of Technical Staff - Evaluations

Reflection AI

San Francisco, California, United States (On-site)
4wXA

Software Engineer - Applied Inference

xAI

Palo Alto, California, United States (On-site)$180k – $440k Yearly
4wSC
2wNV

High-Performance LLM Training Engineer - New College Grad 2026

NVIDIA

Santa Clara, California, United States (On-site)$124k – $195.5k Yearly
2wPL

Distributed Training Engineer

Periodic Labs

Menlo Park, California, United States (Hybrid)
2wNE

Senior ML Engineer (Token Factory)

Nebius

Amsterdam, North Holland, Netherlands (On-site)
7dAI

ML Runtime Optimization Engineer

Applied Intuition

Mountain View, California, United States (On-site)$159.1k – $199.3k Yearly
6dLA

Applied Research Intern

Labelbox

San Francisco, California, United States (Hybrid)$35 – $45 Yearly
7dAN

TPU Kernel Engineer

Anthropic

San Francisco, California, United States (Hybrid)$280k – $560k Yearly
2wPE

AI Researcher

Perplexity

San Francisco, California, United States (On-site)$210k – $470k Yearly
3wCR

Principal Engineer, AI Model LifeCycle

Crusoe

San Francisco, California, United States (On-site)$256k – $320k Yearly