1. Home
  2. Jobs
  3. LLM Inference Optimization

LLM Inference Optimization Jobs

Browse 452 LLM Inference Optimization jobs on Inference Jobs.

101-120 of 452 jobs

4wCE

ML API Features SDET

Cerebras

Sunnyvale, California, United States (Hybrid)
2wSC

Tech Lead Manager, Machine Learning Research Scientist- LLM Evals

Scale

San Francisco, California, United States (On-site)$280k – $380k Yearly
7dAN

Research Engineer, Pretraining Scaling (London)

Anthropic

London, England, United Kingdom (On-site)£250k – £435k Yearly
3wNV

Platform Architecture Engineer, GeForce NOW

NVIDIA

Santa Clara, California, United States (On-site)$184k – $287.5k Yearly
7dCE

Principal ML Investigator

Cerebras

Sunnyvale, California, United States (On-site)
2dNV

Senior Deep Learning Compiler Engineer - XLA

NVIDIA

Santa Clara, California, United States (On-site)$152k – $241.5k Yearly
2wOP

Software Engineer, Productivity

OpenAI

San Francisco, California, United States (On-site)$255k – $405k Yearly
7dNV

Senior LLM Agents Architect

NVIDIA

Yokneam Ilit, Northern District, Israel (Hybrid)
7dAN

Staff Research Engineer, Discovery Team

Anthropic

San Francisco, California, United States (Hybrid)$340k – $425k Yearly
7dXA

Member of Technical Staff, RL Training Framework

xAI

Palo Alto, California, United States (On-site)$180k – $440k Yearly
4dNV

Senior AI Compiler Engineer, MLIR

NVIDIA

Santa Clara, California, United States (On-site)$152k – $241.5k Yearly
6dAN

Engineering Manager, Inference

Anthropic

San Francisco, California, United States (Hybrid)$425k – $560k Yearly
2wMA

Software Engineer, Technical Lead, Inference

Mistral AI

Île de Ré, Charente-Maritime, France (Hybrid)
6dLA

Applied Research Engineer, Agents

Labelbox

San Francisco, California, United States (Hybrid)$250k – $300k Yearly
7dXA

Member of Technical Staff - Reasoning Efficiency

xAI

Palo Alto, California, United States (On-site)$180k – $440k Yearly
3wSC

Staff Machine Learning Research Scientist, LLM Evals

Scale

San Francisco, California, United States (On-site)$280k – $380k Yearly
7dVA

GPU Systems Engineer – HPC / Parallel Computing

Vast.ai

San Francisco, California, United States (On-site)$160k – $320k Yearly
7dAN

Research Engineer, Discovery

Anthropic

San Francisco, California, United States (Hybrid)$340k – $425k Yearly