LLM Inference Optimization Jobs
Browse 452 LLM Inference Optimization jobs on Inference Jobs.
101-120 of 452 jobs
2wSC
Tech Lead Manager, Machine Learning Research Scientist- LLM Evals
Scale
San Francisco, California, United States (On-site)$280k – $380k Yearly
7dAN
Research Engineer, Pretraining Scaling (London)
Anthropic
London, England, United Kingdom (On-site)£250k – £435k Yearly
3dAN
3wNV
Platform Architecture Engineer, GeForce NOW
NVIDIA
Santa Clara, California, United States (On-site)$184k – $287.5k Yearly
1wNE
Senior Technical Product Manager Token Factory - Inference
Nebius
United States (Remote)$204k – $255k Yearly
2dNV
Senior Deep Learning Compiler Engineer - XLA
NVIDIA
Santa Clara, California, United States (On-site)$152k – $241.5k Yearly
2wOP
Software Engineer, Productivity
OpenAI
San Francisco, California, United States (On-site)$255k – $405k Yearly
7dAN
Staff Research Engineer, Discovery Team
Anthropic
San Francisco, California, United States (Hybrid)$340k – $425k Yearly
7dXA
Member of Technical Staff, RL Training Framework
xAI
Palo Alto, California, United States (On-site)$180k – $440k Yearly
4dNV
Senior AI Compiler Engineer, MLIR
NVIDIA
Santa Clara, California, United States (On-site)$152k – $241.5k Yearly
6dAN
Engineering Manager, Inference
Anthropic
San Francisco, California, United States (Hybrid)$425k – $560k Yearly
2wMA
6dLA
Applied Research Engineer, Agents
Labelbox
San Francisco, California, United States (Hybrid)$250k – $300k Yearly
7dXA
Member of Technical Staff - Reasoning Efficiency
xAI
Palo Alto, California, United States (On-site)$180k – $440k Yearly
3wSC
Staff Machine Learning Research Scientist, LLM Evals
Scale
San Francisco, California, United States (On-site)$280k – $380k Yearly
7dVA
GPU Systems Engineer – HPC / Parallel Computing
Vast.ai
San Francisco, California, United States (On-site)$160k – $320k Yearly
7dAN
Research Engineer, Discovery
Anthropic
San Francisco, California, United States (Hybrid)$340k – $425k Yearly