LLM Inference Optimization jobs
Explore LLM Inference Optimization roles on Inference Jobs and apply today.
141-160 of 447 jobs
Research Engineer, Infrastructure, RL Systems
Thinking Machines Lab
San Francisco, California, United States (On-site)
$350k – $475k Yearly
Member of Engineering (Pre-training / Synthetic Data)
Poolside
United States or Remote (United States + 2 more)
AI Engineer, Applied ML
Perplexity
San Francisco, California, United States (On-site)
$210k – $385k Yearly
AI Infrastructure Engineer, Model Serving Platform
Scale
San Francisco, California, United States (On-site)
$179.4k – $224.3k Yearly
Member of Technical Staff, Pretraining evaluations
Cohere
London, England, United Kingdom or Remote (Worldwide)
Senior Software Engineer, AI Inference Systems
NVIDIA
Santa Clara, California, United States (Hybrid)
$184k – $356.5k Yearly
Member of Engineering (Pre-training / Data)
Poolside
United Kingdom or Remote (Europe, Middle East, and Africa, North America)
Member of Technical Staff, Inference
xAI
Palo Alto, California, United States (On-site)
$180k – $440k Yearly
Research Engineer, Frontier Speculative Decoding
Together AI
San Francisco, California, United States (On-site)
$190k – $270k Yearly
Product Marketing Manager, CoreWeave Inference
CoreWeave
Livingston, New Jersey, United States (Hybrid)
$143k – $210k Yearly
Senior Deep Learning Engineer - Model Evaluation & AI Systems
NVIDIA
Santa Clara, California, United States (On-site)
$224k – $431.3k Yearly
Staff Product Manager, Managed Inference (SF/Sunnyvale/New York)
Crusoe
San Francisco, California, United States or Remote (California, United States + 1 more)
$204k – $247k Yearly
Member of Technical Staff - GPU Infrastructure
Reflection AI
San Francisco, California, United States (On-site)