LLM Inference jobs
Explore LLM Inference roles on Inference Jobs and apply today.
141-160 of 424 jobs
AI Research Lead
Perplexity
San Francisco, California, United States (On-site)
$300k – $470k Yearly
Senior Manager Forward Deployed Engineers
CoreWeave
Livingston, New Jersey, United States (Hybrid)
$188k – $275k Yearly
AI Infrastructure Engineer, Model Serving Platform
Scale
San Francisco, California, United States (On-site)
$179.4k – $224.3k Yearly
Staff Product Manager, Managed Inference (SF/Sunnyvale/New York)
Crusoe
San Francisco, California, United States or Remote (California, United States + 1 more)
$204k – $247k Yearly
AI Engineer, Applied ML
Perplexity
San Francisco, California, United States (On-site)
$210k – $385k Yearly
Software Engineer, Load Balancing - Inference
OpenAI
San Francisco, California, United States (On-site)
$325k – $490k Yearly
Senior Deep Learning Engineer - Model Evaluation & AI Systems
NVIDIA
Santa Clara, California, United States (On-site)
$224k – $431.3k Yearly
Product Marketing Manager, CoreWeave Inference
CoreWeave
Livingston, New Jersey, United States (Hybrid)
$143k – $210k Yearly
Deep Learning Compiler Verification and Infra Development Intern - 2026
NVIDIA
Shanghai, Shanghai, China (On-site)
Member of Technical Staff, Inference
xAI
Palo Alto, California, United States (On-site)
$180k – $440k Yearly
Deployed Engineer (West)
LangChain
San Francisco, California, United States (On-site)
$150k – $270k Yearly
Engineering Manager, Inference
Anthropic
San Francisco, California, United States (Hybrid)
$425k – $560k Yearly