LLM Serving jobs
Explore LLM Serving roles on Inference Jobs and apply today.
221-240 of 289 jobs
Forward Deployed Engineer
HappyRobot
San Francisco, California, United States (On-site)
$120k – $200k Yearly
Solutions Architect, Applied AI
Anthropic
Île de Ré, Charente-Maritime, France (Hybrid)
€190k – €200k Yearly
Full Stack Software Engineer - Applied AI
Perplexity
San Francisco, California, United States (On-site)
$210k – $385k Yearly
Deep Learning Compiler Verification and Infra Development Intern - 2026
NVIDIA
Shanghai, Shanghai, China (On-site)
Research Engineer / Research Scientist - Foundations Retrieval Lead
OpenAI
San Francisco, California, United States (Hybrid)
$460k – $555k Yearly
Machine Learning Research Engineer, Agents - Enterprise GenAI
Scale
San Francisco, California, United States (On-site)
$252k – $315k Yearly
Applied AI, Forward Deployed Machine Learning Engineer - Morocco
Mistral AI
Casablanca, Casablanca-Settat, Morocco (On-site)
Senior Data Scientist – Enterprise AI Systems
NVIDIA
Santa Clara, California, United States (On-site)
$168k – $322k Yearly
Performance Engineer
Anthropic
San Francisco, California, United States (Hybrid)
$315k – $560k Yearly
Research Engineer, Infrastructure, Kernels
Thinking Machines Lab
San Francisco, California, United States (On-site)
$350k – $475k Yearly
Staff Research Engineer, Voice
Decagon
San Francisco, California, United States (On-site)
$350k – $475k Yearly
Director, Business Systems
Scale
San Francisco, California, United States (On-site)
$231k – $288.8k Yearly
Open Source Engineer - Go
Braintrust
San Francisco, California, United States or Remote (United States)
Sr. Engineer, Inference Ecosystem Engineering
Cerebras
Sunnyvale, California, United States (On-site)