LLM Serving jobs
Explore LLM Serving roles on Inference Jobs and apply today.
81-100 of 289 jobs
Forward Deployed Engineer
CoreWeave
Livingston, New Jersey, United States (Hybrid)
$188k – $275k Yearly
Senior AI Software Engineer, GenAI Framework
NVIDIA
Santa Clara, California, United States (On-site)
$152k – $287.5k Yearly
Senior Research Scientist
EliseAI
New York, New York, United States (On-site)
$200k – $320k Yearly
Senior Technical Product Manager Token Factory - Inference
Nebius
United States (Remote)
$204k – $255k Yearly
Senior Forward Deployed Engineer
Harvey
New York, New York, United States (On-site)
$200k – $260k Yearly
Senior Manager Forward Deployed Engineers
CoreWeave
Livingston, New Jersey, United States (Hybrid)
$188k – $275k Yearly
Senior Deep Learning Research Engineer
NVIDIA
Tel Aviv-Yafo, Tel Aviv District, Israel (On-site)
Senior Software Engineer, Quantized Inference
NVIDIA
Redmond, Washington, United States (On-site)
$152k – $287.5k Yearly
Senior Site Reliability Engineer, Managed AI
Crusoe
San Francisco, California, United States (On-site)
$172k – $209k Yearly
Staff Machine Learning Research Engineer, Agent Post-training - Enterprise GenAI
Scale
San Francisco, California, United States (On-site)
$252k – $315k Yearly
Machine Learning Systems Research Engineer, Agent Post-training - Enterprise GenAI
Scale
San Francisco, California, United States (On-site)
$252k – $315k Yearly
Software Engineer, Applied Evals
OpenAI
San Francisco, California, United States (Hybrid)
$255k – $325k Yearly
AI Research Lead
Perplexity
San Francisco, California, United States (On-site)
$300k – $470k Yearly
Senior Full Stack Engineer, Observability & Evals Platform
LangChain
San Francisco, California, United States (On-site)
$175k – $225k Yearly
Research Engineer, Pretraining Scaling
Anthropic
San Francisco, California, United States (On-site)
$315k – $560k Yearly