LLM Serving Jobs
Browse 296 LLM Serving jobs on Inference Jobs.
61-80 of 296 jobs
2wBA
Software Engineer - Model Performance
Baseten
San Francisco, California, United States (On-site)$150k – $250k Yearly
2wBA
Software Engineer - Model API's
Baseten
San Francisco, California, United States (On-site)$150k – $230k Yearly
2wLA
2wXA
Member of Technical Staff, Grokipedia - Synthetic Data & Epistemics
xAI
Palo Alto, California, United States (On-site)$180k – $440k Yearly
3wCR
Principal Engineer, AI Model LifeCycle
Crusoe
San Francisco, California, United States (On-site)$256k – $320k Yearly
3dNV
Senior Machine Learning Engineer, Quantized Inference
NVIDIA
Redmond, Washington, United States (On-site)$152k – $287.5k Yearly
2wRA
Member of Technical Staff - Safety Lead
Reflection AI
San Francisco, California, United States (On-site)
6dAN
Staff Software Engineer, Inference
Anthropic
Dublin, County Dublin, Ireland (Hybrid)€295k – €355k Yearly
3wLA
Deployed Engineer (Central)
LangChain
Chicago, Illinois, United States or Remote (Illinois, United States + 1 more)$150k – $270k Yearly
2wBA
Software Engineer, Model Performance Tooling
Baseten
Canada or Remote (Canada + 1 more)C$130k – C$200k Yearly
3wNV
Senior Software Engineer - NIM Factory Container and Cloud Infrastructure
NVIDIA
Santa Clara, California, United States (On-site)$184k – $356.5k Yearly
6dTM
Research Engineer, Infrastructure, Inference
Thinking Machines Lab
San Francisco, California, United States (On-site)$350k – $475k Yearly
1wCO
Forward Deployed Engineer
CoreWeave
Livingston, New Jersey, United States (Hybrid)$188k – $275k Yearly