LLM Serving Engineering Jobs
Explore LLM Serving Engineering roles on Inference Jobs and apply today.
3mo agoSE
ML Model Serving Engineer
Sesame
San Francisco, California, United States (On-site)$175K – $280K Yearly
1mo agoTA
Engineering Manager, Model Serving
Together AI
San Francisco, California, United States (On-site)$250K – $300K Yearly
4w agoSC
AI Infrastructure Engineer, Model Serving Platform
Scale
San Francisco, California, United States (On-site)$179.4K – $224.3K Yearly
17h agoNV
Senior Deep Learning Software Engineer, LLM Performance
NVIDIA
Santa Clara, California, United States (On-site)$184K – $356.5K Yearly
4w agoTA
LLM Inference Frameworks and Optimization Engineer
Together AI
San Francisco, California, United States (On-site)$160K – $230K Yearly
2w agoTA
Senior Machine Learning Engineer, Voice AI
Together AI
San Francisco, California, United States (On-site)$200K – $260K Yearly
2mo agoNV
Senior Software Engineer – TensorRT Edge-LLM
NVIDIA
Santa Clara, California, United States (Hybrid)$152K – $287.5K Yearly
2mo agoNV
High-Performance LLM Training Engineer - New College Grad 2026
NVIDIA
Santa Clara, California, United States (On-site)$124K – $195.5K Yearly
3mo agoBA
Software Engineer - Model API's
Baseten
San Francisco, California, United States (On-site)$150K – $230K Yearly
1mo agoNV
Senior AI Software Development Engineer, TensorRT-LLM
NVIDIA
Yokne'am, Northern District, Israel (Hybrid)