Distributed Serving Jobs
Explore Distributed Serving roles on Inference Jobs and apply today.
2mo agoTM
Research Infrastructure Engineer, Research Acceleration
Thinking Machines Lab
San Francisco, California, United States (On-site)$350K – $475K Yearly
3w agoTA
Senior Software Engineer - Together Cloud Platform
Together AI
San Francisco, California, United States (Hybrid)$160K – $230K Yearly
3mo agoCO
Software Engineer II - Artifact Management
CoreWeave
Livingston, New Jersey, United States (Hybrid)$109K – $160K Yearly
3mo agoOP
Software Engineer, Accelerators
OpenAI
San Francisco, California, United States (On-site)$310K – $380K Yearly
3w agoOP
2w agoTA
Senior Platform Engineer, Voice AI
Together AI
San Francisco, California, United States (On-site)$200K – $260K Yearly
1mo agoTA
Engineering Manager, Model Serving
Together AI
San Francisco, California, United States (On-site)$250K – $300K Yearly
3w agoNV
3w agoSC
AI Infrastructure Engineer, Model Serving Platform
Scale
San Francisco, California, United States (On-site)$179.4K – $224.3K Yearly
2mo agoDE
Staff Software Engineer, ML Infrastructure
Decagon
San Francisco, California, United States (On-site)$300K – $430K Yearly
1mo agoNV
AI Inference Performance Engineer - New College Grad 2026
NVIDIA
Santa Clara, California, United States (On-site)$124K – $241.5K Yearly
3mo agoOP
Inference Technical Lead, Sora
OpenAI
San Francisco, California, United States (Hybrid)$380K – $380K Yearly
2mo agoXA
Software Engineer - Applied Inference
xAI
Palo Alto, California, United States (On-site)$180K – $440K Yearly
3mo agoOP
Software Engineer, Data Infrastructure - Research
OpenAI
San Francisco, California, United States (On-site)$250K – $380K Yearly
2mo agoNV
Senior System Software Engineer - Dynamo-Triton Inference Server
NVIDIA
Santa Clara, California, United States (On-site)$152K – $241.5K Yearly
3w agoCE
3w agoTM
Research, Audio Expertise
Thinking Machines Lab
San Francisco, California, United States (On-site)$350K – $475K Yearly