Distributed Inference Jobs
Browse 712 Distributed Inference jobs on Inference Jobs.
712 jobs
1wBA
Engineering Manager - Forward Deployed Engineering (LLM)
Baseten
San Francisco, California, United States (On-site)$220k – $285k Yearly
5dOP
Inference Runtime, Engineering Manager
OpenAI
San Francisco, California, United States (On-site)$455k – $555k Yearly
5dCO
Director of Engineering, Inference Services
CoreWeave
Sunnyvale, California, United States (Hybrid)$206k – $303k Yearly
2wD-
Senior Staff Machine Learning Engineer -Frameworks
d-Matrix
Santa Clara, California, United States (Hybrid)$155k – $250k Yearly
1wOP
Software Engineer, Load Balancing - Inference
OpenAI
San Francisco, California, United States (On-site)$325k – $490k Yearly
2wPE
Inference Engineering Manager
Perplexity
San Francisco, California, United States (On-site)$300k – $385k Yearly
1wOP
Software Engineer, Inference – AMD GPU Enablement
OpenAI
San Francisco, California, United States (On-site)$325k – $490k Yearly
5dTA
LLM Inference Frameworks and Optimization Engineer
Together AI
San Francisco, California, United States (On-site)$160k – $230k Yearly
3wXA
Software Engineer - Applied Inference
xAI
Palo Alto, California, United States (On-site)$180k – $440k Yearly
5dAN
Staff Software Engineer, Inference
Anthropic
Dublin, County Dublin, Ireland (Hybrid)€295k – €355k Yearly
1wOP
Software Engineer, Model Inference
OpenAI
San Francisco, California, United States (On-site)$325k – $490k Yearly
3wXA
Member of Technical Staff, Inference
xAI
Palo Alto, California, United States (On-site)$180k – $440k Yearly
5dAN
Senior/Staff Software Engineer, Inference
Anthropic
New York, New York, United States (Hybrid)$300k – $485k Yearly
2wNV
Senior Software Engineer - Inference as a Service
NVIDIA
Santa Clara, California, United States (On-site)$200k – $391k Yearly