1. Home
  2. Jobs
  3. Model Serving

Model Serving Jobs

Explore Model Serving roles on Inference Jobs and apply today.

3w agoOP

TL, Research Inference

OpenAI

San Francisco, California, United States (On-site)$380K – $555K Yearly
2mo agoNV

Senior Software Engineer – TensorRT Edge-LLM

NVIDIA

Santa Clara, California, United States (Hybrid)$152K – $287.5K Yearly
3mo agoBA
2mo agoNV

Principal Software Engineer - AI Inference

NVIDIA

Santa Clara, California, United States (On-site)$272K – $431.3K Yearly
2mo agoDE

Staff Software Engineer, ML Infrastructure

Decagon

San Francisco, California, United States (On-site)$300K – $430K Yearly
3w agoSC

ML Research Engineer, ML Systems

Scale

San Francisco, California, United States (On-site)$218.4K – $273K Yearly
2mo agoOP
2mo agoXA

Software Engineer - Applied Inference

xAI

Palo Alto, California, United States (On-site)$180K – $440K Yearly
4w agoDE

Product Manager, AI Models

Descript

Mission District, San Francisco, California, US or Remote (Worldwide)$171K – $235K Yearly
2mo agoNV

Senior Power Methodology and Modeling Engineer

NVIDIA

Austin, Texas, United States (On-site)$136K – $264.5K Yearly