1. Home
  2. Jobs
  3. Model Serving

Model Serving Jobs

Browse 921 Model Serving jobs on Inference Jobs.

121-140 of 921 jobs

4dNV

Senior Machine Learning Engineer, Quantized Inference

NVIDIA

Redmond, Washington, United States (On-site)$152k – $287.5k Yearly
4dCO

Senior Software Engineer I, Inference

CoreWeave

Sunnyvale, California, United States (Hybrid)$139k – $204k Yearly
7dOP

Inference Runtime, Engineering Manager

OpenAI

San Francisco, California, United States (On-site)$455k – $555k Yearly
2wNV

Senior AI Application Developer - GPU and SOC Architecture Modeling

NVIDIA

Santa Clara, California, United States (On-site)$152k – $287.5k Yearly
7dCO

Senior Software Engineer II, Inference

CoreWeave

Sunnyvale, California, United States (Hybrid)$165k – $242k Yearly
2wD-

Senior Staff Machine Learning Engineer -Frameworks

d-Matrix

Santa Clara, California, United States (Hybrid)$155k – $250k Yearly
2wOP

Training: ML Framework Engineer

OpenAI

San Francisco, California, United States (Hybrid)$245k – $385k Yearly
2wCA

Senior Applied Researcher, Audio Understanding

Cartesia

San Francisco, California, United States (On-site)$200k – $350k Yearly
2wPE

Software Engineer - Data Flywheel

Perplexity

London, England, United Kingdom (On-site)$210k – $385k Yearly
2wNV

Senior AI Software Engineer, GenAI Framework

NVIDIA

Santa Clara, California, United States (On-site)$152k – $287.5k Yearly
7dMA

AI Scientist - Paris/London

Mistral AI

Île de Ré, Charente-Maritime, France (Hybrid)
2wCE

Senior Full Stack LLM Engineer - Training

Cerebras

Sunnyvale, California, United States (On-site)
7dSC

Applied AI Engineering Manager, Enterprise

Scale

San Francisco, California, United States (On-site)$248.4k – $310.5k Yearly
7dAN

Research Engineer, Pretraining Scaling

Anthropic

San Francisco, California, United States (On-site)$315k – $560k Yearly
2wBA

Engineering Manager - Forward Deployed Engineering (LLM)

Baseten

San Francisco, California, United States (On-site)$220k – $285k Yearly