1. Home
  2. Jobs
  3. Model Serving

Model Serving Jobs

Browse 947 Model Serving jobs on Inference Jobs.

21-40 of 947 jobs

1wBA

Software Engineer, Model Performance Tooling

Baseten

Canada or Remote (Canada + 1 more)C$130k – C$200k Yearly
5dXA

Member of Technical Staff - Enterprise Model Evaluation

xAI

Palo Alto, California, United States (On-site)$180k – $440k Yearly
5dXA

[Omni] Member of Technical Staff, World Model

xAI

Bay Area, California, United States (On-site)$180k – $440k Yearly
1wBA

Software Engineer - Model Performance

Baseten

San Francisco, California, United States (On-site)$150k – $250k Yearly
1wNV

Senior ML Framework Performance Engineer - AI for Science at Scale

NVIDIA

Santa Clara, California, United States (On-site)$184k – $287.5k Yearly
1wCO

Member of Technical Staff, Model Efficiency

Cohere

New York, New York, United States or Remote (New York, United States + 3 more)
2wTA

Research Intern, Model Shaping (Summer 2026)

Together AI

San Francisco, California, United States (On-site)
2wOP

Technical Program Manager – Adversarial Model Research

OpenAI

San Francisco, California, United States (Hybrid)$230k – $285k Yearly
3wPE

Research Engineering Manager - Model Training

Perplexity

San Francisco, California, United States (On-site)$300k – $470k Yearly
4wOP

Model Policy Manager

OpenAI

San Francisco, California, United States (Hybrid)$255k – $325k Yearly
2wPE

Model Behavior Architect

Perplexity

San Francisco, California, United States (On-site)$180k – $260k Yearly
1wCO

Senior Member of Technical Staff, Multimodal AI

Cohere

San Francisco, California, United States or Remote (Worldwide)
2wNV

Senior Datacenter Power Systems Modeling Engineer

NVIDIA

Santa Clara, California, United States (On-site)$168k – $322k Yearly
2wOP

Model Policy Manager, Chemical & Biological Risk

OpenAI

San Francisco, California, United States (Hybrid)$207k – $295k Yearly
2wPW

Member of Technical Staff, AI

Parallel Web Systems

San Francisco, California, United States (On-site)
2wMA

Research Scientist - Model Team

Mirelo AI

Berlin, Berlin, Germany (Hybrid)
1wCA

Researcher: Model Architecture, UK

Cartesia

London, England, United Kingdom (On-site)