1. Home
  2. Jobs
  3. Model Serving

Model Serving Jobs

Browse 899 Model Serving jobs on Inference Jobs.

181-200 of 899 jobs

2wBA

Senior Product Engineer - Training Platform

Baseten

San Francisco, California, United States (On-site)$200k – $275k Yearly
2wAI

Machine Learning Engineer - Defense

Applied Intuition

Ann Arbor, Michigan, United States (On-site)$130k – $200k Yearly
1wSC

Machine Learning Research Engineer, GenAI Applied ML

Scale

San Francisco, California, United States (On-site)$176k – $220k Yearly
4wAI

Machine Learning Engineer - Defense

Applied Intuition

Washington, District of Columbia, United States (On-site)$150k – $225k Yearly
2wPE

AI Inference Engineer (London)

Perplexity

London, England, United Kingdom (On-site)
2wPE

AI Inference Engineer (San Francisco)

Perplexity

San Francisco, California, United States (On-site)$210k – $385k Yearly
2wPE

AI Engineer, Applied ML

Perplexity

San Francisco, California, United States (On-site)$210k – $385k Yearly
5dNV

Principal Software Engineer - AI Inference

NVIDIA

Santa Clara, California, United States (On-site)$272k – $431.3k Yearly
2wNV

Senior Software Engineer - VLM Microservices for Neural Reconstruction

NVIDIA

Santa Clara, California, United States (On-site)$152k – $287.5k Yearly
1wTA

Machine Learning Engineer - Inference

Together AI

San Francisco, California, United States (On-site)$160k – $230k Yearly
1wCL
3wCE
1wAN

Senior Software Engineer, Inference

Anthropic

Dublin, Dublin, Ireland (Hybrid)€235k – €295k Yearly
1wSC

ML Systems Engineer, Robotics

Scale

San Francisco, California, United States (On-site)$218.4k – $273k Yearly
3dNV

Senior Systems Software Engineer - Deep Learning Solutions

NVIDIA

Toronto, Ontario, Canada (On-site)C$225k – C$275k Yearly
4wXA

Software Engineer - Applied Inference

xAI

Palo Alto, California, United States (On-site)$180k – $440k Yearly
6dBA

Senior Software Engineer - Infrastructure

Baseten

San Francisco, California, United States (On-site)$150k – $230k Yearly
2wNV

Senior Software Engineer – TensorRT Edge-LLM

NVIDIA

Santa Clara, California, United States (Hybrid)$152k – $287.5k Yearly
2wPE

Inference Engineering Manager

Perplexity

San Francisco, California, United States (On-site)$300k – $385k Yearly