1. Home
  2. Jobs
  3. Model Serving

Model Serving Jobs

Browse 947 Model Serving jobs on Inference Jobs.

81-100 of 947 jobs

4dMO

Forward Deployed ML Engineer

Modal

New York, New York, United States (On-site)$180k – $250k Yearly
3wTE

C++ Machine Learning Engineer, Models Training

Tenstorrent

Austin, Texas, United States (Hybrid)$100k – $500k Yearly
4wD-

Software Engineering Intern, Simulation and Modeling

d-Matrix

Santa Clara, California, United States (Hybrid)$30 – $59 Hourly
3wCR

Senior Software Engineer, Managed AI

Crusoe

San Francisco, California, United States (On-site)$166k – $201k Yearly
1wOP

Machine Learning Data Scientist, Forecasting

OpenAI

San Francisco, California, United States (Hybrid)$255k – $405k Yearly
6dMA

Applied Scientist / Research Engineer - EMEA

Mistral AI

Île de Ré, Charente-Maritime, France (Hybrid)
6dNV

Senior GPU Functional Modeling Architect

NVIDIA

Santa Clara, California, United States (On-site)$152k – $287.5k Yearly
6dAC

Applied Research Engineer

Applied Compute

San Francisco, California, United States (On-site)
2wTE

Engineer, ML Models

Tenstorrent

Santa Clara, California, United States (Hybrid)$100k – $500k Yearly
2wOP

AI & Provider Operations Engineer

OpenRouter

United States or Remote (United States)
2wOP

Forward Deployed Engineer (FDE), Life Sciences - SF

OpenAI

San Francisco, California, United States (Hybrid)$220k – $280k Yearly
4wNV

Senior Power Methodology and Modeling Engineer

NVIDIA

Austin, Texas, United States (On-site)$136k – $264.5k Yearly
1wOP

Research Engineer / Research Scientist - Foundations Retrieval Lead

OpenAI

San Francisco, California, United States (Hybrid)$460k – $555k Yearly
1wCA

Software Engineer, India

Cartesia

Bengaluru, Karnataka, India (On-site)₹7M – ₹9M Yearly
2wAI

ML Runtime Optimization Engineer - Lead

Applied Intuition

Sunnyvale, California, United States (On-site)$199.3k – $264.5k Yearly
6dTM

Research Engineer, Infrastructure, Inference

Thinking Machines Lab

San Francisco, California, United States (On-site)$350k – $475k Yearly
6dRU

Research Manager, Foundation Models

Runway

New York, New York, United States or Remote (North America + 1 more)$360k – $450k Yearly
4wD-

Machine Learning Intern - Dynamic KV-Cache Modeling for Efficient LLM Inference

d-Matrix

Campbell, California, United States or Remote (California, United States)$30 – $59 Hourly