1. Home
  2. Jobs
  3. LLM Serving

LLM Serving Jobs

Browse 296 LLM Serving jobs on Inference Jobs.

61-80 of 296 jobs

2wBA

Software Engineer - Model Performance

Baseten

San Francisco, California, United States (On-site)$150k – $250k Yearly
2wBA

Software Engineer - Model API's

Baseten

San Francisco, California, United States (On-site)$150k – $230k Yearly
2wLA

Deployed Engineer (East)

LangChain

New York, New York, United States (On-site)$150k – $270k Yearly
2wXA

Member of Technical Staff, Grokipedia - Synthetic Data & Epistemics

xAI

Palo Alto, California, United States (On-site)$180k – $440k Yearly
2wLA
3wCR

Principal Engineer, AI Model LifeCycle

Crusoe

San Francisco, California, United States (On-site)$256k – $320k Yearly
3dNV

Senior Machine Learning Engineer, Quantized Inference

NVIDIA

Redmond, Washington, United States (On-site)$152k – $287.5k Yearly
2wRA

Member of Technical Staff - Safety Lead

Reflection AI

San Francisco, California, United States (On-site)
2wAN

Solutions Architect, Applied AI

Anthropic

Bengaluru, Karnataka, India (Hybrid)
6dAN

Staff Software Engineer, Inference

Anthropic

Dublin, County Dublin, Ireland (Hybrid)€295k – €355k Yearly
3wLA

Deployed Engineer (Central)

LangChain

Chicago, Illinois, United States or Remote (Illinois, United States + 1 more)$150k – $270k Yearly
2wNE

Senior ML Solutions Architect - Token Factory

Nebius

United States (Remote)$215k – $275k Yearly
2wBA

Software Engineer, Model Performance Tooling

Baseten

Canada or Remote (Canada + 1 more)C$130k – C$200k Yearly
2wMA

Research Engineer

Magic

San Francisco, California, United States (On-site)$225k – $550k Yearly
6dAN

Solutions Architect, Applied AI

Anthropic

München, Bavaria, Germany (Hybrid)€190k – €200k Yearly
3wNV

Senior Software Engineer - NIM Factory Container and Cloud Infrastructure

NVIDIA

Santa Clara, California, United States (On-site)$184k – $356.5k Yearly
6dTM

Research Engineer, Infrastructure, Inference

Thinking Machines Lab

San Francisco, California, United States (On-site)$350k – $475k Yearly
4wLA

Deployed Engineer (Toronto)

LangChain

Toronto, Ontario, Canada or Remote (Canada)
1wCO

Forward Deployed Engineer

CoreWeave

Livingston, New Jersey, United States (Hybrid)$188k – $275k Yearly