1. Home
  2. Jobs
  3. LLM Serving

LLM Serving jobs

Explore LLM Serving roles on Inference Jobs and apply today.

221-240 of 289 jobs

HA2w

Forward Deployed Engineer

HappyRobot

San Francisco, California, United States (On-site)

$120k – $200k Yearly

AN1w

Solutions Architect, Applied AI

Anthropic

Île de Ré, Charente-Maritime, France (Hybrid)

€190k – €200k Yearly

AN2w

Product Engineer, Applied AI

Anthropic

Seoul, Seoul, South Korea (Hybrid)

PE2w

Full Stack Software Engineer - Applied AI

Perplexity

San Francisco, California, United States (On-site)

$210k – $385k Yearly

NV2w

Deep Learning Compiler Verification and Infra Development Intern - 2026

NVIDIA

Shanghai, Shanghai, China (On-site)

OP2w

Research Engineer / Research Scientist - Foundations Retrieval Lead

OpenAI

San Francisco, California, United States (Hybrid)

$460k – $555k Yearly

AN5d

Solutions Architect, Applied AI

Anthropic

東京都, Tokyo Prefecture, Japan (Hybrid)

SC4w

Machine Learning Research Engineer, Agents - Enterprise GenAI

Scale

San Francisco, California, United States (On-site)

$252k – $315k Yearly

VE3w

Lead Engineer (API Integration Specialist (Device/Gateway))

Vertiv

India (On-site)

CE2w

Forward Deployed Product Manager

Cerebras

San Francisco, California, United States (Hybrid)

MA2w

Applied AI, Forward Deployed Machine Learning Engineer - Morocco

Mistral AI

Casablanca, Casablanca-Settat, Morocco (On-site)

NV2w

Senior Data Scientist – Enterprise AI Systems

NVIDIA

Santa Clara, California, United States (On-site)

$168k – $322k Yearly

MA1w

AI Scientist - Palo Alto

Mistral AI

Palo Alto, California, United States (Hybrid)

AN1w

Performance Engineer

Anthropic

San Francisco, California, United States (Hybrid)

$315k – $560k Yearly

TM1w

Research Engineer, Infrastructure, Kernels

Thinking Machines Lab

San Francisco, California, United States (On-site)

$350k – $475k Yearly

DE2w

Staff Research Engineer, Voice

Decagon

San Francisco, California, United States (On-site)

$350k – $475k Yearly

SC3w

Director, Business Systems

Scale

San Francisco, California, United States (On-site)

$231k – $288.8k Yearly

BR2w

Open Source Engineer - Go

Braintrust

San Francisco, California, United States or Remote (United States)

CE3w

Sr. Engineer, Inference Ecosystem Engineering

Cerebras

Sunnyvale, California, United States (On-site)

AD1w

Senior Software Engineer, Training

ada

Canada (Remote)