1. Home
  2. Jobs
  3. LLM Serving

LLM Serving Jobs

Browse 292 LLM Serving jobs on Inference Jobs.

21-40 of 292 jobs

2wCE

Senior Full Stack LLM Engineer - Training

Cerebras

Sunnyvale, California, United States (On-site)
3wSC

Staff Machine Learning Research Scientist, LLM Evals

Scale

San Francisco, California, United States (On-site)$280k – $380k Yearly
6dTA

LLM Inference Frameworks and Optimization Engineer

Together AI

San Francisco, California, United States (On-site)$160k – $230k Yearly
2wCO

Member of Technical Staff, Post-Training

Cohere

London, England, United Kingdom (Hybrid)
5dAN

Machine Learning Systems Engineer, RL Engineering

Anthropic

San Francisco, California, United States (Hybrid)$300k – $405k Yearly
6dCO

Principal Engineer, Inference

CoreWeave

Sunnyvale, California, United States (Hybrid)$206k – $303k Yearly
2wPO

Member of Engineering (Pre-training and inference software)

Poolside

United Kingdom or Remote (Europe, Middle East, and Africa, North America)
3dCO

Senior Software Engineer I, Inference

CoreWeave

Sunnyvale, California, United States (Hybrid)$139k – $204k Yearly
6dCO

Senior Software Engineer II, Inference

CoreWeave

Sunnyvale, California, United States (Hybrid)$165k – $242k Yearly
6dNV

Senior LLM Agents Architect

NVIDIA

Yokneam Ilit, Northern District, Israel (Hybrid)
2wCO

Staff Research Engineer, Model Efficiency

Cohere

New York, New York, United States (Hybrid)
2wLA

Deployed Engineer (EMEA)

LangChain

London, England, United Kingdom (On-site)
2wNV

Principal Software Engineer - Inference as a Service

NVIDIA

Santa Clara, California, United States (On-site)$248k – $391k Yearly
2wRA
2wLA

Customer Engineer

LangChain

London, England, United Kingdom or Remote (United Kingdom + 1 more)
2wRA

Member of Technical Staff - Evaluations

Reflection AI

San Francisco, California, United States (On-site)
2wSE

ML Engineer

Sesame

New York, New York, United States (On-site)$190k – $320k Yearly
3wCO

Software Engineer, Inference AI/ML

CoreWeave

Sunnyvale, California, United States (Hybrid)$92k – $135k Yearly
5dLA

Applied Research Engineer, Agents

Labelbox

San Francisco, California, United States (Hybrid)$250k – $300k Yearly