1. Home
  2. Jobs
  3. LLM Serving Frameworks

LLM Serving Frameworks Jobs

Browse 422 LLM Serving Frameworks jobs on Inference Jobs.

61-80 of 422 jobs

2wSC

Senior/Staff Machine Learning Engineer, General Agents, Enterprise GenAI

Scale

San Francisco, California, United States (On-site)$218k – $273k Yearly
1wPO

Member of Engineering (Pre-training and inference software)

Poolside

United Kingdom or Remote (Europe, Middle East, and Africa, North America)
2wLA

Deployed Engineer (EMEA)

LangChain

London, England, United Kingdom (On-site)
1wCA

Software Engineer

Cartesia

San Francisco, California, United States (On-site)$180k – $250k Yearly
5dNV
6dCE

Full Stack LLM Engineer

Cerebras

Toronto, Ontario, Canada (On-site)
2wLA
3wCR

Senior Site Reliability Engineer, Managed AI

Crusoe

San Francisco, California, United States (On-site)$172k – $209k Yearly
5dAN

Machine Learning Systems Engineer, RL Engineering

Anthropic

San Francisco, California, United States (Hybrid)$300k – $405k Yearly
1wHA

Forward Deployed Engineer - Portuguese Speaking

HappyRobot

Madrid, Madrid, Spain or Remote (Madrid, Spain)
5dLA

Applied Research Engineer, Agents

Labelbox

San Francisco, California, United States (Hybrid)$250k – $300k Yearly
2wSC

Tech Lead Manager, Machine Learning Research Scientist- LLM Evals

Scale

San Francisco, California, United States (On-site)$280k – $380k Yearly
2wNE

Senior ML Solutions Architect - Token Factory

Nebius

United States (Remote)$215k – $275k Yearly
2wCE

Senior Full Stack LLM Engineer - Training

Cerebras

Sunnyvale, California, United States (On-site)
1wCO

Staff Research Engineer, Model Efficiency

Cohere

New York, New York, United States (Hybrid)
2wNV

Senior Data Scientist – Enterprise AI Systems

NVIDIA

Santa Clara, California, United States (On-site)$168k – $322k Yearly
2wNV

High-Performance LLM Training Engineer - New College Grad 2026

NVIDIA

Santa Clara, California, United States (On-site)$124k – $195.5k Yearly
2wLA

Senior Full Stack Engineer, Observability & Evals Platform

LangChain

San Francisco, California, United States (On-site)$175k – $225k Yearly
4wCO

Applied AI Engineer – Agentic Workflows (Korea)

Cohere

Seoul, Seoul, South Korea or Remote (South Korea)