1. Home
  2. Jobs
  3. LLM Inference

LLM Inference jobs

Explore LLM Inference roles on Inference Jobs and apply today.

141-160 of 424 jobs

MA2w

Applied AI, Evaluation Engineer

Mistral AI

Île de Ré, Charente-Maritime, France (On-site)

LA2w

Deployed Engineer (EMEA)

LangChain

London, England, United Kingdom (On-site)

PE2w

AI Research Lead

Perplexity

San Francisco, California, United States (On-site)

$300k – $470k Yearly

NE2w

Senior ML Solutions Architect - Token Factory

Nebius

United States (Remote)

$215k – $275k Yearly

CO1w

Senior Manager Forward Deployed Engineers

CoreWeave

Livingston, New Jersey, United States (Hybrid)

$188k – $275k Yearly

SC1w

AI Infrastructure Engineer, Model Serving Platform

Scale

San Francisco, California, United States (On-site)

$179.4k – $224.3k Yearly

CR2w

Staff Product Manager, Managed Inference (SF/Sunnyvale/New York)

Crusoe

San Francisco, California, United States or Remote (California, United States + 1 more)

$204k – $247k Yearly

MA1w

AI Scientist - Paris (Internship Phd)

Mistral AI

Paris, Paris, France (On-site)

PE2w

AI Engineer, Applied ML

Perplexity

San Francisco, California, United States (On-site)

$210k – $385k Yearly

OP2w

Software Engineer, Load Balancing - Inference

OpenAI

San Francisco, California, United States (On-site)

$325k – $490k Yearly

PE2w

UK Internship Program

Perplexity

London, England, United Kingdom (Hybrid)

NV1w

Senior Deep Learning Engineer - Model Evaluation & AI Systems

NVIDIA

Santa Clara, California, United States (On-site)

$224k – $431.3k Yearly

CO2w

Product Marketing Manager, CoreWeave Inference

CoreWeave

Livingston, New Jersey, United States (Hybrid)

$143k – $210k Yearly

NV2w

Deep Learning Compiler Verification and Infra Development Intern - 2026

NVIDIA

Shanghai, Shanghai, China (On-site)

PL2w

Research Engineer - Midtraining

Periodic Labs

Menlo Park, California, United States (On-site)

CE3w

Principal Engineer, AI Inference Reliability

Cerebras

United States + 1 more (Remote)

XA4w

Member of Technical Staff, Inference

xAI

Palo Alto, California, United States (On-site)

$180k – $440k Yearly

LA2w

Deployed Engineer (West)

LangChain

San Francisco, California, United States (On-site)

$150k – $270k Yearly

AN6d

Engineering Manager, Inference

Anthropic

San Francisco, California, United States (Hybrid)

$425k – $560k Yearly

VE3w

LLMOps Engineer

Vertiv

Westerville, Ohio, United States (On-site)