1. Home
  2. Jobs
  3. LLM Inference

LLM Inference Jobs

Browse 426 LLM Inference jobs on Inference Jobs.

281-300 of 426 jobs

5dAN

Engineering Manager, ML Acceleration

Anthropic

San Francisco, California, United States (Hybrid)$425k – $560k Yearly
2wCA

Software Engineer

Cartesia

San Francisco, California, United States (On-site)$180k – $250k Yearly
2wBR

Open Source Engineer - Python

Braintrust

San Francisco, California, United States or Remote (California, United States + 2 more)
6dNE

Technical Product Manager (Cluster Experience)

Nebius

Amsterdam, North Holland, Netherlands or Remote (Europe)
6dAN

Staff Research Engineer, Discovery Team

Anthropic

San Francisco, California, United States (Hybrid)$340k – $425k Yearly
6dTM

Research Engineer, Infrastructure, Numerics

Thinking Machines Lab

San Francisco, California, United States (On-site)$350k – $475k Yearly
6dRU

Applied Research Lead, Language

Runway

North America + 1 more (Remote)$280k – $380k Yearly
6dVA

Systems/GPU Research Engineer

Vast.ai

San Francisco, California, United States (On-site)$160k – $320k Yearly
2wLA

Software Engineering Manager, Observability & Evals Platform

LangChain

San Francisco, California, United States (On-site)$200k – $250k Yearly
6dTM

Research, Post-Training

Thinking Machines Lab

San Francisco, California, United States (On-site)$350k – $475k Yearly
4wPO

Member of Engineering (Pre-training / Data Engineering)

Poolside

United Kingdom or Remote (Europe, Middle East, and Africa + 1 more)
6dTM

Research Engineer, Infrastructure, RL Systems

Thinking Machines Lab

San Francisco, California, United States (On-site)$350k – $475k Yearly
2wLA

FullStack Engineer, Observability & Evals Platform (LangSmith)

LangChain

San Francisco, California, United States (On-site)$145k – $180k Yearly
2wCO

Full-Stack Software Engineer, Inference

Cohere

Toronto, Ontario, Canada or Remote (Canada + 2 more)
2wMA

Applied AI, AI Engineer for Mistral

Mistral AI

Île de Ré, Charente-Maritime, France (On-site)
4wNV

Software Product Manager - Nemotron

NVIDIA

Santa Clara, California, United States (On-site)$240k – $379.5k Yearly
2wOP

AI & Provider Operations Engineer

OpenRouter

United States or Remote (United States)
2wDE

Staff Research Engineer

Decagon

San Francisco, California, United States (On-site)$350k – $475k Yearly
2wLA

JavaScript Engineer (Open Source Team)

LangChain

San Francisco, California, United States (On-site)$150k – $225k Yearly