1. Home
  2. Jobs
  3. LLM Stack

LLM Stack Jobs

Browse 453 LLM Stack jobs on Inference Jobs.

101-120 of 453 jobs

3wLA

Deployed Engineer (Central)

LangChain

Chicago, Illinois, United States or Remote (Illinois, United States + 1 more)$150k – $270k Yearly
6dSC

AI Infrastructure Engineer, Model Serving Platform

Scale

San Francisco, California, United States (On-site)$179.4k – $224.3k Yearly
2wPE

Software Engineer, AI Platform

Perplexity

San Francisco, California, United States (On-site)$210k – $385k Yearly
3wCR

Principal Engineer, AI Model LifeCycle

Crusoe

San Francisco, California, United States (On-site)$256k – $320k Yearly
6dAN

Senior/Staff Software Engineer, Inference

Anthropic

New York, New York, United States (Hybrid)$300k – $485k Yearly
2wBA

Software Engineer - Model Performance

Baseten

San Francisco, California, United States (On-site)$150k – $250k Yearly
2wLA

GTM Engineer

LangChain

San Francisco, California, United States (On-site)$160k – $180k Yearly
3wCE

Sr. Engineer, Inference Ecosystem Engineering

Cerebras

Sunnyvale, California, United States (On-site)
2wPL

Research Engineer - Posttraining

Periodic Labs

Menlo Park, California, United States (On-site)
4wLA

Deployed Engineer (Toronto)

LangChain

Toronto, Ontario, Canada or Remote (Canada)
2wLA

Deployed Engineer (West)

LangChain

San Francisco, California, United States (On-site)$150k – $270k Yearly
6dCE

Principal ML Investigator

Cerebras

Sunnyvale, California, United States (On-site)
2wLA

Education Engineer, Fullstack

LangChain

San Francisco, California, United States (On-site)$150k – $185k Yearly
6dXA

Member of Technical Staff - Multimodal Post-training

xAI

Palo Alto, California, United States (On-site)$180k – $440k Yearly
1wOP

Backend Software Engineer (Evals) – Support Automation Engineering

OpenAI

San Francisco, California, United States (On-site)$255k – $405k Yearly
2wPE

Data Scientist, Evals

Perplexity

London, England, United Kingdom (On-site)$210k – $385k Yearly
2wPE

Inference Engineering Manager

Perplexity

San Francisco, California, United States (On-site)$300k – $385k Yearly
2wLA
4wNV

Deep Learning Software Engineer, FlashInfer - New College Grad 2025

NVIDIA

Santa Clara, California, United States (On-site)$108k – $195.5k Yearly