1. Home
  2. Jobs
  3. LLM Stack

LLM Stack jobs

Explore LLM Stack roles on Inference Jobs and apply today.

201-220 of 447 jobs

CO2w

Member of Technical Staff, MLE (Korea)

Cohere

Seoul, Seoul, South Korea or Remote (South Korea)

OP2w

Research Engineer, Codex

OpenAI

San Francisco, California, United States (Hybrid)

$380k – $460k Yearly

MA2w

Research Engineer

Magic

San Francisco, California, United States (On-site)

$225k – $550k Yearly

NV2w

Solutions Architect - Financial Service and Retail

NVIDIA

Beijing, Beijing, China (On-site)

CO1w

Staff Engineer - Perf and Benchmarking

CoreWeave

Sunnyvale, California, United States (Hybrid)

$188k – $275k Yearly

RA2w

Member of Technical Staff - Data Quality Engineer (Pre-training)

Reflection AI

San Francisco, California, United States (On-site)

NV2w

Senior Deep Learning Test Development Engineer, SDET

NVIDIA

Shanghai, Shanghai, China (On-site)

NV2d

Senior Software Engineer, Quantized Inference

NVIDIA

Redmond, Washington, United States (On-site)

$152k – $287.5k Yearly

OP2w

Full-Stack Software Engineer, Frontier Exploration

OpenAI

San Francisco, California, United States (Hybrid)

$255k – $325k Yearly

BA2w

Software Engineer - Model API's

Baseten

San Francisco, California, United States (On-site)

$150k – $230k Yearly

LA2w

Senior Fullstack Engineer, Growth & Monetization

LangChain

San Francisco, California, United States (On-site)

$175k – $225k Yearly

OP2w

Software Engineer, Full-Stack — Developer Experience

OpenAI

San Francisco, California, United States (On-site)

$210k – $490k Yearly

RE2w

Member of Technical Staff (Applied AI)

Reka

Unknown or Remote (United States + 1 more)

CR4d

Site Reliability Engineer, Managed AI

Crusoe

San Francisco, California, United States (On-site)

$204k – $247k Yearly

LA2w

Python OSS Engineer

LangChain

San Francisco, California, United States (On-site)

$160k – $225k Yearly

OP2w

Full Stack Software Engineer - Enterprise Agents

OpenAI

San Francisco, California, United States (On-site)

$255k – $405k Yearly

PE3w

Research Engineering Manager - Model Training

Perplexity

San Francisco, California, United States (On-site)

$300k – $470k Yearly

PO5d

Member of Engineering (Pre-training / CUDA)

Poolside

Europe + 1 more (Remote)

HE2w

Senior Full Stack Software Engineer

Heidi

San Francisco, California, United States (On-site)

$150k – $220k Yearly

PE2w

Internship - Search Machine Learning Engineer (Belgrade)

Perplexity

Belgrade, Belgrade, Serbia (On-site)