1. Home
  2. Jobs
  3. LLM Runtimes

LLM Runtimes Jobs

Browse 298 LLM Runtimes jobs on Inference Jobs.

21-40 of 298 jobs

1wNV

Senior LLM Agents Architect

NVIDIA

Yokneam Ilit, Northern District, Israel (Hybrid)
3wSC

Staff Machine Learning Research Scientist, LLM Evals

Scale

San Francisco, California, United States (On-site)$280k – $380k Yearly
2wHA

LLM Inference Engineer

Hippocratic AI

Palo Alto, California, United States (On-site)
1wAN

Research Engineer, Pretraining Scaling

Anthropic

San Francisco, California, United States (On-site)$315k – $560k Yearly
2wLA

Python OSS Engineer

LangChain

San Francisco, California, United States (On-site)$160k – $225k Yearly
2wCO

Member of Technical Staff, Model Efficiency

Cohere

New York, New York, United States or Remote (New York, United States + 3 more)
2wSC

Senior/Staff Machine Learning Engineer, General Agents, Enterprise GenAI

Scale

San Francisco, California, United States (On-site)$218k – $273k Yearly
2wCE

Senior Full Stack LLM Engineer - Training

Cerebras

Sunnyvale, California, United States (On-site)
2wPL

Research Engineer - Midtraining

Periodic Labs

Menlo Park, California, United States (On-site)
6dLA

Applied Research Engineer, Agents

Labelbox

San Francisco, California, United States (Hybrid)$250k – $300k Yearly
2wPL

Distributed Training Engineer

Periodic Labs

Menlo Park, California, United States (Hybrid)
2wCO

Staff Research Engineer, Model Efficiency

Cohere

New York, New York, United States (Hybrid)
2wPL

Research Engineer - Posttraining

Periodic Labs

Menlo Park, California, United States (On-site)
2wLA

Deployed Engineer (EMEA)

LangChain

London, England, United Kingdom (On-site)
2wPO

Member of Engineering (Pre-training and inference software)

Poolside

United Kingdom or Remote (Europe, Middle East, and Africa, North America)
5dNV

Senior Machine Learning Engineer, Quantized Inference

NVIDIA

Redmond, Washington, United States (On-site)$152k – $287.5k Yearly
2wNE

Senior ML Solutions Architect - Token Factory

Nebius

United States (Remote)$215k – $275k Yearly
2wLA

JavaScript Engineer (Open Source Team)

LangChain

San Francisco, California, United States (On-site)$150k – $225k Yearly
1wCE

Senior Research Engineer - Inference ML

Cerebras

Sunnyvale, California, United States (Hybrid)