1. Home
  2. Jobs
  3. LLM Runtimes

LLM Runtimes Jobs

Browse 307 LLM Runtimes jobs on Inference Jobs.

201-220 of 307 jobs

3wCO

Software Engineer, Inference AI/ML

CoreWeave

Sunnyvale, California, United States (Hybrid)$92k – $135k Yearly
2wD-

Software Machine Learning Test Engineer - Staff

d-Matrix

Bengaluru, Karnataka, India (Hybrid)$155.3k – $234.3k Yearly
6dTM

Research Engineer, Infrastructure, Numerics

Thinking Machines Lab

San Francisco, California, United States (On-site)$350k – $475k Yearly
1wOP

Software Engineer, Inference – AMD GPU Enablement

OpenAI

San Francisco, California, United States (On-site)$325k – $490k Yearly
17hNV

Senior Deep Learning Compiler Engineer - XLA

NVIDIA

Santa Clara, California, United States (On-site)$152k – $241.5k Yearly
4wNV

Agentic AI Solution Engineering Intern - Summer 2026

NVIDIA

Austin, Texas, United States (On-site)$20 – $71 Hourly
1wSI

Software Engineer, Security

Sierra

San Francisco, California, United States (On-site)$200k – $330k Yearly
3wNV

Senior Applied Deep Learning Research Scientist, Efficiency

NVIDIA

Santa Clara, California, United States (On-site)$192k – $356.5k Yearly
6dAN

TPU Kernel Engineer

Anthropic

San Francisco, California, United States (Hybrid)$280k – $560k Yearly
5dAN

Research Engineer, Machine Learning (Horizons)

Anthropic

San Francisco, California, United States (Hybrid)$280k – $425k Yearly
2wRA

Member of Technical Staff - Evaluations

Reflection AI

San Francisco, California, United States (On-site)
2wOP

Software Engineer, Applied Evals

OpenAI

San Francisco, California, United States (Hybrid)$255k – $325k Yearly
4wNV
4wNV

Senior Software Test Development Engineer - Deep Learning

NVIDIA

Santa Clara, California, United States (On-site)$140k – $270.3k Yearly
4wNV

Software Product Manager - Nemotron

NVIDIA

Santa Clara, California, United States (On-site)$240k – $379.5k Yearly
2wHE
17hNV

Senior Systems Software Engineer - Deep Learning Solutions

NVIDIA

Toronto, Ontario, Canada (On-site)C$225k – C$275k Yearly
4wGR
2wLA

FullStack Engineer, Observability & Evals Platform (LangSmith)

LangChain

San Francisco, California, United States (On-site)$145k – $180k Yearly