1. Home
  2. Jobs
  3. LLM Inference

LLM Inference Jobs

Browse 437 LLM Inference jobs on Inference Jobs.

41-60 of 437 jobs

4wNV

Deep Learning Software Engineer, FlashInfer - New College Grad 2025

NVIDIA

Santa Clara, California, United States (On-site)$108k – $195.5k Yearly
2wBA

Technical Enablement Lead

Baseten

San Francisco, California, United States (On-site)$175k – $210k Yearly
2wRA

Member of Technical Staff - Post-Training

Reflection AI

San Francisco, California, United States (On-site)
2wNV

Senior AI Software Engineer, GenAI Framework

NVIDIA

Santa Clara, California, United States (On-site)$152k – $287.5k Yearly
4wSC

Machine Learning Systems Research Engineer, Agent Post-training - Enterprise GenAI

Scale

San Francisco, California, United States (On-site)$252k – $315k Yearly
2wNV

Senior Software Research Architect, AI Networking

NVIDIA

Tel Aviv-Yafo, Tel Aviv District, Israel (On-site)
3wNV

Senior Software Engineer - NIM Factory Container and Cloud Infrastructure

NVIDIA

Santa Clara, California, United States (On-site)$184k – $356.5k Yearly
5dSC

Senior Forward Deployed Data Scientist/Engineer

Scale

San Francisco, California, United States (Hybrid)$198k – $247.5k Yearly
2wSE

ML Model Serving Engineer

Sesame

San Francisco, California, United States (On-site)$175k – $280k Yearly
1wOP

Research Engineer, Codex

OpenAI

San Francisco, California, United States (Hybrid)$380k – $460k Yearly
13hNV

Senior ML Compiler Engineer

NVIDIA

Redmond, Washington, United States (On-site)$152k – $287.5k Yearly
1wXA

Member of Technical Staff, Grokipedia - Synthetic Data & Epistemics

xAI

Palo Alto, California, United States (On-site)$180k – $440k Yearly
5dNV

Senior System Software Engineer - Dynamo-Triton Inference Server

NVIDIA

Santa Clara, California, United States (On-site)$152k – $241.5k Yearly
5dSC

ML Research Engineer, ML Systems

Scale

San Francisco, California, United States (On-site)$218.4k – $273k Yearly
5dTA

Machine Learning Engineer - Inference

Together AI

San Francisco, California, United States (On-site)$160k – $230k Yearly
2wMA

Research Engineer

Magic

San Francisco, California, United States (On-site)$225k – $550k Yearly