1. Home
  2. Jobs
  3. LLM Inference Optimization

LLM Inference Optimization Jobs

Browse 438 LLM Inference Optimization jobs on Inference Jobs.

21-40 of 438 jobs

6dAN

Staff Software Engineer, Inference

Anthropic

Dublin, County Dublin, Ireland (Hybrid)€295k – €355k Yearly
1dNV

Senior ML Compiler Engineer

NVIDIA

Redmond, Washington, United States (On-site)$152k – $287.5k Yearly
2wRA

Member of Technical Staff - Post-Training

Reflection AI

San Francisco, California, United States (On-site)
2wNV

Senior Deep Learning Engineer

NVIDIA

Warszawa, Masovian Voivodeship, Poland (Hybrid)zł 292.5k – zł 507k Yearly
6dVA

Systems/GPU Research Engineer

Vast.ai

San Francisco, California, United States (On-site)$160k – $320k Yearly
2wMA

Research Engineer

Magic

San Francisco, California, United States (On-site)$225k – $550k Yearly
2wSE

ML Model Serving Engineer

Sesame

San Francisco, California, United States (On-site)$175k – $280k Yearly
2wOP

Research Engineer, Codex

OpenAI

San Francisco, California, United States (Hybrid)$380k – $460k Yearly
6dSC

ML Research Engineer, ML Systems

Scale

San Francisco, California, United States (On-site)$218.4k – $273k Yearly
4dNV

Senior Compiler Engineer - AI

NVIDIA

Santa Clara, California, United States (On-site)$184k – $287.5k Yearly
2wNV

Senior Machine Learning Applications and Compiler Engineer

NVIDIA

Santa Clara, California, United States (Hybrid)$152k – $287.5k Yearly
2wPE

Inference Engineering Manager

Perplexity

San Francisco, California, United States (On-site)$300k – $385k Yearly
5dNV
4dNV

Senior Machine Learning Engineer, Quantized Inference

NVIDIA

Redmond, Washington, United States (On-site)$152k – $287.5k Yearly
6dCO

Director of Engineering, Inference Services

CoreWeave

Sunnyvale, California, United States (Hybrid)$206k – $303k Yearly
2wSE

ML Engineer

Sesame

New York, New York, United States (On-site)$190k – $320k Yearly
2wOP

Research Engineer / Research Scientist - Foundations Retrieval Lead

OpenAI

San Francisco, California, United States (Hybrid)$460k – $555k Yearly
2wPO

Member of Engineering (Pre-training and inference software)

Poolside

United Kingdom or Remote (Europe, Middle East, and Africa, North America)