1. Home
  2. Jobs
  3. LLM Inference Optimization

LLM Inference Optimization Jobs

Browse 444 LLM Inference Optimization jobs on Inference Jobs.

61-80 of 444 jobs

2wBA

Software Engineer - Model Performance

Baseten

San Francisco, California, United States (On-site)$150k – $250k Yearly
6dTA

Machine Learning Engineer - Inference

Together AI

San Francisco, California, United States (On-site)$160k – $230k Yearly
2wOP

Inference Technical Lead, Sora

OpenAI

San Francisco, California, United States (Hybrid)$380k – $380k Yearly
1wTA

Research Engineer, Core ML

Together AI

San Francisco, California, United States (On-site)$200k – $280k Yearly
3wCE

Inference Compiler and Frontend Engineer – Dubai

Cerebras

Dubai, Dubai, United Arab Emirates (On-site)
2dNV

Senior Scientist, Synthetic Data and Privacy

NVIDIA

Santa Clara, California, United States (On-site)$192k – $356.5k Yearly
4dCR

Research Engineer

Crusoe

Tel Aviv-Yafo, Tel Aviv District, Israel (On-site)
2wLA

Fullstack Engineer, Applied AI

LangChain

San Francisco, California, United States (On-site)$170k – $195k Yearly
3wNV

Senior Software Engineer - NIM Factory Container and Cloud Infrastructure

NVIDIA

Santa Clara, California, United States (On-site)$184k – $356.5k Yearly
6dNV

Senior System Software Engineer - Dynamo-Triton Inference Server

NVIDIA

Santa Clara, California, United States (On-site)$152k – $241.5k Yearly
6dSC

Senior Forward Deployed Data Scientist/Engineer

Scale

San Francisco, California, United States (Hybrid)$198k – $247.5k Yearly
4wSC

Machine Learning Systems Research Engineer, Agent Post-training - Enterprise GenAI

Scale

San Francisco, California, United States (On-site)$252k – $315k Yearly
2wNV

Senior AI Software Engineer, GenAI Framework

NVIDIA

Santa Clara, California, United States (On-site)$152k – $287.5k Yearly
6dAN

Research Engineer, Pretraining Scaling

Anthropic

San Francisco, California, United States (On-site)$315k – $560k Yearly
2wXA

Member of Technical Staff, Grokipedia - Synthetic Data & Epistemics

xAI

Palo Alto, California, United States (On-site)$180k – $440k Yearly
6dNE

GPU Cluster Architect

Nebius

United States (Remote)$150k – $180k Yearly
6dCO

Principal Engineer, Inference

CoreWeave

Sunnyvale, California, United States (Hybrid)$206k – $303k Yearly