1. Home
  2. Jobs
  3. LLM Inference Optimization

LLM Inference Optimization Jobs

Browse 452 LLM Inference Optimization jobs on Inference Jobs.

81-100 of 452 jobs

7dCO

Senior Software Engineer II, Inference

CoreWeave

Sunnyvale, California, United States (Hybrid)$165k – $242k Yearly
2wCE

Forward Deployed Product Manager

Cerebras

San Francisco, California, United States (Hybrid)
3wCO

Software Engineer, Inference AI/ML

CoreWeave

Sunnyvale, California, United States (Hybrid)$92k – $135k Yearly
7dAN

ML Infrastructure Engineer, Safeguards

Anthropic

San Francisco, California, United States (Hybrid)$320k – $405k Yearly
2wOP

Software Engineer, Model Inference

OpenAI

San Francisco, California, United States (On-site)$325k – $490k Yearly
4wNV

Product Manager - BioNeMo Inference

NVIDIA

New York, New York, United States (On-site)$168k – $258.8k Yearly
4dNV

Senior Compiler Engineer, AI Inference Platforms

NVIDIA

Santa Clara, California, United States (On-site)$152k – $241.5k Yearly
4dCO

Senior Software Engineer I, Inference

CoreWeave

Sunnyvale, California, United States (Hybrid)$139k – $204k Yearly
2wNV

Senior Deep Learning Performance Architect

NVIDIA

California, United States (Hybrid)$152k – $287.5k Yearly
7dTM

Research Engineer, Infrastructure, Numerics

Thinking Machines Lab

San Francisco, California, United States (On-site)$350k – $475k Yearly
2wNV
7dTA

Machine Learning Engineer

Together AI

San Francisco, California, United States (On-site)$160k – $220k Yearly
7dSC

Machine Learning Research Scientist / Research Engineer, Post-Training

Scale

San Francisco, California, United States (On-site)$252k – $315k Yearly
3wAN

Software Engineer, Encoding Libraries

Anthropic

San Francisco, California, United States (Hybrid)$300k – $405k Yearly
7dTM

Research Engineer, Infrastructure, Kernels

Thinking Machines Lab

San Francisco, California, United States (On-site)$350k – $475k Yearly
4dNV

Senior Compiler Engineer, AI Inference Performance

NVIDIA

Santa Clara, California, United States (On-site)$152k – $241.5k Yearly
2wRA

Forward Deployed Engineer Lead

Reflection AI

New York, New York, United States (On-site)
3wCR

Staff Software Engineer, Model LifeCycle

Crusoe

San Francisco, California, United States (On-site)$204k – $247k Yearly