LLM Inference Optimization Jobs
Browse 444 LLM Inference Optimization jobs on Inference Jobs.
61-80 of 444 jobs
2wBA
Software Engineer - Model Performance
Baseten
San Francisco, California, United States (On-site)$150k – $250k Yearly
6dTA
Machine Learning Engineer - Inference
Together AI
San Francisco, California, United States (On-site)$160k – $230k Yearly
2wOP
Inference Technical Lead, Sora
OpenAI
San Francisco, California, United States (Hybrid)$380k – $380k Yearly
1wTA
Research Engineer, Core ML
Together AI
San Francisco, California, United States (On-site)$200k – $280k Yearly
3wCE
Inference Compiler and Frontend Engineer – Dubai
Cerebras
Dubai, Dubai, United Arab Emirates (On-site)
2dNV
Senior Scientist, Synthetic Data and Privacy
NVIDIA
Santa Clara, California, United States (On-site)$192k – $356.5k Yearly
2wLA
Fullstack Engineer, Applied AI
LangChain
San Francisco, California, United States (On-site)$170k – $195k Yearly
3wNV
Senior Software Engineer - NIM Factory Container and Cloud Infrastructure
NVIDIA
Santa Clara, California, United States (On-site)$184k – $356.5k Yearly
6dNV
Senior System Software Engineer - Dynamo-Triton Inference Server
NVIDIA
Santa Clara, California, United States (On-site)$152k – $241.5k Yearly
6dSC
Senior Forward Deployed Data Scientist/Engineer
Scale
San Francisco, California, United States (Hybrid)$198k – $247.5k Yearly
4wSC
Machine Learning Systems Research Engineer, Agent Post-training - Enterprise GenAI
Scale
San Francisco, California, United States (On-site)$252k – $315k Yearly
2wNV
Senior AI Software Engineer, GenAI Framework
NVIDIA
Santa Clara, California, United States (On-site)$152k – $287.5k Yearly
6dAN
Research Engineer, Pretraining Scaling
Anthropic
San Francisco, California, United States (On-site)$315k – $560k Yearly
2wXA
Member of Technical Staff, Grokipedia - Synthetic Data & Epistemics
xAI
Palo Alto, California, United States (On-site)$180k – $440k Yearly
6dCO
Principal Engineer, Inference
CoreWeave
Sunnyvale, California, United States (Hybrid)$206k – $303k Yearly