LLM Performance Engineering Jobs
Explore LLM Performance Engineering roles on Inference Jobs and apply today.
13h agoNV
Senior Deep Learning Software Engineer, LLM Performance
NVIDIA
Santa Clara, California, United States (On-site)$184K – $356.5K Yearly
2mo agoNV
High-Performance LLM Training Engineer - New College Grad 2026
NVIDIA
Santa Clara, California, United States (On-site)$124K – $195.5K Yearly
4w agoTA
LLM Inference Frameworks and Optimization Engineer
Together AI
San Francisco, California, United States (On-site)$160K – $230K Yearly
1mo agoNV
Senior AI Software Development Engineer, TensorRT-LLM
NVIDIA
Yokne'am, Northern District, Israel (Hybrid)
3mo agoBA
Software Engineer - Model Performance
Baseten
San Francisco, California, United States (On-site)$150K – $250K Yearly
1mo agoNV
AI Inference Performance Engineer - New College Grad 2026
NVIDIA
Santa Clara, California, United States (On-site)$124K – $241.5K Yearly
2mo agoNV
Senior Software Engineer – TensorRT Edge-LLM
NVIDIA
Santa Clara, California, United States (Hybrid)$152K – $287.5K Yearly
1mo agoNV
Senior DL Algorithms Engineer - Inference Performance
NVIDIA
Santa Clara, California, United States (On-site)$184K – $356.5K Yearly
3mo agoSE
ML Model Serving Engineer
Sesame
San Francisco, California, United States (On-site)$175K – $280K Yearly
3mo agoD-
Machine Learning Intern - Dynamic KV-Cache Modeling for Efficient LLM Inference
d-Matrix
Santa Clara, Ca, Ca, United States or Remote (California, United States)$30 – $59 Hourly
2mo agoSC
Tech Lead Manager, Machine Learning Research Scientist- LLM Evals
Scale
San Francisco, California, US$280K – $380K Yearly
3mo agoBA
Software Engineer, Model Performance Tooling
Baseten
Canada or Remote (Canada + 1 more)C$130K – C$200K Yearly
2mo agoNV
Senior Research Scientist, Fundamental LLM Research for Knowledge, Reasoning, and Agents
NVIDIA
Santa Clara, California, United States (On-site)$224K – $356.5K Yearly