LLM Inference Jobs
Browse 437 LLM Inference jobs on Inference Jobs.
437 jobs
5dTA
LLM Inference Frameworks and Optimization Engineer
Together AI
San Francisco, California, United States (On-site)$160k – $230k Yearly
1wBA
Engineering Manager - Forward Deployed Engineering (LLM)
Baseten
San Francisco, California, United States (On-site)$220k – $285k Yearly
5dCO
Director of Engineering, Inference Services
CoreWeave
Sunnyvale, California, United States (Hybrid)$206k – $303k Yearly
1wCO
Member of Technical Staff, Model Efficiency
Cohere
New York, New York, United States or Remote (New York, United States + 3 more)
4wD-
Machine Learning Intern - Dynamic KV-Cache Modeling for Efficient LLM Inference
d-Matrix
Campbell, California, United States or Remote (California, United States)$30 – $59 Hourly
2wPE
AI Inference Engineer (San Francisco)
Perplexity
San Francisco, California, United States (On-site)$210k – $385k Yearly
3dNV
Senior Machine Learning Engineer, Quantized Inference
NVIDIA
Redmond, Washington, United States (On-site)$152k – $287.5k Yearly
2wPE
Inference Engineering Manager
Perplexity
San Francisco, California, United States (On-site)$300k – $385k Yearly
2wNV
Principal Software Engineer - Inference as a Service
NVIDIA
Santa Clara, California, United States (On-site)$248k – $391k Yearly
4dNV
Senior Research Scientist, Fundamental LLM Research for Knowledge, Reasoning, and Agents
NVIDIA
Santa Clara, California, United States (On-site)$224k – $356.5k Yearly
9hNV
Senior Software Engineer, Quantized Inference
NVIDIA
Redmond, Washington, United States (On-site)$152k – $287.5k Yearly