LLM Inference Optimization jobs
Explore LLM Inference Optimization roles on Inference Jobs and apply today.
161-180 of 447 jobs
Senior Software Engineer, AI Inference Systems
NVIDIA
Toronto, Ontario, Canada (Hybrid)
C$170k – C$275k Yearly
Member of Technical Staff - ML Performance
Modal
New York, New York, United States (On-site)
$150k – $270k Yearly
Software Engineer, Inference Deployment
Anthropic
San Francisco, California, United States (Hybrid)
$320k – $485k Yearly
Software Engineer, Load Balancing - Inference
OpenAI
San Francisco, California, United States (On-site)
$325k – $490k Yearly
Senior Software Engineer - VLM Microservices for Neural Reconstruction
NVIDIA
Santa Clara, California, United States (On-site)
$152k – $287.5k Yearly
Senior ML Framework Performance Engineer - AI for Science at Scale
NVIDIA
Santa Clara, California, United States (On-site)
$184k – $287.5k Yearly
Senior Systems Software Engineer - Deep Learning Solutions
NVIDIA
Toronto, Ontario, Canada (On-site)
C$225k – C$275k Yearly
AI Researcher, Core ML
Together AI
San Francisco, California, United States (On-site)
$160k – $230k Yearly
Member of Technical Staff, Model Evaluation
xAI
Palo Alto, California, United States (On-site)
$180k – $440k Yearly
Infrastructure Engineer, ML Systems
Applied Compute
San Francisco, California, United States (On-site)
Software Engineer, Model Performance Tooling
Baseten
Canada or Remote (Canada + 1 more)
C$130k – C$200k Yearly
Software Engineer - Model API's
Baseten
San Francisco, California, United States (On-site)
$150k – $230k Yearly
Research Engineering Manager - Model Training
Perplexity
San Francisco, California, United States (On-site)
$300k – $470k Yearly
Internship - Machine Learning Research Engineer (Berlin)
Perplexity
Berlin, Berlin, Germany (On-site)