Inference Optimization Jobs
Browse 482 Inference Optimization jobs on Inference Jobs.
41-60 of 482 jobs
5dVA
Systems/GPU Research Engineer
Vast.ai
San Francisco, California, United States (On-site)$160k – $320k Yearly
2wNV
Senior Software Engineer – TensorRT Edge-LLM
NVIDIA
Santa Clara, California, United States (Hybrid)$152k – $287.5k Yearly
1wCO
Member of Technical Staff, Model Efficiency
Cohere
New York, New York, United States or Remote (New York, United States + 3 more)
9hNV
2wPE
Inference Engineering Manager
Perplexity
San Francisco, California, United States (On-site)$300k – $385k Yearly
1wOP
Software Engineer, Monetization Delivery
OpenAI
San Francisco, California, United States (On-site)$255k – $405k Yearly
5dVA
GPU Systems Engineer – HPC / Parallel Computing
Vast.ai
San Francisco, California, United States (On-site)$160k – $320k Yearly
3dNV
Senior Compiler Engineer - AI
NVIDIA
Santa Clara, California, United States (On-site)$184k – $287.5k Yearly
2wSE
ML Model Serving Engineer
Sesame
San Francisco, California, United States (On-site)$175k – $280k Yearly
2wHF
Senior Open-Source Machine Learning Engineer, Computer Vision - EMEA Remote
Hugging Face
Île de Ré, Charente-Maritime, France or Remote (Europe, Middle East, and Africa)
5dAN
Research Engineer, Discovery
Anthropic
San Francisco, California, United States (Hybrid)$340k – $425k Yearly
5dAN
Staff Research Engineer, Discovery Team
Anthropic
San Francisco, California, United States (Hybrid)$340k – $425k Yearly
3dNV
Senior Machine Learning Applications and Compiler Engineer
NVIDIA
Cambridge, England, United Kingdom (Hybrid)
1wOP
1wOP
Research Engineer / Research Scientist - Foundations Retrieval Lead
OpenAI
San Francisco, California, United States (Hybrid)$460k – $555k Yearly
5dSC
ML Research Engineer, ML Systems
Scale
San Francisco, California, United States (On-site)$218.4k – $273k Yearly