Large Model Inference Jobs
Explore Large Model Inference roles on Inference Jobs and apply today.
2mo agoNV
Senior Deep Learning Engineer - Model Evaluation & AI Systems
NVIDIA
Santa Clara, California, United States (On-site)$224K – $431.3K Yearly
2mo agoNV
Senior Research Scientist, Fundamental LLM Research for Knowledge, Reasoning, and Agents
NVIDIA
Santa Clara, California, United States (On-site)$224K – $356.5K Yearly
3mo agoRA
Member of Technical Staff - Post-Training
Reflection AI
San Francisco, California, United States (On-site)
3w agoOP
3mo agoMO
Member of Technical Staff - ML Performance
Modal
New York, United States (On-site)$150K – $270K Yearly
4w agoNE
3w agoSC
ML Research Engineer, ML Systems
Scale
San Francisco, California, United States (On-site)$218.4K – $273K Yearly
3mo agoBA
Software Engineer - Model Performance
Baseten
San Francisco, California, United States (On-site)$150K – $250K Yearly
3mo agoOP
Software Engineer, Inference - Multi Modal
OpenAI
San Francisco, California, United States (On-site)$325K – $490K Yearly
3mo agoOP
Software Engineer, Model Inference
OpenAI
San Francisco, California, United States (On-site)$325K – $490K Yearly
3mo agoOP
Inference Technical Lead, Sora
OpenAI
San Francisco, California, United States (Hybrid)$380K – $380K Yearly
3w agoTA
LLM Inference Frameworks and Optimization Engineer
Together AI
San Francisco, California, United States (On-site)$160K – $230K Yearly
1mo agoCE
3w agoOP
Inference Technical Lead, On-Device Transformers
OpenAI
San Francisco, California, United States (Hybrid)$445K – $445K Yearly
3mo agoCO
Site Reliability Engineer, Inference Infrastructure
Cohere
Toronto, Ontario, Canada or Remote (Canada + 2 more)
3mo agoSE
ML Model Serving Engineer
Sesame
San Francisco, California, United States (On-site)$175K – $280K Yearly
3mo agoOP
Software Engineer, Inference – AMD GPU Enablement
OpenAI
San Francisco, California, United States (On-site)$325K – $490K Yearly