AI Model Inference Jobs
Browse 1,278 AI Model Inference jobs on Inference Jobs.
1,278 jobs
1wNV
Senior ML Framework Performance Engineer - AI for Science at Scale
NVIDIA
Santa Clara, California, United States (On-site)$184k – $287.5k Yearly
6dNV
Senior Deep Learning Engineer - Model Evaluation & AI Systems
NVIDIA
Santa Clara, California, United States (On-site)$224k – $431.3k Yearly
2wOP
Inference Technical Lead, Sora
OpenAI
San Francisco, California, United States (Hybrid)$380k – $380k Yearly
2wPE
AI Inference Engineer (San Francisco)
Perplexity
San Francisco, California, United States (On-site)$210k – $385k Yearly
3wCE
Inference Compiler and Frontend Engineer – Dubai
Cerebras
Dubai, Dubai, United Arab Emirates (On-site)
1wNE
Senior Technical Product Manager Token Factory - Inference
Nebius
United States (Remote)$204k – $255k Yearly
3wNV
Platform Architecture Engineer, GeForce NOW
NVIDIA
Santa Clara, California, United States (On-site)$184k – $287.5k Yearly
2wXT
AI Scientist, BioMedical AI
Xaira Therapeutics
South San Francisco, California, United States (On-site)$150k – $240k Yearly
5dAN
Engineering Manager, Inference
Anthropic
San Francisco, California, United States (Hybrid)$425k – $560k Yearly
4wNV
Agentic AI Solution Engineering Intern - Summer 2026
NVIDIA
Austin, Texas, United States (On-site)$20 – $71 Hourly
4wD-
Machine Learning Intern - Dynamic KV-Cache Modeling for Efficient LLM Inference
d-Matrix
Campbell, California, United States or Remote (California, United States)$30 – $59 Hourly
4wNV
Technical Marketing Engineer, World Models - AV Physical AI
NVIDIA
Santa Clara, California, United States (On-site)$148k – $287.5k Yearly
2wCR
Staff Product Manager, Managed Inference (SF/Sunnyvale/New York)
Crusoe
San Francisco, California, United States or Remote (California, United States + 1 more)$204k – $247k Yearly
2wLA
Fullstack Engineer, Applied AI
LangChain
San Francisco, California, United States (On-site)$170k – $195k Yearly
1wOP
Software Engineer, Model Inference
OpenAI
San Francisco, California, United States (On-site)$325k – $490k Yearly
6dOP
Inference Runtime, Engineering Manager
OpenAI
San Francisco, California, United States (On-site)$455k – $555k Yearly
2wOP
Research Engineer, Privacy
OpenAI
San Francisco, California, United States (On-site)$380k – $460k Yearly
4wCE
Python / PyTorch Developer — Frontend Inference Compiler – Dubai
Cerebras
United Arab Emirates (On-site)
6dTA
LLM Inference Frameworks and Optimization Engineer
Together AI
San Francisco, California, United States (On-site)$160k – $230k Yearly