- Home
- Jobs
- United States
- California
- Santa Clara
- AI Inference Engineering
AI Inference Engineering Jobs in Santa Clara, California, United States
Discover AI Inference Engineering roles in Santa Clara, California, United States on Inference Jobs and apply today.
2mo agoNV
Principal Software Engineer - AI Inference
NVIDIA
Santa Clara, California, United States (On-site)$272K – $431.3K Yearly
1mo agoNV
AI Inference Performance Engineer - New College Grad 2026
NVIDIA
Santa Clara, California, United States (On-site)$124K – $241.5K Yearly
2mo agoNV
Senior AI Inference Compiler Engineer
NVIDIA
Santa Clara, California, United States (On-site)$152K – $241.5K Yearly
2mo agoNV
Senior Compiler Engineer, AI Inference Platforms
NVIDIA
Santa Clara, California, United States (On-site)$152K – $241.5K Yearly
1mo agoNV
Senior DL Algorithms Engineer - Inference Performance
NVIDIA
Santa Clara, California, United States (On-site)$184K – $356.5K Yearly
2mo agoNV
Senior System Software Engineer - Dynamo-Triton Inference Server
NVIDIA
Santa Clara, California, United States (On-site)$152K – $241.5K Yearly
3mo agoCO
Audio Inference Engineer, Model Efficiency
Cohere
New York, United States or Remote (New York, United States + 3 more)
2mo agoNV
Senior Software Engineer, AI Inference Systems
NVIDIA
Santa Clara, California, United States (Hybrid)$184K – $356.5K Yearly
18h agoNV
Senior Software Engineer - AI Inference
NVIDIA
Santa Clara, California, United States (On-site)$152K – $287.5K Yearly
2mo agoNV
Senior Compiler Engineer, AI Inference Performance
NVIDIA
Santa Clara, California, United States (On-site)$152K – $241.5K Yearly
3mo agoCR
Staff Product Manager, Managed Inference (SF/Sunnyvale/New York)
Crusoe
San Francisco, California, United States or Remote (California, United States + 1 more)$204K – $247K Yearly
18h agoNV
Software Engineer, Machine Learning Inference - New College Grad 2026
NVIDIA
Santa Clara, California, United States (Hybrid)$108K – $195.5K Yearly
3mo agoD-
Machine Learning Intern - Dynamic KV-Cache Modeling for Efficient LLM Inference
d-Matrix
Santa Clara, Ca, Ca, United States or Remote (California, United States)$30 – $59 Hourly
3mo agoCO
Staff Software Engineer, Inference Infrastructure
Cohere
San Francisco, California, United States or Remote (United States + 2 more)
2mo agoNV
Senior Deep Learning Engineer - Model Evaluation & AI Systems
NVIDIA
Santa Clara, California, United States (On-site)$224K – $431.3K Yearly
2mo agoNV
Senior ML Framework Performance Engineer - AI for Science at Scale
NVIDIA
Santa Clara, California, United States (On-site)$184K – $287.5K Yearly
2mo agoNV
Lead Principal Engineer, Enterprise Agentic AI Platform
NVIDIA
Santa Clara, California, United States (On-site)$272K – $431.3K Yearly