Inference Jobs in Santa Clara, California, United States
Discover Inference roles in Santa Clara, California, United States on Inference Jobs and apply today.
4w agoNE
Senior Site Reliability Engineer — Token Factory (Inference Platform)
Nebius
United States + 4 more (Remote)
1mo agoNV
AI Inference Performance Engineer - New College Grad 2026
NVIDIA
Santa Clara, California, United States (On-site)$124K – $241.5K Yearly
3mo agoCR
Staff Product Manager, Managed Inference (SF/Sunnyvale/New York)
Crusoe
San Francisco, California, United States or Remote (California, United States + 1 more)$204K – $247K Yearly
2mo agoNV
Principal Software Engineer - AI Inference
NVIDIA
Santa Clara, California, United States (On-site)$272K – $431.3K Yearly
3mo agoPO
Member of Engineering (Pre-training and inference software)
Poolside
United Kingdom or Remote (Europe, Middle East, and Africa, North America)
3mo agoCO
3mo agoCO
Audio Inference Engineer, Model Efficiency
Cohere
New York, United States or Remote (New York, United States + 3 more)
1mo agoNV
Senior DL Algorithms Engineer - Inference Performance
NVIDIA
Santa Clara, California, United States (On-site)$184K – $356.5K Yearly
2mo agoNV
Senior AI Inference Compiler Engineer
NVIDIA
Santa Clara, California, United States (On-site)$152K – $241.5K Yearly
3mo agoCO
Staff Software Engineer, Inference Infrastructure
Cohere
San Francisco, California, United States or Remote (United States + 2 more)
2mo agoNV
Senior System Software Engineer - Dynamo-Triton Inference Server
NVIDIA
Santa Clara, California, United States (On-site)$152K – $241.5K Yearly
3mo agoCO
Site Reliability Engineer, Inference Infrastructure
Cohere
Toronto, Ontario, Canada or Remote (Canada + 2 more)
3d agoNV
Senior Software Engineer, Machine Learning Inference
NVIDIA
Santa Clara, California, United States (Hybrid)$152K – $287.5K Yearly
2mo agoD-
Machine Learning Intern - Dynamic KV-Cache Modeling for Efficient LLM Inference
d-Matrix
Santa Clara, Ca, Ca, United States or Remote (California, United States)$30 – $59 Hourly
3mo agoCO
Member of Technical Staff, Model Efficiency
Cohere
New York, United States or Remote (New York, United States + 3 more)
2mo agoNV
Senior Software Engineer, AI Inference Systems
NVIDIA
Santa Clara, California, United States (Hybrid)$184K – $356.5K Yearly
2mo agoNV
Senior Compiler Engineer, AI Inference Platforms
NVIDIA
Santa Clara, California, United States (On-site)$152K – $241.5K Yearly