Cache Optimization Jobs
Browse 303 Cache Optimization jobs on Inference Jobs.
101-120 of 303 jobs
1wNV
Senior Performance Verification Engineer
NVIDIA
Santa Clara, California, United States (On-site)$136k – $264.5k Yearly
3wRE
Senior Marketing Manager, SEO & Organic Growth
Replit
Foster City, California, United States (Hybrid)$165k – $215k Yearly
3wNV
Senior Deep Learning Performance Architect
NVIDIA
California, United States (Hybrid)$152k – $287.5k Yearly
2wBA
Software Engineer - Model API's
Baseten
San Francisco, California, United States (On-site)$150k – $230k Yearly
3wRE
Staff Product Engineer, Product Platform
Replit
Foster City, California, United States (Hybrid)$225k – $320k Yearly
4dNV
Developer Technology Intern, High-Performance Databases - Summer 2026
NVIDIA
Santa Clara, California, United States (On-site)$20 – $71 Hourly
1wOP
Inference Runtime, Engineering Manager
OpenAI
San Francisco, California, United States (On-site)$455k – $555k Yearly
2wOP
Software Engineer, Hardware
OpenAI
San Francisco, California, United States (Hybrid)$310k – $460k Yearly
6dNV
Senior Performance Compiler Engineer - Triton
NVIDIA
Redmond, Washington, United States (On-site)$184k – $287.5k Yearly
2wPW
Member of Technical Staff, Infrastructure & Scaling
Parallel Web Systems
San Francisco, California, United States (On-site)
2wOP
Software Engineer, Model Inference
OpenAI
San Francisco, California, United States (On-site)$325k – $490k Yearly
1wNV
Senior Software Engineer, AI Inference Systems
NVIDIA
Toronto, Ontario, Canada (Hybrid)C$170k – C$275k Yearly
2wNV
Senior Machine Learning Applications and Compiler Engineer
NVIDIA
Santa Clara, California, United States (Hybrid)$152k – $287.5k Yearly
2wPE
AI Inference Engineer (San Francisco)
Perplexity
San Francisco, California, United States (On-site)$210k – $385k Yearly
6dNV
Senior Compiler Engineer, AI Inference Performance
NVIDIA
Santa Clara, California, United States (On-site)$152k – $241.5k Yearly