Caching Jobs in California, United States
Discover Caching roles in California, United States on Inference Jobs and apply today.
3mo agoOP
Training Performance Engineer
OpenAI
San Francisco, California, United States (Hybrid)$250K – $460K Yearly
3mo agoLA
3mo agoPO
Member of Engineering (Scalability)
Poolside
United Kingdom or Remote (Europe, Middle East, and Africa, North America)
3w agoCE
3mo agoD-
Machine Learning Intern - Dynamic KV-Cache Modeling for Efficient LLM Inference
d-Matrix
Santa Clara, Ca, Ca, United States or Remote (California, United States)$30 – $59 Hourly
3mo agoCO
Audio Inference Engineer, Model Efficiency
Cohere
New York, United States or Remote (New York, United States + 3 more)
3mo agoBA
Software Engineer - Model Performance
Baseten
San Francisco, California, United States (On-site)$150K – $250K Yearly
1mo agoNV
AI Inference Performance Engineer - New College Grad 2026
NVIDIA
Santa Clara, California, United States (On-site)$124K – $241.5K Yearly
3w agoTM
Research Engineer, Infrastructure, Kernels
Thinking Machines Lab
San Francisco, California, United States (On-site)$350K – $475K Yearly
1mo agoNV
Senior Performance Engineer - Deep Learning
NVIDIA
Santa Clara, California, United States (On-site)$152K – $241.5K Yearly
4w agoTA
LLM Inference Frameworks and Optimization Engineer
Together AI
San Francisco, California, United States (On-site)$160K – $230K Yearly
2mo agoNV
Developer Technology Intern, High-Performance Databases - Summer 2026
NVIDIA
Santa Clara, California, United States (On-site)$20 – $71 Hourly
3mo agoPE
Senior/Staff Web Platform Engineer | NYC, Seattle, SF
Perplexity
San Francisco, California, United States (On-site)$250K – $385K Yearly
2mo agoNV
High-Performance LLM Training Engineer - New College Grad 2026
NVIDIA
Santa Clara, California, United States (On-site)$124K – $195.5K Yearly
2mo agoXA
Member of Technical Staff, Inference
xAI
Palo Alto, California, United States (On-site)$180K – $440K Yearly