Cache Optimization Jobs
Explore Cache Optimization roles on Inference Jobs and apply today.
3mo agoD-
Machine Learning Intern - Dynamic KV-Cache Modeling for Efficient LLM Inference
d-Matrix
Santa Clara, Ca, Ca, United States or Remote (California, United States)$30 – $59 Hourly
1mo agoMA
Member of Technical Staff, Inference & RL Systems
Magic
San Francisco, California, United States (On-site)$225K – $550K Yearly
3mo agoCO
Member of Technical Staff, Model Efficiency
Cohere
New York, United States or Remote (New York, United States + 3 more)
2mo agoVE
4w agoAI
ML Runtime Optimization Engineer
Applied Intuition
Sunnyvale, California, United States (On-site)$159.1K – $199.3K Yearly
2mo agoAI
ML Runtime Optimization Engineer - Lead
Applied Intuition
Sunnyvale, California, US$199.3K – $264.5K Yearly
2mo agoTE
Software Engineer, Kernel Development and Optimization
Tenstorrent
Gdańsk, Pomeranian Voivodeship, Poland (Hybrid)
3w agoTA
LLM Inference Frameworks and Optimization Engineer
Together AI
San Francisco, California, United States (On-site)$160K – $230K Yearly
2w agoTE
2mo agoNV
Senior Performance Architect - Heterogeneous Workload Optimization
NVIDIA
Santa Clara, California, United States (Hybrid)$184K – $356.5K Yearly
5d agoAN