1. Home
  2. Jobs
  3. KV Cache Management

KV Cache Management Jobs

Browse 24 KV Cache Management jobs on Inference Jobs.

24 jobs

4wD-

Machine Learning Intern - Dynamic KV-Cache Modeling for Efficient LLM Inference

d-Matrix

Campbell, California, United States or Remote (California, United States)$30 – $59 Hourly
2wPL

LLM Inference Engineer

Periodic Labs

Menlo Park, California, United States (On-site)
2wCO

Member of Technical Staff, Model Efficiency

Cohere

New York, New York, United States or Remote (New York, United States + 3 more)
5dNV

Senior ASIC Physical Design Engineer, Cache Coherent Interconnects

NVIDIA

Santa Clara, California, United States (Hybrid)$136k – $264.5k Yearly
6dTE

CPU Architect, Load-Store

Tenstorrent

United States (Remote)$100k – $500k Yearly
3wCO

Staff Software Engineer - Artifact Management

CoreWeave

Livingston, New Jersey, United States (Hybrid)$188k – $275k Yearly
2wRA

Member of Technical Staff - GPU Infrastructure

Reflection AI

San Francisco, California, United States (On-site)
3dCO

Software Engineer II - Artifact Management

CoreWeave

Livingston, New Jersey, United States (Hybrid)$109k – $160k Yearly
3wCO

Senior Software Engineer - Artifact Management

CoreWeave

Livingston, New Jersey, United States (Hybrid)$139k – $204k Yearly
2wCO

Engineering Manager, Storage Security

CoreWeave

Livingston, New Jersey, United States (Hybrid)$165k – $242k Yearly
2wPE

Backend Software Engineer - Mobile (San Francisco, Palo Alto, New York, Belgrade, London)

Perplexity

San Francisco, California, United States (On-site)$210k – $385k Yearly
4dTA

Staff Engineer, Distributed Storage and HPC & AI Infrastructure

Together AI

San Francisco, California, United States (On-site)$160k – $260k Yearly
6dAN

Senior/Staff Software Engineer, Inference

Anthropic

New York, New York, United States (Hybrid)$300k – $485k Yearly
6dAN

Senior Software Engineer, Inference

Anthropic

Dublin, Dublin, Ireland (Hybrid)€235k – €295k Yearly
2wTA

Staff Engineer, Distributed Storage and HPC & AI Infrastructure

Together AI

Amsterdam, North Holland, Netherlands (Hybrid)
6dXA

Software Engineer - Real-Time Storage

xAI

Palo Alto, California, United States (On-site)$180k – $440k Yearly
2wOP

Software Engineer, Caching Infrastructure

OpenAI

San Francisco, California, United States (On-site)$255k – $405k Yearly
3dCO

Senior Software Engineer, Data Center Infrastructure Tooling

CoreWeave

Livingston, New Jersey, United States (Hybrid)$165k – $242k Yearly