1. Home
  2. Jobs
  3. KV-Cache

KV-Cache Jobs

Browse 24 KV-Cache jobs on Inference Jobs.

24 jobs

4wD-

Machine Learning Intern - Dynamic KV-Cache Modeling for Efficient LLM Inference

d-Matrix

Campbell, California, United States or Remote (California, United States)$30 – $59 Hourly
2wPL

LLM Inference Engineer

Periodic Labs

Menlo Park, California, United States (On-site)
1wCO

Member of Technical Staff, Model Efficiency

Cohere

New York, New York, United States or Remote (New York, United States + 3 more)
4dNV

Senior ASIC Physical Design Engineer, Cache Coherent Interconnects

NVIDIA

Santa Clara, California, United States (Hybrid)$136k – $264.5k Yearly
5dTE

CPU Architect, Load-Store

Tenstorrent

United States (Remote)$100k – $500k Yearly
3dTA

Staff Engineer, Distributed Storage and HPC & AI Infrastructure

Together AI

San Francisco, California, United States (On-site)$160k – $260k Yearly
5dCO

Sr. Engineer, Storage

CoreWeave

Livingston, New Jersey, United States (Hybrid)$165k – $220k Yearly
2wTA

Staff Engineer, Distributed Storage and HPC & AI Infrastructure

Together AI

Amsterdam, North Holland, Netherlands (Hybrid)
2dCO

Software Engineer II - Artifact Management

CoreWeave

Livingston, New Jersey, United States (Hybrid)$109k – $160k Yearly
2wPE

Backend Software Engineer - Mobile (San Francisco, Palo Alto, New York, Belgrade, London)

Perplexity

San Francisco, California, United States (On-site)$210k – $385k Yearly
5dXA

Software Engineer - Real-Time Storage

xAI

Palo Alto, California, United States (On-site)$180k – $440k Yearly
3wNV

Senior Software Engineer - NIM Factory Container and Cloud Infrastructure

NVIDIA

Santa Clara, California, United States (On-site)$184k – $356.5k Yearly
3wCO

Staff Software Engineer - Artifact Management

CoreWeave

Livingston, New Jersey, United States (Hybrid)$188k – $275k Yearly
1wOP

Software Engineer, Caching Infrastructure

OpenAI

San Francisco, California, United States (On-site)$255k – $405k Yearly
5dAN

Senior/Staff Software Engineer, Inference

Anthropic

New York, New York, United States (Hybrid)$300k – $485k Yearly
3wCO

Senior Software Engineer - Artifact Management

CoreWeave

Livingston, New Jersey, United States (Hybrid)$139k – $204k Yearly
1wOP

Software Engineer, Online Storage

OpenAI

Seattle, Washington, United States (On-site)$255k – $405k Yearly