1. Home
  2. Jobs
  3. Caching

Caching Jobs

Browse 24 Caching jobs on Inference Jobs.

24 jobs

1wOP

Software Engineer, Caching Infrastructure

OpenAI

San Francisco, California, United States (On-site)$255k – $405k Yearly
6dAN

Senior Software Engineer, Inference

Anthropic

Dublin, Dublin, Ireland (Hybrid)€235k – €295k Yearly
2wPE

Backend Software Engineer

Perplexity

San Francisco, California, United States (On-site)$210k – $385k Yearly
2dCO

Senior Software Engineer, Data Center Infrastructure Tooling

CoreWeave

Livingston, New Jersey, United States (Hybrid)$165k – $242k Yearly
5dAN

Senior/Staff Software Engineer, Inference

Anthropic

New York, New York, United States (Hybrid)$300k – $485k Yearly
2wMA

Distributed Systems Engineer

Magic

San Francisco, California, United States (On-site)$225k – $550k Yearly
2wOP

Software Engineer, Core Services

OpenAI

San Francisco, California, United States (Hybrid)$255k – $405k Yearly
4dOP

Software Engineer, ChatGPT Infrastructure

OpenAI

San Francisco, California, United States (On-site)$255k – $405k Yearly
2wSE

ML Model Serving Engineer

Sesame

San Francisco, California, United States (On-site)$175k – $280k Yearly
1wOP

Software Engineer, Online Storage

OpenAI

Seattle, Washington, United States (On-site)$255k – $405k Yearly
3dCO

Software Engineer II - Artifact Management

CoreWeave

Livingston, New Jersey, United States (Hybrid)$109k – $160k Yearly
2wPE

Backend Software Engineer - Mobile (San Francisco, Palo Alto, New York, Belgrade, London)

Perplexity

San Francisco, California, United States (On-site)$210k – $385k Yearly
2wCO

Engineering Manager, Storage Security

CoreWeave

Livingston, New Jersey, United States (Hybrid)$165k – $242k Yearly
4wD-

Machine Learning Intern - Dynamic KV-Cache Modeling for Efficient LLM Inference

d-Matrix

Campbell, California, United States or Remote (California, United States)$30 – $59 Hourly
6dCE

Performance Engineer

Cerebras

Toronto, Ontario, Canada (On-site)
3wCO

Senior Software Engineer - Artifact Management

CoreWeave

Livingston, New Jersey, United States (Hybrid)$139k – $204k Yearly
2wPL

LLM Inference Engineer

Periodic Labs

Menlo Park, California, United States (On-site)
5dNV

Senior ASIC Physical Design Engineer, Cache Coherent Interconnects

NVIDIA

Santa Clara, California, United States (Hybrid)$136k – $264.5k Yearly
2wCO

Member of Technical Staff, Model Efficiency

Cohere

New York, New York, United States or Remote (New York, United States + 3 more)