1. Home
  2. Jobs
  3. Sparse Attention

Sparse Attention Jobs

Browse 13 Sparse Attention jobs on Inference Jobs.

13 jobs

5dCE

Senior Research Engineer - Inference ML

Cerebras

Sunnyvale, California, United States (Hybrid)
1wD-

Senior Runtime Software Engineer

d-Matrix

Sydney, New South Wales, Australia (Hybrid)
5dTE

Software Engineer

Tenstorrent

東京都, Tokyo Prefecture, Japan (On-site)
4wD-

Machine Learning Intern - Dynamic KV-Cache Modeling for Efficient LLM Inference

d-Matrix

Campbell, California, United States or Remote (California, United States)$30 – $59 Hourly
1wCA

Senior Applied Researcher, Audio Understanding

Cartesia

San Francisco, California, United States (On-site)$200k – $350k Yearly
1wCA

Researcher: Model Architecture, UK

Cartesia

London, England, United Kingdom (On-site)
2wTA

Research Intern, Model Shaping (Summer 2026)

Together AI

San Francisco, California, United States (On-site)
1wTA

Research Engineer, Frontier Speculative Decoding

Together AI

San Francisco, California, United States (On-site)$190k – $270k Yearly
5dTM

Research, Audio Expertise

Thinking Machines Lab

San Francisco, California, United States (On-site)$350k – $475k Yearly