Sparse Attention Jobs
Browse 13 Sparse Attention jobs on Inference Jobs.
13 jobs
4wD-
Machine Learning Intern - Dynamic KV-Cache Modeling for Efficient LLM Inference
d-Matrix
Campbell, California, United States or Remote (California, United States)$30 – $59 Hourly
1wCA
Senior Applied Researcher, Audio Understanding
Cartesia
San Francisco, California, United States (On-site)$200k – $350k Yearly
2wPE
2wTA
Research Intern, Model Shaping (Summer 2026)
Together AI
San Francisco, California, United States (On-site)
1wTA
Research Engineer, Frontier Speculative Decoding
Together AI
San Francisco, California, United States (On-site)$190k – $270k Yearly
5dTM
Research, Audio Expertise
Thinking Machines Lab
San Francisco, California, United States (On-site)$350k – $475k Yearly